# Database: Beta Turn DB # Date of preparation: Jan.3, 2007 # Total number of sequences: 14302 # Total number of residues: 3118986.5.5.5 # Size: 7085.542 kB # Version: 1 # This database consists of a subset of non-redundant proteins in the PDB selected from the PISCES server [1] that exhibit less than 95% sequence identity and better than 3.0 Angstrom resolution (for X-ray structures). The data set includes both X-ray and NMR structures. The resulting set of proteins was further edited (manually) to remove proteins with transmembrane helices or transmembrane beta barrels. TMHMM [2], TMB-Hunt [3] and literature surveys were used in this culling process. The identity and position of the beta turns was determined by VADAR [4] using definitions suggested by Wilmot and Thornton [5]. # Example: # >Sequence name to 25 characters; SwissProt ID; PDB ID # MEILPQCDFKLGGAPRALDNQAGTRKEALKHLIAVNMAQPGDSGG # --------------1111---------2222--------iiii-- # ANGLKCNMCSVLRIAGSSTHQNELANGAILVTQNGCLDIPANNAL # ------3333------------4444-------!!!!-------- # INASSTGLK # --%%%%--- # Where # 1111 = type I turn # 2222 = type II turn # iiii = type I' turn # 3333 = type III turn # %%%% = type III' turn # 4444 = type IV turn # !!!! = type II' turn # Reference: # 1. Wang G, Dunbrack RL Jr. PISCES: a protein sequence culling server. Bioinformatics. 2003 Aug 12;19(12):1589-91. # 2. Krogh A, Larsson B, von Heijne G, Sonnhammer EL. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 2001 Jan 19;305(3):567-80. # 3. Garrow AG, Agnew A, Westhead DR. TMB-Hunt: a web server to screen sequence sets for transmembrane beta-barrel proteins. Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W188-92. # 4. Willard L, Ranjan A, Zhang H, Monzavi H, Boyko RF, Sykes BD, Wishart DS. VADAR: a web server for quantitative evaluation of protein structure quality. Nucleic Acids Res. 2003 Jul 1;31(13):3316-9. # 5. Wilmot CM, Thornton JM. Analysis and prediction of the different types of beta-turn in proteins. J Mol Biol. 1988 Sep 5;203(1):221-32. >ASPARAGINE SYNTHETASE; SWP:P00963; PDB:12ASA; AYIAKQRQISFVKSHFSRQLEERLGLIEVQAPILSRVGDGTQDNLSGAEKAVQVKVKALP -----------------------------------2222----!!!!--------3333- DAQFEVVHSLAKWKRQTLGQHDFSAGEGLYTHMKALRPDEDRLSPLHSVYVDQWDWERVM -----------------------2222---------1111-------------------- GDGERQFSTLKSTVEAIWAGIKATEAAVSEEFGLAPFLPDQIHFVHSQELLSRYPDLDAK 2222-3333------------------------------------3333--------333 GRERAIAKDLGAVFLVGIGGKLSDGHRHDVRAPDYDDWSTPSELGHAGLNGDILVWNPVL 3----------------------------------------3333------------111 EDAFELSSMGIRVDADTLKHQLALTGDEDRLELEWHQALLRGEMPQTIGGGIGQSRLTML 1--------------------------3333-------1111------------3333-- LLQLPHIGQVQAGVWPAAVRESVPSLL -----3333------------------ >2E8 (IGG1=KAPPA=) ANTIBOD; SWP:NA; PDB:12E8H; EVQLQQSGAEVVRSGASVKLSCTASGFNIKDYYIHWVKQRPEKGLEWIGWIDPEIGDTEY ------------2222------------1111-------2222----------------- VPKFQGKATMTADTSSNTAYLQLSSL 3333---------1111--------- >Igk-C protein; SWP:Q58EU4; PDB:12E8L; DIVMTQSQKFMSTSVGDRVSITCKASQNVGTAVAWYQQKPGQSPKLMIYSASNRYTGVPD -------------2222---------------------2222------------222211 RFTGSGSGTDFTLTISNMQSEDLADYFCQQYSSYPLTFGAGTKLELKRADAAPTVSIFPP 11----------------3333-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSATDQDSKDSTYSMSSTLT 3333-------------------------iiii--------------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC -3333------------1111--------3333- >T4 LYSOZYME; SWP:P00720; PDB:146L; MNIFEMLRIDEGLRLKIYKDTEGYYTIGIGHLLTKSPSLNAAKSELDKAIGRNTNGVITK -------------------1111------------------------------iiii--- DEAEKLFNQDVDAAVRGILRNAKLKPVYDSLDAVRRAALINMVFQMGETGVAGFTNSLRM ---------------------------1111----------------------------- MQQKRWDELAVNMAKSRWYNQTPNRAKRIITTWRTGTWDAYK -------------------------------------3333- >T4 LYSOZYME; SWP:P00720; PDB:152L; MNCFEMLRCDEGLRLKIYKDCEGYYTIGIGHLLTKSPSLNAAKSELDKAIGRNTNGVITK -------------------1111------------------------------iiii--- DEAEKLFNQDVDAAVRGILRNAKLKPVYDSLDAVRRCALINMVFQMGETGVAGFTNSLRM ---------------------------11113333------------------------- LQQKRWDEAAVNLAKSRWYNQCPNRAKRVITTFRTGTWDAYKNC 1111---------------------------------3333--- >T4 LYSOZYME; SWP:P00720; PDB:157L; MNIFEMLRIDEGLRLKIYKDTEGYYTIGIGHLLTKSPSLNAAKSELDKAIGRNTNGVITK -------------------1111------------------------------iiii--- DEAEKLFNQDVDAAVRGILRNAKLKPVYDSLDAVRRAALINMVFQMGETGVAGFAAALAA ---------------------------1111----------------------------- LAAKRWDEAAVNLAKSRWYNQTPNRAKRVITTFRTGTWDAYK -------------------------------------3333- >Putative uncharacterized ; SWP:A0A5E3; PDB:15C8L; DIVLTQSPAIMSASLGERVTMTCTASSSVSSSNLHWYQQKPGSSPKLWIYSTSNLASGVP -------------2222------------1111------2222------------22221 ARFSGSGSGTSYSLTISSMEAEDAATYYCHQYHRSPYTFGGGTKLEIKRADAAPTVSIFP 111----------------1111------------------------------------- PSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTL -3333-------------------------iiii-------------------------- TLTKDEYERHNSYTCEATHKTSTSPIVKSFNRN ------------------3333----------- >T4 LYSOZYME; SWP:P00720; PDB:169LA; MNIFEMLRIDEGLRLKIYKDTEGYYTIGIGHLLTKSPSLNAAKSELDKAIGRNCNGVITK -------------------1111------------------------------iiii-33 DEAEKLFNQDVDAAVRGILRNAKLKPVYDSLDAVRRCALINMVFQMGETGVAGFTNSLRM 33---------------------33331111-1111------------------------ LQQKRWDAAAAALAAAAWAAATPNRAKRVITTFRTGTWDAYK -----------------3333--3333--------------- >3-PHOSPHOGLYCERATE KINASE; SWP:P07378; PDB:16PK; EKKSINECDLKGKKVLIRVDFNVPVKNGKITNDYRIRSALPTLKKVLTEGGSCVLMSHLG ---3333--2222------------iiii------------------------------- RPKGIPMAQAGKIRSTGGVPGFQQKATLKPVAKRLSELLLRPVTFAPDCLNAADVVSKMS -----3333----1111-22223333---------------------1111----11112 PGDVVLLENVRFYKEEGSKKAKDREAMAKILASYGDVYISDAFGTAHRDSATMTGIPKIL 222-----11113333---------------1111------3333----3333------- GNGAAGYLMEKEISYFAKVLGNPPRPLVAIVGGAKVSDKIQLLDNMLQRIDYLLIGGAMA ----------------------------------3333-------1111-------3333 YTFLKAQGYSIGKSKCEESKLEFARSLLKKAEDRKVQVILPIDHVCHTEFKAVDSPLITE ----------!!!!--1111---------------------------------------- DQNIPEGHMALDIGPKTIEKYVQTIGKCKSAIWNGPMGVFEMVPYSKGTFAIAKAMGRGT ----2222----------------1111----------33331111-------------- HEHGLMSIIGGGDSASAAELSGEAKRMSHVSTGGGASLELLEGKTLPGVTVLDDK ----------------------3333-------------1111--3333------ >VP16, VMW65, ATIF; SWP:P06492; PDB:16VPA; SRMPSPPMPVPPAALFNRLLDDLGFSAGPALCTMLDTWNEDLFSALPTNADLYRECKFLS ----------3333----------1111-----3333-----1111--333311111111 TLPSDVVEWGDAYVPERTQIDIRAHGDVAFPTLPATRDGLGLYYEALSRFFHAELRAREE -----------------------------------3333--------------------- SYRTVLANFCSALYRYLRASVRQLHRQAHMRGRDRDLGEMLRATIADRYYRETARLARVL ----------------------------1111---------------------------- FLHLYLFLTREILWAAYAEQMMRPDLFDCLCCDLESWRQLAGLFQPFMFVNGALTVRGVP ----------------------33331111-----3333----------------iiii- IEARRLRELNHIREHLNLPLVRSAATEEPGAPLTTPPTLHGNQARASGYFMVLIRAKLDS -------------1111-----3333-2222--------1111-3333--------1111 YSSAAPRLSFL ----------- >T4 LYSOZYME; SWP:P00720; PDB:174LA; MNIFEMLRIDEGLRLKIYKDTEGYYTIGIGHLLAAAADLAAAKAALAAAIGRNTNGVITK -------------------1111------------------------1111--------- DEAEKLFNQDVDAAVRGILRNAKLKPVYDSLDAVRRAALINMVFQMGETGVAGFTNSLRM -----------------------33331111----------------------------- LQQKRWDEAAVNLAKSRWYNQTPNRAKRVITTFRTGTWDAYKNL ----------------3333-----------------3333--- >T4 LYSOZYME; SWP:P00720; PDB:176LA; MNIFEMLRIDEGLRLKIYKDTEGYYTIGIGHTLKVDGNSNAAKSELDKAIGRNTNGVITK -------------------3333------------------------------iiii--- DEAEKLFNQDVDAAVRGILRNAKLKPVYDSLDAVRRAALINMVFQMGETGVAGFTNSLRM ---------------------------3333---------------11111111------ LQQKRWDEAAVNLAKSRWYNQTPNRAKRVITTFRTGTWDAYKNL 1111-------------3333----------------3333--- >T4 LYSOZYME; SWP:P00720; PDB:189L; MNLFEMLRIDEGLRLKIYKDTEGYYTIGIGHLLTKSPDLNVAKSELDKAIGRNCNGVITK -------------------3333--------------3333------------------- DEAEKLFNQDVDAAVRGILRNPKLKPVYDSLDAVRRCALINMVFQMGETGVAGFTDSLRM ---------------------------3333-----------3333--3333---3333- LQQKRWDEAAANLAKSRWYNQTPDRAKRVITTFRTGTWDAYKNL ----3333---------1111----------------3333--- >SIGNAL RECOGNITION PARTIC; SWP:P16254; PDB:1914; MVLLESEQFLTELTRLFQKCRSSGSVFITLKKYDEGLEPAENKCLLRATDGKRKISTVVS -----------------------------------------------------------1 SKEVNKFQMAYSNLLRANMDGLKKRAQGGEQKLFQTWEEFSRAAEKLYLADPMKVRVVLK 1113333-------------------------------------------3333------ YRHVDGNLCIKVTDDLVCLVYRTDQAQDVKKIEKFHSQLMRLMVAKESRNV --1111------------------1111----------------------- >LYSOZYME; SWP:P00720; PDB:192L; MNIFEMLRIDEGLRLKIYKDTEGYYTIGIGHLLTKSPSLAAAKAALAAAIGRNTNGVITK -------------------1111--------------3333------------------- DEAEKLFNQDVDAAVRGILRNAKLKPVYDSLDAVRRAALINMVFQMGETGVAGFTNSLRM ---------------------------33333333------------------------- LQQKRWAAAAAALAKSRWYNQTPNRAKRVITTFRTGTWDAYK 1111---------------------------------3333- >NINE-HAEM CYTOCHROME C; SWP:Q9RN68; PDB:19HCA; AALEPTDSGAPSAIVMFPVGEKPNPKGAAMKPVVFNHLIHEKKIADCETCHHTGDPVSCS -----3333--------------1111------------------1111-1111---111 TCHTVEGKAEGDYITLDRAMHATDIAARAKGNTPTSCVSCHQSETKERRECAGCHAITTP 1--11113333------------------------------------3333-1111---- KDDEAWCATCHDITPSMTPSEMQKGIAGTLLPGDNEALAAETVLAEATVAPVSPMLAPYK -------------1111-------------3333--------1111------3333---- VVIDALADKYEPSDFTHRRHLTSLMESIKDDKLAQAFHDKPEILCATCHHRSPLSLTPPK ---1111--------------------1111--------11113333------------1 CGSCHTKEIDAADPGRPNLMAAYHLECMGCHKGMAVARPRDTDCTTCHKAAA 111------1111--------------------------1111-1111---- >Proto-oncogene protein c-; SWP:P01100; PDB:1A02F; RRIRRERNKMAAAKSRNRRRELTDTLQAETDQLEDEKSALQTEIANLLKEKEK -------------------------------------------------1111 >CALCYCLIN (RABBIT, CA2+); SWP:P30801; PDB:1A03A; MASPLDQAIGLLIGIFHKYSGKEGDKHTLSKKELKELIQKELTIGSKLQDAEIVKLMDDL --------------------3333------------------------------------ DRNKDQEVNFQEYITFLGALAMIYNEALKG --------3333------------------ >NITRATE/NITRITE RESPONSE ; SWP:P10957; PDB:1A04A; EPATILLIDDHPMLRTGVKQLISMAPDITVVGEASNGEQGIELAESLDPDLILLDLNMPG ------------------------1111-------------------------------- MNGLETLDKLREKSLSGRIVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALHQAA --------------------------3333----1111---------------------- AGEMVLSEALTPVLAASLERDVNQLTPRERDILKLIAQGLPNKMIARRLDITESTVKVHV ------3333----------1111------------------------------------ KHMLKKMKLKSRVEAAVWVHQERIF ------------------------- >3-ISOPROPYLMALATE DEHYDRO; SWP:Q56268; PDB:1A05A; MKKIAIFAGDGIGPEIVAAARQVLDAVDQAAHLGLRCTEGLVGGAALDASDDPLPAASLQ ---------!!!!----------------------------------------------- LAMAADAVILGAVGGPRWDAYPPAKRPEQGLLRLRKGLDLYANLRPAQIFPQLLDASPLR -1111---------3333---3333------------------------11111111--3 PELVRDVDILVVRELTGDIYFGQPRGLEVIDGKRRGFNTMVYDEDEIRRIAHVAFRAAQG 333--------------3333--------iiii------------------------111 RRKQLCSVDKANVLETTRLWREVVTEVARDYPDVRLSHMYVDNAAMQLIRAPAQFDVLLT 1--------11113333---------33331111-----3333-------3333------ GNMFGDILSDEASQLTGSIGMLPSASLGEGRAMYEPIHGSAPDIAGQDKANPLATILSVA -------------33331111------2222---------3333---------------- MMLRHSLNAEPWAQRVEAAVQRVLDQGLRTADIAAPGTPVIGTKAMGAAVVNALNLK -----------------------------3333-2222-------------3333-- >CALCIUM/CALMODULIN-DEPEND; SWP:Q63450; PDB:1A06; WKQAEDIRDIYDFRDVLGTGAFSEVILAEDKRTQKLVAIKCIAKENEIAVLHKIKHPNIV -----1111----------3333---------------------------1111-1111- ALDDIYESGGHLYLIMQLVSGGELFDRIVEKGFYTERDASRLIFQVLDAVKYLHDLGIVH ---------------------------1111----------------------1111--- RDLKPENLLYYSLDEDSKIMISDFPGYVAPEVLAQKPYSKAVDCWSIGVIAYILLCGYPP ---3333------1111-------3333-------------------------------- FYDENDAKLFEQILKAEYEFDSPYWDDISDSAKDFIRHLMEKDPEKRFTCEQALQHPWIA ----3333----1111------1111--3333----------3333---------3333- GDTALDKNIHQSVSEQIKKNFAKSKWKQAFNATAVVRHM -----------------------1111---3333-1111 >DNA; SWP:P07270; PDB:1A0AA; MKRESHKHAEQARRNRLAVALHELASLIPAEWKQQNVSAAPSKATTVEAACRYIRHLQQN ---3333---------------------33333333------------------------ GST --- >XYLOSE ISOMERASE; SWP:P19148; PDB:1A0CA; NKYFENVSKIKYEGPKSNNPYSFKFYNPEEVIDGKTMEEHLRFSIAYWHTFTADGTDQFG ---1111------1111---------1111-iiii3333------3333-------1111 KATMQRPWNHYTDPMDIAKARVEAAFEFFDKINAPYFCFHDRDIAPEGDTLRETNKNLDT -----1111------------------------------1111----------------- IVAMIKDYLKTSKTKVLWGTANLFSNPRFVHGASTSCNADVFAYSAAQVKKALEITKELG -------------------------3333---1111------------------------ GENYVFWGGREGYETLLNTDMEFELDNFARFLHMAVDYAKEIGFEGQFLIEPKPKEPTKH -------1111---3333------------------------------------------ QYDFDVANVLAFLRKYDLDKYFKVNIEANHATLAFHDFQHELRYARINGVLGSIDANTGD -----------------1111----------1111-3333-----1111----------1 MLLGWDTDQFPTDIRMTTLAMYEVIKMGGFDKGGLNFDAKVRRASFEPEDLFLGHIAGMD 111---------------------1111-------------1111-3333---------- AFAKGFKVAYKLVKDRVFDKFIEERYASYKDGIGADIVSGKADFRSLEKYALERSQIVNK ------------1111---------3333---------------------1111------ SGRQELLESILNQYLFA ----------------- >XYLOSE ISOMERASE; SWP:P54273; PDB:1A0DA; PYFDNISTIAYEGPASKNPLAFKFYNPEEKVGDKTMEEHLRFSVAYWHTFTGDGSDPFGA -------------------------1111-!!!!3333--------3333---------- GNMIRPWNKYSGMDLAKARVEAAFEFFEKLNIPFFCFHDVDIAPEGETLKETYKNLDIIV ----1111-----------------------------1111------------------- DMIEEYMKTSKTKLLWNTANLFTHPRFVHGAATSCNADVFAYAAAKVKKGLEIAKRLGAE -----3333--------------3333---1111-------------------------- NYVFWGGREGYETLLNTDMKLELDNLARFLHMAVDYAKEIGFDGQFLIEPKPKEPTKHQY -----1111---3333---------------------1111------------------- DFDVATALAFLQTYGLKDYFKFNIEANHATLAGHTFEHELRVARIHGMLGSVDANQGDML ---------------3333-----3333-1111----------1111----------111 LGWDTDEFPTDLYSTTLAMYEILKNGGLGRGGLNFDAKVRRGSFEPEDLFYAHIAGMDSF 1---------3333-------------!!!!--------3333-3333------------ AVGLKVAHRLIEDRVFDEFIEERYKSYTEGIGREIVEGTADFHKLEAHALQLGEIQNQSG ------------------------1111-------------------3333--------- RQERLKTLLNQYLLEVC ----------------- >XYLOSE ISOMERASE; SWP:P45687; PDB:1A0EA; AEFFPEIPKVQFEGKESTNPLAFKFYDPEEIIDGKPLKDHLKFSVAFWHTFVNEGRDPFG ---1111------1111---------3333-iiii3333------3333-------1111 DPTADRPWNRYTDPMDKAFARVDALFEFCEKLNIEYFCFHDRDIAPEGKTLRETNKILDK -----1111------------------------------1111----------------- VVERIKERMKDSNVKLLWGTANLFSHPRYMHGAATTCSADVFAYAAAQVKKALEITKELG -------------------------3333---1111------------------------ GEGYVFWGGREGYETLLNTDLGFELENLARFLRMAVDYAKRIGFTGQFLIEPKPKEPTKH -------1111----1111----------------------------------------- QYDFDVATAYAFLKSHGLDEYFKFNIEANHATLAGHTFQHELRMARILGKLGSIDANQGD -------------11113333-----3333-1111------------------------1 LLLGWDTDQFPTNVYDTTLAMYEVIKAGGFTKGGLNFDAKVRRASYKVEDLFIGHIAGMD 111---------------------1111-------------1111-3333---------- TFALGFKVAYKLVKDGVLDKFIEEKYRSFREGIGRDIVEGKVDFEKLEEYIIDKETIELP -------------------------3333-!!!!----------------1111------ SGKQEYLESLINSYIVKTILELR ------------------1111- >MEIZOTHROMBIN; SWP:P00735; PDB:1A0HA; SPLLETCVPDRGREYRGRLAVTTHGSRCLAWSSEQAKALSKDQDFNPAVPLAENFCRNPD ---------%%%%--------3333---------1111-------------------111 GDEEGAWCYVADQPGDFEYCDLNYCEEPVDGDLGDRLGEDPDP 11111-------2222--------------------------- >Prothrombin [Precursor]; SWP:P00735; PDB:1A0HB; IVEGQDAEVGLSPWQVMLFRKSPQELLCGASLISDRWVLTAAHCLLYPPWDKNFTVDDLL -------2222-----------------------------1111--3333----3333-- VRIGKHSRTRYERKVEKISMLDKIYIHPRYNWKENLDRDIALLKLKRPIELSDYIHPVCL ---------------------------------------------------1111----- PDKQTAAKLLHAGFKGRVTGWGNRRETWTTSVAEVQPSVLQVVNLPLVERPVCKASTRIR ---3333-----------------------3333-------------------------- ITDNMFCAGYKPGEGKRGDACEGDSGGPFVMKSPYNNRWYQMGIVSWGEGCDRDGKYGFY ---------------------------------2222----------------------- THVFRLKKWIQKVIDRLGS ---1111-----1111--- >DNA LIGASE; SWP:P00969; PDB:1A0I; VNIKTNPFKAVSFVESAIKKALDNAGYLIAEIKYDGVRGNICVDNTANSYWLSRVSKTIP -------------------------------------------1111-----1111---- ALEHLNGFDVRWKRLLNDDRCFYKDGFMLDGELMVKGVDFNTGSGLLRTKWTDTKNQEFH -1111------------1111-3333------------2222------------------ RKKDKVPFKLHTGHLHIKLYAILPLHIVESGEDCDVMTLLMQEHVKNMLPLLQEYFPEIE ----------1111---------33333333----------------------------- WQAAESYEVYDMVELQQLYEQKRAEGHEGLIVKDPMCIYKRGKKSGWWKMKPENEADGII ---------------------------------1111----------------------- QGLVWGTKGLANEGKVIGFEVLLESGRLVNATNISRALMDEFTETVKEATLSQWGFFDAC ----------------------3333---------------------------------- TINPYDGWACQISYMEETPDGSLRHPSFVMFR ----2222---------1111----------- >TRYPSIN; SWP:P35033; PDB:1A0JA; IVGGYECRKNSASYQASLQSGYHFCGGSLISSTWVVSAAHCYKSRIQVRLGEHNIAVNEG -------11111111----------------------1111------------1111--- TEQFIDSVKVIMHPSYNSRNLDNDIMLIKLSKPASLNSYVSTVALPSSCASSGTRCLVSG ------------1111--------------------1111-------------------- WGNLSGSSSNYPDTLRCLDLPILSSSSCNSAYPGQITSNMFCAGFMEGGKDSCQGDSGGP -------------------------------2222-1111------------2222---- VVCNGQLQGVVSWGYGCAQRNKPGVYTKVCNYRSWISSTMSSN --iiii---------------------3333------------ >PROFILIN; SWP:Q42449; PDB:1A0K; SWQSYVDDHLMCDVEGNHLTAAAILGQDGSVWAQSAKFPQLKPQEIDGIKKDFEEPGFLA -------------iiii--------1111-----1111----------------222233 PTGLFLGGEKYMVIQGEQGAVIRGKKGPGGVTIKKTNQALVFGFYDEPMTGGQCNLVVER 33---iiii-------2222-----!!!!------------------------------- LGDYLIESEL ---------- >BETA-TRYPTASE; SWP:P20231; PDB:1A0LA; IVGGQEAPRSKWPWQVSLRVHGPY -----------1111--------- >SITE-SPECIFIC RECOMBINASE; SWP:P0A8P8; PDB:1A0P; QDLARIEQFLDALWLEKNLAENTLNAYRRDLSMMVEWLHHRGLTLATAQSDDLQALLAER -------------------------------------------3333------------- LSSARLLSAVRRLFQYLYREKFREDDPSAHLKDLSEAQVERLLQAPLIDQPLELRDKAML -------------------------1111------------------------------- EVLYATGLRVSELVGLTMSDISLRQGVVRVIGKGNKERLVPLGEEAVYWLETYLEHGRPW --------333311113333--3333------------------------------3333 LLNGVSIDVLFPSQRAQQMTRQTFWHRIKHYAVLAGIDSEKLSPHVLRHAFATHLLNHGA ------------1111----------------1111-1111------------------- DLRVVQMLLSDLSTTQIYTHVATERLRQLHQ ---------------------------1111 >29G11 FAB; SWP:NA; PDB:1A0QH; VQLQESDAELVKPGASVKISCKASGYTFTDHVIHWVKQKPEQGLEWIGYISPGNGDIKYN -----------2222-----------1111-----------------------------3 EKFKGKATLTADKSSSTAYMQLNSL 333---------1111--------- >29G11 FAB; SWP:NA; PDB:1A0QL; IELTQSPSSLSASLGGKVTITCKASQDIKKYIGWYQHKPGKQPRLLIHYTSTLLPGIPSR ------------2222-----------%%%%------2222------------2222333 FRGSGSGRDYSFSISNLEPEDIATYYCLQYYNLRTFGGGTKLEIKRADAAPTVSIFPPSS 3----------------3333-------------------------------------33 EQLTSGGASVVCFLNNFYSKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTLT 33-------------------------iiii----------------------------3 KDEYERHNSYTCEATHKTSTSPIVKSFNRNE 3331111--------3333------------ >Phosducin; SWP:P19632; PDB:1A0RP; FEGQASHTGPKGVINDWRKFKLESEFSRKMSVQEYELIHKDKEDENCLRKYRRQCMQDMH ------------------------------------1111-------------------- QKLSFGPRYGFVYELESGEQFLETIEKEQKITTIVVHIYEDGIKGCDALNSSLICLAAEY ----------------------------1111-------2222----------------1 PMVKFCKIKASNTGAGDRFSSDVLPTLLVYKGGELLSNFISVTEQLAEEFFTGDVESFLN 111-----3333-------1111-------iiii------3333------3333------ EYGLLPEK -------- >REGULATOR OF CHROMOSOME C; SWP:P18754; PDB:1A12A; KKVKVSHRSHSTEPGLVLTLGQGDVGQLGLGENVMERKKPALVSIPEDVVQAEAGGMHTV ------1111------------1111----1111-------------------------- CLSKSGQVYSFGCNDEGALGRDTSVEGSEMVPGKVELQEKVVQVSAGDSHTAALTDDGRV --1111-------1111-------2222--------------------------1111-- FLWGSFRDNNGVIGLLEPMKKSMVPVQVQLDVPVVKVASGNDHLVMLTADGDLYTLGCGE -------1111-----2222---------------------------1111-------11 QGQLGRVPELFANRGGRQGLERLLVPKCVMLKSRGSRGHVRFQDAFCGAYFTFAISHEGH 11----3333----33333333---------------------------------1111- VYGFGLSNYHQLGTPGTESCFIPQNLTSFKNSTKSWVGFSGGQHHTVCMDSEGKAYSLGR ------1111---------------3333-1111-------------------------- AEYGRLGLGEGAEEKSIPTLISRLPAVSSVACGASVGYAVTKDGRVFAWGMGTNYQLGTG 2222----2222----------------------------1111-------1111----- QDEDAWSPVEMMGKQLENRVVLSVSSGGQHTVLLVKDKEQS ------------1111------------------------- >Ig heavy chain V region 3; SWP:P01749; PDB:1A14H; QVQLQQSGAELVKPGASVRMSCKASGYTFTNYNMYWVKQSPGQGLEWIGIFYPGNGDTSY ---------------------------3333----------------------------- NQKFKDKATLTADKSSNTAYMQLSSLS -1111---------------------- >Ig kappa chain V-V region; SWP:P01645; PDB:1A14L; DIELTQTTSSLSASLGDRVTISCRASQDISNYLNWYQQNPDGTVKLLIYYTSNLHSEVPS -------------2222-----------iiii----------------------222211 RFSGSGSGTDYSLTISNLEQEDIATYFCQQDFTLPFTFGGGTAA 11----------------3333---------------------- >SERINE/THREONINE PROTEIN ; SWP:P53041; PDB:1A17; PPADGALKRAEELKTQANDYFKAKDYENAIKFYSQAIELNPSNAIYYGNRSLAYLRTECY --3333--------------1111------------------------------1111-- GYALGDATRAIELDKKYIKGYYRRAASNMALGKFRAALRDYETVVKVKPHDKDAKMKYQE -------------1111-----------1111---------------1111--------- CNKIVKQKAFERAIAGDEHKRSVVDSLDIESMTIEDEYS -----------------------1111-1111------- >RADR ZINC FINGER PEPTIDE; SWP:P08154; PDB:1A1IA; RPYACPVESCDRRFSRSADLTRHIRIHTGQKPFQCRICMRNFSRSDHLTTHIRTHTGEKP ------1111-----3333------1111----------------------3333----- FACDICGRKFARSDERKRHTKIHLR --------------------1111- >P53; SWP:P04637; PDB:1A1UA; EYFTLQIRGRERFEKIREYNEALELKDAQ ------------------------3333- >HMTCP-1; SWP:P56278; PDB:1A1X; AGEDVGAPPDHLWVHQEGIYRDEYQRTWVAVVEEETSFLRARVQQIQVPLGDAARPSHLL ---------------2222--1111---------1111----------------333311 TSQLPLMWQLYPEERYMDNNSRLWQIQHHLMVRGVQELLLKLLPDD 11---------------1111----------iiii----------- >TISSUE FACTOR; SWP:P24055; PDB:1A21A; TGRAYNLTWKSTNFKTILEWEPKSIDHVYTVQISTRLENWKSKCFLTAETECDLTDEVVK -----------iiii-------------------1111---------------3333333 DVGQTYMARVLSYPARNTTGFPEEPPFRNSPEFTPYLDTNLGQPTIQSFEQVGTKLNVTV 3--------------------------------3333----------------------- QDARTLVTFLSLRAVFGKDLNYTLYYWRKKTATTNTNEFLIDVDKGENYCFSVQAVIPSR ----------3333-!!!!-------------------------------------1111 KRKQRSPESLTECT -------------- >GROWTH HORMONE; SWP:P01241; PDB:1A22A; FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQTSLCFSESIPT ----------------------------------------------3333--3333---- PSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEER -----------------------1111--------------iiii3333----------- IQTLMGRLEGQIFKQTYSKFDTDALLKNYGLLYCFRKDMDKVETFLRIVQCRSVEGSCGF ---------3333---------3333---------------------------------- >Growth hormone receptor [; SWP:P10912; PDB:1A22B; PKFTKCRSPERETFSCHWTLGPIQLFYTRRNTQEWTQEWKECPDYVSAGENSCYFNSSFT -------------------------------------------------------3333- SIWIPYCIKLTSNGGTVDEKCFSVDEIVQPDPPIALNWTLLNGIHADIQVRWEAPRNADI -----------1111-------3333----------------------------1111-1 QKGWMVLEYELQYKEVNETKWKMMDPILTTSVPVYSLKVDKEYEVRVRSKQRNSGNYGEF 111-----------1111-------------------1111---------2222------ SEVLYVTLPQMS ------------ >PROTEIN KINASE C (BETA); SWP:P04410; PDB:1A25A; ERRGRIYIQAHIDREVLIVVVRDAKNLVPMDPNGLSDPYVKLKLIPDPKSESKQKTKTIK 1111--------------------------1111------------1111---------- CSLNPEWNETFRFQLKESDKDRRLSVEIWDWDLTSRNDFMGSLSFGISELQKAGVDGWFK ---------------3333--------------------------3333----------- LLSQEEGEYFNV --3333------ >Prothrombin [Precursor]; SWP:P00734; PDB:1A2CL; TFGSGEADCGLRPLFEKKSLEDKTERELLESYIDGR -------22221111--------3333--------- >METHYLAMINE OXIDASE; SWP:P12807; PDB:1A2VA; PARPAHPLDPLSTAEIKAATNTVKSYFAGKKISFNTVTLREPARKAYIQWKEQGGPLPPR -----1111-----------------2222------------------------------ LAYYVILEAGKPGVKEGLVDLASLSVIETRALETVQPILTVEDLCSTEEVIRNDPAVIEQ -------2222---------1111---------------3333----------------- CVLSGIPANEMHKVYCDPWTIGYDERWGTGKRLQQALVYYRSDEDDSQYSHPLDFCPIVD ------11111111---------3333---------------11113333---------- TEEKKVIFIDIPNRRRKVSKHKHANFYPKHMIEKVGAMRPEAPPINVTQPEGVSFKMTGN 3333--------------------------------------------1111-----!!! VMEWSNFKFHIGFNYREGIVLSDVSYNDHGNVRPIFHRISLSEMIVPYGSPEFPHQRKHA !--iiii----------------------------------------------------- LDIGEYGAGYMTNPLSLGCDCKGVIHYLDAHFSDRAGDPITVKNAVCIHEEDDGLLFKHS 3333----1111---3333--------------1111----------------------- DFRDNFATSLVTRATKLVVSQIFTAANEYCLYWVFMQDGAIRLDIRLTGILNTYILGDDE ---iiii-------------------------------------------------1111 EAGPWGTRVYPNVNAHNHQHLFSLRIDPRIDGDGNSAAACDAKSSPYPLGSPENMYGNAF ---------2222----------------------------------2222--1111--- YSEKTTFKTVKDSLTNYESATGRSWDIFNPNKVNPYSGKPPSYKLVSTQCPPLLAKEGSL --------3333-----3333-------1111-----------------------2222- VAKRAPWASHSVNVVPYKDNRLYPSGDHVPQWSGDGVRGMREWIGDGSENIDNTDILFFH ----3333---------2222-1111--2222-----------!!!!------------- TFGITHFPAPEDFPLMPAEPITLMLRPRHFFTENPGLDIQPSYAMTTSEAKRAV --------3333---------------------1111----------------- >Troponin I, fast skeletal; SWP:P02643; PDB:1A2XB; EEKRNRAITARRQHLKSVMLQIAATELEKEE ---------------------------3333 >MONOCLONAL ANTIBODY D1.3; SWP:P01635; PDB:1A2YA; DIVLTQSPASLSASVGETVTITCRASGNIHNYLAWYQQKQGKSPQLLVYYTTTLADGVPS -------------2222-----------iiii------2222------------222233 RFSGSGSGTQYSLKINSLQPEDFGSYYCQHFWSTPRTFGGGTKLEIK 33----!!!!--------1111------------------------- >Ig heavy chain V region P; SWP:P01820; PDB:1A2YB; QVQLQESGPGLVAPSQSLSITCTVSGFSLTGYGVNWVRQPPGKGLEWLGMIWGDGNTDYN ------------1111-----------3333--------2222--------1111----- SALKSRLSISKDNSKSQVFLKMNSLHTDDTARYYCARERDYRLDYWGQGTTLTVSS 1111--------1111---------1111---------%%%%-------------- >PYRROLIDONE CARBOXYL PEPT; SWP:O07883; PDB:1A2ZA; MKKVLITGFEPFGGDSKNPTEQIAKYFDRKQIGNAMVYGRVLPVSVKRATIELKRYLEEI --------------------------2222-!!!!------------------------- KPEIVINLGLAPTYSNITVERIAVNIIDARIPDNDGYQPIDEKIEEDAPLAYMATLPVRA ----------2222------------------1111------------------------ ITKTLRDNGIPATISYSAGTYLCNYVMFKTLHFSKIEGYPLKAGFIHVPYTPDQVVNKFF -----1111-----------------------------------------33331111-- LLGKNTPSMCLEAEIKAIELAVKVSLDYLEKDRDDIKIPL 2222------------------------------------ >RIBOSOMAL PROTEIN S15; SWP:P05766; PDB:1A32; LTQERKREIIEQFKVHENDTGSPEVQIAILTEQINNLNEHLRVHKKDHHSRRGLLKMVGK ---------------------------------------333311111111--------- RRRLLAYLRNKDVARYREIVEKLGL ------------------------- >PEPTIDYLPROLYL ISOMERASE; SWP:Q27450; PDB:1A33; KDRRRVFLDVTIDGNLAGRIVMELYNDIAPRTCNNFLMLCTGMAGTGKISGKPLHYKGST -----------iiii----------------------------------------2222- FHRVIKNFMIQGGDFTKGDGTGGESIYGGMFDDEEFVMKHDEPFVVSMANKGPNTNGSQF ----2222----------------1111-------------------------------- FITTTPAPHLNNIHVVFGKVVSGQEVVTKIEYLKTNSKNRPLADVVILNCGELV ------3333------------3333---1111--1111--------------- >SATELLITE TOBACCO MOSAIC ; SWP:P17574; PDB:1A34A; TGDNSNVVTMIRAGSYPKVNPTPTWVRAIPFEVSVQSGIAFKVPVGSLFSANFRTDSFTS -----------------------------------2222----3333--3333-1111-- VTVMSVRAWTQLTPPVNEYSFVRLKPLFKTGDSTEEFEGRASNINTRASVGYRIPTNLRQ --------------2222---------1111-----------1111--------3333-- NTVAADNVCEVRSNCRQVALVISCCFN --3333--------------------- >14-3-3 PROTEIN ZETA; SWP:P29312; PDB:1A37A; MDKNELVQKAKLAEQAERYDDMAACMKSVTEQGAELSNEERNLLSVAYKNVVGARRSSWR ------------------------------------3333-------------------- VVSSIEQEKKQQMAREYREKIETELRDICNDVLSLLEKFLIPNAAESKVFYLKMKGDYYR -------3333----------------------------------3333----------- YLAEVAAGDDKKGIVDQSQQAYQEAFEISKIRLGLALNFSVFYYACSLAKTAFDEAIADD -------3333---------------3333----------33333333-----3333--- STLIMQLLRDNLTLW --------------- >MANNITOL-SPECIFIC EII; SWP:P00550; PDB:1A3AA; LFKLGAENIFLGRKAATKEEAIRFAGEQLVKGGYVEPEYVQAMLDREKLTPTYLGESIAV ----3333----------------------------------------------%%%%-- PHGTVEAKDRVLKTGVVFCQYPEGVRFGEEEDDIARLVIGIAARNNEHIQVITSLTNALD ---33331111---------1111-----1111--------------------------- DESVIERLAHTTSVDEVLELLAGRK 3333----------------1111- >PYRIMIDINE OPERON REGULAT; SWP:P39765; PDB:1A3C; QKAVILDEQAIRRALTRIAHEMIERNKCILVGIKTRGIYLAKRLAERIEQIEGNPVTVGE ------------------------------------------------------------ IDITLYRNDEPLVKGADIPVDITDQKVILVDDVLYTGRTVRAGMDALVDVGRPSSIQLAV ---------------------2222----------------------------------- LVDRGHRELPIRADYIGKNIPTSKSEKVMVQLDEVDQNDLVAIYEN ----------------------1111-----3333----------- >PHOSPHOLIPASE A2; SWP:P15445; PDB:1A3D; NLYQFKNMIKCTVPSRSWWDFADYGCYCGRGGSGTPVDDLDRCCQVHDNCYNEAEKISGC ------------33333333------2222--------------------------2222 WPYFKTYSYECSQGTLTCKGDNNACAASVCDCDRLAAICFAGAPYNDNNYNIDLKARCQ 3333--------------1111-----------------------1111---3333--- >NUCLEAR FACTOR-KAPPA-B P5; SWP:Q04860; PDB:1A3QA; GPYLVIVEQPKQRGFRFRYGCEGPSHGGLPGASSEKGRKTYPTVKICNYEGPAKIEVDLV ------------------3333-------------------------------------- THSDPPRAHAHSLVGKQCSELGICAVSVGPKDMTAQFNNLGVLHVTKKNMMGTMIQKLQR --------------2222---------------------------1111----------- QRLRSRPQGLTEAEQRELEQEAKELKKVMDLSIVRLRFSAFLRSLPLKPVISQPIHDSKS --------------------------------------------------------1111 PGASNLKISRMDKTAGSVRGGDEVYLLCDKVQKDDIEVRFYEDDENGWQAFGDFSPTDVH ----------------3333-----------1111-------------------3333-% KQYAIVFRTPPYHKMKIERPVTVFLQLKRKRGGDVSDSKQFTYYP %%%------------------------------------------ >Genome polyprotein; SWP:P04936; PDB:1A3RH; VQLQQSGAELVRPGASVKLSCTTSGFNIKDIYIHWVKQRPEQGLEWIGRLDPANGYTKYD -----------2222-------22221111--------2222-----------------1 PKFQGKATITVDTSSNTAYLHLSSL 111--------3333---------- >Putative uncharacterized ; SWP:Q52L64; PDB:1A3RL; DIVMTQSPSSLTVTTGEKVTMTCKSSQSLLNSR -------------2222---------------- >PYRUVATE KINASE; SWP:P00549; PDB:1A3WA; SRLERLTSLSDLRRTSIIGTIGPKTNNPETLVALRKAGLNIVRMNFSHGSYEYHKSVIDN ---------------------3333------------------------1111------- ARKSEELYPGRPLAIALDTKGPEIRTGTTTNDVDYPIPPNHEMIFTTDDKYAKACDDKIM ------------------------------------------------1111-------- YVDYKNITKVISAGRIIYVDDGVLSFQVLEVVDTLKVKALNAGKICSHKGVNLPGTDVDL ---1111----2222----%%%%-----------------------------2222---- PALSEKDKEDLRFGVKNGVHMVFASFIRTANDVLTIREVLGEQGKDVKIIVKIENQQGVN ----------------------------3333--------3333-----------1111- NFDEILKVTDGVMVARGDLGIEIPAPEVLAVQKKLIAKSNLAGKPVICATQMLESMTYNP -----------------3333--3333-------------------------3333---- RPTRAEVSDVGNAILDGADCVMLSGETAKGNYPINAVTTMAETAVIAEQAIAYLPNYDDM --3333-------3333------3333--------------------------------- RNCTPKPTSTTETVAASAVAAVFEQKAKAIIVLSTSGTTPRLVSKYRPNCPIILVTRCPR -----------------------------------------------------------3 AARFSHLYRGVFPFVFEKEPVSDWTDDVEARINFGIEKAKEFGILKKGDTYVSIQGFKAG 333----2222------------------------------------------------- AGHSNTLQVSTV ------------ >TOPOISOMERASE I; SWP:P68698; PDB:1A41; NAKRDRIFVRVYNVMKRINCFINKNIKKSSTDSNYQLAVFMLMETMFFKENETVGLLTLK --3333------------------1111---------------------1111------1 NKHIEISPDEIVIKFVGKDKVSHEFVVHKSNRLYKPLLKLTDDSSPEEFLFNKLSERKVY 111---1111--------------------3333-------1111---------3333-- ECIKQFGIRIKDLRTYGVNYTFLYNFWTNVKSISPLPSPKKLIALTIKQTAEVVGHTPSI ---1111-3333------------------------------------------------ SKRAYMATTILEMVKDKNFLDVVSKTTFDEFLSIVVDHVKS ---------------1111-------3333----------- >PHOSPHATIDYLETHANOLAMINE-; SWP:P13696; PDB:1A44; PVDLSKWSGPLSLQEVDERPQHPLQVKYGGAEVDELGKVLTPTQVKNRPTSITWDGLDPG --3333--3333----------------------2222--3333---------2222111 KLYTLVLTDPDAPSRKDPKYREWHHFLVVNMKGNNISSGTVLSDYVGSGPPKGTGLHRYV 1------------33331111----------!!!!1111-----------2222------ WLVYEQEGPLKCDEPILSNRSGDHRGKFKVASFRKKYELGAPVAGTCYQAEWDDYVPKLY ---------------------2222---------1111--------------1111---- EQLSG ----- >CATALASE A; SWP:P15202; PDB:1A4EA; DVREDRVVTNSTGNPINEPFVTQRIGEHGPLLLQDYNLIDSLAHFNRENIPQRNPHAHGS --1111---1111----1111----------1111---------1111------------ GAFGYFEVTDDITDICGSAMFSKIGKRTKCLTRFSTVGGDKGSADTVRDPRGFATKFYTE -----------1111--3333-2222-------------1111---------------11 EGNLDWVYNNTPVFFIRDPSKFPHFIHTQKRNPQTNLRDADMFWDFLTTPENQVAIHQVM 11---------------3333-----------------3333------33331111---- ILFSDRGTPANYRSMHGYSGHTYKWSNKNGDWHYVQVHIKTDQGIKNLTIEEATKIAGSN ---3333---1111------------3333----------1111---------------- PDYCQQDLFEAIQNGNYPSWTVYIQTMTERDAKKLPFSVFDLTKVWPQGQFPLRRVGKIV -----------1111------------33331111--1111-----3333---------- LNENPLNFFAQVEQAAFAPSTTVPYQEASADPVLQARLFSYADAHRYRLGPNFHQIPVNC ------3333-1111--1111-2222-------------------------11113333- PYASKFFNPAIRDGPMNVNGNFGSEPTYLANDKSYTYIQQDRPIQQHQEVWNGPAIPYHW 1111---1111--------%%%%------1111-----1111--1111------------ ATSPGDVDFVQARNLYRVLGKQPGQQKNLAYNIGIHVEGACPQIQQRVYDMFARVDKGLS -----1111-------3333-2222----------3333-----------3333------ EAIKKVAE -------- >HEMOGLOBIN; SWP:P01990; PDB:1A4FA; VLSAADKTNVKGVFSKISGHAEEYGAETLERMFTAYPQTKTYFPHFDLQHGSAQIKAHGK --------------1111--3333------------------1111--2222-------- KVVAALVEAVNHIDDIAGALSKLSDLHAQKLRVDPVNFKFLGHCFLVVVAIHHPSALTAE -----------33333333--------------3333---------------3333---- VHASLDKFLCAVGTVLTAKYR --------------------- >Hemoglobin subunit beta; SWP:P02118; PDB:1A4FB; VHWSAEEKQLITGLWGKVNVADCGAEALARLLIVYPWTQRFFSSFGNLSSPTAILGNPMV --------------11113333---------------33331111--------------- RAHGKKVLTSFGDAVKNLDNIKNTFAQLSELHCDKLHVDPENFRLLGDILIIVLAAHFAK ----------------1111-------------------------------------!!! EFTPDCQAAWQKLVRVVAHALARKYH !--------------------1111- >METHYLENETETRAHYDROFOLATE; SWP:P11586; PDB:1A4IA; APAEILNGKEISAQIRARLKNQVTQLKEQVPGFTPRLAILQVGNRDDSNLYINVKLKAAE -----------------------------2222--------------------------- EIGIKATHIKLPRTTTESEVMKYITSLNEDSTVHGFLVQLPLDSENSINTEEVINAIAPE -----------1111--------------1111--------------------1111333 KDVDGLTSINAGRLARGDLNDCFIPCTPKGCLELIKETGVPIAGRHAVVVGRSKIVGAPM 31111--------1111------------------3333--2222-------3333---- HDLLLWNNATVTTCHSKTAHLDEEVNKGDILVVATGQPEMVKGEWIKPGAIVIDCGINYK --------------1111------1111-------------3333-2222---------- VVGDVAYDEAKERASFITPVPGGVGPMTVAMLMQSTVESAKRFLE ------3333-------------3333------------------ >IMMUNOGLOBULIN, DIELS ALD; SWP:NA; PDB:1A4JH; QVQLLESGPELKKPGETVKISCKASGYTFTNYGMNWVKQAPGKGLKWMGWINTYTGEPTY ------------2222-----------1111--------2222----------------- ADDFKGRFAFSLETSASTAYLQINNLKNEDTATYFCVQAERLRRTFDYWGAGTTVTVSSA 1111---------1111---------3333----------3333---------------- STKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSG ---------------------------------------%%%%--2222-------1111 LYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKV ----------3333------------1111------- >Ig kappa chain C region; SWP:KAC_HUMAN; PDB:1A4JL; ELVMTQTPLSLPVSLGDQASISCRSSQSLVHSNGNTYLHWYLQKPGQSPKLLIYKVSNRF -------------2222-------------1111---------2222------------2 SGVPDRFSGSGSGTDFTLKISRVEAEDLGVYFCSQSTHVPPTFGGGTKLEIKRTVAAPSV 222--------------------1111--------------------------------- FIFPPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSL -----3333-------------------------iiii---------------------- SSTLTLSKADYEKHKVYACEVTHQGLSSPVTKSFNRG ------3333------------1111----------- >ADENOSINE DEAMINASE; SWP:P03958; PDB:1A4MA; TPAFNKPKVELHVHLDGAIKPETILYFGKKRGIALPADTVEELRNIIGMDKPLSLPGFLA -------------1111------------------------------------------3 KFDYYMPVIAGCREAIKRIAYEFVEMKAKEGVVYVEVRYSPHLLANSKVDPMPWNQTEGD 333-33332222---------------------------3333---------%%%%---- VTPDDVVDLVNQGLQEGEQAFGIKVRSILCCMRHQPSWSLEVLELCKKYNQKTVVAMDLA -------------------------------11111111---------2222-------- GDETIEGSSLFPGHVEAYEGAVKNGIHRTVHAGEVGSPEVVREAVDILKTERVGHGYHTI -3333-3333----------------------------------------------3333 EDEALYNRLLKENMHFEVCPWSSYLTGAWDPKTTHAVVRFKNDKANYSLNTDDPLIFKST ---------1111------33331111--3333-3333----------------1111-3 LDTDYQMTKKDMGFTEEEFKRLNINAAKSSFLPEEEKKELLERLYREYQ 333-----------------------1111------------------- >S100A10; SWP:P08206; PDB:1A4PA; PSQMEHAMETMMFTFHKFAGDKGYLTKEDLRVLMEKEFPGFLENQKDPLAVDKIMKDLDQ -------------------3333-----------------------1111--------11 CRDGKVGFQSFFSLIAGLTIACNDYFVVHMKQ 11----3333---------------------- >BETAINE ALDEHYDE DEHYDROG; SWP:P56533; PDB:1A4SA; AQLVDSMPSASTGSVVVTDDLNYWGGRRIKSKDGATTEPVFEPATGRVLCQMVPCGAEEV 3333-3333-2222---------%%%%--------------------------------- DQAVQSAQAAYLKWSKMAGIERSRVMLEAARIIRERRDNIAKLEVINNGKTITEAEYDID ------------3333-------------------------------------------- AAWQCIEYYAGLAPTLSGQHIQLPGGAFAYTRREPLGVCAGILAWNYPFMIAAWKCAPAL -----------1111--------iiii-------------------3333---------1 ACGNAVVFKPSPMTPVTGVILAEIFHEAGVPVGLVNVVQGGAETGSLLCHHPNVAKVSFT 111-------1111-3333------3333-2222-------------------------- GSVPTGKKVMEMSAKTVKHVTLELGGKSPLLIFKDCELENAVRGALMANFLTQGQVCTNG -----------3333-----------------1111--------------%%%%-1111- TRVFVQREIMPQFLEEVVKRTKAIVVGDPLLTETRMGGLISKPQLDKVLGFVAQAKKEGA -----3333-----------1111---1111------------------------1111- RVLCGGEPLTPSDPKLKNGYFMSPCVLDNCRDDMTCVKEEIFGPVMSVLPFDTEEEVLQR ------------1111--------------11113333---------------------- ANNTTFGLASGVFTRDISRAHRVAANLEAGTCYINTYSISPVEVPFGGYKMSGFGRENGQ ----------------------------------------3333----!!!!------33 ATVDYYSQLKTVIVEMGDVDSLF 331111----------------- >INDOLE-3-GLYCEROLPHOSPHAT; SWP:Q06121; PDB:1A53; PRYLKGWLKDVVQLSLRRPSFRASRQRPIISLNERILEFNKRNITAIIAEYKRKSPSGLD -----3333-----1111---------------------1111-----------1111-- VERDPIEYSKFMERYAVGLSILTEEKYFNGSYETLRKIASSVSIPILMKDFIVKESQIDD -------------------------------------1111------------3333--- AYNLGADTVLLIVKILTERELESLLEYARSYGMEPLIEINDENDLDIALRIGARFIGINS -----------3333-------------1111---------------------------- RDLETLEINKENQRKLISMIPSNVVKVAESGISERNEIEELRKLGVNAFLIGSSLMRNPE ---------------3333-1111---------3333----1111--------------3 KIKEFIL 3331111 >FERRICYTOCHROME C-552; SWP:P95339; PDB:1A56; DADLAKKNNCIACHQVETKVVGPALKDIAAKYADKDDAATYLAGKIKGGSSGVWGQIPMP ----33333333------------------------------------------------ PNVNVSDADAKALADWILTLK ----------------3333- >CYCLOPHILIN; SWP:Q27450; PDB:1A58; MSKKDRRRVFLDVTIDGNLAGRIVMELYNDIAPRTCNNFLMLCTGMAGTGKISGKPLHYK -1111---------iiii----------------------------------------22 GSTFHRVIKNFMIQGGDFTKGDGTGGESIYGGMFDDEEFVMKHDEPFVVSMANKGPNTNG 22-----2222----------------1111----------------------------- SQFFITTTPAPHLNNIHVVFGKVVSGQEVVTKIEYLKTNSKNRPLADVVILNCGELV ---------3333-------------------1111--1111--------------- >CITRATE SYNTHASE; SWP:O34002; PDB:1A59; EPTIHKGLAGVTADVTAISKVNSDTNSLLYRGYPVQELAAKCSFEQVAYLLWNSELPNDS ----2222--------------1111---iiii--------------------------- ELKAFVNFERSHRKLDENVKGAIDLLSTACHPMDVARTAVSVLGANHARAQDSSPEANLE -------3333-----------1111----3333---------1111-1111-------- KAMSLLATFPSVVAYDQRRRRGEELIEPREDLDYSANFLWMTFGEEAAPEVVEAFNVSMI ------------------1111------1111---------------------------- LYAEHSFNASTFTARVITSTLADLHSAVTGAIGALKGPLHGGANEAVMHTFEEIGIRKDE -----------------1111---------------1111-------------------- SLDEAATRSKAWMVDALAQKKKVMGFGHRVYKNGDSRVPTMKSALDAMIKHYDRPEMLGL 3333------------1111--2222--------1111-----------11113333--- YNGLEAAMEEAKQIKPNLDYPAGPTYNLMGFDTEMFTPLFIAARITGWTAHIMEQVADNA -------------------------------3333------------------------- LIRPLSEYNGPEQRQVP ----------------- >FRUCTOSE-1,6-BISPHOSPHATE; SWP:P14223; PDB:1A5CA; LPADVAEELATTAQKLVQAGKGILAADESTQTIKKRFDNIKLENTIENRASYRDLLFGTK -3333------------2222---------3333---1111--------------1111- GLGKFISGAILFEETLFQKNEAGVPMVNLLHNENIIPGIKVDKGLVNIPCTDEEKSTQGL 3333---------3333--1111-3333--1111-------------------------2 DGLAERCKEYYKAGARFAKWRTVLVIDTAKGKPTDLSIHETAWGLARYASICQQNRLVPI 222------------------------1111----------------------------- VEPEILADGPHSIEVCAVVTQKVLSCVFKALQENGVLLEGALLKPNMVTAGYECTAKTTT ------------------------------------3333----------1111------ QDVGFLTVRTLRRTVPPALPGVVFLSGGQSEEEASVNLNSINALGPHPWALTFSYGRALQ ---------------3333------!!!!-----------3333-----------3333- ASVLNTWQGKKENVAKAREVLLQRAEANSLATYGKYKGGAGG ------iiii1111----------------1111-------- >MONOCLONAL ANTI-E-SELECTI; SWP:NA; PDB:1A5FH; EVALQQSGAELVKPGASVKLSCAASGFTIKDAYMHWVKQKPEQGLEWIGRIDSGSSNTNY ---------------------------3333--------2222----------------- DPTFKGKATITADDSSNTAYLQMSSLTSEDTAVYYCARVYAMDYWGQGTSVTVSSAKTTP 1111-----------------------1111----------------------------- PSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLS -----------------------------------%%%%--2222--------------- SSVSVPTSTETVTCNVAHAPSSTKVDKKIVPR -------------------------------- >TISSUE PLASMINOGEN ACTIVA; SWP:P00750; PDB:1A5HA; IKGGLFADIASHPWQAAIFAKHRRSP -------33331111----------- -- >DELTA PRIME; SWP:P28631; PDB:1A5T; MRWYPWLRPDFEKLVASYQAGRGHHALLIQALPGMGDDALIYALSRYLLCQQPQGHKSCG ---11113333--------------------2222------------------!!!!--- HCRGCQLMQAGTHPDYYTLAPEKGKNTLGVDAVREVTEKLNEHARLGGAKVVWVTDAALL ------------1111-----1111-------------1111-1111------------- TDAAANALLKTLEEPPAETWFFLATREPERLLATLRSRCRLHYLAPPPEQYAVTWLSREV ----------1111------------3333-33331111--------------------- TMSQDALLAALRLSAGSPGAALALFQGDNWQARETLCQALAYSVPSGDWYSLLAALNHEQ -----------1111------3333------------------1111-3333-------- APARLHWLATLLMDALKRVTNVDVPGLVAELANHLSPSRLQAILGDVCHIREQLMSVTGI -------------1111---1111------------------------------------ NRELLITDLLLRIEHYLQPGVVLP -------------33332222--- >L-LACTATE DEHYDROGENASE; SWP:P16115; PDB:1A5Z; MKIGIVGLGRVGSSTAFALLMKGFAREMVLIDVDKKRAEGDALDLIHGTPFTRRANIYAG --------3333-----------------------------------3333--------- DYADLKGSDVVIVAAGVPQKPGETRLQLLGRNARVMKEIARNVSKYAPDSIVIVVTNPVD ----2222---------------3333-------------------1111-------333 VLTYFFLKESGMDPRKVFGSGTVLDTARLRTLIAQHCGFSPRSVHVYVIGEHGDSEVPVW 3-----------1111---!!!!----------------3333---------1111--33 SGAMIGGIPLQNMCQVCQKCDSKILENFAEKTKRAAYEIIERKGATHYAIALAVADIVES 33--iiii------------------------------3333------------------ IFFDEKRVLTLSVYLEDYLGVKDLCISVPVTLGKHGVERILELNLNEEELEAFRKSASIL 1111-------------iiii-----------1111------------------------ KNAINEITAEEN ------1111-- >RHO; SWP:P0AG30; PDB:1A62; NLTELKNTPVSELITLGENGLENLARRKQDIIFAILKQHAKSGEDIFGDGVLEILQDGFG ----1111--------------3333-------------1111-----------3333-- FLRSADSSYLAGPDDIYVSPSQIRRFNLRTGDTISGKIRPPKEGERYFALLKVNEVNFDK ---1111----1111-------------2222---------2222----------%%%%- PE -- >CORE NFATC1; SWP:O95644; PDB:1A66A; MKDWQLPSHSGPYELRIEVQPKSHHRARYETEGSRGAVKASAGGHPIVQLHGYLENEPLM ---------!!!!---------------3333---------------------------- LQLFIGTADDRLLRPHAFYQVHRITGKTVSTTSHEAILSNTKVLEIPLLPENSMRAVIDC ------------------------1111-----------------------%%%%----- AGILKLRNSDIELRKGETDIGRKNTRVRLVFRVHVPQPSGRTLSLQVASNPIECSQRS ------3333-3333------------------------------------------- >HLA-DR3; SWP:P01903; PDB:1A6AA; HVIIQAEFYLNPDQSGEFMFDFDGDEIFHVDMAKKETVWRLEEFGRFASFEAQGALANIA ----------1111-------iiii------1111-----33331111------------ VDKANLEIMTKRSNYTPITNVPPEVTVLTNSPVELREPNVLICFIDKFTPPVVNVTWLRN ----------1111---------------------------------------------- GKPVTTGVSETVFLPREDHLFRKFHYLPFLPSTEDVYDCRVEHWGLDEPLLKHWEF ---------------1111------------------------------------- >HLA class II histocompati; SWP:P01912; PDB:1A6AB; PRFLEYSTSECHFFNGTERVRYLDRYFHNQEENVRFDSDVGEFRAVTELGRPDAEYWNSQ --------------!!!!---------iiii-----3333------3333---------3 KDLLEQKRGRVDNYCRHNYGVVESFTVQRRVHPKVTVYPSKTQPLQHHNLLVCSVSGFYP 333----------------------1111------------------------------- GSIEVRWFRNGQEEKTGVVSTGLIHNGDWTFQTLVMLETVPRSGEVYTCQVEHPSVTSPL --------iiii----------------------------------------3333---- TVEWRAR ------- >TOBACCO RINGSPOT VIRUS CA; SWP:Q88894; PDB:1A6CA; AVTVVPDPTCCGTLSFKVPKDAKKGKHLGTFDIRQAIMDYGGLHSQEWCAKGIVNPTFTV ------3333-------------------------------1111--------------- RMHAPRNAFAGLSIACTFDDYKRIDLPALGNECPPSEMFELPTKVFMLKDADVHEWQFNY -------------------------1111---3333--------3333------------ GELTGHGLCNWANVATQPTLYFFVASTNQVTMAADWQCIVTMHVDMGPVIDRFELNPTMT ------------------------------------------------------------ WPIQLGDTFAIDRYYEAKEIKLDGSTSMLSISYNFGGPVKHSKKHAISYSRAVMSRNLGW ------------------------------------------------------------ SGTISGSVKSVSSLFCTASFVIFPWECEAPPTLRQVLWGPHQIMHGDGQFEIAIKTRLHS ------------3333----------------3333------------------------ AATTEEGFGRLGILPLSGPIAPDAHVGSYEFIVHINTWRPDSQVHPPMFSSSELYNWFTL ---------------------3333----------------------------------- TNLKPDANTGVVNFDIPGYIHDFASKDATVTLASNPLSWLVAATGWHYGEVDLCISWSRS ----------------------------------------1111---------------- KQAQAQEGSVSITTNYRDWGAYWQGQARIYDLRRTEAEIPIFLGSYAGATPSGALGKQNY -3333------------------------------------------------------- VRISIVNAKDIVALRVCLRPKSIKFWGRSATLF --------------------------------- >Thermosome subunit beta; SWP:P48425; PDB:1A6DB; KDAMKENIEAAIAISNSVRSSLGPRGMDKMLVDSLGDIVITNDGVTILKEMDVEHPAAKM -----------------1111-1111------1111------3333-------------- MVEVSKTQDSFVGDGTTTAVIIAGGLLQQAQGLINQNVHPTVISEGYRMASEEAKRVIDE ---------------------------------1111-3333------------------ ISTKIGADEKALLLKMAQTSLNSKSASVAKDKLAEISYEAVKSVAELRDGKYYVDFDNIQ -----1111--------------1111---------------------------1111-- VVKKQGGAIDDTQLINGIIVDKEKVHPGMPDVVKDAKIALLDAPLEIKKPEFDTNLRIED -------------------------1111------------------------------- PSMIQKFLAQEENMLREMVDKIKSVGANVVITQKGIDDMAQHYLSRAGIYAVRRVKKSDM ----------------------1111---------------------------------- DKLAKATGASIVSTIDEISSSDLGTAERVEQVKVGEDYMTFVTGCKNPKAVSILVRGETE -------------3333-3333------------------------------------33 HVVDEMERSITDSLHVVASALEDGAYAAGGGATAAEIAFRLRSYAQKIGGRQQLAIEKFA 33--------------------------iiii-----------------3333------- DAIEEIPRALAENAGLDPIDILLKLRAEHAKGNKTYGINVFTGEIEDMVKNGVIEPIRVG -----------------3333-----------1111----------3333-----3333- KQAIESATEAAIMILRIDDVIA -------------1111----- >RIBONUCLEASE P PROTEIN; SWP:P25814; PDB:1A6F; AHLKKRNRLKKNEDFQKVFKHGTSVANRQFVLYTLDQPENDELRVGLSVSKKIGNAVMRN ---3333-----------------------------1111---------------3333- RIKRLIRQAFLEEKERLKEKDYIIIARKPASQLTYEETKKSLQHLFRKSSLYK ------------3333----------1111----------------------- >TETRACYCLINE REPRESSOR PR; SWP:P0ACT4; PDB:1A6I; SRLDKSKVINSALELLNEVGIEGLTTRKLAQKLGVEQPTLYWHVKNKRALLDALAVEILA --------------------1111-----------33333333----------------- RHHDYSLPAAGESWQSFLRNNAMSFRRALLRYRDGAKVHLGTRPDEKQYDTVETQLRFMT -------------------------------2222------------------------1 ENGFSLRDGLYAISAVSHFTLGAVLEQQEHLPPLLREALQIMDSDDGEQAFLHGLESLIR 111-----------------------1111------------------------------ GFEVQLTALLQIV -3333--!!!!-- >NITROGEN REGULATORY IIA P; SWP:P31222; PDB:1A6JA; LQLSSVLNRECTRSRVHCQSKKRALEIISELAAKQLSLPPQVVFEAILTREKMGSTGIGN -3333--3333---------------------------3333--------1111----%% GIAIPHGKLEEDTLRAVGVFVQLETPIAFDAIDNQPVDLLFALLVPADQTKTHLHTLSLV %%-------3333--------------------------------1111-1111------ AKRLADKTICRRLRAAQSDEELYQIITDTE ------------1111-3333--------- >MYOGLOBIN; SWP:P02185; PDB:1A6M; VLSEGEWQLVLHVWAKVEADVAGHGQDILIRLFKSHPETLEKFDRFKHLKTEAEMKASED ---------------------------------------------3333----------- LKKHGVTVLTALGAILKKKGHHEAELKPLAQSHATKHKIPIKYLEFISEAIIHVLHSRHP ---------------1111--3333--------------3333---------------33 GDFGADAQGAMNKALELFRKDIAAKYKELGY 33----------------------------- >PHOSPHATASE 2C; SWP:P35813; PDB:1A6Q; GAFLDKPKMEKHNAQGQGNGLRYGLSSMQGWRVEMEDAHTAVIGLPSGLESWSFFAVYDG -----------------iiii-------!!!!-------------iiii----------- HAGSQVAKYCCEHLLDHITNNQDFKGSAGAPSVENVKNGIRTGFLEIDEHMRVMSEKKHG ---3333-------------3333------------------------------------ ADRSGSTAVGVLISPQHTYFINCGDSRGLLCRNRKVHFFTQDHKPSNPLEKERIQNAGGS -------------------------------%%%%--------1111-------1111-- VMIQRVNGSLAVSRALGDFDYKCVHGKGPTEQLVSPEPEVHDIERSEEDDQFIILACDGI -----iiii--------3333--22221111--------------3333-------3333 WDVMGNEELCDFVRSRLEVTDDLEKVCNEVVDTCLYKGSRDNMSVILICFPNAPKVSPEA 1111------------------------------1111---------------------- VKKEAELDKYLECRVEEIIKGVPDLVHVMRTLASENIPSLPPGGELASKRNVIEAVYNRL ----------------3333---3333-----1111--------3333------------ NPY --- >GAG POLYPROTEIN; SWP:P03322; PDB:1A6S; GEAVIKVISSACKTYCGKTSPSKKEIGAMLSLLQKEGLLMSPSDLYSPGSWDPITAALSQ 3333---------------------------1111-------3333----3333------ RAMILGKSGELKTWGLVLGALKAAREE 1111----------------------- >FAB1-IA; SWP:Q5XFY8; PDB:1A6TA; QSVLSQSPAILSASPGEKVIMTCSPSSSVSYMQWYQQKPGSSPKPWIYSTSNLASGVPGR -------------2222--------------------2222------------2222111 FSGGGSGTSFSLTISGVEAEDAATYYCQQYSSHPLTFGGGTKLELKRADAAPTVSIFPPS 1----------------1111--------------------------------------- SEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTL ----------------------------iiii---------------------------- TKDEYERHNSYTCEATHKTSTSPIVKSFNR ----------------1111---------- >FAB1-IA; SWP:NA; PDB:1A6TB; EVQLQQSGPDLVKPGASVKISCKASGYSFSTYYMHWVKQSHGKSLEWIGRVDPDNGGTSF ------------2222-----------1111--------2222----------------- NQKFKGKAILTVDKSSSTAYMELSL -1111-------------------- >Ig heavy chain V region B; SWP:P01751; PDB:1A6VH; QVQLQQPGAELVKPGASVKLSCKASGYTFTSYWMHWVKQRPGRGLEWIGRIDPNSGGTKY ----------------------------1111---------------------------- NEKFKSKATLTVDKPSSTAYMQLSSLTSEDSAVYYCARYDYYGSSYFDYWGQGTTVTV 3333---------1111----------------------------------------- >REVERBA ORPHAN NUCLEAR RE; SWP:P20393; PDB:1A6YA; LLCKVCGDVASGFHYGVLACEGCKGFFRRSIQQNIQYKRCLKNENCSIVRINRNRCQQCR -------------iiii--3333---3333-----------%%%%------1111-3333 FKKCLSVGMSRDAVRFGR ---------1111----- >HFE; SWP:Q30201; PDB:1A6ZA; RSHSLHYLFMGASEQDLGLSLFEALGYVDDQLFVFYDHESRRVEPRTPWVSSRISSQMWL -------------2222----------%%%%------1111-----33333333------ QLSQSLKGWDHMFTVDFWTIMENHNHSKESHTLQVILGCEMQEDNSTEGYWKYGYDGQDH -----------------------------------------1111---------%%%%-- LEFCPDTLDWRAAEPRAWPTKLEWERHKIRARQNRAYLERDCPAQLQQLLELGRGVLDQQ ----1111-----3333-----1111--------------------------2222---- VPPLVKVTHHVTSSVTTLRCRALNYYPQNITMKWLKDKQPMDAKEFEPKDVLPNGDGTYQ -----------1111--------------------%%%%--1111--------1111--- GWITLAVPPGEEQRYTCQVEHPGLDQPLIVIW ----------3333------3333-------- >FERREDOXIN; SWP:P00221; PDB:1A70; AAYKVTLVTPTGNVEFQCPDDVYILDAAEEEGIDLPYSCRAGSCSSCAGKLKTGSLNQDD --------1111------1111------1111------------1111---------111 QSFLDDDQIDEGWVLTCAAYPVSDVTIETHKKEELTA 1-------------1111------------1111--- >INTRON 3 (I-PPO) ENCODED ; SWP:Q94702; PDB:1A73A; ALTNAQILAVIDSWEETVGQFPVITHHVPLGGGLQGTLHCYEIPLAAPYGVGFAKNGPTR -----------------1111---------iiii---------------2222------- WQYKRTINQVVHRWGSHTVPFLLEPDNINGKTCTASHLCHNTRCHNPLHLCWESLDDNKG ------iiii----11111111-----%%%%-----11113333-1111----------- RNWCPGPNGGCVHAVVCLRQGPLYGPGATVAGPQQRGSHFVV -----1111-----------1111------------------ >PARVALBUMIN; SWP:P02621; PDB:1A75A; AGILADADCAAAVKACEAADSFSYKAFFAKCGLSGKSADDIKKAFVFIDQDKSGFIEEDE -1111------3333----------------3333-------------1111----3333 LKLFLQVFKAGARALTDAETKAFLKAGDSDGDGAIGVEEWVALVKA ----33332222---------------1111----------3333- >FLAP ENDONUCLEASE-1 PROTE; SWP:Q58839; PDB:1A76; GVQFGDFIPKNIISFEDLKGKKVAIDGMNALYQFLTSIRLRDGSPLRNRKGEITSAYNGV ----1111-----33332222------------------1111----1111--------- FYKTIHLLENDITPIWVFDGEPPKLKEKTRKVRREMKEKAELKMKEAIKKEDFEEAAKYA ------------------------------------------------3333--333333 KRVSYLTPKMVENCKYLLSLMGIPYVEAPSEGEAQASYMAKKGDVWAVVSQDYDALLYGA 33---------------------------------------------------3333--- PRVVRNLTTTKEMPELIELNEVLEDLRISLDDLIDIAIFMGTDYNPGGVKGIGFKRAYEL -----------------------------------------1111--------------- VRSGVAKDVLKKEVEYYDEIKRIFKEPKVTDNYSLSLKLPDKEGIIKFLVDENDFNYDRV 1111---------2222------------------------------------------- KKHVDKLYNLIANKT --------------- >GALECTIN-1; SWP:P56217; PDB:1A78A; ASAGVAVTNLNLKPGHCVEIKGSIPPDCKGFAVNLGEDASNFLLHFNARFDLHGDVNKIV ------------2222---------------------1111----------iiii----- CNSKEADAWGSEQREEVFPFQQGAEVMVCFEYQTQKIIIKFSSGDQFSFPVRKVLPSIPF ----iiii------------------------1111----1111---------------- LSLEGLAFKSITTE -------------- >TRNA ENDONUCLEASE; SWP:Q58819; PDB:1A79A; KITGLLDGDRVIVFDKNGISKLSARHYGNVEGNFLSLSLVEALYLINLGWLEVKYKDNKP ------!!!!----3333----1111---------------------------------- LSFEELYEYARNVEERLCLKYLVYKDLRTRGYIVKTGLKYGADFRLYERGANIDKEHSVY ---------------------------1111-----3333-------------------- LVKVFPEDSSFLLSELTGFVRVAHSVRKKLLIAIVDADGDIVYYNMTYVKP -----------------------------------1111------------ >Regulatory protein E2; SWP:P17383; PDB:1A7GE; ATTPIIHLKGDANILKCLRYRLSKYKQLYEQVSSTWHWTCTDGKHKNAIVTLTYISTSQR ------------------------3333---------3333------------------- DDFLNTVVIPNTVSVSTGYMTI ---------1111--------- >GAMMAS CRYSTALLIN; SWP:P06504; PDB:1A7HA; MYKIQIFEKGDFNGQMHETTEDCPSIMEQFHMREVHSCKVLEGAWIFYELPNYRGRQYLL ---------%%%%-----------3333----------------------%%%%------ DKKEYRKPVDWGAASPAVQSFRRIVE ------3333---------------- >QCRP2 (LIM1); SWP:Q05158; PDB:1A7I; NKCGACGRTVYHAEEVQCDGRSFHRCCFLCMVCRKNLDSTTVAIHDAEVYCKSCYGKKYG -----------------------3333-----------------!!!!----3333---- >PHOSPHORIBULOKINASE; SWP:P12033; PDB:1A7J; SKKHPIISVTGSSTSTVKHTFDQIFRREGVKAVSIEGDAFHRFNRADMKAELDRRYAAGD 1111-------------------------------3333--------------3333--- ATFSHFSYEANELKELERVFREYGETGQGRTRTYVARTGVAPGNFTDWRDFDSDSHLLFY ---33331111-----------------------------2222---------------- EGLHGAVVNSEVNIAGLADLKIGVVPVINLEWIQKIHRDRATRGYTTEAVTDVILRRMHA -------------3333---------3333------1111--------3333-3333--- YVHCIVPQFSQTDINFQRVPVVDTSNPFIARWIPTADESVVVIRFRNPRGIDFPYLTSMI -------------------------1111-----3333-------------3333----2 HGSWMSRANSIVVPGNKLDLAMQLILTPLIDRVVRESKV 222----------11113333-----------1111--- >MALE-B363; SWP:P02928; PDB:1A7LA; EGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWA --------1111--------------------------3333----3333---------3 HDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLP 333----1111-------33331111--3333---iiii--------------------- NPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGV -----1111------3333----------3333-----1111------%%%%-3333--- DNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYG ---------------------------------1111-------3333------------ VTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVA ------iiii-------------1111---------------3333-------------- LKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKD 3333--3333----------1111-----1111-------------------3333--33 GS 33 >Ig heavy chain V region P; SWP:P01820; PDB:1A7QH; QVQLQESGPGLVAPSQSLSITCTVSGFSLTGYGVNWVRQLPGKGLEWLGMIWGDGNTAYN ---------------------------1111--------2222--------1111----- SALKSRLSISKDNSKSQVFLEMDSLHTDDTARYYCARERDYRLDYWGQGTTVTVSS 1111--------1111---------1111--------------------------- >Ig kappa chain V-V region; SWP:P01635; PDB:1A7QL; DIVLTQSPASLSASVGETVTITCRAGGNTHNYLAWYQQKQGKSPQLLVYYTTTLAAGVPS -------------2222-----------iiii------2222------------222233 RFSGSGSGTQYSLKINSLQPDDFGSYYCQHFWSTPRSFGGGTKLEI 33----------------3333------------------------ >HEPARIN BINDING PROTEIN; SWP:P20160; PDB:1A7S; IVGGRKARPRQFPFLASIQNQGRHFCGGALIHARFVMTAASCFPGVSTVVLGAYDLRRRE -------22221111----iiii--------1111---3333------------1111-1 RQSRQTFSISSMSENGYDPQQNLNDLMLLQLDREANLTSSVTILPLPLQNATVEAGTRCQ 111---------------1111---------------1111------2222--2222--- VAGWGSQRSGGRLSRFPRFVNVTVTPEDQCRPNNVCTGVLTRRGGICNGDGGTPLVCEGL -------2222--------------3333-3333------------2222------iiii AHGVASFSLGPCGRGPDFFTRVALFRDWIDGVLNNPGPGPA ----------2222------3333----------------- >METALLO-BETA-LACTAMASE; SWP:P25910; PDB:1A7TA; SVKISDDISITQLSDKVYTYVSLAEIEGWGMVPSNGMIVINNHQAALLDTPINDAQTEML ----1111-----1111-----------------------%%%%---------------- VNWVTDSLHAKVTTFIPNHWHGDCIGGLGYLQRKGVQSYANQMTIDLAKEKGLPVPEHGF --------------------11111111--------------------1111-------- TDSLTVSLDGMPLQCYYLGGGHATDNIVVWLPTENILFGGCMLKDNQTTSIGNISDADVT -------iiii--------------------1111----3333-1111----------11 AWPKTLDKVKAKFPSARYVVPGHGNYGGTELIEHTKQIVNQYIESTS 11----------1111------------------------------- >CHLOROPEROXIDASE T; SWP:O31168; PDB:1A7UA; PFITVGQENSTSIDLYYEDHGAGQPVVLIHGFPLSGHSWERQSAALLDAGYRVITYDRRG -------!!!!-----------------------33333333----1111-------222 FGQSSQPTTGYDYDTFAADLNTVLETLDLQDAVLVGFSMGTGEVARYVSSYGTARIAKVA 2---------------------------------------------------1111---- FLASLEPFLLKTDDNPDGAAPKEFFDGIVAAVKADRYAFYTGFFNDFYNLDENLGTRISE -----------1111-----3333-----------3333------1111---2222---- EAVRNSWNTAASGGFFAAAAAPTTWYTDFRADIPRIDVPALILHGTGDRTLPIENTARVF ---------1111------33331111-11111111--------1111---3333----- HKALPSAEYVEVEGAPHGLLWTHAEEVNTALLAFLAK ---1111----------3333------------3333 >HISTONE HMFB; SWP:P19267; PDB:1A7W; MELPIAPIGRIIKDAGAERVSDDARITLAKILEEMGRDIASEAIKLARHAGRKTIKAEDI --------------------3333-----------------------1111----3333- ELAVRRFK ---3333- >SYK KINASE; SWP:P43405; PDB:1A81A; SANHLPFFFGNITREEAEDYLVQGGMSDGLYLLRQSRNYLGGFALSVAHGRKAHHYTIER --------------------------2222------------------%%%%-------- ELNGTYAIAGGRTHASPADLCHYHSQESDGLVCLLKKPFNRPQGVQPKTGPFEDLKENLI 3333---2222----3333----------------------------------------- REYVKQTWNLQGQALEQAIISQKPQLEKLIATTAHEKMPWFHGKISREESEQIVLIGSKT ----------------------------3333-33333333----------1111----2 NGKFLIRARDNNGSYALCLLHEGKVLHYRIDKDKTGKLSIPEGKKFDTLWQLVEHYSYKA 222-----------------------------1111---2222----------------- DGLLRVLTVPCQKI -------------- >COLICIN N; SWP:P08083; PDB:1A87; SAKVGEITITPDNSKPGRYISSNPEYSLLAKLIDAESIKGTEVYTFHTRKGQYVKVTVPD ---!!!!----3333-------3333%%%%-------iiii-------2222------%% SNIDKMRVDYVNWKGPKYNNKLVKRFVSQFLLFRKEEKEKNEKEALLKASELVSGMGDKL %%1111------------3333-------------------------------------- GEYLGVKYKNVAKEVANDIKNFHGRNIRSYNEAMASLNKVLANPKMKVNKSDKDAIVNAW ------------------11111111----------------3333--3333-------- KQVNAKDMANKIGNLGKAFKVADLAIKVEKIREKSIEGYNTGNWGPLLLEVESWIIGGVV ---------------3333-----------------------------------1111-- AGVAISLFGAVLSFLPISGLAVTALGVIGIMTISYLSSFIDANRVSNINNIISSVIR -----------1111-2222----------------11113333------3333--- >CHLOROPEROXIDASE L; SWP:P49323; PDB:1A88A; GTVTTSDGTNIFYKDWGPRDGLPVVFHHGWPLSADDWDNQMLFFLSHGYRVIAHDRRGHG ----1111---------1111-----------3333--------1111-------2222- RSDQPSTGHDMDTYAADVAALTEALDLRGAVHIGHSTGGGEVARYVARAEPGRVAKAVLV ---------------------------------------------11112222------- SAVPPVMVKSDTNPDGLPLEVFDEFRAALAANRAQFYIDVPSGPFYGFNREGATVSQGLI ---------1111----3333-----------3333-------1111--2222--3333- DHWWLQGMMGAANAHYECIAAFSETDFTDDLKRIDVPVLVAHGTDDQVVPYADAAPKSAE ------3333--------------------1111---------------3333------- LLANATLKSYEGLPHGMLSTHPEVLNPDLLAFVKS ---------------3333--3333---------- >TETANUS NEUROTOXIN; SWP:P04958; PDB:1A8D; MKNLDCWVDNEEDIDVILKKSTILNLDINNDIISDISGFNSSVITYPDAQLVPGINGKAI ------------------1111------%%%%-------------1111----------- HLVNNESSEVIVHKAMDIEYNDMFNNFTVSFWLRVPKVSASHLEQYGTNEYSIISSMKKH ----1111------1111-1111----------------------1111----------- SLSIGSGWSVSLKGNNLIWTLKDSAGEVRQITFRDLPDKFNAYLANKWVFITITNDRLSS iiii------------------1111-----------3333--------------1111- ANLYINGVLMGSAEITGLGAIREDNNITLKLDRCNNNNQYVSIDKFRIFCKALNPKEIEK ----iiii---------------------------1111--------------------- LYTSYLSITFLRDFWGNPLRYDTEYYLIPVASSSKDVQLKNITDYMYLTNAPSYTNGKLN --1111------1111------------1111--------2222------------1111 IYYRRLYNGLKFIIKRYTPNNEIDSFVKSGDFIKLYVSYNNNEHIVGYPKDGNAFNNLDR ---------------------------2222-------%%%%------2222-------- ILRVGYNAPGIPLYKKMEAVKLRDLKTYSVQLKLYDDKNASLGLVGTHNGQIGNDPNRDI -------2222------------1111------------------------!!!!----- LIASNWYFNHLKDKILGCDWYFVPTDEGWTND ----3333-1111--1111------1111--- >PROTEIN DISULFIDE OXIDORE; SWP:Q51760; PDB:1A8L; MGLISDADKKVIKEEFFSKMVNPVKLIVFVRKDHCQYCDQLKQLVQELSELTDKLSYEIV ----------------1111--------------1111---------1111--------- DFDTPEGKELAKRYRIDRAPATTITQDGKDFGVRYFGLPAGHEFAAFLEDIVDVSREETN 3333-------1111----------iiii------------------------------- LMDETKQAIRNIDQDVRILVFVTPTCPYCPLAVRMAHKFAIENTKAGKGKILGDMVEAIE --------1111-------------1111---------------------------3333 YPEWADQYNVMAVPKIVIQVNGEDRVEFEGAYPEKMFLEKLLSALS -----1111----------iiii------------------1111- >HIV CAPSID; SWP:P12493; PDB:1A8O; DIRQGPKEPFRDYVDRFYKTLRAEQASQEVKNWTETLLVQNANPDCKTILKALGPGATLE ---------------------1111-----------3333-----------------333 ETACQG 31111- >NADPH\:FERREDOXIN OXIDORE; SWP:Q44532; PDB:1A8P; SNLNVERVLSVHHWNDTLFSFKTTRNPSLRFENGQFVMIGLEVDGRPLMRAYSIASPNYE -------------------------3333---------------------------1111 EHLEFFSIKVQNGPLTSRLQHLKEGDELMVSRKPTGTLVTSDLLPGKHLYMLSTGTGLAP --------------333311112222------------3333----------------33 FMSLIQDPEVYERFEKVVLIHGVRQVNELAYQQFITEHLPQSEYFGEAVKEKLIYYPTVT 331111-3333-------------1111---------3333------------------- RESFHNQGRLTDLMRSGKLFEDIGLPPINPQDDRAMICGSPSMLDESCEVLDGFGLKISP --------3333-3333----------------------------------1111----- RMGEPGDYLIERAFVEK ----------------- >BROMOPEROXIDASE A1; SWP:P33912; PDB:1A8Q; PICTTRDGVEIFYKDWGQGRPVVFIHGWPLNGDAWQDQLKAVVDAGYRGIAHDRRGHGHS ----1111----------------------------------1111-------2222--- TPVWDGYDFDTFADDLNDLLTDLDLRDVTLVAHSMGGGELARYVGRHGTGRLRSAVLLSA ------------------------------------------------1111-------- IPPVMIKSDKNPDGVPDEVFDALKNGVLTERSQFWKDTAEGFFSANRPGNKVTQGNKDAF -------3333----3333---------------------------2222--3333---- WYMAMAQTIEGGVRCVDAFGYTDFTEDLKKFDIPTLVVHGDDDQVVPIDATGRKSAQIIP ---1111-----------------3333------------------3333---3333-22 NAELKVYEGSSHGIAMVPGDKEKFNRDLLEFLNK 22----------3333------------------ >GTP CYCLOHYDROLASE I; SWP:P27511; PDB:1A8RA; PSLSKEAALVHEALVARGLETPLRPPVHEMDNETRKSLIAGHMTEIMQLLNLDLADDSLM -----------------------------------------------1111-33331111 ETPHRIAKMYVDEIFSGLDYANFPKITLIENKMKVDEMVTVRDITLTSTCESHFVTIDGK ------------1111--3333--------3333-------------------------- ATVAYIPKDSVIGLSKINRIVQFFAQRPQVQERLTQQILIALQTLLGTNNVAVSIDAVHY ------------3333-------------3333-------------------------11 CVKARGIRDATSATTTTSLGGLFKSSQNTRHEFLRAVRHHN 11--!!!!--------------------------------- >CHLOROPEROXIDASE F; SWP:O31158; PDB:1A8S; TTFTTRDGTQIYYKDWGSGQPIVFSHGWPLNADSWESQMIFLAAQGYRVIAHDRRGHGRS ----1111----------------------3333--------1111-------2222--- SQPWSGNDMDTYADDLAQLIEHLDLRDAVLFGFSTGGGEVARYIGRHGTARVAKAGLISA ------------------------------------------------1111-------- VPPLMLKTEANPGGLPMEVFDGIRQASLADRSQLYKDLASGPFFGFNQPGAKSSAGMVDW -------1111----3333----------------------1111--2222--------- FWLQGMAAGHKNAYDCIKAFSETDFTEDLKKIDVPTLVVHGDADQVVPIEASGIASAALV ----3333--------------------1111---------------3333--------2 KGSTLKIYSGAPHGLTDTHKDQLNADLLAFIKG 222----2222--3333---------------- >CALSEQUESTRIN; SWP:P07221; PDB:1A8Y; GLDFPEYDGVDRVINVNAKNYKNVFKKYEVLALLYHEPPEDDKASQRQFEMEELILELAA ----------------3333---------------------------------------- QVLEDKGVGFGLVDSEKDAAVAKKLGLTEEDSIYVFKEDEVIEYDGEFSADTLVEFLLDV --1111------------------------------------------------------ LEDPVELIEGERELQAFENIEDEIKLIGYFKNKDSEHYKAFKEAAEEFHPYIPFFATFDS ---------------3333------------1111------------------------- KVAKKLTLKLNEIDFYEAFMEEPVTIPDKPNSEEEIVNFVEEHRRSTLRKLKPESMYETW ----------------2222-------------------------------1111--333 EDDMDGIHIVAFAEEADPDGYEFLEILKSVAQDNTDNPDLSIIWIDPDDFPLLVPYWEKT 3------------3333----------------3333--------33333333------- FDIDLSAPQIGVVNVTDADSVWMEPSAEELEDWLEDVL -------------------------------------- -------------------------------------------------- -------------------------- -------------------------- >PUTRESCINE-BINDING PROTEI; SWP:P31133; PDB:1A99A; QKTLHIYNWSDYIAPDTVANFEKETGIKVVYDVFDSNEVLEGKLMAGSTGFDLVVPSASF -------------1111------------------------------------------- LERQLTAGVFQPLDKSKLPEWKNLDPELLKLVAKHDPDNKFAMPYMWATTGIGYNVDKVK ----1111-----33331111----------33332222--------------------- AVLGENAPVDSWDLILKPENLEKLKSCGVSFLDAPEEVFATVLNYLGKDPNSTKADDYTG ---111111113333-3333---3333-------3333-----1111------------- PATDLLLKLRPNIRYFHSSQYINDLANGDICVAIGWAGDVWQASNRAKEAKNGVNVSFSI --------3333------------------------------------------------ PKEGAMAFFDVFAMPADAKNKDEAYQFLNYLLRPDVVAHISDHVFYANANKAATPLVSAE 1111----------1111--------------3333---3333------33331111333 VRENPGIYPPADVRAKLFTLKVQDPKIDRVRTRAWTKVKSG 3--1111---------------------------------- >U2 RNA HAIRPIN IV; SWP:P09661; PDB:1A9NA; VKLTAELIEQAAQYTNAVRDRELDLRGYKIPVIENLGATLDQFDAIDFSDNEIRKLDGFP ---33331111----1111----------------3333--------------------- LLRRLKTLLVNNNRICRIGEGLDQALPDLTELILTNNSLVELGDLDPLASLKSLTYLCIL -1111---------------3333-1111-----------33333333--1111------ RNPVTNKKHYRLYVIYKVPQVRVLDFQKVKLKERQEAEKMFK -3333-2222-------3333--%%%%-----------2222 >U2 small nuclear ribonucl; SWP:P08579; PDB:1A9NB; IRPNHTIYINNMNDKIKKEELKRSLYALFSQFGHVVDIVALKTMKMRGQAFVIFKELGSS ----------------3333----------------------3333-------------- TNALRQLQGFPFYGKPMRIQYAKTDSDIISKMRG ------2222-iiii------------------- >MAP KINASE P38; SWP:Q16539; PDB:1A9U; ERPTFYRQELNKTIWEVPERYQNLSPVGSGAYGSVCAAFDTKTGLRVAVKKLSRPFQSII ---------%%%%----3333--------3333--------------------------- HAKRTYRELRLLKHMKHENVIGLLDVFTPARSLEEFNDVYLVTHLMGADLNNIVKCQKLT ----------------1111-----------3333------------------1111--3 DDHVQFLIYQILRGLKYIHSADIIHRDLKPSNLAVNEDCELKILDFGLARHTDDEMTGYV 333---------------1111------3333---1111--------------------3 ATRWYRAPEIMLNWMHYNQTVDIWSVGCIMAELLTGRTLFPGTDHIDQLKLILRLVGTPG 333------1111----------------------------------------------3 AELLKKISSESARNYIQSLTQMPKMNFANVFIGANPLAVDLLEKMLVLDSDKRITAAQAL 3331111--------1111------3333-22223333----------3333-------- AHAYFAQYHDPDDEPVADPYDQSFESRDLLIDEWKSLTYDEVISFVPPPLD -3333----1111--------3333----3333------------------ >Hemoglobin subunit epsilo; SWP:P02100; PDB:1A9WE; VHFTAEEKAAVTSLWSKMNVEEAGGEALGRLLVVYPWTQRFFDSFGNLSSPSAILGNPKV --------------1111----------------333333333333-----------333 KAHGKKVLTSFGDAIKNMDNLKPAFAKLSELHCDKLHVDPENFKLLGNVMVIILATHFGK 3-----------333333333333--------------3333---------------!!! EFTPEVQAAWQKLVSAVAIALAHKY !------------------------ >CARBAMOYL PHOSPHATE SYNTH; SWP:P00968; PDB:1A9XA; MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVINVNSNPATIMTDPEM ------------------22223333--------------------------33331111 ADATYIEPIHWEVVRKIIEKERPDAVLPTMGGQTALNCALELERQGVLEEFGVTMIGATA ------------------------------------------3333-------------- DAIDKAEDRRRFDVAMKKIGLETARSGIAHTMEEALAVAADVGFPCIIRPSFTMGGSGGG ----------------1111-----------------------------------2222- IAYNREEFEEICARGLDLSPTKELLIDESLIGWKEYEMEVVRDKNDNCIIVCSIENFDAM ------------------1111-------2222---------1111-------------- GIHTGDSITVAPAQTLTDKEYQIMRNASMAVLREIGVETGGSNVQFAVNPKNGRLIVIEM --3333------------------------------------------------------ NPRVSRSSALASKATGFPIAKVAAKLAVGYTLDELMNDITGGRTPASFEPSIDYVVTKIP -------------------------1111-3333---1111------------------- RFNFEKFAGANDRLTTQMKSVGEVMAIGRTQQESLQKALRGLEVGATGFDPKVSLDDPEA --3333-----------------------------------------------1111--- LTKIRRELKDAGADRIWYIADAFRAGLSVDGVFNLTNIDRWFLVQIEELVRLEEKVAEVG -----------11113333-------------------3333-----------------3 ITGLNADFLRQLKRKGFADARLAKLAGVREAEIRKLRDQYDLHPVYKRVDTCAAEFATDT 333---------1111------------3333-------------------%%%%----- AYMYSTYEEECEANPSTDREKIMVLGGGPNRIGQGIEFDYCCVHASLALREDGYETIMVN ------------------------------22223333-----------3333------- CNPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKPKGVIVQYGGQTPLKLARALEAAGVP -33331111------------------------------------3333----------- VIGTSPDAIDRAEDRERFQHAVERLKLKQPANATVTAIEMAVEKAKEIGYPLVVRAAMEI -------------------------------------1111--3333------------- VYDEADLRRYFQTAVLLDHFLDDAVEVDVDAICDGEMVLIGGIMEHIEQAGVHSGDSACS ----------------------------------------------------3333---- LPAYTLSQEIQDVMRQQVQKLAFELQVRGLMNVQFAVKNNEVYLIEVNPRAARTVPFVSK ------3333---------------------------%%%%----------1111----- ATGVPLAKVAARVMAGKSLAEQGVTKEVIPPYYSVKEVVLPFNKFPGVDPLLGPEMRSTG ----3333---------3333-------------------33331111------------ EVMGVGRTFAEAFAKAQLGSNSTMKKHGRALLSVREGDKERVVDLAAKLLKQGFELDATH ----------------------------------3333-3333------1111------- GTAIVLGEAGINPRLVNKVHEGRPHIQDRIKNGEYTYIINTTSGRRAIEDSRVIRRSALQ ------1111-------3333-----------------------------3333------ YKVHYDTTLNGGFATAMALNADATEKVISVQEMHAQIK ----------------3333-1111------------- >Carbamoyl-phosphate synth; SWP:P0A6F1; PDB:1A9XB; IKSALLVLEDGTQFHGRAIGATGSAVGEVVFNTSMTGYQEILTDPSYSRQIVTLTYPHIG -------1111--------------------------------3333------------1 NVGTNDADEESSQVHAQGLVIRDLPLIASNFRNTEDLSSYLKRHNIVAIADIDTRKLTRL 111-3333---------------------1111--------1111--------------- LREKGAQNGCIIAGDNPDAALALEKARAFPGLNGMDLAKEVTTAEAYSWTQGSWTLTGGL -----------------3333----------2222-3333-------------------- PQAKKEDELPFHVVAYDFGAKRNILRMLVDRGCRLTIVPAQTSAEDVLKMNPDGIFLSNG ----3333------------3333--------------1111-----1111--------- PGDPAPCDYAITAIQKFLETDIPVFGILGHQLLALASGAKTVKMKFGHHGGNHPVKDVEK ---3333---------1111--------------1111---------------------- NVVMITAQNHGFAVDEATLPANLRVTHKSLFDGTLQGIHRTDKPAFSFQGNPEASPGPHD --------------3333-1111----------------1111-------3333----11 AAPLFDHFIELIEQYRKT 113333--------3333 >UDP-GALACTOSE 4-EPIMERASE; SWP:P09147; PDB:1A9Y; MRVLVTGGSGYIGSHTCVQLLQNGHDVIILDNLCNSKRSVLPVIERLGGKHPTFVEGDIR ------1111--------------------------3333-----------------111 NEALMTEILHDHAIDTVIHFAGLKAVGESVQKPLEYYDNNVNGTLRLISAMRAANVKNFI 1--------1111-----------3333-----------------------1111----- FSSAATVYGDQPKIPYVESFPTGTPQSPFGKSKLMVEQILTDLQKAQPDWSIALLRYFNP ----------------1111--------------------------1111---------- VGAHPSGDMGEDPQGIPNNLMPYIAQVAVGRRDSLAIFGNDYPTEDGTGVRDYIHVMDLA ---3333------------------------------------1111------------- DGHVVAMEKLANKPGVHIYNLGAGVGNSVLDVVNAFSKACGKPVNYHFAPRREGDLPAYW ---------2222--------------------------------------2222----- ADASKADRELNWRVTRTLDEMAQDTWHWQSRHPQGYPD -------------------------------1111--- >FIBRITIN; SWP:P10104; PDB:1AA0; VSGLNNAVQNLQVEIGNNSAGIKGQVVALNTLVNGTNPNGSTVEERGLTNSIKANETNIA 3333------------1111---------------------3333--------------- SVTQEVNTAKGNISSLQGDVQALQEAGYIPEAPRDGQAYVRKDGEWVLLSTFL ----------------------1111---------------------3333-- >RECA; SWP:P0A7G6; PDB:1AA3; INFYGELVDLGVKEKLIEKAGAWYSYKGEKIGQGKANATAWLKDNPETAKEIEKKVRELL ----------1111------------------------------3333------------ LSN --- >INFLUENZA VIRUS MATRIX PR; SWP:P03485; PDB:1AA7A; MSLLTEVETYVLSIIPSGPLKAEIAQRLEDVFAGKNTDLEVLMEWLKTRPILSPLTKGIL -3333------1111---------------1111-----------1111----------- GFVFTLTVPSERGLQRRRFVQNALNGNGDPNNMDKAVKLYRKLKREITFHGAKEISLSYS ----------%%%%--33333333------------------1111---------1111- AGALASCMGLIYNRMGAVTTEVAFGLVCATCEQIADSQ ----------------------------------3333 >HIV-1 NUCLEOCAPSID PROTEI; SWP:P05888; PDB:1AAF; MQRGNFRNQRKIIKCFNCGKEGHIAKNCRAPRKRGCWKCGKEGHQMKDCTERQAN -----------------------3333-----------------3333------- >ALZHEIMER'S DISEASE AMYLO; SWP:P05067; PDB:1AAPA; VREVCSEQAETGPCRAMISRWYFDVTEGKCAPFFYGGCGGNRNNFDTEEYCMAVCG -3333-------------------3333---------------------------- >GYRASE A; SWP:P0AES4; PDB:1AB4; VGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDSAVYDTI -3333------------------1111-1111---3333--------------------- VRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNY ----1111-------------1111----1111-----33333333-3333-------11 DGTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDDEDISIEGLM 11------------3333--------------------------------1111333333 EHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEVETIIVHEIPYQVNKARLIE 33-------------3333------------------------------2222------- KIAELVKEKRVEGISALRDESDKDGMRIVIEGEVVLNNLYSQTQLQVSFGINMVALHHGQ ---------------------3333-------------3333--------------%%%% PKIMNLKDIIAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHAP ------------------------------------------------------------ TPAEAKTALVANPWQLGNVAAMLEDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQK ----------------1111--------11111111----------------11113333 LTGLEHEKLLDEYKELLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEIT --------------------------------------------------------- >ADENYLYL CYCLASE; SWP:P26769; PDB:1AB8A; LYHQSYDCVCVMFASIPDFKEFYTESDVNKEGLECLRLLNEIIADFDDLLSKPKFSGVEK ---------------1111---------------------------3333-3333----- IKTIGSTYMAATGLSAIPSQQYMHIGTMVEFAYALVGKLDAINKHSFNDFKLRVGINHGP ---!!!!--------------3333----------------1111--------------- VIAGVIGAQKPQYDIWGNTVNVASRMDSTGVLDKIQVTEETSLILQTLGYTCTCFVN ------------------------------2222---3333---------------- >GLUTAREDOXIN; SWP:P00276; PDB:1ABA; MFKVYGYDSNIHKCGPCDNAKRLLTVKKQPFEFINIMPEKGVFDDEKIAELLTKLGRDTQ -------3333-------------1111----------2222-----------------2 IGLTMPQVFAPDGSHIGGFDQLREYFK 222------1111---------3333- >ABRIN-A; SWP:Q7DM12; PDB:1ABRA; EDRPIKFSTEGATSQSYKQFIEALRERLRGGLIHDIPVLPDPTTLQERNRYITVELSNSD --------------------------------iiii----3333-3333----------- TESIEVGIDVTNAYVVAYRAGTQSYFLRDAPSSASDYLFTGTDQHSLPFYGTYGDLERWA ------------------------------1111-------------------------- HQSRQQIPLGLQALTHGISFFRSGGNDNEEKARTLIVIIQMVAEAARFRYISNRVRVSIQ --3333------------------------------------------------------ TGTAFQPDAAMISLENNWDNLSRGVQESVQDTFPNQVTLTNIRNEPVIVDSLSHPTVAVL -------3333-----------------------------1111------11113333-- ALMLFVCNPPN ----------- >Abrin-a [Precursor]; SWP:P11140; PDB:1ABRB; IVEKSKICSSRYEPTVRIGGRDGMCVDVYDNGYHNGNRIIMWKCKDRLEENQLWTLKSDK --------------------iiii---2222--2222-----------1111----1111 TIRSNGKCLTTYGYAPGSYVMIYDCTSAVAEATYWEIWDNGTIINPKSALVLSAESSSMG -----------------------1111--1111----1111----1111--------222 GTLTVQTNEYLMRQGWRTGNNTSPFVTSISGYSDLCMQAQGSNVWMADCDSNKKEQQWAL 2---------1111----------------2222---------------3333------- YTDGSIRSVQNTNNCLTSKDHKQGSTILLMGCSNGWASQRWVFKNDGSIYSLYDDMVMDV 1111---3333----------2222------3333--------1111------------- KGSDPSLKQIILWPYTGKPNQIWLTLF %%%%----------------------- >DELTA SUBUNIT OF THE F1F0; SWP:P0ABA4; PDB:1ABV; SEFITVARPYAKAAFDFAVEHQSVERWQDMLAFAAEVTKNEQMAELLSGALAPETLAESF --3333-----------------3333--------------------------------- IAVCGEQLDENGQNLIRVMAENGRLNALPDVLEQFIHLRAVSEAT -----------------------3333------------------ >ALPHA-T-ALPHA; SWP:NA; PDB:1ABZ; DWLKARVEQELQALEARGTDSNAELRAMEAKLKAEIQK 3333--------3333---------------3333--- >GLUCOAMYLASE; SWP:P69328; PDB:1AC0; CTTPTAVAVTFDLTATTTYGENIYLVGSISQLGDWETSDGIALSADKYTSSDPLWYVTVT ----------------------------3333---3333------------1111----- LPAGESFEYKFIRIESDDSVEWESDPNREYTVPQACGTSTATVTDTWR ------------------------------------------------ >KEX1(DELTA)P; SWP:P09620; PDB:1AC5; LPSSEEYKVAYELLPGLSEVPDPSNIPQMHAGHIPLRSEDADEQDSSDLEYFFWKFTNND ----1111-33332222----1111----------------3333--------------3 SNGNVDRPLIIWLNGGPGCSSMDGALVESGPFRVNSDGKLYLNEGSWISKGDLLFIDQPT 333-------------------------------1111----1111-1111-------22 GTGFSVEQNKDEGKIDKNKFDEDLEDVTKHFMDFLENYFKIFPEDLTRKIILSGESYAGQ 22--------3333-1111----------------------3333--------------- YIPFFANAILNHNKFSKIDGDTYDLKALLIGNGWIDPNTQSLSYLPFAMEKKLIDESNPN -----------------1111---------------------------------1111-- FKHLTNAHENCQNLINSASTDEAAHFSYQECENILNLLLSYTRESSQKGTADCLNMYNFN ---------------------1111--3333--------------------------333 LKDSYPSCGMNWPKDISFVSKFFSTPGVIDSLHLDSDKIDHWKECTNSVGTKLSNPISKP 3-----iiii--------------2222-1111-1111---------------------3 SIHLLPGLLESGIEIVLFNGDKDLICNNKGVLDTIDNLKWGGIKGFSDDAVSFDWIHKSK 333------------------------------------iiii---1111---------1 STDDSEEFSGYVKYDRNLTFVSVYNASHMVPFDKSLVSRGIVDIYSNDVMIIDNNGKNVM 111-----------iiii----------3333--3333---------------iiii--- ITT --- >T-CELL RECEPTOR ALPHA; SWP:P06323; PDB:1AC6A; DSVTQTEGQVALSEEDFLTIHCNYSASGYPALFWYVQYPGEGPQFLFRASRDKEKGSSRG -------------------------------------2222---------2222---iii FEATYNKEATSFHLQKASVQESDSAVYYCALSGGNNKLTFGAGTKLTIKP i-----1111---------1111--------------------------- >ANTHRAX PROTECTIVE ANTIGE; SWP:P13423; PDB:1ACC; SSSQGLLGYYFSDLNFQAPMVVTSSTTGDLSIPSSELENIPSENQYFQSAIWSGFIKVKK --------------------------------333311113333---------------- SDEYTFATSADNHVTMWVDDQEVINSNKIRLEKGRLYQIKIQYQRENPTEKGLDFKLYWT ---------3333----iiii----------2222------------------------- DSQNKKEVISSDNLQLPELKQKSSVPDRDNDGIPDSLEVEGYTVDVKNKRTFLSPWISNI ---------3333-------------1111---------------------------333 HEKKGLTKYKSSPEKWSTASDPYSDFEKVTGRIDKNVSPEARHPLVAAYPIVHVDMENII 3----------1111-1111---33331111--1111-----1111-------------- LSKNETISKNTSTSRTHTSEVVSAGFSNSNSSTVAIDHSLSLAGGLNTADTARLNANIRY ------------------------------------------------------------ VNTGTAPIYNVLPTTSLVLGKNQTLATIKAKENQLSQILAPNNYYPSKNLAPIALNAQDD ------------------------------2222------------1111---------- FSSTPITMNYNQFLELEKTKQLRLDTDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQE ---------------------------------------------11113333------- TTARIIFNGKDLNLVERRIAAVNPSTTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEF ---------------------------------------------------iiii----- DFNFDQQTSQNIKNQLAELNATNIYTVLDKIKLNAKMNILIRDKRFHYDRNNIAVGADES ----------------1111--33331111---2222-----1111--1111-------- VVKEAHREVINSSTEGLLLNIDKDIRKILSGYIVEIEDTEGLKEVINDRYDMLNISSLRQ ---1111-----1111-----33331111--------1111-------1111------11 DGKTFIDFKKYNDKLPLYISNPNYKVNVYAVTKENTIINPSENGDTSTNGIKKILIFSKK 11---------%%%%-----1111-------3333------------2222--------3 GYEIG 333-- >PROFILIN I; SWP:P68696; PDB:1ACF; SWQTYVDTNLVGTGAVTQAAILGLDGNTWATSAGFAVTPAQGTTLAGAFNNADAIRAGGF 3333-----3333---------1111-----2222----------3333----------- DLAGVHYVTLRADDRSIYGKKGSSGVITVKTSKAILVGVYNEKIQPGTAANVVEKLADYL --------------------!!!!----------------1111---------------- IGQGF 1111- >ASPARTATE CARBAMOYLTRANSF; SWP:P0A786; PDB:1ACMA; ANPLYQKHIISINDLSRDDLNLVLATAAKLKANPQPELLKHKVIASCFFEASTATRLSFQ -1111-----3333------------------------2222------------------ TSMHRLGASVVGFSDSANTSLGKKGETLADTISVISTYVDAIVMRHPQEGAARLATEFSG ---1111--------1111---------------1111---------2222--3333-!! NVPVLNAGDGSNQHPTQTLLDLFTIQQTEGRLDNLHVAMVGDLKYGRTVHSLTQALAKFD !!------!!!!-3333--------------------------------------1111- GNRFYFIAPDALAMPEYILDMLDEKGIAWSLHSSIEEVMAEVDILYMTRVQKERLDPSEY --------3333--3333----------------3333------------3333-33331 ANVKAQFVLRASDLHNAKANMKVLHPLPRVDEIATDVDKTPHAWYFQQAGNGIFARQALL 111------3333----1111------------3333--1111----------------- ALVLNRDLVL ---------- >NATURAL SCORPION PEPTIDE ; SWP:P56215; PDB:1ACW; VSCEDCPEHCSTQKAQAKCDNDKCVCEPI --------3333-------%%%%------ >ACTINOXANTHIN; SWP:P01551; PDB:1ACX; APAFSVSPASGASDGQSVSVSVAAAGETYYIAQCAPVGGQDACNPATATSFTTDASGAAS ------------------------------------------------------------ FSFTVRKSYAGQTPSGTPVGSVDCATDACNLGAGNSGLNLGHVALTFG ------------3333------1111--------3333---------- >HIV-1 GP120 (MN ISOLATE); SWP:GC1_MOUSE; PDB:1ACYL; DIVMTQSPASLVVSLGQRATISCRASESVDS -------------2222-------------- >FAB FRAGMENT, ANTIBODY A5; SWP:NA; PDB:1AD0A; QTVLTQSPSSLSVSVGDRVTITCRASSSVTYIHWYQQKPGLAPKSLIYATSNLASGVPSR -------------2222--------------------2222------------2222111 FSGSGSGTDYTFTISSLQPEDIATYYCQHWSSKPPTFGQGTKVEVKRTVAAPSVFIFPPS 1----!!!!--------3333--------------------------------------3 DEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLTL 3331111---------------------iiii---------------------------- SKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC 3333------------1111--------2222- >FAB FRAGMENT, ANTIBODY A5; SWP:NA; PDB:1AD0B; EVQLLESGGGLVQPGGSLRLSCATSGFTFTDYYMNWVRQAPGKGLEWLGFIGNKA ------------2222-----------1111------------------------ >DIHYDROPTEROATE SYNTHETAS; SWP:O05701; PDB:1AD1A; TKTKIMGILNVTPDSFSDGGKFNNVESAVTRVKAMMDEGADIIDVGGVSTRPGHEMITVE -----------3333--iiii-----------------------------2222------ EELNRVLPVVEAIVGFDVKISVDTFRSEVAEACLKLGVDIINDQWAGLYDHRMFQVVAKY ------------1111-----------------1111-----11113333-------111 DAEIVLMHNGNGNRDEPVVEEMLTSLLAQAHQAKIAGIPSNKIWLDPGIGFAKTRNEEAE 1---------1111------------------------1111-----2222--------- VMARLDELVATEYPVLLATSRKRFTKEMMGYDTTPVERDEVTAATTAYGIMKGVRAVRVH ------------------22223333-------3333------------1111------- NVELNAKLAKGIDFLKENENARHN -------------------1111- >RIBOSOMAL PROTEIN L1; SWP:P27150; PDB:1AD2; KRYRALLEKVDPNKIYTIDEAAHLVKELATAKFDETVEVHAKLGIDPRRSDQNVRGTVSL --3333----1111---------3333------------------1111----------- PHGLGKQVRVLAIAKGEKIKEAEEAGADYVGGEEIIQKILDGWMDFDAVVATPDVMGAVG ----3333------!!!!--------------------1111---------1111----- SKLGRILGPRGLLPNPKAGTVGFNIGEIIREIKAGRIEFRNDKTGAIHAPVGKACFPPEK -------1111---1111-----------------------1111-------1111---- LADNIRAFIRALEAHKPEGAKGTFLRSVYVTTTMGPSVRINPHS ------------1111---------------------------- >ALDEHYDE DEHYDROGENASE (C; SWP:P11883; PDB:1AD3A; SISDTVKRAREAFNSGKTRSLQFRIQQLEALQRMINENLKSISGALASDLGKNEWTSYYE ---------------1111---------------3333--------------3333---- EVAHVLEELDTTIKELPDWAEDEPVAKTRQTQQDDLYIHSEPLGVVLVIGAWNYPFNLTI ---------------------------3333------------------------1111- QPMVGAVAAGNAVILKPSEVSGHMADLLATLIPQYMDQNLYLVVKGGVPETTELLKERFD ------1111--------------------1111-------------------------- HIMYTGSTAVGKIVMAAAAKHLTPVTLELGGKSPCYVDKDCDLDVACRRIAWGKFMNSGQ ------3333-------3333--------------------3333------3333-iiii TCVAPDYILCDPSIQNQIVEKLKKSLKDFYGEDAKQSRDYGRIINDRHFQRVKGLIDNQK ----------3333------------------33331111-------------1111--- VAHGGTWDQSSRYIAPTILVDVDPQSPVMQEEIFGPVMPIVCVRSLEEAIQFINQREKPL --------1111----------11111111----------------------1111---- ALYVFSNNEKVIKKMIAETSSGGVTANDVIVHITVPTLPFGGVGNSGMGAYHGKKSFETF -------3333---3333----------------1111----!!!!------3333-111 SHRRSCLVKSLLNEEAHKARYPPSPA 1---------------3333------ >RETINOBLASTOMA TUMOR SUPP; SWP:P06400; PDB:1AD6; VMNTIQQLMMILNSASDQPSENLISYFNNCTVNPKESILKRVKDIGYIFKEKFAKAVGQG -----------1111-----------1111------------------------------ CVEIGSQRYKLGVRLYYRVMESMLKSEEERLSIQNFSKLLNDNIFHMSLLACALEVVMAT 3333-------------------------------3333--------------------- YSRSTSQNLDSGTDLSFPWILNVLNLKAFDFYKVIESFIKAEGNLTREMIKHLERCEHRI -----------------------------------------1111--------------- MESLA 1111- >FAB FRAGMENT CTM01; SWP:NA; PDB:1AD9H; EIQLVQSGAEVKKPGSSVKVSCKASGYTFTDYYINWMRQAPGQGLEWIGWIDPGSGNTKY ------------2222-----------3333----------------------------- NEKFKGRATLTVDTSTNTAYMELSSL 3333--------3333---------- >FAB FRAGMENT CTM01; SWP:NA; PDB:1AD9L; DIQMTQSPSTLSASVGDRVTITCRSSKSLLHS -------------2222--------------- >IGG4 REA; SWP:P01861; PDB:1ADQA; PSVFLFPPKPKDTLMISRTPEVTCVVVDVSQEDPQVQFNWYVDGVQVHNAKTKPREQQFN --------3333--3333------------------------------------------ STYRVVSVLTVLHQNWLDGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEE ---------------1111-------------------------------------3333 MTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSRLTVDKSRW -----------------------------------------1111--------------- QEGNVFSCSVMHEALHNHYTQKSLSL -----------1111----------- >Prolactin-binding protein; SWP:Q9QV16; PDB:1ADQH; EVQLVESGGGLVQPGRSLRLSCVTSGFTFDDYAMHWVRQSPGKGLEWVSGISWNTGTIIY ---------------------------3333----------------------------- ADSVKGRFIISRDNAKNSLYLQMNSL -------------1111--------- >IGL@ protein; SWP:Q8N355; PDB:1ADQL; YVLTQPPSVSVAPGQTARITCGGNNIGSKSVHWYQQKPGQAPVLVVYDDSDRPPGIPERF -------------------------1111-------2222----------------3333 SGSNSGNTATLTISRVEAGDEADYYCQVWDSSSDH ----!!!!--------3333---------3333-- >P22 C2 REPRESSOR; SWP:P69202; PDB:1ADR; MNTQLMGERIRARRKKLKIRQAALGKMVGVSNVAISQWERSETEPNGENLLALSKALQCS ------------------------------3333--------------------1111-3 PDYLLKGDLSQTNVAY 333------------- >ADENOVIRUS SINGLE-STRANDE; SWP:P03265; PDB:1ADT; PIVSAWEKGMEAARALMDKYHVDNDLKANFKLLPDQVEALAAVCKTWLNEEHRGLQLTFT -------------------------3333---1111------------------------ SNKTFVTMMGRFLQAYLQSFAEVTYKHHEPTGCALWLHRCAEIEGELKCLHGSIMINKEH ------------------------22223333----------2222--1111-------- VSNTDARCCVHDAACPANQFSGKSCGMFFSEGAKAQVAFKQIKAFMQALYPNAQTGHGHL --------1111---2222-1111------------------------------------ LMPLRCECNSFLGRQLPKLTPFALSNAEDLDADLISDKSVLASVHHPALIVFQCCNPNCD -----3333--------------1111--------------------------------- FKISAPDLLNALVMVRSLWSENFTELPRMVVPQFKWSTKHQYRNVSLPVAHSDARQNPFD -------------------------------------1111------------------- F - >PSEUDOAZURIN; SWP:P80401; PDB:1ADWA; ATHEVHMLNKGESGAMVFEPAFVRAEPGDVINFVPTDKSHNVEAIKEILPEGVESFKSKI ----------1111-----------2222---------------3333-2222-----22 NESYTLTVTEPGLYGVKCTPHFGMGMVGLVQVGDAPENLDAAKTAKMPKKARERMDAELA 22---------------33331111----------11113333---------------11 QVN 11- >TROPINONE REDUCTASE-I; SWP:P50162; PDB:1AE1A; RWSLKGTTALVTGGSKGIGYAIVEELAGLGARVYTCSRNEKELDECLEIWREKGLNVEGS ---2222-------------------------------------------1111------ VCDLLSRTERDKLMQTVAHVFDGKLNILVNNAGVVIHKEAKDFTEKDYNIIMGTNFEAAY --1111--------------%%%%--------------1111------------------ HLSQIAYPLLKASQNGNVIFLSSIAGFSALPSVSLYSASKGAINQMTKSLACEWAKDNIR ---------3333---------1111---2222--------------------3333--- VNSVAPGVILQKEEIDNFIVKTPMGRAGKPQEVSALIAFLCFPAASYITGQIIWADGGFT ---------------------3333---1111---------3333----------iiii- ANGGF ----- >ANTIBODY CTM01; SWP:NA; PDB:1AE6H; QIQLQQSGPELVKPGASVKISCKASGYTFTDYYINWMKQKPGQGLEWIGWIDPGSGNTKY ------------2222-----------3333----------------------------- NEKFKGKATLTVDTSSSTAYMQLSSL 1111---------------------- -------------------------------- >PHOSPHOLIPASE A2; SWP:P00608; PDB:1AE7; NLVQFSYLIQCANHGKRPTWHYMDYGCYCGAGGSGTPVDELDRCCKIHDDCYDEAGKKGC ------------iiii-3333-----------------3333------------------ FPKMSAYDYYCGENGPYCRNIKKKCLRFVCDCDVEAAFCFAKAPYNNANWNIDTKKRCQ -----------3333------------------------------3333---3333--- >LAMBDA INTEGRASE; SWP:P03700; PDB:1AE9A; RSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKT ---------------11113333--------------------3333-%%%%-------- GVKIAIPTALHIDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKAS ------1111--1111---------------------1111----------------333 GLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKFRDDRGREWDKIEI 3--------3333--------------------------------------- >ACTINIDIN; SWP:P00785; PDB:1AEC; LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQ -----3333-------------------------------------------------!! NTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNN !!!!!!--3333------------3333-------------------------------- EWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWIV -------1111----------------------------------------iiii----- KNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPVKY ----1111-iiii-------!!!!%%%%---------- >APOLIPOPHORIN III; SWP:P10762; PDB:1AEP; NIAEAVQQLNHTIVNAAHELHETLGLPTPDEALNLLTEQANAFKTKIAEVTTSLKQEAEK ---------------------3333--------------------------------111 HQGSVAEQLNAFARNLNNSIHDAATSLNLQDQLNSLQSALTNVGHQWQDIATKTQASAQE 1-------------------------------------------------------1111 AWAPVQSALQEAAEKTKEAAANLQNSIQSAVQK -3333----------------------1111-- >APOPTOSIS REGULATOR BCL-X; SWP:P53563; PDB:1AF3; SQSNRELVVDFLSYKLSQKGYSWSQFSDVIPMAAVKQALREAGDEFELRYRRAFSDLTSQ ----------------1111--3333---1111--------------------------- LHITPGTAYQSFEQVVNELFRDGVNWGRIVAFFSFGGALCVESVDKEMQVLVSRIASWMA ---11113333----33331111--------------------11113333--------- TYLNDHLEPWIQENGGWDTFVDLYG ------------------------- >I-CREI; SWP:P05725; PDB:1AF5; KYNKEFLLYLAGFVDGDGSIIAQIKPNQSYKFKHQLSLTFQVTQKTQRRWFLGKLVDEIG -----3333-----------------------------------3333------------ VGYVRDRGSVSDYILSEIKPLHNFLTQLQPFLKLKQKQANLVLKIIEQLPLEVCTWVDQI ------!!!!-------------3333---------------------------3333-- AALNDS ------ >CHEMOTAXIS RECEPTOR METHY; SWP:P07801; PDB:1AF7; SVLLQMTQRLALSDAHFRRICQLIYQRAGIVLADHKRDMVYNRLVRRLRALGLDDFGRYL --------------------------------1111------------------------ SMLEANQNSAEWQAFINALTTNLTAFFREAHHFPILAEHARRRHGEYRVWSAAASTGEEP -----11113333--3333----------------------------------!!!!--- YSIAITLADALGMAPGRWKVFASDIDTEVLEKARSGIYRLSELKTLSPQQLQRYFMRGTG -------------2222---------------------33331111-------------- PHEGLVRVRQELANYVEFSSVNLLEKQYNVPGPFDAIFCRNVMIYFDKTTQEDILRRFVP ---------3333--------1111----------------3333------------333 LLKPDGLLFAGHSENFSNLVREFSLRGQTVYALS 3---------11113333-1111----------- >MERP; SWP:P04129; PDB:1AFI; ATQTVTLAVPGMTCAACPITVKKALSKVEGVSKVDVGFEKREAVVTFDDTKASVQKLTKA ---------------1111----------------------------3333-3333---- TADAGYPSSVKQ -3333------- >3-ALPHA-HYDROXYSTEROID DE; SWP:P23457; PDB:1AFSA; MDSISLRVALNDGNFIPVLGFGTTVPEKVAKDEVIKATKIAIDNGFRHFDSAYLYEVEEE -1111--------------------1111------------1111------3333-3333 VGQAIRSKIEDGTVKREDIFYTSKLWSTFHRPELVRTCLEKTLKSTQLDYVDLYIIHFPM --------1111--3333-------1111-1111-------------------------- ALQPGDIFFPRDEHGKLLFETVDICDTWEAMEKCKDAGLAKSIGVSNFNCRQLERILNKP -----------1111-------3333--------------------------------22 GLKYKPVCNQVECHLYLNQSKMLDYCKSKDIILVSYCTLGSSRDKTWVDQKSPVLLDDPV 22-----------1111---------1111------1111---3333-1111-1111--- LCAIAKKYKQTPALVALRYQLQRGVVPLIRSFNAKRIKELTQVFEFQLASEDMKALDGLN -----1111-----------1111---------------1111-----3333---1111- RNFRYNNAKYFDDHPNHPF -------1111--1111-- >ANTIBODY FAB25.3 FRAGMENT; SWP:NA; PDB:1AFVH; QVQLQQPGSVLVRPGASVKLSCKASGYTFTSSWIHWAKQRPGQGLEWIGEIHPNSGNTNY ---------------------------------------------------1111----- NEKFKGKATLTVDTSSSTAYVDLSSLTSEDSAVYYCARWRYGSPYYFDYWGQGTTLTVSS -1111-------3333----------3333------------------------------ AKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSD -----------------------------------------iiii-2222---------- LYTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPK -------------3333---------1111---------- >3-KETOACETYL-COA THIOLASE; SWP:P27796; PDB:1AFWA; KNSLLEKRPEDVVIVAANRSAIGKGFKGAFKDVNTDYLLYNFLNEFIGRFPEPLRADLNL 3333---1111----------------1111-------------------3333------ IEEVACGNVLNVGAGATEHRAACLASGIPYSTPFVALNRQCSSGLTAVNDIANKIKVGQI ----------2222--------------3333------!!!!------------------ DIGLALGVESMTNNYKNVNPLGMISSEELQKNREAKKCLIPMGITNENVAANFKISRKDQ ---------33333333-1111-----------3333----------------------- DEFAANSYQKAYKAKNEGLFEDEILPIKLPDGSICQSDEGPRPNVTAESLSSIRPAFIGT -----------------1111-------1111-------------3333----------- TTAGNASQVSDGVAGVLLARRSVANQLNLPVLGRYIDFQTVGVPPEIMGVGPAYAIPKVL -3333--------------------------------------33331111--------- EATGLQVQDIDIFEINEAFAAQALYCIHKLGIDLNKVNPRGGAIALGHPLGCTGARQVAT -----3333-----------------------1111-----3333----3333------- ILRELKKDQIGVVSMCIGTGMGAAAIFIKE 1111-2222--------------------- >PLASTOCYANIN; SWP:P00289; PDB:1AG6; VEVLLGGDDGSLAFLPGDFSVASGEEIVFKNNAGFPHNVVFDEDEIPSGVDAAKISMSEE ------1111-----------2222----------------1111-22223333---111 DLLNAPGETYKVTLTEKGTYKFYCSPHQGAGMVGKVTVN 1---2222---------------33331111-------- >CONOTOXIN GS; SWP:P15472; PDB:1AG7; ACSGRGSRCQCCMGLRCGRGNPQKCIGAHDV -----------2222---------------- >ALDEHYDE DEHYDROGENASE; SWP:P20000; PDB:1AG8A; VPTPNQQPEVLYNQIFINNEWHDAVSKKTFPTVNPSTGDVICHVAEGDKADVDRAVKAAR -----------------------1111--------------------------------- AAFQLGSPWRRMDASERGRLLNRLADLIERDRTYLAALETLDNGKPYIISYLVDLDMVLK ---22223333-3333-------------------------------------------- CLRYYAGWADKYHGKTIPIDGDYFSYTRHEPVGVCGQIIPWNFPLLMQAWKLGPALATGN ----1111-----------------------------------------------1111- VVVMKVAEQTPLTALYVANLIKEAGFPPGVVNVIPGFGPTAGAAIASHEDVDKVAFTGST ------3333----------------2222----------------------------33 EVGHLIQVAAGKSNLKRVTLEIGGKSPNIIMSDADMDWAVEQAHFALFFNQGQCCCAGSR 33----------------------------1111--------------%%%%-1111--- TFVQEDIYAEFVERSVARAKSRVVGNPFDSRTEQGPQVDETQFKKVLGYIKSGKEEGLKL ---3333------------------1111------------------------------- LCGGGAAADRGYFIQPTVFGDLQDGMTIAKEEIFGPVMQILKFKSMEEVVGRANNSKYGL ----------------------11111111------------------------------ AAAVFTKDLDKANYLSQALQAGTVWVNCYDVFGAQSPFGGYKLSGSGRELGEYGLQAYTE --------------------------------1111--------------33333333-- VKTVTVRVPQKNS ------------- >FLAVODOXIN; SWP:P23243; PDB:1AG9A; AITGIFFGSDTGNTENIAKMIQKQLGKDVADVHDIAKSSKEDLEAYDILLLGIPTWYYGE -------------------------1111----3333-3333------------------ AQCDWDDFFPTLEEIDFNGKLVALFGCGDQEDYAEYFCDALGTIRDIIEPRGATIVGHWP -------3333-----2222------------1111-3333-------1111-------- TAGYHFEASKGLADDDHFVGLAIDEDRQPELTAERVEKWVKQISEELHLDEILNA 2222-----------------------1111------------------------ >OMEGA-AGATOXIN-IVB; SWP:P37045; PDB:1AGG; EDNCIAEDYGKCTWGGTKCCRGRPCRCSMIGTNCECTPRLIMEGLSFA ---------------------------3333----------------- >ANGIOGENIN; SWP:P10152; PDB:1AGI; AQDDYRYIHFLTQHYDAKPKGRNDEYCFNMMKNRRLTRPCKDRNTFIHGNKNDIKAICED -------------------------------------------------3333------- RNGQPYRGDLRISKSEFQITICKHKGGSSRPPCRYGATEDSRVIVVGCENGLPVHFDESF ------------------------------------------------iiii----1111 ITPRH ----- >EPIDERMOLYTIC TOXIN A; SWP:P09331; PDB:1AGJA; EVSAEEIKKHEEKWNKYYGVNAFNLPKELFSKVDEKDRQKYPYNTIGNVFVKGQTSATGV --------------------3333-3333----3333----1111-----2222------ LIGKNTVLTNRHIAKFANGDPSKVSFRPSINTDDNGNTETPYGEYEVKEILQEPFGAGVD ---------33333333--3333---------1111---1111---------1111---- LALIRLKPDQNGVSLGDKISPAKIGTSNDLKDGDKLELIGYPFDHKVNQMHRSEIELTTL --------1111-1111---------11112222-------3333--------------- SRGLRYYGFTVPGNSGSGIFNSNGELVGIHSSKVSHLDREHQINYGVGIGNYVKRIINEK ----------3333------1111-------------1111------------------- NE -- >GLIAL CELL-DERIVED NEUROT; SWP:Q07731; PDB:1AGQA; NRGCVLTAIHLNVTDLGLGYETKEELIFRYCSGSCEAAETMYDKILKNLSRSRVGQACCR 3333-------3333--------------------------------------------- PVAFDDDLSFLDDSLVYHILRKHSAKRCGCI ------------------------------- >Regulator of G-protein si; SWP:P49799; PDB:1AGRE; VSQEEVKKWAESLENLINHECGLAAFKAFLKSEYSEENIDFWISCEEYKKIKSPSKLSPK -3333-3333------------------3333--3333---------3333-1111---- AKKIYNEFISVQATKEVNLDSCTREETSRNMLEPTITCFDEAQKKIFNLMEKDSYRRFLK ---------1111----------------3333----------------------3333- SRFYLDLT ----1111 >AGITOXIN 2; SWP:P46111; PDB:1AGT; GVPINVSCTGSPQCIKPCKDAGMRFGKCMNRKCHCTPK ---------3333--3333---------2222------ >GLUTAMINASE-ASPARAGINASE; SWP:P10172; PDB:1AGX; KNNVVIVATGGTIAGAGASSTNSATYSAAKVPVDALIKAVPQVNDLANITGIQALQVASE -----------------------------------11113333--------------333 SITDKELLSLARQVNDLVKKPSVNGVVITHGTDTMEETAFFLNLVVHTDKPIVLVGSMRP 3------------------1111------------------------------------1 STALSADGPLNLYSAVALASSNEAKNKGVMVLMNDSIFAARDVTKGINIHTHAFVSQWGA 111-----------------------------%%%%--3333-------------1111- LGTLVEGKPYWFRSSVKKHTNNSEFNIEKIQGDALPGVQIVYGSDNMMPDAYQAFAKAGV -------------------------3333--------------------------1111- KAIIHAGTGNGSMANYLVPEVRKLHDEQGLQIVRSSRVAQGFVLRNAEQPDDKYGWIAAH -------------3333--------------------------------3333------- DLNPQKARLLMALALTKTNDAKEIQNMFWNY ------------1111---3333-3333--- >ALDOSE REDUCTASE; SWP:P80276; PDB:1AH4; SHLVLYTGAKMPILGLGTWKSPPGKVTEAVKVAIDLGYRHIDCAHVYQNENEVGLGLQEK ----1111---------22221111--------1111------3333------------- LQGQVVKREDLFIVSKLWCTDHEKNLVKGACQTTLRDLKLDYLDLYLIHWPTGFKPGKDP ------3333-------1111-1111---------------------------------- FPLDGDGNVVPDESDFVETWEAMEELVDEGLVKAIGVSNFNHLQVEKILNKPGLKYKPAV ---1111-------------------------------------------2222------ NQIEVHPYLTQEKLIEYCKSKGIVVTAYSPLGSPDRPWAKPEDPSLLEDPRIKAIAAKYN -----1111---------1111------11111111---1111-3333------------ KTTAQVLIRFPMQRNLIVIPKSVTPERIAENFQVFDFELSPEDMNTLLSYNRNWRVCALM -----------1111--------------1111------3333---------------33 SCASHKDYPFHEEY 33--1111------ >PHOSPHOLIPASE C; SWP:P09598; PDB:1AH7; WSAEDKHKEGVNSHLWIVNRAIDIMSRNTTLVKQDRVAQLNEWRTELENGIYAADYENPY -------3333----------------------------------------3333----- YDNSTFASHFYDPDNGKTYIPFAKQAKETGAKYFKLAGESYKNKDMKQAFFYLGLSLHYL -%%%%1111----------2222-----------------1111---------------- GDVNQPMHAANFTNLSYPQGFHSKYENFVDTIKDNYKVTDGNGYWNWKGTNPEEWIHGAA ----3333----1111-2222------33333333---------------3333------ VVAKQDYSGIVNDNTKDWFVKAAVSQEYADKWRAEVTPMTGKRLMDAQRVTAGYIQLWFD -----3333--------------------------------------------------- TYGDR ----- >INITIATION FACTOR 1; SWP:P69222; PDB:1AH9; AKEDNIEMQGTVLETLPNTMFRVELENGHVVTAHISGKMRKNYIRILTGDKVTVELTPYD ------------------------1111---------3333------------------1 LSKGRIVFRSR 111-------- >ANTHOPLEURIN-A; SWP:P01530; PDB:1AHL; GVSCLCDSDGPSVRGNTLSGTLWLYPSGCPSGWHNCKAHGPTIGWCCKQ -----1111---2222--------------------------------- >TOXIN II; SWP:P01484; PDB:1AHO; VKDGYIVDDVNCTYFCGRNAYCNEECTKLKGESGYCQWASPYGNACYCYKLPDHVRTKGP -------1111---------------1111---------1111--------1111----- GRCH ---- >AFRICAN HORSE SICKNESS VI; SWP:P36325; PDB:1AHSA; TGPYAGAVEVQQSGRYYVPQGRTRGGYINSNIAEVCMDAGAAGQVNALLAPRRGDAVMIY -1111------2222-----------------------------3333------------ FVWRPLRIFCDPQGASLESAPGTFVTVDGVNVAAGDVVAWNTIAPVNVGNPGARRSILQF ----------1111------------iiii--2222------------------------ EVLWYT ------ >IMMUNOGLOBULIN FAB 5G9; SWP:P13726; PDB:1AHWA; DIKMTQSPSSMYASLGERVTITCKASQDIRKYLNWYQQKPWKSPKTLIYYATSLADGVPS ----------------------------%%%%--------------------------11 RFSGSGSGQDYSLTISSLESDDTATYYCLQHGESPYTFGGGTKLEINRADAAPTVSIFPP 11---------------------------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT 3333-------------------------------------------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC ---------------------------------- >Tissue factor [Precursor]; SWP:P13726; PDB:1AHWB; EIQLQQSGAELVRPGALVKLSCKASGFNIKDYYMHWVKQRPEQGLEWIGLIDPENGNTIY ---------------------------1111----------------------------- DPKFQGKASITADTSSNTAYLQLSSLTSEDTAVYYCARDNSYYFDYWGQGTTLTVSSAKT 3333----------------------1111------------------------------ TPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYT -------------------------------------%%%%------------------- LSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKI -------1111-----------3333-------- >Tissue factor [Precursor]; SWP:P13726; PDB:1AHWC; TNTVAAYNLTWKSTNFKTILEWEPKPVNQVYTVQISTKSGDWKSKCFYTTDTECDLTDEI -------------%%%%--------------------------------------33333 VKDVKQTYLARVFSYPAGNEPLYENSPEFTPYLETNLGQPTIQSFEQVGTKVNVTVEDER 333--------------------------3333--------------------------- TLVRRNNTFLSLRDVFGKDLIYTLYYWKSSSSGKKTAKTNTNEFLIDVDKGENYCFSVQA ----------3333-!!!!----------------------------------------- VIPSRTVNRKSTDSPVECMG -3333--------------- >FAB59.1; SWP:GC1_MOUSE; PDB:1AI1H; QVKLQESGPAVIKPSQSLSLTCIVSGFSITRTNYCWHWIRQAPGKGLEWMGRICYEGSIY -----------------------------------------2222--------------- YSPSIKSRSTISRDTSLNKFFIQLISVTNEDTAMYYCSRENHMYETYFDVWGQGTTVTVS -3333----------------------3333---------3333---------------- SAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQS ------------------------------------------------------------ DLYTLSSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVPR --------------------------1111----------- >FAB59.1; SWP:GC1_MOUSE; PDB:1AI1L; DIVMTQSPASLVVSLGQRATISCRASESVDSYGKSFMHWYQQKPGQPPKVLIYIASNLES -------------2222-------------iiii-------------------------- GVPARFSGSGSRTDFTLTIDPVEADDAATYYCQQNNEDPPTFGAGTKLEMRRADAAPTVS --1111----------------1111---------------------------------- IFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMS -----3333--------------------------------------------------- STLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNR -----3333------------3333---------- >ANTI-IDIOTYPIC FAB 409.5.; SWP:NA; PDB:1AIFH; EVKLQESGGGLVQPGGSMKLSCVASGFTFNNYWMSWVRQSPEKGLEWVAEIRLNSDNFAT ---------------------------------------3333----------------- HYAESVKGKFIISRDDSKSRLYLQMNSLRAEDTGIYYCVLRPLFYYAVDYWGQGTSVTVS --------------3333------------------------------------------ SAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQS ------------------------------------------%%%%-------------- DLYTLSSSVTVPSTWRPSETVTCNVAHPASSTKVDKKI -------------------------------------- >ANTI-IDIOTYPIC FAB 409.5.; SWP:NA; PDB:1AIFL; DIQLTQSPAFMAASPGEKVTITCSVSSSISSSNLHWYQQKSETSPKPWIYGTSNLASGVP -------------2222------------------------------------------- VRFSGSGSGTSYSLTISSMEAEDAATYYCQQWNSYPYTFGGGTKLEIKRADAAPTVSIFP -------------------1111------------------------------------- PSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTL ---3333----------------------------------------------------- TLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC --33333333------------------------- >HP1 INTEGRASE; SWP:P21442; PDB:1AIHA; ETELAFLYERDIYRLLAECDNSRNPDLGLIVRICLATGARWSEAETLTQSQVMPYKITFT -----------------------1111------------333311111111--------- NTKSKKNRTVPISDELFDMLPKKRGRLFNDAYESFENAVLRAEIELPKGQLTHVLRHTFA ----------------3333----------3333------------22221111------ SHFMMNGGNILVLKEILGHSTIEMTMRYAHFAPSHLESAVKFNPLSNPAQ --------------------3333---3333-----3333---3333--- >NONSTRUCTURAL PROTEIN NS1; SWP:P03495; PDB:1AIL; MDSNTVSSFQVDCFLWHVRKQVVDQELGDAPFLDRLRRDQKSLRGRGSTLGLNIEAATHV -3333-----------------1111---------------------------------- GKQIVEKILK ---------- >ELONGATION FACTOR TU; SWP:P07157; PDB:1AIPA; KPHVNVGTIGHVDHGKTTLTAALTYVTAAENPTAHVEYETAKRHYSHVDCPGHADYIKNM ----------1111------------1111---------3333----------------- ITGAAQMDGAILVVSAADGPMPQTREHILLARQVGVPYIVVFMNKVDMVDDPELLDLVEM --------------3333--3333--------------------3333--3333------ EVRDLLNQYEFPGDEVPVIRGSALLALEQMHRNPKTRRGENEWVDKIWELLDAIDEYIPT --------------------------------1111------------------------ PVRDVDKPFLMPVEDVFTITGRGTVATGRIERGKVKVGDEVEIVGLAPETRRTVVTGVEM --3333-----------------------------------------------------% HRKTLQEGIAGDNVGVLLRGVSREEVERGQVLAKPGSITPHTKFEASVYVLKKEEGGRHT %%%-----2222---------1111--------2222--------------3333----- GFFSGYRPQFYFRTTDVTGVVQLPPGVEMVMPGDNVTFTVELIKPVALEEGLRFAIREGG --2222-----!!!!--------2222---2222--------------2222-------- RTVGAGVVTKILE ------------- >TATA-BINDING PROTEIN; SWP:P62001; PDB:1AISA; MVDMSKVKLRIENIVASVDLFAQLDLEKVLDLCPNSKYNPEEFPGIICHLDDPKVALLIF ---1111-----------------333333332222--3333-----------------1 SSGKLVVTGAKSVQDIERAVAKLAQKLKSIGVKFKRAPQIDVQNMVFSGDIGREFNLDVV 111--------3333------------1111----------------------------- ALTLPNCEYEPEQFPGVIYRVKEPKSVILLFSSGKIVCSGAKSEADAWEAVRKLLRELDK ---------1111-----------------3333----------------------1111 Y - >TATA-BINDING PROTEIN; SWP:P29095; PDB:1AISB; NLAFALSELDRITAQLKLPRHVEEEAARLYREAVRKGLIRGRSIESVMAACVYAACRLLK -------------1111---------------3333------3333-------------- VPRTLDEIADIARVDKKEIGRSYRFIARNLNLTPKKLFVKPTDYVNKFADELGLSEKVRR ---3333--1111-------------3333--3333---3333----------------- RAIEILDEAYKRGLTSGKSPAGLVAAALYIASLLEGEKRTQREVAEVARVTEVTVRNRYK ---------1111-2222------------------------------------------ ELVEKLKIKVPIA ---1111------ >ENDOGLUCANASE Z; SWP:P07103; PDB:1AIW; MGDCANANVYPNWVSKDWAGGQPTHNEAGQSIVYKGNLYTANWYTASVPGSDSSWTQVGS --------------------------2222------------------------------ CN -- >DIHYDROPTEROATE SYNTHASE; SWP:P26282; PDB:1AJ2; MKLFAQGTSLDLSHPHVMGILNVTPDSFSDGGTHNSLIDAVKHANLMINAGATIIDVGGE ----iiii---------------1111--------------------1111--------- STRPGAAEVSVEEELQRVIPVVEAIAQRFEVWISVDTSKPEVIRESAKVGAHIINDIRSL ------------------3333----------------3333-------------1111- SEPGALEAAAETGLPVCLMHMQGNPKTMQEAPKYDDVFAEVNRYFIEQIARCEQAGIAKE -2222------------------1111------------------------------333 KLLLDPGFGFGKNLSHNYSLLARLAEFHHFNLPLLVGMSRKSMIGQLLNVGPSERLSGSL 3-----2222------------3333-1111------2222---------1111------ ACAVIAAMQGAHIIRVHDVKETVEAMRVVEATLSAKENKRYE ------1111-------------------------1111--- >TROPONIN C; SWP:P09860; PDB:1AJ4; ADIYKAAVEQLTEEQKNEFKAAFDIFVLGAEDGSISTKELGKVMRMLGQNPTPEELQEMI ---333311113333----------3333------3333------------3333----- DEVDEDGSGTVDFDEFLVMMVRSMKDDSKGKTEEELSDLFRMFDKNADGYIDLEELKIML -------------------------------------------3333----3333----- QATGETITEDDIEELMKDGDKNNDGRIDYDEFLEFMKGVE ---------------------------------------- >GYRASE; SWP:P0AES6; PDB:1AJ6; VLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNAIDEALAGHCKEIIVTIHADNSVSV ----------3333-------------------------1111---------1111---- QDDGRGIPTGIVSAAEVIMTVLHAGGKFSGGLHGVGVSVVNALSQKLELVIQHEGKIHRQ ----------------------2222--------------------------%%%%---- IYEHGVPQAPLAVTGETEKTGTMVRFWPSLETFTNVTEFEYEILAKRLRELSFLNSGVSI --iiii---------------------------------3333-----------2222-- RLRDKRDGKEDHFH -------------- >IMMUNOGLOBULIN 48G7 GERML; SWP:GC1_HUMAN; PDB:1AJ7H; QVQLQQSGAELVKPGASVKLSCTASGFNIKDTYMHWVKQRPEQGLEWIGRIDPANGNTKY ------------2222-----------3333--------2222----------------- DPKFQGKATITADTSSNTAYLQLSSLTSEDTAVYYCASYYGIYWGQGTTLTVSSASTKGP -1111----------------------1111----------------------------- SVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLS -----------------------------------iiii--------------------- SVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSC ------1111--------------------------- >IMMUNOGLOBULIN 48G7 GERML; SWP:GC1_HUMAN; PDB:1AJ7L; DIQMTQSPSSLSASLGERVSLTCRASQEISGYLSWLQQKPDGTIKRLIYAASTLDSGVPK -------------2222-----------iiii------1111------------222233 RFSGSRSGSDYSLTISSLESEDFADYYCLQYASYPRTFGGGTKVEIKRTVAAPSVFIFPP 33----!!!!--------3333-------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT 3333-------------------------iiii--------------------------- LSKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC -----1111--------1111------------- >CITRATE SYNTHASE; SWP:Q53554; PDB:1AJ8A; LAKGLEDVYIDQTNICYIDGKEGKLYYRGYSVEELAELSTFEEVVYLLWWGKLPSLSELE -2222---------------------iiii------------------------------ NFKKELAKSRGLPKEVIEIMEALPKNTHPMGALRTIISYLGNIDDSGDIPVTPEEVYRIG ------------3333---11111111----------------1111------------- ISVTAKIPTIVANWYRIKNGLEYVPPKEKLSHAANFLYMLHGEEPPKEWEKAMDVALILY ----------------1111------3333------------------------------ AEHEINASTLAVMTVGSTLSDYYSAILAGIGALKGPIHGGAVEEAIKQFMEIGSPEKVEE ---------------1111---------------1111---------------3333--- WFFKALQQKRKIMGAGHRVYKTYDPRARIFKKYASKLGDKKLFEIAERLERLVEEYLSKK -----------2222------------------------------------------333 GISINVDYWSGLVFYGMKIPIELYTTIFAMGRIAGWTAHLAEYVSHNRIIRPRLQYVGEI 3---1111------1111-3333-------------------3333-------------- GKKYLPIELRR -----3333-- >LOW-DENSITY LIPOPROTEIN R; SWP:P01130; PDB:1AJJ; PCSAFEFHCLSGECIHSSWRCDGGPDCKDKSDEENCA --2222--1111---3333-------11111111--- >ASPARTATE AMINOTRANSFERAS; SWP:P00503; PDB:1AJSA; APPSVFAEVPQAQPVLVFKLIADFREDPDPRKVNLGVGAYRTDDCQPWVLPVVRKVEQRI ---1111-------3333-----1111-1111---------1111--------------- ANNSSLNHEYLPILGLAEFRTCASRLALGDDSPALQEKRVGGVQSLGGTGALRIGAEFLA --3333-----11113333---------1111---------------------------- RWYNGTNNKDTPVYVSSPTWENHNGVFTTAGFKDIRSYRYWDTEKRGLDLQGFLSDLENA ------------------------------------------------------------ PEFSIFVLHACAHNPTGTDPTPEQWKQIASVMKRRFLFPFFDSAYQGFASGNLEKDAWAI 2222----------------------------------------2222---3333----- RYFVSEGFELFCAQSFSKNFGLYNERVGNLTVVAKEPDSILRVLSQMQKIVRVTWSNPPA -----------------11111111----------3333-----------1111------ QGARIVARTLSDPELFHEWTGNVKTMADRILSMRSELRARLEALKTPGTWNHITDQIGMF -----------------------------------------1111----3333------- SFTGLNPKQVEYLINQKHIYLLPSGRINMCGLTTKNLDYVATSIHEAVTK ---------------------3333--1111-3333-------------- >P1 NUCLEASE; SWP:P24289; PDB:1AK0; WGALGHATVAYVAQHYVSPEAASWAQGILGSSSSSYLASIASWADEYRLTSAGKWSASLH ------------3333--------------------3333----------1111-3333- FIDAEDNPPTNCNVDYERDCGSSGCSISAIANYTQRVSDSSLSSENHAEALRFLVHFIGD --------------3333--1111-----------1111---1111------------33 MTQPLHDEAYAVGGNKINVTFDGYHDNLHSDWDTYMPQKLIGGHALSDAESWAKTLVQNI 331111-----iiii-----iiii--------------------3333------------ ESGNYTAQAIGWIKGDNISEPITTATRWASDANALVCTVVMPHGAAALQTGDLYPTYYDS --1111-3333-22221111-----------------------3333------------- VIDTIELQIAKGGYRLANWINEIH ------------------------ >INOSINE-5'-MONOPHOSPHATE ; SWP:P50097; PDB:1AK5; AKYYNEPCHTFNEYLLIPGLSTVDCIPSNVNLSTPLVKFQKGQQSEINLKIPLVSAIMQS ---------1111--------11113333----------2222--------------111 VSGEKMAIALAREGGISFIFGSQSIESQAAMVHAVKNFKAHNELVDSQKRYLVGAGINTR 1--------------------------------------------1111----------- DFRERVPALVEAGADVLCIDSSDGFSEWQKITIGWIREKYGDKVKVGAGNIVDGEGFRYL 3333---------------------3333----------!!!!----------------- ADAGADFIKIGIGRGQATAVIDVVAERNKYFEETGIYIPVCSDGGIVYDYHMTLALAMGA -----------------------------------------------3333--------- DFIMLGRYFARFEESPTRKVTINGSVMKEYWGEGSSRGVDSYVPYAGKLKDNVEASLNKV -----3333--1111----------------1111------------------------- KSTMCNCGALTIPQLQSKAKITLVSSVSI ----1111----------------3333- >DESTRIN; SWP:DEST_HUMAN; PDB:1AK6; SASGVQVADEVCRIFYDMKVRKCSTPEEIKKRKKAVIFCLSADKKCIIVEEGKEILVGDV --------3333-1111-------1111------------2222------------3333 GVTITDPFKHFVGMLPEKDCRYALYDASFETKESRKEELMFFLWAPELAPLKSKMIYASS -------33331111---------------3333-----------11113333------- KDAIKKKFQGIKHECQANGPEDLNRACIAEKLGGSLIVAFEGCPV ------------------1111--3333---1111---------- >BILE-SALT ACTIVATED LIPAS; SWP:P30122; PDB:1AKN; AKLGSVYTEGGFVEGVNKKLSLFGDSIDIFKGIPFAAAPKALEKPERHPGWQGTLKAKSF -------1111------------------------------------------------- KKRCLQATLTQDSTYGNEDCLYLNIWVPQGRKEVSHDLPVMIWIYGGAFLMGASQGANFL ----------------------------------------------%%%%---------- SNYLYDGEEIATRGNVIVVTFNYRVGPLGFLSTGDSNLPGNYGLWDQHMAIAWVKRNIEA -------------------------3333-----3333--3333------------3333 FGGDPDNITLFGESAGGASVSLQTLSPYNKGLIKRAISQSGVGLCPWAIQQDPLFWAKRI ---1111------------------3333------------1111-------3333---- AEKVGCPVDDTSKMAGCLKITDPRALTLAYKLPLGSTEYPKLHYLSFVPVIDGDFIPDDP -----------------1111-------------------------------------33 VNLYANAADVDYIAGTNDMDGHLFVGMDVPAINSNKQDVTEEDFYKLVSGLTVTKGLRGA 333333------------3333-3333------1111----------------------- NATYEVYTEPWAQDSSQETRKKTMVDLETDILFLIPTKIAVAQHKSHAKSANTYTYLFSQ ------------3333-------------------------------------------- PSRMPIYPKWMGADHADDLQYVFGKPFATPLGYRAQDRTVSKAMIAYWTNFARTGDPNTG ---11111111--2222---1111-----22223333------------------1111- HSTVPANWDPYTLEDDNYLEINKQMDSNSMKLHLRTNYLQFWTQTYQALPTVTSAGASLL -----------3333----------1111-----3333------3333----------33 PPEDNSQ 33-%%%% >EXONUCLEASE III; SWP:P09030; PDB:1AKO; MKFVSFNINGLRARPHQLEAIVEKHQPDVIGLQETKVHDDMFPLEEVAKLGYNVFYHGQK ---------3333------------------------3333-----------------22 GHYGVALLTKETPIAVRRGFPGDDEEAQRRIIMAEIPSLLGNVTVINGYFPQGESRDHPI 22-----------------22223333----------1111-------------1111-- KFPAKAQFYQNLQNYLETELKRDNPVLIMGDMNISPTDLDIGIGEENRKRWLRTGKCSFL --------------------1111------------3333-------------------- PEEREWMDRLMSWGLVDTFRHANPQTADRFSWFDYRSKGFDDNRGLRIDLLLASQPLAEC ----------------------------------1111-1111-----------3333-- CVETGIDYEIRSMEKPSDHAPVWATFRR -------3333----------------- >APOKEDARCIDIN; SWP:P41249; PDB:1AKP; ASAAVSVSPATGLADGATVTVSASGFATSTSATALQCAILADGRGACNVAEFHDFSLSGG -------------2222----------------------3333----------------- EGTTSVVVRRSFTGYVMPDGPEVGAVDCDTAPGGCEIVVGGNTGEYGNAAISFG ----------------3333-------3333----------------------- >ALPHA TRYPSIN; SWP:P00761; PDB:1AKSA; IVGGYTCAANSIPYQVSLNSGSHFCGGSLINSQWVVSAAHCYKSRIQVRLGEHNIDVLEG -----------1111---------------1111---1111------------1111--- NEQFINAAKIITHPNFNGNTLDNDIMLIKLSSPATLNSRVATVSLPRSCAAAGTECLISG ------------1111--------------------1111-------------------- WGNTK ----- >Trypsin [Precursor]; SWP:P00761; PDB:1AKSB; SSGSSYPSLLQCLKAPVLSNSSCKSSYPGQITGNMICVGFLQGGKDSCQGDSGGPVVCNG --------------------------2222-1111------------------------- QLQGIVSWGYGCAQKNKPGVYTKVCNYVNWIQQT ----------------------------3333-- >ADENYLATE KINASE; SWP:P07170; PDB:1AKY; ESIRMVLIGPPGAGKGTQAPNLQERFHAAHLATGDMLRSQIAKGTQLGLEAKKIMDQGGL ---------22223333-------------------------------------1111-- VSDDIMVNMIKDELTNNPACKNGFILDGFPRTIPQAEKLDQMLKEQGTPLEKAIELKVDD ------------------3333------------------------------------33 ELLVARITGRLIHPASGRSYHKIFNPPKEDMKDDVTGEALVQRSDDNADALKKRLAAYHA 33----1111------------------2222----------1111-------------- QTEPIVDFYKKTGIWAGVDASQPPATVWADILNKLGKN --------------------------------3333-- >SCAFFOLDING PROTEIN GPD; SWP:P03637; PDB:1AL01; EQSVRFQTALASIKLIQASAVLDLTEDDFDFLTSNKVWIATDRSRARRCVEACVYGTLDF -------------------------3333---------1111------------------ VGYPRFPAPVEFIAAVIAYYVHPVNIQTACLIMEGAEFTENIINGVERPVKAAELFAFTL --------3333------------------1111-------------------------- RVRAGNTDVLTDAEENVRQKLRA -3333-----11113333----- >Scaffolding protein B; SWP:P03633; PDB:1AL0B; MEQLTKNQGATCDDKSAQIYARFDKNDWRIQPAEFYRFHDAEVNTFGYF --------------3333-----3333---3333--------------- >CYS REGULON TRANSCRIPTION; SWP:P45600; PDB:1AL3; TWPDKGSLYVATTHTQARYALPGVIKGFIERYPRVSLHMHQGSPTQIAEAVSKGNADFAI -------------------------------1111------------------------- ATEALHLYDDLVMLPCYHWNRSIVVTPEHPLATKGSVSIEELAQYPLVTYTFGFTGRSEL -----1111----------------1111-1111---33331111-----2222-3333- DTAFNRAGLTPRIVFTATDADVIKTYVRLGLGVGVIASMAVDPVSDPDLVKLDANGIFSH ----1111----------------------------1111-----1111----------- STTKIGFRRSTFLRSYMYDFIQRFAPHLTRDVVDTAVALRSNEDIEAMFKDIKLPEK -------1111-------------3333------------------1111------- >GLYCOLATE OXIDASE; SWP:P05414; PDB:1AL7; MEITNVNEYEAIAKQKLPKMVYDYYASGAEDQWTLAENRNAFSRILFRPRILIDVTNIDM ----3333--------------------!!!!-------3333----------------- TTTILGFKISMPIMIAPTAMQKMAHPEGEYATARAASAAGTIMTLSSWATSSVEEVASTG ---iiii-------------33333333-----------------1111------3333- PGIRFFQLYVYKDRNVVAQLVRRAERAGFKAIALTVDTPRLGRREADIKNRFVLPPFLTL -------------------------------------------3333-------1111-3 KNFEGIDLGLSSYVAGQIDRSLSWKDVAWLQTITSLPILVKGVITAEDARLAVQHGAAGI 333-----3333--1111----3333---1111--------------------------- IVSNHGARQLDYVPATIMALEEVVKAAQGRIPVFLDGGVRRGTDVFKALALGAAGVFIGR ---%%%%-------3333--------%%%%----------3333----1111------33 PVVFSLAAEGEAGVKKVLQMMRDEFELTMALSGCRSLKEISRSHIAADWD 33---------------------------------3333-1111--1111 >ALPHA-LACTALBUMIN; SWP:P12065; PDB:1ALC; KQFTKCELSQNLYDIDGYGRIALPELICTMFHTSGYDTQAIVENDESTEYGLFQISNALW -----------1111-2222-------------%%%%-------------1111------ CKSSQSPQSRNICDITCDKFLDDDITDDIMCAKKILDIKGIDYWIAHKALCTEKLEQWLC --3333----1111-3333--------------------11113333-------3333-- EK -- >ALLOPHYCOCYANIN; SWP:P72504; PDB:1ALLA; SIVTKSIVNADAEARYLSPGELDRIKSFVTSGERRVRIAETMTGARERIIKQAGDQLFGK 3333------1111---------------------------------------------- RPDVVSPGGNAYGADMTATCLRDLDYYLRLITYGIVAGDVTPIEEIGVVGVREMYKSLGT 3333-2222------------------------------3333----2222--------- PIEAIAEGVRAMKSVATSLLSGADAAEAGSYFDYLIGAMS 3333------------1111---------------1111- >Allophycocyanin beta chai; SWP:P72505; PDB:1ALLB; MQDAITSVINSSDVQGKYLDASAIQKLKAYFATGELRVRAATTISANAANIVKEAVAKSL ------------1111-------------------------------------------- LYSDVTRPGGNMYTTRRYAACIRDLDYYLRYATYAMLAGDPSILDERVLNGLKETYNSLG --33332222------------------------------3333----2222-------- VPIGATVQAIQAMKEVTAGLVGGGAGKEMGIYFDYICSGLS -----------------------3333----------1111 >INTERLEUKIN-6; SWP:P05231; PDB:1ALU; LTSSERIDKQIRYILDGISALRKETCNKSNMCENLNLPKMAEKDGCFQSGFNEETCLVKI ----------------------------------------3333---2222--------- ITGLLEFEVYLEYLQNRFESSEEQARAVQMSTKVLIQFLQKKAKNLDAITTPDPTTNASL ------------------------------------------------------------ LTKLQAQNQWLQDMTTHLILRSFKEFLQSSLRALRQM ---1111-------------------------3333- >CALPAIN; SWP:P04574; PDB:1ALVA; EEVRQFRRLFAQLAGDDMEVSATELMNILNKVVTRHPDLKTDGFGIDTCRSMVAVMDSDT 1111----------1111------------3333----------------------1111 TGKLGFEEFKYLWNNIKKWQAIYKQFDVDRSGTIGSSELPGAFEAAGFHLNEHLYSMIIR --------------------------1111----3333-----1111------------- RYSDEGGNMDFDNFISCLVRLDAMFRAFKSLDKDGTGQIQVNIQEWLQLTMYS ----------------------------1111--------------------- >CD40 LIGAND; SWP:P29965; PDB:1ALY; GDQNPQIAAHVISEASSKTTSVLQWAEKGYYTMSNNLVTLENGKQLTVKRQGLYYIYAQV ----------------------------------1111---------------------- TFCSNREASSQAPFIASLCLKSPGRFERILLRAANTHSSAKPCGQQSIHLGGVFELQPGA ---------------------2222-------------------------------2222 SVFVNVTDPSQVSHGTGFTSFGLLKL -------3333--------------- >MXE GYRA INTEIN; SWP:P72065; PDB:1AM2; ASITGDALVALPEGESVRIADIVPGARPNSDNAIDLKVLDRHGNPVLADRLFHSGEHPVY ---1111----iiii---11112222-------------1111----------------- AVRTVEGLRVTGTANHPLLCLVDVAGVPTLLWKLIDEIKPGDYAVIQRSAFSTVGVPGLV ---1111-----1111-------iiii------3333-2222-------------2222- RFLEAHHRDPDAKAIADELTDGRFYYAKVASVTDAGVQPVYSLRVDTADHAFITNGFVSH --------1111--------1111----------------------3333---iiii--- N - >PEPSIN; SWP:P56272; PDB:1AM5; RVTEQMKNEADTEYYGVISIGTPPESFKVIFDTGSSNLWVSSSHCSAQACSNHNKFKPRQ --------%%%%----------------------------------3333------1111 SSTYVETGKTVDLTYGTGGMRGILGQDTVSVGGGSDPNQELGESQTEPGPFQAAAPFDGI --------------------------------------------------3333------ LGLAYPSIAAAGAVPVFDNMGSQSLVEKDLFSFYLSGGGANGSEVMLGGVDNSHYTGSIH ----33332222--3333------------------%%%%----------1111------ WIPVTAEKYWQVALDGITVNGQTAACEGCQAIVDTGTSKIVAPVSALANIMKDIGASENQ ------------------iiii-----------3333-----3333---3333------- GEMMGNCASVQSLPDITFTINGVKQPLPPSAYIEGDQAFCTSGLGSSGVPSNTSELWIFG -------------------iiii----3333----------------------------- DVFLRNYYTIYDRTNNKVGFAPAA --3333------------------ >LYSOZYME; SWP:P03706; PDB:1AM7A; MVEINNQRKAFLDMLASEGTDNGRQKTRNHGYDVIVGGELFTDYSDHPRKLVTLNPKLKS ----------------------------iiii--2222----------------1111-- TGAGRYQLLSRDAYRKQLGLKDFSPKSQDAVALQQIKERGALPMIDRGDIRQAIDRCSNI ---1111-----------------------------1111----------------1111 ASLPGAGYGQFEHKADSLIAKFKEAGGTVR ----1111---------------------- >STEROL REGULATORY ELEMENT; SWP:P36956; PDB:1AM9A; QSRGEKRTAHNAIEKRYRSSINDKIIELKDLVVGTEAKLNKSAVLRKAIDYIRFLQHSNQ ---------------------------------3333----------------------- KLKQENLSLRTAVHKSKSLK -------------------- >MOLYBDATE TRANSPORT PROTE; SWP:P37329; PDB:1AMF; GKITVFAAASLTNAMQDIATQFKKEKGVDVVSSFASSSTLARQIEAGAPADLFISADQKW -------3333------------------------------------------------- MDYAVDKKAIDTATRQTLLGNSLVVVAPKASVQKDFTIDSKTNWTSLLNGGRLAVGDPEH ----------3333-------------1111----------------iiii--------- VPAGIYAKEALQKLGAWDTLSPKLAPAEDVRGALALVERNEAPLGIVYGSDAVASKGVKV ----------------33331111----3333----1111-------33333333----- VATFPEDSHKKVEYPVAVVEGHNNATVKAFYDYLKGPQAAEIFKRYGFTIK ----1111----------2222---------------------1111---- >ANIONIC TRYPSIN; SWP:P00763; PDB:1AMHA; IVGGYTCQENSVPYQVSLNSGYHFCGGSLINDQWVVSAAHCYKSRIQVRLGEHNINVLEG -----------1111----------------------1111------------1111--- NEQFVNAAKIIKHPNFDRKTLNNDIMLIKLSSPVKLNARVATVALPSSCAPAGTQCLISG ------------1111--------------------3333-------------------- WGNTLSSGVNEPDLLQCLDAPLLPQADCEASYPGKITDNMVCVGFLEGGKSSCQGDSGGP -----------------------3333----2222-1111---------------2222- VVCNGELQGIVSWGYGCALPDNPGVYTKVCNYVDWIQDTIAAN --%%%%---------------------3333------------ >GAMMA B-CRYSTALLIN; SWP:P02526; PDB:1AMM; GKITFYEDRGFQGHCYECSSDCPNLQPYFSRCNSIRVDSGCWMLYERPNYQGHQYFLRRG --------%%%%------------3333-------------------%%%%--------- DYPDYQQWMGFNDSIRSCRLIPQHTGTFRMRIYERDDFRGQMSEITDDCPSLQDRFHLTE ---3333----------------------------%%%%-----------3333------ VHSLNVLEGSWVLYEMPSYRGRQYLLRPGEYRRYLDWGAMNAKVGSLRRVMDFY ----------------%%%%------------3333------------------ >GRAMICIDIN SYNTHETASE 1; SWP:P14687; PDB:1AMUA; GTHEEEQYLFAVNNTKAEYPRDKTIHQLFEEQVSKRPNNVAIVCENEQLTYHELNVKANQ ---------3333------1111------------1111----!!!!------------- LARIFIEKGIGKDTLVGIMMEKSIDLFIGILAVLKAGGAYVPIDIEYPKERIQYILDDSQ -----1111-2222--------3333-----------------11113333-----3333 ARMLLTQKHLVHLIHNIQFNGQVEIFEEDTIKIREGTNLHVPSKSTDLAYVIYTSPKGTM ------1111-3333-------------3333-----------1111------------- LEHKGISNLKVFFENSLNVTEKDRIGQFASISFDASVWEMFMALLTGASLYIILKDTIND -------------------3333------1111---------1111-------3333--- FVKFEQYINQKEITVITLPPTYVVHLDPERILSIQTLITAGSATSPSLVNKWKEKVTYIN --------1111------333311111111---------------------1111----- AYGPTETTICATTWVATKETIGHSVPIGAPIQNTQIYIVDENLQLKSVGEAGELCIGGEG ---3333-----------------------2222-----1111---2222-------111 LARGYWKRPELTSQKFVDNPFVPGEKLYKTGDQARWLSDGNIEYLGRIDNQVKIRGHRVE 1--------------------2222---------------------3333---iiii--3 LEEVESILLKHMYISETAVSVHKDHQEQPYLCAYFVSEKHIPLEQLRQFSSEELPTYMIP 333-------1111---------1111--------------3333---------3333-- SYFIQLDKMPLTSNGKIDRKQLPEPDLTF -----------1111--1111-------- >1,4-ALPHA-D-GLUCAN GLUCAN; SWP:P04063; PDB:1AMY; QVLFQGFNWESWKHNGGWYNFLMGKVDDIAAAGITHVWLPPASQSVAEQGYMPGRLYDLD -------1111--2222---3333----------------------1111----111111 ASKYGNKAQLKSLIGALHGKGVKAIADIVINHRTAEHKDGRGIYCIFEGGTPDARLDWGP 11---------------------------------------------------2222-33 HMICRDDRPYADGTGNPDTGADFGAAPDIDHLNLRVQKELVEWLNWLKADIGFDGWRFDF 33---------------------------------------------------------3 AKGYSADVAKIYIDRSEPSFAVAEIWTSLAYGGDGKPNLNQDQHRQELVNWVDKVGGKGP 333----------1111--------------------------------------1111- ATTFDFTTKGILNVAVEGELWRLRGTDGKAPGMIGWWPAKAVTFVDNHDTGSTQHMWPFP -------------3333-------1111---3333-3333-------------------1 SDRVMQGYAYILTHPGTPCIFYDHFFDWGLKEEIDRLVSVRTRHGIHNESKLQIIEADAD 111-------1111---------------------------1111------------111 LYLAEIDGKVIVKLGPRYDVGNLIPGGFKVAAHGNDYAVWEKI 1----iiii-----------1111------------------- >USF; SWP:P22415; PDB:1AN4A; MDEKRRAQHNEVERRRRDKINNWIVQLSKIIPDSSMESTKSGQSKGGILSKASDYIQELR ------------------33333333-1111-----------------------3333-- QSNHR ----- >STREPTOCOCCAL PYROGENIC E; SWP:Q8NKX2; PDB:1AN8; KKDISNVKSDLLYAYTITPYDYKDCRVNFSTTHTLNIDTQKYRGKDYYISSEMSYEASQK --3333------1111----------------------3333-2222-------3333-- FKRDDHVDVFGLFYILNSHTGEYIYGGITPAQNNKVNHKLLGNLFISGESQQNLNNKIIL -2222------------!!!!------------------------2222----2222--- EKDIVTFQEIDFKIRKYLMDNYKIYDATSPYVSGRIEIGTKDGKHEQIDLFDSPNEGTRS ----------------------1111-------------1111----------!!!!333 DIFAKYKDNRIINMKNFSHFDIYLEK 3-3333------3333---------- >MALTODEXTRIN-BINDING PROT; SWP:P02928; PDB:1ANF; KIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDII -----------1111-----------3333-----------3333-----1111------ FWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKD --3333----1111-------3333----33333333-iiii------------------ LLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKD -------3333-------3333----------3333-----1111-----------1111 VGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKV ------------------1111--1111--------1111-------3333----1111- NYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLG ---------iiii-------------1111----------------------3333---- AVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEA ---3333----------------1111----------------------1111------- LKDAQTRITK ---------- >ANNEXIN V; SWP:P08758; PDB:1ANXA; QVLRGTVTDFPGFDERADAETLRKAMKGLGTDEESILTLLTSRSNAQRQEISAAFKTLFG -------------------------------3333---1111------------------ RDLLDDLKSELTGKFEKLIVALMKPSRLYDAYELKHALKGAGTNEKVLTEIIASRTPEEL --------------------33333333---------------3333------------- RAIKQVYEEEYGSSLEDDVVGDTSGYYQRMLVVLLQANRDPDAGIDEAQVEQDAQALFQA -------------------1111------------------------------------- GELKWGTDEEKFITIFGTRSVSHLRKVFDKYMTISGFQIEETIDRETSGNLEQLLLAVVK 1111---------------------------------3333------------------- SIRSIPAYLAETLYYAMKGAGTDDHTLIRVMVSRSEIDLFNIRKEFRKNFATSLYSMIKG -------------3333--------------1111------------------------- DTSGDYKKALLLLCGE ---------------- >GLUTAMINE PHOSPHORIBOSYLP; SWP:P00497; PDB:1AO0A; CGVFGIWGHEEAPQITYYGLHSLQHRGQEGAGIVATDGEKLTAHKGQGLITEVFQNGELS ----------------------3333----------------------3333-----333 KVKGKGAIGHVRYATGYENVQPLLFRSQNNGSLALAHNGNLVNATQLKQQLENQGSIFQT 3--------------3333---------------------1111-------1111----- SSDTEVLAHLIKRSGHFTLKDQIKNSLSMLKGAYAFLIMTETEMIVALDPNGLRPLSIGM -3333---------------------1111---------1111----------------- MGDAYVVASETCAFDVVGATYLREVEPGEMLIINDEGMKSERFSMNINRSICSMEYIYFS !!!!-----3333-1111-------2222-----------------------3333---- RPDSNIDGINVHSARKNLGKMLAQESAVEADVVTGVPDSSISAAIGYAEATGIPYELGLI 1111-iiii--------------------------------------------------- KNRYVGRTFIQPSQALREQGVRMKLSAVRGVVEGKRVVMVDDSIVRGTTSRRIVTMLREA -----3333------------------33332222------------------------- GATEVHVKISSPPIAHPCFYGIDTSTHEELIASSHSVEEIRQEIGADTLSFLSVEGLLKG ---------------------3333----3333--3333--------------------- IGRKYDDSNCGQCLACFTGKYPTEIYQDTVLPHVK --------%%%%-------------1111-3333- >GLANDULAR KALLIKREIN-13; SWP:P36368; PDB:1AO5A; VVGGFNCEKNSQPWQVAVYYQKEHICGGVLLDRNWVLTAAHCYVDQYEVWLGKNKLFQEE -------22221111-----------------------1111-----------------3 PSAQHRLVSKSFPHPGFNMSLLMLQTIPP 333----------1111------------ >HLA-A 0201; SWP:NA; PDB:1AO7D; KEVEQNSGPLSVPEGAIASLNCTYSDRGSQSFFWYRQYSGKSPELIMSIYSNGDKEDGRF -------------------------1111--------2222------------------- TAQLNKASQYVSLLIRDSQPSDSATYLCAVTTDSWGKLQFGAGTQVVVTPDIQNP -----1111-----------------------1111------------------- >HLA-A 0201; SWP:NA; PDB:1AO7E; GVTQTPKFQVLKTGQSMTLQCAQDMNHEYMSWYRQDPGMGLRLIHYSVGAGITDQGEVPN -----------------------------------1111---------2222-------- GYNVSRSTTEDFPLRLLSAAPSQTSVYFCASRPGLAGGRPEQYFGPGTRLTVTEDLKNVF -------------------3333------------------------------------- PPEVAVFLVCLATGFYPDHVELSWWVNGKEVHSGVSTDPQPLYALSSRLRVSATFWQNPR -------------------------iiii--1111----------------3333--111 NHFRCQVQFYGLAKPVTQIVSAEAWGRAD 1---------------------------- >T-FIMBRIN; SWP:P13797; PDB:1AOA; YSEEEKYAFVNWINKALENDPDCRHVIPMNPNTDDLFKAVGDGIVLCKMINLSVPDTIDE -3333-----------1111--1111---111133333333-------3333-2222-33 RAINKKKLTPFIIQENLNLALNSASAIGCHVVNIGAEDLRAGKPHLVLGLLWQIIKIGLF 33----------------------1111--111133331111-3333------------- ADIELSRNEALTLEELMKLSPEELLLRWANFHLENSGWQKINNFSADIKDSKAYFHLLNQ 3333-------3333------------------1111-------3333------------ IAPKGQKEGEPRIDINMSGFNETDDLKRAESMLQQADKLGCRQFVTPADVVSGNPKLNLA ------------------1111------------3333-------33331111------- FVANLFN ------- >COAGULOGEN; SWP:P02681; PDB:1AOCA; ADTNAPICLCDEPGVLGRTQIVTTEIKDKIEKAVEAVAQESGVSGRGFSIFSHHPVFREC -------2222-----------3333----------------------1111-------- GKYECRTVRPEHSRCYNFPPFTHFKSECPVSTRDCEPVFGYTVAGEFRVIVQAPRAGFRQ ---3333-3333-3333------------------------1111--------3333--- CVWQHKCRFGSNSCGYNGRCTQQRSVVRLVTYNLEKDGFLCESFRTCCGCPCRSF ------------------------------------------------------- >DIHYDROFOLATE REDUCTASE; SWP:P22906; PDB:1AOEA; MLKPNVAIIVAALKPALGIGYKGKMPWRLRKEIRYFKDVTTRTTKPNTRNAVIMGRKTWE --------------------iiii----------------------------------11 SIPQKFRPLPDRLNIILSRSYENEIIDDNIIHASSIESSLNLVSDVERVFIIGGAEIYNE 113333-----------1111-----1111----33333333-----------------3 LINNSLVSHLLITEIEHPSPESIEMDTFLKFPLESWTKQPKSELQKFVGDTVLEDDIKEG 3333333-----------3333---------3333----3333----!!!!-------!! DFTYNYTLWTRK !!---------- >TRYPANOTHIONE REDUCTASE; SWP:P28593; PDB:1AOGA; SKIFDLVVIGAGSGGLEAAWNAATLYKKRVAVIDVQMVHGPPFFSALGGTCVNVGCVPKK -----------3333----------------------------------3333------- LMVTGAQYMEHLRESAGFGWEFDRTTLRAEWKNLIAVKDEAVLNINKSYDEMFRDTEGLE -------------3333-----1111---3333--------------------------- FFLGWGSLESKNVVNVRESADPASAVKERLETEHILLASGSWPHMPNIPGIEHCISSNEA --------------------1111-----------------------2222----3333- FYLPEPPRRVLTVGGGFISVEFAGIFNAYKPKDGQVTLCYRGEMILRGFDHTLREELTKQ ---------------------------------------------2222----------- LTANGIQILTKENPAKVELNADGSKSVTFESGKKMDFDLVMMAIGRSPRTKDLQLQNAGV -1111--------------1111-----1111-----------------33333333--- MIKNGGVQVDEYSRTNVSNIYAIGDVTNRVMLTPVAINEAAALVDTVFGTTPRKTDHTRV ---------1111---2222----1111-------------------------------- ASAVFSIPPIGTCGLIEEVASKRYEVVAVYLSSFTPLMHKVSGSKYKTFVAKIITNHSDG ---------------33331111------------3333----1111------------- TVLGVHLLGDNAPEIIQGIGICLKLNAKISDFYNTIGVHPTSAEELCSMRTPSYYYVKGE --------2222----------1111-33331111------3333-----------iiii KMEKP ----- >CELLULOSOME-INTEGRATING P; SWP:Q06851; PDB:1AOHA; AVRIKVDTVNAKPGDTVRIPVRFSGIPSKGIANCDFVYSYDPNVLEIIEIEPGELIVDPN -----------2222-----------1111----------1111--------3333---3 PTKSFDTAVYPDRKMIVFLFAEDSGTGAYAITEDGVFATIVAKVKSGAPNGLSVIKFVEV 333-------1111-------3333-1111--------------1111------------ GGFANNDLVEQKTQFFDGGVNVG ----1111--------------- >GP70; SWP:P03390; PDB:1AOL; QVYNITWEVTNGDRETVWAISGNHPLWTWWPVLTPDLCMLALSGPPHWGLEYQAPYSSPP ----------1111----------2222-------3333-22223333-----2222--- GPPCCSGSSGSSAGCSRDCDEPLTSLTPRCNTAWNRLKLDQVTHKSSEGFYVCPGSHRPR ---1111----2222-3333--1111------------------1111------1111-- EAKSCGGPDSFYCASWGCETTGRVYWKPSSSWDYITVDNNLTTSQAVQVCKDNKWCNPLA ------3333----2222-----1111--------------3333----1111------- IQFTNAGKQVTSWTTGHYWGLRLYVSGRDPGLTFGIRLRYQNLGPRVP ----3333---3333---------%%%%---------------1111- ---------------------------------------- >SULFITE REDUCTASE HEMOPRO; SWP:P17846; PDB:1AOP; LLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILPVHQMLHSVGLDAL -----2222-----------------3333----------------------1111---- NDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPTYLPRKFKTTVVIPPQNDIDLHAND --------------3333---------------------------------11111111- MNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQR -------%%%%--------------2222--------------3333------------- DWGNRTDRKNAKTKYTLERVGVETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDD ------1111-3333------------------------------------------!!! NWHLTLFIENGRILDYPARPLKTGLLEIAKIHKGDFRITANQNLIIAGVPESEKAKIEKI !------2222----2222-------------------1111-------3333------- AKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVM ----------3333-----------1111---3333------------11111111---- RVTGCPNGCGRAMLAEVGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPEILASLDEL ----3333--1111--------2222-------1111----------------------- IGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDLWD --------2222------1111------3333---- >ALDEHYDE FERREDOXIN OXIDO; SWP:Q51739; PDB:1AORA; MYGNWGRFIRVNLSTGDIKVEEYDEELAKKWLGSRGLAIYLLLKEMDPTVDPLSPENKLI ----------------------------------------------11111111------ IAAGPLTGTSAPTGGRYNVVTKSPLTGFITMANSGGYFGAELKFAGYDAIVVEGKAEKPV ---1111---2222----------------------------1111-------------- YIYIKDEHIEIRDASHIWGKKVSETEATIRKEVGSEKVKIASIGPAGENLVKFAAIMNDG ----!!!!--------2222--------------------------1111---------- HRAAGRGGVGAVMGSKNLKAIAVEGSKTVPIADKQKFMLVVREKVNKLRNDPVAGGGLPK -------3333-----------------------------------------1111---- YGTAVLVNIINENGLYPVKNFQTGVYPYAYEQSGEAMAAKYLVRNKPCYACPIGCGRVNR ------------------%%%%---1111------------------2222--------- LPTVGETEGPEYESVWALGANLGINDLASIIEANHMCDELGLDTISTGGTLATAMELYEK -------------------1111------------------------------------- GHIKDEELGDAPPFRWGNTEVLHYYIEKIAKREGFGDKLAEGSYRLAESYGHPELSMTVK ---3333!!!!---22223333-------------------------11113333---ii KLELPAYDPRGAEGHGLGYATNNRGGCHIKNYMISPEILGYPYKMDPHDVSDDKIKMLIL ii-----3333----------1111--1111--------------1111----------- FQDLTALIDSAGLCLFTTFGLGADDYRDLLNAALGWDFTTEDYLKIGERIWNAERLFNLK ---------------------3333---------------------------------11 AGLDPARDDTLPKRFLEEPMPEGPNKGHTVRLKEMLPRYYKLRGWTEDGKIPKEKLEELG 11-3333----3333-------1111----3333------3333-1111--3333----- IAEFY 3333- >Proto-oncogene tyrosine-p; SWP:P06241; PDB:1AOTF; SIQAEEWYFGKLGRKDAERQLLSFGNPRGTFLIRESETTKGAYSLSIRDWDDMKGDHVKH ------------------------------------------------------------ YKIRKLDNGGYYITTRAQFETLQQLVQHYSERAAGLSSRLVVPSHK -----1111----3333----------3333--------------- >ARGININE REPRESSOR; SWP:P0A6D0; PDB:1AOY; MRSSAKQEELVKAFKALLKEEKFSSQGEIVAALQEQGFDNINQSKVSRMLTKFGAVRTRN --------3333-----3333--------------------------------------1 AKMEMVYCLPAELGVPTT 111--------------- >ASCORBATE OXIDASE; SWP:P37064; PDB:1AOZA; SQIRHYKWEVEYMFWAPNCNENIVMGINGQFPGPTIRANAGDSVVVELTNKLHTEGVVIH --------------------------iiii------------------------------ WHGILQRGTPWADGTASISQCAINPGETFFYNFTVDNPGTFFYHGHLGMQRSAGLYGSLI 2222-22221111-2222-----2222---------------------3333-------- VDPPQGKKEPFHYDGEINLLLSDWWHQSIHKQEVGLSSKPIRWIGEPQTILLNGRGQFDC ---------------------------3333--------------------iiii----- SIAAKYDSNLEPCKLKGSESCAPYIFHVSPKKTYRIRIASTTALAALNFAIGNHQLLVVE --11111111--------1111----------------------------2222------ ADGNYVQPFYTSDIDIYSGESYSVLITTDQNPSENYWVSVGTRARHPNTPPGLTLLNYLP iiii------------2222----------1111-------------------------- NSVSKLPTSPPPQTPAWDDFDRSKNFTYRITAAMGSPKPPVKFNRRIFLLNTQNVINGYV -1111----------1111-------------2222------------------------ KWAINDVSLALPPTPYLGAMKYNLLHAFDQNPPPEVFPEDYDIDTPPTNEKTRIGNGVYQ ---%%%%------------------------------11111111--------------- FKIGEVVDVILQNANMMKENLSETHPWHLHGHDFWVLGYGDGKFSAEEESSLNLKNPPLR -2222---------------------------------------33331111-------- NTVVIFPYGWTAIRFVADNPGVWAFHCHIEPHLHMGMGVVFAEGVEKVGRIPTKALACGG -------------------------------------------3333----3333----- TAKSLINNPKNP ------------ >MODIFIER PROTEIN 1; SWP:P83917; PDB:1AP0; HMVEEVLEEEEEEYVVEKVLDRRVVKGKVEYLLKWKGFSDEDNTWEPEENLDCPDLIAEF ----------------------------------------------1111-----11111 LQSQKTAHETDKS 111---1111--- >MONOCLONAL ANTIBODY C219; SWP:Q6KB05; PDB:1AP2A; DIVMTQSPSSLTVTAGEKVTMSCKSSQSLLNSGNQKNYLTWYQQKPGQPPKLLIYWASTR -------------2222---------------------------2222------------ ESGVPDRFTGSGSGTDFTLTISSVQAEDLAVYYCQNDYSYPLTFGAGTKLEP ------------------------1111------------------------ >MONOCLONAL ANTIBODY C219; SWP:NA; PDB:1AP2B; EVQLQQSGAELVRPGASVKLSCTASGFNIKDDFMHWVKQRPEQGLEWIGRIDPANDNTKY ------------2222-----------1111----------------------------- APKFQDKATIIADTSSNTAYLQLSSLTSEDTAVYYCARREVYSYYSPLDVWGAGTTVTVP 1111--------3333----------3333------------------------------ >POKEWEED ANTIVIRAL PROTEI; SWP:Q03464; PDB:1APA; INTITFDVGNATINKYATFMKSIHNQAKDPTLKCYGIPMLPNTNLTPKYLLVTLQDSSLK ------3333-3333------------------iiii----1111----------1111- TITLMLKRNNLYVMGYADTYNGKCRYHIFKDISNTTERNDVMTTLCPNPSSRVGKNINYD ----------------------------1111-1111----------------------- SSYPALEKKVGRPRSQVQLGIQILNSGIGKIYGVDSFTEKTEAEFLLVAIQMVSEAARFK ------------3333---1111---33332222---3333------------------- YIENQVKTNFNRAFYPNAKVLNLEESWGKISTAIHNAKNGALTSPLELKNANGSKWIVLR --------1111----3333-----------3333--iiii------------------1 VDDIEPDVGLLKYVNGTCQAT 1113333-------------- >ANTHOPLEURIN-B; SWP:P01531; PDB:1APF; GVPCLCDSDGPRPRGNTLSGILWFYPSGCPSGWHNCKAHGPNIGWCCKK ------------------------1111--------------------- >FIBRILLIN; SWP:P35555; PDB:1APJ; SAQDLRMSYCYAKFEGGKCSSPKSRNHSKQECCCALKGEGWGDPCELCPTEPDEAFRQIC -----------------------------------------------------------3 PYGSGIIVGPDDSA 333----------- >CONCANAVALIN A; SWP:P81461; PDB:1APNA; ADTIVAVELDTYPNTPHIGIDIKSVRSKKTAKWNMQNGKVGTAHIIYNSVDKRLSAVVSY -----------------------------------2222--------3333--------- PNADSATVSYDVDLDNVLPEWVRVGLSASTGLYKETNTILSWSFTSKLKSNSTHETNALH ------------3333-------------------------------------------- FMFNQFSKDQKDLILQGDATTGTDGNLELTRVSGSSVGRALFYAPVHIWESSAVVASFEA ---------1111--------2222------------------------1111------- TFTFLIKSPDSHPADGIAFFISNIDSSIPSGSTGRLLGLFPDAN ----------------------------2222!!!!-------- >EGF-LIKE MODULE OF BLOOD ; SWP:P00743; PDB:1APO; KDGDQCEGHPCLNQGHCKDGIGDYTCTCAEGFEGKNCEFSTR ---------------------------------1111----- >COMPLEMENT PROTEASE C1R; SWP:P00736; PDB:1APQ; AVDLDECASRSKSGEEDPQPQCQHLCHNYVGGYFCSCRPGYELQEDRHSCQAE ----3333--------------------2222-----------3333------ >ACYLPHOSPHATASE; SWP:P00818; PDB:1APS; STARPLKSVDYEVFGRVQGVCFRMYAEDEARKIGVVGWVKNTSKGTVTGQVQGPEEKVNS 3333-------------------------------------------------------- MKSWLSKVGSPSSRIDRTNFSNEKTISKLEYSNFSVRY -------------------------------------- >CYTOSOLIC ASCORBATE PEROX; SWP:P48534; PDB:1APXA; GKSYPTVSPDYQKAIEKAKRKLRGFIAEKKCAPLILRLAWHSAGTFDSKTKTGGPFGTIK ------------------------------------------1111-1111-----3333 HQAELAHGANNGLDIAVRLLEPIKEQFPIVSYADFYQLAGVVAVEITGGPEVPFHPGRED -3333-3333---------3333---3333------------------------------ KPEPPPEGRLPDATKGSDHLRDVFGKAMGLSDQDIVALSGGHTIGAAHKERSGFEGPWTS -----------1111--------------------------------3333--------- NPLIFDNSYFTELLTGEKDGLLQLPSDKALLTDSVFRPLVEKYAADEDVFFADYAEAHLK 1111--------1111-------3333-3333---------------------------- LSELGFAEA --2222--- >ASPARTYLGLUCOSAMINIDASE; SWP:P20933; PDB:1APYA; SPLPLVVNTWPFKNATEAAWRALASGGSALDAVESGCAMCEREQCDGSVGFGGSPDELGE -------------------------------------------2222--------1111- TTLDAMIMDGTTMDVGAVGDLRRIKNAIGVARKVLEHTTHTLLVGESATTFAQSMGFINE -------------------------3333-----------------------1111---- DLSTSASQALHSDWLARNCQPNYWRNVIPDPSKYCGPYKPP -----------------------------1111-------- >N(4)-(beta-N-acetylglucos; SWP:P20933; PDB:1APYB; TIGMVVIHKTGHIAAGTSTNGIKFKIHGRVGDSPIPGAGAYADDTAGAAAATGNGDILMR -------3333----------22222222--1111-------------------333311 FLPSYQAVEYMRRGEDPTIACQKVISRIQKHFPEFFGAVICANVTGSYGAACNKLSTFTQ 11-------3333--------------33331111-------1111--------1111-- FSFMVYNSEKNQPTEEKVDCI ------3333----------- >1,3-1,4-BETA-GLUCANASE; SWP:P12257; PDB:1AQ0A; IGVCYGMSANNLPAASTVVSMFKSNGIKSMRLYAPNQAALQAVGGTGINVVVGAPNDVLS ------------------------------------------2222--------3333-- NLAASPAAAASWVKSNIQAYPKVSFRYVCVGNEVAGGATRNLVPAMKNVHGALVAAGLGH ----------------1111--------------!!!!---------------1111111 IKVTTSVSQAILGVFSPPSAGSFTGEAAAFMGPVVQFLARTNAPLMANIYPYLAWAYNPS 1------3333-----3333----3333-----------------------------333 AMDMGYALFNASGTVVRDGAYGYQNLFDTTVDAFYTAMGKHGGSSVKLVVSESGWPSGGG 3-3333-----------!!!!---------------------1111-------------2 TAATPANARFYNQHLINHVGRGTPRHPGAIETYIFAMFNENQKDSGVEQNWGLFYPNMQH 222--------------3333-1111-------------1111--3333-----1111-- VYPINF ------ >CYCLIN-DEPENDENT PROTEIN ; SWP:P24941; PDB:1AQ1; MENFQKVEKIGEGTYGVVYKARNKLTGEVVALKKIVPSTAIREISLLKELNHPNIVKLLD 1111--------1111----------------------------3333---1111----- VIHTENKLYLVFEFLHQDLKKFMDASALTGIPLPLIKSYLFQLLQGLAFCHSHRVLHRDL ---%%%%------------------------3333---------------1111------ KPQNLLINTEGAIKLADFGLEVVTLWYRAPEILLGCKYYSTAVDIWSLGCIFAEMVTRRA 3333---1111-------------1111---1111------------------------- LFPGDSEIDQLFRIFRTLGTPDEVVWPGVTSMPDYKPSFPKWARQDFSKVVPPLDEDGRS ---------------------33332222--11111111------3333----------- LLSQMLHYDPNKRISAKAALAHPFFQDVTKPVPHLRL --3333--1111--3333---3333------------ >MS2 PROTEIN CAPSID; SWP:P03612; PDB:1AQ3A; ASNFTQFVLVDNGGTGDVTVAPSNFANGVAEWISSNSRSQAYKVTCSVRQSSAQNRKYSI ------------------------2222--------3333-----------1111----- KVEVPKVATQTVGGVELPVAAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPS -----------iiii-----------------1111----------------2222---- AIAANSGIY -1111---- >CARTILAGE MATRIX PROTEIN; SWP:P05099; PDB:1AQ5A; GSHMEEDPCECKSIVKFQTKVEELINTLQQKLEAVAKRIEALENKII -----------------------------------------3333-- >CYTOCHROME B5; SWP:P00173; PDB:1AQA; KYYTLEEIQKHKDSKSTWVILHHKVYDLTKFLEEHPGGEEVLREQAGGDATENFEDVGHS -----------------------------3333-1111-3333----------1111--3 TDARELSKTYIIGELHPDDRSKIA 333-3333---------3333--- >X11; SWP:Q02410; PDB:1AQCA; EDLIDGIIFAANYLGSTQLLSDKTPSKNVRQAQEAVSRIKAQKLTEVDLFILTQRIKVLN -3333--------------------3333----------------------1111----- ADTQETDHPLRTISYIADIGNIVVLARRRYKICHVFESEDAQLIAQSIGQAFSVAYQEFL --------3333------!!!!---------------1111------------------- R - >FAB B7-15A2; SWP:NA; PDB:1AQKH; VQLVESGGGVVQPGRSLRLSCAASGFTFNNYAIHWVRQAPGKGLEWVAFISYDGSKNYYA -----------2222-----------3333--------2222--------1111-----1 DSVKGRFTISRDNSKNTLFLQMNSLRPEDTAIYYCARVLFQQLVLYAPFDIWGQGTMVTV 111----------------------1111------------------------------- SSASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPQPVTVSWNSGALTSGVHTFPAVLQ ---------------3333-----------------------%%%%-------------1 SSGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSC 111----------1111-----------3333------------- >FAB B7-15A2; SWP:NA; PDB:1AQKL; NVLTQPPSVSGAPGQRVTISCTGSNSNIGAGFTVHWYQHLPGTAPKLLIFANTNRPSGVP -----------2222--------11111111--------1111----------------3 DRFSGSKSGTSASLAITGLQAEDEADYYCQSYDSSLSARFGGGTRLTVLGQPKAAPSVTL 333----------------3333------------------------------------- FPPSSEELQANKATLVCLISDFYPGAVTVAWKADSSPVNAGVETTKPSKQSNNKYAASSY -------------------------------------------------1111------- LSLTPEQWKSHKSYSCQVTHEGSTVEKTVAPAECS -------1111------------------------ >CU-METALLOTHIONEIN; SWP:P07215; PDB:1AQS; QNEGHECQCQCGSCKNNEQCQKSCSCPTGCNSDDKCPCGN -------------1111----------------------- >ATP SYNTHASE; SWP:P0A6E6; PDB:1AQT; STYHLDVVSAEQQMFSGLVEKIQVTGSEGELGIYPGHAPLLTAIKPGMIRIVKQHGHEEF --------1111-------------1111----2222----------------------- IYLSGGILEVQPGNVTVLADTAIRGQDLDEARAMEAKRKAEEHISSSHGDVDYAQASAEL ----------2222---------3333--------------------------------- AKAIAQLRVIELTKK --------------- >ESTROGEN SULFOTRANSFERASE; SWP:P49891; PDB:1AQUA; EYYEVFGEFRGVLMDKRFTKYWEDVEMFLARPDDLVIATYPKSGTTWISEVVYMIYKEGD 1111----iiii--3333--33331111--1111-------------------------- AIFNRIPYLECRNEDLINGIKQLKEKESPRIVKTHLPPKLLPASFWEKNCKMIYLCRNAK 1111--------!!!!------1111----------3333-----1111----------- DVAVSYYYFLLMITSYPNPKSFSEFVEKFMQGQVPYGSWYDHVKAWWEKSKNSRVLFMFY ------------1111------------1111-2222--------------1111---33 EDMKEDIRREVVKLIEFLERKPSAELVDRIIQHTSFQEMKNNPSTNYTMMPEEMMNQKVS 33-------------1111----------------------3333-11113333--3333 PFMRKGIIGDWKNHFPEALRERFDEHYKQQMKDCTVKFRME --------3333------------------1111------- >RESTRICTOCIN; SWP:P04389; PDB:1AQZA; ATWTCINQQLEDKRLLYSQAKAESNSHHAPLSDGKTGSSYPHWFTNGYDGNGKLIKGRTP -----------------------------------1111-----iiii1111--2222-- IKFGKADCDRPPKHSQNGMGKDDHYLLEFPTFPDGHDYKFDSKKPKENPGPARVIYTYPN ----3333------1111-1111--------1111---1111------------------ KVFCGIVAHQRGNQGDLRLCSH ----------!!!!-------- >REI; SWP:P01607; PDB:1AR2; TPDIQMTQSPSSLSASVGDRVTITVQASQDIIKHLNWYQQTPGKAPKLLIYEASNLQAGV ---------------2222---------------------2222---------------- PSRFSGSGSGTDYTFTISSLQPEDIATYYCQQYQSLPYTFGQGTKLQIT --------!!!!--------3333------------------------- >ACHROMOBACTER PROTEASE I; SWP:P15636; PDB:1ARB; GVSGSCNIDVVCPEGDGRRDIIRAVGAYSKSGTLACTGSLVNNTANDRKMYFLTAHHCGM ---3333-11111111---3333------iiii---------1111--------3333-- GTASTAASIVVYWNYQNSTCRAPNTPASGANGDGSMSQTQSGSTVKATYATSDFTLLELN ------------------------3333---------------------1111------- NAANPAFNLFWAGWDRRDQNYPGAIAIHHPNVAEKRISNSTSPTSFVAWGGGAGTTHLNV ---3333---------------------2222---------------2222--------- QWQPSGGVTEPGSSGSPIYSPEKRVLGQLHGGPSSCSATGTNRSDQYGRVFTSWTGGGAA ------------2222---1111-----------1111!!!!--------3333!!!!11 ASRLSDWLDPASTGAQFIDGLDS 11------1111----------- >NEBULIN; SWP:P20929; PDB:1ARK; TAGKIFRAMYDYMAADADEVSFKDGDAIINVQAIDEGWMYGTVQRTGRTGMLPANYVEAI ----------------------------------------------------1111---- >N-acetylmuramoyl-L-alanin; SWP:P00806; PDB:1AROL; RVQFKQRESTDAIFVHCSATKPSQNVGVREIRQWHKEQGWLDVGYHFIIKRDGTVEAGRD --------------------1111------------------------------------ EMAVGSHAKGYNHNSIGVCLVGGIDDKGKFDANFTPAQMQSLRSLLVTLLAKYEGAVLRA -------2222-----------------------3333---------------------1 HHEVAPKACPFDLKRWWEKNELVTSDRG 111------------------------- >DNA-directed RNA polymera; SWP:P00573; PDB:1AROP; KNDFSDIELAAIPFNTLADHYGERLAREQLALEHESYEMGEARFRKMFERQLLITTLLPK ------3333-----------3333--------3333----3333--3333--------- MIARINDWFEEVKAKRGKRPTAFQFLQEIKPEAVAYITIKTTLACLTSADNTTVQAVASA -------------------1111------3333-------------------3333---- IGRAIEDEARFGRIRDLEAKHFKKFMQVVEADMLSKGLLGGEAWSSWHKEDSIHVGVRCI -----------------3333----------------------------3333------- EMLIESTGMVSLHRQNSETIELAPEYAEAIATRAGALAGISPMFQPCVVPPKPWTGITGG ------------------------------------------------------------ GYWANGRRPLALVRTHSKKALMRYEDVYMPEVYKAINIAQNTAWKINKKVLAVANVITKW ----------------------------3333-----3333-----------3333---- VYRKDKARKSRRISLEFMLEQANKFANHKAIWFPYNMDWRGRVYAVSMFNPQGNDMTKGL 3333--------------------1111---------1111------------------- LTLAKGKPIGKEGYYWLKIHGANCAGVDKVPFPERIKFIEENHENIMACAKSPLENTWWA ---------------------------------------------3333--------333 EQDSPFCFLAFCFEYAGVQHHGLSYNCSLPLAFDGSCSGIQHFSAMLRDEVGGRAVNLLP 3---1111------------------------------------3333---1111----- SETVQDIYGIVAKKVNEILQADAINGGTKALAGQWLAYGVTRSVTKRSVMTLAYGSKEFG ----------------------1111-------3333---1111-3333----------- FRQQVLEDTIQPAIDSGKGLMFTQPNQAAGYMAKLIWESVSVTVVAAVEAMNWLKSAAKL ---------11111111-------3333-------------------------------1 LAAEVKDKKTGEILRKRSAVHWVTPDGFPVWQEYKKPIQTRLNLMFLGQFRLQPTINTNK 111--------------------1111--------------------------------- DSEIDAHKQESGIAPNFVHSQDGSHLRKTVVWAHEKYGIESFALIHDSFGTIPADAANLF -------3333-------------------1111----------%%%%---3333----- KAVRETMVDTYESSDVLADFYDQFADQLHESQLDKMPALPAKGNLNLRDILESD ------------------------1111--1111-------------1111--- >ASPARTATE AMINOTRANSFERAS; SWP:P00509; PDB:1ARS; MFENITAAPADPILGLADLFRADERPGKINLGIGVYKDETGKTPVLTSVKKAEQYLLENE -1111--------------------------------1111------------------- TTKNYLGIDGIPEFGRCTQELLFGKGSALINDKRARTAQTPGGTGALRVAADFLAKNTSV ------1111-------------------1111--------------------------- KRVWVSNPSWPNHKSVFNSAGLEVREYAYYDAENHTLDFDALINSLNEAQAGDVVLFHGC ---------1111----1111------------------------11112222------- CHNPTGIDPTLEQWQTLAQLSVEKGWLPLFDFAYQGFARGLEEDAEGLRAFAAMHKELIV ---------------------------------2222---3333---------------- ASSYSKNFGLYNERVGACTLVAADSETVDRAFSQMKAAIRANYSNPPAHGASVVATILSN -----1111-----------------------------------------------1111 DALRAIWEQELTDMRQRIQRMRQLFVNTLQEKGANRDFSFIIKQNGMFSFSGLTKEQVLR -------------------------------------3333------------------- LREEFGVYAVASGRVNVAGMTPDNMAPLCEAIVAVL ---------1111--1111-3333------------ >ANTICHYMOTRYPSIN; SWP:P01011; PDB:1AS4A; GLASANVDFAFSLYKQLVLKAPDKNVIFSPLSISTALAFLSLGAHNTTLTEILKGLKFNL --------------------1111----------------1111--------------33 TETSEAEIHQSFQHLLRTLNQSSDELQLSMGNAMFVKEQLSLLDRFTEDAKRLYGSEAFA 33----------------------------------2222-------------------- TDFQDSAAAKKLINDYVKNGTRGKITDLIKDLDSQTMMVLVNYIFFKAKWEMPFDPQDTH -33333333---------1111----------1111------------------3333-- QSRFYLSKKKWVMVPMMSLHHLTIPYFRDEELSCTVVELKYTGNASALFILPDQDKMEEV -----------------------------1111-------------------2222---- EAMLLPETLKRWRDSLEFREIGELYLPKFSISRDYNLNDILLQLGIEEAFTSKADLSGIT 11113333---------------------------------1111-33331111------ GARNLAVSQVVHKAVLDVFEEGTEASRATAVKITLL ------------------3333-------------- >HEMOGLOBIN (OXY); SWP:P28316; PDB:1ASH; ANKTRELCMKSLEHAKVDTSNEARQDGIDLYKHMFENYPPLRKYFKSREEYTAEDVQNDP -----------1111-------------------------3333---------------- FFAKQGQKILLACHVLCATYDDRETFNAYTRELLDRHARDHVHMPPEVWTDFWKLFEEYL ------------------1111---------------1111---1111-----------3 GKKTTLDEPTKQAWHEIGREFAKEINK 333------------------------ >THERMOSOME; SWP:P48424; PDB:1ASS; MSGIVIDKEKVHSKMPDVVKNAKIALIDSALEIKKTEIEAKVQISDPSKIQDFLNQETNT -----------1111--------------------------11111111----------- FKQMVEKIKKSGANVVLCQKGIDDVAQHYLAKEGIYAVRRVKKSDMEKLAKATGAKIVTD -------3333---------------------------------------1111------ LDDLTPSVLGEAETVEERKIGDDRMTFVMGCK -------------------------------- >17-HEDGEHOG; SWP:Q02936; PDB:1AT0; CFTPESTALLESGVRKPLGELSIGDRVLSTANGQAVYSEVILFDRNLEQQNFVQLHTDGG --1111---3333---3333---------1111-----------------------1111 AVLTVTPAHLVSVWQPESQKLTFVFADRIEEKNQVLVRDVETGELRPQRVVKVGSVRSKG -----1111------1111-----3333-----------1111----------------- VVAPLTREGTIVVNSVAASCYA -----3333---iiii------ >HERPES SIMPLEX VIRUS TYPE; SWP:Q69527; PDB:1AT3A; RAVPIYVAGFLALYDSGDPGELALDPDTVRAALPPENPLPINVDHRARCEVGRVLAVVND ------------------3333-------1111------------3333----------1 PRGPFFVGLIACVQLERVLETAASAAILSREERLLYLITNYLPSVSLSTKPDRTLFAHVA 111---------3333-----------------------------------1111----- LCAIGRRLGTIVTYDTSLDAAIAPFRHLDPATREGVRREAAEAELALAGRTWAPGVEALT ----------------3333--------------------------2222---------- HTLLSTAVNNMMLRDRWSLVAERRRQAGIAGHTYLQA ------1111--------------1111--------- >ASCARIS TRYPSIN INHIBITOR; SWP:P19398; PDB:1ATA; EAEKCTKPNEQWTKCGGCEGTCAQKIVPCTRECKPPRCECIASAGFVRDAQGNCIKFEDC ------------------------------------------------1111---3333- PK -- >PERIPLASMIC MOLYBDATE-BIN; SWP:Q7SIH2; PDB:1ATG; ELKVVTATNFLGTLEQLAGQFAKQTGHAVVISSGSSGPVYAQIVNGAPYNVFFSADEKSP ------3333--------------------------------1111-------------- EKLDNQGFALPGSRFTYAIGKLVLWSAKPGLVDNQGKVLAGNGWRHIAISNPQIAPYGLA ---------2222--------------2222----3333---------------3333-- GTQVLTHLGLLDKLTAQERIVEANSVGQAHSQTASGAADLGFVALAQIIQAAAKIPGSHW -------------------------------------------3333--1111------- FPPANYYEPIVQQAVITKSTAEKANAEQFMSWMKGPKAVAIIKAAGYVLPQ --1111----------1111----------------------1111----- >GLYCYL-TRNA SYNTHETASE; SWP:P56206; PDB:1ATIA; AASSLDELVALCKRRGFIFQSSEIYGGLQGVYDYGPLGVELKNNLKQAWWRRNVYERDDM ---3333-----1111---2222------------------------------3333--- EGLDASVLTHRLVLHYSGHEATFADPMVDNWTPPRYFNMMFQDLRGPRGGRGLLAYLRPE ---------3333-1111-------------------------------1111------- TAQGIFVNFKNVLDATSRKLGFGIAQIGKAFRNEITPRNFIFRVREFEQMEIEYFVRPGE 33333333------------------------------!!!!---------------111 DEYWHRYWVEERLKWWQEMGLSRENLVPYQQPPESSAHYAKATVDILYRFPHGSLELEGI 1--------------------3333------3333-1111---------1111------- AQRTDFDLGSHTKDQEALGITARVLRNEHSTQRLAYRDPETGKWFVPYVIEPSAGVDRGV --!!!!3333---1111------------------------------------------- LALLAEAFTREELPNGEERIVLKLKPQLAPIKVAVIPLVKNRPEITEYAKRLKARLLALG ------------1111--------3333-------------3333--------------- LGRVLYEDTGNIGKAYRRHDEVGTPFAVTVDYDTIGQSKDGTTRLKDTVTVRDRDTMEQI -------------------------------3333--1111-1111-------------- RLHVDELEGFLRERLRW --3333----------- >ATROLYSIN C; SWP:P15167; PDB:1ATLA; LPQRYIELVVVADHRVFMKYNSDLNTIRTRVHEIVNFINGFYRSLNIHVSLTDLEIWSNE -------------------%%%%------------------1111--------------- DQINIQSASSDTLNAFAEWRETDLLNRKSHDNAQLLTAIELDEETLGLAPLGTMCDPKLS -----------------------3333-------------------------2222---- IGIVQDHSPINLLMGVTMAHELGHNLGMEHDGKDCLRGASLCIMRPGLTKGRSYEFSDDS -------------------------------1111-!!!!-1111--------------- MHYYERFLKQYKPQCILNKP -------------1111--- >Deoxyribonuclease-1 [Prec; SWP:P00639; PDB:1ATND; LKIAAFNIRTFGETKMSNATLASYIVRIVRRYDIVLIQEVRDSHLVAVGKLLDYLNQDDP -----------3333---3333-----1111----------1111--------------- NTYHYVVSEPLGRNSYKERYLFLFRPNKVSVLDTYQYDDGCCGNDSFSREPAVVKFSSHS -----------3333---------1111-------------------------------- TKVKEFAIVALHSAPSDAVAEINSLYDVYLDVQQKWHLNDVMLMGDFNADCSYVTSSQWS -------------3333---------------------------------3333333311 SIRLRTSSTFQWLIPDSADTTATSTNCAYDRIVVAGSLLQSSVVPGSAAPFDFQAAYGLS 11---------------------------------33333333--------3333----- NEMALAISDHYPVEVTLT ---3333----------- >ANTITHROMBIN III; SWP:P41361; PDB:1ATTA; VEDVCTAKPRDIPVNPMCIYRATEGQGSEQKIPGATNRRVWELSKANSHFATAFYQHLAD ------------------------------------------------------------ SKNNNDNIFLSPLSISTAFAMTKLGACNNTLTQLMEVFKFDTISEKTSDQIHFFFAKLNC --1111---------------------------------3333----------------- RLYRKANKSSELVSANRLFGDKSITFNETYQDISEVVYGAKLQPLDFKGNAEQSRLTINQ --------------------1111---------------------3333----------- WISNKTEGRITDVIPPQAINEFTVLVLVNTIYFKGLWKSKFSPENTRKELFYKADGESCS -----%%%%--------------------------------3333-------1111---- VLMMYQESKFRYRRVAESTQVLELPFKGDDITMVLILPKLEKTLAKVEQELTPDMLQEWL ---------------%%%%-----------------------3333------------11 DELTETLLVVHMPRFRIEDSFSVKEQLQDMGLEDLFSPEKSRLPGIVAEGRSDLYVSDAF 11-------------------3333-------3333------1111-------------- HKAFLEVNEEGSEAAASTVISIAGRSLRVTFKANRPFLVLIREVALNTIIFMGRVANPCV -------1111-------------------------------3333-------------- D - >ATX IA; SWP:P01533; PDB:1ATX; GAACLCKSDGPNTRGNSMSGTIWVFGCPSGWNNCEGRAIIGYCCKQ -----3333------------------2222--------------- >VON WILLEBRAND FACTOR; SWP:P04275; PDB:1ATZA; QPLDVILLLDGSSSFPASYFDEMKSFAKAFISKANIGPRLTQVSVLQYGSITTIDVPWNV ------------------------------------1111-------------------- VPEKAHLLSLVDVMQREGGPSQIGDALGFAVRYLTSEMHGARPGASKAVVILVTDVSVDS ----------1111-----------------------22221111--------------- VDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGPAGDSNVVKLQRIEDLPTMVTLGNSF --------1111--------------------!!!!3333-----3333-3333---333 LHKL 3--- >INTERFERON-BETA; SWP:P01574; PDB:1AU1A; MSYNLLGFLQRSSNFQCQKLLWQLNGRLEYCLKDRMNFDIPEEIKQLQQFQKEDAALTIY ---------------------------33331111-----3333---------------- EMLQNIFAIFRQDSSSTGWNETIVENLLANVYHQINHLKTVLEEKLEKEDFTRGKLMSSL --------1111-3333----------------------------1111----------- HLKRYYGRILHYLKAKEYSHCAWTIVRVEILRNFYFINRLTGYLRN -------------1111----------------------------- >PIT-1; SWP:Q00286; PDB:1AU7A; GMRALEQFANEFKVRRIKLGYTQTNVGEALAAVHGSEFSQTTICRFENLQLSFKNACKLK ----------------3333---------------------------------------- AILSKWLEEAEQKRRTTISIAAKDALERHFGEHSKPSSQEIMRMAEELNLEKEVVRVWFC ------1111--------3333--------------3333-------------------- NRRQREKRVK ----1111-- >PHOSPHATIDYLINOSITOL TRAN; SWP:P24280; PDB:1AUA; QQEKEFLESYPQNCPPDALPGTPGNLDSAQEKALAELRKLLEDAGFIERLDDSTLLRFLR -----1111-----11112222----------------------------3333------ ARKFDVQLAKEMFENCEKWRKDYGTDTILQDFHYDEKPLIAKFYPQYYHKTDKDGRPVYF -%%%%------------------3333------1111--3333--------1111----- EELGAVNLHEMNKVTSEERMLKNLVWEYESVVQYRLPACSRAAGHLVETSCTIMDLKGIS -3333--3333--------------------------------------------2222- ISSAYSVMSYVREASYISQNYYPERMGKFYIINAPFGFSTAFRLFKPFLDPVTVSKIFIL --------------------------------------------1111-3333------- GSSYQKELLKQIPAENLPVKFGGKSEVDESKGGLYLSDIGPWRDPKYIGPEGEAPE ---33331111--11113333------1111--1111--11111111-1111---- >PYROGLUTAMYL PEPTIDASE-1; SWP:P46107; PDB:1AUGA; MEKKVLLTGFDPFGGETVNPSWEAVKRLNGAAEGPASIVSEQVPTVFYKSLAVLREAIKK ------------iiii---------1111---!!!!------------------------ HQPDIIICVGQAGGRMQITPERVAINLNEARIPDNEGNQPVGEDISQGGPAAYWTGLPIK ---------------------------------1111--------2222----------- RIVEEIKKEGIPAAVSYTAGTFVCNHLFYGLMDEISRHHPHIRGGFIHIPYIPEQTLQKS ------1111----------------------------1111---------3333----- APSLSLDHITKALKIAAVTAAVHEDDIETG ------------------------------ >SERINE/THREONINE PHOSPHAT; SWP:Q08209; PDB:1AUIA; TDRVVKAVPFPPSHRLTAKEVFDNDGKPRVDILKAHLMKEGRLEESVALRIITEGASILR ----3333--------3333--1111----------1111------------------11 QEKNLLDIDAPVTVCGDIHGQFFDLMKLFEVGGSPANTRYLFLGDYVDRGYFSIECVLYL 11-------------------------------3333----------------------- WALKILYPKTLFLLRGNHECRHLTEYFTFKQECKIKYSERVYDACMDAFDCLPLAALMNQ ---------------11113333-------------------------1111-----%%% QFLCVHGGLSPEINTLDDIRKLDRFKEPPAYGPMCDILWSDPLEDFGNEKTQEHFTHNTV %--------1111-33331111----------------------2222------------ RGCSYFYSYPAVCEFLQHNNLLSILRAHEAQDAGYRMYRKSQTTGFPSLITIFSAPNYLD ----------------1111----------1111----------------------2222 VYNNKAAVLKYENNVMNIRQFNCSPHPYWLPNFMDVFTWSLPFVGEKVTEMLVNVLNICS -----------%%%%--------------2222---3333-------------------- SFEEAKGLDRINERMPPR --------3333------ >SERINE/THREONINE PHOSPHAT; SWP:P06705; PDB:1AUIB; SYPLEMCSHFDADEIKRLGKRFKKLDLDNSGSLSVEEFMSLPELQQNPLVQRVIDIFDTD ----------------------------------------3333--1111-------111 GNGEVDFKEFIEGVSQFSVKGDKEQKLRFAFRIYDMDKDGYISNGELFQVLKMMVGNNLK 1----------------1111---------33331111------------3333------ DTQLQQIVDKTIINADKDGDGRISFEEFCAVVGGLDIHKKMVVDV -------------------------------33333333------ >ARYLSULFATASE A; SWP:P15289; PDB:1AUK; RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLTPSRAALLTG -------------11111111--------------------------------------- RLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQ --3333-------1111----3333-------1111--------------%%%%-3333- GFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEA ----------------1111-----------------------!!!!------3333--- RYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSFAERSGRGPFGDSLMELDAAVG -------------1111------------------3333--------------------- TLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLLRCGKGTTYEGGVREPALAFWP ----------1111--------------!!!!---!!!!--2222-3333--------22 GHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLGTGKSPRQSLFFYPSY 22----------3333------1111---------------------------------- PDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASSSLTAHEPPLLYDLSKDPGENY -3333------!!!!---------1111---33333333-----------33331111-- NLLGATPEVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQICCHPGCTPRPACCHCP 1111---------------------------3333---3333----2222---3333--- >PR-5D; SWP:P25871; PDB:1AUN; SGVFEVHNNCPYTVWAAATPVGGGRRLERGQSWWFWAPPGTKMARIWGRTNCNFDGAGRG ---------------------------2222------2222-------------1111-- WCQTGDCGGVLECKGWGKPPNTLAEYALNQFSNLDFWDISVIDGFNIPMSFGPTKPGPGK ------%%%%--------------------%%%%-----------------------!!! CHGIQCTANINGECPGSLRVPGGCNNPCTTFGGQQYCCTQGPCGPTELSRWFKQRCPDAY !-------3333--3333-2222--3333---3333---------3333------1111- SYPQDDPTSTFTCTSWTTDYKVMFCPYG -----1111---------------1111 >CARBOXYLESTERASE; SWP:Q53547; PDB:1AUOA; MTEPLILQPAKPADACVIWLHGLGADRYDFMPVAEALQESLLTTRFVLPQAPTRPVTING ---------------------22223333-------3333---------------3333- GYEMPSWYDIKAMSPARSISLEELEVSAKMVTDLIEAQKRTGIDASRIFLAGFSQGGAVV -------------------------------------------3333------3333--- FHTAFINWQGPLGGVIALSTYAPTFGDELELSASQQRIPALCLHGQYDDVVQNAMGRSAF ---------------------11111111--3333----------------3333----- EHLKSRGVTVTWQEYPMGHEVLPQEIHDIGAWLAARLG ---1111------------------------------- >NAD-SPECIFIC GLUTAMATE DE; SWP:P24295; PDB:1AUP; SKYVDRVIAEVEKKYADEPEFVQTVEEVLSSLGPVVDAHPEYEEVALLERMVIPERVIEF --------------1111-----------------------------3333--------- RVPWEDDNGKVHVNTGYRVQFNGAIGPYLGGLRFAPSVNLSIMKFLGFEQAFKDSLTTLP -----1111-------------1111--------1111---------------------- MGGAKGGSDFDPNGKSDREVMRFCQAFMTELYRHIGPDIDVPAGDLGVGAREIGYMYGQY -----------2222--------------3333-----------2222------------ RKIVGGFYNGVLRPEATGYGSVYYVEAVMKHENDTLVGKTVALAGFGNVAWGAAKKLAEL -----------------------------1111-------------3333-------111 GAKAVTLSGPDGYIYDPEGITTEEKINYMLEMRASGRNKVQDYADKFGVQFFPGEKPWGQ 1-------1111---3333-------------1111--3333------------------ KVDIIMPCATQNDVDLEQAKKIVANNVKYYIEVANMPTTNEALRFLMQQPNMVVAPSKAV ---------------3333--1111-----------------------1111---3333- NAGGVLVVGFETAEEVDSKLHQVMTDIHDGSAAAAERYGLGYNLVAGANIVGFQKIADAM -----------------------------------1111--------------------- MAQGIAW ------- >Vitamin K-dependent prote; SWP:P04070; PDB:1AUTC; LIDGKMTRRGDSPWQVVLLDSKKKLACGAVLIHPSWVLTAAHCMDESKKLLVRLGEYDLR -------22221111----1111----------------1111----------------- RWEKWELDLDIKEVFVHPNYSKSTTDNDIALLHLAQPATLSQTIVPICLPDSGLAERAE ----------------------------------------------------------- >Vitamin K-dependent prote; SWP:P04070; PDB:1AUTL; QCLVLPLEHPCASLCCGHGTCIGIGSFSCDCRSGWEGRFCQREVSFLNCSLDNGGCTHYC ------------1111--------------------1111------------iiii---- LEEVGWRRCSCAPGYKLGDDLLQCHPAVKFPCGRPWK --1111-----2222--3333---------------- >SACY; SWP:P15401; PDB:1AUUA; MKIKRILNHNAIVVKDQNEEKILLGAGIAFNKKKNDIVDPSKIEKTFIRKDTPDY ---------------3333-----2222----2222--3333------------- >SYNAPSIN IA; SWP:P17599; PDB:1AUVA; AARVLLVIDEPHTDWAKYFKGKKIHGEIDIKVEQAEFSDLNLVAHANGGFSVDEVLRNGV ---------33333333-2222-iiii--------3333-----1111------------ KVVRSLKPDFVLIRQHAFSARNGDYRSLVIGLQYAGIPSINSLHSVYNFCDKPWVFAQVR --------------------------------1111----------1111---------- LHKKLGTEEFPLINQTFYPNHKELSSTTYPVVVKGHAHSGGKVKVDNQHDFQDIASVVAL ----------------------------------------------------------11 TKTYATTEPFIDAKYDVRIQKIGQNYKAYRTLEQIASDRYKLWVDTCSEIFGGLDICAVE 11-----------------------------------------------iiii------- ALHGKDGRDHIIEVVGSSPLIGDHQDEDKQLIVELVVNKAQA ---3333----------------3333--------------- >TURNIP YELLOW MOSAIC VIRU; SWP:P03608; PDB:1AUYA; SPLTIKQPFQSEVLFAGTKDAEASLTIANIDSVSTLTTFYRHASLESLWVTIHPTLQAPT -------------------------3333-----1111----------------3333-- FPTTVGVCWVPAQSPVTPAQITKTYGGQIFCIGGAIQTLSPLIVKCPLEMMQPRVKDSIQ ----------1111--3333---3333---2222------------3333---------- YLDSPKLLISITAQPTAPPASTCIITVSGTLSMHSPLITDTST ------------------------------------------- >Alpha-amylase/subtilisin ; SWP:P07596; PDB:1AVAC; ADPPPVHDTDGHELRADANYYVLSANRAHGGGLTMAPGHGRHCPLFVSQDPNGQHDGFPV -------1111--------------3333--------iiii----------1111----- RITPYGVAPSDKIIRLSTDVRISFRAYTTCLQSTEWHIDSELAAGRRHVITGPVKDPSPS ---------------------------1111-----------iiii-----------333 GRENAFRIEKYSGAEVHEYKLMSCGDWCQDLGVFRDLKGGAWFLGATEPYHVVVFKKAPP 31111-------------------------------2222-------------------- A - >ARCELIN-1; SWP:P19329; PDB:1AVBA; SNDASFNVETFNKTNLILQGDATVSSEGHLLLTNVKGNEEDSMGRAFYSAPIQINDRTID -----------3333---------1111----------2222-------------3333- NLASFSTNFTFRINAKNIENSAYGLAFALVPVGSRPKLKGRYLGLFNTTNYDRDAHTVAV ----------------1111----------2222----!!!!---------1111----- VFDTVSNRIEIDVNSIRPIATESCNFGHNNGEKAEVRITYDSPKNDLRVSLLYPSSEEKC ------------------------33332222---------1111--------1111--- HVSATVPLEKEVEDWVSVGFSATSGSKKETTETHNVLSWSFSSNFI ------3333----------------1111---------------- >ANNEXIN VI; SWP:ANX6_BOVIN; PDB:1AVC; YRGSIRDFPDFNPSQDAETLYNAMKGFGSDKEAIINLITSRSNKQRQEICQNYKSLYGKD ---------------------1111----3333---1111-------------------- LIADLKYELTGKFERLIVGLMRPPAYADAKEIKDAISGIGTDEKCLIEILASRTNEQIHQ ----------3333---1111--------------------------------------- LVAAYKDAYERDLEADITGDTSGHFRKMLVVLLQGTREEDDVVSEDLVQQDVQDLYEAGE ----------------1111----------3333------------------------11 LKWGTDEAQFIYILGNRSKQHLRLVFDEYLKTTGKPIEASIRGELSGDFEKLMLAVVKCI 11----3333-------------------------3333-2222---------------- RSTAEYFAERLFKAMKGLGTRDNTLIRIMVSRSELDMLDIREIFRTKYEKSLYSMIKNDT --------------------3333------------------------------------ SGEYKKTLLKLCGGQFFPEAAQVAYQMWELSAVARVELKGTVRPAGDFNPDADAKALRKA ------------------3333--------------------------3333-------- MKGLGTDEDTIIDIITHRSNAQRQQIRQTFKSHFGRDLMADLKSELSGDLARLILGLMMP -------------1111------------------------------------------- PAHYDAKQLKKAMEGAGTDEKALIEILATRTNAEIQAINKAYKEDYHKTLEDALSSDTSG 3333------1111-----------1111------------------------------- HFKRILISLATGNREEGGEDRERAREDAQVAAEILTRFMMILCTRSYPDLRRVFQEFVKM ------------------------------------------------------------ TNYDVEHTIKKEMSGDVRDVFVAIVQSVKNKPLFFADKLYKSMKGAGTEEKTLTRIMVSR ---3333---------------------------------1111-------------111 SEIDLLNIRREFIEKYDKSLHQAIEGDTSGHFLKALLAICGG 1----------------------------3333--------- >Triabin [Precursor]; SWP:Q27049; PDB:1AVGI; AEGDDCSIEKAMGDFKPEEFFNGTWYLAHGPGVTSPAVCQKFTTSGSKGFTQIVEIGYNK ----3333-------1111--------------------------------------111 FESNVKFQCNQVDNKNGEQYSFKCKSSDNTEFEADFTFISVSYDNFALVCRSITFTSQPK 1------------------------1111------------------------------- EDRYLVFERTKSDTDPDAKEIC ----------------3333-- >11S REGULATOR; SWP:Q06323; PDB:1AVOA; LRVQPEAQAKVDVFREDLCTKTENLLGSYFPKKISELDAFLKEPALNEANLSNLKAPLDI ------------------------------------------3333---1111------- >Proteasome activator comp; SWP:Q06323; PDB:1AVOB; AVNCNEKIVVLLQRLKPEIKDVIEQLNLVTTWLQLQIPRIEDGNNFGVAVQEKVFELMTS ----3333-------------------------1111----------------------- LHTKLEGFHTQISKYFSERGDAVTKAAKQPHVGDYRQLVHELDEAEYRDIRLMVMEIRNA ----------------------------1111---------------------------- YAVLYDIILKNFEKLKKPRG --------11113333---- >LAMBDA EXONUCLEASE; SWP:P03697; PDB:1AVQA; HMTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFH -------------3333-2222----1111---33333333------------------- TLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSGVNVTESPIIYRDSMRTACSPDG ------------------------------------------------------------ LCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKNAWYFANYDPRMK -1111---------3333--------3333-------------------------1111- REGLHYVVIERDEKYMASFDEIVPEFIEKMDEALAEIGFVFGEQWR -----------3333-------------------1111---3333- >TROPONIN C; SWP:P02588; PDB:1AVSA; QAEARAFLSEEMIAEFKAAFDMFDADGGGDISTKELGTVMRMLGQNPTKEELDAIIEEVD --1111-----------------1111------------3333------------33331 EDGSGTIDFEEFLVMMVRQMK 111-------------3333- >Trypsin inhibitor A [Prec; SWP:P01070; PDB:1AVWB; DFVLDNEGNPLENGGTYYILSDITAFGGIRAAPTGNERCPLTVVQSRNELDKGIGTIISS -----------2222------------------!!!!----------1111--------- PYRIRFIAEGHPLSLKFDSFAVIMLCVGIPTEWSVVEDLPEGPAVKIGENKDAMDGWFRL -------2222-----------3333-----------------------1111------- ERVSEFNNYKLVFCPQDKCGDIGISIDHDDGTRRLVVSKNKPLVVQFQKLD --------------------------------------------------- >FIBRITIN; SWP:P10104; PDB:1AVYA; TNKIKAIETDIASVRQEVNTAKGNISSLQGDVQALQEAGYIPEAPRDGQAYVRKDGEWVL -----------------------------------------------------%%%%--3 LSTFLSPA 333----- >MENKES COPPER-TRANSPORTIN; SWP:Q04656; PDB:1AW0; LTQETVINIDGMTCNSCVQSIEGVISKKPGVKSIRVSLANSNGTVEYDPLLTSPETLRGA -------------------------------------1111------3333--------- IEDMGFDATLSD ------------ >TRIOSEPHOSPHATE ISOMERASE; SWP:P50921; PDB:1AW2A; RHPVVMGNWKLNGSKEMVVDLLNGLNAELEGVTGVDVAVAPPALFVDLAERTLTEAGSAI --------------------------1111-----------3333--------------- ILGAQNTDLNNSGAFTGDMSPAMLKEFGATHIIIGHSERREYHAESDEFVAKKFAFLKEN --------------2222-----3333------------------------------111 GLTPVLCIGESDAQNEAGETMAVCARQLDAVINTQGVEALEGAIIAYEPIWAIGTGKAAT 1---------3333-----------------------1111-------3333-------3 AEDAQRIHAQIRAHIAEKSEAVAKNVVIQYGGSVKPENAAAYFAQPDIDGALVGGAALDA 333-----------3333-3333-----------3333--11111111-----3333--- KSFAAIAKAAAEAKA -----------1111 >5-AMINOLEVULINATE DEHYDRA; SWP:P05373; PDB:1AW5; HTAEFLETEPTEISSVLAGGYNHPLLRQWQSERQLTKNLIFPLFISDNPDDFTEIDSAPN -----------3333-1111--3333-1111----------------1111--------- INRIGVNRLKDYLKPLVAKGLRSVILFGVPLIPGTKDPVGTAADDPAGPVIQGIRFIREK ----11111111---------------------------3333-11113333---3333- FPELYIICDVCLCEYTSHGHCGVLYDDGTINRERSVSRLAAVAVNYAKAGAHCVAPSDID -----------11113333---------------------------1111--------22 GRIRDIKRGLINANLAHKTFVLSYAAKFSGNLYGPACYQLPPAGRGLARRALERDSEGAD 22--------11111111----------------------1111---------------- GIIVKPSTFYLDIVRDASEICKDLPICAYHVSGEYALHAAAEKGVVDLKTIAFESHQGFL ------1111----------1111----------------1111---------------- RAGARLIITYLAPEFLDWLDE ----------3333--3333- >GLUTATHIONE S-TRANSFERASE; SWP:Q9ZP62; PDB:1AW9; APLKLYGMPLSPNVVRVATVLNEKGLDFEIVPVDLTTGAHKQPDFLALNPFGQIPALVDG -------1111--------------------------------3333-1111------!! DEVLFESRAINRYIASKYASEGTDLLPATASAAKLEVWLEVESHHFYPNASPLVFQLLVR !!------------------------1111---------------3333----------- PLLGGAPDAAVVDKHAEQLAKVLDVYEAHLARNKYLAGDEFTLADANHASYLLYLSKTPK 1111------------------------3333--1111---3333------------111 AGLVAARPHVKAWWEAIVARPAFQKTVAAIPLPPPP 1--1111-------------------1111------ >GA BINDING PROTEIN ALPHA; SWP:Q00422; PDB:1AWCA; IQLWQFLLELLTDKDARDCISWVGDEGEFKLNQPELVAQKWGQRKNKPTMNYEKLSRALR -3333-------11111111--------------------------1111---------1 YYYDGDMICKVQGKRFVYKFVCDLKTLIGYSAAELNRLVIECEQKKLARM 111-------2222--------3333------------------------ >GA-binding protein beta c; SWP:Q00421; PDB:1AWCB; DLGKKLLEAARAGQDDEVRILMANGAPFTTDWLGTSPLHLAAQYGHFSTTEVLLRAGVSR ------------------------------1111-------------------1111--- DARTKVDRTPLHMAASEGHANIVEVLLKHGADVNAKDMLKMTALHWATEHNHQEVVELLI --------------------------1111-1111-1111-------------------- KYGADVHTQSKFCKTAFDISIDNGNEDLAEILQ ---------1111-3333--1111--------- >FERREDOXIN; SWP:P56408; PDB:1AWD; YKVTLKTPSGEETIECPEDTYILDAAEEAGLDLPYSCRAGACSSCAGKVESGEVDQSDQS ------1111------1111------1111------------1111---------1111- FLDDAQMGKGFVLTCVAYPTSDVTILTHQEAALY ------------3333------------3333-- >ITK; SWP:Q03526; PDB:1AWJ; KKPLPPTPEDNRRSFQEPEETLVIALYDYQTNDPQELALRCDEEYYLLDSSEIHWWRVQD --------------------------------3333------------------------ KNGHEGYAPSSYLVEKS ---------1111---- >TRYPTOPHANASE; SWP:P28796; PDB:1AX4A; AKRIVEPFRIKMVEKIRVPSREEREAALKEAGYNPFLLPSSAVYIDLLTDSGTNAMSDHQ ---------------------------------1111-3333------------------ WAAMITGDEAYAGSRNYYDLKDKAKELFNYDYIIPAHQGRGAENILFPVLLKYKQKEGKA -------------3333------------------------------------------- KNPVFISNFHFDTTAAHVELNGCKAINIVTEKAFDSETYDDWKGDFDIKKLKENIAQHGA ------------------1111-------3333-1111--------------------33 DNIVAIVSTVTCNSAGGQPVSMSNLKEVYEIAKQHGIFVVMDSARFCENAYFIKARDPKY 33----------1111----------------------------------------3333 KNATIKEVIFDMYKYADALTMSAKDPLLNIGGLVAIRDNEEIFTLARQRCVPMEGFVTYG -----------3333----------------------------------------1111- GLAGRDMAAMVQGLEEGTEEEYLHYRIGQVKYLGDRLREAGIPIQYPTGGHAVFVDCKKL --3333--------3333-------------------1111--------------3333- VPQIPGDQFPAQAVINALYLESGVRAVEIGSFLLGRDPATGEQKHADMEFMRLTIARRVY 11113333---------------------3333---1111-------------------- TNDHMDYIADALIGLKEKFATLKGLEFEYEPPVLRHFTARLKPI 3333------------3333--------------3333------ >OBESITY PROTEIN; SWP:P41159; PDB:1AX8; IQKVQDDTKTLIKTIVTRINDILDFIPGLHPILTLSKMDQTLAVYQQILTSMPSRNVIQI ----------------------1111---------------------3333--3333--- SNDLENLRDLLHVLAFSKSCHLPEASGLETLDSLGGVLEASGYSTEVVALSRLQGSLQDM ---------------1111-----------33333333-2222----------------- LWQLDLSPGC ---1111--- >PCNA; SWP:P12004; PDB:1AXCA; MFEARLVQGSILKKVLEALKDLINEACWDISSSGVNLQSMDSSHVSLVQLTLRSEGFDTY -------3333-------3333--------3333------1111--------3333---- RCDRNLAMGVNLTSMSKILKCAGNEDIITLRAEDNADTLALVFEAPEKVSDYEMKLMDLD ------------------11111111------1111------------------------ VEQLGIPEQEYSCVVKMPSGEFARICRDLSHIGDAVVISCAKDGVKFSASGELGNGNIKL ---------------------------3333---------3333---------------- SQTSEEEAVTIEMNEPVQLTFALRYLNFFTKATPLSSTVTLSMSADVPLVVEYKIADMGH -------------------------------3333--------2222------------- LKYYLAPKI --------- >GLUTATHIONE S-TRANSFERASE; SWP:P12653; PDB:1AXDA; APMKLYGAVMSWNLTRCATALEEAGSDYEIVPINFATAEHKSPEHLVRNPFGQVPALQDG -------1111-----------------------111111113333--1111-------- DLYLFESRAICKYAARKNKPELLREGNLEEAAMVDVWIEVEANQYTAALNPILFQVLISP -----------------------------------------------------------1 MLGGTTDQKVVDENLEKLKKVLEVYEARLTKCKYLAGDFLSLADLNHVSVTLCLFATPYA 111------------------------------3333----------------------- SVLDAYPHVKAWWSGLMERPSVQKVAALM ----------------------------- >ATRACOTOXIN-HVI; SWP:P56207; PDB:1AXH; SPTCIPSGQPCPYNENCCSQSCTFKENENGNTVKRCD -----2222---11111111----------------- >GROWTH HORMONE; SWP:P01241; PDB:1AXIA; TIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPSLCFSESIPTPSNR ------------------------------------------------1111-------- EETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEERIQTL --3333-------------11113333----------2222------------------- MGRLTGQIFKQTYSKFDTALLKNYGLLYCFRRDMTYVATYLRIVQCRSVEGSCGF -11113333---------------------------------------------- >Growth hormone receptor [; SWP:P10912; PDB:1AXIB; EPKFTKCRSPERETFSCHWTDEGPIQLFYTRRNEWKECPDYVSAGENSCYFNSSFTSIAI -----------------------------------------1111------3333----- PYCIKLTSNGGTVDEKCFSVDEIVQPDPPIALNWTLLNVSLTGIHADIQVRWEAPRNADI -------1111-------3333--------------------------------1111-1 QKGWMVLEYELQYKEVNETKWKMMDPILTTSVPVYSLKVDKEYEVRVRSKQRNSGNYGEF 111-----------1111-------------------1111---------2222------ SEVLYVTLPQM ----------- >ANNEXIN III; SWP:P12429; PDB:1AXN; SASIWVGHRGTVRDYPDFSPSVDAEAIQKAIRGIGTDEKMLISILTERSNAQRQLIVKEY --1111--------1111-------------------------1111------------- QAAYGKELKDDLKGDLSGHFEHLMVALVTPPAVFDAKQLKKSMKGAGTNEDALIEILTTR -------------------------1111----------1111----------------- TSRQMKDISQAYYTVYKKSLGDDISSETSGDFRKALLTLADGRRDESLKVDEHLAKQDAQ --------------------------------------1111------------------ ILYKAGENRWGTDEDKFTEILCLRSFPQLKLTFDEYRNISQKDIVDSIKGELSGHFEDLL -----1111--------------------------------------------------- LAIVNCVRNTPAFLAERLHRALKGIGTDEFTLNRIMVSRSEIDLLDIRTEFKKHYGYSLY --------------------------------------1111------------------ SAIKSDTSGDYEITLLKICGGDD ----------------------- >OXY-COPE CATALYTIC ANTIBO; SWP:GC1_HUMAN; PDB:1AXSH; QVQLLESGAELMKPGASVKISCKATGYTFSSFWIEWVKQRPGHGLEWIGEILP ---------------------------3333---------------------- >OXY-COPE CATALYTIC ANTIBO; SWP:GC1_HUMAN; PDB:1AXSL; ELVLTQSPSSMYASLGERVTITCKASQDINSYLNWFQQKPGKSPKTLIYRTNRLVDGVPS -------------2222-----------%%%%------2222----------------11 RFSGSGSGQDYSLTISSLEYEDMGIYYCLQYDEFPYTFGSGTKLEIKRTVAAPSVFIFPP 11----------------3333-------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT 3333-------------------------iiii--------------------------- LSKADYEKHKVYACEVTHQGLSSPVTKSFNR ---3333----------1111---------- >Igh protein; SWP:Q6PIP8; PDB:1AXTH; EVKLEESGGGLVQPGGSMKLSCVVSGLTFSRFWMSWVRQSPEKGLEWVAEIRLKS ------------2222-----------3333------------------------ >If kappa light chain [Fra; SWP:A2NHM3; PDB:1AXTL; ELVMTQTPLSLPVSLGDQASISCRSSQSLVHS -------------2222--------------- >TP7 FAB; SWP:NA; PDB:1AY1H; EVQLQESGPGLVKPYQSLSLSCTVTGYSITSDYAWNWIRQFPGNKLEWMGYITYSGTTDY ------------------------------------------------------------ NPSLKSRISITRDTSKNQFFLQLNSV -------------1111--------- >TYPE 4 PILIN; SWP:P02974; PDB:1AY2; FTLIELMIVIAIVGILAAVALPAYQDYTARAQVSEAILLAEGQKSAVTEYYLNHGKWPEN -----------------------------------------------------------3 NTSAGVASPPSDIKGKYVKEVEVKNGVVTATMLSSGVNNEIKGKKLSLWARRENGSVKWF 333-----3333-----------iiii----------3333------------------- CGQPVTRTDDDTVADAKDGKEIDTKHLPSTCRDNFDAK ----------------------3333-3333--1111- >n/a; SWP:P35235; PDB:1AYAA; MRRWFHPNITGVEAENLLLTRGVDGSFLARPSKSNPGDFTLSVRRNGAVTHIKIQNTGDY ----------------------------------2222---------------------- YDLYGGEKFATLAELVQYYMEHHGQLKEKNGDVIELKYPLN ---------------------2222--1111---------- >PHOSPHOENOLPYRUVATE CARBO; SWP:P22259; PDB:1AYL; MRVNNGLTPQELEAYGISDVHDIVYNPSYDLLYQEELDPSLTGYERGVLTNLGAVAVDTG -----------3333----------------------1111-3333---1111------- IFTGRSPKDKYIVRDDTTRDTFWWADKGKGKNDNKPLSPETWQHLKGLVTRQLSGKRLFV -----3333--------------1111--------------------------------- VDAFCGANPDTRLSVRFITEVAWQAHFVKNMFIRPSDEELAGFKPDFIVMNGAKCTNPQW -------3333------------------------3333------------3333-1111 KEQGLNSENFVAFNLTERMQLIGGTWYGGEMKKGMFSMMNYLLPLKGIASMHCSANVGEK 1111---------------------------------------1111----------111 GDVAVFFGLSGTGKTTLSTDPKRRLIGDDEHGWDDDGVFNFEGGCYAKTIKLSKEAEPEI 1-------22223333---1111----------1111---------------3333---- YNAIRRDALLENVTVREDGTIDFDDGSKTENTRVSYPIYHIDNIVKPVSKAGHATKVIFL 11112222-------1111--33333333------------------------------- TADAFGVLPPVSRLTADQTQYHFLSGFTAKLAPTPTFSACFGAAFLSLHPTQYAEVLVKR --1111-------------------------------22223333---3333-------- MQAAGAQAYLVNTGWNGTGKRISIKDTRAIIDAILNGSLDNAETFTLPMFNLAIPTELPG ---------------1111--------------11111111----------------222 VDTKILDPRNTYASPEQWQEKAETLAKLFIDNFDKYTDTPAGAALVAAGPKL 23333-3333-----------------------------------3333--- >Genome polyprotein; SWP:Q82122; PDB:1AYM1; NPVERYVDEVLNEVLVVPNINQSHPTTSNAAPVLDAAETGHTNKIQPEDTIETRYVQSSQ 3333--------------------------3333-3333------3333----------- TLDEMSVESFLGRSGCIHESVLDIVDNYNDQSFTKWNINLQEMAQIRRKFEMFTYARFDS -3333---------------------3333------------3333-------------- EITMVPSVAAKDGHIGHIVMQYMYVPPGAPIPTTRDDYAWQSGTNASVFWQHGQPFPRFS ---------1111--------------------11113333---------2222------ LPFLSIASAYYMFYDGYDGDTYKSRYGTVVTNDMGTLCSRIVTSEQLHKVKVVTRIYHKA --------------------1111--3333------------------------------ KHTKAWCPRPPRAVQYSHTHTTNYKLSSEVHNDVAIRPRTNLTTV -----------------------------1111-------1111- >Genome polyprotein [Fragm; SWP:P23008; PDB:1AYM2; SDRIIQITRGDSTITSQDVANAVVGYGVWPHYLTPQDATAIDKPTQPDTSSNRFYTLDSK 1111----!!!!-----------2222------3333---------!!!!---------- MWNSTSKGWWWKLPDALKDMGIFGENMFYHFLGRSGYTVHVQCNASKFHQGTLLVVMIPE --1111----------1111---------------------------------------- HQLATVNKGNVNAGYKYTHPGEAGREVGTQVENEKQPSDDNWLNFDGTLLGNLLIFPHQF -------!!!!--3333---3333--------1111---3333-----33333333---- INLRSNNSATLIVPYVNAVPMDSMVRHNNWSLVIIPVCQLQSNNISNIVPITVSISPMCA -3333-----------------3333-----------------3333------------- EFSGARAKTVVQ ------------ >Genome polyprotein; SWP:Q82122; PDB:1AYM3; GLPVYVTPGSGQFMTTDDMQSPCALPWYHPTKEIFIPGEVKNLIEMCQVDTLIPINSTQS ------2222---1111-------2222-------------33331111--------333 NIGNVSMYTVTLSPQTKLAEEIFAIKVDIASHPLATTLIGEIASYFTHWTGSLRFSFMFC 3----1111------------------1111--1111-----1111-------------- GTANTTLKVLLAYTPPGIGKPRSRKEAMLGTHVVWDVGLQSTVSLVVPWISASQYRFTTP -1111------------------------------------------------------- DTYSSAGYITCWYQTNFVVPPNTPNTAEMLCFVSGCKDFCLRMARDTDLHKQTGPITQ 3333-------------------------------1111------------------- >Genome polyprotein [Fragm; SWP:P23008; PDB:1AYM4; GAQVSRQSLNYFNINYFKDAASSGASRLD ------------------3333------- >ALPHA-2-MACROGLOBULIN; SWP:Q7SIH1; PDB:1AYOA; EFPFALEVQTLPQTCDGPKAHTSFQISLSVSYIGSRPASNMAIVDVKMVSGFIPLKPTVK ----------------3333----------------------------2222--3333-- MLERSNVSRTEVSNNHVLIYLDKVTNETLTLTFTVLQDIPVRDLKPAIVKVYDYYETDEF ------------%%%%---------------------------------------1111- AVAEYSAPCS ------1111 >UBIQUITIN-CONJUGATING ENZ; SWP:P06104; PDB:1AYZA; STPARRRLMRDFKRMKEDAPPGVSASPLPDNVMVWNAMIIGPADTPYEDGTFRLLLEFDE -------------------2222----1111-------------1111----------11 EYPNKPPHVKFLSEMFHPNVYANGEICLDILQNRWTPTYDVASILTSIQSLFNDPNPASP 11--------------11111111---333311111111---------3333-------- ANVEAATLFKDHKSQYVKRVKETVEKSWEDDMD --------------------------------- >SIV PROTEASE; SWP:P05896; PDB:1AZ5; PQFHLWKRPVVTAHIEGQPVEVLLDTGADDSIVTGIELGPHYTPKIVGFINTKEYKNVEV --------------iiii------1111-------------------------------- EVLGKRIKGTIMTGDTPINIFGRNLLTALGMSLNF -iiii---------------------1111----- >CELLOBIOHYDROLASE I; SWP:P62694; PDB:1AZ6; TQSHAGQCGGIGYSGPTVCASGTTCQVLNPYYSQCL ---------2222---------------1111---- >MUTH; SWP:P06722; PDB:1AZO; PRPLLSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGAPE ------------------1111--------1111-------------------1111--- QDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGE 3333-----------1111-------------------3333----1111---------3 ASIPLAQRRVGSPLLWSPNEEEDRQLREDWEELDIVLGQVERITARHGEYLQIRPLTEAI 3333333---------------------------11113333-1111------------- GARGERILTLPRGFYLKKNFTSALLARHFLIQ 1111----------------------1111-- >VC1; SWP:P30803; PDB:1AZSA; DMMFHKIYIQKHDNVSILFADIEGFTSLASQCTAQELVMTLNELFARFDKLAAENHCLRI ---------------------------3333-3333------------------------ KILGDCYYCVSGLPEARADHAHCCVEMGMDMIEAISLVREMTGVNVNMRVGIHSGRVHCG --!!!!---------------------------------3333----------------- VLGLRKWQFDVWSNDVTLANHMEAGGKAGRIHITKATLSYLNGDYEVEPGCGGERNAYLK --------------------------2222-------1111---------3333-33331 EHSIETFLIL 111------- >Guanine nucleotide-bindin; SWP:P04896; PDB:1AZSC; VYRATHRLLLLGAGESGKSTIVKQMRILHVNGEKATKVQDIKNNLKEAIETIVAAMSNLV 1111--------22223333------------3333------------------------ PPVELANPENQFRVDYILSVMNVPDFDFPPEFYEHAKALWEDEGVRACYERSNEYQLIDC ------3333-------1111-------3333------------------3333---111 AQYFLDKIDVIKQDDYVPSDQDLLRCRVLTSGIFETKFQVDKVNFHMFDVGGQRDERRKW 1------3333-1111-----------------------%%%%---------33331111 IQCFNDVTAIIFVVASSSYNMVIREDNQTNRLQEALNLFKSIWNNRWLRTISVILFLNKQ --------------3333-----------------------1111--3333--------- DLLAEKVLAGKSKIEDYFPEFARYTTPEDATPEPGEDPRVTRAKYFIRDEFLRISTASGD ------------3333---3333-----------------------------------%% GRHYCYPHFTCAVDTENIRRVFNDCRDIIQRMHLRQYEL %%--------3333------------3333---1111-- >PROLINE IMINOPEPTIDASE; SWP:P52279; PDB:1AZWA; MRTLYPEITPYQQGSLKVDDRHTLYFEQCGNPHGKPVVMLHGGPGGGCNDKMRRFHDPAK ------------------------------1111--------------3333-------- YRIVLFDQRGSGRSTPHADLVDNTTWDLVADIERLRTHLGVDRWQVFGGSWGSTLALAYA -------2222-------------------------1111-------------------- QTHPQQVTELVLRGIFLLRRFELEWFYQEGASRLFPDAWEHYLNAIPPVERADLMSAFHR --3333-------------------------33333333---33333333---------- RLTSDDEATRLAAAKAWSVWEGATSFLHVDEDFVTGHEDAHFALAFARIENHYFVNGGFF --------------------1111----------11113333-----------1111--- EVEDQLLRDAHRIADIPGVIVHGRYDVVCPLQSAWDLHKAWPKAQLQISPASGHSAFEPE -1111-----1111--------------------------3333---------------- NVDALVRATDGFA ------------- >COLLAGENASE; SWP:P00771; PDB:1AZZA; IVGGVEAVPNSWPHQAALFIDDMYFCGGSLISPEWILTAAHCMDGAFVDVVLGAHNIRED -------22221111----------------1111---3333-------------3333- EATQVTIQSTDFTVHENYNSFVISNDIAVIRLPVPVTLTAAIATVGLPSTDVGVGTVVTP 1111----------1111--------------------1111------------------ TGWGLPSDSALGISDVLRQVDVPIMSNADCDAVYGIVTDGNICIDSTGGKGTCNGDSGGP ------1111-------------------3333----1111----2222------2222- LNYNGLTYGITSFGAAAGCEAGYPDAFTRVTYFLDWIQTQTGITP --iiii-----------1111-------3333------------- >DNA LIGASE; SWP:O87703; PDB:1B04A; DRQQAERRAAELRELLNRYGYEYYVLDRPSVPDAEYDRLMQELIAIEEQYPELKTSDSPT 3333---------------------------------------------1111------- QRIGGPPLEAFRKVAHRVPMMSLANAFGEGDLRDFDRRVRQEVGEAAYVCELAIDGLAVS 3333-------------------------------------------------------- VRYEDGYFVQGATRGDGTTGEDITENLKTIRSLPLRLKEPVSLEARGEAFMPKASFLRLN -------------!!!!------3333--3333--------------------------- EERKARELFANPRNAAAGSLRQLDPKVAASRQLDLFVYGLADAEALGIASHSEALDYLQA ------------------1111-33331111----------3333----3333-----11 LGFKVNPERRRCANIDEVIAFVSEWHDKRPQLPYEIDGIVIKVDSFAQQRALGATAKSPR 11--------------------------1111---------------------------- WAIAYKFPAE ---------- >SUPEROXIDE DISMUTASE; SWP:Q08713; PDB:1B06A; VIQLKRYEFPQLPYKVDALEPYISKDIIDVHYNGHHKGYVNGANSLLDRLEKLIKGDLPQ --------------1111----------------------------------------22 GQYDLQGILRGLTFNINGHKLHAIYWNNMAPAGKGGGKPGGALADLIDKQYGSFDRFKQV 22-3333------------------1111--3333------------------------- FSESANSLPGSGWTVLYYDNESGNLQIMTVENHFMNHIAELPVILIVDEFEHAYYLQYKN ----1111-----------------------------2222--------3333----!!! KRGDYLNAWWNVVNWDDAEKRLQKYLNK !---------------------1111-- >C-REACTIVE PROTEIN; SWP:P02741; PDB:1B09A; QTDMSRKAFVFPKESDTSYVSLKAPLTKPLKAFTVCLHFYTELSSTRGYSIFSYATKRQD ---2222----------------------------------3333----------1111- NEILIFWSKDIGYSFTVGGSEILFEVPEVTVAPVHICTSWESASGIVEFWVDGKPRVRKS -------2222-----iiii------------------------------iiii------ LKKGYTVGAEASIILGQEQDSFGGNFEGSQSLVGDIGNVNMWDFVLSPDEINTIYLGGPF -2222---------------2222--1111------------------------------ SPNVLNWRALKYEVQGEVFTKPQLWP -----1111----------------- >FOLD BIFUNCTIONAL PROTEIN; SWP:P24186; PDB:1B0AA; AAKIIDGKTIAQQVRSEVAQKVQARIAAGLRAPGLAVVLVGSNPASQIYVASKRKACEEV -------------------------1111----------------------------111 GFVSRSYDLPETTSEAELLELIDTLNADNTIDGILVQLPLPAGIDNVKVLERIHPDKDVD 1--------11113333----------3333-----------------1111-3333111 GFHPYNVGRLCQRAPRLRPCTPRGIVTLLERYNIDTFGLNAVVIGASNIVGRPMSMELLL 1----------------------------1111--2222-------3333---------- AGCTTTVTHRFTKNLRHHVENADLLIVAVGKPGFIPGDWIKEGAIVIDVGINRLENGKVV --------1111-3333-1111--------2222-3333-2222---------1111--- GDVVFEDAAKRASYITPVPGGVGPMTVATLIENTLQACVEYHDPQDE -----3333------------3333---------------------- >HEMOGLOBIN; SWP:P41260; PDB:1B0B; LSAAQKDNVKSSWAKASAAWGTAGPEFFMALFDAHDDVFAKFSGLFSGAAKGTVKNTPEM ------------------3333---------------------1111--11111111--- AAQAQSFKGLVSNWVDNLDNAGALEGQCKTFAANHKARGISAGQLEAAFKVLAGFMKSYG ----------------1111---------------1111---------------3333-- GDEGAWTAVAGALMGMIRPDM ----------------3333- >SINR PROTEIN; SWP:P06533; PDB:1B0NA; MIGQRIKQYRKEKGYSLSELAEKAGVAKSYLSSIERNLQTNPSIQFLEKVSAVLDVSVHT -3333-----1111---------------------------------------------- LLDEKHETLDSEWEKLVRDAMTSGVSKKQFREFLDYQKWRKSQ ---1111--------------------------------1111 >Protein sinI; SWP:P23308; PDB:1B0NB; FELDQEWVELMVEAKEANISPEEIRKYLLLN --------------1111------------- >HISTIDINE PERMEASE; SWP:P02915; PDB:1B0UA; NKLHVIDLHKRYGGHEVLKGVSLQARAGDVISIIGSSGSGKSTFLRCINFLEKPSEGAII -----------!!!!----------2222------------------------------- VNGQNINLVRDKDGQLKVADKNQLRLLRTRLTMVFQHFNLWSHMTVLENVMEAPIQVLGL iiii------1111-----3333-----------------1111---------------- SKHDARERALKYLAKVGIDERAQGKYPVHLSGGQQQRVSIARALAMEPDVLLFDEPTSAL -------------1111-3333---3333--------------1111-------1111-- DPELVGEVLRIMQQLAEEGKTMVVVTHEMGFARHVSSHVIFLHQGKIEEEGDPEQVFGNP 3333-----------1111-----------------------iiii-----3333----- QSPRLQQFLKGSLKKLEH ----------3333---- >BENCE-JONES KAPPA I PROTE; SWP:P01594; PDB:1B0WA; DIQMTQSPSSLSASVGDRVTITCQASQDISDYLIWYQQKLGKAPNLLIYDASTLETGVPS -------------2222-----------iiii------2222------------222211 RFSGSGSGTEYTFTISSLQPEDIATYYCQQYDDLPYTFGQGTKVEIKR 11----------------1111-------------------------- >EPHA4 RECEPTOR TYROSINE K; SWP:Q03137; PDB:1B0XA; FSAVVSVGDWLQAIKMDRYKDNFTAAGYTTLEAVVHMSQDDLARIGITAITHQNKILSSV -----------11113333---3333-------1111-----1111-------------- QAMRTQMQQMHG ------------ >HIPIP; SWP:P00260; PDB:1B0YA; SAPANAVAADNATAIALKYNQDATKSERVAAARPGLPPEEQQCANCQFMQADAAGATDEW --1111-1111----------3333-3333------1111-33331111---22221111 KGCQLFPGKLINVNGWCASWTLKAG --1111----------1111----- >PRION PROTEIN; SWP:P04273; PDB:1B10A; LGGYMLGSAMSRPMMHFGNDWEDRYYRENMNRYPNQVYYRPVDQYNNQNNFVHDCVNITI -------------------3333------3333--------------------------- KQHTVTTTTKGENFTETDIKIMERVVEQMCTTQYQKESQAYYDG --------------3333-------------------------- >SIGNAL PEPTIDASE I; SWP:P00803; PDB:1B12A; RSFIYEPFQIPSGSMMPTLLIGDFILVEKFAYGIKDPIYQKTLIETGHPKRGDIVVFKYP ------------1111---2222--------------------------2222-----11 EDPKLDYIKRAVGLPGDKVTYDPVSKELTIQPGCSSGQACENALPVTYSNVEPSDFVQTF 11-----------2222------------------------------------------- SRRNGGEATSGFFEVPKNETKENGIRLSERKETLGDVTHRILTVPIAQDQVGMYYQQPGQ ---------------1111------------------------1111--3333---2222 QLATWIVPPGQYFMMGDNRDNSADSRYWGFVPEANLVGRATAIWMSFDGLRLSRIGGIH 2222---2222------1111--3333----3333---------------3333----- >ALPHA-AMYLASE/TRYPSIN INH; SWP:P01087; PDB:1B1UA; GTSCIPGMAIPHNPLDSCRWYVSTRTCGVGPRLATQEMKARCCRQLEAIPAYCRCEAVRI --------------3333-------------------------------3333------- LMDGVVTPSGQHEGRLLQDLPGCPRQVQRAFAPKLVTEVECNLATIHGGPFCLSLLG ------1111---------2222----3333-------------3333--------- >LACTOFERRIN; SWP:O77811; PDB:1B1XA; APRKSVRWCTISPAEAAKCAKFQRNMKKVRGPSVSCIRKTSSFECIQAIAANKADAVTLD -----------3333-----------1111----------3333----1111-------3 GGLVYEAGLHPYKLRPVAAEVYQTRGKPQTRYYAVAVVKKGSGFQLNQLQGVKSCHTGLG 333-----------------------------------------11112222-----222 RSAGWNIPIGTLRPYLNWTGPPEPLQKAVANFFSASCVPCADGKQYPNLCRLCAGTEADK 21111------3333--------33333333------------------1111-----22 CACSSQEPYFGYSGAFKCLENGAGDVAFVKDSTVFENLPDEAERDKYELLCPDNTRKPVD 22----1111--------1111-------33333333------1111----------111 AFKECHLARVPSHAVVARSVDGREDLIWKLLHRAQEEFGRNKSSAFQLFGSTPGEQDLLF 11111----------------------------------------------3333----- KDSALGFVRIPSQIDSGLYLGANYLTATQNLRETAAEVAARRERVVWCAVGPEEERKCKQ 2222------3333-------------------3333----------------------- WSDVSNRKVACASASTTEECIALVLKGEADALNLDGGFIYVAGKCGLVPVLAENQKSQNS --1111----------------------------3333----1111-------------- NAPDCVHRPPEGYLAVAVVRKSDADLTWNSLSGKKSCHTGVGRTAAWNIPMGLLFNQTGS ---3333-------------------11112222-----2222----------------- CKFDKFFSQSCAPGADPQSSLCALCVGNNENENKCMPNSEERYYGYTGAFRCLAEKAGDV -1111------22221111--1111--1111-2222-3333------------------- AFVKDVTVLQNTDGKNSEPWAKDLKQEDFELLCLDGTRKPVAEAESCHLARAPNHAVVSQ ---33331111%%%%--3333---3333-----------11111111------------3 SDRAQHLKKVLFLQQDQFGGNGPDCPGKFCLFKSETKNLLFNDNTECLAELQGKTTYEQY 333------------------1111--------%%%%----1111--------------- LGSEYVTSITNLRRCSSSPLLEACAFLRA -----------3333----------1111 >BETA-AMYLASE; SWP:P16098; PDB:1B1YA; MKGNYVQVYVMLPLDAVSVNNRFEKGDELRAQLRKLVEAGVDGVMVDVWWGLVEGKGPKA 3333------------------------------1111----------3333-3333--- YDWSAYKQLFELVQKAGLKLQAIMSFHQCGGNVGDAVNIPIPQWVRDVGTRDPDIFYTDG --3333-------------------------2222----------------1111---33 HGTRNIEYLTLGVDNQPLFHGRSAVQMYADYMTSFRENMKDFLDAGVIVDIEVGLGPAGE 33-------3333--------------------------3333------------2222- LRYPSYPQSHGWSFPGIGEFICYDKYLQADFKAAAAAVGHPEWEFPNDAGQYNDTPERTQ ------3333-------------3333--------11113333-------11113333-- FFRDNGTYLSEKGRFFLAWYSNNLIKHGDRILDEANKVFLGYKVQLAIKIAGVHWWYKVP -----------------------------------------------------2222--- SHAAELTAGYYNLHDRDGYRTIARMLKRHRASINFTCAEMRDSEQPPDAMSAPEELVQQV --3333------1111----3333--1111------11113333-1111----------- LSAGWREGLNVSCENALPRYDPTAYNTILRNARPHGINQSGPPEHKLFGFTYLRLSNQLV --------------------------------1111-1111--------------3333- EGQNYVNFKTFVDRMHANLPRDPYVDPMAPLPRSGPEISIEMILQAAQPKIQPFPFQEHT --------------------------------------3333------------------ DLPVGPTGGMGGQAEGPTCG -------------------- >DNA REPAIR PROTEIN RAD51; SWP:Q06609; PDB:1B22A; EEESFGPQPISRLEQCGINANDVKKLEEAGFHTVEAVAYAPKKELINIKGISEAKADKIL --------------------------------3333-------3333--------3333- AEAAKLVPMG ---------- >I-DMOI; SWP:P21505; PDB:1B24A; VSGISAYLLGLIIGDGGLYKLKYKGNRSEYRVVITQKSENLIKQHIAPLQFLIDELNVKS -----------------------!!!!----------3333------------------- KIQIVKGDTRYELRVSSKKLYYYFANLERIRLFNREQIAFIKGLYVAEGDKTLKRLRIWN ----------------------------3333-----------------3333------- KNKALLEIVSRWLNNLGVRNTIHLDDHRHGVYVLNISLRDRIKFVHTILS -3333--------1111---------1111------1111---------- >FORMALDEHYDE FERREDOXIN O; SWP:O93738; PDB:1B25A; MYGWWGRILRVNLTTGEVKVQEYPEEVAKKFIGGRGLAAWILWNEARGVEPLSPENKLIF -----------------------3333----------------------1111------- AAGPFNGLPTPSGGKLVVAAKSPLTGGYGDGNLGTMASVHLRRAGYDALVVEGKAKKPVY --3333-----------------------------------1111--------------- IYIEDDNVSILSAEGLWGKTTFETERELKEIHGKNVGVLTIGPAGENLVKYAVVISQEGR ---!!!!-----3333----------------------------1111------------ AAGRPGMGAVMGSKKLKAVVIRGTKEIPVADKEELKKLSQEAYNEILNSPGYPFWKRQGT --1111------------------------------------------1111-------- MAAVEWCNTNYALPTRNFSDGYFEFARSIDGYTMEGMKVQQRGCPYCNMPCGNVVLDAEG -------1111----%%%%---------------1111-----2222---------1111 QESELDYENVALLGSNLGIGKLNEVSVLNRIADEMGMDTISLGVSIAHVMEAVERGILKE --------------1111--3333------------------------------------ GPTFGDFKGAKQLALDIAYRKGELGNLAAEGVKAMAEKLGTHDFAMHVKGLEVSGYNCYI --2222----------1111--------------------3333---iiii------111 YPAMALAYGTSAIGAHHKEAWVIAWEIGTAPIEYKISYDPIKAQKVVELQRLRGGLFEML 1---------1111--3333---------3333----------------------1111- TACRLPWVEVGLSLDYYPKLLKAITGVTYTWDDLYKAADRVYSLIRAYWVREFNGKWDRK ------------3333------------------------------------iiii--33 MDYPPKRWFTEGLKSGPHKGEHLDEKKYDELLSEYYRIRGWDERGIPKKETLKELDLDFV 33--3333-------1111---------------------------------11113333 IPELEKVTNLE ----------- >LECTIN; SWP:Q9ZP49; PDB:1B2PA; NNIIFSKQPDDNHPQILHATESLEILFGTHVYRFIMQTDCNLVLYDNNNPIWATNTGGLG -----------------2222-----!!!!------1111-----!!!!------2222- NGCRAVLQPDGVLVVITNENVTVWQSPVAGKAGHYVLVLQPDRNVVIYGDALWATQTVR ---------------------------------------1111---------------- >ANTIBODY (LIGHT CHAIN); SWP:NA; PDB:1B2WH; VQLVQSGGGVVQPGRSLKLSCLASGYIFTSSWINWVKQRPGRGLEWIGRIDPSDGEVHYN -----------------------3333--------------------------------- QDFKDRFTISRDKSKNTLYLQMNSLRPEDTAVYYCARGFLPWFADWGQGTLVTVSSASTK -3333-------1111---------3333------------------------------- GPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYS -------------------------------------%%%%------------3333--- LSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSC -------3333------------1111------------ >ALPHA-AMYLASE; SWP:P04746; PDB:1B2YA; YSPNTQQGRTSIVHLFEWRWVDIALECERYLAPKGFGGVQVSPPNENVAIYNPFRPWWER -----2222-----2222-------------1111--------------------1111- YQPVSYKLCTRSGNEDEFRNMVTRCNNVGVRIYVDAVINHMCGNAVSAGTSSTCGSYFNP ---------3333------------1111-------------1111-----1111----1 GSRDFPAVPYSGWDFNDGKCKTGSGDIENYNDATQVRDCRLTGLLDLALEKDYVRSKIAE 111-------1111-------3333---1111--------iiii---1111--------- YMNHLIDIGVAGFRLDASKHMWPGDIKAILDKLHNLNSNWFPAGSKPFIYQEVIDLGGEP ----------------3333-3333----1111---3333-2222--------------- IKSSDYFGNGRVTEFKYGAKLGTVIRKWNGEKMSYLKNWGEGWGFVPSDRALVFVDNHDN -33333333----3333----------%%%%--1111--3333---1111------3333 QRGHGAGGASILTFWDARLYKMAVGFMLAHPYGFTRVMSSYRWPRQFQNGNDVNDWVGPP -------3333-3333-------------------------------%%%%1111----- NNNGVIKEVTINPDTTCGNDWVCEHRWRQIRNMVIFRNVVDGQPFTNWYDNGSNQVAFGR -iiii------1111--iiii-3333-3333--------2222----------------- GNRGFIVFNNDDWSFSLTLQTGLPAGTYCDVISGDKINGNCTGIKIYVSDDGKAHFSISN ------------------------------------------------1111------11 SAEDPFIAIHAESKL 11-------1111-- >XYLANASE; SWP:P56588; PDB:1B30A; ASVSIDAKFKAHGKKYLGTIGDQYTLTKNTKNPAIIKADFGQLTPENSMKWDATEPNRGQ ---------1111--------3333---------------------11111111--2222 FTFSGSDYLVNFAQSNGKLIRGHTLVWHSQLPGWVSSITDKNTLISVLKNHITTVMTRYK -------------------------------3333---------------------1111 GKIYAWDVLNEIFNEDGSLRNSVFYNVIGEDYVRIAFETARSVDPNAKLYINDYNLDSAG -------------1111----3333------------------1111----------222 YSKVNGMVSHVKKWLAAGIPIDGIGSQTHLGAGAGSAVAGALNALASAGTKEIAITELDI 2-------------1111------------22221111------1111------------ AGASSTDYVNVVNACLNQAKCVGITVWGVADPDSWRSSSSPLLFDGNYNPKAAYNAIANA --------------1111-----------3333--3333-----1111---------111 L 1 >ALLOPHYCOCYANIN, ALPHA CH; SWP:P00315; PDB:1B33A; SIVTKSIVNADAEARYLSPGELDRIKSFVSSGEKRLRIAQILTDNRERIVKQAGDQLFQK ----------1111---------------------------------------------- RPDVVSPGGNAYGQEMTATCLRDLDYYLRLITYGIVAGDVTPIEEIGIVGVREMYKSLGT 3333-2222--------------------------------------2222--------- PIDAVAAGVSAMKNVASSILSAEDAAEAGAYFDYVAGALA 3333-----------1111--------------------- >Phycobilisome 7.8 kDa lin; SWP:P20116; PDB:1B33N; GRLFKITACVPSQTRIRTQRELQNTYFTKLVPYENWFREQQRIQKMGGKIVKVELATGKQ --------------------3333-------1111--------1111-------1111-- GINTGLA ------- >SMALL NUCLEAR RIBONUCLEOP; SWP:P13641; PDB:1B34A; KLVRFLMKLSHETVTIELKNGTQVHGTITGVDVSMNTHLKAVKMTLKNREPVQLETLSIR --------2222-----1111----------1111------------------------3 GNNIRYFILPDSLPLDTLLV 333------11113333--- >SMALL NUCLEAR RIBONUCLEOP; SWP:P43330; PDB:1B34B; TGPLSVLTQSVKNNTQVLINCRNNKKLLGRVKAFDRHCNMVLENVKEMDRYISKMFLRGD ----------------------------------1111-------------------333 SVIVVLRNPLIAGK 3------------- >CRICKET PARALYSIS VIRUS, ; SWP:P13418; PDB:1B35A; VMGEDQQIPRNEAQHGVHPISIDTHRISNNWSPQAMCIGEKVVSIRQLIKRFGIFGDANT --------33331111----3333------3333---------33331111--------- LQADGSSFVVAPFTVTSPTKTLTSTRNYTQFDYYYYLYAFWRGSMRIKMVAETQDGTGTP ----------1111--------------33333333-------------------2222- RKKTNFTWFVRMFNSLQDSFNSLISTSSSAVTTTVLPSGTINMGPSTQVIDPTVEGLIEV ----------------333311111111-----------1111-------3333------ EVPYYNISHITPAVTIDDGTPSMEDYLKGHSPPCLLTFSPRDSISATNHIITASFMRALG -----------------------------------------------------------1 DDFSFMYLLGVPPLVNVARA 111----------------- >Genome polyprotein [Fragm; SWP:P13418; PDB:1B35B; ENSHIENEDKRLTSEQKEIVHFVSEGVTPSTTALPDIVNLSTNYLDKNTREDRIHSIKDF ------1111------!!!!--------------------3333---------------1 LSRPIIIATNLWSVSDPVEKQLYTANFPEVLISNAMYQDKLKGFVGLRATLVVKVQVNSQ 111---------11112222----------1111------2222---------------1 PFQQGRLMLQYIPYAQYMPNRVTLINETLQGRSGCPRTDLELSVGTEVEMRIPYVSPHLY 111----------3333--------------1111-----1111---------------- YNLITGQGSFGSIYVVVYSQLHDQVSGTGSIEYTVWAHLEDVDVQYPTGANIFTGNEAYI ------------------------------------------------------------ KGTSRYDAAQKAHAA ----3333------- >Genome polyprotein [Fragm; SWP:P13418; PDB:1B35C; SKPTVQGKIGECKLRGQGRMANFDGMDMSHKMALSSTNEIETNEGLAGTSLDVMDLSRVL ------------------1111--------------------2222--------333311 SIPNYWDRFTWKTSDVINTVLWDNYVSPFKVKPYSATITDRFRCTHMGKVANAFTYWRGS 11---------11112222---------------1111---------------------- MVYTFKFVKTQYHSGRLRISFIPYYYNTTISTGTPDVSRTQKIVVDLRTSTAVSFTVPYI --------------------------3333-----3333--------------------- GSRPWLYCIRPESSWLSKDNTDGALMYNCVSGIVRVEVLNQLVAAQNVFSEIDVICEVNG ---------1111-------22221111----------------1111------------ GPDLEFAGPTCPRYVPYAGDFTLADTRKIEAERTQEYSNNED 1111-----------------3333----------------- >Genome polyprotein [Fragm; SWP:P13418; PDB:1B35D; AASELKQLETNNSPSTALGQISEGLTTLSHIPVLGNIFSTPAWISAKAADLAKLFGF --------------------3333--1111-----1111-1111------------- >NEUROTOXIN CSE-I; SWP:P01491; PDB:1B3CA; KDGYLVEKTGCKKTCYKLGENDFCNRECKWKHIGGSYGYCYGFGCYCEGLPDSTQTWPLP ------1111----------3333----------------%%%%------3333------ NKTC ---- >PLASTOCYANIN; SWP:P50057; PDB:1B3IA; ASVQIKMGTDKYAPLYEPKALSISAGDTVEFVMNKVGPHNVIFDKVPAGESAPALSNTKL ------------------------------------------------------------ AIAPGSFYSVTLGTPGTYSFYCTPHRGAGMVGTITVE -------------------------3333-------- >MHC CLASS I HOMOLOG MIC-A; SWP:Q29983; PDB:1B3JA; EPHSLRYNLTVLSWDGSVQSGFLTEVHLDGQPFLRCDRQKCRAKPQGQWAEDVLGNKTWD ---------------------------%%%%-------------3333------------ RETRDLTGNGKDLRMTLAHIKDQKEGLHSLQEIRVCEIHEDNSTRSSQHFYYDGELFLSQ ----------------3333------------------3333---------iiii----- NLETKEWTMPQSSRAQTLAMNVRNFLKEDAMADCLQELRRYLKSGVVLRRTVPPMVNVTR ---------------------------3333----------1111--------------- SEASEGNITVTCRASGFYPWNITLSWRQDGVSLSHDTQQWGDVLPDGNGTYQTWVATRIC ------------------------------------------------------------ QGEEQRFTCYMEHSGNHSTHPVPS --3333------------------ >INOSINE MONOPHOSPHATE DEH; SWP:P12268; PDB:1B3OA; TSYVPDDGLTAQQLFNCGDGLTYNDFLILPGYIDFTADQVDLTSALTKKITLKTPLVSSP -----------3333------1111----------1111--------------------- MDTVTEAGMAIAMALTGGIGFIHHNCTPEFQANEVRKVKKDYPLASKDAKKQLLCGAAIG 1111----------------------3333-----------1111--------------- THEDDKYRLDLLAQAGVDVVVLDSSQGNSIFQINMIKYIKDKYPNLQVIGGNVVTAAQAK ------------------------------------------1111--------3333-- NLIDAGVDALRVGMGSRPQATAVYKVSEYARRFGVPVIADGGIQNVGHIAKALALGASTV ----------------------------3333------------3333----1111---- MMGSLLAATTEAPGEYDKGSIHKFVPYLIAGIQHSCQDIGAKSLTQVRAMMYSGELKFEK --3333--3333-------1111-------------1111-------------------- RTSSAQV -1111-- >CHEMOTAXIS PROTEIN CHEA; SWP:Q56310; PDB:1B3QA; SQTVRVDIEKLDNLMDLMGELVIARSRILETLKKYNIKELDESLSHLSRITLDLQNVVMK ------------------------------------------------------------ IRMVPISFVFNRFPRMVRDLAKKMNKEVNFIMRGEDTELDRTFVEEIGEPLLHLLRNAID ----33333333---------1111--------1111--3333----------------- HGIEPKEERIAKGKPPIGTLILSARHEGNNVVIEVEDDGRGIDKEKIIRKAIEKGLIDES ---------1111---------------------------------------------11 KAATLSDQEILNFLFVPGFSGVGMDVVKNVVESLNGSMGIESEKDKGTKVTIRLPLTLAI 11---1111-3333-1111------------1111-------2222-------------- ICALLVKVNNLVYAIPIANIDTILSISKEDIQRVQDRDVIVIRGEVIPVYRLWEVLQIEH -------iiii----3333-------3333-----------iiii-----3333------ KEELEEMEAVIVRVGNRKYGIVVDDLLGQDDIVIKSLGKVFSEVKEFSGAAILGDGSIAL ------------------------------------33331111--------3333---- IINVSGIV --3333-- >NUCLEAR PROTEIN EBNA1; SWP:Q69477; PDB:1B3TA; KGGWFGKHRGQGGSNPKFENIAEGLRALLARSHVERTTDEGTWVAGVFVYGGSKTSLYNL -------2222----------------3333------3333------------------- RRGTALAIPQCRLTPLSRLPFGMAPGPGPQPGPLRESIVCYFMVFLQTHIFAEVLKDAIK -------1111--------------------1111------------------------- DLVMTKPAPTCNIRVTVCSFDDGVDLP --1111--3333-------1111---- >PROTEIN PHOSPHATASE PP2A; SWP:P30153; PDB:1B3UA; AAADGDDSLYPIAVLIDELRNEDVQLRLNSIKKLSTIALALGVERTRSELLPFLTDTIYD ---1111----------1111-3333-------------------------3333----- EDEVLLALAEQLGTFTTLVGGPEYVHCLLPPLESLATVEETVVRDKAVESLRAISHEHSP --------------3333--3333---------------3333----------3333-33 SDLEAHFVPLVKRLAGGDWFTSRTSACGLFSVCYPRVSSAVKAELRQYFRNLCSDDTPMV 33-----------1111-------3333-33331111--------------1111----- RRAAASKLGEFAKVLELDNVKSEIIPMFSNLASDEQDSVRLLAVEACVNIAQLLPQEDLE -----------11113333-----------1111-3333----------3333-3333-- ALVMPTLRQAAEDKSWRVRYMVADKFTELQKAVGPEITKTDLVPAFQNLMKDCEAEVRAA ---------1111-----------------------------------1111-------- ASHKVKEFCENLSADCRENVIMSQILPCIKELVSDANQHVKSALASVIMGLSPILGKDNT --------11113333--------------3333----------------3333------ IEHLLPLFLAQLKDECPEVRLNIISNLDCVNEVIGIRQLSQSLLPAIVELAEDAKWRVRL ----------1111-----------------------------------1111------- AIIEYMPLLAGQLGVEFFDEKLNSLCMAWLVDHVYAIREAATSNLKKLVEKFGKEWAHAT -------------3333---------3333---3333----------------------- IIPKVLAMSGDPNYLHRMTTLFCINVLSEVCGQDITTKHMLPTVLRMAGDPVANVRFNVA -----3333----------------3333-----------------1111---------- KSLQKIGPILDNSTLQSEVKPILEKLTQDQDVDVKYFAQEALTVLSLA -----3333----------------1111-------------1111-- >ACETYLCHOLINESTERASE; SWP:P22303; PDB:1B41A; DAELLVTVRGGRLRGIRLKTPGGPVSAFLGIPFAEPPMGPRRFLPPEPKQPWSGVVDATT -1111--3333---------------------------1111------------------ FQSVCYQYVDTLYPGFEGTEMWNPNRELSEDCLYLNVWTPYPRPTSPTPVLVWIYGGGFY ------------2222--3333----------------------------------iiii SGASSLDVYDGRFLVQAERTVLVSMNYRVGAFGFLALPGSREAPGNVGLLDQRLALQWVQ --11111111-------------------------------------------------- ENVAAFGGDPTSVTLFGESAGAASVGMHLLSPPSRGLFHRAVLQSGAPNGPWATVGMGEA -3333---1111------------------3333------------1111---------- RRRATQLAHLVGCPNDTELVACLRTRPAQVLVNHEWHVLPQESVFRFSFVPVVDGDFLSD --------1111----------11113333---3333----------------------- TPEALINAGDFHGLQVLVGVVKDEGSYFLVYGAPGFSKDNESLISRAEFLAGVRVGVPQV -------------------------3333---2222-----------------1111--- SDLAAEAVVLHYTDWLHPEDPARLREALSDVVGDHNVVCPVAQLAGRLAAQGARVYAYVF ----------------1111---------------------------------------- EHRASTLSWPLWMGVPHGYEIEFIFGIPLDPSRNYTAEEKIFAQRLMRYWANFARTGDPN ---1111--3333--2222---1111---3333--3333--------------------- EPPKAPQWPPYTAGAQQYVSLDLRPLEVRRGLRAQACAFWNRFLPKLLSAT --------------------------------3333--------------- >ACETYLCHOLINESTERASE; SWP:P01403; PDB:1B41B; TMCYSHTTTSRAILTNCGENSCYRKSRRHPPKMVLGRGCGCPPGDDNLEVKCCTSPDKCN --------------------------------------------1111------------ Y - >FEN-1; SWP:O93634; PDB:1B43A; GVPIGEIIPRKEIELENLYGKKIAIDALNAIYQFLSTIRQKDGTPLMDSKGRITSHLSGL ---3333------33332222------------------1111----1111--------- FYRTINLMEAGIKPVYVFDGEPPEFKKKELEKRREAREEAEEKWREALEKGEIEEARKYA ------------------------------------3333------------------33 QRATRVNEMLIEDAKKLLELMGIPIVQAPSEGEAQAAYMAAKGSVYASASQDYDSLLFGA 3311113333--------------------3333------------------3333---- PRLVRNLTITGKRKLPGKNVYVEIKPELIILEEVLKELKLTREKLIELAILVGTDYNPGG -----1111-----2222-----------------------------------1111--- IKGIGLKKALEIVRHSKDPLAKFQKQSDVDLYAIKEFFLNPPVTDNYNLVWRDPDEEGIL 2222-------------3333-3333---3333--------------------------- KFLCDEHDFSEERVKNGLERLKKAIKSGKQSTLESWFKR ----------------------------3333------- >GLUTATHIONE S-TRANSFERASE; SWP:P24472; PDB:1B48A; AAKPKLYYFNGRGRMESIRWLLAAAGVEFEEEFLETREQYEKMQKDGHLLFGQVPLVEID --------------3333----3333-----------------1111-----------ii GMMLTQTRAILSYLAAKYNLYGKDLKERVRIDMYADGTQDLMMMIAVAPFKTPKEKEESY ii--------------------------------------------3333-3333----- DLILSRAKTRYFPVFEKILKDHGEAFLVGNQLSWADIQLLEAILMVEELSAPVLSDFPLL --------------------------------3333-------------11113333--- QAFKTRISNIPTIKKFLQPGSQRKPPPDGPYVEVVRIVLKF -----------------1111---------------1111- >ARGININE REPRESSOR; SWP:O31408; PDB:1B4BA; ALVDVFIKLDGTGNLLVLRTLPGNAHAIGVLLDNLDWDEIVGTICGDDTCLIICRTPKDA 3333-------!!!!-----2222-------3333-3333-------------------- KKVSNQLLSML -------1111 >S-100 PROTEIN, BETA CHAIN; SWP:P04631; PDB:1B4CA; MSELEKAMVALIDVFHQYSGREGDKHKLKKSELKELINNELSHFLEEIKEQEVVDKVMET ----------------------------3333--3333---3333--------------- LDEDGDGECDFQEFMAFVSMVTTACHEFFEHE --------------------------3333-- >EPHB2; SWP:P29323; PDB:1B4FA; PDYTSFNTVDEWLEAIKMGQYKESFANAGFTSFDVVSQMMMEDILRVGVTLAGHQKKILN -------------11113333----1111--33331111--------------------- SIQVMRAQMNQIQS ----------3333 >POTASSIUM CHANNEL; SWP:Q63734; PDB:1B4G; MISSVCVSYRGRKSGNKPPSKTCLKEEMA -%%%%------------3333-------- >ANTIBODY; SWP:NA; PDB:1B4JH; VQLQQPGADLVMPGAPVKLSCLASGYIFTSSWINWVKQRPGRGLEWIGRIDPSDGEVHYN -----------------------------------------------------------1 QDFKDKATLTVDKSSSTAYIQLNSLTSEDSAVYYCARGFLPWFADWGQGTLVTVSAASTK 111--------------------------------------------------------- GPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYS ------------------------------------%%%%-------------1111--- LSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSC -----------------------3333------------ >ANTIBODY; SWP:NA; PDB:1B4JL; NIVMTQSPKSMYVSIGERVTLSCKASENVDTYVSWYQQKPEQSPKLLIYGASNRYTGVPD --------------------------------------3333------------222233 RFTGSGSATDFTLTISSVQAEDLADYHCGQSYNYPFTFGSGTKLEIKRTVAAPSVFIFPP 33---------------------------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT -----------------------------iiii--------------------------- LSKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC -----3333------------------------- >GLUTATHIONE S-TRANSFERASE; SWP:P08010; PDB:1B4PA; PMILGYWNVRGLTHPIRLLLEYTDSSYEEKRYAMGDAPDYDRSQWLNEKFKLGLDFPNLP -----------------------------------------33331111----------- YLIDGSRKITQSNAIMRYLARKHHLCGETEEERIRVDVLENQAMDTRLQLAMVCYSPDFE ---!!!!---3333-------------------------------------------333 RKKPEYLEGLPEKMKLYSEFLGKQPWFAGNKITYVDFLVYDVLDQHRIFEPKCLDAFPNL 3-------------------!!!!-3333--------------------11111111333 KDFVARFEGLKKISDYMKSGRFLSKPIFAKMAFWNPK 3---------3333--------------1111----- >HUMAN THIOLTRANSFERASE; SWP:P35754; PDB:1B4QA; AQEFVNSKIQPGKVVVFIKPTCPYSRRAQEILSQLPIKQGLLEFVDITATNHTNEIQDYL ---------2222-----------------3333-----------------3333----- QQLTGARTVPRVFIGKDSIGGSSDLVSLQQSGELLTRLKQIGALQ ---------------------3333---1111------1111--- ------------------------------------------------------------ -------------------- >MIP-1A; SWP:P10147; PDB:1B50A; SLAADTPTACCFSYTSRQIPQNFIAAYFETSSQCSKPGVIFLTKRSRQVCADPSEEWVQK ------------------------------------------------------3333-- YVSDLELSA --------- >FATTY ACID BINDING PROTEI; SWP:Q01469; PDB:1B56; TVQQLEGRWRLVDSKGFDEYMKELGVGIALRKMGAMAKPDCIITCDGKNLTIKTESTLKT --1111-------------------------------------------------3333- TQFSCTLGEKFEETTADGRKTQTVCNFTDGALVQHQEWDGKESTITRKLKDGKLVVECVM -----2222-----1111---------iiii------iiii--------iiii------i NNVTCTRIYEKVE iii---------- >FRUCTOSE-BISPHOSPHATE ALD; SWP:P11604; PDB:1B57A; SKIFDFVKPGVITGDDVQKVFQVAKENNFALPAVNCVGTDSINAVLETAAKVKAPVIVQF -1111-------!!!!-------------------------------------------- SNGGASFIAGKGVKSDVPQGAAILGAISGAHHVHQMAEHYGVPVILHTDHCAKKLLPWID ------3333------2222---------------3333------------3333----- GLLDAGEKHFAATGKPLFSSHMIDLSEESLQENIEICSKYLERMSKIGMTLEIELGCTGG ------------------------1111-------------------------------- EELYTQPEDVDYAYTELSKISPRFTIAASFGNVHGVYKPGNVVLTPTILRDSQEYVSKKH -----3333-------3333------------------------3333---------111 NLPHNSLNFVFHGGSGSTAQEIKDSVSYGVVKMNIDTDTQWATWEGVLNYYKANEAYLQG 1------------2222------------------------------------1111--- QLGNPKGEDQPNKKYYDPRVWLRAGQTSMIARLEKAFQELNAIDVL ---1111----3333-3333-----------------1111----- >DEOXYCYTIDYLATE HYDROXYME; SWP:P08773; PDB:1B5EA; MISDSMTVEEIRLHLGLALKEKDFVVDKTGVKTIEIIGASFVADEPFIFGALNDEYIQRE ------------------1111----1111------------------------------ LEWYKSKSLFVKDIPGETPKIWQQVASSKGEINSNYGWAIWSEDNYAQYDMCLAELGQNP ---3333--3333-----333311111111------------1111------------11 DSRRGIMIYTRPSMQFDYNKDGMSDFMCTNTVQYLIRDKKINAVVNMRSNDVVFGFRNDY 11--------1111----2222--------------%%%%-------------------- AWQKYVLDKLVSDLNAGDSTRQYKAGSIIWNVGSLHVYSRHFYLVDHWWKTGETHISKKD -----------------1111----------------3333---------------3333 Y - >CARDOSIN A; SWP:Q9XFX3; PDB:1B5FA; GSAVVALTNDRDTSYFGEIGIGTPPQKFTVIFDTGSSVLWVPSSKCINSKACRAHSMYES ---------%%%%-----------------------------------3333------11 SDSSTYKENGTFGAIIYGTGSITGFFSQDSVTIGDLVVKEQDFIEATDEADNVFLHRLFD 111111--------------------------!!!!-------------3333------- GILGLSFQTISVPVWYNMLNQGLVKERRFSFWLNRNVDEE ------------------1111------------------ >Preprocardosin A [Precurs; SWP:Q9XFX3; PDB:1B5FB; EELQVDCNTLSSMPNVSFTIGGKKFGLTPEQYILKVK -----33331111------iiii----3333------ >INTERFERON TAU; SWP:P56828; PDB:1B5L; CYLSRKLMLDARENLKLLDRMNRLSPHSCLQDRKDFGLPQEMVEGDQLQKDQAFPVLYEM 1111--------------------------------------------1111-------- LQQSFNLFYTEHSSAAWDTTLLEQLCTGLQQQLDHLDTCRGMDPIVTVKKYFQGIYDYLQ -----------------------------------------------------------1 EKGYSDCAWEIVRVEMMRALTVSTTLQKRLTK 111----------------------------- >ASPARTATE AMINOTRANSFERAS; SWP:Q56232; PDB:1B5PA; MRGLSRRVQAMKPSATVAVNAKALELRRQGVDLVALTAGEPDFDTPEHVKEAARRALAQG ----3333------------------------------------------------1111 KTKYAPPAGIPELREALAEKFRRENGLSVTPEETIVTVGGSQALFNLFQAILDPGDEVIV -----1111--------------------3333-------------------2222---- LSPYWVSYPEMVRFAGGVVVEVETLPEEGFVPDPERVRRAITPRTKALVVNSPNNPTGAV ----3333----------------3333----3333-11111111--------------- YPKEVLEALARLAVEHDFYLVSDEIYEHLLYEGEHFSPGRVAPEHTLTVNGAAKAFAMTG -----------------------1111---------3333-1111------------111 WRIGYACGPKEVIKAMASVSRQSTTSPDTIAQWATLEALTNQEASRAFVEMAREAYRRRR 1-------3333-------1111------------------------------------- DLLLEGLTALGLKAVRPSGAFYVLMDTSPIAPDEVRAAERLLEAGVAVVPGTDFAAFGHV -------1111---------------3333---------------------11112222- RLSYATSEENLRKALERFARVL ----------------3333-- >POLYAMINE OXIDASE; SWP:Q546R6; PDB:1B5QA; PRVIVVGAGMSGISAAKRLSEAGITDLLILEATDHIGGRMHKTNFAGINVELGANWVEGV -------------------1111-------------!!!!----%%%%------------ NGGKMNPIWPIVNSTLKLRNFRSDFDYLAQNVYKEDGGVYDEDYVQKRIELADSVEEMGE -----3333---------------1111-----1111----------------------- KLSATLHASGRDDMSILAMQRLNEHQPNGPATPVDMVVDYYKFDYEFAEPPRVTSLQNTV --11113333--------------------------------3333---3333-3333-- PLATFSDFGDDVYFVADQRGYEAVVYYLAGQYLKTDDKSGKIVDPRLQLNKVVREIKYSP ----------------33333333-------------------1111-----------11 GGVTVKTEDNSVYSADYVMVSASLGVLQSDLIQFKPKLPTWKVRAIYQFDMAVYTKIFLK 11----1111-------------------------------------------------- FPRKFWPEGKGREFFLYASSRRGYYGVWQEFEKQYPDANVLLVTVTDEESRRIEQQSDEQ --------2222--------2222------33332222--------------1111---- TKAEIMQVLRKMFPGKDVPDATDILVPRWWSDRFYKGTFSNWPVGVNRYEYDQLRAPVGR ------------1111-----------33331111----------------3333--!!! VYFTGEHTSEHYNGYVHGAYLSGIDSAEILINCAQKKMC !---1111---2222------------------------ >DIHYDROLIPOAMIDE ACETYLTR; SWP:P11961; PDB:1B5SA; AAAKPATTEGEFPETREKMSGIRRAIAKAMVHSKHTAPHVTLMDEADVTKLVAHRKKFKA -------1111------------------------------------------------- IAAEKGIKLTFLPYVVKALVSALREYPVLNTSIDDETEEIIQKHYYNIGIAADTDRGLLV --1111-------------------3333----1111----------------1111--- PVIKHADRKPIFALAQEINELAEKARDGKLTPGEMKGASCTITNIGSAGGQWFTPVINHP ----3333----------------------3333----------3333------------ EVAILGIGRIAEKPIVRDGEIVAAPMLALSLSFDHRMIDGATAQKALNHIKRLLSDPELL ---------------------------------3333--3333-------------1111 LM -- >MUTL; SWP:P23367; PDB:1B63A; MPIQVLPPQLANQIAAGEVVERPASVVKELVENSLDAGATRIDIDIERGGAKLIRIRDNG ---------------------3333---------1111--------iiii---------- CGIKKDELALALARHATSKIASLDDLEAIISLGFRGEALASISSVSRLTLTSRTAEQQEA ---11113333------------------------------------------1111--- WQAYAEGRDMNVTVKPAAHPVGTTLEVLDLFYNTPARRKFLRTEKTEFNHIDEIIRRIAL ------1111------------------2222--3333---------------------- ARFDVTINLSHNGKIVRQYRAVPEGGQKERRLGAICGTAFLEQALAIEWQHGDLTLRGWV -1111-----iiii--------22223333----------1111---------------- ADPNHTTPALAEIQYCYVNGRMMRDRLINHAIRQACEDKLGADQQPAFVLYLEIDPHQVD -3333-3333-------iiii---------------------------------3333-- VNVHPAKHEVRFHQSRLVHDFIYQGVLSVLQ ---1111------------------------ >ELONGATION FACTOR 1-BETA; SWP:P24534; PDB:1B64; MLVAKSSILLDVKPWDDETDMAKLEECVRSIQADGLVWGSSKLVPVGYGIKKLQIQCVVE ---------------33333333---------2222------------------------ DDKVGTDMLEEQITAFEDYVQSMDVAAFNKI ---------------1111------------ >PROTEIN (AMINOPEPTIDASE); SWP:Q59632; PDB:1B65A; KPRARDLGLPFTGVTGPYNAITDVDGVGVGFQTIIENEPRPGRKRPARSGVTAILPHMQS --3333---------1111----2222------------2222-------------3333 ETPVPVYAGVHRFNGNGEMTGTHWIEDGGYFLGPVVITNTHGIGMAHHATVRWMVDRYAS --------------------------------------1111---------------333 TYQTDDFLWIMPVVAETYDGALNDINGFPVTEADVRKALDNVASGPVQEGNCGGGTGMIT 3-----------------3333-1111-----------1111---------!!!!----i YGFKGGTGTASRVVEFGGRSFTIGALVQANHGQRDWLTIAGVPVGQHMRDGTPQSQLSII iii------------iiii-------------3333--iiii3333-----3333----- VVLATDLPLMPHQLKRLARRASIGIGRNGTPGGNNSGDIFIAFSTANQRPMQHRSAPFLD -------------------------1111---1111--------------1111------ VEMVNDEPLDTVYLAAVDSVEEAVVNAMIAAEDMGGTPFDRLLVQAIDHERLRAVLRQYG -----1111---------------------------1111----------------1111 RLA --- >6-PYRUVOYL TETRAHYDROPTER; SWP:P27213; PDB:1B66A; LRRRARLSRLVSFSASHRLHSPSLSAEENLKVFGKCNNPNGHGHNYKVVVTIHGEIDPVT -------------------------------------1111------------------- GMVMNLTDLKEYMEEAIMKPLDHKNLDLDVPYFADVVSTTENVAVYIWENLQRLLPVGAL ----3333---------1111---3333-3333----------------------2222- YKVKVYETDNNIVVYKGE -------1111------- >HISTONE HMFA; SWP:P48781; PDB:1B67A; GELPIAPIGRIIKNAGAERVSDDARIALAKVLEEMGEEIASEAVKLAKHAGRKTIKAEDI ------------1111-------------------------------1111----3333- ELARKMFK ----1111 >METHIONINE AMINOPEPTIDASE; SWP:P50579; PDB:1B6A; KVQTDPPSVPICDLYPNGVFPKGQECEYPEEKKALDQASEEIWNDFREAAEAHRQVRKYV ---------3333-3333------------------------------------------ MSWIKPGMTMIEICEKLEDCSRKLIKENGLNAGLAFPTGCSLNNCAAHYTPNAGDTTVLQ ----2222--------------1111-!!!!----------!!!!------2222----1 YDDICKIDFGTHISGRIIDCAFTVTFNPKYDTLLKAVKDATNTGIKCAGIDVRLCDVGEA 111---------iiii----------3333------------------22223333---- IQEVMESYEVEIDGKTYQVKPIRNLNGHSIGQYRIHAGKTVPIVKGGEATRMEEGEVYAI -----------iiii------1111-----2222------------------2222---- ETFGSTGKGVVHDDMECSHYMKNFDVGHVPIRLPRTKHLLNVINENFGTLAFCRRWLDRL ----------------------1111--------------------!!!!--3333-111 GESKYLMALKNLCDLGIVDPYPPLCDIKGSYTAQFEHTILLRPTCKEVVSRGDDY 1-------------------------2222-----------1111--1111---- >IMMUNOGLOBULIN; SWP:Q6GMX8; PDB:1B6DA; DIQMTQSPSSLSASVGDRVTITCQASQDISSYLNWYQQKPGKAPKLLIHAASSLETGVPS ----------------------------!!!!------2222------------222233 RFSGSGSGTDFSFTISSLQPEDLATYYCQQYDSLPLTFGGGTKVEIKRTVAAPSVFIFPP 33----------------1111-------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT 3333-------------------------iiii--------------------------- LSKADYEKHKVYACEVTHQGLSSPVTKSFNRG -----------------1111----------- >CD94; SWP:Q13241; PDB:1B6E; CSCQEKWVGYRCNCYFISSEQKTWNESRHLCASQKSSLLQLQNTDELDFMSSSQQFYWIG ---------%%%%---------3333--------------------3333---------- LSYSEEHTAWLWENGSALSQYLFPSFETFNTKNCIAYNPNGNALDESCEDKNRYICKQQL ---3333----1111---1111---3333--------1111-----1111---------- I - >MAJOR POLLEN ALLERGEN BET; SWP:P15494; PDB:1B6FA; GVFNYETETTSVIPAARLFKAFILDGDNLFPKVAPQAISSVENIEGNGGPGTIKKISFPE --------------3333-------3333----1111-----------2222-------- GFPFKYVKDRVDEVDHTNFKYNYSVIEGGPIGDTLEKISNEIKIVATPDGGSILKISNKY ---------------------------3333----------------------------- HTKGDHEVKAEQVKASKELGETLLRAVESYLLAHSDAYN -----------------------------3333------ >HALOALKANE DEHALOGENASE; SWP:P22643; PDB:1B6G; MVNAIRTPDQRFSNLDQYPFSPNYLDDLPGYPGLRAHYLDEGNSDAEDVFLCLHGEPTWS -------3333----------------2222-----------1111------------33 YLYRKMIPVFAESGARVIAPDFFGFGKSDKPVDEEDYTFEFHRNFLLALIERLDLRNITL 33-------------------2222-------3333------------------------ VVQDWGGFLGLTLPMADPSRFKRLIIMNALMTDPVTQPAFSAFVTQPADGFTAWKYDLVT --!!!!--333333333333------------333333333333--1111---------- PSDLRLDQFMKRWAPTLTEAEASAYAAPFPDTSYQAGVRKFPKMVAQRDQAIDISTEAIS -------------1111--------3333-3333-------3333--------------- FWQNDWNGQTFMAIGMKDKLLGPDVMYPMKALINGCPEPLEIADAGHFVQEFGEQVAREA --------------1111---3333-------2222-----------3333--------- LKHFAETE ---1111- >ROP; SWP:P03051; PDB:1B6Q; MTKQEKTALNMARFIRSQTLTLLEKLNELDPDEQADICESLHDHADELYRSCLARF -------------------------1111--------------------------- >PROTEIN (N5-CARBOXYAMINOI; SWP:P09029; PDB:1B6RA; MKQVCVLGNGQLGRMLRQAGEPLGIAVWPVGLDAEPAAVPFQQSVITAEIERWPETALTR -------------------3333-------33331111-3333----------------- QLARHPAFVNRDVFPIIADRLTQKQLFDKLHLPTAPWQLLAERSEWPAVFDRLGELAIVK -1111--2222--3333------------------------33331111----------- RRTGQWRLRANETEQLPAECYGECIVEQGINFSGEVSLVGARGFDGSTVFYPLTHNLHQD --------111111113333----------------------1111------------%% GILRTSVAFPQANAQQQARAEEMLSAIMQELGYVGVMAMECFVTPQGLLINELAPRVHNS %%-------------------------------------------------------111 GHWTQNGASISQFELHLRAITDLPLPQPVVNNPSVMINLIGSDVNYDWLKLPLVHLHWYD 1-3333------------1111----------------------3333--1111------ KEVRPGRKVGHLNLTDSDTSRLTATLEALIPLLPPEYASGVIWAQSKFG ---2222---------------------3333-3333-------1111- >RIBONUCLEASE; SWP:P00656; PDB:1B6VA; KETAAAKFERQHMDSSTSAASSSNYCNQMMKSRNLTKDRCKPVNTFVHESLADVKAVCSQ ---------------------1111-----1111-1111---------------3333-- KKVTCKNGQTNCYQSKSTMRITDCRETGSSKYPNCAYKTTQANKHIIVACGGKPYVPVHF ----1111------------------1111------------------------------ DASV ---- >HOMEOBOX PROTEIN HOX-B1; SWP:P14653; PDB:1B72A; ARTFDWMKVLRTNFTTRQLTELEKEFHFNKYLSRARRVEIAATLELNETQVKIWFQNRRM ---3333----------------------------------------------------- KQKKRERE ----1111 >Pre-B-cell leukemia trans; SWP:P40424; PDB:1B72B; RKRRNFNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIRYKKN ----------------33333333-------------------------------33333 IGKFQEEANIYAA 333---------- >GLUTAMATE RACEMASE; SWP:P56868; PDB:1B74A; MKIGIFDSGVGGLTVLKAIRNRYRKVDIVYLGDTARVPYGIRSKDTIIRYSLECAGFLKD ---------3333---------1111------3333---------------------111 KGVDIIVVACNTASAYALERLKKEINVPVFGVIEPGVKEALKKSRNKKIGVIGTPATVKS 1-------------------------------3333----3333---------3333--- GAYQRKLEEGGADVFAKACPLFAPLAEEGLLEGEITRKVVEHYLKEFKGKIDTLILGCTH ------1111-----------3333--------3333--3333-3333------------ YPLLKKEIKKFLGDAEVVDSSEALSLSLHNFIKDDGSSSLELFFTDLSPNLQFLIKLILG ---33333333-------3333-----3333------------------3333------- RDYPVKLAEGVF ------------ >SLIDING CLAMP; SWP:O80164; PDB:1B77A; MKLSKDTIAILKNFASINSGILLSQGKFIMTRAVNGTTYAEANISDEIDFDVALYDLNSF -------------1111---------------1111------------------------ LSILSLVSDDAEISMHTDGNIKIADTRSTVYWPAADKSTIVFPNKPIQFPVASVITEIKA ---11111111----1111----------------3333--------------------- EDLQQLLRVSRGLQIDTIAITNKDGKIVINGYNKVEDSGLTRPKYSLTLTDYDGSNNFNF ----------------------%%%%------33331111-------------------- VINMANMKIQPGNYKVMLWGAGDKVAAKFESSQVSYVIAMEADSTHDF --3333--------------!!!!----------------1111---- >PYROPHOSPHATASE; SWP:Q57679; PDB:1B78A; KIYFATGNPNKIKEANIILKDLKDVEIEQIKISYPEIQGTLEEVAEFGAKWVYNILKKPV -------3333-------3333-------------------------------------- IVEDSGFFVEALNGFPGTYSKFVQETIGNEGILKLLEGKDNRNAYFKTVIGYCDENGVRL ---------1111--!!!!----------------2222--------------1111--- FKGIVKGRVSEEIRSKGYGFAYDSIFIPEEEERTFAEMTTEEKSQISHRKKAFEEFKKFL -------------------!!!!----!!!!--1111-3333-----------------1 LDRI 111- >DNAB HELICASE; SWP:P03005; PDB:1B79A; PPHSIEAEQSVLGGLMLDNERWDDVAERVVADDFYTRPHRHIFTEMARLQESGSPIDLIT -----------------3333--------1111----------------1111------- LAESLERQGQLDSVGGFAYLAELSKNTPSAANISAYADIVRE -----1111-3333---------------------------- >CARBAMATE KINASE; SWP:NA; PDB:1B7BA; GKKMVVALGGNAILSNDASAHAQQQALVQTSAYLVHLIKQGHRLIVSHGNGPQVGNLLLQ --------3333-------------------------1111------------------- QQAADSEKNPAMPLDTCVAMTQGSIGYWLSNALNQELNKAGIKKQVATVLTQVVVDPADE -----3333----------------------------------------------11113 AFKNPTKPIGPFLTEAEAKEAMQAGAIFKEDAGRGWRKVVPSPKPIDIHEAETINTLIKN 333-----------------1111-----------------------1111------111 DIITISCGGGGIPVVGQELKGVEAVIDKDFASEKLAELVDADALVILTGVDYVCINYGKP 1-----2222-----1111---------------------------------------11 DEKQLTNVTVAELEEYKQAGHFAPGSMLPKIEAAIQFVESQPNKQAIITSLENLGSMSGD 11--------------------3333-----------------------3333------- EIVGTVV ------- >TRANSPOSASE INHIBITOR PRO; SWP:Q46731; PDB:1B7EA; SAEAIRKAGAMQTVKLAQEFPELLAIEDTTSLSYRWWVHSVLLLEATTFRTVGLLHQEWW ---------------3333----------------------------------------- MRPDDPADADEKESGKWLAAAATSRLRMGSMMSNVIAVCDREADIHAYLQDKLAHNERFV -------3333-----------------3333---------------------------- VRSKHPRKDVESGLYLYDHLKNQPELGGYQISIPQKGVRPARKASLSLRSGRITLKQGNI -------------------3333------------------------------------- TLNAVLAEEINPPKGETPLKWLLLTSEPVESLAQALRVIDIYTHRWRIEEFHKAWKTGAG ------------2222--------------3333----------3333------------ AERQRMPDNLERMVSILSFVAVRLLQLRESFTLPQALRAQGLLKEAEHVESQSAETVLTP --------3333-------------------------1111---3333-----3333--- DECQLLGYLDKGKRKRKEKGSLQWAYMAIARLGGFMDSKRTGIASWGALWEGWEALQSKL -----------------------------3333------------------------333 DGFLAAKDLMAQ 3----------- >SXL-LETHAL PROTEIN; SWP:P19339; PDB:1B7FA; SNTNLIVNYLPQDMTDRELYALFRAIGPINTCRIMRDYKTGYSYGYAFVDFTSEMDSQRA ----------11113333----------------------------------3333---- IKVLNGITVRNKRLKVSYARPGGESIKDTNLYVTNLPRTITDDQLDTIFGKYGSIVQKNI ---2222-!!!!----------3333----------1111--------3333-------- LRDKLTGRPRGVAFVRYNKREEAQEAISALNNVIPEGGSQPLSVRLA -----------------------------2222-2222--------- >Glyceraldehyde-3-phosphat; SWP:P39460; PDB:1B7GO; MVNVAVNGYGTIGKRVADAIIKQPDMKLVGVAKTSPNYEAFIAHRRGIRIYVPQQSIKKF ---------3333---------1111-----------------1111-----1111---- EESGIPVAGTVEDLIKTSDIVVDTTPNGVGAQYKPIYLQLQRNAIFQGGEKAEVADISFS 3333---------------------22223333-------------11113333-----1 ALCNYNEALGKKYIRVVSCNTTALLRTICTVNKVSKVEKVRATIVRRAADQKEVKKGPIN 1113333------------------------------------------3333------- SLVPDPATVPSHHAKDVNSVIRNLDIATMAVIAPTTLMHMHFINITLKDKVEKKDILSVL --------------------1111---------------------------3333---11 ENTPRIVLISSKYDAEATAELVEVARDLKRDRNDIPEVMIFSDSIYVKDDEVMLMYAVHQ 112222-----------------------2222-------1111---!!!!-------33 ESIVVPENIDAIRASMKLMSAEDSMRITNESLGILKGYLI 33---------------------------1111------- >PHENYLALANYL-TRNA SYNTHET; SWP:P27001; PDB:1B7YA; VDVSLPGASLFSGGLHPITLMERELVEIFRALGYQAVEGPEVESEFFNFDALNIPEHHPA -1111------------------------1111----------3333----------333 RDMWDTFWLTGEGFRLEGPLGEEVEGRLLLRTHTSPMQVRYMVAHTPPFRIVVPGRVFRF 3----------------1111------------3333----------------------- EQTDATHEAVFHQLEGLVVGEGIAMAHLKGAIYELAQALFGPDSKVRFQPVYFPFVEPGA -----------------------3333-------------------------1111---- QFAVWWPEGGKWLELGGAGMVHPKVFQAVDAYRERLGLPPAYRGVTGFAFGLGVERLAML ---------------------------------1111-----------------3333-- RYGIPDIRYFFGGRLKFLEQFKGVL ------3333----3333------- >Phenylalanyl-tRNA synthet; SWP:P27002; PDB:1B7YB; MRVPFSWLKAYVPELESPEVLEERLAGLGFETDRIERVFPIPRGVVFARVLEAHPIPGTR --------1111-------------1111------------3333----------2222- LKRLVLDAGRTVEVVSGAENARKGIGVALALPGTELPGLGQKVGERVIQGVRSFGMALSP -----------------3333-------------------------------------33 RELGVGEYGGGLLEFPEDALPPGTPLSEAWPEEVVLDLEVTPNRPDALGLLGLARDLHAL 33-------------1111-----3333------------11111111--------3333 GYALVEPEAALKAEALPLPFALKVEDPEGAPHFTLGYAFGLRVAPSPLWMQRALFAAGMR -------------------------3333------------------------------- PINNVVDVTNYVMLERAQPMHAFDLRFVGEGIAVRRAREGERLKTLDGVERTLHPEDLVI -----------------------3333----------2222---1111-----3333--- AGWRGEESFPLGLAGVMGGAESEVREDTEAIALEVACFDPVSIRKTARRHGLRTEASHRF -------------------1111-1111-------------------------------- ERGVDPLGQVPAQRRALSLLQALAGARVAEALLEAGSPKPPEAIPFRPEYANRLLGTSYP ------------------------------------------------------------ EAEQIAILKRLGCRVEGEGPTYRVTPPSHRLDLRLEEDLVEEVARIQGYETIPLALPAFF --------------------------3333-----------------3333--------- PAPDNRGVEAPYRKEQRLREVLSGLGFQEVYTYSFMDPEDARRFRLDPPRLLLLNPLAPE -1111-----------------3333----------3333-----------------111 KAALRTHLFPGLVRVLKENLDLDRPERALLFEVGRVFREREETHLAGLLFGEGVGLPWAK 1------3333------------------------------------------------- ERLSGYFLLKGYLEALFARLGLAFRVEAQAFPFLHPGVSGRVLVEGEEVGFLGALHPEIA ----3333----------------------1111---------2222------------- QELELPPVHLFELRLPLPDKPLAFQDPSRHPAAFRDLAVVVPAPTPYGEVEALVREAAGP -----------------------------------------3333--------------- YLESLALFDLYQGPPLPEGHKSLAFHLRFRHPKRTLRDEEVEEAVSRVAEALRAR ----------------1111----------------3333--------------- >RECOMBINANT LIGNIN PEROXI; SWP:P06181; PDB:1B80A; RATCSNGKTVGDSCCAWFDVLDDIQQNLFHGGQCGAEAHESIRLVFHDSIAISPAMEAQG ---1111----333--------------%%%%--------------------33331111 KFGGGGADGSIMIFDDIETAFHPNIGLDEIVKLQKPFVQKHGVTPGDFIAFAGAVALSNC --------3333----11113333-3333------------------------------2 PGAPQMNFFTGRAPATQPAPDGLVPEPFHTVDQIINRVNDAGEFDELELVMLSAHSVAAV 222----------------------1111------------------------------- NDVDPTVQGLPFDSTPGIFDSQFFVETQLRGTAFPGSGGNQGEVESPLPGEIRIQSDHTI ---1111-------1111------3333-----------2222----2222--------- ARDSRTACEWQSFVNNQSKLVDDFQFIFLALTQLGQDPNAMTDCSDVIPQSKPIPGNLPF --3333------2222----------------22221111---3333------------- SFFPAGKTIKDVEQACAETPFPTLTTLPGPETSVQRIPPPPGA ---22223333----1111--------------------2222 >T CELL RECEPTOR V-ALPHA D; SWP:Q5R1B3; PDB:1B88A; MQQVRQSPQSLTVWEGETAILNCSYENSAFDYFPWYQQFPGEGPALLISILSVSNKKEDG -------------2222---------3333--------------------3333------ RFTIFFNKREKKLSLHIADSQPGDSATYFCAASASFGDNSKLIWGLGTSLVVNP --------------------3333------------------------------ >CLATHRIN HEAVY CHAIN; SWP:P49951; PDB:1B89A; RLAELEEFINGPNMYDAAKLLYNNVSNFGRLASTLVHLGEYQAAVDGARKANSTRTWKEV 3333--1111---1111-----3333---------1111-3333---------------- CFACVDGKEFRLAQCGLHIVVHADELEELINYYQDRGYFEELITLEAALGLERAHGFTEL ---------3333--1111--3333---------------------3333---------- AILYSKFKPQKREHLELFWSRVNIPKVLRAAEQAHLWAELVFLYDKYEEYDNAIITNHPT -------3333------3333----------1111---------1111------------ DAWKEGQFKDIITKVANVELYYRAIQFYLEFKPLLLNDLLVLSPRLDHTRAVNYFSKVKQ ----------------3333-----------3333-------1111-------------3 LPLVKPYLRSVQNHNNKSVNESLNNLFITEEDYQALRTSIDAYDNFDNISLAQRLEKHEL 3333333---------3333----------------------------------1111-- IEFRRIAAYLFKG ------------- >ASPARTYL-TRNA SYNTHETASE; SWP:Q52428; PDB:1B8AA; MYRTHYSSEITEELNGQKVKVAGWVWEVKDLGGIKFLWIRDRDGIVQITAPKKKVDPELF -----1111-3333----------------1111------1111------3333------ KLIPKLRSEDVVAVEGVVNFTPKAKLGFEILPEKIVVLNRAETPLPLDPTGKVKAELDTR -3333-2222----------1111-----------------------1111--------- LNNRFMDLRRPEVMAIFKIRSSVFKAVRDFFHENGFIEIHTPKIIATATEGGTELFPMKY ---3333--3333------------------1111------------------------! FEEDAFLAESPQLYKEIMMASGLDRVYEIAPIFRAEEHNTTRHLNEAWSIDSEMAFIEDE !!!--------------1111--------------------------------------- EEVMSFLERLVAHAINYVREHNAKELDILNFELEEPKLPFPRVSYDKALEILGDLGKEIP --------------------------1111----------------------1111---2 WGEDIDTEGERLLGKYMMENENAPLYFLYQYPSEAKPFYIMKYDNKPEICRAFDLEYRGV 222----------------------------3333-1111--1111----------%%%% EISSGGQREHRHDILVEQIKEKGLNPESFEFYLKAFRYGMPPHGGFGLGAERLIKQMLDL -------------------1111------33331111----------------------- PNIREVILFPRDRRRLTP -3333------1111--- >PARVALBUMIN; SWP:P02618; PDB:1B8CA; AFAGVLNDADIAAALEACKAADSFNHKAFFAKVGLTSKSADDVKKAFAIIAQDKSGFIEE -2222--------------2222-------33331111--------------1111--33 DELKLFLQNFKADARALTDGETKTFLKAGDSDGDGKIGVDDWTALVKA 33----33331111---------------1111-----------1111 >RHODOPHYTAN PHYCOERYTHRIN; SWP:O36005; PDB:1B8DA; MKSVITTTISAADAAGRFPSSSDLESIQGNIQRAAARLEAAQKLSGNHEAVVKEAGDACF ------------------------------------------------------------ AKYSYLKNAGEAGDSPEKINKCYRDIDHYMRLINYSLVVGGTGPVDEWGIAGSREVYRAL ---33332222----------------------------------------3333----- NLPGSAYIAAFTFTRDRLCVPRDMSSQAGVEFTSALDYVINSLC --3333--------3333--1111----------------1111 >R-phycoerythrin beta chai; SWP:O36004; PDB:1B8DB; MLDAFSRVVVTSDAKAAYVGGSDLQSLKSFINDGNKRLDAVNYIVSNASCIVSDAVSGMI ------------1111----3333------------------------------------ CENPGLIAPGGCYTNRRMAACLRDGEIILRYVSYALLAGDSSVLDDRCLNGLKETYIALG -------2222-----------------------------3333----2222-------- VPTASSSRAVSIMKATATAFITNTASGRKVEVAAGDCQALQAEAASYFDKVGSSID -------------------------------------------------------- >ULTRABITHORAX HOMEOTIC PR; SWP:P83949; PDB:1B8IA; FYPWMARQTYTRYQTLELEKEFHTNHYLTRRRRIEMAHALSLTERQIKIWFQNRRMKLKK ----------3333------3333------------------------------------ EI -- >NEUROTROPHIN-3; SWP:P20783; PDB:1B8KA; YSVCDSESLWVTDKSSAIDIRGHQVTVLGEIVKQYFYETRCKEGCRGIDDKHWNSQCKTS ------------------1111----------------------2222-2222------- QTYVRALTSENNKLVGWRWIRIDTSCVCAL ------------------------------ >Neurotrophin-5 [Precursor; SWP:P34130; PDB:1B8MB; GELAVCDAVSGWVTDRRTAVDLRGREVEVLGEVPAAGGSPLRQYFFETRCKAAGGPGAGG --------------------1111------------------------------------ GGCRGVDRRHWVSECKAKQSYVRALTADAQGRVGWRWIRIDTACVCTLLSRTGRA --22223333-----------------1111------------------------ >PURINE NUCLEOSIDE PHOSPHO; SWP:P55859; PDB:1B8OA; NGYTYEDYQDTAKWLLSHTEQRPQVAVICGSGLGGLVNKLTQAQTFDYSEIPNFPESTVP ---3333-------1111-----------2222-1111--------33332222----22 GHAGRLVFGILNGRACVMMQGRFHMYEGYPFWKVTFPVRVFRLLGVETLVVTNAAGGLNP 22--------iiii---------3333--3333--------1111-------------11 NFEVGDIMLIRDHINLPGFSGENPLRGPNEERFGVRFPAMSDAYDRDMRQKAHSTWKQMG 112222----------3333--1111---3333---------------------3333-- EQRELQEGTYVMLGGPNFETVAECRLLRNLGADAVGMSTVPEVIVARHCGLRVFGFSLIT ---------------------------1111---------------1111---------- NKVIMDYESQGKANHEEVLEAGKQAAQKLEQFVSLLMASI -----1111-------------------------3333-- >MALATE DEHYDROGENASE; SWP:Q9ZF99; PDB:1B8PA; KTPMRVAVTGAAGQICYSLLFRIANGDMLGKDQPVILQLLEIPNEKAQKALQGVMMEIDD ---------11113333------------1111-------------------------11 CAFPLLAGMTAHADPMTAFKDADVALLVGARPRGPGMERKDLLEANAQIFTVQGKAIDAV 111111-------3333-2222-----------2222----------------------- ASRNIKVLVVGNPANTNAYIAMKSAPSLPAKNFTAMLRLDHNRALSQIAAKTGKPVSSIE ------------------------11113333----------------------1111-- KLFVWGNHSPTMYADYRYAQIDGASVKDMINDDAWNRDTFLPTVGKRGAAIIDARGVSSA --------1111---1111-iiii------------------------------------ ASAANAAIDHIHDWVLGTAGKWTTMGIPSDGSYGIPEGVIFGFPVTTENGEYKIVQGLSI -----------------iiii---------2222-2222--------iiii--------- DAFSQERINVTLNELLEEQNGVQHLLG ---------------------3333-- >DEFENSIN-LIKE PEPTIDE 1; SWP:P82172; PDB:1B8WA; FVQHRPRDCESINGVCRHKDTVNCREIFLADCYNDGQKCCRK --------1111------------------------------ >AML-1B; SWP:P08515; PDB:1B8XA; SPILGYWKIKGLVQPTRLLLEYLEEKYEEHLYERDEGDKWRNKKFELGLEFPNLPYYIDG --------------------1111-----------1111--------------------- DVKLTQSMAIIRYIADKHNMLGGCPKERAEISMLEGAVLDIRYGVSRIAYSKDFETLKVD ---------3333--1111----3333--------------------1111-3333---- FLSKLPEMLKMFEDRLCHKTYLNGDHVTHPDFMLYDALDVVLYMDPMCLDAFPKLVCFKK ---3333------1111----------1111-----------------1111-------- RIEAIPQIDKYLKSSKYIAWPLQGWQATFGGGDHPPKSDLVPRGSRRASVGSRMHYPGAF -----1111-1111---------1111--------------------------------- TYSPTPVTSGIGIGMSAMGS -------------------- >HISTONELIKE PROTEIN HU; SWP:P36206; PDB:1B8ZA; MNKKELIDRVAKKAGAKKKDVKLILDTILETITEALAKGEKVQIVGFGSFEVVPKFKPGK -----------------------------------1111--------------------- ALKEKVK ------- >PROTEIN (METHYLGLYOXAL SY; SWP:P0A733; PDB:1B93A; MELTTRTLPARKHIALVAHDHCKQMLMSWVERHQPLLEQHVLYATGTTGNLISRATGMNV ------------------1111----------33331111-------------------- NAMLSGPMGGDQQVGALISEGKIDVLIFFWDPLNAVPHDPDVKALLRLATVWNIPVATNV ----3333---------1111----------------------------1111------- ATADFIIQSPHFNDAVDILIPDYQRYLA --------1111----------3333-- >TRIOSEPHOSPHATE ISOMERASE; SWP:P36204; PDB:1B9BA; TRKLILAGNWKMHKTISEAKKFVSLLVNELHDVKEFEIVVCPPFTALSEVGEILSGRNIK ------------------------------------------3333-------2222--- LGAQNVFYEDQGAFTGEISPLMLQEIGVEYVIVGHSERRRIFKEDDEFINRKVKAVLEKG -------------2222-----3333--------3333---------------------- MTPILCVGETLEEREKGLTFCVVEKQVREGFYGLDKEEAKRVVIAYEPVWAIGTGRVATP ------------------------------222233333333-----1111--------- QQAQEVHAFIRKLLSEMYDEETAGSIRILYGGSIKPDNFLGLIVQKDIDGGLVGGASLKE ------------------3333-------------------3333--------3333--- SFIELARIMRGV -------1111- >INTEGRASE; SWP:P12497; PDB:1B9DA; SPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQETAYFLLKLAGRWPVKTVHTD 1111-------iiii--------------------------------------------- NGSNFTSTTVKAACWWAGIKQEFGGVIESMNKELKKIIGQVRDQAEHLKTAVQMAVFIHN --1111----------------------------------1111---------------- KKRKGGYSAGERIVDIIATDIQT ----------------------- >3-AMINO-5-HYDROXYBENZOIC ; SWP:O52552; PDB:1B9HA; KAPEFPAWPQYDDAERNGLVRALEQGQWWRMGGDEVNSFEREFAAHHGAAHALAVTNGTH ----------------------------3333------------1111------------ ALELALQVMGVGPGTEVIVPAFTFISSSQAAQRLGAVTVPVDVDAATYNLDPEAVAAAVT ------1111-2222--------3333----1111------------------------1 PRTKVIMPVHMAGLMADMDALAKISADTGVPLLQDAAHAHGARWQGKRVGELDSIATFSF 111-------iiii---------------------1111----iiii1111--------- QNGKLMTAGEGGAVVFPDGETEKYETAFLRHSCGRPRDDRRYFHKIAGSNMRLNEFSASV 1111------------2222---------------------------------------- LRAQLARLDEQIAVRDERWTLLSRLLGAIDGVVPQGGDVRADRNSHYMAMFRIPGLTEER ----------------------------2222-----3333-----------2222---- RNALVDRLVEAGLPAFAAFRAIYRTDAFWELGAPDESVDAIARRCPNTDAISSDCVWLHH --------1111--------11113333-------------------------------3 RVLLAGEPELHATAEIIADAVARA 333--3333-----------1111 >EPIMERASE; SWP:P80449; PDB:1B9LA; AQPAAIIRIKNLRLRTFIGIKEEEINNRQDIVINVTIHYPADKARTSEDINDALNYRTVT --------------------3333---------------------1111----------- KNIIQHVENNRFSLLEKLTQDVLDIAREHHWVTYAEVEIDKLHALRYADSVSMTLSWQR -------------3333---------------------------2222----------- >MODE; SWP:P46930; PDB:1B9MA; QAEILLTLKLQQKLFADPRRISLLKHIALSGSISQGAKDAGISYKSAWDAINENQLSEHI ---------%%%%-----------------------------------------1111-- LVERATGGAVLTRYGQRLIQLYDLLAQIQQKAFDVLSDDDALPLNSLLAAISRFSLQTSA ------------------------------------------1111-------------- RNQWFGTITARDHDDVQQHVDVLLADGKTRLKVAITAQSGARLGLDEGKEVLILLKAPWV -----------------------1111------------------2222------1111- GITQDEAVAQNADNQLPGIISHIERGAEQCEVLALPDGQTLCATVPVNEATSLQQGQNVT ----33331111----------------------1111-------333311112222--- AYFNADSVIIATLC ---1111------- >ALPHA-LACTALBUMIN; SWP:P00709; PDB:1B9OA; KQFTKCELSQLLKDIDGYGGIALPELICTMFHTSGYDTQAIVENDESTEYGLFQISNKLW -----------1111-2222-------------%%%%-------------1111-3333- CKSSQVPQSRNICDISCDKFLDDDITDDIMCAKKILDIKGIDYWLAHKALCTEKLEQWLC --3333----1111---1111-------------------3333%%%%-------1111- EKL --- >COLLAGEN ALPHA 1; SWP:P32018; PDB:1B9PA; CAVELRSPGISRFRRKIAKRSIKTLEHKRENAKE --%%%%--3333----------------3333-- >TERPREDOXIN; SWP:P33007; PDB:1B9RA; PRVVFIDEQSGEYAVDAQDGQSLMEVATQNGVPGIVAECGGSCVCATCRIEIEDAWVEIV --------------------------3333--------iiii-------------1111- GEANPDENDLLQSTGEPMTAGTRLSCQVFIDPSMDGLIVRVPLPA ----------3333---------3333------------------ >NEURAMINIDASE; SWP:P03474; PDB:1B9VA; EPEWTYPRLSCQGSTFQKALLISPHRFGEIKGNSAPLIIREPFVACGPKECRHFALTHYA ----------------------------1111--------------1111---------- AQPGGYYNGTRKDRNKLRHLVSVKLGKIPTVENSIFHMAAWSGSACHDGREWTYIGVDGP ------2222-------------------3333--------------------------1 DNDALVKIKYGEAYTDTYHSYAHNILRTQESACNCIGGDCYLMITDGSASGISKCRFLKI 111------!!!!----------------------iiii--------------------- REGRIIKEILPTGRVEHTEECTCGFASNKTIECACRDNSYTAKRPFVKLNVETDTAEIRL iiii----------------------1111------------------------------ MCTKTYLDTPRPDDGSIAGPCESNGDKWLGGIKGGFVHQRMASKIGRWYSRTMSKTNRMG ------------2222---3333------------------------------------- MELYVRYDGDPWTDSDALTLSGVMVSIEEPGWYSFGFEIKDKKCDVPCIGIEMVHDGGKD ---------3333----------------------------------------------- TWHSAATAIYCLMGSGQLLWDTVTGVDMAL ------------------------------ >MEROZOITE SURFACE PROTEIN; SWP:Q25659; PDB:1B9WA; MSSEHRCIDTNVPENAACYRYLDGTEEWRCLLYFKEDAGKCVPAPNMTCKDKNGGCAPEA -3333---------------1111------2222--iiii-------1111-iiii1111 ECKMNDKNEIVCKCTKEGSEPLFEGVFCSHH ----1111-------2222--%%%%------ >IGG1-KAPPA AN02 FAB (HEAV; SWP:NA; PDB:1BAFH; DVQLQESGPGLVKPSQSQSLTCTVTGYSITSDYAWNWIRQFPGNKLEWMGYMSYSGSTRY ------------2222------------------------------------1111---- NPSLRSRISITRDTSKNQFFLQLKSVTTEDTATYFCARGWPLAYWGQGTQVSVSEAKTTP 3333---------1111---------3333------------------------------ PSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLS -----------------------------------%%%%--------------------- SSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVPRDC ---------------------1111------------ ---------------------------------- >ROUS SARCOMA VIRUS PROTEA; SWP:O92805; PDB:1BAIA; LAMTMEHKDRPLVRVILTNTGSHPVKQRSVYITALLDTGADDTVISEEDWPTDWPVMEAA -----3333---------------------------1111-----1111-1111------ NPQIHGIGGGIPVRKSRDMIELGVINRDGSLERPLLLFPLVAMTPVNILGRDCLQGLGLR -----3333----------------3333------------------------------- LTNL ---- >ENDONUCLEASE BamH I; SWP:P23940; PDB:1BAM; MEVEKEFITDEAKELLSKDKLIQQAYNEVKTSICSPIWPATSKTFTINNTEKNCNGVVPI ---------3333-------------------------1111--------2222------ KELCYTLLEDTYNWYREKPIDVYKEFIENSELKRVGMEFETGNISSAHRSMNKLLLGLKH ---------------------------%%%%--------------------------111 GEIDLAIILMPIKQLAYYLTDRVTNFEELEPYFELTEGQPFIFIGFNAEAYNSNVPLIPK 1-----------3333--------33333333-1111----------------------- GSDGMSKRSIKKWKDKVENK 1111---------------- >ACIDIC FIBROBLAST GROWTH ; SWP:P03968; PDB:1BARA; PKLLYCSNGGYFLRILPDGTVDGTKDRSDQHIQLQLAAESIGEVYIKSTETGQFLAMDTD -------------------------3333------------------------------- GLLYGSQTPNEECLFLERLEENGYNTYISKKHAEKHWFVGLKKNGRSKLGPRTHFGQKAI ---------1111------3333--------3333------1111---3333-------- LFLPLPV ------- >PLASTOCYANIN; SWP:Q51883; PDB:1BAWA; ETFTVKMGADSGLLQFEPANVTVHPGDTVKWVNNKLPPHNILFDDKQVPGASKELADKLS --------1111-----------2222----------------11112222--------- HSQLMFSPGESYEITFSSDFPAGTYTYYCAPHRGAGMVGKITVEG ------2222------3333--------33331111--------- >GLUTATHIONE S-TRANSFERASE; SWP:P19157; PDB:1BAYA; PPYTIVYFPVRGRCEAMRMLLADQGQSWKEEVVTQLPKFEDGDLTLYQSNAILRHLGRSL ----------!!!!-------1111---------------!!!!-------------111 GLYGKNQREAAQMDMVNDGVEDLRGKYVTLIYTNYENGKNDYVKALPGHLKPFETLLSQN 1--------------------------------------------3333----------% QGGKAFIVGDQISFADYNLLDLLLIHQVLAPGCLDNFPLLSAYVARLSARPKIKAFLSSP %%%--------------------------22221111----------------------3 EHVNRPINGNGKQ 333----1111-- ------------------------------------------------- ---------------------------------- ---------------------------------- ---------------------------------- >INTEGRASE; SWP:P22886; PDB:1BB8; EKRRDNRGRILKTGESQRKDGRYLYKYIDSFGEPQFVYSWKLVATDRVPAGKRDCISLRE -----------------1111---------------------1111--2222-------- KIAELQKDIHD ----------- >AMPHIPHYSIN 2; SWP:O08839; PDB:1BB9; TTGRLDLPPGFMFKVQAQHDYTATDTDELQLKAGDVVLVIPFQNPEEQDEGWLMGVKESD -------2222-------------1111---2222--------1111-2222-------- WNQHKELEKCRGVFPENFTERVQ 1111-3333-----1111----- >BOVINE PANCREATIC POLYPEP; SWP:P01302; PDB:1BBA; APLEPEYPGDNATPEQMAQYAAELRRYINMLTRPRY --------------3333------------------ >POLLEN ALLERGEN 5; SWP:P10414; PDB:1BBG; DDGLCYEGTNCGKVGKYCCSPIGKYCVCYDSKAICNKNCT ------------2222--------------3333------ >CYTOCHROME C'; SWP:P00154; PDB:1BBHA; AGLSPEEQIETRQAGYEFMGWNMGKIKANLEGEYNAAQVEAAANVIAAIANSGMGALYGP ------------------------------------------------11113333---- GTDKNVGDVKTRVKPEFFQNMEDVGKIAREFVGAANTLAEVAATGEAEAVKTAFGDVGAA -----!!!!----3333------------------------1111--------------- CKSCHEKYRAK ----------- >IGG4-KAPPA B72.3 FAB (HEA; SWP:GC4_HUMAN; PDB:1BBJH; VQLQQSDAELVKPGASVKISCKASGYTFTDHAIHWAKQKPEQGLEWIGYISPGNDDIKYN --------------------------3333-----------------------------3 EKFKGKATLTADKSSSTAYMQLNSLTSEDSAVYFCKRSYYGHWGQGTTLTVSSASTKGPS 333----------------------3333------------------------------- VFPLAPCSRSTSESTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSS ---------------------------------%%%%-------------3333------ VVTVPSSSLGTKTYTCNVDHKPSNTKVDKRV ----3333-----------3333-------- >IGG4-KAPPA B72.3 FAB (HEA; SWP:GC4_HUMAN; PDB:1BBJL; DIQMTQSPASLSVSVGETVTITCRASENIYSNLAWYQQKQGKSPQLLVYAATNLADGVPS ------------------------------------------------------111111 RFSGSGSGTQYSLKINSLQSEDFGSYYCQHFWGTPYTFGGGTRLEIKRADAAPTVFIFPP 11---------------------------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT 3333-------------------------------------------------------- LSKADYEKHKVYACEVTHQGLSSPVTKSFNR -----1111---------------------- >BILIN BINDING PROTEIN; SWP:P09464; PDB:1BBPA; NVYHDGACPEVKPVDNFDWSNYHGKWWEVAKYPNSVEKYGKCGWAEYTPEGKSVKVSNYH -----------------3333------------3333----------------------- VIHGKEYFIEGTAYPVGDSKIGKIYHKLTYGGVTKENVFNVLSTDNKNYIIGYYCKYDED -iiii------------------------%%%%-------------------------11 KKGHQDFVWVLSRSKVLTGEAKTAVENYLIGSPVVDSQKLVYSDFSEAACKVN 11---------------------------------3333------3333---- >LYSYL-TRNA SYNTHETASE; SWP:P13030; PDB:1BBUA; VVDLNNELKTRREKLANLREQGIAFPNDFRRDHTSDQLHAEFDGKENEELEALNIEVAVA -----------------------------------------11113333----------- GRMMTRRIMGKASFVTLQDVGGRIQLYVARDDLPEGVYNEQFKKWDLGDILGAKGKLFKT --------!!!!------1111------1111-2222---3333---------------1 KTGELSIHCTELRLLTKALRPLPDDQEARYRQRYLDLISNDESRNTFKVRSQILSGIRQF 111----------------------3333------------------------------- MVNRGFMEVETPMMQVIPGGAAARPFITHHNALDLDMYLRIAPELYLKRLVVGGFERVFE -1111------------------------------------------------------- INRNFRNEGISVRHNPEFTMMELYMAYADYKDLIELTESLFRTLAQDILGKTEVTYGDVT ----------------------------3333-----------------------!!!!- LDFGKPFEKLTMREAIKKYRPETDMADLDNFDSAKAIAESIGIHVEKSWGLGRIVTEIFE -------------------11113333-----------1111---1111----------- EVAEAHLIQPTFITEYPAEVSPLARRNDVNPEITDRFEFFIGGREIGNGFSELNDAEDQA --3333----------3333------3333----------%%%%---------------- QRFLDQVAAKDAGDDEAMFYDEDYVTALEHGLPPTAGLGIGIDRMVMLFTNSHTIRDVIL ---------11111111---3333-----------------------------3333--- FPAMRP ------ ------------------------------------------------------------ --------- >ABL TYROSINE KINASE; SWP:P00519; PDB:1BBZA; NLFVALYDFVASGDNTLSITKGEKLRVLGYNHNGEWCEAQTKNGQGWVPSNYITPVNS ------------%%%%---2222-------1111------1111----1111------ >7-FE FERREDOXIN; SWP:Q45560; PDB:1BC6; AYVITEPCIGTKDASCVEVCPVDCIHEGEDQYYIDPDVCIDCGACEAVCPVSAIYHEDFV -----3333------3333----------------3333----3333-2222---1111- PEEWKSYIQKNRDFFKK 3333--------1111- >ETS domain-containing pro; SWP:P28324; PDB:1BC8C; MDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARLWGIRKNKPNMNYDKLS ----------------33331111--------------------------1111------ RALRYYYVKNIIKKVNGQKFVYKFVSYPEILNM ------1111----2222---------1111-- >CYTOHESIN-1; SWP:Q15438; PDB:1BC9; MKNMQRNKQVAMGRKKFNMDPKKGIQFLIENDLLKNTCEDIAQFLYKGEGLNKTAIGDYL -----------------------------------%%%%-3333---------------- GERDEFNIQVLHAFVELHEFTDLNLVQALRQFLWSFRLPGEAQKIDRMMEAFAQRYCQCN -----33333333------1111-3333---------------------------3333- NGVFQSTDTCYVLSFAIIMLNTSLHNPNVKDKPTVERFIAMNRGINDGGDLPEELLRNLY ------3333---------------1111--------------3333----3333----- ESIKNEPFKIPELEHHHHHH --1111---------3333- >UBIQUINOL CYTOCHROME C OX; SWP:P31800; PDB:1BCCA; YAQALQSVPETQVSQLDNGVRVASEQSSQPTCTVGVWIDAGSRYESEKNNGAGYFLEHLA ----1111--------------------------------3333-1111--3333----- FKGTKNRPQNALEKEVESMGAHLNAYSSREHTAYYIKALSKDVPKAVELLADIVQNCSLE ---3333--------3333----------------------3333--------------3 DSQIEKERDVIVRELQENDTSMREVVFNYLHATAFQGTGLAQSVEGPSENIRKLSRADLT 333-----------------3333----------222233333333-------------- EYLSTHYTAPRMVLAAAGGVEHQQLLELAQKHFGGVPFTYDDDAVPTLSKCRFTGSQIRH -------3333-----------------------------3333---------------- REDGLPLAHVAIAVEGPGWAHPDLVALQVANAIIGHYDRTYGGGLHSSSPLASIAVTNKL -3333--------------------3333-------------------3333-------- CQSFQTFSICYSETGLFGFYFVCDRMSIDDMMFVLQGQWMRLCTSISESEVLRGKNFLRN -----------------------1111-------------------3333---------- ALVSHLDGTTPVCEDIGRELLTYGRRIPLEEWEERLAEVDARMVREVCSKYIYDQCPAVA ---------------------------3333---------3333---------------- GPGPIEQLPDYNRIRSGMFWLR ----1111-------------- >UBIQUINOL CYTOCHROME C OX; SWP:UCR2_BOVIN; PDB:1BCCB; PPHPQDLEITKLPNGLVIASLENYSPGSTIGVFIKAGSRYENSSNLGTSHLLRLASSLTT -----------3333----------------------1111------------------- KGASSFKITRGIEAVGGKLSVESTRENMAYTVECLRDDVEILMEFLLNVTTAPEFRPWEV -----3333---1111----------------------3333------------------ ADLQPQLKIDKAVAFQNPQTHVIENLHAAAYRNALADSLYCPDYRIGKVTSVELHDFVQN ------------------------------------------1111-------------- HFTSARMALVGLGVSHPVLKNVAEQLLNIRGGLGLSGAKAKYRGGEIREQNGDSLVHAAI --3333---------------3333----------------------------------- VAESAAIGGAEANAFSVLQHVLGANPHVKRGNPFDVSAFNASYSDSGLFGFYTISQAAYA -----2222-----------------------------------------------1111 GQVIKAAYNQVKTIAQGNVSNENVQAAKNKLKAKYLMSVESSEGFLEEVGSQALAAGSYN 3333--------3333-------------------------------------------- PPSTVLQQIDAVADADVIKAAKKFVSRQKSMAASGNLGHTPFVDEL ------------------------------------1111-3333- >Ubiquinol-cytochrome c re; SWP:P13272; PDB:1BCCE; SHTDIKVPNFSDYRRPPDDYSTKSSRESDPSRKGFSYLVTAVTTLGVAYAAKNVVTQFVS 3333-----------1111----3333-3333---------------------------- SMSASADVLAMSKIEIKLSDIPEGKNMAFKWRGKPLFVRHRTKKEIDQEAAVEVSQLRDP ----33331111----3333-2222-----iiii-------33333333---1111---- QHDLERVKKPEWVILIGVCTHLGCVPIANAGDFGGYYCPCHGSHYDASGRIRKGPAPLNL -1111---1111------------------------------------------------ EVPSYEFTSDDMVIVG --------1111---- >Ubiquinol-cytochrome c re; SWP:P00129; PDB:1BCCF; SRWLEGIRKWYYNAAGFNKYGLMRDDTIYENDDVKEAIRRLPENLYDDRMFRIKRALDLN --3333-----------1111-3333-----------1111------------------1 MRQQILPKEQWTKYEEDVPYLEPYLKEVIRERKEREEWDK 111---3333---1111---3333------------1111 >TOXIN BJXTR-IT; SWP:P56637; PDB:1BCG; KKNGYPLDRNGKTTCSGVNAIAPHYCNSECTKVYYAESGYCCWGACYCFGLEDDKPIGPM -------1111----!!!!----------------------iiii------1111----- KDITKKYCDVQI ------------ >BACTERIOPHAGE MU TRANSPOS; SWP:P07636; PDB:1BCO; EHLDAMQWINGDGYLHNVFVRWFNGDVIRPKTWFWQDVKTRKILGWRCDVSENIDSIRLS ---2222--------------1111---------------------------3333---- FMDVVTRYGIPEDFHITIDNTRGAANKWLTGGAPNRYRFKVKEDDPKGLFLLMGAKMHWT ----------------------1111------1111------------------------ SVVAGKGWGQAKPVERAFGVGGLEEYVDKHPALAGAYTGPYGDRAVDAELFLKTLAEGVA --2222--------3333---3333-11111111------3333--3333---------- MFNARTGRETEMCGGKLSFDDVFEREYARTIVRKPTEEQKRMLLLPAEAVNVSRKGEFTL ---------3333-------------1111----------1111--------1111---- KVGGSLKGAKNVYYNMALMNAGVKKVVVRFDPQQLHSTVYCYTLDGRFICEAECL --!!!!--------3333------------3333--------1111--------- >PERTUSSIS TOXIN; SWP:P04977; PDB:1BCPA; DPPATVYRYDSRPPEDVFQNGFTAWGNNDNVLEHLTGRSCQVGSSNSAFVSTSSSRRYTE ------------3333--------------------1111--------------3333-- VYLEHRMQEAVEAERAGRGTGHFIGYIYEVRADNNFYGAASSYFEYVDTYGDNAGRILAG -------------1111---------------1111--------------3333-3333- ALATYQSEYLAHRRIPPENIRRVTRVYHNGITGETTTTEYSNARYVSQQTRANPNPYTSR ---------------3333----------1111--------3333--------------- RSVASIVGTLVRMAPVVGACMARQAESSEEAMVLVYYESIAYSF -----------------3333-3333---------3333----- >Pertussis toxin subunit 2; SWP:P04978; PDB:1BCPB; PGIVIPPQEQITQHGSPYGRCANKTRALTVAELRGSGDLQEYLRHVTRGWSIFALYDGTY -------3333-----iiii-2222---3333----3333-------------------- LGGEYGGVIKDGTPGGAFDLKTTFCIMTTRNTGQPATDHYYSNVTATRLLSSTNSRLCAV -!!!!-------2222-------------------------------------------- FVRSGQPVIGACTSPYDGKYWSMYSRLRKMLYLIYVAGISVRVHVSKEEQYYDYEDATFE --------------------1111------------------------------------ TYALTGISICNPGSSLC ----------2222--- >Pertussis toxin subunit 3; SWP:P04979; PDB:1BCPC; GIVIPPKALFTQQGGAYGRCPNGTRALTVAELRGNAELQTYLRQITPGWSIYGLYDGTYL -----3333------iiii-2222----------------3333---------------- GQAYGGIIKDAPPGAGFIYRETFCITTIYKTGQPAADHYYSKVTATRLLASTNSRLCAVF 3333-------2222--------------------------------------------- VRDGQSVIGACASPYEGRYRDMYDALRRLLYMIYMSGLAVRVHVSKEEQYYDYEDATFQT ------------------1111-------------------------------------- YALTGISLCNPAASIC ----------1111-- >PERTUSSIS TOXIN; SWP:P04980; PDB:1BCPD; DVPYVLVKTNMVVTSVAMKPYEVTPTRMLVCGIAAKLGAAASSPDAHVPFCFGKDLKRPG -----------------------------------22223333----------------- SSPMEVMLRAVFMQQRPLRMFLGPKQLTFEGKPALELIRMVECSGKQDCP -3333-----------------------%%%%------------------ >Pertussis toxin subunit 5; SWP:P04981; PDB:1BCPF; LPTHLYKNFTVQELALKLKGKNQEFCLTAFMSGRSLVRACLSDAGHEHDTWFDTMLGFAI ------------------!!!!-------------------------------------- SAYALKSRIALTVEDSPYPGTPGDLLELQICPLNGYCE -------------------------------2222--- >ALANINE RACEMASE; SWP:P10724; PDB:1BD0A; NDFHRDTWAEVDLDAIYDNVENLRRLLPDDTHIMAVVKANAYGHGDVQVARTALEAGASR ---------------------------1111---------iiii---------1111--- LAVAFLDEALALREKGIEAPILVLGASRPADAALAAQQRIALTVFRSDWLEEASALYSGP ----3333-------------------3333----1111------3333----------- FPIHFHLKMDTGMGRLGVKDEEETKRIVALIERHPHFVLEGLYTHFATADEVNTDYFSYQ ---------------------------------1111----------1111--------- YTRFLHMLEWLPSRPPLVHCANSAASLRFPDRTFNMVRFGIAMYGLAPSPGIKPLLPYPL -------3333-----------------1111-------3333-----33331111---- KEAFSLHSRLVHVKKLQPGEKVSYGATYTAQTEEWIGTIPIGYADGWLRRLQHFHVLVDG ----------------2222--2222----------------1111----1111---iii QKAPIVGRICMDQCMIRLPGPLPVGTKVTLIGRQGDEVISIDDVARHLETINYEVPCTIS i---------------------2222-------!!!!-------------33333333-3 YRVPRIFFRHKRIMEVRNAIG 333-----%%%%-----1111 >T-cell receptor alpha cha; SWP:P04437; PDB:1BD2D; QQVKQNSPSLSVQEGRISILNCDYTNSMFDYFLWYKKYPAEGPTFLISISSIKDKNADGR ------------2222---------1111--------2222--------1111------- FTVFLNKSAKHLSLHIVPSQPGDSAVYFCAAMEGAQKLVFGQGTRLTINPNIQNPDPAVY -----3333----------3333------------------------------------- QLRDSKSVCLFTDFDSQTNVSQSKDSDVYITDKTVLDMRFKSNSAVAWSNKSDFACANAF ------------------------1111--------------------------3333-- NNSIIPEDTF ---------- >Uracil phosphoribosyltran; SWP:Q26998; PDB:1BD3D; QEESILQDIITRFPNVVLMKQTAQLRAMMTIIRDKETPKEEFVFYADRLIRLLIEEALNE ------------1111-----------------3333--------------------111 LPFQKKEVTTPLDVSYHGVSFYSKICGVSIVRAGESMESGLRAVCRGVRIGKILIQRDET 1--------1111------------------3333---------2222------------ TAEPKLIYEKLPADIRERWVMLLDPMCATAGSVCKAIEVLLRLGVKEERIIFVNILAAPQ -----------1111-------------------------1111-1111----------- GIERVFKEYPKVRMVTAAVDICLNSRYYIVPGIGDFGDRYFGTM --------1111-----------1111----------------- >CIRCULARLY PERMUTED BB2-C; SWP:P62697; PDB:1BD7A; EHKIILYENPNFTGKKMEIVDDDVPSFHAHGYQEKVSSVRVQSGTWVGYQYPGYRGLQYL ---------%%%%------------3333----------------------%%%%----- LEKGDYKDNSDFGAPHPQVQSVRRIRDMQG -------3333------------------- >P19INK4D CDK4/6 INHIBITOR; SWP:P55273; PDB:1BD8; RAGDRLSGAAARGDVQEVRRLLHRELVHPDALNRFGKTALQVMMFGSTAIALELLKQGAS ---------------------------1111-1111-3333--3333-------1111-1 PNVQDTSGTSPVHDAARTGFLDTLKVLVEHGADVNVPDGTGALPIHLAVQEGHTAVVSFL 111-1111-3333--------------1111------1111-3333-------------- AAESDLHRRDARGLTPLELALQRGAQDLVDILQGHM ----1111-1111-3333--1111--------1111 >CIS-BIPHENYL-2,3-DIHYDROD; SWP:P47227; PDB:1BDB; MKLKGEAVLITGGASGLGRALVDRFVAEGAKVAVLDKSAERLAELETDHGDNVLGIVGDV --2222------------------------------------------!!!!------33 RSLEDQKQAASRCVARFGKIDTLIPNAGIWDYSTALVDLPEESLDAAFDEVFHINVKGYI 33----------------------------%%%%3333-3333----------------- HAVKACLPALVASRGNVIFTISNAGFYPNGGGPLYTAAKHAIVGLVRELAFELAPYVRVN ---------------------3333----------------------------------- GVGSGGINSDLRGPSSLGPLADMLKSVLPIGRMPEVEEYTGAYVFFATRGDAAPATGALL -------------3333-3333-11113333---3333---------33331111----- NYDGGLGVRGFFSGAGGNDLLEQLNIH ----3333-1111---1111------- >VPR PROTEIN; SWP:P12520; PDB:1BDE; YGDTWAGVEAIIRILQQLLFIHFRIGCRHSRIG --------------------1111--------- >RNA POLYMERASE ALPHA SUBU; SWP:P00574; PDB:1BDFA; QGSVTEFLKPRLVDIEQVSSTHAKVTLEPLERGFGHTLGNALRAILLSSMPGCAVTEVEI ------------------------------2222-------------------------2 DGVLHEYSTKEGVQEDILEILLNLKGLAVRVQGKDEVILTLNKSGIGPVTAADITHDGDV 222-1111-2222----------1111----------------------3333---3333 EIVKPQHVICHLTDENASISMRIKVQRGRGYVPASTRIERPIGRLLVDACYSPVERIAYN ---1111-------------------------3333------------------------ VEAARVEQRTDLDKLVIEMETNGTIDPEEAIRRAATILAEQLEAFV ---------------------------------------1111--- >HEXOKINASE; SWP:Q26609; PDB:1BDG; FSDQQLFEKVVEILKPFDLSVVDYEEICDRMGESMRLGLQKSTNEKSSIKMFPSYVTKTP -------------3333--3333----------------1111----------------- NGTETGNFLALDLGGTNYRVLSVTLEGGKSPRIQERTYCIPAEKMSGSGTELFKYIAETL ----------------------------------------3333---3333--------- ADFLENNGMKDKKFDLGFTFSFPCVQKGLTHATLVRWTKGFSADGVEGHNVAELLQTELD ---------------------------3333------!!!!----22223333------- KRELNVKCVAVVNDTVGTLASCALEDPKCAVGLIVGTGTNVAYIEDSSKVELMDGVKEPE -------------------------1111--------------------3333------- VVINTEWGAFGEKGELDCWRTQFDKSMDIDSLHPGKQLYEKMVSGMYLGELVRHIIVYLV -----3333-1111-1111-------3333--22223333-------------------1 EQKILFRGDLPERLKVRNSLLTRYLTDVERDPAHLLYNTHYMLTDDLHVPVVEPIDNRIV 111-%%%%--3333------3333-3333-------3333------------3333---- RYACEMVVKRAAYLAGAGIACILRRINRSEVTVGVDGSLYKFHPKFCERMTDMVDKLKPK ---------------------1111-----------3333--2222------------11 NTRFCLRLSEDGSGKGAAAIAASC 11------1111------------ >ACETYL-COA CARBOXYLASE; SWP:P02905; PDB:1BDO; EISGHIVRSPMVGTFYRTPSPDAKAFIEVGQKVNVGDTLCIVEAMKMMNQIEADKSGTVK -------------------1111----2222--2222------iiii------------- AILVESGQPVEFDEPLVVIE ----2222--2222------ >HIV-1 PROTEASE; SWP:P04587; PDB:1BDQA; PQITLWQRPLVTIKIGGQLKEALLDTGADDSIVAGIELPGRWKPKMVGGIGGFIKVRQYD --------------iiii------1111-------------------------------- QILIEICGHKAIGTVLVGPTPINIIGRNLLTQIGCTLNF -----iiii----------------3333---------- >BDS-I; SWP:P11494; PDB:1BDS; AAPCFCSGKPGRGDLWILRGTCPGGYGYTSNCYKWPNICCYPH ----------------------1111----------------- >PROTEIN KINASE C; SWP:P09215; PDB:1BDYA; MAPFLRISFNSYELGSLQAEDDASQPFCAVKMKEALTTDRGKTLVQKKPTMYPEWKSTFD --------------1111------------------3333-------------2222--- AHIYEGRVIQIVLMRAAEDPMSEVTVGVSVLAERCKKNNGKAEFWLDLQPQAKVLMCVQY ---2222--------2222------------------iiii------------------- FLE --- >GLUTAMATE MUTASE; SWP:Q05488; PDB:1BE1; MEKKTIVLGVIGSDCHAVGNKILDHSFTNAGFNVVNIGVLSSQEDFINAAIETKADLICV -----------------------------------------------1111--------- SSLYGQGEIDCKGLREKCDEAGLKGIKLFVGGNIVVGKQNWPDVEQRFKAMGFDRVYPPG ---3333---------------------------------------1111---------- TSPETTIADMKEVLGVE -----------3333-- >NUCLEOSIDE DIPHOSPHATE TR; SWP:P52175; PDB:1BE4A; ANSERTFIAIKPDGVQRGLMGEIIKRFEQKGFRLVAMKFMRASEDLLKEHYIDLKDRPFF --------------1111--------------------------------3333------ AGLVKYMHSGPVVAMVWEGLNVVKTGRVMLGETNPADSKPGTIRGDFCIQVGRNIIHGSD ----3333----------2222-----------1111------------1111------- SVESAEKEIALWFRPEELVNYKSCAQNWIYE --3333------------------1111--- >BIFUNCTIONAL AMYLASE/SERI; SWP:P01088; PDB:1BEA; SCVPGWAIPHNPLPSCRWYVTSRTCGIGPRLPWPELKRRCCRELADIPAYCRCTALSILM ------------3333-------------------------------3333--------- DGAIPPGPDAQLEGRLEDLPGCPREVQRGFAATLVTEAECNLATISGVAECPWILG ----------------------3333---------3333----1111---1111-- >BETA-LACTOGLOBULIN; SWP:P02755; PDB:1BEBA; QTMKGLDIQKVAGTWYSLAMAASDISLLDAQSAPLRVYVEELKPTPEGDLEILLQKWENG --------1111-----------3333--1111-----------1111------------ ECAQKKIIAEKTKIPAVFKIDALNENKVLVLDTDYKKYLLFCMENSAEPEQSLVCQCLVR -------------1111----%%%%-------------------33331111-------- TPEVDDEALEKFDKALKALPMHIRLSFNPTQLEEQC ---------------1111-----------1111-- >14.3.D T CELL ANTIGEN REC; SWP:Q8K1Z5; PDB:1BEC; AVTQSPRNKVAVTGGKVTLSCQQTNNHNNMYWYRQDTGHGLRLIHYSYGAGSTEKGDIPD -----------2222---------------------------------2222-------- GYKASRPSQEQFSLILELATPSQTSVYFCASGGGRGSYAEQFFGPGTRLTVLEDLRQVTP -------3333--------3333----------2222----------------3333--- PKVSLFEPSKAEIANKQKATLVCLARGFFPDHVELSWWVNGKEVHSGVSTDPQAYKESNY --------3333--------------------------iiii------------------ SYCLSSRLRVSATFWHNPRNHFRCQVQFHGLSEEDKWPEGSPKPVTQNISAEAWGRAD ----------3333-----------------1111----------------------- >DSBA OXIDOREDUCTASE; SWP:P32557; PDB:1BED; AQFKEGEHYQVLKTPASSSPVVSEFFSFYCPHCNTFEPIIAQLKQQLPEGAKFQKNHVSF ---2222-------------------1111-3333------------2222--------- MGGNMGQAMSKAYATMIALEVEDKMVPVMFNRIHTLRKPPKDEQELRQIFLDEGIDAAKF -!!!!----------------------------3333-------------1111-3333- DAAYNGFAVDSMVRRFDKQFQDSGLTGVPAVVVNNRYLVQGQSVKSLDEYFDLVNYLLTL --1111--------------1111--------------------------------3333 K - >DENGUE VIRUS NS3 SERINE P; SWP:Q9Q4T1; PDB:1BEFA; WDVPSPPPVGKAELEDGAYRIKQKGILGYSQIGAGVYKEGTFHTMWHVTRGAVLMHKGKR -11113333----------------------------%%%%---3333------------ IEPSWADVKKDLVSCGGGWKLEGEWKEGEEVQVLALEPGKNPRAVQTKPGLFKTNAGTIG -------1111------------------------------------------------- AVSLDFSPGTSGSPIIDKKGKVVGIYGNGVVTRSGAYVSAIAQTEKSIEDNPEIEDD --33332222------3333----------------------33333333--3333- >PHOSPHATIDYLETHANOLAMINE ; SWP:P30086; PDB:1BEHA; VDLSKWSGPLSLQEVDEQPQHPLHVTYAGAAVDELGKVLTPTQVKNRPTSISWDGLDSGK --1111-3333----------------------2222--3333---------22221111 LYTLVLTDPDAPSRKDPKYREWHHFLVVNMKGNDISSGTVLSDYVGSGPPKGTGLHRYVW ------------33331111----------!!!!3333-----------2222------- LVYEQDRPLKCDEPILSNRSGDHRGKFKVASFRKKYELRAPVAGTCYQAEWDDYVPKLYE --------------------2222---------1111----------------------1 QLSG 111- >POTASSIUM CHANNEL TOXIN S; SWP:P29187; PDB:1BEI; RSCIDTIPKSRCTAFQCKHSMYRLSFCRKTCGTC -------3333-3333------------------ >BETA-NERVE GROWTH FACTOR; SWP:P01139; PDB:1BET; GEFSVCDSVSVWVGDKTTATDIKGKEVTVLAEVNINNSVFRQYFFETKCRASNPVESGCR --------------------1111------------------------------111122 GIDSKHWNSYCTTTHTFVKALTTDEKQAAWRFIRIDTACVCVLSRKA 223333----------------------------------------- >Genome polyprotein; SWP:P12915; PDB:1BEV1; QAAGALVAGTSTSTHSVATDSTPALQAAETGATSTARDESMIETRTIVPTHGIHETSVES ---------------------3333-----------3333------------1111---- FFGRSSLVGMPLLATGTSITHWRIDFREFVQLRAKMSWFTYMRFDVEFTIIATSSTGQNV -------------3333------------------3333----------------1111- TTEQHTTYQVMYVPPGAPVPSNQDSFQWQSGCNPSVFADTDGPPAQFSVPFMSSANAYST ---------------------33331111---------1111------------------ VYDGYARFMDTDPDRYGILPSNFLGFMYFRTLEDAAHQVRFRIYAKIKHTSCWIPRAPRQ -----------3333---3333-------------------------------------- APYKKRYNLVFSGDSDRICSNRASLTSY ---------------------------- >Genome polyprotein; SWP:P12915; PDB:1BEV2; EACGYSDRVAQLTLGNSTITTQEAANICVAYGCWPAKLSDTDATSVDKPTEPGVSADRFY -------------!!!!-----------2222------3333----------3333---- TLRSKPWQADSKGWYWKLPDALNNTGMFGQNAQFHYLYRGGWAVHVQCNATKFHQGTLLV -------1111--------1111------------------------------------- LAIPEHQIATQEQPAFDRTMPGSEGGTFQEPFWLEDGTSLGNSLIYPHQWINLRTNNSAT --------------3333---3333----3333-----3333---------1111----- LILPYVNAIPMDSAIRHSNWTLAIIPVAPLKYAAETTPLVPITVTIAPMETEYNGLRRAI ------------1111-------------------------------------------- ASNQ ---- >Genome polyprotein; SWP:P12915; PDB:1BEV3; GLPTKPGPGSYQFMTTDEDCSPCILPDFQPTPEIFIPGKVNNLLEIAQVESILEANNREG ------2222---1111-------1111-----------------1111--------222 VEGVERYVIPVSVQDALDAQIYALRLELGGSGPLSSSLLGTLAKHYTQWSGSVEITCMFT 2--3333------------------------3333-------1111-------------- GTFMTTGKVLLAYTPPGGDMPRNREEAMLGTHVIWDFGLQSSITLVIPWISASHFRGVSN -1111-----------------3333---------------------------------- DDVLNYQYYAAGHVTIWYQTNMVIPPGFPNTAGIIMMIAAQPNFSFRIQKDREDMTQTAI -1111-1111--------------2222------------1111-------1111----- LQ -- >Genome polyprotein; SWP:P12915; PDB:1BEV4; STINYNNINYYSHAASAAQNKQDFTQDPSKFTQPIADVIK ------------3333-----------3333--------- >CAMPATH-1H ANTIBODY; SWP:NA; PDB:1BEYH; QVQLQESGPGLVRPSQTLSLTCTVSGFTFTDFYMNWVRQPPGRGLEWIGFIRDKAKGYTT ---------------------------3333----------------------------- EYNPSVKGRVTMLVDTSKNQFSLRLSSVTAADTAVYYCAREGHTAAPFDYWGQGSLVTVS --------------3333----------3333---------------------------- SASTKGPSVFPLAPAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSSV --------------------------------%%%%-------------3333------- VTVPSSSLGTQTYICNVNHKPSNTKVDKKV -------------------3333------- >CALCICLUDINE; SWP:P81658; PDB:1BF0; WQPPWYCKEPVRIGSCKKQFSSFYFKWTAKKCLPFLFSGCGGNANRFQTIGECRKKCLGK ----3333-------------------------------------------1111----- >ISOAMYLASE; SWP:P10342; PDB:1BF2; AINSMSLGASYDAQQANITFRVYSSQATRIVLYLYSAGYGVQESATYTLSPAGSGVWAVT --1111-----1111-------------------------------------iiii---- VPVSSIKAAGITGAVYYGYRAWGPNWPYASNWGKGSQAGFVSDVDANGDRFNPNKLLLDP -3333-----------------1111--111122222222----1111---1111---11 YAQEVSQDPLNPSNQNGNVFASGASYRTTDSGIYAPKGVVLVPSTQSTGTKPTRAQKDDV 11--------1111-3333---3333-----1111-------------------3333-- IYEVHVRGFTEQDTSIPAQYRGTYYGAGLKASYLASLGVTAVEFLPVQETQNDANDVVPN ----3333----11113333-------------------------------1111----- SDANQNYWGYMTENYFSPDRRYAYNKAAGGPTAEFQAMVQAFHNAGIKVYMDVVYNHTAE -1111--------1111-3333----2222------------1111-----------333 GGTWTSSDPTTATIYSWRGLDNATYYELTSGNQYFYDNTGIGANFNTYNTVAQNLIVDSL 3--------------3333-3333----2222-------------1111----------- AYWANTMGVDGFRFDLASVLGNSCLNGAYTASAPNCPNGGYNFDAADSNVAINRILREFT ----1111----------1111-------3333--1111----1111------------- VRPAAGGSGLDLFAEPWAIGGNSYQLGGFPQGWSEWNGLFRDSLRQAQNELGSMTIYVTQ --1111------------------2222-2222----3333------------------- DANDFSGSSNLFQSSGRSPWNSINFIDVHDGMTLKDVYSCNGANNSQAWPYGPSDGGTST ---111133333333--1111-----------3333------------------------ NYSWDQGMSAGTGAAVDQRRAARTGMAFEMLSAGTPLMQGGDEYLRTLQCNNNAYNLDSS ----iiii-----3333-----------------------1111---iiii--1111-33 ANWLTYSWTTDQSNFYTFAQRLIAFRKAHPALRPSSWYSGSQLTWYQPSGAVADSNYWNN 33--------------------------3333------3333----3333---3333--- TSNYAIAYAINGPSLGDSNSIYVAYNGWSSSVTFTLPAPPSGTQWYRVTDTCDWNDGAST ------------1111-----------------------------------3333----- FVAPGSETLIGGAGTTYGQCGQSLLLLISK --2222-----2222--------------- >STAT-1; SWP:P42224; PDB:1BF5A; LDKQKELDSKVRNVKDKVMCIEHEIKSLEDLQDEYDFKCKTLQNREHLLLKKMYLMLDNK ----------------------------------------3333----3333-------- RKEVVHKIIELLNVTELTQNALINDELVEWKRRQQSACIGGPPNACLDQLQNWFTIVAES ------------------------------------1111-------------------- LQQVRQQLKKLEELEQKYTYEHDPITKNKQVLWDRTFSLFQQLIQSSFVVERQPCMPTHP ----------------------1111------------------3333-------1111- QRPLVLKTGVQFTVKLRLLVKLQELNYNLKVKVLFDKDVNERNTVKGFRKFNILGTHTKV -1111----------------------------------3333----------------- MNMEESTNGSLAAEFRHLQLKEQKNAGTRTNEGPLIVTEELHSLSFETQLCQPGLVIDLE ---------------------------------------------------2222----- TTSLPVVVISNVSQLPSGWASILWYNMLVAEPRNLSFFLTPPCARWAQLSEVLSWQFSSV ----------3333----------3333-------1111--------------------- TKRGLNVDQLNMLGEKLLGPNASPDGLIPWTRFCKENINDKNFPFWLWIESILELIKKHL ------------------11111111--3333---------------------------- LPLWNDGCIMGFISKERERALLKDQQPGTFLLRFSESSREGAITFTWVERSQNGGEPDFH ------------------------------------------------------------ AVEPYTKKELSAVTFPDIIRNYKVMAAENIPENPLKYLYPNIDKDHAFGKYYSRGIKTEL --------3333--3333-------------------------3333-3333-------- ISVS ---- >PHOSPHOTRIESTERASE HOMOLO; SWP:P45548; PDB:1BF6A; SFDPTGYTLAHEHLHIDLSGFKNNVDCRLDQYAFICQEMNDLMTRGVRNVIEMTNRYMGR --1111------------3333-3333--------------------------------- NAQFMLDVMRETGINVVACTGYYQDAFFPEHVATRSVQELAQEMVDEIEQGIDGTELKAG -----------------------3333-3333-------------------%%%%----- IIAEIGTSEGKITPLEEKVFIAAALAHNQTGRPISTHTSFSTMGLEQLALLQAHGVDLSR -------2222---------------------------%%%%---------1111-1111 VTVGHCDLKDNLDNILKMIDLGAYVQFDTIGKNSYYPDEKRIAMLHALRDRGLLNRVMLS ------------------1111------2222----3333--------11111111---- MDITRRSHLKANGGYGYDYLLTTFIPQLRQSGFSQADVDVMLRENPSQFFQ ----3333-1111----3333------------------------------ >BASIC FIBROBLAST GROWTH F; SWP:P09038; PDB:1BFG; DPKRLYCKNGGFFLRIHPDGRVDGVREKSDPHIKLQLQAEERGVVSIKGVSANRYLAMKE ----------------1111------1111----------2222-----3333-----11 DGRLLASKSVTDECFFFERLESNNYNTYRSRKYTSWYVALKRTGQYKLGSKTGPGQKAIL 11--------1111------1111--------1111----1111---3333-22221111 FLPMSA ------ >CAMPATH-1G ANTIBODY; SWP:NA; PDB:1BFOA; DIKMTQSPSFLSASVGDRVTLNCKASQNIDKYLNWYQQKLGESPKLLIYNTNNLQTGIPS -------------2222-----------!!!!------2222------------222211 RFSGSGSGTDFTLTISSLQPEDVATYFCLQHISRPRTFGTGTKLELKRANAAPTVSIFPP 11----------------1111-------------------------------------- STEQLATGGASVVCLMNKFYPRDISVKWKIDGTERNGVLNSVTDQDSADSTYSMSSTLSL 3333-------------------------iiii--------------------------- TKADYQSHNLYTCQVVHKTSSSPVVAKNFNRNEC 3333-------------------------3333- >CAMPATH-1G ANTIBODY; SWP:NA; PDB:1BFOB; EVKLLESGGGLVQPGGSMRLSCAGSGFTFTDFYMNWIRQPAGKAPEWLGFIRDKAKGYTT ------------2222-----------3333--------2222----------1111--- EYNPSVKGRFTISRDNTQNMLYLQMNTLRAEDTATYYCAREGHTAAPFDYWGQGVMVTVS --------------3333----------1111---------------------------- SAQTTAPSVYPLAPGCGDTTSSTVTLGCLVKGYFPEPVTVTWNSGALSSDVHTFPAVLQS -----------------------------------------%%%%--------------- GLYTLTSSVTSSTWPSQTVTCNVAHPASSTKVDKKV -------------1111------------------- >NUCLEAR FACTOR NF-KAPPA-B; SWP:P25799; PDB:1BFS; ASNLKIVRMDRTAGCVTGGEEIYLLCDKVQKDDIQIRFYEEEENGGVWEGFGDFSPTDVH --------------1111-----------1111---------1111--------3333-% RQFAIVFKTPKYKDVNITKPASVFVQLRRKSDLETSEPKPFLYYPE %%%------------------------------------------- >FV4155; SWP:NA; PDB:1BFVH; QVQLQESGGGLVNLGGSMTLSCVASGFTFNTYYMSWVRQTPEKTLELVAAINSDGEPIYY ------------2222-----------3333--------1111--------3333----- PDTLKGRVTISRDNAKKTLYLQMSSLNFEDTALYYCARLNYAVYGMDYWGQGTTVTVSS 3333---------1111---------1111----------3333--------------- >FV4155; SWP:KV2G_MOUSE; PDB:1BFVL; DIELTQSPPSLPVSLGDQVSISCRSSQSLVSNNRRNYLHWYLQKPGQSPKLVIYKVSNRF -------------2222--------------------------2222------------2 SGVPDRFSGSGSGTDFTLKISRVAAEDLGLYFCSQSSHVPLTFGSGTKLEIKR 2223333----------------3333-------------------------- >CYTOCHROME B5; SWP:P00173; PDB:1BFX; DKDVKYYTLEEIQKHKDSKSTWVILHHKVYDLTKFLEEHPGGEEVLREQAGGDATENFED --------------------------------3333------------2222-3333-33 VGHSTDARELSKTYIIGELHPDDRSKIAKPSETL 33---33331111--------------------- >STAT3B; SWP:P42227; PDB:1BG1A; VVTEKQQMLEQHLQDVRKRVQDLEQKMKVVENLQDDFDFNYKTLKSQGDSVTRQKMQQLE ---------------------------------------------------2222----- QMLTALDQMRRSIVSELAGLLSAMEYVQKTLTDEELADWKRRQQIACIGGPPNICLDRLE ---------------------------------------------1111----------- NWITSLAESQLQTRQQIKKLEELQQKVSYKGDPIVQHRPMLEERIVELFRNLMKSAFVVE -------------------------------3333------------------------- RQPCMPMHPDRPLVIKTGVQFTTKVRLLVKFPELNYQLKIKVCIDKDSGDVAALRGSRKF ----3333-------2222-----------3333-------------------------- NILGTNTKVMNMEESNNGSLSAEFKHLTLREQRCGNGGRANCDASLIVTEELHLITFETE ----------------------------------------2222--1111---------- VYHQGLKIDLETHSLPVVVISNICQMPNAWASILWYNMLTNNPKNVNFFTKPPIGTWDQV --iiii---------------3333-------------------1111------------ AEVLSWQFSSTTKRGLSIEQLTTLAEKLLGPGVNYSGCQITWAKFCKENMAGKGFSFWVW ----------------------------------------3333---------------- LDNIIDLVKKYILALWNEGYIMGFISKERERAILSTKPPGTFLLRFSESSKEGGVTFTWV ------------3333------------------------------1111---------- EKDISGSTQIQSVEPYTKQQLNNMSFAEIIMGYKIMDATNILVSPLVYLYPDIPKEEAFG ----------------3333----3333--------1111------------------33 KYCRAAPLKTKFICVTPF 33---------------- >KINESIN; SWP:P33176; PDB:1BG2; DLAECNIKVMCRFRPLNESEVNRGDKYIAKFQGEDTVVIASKPYAFDRVFQSSTSQEQVY --------------------------------------iiii--------1111------ NDCAKKIVKDVLEGYNGTIFAYGQTSSGKTHTMEGKLHDPEGMGIIPRIVQDIFNYIYSM ----------1111---------2222--------1111--------------------- DENLEFHIKVSYFEIYLDKIRDLLDVSKTNLSVHEDKNRVPYVKGCTERFVCSPDEVMDT ---------------%%%%--1111-----------------2222-------------- IDEGKSNRHVAVTNMNEHSSRSHSIFLINVKQENTQTEQKLSGKLYLVDLAGSEKVSKTG --------2222------------------------------------------------ AEGAVLDEAKNINKSLSALGNVISALAEGSTYVPYRDSKMTRILQDSLGGNCRTTIVICC -------------------------1111----3333------1111------------- SPSSYNESETKSTLLFGQRAKTI --3333----------------- >HEXOKINASE; SWP:P05708; PDB:1BG3A; MIAAQLLAYYFTELKDDQVKKIDKYLYAMRLSDEILIDILTRFKKEMKNGLSRDYNPTAS ---------------3333------3333-------------------------3333-- VKMLPTFVRSIPDGSEKGDFIALDLGGSSFRILRVQVNVSMESEIYDTPENIVHGSGTQL ---------------------------------------------------1111----- FDHVADCLGDFMEKKKIKDKKLPVGFTFSFPCRQSKIDEAVLITWTKRFKASGVEGADVV ------------33333333-------------------------!!!!----2222--- KLLNKAIKKRGDYDANIVAVVNDTVGTMMTCGYDDQQCEVGLIIGTGTNACYMEELRHID ------------------------------------------------------333333 LVEGDEGRMCINTEWGAFGDDGSLEDIRTEFDRELDRGSLNPGKQLFEKMVSGMYMGELV 33-----------3333-1111-1111--------1111-2222---1111--------- RLILVKMAKEGLLFEGRITPELLTRGKFNTSDVSAIEKDKEGIQNAKEILTRLGVEPSDV -------1111-%%%%--3333-2222-3333-----------------3333------- DCVSVQHICTIVSFRSANLVAATLGAILNRLRDNKGTPRLRTTVGVDGSLYKMHPQYSRR --------------------------------1111-----------------1111--- FHKTLRRLVPDSDVRFLLSESGTGKGAAMVTAVAYRLAEQHRQIEETLAHFRLSKQTLME ----------------------3333--------------------3333---------- VKKRLRTEMEMGLRKETNSKATVKMLPSFVRSIPDGTEHGDFLALDLGGTNFRVLLVKIR --------------3333------------------------------------------ SRTVEMHNKIYSIPLEIMQGTGDELFDHIVSCISDFLDYMGIKGPRMPLGFTFSFPCHQT -------------3333------------------------------------------- NLDCGILISWTKGFKATDCEGHDVASLLRDAVKRREEFDLDVVAVVNDTVGTMMTCAYEE 1111------!!!!----2222----------3333-------------------33331 PTCEIGLIVGTGTNACYMEEMKNVEMVEGNQGQMCINMEWGAFGDNGCLDDIRTDFDKVV 111----------------33333333----------------1111------------- DEYSLNSGKQRFEKMISGMYLGEIVRNILIDFTKKGFLFRGQISEPLKTRGIFETKFLSQ 1111-----3333---11113333-------------%%%%--3333-2222-3333--1 IESDRLALLQVRAILQQLGLNSTCDDSILVKTVCGVVSKRAAQLCGAGMAAVVEKIRENR 111-------------------3333---------------------------------- GLDHLNVTVGVDGTLYKLHPHFSRIMHQTVKELSPKCTVSFLLSEDGSGKGAALITAVGV ---------------------------------1111----------------------- RL -- >ENDO-1,4-BETA-XYLANASE; SWP:P56588; PDB:1BG4; ASVSIDAKFKAHGKKYLGTIGDQYTLTKNTKNPAIIKADFGQLTPENSMKWDATEPNRGQ ---------1111--------3333---------------------11111111--2222 FTFSGSDYLVNFAQSNGKLIRGHTLVWHSQLPGWVSSITDKNTLISVLKNHITTVMTRYK -------------------------------3333---------------------1111 GKIYAWDVLNEIFNEDGSLRNSVFYNVIGEDYVRIAFETARSVDPNAKLYINDYNLDSAG -------------1111--------------------------3333----------222 YSKVNGMVSHVKKWLAAGIPIDGIGSQTHLGAGAGSAVAGALNALASAGTKEIAITELDI 2-------------1111------------22221111---------------------2 AGASSTDYVNVVNACLNQAKCVGITVWGVADPDSWRSSSSPLLFDGNYNPKAAYNAIANA 222--------------3333--------3333--3333-----1111---------111 L 1 >N-(1-D-CARBOXYLETHYL)-L-N; SWP:Q44297; PDB:1BG6; SKTYAVLGLGNGGHAFAAYLALKGQSVLAWDIDAQRIKEIQDRGAIIAEGPGLAGTAHPD --------------------1111-------------------------3333------- LLTSDIGLAVKDADVILIVVPAIHHASIAANIASYISEGQLIILNPGATGGALEFRKILR ----3333-1111-------3333-------3333-2222-------------------1 ENGAPEVTIGETSSMLFTCRSERPGQVTVNAIKGAMDFACLPAAKAGWALEQIGSVLPQY 111-------------------2222---------------3333-------33333333 VAVENVLHTSLTNVNAVMHPLPTLLNAARCESGTPFQYYLEGITPSVGSLAEKVDAERIA ------------3333---3333------------------------------------- IAKAFDLNVPSVCEWYPATIYEAVQGNPAYRGIAGPINLNTRYFFEDVSTGLVPLSELGR --1111----3333------------3333----------3333---------------- AVNVPTPLIDAVLDLISSLIDTDFRKEGRTLEKLGLSGLTAAGIRSAVE ----------------------3333---3333--2222------1111 >FERRITIN; SWP:P07229; PDB:1BG7; DSQVRQNFHRDCEAAINRMVNMELYASYTYLSMAFYFDRDDIALHNVAKFFKEQSHEERE -1111---------------------------------3333------------------ HAEKLMKDQNKRGGRIVLQDVQKPERDEWGNTLEAMQAALQLEKTVNQALLDLPEEQVKS ------------------------------------------------------------ IKQLGDYITNLKRLGLPQNGMGEYLFDKHTMGE -----------------------------1111 >GRANULOCYTE COLONY-STIMUL; SWP:P35833; PDB:1BGC; SLPQSFLLKCLEQVRKIQADGAELQERLCAAHKLCHPEELMLLRHSLGIPQAPLSSCSSQ -----------------------------------3333-----1111-----1111333 SLQLRGCLNQLHGGLFLYQGLLQALAGISPELAPTLDTLQLDVTDFATNIWLQMEDLGAA 3-----------------------iiii3333---------------------------- PAMPTFTSAFQRRAGGVLVASQLHRFLELAYRGLRYLA -------------------------------------- >GRANULOCYTE COLONY-STIMUL; SWP:P35834; PDB:1BGEA; PLPQSFLLKCLEQMRKVQADGTALQETLCATHQLCHPEELVLLGHALGIPQPPLSSCSSQ --3333-----------------------------3333--------------1111333 ALQLMGCLRQLHSGLFLYQGLLQALAGISPELAPTLDTLQLDTTDFAINIWQQMEDLGMA 3-----------------------iiii3333----------------------1111-- PTMPAFTSAFQRRAGGVLVASNLQSFLELAYRALRHFAK --------------------------------------- >STAT-4; SWP:P42228; PDB:1BGF; GGSQWNQVQQLEIKFLEQVDQFYDDNFPMEIRHLLAQWIETQDWEVASNNETMATILLQN -------1111---33331111-1111-3333------11113333-------------- LLIQLDEQLGRVSKEKNLLLIHNLKRIRKVLQGKFHGNPMHVAVVISNCLREERRILAAA -------------------------------------3333------------------- NMPI ---- >BGK; SWP:P29186; PDB:1BGK; VCRDWFKETACRHAKSLGNCRTSQKYRANCAKTCELC -----------------------3333--1111---- >BARLEY GRAIN PEROXIDASE; SWP:NA; PDB:1BGP; AEPPVAPGLSFDFYWQTCPRAESIVREFVQEAVRKDIGLAAGLLRLHFHDCFVQGCDASV -----2222--1111--1111------------------------------------111 LLDGSATGPGEQQAPPNLTLRPSAFKAVNDIRDRLERECRGAVVSCSDILALAARDSVVV 11111--1111---1111--3333------------1111------------------11 SGGPDYRVPLGRRDSRSFASTQDVLSDLPGPSSNVQSLLALLGRLGLDATDLVTISGGHT 11---------------------------1111---------1111-------------- IGLAHCSSFEDRLFPRPDPTISPTFLSRLKRTCPAKGTDRRTVLDVRTPNVFDNKYYIDL ----33333333-----1111-------------2222------3333------------ VNREGLFVSDQDLFTNAITRPIVERFAQSQQDFFEQFGVSIGKMGQMRVRTSDQGEVRRN ------33333333------------------------------------!!!!-----3 CSVRNPGPG 333------ >GLUTAMATE DEHYDROGENASE; SWP:P24295; PDB:1BGVA; SKYVDRVIAEVEKKYADEPEFVQTVEEVLSSLGPVVDAHPEYEEVALLERMVIPERVIEF --------------1111----------------------3333-----1111------- RVPWEDDNGKVHVNTGYRVQFNGAIGPYKGGLRFAPSVNLSIMKFLGFEQAFKDSLTTLP -----1111-------------1111--------1111---------------------- MGGAKGGSDFDPNGKSDREVMRFCQAFMTELYRHIGPDIDVPAGDLGVGAREIGYMYGQY -----------2222--------------3333--1111-----2222------------ RKIVGGFYNGVLTGKARSFGGSLVRPEATGYGSVYYVEAVMKHENDTLVGKTVALAGFGN -------1111----3333----3333--------------1111--2222--------- VAWGAAKKLAELGAKAVTLSGPDGYIYDPEGITTEEKINYMLEMRASGRNKVQDYADKFG --------------------1111---3333-------------------3333--1111 VQFFPGEKPWGQKVDIIMPCATQNDVDLEQAKKIVANNVKYYIEVANMPTTNEALRFLMQ --------3333------------------------------------------------ QPNMVVAPSKAVNAGGVLVSGFEMSQNSERLSWTAEEVDSKLHQVMTDIHDGSAAAAERY 1111---3333------------------------------------------------- GLGYNLVAGANIVGFQKIADAMMAQGIAW ----------------------1111--- >DNA polymerase I, thermos; SWP:P19821; PDB:1BGXT; MRGMLPLFEPKGRVLLVDGHHLAYRTFHALKGLTTSRGEPVQAVYGFAKSLLKALKEDGD ----------2222-----1111------1111--------------------3333--- AVIVVFDAKAPSFRHEAYGGYKAGRAPTPEDFPRQLALIKELVDLLGLARLEVPGYEADD ---------------1111------------3333---33331111----------3333 VLASLAKKAEKEGYEVRILTADKDLYQLLSDRIHVLHPEGYLITPAWLWEKYGLRPDQWA -------------------------11112222----------3333-------333311 DYRALTGDESDNLPGVKGIGEKTARKLLEEWGSLEALLKNLDRLKPAIREKILAHMDDLK 11-------------------------1111-----------------1111-----111 LSWDLAKVRTDLPLEVDFAKRREPDRERLRAFLERLEFGSLLHEFGLLESPKALEEAPWP 1-3333---------------------11113333------------------------- PPEGAFVGFVLSRKEPMWADLLALAAARGGRVHRAPEPYKALRDLKEARGLLAKDLSVLA ---------------1111--------iiii-----33331111-------3333----- LREGLGLPPGDDPMLLAYLLDPSNTTPEGVARRYGGEWTEEAGERAALSERLFANLWGRL --------------------3333-3333-----------3333---------------1 EGEERLLWLYREVERPLSAVLAHMEATGVRLDVAYLRALSLEVAEEIARLEAEVFRLAGH 1111111------------------------1111-----------------3333---- PFNLNSRDQLERVLFDELGLPAIGKTEKTGKRSTSAAVLEALREAHPIVEKILQYRELTK -----3333-3333----------------3333--2222-11113333----------- LKSTYIDPLPDLIHPRTGRLHTRFNQTATATGRLSSSDPNLQNIPVRTPLGQRIRRAFIA -------1111--------------------------------------33333333--- EEGWLLVALDYSQIELRVLAHLSGDENLIRVFQEGRDIHTETASWMFGVPREAVDPLMRR ------------3333--------1111--------3333---------1111-3333-- AAKTINFGVLYGMSAHRLSQELAIPYEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYV -------3333--3333-------3333-------------------------------- ETLFGRRRYVPDLEARVKSVREAAERMAFNMPVQGTAADLMKLAMVKLFPRLEEMGARML -1111-----------3333---3333--------------------3333--------- LQVHDELVLEAAEAVARLAKEVMEGVYPLAVPLEVEVGIGEDWLSAKE ------------3333-------------------------3333--- ----------------------------- >CIRCULIN A; SWP:P56871; PDB:1BH4; CGESCVWIPCISAALGCSCKNKVCYRNGIP ----------1111---------------- >SUBTILISIN DY; SWP:P00781; PDB:1BH6A; AQTVPYGIPLIKADKVQAQGYKGANVKVGIIDTGIASSHTDLKVVGGASFVSGESYNTDG ----33331111----------2222---------3333-----------2222------ NGHGTHVAGTVAALDNTTGVLGVAPNVSLYAIKVLNSSGSGSYSAIVSGIEWATQNGLDV -----------------------1111--------1111--------------1111--- INMSLGGPSGSTALKQAVDKAYASGIVVVAAAGNSGNSGSQNTIGYPAKYDSVIAVGAVD -------------------------------------!!!!-----3333---------1 SNKNRASFSSVGSELEVMAPGVSVYSTYPSNTYTSLNGTSMASPHVAGAAALILSKYPTL 111--1111--1111----------------------3333---------------1111 SASQVRNRLSSTATNLGDSFYYGKGLINVEAAAQ -----------------3333!!!!--3333--- >BAND 3; SWP:P02730; PDB:1BH7; IQLFDRILLFKPPKYHPDPYVKRVKTWRMHL ---1111------------------3333-- >TAFII18; SWP:Q15543; PDB:1BH9A; LFSKELRCMMYGFGDDQNPYTESVDILEDLVIEFITEMTHKAMSI --3333----1111------------------------------- >TAFII18; SWP:T2D9_HUM; PDB:1BH9B; FSEEQLNRYEMYRRSAFPKAAIKRLIQSITGTSVSQNVVIAMSGISKVFVGEVVEEALDV ------------------------------------------------------------ CEKWGEMPPLQPKHMREAVRRLKSKGQIP ----------3333--------------- >UTROPHIN; SWP:P46939; PDB:1BHDA; LQQTNSEKILLSWVRQTTRPYSQVNVLNFTTSWTDGLAFNAVLHRHKPDLFSWDKVVKMS -----------------1111--------1111----------3333------------- PIERLEHAFSKAQTYLGIEKLLDPEDVAVRLPDKKSIIMYLTSLFEVL ----------------------3333---------------------- >POLYGALACTURONASE; SWP:P26509; PDB:1BHE; SDSRTVSEPKTPSSCTTLKADSSTATSTIQKALNNCDQGKAVRLSAGSTSVFLSGPLSLP ------------------------------------2222-------------------2 SGVSLLIDKGVTLRAVNNAKSFENAPSSCGVVDKNGKGCDAFITAVSTTNSGIYGPGTID 222----2222------3333---2222-------------------------------- GQGGVKLQDKKVSWWELAADAKVKKLKQNTPRLIQINKSKNFTLYNVSLINSPNFHVVFS -1111-1111--3333-------------------------------------------- DGDGFTAWKTTIKTPSTARNTDGIDPMSSKNITIAYSNIATGDDNVAIKAYKGRAETRNI --------------1111--------------------------------2222------ SILHNDFGTGHGMSIGSETMGVYNVTVDDLKMNGTTNGLRIKSDKSAAGVVNGVRYSNVV -------------------------------------------1111------------- MKNVAKPIVIDTVYEKKEGSNVPDWSDITFKDVTSETKGVVVLNGENAKKPIEVTMKNVK --------------------------------------------2222------------ LTSDSTWQIKNVNVKK -1111----------- >BETA-GLUCURONIDASE; SWP:P08236; PDB:1BHGA; GLQGGMLYPQESPSRECKELDGLWSFRADFSDNRRRGFEEQWYRRPLWESGPTVDMPVPS -----------------------------------1111-3333-3333----------- SFNDISQDWRLRHFVGWVWYEREVILPERWTQDLRTRVVLRIGSAHSYAIVWVNGVDTLE -1111--3333---------------3333------------------------------ HEGGYLPFEADISNLVQVGPLPSRLRITIAINNTLTPTTLPPGTIQYLTDTSKYPKGYFV -----------------------------------1111--------------------- QNTYFDFFNYAGLQRSVLLYTTPTTYIDDITVTTSVEQDSGLVNYQISVKGSNLFKLEVR ------------------------------------------------------------ LLDAENKVVANGTGTQGQLKVPGVSLWWPYLMHERPAYLYSLEVQLTAQTSLGPVSDFYT ------------------------------------------------------------ LPVGIRTVAVTKSQFLINGKPFYFHGVNKHEDADIRGKGFDWPLLVKDFNLLRWLGANAF ----------------iiii---------------!!!!--3333--------------- RTSHYPYAEEVMQMCDRYGIVVIDECPGVGLALPQFFNNVSLHHHMQVMEEVVRRDKNHP -2222----1111-------------------3333-------------------1111- AVVMWSVANEPASHLESAGYYLKMVIAHTKSLDPSRPVTFVSNSNYAADKGAPYVDVICL -----------33333333--------3333-------------------3333------ NSYYSWYHDYGHLELIQLQLATQFENWYKKYQKPIIQSEYGAETIAGFHQDPPLMFTEEY --2222--2222--------------------------------2222------------ QKSLLEQYHLGLDQKRRKYVVGELIWNFADFMTEQSPTRVLGNKKGIFTRQRQPKSAAFL --------------1111-----------------1111---------1111-------- LRERYWKIANE -------1111 >CRE-BP1; SWP:P15336; PDB:1BHI; MSDDKPFLCTAPGCGQRFTNEDHLAVHKHKHEMTLKFG --------------------------------1111-- >BETA-PUROTHIONIN; SWP:P01543; PDB:1BHP; KSCCKSTLGRNCYNLCRARGAQKLCANVCRCKLTSGLSCPKDFPK ---------------------------------------1111-- >HEPATOCYTE GROWTH FACTOR; SWP:P14210; PDB:1BHTA; RRNTIHEFKKSAKTTLIKIDPALKIKTKKVNTADQCANRCTRNKGLPFTCKAFVFDKARK ---1111------------1111------------------------------------- QCLWFPFNSMSSGVKKEFGHEFDLYENKDYIRNCIIGKGRSYKGTVSITKSGIKCQPWSS -------1111-------1111----3333-----!!!!---------1111----1111 MIPHEHSFLPSSYRGKDLQENYCRNPRGEEGGPWCFTSNPEVRYEVCDIPQCSEVE --------33332222--------1111----------3333-------------- >METALLOPROTEINASE INHIBIT; SWP:P01077; PDB:1BHU; APSCPAGSLCTYSGTGLSGARTVIPASDMEKAGTDGVKLPASARSFANGTHFTLRYGPAR ----------------------------3333---------------------------- KVTCVRFPCYQYATVGKVAPGAQLRSLPSPGATVTVGQDLGD -------------------%%%%------------------- >P64K; SWP:Q51225; PDB:1BHY; GSADAEYDVVVLGGGPGGYSAAFAAADEGLKVAIVERYKTLGGVCLNVGCIPSKALLHNA ------------------------------------------------------------ AVIDEVRHLAANGIKYPEPELDIDMLRAYKDGVVSRLTGGLAGMAKSRKVDVIQGDGQFL --------3333---------3333----------------------------------- DPHHLEVSLTAGDAYEQAAPTGEKKIVAFKNCIIAAGSRVTKLPFIPEDPRIIDSSGALA -------------2222-------------------------1111--1111-3333333 LKEVPGKLLIIGGGIIGLEMGTVYSTLGSRLDVVEMMDGLMQGADRDLVKVWQKQNEYRF 3-----------------------1111------------22223333-------3333- DNIMVNTKTVAVEPKEDGVYVTFEGANAPKEPQRYDAVLVAAGRAPNGKLISAEKAGVAV ----------------------------------------------1111-3333----- TDRGFIEVDKQMRTNVPHIYAIGDIVGQPMLAHKAVHEGHVAAENCAGHKAYFDARVIPG 3333----1111---1111---3333------------------1111------------ VAYTSPEVAWVGETELSAKASARKITKANFPWAASGRAIANGCDKPFTKLIFDAETGRII ------------------------------3333-------------------------- GGGIVGPNGGDMIGEVYLAIEMGCDAADIGKTIHPHPTLGESIGMAAEVALGTCTDLPPQ -----2222---------------33331111------3333------------------ KK -- >Bromelain inhibitor [Prec; SWP:P27478; PDB:1BI6H; EEYKCYCTDTYSDCPGFCKTCKAEFGKYICLDLISPNDCVK --------------1111-----iiii-------------- >CYCLIN-DEPENDENT KINASE 6; SWP:Q00534; PDB:1BI7A; DQQYECVAEIGEGAYGKVFKARDLKNGGRFVALKRVRVQEHPNVVRLFDVCTVSRTDRET ----------------------------------------1111---------------- KLTLVFEHVDQDLTTYLDKVPEPGVPTETIKDMMFQLLRGLDFLHSHRVVHRDLKPQNIL -------------------------------------------3333-------3333-- VTSSGQIKLADFGLARIYSFQMALTSVVVTLWYRAPEVLLQSSYATPVDLWSVGCIFAEM -1111-------------------------1111----------1111------------ FRRKPLFRGSSDVDQLGKILDVIGLPGEEDWPRDVALPRQAFHSKSAQPIEKFVTDIDEL ----------3333-----------------------3333-------3333-------- GKDLLLKCLTFNPAKRISAYSALSHPYFQ -----------3333---3333------- >RETINAL DEHYDROGENASE TYP; SWP:Q63639; PDB:1BI9A; MASLQLLPSPTPNLEIKYTKIFINNEWQNSESGRVFPVCNPATGEQVCEVQEADKVDIDK ----------------------%%%%---3333--------------------3333--- AVQAARLAFSLGSVWRRMDASERGRLLDKLADLVERDRATLATMESLNGGKPFLQAFYID ---------33331111-3333-----------------------------3333----- LQGVIKTLRYYAGWADKIHGMTIPVDGDYFTFTRHEPIGVCGQIIPWNFPLLMFTWKIAP -------------1111-------------------------------3333-------- ALCCGNTVVIKPAEQTPLSALYMGALIKEAGFPPGVVNILPGYGPTAGAAIASHIGIDKI -1111-------1111----------------2222--------------1111------ AFTGSTEVGKLIQEAAGRSNLKRVTLELGGKSPNIIFADADLDYAVEQAHQGVFFNQGQC ------------------------------------1111--------------%%%%-1 CTAGSRIFVEESIYEEFVKRSVERAKRRIVGSPFDPTTEQGPQIDKKQYNKILELIQSGV 111------3333-----------1111---1111------------------------- AEGAKLECGGKGLGRKGFFIEPTVFSNVTDDMRIAKEEIFGPVQEILRFKTMDEVIERAN ----------------------------1111-------------------------111 NSDFGLVAAVFTNDINKALMVSSAMQAGTVWINCYGEFGLREYSEVKTVTVKIPQKNS 1----------------------------------1111------------------- >BirA BIFUNCTIONAL PROTEIN; SWP:P06709; PDB:1BIA; MKDNTVPLKLIALLANGEFHSGEQLGETLGMSRAAINKHIQTLRDWGVDVFTVPGKGYSL -------------------------------3333--------1111------------- PEPIQLLNAKQILGQLDGGSVAVLPVIDSTNQYLLDRIGELKSGDACIAEYQQAGSPFGA ------------1111--------------------3333-2222------3333-2222 NLYLSMFWRLEQPAAAIGLSLVIGIVMAEVLRKLGADKVRVKWPNDLYLQDRKLAGILVE -------------------------------11111111---------%%%%-------- LTGAAQIVIGAGINMAMWITLQEAGINLDRNTLAAMLIRELRAALELFEQEGLAPYLSRW -------------------3333-----------------------------1111---- EKLDNFINRPVKLIIGDKEIFGISRGIDKQGALLLEQDGIIKPWMGGEISLR ---1111--------------------1111--------------------- >6-PHOSPHOFRUCTO-2-KINASE/; SWP:P25114; PDB:1BIF; CPTLIVMVGLPARGKTYISKKLTRYLNFIGVPTREFNVGQYRRDMVKTYKSFEFFLPDNE ------------------------------------3333-----------33331111- EGLKIRKQCALAALNDVRKFLSEEGGHVAVFDATNTTRERRAMIFNFGEQNGYKTFFVES ------------------------------------------------1111-------- ICVDPEVIAANIVQVKLGSPDYVNRDSDEATEDFMRRIECYENSYESLDEEQDRDLSYIK ---------------1111--2222----------------1111-----1111------ IMDVGQSYVVNRVADHIQSRIVYYLMNIHVTPRSIYLCRHGESELNLKGRIGGDPGLSPR -%%%%-------------------1111--------------3333-------------- GREFSKHLAQFISDQNIKDLKVFTSQMKRTIQTAEALSVPYEQFKVLNEIDAGVCEEMTY ------------3333----------3333--3333-------3333----!!!!----- EEIQDHYPLEFALRDQDKYRYRYPKGESYEDLVQRLEPVIMELERQENVLVICHQAVMRC ----------------------2222---------------------------------- LLAYFLDKAAEELPYLKCPLHTVLKLTPVAYGCKVESIFLNVAAVNTHRDRPQNVDISRP --------33331111------------2222-----------------------2222- SEEALVTVPAHQ ----1111---- >TOXIN BMTX1; SWP:Q9NII6; PDB:1BIG; FTDVKCTGSKQCWPVCKQMFGKPNGKCMNGKCRCYS -------3333------------------------- >HEMOLIN; SWP:P25033; PDB:1BIHA; KYPVLKDQPAEVLFRENNPTVLECIIEGNDQGVKYSWKKDGKSYNWQEHNAALRKDEGSL --------------2222--------------------------3333------------ VFLRPQASDEGHYQCFAETPAGVASSRVISFRKTYLIASPAKTHEKTPIEGRPFQLDCVL -----1111---------3333-------------------------------------- PNAYPKPLITWKKRLSGADPNADVTDFDRRITAGPDGNLYFTIVTKEDVSDIYKYVCTAK --------------22221111-----3333-------------1111------------ NAAVDEEVVLVEYEIKGVTKDNSGYKGEPVPQYVSKDMMAKAGDVTMIYCMYGSNPMGYP 1111------------------------------------2222---------------- NYFKNGKDVNGNPEDRITRHNRTSGKRLLFKTTLPEDEGVYTCEVDNGVGKPQKHSLKLT ----------------------iiii-------1111----------------------- VVSAPKYEQKPEKVIVVKQGQDVTIPCKVTGLPAPNVVWSHNAKPLSGGRATVTDSGLVI ----------------------------------------%%%%---------1111--- KGVKNGDKGYYGCRATNEHGDKYFETLVQVN ------------------------------- >MHC CLASS I H-2DD; SWP:P01900; PDB:1BIIA; GSHSLRYFVTAVSRPGFGEPRYMEVGYVDNTEFVRFDSDAENPRYEPRARWIEQEGPEYW -------------1111-------------------1111--------3333---3333- ERETRRAKGNEQSFRVDLRTALRYYNQSAGGSHTLQWMAGCDVESDGRLLRGYWQFAYDG -------------------------------------------1111----------iii CDYIALNEDLKTWTAADMAAQITRRKWEQAGAAERDRAYLEGECVEWLRRYLKNGNATLL i-----3333--------------------------------------------1111-- RTDPPKAHVTHHRRPEGDVTLRCWALGFYPADITLTWQLNGEELTQEMELVETRPAGDGT -------------1111---------------------%%%%------------------ FQKWASVVVPLGKEQKYTCHVEHEGLPEPLTLRW ---------22223333-----1111-------- >BIKUNIN; SWP:P02760; PDB:1BIK; SCQLGYSAGPCMGMTSRYFYNGTSMACETFQYGGCMGNGNNFVTEKECLQTCRTVAACNL 3333----------------------------------------------------1111 PIVRGPCRAFIQLWAFDAVKGKCVLFPYGGCQGNGNKFYSEKECREYCGV -----------------1111----------------------------- >LEGHEMOGLOBIN A; SWP:P02238; PDB:1BINA; VAFTEKQDALVSSSFEAFKANIPQYSVVFYTSILEKAPAAKDLFSFLANGVDPTNPKLTG ------------------------------------3333---1111----1111----- HAEKLFALVRDSAGQLKASGTVVADAALGSVHAQKAVTDPQFVVVKEALLKTIKAAVGDK -------------------------------------3333---------------!!!! WSDELSRAWEVAYDELAAAIKKA ----------------------- >COMPLEMENT FACTOR D; SWP:P00746; PDB:1BIO; ILGGREAEAHARPYMASVQLNGAHLCGGVLVAEQWVLSAAHCLEDAADGKVQVLLGAHSL -------22221111----------------1111---3333------------------ SQPEPSKRLYDVLRAVPHPDSQPDTIDHDLLLLQLSEKATLGPAVRPLPWQRVDRDVAPG ---1111----------11111111--------------------------------222 TLCDVAGWGIVNHAGRRPDSLQHVLLPVLDRATCNRRTHHDGAITERLMCAESNRRDSCK 2----------------------------3333--3333-----1111----------22 GDSGGPLVCGGVLEGVVTSGSRVCGNRKKPGIYTRVASYAAWI 22-----------------------1111-----3333----- >AP ENDONUCLEASE 1; SWP:P27695; PDB:1BIX; LYEDPPDQKTSPSGKPATLKICSWNVDGLRAWIKKKGLDWVKEEAPDILCLQETKCSENK ----------1111---------------------------------------------- LPAELQELPGLSHQYWSAPSDKEGYSGVGLLSRQCPLKVSYGIGDEEHDQEGRVIVAEFD ---11111111---------------------------------3333----------11 SFVLVTAYVPNAGRGLVRLEYRQRWDEAFRKFLKGLASRKPLVLCGDLNVAHEEIDLRNP 11----------2222--------------------1111------------1111--33 KGNKKNAGFTPQERQGFGELLQAVPLADSFRHLYPNTPYAYTFWTYMMNARSKNVGWRLD 331111----------------------------------------%%%%1111------ YFLLSHSLLPALCDSKIRSKALGSDHCPITLYLAL ----33331111-----1111-------------- >Prolactin-binding protein; SWP:Q9QV16; PDB:1BJ1H; EVQLVESGGGLVQPGGSLRLSCAASGYTFTNYGMNWVRQAPGKGLEWVGWINTYTGEPTY ------------2222-----------1111--------2222----------------- AADFKRRFTFSLDTSKSTAYLQMNSLRAEDTAVYYCAKYPHYYGSSHWYFDVWGQGTLVT 3333--------1111----------1111------------------------------ VSSASTKGPSVFPLAPSGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLY -------------------------------------%%%%--2222-------3333-- SLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPK --------1111------------1111---------- >SERINE HYDROXYMETHYLTRANS; SWP:P34896; PDB:1BJ4A; DADLWSSHDAMLAQPLKDSDVEVYNIIKKESNRQRVGLELIASENFASRAVLEALGSCLN --------------3333----------------------1111-------3333--333 NKYSEGYPGQRYYGGTEFIDELETLCQKRALQAYKLDPQCWGVNVQPYSGSPANFAVYTA 3-----2222-----3333------------1111-1111-------------------- LVEPHGRIMGLDLPDGGHLTHGFMTDKKKISATSIFFESMPYKVNPDTGYINYDQLEENA --2222------1111-3333---3333--3333-------------------------- RLFHPKLIIAGTSCYSRNLEYARLRKIADENGAYLMADMAHISGLVAAGVVPSPFEHCHV ----------------------------1111-------1111----------3333--- VTTTTHKTLRGCRAGMIFYRKGVKSVDPATGKEILYNLESLINSAVFPGLQGGPHNHAIA -----!!!!--------------------------------------------------- GVAVALKQAMTLEFKVYQHQVVANCRALSEALTELGYKIVTGGSDNHLILVDLRSKGTDG --------------------------------1111--2222-----------1111--- GRAEKVLEACSIACNKNTCPGDRSALRPSGLRLGTPALTSRGLLEKDFQKVAHFIHRGIE -----------------------3333--------3333--------------------- LTLQIQSDTGVAATLKEFKERLAGDKYQAAVQALREEVESFASLFPLPGL ------1111--------------1111-------------1111----- >DNA (ACGCC); SWP:Q74084; PDB:1BJ6A; NVKCFNCGKEGHTARNCRAPRKKGCWKCGKEGHQMKDCTERQ ------------3333-----------------3333----- >D 2; SWP:Q28133; PDB:1BJ7; IDPSKIPGEWRIIYAAADNKDKIVEGGPLRNYYRRIECINDCESLSITFYLKDQGTCLLL -3333-------------3333-2222-------------------------%%%%---- TEVAKRQEGYVYVLEFYGTNTLEVIHVSENMLVTYVENYDGERITKMTEGLAKGTSFTPE ---------------------------1111----------------------------- ELEKYQQLNSERGVPNENIENLIKTDNCPP ---------1111-1111---3333----- >TRANSCRIPTION REGULATORY ; SWP:P22915; PDB:1BJAA; SKVTYIIKASNDVLNEKTATILITIAKKDFITAAEVREVHPDLGNAVVNSNIGVLIKKGL -------1111----------------2222--------3333----------------- VEKSGDGLIITGEAQDIISNAATLYAQENAPELLK ---!!!!----3333-------------3333--- >NEUROCALCIN DELTA; SWP:P61602; PDB:1BJFA; NSKLRPEVMQDLLESTDFTEHEIQEWYKGFLRDCPSGHLSMEEFKKIYGNFFPYGDASKF ----3333-------------------------3333---------3333------3333 AEHVFRTFDANGDGTIDFREFIIALSVTSRGKLEQKLKWAFSMYDLDGNGYISKAEMLEI --------3333-------------------3333---------1111------------ VQAIYKMVSSVMKMPEDESTPEKRTEKIFRQMDTNRDGKLSLEEFIRGAKSDPSIVRLLQ ---1111--1111-1111--------------1111----------------33333333 C - >AGKISTRODOTOXIN; SWP:P14421; PDB:1BJJA; NLLQFNKMIKEETGKNAIPFYAFYGCYCGWGGQGKPKDGTDRCCFVHDCCYGRLVNCNTK 3333---------------------------------3333--------3333----333 SDIYSYSLKEGYITCGKGTNCEEQICECDRVAAECFRRNLDTYNNGYMFYRDSKCTETSE 3-------------------------------------3333-3333---3333------ EC -- >LOC - LAMBDA 1 TYPE LIGHT; SWP:NA; PDB:1BJMA; SVLTQPPSASGTPGQRVTISCSGSSSNIGENSVTWYQHLSGTAPKLLIYEDNSRASGVSD -----------2222---------------------------------------2222-- RFSASKSGTSASLAISGLQPEDETDYYCAAWDDSLDVAVFGTGTKVTVLGQPKANPTVTL ------------------1111-------------------------------------- FPPSSEELQANKATLVCLISDFYPGAVTVAWKADGSPVKAGVETTKPSKQSNNKYAASSY ------3333----------------------%%%%-------------1111------- LSLTPEQWKSHRSYSCQVTHEGSTVEKTVAPTECS ---11111111--------------------2222 >PHOSPHOSERINE AMINOTRANSF; SWP:P23721; PDB:1BJNA; QIFNFSSGPAMLPAEVLKQAQQELRDWNGLGTSVMEVSHRGKEFIQVAEEAEKDFRDLLN ----------------------1111%%%%--3333-1111------------------- VPSNYKVLFCHGGGRGQFAAVPLNILGDKTTADYVDAGYWAASAIKEAKKYCTPNVFDAK -1111-------3333---------!!!!------------------3333--------- VTVDGLRAVKPMREWQLSDNAAYMHYCPNETIDGIAIDETPDFGADVVVAADFSSTILSR --iiii----3333---1111----------------------1111--------2222- PIDVSRYGVIYAGAQNIGPAGLTIVIVREDLLGKANIACPSILDYSILNDNGSMFNTPPT --3333---------------------3333----11113333-------%%%%------ FAWYLSGLVFKWLKANGGVAEMDKINQQKAELLYGVIDNSDFYRNDVAKRNRSRMNVPFQ ------------------------------------1111-------3333--------- LADSALDKLFLEESFAAGLHALKGHRVVGGMRASIYNAMPLEGVKALTDFMVEFERRHG --1111------------------3333-------33333333---------------- >TOPOISOMERASE II; SWP:P06786; PDB:1BJT; RKSRITNYPKLEDANKAGTKEGYKCTLVLTEGDSALSLAVAGLAVVGRDYYGCYPLRGKM -------1111--1111-1111-------------------------------------- LNVREALKNAEIQAIKKIMGLQHRKKYEDTKSLRYGHLMIMTDSHIKGLIINFLESSFLG -------3333---------------------------------3333------------ LLDIQGFLLEFITPIIKVSITKPTKNTIAFYNMPDYEKWREEESHKFTWKQKYYKGLGTS 1111---------------------------3333-------1111-----------111 LAQEVREYFSNLDRHLKIFHSLQWLRQYEPFINKELILFSLADNIRSIPNVLDGFKPGQR 1----------------------------------------------------------- KVLYGCFKKNLKSELKVAQLAPYVSECTAYHHGEQSLAQTIIGLAQNFVGSNNIYLLLPN ---------------3333-----------3333-------------------------- GAFGTRATGGKDAAAARYIYTELNKLTRKIFHPADDPLYKYIQEDEKTVEPEWYLPILPM -----1111-----3333-----3333----33333333----iiii------------- ILVNGAEGIGTGWSTYIPPFNPLEIIKNIRHLMNDEELEQMHPWFRGWTGTIEEIEPLRY -------------------------------1111---------2222-------2222- RMYGRIEQIGDNVLEITELPARTWTSTIKEYLLLGLSGNDKIKPWIKDMEEQHDDNIKFI ---------2222----------3333-----------2222------------------ ITLSPEEMAKTRKIGFYERFKLISPISLMNMVAFDPHGKIKKYNSVNEILSEFYYVRLEY -----------------1111-------------1111------3333------------ YQKRKDHMSERLQWEVEKYSFQVKFIKMIIEKELTVTNKPRNAIIQELENLGFPRFNKEG -----------------------------------2222---------1111----1111 KPYYGSPEELYGTYEYLLGMRIWSLTKERYQKLLKQKQEKETELENLLKLSAKDIWNTDL -------------3333---3333----------------------1111---------- KAFEVGYQEFLQRDAEARG ------------------- >FRUCTOSE-1,6-BISPHOSPHATA; SWP:P00637; PDB:1BK4A; FDTDISTMTRFVMEEGRKAGGTGEMTQLLNSLCTAVKAISTAVRKAGIKLDVLSNDLVMN ------------------------------------------------------------ MLKSSFATCVLVSEEDKNAIIVEPEKRGKYVVCFDPLDGSSNIDCLVSIGTIFGIYRKKS --3333------1111------3333---------------------------------- TDEPSTKDALQPGRNLVAAGYALYGSATMLVLAGGSGVNSFMLDPAIGEFILVDKNVKIK ----3333----1111-------------------------------------------- KKGNIYSLNEGYAKDFDPAVTEYIQKKKFPPDNSSPYGARYVGSMVADVHRTLVYGGIFL ---------1111----------------1111----------3333------------- YPANKKSPDGKLRLLYECNPMAFIMEKAGGMATTGKEAILDIVPTDIHQRAPVILGSPDD ---3333------------------1111--------3333----1111----------- VQEFLEIYKKHAVK --------1111-- >KARYOPHERIN ALPHA; SWP:Q02821; PDB:1BK5A; LPQMTQQLNSDDMQEQLSATVKFRQILSREHRPPIDVVIQAGVVPRLVEFMRENQPEMLQ 3333--1111-3333----------1111----------------------1111----- LEAAWALTNIASGTSAQTKVVVDADAVPLFIQLLYTGSVEVKEQAIWALGNVAGDSTDYR ---------1111--------1111----------------------------------- DYVLQCNAMEPILGLFNSNKPSLIRTATWTLSNLCRGKKPQPDWSVVSQALPTLAKLIYS ------------3333--------------------------3333-------------- MDTETLVDACWAISYLSDGPQEAIQAVIDVRIPKRLVELLSHESTLVQTPALRAVGNIVT --------------1111--------------------1111-3333----------111 GNDLQTQVVINAGVLPALRLLLSSPKENIKKEACWTISNITAGNTEQIQAVIDANLIPPL 13333----------------------------------1111----------------- VKLLEVAEYKTKKEACWAISNASSGGLQRPDIIRYLVSQGCIKPLCDLLEIADNRIIEVT -------3333-----------1111----------1111-----3333----------- LDALENILKMGEADKEARGLNINENADFIEKAGGMEKIFNCQQNENDKIYEKAYKIIETY ----------------------------------------1111-3333----------- FG -- >ANTIMICROBIAL PROTEIN 1; SWP:Q7M1F3; PDB:1BK8; LCNERPSQTWSGNCGNTAHCDKQCQDWEKASHGACHKRENHWKCFCYFNC -------------------------------------%%%%--------- >TRANSLATION INITIATION FA; SWP:P56635; PDB:1BKB; KWVSTKYVEAGELKEGSYVVIDGEPCRVVEIEKSKTGKHGSAKARIVAVGVFDGGKRTLS --------3333-2222---iiii------------------------------------ LPVDAQVEVPIIEKFTAQILSVSGDVIQLDRDYKTIEVPKYVEEEAKGRLAPGAEVEVWQ -1111-----------------1111----------------333311112222------ ILDRYKIIRVKG !!!!-------- >FK506 BINDING PROTEIN; SWP:P20071; PDB:1BKF; GVQVETISPGDGRTFPKRGQTCVVHYTGMLEDGKKFDSSRDKNKPFKFMLGKQEVIRGWE ----------------2222---------1111----3333--------------3333- EGVAQMSVGQRAKLTISPDYAYGATGVPGIIPPHATLVFDVELLKLE -3333-2222------3333-!!!!-2222-2222------------ >NADPH-FLAVIN OXIDOREDUCTA; SWP:Q56691; PDB:1BKJA; NNTIETILAHRSIRKFTAVPITDEQRQTIIQAGLAASSSSMLQVVSIVRVTDSEKRNELA ------1111---------------------------2222------------------- QFAGNQAYVESAAEFLVFCIDYQRHATINPDVQADFTELTLIGAVDSGIMAQNCLLAAES 1111--3333------------------1111---3333-------------------11 MGLGGVYIGGLRNSAAQVDELLGLPENSAVLFGMCLGHPDQNPEVKPRLPAHVVVHENQY 11-------3333------------------------------------3333------- QELNLDDIQSYDQTMQAYYSTWSQEVTGKLAGESRPHILPYLNSKGLAKR --------------------3333----3333--1111----1111---- >THYMIDYLATE SYNTHASE A; SWP:P42326; PDB:1BKPA; TQFDKQYNSIIKDIINNGISDEEFDVRTKWDSDGTPAHTLSVISKQMRFDNSEVPILTTK --------------------3333------------------------------------ KVAWKTAIKELLWIWQLKSNDVNDLNMMGVHIWDQWKQEDGTIGHAYGFQLGKKNRSLNG --------------------3333-1111-1111---1111-----3333-------iii EKVDQVDYLLHQLKNNPSSRRHITMLWNPDELDAMALTPCVYETQWYVKHGKLHLEVRAR i--------------1111--------11111111-------------iiii-------- SNDMALGNPFNVFQYNVLQRMIAQVTGYELGEYIFNIGDCHVYTRHIDNLKIQMEREQFE ------------------------------------------1111-----3333----- APELWINPEVKDFYDFTIDDFKLINYKHGDKLLFEVAV ------1111-1111-1111------------------ >SPECTRIN BETA CHAIN; SWP:Q01082; PDB:1BKRA; KSAKDALLLWCQMKTAGYPNVNIHNFTTSWRDGMAFNALIHKHRPDLIDFDKLKKSNAHY ------------1111----------3333-------------3333-3333-3333--- NLQNAFNLAEQHLGLTKLLDPEDISVDHPDEKSIITYVVTYYHYFSKM -------------------3333------------------------- >BMKTX; SWP:Q9NII7; PDB:1BKT; VGINVKCKHSGQCLKPCKDAGMRFGKCINGKCDCTPK -----------------1111---------------- >CALCITONIN; SWP:P01262; PDB:1BKU; CSNLSTCVLGKLSQELHKLQTYPRTDVGAGTP ----3333------------------------ >GALECTIN-7; SWP:P47929; PDB:1BKZA; SNVPHKSSLPEGIRPGTVLRIRGLVPPNASRFHVNLLCGEEQGSDAALHFNPRLDTSEVV --------1111-2222--------1111-----------2222---------1111--- FNSKEQGSWGREERGPGVPFQRGQPFEVLIIASDDGFKAVVGDAQYHHFRHRLPLARVRL ----iiii------------2222--------1111----%%%%---------3333--- VEVGGDVQLDSVRIF --------------- >MULTIPLE ANTIBIOTIC RESIS; SWP:P27246; PDB:1BL0A; DAITIHSILDWIEDNLESPLSLEKVSERSGYSKWHLQRMFKKETGHSLGQYIRSRKMTEI 3333-------------------3333--------------------------------- AQKLKESNEPILYLAERYGFESQQTLTRTFKNYFDVPPHKYRMTNMQGESRFLHPL ---------------1111-----------------3333--------2222---- >PARATHYROID HORMONE RECEP; SWP:Q03431; PDB:1BL1; SEAVKFLTNETREREVFDRLGMIYTVGYSVC 3333-------33333333------3333-- ----------------------------- >FRUCTOSE PERMEASE; SWP:P26380; PDB:1BLE; MNIVLARIDDRFIHGQILTRWIKVHAADRIIVVSDDIAQDEMRKTLILSVAPSNVKASAV --------1111----------------------3333-3333-3333------------ SVSKMAKAFHSPRYEGVTAMLLFENPSDIVSLIEAGVPIKTVNVGGMRFENHRRQITKSV -----------1111----------------------------------1111---1111 SVTEQDIKAFETLSDKGVKLELRQLPSDASEDFVQILRNVT ------------------------1111------------- >Cytosol aminopeptidase; SWP:P00727; PDB:1BLLE; TKGLVLGIYSKEDEPQFTSAGENFNKLVSGKLREILNISGPPLKAGKTRTFYGLHEDFPS -------------------------1111--------------2222-------1111-- VVVVGLGKKTAGIDEQENWHEGKENIRAAVAAGCRQIQDLEIPSVEVDPCGDAQAAAEGA -------1111--------------------------1111-------iiii-------- VLGLYEYDDLKQKRKVVVSAKLHGSEDQEAWQRGVLFASGQNLARRLMETPANEMTPTKF -------1111---------------------------------------3333------ AEIVEENLKSASIKTDVFIRPKSWIEEQEMGSFLSVAKGSEEPPVFLEIHYKGSPNASEP -------3333---------3333-1111-------3333---------------1111- PLVFVGKGITFDSGGISIKAAANMDLMRADMGGAATICSAIVSAAKLDLPINIVGLAPLC --------------------2222--1111------------------------------ ENMPSGKANKPGDVVRARNGKTIQVDNTDAEGRLILADALCYAHTFNPKVIINAATLTGA ---------2222---1111------1111------------3333-----------333 MDIALGSGATGVFTNSSWLWNKLFEASIETGDRVWRMPLFEHYTRQVIDCQLADVNNIGK 3----------------------------------------------------------- YRSAGACTAAAFLKEFVTHPKWAHLDIAGVMTNKDEVPYLRKGMAGRPTRTLIEFLFRFS ---1111-------------------3333------1111-------3333--------- Q - >MONOCLONAL ANTIBODY MRK-1; SWP:NA; PDB:1BLNB; EVILVESGGGLVKPGGSLKLSCAASGFTFSSYTMSWVRQTPEKRLEWVATISSGGGNTYY ------------2222-----------3333--------1111----------------- PDSVKGRFTISRDNAKNNLYLQMSSL 3333---------------------- >FERREDOXIN; SWP:P00208; PDB:1BLU; ALMITDECINCDVCEPECPNGAISQGDETYVIEPSLCTECVGHYETSQCVEVCPVDCIIK ----1111---3333--1111-----------3333-iiii------------------- DPSHEETEDELRAKYERITG 1111---------------- >CYCLIN-DEPENDENT KINASE 6; SWP:Q00534; PDB:1BLXA; GLCRADQQYECVAEIGEGAYGKVFKARDLKNGGRFVALKRVRVQTGEEGMPLSTIREVAV ---3333----------1111------------------------1111--3333----- LRHLETFEHPNVVRLFDVCTVSRTDRETKLTLVFEHVDQDLTTYLDKVPEPGVPTETIKD ----11111111-----------3333--------------------------------- MMFQLLRGLDFLHSHRVVHRDLKPQNILVTSSGQIKLADFGLARIYSFQMALTSVVVTLW ------------1111------3333---1111---------------iiii------11 YRAPEVLLQSSYATPVDLWSVGCIFAEMFRRKPLFRGSSDVDQLGKILDVIGLPGEEDWP 11---1111---3333--------------------------------------3333-- RDVALPRQAFHSKSAQPIEKFVTDIDELGKDLLLKCLTFNPAKRISAYSALSHPYFQDLE -----3333-------3333-------------------3333--333311111111--- RCKEN ----- >Cyclin-dependent kinase 4; SWP:Q60773; PDB:1BLXB; VCVGDRLSGAAARGDVQEVRRLLHRELVHPDALNRFGKTALQVMMFGSPAVALELLKQGA ----------------------------1111-1111-3333--1111-------1111- SPNVQDASGTSPVHDAARTGFLDTLKVLVEHGADVNALDSTGSLPIHLAIREGHSSVVSF ----------3333--------------1111-1111-1111-3333------------- LAPESDLHHRDASGLTPLELARQRGAQNLMDILQGHMMIP 1111-1111-1111-------1111--------1111--- >IMMUNOGLOBULIN OPG2 FAB, ; SWP:NA; PDB:1BM3H; EVQLVQSGGGLVNPGRSLKLSCAASGFTFSSYGMSWVRQTPEKRLEWVAAISGGGTYIHY ------------2222-----------3333--------1111--------1111----- PDSVKGRFTISRDNAKNNLYLQMSSLRSEDTALYYCTRHAMDHWGQGTSVTVSAAKTTPP 3333----------------------3333------------------------------ SVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSS ----------------------------------%%%%---------------------- SVTVPSSPRPSETVTCNVAHPASSTKVDKKIVPRDC -------------------3333------------- >IMMUNOGLOBULIN OPG2 FAB, ; SWP:NA; PDB:1BM3L; DELLTQSPATLSVTPGDSVSLSCRASQSISNNLHWYQQKSHESPRLLIKYASQSISGIPS ------------------------------------------------------222233 RFSGSGSGTDFTLSINSVETEDFGMYFCQQSNSWPLTFGGGSKLEIKRADAAPTVSIFPP 33----------------1111-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT ----1111---------------------iiii--2222--------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC -----3333--------3333------------- >MOLONEY MURINE LEUKEMIA V; SWP:P26807; PDB:1BM4A; CAKVKGITQGPNESPSAFLERLKEAYRRYTPY ----------%%%%----------3333---- >TRANSCRIPTION FACTOR MBP1; SWP:P39678; PDB:1BM8; QIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETH ------iiii------3333----------------------------------1111-- EKVQGGFGKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDF -------3333------------------3333-3333- >REPLICATION TERMINATOR PR; SWP:P14382; PDB:1BM9A; EEKRSSTGFLVKQRAFLKLYMITMTEQERLYGLKLLEVLRSEFKEIGFKPNHTEVYRSLH ------------------------1111---------------3333---3333------ ELLDDGILKQIKVKKEGAKLQEVVLYQFKDYEAAKLYKKQLKVELDRCKKLIEKALSDNF --1111--------1111------------------------------------------ >BETA=2=-MICROGLOBULIN; SWP:P01888; PDB:1BMG; IQRPPKIQVYSRHPPEDGKPNYLNCYVYGFHPPQIEIDLLKNGEKIKSEQSDLSFSKDWS ----------------------------------------iiii---------------- FYLLSHAEFTPNSKDQYSCRVKHVTLEQPRIVKWDRDL ----------------------1111------------ >LQH III ALPHA-LIKE TOXIN; SWP:P56678; PDB:1BMR; VRDGYIAQPENCVYHCFPGSSGCDTLCKEKGGTSGHCGFKVGHGLACWCNALPDNVGIIV ----------------------------------------------------1111---- EGEKCHS ------- >METHIONINE SYNTHASE; SWP:P13009; PDB:1BMTA; QAEWRSWEVNKRLEYSLVKGITEFIEQDTEEARQQATRPIEVIEGPLMDGMNVVGDLFGE ----------------1111-----------------3333------------------- GKMFLPQVVKSARVMKQAVAYLEPFIEASKEQGKTNGKMVIATVKGDVHDIGKNIVGVVL ---3333--------------33331111--------------2222------------- QCNNYEIVDLGVMVPAEKILRTAKEVNADLIGLSGLITPSLDEMVNVAKEMERQGFTIPL -----------------------1111---------3333-------------------- LIGGATTSKAHTAVKIEQNYSGPTVYVQNASRTVGVVAALLSDTQRDDFVARTRKEYETV ---1111---------1111--------3333------1111------------------ RIQHGR ------ >HALOALKANE DEHALOGENASE; SWP:P59336; PDB:1BN7A; IGTGFPFDPHYVEVLGERMHYVDVGPRDGTPVLFLHGNPTSSYLWRNIIPHVAPSHRCIA -------------iiii-----------------------3333---33333333----- PDLIGMGKSDKPDLDYFFDDHVRYLDAFIEALGLEEVVLVIHDWGSALGFHWAKRNPERV --2222----------3333---------1111----------------------3333- KGIACMEFIRPIPTWDEWPEFARETFQAFRTADVGRELIIDQNAFIEGVLPKCVVRPLTE -------------3333-3333--------------------3333----1111------ VEMDHYREPFLKPVDREPLWRFPNEIPIAGEPANIVALVEAYMNWLHQSPVPKLLFWGTP ------3333-33333333--1111--iiii--------------1111----------- GVLIPPAEAARLAESLPNCKTVDIGPGLHYLQEDNPDLIGSEIARWLPGLA -----------------------------3333-------------3333- >PECTATE LYASE; SWP:P39116; PDB:1BN8A; ADLGHQTLGSNDGWGAYSTGTTGGSKASSSNVYTVSNRNQLVSALGKETNTTPKIIYIKG -3333-------3333!!!!-!!!!--1111---------------1111---------- TIDMNVDDNLKPLGLNDYKDPEYDLDKYLKAYDPSTWGKKEPSGTQEEARARSQKNQKAR ------1111---3333--1111---------3333------------------------ VMVDIPANTTIVGSGTNAKVVGGNFQIKSDNVIIRNIEFQDAYDYFPQWDPTDGSSGNWN ------------------------------------------------------------ SQYDNITINGGTHIWIDHCTFNDGSRPDSTSPKYYGRKYQHHDGQTDASNGANYITMSYN ----------------------!!!!3333---iiii----------------------- YYHDHDKSSIFGSSDSKTSDDGKLKITLHHNRYKNIVQRAPRVRFGQVHVYNNYYEGSTS -------------11113333------------------------------------111 SSSYPFSYAWGIGKSSKIYAQNNVIDVPGLSAAKTISVFSGGTALYDSGTLLNGTQINAS 1-----------2222----------22223333----2222---------iiii----- AANGLSSSVGWTPSLHGSIDASANVKSNVINQAGAGKLN 1111----------------3333--------------- >BOVINE NEUTROPHIL BETA-DE; SWP:P46170; PDB:1BNB; APLSCGRNGGVCIPIRCPVPMRQIGTCFGRPVKCCRSW -----3333----------------------------- >BRAIN DERIVED NEUROTROPHI; SWP:P23560; PDB:1BNDA; GQLSVCDSISEWVTAADKKTAVDMSGGTVTVLEKVPVSKGQLKQYFYETKCNPMGYTKEG -------------3333------------------------------------------- CRGIDKRHWNSQCRTTQSYVRALTMDSKKRIGWRFIRIDTSCVCTLTIK ------------------------------------------------- >COLLAGEN XVIII; SWP:P39060; PDB:1BNLA; HSHRDFQPVLHLVALNAPLSGGMRGIRGADFQCFQQARAVGLAGTFRAFLSSRLQDLYSI ---1111--------------------------------------------11113333- VRRADRAAVPIVNLKDELLFPSWEALFSGSEGPLKPGARIFSFDGKDVLRHPTWPQKSVW -3333-------1111-----3333---------2222---1111-11113333------ HGSDPNGRRLTESYCETWRTEAPSATGQASSLLGGRLLGQSAASCHHAYIVLCIENSF ---1111--1111%%%%----3333-----1111---------1111----------- >MONOCYTE CHEMOATTRACTANT ; SWP:MCP3_HUMAN; PDB:1BO0; QPVGINTSTTCCYRFINKKIPKQRLESYRRTTSSHCPREAVIFKTKLDKEICADPTQKWV --------------------3333-------3333---------1111------------ QDFMKHLDKKTQTPKL ---------------- >PHOSPHATIDYLINOSITOL PHOS; SWP:P78356; PDB:1BO1A; KLFRASEPILSVLMWGVNHTINELSNVPVPVMLMPDDFKAYSKIKVDNHLFNKENLPSRF ------3333-------------1111------3333----------------------- KFKEYCPMVFRNLRERFGIDDQDYQNSVTRSAPINSDSQTRFLTTYDRRFVIKTVSSEDV --------------1111--------------------------1111-------3333- AEMHNILKKYHQFIVECHGNTLLPQFLGMYRLTVDGVETYMVVTRNVFSHRLTVHRKYDL ----------------%%%%-------------%%%%----------------------- KGSTVAREASDKEKAKDLPTFKDNDFLNEGQKLHVGEESKKNFLEKLKRDVEFLAQLKIM -----------3333--------------------------------------------- DYSLLVGIHDVDRAEQEEMEVEERAEDEEFDPSVDVYAMKSHESSPKKEVYFMAIIDILT ---------------------------------3333----------------------- AGAEISTVNPEQYSKRFNEFMSNILT --------3333-------3333--- >PROTEIN (SERRATIA MARCESC; SWP:Q53396; PDB:1BO4A; GIIRTCRLGPDQVKSMRAALDLFGREFGDVATYSQHQPDSDYLGNLLRSKTFIALAAFDQ --------1111-----------------3333------------1111---------%% EAVVGALAAYVLPKFEQPRSEIYIYDLAVSGEHRRQGIATALINLLKHEANALGAYVIYV %%---------------------------1111--------------------------- QADYGDDPAVALYTKLG ----------------- >ANNEXIN I; SWP:P04083; PDB:1BO9A; TFNPSSDVAALHKAIMVKGVDEATIIDILTKRNNAQRQQIKAAYLQETGKPLDETLKKAL -----------------------1111------111133333333--------------- TGHLEEVVLALLK -------3333-- >HISTONE ACETYLTRANSFERASE; SWP:Q12341; PDB:1BOB; FKPETWTSSANEALRVSIVGENAVQFSPLFTYPIYGDSEKIYGYKDLIIHLAFDSVTFKP -3333---3333------------------3333-1111--------------------- YVNVKYSAKLGDDNIVDVEKKLLSFLPKDDVIVRDEAKWVDCFAEERKTHNLSDVFEKVS ----------------------1111------------------3333--3333------ EYSLNGEEFVVYKSSLVDDFARRMHRRVQIFSLLFIEAANYIDETDPSWQIYWLLNKKTK -----------------------------3333--2222---1111-------------- ELIGFVTTYKYWHYIDKKFRAKISQFLIFPPYQNKGHGSCLYEAIIQSWLEDKSITEITV ----------------------------1111-------------------1111----- EDPNEAFDDLRDRNDIQRLRKLGYDAVFQKHSDLSDEFLESSRKSLKLEERQFNRLVEML ------------------------3333------3333----------3333-------- LLLNNS ------ >ANTIBODY (CB 4-1); SWP:Q7TS98; PDB:1BOGA; DIKMTQSPSSMYTSLGERVTITCKASQDINSFLTWFLQKPGKSPKTLIYRANRLMIGVPS -------------2222-----------%%%%------2222----------------33 RFSGSGSGQTYSLTISSLEYEDMGIYYCLQYDDFPLTFGAGTKLDLKRADAAPTVSIFPP 33----------------1111-------------------------------------- SSEQLTSGTASVVCFLNNFYPKEINVKWKIDGSERQNGVLDSWTEQDSKDSTYSMSSTLT 3333-------------------------iiii--------------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC -----1111--------1111--------1111- >ANTIBODY (CB 4-1); SWP:NA; PDB:1BOGB; QDQLQQSGAELVRPGASVKLSCKALGYIFTDYEIHWVKQTPVHGLEWIGGIHPGSSGTAY ------------2222-----------1111----------------------------- NQKFKGKATLTADKSSTTAFMELSSLTSEDSAVYYCTRKDYWGQGTLVTVSAAKTTAPSV 3333---------1111---------1111------------------------------ YPLVPVCGGTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPALLQSGLYTLSSSV -------------------------------------------------iiii------- TVTSNTWPSQTITCNVAHPASSTKVDKKIEPRV --3333------------1111----------- >RHODANESE; SWP:P00586; PDB:1BOH; VHQVLYRALVSTKWLAESVRAGKVGPGLRVLDASWYSPGTREARKEYLERHVPGASFFDI ------------------------1111-----------------------2222---33 EECRDKASPYEVMLPSEAGFADYVGSLGISNDTHVVVYDGDDLGSFYAPRVWWMFRVFGH 33--1111---------------------1111-------1111--3333-----1111- RTVSVLNGGFRNWLKEGHPVTSEPSRPEPAIFKATLNRSLLKTYEQVLENLESKRFQLVD -----2222----1111-------------------3333-------------------- SRAQGRYLGTQPEPDAVGLDSGHIRGSVNMPFMNFLTEDGFEKSPEELRAMFEAKKVDLT --3333-----------------2222---3333--3333-------------------- KPLIATRKGVTACHIALAAYLCGKPDVAIYDGSWFEWFHRAPPETWVSQGKG ---------3333------1111------3333--------3333--3333- >RIBONUCLEASE RH; SWP:P08056; PDB:1BOLA; SSCSSTALSCSNSANSDTCCSPEYGLVVLNMQWAPGYGPDNAFTLHGLWPDKCSGAYAPS ---1111---1111--1111-------------2222--------------1111---11 GGCDSNRASSSIASVIKSKDSSLYNSMLTYWPSNQGNNNVFWSHEWSKHGTCVSTYDPDC 11-3333-----------------------------------------333333331111 YDNYEEGEDIVDYFQKAMDLRSQYNVYKAFSSNGITPGGTYTATEMQSAIESYFGAKAKI ----2222---------------------3333--------------------------- DCSSGTLSDVALYFYVRGRDTYVITDALSTGSCSGDVEYPTK --%%%%------------------------------------ >N-4 CYTOSINE-SPECIFIC MET; SWP:P11409; PDB:1BOOA; NFGKKPAYTTSNGSMYIGDSLELLESFPEESISLVMTSPPFALQRKKEYGNLEQHEYVDW ---------1111-----33333333--------------------------3333---- FLSFAKVVNKKLKPDGSFVVDFGGAYMKGVPARSIYNFRVLIRMIDEVGFFLAEDFYWFN ------------1111------------------3333---------------------- PSKLPSPIEWVNKRKIRVKDAVNTVWWFSKTEWPKSDITKVLASIPPNLLQISNSESNGQ --33333333--------------------------1111-------------------- YLANCKLMGIKAHPARFPAKLPEFFIRMLTEPDDLVVDIFGGSNTTGLVAERESRKWISF -----------------3333---------2222------!!!!------1111------ EMKPEYVAASAFRFLDNNISEEKITDIYNRILNGESLDLNSI ---------3333------3333-------1111---3333- >TRANSCRIPTION FACTOR PML; SWP:P29590; PDB:1BOR; EEEFQFLRCQQCQAEAKCPKLLPCLHTLCSGCLEASGMQCPICQAPWPLGADTPAL ---------------------------------------------3333------- >4,5-DIOXYGENASE ALPHA CHA; SWP:P22635; PDB:1BOUA; IDVHAYLAEFDDIPGTRVFTAQRARKGYNLNQFAMSLMKAENRERFKADESAYLDEWNLT ------------2222-------------------------------------1111--- PAAKAAVLARDYNAMIDEGGNVYFLSKLFSTDGKSFQFAAGSMTGMTQEEYAQMMIDGGR ---------------1111-3333-----1111-------1111---------------- SPAGVRSIKGGY -2222--3333- >Protocatechuate 4,5-dioxy; SWP:P22636; PDB:1BOUB; ARVTTGITSSHIPALGAAIQTGTSDNDYWGPVFKGYQPIRDWIKQPGNMPDVVILVYNDH -----------3333-------1111------------------2222------------ ASAFDMNIIPTFAIGCAETFKPADEGWGPRPVPDVKGHPDLAWHIAQSLILDEFDMTIMN -------------------------------------------------1111------- QMDVDHGCTVPLSMIFGEPEEWPCKVIPFPVNVVTYPPPSGKRCFALGDSIRAAVESFPE ----3333----------------------------------------------1111-- DLNVHVWGTGGMSHQLQGPRAGLINKEFDLNFIDKLISDPEELSKMPHIQYLRESGSEGV -----------------1111------------------3333---3333-----1111- ELVMWLIMRGALPEKVRDLYTFYHIPASNTALGAMILQPEETAGTPLEPRKVMSGHSL --------1111--------------!!!!--------3333------------1111 >MULTIDRUG-EFFLUX TRANSPOR; SWP:P39075; PDB:1BOWA; RLGEVFVLDEEEIRIIQTEAEGIGPENVLNASYSKLKKFIESNNSYGATFSFQPYTSIDE -----------------------33331111----3333--------------------- MTYRHIFTPVLISSITPDMEITTIPKGRYACIAYNFSPEHYFLNLQKLIKYIADRQLTVV ------------------------------------3333-------------------- SDVYELIIPIHYEYRVEMKIRIL ----------------------- >Prolactin receptor [Precu; SWP:P16471; PDB:1BP3B; LPPGKPEIFKCRSPNKETFTCWWRPGTDGTNYSLTYHREGETLMHECPDYITGGPNSCHF ------------------------------------------------------------ GKQYTSMWRTYIMMVNATNSSFSDELYVDVTYIVQPDPPLELAVEVKQPEDRKPYLWIKW 3333------------------------3333---------------------------- SPPTLIDLKTGWFTLLYEIRLKPEKAAEWEIHFAGQQTEFKILSLHPGQKYLVQVRCKPD ---------------------------------!!!!----------------------- HGYWSAWSPATFIQIPS ----------------- >PAPAIN; SWP:P00784; PDB:1BP4; IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQELLDCDRRS -----3333--------------3333-----------------------------1111 YGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNQGA !!!!---------3333----3333----------3333--------------------- LLYSIANQPVSVVLQAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYGPNYILIKNSWGTG -----------------3333------------------------------------111 WGENGYIRIKRGTGNSYGVCGLYTSSFYPVKN 1-iiii--------3333iiii---------- >DNA POLYMERASE BETA; SWP:P06766; PDB:1BPB; DDTSSSINFLTRVTGIGPSAARKLVDEGIKTLEDLRKNEDKLNHHQRIGLKYFEDFEKRI ----------------------1111-----------3333-------------1111-- PREEMLQMQDIVLNEVKKLDPEYIATVCGSFRRGAESSGDMDVLLTHPNFTSESSKQPKL 3333---------------1111------3333-------------33333333--3333 LHRVVEQLQKVRFITDTLSKGETKFMGVCQLPSENEYPHRRIDIRLIPKDQYYCGVLYFT -----------------------------------------------1111------333 GSDIFNKNMRAHALEKGFTINEYTIRPLGVTGVAGEPLPVDSEQDIFDYIQWRYREPKDR 3----------------------------------------3333-3333-----3333- SE -- >DNAK; SWP:P0A6Y8; PDB:1BPR; SIEGRVKDVLLLDVTPLSLGIETMGGVMTTLIAKNTTIPTKHSQVFSTAEDNQSAVTIHV ------------------------------------------------------------ LQGERKRAADNKSLGQFNLDGINPAPRGMPQIEVTFDIDADGILHVSAKDKNSGKEQKIT --------------------------------------3333------------------ IKASSGLNEDEIQKMVRDAEANAEADRKFEELVQTRNQGDHLLHSTRKQVEEA ----------------------------------------------------- >AUREOLYSIN; SWP:P81177; PDB:1BQBA; AAATGTGKGVLGDTKDININSIDGGFSLEDLTHQGKLSAYNFNDQTGQATLITNEDENFV --------1111---------2222----------------------------------- KDDQRAGVDANYYAKQTYDYYKNTFGRESYDNHGSPIVSLTHVNHYGGQDNRNNAAWIGD 1111------------------------1111-------------iiii----------- KMIYGDGDGRTFTNLSGANDVVAHEITHGVTQQTANLEYKDQSGALNESFSDVFGYFVDD ------------------------------------------------------------ EDFLMGEDVYTPGKEGDALRSMSNPEQFGQPSHMKDYVYTEKDNGGVHTNSGIPNKAAYN -----3333-2222---------3333-----3333-----%%%%--------------- VIQAIGKSKSEQIYYRALTEYLTSNSNFKDLKDALYQAAKDLYEQQTAEQVYEAWNEVGV ----------------------1111-----------------------------1111- E - >BETA-MANNANASE; SWP:NA; PDB:1BQCA; ATGLHVKNGRLYEANGQEFIIRGVSHPHNWYPQHTQAFADIKSHGANTVRVVLSNGVRWS ------iiii--1111----------33331111-------1111--------------- KNGPSDVANVISLCKQNRLICMLEVHDTTGYGEQSGASTLDQAVDYWIELKSVLQGEEDY --------------1111-------1111----2222------------33332222--- VLINIGNEPYGNDSATVAAWATDTSAAIQRLRAAGFEHTLVVDAPNWGQDWTNTMRNNAD ------------3333---------------1111---------%%%%-11113333--- QVYASDPTGNTVFSIHMYGVYSQASTITSYLEHFVNAGLPLIIGEFGHDHSDGNPDEDTI -----3333--------3333--------------------------1111--------- MAEAERLKLGYIGWSWSGNGGGVEYLDMVYNFDGDNLSPWGERIFYGPNGIASTAKEAVI -------------------33331111--%%%%-------------2222-------333 FG 3- >D-GLUCARATE DEHYDRATASE; SWP:P42206; PDB:1BQG; GAPVITDLKVVPVAGHDSMLLNLSGAHGPLFTRNILILTDSSGHVGVGEVPGGEGIRKTL ---------------------1111--------------1111----------------- EDARHLLINQSIGNYQSLLNKVRNAFADLRIAVHAVTAVESALLDLLGQHLQVPVAALLG --333322223333---------1111--------------------------3333--- EGQQRDAVEMLGYLFYVGDRNKTDLGYRSEHEADNEWFRLRNKEALTPESVVALAEAAYD ------------------1111-------------33331111----------------- RYGFKDFKLKGGVLRGEDEIAAVTALSERFPDARITLDPNGAWSLKEAVALCRDQHHVLA --------------3333-----------1111-----%%%%------------1111-- YAEDPCGAENGYSGREVMAEFRRSTGLRTATNMIATDWRQMGHAIQLQSVDIPLADPHFW --------iiii-------------------------------------------3333- TMQGSVRVAQMCNEWGLTWGSHSNNHFDISLAMFTHVAAAAPGNITAIDTHWIWQDGQRL -------------------------------------1111---------3333------ TKEPLQIKGGLVEVPKKPGLGVELDWDALMKAHEVYKSM -------iiii---------------------------- >T-cell surface glycoprote; SWP:P01731; PDB:1BQHG; KPQAPELRIFPKKMDAELGQKVDLVCEVLGSVSQGCSWLFQNSSSKLPQPTFVVYMASSH ------------------------------------------------------------ NKITWDEKLNSSKLFSAMRDTNNKYVLTLNKFSKENEGYYFCSVISNSVMYFSSVVPVLQ -------3333---------------------1111---------%%%%----------- KV -- >PSEUDOAZURIN; SWP:P19567; PDB:1BQK; ADFEVHMLNKGKDGAMVFEPASLKVAPGDTVTFIPTDKGHNVETIKGMIPDGAEAFKSKI ----------1111-----------2222---------------2222-2222-----22 NENYKVTFTAPGVYGVKCTPHYGMGMVGVVQVGDAPANLEAVKGAKNPKKAQERLDAALA 22---------------33331111----------11113333----------------1 ALGN 111- >Matrix metalloproteinase-; SWP:P50281; PDB:1BQQM; IQGLKWQHNEITFCIQNYTPKVGEYATYEAIRKAFRVWESATPLRFREVPYAYIREGHEK ------------------3333--------------3333-------------------- QADIMIFFAEGFHGDSTPFDGEGGFLAHAYFPGPNIGGDTHFDSAEPWTVRNEDLNGNDI -----------------------------------2222---3333---%%%%------- FLVAVHELGHALGLEHSSDPSAIMAPFYQWMDTENFVLPDDDRRGIQQLYGGES 3333-----1111--------1111-------1111-----1111--------- >GP130; SWP:P40189; PDB:1BQUA; GLPPEKPKNLSCIVNEGKKMRCEWDGGRETHLETNFTLKSEWATHKFADCKAKRDTPTSC --------------2222-----------------------1111-------3333---- TVDYSTVYFVNIEVWVEAENALGKVTSDHINFDPVYKVKPNPPHNLSVINSLSSILKLTW -------------------1111---------3333------------------------ TNPSIKSVIILKYNIQYRTKDASTWSQIPPEDTASTRSSFTVQDLKPFTEYVFRIRCMKE ----1111----------1111------3333--------------------------11 DGKGYWSDWSEEASGITYEDRPSKEPSF 11-----------------3333----- >PLASMINOGEN ACTIVATOR; SWP:Q91516; PDB:1BQYA; VFGGDECNINEHRSLVVLFNSNGFLCGGTLINQDWVVTAAHCDSNNFQLLFGVHSKKILN -------11111111-----------------------1111------------------ EDEQTRDPKEKFFCPNRKKDDEVDKDIMLIKLDSSVSNSEHIAPLSLPSSPPSVGSVCRI -------------1111---1111----------------------------2222---- MGWGKTIPTKEIYPDVPHCANINILDHAVCRTAYSWRQVANTTLCAGILQGRDTCHFDSG ------1111--------------------------------------------2222-- GPLICNGIFQGIVSWGGHPCGQPGEPGVYTKVFDYLDWIKSIIAGNKDATCPP ------------------------------3333------------------- >MYOSIN; SWP:P10587; PDB:1BR1A; AQKPLSDDEKFLFVDKNFVNNPLAQADWSAKKLVWVPSEKHGFEAASIKEEKGDEVTVEL -----3333------------3333--------------------------!!!!----- QENGKKVTLSKDDIQKMNPPKFSKVEDMAELTCLNEASVLHNLRERYFSGLIYTYSGLFC ------------------1111----1111------------------------------ VVINPYKQLPIYSEKIIDMYKGKKRHEMPPHIYAIADTAYRSMLQDREDQSILCTGESGA ------------------------------3333----------------------2222 GKTENTKKVIQYLAVVASSHKGKQGPSFSYGELEKQLLQANPILEAFGNAKTVKNDNSSR -----------------------!!!!----3333----1111----------------- FGKFIRINFDVTGYIVGANIETYLLEKSRAIRQAKDERTFHIFYYLIAGASEQMRNDLLL ---------1111--------------------2222----------------------- EGFNNYTFLSNGHVPIPAQQDDEMFQETLEAMTIMGFTEEEQTSILRVVSSVLQLGNIVF ---------------2222--3333------------3333-----------3333---- KKERNTDQASMPDNTAAQKVCHLMGINVTDFTRSILTPRIKVGRDVVQKAQTKEQADFAI ---------------33333333------------------------------3333--- EALAKAKFERLFRWILTRVNKALDASFLGILDIAGFEIFEINSFEQLCINYTNEKLQQLF -----------------3333---------------------3333-------------- NHTMFILEQEEYQREGIEWNFIDFGLDLQPCIELIERPTNPPGVLALLDEECWFPKATDT -------------------------------------------------1111------- SFVEKLIQEQGNHAKFQKSKQLKDKTEFCILHYAGKVTYNASAWLTKNMDPLNDNVTSLL ------------1111-----3333------1111----------------------333 NQSSDKFVADLWKDVDRIVGLFRTVGQLYKEQLTKLMTTLRNTNPNFVRCIIPNHEKRAG 3-------------1111------1111--1111-----1111----------------- KLDAHLVLEQLRCNGVLEGIRICRQGFPNRIVFQEFRQRYEILAANAIPKGFMDGKQACI -----------11113333-3333---------------3333------------3333- LMIKALELDPNLYRIGQSKIFFRTGVLAHLEEERDLKITDVIIAFQAQCRGYLARKAFAK ---1111---------------2222---------------------------------- RQQQLGS -3333-- >Myosin light polypeptide ; SWP:P02607; PDB:1BR1B; FSEEQTAEFKEAFQLFDRTGDGKILYSQCGDVMRALGQNPTNAEVMKVLGNPKSDEMNLK -----------3333------------------------------1111-----1111-- TLKFEQFLPMMQTIAKNKDQGCFEDYVEGLRVFDKEGNGTVMGAEIRHVLVTLGEKMTEE --33333333---------------33331111-----------3333---------333 EVEQLVAGHEDSNGCINYEELVRMVLSG 3----2222-1111-------------- >MYOSIN; SWP:P10587; PDB:1BR2A; LVWVPSEKHGFEAASIEVTVELQENGKKVTLSKDDIQKMNPPKFSKVEDMAELTCLNEAS -------------------------------3333-----3333----3333----3333 VLHNLRERYFSGLIYTYSGLFCVVINPYKQLPIYSEKIIDMYKGKKRHEMPPHIYAIADT --------1111-----------------------------22223333----------- AYRSMLQDREDQSILCTGESGAGKTENTKKVIQYLAVVASGELEKQLLQANPILEAFGNA ------------------2222-------------------------------------- KTVKNDNSSRFGKFIRINFDVTGYIVGANIETYLLEKSRAIRQAKDERTFHIFYYLIAGA -3333--------------3333------------3333----------3333------- SEQMRNDLLLEGFNNYTFLSNGHVPIPAQQDDEMFQETLEAMTIMGFTEEEQTSILRVVS 3333-------11111111----------3333--------------------------- SVLQLGNIVFKKEQASMPDNTAAQKVCHLMGINVTDFTRSILTPKAQTKEQADFAIEALA ---3333-------------------------3333------------1111-------- KAKFERLFRWILTRVNKALDASFLGILDIAGFEIFEINSFEQLCINYTNEKLQQLFNHTM ------------------------------------------------------------ FILEQEEYQREGIEWNFIDFGLDLQPCIELIERPTNPPGVLALLDEECATDTSFVEKLIQ --------1111------------------------------------------------ EQGNHAKFQKSKTEFCILHYAGKVTYNASAWLTKNMDPLNDNVTSLLNQSSDKFVADLWK ------------------1111-----------------------3333----------- RTVGQLYKEQLTKLMTTLRNTNPNFVRCIIPNHEKRAGKLDAHLVLEQLRCNGVLEGIRI ----------------3333-----------------------------1111------- CRQGFPNRIVFQEFRQRYEILAANAIPKGFMDGKQACILMIKALELDPNLYRIGQSKIFF -----------------33331111----------------1111-3333---1111--- RTGVLAHLEEERD 2222--------- >METALLOPROTEINASE-2 INHIB; SWP:P16035; PDB:1BR9; CSCSPVHPQQAFCNADVVIRAKAVSEKEVDSGNDIYGNPIKRIQYEIKQIKMFKGPEKDI ---------------------------------1111----------------------- EFIYTAPSSAVCGVSLDVGGKKEYLIAGKAEGDGKMHITLCDFIVPWDTLSTTQKKSLNH -------3333--------------------%%%%---1111---1111------3333- RYQMGCECKITRCPMIPCYISSPDECLWMDWVTEKNINGHQAKFFACIKRSDGSCAWYRG 3333------------------------3333------3333------------------ AA -- >RUBREDOXIN; SWP:P24297; PDB:1BRFA; AKWVCKICGYIYDEDAGDPDNGISPGTKFEELPDDWVCPICGAPKSEFEKLED ------------3333-3333--22223333-1111-------3333------ >BROMOPEROXIDASE A2; SWP:P29715; PDB:1BRT; PFITVGQENSTSIDLYYEDHGTGQPVVLIHGFPLSGHSWERQSAALLDAGYRVITYDRRG -------!!!!-----------------------33333333----1111-------222 FGQSSQPTTGYDYDTFAADLNTVLETLDLQDAVLVGFSTGTGEVARYVSSYGTARIAKVA 2---------------------------------------------------1111---- FLASLEPFLLKTDDNPDGAAPQEFFDGIVAAVKADRYAFYTGFFNDFYNLDENLGTRISE -----------1111-----3333-----------3333---------33332222---- EAVRNSWNTAASGGFFAAAAAPTTWYTDFRADIPRIDVPALILHGTGDRTLPIENTARVF ---------1111------33331111-11111111--------1111---3333----- HKALPSAEYVEVEGAPHGLLWTHAEEVNTALLAFLAK ---3333------------------------------ >Elastase-2A [Precursor]; SWP:P08419; PDB:1BRUP; VVGGEDARPNSWPWQVSLQYDSS -------22221111-------- >PYRIMIDINE NUCLEOSIDE PHO; SWP:P77836; PDB:1BRWA; MRMVDLIAKKRDGKALTKEEIEWIVRGYTNGDIPDYQMSALAMAIYFRGMTEEETAALTM -3333----------------------------3333----------------------- AMVQSGEMLDLSSIRGVKVDKHSTGGVGDTTTLVLGPLVASVGVPVAKMSGRGLGHTGGT ----------3333----------------3333-----1111----------!!!!--- IDKLESVPGFHVEISKDEFIRLVNENGIAIIGQTGDLTPADKKLYALRDVTATVNSIPLI ------2222---------------------------3333-----3333---------- ASSIMSKKIAAGADAIVLDVKTGAGAFMKKLDEARRLARVMVDIGKRVGRRTMAVISDMS --------3333----------1111---3333------------1111----------- QPLGYAVGNALEVKEAIETLKGNGPHDLTELCLTLGSHMVYLAEKAPSLDEARRLLEEAI ------------------1111------------------1111---------------- RSGAAIAAFKTFLAAQGGDASVVDDLDKLPKAAYTSTVTAAADGYVAEMAADDIGTAAMW -------------1111----11111111------------------------------- LGAGRAKKEDVIDLAVGIVLHKKIGDRVQKGEALATIHSNRPDVLDVKEKIEAAIRLSPQ ------1111--1111------------2222-----------3333----3333----- PVARPPLIYETIV ------------- >Ribosome-inactivating pro; SWP:P33185; PDB:1BRYY; DVSFRLSGATTTSYGVFIKNLREALPYERKVYNIPLLRSSISGSGRYTLLHLTNYADETI -----2222------------1111-----%%%%--------3333-------1111--- SVAVDVTNVYIMGYLAGDVSYFFNEASATEAAKFVFKDAKKKVTLPYSGNYERLQTAAGK ------------------------------3333-1111--------------------- IRENIPLGLPALDSAITTLYYYTASSAASALLVLIQSTAESARYKFIEQQIGKRVDKTFL 3333------------------1111---------------------------------- PSLATISLENNWSALSKQIQIASTNNGQFESPVVLIDGNNQRVSITNASARVVTSNIALL ------------------------iiii--------1111------11113333------ LNRNNIA -3333-- >8-AMINO-7-OXONANOATE SYNT; SWP:P12998; PDB:1BS0A; SWQEKINAALDARRAADALRRRYPVAQGAGRWLVADDRQYLNFSSNDYLGLSHHPQIIRA -------------1111-----------------------------11111111------ WQQGAEQFGIGSGGSGHVSGYSVVHQALEEELAEWLGYSRALLFISGFAANQAVIAAMMA --------------1111-----------------------------------------3 KEDRIAADRLSHASLLEAASLSPSQLRRFAHNDVTHLARLLASPCPGQQMVVTEGVFSMD 333----11113333--------------2222--------------------------- GDSAPLAEIQQVTQQHNGWLMVDDAHGTGVIGEQGRGSCWLQKVKPELLVVTFGKGFGVS -------------1111---------2222-2222-3333--------------3333-- GAAVLCSSTVADYLLQFARHLIYSTSMPPAQAQALRASLAVIRSDEGDARREKLAALITR -----------------3333------3333----------------------------- FRAGVQDLPFTLADSCSAIQPLIVGDNSRALQLAEKLRQQGCWVTAIRPPTVPAGTARLL --3333----------------------------------------------2222---- TLTAAHEMQDIDRLLEVLHGNG --11113333------------ >BETA LACTAMASE; SWP:P14559; PDB:1BSG; SDAERRLAGLERASGARLGVYAYDTGSGRTVAYRADELFPMCSVFKTLSSAAVLRDLDRN ---------------------------------1111----------------------- GEFLSRRILYTQDDVEQADGAPETGKPQNLANGMTVEELCEVSITASDNCAANLMLRELG 3333-----------3333---1111-----------------1111------------- GPAAVTRFVRSLGDRVTRLDRWEPELNSAEPGRVTDTTSPRAITRTYGRLVLGDALNPRD ---------1111----------------1111--------------------------- RRLLTSWLLANTTSGDRFRAGLPDDWTLGDKTGAGRYGTNNDAGVTWPPGRAPIVLTVLT -------1111--11113333-1111---------%%%%--------------------- AKTEQDAARDDGLVADAARVLAETLG ---1111------------------- >SUPEROXIDE DISMUTASE; SWP:P80293; PDB:1BSMA; AVYTLPELPYDYSALEPYISGEIMELHHDKHHKAYVDGANTALDKLAEARDKADFGAINK ----------1111----------------------------------------1111-- LEKDLAFNLAGHVNHSVFWKNMAPKGSAPERPTDELGAAIDEFFGSFDNMKAQFTAAATG ------------------------------------------------------------ IQGSGWASLVWDPLGKRINTLQFYDHQNNLPAGSIPLLQLDMWEHAFYLQYKNVKGDYVK ------------------------------2222--------3333----!!!!------ SWWNVVNWDDVALRFSEARVA 3333----------------- >UBIQUITIN-LIKE PROTEIN 7,; SWP:Q9SHE7; PDB:1BT0A; MLIKVKTLTGKEIEIDIEPTDTIDRIKERVEEKEGIPPVQQRLIYAGKQLADDKTAKDYN ------1111-------1111---------------3333----%%%%--11113333-- IEGGSVLHLVLAL -2222-------- >CATECHOL OXIDASE; SWP:Q9ZP19; PDB:1BT3A; APIQAPEISKCVVPPADLPPGAVVDNCCPPVASNIVDYKLPAVTTMKVRPAAHTMDKDAI ------3333----1111--------------------------------3333------ AKFAKAVELMKALPADDPRNFYQQALVHCAYCNGGYDQVNFPDQEIQVHNSWLFFPFHRW -------------1111----3333------------1111---------1111------ YLYFYERILGKLIGDPSFGLPFWNWDNPGGMVLPDFLNDSTSSLYDSNRNQSHLPPVVVD --------------1111-----11111111--3333-1111-------1111------1 LGYNGADTDVTDQQRITDNLALMYKQMVTNAGTAELFLGKAYRAGDAPSPGAGSIETSPH 111------------------------1111-3333------2222-------------- IPIHRWVGDPRNTNNEDMGNFYSAGRDIAFYCHHSNVDRMWTIWQQLARDYTDSDWLNAT --------3333----1111--11113333------------3333------3333---- FLFYDENGQAVKVRIGDSLDNQKMGYKYAKTPLPWL ----1111-----3333--3333---------1111 >ACTIVIN RECEPTOR TYPE II; SWP:P27038; PDB:1BTEA; ETQECLFFNANWERDRTNQTGVEPCYGRRHCFATWKNISGSIEIVKQGCWLDDINCYDRT --------11111111---------------------iiii------------------- DCIEKKDSPEVYFCCCEGNMCNEKFSYFPEME -----------------2222-------3333 >BRUTON'S TYROSINE KINASE; SWP:Q06187; PDB:1BTKA; AAVILESIFLKRSQQKKKTSPLNFKKCLFLLTVHKLSYYEYDFERGRRGSKKGSIDVEKI ----------------1111----------------------1111---------3333- TCVETVVPEKNPPPERQIMEQISIIERFPYPFQVVYDEGPLYVFSPTEELRKRWIHQLKN ------------3333---3333------------1111--------------------- VIRYNSDLVQKYHPCFWIDGQYLCCSQTAKNAMGCQILEN -1111------------iiii-------1111-------- >BETA-SPECTRIN; SWP:Q62261; PDB:1BTN; MEGFLNRKHEWEAHNKKASSRSWHNVYCVINNQEMGFYKDAKSAASGIPYHSEVPVSLKE ------------%%%%--------------%%%%-------------------------- AICEVALDYKKKKHVFKLRLSDGNEYLFQAKDDEEMNTWIQAISSA -----3333----------1111-------------------1111 >HEMOPOIETIC CELL KINASE; SWP:P08631; PDB:1BU1A; IIVVALYDYEAIHHEDLSFQKGDQMVVLEESGEWWKARSLATRKEGYIPSNYVARVD -------------------2222-------!!!!--------------1111----- >CALCIUM-BINDING PROTEIN; SWP:P56503; PDB:1BU3; AFSGILADADVAAALKACEAADSFNYKAFFAKVGLTAKSADDIKKAFFVIDQDKSGFIEE ---------------11112222----------3333-------------1111----33 DELKLFLQVFSAGARALTDAETKAFLKAGDSDGDGAIGVDEWAALVKA 33----33331111---------------1111--------------- >GLYCEROL KINASE; SWP:P08859; PDB:1BU6O; KKYIVALDQGTTSSRAVVMDHDANIISVSQREFEQIYPKPGWVEHDPMEIWATQSSTLVE -------------------1111---------------2222------------------ VLTKADISSDQIAAIGITNQRETTIVWEKETGKPIYNAIVWQCRRTAEICEHLKRDGLED -3333--1111----------------------------3333----3333--1111--- YIRSNTGLVIDPYFSGTKVKWILDHVEGSRERARRGELLFGTVDTWLIWKMTQGRVHVTD -------------------------------3333----------------iiii----- YTNASRTMLFNIHTLDWDDKMLEVLDIPREMLPEVRRSSEVYGQTNIGGKGGTRIPISGI 3333-----------------------1111----------------------------- AGDQQAALFGQLCVKEGMAKNTYGTGCFMLMNTGEKAVKSENGLLTTIACGPTGEVNYAL --------1111--2222--------------!!!!------------------------ EGAVFMAGASIQWLRDEMKLINDAYDSEYFATKVQNTNGVYVVPAFTGLGAPYWDPYARG -----------------------------1111---iiii-------------------- AIFGLTRGVNANHIIRATLESIAYQTRDVLEAMQADSGIRLHALRVDGGAVANNFLMQFQ -----1111--------------------------------------------------- SDILGTRVERPEVREVTALGAAYLAGLAVGFWQNLDELQEKAVIEREFRPGIETTERNYR --------------3333---------------3333-3333------------------ YAGWKKAVKRAMAWEEH ----------------- >PANCREATIC LIPASE RELATED; SWP:P54318; PDB:1BU8A; KEVCYGHLGCFSNDKPWAGMLQRPLKIFPWSPEDIDTRFLLYTNENPNNYQKISATEPDT ----!!!!-----------3333--------3333-------3333-----------333 IKFSNFQLDRKTRFIVHGFIDKGEDGWLLDMCKKMFQVEKVNCICVDWRRGSRTEYTQAS 3-----1111----------2222----------3333---------3333--------- YNTRVVGAEIAFLVQVLSTEMGYSPENVHLIGHSLGAHVVGEAGRRLEGHVGRITGLDPA -----------------------3333-----------------1111------------ EPCFQGLPEEVRLDPSDAMFVDVIHTDSAPIIPYLGFGMSQKVGHLDFFPNGGKEMPGCQ 2222---3333--1111-------------------------------2222---2222- KNILSTIVDINGIWEGTQNFVACNHLRSYKYYASSILNPDGFLGYPCSSYEKFQQNDCFP --------33331111-----3333------------3333-----------1111---- CPEEGCPKMGHYADQFEGKTATVEQTVYLNTGDSGNFTRWRYKVSVTLSGAKKLSGYILV -1111----1111--1111--------------!!!!----------------------- ALYGNNGNSKQYEIFKGSLKPEARHVRDIDVDINVGEIQKVKFLWNNRPTLGASQITVQS ---1111------------2222------------------------------------- GVD --- >BUTYRYL-COA DEHYDROGENASE; SWP:Q06319; PDB:1BUCA; MDFNLTDIQQDFLKLAHDFGEKKLAPTVTERDHKGIYDKELIDELLSLGITGAYFEEKYG -----------------------3333-----------------11113333---1111- GSGDDGGDVLSYILAVEELAKYDAGVAITLSATVSLCANPIWQFGTEAQKEKFLVPLVEG -1111--3333------------------------------------------------- TKLGAFGLTEPNAGTDASGQQTIATKNDDGTYTLNGSKIFITNGGAADIYIVFAMTDKSK ---------1111--3333-------1111----------2222---------------- GNHGITAFILEDGTPGFTYGKKEDKMGIHTSQTMELVFQDVKVPAENMLGEEGKGFKIAM 1111------2222-------------1111------------1111---2222------ MTLDGGRIGVAAQALGIAEAALADAVEYSKQRVQFGKPLCKFQSISFKLADMKMQIEAAR ---------------------------------iiii1111------------------- NLVYKAACKKQEGKPFTVDAAIAKRVASDVAMRVTTEAVQIFGGYGYSEEYPVARHMRDA -----------------------------------------!!!!--------------3 KITQIYEGTNEVQLMVTGGALLR 333-----3333-------1111 >ACUTOLYSIN A; SWP:NA; PDB:1BUDA; FQRYMEIVIVVDHSMVKKYNGDSDSIKAWVYEMINTITESYSYLKIDISLSGLEIWSGKD ------------------%%%%-----------------3333----------------- LIDVEASAGNTLKSFGEWRAKDLIHRISHDNAQLLTATDFDGATIGLAYVASMCNPKRSV ---------------------3333--------------------------2222----- GVIQDHSSVNRLVAITLAHEMAHNLGVSHDEGSCSCGGKSCIMSPSISDETIKYFSDCSY --------3333------------------!!!!------1111---------------- IQCRDYISKENPPCILN -----------3333-- >IMIPENEM-HYDROLYSING BETA; SWP:P52663; PDB:1BUEA; NTKGIDEIKNLETDFNGRIGVYALDTGSGKSFSYRANERFPLCSSFKGFLAAAVLKGSQD -2222-----------------------------1111---!!!!--------------- NRLNLNQIVNYNTRSLEFHSPITTKYKDNGMSLGDMAAAALQYSDNGATNIILERYIGGP ---1111---1111-------3333----------------------------------- EGMTKFMRSIGDEDFRLDRWELDLNTAIPGDERDTSTPAAVAKSLKTLALGNILSEHEKE -------1111----------------2222----------------------------- TYQTWLKGNTTGAARIRASVPSDWVVGDKTGSCGAYGTANDYAVVWPKNRAPLIISVYTT -----1111--11111111-1111----------%%%%---------------------- KNEKEAKHEDKVIAEASRIAIDNLK --1111--3333------------- >BETA2-BUNGAROTOXIN; SWP:P00617; PDB:1BUNA; NLINFMEMIRYTIPCEKTWGEYADYGCYCGAGGSGRPIDALDRCCYVHDNCYGDAEKKHK 3333----1111-33333333-----------------3333------------------ CNPKTQSYSYKLTKRTIICYGAAGTCARIVCDCDRTAALCFGNSEYIEGHKNIDTARFCQ -1111-------%%%%---------3333-----------1111--3333---3333--- >Beta bungarotoxin B2 chai; SWP:P00989; PDB:1BUNB; RKRHPDCDKPPDTKICQTVVRAFYYKPSAKRCVQFRYGGCNGNGNHFKSDHLCRCECLEY ---1111------------------1111------------------------------- R - >PROMYELOCYTIC LEUKEMIA ZI; SWP:Q05516; PDB:1BUOA; MGMIQLQNPSHPTGLLCKANQMRLAGTLCDVVIMVDSQEFHAHRTVLACTSKMFEILFHR -------1111---------------------------------------------3333 NSQHYTLDFLSPKTFQQILEYAYTATLQAKAEDLDDLLYAAEILEIEYLEEQCLKMLETI -----------------------------1111--------------------------- Q - >70 KILODALTON HEAT SHOCK ; SWP:P19120; PDB:1BUPA; GPAVGIDLGSTYSCVGVFQHGKVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVAMNP ------------------%%%%-----1111------------------3333-333333 TNTVFDAKRLIGRRFDDAVVQSDMKHWPFMVVNDAGRPKVQVEYKGETKSFYPEEVSSMV 33---333322221111------1111------iiii------iiii----3333----- LTKMKEIAEAYLGKTVTNAVVTVPAYFNDSQRQATKDAGTIAGLNVLRIINEPTAAAIAY -----------------------1111------------------------------111 GLDKKVGAERNVLIFDLGGGTFDVSILTIEDGIFEVKSTAGDTHLGGEDFDNRMVNHFIA 1----------------------------iiii------------3333----------- EFKRKHKKDISENKRAVRRLRTACERAKRTLSSSTQASIEIDSLYEGIDFYTSITRARFE ---------1111----------------1111-----------iiii------------ ELNADLFRGTLDPVEKALRDAKLDKSQIHDIVLVGGSTRIPKIQKLLQDFFNGKELNKSI ------------------1111-3333-------3333----------1111-------- NPDEAVAYGAAVQAAILS 1111-------------- -------------------------------------------------------- >BET V 1; SWP:P15494; PDB:1BV1; GVFNYETETTSVIPAARLFKAFILDGDNLFPKVAPQAISSVENIEGNGGPGTIKKISFPE ------------------------3333--------------------2222-------- GLPFKYVKDRVDEVDHTNFKYNYSVIEGGPIGDTLEKISNEIKIVATPDGGSILKISNKY ---------------1111--------!!!!---------------1111---------- HTKGDHEVKAEQVKASKEMGETLLRAVESYLLAHSDAYN --!!!!---------------------------1111-- >ALPHA-2-MACROGLOBULIN; SWP:P01023; PDB:1BV8A; EEFPFALGVQTLPQTCDEPKAHTSFQISLSVSYTGSRSASNMAIVDVKMVSGFIPLKPTV -----------------3333------------------------------------333 KMLERSNHVSRTEVSSNHVLIYLDKVSNQTLSLFFTVLQDVPVRDLKPAIVKVYDYYETD 3----------------------------------------------------------- EFAIAEYNAPCSKDLGNA ------------------ >HULYS11; SWP:P00698; PDB:1BVKA; DIQMTQSPSSLSASVGDRVTITCRASGNIHNYLAWYQQKPGKAPKLLIYYTTTLADGVPS ----------------------------iiii------2222------------222233 RFSGSGSGTDYTFTISSLQPEDIATYYCQHFWSTPRTFGQGTKVEIKR 33----!!!!--------1111-------------------------- >Lysozyme C [Precursor]; SWP:P00698; PDB:1BVKB; QVQLQESGPGLVRPSQTLSLTCTVSGFSLTGYGVNWVRQPPGRGLEWIGMIWGDGNTDYN ---------------------------1111--------------------1111----- SALKSRVTMLKDTSKNQFSLRLSSVTAADTAVYYCARERDYRLDYWGQGSLVTVSS -------------------------------------------------------- >TRANSCRIPTION FACTOR GAMB; SWP:Q17034; PDB:1BVOA; PYVEITEQPHPKALRFRYECEGRSAGSIPGVNTTAEQKTFPSIQVHGYRGRAVVVVSCVT -----------------3333--------1111--------------------------- KEGPEHKPHPHNLVGKEGCKKGVCTVEINSTTMSYTFNNLGIQCVKKKDVEEALRLRQEI -------------------iiii----------------------3333----------- RVDPFRTGFGHAKEPGSIDLNAVRLCFQVFLEGQQRGRFTEPLTPVVSDIIYDKK --1111--3333-1111-----------------2222----------------- >VP7 core protein; SWP:P69361; PDB:1BVP1; MDTIAARALTVMRACATLQEARIVLEANVMEILGIAINRYNGLTLRGVTMRPTSLAQRNE --------------3333-------3333------------------------------- MFFMCLDMMLSAAGINVGPISPDYTQHMATIGVLATPEIPFTTEAANEIARVTGETSTWG -----------------------------------1111---------------1111-- PARQPYGFFLETEETFQPGRWFMRAAQAVTAVVCGPDMIQVSLNAGARGDVQQIFQGRND ------1111------2222---2222----------------2222----3333----- PMMIYLVWRRIENFAMAQGNSQQTQAGVTVSVGGVDMRAGRIIAWDGQAALHVHNPTQQN ---------------3333------------iiii------------------------- AMVQIQVVFYISMDKTLNQYPALTAEIFNVYSFRDHTWHGLRTAILNRTTLPNMLPPIFP ---------------1111--------------------------1111----------- PNDRDSILTLLLLSTLADVYTVLRPEFAIHGVNPMPGPLTRAIARAAYV ----------------------------2222------------1111- >HOLLIDAY JUNCTION DNA HEL; SWP:P40832; PDB:1BVSA; MIFSVRGEVLEVALDHAVIEAAGIGYRVNATPSALATLNQGSQARLVTAMVVREDSMTLY --------------------iiii-------3333------------------------- GFSDAENRDLFLALLSVSGVGPRLAMATLAVHDAAALRQALADSDVASLTRVPGIGRRGA ------------3333-----------3333----------------------------- ERIVLELADKVGPVNAVRGSVVEALVGLGFAAKQAEEATDQVLDGEATSSALRAALSLLG ------------------------3333--3333--------------3333--1111-- KTR --- >GLUTAMATE DEHYDROGENASE; SWP:Q56304; PDB:1BVUA; QDPFEIAVKQLERAAQYMDISEEALEFLKRPQRIVEVSIPVEMDDGSVKVFTGFRVQYNW --------------1111------------------------1111-------------1 ARGPTKGGIRWHPEETLSTVKALAAWMTWKTAVMDLPYGGGKGGVICNPKEMSDREKERL 111------------3333----------------------------3333--------- ARGYVRAIYDVISPYTDIPAPDVYTNPQIMAWMMDEYETISRRKDPSFGVITGKPPSVGG -------3333----------2222--------------------3333-----3333-- IVARMDATARGASYTVREAAKALGMDLKGKTIAIQGYGNAGYYMAKIMSEEYGMKVVAVS --------------------------2222--------3333------------------ DTKGGIYNPDGLNADEVLAWKKKTGSVKDFPGATNITNEELLELEVDVLAPSAIEEVITK -------1111------------------2222-------1111---------------- KNADNIKAKIVAELANGPTTPEADEILYEKGILIIPDFLCNAGGVTVSYFEWVQNITGDY --1111-----------------------------------3333--------------- WTVEETRAKLDKKMTKAFWDVYNTHKEKNINMRDAAYVVAVSRVYQAMKDRGWIKK -------------------------------------------------------- >Bifunctional P-450:NADPH-; SWP:P14779; PDB:1BVYF; NTPLLVLYGSNMGTAEGTARDLADIAMSKGFAPQVATLDSHAGNLPREGAVLIVTASYNG -------------------------3333-----------2222-------------iii HPPDNAKQFVDWLDQASADEVKGVRYSVFGCGDKNWATTYQKVPAFIDETLAAKGAENIA i-11113333----------2222--------33331111-------------------- DRGEADASDDFEGTYEEWREHMWSDVAAYFNL -----11113333------------------- >TYROSINE AMINOTRANSFERASE; SWP:P33447; PDB:1BW0A; WDVSMSNHAGLVFNPIRTVSDNAKPSPSPKPIIKLSVGDPTLDKNLLTSAAQIKKLKEAI -----3333-------------------------------1111---------------- DSQECNGYFPTVGSPEAREAVATWWRNSFVHKEELKSTIVKDNVVLCSGGSHGILMAITA ---------1111------------------333311113333----------------- ICDAGDYALVPQPGFPHYETVCKAYGIGMHFYNCRPENDWEADLDEIRRLKDDKTKLLIV --2222-------------------------------%%%%----------1111----- TNPSNPCGSNFSRKHVEDIVRLAEELRLPLFSDEIYAGMVFKGKDPNATFTSVADFETTV ---------------------------------1111-------1111---3333----- PRVILGGTANLVVPGWRLGWLLYVDPHGNGPSFLEGLKRVGMLVCGPCTVVQAALGEALL ------------1111--------1111-------------------3333--------- NTPQEHLDQIVAKIEESAMYLYNHIGECIGLAPTMPRGAMYLMSRIDLEKYRDIKTDVEF --3333-----------------33332222---------------3333---------- FEKLLEEENVQVLPGTIFHAPGFTRLTTTRPVEVYREAVERIKAFCQRHAA -------------3333--2222-------3333------------1111- >INOSAMINE-PHOSPHATE AMIDI; SWP:P08078; PDB:1BWDA; RSLVSVHNEWDPLEEVIVGTAVGARVPTADRSVFAVEYAGDYESQEQIPSGAYPDRVLKE -------------------------------------------3333------3333--- TEEELHVLAAELTKLGVTVRRPGPRDHSALIKTPDWETDGFHDYCPRDGLLSVGQTIIET --------------------------------3333--------3333------------ PMALRSRFLESLAYKDLLLEYFASGSRWLSAPKPRLTDDSYAPQAPAGERLTDEEPVFDA ---3333-1111------------------------3333-33332222---------11 ANVLRFGTDLLYLVSDSGNELGAKWLQSAVGDTYTVHPCRKLYASTHVDSTIVPLRPGLV 11----------------------------3333-------------1111----2222- LTNPSRVNDENMPDFLRSWENITCPELVDIGFTGDKPHCSVWIGMNLLVVRPDLAVVDRR --1111-1111-1111-------------------------3333--------------- QTALIRLLEKHGMNVLPLQLTHSRTLGGGFHCATLDVRRTGALETYQF --------1111---------1111----------------------- >NADPH DEHYDROGENASE 1; SWP:Q02899; PDB:1BWKA; SFVKDFKPQALGDTNLFKPIKIGNNELLHRAVIPPLTRMRALHPGNIPNRDWAVEYYTQR ----------11111111---!!!!-----------------------1111------11 AQRPGTMIITEGAFISPQAGGYDNAPGVWSEEQMVEWTKIFNAIHEKKSFVWVQLWVLGW 112222---------3333--1111----3333------------------------!!! AAFPDNLARDGLRYDSASDNVFMDAEQEAKAKKANNPQHSLTKDEIKQYIKEYVQAAKNS !----------------------------------------------------------- IAAGADGVEINSANGYLLNQFLDPHSNTRTDEYGGSIENRARFTLEVVDALVEAIGHEKV ------------%%%%---------------------------------------1111- GLRLSPYGVFNSMSGGAETGIVAQYAYVAGELEKRAKAGKRLAFVHLVEPRVTNPFLTEG ----1111------3333------------------------------3333-1111222 EGEYEGGSNDFVYSIWKGPVIRAGNFALHPEVVREEVKDKRTLIGYGRFFISNPDLVDRL 2-------3333-------------1111-----1111--------1111---------- EKGLPLNKYDRDTFYQMSAHGYIDYPTYEEALKLGWDKS ---------3333-----2222----------------- >ALPHA-BETA T CELL RECEPTO; SWP:A2NTY6; PDB:1BWMA; AVTQSPRNKVAVTGGKVTLSCNQTNNHNNMYWYRQDTGHGLRLIHYSYGAGSTEKGDIPD ------------------------------------------------------------ GYKASRPSQENFSLILELATPSQTSVYFCASGGQGRAEQFFGPGTRLTVLGSDYKDDDDK -------1111------------------------------------------------- RSGGGGSGGGGSGGSGAQQQVRQSPQSLTVWEGTTILNCSYEDSTFDYFPWYRQFPGKSP ------------------------------------------------------------ ALLIAISLVSNKKEDGRFTIFFNKREKKLSLHITDSQPGDSATYFCAATGSFNKLTFGAG ------------------------------------------------------------ TRLAVSPY -------- >NONSPECIFIC LIPID-TRANSFE; SWP:P24296; PDB:1BWOA; IDCGHVDSLVRPCLSYVQGGPGPSGQCCDGVKNLHNQARSQSDRQSACNCLKGIARGIHN ---------3333--1111-----------------------------------1111-- LNEDNARSIPPKCGVNLPYTISLNIDCSRV ----------1111-------11111111- >AGGLUTININ; SWP:Q38789; PDB:1BWUA; RNILRNDEGLYGGQSLDVNPYHFIMQEDCNLVLYDHSTSVWASNTGILGKKGCRAVLQSD -------------------------1111-----!!!!--------2222-------111 GNFVVYDAEGRSLWASHSVRGNGNYVLVLQEDGNVVIYRSDIWSTN 1----------------------------1111------------- >II lectin [Precursor] [Fr; SWP:Q38785; PDB:1BWUD; RNILTNDEGLYGGQSLDVNPYHLIMQEDCNLVLYDHSTAVWSSNTDIPGKKGCKAVLQSD ----------2222---!!!!-------------------------2222-------111 GNFVVYDAEGASLWASHSVRGNGNYVLVLQEDGNVVIYRSDIWSTNTYR 1----------------------------1111---------------- >RIBULOSE BISPHOSPHATE CAR; SWP:O98949; PDB:1BWVA; RIKNSRYESGVIPYAKMGYWNPDYQVKDTDVLALFRVTPQPGVDPIEAAAAVAGESSTAT ----1111----3333----1111--1111---------2222----------------- WTVVWTDLLTAADLYRAKAYKVDQVPNNPEQYFAYIAYELDLFEEGSIANLTASIIGNVF ----3333--3333-------------3333-------3333-2222----------111 GFKAVKALRLEDMRLPLAYLKTFQGPATGVILERERLDKFGRPLLGCTTKPKLGLSGKNY 13333----------33331111------------------------------------- GRVVYEALKGGLDFVDDENINSQPFMRWRERYLFTMEAVNKASAATGEVKGHYLNVTAAT ----------------1111--3333---------------------------------- MEEMYARANFAKELGSVIIMIDLVIGYTAIQTMAKWARDNDMILHLHRAGNSTYSRQKNH -----------1111-------3333---------------------2222-----1111 GMNFRVICKWMRMAGVDHIHAGTVVGKLEGDPIITRGFYKTLLLPKLERNLQEGLFFDME --------------------------------------------------1111------ WASLRKVMPVASGGIHAGQMHQLIHYLGEDVVLQFGGGTIGHPDGIQAGATANRVALEAM %%%%---------------1111------------3333--1111--------------- ILARNENRDYLTEGPEILREAAKTCGALRTALDLWKDITFNYTSTDTSDFV --------3333--------3333----------1111------------- >Ribulose bisphosphate car; SWP:O98950; PDB:1BWVS; VRITQGTFSFLPDLTDEQIKKQIDYMISKKLAIGIEYTNDIHPRNAYWEIWGLPLFDVTD -------1111---------------1111-----------------------------3 PAAVLFEINACRKARSNFYIKVVGFSSVRIESTIISFIVNRPKHEPGFNLMRQEDKSRSI 333----------------------------------------------------!!!!- KYTIHSYESYKPEDERY -----3333--1111-- >IG KAPPA CHAIN V-I REGION; SWP:P01607; PDB:1BWWA; TPDIQMTQSPSSLSASVGDRVTITCQASQDIIKYLNWYQQKPGKAPKLLIYEASNLQAGV ---------------2222---------------------2222------------2222 PSRFSGSGSGTDYTFTISSLQPEDIATYYCQQYQSLPYTFGQGTKLQIT 1111----!!!!--------1111------------------------- >PARATHYROID HORMONE; SWP:P01270; PDB:1BWX; SVSEIQLMHNLGKHLNSMERVEWLRKKLQDVHNFVALGA -------3333-----3333------------------- >HEART FATTY ACID BINDING ; SWP:P10790; PDB:1BWYA; VDAFVGTWKLVDSKNFDDYMKSLGVGFATRQVGNMTKPTTIIEVNGDTVIIKTQSTFKNT ----------------3333---------------------------------------- EISFKLGVEFDETTADDRKVKSIVTLDGGKLVHVQKWNGQETSLVREMVDGKLILTLTHG ------------------------------------------------------------ TAVCTRTYEKQA ------------ >PROTEIN (FERREDOXIN:NADP+; SWP:P00455; PDB:1BX0A; HSKKMEEGITVNKFKPKTPYVGRCLLNTKITGDDAPGETWHMVFSHEGEIPYREGQSVGV -----2222-----3333-------------3333----------iiii---2222---- IPDGEDKNGKPHKLRLYSIASSALGDFGDAKSVSLCVKRLIYTNDAGETIKGVCSNFLCD -----1111------------3333------------------1111----------111 LKPGAEVKLTGPVGKEMLMPKDPNATIIMLGTGTGIAPFRSFLWKMFFEKHDDYKFNGLA 12222---------1111---1111----------------------------------- WLFLGVPTSSSLLYKEEFEKMKEKAPDNFRLDFAVSREQTNEKGEKMYIQTRMAQYAVEL -------1111-------------1111------1111--3333---3333--------- WEMLKKDNTYFYMCGLKGMEKGIDDIMVSLAAAEGIDWIEYKRQLKKAEQWNVLVY -----1111------3333------------1111-3333-----1111------- >HLA class II histocompati; SWP:P04229; PDB:1BX2B; TRPRFLWQPKRECHFFNGTERVRFLDRYFYNQEESVRFDSDVGEFRAVTELGRPDAEYWN --------------------------------------1111------3333-------- SQKDILEQARAAVDTYCRHNYGVVESFTVQRRVQPKVTVYPSKTQPLQHHNLLVCSVSGF ----------3333---------33331111----------------------------- YPGSIEVRWFLNGQEEKAGMVSTGLIQNGDWTFQTLVMLETVPRSGEVYTCQVEHPSVTS ----------iiii-----------------------------2222-------1111-- PLTVEWRARSE ----------- >ADENOSINE KINASE; SWP:P55263; PDB:1BX4A; VRENILFGMGNPLLDISAVVDKDFLDKYSLKPNDQILAEDKHKELFDELVKKFKVEYHAG -2222--------------------1111---------3333------------------ GSTQNSIKVAQWMIQQPHKAATFFGCIGIDKFGEILKRKAAEAHVDAHYYEQNEQPTGTC ------------------------------------------------------------ AACITGDNRSLIANLAAANCYKKEKHLDLEKNWMLVEKARVCYIAGFFLTVSPESVLKVA ----!!!!-------3333--333311113333--3333--------11113333----- HHASENNRIFTLNLSAPFISQFYKESLMKVMPYVDILFGNETEAATFAREQGFETKDIKE ---1111--------3333----------3333--------------------------- IAKKTQALPKMNSKRQRIVIFTQGRDDTIMATESEVTAFAVLDQDQKEIIDTNGAGDAFV ----1111---1111-------!!!!-----1111----------1111-3333------ GGFLSQLVSDKPLTECIRAGHYAASIIIRRTGCTFPEKPDFH -----1111-----------------1111%%%%-------- >HIRUSTASIN; SWP:P80302; PDB:1BX7; GNTCGGETCSAAQVCLKGKCVCNEVHCRIRCKYGLKKDENGCEYPCSCAKA ---iiii--3333--%%%%-----------1111---1111---------- >OSMOLARITY SENSOR PROTEIN; SWP:P02933; PDB:1BXDA; TGQEMPMEMADLNAVLGEVIAAESGYEREIETALYPGSIEVKMHPLSIKRAVANMVVNAA ------------------------------------------------------------ RYGNGWIKVSSGTEPNRAWFQVEDDGPGIAPEQRKHLFQPFVRGDSARTISGTGLGLAIV -----------------------------------------------------------3 QRIVDNHNGMLELGTSERGGLSIRAWLPVPVTRAQGTTKEG 3331111--------%%%%---------------------- >PROTEIN (CMTI-I); SWP:NA; PDB:1BXJA; RVCPRILLECKKDSDCLAECVCLEHGYCG ----------------------1111--- >DTDP-GLUCOSE 4,6-DEHYDRAT; SWP:P27830; PDB:1BXKA; MRKILITGGAGFIGSALVRYIINETSDAVVVVDKLTYAGNLMSLAPVAQSERFAFEKVDI -------1111------------------------1111----3333--1111-----11 CDRAELARVFTEHQPDCVMHLAAESHVDRSIDGPAAFIETNIVGTYTLLEAARAYWNALT 11--------------------------------3333----------------3333-- EDKKSAFRFHHISTDEVYGDLHSTDDFFTETTPYAPSSPYSASKASSDHLVRAWLRTYGL --------------3333----------1111---------------------------- PTLITNCSNNYGPYHFPEKLIPLMILNALAGKSLPVYGNGQQIRDWLYVEDHARALYCVA -----------22223333--------1111----------------------------- TTGKVGETYNIGGHNERKNLDVVETICELLEELAPNKPHGVAHYRDLITFRYAIDASKIA ---2222----------3333---------------------3333----------3333 RELGCVPQETFESGMRKTVQWYLANESWWKQVQDGSYQGER ------------------------------------1111- >RIBULOSE BISPHOSPHATE CAR; SWP:P42721; PDB:1BXNA; YKMGYWDGDYVPKDTDLLALFRITPQDGVDPVEAAAAVAGESSTATWTVVWTDRLTACDM -------------------------------------------------3333------- YRAKAYRVDPVPNNPEQFFCYVAYDLSLFEEGSIANLTASIIGNVFSFKPIKAARLEDMR ----------2222----------1111----3333-------33333333--------- FPVAYVKTFAGPSTGIIVERERLDKFGRPLLGATTKPKLGLSGRNYGRVVYEGLKGGLDF -33331111-------------------------3333---------------------- MKDDENINSQPFMHWRDRFLFVMDAVNKASAATGEVKGSYLNVTAGTMEEMYRRAEFAKS ------------------------------------------------------------ LGSVIIMVDLIVGWTCIQSMSNWCRQNDMILHLHRAGHGTYTRQKNHGVSFRVIAKWLRL --------3333----------------------2222---------------------- AGVDHMHTGTAVGKLEGDPLTVQGYYNVCRDAYTQTDLTRGLFFDQDWASLRKVMPVASG -------------------------------------1111------%%%%--------- GIHAGQMHQLIHLFGDDVVLQFGGGTIGHPQGIQAGATANRVALEAMVLARNEGRDILNE --3333----------------3333-------------------------------111 GPEILRDAARWCGPLRAALDTWGDI 1-------1111------------- >Ribulose bisphosphate car; SWP:Q59102; PDB:1BXNI; MRITQGTFSFLPELTDEQITKQLEYCLNQGWAVGLEYTDDPHPRNTYWEMFGLPMFDLRD -----2222-----------------1111-----------------------------3 AAGILMEINNARNTFPNHYIRVTAFDSTHTVESVVMSFIVNRPADEPGFRLVRQEEPGRT 333-----------------------------------------------------!!!! LRYSIESYA --------- >PENICILLOPEPSIN; SWP:P00798; PDB:1BXOA; AASGVATNTPTANDEEYITPVTIGGTTLNLNFDTGSADLWVFSTELPASQQSGHSVYNPS ----------2222--------iiii----------------111133332222---333 ATGKELSGYTWSISYGDGSSASGNVFTDSVTVGGVTAHGQAVQAAQQISAQFQQDTNNDG 3----2222-----1111-------------iiii--------------3333------- LLGLAFSSINTVQPQSQTTFFDTVKSSLAQPLFAVALKHQQPGVYDFGFIDSSKYTGSLT -----3333---------3333-1111-----------------------1111------ YTGVDNSQGFWSFNVDSYTAGSQSGDGFSGIADTGTTLLLLDDSVVSQYYSQVSGAQQDS -----1111----------!!!!---------1111----------------2222---- NAGGYVFDCSTNLPDFSVSISGYTATVPGSLINYGPSGDGSTCLGGIQSNSGIGFSIFGD -------1111--------iiii----3333------------------iiii-----33 IFLKSQYVVFDSDGPQLGFAPQA 331111----------------- >ALDEHYDE DEHYDROGENASE; SWP:P51977; PDB:1BXSA; DVPAPLTNLQFKYTKIFINNEWHSSVSGKKFPVFNPATEEKLCEVEEGDKEDVDKAVKAA -----------------%%%%---3333-------------------------------- RQAFQIGSPWRTMDASERGRLLNKLADLIERDRLLLATMEAMNGGKLFSNAYLMDLGGCI ----22221111-3333------------------------------------------- KTLRYCAGWADKIQGRTIPMDGNFFTYTRSEPVGVCGQIIPWNFPLLMFLWKIGPALSCG ------3333----------------------------------------------1111 NTVVVKPAEQTPLTALHMGSLIKEAGFPPGVVNIVPGYGPTAGAAISSHMDVDKVAFTGS -------3333----------------2222--------------1111----------- TEVGKLIKEAAGKSNLKRVSLELGGKSPCIVFADADLDNAVEFAHQGVFYHQGQCCIAAS -------------------------------1111--------------%%%%-1111-- RLFVEESIYDEFVRRSVERAKKYVLGNPLTPGVSQGPQIDKEQYEKILDLIESGKKEGAK ----3333----------1111----1111------------------------1111-- LECGGGPWGNKGYFIQPTVFSDVTDDMRIAKEEIFGPVQQIMKFKSLDDVIKRANNTFYG -----------------------33331111----------------------------- LSAGIFTNDIDKAITVSSALQSGTVWVNCYSVVSAQCPFGGFKMSGNGRELGEYGFHEYT ---------------------------------1111----!!!!-------33331111 EVKTVTIKISQKNS -------------- >STREPTOCOCCAL SUPERANTIGE; SWP:NA; PDB:1BXTA; SSQPDPTPEQLNKSSQFTGVMGNLRCLYDNHFVEGTNVRSTGQLLQHDLIFPIKDLKLKN ------1111--3333-------3333-----------------1111------------ YDSVKTEFNSKDLATKYKNKDVDIFGSNYYYNCKTCMYGGVTEHHRNQIEGKFPNITVKV ----------------1111-----------------------2222------------- YEDNENILSFDITTNKKQVTVQELDCKTRKILVSRKNLYEFNNSPYETGYIKFIESSGDS -%%%%-------------------------------------------------1111-- FWYDMMPAPGAIFDQSKYLMLYNDNKTVSSSAIAIEVHLTKK -------------3333----1111---1111---------- >PLASTOCYANIN; SWP:P55020; PDB:1BXVA; VAIKMGADNGMLAFEPSTIEIQAGDVQWVNNKLAPHNVVVEGQPELSHKDLAFSPGETFE ------1111-----------2222-----------------3333-------2222--- ATFSEPGTYTYYCEPHRGAGMVGKIVVQ -------------1111----------- >OUTER MEMBRANE PROTEIN A; SWP:P02934; PDB:1BXWA; MAPKDNTWYTGAKLGWSQYHDTGLINNNGPTHENKLGAGAFGGYQVNPYVGFEMGYDWLG ----------------------------------------------1111---------- RMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGMVWRADTYSNVYGKNHDTGV ------------------------------------------------------------ SPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTIGTRPDNGMLSLGVSYRFG ------------1111------------------------------------ >RIBOSOMAL PROTEIN L30; SWP:P74909; PDB:1BXYA; MPRLKVKLVKSPIGYPKDQKAALKALGLRRLQQERVLEDTPAIRGNVEKVAHLVRVEVVE -----------2222--------1111--2222----------------3333------- >PIX; SWP:Q14155; PDB:1BY1A; MKGFDTTAINKSYYNVVLQNILETENEYSKELQTVLSTYLRPLQTSEKLSSANISYLMGN ------------------------------------------------------------ LEEICSFQQMLVQSLEECTKLPEAQQRVGGCFLNLMPQMKTLYLTYCANHPSAVNVLTEH 3333----------3333-------------------------------3333------- SEELGEFMETKGASSPGILVLTTGLSKPFMRLDKYPTLLKELERHMEDYHTDRQDIQKSM --3333---------!!!!---11111111-----3333------------3333----- AAFKNLSAQCQEVRKRKELELQILTEAIR --------33331111------------- >MAC-2 BINDING PROTEIN; SWP:Q08380; PDB:1BY2; AVNDGDMRLADGGATNQGRVEIFYRGQWGTVCDNLWDLTDASVVCRALGFENATQALGRA --2222-----------------iiii-----2222---------1111--------%%% AFGQGSGPIMLDEVQCTGTEASLADCKSLGWLKSNCRHERDAGVVCTNETTL %--------------------3333----2222---1111------------ >RETINOIC ACID RECEPTOR RX; SWP:Q6LC96; PDB:1BY4A; TKHICAICGDRSSGKHYGVYSCEGCKGFFKRTVRKDLTYTCRDNKDCLIDKRQRNRCQYC --------------------------------------------------1111---333 RYQKCLAMGMKREAVQEER 31111-----1111----- >PLASMINOGEN ACTIVATOR INH; SWP:P05120; PDB:1BY7A; EDLCVANTLFALNLFKHLAKASPTQNLFLSPWSISSTMAMVYMGSRGSTEDQMAKVLQFN ------------------3333-------------------1111-------------33 EVGAADKIHSSFRSLSSAINLLESVNKLFGEKSASFREEYIRLCQKYYSSEPQAVDFLEC 33----------------------------3333-------------------------- AEEARKKINSWVKTQTKGKIPNLLPEGSVDGDTRMVLVNAVYFKGKWKTPFEKKLYPFRV -----------------------------1111--------------------------- NSAQRTPVQMMYLREKLNIGYIEDLKAQILELPYAGDVSMFLLLPDEIADVSTGLELLES 1111------------------1111---------------------------------- EITYDKLNKWTSKDKMAEDEVEVYIPQFKLEEHYELRSILRSMGMEDAFNKGRANFSGMS -----------3333------------------------------3333------3333- ERNDLFLSEVFHQAMVDVNEEGTGPQFVADHPFLFLIMHKITNCILFFGRFSSP ---------------------------------------1111----------- >REGULATORY PROTEIN E2; SWP:P03120; PDB:1BY9; TTPIVHLKGDANTLKCLRYRFKKHCTLYTAVSSTWHWTAIVTLTYDSEWQRDQFLSQVKI -----------------------1111--------------------------------- PKTITVSTGFMS 1111-------- >POLYANDROCARPA LECTIN; SWP:P16108; PDB:1BYFA; DYEILFSDETMNYADAGTYCQSRGMALVSSAMRDSTMVKAILAFTEVKGHDYWVGADNLQ --------------------1111----3333---------------------------- DGAYNFLWNDGVSLPTDSDLWSPNEPSNPQSWQLCVQIWSKYNLLDDVGCGGARRVICEK -------1111---1111---2222---1111---------------------------- ELD --- >C-TERMINAL SRC KINASE; SWP:P41240; PDB:1BYGA; GWALNMKELKLLQTIGKGEFGDVMLGDYRGNKVAVKCIKNDAQAFLAEASVMTQLRHSNL ----3333---------3333----------------------3333---------1111 VQLLGVIVEEGLYIVTEYMAKGSLVDYLRSRGRSVLGGDCLLKFSLDVCEAMEYLEGNNF ------------------1111---------3333--------------------1111- VHRDLAARNVLVSEDNVAKVSDFGLLPVKWTAPEALREKKFSTKSDVWSFGILLWEIYSF -----3333---1111----------1111-3333---------------------1111 GRVPYPRIPLKDVVPRVEKGYKMDAPDGCPPAVYEVMKNCWHLDAAMRPSFLQLREQLEH ----1111333333331111-----22223333----------3333------------- IKTHEL ------ >DETHIOBIOTIN SYNTHASE; SWP:P13000; PDB:1BYI; SKRYFVTGTDTEVGKTVASCALLQAAKAAGYRTAGYKPVASGSEKTPEGLRNSDALALQR --------------------------1111---------------1111---------11 NSSLQLDYATVNPYTFAEPTSPHIISAQEGRPIESLVMSAGLRALEQQADWVLVEGAGGW 11----3333----------3333------------------------------------ FTPLSDTFTFADWVTQEQLPVILVVGVKLGCINHAMLTAQVIQHAGLTLAGWVANDVTPP --------3333---------------2222----------------------------- GKRHAEYMTTLTRMIPAPLLGEIPWLAENPENAATGKYINLALL -------------------------iiii-----3333-3333- >TREHALOSE OPERON REPRESSO; SWP:P36673; PDB:1BYKA; SDKVVAIIVTRLDSLSENLAVQTMLPAFYEQGYDPIMMESQFSPQLVAEHLGVLKRRNID ----------11113333----------1111-------%%%%-----------1111-- GVVLFGFTGITEEMLAHWQSSLVLLARDAKGFASVCYDDEGAIKILMQRLYDQGHRNISY ----------33331111--------------------------------1111------ LGVPHSDVTTGKRRHEAYLAFCKAHKLHPVAALPGLAMKQGYENVAKVITPETTALLCAT ---3333-----------------------------3333---3333--1111------- DTLALGASKYLQEQRIDTLQLASVGNTPLMKFLHPEIVTVDPGYAEAGRQAACQLIAQVT ---------------------------------3333----------------------- GRSEPQQIIIPATLS --------------- >PLASTOCYANIN; SWP:P07030; PDB:1BYPA; AEVLLGSSDGGLAFVPSDLSIASGEKITFKNNAGFPHNDLFDKKEVPAGVDVTKISMPEE ------1111-----------2222----------------1111-22223333---111 DLLNAPGEEYSVTLTEKGTYKFYCAPHAGAGMVGKVTVN 1---2222---------------33331111-------- >ENDONUCLEASE; SWP:Q46707; PDB:1BYRA; EPSVQVGYSPEGSARVLVLSAIDSAKTSIRMMAYSFTAPDIMKALVAAKKRGVDVKIVID ---------------------1111----------------------------------- ERGNTGRASIAAMNYIANSGIPLRTDSNFPIQHDKVIIVDNVTVETGSFNFTKAAETKNS 1111------------1111-------------------------------3333----- ENAVVIWNMPKLAESFLEHWQDRWNQGRDYRS -----------------------1111----- >HUMAN ERG POTASSIUM CHANN; SWP:Q12809; PDB:1BYWA; SRKFIIANARVENCAVIYCNDGFCELCGYSRAEVMQRPCTCDFLHGPCTQRRAAAQIAQA ----------1111---------------33332222111111111111----------- LLGAEERKVEIAFYRKDGSCFLCLVDVVPVKNEDGAVIMFILNFEVVMEK 1111----------1111-------------1111--------------- >ANTIBODY R24 (LIGHT CHAIN; SWP:NA; PDB:1BZ7A; DIQMTQITSSLSVSLGDRVIISCRASQDIGNFLNWYQQKPDGSLKLLIYYTSRLQSGVPS -------------2222---------------------1111------------2222-- RFSGWGSGTDYSLTISNLEEEDIATFFCQQGKTLPYTFGGGTKLEIKRTVAAPSVFIFPP ------------------3333-------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT -----------------------------iiii--------------------------- LSKADYEKHKVYACEVTHQGLSSPVT -----1111----------------- >ANTIBODY R24 (LIGHT CHAIN; SWP:NA; PDB:1BZ7B; DVQLVESGGGLVQPGGSRKLSCAASGFTFSNFGMHWVRQAPEKGLEWVAYISSGGSSINY ---------------------------3333----------------------------- ADTVKGRFTISRDNPKNTLFLQMTSLRSEDTAIYYCTRGGTGTRSLYYFDYWGQGATLIV 1111--------3333----------1111------------------------------ SSASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQ ------------------------------------------------------------ SGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDK ------------3333--------------------- >PARATHYROID HORMONE-RELAT; SWP:P12272; PDB:1BZG; AVSEHQLLHDKGKSIQDLRRRFFLHHLIAEIHTA --------------3333-----3333------- >RNASE A; SWP:NA; PDB:1BZQK; QVQLVESGGGLVQAGGSLRLSCAASGYAYTYIYMGWFRQAPGKEREGVAAMDSGGGGTLY ------------2222-------------------------------------------- ADSVKGRFTISRDKGKNTVYLQMDSLKPEDTATYYCAAGGYELRDRTYGQWGQGTQVTVS 3333--------2222----------3333-------------3333------------- SRGR ---- >HYPOXANTHINE-GUANINE PHOS; SWP:P00492; PDB:1BZYA; SPGVVISDDEPGYDLDLFCIPNHYAEDLERVFIPHGLIMDRTERLARDVMKEMGGHHIVA ------1111---3333---1111---------3333---------------1111---- LCVLKGGYKFFADLLDYIKALNRNSDRSIPMTVDFIRLKSYCNDQSTGDIKVIGGDDLST ----1111------------------------------------------------3333 LTGKNVLIVEDIIDTGKTMQTLLSLVRQYNPKMVKVASLLVKRTPRSVGYKPDFVGFEIP 2222---------------------------------------3333------------- DKFVVGYALDYNEYFRDLNHVCVISETGKAKYKA -----iiii-iiii-------------------- >ANTIMICROBIAL PEPTIDE 1; SWP:P80915; PDB:1C01A; SAFTVWSGPGCNNRAERYSKCGCSAIHQKGGYDFSYTGQTAALYNQAGCSGVAHTRFGSS ----------------------------------------------%%%%---------- ARACNPFGWKSIFIQC ---------------- >PHOSPHOTRANSFERASE YPD1P; SWP:Q07688; PDB:1C02A; STIPSEIINWTILNEIISMDDDDSDFSKGLIIQFIDQAQTTFAQMQRQLDGEKNLTELDN ------------------33331111---------------------------------- LGHFLKGSSAALGLQRIAWVCERIQNLGRKMQHFFPNKTELVNTLSDKSIINGINIDEDD -----------------------------------------1111---1111--1111-- EEIKIQVDDKDENSIYLILIAKALNQSRLEFKLARIELSKYYNTNL ---------------------------------------------- >50S ribosomal protein L14; SWP:P04450; PDB:1C04D; MIQQESRLKVADNSGAREVLVIKVLGGSGRRYANIGDVVVATVKDATPGGVVKKGQVVKA --2222--------------------2222---2222---------2222--2222---- VVVRTKRGVRRPDGSYIRFDENACVIIRDDKSPRGTRIFGPVARELRDKDFMKIISLAPE ----3333--1111-------------1111-------------3333------------ VI -- >RIBOSOMAL PROTEIN S4 DELT; SWP:P81288; PDB:1C05A; MKLSEYGLQLQEKQKLRHMYGVNERQFRKTFEEAGKMPGKHGENFMILLESRLDNLVYRL ---3333-----------------------3333-----------------3333----- GLARTRRQARQLVTHGHILVDGSRVNIPSYRVKPGQTIAVREKSRNLQVIKEALEANNYI ----3333-----------%%%%---1111--2222----------3333---------- PDYLSFDPEKMEGTYTRLPERSELPAEINEALIVEFYSR -------1111--------3333-------11111111- >PROTEIN (EPIDERMAL GROWTH; SWP:P42566; PDB:1C07A; TWVVSPAEKAKYDEIFLKTDKDMDGFVSGLEVREIFLKTGLPSTLLAHIWSLCDTKDCGK --------------3333-1111----3333----------------------3333--- LSKDQFALAFHLISQKLIKGIDPPHVLTPEMIPPS -1111------3333------------1111---- >ANTI-HEN EGG WHITE LYSOZY; SWP:P01642; PDB:1C08A; DIVLTQSPATLSVTPGNSVSLSCRASQSIGNNLHWYQQKSHESPRLLIKYASQSISGIPS -------------2222-------------------------------------222233 RFSGSGSGTDFTLSINSVETEDFGMYFCQQSNSWPYTFGGGTKLEIK 33----------------1111------------------------- >ASPARTYL TRNA SYNTHETASE; SWP:P21889; PDB:1C0AA; MRTEYCGQLRLSHVGQQVTLCGWVNRRRDLGSLIFIDMRDREGIVQVFFDPDRADALKLA -----11113333--------------------------1111------3333-----33 SELRNEFCIQVTGTVRARDEKNINRDMATGEIEVLASSLTIINRADVLPLDSNHVNTEEA 33-2222-----------3333----1111--------------------1111------ RLKYRYLDLRRPEMAQRLKTRAKITSLVRRFMDDHGFLDIETPMLTKATPEGARDYLVPS ---33333333---------------------1111------------------------ RVHKGKFYALPQSPQLFKQLLMMSGFDRYYQIVKCFRDEDLRADRQPEFTQIDVETSFMT --2222---------------1111----------------1111--------------- APQVREVMEALVRHLWLEVKGVDLGDFPVMTFAEAERRYGSDKPDLRNPMELTDVADLLK ------------------------------------------------------333311 SVEFAVFAGPANDPKGRVAALRVPGGASLTRKQIDEYGNFVKIYGAKGLAYIKVNERAKG 11-3333-----1111------2222-------------3333------------33331 LEGINSPVAKFLNAEIIEDILDRTAAQDGDMIFFGADNKKIVADAMGALRLKVGKDLGLT 111--1111-----------------2222------------------------1111-- DESKWAPLWVIDFPMFEDDGEGGLTAMHHPFTSPKDMTAAELKAAPENAVANAYDMVING 3333------------------------1111-------------1111--------iii YEVGGGSVRIHNGDMQQTVFGILGINEEEQREKFGFLLDALKYGTPPHAGLAFGLDRLTM i--------------------------------------1111----------------- LLTGTDNIRDVIAFPKTTAAACLMTEAPSFANPTALAELSIQVVK ------3333------1111----------------1111----- >D-AMINO ACID OXIDASE; SWP:P80324; PDB:1C0PA; LMMHSQKRVVVLGSGVIGLSSALILARKGYSVHILARDLPEDVSSQTFASPWAGANWTPF -------------------------1111---------1111------3333-------- MTLTDGPRQAKWEESTFKKWVELVPTGHAMWLKGTRRFAQNEDGLLGHWYKDITPNYRPL -3333------------------1111-------------3333%%%%-1111------- PSSECPPGAIGVTYDTLSVHAPKYCQYLARELQKLGATFERRTVTSLEQAFDGADLVVNA 3333-2222-----------------------1111---------3333----------- TGLGAKSIAGIDDQAAEPIRGQTVLVKSPCKRCTMDSSDPASPAYIIPRPGGEVICGGTY !!!!---2222-1111----------------------1111------------------ GVGDWDLSVNPETVQRILKHCLRLDPTISSDGTIEGIEVLRHNVGLRPARRGGPRVEAER ------------------------1111----3333-------------2222------- IVLPLDRTKSPLSLGRGSARAAKEKEVTLVHAYGFSSAGYQQSWGAAEDVAQLVDEAFQR -----33331111--3333---------------!!!!3333------------------ YHG --- >ANTIBODY FRAGMENT FAB; SWP:NA; PDB:1C12A; DIELTQSPSSMSVSLGDTVSITCHASQGISSNIGWLQQKPGKSFKGLIYHGTNLEDGVPS -------------2222-----------%%%%----------------------222211 RFSGSGSGADYSLTISSLESEDFADYYCVQYVQFPFTFGSGTKLEIKRADAAPTVSIFPP 11----------------3333-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT 33333333---------------------iiii--2222--------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEA -----1111--------1111--------1111- >ANTIBODY FRAGMENT FAB; SWP:NA; PDB:1C12B; QVQLQESGPGLVKPSQSLSLTCTVTGYSITSDY ------------2222----------------- >MHC-LIKE PROTEIN T22; SWP:Q9BCZ1; PDB:1C16A; GSHSLRYFYTAVSRPGLGEPWFIIVGYVDDMQVLRFSSKEETPRMAPWLEQEEADNWEQQ ---------------------------!!!!----------------------------- TRIVTIQGQLSERNLMTLVHFYNKSMDDSHTLQWLQGCDVEPDRHLCLWYNQLAYDSEDL -------------------1111-------------------------------iiii-- PTLNENPSSCTVGNSTVPHISQDLKSHCSDLLQKYLEKGKERLLRSDPPKAHVTRHPRPE -----------------------3333--------------------------------- GDVTLRCWALGFYPADITLTWQLNGEELTQDMELVETRPAGDGTFQKWAAVVVPLGKEQS ----------------------%%%%---------------------------2222333 YTCHVYHEGLPEPLILRWGG 3-----1111---------- >L-PHENYLALANINE DEHYDROGE; SWP:Q59771; PDB:1C1DA; SIDSALNWDGEMTVTRFDAMTGAHFVIRLDSTQLGPAAGGTRAAQYSNLADALTDAGKLA 3333--------------1111---------1111------------3333--------- GAMTLKMAVSNLPMGGGKSVIALPAPRHSIDPSTWARILRIHAENIDKLSGNYWTGPDVN -------1111--------------3333-------------------iiii-----222 TNSADMDTLNDTTEFVFGRSLERGGAGSSAFTTAVGVFEAMKATVAHRGLGSLDGLTVLV 2------------------3333-----------------------------2222---- QGLGAVGGSLASLAAEAGAQLLVADTDTERVAHAVALGHTAVALEDVLSTPCDVFAPCAM ---3333---------------------------1111----3333-------------- GGVITTEVARTLDCSVVAGAANNVIADEAASDILHARGILYAPDFVANAGGAIHLVGREV ----3333------------------3333----1111----3333-------------- LGWSESVVHERAVAIGDTLNQVFEISDNDGVTPDEAARTLAGRRAREAS ------------------------------------------------- >CATALYTIC ANTIBODY 1E9 (L; SWP:NA; PDB:1C1EH; QIQLVQSGPELKKPGETVKISCKASGYMFTNYGMNWVKQAPGKALKLMGWINPYTGESTF ------------2222-----------1111----------------------------- ADDFKGRFAFFLETSATTAYLQINNL 3333--------3333---------- >BPT4 GENE 59 HELICASE ASS; SWP:P13342; PDB:1C1KA; MIKLRMPAGGERYIDGKSVYKLYLMIKQHMNGKYDVIKYNWCMRVSDAAYQKRRDKYFFQ --------1111----------------1111--3333%%%%-------3333-3333-- KLSEKYKLKELALIFISNLVANQDAWIGDISDADALVFYREYIGRLKQIKFKFEEDIRNI ----------------------1111--1111---------------------------- YYFSKKVEVSAFKEIFEYNPKVQSSYIFKLLQSNIISFETFILLDSFLNIIDKHDEQTDN ----------3333-----1111------------------------------------- LVWNNYSIKLKAYRKILNIDSQKAKNVFIETVKSCKY -------------1111-------------------- >CONGERIN I; SWP:P26788; PDB:1C1LA; GGLQVKNFDFTVGKFLTVGGFINNSPQRFSVNVGESMNSLSLHLDHRFNYGADQNTIVMN ----3333--2222---------------------1111----------!!!!------- STLKGDNGWETEQRSTNFTLSAGQYFEITLSYDINKFYIDILDGPNLEFPNRYSKEFLPF ---!!!!-------------2222----------------2222------1111------ LSLAGDARLTLVKLE --------------- >RAS-RELATED PROTEIN RAP-1; SWP:P62834; PDB:1C1YA; MREYKLVVLGSGGVGKSALTVQFVQGIFVEKYDPTIEDSYRKQVEVDCQQCMLEILDTAG ----------2222-------------------------------%%%%----------- TEQFTAMRDLYMKNGQGFALVYSITAQSTFNDLQDLREQILRVKDTEDVPMILVGNKCDL ----------------------1111------------------------------1111 EDERVVGKEQGQNLARQWCNCAFLESSAKSKINVNEIFYDLVRQINR 1111-----------------------------3333---------- >RAF proto-oncogene serine; SWP:P04049; PDB:1C1YB; SNTIRVFLPNKQRTVVNVRNGMSLHDCLMKALKVRGLQPECCAVFRLLHEHKGKKARLDW --------%%%%------22223333------1111-3333------1111-------11 NTDAASLIGEELQVDFL 1133332222------- >CDC25A; SWP:P30304; PDB:1C25; MLIGDFSKGYLFHTVAGKHQDLKYISPEIMASVLNGKFANLIKEFVIIDCRYPYEYEGGH --1111------------1111-------------1111--------------------- IKGAVNLHMEEEVEDFLLKKPIVPTDGKRVIVVFHCEFSSERGPRMCRYVRERDRLGNEY 2222----3333----3333---------------------------------------- PKLHYPELYVLKGGYKEFFMKCQSYCEPPSYRPMHHEDFKE ----------2222-------3333---------------- >PROTEIN (30 KD ADIPOCYTE ; SWP:Q60994; PDB:1C28A; MYRSAFSVGLETRVTVPNVPIRFTKIFYNQQNHYDGSTGKFYCNIPGLYYFSYHITVYMK ----------------------------1111---------------------------- DVKVSLFKKDKAVLFTYDQYQENVDQASGSVLLHLEVGDQVWLQVYYADNVNDSTFTGFL -----------------------------------2222-----------2222------ LYHDT ----- >SYNTHETIC PEPTIDE ANALOGU; SWP:P29187; PDB:1C2UA; RSIDTIPKSRCTAFQCKHSAKYRLSFCRKTCGT -----------1111----1111----1111-- >LUMAZINE SYNTHASE; SWP:Q9XH32; PDB:1C2YA; MNELEGYVTKAQSFRFAIVVARFNEFVTRRLMEGALDTFKKYSVNEDIDVVWVPGAYELG -----------------------3333---------------------------3333-- VTAQALGKSGKYHAIVCLGAVVKGDTSHYDAVVNSASSGVLSAGLNSGVPCVFGVLTCDN -----------------------------------------------------------3 MDQAINRAGGKAGNKGAESALTAIEMASLFEHHLK 333-3333-------------------3333---- >FLAVOCETIN-A: ALPHA SUBUN; SWP:Q8AV97; PDB:1C3AA; DFDCIPGWSAYDRYCYQAFSKPKNWEDAESFCEEGVKTSHLVSIESSGEGDFVAQLVAEK ----------!!!!-----------------1111!!!!--------------------- IKTSFQYVWIGLRIQNKEQQCRSEWSDASSVNYENLVKQFSKKCYALKKGTELRTWFNVY ------------------------------------3333-------------------- CGTENPEVCKYTPEC --------------- >Flavocetin-A beta chain; SWP:Q8AV98; PDB:1C3AB; GFCCPLGWSSYDEHCYQVFQQKMNWEDAEKFCTQQHKGSHLVSFHSSEEVDFVTSKTFPI -----------------------------------2222--------------------- LKYDFVWIGLSNVWNECTKEWSDGTKLDYKAWSGGSDCIVSKTTDNQWLSMDCSSKYYVV -----------1111-----1111------------------------------------ CKFQA ----- >ADENYLOSUCCINATE LYASE; SWP:Q9X0I0; PDB:1C3CA; VERYSLSPMKDLWTEEAKYRRWLEVELAVTRAYEELGMIPKGVTERIRNNAKIDVELFKK 3333---3333----------------------1111--2222----------------- IEEKTNHDVVAFVEGIGSMIGEDSRFFHYGLTSSDVLDTANSLALVEAGKILLESLKEFC ---------------------3333--22223333------------------------- DVLWEVANRYKHTPTIGRTHGVHAEPTSFGLKVLGWYSEMKRNVQRLERAIEEVSYGKIS ---------1111-----iiii-------------------------------------- GAVGNYANVPPEVEEKALSYLGLKPEPVSTQVVPRDRHAFYLSTLAIVAAGIERIAVEIR 1111-1111---------1111-------------------------------------- HLQRTEVLEVEEPFRKSAMPHKKNPITCERLTGLSRMMRAYVDPSLENIALWHERDISHS ---1111-----------1111-----------------------------!!!!-3333 SVERYVFPDATQTLYYMIVTATNVVRNMKVNEERMKKNIDLTKGLVFSQRVLLKLIEKGL -------------------------------------1111--3333------------- TRKEAYDIVQRNALKTWNSEKHFLEYLLEDEEVKKLVTKEELEELFDISYYLKHVDHIFE ---------------1111-----------3333------------3333-1111----1 RFEK 111- >C3D; SWP:P01024; PDB:1C3D; MLDAERLKHLIVTPSGAGEQNMIGMTPTVIAVHYLDETEQWEKFGLEKRQGALELIKKGY --33333333-----------------------------3333-1111------------ TQQLAFRQPSSAFAAFVKRAPSTWLTAYVVKVFSLAVNLIAIDSQVLCGAVKWLILEKQK ---11111111----1111--------------1111-----3333-------------1 PDGVFQEDAPVIHQEMIGGLRNNNEKDMALTAFVLISLQEAKDICEEQVNSLPGSITKAG 111---------3333--------------------------1111--1111-------- DFLEANYMNLQRSYTVAIAGYALAQMGRLKGPLLNKFLTTAKDKNRWEDPGKQLYNVEAT -----3333--------------1111-------------2222-------3333----- SYALLALLQLKDFDFVPPVVRWLNEQRYYGGGYGSTQATFMVFQALAQYQKDAP -----------3333--------3333----2222------------------- >HEAT SHOCK PROTEIN 40; SWP:P25294; PDB:1C3GA; ETVQVNLPVSLEDLFVGKKKSFKIGRKGPHGASEKTQIDIQLKPGWKAGTKITYKNQGDY ---------3333-----------------------------2222-------------- NPQTGRRKTLQFVIQEKSHPNFKRDGDDLIYTLPLSFKESLLGFSKTIQTIDGRTLPLSR ------------------------!!!!-------------------------------- VQPVQPSQTSTYPGQGMPTPKNPSQRGNLIVKYKVDYPISLNDAQKRAID ----1111---2222------3333-----------------33333333 >30 KD ADIPOCYTE COMPLEMEN; SWP:Q60994; PDB:1C3HA; AYMYRSAFSVGLETRVTVPNVPIRFTKIFYNQQNHYDGSTGKFYCNIPGLYYFSYHITVY ------------------------------1111-------------------------- MKDVKVSLFKKDKAVLFTYDQYQEKNVDQASGSVLLHLEVGDQVWLQVYGDGDHNGLYAD -----------------------%%%%-----------2222------------------ NVNDSTFTGFLLYHDTN ----------------- >AGGLUTININ; SWP:Q9ZQY5; PDB:1C3MA; ASDIAVQAGPWGGNGGKRWLQTAHGGKITSIIIKGGTCIFSIQFVYKDKDNIEYHSGKFG ----------------------iiii---------------------1111--------- VLGDKAETITFAEDEDITAISGTFGAYYHMTVVTSLTFQTNKKVYGPFGTVASSSFSLPL -----------1111--------------------------------------------- TKGKFAGFFGNSGDVLDSIGGVVVP ------------------------- >HDLP (HISTONE DEACETYLASE; SWP:O67135; PDB:1C3PA; KKVKLIGTLDYGKYRYPKNHPLKIPRVSLLLRFKDAMNLIDEKELIKSRPATKEELLLFH --------3333-------1111-----------------1111-------3333----- TEDYINTLMEAERCQCVPKGAREKYNIGGYENPVSYAMFTGSSLATGSTVQAIEEFLKGN -----------------2222-------3333-----------------------1111- VAFNPAGGMHHAFKSRANGFCYINNPAVGIEYLRKKGFKRILYIDLDAHHCDGVQEAFYD -----------------iiii------------1111--------------------111 TDQVFVLSLHQSPEYAFPFEKGFLEEIGEGKGKGYNLNIPLPKGLNDNEFLFALEKSLEI 1----------3333-------1111---1111--------------------------- VKEVFEPEVYLLQLGTDPLLEDYLSKFNLSNVAFLKAFNIVREVFGEGVYLGGGGYHPYA ---------------3333--1111----------------------------------- LARAWTLIWCELSGREVPEKLNNKAKELLKSIDFEEFDDEVDRSYMLETLKDPWRGGEVR ----------------------------1111------------1111------------ KEVKDTLEKAKA ------------ >HIS TAG; SWP:P39593; PDB:1C3QA; SMDAQSAAKCLTAVRRHSPLVHSITNNVVTNFTANGLLALGASPVMAYAKEEVADMAKIA ------------------------------------------------1111---3333- GALVLNIGTLSKESVEAMIIAGKSANEHGVPVILDPVGAGATPFRTESARDIIREVRLAA -------------------------1111------2222--------------------- IRGNAAEIAHTVGVTDWLIKGVDAGEGGGDIIRLAQQAAQKLNTVIAITGEVDVIADTSH --------------1111------------------------------------------ VYTLHNGHKLLTKVTGAGCLLTSVVGAFCAENPLFAAIAAISSYGVAAQLAAQQTADKGP -------------2222-------------------------------------1111-- GSFQIELLNKLSTVTEQDVQEWATIERVTVS ------------------------------- >1D8 UBIQUITIN; SWP:P62988; PDB:1C3TA; MQLFVKTLTGKTLTVELEPSDTVENLKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYN -----------------33333333-----------1111----iiii------3333-- LQKESTIHLVLRLRGG ---------------- >THP12 CARRIER PROTEIN; SWP:Q27011; PDB:1C3YA; ETPREKLKQHSDACKAESGVSEESLNKVRNREEVDDPKLKEHAFCILKRAGFIDASGEFQ ----------1111--------33333333-----------------------1111--3 LDHIKTKFKENSEHPEKVDDLVAKCAVKKDTPQHSSADFFKCVHDNRS 333---------------------------3333--3333-------- >LUMAZINE SYNTHASE; SWP:Q9UVT8; PDB:1C41A; GPTPQQHDGSALRIGIVHARWNETIIEPLLAGTKAKLLACGVKESNIVVQSVPGSWELPI ---------------------3333------------1111-3333-------3333--- AVQRLYSASQLQSTGPFDALIAIGVLIKGETMHFEYIADSVSHGLMRVQLDTGVPVIFGV -------3333------------------------------------------------- LTVLTDDQAKARAGVIEGSHNHGEDWGLAAVEMGVRRRDWAAGKT ----3333--1111-------------------------1111-- >STEROL CARRIER PROTEIN 2; SWP:O62742; PDB:1C44A; SSAGDGFKANLVFKEIEKKLEEEGEQFVKKIGGIFAFKVKDGPGGKEATWVVDVKNGKGS ------3333-------------------------------2222--------------- VLPNSDKKADCTITMADSDLLALMTGKMNPQSAFFQGKLKITGNMGLAMKLQNLQLQPGK ----------------------1111-------1111-------------3333------ AKL --- >SHIGA-LIKE TOXIN I B SUBU; SWP:P08027; PDB:1C48A; TPDCVTGKVEYTKYNDDDTFTVKVGDKELFTNRWNLQSLLLSAQITGMTVTIKTNACHNG --------------1111-----!!!!-----3333---------------------222 GTFSEVIFR 2-------- >TOXIN K-BETA; SWP:P55928; PDB:1C49A; TISCTNEKQCYPHCKKETGYPNAKCMNRKCKCFGR -----3333----------------%%%%------ >GURMARIN; SWP:P25810; PDB:1C4EA; QCVKKDELCIPYYLDCCEPLECKKVNWWDHKCIG ---2222--2222--------------------- >ORNITHINE DECARBOXYLASE; SWP:P43099; PDB:1C4KA; SSSLKIASTQEARQYFDTDRVVVDAVGSDFTDVGAVIAMDYETDVIDAADATKFGIPVFA --------3333--------------------------11111111--3333-------- VTKDAQAISADELKKIFHIIDLEFDATVNAREIETAVNNYEDSILPPFFKSLKEYVSRYL ---3333-3333---------------------------------3333----------- IQFDCPGHQGGQYYRKHPAGREFYDFFGETVFRADLCNADVALGDLLIHEGPAVAAEKHA -----3333-3333--------------3333----33331111---------------- ARVYNADKTYFVLGGSSNANNTVTSALVSNGDLVLFDRNNHKSVYNSALAMAGGRPVYLQ -1111-----------------------2222----11113333---------------- TNRNPYGFIGGIYDSDFDEKKIRELAAKVDPERAKWKRPFRLAVIQLGTYDGTIYNAHEV ---1111-----3333-3333-------1111----------------1111-------- VKRIGHLCDYIEFDSAWVGYEQFIPMMRNSSPLLIDDLGPEDPGIIVVQSVHKQQAGFSQ ---1111-------11113333-3333---1111----1111-------3333----222 TSQIHKKDSHIKGQLRYCDHKHFNNSFNLFMSTSPFYPMYAALDVNAAMQEGEAGRKLWH 2------1111--3333------------------3333------------3333----- DLLITTIEARKKLIKAGSMFRPFVPPVVNGKKWEDGDTEDMANNIDYWRFEKGAKWHAYE -------------1111----------iiii3333-3333---3333---22223333-- GYGDNQYYVDPNKFMLTTPGINPETGDYEDFGVPATIVANYLRDHGIIPEKSDLNSILFL ---------1111----------------------------------------------- MTPAETPAKMNNLITQLLQLQRLIEEDAPLKQVLPSIYAANEERYNGYTIRELCQELHDF -----------------------1111-3333------------2222------------ YKNNNTFTYQKRLFLREFFPEQGMLPYEARQEFIRNHNKLVPLNKIEGEIALEGALPYPP -1111-----33333333-----------------------33332222----------- GVFCVAPGEKWSETAVKYFTILQDGINNFPGFAPEIQGVYFKQEGDKVVAYGEVYDAEVA -----2222----------------------------------!!!!---------3333 KNDDRYNN --3333-- >DNA NUCLEOTIDE EXCISION R; SWP:Q56243; PDB:1C4OA; TFRYRGPSPKGDQPKAIAGLVEALRDGEFTLLGATGTGTVTMAKVIEALGRPALVLAPNK ---------!!!!--------------------2222----------------------- ILAAQLAAEFRELFPENAVEYFISYYDYYQPEAYVPGKDLYIEKDASINPEIRLRHSTTR -------------1111------3333--------1111--------------------- SLLTRRDVIVVASVSAIYGGDPREYRARNLVGFVLFPATHYLSPEGLEEILKEIEKELWE ------------3333-------------------------------------------- RVRYFEERGEYAQRLKERTLYDLEMLRVMGTCPGVENYARYFTGKAPGEPPYTLLDYFPE -----1111----------------------2222--3333----2222---3333--11 DFLVFLDESHVTVPQLQGMYRGDYARKKTLVDYGFRLPSALDNRPLRFEEFLERVSQVVF 11-----3333-------------------1111--3333-----------1111----- VSATPGPFELAHSGRVVEQIIRPTGLLDPLVRVKPTENQILDLMEGIRERAARGERTLVT ---------------------1111----------2222-----------1111------ VLTVRMAEELTSFLVEHGIRARYLHHELDAFKRQALIRDLRLGHYDCLVGINLLREGLDI ------------------------1111-----------1111-----------2222-1 PEVSLVAILDADKEGFLRSERSLIQTIGRAARNAGEVWLYADRVSEAMQRAIEETNRRRA 111------1111-1111---------1111----------------------------- LQEAYNEHGITPETV --------------- >SHIGA-LIKE TOXIN I SUBUNI; SWP:P08027; PDB:1C4QA; TPDCVTGKVEYTKYNDDDTFTVKVGDKELATNRANLQSLLLSAQITGMTVTIKTNACHNG --------------1111-----!!!!-----3333---------------------222 GGFSEVIFR 2-------- >NEUREXIN-I BETA; SWP:Q9ULB1; PDB:1C4RA; HAGTTYIFSKGGGQITYKWPPNDRPSTRADRLAIGFSTVQKEAVLVRVDSSSGLGDYLEL -------------------1111---------------------------2222------ HIHQGKIGVKFNVGTDDIAIEESNAIINDGKYHVVRFTRSGGNATLQVDSWPVIERYPAG ---------------------------------------!!!!----------------- RQLTIFNSQATIIIGGKEQGQPFQGQLSGLYYNGLKVLNMAAENDANIAIVGNVRLVGEV ---------------3333------------iiii---------1111------------ >PROTEIN (2-HYDROXY-6-OXO-; SWP:O05149; PDB:1C4XA; TVEIIEKRFPSGTLASHALVAGDPQSPAVVLLHGAGPGAHAASNWRPIIPDLAENFFVVA ----------------------3333---------2222-33333333---3333----- PDLIGFGQSEYPETYPGHIMSWVGMRVEQILGLMNHFGIEKSHIVGNSMGGAVTLQLVVE --2222-----------3333--------------------------------------- APERFDKVALMGSVGAPMNARPPELARLLAFYADPRLTPYRELIHSFVYDPENFPGMEEI 3333-------------------------1111----------------33332222--- VKSRFEVANDPEVRRIQEVMFESMKAGMESLVIPPATLGRLPHDVLVFHGRQDRIVPLDT ---------------------------3333--33331111--------1111------- SLYLTKHLKHAELVVLDRCGHWAQLERWDAMGPMLMEHFRA ---------------------3333---------------- >UBIQUITIN-PROTEIN LIGASE ; SWP:Q05086; PDB:1C4ZA; NPYLRLKVRRDHIIDDALVRLEMIAMENPADLKKQLYVEFEGEQGVDEGGVSKEFFQLVV ---------------------------3333--------2222----------------- EEIFNPDIGMFTYDESTKLFWFNPSSFETEGQFTLIGIVLGLAIYNNCILDVHFPMVVYR -11111111-----1111----1111---------------------------------- KLMGKKGTFRDLGDSHPVLYQSLKDLLEYEGNVEDDMMITFQISQTDLFGNPMMYDLKEN 1111---3333---------------------1111-----------------------3 GDKIPITNENRKEFVNLYSDYILNKSVEKQFKAFRRGFHMVTNESPLKYLFRPEEIELLI 333---1111-------------3333------------------------3333----- CGSRNLDFQALEETTEYDGGYTRDSVLIREFWEIVHSFTDEQKRLFLQFTTGTDRAPVGG ---------3333-------------------------------------------2222 LGKLKMIIAKNGPDTERLPTSHTCFNVLLLPEYSSKEKLKERLLKAITYA 3333-----------------1111---------3333------------ >UBIQUITIN-PROTEIN LIGASE ; SWP:P51966; PDB:1C4ZD; SRRLMKELEEIRKCGMKNFRNIQVDEANLLTWQGLIVPDNPPYDKGAFRIEINFPAEYPF -1111-3333--------------------------------------------1111-- KPPKITFKTKIYHPNIDEKGQVCLPVISAENWKPATKTDQVIQSLIALVNDPQPEHPLRA ------------11111111---3333-----33333333-------------------- DLAEEYSKDRKKFCKNAEEFTKKY -------------1111------- >CYTOCHROME-C552; SWP:P04164; PDB:1C52; QADGAKIYAQCAGCHQQNGQGIPGAFPPLAGHVAEILAKEGGREYLILVLLYGLQGQIEV --33333333-----1111--2222-------------2222-----------------% KGMKYNGVMSSFAQLKDEEIAAVLNHIATAWGDAKKVKGFKPFTAEEVKKLRAKKLTPQQ %%%--------3333-------------111133332222----------3333------ VLAERKKLGLK ----1111--- >BUTANTOXIN; SWP:P59936; PDB:1C55A; WCSTCLDLACGASRECYDPCFKAFGRAHGKCMNNKCRCYT -------------------------------iiii----- >CHIMERIC DECARBOXYLASE AN; SWP:NA; PDB:1C5CH; QVQLLEPGTELVKPGASVKLSCRASGYSFTSYWMHWVKQRPGQGLEWIGLIDPSNGRTNF ------------2222-----------1111--------2222----------------- NDKFKSRATLTVDTSSSTAYMQLSSLTSEDSAVYYCVRIAYWGQGTLVTVSSASTKGPSV 3333--------3333----------1111------------------------------ FPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSSV -----1111-!!!!-------------------iiii----------------------- VTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSC ---1111---------------------------- >CHIMERIC DECARBOXYLASE AN; SWP:NA; PDB:1C5CL; EIQLTQSPSSLSASLGERVSLTCRTSQEISGYLSWLQQKPDGTIKRLIYDATKLDSGAPK -------------2222-----------iiii------1111------------111133 RFSGSRSGSDYSLTISSLESEDFADYYCLQYASFPRTFGGGTKLEIKRTVAAPSVFIFPP 33----!!!!--------3333-------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT 33331111---------------------iiii--------------------------- LSKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC -----1111--------1111--------2222- >MONOCLONAL ANTIBODY AGAIN; SWP:NA; PDB:1C5DH; EVKLLESGPGLVQPSQTLSLTCTVSGFPLTTNGVSWVRQPPGKGLEWIAAISSGGSPYYN ------------2222-----------3333--------2222--------1111----3 SALKSRLSINRDTSKSQVFLKMNSLQTEDTAIYFCTREDGWNYFDYWGPGTMVTVSSAQT 3331111------------------3333------------------------------- TAPSVYPLAPGCGDTTSSTVTLGCLVKGYFPEPVTVTWNSGALSSDVHTFPAVLQSGLYT -------------------------------------iiii-------------%%%%-- LTSSVTSSTWPSQTVTCNVAHPASSTKVDKKLER ---------------------------------- >MONOCLONAL ANTIBODY AGAIN; SWP:NA; PDB:1C5DL; DIQMTQSPPSLSASLGDKVTITCQASQDINKYIAWYQQKPGKAPRQLIRYTSILVLGTPS -------------2222---------------------2222----------------11 RFSGSGSGRDFSFSISNVASEDIASYYCLQYGNLYTFGAGTKLEIKRADAAPTVSIFPPS 11----------------3333-------------------------------------3 TEQLATGGASVVCLMNNFYPRDISVKWKIDGTERRDGVLDSVTDQDSKDSTYSMSSTLSL 3331111---------------------iiii---------------------------- TKADYESHNLYTCEVVHKTSSSPVVKSFNRNEC ----1111--------1111------------- >HEAD DECORATION PROTEIN; SWP:P03712; PDB:1C5EA; SDPAHTATAPGGLSAKAPAMTPLMLDTSSRKLVAWDGTTDGAAVGILAVAADQTSTTLTF -----------------2222-----------------2222---------1111----- YKSGTFRYEDVLWPEAASDETKKRTAFAGTAISIV ------3333---3333-------1111------- >CYTOCHROME C6; SWP:P57736; PDB:1C6RA; ADLALGKQTFEANCAACHAGGNNSVIPDHTLRKAAMEQFLQGGFNLEAITYQVENGKGAM --------------1111%%%%---1111----------2222-------------!!!! PAWSGTLDDDEIAAVAAYVYDQASGDKW ---------------------------- >CYTOCHROME C6; SWP:P0A3X9; PDB:1C6S; ADLANGAKVFSGNCAACHMGGGNVVMANKTLKKEALEQFGMYSEDAIIYQVQHGKNAMPA -3333---11113333------------------------------1111---------- FAGRLTDEQIQDVAAYVLDQAAKGWAG ----------333333333333----- >SIV INTEGRASE; SWP:Q87706; PDB:1C6VA; NSDLGTWQMDCTHLEGKIVIVAVHVASGFIEAEVIPQETGRQTALFLLKLAGRWPITHLH --1111-------iiii------------------------------------------- TDNGANFASQEVKMVAWWAGIEHTFGEAMNHHLKNQIDRIREQANSVETIVLMAVHCMNH --------3333-----------------------3333-----------------3333 KRRGGIGDMTPAERLINMITTEQEIQFQ ----------------------3333-- ------------------------------------------------------- >MAUROCALCIN; SWP:P60254; PDB:1C6WA; GDCLPHLKLCKENKDCCSKKCKRRGTNIEKRCR -----------33331111-------------- >PROTEASE; SWP:O09893; PDB:1C6YA; PQITLWQRPVVTIKIGGQLMEALIDTGADDTVLEEMDLPGRWKPKIIGGIGGFVKVRQYD --------------iiii------1111-------------------------------- QIPIEICGHKVIGTVLVGPTPTNIIGRNLLTQIGCTLNF -----iiii-------------------3333------- >CYTOCHROME C-553; SWP:P82599; PDB:1C75A; VDAEAVVQQKCISCHGGDLTGASAPAIDKAGANYSEEEILDIILNGQGGMPGGIAKGAEA ---------------1111---------3333--------------!!!!---------- EAVAAWLAEKK ----------- >TYROSINE PHENOL-LYASE; SWP:P31011; PDB:1C7GA; MNYPAEPFRIKSVETVSMISRDERVKKMQEAGYNTFLLNSKDIYIDLLTDSGTNAMSDKQ ------------------------------%%%%11113333------------------ WAGMMIGDEAYAGSENFYHLEKTVKELFGFKHIVPTHQGRGAENLLSQLAIKPGQYVAGN --1111---------------------------------------------2222----- MYFTTTRFHQEKNGATFVDIVRDEAHDASLNLPFKGDIDLNKLATLIKEKGAENIAYICL ---------------------3333-1111---1111-------------3333------ AVTVNLAGGQPVSMANMRAVHEMASTYGIKIFYDATRCVENAYFIKEQEAGYENVSIKDI ----1111----------------1111--------------------2222-------- VHEMFSYADGCTMSGKKDCLVNIGGFLCMNDEEMFSAAKELVVVYEGMPSYGGLAGRDME ----1111--------1111---------------------------1111---3333-- AMAIGLREAMQYEYIEHRVKQVRYLGDKLREAGVPIVEPTGGHAVFLDARRFCPHLTQDQ -----------------------------1111-------1111-----3333---1111 FPAQSLAASIYMETGVRSMERGIVSAGRSKETGENHRPKLETVRLTIPRRVYTYAHMDVV ---------------------3333----------------------------------- ADGIIKLYQHKEDIRGLTFVYEPKQLRFFTARFDFI ---------3333------------3333------- >PARA-NITROBENZYL ESTERASE; SWP:P37967; PDB:1C7IA; THQIVTTQYGKVKGTTENGVHKWKGIPYAKPPVGQWRFKAPEPPEVWEDVLDATVYGPVC ------1111------%%%%------------!!!!------------------------ PQPSDLLSLSYKELPRQSEDCLYVNVFAPDTPSQNLPVMVWIHGGAFYLGAGSEPLYDGS -----3333-----------------------------------iiii--11111111-- KLAAQGEVIVVTLNYRLGPFGFMHLSSFDEAYSDNLGLLDQAAALKWVRENISAFGGDPD ---1111---------!!!!----33333333------------------3333---111 NVTVFGESAGGMSIAALLAMPAAKGLFQKAIMESGASRTMTKEQAASTAAAFLQVLGINE 1------------------1111------------------------------1111-11 SQLDRLHTVAAEDLLKAADQLRIAEKENIFQLFFQPALDPKTLPEEPEKSIAEGAASGIP 113333---3333---------1111-1111------------------------2222- LLIGTTRDEGYFFFTPDSDVYSQETLDAALEYLLGKPLAEKVADLYPRSLESQIHMVTDL ---------3333-1111-----------------------3333--------------- LFWRPAVAFASAQSHYAPVWMYRFDWHPEKPPYNKAFHTLELPFVFGNLDELERMAKAEI ------------------------------------2222-------------------- TDEVKQLSHTIQSAWTTFAKTGNPSTEAVNWPAYHEESRETVILDSEITIENDPESEKRQ -------------------------3333-----3333---------------------- KLF --- >ZINC ENDOPROTEASE; SWP:P56406; PDB:1C7KA; TVTVTYDPSNAPSFQQEIANAAQIWNSSVRNVQLRAGGNADFSYYEGNDSRGSYAQTDGH -------1111-------------------------------------3333-------- GRGYIFLDYQQNQQYDSTRVTAHETGHVLGLPDHYQGPCSELMSGGGPGPSCTNPYPNAQ ---------------------------------11113333--!!!!-3333-------- ERSRVNALWANG ------1111-- >CYSTALYSIN; SWP:Q56257; PDB:1C7NA; MIYDFTTKISRKNLGSLKWDLMYSQNPEVGNEVVPLSVADMEFKNPPELIEGLKKYLDET -------------------------11111111----------------------1111- VLGYTGPTEEYKKTVKKWMKDRHQWDIQTDWIINTAGVVPAVFNAVREFTKPGDGVIIIT ---------------------------1111-------------------2222------ PVYYPFFMAIKNQERKIIECELLEKDGYYTIDFQKLEKLSKDKNNKALLFCSPHNPVGRV ---------3333-----------iiii-------------3333--------------- WKKDELQKIKDIVLKSDLMLWSDEIHFDLIMPGYEHTVFQSIDEQLADKTITFTAPSKTF -----------------------1111----------1111-3333---------3333- NIAGMGMSNIIIKNPDIRERFTKSRDATSGMPFTTLGYKACEICYKECGKWLDGCIKVID -1111----------------------------3333----------------------- KNQRIVKDFFEVNHPEIKAPLIEGTYLQWIDFRALKMDHKAMEEFMIHKAQIFFDEGYIF -------------3333--------------3333--------------------3333- GDGGIGFERINLAAPSSVIQESLERLNKALKDLK 1111----------3333---------------- >PHOSPHOGLUCOSE ISOMERASE; SWP:P13376; PDB:1C7QA; AISFDYSNALPFMQENELDYLSEFVKAAHHMLHERKGPGSDFLGWVDWPIRYDKNEFSRI -----1111----3333-------------------2222--33333333---------- KQAAERIRNHSDALVVIGIGGSYLGARAAIEALSHTFHNQMNDTTQIYFAGQNISSTYIS ------------------!!!!--------------33331111---------------- HLLDVLEGKDLSINVISKSGTTTEPAIAFRIFRDYMEKKYGKEEARKRIYVTTDRTKGAL ---1111---------3333---------------------3333--------------- KKLADQEGYETFVIPDNIGGRYSVLTAVGLLPIAVAGLNIDRMMEGAASAYHKYNNPDLL --------------1111---1111--------------------------------111 TNESYQYAAVRNILYRKGKAIELLVNYEPSLHYVSEWWKQLFGESEGKDQKGLFPASVDF 1-------------1111---------3333----------------%%%%--------- TTDLHSMGQYVQEGRRNLIETVLHVKKPQIELTIQEDPENIDGLNFLAGKTLDEVNKKAF 1111--------------------------------3333---3333------------- QGTLLAHVDGGVPNLIVELDEMNEYTFGEMVYFFEKACGISGHLLGVNPFDQPGVEAYKK -------1111------------------------------------------1111--- NMFALLGKPGFEDEKAALMKRL -----------------3333- >BETA-N-ACETYLHEXOSAMINIDA; SWP:Q54468; PDB:1C7SA; DQQLVDQLSQLKLNVKMLDNRAGENGVDCAALGADWASCNRVLFTLSNDGQAIDGKDWVI -------1111---------3333---3333--2222----------------------- YFHSPRQTLRVDNDQFKIAHLTGDLYKLEPTAKFSGFPAGKAVEIPVVAEYWQLFRNDFL ---------------------!!!!-----1111--------------------3333-- PRWYATSGDAKPKMLANTDTENLDQFVAPFTGDQWKRTKDDKNILMTPASRFVSNADLQT --------------1111---3333-----!!!!---1111-------------1111-- LPAGALRGKIVPTPMQVKVHAQDADLRKGVALDLSTLVKPAADVVSQRFALLGVPVQTNG -3333--------------------1111----3333------------1111---1111 YPIKTDIQPGKFKGAMAVSGAYELKIGKKEAQVIGFDQAGVFYGLQSILSLVPSDGSGKI -------3333-!!!!-2222---------------------------33331111---- ATLDASDAPRFPYRGIFLDVARNFHKKDAVLRLLDQMAAYKLNKFHFHLSDDEGWRIEIP -------------------------------------1111---------1111----22 GLPELTEVGGQRCHDLSETTCLLPQYGQGPDVYGGFFSRQDYIDIIKYAQARQIEVIPEI 22------------1111-------------------------------1111------- DMPAHARAAVVSMEARYKKLHAAGKEQEANEFRLVDQTDTSNTTSVQFFNRQSYLNPCLD --------------------1111-----1111--1111-----1111-1111--1111- SSQRFVDKVIGEIAQMHKEAGQPIKTWHFGGAEAKNIRLGAGYTDKAKPEPGKGIIDQSN -----------------1111--------------33331111-3333------------ EDKPWAKSQVCQTMIKEGKVADMEHLPSYFGQEVSKLVKAHGIDRMQAWQDGLKDAESSK --2222---------------3333-------------1111----------1111---- AFATSRVGVNFWDTLYWGGFDSVNDWANKGYEVVVSNPDYVYMDFPYEVNPDERGYYWGT -------------11113333-----1111------1111---------1111---1111 RFSDERKVFSFAPDNMPQNAETSVDRDGNHFNAKSDKPWPGAYGLSAQLWSETQRTDPQM -------11111111---1111--1111----------------------1111------ EYMIFPRALSVAERSWHRAGWEQDYRAGREYKGGETHFVDTQALEKDWLRFANILGQREL ------------------1111---2222--2222------------------------- AKLDKGGVAYRLPVPGARVAGGKLEANIALPGLGIEYSTDGGKQWQRYDAKAKPAVSGEV ---1111------------%%%%----------------iiii-----1111-------- QVRSVSPDGKRYSRAEKV -----1111--------- >CALCIUM VECTOR PROTEIN; SWP:P04573; PDB:1C7VA; EEEILRAFKVFDANGDGVIDFDEFKFIMQKVGEEPLTDAEVEEAMKEADEDGNGVIDIPE --------------iiii------------------3333----------iiii--3333 FMDLIKKS -------- >SPORE PROTEASE; SWP:P22321; PDB:1C8BA; MEKELDLSQYSVRTDLAVEAKDIALENQPKVIVKEKEEQGVKISMVEITEEGAEAIGKKK -----------------------------------------iiii-------3333---- GRYVTLESVGIREQDTEKQEEAMEEVFAKELNFFIKSLNIPDDASCLVVGLGNLSVTPDA -------------------------------------------------------1111- LGPKAVDNLLITRHLFELQPESVQDGFRPVSAIVPGVMGMTGIETSDIIFGVVKKVNPDF ----3333----33331111-------------3333----------------------- IIAIDALAARSIERVNATIQISDSGIHPGSGVGNKRKEISYETLPTVVDAVSITSDTIDF ----------3333------------------------3333------------------ ILKHFGREMKEQGLGMIGTLPDEEKRRLIHEVLAPLGHNLMVTPKEVDMFIEDMANVVAG -3333-3333------3333---3333--------3333--------------------- GLNAALHHEVDQENFGAYTH -------------------- >DNA-BINDING PROTEIN 7A; SWP:O59631; PDB:1C8CA; MATVKFKYKGEEKQVDISKIKKVWRVGKMISFTYDEGGGKTGRGAVSEKDAPKELLQMLA ---------------3333------!!!!-------iiii-----------3333---11 KQKK 11-- >COAT PROTEIN; SWP:Q9IPS9; PDB:1C8NA; NSTVVSNSELILNLTPIALAYTVQSLPLIATQPAWLGTIADNYSKWRWVSLRIIYSPKCP ----------------------------11113333---1111----------------1 TTTSGTVAMCLSYDRNDVAPGSRVQLSQTYKAINFPPYAGYDGAAILNTDVTPTSAIYVD 111----------1111-----------2222---11111111-3333----1111---- VDVTRFDKAWYSTIGTAAFAALTAFDQNQFCPCTVHIGSDGGPAVAVPPGDIFFKYVIEL -1111-------------111133333333------------------------------ IEPINPTMN ----3333- >CYTOKINE RECEPTOR COMMON ; SWP:P32927; PDB:1C8PA; MIQMAPPSLNVTKDGDSYSLRWETMKMRYEHIDHTFEIQYRKDTATWKDSKTETLQNAHS ---------------------------------------------3333----------- MALPALEPSTRYWARVRVRTSRTGYNGIWSEWSEARSWDTES ------------------------------------------ >ACYL-COA THIOESTERASE II; SWP:P23911; PDB:1C8UA; SQALKNLLTLLNLEKIEEGLFRGQSEDLGLRQVFGGQVVGQALYAAKETVPEERLVHSFH ----------------2222-------------3333--------3333-3333------ SYFLRPGDSKKPIIYDVETLRDGNSFSARRVAAIQNGKPIFYMTASFQAPEAGFEHQKTM -------1111-----------------------%%%%---------------------- PSAPAPDGLPSETQIAQSLAHLLPPVLKDKFICDRPLEVRPVEFHNPLKGHVAEPHRQVW -----1111--------1111--3333------------------3333----------- IRANGSVPDDLRVHQYLLGYASDLNFLPVALQPHGIGFLEPGIQIATIDHSMWFHRPFNL --------------------1111-3333-3333--1111-------------------- NEWLLYSVESTSASSARGFVRGEFYTQDGVLVASTVQEGVMRNHN -------------%%%%--------1111---------------- >TUBBY PROTEIN; SWP:P50586; PDB:1C8ZA; GSVDIEVQDLEEFALRPAPQGITIKCRITRDKKGMDRGMFPTYFLHLDREDGKKVFLLAG ------------1111--2222-------------------------------------- RKRKKSKTSNYLISVDPTDLSRGGDSYIGKLRSNLMGTKFTVYDNGVNPQKASSSTLESG ---------------3333--------------3333----------3333-3333---- TLRQELAAVCYETNVLGFKGPRKMSVIVPGMNMVHERVCIRPRNEHETLLARWQNKNTES -------------------------------1111--------1111------------- IIELQNKTPVWNDDTQSYVLNFHGRVTQASVKNFQIIHGNDPDYIVMQFGRVAEDVFTMD ---------------------iiii----1111----1111-----------1111---- YNYPLCALQAFAIALSSFDSKLACE --------------3333------- ------------------------------------- >MITOCHONDRIAL ACONITASE; SWP:P20004; PDB:1C96A; RAKVAMSHFEPHEYIRYDLLEKNIDIVRKRLNRPLTLSEKIVYGHLDDPANQEIERGKTY ------1111--------------------------------1111-3333---2222-- LRLRPDRVAMQDATAQMAMLQFISSGLPKVAVPSTIHCDHLIEAQLGGEKDLRRAKDINQ ----------1111---------------------------------------------- EVYNFLATAGAKYGVGFWRPGSGIIHQIILENYAYPGVLLIGTDSHTPNGGGLGGICIGV ------------------2222------------2222-----11113333--------- GGADAVDVMAGIPWELKCPKVIGVKLTGSLSGWTSPKDVILKVAGILTVKGGTGAIVEYH 3333--------------------------!!!!3333---------11112222----- GPGVDSISCTGMATICNMGAEIGATTSVFPYNHRMKKYLSKTGRADIANLADEFKDHLVP 3333-------------3333------------------1111----------3333--- DPGCHYDQVIEINLSELKPHINGPFTPDLAHPVAEVGSVAEKEGWPLDIRVGLIGSCTNS 2222--------3333---------1111--3333-11111111---------------- SYEDMGRSAAVAKQALAHGLKCKSQFTITPGSEQIRATIERDGYAQVLRDVGGIVLANAC 3333-------------------------------------------------------- GPCIGQWDRKDIKKGEKNTIVTSYNRNFTGRNDANPETHAFVTSPEIVTALAIAGTLKFN 3333--------2222-----------2222---1111---------------------3 PETDFLTGKDGKKFKLEAPDADELPRAEFDPGQDTYQHPPKDSSGQRVAVSPTSQRLQLL 333-----------------------------------------------1111------ EPFDKWDGKDLEDLQILIKVKGKCTTDHISAAGPWLKFRGHLDNISNNLLIGAINIENRK ------------------------3333---!!!!1111--3333----1111------- ANSVRNAVTQEFGPVPDTARYYKQHGIRWVVIGDENYGEGASREHSALEPRHLGGRAIIT ----------------------1111---------2222---3333-------------- KSFARIHETNLKKQGLLPLTFADPADYNKIHPVDKLTIQGLKDFAPGKPLKCIIKHPNGT -----------1111---------3333--3333-----3333-2222-------1111- QETILLNHTFNETQIEWFRAGSALNRMKELQQK ----------3333------------------- >GENERAL TRANSCRIPTION FAC; SWP:Q00403; PDB:1C9BA; SDRAMMNAFKEITTMADRINLPRNIVDRTNNLFKQVYEQKSLKGRANDAIASACLYIACR ---------------------3333--------------------3333----------- QEGVPRTFKEICAVSRISKKEIGRCFKLILKALETSVDLITTGDFMSRFCSNLCLPKQVQ ----------------------------------------3333------1111-3333- MAATHIARKAVELDLVPGRSPISVAAAAIYMASQASAEKRTQKEIGDIAGVADVTIRQSY ----------1111-2222-------------1111------------------------ RLIYPRAPDLFPTDFKFDTPVDKLPQL --33331111----------------- >CASPASE-ACTIVATED DNASE; SWP:O54788; PDB:1C9FA; MCAVLRQPKCVKLRALHSACKFGVAARSCQELLRKGCVRFQLPMPGSRLCLYEDGTEVTD --------------1111-------------------1111-----------------11 DCFPGLPNDAELLLLTAGETWHGYVSD 11-------------1111-------- >FKBP12.6; SWP:P68106; PDB:1C9HA; GVEIETISPGDGRTFPKKGQTCVVHYTGMLQNGKKFDSSRDRNKPFKFRIGKQEVIKGFE ----------------2222---------2222----3333-------2222-------- EGAAQMSLGQRAKLTCTPDVAYGATGHPGVIPPNATLIFDVELLNLE --11112222------3333-!!!!-2222----------------- >ADENOSYLCOBINAMIDE KINASE; SWP:Q05599; PDB:1C9KA; MILVTGGARSGKSRHAEALIGDAPQVLYIATSQIARIQHHKDGRPAHWRTAECWRHLDTL ----------------------------------3333--1111-----------3333- ITADLAPDDAILLECITTMVTNLLFALDPEQWDYAAMERAIDDEIQILIAACQRCPAKVV -11111111-----------------------3333--------------3333------ LVTNEVGMGIVPENRLARHFRDIAGRVNQRLAAAADEVWLVVSGIGVKIK -----------------------------------------iiii----- >COLD-SHOCK PROTEIN; SWP:P41016; PDB:1C9OA; MQRGKVKWFNNEKGYGFIEVEGGSDVFVHFTAIQGEGFKTLEEGQEVSFEIVQGNRGPQA ----------1111-----2222-----3333---------2222--------1111--- ANVVKL ------ >APOPTOSIS INHIBITOR IAP H; SWP:P98170; PDB:1C9QA; RDHFALDRPSETHADYLLRTGQVVDISDTIYPRNPAMYSEEARLKSFQNWPDYAHLTPRE -2222-------3333-----------------3333-33333333-------------- LASAGLYYTGIGDQVQCFACGGKLKNWEPGDRAWSEHRRHFPNCFFVLGRNLNIRSE ----------!!!!------------------3333-------3333---------- >CHO REDUCTASE; SWP:O08782; PDB:1C9WA; STFVELSTKAKMPIVGLGTWQSPPGQVKEAVKVAIDAGYRHIDCAYAYYNEHEVGEAIQE -----1111---------22221111------------------1111------------ KIKEKAVRREDLFIVSKLWPTCFERKLLKEAFQKTLTDLKLDYLDLYLIHWPQGLQPGKE -------3333-------1111-------------------------------------- LFPKDDQGNVLTSKITFLDAWEVMEELVDEGLVKALGVSNFNHFQIERILNKPGLKHKPV ----3333-------------------1111--------------------2222----- TNQVECHPYLTQEKLIEYCHSKGITVTAYSPLGSPNRPWAKPEDPSLLEDPKIKEIAAKH ------1111---------1111------1111------------3333--------111 KKTSAQVLIRFHIQRNVVVIPKSVTPARIHENFQVFDFQLSDQEMATILGFNRNWRACLL 1-----------1111--------------1111--------------1111-------3 PETVNMEEYPYDAEY 333--1111------ >ALPHA-TOXIN; SWP:P0C216; PDB:1CA1; WDGKIDGTGTHAMIVTQGVSILENDLSKNEPESVRKNLEILKENMHELQLGSTYPDYDKN ---1111-3333--------------11113333------------------3333---- AYDLYQDHFWDPDTDNNFSKDNSWYLAYSIPDTGESQIRKFSALARYEWQRGNYKQATFY -11111111---------------------------------------1111-------- LGEAMHYFGDIDTPYHPANVTAVDSAGHVKFETFAEERKEQYKINTVGCKTNEDFYADIL ---------1111-3333--3333-------------3333--------1111---3333 KNKDFNAWSKEYARGFAKTGKSIYYSHASMSHSWDDWDYAAKVTLANSQKGTAGYIYRFL ----------------------------1111---------------------------- HDVSEGNDPSVGKNVKELVAYISTSGEKDAGTDDYMYFGIKTKDGKTQEWEMDNPGNDFM --1111---2222-------------2222-----------1111--------------2 TGSKDTYTFKLKDENLKIDDIQNMWIRKRKYTAFPDAYKPENIKVIANGKVVVDKDINEW 222-------------3333--------------------------iiii---------- ISGNSTYNIK ---------- >5'-DEOXY-5'-METHYLTHIOADE; SWP:Q13126; PDB:1CB0A; AVKIGIIGGTGLDDPEILEGRTEKYVDTPFGKPSDALILGKIKNVDCVLLARHGRQHTIM --------2222-3333----------1111----------!!!!-----1111-----3 PSKVNYQANIWALKEEGCTHVIVTTACGSLREEIQPGDIVIIDQFIDRTTMRPQSFYDGS 333----------1111-------------33332222---------------------- HSCARGVCHIPMAEPFCPKTREVLIETAKKLGLRCHSKGTMVTIEGPRFSSRAESFMFRT 1111------------------------1111--------------------------11 WGADVINMTTVPEVVLAKEAGICYASIAMATDYDCWAVSVDRVLKTLKENANKAKSLLLT 11---------------1111--------------------------------------- TIPQIGSTEWSETLHNLKNMAQFSVLLP ----1111-------------------- >CALBINDIN D9K; SWP:P02632; PDB:1CB1; QKSPAELKSIFEKYAAKEGDPNQLSKEELKQLIQAEFPSLLKGPRTLDDLFQELDKNGDG -------------3333----------------------------------1111----- EVSFEEFQVLVKKISQ --3333----1111-- >CHONDROITINASE AC; SWP:Q59288; PDB:1CB8A; GTAELIMKRVMLDLKKPLRNMDKVAEKNLNTLQPDGSWKDVPYKDDAMTNWLPNNHLLQL -----------------2222-----------3333-11111111-----3333------ ETIIQAYIEKDSHYYGDDKVFDQISKAFKYWYDSDPKSRNWWHNEIATPQALGEMLILMR --------1111-2222-----------------------3333-------------111 YGKKPLDEALVHKLTERMKRGEPEKKTGANKTDIALHYFYRALLTSDEALLSFAVKELFY 1--------------1111--3333---------------------------------11 PVQFVHYEEGLQYDYSYLQHGPQLQISSYGAVFITGVLKLANYVRDTPYALSTEKLAIFS 11---------1111----------------------------2222------------- KYYRDSYLKAIRGSYMDFNVEGRGVSRPDILNKKAEKKRLLVAKMIDLKHTEEWADAIAR -------1111-----3333------2222--1111----------3333---------1 TDSTVAAGYKIEPYHHQFWNGDYVQHLRPAYSFNVRMVSKRTRRSESGNKENLLGRYLSD 111--1111------------------1111-------1111-----%%%%1111-1111 GATNIQLRGPEYYNIMPVWEWDKIPGITSRDYLTDRPLTKLWGEQGSNDFAGGVSDGVYG ---------1111-3333-11112222--------------------------------- ASAYALDYDSLQAKKAWFFFDKEIVCLGAGINSNAPENITTTLNQSWLNGPVISTAGKTG -------%%%%------------------------------------------------- RGKITTFKAQGQFWLLHDAIGYYFPEGANLSLSTQSQKGNWFHINNSHSKDEVSGDVFKL ---------2222---%%%%--------------------33331111------------ WINHGARPENAQYAYIVLPGINKPEEIKKYNGTAPKVLANTNQLQAVYHQQLDMVQAIFY ----------------------3333---1111-------1111-----1111------- TAGKLSVAGIEIETDKPCAVLIKHINGKQVIWAADPLQKEKTAVLSIRDLKTGKTNRVKI ------iiii------------------------1111---------------------- DFPQQEFAGATVELK ---!!!!-------- >CYTOTOXIN 2; SWP:P01441; PDB:1CB9A; LKCKKLVPLFSKTCPAGKNLCYKMFMVAAPHVPVKRGCIDVCPKSSLLVKYVCCNTDKCN ----------------------------1111---------------------------- >COBALT-PRECORRIN-4 TRANSM; SWP:NA; PDB:1CBF; GLVPRGSHMKLYIIGAGPGDPDLITVKGLKLLQQADVVLYADSLVSQDLIAKSKPGAEVL -----3333----------1111-------1111-----------3333----2222--- KTAGMHLEEMVGTMLDRMREGKMVVRVHTGDPAMYGAIMEQMVLLKREGVDIEIVPGVTS -----3333--------1111---------3333-----------1111----------- VFAAAAAAEAELTIPDLTQTVILTRAEGRTPVPEFEKLTDLAKHKCTIALFLSSTLTKKV -----1111----2222---------------1111-----1111-------1111---- MKEFINAGWSEDTPVVVVYKATWPDEKIVRTTVKDLDDAMRTNGIRKQAMILAGWALDP ---------1111------2222--------3333------------------3333-- >CYANOGENIC BETA-GLUCOSIDA; SWP:P26205; PDB:1CBG; FKPLPISFDDFSDLNRSCFAPGFVFGTASSAFQYEGAAFEDGKGPSIWDTFTHKYPEKIK ---------1111-1111-1111------3333---1111--------------111111 DRTNGDVAIDEYHRYKEDIGIMKDMNLDAYRFSISWPRVLPKGKLSGGVNREGINYYNNL 11----!!!!------------1111--------3333-11111111------------- INEVLANGMQPYVTLFHWDVPQALEDEYRGFLGRNIVDDFRDYAELCFKEFGDRVKHWIT ----1111--------------------!!!!---------------------------- LNEPWGVSMNAYAYGTFAPGRCSDWLKLNCTGGDSGREPYLAAHYQLLAHAAAARLYKTK ----------------------3333-------1111----------------------- YQASQNGIIGITLVSHWFEPASKEKADVDAAKRGLDFMLGWFMHPLTKGRYPESMRYLVR -----------------------3333--------------------------------3 KRLPKFSTEESKELTGSFDFLGLNYYSSYYAAKAPRIPNARPAIQTDSLINATFEHNGKP 333----------2222-------------------2222--3333---------iiii- LGPMAASSWLCIYPQGIRKLLLYVKNHYNNPVIYITENGRNEFNDPTLSLQESLLDTPRI ------1111--3333----------------------------11113333---3333- DYYYRHLYYVLTAIGDGVNVKGYFAWSLFDNMEWDSGYTVRFGLVFVDFKNNLKRHPKLS -------------1111---------------!!!!-----------1111--------- AHWFKSFLKK -----1111- >CELLULAR RETINOIC ACID BI; SWP:P02695; PDB:1CBIA; PNFAGTWKMRSSENFDELLKALGVNAMLRKVAVAAASKPHVEIRQDGDQFYIKTSTTVRT ---------------------------------3333--------!!!!----------- TEINFKVGEGFEEETVDGRKCRSLPTWENENKIHCTQTLLEGDGPKTYWTRELANDELIL -----2222-----1111----------1111---------------------%%%%--- TFGADDVVCTRIYVRE ---!!!!--------- >PROTEIN (7,8-DIHYDRO-6-HY; SWP:P43777; PDB:1CBKA; MITAYIALGSNLNTPVEQLHAALKAISQLSNTHLVTTSSFYKSKPLGPQDQPDYVNAVAK ------------------------------------------------------------ IETELSPLKLLDELQRIENEQGRVRLRRWGERTLDLDILLYGNEIIQNERLTIPHYDMHN ------------------1111----2222-----------------1111---3333-- REFVIVPLFEIASDLVLPNSQIITELVKQFADHKMIKLNP 1111----1111----1111-333333331111------- >PROTEIN (FAB (BV04-01) AU; SWP:NA; PDB:1CBVH; EVQPVETGGGLVQPKGSLKLSCAASGFSFNTNAMNWVRQAPGKGLEWVARIRSKSNNYAT ------------2222-----------------------1111---------3333---- YYADSVKDRFTISRDDSQNMLYLQMNNLKTEDTAMYYCVRDQTGTAWFAYWGQGTLVTVS --3333----------------------1111---------------------------- AAKTTPPSVYPLAPGCGDTTGSSVTLGCLVKGYFPESVTVTWNSGSLSSSVHTFPALLQS ------------------------------------------------------------ GLYTMSSSVTVPSSTWPSQTVTCSVAHPASSTTVDKKLE -----------1111------------2222-------- >DELTA-ENDOTOXIN CYTB; SWP:Q04470; PDB:1CBY; CSAPIIRKPFKHIVLTVPSSDLDNFNTVFYVQPQYINQALHLANAFQGAIDPLNLNFNFE -------------------------------3333---------3333------------ KALQIANGIPNSAIVKTLNQSVIQQTVEISVMVEQLKKIIQEVLGLVINSTSFWNSVEAT ------------------------------------------------------------ IKGTFTNLDTQIDEAWIFWHSLSAHNTSYYYNILFSIQNEDTGAVMAVLPLAFEVSVDVE ------33332222--------1111------------3333-----------------3 KQKVLFFTIKDSARYEVKMKALTLVQALHSSNAPIVDIFNVNNYNLY 333----1111----------------------33331111------ >Periplasmic [NiFeSe] hydr; SWP:P13065; PDB:1CC1L; VKISIDPLTRVEGHLKIEVEVKDGKVVDAKCSGGMFRGFEQILRGRDPRDSSQIVQRICG ---------------------%%%%------------3333-22223333---3333--- VCPTAHCTASVMAQDDAFGVKVTTNGRITRNLIFGANYLQSHILHFYHLAALDYVKGPDV -------------------------------------------------1111------- SPFVPRYANADLLTDRIKDGAKADATNTYGLNQYLKALEIRRICHEMVAMFGGRMPHVQG -----------3333--------------------------------------------- MVVGGATEIPTADKVAEYAARFKEVQKFVIEEYLPLIYTLGSVYTDLFETGIGWKNVIAF -------------------------------------------3333------------- GVFPEDDDYKTFLLKPGVYIDGKDEEFDSKLVKEYVGHSFFDHSAPGGLHYSVGETNPNP -----1111----------iiii----3333----1111----------3333-----11 DKPGAYSFVKAPRYKDKPCEVGPLARMWVQNPELSPVGQKLLKELYGIEAKKFRDLGDKA 11-----------%%%%----------------------------------3333----- FSIMGRHVLRAEETWLTAVAVEKWLKQVQPGAETYVKSEIPDAAEGTGFTEAPRGALLHY ----------------------------2222-------------------1111----- LKIKDKKIENYQIVSATLWNANPRDDMGQRGPIEEALIGVPVPDIKNPVNVGRLVRSYDP ---%%%%-------!!!!------1111------3333-----33333333----1111- LGCAVH 3333-- >Periplasmic [NiFeSe] hydr; SWP:P13063; PDB:1CC1S; KKAPVIWVQGQGCTGCSVSLLNAVHPRIKEILLDVISLEFHPTVMASEGEMALAHMYEIA -------------------1111-----------------3333---------------- EKFNGNFFLLVEGAIPTAKEGRYCIVGEAKAHHHEVTMMELIRDLAPKSLATVAVGTCSA --2222-----------%%%%-----------------------3333------------ YGGIPAAEGNVTGSKSVRDFFADEKIEKLLVNVPGCPPHPDWMVGTLVAAWSHVLNPTEH --33332222----------------------------3333--------------1111 PLPELDDDGRPLLFFGDNIHENCPYLDKYDNSEFAETFTKPGCKAELGCKGPSTYADCAK -----1111-3333---3333-1111--1111----1111---3333--3333---3333 RRWNNGINWCVENAVCIGCVEPDFPDGKSPFYVAE ---%%%%-3333-----1111---1111-1111-- >CYTOCHROME C5; SWP:P11732; PDB:1CC5; GGGARSGDDVVAKYCNACHGTGLLNAPKVGDSAAWKTRADAKGGLDGLLAQSLSGLNAMP --------------3333---2222--2222-3333-----------3333--------- PKGTCADCSDDELKAAIGKMSGL --------3333----1111--- >METALLOCHAPERONE ATX1; SWP:P38636; PDB:1CC8A; AEIKHYQFNVVMTCSGCSGAVNKVLTKLEPDVSKIDISLEKQLVDVYTTLPYDFILEKIK -----------------------3333--------------------------------3 KTGKEVRSGKQL 333--------- >CLARA CELL 17 kD PROTEIN; SWP:P17559; PDB:1CCD; ICPGFLQVLEALLLGSESNYEAALKPFNPASDLQNAGTQLKRLVDTLPQETRINIVKLTE -----------------------3333--------------3333--------------- KILTSPLCEQDLRV -------------- >CYTOCHROME C551; SWP:P00101; PDB:1CCH; QDGEALFKSKPCAACHSVDTKMVGPALKEVAAKNAGVEGAADTLALHIKNGSQGVWGPIP --3333----3333----------------------1111-------1111--------- MPPNPVTEEEAKILAEWVLSLK -----------------1111- >CYTOCHROME C; SWP:P00055; PDB:1CCR; ASFSEAPPGNPKAGEKIFKTKCAQCHTVDKGAGHKQGPNLNGLFGRQSGTTPGYSYSTAD -3333----------------3333---2222-------2222-------2222--3333 KNMAVIWEENTLYDYLLNPKKYIPGTKMVFPGLKKPQERADLISYLKEATS -----------------3333-2222---------------------1111 >CHYMOTRYPSIN INHIBITOR; SWP:P56682; PDB:1CCVA; EECGPNEVFNTCGSACAPTCAQPKTRICTMQCRIGCQCQEGFLRNGEGACVLPENC ---2222-----------3333-----------------------------1111- >GLUTAMATE MUTASE; SWP:P80078; PDB:1CCWA; MEKKTIVLGVIGSDCHAVGNKILDHAFTNAGFNVVNIGVLSPQELFIKAAIETKADAILV ----------!!!!-------------1111----------3333--------------- SSLYGQGEIDCKGLRQKCDEAGLEGILLYVGGNIVVGKQHWPDVEKRFKDMGYDRVYAPG -----3333---------11112222-------------3333--------------222 TPPEVGIADLKKDLNIE 23333------------ >Methylaspartate mutase E ; SWP:P80077; PDB:1CCWB; MELKNKKWTDEEFHKQREEVLQQWPTGKEVDLQEAVDYLKKIPAEKNFAEKLVLAKKKGI ------------------3333----1111--------11113333-------------- TMAQPRAGVALLDEHIELLRYLQDEGGADFLPSTIDAYTRQNRYDECENGIKESEKAGRS ------------------------------------3333-------------------- LLNGFPGVNFGVKGCRKVLEAVNLPLQARHGTPDSRLLAEIIHAGGWTSNEGGGISYNVP -----3333---------1111--------------------1111--------1111-- YAKNVTIEKSLLDWQYCDRLVGFYEEQGVHINREPFGPLTGTLVPPSMSNAVGITEALLA ------------------------1111-------3333--------------------- AEQGVKNITVGYGECGNMIQDIAALRCLEEQTNEYLKAYGYNDVFVTTVFHQWMGGFPQD 1111--------------------------------1111-------------------- ESKAFGVIVTATTIAALAGATKVIVKTPHEAIGIPTKEANAAGIKATKMALNMLEGQRMP ---------------3333-------1111-----------------------2222--- MSKELETEMAVIKAETKCILDKMFELGKGDLAIGTVKAFETGVMDIPFGPSKYNAGKMMP ------------------------1111-----------------2222-1111------ VRDNLGCVRYLEFGNVPFTEEIKNYNRERLQERAKFEGRDVSFQMVIDDIFAVGKGRLIG --1111------!!!!-------------------------3333------3333----- RPE --- >CD58; SWP:P19256; PDB:1CCZA; FSQQIYGVVYGNVTFHVPSNVPLKEVLWKKQKDKVAELENSEFRAFSSFKNRVYLDTVSG -------2222---------------------------%%%%---!!!!----------- SLTIYNLTSSDEDEYEMESPNITDTMKFFLYVLEMVSKPMIYWECSNATLTCEVLEGTDV -------3333-------1111----------------------1111------------ ELKLYQGKEHLRSLRQKTMSYQWTNLRAPFKCKAVNRVSQESEMEVVNCPE -----!!!!--------------------------1111------------ >GRANULOCYTE COLONY-STIMUL; SWP:P09919; PDB:1CD9A; GPASSLPQSFLLKCLEQVRKIQGDGAALQEKLCATYKLCHPEELVLLGHSLGIPWAPLSS ---------------------------------------3333--------------111 CPSQALQLAGCLSQLHSGLFLYQGLLQALEGISPELGPTLDTLQLDVADFATTIWQQMEE 1------3333-----------------iiii3333------------------------ LGMAPALQPTQGAMPAFASAFQRRAGGVLVASHLQSFLEVSYRVLRHLAQP --------------------------------------------------- >Granulocyte colony-stimul; SWP:P40223; PDB:1CD9B; AGYPPASPSNLSCLMHLTTNSLVCQWEPGPETHLPTSFILKSFRSRADCQYQGDTIPDCV ---------------------------------------------2222----------- AKKRQNNCSIPRKNLLLYQYMAIWVQAENMLGSSESPKLCLDPMDVVKLEPPMLQALDIQ -2222-----3333---------------------------1111--------------- PGCLWLSWKPWKPSEYMEQECELRYQPQLKGANWTLVFHLPSSKDQFELCGLHQAPVYTL ----------3333---------------------------------------------- QMRCIRSSLPGFWSPWSPGLQLRPTM -------------------------- >CD2; SWP:NA; PDB:1CDCA; GTVWGALGHGINLNIPNFQMTDDIDEVRWERGSTLVAEFKRKMKPFLKSGAFEILANGDL -----2222-----------1111------!!!!--------------1111--1111-- KIKNLTRDDSGTYNVTVYSTNGTRILDKALDLRILE -----3333---------1111-------------- >ALCOHOL DEHYDROGENASE; SWP:P26325; PDB:1CDOA; ATVGKVIKCKAAVAWEANKPLVIEEIEVDVPHANEIRIKIIATGVCHTDLYHLFEGKHKD -2222--------------------------2222----------------------111 GFPVVLGHEGAGIVESVGPGVTEFQPGEKVIPLFISQCGECRFCQSPKTNQCVKGWANES 1-----------------------2222------------3333-1111-----3333-- PDVMSPKETRFTCKGRKVLQFLGTSTFSQYTVVNQIAVAKIDPSAPLDTVCLLGCGVSTG 3333--------%%%%---2222-----------1111---11111111------3333- FGAAVNTAKVEPGSTCAVFGLGAVGLAAVMGCHSAGAKRIIAVDLNPDKFEKAKVFGATD ----------2222-------------------------------3333----1111--- FVNPNDHSEPISQVLSKMTNGGVDFSLECVGNVGVMRNALESCLKGWGVSVLVGWTDLHD --3333---3333--------------------------1111----------------- VATRPIQLIAGRTWKGSMFGGFKGKDGVPKMVKAYLDKKVKLDEFITHRMPLESVNDAID ---3333----------%%%%--------------------3333-----3333------ LMKHGKCIRTVLSL -1111--------- >CD59; SWP:P13987; PDB:1CDQ; LQCYNCPNPTADCKTAVNCSSDFDACLITKAGLQVYNKCWKFEHCNFNDVTTRLRENELT ----------------------------------------3333-------1111----- YYCCKKDLCNFNEQLEN ----------------- >CARDIOTOXIN VII4; SWP:P01452; PDB:1CDTA; LKCNKLIPIAYKTCPEGKNLCYKMMLASKKMVPVKRGCINVCPKNSALVKYVCCSTDRCN ------3333-----------------------------------1111----------- >TATA-BOX-BINDING PROTEIN; SWP:P20226; PDB:1CDWA; SGIVPQLQNIVSTVNLGCKLDLKTIALRARNAEYNPKRFAAVIMRIREPRTTALIFSSGK ----------------------------------3333-----------------3333- MVCTGAKSEENSRLAARKYARVVQKLGFPAKFLDFKIQNMVGSCDVKFPIRLEGLVLTHQ --------------------------------------------------3333----33 QFSSYEPELFPGLIYRMIKPRIVLLIFVSGKVVLTGAKVRAEIYEAFENIYPILKGFRK 33---1111-----------------3333--------3333----------------- >DNA-REPAIR PROTEIN XRCC1; SWP:P18887; PDB:1CDZA; ELPDFFQGKHFFLYGEFPGDERRKLIRYVTAFNGELEDYMSDRVQFVITAQEWDPSFEEA ---1111----------------------1111-------3333---------3333--- LMDNPSLAFVRPRWIYSCNEKQKLLPHQLYGVVPQA -------------------------3333------- ----------------------------------- >CAMPATH-1H:LIGHT CHAIN; SWP:NA; PDB:1CE1H; QVQLQESGPGLVRPSQTLSLTCTVSGFTFTDFYMNWVRQPPGRGLEWIGFIRDKAKGYTT ------------2222-----------1111--------2222---------3333---- EYNPSVKGRVTMLVDTSKNQFSLRLSSVTAADTAVYYCAREGHTAAPFDYWGQGSLVTVS --3333--------3333----------3333---------------------------- SASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQS --------------1111-iiii------------------%%%%--2222-------33 SGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVE 33----------3333------------1111-------- >IGKC protein; SWP:Q6GMW1; PDB:1CE1L; DIQMTQSPSSLSASVGDRVTITCKASQNIDKYLNWYQQKPGKAPKLLIYNTNNLQTGVPS -------------2222-----------!!!!------2222------------222233 RFSGSGSGTDFTFTISSLQPEDIATYYCLQHISRPRTFGQGTKVEIKRTVAAPSVFIFPP 33----------------1111-------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT ------------------------------------------------------------ LSKADYEKHKVYACEVTHQGLSSPVTKSFNR -3333------------1111---------- >V3 LOOP OF HIV-1 ENVELOPE; SWP:P20871; PDB:1CE4A; CTRPNNNTRKSIHIGPGRAFYTTGEIIGDIRQAHC ----------------1111-3333-----1111- >RIBOSOME-INACTIVATING PRO; SWP:Q6ITZ3; PDB:1CE7A; YERGDLDVTAQTTGAGYFSFITLLRDYVSSGSFSNAIPLLSQSGGGGEAGRFVLVELTNS ---------------------------------iiii---------------------11 GGDGITVAIDVTNLYVVAYQAGSQSYFLSGPGGRHGFTGTTRSSLPFNGSYPDLEQYGGQ 11-------1111------------------------------------3333------3 RKQIPLGIDQLIQSVTALKFPGSTRTGARSILILIQMISEAARFNPILWRARQYINSGAS 333--------------------33331111----------------------------- FLPDVYMLELETSWGQQSTQVQHSTDGVFNNPIALADPGGGVTLTNVRDVIASLAIMLFV --------------------1111---------------------33331111------- C - >Beta-galactoside-specific; SWP:Q6ITZ3; PDB:1CE7B; CSASEPTVRIVGRNGMNVDVRDDDFHDGNQIQLWPSKSNNDPNQLWTIKRDGTIRSNGSC ------------iiii---2222--2222------------1111----------iiii- LTTYGYTAGVYVMIFDCATAVGEATVWQIWGNGTIINPRSNLVLAASSGIKGTTLTVQTL ---------------3333----------1111----3333--------2222------- DYTLGQGWLAGNDTAPREVTIYGFNDLCMESGGGSVTVETCSSGKADKWALYGDGSIRPE --1111-----------------%%%%-----------------1111---3333---11 QNQAQCLTSGGDSVAGVNIVSCSGAASGQRWVFTNEGAILNLKNGLAMDVANPGGGRIII 11-------------------------------1111----------------------- YPATGKPNQMWLPVF -----1111------ >GCN4-PMSE; SWP:P03069; PDB:1CE9A; MSVKELEDKVEELLSKNYHLENEVARLKKLVGER --------------------------1111---- >MEROZOITE SURFACE PROTEIN; SWP:P04933; PDB:1CEJA; NISQHQCVKKQCPQNSGCFRHLDEREECKCLLNYKQEGDKCVENPNPTCNENNGGCDADA --------------------1111--------------------------------1111 KCTEEDSGSNGKKITCECTKPDSYPLFDGIFCSSSN ------------------------------------ >CELLULASE CELC; SWP:P23340; PDB:1CEO; MVSFKAGINLGGWISQYQVFSKEHFDTFITEKDIETIAEAGFDHVRLPFDYPIIESDDNV -----------------------------3333----3333--------3333-----22 GEYKEDGLSYIDRCLEWCKKYNLGLVLDMHHAPGSTLFEDPNQQKRFVDIWRFLAKRYIN 22---------------------------------1111------------------111 EREHIAFELLNQVVEPDSTRWNKLMLECIKAIREIDSTMWLYIGGNNYNSPDELKNLADI 1--------------------------------------------%%%%11111111--- DDDYIVYNFHFYNPFFFTHQKAHWSESAMAYNRTVKYPGQYEGIEEFVKNNPKYSFMMEL -------------1111-2222----------------------------3333-----2 NNLKLNKELLRKDLKPAIEFREKKKCKLYCGEFGVIAIADLESRIKWHEDYISLLEEYDI 222--3333--------------------------1111--------------------- GGAVWNYKKMDFEIYNEDRKPVSQELVNILAR --------iiii---1111---------1111 >Cystatin [Precursor]; SWP:P01038; PDB:1CEWI; GAPVPVDENDEGLQRALQFAMAEYNRASNDKYSSRVVRVISAKRQLVSGIKYILQVEIGR ------1111-------------3333--------------------------------- TTCPKSSGDLQSCEFHDEPEMAKYTTCTFVVYSIPWLNQIKLLESKCQ ---3333----------------------------------------- >CUTINASE; SWP:P00590; PDB:1CEX; RTTRDDLINGNSASCADVIFIYARGSTETGNLGTLGPSIASNLESAFGKDGVWIQGVGGA ----3333--1111----------2222-------------------1111------!!! YRATLGDNALPRGTSSAAIREMLGLFQQANTKCPDATLIAGGYSQGAALAAASIEDLDSA !--3333--1111-------------------1111---------------------333 IRDKIAGTVLFGYTKNLQNRGRIPNYPADRTKVFCNTGDLVCTGSLIVAAPHLAYGPDAR 31111-------11111111--22223333-----22221111-----3333-------- GPAPEFLIEKVRAVRGS ----------------- >ARRESTIN; SWP:P08168; PDB:1CF1A; HVIFKKISRDKSVTIYLGKRDYIDHVERVEPVDGVVLVDPELVKGKRVYVSLTCAFRYGQ ------------------------3333-------------------------------- EDIDVMGLSFRRDLYFSQVQVFPPVGASGATTRLQESLIKKLGANTYPFLLTFPDYLPCS -2222--------------------1111--3333------------------------- VMLQPAPQDVGKSCGVDFEIKAFATHSTDVEEDKIPKKSSVRLLIRKVQHAPRDMGPQPR -----1111-----------------3333-----3333--------------------- AEASWQFFMSDKPLRLAVSLSKEIYYHGEPIPVTVAVTNSTEKTVKKIKVLVEQVTNVVL --------%%%%-------------2222------------------------------- YSSDYYIKTVAAEEAQEKVPPNSSLTKTLTLVPLLANNRERRGIALDGKIKHEDTNLASS ---------------------------------3333-----------1111-------- TIIKEGIDKTVMGILVSYQIKVKLTVSGLLGELTSSEVATEVPFRLMHPQPEDNFVFEEF ---2222----------------------------------------------------- ARQNLKDAGEYKE ------------- >Glyceraldehyde-3-phosphat; SWP:P10618; PDB:1CF2P; MKAVAINGYGTVGKRVADAIAQQDDMKVIGVSKTRPDFEARMALKKGYDLYVAIPERVKL ---------3333------3333--------------------1111------1111--- FEKAGIEVAGTVDDMLDEADIVIDCTPEGIGAKNLKMYKEKGIKAIFQGGEKHEDIGLSF ---------------1111-------22223333-------------11113333----- NSLSNYEESYGKDYTRVVSCNTTGLCRTLKPLHDSFGIKKVRAVIVRRGADPAQVSKGPI 333333332222--------------------------------------1111------ NAIIPNPPKLPSHHGPDVKTVLDINIDTMAVIVPTTLMHQHNVMVEVEETPTVDDIIDVF -----------------3333--------------------------------------- EDTPRVILISAEDGLTSTAEIMEYAKELGRSRNDLFEIPVWRESITVVDNEIYYMQAVHQ ---------3333-----------3333-2222-------3333---!!!!-------33 ESDIVPENVDAVRAILEMEEDKYKSINKTNKAMNIL 33------------------3333------------ >GLUCOSE OXIDASE; SWP:P13006; PDB:1CF3A; GIEASLLTDPKDVSGRTVDYIIAGGGLTGLTTAARLTENPNISVLVIESGSYESDRGPII 3333----33332222------------------11111111----------11113333 EDLNAYGDIFGSSVDHAYETVELATNNQTALIRSGNGLGGSTLVNGGTWTRPHKAQVDSW -1111-1111-1111---------------------22223333--------3333---- ETVFGNEGWNWDNVAAYSLQAERARAPNAKQIAAGHYFNASCHGVNGTVHAGPRDTGDDY -----22223333------------------3333---3333------------------ SPIVKALMSAVEDRGVPTKKDFGCGDPHGVSMFPNTLHEDQVRSDAAREWLLPNYQRPNL -----------1111----------------------1111---3333--3333--1111 QVLTGQYVGKVLLSQNGTTPRAVGVEFGTHKGNTHNVYAKHEVLLAAGSAVSPTILEYSG -----------------------------1111--------------1111--------- IGMKSILEPLGIDTVVDLPVGLNLQDQTTATVRSRITSAGAGQGQAAWFATFNETFGDYS --33333333--------2222--------------3333---------------!!!!- EKAHELLNTKLEQWAEEAVARGGFHNTTALLIQYENYRDWIVNHNVAYSELFLDTAGVAS ------------------1111--------------------------------iiii-- FDVWDLLPFTRGYVHILDKDPYLHHFAYDPQYFLNELDLLGQAAATQLARNISNSGAMQT -------------------1111-------2222--------------------!!!!11 YFAGETIPGDNLAYDADLSAWTEYIPYHFRPNYHGVGTCSMMPKEMGGVVDNAARVYGVQ 11-----!!!!-1111-------3333---------------3333---------2222- GLRVIDGSIPPTQMSSHVMTVFYAMALKISDAILEDYASMQ -----3333--------3333-------------------- >Activated CDC42 kinase 1; SWP:Q07912; PDB:1CF4B; GSGLSAQDISQPLQNSFIHTGHGDSDPRHCWGFPDRIDELYLGN -----%%%%----------------------------3333--- >BETA-MOMORCHARIN; SWP:P29339; PDB:1CF5A; DVNFDLSTATAKTYTKFIEDFRATLPFSHKVYDIPLLYSTISDSRRFILLNLTSYAYETI ----1111------------1111------iiii-------3333--------1111--- SVAIDVTNVYVVAYRTRDVSYFFKESPPEAYNILFKGTRKITLPYTGNYENLQTAAHKIR ----------------------22223333-----------------1111--1111-11 ENIDLGLPALSSAITTLFYYNAQSAPSALLVLIQTTAEAARFKYIERHVAKYVATNFKPN 11---------------------3333--------------------------------- LAIISLENQWSALSKQIFLAQNQGGKFRNPVDLIKPTGQRFQVTNVDSDVVKGNIKLLLN ----------------------------------1111------3333------------ SRASTADEN --------- >TRANSCRIPTION FACTOR E2F-; SWP:Q16254; PDB:1CF7A; SRHEKSLGLLTTKFVSLLQEAKDGVLDLKLAADTLAVRQKRRIYDITNVLEGIGLIEKKS --------------------------------------3333-----------------2 KNSIQWK 222---- >Transcription factor Dp-2; SWP:Q14188; PDB:1CF7B; GKGLRHFSMKVCEKVQRKGTTSYNEVADELVSEFTNSNNHLAADSAYDQKNIRRRVYDAL --3333------------------------------11113333---------------- NVLMAMNIISKEKKEIKWIGLP ---1111----iiii------- >PROTEIN (CATALYTIC ANTIBO; SWP:NA; PDB:1CF8H; DVQLQESGPGLVKPSQSLSLTCTVTGYSITSGYAWNWIRQFPGNKLEWMGYIRYSGDTRY ----------------------------------------2222--------3333---- NPSLKSRISITRDTSKNQFFLQLNSV 3333---------1111--------- >EG628498 protein; SWP:A0A5E0; PDB:1CF8L; DIVLTQSPTIMSVSPGEKVTLTCSASSSVSSNYVYWYQQKPGSSPKVWIYSTSNLASGVP -------------2222------------3333------2222------------22223 ARFSGSGSGTSYSLTISSMEAEDAASYFCLQWSSFPYTFGGGTKLELKRADVAPTVSIFP 333----------------1111------------------------------------- PSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTL ------------------------------iiii-------------------------- TLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC ----3333--------------------------- >COMPLEMENT 5A SEMI-SYNTHE; SWP:P01031; PDB:1CFAA; MLQKKIEEIAAKYKHSVVKKCCYDGASVNNDETCEQRAARISLGPRCIKAFTECCVVASQ ---------------------------------------------3333----------- LRANISHKDMC ----------- >DROSOPHILA NEUROGLIAN; SWP:P20241; PDB:1CFB; IVQDVPNAPKLTGITCQADKAEIHWEQQGDNRSPILHYTIQFNTSFTPASWDAAYEKVPN -----------------------------iiii----------3333-----------11 TDSSFVVQMSPWANYTFRVIAFNKIGASPPSAHSDSCTTQPDVPFKNPDNVVGQGTEPNN 11--------------------3333------------------------------1111 LVISWTPMPEIEHNAPNFHYYVSWKRDIPAAAWENNNIFDWRQNNIVIADQPTFVKYLIK --------3333---------------2222--------1111----------------- VVAINDRGESNVAAEEVVGYSGEDR ----1111----------------- >COAGULATION FACTOR IX; SWP:P00740; PDB:1CFH; YNSGKLEEFVQGNLERECMEEKCSFEEAREVFENTERTTEFWKQYVD ---------------------------------1111--1111---- >RESTRICTION ENDONUCLEASE; SWP:P56200; PDB:1CFR; MDIISKSGEGNKYTINSAIAFVAYASHIDINTTEFSKVLSGLRDFINDEAIRLGGKISDG --------!!!!------------11113333-3333----------------------- SFNKCNGDWYEWLIGIRAIEFFLESETNFIVVKMPNATSFDVMSIYKSCLSEFIYDLRSK -----------------------------------3333---3333-------------- LSLNNVNLITSNPDFSIIDIRGRREELKSMLKDISFSNISLSTISEIDNLYKNFIDYAEL -1111-------------------------11113333-3333-------1111----11 EHIKSFLSVKTTFRPDRRLQLAHEGSLMKALYTHLQTRTWTINPTGIRYYAAATSIGNAD 11----------------------------------1111----------------3333 VIGLKTVATHSITDVKSLPQSAVDEIFKINSVLDVDSCLSHIL -------3333-------------------------------- >MONOCLONAL ANTIBODY FV415; SWP:NA; PDB:1CFVH; QVQLQESGGGLVNLGGSMTLSCVASGFTFNTYYMSWVRQTPEKTLELVAAINSDGEPIYY ------------2222-----------3333--------1111--------3333----- PDTLKGRVTISRDNAKKTLYLQMSSLNFEDTALYYCARLNYAVYGMDYWGQGTTVTVSS 1111---------1111---------1111----------3333--------------- >MONOCLONAL ANTIBODY FV415; SWP:KV2G_MOUSE; PDB:1CFVL; DIELTQSPPSLPVSLGDQVSISCRSSQSLVSNNRRNYLHWYLQKPGQSPKLVIYKVSNRF -------------2222--------------------------2222------------2 SGVPDRFSGSGSGTDFTLKISRVAAEDLGLYFCSQSSHVPLTFGSGTKLEIKR 2221111----------------3333-------------------------- >DESULFOREDOXIN; SWP:P00273; PDB:1CFWA; ANEGDVYKCELCGQVVKVLEEGGGTLVCCGEDMVKQ -2222----------------------%%%%----- >COFILIN; SWP:Q03048; PDB:1CFYA; VAVADESLTAFNDLKLGKKYKFILFGLNDAKTEIVVKETSTDPSYDAFLEKLPENDCLYA ---------------------------1111------------33333333--------- IYDFEYEINGNEGKRSKIVFFTWSPDTAPVRSKMVYASSKDALRRALNGVSTDVQGTDFS ------------------------11113333----------3333-----------333 EVSYDSVLERVSR 3------------ >HYDROGENASE 2 MATURATION ; SWP:P37182; PDB:1CFZA; MRILVLGVGNILLTDEAIGVRIVEALEQRYILPDYVEILDGGTAGMELLGDMANRDHLII ---------1111-------------------1111----!!!!3333-3333------- ADAIVSKKNAPGTMMILRDEEVPALFTNKISPHQLGLADVLSALRFTGEFPKKLTLVGVI ---------2222----!!!!-----------------------1111------------ PESLEPHIGLTPTVEAMIEPALEQVLAALRESGVEAIPRSDS ----------33331111-----------1111----3333- >CARBOXYPEPTIDASE G2; SWP:P06621; PDB:1CG2A; QKRDNVLFQAATDEQPAVIKTLEKLVNIETGTGDAEGIAAAGNFLEAELKNLGFTVTRSK ------------------------------2222---------------1111------- SAGLVVGDNIVGKIKGRGGKNLLLMSHMDTVYLKGILAKAPFRVEGDKAYGPGIADDKGG --------------------------------2222--------!!!!--2222--3333 NAVILHTLKLLKEYGVRDYGTITVLFNTDEEKGSFGSRDLIQEEAKLADYVLSFEPTSAG -----------1111-------------3333-1111------3333----------222 DEKLSLGTSGIAYVQVNITGKASHAGAAPELGVNALVEASDLVLRTMNIDDKAKNLRFNW 2----------------------11113333--------------1111--1111----- TIAKAGNVSNIIPASATLNADVRYARNEDFDAAMKTLEERAQQKKLPEADVKVIVTRGRP -------1111--------------3333----------1111--1111----------- AFNAGEGGKKLVDKAVAYYKEAGGTLGVEERTGGGTDAAYAALSGKPVIESLGLPGFGYH -------------------1111------------------------------------- SDKAEYVDISAIPRRLYMAARLIMDLGAG -------3333------------------ >HEMOGLOBIN; SWP:P56691; PDB:1CG5A; VLSSQNKKAIEELGNLIKANAEAWGADALARLFELHPQTKTYFSKFSGFEACNEQVKKHG ------------------------------------------1111---3333------- KRVMNALADATHHLDNLHLHLEDLARKHGENLLVDPHNFHLFADCIVVTLAVNLQAFTPV ------------1111------------------3333---------------------- THCAVDKFLELVAYELSSCYR --------------1111--- >Hemoglobin subunit beta; SWP:P56692; PDB:1CG5B; VKLSEDQEHYIKGVWKDVDHKQITAKALERVFVVYPWTTRLFSKLQGLFSANDIGVQQHA ------------------------------------1111-3333----1111------- DKVQRALGEAIDDLKKVEINFQNLSGKHQEIGVDTQNFKLLGQTFMVELALHYKKTFRPK ------------11113333-------------3333---------------!!!!---- EHAAAYKFFRLVAEALSSNYH --------------------- >NON HISTONE PROTEIN 6 A; SWP:P11632; PDB:1CG7A; MVTPREPKKRTTRKKKDPNAPKRALSAYMFFANENRDIVRSENPDITFGQVGKKLGEKWK -------------------------3333-------3333--11113333---3333--- ALTPEEKQPYEAKAQADKKRYESEKELYNATLA --3333----------------3333--3333- >Coat protein; SWP:P69475; PDB:1CGME; AYNPITPSKLIAFSASYVPVRTLLNFLVASQGTAFQTQAGRDSFRESLSALPSSVVDINS ----------------------------3333-----3333------------------- RFPDAGFYAFLNGPVLRPIFVSLLSSTDTRNRVIEVVDPSNPTTAESLNAVKRTDDASTA --------------3333--------------------------------3333------ ARAEIDNLIESISKGFDVYDRASFEAAFSVVWSEATTSKA ---------1111---------3333-------------- >IGG2B-KAPPA NC6.8 FAB (HE; SWP:NA; PDB:1CGSH; RVQLLESGAELMKPGASVQISCKATGYTFSEYWIEWVKERPGHGLEWIGEILPGSGRTNY ------------------------------------------------------------ REKFKGKATFTADTSSNTAYMQLSSLTSEDSAVYYCTRGYSSMDYWGQGTSVTVSAAKTT 3333----------------------1111------------------------------ PPSVYPLAPGCGDTTGSSVTLGCLVKGYFPESVTVTWNSGSLSSSVHTFPALLQSGLYTM -----------------------------------------------------iiii--- SSSVTVPSSTWPSQTVTCSVAHPASSTTVDKKLE -------3333----------3333--------- >CYCLODEXTRIN GLYCOSYL-TRA; SWP:P30920; PDB:1CGT; DPDTAVTNKQSFSTDVIYQVFTDRFLDGNPSNNPTGAAYDATCSNLKLYCGGDWQGLINK -1111--11111111-----3333----3333---11111111----------------- INDNYFSDLGVTALWISQPVENIFATINYSGVTNTAYHGYWARDFKKTNPYFGTMADFQN ----------------------------iiii---1111---------3333-------- LITTAHAKGIKIVIDFAPNHTSPAMETDTSFAENGRLYDNGTLVGGYTNDTNGYFHHNGG -----1111---------------1111----%%%%--iiii-------1111------- SDFSSLENGIYKNLYDLADFNHNNATIDKYFKDAIKLWLDMGVDGIRVDAVKHMPLGWQK ----3333------------3333--------------1111-------3333------- SWMSSIYAHKPVFTFGEWFLGSAASDADNTDFANKSGMSLLDFRFNSAVRNVFRDNTSNM ------------------------------------------------------------ YALDSMINSTATDYNQVNDQVTFIDNHDMDRFKTSAVNNRRLEQALAFTLTSRGVPAIYY ---------------1111------1111----11113333-----------------22 GTEQYLTGNGDPDNRAKMPSFSKSTTAFNVISKLAPLRKSNPAIAYGSTQQRWINNDVYV 22---------1111-------------------3333--3333----------1111-- YERKFGKSVAVVAVNRNLSTSASITGLSTSLPTGSYTDVLGGVLNGNNITSTNGSINNFT ----!!!!-----------------------------1111------------------- LAAGATAVWQYTTAETTPTIGHVGPVMGKPGNVVTIDGRGFGSTKGTVYFGTTAVTGAAI -2222-----------------------2222-----------------!!!!--!!!!- TSWEDTQIKVTIPSVAAGNYAVKVAASGVNSNAYNNFTILTGDQVTVRFVVNNASTTLGQ ---1111------------------iiii---------------------------2222 NLYLTGNVAELGNWSTGSTAIGPAFNQVIHQYPTWYYDVSVPAGKQLEFKFFKKNGSTIT -------3333%%%%-1111---------------------------------------- WESGSNHTFTTPASGTATVTVNWQ ------------------------ >MODULE-SUBSTITUTED CHIMER; SWP:Q52MT0; PDB:1CH4A; VHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKV ---3333-------1111------------------3333-3333--------------- KAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLRVDPVNFKLLSHCLLVTLAAHLPA ----------------3333------------------3333---------------333 EFTPAVHASLDKVLASVSTVLTSKYR 3--------------------1111- ------------------------------------------------------------ -------- >CHEB METHYLESTERASE; SWP:P04042; PDB:1CHD; LLSSEKLIAIGASTGGTEAIRHVLQPLPLSSPAVIITQHMPPGFTRSFAERLNKLCQISV ------------2222-------33331111----------------------------- KEAEDGERVLPGHAYIAPGDKHMELARSGANYQIKIHDGPPVNRHRPSVDVLFHSVAKHA ---2222----------2222------!!!!----------%%%%--------------! GRNAVGVILTGMGNDGAAGMLAMYQAGAWTIAQNEASCVVFGMPREAINMGGVSEVVDLS !!!----------2222------1111-------1111---------1111------333 QVSQQMLAKISAGQAIRI 3----------!!!!--- >CHITOSANASE; SWP:P33665; PDB:1CHKA; AGAGLDDPHKKEIAMELVSSAENSSLDWKAQYKYIEDIGDGRGYTGGIIGFCSGTGDMLE --!!!!----------------------1111-------------------3333----- LVQHYTDLEPGNILAKYLPALKKVNGSASHSGLGTPFTKDWATAAKDTVFQQAQNDERDR -------------3333------2222--2222--------------------------- VYFDPAVSQAKADGLRALGQFAYYDAIVMHGPGNDPTSFGGIRKTAMKKARTPAQGGDET ----------1111--------------------1111-------------3333----- TYLNGFLDARKAAMLTEAAHDDTSRVDTEQRVFLKAGNLDLNPPLKWKTYGDPYVINS ----------------------3333-------11111111-------iiii------ >CHLOROTOXIN; SWP:P45639; PDB:1CHL; MCMPCFTTDHQMARKCDDCCGGKGRGKCYGPQCLCR ---------------3333--iiii----------- >CREATINE AMIDINOHYDROLASE; SWP:P38488; PDB:1CHMA; QMPKTLRIRNGDKVRSTFSAQEYANRQARLRAHLAAENIDAAIFTSYHNINYYSDFLYCS ----------------------------------1111-------3333----------% FGRPYALVVTEDDVISISANIDGGQPWRRTVGTDNIVYTDWQRDNYFAAIQQALPKARRI %%%---------------3333---------------------3333------------- GIEHDHLNLQNRDKLAARYPDAELVDVAAACMRMRMIKSAEEHVMIRHGARIADIGGAAV --1111------------1111-------------------------------------- VEALGDQVPEYEVALHATQAMVRAIADTFEDVELMDTWTWFQSGINTDGAHNPVTTRKVN 1111----3333------------------------------!!!!--1111-------2 KGDILSLNCFPMIAGYYTALERTLFLDHCSDDHLRLWQVNVEVHEAGLKLIKPGARCSDI 222---------iiii-------------3333-------------1111-----3333- ARELNEIFLKHDVLQYRTFGYGHSFGTLSHYYGREAGLELREDIDTVLEPGMVVSMEPMI ------------1111------------1111--1111----------2222-------- MLPEGLPGAGGYREHDILIVNENGAENITKFPYGPEKNIIR --------------------1111---------3333---- >L-ASPARTATE OXIDASE; SWP:P10902; PDB:1CHUA; NTLPEHSCDVLIIGSGAAGLSLALRLADQHQVIVLSKGPVTEFDETDSIDSHVEDTLIAG ---------------3333-------1111--------1111--3333------------ AGICDRHAVEFVASNARSCVQWLIDQGVLTTLVSKALNHPNIRVLERTNAVDLIVSDKIG -----------------------1111-----------1111------------1111-- LPGTRRVVGAWVWNRNKETVETCHAKAVVLATGGASKVYQYTTNPDISSGDGIAMAWRAG -------------------------------------------3333---------1111 CRVANLEFNQFHPTALYHPQARNFLLTEALRGEGAYLKRPDGTRFMPDFDERGELAPRDI --------------------%%%%--33331111----3333--3333-1111---3333 VARAIDHEMKRLGADCMFLDISHKPADFIRQHFPMIYEKLLGLGIDLTQEPVPIVPAAHY ---------1111-------3333---------3333--------1111----------- TCGGVMVDDHGRTDVEGLYAIGEVSYTGLHGANRMASNSLLECLVYGWSAAEDITRRMHD -------1111---2222---3333----!!!!-2222---------------------- ISTLPPWDESRVENPDERVVIQHNWHELRLFMWDYVGIVRTTKRLERALRRITMLQQEID -------------3333------------------------------------------- EYYAHFRVSNNLLELRNLVQVAELIVRCAMMRKESRGLHFTLDYPELLTHSGPSILSP ----------------------------3333---!!!!-1111-------------- ------------------------------------------------------------ >PNP OXIDASE; SWP:P38075; PDB:1CI0A; FTLNEKQLTDDPIDLFTKWFNEAKEDPRETLPEAITFSSAELPSGRVSSRILLFKELDHR ---3333---3333-----------3333-1111-------------------------- GFTIYSNWGTSRKAHDIATNPNAAIVFFWKDLQRQVRVEGITEHVNRETSERYFKTRPRG ----------------1111---------1111--------------------------- SKIGAWASRQSDVIKNREELDELTQKNTERFKDAEDIPCPDYWGGLRIVPLEIEFWQGRP --------2222----------------1111-------1111---------------11 SRLHDRFVYRRKTENDPWKVVRLAP 11----------1111--------- >Apocytochrome f [Precurso; SWP:P95522; PDB:1CI3M; YPFWAQQNYANPREATGRIVCANCHLAAKPAEIEVPQAVLPDSVFKAVVKIPYDHSVQQV 3333---------1111------------------------------------1111--- QADGSKGPLNVGAVLMLPEGFTIAPEDRIPEEMKEEVGPSYLFQPYADDKQNIVLVGPLP 1111-------------2222---1111-3333-----3333----1111---------3 GDEYEEIVFPVLSPNPATNKSVAFGKYSIHLGANRGRGQIYPTGEKSNNAVYNASAAGVI 333-----------33331111------------------1111---------------- TAIAKADDGSAEVKIRTEDGTTIVDKIPAGPELIVSEGEEVAAGAALTNNPNVGGFGQKD -----------------------------------2222--2222--------------- TEIVLQSPN --------- >BARRIER-TO-AUTOINTEGRATIO; SWP:O75531; PDB:1CI4A; TTSQKHRDFVAEPGEKPVGSLAGIGEVLGKKLEERGFDKAYVVLGQFLVLKKDEDLFREW ----------------33332222--------1111--3333-------%%%%------- LKDTCGANAKQSRDCFGCLREWCDAFL --------------------------- >LYMPHOCYTE FUNCTION-ASSOC; SWP:P19256; PDB:1CI5A; SSQQIYGVKYGNVTFHVPSNQPLKEVLWKKQKDKVAELENSEFRAFSSFKNRVYLDTKSG -------2222------------------!!!!-----%%%%-----3333--------- SLTIYNLTSSDEDEYEMESPNITDSMKFFLYVGES -------1111-------3333------------- >TRANSCRIPTION FACTOR ATF-; SWP:P18848; PDB:1CI6A; EQNKTAATRYRQKKRAEQEALTGECKELEKKNEALKERADSLAKEIQYLKDLIEEV ----------------------------------------------------3333 ----------------------------------------------- >CARBOXYLESTERASE; SWP:Q9KX40; PDB:1CI9A; AASLAARLDAVFDQALRERRLVGAVAIVARHGEILYRRAQGLADREAGRPMREDTLFRLA -----------------------------iiii-----------1111---1111----- SVTKPIVALAVLRLVARGELALDAPVTRWLPEFRPRLADGSEPLVTIHHLLTHTSGLGYW --------------------11113333-1111-------------------------33 LLEGAGSVYDRLGISDGIDLRDFDLDENLRRLASAPLSFAPGSGWQYSLALDVLGAVVER 33-2222------------------------1111----2222----------------- ATGQPLAAAVDALVAQPLGMRDCGFVSAEPERFAVPYHDGQPEPVRMRDGIEVPLPEGHG ---------------1111---------3333---------------2222----2222- AAVRFAPSRVFEPGAYPSGGAGMYGSADDVLRALEAIRANPGFLPETLADAARRDQAGVG -----1111--1111--1111-----------------------------1111------ AETRGPGWGFGYLSAVLDDPAAAGTPQHAGTLQWGGVYGHSWFVDRALGLSVLLLTNTAY ----2222----------3333-----2222----3333------1111----------- EGMSGPLTIALRDAVYA ----------------- >CHLORAMPHENICOL ACETYLTRA; SWP:P00484; PDB:1CIA; MNYTKFDVKNWVRREHFEFYRHRLPCGFSLTSKIDITTLKKSLDDSAYKFYPVMIYLIAQ ------33331111---------------------------------------------- AVNQFDELRMAIKDDELIVWDSVDPQFTVFHQETETFSALSCPYSSDIDQFMVNYLSVME ----3333-----------------------1111------------------------- RYKSDTKLFPQGVTPENHLNISALPWVNFDSFNLNVANFTDYFAPIITMAKYQQEGDRLL -1111---1111-----------1111-----------2222------------!!!!-- LPLSVQVHQAVCDGFHVARFINRLQELCNSKLK -------3333---------------1111--- >IG HEAVY CHAIN V REGIONS; SWP:NA; PDB:1CICB; QVQLQQPGSELVRPGASVKLSCKASGYTFTNYWMHWVKQRPGQGLEWIGNIYPGSGDSNY ------------2222-----------1111----------------------------- DEKFKSKATLTVDTSSSTAYMQLSGLTSEDSAVYYCARGLAFYFDHWGQGTTLTVSSALT 3333---------1111---------1111---------1111----------------- TPPSVYPLAPGCGDTTGSSVTLGCLVKGYFPEPVTVTWNSGSLSSSVHTFPALLQSGLYT ------------------------------------------------------iiii-- MSSSVTVPSSTWPSETVTCSVAHPASSTTVDKKLEPS -------1111-----------3333----------- >T CELL SURFACE GLYCOPROTE; SWP:P05540; PDB:1CID; TSITAYKSEGESAEFSFPLNLGEESLQGELRWKAEKAPSSQSWITFSLKNQKVSVQKSTS -------2222-------------------------------------%%%%-------- NPKFQLSETLPLTLQIPQVSLQFAGSGNLTLTLDRGILYQEVNLVVMKVTQPDSNTLTCE -------------------3333------------------------------------- VMGPTSPKMRLILKQENQEARVSRQEKVIQVQAPEAGVWQCLLSEGEEVKMDSKIQV --------------------------------------------!!!!--------- >ORPHAN NUCLEAR RECEPTOR N; SWP:P12813; PDB:1CITA; GRCAVCGDNASCQHYGVRTCEGCKGFFKRTVQKSAKYICLANKDCPVDKRRRNRCQFCRF -------------iiii---------------------------------1111------ QKCLAVGMVKEVVRTDSLKGRRGRLPSKP ---1111-3333---1111---------- >CYCLODEXTRIN GLYCOSYLTRAN; SWP:P26827; PDB:1CIU; ASDTAVSNVVNYSTDVIYQIVTDRFVDGNTSNNPTGDLYDPTHTSLKKYFGGDWQGIINK -1111--11111111-----3333----3333---33331111-1111------------ INDGYLTGMGVTAIWISQPVENIYAVLPDSTFGGSTSYHGYWARDFKRTNPYFGSFTDFQ 1111--1111--------------------------3333----1111-1111------- NLINTAHAHNIKVIIDFAPNHTSPASETDPTYAENGRLYDNGTLLGGYTNDTNGYFHHYG ------1111---------------1111----%%%%-------------1111------ GTDFSSYEDGIYRNLFDLADLNQQNSTIDSYLKSAIKVWLDMGIDGIRLDAVKHMPFGWQ -----3333-----!!!!---------------------1111-------1111------ KNFMDSILSYRPVFTFGEWFLGTNEIDVNNTYFANESGMSLLDFRFSQKVRQVFRDNTDT ------1111-----------2222----------------------------------- MYGLDSMIQSTASDYNFINDMVTFIDNHDMDRFYNGGSTRPVEQALAFTLTSRGVPAIYY ---------1111---1111------1111----------------------------22 GTEQYMTGNGDPYNRAMMTSFNTSTTAYNVIKKLAPLRKSNPAIAYGTTQQRWINNDVYI 22--------------------------------------3333----------1111-- YERKFGNNVALVAINRNLSTSYNITGLYTALPAGTYTDVLGGLLNGNSISVASDGSVTPF ----!!!!-----------------------------1111----------1111----- TLSAGEVAVWQYVSSSNSPLIGHVGPTMTKAGQTITIDGRGFGTTSGQVLFGSTAGTIVS --2222-----------------------2222--------------------------- WDDTEVKVKVPSVTPGKYNISLKTSSGATSNTYNNINILTGNQICVRFVVNNASTVYGEN -1111------------------1111----------------------------2222- VYLTGNVAELGNWDTSKAIGPMFNQVVYQYPTWYYDVSVPAGTTIQFKFIKKNGNTITWE ------3333iiii---------------------------------------------- GGSNHTYTVPSSSTGTVIVNWQQ ----------------------- >NADP-MALATE DEHYDROGENASE; SWP:P46489; PDB:1CIVA; LPAKQKPECFGVFCLTYDLKAEEETKSWKKIINVAVSGAAGMISNHLLFKLASGEVFGPD -----------1111---------3333---------1111------------------- QPISLKLLGSERSFAALEGVAMELEDSLYPLLRQVSIGIDPYEIFQDAEWALLIGAKPRG ----------1111--------------3333-------3333-2222------------ PGMERADLLDINGQIFAEQGKALNAVASPNVKVMVVGNPCNTNALICLKNAPNIPPKNFH ---3333-------------------------------------------33333333-- ALTRLDENRAKCQLALKAGVFYDKVSNVTIWGNHSTTQVPDFLNAKIHGIPVTEVIRDRK --------------------1111-----------------1111-----3333---333 WLEDEFTNMVQTRGGVLIKKWGRSSAASTAVSIVDAIRSLVTPTPEGDWFSTGVYTNGNP 3----3333-------3333------------------------2222-------2222- YGIAEDIVFSMPCRSKGDGDYEFVKDVIFDDYLSKKIKKSEDELLAEKKCVAHLTGEGIA --------------------------------------------------1111------ VCDLPEDTMLPGEM ---------2222- >TACHYSTATIN A; SWP:Q9U8X3; PDB:1CIXA; YSRCQLQGFNCVVRSYGLPTIPCCRGLTCRSYFPGSTYGRCQRY --------------2222-----2222----------------- >CRYIA(A); SWP:P0A366; PDB:1CIY; YTPIDISLSLTQFLLSEFVPGAGFVLGLVDIIWGIFGPSQWDAFLVQIEQLINQRIEEFA -3333------------------------------------------1111--------- RNQAISRLEGLSNLYQIYAESFREWEADPTNPALREEMRIQFNDMNSALTTAIPLLAVQN -----------------------33331111----------------------1111222 YQVPLLSVYVQAANLHLSVLRDVSVFGQRWGFDAATINSRYNDLTRLIGNYTDYAVRWYN 23333----------------------1111-3333------------------------ TGLERVWGPDSRDWVRYNQFRRELTLTVLDIVALFSNYDSRRYPIRTVSQLTREIYTNPV --3333------------------------3333----3333---------------333 LENFDGSFRGMAQRIEQNIRQPHLMDILNSITIYTDVHRGFNYWSGHQITASPVGFSGPE 3--------------1111------------------%%%%-----------2222---- FAFPLFGNAGNAAPPVLVSLTGLGIFRTLSSPLYRRIILGSGPNNQELFVLDGTEFSFAS ------------------------------------------------------------ LTTNLPSTIYRQRGTVDSLDVIPPQDNSVPPRAGFSHRLSHVTMLSQAAGAVYTLRAPTF ----------------3333---------------------------2222--------- SWQHRSAEFNNIIPSSQITQIPLTKSTNLGSGTSVVKGPGFTGGDILRRTSPGQISTLRV ---3333---------------1111---2222--------------------------- NITAPLSQRYRVRIRYASTTNLQFHTSIDGRPINQGNFSATMSSGSNLQSGSFRTVGFTT ----3333-------------------iiii-----------2222--3333-------- PFNFSNGSSVFTLSAHVFNSGNEVYIDRIEFVPAEVT --------------------------------3333- >ACTIN-FRAGMIN KINASE; SWP:Q94706; PDB:1CJAA; AGALWEIEKELFTKLPAPSSAINSHLQPAKPFKVDLSTAVSYNDIGDINWKNLQQFKGIE ----------------------3333--------3333----------11113333---- RSEKGTEGLFFVETESGVFIVKRSTNIESETFCSLLCMRLGLHAPKVRVVSSNSEEGTNM -------------1111------------------------------------------- LECLAAIDKSFRVITTLANQANILLMELVRGITLNKLTTTSAPEVLTKSTMQQLGSLMAL ------------33333333------------3333----3333---------------- DVIVNNSDRLPIAWTNEGNLDNIMLSERGATVVPIDSKIIPLDASHPHGERVRELLRTLI -------------------1111----%%%%----------------------------- AHPGHESSQFHSIRDIITLYTGYDVGTEGSISMQEGFLATVRECASFDLDAFERELLSWQ -3333-------------1111--------------------3333-------------- ESLQKCHNLSISPQAIPFILRMLRIFH -----------1111-------3333- >HYPOXANTHINE-GUANINE PHOS; SWP:P20035; PDB:1CJBA; PIPNNPGAGENAFDPVFVNDDDGYDLDSFMIPAHYKKYLTKVLVPNGVIKNRIEKLAYDI ----2222----------1111--1111---11111111--------------------- KKVYNNEEFHILCLLKGSRGFFTALLKHLSRIHNYSAVETSKPLFGEHYVRVKSYCNDQS ---------------1111------------------1111--------------!!!!- TGTLEIVSEDLSCLKGKHVLIVEDIIDTGKTLVKFCEYLKKFEIKTVAIACLFIKRTPLW -------------2222---------------------------------------1111 NGFKADFVGFSIPDHFVVGYSLDYNEIFRDLDHCCLVNDEGKKKYKAT ------------------iiii-iiii1111----------------- >ADRENODOXIN REDUCTASE; SWP:P08165; PDB:1CJCA; TPQICVVGSGPAGFYTAQHLLKHHSRAHVDIYEKQLVPFGLVRFGVAPDHPEVKNVINTF ---------------------------------------3333---11113333------ TQTARSDRCAFYGNVEVGRDVTVQELQDAYHAVVLSYGAEDHQALDIPGEELPGVFSARA -----1111------2222---------------------------2222---------- FVGWYNGLPENRELAPDLSCDTAVILGQGNVALDVARILLTPPDHLEKTDITEAALGALR -------3333------------------------------33331111--------333 QSRVKTVWIVGRRGPLQVAFTIKELREMIQLPGTRPMLDPADFLGLQDRIKEAARPRKRL 3------------3333-------------2222----333322221111---------- MELLLRTATEKPGVEEAARRASASRAWGLRFFRSPQQVLPSPDGRRAAGIRLAVTRLEGI -------------------1111-----------------1111---------------! GEATRAVPTGDVEDLPCGLVLSSIGYKSRPIDPSVPFDPKLGVVPNMEGRVVDVPGLYCS !!!----------------------------1111-----------iiii2222------ GWVKRGPTGVITTTMTDSFLTGQILLQDLKAGHLPSGPRPGSAFIKALLDSRGVWPVSFS 3333---------------------------------------------1111------- DWEKLDAEEVSRGQASGKPREKLLDPQEMLRLLGH -------------1111-------------1111- >ARYL SULFOTRANSFERASE; SWP:P50224; PDB:1CJMA; SRPPLEYVKGVPLIKYFAEALGPLQSFQARPDDLLINTYPKSGTTWVSQILDMIYQRVPF -------iiii--3333-11111111---1111---------3333-------------1 LEVNDPGEPETLKDTPPPRLIKSHLPLALLPQTLLDQKVKVVYVARNPKDVAVSYYHFHR 111-2222--3333-----------3333-33331111---------------------- MEKAHPEPGTWDSFLEKFMAGEVSYGSWYQHVQEWWELSRTHPVLYLFYEDMKENPKREI -----------------1111-2222-----------3333------3333--------- QKILEFVGRSLGDWKTTFTVAQNERFDADYAEKMAGCSLSFRS -----------3333----------------1111-------- >SEROTONIN N-ACETYLTRANSFE; SWP:Q29495; PDB:1CJWA; HTLPANEFRCLTPEDAAGVFEIEREAFISVSGNCPLNLDEVQHFLTLCPELSLGWFVEGR -----------3333--------------------------------3333-----iiii LVAFIIGSLWDEERLTQESLALHRPRGHSAHLHALAVHRSFRQQGKGSVLLWRYLHHVGA ---------------3333----1111----------1111------------------- QPAVRRAVLMCEDALVPFYQRFGFHPAGPCAIVVGSLTFTEMHCSL -----------3333-3333-------------!!!!--------- >4-HYDROXYPHENYLPYRUVATE D; SWP:P80064; PDB:1CJXA; YENPMGLMGFEFIEFASPTPGTLEPIFEIMGFTKVATHRSKNVHLYRQGEINLILNNEPN --1111---------------------1111----------------!!!!--------- SIASYFAAEHGPSVCGMAFRVKDSQKAYNRALELGAQPIHIDTGPMELNLPAIKGIGGAP -------------------------------1111--------2222-------2222-- LYLIDRFGEGSSIYDIDFVYLEGVERNPVGAGLKVIDHLTHNVYRGRMVYWANFYEKLFN -----------3333-----2222----!!!!-----------2222------------- FREARYFDIKGEYTGLTSKAMSAPDGMIRIPLNEESSKGAGQIEEFLMQFNGEGIQHVAF ----------------------1111---------3333--------------------- LTDDLVKTWDALKKIGMRFMTAPPDTYYEMLEGRLPDHGEPVDQLQARGILLDGSSVEGD ----------------------------------------3333------------iiii KRLLLQIFSETLMGPVFFEFIQRKGDDGFGEGNFKALFESIERDQVRRGVLAT ------------!!!!------------------------------------- >IGG1-KAPPA ANTIBODY 131 (; SWP:NA; PDB:1CK0H; QVQLQESGGGLVQPRGSLKLSCAASGFTFNTDAMNWVRQAPGKGLEWVARIRSK ------------2222-----------3333----------------------- >ENTEROTOXIN TYPE C-3; SWP:Q06535; PDB:1CK1A; ESQPDPMPDDLHKSSEFTGTMGNMKYLYDDHYVSATKVKSVDKFLAHDLIYNINDKKLNN ------1111--3333-------3333-----------------1111------------ YDKVKTELLNEDLANKYKDEVVDVYGSNYYVNCYFSSKDNVGKVTSGKTCMYGGITKHEG ----------------1111---------------11112222--------------222 NHFDNGNLQNVLIRVYENKRNTISFEVQTDKKSVTAQELDIKARNFLINKKNLYEFNSSP 2-1111----------%%%%----------------------------------3333-- YETGYIKFIESNGNTFWYDMMPAPGDKFDQSKYLMIYKDNKMVDSKSVKIEVHLTTKNG ---------1111---------------3333----1111---3333------------ >INTEGRIN ALPHA-1; SWP:P18614; PDB:1CK4A; TQLDIVIVLDGSNSIYPWESVIAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNLNKYS ----------------3333--------3333----------------------1111-- STEEVLVAANKIGRQGGLQTMTALGIDTARKEAFTEARGARRGVKKVMVIVTDGESHDNY 3333----1111----------------------3333--2222-----------1111- RLKQVIQDCEDENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELAL --------3333-------------1111----------1111--3333------3333- VTIVKALGERIFA ------------- >n/a; SWP:Q64010; PDB:1CKAA; AEYVRALFDFNGNDEEDLPFKKGDILRIRDKPEEQWWNAEDSEGKRGMIPVPYVEKY -------------1111---2222--------1111----1111-----3333---- >CYTIDINE MONOPHOSPHATE KI; SWP:P0A6I0; PDB:1CKEA; AIAPVITIDGPSGAGKGTLCKAMAEALQWHLLDSGAIYRVLALAALHHHVDVASEDALVP ----------2222----------1111----3333--------------11113333-- LASHLDVRFVSTNGNLEVILEGEDVSGEIRTQEVANAASQVAAFPRVREALLRRQRAFRE -1111------iiii----iiii--3333--------------------------1111- LPGLIADGRDMGTVVFPDAPVKIFLDASSEERAHRRMLQLQVKGFSVNFERLLAEIKLVP ---------1111--1111----------------------------------1111--- AADALVLDSTTLSIEQVIEKALQYARQKLALA 1111---1111--------------------- >CASEIN KINASE I DELTA; SWP:Q06486; PDB:1CKIA; MELRVGNRYRLGRKIGSGSFGDIYLGTDIAAGEEVAIKLECVKTKHPQLHIESKIYKMMQ -----------------------------------------------3333--------- GGVGIPTIRWCGAEGDYNVMVMELLGPSLEDLFNFCSRKFSLKTVLLLADQMISRIEYIH -2222--------!!!!----------------1111----------------------- SKNFIHRDVKPDNFLMGLGKKGNLVYIIDFGLAKKYRDARTHQHIPYRENKNLTGTARYA ---------3333-------1111-----1111----------------------3333- SINTHLGIEQSRRDDLESLGYVLMYFNLGSLPWQGLKYERISEKKMSTPIEVLCKGYPSE 3333------3333----------------1111--------------3333-2222--- FATYLNFCRSLRFDDKPDYSYLRQLFRNLFHRQGFSYDYVFDWNMLKFGASR -------33331111---------------1111------33333333---- >MRNA CAPPING ENZYME; SWP:Q84424; PDB:1CKMA; NITTERAVLTLNGLQIKLHKVVGESRDDIVAKMKDLAMDDHKFPRLPGPNPVSIERKDFE ----------iiii----------------------------------------3333-3 KLKQNKYVVSEKTDGIRFMMFFTRVFGFKVCTIIDRAMTVYLLPFKNIPRVLFQGSIFDG 333-------------------------------1111----------3333-------- ELCVDIVEKKFAFVLFDAVVVSGVTVSQMDLASRFFAMKRSLKEFKNVPEDPAILRYKEW -----1111-----------iiii-11113333--------1111--1111--------- IPLEHPTIIKDHLKKANAIYHTDGLIIMSVDEPVIYGRNFNLFKLKPGTHHTIDFIIMSE -1111-----------------------------------------------------11 DGTIGIFDPNLRKNVPVGKLDGYYNKGSIVECGFADGTWKYIQGRSDKNQANDRLTYEKT 11----------------------2222------iiii------1111------------ LLNIEENITIDELLDLF --------3333----- >HEAT SHOCK SUBSTRATE BIND; SWP:P08109; PDB:1CKRA; SENVQDLLLLDVTPLSLGIETAGGVMTVLIKRNTTIPTKQTQTFTTYSDNQPGVLIQVYE ------------------------------------------------------------ GERAMTKDNNLLGKFELTGIPPAPRGVPQIEVTFDIDANGILNVSAVDKSTGKENKITIT -----------------------2222---------------------1111-------- NDKGRLSKEDIERMVQEAEKYKAEDEKQRDKVSSKNSLE --------------------------------------- >CYCLIN-DEPENDENT KINASE S; SWP:P33552; PDB:1CKSA; AHKQIYYSDKYFDEHYEYRHVMLPRELSKQVPKTHLMSEEEWRRLGVQQSLGWVHYMIHE -----------------------33331111-----------3333-------------- PEPHILLFRRPLPK -------------- >HIGH MOBILITY GROUP 1 PRO; SWP:P07155; PDB:1CKTA; KPRGKMSSYAFFVQTCREEHKKKHPDASVNFSEFSKKCSERWKTMSAKEKGKFEDMAKAD --------------------1111----------------3333-11113333------- KARYEREMKTY ------1111- >PROTEIN B; SWP:NA; PDB:1CKV; MSVNSNAYDAGIMGLKGKDFADQFFADENQVVHESDTVVLVLKKSDEINTFIEEILLTDY -------3333------------------------------------------------- KKNVNPTVNVEDRAGYWWIKANGKIEVDCDEISELLGRQFNVYDFLVDVSSTIGRAYTLG ------------------------------3333--2222-------------------- NKFTITSELMGLDRKLEDYHA -------%%%%---------- >POLYOMAVIRUS ENHANCER BIN; SWP:Q13951; PDB:1CL3A; VVPDQRSKFENEEFFRKLSRECEIKYTGFRDRPHEERQTRFQNACRDGRSEIAFVATGTN ----------------------------1111---------------------3333--- LSLQFFPASWQGEQRQTPSREYVDLEREAGKVYLKAPMILNGVCVIWKGWIDLHRLDGMG ------------------3333--3333-------------------------------- CLEFDEERAQQEDALAQQ -----3333--------- >GAG POLYPROTEIN; SWP:P07567; PDB:1CL4A; VPGLCPRCKRGKHWANECKSKTDNQGNPIPPH ----------------------------3333 >Ig gamma-1 chain C region; SWP:P01869; PDB:1CL7H; QIQLQQSGPELKKPGETVKISCKATNYAFTDYSMHWVKQAPGGDLKYVGWINTETDEPTF ---------------------------1111----------------------------- ADDFKGRFAFSLDTSTSTAFLQINNL -------------------------- >ENDONUCLEASE; SWP:P00642; PDB:1CL8A; SQGVIGIFGDYAKAHDLAVGEVSKLVKKALSNEYPQLSFRYRDSIKKTEINEALKKIDPD -!!!!----------------------------3333--------------------111 LGGTLFVSNSSIKPDGGIVEVKDDYGEWRVVLVAEAKHQGKDIINIRNGLLVGKRGDQDL 1-----1111---1111-----1111--------------------------1111---- MAAGNAIERSHKNISEIANFMLSESHFPYVLFLEGSNFLTENISITRPDGRVVNLEYNSG ------------------1111------------1111--------3333-----1111- ILNRLDRLTAANYGMPINSNLCINKFVNHKDKSIMLQAASIYTQGDGREWDSKIMFEIMF ---3333----iiii----------------------------3333------------- DISTTSLRVLGRDLFEQLTSK ---------3333-3333--- >ENDOGLUCANASE CELD; SWP:P0C2S4; PDB:1CLC; IETKVSAAKITENYQFDSRIRLNSIGFIPNHSKKATIAANCSTFYVVKEDGTIVYTGTAT --1111----------1111-------2222----------------1111--------- SMFDNDTKETVYIADFSSVNEEGTYYLAVPGVGKSVNFKIAMNVYEDAFKTAMLGMYLLR ---------------3333---------2222--------1111------------1111 CGTSVSATYNGIHYSHGPCHTNDAYLDYINGQHTKKDSTKGWHDAGDYNKYVVNAGITVG --------iiii-------------1111---------------------3333------ SMFLAWEHFKDQLEPVALEIPEKNNSIPDFLDELKYEIDWILTMQYPDGSGRVAHKVSTR --------33331111---1111----3333--------3333--1111----------- NFGGFIMPENEHDERFFVPWSSAATADFVAMTAMAARIFRPYDPQYAEKCINAAKVSYEF ------1111-------------------------------------------------- LKNNPANVFANQSGFSTGEYATVSDADDRLWAAAEMWETLGDEEYLRDFENRAAQFSKKI -----------1111--------------------------------------------- EADFDWDNVANLGMFTYLLSERPGKNPALVQSIKDSLLSTADSIVRTSQNHGYGRTLGTT ----33333333------------------------------------------1111-- YYWGCNGTVVRQTMILQVANKISPNNDYVNAALDAISHVFGRNYYNRSYVTGLGINPPMN -2222-------------------------------------1111---2222------- PHDRRSGADGIWEPWPGYLVGGGWPGPKDWVDIQDSYQTNEIAINWNAALIYALAGFVNY --3333-------------------1111---3333------3333-------3333--- N - >CD2-LAC9; SWP:P08657; PDB:1CLD; QACDACRKKKWKCSKTVPTCTNCLKYNLDCVYS --------------------3333--------- >FERREDOXIN; SWP:P00195; PDB:1CLF; AYKIADSCVSCGACASECPVNAISQGDSIFVIDADTCIDCGNCANVCPVGAPVQE ----3333------3333--------------3333------33331111----- >PHOSPHORIBOSYL-AMINOIMIDA; SWP:P08178; PDB:1CLIA; TSLSYKDAGVDIDAGNALVGRIKGVVKKTRRPEVMGGLGGFGALCALPQKYREPVLVSGT --------------------------11113333-------------3333--------- DGVGTKLRLAMDLKRHDTIGIDLVAMCVNDLVVQGAEPLFFLDYYATGKLDVDTASAVIS ----------3333-----------------1111------------------------- GIAEGCLQSGCSLVGGETAEMPGMYHGEDYDVAGFCVGVVEKSEIIDGSKVSDGDVLIAL --------------------1111-!!!!-----------1111---11112222----- GSSGPHSNGYSLVRKILEVSGCDPQTTELDGKPLADHLLAPTRIYVKSVLELIEKVDVHA --------3333----------3333------3333------------------------ IAHLTGGGFWENIPRVLPDNTQAVIDESSWQWPEVFNWLQTAGNVEHHEMYRTFNCGVGM -------------1111--------1111-----------1111-3333-----iiii-- IIALPAPEVDKALALLNANGENAWKIGIIKASDSEQRVVIE ----1111--------------------------------- >A5B7 MONOCLONAL ANTIBODY; SWP:GC1_MOUSE; PDB:1CLOH; EVKLVESGGGLVQPGGSLRLSCATSGFTFTDYYMNWVRQPPGKALEWLGFIGNKA ------------2222-----------1111--------2222------------ >Alpha-amylase inhibitor A; SWP:P80403; PDB:1CLVI; CIPKWNRCGPKMDGVPCCEPYTCTSDYYGNCS --2222--3333-------------------- >IGG FAB (HUMAN IGG1, KAPP; SWP:NA; PDB:1CLYH; EVNLVESGGGLVQPGGSLKVSCVTSGFTFSDYYMYWVRQTPEKRLEWVAYISQGGDITDY ------------2222-----------3333--------1111----------------- PDTVKGRFTISRDNAKNSLYLQMSRL 3333---------1111--------- >Ig gamma-3 chain C region; SWP:GC3_MOUSE; PDB:1CLZH; EVNLVESGGGLVQPGGSLKVSCVTSGFTFSDYYMYWVRQTPEKRLEWVAYISQGGDITDY ------------2222-----------1111--------1111----------------- PDTVKGRFTISRDNAKNSLYLQMSRL 1111---------------------- >Histone acetyltransferase; SWP:Q92831; PDB:1CM0B; KVIEFHVVGNSLNQKPNKKILMWLVGLQNVFSHQLPRMPKEYITRLVFDPKHKTLALIKD ----------------------------------1111-------1111---------ii GRVIGGICFRMFPSQGFTEIVFCAVTSNEQVKGYGTHLMNHLKEYHIKHDILNFLTYADE ii-----------------------1111-----------------1111--------11 YAIGYFKKQGFSKEIKIPKTKYVGYIKDYEGATLMGCELNPR 11----1111-------3333-------2222---------- >3-ISOPROPYLMALATE DEHYDRO; SWP:P30125; PDB:1CM7A; MSKNYHIAVLPGDGIGPEVMTQALKVLDAVRNRFAMRITTSHYDVGGAAIDNHGQPLPPA ------------!!!!-----------------------------------------333 TVEGCEQADAVLFGSVGGPKWEHLPPDQQPERGALLPLRKHFKLFSNLRPAKLYQGLEAF 3---1111---------------------3333-----------------------1111 CPLRADIAANGFDILCVRELTGGIYFGQPKGREGSGQYEKAFDTEVYHRFEIERIARIAF ---33333333------------------------1111--------------------- ESARKRRHKVTSIDKANVLQSSILWREIVNEIATEYPDVELAHMYIDNATMQLIKDPSQF -------------------------------33331111-----------3333-1111- DVLLCSNLFGDILSDECAMITGSMGMLPSASLNEQGFGLYEPAGGSAPDIAGKNIANPIA ------------------3333-1111-----1111-----------1111--------- QILSLALLLRYSLDADDAACAIERAINRALEEGIRTGDLARGAAAVSTDEMGDIIARYVA -----------------------------1111--------------------------- EGV --- >PHOSPHORYLATED MAP KINASE; SWP:P53778; PDB:1CM8A; RSGFYRQEVTKTAWEVRAVYRDLQPVAVCSAVDGRTGAKVAIKKLYRPFQSELFAKRAYR ----------------3333---------------------------------------- ELRLLKHMRHENVIGLLDVFTPDETLDDFTDFYLVMPFMGTDLGKLMKHEKLGEDRIQFL ---------1111-----------3333-------------------------------- VYQMLKGLRYIHAAGIIHRDLKPGNLAVNEDCELKILDFGLARQADSEMGVVTRWYRAPE ---------------------3333---1111------1111-----------1111333 VILNWMRYTQTVDIWSVGCIMAEMITGKTLFKGSDHLDQLKEIMKVTGTPPAEFVQRLQS 3--2222---------------------------3333------------33331111-- DEAKNYMKGLPELEKKDFASILTNASPLAVNLLEKMLVLDAEQRVTAGEALAHPYFESLH ----------------3333-------------------3333---------3333---- QVQKYDDSRTLDEWKRVTYKEVLSFKP ---------3333-------------- >MET REPRESSOR; SWP:P0A8U6; PDB:1CMCA; AEWSGEYISPYAEHGKKSEQVKKITVSIPLKVLKILTDERTRRQVNNLRHATNSELLCEA -------------------------------------------1111------------- FLHAFTGQPLPDDADLRKERSDEIPEAAKEIMREMGINPETWEY -----------3333-3333-----------------3333--- >NEURONAL NITRIC OXIDE SYN; SWP:Q15701; PDB:1CMIA; KAVIKNADMSEEMQQDSVECATQALEKYNIEKDIAAHIKKEFDKKYNPTWHCIVGRNFGS -----------------------------3333--------------------------- YVTHETKHFIYFYLGQVAILLFKSG -------------!!!!-------- >CHARYBDOTOXIN, ALPHA CHIM; SWP:P13487; PDB:1CMR; CTTSKECWSVCQRLHNTSKGWCDHRGCICES --1111------------------------- >PROCHYMOSIN A/B PRECURSOR; SWP:P00794; PDB:1CMS; GEVASVPLTNYLDSQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYCKSNACKNHQRFDP -------------------------------------------1111-3333------33 RKSSTFQNLGKPLSIHYGTGSMQGILGYDTVTVSNIVDIQQTVGLSTQEPGDVFTYAEFD 33--------------!!!!------------------------------3333------ GILGMAYPSLASEYSIPVFDNMMNRHLVAQDLFSVYMDRNGQESMLTLGAIDPSYYTGSL ------3333-2222----------------------3333----------1111----- HWVPVTVQQYWQFTVDSVTISGVVVACEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQ --------------------------2222-----3333--------------1111--- NQYGEFDIDCDNLSYMPTVVFEINGKMYPLTPSAYTSQDQGFCTSGFQSENHSQKWILGD ------------1111------%%%%----3333-------------------------- VFIREYYSVFDRANNLVGLAKAI --1111----------------- >HUMAN CYTOMEGALOVIRUS PRO; SWP:P16753; PDB:1CMVA; APVYVGGFLARYDQSPDLPRDVVEHWALPLNINHDDTAVVGHVAAMQSVRDGLFCLGCVT ----------1111----33333333-----iiii-------------1111-------- SPRFLEIVRRASEKSELVSRGPVSPLQPDKVVEFLSGSYAGLSLSPFKHVALCSVGRRRG ---------------3333----------------------------------------- TLAVYGRDPEWVTQRFPDLTAADRDGLRAQWGDPFRSDSYGLLGNSVDALYIRERLPKLR -----------11111111--------------------------------2222----- YDKQLVGVTERESYVKA --------3333----- >UBIQUITIN YUH1-UBAL; SWP:P35127; PDB:1CMXA; RAVVPIESNPEVFTNFAHKLGLKNEWAYFDIYSLTEPELLAFLPRPVKAIVLLFPINDVI ----------------------1111----------3333-------------------- WFKQSVKNACGLYAILHSLSNNQSLLEPGSDLDNFLKSQSDTSSSKNRFDDVTTDQFVLN -----2222------------3333-2222------------------------------ VIKENVQTFSTGQSEAPEATADTNLHYITYVEENGGIFELDGRNLSGPLYLGKSDPTATD ----3333---------3333-------------------1111---------------3 LIEQELVRVRVASYMENANEEDVLNFAMLGLGPN 3333333--------------------------- >GAIP (G-ALPHA INTERACTING; SWP:P49795; PDB:1CMZA; PSPEEVQSWAQSFDKLMHSPAGRSVFRAFLRTEYSEENMLFWLACEELKAEANQHVVDEK -------1111-3333-------------------3333-------3333----333333 ARLIYEDYVSILSPKEVSLDSRVREGINKKMQEPSAHTFDDAQLQIYTLMHRDSYPRFLS 33----------------------------------1111----------------3333 SPTYRALL ---3333- ------------------------------------------------------------ ------ >Capsid protein VP2; SWP:P12908; PDB:1CN3F; GGGGGGGGAASHQRVTPDWMLPLILGLYG ----------------3333----1111- >NITRATE REDUCTASE; SWP:P17571; PDB:1CNE; GRIHCRLVAKKELSRDVRLFRFSLPSPDQVLGLPIGKHIFVCATIEGKLCMRAYTPTSMV ----------------------------------------------------------33 DEIGHFDLLVKVYFKNEHPKFPNGGLMTQYLDSLPVGSYIDVKGPLGHVEYTGRGSFVIN 33------------------3333-3333-----2222---------------------- GKQRNARRLAMICGGSGITPMYQIIQAVLRDQPEDHTEMHLVYANRTEDDILLRDELDRW -----------------3333-------------------------1111---3333--- AAEYPDRLKVWYVIDQVKRPEEGWKYSVGFVTEAVLREHVPEGGDDTLALASGPPPMIQF ------------------1111---------3333------------------3333--- AISPNLEKMKYDMANSFVVF -------------------- >CYTOCHROME C552; SWP:P82903; PDB:1CNOA; AGDIEAGKAKAAVCAACHGQNGISQVPIYPNLAGQKEQYLVAALKAYKAGQRQGGQAPVM ------------------1111---1111--2222-----------1111---!!!!--- QGQATALSDADIANLAAYYASNPAAA ---1111---------------1111 >Ciliary neurotrophic fact; SWP:P26441; PDB:1CNT1; PHRRDLCSRSIWLARKIRSDLTALTESYVKHQGLWSELTEAERLQENLQAYRTFHVLLAR ------------------------------------------------------------ LLEDQQVHFTPTEGDFHQAIHTLLLQVAAFAYQIEELMILLEYKIPRNEADGMLFEKKLW ----1111---------------------------------------------------- GLKVLQELSQWTVRSIHDLRFISSHQTGIP ----------------------1111---- >ACTOPHORIN; SWP:P37167; PDB:1CNUA; GIAVSDDCVQKFNELKLGHQHRYVTFKMNASNTEVVVEHVGGPNATYEDFKSQLPERDCR ----------------------------1111---------111133333333------- YAIFDYEFQVDGGQRNKITFILWAPDSAPIKSKMMYTSTKDSIKKKLVGIQVEVQATDAA ---------iiii-----------1111-----------------------------333 EISEDAVSERAKKD 3------------- >CONCANAVALIN B; SWP:P49347; PDB:1CNV; DISSTEIAVYWGQREDGLLRDTCKTNNYKIVFISFLDKFGCEIRKPELELEGVCGPSVGN 3333--------1111------3333-------------1111------2222------- PCSFLESQIKECQRMGVKVFLALGGPKGTYSACSADYAKDLAEYLHTYFLSERREGPLGK ------------1111---------------------------------------1111- VALDGIHFDIQKPVDELNWDNLLEELYQIKDVYQSTFLLSAAPGCLSPDEYLDNAIQTRH ---------------------------------------------------3333----- FDYIFVRFYNDRSCQYSTGNIQRIRNAWLSWTKSVYPRDKNLFLELPASQATAPGGGYIP ----------1111--2222----------------------------33331111---- PSALIGQVLPYLPDLQTRYAGIALWNRQADKETGYSTNIIRYL -------33332222------------------------1111 >3-ISOPROPYLMALATE DEHYDRO; SWP:P37412; PDB:1CNZA; MSKNYHIAVLPGDGIGPEVMAQALKVMDAVRSRFDMRITTSHYDVGGIAIDNHGHPLPKA ------------!!!!-----------------------------------------333 TVEGCEQADAILFGSVGGPKWENLPPESQPERGALLPLRKHFKLFSNLRPAKLYQGLEAF 3----------------1111---3333-------------------------2222111 CPLRADIAANGFDILCVRELTGGIYFGQPKGREGSGQYEKAFDTEVYHRFEIERIARIAF 1------3333-----------3333---------1111--------3333--------- ESARKRRRKVTSIDKANVLQSSILWREIVNDVAKTYPDVELAHMYIDNATMQLIKDPSQF --1111--------3333-----------------1111-----------3333-3333- DVLLCSNLFGDILSDECAMITGSMGMLPSASLNEQGFGLYEPAGGSAPDIAGKNIANPIA ------------------3333----------1111----------3333---------- QILSLALLLRYSLDANDAATAIEQAINRALEEGVRTGDLARGAAAVSTDEMGDIIARYVA -----------------------------------3333--2222--------------- EGV --- >ACTIVATOR OF METALLOTHION; SWP:P14772; PDB:1CO4A; MVVINGVKYACDSCIKSHKAAQCEHNDRPLKILKPRGRPPTT ----------1111---3333--------------------- >CYTOCHROME C2; SWP:P00083; PDB:1CO6A; QDAASGEQVFKQCLVCHSIGPGAKNKVGPVLNGLFGRHSGTIEGFAYSDANKNSGITWTE -3333--------------2222----------2222----2222--3333--------- EVFREYIRDPKAKIPGTKMIFAGVKDEQKVSDLIAYIKQFNADGSKK --------3333-2222-----------------------1111--- >Subtilisin-chymotrypsin i; SWP:P01053; PDB:1COAI; MKTEWPELVGKSVEEAKKVILQDKPEAQIIVLPVGTIVTMEYRIDRVRLFVDKLDNVAEV ----1111---------------1111-----2222------1111-----1111----- PRVG ---- >COIL-VALD; SWP:NA; PDB:1COI; EVEALEKKVAALESKVQALEKKVEALEHG ----------------------3333--- >SUPEROXIDE DISMUTASE; SWP:Q9X6W9; PDB:1COJA; VHKLEPKDHLKPQNLEGISNEQIEPHFEAHYKGYVAKYNEIQEKLADQNFADRSKANQNY ------3333--------3333-----------------------------3333-3333 SEYRELKVEETFNYMGVVLHELYFGMLTPGGKGEPSEALKKKIEEDIGGLDACTNELKAA ----------------------3333-2222----------------------------- AMAFRGWAILGLDIFSGRLVVNGLDAHNVYNLTGLIPLIVIDTYEHAYYVDYKNKRPPYI -------------------------------2222--------3333----!!!!----- DAFFKNINWDVVNERFEKAMKAYEALKDFIK ------------------------------- >RNA POLYMERASE ALPHA SUBU; SWP:P0A7Z4; PDB:1COO; FDPILLRPVDDLELTVRSANCLKAEAIHYIGDLVQRTEVELLKTPNLGKKSLTEIKDVLA -3333--3333--------------------------3333------------------- SRGLSLGMRLENWPPASIADE ----2222------------- ------------------------------------------------------------ ------ >CYTOCHROME C551; SWP:P00101; PDB:1COR; DGEALFKSKPCAACHSIDAKLVGPAFKEVAAKYAGQDGAADLLAGHIKNGSQGVWGPIPM ---3333--3333----------------------2222--------------------- PPNPVTEEEAKILAEWILSQK ------3333----------- >COILED SERINE; SWP:NA; PDB:1COSA; EWEALEKKLAALESKLQALEKKLEALEHG ----------------------3333--- >CYTOCHROME C2; SWP:P00096; PDB:1COT; DGDAAKGEKEFNKCKACHMIQAPDGTDIIKGGKTGPNLYGVVGRKIASEEGFKYGEGILE --------------1111---1111------------2222-------2222-------- VAEKNPDLTWTEADLIEYVTDPKPWLVKMTDDKGAKTKMTFKMGKNQADVVAFLAQNSPD ----1111-----------------------1111----------------------111 A 1 >NEMATODE ANTICOAGULANT PR; SWP:Q16938; PDB:1COUA; KATMQCGENEKYDSCGSKECDKKCKYDGVEEEDDEEPNVPCLVRVCHQDCVCEEGFYRNK ----------------------------3333-----3333------------------- DDKCVSAEDCELDNMDFIYPGTRNP -----3333---3333--------- >GLYCEROL-3-PHOSPHATE CYTI; SWP:P27623; PDB:1COZA; MKKVITYGTFDLLHWGHIKLLERAKQLGDYLVVAISTDEFNLQKQKKAYHSYEHRKLILE ------------------------------------------------------------ TIRYVDEVIPEKNWEQKKQDIIDHNIDVFVMGDDWEGKFDFLKDQCEVVYLPRTEGISTT -3333----------------1111------1111---1111------------------ KIKEEI --1111 >NITROGENASE IRON PROTEIN; SWP:P00456; PDB:1CP2A; MRQVAIYGKGGIGKSTTTQNLTSGLHAMGKTIMVVGCDPKADSTRLLLGGLAQKSVLDTL -------------------------1111--------1111--1111------------- REEGEDVELDSILKEGYGGIRCVESGGPEPGVGCAGRGIITSINMLEQLGAYTDDLDYVF --!!!!-3333----2222---------2222-3333---------1111---------- YDVLGDVVCGGFAMPIREGKAQEIYIVASGEMMALYAANNISKGIQKYAKSGGVRLGGII ---------------1111-----------3333-----------1111----------- CNSRKVANEYELLDAFAKELGSQLIHFVPRSPMVTKAEINKQTVIEYDPTCEQAEEYREL --------3333------------------------------3333-1111--------- ARKVDANELFVIPKPMTQERLEEILMQYG ---1111---------------------- >PENICILLIN AMIDOHYDROLASE; SWP:Q7WZI9; PDB:1CP9A; STQIKIERDNYGVPHIYANDTYSLFYGYGYAVAQDRLFQMEMAKRSTQGTVSEVFGKDYI --------1111--------------------------------------3333-3333- SFDKEIRNNYWPDSIHKQINQLPSQEQDILRGYADGMNAWIKQINTKPDDLMPKQFIDYD ------11113333----11113333--------------------3333--3333---- FLPSQWTSFDVAMIMVGTLANRFSDMNSEIDNLALLTALKDKYGEQLGVEFFNQINWLNN -----------------------------------------------------------1 PNAPTTISSEEFTYSD 111----3333----- >Penicillin G acylase [Pre; SWP:P06875; PDB:1CP9B; SNVWLVGKTKASGAKAILLNGPQFGWFNPAYTYGIGLHGAGFNIVGNTPFAYPAILFGHN --------------------------------------iiii------%%%%-------- GHVSWGSTAGFGDGVDIFAEQVSPEDPNSYLHQGQWKKMLSRQETLNVKGEQPITFEIYR -------------------------1111--iiii------------2222--------- TVHGNVVKRDKTTHTAYSKARAWDGKELTSLMAWVKQGQAQNWQQWLDQAQNQALTINWY 1111------1111------1111----------3333----------3333-------- YADKDGNIGYVHTGHYPDRQINHDPRLPVSGTGEWDWKGIQPFANNPKVYNPKSGYIANW --1111-------------22221111-----1111-----3333------3333----- NNSPAKNYPASDLFAFLWGSADRVKEIDNRIEAYDKLTADDMWAILQQTSRVDLNHRLFT ----2222----3333--------------3333------------------1111---- PFLTQATQGLPSNDNSVKLVSMLQQWDGINQLSSDGKHYIHPGSAILDIWLKEMLKATLG ------22221111--------1111------1111---------------------333 QTVPAPFDKWYLASGYETTQEGPTGSLNISTGAKLLYESLLEDKSPISQSIDLFSGQPQN 3-----------------1111------------------!!!!-------1111--333 DVIRKTLNTTYQKMIEKYGDNPANWQTPATALTFRENNFFGIPQALPQENFHQNEYHNRG 3-------------------3333-------------1111----2222----------- TENDLIVFTEEGVSAWDVVAPGQSGFISPQGKPSPHYQDQLSLYQQFGKKPLWLNSEDVA --------3333---------------3333--1111-----------------333333 PYIESTETLIIER 33----------- >C-PHYCOCYANIN (BETA SUBU; SWP:P07122; PDB:1CPCA; MKTPLTEAVAAADSQGRFLSSTEIQTAFGRFRQASASLAAAKALTEKASSLASGAANAVY ------------1111-------------------------------------------- SKFPYTTSQNGPNFASTQTGKDKCVRDIGYYLRMVTYCLVVGGTGPLDDYLIGGIAEINR ----1111--1111---------------------------------------------- TFDLSPSWYVEALKYIKANHGLSGDPAVEANSYIDYAINALS ----3333-------1111-------------------1111 >C-phycocyanin-1 beta chai; SWP:P07119; PDB:1CPCB; MLDAFAKVVSQADARGEYLSGSQIDALSALVADGNKRMDVVNRITGNSSTIVANAARSLF --3333------1111-------------------------------------------- AEQPQLIAPGGNAYTSRRMAACLRDMEIILRYVTYAIFAGDASVLDDRCLNGLKETYLAL ---33332222--------------------------------------2222------- GTPGSSVAVGVQKMKDAALAIAGDTNGITRGDCASLMAEVASYFDKAASAVA --------------------1111------------------------1111 >CYTOCHROME C'; SWP:P00147; PDB:1CPQ; ADTKEVLEAREAYFKSLGGSMKAMTGVAKAFDAEAAKVEAAKLEKILATDVAPLFPAGTS --------------------------3333---------------1111-3333-22223 STDLPGQTEAKAAIWANMDDFGAKGKAMHEAGGAVIAAANAGDGAAFGAALQKLGGTCKA 3332222----3333--------------------------------------------- CHDDYREED --------- >CYTOCHROME P450-TERP; SWP:P33006; PDB:1CPT; MDARATIPEHIARTVILPQGYADDEVIYPAFKWLRDEQPLAMAHIEGYDPMWIATKHADV -----------------3333-1111------------------2222------------ MQIGKQPGLFSNAEGSEILYDQNNEAFMRSISGGCPHVIDSLTSMDPPTHTAYRGLTLNW -----3333--------------------1111-------1111------------3333 FQPASIRKLEENIRRIAQASVQRLLDFDGECDFMTDCALYYPLHVVMTALGVPEDDEPLM -3333----------------------------1111---------------3333---- LKLTQDFFGVEAARRFHETIATFYDYFNGFTVDRRSCPKDDVMSLLANSKLDGNYIDDKY --------------------------------------------------%%%%------ INAYYVAIATAGHDTTSSSSGGAIIGLSRNPEQLALAKSDPALIPRLVDEAVRWTAPVKS ---------------------------------------3333----------------- FMRTALADTEVRGQNIKRGDRIMLSYPSANRDEEVFSNPDEFDITRFPNRHLGFGWGAHM ----------%%%%--2222-----3333--3333--1111-1111-----1111-1111 CLGQHLAKLEMKIFFEELLPKLKSVELSGPPRLVATNFVGGPKNVPIRFTKA 1111-------------3333------------------------------- >SERINE CARBOXYPEPTIDASE; SWP:P00729; PDB:1CPY; KIKDPKILGIDPNVTQYTGYLDVEDEDKHFFFWTFESRNDPAKDPVILWLNGGPGCSSLT -----------------------1111------------3333----------------- GLFFALGPSSIGPDLKPIGNPYSWNSNATVIFLDQPVNVGFSYSGSSGVSNTVAAGKDVY ----------------------1111---------2222-----------33333333-- NFLELFFDQFPEYVNKGQDFHIAGASYAGHYIPVFASEILSHKDRNFNLTSVLIGNGLTD -----------------------------------------------------------3 PLTQYNYYEPMACGEGGEPSVLPSEECSAMEDSLERCLGLIESCYDSQSVWSCVPATIYC 333-----3333------------------------------------3333-------- NNAQLAPYQRTGRNVYDIRKDCEGGNLCYPTLQDIDDYLNQDYVKEAVGAEVDHYESCNF -----3333----1111-----------3333----3333--3333-------------- DINRNFLFAGDWMKPYHTAVTDLLNQDLPILVYAGDKDFICNWLGNKAWTDVLPWKYDEE ----------11113333------------------------------------1111-- FASQKVRNWTASITDEVAGEVKSYKHFTYLRVFNGGHMVPFDVPENALSMVNEWIHGGFS 1111-------------------!!!!----------1111------------------- L - >COPZ; SWP:Q47840; PDB:1CPZA; AQEFSVKGMSCNHCVARIEEAVGRISGVKKVKVQLKKEKAVVKFDEANVQATEICQAINE --------------------------------------------1111------------ LGYQAEVI -------- >VIRAL CHEMOKINE INHIBITOR; SWP:O73568; PDB:1CQ3A; SFSSSSSCTEEENKHHMGIDVIIKVTKQDQTPTNDKICQSVTEVTESEDESEEVVKGDPT --3333---------------------1111-------------22221111-------- TYYTVVGGGLTMDFGFTKCPKISSISEYSDGNTVNARLSSVSPGQGKDSPAITREEALSM ------iiii-------------------!!!!--------------------------- IKDCEMSINIKCSEEEKDSNIKTHPVLGSNISHKKVSYEDIIGSTIVDTKCVKNLEISVR -----------------------------------------------3333--------- IGDMCKESSELEVKDGFKYVDGSASEDAADDTSLINSAKLIACV --------1111-------iiii------------3333----- >SERINE PROTEINASE INHIBIT; SWP:P01053; PDB:1CQ4A; KTEWPELVGKSVEEAKKVILQDKPEAQIIVLPVGTIV ---3333---------------1111-----2222-- >PROFILIN; SWP:P25816; PDB:1CQA; SWQTYVDEHLMLAASAIVGHDGSVWAQSSSFPQFKPQEITGIMKDFEEPGHLAPTGLHLG ------------------1111-----1111---3333----3333-22223333---ii GIKYMVIQGEAGAVIRGKKGSGGITIKKTGQALVFGIYEEPVTPGQCNMVVERLGDYLID ii----------------!!!!------------------------------------11 QGL 11- >PROTEASE II; SWP:P82474; PDB:1CQDA; LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTAN -------1111-------!!!!-3333-----------------------------1111 HGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNSTVNAPVVSIDSYENVPSHNEQS !!!!------------------3333----------3333-------------------- LQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIVKNS ----1111---------3333---------------------------%%%%-------- WGKNWGESGYIRAERNIENPDGKCGITRFASYPVKK -1111-iiii--------1111%%%%---------- >THIOREDOXIN; SWP:P10599; PDB:1CQGA; MVKQIESKTAFQEALDAAGDKLVVVDFSATWCGPAKMIKPFFHSLSEKYSNVIFLEVDVD -----------------!!!!-------33333333-------3333-1111-------- DAQDVASEAEVKATPTFQFFKKGQKVGEFSGANKEKLEATINELV ------1111----------iiii--------3333--------- >CH3 DOMAIN OF MAK33 ANTIB; SWP:P01869; PDB:1CQKA; PAAPQVYTIPPPLEQMAKDLVSLTCMITDFFPEDITVEWQWNGQPAENYKNTQPIMDTDG -----------1111-------------------------iiii------------1111 SYFVYSKLNVQKSNWEAGNTFTCSVLHEGLHNHHTEKSLSH ----------3333------------1111%%%%------- >RIBOSOMAL PROTEIN S6; SWP:P23370; PDB:1CQMA; MRRYEVNIVLNPNLDQSQLALEKEIIQRALENYGARVEKVAILGLRRLAYPIAKDPQGYF ----------1111----------------1111-----------------iiii----- LWYQVEMPEDRVNDLARELRIRDNVRRVMVVKSQEPFL -------1111----------1111------------- >TYPE 2 RHINOVIRUS 3C PROT; SWP:P04936; PDB:1CQQA; GPEEEFGMSLIKHNSCVITTENGKFTGLGVYDRFVVVPTHADPGKEIQVDGITTKVIDSY -------------------1111--------------1111-------iiii-------- DLYNKNGIKLEITVLKLDRNEKFRDIRRYIPNNEDDYPNCNLALLANQPEPTIINVGDVV ---1111------------------3333------------------------------- SYGNILLSGNQTARMLKYSYPTKSGYCGGVLYKIGQVLGIHVGGNGRDGFSAMLLRSYFT ------iiii------------2222------2222------------------3333-- >POU DOMAIN, CLASS 2, TRAN; SWP:P14859; PDB:1CQTA; EPSDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEALNLSFKNMC ---3333-3333---33331111-3333----1111------------1111---3333- KLKPLLEKWLNDAERKKRTSIETNIRVALEKSFLENQKPTSEEITMIADQLNMEKEVIRV ------------------------------------------------------------ WFCNRRQKEKRINP -------------- >FLAVOHEMOPROTEIN; SWP:P39662; PDB:1CQXA; MLTQKTKDIVKATAPVLAEHGYDIIKCFYQRMFEAHPELKNVFNMAHQEQGQQQQALARA -----------------------------------3333--------------------- VYAYAENIEDPNSLMAVLKNIANKHASLGVKPEQYPIVGEHLLAAIKEVLGNAATDDIIS ----1111----------------------3333----------------3333------ AWAQAYGNLADVLMGMESELYERSAEQPGGWKGWRTFVIREKRPESDVITSFILEPADGG --------------------------2222-------------------------1111- PVVNFEPGQYTSVAIDVPALGLQQIRQYSLSDMPNGRTYRISVKREGGGPQPPGYVSNLL -----------------1111--------------------------------------- HDHVNVGDQVKLAAPYGSFHIDVDAKTPIVLISGGVGLTPMVSMLKVALQAPPRQVVFVH ---------------------3333---------------------1111---------- GARNSAVHAMRDRLREAAKTYENLDLFVFYDQPLPEDVQGRDYDYPGLVDVKQIEKSILL ---------------------------------1111------------33333333--2 PDADYYICGPIPFMRMQHDALKNLGIHEARIHYEVFGPDLFAE 222------------------1111-3333------------- >DNA PRIMASE/HELICASE; SWP:P03692; PDB:1CR1A; MRERIREHLSSEESVGLLFSGCTGINDKTLGARGGEVIMVTSGSGMGKSTFVRQQALQWG -----------------------3333-----2222------2222-------------- TAMGKKVGLAMLEESVEETAEDLIGLHNRVRLRQSDSLKREIIENGKFDQWFDELFGNDT -------------------------1111-3333-------------------------- FHLYDSFAEAETDRLLAKLAYMRSGLGCDVIILDHISIRKMIDNLMTKLKGFAKSTGVVL -----------------------1111------------3333----------------- VVICHLKTDLRGSGALRQLSDTIIALERNQLVLVRILKCRFTGDTGIAGYMEYNKETGWL -------3333---------------------------3333------------------ EPSSY ----- >SEC18P (RESIDUES 22 - 210; SWP:P18759; PDB:1CR5A; TRHLKVSNCPNNSYALANVAAVSPNDFPNNIYIIIDNLFVFTTRHSNDIPPGTIGFNGNQ ----------3333--------1111---------------------------------- RTWGGWSLNQDVQAKAFDLFKYSGKQSYLGSIDIDISFRAVFDQDELAKQFVRCYESQIF ------2222-------3333---------------------------------2222-- SPTQYLIMEFQGHFFDLKIRNVQAIDLGDIEPTSAVATGIETKGILTKQTQINFFKGR ---------iiii------------1111---------3333----1111-------- >LOW DENSITY LIPOPROTEIN R; SWP:Q07954; PDB:1CR8A; PGGCHTDEFQCRLDGLCIPLRWRCDGDTDCMDSSDEKSCEGV ----2222----------3333----------3333------ >FAB ANTIBODY LIGHT CHAIN; SWP:NA; PDB:1CR9H; KVKLQQSGAELVRSGASVKLSCTASGFNIKDYYIQWVKQRPEQGLEWIGWIDPENGNSEY ------------2222------------1111-------2222----------------- APRFQGKATMTADTLSNTAYLQLSSL 3333---------------------- >Putative uncharacterized ; SWP:A0A5D9; PDB:1CR9L; DVVMTQTPLSLSVTIGQPASISCKSSQSLLDS -------------2222--------------- >CELLULAR RETINOL BINDING ; SWP:P02696; PDB:1CRB; PVDFNGYWKMLSNENFEEYLRALDVNVALRKIANLLKPDKEIVQDGDHMIIRTLSTFRNY --------------------1111--------1111--------!!!!------3333-- IMDFQVGKEFEEDLTGIDDRKCMTTVSWDGDKLQCVQKGEKEGRGWTQWIEGDELHLEMR ----2222--------------------!!!!------------------!!!!------ AEGVTCKQVFKKVH iiii---------- ------------------------------------------------------------ >CREATINE KINASE; SWP:P11009; PDB:1CRKA; TVHEKRKLFPPSADYPDLRKHNNCMAECLTPAIYAKLRDKLTPNGYSLDQCIQTGVDNPG ---------3333----1111---------------1111-1111-3333--3333---- HPFIKTVGMVAGDEESYEVFAEIFDPVIKARHNGYDPRTMKHHTDLDASKITHGQFDERY ------------3333---3333------------3333-------3333---------- VLSSRVRTGRSIRGLSLPPACSRAERREVENVVVTALAGLKGDLSGKYYSLTNMSERDQQ -----------2222---------------------1111-1111----3333------- QLIDDHFLFDKPVSPLLTCAGMARDWPDARGIWHNNDKTFLVWINEEDHTRVISMEKGGN -------------33331111--------------------------------------- MKRVFERFCRGLKEVERLIKERGWEFMWNERLGYVLTCPSNLGTGLRAGVHVKLPRLSKD -------------------1111--------------3333-------------3333-- PRFPKILENLRLQKRGTGGVDTAAVADVYDISNLDRMGRSEVELVQIVIDGVNYLVDCEK ------------------3333-------------------------------------- KLEKGQDIKVPPPLPQFGRK -------------------- >LIPASE; SWP:P20261; PDB:1CRL; APTATLANGDTITGLNAIINEAFLGIPFAEPPVGNLRFKDPVPYSGSLDGQKFTSYGPSC -----1111-----------------------!!!!-----------2222--------- MQQNPEGTYEENLPKAALDLVMQSKVFEAVSPSSEDCLTINVVRPPGTKAGANLPVMLWI ---1111-------------------------------------22222222-------- FGGGFEVGGTSTFPPAQMITKSIAMGKPIIHVSVNYRVSSWGFLAGDEIKAEGSANAGLK --%%%%---1111-3333----1111-------------1111----------------- DQRLGMQWVADNIAAFGGDPTKVTIFGESAGSMSVMCHILWNDGDNTYKGKPLFRAGIMQ ------------3333--1111------------------%%%%---iiii--------- SGAMVPSDAVDGIYGNEIFDLLASNAGCGSASDKLACLRGVSSDTLEDATNNTPGFLAYS --------1111---------------1111------1111--------1111-1111-! SLRLSYLPRPDGVNITDDMYALVREGKYANIPVIIGDQNDEGTFFGTSSLNVTTDAQARE !!!------------------------------------11113333------------- YFKQSFVHASDAEIDTLMTAYPGDITQGSPFDTGILNALTPQFKRISAVLGDLGFTLARR -----1111--------------3333-----!!!!---1111----------------- YFLNHYTGGTKYSFLSKQLSGLPVLGTFHSNDIVFQDYLLGSGSLIYNNAFIAFATDLDP ------------------2222-----22223333-----3333---------------- NTAGLLVKWPEYTSSSQSGNNLMMINALGLYTGKDNFRTAGYDALFSNPPSFFV -------------1111--------1111------------------3333--- >PROTEIN (SOLUBLE QUINOPRO; SWP:P13650; PDB:1CRUA; DVPLTPSQFAKAKSENFDKKVILSNLNKPHALLWGPDNQIWLTERATGKILRVNPESGSV ----------------------------------1111---------------------- KTVFQVPEIVNDADGQNGLLGFAFHPDFKNNPYIYISGTFKNPKSKELPNQTIIRRYTYN -----------1111-----------3333-----------1111--------------- KSTDTLEKPVDLLAGLPSSKDHQSGRLVIGPDQKIYYTIGDQGRNQLAYLFLPNQAQHTP 1111-------------------------1111-------iiii-!!!!-----1111-- TQQELNGKDYHTYMGKVLRLNLDGSIPKDNPSFNGVVSHIYTLGHRNPQGLAFTPNGKLL ----1111-1111-------1111--1111--iiii-----------------1111--- QSEQGPNSDDEINLIVKGGNYGWPNVAGYKDDSGYAYANYSAAANKSIKDLAQNGVKVAA ---------------2222-------------------3333----------iiii---- GVPVTKESEWTGKNFVPPLKTLYTVQDTYNYNDPTCGEMTYICWPTVAPSSAYVYKGGKK -----1111----------------1111---3333--3333------------------ AITGWENTLLVPSLKRGVIFRIKLDPTYSTTYDDAVPMFKSNNRYRDVIASPDGNVLYVL -2222--------1111-------1111----------------------3333------ TDTAGNVQKDDGSVTNTLENPGSLIKFT --------1111---------------- >TOLB PROTEIN; SWP:P19935; PDB:1CRZA; DSGVDSGRPIGVVPFQWAGPGAAPEDIGGIVAADLRNSGKFNPLDRARLPQQPGSAQEVQ -----------------------------------1111-----3333------3333-3 PAAWSALGIDAVVVGQVTPNPDGSYNVAYQLVDTGGAPGTVLAQNSYKVNKQWLRYAGHT 333-1111-----------1111-------------2222---------3333------- ASDEVFEKLTGIKGAFRTRIAYVVQTNGGQFPYELRVSDYDGYNQFVVHRSPQPLSPAWS --------------1111--------------------1111-----------------1 PDGSKLAYVTFESGRSALVIQTLANGAVRQVASFPRHNGAPAFSPDGSKLAFALSKTGSL 111-------1111-----------------------------1111------------- NLYVDLASGQIRQVTDGRSNNTEPTWFPDSQNLAFTSDQAGRPQVYKVNINGGAPQRITW ----3333------------------1111-------1111-------1111-------- EGSQNQDADVSSDGKFVVSSNGGQQHIAKQDLATGGVQVLSSTFLDETPSLAPNGTVIYS ----------1111------iiii---------------------------1111----- SSQGGSVLNLVSTDGRFKARLPATDGQVKFPAWSPYL -----------1111---------------------- >CYSTATHIONINE GAMMA-SYNTH; SWP:P00935; PDB:1CS1A; RKQATIAVRSGLNDDEQYGCVVPPIHLSSTYNFTGFNEPRAHDYSRRGNPTRDVVQRALA --------2222-------------------------------3333------------- ELEGGAGAVLTNTGMSAIHLVTTVFLKPGDLLVAPHDCYGGSYRLFDSLAKRGCYRVLFV --------------------------2222----1111-----------1111------- DQGDEQALRAALAEKPKLVLVESPSNPLLRVVDIAKICHLAREVGAVSVVDNTFLSPALQ 1111-------------------------------------1111-------1111---- NPLALGADLVLHSCTYLNGHSDVVAGVVIAKDPDVVTELAWWANNIGVTGGAFDSYLLLR 3333-----------3333------------------------1111------------- GLRTLVPRMELAQRNAQAIVKYLQTQPLVKKLYHPSLPENQGHEIAARQQKGFGAMLSFE -------------------------1111----3333--2222----------------- LDGDEQTLRRFLGGLSLFTLAESLGGVESLISHAATMTHAGMAPEARAAAGISETLLRIS -----------1111-----------------3333--1111-----1111-1111---- TGIEDGEDLIADLENGFRAANKG ------------------1111- >AXONIN-1; SWP:P28685; PDB:1CS6A; RSYGPVFEEQPAHTLFPEGSAEEKVTLTCRARANPPATYRWKMNGTELKMGPDSRYRLVA ------------------------------------------iiii----1111------ GDLVISNPVKAKDAGSYQCVATNARGTVVSREASLRFGFLQEFSAEERDPVKITEGWGVM --------3333----------3333---------------------------------- FTCSPPPHYPALSYRWLLNEFPNFIPADGRRFVSQTTGNLYIAKTEASDLGNYSCFATSH ---------------------------------------------3333----------- IDFITKSVFSKFSQLSLAAEDARQYAPSIKAKFPADTYALTGQMVTLECFAFGNPVPQIK !!!!-----------------------------------2222----------------- WRKLDGSQTSKWLSSEPLLHIQNVDFEDEGTYECEAENIKGRDTYQGRIIIHAQPDWLDV ------------------------3333---------1111------------------- ITDTEADIGSDLRWSCVASGKPRPAVRWLRDGQPLASQNRIEVSGGELRFSKLVLEDSGM ------2222-------------------iiii----------!!!!------1111--- YQCVAENKHGTVYASAELTVQA ------1111------------ ------------------------------------------------------------ ------------------------------------ >CATHEPSIN B; SWP:P07858; PDB:1CSBA; LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTN -----3333-11113333---------3333-----------1111- >CITRATE SYNTHASE; SWP:P23007; PDB:1CSC; ASSTNLKDVLAALIPKEQARIKTFRQQHGGTALGQITVDMSYGGMRGMKGLVYETSVLDP ----3333-------------------1111-----3333--%%%%-------------- DEGIRFRGFSIPECQKLLPKGGGGEPLPEGLFWLLVTGQIPTGAQVSWLSKEWAKRAALP ------------------------------------------------------------ SHVVTMLDNFPTNLHPMSQLSAAITALNSESNFARAYAEGILRTKYWEMVYESAMDLIAK ----------1111-----------3333------------3333--------------- LPCVAAKIYRNLYRAGSSIGAIDSKLDWSHNFTNMLGYTDAQFTELMRLYLTIHSDHEGG -------------2222-----1111-------------3333--------1111----- NVSAHTSHLVGSALSDPYLSFAAAMNGLAGPLHGLANQEVLGWLAQLQKAAGADASLRDY ----------1111---------------1111-------------3333---------- IWNTLNSGRVVPGYGHAVLRKTDPRYTCQREFALKHLPGDPMFKLVAQLYKIVPNVLLEQ ----1111----------------------------1111-------3333--------- GAAANPWPNVDAHSGVLLQYYGMTEMNYYTVLFGVSRALGVLAQLIWSRALGFPLERPKS -------------3333-------1111-------------------------------- MSTDGLIAL --------- >Eglin C; SWP:P01051; PDB:1CSEI; KSFPEVVGKTVDQAREYFTLHYPQYNVYFLPEGSPVTLDLRYNRVRVFYNPGTNVVNHVP --3333---------------1111-----2222------1111---------------- HVG --- >CITRATE SYNTHASE; SWP:P23007; PDB:1CSH; STNLKDVLASLIPKEQARIKTFRQQHGNTAVGQITVDMSYGGMRGMKGLIYETSVLDPDE --3333-------------------1111-----------%%%%---------------- GIRFRGFSIPECQKLLPKAGGGEEPLPEGLFWLLVTGQIPTPEQVSWVSKEWAKRAALPS ---iiii-----------2222------------------------------1111--33 HVVTMLDNFPTNLHPMSQLSAAITALNSESNFARAYAEGINRTKYWEFVYEDAMDLIAKL 33---11111111----------------------1111-3333---------------- PCVAAKIYRNLYRAGSSIGAIDSKLDWSHNFTNMLGYTDPQFTELMRLYLTIHSDHEGGN ------------2222-----1111-------------------------1111------ VSAHTSHLVGSALSDPYLSFAAAMNGLAGPLHGLANQEVLLWLSQLQKDLGADASDEKLR ---------1111------------1111------------------------------- DYIWNTLNSGRVVPGYGHAVLRKTDPRYTCQREFALKHLPSDPMFKLVAQLYKIVPNVLL ------1111--2222----------------------1111-------3333------- EQGKAKNPWPNVDAHSGVLLQYYGMTEMNYYTVLFGVSRALGVLAQLIWSRALGFPLERP ----------3333-33333333---1111------------------------------ KSMSTAGLEKLSAGG --------------- >C-SRC SH3 DOMAIN; SWP:P41240; PDB:1CSKA; GTECIAKYNFHGTAEQDLPFCKGDVLTIVAVTKDPNWYKAKNKVGREGIIPANYVQKR -------------1111---2222---------1111----1111-----3333---- >CASEIN KINASE-1; SWP:P40233; PDB:1CSN; NVVGVHYKVGRRIGEGSFGVIFEGTNLLNNQQVAIKFEPRRSDAPQLRDEYRTYKLLAGC ----------------------------------------1111----------1111-2 TGIPNVYYFGQEGLHNVLVIDLLGPSLEDLLDLCGRKFSVKTVAMAAKQMLARVQSIHEK 222--------!!!!----------33333333%%%%----------------------- SLVYRDIKPDNFLIGRPNSKNANMIYVVDFGMVKFYRDPVTKQHIPYREKKNLSGTARYM -------3333---------1111-----1111----------------------3333- SINTHLGREQSRRDDLEALGHVFMYFLRGSLPWQGLKAATNKQKYERIGEKKQSTPLREL ----------3333----------------1111------3333-----------3333- CAGFPEEFYKYMHYARNLAFDATPDYDYLQGLFSKVLERLNTTEDENFDWNLL 2222----------11111111-------------------------1111-- ----------------------- >YEAST HYPOTHETICAL PROTEI; SWP:P38197; PDB:1CT5A; TGITYDEDRKTQLIAQYESVREVVNAEAKNVKILLLVVSKLKPASDIQILYDHGVREFGE --------------------------------------22223333----3333------ NYVQELIEKAKLLPDDIKWHFIGGLQTNKCKDLAKVPNLYSVETIDSLKKAKKLNESRAK -------------1111--------1111------------------------------- FQPDCNPILCNVQINTSHEDQKSGLNNEAEIFEVIDFFLSEECKYIKLNGLTIGSWNRDF -1111----------------------------------3333----------------- ATLVEWKKKIDAKFGTSLKLSGSADFREAIRQGTAEVRIGTDIFG -----------------------------1111------3333-- >ASPARAGINE SYNTHETASE B; SWP:P22106; PDB:1CT9A; ASIFGVFDIKTDAVELRKKALELSRLMRHRGPDWSGIYASDNAILAHERLSIVDVNAGAQ -----------3333------------------------1111----------3333--- PLYNQQKTHVLAVNGEIYNHQALRAEYGDRYQFQTGSDCEVILALYQEKGPEFLDDLQGM ---1111---------1111----------------3333--------!!!!1111---- FAFALYDSEKDAYLIGRDHLGIIPLYMGYDEHGQLYVASEMKALVPVCRTIKEFPAGSYL ---------------------------------------33333333------------- WSQDGEIRSYYHRDWFDYDAVKDNVTDKNELRQALEDSVKSHLMSDVPYGVLLSGGLDSS 3333--------333333331111---------------1111----------------- IISAITKKYALHSFAVGLPGSPDLKAAQEVANHLGTVHHEIHFTVQEGLDAIRDVIYHIE -----------------2222--------------------------------------- TYDVTTIRASTPMYLMSRKIKAMGIKMVLSGEGSDEVFGGYLYFHKAPNAKELHEETVRK --------------------------------3333----3333---------------- LLALHMYDCARANKAMSAWGVEARVPFLDKKFLDVAMRINPQDKMCKMEKHILRECFEAY --3333-3333-------------3333-3333-1111--1111------------3333 LPASVAWRQKEQFSDGVGYSWIDTLKEVAAQQVSDQQLETARFRFPYNTPTSKEAYLYRE --3333----11113333---------------------3333----------------- IFEELFPLPSAAECVPG -------33333333-- >TROPONIN C SITE III - SIT; SWP:P10246; PDB:1CTAA; KSEEELANAFRIFDKNADGYIDIEELGEILRATG -------------1111----------------- >RIBOSOMAL PROTEIN L7/L12; SWP:P0A7K2; PDB:1CTF; EFDVILKAAGANKVAVIKAVRGATGLGLKEAKDLVESAPAALKEGVSKDDAEALKKALEE --------!!!!----------------------1111---------------------- AGAEVEVK -------- >CYTOCHROME C6; SWP:Q09099; PDB:1CTJ; EADLALGKAVFDGNCAACHAGGGNNVIPDHTLQKAAIEQFLDGGFNIEAIVYQIENGKGA --------------33332222----1111----------2222-------------!!! MPAWDGRLDEDEIAGVAAYVYDQAAGNKW !--2222----------------1111-- >CYTIDINE DEAMINASE; SWP:P0ABF6; PDB:1CTT; MHPRFQTAFAQLADNLQSALEPILADKYFPALLTGEQVSSLKSATGLDEDALAFALLPLA -3333-3333--3333---------1111------------------------------- AACARTPLSNFNVGAIARGVSGTWYFGANMEFIGATMQQTVHAEQSAISHAWLSGEKALA 1111--------------1111---------22223333--------------------- AITVNYTPCGHCRQFMNELNSGLDLRIHLPGREAHALRDYLPDAFGPKDLEIKTLLMDEQ --------------------!!!!----2222---3333------3333-----2222-- DHGYALTGDALSQAAIAAANRSHMPYSKSPSGVALECKDGRIFSGSYAENAAFNPTLPPL ------------------1111--------------1111---------1111----333 QGALILLNLKGYDYPDIQRAVLAEKADAPLIQWDATSATLKALGCHSIDRVLLA 3-----------3333------------------------1111---------- ------------------------------------------------------------ ----------- >RUVA PROTEIN; SWP:P08576; PDB:1CUK; MIGRLRGIIIEKQPPLVLIEVGGVGYEVHMPMTCFYELPEAGQEAIVFTHFVVREDAQLL --------------------iiii------33331111-2222----------------- YGFNNKQERTLFKELIKTNGVGPKLALAILSGMSAQQFVNAVEREEVGALVKLPGIGKKT -----------------1111--------------------------3333-2222---- AERLIVEMKDRFKGLHGDLFTPTDDAEQEAVARLVALGYKPQEASRMVSKIARPDASSET -----------1111-------------------3333-3333----3333--------- LIREALRAAL ---------- >ALPHA SPECTRIN; SWP:P07751; PDB:1CUNA; MVHQFFRDMDDEESWIKEKKLLVSSEDYGRDLTGVQNLRKKHKRLEAELAAHEPAIQSVL ------------------------------------------------------------ DTGKKLSDDNTIGKEEIQQRLAQFVDHWKELKQLAAARGQRLEESLEYQQFVANVEEEEA ------11112222---------------------------------------------- WINEKMTLVASEDYGDTLAAIQGLLKKHEAFETDFTVHKDRVNDVCANGEDLIKKNNHHV -------1111------------------------------------------------- ENITAKMKGLKGKVSDLEKAAAQRKAKLDENSA --------------------------------- >AZURIN ISO-2; SWP:P12335; PDB:1CUOA; ASCETTVTSGDTMTYSTRSISVPASCAEFTVNFEHKGHMPKTGMGHNWVLAKSADVGDVA ----------------------1111-------------------------3333----- KEGAHAGADNNFVTPGDKRVIAFTPIIGGGEKTSVKFKVSALSKDEAYTYFCSYPGHFSM -3333-3333---2222----------2222------3333-1111-------2222--- MRGTLKLEE --------- >STAPHOPAIN; SWP:P81297; PDB:1CV8; NEQYVNKLENFKIRETQGNNGWCAGYTMSALLNATYNTNKYHAEAVMRFLHPNLQGQQFQ -------1111---------------------------------------1111------ FTGLTPREMIYFGQTQGRSPQLLNRMTTYNEVDNLTKNNKGIAILGSRVESRNGMHAGHA -------------1111------------------1111------------iiii----- MAVVGNAKLNNGQEVIIIWNPWDNGFMTQDAKNNVIPVSNGDHYQWYSSIYGY --------1111-------3333------1111----1111------------ >POLYDENYLATE BINDING PROT; SWP:P11940; PDB:1CVJA; ASLYVGDLHPDVTEAMLYEKFSPAGPILSIRVCRDMITRRSLGYAYVNFQQPADAERALD --------1111--------3333--------------------------1111------ TMNFDVIKGKPVRIMWSQRDPSLRKSGVGNIFIKNLDKSIDNKALYDTFSAFGNILSCKV -2222-iiii----------3333--1111------------------3333-------- VCDENGSKGYGFVHFETQEAAERAIEKMNGMLLNDRKVFVGRFKSRKER --------------------------------iiii--------3333- >TRIACYLGLYCEROL HYDROLASE; SWP:Q05489; PDB:1CVL; ADTYAATRYPVILVHGLAGTDKFANVVDYWYGIQSDLQSHGAKVYVANLSGFQSDDGPNG --1111-----------------------2222----1111------------1111--- RGEQLLAYVKQVLAATGATKVNLIGHSQGGLTSRYVAAVAPQLVASVTTIGTPHRGSEFA ---------------------------------------3333---------1111---- DFVQDVLKTDPTGLSSTVIAAFVNVFGTLVSSSHNTDQDALAALRTLTTAQTATYNRNFP ------------3333----------1111-------------1111------------- SAGLGAPGSCQTGAATETVGGSQHLLYSWGGTAIQPTSTVTGATDTSTGTLDVANVTDPS -----2222---------%%%%----------------------3333---3333--333 TLALLATGAVMINRASGQNDGLVSRCSSLFGQVISTSYHWNHLDEINQLLGVRGANAEDP 3----------1111--------3333-------------1111----iiii-1111--- VAVIRTHVNRLKLQGV -----------1111- >GINGIPAIN R; SWP:P95493; PDB:1CVRA; YTPVEEKENGRMIVIVAKKYEGDIKDFVDWKNQRGLRTEVKVAEDIASPVTANAIQQFVK ------1111------3333-----------1111------3333--------------- QEYEKEGNDLTYVLLVGDHKDIPAKITPGIKSDQVYGQIVGNDHYNEVFIGRFSCESKED -----------------3333-----2222--3333--------------------3333 LKTQIDRTIHYERNITTEDKWLGQALCIASAEGGPSADNGESDIQHENVIANLLTQYGYT ---------------1111--------------1111%%%%------------------- KIIKCYDPGVTPKNIIDAFNGGISLVNYTGHGSETAWGTSHFGTTHVKQLTNSNQLPFIF ----------3333----3333--------------------33331111---------- DVACVNGDFLFSMPCFAEALMRAQKDGKPTGTVAIIASTIDQYWAPPMRGQDEMNEILCE ------------------------iiii----------------3333------------ KHPNNIKRTFGGVTMNGMFAMVEKYKKDGENMLDTWTVFGDPSLLVRTLVPTEMQVTAPA -1111-----------------------------------1111---------------- NISASAQTFEVACDYNGAIATLSDDGDMVGTAIVKDGKAIIKLNESIADETNLTLTVVGY --1111--------2222-----iiii-------iiii--------1111---------- NKVTVIKDVKVE ------------ >Basic fibroblast growth f; SWP:P11362; PDB:1CVSC; MPVAPYWTSPEKMEKKLHAVPAAKTVKFKCPSSGTPQPTLRWLKNGKEFKPDHRIGGYKV --------3333--------2222-------------------iiii------2222--- RYATWSIIMDSVVPSDKGNYTCIVENEYGSINHTYQLDVVERSPHRPILQAGLPANKTVA 3333--------3333---------1111--------------------2222------2 LGSNVEFMCKVYSDPQPHIQWLKHIEVNGSKIGPDNLPYVQILKTAGVNTTDKEMEVLHL 222-----------------------------1111----------33333333------ RNVSFEDAGEYTCLAGNSIGLSHHSAWLTVL ---3333---------3333----------- >PROSTAGLANDIN H2 SYNTHASE; SWP:Q05769; PDB:1CVUA; ANPCCSNPCQNRGECMSTGFDQYKCDCTRTGFYGENCTTPEFLTRIKLLLKPTPNTVHYI -1111----iiii-------------2222---1111---------3333---------1 LTHFKGVWNIVNNIPFLRSLIMKYVLTSRSYLIDSPPTYNVHYGYKSWEAFSNLSYYTRA 1113333--------------------------------1111---3333--1111---- LPPVADDCPTPMGVKGNKELPDSKEVLEKVLLRREFIPDPQGSNMMFAFFAQHFTAQFFK ----1111-1111-------------------------1111------------------ TDHKRGPGFTRGLGHGVDLNHIYGETLDRQHKLRLFKDGKLKYQVIGGEVYPPTVKDTQV -33331111--3333---1111--------1111--iiii-----iiii----3333--- EMIYPPHIPENLQFAVGQEVFGLVPGLMMYATIWLREHQRVCDILKQEHPEWGDEQLFQT ----11113333-------1111-------------------------1111-------- SKLILIGETIKIVIEDYVQHLSGYHFKLKFDPELLFNQQFQYQNRIASEFNTLYHWHPLL ------------------------------33331111--------33333333-3333- PDTFNIEDQEYSFKQFLYNNSILLEHGLTQFVESFTRQIAGRVAGGRNVPIAVQAVAKAS -----------33332222------------------------------3333------- IDQSREMKYQSLNEYRKRFSLKPYTSFEELTGEKEMAAELKALYSDIDVMELYPALLVEK ----------------1111-----3333----------------1111--3333----- PRPDAIFGETMVELGAPFSLKGLMGNPICSPQYWKPSTFGGEVGFKIINTASIQSLICNN ----------------------11111111----3333--3333------------3333 VKGCPFTSFNVQ 2222-------- >Coagulation factor VII [P; SWP:P08709; PDB:1CVWH; IVGGKVCPKGECPWQVLLLVNGAQLCGGTLINTIWVVSAAHCFDKIKN -------22221111-----------------------1111------ >TYPE IIA BACTERIOCIN CARN; SWP:P38580; PDB:1CW5A; VNYGNGVSCSKTKCSVNWGQAFQERYTAGINSFVSGVASGAGSIGRRP ----------------3333---------------3333--------- >TYPE IIA BACTERIOCIN LEUC; SWP:P34034; PDB:1CW6A; KYYGNGVHCTKSGCSVNWGEAFSAGVHRLANGGNGFW ---------%%%%---------------3333----- >INVASIN; SWP:P11922; PDB:1CWVA; LTLTAAVIGDGAPANGKTAITVEFTVADFEGKPLAGQEVVITTNNGALPNKITEKTDANG ---------------------------3333------------iiii---------1111 VARIALTNTTDGVTVVTAEVEGQRQSVDTHFVKGTIAADKSTLAAVPTSIIADGLMASTI ------------------------------------1111-------------------- TLELKDTYGDPQAGANVAFDTTLGNMGVITDHNDGTYSAPLTSTTLGVATVTVKVDGAAF -----1111---------------------------------------------iiii-- SVPSVTVNFTADPIPDAGRSSFTVSTPDILADGTMSSTLSFVPVDKNGHFISGMQGLSFT --------------------------------------------1111------------ QNGVPVSISPITEQPDSYTATVVGNSVGDVTITPQVDTLILSTLQKKISLFPVPTLTGIL ----------------------------------------1111---------------- VNGQNFATDKGFPKTIFKNATFQLQMDNDVANNTQYEWSSSFTPNVSVNDQGQVTITYQT ----------------2222-----%%%%1111---------------1111-------- YSEVAVTAKSKKFPSYSVSYRFYPNRWIYDGGRSLVSSLEASRQCQGSDMSAVLESSRAT ---------3333-----------------------------1111-1111---3333-- NGTRAPDGTLWGEWGSLTAYSSDWQSGEYWVKKTSTDFETMNMDTGALQPGPAYLAFPLC ---------------3333--------------1111----------------------- ALSI ---- >HEPATITIS C VIRUS CAPSID ; SWP:P27958; PDB:1CWXA; STNPKPQRKTKRNTNRRPQDVKFPGGGQIVGGVYLLPRRGPRLG ------------------------------1111---------- -------- >CAMP-DEPENDENT PROTEIN KI; SWP:P12369; PDB:1CX4A; RIIHPKTDDQRNRLQEACKDILLFKNLDPEQMSQVLDAMFEKLVKEGEHVIDQGDDGDNF -----------------11111111--3333-------------2222---2222----- YVIDRGTFDIYVKCDGVGRCVGNYDNRGSFGELALMYNTPRAATITATSPGALWGLDRVT -------------------------------3333------------------------- FRRIIVKNNAKKRKMYESFIESLPFLKSLEVSERLKVVDVIGTKVYNDGEQIIAQGDSAD ----------------------3333---3333--3333-------2222---2222--- SFFIVESGEVRITMKRNGAVEIARCLRGQYFGELALVTNKPRAASAHAIGTVKCLAMDVQ -------------------------2222---3333------------------------ AFERLLGPCMEIMKRNIATYEEQLVALFGTNMDIV -----3333-------------------------- >CYTOCHROME C2; SWP:P0C0X8; PDB:1CXC; QEGDPEAGAKAFNQCQTCHVIVDDSGTTIAGRNAKTGPNLYGVVGRTAGTQADFKGYGEG ----------33333333---------------------2222-------1111------ MKEAGAKGLAWDEEHFVQYVQDPTKFLKEYTGDAKAKGKMTFKLKKEADAHNIWAYLQQV --------------------------------1111------------------------ AVRP ---- >AVIAN SARCOMA VIRUS INTEG; SWP:P03354; PDB:1CXQA; GRGLGPLQIWQTDFTLEPRMAPRSWLAVTVDTASSAIVVTQHGRVTSVAAQHHWATAIAV -!!!!------------1111--------------------------------------- LGRPKAIKTDNGSCFTSKSTREWLARWGIAHTTGIPGQAMVERANRLLKDKIRVLAEGDG -----------3333-----------------------------------------1111 FMKRIPTSKQGELLAKAMYALNH -----3333-------------- >COLLAGENASE-3; SWP:P33435; PDB:1CXVA; YNVFPRTLKWSQTNLTYRIVNYTPDMSHSEVEKAFRKAFKVWSDVTPLNFTRIYDGTADI ---2222---------------1111---------------3333--------------- MISFGTKEHGDFYPFDGPSGLLAHAFPPGPNYGGDAHFDDDETWTSSSKGYNLFIVAAHE -----------------------------!!!!-----1111------------------ LGHSLGLDHSKDPGALMFPIYTYTFMLPDDDVQGIQFLYG -----------1111------------------------- >CYSTEINE AND GLYCINE-RICH; SWP:Q05158; PDB:1CXXA; AEKCSACGDSVYAAEKVIGAGKPWHKNCFRCAKCGKSLESTTLTEKEGEIYCKGCYAKN ------------------------3333------------------------------- >CYTOCHROME B5; SWP:P82291; PDB:1CXYA; TLPVFTLEQVAEHHSPDDCWMAIHGKVYDLTPYVPNHPGPAGMMLVWCGQESTEAWETKS -----3333-----1111----iiii---33331111--22223333------------- YGEPHSSLAARLLQRYLIGTL -------------1111---- >Serine/threonine-protein ; SWP:Q16512; PDB:1CXZB; WSLLEQLGLAGADLAAPGVQQQLELERERLRREIRKELKLKEGAENLRRATTDLGRSLGP ----1111----1111-----------------------------------1111----- VELLLRGSSRRLDLLHQQLQELHAHV ---------------------3333- >APOPTOTIC PROTEASE ACTIVA; SWP:O14727; PDB:1CY5A; MDAKARNCLLQHREALEKDIKTSYIMDHMISDGFLTISEEEKVRNEPTQQQRAAMLIKMI ------------------------------------------3333-------------1 LKKDNDSYVSFYNALLHEGYKDLAALLHDGIPV 111-----------------------3333--- >CARBONYL REDUCTASE; SWP:P08074; PDB:1CYDA; LNFSGLRALVTGAGKGIGRDTVKALHASGAKVVAVTRTNSDLVSLAKECPGIEPVCVDLG --2222-------------------1111--------3333----------------111 DWDATEKALGGIGPVDLLVNNAALVIMQPFLEVTKEAFDRSFSVNLRSVFQVSQMVARDM 1---------------------------1111---------------------------- INRGVPGSIVNVSSMVAHVTFPNLITYSSTKGAMTMLTKAMAMELGPHKIRVNSVNPTVV -------------1111---2222--------------------3333------------ LTDMGKKVSADPEFARKLKERHPLRKFAEVEDVVNSILFLLSDRSASTSGGGILVDAGYL ---------------------1111---3333------------1111-------iiii- AS -- >CYCLODEXTRIN GLUCANOTRANS; SWP:P31797; PDB:1CYG; AGNLNKVNFTSDVVYQIVVDRFVDGNTSNNPSGALFSSGCTNLRKYCGGDWQGIINKIND --------1111------1111---3333-------2222-1111--------------- GYLTDMGVTAIWISQPVENVFSVMNDASGSASYHGYWARDFKKPNPFFGTLSDFQRLVDA --1111-------------------3333--1111----1111--3333----------- AHAKGIKVIIDFAPNHTSPASETNPSYMENGRLYDNGTLLGGYTNDANMYFHHNGGTTFS 3333----------------3333----%%%%-------------3333----------- SLEDGIYRNLFDLADLNHQNPVIDRYLKDAVKMWIDMGIDGIRMDAVKHMPFGWQKSLMD 3333------------3333-------------------------1111----------- EIDNYRPVFTFGEWFLSENEVDANNHYFANESGMSLLDFRFGQKLRQVLRNNSDNWYGFN ----------------1111------------------3333------------------ QMIQDTASAYDEVLDQVTFIDNHDMDRFMIDGGDPRKVDMALAVLLTSRGVPNIYYGTEQ -----------3333------1111----22223333------3333-------2222-- YMTGNGDPNNRKMMSSFNKNTRAYQVIQKLSSLRRNNPALAYGDTEQRWINGDVYVYERQ ------------------------------------3333-------------------- FGKDVVLVAVNRSSSSNYSITGLFTALPAGTYTDQLGGLLDGNTIQVGSNGSVNAFDLGP !!!!-----------------------------1111-----------%%%%------22 GEVGVWAYSATESTPIIGHVGPMMGQVGHQVTIDGEGFGTNTGTVKFGTTAANVVSWSNN 22-----------------------2222-----------------!!!!---------- QIVVAVPNVSPGKYNITVQSSSGQTSAAYDNFEVLTNDQVSVRFVVNNATTNLGQNIYIV -------------------1111------------------------------------- GNVYELGNWDTSKAIGPMFNQVVYSYPTWYIDVSVPEGKTIEFKFIKKDSQGNVTWESGS --3333%%%%-------------------------------------------------- NHVYTTPTNTTGKIIVDWQN -------------------- >CYTOCHROME C6; SWP:P08197; PDB:1CYJ; ADLALGAQVFNGNCAACHMGGRNSVMPEKTLDKAALEQYLDGGFKVESIIYQVENGKGAM -------------33332222-3333-------------2222-------------!!!! PAWADRLSEEEIQAVAEYVFKQATDAAWKY ----------------------1111---- >CYCLOPHILIN B; SWP:P23284; PDB:1CYNA; GPKVTVKVYFDLRIGDEDVGRVIFGLFGKTVPKTVDNFVALATGEKGFGYKNSKFHRVIK -------------------------------------------1111--2222-----22 DFMIQGGDFTRGDGTGGKSIYGERFPDENFKLKHYGPGWVSMANAGKDTNGSQFFITTVK 22----------------1111-------------------------------------- TAWLDGKHVVFGKVLEGMEVVRKVESTKTDSRDKPLKDVIIADCGKIEVEKPFAIAKE 3333------------3333---1111--1111------------------------- >CYTOCHROME B5; SWP:P00171; PDB:1CYO; SKAVKYYTLEEIQKHNNSKSTWLILHYKVYDLTKFLEEHPGGEEVLREQAGGDATENFED -------3333-----3333----iiii---11111111-------1111--------11 VGHSTDARELSKTFIIGELHPDDRSKIT 11---------1111----33333333- >CYOA; SWP:P18400; PDB:1CYX; KPITIEVVSMDWKWFFIYPEQGIATVNEIAFPANTPVYFKVTSNSVMHSFFIPRLGSQIY ------------------3333-----------------------------1111----- AMAGMQTRLHLIANEPGTYDGICAEICGPGHSGMKFKAIATPDRAAFDQWVAKAKQSPNT -2222------------------------3333-------------------3333---- MSDMAAFEKLAAPSEYNQVEYFSNVKPDLFADVINKFM --3333----------------------------1111 >VCP-LIKE ATPASE; SWP:O05209; PDB:1CZ4A; MESNNGIILRVAEANSTDPGMSRVRLDESSRRLLDAEIGDVVEIEKVRKTVGRVYRARPE ---------------------------3333----------------------------- DENKGIVRIDSVMRNNCGASIGDKVKVRKVRTEIAKKVTLAPIIRKDQRLKFGEGIEEYV -------------------2222---------------------1111------------ QRALIRRPMLEQDNISVPGLTLAGQTGLLFKVVKTLPSKVPVEIGEETKIEIREEPASEV ---------2222-------------------------------1111------------ LEEGG ----- >Hexokinase-1; SWP:P19367; PDB:1CZAN; DDQVKKIDKYLYAMRLSDETLIDIMTRFRKEMKNGLSRDFNPTATVKMLPTFVRSIPDGS ----------3333-------------------------3333----------------- EKGDFIALDLGGSSFRILRVQVNHEKNQNVHMESEVYDTPENIVHGSGSQLFDHVAECLG ---------------------------------------3333----------------- DFMEKRKIKDKKLPVGFTFSFPCQQSKIDEAILITWTKRFKASGVEGADVVKLLNKAIKK ---11111111---------------1111------!!!!----2222------------ RGDYDANIVAVVNDTVGTMMTCGYDDQHCEVGLIIGTGTNACYMEELRHIDLVEGDEGRM -------------------------1111----------------33331111------- CINTEWGAFGDDGSLEDIRTEFDRAIDAYSLNPGKQLFEKMVSGMYLGELVRLILVKMAK ----3333-1111-3333-------------22223333---3333------------11 EGLLFEGRITPELLTRGKFNTSDVSAIEKNKEGLHNAKEILTRLGVEPSDDDCVSVQHVC 11-%%%%--3333-2222------------------------------------------ TIVSFRSANLVAATLGAILNRLRDNKGTPRLRTTVGVDGSLYKTHPQYSRRFHKTLRRLV -----------------------------------------------------------1 PDSDVRFLLSESGSGKGAAMVTAVAYRLAEQHRQIEETLAHFHLTKDMLLEVKKRMRAEM 111------1111--------------------------1111----------------- ELGLRKQTHNNAVVKMLPSFVRRTPDGTENGDFLALDLGGTNFRVLLVKIRSGKKRTVEM -----1111--------------------------------------------------- HNKIYAIPIEIMQGTGEELFDHIVSCISDFLDYMGIKGPRMPLGFTFSFPCQQTSLDAGI -------3333-------------------------------------------1111-- LITWTKGFKATDCVGHDVVTLLRDAIKRREEFDLDVVAVVNDTVGTMMTCAYEEPTCEVG ----!!!!----2222-------------------------------------1111--- LIVGTGSNACYMEEMKNVEMVEGDQGQMCINMEWGAFGDNGCLDDIRTHYDRLVDEYSLN -------------33331111-----------3333-1111-1111-------------2 AGKQRYEKMISGMYLGEIVRNILIDFTKKGFLFRGQISETLKTRGIFETKFLSQIESDRL 2223333-------------------1111-%%%%--3333-2222-------------- ALLQVRAILQQLGLNSTCDDSILVKTVCGVVSRRAAQLCGAGMAAVVDKIRENRGLDRLN ---------------------------------------------------1111----- VTVGVDGTLYKLHPHFSRIMHQTVKELSPKCNVSFLLSEDGSGKGAALITAVGVRLRT ---------------------------1111--------------------------- >DNA POLYMERASE ACCESSORY ; SWP:P04525; PDB:1CZDA; MKLSKDTTALLKNFATINSGIMLKSGQFIMTRAVNGTTYAEANISDVIDFDVAIYDLNGF ------------3333----------------1111------------------------ LGILSLVNDDAEISQSEDGNIKIADARSTIFWPAADPSTVVAPNKPIPFPVASAVTEIKA -1111--1111----1111----------------3333-------------------33 EDLQQLLRVSRGLQIDTIAITVKEGKIVINGFNKVEDSALTRVKYSLTLGDYDGENTFNF 33--------1111--------%%%%------33331111-------------------- IINMANMKMQPGNYKLLLWAKGKQGAAKFEGEHANYVVALEADSTHDF --3333--------------!!!!----------------3333---- >POLYGALACTURONASE II; SWP:P26214; PDB:1CZFA; DSCTFTTAAAAKAGKAKCSTITLNNIEVPAGTTLDLTGLTSGTKVIFEGTTTFQYEEWAG -------------3333-----------2222-------2222----------------- PLISMSGEHITVTGASGHLINCDGARWWDGKGTSGKKKPKFFYAHGLDSSSITGLNIKNT --------------2222-----3333---!!!!-------------------------- PLMAFSVQANDITFTDVTINNADGDTQGGHNTDAFDVGNSVGVNIIKPWVHNQDDCLAVN ----------------------3333---------------------------------- SGENIWFTGGTCIGGHGLSIGSVGDRSNNVVKNVTIEHSTVSNSENAVRIKTISGATGSV ----------------------------------------------------2222---- SEITYSNIVMSGISDYGVVIQQDYEDGKPTGKPTNGVTIQDVKLESVTGSVDSGATEIYL ------------------------iiii-----------------------1111----- LCGSGSCSDWTWDDVKVTGGKKSTACKNFPSVASC --2222-----------------------3333-- >CYTOCHROME C3; SWP:P38554; PDB:1CZJ; TFEIPESVTMSPKQFEGYTPKKGDVTFNHASHMDIACQQCHHTVPDTYTIESCMTEGCHD -----------3333-------------3333---3333-1111----------2222-- NIKERTEISSVYRTFHTTKDSEKSCVGCHRELKRQGPSDAPLACNSCHVQ ------1111-3333---------------3333--------1111---- >FLAVODOXIN; SWP:P10340; PDB:1CZNA; AKIGLFYGTQTGVTQTIAESIQQEFGGESIVDLNDIANADASDLNAYDYLIIGCPTWNVG --------------------------1111----3333-33333333------------- ELQSDWEGIYDDLDSVNFQGKKVAYFGAGDQVGYSDNFQDAMGILEEKISSLGSQTVGYW --------33331111-2222------------1111----------------------- PIEGYDFNESKAVRNNQFVGLAIDEDNQPDLTKNRIKTWVSQLKSEFGL -2222--------%%%%----------3333-------------1111- >FERREDOXIN I; SWP:P06543; PDB:1CZPA; ATFKVTLINEAEGTKHEIEVPDDEYILDAAEEQGYDLPFSCRAGACSTCAGKLVSGTVDQ ---------1111-------1111------1111------------1111---------1 SDQSFLDDDQIEAGYVLTCVAYPTSDVVIQTHKEEDLY 111-------------3333------------3333-- >D-PEPTIDE INHIBITOR; SWP:NA; PDB:1CZQA; RMKQIEDKIEEIESKQKKIENEIARIKKLLQLTVWGIKQLQARIL 3333----------------------------------------- >COAGULATION FACTOR V; SWP:P12259; PDB:1CZTA; GCSTPLGMENGKIENKQITASSFKKSWWGDYWEPFRARLNAQGRVNAWQAKANNNKQWLE -------1111--3333---------------1111-2222------------------- IDLLKIKKITAIITQGCKSLSSEMYVKSYTIHYSEQGVEWKPYRLKSSMVDKIFEGNTNT ------------------------------------------------------------ KGHVKNFFNPPIISRFIRVIPKTWNQSITLRLELFGCDIY ---------------------------------------- >LATENT MEMBRANE PROTEIN 1; SWP:Q12933; PDB:1CZYA; AMADLEQKVLEMEASTYDGVFIWKISDFPRKRQEAVAGRIPAIFSPAFYTSRYGYKMCLR ------------------------------------------------------------ IYLNGDGTGRGTHLSLFFVVMKGPNDALLRWPFNQKVTLMLLDQNNREHVIDAFRPDVTS -----!!!!-------------1111------------------------------1111 SSFQRPVNDMNIASGCPLFCPVSKMEAKNSYVRDDAIFIKAIVDLTGL 1111----------------3333--------%%%%--------2222 >TYPE II RESTRICTION ENZYM; SWP:P43642; PDB:1D02A; LSGRLNWQALAGLKASGAEQNLYNVFNAVFEGTKYVLYEKPKHLKNLYAQVVLPDDVIKE ---------------------------1111----------1111--1111------111 IFNPLIDLSTTQWGVSPAFAIENTETHKILFGEIKRQDGWVEGKDPSAGRGNAHERSCKL 1-----3333------------------------------22223333------------ FTPGLLKAYRTIGGINDEEILPFWVVFEGDITRDPKRVREITFWYDHYQDNYFMWRPNES ----------------3333--------3333------------!!!!-------22223 GEKLVQHFNEKLKKYLD 333----------1111 >BOVINE ENDOTHELIAL NITRIC; SWP:P29473; PDB:1D0CA; GPKFPRVKNWELGSITYDTLCAQSQQDGPCTPRRCLGSLVLPRKLQTRPSPGPPPAEQLL --------------------1111------3333-1111---------------3333-- SQARDFINQYYSSIKRSGSQAHEERLQEVEAEVASTGTYHLRESELVFGAKQAWRNAPRC ---------------2222-------------------------------------1111 VGRIQWGKLQVFDARDCSSAQEMFTYICNHIKYATNRGNLRSAITVFPQRAPGRGDFRIW -3333--------1111-----------------%%%%---------------------- NSQLVRYAGYRQQDGSVRGDPANVEITELCIQHGWTPGNGRFDVLPLLLQAPDEAPELFV -----------1111----3333-------1111-------------------------- LPPELVLEVPLEHPTLEWFAALGLRWYALPAVSNMLLEIGGLEFSAAPFSGWYMSTEIGT -1111----------3333-------------------iiii------------------ RNLCDPHRYNILEDVAVCMDLDTRTTSSLWKDKAAVEINLAVLHSFQLAKVTIVDHHAAT ----1111--------1111----1111-------------------------------- VSFMKHLDNEQKARGGCPADWAWIVPPISGSLTPVFHQEMVNYILSPAFRYQPDPW -------------------3333-----11113333-------------------- >ANTICOAGULANT PROTEIN; SWP:P17726; PDB:1D0DA; YNRLCIKPRDWIDECDSNEGGERAYFRNGKGGCDSFWICPEDHTGADYYSSYRDCFNACI -3333--1111----1111-------------------1111------------------ >REVERSE TRANSCRIPTASE; SWP:P03355; PDB:1D0EA; GSHMTWLSDFPQAWAETGGMGLAVRQAPLIIPLKATSTPVSIKQYPMSQEARLGIKPHIQ -3333----33333333-----1111---------------------3333--------- RLLDQGILVPCQSPWNTPLLPVKKPGTNDYRPVQDLREVNKRVEDIHPTVPNPYNLLSGL -----------------------2222----------3333------------3333--- PPSHQWYTVLDLKDAFFCLRLHPTSQPLFAFEWRDPEMGISGQLTWTRLPQGFKNSPTLF 3333---------3333--------3333-----3333--------------1111---- DEALHRDLADFRIQHPDLILLQYVDDLLLAATSELDCQQGTRALLQTLGNLGYRASAKKA ------------------------------------------------3333---3333- QICQKQVKYLGYLLKEGQR ------------------- >TNFRSF10B/DR5; SWP:Q6UXM8; PDB:1D0GR; SSPSEGLCPPGHHISEDGRDCISCKYGQDYSTHWNDLLFCLRCTRCDSGEVELSPCTTTR ---iiii-2222--3333------2222------------------1111---------- NTVCQCEEGTFREEDSPEMCRKCRTGCPRGMVKVGDCTPWSDIECVHK ------2222--1111-------------------------------- >HORSE PLASMA GELSOLIN; SWP:Q28372; PDB:1D0NA; VEHPEFLKAGKEPGLQIWRVEKFDLVPVPPNLYGDFFTGDAYVILKTVQLRNGILQYDLH --3333--------------%%%%----1111-----------------1111------- YWLGNECSQDESGAAAIFTVQLDDYLNGRAVQHREVQGFESATFLGYFKSGLKYKKGGVA ---1111----------------1111--------2222-3333---3333------333 SGFKHVVPNEVVVQRLLQVKGRRVVRATEVPVSWESFNNGDCFILDLGNNIYQWCGSKSN 3-------------------------------3333-------------------1111- RFERLKATQVSKGIRDNERSGRAQVSVFEEGAEPEAMLQVLGPKPTLPEATEDTVKEDAA ------------------iiii------2222---------------------------- NRKLAKLYKVSNGAGPMVVSLVADENPFAQGALRSEDCFILDHGKDGKIFVWKGKQANME ----------------------------3333-1111-----3333-------1111333 ERKAALKTASDFISKMDYPKQTQVSVLPEGGETPLFRQFFKNWRDPDQTEGLGLAYLSSH 31111--------1111-1111-----2222-33331111-----------------111 IAHVERVPFDAATLHTSTAMAAQHGMDDDGTGQKQIWRVEGSNKVPVDPATYGQFYGGDS 1--------33331111------------------------------3333----1111- YIILYNYRHGSRQGQIIYNWQGAQSTQDEVAASAILTAQLDEELGGTPVQSRVVQGKEPA ---------------------1111---------------------------------33 HLMSLFGGKPMIVYKGGTSREGGQTAPASTRLFQVRASSSGATRAVEIIPKAGALNSNDA 33-----------------------------------1111------------------- FVLKTPSAAYLWVGAGASEAEKTGAQELLRVLRAQPVQVAEGSEPDSFWEALGGKATYRT -------------1111-3333-----3333----------------------------- SPRLKDKKMDAHPPRLFACSNKIGRFVIEEVPGEFMQEDLATDDVMLLDTWDQVFVWVGK ----3333---------------------------3333-1111-----1111-----11 DSQDEEKTEALTSAKRYIDTDPAHRDRRTPITVVKQGFEPPSFVGWFLGWDDSYWSVDPL 1111111111----------1111-1111-----2222-11111111---1111------ DRALAELAA --------- >DNA PRIMASE; SWP:Q9X4D0; PDB:1D0QA; GHRIPEETIEAIRRGVDIVDVIGEYVQLKRQGRNYFGLCPFHGEKTPSFSVSPEKQIFHC !!!!--------1111-----1111-----!!!!------------------1111---- FGCGAGGNAFTFLMDIEGIPFVEAAKRLAAKAGVDLSVYELD -----------------------------------3333--- ------------------------------ >MYOSIN S1DC MOTOR DOMAIN; SWP:P08799; PDB:1D0XA; NPIHDRTSDYHKYLKVKQGDSDLFKLTVSDKRYIWYNPDPKERDSYECGEIVSETSDSFT 33331111--------------3333---------------------------------- FKTVDGQDRQVKKDDANQRNPIKFDGVEDMSELSYLNEPAVFHNLRVRYNQDLIYTYSGL --3333-----3333-----3333----3333----------------1111-----!!! FLVAVNPFKRIPIYTQEMVDIFKGRRRNEVAPHIFAISDVAYRSMLDDRQNQSLLITGES !--------------------22221111-----------------------------22 GAGKTENTKKVIQYLASVAGRNGVLEQQILQANPILEAFGNAKTTRNNNSSRFGKFIEIQ 22------------------------------3333-------1111------------- FNNAGFISGASIQSYLLEKSRVVFQSETERNYHIFYQLLAGATAEEKKALHLAGPESFNY -1111--------------3333--2222--3333-------3333-------3333111 LNQSGCVDIKGVSDSEEFKITRQAMDIVGFSQEEQMSIFKIIAGILHLGNIKFEKGAGEG 1-------2222----------------------------------3333---------- AVLKDKTALNAASTVFGVNPSVLEKALMEPRILAGRDLVAQHLNVEKSSSSRDALVKALY ------------------------------------------------------------ GRLFLWLVKKINNVLCQERKAYFIGVLDISGFEIFKVNSFEQLCINYTNEKLQQFFNHHM ------------------------------------------------------------ FKLEQEEYLKEKINDSQATIDLIDGRQPPGILALLDEQSVFPNATDNTLITKLHSHFSKK ----------------------------------------1111------------2222 NAKYEEPRFSKTEFGVTHYAGQVMYEIQDWLEKNKDPLQQDLELCFKDSSDNVVTKLFND 1111-------------1111-----2222---------------1111-3333-----1 PNIASRAFITVAAQYKEQLASLMATLETTNPHFVRCIIPNNKQLPAKLEDKVVLDQLRCN 111----------------------1111--------------------3333------- GVLEGIRITRKGFPNRIIYADFVKRYYLLAPNVPRDAEDSQKATDAVLKHLNIDPEQYRF --------3333-------------1111------------------------3333--- GITKIFFRAGQLARIEEARE -------2222--------- >DIHYDROFOLATE REDUCTASE; SWP:Q60034; PDB:1D1GA; AKVIFVLAMDVSGKIASSVESWSSFEDRKNFRKITTEIGNVVMGRITFEEIGRPLPERLN ---------1111-----------------------------------------2222-- VVLTRRPKTSNNPSLVFFNGSPADVVKFLEGKGYERVAVIGGKTVFTEFLREKLVDELFV -----------1111--------------1111----------------1111------- TVEPYVFGKGIPFFDEFEGYFPLKLLEMRRLNERGTLFLKYSVE -------------------------------3333--------- >HANATOXIN TYPE 1; SWP:P56852; PDB:1D1HA; ECRYLFGGCKTTSDCCKHLGCKFRDKYCAWDFTFS ---2222---1111--------------------- >PROFILIN II; SWP:P35080; PDB:1D1JA; AGWQSYVDNLMCDGCCQEAAIVGYCDAKYVWAATAGGVFQSITPIEIDMIVGKDREGFFT -3333----------------------------2222-11113333---------1111- NGLTLGAKKCSVIRDSLYVDGDCTMDIRTKSQGGEPTYNVAVGRAGRALVIVMGKEGVHG ----iiii----------2222--------------------------------2222-- GTLNKKAYELALYLRRSD ------------------ ------------------------------------------------------------ --------------------------------------- >TYROSINE PHOSPHATASE (E.C; SWP:P40347; PDB:1D1QA; IEKPKISVAFIALGNFCRSPMAEAIFKHEVEKANLENRFNKIDSFGTSNYHVGESPDHRT ------------------------------11113333------------2222------ VSICKQHGVKINHKGKQIKTKHFDEYDYIIGMDESNINNLKKIQPEGSKAKVCLFGDWNT ----1111----------3333------------------11112222-----1111--- NDGTVQTIIEDPWYGDIQDFEYNFKQITYFSKQFLKKEL -----------1111------------------------ >HYPOTHETICAL 11.4 KD PROT; SWP:P08245; PDB:1D1RA; KGDGVVRIQRQTSGRKGKGVCLITGVDLDDAELTKLAAELKKKCGCGGAVKDGVIEIQGD ----------------------------3333---------------------------- KRDLLKSLLEAKGMKVKLAGGLE --------3333----------- >ALCOHOL DEHYDROGENASE CLA; SWP:P40394; PDB:1D1TA; GTAGKVIKCKAAVLWEQKQPFSIEEIEVAPPKTKEVRIKILATGICRTDDHVIKGTMVSK -2222--------------------------2222----------3333----------- FPVIVGHEATGIVESIGEGVTTVKPGDKVIPLFLPQCRECNACRNPDGNLCIRSDITGRG ----------------2222---2222------------3333-1111--1111------ VLADGTTRFTCKGKPVHHFLNTSTFTEYTVVDESSVAKIDDAAPPEKVCLIGCGFSTGYG -1111-------------%%%%---------3333----111111113333--------- AAVKTGKVKPGSTCVVFGLGGVGLSVIMGCKSAGASRIIGIDLNKDKFEKAMAVGATECI --------2222-------3333--------------------3333------------- SPKDSTKPISEVLSEMTGNNVGYTFEVIGHLETMIDALASCHMNYGTSVVVGVPPSAKML 1111---------------------------------1111------------------- TYDPMLLFTGRTWKGCVFGGLKSRDDVPKLVTEFLAKKFDLDQLITHVLPFKKISEGFEL ------3333------%%%%-3333--------1111---3333---------------- LNSGQSIRTVLTF 1111--------- >TRNA SYNTHETASE; SWP:NA; PDB:1D2DA; MVYDKIAAQGEVVRKLKAEKAPKAKVTEAVECLLSLKAEYKEKTGKEYVPGLEHHH ---3333--------------3333------------------------------- >ELONGATION FACTOR TU (EF-; SWP:P49410; PDB:1D2EA; KPHVNVGTIGHVDHGKTTLTAAITKILAEGGGAKFKKYEEIDNAPEERARGITINAAHVE ----------2222-----------3333------------------------------- YSTAARHYAHTDCPGHADYVKNMITGTAPLDGCILVVAANDGPMPQTREHLLLARQIGVE --3333--------3333-------------------3333--3333-------1111-- HVVVYVNKADAVQDSEMVELVELEIRELLTEFGYKGEETPIIVGSALCALEQRDPELGLK -------3333------------------1111-3333---------------3333--- SVQKLLDAVDTYIPVPTRDLEKPFLLPVESVYSIPGRGTVVTGTLERGILKKGDECEFLG ------------------1111----------------------------2222-----% HSKNIRTVVTGIEMFHKSLDRAEAGDNLGALVRGLKREDLRRGLVMAKPGSIQPHQKVEA %%%----------%%%%-----2222---------3333--------2222--------- QVYILTKEEGGRHKPFVSHFMPVMFSLTWDMACRIILPPGKELAMPGEDLKLTLILRQPM -----3333-------2222-----!!!!---------------2222------------ ILEKGQRFTLRDGNRTIGTGLVTDTPAMTEEDKNIKW --2222-----!!!!-------------3333----- >MALY PROTEIN; SWP:P23256; PDB:1D2FA; LLPFTISDMDFATAPCIIEALNQRLMHGVFGYSRWKNDEFLAAIAHWFSTQHYTAIDSQT ------------------------3333----------------------------3333 VVYGPSVIYMVSELIRQWSETGEGVVIHTPAYDAFYKAIEGNQRTVMPVALEKQADGWFC -------------------2222--------3333----1111----------------- DMGKLEAVLAKPECKIMLLCSPQNPTGKVWTCDELEIMADLCERHGVRVISDEIHMDMVW -------3333-------------------------------1111------1111---- GEQPHIPWSNVARGDWALLTSGSKSFNIPALTGAYGIIENSSSRDAYLSALKGRDGLSSP -------3333----------3333--1111----------------------------- SVLALTAHIAAYQQGAPWLDALRIYLKDNLTYIADKMNAAFPELNWQIPQSTYLAWLDLR ----------------------------------------------------------33 PLNIDDNALQKALIEQEKVAIMPGYTYGEEGRGFVRLNAGCPRSKLEKGVAGLINAIRAV 33-------------------------3333----------3333--------------- R - >LOW-DENSITY LIPOPROTEIN R; SWP:P01130; PDB:1D2JA; VATCRPDEFQCSDGNCIHGSRQCDREYDCKDLSDEVGCVN ----------3333----1111----3333---------- >LIPOPROTEIN RECEPTOR RELA; SWP:Q07954; PDB:1D2LA; GSPPQCQPGEFACANSRCIQERWKCDGDNDCLDNSDEAPALCHQH ------------3333------------------1111------- >N-ETHYLMALEIMIDE-SENSITIV; SWP:P18708; PDB:1D2NA; EDYASYIMNGIIKWGDPVTRVLDDGELLVQQTKNSDRTPLVSVLLEGPPHSGKTALAAKI -3333---------3333-----------------------------2222--------- AEESNFPFIKICSPDKMIGFSETAKCQAMKKIFDDAYKSQLSCVVVDDIERLLDYVPIGP -3333--------1111------------------1111-----------1111------ RFSNLVLQALLVLLKKAPPQGRKLLIIGTTSRKDVLQEMEMLNAFSTTIHVPNIATGEQL ------------1111--2222--------------11111111---------------- LEALELLGNFKDKERTTIAQQVKGKKVWIGIKKLLMLIEMSLQMDPEYRVRKFLALLREE ----------3333-------2222---------------11113333------------ GASPLD --1111 >COLLAGEN ADHESIN; SWP:Q53654; PDB:1D2OA; ETTSSIGEKVWDDKDNQDGKRPEKVSVNLLANGEKVKTLDVTSETNWKYEFKDLPKYDEG ------------%%%%--------------iiii-------3333--------------- KKIEYTVTEDHVKDYTTDINGTTITNKYTPGETSATVTKNWDDNNNQDGKRPTEIKVELY -----------2222-------------2222---------------------------- QDGKATGKTAILNESNNWTHTWTGLDEKAKGQQVKYTVEELTKVKGYTTHVDNNDMGNLI iiii--------3333------------iiii-----------2222-------1111-- TTNKYTP ------- >SEX HORMONE-BINDING GLOBU; SWP:P04278; PDB:1D2SA; PPAVHLSNGPGQEPIAVMTFDLTKITKTSSSFEVRTWDPEGVIFYGDTNPKDDWFMLGLR -------!!!!---------3333------------------------3333-------- DGRPEIQLHNHWAQLTVGAGPRLDDGRWHQVEVKMEGDSVLLEVDGEEVLRLRQVSGHPI ---------1111----------------------!!!!----iiii------------- MRIALGGLLFPASNLRLPLVPALDGCLRRDSWLDKQAEISASAPTSLRSC ----------3333-------------------3333------------- >ACID PHOSPHATASE; SWP:Q9S1A6; PDB:1D2TA; GNDTTTKPDLYYLKNSEAINSLALLPPPPAVGSIAFLNDQAMYEQGRLLRNTERGKLAAE --33331111---3333--3333------2222---------------1111-------- DANLSSGGVANAFSGAFGSPITEKDAPALHKLLTNMIEDAGDLATRSAKDHYMRIRPFAF 333333333333--3333---3333----------------3333--3333--------- YGVSTCNTQDKLSKNGSYPSGHTSIGWATALVLAEINPQRQNEILKRGYELGQSRVICGY ------------------------------------3333-------------------- HWQSDVDAARVVGSAVVATLHTNPAFQQQLQKAKAEFAQHQK -3333------------------------------------- >MYELOPEROXIDASE; SWP:P05164; PDB:1D2VA; CPEQDKYRTITGMCNNRRSPTLGASNRAFVRWLPAEYEDGFSLPYGWTPGVKRNGFPVAL --------1111---3333-2222------------1111---22222222-iiii---- ARAVSNEIVRFPTDQLTPDQERSLMFMQWGQLLDHDLDFTPEPA -----------3333---11113333------1111-------- >Myeloperoxidase [Precurso; SWP:P05164; PDB:1D2VC; VNCETSCVQQPPCFPLKIPPNDPRIKNQADCIPFFRSPACPGSNITIRNQINALTSFVDA -1111---------------------1111---------2222----------------- SMVYGSEEPLARNLRNMSNQLGLLAVNQRFQDNGRALLPFDNLHDDPCLLTNRSARIPCF -----------1111----------------iiii----------3333--3333----- LAGDTRSSEMPELTSMHTLLLREHNRLATELKSLNPRWDGERLYQEARKIVGAMVQIITY ---1111---------------------------3333---------------------- RDYLPLVLGPTAMRKYLPTYRSYNDSVDPRIANVFTNAFRYGHTLIQPFMFRLDNRYQPM --3333-----------------1111----3333----3333----------1111--- EPNPRVPLSRVFFASWRVVLEGGIDPILRGLMATPAKLNRQNQIAVDEIRERLFEQVMRI ------333322223333---------------------1111--------2222----- GLDLPALNMQRSRDHGLPGYNAWRRFCGLPQPETVGQLGTVLRNLKLARKLMEQYGTPNN ------------1111--------1111----------------------------3333 IDIWMGGVSEPLKRKGRVGPLLACIIGTQFRKLRDGDRFWWENEGVFSMQQRQALAQISL ------3333--2222-----------------1111--1111-----------1111-- PRIICDNTGITTVSKNNIFMSNSYPRDFVNCSTLPALNLASWREA ----------------3333---------3333-----1111--- >DEATH DOMAIN OF PELLE; SWP:Q05652; PDB:1D2ZA; LDNTMAIRLLPLPVRAQLCAHLDALDVWQQLATAVKLYPDQVEQISSQKQRGRSASNEFL -----3333---------------------------------------1111-------- NIWGGQYNHTVQTLFALFKKLKLHNAMRLIKDYVSEDLHKYI ----1111----------1111-------1111-33331111 >Protein Tube; SWP:P22812; PDB:1D2ZB; LSSKYSRNTELRRVEDNDIYRLAKILDENSCWRKLMSIIPKGMDVQACSGAGCLNFPAEI -----11113333-3333--------2222-------------3333--2222-333333 KKGFKYTAQDVFQIDEAANRLPPDQSKSQMMIDEWKTSGKLNERPTVGVLLQLLVQAELF 33---------------33331111---------1111----------------1111-- SAADFVALDFLNESTPARPVDGPGALISLE ---------------------1111----- >HALOPHILIC MALATE DEHYDRO; SWP:Q07841; PDB:1D3AA; TKVSVVGAAGTVGAAAGYNIALRDIADEVVFVDIPD ----------3333---------------------- >SMALL NUCLEAR RIBONUCLEOP; SWP:P62323; PDB:1D3BA; GVPIKVLHEAEGHIVTCETNTGEVYRGKLIEAEDNMNCQMSNITVTYRDGRVAQLEQVYI -3333--1111-------1111----------1111----------1111---------- RGCKIRFLILPD 1111-------- >Small nuclear ribonucleop; SWP:Q66K91; PDB:1D3BB; SKMLQHIDYRMRCILQDGRIFIGTFKAFDKHMNLILCDCDEFRKIKPKNSKQAEREEKRV --3333--------1111----------1111----------------1111-------- LGLVLLRGENLVSMTVEGPPP ------3333----------- >CYCLODEXTRIN GLYCOSYLTRAN; SWP:P43379; PDB:1D3CA; APDTSVSNKQNFSTDVIYQIFTDRFSDGNPANNPTGAAFDGTCTNLRLYCGGDWQGIINK -1111--11111111-----3333----3333--!!!!-1111----------------- INDGYLTGMGVTAIWISQPVENIYSIINYSGVNNTAYHGYWARDFKKTNPAYGTIADFQN -----3333--------------------------1111----1111-3333-------- LIAAAHAKNIKVIIDFAPNHTSPASSDQPSFAENGRLYDNGTLLGGYTNDTQNLFHHNGG -----1111------------------1111-%%%%--iiii-------3333------- TDFSTTENGIYKNLYDLADLNHNNSTVDVYLKDAIKMWLDLGIDGIRMNAVKHMPFGWQK -------------!!!!---11113333----------1111-------1111------- SFMAAVNNYKPVFTFGQWFLGVNEVSPENHKFANESGMSLLDFRFAQKVRQVFRDNTDNM --------------------2222------------------------------------ YGLKAMLEGSAADYAQVDDQVTFIDNHDMERFHASNANRRKLEQALAFTLTSRGVPAIYY ---------------1111------1111----1111----------3333-------22 GTEQYMSGGTDPDNRARIPSFSTSTTAYQVIQKLAPLRKCNPAIAYGSTQERWINNDVLI 22--------------------------------------3333----------1111-- YERKFGSNVAVVAVNRNLNAPASISGLVTSLPQGSYNDVLGGLLNGNTLSVGSGGAASNF -------------------------------------1111------------------- TLAAGGTAVWQYTAATATPTIGHVGPMMAKPGVTITIDGRGFGSSKGTVYFGTTAVSGAD --2222-----------------------2222-----------------!!!!---333 ITSWEDTQIKVKIPAVAGGNYNIKVANAAGTASNVYDNFEVLSGDQVSVRFVVNNATTAL 3---1111------------------3333----------------------------22 GQNVYLTGSVSELGNWDPAKAIGPMYNQVVYQYPNWYYDVSVPAGKTIEFKFLKKQGSTV 22-------3333iiii1111---------------------2222---------!!!!- TWEGGSNHTFTAPSSGTATINVNWQP -------------------------- >DIHYDROOROTATE DEHYDROGEN; SWP:Q02127; PDB:1D3GA; MATGDERFYAEHLMPTLQGLLDPESAHRLAVRFTSLGLLPFQDSDMLEVRVLGHKFRNPV ---------------------------------1111-------1111--iiii------ GIAAGFDKHGEAVDGLYKMGFGFVEIGSVTPKPQEGNPRPRVFRLPEDQAVINRYGFNSH ------1111----------------------------------3333------------ GLSVVEHRLRARQQKQAKLTEDGLPLGVNLGKNKTSVDAAEDYAEGVRVLGPLADYLVVN --------------------------------1111-------------3333------- VSSPNTAGLGKAELRRLLTKVLQERDGLRRVHRPAVLVKIAPDLTSQDKEDIASVVKELG ---------3333-----------11111111---------------------------- IDGLIVTNTTVSRPAGLQGALRSETGGLSGKPLRDLSTQTIREMYALTQGRVPIIGVGGV -------------2222-1111-------3333--------------------------- SSGQDALEKIRAGASLVQLYTALTFWGPPVVGKVKRELEALLKEQGFGGVTDAIGADHRR ---------1111------3333-------------------1111--333322221111 >ARGINASE; SWP:P07824; PDB:1D3VA; KPIEIIGAPFSKGQPRGGVEKGPAALRKAGLVEKLKETEYNVRDHGDLAFVDVPNDSPFQ ----------1111-3333---------------1111--------------------!! IVKNPRSVGKANEQLAAVVAETQKNGTISVVLGGDHSMAIGSISGHARVHPDLCVIWVDA !!--------------------1111-------------------33331111------- HTDINTPLTTSSGNLHGQPVAFLLKELKGKFPDVPGFSWVTPCISAKDIVYIGLRDVDPG -----1111--------3333--3333------2222-------1111------------ EHYIIKTLGIKYFSMTEVDKLGIGKVMEETFSYLLGRKKRPIHLSFDVDGLDPVFTPATG -----1111-------------------------------------1111-3333----- TPVVGGLSYREGLYITEEIYKTGLLSGLDIMEVNPTLGKTPEEVTRTVNTAVALTLSCFG -------------------3333----------1111--3333----------------- TKREGNHK -3333--- >DNA TOPOISOMERASE VI A SU; SWP:Q57815; PDB:1D3YA; QAKIFAQTTKMLEFAKQLLETDDFSTLREAYYVSKNWGEARFDDQQASNNVIEDLEAALG ------------------1111--------------!!!!----------------1111 VLREHLGFIPEEDGSSVVGPLKIIEETPEGELVVDCTKLGTGAYNIPNDVTKLNLETDAD -3333---------------------1111-------------------1111------- FILAIETSGMFARLNAERFWDKHNCILVSLKGVPARATRRFIKRLHEEHDLPVLVFTDGD --------------------1111------------------------------------ PYGYLNIYRTLKVDKLSIPAARLIGVTPQDIIDYDLPTHPLKEQDIKRIKDGLKNDDFVR -------------11111111-----3333-1111----------------------333 SFPEWQKALKQMLDMGVRAEQQSLAKYGLKYVVNTYLPEKIKDESTWLP 3------------------111133331111-----------3333--- >QUINONE REDUCTASE; SWP:P15559; PDB:1D4AA; VGRRALIVLAHSERTSFNYAMKEAAAAALKKKGWEVVESDLYAMNFNPIISRKDITGKLK ------------1111-------------1111------3333----------------- DPANFQYPAESVLAYKEGHLSPDIVAEQKKLEAADLVIFQFPLQWFGVPAILKGWFERVF 3333---------------------------------------%%%%------------- IGEFAYTYAAMYDKGPFRSKKAVLSITTGGSGSMYSLQGIHGDMNVILWPIQSGILHFCG ------1111!!!!1111------------3333-1111-----3333--------1111 FQVLEPQLTYSIGHTPADARIQILEGWKKRLENIWDETPLYFAPSSLFDLNFQAGFLMKK ----------1111------------------3333-------3333---1111------ EVQDEEKNKKFGLSVGHHLGKSIPTDNQIKARK -----1111----3333iiii----1111---- >HUMAN CELL DEATH-INDUCING; SWP:Q9UHD4; PDB:1D4BA; MEYLSALNPSDLLRSVSNISSEFGRRVWTSAPPPQRPFRVCDHKRTIRKGLTAATRQELL --------------------------------------------------------3333 AKALETLLLNGVLTLVLEEDGTAVDSEDFFQLLEDDTCLMVLQSGQSWSPTRSGVLHHHH -------------------------33331111--------------------------- HH -- >FLAVOCYTOCHROME C FUMARAT; SWP:P83223; PDB:1D4DA; VLADFHGEMGGCDSCHVSDKGGVTNDNLTHENGQCVSCHGDLKELAAAAPVSPHKSHLIG -3333------3333--3333---3333-----------------------1111----- EIACTSCHKGHEKSVAYCDACHSFGFDMPFGGKWERKFVPVDADKAAQDKAIAAGVKETT --1111---------3333--------------------1111------3333------- DVVIIGSGGAGLAAAVSARDAGAKVILLEKEPIPGGNTKLAAGGMNAAETKPQAKLGIED ------------------------------------3333-------------1111--- KKQIMIDDTMKGGRNINDPELVKVLANNSSDSIDWLTSMGADMTDVGRMGGASVNRSHRP 3333---------------------1111-------1111--------2222-------2 TGGAGVGAHVAQVLWDNAVKRGTDIRLNSRVVRILEDGKVTGVLVKGEYTGYYVIKADAV 222--------------------------------------------------------- VIAAGGFAKNNERVSKYDPKLKGFKATNHPGATGDGLDVALQAGAATRDLQYIQAHPTYS ----------333311111111------1111----------------3333-------- PAGGVMITEAVRGNGAIVVNREGNRFMNEITTRDKASAAILQQKGESAYLVFDDSIRKSL 1111---------------1111----1111-----------2222------3333---- KAIEGYVHLNIVKEGKTIEELAKQIDVPAAELAKTVTAYNGFVSGKDAQFERPDLPRELV ------1111-------------------------------------------------- VAPFYALEIAPAVHHTMGGLVIDTKAEVKSEKTAKPITGLYAAGEVTGGVHGANRLGGNA ----------------------1111----------2222---3333-1111---2222- ISDIVTYGRIAGASAAKFAK -------------------- >Genome polyprotein; SWP:P21404; PDB:1D4M1; GDVEEAIERAVVHVADTMRSGPSNSASVPALTAVETGHTSQVTPSDTMQTRHVKNYHSRS ---------------------------3333-----------3333------------11 ESTVENFLGRSACVYMEEYKTTDNDVNKKFVAWPINTKQMVQMRRKLEMFTYLRFDMEVT 11----------------------1111-------------------------------- FVITSRQDPGTTLAQDMPVLTHQIMYVPPGGPIPAKVDDYAWQTSTNPSIFWTEGNAPAR ---------------------------2222----1111-3333--------2222---- MSIPFISIGNAYSNFYDGWSNFDQRGSYGYNTLNNLGHIYVRHVSGSSPHPITSTIRVYF --------------------3333----3333---------------------------- KPKHTRAWVPRPPRLCQYKKAFSVDFTPTPITDTRKDINTVTTV ------------------------------------1111---- >Genome polyprotein; SWP:P21404; PDB:1D4M2; SDRVRSITLGNSTITTQECANVVVGYGRWPTYLRDDEATAEDQPTQPDVATCRFYTLDSI --------!!!!---------------------1111---------!!!!---------- KWEKGSVGWWWKFPEALSDMGLFGQNMQYHYLGRAGYTIHVQCNASKFHQGCLLVVCVPE --1111----------1111-------------------------1111----------- AEMGGAVVGQAFSATAMANGDKAYEFTSATQSDQTKVQTAIHNAGMGVGVGNLTIYPHQW ------2222--3333--!!!!----------1111---3333-----1111-------- INLRTNNSATIVMPYINSVPMDNMFRHYNFTLMVIPFVKLDYADTASTYVPITVTVAPMC -3333------------------------------------------------------- AEYNGLRLAQAQ ------------ >Genome polyprotein; SWP:P21404; PDB:1D4M3; GLPTMNTPGSTQFLTSDDFQSPCALPQFDVTPSMNIPGEVKNLMEIAEVDSVVPVNNVQD ------2222---1111-------2222-----------------1111--------111 TTDQMEMFRIPVTINAPLQQQVFGLRLQPGLDSVFKHTLLGEILNYYAHWSGSMKLTFVF 1-3333----------2222-------1111---1111---------------------- CGSAMATGKFLIAYSPPGANPPKTRKDAMLGTHIIWDIGLQSSCVLCVPWISQTHYRLVQ --1111-----------------33331111----------------------------- QDEYTSAGYVTCWYQTGMIVPPGTPNSSSIMCFASACNDFSVRMLRDTPFISQDNKLQ -3333-------------------------------1111------------------ >NADP(H) TRANSHYDROGENASE; SWP:P11024; PDB:1D4OA; GTHTEINLDNAIDMIREANSIIITPGYGLCAAKAQYPIADLVKMLSEQGKKVRFGIHPVA --------------------------------------------------------1111 GRMPGQLNVLLAEAGVPYDIVLEMDEINHDFPDTDLVLVIGANDTVNSAAQEDPNSIIAG --2222----------1111--111133331111--------1111-3333-1111-222 MPVLEVWKSKQVIVMKRSLGVGYAAVDNPIFYKPNTAMLLGDAKKTCDALQAKVRES 2---3333-------------1111--3333-1111--------------------- >T CELL SIGNAL TRANSDUCTIO; SWP:O60880; PDB:1D4TA; MDAVAVYHGKISRETGEKLLLATGLDGSYLLRDSESVPGVYCLCVLYHGYIYTYRVSQTE 1111----------------33332222--------2222------iiii--------11 TGSWSAETAPGVHKRYFRKIKNLISAFQKPDQGIVIPLQYPVEK 11------2222------3333---------------------- >C. ELEGANS ACTIN 1/3; SWP:P10983; PDB:1D4XA; EVAALVVDNGSGMCKAGFAGDDAPRAVFPSIVGRPRHQGVGQKDSYVGDEAQSKRGILTL -----------------2222--------------------------------3333--- KYPIEHGIVTNWDDMEKIWHHTFYNELRVAPEEHPVLLTEAPLNPKANREKMTQIMFETF ----iiii---------------------1111-----------3333------------ NTPAMYVAIQAVLSLYASGRTTGVVLDSGDGVTHTVPIYEGYALPHAILRLDLAGRDLTD ---------------1111-------------------iiii-3333------------- YLMKILTERGYSFTTTAEREIVRDIKEKLCYVALDFEQEMATAASSSSLEKSYELPDGQV ----3333--------------------------3333--------1111---------- ITVGNERFRCPEAMFQPSFLGMESAGIHETSYNSIMKCDIDIRKDLYANTVLSGGTTMYP ---------3333----1111-------------11113333-----------1111-22 GIADRMQKEITALAPSTMKIKIIAPPERKYSVWIGGSILASLSTFQQMWISKQEYDESGP 22------------3333------1111---------333311113333----------- SIVHRKCF 3333---- >Gelsolin [Precursor]; SWP:P06396; PDB:1D4XG; VEHPEFLKAGKEPGLQIWRVEKFDLVPVPTNLYGDFFTGDAYVILKTVQLRNGNLQYDLH --3333----------------------3333-----------------3333------- YWLGNECSQDESGAAAIFTVQLDDYLNGRAVQHREVQGFESATFLGYFKSGLKYKKGGVA ---1111------------------%%%%------2222-3333---1111--------- SGFK ---- >DNA POLYMERASE; SWP:Q7SIG7; PDB:1D5AA; MILDADYITEDGKPVIRVFKKEKGEFKIDYDRDFEPYIYALLKDDSAIEDIKKITAERHG ---------iiii----------------------------------3333------iii TTVRVTRAERVKKKFLGRPVEVWKLYFTHPQDVPAIRDKIREHPAVVDIYEYDIPFAKRY i---------------------------3333----1111--1111-------------- LIDRGLIPMEGDEELRMLAFDIETLAHAGAAAGAGPILMISYADEEGARVITWKNIDLPY -------------------------------------------1111------------- VESVSTEKEMIKRFLKVIQEKDPDVLITYNGDNFDFAYLKKRSEMLGVKFILGRDGSEPK -----------------------------3333------1111----------------- IQRMGDRFAVEVKGRIHFDLYPVIRRTINLPTYTLETVYEPVFGQPAEKVYAEEIAEAWA --------------------3333---------3333--3333----------------- SGEGLERVARYSMEDAKATYELGKEFFPMEAQLSRLVGQSLWDVSRSSTGNLVEWFLLRK -1111-------------------------------------1111-------------- AYERNDVAPNKPDERELARRTESYAGGYVKEPEKGLWENIVYLDYKSLYPSIIITHNVSP --------------3333-----------------------------------1111-11 DTLNREGCREYDVAPQVGHRFCKDFPGFIPSLLGDLLEERQKVKKKMKATVDPIERKLLD 11--2222-------------------------------------1111--3333----- YRQRAIKILANSYYGYYAYANARWYCRECAESVTAWGRQYIETTMREIEEKFGFKVLYAD ------------3333--1111-------------------------------------- TDGFFATIPGADAETVKNKAKEFLNYINPRLPGLLELEYEGFYRRGFFVTKKKYAVIDEE -----------------------33331111--------------------------111 DKITTRGLEIVRRDWSEIAKETQARVLEAILKHGDVEEAVRIVKEVTEKLSRHEVPPEKL 1----------------------------------3333---------1111---3333- VIYEAGPHVAAAATVISYIVLKGPGRVGDRAIPFDEFDPAKHRYDAEYYIENQVLPAVER --------------------------------------------1111-----3333--- ILRAFGYRKEDLR --1111-3333-- >Ig gamma-1 chain C region; SWP:P01857; PDB:1D5BB; QVQLQQSGAELMKPGASVKISCKATGYTFSSFWIEWVKQRPGHGLEWIGEILPGSGGTHY ------------2222-----------1111----------------------------- NEKFKGKATFTADKSSNTAYMQLSSL 3333---------1111--------- >RAB6 GTPASE; SWP:Q26000; PDB:1D5CA; KYKLVFLGEQAVGKTSIITRFYDTFDNNYQSTIGIDFLSKTLYLDEGPVRLQLWDTAGQE --------2222-------------1111-----------------------------33 RFRSLIPSYIRDSAAAIVVYDITNRQSFENTTKWIQDILNERGKDVIIALVGNKTDLGDL 3311113333----------1111-----------------!!!!--------1111111 RKVTYEEGQKAQEYNTFHETSAKAGHNIKVLFKKTASKL 1--3333----1111------1111--3333-------- >HUMAN PHOSPHATASE HPTP1E; SWP:Q12923; PDB:1D5GA; PKPGDIFEVELAKNDNSLGISVTGGVNTSVRHGGIYVKAVIPQGAAESDGRIHKGDRVLA ----------------------------------------------3333--2222---- VNGVSLEGATHKQAVETLRNTGQVVHLLLEKGQSPT ------------------------------------ >Enterotoxin type B [Precu; SWP:P01552; PDB:1D5MC; SQPDPKPDELHKSSKFTGLMENMKVLYDDNHVSAINVKSIDQFLYFDLIYSIKDTKLGNY -----2222--3333-------3333-----------------1111------------- DNVRVEFKNKDLADKYKDKYVDVFGANYYYQCYFSKKKRKTCMYGGVTEHNGNQLDKYRS ---------------1111------------------------------2222------- ITVRVFEDGKNLLSFDVQTNKKKVTAQELDYLTRHYLVKNKKLYEFNNSPYETGYIKFIE ------iiii-------------------------------------------------i NENSFWYDMMPAPGDKFDQSKYLMMYNDNKMVDSKDVKIEVYLTTK iii----------------------1111---3333---------- >PHOSPHOINOSITIDE PHOSPHOT; SWP:O00633; PDB:1D5RA; RRYQEDGFDLDLTYIYPNIIAMGFPAERLEGVYRNNIDDVVRFLDSKHKNHYKIYNLCAE ---------------1111----------!!!!--------------------------- RHYDTAKFNCRVAQYPFEDHNPPQLELIKPFCEDLDQWLSEDDNHVAAIHCKAGKGRTGV ----3333---------2222--1111--------------------------------- MICAYLLHRGKFLKAQEALDFYGEVRTRDKKGVTIPSQRRYVYYYSYLLKNHLDYRPVAL -------------3333-------------------------------1111-------- LFHKMMFETIPMFSGGTCNPQFVVCQLKVKIYSSNSGPTRREDKFMYFEFPQPLPVCGDI -------------iiii--------!!!!------------------------------- KVEFFHKQNKMLKKDKMFHFWVNTFFIPKEYLVLTLTKNDLDKANKDKANRYFSPNFKVK ----------------------3333----------3333--33331111---------- LYFTKTV ------- >GUANINE NUCLEOTIDE DISSOC; SWP:P21856; PDB:1D5TA; MDEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESSSITPLEELYKRFQLLE -----------------------1111------------!!!!----3333--1111333 GPPETMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDFKVVEGSFVYKGGKIYKVP 3-3333-3333----------1111-----------1111-----------iiii----- STETEALASNLMGMFEKRRFRKFLVFVANFDENDPKTFEGVDPQNTSMRDVYRKFDLGQD -----1111---------------------11111111---1111-------1111---- VIDFTGHALALYRTDDYLDQPCLETINRIKLYSESLARYGKSPYLYPLYGLGELPQGFAR --------------3333----------------------------2222---------- LSAIYGGTYMLNKPVDDIIMENGKVVGVKSEGEVARCKQLICDPSYVPDRVRKAGQVIRI --1111--------------iiii-----iiii---------33331111---------- ICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYVCMISYAHNVAAQGKYIAIASTTVETT ----------%%%%-------3333------------3333---2222------------ DPEKEVEPALGLLEPIDQKFVAISDLYEPIDDGSESQVFCSCSYDATTHFETTCNDIKDI 3333-33333333----------------------------------------------- YKRMAGSAFDF ----------- >S12 TRANSCRIPTION FACTOR ; SWP:Q99958; PDB:1D5VA; MLVKPPYSYIALITMAIQNAPEKKITLNGIYQFIMDRFPFYRENKQGWQNSIRHNLSLNE -------------------------3333--------3333------------------- CFVKVPRDDKKPGKGSYWTLDPDSYNMFENGSFL -------3333----------------------- >ROB TRANSCRIPTION FACTOR; SWP:P27292; PDB:1D5YA; QAGIIRDLLIWLEGHLDQPLSLDNVAAKAGYSKWHLQRMFKDVTGHAIGAYIRARRLSKS 1111-----------------3333------3333-----------3333---------- AVALRLTARPILDIALQYRFDSQQTFTRAFKKQFAQTPALYRRSPEWSAFGIRPPLRLGE ------------------------------------------------1111-------- FTMPEHKFVTLEDTPLIGVTQSYSCSLEQISDFRHEMRYQFWHDFLGNAPTIPPVLYGLN -------------------------1111---------------3333------------ ETRPSQDKDDEQEVFYTTALAQDQADGYVLTGHPVMLQGGEYVMFTYEGLGTGVQEFILT --------------------3333----2222-----------------1111------- VYGTCMPMLNLTRRKGQDIERYYPAEDDRPINLRCELLIPIRRKLAAA ------1111-------------------------------------- >HLA class II histocompati; SWP:P13760; PDB:1D5ZB; GDTRPRFLEQVKHECHFFNGTERVRFLDRYFYHQEEYVRFDSDVGEYRAVTELGRPDAEY ----------------------------------------1111------3333------ WNSQKDLLEQKRAAVDTYCRHNYGVGESFTVQRRVYPEVTVYPALLVCSVNGFYPGSIEV ------------3333---------33331111--------------------------- RWFRNGQEEKTGVVSTGLIQNGDWTFQTLVMLETVPRSGEVYTCQVEHPSLTSPLTVEWR ---iiii----------------------------------------3333--------- A - --------------------------------------------------------- >DEFENSIN-LIKE PEPTIDE-2; SWP:P82140; PDB:1D6BA; IMFFEMQACWSHSGVCRDKSERNCKPMAWTYCENRNQKCCEY -------3333------1111--------------------- >CHOLECYSTOKININ TYPE A RE; SWP:P32238; PDB:1D6GA; MDVVDSLLVNGSNITPPCELGLENETLFCLDQPRPSKEWQPAQVILL ----3333-----------3333------------------------ >ADENOSINE-5'PHOSPHOSULFAT; SWP:NA; PDB:1D6JA; HASALTRSERTELRNQRGLTIWLTGLSASGKSTLAVELEHQLVRDRRVHAYRLDGDNIRF -------------------------1111--------------1111------------- GLNKDLGFSEADRNENIRRIAEVAKLFADSNSIAITSFISPYRKDRDTARQLHEVATPGE 1111-----------------------1111----------------------------- ETGLPFVEVYVDVPVEAPYEAPANPEVHVKNYELPVQDAVKQIIDYLDTKGYLPAKK ----------------------------------3333---------1111------ ---------------------------------------------------------- >RIBONUCLEASE P; SWP:P0A0H5; PDB:1D6TA; MLLEKAYRIKKNADFQRIYKKGHSVANRQFVVYTCNNKEIDHFRLGISVSKKLGNAVLRN ---3333----3333--------------------------------------------- KIKRAIRENFKVHKSHILAKDIIVIARQPAKDMTTLQIQNSLEHVLKIAKVFNKKIK ------------1111------------3333------------------------- >Ig gamma-1 chain C region; SWP:P01857; PDB:1D6VH; QVQLQQSGAELMKPGASVKISCKATGYTFSSYWIEWVKQRPGHGLEWIGEILPGSGSTNY ------------2222-----------3333--------2222----------------- NEKFKGKATFTADTSSNTAYMQLSSLTSEDSAVYYCARGHSYYFYDGDYWGQGTSVTVSS -------------1111---------3333------------------------------ ASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSS ----------------------------------------%%%%--2222-------333 GLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPK 3--------------------------1111---------- >Ig kappa chain C region; SWP:P01834; PDB:1D6VL; DIKMTQSPSSMYASLGERVTITCKASQDINSYLSWFQQKPGKSPKTLIYRANRLVDGVPS -------------2222-----------%%%%------2222------------222233 RFSGSGSGQDYSLTISSLEYEDMGIYYCLQYDEFPYTFGSGTKLEIKRTVAAPSVFIFPP 33----------------3333-------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT 33333333---------------------------------------------------- LSKADYEKHKVYACEVTHQGLSSPVTKSFNR -----------------1111---------- >HUMAN ORNITHINE DECARBOXY; SWP:P11926; PDB:1D7KA; EEFDCHFLDEGFTAKDILDQKINEVSSSDDKDAFYVADLGDILKKHLRWLKALPRVTPFY -------------------1111-----------------------------1111---- AVCNDSKAIVKTLAATGTGFDCASKTEIQLVQSLGVPPERIIYANPCKQVSQIKYAANNG -----3333----------------------1111-3333--------3333-------- VQMMTFDSEVELMKVARAHPKAKLVLRIATDDSKAVCRLSVKFGATLRTSRLLLERAKEL ------------------1111-----------------------------------111 NIDVVGVSFHVGSGCTDPETFVQAISDARCVFDMGAEVGFSMYLLDIGGGFPGSEDVKLK 1----------1111-3333---------------1111--------------------- FEEITGVINPALDKYFPSDSGVRIIAEPGRYYVASAFTLAVNIIAKKIVLKEQTEQTFMY ----------------1111--------33333333------------------------ YVNDGVYGSFNCILYDHAHVKPLLQKRPKPDERYYSSSIWGPTCDGLDRIVERCDLPEMH ----3333------------------------------------1111-----------2 VGDWMLFENMGAYTVAAASTFNGFQRPTIYYVMSGPAWQLMQQFQNPDFPP 222----------3333--2222---------------------------- >CORTEXILLIN I; SWP:O15813; PDB:1D7MA; EMANRLAGLENSLESEKVSREQLIKQKDQLNSLLASLESEGAEREKRLRELEAKLDETLK ------------------------------------------------------------ NLELEKLARMELEARLAKTEKDRAILELKLAEAIDEKSKLE -------------------------------------3333 >ENOYL-[ACYL-CARRIER PROTE; SWP:P80030; PDB:1D7OA; GLPIDLRGKRAFIAGIADDNGYGWAVAKSLAAAGAEILVGTWVPALNIFETSLRRGKFDQ -----2222--------------------------------3333----------11111 SRVLPDGSLMEIKKVYPLDAVFDNPEDVPEDVKANKRYAGSSNWTVQEAAECVRQDFGSI 1113333----------------3333-3333---3333--------------------- DILVHSLANGPEVSKPLLETSRKGYLAAISASSYSFVSLLSHFLPIMNPGGASISLTYIA ---------1111--3333-----------------------3333-2222------333 SERIIPGYGGGMSSAKAALESDTRVLAFEAGRKQNIRVNTISAGPLGSRAAKAIGFIDTM 3---2222iiii------------------------------------------------ IEYSYNNAPIQKTLTADEVGNAAAFLVSPLASAITGATIYVDNGLNSMGVALDSPVF --------------3333---------3333------------3333---1111--- >Coagulation factor VIII [; SWP:P00451; PDB:1D7PM; LNSCSMPLGMESKAISDAQITASSYFTNMFATWSPSKARLHLQGRSNAWRPQVNNPKEWL ---------3333--3333--------1111--1111-2222------------------ QVDFQKTMKVTGVTTQGVKSLLTSMYVKEFLISSSQDGHQWTLFFQNGKVKVFQGNQDSF -------------------!!!!----------------------iiii----------- TPVVNCLDPPLLTRYLRIHPQSWVHQIALRMEVLGCEAQ --------------------------------------- >N-TERMINAL HISTIDINE TAG; SWP:P47813; PDB:1D7QA; PKNKGKGGKNRRRGKNENESEKRELVFKEDGQEYAQVIKMLGNGRLEAMCFDGVKRLCHI ----------------------------2222-----------------1111------- RGKLRKKVWINTSDIILVGLRDYQDNKADVILKYNADEARSLKAYGELPEHAKINETDTF -3333-----------------------------3333----------1111-------- GPGDDDEIQFDDIGDDDEDIDDI ----------------------- >FERREDOXIN REDUCTASE; SWP:Q52437; PDB:1D7YA; ALKAPVVVLGAGLASVSFVAELRQAGYQGLITVVGDEAERPYDRPPLSKDFMAHGDAEKI -------------------------------------------3333-3333---3333- RLDCKRAPEVEWLLGVTAQSFDPQAHTVALSDGRTLPYGTLVLATGAAPRALPTLQGATM ---1111----------------------1111------------------3333----- PVHTLRTLEDARRIQAGLRPQSRLLIVGGGVIGLELAATARTAGVHVSLVETQPRLMSRA -------------3333-2222-------------------------------------- APATLADFVARYHAAQGVDLRFERSVTGSVDGVVLLDDGTRIAADMVVVGIGVLANDALA -------------1111------------%%%%--------------------------- RAAGLACDDGIFVDAYGRTTCPDVYALGDVTRQRNPLSGRFERIETWSNAQNQGIAVARH 1111---------1111---2222---1111----------------------------- LVDPTAPGYAELPWYWSDQGALRIQVAGLASGDEEIVRGEVSLDAPKFTLIELQKGRIVG --3333------------!!!!-------------------------------iiii--- ATCVNNARDFAPLRRLLAVGAKPDRAALADPATDLRKLAAA -----3333-------1111--------------------- >SGS1 RECQ HELICASE; SWP:P35187; PDB:1D8BA; ELNNLRMTYERLRELSLNLGNRMVPPVGNFMPDSILKKMAAILPMNDSAFATLGTVEDKY ------------------1111-----------------------3333----------- RRRFKYFKATIADLSKKRSSE --------------------- >MALATE SYNTHASE G; SWP:P37330; PDB:1D8CA; QTITQSRLRIDANFKRFVDEEVLPGTGLDAAAFWRNFDEIVHDLAPENRQLLAERDRIQA ----!!!!--------------3333---------------------------------- ALDEWHRSNPGPVKDKAAYKSFLRELGYLVPQPERVTVETTGIDSEITSQAGPQLVVPAN -------------------------------------------3333------------- ARYALNAANARWGSLYDALYGSDIIPQEGAVSGYDPQRGEQVIAWVRRFLDESLPLENGS --------3333-----------------------------------------------3 YQDVVAFKVVDKQLRIQLKNGKETTLRTPAQFVGYRGDAAAPTCILLKNNGLHIELQIDA 333------%%%%----3333----------------1111-------iiii------11 NGRIGKDDPAHINDVIVEAAISTILDCEDSVAAVDAEDKILLYRNLLGLQGTLQRKLNDD 11-33331111----------------1111----------------------------- RHYTAADGSEISLHGRSLLFIRNVGHLTIPVIWDSEGNEIPEGILDGVTGAIALYDLKVQ ----1111-------------------------1111----------------------- KNSRTGSVYIVKPKHGPQEVAFANKLFTRIETLGAPNTLKGIDEERRTSLNLRSCIAQAR ----------------------------------2222------3333----------11 NRVAFINTGFLDRTGDEHSVEAGPLRKNQKSTPWIKAYERNNVLSGLFCGLRGKAQIGKG 11-----------------------3333-----------------11112222------ WAPDLADYSQKGDQLRAGANTAWVPSPTAATLHALHYHQTNVQSVQANIAQTEFNAEFEP --------------1111---------------3333-----------1111-3333--- LLDDLLTIPVAENANWSAQEIQQELDNNVQGILGYVVRWVEQGIGCSKVPDIHNVALEDR --------------------------------------------------1111----33 ATLRISSQHIANWLRHGILTKEQVQASLENAKVVDQQNAGDPAYRPAGNFANSCAFKAAS 33-----------------------------------1111-------3333-------- DLIFLGVKQPNGYTEPLLHAWRLREKES ----33332222-3333----------- >MRNA TRIPHOSPHATASE CET1; SWP:O13297; PDB:1D8HA; HMYRNVPIWAQKWKPTIKALQSINVKDLKIDPSFLNIIPDDDLTKSVQDWVYATIYSIAP ------3333----------------------1111------------------333311 ELRSFIELEMKFGVIIDAKGPDRVNPPVSSQCVFTELDAHLTPNIDASLFKELSKYIRGI 111111-----------------------------2222--------------------- SEVTENTGKFSIIESQTRDSVYRVGPRFLRMSTDIKTGRVGQFIEKRHVAQLLLYSPKDS --3333-------------------------------------------------1111- YDVKISLNLELPVPDNDPPEKYKSQSPISERTKDRVSYIHNDSCTRIDITKVENHSETTH ----------------3333--------------------1111---------------- EVELEINTPALLNAFDNITNDSKEYASLIRTFLNNGTIIRRKLSSLSY -----------------3333-----------------------1111 >GENERAL TRANSCRIPTION FAC; SWP:P29084; PDB:1D8JA; ALSGSSGYKFGVLAKIVNYMKTRHQRGDTHPLTLDEILDETQHLDIGLKQKQWLMTEALV --------------------------------------1111------------------ NNPKIEVIDGKYAFKPKYNVR -3333---------------- >L-RHAMNOSE ISOMERASE; SWP:P32170; PDB:1D8WA; TQLEQAWELAKQRFAAVGIDVEEALRQLDRLPVSHCWQGDDVSGFENYPGKARNASELRA --------------1111---------1111---3333-%%%%----------------- DLEQARLIPGPKRLNLHAIYLESDTPVSRDQIKPEHFKNWVEWAKANQLGLDFNPSCFSH ---------------------------1111-3333--------1111-----------1 PLSADGFTLSHADDSIRQFWIDHCKASRRVSAYFGEQLGTPSVNIWIPDGKDITVDRLAP 111----1111------------------------------------------------- RQRLLAALDEVISEKLNPAHHIDAVESKLFGIGAESYTVGSNEFYGYATSRQTALCLDAG ----------------1111----------2222------3333-------------111 HFHPTEVISDKISAALYVPQLLLHVSRPVRWDSDHVVLLDDETQAIASEIVRHDLFDRVH 1-22223333-3333---------------------------------------1111-- IGLDFFDASINRIAAWVIGTRNKKALLRALLEPTAELRKLEAPGDYTARLALLEEQKSLP ----------------------------1111--------1111---------3333--- WQAVWEYCQRHDTPAGSEWLESVRAYEKEILSRR --------1111---------------------- >INTERFERON-GAMMA; SWP:P07353; PDB:1D9CA; QGQFFREIENLKEYFNASSPDVAKGGPLFSEILKNWKDESDKKIIQSQIVSFYFKLFENL -----------------------------------1111--------------------1 KDNQVIQRSMDIIKQDMFQKFLNGSSEKLEDFKKLIQIPVDDLQIQRKAINELIKVMNDL 1111111--------------%%%%---------1111---------------------- S - >3-DEOXY-D-MANNO-OCTULOSON; SWP:P17579; PDB:1D9EA; MKQKVVSIGDINVANDLPFVLFGGMNVLESRDLAMRICEHYVTVTQKLGIPYVFKASFDK -------!!!!--1111------------------------------------------- ANRSSIHSYRGPGLEEGMKIFQELKQTFGVKIITDVHEPSQAQPVADVVDVIQLPAFLAR ----1111-----------------------------1111-------------3333-- QTDLVEAMAKTGAVINVKKPQFVSPGQMGNIVDKFKEGGNEKVILCDRGANFGYDNLVVD --------3333-------11113333--------1111--------------------1 MLGFSIMKKVSGNSPVIFDVTHALQRAQVAELARAGMAVGLAGLFIEAHPDPEPSALPLA 111-------%%%%-----1111-------------3333-----------------333 KLEPFLKQMKAIDDLVKGFEELDTSK 3--------------1111------- --------- >H-2 class II histocompati; SWP:P06343; PDB:1D9KD; GSERHFVHQFQPFCYFTNGTQRIRLVIRYIYNREEYVRFDSDVGEYRAVTELGRPDAEYW -----------------!!!!---------!!!!-----1111------3333------- NKQYLERTRAELDTVCRHNYEKTETPTSLRRLEQPSVVISLSRTEALNHHNTLVCSVTDF ----------------------3333-3333----------------------------- YPAKIKVRWFRNGQEETVGVSSTQLIRNGDWTFQVLVMLEMTPRRGEVYTCHVEHPSLKS ------------------------------------------------------1111-- PITVEWRA -------- >METHYL-CPG-BINDING PROTEI; SWP:Q9UIS9; PDB:1D9NA; MAEDWLDCPALGPGWKRREVFRKSGATCGRSDTYYQSPTGDRIRSKVELTRYLGPACDLT ---------------------------------------------3333---------11 LFDFKQGILCYPAPK 11------------- >CYCLIN-DEPENDENT KINASE 4; SWP:CDN5_MOUSE; PDB:1D9SA; HMLGGSSDAGLATAAARGQVETVRQLLEAGADPNALNRFGRRPIQVMMMGSAQVAELLLL --------3333--1111------------------1111-------------------- HGAEPNCADPATLTRPVHDAAREGFLDTLVVLHRAGARLDVCDAWGRLPVDLAEEQGHRD --------------3333------------------------------------------ IARYLHAATGD ----------- >P.69 PERTACTIN; SWP:P14283; PDB:1DABA; DWNNQSIVKTGERQHGIHIQGSDPGGVRTASGTTIKVSGRQAQGILLENPAAELQFRNGS -------------------1111-------------------------1111-------- VTSSGQLSDDGIRRFLGTVTVKAGKLVADHATLANVGDTWDDDGIALYVAGEQAQASIAD --------------------------------------------------1111------ STLQGAGGVQIERGANVTVQRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPAS ----1111------------------------------------------------3333 GAPAAVSVLGASELTLDGGHITGGRAAGVAAMQGAVVHLQRATIRRGDALAGGAVPGGAV ------------------------------------------------------------ PGGAVPGGFGPGGFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAAIRVGRGARVTVPGG ------------------------------------------------------------ SLSAPHGNVIETGGARRFAPQAAPLSITLQAGAHAQGKALLYRVLPEPVKLTLTGGADAQ ---1111-----------3333-------%%%%--------------------%%%%--- GDIVATELPSIPGTSIGPLDVALASQARWTGATRAVDSLSIDNATWVMTDNSNVGALRLA ------------------------------------------------------------ SDGSVDFQQPAEAGRFKVLTVNTLAGSGLFRMNVFADLGLSDKLVVMQDASGQHRLWVRN -----------2222-------------------3333---------------------- SGSEPASANTLLLVQTPLGSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPP ----------------3333-------2222---!!!!--------------------- >ENDOGLUCANASE SS; SWP:P38686; PDB:1DAQA; MSTKLYGDVNDDGKVNSTDAVALKRYVLRSGISINTDNADLNEDGRVNSTDLGILKRYIL ---------------3333----------------------------------------- KEIDTLPYKNG ----------- >ELONGATION FACTOR G; SWP:Q5SHN5; PDB:1DAR; MAVKVEYDLKRLRNIGIAAHIDAGKTTTTERILYYTGRIAAVTTCFWKDHRINIIDTPGH -------3333--------2222-----------------------%%%%---------- VDFTIEVERSMRVLDGAIVVFDSSQGVEPQSETVWRQAEKYKVPRIAFANKMDKTGADLW ---------------------1111-------------------------1111------ LVIRTMQERLGARPVVMQLPIGREDTFSGIIDVLRMKAYTYGNDLGTDIREIPIPEEYLD ----------------------!!!!----------------------------1111-- QAREYHEKLVEVAADFDENIMLKYLEGEEPTEEELVAAIRKGTIDLKITPVFLGSALKNK ----------------3333-----------------------------------1111- GVQLLLDAVVDYLPSPLDIPPIKGTTPEGEVVEIHPDPNGPLAALAFKIMADPYVGRLTF --------------3333------------------1111-------------------- IRVYSGTLTSGSYVYNTTKGRKERVARLLRMHANHREEVEELKAGDLGAVVGLKETITGD ------------------------------------------2222----------2222 TLVGEDAPRVILESEEDPTFRVSTHQTIISGMGELLKREFKVDANVGKPQVAYRETITKP ------------------------------------------------------------ VDVEGKFIRQTGGRGQYGHVKIKVEPLPRGSGFEFVNAIVGGVIPKEYIPAVQKGIEEAM ---------------------------2222-------------1111------------ QSGPLIGFPVVDIKVTLYDGSYHEVDSSEMAFKIAGSMAIKEAVQKGDPVILEPIMRVEV ------------------------------------------------------------ TTPEEYMGDVIGDLNARRGQILGMEPRGNAQVIRAFVPLAEMFGYATDLRSKTQGRGSFV --3333--------------------!!!!-------33332222--------------- MFFDHYQEVPKQVQEKLIK ------------------- >GDP-MANNOSE 4,6-DEHYDRATA; SWP:P0AC88; PDB:1DB3A; SKVALITGVTGQDGSYLAEFLLEKGYEVHGIKRPKFHLHYGDLSDTSNLTRILREVQPDE -------1111------------------------------------------------- VYNLGAMSHVAVSFESPEYTADVDAMGTLRLLEAIRFLGLEKKTRFYQASTSELYGLVQE -------33331111-------------------------------------3333---- IPQKETTPFYPRSPYAVAKLYAYWITVNYRESYGMYACNGILFNHESPRRGETFVTRKIT ---1111---------------------------------------11113333------ RAIANIAQGLESCLYLGNMDSLRDWGHAKDYVKMQWMMLQQEQPEDFVIATGVQYSVRQF -----1111-----------------3333------1111-------------------- VEMAAAQLGIKLRFEGTGVEEKGIVVSVTGHDAPGVKPGDVIIAVDPRYFRPAEETLLGD ----------------!!!!------------11112222-----3333----------- PTKAHEKLGWKPEITLREMVSEMVANDLEAAKKHS -------------------------------1111 >IGG1-KAPPA DB3 FAB (HEAVY; SWP:GC1_MOUSE; PDB:1DBBH; QIQLVQSGPELKKPGETVKISCKASGYAFTNYGVNWVKEAPGKELKWMGWINIYTGEPTY ---------------------------3333----------------------------- VDDFKGRFAFSLETSASTAYLEINNLKNEDTATYFCTRGDYVNWYFDVWGAGTTVTVSSA -1111-------1111----------1111------------------------------ KTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDL ----------------------------------------iiii---------------- YTLSSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVPR --------------------------------------- >Putative uncharacterized ; SWP:A0A5D7; PDB:1DBBL; DVVMTQIPLSLPVNLGDQASISCRSSQSLIHSNGNTYLHWYLQKPGQSPKLLMYKVSNRF ------------------------------1111-------------------------- YGVPDRFSGSGSGTDFTLKISRVEAEDLGIYFCSQSSHVPPTFGGGTKLEIKRADAAPTV ---3333----------------1111--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM ---------1111--------------------------------------3333----- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNR ------1111-------------------------- >CHORISMATE MUTASE; SWP:P19080; PDB:1DBFA; MMIRGIRGATTVERDTEEEILQKTKQLLEKIIEENHTKPEDVVQMLLSATPDLHAVFPAK -------------------------------------3333---------------3333 AVRELSGWQYVPVTCMQEMDVTGGLKKCIRVMMTVQTDVPQDQIRHVYLEKAVVLRPDLS ----2222------------2222---------------1111-------3333-----3 LTKNTEL 333---- >HUMAN SOS 1; SWP:Q07889; PDB:1DBHA; EQTYYDLVKAFAEIRQYIRELNLIIKVFREPFVSNSKLFSANDVENIFSRIVDIHELSVK ------------------------------33333333---------------------- LLGHIEDTVETDEGSPHPLVGSCFEDLAEELAFDPYESYARDILRPGFHDRFLSQLSKPG -----------3333----3333-------1111----------1111---------222 AALYLQSIGEGFKEAVQYVLPRLLLAPVYHCLHYFELLKQLEEKSEDQEDKECLKQAITA 2-------2222------3333-------------------------------------- LLNVQSGEKICSKSLAKRRLSESAAIKKNEIQKNIDGWEGKDIGQCCNEFIEGTLTRVGA ---------1111-------1111-------1111------3333--------------- KHERHIFLFDGLICCKSNHGQPRLPGASNAEYRLKEKFFRKVQINDKDDTNEYKHAFEII --------1111-----2222--2222--------------------------------- LKDENSVIFSAKSAEEKNNWAALISLQYRSTL 2222------------------------3333 >AK.1 SERINE PROTEASE; SWP:Q45670; PDB:1DBIA; WTPNDTYYQGYQYGPQNTYTDYAWDVTKGSSGQEIAVIDTGVDYTHPDLDGKVIKGYDFV ----1111----3333-----3333----1111---------1111-------------- DNDYDPMDLNNHGTHVAGIAAAETNNATGIAGMAPNTRILAVRALDRNGSGTLSDIADAI ---------------------------------1111--------1111----------- IYAADSGAEVINLSLGCDCHTTTLENAVNYAWNKGSVVVAAAGNNSYENVIAVGAVDQYD ---1111------------------------1111-----------1111------1111 RLASFSNYGTWVDVVAPGVDIVSTITGNRYAYMSGTSMASPHVAGLAALLASQGRNNIEI --1111--3333----------------------1111-----------3333------- RQAIEQTADKISGTGTYFKYGRINSYNAVTY ----1111----2222-----------1111 >LEUKOAGGLUTININ; SWP:P93248; PDB:1DBNA; SDELSFTINNFVPNEADLLFQGEASVSSTGVLQLTKVENGQPQKYSVGRALYAAPVRIWG -----------2222-----------1111-------iiii------------------- NTTGSVASFSTSFTFVVKAPNPDITSDGLAFYLAPPDSQIPSGSVSKYLGLFNNSNSDSS --------------------3333----------1111------1111---------111 NQIVAVEFDTYFAHSYDPWDPNYRHIGIDVNGIESIKTVQWDWINGGVAFATITYLAPNK 1-----------33331111-----------------------2222--------3333- TLIASLVYPSNQTTFSVAASVDLKEILPEWVRVGFSAATGYPTEVETHDVLSWSFTSTL -------3333----------3333---------------1111--------------- >PURINE REPRESSOR; SWP:P15039; PDB:1DBQA; KSIGLLATSSEAAYFAEIIEAVEKNCFQKGYTLILGNAWNNLEKQRAYLSMMAQKRVDGL --------11113333----------1111----------------------1111---- LVMCSEYPEPLLAMLEEYRHIPMVVMDWGEAKADFTDAVIDNAFEGGYMAGRYLIERGHR -------3333------1111--------------------------------------- EIGVIPGPAGRLAGFMKAMEEAMIKVPESWIVQGDFEPESGYRAMQQILSQPHRPTAVFC -------------------1111---3333------------------------------ GGDIMAMGALCAADEMGLRVPQDVSLIGYDNVRNARYFTPALTTIHQPKDSLGETAFNML -3333--------1111--------------1111------------------------- LDRIVNKREEPQSIEVHPRLIERRSVADGPFRDYRR ----------------------------1111---- >OROTIDINE 5'-PHOSPHATE DE; SWP:P25971; PDB:1DBTA; MKNNLPIIALDFASAEETLAFLAPFQQEPLFVKVGMELFYQEGPSIVKQLKERNCELFLD ---------------------3333-------------------------1111------ LKLHDIPTTVNKAMKRLASLGVDLVNVHAAGGKKMMQAALEGLEEGTPAGKKRPSLIAVT ----------------1111-------1111----------------------------- QLTSTSEQIMKDELLIEKSLIDTVVHYSKQAEESGLDGVVCSVHEAKAIYQAVSPSFLTV -1111--------------------------1111------1111-3333---1111--- TPGIRMSEDAANDQVRVATPAIAREKGSSAIVVGRSITKAEDPVKAYKAVRLEWEGI -----1111-!!!!---------1111------3333-------------------- >TRANSCRIPTIONAL REGULATOR; SWP:P10958; PDB:1DBWA; MQDYTVHIVDDEEPVRKSLAFMLTMNGFAVKMHQSAEAFLAFAPDVRNGVLVTDLRMPDM -----------------------1111--------------3333--------------- SGVELLRNLGDLKINIPSIVITGHGDVPMAVEAMKAGAVDFIEKPFEDTVIIEAIERASE ----------------------2222--------------------3333--------11 HLV 11- >CYSTEINYL-TRNA(PRO) DEACY; SWP:P45202; PDB:1DBXA; TPAIDLLKKQKIPFILHTYDHDPGDEAAEKLGIDPNRSFKTLLVAENGDQKKLACFVLAT ---------------------------------1111-------22221111------11 ANLNLKKAAKSIGVKKVEADKDAAQKSTGYLVGGISPLGQKKRVKTVINSTALEFETIYV 11-------1111-----------------2222--------------3333-------- SGGKRGLSVEIAPQDLAKVLGAEFTDIVDE ---2222----------------------- >CHLOROPLAST THIOREDOXIN M; SWP:P23400; PDB:1DBYA; MEAGAVNDDTFKNVVLESSVPVLVDFWAPWCGPCRIIAPVVDEIAGEYKDKLKCVKLNTD ------3333----1111---------1111--------------------------333 ESPNVASEYGIRSIPTIMVFKGGKKCETIIGAVPKATIVQTVEKYLN 3--------------------------------3333---------- >BSOBI RESTRICTION ENDONUC; SWP:P70985; PDB:1DC1A; KPFENHLKSVDDLKTTYEEYRAGFIAFALEKNKRSTPYIERARALKVAASVAKTPKDLLY 3333----1111------------------------------------3333-3333--- LEDIQDALLYASGISDKAKKFLTEDDKKESINNLIENFLEPAGEEFIDELIFRYLLFQGD 3333----------33331111-------------------!!!!--------------- SLGGTMRNIAGALAQQKLTRAIISALDIANIPYKWLDSRDKKYTNWMDKPEDDYELETFA --------------------------1111------3333------------2222---- KGISWTINGKHRTLMYNITVSLVKKNVDICLFNCEPQQPEKYLLLGELKGGIDPAGADEH ------iiii----------1111-------------1111-----------3333---- WKTANTALTRIRNKFSEKGLSPKTIFIGAAIEHSMAEEIWDQLQSGSLTNSANLTKTEQV ---------------1111---------------------------------1111---- GSLCRWIINI ------1111 >NITROGEN REGULATION PROTE; SWP:P41789; PDB:1DC7A; MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGNEVLAALASKTPDVLLSDIRMPGM --------------3333----------------3333---------------------- DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAIS --3333--------------------1111----------------3333---------3 HYQE 333- >RAB GERANYLGERANYLTRANSFE; SWP:Q08602; PDB:1DCEA; HGRLKVKTSEEQAEAKRLEREQKLKLYQSATQAVFQKRQAGELDESVLELTSQILGANPD -------------------3333----------------------------------111 FATLWNCRREVLQHLETEKSPEESAALVKAELGFLESCLRVNPKSYGTWHHRCWLLSRLP 1------------3333--------------------------------------1111- EPNWARELELCARFLEADERNFHCWDYRRFVAAQAAVAPAEELAFTDSLITRNFSNYSSW -----------------1111-----------1111----------1111---------- HYRSCLLPQLHPQPDSGPQGRLPENVLLKELELVQNAFFTDPNDQSAWFYHRWLLGRAEP ------3333------------------------------------------3333---- HDVLCCVHVSREEACLSVCFSRPLTVGSRMGTLLLMVDEAPLSVEWRTPDGRNRPSHVWL ----------1111----------------------%%%%-------1111--------- CDLPAASLNDQLPQHTFRVIWTGSDSQKECVLLKDRPECWCRDSATDEQLFRCELSVEKS ---3333-------------------------2222------------------------ TVLQSELESCKELQELEPENKWCLLTIILLMRALDPLLYEKETLQYFSTLKAVDPMRAAY ----------------1111----------------1111-------------3333--- LDDLRSKFLLENSVLKMEYADVRVLHLAHKDLTVLCHLEQLLLVTHLDLSHNRLRALPPA ------------------------------------11111111---------------- LAALRCLEVLQASDNALENVDGVANLPRLQELLLCNNRLQQSAAIQPLVSCPRLVLLNLQ ---1111------------1111--1111-------------111133331111----22 GNSLCQEEGIQERLAEMLPSVSSILT 223333-----------1111----- >Geranylgeranyl transferas; SWP:Q08603; PDB:1DCEB; TQQKDVTIKSDAPDTLLLEKHADYIASYGSKKDDYEYCMSEYLRMSGVYWGLTVMDLMGQ --3333--------------------1111-----33333333------------1111- LHRMNKEEILVFIKSCQHECGGVSASIGHDPHLLYTLSAVQILTLYDSIHVINVDKVVAY -------------11111111----2222--------------11111111--------- VQSLQKEDGSFAGDIWGEIDTRFSFCAVATLALLGKLDAINVEKAIEFVLSCMNFDGGFG -11111111----1111--------------11113333----------11111111--- CRPGSESHAGQIYCCTGFLAITSQLHQVNSDLLGWWLCERQLPSGGLNGRPEKLPDVCYS -2222--------------11113333-3333-----33333333----2222------- WWVLASLKIIGRLHWIDREKLRSFILACQDEETGGFADRPGDMVDPFHTLFGIAGLSLLG -----------1111----------1111---------2222--------------1111 EEQIKPVSPVFCMPEEVLQRVNVQPELVS 3333---------3333------------ >ETR1 PROTEIN; SWP:P49333; PDB:1DCFA; HMSNFTGLKVLVMDENGVSRMVTKGLLVHLGCEVTTVSSNEECLRVVSHEHKVVFMDVCM ----2222---------------------------------------3333--------- PGVENYQIALRIHEKFTQRHQRPLLVALSGNTDKSTKEKCMSFGLDGVLLKPVSLDNIRD --1111------------------------------------------------------ VLSDLLEPRVLYE ------------- >DIENOYL-COA ISOMERASE; SWP:Q62651; PDB:1DCIA; AYESIQVTSAQKHVLHVQLNRPEKRNAMNRAFWRELVECFQKISKDSDCRAVVVSGAGKM ----------2222------3333----3333-------------1111----------- FTSGIDLMDMASDILQPPGDDVARIAWYLRDLISRYQKTFTVIEKCPKPVIAAIHGGCIG -----------------------------------3333---1111-------------- GGVDLISACDIRYCTQDAFFQVKEVDVGLAADVGTLQRLPKVIGNRSLVNELTFTARKMM -----1111-----1111----3333-------33333333------------------- ADEALDSGLVSRVFPDKDVMLNAAFALAADISSKSPVAVQGSKINLIYSRDHSVDESLDY ------------------------------33333333---------3333--------- MATWNMSMLQTQDIIKSVQAAMEKKDSKSITFSKL ------1111----------1111-3333------ >YHHP PROTEIN; SWP:P37618; PDB:1DCJA; MTDLFSSPDHTLDALGLRCPEPVMMVRKTVRNMQPGETLLIIADDPATTRDIPGFCTFME -------------2222----------------2222-------1111--------1111 HELVAKETDGLPYRYLIRKGG --------------------- >DCOH; SWP:P80095; PDB:1DCOA; HRLSAEERDQLLPNLRAVGWNELEGRDAIFKQFHFKDFNRAFGFMTRVALQAEKLDHHPE ---3333--------1111----------------------------------------- WFNVYNKVHITLSTHECAGLSERDINLASFIEQVAVSMT ---!!!!-------------------------------- >PYK2-ASSOCIATED PROTEIN B; SWP:Q7SIG6; PDB:1DCQA; LTKEIISEVQRMTGNDVCCDCGAPDPTWLSTNLGILTCIECSGIHRELGVHYSRMQSLTL -----------2222----------------------3333---33331111----1111 DVLGTSELLLAKNIGNAGFNEIMECCLPSEDPVKPNPGSDMIARKDYITAKYMERRYARK -----------------------------------1111---------------1111-- KHADTAAKLHSLCEAVKTRDIFGLLQAYADGVDLTEKIPLANGHEPDETALHLAVRSVDR --------------------------------1111----------------------11 TSLHIVDFLVQNSGNLDKQTGKGSTALHYCCLTDNAECLKLLLRGKASIEIANESGETPL 11-----------------1111-----------------------------1111---- DIAKRLKHEHCEELLTQALSGRFNSHVHVEYEWRLL ------------------------------------ >DEACETOXYCEPHALOSPORIN C ; SWP:P18548; PDB:1DCS; MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSE ------------1111-------------------------------------------- AEKRAVTSPVPTMRRGFTGLSMCYSMGTADNLFPSGDFERIWTQYFDRQYTASRAVAREV ------------------------------------------------------------ LRATGTEPDGGVEAFLDCEPLLRFRYFPQLRMAPHYDLSMVTLIQQTPCANGFVSLQAEV -1111--2222-------------------------------------1111-------i GGAFTDLPYRPDAVLVFCGAIATLVTGGQVKAPRHHVAAPIAGSSRTSSVFFLRPNADFT iii------1111------------iiii-----------2222-----------1111- FSVPLARECGFDVSLDGETATFQDWIGGNYVNIRRTSKA ------1111----------3333--------------- >DNA (CYTOSINE-5) METHYLAS; SWP:P20589; PDB:1DCTA; MNLISLFSGAGGLDLGFQKAGFRIICANEYDKSIWKTYESNHSAKLIKGDISKISSDEFP ------------------------------1111----1111-------3333-3333-- KCDGIIGGPPCQSWSEGGSLRGIDDPRGKLFYEYIRILKQKKPIFFLAENVKGMMAQRHN ----------3333----------3333----------------------3333-3333- KAVQEFIQEFDNAGYDVHIILLNANDYGVAQDRKRVFYIGFRKELNINYLPPIPHLIKPT ----------------------3333---------------3333--------------3 FKDVIWDLKDNPIPALDKNKTNGNKCIYPNHEYFIGSYSTIFMSRNRVRQWNEPAFTVQA 333-1111--------%%%%--3333-2222-------3333-------1111------- SGRQCQLHPQAPVMLKVSKNLNKFVEGKEHLYRRLTVRECARVQGFPDDFIFHYESLNDG 3333---3333-------------22221111--------------1111-----3333- YKMIGNAVPVNLAYEIAKTIKSAL ------------------------ >FRUCTOSE-1,6-BISPHOSPHATA; SWP:P46275; PDB:1DCUA; KRSGYEIITLTSWLLQQEQKGIIDAELTIVLSSISMACKQIASLVQRANISNLTGTEDQK -----------------1111-------------------------33331111--3333 KLDVISNEVFSNCLRSSGRTGIIASEEEDVPVAVEESYSGNYIVVFDPLDGSSNLDAAVS -------------1111-------------------3333------------3333---- TGSIFGIYSPNDECLPNTLGTEEQRCIVNVCQPGSNLLAAGYCMYSSSVIFVLTIGKGVF -------------------------------3333------------------------- VFTLDPLYGEFVLTQENLQIPKSGKIYSFNEGNYKLWDENLKKYIDDLKEPGPSGKPYSA -----------------------------3333-------------3333-3333----- RYIGSLVGDFHRTLLYGGIYGYPRDKKSKNGKLRLLYECAPMSFIVEQAGGKGSDGHQRV ----------------------------------------------1111--------11 LDIQPTEIHQRVPLYIGSTEEVEKVEKYLA 11-----------------------1111- >50S RIBOSOMAL PROTEIN L7/; SWP:P29396; PDB:1DD3A; MTIDEIIEAIEKLTVSELAELVKKLEDKFGVTAAAPVAVAAAPVAGAAAGAAQEEKTEFD ---------1111------------------1111------------------------- VVLKSFGQNKIQVIKVVREITGLGLKEAKDLVEKAGSPDAVIKSGVSKEEAEEIKKKLEE -----!!!!---------------------------1111------------------11 AGAEVELK 11------ >RIBOSOME RECYCLING FACTOR; SWP:Q9X1B9; PDB:1DD5A; VNPFIKEAKEKMKRTLEKIEDELRKMRTGKPSPAILEEIKVDYYGVPTPVNQLATISISE ----------------------1111-------1111-----iiii--3333-------1 ERTLVIKPWDKSVLSLIEKAINASDLGLNPINDGNVIRLVFPSPTTEQREKWVKKAKEIV 111--------------------------------------------------------- EEGKIAIRNIRREILKKIKEDQKEGLIPEDDAKRLENEIQKLTDEFIEKLDEVFEIKKEE ------------------------------------------------------------ IMEF ---- >INDUCIBLE NITRIC OXIDE SY; SWP:P29477; PDB:1DD7A; MNPKSLTRGPRDKPTPLEELLPHAIEFINQYYGSFKEAKIEEHLARLEAVTKEIETTGTY ---------------1111----------------------------------------- QLTLDELIFATKMAWRNAPRCIGRIQWSNLQVFDARNCSTAQEMFQHICRHILYATNNGN --3333-------------------1111-----1111-----------------%%%%- IRSAITVFPQRSDGKHDFRLWNSQLIRYAGYQTIRGDAATLEFTQLCIDLGWKPRYGRFD ------------------------------------3333-------1111--------- VLPLVLQADGQDPEVFEIPPDLVLEVTMELGLKWYALPAVANMLLEVGGLEFPACPFNGW -------iiii-------3333------------------------iiii---------- YMNVAVLHSFQKQNVTIMDHHTASESFMKHMQNEYVLSPFYYYQIEPWKTHIWQN ----------1111----------------1111-----------3333------ >DNA PRIMASE; SWP:P02923; PDB:1DD9A; TLYQLMDGLNTFYQQSLQQPVATSARQYLEKRGLSHEVIARFAIGFAPPGWDNVLKRFGG ------------------3333-------1111--------------------------- NPENRQSLIDAGMLVTNRSYDRFRERVMFPIRDKRGRVIGFGGRVLGNDTPKYLNSPETD --------1111--------------------1111----------------------11 IFHKGRQLYGLYEAQQDNAEPNRLLVVEGYMDVVALAQYGINYAVASLGSTTADHIQLLF 111111-2222-------------------------1111-------------------- RATNNVICCYDGDRAGRDAAWRALETALPYMTDGRQLRFMFLPDGEDPDTLVRKEGKEAF --------------------------3333-2222-------2222-------------- EARMEQAMPLSAFLFNSLMPQVDLSTPDGRARLSTLALPLISQVPGETLRIYLRQELGNK ---1111----------3333-1111-------------3333----------------- LGILDDSQLE ------3333 >BID; SWP:P70444; PDB:1DDBA; MDSEVSNGSGLGAKHITDLLVFGFLQSSGCTRQELEVLGRELPVQAYWEADLEDELQTDG ------------------------------3333-------------------------- SQASRSFNQGRIEPDSESQEEIIHNIARHLAQIGDEMDHNIQPTLVRQLAAQFMNGSLSE ---------------------3333----------3333------33333333------- EDKRNCLAKALDEVKTAFPRDMENDKAMLIMTMLLAKKVASHAPSLLRDVFHTTVNFINQ ---3333-----------------------------------1111-------------- NLFSYVRNLVRNEMD ---3333-------- >FAS; SWP:P25445; PDB:1DDF; METVAINLSDVDLSKYITTIAGVMTLSQVKGFVRKNGVNEAKIDEIKNDNVQDTAEQKVQ --------------33333333---------3333---3333-------------3333- LLRNWHQLHGKKEAYDTLIKDLKKANLCTLAEKIQTIILKDITSDSENSNFRNEIQSLVL ------------3333------------33333333------------------------ EHHHHHH ------- >SULFITE REDUCTASE (NADPH); SWP:P38038; PDB:1DDGA; IHTSPYSKDAPLVASLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPA ------1111---------------------------!!!!----2222----------- LVKELVELLWLKGDEPVTVEGKTLPLNEALQWHFELTVNTANIVENYATLTRSETLLPLV ------1111------------------------------------------11113333 GDKAKLQHYAATTPIVDMVRFSPAQLDAEALINLLRPLTPRLYSIASSQAEVENEVHVTV -------------------------------3333------------3333--------- GVVRYDVEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPETPVIMIGPGTGIA ------iiii----3333------------------1111----1111------!!!!-- PFRAFMQQRAADEAPGKNWLFFGNPHFTEDFLYQVEWQRYVKEGVLTRIDLAWSRDQKEK -------------------------3333-2222------------------1111---- VYVQDKLREQGAELWRWINDGAHIYVCGDANRMAKDVEQALLEVIAEFGGMDTEAADEFL -3333------------1111--------------------------------------- SELRVERRYQRDVY ---1111------- >PLASMINOGEN; SWP:P00747; PDB:1DDJA; SFDCGKPQVEPKKCPGRVVGGCVAHPHSWPWQVSLRTRFGMHFCGGTLISPEWVLTAAHC --------------3333------22221111----1111---------1111---3333 LEKSPRPSSYKVILGAHQEVNLEPHVQEIEVSRLFLEPTRKDIALLKLSSPAVITDKVIP 3333-3333-------------1111----------1111--------------1111-- ACLPSPNYVVADRTECFITGWGETQGTFGAGLLKEAQLPVIENKVCNRYEFLNGRVQSTE ----2222--2222--------------2222---------3333--1111%%%%-1111 LCAGHLAGGTDSCQGDAGGPLVCFEKDKYILQGVTSWGLGCARPNKPGVYVRVSRFVTWI ---------------2222----------------3333------------3333----- EGVMRNN ------- >RNA (5'-R(P*UP*UP*UP*UP*U; SWP:O89511; PDB:1DDLA; MEQDKILAHQASLNTKPSLLPPPVGNPPPVISYPFQITLASLGTEDAADSVSIASNSVLA ---------------------------------------------------3333-3333 TYTALYRHAQLKHLKATIHPTYMAPKYPTSVALVWVPANSTATSTQVLDTYGGLHFCIGG --1111--------------1111------------1111--11111111---------- SVNSVKPIDVEANLTNLNPIIKASTTFTDTPKLLYYSKAQATAPTSPTCYLTIQGQIELS ------------------------------------------------------------ SPLLQASS -------- >GLGF-DOMAIN PROTEIN HOMER; SWP:Q9Z214; PDB:1DDWA; GEQPIFSTRAHVFQIDPNTKKNWVPTSKHAVTVSYFYDSTRNVYRIISLDGSKAIINSTI -------------------------------------------------!!!!------- TPNTFTKTSQKFGQWADSRANTVYGLGFSSEHHLSKFAEKFQEFKEAAR -----------------1111---------------------------- >CARBONIC ANHYDRASE; SWP:Q43060; PDB:1DDZA; VMSDLEKKFIELEAKLVAQPAGQAMPGKSNIFANNEAWRQEMLKQDPEFFNRLANGQSPE ---------------11112222----------------------1111--3333----- YLWIGCADSRVPANQLLDLPAGEVFVHRNIANQCIHSDISFLSVLQYAVQYLKVKHILVC --------------1111-2222-----------11113333------------------ GHYGCGGAKAALGDSRLGLIDNWLRHIRDVRRMNAKYLDKCKDGDEELNRLIELNVLEQV -2222------------3333---------------3333-------------------- HNVCATSIVQDAWDAGQELTVQGVVYGVGDGKLRDLGVVVNSSDDISKFYRTKSDSGALK --------------------------3333---------------3333--3333--333 AGNPNAPLVQVTKGGESELDSTMEKLTAELVQQTPGKLKEGANRVFVNNENWRQKMLKQD 3----------------------------11112222-----3333-------------1 PQFFSNLAHTQTPEILWIGCADSRVPANQIINLPAGEVFVHRNIANQCIHSDMSFLSVLQ 111----------------3333-----1111-2222-----------1111-------- YAVQYLKVKRVVVCGHYACGGCAAALGDSRLGLIDNWLRHIRDVRRHNQAELSRITDPKD ---------------2222----1111---------------------3333-------- SLNRLIEINVLEQMHNVCATSIVQDAWDAGQELEVQGVVYGVGDGKLRDMGVVAKANDDI ----------------------------------------3333---------------- G - >RIBONUCLEASE ALPHA-SARCIN; SWP:P00655; PDB:1DE3A; AVTWTCLNDQKNPKTNKYETKRLLYNQNKAESNSHHAPLSDGKTGSSYPHWFTNGYDGDG ------------1111---------3333------------------------------- KLPKGRTPIKFGKSDCDRPPKHSKDGNGKTDHYLLEFPTFPDGHDYKFDSKKPKENPGPA -------------3333----------------------3333---1111---------- RVIYTYPNKVFCGIIAHTKENQGELKLCSH ------------------------------ >Transferrin receptor prot; SWP:P02786; PDB:1DE4C; LYWDDLKRKLSEKLDSTDFTSTIKLLNENSYVPREAGSQKDENLALYVENQFREFKLSKV -------------1111------11113333---2222---------------------- WRDQHFVKIQVKDSAQNSVIIVDKNGRLVYLVENPGGYVAYSKAATVTGKLVHANFGTKK --------------------------------------2222------------------ DFEDLYTPVNGSIVIVRAGKITFAEKVANAESLNAIGVLIYMDQTKFPIVNAELSFFGHA --------2222---------3333-------------------------1111------ HLGTGDPYTPGFPSFNHTQFPPSRSSGLPNIPVQTISRAAAEKLFGNMEGDCPSDWKTDS -----1111-----3333----------------------------------3333---- TCRMVTSESKNVKLTVSNVLKEIKILNIFGVIKGFVEPDHYVVVGAQRDAWGPGAAKSGV ------3333--------------------------3333-------------------- GTALLLKLAQMFSDMVLKDGFQPSRSIIFASWSAGDFGSVGATEWLEGYLSSLHLKAFTY ---------------------------------3333------------1111------- INLDKAVLGTSNFKVSASPLLYTLIEKTMQNVKHPVTGQFLYQDSNWASKVEKLTLDNAA -----------------3333-----3333---------------3333-----1111-- FPFLAYSGIPAVSFCFCEDTDYPYLGTTMDTYKELIERIPELNKVARAAAEVAGQFVIKL ---------------------1111-----3333-------------------------- THDVELNLDYERYNSQLLSFVRDLNQYRADIKEMGLSLQWLYSARGDFFRATSRLTTDFG --------3333-----------3333--------------------------------- NAEKTDRFVMKKLNDRVMRVEYHFLSPYVSPKESPFRHVFWGSGSHTLPALLENLKLRKQ --1111-------------3333------3333----------1111-------3333-- NNGAFNETLFRNQLALATWTIQGAANALSGDVWDI --------------------------33333333- ------------------------------------------------------ --------------------------------------- >Immunoglobulin G-binding ; SWP:P02976; PDB:1DEEB; QVQLVESGGGVVQPGKSLRLSCAASGFTFSGYGMHWVRQAPGKGLEWVALISYDESNKYY ---------------------------3333--------2222--------1111----- ADSVKGRFTISRDNSKNTLYLQMNSLRAEDTAVYYCAKVKFYDPTAPNDYWGQGTLVTVS 3333---------1111---------3333------------1111-------------- SGSASAPTLFPLVSCENSNPSSTVAVGCLAQDFLPDSITFSWKYKNNSDISSTRGFPSVL -------------2222--------------------------1111------------- RGGKYAATSQVLLPSKDVAQGTNEHVVCKVQHPNGNKEKDVPL -------------------------------1111-------- >Immunoglobulin G-binding ; SWP:P02976; PDB:1DEEG; DQQSAFYEILNMPNLNEAQRNGFIQSLKDDPSQSTNVLGEAKKLNESQAPK -3333---1111-----------------3333-----------3333--- >DEOXYNUCLEOSIDE MONOPHOSP; SWP:P04531; PDB:1DEKA; MKLIFLSGVKRSGKDTTADFIMSNYSAVKYQLAGPIKDALAYAWGVFAANTDYPLTRKEF ------------------------------1111------------------------33 EGIDYDRETNLNLTKLEVITIMEQAFCYLNGKSPIKGVFVFDDEGKESVNFVAFNKITDV 33---1111-------------------1111--2222---------------------- INNIEDQWSVRRLMQALGTDLIVNNFDRMYWVKLFALDYLDKFNSGYDYYIVPDTRQDHE 1111----------------------1111-----------1111-----------3333 MDAARAMGATVIHVVRPGQKSNDTHITEAGLPIRDGDLVITNDGSLEELFSKIKNTLKVL ----1111----------------1111-----2222----------------------- >DENDROTOXIN I; SWP:P00979; PDB:1DEM; QPLRKLCILHRNPGRCYQKIPAFYYNQKKKQCEGFTWSGCGGNSNRFKTIEECRRTCIRK ---3333----------------------------------------------------- >PROCATHEPSIN X; SWP:Q9UBR2; PDB:1DEUA; RGQTCYRPLRGDGLAPLGRTTYPRPHEYLSPAD --------2222-----------2222--3333 >Zinc finger FYVE domain-c; SWP:O95405; PDB:1DEVB; SQSPNPNNPAEYCSTIPPLQQAQASGALSSPPPTVMVPVGV ----11113333----3333------1111----------- >M-CALPAIN; SWP:Q07009; PDB:1DF0A; AGIAMKLAKDREAAEGLGSHERAIKYLNQDYETLRNECLEAGALFQDPSFPALPSSLGFK ------------------1111--2222----------1111----3333--3333---- ELGPYSSKTRGIEWKRPTEICADPQFIIGGATRTDICQGALGDSWLLAAIASLTLNEEIL ---------------3333-----------------------3333----1111-3333- ARVVPLDQSFQENYAGIFHFQFWQYGEWVEVVVDDRLPTKDGELLFVHSAEGSEFWSALL 1111-----------------------------------%%%%-----1111-------- EKAYAKINGCYEALSGGATTEGFEDFTGGIAEWYELRKPPPNLFKIIQKALEKGSLLGCS -----11113333----3333-------------1111------------1111------ IDIGHAYSVTGAEEVQKLIRIRNPWGQGEFWMSFSDFLRHYSRLEICNLTPDTLTCDSYK ----------------------1111-----------------------1111------- KWKLTKMDGNWRRGSTAGGCRNYPNTFWMNPQYLIKLEEEDEDDEDGRGCTFLVGLIQKH -----------2222----3333--3333------------------------------- RRRQRKMGEDMHTIGFGIYEVSKNFFLTERSDTFINLREVLNRFKLPPGEYVLVPSTFEP 1111-2222--------------------------------------------------- HKNGDFCIRVFSEKKADYQTVDDEIEANIEEIANEEDIGDGFRRLFAQLAGEDAEISAFE --------------------------------------------------1111------ LQTILRRVLAKKSDGFSIETCKIMVDMLDEDGSGKLGLKEFYILWTKIQKYQKIYREIDV -------3333-----------------------------------------------11 DRSGTMNSYEMRKALEEAGFKLPCQLHQVIVARFADDELIIDFDNFVRCLVRLEILFKIF 11----3333--------------3333-------------------------------- KQLDPENTGTIQLDLISWLSFSVL ---1111----------------- --------------------------------------------------------- >DIHYDROFOLATE REDUCTASE; SWP:P0A546; PDB:1DF7A; MVGLIWAQATSGVIGRGGDIPWRLPEDQAHFREITMGHTIVMGRRTWDSLPAKVRPLPGR --------3333---iiii----3333-------2222--------33333333--2222 RNVVLSRQADFMASGAEVVGSLEEALTSPETWVIGGGQVYALALPYATRCEVTEVDIGLP ------------2222----3333------------------3333-------------- REAGDALAPVLDETWRGETGEWRFSRSGLRYRLYSYHRS -2222-------------------1111----------- >Bowman-Birk type trypsin ; SWP:P01062; PDB:1DF9C; SHDEPSESSEPCCDSCDCTKSIPPQCHCANIRLNSCHSACKSCICTRSMPGKCRCLDTDD ------------------------------------1111-------------------- FCYKPCESMDKD ------------ >PI-SCEI ENDONUCLEASE; SWP:P17255; PDB:1DFAA; CFAKGTNVLMADGSIECIENIEVGNKVMGKDGRPREVIKLPRGRETMYSVVQKSQHRAHK ---------1111---3333--------1111---------------------------- SDSSREVPELLKFTCNATHELVVRTPRSVRRLSRTIKGVEYFEVITFEMGQKKAPDGRIV ---------------1111----------------------------------3333--- ELVKEVSKSYPISEKAYFEWTIEARDLSLLGSHVRKATYQTYAPILYENDHFFDYMQLTI ----------------------33331111-------------------3333------- EGPKVLAYLLGLWIGDGLSDRATFSVDSRDTSLMERVTEYAEKLNLCAEYKNTENPLWDA -----------------1111-----3333-----------1111--------------- IVGLGFLKDGVKNIPSFLSTDNIGTRETFLAGLIDSDGYVTDEHGIKATIKTIHTSVRDG -1111------------------------------------------------3333--- LVSLARSLGLVVSVNAEPAKVDMNGTKHKISYAIYMSGGDVLLNVLSKCAGSKKFRPAPA -----1111------------iiii------------!!!!---------1111------ AAFARECRGFYFELQELKEDDYYGITLSDDSDHQFLLANQVVVHN ---------------------------1111-----1111----- >IGG1-KAPPA 3D6 FAB (HEAVY; SWP:GC1_HUMAN; PDB:1DFBH; EVQLVESGGGLVQPGRSLRLSCAASGFTFNDYAMHWVRQAPGKGLEWVSGISWDSSSIGY ---------------------------3333--------2222----------------- ADSVKGRFTISRDNAKNSLYLQMNSLRAEDMALYYCVKGRDYYDSGGYFTVAFDIWGQGT 1111---------3333---------1111------------------------------ MVTVSSASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFP ----------------------------------------------%%%%---------- AVLQSSGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSC -----------------3333------------1111------------ >IGKC protein; SWP:Q6GMX8; PDB:1DFBL; DIQMTQSPSTLSASVGDRVTITCRASQSISRWLAWYQQKPGKVPKLLIYKASSLESGVPS ----------------------------!!!!------2222------------2222-- RFSGSGSGTEFTLTISSLQPDDFATYYCQQYNSYSFGPGTKVDIKRTVAAPSVFIFPPSD ----------------------------------------------------------33 EQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLTLS 331111------------------------------------------------------ KADYEKHKVYACEVTHQGLSSPVTKSFNRGEC ---------------1111------------- >FASCIN; SWP:Q16658; PDB:1DFCA; EAVQIQFGLINCGNKYLTAEAFGFKVNASASSLKKKQIWTLEAAVCLRSHLGRYLAADKD ----------1111-------------------1111--------------------111 GNVTCEREVPGPDCRFLIVAHDDGRWSLQSEAHRRYFGGTEDRLSCFAQTVSPAEKWSVH 1---------1111-------------------------------------1111----- IAMHPQVNIYSVTRKRYAHLSARPADEIAVDRDVPWGVDSLITLAFQDQRYSVQTADHRF -----------1111--------------------------------------------- LRHDGRLVARPEPATGYTLEFRSGKVAFRDCEGRYLAPSGPSGTLKAGKATKVGKDELFA -1111------1111------------------------1111-----------1111-- LEQSCAQVVLQAANERNVSTMDLSANQDEETDQETFQLEIDRDTKKCAFRTHTGKYWTLT -----------1111---------------3333-------------------------3 ATGGVQSTASSKNASCYFDIEWRDRRITLRASNGKFVTSKKNGQLAASVETAGDSELFLM 333----------1111-------------1111-------------------------- KLINRPIIVFRGEHGFIGCRKVTGTLDANRSSYDVFQLEFNDGAYNIKDSTGKYWTVGSD -----------1111-------------------------iiii----1111-------- SAVTSSGDTPVDFFFEFCDYNKVAIKVGGRYLKGDHAGVLKASAETVDPASLWEY --------------------------iiii-----------------3333---- >L36 RIBOSOMAL PROTEIN; SWP:P80256; PDB:1DFEA; MKVRASVKRICDKCKVIRRHGRVYVICENPKHKQRQG ----------------------------3333----- >ENDONUCLEASE BGLII; SWP:Q45488; PDB:1DFMA; KIDITDYNHADEILNPQLWKEIEETLLKPLHVKASDQASKVGSLIFDPVGTNQYIKDELV --------3333------------------------2222-------------------1 PKHWKNNIPIPKRFDFLGTDIDFGKRDTLVEVQFSNYPFLLNNTVRSELFHKSNDIDEEG 111-------33331111------!!!!-------------------------------- KVAIIITKGHFPASNSSLYYEQAQNQLNSLAEYNVFDVPIRLVGLIEDFETDIDIVSTTY -------------2222-------------1111-------------------------- ADKRYSRTITKRDTVKGKVIDTNTPNTRRRKRGTIVTY -----------------------1111----------- >DEFENSIN HNP-3; SWP:NA; PDB:1DFNA; DCYCRIPACIAGERRYGTCIYQGRLWAFCC ---------2222-------%%%%------ >SERINE HYDROXYMETHYLTRANS; SWP:P00477; PDB:1DFOA; LKREMNIADYDAELWQAMEQEKVRQEEHIELIASENYTSPRVMQAQGSQLTNKYAEGYPG -3333-1111---------------------1111--------------1111----222 KRYYGGCEYVDIVEQLAIDRAKELFGADYANVQPHSGSQANFAVYTALLEPGDTVLGMNL 2------------------------------------------------2222------1 AHGGHLTHGSPVNFSGKLYNIVPYGIDATGHIDYADLEKQAKEHKPKMIIGGFSAYSGVV 111-1111-11113333---------1111------------------------------ DWAKMREIADSIGAYLFVDMAHVAGLVAAGVYPNPVPHAHVVTTTTHKTLAGPRGGLILA ---------1111-------1111--1111----1111---------------------- KGGSEELYKKLNSAVFPGGQGGPLMHVIAGKAVALKEAMEPEFKTYQQQVAKNAKAMVEV ---------------------------------------3333----------------- FLERGYKVVSGGTDNHLFLVDLVDKNLTGKEADAALGRANITVNKNSVPNDPKSPFVTSG -------2222----------3333----------------------------1111--- IRVGTPAITRRGFKEAEAKELAGWMCDVLDSINDEAVIERIKGKVLDICARYPVYA --------1111------------------1111-------------3333----- ------------------------------- ------------------------------ >50S ribosomal protein L25; SWP:P68919; PDB:1DFUP; MFTINAEVRKEQGKGASRRLRAANKFPAIIYGGKEAPLAIELDHDKVMNMQAKAEFYSEV -----------------------------------------------------3333--- LTIVVDGKEIKVKAQDVQRHPYKPKLQHIDFVRA ---------------------------------- >DESULFOFERRODOXIN; SWP:P22076; PDB:1DFX; PKHLEVYKCTHCGNIVEVLHGGGAELVCCGEPMKHMVEGSTDGAMEKHVPVIEKVDGGYL -2222----------------------iiii-----2222--------------2222-- IKVGSVPHPMEEKHWIEWIELLADGRSYTKFLKPGDAPEAFFAIDASKVTAREYCNLHGH ----------1111------------------2222------------------------ WKAEN ----- >INTERFERON-INDUCED GUANYL; SWP:P32455; PDB:1DG3A; HMTGPMCLIENTNGRLMANPEALKILSAITQPMVVVAIVGLYRTGKSYLMNKLAGKKHTK -----------iiii-------------------------3333---------------- GIWMWCVPHPKKPGHILVLLDTEGLGDVEKGDNQNDSWIFALAVLLSSTFVYNSIGTINQ --------1111-----------22221111-1111------------------------ QAMDQLYYVTELTHRIRSKSDSADFVSFFPDFVWTLRDFSLDLQPLTPDEYLTYSLKLKK ------33333333------33333333-------------------------1111--- GTSQKDETFNLPRLCIRKFFPKKKCFVFDRPVHELDPEFVQQVADFCSYIFSNSKTKTLS --3333----------------------------------------------------22 GGIQVNGPRLESLVLTYVNAISSGDLPCMENAVLALAQIENSAAVQKAIAHYEQQMGQKV 22------------------1111------------------------------------ QLPTESLQELLDLHRDSEREAIEVFIRSSFKDVDHLFQKELAAQLEKKRDDFCKQNQEAS -----3333----------------1111--2222------------------------- SDRCSGLLQVIFSPLEEEVKAGIYSKPGGYRLFVQKLQDLKKKYYEEPRKGIQAEEILQT ---------------------11112222--------------1111---1111------ YLKSKESMTDAILQTDQTLTEKEKEIEVERVKAESAQASAKMLHEMQRKNEQMMEQKERS ------------------------------------------------------------ YQEHLKQLTEKMENDRVQLLKEQERTLALKLQEQEQLLKEGFQKESRIMKNEIQDLQTKM ------------------------------------------------------------ >APO2L/TNF-RELATED APOPOTI; SWP:P50591; PDB:1DG6A; QRVAAHITGTRKNEKALGRKINSWESSRSGHSFLSNLHLRNGELVIHEKGFYYIYSQTYF ---------------------------------------iiii----------------- RFQEKENTKNDKQMVQYIYKYTSYPAPILLMKSARNSCWSKDAEYGLYSIYQGGIFELKE ---------------------------------------1111---------------22 NDRIFVSVTNEHLIDMDHEASFFGAFLVG 22-------1111---1111--------- >TYROSINE PHOSPHATASE; SWP:P11064; PDB:1DG9A; AEQVTKSVLFVCLGNICRSPIAEAVFRKLVTDQNISDNWVIDSGAVSDWNVGRSPDPRAV ------------------------------11113333---------1111--------- SCLRNHGINTAHKARQVTKEDFVTFDYILCMDESNLRDLNRKSNQVKNCRAKIELLGSYD ---1111----------3333--------------------3333---------3333-1 PQKQLIIEDPYYGNDADFETVYQQCVRCCRAFLEKVR 111------11113333-------------------- >CATALASE; SWP:P04040; PDB:1DGFA; RDPASDQMQHWKEQRAAQKADVLTTGAGNPVGDKLNVITVGPRGPLLVQDVVFTDEMAHF -3333-------------------1111------------------1111---------1 DRERIPERVVHAKGAGAFGYFEVTHDITKYSKAKVFEHIGKKTPIAVRFSTVAGESGSAD 111-----------------------3333--3333-2222-------------1111-- TVRDPRGFAVKFYTEDGNWDLVGNNTPIFFIRDPILFPSFIHSQKRNPQTHLKDPDMVWD -------------1111---------------3333-----1111--------------- FWSLRPESLHQVSFLFSDRGIPDGHRHMNGYGSHTFKLVNANGEAVYCKFHYKTDQGIKN ----3333--------1111---1111------------1111----------1111--- LSVEDAARLSQEDPDYGIRDLFNAIATGKYPSWTFYIQVMTFNQAETFPFNPFDLTKVWP ------------------------1111------------3333------1111-----3 HKDYPLIPVGKLVLNRNPVNYFAEVEQIAFDPSNMPPGIEASPDKMLQGRLFAYPDTHRH 333----------------3333-1111--1111-2222--------------------- RLGPNYLHIPVNCPYRARVANYQRDGPMCMQDNQGGAPNYYPNSFGAPEQQPSALEHSIQ ----33333333-1111--------------%%%%---------------3333------ YSGEVRRFNTANDDNVTQVRAFYVNVLNEEQRKRLCENIAGHLKDAQIFIQKKAVKNFTE ---------1111-----------------------------1111-------------- VHPDYGSHIQALLDKYN ----------------- >ALDEHYDE OXIDOREDUCTASE; SWP:Q9REC4; PDB:1DGJA; METKTLIVNGMARRLLVSPNDLLVDVLRSQLQLTSVKVGCGKGQCGACTVILDGKVVRAC -------iiii------1111-----------3333---------1111--iiii--111 IIKMSRVAENASVTTLEGIGAPDCLHPLQHAWIQHGAAQCGFCTPGFIVSAKALLDENVA 1-3333-2222---3333--1111----------------1111---------3333--- PSREDVRDWFQKHHNICRCTGYKPLVDAVMDAAAILRGEKTVEEISFKMPADGRIWGSSI ---------------------3333---------1111--3333-----3333-2222-- PRPSAVAKVTGLAEFGADAALRMPENTLHLALAQAKVSHALIKGIDTSEAEKMPGVYKVL -1111---------33333333-1111-------------------------2222---- THKDVKGKNRITGLITFPTNKGDGWERPILNDSKIFQYGDALAIVCADSEANARAAAEKV 3333------------1111-----------------------------------3333- KFDLELLPEYMSAPEAMAPDAIEIHPGTPNVYYDQLEEKGEDTVPFFNDPANVVAEGSYY -----------3333--1111---2222--------------3333--1111-------- TQRQPHLPIEPDVGYGYINEQGQVVIHSKSVAIHLHALMIAPGLGLEFPKDLVLVQNTTG ------------------------------------------------------------ GTFGYKFSPTMEALVGVAVMATGRPCHLRYNYEQQQNYTGKRSPFWTTMRYAADRQGKIL ---1111---3333-----------------3333------------------1111--- AMETDWSVDHGPYSEFGDLLTLRGAQYIGAGYGIANIRGTGRTVATNHCWGAAFRGYGAP -------------2222----3333-2222------------------------------ ESEFPSEVLMDELAEKLGMDPFELRALNCYREGDTTSSGQIPEVMSLPEMFDKMRPYYEE ------------------------------------------------------------ SKKRVKERSTAEIKRGVGVALGVYGAGLDGPDTSEAWVELNDDGSVTLGNSWEDHGQGAD ---------1111---------------------------1111---------------- AGSLGTAHEALRPLGITPENIHLVMNDTSKTPNSGPAGGSRSQVVTGNAIRVACEMLIEG ----------3333--1111------1111-------%%%%-----------------11 MRKPGGGFFTPAEMKAEGRPMRYDGKWTAPAKDCDAKGQGSPFACYMYGLFLTEVAVEVA 112222----------------------------1111---------------------- TGKATVEKMVCVADIGKICNKLVVDGQIYGGLAQGVGLALSEDYEDLKKHSTMGGAGIPS -----------------------------------------------11113333----3 IKMIPDDIEIVYVETPRKDGPFGASGVGEMPLTAPHAAIINGIYNACGARVRHLPARPEK 333-------------1111%%%%----1111---------------------------- VLEAMP ------ >LECTIN; SWP:P08902; PDB:1DGLA; ADTIVAVELNSYPNTDIGDPNYPHIGIDIKSIRSKSTARWNMQTGKVGTVHISYNSVAKR -------------3333------------------------------------------- LSAVVSYSGSSSTTVSYDVDLNNVLPEWVRVGLSATTGLYKETNTILSWSFTSKLKTNSI ------2222---------3333------------------------------------- ADANSLHFSFHQFSQNPKDLILQGDAFTDSDGNLELTKVSSSGDPQGNSVGRALFYAPVH ----------------------------1111-------3333----------------- IWEKSAVVASFDATFTFLIKSPDREPADGITFFIANTDTSIPSGSGGRLLGLFPDAN --1111-----------------------------1111--2222-1111------- >ICEBERG (PROTEASE INHIBIT; SWP:P57730; PDB:1DGNA; ADQLLRKKRRIFIHSVGAGTINALLDCLLEDEVISQEDMNKVRDENDTVMDKARVLIDLV --3333----3333--------------3333--3333---------3333--------- TGKGPKSCCKFIKHLCEEDPQLASKMGLH --------------------3333----- >DNA LIGASE; SWP:Q9ZHI0; PDB:1DGSA; MTREEARRRINELRDLIRYHNYRYYVLADPEISDAEYDRLLRELKELEERFPEFKSPDSP ----------------------------------3333------------3333----33 TEQVGARPLEPTFRPVRHPTRMYSLDNAFTYEEVLAFEERLEREAEAPSLYTVEHKVDGL 33---------------------------------------------------------- SVLYYEEGVWSTGSGDGEVGEEVTQNLLTIPTIPRRLKGVPDRLEVRGEVYMPIEAFLRL -----iiii--------------3333--3333--------------------------- NEELEERGEKVFKNPRNAAAGSLRQKDPRVTAKRGLRATFYALGLGLGLEESGLKSQYEL -----------------------------3333--------------------------- LLWLKEKGFPVEHCYEKALGAEGVEEVYRRGLAQRHALPFEADGVVLKLDDLTLWGELGY --------------------------------------------------3333------ TARAPRFALAYKFPAEEKETRLLDVVFQVGRTGRVTPVGVLEPVFIEGSEVSRVTLHNES -----------------------------1111--------------------------- YIEELDIRIGDWVLVHKAGGVIPEVLRVLKERRTGKERPIRWPEACPECGHRLVKEGKVH --1111----------2222--------3333-----------------------!!!!- RCPNPLCPAKRFEAIRHYASRKAMDIEGLGEKLIERLLEKGLVRDVADLYHLRKEDLLGL ---11111111--------3333--33333333----3333---33331111-3333--- ERMGEKSAQNLLRQIEESKHRGLERLLYALGLPGVGEVLARNLARRFGTMDRLLEASLEE ----1111--------3333-----------22223333---------333311113333 LIEVEEVGELTARAILETLKDPAFRDLVRRLKEAGVSMESK 3333---3333---------3333----------------- >CANAVALIN; SWP:P50477; PDB:1DGWA; NNPYLFRSNKFLTLFKNQHGSLRLLQRFNEDTEKLENLRDYRVLEYCSKPNTLLLPHHSD -1111-1111----------------1111----1111---------------------- SDLLVLVLEGQAILVLVNPDGRDTYKLDQGDAIKIQAGTPFYLINPDNNQNLRILKFAIT ---------------------------2222----2222--------------------- FRRPGTVEDFFLSSTKRLPSYLSAFSKNFLEASYDSPYDEIEQTLLQEEQEGVIVKMP --2222--------------3333------------3333------------------ >Canavalin [Precursor]; SWP:P50477; PDB:1DGWX; DKPFNLRSRDPIYSNNYGKLYEITPEKNSQLRDLDILLNCLQMNEGALFVPHYNSRATVI ----1111---------------1111-3333-----------2222------------- LVANEGRAEVELVGLE ---------------- >Canavalin [Precursor]; SWP:P50477; PDB:1DGWY; QLRRYAATLSEGDIIVIPSSFPVALKAASDLNMVGIGVNAENNERNFLAGHKENVIRQIP ---------2222----2222------------------2222----------3333--- RQVSDLTFPGSGEEVEELLENQKESYFVDGQP -------------------------------- >TRANSCRIPTION FACTOR CREB; SWP:Q01147; PDB:1DH3A; KREVRLMKNREAARESRRKKKEYVKSLENRVAVLENQNKTLIEELKALKDLYSHK ----------------------------------------------3333----- >Alpha-amylase inhibitor 1; SWP:P02873; PDB:1DHKB; ATETSFIIDAFNKTNLILQGDATVSSNGNLQLSYNSYDSMSRAFYSAPIQIRDSTTGNVA -----------3333---------1111----1111------------------------ SFDTNFTMNIRTHRSAVGLDFVLVPVDTVTVEFDTFLSRISIDVNNNDIKSVPWDVHDYD -------------------------------------------iiii-------333322 GQNAEVRITYNSSTKVFSVSLSNPSTGKSNNVSTTVELEKEVYDWVSVGFSATSGAYQWS 22-------------------------------------3333------------!!!!- YETHDVLSWSFSSKF --------------- >7,8-DIHYDRONEOPTERIN ALDO; SWP:P56740; PDB:1DHN; MQDTIFLKGMRFYGYHGALSAENEIGQIFKVDVTLKVDLSEAGRTDNVIDTVHYGEVFEE ------------------3333------------------------1111--3333---- VKSIMEGKAVNLLEHLAERIANRINSQYNRVMETKVRITKENPPIPGHYDGVGIEIVREN ---------------------------3333----------------------------- K - >DIHYDROPTERIDINE REDUCTAS; SWP:P11348; PDB:1DHR; EARRVLVYGGRGALGSRCVQAFRARNWWVASIDVVENEEASASVIVKMTDSFTEQADQVT --------1111----------1111----------1111-------------------- AEVGKLLGDQKVDAILCVAGGWAGGNAKSKSLFKNCDLMWKQSIWTSTISSHLATKHLKE ------!!!!---------------1111-----------------------------22 GGLLTLAGAKAALDGTPGMIGYGMAKGAVHQLCQSLAGKNSGMPSGAAAIAVLPVTLDTP 22------3333---1111---------------1111-----2222------------- MNRKSMPEADFSSWTPLEFLVETFHDWITGNKRPNSGSLIQVVTTDGKTELTPAYF -----1111-1111-3333---------------2222------iiii-------- >LUMAZINE SYNTHASE; SWP:P61711; PDB:1DI0A; TSFKIAFIQARWHADIVDEARKSFVAELAAKTGGSVEVEIFDVPGAYEIPLHAKTLARTG ------------3333----------------------------3333------------ RYAAIVGAAFVIDGGIYDHDFVATAVINGMMQVQLETEVPVLSVVLTPHHFHES ------------------------------------------------------ >ARISTOLOCHENE SYNTHASE; SWP:Q03471; PDB:1DI1A; TPPPTQWSYLCHPRVKEVQDEVDGYFLENWKFPSFKAVRTFLDAKFSEVTCLYFPLALDD -----------1111------------------3333----------------1111111 RIHFACRLLTVLFLIDDVLEHMSFADGEAYNNRLIPISRGDVLPDRTKPEEFILYDLWES 1-----------------1111----------------------1111------------ MRAHDAELANEVLEPTFVFMRAQTDRARLSIHELGHYLEYREKDVGKALLSALMRFSMGL -------------------1111----------------3333----------------- RLSADELQDMKALEANCAKQLSVVNDIYSYDKEEEALCSAVKVLAEESKLGIPATKRVLW ------------------------------1111----3333---1111----------- SMTREWETVHDEIVAEKIASPDGCSEAAKAYMKGLEYQMSGNEQWSKTTR -------------------1111--------------------------- >DOUBLE STRANDED RNA BINDI; SWP:Q91836; PDB:1DI2A; MPVGSLQELAVQKGWRLPEYTVAQESGPPHKREFTITCRVETFVETGSGTSKQVAKRVAA ---------------------------1111----------------------------- EKLLTKFKT --------- >MOLYBDENUM COFACTOR BIOSY; SWP:P28694; PDB:1DI6A; ATLRIGLVSISDRDKGIPALEEWLTSALTTPFELETRLIPDEQAIIEQTLCELVDEMSCH ------------------------------------------------------------ LVLTTGGTGPARRDVTPDATLAVADREMPGFGEQMRQISLHFVPTAILSRQVGVIRKQAL ----------1111------1111--------------3333-3333------------- ILNLPGQPKSIKETLEGVKDAEGNVVVHGIFASVPYCIQLLEGPYVETAPEVVAAFRPKS -------------------1111-----3333------1111------3333-----333 ARR 3-- >DELTA-SLEEP-INDUCING PEPT; SWP:P80220; PDB:1DIPA; MDLVKNHLMYAVREEVEILKEQIRELVEKNSQLERENTLLKTLASPEQLEKFQSRLSPEE ------1111-------------------------------------------------- PAPETPEAPEAPGGSAV ----------------- >RIBOSOMAL PROTEIN L9; SWP:P02417; PDB:1DIV; MKVIFLKDVKGKGKKGEIKNVADGYANNFLFKQGLAIEATPANLKALEAQKQKEQRQAAE ------------3333-------------3333--------------------------- ELANAKKLKEQLEKLTVTIPAKAGEGGRLFGSITSKQIAESLQAQHGLKLDKRKIELADA ------------------------iiii----------------------3333------ IRALGYTNVPVKLHPEVTATLKVHVTEQK ----------------------------- >PSEUDOURIDINE SYNTHASE I; SWP:P07649; PDB:1DJ0A; PPVYKIALGIEYDGSKYYGWQRQNEVRSVQEKLEKALSQVANEPITVFCAGRTDAGVHGT -------------1111----2222----------------------------2222--- GQVVHFETTALRKDAAWTLGVNANLPGDIAVRWVKTVPDDFHARFSATARRYRYIIYNHR ------------3333-----11111111--------1111--1111------------- LRPAVLSKGVTHFYEPLDAERMHRAAQCLLGENDFTSFRAVQCQSRTPWRNVMHINVTRH -----1111----------------3333-----3333-1111----------------! GPYVVVDIKANAFVHHMVRNIVGSLMEVGAHNQPESWIAELLAAKDRTLAAATAKAEGLY !!!----------2222----------1111--1111----33333333------1111- LVAVDYPDRYDLPKPPMGPLFLAD ------3333--------!!!!-- >ADENYLOSUCCINATE SYNTHETA; SWP:Q96529; PDB:1DJ2A; IGSLSQVSGVLGCQWGDEGKGKLVDILAQHFDIVARCQGGANAGHTIYNSEGKKFALHLV 1111----------------------3333------------------3333-------- PSGILNEDTTCVIGNGVVVHLPGLFKEIDGLESNGVSCKGRILVSDRAHLLFDFHQEVDG 1111---------1111-------------3333---2222---1111---3333----- LRESELAKSFIGTTKRGIGPAYSSKVIRNGIRVGDLRHMDTLPQKLDLLLSDAAARFQGF --3333-------------------------3333--3333---------------3333 KYTPEMLREEVEAYKRYADRLEPYITDTVHFINDSISQKKKVLVEGGQATMLDIDFGTYP --------------------3333------------------------1111-------- FVTSSSPSAGGICTGLGIAPSVVGDLIGVVKAYTTRVGSGPFPTENLGTGGDLLRLAGQE -------3333-------3333-------------------1111----------1111- FGTTTGRPRRCGWLDIVALKFSCQINGFASLNLTKLDVLSDLNEIQLGVAYKRSDGTPVK ----------------------------------33331111----------1111---- SFPGDLRLLEELHVEYEVLPGWKSDISSVRNYSDLPKAAQQYVERIEELVGVPIHYIGIG ----3333-----------------1111-3333-------------------------- PGRDALIYK -1111---- >ADENYLOSUCCINATE SYNTHETA; SWP:O24396; PDB:1DJ3A; ADRVSSLSNVSGVLGSQWGDEGKGKLVDVLAPRFDIVARCQGGANAGHTIYNSEGKKFAL -3333----------------3333----3333---------3333-----1111----- HLVPSGILHEGTLCVVGNGAVIHVPGFFGEIDGLQSNGVSCDGRILVSDRAHLLFDLHQT ---1111-1111----3333--------------------2222---1111---3333-- VDGLREAELANSFIGTTKRGIGPCYSSKVTRNGLRVCDLRHMDTFGDKLDVLFEDAAARF ----------------------------------3333---11111111----------- EGFKYSKGMLKEEVERYKKFAERLEPFIADTVHVLNESIRQKKKILVEGGQATMLDIDFG -----------------------3333-----------1111---------1111----- TYPFVTSSSPSAGGICTGLGIAPRVIGDLIGVVKAYTTRVGSGPFPTELLGEEGDVLRKA ----------3333-------3333-------------------1111--3333------ GMEFGTTTGRPRRCGWLDIVALKYCCDINGFSSLNLTKLDVLSGLPEIKLGVSYNQMDGE -------------------------------------33332222--------------- KLQSFPGDLDTLEQVQVNYEVLPGWDSDISSVRSYSELPQAARRYVERIEELAGVPVHYI -------3333----------------------3333-3333------------------ GVGPGRDALIYK ----3333---- >FERREDOXIN THIOREDOXIN RE; SWP:Q55389; PDB:1DJ7A; NNKTLAAMKNFAEQYAKRTDTYFCSDLSVTAVVIEGLARHKEELGSPLCPCRHYEDKEAE ----------------1111---------------------------------------- VKNTFWNCPCVPMRERKECHCMLFLTPDNDFAGDAQDIPMETLEEVKAS ---1111--3333-----1111---3333-------------------- >Ferredoxin-thioredoxin re; SWP:Q55781; PDB:1DJ7B; MNVGDRVRVTSSVVVYHHPEHKKTAFDLQGMEGEVAAVLTEWQGRPISANLPVLVKFEQR -2222------------1111------2222----------iiii------------%%% FKAHFRPDEVTLI %----1111---- >PROTEIN HNS-DEPENDENT EXP; SWP:P26604; PDB:1DJ8A; NKKPVNSWTCEDFLAVDESFQPTAVGFAEALNNKDKPEDAVLDVQGIATVTPAIVQACTQ ---3333-333311113333---------------3333--------------------- DKQANFKDKVKGEWDKIKK 1111--------------- >TRIMETHYLAMINE DEHYDROGEN; SWP:P16099; PDB:1DJNA; ARDPKHDILFEPIQIGPKTLRNRFYQVPHCIGAGSDKPGFQSAHRSVKAEGGWAALNTEY --3333-1111---!!!!--------------!!!!------------------------ CSINPESDDTHRLSARIWDEGDVRNLKAMTDEVHKYGALAGVELWYGGAHAPNMESRATP ---1111-------------------------3333----------!!!!---------- RGPSQYASEFETLSYCKEMDLSDIAQVQQFYVDAAKRSRDAGFDIVYVYGAHSYLPLQFL ----------1111------------------------1111--------%%%%-3333- NPYYNKRTDKYGGSLENRARFWLETLEKVKHAVGSDCAIATRFGVDTVYGPGQIEAEVDG --------1111------------------------------------------1111-- QKFVEMADSLVDMWDITIGDIAEWGEDAGPSRFYQQGHTIPWVKLVKQVSKKPVLGVGRY ------3333-------------3333--3333-22223333------------------ TDPEKMIEIVTKGYADIIGCARPSIADPFLPQKVEQGRYDDIRVCIGCNVCISRWEIGGP --------------------3333--1111-------1111-------3333-------- PMICTQNATAGEEYRRGWHPEKFRQTKNKDSVLIVGAGPSGSEAARVLMESGYTVHLTDT ------1111---1111--------------------3333-------1111-------- AEKIGGHLNQVAALPGLGEWSYHRDYRETQITKLLKKNKESQLALGQKPMTADDVLQYGA ---------33332222---------------3333-1111------------------- DKVIIATGARWNTDGTNCLTHDPIPGADASLPDQLTPEQVMDGKKKIGKRVVILNADTYF ----------------3333---22221111----------------------------- MAPSLAEKLATAGHEVTIVSGVHLANYMHFTLEYPNMMRRLHELHVEELGDHFCSRIEPG ---------1111-----------33331111---------1111------------222 RMEIYNIWGDGSKRTYRGPGVSPRDANTSHRWIEFDSLVLVTGRHSECTLWNELKARESE 2-------------------------------------------------------3333 WAENDIKGIYLIGDAEAPRLIADATFTGHRVAREIEEANPQIAIPYKRETIAWGTPHMPG -1111------!!!!------------------1111-3333---------------222 GNFKIEYKV 2-------- >Heat-labile enterotoxin B; SWP:P32890; PDB:1DJRD; APQTITELCSEYRNTQIYTINDKILSYTESMAGKREMVIITFKSGETFQVEVPGSQHIDS --------1111-------------------2222------3333--------1111333 QKKAIERMKDTLRITYLTETKIDKLCVWNNKTPNSIAAISMKN 3------------------------------------------ >FIBROBLAST GROWTH FACTOR ; SWP:P21802; PDB:1DJSA; TLEPEGAPYWTNTEKEKRLHAVPAANTVKFRCPAGGNPPTRWLKNGKEFKQEHRIGGYKV ----------------------2222-----------------%%%%--11112222--- RNQHWSLIESVVPSDKGNYTCVVENEYGSINHTYHLDVVERSPHRPILQAGLPANASTVV -----------3333---------1111--------------------2222------22 GGDVEFVCKVYSDAQPHIQWIKHVEKPYLKVLKAAGVNTTDKEIEVLYIRNVTFEDAGEY 22-------------------------------------3333---------3333---- TCLAGNSIGISFHSAWLTVLPA -----1111------------- >ALPHA-LIKE NEUROTOXIN BMK; SWP:P45697; PDB:1DJTA; VRDAYIAKPHNCVYECARNEYCNDLCTKNGAKSGYCQWVGKYGNGCWCIELPDNVPIRVP --------------------------1111---------1111--------1111----- GKCH ---- >PHOSPHOINOSITIDE-SPECIFIC; SWP:P10688; PDB:1DJXA; EIETFYKMLTQRAEIDRAFEEAAGSAETLSVERLVTFLQHQQREEEAGPALALSLIERYE ------3333-3333--------------------------------------------- PSETAKAQRQMTKDGFLMYLLSADGNAFSLAHRRVYQDMDQPLSHYLVSSSHNTYLLEDQ ---------------------3333---3333---------3333--------------- LTGPSSTEAYIRALCKGCRCLELDCWDGPNQEPIIYHGYTFTSKILFCDVLRAIRDYAFK -----3333----1111-----------%%%%----2222----------------1111 ASPYPVILSLENHCSLEQQRVMARHLRAILGPILLDQPLDGVTTSLPSPEQLKGKILLKG -----------------------------!!!!-----2222-----33332222----- KKLKLVPELSDMIIYCKSVHFGGFSSPGTSGQAFYEMASFSESRALRLLQESGNGFVRHN -----33331111-------------------1111------------------------ VSCLSRIYPAGWRTDSSNYSPVEMWNGGCQIVALNFQTPGPEMDVYLGCFQDNGGCGYVL ---------1111------3333-1111------1111-------------iiii----- KPAFLRDPNTTFNSRALTQGPWWRPERLRVRIISGQQLPKVNKNKNSIVDPKVIVEIHGV -3333-1111--3333---3333------------------------------------3 GRDTGSRQTAVITNNGFNPRWDMEFEFEVTVPDLALVRFMVEDYDSSSKNDFIGQSTIPW 333---------------------------3333------------------------11 NSLKQGYRHVHLLSKNGDQHPSATLFVKISIQD 11-----------1111--1111---------- >HEME-BINDING PROTEIN A; SWP:Q54450; PDB:1DK0A; AFSVNYDSSFGGYSIHDYLGQWASTFGDVNHTNGNVTDANSGGFYGGSLSGSQYAISSTA ------3333---------------------2222-3333-------------------- NQVTAFVAGGNLTYTLFNEPAHTLYGQLDSLSFGDGLSGGDTSPYSIQVPDVSFGGLNLS --------------!!!!------------------------------------------ SLQAQGHDGVVHQVVYGLMSGDTGALETALNGILDDYGLSVNSTFDQVAAATA -33331111--------1111-----------3333---1111---------- >30S RIBOSOMAL PROTEIN S15; SWP:P80378; PDB:1DK1A; PITKEEKQKVQEFARFPGDTGSTEVQVALLTLRINRLSEHLKVHKKDHHSHRGLLVGQRR --11113333---------------------------------111133333333----- RLLRYLQREDPERYRLIEKLGI ---------------------- >ANNEXIN 24(CA32); SWP:NA; PDB:1DK5A; HHHHMASLTVPAHVPSAAEDCEQLRSAFKGWGTNEKLIISILAHRTAAQRKLIRQTYAET ---------------------------------3333---1111---------------- FGEDLLKELDRELTHDFEKLVLVWTLDPSERDAHLAKEATKRWTKSNFVLVELACTRSPK ---3333-------------------3333---------------------------333 ELVLAREAYHARYKKSLEEDVAYHTTGDHRKLLVPLVSSYRYGGEEVDLRLAKAESKILH 3----------------------------------------------3333--------- EKISDKAYSDDEVIRILATRSKAQLNATLNHYKDEHGEDILKQLEDGDEFVALLRATIKG ------1111------------3333---3333-----3333------------------ LVYPEHYFVEVLRDAINRRGTEEDHLTRVIATRAEVDLKIIADEYQKRDSIPLGRAIAKD --------------------------------1111---------------3333----- TRGDYESMLLALLGQE ---------------- >AXIN; SWP:O15169; PDB:1DK8A; GSASPTPPYLKWAESLHSLLDDQDGISLFRTFLKQEGCADLLDFWFACTGFRKLEPCDSN ----------3333-------------------1111-------------3333------ EEKRLKLARAIYRKYILDNNGIVSRQTKPATKSFIKGCIMKQLIDPAMFDQAQTEIQATM -----------------11113333-------------------1111------------ EENTYPSFLKSDIYLEYTRTGSESPKV ----------------3333------- >ANTIFUNGAL PEPTIDE; SWP:P81418; PDB:1DKCA; AGCIKNGGRCNASAGPPYCCSSYCFQIAGQSYGVCKNR ----------1111------------------------ >Retinoic acid receptor al; SWP:P10276; PDB:1DKFB; PEVGELIEKVRKAHQETFPALCQLGKYTTSEQRVSLDIDLWDKFSELSTKCIIKTVEFAK --------------1111-3333------------------------------------- QLPGFTTLTIADQITLLKAACLDILILRICTRYTPEQDTMTFSDGLTLNRTQMHNAGFGP -2222-----------------------------1111---1111------------!!! LTDLVFAFANQLLPLEMDDAETGLLSAICLICGDRQDLEQPDRVDMLQEPLLEALKVYVR !----------3333--------------------------------------------- KRRPSRPHMFPKMLMKITDLRSISAKGAERVITLKMEIPGSMPPLIQEMLEN --1111---------------------------------------------- >NUCLEOTIDE EXCHANGE FACTO; SWP:P09372; PDB:1DKGA; AEQVDPRDEKVANLEAQLAEAQTRERDGILRVKAEMENLRRRTELDIEKAHKFALEKFIN ----3333-------------------------------------------------111 ELLPVIDSLDRALEVAMSAMVEDIELTLKSMLDVVRKFGVEVIAETNVPLDPNVHQAIAM 11111---------------------------------------------1111------ VESDDVAPGNVLGIMQKGYTLNGRTIRAAMVTVAKAKA ------2222---------------------------- >NUCLEOTIDE EXCHANGE FACTO; SWP:P04475; PDB:1DKGD; KIIGIDLGTTNSCVAIMDGTTPRVLENAEGDRTTPSIIAYTQDGETLVGQPAKRQAVTNP -------1111---------------1111----------1111-----3333-----11 QNTLFAIKRLIGRRFQDEEVQRDVSIMPFKIIAADNGDAWVEVKGQKMAPPQISAEVLKK 11--------------------3333-------3333-----iiii--3333-------- MKKTAEDYLGEPVTEAVITVPAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAALAYGLD --------------------11113333--------1111-------------------- KTGNRTIAVYDLGGGTFDISIIEIDEKTFEVLATNGDTHLGGEDFDSRLINYLVEEFKKD ----------------------------------------3333---------------- QGIDLRNDPLAMQRLKEAAEKAKIELSSAQQTDVNLPYITADATGPKHMNIKVTRAKLES ---3333-3333-----------------------------1111--------------- LVEDLVNRSIELLKVALQDAGLSVSDIDDVILVGGQTRMPMVQKKVAEFFGKEPRKDVNP --3333-----------1111-3333--------11113333----------------11 DEAVAIGAAVQGGVLT 11-------------- >PYROGENIC EXOTOXIN B ZYMO; SWP:Q5X9P3; PDB:1DKIA; LDKVNLGGELSGSNMYVYNGFVIVSGDKRSPEILGYSTSGSFDVNGKENIASFMESYVEQ ------!!!!---------------------------------2222------------- IKENKKL ---1111 >PHYTASE; SWP:P07102; PDB:1DKQA; QSEPELKLESVVIVSRAGVRAPTKATQLMQDVTPDAWPTWPVKLGWLTPRGGELIAYLGH -------------------------33331111---------2222-------------- YQRQRLVADGLLAKKGCPQSGQVAIIADVDERTRKTGEAFAAGLAPDCAITVHTQTDTSS ------1111--------2222-------3333-----------------------1111 PDPLFNPLKTGVCQLDNANVTDAILSRAGGSIADFTGHRQTAFRELERVLNFPQSNLCLK -3333--1111--------------1111---------------------3333---111 REKQDESCSLTQALPSELKVSADNVSLTGAVSLASMLTEIFLLQQAQGMPEPGWGRITDS 1-------3333--------1111--------------------1111---%%%%----- HQWNTLLSLHNAQFYLLQRTPEVARSRATPLLDLIKTALTPHPPQKQAYGVTLPTSVLFI -------------------3333-----------------------2222---------- AGHDTNLANLGGALELNWTLPGQPDNTPPGGELVFERWRRLSDNSQWIQVSLVFQTLQQM -------------------2222----2222----------------------------1 RDKTPLSLNTPPGEVKLTLAGCEERNAQGMCSLAGFTQIVNEARIPACSL 111---3333--------1111---1111---------------3333-- >PHOSPHORIBOSYL PYROPHOSPH; SWP:P14193; PDB:1DKUA; NLKIFSLNSNPELAKEIADIVGVQLGKCSVTRFSDGEVQINIEESIRGCDCYIIQSTSDP --------------------------------1111---------2222----------- VNEHIMELLIMVDALKRASAKTINIVIPYYGYARQDRKARSREPITAKLFANLLETAGAT -------------------------------1111------------------------- RVIALDLHAPQIQGFFDIPIDHLMGVPILGEYFEGKNLEDIVIVSPDHGGVTRARKLADR --------3333------------3333------------------1111---------- LKAPIAIIDKRMNIVGNIEGKTAILIDDIIDTAGTITLAANALVENGAKEVYACCTHPVL --------------------------------3333------------------------ SGPAVERINNSTIKELVVTNSIKLKIERFKQLSVGPLLAEAIIRVHEQQSVSYLF !!!!---------------------------------------------3333-- >SUBSTRATE BINDING DOMAIN ; SWP:P04475; PDB:1DKZA; VLLLDVTPLSLGIETMGGVMTTLIAKNTTIPTKHSQVFSTAEDNQSAVSIHVLQGERKRA --------------2222-----------------------2222-------------33 ADNKSLGQFNLDGINPAPRGMPQIEVTFDIDADGILHVSAKDKNSGKEQKITIKASSGLN 33---------------2222---------1111-------------------1111--- EDEIQKMVRDAEANAEADRKFEELVQTRNQGDHLLHSTRKQVEEAGDKLPADDKTAIESA --------------------------------------------!!!!------------ LTALETALKGEDKAAIEAKMQELAQVSQKLMEIAQ ------1111------------------------- >J-ATRACOTOXIN-HV1C; SWP:P82228; PDB:1DL0A; AICTGADRPCAACCPCCPGTSCKAESNGVSYCRKDEP ------------------------3333--------- >CLASS I ALPHA-1,2-MANNOSI; SWP:P32906; PDB:1DL2A; GAGEMRDRIESMFLESWRDYSKHGWGYDVYGPIEHTSHNMPRGNQPLGWIIVDSVDTLML -----------------------2222----1111-----1111---------------- MYNSSTLYKSEFEAEIQRSEHWINDVLDFDIDAEVNVFETTIRMLGGLLSAYHLSDVLEV ------------------------------------------------------------ GNKTVYLNKAIDLGDRLALAFLSTQTGIPYSSINLHSGQAVKNHADGGASSTAEFTTLQM -3333--------------11113333-----------------%%%%--3333------ EFKYLAYLTGNRTYWELVERVYEPLYKNNDLLNTYDGLVPIYTFPDTGKFGASTIRFGSR --------------------------1111----%%%%------1111---------222 GDSFYEYLLKQYLLTHETLYYDLYRKSMEGMKKHLLAQSKPSSLWYIGEREQGLHGQLSP 2---------------------------------------------------1111---- KMDHLVCFMGGLLASGSTEGLSIHEARRRPFFSKSDWDLAKGITDTCYQMYKQSSSGLAP --3333-----------iiii33331111---------------------1111------ EIVVFNDGNIKDGWWRSSVGDFFVKPLDRHNLQRPETVESIMFMYHLSHDHKYREWGAEI ----------------1111----1111---------------------3333------- ATSFFENTCVDCNDPKLRRFTSLSDCITLPTKKSNNMESFWLAETLKYLYILFLDEFDLT ----------1111-----------------------1111---------1111------ KVVFNTEAHPFPVLDEEILKSQSLTTGWSL ----1111-----------1111------- >PHOSPHORIBOSYLANTRANILATE; SWP:Q56320; PDB:1DL3A; MVRVKICGITNLEDALFSVESGADYVGFVFYPKSKRYISPEDARRISVELPVERVGVFVN ----------3333----------------1111-----------3333----------- EEPEKILDVASYVQLNAVQLHGEEPIELCRKIAERILVWKAVGVSNERDMERALNYREFP ------------------------3333-----------------3333------3333- ILLDTDWSLILPYRDRFRYLVLSGGLNPENVRSAIDVVRPFAVDVSSGVEAFPGKKDHDS ------1111----------------3333---------------3333--2222----- IKMFIKNAKGL ----------- >PROTEIN-L-ISOASPARTATE O-; SWP:Q56308; PDB:1DL5A; MREKLFWILKKYGVSDHIAKAFLEIPREEFLTKSYPLSYVYEDIVLVSYDDGEEYSTSSQ 3333-----1111-33333333---3333------3333--------------------- PSLMALFMEWVGLDKGMRVLEIGGGTGYNAAVMSRVVGEKGLVVSVEYSRKICEIAKRNV -------------2222------!!!!-------1111---------------------- ERLGIENVIFVCGDGYYGVPEFSPYDVIFVTVGVDEVPETWFTQLKEGGRVIVPINLKLS 1111---------3333-3333-----------------------2222----------- RRQPAFLFKKKDPYLVGNYKLETRFITAGGNLGNLLERNRKLLREFPFNREILLVRSHIF -----------------------------1111----3333------------------- VELVDLLTRRLTEIDGTFYYAGPNGVVEFLDDRMRIYGDAPEIENLLTQWESCGYRSFEY -------------iiii----------------------3333-------------3333 LMLHVGYNAFSHISCSI ----------------- >Ig heavy chain V region P; SWP:P01820; PDB:1DL7H; QVQLKESGPGLVAPSQSLSITCTVSGFSLTGYGVNWVRQPPGKGLEWLGMIWGDGSTDYN ---------------------------------------------------1111----- SALKSRLNISKDKSKSQVFLRMYSLQTDDTARYYCARDYGPYWGQGTLVTVS ---1111-----1111---------1111----------------------- >Putative uncharacterized ; SWP:Q0VDX6; PDB:1DL7L; QAVVTQESALTTSPGETVTLTCRSSTGAVTTSNYANWVQEKPDHLFTGLIGGTKHRTPGA ------------2222-------------3333--------------------------- PARFSGSLIGDKAALTITGAQTEDEAIYFCALWYSNHWVFGGGTKLTVL 1111----------------1111------------------------- >DELTA-ENDOTOXIN CRYIIIA; SWP:P0A379; PDB:1DLC; TTKDVIQKGISVVGDLLGVVGFPFGGALVSFYTNFLNTIWPSEDPWKAFMEQVEALMDQK ---------------1111----------------------------------------- IADYAKNKALAELQGLQNNVEDYVSALSSWQKNPVSSRNPHSQGRIRELFSQAESHFRNS ------------------------------------------------------------ MPSFAISGYEVLFLTTYAQAANTHLFLLKDAQIYGEEWGYEKEDIAEFYKRQLKLTQEYT -111122223333----------------------1111--------------------- DHCVKWYNVGLDKLRGSSYESWVNFNRYRREMTLTVLDLIALFPLYDVRLYPKEVKTELT -----------1111--------------------------------------------- RDVLTDPIVGVNNLRGYGTTFSNIENYIRKPHLFDYLHRIQFHTRFQPGYYGNDSFNYWS -------------iiii--33331111---------------------1111-------- GNYVSTRPSIGSNDIITSPFYGNKSSEPVQNLEFNGEKVYRAVANTNLAVWPSAVYSGVT ------------------------------------------------------------ KVEFSQYNDQTDEASTQTYDSKRNVGAVSWDSIDQLPPETTDEPLEKGYSHQLNYVMCFL ------------------------------3333---------3333------------- MQGSRGTIPVLTWTHKSVDFFNMIDSKKITQLPLVKAYKLQSGASVVAGPRFTGGDIIQC 2222----------33331111--1111-----1111---%%%%---------------- TENGSAATIYVTPDVSYSQKYRARIHYASTSQITFTLSLDGAPFNQYYFDKTINKGDTLT --------------------------------------iiii-----------2222--1 YNSFNLASFSTPFELSGNNLQIGVTGLSAGDKVYIDKIEFIPVN 111------------------------2222------------- ------------ >ANTI-DANSYL IMMUNOGLOBULI; SWP:NA; PDB:1DLFH; EVKLEESGGGLVQPGGSMKLSCATSGFTFSDAWMDWVRQSPEKGLEWVAEIRNKA ------------2222-----------3333------------------------ >UDP-GLUCOSE DEHYDROGENASE; SWP:Q07172; PDB:1DLJA; MKIAVAGSGYVGLSLGVLLSLQNEVTIVDILPSKVDKINNGLSPIQDEYIEYYLKSKQLS --------3333-----1111----------------1111------------------- IKATLDSKAAYKEAELVIIATPTNYNSRINYFDTQHVETVIKEVLSVNSHATLIIKSTIP --------------------------1111---3333----------------------2 IGFITEMRQKFQTDRIIFSPEFLRESKALYDNLYPSRIIVSCEENDSPKVKADAEKFALL 222--------------------2222-3333----------1111-------------- LKSAAKKNNVPVLIMGASEAEAVKLFANTYLALRVAYFNELDTYAESRKLNSHMIIQGIS ------------------------------------------------------------ YDDRIGMHYNNPSFGYGGYSLPKDTKQLLANYNNIPQTLIEAIVSSNNVRKSYIAKQIIN -1111------------------------3333-----3333------------------ VLKEQESPVKVVGVYRLIMKSNSDNFRESAIKDVIDILKSKDIKIIIYEPMLNKLESEDQ -1111--------------2222--2222-------------------3333---1111- SVLVNDLENFKKQANIIVTNRYDNELQDVKNKVYSRDIFGRD ----------------------1111--1111---------- >LECTIN SCAFET PRECURSOR; SWP:Q9ZP48; PDB:1DLPA; NNILFGLSHEGSHPQTLHAAQSLELSSFRFTMQSDCNLVLFDSDVRVWASNTAGATGCRA ------------------------------------------------------------ VLQSDGLLVILTAQNTIRWSSGTKGSIGNYVLVLQPDRTVTIYGPGLWDSGTSNKGSVVV ------------------------------------------------------------ ANNGNSILYSTNDNHPQTLHATQSLQLSPYRLSMETDCNLVLFDRDDRVWSTNTAGKGTG ----------------------------------1111---------------------- CRAVLQPNGRMDVLTNQNIAVWTSGNSRSAGRYVFVLQPDRNLAIYGGALWTT ----------------------------------------------------- >HEMOGLOBIN; SWP:P15160; PDB:1DLWA; SLFEQLGGQAAVQAVTAQFYANIQADATVATFFNGIDMPNQTNKTAAFLCAALGGPNAWT ---1111---------------------33332222--------------1111------ GRNLKEVHANMGVSNAQFTTVIGHLRSALTGAGVAAALVEQTVAVAETVRGDVVTV -----------------------------1111------------3333------- >HEMOGLOBIN; SWP:Q08753; PDB:1DLYA; SLFAKLGGREAVEAAVDKFYNKIVADPTVSTYFSNTDMKVQRSKQFAFLAYALGGASEWK ---------------------333333333333---3333----------1111------ GKDMRTAHKDLVPHLSDVHFQAVARHLSDTLTELGVPPEDITDAMAVVASTRTEVLNMPQ --3333-1111--------------------1111------------------1111--- Q - >ANNEXIN XII E105K MUTANT ; SWP:P26256; PDB:1DM5A; VVQGTVKPHASFNSREDAETLRKAMKGIGTDEKSITHILATRSNAQRQQIKTDYTTLFGK ------------------------------------------3333-------------- HLEDELKSELSGNYEAAALALLRKPDEFLAEQLHAAMKGLGTDKNALIDILCTQSNAQIH ------------------3333-------------------------------------- AIKAAFKLLYKEDLEKEIISETSGNFQRLLVSMLQGGRKEDEPVNAAHAAEDAAAIYQAG -----------------1111-----------3333--1111-----------------3 EGQIGTDESRFNAVLATRSYPQLHQIFHEYSKISNKTILQAIENEFSGDIKNGLLAIVKS 333--------------------------3333--------------------------- VENRFAYFAERLHHAMKGLGTSDKTLIRILVSRSEIDLANIKETFQAMYGKSLYEFIADD -------------1111-------------1111-------------------------- CSGDYKDLLLQITGH --------------- >HYPOTHETICAL 15.5 KD PROT; SWP:P45802; PDB:1DM9A; PAVEVRLDKWLWAARFYKTRALAREMIEGGKVHYNGQRSKPSKIVELNATLTLRQGNDER ---------------------------------iiii--1111--2222----------- TVIVKAITEQRRPASEAALLYEETAESVEKREKMALARKLNALT -------------------------------------------- >CD6 METALLOTHIONEIN-1; SWP:P55949; PDB:1DMC; SPCQKCTSGCKCATKEECSKTCTKPCSCCPK -------------3333-------------- >RIBOSOMAL PROTEIN L4; SWP:P38516; PDB:1DMGA; AQVDLLNVKGEKVGTLEISDFVFNIDPNYDVMWRYVDMQLSDWSKKLNKKMKKLALRSAL ------1111--------3333-------------------------------------- SVKYRENKLLVLDDLKLERPKTKSLKEILQNLQLSDKKTLIVLPWKEEGYMNVKLSGRNL ---1111-------------3333-----11111111---------3333----1111-- PDVKVIIADNPNNSKNGEKAVRIDGLNVFDMLKYDYLVLTRDMVSKIEEVLG ---------------------------------------------------- >CATECHOL 1,2-DIOXYGENASE; SWP:P07773; PDB:1DMHA; VKIFNTQDVQDFLRVASGLEQEGGNPRVKQIIHRVLSDLYKAIEDLNITSDEYWAGVAYL --1111--------1111-------------------------1111------------- NQLGANQEAGLLSPGLGFDHYLDMRMDAEDAALGIENATPRTIEGPLYVAGAPESVGYAR ---1111-----3333--------------1111-------------------------- MDDGSDPNGHTLILHGTIFDADGKPLPNAKVEIWHANTKGFYSHFDPTGEQQAFNMRRSI -----1111----------1111-------------1111-22221111--2222----- ITDENGQYRVRTILPAGYGCPPEGPTQQLLNQLGRHGNRPAHIHYFVSADGHRKLTTQIN --1111--------------1111------1111--------------2222-------- VAGDPYTYDDFAYATREGLVVDAVEHTDPEAIKANDVEGPFAEMVFDLKLTRLVDGVDNQ 2222-11111111--2222----------------------------------iiii--- VVDRPRLAV --------- >DNA POLYMERASE PROCESSIVI; SWP:P10226; PDB:1DMLA; APCQVVLQGAELNGILQAFAPLRTSLLDSLLVMGDRGILIHNTIFGEQVFLPLEHSQFSR -------------------11111111------1111------iiii------3333--- YRWRGPTAAFLSLVDQKRSLLSVFRANQYPDLRRVELAITGQAPFRTLVQRIWTTTSDGE --------------11111111------1111---------------------------- AVELASETLMKRELTSFVVLVPQGTPDVQLRLTRPQLTKVLNATGADSATPTTFELGVNG ----------------------------------------3333--1111---------- KFSVFTTSTCVTFAAREEGNAKTVYGENTHRTFSVVVDDCSMRAVLRRLQVGGGTLKFFL -------------------1111------------------------------------- TTPVPSLCVTATGPNAVSAVFLLKPQK --------------------------- >DNA polymerase; SWP:P07917; PDB:1DMLB; DDVAARLRAAGFGAVGAGATAEETRRMLHRAFDTLA -------1111------------------------- >DMSO REDUCTASE; SWP:Q52675; PDB:1DMR; LANGTVMSGSHWGVFTATVENGRATAFTPWEKDPHPSPMLAGVLDSIYSPTRIKYPMVRR ---------1111------iiii------1111---3333--------1111-------- EFLEKGVNADRSTRGNGDFVRVSWDQALDLVAAEVKRVEETYGPEGVFGGSYGWKSPGRL ----!!!!--1111----------------------------3333-------------- HNCTTLLRRMLTLAGGYVNGAGDYSTGAAQVIMPHVVGTLEVYEQQTAWPVLAENTEVMV -----------1111----------3333--3333------------------------- FWAADPIKTSQIGWVIPEHGAYPGLEALKAKGTKVIVIDPVRTKTVEFFGAEHITPKPQT ----3333----------3333------3333------------------------2222 DVAIMLGMAHTLVAEDLYDKDFIANYTSGFDKFLPYLDGETDSTPKTAEWAEGISGVPAE ------------1111------------3333---------------------------- TIKELARLFESKRTMLAAGWSMQRMHHGEQAHWMLVTLASMLGQIGLPGGGFGLSYHYSG -------------------3333--------------------2222-------1111-2 GGTPSTSGPALAGITDGGAATKGPEWLAASGASVIPVARVVDMLENPGAEFDFNGTRSKF 222-------------1111---------------1111------2222---iiii---- PDVKMAYWVGGNPFVHHQDRNRMVKAWEKLETFVVHDFQWTPTARHADIVLPATTSYERN -----------1111-----------3333----------3333----------1111-- DIETIGDYSNTGILAMKKIVEPLYEARSDYDIFAAVAERLGKGAEFTEGKDEMGWIKSFY ----------------------!!!!-----------1111-3333%%%%---------- DDAAKQGKAAGVQMPAFDAFWAEGIVEFPVTDGADFVRYASFREDPLLNPLGTPTGLIEI -------1111--------------------1111-2222------------1111---- YSKNIEKMGYDDCPAHPTWMEPLERLDGPGAKYPLHIAASHPFNRLHSQLNGTVLREGYA --3333---1111-----------2222-----------------!!!!----3333--- VQGHEPCLMHPDDAAARGIADGDVVRVHNDRGQILTGVKVTDAVMKGVIQIYEGGWYDPS iiii----------1111-2222-----1111------------2222------------ DVTEPGTLDKYGDVNVLSADIGTSKLAQGNCGQTVLAEVEKYTGPAVTLTGFVAPKAAE 1111--------------------------1111--------------------3333- >BGLI RESTRICTION ENDONUCL; SWP:O68557; PDB:1DMUA; MYNLHREKIFMSYNQNKQYLEDNPEIQEKIELYGLNLLNEVISDNEEEIRADYNEANFLH 1111------------------------------------------------------33 PFWMNYPPLDRGKMPKGDQIPWIEVGEKAVGSKLTRLVSQREDITVREIGLPTGPDERYL 331111---------------3333---------------1111---------------- LTSPTIYSLTNGFTDSIMMFVDIKSVGPRDSDYDLVLSPNQVSGNGDWAQLEGGIQNNQQ -------1111---------------1111-------1111--------1111------- TIQGPRSSQIFLPTIPPLYILSDGTIAPVVHLFIKPIYAMRSLTKGDTGQSLYKIKLASV --------------------1111---------------3333----------------- PNGLGLFCNPGYAFDSAYKFLFRPGKDDRTKSLLQKRVRVDLRVLDKIGPRVMTIDMDK ----------33333333---------11111111--------3333------------ >IGM-KAPPA COLD AGGLUTININ; SWP:Q6GMV9; PDB:1DN0A; EIVLTQSPATLSLSPGERATLSCGASQSVSSNYLAWYQQKPGQAPRLLIYDASSRATGIP -------------2222-----------2222-------2222------------2222- DRFSGSGSGTDFTLTISRLEPEDFAVYYCQQYGSSPLTFGGGTKVEIKRTVAAPSVFIFP -------!!!!--------1111------------------------------------- PSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTL -33333333---------------------%%%%-------------------------- TLSKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC ----3333----------1111--------2222- >IGM-KAPPA COLD AGGLUTININ; SWP:NA; PDB:1DN0B; EVQLQQWGAGLLKPSETLSLTCAVYGGSFSDYYWSWIRQPPGKGLEWIGEINHSGSTNYN ------------2222-----------------------2222--------1111----3 PSLKSRVTISVDTSKNQFSLKLSSVTAADTAVYYCARPPHDTSGHYWNYWGQGTLVTVSS 3331111------------------3333------------------------------- GSASAPTLFPLVSCTSSVAVGCLAQDFLPDSITFSWKYKNNSDISSTRGFPSVLRGGKYA -------------------------------------1111-------------iiii-- ATSQVLLPSKDVTDEHVVCKVQHPNGNKEKNVPLPV ----------------------1111---------- >SYNTAXIN BINDING PROTEIN ; SWP:Q64320; PDB:1DN1A; IGLKAVVGEKIMHDVIKKVKKKGEWKVLVVDQLSMRMLSSCCKMTDIMTEGITIVEDINK -3333----------3333-2222------------3333--33331111------1111 RREPLPSLEAVYLITPSEKSVHSLISDFKDPPTAKYRAAHVFFTDSCPDALFNELVKSRA ----3333--------3333---------3333------------------------333 AKVIKTLTEINIAFLPYESQVYSLDSADSFQSFYSPHKAQMKNPILERLAEQIATLCATL 3---------------------------------3333---------------------- KEYPAVRYRGEYKDNALLAQLIQDKLDAYKADDPTMGEGPDKARSQLLILDRGFDPSSPV --------33333333----------------1111---3333-------3333--1111 LHELTFQAMSYDLLPIENDVYKYETSGIGEARVKEVLLDEDDDLWIALRHKHIAEVSQEV ----------------iiii-------------------------------3333----- TRSLKDFSSSKRMMRDLSQMLKKMPQYQKELSKYSTHLHLAEDCMKHYQGTVDKLCRVEQ -------------------33333333-3333---------------------------- DLAMGTDAEGEKIKDPMRAIVPILLDANVSTYDKIRIILLYIFLKNGITEENLNKLIQHA ------3333-------------------------------------------------- QIPPEDSEIITNMAHLGVPIVTDSTLRRRSKPERKERISEQTYQLSRWTPIIKDIMEDTI --3333-----------------3333----------------------3333-----11 EDKLDTKHYPYISTRRSGPRLIIFILGGVSLNEMRCAYEVTQANGKWEVLIGSTHILTPQ 11--1111---------------------------------------------------- KLLDTLKKLNKTDEEI -----1111------- >PYRIDOXINE 5'-PHOSPHATE O; SWP:P28225; PDB:1DNLA; GGLRRRDLPADPLTLFERWLSQACEAKLADPTAVVATVDEHGQPYQRIVLLKHYDEKGVF ---3333---3333---------1111--1111-----1111------------1111-- YTNLGSRKAHQIENNPRVSLLFPWHTLERQVVIGKAERLSTLEVKYFHSRPRDSQIGAWV --1111-----------------3333------------3333-3333--------3333 SKQSSRISARGILESKFLELKQKFQQGEVPLPSFWGGFRVSLEQIEFWQGGEHRLHDRFL -2222---3333-------------------1111---------------2222------ YQRENDAWKIDRLAP ---%%%%-------- >DNA PHOTOLYASE; SWP:P00914; PDB:1DNPA; TTHLVWFRQDLRLHDNLALAAACRNSSARVLALYIATPRQWATHNMSPRQAELINAQLNG --------------------11111111-------------1111--------------- LQIALAEKGIPLLFREVDDFVASVEIVKQVCAENSVTHLFYNYQYEVNERARDVEVERAL -----1111---------3333---------1111------------------------1 RNVVCEGFDDSVILPPGAVMTGNHEMYKVFTPFKNAWLKRLREGMPECVAAPKVRSSGSI 111-----------2222--1111----------------1111----------3333-- EPSPSITLNYPRQSFDTAHFPVEEKAAIAQLRQFCQNGAGEYEQQRDFPAVEGTSRLSAS ---------------3333---------------------33331111------------ LATGGLSPRQCLHRLLAEQPQALDGGAGSVWLNELIWREFYRHLITYHPSLCKHRPFIAW ------------------1111---2222------------------------------- TDRVQWQSNPAHLQAWQEGKTGYPIVDAAMRQLNSTGWMHNRLRMITASFLVKDLLIDWR 1111------------------3333-------------------------------333 EGERYFMSQLIDGDLAANNGGWQWAASTGTDAAPYFRIFNPTTQGEKFDHEGEFIRQWLP 3--------11113333------1111------1111--3333-----11113333--33 ELRDVPGKVVHEPWKWAQKAGVTLDYPQPIVEHKEARVQTLAAYEAARK 33----3333---------------------3333-------------- >GALLERIA MELLONELLA DENSO; SWP:Q90125; PDB:1DNV; VYIIPRPFSNFGKKLSTYTKSHKFMIFGLANNVIGPTGTGTTAVNRLLTTCLAEIPWQKL ----------------------------------------------------------33 PLYMNQSEFDLLPPGSRVVECNVKVIFRTNRIAFETSSTVTKQATLNQISNVQTAIGLNK 33---------------------------------------------------------- LGWGINRAFTAFQSDQPMIPTATTAPKYEPVTGDTGYRGMIADYYGADSTNDTAFGNAGN --------------------------------------------------3333------ YPHHQVSSFTFLQNYYCMYQQTNQGTGGWPCLAEHLQQFDSKTVNNQCLIDVTYKPKMGL --3333------------------------3333-------------------------- IKSPLNYKIIGQPTVKGTISVGDNLVNMRGAVVTNPPEATQNVAESTHNLTRNFPADLFN ------------------------------------------------------------ IYSDIEKSQVLHKGPWGHENPQIQPSVHIGIQAVPALTTGALLINSSPLNSWTDSMGYID ------------------------------------------------------------ VMSSCTVMEAQPTHFPFSTEANTNPGNTIYRINLTPNSLTSAFNGLYGNGATLGN -----------------------3333---------------------------- >NON-RIBOSOMAL PEPTIDE SYN; SWP:O30409; PDB:1DNYA; YVAPTNAVESKLAEIWERVLGVSGIGILDNFFQIGGHSLKAMAVAAQVHREYQVELPLKV -----------------1111--------------------------------------- LFAQPTIKALAQYVAT ---------------- >HEAT SHOCK LOCUS U; SWP:P32168; PDB:1DO0A; SEMTPREIVSELDKHIIGQDNAKRSVAIALRNRWRRMQLNEELRHEVTPKNILMIGPTGV -----------------------------------11113333----------------- GKTEIARRLAKLANAPFIKVEATKFTEVGYVGKEVDSIIRDLTDAAVKMVRVQAIEKNRY ----------3333-------3333--------3333----------------------- RAEELAEERILDVLIPPAKNNWGQTEQQQEPSAARQAFRKKLREGQLDDKEIEIDARKLK ----------3333-----------1111------------------------------- IKDAMKLLIEEEAAKLVNPEELKQDAIDAVEQHGIVFIDEIDKICKRGESSGPDVSREGV -------------33333333------------------1111-------3333------ QRDLLPLVEGCTVSTKHGMVKTDHILFIASGAFQIAKPSDLIPELQGRLPIRVELQALTT -----3333------------1111-----------3333-33331111----------- SDFERILTEPNASITVQYKALMATEGVNIEFTDSGIKRIAEAAWQVNESTENIGARRLHT --------------------------------3333----------------!!!!---- VLERLMEEISYDASDLSGQNITIDADYVSKHLDALVADEDLSRFIL ------------1111------------------------------ >HUMAN COPPER CHAPERONE FO; SWP:O14618; PDB:1DO5A; QNLGAAVAILGGPGTVQGVVRFLQLTPERCLIEGTIDGLEPGLHGLHVHQYGDLTNNCNS -------------------------1111--------------------------!!!!- CGNHFNPDGASHGGPQDSDRHRGDLGNVRADADGRAIFRMEDEQLKVWDVIGRSLIIDEG -----1111----1111---1111------1111-----------33332222------- EDDLGRGGHPLSKITGNSGERLACGIIARSAGLF --%%%%--1111---------------------- >CYTOCHROME B5; SWP:P00169; PDB:1DO9A; DKDVKYYTLEEIKKHNHSKSTWLILHHKVYDLTKFLEEHPGGEEVLREQAGGDATENFED -------3333---------------------1111------3333--2222-3333-33 VGHSTDARELSKTFIIGELHPDDRSKLSKPMETL 33-------3333------1111----------- >Rho GDP-dissociation inhi; SWP:P19803; PDB:1DOAB; EPTAEQLAQIAAENEEDEHSVNYKPPAQKSIQEIQELDKDDESLRKYKEALLGRVAVSAD -----3333------------------------11113333------------------- PNVPNVVVTRLTLVCSTAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNREIVS --------------1111-----------3333--------------------------- GMKYIQHTYRKGVKIDKTDYMVGSYGPRAEEYEFLTPMEEAPKGMLARGSYNIKSRFTDD ---------iiii-----------------------------3333------------11 DRTDHLSWEWNLTIKKEWKD 11------------------ >ADENYLOSUCCINATE LYASE; SWP:Q8ZY28; PDB:1DOFA; HVSPFDWRYGSEEIRRLFTNEAIINAYLEVERALVCALEELGVAERGCCEKVNKASVSAD --1111----33331111--------------------1111--2222---------333 EVHDILSLVLLLEQKSGCRYVHYGATSNDIIDTAWALLIRRALAAVKEKARAVGDQLASM 3------------------2222--3333------------------------------- ARKYKTLEMVGRTHGQWAEPITLGFKFANYYYELYIACRQLALAEEFIRAKIGGAVGTMA ---1111-----iiii----------------------------1111----------33 SWGELGLEVRRRVAERLGLPHHVITTQVAPRESFAVLASALALMAAVFERLAVEIRELSR 331111--------1111-----------------------------------------1 PEIGEVVEGGANPTASERIVSLARYVRALTHVAFENVALWHERDLTNSANERVWIPEALL 111------------------------------3333--!!!!-1111------------ ALDEILTSALRVLKNVYIDEERITENLQKALPYILTEFHMNRMIKEGASRAEAYKKAKEV ------------1111-------------3333-3333-----1111---------1111 KALTFEYQKWPVERLIEDALSLKLC ----3333----------1111--- >2FE-2S FERREDOXIN; SWP:P00217; PDB:1DOI; PTVEYLNYEVVDDNGWDMYDDDVFGEASDMDLDDEDYGSLEVNEGEYILEAAEAQGYDWP ----------------1111-3333-1111--3333------2222------1111---- FSCRAGACANCAAIVLEGDIDMDMQQILSDEEVEDKNVRLTCIGSPDADEVKIVYNAKHL --------1111---------------------------3333-------------1111 DYLQNRVI 1111---- >MONOCYTE CHEMOATTRACTANT ; SWP:P13500; PDB:1DOKA; MQPDAINAPVTCCYNFTNRKISVQRLASYRRITSSKCPKEAVIFKTIVAKEICADPKQKW -3333----------------3333--------3333--------1111-----1111-- VQDSMDHLDKQT ------------ >RNA POLYMERASE ALPHA SUBU; SWP:Q9Z9H6; PDB:1DOQA; EQEEELDLPLEELGLSTRVLHSLKEEGIESVRALLALNLKDLKNIPGIGERSLEEIKEAL --------3333---3333------------------11113333---3333-------- EKKGFTLKE --------- >ALDOLASE CLASS II; SWP:P11604; PDB:1DOSA; SKIFDFVKPGVITGDDVQKVFQVAKENNFALPAVNCVGTDSINAVLETAAKVKAPVIVQF -1111-------!!!!-------------------------------------------- SNGGASFIAGKGVKSDVPQGAAILGAISGAHHVHQMAEHYGVPVILHTAKKLLPWIDGLL ------3333------2222---------------3333---------3333-------- DAGEKHFAATGKPLFSSHMSEESLQENIEICSKYLERMSKIGMTLEGCTGGEEDGVDNSH --------------------------------------1111------------------ MDASALYTQPEDVDYAYTELSKISPRFTIAASFGNVYKAGNVVLTPTILRDSQEYVSKKH --------3333------------------------------------------------ NLPHNSLNFVGSGSTAQEIKDSVSYGVVKMNIDTDTQWATWEGVLNYYKANEAYLQGQLG ----------1111------------------------------------1111------ NPKGEDQPNKKYYDPRVWLRAGQTSMIARLEKAFQELNAIDVL 3333----3333-3333-----------------1111----- >ALPHA-CATENIN; SWP:P26231; PDB:1DOVA; ESQFLKEELVVAVEDVRKQGDLMKSAAGEFADDPCSSVKRGNMVRAARALLSAVTRLLIL --------------------------------1111------------------------ ADMADVYKLLVQLKVVEDGILKLRNAGNEQDLGIQYKALKPEVDKLNIMAAKRQQELKDV ----------------------1111--------------------------3333--33 GNRDQMAAARGILQKNVPILYTASQACLQHPDVAAYKANRDLIYKQLQQAVTGISNAAQA 33-----------------------33331111-----------------------1111 T - >Catenin beta-1; SWP:Q02248; PDB:1DOWB; HPTNVQRLAEPSQLKHAVVNLINYQDDAELA ---------3333------------3333-- >TRAM PROTEIN; SWP:P07294; PDB:1DP3A; AKVQAYVSDEIVYKINKIVERRRAEGAKSTDVSFSSISTMLLELGLRVYEAQMER ----------------------3333-----------------%%%%3333---- >ATRIAL NATRIURETIC PEPTID; SWP:P18910; PDB:1DP4A; SDLTVAVVLPLTNTSYPWSWARVGPAVELALARVKARPDLLPGWTVRMVLGSSENAAGVC ---------------1111-----------------33332222----------1111-- SDTAAPLAAVDLKWEHSPAVFLGPGCVYSAAPVGRFTAHWRVPLLTAGAPALGIGVKDEY -------------------------3333---------------------3333-33332 ALTTRTGPSHVKLGDFVTALHRRLGWEHQALVLYADRLGDDRPCFFIVEGLYMRVRERLN 222-----3333------------------------------------------------ ITVNHQEFVEGDPDHYPKLLRAVRRKGRVIYICSSPDAFRNLMLLALNAGLTGEDYVFFH --------11111111------------------------------1111-3333----- LDVFGQSLKSAQGLVPQKPWERGDGQDRSARQAFQAAKIITYKEPDNPEYLEFLKQLKLL -1111------!!!!--1111------------1111---------3333---------- ADKKFNFTVEDGLKNIIPASFHDGLLLYVQAVTETLAQGGTVTDGENITQRMWNRSFQGV -----------------------------------1111-1111----1111------11 TGYLKIDRNGDRDTDFSLWDMDPETGAFRVVLNYNGTSQELMAVSEHKLYWPLGYPPPDV 11-----------------------------------------%%%%---1111------ PKCGF 1111- ------------------------------- >MHC class II regulatory f; SWP:P22670; PDB:1DP7P; TVQWLLDNYETAEGVSLPRSTLYNHYLLHSQEQKLEPVNAASFGKLIRSVFMGLRTRRLG ------------------------------1111-------------------------- TRGNSKYHYYGLRIKA 2222------------ >DIHYDROLIPOYL-TRANSACETYL; SWP:P10802; PDB:1DPB; IPPIPPVDFAKYGEIEEVPMTRLMQIGATNLHRSWLNVPHVTQFESADITELEAFRVAQK ---------1111----------------------------------------------- AVAEKAGVKLTVLPLLLKACAYLLKELPDFNSSLAPSGQALIRKKYVHIGFAVDTPDGLL ---1111-------------------3333----1111----------------1111-- VPVIRNVDQKSLLQLAAEAAELAEKARSKKLGADAMQGACFTISSLGHIGGTAFTPIVNA -----3333----------------1111--3333----------1111----------- PEVAILGVSKASMQPVWDGKAFQPRLMLPLSLSYDCRVINGAAAARFTKRLGDLLADIRA -----------------------------------------------------------3 ILL 333 >DIPEPTIDE-BINDING PROTEIN; SWP:P23847; PDB:1DPE; KTLVYCSEGSPEGFNPQLFISGTTYDASSVPLYNRLVEFKIGTTEVIPGLAEKWEVSEDG --------------3333---------------------2222-------------1111 KTYTFHLRKGVKWHDNKEFKPTRELNADDVVFSFDRQKNAQNPYHKVSGGSYEYFEGMGL ---------------1111------3333----3333-1111-1111------------- PELISEVKKVDDNTVQFVLTRPEAPFLADLAMDFASILSKEYADAMMKAGTPEKLDLNPI ----------------------1111-----3333----------------3333----- GTGPFQLQQYQKDSRIRYKAFDGYWGTKPQIDTLVFSITPDASVRYAKLQKNECQVMPYP ----------1111------1111------------------------1111-------- NPADIARMKQDKSINLMEMPGLNVGYLSYNVQKKPLDDVKVRQALTYAVNKDAIIKAVYQ 3333-3333-1111---------------1111-1111--------------------ii GAGVSAKNLIPPTMWGYNDDVQDYTYDPEKAKALLKEAGLEKGFSIDLWAMPVQRPYNPN ii--------1111---1111--------------11111111-----------1111-- ARRMAEMIQADWAKVGVQAKIVTYEWGEYLKRAKDGEHQTVMMGWTGDNGDPDNFFATEF --------------------------------1111--------------3333------ SCAASEQGSNYSKWCYKPFEDLIQPARATDDHNKRVELYKQAQVVMHDQAPALIIAHSTV ---------1111--3333----------------------------------------- FEPVRKEVKGYVVDPLGKHHFENVSIE -------------1111---1111--- >GLUCOSE 6-PHOSPHATE DEHYD; SWP:P11411; PDB:1DPGA; VSEIKTLVTFFGGTGDLAKRKLYPSVFNLYKKGYLQKHFAIVGTARQALNDDEFKQLVRD ---------------3333----------3333--------------------------- CIKDFTDDQAQAEAFIEHFSYRAHDVTDAASYAVLKEAIEEAADKFDIDGNRIFYMSVAP -3333-------------------11113333--------------------------33 RFFGTIAKYLKSEGLLADTGYNRLMIEKPFGTSYDTAAELQNDLENAFDDNQLFRIDHYL 33----------------------------------------------1111----3333 GKEMVQNIAALRFGNPIFDAAWNKDYIKNVQVTLSEVLGVEERAGYYDTAGALLDMIQNH -3333------11113333---3333----------------3333-------------- TMQIVGWLAMEKPESFTDKDIRAAKNAAFNALKIYDEAEVNKYFVRAQYGAGDSADFKPY ----------------------------1111--------------------------33 LEELDVPADSKNNTFIAGELQFDLPRWEGVPFYVRSGKRLAAKQTRVDIVFKAGTFNFGS 3322221111-------------3333--------------------------------- EQEAQEAVLSIIIDPKGAIELKLNAKSVEDAFNTRTIDLGWTVSDEDKKNTPEPYERMIH ----------------------------------------------------3333---- DTMNGDGSNFADWNGVSIAWKFVDAISAVYTADKAPLETYKSGSMGPEASDKLLAANGDA ------1111------------------------------------3333----1111-- WVFKG ----- >PROTEINASE A; SWP:P07267; PDB:1DPJA; GGHDVPLTNYLNAQYYTDITLGTPPQNFKVILDTGSSNLWVPSNECGSLACFLHSKYDHE ---------%%%%-----------------------------1111-3333------333 ASSSYKANGTEFAIQYGTGSLEGYISQDTLSIGDLTIPKQDFAEATSEPGLTFAFGKFDG 31111----------1111------------!!!!-------------33331111---- ILGLGYDTISVDKVVPPFYNAIQQDLLDEKRFAFYLGDTSKDTENGGEATFGGIDESKFK -----33332222--------1111------------1111-------------1111-- GDITWLPVRRKAYWEVKFEGIGLGDEYAELESHGAAIDTGTSLITLPSGLAEMINAEIGA ----------------------!!!!---------------------------------- KKGWTGQYTLDCNTRDNLPDLIFNFNGYNFTIGPYDYTLEVSGSCISAITPMDFPEPVGP --1111----11111111------iiii----1111----%%%%---------------- LAIVGDAFLRKYYSIYDLGNNAVGLAKAI --------3333-----1111-------- >DPS; SWP:P27430; PDB:1DPSA; SKATNLLYTRNDVSDSEKKATVELLNRQVIQFIDLSLITKQAHWNMRGANFIAVHEMLDG -----------------------------------------3333--2222--------- FRTALIDHLDTMAERAVQLGGVALGTTQVINSKTPLKSYPLDIHNVQDHLKELADRYAIV ------------------------------------------------------------ ANDVRKAIGEAKDDDTADILTAASRDLDKFLWFIECNIE -----3333-------------------------1111- >D-DOPACHROME TAUTOMERASE; SWP:P30046; PDB:1DPTA; PFLELDTNLPANRVPAGLEKRLCAAAASILGKPADRVNVTVRPGLAMALSGSTEPCAQLS ---------1111-2222--------------1111------------iiii-------- ISSIGVVGTAEDNRSHSAHFFEFLTKELALGQDRILIRFFPLESWQIGKIGTVMTFL -------------------------1111-1111--------1111--iiii3333- >PHOSPHOLIPASE A2; SWP:Q9DF52; PDB:1DPYA; NLIQFKNMIQCAGTRIWTAYVAYGCYCGKGGSGTPVDELDRCCYTHDHCYNEAEKIPGCN ---------------3333-----------------3333-------------------3 PNIKTYSYTCTQPNLTCTDSADTCAQFLCECDRTAAICFASAPYNSNNIMLSSTSCQ 333-----------------------------------------3333----3333- >ENDONUCLEASE; SWP:P95484; PDB:1DQ3A; CIDGKAKIIFENEGEEHLTTMEEMYERYKHLGEFYDEEYNRWGIDVSNVPIYVKSFDPES --1111---------------------3333-----1111-----1111----------- KRVVKGKVNVIWKYELGKDVTKYEIITNKGTKILTSPWHPFFVLTPDFKIVEKRADELKE ----------------1111------3333-----1111-----1111-----1111-22 GDILIGGMPDGEDYKFIFDYWLAGFIAGDGCFDKYHSHVKGHEYIYDRLRIYDYRIETFE 22------------------------------------2222------------3333-- IINDYLEKTFGRKYSIQKDRNIYYIDIKARNITSHYLKLLEGIDNGIPPQILKEGKNAVL ----------------------------3333-----11113333--3333----3333- SFIAGLFDAEGHVSNKPGIELGMVNKRLIEDVTHYLNALGIKARIREKLRKDGIDYVLHV ------------------------------------1111-------------------- EEYSSLLRFYELIGKNLQNEEKREKLEKVLSNHKGGNFGLPLNFNAFKEWASEYGVEFKT ------------3333---------------------------------3333------- NGSQTIAIINDERISLGQWHTRNRVSKAVLVKMLRKLYEATKDEEVKRMLHLIEGLEVVR !!!!----iiii-----3333-------------------------------1111---- HITTTNEPRTFYDLTVENYQNYLAGENGMIFVHN ---------------------------------- >HMG-COA REDUCTASE; SWP:P04035; PDB:1DQAA; LSDAEIIQLVNAKHIPAYKLETLIETHERGVSIRRQLLSKKLSEPSSLQYLPYRDYNYSL ---------------33331111--------------3333--11111111-----3333 VMGACCENVIGYMPIPVGVAGPLCLDEKEFQVPMATTEGCLVASTNRGCRAIGLGGGASS 2222--------------------%%%%------------------------1111---- RVLADGMTRGPVVRLPRACDSAEVKAWLETSEGFAVIKEAFDSTSRFARLQKLHTSIAGR -----------------------------------------1111------------!!! NLYIRFQSRSGDAMGMNMISKGTEKALSKLHEYFPEMQILAVSGNYCTDKKPAAINWIEG !--------!!!!--------------------1111---------------3333---- RGKSVVCEAVIPAKVVREVLKTTTEAMIEVNINKNLVGSAMAGSIGGYNAHAANIVTAIY --------------------------------------------------3333------ IACGQDAAQNVGSSNCITLMEASGPTNEDLYISCTMPSIEIGTVGGGTNLLPQQACLQML -----------1111--------1111-----------------!!!!--3333---111 GVQGACKDNPGENARQLARIVCGTVMAGELSLMAALAAGHLVKSHMIHN 1----3333---------------------------------------- >TACHYCITIN; SWP:P91818; PDB:1DQCA; YLAFRCGRYSPCLDDGPNVNLYSCCSFYNCHKCLARLENCPKGLHYNAYLKVCDWPSKAG ------1111------------------------------%%%%--1111----1111-- CTSVNKECHLWKT ------------- >FAB HGR-2 F6; SWP:NA; PDB:1DQDH; EVQLQESGPSLVKPSQTLSLTCSVTGDSITSGYWNWIRKFPGNKLEYMGYISYSGSTYYN ------------2222-----------------------2222--------1111----3 PSLKSRLSITRDTSRNQYYLQLKSVTPEDTATYYCASPPGYYGSGPYAMDYWGQGTSVTV 333----------------------1111------------------------------- SSAKTTPPSVYPLAPGSAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQS -----------------------------------------%%%%--1111-------ii DLYTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKISPG ii---------1111-----------3333----------- >EG628498 protein; SWP:A0A5E0; PDB:1DQDL; DIVLSQSPAIMSASPGEKVTITCSASSSVSYMHWFQQKPGTSPKLCIYTTSNLASGVPAR -------------2222--------------------2222------------2222111 FSGSGSGTSYSLTISRMEAEDAATYYCQQRSTYPPTFGSGTKLEIKRADAAPTVSIFPPS 1----------------3333--------------------------------------3 SEQLTSGGASVVCFLNNFYPRDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTL 333-------------------------%%%%---------------------------- TKDEYERHNSYTCEATHKTSTSPIVKSFNRNECA 3333------------------------3333-- >PHEROMONE-BINDING PROTEIN; SWP:P34174; PDB:1DQEA; SQEVMKNLSLNFGKALDECKKEMTLTDAINEDFYNFWKEGYEIKNRETGCAIMCLSTKLN --------------------1111--3333----1111------3333--------1111 MLDPEGNLHHGNAMEFAKKHGADETMAQQLIDIVHGCEKSTPANDDKCIWTLGVATCFKA --1111-----------1111--------------------------------------- EIHKLNWAPSMDVAVGE --1111-------2222 >MANNOSE RECEPTOR; SWP:Q61830; PDB:1DQGA; DARQFLIYNEDHKRCVDALSAISVQTATCNPEAESQKFRWVSDSQIMSVAFKLCLGVPSK 3333-----1111------1111------11111111-----------1111-------- TDWASVTLYACDSKSEYQKWECKNDTLFGIKGTELYFNYGNRQEKNIKLYKGSGLWSRWK 2222-------1111--------iiii--2222-------%%%%---------1111--- VYGTTDDLCSRGYE 2222----1111-- >SUPEROXIDE REDUCTASE; SWP:P82385; PDB:1DQIA; MISETIRSGDWKGEKHVPVIEYEREGELVKVKVQVGKEIPHPNTTEHHIRYIELYFLPEG 3333--------------------!!!!---------------1111----------222 ENFVYQVGRVEFTAHGESVNGPNTSDVYTEPIAYFVLKTKKKGKLYALSYCNIHGLWENE 2----------------1111--------------------------------------- VTLE ---- >Lysozyme C [Precursor]; SWP:P00698; PDB:1DQJB; EVQLQESGPSLVKPSQTLSLTCSVTGDSVTSDYWSWIRKFPGNKLEYMGYISYSGSTYYH ------------2222-----------1111--------3333--------1111----- PSLKSRISITRDTSKNQYYLQLNSVTTEDTATYYCASWGGDVWGAGTTVTVSSAKTTAPS ------------1111---------3333-------3333-------------------- VYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLYTLSSS ----------------------------------iiii---------------------- VTVTSSTWPSQSITCNVAHPASSTKVDKKI ------------------3333-------- >IGM MEZ IMMUNOGLOBULIN; SWP:NA; PDB:1DQLH; VQLVESGGGLVQPGGSLRLSCAASGFTFSSYAMHWVRQAPGKGLEWVAVISSDGGNKYYT --------------------------3333--------2222--------1111-----3 DSVKGRFTISRNDSKNTLYLQMNSLRTEDTAVFYCARGNPPYSSGWGGGDYWGQGTMVTV 333----------------------1111--------------1111------------- SS -- >IGKC protein; SWP:Q6GMW1; PDB:1DQLL; DIQMTQSPSSLSASVGDRVTITCRASQDIRNDLGWYQQKPGKAPKKLIYAASSLQSGVPS --------------------------------------2222------------222233 RFSGSGSGTDFTLTISSLQPEDFATYYCLQQNSNWTFGQGTKVDIK 33----------------1111------------------------ >GUANINE PHOSPHORIBOSYLTRA; SWP:Q24973; PDB:1DQPA; MICSVTGKPVKDVLSTFFKDRNDVLESEVKKFHLLATFEECKALAADTARRMNEYYKDVA ------------------------33333333-----------------------1111- EPVTLVALLTGAYLYASLLTVHLTFPYTLHFVKVSSYKGTRQESVVFDEEDLKQLKEKRE ---------1111-----3333----------------1111------------1111-- VVLIDEYVDSGHTIFSIQEQIKHAKICSCFVKDVDAIKKHSALADTKMFYGYTPMPKGSW --------------------1111---------------3333------------2222- LIGFGLDDNGLRRGWAHLFDINLSESEVTEFRRRLTEHIKGLNINGVNRY --iiii-iiii1111-----------------------1111-2222--- >ANTI-LYSOZYME ANTIBODY HY; SWP:NA; PDB:1DQQB; EVQLQESGPSLVKPSQTLSLTCSVTGDSVTSDYWSWIRKFPGNKLEYMGYISYSGSTYYH ------------1111-----------1111--------------------1111----3 PSLKSRISITRDTSKNQYYLQLNSVTTEDTATYYCASWGGDVWGAGTTVTVSSAKTTAPS 333----------------------3333-------1111-------------------- VYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLYTLSSS ------1111------------------------iiii---------------------- VTVTSSTWPSQSITCNVAHPASSTKVDKKI -------------------1111------- >CYTOTOXIC T LYMPHOCYTE AS; SWP:P09793; PDB:1DQTA; IQVTQPSVVLASSHGVASFPCEYSPSHNTDEVRVTVLRQTNDQMTEVCATTFTEKNTVGF -----------1111-----------1111---------2222----------------1 LDYPFCSGTFNESRVNLTIQGLRAVDTGLYLCKVELMYPPPYFVGMGNGTQIYVIDP 111-------2222--------3333------------------------------- >ISOCITRATE LYASE; SWP:P28298; PDB:1DQUA; SYIEEEDQRYWDEVAAVKNWWKDSRWRYTKRPFTAEQIVAKRGNLKIEYPSNVQAKKLWG -----------------3333-3333-------33331111-------3333-------- ILERNFKNKEASFTYGCLDPTMVTQMAKYLDTVYVSGWQSSSTASSTDEPSPDLADYPMN ------------------3333--3333-------3333-----1111---------333 TVPNKVNHLWMAQLFHDRKQREERMTTPKDQRHKVANVDYLRPIIADADTGHGGLTAVMK 3--------------------------33331111------------!!!!--3333--- LTKLFVERGAAGIHIEDQAPGTKGKVLVPISEHINRLVAIRAQADIMGTDLLAIARTDSE ------------------------------------------------------------ AATLITSTIDHRDHPFIIGSTNPDIQPLNDLMVMAEQAGKNGAELQAIEDEWLAKAGLKL ---------11111111----1111----------------------------------- FNDAVVDAINNSPLPNKKAAIEKYLTQSKGKSNLEARAIAKEIAGTDIYFDWEAPRTREG ----------------3333-------2222---------------------11111111 YYRYQGGTQCAINRAVAYAPFADLIWMESKLPDYKQAKEFADGVHAVWPEQKLAYNLSPS -----------------1111--------------------------------------- FNWKKAMPRDEQETYIKRLGALGYAWQFITLAGLHTTALISDTFAKAYAKQGMRAYGELV -3333------------3333-----------------------------!!!!------ QEPEMANGVDVVTHQKWSGANYVDNMLKMITGG 33331111-33333333---------------- >SYNAPTOTAGMIN III; SWP:P40748; PDB:1DQVA; GAPCGRISFALRYLYGSDQLVVRILQALDLPAKDSNGFSDPYVKIYLLPDRKKKFQTKVH ---------------------------------3333----------------------- RKTLNPIFNETFQFSVPLAELAQRKLHFSVYDFDRFSRHDLIGQVVLDNLLELAEQPPDR --------------------------------------------------3333------ PLWRDILEGGSEKADLGELNFSLCYLPTAGLLTVTIIKASNLKAMDLTGFSDPYVKASLI ------------------------------------------------------------ SEGRRLKKRKTSIKKNTLNPTYNEALVFDVAPESVENVGLSIAVVDYDCIGHNEVIGVCR ---1111------------------------3333------------------------- VGPEAADPHGREHWAEMLANPRKPVEHWHQLVEEK ---33333333--1111------------------ >OROTIDINE 5'-PHOSPHATE DE; SWP:P03962; PDB:1DQWA; MHKATYKERAATHPSPVAAKLFNIMHEKQTNLCASLDVRTTKELLELVEALGPKICLLKT --------------------------------------------------3333------ HVDILTDFSMEGTVKPLKALSAKYNFLLFEDRKFADIGNTVKLQYSAGVYRIAEWADITN 1111----3333--------------------------------------3333------ AHGVVGPGIVSGLKQAAEEVTKEPRGLLMLAELSCKGSLSTGEYTKGTVDIAKSDKDFVI --3333----------------------------2222---------------------- GFIAQRDMGGRDEGYDWLIMTPGVGLDDKGDALGQQYRTVDDVVSTGSDIIIVGRGLFAK ---------3333-------------3333-------------1111------3333--- GRDAKVEGERYRKAGWEAYLRRCGQQD ---------------------1111-- >ANTIGEN 85-C; SWP:P31953; PDB:1DQZA; RPGLPVEYLQVPSASMGRDIKVQFQGGGPHAVYLLDGLRAQDDYNGWDINTPAFEEYYQS -------------1111-------------------1111----3333--------2222 GLSVIMPVGGQSSFYTDWYQPSQSNGQNYTYKWETFLTREMPAWLQANKGVSPTGNAAVG ---------2222----------------------------------------------- LSMSGGSALILAAYYPQQFPYAASLSGFLNPSESWWPTLIGLAMNDSGGYNANSMWGPSS -1111----------3333----------1111-----------------3333---111 DPAWKRNDPMVQIPRLVANNTRIWVYCGNGTPSDLGGDNIPAKFLEGLTLRTNQTFRDTY 1-------3333--------------------3333------------------------ AADGGRNGVFNFPPNGTHSWPYWNEQLVAMKADIQHVLNG 1111--------------3333------------------ >BETA-SPECTRIN; SWP:Q00963; PDB:1DRO; GSGTGAGEGHEGYVTRKHEWDSTTKKASNRSWDKVYMAAKAGRISFYKDQKGYKSNPELT --------------------------------------------------3333------ FRGEPSYDLQNAAIEIASDYTKKKHVLRVKLANGALFLLQAHDDTEMSQWVTSLKAQSDS ---------------------1111----------------------------------- TA -- >DIHYDRODIPICOLINATE REDUC; SWP:P04036; PDB:1DRW; HDANIRVAIAGAGGRMGRQLIQAALALEGVQLGAALEREGSSLLGSDAGELAGAGKTGVT ----------1111------------2222------------------------------ VQSSLDAVKDDFDVFIDFTRPEGTLNHLAFCRQHGKGMVIGTTGFDEAGKQAIRDAAADI ---33331111--------------------------------------------3333- AIVFAANFSVGVNVMLKLLEKAAKVMGDYTDIEIIEAHHRHKVDAPSGTALAMGEAIAHA ----------------------------------------------------------11 LDKDLKDCAVYSREGHTGERVPGTIGFATVRAGDIVGEHTAMFADIGERLEITHKASSRM 11-3333-------------2222--------------------2222------------ TFANGAVRSALWLSGKESGLFDMRDVLDLNNL ----------3333-------------1111- >CLAVAMINATE SYNTHASE 1; SWP:Q05581; PDB:1DS1A; TSVDCTAYGPELRALAARLPRTPRADLYAFLDAAHTAAASLPGALATALDTFNAEGSEDG ----3333-------3333--3333------------1111---------------1111 HLLLRGLPVEADADLPTTPSSTPAPEDRSLLTMEAMLGLVGRRLGLHTGYRELRSGTVYH ----------3333----------1111---------------------1111%%%%--- DVYPSPGAHHLSSETSETLLEFHTEMAYHRLQPNYVMLACSRADHERTAATLVASVRKAL --------1111-------------1111--------------1111-------333333 PLLDERTRARLLDRRMPCCVDVAFRGGVDDPGAIAQVKPLYGDADDPFLGYDRELLAPED 33--------2222------3333-----3333---------1111-------------- PADKEAVAALSKALDEVTEAVYLEPGDLLIVDNFRTTHARTPFSPRWDGKDRWLHRVYIR --------------1111-----2222----1111------------------------- TDRNGQLSGGERAGDVVAFTPRG --iiii----------------- >RAS-RELATED C3 BOTULINUM ; SWP:P15153; PDB:1DS6A; MQAIKCVVVGDGAVGKTCLLISYTTNAFPGEYIPTVFDNYSANVMVDSKPVNLGLWDTAG ----------2222---------------------------------------------- QEDYDRLRPLSYPQTDVFLICFSLVSPASYENVRAKWFPEVRHHCPSTPIILVGTKLDLR 3333-------2222-------1111------------------1111-------3333- DDKDTIEKLKEKKLAPITYPQGLALAKEIDSVKYLECSALTQRGLKTVFDEAIRAVLCPQ ---------1111----------------------------2222-------3333---- P - >Rho GDP-dissociation inhi; SWP:P52566; PDB:1DS6B; GNYKPPPQKSLKELQEMDKDDESLIKYKKTLLGDGPVVTDPKAPNVVVTRLTLVCESAPG -----------------1111---------------------------------1111-- PITMDLTGDLEALKKETIVLKEGSEYRVKIHFKVNRDIVSGLKYVQHTYRTGVKVDKATF ----1111------------2222-------------------------iiii------- MVGSYGPRPEEYEFLTPVEEAPKGMLARGTYHNKSFFTDDDKQDHLSWEWNLSIKKEWG -----------------------3333-----------1111----------------- >ANTICANCER ANTIBODY B1; SWP:NA; PDB:1DSFH; QLVESGGGLVKPGGSLKLSCAASGFIFSDNYMYWVRQTPEKCLEWVATISDGGTYIDYSD ----------2222-----------------------1111-----------------33 SVKGRFTISRDNAKNNLYLQMSSL 33---------------------- >Glyceraldehyde-3-phosphat; SWP:P56649; PDB:1DSSG; SKIGINGFGRIGRLVLRAALEMGAQVVAVNDPFIALEYMVYMFKYDSTHGMFKGEVKAED -------------------1111-------1111------------------------%% GALVVDGKKITVFNEMKPENIPWSKAGAEYIVESTGVFTTIEKASAHFKGGAKKVIISAP %%--iiii--------3333-3333--------------3333--3333----------- SADAPMFVCGVNLEKYSKDMKVVSNASTTNCLAPVAKVLHENFEIVEGLMTTVHAVTATQ -------22223333-1111-------3333-------------------------1111 KTVDGPSAKDWRGGRGAAQNIIPSSTGAAKAVGKVIPELDGKLTGMAFRVPTPNVSVVDL ------11113333-1111------33333333--3333--------------------- TVRLGKECSYDDIKAAMKAASEGPLQGVLGYTEDDVVSCDFTGDNRSSIFDAKAGIQLSK --------3333----------1111----------33332222------3333------ TFVKVVSWYDNEFGYSQRVIDLIKHMQKVDSA ---------3333------------------- >NUCLEIC ACID BINDING PROT; SWP:P11284; PDB:1DSVA; PPGLCPRCKKGYHWKSECKSKFDKDGNPLPP -------------3333------%%%%---- >KV1.2 VOLTAGE-GATED POTAS; SWP:P15386; PDB:1DSXA; ERVVINISGLRFEVQLKTLAQFPETLLGDPKKRMRYFDPLRNEYFFDRNRPSFDAILYYY ------iiii-----------11111111-3333----1111-------3333------- QSGGRLRRPVNVPLDIFSEEIRFYELG --------1111--------------- >PROTEIN KINASE C, ALPHA T; SWP:P05696; PDB:1DSYA; TEKRGRIYLKAEVTDEKLHVTVRDAKNLIPMDPNGLSDPYVKLKLIPDPKNESKQKTKTI -------------------------------1111------------1111--------- RSTLNPQWNESFTFKLKPSDKDRRLSVEIWDWDRTTRNDFMGSLSFGVSELMKMPASGWY ----------------1111--------------------------3333---------- KLLNQEEGEYYNVPIPE ---3333---------- >RETINOIC ACID RECEPTOR AL; SWP:P10276; PDB:1DSZA; PCFVCQDKSSGYHYGVSACEGCKGFFRRSIQKNMVYTCHRDKNCIINKVTRNRCQYCRLQ ------------iiii-------------1111-------------1111---------- KCFEVGMSKESVRND -------1111---- >SUPEROXIDE DISMUTASE; SWP:P09223; PDB:1DT0A; AFELPPLPYAHDALQPHISKETLEFHHDKHHNTYVVNLNNLVPGTEFEGKTLEEIVKTSS ---------1111----------------------------2222-1111---------- GGIFNNAAQVWNHTFYWNCLSPNAGGQPTGALADAINAAFGSFDKFKEEFTKTSVGTFGS ------------------------------------------------------------ GWGWLVKKADGSLALASTIGAGCPLTIGDTPLLTCDVWEHAYYIDYRNLRPKYVEAFWNL -------1111-----------3333-----------33333333!!!!3333---1111 VNWAFVAEQFEGKTYKV ----------------- >NEURO-ONCOLOGICAL VENTRAL; SWP:P51513; PDB:1DT4A; MKDVVEIAVPENLVGAILGKGGKTLVEYQELTGCRIQISKKGEFLPGTRNRKVTITGTPA ---------1111------iiii------------------------------------- ATQAAQYLITQRI --------3333- >PROTEIN (EUKARYOTIC PEPTI; SWP:P46055; PDB:1DT9A; PSAADRNVEIWKIKKLIKSLEAARGNGTSMISLIIPPKDQISRVAKMLADEFGTASNIKS ---------------------------------------3333-----------3333-- RVNRLSVLGAITSVQQRLKLYNKVPPNGLVVYCGTIVTEEGKEKKVNIDFEPFKPINTSL ----------------3333------------------%%%%------------------ YLCDNKFHTEALTALLSDDSKFGFIVIDGSGALFGTLQGNTREVLHKFTVDLPKKHGRGG --------33333333-------------------------------------------- QSALRFARLRMEKRHNYVRKVAETAVQLFISGDKVNVAGLVLAGSADFKTELSQSDMFDQ --3333------------------1111----------------%%%%------------ RLQSKVLKLVDISYGGENGFNQAIELSTEVLSNVKFIQEKKLIGRYFDEISQDTGKYCFG 3333--------------------------------------------3333-------- VEDTLKALEMGAVEILIVYENLDIMRYVLILYLTPEQEKDKSHFTESMPLLEWFANNYKK ------------------------------------------------------------ FGATLEIVTDKSQEGSQFVKGFGGIGGILRYRVDFQGM -------------------------------------- >Metallocarboxypeptidase i; SWP:P81511; PDB:1DTDB; DESFLCYQPDQVCCFICRGAAPLPSEGECNPHPTAPWCREGAVEWVPYSTGQCRTTCIPY ----------------------3333--------3333---------!!!!--------- V - >RNA-BINDING NEUROONCOLOGI; SWP:Q9UNW9; PDB:1DTJA; MKELVEMAVPENLVGAILGKGGKTLVEYQELTGARIQISKKGEFLPGTRNRRVTITGSPA ---------11113333------------------------------------------- ATQAAQYLISQRVT --------3333-- >DENDROTOXIN K; SWP:P00981; PDB:1DTK; AAKYCKLPLRIGPCKRKIPSFYYKWKAKQCLPFDYSGCGGNANRFKTIEECRRTCVG -3333------------------1111------------------------------ >CARDIAC TROPONIN C; SWP:P09860; PDB:1DTLA; YKAAVEQLTEEQKNEFKAAFDIFVLGAEDGSISTKELGKVMRMLGQNPTPEELQEMIDEV 3333----------------3333--2222-----------1111--------------- DEDGSGTVDFDEFLVMMVRSMKKSEEELSDLFRMFDKNADGYIDLEELKIMLQATTITED 1111-------------------------------1111----333333332222----- DIEELMKDGDKNNDGRIDYDEFLEFMKGV ---------1111---------------- >ALPHA-DENDROTOXIN; SWP:P00980; PDB:1DTX; PRRKLCILHRNPGRCYDKIPAFYYNQKKKQCERFDWSGCGGNSNRFKTIEECRRTCIG --3333-------------------1111----------------------------- >APO LACTOFERRIN; SWP:Q9TUM0; PDB:1DTZA; ASKKSVRWCTTSPAESKKCAQWQRRMKKVRGPSVTCVKKTSRFECIQAISTEKADAVTLD -----------3333----------3333------------------3333--------- GGLVYDAGLDPYKLRPIAAEVYGTENQPQTHYYAVAIAKKGTNFQLNQLQGLKSCHTGLG -----------------------3333-----------------11112222-----222 RSAGWNIPMGLLRPFLDWTGPPEPLQKAVAKFFSASCVPCVDGKEYPNLCQLCAGTGENK 21111------3333--------3333----------2222111133331111--!!!!- CACSSQEPYFGYSGAFKCLQDGAGDVAFVKDSTVFESLPAKADRDQYELLCPNNTRKPVD ---3333-----------3333-------11111111--3333-------1111---111 AFQECHLARVPSHAVVARSVNGKEDLIWKLLVKAQEKFGRGKPSAFQLFGSPAGQKDLLF 11111------------------------------------------------------- KDSALGLLRIPKKIDSGLYLGSNYITAIRGLRETAAEVELRRAQVVWCAVGSDEQLKCQE 3333------3333--------------1111---------------------------- WSRQSNQSVVCATASTTEDCIALVLKGEADALSLDGGYIYIAGKCGLVPVLAESQQSPES --1111-----------------1111-------3333--3333---------------- SGLDCVHRPVKGYLAVAVVRKANDKITWNSLRGKKSCHTAVDRTAGWNIPMGPLFKDTDS ---3333-----------------------2222--------------3333-------- CRFDEFFSQSCAPGSDPRSKLCALCAGNEEGQLKCVPNSSERLYGYTGAFRCLAENVGDV ---------------1111------------------1111------------------- AFVKDVTVLDNTDGKGTEQWAKDLKLGDFELLCLNGTRKPVTEAESCHLPVAPNHAVVSR ---3333----%%%%---1111----------1111------1111-------------3 IDKVAHLRQVLLRQQAHFGRNGEDCPGKFCLFQSKTKNLLFNDNTECLAKLQGKTTYDEY 333---------------1111--3333-1111----------------------3333- LGPQYVTAIAKLRRCSTSPLLEACAFLMR -----------------3333-------- >DNA POLYMERASE III; SWP:P28689; PDB:1DU2A; MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR ---------3333---3333----------------1111----3333----3333---- LASVNLSRLPYEPKLK ---------------- >DEATH RECEPTOR 5; SWP:Q7Z360; PDB:1DU3A; SSPSEGLCPPGHHISEDGRDCISCKYGQDYSTHWNDLLFCLRCTRCDSGEVELSPCTTTR --------------1111------2222------------------1111---------- NTVCQCEEGTFREEDSPEMCRKDCTPWSDI ------2222--3333-------------- >ZEAMATIN; SWP:P33679; PDB:1DU5A; AVFTVVNQCPFTVWAASVPVGGGRQLNRGESWRITAPAGTTAARIWARTGCKFDASGRGS --------------------------2222------2222-------------1111--- CRTGDCGGVLQCTGYGRAPNTLAEYALKQFNNLDFFDISLIDGFNVPMSFLPDGGSGCSR -----iiii--------------------%%%%--------------------------- GPRCAVDVNARCPAELRQDGVCNNACPVFKKDEYCCVGSAANDCHPTNYSRYFKGQCPDA ------3333--3333-iiii---3333--3333--!!!!1111--3333------3333 YSYPKDDATSTFTCPAGTNYKVVFCP -------------------------- >HOMEOBOX PROTEIN PBX1; SWP:P41778; PDB:1DU6A; SSGHIEGRHMNKQATEILNEYFYSHLSNPYPSEEAKEELAKKCGITVSQVSNWFGNKRIR ----------------------1111-----3333----------3333----------- YKKN ---- >chimera of GLUTATHIONE S-; SWP:P08515; PDB:1DUGA; SPILGYWKIKGLVQPTRLLLEYLEEKYEEHLYERDEGDKWRNKKFELGLEFPNLPYYIDG ----------1111------------------1111------1111------------11 DVKLTQSMAIIRYIADKHNMLGGCPKERAEISMLEGAVLDIRYGVSRIAYSKDFETLKVD 11-------------1111-------------------------3333--1111------ FLSKLPEMLKMFEDRLCHKTYLNGDHVTHPDFMLYDALDVVLYMDPMCLDAFPKLVCFKK ---------------2222-1111---3333-------------11111111-------- RIEAIPQIDKYLKSSKYIAWPLQGWQATFGGGDHPPKSDPQQHHLGGAKQAGDV -----3333----1111------1111--------------------------- >SPINDLE ASSEMBLY CHECKPOI; SWP:Q13257; PDB:1DUJA; GSITLRGSAEIVAEFFSFGINSILYQRGIYPSETFTRVQKYGLTLLVTTDLELIKYLNNV ---------------------------------------iiii----------------- VEQLKDWLYKCSVQKLVVVISNIESGEVLERWQFDIECDKTAKDDSAPREKSQKAIQDEI ----------------------------------------------------3333---- RSVIRQITATVTFLPLLEVSCSFDLLIYTDKDLVVPEKWEESGPQFITNSEEVRLRSFTT -----------------------------------3333--------------------- TIHKVNS ------- >4.5 S RNA DOMAIN IV; SWP:P07019; PDB:1DULA; FDLNDFLEQKVLVREAIINSTKERAKPEIIKGSRKRRIAAGSGQVQDVNRLLKQFDDQRK ----------3333-------3333--------------11113333------------- K - >DEOXYURIDINE 5'-TRIPHOSPH; SWP:P11204; PDB:1DUN; MLAYQGTQIKEKRDEDAGFDLCVPYDIMIPVSDTKIIPTDVKIQVPPNSFGWVTGKSSMA ------------1111-------------2222------------2222------33331 KQGLLINGGIIDEGYTGEIQVICTNIGKSNIKLIEGQKFAQLIILQHHSNSRQPWDENKI 111--------1111------------------2222------------------1111- >2[4FE-4S] FERREDOXIN; SWP:P00193; PDB:1DURA; AYVINDSCIACGACKPECPVNCIQEGSIYAIDADSCIDCGSCASVCPVGAPNPED ----3333---3333--1111----------3333-------------------- >MJ0882; SWP:Q58292; PDB:1DUSA; FSEKPTTKSDVKIVEDILRGKKLKFKTDSGVFSYGKVDKGTKILVENVVVDKDDDILDLG -----------------iiii------1111-2222--------------1111------ CGYGVIGIALADEVKSTTADINRRAIKLAKENIKLNNLDNYDIRVVHSDLYENVKDRKYN !!!!-----1111--------------------11111111-------!!!!-------- KIITNPPIRAGKEVLHRIIEEGKELLKDNGEIWVVIQTKQGAKSLAKYKDVFGNVETVTI -------3333---------3333--2222------------------------------ KGGYRVLKSKKL iiii-------- >Ornithine carbamoyltransf; SWP:P04391; PDB:1DUVG; SGFYHKHFLKLLDFTPAELNSLLQLAAKLKADKKSGKEEAKLTGKNIALIFEKDSTRTRC ---------1111----------------------------2222--------------- SFEVAAYDQGARVTYLGPSGSQIGHKESIKDTARVLGRMYDGIQYRGYGQEIVETLAEYA -----------------------1111-------3333---------------------- SVPVWNGLTNEFHPTQLLADLLTMQEHLPGKAFNEMTLVYAGDARNNMGNSMLEAAALTG ------------3333---------------1111------------------------- LDLRLVAPQACWPEAALVTECRALAQQNGGNITLTEDVAKGVEGADFIYTDVWVSMGEAK -------3333--------------1111------------2222---------2222-- EKWAERIALLREYQVNSKMMQLTGNPEVKFLHCLPAFHDDQTTLGKKMAEEFGLHGGMEV ---------3333-------33331111------------------3333---------- TDEVFESAASIVFDQAENRMHTIKAVMVATLSK ------3333----------------------- >NONAHEME CYTOCHROME C; SWP:Q9XCU0; PDB:1DUWA; EPTDSGAPSAIVMFPVSAKPNPKGAAMKPAVFNHLAHEKKIANCETCHHTGDPVACSTCH --3333--------------1111-------------1111-1111-1111---3333-- TTEGKAEGNFVTLDRAMHATNIAKRAKGNTPVSCVSCHEQQTKERRECAGCHAIVTPKRD 33333333------------------------------------3333-1111------- QAWCATCHNVTSSMTPEQMQQGIKGKLPPDQNEALAAETVLNHKPVQPLTAMQGPYKVSI ----------3333-------------------------1111------1111------- DALADKYEPSNFTHRRHMASLMERIKGDKLAEAFHNKPETLCATCHHRSPLSATPPKCGS ----------------------1111----------11113333------------3333 CHTKEIDPANPNRPNLKAAYHLQCMGCHQGMNVGRPKNTDCTTCHKARP ------------------------------------11113333----- >ETS domain-containing pro; SWP:P19419; PDB:1DUXC; VTLWQFLLQLLREQGNGHIISWTSRDGGEFKLVDAEEVARLWGLRKNKTNMNYDKLSRAL -----------------------3333----------------1111----3333----- RYYYDKNIIRKVSGQKFVYKFVSYPE --3333-----2222----------- >BIOTIN CARBOXYLASE; SWP:P24182; PDB:1DV1A; MLDKIVIANRGEIALRILRACKELGIKTVAVHSSADRDLKHVLLADETVCIGPAPSVKSY ---------------------1111-------1111------------------333311 LNIPAIISAAEITGAVAIHPGYGFLSENANFAEQVERSGFIFIGPKAETIRLMGDKVSAI 11--------1111--------!!!!---------1111--------------------- AAMKKAGVPCVPGSDGPLGDDMDKNRAIAKRIGYPVIIKASMRVVRGDAELAQSISMTRA ---1111-------------3333----------------------3333---------- EANDMVYMEKYLENPRHVEIQVLADGQGNAIYLAERDCSMQRRHQKVVEEAPAPGITPEL -----------------------------------------iiii-------2222---- RRYIGERCAKACVDIGYRGAGTFEFLFENGEFYFIEMNTRIQVEHPVTEMITGVDLIKEQ ---------------------------iiii----------1111--------------- LRIAAGQPLSIKQEEVHVRGHAVECRINAEDPNTFLPSPGKITRFHAPGGFGVRWESHIY -----------3333----------------------------------2222------2 AGYTVPPYYDSMIGKLICYGENRDVAIARMKNALQELIIDGIKTNVDLQIRIMNDENFQH 222-----------------------------3333------------------------ GGTNIHYLEKKLGL ---1111------- >APO-D-ALANYL CARRIER PROT; SWP:P55153; PDB:1DV5A; ADEAIKNGVLDILADLTGSDDVKKNLDLNLFETGLLDSMGTVQLLLELQSQFGVDAPVSE --------------3333--3333------1111-------------------------- FDRKEWDTPNKIIAKVEQAQ -3333--------------- >ASIALOGLYCOPROTEIN RECEPT; SWP:P07306; PDB:1DV8A; CPVNWVEHERSCYWFSRSGKAWADADNYCRLEDAHLVVVTSWEEQKFVQHHIGPVNTWMG -2222--iiii------------------1111--------------------------- LHDQNGPWKWVDGTDYETGFKNWRPEQPDDWYGHGLGGGEDCAHFTDDGRWNDDVCQRPY --1111---1111--1111----2222----3333----------1111-----1111-- RWVCETEL -------- >Ig kappa chain V-V region; SWP:P01644; PDB:1DVFC; DIQLTQSPSSLSASLGDRVTISCRASQDISNYLNWYQQKPDGTVKLLIYYTSRLHSGVPS -------------2222-----------iiii------1111------------222211 RFSGSGSGTDYSLTISNLEQEDIATYFCQQGNTLPWTFGGGTKLEIK 11----!!!!--------1111------------------------- >FV D1.3; SWP:NA; PDB:1DVFD; QVQLQQSGTELVKSGASVKLSCTASGFNIKDTHMNWVKQRPEQGLEWIGRIDPANGNIQY ---------------------------3333--------2222----------------- DPKFRGKATITADTSSNTAYLQLSL 1111--------3333--------- >PRP18; SWP:NA; PDB:1DVKA; MRIQEAIAQDKTISVIIDPSQIGSTEGKPLLSMKCNLYIHEILSRWKASLEAYHPELFLD -33333333--------3333---1111--------------------3333-1111--- TKKALFPLLLQLRRNQLAPDLLISLATVLYHLQQPKEINLAVQSYMKLSIGNVAWPIGVA -----------1111------------------1111----------------------- NIMIDERTRLWITSIKRLITFEEWYTSNH ------------------------1111- >FERTILITY INHIBITION PROT; SWP:P29367; PDB:1DVOA; PPKWKVKKQKLAEKAAREAELTAKKAQARQALSIYLNLPTLDEAVNTLKPWWPGLFDGDT -------------------------------3333------------3333-----!!!! PRLLACGIRDVLLEDVAQRNIPLSHKKLRRAMKAITRSESYLCAMKAGACRYDTEGYVTE ----2222---------------------------------11112222---1111---- HISQEEEVYAAERLDKIRRQNRIKAELQAVLD -------------------------------- >HEPATOCYTE GROWTH FACTOR-; SWP:Q960X8; PDB:1DVPA; MFRSSFCKNLENATSHLRLEPDWPSILLICDEINQKDVTPKNAFAAIKKKMNSPNPHSSC --------------3333------------------------------------------ YSLLVLESIVKNCGAPVHEEVFTKENCEMFSSFLESTPHENVRQKMLELVQTWAYAFRSS -------------3333---------------------------------------1111 DKYQAIKDTMTILKAKGHTFPELREMFTADTAPNWADGRVCHRCRVEFTFTNRKHHCRNC ---3333----------------------------------------------------- GQVFCGQCTAKQCPLPKYGIEKEVRVCDGCFAALQRG ----3333------3333------------------- >CYTOCHROME C551; SWP:P00099; PDB:1DVVA; EDPEVLAKNKGCMACHAIDTKMVGPAYKDVAAKYAGQAGAEAYLAQRIKNGSQGVWGPIP -3333-----1111------------------------3333------------------ MPPNAVSDDEAQTLAKWILSQK -----------------1111- >CYTOCHROME C; SWP:P81238; PDB:1DW0A; GDTSPAQLIAGYEAAAGAPADAERGRALFLSTQTGGKPDTPSCTTCHGADVTRAGQTRTG ------------------------------------1111----------------1111 KEIAPLAPSATPDRFTDSARVEKWLGRNCNSVIGRDCTPGEKADLLAWLAAQ ----------1111---------------------------------3333- >CYANATE LYASE; SWP:P00816; PDB:1DWKA; IQSQINRNIRLDLADAILLSKAKKDLSFAEIADGTGLAEAFVTAALLGQQALPADAARLV -------------------------------2222---------1111------------ GAKLDLDEDSILLLQIPLRGCIDDRIPTDPTYRFYELQVYGTTLKALVHEKFGDGIISAI --------------------------------3333------------------------ NFKLDVKKVADPEGGERAVITLDGKYLPTKPF ----------1111------------------ >FERREDOXIN I; SWP:P07485; PDB:1DWLA; TIVIDHEECIGCESCVELCPEVFAMIDGEEKAMVTAPDSTAECAQDAIDACPVEAISKE -------------3333-----------------------3333------1111----- >LINUM USITATISSINUM TRYPS; SWP:P82381; PDB:1DWMA; SRRCPGKNAWPELVGKSGNMAAATVERENRNVHAIVLKEGSAMTKDFRCDRVWVIVNDHG ---------3333---------------1111-----2222------1111-----1111 VVTSVPHIT --------- >PHAGE COAT PROTEIN; SWP:Q38062; PDB:1DWNA; SKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLD ------------------------------------------------------------ QADVVDCSTSVCGELPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVV ------33332222----------------1111-------------------------- NLVPLGR ------- >RIBOSOMAL PROTEIN L1; SWP:O52704; PDB:1DWUA; MDRENILKAVKEARSLAKPRNFTQSLDLIINLKELDLSRPENRLKEQVVLPNGRGKEPKI -3333--------1111---------------------3333-------1111------- AVIAKGDLAAQAEEMGLTVIRQDELEELGKNKKMAKKIANEHDFFIAQADMMPLVGKTLG -----3333-----------------3333-----------------1111-------33 PVLGPRGKMPQPVPANANLTPLVERLKKTVLINTRDKPLFHVLVGNEKMSDEELAENIEA 333333-------1111----------------------------1111----------- ILNTVSRKYEKGLYHVKSAYTKLTMGPPAQIEK ----333311111111------1111------- >ACETYLCHOLINESTERASE; SWP:P07140; PDB:1DX4A; DRLVVQTSSGPVRGRSVTVQGREVHVYTGIPYAKPPVEDLRFRKPVPAEPWHGVLDATGL ------1111--------%%%%--------------!!!!-------------------- SATCVQERYEYFPGFSGEEIWNPNTNVSEDCLYINVWAPATTNGLPILIWIYGGGFMTGS -----------22223333----------------------------------%%%%--- ATLDIYNADIMAAVGNVIVASFQYRVGAFGFLHLAPEMPSEFAEEAPGNVGLWDQALAIR --3333------1111----------3333---3333-3333------------------ WLKDNAHAFGGNPEWMTLFGESAGSSSVNAQLMSPVTRGLVKRGMMQSGTMNAPWSHMTS -----3333--1111----------------------------------3333------- EKAVEIGKALINDCNCNASMLKTNPAHVMSCMRSVDAKTISVQQWNSYSGILSFPSAPTI -----------1111-3333---------------3333----3333------------- DGAFLPADPMTLMKTADLKDYDILMGNVRDEGTYFLLYDFIDYFDKDDATALPRDKYLEI -------1111------1111--------3333------3333----------------- MNNIFGKATQAEREAIIFQYTSWEGNPGYQNQQQIGRAVGDHFFTCPTNEYAQALAERGA --1111------------------------------------------------------ SVHYYYFTHRTSTSLWGEWMGVLHGDEIEYFFGQPLNNSLQYRPVERELGKRMLSAVIEF ----------1111--3333--2222------11113333-------------------- AKTGNPAQDGEEWPNFSKEDPVYYIFSTDDKIEKLARGPLAARCSFWNDYLPKVRSW ----------------3333-------------------------------3333-- >Thrombomodulin [Precursor; SWP:P07204; PDB:1DX5I; VEPVDPCFRANCEYQCQPLDQTSYLCVCAEGFAPIPHEPHRCQMFCNQTACPADCDPNTQ -----3333-------------------2222--1111-----------------1111- ASCECPEGYILDDGFICTDIDECENGGFCSGVCHNLPGTFECICGPDSALAGQIGTDC -----2222-----------3333-----------2222------------------- >RUBREDOXIN; SWP:Q9XG40; PDB:1DX8A; MEIDEGKYECEACGYIYEPEKGDKFAGIPPGTPFVDLSDSFMCPACRSPKNQFKSIKKVI ----------------------3333------3333------------3333-------- AGFAENQKYG ---------- >2-DEHYDRO-3-DEOXY-GALACTA; SWP:P23522; PDB:1DXEA; DVFPNKFKAALAAKQVQIGCWSALSNPISTEVLGLAGFDWLVLDGEHAPNDISTFIPQLM --------------------------------1111--------------3333-----1 ALKGSASAPVVRVPTNEPVIIKRLLDIGFYNFLIPFVETKEEAELAVASTRYPPEGIRGV 111--------------------------------------------------------- SVSHRANMFGTVADYFAQSNKNITILVQIESQQGVDNVDAIAATEGVDGIFVGPSDLAAA ---3333iiii---33331111---------------------2222-----------11 LGHLGNASHPDVQKAIQHIFNRASAHGKPSGILAPVEADARRYLEWGATFVAVGSDLGVF 112222-----------------1111--------------------------------- RSATQKLADTFKK -------3333-- >ORNITHINE CARBAMOYLTRANSF; SWP:P08308; PDB:1DXHA; AFNMHNRNLLSLMHHSTRELRYLLDLSRDLKRAKYTGTEQQHLKRKNIALIFEKTSTRTR ---2222----1111---------------------------2222---------3333- CAFEVAAYDQGANVTYIDPNSSQIGHKESMKDTARVLGRMYDAIEYRGFKQEIVEELAKF -----------------3333--------------3333----------3333------- AGVPVFNGLTDEYHPTQMLADVLTMREHSDKPLHDISYAYLGDARNNMGNSLLLIGAKLG ---------3333------------------3333------------------------- MDVRIAAPKALWPHDEFVAQCKKFAEESGAKLTLTEDPKEAVKGVDFVHTDVWVSMGEPV -------3333------------------------------2222---------333333 EAWGERIKELLPYQVNMEIMKATGNPRAKFMHCLPAFHNSETKVGKQIAEQYPNLANGIE 33-------3333-------33331111------------------3333-3333----- VTEDVFESPYNIAFEQAENRMHTIKAILVSTLADI -3333--3333------------------------ >D-XYLOSE ISOMERASE; SWP:P37031; PDB:1DXIA; MSFQPTPEDRFTFGLWTVGWQGRDPFGDATRPALDPVETVQRLAELGAYGVTFHDDDLIP -----1111----1111-----------------3333---------------------- FGSSDTERESHIKRFRQALDATGMTVPMATTNLFTHPVFKDGGFTANDRDVRRYALRKTI ------------------3333-------------3333--------1111--------- GNIDLAAELGAKTYVAWGGREGAESGGAKDVRDALDRMKEAFDLLGEYVTAQGYDLRFAI ------------------------------------------------------------ EPKPNEPRGDILLPTVGHALAFIERLERPELYGVNPEVGHEQMAGLNFPHGIAQALWAGK ---------------------------3333----------1111--------------- LFHIDLNGQSGIKYDQDLRFGAGDLRAAFWLVDLLETAGYEGPRHFDFKPPRTEDFDGVW -----------------------3333-----------------------1111------ ASAAGCMRNYLILKDRAAAFRADPEVQEALRAARLDQLAQPTAADGLDALLADRAAFEDF 3333------------------3333----1111-3333--------------------- DVDAAAARGMAFEHLDQLAMDHLLGARG 33333333-------------------- >CLASS II CHITINASE; SWP:O81934; PDB:1DXJA; DVGSVIDASLFDQLLKHRNDPACEGKGFYSYNAFVTAARSFGGFGTTGDTNTRKREVAAF 3333----------1111-1111-iiii------------2222---------------- LAQTSHETTGGAAGSPDGPYAWGYCFVTERDKSNKYCDPGTPCPAGKSYYGRGPIQLTHN -----1111--2222--1111----------------1111--2222-----1111---- YNYAQAGRALGVDLINNPDLVARDAVISFKTAIWFWMTPQGNKPSCHDVITNRWTPSAAD -------------11113333-------------1111-!!!!-----1111-------- VAANRTPGFGVITNIINGGIECGRGPSPASGDRIGFYKRYCDVLHLSYGPNLNCRDQRPF 1111---3333------------------------------------------------- GG -- >DIHYDROLIPOAMIDE DEHYDROG; SWP:P31023; PDB:1DXLA; SDENDVVIIGGGPGGYVAAIKAAQLGFKTTCIEKRGALGGTCLNVGCIPSKALLHSSHMY -----------3333-------1111--------------3333---------------- HEAKHSFANHGVKVSNVEIDLAAMMGQKDKAVSNLTRGIEGLFKKNKVTYVKGYGKFVSP -------1111--------3333------------------------------------- SEISVDTIEGENTVVKGKHIIIATGSDVKSLPGVTIDEKKIVSSTGALALSEIPKKLVVI -------------------------------------------3333------------- GAGYIGLEMGSVWGRIGSEVTVVEFASEIVPTMDAEIRKQFQRSLEKQGMKFKLKTKVVG -----------------------------3333--------------------------- VDTSGDGVKLTVEPSAGGEQTIIEADVVLVSAGRTPFTSGLNLDKIGVETDKLGRILVNE ----------------------------------------------------------11 RFSTNVSGVYAIGDVIPGPMLAHKAEEDGVACVEYLAGKVGHVDYDKVPGVVYTNPEVAS 11------------------3333-------------------1111------------- VGKTEEQVKETGVEYRVGKFPFMANSRAKAIDNAEGLVKIIAEKETDKILGVHIMAPNAG --------------------3333----1111-------------------------333 ELIHEAAIALQYDASSEDIARVCHAHPTMSEAIKEAAMATYDKPIHI 3-------------3333---------3333---------------- >QUINONE REDUCTASE; SWP:Q64669; PDB:1DXQA; AARRALIVLAHSEKTSFNYAMKEAAVEALKKRGWEVLESDLYAMNFNPIISRNDITGELK ------------11113333-------------------3333----------------- DSKNFQYPSESSLAYKEGRLSPDIVAEHKKLEAADLVIFQFPLQWFGVPAILKGWFERVL --------------------3333-------------------%%%%------------- VAGFAYTYAAMYDNGPFQNKKTLLSITTGGSGSMYSLQGVHGDMNVILWPIQSGILRFCG 2222--3333!!!!1111------------3333-1111---3333-3333--------- FQVLEPQLVYSIGHTPPDARMQILEGWKKRLETVWEETPLYFAPSSLFDLNFQAGFLLMK -----------------3333-------33333333-------3333------------- EVQEEQKKNKFGLSVGHHLGKSIPADNQIKARK ----3333-----3333iiii----1111---- >P53-LIKE TRANSCRIPTION FA; SWP:O15350; PDB:1DXSA; SLVSFLTGLGCPNCIEYFTSQGLQSIYHLQNLTIEDLGALKIPEQYRMTIWRGLQDL 3333--1111111133331111----3333------------3333------3333- >DYSTROPHIN; SWP:P11532; PDB:1DXXA; DSYEREDVQKKTFTKWVNAQFSKFGKQHIENLFSDLQDGRRLLDLLEGLTGQKLPKEKGS ---------------------1111-----1111-1111--------------------- TRVHALNNVNKALRVLQNNNVDLVNIGSTDIVDGNHKLTLGLIWNIILHWQVKNVMKNIM ----------------1111--2222----1111-------------------------- AGLQQTNSEKILLSWVRQSTRNYPQVNVINFTTSWSDGLALNALIHSHRPDLFDWNSVVS --1111-------------1111--------1111-------------3333-3333111 QQSATQRLEHAFNIARYQLGIEKLLDPEDVDTTYPDKKSILMYITSLFQVLPQQVSIE 1------------------------3333----------------------------- >D-2-HYDROXYISOCAPROATE DE; SWP:P17584; PDB:1DXY; MKIIAYGARVDEIQYFKQWAKDTGNTLEYHTEFLDENTVEWAKGFDGINSLQTTPYAAGV --------3333----------------------3333-1111-------------3333 FEKMHAYGIKFLTIRNVGTDNIDMTAMKQYGIRLSNVPAYSPAAIAEFALTDTLYLLRNM ----1111----------1111-----1111---------3333---------------- GKVQAQLQAGDYEKAGTFIGKELGQQTVGVMGTGHIGQVAIKLFKGFGAKVIAYDPYPMK -----------3333------3333-------------------1111------------ GDHPDFDYVSLEDLFKQSDVIDLHVPGIEQNTHIINEAAFNLMKPGAIVINTARPNLIDT --3333---------------------3333--------33332222------1111--- QAMLSNLKSGKLAGVGIDTYEYETEDLLNLAKHGSFKDPLWDELLGMPNVVLSPHIAYYT ------3333------------------------------------1111-----1111- ETAVHNMVYFSLQHLVDFLTKGETSTEVTG -----------------------1111--- >COLLAGEN ALPHA1(XVIII) CH; SWP:P39061; PDB:1DY0A; AHTHQDFQPVLHLVALNTPLSGGMRGIRGADFQCFQQARAVGLSGTFRAFLSSRLQDLYS -------------------------3333-----------------------11113333 IVRRADRGSVPIVNLKDEVLSPSWDSLFSGSQGQLQPGARIFSFDGRDVLRHPAWPQKSV --3333-------1111-----3333----%%%%-2222---1111-33333333----- WHGSDPSGRRLMESYCETWRTETTGATGQASSLLSGRLLEQKAASCHNSYIVLCIENSFM ----1111--1111%%%%----3333-----1111---------3333------------ >COLLAGEN ALPHA1(XV) CHAIN; SWP:O35206; PDB:1DY2A; RPVLHLVALNTPVAGDIRADFQCFQQARAAGLLSTFRAFLSSHLQDLSTVVRKAERFGLP ---------------------------1111--------------3333--3333----- IVNLKGQVLFNNWDSIFSGDGGQFNTHIPIYSFDGRDVMTDPSWPQKVVWHGSNPHGVRL --1111-----3333---------1111---1111-33333333---------1111--1 VDKYCEAWRTTDMAVTGFASPLSTGKILDQKAYSCANRLIVLCIENSF 111%%%%----1111-----3333---------1111----------- >RIBONUCLEASE A; SWP:P00656; PDB:1DY5A; KETAAAKFERQHMDSSTSAASSSNYCNQMMKSRNLTKDRCKPVNTFVHESLADVQAVCSQ ---------------------1111-----1111----------------------1111 KNVACKGQTNCYQSYSTMSITDCRETGSSKYPNCAYKTTQANKHIIVACEGNPYVPVHFD -------------------------1111------------------------------- ASV --- >CARBAPENEM-HYDROLYSING BE; SWP:Q54488; PDB:1DY6A; NKSDAAAKQIKKLEEDFDGRIGVFAIDTGSGNTFGYRSDERFPLCSSFKGFLAAAVLERV --------------1111------------------1111---!!!!------------- QQKKLDINQKVKYESRDLEYHSPITTKYKGSGMTLGDMASAALQYSDNGATNIIMERFLG -----1111---1111-------3333--------------------------------- GPEGMTKFMRSIGDNEFRLDRWELELNTAIPGDKRDTSTPKAVANSLNKLALGNVLNAKV ---------1111----------------2222--------------------------- KAIYQNWLKGNTTGDARIRASVPADWVVGDKTGSCGAYGTANDYAVIWPKNRAPLIVSIY -------1111--11111111-3333----------%%%%-------------------- TTRKSKDDKHSDKTIAEASRIAIQAID ----1111--------------3333- >PROTEASE/HELICASE NS3 (P7; SWP:Q81755; PDB:1DY9A; APITAYSQQTRGLLGCIITSLTGRDKNQVDGEVQVLSTATQSFLATCVNGVCWTVYHGAG -------------------------------------1111------%%%%---3333!! SKTLAGPKGPITQMYTNVDQDLVGWPAPPGARSMTPCTCGSSDLYLVTRHADVIPVRRRG !!---1111--------1111--------------------------1111--------- DSRGSLLSPRPVSYLKGSSGGPLLCPSGHVVGIFRAAVCTRGVAKAVDFIPVESM ----------33332222------1111-----------iiii-------3333- >LAMININ ALPHA 2 CHAIN; SWP:Q60675; PDB:1DYKA; HGPCVAESEPALLTGSKQFGLSRNSHIAIAFDDTKVKNRLTIELEVRTEAESGLLFYMAR ------------2222-----1111------3333------------------------1 INHADFATVQLRNGFPYFSYDLGSGDTSTMIPTKINDGQWHKIKIVRVKQEGILYVDDAS 111--------iiii--------------------------------!!!!----!!!!- SQTISPKKADILDVVGILYVGGLPINYTTRRIGPVTYSLDGCVRNLHMEQAPVDLDQPTS -----------------------------------------------1111--1111--- SFHVGTCFANAESGTYFDGTGFAKAVGGFKVGLDLLVEFEFRTTRPTGVLLGVSSQKMDG ------------------------------------------------------------ MGIEMIDEKLMFHVDNGAGRFTAIYDAEIPGHMCNGQWHKVTAKKIKNRLELVVDGNQVD -----iiii-------------------2222-------------!!!!----iiii--- AQSPNSASTSADTNDPVFVGGFPGGLNQFGLTTNIRFRGCIRSLKLTKGTGKPLEVNFAK ----3333--------------2222-1111-------------------------1111 ALELRGVQPVSCPT -------------- >DYNAMIN; SWP:Q05193; PDB:1DYNA; ILVIRKGWLTINNIGIMKGGSKEYWFVLTAENLSWYKDDEEKEKKYMLSVDNLKLRDVEK -------------------------------------1111--------2222------- GFMSSKHIFALFNTEQRNVYKDYRQLELACETQEEVDSWKASFLRAGVYPERV ------------1111--------------------------3333------- >ENDO-1,4-BETA-XYLANASE Y; SWP:P51584; PDB:1DYOA; PDAGYYYHDTFEGSVGQWTARGPAEVLLSGRTAYKGSESLLVRNRTAAWNGAQRALNPRT --------------!!!!----------------------------3333------3333 FVPGNTYCFSVVASFIEGASSTTFCKLQYVDGSGTQRYDTIDKTVGPNQWVHLYNPQYRI ------------------------------1111-------------------------- PSDATDYVYVETADDTINFYIDEAIGAVAGTVI 1111-----------------------2222-- >KAPPA-CARRAGEENASE; SWP:P43478; PDB:1DYPA; SQPPIAKPGETWILQAKRSDEFNVKDATKWNFQTENYGVWSWKNENATVSKGKLKLTTKR --11112222----3333-------3333-------------3333---iiii------- ESHQRTFWDGCNQQQVANYPLYYTSGVAKSRATGNYGYYEARIKGASTFPGVSPAFWYST ---------1111----------------------------------------------- IDRSLTKEGDVQYSEIDVVELTQKSAVRESDHDLHNIVVKNGKPTWRPGSFPQTNHNGYH ------2222---------------1111----------iiii---3333-1111----- LPFDPRNDFHTYGVNVTKDKITWYVDGEIVGEKDNLYWHRQNLTLSQGLRAPHTQWKCNQ ---1111---------1111----iiii----------------------------%%%% FYPSANKSAEGFPTSEVDYVRTWVKV --------2222-------------- >STAPHYLOCOCCAL ENTEROTOXI; SWP:P13163; PDB:1DYQA; EINEKDLRKKSELQGTALGNLKQIYYYNEKAKTENKESHDQFRQHTILFKGFFTDHSWYN --1111--3333-!!!!------------------------------------------- DLLVRFDSKDIVDKYKGKKVDLYGAYAGYQCAGGTPNKTACMYGGVTLHDNNRLTEEKKV --------------2222---------111122222222---------2222-------- PINLWLDGKQNTVPLETVKTNKKNVTVQELDLQARRYLQEKYNLYNSDVFDGKVQRGLIV -----iiii----1111-------------------------11113333---------- FHTSTEPSVNYDLFGAQGQYSNTLLRIYRDNKTINSENMHIDIYLYTS -----------1111-333311113333-------2222--------- >ENDOGLUCANASE; SWP:Q7SIG5; PDB:1DYSA; GNPFSGRTLLVNSDYSSKLDQTRQAFLSRGDQTNAAKVKYVQEKVGTFYWISNIFLLRDI -1111---------------------1111----------------------3333---- DVAIQNARAAKARGENPIVGLVLYNLPDRDCSAGESSGELKLSQNGLNRYKNEYVNPFAQ ----------1111---------------3333-------1111---------------- KLKAASDVQFAVILEPDAIGNMVTGTSAFCRNARGPQQEAIGYAISQLQASHIHLYLDVA ----1111------2222--------------------------1111-1111------- NGGWLGWADKLEPTAQEVATILQKAGNNAKIRGFSSNVSNYNPYSTSNPPPYTSGSPSPD 3333--1111--------------------------2222---------3333------- ESRYATNIANAMRQRGLPTQFIIDQSRVALSGARSEWGQWCNVNPAGFGQPFTTNTNNPN ------------1111-------------2222--1111------------------111 VDAIVWVKPGGESDGQCGMGGAPAAGMWFDAYAQMLTQNAHDEIA 1-----------------2222-2222---------11113333- >EOSINOPHIL CATIONIC PROTE; SWP:P12724; PDB:1DYTA; RPPQFTRAQWFAIQHISLNPPRCTIAMRAINNYRWRCKNQNTFLRTTFANVVNVCGNQSI -1111----------------33333333-1111-----------------3333----- RCPHNRTLNNCHRSRFRVPLLHCDLINPGAQNISNCRYADRPGRRFYVVACDNRDPRDSP -1111---------------------1111-1111-------------------111133 RYPVVPVHLDTTI 33----------- >CYCLOPHILIN 3; SWP:P52011; PDB:1DYWA; MSRSKVFFDITIGGKASGRIVMELYDDVVPKTAGNFRALCTGENGIGKSGKPLHFKGSKF -----------iiii--------------------------1111-1111----2222-- HRIIPNFMIQGGDFTRGNGTGGESIYGEKFPDENFKEKHTGPGVLSMANAGPNTNGSQFF -----------------------1111-------------2222---------------- LCTVKTEWLDGKHVVFGRVVEGLDVVKAVESNGSQSGKPVKDCMIADCGQLK -----1111------------3333----11113333--------------- >MODIFIER 1 PROTEIN; SWP:P23197; PDB:1DZ1A; HMKEESEKPRGFARGLEPERIIGATDSSGELMFLMKWKNSDEADLVPAKEANVKCPQVVI ----------3333--------------------------------3333---------- SFYEERLTWH --3333---- >STAGE 0 SPORULATION PROTE; SWP:CAA05307; PDB:1DZ3A; SIKVCIADDNRELVSLLDEYISSQPDMEVIGTAYNGQDCLQMLEEKRPDILLLDIIMPHL -----------------------1111--------------------------------- DGLAVLERIRAGFEHQPNVIMLTAFGQEDVTKKAVELGASYFILKPFDMENLAHHIRQVY -----------------------22223333---1111----------2222-------- GKT --- >CHORIONIC GONADOTROPIN; SWP:P01215; PDB:1DZ7A; APDVQDCPECTLQENPFFSQPGAPILQCMGCCFSRAYPTPLRSKKTMLVQKNVTSESTCC ------------------------------------------------------------ VAKSYNRVTVMGGFKVENHTACHCSTCYYHKS ----------%%%%------------------ >RIBONUCLEASE 1; SWP:Q16869; PDB:1DZAA; KESAAAKFERQHMDSGNSTYCNQMMRRRNMTQGRCKPVNTFVHESLVDVQNVCFQEKVTC -------------3333--------1111----------------------1111----1 KNGQGNCYKSNSSMHITDCRLTNGSRYPNCAYRTSPKERHIIVACEGSPYVPVHFDASVE 111------------------%%%%----------------------------------- >SCFV FRAGMENT 1F9; SWP:P00703; PDB:1DZBA; QVKLQQSGAELVKPGASVKLSCTASGFNIKDTYMHWVKQRPEQGLEWIGRIDPANGNTKY ------------2222-----------3333----------------------------- DPKFQGKATITADTSSNTAYLQLSSLTSEDTAVYYCARWDWYFDVWGQGTTVTVSSGDIE 3333---------1111---------3333---------2222----------------- LTQSPSSMYTSLGERVTITCKASQDINSYLRWFQQKPGKSPKTLIYYATSLADGVPSRFS -----------------------------------2222------------22223333- GSGSGQDYSLTISSLESDDTTTYYCLQHGESPYTFGGGTKLEIK ---!!!!--------1111------------------------- >DNA-DIRECTED RNA POLYMERA; SWP:P20434; PDB:1DZFA; NERNISRLWRAFRTVKEMVKDRGYFITQEEVELPLEDFKAKYCDSMGRPQRKMMSFQANP -1111---------------------3333-------------1111--3333------- TEESISKFPDMGSLWVEFCDEPSVGVKTMKTFVIHIQEKNFQTGIFVYQNNITPSAMKLV -------1111------------------------------------------3333-11 PSIPPATIETFNEAALVVNITHHELVPKHIRLSSDEKRELLKRYRLKESQLPRIQRADPV 11---------3333---11111111--------------------3333----1111-- ALYLGLKRGEVVKIIRKSETSGRYASYRICM ------2222-----1111------------ >ODORANT-BINDING PROTEIN; SWP:P81245; PDB:1DZKA; FELSGKWITSYIGSSDLEKIGENAPFQVFMRSIEFDDKESKVYLNFFSKENGICEEFSLI ---------------3333-2222------------1111---------iiii------- GTKQEGNTYDVNYAGNNKFVVSYASETALIISNINVDEEGDKTIMTGLLGKGTDIEDQDL ----%%%%----------------1111--------1111-------------------- EKFKEVTRENGIPEENIVNIIERDDCPA -------1111-3333--3333------ >DTDP-4-DEHYDRORHAMNOSE 3,; SWP:P26394; PDB:1DZRA; MMIVIKTAIPDVLILEPKVFGDERGFFFESYNQQTFEELIGRKVTFVQDNHSKSKKNVLR --------3333---------1111-----------------------------2222-- GLHFQRGENAQGKLVRCAVGEVFDVAVDIRKESPTFGQWVGVNLSAENKRQLWIPEGFAH -----!!!!--------------------1111-2222------3333------2222-- GFVTLSEYAEFLYKATNYYSPSSEGSILWNDEAIGIEWPFSQLPELSAKDAAAPLLDQAL -------------------3333----11113333-----------3333----3333-- LTE --- >Periplasmic [Fe] hydrogen; SWP:P07603; PDB:1E08D; VKQIKDYMLDRINGVYGADAKFPVRASQDNTQVKALYKSYLEKPLGHKSHDLLHTHWFDK --3333-------------------1111------------------------------- SKGVKELTTAGKLPNPRASEFEGPYPYE -3333-3333----1111---------- >PRU AV 1; SWP:O24248; PDB:1E09A; GVFTYESEFTSEIPPPRLFKAFVLDADNLVPKIAPQAIKHSEILEGDGGPGTIKKITFGE -------------3333-------3333--------------------2222-------- GSQYGYVKHKIDSIDKENYSYSYTLIEGDALGDTLEKISYETKLVASPSGGSIIKSTSHY ---------------------------3333---------------1111---------- HTKGNVEIKEEHVKAGKEKASNLFKLIETYLKGHPDAYN --------3333--------------------------- >Serine/threonine-protein ; SWP:P35465; PDB:1E0AB; GSISLPSDFEHTIHVGFDAVTGEFTGMPEQWARLLQTSNITKSEQK ---------------------------3333--------------- >SWI6 PROTEIN; SWP:P40381; PDB:1E0BA; QVENYDSWEDLVSSIDTIERKDDGTLEIYLTWKNGAISHHPSTITNKKCPQKMLQFYESH -----1111-----------1111-------1111-----33333333------------ L - >SULFURTRANSFERASE; SWP:P52197; PDB:1E0CA; MDDFASLPLVIEPADLQARLSAPELILVDLTSAARYAEGHIPGARFVDPKRTQLGQPPAP !!!!-------3333---1111------------------2222---3333-------11 GLQPPREQLESLFGELGHRPEAVYVVYDDEGGGWAGRFIWLLDVIGQQRYHYLNGGLTAW 11----------------1111-------------------------------------- LAEDRPLSRELPAPAGGPVALSLHDEPTASRDYLLGRLGAADLAIWDARSPQEYRGEKVL 1111--------------------3333------1111-1111----------------- AAKGGHIPGAVNFEWTAAMDPSRALRIRTDIAGRLEELGITPDKEIVTHQTHHRSGLTYL ------2222---3333--1111----1111----1111-1111--------1111---- IAKALGYPRVKGYAGSWGEWGNHPDTPVEL ------------3333------1111---- >HUMAN IMMUNODEFICIENCY VI; SWP:P04584; PDB:1E0EA; FLEKIEPAQEEHEKYHSNVKELSHKFGIPNLVARQIVNSCAQCQQK ----------------------------3333--------1111-- --------------------------------------------------------- >MEMBRANE-BOUND LYTIC MURE; SWP:P23931; PDB:1E0GA; DSITYRVRKGDSLSSIAKRHGVNIKDVMRWNSDTANLQPGDKLTLFVK -------2222-----3333-------------1111----------- >WWPROTOTYPE; SWP:NA; PDB:1E0MA; SMGLPPGWDEYKTHNGKTYYYNHNTKTSTWTDPRMSS -------------------------------3333-- >PYRUVATE KINASE; SWP:P14178; PDB:1E0TA; MKKTKIVCTIGPKTESEEMLAKMLDAGMNVMRLNFSHGDYAEHGQRIQNLRNVMSKTGKT ----------3333-3333-------------------3333------------------ AAILLDTKGPEIRTMKLEGGNDVSLKAGQTFTFTTDKSVIGNSEMVAVTYEGFTTDLSVG ----------------2222---------------3333--1111----1111------- NTVLVDDGLIGMEVTAIEGNKVICKVLNNGDLGENKGVNLPGVSIALPALAEKDKQDLIF -----iiii--------------------------------------------------- GCEQGVDFVAASFIRKRSDVIEIREHLKAHGGENIHIISKIENQEGLNNFDEILEASDGI ---------------3333------------1111-------33331111---------- MVARGDLGVEIPVEEVIFAQKMMIEKCIRARKVVITATMRPTDAEAGDVANAILDGTDAV ----3333---3333--------------------------3333--------------- MLSGEPLEAVSIMATICERTDRVMNSRLEITEAVCRGAVETAEKLDAPLIVVATQGGKSA -----3333----------1111------------------------------------- RAVRKYFPDATILALTTNEKTAHQLVLSKGVVPQLVKEITSTDDFYRLGKELALQSGLAH ---1111----------------33332222----------------------------2 KGDVVVMVSGALVPSGTTNTASVHVL 222----------------------- >FERREDOXIN; SWP:P00216; PDB:1E0ZA; PTVEYLNYETLDDQGWDMDDDDLFEKAADAGLDGEDYGTMEVAEGEYILEAAEAQGYDWP -----------3333------33333333---1111------2222-------------- FSCRAGACANCASIVKEGEIDMDMQQILSDEEVEEKDVRLTCIGSPAADEVKIVYNAKHL --------1111----------------3333-------3333----------------- DYLQNRVI -------- >AFX; SWP:P98177; PDB:1E17A; SRRNAWGNQSYAELISQAIESAPEKRLTLAQIYEWMVRTVPYFKDKGDSNSSAGWKNSIR ---------------------------------------33333333------------- HNLSLHSKFIKVHNEATGKSSWWMLNPEGG ---------------%%%%------3333- >CARBAMATE KINASE-LIKE CAR; SWP:P95474; PDB:1E19A; GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL --------1111--2222---------------------1111----------------- LHMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDK ----------------------------------------1111--------------11 NDPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKK 113333----------------------------iiii-------------1111----- LVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAA -1111-----2222------iiii------------------1111-------------- LYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAV -2222-----------------------1111----------------------1111-- EALEGKTGTQVLP -1111-------- >BOTULINUM NEUROTOXIN TYPE; SWP:Q45894; PDB:1E1HA; MAYKDPVNGVDIAYIKIPNAGQMQPVKAFKIHNKIWVIPERDTFTNPEEGDLNPPPEAKQ -1111-----------2222-----------2222----------1111----------- VPVSYYDSTYLSTDNEKDNYLKGVTKLFERIYSTDLGRMLLTSIVRGIPFWGGSTIDTEL ------1111---------------------------------------------1111- KVIDTNCINVIQPDGSYRSEELNLVIIGPSADIIQFECKSFGHDVLNLTRNGYGSTQYIR --1111-----1111----------------1111--------1111------------- FSPDFTFGFEESLGAGKFATDPAVTLAHELIHAEHRLYGIAINPNRVFKVNTNAY ------------------------------------------1111--------- >Botulinum neurotoxin type; SWP:Q45894; PDB:1E1HB; EMSGLEVSFEELRTFGGHDAKFIDSLQENEFRLYYYNKFKDVASTLNKAKSIIGTTASLQ ----------------3333---------------------------------------- YMKNVFKEKYLLSEDTSGKFSVDKLKFDKLYKMLTEIYTEDNFVNFFKVINRKTYLNFDK -------1111---1111------------------------------------------ AVFRINIVPDENYTIKDGFNLKANLSTNFNGQNTEINSRNFTRL --------1111----!!!!------%%%%------3333---- >LYSYL-TRNA SYNTHETASE; SWP:P14825; PDB:1E1OA; AIDFNDELRNRREKLAALRQQGVAFPNDFRRDHTSDQLHEEFDAKDNQELESLNIEVSVA 3333--------------3333-----------3333----1111--------------- GRMMTRRIMGKASFVTLQDVGGRIQLYVARDSLPEGVYNDQFKKWDLGDIIGARGTLFKT --------!!!!------1111------1111-2222---3333-2222----------1 QTGELSIHCTELRLLTKALRPLPDQEVRYRQRYLDLIANDKSRQTFVVRSKILAAIRQFM 111----------------------3333------------------------------- VARGFMEVETPMMQVIPGGASARPFITHHNALDLDMYLRIAPELYLKRLVVGGFERVFEI 1111-------------------------------------------------------- NRNFRNEGISVHNPEFTMMELYMAYADYHDLIELTESLFRTLAQEVLGTTKVTYGEHVFD --------------------------3333-----------------------!!!!--- FGKPFEKLTMREAIKKYRPETDMADLDNFDAAKALAESIGITVEKSWGLGRIVTEIFDEV -----------------1111---------------1111---3333------------- AEAHLIQPTFITEYPAEVSPLARRNDVNPEITDRFEFFIGGREIGNGFSELNDAEDQAER 3333----------33331111--3333----------iiii------------------ FQEQVNAKAAGDDEAMFYDEDYVTALEYGLPPTAGLGIGIDRMIMLFTNSHTIRDVILFP -------11111111---------3333-----------------------3333----- AMRP ---- >RIBONUCLEASE 1; SWP:P07998; PDB:1E21A; AFQRQHMDSDSSPSSSSTYCNQMMRRRNMTQGRCKPVNTFVHEPLVDVQNVCFQEKVTCK 3333---1111-------------1111--------------------3333-------- NGQGNCYKSNSSMHITDCRLTNGSRYPNCAYRTSPKERHIIVACEGSPYVPVHFDASVE ----------------------------------------------------------- >EXTENDED-SPECTRUM BETA-LA; SWP:P37321; PDB:1E25A; SPLLKEQIESIVIGKKATVGVAVWGPDDLEPLLINPFEKFPMQSVFKLHLAMLVLHQVDQ -----------2222---------1111------1111---!!!!------------111 GKLDLNQTVIVNRAKVLQN 1--1111----1111---- >HLA CLASS I HISTOCOMPATIB; SWP:P18464; PDB:1E27A; GSHSMRYFYTAMSRPGRGEPRFIAVGYVDDTQFVRFDSDAASPRTEPRAPWIEQEGPEYW -------------2222----------!!!!-----1111--------3333-------- DRNTQIFKTNTQTYRENLRIALRYYNQSEAGSHTWQTMYGCDVGPDGRLLRGHNQYAYDG -------------------------------------------1111----------iii KDYIALNEDLSSWTAADTAAQITQRKWEAAREAEQLRAYLEGLCVEWLRRHLENGKETLQ i-----1111------3333-------------------------------------111 RADPPKTHVTHHPVSDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDRT 1-------------1111--------------------iiii--1111------------ FQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRWEP ---------22221111-----1111---------- >CYTOCHROME C549; SWP:Q55013; PDB:1E29A; VELTESTRTIPLDEAGGTTTLTARQFTNGQKIFVDTCTQCHLQGKTKTNNNVSLGLADLA ---3333----------------------------------------------------- GAEPRRDNVLALVEFLKNPKSYDGEDDYSELHPNISRPDIYPEMRNYTEDDIFDVAGYTL --------------------1111---------333311111111--------------- IAPKLDERWGGTIYF -----1111------ >THYMIDINE KINASE; SWP:P03176; PDB:1E2KA; MPTLLRVYIDGPHGMGKTTTTQLLVADDIVYVPEPMTYWRVLGASETIANIYTTQHRLDQ ----------------------1111--------3333---------------------- GEISAGDAAVVMTSAQITMGMPYAVTDAVLAPHIGGEAGPPPALTLIFDRHPIAALLCYP -----------------------------3333-----------------3333------ AARYLMGSMTPQAVLAFVALIPPTLPGTNIVLGALPEDRHIDRLAKRQRPGERLDLAMLA ---1111----------1111----------------------3333-2222-------- AIRRVYGLLANTVRYLQCGGSWREDWGQLSGTGPRPHIGDTLFTLFRAPELLAPNGDLYN ---------------1111-33333333--------33333333---3333-1111---- VFAWALDVLAKRLRSMHVFILDYDQSPAGCRDALLQLTSGMVQTHVTTPGSIPTICDLAR ------------1111---------3333--------1111------1111--------- TFAREMGE -------- >N-HYDROXYARYLAMINE O-ACET; SWP:Q00267; PDB:1E2TA; HMTSFLHAYFTRLHCQPLGVPTVEALRTLHLAHNCAIPFENLDVLLPREIQLDETALEEK ----------------------------------------3333---------------- LLYARRGGYCFELNGLFERALRDIGFNVRSLLGRVILSHPASLPPRTHRLLLVDVEDEQW -1111----------------1111----------------------------------- IADVGFGGQTLTAPLRLQAEIAQQTPHGEYRLMQEGSTWILQFRHHEHWQSMYCFDLGVQ --------------------------------------------%%%%------------ QQSDHVMGNFWSAHWPQSHFRHHLLMCRHLPDGGKLTLTNFHFTRYHQGHAVEQVNVPDV 3333----------11111111----------------!!!!----%%%%---------- PSLYQLLQQQFGLGVNDVKHGFTEAELAAVMAAF -----------------------------3333- >CYTOCHROME F; SWP:P23577; PDB:1E2WA; YPVFAQQNYANPREANGRIVCANCHLAQKAVEIEVPQAVLPDTVFEAVIELPYDKQVKQV 3333---------1111-3333-------------------------------1111--- LANGKKGDLNVGMVLILPEGFELAPPDRVPAEIKEKVGNLYYQPYSPEQKNILVVGPVPG 1111--------------------3333-----------------1111---------33 KKYSEMVVPILSPDPAKNKNVSYLKYPIYFGGNRGRGQVYPDGKKSNFTIYNASAAGKIV 33-----------33331111------------------1111----------------- AITALSEKKGGFEVSIEKANGEVVVDKIPAGPDLIVKEGQTVQADQPLTNNPNVGGFGQA -----------------3333---------------2222--2222-------------- ETEIVLQNPAR ----------- >TRYPAREDOXIN PEROXIDASE; SWP:Q9TZX2; PDB:1E2YA; GAAKLNHPAPEFDDALPNGTFKKVSLSSYKGKYVVLFFYPDFTFVCPTEIIQFSDDAKRF ------------------------3333-------------------------------- AEINTEVISCSCDSEYSHLQWTSVDRKKGGLGPAIPLADKTKAIARAYGVLDEDSGVAYR -------------------3333-3333----------1111------------------ GVFIIDPNGKLRQIIINDPIGRNVEEVIRLVEALQFVEEHG -----1111-------------3333--------------- >P97; SWP:Q01853; PDB:1E32A; NRPNRLIVDEAINEDNSVVSLSQPKMDELQLFRGDTVLLKGKKRREAVCIVLSDDTCSDE -1111---------1111-----------------------%%%%--------1111333 KIRMNRVVRNNLRVRLGDVISIQPCPDVKYGKRIHVLPIDDTVEGITGNLFEVYLKPYFL 3---3333------2222------1111----------1111------3333-------- EAYRPIRKGDIFLVRGGMRAVEFKVVETDPSPYCIVAPDTVIHCEGEPIKREDEEESLNE ---------------%%%%-----------------1111----------------1111 VGYDDVGGCRKQLAQIKEMVELPLRHPALFKAIGVKPPRGILLYGPPGTGKTLIARAVAN -1111----------------11113333-----------------------------33 ETGAFFFLINGPEIMSKLAGESESNLRKAFEEAEKNAPAIIFIDELDAIAPKREKTHGEV 33--------3333-------------------1111--------1111----------- ERRIVSQLLTLMDGLKQRAHVIVMAATNRPNSIDPALRRFGRFDREVDIGIPDATGRLEI ---------------------------------3333----------------------- LQIHTKNMKLADDVDLEQVANETHGHVGADLAALCSEAALQAIRKKMDLIDLEDETIDAE ----1111--1111------------3333----------------3333-------333 VMNSLAVTMDDFRWALSQ 3------3333---1111 >[NIFE] HYDROGENASE SMALL ; SWP:AAF43137; PDB:1E3DA; SRPSVVYLHAAECTGCSEALLRTYQPFIDTLILDTISLDYHETIMAAAGEAAEEALQAAV -------------------1111-----------------3333---------------- NGPDGFICLVEGAIPTGMDNKYGYIAGHTMYDICKNILPKAKAVVSIGTCACYGGIQAAK ----------------%%%%----iiii--------3333--------------1111-- PNPTAAKGINDCYADLGVKAINVPGCPPNPLNMVGTLVAFLKGQKIELDEVGRPVMFFGQ -------3333-3333-----------------------1111----------3333--- SVHDLCERRKHFDAGEFAPSFNSEEARKGWCLYDVGCKGPETYNNCPKVLFNETNWPVAA 3333-1111--1111----------------1111--3333---3333--%%%%-3333- GHPCIGCSEPNFWDDMTPFYQN -----1111-3333---1111- >[NIFE] HYDROGENASE SMALL ; SWP:AAF43138; PDB:1E3DB; TPRSNYTGPIVVDPLTRIEGHLRIEVEVEGGVIKEARSCATLFRGIETILKGRDPRDAQH ----------------------------iiii--------------3333---3333--- FTQRTCGVCTYTHALASTRCLEDAINKPIPANATYIRNLVLGNQFMHDHLVHFYHLHALD -----------------------------3333-----------------------3333 FVDVTSALLADPAKAAKLANSISPRKATTEEFAAVQAKLKTFVASGQLGPFTNAYFLGGH --3333-----------------------------------------!!!!--1111--3 EGYYMDPEANLVCTAHYLQALRAQVEVAKGMAVFGAKNPHTQFTVAGGVTCYEALTPERI 333--------------------------3333--------------------------- KQFRELYVKARAFIEEVYIPDLLLVASYYKDWGKIGGTNNFMAFGEFPAPGGERDLNSRW ------------------------3333--------------------2222--1111-- YKPGVIYDRKVGSVQPFDPSKIEEHVRHSWYEGKARAPFEGETNPHFTFMGDTDKYSWNK ------%%%%-------3333----1111-------3333--------2222-------- APRYDGHAVETGPLAQMLVAYGHNHKTIKPTIDAVLGKLNLGPEALFSTLGRTAARGIQT ---iiii-------------1111------------------3333-------------- LVIAQQMENWLNEYENNIVKDKQIVEDYAVPTSARGVGFADVSRGGLSHWMTIEDGKIDN ---------------1111----------------------1111--------------- FQLVVPTTWNLGPRDDKGVPSAAEAALVGTPVADPKRPVEILRTIHSFDPCIACSTH ----3333------1111--------2222---1111--------1111-3333--- >ALCOHOL DEHYDROGENASE, CL; SWP:Q9QYY9; PDB:1E3IA; GTQGKVIKCKAAIAWKTGSPLCIEEIEVSPPKACEVRIQVIATCVCPTDINATDPKKKAL -2222--------------------------2222------------------3333--- FPVVLGHECAGIVESVGPGVTNFKPGDKVIPFFAPQCKRCKLCLSPLTNLCGKLRNFKYP ----------------2222---2222------------3333-------3333----33 TIDQELMEDRTSRFTCKGRSIYHFMGVSSFSQYTVVSEANLARVDDEANLERVCLIGCGF 33----1111-----%%%%----%%%%-----------------11111111-------- SSGYGAAINTAKVTPGSTCAVFGLGCVGLSAIIGCKIAGASRIIAIDINGEKFPKAKALG -------------2222---------------------------------------1111 ATDCLNPRELDKPVQDVITELTAGGVDYSLDCAGTAQTLKAAVDCTVLGWGSCTVVGAKV -----3333---3333-------------------------------------------- DEMTIPTVDVILGRSINGTFFGGWKSVDSVPNLVSDYKNKKFDLDLLVTHALPFESINDA -----33331111------%%%%--3333-------------3333------3333---- IDLMKEGKSIRTILTF ---------------- >NADP(H)-DEPENDENT KETOSE ; SWP:O96496; PDB:1E3JA; DNLSAVLYKQNDLRLEQRPIPEPKEDEVLLQMAYVGICGSDVHYYEHGRIADFIVKDPMV --------2222-----------1111--------------------------------- IGHEASGTVVKVGKNVKHLKKGDRVAVEPGVPCRRCQFCKEGKYNLCPDLTFCATPPDDG ------------1111---2222---------------11113333-----2222----- NLARYYVHAADFCHKLPDNVSLEEGALLEPLSVGVHACRRAGVQLGTTVLVIGAGPIGLV --------1111----11113333-------------------2222------------- SVLAAKAYGAFVVCTARSPRRLEVAKNCGADVTLVVDPAKEEESSIIERIRSAIGDLPNV -----1111--------3333---------------1111----------1111------ TIDCSGNEKCITIGINITRTGGTLMLVGMGSQMVTVPLVNACAREIDIKSVFRYCNDYPI ------------------2222--------------3333-------------------- ALEMVASGRCNVKQLVTHSFKLEQTVDAFEAARKKADNTIKVMISCRQ -----------3333-----3333-----------1111--------- >POU domain, class 2, tran; SWP:P14859; PDB:1E3OC; EEPSDLEELEQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEALNLSFKNM ------------------------------------------------------------ SKLKPLLEKWLNDAEKRTSIETNIRVALEKSFMENQKPTSEDITLIAEQLNMEKEVIRVW ----3333------------------------------3333------------------ FSNRRQKEKRIN ------1111-- >GUANOSINE PENTAPHOSPHATE ; SWP:Q53597; PDB:1E3PA; NETHYAEAVIDNGAFGTRTIRFETGRLARQAAGSAVAYLDDDTMVLSATTASKNPKDQLD -----------!!!!------------1111-------%%%%-------------3333- FFPLTVDVEERMYAAGKIPGSFFRREGRPSEDAILTCRLIDRPLRPSFKKGLRNEIQVVA ------------1111----1111--------------------11112222-------- TIMALNPDHLYDVVAINAASASTQLAGLPFSGPIGGVRVALIRGQWVAFPTHTELEDAVF -----1111-------------1111---------------iiii-----33331111-- DMVVAGRVLEDGDVAIMMVEAEATEKTIQLVKDGAEAPTEEVVAAGLDAAKPFIKVLCKA --------1111-----------1111--------------------------------- QADLAAKAAKPTGEFPVFLDYQDDVLEALSAAVRPELSAALTIAGKQDREAELDRVKALA ---------------------3333-------------1111---3333----------- AEKLLPEFEGREKEISAAYRALTKSLVRERVIAEKKRIDGRGVTDIRTLAAEVEAIPRVH ----------3333----------------------1111-------------------- GSALFERGETQILGVTTLNMLRMEQQLDTLSPVTRKRYMHNYNFPPYSVGETGRVGSPKR ------!!!!---------3333---------------------1111------------ REIGHGALAERAIVPVLPTREEFPYAIRQVSEALGSNGSTSMGSVCASTMSLLNAGVPLK ------------3333--3333-------------------------------------- APVAGIAMGLISQEINGETHYVALTDILGAEDAFGDMDFKVAGTKEFVTALQLDTKLDGI ---------------------------33331111------------------------- PASVLAAALKQARDARLHILDVMMEAIDTPDEMSPNAPRIITVNQIQEDTGAEIYIGAAD 3333-----------------------------1111--------1111----------1 GPAAEAGSVVKTTFGAFVSLLDGLLHLGVGQKVQVEIAEIDSRGK 111-------------------------------------1111- >ALPHA-AMYLASE; SWP:P00692; PDB:1E43A; VNGTLMQYFEWYTPNDGQHWKRLQNDAEHLSDIGITAVWIPPAYKGLSQSDNGYGPYDLY ---------1111----------------------------------1111------111 DLGEFQQKGTVRTKYGTKSELQDAIGSLHSRNVQVYGDVVLNHKAGADATEDVTAVEVNP 1-----iiii--1111------------1111--------------------------11 ANRNQETSEEYQIKAWTDFRFPGRGNTYSDFKWHWYHFDGADWDESRKISRIFKFRGEGK 11------------------1111---------1111----------------------- AWDWEVSSENGNYDYLMYADVDYDHPDVVAETKKWGIWYANELSLDGFRIDAAKHIKFSF -------2222----------------------------------------1111----- LRDWVQAVRQATGKEMFTVAEYWQNNAGKLENYLNKTSFNQSVFDVPLHFNLQAASSQGG ---------------------------------------------------------iii GYDMRKLLNGTVVSKHPLKSVTFVDNHDTQPGQSLESTVQTWFKPLAYAFILTRESGYPQ i-3333-222233331111------11112222------3333----------------- VFYGDMYGTKGDSQREIPALKHKIEPILKARKQYAYGAQHDYFDHHDIVGWTREGDSSVA -3333--------------3333--------------------------------1111- NSGLAALITDGPGGAKRMYVGRQNAGETWHDITGNRSEPVVINSEGWGEFHVNGGSVSIY --------------------1111------1111--------1111-------------- VQR --- >L-FUCULOSE 1-PHOSPHATE AL; SWP:P11550; PDB:1E4CP; MERNKLARQIIDTCLEMTRLGLNQGTAGNVSVRYQDGMLITPTGIPYEKLTESHIVFIDG ---------------------------------!!!!--------3333-3333----11 NGKHEEGKLPQSEWRFHMAAYQSRPDANAVVHNHAVHCTAVSILNRSIPAIHYMIAAAGG 11--2222--1111---------3333-------------3333-----------1111- NSIPCAPYATFGTRELSEHVALALKNRKATLLQHHGLIACEVNLEKALWLAHEVEVLAQL ---------2222--------1111----------------------------------- YLTTLAITDPVPVLSDEEIAVVLEKF -------------------------- >VANCOMYCIN/TEICOPLANIN A-; SWP:P25051; PDB:1E4EA; NRIKVAILFGGCSEEHDVSVKSAIEIAANINKEKYEPLYIGITKSGVWKMCEKPCAEWEN ------------1111---------1111------------------------------- ENCYSAVLSPDKKMHGLLVKKNHEYEINHVDVAFSALHGKSGEDGSIQGLFELSGIPFVG ----------3333------%%%%--------------2222------------------ CDIQSSAICMDKSLTYIVAKNAGIATPAFWVINKDDRPVAATFTYPVFVKPARSGSSFGV -------------------1111---------1111--3333------------%%%%-- KKVNSADELDYAIESARQYDSKILIEQAVSGCEVGCAVLGNSAALVVGEVDQIRLQYGIF ----3333-------3333----------------------------------------- RIHQEVEPEKGSENAVITVPADLSAEERGRIQETVKKIYKTLGCRGLARVDMFLQDNGRI 3333--3333-----------------------------1111-----------1111-- VLNEVNTLPGFTSYSRYPRMMAAAGISLPELIDRLIVLALK -----------1111-----3333----------------- >Cell division protein Fts; SWP:Q9WZU0; PDB:1E4FT; TVFYTSIDIGSRYIKGLVLGKRDQEWEALAFSSVKSRGLDEGEIKDAIAFKESVNTLLKE ---------------------------------------iiii----------------- LEEQLQKSLRSDFVISFSSVSFEREDTVIERDFGEEKRSITLDILSEMQSEALEKLKENG ------------------------------------------------------------ KTPLHIFSKRYLLDDERIVFNPLDMKASKIAIEYTSIVVPLKVYEMFYNFLQDTVKSPFQ ------------%%%%-----2222----------------------------------- LKSSLVSTAEGVLTTPEKDRGVVVVNLGYNFTGLIAYKNGVPIKISYVPVGMKHVIKDVS ---------------------------3333------iiii------------------- AVLDTSFEESERLIITHGNAVYNDLKEEEIQYRGLDGNTIKTTTAKKLSVIIHARLREIM 1111-----------------------------1111----------------------- SKSKKFFREVEAKIGIPGGVVLTGGGAKIPRINELATEVFKSPVRTGCYANSDRPSIINA -----------------------3333-2222---------------3333--------- DEVANDPSFAAAFGNVFA -33333333--------- >BETA-GLUCOSIDASE; SWP:P22073; PDB:1E4IA; TIFQFPQDFMWGTATAAYQIEGAYQEDGRGLSIWDTFAHTPGKVFNGDNGNVACDSYHRY -----1111------3333---1111-------------22222222----!!!!----- EEDIRLMKELGIRTYRFSVSWPRIFPNGDGEVNQKGLDYYHRVVDLLNDNGIEPFCTLYH -------3333--------1111-1111-------------------1111--------- WDLPQALQDAGGWGNRRTIQAFVQFAETMFREFHGKIQHWLTFNEPWCIAFLSNMLGVHA --------------3333--------------2222------------------------ PGLTNLQTAIDVGHHLLVAHGLSVRRFRELGTSGQIGIAPNVSWAVPYSTSEEDKAACAR ------------------------------------------------------------ TISLHSDWFLQPIYQGSYPQFLVDWFAEQGATVPIQDGDMDIIGEPIDMIGINYYSMSVN --------------------------1111-----2222--------------------- RFNPEAGFLQSEEINMGLPVTDIGWPVESRGLYEVLHYLQKYGNIDIYITENGACINDEV --11111111----------1111---3333--------1111----------------- VNGKVQDDRRISYMQQHLVQVHRTIHDGLHVKGYMAWSLLDNFEWAEGYNMRFGMIHVDF iiii--3333--------------1111---------------!!!!------------- RTQVRTPKQSYYWYRNVVSNNWLETRR --------------------------- >Myrosinase MA1; SWP:P29736; PDB:1E4MM; EITCQENLPFTCGNTDALNSSSFSSDFIFGVASSAYQIEGTIGRGLNIWDGFTHRYPNKS -------------3333-3333-1111------3333---2222-----------3333- GPDHGNGDTTCDSFSYWQKDIDVLDELNATGYRFSIAWSRIIPRGKRSRGVNEKGIDYYH 1111----!!!!------------------------3333-11113333----------- GLISGLIKKGITPFVTLFHWDLPQTLQDEYEGFLDPQIIDDFKDYADLCFEEFGDSVKYW ------1111------------3333----!!!!-------------------------- LTINQLYSVPTRGYGSALDAPGRCSPTVDPSCYAGNSSTEPYIVAHHQLLAHAKVVDLYR ----3333----------------11111111---3333--------------------- KNYTHQGGKIGPTMITRWFLPYNDTDRHSIAATERMKEFFLGWFMGPLTNGTYPQIMIDT --3333------------------------------------------------------ VGERLPSFSPEESNLVKGSYDFLGLNYYFTQYAQPSPNPVNSTNHTAMMDAGAKLTYINA !!!!-----------2222-------------------1111---3333---------11 SGHYIGPLFEKDKADSTDNIYYYPKGIYSVMDYFKNKYYNPLIYVTENGISTPGDENRNQ 11---------33331111---3333--------------------------3333---- SMLDYTRIDYLCSHLCFLNKVIKEKDVNVKGYLAWALGDNYEFNKGFTVRFGLSYIDWNN ---3333----------------------------------2222-----------1111 VTDRDLKKSGQWYQSFISP ------------------- >BETA-DEFENSIN 8; SWP:CAC44635; PDB:1E4RA; NEPVSCIRNGGICQYRCIGLRHKIGTCGSPFKCCK 11113333---------3333------1111---- >BETA-DEFENSIN 1; SWP:Q09753; PDB:1E4SA; DHYNCVSSGGQCLYSACPIFTKIQGTCYRGKAKCCK -----3333--------3333-------3333---- >BETA-DEFENSIN 7; SWP:CAC44542; PDB:1E4TA; NSKRACYREGGECLQRCIGLFHKIGTCNFRFKCCKFQ ----3333---------3333---------------- >TRANSCRIPTIONAL REPRESSOR; SWP:O95628; PDB:1E4UA; MSRSPDAKEDPVECPLCMEPLEIDDINFFPCTCGYQICRFCWHRIRTDENGLCPACRKPY ---------------------1111------------3333--1111------------- PEDPAVYKPLSQEELQRI ------------------ >TAB2; SWP:NA; PDB:1E4XH; QVQLQQPGAELVKPGASVKLSCKASGFTFTNYWMHWVKQRPGQGLEWIGEILPSNGRTNY ------------2222------------1111---------------------------- NEKFKTKATLTVDKSSNTAYMQLSSLTSEDSAVYYCARSPSDYWGQGTTLTVSSAKTTAP 3333---------1111---------3333------------------------------ SVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLYTLSS ----------------------------------%%%%---------------------- SVTVTSSTWPSQSITCNVAHPASSTKVDKKIEPRVP ----3333------------1111------------ >TAB2; SWP:NA; PDB:1E4XL; DIQMTQTPSSLSASLGDRVTISCRASQDISHYLNWFQQKPDGTVKLLIYYTSTLHSGVPS -------------2222-----------iiii------1111------------222233 RFSGSGSGTDYSLTISNLEEEDIAFYFCQQGGALPFTFGSGTKLAIKRADAAPTVSIFPP 33----------------3333-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLDSWTDQDSKDSTYSMSSTLT -----------------------------iiii--------------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC -----3333--------1111------------- >ADENYLATE KINASE; SWP:P05082; PDB:1E4YA; MRIILLGALVAGKGTQAQFIMEKYGIPQISTGDMLRAAVKSGSELGKQAKDIMDAGKLVT -------2222-3333-----1111-----------------3333-----3333----3 DELVIALVKERIAQEDCRNGFLLDGFPRTIPQADAMKEAGINVDYVLEFDVPDELIVDRI 333----------1111-----------3333----3333-----------3333----- VGRRVHAPSGRVYHVKFNPPKVEGKDDVTGEELTTRKDDQEETVRKRLVEYHQMTAPLIG ------1111-----------2222----------1111--------------------- YYSKEAEAGNTKYAKVDGTKPVAEVRADLEKILG --------------------3333---------- >DELTA-AMINOLEVULINIC ACID; SWP:P13716; PDB:1E51A; MQPQSVLHSGYFHPLLRAWQTATTTLNASNLIYPIFVTDVPDDIQPITSLPGVARYGVKR ------3333--------1111-----3333---------------3333---------- LEEMLRPLVEEGLRCVLIFGVPSRVPKDERGSAADSEESPAIEAIHLLRKTFPNLLVACD -1111--3333----------------1111-------3333---------1111----- VCLCPYTSHGHCGLLSGAFRAEESRQRLAEVALAYAKAGCQVVAPSDMMDGRVEAIKEAL ------3333----------3333------------------------2222-------- MAHGLGNRVSVMSYSAKFASCFYGPFRDAAKSSPAFGDRRCYQLPPGARGLALRAVDRDV ----1111----------------3333-------------------------------- REGADMLMVKPGMPYLDIVREVKDKHPDLPLAVYHVSGEFAMLWHGAQAGAFDLKAAVLE -----------1111----------1111------------------------------- AMTAFRRAGADIIITYYTPQLLQWLK --------------1111-------- >EXCINUCLEASE ABC SUBUNIT ; SWP:P07025; PDB:1E52A; LEPDNVPMDMSPKALQQKIHELEGLMMQHAQNLEFEEAAQIRDQLHQLRELFIAAS -3333----------------------3333-----3333---------------- >PHYSALIS MOTTLE VIRUS; SWP:P36351; PDB:1E57A; SPAIVLPFQFEATTFGTAETAAQVSLQTADPITKLTAPYRHAQIVECKAILTPTDLAVSN ------------------------3333-------3333--------------------- PLTVYLAWVPANSPATPTQILRVYGGQSFVLGGAISAAKTIEVPLNLDSVNRMLKDSVTY ------------------------------------------------------------ TDTPKLLAYSRAPTNPSKIPTASIQISGRIRLSKPMLIAN ---------------------------------------- >PHOSPHOGLYCERATE MUTASE; SWP:P31217; PDB:1E58A; AVTKLVLVRGESQWNKENRFTGWYDVDLSEKGVSEAKAAGKLLKEEGYSFDFAYTSVLKR --------------------!!!!-------------------1111------------- AIHTLWNVLDELDQAWLPVEKSWKLNERHYGALQGLNKAETAEKYGDEQVKQWRRGFAVT ---------11111111----3333----!!!!----------------------1111- PPELTKDDERYPGHDPRYAKLSEKELPLTESLALTIDRVIPYWNETILPRMKSGERVIIA ----1111--33331111---3333-------------------------1111------ AHGNSLRALVKYLDNMSEEEILELNIPTGVPLVYEFDENFKPLKRYYLGNADEIAAKAAA --------------------1111--2222------1111---------------1111- VANQGK -1111- >XYLANASE D; SWP:P54865; PDB:1E5BA; TGCSVTATRAEEWSDGFNVTYSVSGSSAWTVNLALNGSQTIQASWNANVTGSGSTRTVTP -----------------------------------!!!!--------------------- NGSGNTFGVTVMKNGSSTTPAATCAGS ------------iiii----------- >RUBREDOXIN:OXYGEN OXIDORE; SWP:Q9F0J6; PDB:1E5DA; QATKIIDGFHLVGAIDWNSRDFHGYTLSPMGTTYNAYLVEDEKTTLFDTVKAEYKGELLC -----2222------------------1111-------------------1111------ GIASVIDPKKIDYLVIQHLELDHAGALPALIEACQPEKIFTSSLGQKAMESHFHYKDWPV ------1111---------1111------------------------------------- QVVKHGETLSLGKRTVTFYETRMLHWPDSMVSWFADEKVLISNDIFGQNIAASERFSDQI ---2222-------------2222----------1111---!!!!----------1111- PVHTLERAMREYYANIVNPYAPQTLKAIETLVGAGVAPEFICPDHGVIFRGADQCTFAVQ ----------------3333---------------------------------------- KYVEYAEQKPTNKVVIFYDSMWHSTEKMARVLAESFRDEGCTVKLMWCKACHHSQIMSEI ----------------------------------------------1111---------1 SDAGAVIVGSPTHNNGILPYVAGTLQYIKGLRPQNKIGGAFGSFGWSGESTKVLAEWLTG 111---------%%%%------------------------------------------11 MGFDMPATPVKVKNVPTHADYEQLKTMAQTIARALKAKLAA 11--------------------------------------- >METHIONINE GAMMA-LYASE; SWP:O15564; PDB:1E5EA; ERMTPATACIHANPQKDQFGAAIPPIYQTSTFVFDNCQQGGNRFAGQESGYIYTRLGNPT ---3333---------1111----------------------1111------3333---- VSNLEGKIAFLEKTEACVATSSGMGAIAATVLTILKAGDHLISDECLYGCTHALFEHALT -----------------------------------2222--------------------- KFGIQVDFINTAIPGEVKKHMKPNTKIVYFETPANPTLKIIDMERVCKDAHSQEGVLVIA ---------3333---3333-1111----------------------------------- DNTFCSPMITNPVDFGVDVVVHSATKYINGHTDVVAGLICGKADLLQQIRMVGIKDITGS --3333----3333--------33333333------------------------------ VISPHDAWLITRGLSTLNIRMKAESENAMKVAEYLKSHPAVEKVYYPGFEDHEGHDIAKK -------------------------------------1111----3333--2222----- QMRMYGSMITFILKSGFEGAKKLLDNLKLITLAVSLGGCESLIQHPASMTHAVVPKEERE ------------1111-------1111-----------------33331111-------1 AAGITDGMIRLSVGIEDADELIADFKQGLDALLR 111------------------------------- >MOLYBDOPTERIN-GUANINE DIN; SWP:P32173; PDB:1E5KA; MTTITGVVLAGGKARRMGGVDKGLLELNGKPLWQHVADALMTQLSHVVVNANRHQEIYQA --------------------3333--iiii3333-------------------3333-11 SGLKVIEDSLADYPGPLAGMLSVMQQEAGEWFLFCPCDTPYIPPDLAARLNHQRKDAPVV 11------3333-----------------------1111---1111-------iiii--- WVHDGERDHPTIALVNRAIEPLLLEYLQAGERRVMVFMRLAGGHAVDFSDHKDAFVNVNT --------1111---3333-------------------1111-----1111-1111---3 PEELARWQ 3331111- >BETA KETOACYL ACYL CARRIE; SWP:P73283; PDB:1E5MA; KKRVVVTGLGAITPIGNTLQDYWQGLMEGRNGIGPITRFDASDQACRFGGEVKDFDATQF -------------------------1111-----------1111-----------1111- LDRKEAKRMDRFCHFAVCASQQAINDAKLVINELNADEIGVLIGTGIGGLKVLEDQQTIL -----11113333------------------3333------------------------- LDKGPSRCSPFMIPMMIANMASGLTAINLGAKGPNNCTVTACAAGSNAIGDAFRLVQNGY ---3333-1111---------------------------!!!!----------------- AKAMICGGTEAAITPLSYAGFASARALSFRNDDPLHASRPFDKDRDGFVMGEGSGILILE -----------------------------11111111-2222------------------ ELESALARGAKIYGEMVGYAMTCDAYHITAPVPDGRGATRAIAWALKDSGLKPEMVSYIN -----1111----------------------1111-----------1111-3333----- AHGTSTPANDVTETRAIKQALGNHAYNIAVSSTKSMTGHLLGGSGGIEAVATVMAIAEDK -----3333-------------3333-------------!!!!----------------- VPPTINLENPDPECDLDYVPGQSRALIVDVALSNSFGFGGHNVTLAFKKYQ ----------1111------------------------------------- >APHRODISIN; SWP:P09465; PDB:1E5PA; FAELQGKWYTIVIAADNLEKIEEGGPLRFYFRHIDCYKNCSEEITFYVITNNQCSKTTVI 1111------------3333-2222------------%%%%--------%%%%------- GYLKGNGTYETQFEGNNIFQPLYITSDKIFFTNKNDRAGQETNIVVAGKGNALTPEENEI ---1111-----------------1111-------1111--------------------- LVQFAHEKKIPVENILNILATDTCPE ----------1111---3333----- >PROLINE OXIDASE; SWP:O09345; PDB:1E5RA; MRSHILGKIELDQTRLAPDLAYLAAVPTVEEFSNGFWKHVPLWNAPTAHVEHVPYLKEIV ------------3333------------------------------3333---3333--- TTVFDGTHLQMARSRNLKNAIVIPHRDFRYFRTFMVLEDSPLAFHSNEDTVIHMRPGEIW -----1111------------------------------1111---!!!!----2222-- FLDAATVHSAVNFSEISRQSLCVDFAFDGPFDEKEIFADATLYAPGSTPDLPERRPFTAE --3333-------------------------3333---3333------------------ HRRRILSLGQVIERENFRDILFLLSKVHYKYDVHPSETYDWLIEISKQAGDEKMVVKAEQ ------------1111---------3333----3333---------1111---------- IRDFAVEARALSERFSLTSW -------------------- >MOESIN; SWP:P26038; PDB:1E5WA; MPKTISVRVTTMDAELEFAIQPNTTGKQLFDQVVKTIGLREVWFFGLQYQDTKGFSTWLK ----------1111------1111----------------3333------1111-----1 LNKKVTAQDVRKESPLLFKFRAKFYPEDVSEELIQDITQRLFFLQVKEGILNDDIYCPPE 1111111--------------------3333--------------------------333 TAVLLASYAVQSKYGDFNKEVHKSGYLAGDKLLPQRVLEQHKLNKDQWEERIQVWHEEHR 3-----------------------1111-----3333---2222------------1111 GMLREDAVLEYLKIAQDLEMYGVNYFSIKNKKGSELWLGVDALGLNIYEQNDRLTPKIGF -------------33331111--------1111-------1111----1111-------- PWSEIRNISFNDKKFVIKPIDKKAPDFVFYAPRLRINKRILALCMGNHELYMRRRKPDTI 3333-----------------------------3333---------------3333---- EVQQMKAQAREEKHQKQMERAMLENEKKKREMAEKEKEKIEREKEE -----------------3333--1111-3333-------------- >THREONINE SYNTHASE; SWP:Q39144; PDB:1E5XA; IETAVKPPHRTEDNIRDENAVNPFSAKYVPFNAAPGSTESYSLDEIVYRGLLDVEHDEAL -----1111---1111-----------------------1111--------------333 KRFDGAYWRDLFDSRVGKSTWPYGSGVWSKKEWVLPEIDDDDIVSAFEGNSNLFWAERFG 3-------------2222-----------3333-11113333--------------3333 KQFLGNDLWVKHCGISHTGSFKDLGTVLVSQVNRLRKKRPVVGVGCASTGDTSAALSAYC -----------11111111----------------------------------------- ASAGIPSIVFLPANKISAQLVQPIANGAFVLSIDTDFDGCKLIREITAELPIYLANSLNS 1111-------1111--1111--------------3333-----3333---------333 LRLEGQKTAAIEILQQFDWQVPDWVIVPGGNLGNIYAFYKGFKCQELGLVDRIPRVCAQA 3-------------1111------------3333----------1111-----------2 ANANPLYLHYKSGWKDFKPVSIDRAVYALKKCNGIVEEATEEELDAAQADSTGFICPHTG 222----------1111------------1111------3333------1111--3333- VALTALFKLRNQGVIAPTDRTVVVSTAHGLKFTQSKIDYHSNAIPDACRFSNPPVDVKAD ---------3333--1111--------3333-------1111-----1111--------3 FGAVDVLKSYLGSNTLTS 333--------------- >CHROMOSOME SEGREGATION SM; SWP:Q9X0R4; PDB:1E69A; MRLKKLYLKGFKSFGRPSLIGFSDRVTAIVGPNGSGKSNIIDAIKWVFGEKFDMIFAGSE ----------!!!!-----------------3333------------------------- NLPPAGSAYVELVFEENGEEITVARELKRTGENTYYLNGSPVRLKDIRDRFAGTGLGVDF ---------------------------3333-----iiii--3333-------------3 YSIVGQGQIDRIVNAYQRVNESFNRFISLLFFGGEGRLEISIRKPGRRDQKLSLLSGGEK 333-----------------------------------------------3333------ ALVGLALLFALMEIKPSPFYVLDEVDSPLDDYNAERFKRLLKENSKHTQFIVITHNKIVM ----------3333---------------3333---------3333------------33 EAADLLHGVTMVNGVSAIVPVEV 33--------------------- >GLUTATHIONE S-TRANSFERASE; SWP:Q9ZVQ3; PDB:1E6BA; KLKLYSYWRSSCAHRVRIALALKGLDYEYIPVNLLKGDQFDSDFKKINPMGTVPALVDGD ------1111----------1111--------33331111-3333--3333------!!! VVINDSFAIIMYLDEKYPEPPLLPRDLHKRAVNYQAMSIVLSGIQPTAWVNNAITKGFTA !----------------------------------------------------------- LEKLLVNCAGKHATGDEIYLADLFLAPQIHGAINRFQINMEPYPTLAKCYESYNELPAFQ ----1111----------3333-----------------3333-------1111--3333 NALPEKQPDAPSST --33331111---- >SHIKIMATE KINASE; SWP:P10880; PDB:1E6CA; MTEPIFMVGARGCGMTTVGRELARALGYEFVDTDIFMQHTSGMTVADVVAAEGWPGFRRR -----------------------1111--------------------------------- ESEALQAVATPNRVVATGGGMVLLEQNRQFMRAHGTVVYLFAPAEELALRLQASLQAHQR -----------------1111---------------------3333-------------- PTLTGRPIAEEMEAVLREREALYQDVAHYVVDATQPPAAIVCELMQTMRL -1111----------------------------------------1111- >SPECTRIN ALPHA CHAIN; SWP:P07751; PDB:1E6GA; TGKELVLVLYDYQEKSPRELTIKKGDILTLLNSTNKDWWKVEVNDRQGFIPAAYLKKLD ---------------1111-------------------------------3333----- >SPECTRIN ALPHA CHAIN; SWP:P07751; PDB:1E6HA; DETGKELVLVLYDYQEKSPREVTIKKGDILTLLNSTNKDWWKIEVNDRQGFVPAAYLKKL -----------------1111---2222--------1111------------3333---- D - >TRANSCRIPTIONAL ACTIVATOR; SWP:Q03330; PDB:1E6IA; RGPHDAAIQNILTELQNHAAAWPFLQPVNKEEVPDYYDFIKEPMDLSTMEIKLESNKYQK -1111------------11111111---33331111----------------1111---3 MEDFIYDARLVFNNCRMYNGENTSYYKYANRLEKFFNNKVKEIPEYSHLID 333----------------1111----------------11111111---- >EG628498 protein; SWP:A0A5E0; PDB:1E6OH; EVQLQQSGAELARPGASVKMSCKASGYTFTSYTMHWVKQRPGQGLEWIGYINPSSGYSNY ------------2222-----------1111--------2222----------------- NQKFKDKATLTADKSSSTAYMQLSSLTSEDSAVYYCSRPVVRLGYNFDYWGQGSTLTVSS 3333---------1111---------3333------------------------------ AKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSD ----------------3333--------------------%%%%--1111-------%%% LYTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVP %---------1111------------1111--------- >GDP-FUCOSE SYNTHETASE; SWP:P32055; PDB:1E6UA; AKQRVFIAGHRGMVGSAIRRQLEQRGDVELVLRTRDELNLLDSRAVHDFFASERIDQVYL --------1111--------33331111-----3333-3333------------------ AAAKVGGIVANNTYPADFIYQNMMIESNIIHAAHQNDVNKLLFLGSSCIYPKLAKQPMAE ---------------------------------1111--------1111---------33 SELLQGTLEPTNEPYAIAKIAGIKLCESYNRQYGRDYRSVMPTNLYGPHDNFHPSNSHVI 33------3333----------------------------------2222--1111---- PALLRRFHEATAQKAPDVVVWGSGTPMREFLHVDDMAAASIHVMELAHEVWLENTQPMLS ---------------------------------------------------11111111- HINVGTGVDCTIRELAQTIAKVVGYKGRVVFDASKPDGTPRKLLDVTRLHQLGWYHEISL -------------------------------------------------1111------- EAGLASTYQWFLENQ ----------1111- >METHYL-COENZYME M REDUCTA; SWP:Q49605; PDB:1E6VA; LFMKALKEKFEESPEEKYTKFYIFGGWKQSERKKEFKEWADKIVEERGVPHYNPDIGVPL ------------3333---------1111-----------------------1111---- GQRKLMSYQVSGTDVFVEGDDLTFVNNAAMQQMWDDIRRTVIVGMDTAHRVLERRLGKEV ---------2222----3333-3333----------1111-------------------- TPETINEYMETLNHALPGGAVVQEHMVEIHPGLTWDCYAKIITGDLELADEIDDKFLIDI --------------1111-----------3333-----------3333----1111--33 EKLFPEEQAEQLIKAIGNRTYQVCRMPTIVGHVCDGATMYRWAAMQIAMSFICAYKIAAG 33------------------------3333----1111---------------------- EAAVSDFAFASKHAEVINMGEMLPARRARGENEPGGVPFGVLADCVQTMRKYPDDPAKVA 3333-------------------3333-----3333------33333333-1111----- LEVIAAGAMLYDQIWLGSYMSGGVGFTQYATAVYPDNILDDYVYYGLEYVEDKYGIAEAE -----------------1111----11113333---------------------2222-- PSMDVVKDVATEVTLYGLEQYERYPAAMETHFGGSQRAAVCAAAAGCSTAFATGHAQAGL -----------------3333--------------------------------------- NGWYLSQILHKEGQGRLGFYGYALQDQCGAANSLSVRSDEGLPLELRGPNYPNYAMNVGH ------------------1111-------1111---------3333-11111111----3 LGEYAGIVQAAHAARGDAFCVHPVIKVAFADENLVFDFTEPRKEFAKGALREFEPAGERD 333-----------------------11111111--3333-------1111-------33 LIVPA 33--- >Methyl coenzyme M reducta; SWP:Q49601; PDB:1E6VB; DTVDLYDDRGNCVAEEVPIEVLSPMRNEAIQSIVNDIKRTVAVDLEGIENALQNATVGGK ------1111--------11113333--------------------------------ii GMKIPGREMDVDIVDNAEAIADEIEKMIRVYQDDDTNVEPMYDGKRLLVQLPSERVKVMA ii-2222-----3333-------------------------iiii------3333----- DPYSGTLQAGMAVVHAIIDVCEVDMWDANMVKAAVFGRYPQTIDYFGGNVASMLDVPMKQ 1111-------------------1111--------!!!!-----2222-------1111- EGVGYALRNIMVNHIVAATRKNTMQAVCLAATLQQTAMFEMGDALGPFERLHLLGYAYQG -22221111---------%%%%----------------1111--!!!!------------ LNADNMVYDIVKKHGKEGTVGTVVREVVERALEDGVIEVKEELPSFKVYKANDMDLWNAY -------------------------------1111------------------------- AAAGLVAAVMVNQGAARAAQGVSATILYYNDLLEYETGLPGVDFGRAEGTAVGFSFFSHS -------------3333---3333----------------2222---------------- IYGGGGPGIFHGNHIVTRHSKGFAIPPVAAAMALDAGTQMFSPEVTSKLIGDVFGEIDEF -----3333-1111---------------3333--------1111-------3333-333 REPMKYITEAAAEEAK 3----------3333- >Methyl coenzyme M reducta; SWP:Q49604; PDB:1E6VC; FYYPGETDVAENRRKYMNPNYELKKLREIPDEDIVRLMGHREPGEEYPSVHPPLEEMEEP --------------3333-----------3333--------3333-------1111---- ECPIRELVEPTEGAKAGDRIRYIQFTDSVYFAPIHPYIRARMYMWRYRGVDTGSLSGRQI -3333-------------------------------------------------1111-- IEVRERDLEKIAKELLETEIFDPARSGVRGATVHGHALRLDENGLMLHALRRYRLNEETG -----------------3333-------------1111--1111---1111--------- EVEYVKDQVGIELDEPIPVGAPADEDDLKERTTIYRIDGTPYREDEELLQVVQRIHELRT ------1111-------------------------1111-3333---------------- LAGYRPEE -------- >SHORT CHAIN 3-HYDROXYACYL; SWP:O70351; PDB:1E6WA; SVKGLVAVITGGASGLGLSTAKRLVGQGATAVLLDVPNSEGETEAKKLGGNCIFAPANVT -2222-----1111---------------------3333---------1111-----111 SEKEVQAALTLAKEKFGRIDVAVNCAGIAVAIKTYHEKKNQVHTLEDFQRVINVNLIGTF 1-----------------------------------3333-------------------- NVIRLVAGVMGQNEPDQGGQRGVIINTASVAAFEGQVGQAAYSASKGGIVGMTLPIARDL ---------1111--1111----------------2222--------------------3 APIGIRVVTIAPGLFATPLLTKVRNFLASQVPFPSRLGDPAEYAHLVQMVIENPFLNGEV 333-------------3333---33331111-------3333----------1111---- IRLDGAIRMQP ---iiii---- >METHYL-COENZYME M REDUCTA; SWP:P07962; PDB:1E6YA; AADIFSKFKKDMEVKFAQEFGSNKQTGGDITDKTAKFLRLGPEQDPRKVEMIKAGKEIAE ----------------------------1111---------------------------- KRGIAFYNPMMHSGAPLGQRAITPYTISGTDIVCEPDDLHYVNNAAMQQMWDDIRRTCIV -------1111---------------2222----3333-3333----------------- GLDMAHETLEKRLGKEVTPETINHYLEVLNHAMPGAAVVQEMMVETHPALVDDCYVKVFT -------------------------------3333-----------33331111------ GDDALADEIDKQFLIDINKEFSEEQAAQIKASIGKTSWQAIHIPTIVSRTTDGAQTSRWA -333311113333--3333--------------------------------3333----- AMQIGMSFISAYAMCAGEAAVADLSFAAKAALVSMGEMLPARARGPNEPGGLSFGHLSDI -----------------3333--------------------------3333--------- VQTSRVSEDPAKIALEVVGAGCMLYDQIWLGSYMSGGVGFTQYATAAYTDDILDNNTYYD -3333--------------------------1111----11113333------------- VDYINDKYNGAATVGKDNKVKASLEVVKDIATESTLYGIETYEKFPTALEDHFGGSQRAT -------%%%%----------------------------------3333----------- VLAAAAGVACSLATGNANAGLSGWYLSMYLHKEAWGRLGFFFDLQDQGATNVLSYQGDEG ------------------------------------------3333-3333----1111- LPDELRGPNYPNYAMNVGHQGGYAGIAQAAHSGRGDAFTVNPLLKVCFADDLLPFNFAEP -3333-11111111-----------------1111--------------3333--1111- RREFGRGAIREFVPAGERSLVIPA ------1111-------3333--- >Methyl-coenzyme M reducta; SWP:P07955; PDB:1E6YB; SDTVDIYDDRGKLLESNVDIMSLAPTRNAAIQSIIMDTKRSVAVNLAGIQGALASGKMGG -------1111-------3333-1111--------------------------------2 KGRQILGRGLNYDIVGNADAIAENVKKLVQVDEGDDTNVIKVKGGKSLLIQSPKSRIIAG 222----------3333--------------2222-------iiii-------------- ADFMSATTVGAAAVTQTIMDMFGTDPYDAPIVKSAVWGSYPQTMDLMGGQVQGILSIPQN -------------------1111-1111--------!!!!-----2222-------3333 NEGLGFSLRNIMANHVAAISNRNAMNASALSSIYEQSGIFEMGGAVGMFERHQLLGLAYQ -----1111--3333----%%%%----------------------!!!!----------- GLNANNLLYDIVKENGKDGTIGTVIESVVRRAIEAGIISVDKTAPSGYNFYKANDVPKWN ------------1111---------------------------3333------------- ACAAVGTLAATLVNCGAGRAAQNVSSTLLYFNDILEKETGLPGCDYGKVEGTAVGFSFFS -------------------3333-------------------2222-------------- HSIYGGGGPGVFNGNHVVTRHSRGFAIPCVCAAVALDAGTQMFSIESTSGLIGDVFGAIP -------3333-1111---------3333----1111------3333--------11113 EFREPIKAVAGV 333--------- >Methyl-coenzyme M reducta; SWP:P07964; PDB:1E6YC; AYERQYYPGATSVAANRRKHMSGKLEKLREISDEDLTAVLGHRAPGSDYPSTHPPLAEMG -------------------------------------------2222-------3333-- EPASTRENVAATPGAAAGDRVRYIQFADSMYNAPATPYFRSYFAAINFRGVDPGTLSGRQ ---3333------------------------------------------------1111- IVEARERDMEQCAKVQMETEITDHALAGVRGATVHGHSVRLQEDGVMFDMLDRRRLENGT ------------------3333-------------1111--1111---1111----iiii IIMDKDQVAIPLDRKVDLGKPMSSEEAAKRTTIYRVDNVAFRDDAEVVEWVHRIFDQRTK -----1111-------------------------1111-3333----------------- FGFQPK ------ >ATP synthase delta chain,; SWP:P05630; PDB:1E79H; QMSFTFASPTQVFFNSANVRQVDVPTQTGAFGILAAHVPTLQVLRPGLVVVHAEDGTTSK ------------------------------------------------------------ YFVSSGSVTVNADSSVQLLAEEAVTLDMLDLGAAKANLEKAQSELLGAADEATRAEIQIR ------------------------3333----------------1111------------ IEANEALVKAL ----------- >RECOMBINATION ENDONUCLEAS; SWP:P13340; PDB:1E7DA; MLLTGKLYKEEKQKFYDAQNGKCLICQRELNPDVQANHLDHDHELNGPKAGKVRGLLCNL ----3333------------------------1111----------1111---------- CNAAEGQMKHKFNRSGLKGQGVDYLEWLENLLTYLKSDYTQNNIHPNFVGDKSKEFSRLG ----------------3333-------------1111-2222--1111-----------3 KEEMMAEMLQRGFEYNESDTKTQLIASFKKQLRKSLK 333------------------------------1111 >RECOMBINATION ENDONUCLEAS; SWP:P13340; PDB:1E7LA; MLLTGKLYKEEKQKFYDAQNGKCLICQRELNPDVQANHLDHDHELNGPKAGKVRGLLCNL ---!!!!---------1111------------1111----------1111---------- CDAAEGQMKHKFNRSGLKGQGVDYLEWLENLLTYLKSDYTQNNIHPNFVGDKSKEFSRLG ---------------3333-------------------1111--3333-------1111- KEEMMAEMLQRGFEYNESDTKTQLIASFKK --------1111---1111----------- >FUMARATE REDUCTASE FLAVOP; SWP:P17412; PDB:1E7PA; MKVQYCDSLVIGGGLAGLRAAVATQQKGLSTIVLSLIPVKRSHSAAAQGGMQASLGNSKM ---------------33333333-------------------3333-------------- SDGDNEDLHFMDTVKGSDWGCDQKVARMFVNTAPKAIRELAAWGVPWTRIHKGDRMAIIN 2222-----------------------------------1111----------------- AQKTTITEEDFRHGLIHSRDFGGTKKWRTCYTADATGHTMLFAVANECLKLGVSIQDRKE -----------2222--------------------------------3333--------- AIALIHQDGKCYGAVVRDLVTGDIIAYVAKGTLIATGGYGRIYKNTTNAVVCEGTGTAIA ------%%%%----------------------------1111------------------ LETGIAQLGNMEAVQFHPTPLFPSGILLTEGCRGDGGILRDVDGHRFMPDYEPEKKELAS 1111-----1111----------------3333--------------------------- RDVVSRRMIEHIRKGKGVQSPYGQHLWLDISILGRKHIETNLRDVQEICEYFAGIDPAEK -------------------------------------------------------1111- WAPVLPMQHYSMGGIRTDYRGEAKLKGLFSAGEAACWDMHGFNRLGGNSVSEAVVAGMIV -----------------1111----------3333----!!!!----1111--------- GEYFAEHCANTQVDLETKTLEKFVKGQEAYMKSLVESKGTEDVFKIKNRMKDVMDDNVGI ---------------3333----------------------3333--------------- FRDGPHLEKAVKELEELYKKSKNVGIKNKRLHANPELEEAYRVPMMLKVALCVAKGALDR --3333----------1111--------------3333---------------------- TESRGAHNREDYPKRDDINWLNRTLASWPNPEQTLPTLEYEALDVNEMEIAPGYRGYGAK ---!!!!-1111------------------------------------------------ GNYIENPLSVKRQEEIDKIQSELEAAGKDRHAIQEALMPYELPAKYKARNERLGD -----3333--------------3333-3333----------------------- >PHOSPHATIDYLINOSITOL 3-KI; SWP:O02697; PDB:1E7UA; ASEETLAFQRQLNALIGYDVTDVSNVHDDELEFTRRRLVTPRMAEVAGRDPKLYAMHPWV ------------------1111-------------------------------------- TSKPLPEYLLKKITNNCVFIVIHRSTTSQTIKVSADDTPGTILQSFFTKMAKNERDFVLR -----11113333%%%%----------------1111----------------------- VCGRDEYLVGETPIKNFQWVRQCLKNGEEIHLVLDTPPDPALDEVRKETVSLWDCDRKFR 2222--------1111-------------------------------------------- VKIRGIDIPVLPRTADLTVFVEANIQYGQQVLCQRRTSPKPFTEEVLWNVWLEFSIKIKD --------------------------%%%%--------------------------3333 LPKGALLNLQIYCGAKQLLYYVNLLLIDHRFLLRHGEYVLHMWQLSGKGFNADKLTSATN -2222----------------------1111-------------------3333------ PDKENSMSISILLDNYCHPIALPKHRPTDRVRAEMPNQLRKQLEAIIATDPLNPLTAEDK -3333--------------------------------------------3333------- ELLWHFRYESLKDPKAYPKLFSSVKWGQQEIVAKTYQLLAKREVWDQSALDVGLTMQLLD ------3333--3333---1111-1111---------3333-------------3333-1 CNFSDENVRAIAVQKLESLEDDDVLHYLLQLVQAVKFEPYHDSALARFLLKRGLRNKRIG 111------------11113333---------3333------------------------ HFLFWFLRSEIAQSRHYQQRFAVILEAYLRGCGTAMLHDFTQQVQVIDMLQKVTIDIKSL ----------------------------------------------------------11 SAEKYDVSSQVISQLKQKLENLQNLNLPQSFRVPYDPGLKAGALVIEKCKVMASKKKPLW 11------------------------------1111--------3333-----1111--- LEFKCADPTALSNETIGIIFKHGDDLRQDMLILQILRIMESIWETESLDLCLLPYGCIST ------1111---------------------------------1111------------- GDKIGMIEIVKDATTIAKIQQSTVGNTGAFKDEVLSHWLKEKCPIEEKFQAAVERFVYSC ------------------------------------------------------------ AGYCVATFVLGIGDRHNDNIMISETGNLFHIDFGHINKERVPFVLTPDFLFVMGTSGKKT ---------------1111---1111---------------------------------- SLHFQKFQDVCVKAYLALRHHTNLLIILFSMMLMTGMPQLTSKEDIEYIRDALTVGKSEE ------------------------------------------------------------ DAKKYFLDQIEVCRDKGWTVQFNWFLHLVLGI -------------------------------- >PTERIDINE REDUCTASE; SWP:Q9U1F8; PDB:1E7WA; TVPVALVTGAAKRLGRSIAEGLHAEGYAVCLHYHRSAAEANALSATLNARRPNSAITVQA ----------------------1111---------------------------------- DLSNVATAPVSSAPVTLFTRCAELVAACYTHWGRCDVLVNNASSFYPTPLLREAMETATA ---------------------------------------------------3333----- DLFGSNAIAPYFLIKAFAHRVAGTPAKHRGTNYSIINMVDAMTNQPLLGYTIYTMAKGAL ---1111-------------11113333-----------1111---2222---------- EGLTRSAALELAPLQIRVNGVGPGLSVLVDDMPPAVWEGHRSKVPLYQRDSSAAEVSDVV -----------1111------------3333--------------------3333----- IFLCSSKAKYITGTCVKVDGGYSLTRA ----3333----------iiii----- >CYTOCHROME C'; SWP:P00138; PDB:1E85A; FAKPEDAVKYRQSALTLMASHFGRMTPVVKGQAPYDAAQIKANVEVLKTLSALPWAAFGP --3333---------------3333--1111------------------11113333-22 GTEGGDARPEIWSDAASFKQKQQAFQDNIVKLSAAADAGDLDKLRAAFGDVGASCKACHD 22-!!!!-3333------------------------------------------------ AYRK ---- >EARLY ACTIVATION ANTIGEN ; SWP:Q07108; PDB:1E87A; SSCSEDWVGYQRKCYFISTVKRSWTSAQNACSEHGATLAVIDSEKDMNFLKRYAGREEHW ---1111--%%%%------------------1111------------------------- VGLKKEPGHPWKWSNGKEFNNWFNVTGSDKCVFLKNTEVSSMECEKNLYWICNKPYK -----2222---1111------------------1111----1111----------- >FIBRONECTIN; SWP:P02751; PDB:1E88A; YGHCVTDSGVVYSVGMQWLKTQGNKQMLCTCLGNGVSCQETAVTQTYGGNSNGEPCVLPF ------------------------------------------------------------ TYNGRTFYSCTTEGRQDGHLWCSTTSNYEQDQKYSFCTDHTVLVQTRGGNSNGALCHFPF --------------------------3333-------%%%%------------------- LYNNHNYTDCTSEGRRDNMKWCGTTQNYDADQKFGFCPMA -iiii------2222------------1111--------- >HYDROXYNITRILE LYASE; SWP:P52705; PDB:1E89A; MVTAHFVLIHTICHGAWIWHKLKPALERAGHKVTALDMAASGIDPRQIEQINSFDEYSEP ----------22223333--------1111-------2222-----1111--3333---- LLTFLEKLPQGEKVIIVGEACAGLNIAIAADRYVDKIAAGVFHNSLLPDTVHSPSYTVEK ----11112222--------------------1111----------------1111---- LLESFPDWRDTEYFTFTNITGETITTMKLGFVLLRENLFTKCTDGEYELAKMVMRKGSLF -------!!!!------1111-----------------11113333-------------- QNVLAQRPKFTEKGYGSIKKVYIWTDQDKIFLPDFQRWQIANYKPDKVYQVQGGDHKLQL ---1111-------1111------1111---3333---------------------1111 TKTEEVAHILQEVADAYA ------------------ >S100A12; SWP:P80511; PDB:1E8AA; TKLEEHLEGIVNIFHQYSVRKGHFDTLSKGELKQLLTKELANTIKNIKDKAVIDEIFQGL ----------------1111--1111-------------1111--1111----------- DANQDEQVDFQEFISLVAIALKAAHYH 1111----3333-----------3333 >UDP-N-ACETYLMURAMOYLALANY; SWP:P22188; PDB:1E8CA; RNLRDLLAPWVPDAPSRALRETLDSRVAAAGDLFVAVVGHQADGRRYIPQAIAQGVAAII ------11111111---------3333-2222---------------------------- AEAKDEATDGEIREHGVPVIYLSQLNERLSALAGRFYHEPSDNLRLVGVTGTNGKTTTTQ --2222-2222------------3333-----------1111------------------ LLAQWSQLLGEISAVGTVGNGLLGKVIPTENTTGSAVDVQHELAGLVDQGATFCAEVSSH ------1111-----1111--2222---------3333---------------------- GLVQHRVAALKFAASVFTNLSRDHLDYHGDEHYEAAWLLYSEHHCGQAIINADDEVGRRW ------1111-------------3333---------3333----------1111------ LAKLPDAVAVSEDHINPNCHGRWLKATEVNYHDSGATIRFSSSWGDGEIESHLGAFNVSN ---1111--------1111----------------------1111--------3333--- LLLALATLLALGYPLADLLKTAARLQPVCGREVFTAPGKPTVVVDYAHTPDALEKALQAA --------1111--------3333---2222----2222-------------------33 RLHCAGKLWCVFGCGGDRDKGKRPLGAIAEEFADVAVVTDDNPRTEEPRAIINDILAGLD 33----------------3333--------------------!!!!-------------3 AGHAKVEGRAEAVTCAVQAKENDVVLVAGKGHEDYQIVGNQRLDYSDRVTVARLLGVIAR 333----------------1111------!!!!-------------------------33 SH 33 >VANILLYL-ALCOHOL OXIDASE; SWP:P56216; PDB:1E8GA; EFRPLTLPPKLSLSDFNEFIQDIIRIVGSENVEVISVDGSYMKPTHTHDPTHVMDQDYFL -------2222----------------3333--------3333-----------2222-- ASAIVAPRNVADVQSIVGLANKFSFPLWPISIGRNSGYGGAAPRVSGSVVLDMGKNMNRV ----------------------------------2222!!!!--2222---3333----- LEVNVEGAYCVVEPGVTYHDLHNYLEANNLRDKLWLDVPDLGGGSVLGNAVERGVGYTPY ----1111----3333---------1111-----------1111-------------111 GDHWMMHSGMEVVLANGELLRTGMGALPDPKRPETMGLKPEDQPWSKIAHLFPYGFGPYI 13333--------1111------1111----3333---3333---1111----------3 DGLFSQSNMGIVTKIGIWLMPNPGGYQSYLITLPKDGDLKQAVDIIRPLRLGMALQNVPT 333-------------------------------3333---------------------- IRHILLDAAVLGDKRSYSSRTEPLSDEELDKIAKQLNLGRWNFYGALYGPEPIRRVLWET ------------3333---------------------------------3333------- IKDAFSAIPGVKFYFPEDTPENSVLRVRDKTMQGIPTYDELKWIDWLPNGAHLFFSPIAK -------2222---3333-1111------1111----33333333--------------- VSGEDAMMQYAVTKKRCQEAGLDFIGTFTVGMREMHHIVCIVFNKKDLIQKRKVQWLMRT -----------------1111----------------------1111------------- LIDDCAANGWGEYRTHLAFMDQIMETYNWNNSSFLRFNEVLKNAVDPNGIIAPGKSGVWP -----1111------1111----3333-%%%%-------------1111--2222----1 SQYSHVTWKL 1113333--- >ENDOGLUCANASE; SWP:CAB92325; PDB:1E8PA; ASCWAQSQGYNCCNNPSSTKVEYTDASGQWGVQNGQWCGIDYSYGQ --3333--------1111------1111----%%%%----1111-- >ENDO-1,4-BETA-XYLANASE; SWP:P14768; PDB:1E8RA; MGNQQCNWYGTLYPLCVTTTNGWGWEDQRSCIARSTCAAQPAPFGIVGSG ---------------3333------%%%%----3333------------- >PHOSPHATIDYLINOSITOL 3-KI; SWP:P48736; PDB:1E8YA; MSEESQAFQRQLTALIGYDVTDVSNVHDDELEFTRRGLVTPRMAEVASRDPKLYAMHPWV ------------------1111-------------------------------------- TSKPLPEYLWKKIANNCIFIVIHRSTTSQTIKVSPDDTPGAILQSFFTKMAEQDFVLRVC -----11111111----------!!!!------1111333333333333---------22 GRDEYLVGETPIKNFQWVRHCLKNGEEIHVVLDTPPDPALDEVRKEECDRKFRVKIRGID 22--------3333-------1111-----------3333-------------------- IPVLPRNTDLTVFVEANIQHGQQVLCQRRTSPKPFTEEVLWNVWLEFSIKIKDLPKGALL -------------------------------------------------3333-2222-- NLQIYCLLYYVNLLLIDHRFLLRRGEYVLHMWQISGFNADKLTSATNPDKENSMSISILL ----------------1111------------------3333------------------ DNHPIARAEMPNQLRKQLEAIIATDPLNPLTAEDKELLWHFRYESLKHPKAYPKLFSSVK ------------------------1111---------------33333333---1111-1 WGQQEIVAKTYQLLARREVWDQSALDVGLTMQLLDCNFSDENVRAIAVQKLESLEDDDVL 111-------------3333---------3333-1111------------11113333-- HYLLQLVQAVKFEPYHDSALARFLLKRGLRNKRIGHFLFWFLRSEIAQSRHYQQRFAVIL --------3333------------------------------------------------ EAYLRGCGTAMLHDFTQQVQVIEMLQKVTLDIKSLSAEKYDVSSQVISQLKQKLENLQNS ------------------------------------------------------------ QLPESFRVPYDPGLKAGALAIEKCKVMASKKKPLWLEFKCADPTALSNETIGIIFKHGDD -------1111--------3333------------------1111--------------- LRQDMLILQILRIMESIWETESLDLCLLPYGCISTGDKIGMIEIVKDATTIAKIQQSTVG -----------------------------------2222--------------------- NTGFKDEVLNHWLKEKSPTEEKFQAAVERFVYSCAGYCVATFVLGIGDRHNDNIMITETG ----1111-----------------------------------------1111---1111 NLFHIDFGHERVPFVLTPDFLFVMGTSGKKTSPHFQKFQDICVKAYLALRHHTNLLIILF ----------------33333333------------------------------------ SMMLMTGMPQLTSKEDIEYIRDALTVGKNEEDAKKYFLDQIEVCRDKGWTVQFNWFLHLV -------------3333------------------------------------------- L - >HEAT SHOCK PROTEIN HSLV; SWP:P31059; PDB:1E94A; TTIVSVRRNGHVVIAGDGQATLGNTVMKGNVKKVRRLYNDKVIAGFAGGTADAFTLFELF -------------------------------------iiii------------------- ERKLEMHQGHLVKAAVELAKDWRTDRMLRKLEALLAVADETASLIITGNGDVVQPENDLI --3333iiii----------------3333--------3333----1111---------- AIGSGGPYAQAAARALLENTELSAREIAEKALDIAGDICIYTNHFHTIEELSYK --1111---------1111-------------------1111------------ >INORGANIC PYROPHOSPHATASE; SWP:P00817; PDB:1E9GA; TYTTRQIGAKNTLEYKVYIEKDGKPVSAFHDIPLYADKENNIFNMVVEIPRWTNAKLEIT -----------1111-----iiii--3333-------1111--------2222------- KEETLNPIIQDTKKGKLRFVRNCFPHHGYIHNYGAFPQTWEDPNVSHPETKAVGDNDPID --2222------iiii-------------------------------------------- VLEIGETIAYTGQVKQVKALGIMALLDEGETDWKVIAIDINDPLAPKLNDIEDVEKYFPG ---------2222-------------iiii--------1111-3333--3333----222 LLRATNEWFRIYKIPDGKPENQFAFSGEAKNKKYALDIIKETHDSWKQLIAGKSSDSKGI 2---------1111---------%%%%---------------------1111----iiii DLTNVTLPDTPTYSKAASDAIPPASLKADAPIDKSIDKWFFISG ------1111-----3333------------------------- >CAMP SPECIFIC PHOSPHODIES; SWP:CAC03757; PDB:1E9KA; VPEVDNPHCPNPWLNEDLVKSLRENLLQHEKSKTARKS -------------3333--------------------- >CONJUGAL TRANSFER PROTEIN; SWP:Q04230; PDB:1E9RA; VGQGEFGGAPFKRFLRGTRIVSGGKLKRMTREKAKQVTVAGVPMPRDAEPRHLLVNGATG --------------------------------------iiii--1111---------222 TGKSVLLRELAYTGLLRGDRMVIVDPNGDMLSKFGRDKDIILNPYDQRTKGWSFFNEIRN 2-------------1111-----------------1111---1111------3333---1 DYDWQRYALSVVPRGKTDEAEEWASYGRLLLRETAKKLALIGTPSMRELFHWTTIATFDD 111-------------3333---------------------------------------- LRGFLEGTLAESLFAGSNEASKALTSARFVLSDKLPEHVTMPDGDFSIRSWLEDPNGGNL ----2222--3333---------------------3333--------------1111--- FITWREDMGPALRPLISAWVDVVCTSILSLPEEPKRRLWLFIDELASLEKLASLADALTK ----1111-1111-------------1111--1111-------1111------------- GRKAGLRVVAGLQSTSQLDDVYGVKEAQTLRASFRSLVVLGGSRTDPKTNEDMSLSLGEH 3333--------------------------1111--------1111-------------- EVERDRALERVRERVVMPAEIANLPDLTAYVGFAGNRPIAKVPLEIKQFANRQPAFVEGT ----------------33331111------------------------------------ >INTESTINAL TREFOIL FACTOR; SWP:Q07654; PDB:1E9TA; EEYVGLSANQCAVPAKDRVDCGYPHVTPKECNNRGCCFDSRIPGVPWCFKPLQEAECTF ----------------------------3333---------3333-------------- >UREASE ALPHA SUBUNIT; SWP:P14916; PDB:1E9ZA; MKLTPKELDKLMLHYAGELAKKRKEKGIKLNYVEAVALISAHIMEEARAGKKTAAELMQE -----------------------1111-------------------3333---------- GRTLLKPDDVMDGVASMIHEVGIEAMFPDGTKLVTVHTPIEANGKLVPGELFLKNEDITI 1111-1111---3333----------1111----------------2222---------- NEGKKAVSVKVKNVGDRPVQIGSHFHFFEVNRCLDFDREKTFGKRLDIAAGTAVRFEPGE -------------------------3333-------3333-------------------- EKSVELIDIGGNRRIFGFNALVDRQADNESKKIALHRAKERGFHGAKSDDNYVKTIKE ---------!!!!-----------------------------2222------------ >UREASE ALPHA SUBUNIT; SWP:P14917; PDB:1E9ZB; MKKISRKEYVSMYGPTTGDKVRLGDTDLIAEVEHDYTIYGEELKFGGGKTLREGMSQSNN ---------------2222---!!!!-------------------------2222----- PSKEELDLIITNALIVDYTGIYKADIGIKDGKIAGIGKGGNKDMQDGVKNNLSVGPATEA ----------------1111--------%%%%--------3333----3333--1111-- LAGEGLIVTAGGIDTHIHFISPQQIPTAFASGVTTMIGGGTGPADGTNATTITPGRRNLK ------------------------------------------------------------ WMLRAAEEYSMNLGFLAKGNASNDASLADQIEAGAIGFIHEDWGTTPSAINHALDVADKY ----3333----------------3333--1111-----3333--------------111 DVQVAIHTDTLNEAGCVEDTMAAIAGRTMHTFHTEGAGGGHAPDIIKVAGEHNILPASTN 1--------------3333----iiii-----3333---------3333-3333------ PTIPFTVNTEAEHMDMLMVCHHLDKSIKEDVQFADSRIRPQTIAAEDTLHDMGAFSITSS -----1111------3333--------33333333-----3333---------------- DSQAMGRVGEVITRTWQTADKNKKEFGRLKEEKGDNDNFRIKRYLSKYTINPAIAHGISE ------1111------------------1111---------------------------- YVGSVEVGKVADLVLWSPAFFGVKPNMIIKGGFIALSQMGDANASIPTPQPVYYREMFAH -----------------------------%%%%-------1111---------------- HGKAKYDANITFVSQAAYDKGIKEELGLERQVLPVKNCRNVTKKDMQFNNTTAHIEVNPE --3333-----------------------------------3333--------------- TYHVFVDGKEVTSKPANKVSLAQLFSIF ---------------------3333--- >GLUTAMATE SYNTHASE [NADPH; SWP:Q05755; PDB:1EA0A; CGVGFIAAIDGKPRRSVVEKGIEALKAVWHRGAVDADGKTGDGAGIHVAVPQKFFKDHVK -------3333---3333-----11113333---3333---------------------1 VIGHRAPDNKLAVGQVFLPRISLDAQEACRCIVETEILAFGYYIYGWRQVPINVDIIGEK 111------------------3333------------1111------------1111333 ANATRPEIEQIIVGNNKGVSDEQFELDLYIIRRRIEKAVKGEQINDFYICSLSARSIIYK 31111---------1111--3333------------------------------------ GMFLAEQLTTFYPDLLDERFESDFAIYHQRYSTNTFPTWPLAQPFRMLAHNGEINTVKGN ---3333-333333333333-----------------3333---1111-----1111--- VNWMKAHETRMEHPAFGTHMQDLKPVIGVGLSDSGSLDTVFEVMVRAGRTAPMVKMMLVP -----3333---3333----------------------------1111-3333------- QALTTTPDNHKALIQYCNSVMEPWDGPAALAMTDGRWVVGGMDRNGLRPMRYTITTDGLI --------3333----3333----------------------------------1111-- IGGSETGMVKIDETQVIEKGRLGPGEMIAVDLQSGKLYRDRELKDHLATLKPWDKWVQNT -----------3333-------2222-----1111----------3333----------- THLDELVKTASLKGEPSDMDKAELRRRQQAFGLTMEDMELILHPMVEDGKEAIGSMGDDS ---3333--3333------------------------------3333------------- PIAVLSDKYRGLHHFFRQNFSQVTNPPIDSLRERRVMSLKTRLGNLGNILDEDETQTRLL -3333-----3333------------------3333-----------3333--------- QLESPVLTTAEFRAMRDYMGDTAAEIDATFPVDGGPEALRDALRRIRQETEDAVRGGATH -------------------3333-----------1111---------------------- VILTDEAMGPARAAIPAILATGAVHTHLIRSNLRTFTSLNVRTAEGLDTHYFAVLIGVGA ----11111111--------------------3333------------------------ TTVNAYLAQEAIAERHRRGLFGSMPLEKGMANYKKAIDDGLLKIMSKMGISVISSYRGGG ------------------1111-----------------------------33332222- NFEAIGLSRALVAEHFPAMVSRISGIGLNGIQKKVLEQHATAYNEEVVALPVGGFYRFRK --------------------------3333--------3333------------------ SGDRHGWEGGVIHTLQQAVTNDSYTTFKKYSEQVNKRPPMQLRDLLELRSTKAPVPVDEV ----------------------------------------3333-----------3333- ESITAIRKRFITPGMSMGALSPEAHGTLNVAMNRIGAKSDSGEGGEDPARFRPDKNGDNW -3333---------------------------1111----------3333---1111--- NSAIKQVASGRFGVTAEYLNQCRELEIKVAQGAKPGEGGQLPGFKVTEMIARLRHSTPGV --------------3333------------3333----------------------2222 MLISPPPHHDIYSIEDLAQLIYDLKQINPDAKVTVKLVSRSGIGTIAAGVAKANADIILI ------------3333-----------1111----------------------------- SGNSGGTGASPQTSIKFAGLPWEMGLSEVHQVLTLNRLRHRVRLRTDGGLKTGRDIVIAA -1111---------------3333-------3333----------------3333----- MLGAEEFGIGTASLIAMGCIMVRQCHSNTCPVGVCVQDDKLRQKFVGTPEKVVNLFTFLA ---------3333----------1111----------3333------3333--------- EEVREILAGLGFRSLNEVIGRTDLLHQVDLDLNPRLAQVDPGGRNEVPDTLDARIVADAR ------3333---3333---3333--------3333-------------3333------- PLFEEGEKMQLAYNARNTQRAIGTRLSSMVTRKFGMFGLQPGHITIRLRGTAGQSLGAFA ---------------1111--------------------2222----------------- VQGIKLEVMGDANDYVGKGLSGGTIVVRPTTSSPLETNKNTIIGNTVLYGATAGKLFAAG ----------------2222---------1111--------------2222--------- QAGERFAVRNSGATVVVEGCGSNGCEYMTGGTAVILGRVGDNFAAGMTGGMAYVYDLDDS -------------------------------------------2222------------3 LPLYINDESVIFQRIEVGHYESQLKHLIEEHVTETQSRFAAEILNDWAREVTKFWQVVPK 333-----------------------------------------------1111------ EMLNRLEVPVHL ------------ >ACETYLCHOLINESTERASE; SWP:P04058; PDB:1EA5A; SELLVNTKSGKVMGTRVPVLSSHISAFLGIPFAEPPVGNMRFRRPEPKKPWSGVWNASTY 1111--1111--------%%%%--------------!!!!-------------------- PNNCQQYVDEQFPGFSGSEMWNPNREMSEDCLYLNIWVPSPRPKSTTVMVWIYGGGFYSG -----------2222---1111--------------------------------iiii-- SSTLDVYNGKYLAYTEEVVLVSLSYRVGAFGFLALHGSQEAPGNVGLLDQRMALQWVHDN ---3333------1111-----------1111---------------------------3 IQFFGGDPKTVTIFGESAGGASVGMHILSPGSRDLFRRAILQSGSPNCPWASVSVAEGRR 333---1111------------------11111111--------1111------------ RAVELGRNLNCNLNSDEELIHCLREKKPQELIDVEWNVLPFDSIFRFSFVPVIDGEFFPT -----3333------------3333-3333---1111----------------------- SLESMLNSGNFKKTQILLGVNKDEGSFFLLYGAPGFSKDSESKISREDFMSGVKLSVPHA ----------------------1111------22221111----------------1111 NDLGLDAVTLQYTDWMDDNNGIKNRDGLDDIVGDHNVICPLMHFVNKYTKFGNGTYLYFF -------------1111-----------------------------3333---------- NHRASNLVWPEWMGVIHGYEIEFVFGLPLVKELNYTAEEEALSRRIMHYWATFAKTGNPN ---1111--3333--2222------33333333--------------------------- EPHSQESKWPLFTTKEQKFIDLNTEPMKVHQRLRVQMCVFWNQFLPKLLNAT ------------3333------------------------------------ >PMS1 PROTEIN HOMOLOG 2; SWP:P54278; PDB:1EA6A; CSGQVVLSLSTAVKELVENSLDAGATNIDLKLKDYGVDLIEVSDNGCGVEEENFEGLTLE --------------------1111---------iiii------------11111111--- ALSSLCALSDVTISTCHASAKVGTRLMFDHNGKIIQKTPYPRPRGTTVSVQQLFSTLPVR -----1111-------3333--------1111---------------------1111--- HKEFQRNIKKEYAKMVQVLHAYCIISAGIRVSCTNQLGQGKRQPVVCTGGSPSIKENIGS ------------------------------------!!!!-------------------- VFGQKQLQSLIPFVQLPPSDSVCEEYGLSCSDALHNLFYISGFISQCTHGVGRSSTDRQF ------1111-------------1111-3333---------------------------- FFINRRPCDPAKVCRLVNEVYHMYNRHQYPFVVLNISVDSECVLLQEEKLLLAVLKTSLI --%%%%------------------1111----------3333------------------ GMFD ---- >SERINE PROTEASE; SWP:Q9S3L6; PDB:1EA7A; RASQQIPWGIKAIYNNDTLTSTTGGSGINIAVLDTGVNTSHPDLVNNVEQCKDFTGATTP ---------------1111-----2222---------1111--3333------------- INNSCTDRNGHGTHVAGTALADGGSDQAGIYGVAPDADLWAYKVLLDSGSGYSDDIAAAI -----------------------1111------1111--------3333----------- RHAADQATATGTKTIISMSLGSSANNSLISSAVNYAYSKGVLIVAAAGNSGYSQGTIGYP ------------------------------------1111-------------------1 GALPNAIAVAALENVQQNGTYRVADYSSRGYISTAGDYVIQEGDIEISAPGSSVYSTWYN 111-------------iiii---3333---3333------2222-------------111 GGYNTISGTSMATPHVSGLAAKIWAENPSLSNTQLRSNLQERAKSVDIKGGYGAAIGDDY 1------3333---------------------------------------2222------ ASGFGFARVQ ---------- >Cyclomaltodextrinase; SWP:Q59226; PDB:1EA9C; MFLEAVYHRPRKNFSYAYNGTTVHLRIRTKKDDMTAVYALAGDKYMWDHTMEYVPMTKLA -1111-----!!!!---------------------------------------------- TDELFDYWECEVTPPYRRVKYGFLLQQGHEKRWMTEYDFLTEPPANPDRLFEYPFINPVD ----------------------------------------------1111---------- VFQPPAWVKDAIFYQIFPERFANGDTRNDPEGTLPWGSADPTPSCFFGGDLQGVIDHLDH -----3333-------3333------------------------------3333--3333 LSKLGVNAVYFTPLFKATTNHKYDTEDYFQIDPQFGDKDTLKKLVDLCHERGIRVLLDAV -------------------------------3333------------------------- FNHSGRTFPPFVDVLKNGEKSKYKDWFHIRSLPLEVVDGIPTYDTFAFEPLMPKLNTEHP ----3333--------------1111----------------------1111---33333 DVKEYLLKAAEYWIRETGIDGWRLDVANEVSHQFWREFRRVVKQANPDAYILGEVWHESS 333-----------------------------3333----------------------33 IWLEGDQFDAVMNYPFTNAVLDFFIHQIADAEKFSFMLGKQLAGYPRQASEVMFNLLDSH 33----------3333---1111------------------------3333-------11 DTARLLTQADGDKRKMKLAVLFQFTYFGTPCIYYGDEVGLDGGHDPGCRKCMEWDETKHD 11---3333--------------------------1111----3333------------3 KDLFAFYQTVIRLRQAHAALRTGTFKFLTAEKNSRQIAYLREDDQDTILVVMNNDKAGHT 333--------------3333--------------------------------------- LTLPVRHAQWTHLWQDDVLTAAHGQLTVKLPAYGFAVLKASSD ------------------------------------------- >ASPARTIC PROTEINASE (SAP2; SWP:P43097; PDB:1EAGA; QAVPVTLHNEQVTYAADITVGSNNQKLNVIVDTGSSDLWVPDVNVDCQVTYSDQTADFCK --------------------1111-----------------1111-----11111111-- QKGTYDPSGSSASQDLNTPFKIGYGDGSSSQGTLYKDTVGFGGVSIKNQVLADVDSTSID -----33331111----------1111-------------iiii---------------- QGILGVGYKTNEAGGSYDNVPVTLKKQGVIAKNAYSLYLNSPDAATGQIIFGGVDNAKYS -------1111-----------------------------1111----------1111-- GSLIALPVTSDRELRISLGSVEVSGKTINTDNVDVLLDSGTTITYLQQDLADQIIKAFNG ----------------------iiii---%%%%----1111-----3333-----1111- KLTQDSNGNSFYEVDCNLSGDVVFNFSKNAKISVPASEFAASLDGQPYDKCQLLFDVNDA ----3333------------------%%%%------1111------1111---------- NILGDNFLRSAYIVYDLDDNEISLAQVKYTSASSISALT ---33331111-----1111------------------- >Genome polyprotein; SWP:P06210; PDB:1EAH1; ANNLPDTQSSGPAHSKETPALTAVETGATNPLVPSDTVQTRHVIQKRTRSESTVESFFAR -----------------3333-----------3333------------1111-------- GACVAIIEVDNDSKLFSVWKITYKDTVQLRRKLEFFTYSRFDMEFTFVVTSNYTDANNGH ------------------------------------------------------------ ALNQVYQIMYIPPGAPIPGKWNDYTWQTSSNPSVFYTYGAPPARISVPYVGIANAYSHFY ------------------------------------2222-------------------- DGFAKVPLAGQASTEGDSLYGAASLNDFGSLAVRVVNDHNPTKLTSKIRVYMKPKHVRVW -----------3333---2222-------------------------------------- CPRPPRAVPYYGPGVDYKDGLAPLPGKGLTTY ---------------------------1111- >Genome polyprotein; SWP:P06210; PDB:1EAH2; SVRVMQLTLGNSTITTQEAANSVVAYGRWPEYIKDSEANPVDQPTEPDVAACRFYTLDTV 1111----!!!!-----------2222------1111---------!!!!---------- TWRKESRGWWWKLPDALKDMGLFGQNMFYHYLGRAGYTVHVQCNASKFHQGALGVFAVPE --1111----------1111-------------------------1111----------- MCLAGDSTTHMFTKYENANPGEKGGEFKGSFTLDTNATNPARNFCPVDYLFGSGVLAGNA -------------3333---3333-----------3333-------3333-----33331 FVYPHQIINLRTNNCATLVLPYVNSLSIDSMTKHNNWGIAILPLAPLDFATESSTEIPIT 111-----1111-----------------3333--------------------------- LTIAPMCCEFNGLRNITVPRTQ ---------------------- >Genome polyprotein; SWP:P06210; PDB:1EAH3; GLPVLNTPGSNQYLTADNYQSPCAIPEFDVTPPIDIPGEVRNMMELAEIDTMIPLNLTNQ ------2222---1111----------------------------1111--------333 RKNTMDMYRVELNDAAHSDTPILCLSLSPASDPRLAHTMLGEILNYYTHWAGSLKFTFLF 3--1111--------------------1111---1111---------------------- CGSMMATGKLLVSYAPPGAEAPKSRKEAMLGTHVIWDIGLQSSCTMVVPWISNTTYRQTI --1111-----------------33331111----------------------------- NDSFTEGGYISMFYQTRVVVPLSTPRKMDILGFVSACNDFSVRLLRDTTHISQEA -3333-------------------------------1111--------------- >Chymotrypsin/elastase iso; SWP:P07851; PDB:1EAIC; GQESCGPNEVWTECTGCEMKCGPDENTPCPLMCRRPSCECSPGRGMRRTNDGKCIPASQC -11112222-------------------------------3333----1111---3333- P - >COXSACKIE VIRUS AND ADENO; SWP:P78310; PDB:1EAJA; FARSLSITTPEEMIEKAKGETAYLPCKFTLSPEDQGPLDIEWLISPADNQKVDQVIILYS ----------------2222----------1111-----------1111----------% GDKIYDDYYPDLKGRVHFTSNDLKSGDASINVTNLQLSDIGTYQCKVKKAPGVANKKIHL %%%--33332222--------3333----------3333--------------------- VVLV ---- >72 KDA TYPE IV COLLAGENAS; SWP:P08253; PDB:1EAKA; SPIIKFPGDVAPKTDKELAVQYLNTFYGCPKESCNLFVLKDTLKKMQKFFGLPQTGDLDQ -----------------------------1111-------------------------33 NTIETMRKPRCGNPDVANYNFFPRKPKWDKNQITYRIIGYTPDLDPETVDDAFARAFQVW 33--3333--------------------------------33333333-----------3 SDVTPLRFSRIHDGEADIMINFGRWEHGDGYPFDGKDGLLAHAFAPGTGVGGDSHFDDDE 333--------------------------------------------!!!!--------- LWTLGEGQVVRVKYGNADGEYCKFPFLFNGKEYNSCTDTGRSDGFLWCSTTYNFEKDGKY ----------------2222-------%%%%------2222-----------3333---- GFCPHEALFTMGGNAEGQPCKFPFRFQGTSYDSCTTEGRTDGYRWCGTTEDYDRDKKYGF ----1111------iiii-------!!!!------2222-----------3333------ CPETAMSTVGGNSEGAPCVFPFTFLGNKYESCTSAGRSDGKMWCATTANYDDDRKWGFCP ------------iiii-------iiii------2222----------------------- DQGYSLFLVAAHQFGHAMGLEHSQDPGALMAPIYTYTKNFRLSQDDIKGIQELYGASPDI ---------------1111-----1111-------------------------------- D - >ILEAL LIPID BINDING PROTE; SWP:P10289; PDB:1EAL; AFTGKYEIESEKNYDEFMKRLALPSDAIDKARNLKIISEVKQDGQNFTWSQQYPGGHSIT ------------------------------------------------------------ NTFTIGKECDIETIGGKKFKATVQMEGGKVVVNSPNYHHTAEIVDGKLVEVSTVGGVSYE -------------------------------------------%%%%------------- RVSKKLA ------- >IGG2B-KAPPA 17E8 FAB (HEA; SWP:NA; PDB:1EAPB; EVQLQESGTELVKPGASVKISCKASGYISTDHAIHWVKQRPEQGLEWIGYISPGNGDIKY ------------2222-----------1111----------------------------- NEKFKVKATLTADQSSSTAYMQLNSLTSEDSAVYFCKRSYYGSSYVDYWGQGTTLTVSSA 3333--------3333----------3333------------------------------ KTTPPSVYPLAPGCGDTTGSSVTLGCLVKGYFPESVTVTWNSGGLSSSVHTFPALLQSGL ---------------------------------------%%%%-------------iiii YTMSSSVTVPGGGWPSATVTCSVAHPASSTTVDKKL ------------------------------------ >RUNT-RELATED TRANSCRIPTIO; SWP:Q03347; PDB:1EAQA; SVEVLADHPGELVRTDSPNFLSSVLPTHWRSNKTLPIAFKVVALGDVPDGTLVTVAGNDE -------2222-----1111---------2222--------------2222------111 NYSAELRNATAAKNQVARFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRAIKITVDG 1-----------%%%%------------1111---------------------------- P - >UREASE ACCESSORY PROTEIN ; SWP:P50049; PDB:1EARA; MVITKIVGHIDDLSHQIKKVDWLEVEWEDLNKRILRKETENGTDIAIKLENSGTLRYGDV -----------3333----------3333---------1111-------------2222- LYESDDTLIAIRTKLEKVYVIKPQTMQEMGKMAFEIGNRHTMCIIEDDEILVRYDKTLEK ------------------------------------1111-------------------- LIDEVGVSYEQSERRFKEPFKY ---------------------- >SUPPRESSOR OF TUMORIGENIC; SWP:Q9Y5Y6; PDB:1EAWA; VVGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYIDDRFRYSD -------22221111-------------------------3333--------- >SUPPRESSOR OF TUMORIGENIC; SWP:Q9Y5Y6; PDB:1EAXA; VVGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYIDDRGFRYSDPTQWTA -------22221111----2222---------1111---3333---111111111111-- FLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPD -----1111--2222----------1111--------------------1111------1 ASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFL 111--2222----------2222----------------------2222-1111----11 SGGVDSCQGDSGGPLSSVEADGRIFQAGVVSWGDGCAQRNKPGVYTRLPLFRDWIKENTG 11----2222--------1111---------------2222------3333--------- V - >Chemotaxis protein cheA; SWP:P07363; PDB:1EAYC; PRRIILSRLKAGEVDLLEEELGHLTTLTDVVKGADSLSAILPGDIAEDDITAVLCFVIEA ---------2222--------------------------------3333----3333-11 DQITFET 11----- >TANDEM PH DOMAIN CONTAINI; SWP:Q9HB21; PDB:1EAZA; SAVIKAGYCVKQGAVMKNWKRRYFQLDENTIGYFKSELEKEPLRVIPLKEVHKVQECKQS -----------------------------------3333-------3333------1111 DIMMRDNLFEIVTTSRTFYVQADSPEEMHSWIKAVSGAIVA 1111--------1111-----------------------11 >NEUTRAL PROTEASE II; SWP:P46076; PDB:1EB6A; TEVTDCKGDAESSLTTALSNAAKLANQAAEAAESGDESKFEEYFKTTDQQTRTTVAERLR ------------------------------------------------------------ AVAKEAGSTSGGSTTYHCNDPYGYCEPNVLAYTLPSKNEIANCDIYYSELPPLAQKCHAQ -------------------------2222----3333------3333--------2222- DQATTTLHEFTHAPGVYQPGTEDLGYGYDAATQLSAQDALNNADSYALYANAIELKC ------------3333--------------1111----------------------- >CYTOCHROME C551 PEROXIDAS; SWP:P14532; PDB:1EB7A; DALHDQASALFKPIPEQVTELRGQPISEQQRELGKKLFFDPRLSRSHVLSCNTCHNVGTG --------------------%%%%---------------11111111--3333--1111- GADNVPTSVGHGWQKGPRNSPTVFNAVFNAAQFWDGRAKDLGEQAKGPIQNSVEMHSTPQ ----------3333--------2222----------3333-------------------- LVEQTLGSIPEYVDAFRKAFPKAGKPVSFDNMALAIEAYEATLVTPDSPFDLYLKGDDKA --------------------------------------------------------1111 LDAQQKKGLKAFMDSGCSACHNGINLGGQAYFPFGLVKKPDADKGRFAVTKTQSDEYVFR ---------------3333---1111---------------------------------- AAPLRNVALTAPYFHSGQVWELKDAVAIMGNAQLGKQLAPDDVENIVAFLHSLSGKQPRV ---2222------1111---------------------3333--------1111------ EYPLLPASTETTPRPAE --------1111----- >DIHYDROLIPOAMIDE DEHYDROG; SWP:P11959; PDB:1EBDA; AIETETLVVGAGPGGYVAAIRAAQLGQKVTIVEKGNLGGVCLNVGCIPSKALISASHRYE -----------3333--------------------------------------------- QAKHSEEMGIKAENVTIDFAKVQEWKASVVKKLTGGVEGLLKGNKVEIVKGEAYFVDANT ----3333---------------------------------------------------- VRVVNGDSAQTYTFKNAIIATGSRPIELPNFKFSNRILDSTGALNLGEVPKSLVVIGGGY ----!!!!----------------------------------1111-------------- IGIELGTAYANFGTKVTILEGAGEILSGFEKQMAAIIKKRLKKKGVEVVTNALAKGAEER -----------------------------3333--------------------------- EDGVTVTYEANGETKTIDADYVLVTVGRRPNTDELGLEQIGIKMTNRGLIEVDQQCRTSV ---------%%%%----------------------3333-----1111----1111---1 PNIFAIGDIVPGPALAHKASYEGKVAAEAIAGHPSAVDYVAIPAVVFSDPECASVGYFEQ 111---1111-------------------------------------------------3 QAKDEGIDVIAAKFPFAANGRALALNDTDGFLKLVVRKEDGVIIGAQIIGPNASDMIAEL 333-----------3333-------------------------------2222------- GLAIEAGMTAEDIALTIHAHPTLGEIAMEAAEVAL ---1111-3333---------3333---------- >HOMOSERINE DEHYDROGENASE; SWP:P31116; PDB:1EBFA; STKVVNVAVIGAGVVGSAFLDQLLAMKSTITYNLVLLAEAERSLISKDFSPLNVGSDWKA ------------3333-----------------------------3333--------333 ALAASTTKTLPLDDLIAHLKTSPKPVILVDNTSSAYIAGFYTKFVENGISIATPNKKAFS 3---------3333---3333------------3333-------1111--------1111 SDLATWKALFSNKPTNGFVYHEATVGAGLPIISFLREIIQTGDEVEKIEGIFSGTLSYIF ------------2222----1111-!!!!------------------------------- NEFSTSQANDVKFSDVVKVAKKLGYTEPDPRDDLNGLDVARKVTIVGRISGVEVESPTSF ------------------------------------------------------------ PVQSLIPKPLESVKSADEFLEKLSDYDKDLTQLKKEAATENKVLRFIGKVDVATKSVSVG ---------------33331111----------------------------1111----- IEKYDYSHPFASLKGSDNVISIKTKRYTNPVVIQGAGAGAAVTAAGVLGDVIKIAQRL ----33331111------------------------------------------3333 >GLUCARATE DEHYDRATASE; SWP:P76637; PDB:1EC7A; FTTPVVTEMQVIPVAGHDSMLMNLSGAHAPFFTRNIVIIKDNSGHTGVGEIPGGEKIRKT ----------------------1111---------------------------------- LEDAIPLVVGKTLGEYKNVLTLVRNTFALRTTIHVVTGIEAAMLDLLGQHLGVNVASLLG ---333322223333--------------------------------------3333-!! DGQQRSEVEMLGYLFFVGNRKATPLPYQSQPDDSCDWYRLRHEEAMTPDAVVRLAEAAYE !!---------------------------1111-33333333------------------ KYGFNDFKLKGGVLAGEEEAESIVALAQRFPQARITLDPNGAWSLNEAIKIGKYLKGSLA --------------3333-----------1111-----iiii------------3333-- YAEDPCGAEQGFSGREVMAEFRRATGLPTATNMIATDWRQMGHTLSLQSVDIPLADPHFW --------iiii-------------------------------------------3333- TMQGSVRVAQMCHEFGLTWGSHSNNHFDISLAMFTHVAAAAPGKITAIDTHWIWQEGNQR -------------------------------------1111---------3333------ LTKEPFEIKGGLVQVPEKPGLGVEIDMDQVMKAHELYQKHGLGARDDAMGMQYLIPGWTF --------iiii---------------------------------33333333------- DNKRPCMVR ----3333- >ERYTHROCRUORIN (AQUO MET); SWP:P02229; PDB:1ECA; LSADQISTVQASFDKVKGDPVGILYAVFKADPSIMAKFTQFAGKDLESIKGTAPFETHAN -----------33331111---------------------222233331111-------- RIVGFFSKIIGELPNIEADVNTFVASHKPRGVTHDQLNNFRAGFVSYMKAHTDFAGAEAA ---------1111-------------1111-----------------------3333--- WGATLDTFFGMIFSKM ----------3333-- >ENDOCELLULASE E1; SWP:P54583; PDB:1ECEA; AGGGYWHTSGREILDANNVPVRIAGINWFGFETCNYVVHGLWSRDYRSMLDQIKSLGYNT --------!!!!--1111-----------1111--------------------------- IRLPYSDDILKPGTMPNSINFYQMNQDLQGLTSLQVMDKIVAYAGQIGLRIILDRHRPDC -----3333-2222----------1111------------------------------33 SGQSALWYTSSVSEATWISDLQALAQRYKGNPTVVGFDLHNEPHDPACWGCGDPSIDWRL 33------1111---------------2222---------------------1111---- AAERAGNAVLSVNPNLLIFVEGVQSYNGDSYWWGGNLQGAGQYPVVLNVPNRLVYSAHDY --------3333-------------iiii--2222-1111-------------------- ATSVYPQTWFSDPTFPNNMPGIWNKNWGYLFNQNIAPVWLGEFGTTLQSTTDQTWLKTLV 3333--3333-11113333-----------1111-------------------------- QYLRPTAQYGADSFQWTFWSWNPDSGDTGGILKDDWQTVDTVKDGYLAPIKSSIFDPV ----3333!!!!--------------------1111------33333333-------- >GLUTAMINE PHOSPHORIBOSYLP; SWP:P00496; PDB:1ECFA; CGIVGIAGVMPVNQSIYDALTVLQHRGQDAAGIITIDANNCFRLRKANGLVSDVFEARHM ----------------------3333----------1111---------3333------- QRLQGNMGIGHVRYPTAGSSSASEAQPFYVNSPYGITLAHNGNLTNAHELRKKLFEEKRR ---------------22221111------------------------------------- HINTTSDSEILLNIFASELDNFRHYPLEADNIFAAIAATNRLIRGAYACVAMIIGHGMVA ---------------------------3333---------------------2222---- FRDPNGIRPLVLGKRDIDENRTEYMVASESVALDTLGFDFLRDVAPGEAIYITEEGQLFT -----------------1111------------1111-------2222----1111---- RQCADNPVSNPCLFEYVYFARPDSFIDKISVYSARVNMGTKLGEKIAREWEDLDIDVVIP ------------3333----1111-%%%%--------------------1111------- IPETSCDIALEIARILGKPYRQGFVKNRYVGRTFIMPGQQLRRKSVRRKLNANRAEFRDK --1111---------------------------------------3333---3333---- NVLLVDDSIVRGTTSEQIIEMAREAGAKKVYLASAAPEIRFPNVYGIDMPSATELIAHGR --------------------------------------------------33333333-- EVDEIRQIIGADGLIFQDLNDLIDAVRAENPDIQQFECSVFNGVYVTKDVDQGYLDFLDT -------------------------33333333----3333---1111------------ LRNDDAKAVQRQ ------------ >ECTATOMIN; SWP:P49343; PDB:1ECIA; GVIPKKIWETVCPTVEPWAKKCSGDIATYIKRECGKL ------------3333-3333---------------- >Ectatomin subunit B; SWP:P49344; PDB:1ECIB; WSTIVKLTICPTLKSMAKKCEGSIATMIKKKCDK ---------------3333---3333-------- >ENDO-OXABICYCLIC TRANSITI; SWP:P07022; PDB:1ECMA; NPLLALREKISALDEKLLALLAERRELAVEVGKAKLLSHRPVRDIDRERDLLERLITLGK -----------------------------------3333------------------333 AHHLDAHYITRLFQLIIEDSVLTQQALLQQH 3--------------------------1111 >REPLICATION TERMINATOR PR; SWP:P16525; PDB:1ECRA; DLVDRLNTTFRQMEQELAIFAAHLEQHKLLVARVFSLPEVKKEDEHNPLNRIEVKQHLGN ---------------------3333---------------3333---------------- DAQSLALRHFRHLFIQQQSENRSSKAAVRLPGVLCYQVDNLSQAALVSHIQHINKLKTTF -------3333-------1111-------------------------------------- EHIVTVESELPTAARFEWVHRHLPGLITLNAYRTLTVLHDPATLRFGWANKHIIKNLHRD -----3333--1111-------2222-3333--------------------------333 EVLAQLEKSLKSPRSVAPWTREEWQRKLEREYQDIAALPQNAKLKIKRPVKVQPIARVWY 3--------------------------------3333-1111-----------------3 KGDQKQVQHACPTPLIALINRDNGAGVPDVGELLNYDADNVQHRYKPQAQPLRLIIPRLH 333---------------------------------1111---------------3333- LYVAD ----- >BLEOMYCIN RESISTANCE PROT; SWP:P13081; PDB:1ECSA; TDQATPNLPSRDFDSTAAFYERLGFGIVFRDAGWMILQRGDLMLEFFAHPGLDPLASWFS --------------------1111--------------!!!!------11111111---- CCLRLDDLAEFYRQCKSVGIQETSSGYPRIHAPELQGWGGTMAALVDPDGTLLRLIQNEL ---------------1111---------------------------1111---------- >GAG POLYPROTEIN; SWP:Q03859; PDB:1ED1A; SVLSGKKADELEKIRLRPGGKKKYMLKHVVWAANELDRFGLAESLLENKEGCQKILSVLA ----------------3333----3333--------1111------------------33 PLVPTGSENLKSLYNTVCVIWCIHAEEKVKHTEEAKQIVQRHLVVETGTAETMP 331111------------------------------------------------ >Beta-2-microglobulin [Pre; SWP:P07151; PDB:1ED3B; IQKTPQIQVYSRHPPENGKPNFLNCYVSQFHPPQIEIELLKNGKKIPNIEMSDLSFSKDW ---------------2222-------------------------------------1111 SFYILAHTEFTPTETDVYACRVKHVTLKEPKTVTWDRDM -----------------------3333--------1111 >CHITINASE A1; SWP:P20533; PDB:1ED7A; AWQVNTAYTAGQLVTYNGKTYKCLQPHTSLAGWEPSNVPALWQLQ --------2222-------------------------3333---- >ENDOGLUCANASE A; SWP:P17901; PDB:1EDG; MYDASLIPNLQIPQKNIPNNDGMNFVKGLRLGWNLGNTFDAFNGTNITNELDYETSWSGI --3333-------------------------------1111-------33331111---- KTTKQMIDAIKQKGFNTVRIPVSWHPHVSGSDYKISDVWMNRVQEVVNYCIDNKMYVILN --3333----3333---------1111--1111----------------3333------- THHDVDKVKGYFPSSQYMASSKKYITSVWAQIAARFANYDEHLIFEGMNEPRLVGHANEW -------------3333------------------11113333---------2222-111 WPELTNSDVVDSINCINQLNQDFVNTVRATGGKNASRYLMCPGYVASPDGATNDYFRMPN 1-3333------------------------!!!!--------22223333--3333---- DISGNNNKIIVSVHAYCPWNFAGLAMADGGTNAWNINDSKDQSEVTWFMDNIYNKYTSRG -2222------------3333---3333------1111------------------1111 IPVIIGECGAVDKNNLKTRVEYMSYYVAQAKARGILCILWDNNNFSGTGELFGFFDRRSC ------------------------------1111-------------------------- QFKFPEIIDGMVKYAFGLIN ---3333------------- >E-CADHERIN; SWP:P09803; PDB:1EDHA; VIPPISCPENEKGEFPKNLVQIKSNRDKETKVFYSITGQGADKPPVGVFIIERETGWLKV -----------------------3333--------------------------------- TQPLDREAIAKYILYSHAVSSNGEAVEDPMEIVITVTDQNDNRPEFTQEVFEGSVAEGAV ----3333-----------1111--------------------------------1111- PGTSVMKVSATDADDDVNTYNAAIAYTIVSQDPELPHKNMFTVNRDTGVISVLTSGLDRE --------------3333---------------------------------------333 SYPTYTLVVQAADLQGEGLSTTAKAVITVKD 3------------iiii-------------- >STAPHYLOCOCCAL PROTEIN A; SWP:P38507; PDB:1EDI; AQHDEAQQNAFYQVLNMPNLNADQRNGFIQSLKDDPSQSANVLGEAQKLNDSQAPK ----------------1111--------------3333------------1111-- >Coagulation factor IX [Pr; SWP:P00740; PDB:1EDMB; VDGDQCESNPCLNGGSCKDDINSYECWCPFGFEGKNCEL ---1111----iiii-------------2222-1111-- >BETA-KETO ACYL CARRIER PR; SWP:Q93X62; PDB:1EDOA; SPVVVVTGASRGIGKAIALSLGKAGCKVLVNYARSAKAAEEVSKQIEAYGGQAITFGGDV ---------------------1111---------------------3333---------- SKEADVEAMMKTAIDAWGTIDVVVNNAGITRDTLLIRMKKSQWDEVIDLNLTGVFLCTQA ---------------------------------3333-3333------------------ ATKIMMKKRKGRIINIASVVGLIGNIGQANYAAAKAGVIGFSKTAAREGASRNINVNVVC -----------------3333---2222---------------------1111------- PGFIASDMTAKLGEDMEKKILGTIPLGRTGQPENVAGLVEFLALSPAASYITGQAFTIDG -----33331111------11111111---3333----------3333----------ii GIAI ii-- >CHITINASE A; SWP:O83008; PDB:1EDQA; AAPGKPTIAWGNTKFAIVEVDQAATAYNNLVKVKNAADVSVSWNLWNGDTGTTAKVLLNG -------------------------3333----------------------------iii KEAWSGPSTGSSGTANFKVNKGGRYQMQVALCNADGCTASDATEIVVADTDGSHLAPLKE i-------------------------------1111---------------1111----- PLLEKNKPYKQNSGKVVGSYFVEWGVYGRNFTVDKIPAQNLTHLLYGFIPICGGNGINDS --!!!!---------------1111-3333-3333-3333-------------2222333 LKEIEGSFQALQRSCQGREDFKVSIHDPFAALQKAQKGVTAWDDPYKGNFGQLMALKQAH 3--2222-------2222-----------------2222-1111---------------1 PDLKILPSIGGWTLSDPFFFMGDKVKRDRFVGSVKEFLQTWKFFDGVDIDWEFPGGKGAN 111-------11113333----------------------3333--------2222---1 PNLGSPQDGETYVLLMKELRAMLDQLSVETGRKYELTSAISAGKDKIDKVAYNVAQNSMD 111-1111----------------------------------3333----3333-1111- HIFLMSYDFYGAFDLKNLGHQTALNAPAWKPDTAYTTVNGVNALLAQGVKPGKIVVGTAM ----------3333------------1111--------------3333-3333------- YGRGWTGVNGYQNNIPFTGTATGPVKGTWENGIVDYRQIAGQFMSGEWQYTYDATAEAPY -----------%%%%1111----------2222-----------!!!!------------ VFKPSTGDLITFDDARSVQAKGKYVLDKQLGGLFSWEIDADNGDILNSMNASLGNSAGVQ ---1111-----------------------------3333----------1111------ >ENDO-BETA-N-ACETYLGLUCOSA; SWP:P04067; PDB:1EDT; KQGPTSVAYVEVNNNSMLNVGKYTLADGGGNAFDVAVIFAANINYDTGTKTAYLHFNENV ----------3333-3333-------------------------------------3333 QRVLDNAVTQIRPLQQQGIKVLLSVLGNHQGAGFANFPSQQAASAFAKQLSDAVAKYGLD -----3333-----1111--------------1111------------------------ GVDFDDEYAEYGNNGTAQPNDSSFVHLVTALRANMPDKIISLYNIGPAASRLSYGGVDVS ---------2222--------------------------------3333---------33 DKFDYAWNPYYGTWQVPGIALPKAQLSPAAVEIGRTSRSTVADLARRTVDEGYGVYLTYN 33-------2222--------3333------2222-------------1111-------- LDGGDRTADVSAFTRELYGSEAVRT ------------------------- >EH DOMAIN BINDING PROTEIN; SWP:O88339; PDB:1EDUA; NIVHNYSEAEIKVREATSNDPWGPSSSLSEIADLTYNVVAFSEISIWKRLNDHGKNWRHV --------------1111------3333------------------3333--!!!!---- YKATLEYLIKTGSERVSQQCKENYAVQTLKDFQYVDRDGKDQGVNVREKAKQLVALLRDE -----3333--------------------------1111--------------------- DRLREERAHALKTKEKLAQTATA ----------------1111--- >ALPHA 1-MACROGLOBULIN; SWP:Q63041; PDB:1EDYA; EAPFTLKVNTLPLNFDKAEHHRKFQIHINVSYIGERPNSNMVIVDVKMVSGFIPVKPSVK ------------------------------------------------2222-------- KLQDQSNIQRTEVNTNHVLIYIEKLTNQTMGFSFAVEQDIPVKNLKPAPVKVYDYYETDE ----1111---------------------------------------------1111--- FAIEEYSAPFSSDS -------------- >5,10-METHYLENETETRAHYDROF; SWP:Q02046; PDB:1EDZA; KPGRTILASKVAETFNTEIINNVEEYKKTHNGQGPLLVGFLANNDPAAKMYATWTQKTSE ------3333-------------------------------------------------1 SMGFRYDLRVIEDKDFLEEAIIQANGDDSVNGIMVYFPVFGNAQDQYLQQVVCKEKDVEG 111---------3333-3333--1111--------------3333--1111-11111111 LNHVYYQNLYHNVRYLDKENRLKSILPCTPLAIVKILEFLKIYNNLLPEGNRLYGKKCIV -3333---1111-----------------------------------2222--------- INRSEIVGRPLAALLANDGATVYSVDVNNIQKFTRGESLKLNKHHVEDLGEYSEDLLKKC ---------------1111---------------------------------3333---- SLDSDVVITGVPSENYKFPTEYIKEGAVCINFACTKNFSDDVKEKASLYVPMTGKVTIAM ------------------3333-2222--------------------------------- LLRNMLRLVRNVELSKE ------------1111- >2-PYRONE SYNTHASE; SWP:P48391; PDB:1EE0A; GLATILAIGTATPPNCVAQADYADYYFRVTKSEHMVDLKEKFKRICEKTAIKKRYLALTE -----------------3333------11113333------------------------- DYLQENPTMCEFMAPSLNARQDLVVTGVPMLGKEAAVKAIDEWGLPKSKITHLIFCTTAG ----------2222-------------------------------3333----------- VDMPGADYQLVKLLGLSPSVKRYMLYQQGAAGGTVLRLAKDLAENNKGSRVLIVCSEITA ----------------1111-------------------------2222---------33 ILFHGPNENHLDSLVAQALFGDGAAALIVGSGPHLAVERPIFEIVSTDQTILPDTEKAMK 33----1111-----------------------3333-----------------1111-- LHLREGGLTFQLHRDVPLMVAKNIENAAEKALSPLGITDWNSVFWMVHPGGRAILDQVER ---1111-----1111---------------3333---1111--------3333------ KLNLKEDKLRASRHVLSEYGNLISACVLFIIDEVRKRSMAEGKSTTGEGLDCGVLFGFGP ----1111-------------!!!!---------------------iiii---------- GMTVETVVLRSVRVT --------------- >PECTATE LYASE; SWP:Q9RHW0; PDB:1EE6A; APTVVHETIRVPAGQTFDGKGQTYVANPNTLGDGSQAENQKPIFRLEAGASLKNVVIGAP -----------2222---iiii------------------------2222---------- AADGVHCYGDCTITNVIWEDVGEDALTLKSSGTVNISGGAAYKAYDKVFQINAAGTINIR ------------------------------------------------------------ NFRADDIGKLVRQNGGTTYKVVMNVENCNISRVKDAILRTDSSTSTGRIVNTRYSNVPTL -----------------------------------------1111--------------- FKGFKSGNTTASGNTQY ----2222--------- >MUTM (FPG) PROTEIN; SWP:O50606; PDB:1EE8A; PELPEVETTRRRLRPLVLGQTLRQVVHRDPARYRNTALAEGRRILEVDRRGKFLLFALEG ------------33332222--------3333--3333-----------!!!!------- GVELVAHLGMTGGFRLEPTPHTRAALVLEGRTLYFHDPRRFGRLFGVRRGDYREIPLLLR ---------------------------1111-----1111-------222211113333- LGPEPLSEAFAFPGFFRGLKESARPLKALLLDQRLAAGVGNIYADEALFRARLSPFRPAR ------3333--------------33331111---2222--------------1111333 SLTEEEARRLYRALREVLAEAVELGGSTLSDQSYRQPDGLPGGFQTRHAVYGREGLPCPA 3--------------------1111---3333---3333----3333--2222------- CGRPVERRVVAGRGTHFCPTCQGEGP -----------------3333----- >THIOL:DISULFIDE INTERCHAN; SWP:P21892; PDB:1EEJA; DDAAIQQTLAKMGIKSSDIQPAPVAGMKTVLTNSGVLYITDDGKHIIQGPMYDVSGTAPV ---------1111----------2222------------1111----------------- NVTNKMLLKQLNALEKEMIVYKAPQEKHVITVFTDITCGYCHKLHEQMADYNALGITVRY -------------3333-----------------1111---------------------- LAFPRQGLDSDAEKEMKAIWCAKDKNKAFDDVMAGKSVAPASCDVDIADHYALGVQLGVS ---1111-----------1111-----------------------3333----------- GTPAVVLSNGTLVPGYQPPKEMKEFLDEHQKMTSGK ------1111-------------------------- >GLUTATHIONE-S-TRANSFERASE; SWP:P78417; PDB:1EEMA; SARSLGKGSAPPGPVPEGSIRIYSMRFCPFAERTRLVLKAKGIRHEVININLKNKPEWFF -----2222------2222-----11113333------1111--------1111----11 KKNPFGLVPVLENSQGQLIYESAITCEYLDEAYPGKKLLPDDPYEKACQKMILELFSKVP 113333------1111-------------------------------------------- SLVGSFIRSQNKEDYAGLKEEFRKEFTKLEEVLTNKKTTFFGGNSISMIDYLIWPWFERL -----1111------------------------------1111---3333---------- EAMKLNECVDHTPKLKLWMAAMKEDPTVSALLTSEKDWQGFLELYLQNSPEACDYGL 111111111111--------------------------------1111---1111-- >INOSINE 5'-MONOPHOSPHATE ; SWP:P49058; PDB:1EEPA; NKITKEALTFDDVSLIPRKSSVLPSEVSLKTQLTKNISLNIPFLSSAMDTVTESQMAIAI --------1111----------3333-------1111----------1111--------- AKEGGIGIIHKNMSIEAQRKEIEKVKTYKDFPNACKDLNNKLRVGAAVSIDIDTIERVEE ------------------------1111--1111--1111-------------------- LVKAHVDILVIDSAHGHSTRIIELIKKIKTKYPNLDLIAGNIVTKEAALDLISVGADCLK -------------------------------1111----------------1111----- VGIGPGSICTTRIVAGVGVPQITAICDVYEACNNTNICIIADGGIRFSGDVVKAIAAGAD -----11113333------------------2222-----------3333----3333-- SVMIGNLFAGTKESPSEEIIYNGKKFKSMVPYSGKLKDILTQLKGGLMSGMGYLGAATIS ----3333--3333--------------------3333-------------1111----- DLKINSKFVKISHS -------------- >KAPPA-4 IMMUNOGLOBULIN (L; SWP:P01625; PDB:1EEQA; DIVLTQSPDSLAVSLGERATINCKSSQSVLDSS -------------2222---------------- >ERYTHROPOIETIN; SWP:P01588; PDB:1EERA; APPRLICDSRVLERYLLEAKEAEKITTGCAEHCSLNEKITVPDTKVNFYAWKRMEVGQQA ----1111-----------------1111-----------------333311113333-- VEVWQGLALLSEAVLRGQALLVKSSQPWEPLQLHVDKAVSGLRSLTTLLRALGAQKEAIS ---------------------------3333----------------------------1 NSDAASAAPLRTITADTFRKLFRVYSNFLRGKLKLYTGEACRTGDR 111-----------------------------------3333---- >Erythropoietin receptor [; SWP:P19235; PDB:1EERB; DPKFESKAALLAARGPEELLCFTERLEDLVCFWEEAASAGVGPGQYSFSYQLEDEPWKLC -----------------------------------------1111------2222----- RLHQAPTARGAVRFWCSLPTADTSSFVPLELRVTAASGAPRYHRVIHINEVVLLDAPVGL ------------------3333------------1111--------1111---------- VARLADESGHVVLRWLPPPETPMTSHIRYEVDVSAGQGAGSVQRVEILEGRTECVLSNLR ----------------------3333---------------------------------- GRTRYTFAVRARMAEPSFGGFWSEWSEPVSLLT --------------------------------- >PROPANEDIOL DEHYDRATASE; SWP:Q59470; PDB:1EEXA; MRSKRFEALAKRPVNQDGFVKEWIEEGFIAMESPNDPKPSIKIVNGAVTELDGKPVSDFD -----------3333--------1111-----1111-------iiii-------3333-- LIDHFIARYGINLNRAEEVMAMDSVKLANMLCDPNVKRSEIVPLTTAMTPAKIVEVVSHM -----------3333-----------------1111333333331111--------1111 NVVEMMMAMQKMRARRTPSQQAHVTNVKDNPVQIAADAAEGAWRGFDEQETTVAVARYAP -------------------------1111-------------------------3333-- FNAIALLVGSQVGRPGVLTQCSLEEATELKLGMLGHTCYAETISVYGTEPVFTDGDDTPW -------------------------------1111------------------------- SKGFLASSYASRGLKMRFTSGSGSEVQMGYAEGKSMLYLEARCIYITKAAGVQGLQNGSV ---------1111-------2222-1111-iiii------------------------!! SCIGVPSAVPSGIRAVLAENLICSSLDLECASSNDQTFTHSDMRRTARLLMQFLPGTDFI !!--33332222-----------1111--------------------------------- SSGYSAVPNYDNMFAGSNEDAEDFDDYNVIQRDLKVDGGLRPVREEDVIAIRNKAARALQ -------3333--------1111------------------------------------- AVFAGMGLPPITDEEVEAATYAHGSKDMPERNIVEDIKFAQEIINKNRNGLEVVKALAQG -----------------------3333--------------------------------- GFTDVAQDMLNIQKAKLTGDYLHTSAIIVGDGQVLSAVNDVNDYAGPATGYRLQGERWEE ------------3333--11112222---------3333------2222----------- IKNIPGALDPN ---2222---- >Diol dehydrase beta subun; SWP:Q59471; PDB:1EEXB; GFLTEVGEARQGTQQDEVIIAVGPAFGLAQTVNIVGIPHKSILREVIAGIEEEGIKARVI ----------------------1111------1111--------------1111------ RCFKSSDVAFVAVEGNRLSGSGISIGIQSKGTTVIHQQGLPPLSNLELFPQAPLLTLETY ------------------1111-----3333---------1111------3333------ RQIGKNAARYAKRESPQPVPTLNDQMARPKYQAKSAILHIKETKYVVTGKNPQELRVA -----------------------11113333-----------11112222-------- >Diol dehydrase gamma subu; SWP:Q59472; PDB:1EEXG; SARVSDYPLANKHPEWVKTATNKTLDDFTLENVLSNKVTAQDMRITPETLRLQASIAKDA --3333-3333-3333-------3333-----------3333---------------111 GRDRLAMNFERAAELTAVPDDRILEIYNALRPYRSTKEELLAIADDLESRYQAKICAAFV 1-------------1111------------2222-------------------------- REAATLYVERKKLKGDD -------1111-2222- >Moesin; SWP:P26038; PDB:1EF1C; AEASADLRADAMAKDRSEEERTTEAEKNERVQKHLKALTSELANARDESKKTANDIHAEN -------------22221111-3333---------------1111-1111-3333----- RLGRDKYKTLRQIRQGNTKQRIDEFES -----------1111------------ >DNA-DIRECTED RNA POLYMERA; SWP:O26147; PDB:1EF4A; MIPVRCLSCGKPVSAYFNEYQRRVADGEDPKDVLDDLGLKRYCCRRMLISHVETW ------------3333------------33333333----33333333------- >RGL; SWP:Q60695; PDB:1EF5A; EDTCIIRISVEDNNGNMYKSIMLTSQDKTPAVIQRAMSKHNLESDPAEEYELVQVISEDK ------------------------------------------------------------ ELVIPDSANVFYAMNSQVNFDFILRKKN ----------3333-------------- >METHYLMALONYL COA DECARBO; SWP:P52045; PDB:1EF8A; MSYQYVNVVTINKVAVIEFNYGRKLNALSKVFIDDLMQALSDLNRPEIRCIILRAPSGSK ----------!!!!------3333----------------11111111-------2222- VFSAGHDIHELDPLSYDDPLRQITRMIQKFPKPIISMVEGSVWGGAFEMIMSSDLIIAAS ------3333----1111----------------------------------------11 TSTFSMTPVNLGVPYNLVGIHNLTRDAGFHIVKELIFTASPITAQRALAVGILNHVVEVE 11----3333----------1111---------------------------------333 ELEDFTLQMAHHISEKAPLAIAVIKEELRVLGEAHTMNSDEFERIQGMRRAVYDSEDYQE 3-----------1111--------------1111-------------------------- GMNAFLEKRKPNFVGH ---------------- >ELONGATION FACTOR; SWP:P02990; PDB:1EFCA; TKPHVNVGTIGHVDHGKTTLTAAITTVLAKTYGGAARAFDQIDNAPEEKARGITINTSHV -----------2222----------------------3333--------%%%%------- EYDTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQVGV ---1111--------3333-------------------3333--3333------------ PYIIVFLNKCDMVDDEELLELVEMEVRELLSQYDFPGDDTPIVRGSALKALEGDAEWEAK --------1111------------------1111-3333----------1111-3333-- ILELAGFLDSYIPEPERAIDKPFLLPIEDVFSISGRGTVVTGRVERGIIKVGEEVEIVGI ----------------3333-----------------------------2222------- KETQKSTCTGVEMFRKLLDEGRAGENVGVLLRGIKREEIERGQVLAKPGTIKPHTKFESE ------------!!!!-----2222---------1111-2222----------------- VYILSKDEGGRHTPFFKGYRPQFYFRTTDVTGTIELPEGVEMVMPGDNIKMVVTLIHPIA ----3333-------1111------------------------2222------------- MDDGLRFAIREGGRTVGAGVVAKVLS -2222-----iiii------------ >Ferrichrome-binding perip; SWP:P07822; PDB:1EFDN; GIDPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVIDVGLRTEPNLEL --1111------------1111--------3333---------3333----1111----- LTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAA -----------2222------3333----------------------------------- ETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIPNAWQ -------------1111---------------1111----1111------1111------ GETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRFQRVPA ---1111----33333333----------------------------------------- VWFYGATLSAMHFVRVLDNAIG ---------------------- >MINI-PROINSULIN; SWP:P30410; PDB:1EFEA; FVNQHLCGSHLVEALYLVCGERGFFYTPKTRRYPGDVKRGIVEQCCTSICSLYQLENYCN -------3333-----------------------%%%%---------------------- >Protein Nef; SWP:P04324; PDB:1EFNB; RPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIHSQRRQDILDLWIYHTQGYFPDWQNYTPG --------------------------2222------------------------------ PGVRYPLTFGWCYKLVPVREVLEWRFDSRLAFHHVARELHPEYF ----------------------------1111-3333--3333- >ELECTRON TRANSFER FLAVOPR; SWP:P38974; PDB:1EFPA; AVLLLGEVTNGALNRDATAKAVAAVKALGDVTVLCAGASAKAAAEEAAKIAGVAKVLVAE ------------------------1111-------------------------------- DALYGHRLAEPTAALIVGLAGDYSHIAAPATTDAKNVMPRVAALLDVMVLSDVSAILDAD 3333---3333-------3333--------------------1111-------------- TFERPIYAGNAIQVVKSKDAKKVFTIRTASFDAAGEGGTAPVTETAAAADPGLSSWVADE -----%%%%-----------------1111------------------------------ VAESDRPELTSARRVVSGGRGLGSKESFAIIEELADKLGAAVGASRAAVDSGYAPNDWQV -------3333--------3333-3333-------1111---------------3333-- GQTGKVVAPELYVAVGISGAIQHLAGMKDSKVIVAINKDEEAPIFQIADYGLVGDLFSVV -------------------33331111-----------11111111--------111133 PELTGKL 333333- >Electron transfer flavopr; SWP:P38975; PDB:1EFPB; MKVLVPVKRLIDYNVKARVKSDGSGVDLANVKMSMNPFDEIAVEEAIRLKEKGQAEEIIA -----------1111--------------------------------------------- VSIGVKQAAETLRTALAMGADRAILVVAADDVQQDIEPLAVAKILAAVARAEGTELIIAG ----3333-------3333-----------3333--3333---------1111------- KQAIDNDMNATGQMLAAILGWAQATFASKVEIEGAKAKVTREVDGGLQTIAVSLPAVVTA ------------------------------------------------------------ DLRLNEPRYASLPNIMKAKKKPLDEKTAADYGVDVAPRLEVVSVREPEGRKAGIKVGSVD 1111-------------1111-----3333---------------------------333 ELVGKL 31111- >ELONGATION FACTOR TU; SWP:P02997; PDB:1EFUB; AEITASLVKELRERTGAGMMDCKKALTEANGDIELAIENMRKSGAIKAAKKAGNVAADGV ---3333---------------------iiii------------------1111------ IKTKIDGNYGIILEVNCQTDFVAKDAGFQAFADKVLDAAVAGKITDVEVLKAQFEEERVA -----!!!!---------3333-----------------1111----------------- LVAKIGENINIRRVAALEGDVLGSYQHGARIGVLVAAKGADEELVKHIAMHVAASKPEFI ----------------------------------------3333---------------- KPEDVSAEVVEKEYQVQLDIAMQSGKPKEIAEKMVEGRMKKFTGEVSLTGQPFVMEPSKT 1111-----------------------3333---------------------1111---- VGQLLKEHNAEVTGFIRFEVGEGIEKVETDFAAEVAAMSKQS ------------------2222-------------1111--- >ELECTRON TRANSFER FLAVOPR; SWP:P13804; PDB:1EFVA; QSTLVIAEHANDSLAPITLNTITAATRLGGEVSCLVAGTKCDKVAQDLCKVAGIAKVLVA ---------%%%%-3333-------3333---------------------2222------ QHDVYKGLLPEELTPLILATQKQFNYTHICAGASAFGKNLLPRVAAKLEVAPISDIIAIK ----22223333------------------------------------------------ SPDTFVRTIYAGNALCTVKCDEKVKVFSVRGTSFDAAATSGGSASSEKASSTSPVEISEW ---------iiii----------------1111--------------------------- LDQKLTKSDRPELTGAKVVVSGGRGLKSGENFKLLYDLADQLHAAVGASRAAVDAGFVPN -----------3333--------11113333---------------------------11 DMQVGQTGKIVAPELYIAVGISGAIQHLAGMKDSKTIVAINKDPEAPIFQVADYGIVADL 11--1111---------------33331111-----------1111-3333-------33 FKVVPEMTEILK 33------1111 >Electron transfer flavopr; SWP:P38117; PDB:1EFVB; LRVLVAVKRVIDYAVKIRVKPDRTGVVTDGVKHSMNPFCEIAVEEAVRLKEKKLVKEVIA -----------1111----1111----2222------------------1111------- VSCGPAQCQETIRTALAMGADRGIHVEVPPAEAERLGPLQVARVLAKLAEKEKVDLVLLG ----3333-------1111---------33331111------------------------ KQAIDDDCNQTGQMTAGFLDWPQGTFASQVTLEGDKLKVEREIDGGLETLRLKLPAVVTA --------------------------------!!!!------1111-------------- DLRLNEPRYATLPNIMKAKKKKIEVIKPGDLGVDLTSKLSVISVEDPPQRTAGVKVETTE 3333------------3333------3333------------------------------ DLVAKLKEIGRI ------1111-- >HLA-CW3 (HEAVY CHAIN); SWP:NA; PDB:1EFXA; GSHSMRYFYTAVSRPGRGEPHFIAVGYVDDTQFVRFDSDAASPRGEPRAPWVEQEGPEYW ---------------------------!!!!-------------------1111-3333- DRETQKYKRQAQTDRVSLRNLRGYYNQSEAGSHIIQRMYGCDVGPDGRLLRGYDQYAYDG ----------------------1111-----------------1111----------%%% KDYIALNEDLRSWTAADTAAQITQRKWEAAREAEQLRAYLEGLCVEWLRRYLKNGKETLQ %-----1111--------------------------------------------1111-- RAEHPKTHVTHHPVSDHEATLRCWALGFYPAEITLTWQWDGEDQTQDTELVETRPAGDGT --------------------------------------iiii------------------ FQKWAAVVVPSGEEQRYTCHVQHEGLPEPLTLRWEPSS ------------3333------3333------------ >POLY (ADP-RIBOSE) POLYMER; SWP:P26446; PDB:1EFYA; KSKLAKPIQDLIKMIFDVESMKKAMVEFEIDLQKMPLGKLSKRQIQSAYSILNEVQQAVS -----------------------------------3333--------------------- DGGSESQILDLSNRFYTLIPHDFGMKKPPLLSNLEYIQAKVQMLDNLLDIEVAYSLLRGG ------------------------------------------------------------ NEDGDKDPIDINYEKLRTDIKVVDKDSEEAKIIKQYVKNTHAATHNAYDLKVVEIFRIER -----------------------------------------1111--------------2 EGESQRYKPFKQLHNRQLLWHGSRTTNFAGILSQGLRIAPPEAPVTGYMFGKGIYFADMV 222------1111----------3333------------33333333-----------33 SKSANYCHTSQADPIGLILLGEVALGNMYELKNASHITKLPKGKHSVKGLGKTAPDPTAT 33-1111--1111---------------------------2222-----------3333- TTLDGVEVPLGNGISTGINDTCLLYNEYIVYDVAQVNLKYLLKLKFNYKT --iiii-------------------------1111--------------- >ENDOGLUCANASE I; SWP:P07981; PDB:1EG1A; QPGTSTPEVHPKLTTYKCTKSGGCVAQDTSVVLDWNYRWMHDANYNSCTVNGGVNTTLCP ---------------------------------3333----1111----iiii--1111- DEATCGKNCFIEGVDYAASGVTTSGSSLTMNQYMPSSSGGYSSVSPRLYLLDSDGEYVML --------------3333---------------------------------1111----- KLNGQELSFDVDLSALPCGENGSLYLSQMDENGGANQYNTAGANYGSGYCDAQCPVQTWR ------------33332222-----------%%%%1111-!!!!------3333-----i NGTLNTSHQGFCCNEMDILEGNSRANALTPHSCTATACDSAGCGFNPYGSGYKSYYGPGD iii-1111-------------1111--------3333--------3333-------2222 TVDTSKTFTIITQFNTDNGSPSGNLVSITRKYQQNGVDIPSAQPGGDTISSCPSASAYGG --3333---------11113333-------------------2222-----3333----- LATMGKALSSGMVLVFSIWNDNSQYMNWLDSGNAGPCSSTEGNPSNILANNPNTHVVFSN -----------------------%%%%---!!!!---3333---------1111------ IRWGDIGSTT ----2222-- >MODIFICATION METHYLASE RS; SWP:P14751; PDB:1EG2A; GTTRHVYDVCDCLDTLAKLPDDSVQLIICDPPYNIMLADWDDHMDYIGWAKRWLAEAERV ---------------1111----------------3333-----3333------------ LSPTGSIAIFGGLQYQGEAGSGDLISIISHMRQNSKMLLANLIIWNYPNGMSAQRFFANR -1111------------2222-3333---------------------------------- HEEIAWFAKTKKYFFDLDAVREPYDEETKAAYMKDKRLNPESVEKGRNPTNVWRMSRLNG ---------1111--3333---------------3333---------------------- NSLERVGHPTQKPAAVIERLVRALSHPGSTVLDFFAGSGVTARVAIQEGRNSICTDAAPV -3333--1111-3333---------2222------!!!!------------------333 FKEYYQKQLTFLRSYEIVEGAANFGAALQR 3------3333--------33333333--- >DYSTROPHIN; SWP:P11532; PDB:1EG3A; PASQHFLSTSVQGPWERAISPNKVPYYINHETQTTCWDHPKMTELYQSLADLNNVRFSAY 33331111---!!!!----1111------1111----------------1111------- RTAMKLRRLQKALCLDLLSLSAACDALDQHNLKQNDQPMDILQIINCLTTIYDRLEQEHN -------------1111----------1111--1111----------------------- NLVNVPLCVDMCLNWLLNVYDTGRTGRIRVLSFKTGIISLCKAHLEDKYRYLFKQVASST --------------------1111---------------------------------111 GFCDQRRLGLLLHDSIQIPRQLGEVASFGGSNIEPSVRSCFQFANNKPEIEAALFLDWMR 1-----------------3333-3333--------------1111------------333 LEPQSMVWLPVLHRVAAAET 3-3333-------------- >AMINOTRANSFERASE; SWP:Q9X218; PDB:1EG5A; RVYFDNNATTRVDDRVLEEIVFYREKYGNPNSAHGGIEANLHEKAREKVAKVLGVSPSEI ----1111--------------------1111-----------------------1111- FFTSCATESINWILKTVAETFEKRKRTIITTPIEHKAVLETKYLSKGFKVKYVPVDSRGV ------------------------------1111---------------------1111- VKLEELEKLVDEDTFLVSIAANNEVGTIQPVEDVTRIVKKKNKETLVHVDAVQTIGKIPF ----------1111---------------------------1111-----1111------ SLEKLEVDYASFSAHKFHGPKGVGITYIRKGVPIRPLIHGGGQERGLRSGTQNVPGIVGA -1111-------1111------------2222----------%%%%-------------- ARAEIAVEELSEAAKHEKLRSKLVSGLNLGAHIITPLEISLPNTLSVSFPNIRGSTLQNL -----------------------------------3333-1111----2222-------- LSGYGIYVSTHVLDAGVDRRIAQGAIRISLCKYNTEEEVDYFLKKIEEILSFL ----------3333-------1111-----1111------------------- >FORMYLTETRAHYDROFOLATE SY; SWP:P21164; PDB:1EG7A; DIEIAQAAKMKPVMELARGLGIQEDEVELYGKYKAKISLDVYRRLKDKPDGKLILVTAIT -3333-----------------1111---------------1111--------------- PTPAGEGKTTTSVGLTDALARLGKRVMVCLREPSLGPSFGIKGGAAGGGYAQVVPMEDIN -1111-----------------------------3333---------!!!!--------- LHFTGDIHAVTYAHNLLAAMVDNHLQQGNVLNIDPRTITWRRVIDLNERALRNIVIGLGG ----------------------------1111-1111----------3333--------1 KANGVPRETGFDISVASEVMACLCLASDLMDLKERFSRKVVGYTYDGKPVTAGDLEAQGS 111---------1111------1111-----------------1111---3333------ MALLMKDAIKPNLVQTLENTPAFIHGGPFANIAHGCNSIIATKTALKLADYVVTEAGFGA --11113333-----1111---------------------------------------33 DLGAEKFYDVKCRYAGFKPDATVIVATVRALKMHGGVPKSDLATENLEALREGFANLEKH 33--------------------------33331111-3333------------------- IENIGKFGVPAVVAINAFPTDTEAELNLLYELCAKAGAEVALSWAKGGEGGLELARKVLQ --3333-----------33333333---------------------3333---------- TLESRPSNFHVLYNLDLSIKDKIAKIATEIYGADGVNYTAEADKAIQRYESLGYGNLPVV -------------1111--------------------------------1111------- MAKTQYSFSDDMTKLGRPRNFTITVREVRLSAGGRLIVPITGAIMTMPGLPKRPAACNID ----------1111---------------------------------------3333--- IDADGVITG --------- >GTP-BINDING PROTEIN ERA; SWP:P06616; PDB:1EGAA; DKSYCGFIAIVGRPNVGKSTLLNKLLGQKISITSRKAQTTRHRIVGIHTEGAYQAIYVDT -------------------------------------------------!!!!------- PGLHMEEKRAINRLMNKAASSSIGDVELVIFVVEGTRWTPDDEMVLNKLREGKAPVILAV -----------------1111--------------------------------------- NKVDNVQEKADLLPHLQFLASQMNFLDIVPISAETGLNVDTIAAIVRKHLPEATHHFPED -3333--3333--------3333--------1111-----------1111---------- YITDRSQRFMASEIIREKLMRFLGAELPYSVTVEIERFVSNERGGYDINGLILVEREGQK ----------------------!!!!--------------1111-----------3333- KMVIGNKGAKIKTIGIEARKDMQEMFEAPVHLELWVKVKSGWADDERALRSL -3333--------------------------------3333-------1111 >MEDIUM CHAIN ACYL-COA DEH; SWP:P11310; PDB:1EGDA; LGFSFEFTEQQKEFQATARKFAREEIIPVAAEYDKTGEYPVPLIRRAWELGLMNTHIPEN -------3333--------------3333------------------------1111111 CGGLGLGTFDACLISEELAYGCTGVQTAIEGNSLGQMPIIIAGNDQQKKKYLGRMTEEPL 1-----------------3333------------------------------3333---- MCAYCVTEPGAGSDVAGIKTKAEKKGDEYIINGQKMWITNGGKANWYFLLARSDPDPKAP -------1111--1111----------------------2222------------11111 ANKAFTGFIVEADTPGIQIGRKELNMGQRCSDTRGIVFEDVKVPKENVLIGDGAGFKVAM 111-------1111-------------1111------------1111---2222------ GAFDKERPVVAAGAVGLAQRALDEATKYALERKTFGKLLVEHQAISFMLAEMAMKVELAR ---------------------------1111--iiii33333333--------------- MSYQRAAWEVDSGRRNTYYASIAKAFAGDIANQLATDAVQILGGNGFNTEYPVEKLMRDA ---------1111-----------------------------3333-33333333----3 KIYQIYGGTSQIQRLIVAREHIDKYKN 333-------------------1111- >EPIDERMAL GROWTH FACTOR; SWP:P01132; PDB:1EGF; NSYPGCPSSYDGYCLNGGVCMHIESLDSYTCNCVIGYSGDRCQTRDLRWWELR ---------------------------------2222---------------- >MACROPHAGE MANNOSE RECEPT; SWP:P22897; PDB:1EGIA; CPEDWGASSSLCFKLYAKGKHEKKTWFESRDFCRALGGDLASINNKEEQQTIWRLITASG ------------------1111-----------1111-------------------1111 SYHKLFWLGLTYGGFTWSDGSPVSYENWAYGEPNNYQNVEYCGELKGDPTMSWNDINCEH 2222------------3333--------2222-1111----------3333-----1111 LNNWICQIQ --------- >CYTOKINE RECEPTOR COMMON ; SWP:NA; PDB:1EGJH; EVQLQQSGPELVKPGTSVKMSCKASGYTFTDYYMKWVKHSHGKSLEWIGDINP ---------------------------1111---------------------- >GLUTAREDOXIN; SWP:P68688; PDB:1EGO; MQTVIFGRSGCPYCVRAKDLAEKLSNERDDFQYQYVDIRAEGITKEDLQQKAGKPVETVP -----------------------1111---------1111-------------------- QIFVDQQHIGGYTDFAAWVKENLDA ---iiii------------------ >MADS BOX TRANSCRIPTION EN; SWP:Q02078; PDB:1EGWA; GRKKIQITRIMDERNRQVTFTKRKFGLMKKAYELSVLCDCEIALIIFNSSNKLFQYASTD -----------------------------------1111--------1111--------3 MDKVLLKYTEY 333-------- >VASODILATOR-STIMULATED PH; SWP:P50552; PDB:1EGXA; MSETVICSSRATVMLYDDGNKRWLPAGTGPQAFSRVQIYHNPTANSFRVVGRKMQPDQQV ----------------3333--------------------3333----------3333-- VINCAIVRGVKYNQATPNFHQWRDARQVWGLNFGSKEDAAQFAAGMASALEALEG ------------------------------------------------------- >ENDOGLUCANASE Z; SWP:P07103; PDB:1EGZA; SVEPLSVNGNKIYAGEKAKSFAGNSLFWSNNGWGGEKFYTADTVASLKKDWKSSIVRAAM -------!!!!------------------22223333----------------------- GVQESGGYLQDPAGNKAKVERVVDAAIANDMYAIIGWHSHSAENNRSEAIRFFQEMARKY ---2222-------------------1111----------3333---------------1 GNKPNVIYEIYNEPLQVSWSNTIKPYAEAVISAIRAIDPDNLIIVGTPSWSQNVDEASRD 111------------------------------3333-----------%%%%-3333--- PINAKNIAYTLHFYAGTHGESLRNKARQALNNGIALFVTEWGTVNADGNGGVNQTETDAW -------------1111---------------------------1111------------ VTFMRDNNISNANWALNDKNEGASTYYPDSKNLTESGKKVKSIIQSWPYKA --------------------3333--------------------------- >RIBOSOME RECYCLING FACTOR; SWP:Q9WX76; PDB:1EH1A; MTLKELYAETRSHMQKSLEVLEHNLAGLRTGRANPALLLHLKVEYYGAHVPLNQIATVTA ------------------------1111-----33333333---iiii--3333------ PDPRTLVVQSWDQNALKAIEKAIRDSDLGLNPSNKGDALYINIPPLTEERRKDLVRAVRQ -1111------3333----------3333------------------------------- YAEEGRVAIRNIRREALDKLKKLAKELHLSEDETKRAEAEIQKITDEFIAKADQLAEKKE ------------------------------------------------------------ QEILG ----- >EPS15; SWP:P42566; PDB:1EH2; PWAVKPEDKAKYDAIFDSLSPVNGFLSGDKVKPVLLNSKLPVDILGRVWELSDIDHDGML --------------------------3333----------------------1111---- DRDEFAVAMFLVYCALEKEPVPMSLPPALVPPSKR -----------1111----------1111-1111- >GLYCOSYLTREHALOSE TREHALO; SWP:Q55088; PDB:1EH9A; TFAYKIDGNEVIFTLWAPYQKSVKLKVLEKGLYEMERDEKGYFTITLNNVKVRDRYKYVL -------------------------------------3333------------------- DDASEIPDPASRYQPEGVHGPSQIIQESKEFNNETFLKKEDLIIYEIHVGTFTPEGTFEG -------1111-----1111--------------------------------33333333 VIRKLDYLKDLGITAIEIMPIAQFPGKRDWGYDGVYLYAVQNSYGGPEGFRKLVDEAHKK ---------------------------------------------------------111 GLGVILDVVYNHVGPEGNYMVKLGPYFSQKYKTPWGLTFNFDDAESDEVRKFILENVEYW 1----------------------------------------------------------- IKEYNVDGFRLDAVHAIIDTSPKHILEEIADVVHKYNRIVIAESDLNDPRVVNPKEKCGY --------------------------------3333------------3333--1111-- NIDAQWVDDFHHSIHAYLTGERQGYYTDFGNLDDIVKSYKDVFVYDGKYSNFRRKTHGEP --------------3333-----3333---3333----------------1111------ VGELDGCNFVVYIQNHDQVGNRGKGERIIKLVDRESYKIAAALYLLSPYIPMIFMGEEYG ------------------1111----3333--3333------1111---------3333- EENPFYFFSDFSDSKLIQGVREGRKKENGQDTDPQDESTFNASKLSWKIDEEIFSFYKIL ------------3333----------------1111---3333----------------- IKMRKELSIACDRRVNVVNGENWLIIKGREYFSLYVFSKSSIEVKYSGTLLLSSNNSFPQ ------------------------------------------------------------ HIEEGKYEFDKGFALYK ----------------- >D-ALANINE:D-LACTATE LIGAS; SWP:P71454; PDB:1EHIA; KKRVALIFGGNSSEHDVSKRSAQNFYNAIEATGKYEIIVFAIAQNGFFLDTESSKKILAL -----------11113333----------3333---------1111---3333------- EDEQPIVDAFMKTVDASDPLARIHALKSAGDFDIFFPVVHGNLGEDGTLQGLFKLLDKPY ---------1111-1111-3333-1111-------------------3333--------- VGAPLRGHAVSFDKALTKELLTVNGIRNTKYIVVDPESANNWSWDKIVAELGNIVFVKAA ------------3333----3333----------333311113333-------------- NQGSSVGISRVTNAEEYTEALSDSFQYDYKVLIEEAVNGARELEVGVIGNDQPLVSEIGA --%%%%------3333-------------------------------------------- HTVPNQGSGDGWYDYNNKFVDNSAVHFQIPAQLSPEVTKEVKQMALDAYKVLNLRGEARM -------------3333----------------3333----------------------- DFLLDENNVPYLGEPNTLPGFTNMSLFKRLWDYSDINNAKLVDMLIDYGFEDFAQNKKLS ----1111----------------3333-3333--------------------------- >5'-(D(5HT)P*(6-4)T)-3'; SWP:NA; PDB:1EHLH; EVQLQQSGTVLARPGASVKMSCKASGYSFTSFWMHWVKQRPGQGLEWIGTIYPGNSDTSY ------------2222-----------1111--------1111----------------- NQKFKGKAKLTAVTSASTAYMEVSSL 3333--------1111---------- >HEAT-STABLE ENTEROTOXIN B; SWP:P22542; PDB:1EHS; STQSNKKDLCEHYRQIAKESCKKGFLGVRDGTAGACFGAQIMVAAKGC ---1111----------3333------------%%%%---3333---- >NUCLEOSIDE DIPHOSPHATE KI; SWP:O00746; PDB:1EHWA; HMGTRERTLVAVKPDGVQRRLVGDVIQRFERRGFTLVGMKMLQAPESVLAEHYQDLRRKP -!!!!------------------------1111-----------3333-------3333- FYPALIRYMSSGPVVAMVWEGYNVVRASRAMIGHTDSAEAAPGTIRGDFSVHISRNVIHA -------1111---------2222-----------3333--------------------- SDSVEGAQREIQLWFQSSELVSW ---------------3333---- >SCAFFOLDIN PROTEIN; SWP:Q45996; PDB:1EHXA; MQDPTINPTSISAKAGSFADTKITLTPNGNTFNGISELQSSQYTKGTNEVTLLASYLNTL ----------------------------------3333--------------33333333 PENTTKTLTFDFGVGTKNPKLTITVLPKDIPGLE -------------3333----------------- >PROTEIN (SOLUBLE EPOXIDE ; SWP:O31243; PDB:1EHYA; AIRRPEDFKHYEVQLPDVKIHYVREGAGPTLLLLHGWPGFWWEWSKVIGPLAEHYDVIVP ---3333--------------------------------33333333-3333-------- DLRGFGDSEKPDLNDLSKYSLDKAADDQAALLDALGIEKAYVVGHDFAAIVLHKFIRKYS -2222------11111111-------------1111----------------------11 DRVIKAAIFDPIQPDFESWYSQFHQLDMAVEVVGSSREVCKKYFKHFFDHWSYRDELLTE 11-----------------------------1111-------------1111-------- EELEVHVDNCMKPDNIHGGFNYYRANIRPDAALWTDLDHTMSDLPVTMIWGLGDTCVPYA -----------2222------------1111---3333------------------1111 PLIEFVPKYYSNYTMETIEDCGHFLMVEKPEIAIDRIKTAFR ----3333---------------3333--------------- >P8MTCP1; SWP:P56277; PDB:1EI0A; DPCQKQAAEIQKCLQANSYLESKCQAVIQELKKCAAQY -------------------3333--------------- >DNA GYRASE B; SWP:P06982; PDB:1EI1A; SNSSDSSSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNAIDEALAGHCKEIIV ----3333----!!!!----3333------------------------------------ TIHADNSVSVQDDGRGIPTGIHPEEGVSAAEVIMTVLHAGGKFDDNSYKVSGGLHGVGVS --1111------------------------------------------------------ VVNALSQKLELVIQREGKIHRQIYEHGVPQAPLAVTGETEKTGTMVRFWPSLETFTNVTE --------------iiii------iiii----------------------1111------ FEYEILAKRLRELSFLDSGVSIRLRDKRDGKEDHFHYEGGIKAFVEYLNKNKTPIHPNIF -3333-------------------------------------------1111-------- YFSTEKDGIGVEVALQWNDGFQENIYCFTNNIPQRDGGTHLAGFRAAMTRTLNAYMDKEG -----iiii-------------------iiii-1111----------------------3 YSKKAKVSATGDDAREGLIAVVSVKVPDPKFSSQTKDKLVSSEVKSAVEQQMNELLAEYL 333------3333-------------------1111----3333---------------- LENPTDAKIVVGKIIDAARAREAARRAREMT ------------------------------- >D-AMINOPEPTIDASE; SWP:Q9ZBA9; PDB:1EI5A; KFDTSALEAFVRHIPQNYKGPGGVVAVVKDGEVVLQHAWGFADLRTRTPMTLDTRMPICS ----------------------------iiii------------------1111------ VSKQFTCAVLLDAVGEPELLDDALEAYLDKFEDERPAVRDLCNNQSGLRDYWALSVLCGA ---------------3333--------1111-----3333----------3333-1111- DPEGVFLPAQAQSLLRRLKTTHFEPGSHYSYCNGNFRILADLIEAHTGRTLVDILSERIF 1111---------1111------2222----3333------------------------- APAGMKRAELISDTALFDECTGYEGDTVRGFLPATNRIQWMGDAGICASLNDMIAWEQFI 11111111----3333-------------------------------------------- DATRDDESGLYRRLSGPQTFKDGVAAPYGFGLNLHETGGKRLTGHGGALRGWRCQRWHCA 1111-1111----------1111-----iiii----iiii--------2222-------1 DERLSTIAMFNFEGGASEVAFKLMNIALGVSSSEVSRVEADSAWFGSWLDDETGLVLSLE 111-------------------------------------3333---------------- DAGHGRMKARFGTSPEMMDVVSANEARSAVTTIRRDGETIELVRASENLRLSMKRVKGEA ---------------------1111----------!!!!----3333------------- KHDIIGRYHSDELDADLLLVSEGGAIYGAFEGFLGKSDMYPLYSVGSDVWLLPVQRSMDA ----------1111-------iiii------1111----------2222----------- PSPGEWKLVFRRDDKGEITGLSVGCWLARGVEYRRVQP ------------1111--------3333---------- >PHOSPHONOACETATE HYDROLAS; SWP:Q51782; PDB:1EI6A; TNLISVNSRSYRLSSAPTIVICVDGCEQEYINQAIQAGQAPFLAELTGFGTVLTGDCVVP -----iiii-------------2222--------1111-3333-3333------------ SFTNPNNLSIVTGAPPSVHGICGNFFFDQETQEEVLMNDAKYLRAPTILAEMAKAGQLVA --------------3333--------------------3333------------------ VVTAKDKLRNLLGHQLKGICFSAEKADQVNLEEHGVENILARVGMPVPSVYSADLSEFVF ----3333----2222-----3333----3333-----3333---------3333----- AAGLSLLTNERPDFMYLSTTDYVQHKHAPGTPEANAFYAMMDSYFKRYHEQGAIVAITAD --------------------3333---2222-----------------1111-------- HGMNAKTDAIGRPNILFLQDLLDAQYGAQRTRVLLPITDPYVVHHGALGSYATVYLRDAV -------1111---------------2222--------3333-3333---------3333 PQRDAIDFLAGIAGVEAVLTRSQACQRFELPEDRIGDLVVLGERLTVLGSAADKHDLSGL 3333--------------------------1111--------1111----3333--1111 TVPLRSHGGVSEQKVPLIFNRKLVGLDGRLRNFDIIDLALNHLA --------1111------------------1111---------- >COAT PROTEIN; SWP:P03570; PDB:1EI7A; SYSITTPSQFVFLSSAWADPIELINLCTNALGNQFQTQQARTVVQRQFSEVWKPSPQVTV --------3333-----------------1111-----------------------1111 RFPDSDFKVYRYNAVLDPLVTALLGAFDTRNRIIEVENQANPTTAETLDATRRVDDATVA ---------1111-----------3333-------------------------------- IRSAINNLIVELIRGTGSYNRSSFESSSGLVWTSGPAT --------------2222-------------------- >PALMITOYL PROTEIN THIOEST; SWP:P45478; PDB:1EI9A; DPPAPLPLVIWHGMGDSCCNPLSMGAIKKMVEKKIPGIHVLSLEIGKTLREDVENSFFLN 3333--------2222---1111-----------2222---------------------- VNSQVTTVCQILAKDPKLQQGYNAMGFSQGGQFLRAVAQRCPSPPMVNLISVGGQHQGVF ----------33333333------------------------------------------ GLPRCPGESSHICDFIRKTLNAGAYNKAIQERLVQAEYWHDPIREDIYRNHSIFLADINQ -2222-1111------------11113333--3333--------------------1111 ERGVNESYKKNLMALKKFVMVKFLNDTIVDPVDSEWFGFYRSGQAKETIPLQESTLYTQD -----------3333-------1111------3333----2222-----3333-3333-1 RLGLKAMDKAGQLVFLALEGDHLQLSEEWFYAHIIPFLE 111----1111---------2222---------3333-- >EIAV CAPSID PROTEIN P26; SWP:P69732; PDB:1EIA; TPRGYTTWVNTIQTNGLLNEASQNLFGILSVDCTSEEMNAFLDVVPGQAGQKQILLDAID -----3333-3333---------------2222--------------------------- KIADDWDNRHPLPNAPLVAPPQGPIPMTARFIRGLGVPRERQMEPAFDQFRQTYRQWIIE ------------------------------1111---3333--3333------------- AMSEGIKVMIGKPKAQNIRQGAKEPYPEFVDRLLSQIKSEGHPQEISKFLTDTLTIQNAN --------1111-3333---11113333---------------------------1111- EECRNAMRHLRPEDTLEEKMYACRDIG ----1111--3333------1111--- >EOTAXIN-2; SWP:O00175; PDB:1EIGA; VVIPSPCCMFFVSKRIPENRVVSYQLSSRSTCLKAGVIFTTKKGQQSCGDPKQEWVQRYM ----------------1111--------------------3333-----3333------- KNLDAKQKKASPR -3333-------- >HYPOTHETICAL PROTEIN MTH1; SWP:O27652; PDB:1EIJA; MRQQLEMQKKQIMMQILTPEARSRLANLRLTRPDFVEQIELQLIQLAQMGRVRSKITDEQ --------3333---------------11113333----------3333----------- LKELLKRVAGKK ------------ >RNA POLYMERASE SUBUNIT RP; SWP:O27122; PDB:1EIKA; MKREILKHQLVPEHVILNESEAKRVLKELDAHPEQLPKIKTTDPVAKAIGAKRGDIVKII -----------------3333----------3333----33333333%%%%--------- RKSPTAEEFVTYRLVQD ----------------- >MU-AGATOXIN-I; SWP:P11057; PDB:1EIT; ECVPENGHCRDWYDECCEGFYCSCRQPPKCICRNNN ---2222----------------------------- >HYPOTHETICAL PROTEIN MTH5; SWP:O26638; PDB:1EIWA; VTAEIRLYITEGEVEDYRVFLERLEQSGLEWRPATPEDADAVIVLAGLWGTRRDEILGAV -------------3333-----------------1111---------------------- DLARKSSKPIITVRPYGLENVPPELEAVSSEVVGWNPHCIRDALEDALDVI ------------------------3333---------3333---------- >FTSJ; SWP:P28692; PDB:1EJ0A; GLRSRAWFKLDEIQQSDKLFKPGMTVVDLGAAPGGWSQYVVTQIGGKGRIIACDLLPMDP --------------------2222--------------------1111------------ IVGVDFLQGDFRDELVMKALLERVGDSKVQVVMSDMAPNMSGTPAVDIPRAMYLVELALE 2222-----3333------3333!!!!---------------1111-------------- MCRDVLAPGGSFVVKVFQGEGFDEYLREIRSLFTKVKVRKPDSSRARSREVYIVATGRKP ------2222--------2222-------1111-------11113333------------ >NICOTINAMIDE MONONUCLEOTI; SWP:O26253; PDB:1EJ2A; MRGLLVGRMQPFHRGHLQVIKSILEEVDELIICIGSAQLSHSIRDPFTAGERVMMLTKAL ----------------------1111---------1111--1111--------------- SENGIPASRYYIIPVQDIECNALWVGHIKMLTPPFDRVYSGNPLVQRLFSEDGYEVTAPP 1111-1111----------3333-----1111-----------------1111------- LFYRDRYSGTEVRRRMLDDGDWRSLLPESVVEVIDEINGVERIKHLA --1111---------------1111-3333----------------- >WISKOTT-ALDRICH SYNDROME ; SWP:P42768; PDB:1EJ5A; SGFKHVSHVGWDPQNGFDVNNLDPDLRSLFSRAGISEAQLTDAETSKLIYDFIEDQGGLE -----------------3333----3333------3333------------3333--333 AVRQEMRRQGGSGGSQSSEGLVGALMHVMQKRSRAIHSSDEGEDQAG 3-----1111---------3333------3333-------------- >LAMBDA2; SWP:P11079; PDB:1EJ6A; ANVWGVRLADSLSSPTIETRTRQYTLHDLCSDLDANPGREPWKPLRNQRTNNIVAVQLFR --!!!!-------------------------3333------------------------- PLQGLVLDTQLYGFPGAFDDWERFMREKLRVLKYEVLRIYPISNYSNEHVNVFVANALVG -------3333-----3333--------------------33331111------------ AFLSNQAFYDLLPLLIINDTMIGDLLGTGASLSQFFQSHGDVLEVAAGRKYLQMENYSND -1111--11111111------3333------3333---!!!!-----------3333--1 DDDPPLFAKDLSDYAKAFYSDTYEVLDRFFWTHDSSAGVLVHYDKPTNGHHYLLGTLTQM 111--iiii------------3333-3333----3333---------------------- VSAPPYIINATDAMLLESCLEQFSANVRARPAQPVTRLDQCYHLRWGAQYVGEDSLTYRL -----------------------3333--1111-------------1111----3333-- GVLSLLATNGYQLARPIPRQLTNRWLSSFVSQIMSDGVNETPLWPQERYVQIAYDSPSVV -------------------------------1111-----------------------33 DGATQYGYVRKNQLRLGMRISALQSLSDTPSPVQWLPQYTIDQAAMDEGDLMVSRLTQLP 33---------------------------------------------------------- LRPDYGNIWVGDALSYYVDYNRSHRVVLSSELPQLPDTYFDGDEQYGRSLFSLARKIGDR --------------------1111---3333----1111--3333--------------- SLVKDTAVLKHAYQAIDPNTGKEYLRSRQSVAYFGASAGHSGADQPLVIEPWIQGKISGV ----------1111------------------------3333---3333-------2222 PPPSSVRQFGYDVARGAIVDLARPFPSGDYQFVYSDVDQVVDGHDDLSISSGLVESLLSS -------------------1111-------------------1111-------------- CMHATAPGGSFVVKINFPTRPVWHYIEQKILPNITSYMLIKPFVTNNVELFFVAFGVHQH -----2222---------3333-------1111--------------------------- SSLTWTSGVYFFLVDHFYRYETLSTISRQLPSFGYVDDGSSVTGIETISIENPGFSNMTQ --------------------------1111-----------------------------3 AARIGISGLCANVGNARKSIAIYESHGARVLTITSRRSPASARRKSRLRYLPLIDPRSLE 333---------------------iiii---------3333---1111-------3333- VQARTILPADPVLFENVSGASPHVCLTMMYNFEVSSAVYDGDVVLDLGTGPEAKILELIP -------------------------------------------------33333333--- ATSPVTCVDIRPTAQPSGCWNVRTTFLELDYLSDGWITGVRGDIVTCMLSLGAAAAGKSM ----------------3333---------1111--3333----------------1111- TFDAAFQQLIKVLSKSTANVVLVQVNCPTDVVRSIKGYLEIDSTNKRYRFPKFGRDEPYS ------------1111---------------------------------1111------- DMDALEKICRTAWPNCSITWVPLSYDLRWTRLALLESTTLSSASIRIAELMYKYMPIMRI --------33331111----------3333---1111----------------------- DIHGLPMEKRGNFIVGQNCSLVIPGFNAQDVFNCYFNSALAFSTEDVNAAMIPQVSAQFD -------------2222---------1111------------3333-------------- ATKGEWTLDMVFSDAGIYTMQALVGSNANPVSLGSFVVDSPDVDITDAWPAQLDFTIAGT 1111--------------------1111---------------------------3333- DVDITVNPYYRLMTFVRIDGQWQIANPDKFQFFSTLVMNVKLDIADKYLLYYIRDVQSRD ------3333-------iiii----1111-------------3333-------------- VGFYIQHPLQLLNTITLPTNEDLFLSAPDMREWAVKESGNTICILNSQGFVLPQDWDVLT --------3333------------------------iiii---2222-----1111---- DTISWSPSIPTYIVPPGDYTLTPL -----3333--------------- >Major core protein lambda; SWP:P15024; PDB:1EJ6B; NKKTAQLLHADTPRLVTWDAGLCTSFKIVPIVPAQVPQDVLAYTFFTSSYAIQSPFPEAA --3333-----------------------------------11113333----------- VSRIVVHTRWASNVDFDRDSSVIMAPPTENNIHLFKQLLNTETLSVRGANPLMFRANVLH --------1111-------------1111-3333-----1111------3333------- MLLEFVLDNLYLNRHTGFSQDHTPFTEGANLRSLPGPDAEKWYSIMYPTRMGTPNVSKIC -----3333-----------------------------3333----3333---------- NFVASCVRNRVGRFDRAQMMNGAMSEWVDVFETSDALTVSIRGRWMARLARMNINPTEIE --1111-------------2222--------------------------1111-3333-- WALTECAQGYVTVTSPYAPSVNRLMPYRISNAERQISQIIRIMNIGNNATVIQPVLQDIS ------iiii----------------------------------2222---3333----- VLLQRISPLQIDPTIISNTMSTVSESTTQTLSPASSILGKLRPFSSFRVALAGWLYNGVV -------------------------1111--3333--------------------3333- TTVIDDSSYPKDGGSVTSLENLWDFFILALALPLTTDPCAPVKAFMTLANMMVGFETIPM ----3333-1111------------------1111-1111-------33332222----- DNQIYTQSRRASAFSTPHTWPRCFMNIQLISPIDAPILRQWAEIIHRYWPNPSQIRYGAP -----11113333--3333-3333----------------------------------33 NVFGSANLFTPPEVLLLPIDHQPANVTTPTLDFTNELTNWRARVCELMKNLVDNQRYQPG 33--------2222---------------------------------------1111111 WTQSLVSSMRGTLDKLKLIKSMTPMYLQQLAPVELAVIAPMLPFPPFQVPYVRLDRDRVP 13333----------1111-------------------1111------------3333-- TMVGVTRQSRDTITQPALSLSTTNTTVGVPLALDARAITVALLSGKYPPDLVTNVWYADA --------------33333333-----------------------------3333----- IYPMYADTEVFSNLQRDMITCEAVQTLVTLVAQISETQYPVDRYLDWIPSLRASAATAAT 3333-----3333-------------------------------3333------------ FAEWVNTSMKTAFDLSDMLLEPLLSGDPRMTQLAIQYQQYNGRTFNVIPEMPGSVIADCV ----------1111-----3333---------------1111------------------ QLTAEVFNHEYNLFGIARGDIIIGRVQSTHLWSPLAPPPDLVFDRDTPGVHIFGRDCRIS ---------3333-------------------1111--1111-1111------------- FGMNGAAPMIRDETGMMVPFEGNWIFPLALWQMNTRYFNQQFDAWIKTGELRIRIEMGAY --iiii--------------------3333-----------3333--------------- PYMLHYYDPRQYANAWNLTSAWLEEITPTSIPSVPFMVPISSDHDISSAPAVQYIISTEY -------1111-----------1111---------------------------------- NDRSLFCTNSSSPQTIAGPDKHIPVERYNILTNPDAPPTQIQLPEVVDLYNVVTRYAYET -3333---1111-----------3333-----11111111-------------------- PPITAVVMGVP -3333------ >Sigma-2 protein; SWP:P11314; PDB:1EJ6D; ARAAFLFKTVGFGGLQNVPINDELSSHLLRAGNSPWQLTQFLDWISLGRGLATSALVPTA --------------------3333----1111-------------%%%%----1111-33 GSRYYQMSCLLSGTLQIPFRPNHRWGDIRFLRLVWSAPTLDGLVVAPPQVLAQPALQAQA 33--------------1111---------iiii---1111------3333---------- DRVYDCDDYPFLARDPRFKHRVYQQLSAVTLLNLTGFGPISYVRVDEDMWSGDVNQLLMN ----3333-------------------------------------3333----------- YFGHTFAEIAYTLCQASANRPWEHDGTYARMTQIILSLFWLSYVGVIHQQNTYRTFYFQC 2222-------------------------------------1111--1111-iiii---- NRRGDAAEVWILSCSLNHSAQIRPGNRSLFVMPTSPDWNMDVNLILSSTLTGCLCSGSQL ----------------------------------1111---------------------- PLIDNNSVPAVSRNIHGWTGRAGNQLHGFQVRRMVTEFCDRLRRDGVMTQAQQNQIEALA ---3333----------------------3333--------3333--------------- DQTQQFKRDKLEAWAREDDQYNQANPNSTMFRTKPFTNAQWGRGNTGATSAAIAALI ------------------------1111--------3333----3333----1111- >LYS7; SWP:P40202; PDB:1EJ8A; SSAVAILETFQKYTIDQKKDTAVRGLARIVQVGENKTLFDITVNGVPEAGNYHASIHEKG ------------1111-------------------------------------------- DVSKGVESTGKVWHKFDEPIECFNESDLGKNLYSGKTFLSAPLPTWQLIGRSFVISKSLN -11111111------------------1111------------33332222--------- HPENEPSSVKDYSFLGVIAR 3333---------------- >Bdellastasin; SWP:P82107; PDB:1EJAB; TTPCGPVTCSGAQMCEVDKCVCSDLHCKVKCEHGFKKDDNGCEYACICADAPQ ---!!!!---------------------------------------------- >LUMAZINE SYNTHASE; SWP:P50861; PDB:1EJBA; AVKGLGKPDQVYDGSKIRVGIIHARWNRVIIDALVKGAIERMASLGVEENNIIIETVPGS ------1111---1111-------------------------1111-1111-------33 YELPWGTKRFVDRQAKLGKPLDVVIPIGVLIKGSTMHFEYISDSTTHALMNLQEKVDMPV 33------------1111---------------------------------3333----- IFGLLTCMTEEQALARAGIDEAHSMHNHGEDWGAAAVEMAVKFGKNAF --------------1111-3333--------------------1111- >UDP-N-ACETYLGLUCOSAMINE E; SWP:P33038; PDB:1EJDA; MDKFRVQGPTRLQGEVTISGAKNAALPILFAALLAEEPVEIQNVPKLKDIDTTMKLLTQL -------------------------------1111-----------3333-------111 GTKVERGSVWIDASNVNNFSAPYDLVKTMRASIWALGPLVARFGQGQVSLPGGCAIGARP 1--------------------33331111-----------------------------33 VDLHIFGLEKLGAEIKLEEGYVKASVNGRLKGAHIVMDKVSVGATVTIMSAATLAEGTTI 33------1111-----2222------------------------------1111----- IENAAREPEIVDTANFLVALGAKISGQGTDRITIEGVERLGGGVYRVLPDRIETGTFLVA -----------------1111----2222------------------------------- AAISGGKIVCRNAQPDTLDAVLAKLREAGADIETGEDWISLDMHGKRPKAVTVRTAPHPA -------------1111-------------------------iiii-------------- FPTDMQAQFTLLNLVAEGTGVITETIFENRFMHVPELIRMGAHAEIESNTVICHGVEKLS -1111-------------------------3333---1111-----!!!!---------- GAQVMATDLRASASLVLAGCIAEGTTVVDRIYHIDRGYERIEDKLRALGANIERVKGE ---------------------------------1111--------1111--------- >FMN-BINDING PROTEIN; SWP:NA; PDB:1EJEA; GSQAAHMMSMDFEDFPVESAHRILTPRPTVMVTTVDEEGNINAAPFSFTMPVSIDPPVVA -----1111------11111111------------1111--------------------- FASAPDHHTARNIESTHEFVINITPADIIERMWVTARDIPAGENELEAAGLAWTSSRRVK ---1111-----------------3333---3333----2222-3333------------ PPRIVEAPGHLECELLRMFEVGDHNLITGSVVSASVRSGAVKEGLLDVESVKPVLHVGGN ---3333-------------!!!!------------2222-iiii--3333------!!! KFVVGDHVRHVE !----------- >PROGESTERONE RECEPTOR P23; SWP:Q15185; PDB:1EJFA; MQPASAKWYDRRDYVFIEFCVEDSKDVNVNFEKSKLTFSCLGGSDNFKHLNEIDLFHCID ----------1111----------------------------1111-------------3 PNDSKHKRTDRSILCCLRKGESGQSWPRLTKERAKLNWLSVDFNNWKDWE 333-----------------2222-----------1111---1111---- >CRAMBIN (PRO22,SER22/LEU2; SWP:P01542; PDB:1EJGA; TTCCPSIVARSNFNVCRLPGTPEALCATYTGCIIIPGATCPGDYAN ----------------1111--------------------1111-- >SERINE HYDROXYMETHYLTRANS; SWP:P50431; PDB:1EJIA; MADRDATLWASHEKLSQPLKDSDAEVYSIIKKESNRQRVGLELIASENFASRAVLEALGS ------3333-------3333----------------------1111-------1111-- SLNNKYSEGYPGQRYYGGTEFIDELELCQKRALQAYHLDPQCWGVNVQPYSGSPANFAVY -1111----------------------------1111-3333------------------ TALVEPHGRIGLDLPDGGHLTHGFTDKKKISATSIFFESPYKVYPETGYINYDQLEENAS ----2222----1111--1111--------3333-------------------------- LFHPKLIIAGTSCYSRNLDYARLRKIADDNGAYLADAHISGLVAAGVVPSPFEHCHVVTT ---------------------------1111------------------3333------- TTHKTLRGCRAGIFYRKGVRSVDPKTGKETYYELESLINSAVFPGLQGGPHNHAIAGVAV --!!!!-----------------1111--------------------------------- ALKQATTEFKIYQLQVLANCRALSDALTELGYKIVTGGSDNHLILDLRSKGTDGGRAEKV ---------------------------------2222--------3333---3333---- LEACSIACNKNTCPGDKSALRPSGLRLGTPALTSRGLLEEDFQKVAHFIHRGIELTLQIQ -----------------2222---------3333-------------------------- SHATKATLKEFKEKLAGDEKIQSAVATLREEVENFASNFSLPGLPDF ------3333--------3333-------------1111-------- >Importin subunit alpha-2; SWP:P52293; PDB:1EJLI; GTVNWSVEDIVKGINSNNLESQLQATQAARKLLSREKQPPIDNIIRAGLIPKFVSFLGKT ------------1111--------------------------------3333--333311 DCSPIQFESAWALTNIASGTSEQTKAVVDGGAIPAFISLLASPHAHISEQAVWALGNIAG 11-------------1111-------------------1111-3333------------- DGSAFRDLVIKHGAIDPLLALLAVPDLSTLACGYLRNLTWTLSNLCRNKNPAPPLDAVEQ -------------------1111--3333------------3333--------------- ILPTLVRLLHHNDPEVLADSCWAISYLTDGPNERIEMVVKKGVVPQLVKLLGATELPIVT -------1111--------------1111--------1111--------1111-3333-- PALRAIGNIVTGTDEQTQKVIDAGALAVFPSLLTNPKTNIQKEATWTMSNITAGRQDQIQ -------1111-------------------------3333-------------------- QVVNHGLVPFLVGVLSKADFKTQKEAAWAITNYTSGGTVEQIVYLVHCGIIEPLMNLLSA -------------------------------1111----------1111-----1111-- KDTKIIQVILDAISNIFQAAEKLGETEKLSIMIEECGGLDKIEALQRHENESVYKASLNL -----------------------------------------1111--------------- IEKYFS ------ >Pancreatic trypsin inhibi; SWP:P00974; PDB:1EJMB; RPDFCLEPPYTGPCRLRIIRYFYNAKAGLCQTFVYGGCRAKRNNFKSAEDCLRTCGGA -3333-------------------1111------------------------------ >Igk-V21-4 protein; SWP:A0A5E6; PDB:1EJOL; DIVLTQSPASLAVSLGQRATISCRASESVDSYGNSFMHWYQQKPGQPPKLLIYRASNLES -------------2222-------------iiii--------2222------------22 GIPARFSGSGSRTDFTLTINPVEADDVATYYCQQSNEDPLTFGAGTKLELKRADAAPTVS 223333----------------1111---------------------------------- IFPPSSEQLTSGGASVVCFLNNFYPKDINVRWKIDGSERQNGVLNSWTDQDSKDSTYSMS ----33331111---------------------iiii----------------------- STLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRA ---------------------3333----------- >UREASE ALPHA SUBUNIT; SWP:P18316; PDB:1EJXA; MELTPREKDKLLLFTAALVAERRLARGLKLNYPESVALISAFIMEGARDGKSVASLMEEG -----------------------1111------------------------3333--333 RHVLTREQVMEGVPEMIPDIQVEATFPDGSKLVTVHNPII 3---3333-22221111----------------------- >Urease subunit beta; SWP:P18315; PDB:1EJXB; MIPGEYHVKPGQIALNTGRATCRVVVENHGDRPIQVGSHYHFAEVNPALKFDRQQAAGYR -2222----------2222-----------------11113333-3333--33332222- LNIPAGTAVRFEPGQKREVELVAFAGHRAVFGFRGEVMGPL ---2222----2222---------!!!!---!!!!------ >Urease subunit alpha; SWP:P18314; PDB:1EJXC; SNISRQAYADMFGPTVGDKVRLADTELWIEVEDDLTTYGEEVKFGGGKVIRDGMGQGQML --------------2222---!!!!-------------------2222--2222-----3 AADCVDLVLTNALIVDHWGIVKADIGVKDGRIFAIGKAGNPDIQPNVTIPIGAATEVIAA 333------------1111--------iiii--------3333--------1111----2 EGKIVTAGGIDTHIHWICPQQAEEALVSGVTTMVGGGTGPAAGTHATTCTPGPWYISRML 222--------------------------------------------------------- QAADSLPVNIGLLGKGNVSQPDALREQVAAGVIGLIHEDWGATPAAIDCALTVADEMDIQ 3333--------------------------------3333-------------------- VALHSDTLNESGFVEDTLAAIGGRTIHTFHTEGAGGGAPIITACAHPNILPSSTNPTLPY -----3333---3333----iiii-----3333------------1111-----1111-- TLNTIDEHLDMLMVCVAFAESRIRRETIAAEDVLHDLGAFSLTSSDSQAMGRVGEVILRT 1111-----------3333----3333------------------2222--1111----- WQVAHRMKVQRGALAEETGDNDNFRVKRYIAKYTINPALTHGIAHEVGSIEVGKLADLVV -------------1111-------------------------3333----2222------ WSPAFFGVKPATVIKGGMIAIAPMGDINASIPTPQPVHYRPMFGALGSARHHCRLTFLSQ -3333---------iiii-------1111------------1111--------------- AAAANGVAERLNLRSAIAVVKGCRTVQKADMVHNSLQPNITVDAQTYEVRVDGELITSEP --------------------------33332222----------------iiii------ ADVLPMAQRYFLF ------3333--- >GTP-BINDING PROTEIN YPT51; SWP:P36017; PDB:1EK0A; VTSIKLVLLGEAAVGKSSIVLRFVSNDFAENKEPTIGAAFLTQRVTINEHTVKFEIWDTA ----------2222--------------1111--------------!!!!---------- GQERFASLAPYYRNAQAALVVYDVTKPQSFIKARHWVKELHEQASKDIIIALVGNKIDLQ -33333333--1111-------11113333--------------1111------------ EGGERKVAREEGEKLAEEKGLLFFETSAKTGENVNDVFLGIGEKIPLK -------3333----------------1111---------1111---- >KAPPA-4 IMMUNOGLOBULIN LI; SWP:P01625; PDB:1EK3A; DIVMTQSPDSLAVSPGERATINCKSSQNLLDSS -------------2222---------------- >UDP-GALACTOSE 4-EPIMERASE; SWP:Q14376; PDB:1EK6A; MAEKVLVTGGAGYIGSHTVLELLEAGYLPVVIDNFHNAFRGGGSLPESLRRVQELTGRSV --------1111----------1111---------------------------------- EFEEMDILDQGALQRLFKKYSFMAVIHFAGLKAVGESVQKPLDYYRVNLTGTIQLLEIMK -----1111-----------------------3333-----------------------1 AHGVKNLVFSSSATVYGNPQYLPLDEAHPTGGCTNPYGKSKFFIEEMIRDLCQADKTWNA 111---------------------1111--------------------------1111-- VLLRYFNPTGAHASGCIGEDPQGIPNNLMPYVSQVAIGRREALNVFGNDYDTEDGTGVRD -----------3333------------------------------------1111----- YIHVVDLAKGHIAALRKLKEQCGCRIYNLGTGTGYSVLQMVQAMEKASGKKIPYKVVARR -----------------1111--------------------------------------2 EGDVAACYANPSLAQEELGWTAALGLDRMCEDLWRWQKQNPSGFGT 222------------------------------------1111--- >Enteropeptidase [Precurso; SWP:P98072; PDB:1EKBB; IVGGSDSREGAWPWVVALYFDDQQVCGASLVSRDWLVSAAHCVYRNME -------22221111-----------------------1111------ >RIBONUCLEASE HII; SWP:Q57599; PDB:1EKEA; IIIGIDEAGRGPVLGPVVCAFAIEKEREEELKKLGVKELTKNKRAYLKKLLENLGYVEKR -----------------------333333331111----3333----------------- ILEAEEINQLNSINLNDIEINAFSKVAKNLIEKLNIRDDEIEIYIDACSTNTKKFEDSFK -------------------------------1111------------------------- DKIEDIIKERNLNIKIIAEHKADAKYPVVSAASIIAKAERDEIIDYYKKIYGDIGSGYPS ---3333-------------3333----------------------1111-------333 DPKTIKFLEDYFKKHKKLPDIARTHWKTCKRILDKSKQT 3-----------------11111111-------1111-- >FRATAXIN; SWP:Q16595; PDB:1EKGA; LDETTYERLAEETLDSLAEFFEDLADKPYTFEDYDVSFGSGVLTVKLGGDLGTYVINKQT ---------------------3333-11111111----iiii-----%%%%--------1 PNKQIWLSSPSSGPKRYDWTGKNWVYSHDGVSLHELLAAELTKALKTKLDLSSLAYSGK 111---------------------------------------1111----1111----- >BETA-CARBONIC ANHYDRASE; SWP:P17067; PDB:1EKJA; EASERIKTGFLHFKKEKYDKNPALYGELAKGQSPPFMVFACSDSRVCPSHVLDFQPGEAF --------------------3333--3333------------11113333---------- VVRNVANLVPPYDQAKYAGTGAAIEYAVLHLKVSNIVVIGHSACGGIKGLLSFPFDGTYS ----%%%%----3333-------------------------------------------- TDFIEEWVKIGLPAKAKVKAQHGDAPFAELCTHCEKEAVNASLGNLLTYPFVREGLVNKT --3333-------------------3333--------------3333-3333---1111- LALKGGYYDFVKGSFELWGLEFGLSSTFSV ------------------------------ >HYDROXYETHYLTHIAZOLE KINA; SWP:P39593; PDB:1EKQA; MDAQSAAKCLTAVRRHSPLVHSITNNVVTNFTANGLLALGASPVMAYAKEEVADMAKIAG ------------------------3333-------------------1111---3333-- ALVLNIGTLSKESVEAMIIAGKSANEHGVPVILDPVGAGATPFRTESARDIIREVRLAAI ------------------------1111------2222---------------------- RGNAAEIAHTVGGDIIRLAQQAAQKLNTVIAITGEVDVIADTSHVYTLHNGHKLLTKVTG -----------------------1111-------------1111--------3333-222 AGLLTSVVGAFCAVEENPLFAAIAAISSYGVAAQLAAQQTADKGPGSFQIELLNKLSTVT 2--------------------------------------!!!!----------------- EQDVQEWATIERV ------------- >MOLYBDENUM COFACTOR BIOSY; SWP:P30747; PDB:1EKRA; GEAHMVDVSAKAETVREARAEAFVTMRSETLAMIIDGRHHKGDVFATARIAGIQAAKRTW --------------------------------------3333---------------333 DLIPLCHPLMLSKVEVNLQAEPEHNRVRIETLCRLTGKTGVEMEALTAASVAALTIYDMC 3-1111--------------3333------------------------------------ KAVQKDMVIGPVRLLAKSSGDFK 3333------------------- >INTERFERON GAMMA; SWP:P01579; PDB:1EKUA; MQDPYVKEAENLKKYFNAGHSDVADNGTLFLGILKNWKEESDRKIMQSQIVSFYFKLFKN ---3333------1111---1111-----3333-------------------------11 FKDDQSIQKSVETIKEDMNVKFFNSNKKKRDDFEKLTNYSVTDLNVQRKAIDELIQVMAE 11------------------1111---------------3333----------------- FSTEEQQE -------- >MATERNAL EFFECT PROTEIN (; SWP:P25159; PDB:1EKZA; MDEGDKKSPISQVHEIGIKRNMTVHFKVLREEGPAHMKNFITACIVGSIVTEGEGNGKKV --------------------------------------------------------1111 SKKRAAEKMLVELQKL -----------3333- >I-309; SWP:P22362; PDB:1EL0A; SKSMQVPFSRCCFSFAEQEIPLRAILCYRNTSSICSNEGLIFKLKRGKEACALDTVGWVQ --------------------3333--------3333-------3333---------3333 RHRKMLRHCPSKRK -------------- >BASEPLATE STRUCTURAL PROT; SWP:P10929; PDB:1EL6A; SRLADFLGFRPKTGDIDVMNRQSVGSVTISQLAKGFYEPNIESAINDVHNFSIKDVGTII --3333-----2222--%%%%-2222-33331111-------------1111--2222-- TNKTGVSPEGVSQTDYWAFSGTVTDDSLPPGSPITVLVFGLPVSATTGMTAIEFVAKVRV ------------------------33332222-----iiii------------------- ALQEAIASFTAINSYKDHPTDGSKLEVTYLDNQKHVLSTYSTYGITISQEIISESKPGYG -----1111-----------1111-----------------iiii--------------- TWNLLGAQTVTLDNQQTPTVFYHFERTA ---------------------------- >MALTODEXTRIN-BINDING PROT; SWP:P58300; PDB:1ELJA; MKIEEGKVVIWHAMQPNELEVFQSLAEEYMALPEVEIVFEQKPNLEDALKAAIPTGQGPD --------------3333----------------------------------1111---- LFIWAHDWIGKFAEAGLLEPIDEYVTEDLLNEFAPMAQDAMQYKGHYYALPFAAETVAII ----1111----1111----3333-----1111---------iiii-------------- YNKEMVSEPPKTFDEMKAIMEKYYDPANEKYGIAWPINAYFISAIAQAFGGYYFDDKTEQ -1111--------------------1111--------3333---1111------------ PGLDKPETIEGFKFFFTEIWPYMAPTGDYNTQQSIFLEGRAPMMVNGPWSINDVKKAGIN -1111-------------3333-------------1111-------3333----1111-- FGVVPLPPIIKDGKEYWPRPYGGVKLIYFAAGIKNKDAAWKFAKWLTTSEESIKTLALEL ----------iiii---------------2222--------------------------- GYIPVLTKVLDDPEIKNDPVIYGFGQAVQHAYLMPKSPKMSAVWGGVDGAINEILQDPQN -----3333--3333-----------3333------------------------------ ADIEGILKKYQQEILNNMQ ------------------- >TARGET OF MYB1; SWP:O60784; PDB:1ELKA; SDFLLGNPFSSPVGQRIEKATDGSLQSEDWALNMEICDIINETEEGPKDALRAVKKRIVG ------11113333-------3333-----------------1111-----------222 NKNFHEVMLALTVLETCVKNCGHRFHVLVASQDFVESVLVRTILPKNNPPTIVHDKVLNL 2--------------------3333-------------3333-3333------------- IQSWADAFRSSPDLTGVVTIYEDLRRKGLEFPM -------1111-------------1111----- >GAMMA-D CRYSTALLIN; SWP:P08209; PDB:1ELPA; GKITFYEDRGFQGRHYECSSDHSNLQPYLGRCNSVRVDSGCWMIYEQPNYLGPQYFLRRG --------%%%%------------3333-------------------%%%%--------- DYPDYQQWMGLNDSIRSCRLIPHAGSHRLRLYEREDYRGQMIEITEDCSSLQDRFHFNEI ---3333---------------------------%%%%---------------------- HSLNVLEGSWVLYELPNYRGRQYLLRPGEYRRYHDWGAMNAKVGSLRRVIDIY -------------------------------3333------------------ >TPR2A-DOMAIN OF HOP; SWP:P31948; PDB:1ELRA; GKQALKEKELGNDAYKKKDFDTALKHYDKAKELDPTNMTYITNQAAVYFEKGDYNKCREL --------------1111---------------1111-----------1111-------- CEKAIEVGRENREDYRQIAKAYARIGNSYFKEEKYKDAIHFYNKSLAEHRTPDVLKKCQQ ------3333-------------------1111--------------------------- AEKILKEQ -------- >ELASTASE; SWP:Q7SIG3; PDB:1ELT; VVGGRVAQPNSWPWQISLQYKSGSSYYHTCGGSLIRQGWVMTAAHCVDSARTWRVVLGEH -------22221111------!!!!----------1111---3333-------------- NLNTNEGKEQIMTVNSVFIHSGWNSDDVAGGYDIALLRLNTQASLNSAVQLAALPPSNQI 1111---------------111111111111--------------1111------2222- LPNNNPCYITGWGKTSTGGPLSDSLKQAWLPSVDHATCSSSGWWGSTVKTTMVCAGGGAN -2222----------------------------3333--3333!!!!-1111-------- SGCNGDSGGPLNCQVNGSYYVHGVTSFVSSSGCNASKKPTVFTRVSAYISWMNGIM --2222--------iiii----------3333--2222-----3333--------- >L-CYSTEINE/L-CYSTINE C-S ; SWP:Q9ZHG9; PDB:1ELUA; QFPGLANKTYFNFGGQGILPTVALEAITAMYGYLQENGPFSIAANQHIQQLIAQLRQALA -3333------3333----3333------------------------------------- ETFNVDPNTITITDNVTTGCDIVLWGLDWHQGDEILLTDCEHPGIIAIVQAIAARFGITY ------1111----3333----1111---2222--------------------------- RFFPVAATLNQGDAAAVLANHLGPKTRLVILSHLLWNTGQVLPLAEIMAVCRRHQGNYPV ----1111--------------1111---------------------------------- RVLVDGAQSAGSLPLDFSRLEVDYYAFTGHKWFAGPAGVGGLYIHGDCLGEINPTYVGWR --------2222------------------1111-2222-----11111111-----111 SITYGAKGEPTGWAEGGKRFEVATSAYPQYAGLLAALQLHQRQGTAEERYQAICQRSEFL 1---1111------!!!!-------3333----------3333----------------- WRGLNQLPHVHCLATSAPQAGLVSFTVDSPLGHRAIVQKLEEQRIYLRTIADPDCIRACC ------1111-------------------------------------------------- HYITDEEEINHLLARLADFGP 1111-----------1111-- >COMPLEMENT C1S COMPONENT; SWP:P09871; PDB:1ELVA; LDCGIPESIENGKVEDPESTLFGSVIRYTCEEPYYYMEGGGEYHCAGNGSWVNEVLGPEL --------------------2222---------------------1111------!!!!- PKCVPVCGVPREPFIIGGSDADIKNFPWQVFFDNPWAGGALINEYWVLTAAHVVEGNREP ---------------------33331111--------------------33331111--- TMYVGSTSVQKMLTPEHVFIHPGWKLLAVPEGRTNFDNDIALVRLKDPVKMGPTVSPICL --------------------1111-----2222------------------1111----- PGTSSDYNLMDGDLGLISGWGRTEKRDRAVRLKAARLPVAPLRKCKEVAYVFTPNMICAG ---3333--2222---------1111--------------33331111----1111---- GEKGMDSCKGDSGGAFAVQDPNDKTKFYAAGLVSWGPQCGTYGLYTRVKNYVDWIMKTMQ -%%%%--2222--------1111--------------2222-----3333---------- ENS --- >TPR1-DOMAIN OF HOP; SWP:P31948; PDB:1ELWA; EQVNELKEKGNKALSVGNIDDALQCYSEAIKLDPHNHVLYSNRSAAYAKKGDYQKAYEDG -------------1111---------------1111------------------------ CKTVDLKPDWGKGYSRKAAALEFLNRFEEAKRTYEEGLKHEANNPQLKEGLQNMEAR ------11113333----------------------11111111---------1111 >MLN64 PROTEIN; SWP:Q14849; PDB:1EM2A; SFSAQEREYIRQGKEATAVVDQILAQEENWKFEKNNEYGDTVYTIEVPFHGKTFILKTFL -------------------------3333------1111--------------------- PCPAELVYQEVILQPERVLWNKTVTACQILQRVEDNTLISYDVSAGAAGGVVSPRDFVNV --3333-------3333---1111-----------------------iiii--------- RRIERRRDRYLSSGIATSHSAKPPTHKYVRGENGPGGIVLKSASNPRVCTFVWILNTDLK -----1111---------1111--3333-------------------------------- GRLPRYLIHQSLAATFEFAFHLRQRISELGA ---3333------------------------ >PROTEIN G; SWP:P06654; PDB:1EM7A; TTYKLILNGKTLKGETTTEAVDAETAERVFKEYAKKNGVDGEWTYDDATKTFTVTE ----------------------------------1111--------1111------ >DNA POLYMERASE III CHI SU; SWP:P28905; PDB:1EM8A; MKNATFYLLDNDTTVDGLSAVEQLVCEIAAERWRSGKRVLIACEDEKQAYRLDEALWARP --------------iiii-----------------------------------3333--- AESFVPHNLAGEGPRGGAPVEIAWPQKRSSSRRDILISLRTSFADFATAFTEVVDFVPYE -------------2222--------------------------1111------------3 DSLKQLARERYKAYRVAGFNLNTATWK 333------------------------ >DNA polymerase III subuni; SWP:P28632; PDB:1EM8B; GEIAIAIPAHVRLVMVANDLPALTDPLVSDVLRALTVSPDQVLQLTPEKIAMLPQGSHCN -------1111----------1111-------1111-3333------3333--2222--- SWRLGTDEPLSLEGAQVASPALTDLRANPTARAALWQQICTYEHDFFPRN -----------------------------------------3333----- >GAG POLYPROTEIN CAPSID PR; SWP:P03322; PDB:1EM9A; PVVIKTEGPAWTPLEPKLITRLADTVRTKGLRSPITMAEVEALMSSPLLPHDVTNLMRVI ----3333---------------------1111--------1111---3333-------- LGPAPYALWMDAWGVQLQTVIAAATRDPRHPANGQGRGERTNLNRLKGLADGMVGNPQGQ --------------------------1111--------------1111-2222------- AALLRPGELVAITASALQAFREVARLA ----3333------------------- >FIBRILLIN; SWP:P35555; PDB:1EMN; SAVDMDECKEPDVCKHGQCINTDGSYRCECPFGYILAGNECVDTDECSVGNPCGNGTCKN --------------------------------------------3333--3333------ VIGGFECTCEEGFEPGPMMTCE ---------------------- >NIT-FRAGILE HISTIDINE TRI; SWP:O76463; PDB:1EMSA; MATGRHFIAVCQMTSDNDLEKNFQAAKNMIERAGEKKCEMVFLPECFDFIGLNKNEQIDL ---------------------------------1111--------3333----------- AMATDCEYMEKYRELARKHNIWLSLGGLHHKDPSDAAHPWNTHLIIDSDGVTRAEYNKLH ----------------1111-----------1111-----------1111---------- LFDLEIPGKVRLMESEFSKAGTEMIPPVDTPIGRLGLSICYDVRFPELSLWNRKRGAQLL ------------1111-------------1111------3333----------------- SFPSAFTLNTGLAHWETLLRARAIENQCYVVAAAQTGAHNPKRQSYGHSMVVDPWGAVVA ------3333-------------1111-------------------------1111---- QCSERVDMCFAEIDLSYVDTLREMQPVFSHRRSDLYTLHINEKSSETGGLKFARFNIPAD -------------------------1111--1111----------------!!!!--333 HIFYSTPHSFVFVNLKPVTDGHVLVSPKRVVPRLTDLTDAETADLFIVAKKVQAMLEKHH 3----1111---------2222----------1111---------------------111 NVTSTTICVQDGKDAGQTVPHVHIHILPRRAGDFPRSNEQMAEEAVVYRNLM 1----------1111--------------------------------1111- >IGG ANTIBODY (LIGHT CHAIN; SWP:NA; PDB:1EMTH; QVHLQESGPELVRPGASVKISCKTSGYVFSSSWMNWVKQRPGQGLKWIGRIYPGNGNTNY ------------2222------------1111---------------------------- NEKFKGKATLTADKSSNTAYMQLSSLTSVDSAVYFCATSSAYWGQGTLLTVSAAKTTPPS 3333---------1111---------3333------------------------------ VYPLAPGNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSSSVTVPS -----------------------------iiii------------%%%%---------33 SPRPSETVTCNVAHPASSTKVDKKIVPR 33-----------3333----------- >HETEROPODATOXIN 2; SWP:NA; PDB:1EMXA; DDCGKLFSGCDTNADCCEGYVCRLWCKLDW ----2222---1111--------------- >MYOGLOBIN; SWP:P02186; PDB:1EMY; GLSDGEWELVLKTWGKVEADIPGHGETVFVRLFTGHPETLEKFDKFKHLKTEGEMKASED -----------------------------------33331111--1111----------- LKKQGVTVLTALGGILKKKGHHEAEIQPLAQSHATKHKIPIKYLEFISDAIIHVLQSKHP ---------------1111--3333--------------3333----------------1 AEFGADAQGAMKKALELFRNDIAAKYKELGFQG 111-----------------------1111--- >AGGLUTININ ISOLECTIN I/AG; SWP:P11218; PDB:1EN2A; RCGSQGGGGTCPALWCCSIWGWCGDSEPYCGRTCENKCWSGERSDHRCGAAVGNPPCGQD ----------2222---1111----3333--------1111-1111--3333-----222 RCCSVHGWCGGGNDYCSGSKCQYRC 2--1111----3333--1111---- >ENTEROTOXIN H; SWP:P0A0M0; PDB:1ENFA; DLHDKSELTDLALANAYGQYNHPFIKENIKSDEISGEKDLIFRNQGDSGNDLRVKFATAD ---3333-----------1111------------------------iiii---------- LAQKFKNKNVDIYGASFYYKCEKISENISECLYGGTTLNSEKLAQERVIGANVWVDGIQK ----2222---------2222----------------3333-------------iiii-- ETELIRTNKKNVTLQELDIKIRKILSDKYKIYYKDSEISKGLIEFDMKTPRDYSFDIYDL -----------------------------1111----------------------1111- KGENDYEIDKIYEDNKTLKSDDISHIDVNLYT ---3333----1111---1111---------- >HIV-1 ENVELOPE PROTEIN CH; SWP:P03069; PDB:1ENVA; QIEDKIEEILSKIYHIENEIARIKKLIGEARQLLSGIVQQQNNLLRAIEAQQHLLQLTVW 3333-------------------------------------------------------- GIKQLQARILAVERYLKWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDK ------------------------------------------------------- >TRANSCRIPTION ELONGATION ; SWP:P07273; PDB:1ENWA; GSHMPRNSKNDGVDTAIYHHKLRDQVLKALYDVLAKESEHPPQSILHTAKAIESEMNKVN ---------------------------------------------------1111----- NCDTNEAAYKARYRIIYSNVISKNNPDLKHKIANGDITPEFLATCDAKDLAPAP ----3333---------------------3333-------------3333---- >TRANSCRIPTION ELONGATION ; SWP:P07273; PDB:1EO0A; MDSKEVLVHVKNLEKNKSNDAAVLEILHVLDKEFVPTEKLLRETKVGVEVNKFKKSTNVE -3333-----------------------3333-------------------------333 ISKLVKKMISSWKDAIN 3-----------3333- >HYPOTHETICAL PROTEIN MTH1; SWP:O27243; PDB:1EO1A; MKIAIASSGTDLGSEVSRFFGRAPYFMIVEMKKGNIESSEVIENPSASASGGAGIRTAQI ------------------------------------------------------------ IANNGVKAVIASSPGPNAFEVLNELGIKIYRATGTSVEENLKLFTEGNLEEIRSPGSGRG -1111-------------------------------3333-3333--------------- RRRR ---- >PROTOCATECHUATE 3,4-DIOXY; SWP:P20371; PDB:1EO2A; ELKETPSQTGGPYVHIGLLPKQANIEVFEHNLDNNLVQDNTQGQRIRLEGQVFDGLGLPL ----------1111-11111111--------------1111------------1111--- RDVLIEIWQADTNGVYPSQADTQGKQVDPN ----------1111---1111--------- >GOLGI-ASSOCIATED ATPASE E; SWP:P60520; PDB:1EO6A; MKWMFKEDHSLEHRCVESAKIRAKYPDRVPVIVEKVSGSQIVDIDKRKYLVPSDITVAQF ------------------------1111-------2222------------1111----- MWIIRKRIQLPSEKAIFLFVDKTVPQSSLTMGQLYEKEKDEDGFLYVAYSGENTFG ----------3333-----------1111----------1111------------- >Hemagglutinin [Precursor]; SWP:P03437; PDB:1EO8B; GLFGAIAGFIENGWEGMIDGWYGFRHQNSEGTGQAADLKSTQAAIDQINGKLNRVIEKTN 1111---------3333----------3333----------------------------- EKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWSYNAELLVALENQHTIDLTDSEMNKLFE ------------------------------------------------------------ KTRRQLRENAEEMGNGCFKIYHKCDNACIESIRNGTYDHDVYRDEALNNRFQIKG -----!!!!----------------------------3333-------------- >HEMAGGLUTININ (HA1 CHAIN); SWP:NA; PDB:1EO8H; QVQLQQSGAELMKPGPSVKISCKATGYSFSTYFIEWIRQRPGHGLEWIGEILPGSDNTNF ---------------------------1111----------------------------- NEKFKDRATFTADTPSNTAYMQLSSL 3333---------1111--------- >EG628498 protein; SWP:A0A5E0; PDB:1EO8L; QIILTQSPAIMSASPGEKVTMTCSASSDISYMHWYQQKSDTSPKIWIYDTSKLASGVPAR -------------2222------------------------------------2222333 FSGSGSGTSYSLTISTMEAEDAATYYCHQRSSYPTFGGGTKLEIKRADAAPTVSIFPPSK 3----------------3333--------------------------------------- IQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTLT 3333-------------------------------------------------------- KDEYERHNSYTCEATHKTSTSPIVKSFNRN ---1111----------------------- >ENDO-BETA-N-ACETYLGLUCOSA; SWP:P36913; PDB:1EOKA; NGVCIAYYITDGRNPTFKLKDIPDKVDMVILFGLKYWSLQDTTKLPGGTGMMGSFKSYKD -------------33333333-------------3333--1111----!!!!-------- LDTQIRSLQSRGIKVLQNIDDDVSWQSSKPGGFASAAAYGDAIKSIVIDKWKLDGISLDI --------1111---------3333---2222---------------------------- EHSGAKPNPIPTFPGYAATGYNGWYSGSMAATPAFLNVISELTKYFGTTAPNNKQLQIAS --------------3333------1111-------------3333-1111---------- GIDVYAWNKIMENFRNNFNYIQLQSYGANVSRTQLMMNYATGTNKIPASKMVFGAYAEGG 1111---------3333-------2222------------------3333-----3333- TNQANDVEVAKWTPTQGAKGGMMIYTYNSNVSYANAVRDAVK -------------1111--------1111------------- >GAG POLYPROTEIN CAPSID PR; SWP:P03322; PDB:1EOQA; MDIMQGPSESFVDFANRLIKAVEGSDLPPSARAPVIIDCFRQKSQPDIQQLIRTAPSTLT ---------3333--------------3333-------------3333------------ TPGEIIKYVLDRQKTAP 3333------------- >EOTAXIN; SWP:P51671; PDB:1EOT; GPASVPTTCCFNLANRKIPLQRLESYRRITSGKCPQKAVIFKTKLAKDICADPKKKWVQD ------------------3333---------------------------------3333- SMKYLDQKSPTPK ------------- >ASPARTYL-TRNA SYNTHETASE; SWP:P04802; PDB:1EOVA; AKDNYGKLPLIQSRDSDRTGQKRVKFVDLDEAKDSDKEVLFRARVHNTRQQGATLAFLTL -------------3333-------3333-----2222----------------------- RQQASLIQGLVKANKEGTISKNMVKWAGSLNLESIVLVRGIVKKVDEPIKSATVQNLEIH -!!!!---------------------11112222--------------1111-------- ITKIYTISETPEALPILLEDASRSEAEAEAAGLPVVNLDTRLDYRVIDLRTVTNQAIFRI ----------------------------1111-----------33331111--------- QAGVCELFREYLATKKFTEVHTPKLLGAPSEGGSSVFEVTYFKGKAYLAQSPQFNKQQLI ----------3333--------------------------!!!!---------------- VADFERVYEIGPVFRAENSNTHRHMTEFTGLDMEMAFEEHYHEVLDTLSELFVFIFSELP --------------------1111------------------------------------ KRFAHEIELVRKQYPVEEFKLPKDGKMVRLTYKEGIEMLRAAGKEIGDFEDLSTENEKFL ---------------------3333---------------------2222---------- GKLVRDKYDTDFYILDKFPLEIRPFYTMPDPANPKYSNSYDFFMRGEEILSGAQRIHDHA ------------------3333-1111--3333----------%%%%------------- LLQERMKAHGLSPEDPGLKDYCDGFSYGCPPHAGGGIGLERVVMFYLDLKNIRRASLFPR ------1111-1111---------1111------------------------1111---- DPKRLRP 1111--- >DTDP-6-DEOXY-D-XYLO-4-HEX; SWP:O27818; PDB:1EP0A; EFRFIKTSLDGAIIIEPEVYTDERGYFMETFNEAIFQENGLEVRFVQDNESMSVRGVLRG --------2222---------1111-----------1111-------------2222--- LHFQREKPQGKLVRVIRGEIFDVAVDLRKNSDTYGEWTGVRLSDENRREFFIPEGFAHGF ---------------------------2222-2222------3333------2222---- LALSDECIVNYKCTELYHPEYDSGIPWDDPDIGIDWPLEMVDDLIISEKDRNWKPLRENP -----------------3333----1111-------3333------3333----3333-- VYL --- >DIHYDROOROTATE DEHYDROGEN; SWP:P54322; PDB:1EP3A; MTENNRLSVKLPGLDLKNPIIPASGCFGFGEEYAKYYDLNKLGSIMVKATTLHPRFGNPT ----1111--2222--------2222------3333-3333------------------- PRVAETASGMLNAIGLQNPGLEVIMTEKLPWLNENFPELPIIANVAGSEEADYVAVCAKI -----------------------------------1111---------3333-------- GDAANVKAIELNISCPNVKHGGQAFGTDPEVAAALVKACKAVSKVPLYVKLSPNVTDIVP --1111----------1111---1111--------------------------------- IAKAVEAAGADGLTMINTLMGVRFDLKTRQPILANITGGLSGPAIKPVALKLIHQVAQDV --------------------------------1111-----3333----------1111- DIPIIGMGGVANAQDVLEMYMAGASAVAVGTANFADPFVCPKIIDKLPELMDQYRIESLE -------------------1111------3333--1111------------1111----- SLIQEVKEGKK ----------- >Dihydroorotate dehydrogen; SWP:P56968; PDB:1EP3B; SQLQEMMTVVSQREVAYNIFEMVLKGTLVDEMDLPGQFLHLAVPNGAMLLRRPISISSWD ---------------2222-------3333--------------1111------------ KRAKTCTILYRIGDETTGTYKLSKLESGAKVDVMGPLGNGFPVAEVTSTDKILIIGGGIG 3333---------1111----11112222---------------------------!!!! VPPLYELAKQLEKTGCQMTILLGFASENVKILENEFSNLKNVTLKIATDDGSYGTKGHVG 3333---------------------3333------------------1111------333 MLMNEIDFEVDALYTCGAPAMLKAVAKKYDQLERLYISMESRMACGIGACYACVEHDKED 3-3333-----------3333-------1111------------------------3333 ESHALKVCEDGPVFLGKQLSL -----3333------------ >Structural polyprotein; SWP:P05674; PDB:1EP5B; VMKLESDKTFPIMLEGKINGYACVVGGKLFRPMHVEGKIDNDVLAALKTKKASKYDLEYA -------------%%%%-------%%%%---1111-----3333-------3333----- DVPQNMRADTFKYTHEKPQGYYSWHHGAVQYENGRFTVPKGVGAKGDSGRPILDNQGRVV --3333-----------------1111----%%%%---2222----2222---1111--- AIVLGGVNEGSRTALSVVMWNEKGVTVKYTPENCEQW --------!!!!--------3333------2222--- >THIOREDOXIN CH1, H-TYPE; SWP:P80028; PDB:1EP7A; GGSVIVIDSKAAWDAQLAKGKEEHKPIVVDFTATWCGPCKMIAPLFETLSNDYAGKVIFL --------------------------------1111----------------2222---- KVDVDAVAAVAEAAGITAMPTFHVYKDGVKADDLVGASQDKLKALVAKHAAA --3333-------------------iiii------------------3333- >NEURAL CELL ADHESION MOLE; SWP:P13596; PDB:1EPFA; LQVDIVPSQGEISVGESKFFLCQVAGDAKDKDISWFSPNGEKLSPNQQRISVVWNDDDSS ------------2222--------------------1111---------------1111- TLTIYNANIDDAGIYKCVVTAEDGTQSEATVNVKIFQKLMFKNAPTPQEFKEGEDAVIVC -------1111---------------------------------------2222------ DVVSSLPPTIIWKHKGRDVILKKDVRFIVLSNNYLQIRGIKKTDEGTYRCEGRILARGEI -------------iiii--33333333--1111-------1111---------3333--- NFKDIQVIV --------- >PORCINE E-TRYPSIN; SWP:P00761; PDB:1EPTA; IVGGYTCAANSIPYQVSLNSGSHFCGGSLINSQWVVSAAHCYK -------22221111----------------------3333-- >S-SEC1; SWP:O62547; PDB:1EPUA; ALKTAVHEKINDVVLAVKKNAEWKVLIVDQLSRVSACCKHEISEGITLVEDINRRREPLP -------------3333-----------3333--1111--------------------11 LLEAVYLITPTEESVKCLADFQNPDNPQYRGAHIFFTEACPEELFKELCKSTTARFIKTL 11----------3333------------------------------------3333---- KEINIAFLPYESQIFSLDSPDTFQVYYNPSRAQGGIPNKERCAEQIATLCATLGEYPSVR ---------------------------3333----------------------------- YRSDFDENASFAQLVQQKLDAYRADDPTGEGPQKDRSQLLILDRGFDPISPLLHELTFQA -3333----------------3333-------1111------1111--1111-------- AYDLLPIENDVYKYEVLLDEKDDLWVERHQHIAVVSQNVTKKLKQFADEKRGIKDLSQLK -------%%%%-------1111-3333---3333-3333--------------------- KPQYQKELSKYSTHLHLAEDCKQYQQHVDKLCKVEQDLAGTDADGEKIRDHRNIVPILLD -----3333---3333-3333--------------------1111--------3333--1 QKISAYDKIRIILLYIIHKGGISEENLAKLVQHAHIPAEEKWIINDQNLGVPIIQDGGRR 111---------------------------------3333------1111---------- KIPQPYHTHNRKERQADHTYQSRWTPYKDIEAAVEDKLDTRHYPFLNGGGKSGPRLIIFV ---11111111---------------------1111--3333--1111------------ VGGISYSERSAYEVTQTAKNNWEVILGSTHILTPEGLLRDLRKISNP ----3333--------------------------------------- >BOTULINUM NEUROTOXIN TYPE; SWP:P10844; PDB:1EPWA; PVTINNFNYNDPIDNNNIIMMEPPFARGTGRYYKAFKITDRIWIIPERYTFGYKPEDFNK -------1111-----------1111------------2222-------22223333--- SSGIFNRDVCEYYDPDYLNTNDKKNIFLQTMIKLFNRIKSKPLGEKLLEMIINGIPYLGD -------------1111------------------------------------------1 RRVPLEEFNTNIASVTVNKLISNPGEVERKKGIFANLIIFGPGPVLNENETIDIGIQNHF 1111111----1111-------2222------------------1111-------%%%%1 ASREGFGGIMQMKFCPEYVSVFNNVQENKGASIFNRRGYFSDPALILMHELIHVLHGLYG 111--------------------3333-%%%%---------------------------- IKVDDLPIVPNEKKFFMQSTDAIQAEELYTFGGQDPSIITPSTDKSIYDKVLQNFRGIVD -------------1111------3333----!!!!1111--------------------- RLNKVLVCISDPNININIYKNKFKDKYKFVEDSEGKYSIDVESFDKLYKSLMFGFTETNI -------3333-------------1111---1111------------------------- AENYKIKTRASYFSDSLPPVKIKNLLDNEIYTIEEGFNISDKDMEKEYRGQNKAINKQAY -----------------------1111-------!!!!3333--2222-------3333- EEISKEHLAVYKIQMCKSVGICIDVDNEDLFFIADKNSFSDDLSKNERIEYNTQSNYIEN ---3333------------------3333-----3333---1111----3333------- DFPINELILDTDLISKIELPSENTESLTDFNVDVPVYEKQPAIKKIFTDENTIFQYLYSQ --3333---------------------------------------------------111 TFPLDIRDISLTSSFDDALLFSNKVYSFFSMDYIKTANKVVEAGLFAGWVKQIVNDFVIE 1-1111--------------1111-----------1111--3333--------------- ANKSNTMDKIADISLIVPYIGLALNVGNETAKGNFENAFEIAGASILLEFIPELLIPVVG -------3333-----11113333----1111------------1111------------ AFLLESYIDNKNKIIKTIDNALTKRNEKWSDMYGLIVAQWLSTVNTQFYTIKEGMYKALN ------2222-------------------------------------------------- YQAQALEEIIKYRYNIYSEKEKSNINIDFNDINSKLNEGINQAIDNINNFINGCSVSYLM -----------------33331111----------------------------------- KKMIPLAVEKLLDFDNTLKKNLLNYIDENKLYLIGSAEYEKSKVNKYLKTIMPFDLSIYT ------------------------------1111-----------1111-----3333-- NDTILIEMFNKYNSEILNNIILNLRYKDNNLIDLSGYGAKVEVYDGVELNDKNQFKLTSS ----------------1111-----------------------1111------------1 ANSKIRVTQNQNIIFNSVFLDFSVSFWIRIPKYKNDGIQNYIHNEYTIINCMKNNSGWKI 111------------------------------1111---------------%%%%---- SIRGNRIIWTLIDINGKTKSVFFEYNIREDISEYINRWFFVTITNNLNNAKIYINGKLES --!!!!------1111-----------------2222----------------iiii--- NTDIKDIREVIANGEIIFKLDGDIDRTQFIWMKYFSIFNTELSQSNIEERYKIQSYSEYL ---3333-----------------1111-------------------------------- KDFWGNPLMYNKEYYMFNAGNKNSYIKLKKDSPVGEILTRSKYNQNSKYINYRDLYIGEK -1111------------1111-------1111---------------------------- FIIRRKSNSQSINDDIVRKEDYIYLDFFNLNQEWRVYTYKYFKKEEEKLFLAPISDSDEF -----------------2222-------!!!!------------------------1111 YNTIQIKEYDEQPTYSCQLLFKKDEESTDEIGLIGIHRFYESGIVFEEYKDYFCISKWYL -----------------------3333--------------------------------- KEVKRKPYNLKLGCNWQFIPKDEGWTE 3333----1111---------1111-- >APOLIPOPHORIN-III; SWP:P13276; PDB:1EQ1A; DAPAGGNAFEEMEKHAKEFQKTFSEQFNSLVNSKNTQDFNKALKDGSDSVLQQLSAFSSS ----------------------3333-3333--------3333----------------- LQGAISDANGKAKEALEQARQNVEKTAEELRKAHPDVEKEANAFKDKLQAAVQTTVQESQ -----------3333--------------------3333---3333-------------- KLAKEVASNMEETNKKLAPKIKQAYDDFVKHAEEVQKKLHEAATKQ -----3333-------1111-------------------3333--- >ADP-L-GLYCERO-D-MANNOHEPT; SWP:P17963; PDB:1EQ2A; MIIVTGGAGFIGSNIVKALNDKGITDILVVDNLKDGTKFVNLVDLNIADYMDKEDFLIQI -----1111----------1111--------------33331111--------------1 MAGEEFGDVEAIFHEGASSTTEWDGKYMMDNNYQYSKELLHYCLEREIPFLYASSAATYG 111---------------1111----------------------------------1111 GRTSDFIESREYEKPLNVYGYSKFLFDEYVRQILPEANSQIVGFRYFNVYGPREGHKGSM --------3333--------------------3333-----------------3333111 ASVAFHLNTQLNNKRDFVYVGDVADVNLWFLENGVSGIFNLGTGRAESFQAVADATYQAF 1--------3333----------------------------------------------- TQADLTNLRAAGYDKPFKTVAEGVTEYMAWLN --------1111-------------------- >OUTER MEMBRANE LIPOPROTEI; SWP:P02937; PDB:1EQ7A; SSNAKIDQLSSDVQTLNAKVDQLSNDVNAMRSDVQAAKDDAARANQRLDNMATKYR --------------------------------------------------1111-- >CHYMOTRYPSIN; SWP:NA; PDB:1EQ9A; IVGGKDAPVGKYPYQVSLRLSGSHRCGASILDNNNVLTAAHCVDLSN -------22221111----------------1111---3333----- >EXO-(B)-(1,3)-GLUCANASE; SWP:P29717; PDB:1EQCA; AWDYDNNVIRGVNLGGWFVLEPYMTPSLFEPFQNGNDQSGVPVDEYHWTQTLGKEAALRI --3333--------------333333333333-!!!!1111------------------- LQKHWSTWITEQDFKQISNLGLNFVRIPIGYWAFQLLDNDPYVQGQVQYLEKALGWARKN ---------3333----1111--------3333---2222----3333---------111 NIRVWIDLHGAPGSQNGFDNSGLRDSYNFQNGDNTQVTLNVLNTIFKKYGGNEYSDVVIG 1---------2222---1111------1111-------------------3333------ IELLNEPLGPVLNMDKLKQFFLDGYNSLRQTGSVTPVIIHDAFQVFGYWNNFLTVAEGQW -------3333-----------------1111--------%%%%22221111-3333--- NVVVDHHHYQVFSGGELSRNINDHISVACNWGWDAKKESHWNVAGEWSAALTDCAKWLNG ------------3333-----------------3333-----------------2222-2 VNRGARYEGAYDNAPYIGSCQPLLDISQWSDEHKTDTRRYIEAQLDAFEYTGGWVFWSWK 222-3333--%%%%-----3333-3333-------------------------------- TENAPEWSFQTLTYNGLFPQPVTDRQFPNQCGFH ---3333-----1111----1111---------- >RNA POLYMERASE II TRANSCR; SWP:P21675; PDB:1EQFA; GTTVHCDYLNRPHKSIHRRRTDPMVTLSSILESIINDMRDLPNTYPFHTPVNAKVVKDYY ------------------1111------------------22221111---33331111- KIITRPMDLQTLRENVRKRLYPSREEFREHLELIVKNSATYNGPKHSLTQISQSMLDLCD ------------------------------------------1111-------------- EKLKEKEDKLARLEKAINPLLDDDDQVAFSFILDNIVTQKMMAVPDSWPFHHPVNKKFVP ------------------3333-----------------333322221111---333311 DYYKVIVNPMDLETIRKNISKHKYQSRESFLDDVNLILANSVKYNGPESQYTKTAQEIVN 11----------------1111-----------------------1111----------- VCYQTLTEYDEHLTQLEKDICTAKEAA --------------------------- >ORYZACYSTATIN-I; SWP:P09229; PDB:1EQKA; MSSDGGPVLGGVEPVGNENDLHLVDLARFAVTEHNKKANSLLEFEKLVSVKQQVVAGTLY ----------------3333---------------------------------------- YFTIEVKEGDAKKLYEAKVWEKPWMDFKELQEFKPVDASANA ------------------------------------------ ------------- >PHEROMONE ER-2; SWP:P26886; PDB:1ERD; DPMTCEQAMASCEHTMCGYCQGPLYMTCIGITTDPECGLP ------------33333333----------1111-2222- >TRANSCRIPTIONAL REPRESSOR; SWP:NA; PDB:1ERJA; HYLVPYNQRANHSKPIPPFLLDLDSQSVPDALKKQTNDYYILYNPALPREIDVELHKSLD ----1111--------1111--------1111---1111----1111------------- HTSVVCCVKFSNDGEYLATGCNKTTQVYRVSDGSLVARLSDSSDLYIRSVCFSPDGKFLA ----------1111--------------------------------------1111---- TGAEDRLIRIWDIENRKIVMILQGHEQDIYSLDYFPSGDKLVSGSGDRTVRIWDLRTGQC ------------1111------------------3333---------------------- SLTLSIEDGVTTVAVSPGDGKYIAAGSLDRAVRVWDSETGFLVERLDTGHKDSVYSVVFT -----------------------------------------------------------1 RDGQSVVSGSLDRSVKLWNLTCEVTYIGHKDFVLSVATTQNDEYILSGSKDRGVLFWDKK 111-----------------------------------2222------------------ SGNPLLMLQGHRNSVISVAVANGSSLGPEYNVFATGSGDCKARIWKYKKI -----------------------1111----------------------- >PHEROMONE ER-10; SWP:P12350; PDB:1ERP; DLCEQSALQCNEQGCHNFCSPEDKPGCLGMVWNPELCP --33331111---------3333------3333----- >PHEROMONE ER-11; SWP:P26887; PDB:1ERY; DECANAAAQCSITLCNLYCGPLIEICELTVMQNCEPPFS ------1111-3333------------------------ >DD-TRANSPEPTIDASE; SWP:P39042; PDB:1ES5A; KPTIAAVGGYAMNNGTGTTLYTKAADTRRSTGSTTKIMTAKVVLAQSNLNLDAKVTIQKA -----------------------1111----------------------1111----333 YSDYVVANNASQAHLIVGDKVTVRQLLYGLMLPSGCDAAYALADKYGSGSTRAARVKSFI 3--------------2222----------------------------------------- GKMNTAATNLGLHNTHFDSFDGIGNGANYSTPRDLTKIASSAMKNSTFRTVVKTKAYTAK -------1111-------------1111----------------------1111------ TVTKTGSIRTMDTWKNTNGLLSSYSGAIGVKTGAGPEAKYCLVFAATRGGKTVIGTVLAS --1111-----------3333--2222--------------------%%%%--------- TSIPARESDATKIMNYGFAL ----------------1111 >PLATELET-ACTIVATING FACTO; SWP:Q29460; PDB:1ES9A; ENPASKPTPVQDVQGDGKWMSLHHRFVADSKDKEPEVVFIGDSLVQLMHQCEIWRELFSP -3333------------------------------------33333333----3333333 LHALNFGIGGDSTQHVLWRLENGELEHIRPKIVVVWVGTNNHGHTAEQVTGGIKAIVQLV 3------22223333-------1111-----------1111------------------- NERQPQARVVVLGLLPRGQHPNPLREKNRRVNELVRAALAGHPRAHFLDADPGFVHSDGT ---3333------------------------------------------------1111- ISHHDMYDYLHLSRLGYTPVCRALHSLLLRLL -33331111----3333--------------- >ESTERASE; SWP:P22266; PDB:1ESC; DPVPTVFFGDSYTANFGIAPVTNQDSERGWCFQAKENYPAVATRSLADKGITLDVQADVS ---------1111-2222----1111-1111----------------------------- CGGALIHHFWEKQELPFGAGELPPQQDALKQDTQLTVGSLGGNTLGFNRILKQCSDELRK 22221111------2222-----3333--1111-------3333----------3333-- PSLLPGDPVDGDEPAAKCGEFFGTGDGKQWLDDQFERVGAELEELLDRIGYFAPDAKRVL ---------1111-1111----------------------------------1111---- VGYPRLVPEDTTKCLTAAPGQTQLPFADIPQDALPVLDQIQKRLNDAMKKAAADGGADFV ---------3333----2222----!!!!3333------------------3333----- DLYAGTGANTACDGADRGIGGLLEDSQLELLGTKIPWYAHPNDKGRDIQAKQVADKIEEI -3333----1111-------1111------------%%%%------------------11 LN 11 >CU, ZN SUPEROXIDE DISMUTA; SWP:P0AGD1; PDB:1ESO; ASEKVEMNLVTSQGVGQSIGSVTITETDKGLEFSPDLKALPPGEHGFHIHAKGSCQPATK ----------1111------------1111------------------------------ DGKASAAESAGGHLDPQNTGKHEGPEGAGHLGDLPALVVNNDGKATDAVIAPRLKSLDEI --------------1111-----1111--1111------1111-------1111------ KDKALMVHVGGDNMSDQPKPLGGGGERYACGVIK ------------------2222------------ >MONOCYTE CHEMOTACTIC PROT; SWP:P80075; PDB:1ESRA; PDSVSIPITCCFNVINRKIPIQRLESYTRITNIQCPKEAVIFKTQRGKEVCADPKERWVR 3333---------------3333--------3333--------3333-----3333---- DSMKHLDQIFQNLKP --------------- >AMYLOMALTASE; SWP:O87172; PDB:1ESWA; MELPRAFGLLLHPTSLPGPYGVGVLGREARDFLRFLKEAGGRYWQVLPLGPTGYGDSPYQ ------------1111--------------------1111------------22221111 SFSAFAGNPYLIDLRPLAERGYVRLEDPGFPQGRVDYGLLYAWKWPALKEAFRGFKEKAS -------3333------------------------------------------------- PEEREAFAAFREREAWWLEDYALFMALKGAHGGLPWNRWPLPLRKREEKALREAKSALAE -------------3333-------------iiii1111----1111-------------- EVAFHAFTQWLFFRQWGALKAEAEALGIRIIGDMPIFVAEDSAEVWAHPEWFHLDEEGRP -----------------------1111--------------------3333---1111-- TVVAGVPPDYFSETGQRWGNPLYRWDVLEREGFSFWIRRLEKALELFHLVRIDHFRGFEA --------3333------------------------------------------------ YWEIPASCPTAVEGRWVKAPGEKLFQKIQEVFGEVPVLAEDLGVITPEVEALRDRFGLPG ----3333--1111---------------------------------------1111--- MKVLQFAFDDGMENPFLPHNYPAHGRVVVYTGTHDNDTTLGWYRTATPHEKAFMARYLAD --3333----1111--3333-1111-------1111------1111------------11 WGITFREEEEVPWALMHLGMKSVARLAVYPVQDVLALGSEARMNYPGRPSGNWAWRLLPG 11----3333-------------------3333----3333---2222---------222 ELSPEHGARLRAMAEATERL 2-------------1111-- >VPR PROTEIN; SWP:Q73369; PDB:1ESXA; MEQAPEDQGPQREPYNDWTLELLEELKNEAVRHFPRIWLHSLGQHIYETYGDTWTGVEAL -------------------------------------3333------------------- IRILQQLLFIHFRIGCRHSRIGIIQQRRTRNGASKS ------------------------%%%%-------- >SUPERANTIGEN SPE-H; SWP:P0C0I6; PDB:1ET9A; NSYNTTNRHNLESLYKHDSNLIEADSIKNSPDIVTSHMLKYSVKNLSVFFEKDWISQEFK ----------------3333-------------------------------11113333- DKEVDIYALSAQERYEAFGGITLTNSEKKEIKVPVNVWDKSKQQPPMFITVNKPKVTAQE ------------------------------------------------------------ VDIKVRKLLIKKYDIYNNREQKYSKGTVTLDLNSGKDIVFDLYYFGNGDFNSMLKIYSNN -------------1111--------------1111-------------3333-3333--- ERIDSTQFHVDVSIS ---1111-------- >FLT3 LIGAND; SWP:P49771; PDB:1ETEA; TQDCSFQHSPISSDFAVKIRELSDYLLQDYPVTVASNLQDDELCGGLWRLVLAQRWMERL -------------3333---------1111----------------------------33 KTVAGSKMQGLLERVNTEIHFVTKCAFQPPPSCLRFVQTNISRLLQETSEQLVALKPWIT 33--3333------------1111------3333-----------------------111 RQNFSRCLELQCQP 1--1111------- >TRIACYLGLYCEROL ACYL-HYDR; SWP:P00591; PDB:1ETHA; SEVCFPRLGCFSDDAPWAGIVQRPLKILPWSPKDVDTRFLLYTNQNQNNYQELVADPSTI ------------------------------------------3333----------3333 TNSNFRMDRKTRFIIHGFIDKGEEDWLSNICKNLFKVESVNCICVDWKGGSRTGYTQASQ -----------------------3333-----3333----------3333---------- NIRIVGAEVAYFVEVLKSSLGYSPSNVHVIGHSLGSHAAGEAGRRTNGTIERITGLDPAE -------------------------------------------1111------------- PCFQGTPELVRLDPSDAKFVDVIHTDAAPIIPNLGFGMSQTVGHLDFFPNGGKQMPGCQK ------------3333-------------------------------2222---2222-- NILSQIVDIDGIWEGTRDFVACNHLRSYKYYADSILNPDGFAGFPCDSYNVFTANKCFPC 3333----------------------------3333------------3333-------- PSEGCPQMGHYADRFPGKTNGVSQVFYLNTGDASNFARWRYKVSVTLSGKKVTGHILVSL 3333----1111------------------------------------------------ FGNEGNSRQYEIYKGTLQPDNTHSDEFDSDVEVGDLQKVKFIWYNVINPTLPRVGASKIT -----------------2222--------------------------3333--------- VERNDGKVYDFCSQETVREEVLLTLNPC --3333---------------------- >FACTOR FOR INVERSION STIM; SWP:P11028; PDB:1ETKA; DVLTVKPLRDSVKQALKNYFAQLVNDLYELVLAEVEQPLLDMVMAYTRGNQTRAALMMGI ------3333---------1111-----------------------iiii---------- NRGTLRKKLKKYGMN ---------1111-- >FAB NC10.14 - LIGHT CHAIN; SWP:NA; PDB:1ETZH; QVTLKESGPGILQPSQTLSLTCSFSGFSLSTSGMGVGWIRQPSGEGLEWLADIWWNDKKY -----------------------------------------2222--------1111--- YNPSLKSRLTVSKDTSSNQVFLKITSVDTSDTATYHCARRTFSYYYGSSFYYFDNWGQGT -33331111-----1111---------3333-------------iiii------------ TLTVSSAKTTPPSVYPLAPGCGDTTGSSVTLGCLVKGYFPESVTVTWNSGSLSSSVHTFP ------------------------------------------------------------ ALLQSGLYTMSSSVTVPSSTWPSQTVTCSVAHPASSTTVDKKLEPSGP -------------------------------3333------------- >Putative uncharacterized ; SWP:Q0VDX6; PDB:1ETZL; FAVVTQESALTTSPGETVTLTCRSSTGAVTTSNYAIWVQEKPDHLFSGLIGGTNNRVPGV ------------2222-------1111--3333-----------------------2222 PARFSGSLIGDKAALTVTGAQTEDEAIYFCALWYSNHWVFGGGTKLTVLGQPKSSPSVTL 1111----!!!!--------1111------------------------------------ FTPSSEELETNKATLVCTITDFYPGVVTVDWKVDGTPVTQGMETTQPSKQSNNKYMASSY ---3333-------------------------iiii-------------1111------- LTLTARAWERHSSYSCQVTHEGHTVEKSLSRAECS ---3333---------------------------- >DIMETHYL SULFOXIDE REDUCT; SWP:Q57366; PDB:1EU1A; ANGEVMSGCHWGVFKARVENGRAVAFEPWDKDPAPSHQLPGVLDSIYSPTRIKYPMVRRE --------1111------iiii------1111---1111--------1111--------- FLEKGVNADRSTRGNGDFVRVTWDEALDLVARELKRVQESYGPTGTFGGSYGWKSPGRLH ---!!!!-3333-----------------------------3333--------------- NCQVLMRRALNLAGGFVNSSGDYSTAAAQIIMPHVMGTLEVYEQQTAWPVVVENTDLMVF ----------1111----------3333--3333-------------------------- WAADPMKTNEIGWVIPDHGAYAGMKALKEKGTRVIINPVRTETADYFGADVVSPRPQTDV ---3333-----------------------------------------------2222-- ALMLGMAHTLYSEDLHDKDFLENCTTGFDLFAAYLTGESDGTPKTAEWAAEICGLPAEQI ----------1111-------------------1111------------------3333- RELARSFVAGRTMLAAGWSIQRMHHGEQAHWMLVTLASMIGQIGLPGGGFGLSYHYSNGG ----3333---------3333--------------------2222-------1111-222 SPTSDGPALGGISDGGEGGATSIPCARVVDMLLNPGGEFQFNGATATYPDVKLAYWAGGN 2----------------------1111------2222---iiii---------------1 PFAHHQDRNRMLKAWEKLETFIVQDFQWTATARHADIVLPATTSYERNDIESVGDYSNRA 111-----------3333----------3333----------1111-------------- ILAMKKVVDPLYEARSDYDIFAALAERLGKGAEFTEGRDEMGWISSFYEAAVKQAEFKNV ----------!!!!-----------1111-3333iiii-----------------1111- AMPSFEDFWSEGIVEFPITEGANFVRYADFREDPLFNPLGTPSGLIEIYSKNIEKMGYDD ------------------------2222------------1111------3333---111 CPAHPTWMEPAERLGGAGAKYPLHVVASHPKSRLHSQLNGTSLRDLYAVAGHEPCLINPA 1-----------2222-----------------!!!!----3333---iiii-------- DAAARGIADGDVLRVFNDRGQILVGAKVSDAVMPGAIQIYEGGWYDPLDPSEEGTLDKYG --1111-2222---------------------2222------------1111-------- DVNVLSLDVGTSKLAQGNCGQTILADVEKYAGAPVTVTVFDTPKGA -3333-------------1111---------------1111-2222 >TREHALOSE/MALTOSE BINDING; SWP:O51923; PDB:1EU8A; IEEGKIVFAVGGAPNEIEYWKGVIAEFEKKYPGVTVELKRQATDTEQRRLDLVNALRGKS ------------3333--------------------------------------3333-- SDPDVFLMDVAWLGQFIASGWLEPLDDYVQKDNYDLSVFFQSVINLADKQGGKLYALPVY --------3333-----------------------1111----------iiii------- IDAGLLYYRKDLLEKYGYSKPPETWQELVEMAQKIQSGERETNPNFWGFVWQGKQYEGLV -------------1111-------------------------1111---------3333- CDFVEYVYSNGGSLGEFKDGKWVPTLNKPENVEALQFMVDLIHKYKISPPNTYTEMTEEP -----------------iiii---------------------------1111-------- VRLMFQQGNAAFERNWPYAWGLHNADDSPVKGKVGVAPLPHFPGHKSAATLGGWHIGISK ---------------3333-----1111-2222--------2222-------------11 YSDNKALAWEFVKFVESYSVQKGFAMNLGWNPGRVDVYDDPAVVSKSPHLKELRAVFENA 11---------------------------------3333-----------------1111 VPRPIVPYYPQLSEIIQKYVNSALAGKISPQEALDKAQKEAEELVKQ -----1111-------------------------------------- >CYTOCHROME B5; SWP:P04166; PDB:1EUEA; DPAVTYYRLEEVAKRNTAEETWMVIHGRVYDITRFLSEHPGGEEILLEQAGADATESFED -------3333-----3333----iiii---11111111---33331111---------- IGHSPDAREMLKQYYIGDVHPNDLKP ---------3333------3333--- >DUODENASE; SWP:P80219; PDB:1EUFA; IIGGHEAKPHSRPYMAFLLFKTS -------22221111-------- >NADP DEPENDENT NON PHOSPH; SWP:Q59931; PDB:1EUHA; TKQYKNYVNGEWKLSENEIKIYEPASGAELGSVPAMSTEEVDYVYASAKKAQPAWRALSY -------iiii------------------------------------------------- IERAAYLHKVADILMRDKEKIGAILSKEVAKGYKSAVSEVVRTAEIINYAAEEGLRMEGE ----------------------------------------------------1111---- VLEGGSFEAASKKKIAVVRREPVGLVLAISPFNYPVNLAGSKIAPALIAGNVIAFKPPTQ --3333-3333-------------------3333-1111-------1111---------- GSISGLLLAEAFAEAGLPAGVFNTITGRGSEIGDYIVEHQAVNFINFTGSTGIGERIGKM -----------------2222------3333--3333-3333----------------11 AGMRPIMLELGGKDSAIVLEDADLELTAKNIIAGAFGYSGQRCTAVKRVLVMESVADELV 11----------------1111--------------%%%%-----------3333----- EKIREKVLALTIGNPEDDADITPLIDTKSADYVEGLINDANDKGATALTEIKREGNLICP ------1111---3333-----------------------1111---------------- ILFDKVTTDMRLAWEEPFGPVLPIIRVTSVEEAIEISNKSEYGLQASIFTNDFPRAFGIA ------11113333---------------------------------------------1 EQLEVGTVHINNKTQRGTDNFPFLGAKKSGAGIQGVKYSIEAMTTVKSVVFDIK 111--------------1111-------------------1111---------- >FERRITIN 1; SWP:P23887; PDB:1EUMA; LKPEMIEKLNEQMNLELYSSLLYQQMSAWCSYHTFEGAAAFLRRHAQEEMTHMQRLFDYL ------------------------------1111-------------------------- TDTGNLPRINTVESPFAEYSSLDELFQETYKHEQLITQKINELAHAAMTNQDYPTFNFLQ 1111-------------------------------------------------------- WYVSEQHEEEKLFKSIIDKLSLAGKSGEGLYFIDKELSTLD ---------------------------3333-----1111- >ULP1 PROTEASE; SWP:Q02724; PDB:1EUVA; GSLVPELNEKDDDQVQKALASRENTQLMNRDNIEITVRDFKTLAPRRWLNDTIIEFFMKY -----------------1111--------%%%%--3333----2222------------- IEKSTPNTVAFNSFFYTNLSERGYQGVRRWMKRKKTQIDKLDKIFTPINLNQSHWALGII -1111-----------------333311111111--1111---------%%%%------- DLKKKTIGYVDSLSNGPNAMSFAILTDLQKYVMEESKHTIGEDFDLIHLDCPQQPNGYDC -1111----------------------------1111---1111---------------- GIYVCMNTLYGSADAPLDFDYKDAIRMRRFIAHLILTDALK ----------1111-----------------------1111 >Ubiquitin-like protein SM; SWP:Q12306; PDB:1EUVB; PETHINLKVSDGSSEIFFKIKKTTPLRRLMEAFAKRQGKEMDSLRFLYDGIRIQADQTPE --------------------1111----------1111-1111----iiii--2222333 DLDMEDNDIIEAHREQIGG 3---2222----------- >DEOXYURIDINE 5'-TRIPHOSPH; SWP:P06968; PDB:1EUWA; MMKKIDVKILDPRVGKEFPLPTYATSGSAGLDLRACLNDAVELAPGDTTLVPTGLAIHIA ----------3333----------1111---------------2222------------- DPSLAAMMLPRSGLGHKHGIVLGNLVGLIDSDYQGQLMISVWNRGQDSFTIQPGERIAQM 1111-----------------1111----1111------------------2222----- IFVPVVQAEFNLVEDF ---------------- >GLUTAMATE DEHYDROGENASE; SWP:O74024; PDB:1EUZA; IDPFEMAVKQLERAAQYMDISEEALEWLKKPMRIVEVSVPIEMDDGSVKVFTGFRVQHNW -------------1111---------3333------------1111-------------1 ARGPTKGGIRWHPAETLSTVKALATWMTWKVAVVDLPYGGGKGGIIVNPKELSEREQERL 111--------1111----------------1111-------------1111-------- ARAYIRAVYDVIGPWTDIPAPDVYTNPKIMGWMMDEYETIMRRKGPAFGVITGKPLSIGG -------3333----------2222---------------%%%%-3333-----3333-- SLGRGTATAQGAIFTIREAAKALGIDLKGKKIAVQGYGNAGYYTAKLAKEQLGMTVVAVS ---1111-------------------2222------------------------------ DSRGGIYNPDGLDPDEVLKWKREHGSVKDFPGATNITNEELLELEVDVLAPAAIEEVITE -------1111---------------2222------33331111--------------33 KNADNIKAKIVAEVANGPVTPEADDILREKGILQIPDFLCNAGGVTVSYFEWVQNINGYY 331111-----------------------------1111--------------------- WTEEEVREKLDKKMTKAFWEVYNTHKDKNIHMRDAAYVVAVSRVYQAMKDRGWVKK -------------------------1111-------------------1111---- >MINE; SWP:P18198; PDB:1EV0A; RSDAEPHYLPQLRKDILEVICKYVQIDPEMVTVQLEQKDGDISILELNVTLPEAEELK 1111---3333---------------3333---------------------------- >Fibroblast growth factor ; SWP:P21802; PDB:1EV2E; NKRAPYWTNTEKMEKRLHAVPAANTVKFRCPAGGNPMPTMRWLKNGKEFKQEHRIGGYKV --------3333--------2222-------------------%%%%--11112222--- RNQHWSLIMESVVPSDKGNYTCVVENEYGSINHTYHLDVVERSPHRPILQAGLPANASDV 3333--------3333---------1111--------------------2222------- EFVCKVYSDAQPHIQWIKHVPYLKVLKAAGVNTTDKEIEVLYIRNVTFEDAGEYTCLAGN -----------------------------11113333---------3333---------3 SIGISFHSAWLTVL 333----------- >GLUTATHIONE S-TRANSFERASE; SWP:P00502; PDB:1EV4A; SGKPVLHYFNARGRMECIRFLLAAAGVEFDEKFIQSPEDLEKLKKDGNLMFDQVPMVEID -----------!!!!---------------------------------1111------ii GMKLAQTRAILNYIATKYDLYGKDMKERALIDMYSEGILDLTEMIMQLVICPPDQKEAKT ii-------------------------------------------3333---1111---- ALAKDRTKNRYLPAFEKVLKSHGQDYLVGNKLTRVDIHLLELLLYVEEFDASLLTSFPLL ---------------------------%%%%-3333---------------1111----- KAFKSRISSLPNVKKFLQPGSQRKLPMDAKQIEEARKIYKF ---------------------------3333---------- >TYPE IIE RESTRICTION ENDO; SWP:P50187; PDB:1EV7A; EPDDDLERVRATLYSLDPDGDRTAGVLRDTLDQLYDGQRTGRWNFDQLHKTEKTHMGTLV ---3333----3333-1111----------------1111---3333-33331111---- EINLHREFQFGDGFETDYEIAGVQVDCKFSMSQGAWMLPPESIGHICLVIWASDQQCAWT ------------1111---%%%%--------2222---1111-----------1111--- AGLVKVIPQFLGTANRDLKRRLTPEGRAQVVKLWPDHGKLQENLLLHIPGDVRDQIFSAK ------1111----1111-----3333---------------3333--3333-------- SQHGQARVNELFRRVHGRLIGRAVIATVAQQDDFMKRVRGSGGARSILRPEGIIILGHQD ---------------------------------3333--------1111----------- KVANDLGLPVPRKGQVVAARVVPADEGDQRQTAEIQGRRWAVAVPGDPIVEAPVV 3333--------------------------------------------------- >MENA EVH1 DOMAIN; SWP:Q03173; PDB:1EVHA; SEQSICQARAAVMVYDDANKKWVPAGGSTGFSRVHIYHHTGNNTFRVVGRKIQDHQVVIN ----------------1111---2222------------1111----------------- CAIPKGLKYNQATQTFHQWRDARQVYGLNFGSKEDANVFASAMMHALEVLN ---2222-------------------------------------------- >THREONYL-TRNA SYNTHETASE; SWP:P00955; PDB:1EVLA; RDHRKIGKQLDLYHMQEEAPGMVFWHNDGWTIFRELEVFVRSKLKEYQYQEVKGPFMMDR ---------------1111---------------------------------------33 VLWEKTGHWDNYKDAMFTTSSENREYCIKPMNCPGHVQIFNQGLKSYRDLPLRMAEFGSC 33-3333----3333-----iiii---------------------3333----------- HRNEPSGSLHGLMRVRGFTQDDAHIFCTEEQIRDEVNGCIRLVYDMYSTFGFEKIVVKLS ----3333-------------------3333---------------3333---------- TRPEKRIGSDEMWDRAEADLAVALEENNIPFEYQLGEGAFYGPKIEFTLYDCLDRAWQCG ------------------------1111----------1111--------1111------ TVQLDFSLPSRLSASYVGEDNERKVPVMIHRAILGSMERFIGILTEEFAGFFPTWLAPVQ ---------1111----1111--------------------------iiii-3333---- VVIMNITDSQSEYVNELTQKLSNAGIRVKADLRNEKIGFKIREHTLRRVPYMLVCGDKEV ------3333----------------------------------1111------------ ESGKVAVRTRRGKDLGSMDVNEVIEKLQQEIRSRSLKQLEE --------1111-------------------------2222 >ONCOSTATIN M; SWP:P13725; PDB:1EVSA; GSCSKEYRVLLGQLQKQTDLMQDTSRLLDPYIRIQGLDVPKLREHCRERPGAFPSEETLR -------------------33331111-----1111--3333------2222-------- GLGRRGFLQTLNATLGCVLHRLADLEQRLPKAQDLERSGLNIEDLEKLQMARPNILGLRN ------------------------3333--3333-1111-3333---------------- NIYCMAQLLDNASDAFQRKLEGCRFLHGYHRFMHSVGRVFSKW ----3333-------------------------------1111 >GLYCEROL-3-PHOSPHATE DEHY; SWP:P90551; PDB:1EVYA; KDELLYLNKAVVFGSGAFGTALAMVLSKKCREVCVWHMNEEEVRLVNEKRENVLFLKGVQ -------------------------------------------------------2222- LASNITFTSDVEKAYNGAEIILFVIPTQFLRGFFEKSGGNLIAYAKEKQVPVLVCTKGIE -1111---------2222-------3333----------------1111----------- RSTLKFPAEIIGEFLPSPLLSVLAGPSFAIEVATGVFTCVSIASADINVARRLQRIMSTG ---------------3333--------------------------3333---------11 DRSFVCWATTDTVGCEVASAVKNVLAIGSGVANGLGMGLNARAALIMRGLLEIRDLTAAL 11------------------------------1111---------------------111 GGDGSAVFGLAGLGDLQLTCSSELSRNFTVGKKLGKGLPIEEIQRAVAEGVATADPLMRL 1--1111--------------1111----------------------------------- AKQLKVKMPLCHQIYEIVYKKKNPRDALADLLSCGLQDEGLPPLFK ---------------------------------------------- >FIXL; SWP:P10955; PDB:1EW0A; GSHMLETEDVVRARDAHLRSILDTVPDATVVSATDGTIVSFNAAAVRQFGYAEEEVIGQN ------------1111-----1111-------1111---------------333322223 LRILMPEPYRHEHDGYLQRYMATGEKRIIGIDRVVSGQRKDGSTFPMKLAVGEMRSGGER 333-----3333---------------2222-------1111-------------iiii- FFTGFIRDLT ---------- >ALLERGEN EQU C 1; SWP:Q95182; PDB:1EW3A; VAIRNFDISKISGEWYSIFLASDVKEKIEENGSMRVFVDVIRALDNSSLYAEYQTKVNGE ------1111-------------3333-2222------------------------iiii CTEFPMVFDKTEEDGVYSLNYDGYNVFRISEFENDEHIILYLVNFDKDRPFQLFEFYARE ------------2222-------------------------------------------- PDVSPEIKEEFVKIVQKRGIVKENIIDLTKIDRCFQLRG --------------------3333--1111---3333-- >CYAY PROTEIN; SWP:P27838; PDB:1EW4A; MNDSEFHRLADQLWLTIEERLDDWDGDSDIDCEINGGVLTITFENGSKIIINRQEPLHQV --------------------1111----------iiii----1111--------1111-- WLATKQGGYHFDLKGDEWICDRSGETFWDLLEQAATQQAGETVSFR ---3333------%%%%----------------------------- >DEHALOPEROXIDASE; SWP:Q9NAV8; PDB:1EW6A; GFKQDIATIRGDLRTYAQDIFLAFLNKYPDERRYFKNYVGKSDQELKSMAKFGDHTEKVF ----------------------------------1111---------------------- NLMMEVADRATDCVPLASDANTLVQMKQHSSLTTGNFEKLFVALVEYMRASGQSFDSQSW ----------%%%%-----------3333---3333-------------------3333- DRFGKNLVSALSSAGMK -----------1111-- >BACTERICIDAL/PERMEABILITY; SWP:P17213; PDB:1EWFA; VNPGVVVRISQKGLDYASQQGTAALQKELKRIKIPDYSDSFKIKHLGKGHYSFYSMDIRE ----------------------------1111---------------------------- FQLPSSQISMVPNVGLKFSISNANIKISGKWKAQKRFLKMSGNFDLSIEGMSISADLKLG ----------2222--------------------!!!!---------------------- SNPTSGKPTITCSSCSSHINSVHVHISKSKVGWLIQLFHKKIESALRNKMNSQVCEKVTN ---------------------------1111----------------------------- SVSSELQPYFQTLPVMTKIDSVAGINYGLVAPPATTAETLDVQMKGEFYSENHHNPPPFA ------------------------------------------------------------ PPVMEFPAAHDRMVYLGLSDYFFNTAGLVYQEAGVLKMTLRDDMIPKESKFRLTTKFFGT ----------------------------------------3333-3333----3333--- FLPEVAKKFPNMKIQIHVSASTPPHLSVQPTGLTFYPAVDVQAFAVLPNSALASLFLIGM ---3333---------------------3333--------------1111---------- HTTGSMEVSAESNRLVGELKLDRLLLELKHSNIGPFPVELLQDIMNYIVPILVLPRVNEK ----------%%%%---------------------------------------------- LQKGFPLPTPARVQLYNVVLQPHQNFLLFGADVVYK ---------2222---------2222---------- >REPLICATION PROTEIN A; SWP:P27694; PDB:1EWIA; MVGQLSEGAIAAIMQKGDTNIKPILQVINIRPITTGNSPPRYRLLMSDGLNTLSSFMLAT --------3333---%%%%----------------------------------------3 QLNPLVEEEQLSSNCVCQIHRFIVNTLKDGRRVVILMELEVLKSAEAVGVKIGN 333-----------------------iiii------------------------ >METABOTROPIC GLUTAMATE RE; SWP:P23385; PDB:1EWKA; RSVARMDGDVIIGALFSVHHQPPAEKVPERKCGEIREQYGIQRVEAMFHTLDKINADPVL ----------------------33331111------1111-------------1111--- LPNITLGSEIRDSCWHSSVALEQSIEFIRKPIAGVIGPGSSSVAIQVQNLLQLFDIPQIA ------------%%%%-------3333------------------------1111----- YSATSIDLSDKTLYKYFLRVVPSDTLQARAMLDIVKRYNWTYVSAVHTEGNYGESGMDAF ----3333-----1111------------------------------------------- KELAAQEGLCIAHSDKIYSNAGEKSFDRLLRKLRERLPKARVVVCFCEGMTVRGLLSAMR -----------------1111------------1111----------3333--------- RLGVVGEFSLIGSDGWADRDEVIEGYEVEANGGITIKLQSPEVRSFDDYFLKLRLDTNTR ------------3333--333322223333------------3333--3333-1111--- NPWFPEFWQHRFQCRLPGHLLENPNFKKVCTGNESLEENYVQDSKMGFVINAIYAMAHGL 1111-----------2222---------------1111----1111-------------- QNMHHALCPGHVGLCDAMKPIDGRKLLDFLIKSSFVGVSGEEVWFDEKGDAPGRYDIMNL -------2222---3333-----------1111---1111-----1111----------- QYTEANRYDYVHVGTWHEGVLNIDDYKI ----------------iiii---3333- >3-METHYL-ADENINE DNA GLYC; SWP:P29372; PDB:1EWNA; HLTRLGLEFFDQPAVPLARAFLGQVLVRRLPNGTELRGRIVETQAYLGPEDEAAHSRGGR -----3333---------3333-------1111--------------1111--1111--- QTPRNRGMFMKPGTLYVYIIYGMYFCMNISSQGDGACVLLRALEPLEGLETMRQLRSTVL -3333-11112222------------------2222-----------------1111--- KDRELCSGPSKLCQALAINKSFDQRDLAQDEAVWLERGPAVVAAARVGVGHAGEWARKPL 3333---3333--1111-3333---1111-----------------------3333---- RFYVRGSPWVSVVDRVAEQD ---2222------3333--- >DNA MISMATCH REPAIR PROTE; SWP:Q56215; PDB:1EWQA; EGLKGEGPGPLPPLLQQYVELRDQYPDYLLLFQVGDFYECFGEDAERLARALGLVLTHKT --------------------33331111-----!!!!----------------------- SKDFTTPAGIPLRAFEAYAERLLKGFRLAVADQVEPAEEAEGLVRREVTQLLTPGTLLQE ----------3333---------------------3333-------------1111--11 SLLPREANYLAAIATGDGWGLAFLDVSTGEFKGTVLKSKSALYDELFRHRPAEVLLAPEL 11--------------------------------------------1111-------333 LENGAFLDEFRKRFPVLSEAPFEPEGEGPLALRRARGALLAYAQRTQGGALSLQPFRFYD 3-------------------------------------------3333-----------3 PGAFRLPEATLRALEVFEPLRGQDTLFSVLDETRTAPGRRLLQSWLRHPLLDRGPLEARL 333---------------------3333-------------------------------- DRVEGFVREGALREGVRRLLYRLADLERLATRLELGRASPKDLGALRRSLQILPELRALL ------------------3333----------1111------------------------ GEEVGLPDLSPLKEELEAALVEDPPLKVSEGGLIREGYDPDLDALRAAHREGVAYFLELE --------------------------3333----2222---------------------- ERERERTGIPTLKVGYNAVFGYYLEVTRPYYERVPKEYRPVQTLKDRQRYTLPEKEKERE --------1111--------------3333----3333---------------------- VYRLEALIRRREEEVFLEVRERAKRQAEALREAARILAELDVYAALAEVAVRYGYVRPRF ------------------------------------------------------------ GDRLQIRAGRHPVVERRTEFVPNDLEAHELVLITGPNAGKSTFLRQTALIALLAQVGSFV ------------3333-------------------------------------------- PAEEAHLPLFDGIYTRIGAGKSTFVEEEVALILKEATENSLVLLDEVGRGTSSLDGVAIA ------------------------------------1111-------------------- TAVAEALHERRAYTLFATHYFELTALGLPRLKNLHVAAREEAGGLVFYHQVLPGPASKSY -------------------3333----1111----------------------------- GVEVAAAGLPKEVVARARALLQAAAR -------------------------- >RK-1 DEFENSIN; SWP:P81655; PDB:1EWSA; MPCSCKKYCDPWEVIDGSCGLFNSKYICCREK ---------1111------------------- >COAGULATION FACTOR XIII A; SWP:P00488; PDB:1EX0A; AFGGRRAVPPNNSNAAEDDLPTVEQEFLNVTSVHLFKERWDTNKVDHHTDKYENNKLIVR --3333------3333---------------------1111-------3333-------- RGQSFYVQIDFSRPYDPRRDLFRVEYVIGRYPQENKGTYIPVPIVSELQSGKWGAKIVMR ---------------1111-------------1111------------2222-------- EDRSVRLSIQSSPKCIVGKFRMYVAVWTPYGVLRTSRNPETDTYILFNPWCEDDAVYLDN !!!!-------1111------------1111------3333------1111--1111--- EKEREEYVLNDIGVIFYGEVNDIKTRSWSYGQFEDGILDTCLYVMDRAQMDLSGRGNPIK ------------------1111--------1111-----------1111-3333------ VSRVGSAMVNAKDDEGVLVGSFDNIYAYGVPPSAWTGSVDILLEYRSSENPVRYGQCWVF ------------------------------3333----3333-------------3333- AGVFNTFLRCLGIPARIVTNYFSAHDNDANLQMDIFLEEDGNVNSKLTKDSVWNYHCWNE --------------------------%%%%-------1111------------------- AWMTRPDLPVGFGGWQAVDSTPQENSDGMYRCGPASVQAIKHGHVCFQFDAPFVFAEVNS ----11112222------------1111---------------------3333------- DLIYITAKKDGTHVVENVDATHIGKLIVTKQIGGDGMMDITDTYKFQEGQEEERLALETA -------1111----------------------------1111---2222---------3 LMYGAKKPLNTSRSNVDMDFEVENAVLGKDFKLSITFRNNSHNRYTITAYLSANITFYTG 333----------------------2222---------------------------1111 VPKAEFKKETFDVTLEPLSFKKEAVLIQAGEYMGQLLEQASLHFFVTARINETRDVLAKQ ---------------------------33331111-2222-------------------- KSTVLTIPEIIIKVRGTQVVGSDMTVTVQFTNPLKETLRNVWVHLDGPGVTRPMKKMFRE ------------------2222------------------------2222---------- IRPNSTVQWEEVCRPWVSGHRKLIASMSSDSLRHVYGELDVQIQR -2222---------------------------------------- >PROTEIN MAF; SWP:Q02169; PDB:1EX2A; MTKPLILASQSPRRKELLDLLQLPYSIIVSEVEEKLNRNFSPEENVQWLAKQKAKAVADL ------------------------------------11113333------------3333 HPHAIVIGADTMVCLDGECLGKPQDQEEAASMLRRLSGRSHSVITAVSIQAENHSETFYD 1111----------iiii-----------------2222-----------1111------ KTEVAFWSLSEEEIWTYIETKEPMDKAGAYGIQGRGALFVKKIDGDYYSVMGLPISKTMR --------------------1111-2222-----3333---------------------- ALRHF 3333- >GUANYLATE KINASE; SWP:P15454; PDB:1EX7A; SRPIVISGPSGTGKSTLLKKLFAEYPDSFGFSVSSTTRTPRAGEVNGKDYNFVSVDEFKS --------2222------------1111------------22222222------------ MIKNNEFIEWAQFSGNYYGSTVASVKQVSKSGKTCILDIDMQGVKSVKAIPELNARFLFI -1111-------iiii---------------------------------3333------- APPSVEDLKKRLEGRGTETEESINKRLSAAQAELAYAETGAHDKVIVNDDLDKAYKELKD ------------3333-------------------------------------------- FIFAEK ------ >LACTONIZING LIPASE; SWP:P26876; PDB:1EX9A; STYTQTKYPIVLAHGMLGFDNILGVDYWFGIPSALRRDGAQVYVTEVSQLDTSEVRGEQL -1111----------------iiii--2222----------------------------- LQQVEEIVALSGQPKVNLIGHSHGGPTIRYVAAVRPDLIASATSVGAPHKGSDTADFLRQ ----------------------------------3333---------1111------333 IPPGSAGEAVLSGLVNSLGALISFLSSGSTGTQNSLGSLESLNSEGAARFNAKYPQGIPT 32222----------------------------3333-----------------2222-- SACGEGAYKVNGVSYYSWSGSSPLTNFLDPSDAFLGASSLTFKNGTANDGLVGTCSSHLG ---------iiii------------11113333---3333--%%%%------3333---- MVIRDNYRMNHLDEVNQVFGLTSLFETSPVSVYRQHANRLKNASL -----------------iiii------3333---------1111- >TRANSCRIPTION FACTOR 1; SWP:P04445; PDB:1EXEA; MNKTELIKAIAQDTGLTQVSVSKMLASFEKIITETVAKGDKVQLTGFLNIKPVARQARKG ------------------------------------------------------------ FNPQTQEALEIAPSVGVSVKPGESLKKAAEGLKYEDFAK ---------------------3333--3333-3333--- >EXO-1,4-BETA-D-GLYCANASE; SWP:P07986; PDB:1EXG; ASSGPAGCQVLWGVNQWNTGFTANVTVKNTSSAPVDGWTLTFSFPSGQQVTQAWSSTVTQ ------------------------------------------------------------ SGSAVTVRNAPWNGSIPAGGTAQFGFNGSHTGTNAAPTAFSLNGTPCTVG ----------1111------------------------------------ >DNAJ PROTEIN; SWP:P08622; PDB:1EXKA; GVTKEIRIPTLEECDVCHGSGAKPGTQPQTCPTCHGSGQVQMRQGFFAVQQTCPHCQGRG 3333----------1111-------------1111------------------1111--- TLIKDPCNKCHGHGRVERS -------1111-------- >5'-EXONUCLEASE; SWP:P06229; PDB:1EXNA; RNLIVDGTNLGFRFKHNNSKKPFASSYVSTIQSLAKSYSARTTIVLGDKGKSVFRLEHLP ----------------------3333--------------------------------11 EYKGNRDEKYAQRTEEEKALDEQFFEYLKDAFELCKTTFPTFTIRGVEADDAAYIVKLIG 11--------------------------------1111-----22223333-------33 HLYDHVWLISTDGDWDTLLTDKVSRFSFTTRREYHLRDYEHHNVDDVEQFISLKAIGDLG 33-------------1111---------------33331111----------------11 DNIRGVEGIGAKRGYNIIREFGNVLDIIDQLPLPGKQKYIQNLNASEELLFRNLILVDLP 11---2222---------------------------3333------------------11 TYCVDAIAAVGQDVLDKFTKDILEIAE 11-----1111---------------- >POL POLYPROTEIN; SWP:P04585; PDB:1EXQA; SSPGIWQLDCTHLEGKVILVAVHVASGYIEAEVIPAETGQETAYFLLKLAGRWPVKTIHT 3333--------iiii--------------------------------1111-------- DNGSNFTGATVRAACDWAGIKQEDGIPYVESMNKELKKIIGQVRDQAEHLKTAVQMAVFI --1111------------------------------------1111--3333-------- HNKKRKGGIGGYSAGERIVDIIATDIQ --------------------------- >CALMODULIN; SWP:P07463; PDB:1EXRA; EQLTEEQIAEFKEAFALFDKDGDGTITTKELGTVMRSLGQNPTEAELQDMINEVDADGNG ------------------1111-------------1111-----------33331111-- TIDFPEFLSLMARKMKEQDSEEELIEAFKVFDRDGNGLISAAELRHVMTNLGEKLTDDEV -------------------------------1111------------------------- DEMIREADIDGDGHINYEEFVRMMVS -------------------------- >BETA-LACTOGLOBULIN; SWP:P04119; PDB:1EXSA; VEVTPIMTELDTQKVAGTWHTVAMAVSDVSLLDAKSSPLKAYVEGLKPTPEGDLEILLQK ----------1111----------------------1111--------1111-------- RENDKCAQEVLLAKKTDIPAVFKINALDENQLFLLDTDYDSHLLLCMENSASPEHSLVCQ -----------------2222----iiii-------------------33331111---- SLARTLEVDDQIREKFEDALKTLSVPMRILPAQLEEQCRV ------------------3333-------------2222- >TUMOR NECROSIS FACTOR REC; SWP:P19438; PDB:1EXTA; SVCPQGKYIHPQNNSICCTKCHKGTYLYNDCPGPGQDTDCRECESGSFTASENHLRHCLS ---2222--1111--------2222-------2222-------2222------------- CSKCRKEMGQVEISSCTVDRDTVCGCRKNQYRHYWSENLFQCFNCSLCLNGTVHLSCQEK ----3333------------------1111---------------------------111 QNTVCTCHAGFFLRENECVSCSNCKKSLECTKLCLPQIEN 1------2222--%%%%--1111-2222-3333------- >IGG RECEPTOR FCRN LARGE S; SWP:P55899; PDB:1EXUA; HLSLLYHLTAVSSPAPGTPAFWVSGWLGPQQYLSYNSLRGEAEPCGAWYWEKETTDLRIK --------------2222--------!!!!-------------------3333------- EKLFLEAFKALGGKGPYTLQGLLGCELGPDNTSVPTAKFALNGEEFMNFDLKQGTWGGDW -----3333-------------------------------iiii---------------- PEALAISQRWQQQDKAANKELTFLLFSCPHRLREHLERGRGNLEWKEPPSMRLKARPSSP ------------2222------------------------3333--------------22 GFSVLTCSAFSFYPPELQLRFLRNGLAAGTGQGDFGPNSDGSFHASSSLTVKSGDEHHYC 22--------------------iiii-----------1111----------22221111- CIVQHAGLAQPLRVEL ----1111-------- >STEM CELL FACTOR; SWP:P21583; PDB:1EXZA; CRNRVTNNVKDVTKLVANLPKDYMITLKYVPGMDVLPSHCWISEMVVQLSDSLTDLLDKF ---------------11111111------2222---3333--------------3333-- SNISEGLSNYSIIDKLVNIVDDLVECVKENSSKDLKKSFKSPEPRLFTPEEFFRIFNRSI -----------------------------------------------------------3 DAFKDFVVASDCV 333---------- >ANTITERMINATION FACTOR NU; SWP:P04381; PDB:1EY1A; MKPAARRRARECAVQALYSWQLSQNDIADVEYQFLAEQDVKDVDVLYFRELLAGVATNTA -1111----3333-------------3333---3333-------3333----------33 YLDGLMKPYLSRLLEELGQVEKAVLRIALYELSKRSDVPYKVAINEAIELAKSFGAEDSH 33-3333-----3333-------------3333------3333---3333---------3 KFVNGVLDKAAPVIRPNKK 333---------------- >STAPHYLOCOCCAL NUCLEASE; SWP:P00644; PDB:1EY4A; KLHKEPATLIKAIDGDTVKLMYKGQPMTFRLLLVDTPETKHPKKGVEKYGPEAAAFTKKM -------------1111----%%%%-----2222------------2222---------- VENAKKIEVEFDKGQRTDKYGRGLAYIYADGKMVNEALVRQGLAKVAYVYKPNNTHEQHL 1111-------------1111-------iiii---------------------1111--- RKSEAQAKKEKLNIWS ------------1111 >STAPHYLOCOCCAL NUCLEASE; SWP:P00644; PDB:1EYAA; LHKEPATLIKAIDGDTVKLMYKGQPMVFRLLLVDIPETKHPKKGVEKYGPEASAFTKKMV ------------1111----iiii---------------------2222----------1 ENAKKIEVEFDKGQRTDKYGRGLAYIYADGKMVNEALVRQGLAKVAYVYKGNNTHEQLLR 111-------------1111-------iiii-----------------iiii1111---- KAEAQAKKEKLNIWS ----------!!!!- >HOMOGENTISATE 1,2-DIOXYGE; SWP:Q93099; PDB:1EYBA; AELKYISGFGNECSSEDPRCPGSLPEGQNNPQVCPYNLYAEQLSGSAFTCPRSTNKRSWL -------2222-----3333-------------2222---------11113333------ YRILPSVSHKPFESIDEGHVTHNWDEVDPDPNQLRWKPFEIPKASQKKVDFVSGLHTLCG ----3333--------!!!!--1111----------------3333---3333------- AGDIKSNNGLAIHIFLCNTSENRCFYNSDGDFLIVPQKGNLLIYTEFGKLVQPNEICVIQ --3333--------------------------------------1111---2222----- RGRFSIDVFEETRGYILEVYGVHFELPDLGPIGANGLANPRDFLIPIAWYEDRQVPGGYT ----------------------------!!!!------3333------------2222-- VINKYQGKLFAAKQDVSPFNVVAWHGNYTPYKYNLKNFVINSVAFDHADPSIFTVLTAKS ----iiii-------------------------3333-------------1111------ VRPGVAIADFVIFPPRWGVADKTFRPPYYHRNCSEFGLIRGFLPGGGSLHSTTPHGPDAD -2222-------------------------------------2222-------------- CFEKASKVKLAPERIADGTAFFESSLSLAVTKWGLKASRLKSHFTPNSRN ---------------2222-------------------------1111-- >DIHYDROPTEROATE SYNTHASE ; SWP:O06274; PDB:1EYEA; PVQVMGVLNVTDDSFSDGGCYLDLDDAVKHGLAMAAAGAGIVDVGGETSRVIPVVKELAA ------------1111------------------------------3333--------11 QGITVSIDTMRADVARAALQNGAQMVNDVSGGRADPAMGPLLAEADVPWVLMHWRAVSAD 11----------------1111-----11113333----------------------111 TPHVPVRYGNVVAEVRADLLASVADAVAAGVDPARLVLDPGLGFAKTAQHNWAILHALPE 1-------------------------1111-3333-----2222---------------- LVATGIPVLVGASRKRFLGALLAGPDGVMRPTDGRDTATAVISALAALHGAWGVRVHDVR -----------2222----11113333---3333------------1111---------- ASVDAIKVVEAWMGAE -----------1111- >EPSIN; SWP:O88339; PDB:1EYHA; HNYSEAEIKVREATSNDPWGPSSSLMSEIADLTYNVVAFSEIMSMIWKRLNDHGKNWRHV -----------1111--------------------------------3333-!!!!---- YKAMTLMEYLIKTGSERVSQQCKENMYAVQTLKDFQYVDRDGKDQGVNVREKAKQLVALL ------------------------3333---1111---1111------------------ RDEDRLREERAHALKTKEKLAQTA --3333-------------1111- >CHYMOTRYPSIN INHIBITOR; SWP:P10822; PDB:1EYLA; EFDDDLVDAEGNLVENGGTYYLLPHIWAHGGGIETAKTGNEPCPLTVVRSPNEVSKGEPI -------1111---2222-------1111--------!!!!----------1111----- RISSQFLSLFIPRGSLVALGFANPPSCAASPWWTVVDSPQGPAVKLSQQKLPEKDILVFK -----------1111---------3333---------1111----------3333----- FEKVSHSNIHVYKLLYCQHDEEDVKCDQYIGIHRDRNGNRRLVVTEENPLELVLLKAKS ----------------------------------1111--------------------- >CHALCONE-FLAVONONE ISOMER; SWP:P28012; PDB:1EYQA; SITAITVENLEYPAVVTSPVTGKSYFLGGAGERGLTIEGNFIKFTAIGVYLEDIAVASLA ------iiii--------------------------iiii-------------------- AKWKGKSSEELLETLDFYRDIISGPFEKLIRGSKIRELSGPEYSRKVMENCVAHLKSVGT --222233331111-----------------------------------------1111- YGDAEAEAMQKFAEAFKPVNFPPGASVFYRQSPDGILGLSFSPDTSIPEKEAALIENKAV ---------------1111--2222------1111----------------------333 SSAVLETMIGEHAVSPDLKRCLAARLPALLNE 3--3333--1111---------------3333 >Reaction center protein L; SWP:P51762; PDB:1EYSC; CEGPPPGTEQIGYRGVGMENYYVKRQRALSIQANQPVESLPAADSTGPKASEVYQSVQVL ------------2222--------------1111--------------3333-------- KDLSVGEFTRTMVAVTTWVSPKEGCNYCHVPGNWASDDIYTKVVSRRMFELVRAANSDWK -----------------------3333--2222-----3333----------------33 AHVAETGVTCYTCHRGNPVPKYAWVTDPGPKYPSGLKPTGQNYGSKTVAYASLPFDPLTP 33!!!!--3333-%%%%-----------------------------3333-------111 FLDQANEIRITGNAALAGSNPASLKQAEWTFGLMMNISDSLGVGCTSCHNTRAFNDWTQS 1------------------------------------------1111--3333--3333- TPKRTTAWYAIRHVRDINQNYIWPLNDVLPASRKGPYGDPLRVSCMTCHQAVNKPLYGAQ ---------------------3333----3333-1111-----3333-%%%%--%%%%-- MAKDYPGLYK 33331111-- >N-UTILIZING SUBSTANCE PRO; SWP:P95020; PDB:1EYVA; GRHQARKRAVALLFEAEVRGISAAEVVDTRAALAEAKPDIARLHPYTAAVARGVSEHAAH ------------------------------------3333---3333------------- IDDLITAHLRGWTLDRLPAVDRAILRVSVWELLHAADVPEPVVVDEAVQLAKELSTDDSP --------iiii3333------------------3333-----------------1111- GFVNGVLGQVM ----------- >R-PHYCOERYTHRIN; SWP:Q7SIG0; PDB:1EYXA; MKSVITTVISAADSAGRFPSSSDLESVQGNIQRASARLEAAEKLASNHEAVVKEAGDACF ------------1111---3333------------------------------------- GKYGYLKNPGEAGENQEKINKCYRDIDHYMRLVNYSLVIGGTGPLDEWGIAGAREVYRTL -------2222--------------------------------------2222------- NLPTSAYIAAFAFTRDRLCGPRDMSAQAGVEYSTALDYIINSLS --3333---------------------------------3333- >R-phycoerythrin beta chai; SWP:Q7SIF9; PDB:1EYXB; MLDAFSRVISNADAKAAYVGGSDLQALRTFISDGNKRLDAVNYIVSNSSCIVSDAISGMI --3333------1111----3333-----------------------------------1 CENPGLITPGGCYTNRRMAACLRDGEIILRYISYALLAGDSSVLEDRCLNGLKETYIALG 1113333------3333-----------------------3333----2222-------- VPTNSTVRAVSIMKAAVGAFISNTASQRKGEVIEGDCSALAAEIASYCDRISAAVS -3333--------------------------------3333---------3333-- >ALDEHYDE DEHYDROGENASE; SWP:Q56694; PDB:1EZ0A; TDNVFYATNAFTGEALPLAFPVHTEVEVNQAATAAAKVARDFRRLNNSKRASLLRTIASE -----------------------------------------1111--------------- LEARSDDIIARAHLETALPEVRLTGEIARTANQLRLFADVVNSGSYHQAILDTPNPTRAP -------------------------------------------3333------------- LPKPDIRRQQIALGPVAVFGASNFPLAFSAAGGDTASALAAGCPVIVKGHTAHPGTSQIV --------------------1111----1111-----------------3333------- AECIEQALKQEQLPQAIFTLLQGNQRALGQALVSHPEIKAVGFTGSVGGGRALFNLAHER -------------3333-----------------1111-----------------1111- PEPIPFYGELGAINPTFIFPSAMRAKADLADQFVASMTMGCGQFCTKPGVVFALNTPETQ -------------------------1111---------%%%%-1111------------- AFIETAQSLIRQQSPSTLLTPGIRDSYQSQVVSRGSDDGIDVTFSQAESPCVASALFVTS ---------1111--------------------1111----------------------- SENWRKHPAWEEEIFGPQSLIVVCENVADMLSLSEMLAGSLTATIHATEEDYPQVSQLIP -------1111----------------------1111----------3333-3333---- RLEEIAGRLVFNGWPTGVEVGYAMVHGGPYPASTHSASTSVGAEAIHRWLRPVAYQALPE -----------------------------------------1111-1111--------33 SLLPDSLKAENPLEIARAVDGKAA 33-33331111-------iiii-- >SYNTAXIN-1A; SWP:P32851; PDB:1EZ3A; RDRFMDEFFEQVEEIRGFIDKIAENVEEVKRKHSAILASPNPDEKTKEELEELMSDIKKT ------------------------------------------------------------ ANKVRSKLKSIEQSIEQEEGLNRSSADLRIRKTQHSTLSRKFVEVMSEYNATQSDYRERC ------------------1111-------------------------------------- KGRI ---- >LACTATE DEHYDROGENASE; SWP:P56511; PDB:1EZ4A; SMPNHQKVVLVGDGAVGSSYAFAMAQQGIAEEFVIVDVVKDRTKGDALDLEDAQAFTAPK -------------3333---------------------3333----------3333---- KIYSGEYSDCKDADLVVITAGALVNKNLNILSSIVKPVVDSGFDGIFLVAANPVDILTYA -----3333--------------------------------------------------- TWKFSGFPKERVIGSGTSLDSSRLRVALGKQFNVDPRSVDAYIMGEHGDSEFAAYSTATI -------3333---!!!!----------------3333---------------3333--i GTRPVRDVAKEQGVSDDDLAKLEDGVRNKAYDIINLKGATFYGIGTALMRISKAILRDEN iii------1111-----------------------------------------1111-- AVLPVGAYMDGQYGLNDIYIGTPAIIGGTGLKQIIESPLSADELKKMQDSAATLKKVLND ----------2222------------1111------------------------------ GLAELEN ------- >CHOLESTERYL ESTER TRANSFE; SWP:P34929; PDB:1EZEA; APDVSSALDKLKEFGNTLEDKAWEVINRIKQSEFPAKT ---------3333-----------1111---------- >FARNESYL-DIPHOSPHATE FARN; SWP:P37268; PDB:1EZFA; NSLKTCYKYLNQTSRSFAAVIQALDGEMRNAVCIFYLVLRALDTLEDDMTISVEKKVPLL --------------------1111!!!!-------------------11113333----- HNFHSFLYQPDWRFMESKEKDRQVLEDFPTISLEFRNLAEKYQTVIADICRRMGIGMAEF --3333--1111--------33331111------11113333-----------------1 LDKHVTSEQEWDKYCHYVAGLVGIGLSRLFSASEFEDPLVGEDTERANSMGLFLQKTNII 111---3333--------------------------3333-------------------- RDYLEDQQGGREFWPQEVWSRYVKKLGDFAKPENIDLAVQCLNELITNALHHIPDVITYL ------1111----3333------3333--3333-------------3333--------1 SRLRNQSVFNFCAIPQVMAIATLAACYNNQQVFKGAVKIDATNMPAVKAIIYQYMEEIYH 111--------------------------3333-------------------------11 RIPDSDPSSSKTRQIISTIRTQN 111111------------1111- >THERMAL HYSTERESIS PROTEI; SWP:O16119; PDB:1EZGA; QCTGGADCTSCTGACTGCGNCPNAVTCTNSQHCVKANTCTGSTDCNTAQTCTNSKDCFEA -------1111---------1111--------1111--------------------1111 NTCTDSTNCYKATACTNSSGCP --------1111---------- >CMP-N-ACETYLNEURAMINIC AC; SWP:Q57385; PDB:1EZIA; EKQNIAVILARQNSKGLPLKNLRKNGISLLGHTINAAISSKCFDRIIVSTDGGLIAEEAK ------------------3333---------------3333------------------1 NFGVEVVLRPAASSISGVIHALETIGSNSGTVTLLQPTSPLRTGAHIREAFSLFDEKIKG 111--------------------------------1111-----------1111------ SVVSACPEHHPLKTLLQINEYAPRHLSDLEQPRQQLPQAFRPNGAIYINDTASLIANNCF ---------3333-----------3333-------------------------------- FIAPTKLYISHQDSIDIDTELDLQQAENILN ---------3333------------------ >NUCLEOCAPSID PHOSPHOPROTE; SWP:P14252; PDB:1EZJA; ENTSSMKEMATLLTSLGVIQSAQEFESSRDASYVFARRALKSANYAEMTFNVCGLILSAE --------------------3333-------------------3333------------- KSSARKVDENKQLLKQIQESVESFRDIYKRFSEYQKEQNSLLMSNLSTLHIITD --------------------------------------------1111------ >NUCLEOSIDE HYDROLASE; SWP:P83851; PDB:1EZRA; PRKIILDCDPGIDDAVAIFLAHGNPEIELLAITTVVGNQSLEKVTQNARLVADVAGIVGV -----------------------1111--------------------------------- PVAAGCTKPLVRGVRNASHIHGETGMGNVSYPPEFKTKLDGRHAVQLIIDLIMSHEPKTI -------------------------!!!!------------------------------- TLVPTGGLTNIAMAVRLEPRIVDRVKEVVLMGGGYHTGNASPVAEFNVFIDPEAAHIVFN -------------------3333------------------------3333--------- ESWNVTMVGLDLTHLALATPAVQKRVREVGTKPAAFMLQILDFYTKVYEKEHDTYGKVHD --------33333333--3333---3333------------------------------3 PCAVAYVIDPTVMTTERVPVDIELNGALTTGMTVADFRYPRPKNCRTQVAVKLDFDKFWC 333-----1111-------------1111------------------------------- LVIDALERIGDP ------------ >ECOTIN; SWP:P23827; PDB:1EZSA; PYPQAEKGMKRQVIQLTPQEDESTLKVELLIGQTLEVDCNLHRLGGKLENKTAYYVFDKV -----2222-----------1111------------------------------------ SSPVSTRMACPDGKKEKKFVTAYLGDAGMLRYNSKLPIVVYTPDNVDVKYRVWKAEEKID ------------------------3333----3333------1111-------------- NAVVR ----- >Ig heavy chain V region M; SWP:P18531; PDB:1EZVX; EVKLQESGAGLVQPSQSLSLTCSVTGYSITSGYYWNWIRLFPGNKLEWVGYISNVGDNNY ------------2222-----------1111---------1111--------3333---- NPSLKDRLSITRDTSKNQFFLKLNSVTTEDTATYYCARSEYYSVTGYAMDYWGQGTTVTV 3333---------1111---------3333------------------------------ SSAWRHP ------- >Ig kappa chain V-V region; SWP:P01647; PDB:1EZVY; DIELTQTPVSLAASLGDRVTISCRASQDINNFLNWYQQKPDGTIKLLIYYTSRLHAGVPS --------------------------------------1111------------222211 RFSGSGSGTDYSLTISNLEPEDIATYFCQHHIKFPWTFGAGTKLEIK 11--------------------------------------------- >COENZYME F420-DEPENDENT N; SWP:Q8TXY4; PDB:1EZWA; AEVSFGIELLPDDKPTKIAHLIKVAEDNGFEYAWICDHYNNYSYMGVLTLAAVITSKIKL -------------3333--------1111--------1111------------------- GPGITNPYTRHPLITASNIATLDWISGGRAIIGMGPGDKATFDKMGLPFPCKIPIWNPEA ----------3333---------------------------------------1111--- EDEVGPATAIREVKEVIYQYLEGGPVEYEGKYVKTGTADVKARSIQGSDIPFYMGAQGPI -------------------1111------1111------------!!!!----------- MLKTAGEIANGVLVNASNPKDFEVAVPKIEEGAKEAGRSLDEIDVAAYTCFSIDKDEDKA -----------------3333------------1111-1111------------------ IEATKIVVAFIVMGSPDVVLERHGIDTEKAEQIAEAIGKGDFGTAIGLVDEDMIEAFSIA -----------1111-----1111-----------------33333333----------- GDPDTVVDKIEELLKAGVTQVVVGSPIGPDKEKAIELVGQEVIPHFK -------------1111------------------------3333-- >Intimin; SWP:P19809; PDB:1F00I; ASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGY ----------------------------!!!!---------------------------- AKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWL ---------------------------------------1111----------------2 QYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIA 222------------------3333-----------------------1111-------- TPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYYKSSQTII -----------------------1111-----3333---------11113333------- SWVQQTAQDAKSGVASTYDLVKQNPLNNIKASESNAYATCVK -----------------------------1111--------- >Tir; SWP:Q9KWH9; PDB:1F02T; MDQAANAAESATKDQLTQEAFKNPENQKVNIDANGNAIPSGELKDDIVEQIAQQAKEAGE --1111------------3333---------3333--------3333-------333311 VARQQA 111111 >TRANSALDOLASE; SWP:P37837; PDB:1F05A; MESALDQLKQFTTVVADTGDFHAIDEYKPQDATTNPSLILAAAQMPAYQELVEEAIAYGR -------------------33333333-----------------3333------------ KLGGSQEDQIKNAIDKLFVLFGAEILKKIPGRVSTEVDARLSFDKDAMVARARRLIELYK -------------------------1111--------1111------------------1 EAGISKDRILIKLSSTWEGIQAGKELEEQHGIHCNMTLLFSFAQAVACAEAGVTLISPFV 111-3333---------------------------------------------------- GRILDWHVANTDKKSYEPLEDPGVKSVTKIYNYYKKFSYKTIVMGASFRNTGEIKALAGC ----------------3333--------------1111---------------------- DFLTISPKLLGELLQDNAKLVPVLSAKAAQASDLEKIHLDEKSFRWLHNEDQMAVEKLSD ------------------------33331111---------------------------- GIRKFAADAVKLERMLTERMFN ---------------------- >MESO-DIAMINOPIMELATE D-DE; SWP:P04964; PDB:1F06A; MTNIRVAIVGYGNLGRSVEKLIAKQPDMDLVGIFSRRATLDTKTPVFDVADVDKHADDVD -------------------3333------------------------33331111----- VLFLCMGSATDIPEQAPKFAQFACTVDTYDNHRDIPRHRQVMNEAATAAGNVALVSTGWD ----------333333331111--------1111-------------------------- PGMFSINRVYAAAVLAEHQQHTFWGPGLSQGHSDALRRIPGVQKAVQYTLPSEDALEKAR --------------------------------------2222------------------ RGEAGDLTGKQTHKRQCFVVADAADHERIENDIRTMPDYFVGYEVEVNFIDEATFDSEHT -------1111----------3333--------------2222----------------- GMPHGGHVITTGDTGGFNHTVEYILKLDRNPDFTASSQIAFGRAAHRMKQQGQSGAFTVL ---------------------------------------------------------111 EVAPYLLSPENLDDLIARDV 1-3333-------------- >COENZYME F420-DEPENDENT N; SWP:Q50744; PDB:1F07A; MKFGIEFVPNEPIEKIVKLVKLAEDVGFEYAWITDHYNNKNVYETLALIAEGTETIKLGP -----------3333--------1111--------1111----------1111------- GVTNPYVRSPAITASAIATLDELSNGRATLGIGPGDKATFDALGIEWVKPVSTIRDAIAM ------------------------------------------------------------ MRTLLAGEKTESGAQLMGVKAVQEKIPIYMGAQGPMMLKTAGEISDGALINASNPKDFEA ---------1111----------------------------------------3333--- AVPLIKEGAEAAGKSIADIDVAAYTCCSIDEDAAAAANAAKIVVAFIAAGSPPPVFERHG ---------1111-3333-----------------------------11113333-1111 LPADTGKKFGELLGKGDFGGAIGAVDDALMEAFSVVGTPDEFIPKIEALGEMGVTQYVAG -1111----------------1111------------3333--------1111------- SPIGPDKEKSIKLLGEVIASF -----------------1111 >REPLICATION PROTEIN E1; SWP:P03116; PDB:1F08A; GSRATVFKLGLFKSLFLCSFHDITRLFKNDKTTNQQWVLAVFGLAEVFFEASFELLKKQC ------------------3333------1111----------------------3333-- SFLQMQKRSHEGGTCAVYLICFNTAKSRETVRNLMANMLNVREECLMLQPPKIRGLSAAL ---------3333----------------------------3333--------------- FWFKSSLSPATLKHGALPEWIRAQTTLN -------3333------33333333--- >PHOSPHODIESTERASE 4B; SWP:Q07343; PDB:1F0JA; SISRFGVNTENEDHLAKELEDLNKWGLNIFNVAGYSHNRPLTCIMYAIFQERDLLKTFRI 3333--------------1111-1111--------%%%%--------------------- SSDTFITYMMTLEDHYHSDVAYHNSLHAADVAQSTHVLLSTPALDAVFTDLEILAAIFAA ------------11111111--------------------3333---------------- AIHDVDHPGVSNQFLINTNSELALMYNDESVLENHHLAVGFKLLQEEHCDIFMNLTKKQR ---------------1111------%%%%--------------------1111------- QTLRKMVIDMVLATDMSKHMSLLADLKTMVETKKVTSSGVLLLDNYTDRIQVLRNMVHCA --------------1111-----------------1111-----3333------------ DLSNPTKSLELYRQWTDRIMEEFFQQGDKERERGMEISPMCDKHTASVEKSQVGFIDYIV --1111-3333-------------------1111---2222------------------- HPLWETWADLVQPDAQDILDTLEDNRNWYQSMIPQAPANRDCQGLMEKFQF -----------------------------1111-------------1111- >UDP-N-ACETYLGLUCOSAMINE-N; SWP:P17443; PDB:1F0KA; KRLMVMAGGTGGHVFPGLAVAHHLMAQGWQVRWLGTADRMEADLVPKHGIEIDFIRISGL ---------3333----------------------11113333--1111----------i RGKGIKALIAAPLRIFNAWRQARAIMKAYKPDVVLGMGGYVSGPGGLAAWSLGIPVVLHE iii33331111---------------------------3333-------1111------- QNGIAGLTNKWLAKIATKVMQAFPGAFPNAEVVGNPVRTDVLALPLPQQRLAGREGPVRV -------------------------------------3333---------2222------ LVVGGSQGARILNQTMPQVAAKLGDSVTIWHQSGKGSQQSVEQAYAEAGQPQHKVTEFID ----11113333----------!!!!-------2222--------11113333------- DMAAAYAWADVVVCRSGALTVSEIAAAGLPALFVPFQHKDRQQYWNALPLEKAGAAKIIE -------------------------------------1111------------------3 QPQLSVDAVANTLAGWSRETLLTMAERARAASIPDATERVANEVSRVARAL 333-3333----1111------------1111--------------1111- >DIPHTHERIA TOXIN; SWP:P00587; PDB:1F0LA; GADDVVDSSKSFVMENFSSYHGTKPGYVDSIQKGIQKPKSGTQGNYDDDWKGFYSTDNKY 1111--3333-------------2222-3333---------iiii-3333---------- DAAGYSVDNENPLSGKAGGVVKVTYPGLTKVLALKVDNAETIKKELGLSLTEPLMEQVGT -------1111--------------------------------------------3333- EEFIKRFGDGASRVVLSLPFAEGSSSVEYINNWEQAKALSVELEINFETRGKRGQDAMYE -------iiii---------2222-------33333333------3333---!!!!---- YMAQACACINLDWDVIRDKTKTKIESLKEHGPIKNKMSESPNKTVSEEKAKQYLEEFHQT -3333------------------------------------------------------- ALEHPELSELKTVTGTNPVFAGANYAAWAVNVAQVIDSETADNLEKTTAALSILPGIGSV ----1111----33333333----------------3333----------1111--3333 MGIADGAVHHNTEEIVAQSIALSSLMVAQAIPLVGELIGFAAYNFVESIINLFQVVHNSY ---iiii-------------------------------3333----------------11 NRPAYSPGHKTQPFLHDGYAVSWNTVEDSIIRTGFQGESGHDIKITAENTPLPIAGVLLP 11-------------iiii-----3333---1111------------------------- TIPGKLDVNKSKTHISVNGRKIRMRCRAIDGDVTFCRPKSPVYVGNGVHANLHVAFHRSS ----------------iiii------------------------2222------------ SEKIHSNEISSDSIGVLGYQKTVDHTKVNSKLSLFFEIKS ----1111--------------%%%%-------------- >ANTIGEN 85B; SWP:P31952; PDB:1F0NA; SRPGLPVEYLQVPSPSMGRDIKVQFQSGGNNSPAVYLLDGLRAQDDYNGWDINTPAFEWY --------------1111----------2222----------------3333-------2 YQSGLSIVMPVGGQSSFYSDWYSPACGKAGCQTYKWETFLTSELPQWLSANRAVKPTGSA 222---------2222----------1111------------------------------ AIGLSMAGSSAMILAAYHPQQFIYAGSLSALLDPSQGMGPSLIGLAMGDAGGYKAADMWG --------------------------------1111-----------------3333--- PSSDPAWERNDPTQQIPKLVANNTRLWVYCGNGTPNELGGANIPAEFLENFVRSSNLKFQ 1111-------3333--------------------3333--------------------- DAYNAAGGHNAVFNFPPNGTHSWEYWGAQLNAMKGDLQSSLGAG --------------------------------------1111-- >D-LACTATE DEHYDROGENASE; SWP:P06149; PDB:1F0XA; NKAFLNELARLVGSSHLLTDPAKTARYRKGFRSGQGDALAVVFPGSLLELWRVLKACVTA ------------1111---33333333------------------------------111 DKIILMQAANTGLTEGSTPNGNDYDRDVVIISTLRLDKLHVLGKGEQVLAYPGTTLYSLE 1-------------------------------1111-----!!!!-----1111------ KALKPLGREPHSVIGSSCIGASVIGGICNNSGGSLVQRGPAYTEMSLFARINEDGKLTLV --3333-------1111-----------------1111-------------1111----- NHLGIDLGETPEQILSKLDDDRIKDDDVRHDGRHAHDYDYVHRVRDIEADTPARYNADPD ---------3333----------3333---------2222-----1111-----111111 RLFESSGCAGKLAVFAVRLDTFEAEKNQQVFYIGTNQPEVLTEIRRHILANFENLPVAGE 11--2222----------------------------3333-------------------- YMHRDIYDIAELPPRMKNWRDKYEHHLLLKMAGDGVGEAKSWLVDYFKQAEGDFFVCTPE ------------3333---------------!!!!------------------------- EGSKAFLHRFAAAGAAIRYQAVHSDEVEDILALDIALRRNDTEWYEHLPPEIDSQLVHKL -------------------------------------1111-------33331111---- YYGHFMCYVFHQDYIVKKGVDVHALKEQMLELLQQRGAQYPAEHNVGHLYKAPETLQKFY ----1111--------2222-------------1111----------------------- RENDPTNSMNPGIGKTSKRKNW ---1111----2222------- >L-3-HYDROXYACYL-COA DEHYD; SWP:Q16836; PDB:1F0YA; KIIVKHVTVIGGGLMGAGIAQVAAATGHTVVLVDQTEDILAKSKKGIEESLRKVAKKKFA ------------3333-------1111---------------------------111111 ENPKAGDEFVEKTLSTIATSTDAASVVHSTDLVVEAIVENLKVKNELFKRLDKFAAEHTI 113333---------------3333------------------------3333--3333- FASNTSSLQITSIANATTRQDRFAGLHFFNPVPVMKLVEVIKTPMTSQKTFESLVDFSKA --------3333------3333--------3333--------1111-------------- LGKHPVSCKDTPGFIVNRLLVPYLMEAIRLYERGDASKEDIDTAMKLGAGYPMGPFELLD ----------22223333------------------------------------------ YVGLDTTKFIVDGWHEMDAENPLHQPSPSLNKLVAENKFGKKTGEGFYKYK -----------------11111111--------1111---1111------- >F124 IMMUNOGLOBULIN (KAPP; SWP:NA; PDB:1F11B; EVQLQQSGPELVKPGASVKMSCKASGYTFTDYYMKWVKQSHGKSLEWIGDINPNNGGTGY ------------2222-----------3333----------------------------- NQKFKGKATLTVDKSSSTAYMQLNSL 3333---------1111--------- >COAT PROTEIN; SWP:P14767; PDB:1F15A; ERCRPGYTFTSITLKPPKIDRGSYYGKRLLLPDSVTEYDKKLVSRLQIRVNPLPKFDSTV ---2222------------2222--------3333--1111-----------2222---- WVTVRKVPASSDLSVAAISAMFADGASPVLVYQYAASGVQANNKLLYDLSAMRADIGDMR ---------------------1111----------2222------------------333 KYAVLVYSKDDALETDELVLHVDIEHQRIPTSGVLPV 3------------------------------------ >CYTOCHROME C549; SWP:P82603; PDB:1F1CA; LTEELRTFPINAQGDTAVLSLKEIKKGQQVFNAACAQCHALGVTRTNPDVNLSPEALALA -3333-----1111------------------------2222-1111------------- TPPRDNIAALVDYIKNPTTYDGFVEISELHPSLKSSDIFPKMRNISEDDLYNVAGYILLQ ------------------1111---------111111111111---------------33 PKVRGEQWG 33-!!!!-- >HISTONE FOLD PROTEIN; SWP:NA; PDB:1F1EA; ELPKAAIERIFRQGIGERRLSQDAKDTIYDFVPTAEYVANAAKSVLDASGKKTLEEHLKA --------------!!!!---------------------------3333----------- LADVLVEGVEDYDGELFGRATVRRILKRAGIERASSDAVDLYNKLICRATEELGEKAAEY --------1111--------------1111------------------------------ ADEDGRKTVQGEDVEKAITYSPKGGEL -1111----3333--------%%%%-- >CYTOCHROME C6; SWP:P00118; PDB:1F1FA; DVAAGASVFSANCAACHMGGRNVIVANKTLSKSDLAKYLKGFDDDAVAAVAYQVTNGKNA -------------1111%%%%3333-------------2222---------------!!! MPGFNGRLSPLQIEDVAAYVVDQAEKGW !--------------------------- >COPPER-ZINC SUPEROXIDE DI; SWP:P00445; PDB:1F1GA; VQAVAVLKGDAGVSGVVKFEQASESEPTTVSYEIAGNSPNAERGFHIHEFGDATNGCVSA ----------------------1111----------------------------!!!!-- GPHFNPFKKTHGAPTDEVRHVGDMGNVKTDENGVAKGSFKDSLIKLIGPTSVVGRSVVIH ----1111----1111---1111------------------------11112222----- AGQDDLGKGDTEESLKTGNAGPRPACGVIGLTN ----iiii--3333------------------- >CASPASE-7 PROTEASE; SWP:P55210; PDB:1F1JA; YQYNMNFEKLGKCIIINNKNFDKVTGMGVRNGTDKDAEALFKCFRSLGFDVIVYNDCSCA ---------------------3333----2222--------------------------- KMQDLLKKASEEDHTNAACFACILLSHGEENVIYGKDGVTPIKDLTAHFRGDRSKTLLEK --------1111-1111-----------2222--1111--3333-333311111111--- PKLFFIQACRGTELDDGIQKIPVEADFLFAYSTVPGYYSWRSPGRGSWFVQALCSILEEH ---------------------1111--------2222---------3333---------1 GKDLEIMQILTRVNDRVARHFESQSDDPHFHEKKQIPCVVSMLTKELYFS 111-----------------------3333-------------------- >OUTER SURFACE PROTEIN C; SWP:Q9AGB1; PDB:1F1MA; PNLTEISKKITESNAVVLAVKEVETLLTSIDELAKAIGKKIKSDVSLDNEADHNGSLMSG ---------------------------------1111----------------------- AYLISTLITKKISAIKDSGELKAEIEKAKKCSEEFTAKLKGEHTDLGKEGVTDDNAKKAI -----------1111---1111-------------------3333--------------- LKTNNDKTKGADELEKLFESVKNLSKAAKEMLTNSVKELTSP 1111-------------------------------------- >HYALURONATE LYASE; SWP:Q53591; PDB:1F1SA; SEHPQPVTTQIEKSVNTALNKNYVFNKADYQYTLTNPSLGKIVGGILYPNATGSTTVKIS -----------------1111--------------3333---!!!!-------------- DKSGKIIKEVPLSVTASTEDNFTKLLDKWNDVTIGNYVYDTNDSNMQKLNQKLDETNAKN 1111------------------------------3333-1111----------------- IEAIKLDSNRTFLWKDLDNLNNSAQLTATYRRLEDLAKQITNPHSTIYKNEKAIRTVKES ------1111---1111------------------------1111-2222---------- LAWLHQNFYNVNKDIEGSANWWDFEIGVPRSITGTLSLMNNYFTDAEIKTYTDPIEHFVP ---------1111--11113333---------------1111----------3333---- DAEYFRKTLVNPFKALGGNLVDMGRVKIIEGLLRKDNTIIEKTSHSLKNLFTTATKAEGF 1111-1111--------------------------------------3333--------- YADGSYIDHTNVAYTGAYGNVLIDGLTQLLPIIQETDYKISNQELDMVYKWINQSFLPLI 1111------------------------33331111-------------------3333- VKGELMDMSRGRSISREAASSHAAAVEVLRGFLRLANMSNEERNLDLKSTIKTIITSNKF iiii-3333------33333333-----------1111--3333----------1111-- YNVFNNLKSYSDIANMNKLLNDSTVATKPLKSNLSTFNSMDRLAYYNAKKDFGFALSLHS -1111----------------1111----------------------1111--------3 KRTLNYEGMNDENTRGWYTGDGMFYIYNSDQSHYSNHFWPTVNPYKMAGTTEKDAKREDT 333-----%%%%---1111----------1111-%%%%----11112222---------- TKEFMSKHSKDAKEKTGQVTGTSDFVGSVKLNDHFALAAMDFTNWDRTLTAQKGWVILND 33333333-----1111--------------------------1111----------!!! KIVFLGSNIKNTNGIGNVSTTIDQRKDDSKTPYTTYVNGKTIDLKQASSQQFTDTKSVFL !--------------------------3333-----iiii-------------------- ESKEPGRNIGYIFFKNSTIDIERKEQTGTWNSINRTSKNTSIVSNPFITISQKHDNKGDS ----------------------------3333-3333----------------------- YGYMMVPNIDRTSFDKLANSKEVELLENSSKQQVIYDKNSQTWAVIKHDNQESLINNQFK ----------------------------3333-----1111-------------%%%%-- MNKAGLYLVQKVGNDYQNVYYQPQTMTKTDQLAI ----------------------1111-------- >HOMOPROTOCATECHUATE 2,3-D; SWP:Q44048; PDB:1F1UA; TNFVPTPSVPAPDIVRCAYMEIVVTDLAKSREFYVDVLGLHVTEEDENTIYLRSLEEFIH ---------------------------------------------1111----1111--- HNLVLRQGPIAAVAAFAYRVKSPAEVDAAEAYYKELGCRTERRKEGFTKGIGDSVRVEDP ---------------------3333-----------------1111-2222-------11 LGFPYEFFYETEHVERLTQRYDLYSAGELVRLDHFNQVTPDVPRGRAYLEDLGFRVSEDI 11--------------11111111------------------------------------ KDSDGVTYAAWMHRKQTVHDTALTGGNGPRMHHVAFATHEKHNIIQICDKMGALRISDRI -1111----------------------------------3333--------11111111- ERGPGRHGVSNAFYLYILDPDGHRIEIYTQDYYTGDPDNPTITWDVHDNQRRDWWGNPVV -------2222-------1111-------------1111-----111133331111---3 PSWYTEASLVLDLDGNPQPVIV 333--------1111------- >HOMOPROTOCATECHUATE 2,3-D; SWP:Q45135; PDB:1F1XA; EIPKPVAPAPDILRCAYAELVVTDLAKSRNFYVDVLGLHVSYEDENQIYLRSFEEFIHHN ---------------------------------------------------1111----- LVLTKGPVAALKAMAFRVRTPEDVDKAEAYYQELGCRTERRKDGFVKGIGDALRVEDPLG -------------------3333-----------------1111-2222-------1111 FPYEFFFETTHVERLHMRYDLYSAGELVRLDHFNQVTPDVPRGRKYLEDLGFRVTEDIQD --------------111111111111---------------------------------1 DEGTTYAAWMHRKGTVHDTALTGGNGPRLHHVAFSTHEKHNIIQICDKMGALRISDRIER 111----------------------------------3333--------11113333--- GPGRHGVSNAFYLYILDPDNHRIEIYTQDYYTGDPDNPTITWNVHDNQRRDWWGNPVVPS -----2222-------1111-------------1111-----111133331111---333 WYTEASKVLDLDGNVQEII 3--------1111------ >TNSA ENDONUCLEASE; SWP:P13988; PDB:1F1ZA; FSEVQIARRIKEGRGQGHGKDYIPWLTVQEVPSSGRSHRIYSHKTGRVHHLLSDLELAVF ------------2222-!!!!-----3333------------1111-------------- LSLEWESSVLDIREQFPLLPSDTRQIAIDSGIKHPVIRGVDQVMSTDFLVDCKDGPFEQF -----3333---------------------------iiii-------------------- AIQVKPAAALQDERTLEKLELERRYWQQKQIPWFIFTDKEINPVVKENIEWLYSVKTEEV -----3333-----------------1111------1111-------------------- SAELLAQLSPLAHILQEKGDENIINVCKQVDIAYDLELGKTLSEIRALTANGFIKFNIYK ------------------------------------2222----------------1111 SFRANKCADLCISQVVNMEE 3333-3333----------- >NITRIC-OXIDE SYNTHASE; SWP:P29476; PDB:1F20A; SWKRNKFRLTYVAEAPDLTQGLSNVHKKRVSAARLLSRQNLQSPKSSRSTIFVRLHTNGN ------------------------------------------1111----------%%%% QELQYQPGDHLGVFPGNHEDLVNALIERLEDAPPANHVVKVEMLEERNTALGVISNWKDE 1111-2222----------------1111------------------------------- SRLPPCTIFQAFKYYLDITTPPTPLQLQQFASLATNEKEKQRLLVLSKGLQEYEEWKWGK ----------------------------3333---------------------------- NPTMVEVLEEFPSIQMPATLLLTQLSLLQPRYYSISSSPDMYPDEVHLTVAIVSYHTRDG ----------1111--3333-----------------33332222--------------- EGPVHHGVCSSWLNRIQADDVVPCFVRGAPSFHLPRNPQVPCILVGPGTGIAPFRSFWQQ -----------1111-2222--------1111----1111------!!!!---------- RQFDIQHKGMNPCPMVLVFGCRQSKIDHIYREETLQAKNKGVFRELYTAYSREPDRPKKY ----------------------1111-2222------1111------------------3 VQDVLQEQLAESVYRALKEQGGHIYVCGDVTMAADVLKAIQRIMTQQGKLSEEDAGVFIS 333------------------------------------------1111----------- RLRDDNRYHEDIFGV --1111--------- >THYMIDYLATE SYNTHASE; SWP:P13100; PDB:1F28A; NAEEQQYLNLVQYIINHGEDRPDRTGTGTLSVFAPSPLKFSLRNKTFPLLTTKRVFIRGV 3333------------------1111----------------%%%%-------------- IEELLWFIRGETDSLKLREKNIHIWDANGSREYLDSIGLTKRQEGDLGPIYGFQWRHFGA -----------------1111-11111111----111111112222----------2222 EYIDCKTNYIGQGVDQLANIIQKIRTSPYDRRLILSAWNPADLEKMALPPCHMFCQFYVH --------2222--------------1111--------33331111-------------- IPSNNHRPELSCQLYQRSCDMGLGVPFNIASYALLTCMIAHVCDLDPGDFIHVMGDCHIY ----------------------------------------1111---------------1 KDHIEALQQQLTRSPRPFPTLSLNRSITDIEDFTLDDFNIQNYHPYETIKMKMSI 11111113333-----------------1111-3333------------------ >1-AMINOCYCLOPROPANE-1-CAR; SWP:Q7M523; PDB:1F2DA; AGVAKFAKYPLTFGPSPISNLNRLSQHLGSKVNVYAKREDCNSGLAFGGNKLRKLEYIVP -3333----------------------%%%%------1111-------3333-3333-33 DIVEGDYTHLVSIGGRQSNQTRMVAALAAKLGKKCVLIQEDWVPIPEAEKDVYNRVGNIE 33----------------3333-----------------------3333--1111----- LSRIMGADVRVIEDGFDIGMRKSFANALQELEDAGHKPYPIPAGCSEHKYGGLGFVGFAD -------------------------------1111------2222--1111--------- EVINQEVELGIKFDKIVVCCVTGSTTAGILAGMAQYGRQDDVIAIDASFTSEKTKEQTLR -------------------------------3333--3333------------------- IANNTAKLIGVEHEFKDFTLDTRFAYPCYGVPNEGTIEAIRTCAEQEGVLTDPVYEGKSM ----3333-------------1111--2222-3333------------------3333-- QGLIALIKEDYFKPGANVLYVHLGGAPALSAYSSFFPTKTA ------------2222--------333311113333----- >GLUTATHIONE S-TRANSFERASE; SWP:O33705; PDB:1F2EA; MKLFISPGACSLAPHIALRETGADFEAVKVDLAVRKTEAGEDFLTVNPSGKVPALTLDSG -----2222--------------------------------3333-1111------1111 ETLTENPAILLYIADQNPASGLAPAEGSLDRYRLLSRLSFLGSEFHKAFVPLFAPATSDE ----------------3333----2222--------------------3333-------- AKAAAAESVKNHLAALDKELAGRDHYAGNAFSVADIYLYVMLGWPAYVGIDMAAYPALGA ------------------3333---------------------3333---3333------ YAGKIAQRPAVGAALKAEGLA --------------------- >Early growth response pro; SWP:P08046; PDB:1F2IG; NLLNYVVPKMRPYACPVESCDRRFSRSDELTRHIRIHTGQKPFQCRICMRNFSRSDHLTT 1111------------1111---------------------------------------- HIRTHT -1111- >FRUCTOSE-BISPHOSPHATE ALD; SWP:P07752; PDB:1F2JA; SKRVEVLLTQLPAYNRLKTPYEAELIETAKKMTAPGKGLLAADESTGSCSKRFAGIGLSN ------11111111----1111-----------2222-------3333----1111---- TAEHRRQYRALMLECEGFEQYISGVILHDETVYQKAKTGETFPQYLRRRGVVPGIKTDCG -----------1111------------3333----1111--------------------- LEPLVEGAKGEQMTAGLDGYIKRAKKYYAMGCRFCKWRNVYKIQNGTVSEAVVRFNAETL -------2222-----2222-------1111------------iiii------------- ARYAILSQLCGLVPIVEPEVMIDGTHDIETCQRVSQHVWSEVVSALHRHGVVWEGCLLKP -------1111----------------------------------------3333----- NMVVPGAESGLKGHAEQVAEYTVKTLARVIPPALPGVTFLSGGLSEVMASEYLNAMNNCP -----1111---------------------3333------2222---------------- LPRPWKLTFSYARALQSSAIKRWGGKESGVEAGRRAFMHRAKMNSLAQLGKYNRADD -----------1111-------iiii1111----------------1111--3333- >PROFILIN II; SWP:P19984; PDB:1F2KA; SWQTYVDTNLVGTGAVTQAAIIGHDGNTWATSAGFAVSPANGAALANAFKDATAIRSNGF ----------3333--------1111-----2222----------3333----------- ELAGTRYVTIRADDRSVYGKKGSAGVITVKTSKAILIGVYNEKIQPGTAANVVEKLADYL --------------------!!!!------1111------33333333------------ IGQGF ----- >FRACTALKINE; SWP:P78423; PDB:1F2LA; VTKCNITCSKMTSKIPVALLIHYQQNQASCGKRAIILETRQHRLFCADPKEQWVKDAMQH ---------------3333-------1111--------1111-----3333--------- LDRQ 1111 >CAPSID PROTEIN; SWP:Q86527; PDB:1F2NA; LSSNTWPLHSVEFLADFKRSSTSADATTYDCVPFNLPRVWSLARCYSMWKPTRWDVVYLP -------------------------------3333-3333-3333--------------- EVSATVAGSIEMCFLYDYADTIPRYTGKMSRTAGFVTSSVWYGAEGCHLLSGGSARNAVV ----------------3333-----------2222---111133333333---------- ASMDCSRVGWKRVTSSIPSSVDPNVVNTILPARLAVRSSIKPTVSDTPGKLYVIASMVLR ----2222---------111133333333------------------------------- DPVDPTLNT ---3333-- >HIGH AFFINITY IMMUNOGLOBU; SWP:P12319; PDB:1F2QA; KPKVSLNPPWNRIFKGENVTLTCNGNNFSTKWFHNGSLSEETNSSLNIVNAKFEDSGEYK -------------2222------1111------iiii--------------3333----- CQHQQVNESEPVYLEVFSDWLLLQASAEVVMEGQPLFLRCHGWRNWDVYKVIYYKDGEAL ------------------------------2222-------2222--------------- KYWYENHNISITNATVEDSGTYYCTGKVWQLDYESEPLNITVIKAPR --------------3333----------------------------- >DNA fragmentation factor ; SWP:O54786; PDB:1F2RI; MELSRGASAPDPDDVRPLKPCLLRRNHSRDQHGVAASSLEELRSKACELLAIDKSLTPIT -------------------------------------3333-----------1111---- LVLAEDGTIVDDDDYFLCLPSNTKFVALACNEKWTYNDSD -----------------------------------%%%%- >RAD50 ABC-ATPASE; SWP:P58301; PDB:1F2TA; MKLERVTVKNFRSHSDTVVEFKEGINLIIGQNGSGKSSLLDAILVGLYWPLRIKDIKKDE ----------!!!!----------------2222-------------------------- FTKVGARDTYIDLIFEKDGTKYRITRRFLKGEIHAMKRLVGNEWKHVTEPSSKAISAFME --2222----------iiii-------------------!!!!----------------- KLIPYNIFLNAIYIRQGQIDAILES --------------2222--1111- >DNA double-strand break r; SWP:P58301; PDB:1F2TB; AREAALSKIGELASEIFAEFTEGKYSEVVVRAEENKVRLFVVWEGKERPLTFLSGGERIA ------------------1111----------%%%%------iiii--3333-------- LGLAFRLAMSLYLAGEISLLILDEPTPYLDEERRRKLITIMERYLKKIPQVILVSHDEEL --------------------------2222-------------3333---------1111 KDAADHVIRISLENGSSKVEVVS 1111--------iiii------- >PRECORRIN-8X METHYLMUTASE; SWP:P21638; PDB:1F2VA; PEYDYIRDGNAIYERSFAIIRAEADLSRFSEEEADLAVRMVHACGSVEATRQFVFSPDFV -------------------------11113333------------33331111--1111- SSARAALKAGAPILCDAEMVAHGVTRARLPAGNEVICTLRDPRTPALAAEIGNTRSAAAL --------------------11113333---------1111------------3333-33 KLWSERLAGSVVAIGNAPTALFFLLEMLRDGAPKPAAILGMPVGFVGAAESKDALAENSY 33-1111---------------------------------------------------ii GVPFAIVRGRLGGSAMTAAALNSLARPGL ii--------------------------- >ANTIBODY HEAVY CHAIN; SWP:NA; PDB:1F2XK; QVQLVESGGGSVQAGGSLRLSCAASGYTVSTYCMGWFRQAPGKEREGVATILGGSTYYGD ------------2222-----------1111--------2222--------!!!!---11 SVKGRFTISQDNAKNTVYLQMNSLKPEDTAIYYCAGSTVASTGWCSRLRPYDYHYRGQGT 11---------1111---------3333------------333333331111-------- QVTVSS ------ >MAJOR PEPSIN INHIBITOR PI; SWP:P19400; PDB:1F32A; FLFSMSTGPFICTVKDNQVFVANLPWTMLEGDDIQVGKEFAARVEDCTNVKHDMAPTCTK --------------iiii--%%%%-----!!!!---------------------3333-- PPPFCGPQDMKMFNFVGCSVLGNKLFIDQKYVRDLTAKDHAEVQTFREKIAAFEEQSPPP -1111-1111----2222--------%%%%------------------------------ PPSFCTV -3333-- >PEPSIN A; SWP:P00791; PDB:1F34A; IGDEPLENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCSSLACSDHNQFNPDD --------%%%%-----------------------------1111-3333------3333 SSTFEATQELSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGIL 1111---------1111------------%%%%---------------3333-------- GLAYPSISASGATPVFDNLWDQGLVSQDLFSVYLSSNDDSGSVVLLGGIDSSYYTGSLNW ---33332222--3333--1111------------%%%%----------3333------- VPVSVEGYWQITLDSITMDGETIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENS -----------------iiii------------3333-----3333----1111------ DGEMVISCSSIDSLPDIVFTINGVQYPLSPSAYILQDDDSCTSGFEGMDVPTSSGELWIL ------3333----------iiii----1111-------------------1111----- GDVFIRQYYTVFDRANNKVGLAPVA ---3333------------------ >OLFACTORY MARKER PROTEIN; SWP:Q64288; PDB:1F35A; AEDGPQKQQLEPLVLDQDLTQQRLRVESLKQRGEKKQDGEKLIRPAESVYRLDFIQQQKL ------------------------------------2222---1111------------- QFDHWNVVLDKPGKVTITGTSQNWTPDLTNLTRQLLDPAAIFWRKEDSDADWNEADALEF --------------------11113333---1111------------------------- GERLSDLAKIRKVYFLITFGEGVEPANLKASVVFNQL ----3333-----------22223333---------- >REPRESSOR PROTEIN CI; SWP:P03034; PDB:1F39A; ASASAFWLEVEGNSMTAPTGSKPSFPDGMLILVDPEQAVEPGDFCIARLGGDEFTFKKLI -1111-------1111-2222----2222----------2222-----1111-------- RDSGQVFLQPLNPQYPMIPCNESCSVVGKVIASQWPEETFG -iiii------1111-----3333-----------3333-- >GLUTATHIONE S-TRANSFERASE; SWP:P13745; PDB:1F3AA; AGKPVLHYFNARGRMECIRWLLAAAGVEFEEKFIQSPEDLEKLKKDGNLMFDQVPMVEID -----------!!!!---------------------------------1111------ii GMKLAQTRAILNYIATKYDLYGKDMKERALIDMYSEGILDLTEMIGQLVLCPPDQREAKT ii-------------------------------------------3333--1111----- ALAKDRTKNRYLPAFEKVLKSHGQDYLVGNRLTRVDIHLLEVLLYVEEFDASLLTPFPLL ---------------------------%%%%-3333-----------------1111--- KAFKSRISSLPNVKKFLQPGSQRKPPMDAKQIQEARKAFKI -----------------2222-------------------- >CATALYTIC ANTIBODY 4B2; SWP:NA; PDB:1F3DH; EIQLQQSGPELVKPGASVKVSCKASGYSFIDYNIHWVKQSHGKSLEWIGYIVPYSGGTTF ------------2222-----------3333--------%%%%----------------- NQKFKGKATLTVDKSSSTAFMHLNSL 3333---------1111--------- >SURVIVIN; SWP:O15392; PDB:1F3HA; TLPPAWQPFLKDHRISTFKNWPFLEGCACTPERMAEAGFIHCPTENEPDMAQCFFCFKEL --1111333333333333---------------3333------1111------------- EGWEPDDDPIEEHKKHSSGCAFLSVKKQFEELTLGEFLKLDRERAKNKIAKETNNKKKEF ---1111------------3333-----1111---------------------------- EETAKKVRRAIEQLAA ---------------- >PROTEIN ARGININE METHYLTR; SWP:O70467; PDB:1F3LA; DLQEDEDGVYFSSYGHYGIHEEMLKDKVRTESYRDFIYQNPHIFKDKVVLDVGCGTGILS ----------3333------------3333---------33332222------!!!!--- MFAAKAGAKKVIAVDQSEILYQAMDIIRLNKLEDTIVLIKGKIEEVSLPVEKVDVIISEW ---------------------------11111111------1111--------------- MGYFLLFESMLDSVLYAKSKYLAKGGSVYPDICTISLVAVSDVSKHADRIAFWDDVYGFN -222222223333---------2222-------------------------3333iiii- MSCMKKAVIPEAVVEVVDHKTLISDPCDIKHIDCHTTSISDLEFSSDFTLRTTKTAMCTA -------3333------3333----------------3333------------------- VAGYFDIYFEKNCHNRVVFSTGPQSTKTHWKQTIFLLEKPFPVKAGEALKGKITVHKNKK ---------2222--------1111--1111------------2222----------111 DPRSLIVTLTLNSSTQTYSLQ 1---------%%%%------- >TRANSCRIPTION INITIATION ; SWP:P13984; PDB:1F3UA; AERGELDLTGAKQNTGVWLVKVPKYLSQQWAKASGRGEVGKLRIAKTQGRTEVSFTLNED -2222-----1111---------------1111!!!!---------------------33 LANIHDIGGKPASVSAPREHPFVLQSVGGQTLTVFTESSSDKLSLEGIVVQRAECRPA 33-----------------------------------1111----------------- >Transcription initiation ; SWP:P35269; PDB:1F3UB; GPSSQNVTEYVVRVPKNTTKKYNIMAFNAADKVNFATWNQARLERDLSNKKIYQEEEMRK ---------------------------3333--3333---------1111---------- LREEARRKKYGIVLKEFRPEDQPWLLRVNGKSGRKFKGIKKGGVTENTSYYIFTQCPDGA 1111-------------3333--------1111----------------------1111- FEAFPVHNWYNFTPLARHR -------------3333-- >TUMOR NECROSIS FACTOR REC; SWP:Q15628; PDB:1F3VA; HEEWVGSAYLFVESSLDKVVLSDAYAHPQQKVAVYRALQAALAESGGSPDVLQMLKIHRS --------------------------1111-----------------1111--------- DPQLIVQLRFCGRQPCGRFLRAYREGALRAALQRSLAAALAQHSVPLQLELRAGAERLDA ----------------------1111--------------------------!!!!3333 LLADEERCLSCILAQQPDRLRDEELAELEDALRNLKCG 3333------------------------------3333 >DIADENOSINE 5',5'''-P1,P4; SWP:O04841; PDB:1F3YA; GPLGSMDSPPEGYRRNVGICLMNNDKKIFAASRLDIPDAWQMPQGGIDEGEDPRNAAIRE ----------------------1111---------1111--------------------- LREETGVTSAEVIAEVPYWLTYDFPPKVREKLNIQWGSDWKGQAQKWFLFKFTGQDQEIN ------------------------3333-----1111-----------------3333-- LLGDGSEKPEFGEWSWVTPEQLIDLTVEFKKPVYKEVLSVFAPHL --------------------------3333-----------1111 >GLUCOSE-SPECIFIC PHOSPHOC; SWP:P08837; PDB:1F3Z; TIEIIAPLSGEIVNIEDVPDVVFAEKIVGDGIAIKPTGNKMVAPVDGTIGKIFETNHAFS -------------3333--3333--3333-----------------------3333---- IESDSGVELFVHFGIDTVELKGEGFKRIAEEGQRVKVGDTVIEFDLPLLEEKAKSTLTPV --1111---------3333iiii------2222--2222--------------------- VISNMDEIKELIKLSGSVTVGETPVIRIKK ---3333-----------2222-------- >INTERLEUKIN-12 BETA CHAIN; SWP:P29460; PDB:1F42A; IWELKKDVYVVELDWYPDAPGEMVVLTCDTPEEDGITWTLDQSSEVLGSGKTLTIQVKEF ----2222-------------------------------!!!!---------------33 GDAGQYTCHKGGEVLSHSLLLLHKKEDGIWSTDILKDQKEPKNKTFLRCEAKNYSGRFTC 33-------iiii------------iiii------------------------------- WWLTTISTDLTFSVKSSRGGVTCGAATLSAERVRGDNKEYEYSVECQEDSACPAAEESLP ---------------------------------------------------1111----- IEVMVDAVHKLKYENYTSSFFIRDIIKPDPPKNLQLKPLKNSRQVEVSWEYPDTWSTPHS --------------------3333---------------------------1111--333 YFSLTFCVQVQGKSKRRVFTDKTSATVICRKNASISVRAQDRYYSSSWSEWASVPCS 3----------------------------2222-------1111------------- >Interleukin-12 subunit al; SWP:P29459; PDB:1F45B; QNLLRAVSNMLQKARQTLEFYPCSTVEACLPLELTKNESCTSFITNGSSFMMALCLSSIY 3333------------3333---3333---3333-------------------------- EDLKMYQVEFKTMNAKLLMDPKRQIFLDQNMLAVIDELMQALYKTKIKLCILLHAFRIRA -------------------1111------------------------------------- VTIDRVMSYLNAS ------------- >CELL DIVISION PROTEIN ZIP; SWP:P77173; PDB:1F46A; RKEAVIIMNVAAHHGSELNGELLLNSIQQAGFIFGDMNIYHRHLSPDGSGPALFSLANMV ------------2222------------------2222---------------------- KPGTFDPEMKDFTTPGVTIFMQVPSYGDELQLFKLMLQSAQHIADEVGGVVLDDQRRMMT -----1111-------------------------------------------1111---- PQKLREYQDIIREVKDANA ---------------1111 >ROP ALA2ILE2-6; SWP:P03051; PDB:1F4NA; GTKQEKTILNMARFIRSQALTILEKANELDADEIADIAESIHDHADEIYRSALARFGDDG --------------------------1111------------------------2222-- >FLAVODOXIN; SWP:P00323; PDB:1F4PA; PKALIVYGSTTGNTEYTAETIARELADAGYEVDSRDAASVEAGGLFEGFDLVLLGCSTWG -------------------------1111------3333--22222222----------- DDSIELQDDFIPLFDSLEETGAQGRKVACFGCGDSSWEYFCGAVDAIEEKLKNLGAEIVQ ------3333-----3333--2222----------------------------------- DGLRIDGDPRAARDDIVGWAHDVRGAI -------3333------------1111 >Regulatory protein alcR; SWP:P21228; PDB:1F4SP; SMADTRRRQNHSCDPCRKGKRRCDAPENRNEANENGWVSCSNCKRWNKDCTFNWLSSQRS --------------------------------1111-------1111-----3333---- KNSS ---- >ANTIBODY S-20-4, FAB FRAG; SWP:NA; PDB:1F4XH; EVQLEESGGGLVTPGGSLRLSCAASGYVFSTYDMSWVRQTPEKRLEWVAFISSGGGRTSY ------------2222-----------3333--------1111--------2222----- PDTVKGRFTISRDDAKNTLYLQMSSLQSEDTAMYYCTRHFYAVLDYWGRGTTLTVSSAKT 3333--------3333----------1111---------%%%%----------------- TPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYT -------------------------------------%%%%-------------iiii-- LSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVP -------1111------------1111--------- >Putative uncharacterized ; SWP:Q0VDX6; PDB:1F4XL; QAVVTQESALTTSPGETVTLTCRSSTGTVTTSNYANWVQEKPDHLFTGLIGATNNRAAGV ------------2222-------3333--1111-----------------------2222 PVRFSGSLIGGKAALTITGAQTEDEAIYFCALWYSGHWVFGGGTKLTVLGQPKSSPSVTL 3333----iiii--------1111------------------------------------ FPPSSEELETNKATLVCTITDFYPGVVTVDWKVDGTPVTQGMETTNPSKQSNNKYMASSY ---3333-------------------------iiii-------------1111------- LTLTARAWERHSSYSCQVTHEGHTVEKSLS -------1111--------iiii------- >SPORULATION INITIATION PH; SWP:P06535; PDB:1F51A; ISDTALTNELIHLLGHSRHDWMNKLQLIKGNLSLQKYDRVFEMIEEMVIDAKHESKLSNL --3333------------------------------------------------------ KTPHLAFDFLTFNWKTHYMTLEYEVLGEIKDLSAYDQKLAKLMRKLFHLFDQAVSRESEN -3333-----1111-----------------1111------------------------- HLTVSLQTDHPDRQLILYLDFHGAFADPSAFDDIRQNGYEDVDIMRFEITSHECLIEIGL ------------------------------------------------------------ D - >YEAST KILLER TOXIN-LIKE P; SWP:NA; PDB:1F53A; IDHVPCRGGENFLKIWSHSGGQQSVDCYANRGRIDFGGWWVDKISTGNNDLIYYDANGDS ------------------------------------------------------1111-- VRVDRWHDITYPNRPPKVNSIEIL ------------------------ >CALMODULIN; SWP:P06787; PDB:1F54A; SSNLTEEQIAEFKEAFALFDKDNNGSISSSELATVMRSLGLSPSEAEVNDLMNEIDVDGN ---------------3333--------3333-------------3333------------ HQIEFSEFLALMSRQLK ----------------- >PLANTACYANIN; SWP:O82080; PDB:1F56A; AVYNIGWSFNVNGARGKSFRAGDVLVFKYIKGQHNVVAVNGRGYASCSAPRGARTYSSGQ -------------2222--2222------2222----------------2222------- DRIKLTRGQNYFICSFPGHCGGGMKIAINAK ---------------22221111-------- >Envelope glycoprotein gp1; SWP:P05877; PDB:1F58H; DVQLQQSGPDLVKPSQSLSLTCTVTGYSITSGYSWHWIRQFPGNKLEWMGYIHYSAGTNY ------------2222-----------3333---------------------1111---- NPSLKSRISITRDTSKNQFFLQLNSV 3333---------1111--------- >BETA-1,4-XYLANASE; SWP:P77853; PDB:1F5JA; ALTSNASGTFDGYYYELWKDTGNTTMTVYTQGRFSCQWSNINNALFRTGKKYNQNWQSLG ---------iiii----------------iiii---------------------3333-- TIRITYSATYNPNGNSYLCIYGWSTNPLVEFYIVESWGNWRPPGATSLGQVTIDGGTYDI ----------------------------------------------------iiii---- YRTTRVNQPSIVGTATFDQYWSVRTSKRTSGTVTVTDHFRAWANRGLNLGTIDQITLCVE ---------------------------------3333-----1111-------------- GYQSSGSANITQNTFSQSS ------------------- >GAF; SWP:P36088; PDB:1F5MA; STGFHHADHVNYSSNLNKEEILEQLLLSYEGLSDGQVNWVCNLSNASSLIWHAYKSLAVD ----3333----1111----------------3333------------------------ INWAGFYVTQASEENTLILGPFQGKVACQMIQFGKGVCGTAASTKETQIVPDVNKYPGHI -------------------------------2222----------------11112222- ACDGETKSEIVVPIISNDGKTLGVIDIDCLDYEGFDHVDKEFLEKLAKLINKSCVF --1111---------1111------------------------------------- >INTERFERON-INDUCED GUANYL; SWP:P32455; PDB:1F5NA; MTGPMCLIENTNGRLMANPEALKILSAITQPMVVVAIVGLYRTGKSYLMNKLAGKKKGFS ----------iiii---------------------------------------------- LGSTVQSHTKGIWMWCVPHPKKPGHILVLLDTEGLGDVEKGDNQNDSWIFALAVLLSSTF ---------------------2222-----------1111--1111-------------- VYNSIGTINQQAMDQLYYVTELTHRIRSKSSVEDSADFVSFFPDFVWTLRDFSLDLEADG --------3333-------3333---------33333333-------------------- QPLTPDEYLTYSLKLKKGTSQKDETFNLPRLCIRKFFPKKKCFVFDRPVHRRKLAQLEKL ----------1111-----------------------------------11111111--- QDEELDPEFVQQVADFCSYIFSNSKTKTLSGGIQVNGPRLESLVLTYVNAISSGDLPCME 3333------------------------2222------------------1111------ NAVLALAQIENSAAVQKAIAHYEQQMGQKVQLPTESLQELLDLHRDSEREAIEVFIRSSF -------------------------------------------------------1111- KDVDHLFQKELAAQLEKKRDDFCKQNQEASSDRCSGLLQVIFSPLEEEVKAGIYSKPGGY -2222----------------------------------------------11112222- RLFVQKLQDLKKKYYEEPRKGIQAEEILQTYLKSKESMTDAILQTDQTLTEKEKEIEVER -------------1111---1111------------------3333-------------- VKAESAQASAKMLHEMQRKNEQMMEQKERSYQEHLKQLTEKMENDRVQLLKEQERTLALK ------------------------------------------------------------ LQEQEQLLKEGFQKESRIMKNEIQDLQTKM -------------------------3333- >V-cyclin; SWP:P89883; PDB:1F5QB; FQGFLDSSLLNEEDCRQMIYRSEREHDARMVGVNVDQHFTSQYRKVLTTWMFCVCKDLRQ ------1111------------------------3333---------------------- DNNVFPLAVALLDELFLSTRIDRENYQSTAAVALHIAGKVRAYMPIKATQLAYLCGGATT ---------------------3333----------------------------------- ADKLLTLEVKSLDTLSWVADRCLSTDLICYILHIMHAPREDYLNIYNLCRPKIFCALCDG ----------------------3333------1111-3333-----------------33 RSAMKRPVLITLACMHLTMNQKYDYYENRIDGVCKSLYITKEELHQCCDLVDIAIVSFDE 33--------------------------------1111--------------------11 NYFKINA 11----- >OXYGEN-INSENSITIVE NADPH ; SWP:P17117; PDB:1F5VA; MTPTIELICGHRSIRHFTDEPISEAQREAIINSARATSSSSFLQCSSIIRITDKALREEL -------1111---------------------------2222------------------ VTLTGGQKHVAQAAEFWVFCADFNRHLQICPDAQLGLAEQLLLGVVDTAMMAQNALIAAE -1111--3333------------------1111---3333-------------------1 SLGLGGVYIGGLRNNIEAVTKLLKLPQHVLPLFGLCLGWPADNPDLKPRLPASILVHENS 111------3333-------------------------------------3333------ YQPLDKGALAQYDEQLAEYYLTRGSNNRRDTWSDHIRRTIIKESRPFILDYLHKQGWATR --------------------1111--------------1111--1111----1111---- >RHO-GEF VAV; SWP:P27870; PDB:1F5XA; MKGDEIYEDLMRLESVPTPPKMTEYDKRCCCLREIQQTEEKYTDTLGSIQQHFMKPLQRF -3333--------------------3333----------3333----------------- LKPQDMETIFVNIEELFSVHTHFLKELKDALAGPGATTLYQVFIKYKERFLVYGRYCSQV -3333----------------------------------1111--3333----------- ESASKHLDQVATAREDVQMKLEECSQRANNGRFTLRDLLMVPMQRVLKYHLLLQELVKHT -------------3333-------3333------1111-----------------1111- QDATEKENLRLALDAMRDLAQCVNEVKR ------------------1111------ >LOW-DENSITY LIPOPROTEIN R; SWP:P01130; PDB:1F5YA; GSAVGDRCERNEFQCQDGKCISYKWVCDGSAECQDGSDESQETCLSVTCKSGDFSCGGRV --------------3333---3333-----1111--1111-------------------- NRCIPQFWRCDGQVDCDNGSDEQGC ----3333-------------1111 >ELONGATION FACTOR EEF1A; SWP:P02994; PDB:1F60A; GKEKSHINVVVIGHVDSGKSTTTGHLIYKCGGIDKRTIEKFEKEAAELGKGSFKYAWVLD -------------1111--------------------------3333------3333--- KLKAERERGITIDIALWKFETPKYQVTVIDAPGHRDFIKNMITGTSQADCAILIIAGGVG -----3333-----------1111---------1111----------------------- EFEAGISKDGQTREHALLAFTLGVRQLIVAVNKMDSVKWDESRFQEIVKETSNFIKKVGY --33331111----------------------3333%%%%-------------------- NPKTVPFVPISGWNGDNMIEATTNAPWYKGWEKETKAGVVKGKTLLEAIDAIEQPSRPTD -1111---------2222------3333------1111----------1111-----111 KPLRLPLQDVYKIGGIGTVPVGRVETGVIKPGMVVTFAPAGVTTEVKSVEMHHEQLEQGV 1----------------------------2222-----------------!!!!-----2 PGDNVGFNVKNVSVKEIRRGNVCGDAKNDPPKGCASFNATVIVLNHPGQISAGYSPVLDC 222---------3333-2222-----------------------------2222-----! HTAHIACRFDELLEKNDRRSGKKLEDHPKFLKSGDAALVKFVPSKPMCVEAFSEYPPLGR !!!----------------------------2222---------------33333333-- FAVRDMRQTVAVGVIKSVDK ----%%%%------------ >Elongation factor 1-beta; SWP:P32471; PDB:1F60B; PAAKSIVTLDVKPWDDETNLEEMVANVKAIEMEGLTWGAHQFIPIGFGIKKLQINCVVED --------------1111--------1111-2222----------iiii----------- DKVSLDDLQQSIEEDEDHVQSTDIAAMQKL --------------3333------------ >TRANSCRIPTION FACTOR WSTF; SWP:NA; PDB:1F62A; ARCKVCRKKGEDDKLILCDECNKAFHLFCLRPALYEVPDGEWQCPACQPAT -------------------------3333-3333---------3333---- >HISTONE H3; SWP:P17317; PDB:1F66C; AVSRSQRAGLQFPVGRIHRHLKSRTTSHGRVGATAAVYSAAILEYLTAEVLELAGNASKD -------------------------!!!!--1111------------------------- LKVKRITPRHLQLAIRGDEELDSLIKATIAGGGVIPHIHKSLI --------------3333----------2222------1111- >HISTONE ACETYLTRANSFERASE; SWP:Q92830; PDB:1F68A; GDQLYTTLKNLLAQIKSHPSAWPFMEPVKKSEAPDYYEVIRFPIDLKTMTERLRSRYYVT -----------------33331111---3333-3333----------------------- RKLFVADLQRVIANCREYNPPDSEYCRCASALEKFFYFKLKEG ------------------------------------------- >SAR1; SWP:Q9CQC9; PDB:1F6BA; SSVLQFLGLYKKTGKLVFLGLDNAGKTTLLHMLKDDPTLHPTSEELTIAGMTFTTFDLGG --------2222--------2222------3333-------------!!!!--------- RRVWKNYLPAINGIVFLVDCADHERLLESKEELDSLMTDETIANVPILILGNKIDRPEAI -3333-3333--------11111111------------3333----------3333---- SEERLREMFGLYGQTTGKGSVSLKELNARPLEVFMCSVLKRQGYGEGFRWMAQYID ----------2222-----------------------1111----------1111- >PLACENTAL LACTOGEN; SWP:P16038; PDB:1F6FA; AQHPPYCRNQPGKCQIPLQSLFDRATTVANYNSKLAGEMVNRFDEQYVINCHTSSITTPN ----1111---------------------------------3333-----1111------ SKAEAINTEDKILFKLVISLLHSWDEPLHHAVTELANPALLTKAQEIKEKAKVLVDGVEV ------------------------------------------------------------ IQKRIHPGEKNEPYPVWSEQSSLTSQDENVRRVAFYRLFHCLHRDSSKIYTYLRILKCRL ----------------1111-1111----------------------------------- TSC --- >Prolactin receptor [Precu; SWP:P05710; PDB:1F6FB; GKPEIHKCRSPDKETFTCWWNPGTDGGLPTNYSLTYSKEGEKTTYECPDYKTSGPNSCFF ------------------------iiii---------2222---------1111------ SKQYTSIWKIYIITVNATNQMGSSSSDPLYVDVTYIVEPEPPRNLTLEVKKKTYLWVKWS 3333---------------------------3333------------------------- PPTITDVKTGWFTMEYEIRLKPEEAEEWEIHFTGHQTQFKVFDLYPGQKYLVQTRCKPDH -3333-1111----------------------!!!!--------2222------------ GYWSRWSQESSVEMP --------------- >Ig kappa chain V-I region; SWP:P01600; PDB:1F6LL; DIQMTQSPASLSASVGETVTITCRASENIYSYLAWYQQKQGKSPQLLVYNAKTLAEGVPS -------------2222-----------------------------------------11 RFSGSGSGTQFSLKINSLQPEDFGSYYCQHHYGTPFTFGSGTKLEI 11----------------1111------------------------ >ALPHA-LACTALBUMIN; SWP:P00711; PDB:1F6RA; EQLTKCEVFRELKDLKGYGGVSLPEWVCTTFHTSGYDTQAIVQNNDSTEYGLFQINNKIW -----------3333-2222-------------%%%%-------------1111------ CKDDQNPHSSNICNISCDKFLDDDLTDDIMCVKKILDKVGINYWLAHKALCSEKLDQWLC --3333----1111-3333--------------------1111-3333-----3333--2 EKL 222 >DNA TRANSPOSITION PROTEIN; SWP:P03763; PDB:1F6VA; GSRIAKRTAINKTKKADVKAIADAWQINGEKELELLQQIAQKPGALRILNHSLRLAAMTA -------------1111-3333----------------1111------------------ HGKGERVNEDYLRQAFRELDLDVDISTLLRN -------3333-----------1111----- >BILE SALT ACTIVATED LIPAS; SWP:P19835; PDB:1F6WA; AKLGAVYTEGGFVEGVNKKLGLLGDSVDIFKGIPFAAPTKALENPQPHPGWQGTLKAKNF -------1111----------------------------2222----------------- KKRCLQATITQDSTYGDEDCLYLNIWVPQGRKQVSRDLPVMIWIYGGAFLMGSGHGANFL -------1111-----------------------------------iiii--1111---- NNYLYDGEEIATRGNVIVVTFNYRVGPLGFLSTGDANLPGNYGLRDQHMAIAWVKRNIAA ------3333---------------3333-----3333------------------3333 FGGDPDNITLFGESAGGASVSLQTLSPYNKGLIRRAISQSGVALSPWVIQKNPLFWAKKV ---1111------------------3333------------1111------3333----- AEKVGCPVGDAARMAQCLKVTDPRALTLAYKVPLAGLEYPMLHYVGFVPVIDGDFIPDDP -1111----------------------------------3333---------------33 INLYANAADIDYIAGTNNMDGHIFASIDMPAINKGNKKVTEEDFYKLVSEFTITKGLRGA 333333------------11113333--3333---------------------------- KTTFDVYTESWAQDPSQENKKKTVVDFETDVLFLVPTEIALAQHRANAKSAKTYAYLFSH ---------%%%%------------------------------3333------------- PSRMPVYPKWVGADHADDIQYVFGKPFATPTGYRPQDRTVSKAMIAYWTNFAKTGDPNMG ------1111---22223333-------3333-3333------------------3333- DSAVPTHWEPYTTENSGYLEITKKMGSSSMKRSLRTNFLRYWTLTYLALPTVT -------------------------1111---------------3333----- >N-ACETYL-NEURAMINATE LYAS; SWP:P44539; PDB:1F74A; MRDLKGIFSALLVSFNEDGTINEKGLRQIIRHNIDKMKVDGLYVGGSTGENFMLSTEEKK ---------------1111--------------------------33331111------- EIFRIAKDEAKDQIALIAQVGSVNLKEAVELGKYATELGYDCLSAVTPFYYKFSFPEIKH ------------------------------------------------------------ YYDTIIAETGSNMIVYSIPFLTGVNMGIEQFGELYKNPKVLGVKFTAGDFYLLERLKKAY ------------------------------------1111-------------------1 PNHLIWAGFDEMMLPAASLGVDGAIGSTFNVNGVRARQIFELTKAGKLKEALEIQHVTND 111-----1111-3333-------------------------1111-------------- LIEGILANGLYLTIKELLKLEGVDAGYCREPMTSKATAEQVAKAKDLKAKFLS ------------------1111------------------------------- >UNDECAPRENYL PYROPHOSPHAT; SWP:O82827; PDB:1F75A; NINAAQIPKHIAIIMDGNGRWAKQKKMPRIKGHYEGMQTVRKITRYASDLGVKYLTLYAF ---------------------------3333----------------------------- NYLMKLPGDFLNTFLPELIEKNVKVETIGFIDDLPDHTKKAVLEAKEKTKHNTGLTLVFA 3333-------------------------3333---------------1111-------- LNYGGRKEIISAVQLIAERYKSGEISLDEISETHFNEYLFTANMPDPELLIRTSGEERLS -------------------------3333-333333331111------------------ NFLIWQCSYSEFVFIDEFWPDFNEESLAQCISIYQNR --33331111-------3333---------------- >DIHYDROOROTATE DEHYDROGEN; SWP:P05021; PDB:1F76A; YYPFVRKALFQLDPERAHEFTFQQLRRITGTPFEALVRQKVPAKPVNCGLTFKNPLGLAA 3333----1111-----------33332222--3333----------------------- GLDKDGECIDALGAGFGSIEIGTVTPRPQPGNDKPRLFRLVDAEGLINRGFNNLGVDNLV --1111----------------------------------1111---------------- ENVKKAHYDGVLGINIGKNKDTPVEQGKDDYLICEKIYAYAGYIAINISSPNTPGLRTLQ --1111------------33331111----------1111------------2222---- YGEALDDLLTAIKNKQNDLQAHHKYVPIAVKIAPDLSEEELIQVADSLVRHNIDGVIATN ------------------------------------------------1111-------- TTLDRSLVQGKNCDQTGGLSGRPLQLKSTEIIRRLSLELNGRLPIIGVGGIDSVIAAREK ----1111---1111-----3333--------------iiii------------------ IAAGASLVQIYSGFIFKGPPLIKEIVTHI ----------3333---3333-------- >RHOGAP PROTEIN; SWP:Q98935; PDB:1F7CA; AQLDSIGFSIIKKCIHAVETRGINEQGLYRIVGVNSRVQKLLSILMDPETEICAEWEIKT ---------------------1111-2222---3333-----1111------3333---- ITSALKTYLRMLPGPLMMYQFQRSFIKAAKLENQESRVSEIHSLVHRLPEKNRQMLHLLM --------------33331111-----1111-1111--------1111------------ NHLAKVADNHKQNLMTVANLGVVFGPTLLRPTVAAIMDIKFQNIVIEILIENHEKIFNTV ------------------------------------------------------------ PE -- >POL POLYPROTEIN; SWP:P16088; PDB:1F7DA; MIIEGDGILDKRSEDAGYDLLAAKEIHLLPGEVKVIPTGVKLMLPKGYWGLIIGKSSIGS -----------1111-------------2222------------2222------333311 KGLDVLGGVIDEGYRGEIGVIMINVSRKSITLMERQKIAQLIILPCKHEVLEQGKVVM 11--------1111------------------2222---------------------- >BLOOD COAGULATION FACTOR ; SWP:P08709; PDB:1F7EA; SDGDQCASSPCQNGGSCKDQLQSYICFCLPAFEGRNCETHKDDGSA ----3333---%%%%-------------1111-1111--------- >HOLO-(ACYL CARRIER PROTEI; SWP:P96618; PDB:1F7LA; GIYGIGLDITELKRIASMAGRQKRFAERILTRSELDQYYELSEKRKNEFLAGRFAAKEAF -----------------------3333---3333--3333-------------------- SKAFGTGIGRQLSFQDIEIRKDQNGKPYIICTKLSPAAVHVSITHTKEYAAAQVVIER --------11113333-----1111-----3333------------------------ >ACTIN DEPOLYMERIZING FACT; SWP:Q39250; PDB:1F7SA; ASGMAVHDDCKLRFLELKAKRTHRFIVYKIEEKQKQVVVEKVGQPIQTYEEFAACLPADE -------------------------------1111-----------------11111111 CRYAIYDFDFVTAENCQKSKIFFIAWCPDIAKVRSKMIYASSKDRFKRELDGIQVELQAT -----------1111------------1111----------------------------- DPTE ---- >ARGINYL-TRNA SYNTHETASE; SWP:Q05506; PDB:1F7UA; ASTANMISQLKKLSIAEPAVAKDSHPDVNIVDLMRNYISQELSKISGVDSSLIFPALEWT ---------1111---3333111133333333----------------33331111---- NTMERGDLLIPIPRLRIKGANPKDLAVQWAEKFPCGDFLEKVEANGPFIQFFFNPQFLAK -3333-----3333---------------1111-!!!!------!!!!------------ LVIPDILTRKEDYGSCKLVENKKVIIEFSSPNIAKPFHAGHLRSTIIGGFLANLYEKLGW --------!!!!-------------------1111--3333--------------1111- EVIRMNYLGDWGKQFGLLAVGFERYGNEEALVKDPIHHLFDVYVRINKDIEEEGDSIPLE -----------3333------------3333----------------1111-------11 QSTNGKAREYFKRMEDGDEEALKIWKRFREFSIEKYIDTYARLNIKYDVYSGESQVSKES 113333----------------------------------1111-------1111----- MLKAIDLFKEKGLTHEDKGAVLIDLTKFNKKLGKAIVQKSDGTTLYLTRDVGAAMDRYEK --------1111----iiii-----33333333-----1111------------------ YHFDKMIYVIASQQDLHAAQFFEILKQMGFEWAKDLQHVNFGMVQGMSTRKGTVVFLDNI ----------3333-----------11111111--------------------------- LEETKEKMHEVMKKNENKYAQIEHPEEVADLVGISAVMIQDMQGKRINNYEFKWERMLSF ----------3333-3333-------------------------3333----3333---- EGDTGPYLQYAHSRLRSVERNASGITQEKWINADFSLLKEPAAKLLIRLLGQYPDVLRNA ---3333--------------3333333311111111----------------------- IKTHEPTTVVTYLFKLTHQVSSCYDVLWVAGQTEELATARLALYGAARQVLYNGMRLLGL ------------------------------------------------------------ TPVERM ------ ------------------------------------------------------------ ------------- >CREB-BINDING PROTEIN; SWP:P45481; PDB:1F81A; SPQESRRLSIQRCIQSLVHACQCRNANCSLPSCQKMKRVVQHTKGCKRKTNGGCPVCKQL ------------------------1111-----------------3333----------- IALCCYHAKHCQENKCPVPFCLNIKHK --------------------------- >TRANSTHYRETIN THR119MET V; SWP:P02766; PDB:1F86A; CPLMVKVLDAVRGSPAINVAVHVFRKAADDTWEPFASGKTSESGELHGLTTEEEFVEGIY --------------------------1111----------1111------3333------ KVEIDTKSYWKALGISPFHEHAEVVFTANDSGPRRYTIAALLSPYSYSTMAVVTN ----------1111--------------1111----------1111--------- >32.5 KDA PROTEIN YLR351C; SWP:P49954; PDB:1F89A; SASKILSQKIKVALVQLSGSSPDKMANLQRAATFIERAMKEQPDTKLVVLPECFNSPYST -----------------------------------------1111-------1111--11 DQFRKYSEVINPKEPSTSVQFLSNLANKFKIILVGGTIPELDPKTDKIYNTSIIFNEDGK 113333---------3333------------------------------------1111- LIDKHRKVHLFHESETLSPGEKSTTIDTKYGKFGVGICYDMRFPELAMLSARKGAFAMIY ----------3333-------------1111-----!!!!-------------------- PSAFNTVTGPLHWHLLARSRAVDNQVYVMLCSPARNLQSSYHAYGHSIVVDPRGKIVAEA -------3333----------1111----------1111-----------1111------ GEGEEIIYAELDPEVIESFRQAVPLTKQRRF ------------------------------- >NEURAMINIDASE; SWP:P03472; PDB:1F8EA; RDFNNLTKGLCTINSWHIYGKDNAVRIGEDSDVLVTREPYVSCDPDECRFYALSQGTTIR ---------------------------------------------------------111 GKHSNGTIHDRSQYRALISWPLSSPPTV 13333------1111-----2222---- >BENZYL ALCOHOL DEHYDROGEN; SWP:Q59096; PDB:1F8FA; LKDIIAAVTPCKGADFELQALKIRQPQGDEVLVKVVATGMCHTDLIVRDQKYPVPLPAVL ----------2222--------------------------3333---------------- GHEGSGIIEAIGPNVTELQVGDHVVLSYGYCGKCTQCNTGNPAYCSEFFGRNFSGADSEG -----------1111---2222--------------111133331111--------1111 NHALCVNDHFFAQSSFATYALSRENNTVKVTKDVPIELLGPLGCGIQTGAGACINALKVT ---------%%%%---------3333--------3333---------------------2 PASSFVTWGAGAVGLSALLAAKVCGASIIIAVDIVESRLELAKQLGATHVINSKTQDPVA 222-------3333--------------------3333----1111-------------- AIKEITDGGVNFALESTGSPEILKQGVDALGILGKIAVVGAPQLGTTAQFDVNDLLLGGK ------------------3333----11112222--------2222---------1111- TILGVVEGSGSPKKFIPELVRLYQQGKFPFDQLVKFYAFDEINQAAIDSRKGITLKPIIK -----iiii-3333--------1111----1111---1111------------------- IA -- >NICOTINAMIDE NUCLEOTIDE T; SWP:Q2RSB2; PDB:1F8GA; KIAIPKERRPGEDRVAISPEVVKKLVGLGFEVIVEQGAGVGASITDDALTAAGATIASTA --------2222-------------1111-----22223333-------1111-----33 AQALSQADVVWKVQRPTAEEGTDEVALIKEGAVLCHLGALTNRPVVEALTKRKITAYAEL 33-1111---------3333--3333--2222-----1111------------------- PRISRAQSDILSSQSNLAGYRAVIDGAYEFARAFPTAAGTVPPARVLVFGVGVAGLQAIA --3333-----------------------------3333--------------------- TAKRLGAVVATDVRAATKEQVESLGGKFITVDDEATAETAGGYAKEGEEFRKKQAEAVLK --1111-------3333----1111------3333---1111--------------3333 ELVKTDIAITTALIPGKPAPVLITEEVTKKPGSVIIDLAVEAGGNCPLSEPGKIVVKHGV 3333---------2222------------2222---1111-----11112222---iiii KIVGHTNVPSRVAADASPLFAKNLLNFLTPHVDKDTKTLVKLEDETVSGTCVTRDGAIVH ------3333-----------------3333--1111---1111---------iiii--1 PALTGQGA 111----- >ISOCITRATE LYASE; SWP:P0A5H3; PDB:1F8MA; ASVVGTPKSAEQIQQEWDTNPRWKDVTRTYSAEDVVALQGSVVEEHTLARRGAEVLWEQL -------------------3333-------3333-1111--------------------- HDLEWVNALGALTGNMAVQQVRAGLKAIYLSGWQVAGDANLSGHTYPDQSLYPANSVPQV ---------------------------------------1111---------1111---- VRRINNALQRADQIAKIEGDTSVENWLAPIVADGEAGFGGALNVYELQKALIAAGVAGSH ---------------------------------!!!!--------------1111----- WEDQLASEKKCGHLGGKVLIPTQQHIRTLTSARLAADVADVPTVVIARTDAEAATLITSD ----3333--1111------3333-------------------------3333------- VDERDQPFITGERTREGFYRTKNGIEPCIARAKAYAPFADLIWMETGTPDLEAARQFSEA -33331111----1111-----------------3333---------------------- VKAEYPDQMLAYNCSPSFNWKKHLDDATIAKFQKELAAMGFKFQFITLAGFHALNYSMFD ----1111------11113333-------------------------------------- LAYGYAQNQMSAYVELQEREFAAEERGYTATKHQREVGAGYFDRIATTVDPNSSTTALTG ---------------------3333---33333333-------------1111----222 STEEGQF 23333-- >NEUROPEPTIDE Y (PNPY); SWP:P01304; PDB:1F8PA; YPSKPDNPGEDAPAEDLARYYSALRHYINLITRQRY -----------2222--------------------- >ANTIBODY FAB FRAGMENT (LI; SWP:NA; PDB:1F8TH; GVQLQESGPGLVKPSQSLSLTCTVTGYSITSDYAWNWIRQFPGNKLEWMGYITYSGSTGY ------------2222-----------1111---------------------1111---- NPSLKSRISITRDTSKNQFFLQLNSVTTEDTATYYCASYDDYTWFTYWGQGTLVTVSAAK 1111----------------------1111------------------------------ TTPPSVFPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLY ---------------------------------------iiii----------------- TLSSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVPRDC ------------------------1111------------ >Igk protein; SWP:Q58EU8; PDB:1F8TL; DVQMTQTPLTLSVTIGQPASISCESSQSLLYSNGKTYLNWLLQRPGQSPKRLIYLVSKLD -------------2222-------------1111---------2222------------2 SGVPDRFTGSGSGTDFTLRISRVEAEDLGVYYCVQGTHFPRTFGGGTKLEIKRADAAPTV 2223333----------------1111--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM -----33331111---------------------iiii---------------------- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC ------33331111--------1111------------- >RNA; SWP:Q9J7Z0; PDB:1F8VA; NRRNKARKVVSRSTALVPMAPASQRTGPAPRKPRKRNQALVRNPRLTDAGLAFLKCAFAA ------------1111------------------------------------------11 PDFSVDPGKGIPDNFHGRTLAIKDCNTTSVVFTPNTDTYIVVAPVPGFAYFRAEVAVGAQ 11------------------------------------------2222-------2222- PTTFVGVPYPTYATNFGAGSQNGLPAVNNYSKFRYASMACGLYPTSNMMQFSGSVQVWRV --------11113333--1111-------------------------------------- DLNLSEAVNPAVTAITPAPGVFANFVDKRINGLRGIRPLAPRDNYSGNFIDGAYTFAFDK -------------------------------3333------------3333--------- STDFEWCDFVRSLEFSESNVLGAATAMKLLAPGGGTDTTLTGLGNVNTLVYKISTPTGAV -------------------2222--------%%%%--------------------2222- NTAILRTWNCIELQPYTDSALFQFSGVSPPFDPLALECYHNLKMRFPVAVSSREN ---------------1111-3333-----------------1111---------- >NUCLEOSIDE 2-DEOXYRIBOSYL; SWP:Q9R5V5; PDB:1F8YA; PKKTIYFGAGWFTDRQNKAYKEAMEALKENPTIDLENSYVPLDNQYKGIRVDEHPEYLHD -----------------------------11113333--3333-2222-1111-1111-- KVWATATYNNDLNGIKTNDIMLGVYIPDEEDVGLGMELGYALSQGKYVLLVIPDEDYGKP --------------1111-------1111------------1111-------3333---- INLMSWGVSDNVIKMSQLKDFNFNKPRFDFYEGAVY -3333--------333311111111----------- ------------------------------- >BUCANDIN; SWP:P81782; PDB:1F94A; MECYRCGVSGCHLKITCSAEETFCYKWLNKISNERWLGCAKTCTEIDTWNVYNKCCTTNL ------1111-------3333--------------------------1111--------- CNT --- >JUNCTION ADHESION MOLECUL; SWP:O88792; PDB:1F97A; KGSVYTAQSDVQVPENESIKLTCTYSGFSSPRVEWKFVQGSTTALVCYNSQITAPYADRV -------------2222---------------------!!!!-----%%%%-3333---- TFSSSGITFSSVTRKDNGEYTCMVSEEGGQNYGEVSIHLTVLVPPSKPTISVPSSVTIGN --1111------1111----------------------------------------2222 RAVLTCSEHDGSPPSEYSWFKDGISMLTTRAFMNSSFTIDPKSGDLIFDPVTAFDSGEYY --------------------iiii---------------------------3333----- CQAQNGYGTAMRSEAAHMDAVELNVGG --------------------------- >R-PHYCOCYANIN; SWP:P59858; PDB:1F99A; MKTPLTEAIAAADSQGRFLSNTELQVVNGRYNRATSSLEAAKALTANADRLISGAANAVY --3333------1111-------------------------------------------- SKFPYTTQMPGPNYSSTAIGKAKCARDIGYYLRMVTYCLVVGGTGPMDDYLVAGLEEINR --3333----1111-------------------------------------2222----1 TFELSPSWYIEALKYIKNNHGLSGDVANEANTYIDYAINTLS 111-3333------------------------------1111 >R-phycocyanin beta chain; SWP:P59859; PDB:1F99B; MLDAFAKVVAQADARGEFLSNTQIDALLAIVSEGNKRLDVVNKITNNASAIVTNAARALF ------------1111-------------------------------------------- AEQPQLISPGGNAYTSRRMAACLRDMEIVLRYVSYAMIAGDASVLDDRCLNGLRETYQAL ---33332222----------------------------------------------333 GTPGASVAVAIQKMKDAALALVNDTTGTPAGDCASLVAEIATYFDRAAAAVA 3-------------------1111------------------------1111 >HYPOTHETICAL PROTEIN MJ05; SWP:Q57961; PDB:1F9AA; LRGFIIGRFQPFHKGHLEVIKKIAEEVDEIIIGIGSAQKSHTLENPFTAGERILMITQSL ----------------------1111---------1111--3333--------------3 KDYDLTYYPIPIKDIEFNSIWVSYVESLTPPFDIVYSGNPLVRVLFEERGYEVKRPEMFN 333-------------3333-----1111-----------------1111---------3 RKEYSGTEIRRRMLNGEKWEHLVPKAVVDVIKEIKGVERLRKLA 333---------------1111---------------------- >REGULATORY PROTEIN E2; SWP:P06790; PDB:1F9FA; HMTPIIHLKGDRNSLKCLRYRLRKHSDHYRDISSTWHWEKTGILTVTYHSETQRTKFLNT ------------------------------------------------------------ VAIPDSVQILVGYMTM ---1111--------- >ACIDIC LECTIN; SWP:Q9SM56; PDB:1F9KA; ETQSFNFDHFEENSKELNLQRQASIKSNGVLELTKLTKNGVPVWKSTGRALYAEPIKIWD -------------------!!!!--1111--------iiii------------------- STTGNVASFETRFSFNITQPYAYPEPADGLTFFMVPPNSPQGEDGGNLGVFKPPEGDNAF --------------------------------------------1111------------ AVEFDTFQNTWDPQVPHIGIDVNSIVSSKTLHFQLENGGVANVVIKYDSPTKILNVVLAF --------1111-----------------------2222---------1111-------- HSVGTVYTLSNIVDLKQEFPNSEWVNVGLSATTGYQKNAVETHEIISWSFTSSL 1111---------3333------------------2222--------------- >ARGININE REPRESSOR/ACTIVA; SWP:P17893; PDB:1F9NA; KGQRHIKIREIITSNEIETQDELVDMLKQDGYKVTQATVSRDIKELHLVKVPTNNGSYKY -----------1111-------------------1111--------------1111---- SLPADQRFNPLSKLKRALMDAFVKIDSASHMIVLKTMPGNAQAIGALMDNLDWDEMMGTI -3333------------------------------------------1111-3333---- CGDDTILIICRTPEDTEGVKNRLLELL -----------------------1111 >PLATELET FACTOR 4; SWP:P02776; PDB:1F9RA; GDLQCLCVKTTSQVRPRHITSLEVIKAGPHCAVPQLIATLKNGRKICLDLQAPLYKKIIK --------------3333---------3333--------1111-----33333333---- KLLES ----- >KINESIN-LIKE PROTEIN KAR3; SWP:P17119; PDB:1F9TA; GNIRVYCRIRPALKNLENSDTSLINVNEFDDNSGVQSMEVTKIQNTAQVHEFKFDKIFDQ ------------2222-------------------------3333-------------11 QDTNVDVFKEVGQLVQSSLDGYNVCIFAYGQTGSGKTFTMLNPGDGIIPSTISHIFNWIN 11--------3333---1111---------2222-------------------------- KLKTKGWDYKVNCEFIEIYNENIVDLLKHEIRHDQETKTTTITNVTSCKLESEEMVEIIL 3333--------------%%%%-------------------2222--------------- KKANKLRSTASTASNEHSSRSHSIFIIHLSGSNATGAHSYGTLNLVDLAGSERRETQNIN -------------33331111--------------------------------------- KSLSCLGDVIHALGQHIPFRNSKLTYLLQYSLTGDSKTLMFVNISPSSSHINETLNSLRF -----------1111--1111------3333-!!!!----------3333---------- ASKV ---- >KINESIN-LIKE PROTEIN KAR3; SWP:P17119; PDB:1F9VA; GNIRVYCRIRPALKNLENSDTSLINVNEFDDNSGVQSMEVTKIQNTAQVHEFKFDKIFDQ ------------2222--1111--------1111-------3333-------------11 QDTNVDVFKEVGQLVQSSLDGYNVCIFAYGQTGSGKTFTMLNPGDGIIPSTISHIFNWIN 11-------3333---3333----------2222-------------------------- KLKTKGWDYKVNCEFIEIYNENIVDLLRKHEIRHDQETKTTTITNVTSCKLESEEMVEII -3333-------------%%%%--1111-------------------------------- LKKANEHSSASHSIFIIHLSGSNAGAHSYGTLNLVDLAGSERINVSQVVGDRLRETQNIN -----3333----------------------------------1111------------- KSLSCLGDVIHALGQPDRHIPFRNSKLTYLLQYSLTGDSKTLMFVNISPSSSHINETLNS --------------------1111------3333-!!!!----------3333------- LRFASKVNSTRLV ------------- >PROTEIN (6-HYDROXYMETHYL-; SWP:P26281; PDB:1F9YA; TVAYIAIGSNLASPLEQVNAALKALGDIPESHILTVSSFYRTPPLGPQDQPDYLNAAVAL ------------------------1111-------------------------------- ETSLAPEELLNHTQRIELQQGRVRKAERWGPRTLDLDIMLFGNEVINTERLTVPHYDMKN -----------------1111-------------------!!!!---3333---1111-- RGFMLWPLFEIAPELVFPDGEMLRQILHTRAFDKLNKW 3333-------1111-1111------------------ >GLYOXALASE I; SWP:Q59384; PDB:1F9ZA; MRLLHTMLRVGDLQRSIDFYTKVLGMKLLRTSENPEYKYSLAFVGYGPETEEAVIELTYN ----------------------------------1111---------3333--------2 WGVDKYELGTAYGHIALSVDNAAEACEKIRQNGGNVTREAGPVKGGTTVIAFVEDPDGYK 222---------------------------------------2222--------1111-- IELIEEGN -------- >BETA-AMYLASE; SWP:P10537; PDB:1FA2A; APIPGVMPIGNYVSLYVMLPLGVVNADNVFPDKEKVEDELKQVKAGGCDGVMVDVWWGII --2222-3333--------------------3333--------1111--------1111- EAKGPKQYDWSAYRELFQLVKKCGLKIQAIMSFHQCGGNVGDAVFIPIPQWILQIGDKNP 3333----------------1111------------------------------333333 DIFYTNRAGNRNQEYLSLGVDNQRLFQGRTALEMYRDFMESFRDNMADFLKAGDIVDIEV 33---3333-------3333---------------------------------------- GCGAAGELRYPSYPETQGWVFPGIGEFQCYDKYMVADWKEAVKQAGNADWEMPGKGAGTY --2222-------3333-------------3333--------11111111----1111-- NDTPDKTEFFRPNGTYKTDMGKFFLTWYSNKLIIHGDQVLEEANKVFVGLRVNIAAKVSG --3333------------------------------------------------------ IHWWYNHVSHAAELTAGFYNVAGRDGYRPIARMLARHHATLNFTCLEMRDSEQPAEAKSA -2222-1111----------2222----------1111----------3333-3333--- PQELVQQVLSSGWKEYIDVAGENALPRYDATAYNQMLLKLRPNGVNLNGPPKLKMSGLTY ------------1111-----------------------------1111----------- LRLSDDLLQTDNFELFKKFVKKMHADLDPSPNAISPAVLERSNSAITIDELMEATKGSRP ---3333---3333-----------------------------------3333------- FPWYDVTDMPVDGSNPFD ------------------ >THIOREDOXIN F; SWP:P09856; PDB:1FAAA; LELALGTQEMEAIVGKVTEVNKDTFWPIVKAAGDKPVVLDMFTQWCGPCKAMAPKYEKLA ------------2222----1111-3333--!!!!-------1111-------------- EEYLDVIFLKLDCNQENKTLAKELGIRVVPTFKILKENSVVGEVTGAKYDKLLEAIQAAR --1111-------3333----1111----------%%%%--------------------- S - >FADD PROTEIN; SWP:Q61160; PDB:1FADA; AAPPGEAYLQVAFDIVCDNVGRDWKRLARELKVSEAKMDGIEEKYPRSLSERVRESLKVW -----------------------------------------------3333--------- KNAEKKNASVAGLVKALRTCRLNLVADLVEEAQES --------3333----------------------- >LARGE T ANTIGEN; SWP:P03074; PDB:1FAFA; MDRVLSRADKERLLELLKLPRQLWGDFGRMQQAYKQQSLLLHPDKGGSHALMQELNSLWG -----3333-----1111-------3333------------3333--3333--------- TFKTEVYNLRMNLGGTGFQ ----3333----------- >IGG2B-KAPPA R19.9 FAB (HE; SWP:NA; PDB:1FAIH; QVQLQQSGAELVRAGSSVKMSCKASGYTFTSYGVNWVKQRPGQGLEWIGYINPGKGYLSY ---------------------------3333--------2222----------------- NEKFKGKTTLTVDRSSSTAYMQLRSLTSEDAAVYFCARSFYGGSDLAVYYFDSWGQGTTL 33331111----1111----------3333------------------------------ TVSSAKTTPPSVYPLAPGCGDTTGSSVTLGCLVKGYFPESVTVTWNSGSLSSSVHTFPAL --------------------------------------------%%%%------------ LQSALYTMSSSVTVPSSTWPSQTVTCSVAHPASSTTVDKKL --------------3333----------------------- >Pancreatic trypsin inhibi; SWP:P00974; PDB:1FAKI; APDFCLEPPYDGPCRALHLRYFYNAKAGLCQTFYYGGCLAKRNNFESAEDCMRTC -3333-------------------1111--------------------------- >DUAL ADAPTOR OF PHOSPHOTY; SWP:Q9UN19; PDB:1FAOA; PSLGTKEGYLTKQGGLVKTWKTRWFTLHRNELKYFKDQMSPEPIRILDLTECSAVQFDYS -2222----------------------!!!!-----1111-------3333--------- QERVNCFCLVFPFRTFYLCAKTGVEADEWIKILRWKLSQI ----------1111----------------------1111 >RAF-1; SWP:P04049; PDB:1FAQ; LTTHNFARKTFLKLAFCDICQKFLLNGFRCQTCGYKFHEHCSTKVPTMCVDW ----------------3333-------------------------------- >FASCICULIN 1; SWP:P0C1Y9; PDB:1FAS; TMCYSHTTTSRAILTNCGENSCYRKSRRHPPKMVLGRGCGCPPGDDYLEVKCCTSPDKCN ----------------!!!!------------------------3333-------2222- Y - >THIOREDOXIN M; SWP:P07591; PDB:1FB6A; VQDVNDSSWKEFVLESEVPVMVDFWAPWCGPCKLIAPVIDELAKEYSGKIAVYKLNTDEA ----3333----1111---------11113333----------1111--------3333- PGIATQYNIRSIPTVLFFKNGERKESIIGAVPKSTLTDSIEKYL ------------------iiii-----------------3333- >Fructose-bisphosphate ald; SWP:P07764; PDB:1FBAA; TTYFNYPSKELQDELREIAQKIVAPGKGILAADESGPTMGKRLQDIGVENTEDNRRAYRQ -------3333------------2222---------------3333-------------- LLFSTDPKLAENISGVILFHETLYQKADDGTPFAEILKKKGIILGIKVDKGVVPLFGSED -11113333---------3333----1111-3333--1111-------------2222-- EVTTQGLDDLAARCAQYKKDGCDFAKWRCVLKIGKNTPSYQSILENANVLARYASICQSQ ------2222-------1111------------1111--------------------111 RIVPIVEPEVLPDGDHDLDRAQKVTETVLAAVYKALSDHHVYLEGTLLKPNMVTAGQSAK 1-----------------------------------1111-3333--------------- KNTPEEIALATVQALRRTVPAAVTGVTFLSGGQSEEEATVNLSAINNVPLIRPWALTFSY -------------------3333------!!!!--------------------------- GRALQASVLRAWAGKKENIAAGQNELLKRAKANGDAAQGKYVAGSAGAGSGSLFVANHAY 1111-------iiii1111----------------1111----1111------------- >FRUCTOSE 1,6-BISPHOSPHATA; SWP:P00636; PDB:1FBCA; DTNIVTLTRFVMEQGRKARGTGEMTQLLNSLCTAVKAISTAVRKAGIAHKLDVLSNDLVI --------------1111-------------------------------3333------- NVLKSSFATCVLVTEEDKNAIIVEPEKRGKYVVCFDPLDGSSNIDCLVSIGTIFGIYRKN ---1111------1111------3333------------33331111------------- STDEPSEKDALQPGRNLVAAGYALYGSATMLVLAMVNGVNCFMLDPAIGEFILVDRNVKI -----3333----1111-----------------1111---------------------- KKKGSIYSINEGYAKEFDPAITEYIQRKKFPPDNSAPYGARYVGSMVADVHRTLVYGGIF ---------11111111-------------1111-------------------------- MYPANKKSPKGKLRLLYECNPMAYVMEKAGGLATTGKEAVLDIVPTDIHQRAPIILGSPE ----3333-----3333---------1111--------3333----1111---------- DVTELLEIYQKHA ------------- >GUINEA FOWL LYSOZYME; SWP:GC1_MOUSE; PDB:1FBIH; QVQLQQPGAELVKPGASVKLSCKASGYTFTSYWMHWVKQGPGQGLEWIGEIDPSDSYPNY ----------------------------1111--------------------1111---- NEKFKGKATLTVDKSSSTAYMQLSSLTSEDSAVYYCASLYYYGTSYGVLDYWGQGTSVTV 3333---------1111-------------------------3333-------------- SSAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQ ---------------------------------------------3333----------i SDLYTLSSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVP iii---------3333------------------------- >FIBROBLAST (INTERSTITIAL); SWP:P21692; PDB:1FBL; FVLTPGNPRWENTHLTYRIENYTPDLSREDVDRAIEKAFQLWSNVSPLTFTKVSEGQADI ---2222---------------3333---------------3333--------------- MISFVRGDHRDNSPFDGPGGNLAHAFQPGPGIGGDAHFDEDERWTKNFRDYNLYRVAAHE -----------------------------!!!!-----3333------------------ LGHSLGLSHSTDIGALMYPNYIYTGDVQLSQDDIDGIQAIYGPSENPVQPSGPQTPQVCD --1111-----1111---------------------1111----------------1111 SKLTFDAITTLRGELMFFKDRFYMRTNSFYPEVELNFISVFWPQVPNGLQAAYEIADRDE ----------iiii----!!!!--------------1111---------------1111- VRFFKGNKYWAVRGQDVLYGYPKDIHRSFGFPSTVKNIDAAVFEEDTGKTYFFVAHECWR ----!!!!---------------3333----3333------------------------- YDEYKQSMDTGYPKMIAEEFPGIGNKVDAVFQKDGFLYFFHGTRQYQFDFKTKRILTLQK --------------3333-2222---------iiii----!!!!---------------1 ANSWFNC 111---- >HEAT SHOCK FACTOR PROTEIN; SWP:P22121; PDB:1FBQA; PAFVNKLWSMVNDKSNEKFIHWSTSGESIVVPNRERFVQEVLKKYFKHSNFASFVRQLNM ------------3333------------------------1111---------------- YGWHKVQDVKSGSMLSNNDSRWEFENERH -----------1111--1111-------- ------------------------------------------------------------ --------------------------------- >FRUCTOSE-2,6-BISPHOSPHATA; SWP:P16119; PDB:1FBTA; RSIYLCRHGESELNLRGRIGGDSGLSARGKQYAYALANFIRSQGISSLKVWTSHKRTIQT -------------1111-----------------------1111----------3333-3 AEALGVPYEQWKALNEIDAGVCEETYEEIQEHYPEEFALRDQDKYRYRYPKGESYEDLVQ 333-------3333----!!!!--------------------------2222-------- RLEPVIELERQENVLVICHQAVRCLLAYFLDKSSDELPYLKCPLHTVLKLTPVAYGCRVE -3333-3333--------3333-----1111-33331111------------1111---- SIYLNV ------ >Immunoglobulin G-binding ; SWP:P38507; PDB:1FC2C; FNKEQQNAFYEILHLPNLNEEQRNGFIQSLKDDPSQSANLLAEA ---3333-------1111-------------------------- >SPO0A; SWP:P52934; PDB:1FC3A; NKPKNLDASITSIIHEIGVPAHIKGYLYLREAIAMVYHDIELLGSITKVLYPDIAKKYNT -------------------1111-----------------3333---------------- TASRVERAIRHAIEVAWSRGNLESISSLFGYTVSVSKAKPTNSEFIAMVADKLRLEHKA ------------------------1111--3333------------------------- >2-AMINO-3-KETOBUTYRATE CO; SWP:KBL_ECOLI; PDB:1FC4A; HREFYQQLTNDLETARAEGLFKEERIITSAQQADITVADGSHVINFCANNYLGLANHPDL ---------------1111-----------------1111---------1111------- IAAAKAGDSHGFGASVRFICGTQDSHKELEQKLAAFLGEDAILYSSCFDANGGLFETLLG ---------------3333---3333---------------------------3333--1 AEDAIISDALNHASIIDGVRLCKAKRYRYANNDQELEARLKEAREAGARHVLIATDGVFS 111----1111-------1111-------2222--------------------------- DGVIANLKGVCDLADKYDALVVDDSHAVGFVGENGRGSHEYCDVGRVDIITGTLGKALGG -------------------------------1111----1111----------------- ASGGYTAARKEVVEWLRQRSRPYLFSNSLAPAIVAASIKVLEVEAGSELRDRLWANARQF --------------------3333------------------1111-------------- REQSAAGFTLAGADHAIIPVLGDAVVAQKFARELQKEGIYVTGFFYPVVPKGQARIRTQS ---1111------------------------------------------2222------3 AAHTPEQITRAVEAFTRIGKQLGVIA 333----------------------- >PHOTOSYSTEM II D1 PROTEAS; SWP:O04073; PDB:1FC6A; VTSEQLLFLEAWRAVDRAYVDKSFNGQSWFKLRETYLKKEPDRRAQTYDAIRKLAVLDDP ----------------------%%%%---------------------------3333-11 FTRFLEPSRLAALRRGTAGSVTGVGLEITYDGGSGKDVVVLTPAPGGPAEKAGARAGDVI 11----------------------------1111---------2222--1111-2222-- VTVDGTAVKGSLYDVSDLLQGEADSQVEVVLHAPGAPSNTRTLQLTRQKVTINPVTFTTC --iiii---------------2222-------2222------------------------ SNVAAAALPPGAAKQQLGYVRLATFNSNTTAAAQQAFTELSKQGVAGLVLDIRNNGGGLF ---3333-2222-------------1111------------------------------- PAGVNVARLVDRGDLVLIADSQGIRDIYSADGNSIDSATPLVVLVNRGTASASEVLAGAL -------------------3333----------------------1111----------- KDSKRGLIAGERTFGKGLIQTVVDLSDGSGVAVTVARYQTPAGVDINKIGVSPDVQLDPE ------------------------1111-----------1111--------------111 VLPTDLEGVCRVLGSDAAPRLF 1----------1111------- >FERREDOXIN; SWP:P00198; PDB:1FCA; AYVINEACISCGACEPECPVDAISQGGSRYVIDADTCIDCGACAGVCPVDAPVQA ----3333---3333--1111-------------------3333--1111----- >FLAVOCYTOCHROME C SULFIDE; SWP:Q06530; PDB:1FCDA; AGRKVVVVGGGTGGATAAKYIKLADPSIEVTLIEPNTDYYTCYLSNEVIGGDRKLESIKH ---------------------------------------------3333----3333--- GYDGLRAHGIQVVHDSATGIDPDKKLVKTAGGAEFGYDRCVVAPGIELIYDKIEGYSEEA ------------------------------------------------3333----1111 AAKLPHAWKAGEQTAILRKQLEDMADGGTVVIAPPAAPFRCPPGPYERASQVAYYLKAHK ----------3333--------------------------3333---------------- PMSKVIILDSSQTFSKQSQFSKGWERLYGFGTENAMIEWHPGPDSAVVKVDGGEMMVETA ---------------3333----------3333---------1111------------11 FGDEFKADVINLIPPQRAGKIAQIAGLTNDAGWCPVDIKTFESSIHKGIHVIGDASIANP 11--------------------3333--1111---------------------------- MPKSGYSANSQGKVAAAAVVVLLKGEEPGTPSYLNTCYSILAPAYGISVAAIYRPNADGS -----------------------------------------1111--------------- AIESVPDSGGVTPVDAPDWVLEREVQYAYSWYNNIVHDTFG ----2222--------3333--------------------- >Cytochrome subunit of sul; SWP:Q06529; PDB:1FCDC; EPTAEMLTNNCAGCHGTHGNSVGPASPSIAQMDPMVFVEVMEGFKSGEIASTIMGRIAKG -------1111----2222-------------3333----------------3333-111 YSTADFEKMAGYFKQQTYQPAKQSFDTALADTGAKLHDKYCEKCHVEGGKPLADEEDYHI 13333--------------------3333----------------iiii----------- LAGQWTPYLQYAMSDFREERRPMEKKMASKLRELLKAEGDAGLDALFAFYASQQ 2222-----------1111----3333------1111----------------- >FC RECEPTOR FC(GAMMA)RIIA; SWP:P12318; PDB:1FCGA; APPKAVLKLEPPWINVLQEDSVTLTCQGARSPESDSIQWFHNGNLIPTHTQPSYRFKANN ----------------2222--------------------iiii-1111---------11 NDSGEYTCQTGQTSLSDPVHLTVLFEWLVLQTPHLEFQEGETIMLRCHSWKDKPLVKVTF 11-------1111------------------------2222-------2222-------- FQNGKSQKFSHLDPTFSIPQANHSHSGDYHCTGNIGYTLFSSKPVTITVQV -iiii----------------1111-------------------------- >PEROXISOMAL TARGETING SIG; SWP:P50542; PDB:1FCHA; SATYDKGYQFEEENPLRDHPQPFEEGLRRLQEGDLPNAVLLFEAAVQQDPKHMEAWQYLG -------------1111------------1111---------------1111-------- TTQAENEQELLAISALRRCLELKPDNQTALMALAVSFTNESLQRQACEILRDWLRYTPAY ---1111---------------1111-----------1111----------------111 AHLVTRILGSLLSDSLFLEVKELFLAAVRLDPTSIDPDVQCGLGVLFNLSGEYDKAVDCF 1-----------------------------1111-------------------------- TAALSVRPNDYLLWNKLGATLANGNQSEEAVAAYRRALELQPGYIRSRYNLGISCINLGA ------1111----------------3333----------1111-----------1111- HREAVEHFLEALNMQRKSGGAMSENIWSTLRLALSMLGQSDAYGAADARDLSTLLTMFGL ----------------------3333------------3333-------------1111- PQ -- >O-ACETYLSERINE SULFHYDRYL; SWP:P12674; PDB:1FCJA; SKIYEDNSLTIGHTPLVRLNRIGNGRILAKVESRNPSFSVKCRIGANMIWDAEKRGVLKP -----3333---------------------11112222--------------------22 GVELVEPTNGNTGIALAYVAAARGYKLTLTMPETMSIERRKLLKALGANLVLTEGAKGMK 22-----------------------------3333------------------3333--- GAIQKAEEIVASDPQKYLLLQQFSNPANPEIHEKTTGPEIWEDTDGQVDVFISGVGTGGT --------------------3333-------------------iiii------------- LTGVTRYIKGTKGKTDLITVAVEPTDSPVIAQALAGEEIKPGPHKIQGIGAGFIPGNLDL ---------11111111------1111-----1111---------2222-----111133 KLIDKVVGITNEEAISTARRLMEEEGILAGISSGAAVAAALKLQEDESFTNKNIVVILPS 33-------------------------------------------3333----------- SG -- -------------------------------------------------------- >HYALURONOGLUCOSAMINIDASE; SWP:Q08169; PDB:1FCQA; EFNVYWNVPTFMCHKYGLRFEEVSEKYGILQNWMDKFRGEEIAILYDPGMFPALLVARNG --------33333333---------------2222---3333------------------ GVPQLGNLTKHLQVFRDHLINQIPDKSFPGVGVIDFESWRPIFRQNWASLQPYKKLSVEV -3333-------------------1111-------------3333-!!!!---------- VRREHPFWDDQRVEQEAKRRFEKYGQLFMEETLKAAKRMRPAANWGYYAYPYCYNLTPNQ ----1111-------------------------------1111---2222------3333 PSAQCEATTMQENDKMSWLFESEDVLLPSVYLRWNLTSGERVGLVGGRVKEALRIARQMT ---------------33333333------------------------------------- TSRKKVLPYYWYKYQDRRDTDLSRADLEATLRKITDLGADGFIIWGSSDDINTKAKCLQF -------------1111-----------------1111--------3333---------- REYLNNELGPAVKR -------------- >FERREDOXIN CHLOROPLASTIC ; SWP:P07839; PDB:1FCT; MAMAMRSTFAARVGAKPAVRGARPASRMSCMA ----------3333-1111------------- >RETINOIC ACID RECEPTOR GA; SWP:P13631; PDB:1FCYA; ASPQLEELITKVSKAHQETFPSLCQLGKYTTNSSADHRVQLDLGLWDKFSELATKCIIKI ----------------1111-3333----------------------------------- VEFAKRLPGFTGLSIADQITLLKAACLDILMLRICTRYTPEQDTMTFSDGLTLNRTQMHN ------2222------------------------1111-1111---1111---------- AGFGPLTDLVFAFAGQLLPLEMDDTETGLLSAICLICGDRMDLEEPEKVDKLQEPLLEAL --!!!!----------3333----------------1111-------------------- RLYARRRRPSQPYMFPRMLMKITDLRGISTKGAERAITLKMEIPGPMPPLIREMLE -------1111--------------------------3333--------------- >BETA-DEFENSIN 2; SWP:O15263; PDB:1FD3A; GIGDPVTCLKSGAICHPVFCPRRYKQIGTCGLPGTKCCKKP --------1111--------2222-------2222------ >IMMUNOGLOBULIN G BINDING ; SWP:P19909; PDB:1FD6A; MTTFKLIINGKTLKGETTTEAVDAATAEKVFKQYANDNGIDGEWTYDDATKTFTVTE ----------------------3333--------------------3333------- >MACROPHAGE INFECTIVITY PO; SWP:P20380; PDB:1FD9A; TDKDKLSYSIGADLGKNFKNQGIDVNPEAMAKGMQDAMSGAQLALTEQQMKDVLNKFQKD ----------------------------3333---------------------------- LMAKRTAEFNKKADENKVKGEAFLTENKNKPGVVVLPSGLQYKVINSGNGVKPGKSDTVT -------------------------33332222--1111--------------1111--- VEYTGRLIDGTVFDSTEKTGKPATFQVSQVIPGWTEALQLMPAGSTWEIYVPSGLAYGPR ------1111------3333-----1111--------11112222------3333----- SVGGPIGPNETLIFKIHLISVKKS ------------------------ >FRUCTOSE 1,6-BISPHOSPHATE; SWP:P79226; PDB:1FDJA; AHRFPALTPEQKKELSDIAQRIVANGKGILAADESVGTMGNRLQRIKVENSEENRRQFRE 1111-------------------iiii-------3333-----1111------------- ILFTVDNSINQSIGGVILFHETLYQKDSQGKLFRNILKEKGIVVGIKLDQGGAPLAGTNK ------------------3333---------3333--1111------------------- ETTIQGLDGLSERCAQYKKDGVDFGKWRAVLRIADQCPSSLAIQENANTLARYASICQQN ------2222-----------------------2222--------------------111 GLVPIVEPEVIPDGDHDLEHCQYVTEKVLAAVYKALNDHHVYLEGTLLKPNMVTAGHACT 1-----------------------------------1111-3333----------1111- KKYTPEQVAMATVTALHRTVPAAVPGICFLSGGMSEEDATLNLNAINLCPLPKPWKLSFS --------------------3333------%%%%----------1111------------ YGRALQASALAAWGGKAENKKATQEAFMKRAVVNCQAAKGQYVHTGSSGAASTQSLFTAS -1111-------iiii1111----------------1111-------------------- YTY --- >FATTY ACID-BINDING PROTEI; SWP:O15540; PDB:1FDQA; VEAFCATWKLTNSQNFDEYMKALGVGFATRQVGNVTKPTVIISQEGDKVVIRTLSTFKNT 3333---------------------3333---------------!!!!------------ EISFQLGEEFDETTADDRNCKSVVSLDGDKLVHIQKWDGKETNFVREIKDGKMVMTLTFG ----2222-----1111-------------------%%%%------------------!! DVVAVRHYEKA !!--------- >FLAVODOXIN REDUCTASE; SWP:P28861; PDB:1FDR; ADWVTGKVTKVQNWTDALFSLTVHAPVLPFTAGQFTKLGLEIRVQRAYSYVNSPDNPDLE --------------1111------------2222------------------1111---- FYLVTVPDGKLSPRLAALKPGDEVQVVSEAAGFFVLDEVPHCETLWMLATGTAIGPYLSI -----1111-----11112222------------3333---------------------- LRLGKDLDRFKNLVLVHAARYAADLSYLPLMQELEKRYEGKLRIQTVVSRETAAGSLTGR ------1111----------33331111-------1111-------------2222---- IPALIESGELESTIGLPMNKETSHVMLCGNPQMVRDTQQLLKETRQMTKHLRRRPGHMTA ------------------3333----------------------------1111------ EHYW ---- >COPPER TRANSPORT PROTEIN ; SWP:O00244; PDB:1FE0A; PKHEFSVDMTCGGCAEAVSRVLNKLGGVKYDIDLPNKKVCIESEHSMDTLLATLKKTGKT ----------3333-------------------1111----------------------- VSYLGL ------ >PHOSPHOLIPASE A2; SWP:Q9DF52; PDB:1FE5A; NLIQFKNMIQCAGTRPWTAYVNYGCYCGKGGSGTPVDELDRCCYTHDNCYNEAEKIPGCN ---------------3333-----------------3333-------------------3 PNIKTYSYTCTEPNLTCTDTADTCARFLCNCDRTAAICFASAPYNSNNVMISSSTNCQ 333-----------------------------------------3333--1111---- >VON WILLEBRAND FACTOR; SWP:NA; PDB:1FE8H; DVKLVQSGPGLVAPSQSLSITCTVSGFSLTTYGVSWVRQPPGKGLEWLGVIWGDGNTTYH ---------------------------1111--------2222--------1111----3 SALISRLSISKDNSRSQVFLKLNSLHTDDTATYYCAGNYYGMDYWGQGTSVTVSSAETTA 333---------1111---------1111------------------------------- PSVYKLEPVSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLYTLSSSVTVT -----------------------------%%%%-------------%%%%---------3 SSTWPSQSITCNVAHPASSTKVDKKIEPRG 333-----------3333------------ >TRYPANOTHIONE REDUCTASE; SWP:P39040; PDB:1FECA; SRAYDLVVIGAGSGGLEAGWNAASLHKKRVAVIDLQKHHGPPHYAALGGTCVNVGCVPKK -------------------------------------------------3333------- LMVTGANYMDTIRESAGFGWELDRESVRPNWKALIAAKNKAVSGINDSYEGMFADTEGLT -------------3333-----3333---3333----------------------2222- FHQGFGALQDNHTVLVRESADPNSAVLETLDTEYILLATGSWPQHLGIEGDDLCITSNEA --------------------1111-----------------------2222----3333- FYLDEAPKRALCVGGGYISIEFAGIFNAYKARGGQVDLAYRGDMILRGFDSELRKQLTEQ ------------------------------2222-----------2222----------- LRANGINVRTHENPAKVTKNADGTRHVVFESGAEADYDVVMLAIGRVPRSQTLQLEKAGV -1111--------------1111-----3333-----------------11113333--- EVAKNGAIKVDAYSKTNVDNIYAIGDVTDRVMLTPVAINEGAAFVDTVFANKPRATDHTK --1111----1111---1111---3333-------------------------------- VACAVFSIPPMGVCGYVEEDAAKKYDQVAVYESSFTPLMHNISGSTYKKFMVRIVTNHAD --------------------3333------------333333331111------------ GEVLGVHMLGDSSPEIIQSVAICLKMGAKISDFYNTIGVHPTSAEELCSMRTPAYFYEKG ---------2222---------------33331111------3333-------------- KRVEK ----- >PERIPLASMIC HYDROGENASE 1; SWP:P29166; PDB:1FEHA; MKTIIINGVQFNTDEDTTILKFARDNNIDISALCFLNNCNNDINKCEICTVEVEGTGLVT -----iiii--------------------------%%%%-------1111---------3 ACDTLIEDGMIINTNSDAVNEKIKSRISQLLDIHEFKCGPCNRRENCEFLKLVIKYKARA 333---2222--------------------1111---1111-1111-------------- SKPFLPKDKTEYVDERSKSLTVDRTKCLLCGRCVNACGKNTETYAMKFLNKNGKTIIGAE -------3333------------1111-----------------------iiii----22 DEKCFDDTNCLLCGQCIIACPVAALSEKSHMDRVKNALNAPEKHVIVAMAPSVRASIGEL 22-3333--------------------------------1111------3333---3333 FNMGFGVDVTGKIYTALRQLGFDKIFDINFGADMTIMEEATELVQRIENNGPFPMFTSCC -----------------1111-------------------------1111---------- PGWVRQAENYYPELLNNLSSAKSPQQIFGTASKTYYPSISGLDPKNVFTVTVMPCTSKKF ----------33331111-----------3333---1111--3333-------------- EADRPQMEKDGLRDIDAVITTRELAKMIKDAKIPFAKLEDSEADPAMGEYSGAGAIFGAT ---1111-iiii----------------1111-3333---------------------22 GGVMEAALRSAKDFAENAELEDIEYKQVRGLNGIKEAEVEINNNKYNVAVINGASNLFKF 22----------------------3333------------%%%%---------------- MKSGMINEKQYHFIEVMACHGGCVNGGGQPHVNPKDLEKVDIKKVRASVLYNQDEHLSKR 33331111----------2222---1111------3333-3333---------1111--- KSHENTALVKMYQNYFGKPGEGRAHEILHFKYKK 1111-------------2222------------- >50S RIBOSOMAL PROTEIN L25; SWP:P56930; PDB:1FEUA; MEYRLKAYYREGEKPSALRRAGKLPGLMYNRHLNRKVYVDLVEFDKVFRQASIHHVIVLE ------------------------------------------------------------ LPDGQSLPTLVRQVNLDKRRRRPEHVDFFVLSDEPVEMYVPLRFVGTPAGVRAGGVLQEI 1111-------------------------------------------1111--------- HRDILVKVSPRNIPEFIEVDVSGLEIGDSLHASDLKLPPGVELAVSPEETIAAVVPPEDV --------3333--------33332222--3333---2222----1111----------- EKLAE ----- >TRF2-INTERACTING TELOMERI; SWP:Q9NYB0; PDB:1FEXA; GRIAFTDADDVAILTYVKENARSPSSVTGNALWKAMEKSSLTQHSWQSLKDRYLKHLRG ------------------------1111-------3333-------------------- >PEPTIDE METHIONINE SULFOX; SWP:P27110; PDB:1FF3A; SLFDKKHLVSPADALPGRNTPMPVATLHAVNGHSMTNVPDGMEIAIFAMGFWGVERLFWQ ---------3333-------------------------2222------------------ LPGVYSTAAGYTGGYTPNPTYREVCSGDTGHAEAVRIVYDPSVISYEQLLQVFWENHDPA 2222-------------------3333------------1111----------------- QGMRQGNDHGTQYRSAIYPLTPEQDAAARASLERFQAAMLAADDDRHITTEIANATPFYY ----!!!!--1111-------------------------1111----------------- AEDDHQQYLHKNPYGYCGIGGIGVCLPPEA -3333-3333-------------------- >MUSCARINIC TOXIN/ACETYLCH; SWP:P18328; PDB:1FF4A; LTCVTTKSIGGVTTEDCPAGQNVCFKRWHYVTPKNYDIIKGCAATCPKVDNNDPIRCCGT ------1111-------2222--------------------------------------2 DKCND 222-- >SACCHAROPINE REDUCTASE; SWP:Q9P4R4; PDB:1FF9A; ATKSVLMLGSGFVTRPTLDVLTDSGIKVTVACRTLESAKKLSAGVQHSTPISLDVNDDAA ----------3333-------1111----------------2222--------1111--- LDAEVAKHDLVISLIPFHATVIKSAIRQKKHVVTTSYVSPAMMELDQAAKDAGITVMNEI ----1111------------------------------3333-------1111------- GLDPGIDHLYAIKTIEEVHAAGGKIKTFLSYCGGLPAPESSDNPLGYKFSWSSRGVLLAL ----3333----------------------------3333--1111----------3333 RNAASFYKDGKVTNVAGPELMATAKPYFIYPGFAFVAYPNRDSTPYKERYQIPEADNIVR -------iiii-----33331111-----3333--------------11111111----- GTLRYQGFPQFIKVLVDIGFLSDEEQPFLKEAIPWKEATQKIVKASSASEQDIVSTIVSN ----2222----------1111---1111------------------------------- ATFESTEEQKRIVAGLKWLGIFSDKKITPRGNALDTLCATLEEKMQFEEGERDLVMLQHK -------------------1111------------------------2222--------- FEIENKDGSRETRTSSLCEYGAPIGSGGYSAMAKLVGVPCAVAVKFVLDGTISDRGVLAP ----1111--------------2222---------------------------------- MNSKINDPLMKELKEKYGIECKEKVVA -3333---------------------- >CUTM, FLAVOPROTEIN OF CAR; SWP:P19915; PDB:1FFVA; KKIITVNVNGKAQEKAVEPRTLLIHFLREELNLTGAHIGCETSHCGACTVDIDGRSVKSC -------iiii------1111------------------------1111--iiii--111 THLAVQCDGSEVLTVEGLANKGVLHAVQEGFYKEHGLQCGFCTPGMLMRAYRFLQENPNP 1-33332222---3333--iiii----------------1111--------3333----- TEAEIRMGMTGNLCRCTGYQNIVKAVQYAARKLQE ------1111---------------------1111 >Carbon monoxide dehydroge; SWP:P19913; PDB:1FFVB; DAEARELALAGMGASRLRKEDARFIQGKGNYVDDIKMPGMLHMDIVRAPIAHGRIKKIHK ------33332222---11113333-----1111--2222-------------------- DAALAMPGVHAVLTAEDLKPLKLHWMPTLAGDVAAVLADEKVHFQMQEVAIVIADDRYIA -----2222----3333-1111-----1111------------2222------------- ADAVEAVKVEYDELPVVIDPIDALKPDAPVLREDLAGKTSGAHGPREHHNHIFTWGAGDK ---1111-----------3333--1111---3333------------1111--------- AATDAVFANAPVTVSQHMYYPRVHPCPLETCGCVASFDPIKGDLTTYITSQAPHVVRTVV --------------------------------------1111------------------ SMLSGIPESKVRIVSPDIGGGFGNKVGIYPGYVCAIVASIVLGRPVKWVEDRVENISTTA ------3333----------iiii----3333------------------3333------ FARDYHMDGELAATPDGKILGLRVNVVADHGAFDACADPTKFPAGLFHICSGSYDIPRAH -------------1111---------------------1111---1111-!!!!------ CSVKGVYTNKAPGGVAYSFRVTEAVYLIERMVDVLAQKLNMDKAEIRAKNFIRKEQFPYT -----------------------------------------3333-------1111---- TQFGFEYDSGDYHTALKKVLDAVDYPALRAEQAARRADPNSPTLMGIGLVTFTEVVGAGP 1111---------------------------------1111------------------3 SKMCDILGVGMFDSCEIRIHPTGSAIARMGTITQGQGHQTTYAQIIATELGIPSEVIQVE 333--iiii----------1111-------------3333------------3333---- EGDTSTAPYGLGTYGSRSTPVAGAAIALAARKIHAKARKIAAHMLEVNENDLDWEVDRFK --1111-------%%%%------------------------------3333--------- VKGDDSKFKTMADIAWQAYHQPPAGLEPGLEAVHYYDPPNFTYPFGIYLCVVDIDRATGE ---3333---------------2222---------------------------------- TKVRRFYALDDCGTRINPMIIEGQIHGGLTEGYAVAMGQQMPFDAQGNLLGNTLMDYFLP -------------------------------------------1111-----3333---- TAVETPHWETDHTVTPSPHHPIGAKGVAESPHVGSIPTFTAAVVDAFAHVGVTHLDMPHT 3333------------1111------1111----------------3333---------- SYRVWKSLKEHNLAL --------1111--- >Carbon monoxide dehydroge; SWP:P19914; PDB:1FFVC; MIPPRFEYHAPKSVGEAVALLGQLGSDAKLLAGGHSLLPMMKLRFAQPEHLIDINRIPEL ------------------------1111------------1111------------1111 RGIREEGSTVVIGAMTVENDLISSPIVQARLPLLAEAAKLIADPQVRNRGTIGGDIAHGD -----!!!!---111133331111-------------1111-3333-------------3 PGNDHPALSIAVEAHFVLEGPNGRRTVPADGFFLGTYMTLLEENEVMVEIRVPAFAQGTG 333----------------1111-----------2222---1111----------2222- WAYEKLKRKTGDWATAGCAVVMRKSGNTVSHIRIALTNVAPTALRAEAAEAALLGKAFTK --------2222------------!!!!------------------------2222---- EAVQAAADAAIAICEPAEDLRGDADYKTAMAGQMVKRALNAAWARCA ----------1111----1111--------------------1111- >ISOLEUCYL-TRNA SYNTHETASE; SWP:P41972; PDB:1FFYA; MDYEKTLLMPKTDFPMRGGLPNKEPQIQEKWDAEDQYHKALEKNKGNETFILHDGPPYAN --3333------------3333-------------------1111--------------- GNLHMGHALNKILKDFIVRYKTMQGFYAPYVPGWDTHGLPIEQALTKKGVDRKKMSTAEF ---------------------1111--------------------3333----------- REKCKEFALEQIELQKKDFRRLGVRGDFNDPYITLKPEYEAAQIRIFGEMADKGLIYKGK -------------------1111---1111--11113333-------------------- KPVYWSPSSESSLAEAEIEYHDKRSASIYVAFNVKDDKGVVDADAKFIIWTTTPWTIPSN ------1111---3333-------------------1111------------1111---- VAITVHPELKYGQYNVNGEKYIIAEALSDAVAEALDWDKASIKLEKEYTGKELEWVVAQH ------------------------------3333-------------------------- PFLDRESLVINGDHVTTDAGTGCVHTAPGHGEDDYIVGQQYELPVISPIDDKGVFTEEGG --------------------------1111-------------------------2222- QFEGMFYDKANKAVTDLLTEKGALLKLDFITHSYPHDWRTKKPVIFRATPQWFASISKVR -------1111--1111-------------------------------------3333-- QDILDAIENTNFKVNWGKTRIYNMVRDRGEWVISRQRVWGVPLPVFYAENGEIIMTKETV ------1111-------------------------------------------------- NHVADLFAEHGSNIWFEREAKDLLPEGFTHPGSPNGTFTKETDIMDVWFDSGSSHRGVLE ----------3333----3333----------1111------------------------ TRPELSFPADMYLEGSDQYRGWFNSSITTSVATRGVSPYKFLLSHGFVMDGEGKKMSKSL -3333----------1111------------------------------1111----111 GNVIVPDQVVKQKGADIARLWVSSTDYLADVRISDEILKQTSDDYRKIRNTLRFMLGNIN 1--------------------11113333-----------------------------11 DFNPDTDSIPESELLEVDRYLLNRLREFTASTINNYENFDYLNIYQEVQNFINVELSNFY 113333---3333----------------------1111--------------------- LDYGKDILYIEQRDSHIRRSMQTVLYQILVDMTKLLAPILVHTAEEVWSHTPHVKEESVH --3333-----1111--------------------3333-----------2222---333 LADMPKVVEVDQALLDKWRTFMNLRDDVNRALETARNEKVIGKSLEAKVTIASNDKFNAS 3----------------------------------1111--------------1111333 EFLTSFDALHQLFIVSQVKVVDKLDDQATAYEHGDIVIEHADGEKCERCWNYSEDLGAVD 33333-------------------------1111------------------------!! ELTHLCPRCQQVVKSLV !!--------------- >HISTIDINOL PHOSPHATE AMIN; SWP:P06986; PDB:1FG7A; TVTITDLARENVRNLTPYQSARRLGGNGDVWLNANEYPTAVEFQLTQQTLNRYPECQPKA --3333-----------------2222---------------------1111-------- VIENYAQYAGVKPEQVLVSRGADEGIELLIRAFCEPGKDAILYCPPTYGYSVSAETIGVE -----------1111-------------------2222---------------------- CRTVPTLDNWQLDLQGISDKLDGVKVVYVCSPNNPTGQLINPQDFRTLLELTRGKAIVVA ------1111------11112222----------------3333-------2222----- DEAYIEFCPQASLAGWLAEYPHLAILRTLSKAFALAGLRCGFTLANEEVINLLKVIAPYP -1111--3333-11111111----------11113333---------------------- LSTPVADIAAQALSPQGIVARERVAQIIAEREYLIAALKEIPCVEQVFDSETNYILARFK ----------------------------------------1111---------------- ASSAVFKSLWDQGIILRDQNKQPSLSGCLRITVGTREESQRVIDALRAEQV ------------------1111--2222-----------------1111-- >HYDROXYLAMINE OXIDOREDUCT; SWP:Q50925; PDB:1FGJA; DISTVPDETYDALKLDRGKATPKETYEALVKRYKDPAHGAGKGTMGDYWEPIAISIYMDP -----3333-------------------------1111----1111-----1111----- NTFYKPPVSPKEVAERKDCVECHSDETPVWVRAWKRSTHANLDKIRNLKSDDPLYYKKGK --------------3333------------------3333-3333---1111-3333--- LEEVENNLRSMGKLGEKETLKEVGCIDCHVDVNKKDKADHTKDIRMPTADTCGTCHLREF --------1111-----------3333-----------3333-----33333333-3333 AERESERDTMVWPNGQWPAGRPSHALDYTANIETTVWATMPQREVAEGCTMCHTNQNKCD ---3333----------2222------------3333----3333333311113333--- NCHTRHEFSAAESRKPEACATCHSGVDHNNWEAYTMSKHGKLAEMNRDKWNWEVRLKDAF --------3333--3333------1111-----1111--------3333-33333333-- SKGGQNAPTCAACHMEYEGEYTHNITRKTRWANYPFVPGIAENITSDWSEARLDSWVLTC 3333-----------------------------3333--3333----------------- TQCHSERFARSYLDLMDKGTLEGLAKYQEANAIVHKMYEDGTLTGQKTNRPNPPEPEKPG -------------------------------------1111-2222-------------- FGIFTQLFWSKGNNPASLELKVLEMGENNLAKMHVGLAHVNPGGWTYTEGWGPMNRAYVE --3333----!!!!-3333----------------------------------------- IQDEYTKMQELSALQARVN ------------------- >FGF RECEPTOR 1; SWP:P11362; PDB:1FGKA; ELPEDPRWELPRDRLVLGKPLGQVVLAEAIGLPNRVTKVAVKMLKSDATEKDLSDLISEM ----3333--1111------------------------------1111------------ EMMKMIGKHKNIINLLGACTQDGPLYVIVEYASKGNLREYLQARRPPEEQLSSKDLVSCA -------------------------------1111------1111--------------- YQVARGMEYLASKKCIHRDLAARNVLVTEDNVMKIADFGLARDIHHIDYYKKTTNGRLPV ----------1111------3333---1111-----1111--1111-1111-1111---1 KWMAPEALFDRIYTHQSDVWSFGVLLWEIFTLGGSPYPGVPVEELFKLLKEGHRMDKPSN 111-------------------------1111----2222--------1111-------- CTNELYMMMRDCWHAVPSQRPTFKQLVEDLDRIVALTS ----------1111-3333--------------1111- >Anti-colorectal carcinoma; SWP:Q7TS98; PDB:1FGNL; DIKMTQSPSSMYASLGERVTITCKASQDIRKYLNWYQQKPWKSPKTLIYYATSLADGVPS -------------2222-----------iiii--------------------------11 RFSGSGSGQDYSLTISSLESDDTATYYCLQHGESPYTFGGGTKLEINRADAAPTVSIFPP 11----------------1111-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT 3333-------------------------------------------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC -3333------------1111------------- >FOLYLPOLYGLUTAMATE SYNTHE; SWP:P15925; PDB:1FGS; MNYTETVAYIHSFPRLAGDHRRILTLLHALGNPQQQGRYIHVTGTNGKGSAANAIAHVLE -3333----1111--------------11113333-----------3333---------1 ASGLTVGLYTSPFIMRFNERIMIDHEPIPDAALVNAVAFVRAALERLQQQQADFNVTEFE 111------------3333---%%%%------------------------1111------ FITALAYWYFRQRQVDVAVIEVGDSTNVITPVVSVLTEVALDTITAIAKHKAGIIKRGIP ----------1111------------------------------------3333-2222- VVTGNLVPDAAAVVAAKVATTGSQWLRFDRDFSVPKAKLHGWGQRFTYEDQDGRISDLEV --------------------------2222-------------------1111------- PLVGDYQQRNMAIAIQTAKVYAKQTEWPLTPQNIRQGLAASHWPARLEKISDTPLIVIDG ---3333------------------------------1111-2222-------------- AHNPDGINGLITALKQLFSQPITVIAGYAAMADRLTAAFSTVYLVPVPGTPRGRLKDSWQ ---------------------------3333---1111--------1111---------- EALAASLNDVPDQPIVITGSLYLASAVRQTLLG ---------1111------3333---------- >Prolactin-binding protein; SWP:Q9QV16; PDB:1FGVH; EVQLVESGGGLVQPGGSLRLSCATSGYTFTEYTMHWMRQAPGKGLEWVAGINPKNGGTSY ------------2222-----------1111--------2222----------------- ADSVKGRFTISVDKSKNTLYLQMNSLRAEDTAVYYCARWRGLDVRYFDVWGQGTLVTVSS 3333---------1111---------3333------------------------------ >DNA rearranged by a t(2; SWP:Q6LBV5; PDB:1FGVL; DIQMTQSPSSLSASVGDRVTITCRASQDINNYLNWYQQKPGKAPKLLIYYTSTLESGVPS -------------2222-----------iiii------2222------------222233 RFSGSGSGTDYTLTISSLQPEDFATYYCQQGNTLPPTFGAGTKVEIK 33----!!!!--------3333------------------------- >GRP1; SWP:O08967; PDB:1FGYA; TFFNPDREGWLLKLGGRVKTWKRRWFILTDNCLYYFEYTTDKEPRGIIPLENLSIREVLD ----------------------------%%%%-----1111--------2222------- PRKPNCFELYNPSHKGQVIKACKTEADGRVVEGNHVVYRISAPSPEEKEEWKSIKASISR ----------3333----------1111-------------------------------- DPFYD 1111- >CATHEPSIN V; SWP:O60911; PDB:1FH0A; LPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQ -----3333--------------3333------------------------------111 GNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVAQDTGFTVVAPGKE 1-!!!!------------------3333------------3333---------------- KALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSDN --------------------3333---------1111-------------------1111 SKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV ----------1111-iiii------%%%%-1111------- >NODULATION PROTEIN F; SWP:P04685; PDB:1FH1A; LTLEIISAINKLVLGLADVLWDLEQLNIGDVVEAVRG 33333333-333333333333-----3333--3333- >Ig heavy chain V region 6; SWP:P18528; PDB:1FH5H; SGGGLVKPAGSLKLSCAASGFTFSSYYMYWVRQTPDKRLEWVATISDGGSYTYYPDSVKG ------2222-----------1111--------1111--------1111-----3333-- RFTISRDNAKNNLYLQMSSLKSEDTAMYYCARDAMDYWGQGTLVTVSAAKTTPPSVYPLA --------------------1111------------------------------------ VTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSSSVTVTSSTWPSETVTC ------------------%%%%--------------------------33331111---- NVAHPASSTKVDKKIVPR ---3333----------- >BETA-1,4-XYLANASE; SWP:Q59277; PDB:1FH9A; ATTLKEAADGAGRDFGFALDPNRLSEAQYKAIADSEFNLVVAENAMKWDATEPSQNSFSF --------1111-------3333--------------------11111111--2222--- GAGDRVASYAADTGKELYGHTLVWHSQLPDWAKNLNGSAFESAMVNHVTKVADHFEGKVA ----------------------------3333----------------------2222-- SWDVVNEAFADGGGRRQDSAFQQKLGNGYIETAFRAARAADPTAKLCINDYNVEGINAKS ---------2222------------1111------------------------------- NSLYDLVKDFKARGVPLDCVGFQSHLIVGQVPGDFRQNLQRFADLGVDVRITELDIRMRT --------------------------2222-1111-------1111-------------- PSDATKLATQAADYKKVVQACMQVTRCQGVTVWGITDKYSWVPDVFPGEGAALVWDASYA -----------------------1111--------3333-3333-2222------1111- KKPAYAAVMEAF ------------ >GLUTATHIONE TRANSFERASE; SWP:P31670; PDB:1FHE; PAKLGYWKLRGLAQPVRLFLEYLGEEYEEHLYGRDDREKWMSEKFNMGLDLPNLPYYIDD -------------3333---------------111133331111---------------- KCKLTQSVAIMRYIADKHGMLGTTPEERARISMIEGAAMDLRIGFGRVCYNPKFEEVKEE -----3333-----3333-------3333----3333-------3333--1111-----3 YVKELPKTLKMWSDFLGDRHYLTGSSVSHVDFMLYETLDSIRYLAPHCLDEFPKLKEFKS 333------------------------33333333-333311113333------------ RIEALPKIKAYMESKRFIKWPLNGWAASFGAGDA 3333------------------------------ >SEED COAT PEROXIDASE; SWP:O22443; PDB:1FHFA; QLTPTFYRETCPNLFPIVFGVIFDASFTDPRIGASLMRLHFHDCFVQGCDGSVLLNNTDT ----1111---------------1111---------------3333----3333---111 IESEQDALPNINSIRGLDVVNDIKTAVENSCPDTVSCADILAIAAEIASVLGGGPGWPVP 1-3333---2222----------------------3333----------1111------- LGRRDSLTANRTLANQNLPAPFFNLTQLKASFAVQGLNTLDLVTLSGGHTFGRARCSTFI -------------1111--1111---------1111------------------3333-1 NRLYNFSNTGNPDPTLNTTYLEVLRARCPQNATGDNLTNLDLSTPDQFDNRYYSNLLQLN 111--%%%%---1111-3333----------------------------33333333--- GLLQSDQELFSTPGADTIPIVNSFSSNQNTFFSNFRVSMIKMGNIGVLTGDEGEIRLQCN --11113333-----------------------------------------------111 FVNG 1--- >TELOKIN; SWP:P56276; PDB:1FHGA; AEEKPHVKPYFTKTILDMEVVEGSAARFDCKVEGYPDPEVMWFKDDNPVKESRHFQIDYD -------------------------------------------%%%%----3333----1 EEGNCSLTISEVCGDDDAKYTCKAVNSLGEATCTAELLVETM 111---------3333---------1111------------- >HEMOGLOBIN (ALPHA CHAIN); SWP:P01952; PDB:1FHJA; VLSPADKTNIKSTWDKIGGHAGDYGGEALDRTFQSFPTTKTYFPHFDLSPGSAQVKAHGK ----------------!!!!---------------3333---1111-------------- KVADALTTAVAHLDDLPGALSALSDLHAYKLRVDPVNFKLLSHCLLVTLACHHPTEFTPA -----------3333------------------3333----------------1111--- VHASLDKFFTAVSTVLTSKYR ----------------1111- >HEMOGLOBIN (ALPHA CHAIN); SWP:P02056; PDB:1FHJB; VHLTAEEKSLVSGLWGKVNVDEVGGEALGRLLIVYPWTQRFFDSFGDLSTPDAVMSNAKV --------------11113333---------------33331111--------------- KAHGKKVLNSFSDGLKNLDNLKGTFAKLSELHCDKLHVDPENFKLLGNVLVCVLAHHFGK ----------------3333------------------3333----------------33 EFTPQVQAAYQKVVAGVANALAHKYH 33-------------------1111- >UNC-89; SWP:O01761; PDB:1FHOA; MGDTGKLGRIIRHDAFQVWEGDEPPKLRYVFLFRNKIMFTEQDASTSPPSYTHYSSIRLD ------------------------------------------------------------ KYNIRQHTTDEDTIVLQPQEPGLPSFRIKPKDFETSEYVRKAWLRDIAEEQEKYAAERD --------------------------------------------------3333----- >O-SUCCINYLBENZOATE SYNTHA; SWP:P29208; PDB:1FHUA; MRSAQVYRWQIPMDLKTRDGLYVCLREGEREGWGEISPLPGFSQETWEEAQSVLLAWVNN --------------------------!!!!--------2222------------------ WLAGDCELPQMPSVAFGVSCALAELTDTLPQAANYRAAPLCNGDEKVAKVKVGLYEAVRD 1111-------------------------------------------------------- GMVVNLLLEAIPDLHLRLDANRAWTPLKGQQFAKYVNPDYRDRIAFLEEPCKTRDDSRAF ----------1111-----%%%%---------111111111111---------------- ARETGIAIAWDESLREPDFAFVAEEGVRAVVIKPTLTGSLEKVREQVQAAHALGLTAVIS ----------3333---------2222-----3333--------------1111------ SSIESSLGLTQLARIAAWLTPDTIPGLDTLDLMQAQQVRRWPGSTLPVVEVDALERLL -------------------1111-----3333--------2222-----3333----- >MEVALONATE 5-DIPHOSPHATE ; SWP:P32377; PDB:1FI4A; VYTASVTAPVNIATLKYWGKRDTKLNLPTNSSISVTLSQDDLRTLTSAATAPEFERDTLW ------------------------------------------------------------ LNGEPHSIDNERTQNCLRDLRQLRKEESKDASLPTLSQWKLHIVSENNFPTAAGLASSAA -----------3333--------------1111-3333-----------------3333- GFAALVSAIAKLYQLPQSTSEISRIARKGSGSACRSLFGGYVAWEGKAEDGHDSAVQIAD ----------1111---3333--------!!!!1111----------1111--------3 SSDWPQKACVLVVSDIKKDVSSTQGQLTVATSELFKERIEHVVPKRFEVRKAIVEKDFAT 333----------------------------3333------------------------- FAKETDSNSFHATCLDSFPPIFYNDTSKRIISWCHTINQFYGETIVAYTFDAGPNAVLYY -------------1111------------------------------------------- LAENESKLFAFIYKLFGSVPGWDKKFTTEQLEAFNHQFESSNFTARELDLELQKDVARVI 3333-----------1111------------------1111-------33331111---- LTQVGSGPQETNESLIDAKTGL ---------------------- >EH DOMAIN PROTEIN REPS1; SWP:O54916; PDB:1FI6A; WKITDEQRQYYVNQFKTIQPDLNGFIPGSAAKEFFTKSKLPILELSHIWELSDFDKDGAL ---3333-------3333--------3333-----3333-3333--------1111---- TLDEFCAAFHLVVARKNGYDLPEKLPESLMPK -------------------------------- >NATURAL KILLER CELL PROTE; SWP:P18291; PDB:1FI8A; IIGGHEAKPHSRPYMAYLQIMDE -------22221111-------- >MANNOSE-BINDING PROTEIN-A; SWP:P19999; PDB:1FIFA; AIEVKLANMEAEINTLKSKLELTNKLHAFSMGKKSGKKFFVTNHERMPFSKVKALCSELR ----------------------------1111-2222----------3333-----1111 GTVAIPRNAEENKAIQEVAKTSAFLGITDEVTEGQFMYVTGGRLTYSNWKKDQPDDWYGH ----------------------------3333-----1111--------2222----333 GLGGGEDCVHIVDNGLWNDDSCQRPYTAVCEFPA 3----------1111-----3333---------- >IGG1-KAPPA 1F7 FAB (HEAVY; SWP:NA; PDB:1FIGH; DVQLQQSGPELEKPGASVKISCKASGFSLPGHNINWIVQRNGKSLEWIGNIDPYYGGTNF ---------------------------------------2222----------------- NPKFKGKATLTVDKSSSTLYMHLTSL -------------------------- >PROFILIN; SWP:P07737; PDB:1FIL; AGWNAYIDNLMADGTCQDAAIVGYKDSPSVWAAVPGKTFVNITPAEVGVLVGKDRSSFYV -3333----------------------------22223333-------1111-----111 NGLTLGGQKCSVIRDSLLQDGEFSMDLRTKSTGGAPTFNVTVTKTDKTLVLLMGKEGVHG 1---iiii-----------------------2222---------1111------222233 GLINKKCYEMASHLRRSQY 33-----------3333-- >SSO1 PROTEIN; SWP:P32867; PDB:1FIOA; MHDFVGFMNKISQINRDLDKYDHTINQVDSLHKRLLTEVNEEQASHLRHSLDNFVAQATD 1111-----------------------------------3333----------------- LQFKLKNEIKSAQRDGIHDTNKQAQAENSRQRFLKLIQDYRIVDSNYKEENKEQAKRQYM -------------1111------------------------------------------- IIQPEATEDEVEAAISDVGGQQIFSQALLEAKTALAEVQARHQELLKLEKSMAELTQLFN --1111------------------------------------------------------ DMEELVIEQQ ---------- >n/a; SWP:P0A6R3; PDB:1FIPA; PLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQALLDMVMQYTRGNQTRAALMMGINR 3333---------1111---------------------------iiii------------ GTLRKKLKKYGMN -------1111-- >FRAGILE HISTIDINE PROTEIN; SWP:P49789; PDB:1FIT; SFRFGQHLIKPSVVFLKTELSFALVNRKPVVPGHVLVCPLRPVERFHDLRPDEVADLFQT ---!!!!--3333-----------------2222----------3333------------ TQRVGTVVEKHFHGTSLTFSQDGPEAGQTVKHVHVHVLPRKAGDASWRSEEEAAEAAALR ---------1111---------1111--------------2222----3333------33 VYFQ 33-- >TYPE II RESTRICTION ENZYM; SWP:P31032; PDB:1FIUA; MQPLFTQERRIFHKKLLDGNILATNNRGVVSNADGSNTRSFNIAKGIADLLHSETVSERL -----------------------------11111111-----------1111-------- PGQTSGNAFEAICSEFVQSAFEKLQHIRPGDWNVKQVGSRNRLEIARYQQYAHLTALAKA -------------------33333333----------3333-3333-3333--------- AEENPELAAALGSDYTITPDIIVTRNLIADAEINRNEFLVDENIATYASLRAGNGNMPLL ------------3333------------3333-1111---11111111------------ HASISCKWTIRSDRAQNARSEGLNLVRNRKGRLPHIVVVTAEPTPSRISSIALGTGEIDC -------------------------1111--------------3333------------- VYHFALYELEQILQSLNYEDALDLFYIMVNGKRLKDISDLPLDLAV ----------------------------1111---3333-3333-- >BETA-ACROSIN HEAVY CHAIN; SWP:Q9GL10; PDB:1FIWA; IIGGQDAAHGAWPWMVSLQIFTYHNNR -------22221111------------ >BETA-ACROSIN HEAVY CHAIN; SWP:P08001; PDB:1FIZA; VVGGMSAEPGAWPWMVSLQIFMYHNNR -------22221111------------ >Outer surface protein A [; SWP:P14013; PDB:1FJ1B; QIQLVQSGPELKKPGETVKISCKASGYTFTDYSMYWVKQAPGKGLKRMGWINTETGEPTY ------------2222-----------1111--------2222----------------- ADDFKGRFALSLDTSASTAYLHISNLKNEDTATYFCARGLDSWGQGTSVTVSSAKTTPPS 1111--------3333----------3333------------------------------ VYPLAPGCGDTTGSSVTLGCLVKGYFPESVTVTWNSGSLSSSVHTFPALLQSGLYTMSSS ------------------------------------------------------------ VTVPSSTWPSQTVTCSVAHPASSTTVDKKLEPS ---3333-----------3333----------- >Outer surface protein A [; SWP:P14013; PDB:1FJ1E; SLDEKNSVSVDLPGEMKVLVSKEKNKDGKYDLIATVDKLELKGTSDKNNGSGVLEGVKAD ------------------------------------------------------------ KCKVKLTISDDLGQTTLEVFKEDGKTLVSKKVTSKDKSSTEEKFNEKGEVSEKIITRADG --------1111--------------------------------------------1111 TRLEYTGIKSDGSGKAKEVLKGYVLEGTLTAEKTTLVVKEGTVTLSKNISKSGEVSVELN --------3333-----------------3333------!!!!----------------- DTDSSAATKKTAAWNSGTSTLTITVNSKKTKDLVFTKENTITVQQYDSNGTKLEGSAVEI ----3333----------------iiii-------1111-------1111---------- TKLDEIKNALK -3333--1111 >PROTEIN (ACYL PROTEIN THI; SWP:O75608; PDB:1FJ2A; MSTPLPAIVPAARKATAAVIFLHGLGDTGHGWAEAFAGIRSSHIKYICPHAPVRPVTLNM ----------------------------------------1111---------------- NVAMPSWFDIIGLSPDSQEDESGIKQAAENIKALIDQEVKNGIPSNRIILGGFSQGGALS -------------1111---------------------1111-1111------------- LYTALTTQQKLAGVTALSCWLPLRASFPQGPIGGANRDISILQCHGDCDPLVPLMFGSLT ----------------------3333-------1111---------------3333---- VEKLKTLVNPANVTFKTYEGMMHSSCQQEMMDVKQFIDKLLPPI --------3333-------------------------------- >NUCLEOLIN RBD1; SWP:P08199; PDB:1FJ7A; GSHMLEDPVEGSESTTPFNLFIGNLNPNKSVAELKVAISELFAKNDLAVVDVRTGTNRKF ----------%%%%-----------1111------------------------------- GYVDFESAEDLEKALELTGLKVFGNEIKLEKPKGRDGTRGC ---------------------iiii---------------- >PEPTIDYL PROLYL CIS/TRANS; SWP:Q9Y237; PDB:1FJDA; GSGPKGGGNAVKVRHILCEKHGKIMEAMEKLKSGMRFNEVAAQYSEDKARQGGDLGWMTR -----------------1111-----------------------------1111------ GSMVGPFQEAAFALPVSGMDKPVFTDPPVKTKFGYHIIMVEGRK ----3333------------------------------------ >Nucleolin; SWP:P08199; PDB:1FJEB; GSHMVEGSESTTPFNLFIGNLNPNKSVAELKVAISELFAKNDLAVVDVRTGTNRKFGYVD ------------------------------------------------------------ FESAEDLEKALELTGLKVFGNEIKLEKPKGRDSKKVRAARTLLAKNLSFNITEDELKEVF --3333-----------iiii--------------3333--------------------1 EDALEIRLVSQDGKSKGIAYIEFKSEADAEKNLEEKQGAEIDGRSVSLYYTGEKG 111-------------------------------------%%%%----------- >30S ribosomal protein S9; SWP:P80374; PDB:1FJGI; EQYYGTGRRKEAVARVFLRPGNGKVTVNGQDFNEYFQGLVRAVAALEPLRAVDALGRFDA --------------------------iiii1111----33333333-3333--------- YITVRGGGKSGQIDAIKLGIARALVQYNPDYRAKLKPLGFLTRDARVVERKKYGKHKARR ----------------------3333-3333---33331111------------------ APQYSKR ------- >30S ribosomal protein S18; SWP:Q5SLQ0; PDB:1FJGR; PSRKAKVKATLGEFDLRDYRNVEVLKRFLSETGKILPRRRTGLSGKEQRILAKTIKRARI -----3333-----1111---33331111-------3333-------------------- LGLLPFTEKLVRK ------------- >3ALPHA-HYDROXYSTEROID DEH; SWP:Q9ZFY9; PDB:1FJHA; MSIIVISGCATGIGAATRKVLEAAGHQIVGIDIRDAEVIADLSTAEGRKQAIADVLAKCS -------3333-----------------------------------------------11 KGMDGLVLCAGLGPQTKVLGNVVSVNYFGATELMDAFLPALKKGHQPAAVVISSVASAHL 11---------------3333-------------------1111---------3333--- AFDKNPLALALEAGEEAKARAIVEHAGEQGGNLAYAGSKNALTVAVRKRAAAWGEAGVRL 11111111--------------1111---------------------------1111--- NTIAPGAFVPPMGRRAEPSEMASVIAFLMSPAASYVHGAQIVIDGGIDAVMRPTQF ----------2222--3333---------3333----------iiii----1111- >HYPOTHETICAL 17.1 KDA PRO; SWP:P12994; PDB:1FJJA; AKLISNDLRDGDKLPHRHVFNGGYDGDNISPHLAWDDVPAGTKSFVVTCYDPDAPTGSGW --------2222--------------------------2222------------------ WHWVVVNLPADTRVLPQGFGSGLVAPDGVLQTRTDFGKTGYDGAAPPKGETHRYIFTVHA --------1111---2222------2222----1111---------2222---------- LDIERIDVDEGASGAVGFNVHFHSLASASITAFS --------1111---------------------- ------------------------------------------------------------ ----- >MICROCYSTIN-LR TOXIN; SWP:P20653; PDB:1FJMA; LNLDSIIGRLLEVQGSRPGKNVQLTENEIRGLCLKSREIFLSQPILLELEAPLKICGDIH ----------1111--2222---------------------------------------- GQYYDLLRLFEYGGFPPESNYLFLGDYVDRGKQSLETICLLLAYKIKYPENFFLLRGNHE -----------------------------------------------1111-----1111 CASINRIYGFYDECKRRYNIKLWKTFTDCFNCLPIAAIVDEKIFCCHGGLSPDLQSMEQI 3333--------------3333-------1111-----%%%%--------1111-----3 RRIMRPTDVPDQGLLCDLLWSDPDKDVQGWGENDRGVSFTFGAEVVAKFLHKHDLDLICR 333--------------------1111-----1111------------------------ AHQVVEDGYEFFAKRQLVTLFSAPNYCGEFDNAGAMMSVDETLMCSFQILKPAD ----1111----%%%%-----------------------1111----------- >DEFENSIN MGD-1; SWP:P80571; PDB:1FJNA; GFGCPNNYQCHRHCKSIPGRCGGYCGGWHRLRCTCYRCG %%%%-----------------------%%%%-------- >METHUSELAH ECTODOMAIN; SWP:O97148; PDB:1FJRA; DILECDYFDTVDISAAQKLQNGSYLFEGLLVPAILTGEYDFRILPDDSKQKVARHIRGCV -----1111---1111--1111---iiii--3333--------1111-----------33 CKLKPCVRFCCPHDHIMDNGVCYDNMSDEELAELDPFLNVTLDDGSVSRRHFKNELIVQW 33---------1111--iiii--------------------1111--------------- DLPMPCDGMFYLDNREEQDKYTLFENGTFFRHFDRVTLRKREYCLQHLTFADGNATSIRI ---------------1111----1111----1111---1111------------------ APHNCLIV -------- >NONSPECIFIC LIPID-TRANSFE; SWP:P19656; PDB:1FK5A; AISCGQVASAIAPCISYARGQGSGPSAGCCSGVRSLNNAARTTADRRAACNCLKNAAAGV ----------3333--1111-----------------------------------33332 SGLNAGNAASIPSKCGVSIPYTISTSTDCSRVN 222---------1111-------11113333-- >FK506 BINDING PROTEIN; SWP:P18203; PDB:1FKL; VVTPGGRFPKRLDGKKF ---------22------ >GYP1P; SWP:Q08484; PDB:1FKMA; NSIIQRISKFDNILKDKTIINQQDLRQISWNGIPKIHRPVVWKLLIGYLPVNTKRQEGFL ---------------------------3333--3333--------------3333----- QRKRKEYRDSLKHTFSDQHSRDIPTWHQIEIDIPRTNPHIPLYQFKSVQNSLQRILYLWA --------------------------------111111111111---------------- IRHPASGYVQGINDLVTPFFETFLTEYLPPSQIDDVEIKDPSTYVDEQITDLEADTFWCL --3333----3333----------111111111111---1111----------------- TKLLEQITDNYIHGQPGILRQVKNLSQLVKRIDADLYNHFQNEHVEFIQFAFRWNCLLRE ------1111-2222------------------------------3333-3333------ FQGTVIRWDTYLSETSSLNEFHVFVCAAFLIKWSDQLEDFQETITFLQNPPTKDWTETDI ---------------------------------3333--------1111--11113333- ELLSEAFIWQSLYK -------------- >ALPHA-LACTALBUMIN; SWP:P00712; PDB:1FKQA; MEQLTKCEVFQKLKDLKDYGGVSLPEWVCVAFHTSGYDTQAIVQNNDSTEYGLFQINNKI ------------3333-2222-------------iiii-------------1111----- WCKDDQNPHSRNICNISCDKFLDDDLTDDIVCAKKILDKVGINYWLAHKALCSEKLDQWL ---3333----1111-3333--------------------1111----3333--3333-- CEKL 1111 >ENDOTHELIAL-MONOCYTE ACTI; SWP:Q12904; PDB:1FL0A; IDVSRLDLRIGCIITARKHPDADSLYVEEVDVGEIAPRTVVSGLVNHVPLEQMQNRMVIL -3333----------------1111------------------1111-3333-------- LCNLKPAKMRGVLSQAMVMCASSPEKIEILAPPNGSVPGDRITFDAFPGEPDKELNPKKK ----------------------1111------22222222---3333------------- IWEQIQPDLHTNDECVATYKGVPFEVKGKGVCRAQTMSNSGIKL 33333333---1111---iiii---2222--------------- >PROTEASE; SWP:NA; PDB:1FL1A; AQGLYVGGFVDVVSEPLPITIEHLPETEVGWTLGLFQVSHGIFCTGAITSPAFLELASRL --------------------%%%%-------------1111------------------- ADTSHVARAPVKNLPKEPLLEILHTWLPGLSLSFQHVSLCALGRRRGTVAVYGHDAEWVV 11111111------------------------------------2222------------ SRFSSVSKSERAHILQHVSSCRLEDLSTPNFVSPLETLMAKAIDAGFIRDRLDLLKTDRG --1111---------------------------3333----------2222--------- VASILSPVYLKA ------------ >ALKYL HYDROPEROXIDE REDUC; SWP:P35340; PDB:1FL2A; AYDVLIVGSGPAGAAAAIYSARKGIRTGLMGERFGGQILDTVDIENYISVPKTEGQKLAG --------------------1111----------------------2222---------- ALKVHVDEYDVDVIDSQSASKLIPAAVEGGLHQIETASGAVLKARSIIVATGAKWRNMNV --------------------------2222-----1111--------------------2 PGEDQYRTKGVTYCPHCDGPLFKGKRVAVIGGGNSGVEAAIDLAGIVEHVTLLEFAPEMK 2221111------333333332222----------------------------------- ADQVLQDKLRSLKNVDIILNAQTTEVKGDGSKVVGLEYRDRVSGDIHNIELAGIFVQIGL ------------------------------------------------------------ LPNTNWLEGAVERNRMGEIIIDAKCETNVKGVFAAGDCTTVPYKQIIIATGEGAKASLSA ---3333------1111----1111---2222---1111--------------------- FDYLIRTKTA ---------- >BLUE FLUORESCENT ANTIBODY; SWP:NA; PDB:1FL3H; AALLESGGGLVKPGGSLKLSCTASGITFSRYIMSWVRQIPEKRLEWVASISSGGITYYPD -----------2222------------1111-------3333--------1111----33 SVAGRFTISRDNVRNILYLQMSSLRSEDTALYYCARGQGRPYWGQGTSVTVSAAKTTPPS 33----------------------1111-------------------------------- VYPAAPGCGDTTGSSVTLGCLVKGYFPEPVTVTWNSGGSSVHTFPALLQSGLYTMSSSVT ---------------------------------%%%%-----------iiii-------- VPSSTWPSTVTCSVAHPASSTTVDKKLE -1111----------1111--------- >BLUE FLUORESCENT ANTIBODY; SWP:NA; PDB:1FL3L; AALTQSPVSNPVTLGTSASISCRSTKSLLHSNGITYLYWYLQKPGQSPQLLIYQMSNLAS ------------2222-------------1111-------------------------22 GVPNRFSSSGSGTDFTLRINTVEAEDVGVYYCAQNLELPPTFGAGTKLELKRADAAPTVS 22--------------------3333---------------------------------- IFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMS ---------------------------------%%%%----------------------- STLTLTKDEYERHNGYTCEATHKTSTSPIVKSFN -----3333------------------------- >ANTIBODY GERMLINE PRECURS; SWP:NA; PDB:1FL5H; QVQLVESGGGLVQPGGSLRLSCATSGFTFTDYYMSWVRQPPGKALEWLGFIRNKA ---------------------------3333--------2222------------ >Follitropin subunit beta ; SWP:P01225; PDB:1FL7B; CELTNITIAIEKEECRFCISINTAWCAGYCYTRDLVYKDPARPKIQKTCTFKELVYETVR -----------3333--------------------------------------------- VPGCAHHADSLYTYPVATQCHCGKCDSDSTDCTVRGLGPSYCSFGEM -------------------------------------1111------ >HAEMAGGLUTININ-ESTERASE-F; SWP:P07975; PDB:1FLCA; EKIKICLQKQVNSSFSLHNGFGGNLYATEEKRMFELVKPKAGASVLNQSTWIGFGDSRTD ------------------------------------------------------------ KSNSAFPRSADVSAKTADKFRFLSGGSLMLSMFGPPGKVDYLYQGCGKHKVFYEGVNWSP -------------3333-----2222----------------------------333333 HAAINCYRKNWTDIKLNFQKNIYELASQSHCMSLVNALDKTIPLQVTAGTAGNCNNSFLK 33------------------------------------------------3333------ NPALYTQEVKPSENKCGKENLAFFTLPTQFGTYECKLHLVASCYFIYDSKEVYNKRGCDN ----------1111---------------------------------------------- YFQVIYDSFGKVVGGLDNRVSPYTGNSGDTPTMQCDMLQLKPGRYSVRSSPRFLLMPERS ------------------------------------------------------------ YCFDMKEKGPVTAVQSIWGKGRESDYAVDQACLSTPGCMLIQKQKPYIGEADDHHGDQEM ----------------------------------2222----------3333-------- RELLSGLDYEARCISQSGWVNETSPFTEKYLLPPKFGRCPLAAKEESIPKIPDGLLIPTS ---3333-------1111------------------------------------------ GTDTTVT ------- >Hemagglutinin-esterase-fu; SWP:P07975; PDB:1FLCB; IDDLIIGVLFVAIVETGIGGYLLGSRKESGGGVTKESAEKGFEKIGNDIQILKSSINIAI -!!!!--------------------------------1111------------------- EKLNDRISHDEQAIRDLTLEIENARSEALLGELGIIRALLVGNISIGLQESLWELASEIT -----------------3333--------------------------------------- NRAGDLAVEVSPGCWIIDNNICDQSCQNFIFKFNETAPVPTI --3333----2222---3333--------------------- >Elafin [Precursor]; SWP:P19957; PDB:1FLEI; TKPGSCPIILIRCAMLNPPNRCLKDTDCPGIKKCCEGSCGMACFVPQ -----------------------1111-------------------- >QUINOPROTEIN ETHANOL DEHY; SWP:Q9Z4J7; PDB:1FLGA; KDVTWEDIANDDKTTGDVLQYGMGTHAQRWSPLKQVNADNVFKLTPAWSYSFGDEKQRGQ ---------3333------22221111---------11111111--------%%%%---- ESQAIVSDGVIYVTASYSRLFALDAKTGKRLWTYNHRLPDDIRPCCDVVNRGAAIYGDKV ------!!!!---------------------------------------------!!!!- FFGTLDASVVALNKNTGKVVWKKKFADHGAGYTMTGAPTIVKDGKTGKVLLIHGSSGDEF ------------3333----------3333--------------------------1111 GVVGRLFARDPDTGEEIWMRPFVEGHMGRLNGKDSTVTGDVKAPSWPDDRNSPTGKVESW ----------------------2222---iiii------1111-----2222-----333 SHGGGAPWQSASFDAETNTIIVGAGNPGPWNTWARTAKGGNPHDYDSLYTSGQVGVDPSS 3-------------1111------------3333--2222-------------------- GEVKWFYQHTPNDAWDFSGNNELVLFDYKAKDGKIVKATAHADRNGFFYVVDRSNGKLQN ------------------------------------------1111-------------- AFPFVDNITWASHIDLKTGRPVEREGQRPPLPEPGQKHGKAVEVSPPFLGGKNWNPMAYS --------------------------------2222----------3333---------- QDTGLFYVPANHWKEDYWTEEVSYTKGSAYLGMGFRIKRMYDDHVGSLRAMDPVSGKVVW ------------------------2222-------------------------------- EHKEHLPLWAGVLATAGNLVFTGTGDGYFKAFDAKSGKELWKFQTGSGIVSPPITWEQDG ---------------------------------------------------------%%% EQYLGVTVGYGGAVPLWGGDMADLTRPVAQGGSFWVFKLPSW %-----------3333-!!!!---1111-------------- >FLI-1; SWP:Q01543; PDB:1FLIA; PGSGQIQLWQFLLELLSDSANASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKL --------------3333---------------------------------3333--333 SRALRYYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHP 3--------------3333------------------- >CARBONIC ANHYDRASE III; SWP:P14141; PDB:1FLJA; AKEWGYASHNGPEHWHELYPIAKGDNQSPIELHTKDIRHDPSLQPWSVSYDPGSAKTILN ------3333333333333333----------3333---1111-------1111------ NGKTCRVVFDDTFDRSMLRGGPLSGPYRLRQFHLHWGSSDDHGSEHTVDGVKYAAELHLV -------------------!!!!---------------1111-----iiii--------- HWNPKYNTFGEALKQPDGIAVVGIFLKIGREKGEFQILLDALDKIKTKGKEAPFNHFDPS ---11113333--------------------3333-----3333--2222---------1 CLFPACRDYWTYHGSFTTPPCEECIVWLLLKEPMTVSSDQMAKLRSLFASAENEPPVPLV 111--------------------------------------------------------- GNWRPPQPIKGRVVRASFK ------------------- >TNF RECEPTOR ASSOCIATED F; SWP:Q13114; PDB:1FLKA; LESVDKSAGQVARNTGLLESQLSRHDQMLSVHDIRLADMDLRFQVLETASYNGVLIWKIR ---------3333----------------------------------------------- DYKRRKQEAVMGKTLSLYSQPFYTGYFGYKMCARVYLNGDGMGKGTHLSLFFVIMRGEYD --------1111---------------------------!!!!----------------- ALLPWPFKQKVTLMLMDQGSSRRHLGDAFKPDPNSSSFKKPTGEMNIASGCPVFVAQTVL -------------------------------3333--------------------3333- ENGTYIKDDTIFIKVIVDTSDLPDP ------%%%%--------------- >FMN-BINDING PROTEIN; SWP:Q46604; PDB:1FLMA; MLPGTFFEVLKNEGVVAIATQGEDGPHLVNTWNSYLKVLDGNRIVVPVGGMHKTEANVAR --3333---------------1111------1111----------------------111 DERVLMTLGSRKVAGRNGPGTGFLIRGSAAFRTDGPEFEAIARFKWARAALVITVVSAEQ 1---------------------------------------3333---------------- TL -- >FLP RECOMBINASE; SWP:P03870; PDB:1FLOA; PQFDILCKTPPKVLVRQFVERFERPSGEKIALCAAELTYLCWMITHNGTAIKRATFMSYN ---------3333------1111--33331111------------iiii----------- TIISNSLSFDIVNKSLQFKYKTQKATILEASLKKLIPAWEFTIIPYYSDITDIVSSLQLQ --1111-----------------------------1111--------------------- FESKGNSHSKKMLKALLSEGESIWEITEKILNSFEYTSRFTKTKTLYQFLFLATFINCGR -----------------------------1111---------------------1111-3 FSDIKNVDPKSFKLVQNKYLGVIIQCLVTETKTSVSRHIYFFSARGRIDPLVYLDEFLRN 33311111111-------------------1111--------------3333-------- SEPVLKRVNRTGNSSSNKQEYQLLKDNLVRSYNKALKKNAPYSIFAIKNGPKSHIGRHLM ------------------------1111-------------3333-2222-1111----- TSFLSMKGLTELTNVVGNWSDKTTYTHQITAIPDHYFALVSRYYAYDPISKEMIALKDET ---------1111-------------------3333-----------1111--------- NPIEEWQHIEQLKGSAEGSIRYPAWNGIISQEVLDYLSSYINRRI ---------------3333--3333----3333------------ >HEMOGLOBIN I (AQUO MET); SWP:P41260; PDB:1FLP; SLEAAQKSNVTSSWAKASAAWGTAGPEFFMALFDAHDDVFAKFSGLFSGAAKGTVKNTPE -------------------3333---------------------1111--11111111-- MAAQAQSFKGLVSNWVDNLDNAGALEGQCKTFAANHKARGISAGQLEAAFKVLSGFMKSY -----------------1111---------------1111-3333----------3333- GGDEGAWTAVAGALMGEIEPDM -----------------3333- >Ig gamma-2A chain C regio; SWP:GCAM_MOUSE; PDB:1FLRH; EVKLDETGGGLVQPGRPMKLSCVASGFTFSDYWMNWVRQSPEKGLEWVAQIRNKPYNYET ---------------------------3333----------------------1111--- YYSDSVKGRFTISRDDSSVYLQMNNLRVEDMGIYYCTGSYYGMDYWGQGTSVTVSSAKTT --3333--------------------3333------------------------------ APSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLYTL ------------------------------------%%%%--2222-------------- SSSVTVTSSTWPSQSITCNVAHPASSTKVDKKIEPR ------3333-----------3333----------- >Vascular endothelial grow; SWP:P17948; PDB:1FLTX; GRPFVEMYSEIPEIIHMTEGRELVIPCRVTSPNITVTLKKFPLDTLIPDGKRIIWDSRKG -----------------2222---------1111---------------------1111- FIISNATYKEIGLLTCEATVNGHLYKTNYLTHRQT ------3333---------iiii------------ >Molybdopterin-converting ; SWP:P30748; PDB:1FM0D; MIKVLFFAQVRELVGTDATEVAADFPTVEALRQHMAAQSDRWALALEDGKLLAAVNQTLV -----------------------------------------------1111---%%%%-- SFDHPLTDGDEVAFFPPVTGG 1111--2222----------- >Molybdopterin-converting ; SWP:P30749; PDB:1FM0E; AETKIVVGPQPFSVGEEYPWLAERDEDGAVVTFTGKVRVNALTLEHYPGMTEKALAEIVD ------------3333-------3333-------------------1111---------- EARNRWPLGRVTVIHRIGELWPGDEIVFVGVTSAHRSSAFEAGQFIMDYLKTRAPFWKRE --------------------2222------------------------------------ ATPEGDRWVEARESDQQAAKRW -1111------3333---3333 >MAJOR POLLEN ALLERGEN BET; SWP:P43185; PDB:1FM4A; GVFNYETEATSVIPAARMFKAFILDGDKLVPKVAPQAISSVENIEGNGGPGTIKKINFPE ---------------------------------1111-----------2222------22 GFPFKYVKDRVDEVDHTNFKYNYSVIEGGPVGDTLEKISNEIKIVATPDGGCVLKISNKY 22-------------------------!!!!---------------1111---------- HTKGNHEVKAEQVKASKEMGETLLRAVESYLLAHSDAYN --------3333---------------------1111-- >EIAV PROTEASE; SWP:P32542; PDB:1FMB; VTYNLEKRPTTIVLINDTPLNVLLDTGADTSVLTTAHYNRLKYRGRKYQGTGIGGVGGNV --------------%%%%------1111---------1111------------------- ETFSTPVTIKKKGRHIKTRMLVADIPVTILGRDILQDLGAKLVL ----------%%%%------------------------------ >7 ALPHA-HYDROXYSTEROID DE; SWP:P25529; PDB:1FMCA; MFNSDNLRLDGKCAIITGAGAGIGKEIAITFATAGASVVVSDINADAANHVVDEIQQLGG --3333--2222------------------------------------------------ QAFACRCDITSEQELSALADFAISKLGKVDILVNNAGGGGPKPFDMPMADFRRAYELNVF -------1111------------------------------------------------- SFFHLSQLVAPEMEKNGGGVILTITSMAAENKNINMTSYASSKAAASHLVRNMAFDLGEK -------------1111--------3333------3333---------------1111-- NIRVNGIAPGAILTDALKSVITPEIEQKMLQHTPIRRLGQPQDIANAALFLCSPAASWVS -------------3333------------1111------3333------------1111- GQILTVSGGGVQELN ------iiii----- >Foot and mouth disease vi; SWP:Q65095; PDB:1FMD1; TTTTGESADPVTTTVENYGGETQVQRRHHTDVAFVLDRFVKVTVSDNQHTLDVMQAHKDN ---3333------3333---------3333-3333----------------3333-1111 IVGALLRAATYYFSDLEIAVTHTGKLTWVPNGAPVSALNNTTNPTAYHKGPVTRLALPYT -----1111--------------------22223333--1111----------------- APHRVLATAYTGAHLPTSFNFGAVKAETITELLVRMKRAELYCPRPILPIQPTGDRHKQP ---------------3333----------------------------------------- LVAPAKQ ------- >Genome polyprotein [Fragm; SWP:P15072; PDB:1FMD2; DKKTEETTLLEDRILTTRNGHTTSTTQSSVGVTFGYATAEDSTSGPNTSALETRVHQAER --------------------------------------------3333------3333-- FFKMALFDWVPSQNFGHMHKVVLPHEPKGVYGGLVKSYAYMRNGWDVEVTAVGNQFNGGC -------------2222-----------33333333-----------------1111--- LLVALVPEMGDISDREKYQLTLYPHQFINPRTNMTAHITVPYVGVNRYDQYKQHRPWTLV ------------3333---1111-----3333-----------------1111------- VMVVAPLTTNTAGAQQIKVYANIAPTNVHVAGELPSKE -------------------------------------- >Polyprotein; SWP:Q9YQQ5; PDB:1FMD3; GIFPVACSDGYGNMVTTDPKTADPAYGKVYNPPRTALPGRFTNYLDVAEACPTFLMFENV ------------------------------------------3333-------------- PYVSTRTDGQRLLAKFDVSLAAKHMSNTYLAGLAQYYTQYTGTINLHFMFTGPTDAKARY ---------------------3333--------1111---------------1111---- MVAYVPPGMDAPDNPEEAAHCIHAEWDTGLNSKFTFSIPYISAADYTYTASHEAETTCVQ -------------33331111------------------------------3333--111 GWVCVYQITHGKADADALVVSASAGKDFELRLPVDARQQ 1-----------------------1111----------- ------------------------------- ------------------------------- >RETINOL DEHYDRATASE; SWP:Q26490; PDB:1FMJA; PFPYEFRELNPEEDKLVKANLGAFPTTYVKLGPKGYMVYRPYLKDAANIYNMPLRPTDVF --------------------3333-------1111---3333------1111--1111-- VASYQRSGTTMTQELVWLIENDLNFEAAKTYMSLRYIYLDGFMIYDPEKQEEYNDILPNP ---2222-----------------3333--3333---1111----333333331111-33 ENLDMERYLGLLEYSSRPGSSLLAAVPPTEKRFVKTHLPLSLMPPNMLDTVKMVYLARDP 33-----------1111-----33331111--------3333-2222------------- RDVAVSSFHHARLLYLLNKQSNFKDFWEMFHRGLYTLTPYFEHVKEAWAKRHDPNMLFLF -----------1111--1111--------1111-2222---------1111-1111---3 YEDYLKDLPGCIARIADFLGKKLSEEQIQRLCEHLNFEKFKNNGAVNMEDYREIGILADG 333-------------1111----------------------3333-33333333--222 EHFIRKGKAGCWRDYFDEEMTKQAEKWIKDNLKDTDLRYPNM 2--------3333----------------1111-----3333 >Heparin-binding growth fa; SWP:Q7SIF8; PDB:1FMMS; QKPKLLYCSNGGYFLRIFPDGKVDGTRDRSDPYIQLQFYAESVGEVYIKSLETGQYLAMD ---------------------------3333----------------------------- SDGQLYASQSPSEECLFLERLEENNYNTYKSKVHADKDWFVGIKKNGKTKPGSRTHFGQK -----------3333--------------------------------------------- AILFLPLPVSSD ------------ >METHIONYL-TRNA FMET FORMY; SWP:P23882; PDB:1FMTA; SESLRIIFAGTPDFAARHLDALLSSGHNVVGVFTQPDRPLMPSPVKVLAEEKGLPVFQPV ----------------------1111----------------3333---1111------- SLRPQENQQLVAELQADVMVVVAYGLILPKAVLEMPRLGCINVHGSLLPRWRGAAPIQRS ----------------------------3333---1111--------------------- LWAGDAETGVTIMQMDVGLDTGDMLYKLSCPITAEDTSGTLYDKLAELGPQGLITTLKQL --------------------------------1111------------------------ ADGTAKPEVQDETLVTYAEKLSKEEARIDWSLSAAQLERCIRAFNPWPMSWLEIEGQPVK ----------3333-------3333---1111---------------------%%%%--- VWKASVIDTATNAAPGTILEANKQGIQVATGDGILNLLSLQPAGKKAMSAQDLLNSRREW -------------2222----3333----------------2222---3333----3333 FVPGNRLV -2222--- >MONOCLONAL ANTIBODY AGAIN; SWP:NA; PDB:1FN4A; DIKLTQSPSLLSASVGDRVTLSCKGSQNINNYLAWYQQKLGEAPKLLIYNTNSLQTGIPS ----------------------------------------------------------11 RFSGSGSGTDYTLTISSLQPEDVATYFCYQYNNGYTFGAGTKLELKRTAPTVSIFPPSTE 11---------------------------------------------------------- QLATGGASVVCLMNNFYPRDISVKWKIDGTERRDGVLDSVTDQDSKDSTYSMSSTLSLTK ------------------------------------------------------------ ADYESHNLYTCEVVHKTSSSPVVKSFNRNEC ------------------------------- >MONOCLONAL ANTIBODY AGAIN; SWP:NA; PDB:1FN4B; QVQLLESGPGLVRPSETLSLTCTVSGFSLTSFSVSWVRHPSGKGPEWMGRMWYDGYTAYN ------------------------------------------------------------ SALKSRLSISRDTSKNQVFLKMNSL 1111--------------------- >OUTER-CAPSID PROTEIN SIGM; SWP:Q98639; PDB:1FN9A; MEVCLPNGHQVVDLINNAFEGRVSIYSAQEGWDKTISAQPDMMVCGGAVVCMHCLGVVGS -----------------------------!!!!-----------!!!!-----------3 LQRKLKHLPHHRCNQQIRHQDYVDVQFADRVTAHWKRGMLSFVAQMHEMMNDVSPDDLDR 333--------------3333----------------------------1111------- VRTEGGSLVELNWLQVDPNSMFRSIHSSWTDPLQVVDDLDTKLDQYWTALNLMIDSSDLI -----------3333-1111---11111111-----------------------1111-- PNFMMRDPSHAFNGVKLGGDARQTQFSRTFDSRSSLEWGVMVYDYSELEHDPSKGRAYRK ------3333----------1111------3333----------------1111------ ELVTPARDFGHFGLSHYSRATTPILGKMPAVFSGMLTGNCKMYPFIKGTAKLKTVRKLVE ---3333---1111----------%%%%-----3333----------------------- AVNHAWGVEKIRYALGPGGMTGWYNRTMQQAPIVLTPAALTMFPDTIKFGDLNYPVMIGD ---------------2222----------3333---3333-------------------- PMILG ----- >FIBRONECTIN CELL-ADHESION; SWP:P02751; PDB:1FNA; RDLEVVAATPTSLLISWDAPAVTVRYYRITYGETGGNSPVQEFTVPGSKSTATISGLKPG --------------------------------2222---------1111----------- VDYTITVYAVTGRGDSPASSKPISINYRTEI ------------------------------- >FERREDOXIN-NADP+ REDUCTAS; SWP:P00455; PDB:1FND; HSKKMEEGITVNKFKPKTPYVGRCLLNTKITGDDAPGETWHMVFSHEGEIPYREGQSVGV --------------3333-------------3333----------iiii---2222---- IPDGEDKNGKPHKLRLYSIASSALGDFGDAKSVSLCVKRLIYTNDAGETIKGVCSNFLCD -----1111------------3333------------------1111----------111 LKPGAEVKLTGPVGKEMLMPKDPNATIIMLGTGTGIAPFRSFLWKMFFEKHDDYKFNGLA 12222---------1111---1111----------------------------------- WLFLGVPTSSSLLYKEEFEKMKEKAPDNFRLDFAVSREQTNEKGEKMYIQTRMAQYAVEL -----------2222---------1111------1111--1111--------3333---- WEMLKKDNTYVYMCGLKGMEKGIDDIMVSLAAAEGIDWIEYKRQLKKAEQWNVEVY -----1111------3333------------1111-3333-----1111------- >MHC CLASS II I-EK, ALPHA ; SWP:P04224; PDB:1FNGA; IKEEHTIIQAEFYLLPDKRGEFMFDFDGDEIFHVDIEKSETIWRLEEFAKFASFEAQGAL -------------------------iiii------1111-----3333------3333-- ANIAVDKANLDVMKERSNNTPDANVAPEVTVLSRSPVNLGEPNILICFIDKFSPPVVNVT ----------------%%%%-----------------2222------------------- WLRNGRPVTEGVSETVFLPRDDHLFRKFHYLTFLPSTDDFYDCEVDHWGLEEPLRKHWEF --iiii-----------------------------1111-------1111---------- EE -- >FIBRONECTIN; SWP:P02751; PDB:1FNHA; PAPTDLKFTQVTPTSLSAQWTPPNVQLTGYRVRVTPKEKTGPMKEINLAPDSSSVVVSGL ------------------------------------------------------------ MVATKYEVSVYALKDTLTSRPAQGVVTTLENVSPPRRARVTDATETTITISWRTKTETIT ------------------------------------------------------------ GFQVDAVPANGQTPIQRTIKPDVRSYTITGLQPGTDYKIYLYTLNDNARSSPVVIDASTA -------------------1111---------------------!!!!------------ IDAPSNLRFLATTPNSLLVSWQPPRARITGYIIKYEKPGSPPREVVPRPRPGVTEATITG ------------1111---------------------------------1111------- LEPGTEYTIYVIALKNNQKSEPLIGRKKT ----------------------------- >LOW AFFINITY IMMUNOGLOBUL; SWP:O75015; PDB:1FNLA; EDLPKAVVFLEPQWYSVLEKDSVTLKCQGAYSPEDNSTQWFHNESLISSQASSYFIDAAT -----------------2222----------1111------iiii--------------3 VNDSGEYRCQTNLSTLSDPVQLEVHIGWLLLQAPRWVFKEEDPIHLRCHSWKNTALHKVT 333-------1111------------------------2222--------2222------ YLQNGKDRKYFHHNSDFHIPKATLKDSGSYFCRGLVGSKNVSSETVNITITQA --iiii----------------3333--------------------------- >CELL DIVISION CONTROL PRO; SWP:Q8ZYK1; PDB:1FNNA; AIVVDDSVFSPSYVPKRLPHREQQLQQLDILLGNWLRNPGHHYPRATLLGRPGTGKTVTL ---------1111----2222-----------------------------2222------ RKLWELYKDKTTARFVYINGFIYRNFTAIIGEIARSLNIPFPRRGLSRDEFLALLVEHLR ------1111--------3333--3333-------------------------------- ERDLYMFLVLDDAFNLAPDILSTFIRLGQEADKLGAFRIALVIVGHNDAVLNNLDPSTRG -----------3333-3333-----3333-3333------------33331111--3333 IMGKYVIRFSPYTKDQIFDILLDRAKAGLAEGSYSEDILQMIADITGAQTPLDTNRGDAR -2222------------------------2222----------------1111------- LAIDILYRSAYAAQQNGRKHIAPEDVRKSSKEVLFGISEEVLIGLPLHEKLFLLAIVRSL -------------1111----3333----------------3333--------------- KISHTPYITFGDAEESYKIVCEEYGERPRVHSQLWSYLNDLREKGIVETRQNTTLISIGT ---------------------1111----3333--------1111--------------- EPLDTLEAVITKLIKEELR ------------------- >PEPTIDASE T; SWP:P26311; PDB:1FNOA; DKLLERFLHYVSLDTQSKSGVRQVPSTEGQWKLLRLLKQQLEEGLVNITLSEKGTLATLP --------------------------3333--------------------1111------ ANVEGDIPAIGFISHVDTSPDFSGKNVNPQIVENYRGGDIALGIGDEVLSPVFPVLHQLL ------------------3333------------------------------3333--22 GQTLITTDGKTLLGADDKAGVAEITALAVLKGNPIPHGDIKVAFTPDEEVGKGAKHFDVE 22--------------------------------------------3333-1111--333 AFGAQWAYTVDGGGVGELEFENFNAASVNIKIVGNNVHPGTAKGVVNALSLAARIHAEVP 3------------2222--------------------33332222-3333-----11111 ADEAPETTEGYEGFYHLASKGTVDRAEHYIIRDFDRKQFEARKRKEIAKKVGKGLHPDCY 1113333-!!!!---------3333----------------------------------- IELVIEDSYYNREKVVEHPHILDIAQQARDCHITPEKPIRGGTDGAQLSFGLPCPNLFTG ----------------------------1111----------3333-------------- GYNYHGKHEFVTLEGEKAVQVIVRIAELTAKRG --1111--------------------------- >von Willebrand factor [Pr; SWP:P04275; PDB:1FNSH; QVQLKESGPGLVAPSQSLSITCTVSGFSLTDYGVDWVRQPPGKGLEWLGMIWGDGSTDYN ------------2222-----------3333--------2222--------1111----- SALKSRLSITKDNSKSQVFLKMNSLQTDDTARYYCVRDPADYGNYDYALDYWGQGTSVTV --%%%%------1111---------1111------------1111--------------- SSAKTTPPSVYPLAPGSSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYT -------------------------------------%%%%--1111------------- LSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRDCG -------3333-----------3333-------------- >von Willebrand factor [Pr; SWP:P04275; PDB:1FNSL; DIQMTQSPSSLSASLGDRVTISCSASQDINKYLNWYQQKPDGAVKLLIFYTSSLHSGVPS -------------2222-----------%%%%------1111------------222233 RFSGSGSGTDYSLTISNLEPEDIATYYCQQYEKLPWTFGGGTKLEVKRADAAPTVSIFPP 33----!!!!--------3333-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT 3333-------------------------iiii--------------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC -----1111--------3333--------1111- >Proteasome component PUP2; SWP:P32379; PDB:1FNTS; EYDRGVSTFSPEGRLFQVEYSLEAIKLGSTAIGIATKEGVVLGVEKRATSPLLESDSIEK ---------1111----------3333---------------------------3333-- IVEIDRHIGCAMSGLTADARSMIEHARTAAVTHNLYYDEDINVESLTQSVCDLALRFGEG ----1111---------------------------------3333--------------- ASGEERLMSRPFGVALLIAGHDADDGYQLFHAEPSGTFYRYNAKAIGSGSEGAQAELLNE --------------------------------1111-------------------3333- WHSSLTLKEAELLVLKILKQVMEEKLDENNAQLSCITKQDGFKIYDNEKTAELIKELKEK --------------------------3333-----------------------3333--3 EAAE 333- >EXOTOXIN TYPE A PRECURSOR; SWP:P62560; PDB:1FNUA; QQDPDPSQLHRSSLVKNLQNIYFLYEGDPVTHENVKSVDQLLSHDLIYNVSGPNYDKLKT ----1111--1111--3333-3333----------------1111------2222----- ELKNQEMATLFKDKNVDIYGVEYYHLCYLCENAERSACIYGGVTNHEGNHLEIPKKIVVK ----------1111---------2222--1111--------------------------- VSIDGIQSLSFDIETNKKMVTAQELDYKVRKYTIDNKQLYTNGPSKYETGYIKFIPKNKE --iiii------------------------------------------------------ SFWFDFFPEPEFTQSKYLMIYKDNETLDNKTSQIEVYLTTK ------------3333----1111---3333---------- >ELAV-like protein 3; SWP:Q60900; PDB:1FNXH; MDSKTNLIVNYLPQNMTQDEFKSLFGSIGDIESCKLVRDKITGQSLGYGFVNYSDPNDAD --3333------3333-------3333-----------------------------3333 KAINTLNGLKLQTKTIKVSYARPSSASIRDANLYVSGLPKTMSQKEMEQLFSQYGRIITS -3333------------------------------------------------------- RILLDQATGVSRGVGFIRFDKRIEAEEAIKGLNGQKPLGAAEPITVKFANNPSQ ------------------------------------------------------ >BARK AGGLUTININ I,POLYPEP; SWP:Q41159; PDB:1FNYA; TGSLSFSFPKFAPNQPYLINQGDALVTSTGVLQLTNVVNGVPSSKSLGRALYAAPFQIWD --------------------------1111-------iiii------------------- STTGNVASFVTSFTFIIQAPNPATTADGLAFFLAPVDTQPLDLGGMLGIFKFNKSNQIVA --------------------1111----------1111----!!!!------1111---- VEFDTFSNGDWDPKGRHLGINVNSIESIKTVPWNWTNGEVANVFISYEASTKSLTASLVY -------1111------------------------2222---------1111-------- PSLETSFIIDAIVDVKIVLPEWVRFGFSATTGIDKGYVQTNDVLSWSFESNLPG 1111---------3333------------------------------------- >ALLOGENEIC H-2KB MHC CLAS; SWP:Q5R1F1; PDB:1FO0A; KVTQTQTSISVMEKTTVTMDCVYETQDSSYFLFWYKQTASGEIVFLIRQDSYKKENATVG -------------------------------------3333--------1111-----!! HYSLNFQKPKSSIGLIITATQIEDSAVYFCAMRGDYGGSGNKLIFGTGTLLSVKP !!----3333----------3333-----------iiii---------------- >T-cell receptor beta chai; SWP:P04214; PDB:1FO0B; VTLLEQNPRWRLVPRGQAVNLRCILKNSQYPWMSWYQQDLQKQLQWLFTLRSPGDKEVKS --------------------------1111--------1111---------2222----- LPGADYLATRVTDTELRLQVANMSQGRTLYCTCSADRVGNTLYFGEGSRLIV 2222------------------------------------------------ >NUCLEAR RNA EXPORT FACTOR; SWP:Q9UBU9; PDB:1FO1A; TIPYGRKYDKAWLLSMIQSKCFTPIEFHYENTRAALKAVNYKILDRENRRISIIIELKPE ---3333----------------------!!!!--1111--------------------- QVEQLKLIMSKRYDGSQQVLDLKGLRSDPDLVAQNIDVVLNRRSCMAATLRIIEENIPEL ---------1111-1111-----3333-----------1111--------------1111 LSLNLSNNRLYRLDDMSSIVQKAPNLKILNLSGNELKSERELDKIKGLKLEELWLDGNSL ------------1111-3333-1111-----------33333333---------222233 CDTFRDQSTYISAIRERFPKLLRLDGHELPPPIAF 33-----------33331111--iiii-------- >THIOREDOXIN; SWP:Q57755; PDB:1FO5A; MSKVKIELFTSPMCPHCPAAKRVVEEVANEMPDAVEVEYINVMENPQKAMEYGIMAVPTI -----------------!!!!--------------------------------------- VINGDVEFIGAPTKEALVEAIKKRL -------------3333--3333-- >ALPHA-1,3-MANNOSYL-GLYCOP; SWP:P27115; PDB:1FO8A; LAVIPILVIACDRSTVRRCLDKLLHYRPSAELFPIIVSQDCGHEETAQVIASYGSAVTHI ----------------------------3333--------------------!!!!---- RQPDLSNIAVQPDHRKFQGYYKIARHYRWALGQIFHNFNYPAAVVVEDDLEVAPDFFEYF ----------1111---------------------1111-------1111--1111---- QATYPLLKADPSLWCVSAWNDNGKEQMVDSSKPELLYRTDFFPGLGWLLLAELWAELEPK ---------1111----------1111-1111-----------------3333---3333 WPKAFWDDWMRRPEQRKGRACVRPEISRTMTFGLKFIKLNQQFVPFTQLDLSYLQQEAYD -----------3333iiii--------------1111-------1111--3333-3333- RDFLARVYGAPQLQVEKVRTNDRKELGEVRVQYTGRDSFKAFAKALGVMDDLKSGVPRAG -------------3333-----3333-----------------1111-----iiii2222 YRGIVTFLFRGRRVHLAPPQTWDGYDPSWT iiii----iiii------1111---1111- >BETA-1,4-GALACTANASE; SWP:P48842; PDB:1FOBA; ALTYRGADISSLLLLEDEGYSYKNLNGQTQALETILADAGINSIRQRVWVNPSDGSYDLD --------3333-----------1111---------1111-----------1111----- YNLELAKRVKAAGMSLYLDLHLSDTWADPSDQTTPSGWSTTDLGTLKWQLYNYTLEVCNT ---------------------------1111---2222---------------------- FAENDIDIEIISIGNEIRAGLLWPLGETSSYSNIGALLHSGAWGVKDSNLATTPKIMIHL -1111-----------1111---1111-----------------1111------------ DDGWSWDQQNYFYETVLATGELLSTDFDYFGVSYYPFYSASATLASLKTSLANLQSTYDK -1111-----------3333--1111------------1111------------------ PVVVVETNWPVSCPNPAYAFPSDLSSIPFSVAGQQEFLEKLAAVVEATTDGLGVYYWEPA --------------------1111------------------------------------ WIGNAGLGSSCADNLMVDYTTDEVYESIETLGEL 2222-iiii------------------------- >RAS-RELATED C3 BOTULINUM ; SWP:Q60610; PDB:1FOEA; QLSDADKLRKVICELLETERTYVKDLNCLERYLKPLQKETFLTQDELDVLFGNLTEVEFQ --------------------------------3333----------------3333---- VEFLKTLEDGVRLVPDLEKLEKVDQFKKVLFSLGGSFLYYADRFKLYSAFCASHTKVPKV ---------1111--3333--3333----------------------------------- LVKAKTDTAFKAFLDAQNPRQQHSSTLESYLIKPIQRVLKYPLLLRELFALTDAESEEHY --3333-----------333333333333111133333333-------11111111---- HLDVAIKTNKVASHINEQKIHEEFGAVFDQLIAEQTGEKKEVADLSGDLLLHTSVIWLNP ------------------------------------------------------------ PASLGKWKKEPELAAFVFKTAVVLVYKDGSKQKKKLVGSHRLSIYEEWDPFRFRHIPTEA 3333-------------1111-----------------------------------3333 LQVRALPSADAEANAVCEIVHVKSESEGRPERVFHLCCSSPESRKDFLKSVHSILRDKHR ------------------------------------------------------------ RQ -- >PROCARBOXYPEPTIDASE A-S6; SWP:P05805; PDB:1FONA; SWSWQVSLQYEKDGAFHHTCGGSLIAPDWVVTAGHCISTSRTYQVVLGEYDRSVLEGSEQ -----------%%%%-----------------3333------------------------ VIPINAGDLFVHPLWNSNCVACGNDIALVKLSRSAQLGDKVQLANLPPAGDILPNEAPCY ----2222---111111111111------------------------2222--2222--- ISGWGRLYTGGPLPDKLQQALLPTVDYEHCSQWDWWGITVKKTMVCAGGDTRSGCNGDSG ---%%%%------------------3333--1111!!!!-1111---------1111222 GPLNCPAADGSWQVHGVTSFVSAFGCNTIKKPTVFTRVSAFIDWIDETIASN 2-----3333-----------1111--2222-----3333--------1111 >IGG2A-KAPPA 17-IA FAB (HE; SWP:NA; PDB:1FORH; QGQLQQSGAELVRPGSSVKISCKASGYAFSSFWVNWVKQRPGQGLEWIGQIYPGDGDNKY ---------------------------1111--------2222---------1111---- NGKFKGKATLTADKSSTTAYMQLYSLTSEDSAVYFCARSGNYPYAMDYWGQGTSVTVSSA 3333---------1111---------3333------------------------------ KTTAPSVYPLAPVCGGTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSGL ---------------------------------------%%%%----------------- YTLSSSVTVTSSTWPSQTITCNVAHPASSTKVDKKIEPR ---------11113333-------1111----------- >CAMP-DEPENDENT PROTEIN KI; SWP:P06244; PDB:1FOTA; YSLQDFQILRTLGTGSFGRVHLIRSRHNGRYYAMKVLKKEIVVRLKQVEHTNDERLMLSI -1111---------1111------------------------1111--3333----3333 VTHPFIIRMWGTFQDAQQIFMIMDYIEGGELFSLLRKSQRFPNPVAKFYAAEVCLALEYL --1111------------------------------------------------------ HSKDIIYRDLKPENILLDKNGHIKITDFGFAKYVPDVTYLCGTPDYIAPEVVSTKPYNKS ----------3333-------------1111------------11113333--------- IDWWSFGILIYEMLAGYTPFYDSNTMKTYEKILNAELRFPPFFNEDVKDLLSRLITRDLS ------------------------3333--3333-----1111--------------333 QRLGNLQNGTEDVKNHPWFKEVVWEKLLSRNIETPYEPPIDINYGVQGEDPYADLFRDF 32222---33331111------33331111-------------------1111------ >GLUTAREDOXIN 3; SWP:P37687; PDB:1FOVA; ANVEIYTKETCPYCHRAKALLSSKGVSFQELPIDGNAAKREEMIKRSGRTTVPQIFIDAQ --------------------------------2222--------------------iiii HIGGYDDLYALDARGGLDPLLK -----------1111-3333-- ------------------------------------------------------------ ---------------------------- >Isoliquiritigenin 2'-O-me; SWP:P93324; PDB:1FP1D; QTEDSACLSAMVLTTNLVYPAVLNAAIDLNLFEIIAKATPPGAFMSPSEIASKLPASTQH ---------------3333---------------1111-2222-------11113333-1 SDLPNRLDRMLRLLASYSVLTSTTRTIEDGGAERVYGLSMVGKYLVPDESRGYLASFTTF 111-----------1111--------1111---------------1111----------- LCYPALLQVWMNFKEAVVDEDFMGKDKKMNQIFNKSMVDVCATEMKRMLEIYTGFEGIST ---3333-11113333------------------------------------3333---- LVDVGGGSGRNLELIISKYPLIKGINFDLPQVIENAPPLSGIEHVGGDMFASVPQGDAMI ------------------3333------33331111--2222-----3333--------- LKAVCHNWSDEKCIEFLSNCHKALSPNGKVIIVEFILPEEPNTSEESKLVSTLDNLMFIT ---3333-------------11111111-------------------------------- VGGRERTEKQYEKLSKLSGFSKFQVACRAFNSLGVMEFYK ---------------1111--------------------- >ISOFLAVONE O-METHYTRANSFE; SWP:O24529; PDB:1FP2A; RKPSEIFKAQALLYKHIYAFIDSMSLKWAVEMNIPNIIQNHGKPISLSNLVSILQVPSSK ---3333-------------------------------3333--------------3333 IGNVRRLMRYLAHNGFFEIITKEEESYALTVASELLVRGSDLCLAPMVECVLDPTLSGSY -----------------------------3333---2222-------------------- HELKKWIYEEDLTLFGVTLGSGFWDFLDKNPEYNTSFNDAMASDSKLINLALRDCDFVFD -----1111---------------------------------------------333322 GLESIVDVGGGTGTTAKIICETFPKLKCIVFDRPQVVENLSGSNNLTYVGGDMFTSIPNA 22-------!!!!---------1111------33332222-----------1111----- DAVLLKYILHNWTDKDCLRILKKCKEAVTNDGKRGKVTIIDMVIDKKKDENQVTQIKLLM -------1111--------------1111iiii-----------1111------------ DVNMACLNGKERNEEEWKKLFIEAGFQHYKISPLTGFLSLIEIYP ---3333--------------1111---------!!!!------- >N-ACYL-D-GLUCOSAMINE 2-EP; SWP:P17560; PDB:1FP3A; MEKERETLQAWKERVGQELDRVMAFWLEHSHDREHGGFFTCLGRDGRVYDDLKYVWLQGR ------------------------------------------1111-------------- QVWMYCRLYRKLERFHRPELLDAAKAGGEFLLRHARVAPPEKKCAFVLTRDGRPVKVQRS -----------3333-3333----------------------------1111-------- IFSECFYTMAMNELWRVTAEARYQSEAVDMMDQIVHWVREDPSGLGRPQLPGAVASESMA ----------------------------------------3333-----1111----333 VPMMLLCLVEQLGEEDEELAGRYAQLGHWCARRILQHVQRDGQAVLENVSEDGEELSGCL 3---------------------------------1111-%%%%------1111----333 GRHQNPGHALEAGWFLLRHSSRSGDAKLRAHVIDTFLLLPFRSGWDADHGGLFYFQDADG 3------------------3333-3333----------------------------1111 LCPTQLEWAMKLWWPHSEAMIAFLMGYSESGDPALLRLFYQVAEYTFRQFRDPEYGEWFG ----1111---3333--------------------------------------------- YLNREGKVALTIKGGPFKGCFHVPRCLAMCEEMLSALLSRLA --1111------------------------------1111-- >IGE HEAVY CHAIN EPSILON-1; SWP:P01854; PDB:1FP5A; VSAYLSRPSPFDLFIRKSPTITCLVVDLAPSKGTVNLTWSRASGKPVNHSTRKEEKQRNG ----------------------------------------1111---------------- TLTVTSTLPVGTRDWIEGETYQCRVTHPHLPRALMRSTTKTSGPRAAPEVYAFATPEWPG --------------1111--------3333------------------------------ SRDKRTLACLIQNFMPEDISVQWLHNEVQLPDARHSTTQPRKTKGSGFFVFSRLEVTRAE ------------------------!!!!--3333-------------------------- WEQKDEFICRAVHEAASPSQTVQRAVSV --3333------1111-%%%%------- >Genome polyprotein; SWP:P04936; PDB:1FPN1; LVVPNINSSNPTTSNSAPALDAAETGHTSSVQPEDVIETRYVQTSQTRDEMSLESFLGRS ----------------3333-3333------3333------------3333-3333---- GCIHESKLEVTLANYNKENFTVWAINLQEMAQIRRKFELFTYTRFDSEITLVPCISALSQ -------------1111------------3333--------------------------- DIGHITMQYMYVPPGAPVPNSRDDYAWQSGTNASVFWQHGQAYPRFSLPFLSVASAYYMF --------------------1111-3333--------2222------------------- YDGYDEQDQNYGTANTNNMGSLCSRIVTEKHIHKVHIMTRIYHKAKHVKAWCPRPPRALE ----1111----3333-------------------------------------------- YTRAHRTNFKIEDRSIQTAIVTRPIITTA ----------------------------- >Genome polyprotein; SWP:P04936; PDB:1FPN2; RIIQITRGDSTITSQDVANAIVAYGVWPHYLSSKDASAIDKPSQPDTSSNRFYTLRSVTW ---------------------2222------3333---------!!!!------------ SSSSKGWWWKLPDALKDMGIFGENMFYHYLGRSGYTIHVQCNASKFHQGTLIVALIPEHQ 1111----------1111------------------------------------------ IASALHGNVNVGYNYTHPGETGREVKAETRLNPDLQPTEEYWLNFDGTLLGNITIFPHQF -----------3333---3333---------1111----1111----------------- INLRSNNSATIIAPYVNAVPMDSMRSHNNWSLVIIPICPLETSSAINTIPITISISPMCA -3333-----------------3333---------------------------------- EFSGARAKRQ ---------- >Genome polyprotein; SWP:P04936; PDB:1FPN3; GLPVFITPGSGQFLTTDDFQSPCALPWYHPTKEISIPGEVKNLVEICQVDSLVPINNTDT ------2222---1111-------2222-------------33331111--------333 YINSENMYSVVLQSSINAPDKIFSIRTDVASQPLATTLIGEISSYFTHWTGSLRFSFMFC 3--3333-----1111-----------1111--1111----------------------- GTANTTVKLLLAYTPPGIAEPTTRKDAMLGTHVIWDVGLQSTISMVVPWISASHYRNTSP -1111------------------------------------------------------- GRSTSGYITCWYQTRLVIPPQTPPTARLLCFVSGCKDFCLRMARDTNLHLQSGAIAQ ----------------------------------3333------------------- >CHAPERONE PROTEIN HSCB; SWP:P36540; PDB:1FPOA; MDYFTLFGLPARYQLDTQALSLRFQDLQRQYHPDKFASGSQAEQLAAVQQSATINQAWQT ----1111------------------3333-33331111--------------------- LRHPLMRAEYLLSLHGFDLASEQHTVRDTAFLMEQLELREELDEIEQAKDEARLESFIKR -----------1111--3333------3333----------------------------- VKKMFDTRHQLMVEQLDNETWDAAADTCRKLRFLDKLRSSAEQLEEKLLDF --------------------------------------------------- >PROTEIN-TYROSINE PHOSPHAT; SWP:P29350; PDB:1FPRA; GFWEEFESLQKQEVKNLHQRLEGQRPENKGKNRYKNILPFDHSRVILQGRDSNIPGSDYI 3333---3333--1111--3333----3333--3333---1111------3333------ NANYIKNQLLGPDENAKTYIASQGCLEATVNDFWQMAWQENSRVIVMTTREVEKGRNKCV ----------1111---------------------------------------------- PYWPEVGMQRAYGPYSVTNCGEHDTTEYKLRTLQVSPLDNGDLIREIWHYQYLSWPDHGV ----2222---!!!!---------------------3333-------------------- PSEPGGVLSFLDQINQRQESLPHAGPIIVHSSAGIGRTGTIIVIDMLMENISTKGLDCDI -----------------3333--------------------------------------- DIQKTIQMVRAQRSGMVQTEAQYKFIYVAIAQFIETTKKKLEVL --------1111--------3333-------------------- >Ig gamma-2A chain C regio; SWP:GCAM_MOUSE; PDB:1FPTH; QVQLQQSGAELVRPGTSVKVSCKASGYAFTNYLIQWIKQRPGQGLEWIGVINPGSGGTDY ------------2222------------1111---------------------------- NANFKGKATLTADKSSSIVYMQLSSL 3333--------3333---------- >CALCIUM-BINDING PROTEIN N; SWP:Q06389; PDB:1FPWA; MGAKTSKLSKDDLTCLKQSTYFDRREIQQWHKGFLRDCPSGQLAREDFVKIYKQFFPFGS ----------------------3333-----------1111------------------- PEDFANHLFTVFDKDNNGFIHFEEFITVLSTTSRGTLEEKLSWAFELYDLNHDGYITFDE ---------------------3333----3333---3333-----3333-------3333 MLTIVASVYKMMGSMVTLNEDEATPEMRVKKIFKLMDKNEDGYITLDEFREGSKVDPSII -------------1111------------------------------------------- GALNLYDGLI ---------- >CYCLIN-DEPENDENT KINASE I; SWP:Q16667; PDB:1FPZA; TPIHISWLSLSRVNCSQFLGLCALPGCKFKDVRRNVQKDTEELKSCGIQDIFVFCTRGEL ---------1111----------2222-2222--3333-----1111------------- SKYRVPNLLDLYQQCGIITHHHPIADGGTPDIASCCEIMEELTTCLKNYRKTLIHSYGGL ------------1111--------2222--3333-----------1111----------- GRSCLVAACLLLYLSDTISPEQAIDSLRDLRGSGAIQTIKQYNYLHEFRDKLAAHL -------------------------------1111--------------------- >REGULATOR OF G-PROTEIN SI; SWP:O46469; PDB:1FQIA; KLVDIPTKRVERWAFNFSELIRDPKGRQSFQHFLRKEFSGENLGFWEACEDLKYGDQSKV -----------11113333---------------1111-----------------1111- KEKAEEIYKLFLAPGARRWINIDGKTDITVKGLKHPHRYVLDAAQTHIYLKKDSYARYLK ------------2222------3333----------1111-------------------- SPIYKELAKAIEP --33333333--- >Retinal rod rhodopsin-sen; SWP:P04972; PDB:1FQJC; FGDDIPGMEGLGTDITVICPWEAFNHLELHELAQYGII 1111---2222----------1111--3333-1111-- >RIESKE-TYPE FERREDOXIN OF; SWP:P37332; PDB:1FQTA; MKFTRVCDRRDVPEGEALKVESGGTSVAIFNVDGELFATQDRCTHGDWSLSDGGYLEGDV -------1111-2222-----iiii------iiii-------1111----3333--!!!! VECSLHMGKFCVRTGKVKSPPPCEALKIFPIRIEDNDVLVDFEAGYLAP ---------------------------------!!!!---1111----- >S-phase kinase-associated; SWP:P63208; PDB:1FQVB; PSIKLQSSDGEIFEVDVEIAKQSVTIKTMLEDLGMDPVPLPNVNAAILKKVIQWCTHHKD ---------------3333--------------------11113333----------111 DIPVWDQEFLKVDQGTLFELILAANYLDIKGLLDVTCKTVANMIKGKTPEEIRKTFNIKN 1----------------------------3333--------------------------- DFTEEEEAQVRKENQWC --3333-------3333 >COLICIN E9 IMMUNITY PROTE; SWP:P13479; PDB:1FR2A; LKHSISDYTEAEFLQLVTTICNADTSSEEELVKLVTHFAEMTEHPSGSDLIYYPKEGDDD ---1111------------1111--------------------11113333---2222-- SPSGIVNTVKQWRAANGKSGFKQ ----------------------- >Colicin-E9; SWP:P09883; PDB:1FR2B; ESKRNKPGKATGKGKPVGDKWLDDAGKDSGAPIPDRIADKLRDKEFKSFDDFRKAVWEEV --1111-----------11113333-!!!!----------2222---------------- SKDPELSKNLNPSNKSSVSKGYSPFTPKNQQVGGRKVYELHHDKPISQGGEVYDMDNIRV --33331111------3333------1111-!!!!---------3333-----1111--- TTPKRHIDIHR ----------- >MOLYBDATE/TUNGSTATE BINDI; SWP:Q7SIF7; PDB:1FR3A; MKISGRNKLEATVKEIVKGTVMAKIVMDYKGTELVAAITIDSVADLDLVPGDKVTALVKA ----------------------------iiii------3333------2222------11 TEMEVLK 11----- >BACTERIOPHAGE FR CAPSID; SWP:P03614; PDB:1FR5A; ASNFEEFVLVDNGGTGDVKVAPSNFANGVAEWISSNSRSQAYKVTCSVRQSSANNRKYTV ------------------------2222--------3333-----------1111----- KVEVPKVATGVELPVAAWRSYMNMELTIPVFATNDDCALIVKALQGTFKTGNPIATAIAA ----------------------------3333----------------2222-------- NSGIY ----- >HETEROCYST [2FE-2S] FERRE; SWP:P11053; PDB:1FRD; ASYQVRLINKKQDIDTTIEIDEETTILDGAEENGIELPFSCHSGSCSSCVGKVVEGEVDQ ---------1111-------1111------1111------------1111---------1 SDQIFLDDEQMGKGFALLCVTYPRSNCTIKTHQEPYLA 111-------------3333------------3333-- --------------------------------------- >n/a; SWP:NA; PDB:1FRGH; EVLLVESGGDLVKPGGFLKLSCAASGFTFSSFGMSWVRHTPDKRLEWVATISNGGGYTYY ---------------------------3333--------1111--------1111----- QDSVKGRFTISRDNAKNTLFLEMTSLKSEDAGLYYCARRERYDEKGFAYWGRGTLVTVSA 1111--------3333----------1111------------------------------ AKTTAPSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSD ----------------%%%%---------------------------------------- LYTLSSSVTVTSSTWPSQSITCNVAHPASSTKVDKKIEPR ----------3333-----------3333----------- >FERREDOXIN I; SWP:P00235; PDB:1FRRA; AYKTVLKTPSGEFTLDVPEGTTILDAAEEAGYDLPFSCRAGACSSCLGKVVSGSVDESEG -------1111------2222------1111------------1111---------3333 SFLDDGQMEEGFVLTCIAIPESDLVIETHKEEELF -------------3333-----------3333--- ----------------------------- >ATP SYNTHASE EPSILON SUBU; SWP:P00837; PDB:1FS0G; KITKAMEMVAASKMRKSQDRMAASRPYAETMRKVIGHLAHYKHPYLEDRDVKRVGYLVVS ------------------------------------------1111-------------- TDRGLCGGLNINLFKKLLAEMKTWTDKGVQCDLAMIGSKGVSFFNSVGGNVVAQVTGMGD -----!!!!--------------3333------------------------------!!! NPSLSELIGPVKVMLQAYDEGRLDKLYIVSNKFINTMSQVPTISQLLPLPKHKSWDYLYE !-3333-----------1111--------------------------------------- PDPKALLDTLLRRYVESQVYQGVVENLASEQAARMVAMK --------------------------------------- >CYCLIN A/CDK2-ASSOCIATED ; SWP:Q13309; PDB:1FS1A; WDSLPDELLLGIFSCLCLPELLKVSGVCKRWYRLASDESLW ------------11113333---1111---------1111- >GLUCOSAMINE-6-PHOSPHATE D; SWP:P09375; PDB:1FS5A; MRLIPLTTAEQVGKWAARHIVNRINAFKPTADRPFVLGLPTGGTPMTTYKALVEMHKAGQ -----------------------------3333---------1111-------------- VSFKHVVTFNMDEYVGLPKEHPESYYSFMHRNFFDHVDIPAENINLLNGNAPDIDAECRQ -----------------1111-----------3333---3333----1111--------- YEEKIRSYGKIHLFMGGVGNDGHIAFNEPASSLASRTRIKTLTHDTRVANSRFFDNDVNQ -----3333---------1111-------------------------------%%%%111 VPKYALTVGVGTLLDAEEVMILVLGSQKALALQAAVEGCVNHMWTISCLQLHPKAIMVCD 1-----------1111--------3333-------------------------------3 EPSTMELKVKTLRYFNELEAENIKGL 3331111-----------3333---- >CYTOCHROME C NITRITE REDU; SWP:Q9S1E5; PDB:1FS7A; KTAHSQGIEGKAMSEEWARYYPRQFDSWKKTKESDNITDMLKEKPALVVAWAGYPFSKDY ---11112222---------------------------3333-3333---22221111-- NAPRGHYYALQDNINTLRTGAPVDGKTGPLPSACWTCKSPDVPRIIEQDGELEYFTGKWA ----3333-------3333-------------3333--------------3333---333 KYGDEIVNTIGCYNCHDDKSAELKSKVPYLDRGLSAAGFKTFAESTHQEKRSLVCAQCHV 31111-----3333--------------------1111--3333-3333---3333---- EYYFKKTEWKDDKGVDKTAMVVTLPWSKGISTEQMEAYYDEINFADWTHGISKTPMLKAQ ----------1111-----------1111----------1111----------------- HPDWELYKTGIHGQKGVSCADCHMPYTQEGAVKYSDHKVGNPLDNMDKSCMNCHRESEQK ------------1111-3333-------!!!!--------3333---------------- LKDIVKQKFERKEFLQDIAFDNIGKAHLETGKAMELGATDAELKEIRTHIRHAQWRADMA ---------------------------------1111-3333------------------ IAGHGSFFHAPEEVLRLLASGNEEAQKARIKLVKVLAKYGAIDYVAPDFETKEKAQKLAK --1111------------------------------11111111---------------- VDMEAFIAEKLKFKQTLEQEWKKQAIAKGRLNPESLKGVDEKSSYYDKTKK -------------------------------33332222---1111----- >P-SELECTIN; SWP:P16109; PDB:1FSB; TASCQDMSCSKQGECLETIGNYTCSCYPGFYGPECEYVRE --------%%%%---------------------------- >GERE; SWP:Q65GF8; PDB:1FSEA; SKPLLTKREREVFELLVQDKTTKEIASELFISEKTVRNHISNAMQKLGVKGRSQAVVELL --------------1111------------------------------------------ RMGELEL ------- >HYPOXANTHINE-GUANINE PHOS; SWP:Q26997; PDB:1FSGA; GSHMASKPIEDYGKGKGRIEPMYIPDNTFYNADDFLVPPHCKPYIDKILLPGGLVKDRVE -3333--3333---2222------------3333---3333------------------- KLAYDIHRTYFGEELHIICILKGSRGFFNLLIDYLATIQKYSGRESSVPPFFEHYVRLKS ---------2222--------1111----------------------------------- YQNDNSTGQLTVLSDDLSIFRDKHVLIVEDIVDTGFTLTEFGERLKAVGPKSMRIATLVE -------------------2222------------------------------------- KRTDRSNSLKGDFVGFSIEDVWIVGCCYDFNEMFRDFDHVAVLSDAARKKFEK --1111------------------iiii-iiii1111---------------- >DISHEVELLED-1; SWP:P51141; PDB:1FSHA; EAPLTVKSDMSAIVRVMQLPDSGLEIRDRMWLKITIANAVIGADVVDWLYTHVEGFKERR --------3333------------------iiii-------------3333--------- EARKYASSMLKHGFLRHTVNKITFSEQCYYVFGD ---------------------------------- >MAJOR POLLEN ALLERGEN BET; SWP:NA; PDB:1FSKB; NIVLTQSPKSMSVSVGERVTLSCKASENVDTYVFWFQQKPDQSPKLLLYGPSNRYTGVPD ------------------------------------------------------------ RFTGSGSTTDFTLTISSVQAEDLADYHCGQSYSYPYTFGGGTKLEIKRADAAPTVSIFPP ------------------3333-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT --3333-----------------------iiii--------------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC -----------------------------3333- >MAJOR POLLEN ALLERGEN BET; SWP:NA; PDB:1FSKC; QVQLQQPGTELVRPGASVILSCKASGYTFTSYWINWVKQRPGQGLEWVGNIFPSDSYTNY ---------------------------1111----------------------------- NQKFKDKATLTVDKSSSTAYMQVNSPTSEDSAVYYCTRGARDTWFAYWGQGTLVTVSVAK 3333--------3333----------3333---------2222----------------- TTPPSVFPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLY --------------------------------------%%%%------------------ TLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRDC ------------------------1111------------ >RHO GDP-DISSOCIATION INHI; SWP:P52565; PDB:1FSOA; MVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNREIVSG -------------3333----------33331111----2222----------------- MKYIQHTYRAGVAIDATDYMVGSYGPRAEEYEFLTPVEEAPKGMLARGSYSIKSRFTDDD --------iiii------------------------------3333-----------111 KTDHLSWEWNFTIKKDWK 1----------------- >N-ACETYLGALACTOSAMINE-4-S; SWP:P15848; PDB:1FSU; SRPPHLVFLLADDLGWNDVGFHGSRIRTPHLDALAAGGVLLDNYYTQPLTPSRSQLLTGR ------------------1111-------------------------------------- YQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGF -1111-------1111----1111-3333--1111--------------33333333--- DTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIAL ----------------------1111-----------------2222------------- ITNHPPEKPLFLYLALQSVHEPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTA 1111-------------------------3333--------------------------- ALKSSGLWNNTVFIFSTDNGGQTLAGGNNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGV --11113333-----------3333---------2222-3333--------1111----- KNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIELLHNIDPNFVDS ------1111------------2222-------3333----------------------- SPCSAFNTSVHAAIRHGNWKLLTGYPGCGYWFPPPSQYNVSEIPSSDPPTKTLWLFDIDR ---------------!!!!--------------1111----------1111-----3333 DPEERHDLSREYPHIVTKLLSRLQFYHKHSVPVYFPAQDPRCDPKATGVWGPWM 1111----3333--------------1111--------1111-3333------- >CYTOCHROME C554; SWP:Q57142; PDB:1FT5A; ADAPFEGRKKCSSCHKAQAQSWKDTAHAKAMESLKPNVKKEAKQKAKLDPAKDYTQDKDC ------------------------33333333--2222-----1111-1111----1111 VGCHVDGFGQKGGYTIESPKPMLTGVGCESCHGPGRNFRGDHRKSGQAFEKSGKKTPRKD 1111--2222----3333-1111---3333----3333------------------3333 LAKKGQDFHFEERCSACHLNYEGSPWKGAKAPYTPFTPEVDAKYTFKFDEMVKEVKAMHE -1111---------------2222------------33333333--333311111111-- HYKLEGVFEGEPKFKFHDEFQASAKPAKKGK -------------1111-------------- >CARBON MONOXIDE OXIDATION; SWP:P72322; PDB:1FT9A; PPRFNIANVLLSPDGETFFRGFRSKIHAKGSLVCTGEGDENGVFVVVDGRLRVYLVGEER --------------11112222-----2222-------------------------iiii EISLFYLTSGDMFCMHSGCLVEATERTEVRFADIRTFEQKLQTCPSMAWGLIAILGRALT ------------------------------------------------------------ SCMRTIEDLMFHDIKQRIAGFFIDHANTTGRQTGVIVSVDFTVEEIANLIGSSRQTTSTA ------------------------------------------------------------ LNSLIKEGYISRQGRGHYTIPNLVRLKAAA -------------2222----3333----- >FRUCTOSE-1,6-BISPHOSPHATA; SWP:P09467; PDB:1FTAA; DVVTLTRFVMEEGRKARGTGELTQLLNSLCTAVKAISSAVRKAGIAHLYGIAGVKKLDVL ---3333-------------------------------------3333-1111--3333- SNDLVMNMLKSSFATCVLVSEEDKHAIIVEPEKRGKYVVCFDPLDGSSNIDCLVSVGTIF ---------3333------1111------3333----------2222-3333-------- GIYRKKSTDEPSEKDALQPGRNLVAAGYALYGSATMLVLAMDCGVNCFMLDPAIGEFILV -----------3333---3333------------------3333---------------- DKDVKIKKKGKIYSLNEAYAKDFDPAVTEYIQRKKFPPDNSAPYGARYVGSMVADVHRTL ------------------3333--------------1111-------------------- VYGGIFLYPANKKSPNGKLRLLYECNPMAYVMEKAGGMATTGKEAVLDVIPTDIHQRAPI ----------3333------------------1111--------3333----1111---- ILGSPDDVLEFLKVYEKHS ------------------- >ACYL CARRIER PROTEIN SYNT; SWP:P0A2W6; PDB:1FTHA; MIVGHGIDIEELASIESAVTRHEGFAKRVLTALEMERFTSLKGRRQIEYLAGRWSAKEAF ----------3333---------3333---3333--3333--3333-------------- SKAMGTGISKLGFQDLEVLNNERGAPYFSQAPFSGKIWLSISHTDQFVTASVILEEN -----------1111-----1111--------------------------------- >MUSCLE FATTY ACID BINDING; SWP:P41496; PDB:1FTPA; VKEFAGIKYKLDSQTNFEEYMKAIGVGAIERKAGLALSPVIELEILDGDKFKLTSKTAIK 3333-----------------1111--------1111----------------------- NTEFTFKLGEEFDEETLDGRKVKSTITQDGPNKLVHEQKGDHPTIIIREFSKEQCVITIK ------2222-----1111-------------------------------1111------ LGDLVATRIYKAQ !!!!--------- >FORMYLMETHANOFURAN\:TETRA; SWP:Q49610; PDB:1FTRA; MEINGVEIEDTFAEAFEAKMARVLITAASHKWAMIAVKEATGFGTSVIMCPAEAGIDCGY --iiii------------------------------------------------------ VPPEETPDGRPGVTIMIGHNDEDELKEQLLDRIGQCVMTAPTASAFDAMPEAEKEDEDRV -11111111---------------------------1111---------3333------- GYKLSFFGDGYQEEDELDGRKVWKIPVVEGEFIVEDSFGITTGVAGGNFYIMAESQPAGL -------iiii-----iiii------3333------------------------------ QAAEAAVDAIKGVEGAYAPFPGGIVASASKVGSKQYDFLPASTNDAYCPTVEDNELPEGV --------33332222---2222------------1111----33331111-----2222 KCVYEIVINGLNEEAVKEAMRVGIEAACQQPGVVKISAGNFGGKLGQYEIHLHDLF --------------------------1111----------iiii------3333-- >FTSY; SWP:P10121; PDB:1FTS; RSLLKTKENLGSGFISLFRGKKIDDDLFEELEEQLLIADVGVETTRKIITNLTEGASRKQ 1111------3333---2222--------------------------------------- LRDAEALYGLLKEEMGEILAKVDEPLNVEGKAPFVILMVGVNGVGKTTTIGKLARQFEQQ --3333---------------------------------------------------111 GKSVMLAAGDTFRAAAVEQLQVWGQRNNIPVIAQHTGADSASVIFDAIQAAKARNIDVLI 1--------1111-----------1111------22223333---------1111----- ADTAGRLQNKSHLMEELKKIVRVMKKLDVEAPHEVMLTIDASTGQNAVSQAKLFHEAVGL -----3333--------------33331111--------3333----------------- TGITLTKLDGTAKGGVIFSVADQFGIPIRYIGVGERIEDLRPFKADDFIEALFAR -----------------------------------1111---------------- >THYROID TRANSCRIPTION FAC; SWP:P23441; PDB:1FTT; MRRKRRVLFSQAQVYELERRFKQQKYLSAPEREHLASMIHLTPTQVKIWFQNHRYKMKRQ ---------------------------3333----------3333--------------- AKDKAAQQ -------- >FUSHI TARAZU PROTEIN; SWP:P02835; PDB:1FTZ; DSKRTRQTYTRYQTLELEKEFHFNRYITRRRRIDIANALSLSERQIKIWFQNRRMKSKKD -------------------3333------------------------------------- RTLDSSPEH --%%%%--- >PHOSPHOCARRIER PROTEIN HP; SWP:P07515; PDB:1FU0A; MEKKEFHIVAETGIHARPATLLVQTASKFNSDINLEYKGKSVNLKIMGVMSLGVGQGSDV ---------1111-----------------------%%%%--------------2222-- TITVDGADEAEGMAAIVETLQKEGLA -----1111-----------1111-- >DNA REPAIR PROTEIN XRCC4; SWP:Q13426; PDB:1FU1A; MERKISRIHLVSEPSITHFLQVSWEKTLESGFVITLTDGHSAWTGTVSESEISQEADDMA ---------1111-------------3333-----------------3333-----1111 MEKGKYVGELRKALLSGAGPADVYTFNFSKESYFFFEKNLKDVSFRLGSFNLEKVENPAE -3333-------------3333-----------------iiii-------------3333 VIRELIYLDTTAENQAKNEHLQKENERLLRDWNDVQGRFEKVSAKEALETDLYK ------------------------------------------------------ >U-SHAPED TRANSCRIPTIONAL ; SWP:Q9VPQ6; PDB:1FU9A; GSAAEVMKKYCSTCDISFNYVKTYLAHKQFYCKNKP ----------3333-----3333------------- >L-FUCOSE ISOMERASE; SWP:P11552; PDB:1FUIA; MKKISLPKIGIRPVIDGRRMGVRESLEEQTMNMAKATAALLTEKLRHACGAAVECVISDT ---------------------3333---------------------1111---------- CIAGMAEAAACEEKFSSQNVGLTITVTPCWCYGSETIDMDPTRPKAIWGFNGTERPGAVY --------------1111--------------1111------------------------ LAAALAAHSQKGIPAFSIYGHDVQDADDTSIPADVEEKLLRFARAGLAVASMKGKSYLSL --------1111------------1111-----------------------2222----- GGVSMGIAGSIVDHNFFESWLGMKVQAVDMTELRRRIDQKIYDEAELEMALAWADKNFRY ---%%%%1111-3333-------------------------------------------- GEDENNKQYQRNAEQSRAVLRESLLMAMCIRDMMQGNSKLADIGRVEESLGYNAIAAGFQ -----3333-----------------------------3333---3333----------- GQRHWTDQYPNGDTAEAILNSSFDWNGVREPFVVATENDSLNGVAMLMGHQLTGTAQVFA -----------------------1111--------%%%%--------------------- DVRTYWSPEAIERVTGHKLDGLAEHGIIHLINSGSAALDGSCKQRDSEGNPTMKPHWEIS --------------------1111---------------------1111-----1111-- QQEADACLAATEWCPAIHEYFRGGGYSSRFLTEGGVPFTMTRVNIIKGLGPVLQIAEGWS -------1111-----33331111------------------------------------ VELPKDVHDILNKRTNSTWPTTWFAPRLTGKGPFTDVYSVMANWGANHGVLTIGHVGADF -----------11111111-----------!!!!-3333-1111---------------- ITLASMLRIPVCMHNVEETKVYRPSAWAAHGMDIEGQDYRACQNYGPLYKR ----1111--------3333-----3333---3333--------------- >PR3; SWP:P15637; PDB:1FUJA; IVGGHEAQPHSRPYMASLQMRGNP -------22221111--------- >EUKARYOTIC INITIATION FAC; SWP:P10081; PDB:1FUKA; IKQFYVNVEEEEYKYECLTDLYDSISVTQAVIFCNTRRKVEELTTKLRNDKFTVSAIYSD ---------3333---------1111-------------------------------111 LPQQERDTIMKEFRSGSSRILISTDLLARGIDVQQVSLVINYDLPANKENYIHRIGRGGG 13333------------------33331111---------------33331111------ VAINFVTNEDVGAMRELEKFYSTQIEELPSDIATLLN ------33333333-----------------1111-- >FUMARASE C; SWP:P05042; PDB:1FURA; VRSEKDSMGAIDVPADKLWGAQTQRSLEHFRISTEKMPTSLIHALALTKRAAAKVNEDLG -----1111----1111---------------------------------------1111 LLSEEKASAIRQAADEVLAGQHDDEFPLAIWQTGSGTQSNMNMNEVLANRASELLGGVRG --3333---------------1111-------1111-----------------------1 MERKVHPNDDVNKSQSSNDVFPTAMHVAALLALRKQLIPQLKTLTQTLNEKSRAFADIVK 111------1111--3333---------------------------------1111---- IGRTNLQDATPLTLGQEISGWVAMLEHNLKHIEYSLPHVAELALGGTAVGTGLNTHPEYA ---iiii-----------------------------3333---------------1111- RRVADELAVITCAPFVTAPNKFEALATCDALVQAHGALKGLAASLMKIANDVRWLASGPR ------------------------------------------------------------ CGIGEISIPENEPGSSIMPGKVNPTQCEALTMLCCQVMGNDVAINMGGASGNFELNVFRP --------------1111----------------------------1111-!!!!----- MVIHNFLQSVRLLADGMESFNKHCAVGIEPNRERINQLLNESLMLVTALNTHIGYDKAAE -----------------------3333-----------11111111-3333--3333--- IAKKAHKEGLTLKAAALALGYLSEAEFDSWVRPEQM -------------------------------3333- >RIBONUCLEASE F1; SWP:P10282; PDB:1FUS; SATTCGSTNYSASQVRAAANAACQYYQNDDTAGSSTYPHTYNNYEGFDFPVDGPYQEFPI -------------------------1111--!!!!-------1111-------------- KSGGVYTGGSPGADRVVINTNCEYAGAITHTGASGNNFVGCSGTN 1111--------------1111-------2222!!!!---2222- >HYPOTHETICAL 19.5 KDA PRO; SWP:P77368; PDB:1FUXA; EFQVTSNEIKTGEQLTTSHVFSGFGCEGGNTSPSLTWSGVPEGTKSFAVTVYDPDAPTGS ---------2222---------iiii--------------2222---------------- GWWHWTVVNIPATVTYLPVDAGRRDGTKLPTGAVQGRNDFGYAGFGGACPPKGDKPHHYQ ----------1111---2222-1111---2222----3333---------2222------ FKVWALKTEKIPVDSNSSGALVGYLNANKIATAEITPVYEIKLE -------------1111-------3333---------------- >FIRST ZINC FINGER OF U-SH; SWP:Q9VPQ6; PDB:1FV5A; GSLLKPARFMCLPCGIAFSSPSTLEAHQAYYCSHRI ----------3333-----3333------------- >Prolactin-binding protein; SWP:Q9QV16; PDB:1FVCB; EVQLVESGGGLVQPGGSLRLSCAASGFNIKDTYIHWVRQAPGKGLEWVARIYPTNGYTRY ------------2222-----------3333--------3333----------------- ADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCSRWGGDGFYAMDYWGQGTLVTVSS -1111--------1111---------3333----------2222---------------- >IGG1-KAPPA 4D5 FAB (HEAVY; SWP:NA; PDB:1FVDB; EVQLVESGGGLVQPGGSLRLSCAASGFNIKDTYIHWVRQAPGKGLEWVARIYPTNGYTRY ------------2222-----------3333--------2222----------------- ADSVKGRFTISADTSKNTLYLQMNSLRAEDTAVYYCSRWGGDGFYAMDVWGQGTLVTVSS 1111---------1111---------1111------------------------------ ASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSS ----------------------------------------%%%%-------------333 GLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSC 3----------3333-----------3333------------- >PEPTIDE METHIONINE SULFOX; SWP:P54149; PDB:1FVGA; KIVSPQEALPGRKEPLVVAAKHHVNGNRTVEPFPEGTQAVFGGCFWGAERKFWTLKGVYS ---3333--------------------------2222--------------1111----- TQVGFAGGYTPNPTYKEVCSGKTGHAEVVRVVFQPEHISFEELLKVFWENHDPTQGRQGN -----------------3333------------1111--------------------!!! DHGSQYRSAIYPTSAEHVGAALKSKEDYQKVLSEHGFGLITTDIREGQTFYYAEDYHQQY !--1111-------------------------1111-----------------3333-11 LSKDPDGYC 11-1111-- >CHLORELLA VIRUS DNA LIGAS; SWP:O41026; PDB:1FVIA; AITKPLLAATLENIEDVQFPCLATPKIAGIRSVKQTQMLSRTFKPIRNSVMNRLLTELLP ------------1111-----------------------1111----------------2 EGSDGEISIEGATFQDTTSAVMTGHAKFSYYWFDYVTDDPLKKYIDRVEDMKNYITVHPH 222---------3333----------------------11113333-------------1 ILEHAQVKIIPLIPVEINNITELLQYERDVLSKGFEGVMIRKPDGKYKFGRSTLKEGILL 111---------------------------1111-------1111-------3333---- KMKQFKDAEATIISMTALFKSGKVEEDVMGSIEVDYDGVVFSIGTGFDADQRRDFWQNKE ----------------------3333----------------------------333333 SYIGKMVKFKYFEMPRFPVFIGIR 33---------------------- >DISULFIDE BOND FORMATION ; SWP:P24991; PDB:1FVKA; AQYEDGKQYTTLEKPVAGAPQVLEFFSFFCPHCYQFEEVLHISDNVKKKLPEGVKMTKYH ---2222--------2222-------1111----------------11112222------ VNFMGGDLGKDLTQAWAVAMALGVEDKVTVPLFEGVQKTQTIRSASDIRDVFINAGIKGE -----3333--------------3333----------------3333----------333 EYDAAWNSFVVKSLVAQQEKAAADVQLRGVPAMFVNGKYQLNPQGMDTSNMDVFVQQYAD 3---------------------1111---------------3333--------------- TVKYLSEK ----1111 >FLAVOPROTEIN 390; SWP:P12745; PDB:1FVPA; MNKWNYGVFFVNFYNKGQQEPSKTMNNALETLRIIDEDTSIYDVINIDDHYLVKKDSEDK ------------------------------------------------3333---1111- KLAPFITLGEKLYVLATSENTVDIAAKYALPLVFKWDDINEERLKLLSFYNASASKYNKN -----------------------3333--------------------1111--------- IDLVRHQLMLHVNVNEAETVAKEELKLYIENYVACTQPSNFNGSIDSIIQSNVTGSYKDC -----------------3333-------3333---------------------------- LSYVANLAGKFDNTVDFLLCFESMQDQNKKKSVMIDLNNQVIKFRQDNNLI -----3333-%%%%-----------33333333------------------ >COPPER-TRANSPORTING ATPAS; SWP:P38995; PDB:1FVQA; AREVILAVHGMTCSACTNTINTQLRALKGVTKCDISLVTNECQVTYDNEVTADSIKEIIE ------------3333-------1111-------------------3333---------- DCGFDCEILRDS ------------ >TYROSINE-PROTEIN KINASE T; SWP:Q02763; PDB:1FVRA; PTIYPVLDWNDIKFQDVIGEGNFGQVLKARIKKDGLRMDAAIKRMKEYRDFAGELEVLCK -------1111--------!!!!---------iiii--------------------1111 LGHHPNIINLLGACEHRGYLYLAIEYAPHGNLLDFLRKSRVLETDPAFAIANSTASTLSS ---1111--------iiii-------1111---------3333----------------- QQLLHFAADVARGMDYLSQKQFIHRDLAARNILVGENYVAKIADFGLSRGQEVYVKKLPV ---------------------------3333---2222--------------------11 RWMAIESLNYSVYTTNSDVWSYGVLLWEIVSLGGTPYCGMTCAELYEKLPQGYRLEKPLN 11---------------------------------------------3333------111 CDDEVYDLMRQCWREKPYERPSFAQILVSLNRMLEERKTYVNTTLYEKFTYAGIDCSAE 1---------1111-1111-------------3333--------!!!!------1111- >BOTROCETIN ALPHA CHAIN; SWP:P22029; PDB:1FVUA; DCPSGWSSYEGNCYKFFQQKMNWADAERFCSEQAKGGHLVSIKIYSKEKDFVGDLVTKNI --2222--iiii-----------------11112222-----1111-------------- QSSDLYAWIGLRVENKEKQCSSEWSDGSSVSYENVVERTVKKCFALEKDLGFVLWINLYC -----------------------1111--------3333--------2222-------11 AQKNPFVCKSPPP 11----------- >Botrocetin beta chain; SWP:P22030; PDB:1FVUB; DCPPDWSSYEGHCYRFFKEWMHWDDAEEFCTEQQTGAHLVSFQSKEEADFVRSLTSEMLK --2222-----------------------11112222------------3333-111122 GDVVWIGLSDVWNKCRFEWTDGMEFDYLIAEYECVASKPTNNKWWIIPCTRFKNFVCEFQ 22----------------1111----------------1111-----1111--------- A - >GLUTATHIONE TRANSFERASE Z; SWP:O43708; PDB:1FW1A; KPILYSYFRSSCSWRVRIALALKGIDYKTVPINLIKDGGQQFSKDFQALNPMKQVPTLKI ------1111-------------------------iiii1111-3333-1111------i DGITIHQSLAIIEYLEETRPTPRLLPQDPKKRASVRMISDLIAGGIQPLQNLSVLKQVGE iii------------------------------------------3333-3333333333 EMQLTWAQNAITCGFNALEQILQSTAGIYCVGDEVTMADLCLVPQVANAERFKVDLTPYP 33------------------3333-----------3333----------------1111- TISSINKRLLVLEAFQVSHPCRQPDTPT --------11111111--11111111-- >PHOSPHOGLYCERATE KINASE; SWP:P00560; PDB:1FW8A; SKYSLAPVAKELQSLLGKDVTFLNDCVGPEVEAAVKASAPGSVILLENLRYHIEEEGSRK 1111------------------------------11112222-----11111111----- VDGQKVKASKEDVQKFRHELSSLADVYINDAFGTAHRAHSSMVGFDLPQRAAGFLLEKEL iiii----------------1111------3333----3333------------------ KYFGKALENPTRPFLAILGGAKVADKIQLIDNLLDKVDSIIIGGGMAFTFKKVLENTEIG ----------------------33333333------------------------------ DSIFDKAGAEIVPKLMEKAKAKGVEVVLPVDFIIADAFSADANTKTVTDKEGIPAGWQGL ----3333-----------1111------------------------3333--2222--- DNGPESRKLFAATVAKAKTIVWNGPPGVFEFEKFAAGTKALLDEVVKSSAAGNTVIIGGG --3333-------3333----------3333----------------------------3 DTATVAKKYGVTDKISHVSTGGGASLELLEGKELPGVAFLSEKKSLSSKLSVQDLDLKDK 333-------3333----------------------3333----------3333--2222 RVFIRVDFNVPLDGKKITSNQRIVAALPTIKYVLEHHPRYVVLASHLGRPNGERN ------------------------------------------------------- ----------------------------------- >GUANINE NUCLEOTIDE EXCHAN; SWP:P47224; PDB:1FWQA; ELVSAEGRNRKAVLCQRCGSRVLQPGTALFSRRQLFLPSMRKKPALSDGSNPDGDLLQEH --------------------------------------3333------------------ WLVEDMFIFENVGFTKDVGNIKFLVCADCEIGPIGWHCLDDKNSFYVALERVSHE ----3333-----------------1111-------------------1111--- >NITROUS OXIDE REDUCTASE; SWP:Q51705; PDB:1FWXA; ADGSVAPGQLDDYYGFWSSGQSGEMRILGIPSMRELMRVPVFNRCSATGWGQTNESVRIH -----2222---------!!!!-------------------------------------- ERTMSERTKKFLAANGKRI 1111--------1111--- >ATP SYNTHASE ALPHA CHAIN; SWP:P06450; PDB:1FX0A; KVVNTGTVLQVGDGIARIHGLDEVMAGELVEFEEGTIGIALNLESNNVGVVLMGDGLMIQ -----------iiii-----33332222---1111-------------------3333-- EGSSVKATGRIAQIPVSEAYLGRVINALAKPIDGRGEITASESRLIESPAPGIMSRRSVY ------------------------------------------------------------ EPLQTGLIAIDAMIPVGRGQRELIIGDRQTGKTAVATDTILNQQGQNVICVYVAIGQKAS ------3333------2222-----------------------2222----------333 SVAQVVTNFQERGAMEYTIVVAETADSPATLQYLAPYTGAALAEYFMYRERHTLIIYDDL 3----------------------3333--------------------------------- SKQAQAYRQMSLLLRRPPGREAYPGDVFYLHSRLLERAAKLSSLLGEGSMTALPIVETQA -----------1111----iiii--1111----1111----1111-------------%% GDVSAYIPTNVISITDGQIFLSADLFNAGIRPAINVGISVSRVGSAAQIKAMKKVAGKLK %%--------1111-------3333---------------3333---------------- LELAQFAELEAFAQFASDLDKATQNQLARGQRLRELLKQPQSAPLTVEEQVMTIYTGTNG ----33333333-------3333---------------------------------1111 YLDSLELDQVRKYLVELRTYVKTNKPEFQEIISSTKTFTEEAEALLKEAIQEQMERF ---------3333-------------------3333--------------------- >ATP synthase subunit beta; SWP:P00825; PDB:1FX0B; NLGRIAQIIGPVLNVAFPPGKMPNIYNALIVKGRDTAGQPMNVTCEVQQLLGNNRVRAVA -----------------------2222--------------------------------- MSATDGLTRGMEVIDTGAPLSVPVGGPTLGRIFNVLGEPVDNLRPVDTRTTSPIHRSAPA ---22222222----------------2222--1111----------------------1 FTQLDTKLSIFETGIKVVNLLAPYRRGGKIGLFGGAGVGKTVLIMELINNIAKAHGGVSV 111---------------------2222-----------------------1111----- FGGVGERTREGNDLYMEMKESGVINEQNIAESKVALVYGQMNEPPGARMRVGLTALTMAE ------3333--------1111------------------------1111---------- YFRDVNEQDVLLFIDNIFRFVQAGSEVSALLGRMPSAVGYQPTLSTEMGSLQERITSTKE ----------------3333-------3333-----------1111-----------333 GSITSIQAVYVPADDLTDPAPATTFAHLDATTVLSRGLAAKGIYPAVDPLDSTSTMLQPR 3----------iiii---3333-3333--------1111--------------1111111 IVGEEHYEIAQRVKETLQRYKELQDIIAILGLDELSEEDRLTVARARKIERFLSQPFFVA 1-3333-----------------3333----11111111--------------------- EVFTGSPGKYVGLAETIRGFQLILSGELDSLPEQAFYLVGNIDEATA ------------3333------1111-33331111-----3333--- >RECEPTOR-TYPE ADENYLATE C; SWP:Q99279; PDB:1FX2A; NNNRAPKEPTDPVTLIFTDIESSTALWAAHPDLMPDAVAAHHRMVRSLIGRYKCYEVKTV 3333---1111------------------------------------------------! GDSFMIASKSPFAAVQLAQELQLCFLHHDWGTNALDDSYREFEEQRAEGECEYTPPTAHM !!!----------------------------------------------1111---1111 DPEVYSRLWNGLRVRVGIHTGLCDIRHDEVTKGYDYYGRTPNMAARTESVANGGQVLMTH 3333-------------------------------------------11112222----- AAYMSLSAEDRKQIDVTALGDVALRGVSDPVKMYQLNTVPSRNFAALRLDREYFD --333333331111---------2222-----------2222------------- >PROTEIN-EXPORT PROTEIN SE; SWP:P44853; PDB:1FX3A; QPVLQIQRIYVKDVSFEAPNLPHIFQQEWKPKLGFDLSTETTQVGDDLYEVVLNISVETT -----------------------3333-----------------2222------------ LEDSGDVAFICEVKQAGVFTISGLEDVQMAHCLTSQCPNMLFPYARELVSNLVNRGTFPA -3333------------------------------------------------1111--- LNLSPVNFDALFVEYMNRQQAEN ----------------------- >RECEPTOR-TYPE ADENYLATE C; SWP:Q99280; PDB:1FX4A; DNDSAPKEPTGPVTLIFTDIESSTALWAAHPDLMPDAVATHHRLIRSLITRYECYEVKTV -1111--1111------------------------------------------------! GDSFMIASKSPFAAVQLAQELQLCFLRLDWETNAVDESYREFEEQRAEGECEYTPPTASL !!!-----------------------------------------------------1111 DPEVYSRLWNGLRVRVGIHTGLCDIRYDEVTKGYDYYGRTSNMAARTESVANGGQVLMTH ----------------------------1111---------------11112222----- AAYMSLSGEDRNQLDVTTLGATVLRGVPEPVRMYQLNAVPGRNFAALRLDR --333333331111---------2222-----------2222--------- >ANTI-H(O) LECTIN I; SWP:P22972; PDB:1FX5A; SDDLSFKFKNFSQNGKDLSFQGNASVIETGVLQLNKVGNNLPDETGGIARYIAPIHIWNC -----------1111-----------1111------------------------------ NTGELASFITSFSFFMETSANPKAATDGLTFFLAPPDSPLRRAGGYFGLFNDTKCDSSYQ --------------------3333----------1111----!!!!---------3333- TVAVEFDTIGSPVNFWDPGFPHIGIDVNCVKSINAERWNKRYGLNNVANVEIIYEASSKT -------------1111-------------------------1111---------1111- LTASLTYPSDQTSISVTSIVDLKEILPEWVSVGFSGSTYIGRQATHEVLNWYFTSTFINT --------------------3333--------------2222------------------ >FERREDOXIN II; SWP:P00209; PDB:1FXD; PIEVNDDCMAEACVEICPDVFEMNEEGDKAVVINPDSDLDCVEEAIDSCPAEAIVRS ----1111--3333--1111---3333------1111-3333------1111----- >FERREDOXIN I; SWP:P00250; PDB:1FXIA; ASYKVTLKTPDGDNVITVPDDEYILDVAEEEGLDLPYSCRAGACSTCAGKLVSGPAPDED ----------------------1111------------------1111------------ QSFLDDDQIQAGYILTCVAYPTGDCVIETHKEEALY -------3333---1111------------------ >PREFOLDIN; SWP:O26774; PDB:1FXKA; QNVQHQLAQFQQLQQQAQAISVQKQTVEQINETQKALEELSRAADDAEVYKSSGNILIRV --------------------------------------3333-1111-----!!!!---- AKDELTEELQEKLETLQLREKTIERQEERVKKLQEQVNIQEAK --------------------------------------1111- >Prefoldin alpha subunit; SWP:O27646; PDB:1FXKC; AALAEIVAQLNIYQSQVELIQQQMEAVRATISELEILEKTLSDIQGKDGSETLVPVGAGS -3333-------1111-------------------------1111-2222------%%%% FIKAELKDTSEVIMSVGAGVAIKKNFEDAMESIKSQKNELESTLQKMGENLRAITDIMMK -------1111-----iiii------------------------------------3333 LSPQAEELLAAVA ------------- >PARANEOPLASTIC ENCEPHALOM; SWP:P26378; PDB:1FXLA; SKTNLIVNYLPQNMTQEEFRSLFGSIGEIESCKLVRDKITGQSLGYGFVNYIDPKDAEKA ----------1111--------3333---------------------------------- INTLNGLRLQTKTIKVSYARPSSASIRDANLYVSGLPKTMTQKELEQLFSQYGRIITSRI ---2222-!!!!----------3333----------1111--------3333-------- LVDQVTGVSRGVGFIRFDKRIEAEEAIKGLNGQKPSGATEPITVKFA ------------------3333-------2222-------------- >GLUCOSE-1-PHOSPHATE THYMI; SWP:Q9HU22; PDB:1FXOA; KRKGIILAGGSGTRLHPATLAISKQLLPVYDKPMIYYPLSTLMLAGIREILIISTPQDTP -----------3333-1111--1111------3333------1111--------3333-- RFQQLLGDGSNWGLDLQYAVQPSPDGLAQAFLIGESFIGNDLSALVLGDNLYYGHDFHEL --------3333-------------3333--11113333-------1111---2222--- LGSASQRQTGASVFAYHVLDPERYGVVEFDQGGKAISLEEKPLEPKSNYAVTGLYFYDQQ -------------------3333------1111--------------------------- VVDIARDLKPSPRGELEITDVNRAYLERGQLSVEIMGRGYAWLDTGTHDSLLEAGQFIAT ----1111--1111--3333-----1111-------3333-------------------- LENRQGLKVACPEEIAYRQKWIDAAQLEKLAAPLAKNGYGQYLKRLLTETVY ------------------------------3333----------3333---- >FERREDOXIN I; SWP:P00210; PDB:1FXRA; ARKFYVDQDECIACESCVEIAPGAFAMDPEIEKAYVKDVEGASQEEVEEAMDTCPVQCIH ------3333----3333-------------------1111------------1111--- WEDE ---- >PLATELET-ACTIVATING FACTO; SWP:Q29459; PDB:1FXWF; SNPAAIPHAAEDIQGDDRWMSQHNRFVLDCKDKEPDVLFVGDSMVQLMQQYEIWRELFSP -1111------------------------------------33333333--------333 LHALNFGIGGDTTRHVLWRLKNGELENIKPKVIVVWVGTNNHENTAEEVAGGIEAIVQLI 3------2222-----------1111-----------1111------------------- NTRQPQAKIIVLGLLPRGEKPNPLRQKNAKVNQLLKVSLPKLANVQLLDTDGGFVHSDGA ---1111------------------------------------------------1111- ISCHDMFDFLHLTGGGYAKICKPLHELIMQLL -33331111----3333--------------- >EXONUCLEASE I; SWP:P04995; PDB:1FXXA; QSTFLFHDYETFGTHPALDRPAQFAAIRTDSEFNVIGEPEVFYCKPADDYLPQPGAVLIT -----------------------------1111--------------------------- GITPQEARAKGENEAAFAARIHSLFTVPKTCILGYNNVRFDDEVTRNIFYRNFYDPYAWS -----------------------1111--------3333-----------------3333 WQHDNSRWDLLDVMRACYALRPEGINWPEGLPSFRLEHLTKANGIEHSNAHDAMADVYAT -%%%%----------------2222----------------------------------- IAMAKLVKTRQPRLFDYLFTHRNKHKLMALIDVPQMKPLVHVSGMFGAWRGNTSWVAPLA ------------------1111-----33333333-----------3333---------- WHPENRNAVIMVDLAGDISPLLELDSDTLRERLYTAKTDLAVPVKLVHINKCPVLAQANT -1111-------1111-3333--------------------------1111-----3333 LRPEDADRLGINRQHCLDNLKILRENPQVREKVVAIFAEAPSDNVDAQLYNGFFSDADRA ----------------------------------1111-----33331111--------- AMKIVLETEPRNLPALDITFVDKRIEKLLFNYRARNFPGTLDYAEQQRWLEHRRQVFTPE ----111133333333-----3333-----------3333-------------------- FLQGYADELQMLVQQYADDKEKVALLKALWQYADEIVEH ------------------3333----------------- >COAGULATION FACTOR XA-TRY; SWP:P00742; PDB:1FXYA; IVGGYNCKDGEVPWQALLINEENEGFCGGTILSEFYILTAAHCLYQAKRFKVRVGDRNTE -------22221111----1111---------1111---3333----------------- QEEGGEAVHEVEVVIKHNRFTKETYDFDIAVLRLKTPITFRMNVAPASLPTAPPATGTKC ------------------------------------------------------------ LISGWGNTASSGADYPDELQCLDAPVLSQAKCEASYPGKITSNMFCVGF ------------------------------------------------- >GLYCININ G1; SWP:P04776; PDB:1FXZA; NECQIQKLNALKPDNRIESEGGLIETWNPNNKPFQCAGVALSRCTLNRNALRRPSYTNGP 1111-----------------------1111----------------------------- QEIYIQQGKGIFGMIYPGCPSTRHQKIYNFREGDLIAVPTGVAWWMYNNEDTPVVAVSII ------------------------------2222----2222------------------ DTNSLENQLDQMPRRFYLAGNQEQEFLKYQQGGSILSGFTLEFLEHAFSVDKQIAKNLQG ---3333----------------1111------3333--3333--3333----------- EKGAIVTVKGGLSVIKPICTMRLRHNIGQTSSPDIYNPQAGSVTTATSLDFPALSWLRLS -----------------1111-------------------------3333--3333---- AEFGSLRKNAMFVPHYNLNANSIIYALNGRALIQVVNCNGERVFDGELQEGRVLIVPQNF ------2222--------------------------1111--------2222-------- VVAARSQSDNFEYVSFKTNDTPMIGTLAGANSLLNALPEEVIQHTFNLKSQQARQIKNNN -------------------------------3333------------------------- PFKFLVPPQES ----------- >ESA1 HISTONE ACETYLTRANSF; SWP:Q08649; PDB:1FY7A; ARVRNLNRIIMGKYEIEPWYFSPYPIELTDEDFIYIDDFTLQYFGSKKQYERYRKKCTLR ----------!!!!--------------------------------------3333---- HPPGNEIYRDDYVSFFEIDGRKQRTWCRNLCLLSKLFLDHKTLYYDVDPFLFYCMTRRDE ------------------3333--------------------33331111--------11 LGHHLVGYFSKEKESADGYNVACILTLPQYQRMGYGKLLIEFSYELSKKENKVGSPEKPL 11------------1111--------3333-----------------1111--------- SDLGLLSYRAYWSDTLITLLVEHQKEITIDEISSMTSMTTTDILHTAKTLNILRYYKGQH -------------------------------------------------------iiii- IIFLNEDILDRYNRLKAKKRRTIDPNRLIWKPP -----------------------3333------ >PROTEINASE INHIBITOR; SWP:Q40378; PDB:1FYBA; DRICTNCCAGTKGCKYFSDDGTFVCEGESDPRNPKACTLNCDPRIAYGVCPRSEEKKNDR -----------------3333--------------------3333--------------- ICTNCCAGTKGCKYFSDDGTFVCEGESDPRNPKACPRNCDPRIAYGICPLA ---------------1111---------------1111-3333-------- >ASPARTYL DIPEPTIDASE; SWP:P36936; PDB:1FYEA; MELLLLSNSTLPGKAWLEHALPLIANQLNGRRSAVFIPFAGVTQTWDEYTDKTAEVLAPL ----------22222222---------iiii------3333---------------3333 GVNVTGIHRVADPLAAIEKAEIIIVGGGNTFQLLKESRERGLLAPMADRVKRGALYIGWS -----1111----------------------------------------1111------- AGANLACPTIRTTNDMPIVDPNGFDALDLFPLQINPHFTNTREQRIRELLVVAPELTVIG ---------1111---------------------------------------1111---- LPEGNWIQVSNGQAVLGGPNTTWVFKAGEEAVALEAGHRF -2222----%%%%------------2222-----2222-- >INTERFERON-GAMMA; SWP:P01579; PDB:1FYHA; MQDPYVKEAENLKKYFNAGHSDVADNGTLFLGILKNWKEESDRKIMQSQIVSFYFKLFKN ---------------------3333----33333333-------------------3333 FKDDQSIQKSVETIKEDMNVKFFNSNKKKRDDFEKLTNYSVTDLNVQRKAIDELIQVMAE 1111------------------%%%%---------1111-------------------11 LGANVSGEFVKEAENLKKYFNDNGTLFLGILKNWKEESDRKIMQSQIVSFYFKLFKNFKD 11------------------------3333------------------------------ DQSIQKSVETIKEDMNVKFFNSNKKKRDDFEKLTNYSVTDLNVQRKAIHELIQVMAELSP -----------------1111--------------------------------------- AA -- >Interferon-gamma receptor; SWP:P15260; PDB:1FYHB; VPTPTNVTIESYNMNPIVYWEYQIMPQVPVFTVEVKNYGVKNSEWIDACINISHHYCNIS -----------%%%%---------------------2222------------------11 DHVGDPSNSLWVRVKARVGQKESAYAKSEEFAVCRDGKIGPPKLDIRKEEKQIMIDIFHP 11--1111---------!!!!----------3333------------------------3 SVFVETTCYIRVYNVYVRMNGSEIQYKILTQKEDDCDEIQCQLAIPVSSLNSQYCVSAEG 333---------------!!!!----------3333-----------%%%%--------- VLHVWGVTTEKSKEVCITIFN --------------------- >MULTIFUNCTIONAL AMINOACYL; SWP:P07814; PDB:1FYJA; DSLVLYNRVAVQGDVVRELKAKKAPKEDVDAAVKQLLSLKAEYKEKTGQEYKPGNPP ------------------------------------------3333----------- >TOLL-LIKE RECEPTOR 1; SWP:Q15399; PDB:1FYVA; NIPLEELQRNLQFHAFISYSGHDSFWVKNELLPNLEKEGQICLHERNFVPGKSIVENIIT -------------------3333-----------3333---------------------- CIEKSYKSIFVLSPNFVQSEWCHYELYFAHHNLFHEGSNSLILILLEPIPQYSIPSSYHK -1111--------------------3333--------------------1111-333333 LKSLARRTYLEWPKEKSKRGLFWANLRAAINIKLTEQAK 33------------3333--------------------- >TOLL-LIKE RECEPTOR 2; SWP:O60603; PDB:1FYXA; SRNIYDAFVSYSERDAYWVENLMVQELENFNPPFKLLHKRDFIHGKWIIDNIIDSIEKSH -----------3333-----------1111-----------------3333----1111- KTVFVLSENFVKSEWKYELDFSHFRLFDENNDAAILILLEPIEKKAIPQRFKLRKIMNTK -----------------------1111---------------3333-------------- TYLEWPMDEAQREGFWVNLRAAIKS -------3333-------------- >FIBRIN; SWP:Q6NSD8; PDB:1FZCA; IEVLKRKVIEKVQHIQLLQKNVRAQLVDMKRLEVDIDIKIRSCRGSCSRALAREVDLKDY 1111---3333-----------------------------1111---------------- EDQQKQLEQVIAKD -------------- >Fibrinogen beta chain [Pr; SWP:P02675; PDB:1FZCB; LYIDETVNSNIPTNLRVLRSILENLRSKIQKLESDVSAQMEYCRTPCTVSCNIPVVSGKE ----3333---------------------------------------------------- CEEIIRKGGETSEMYLIQPDSSVKPYRVYCDMNTENGGWTVIQNRQDGSVDFGRKWDPYK -------------------3333------------------------------------- QGFGNVATNTDGKNYCGLPGEYWLGNDKISQLTRMGPTELLIEMEDWKGDKVKAHYGGFT -------------------------------1111------------------------- VQNEANKYQISVNKYRGTAGNALMDGASQLMGENRTMTIHNGMFFSTYDRDNDGWLTSDP --3333--------------------1111!!!!1111-2222---1111--------33 RKQCSKEDGGGWWYNRCHAANPNGRYYWGGQYTWDMAKHGTDDGVVWMNWKGSWYSMRKM 333333--------------------2222--11111111------3333---------- SMKIRPFF -------- >FIBRINOGEN-420; SWP:P02671; PDB:1FZDA; GGWLLIQQRMDGSLNFNRTWQDYKRGFGSLNDEGEGEFWLGNDYLHLLTQRGSVLRVELE ------------------------------1111------------1111---------- DWAGNEAYAEYHFRVGSEAEGYALQVSSYEGTAGDALIEGSVEEGAEYTSHNNMQFSTFD ----------------3333--------------------3333-1111-2222---111 RDADQWEENCAEVYGGGWWYNNCQAANLNGIYYPGGSYDPRNNSPYEIENGVVWVSFRGA 1-------3333--------------------------3333-----------3333--- DYSLRAVRMKIRPLVTQ ------------3333- >Transcriptional regulator; SWP:P0C1U6; PDB:1FZPD; AITKINDCFELLSMVTYADKLKSLIKKEFSISFEEFAVLTYISENKEKEYYLKDIVVKAV -------3333-------------------1111--3333--------3333---1111- KILSQEDYFDKKRNEHDERTVLILVNAQQRKKIESLLSRV ----------------------------------3333-- >ADP-RIBOSYLATION FACTOR-L; SWP:Q9WUL7; PDB:1FZQA; GLLSILRKLKSAPDQEVRILLLGLDNAGKTTLLKQLASEDISHITPTQGFNIKSVQSQGF ---3333----------------2222-------------------2222------iiii KLNVWDIGGQRKIRPYWRSYFENTDILIYVIDSADRKRFEETGQELTELLEEEKLSCVPV -------------------3333--------11111111------------3333----- LIFANKQDLLTAAPASEIAEGLNLHTIRDRVWQIQSCSALTGEGVQDGMNWVCKNV -----3333-------------1111---------------2222------1111- >PHOSPHOGLYCERATE MUTASE; SWP:P36623; PDB:1FZTA; MTTEAAPNLLVLTRHGESEWNKLNLFTGWKDPALSETGIKEAKLGGERLKSRGYKFDIAF -----------------3333----------------------------3333------- TSALQRAQKTCQIILEEVGEPNLETIKSEKLNERYYGDLQGLNKDDARKKWGAEQVQIWR -------------------3333----3333----!!!!--------------------- RSYDIAPPNGESLKDTAERVLPYYKSTIVPHILKGEKVLIAAHGNSLRALIMDLEGLTGD ------2222-------------------3333------------------------111 QIVKRELATGVPIVYHLDKDGKYVSKELIDN 1----------------1111---------- >PLACENTA GROWTH FACTOR; SWP:P49763; PDB:1FZVA; SSEVEVVPFQEVWGRSYCRALERLVDVVSEYPSEVEHMFSPSCVSLLRCTGCCGDENLHC -------------------------3333-------------------------1111-- VPVETANVTMQLLKIRSGDRPSYVELTFSQHVRCECRPLR ---------------------------------------- >UBIQUITIN-CONJUGATING ENZ; SWP:P21734; PDB:1FZYA; SRAKRIMKEIQAVKDDPAAHITLEFVSESDIHHLKGTFLGPPGTPYEGGKFVVDIEVPME ------------11111111-----------------------1111----------111 YPFKPPKMQFDTKVYHPNISSVTGAICLDILKNAWSPVITLKSALISLQALLQSPEPNDP 1--------------1111--------3333----3333----------------1111- QDAEVAQHYLRDRESFNKTAALWTRLYAS ----------------------------- >HEMOGLOBIN ALPHA CHAIN; SWP:P01966; PDB:1G08A; VLSAADKGNVKAAWGKVGGHAAEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHGA ------------3333!!!!-------------------1111-----2222-------- KVAAALTKAVEHLDDLPGALSELSDLHAHKLRVDPVNFKLLSHSLLVTLASHLPSDFTPA -----------11113333----------------------------------1111--- VHASLDKFLANVSTVLTSKYR --------------1111--- >Hemoglobin subunit beta; SWP:P02070; PDB:1G08B; MLTAEEKAAVTAFWGKVKVDEVGGEALGRLLVVYPWTQRFFESFGDLSTADAVMNNPKVK -------------1111-----------------------3333---------------- AHGKKVLDSFSNGMKHLDDLKGTFAALSELHCDKLHVDPENFKLLGNVLVVVLARNFGKE ---------------1111------------------3333---------------!!!! FTPVLQADFQKVVAGVANALAHRYH --------------------1111- >ENDOGLUCANASE; SWP:P19424; PDB:1G0CA; PAGMQAVKSPSEAGALQLVELNGQLTLAGEDGTPVQLRGMSTHGLQWFGEIVNENAFVAL ---1111-3333--------%%%%----1111-----------11113333--------- SNDWGSNMIRLAMYIGENGYATNPEVKDLVYEGIELAFEHDMYVIVDWHVHAPGDPRADV -1111----------%%%%-----------------------------------111111 YSGAYDFFEEIADHYKDHPKNHYIIWELANEPSPNNNGGPGLTNDEKGWEAVKEYAEPIV 11------------1111-3333------------2222--------------------- EMLREKGDNMILVGNPNWSQRPDLSADNPIDAENIMYSVHFYTGSHGASHIGYPEGTPSS ---------------%%%%-----3333-------------1111--------2222333 ERSNVMANVRYALDNGVAVFATEWGTSQANGDGGPYFDEADVWLNFLNKHNISWANWSLT 3-----------1111-----------1111----------------1111--------- NKNEISGAFTPFELGRTDATDLDPGANQVWAPEELSLSGEYVRARIKGIEYTPIDRTK ---3333-----------------1111--1111------------------------ >PROTEIN-GLUTAMINE GAMMA-G; SWP:P52181; PDB:1G0DA; GLIVDVNGRSHENNLAHRTREIDRERLIVRRGQPFSITLQCSDSLPPKHHLELVLHLGKR --------3333--11113333-------------------------------------- DEVVIKVQKEHGARDKWWFNQQGAQDEILLTLHSPANAVIGHYRLAVLVMSPDGHIVERA ----------------------------------1111------------1111------ DKISFHMLFNPWCRDDMVYLPDESKLQEYVMNEDGVIYMGTWDYIRSIPWNYGQFEDYVM ---------1111--1111---------------------1111--------1111---- DICFEVLDNSPAALKNSEMDIEHRSDPVYVGRTITAMVNSNGDRGVLTGRWEEPYTDGVA ---------3333----------------------------------------------1 PYRWTGSVPILQQWSKAGVRPVKYGQCWVFAAVACTVLRCLGIPTRPITNFASAHDVDGN 111----3333---1111-------3333------------------------3333--- LSVDFLLNERLESLDSRQRSDSSWNFHCWVESWMSREDLPEGNDGWQVLDPTPQELSDGE -------1111---3333-----------------1111--------------------- FCCGPCPVAAIKEGNLGVKYDAPFVFAEVNADTIYWIVQKDGQRRKITEDHASVGKNIST ----------1111------------------------1111------------------ KSVYGNHREDVTLHYKYPEGSQKEREVYKKAGRRVTRLQLSIKHAQPVFGTDFDVIVEVK ----------3333---2222--------------------------------------- NEGGRDAHAQLTMLAMAVTYNSLRRGECQRKTISVTVPAHKAHKEVMRLHYDDYVRCVSE -------------------------------------------------3333-----11 HHLIRVKALLDAPGPIMTVANIPLSTPELLVQVPGKAVVWEPLTAYVSFTNPLPVPLKGG 11-----------------------------------2222------------------- VFTLEGAGLLSATQIHVNGAVAPSGKVSVKLSFSPMRTGVRKLLVDFDSDRLKDVKGVTT -----2222------------2222-----------------------1111-------- VVVHKK ------ >INOSITOL MONOPHOSPHATASE; SWP:Q57573; PDB:1G0HA; MKWDEIGKNIAKEIEKEILPYFGRKDKSYVVGTSPSGDETEIFDKISEDIALKYLKSLNV -3333------------3333--3333------1111----------------3333--- NIVSEELGVIDNSSEWTVVIDPIDGSFNFINGIPFFAFCFGVFKNNEPYYGLTYEFLTKS ----------------------------1111-----------%%%%-------3333-- FYEAYKGKGAYLNGRKIKVKDFNPNNIVISYYPSKKIDLEKLRNKVKRVRIFGAFGLEMC ----2222---iiii-------1111---------------------------------- YVAKGTLDAVFDVRPKVRAVDIASSYIICKEAGALITDENGDELKFDLNATDRLNIIVAN -----------------3333-1111---1111--------------------------- SKEMLDIILDLL 3333-3333--- >TRIHYDROXYNAPHTHALENE RED; SWP:Q12634; PDB:1G0OA; KYDAIPGPLGPQSASLEGKVALVTGAGRGIGREMAMELGRRGCKVIVNYANSTESAEEVV -----------11112222----------------------------------------- AAIKKNGSDAACVKANVGVVEDIVRMFEEAVKIFGKLDIVCSNSGVVSFGHVKDVTPEEF ---1111--------1111----------3333-----------------3333------ DRVFTINTRGQFFVAREAYKHLEIGGRLILMGSITGQAKAVPKHAVYSGSKGAIETFARC ----------------------2222--------1111---------------------- MAIDMADKKITVNVVAPGGIKTDMYHAVCREYIPNGENLSNEEVDEYAAVQWSPLRRVGL -----------------------------------1111-------------1111---3 PIDIARVVCFLASNDGGWVTGKVIGIDGGACM 333---------3333----------%%%%-- >HYPOTHETICAL 23.7 KDA PRO; SWP:P36651; PDB:1G0SA; MLKPDNLPVTFGKNDVEIIARETLYRGFFSLDLYRFRHRLFNGQMSHEVRREIFERGHAA -----------1111------------------------1111----------------- VLLPFDPVRDEVVLIEQIRIAAYDTSETPWLLEMVAGMIEEGESVEDVARREAIEEAGLI ------1111--------3333-----------------2222----------------- VKRTKPVLSFLASPGGTSERSSIMVGEVDATTASNEDIRVHVVSREQAYQWVEEGKIDNA ------------3333------------3333---------------------------- ASVIALQWLQLHHQALKNEWA --------------------- >CREATINE KINASE; SWP:Q9XSC6; PDB:1G0WA; PFSNSHNTLKLRFPAEDEFPDLSGHNNHMAKVLTPELYAELRAKSTPSGFTVDDVIQTGV ---------11113333----1111-3333----------1111-1111------3333- DNPGHPYIMTVGCVAGDEESYDVFKELFDPIIEDRHGGYKPTDEHKTDLNPDNLQGGDDL ----1111--------3333---3333------------1111------1111------- DPNYVLSSRVRTGRSIRGFCLPPHCSRGERRAIEKLAVEALSSLDGDLAGRYYALKSMTE 3333-----------2222---------------------1111!!!!-----3333--- AEQQQLIDDHFLFDKPVSPLLLASGMARDWPDARGIWHNDNKTFLVWINEEDHLRVISMQ -----------------33331111-22222222----1111------------------ KGGNMKEVFTRFCNGLTQIETLFKSKNYEFMWNPHLGYILTCPSNLGTGLRAGVHIKLPH -----------------------1111--------------1111!!!!----------- LGKHEKFSEVLKRLRLQKRGVGGVFDVSNADRLGFSEVELVQMVVDGVKLLIEMEQRLEQ ---1111----------------------------------------------------- GQAIDDLMPAQK ----1111---- >LEUCOCYTE IMMUNOGLOBULIN-; SWP:Q8NHL6; PDB:1G0XA; HLPKPTLWAEPGSVITQGSPVTLRCQGTQEYRLYREKKTAPWITRIPQELVKKGQFPIPS ---------------2222--------------------3333---33331111------ ITWEHAGRYRCYYGSDTAGRSESSDPLELVVTGAYIKPTLSAQPSPVVNSGGNVTLQCDS -3333----------1111-----------------------------2222-------- QVAFDGFILCKEGEHPQCLNSQPHARGSSRAIFSVGPVSPSRRWWYRCYAYDSNSPYEWS ---------------------1111-------------3333---------3333----- LPSDLLELLVLG ------------ >PEPTIDYL-LYS METALLOENDOP; SWP:P81054; PDB:1G12A; TYNGCSSSEQSALAAAASAAQSYVAESLSYLQTHTAATPRYTTWFGSYISSRHSTVLQHY ----------------------------------------3333---------------- TDMNSNDFSSYSFDCTCTAAGTFAYVYPNRFGTVYLCGAFWKAPTTGTDSQAGTLVHESS --11113333--------1111----1111-------3333------------------- HFTRNGGTKDYAYGQAAAKSLATMDPDKAVMNADNHEYFSENNPAQS -3333--------------------3333------------------ >RAS-RELATED PROTEIN SEC4; SWP:P07560; PDB:1G16A; SIKILLIGDSGVGKSCLLVRFVEDKFNPIDFKIKTVDINGKKVKLQIWDTAGQERFRTIT --------2222-------------------------%%%%--------iiii------- TAYYRGAGIILVYDITDERTFTNIKQWFKTVNEHANDEAQLLLVGNKSDETRVVTADQGE 3333---------1111------------------3333--------------------- ALAKELGIPFIESSAKNDDNVNEIFFTLAKLIQEKI --------------1111-3333------------- >RECA PROTEIN; SWP:P26345; PDB:1G19A; MTQTPDREKALELAVAQIEKSYGKGSVMRLGDEARQPISVIPTGSIALDVALGIGGLPRG ---3333---------------1111--1111---------------------------- RVIEIYGPESSGKTTVALHAVANAQAAGGVAAFIDAEHALDPDYAKKLGVDTDSLLVSQP ------------------------1111-----------------3333-3333------ DTGEQALEIADMLIRSGALDIVVIDSVAALVPRAELEGEHVGLQARLMSQALRKMTGALN -------------------------3333-----------------------------11 NSGTTAIFINQLTGGKALKFYASVRMDVRRVETLKDGTNAVGNRTRVKVVKNKCLAPFKQ 11---------------------------------------------------------- AEFDILYGKGISREGSLIDMGVDQGLIRKSGAWFTYEGEQLGQGKENARNFLVENADVAD -----------3333----------------------------------------3333- EIEKKIKEKLG ----------- >IMMUNOGLOBULIN-LIKE DOMAI; SWP:Q8WZ42; PDB:1G1CA; SMEAPKIFERIQSQTVGQGSDAHFRVRVVGKPDPECEWYKNGVKIERSDRIYWYWPEDNV ---------------------------------------iiii-------------1111 CELVIRDVTGEDSASIMVKAINIAGETSSHAFLLVQAK --------3333---------1111------------- >Paired amphipathic helix ; SWP:Q60520; PDB:1G1EB; SLQNNQPVEFNHAINYVNKIKNRFQGQPDIYKAFLEILHTYQKEQRNAKEAGGNYTPALT --------------------------3333------------------------------ EQEVYAQVARLFKNQEDLLSEFGQFLPDA ----------------------3333--- >SCAFFOLDING PROTEIN; SWP:Q45996; PDB:1G1KA; ASLKVTVGTANGKPGDTVTVPVTFADVAKMKNVGTCNFYLGYDASLLEVVSVDAGPIVKN ------------2222---------3333-------------1111--------1111-3 AAVNFSSSASNGTISFLFLDNTITDELITADGVFANIKFKLKSVTAKTTTPVTFKDGGAF 333-------------------1111---------------------------------- GDGTMSKIASVTKTNGSVTIDPG -1111------------------ >CONOTOXIN EVIA; SWP:P60513; PDB:1G1PA; DDCIKYGFCSLPILKNGLCCSGACVGVCADL ------------------1111--------- >E-SELECTIN; SWP:P16581; PDB:1G1TA; WSYNTSTEAMTYDEASAYCQQRYTHLVAIQNKEEIEYLNSILSYSPSYYWIGIRKVNNVW --------------------------------------------1111-------%%%%- VWVGTQKPLTEEAKNWAPGEPNNRQKDEDCVEIYIKREKDVGMWNDERCSKKKLALCYTA ---------3333---2222----2222-----------2222----1111--------- ACTNTSCSGHGECVETINNYTCKCDPGFSGLKCEQIV --1111iiii--------------2222-1111---- >30S ribosomal protein S15; SWP:P80378; PDB:1G1XB; PITKEEKQKVIQEFARFPGDTGSTEVQVALLTLRINRLSEHLKVHKKDHHSHRGLLMMVG ----------------------------------------33331111------------ QRRRLLRYLQREDPERYREIVEKLGLRG -------------3333----1111--- >30S RIBOSOMAL PROTEIN S6; SWP:NA; PDB:1G1XC; DLRDYRNVEVLKRFLILPRTGLSGKEQRILAKTIKRARILGLLPFT --------3333----------3333-------------------- >CDK-ACTIVATING KINASE ASS; SWP:P51948; PDB:1G25A; MDDQGCPRCKTTKYRNPSLKLMVNVCGHTLCESCVDLLFVRGAGNCPECGTPLRKSNFRV -!!!!-------------------------1111----1111------------------ QLFED ----- >GRANULIN A; SWP:P28799; PDB:1G26A; VVHCDMEVICPDGYTCCRLPSGAWGCCPFTQ ----------3333----------------- >Maltose transport protein; SWP:Q9YGA6; PDB:1G291; MAGVRLVDVWKVFGEVTAVREMSLEVKDGEFMILLGPSGCGKTTTLRMIAGLEEPSRGQI ------------!!!!----------2222------2222-------------------- YIGDKLVADPEKGIFVPPKDRDIAMVFQSYALYPHMTVYDNIAFPLKLRKVPRQEIDQRV -!!!!---3333----3333---------------------------------------- REVAELLGLTELLNRKPRELSGGQRQRVALGRAIVRKPQVFLMDEPLSNLDAKLRVRMRA ----11113333---3333-------------------------1111------------ ELKKLQRQLGVTTIYVTHDQVEAMTMGDRIAVMNRGVLQQVGSPDEVYDKPANTFVAGFI ---------------------------------iiii----------------------- GSPPMNFLDAIVTEDGFVDFGEFRLKLLPDQFEVLGELGYVGREVIFGIRPEDLYDAMFA ------------1111-----------------------2222------3333--3333- QVRVPGENLVRAVVEIVENLGSERIVRLRVGGVTFVGSFRSESRVREGVEVDVVFDMKKI -----------------------------!!!!------1111--2222------3333- HIFDKTTGKAIF ------------ >FUSION PROTEIN (F); SWP:NA; PDB:1G2CA; LEGEVNKIKSALLSTNKAVVSLSNGVSVLTSKVLDLKNYIDKQLLPIVNK -1111--------------------------------------------- ---------------------------------------- >Early growth response pro; SWP:P08046; PDB:1G2DC; MERPYACPVESCDRRFSQKTNLDTHIRIHTGQKPFQCRICMRNFSQHTGLNQHIRTHTGE --------3333------------3333-------------------------------- KPFACDICGRKFATLHTRDRHTKIHLRQK ---------------------3333---- >Early growth response pro; SWP:P08046; PDB:1G2FC; MERPYACPVESCDRRFSQKTNLDTHIRIHTGQKPFQCRICMRNFSQQASLNAHIRTHTGE --------3333------------------------------------------------ KPFACDICGRKFATLHTRTRHTKIHLRQK ---------------------1111---- >TRANSCRIPTIONAL REGULATOR; SWP:P44694; PDB:1G2HA; SAVISLDEFENKTLDEIIGFYEAQVLKLFYAEYPSTRKLAQRLGVSHTAIANKLKQYGIG ----1111-------------------3333----------------------------- K - >PROTEASE I; SWP:O59413; PDB:1G2IA; KVLFLTANEFEDVELIYPYHRLKEEGHEVYIASFERGTITGKHGYSVKVDLTFDKVNPEE ------2222-1111-------1111--------------1111-------3333-3333 FDALVLPGGRAPERVRLNEKAVSIARKFSEGKPVASICHGPQILISAGVLRGRKGTSYPG ---------33333333----------1111-----!!!!----3333-2222----333 IKDDINAGVEWVDAEVVVDGNWVSSRVPADLYAWREFVKLLK 3-----------------!!!!----33333333-------- >ULTRASPIRACLE PROTEIN; SWP:Q7SIF6; PDB:1G2NA; AAVQELSIERLLEMESLVADPSEEFQFLRVGPDSNVPPKFRAPVSSLCQIGNKQIAALVV -------------3333-------------1111--3333-------------------- WARDIPHFSQLEMEDQILLIKGSWNELLLFAIAWRSMEFLTEETTSPPQLMCLMPGMTLH ----2222--------------------------3333---------------2222--3 RNSALQAGVGQIFDRVLSELSLKMRTLRVDQAEYVALKAIILLNPDVKGLKNRQEVEVLR 333----------------------------------------1111----3333----- EKMFLCLDEYCRRSRSSEEGRFAALLLRLPALRSISLKSFEHLFFFHLVADTSIAGYIRD --------------3333-------------------------------1111------- ALRNHA ------ >PURINE NUCLEOSIDE PHOSPHO; SWP:O53359; PDB:1G2OA; DPDELARRAAQVIADRTGIGEHDVAVVLGSGWLPAVAALGSPTTVLPQAELPGFVPPTAA ----------------------------22221111----------33332222----22 GHAGELLSVPIGAHRVLVLAGRIHAYEGHDLRYVVHPVRAARAAGAQIMVLTNAAGGLRA 22--------!!!!---------3333---3333-------1111-------------11 DLQVGQPVLISDHLNLTARSPLVGGEFVDLTDAYSPRLRELARQSDPQLAEGVYAGLPGP 112222---------------------------------------1111----------- HYETPAEIRMLQTLGADLVGMSTVHETIAARAAGAEVLGVSLVTNLAAGITGEPLSHAEV -----------1111---------------1111------------2222---------- LAAGAASATRMGALLADVIARF ------------------1111 >ADENINE PHOSPHORIBOSYLTRA; SWP:P49435; PDB:1G2QA; MPIASYAQELKLALHQYPNFPSEGILFEDFLPIFRNPGLFQKLIDAFKLHLEEAFPEVKI ----------1111-------2222----3333---------------------1111-- DYIVGLESRGFLFGPTLALALGVGFVPVRKAGKLPGECFKATYEKEYGSDLFEIQKNAIP -------3333------------------2222---------------------1111-2 AGSNVIIVDDIIATGGSAAAAGELVEQLEANLLEYNFVMELDFLKGRSKLNAPVFTLL 222------------------------------------------------------- >HYPOTHETICAL CYTOSOLIC PR; SWP:Q97S59; PDB:1G2RA; RKIPLRKSVVSNEVIDKRDLLRIVKNKEGQVFIDPTGKANGRGAYIKLDNAEALEAKKKK ---------------3333------1111----1111----------------------- VFNRSFSMEVEESFYDELIAYVDHKVKRRELGLE ----------3333--------------1111-- >EOTAXIN-3; SWP:Q9Y258; PDB:1G2SA; TRGSDISKTCCFQYSHKPLPWTWVRSYEFTSNSCSQRAVIFTTKRGKKVCTHPRKKWVQK -------------------3333-------3333--------1111-----3333----- YISLLKTPKQL ----------- >PHOSPHOLIPASE A2; SWP:Q9DF52; PDB:1G2XA; NLQQFKNMIQCAGTRTWTAYINYGCYCGKGGSGTPVDKLDRCCYTHDHCYNQADSIPGCN 3333-----------3333----------------------------------------3 PNIKTYSYTCTQPNITCTRTADACAKFLCDCDRTAAICFASAPYNINNIMISASNSCQ 333-----------------------------------------3333--1111---- >GP31; SWP:P17313; PDB:1G31A; QQLPIRAVGEYVILVSEPAQAGDEEVTESGLIIGKRVQGEVPELCVVHSVGPDVPEGFCE -------!!!!--------3333----2222-------------------11112222-2 VGDLTSLPVGQIRNVPHPFVALGLKQPKEIKQKFVTCHYKAIPCLYK 222----3333--------------3333--------3333------ >MODIFICATION METHYLASE TA; SWP:P14385; PDB:1G38A; VETPPEVVDFMVSLAEAPRGGRVLEPACAHGPFLRAFREAHGTGYRFVGVEIDPKALDLP -----------1111--2222------!!!!---------------------3333---1 PWAEGILADFLLWEPGEAFDLILGNPPYGIVGEASKYPIHVFKAVKDLYKKAFSTWKGKY 111-----3333--------------------3333-----3333-------1111!!!! NLYGAFLEKAVRLLKPGGVLVFVVPATWLVLEDFALLREFLAREGKTSVYYLGEVFPQKK ----------11112222-------3333-3333---------------------2222- VSAVVIRFQKSGKGLSLWDTQESESGFTPILWAEYPHWEGEIIRFETEETRKLEISGMPL ----------------------1111--------------------------------33 GDLFHIRFAARSPEFKKHPAVRKEPGPGLVPVLTGRNLKPGWVDYEKNHSGLWMPKERAK 33--------3333---1111----2222----3333-2222------------333333 ELRDFYATPHLVVAHTKGTRVVAAWDERAYPWREEFHLLPKEGVRLDPSSLVQWLNSEAM 33--1111--------------------------------2222---------------- QKHVRTLYRDFVPHLTLRMLERLPVRREYGFHT ---------------3333------3333---- >BETA-CATENIN ARMADILLO RE; SWP:P35222; PDB:1G3JA; HAVVNLINYQDDAELATRAIPELTKLLNDEDQVVVNKAAVMVHQLSKKEASRHAIMRSPQ -------------------------1111------------------3333-3333-333 MVSAIVRTMQNTNDVETARCTAGTLHNLSHHREGLLAIFKSGGIPALVKMLGSPVDSVLF 3------------------------1111--------------------1111------- YAITTLHNLLLHQEGAKMAVRLAGGLQKMVALLNKTNVKFLAITTDCLQILAYGNQESKL ------------2222-------------1111------------------2222----- IILASGGPQALVNIMRTYTYEKLLWTTSRVLKVLSVCSSNKPAIVEAGGMQALGLHLTDP -------------1111------------------------------------1111--- SQRLVQNCLWTLRNLSDAATKQEGMEGLLGTLVQLLGSDDINVVTCAAGILSNLTCNNYK 3333----------11111111-----------3333----------------------- NKMMVCQVGGIEALVRTVLRAGDREDITEPAICALRHLTSRHQEAEMAQNAVRLHYGLPV --------------------!!!!------------1111-1111-------1111---- VVKLLHPPSHWPLIKATVGLIRNLALCPANHAPLREQGAIPRLVQLLVRAHQDTQRRFVE --------------------------3333---------------------1111---ii GVRMEEIVEGCTGALHILARDVHNRIVIRGLNTIPLFVQLLYSPIENIQRVAAGVLCELA ii3333----------------------1111-------1111----------------- QDKEAAEAIEAEGATAPLTELLHSRNEGVATYAAAVLFRMSE ------------------------------------------ >Transcription factor 7-li; SWP:P70062; PDB:1G3JB; PQLNSGGGDELGANDELIRFKDEGEQEEDLADVKSSLVNES --------1111----------------------------- >ATP-DEPENDENT PROTEASE HS; SWP:P43772; PDB:1G3KA; TTIVSVRRNGQVVVGGDGQVSLGNTVMKGNARKVRRLYNGKVLAGFAGGTADAFTLFELF -------iiii----------!!!!------------iiii------------------- ERKLEMHQGHLLKSAVELAKDWRTDRALRKLEAMLIVADEKESLIITGIGDVVQPEEDQI ------iiii---------------3333--------------------------3333- LAIGSGGNYALSAARALVENTELSAHEIVEKSLRIAGDICVFTNTNFTIEELP ---1111--------------------------------1111---------- >ESTROGEN SULFOTRANSFERASE; SWP:P49888; PDB:1G3MA; SELDYYEKFEEVHGILMYKDFVKYWDNVEAFQARPDDLVIATYPKSGTTWVSEIVYMIYK -----------iiii--3333------1111--1111--------------------111 EGDVEKCKEDVIFNRIPFLECRKENLMNGVKQLDEMNSPRIVKTHLPPELLPASFWEKDC 1--3333---1111--------------------------------3333-3333----- KIIYLCRNAKDVAVSFYYFFLMVAGHPNPGSFPEFVEKFMQGQVPYGSWYKHVKSWWEKG --------------------------------------1111-22223333--------- KSPRVLFLFYEDLKEDIRKEVIKLIHFLERKPSEELVDRIIHHTSFQEMKNNPSTNYTTL -1111---3333-------------1111----------------------3333-1111 PDEIMNQKLSPFMRKGITGDWKNHFTVALNEKFDKHYEQQMKESTLKFRT 3333-3333---------3333---------------------------- >V-cyclin; SWP:O40946; PDB:1G3NC; LCEDRIFYNILEIEPRFLTSDSVFGTFQQSLTSHMRKLLGTWMFSVCQEYNLEPNVVALA --3333--3333--1111-3333--------3333-----------------3333---- LNLLDRLLLIKQVSKEHFQKTGSACLLVASKLRSLTPISTSSLCYAAADSFSRQELIDQE ------1111---3333------------------------------------------- KELLEKLAWRTEAVLATDVTSFLLLKLVGGSQHLDFWHHEVNTLITKALVDPLTGSLPAS ------%%%%----3333----------------------------33333333------ IISAAGCALLVPANVIPQGVVPQLASILGCDVSVLQAAVEQILTSVSDFDLRI ------3333-3333----3333---------------------3333----- >MINOR COAT PROTEIN; SWP:P69168; PDB:1G3P; AETVESCLAKSHTENSFTNVKDDKTLDRYANYEGCLWNATGVVVCTGDETQCYGTWVPIG ------1111---------------------iiii----------1111----------- LAIPEYGDTPIPGYTYINPLDGTYPPGTEQNPANPNPSLEESQPLNTFMFQNNRFRNRQG -----------------1111------3333------------------%%%%----iii ALTVYTGTVTQGTDPVKTYYQYTPVSSKAMYDAYWNGKFRDCAFHSGFNEDIFVCEYQGQ i-----------------------------------1111-------------------- SSDLPQPPVNA ----------- >CELL DIVISION INHIBITOR; SWP:Q8U3I1; PDB:1G3QA; MGRIISIVSGKGGTGKTTVTANLSVALGDRGRKVLAVDGDLTMANLSLVLGVDDPDVTLH ---------------------------1111--------1111----1111------333 DVLAGEANVEDAIYMTQFDNVYVLPGAVDWEHVLKADPRKLPEVIKSLKDKFDFILIDCP 31111--3333------2222---------------3333-------1111--------- AGLQLDAMSAMLSGEEALLVTNPEISCLTDTMKVGIVLKKAGLAILGFVLNRYGRSDRDI ---------1111--------------------------------------22221111- PPEAAEDVMEVPLLAVIPEDPAIREGTLEGIPAVKYKPESKGAKAFVKLAEEIEKLA -------------------3333--------3333-1111----------------- >HEAT SHOCK PROTEIN HSLU; SWP:P43773; PDB:1G41A; SEMTPREIVSELDQHIIGQADAKRAVAIALRNRWRRMQLQEPLRHEVTPKNILMIGPTGV -----------------------------------1111----3333------------- GKTEIARRLAKLANAPFIKVEATKFTVGKEVDSIIRDLTDSAMKLVRQQEIAKNRLIDDE ----------1111------3333-----3333-----------------------1111 AAKLINPEELKQKAIDAVEQNGIVFIDEIDKICKKGEYSGADVSREGVQRDLLPLVEGST -----3333------------------1111----------------------3333--- VSTKHGMVKTDHILFIASGAFQVARPSDLIPELQGRLPIRVELTALSAADFERILTEPHA --1111---1111-----------3333-33331111----------------------- SLTEQYKALMATEGVNIAFTTDAVKKIAEAAFRVNEKTENIGARRLHTVMERLMDKISFS ----------------------------------------!!!!---------------3 ASDMNGQTVNIDAAYVADALGEVVENEDLSRFIL 3332222--------------33333333----- >SCAFFOLDING PROTEIN; SWP:Q45996; PDB:1G43A; AGTGVVSVQFNNGSSPASSNSIYARFKVTNTSGSPINLADLKLRYYYTQDADKPLTFWCD ------------------------------------3333-------------------- HAGYMSGSNYIDATSKVTGSFKAVSPAVTNADHYLEVALNSDAGSLPAGGSIEIQTRFAR ------------1111-----------2222--------1111---2222---------1 NDWSNFDQSNDWSYTAAGSYMDWQKISAFVGGTLAYGSTP 111---33331111---------------iiii------- >PINCH PROTEIN; SWP:P48059; PDB:1G47A; MANALASATCERCKGGFAPAEKIVNSNGELYHEQCFVCAQCFQQFPEGLFYEFEGRKYCE ------------------3333------------------------------2222---- HDFQMLFAPC ---------- >REPRESSOR PROTEIN C; SWP:NA; PDB:1G4DA; KSIWCSPQEIMAADGMPGSVAGVHYRANVQGWTKRKKEGVKGGKAVEYDVMSMPTKEREQ -----33331111-----------------------------------1111-------- VIAHLGLST --------- >PHOSPHOLIPASE A2; SWP:P00593; PDB:1G4IA; ALWQFNGMIKCKIPSSEPLLDFNNYGCYCGLGGSGTPVDDLDRCCQTHDNCYKQAKKLDS ------------11111111---------------------------------------- CKVLVDNPYTNNYSYSCSNNEITCSSENNACEAFICNCDRNAAICFSKVPYNKEHKNLDK -1111-3333-------%%%%---3333-----------------1111--3333---33 KNC 33- >BETA-ARRESTIN1; SWP:P17870; PDB:1G4MA; GTRVFKKASPNGKLTVYLGKRDFVDHIDLVEPVDGVVLVDPEYLKERRVYVTLTCAFRYG --------1111---------------------------3333----------------- REDLDVLGLTFRKDLFVANVQSFPPAPEDKKPLTRLQERLIKKLGEHAYPFTFEIPPNLP 1111-2222-----------------------------------1111-------1111- CSVTLQPGPEDTGKACGVDYEVKAFCAENLEEKIHKRNSVRLVIRKVQYAPERPGPQPTA -------11111111-------------------3333---------------------- ETTRQFLMSDKPLHLEASLDKEIYYHGEPISVNVHVTNNTNKTVKKIKISVRQYADICLF ------------------------2222-------------------------------- NTAQYKCPVAMEEADDTVAPSSTFCKVYTLTPFLANNREKRGLALDGKLKHEDTNLASST --------------------------------33331111-------3333--------- LLREGANREILGIIVSYKVKVKLVVSRGSDVAVELPFTLMHPKPKDDDIVFEDFAR ------3333---------------------------------------------- >Effector protein sptP; SWP:P74873; PDB:1G4US; SKQPLLDIALKGLKRTLPQLEQMDGNSLRENFQEMASGNGPLRSLMTNLQNLNKIPEAKQ ---------------33331111----11113333-!!!!----------11113333-- LNDYVTTLTNIQVGVARFSQWGTCGGEVERWVDKASTHELTQAVKKIHVIAKELKNVTAE ----------------3333---------------------------------------- LEKIEAGAPMPQTMSGPTLGLARFAVSSIPINQQTQVKLSDGMPVPVNTLTFDGKPVALA --3333-----------iiii---3333----1111--1111---------iiii----- GSYPKNTPDALEAHMKMLLEKECSCLVVLTSEDQMQAKQLPPYFRGSYTFGEVHTNSQKV -----------------------------------1111--1111----!!!!------- SSASQGEAIDQYNMQLSCGEKRYTIPVLHVKNWPDHQPLPSTDQLEYLADRVKNKHLPMI -----------------!!!!------------2222----------------------- HCLGGVGRTGTMAAALVLKDNPHSNLEQVRADFRDSRNNRMLEDASQFVQLKAMQAQLLM ------3333------33331111-------------1111--3333------------- >Effector protein sptP; SWP:P74873; PDB:1G4WR; LLDIALKGLKRTLPQLEQMDGNSLRPLRSLMTNLQNLNKQLNDYVTTLTNIQVGVARFSQ 3333-1111---3333---3333--3333---------1111--------------1111 WEVERWVDASTHELTQAVKKIHVIAKELKNVTAELEKIPQTMSGPTLGLARFAVSSIPIN -3333----3333---------------------------------------3333---3 QQTQVKLSDGMPVPVNTLTFDGKPVALAGSYPKNTPDALEAHMKMLLEKECSCLVVLTSE 333---1111---------iiii-----------------------------------33 DQMQAKQLPPYFRGSYTFGEVHTNSQKVSSASQGEAIDQYNMQLSCGEKRYTIPVLHVKN 33-------1111----!!!!------3333--------------!!!!----------- WPDHQPLPSTDQLEYLADRVKNSNQNGAPGRSSSDKHLPMIHCLGGVGRTGTMAAALVLK -2222---3333---------------2222-1111------------------------ DNPHSNLEQVRADFRDSRNNRMLEDASQFVQLKAMQAQLLM ------------------1111------------------- >Small conductance calcium; SWP:P70604; PDB:1G4YB; DTQLTKRVKNAAANVLRETWLIYKNTKLVKKIDHAKVRKHQRKFLQAIHQLRSVKMEQRK ------------------------------------------------------------ LNDQANTLVDLAKTQLHHHHH -----------3333------ >DNA CYTOSINE METHYLTRANSF; SWP:O14717; PDB:1G55A; EPLRVLELYSGVGGMHHALRESCIPAQVVAAIDVNTVANEVYKYNFPHTQLLAKTIEGIT ---------!!!!--------------------------------1111-----3333-- LEEFDRLSFDMILMSPPNSFLHILDILPRLQKLPKYILLENVKGFEVSSTRDLLIQTIEN ----3333-----------------1111------------2222--------------- GFQYQEFLLSPTSLGIPNSRLRYFLIAKLQSEPLPFQAPGQVLMEFPKLSVKMLKDFLED ---------------------------------11112222-----------3333--11 DTDVNQYLLPPKSLLRYALLLDIVQPTRRSVCFTKGYGSYIEGTGSVLQTAEDVQVENIY 113333----------1111-------------1111---2222---------------- KSLTNLSQEEQITKLLILKLRYFTPKEIANLLGFPPEFGFPEKITVKQRYRLLGNSLNVH --2222-----------------------1111-1111--3333--------1111---- VVAKLIKILYE ----------- >3,4-DIHYDROXY-2-BUTANONE ; SWP:P24199; PDB:1G57A; LLSSFGTPFERVENALAALREGRGVMVLDENEGDMIFPAETMTVEQMALTIRHGSGIVCL -3333-------------1111---------------3333------------------- CITEDRRKQLDLPMMVENNTSAYGTGFTVTIEAAEGVTTGVSAADRITTVRAAIADGAKP -------1111---------1111------------------------------222233 SDLNRPGHVFPLRAQAGGVLTRGGHTEATIDLMTLAGFKPAGVLCELTNDDGTMARAPEC 33------------2222---------------1111-----------1111-------- IEFANKHNMALVTIEDLVAYRQAHE ----1111----------------- >AMYLOSUCRASE; SWP:Q9ZEU2; PDB:1G5AA; SPNSQYLKTRILDIYTPEQRAGIEKSEDWRQFSRRMDTHFPKLMNELDSVYGNNEALLPM -----------1111-----------------------------------!!!!------ LEMLLAQAWQSYSQRNSSLKDIDIARENNPDWILSNKQVGGVCYVDLFAGDLKGLKDKIP ----------------------------1111--3333-----3333------------- YFQELGLTYLHLMPLFKCPEGKSDGGYAVSSYRDVNPALGTIGDLREVIAALHEAGISAV ----------------------%%%%---------3333-------------1111---- VDFIFNHTSNEHEWAQRCAAGDPLFDNFYYIFPDRRMPDQYDRTLREIFPDQHPGGFSQL --------1111-----11113333-----------------------3333-------1 EDGRWVWTTFNSFQWDLNYSNPWVFRAMAGEMLFLANLGVDILRMDAVAFIWKQMGTSCE 111-------1111---1111--------------1111-------3333---2222--- NLPQAHALIRAFNAVMRIAAPAVFFKSEAIVHPDQVVQYIGQDECQIGYNPLQMALLWNT -------------------3333--------3333-11111111---------------- LATREVNLLHQALTYRHNLPEHTAWVNYVRSHDDIGWTFADEDAAYLGISGYDHRQFLNR --------------------------------------------1111------------ FFVNRFDGSFARGVPFQYNPSTGDCRVSGTAAALVGLAQDDPHAVDRIKLLYSIALSTGG 1111-2222--------------------------3333-1111---------------- LPLIYLGDEVGTLNDDDWSQDSNKSDDSRWAHRPRYNEALYAQRNDPSTAAGQIYQDLRH ------1111------33331111----3333----33331111-1111----------- MIAVRQSNPRFDGGRLVTFNTNNKHIIGYIRNNALLAFGNFSEYPQTVTAHTLQAMPFKA -------3333-----------1111----%%%%--------------33331111---- HDLIGGKTVSLNQDLTLQPYQVMWLEIA -----------------2222------- >SERINE/THREONINE PROTEIN ; SWP:P03772; PDB:1G5BA; MRYYEKIDGSKYRNIWVVGDLHGCYTNLMNKLDTIGFDNKKDLLISVGDLVDRGAENVEC -------3333--------------------------1111------------------- LELITFPWFRAVRGNHEQMMIDGLSERGNVNHWLLNGGGWFFNLDYDKEILAKALAHKAD -----1111---------------1111--3333---3333-----------------11 ELPLIIELVSKDKKYVICHADYPFDEYEFGKPVDHQQVIWNRERISNSQNGIVKEIKGAD 11-------%%%%--------------2222--3333----------1111--------- TFIFGHTPAVKPLKFANQMYIDTGAVFCGNLTLIQVQGA -----------------------3333------------ >BETA-CARBONIC ANHYDRASE; SWP:Q50565; PDB:1G5CA; IIKDILRENQDFRFRDLSDLKHSPKLCIITCMDSRLIDLLERALGIGRGDAKVIKNAGNI --------1111---3333-------------1111----------2222---------- VDDGVIRSAAVAIYALGDNEIIIVGHTDCGMARLDEDLIVSRMRELGVEEEVIENFSIDV ------------------------------------------------------------ LNPVGDEEENVIEGVKRLKSSPLIPESIGVHGLIIDINTGRLKPLYLDE --------------------11113333--------------------- >FUSION PROTEIN; SWP:P35936; PDB:1G5GA; DGRPLAAAGIVVTGDKAVNIYTSSQTGSIIIKLLPNMPKDKEACAKAPLEAYNRTLTTLL --1111------------------------------------------------------ TPLGDSIRRIQESGLSQLAVAVGKMQQFVNDQFNKTAQELDCIKITQQVGVELNLYLTEL ---------33333333------------------------------------------- TTVFGPQITSPALTQLTIQALYNLAGGNMDYLLTKLGVGNNQLSSLISSGLITGNPILYD ----------------3333----iiii3333------1111----3333---------- SQTQLLGIQVTLPSVGNLNNMRATYLETLSVSTTKGFASALVPKVVTQVGSVIEELDTSY 1111--------------------------------------------!!!!-------- CIETDLDLYCTRIVTFPMSPGIYSCLSGNTSACMYSKTEGALTTPYMTLKGSVIANCKMT ---3333----------------------3333------3333-----%%%%---3333- TCRCADPPGIISQNYGEAVSLIDRQSCNILSLDGITLRLSGEFDATYQKNISIQDSQ ----------------------3333------------------------------- >MITOCHONDRIAL DNA POLYMER; SWP:Q9QZM2; PDB:1G5HA; EALVDLCRRRHFLSGTPQQLSTAALLSGCHARFGPLGVELRKNLASQWWSSVVFREQVFA ---------------3333--------------------------------1111----- VDSLHQEPGSSQPRDSAFRLVSPESIREILQDSKEQLVAFLENLLKTSGKLRATLLHGAL ----------3333---------------------------------------------- EHYVNCLDLVNRKLPFGLAQIGVCFHPVSTRVGEKTEASLVWFTPTRTSSQWLDFWLRHR --------------------------------------------3333------------ LLWWRKFASPSNFSSADCQDELGRKGSKLYYSFPWGKEPIETLWNLGDQELLHTYPGNVS ----1111-----------1111---------1111------------------222211 TIQGRDGRKNVVPCVLSVSGDVDLGTLAYLYDSFQLRKVLKLHPCLAPIKVALDVGKGPT 11---!!!!---------------------1111--------3333-------------- VELRQVCQGLLNELLENGISVWPGYSETVHSSLEQLHSKYDESVLFSVLVTETTLENGLI --------------1111----3333------------------------3333------ QLRSRDTTKEHISKLRDFLVKYLASASNVAAALDHHHHH ---3333---1111----------------1111----- >PROTEIN (APOPTOSIS REGULA; SWP:P10415; PDB:1G5MA; HAGRTGYDNREIVMKYIHYKLSQRGYEWDAGDDVEENRTEAPEGTESEVVHLALRQAGDD -------3333---------3333------------------------3333-------- FSRRYRGDFAEMSSQLHLTPFTARGRFATVVEELFRDGVNWGRIVAFFEFGGVMCVESVN 3333-----3333-----1111----------1111------------------------ REMSPLVDNIALWMTEYLNRHLHTWIQDNGGWDAFVELYGPSMR ------------------------------3333---------- >EPIDERMIN MODIFYING ENZYM; SWP:P30197; PDB:1G5QA; MYGKLLICATASINVININHYIVELKQHFDEVNILFSPSSKNFINTDVLKLFCDNLYDEI -----------1111---------------------3333----33331111-----333 KDPLLNNINIVENHEYILVLPASANTINKIANGICDNLLTTVCLTGYQKLFIFPNMNIRM 3---------1111---------------1111------------3333-------3333 WGNPFLQKNIDLLKNNDVKVYSPDMNKSFEISSGRYKNNITMPNIENVLNFVLN ------------------------------------------------------ >COB(I)ALAMIN ADENOSYLTRAN; SWP:P31570; PDB:1G5TA; ERGIIIVFTGNGKGKTTAAFGTAARAVGHGKNVGVVQFIKGTWPNGERNLLEPHGVEFQV --------------------------1111--------------------3333------ MATGFTWETQNREADTAACMAVWQHGKRMLADPLLDMVVLDELTYMVAYDYLPLEEVISA -3333--3333--------------------1111------3333-1111---------- LNARPGHQTVIITGRGCHRDILDLADTVSELRPVKHA ----1111---------33333333------------ >PROFILIN; SWP:Q9LEI8; PDB:1G5UA; SWQTYVDDHLMCDIDGHRLTAAAIIGHDGSVWAQSSSFPQFKSDEVAAVMKDFDEPGSLA 1111------------------------------1111---------------------- PTGLHLGGTKYMVIQGEPGAVIRGKKGSGGITVKRTGQALIIGIYDEPLTPGQCNMIVER ----------------2222---------------------------------------3 LGDYLLDQGL 333------- >FATTY ACID-BINDING PROTEI; SWP:P05413; PDB:1G5WA; VDAFLGTWKLVDSKNFDDYMKSLGVGFATRQVASMTKPTTIIEKNGDILTLKTHSTFKNT 3333-------------------------------------------------------- EISFKLGVEFDETTADDRKVKSIVTLDGGKLVHLQKWDGQETTLVRELIDGKLILTLTHG -------------1111---------%%%%------------------%%%%-------- TAVCTRTYEKEA ------------ >OUTER SURFACE PROTEIN C; SWP:O31117; PDB:1G5ZA; PNLTEISKKITESNAVVLAVKEVETLLASIDELATKAIGKKIGNNGLEANQSKNTSLLSG %%%%--------------------------------------1111-------------- AYAISDLIAEKLNVLKNEELKEKIDTAKQCSTEFTNKLKSEHAVLGLDNLTDDNAQRAIL -----------1111-3333--------------------3333---------------1 KKHANKDKGAAELEKLFKAVENLSKAAQDTLKNAVKELTSPIVA 111-------------------------------1111------ >ADENINE-SPECIFIC METHYLTR; SWP:P23192; PDB:1G60A; MLEINKIHQMNCFDFLDQVENKSVQLAVIDPPYNLSKADWDSFDSHNEFLAFTYRWIDKV ---------------11112222--------------1111------------------- LDKLDKDGSLYIFNTPFNCAFICQYLVSKGMIFQNWITWDKRDGMGSAKRRFSTGQETIL 33331111------3333--------1111------------------------------ FFSKSKNHTFNYDEVRVPYESTDRIKHASEKGILKNGKRWFPNPNGRLCGEVWHFSTPKP ----------3333-----3333----1111---iiii----1111-------------- RDLIERIIRASSNPNDLVLDCFMGSGTTAIVAKKLGRNFIGCDMNAEYVNQANFVLNQ ------------2222------!!!!----------------------------1111 >TRANSLATION INITIATION FA; SWP:Q60357; PDB:1G61A; MIIRKYFSGIPTIGVLALTTEEITLLPIFLDKDDVNEVSEVLETKCLQTNIGGSSLVGSL ------iiii-3333-----------1111--------------------iiii-3333- SVANKYGLLLPKIVEDEELDRIKNFLKENNLDLNVEIIKSKNTALGNLILTNDKGALISP ----------1111------------1111-------------3333-----------33 ELKDFKKDIEDSLNVEVEIGTIAELPTVGSNAVVTNKGCLTHPLVEDDELEFLKSLFKVE 331111---------------iiii-3333-----------1111--------------- YIGKGTANKGTTSVGACIIANSKGAVVGGDTTGPELLIIEDALGL --------------------1111---1111-------------- >RIBOSOME ANTI-ASSOCIATION; SWP:Q12522; PDB:1G62A; MATRTQFENSNEIGVFSKLTNTYCLVAVGGSENFYSAFEAELGDAIPIVHTTIAGTRIIG -----------3333----1111-------3333-------!!!!-------iiii---- RMTAGNRRGLLVPTQTTDQELQHLRNSLPDSVKIQRVEERLSALGNVICCNDYVALVHPD -----3333---1111------------3333----------3333-----------111 IDRETEELISDVLGVEVFRQTISGNILVGSYCSLSNQGGLVHPQTSVQDQEELSSLLQVP 1--------------------iiii-3333-----------1111--------------- LVAGTVNRGSSVVGAGMVVNDYLAVTGLDTTAPELSVIESIFRL --------------------------1111---------1111- >EPIDERMIN MODIFYING ENZYM; SWP:P30197; PDB:1G63A; MYGKLLICATASINVININHYIVELKQHFDEVNILFSPSSKNFINTDVLKLFCDNLYDEI -----------1111---------------------3333--------1111-----333 KDPLLNHINIVENHEYILVLPASANTINKIANGICDNLLTTVCLTGYQKLFIFPNMNIRM 3----------------------------1111------------3333-------3333 WGNPFLQKNIDLLKNNDVKVYSPDMNKNNITMPNIENVLNFVLN -------------------------------------------- >ACETYL XYLAN ESTERASE II; SWP:O59893; PDB:1G66A; SCPAIHVFGARETTASPGYGSSSTVVNGVLSAYPGSTAEAINYPACGGQSSCGGASYSSS -----------2222---!!!!----------2222------------3333-------- VAQGIAAVASAVNSFNSQCPSTKIVLVGYSQGGEIMDVALCGGGDPNQGYTNTAVQLSSS ------------------1111----------------------3333------------ AVNMVKAAIFMGDPMFRAGLSYEVGTCAAGGFDQRPAGFSCPSAAKIKSYCDASDPYCCN -------------------1111-------1111-2222-1111---------------- GSNAATHQGYGSEYGSQALAFVKSKLG --3333---3333-------------- >BETA-LACTAMASE PSE-4; SWP:P16897; PDB:1G6AA; SKFQQVEQDVKAIEVSLSARIGVSVLDTQNGEYWDYNGNQRFPLTSTFKTIACAKLLYDA 1111----------1111------------------1111-------------------1 EQGKVNPNSTVEIKKADLVTYSPVIEKQVGQAITLDDACFATMTTSDNTAANIILSAVGG 111--1111----3333----3333--2222----------------------------- PKGVTDFLRQIGDKETRLDRIEPDLNEGKLGDLRDTTTPKAIASTLNKFLFGSALSEMNQ ----------------------------2222---------------------------- KKLESWMVNNQVTGNLLRSVLPAGWNIADKSGAGGFGARSITAVVWSEHQAPIIVSIYLA ------1111--11113333-2222---------iiii---------------------- QTQASMEERNDAIVKIGHSIFDVYTS --------------------3333-- >ANTIFUNGAL PROTEIN; SWP:Q9RCK8; PDB:1G6EA; MINRTDCNENSYLEIHNNEGRDTLCFANAGTMPVAIYGVNWVESGNNVVTLQFQRNLSDP ------------------------------------------------------------ RLETITLQKWGSWNPGHIHEILSIRIY -------2222---------------- >PROTEIN KINASE RAD53; SWP:P22216; PDB:1G6GA; GENIVCRVICTTGQIPIRDLSADISQVLKEKRSIKKVWTFGRNPACDYHLGNISRLSNKH ----------------------33333333------------3333------1111---- FQILLGEDGNLLLNDISTNGTWLNGQKVEKNSNQLLSQGDEITVGVGVESDILSLVIFIN -----1111-------------iiii----------2222-------3333--------- DKFKQCL ------- >HIGH-AFFINITY BRANCHED-CH; SWP:Q58663; PDB:1G6HA; TMEILRTENIVKYFGEFKALDGVSISVNKGDVTLIIGPNGSGKSTLINVITGFLKADEGR -------------!!!!----------2222------2222-------1111-------- VYFENKDITNKEPAELYHYGIVRTFQTPQPLKEMTVLENLLIGEICPGESPLNSLFYKKW --iiii-22223333-1111-------3333----------11112222----1111--- IPKEEEMVEKAFKILEFLKLSHLYDRKAGELSGGQMKLVEIGRALMTNPKMIVMDEPIAG -------------------3333---3333-------------3333--------1111- VAPGLAHDIFNHVLELKAKGITFLIIEHRLDIVLNYIDHLYVMFNGQIIAEGRGEEEIKN ----------------1111----------1111---------%%%%------------- VLSDPKVVEIYIGE --------1111-- ------------------------------------------------------------ -- >CAG-ALPHA; SWP:Q7BK04; PDB:1G6OA; LSAEDKKFLEVERALKEAALNPLRHATEELFGDFLKENITEICYNGNKVVWVLKNNGEWQ --------------------------------1111-----------------1111--- PFDVRDRKAFSLSRLHFARCCASFKKKTIDNYENPILSSNLANGERVQIVLSPVTVNDET ---1111---3333--------1111--------------1111------------1111 ISISIRIPSKTTYPHSFFEEQGFYNLLDNKEQAISAIKDGIAIGKNVIVCGGTGSGKTTY -------------3333------1111-3333-------------------2222----- IKSIEFIPKEERIISIEDTEEIVFKHHKNYTQLFFGGNITSADCLKSCLRRPDRIILGEL -------1111------------------------!!!!--------------------- RSSEAYDFYNVLCSGHKGTLTTLHAGSSEEAFIRLANSSSNSAARNIKFESLIEGFKDLI -----------1111------------3333------1111--1111-------3333-- DIVHINHHKQCDEFYIK -----1111-------- ------------------------------------------------------------ ------ >HNRNP arginine N-methyltr; SWP:P38074; PDB:1G6Q1; DYYFDSYDHYGIHEEMLQDTVRTLSYRNAIIQNKDLFKDKIVLDVGCGTGILSMFAAKHG --------3333--------------------3333----------!!!!---------- AKHVIGVDMSSIIEMAKELVELNGFSDKITLLRGKLEDVHLPFPKVDIIISEWMGYFLLY --------------------1111----------3333---------------------- ESMMDTVLYARDHYLVEGGLIFPDKCSIHLAGLEDSQYKDEKLNYWQDVYGFDYSPFVPL --------------------------------------------3333------------ VLHEPIVDTVERNNVNTTSDKLIEFDLNTVKISDLAFKSNFKLTAKRQDMINGIVTWFDI ----------3333-----------3333-3333-------------------------- VFPAPKGKRPVEFSTGPHAPYTHWKQTIFYFPDDLDAETGDTIEGELVCSPNEKNNRDLN ----2222-------1111--1111------------2222-------------1111-- IKISYKFESNGIDGNSRSRKNEGSYLMH -------------3333----------- >EPSP SYNTHASE; SWP:P07638; PDB:1G6SA; MESLTLQPIARVDGTINLPGSKSVSNRALLLAALAHGKTVLTNLLDSDDVRHMLNALTAL ---------------------3333--------------------------------111 GVSYTLSADRTRCEIIGNGGPLHAEGALELFLGNAGTAMRPLAAALCLGSNDIVLTGEPR 1-----1111--------------%%%%---------------1111----------333 MKERPIGHLVDALRLGGAKITYLEQENYPPLRLQGGFTGGNVDVDGSVSSQFLTALLMTA 3------------1111-------2222--------------------3333-------- PLAPEDTVIRIKGDLVSKPYIDITLNLMKTFGVEIENQHYQQFVVKGGQSYQSPGTYLVE ----------------------------1111-----%%%%------------------- GDASSASYFLAAAAIKGGTVKVTGIGRNSMQGDIRFADVLEKMGATICWGDDYISCTRGE --3333-------------------1111------------------------------- LNAIDMDMNHIPDAAMTIATAALFAKGTTTLRNIYNWRVKETDRLFAMATELRKVGAEVE -------11113333------1111--------1111-----3333------1111---- EGHDYIRITPPEKLNFAEIATYNDHRMAMCFSLVALSDTPVTILDPKCTAKTFPDYFEQL --------------------%%%%----------------------------1111---- ARISQAA 1111--- ------------------------------------------------ >PANCREATIC TRYPSIN INHIBI; SWP:P00974; PDB:1G6XA; RPDFCLEPPYAGACRARIIRYFYNAKAGLCQTFVYGGCRAKRNNFKSAEDCLRTCGGA -3333-------------------3333------------------------------ >CLR4 PROTEIN; SWP:NA; PDB:1G6ZA; SPKQEEYEVERIVDEKLDRNGAVKLYRIRWLNYSSRSDTWEPPENLSGCSAVLAEWKRRK -----------------1111--------------------3333--------------- RRLKGSNS -------- >DNA PRIMASE; SWP:Q9P9H1; PDB:1G71A; MLMREVTKEERSEFYSKEWSAKKIPKFIVDTLESREFGFDHNGEGPSDRKNQYSDIRDLE -------------------3333-3333--3333--------------------3333-- DYIRATSPYAVYSSVAFYENPREMEGWRGAELVFDIDAKDLPLKRCNHEPGTVCPICLED ------------------------------------3333-------------------- AKELAKDTLIILREELGFENIHVVYSGRGYHIRILDEWALQLDSKSRERILAFISASEIE -----------------------------------3333--------------------- NVEEFRRFLLEKRGWFVLKHGYPRVFRLRLGYFILRVNVPHLLSIGIRRNIAKKILDHKE ------------3333----3333--------1111-3333-1111-------------- EIYEGFVRKAILASFPEGVGIESMAKLFALSTRFSKAYFDGRVTVDIKRILRLPSTLHSK ---------------2222--------------3333--3333---------2222---- VGLIATYVGTKEREVMKFNPFRHAVPKFRKKEVREAYKLWRESL ------------------1111---1111--------------- >INHIBITORS OF APOPTOSIS-L; SWP:Q9NR28; PDB:1G73A; AVPIAQKSEPHSLSSEALMRRAVSLVTDSTSTDLSQTTYALIEAITEYTKAVYTLTSLYR ------------3333-------------------------------------------- QYTSLLGKMNSEEEDEVWQVIIGARAEMTSKHQEYLKLETTWMTAVGLSEMAAEAAYQTG --1111------------------------------------------------------ ADQASITARNHIQLVKLQVEEVHQLSRKAETKLAEAQ ---------------------------------1111 >Baculoviral IAP repeat-co; SWP:P98170; PDB:1G73C; LPRNPSMADYEARIFTFGTWIYSVNKEQLARAGFYALGEGDKVKCFHCGGGLTDWKPSED ---3333------1111---------------------!!!!------------------ PWEQHAKWYPGCKYLLEQKGQEYINNIHL --------1111----------------- >ENDOPLASMIC RETICULUM PRO; SWP:P52555; PDB:1G7DA; PGCLPAYDALAGQFIEASSREARQAILKQGQDGLSGVKETDKKWASQYLKIMGKILDQGE --------------------------------3333---3333---------3333--33 DFPASELARISKLIENKMSEGKKEELQRSLNILTAFRKKGAEKEEL 3333333333------------------------------------ >ENDOPLASMIC RETICULUM PRO; SWP:P52555; PDB:1G7EA; LHTKGALPLDTVTFYKVIPKSKFVLVKFDTQYPYGEKQDEFKRLAENSASSDDLLVAEVG ----------1111--3333------------------3333-----3333--------- ISDYGDKLNMELSEKYKLDKESYPVFYLFRDGDFENPVPYSGAVKVGAIQRWLKGQGVYL ------3333-------------------------------------------1111--- GM -- >ADIPOCYTE LIPID-BINDING P; SWP:P04117; PDB:1G7NA; CDAFVGTWKLVSSENFDDYMKEVGVGFATRKVAGMAKPNMIISVNGDLVTIRSESTFKNT 3333----------------------------------------!!!!------------ EISFKLGVEFDEETVDGRKVKSIITLDGGALVQVQKWDGKSTTIKRKRDGDKLVVECVMK ----2222-----1111---------iiii------iiii--------!!!!-------- GVTSTRVYERA ----------- >GLUTAREDOXIN 2; SWP:P39811; PDB:1G7OA; MKLYIYDHCPYCLKARMIFGLKNIPVELHVLLNDDAETPTRMVGQKQVPILQKDDSRYMP -----3333----------------------3333------------------------- ESMDIVHYVDKLDGKPLLTGKRSPAIEEWLRKVNGYANKLLLPRFAKSAFDEFSTPAARK ----------1111--------3333---------3333----------3333------- YFVDKKEASAGNFADLLAHSDGLIKNISDDLRALDKLIVKPNAVNGELSEDDIQLFPLLR -----3333---------------------------------1111---3333------- NLTLVAGINWPSRVADYRDNMAKQTQINLLSSMAI ----3333---------------------3333-- >TRANSLATION INITIATION FA; SWP:O26359; PDB:1G7RA; KIRSPIVSVLGTTLLDHIRGSAVASQHIGATEIPDVIEGICGDFLKKFSIRETLPGLFFI --------------------------2222------------3333---1111------- DTPGAFTTLRKRGGALADLAILIVDINEGFKPQTQEALNILRYRTPFVVAANKIDRIHGW -----3333-2222----------------3333------------------33332222 RVHEGRPFETFSKQDIQVQQKLDTKVYELVGKLHEEGFESERFDRVTDFASQVSIIPISA --------3333---------------------1111----1111--3333-------33 ITGEGIPELLTLGLAQQYLREQLKIEEDSPARGTILEVKEETGLGTIDAVIYDGILRKDD 332222-------------3333---------------------------------1111 TIATSKDVISTRIRSLLKPRPLKFQKVDEVVAAAGIKIVAPGIDDVAGSPLRVVTDPEKV -------------------------------------------------------3333- REEILSEIEDIKIDTDEAGVVVKADTLGSLEAVVKILRDYVPIKVADIGDVSRRDVVNAG -------3333----------------------------------------3333----- IALQEDRVYGAIIAFNVKVIPSAAQELKNSDIKLFQGNVIYRLEEYEEWVRGIEEEKKKK -33333333----------3333---------------3333------------------ WEAIIKPASIRLIPKLVFRQSKPAIGGVEVLTGVIRQGYPLNDDGETVGTVESQDKGENL -----------------------------------2222--------------------- KSASRGQKVAAIKDAVYGKTIHEGDTLYVDIPENHYHILKEQLLTDEELDLDKIAEIKRK ---------------------2222-------------------3333------------ KN -- >TRANSLATION INITIATION FA; SWP:O26359; PDB:1G7SA; MKIRSPIVSVLGHVDHGKTTLLDHIRGSAVASRITQHIGATEIPMDVIEGICGDFLKKFS ------------2222-------------------------------------3333--- IRETLPGLFFIDTPGHEAFTTLRKRGGALADLAILIVDINEGFKPQTQEALNILRMYRTP 1111--------------3333---------------1111--3333------------- FVVAANKIDRIHGWRVHEGRPFMETFSKQDIQVQQKLDTKVYELVGKLHEEGFESERFDR ------33332222--2222-----1111-------------------1111----1111 VTDFASQVSIIPISAITGEGIPELLTMLMGLAQQYLREQLKIEEDSPARGTILEVKEETG --3333-----------2222---------------3333--1111-----------222 LGMTIDAVIYDGILRKDDTIAMMTSKDVISTRIRSLLKPRPLESRKKFQKVDEVVAAAGI 2-------------1111------------------------------------------ KIVAPGIDDVMAGSPLRVVTDPEKVREEILSEIEDIKIDTDEAGVVVKADTLGSLEAVVK ------11112222------------------1111------------------------ ILRDMYVPIKVADIGDVSRRDVVNAGIALQEDRVYGAIIAFNVKVIPSAAQELKNSDIKL --1111-----------3333------33333333---------------1111------ FQGNVIYRLMEEYEEWVRGIEEEKKKKWMEAIIKPASIRLIPKLVFRQSKPAIGGVEVLT ----------------------------1111--------2222---------------- GVIRQGYPLMNDDGETVGTVESMQDKGENLKSASRGQKVAMAIKDAVYGKTIHEGDTLYV ---2222---------------------------------------2222--2222---- DIPENHYHILKEQLLTDEELDLMDKIAEIKRKKNPD ------------------------------------ ----------- >IMMUNOGLOBULIN E; SWP:P01854; PDB:1G84A; SRDFTPPTVKILQSSSDGGGHFPPTIQLLCLVSGYTPGTINITWLEDGQVMDVDLSTAST ------------------------------------------------------------ TQEGELASTQSELTLSQKHWLSDRTYTCQVTYQGHTFEDSTKKSA --!!!!---------33331111---------------------- >ENDOCELLULASE 9G; SWP:P37700; PDB:1G87A; TYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKF -------------3333------1111--------11113333----------------- NLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGG -----------------------------------------------2222--------- KDHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHA -------3333----------3333----------------------------------- KNLFDMADKAKSDAGYTAASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVPNWG ------------11111111----------------------------------3333-- KEQQTDIIAYKWGQCWDDVHYGAELLLAKLTNKQLYKDSIEMNLDFWTTGVNGTRVSYTP -2222---------1111--------------------------------iiii----11 KGLAWLFQWGSLRHATTQAFLAGVYAEWEGCTPSKVSVYKDFLKSQIDYALGSTGRSFVV 11-------------------------11113333----------------1111---22 GYGVNPPQHPHHRTAHGSWTDQMTSPTYHRHTIYGALVGGPDNADGYTDEINNYVNNEIA 22---------3333------1111-------2222-----1111----11113333--3 CDYNAGFTGALAKMYKHSGGDPIPNFKAIEKITNDEVIIKAGLNSTGPNYTEIKAVVYNQ 333--------------------------------------------------------- TGWPARVTDKISFKYFMDLSEIVAAGIDPLSLVTSSYSEGKNTKVSGVLPWDVSNNVYYV ----------------------1111-1111--------1111---------1111---- NVDLTGENIYPGGQSACRREVQFRIAAPQGTTYWNPKNDFSYDGLPTTSTVNTVTNIPVY ---2222-------1111---------2222---33331111------------------ DNGVKVFGNEP iiii------- >FLAGELLAR TRANSCRIPTIONAL; SWP:P11164; PDB:1G8EA; MHTSELLKHIYDINLSYLLLAQRLIVQDKASAMFRLGINEEMATTLAALTLPQMVKLAET ---------------------------------1111--------1111----------- NQLVCHFRFDSHQTITQLTQDSRVDDLQQIHTGIMLST ----------3333------------------------ >NEURONAL CALCIUM SENSOR 1; SWP:P36610; PDB:1G8IA; SNSKLKPEVVEELTRKTYFTEKEVQQWYKGFIKDCPSGQLDAAGFQKIYKQFFPFGDPTK -------------1111-----------------3333--------------3333---- FATFVFNVFDENKDGRIEFSEFIQALSVTSRGTLDEKLRWAFKLYDLDNDGYITRNEMLD ---------1111----3333------------------------1111----------- IVDAIYQMVLPEEENTPEKRVDRIFAMMDKNADGKLTLQEFQEGSKADPSIVQALSLYDG ----------3333--------------1111--------------------1111-iii LV i- >ARSENITE OXIDASE; SWP:Q7SIF4; PDB:1G8KA; NDRITLPPANAQRTNMTCHFCIVGCGYHVYKWPELEEGGRAPEQNALGLDFRKQLPPLAV -------1111---------3333--------1111----11113333-------2222- TLTPAMTNVVTEHDGARYDIMVVPDKACVVNSGLSSTRGGKMASYMYTPTGDGKERLSAP --3333-----1111---------1111--iiii---33333333--1111-1111---- RLYAADEWVDTTWDHAMALYAGLIKKTLDKDGPQGVFFSCFDHGGAGGGFENTWGTGKLM -------------------------------3333---------2222------------ FSAIQTPMVRIHNRPAYNSECHATREMGIGELNNAYEDAQLADVIWSIGNNPYESQTNYF ----------1111--------------------3333------------3333--3333 LNHWLPNLQGATTSKKKERFPNENFPQARIIFVDPRETPSVAIARHVAGNDRVLHLAIEP ---------1111-----------------------------------1111------22 GTDTALFNGLFTYVVEQGWIDKPFIEAHTKGFDDAVKTNRLSLDECSNITGVPVDMLKRA 22---------------------------------------------------------- AEWSYKPKASGQAPRTMHAYEKGIIWGNDNYVIQSALLDLVIATHNVGRRGTGCVRMGGH -------1111------------------------------1111---2222-------- QEGYTRPPYPGDKKIYIDQELIKGKGRIMTWWGCNNFQTSNNAQALREAILQRSAIVKQA ----------------------------------3333---------------------- MQKARGATTEEMVDVIYEATQNGGLFVTSINLYPTKLAEAAHLMLPAAHPGEMNLTSMNG 3333------------------------------3333---------------------- ERRIRLSEKFMDPPGTAMADCLIAARIANALRDMYQKDGKAEMAAQFEGFDWKTEEDAFN -------------!!!!------------------1111-33331111-----3333--- DGFRRAGQPGAPAIDSQGGSTGHLVTYDRLRKSGNNGVQLPVVSWDESKGLVGTEMLYTE -1111--2222----1111-3333---------3333--------------------111 GKFDTDDGKAHFKPAPWNGLPATVQQQKDKYRFWLNNGRNNEVWQTAYHDQYNSLMQERY 1---1111-------------------------------3333!!!!------------- PMAYIEMNPDDCKQLDVTGGDIVEVYNDFGSTFAMVYPVAEIKRGQTFMLFGYVNGIQGD -----------------2222-----3333--------33332222----------3333 VTTDWTDRDIIPYYKGTWGDIRKVGSMSEFKRTVSFKSRRFG ------1111--1111-------------------------- >Arsenite oxidase small su; SWP:Q7SIF3; PDB:1G8KB; RTTLAYPATAVSVAKNLAANEPVSFTYPDTSSPCVAVKLGAPVPGGVGPDDDIVAYSVLC ------------3333-2222-------1111----------2222-1111--------- THMGCPTSYDSSSKTFSCPCHFTEFDAEKAGQMICGEATADLPRVLLRYDAASDALTAVG ----------1111-----------1111---------------------1111------ VDGLIYGRQANVI ------------- >MOLYBDOPTERIN BIOSYNTHESI; SWP:P12281; PDB:1G8LA; LMSLDTALNEMLSRVTPLTAQETLPLVQCFGRILASDVVSPLDVPGFDNSAMDGYAVRLA -----------1111---------33332222-------------------------333 DIASGQPLPVAGKSFAGQPYHGEWPAGTCIRIMTGAPVPEGCEAVVMQEQTEQMDNGVRF 33333---------------------------2222--2222----3333---1111--- TAEVRSGQNIRRRGEDISAGAVVFPAGTRLTTAELPVIASLGIAEVPVIRKVRVALFSTG ----2222---2222--2222---2222--3333----1111-----------------3 DELQLPGQPLGDGQIYDTNRLAVHLMLEQLGCEVINLGIIRDDPHALRAAFIEADSQADV 333------------------------1111----------------------------- VISSGGVSVGEADYTKTILEELGEIAFWKLAIKPGKPFAFGKLSNSWFCGLPGNPVSATL ------------------------------------------------------------ TFYQLVQPLLAKLSGNTASGLPARQRVRTASRLKKTPGRLDFQRGVLQRNADGELEVTTT -----------3333----------------------------------1111------- GHQGSHIFSSFSLGNCFIVLERDRGNVEVGEWVEVEPFNALFG --------3333--------3333---2222-------3333- >AICAR TRANSFORMYLASE-IMP ; SWP:P31335; PDB:1G8MA; RQQLALLSVSEKAGLVEFARSLNALGLGLIASGGTATALRDAGLPVRDVSDLTGFPELGG -----------2222-------1111-------------1111----3333------iii RVKTLHPAVHAGILARNIPEDNADNKQDFSLVRVVVCNLYPFVKTVSSPGVTVPEAVEKI i----3333---------------1111----------------------------1111 DIGGVALLRAAAKNHARVTVVCDPADYSSVAKEAASKDKDTSVETRRHLALKAFTHTAQY -------------3333-----3333---------1111--------------------- DAAISDYFRKEYSKGVSQLPLRYGNPHQSPAQLYTTRPKLPLTVVNGSPGFINLCDALNA ------------2222--------1111-------------------------------- WQLVKELKQALGIPAAASFKHVSPAGAAVGIPLSEEEAQVCVHDLHKTLTPLASAYARSR -------------------%%%%------------------33331111----------- GADRSSFGDFIALSDICDVPTAKIISREVSDGVVAPGYEEEALKILSKKKNGGYCVLQDP ------------------------1111----------------3333%%%%------11 NYEPDDNEIRTLYGLQLQKRNNAVIDRSLFKNIVTKNKTLPESAVRDLIVASIAVKYTQS 11---------iiii----------3333-----------3333----------1111-- NSVCYAKDGQVIGIGAGQQSRIHCTRLAGDKANSWWLRHHPRVLSKFKAGVKRAEVSNAI ------%%%%-----------------------------3333----------------- DQYVTGTIGEDEDLVKWQAFEEVPAQLTEAEKKQWIAKLTAVSLSSDAFFPFRDNVDRAK -----------------------------------1111--------------------- RIGVQFIVAPSGSAADEVVIEACNELGITLIHTNLRLFHH ---------------------------------------- >MAGNESIUM-CHELATASE 38 KD; SWP:P26239; PDB:1G8PA; RPVFPFSAIVGQEDMKLALLLTAVDPGIGGVLVFGDRGTGKSTAVRALAALLPEIEAVEG ----3333----------------3333-----------1111-----1111-----111 CPVSSPNVEMIPDWATVLSTNVIRKPTPVVDLPLGVSEDRVVGALDIERAISKGEKAFEP 1-----3333-3333-----------------22223333--------------1111-- GLLARANRGYLYIDECNLLEDHIVDLLLDVAQSGENVVERDGLSIRHPARFVLVGSGNPE 3333-2222-----1111-3333----------------%%%%----------------- EGDLRPQLLDRFGLSVEVLSPRDVETRVEVIRRRDTYDADPKAFLEEWRPKDMDIRNQIL ----33331111------------------------------------------------ EARERLPKVEAPNTALYDCAALCIALGSDGLRGELTLLRSARALAALEGATAVGRDHLKR -----1111--3333------------------------------1111----------- VATMALSHRLRVARTVEETLP -----1111------------ >CD81 ANTIGEN, EXTRACELLUL; SWP:P18582; PDB:1G8QA; FVNKDQIAKDVKQFYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSSTLTALTTSVLKN ------------------------1111-----------1111---11111111------ NLCPSGSNIISNLFKEDCHQKIDDLFSGKH -------33331111--------------- >FIBRILLARIN-LIKE PRE-RRNA; SWP:Q58108; PDB:1G8SA; MEDIKIKEIFENIYEVDLGDGLKRIATKSIVKGKKVYDEKIIKIGDEEYRIWNPNKSKLA ------------------------------2222---------!!!!-----1111---- AAIIKGLKVMPIKRDSKILYLGASAGTTPSHVADIADKGIVYAIEYAPRIMRELLDACAE --1111------1111-------------------1111------------------222 RENIIPILGDANKPQEYANIVEKVDVIYEDVAQPNQAEILIKNAKWFLKKGGYGMIAIKA 2--------1111-1111--------------1111------------2222------33 RSIDVTKDPKEIFKEQKEILEAGGFKIVDEVDIEPFEKDHVMFVGIWEGK 33------------------------------------------------ >LEUCOAGGLUTINATING PHYTOH; SWP:P05087; PDB:1G8WA; SNDIYFNFQRFNETNLILQRDASVSSSGQLRLTNLNGNGEPRVGSLGRAFYSAPIQIWDN -----------3333---------1111-------------------------------- TTGTVASFATSFTFNIQVPNNAGPADGLAFALVPVGSQPKDKGGFLGLFDGSNSNFHTVA ---------------------------------1111----!!!!--------------- VEFDTLYNKDWDPTERHIGIDVNSIRSIKTTRWDFVNGENAEVLITYDSSTNLLVASLVY -------3333------------------------2222---------1111-------- PSQKTSFIVSDTVDLKSVLPEWVSVGFSATTGINKGNVETNDVLSWSFASKLS 1111---------3333------------------------------------ >MYELOID PROGENITOR INHIBI; SWP:P55773; PDB:1G91A; MDRFHATSADCCISYTPRSIPCSLLESYFETNSECSKPGVIFLTKKGRRFCANPSDKQVQ --------------------3333-------3333------------------------- VCMRMLKLDTRIKTRKN -3333------------ >ALPHA-AMYLASE; SWP:P29957; PDB:1G94A; TPTTFVHLFEWNWQDVAQECEQYLGPKGYAAVQVSPPNEHITGSQWWTRYQPVSYELQSR -------2222-------------1111----------------1111----------33 GGNRAQFIDMVNRCSAAGVDIYVDTLINHMAAGSGTGTAGNSFGNKSFPIYSPQDFHESC 33------------1111------------------1111---%%%%11111111----- TINNSDYGNDRYRVQNCELVGLADLDTASNYVQNTIAAYINDLQAIGVKGFRFDASKHVA --3333------------iiii---1111--------------3333-------3333-3 ASDIQSLMAKVNGSPVVFQEVIDQGGEAVGASEYLSTGLVTEFKYSTELGNTFRNGSLAW 333----1111------------------33333333----------------------- LSNFGEGWGFMPSSSAVVFVDNHDNQRGHGGAGNVITFEDGRLYDLANVFMLAYPYGYPK ----3333---1111--------1111----3333-3333-------------------- VMSSYDFHGDTDAGGPNVPVHNNGNLECFASNWKCEHRWSYIAGGVDFRNNTADNWAVTN ------iiii-----------iiii---------3333-------------1111----- WWDNTNNQISFGRGSSGHMAINKEDSTLTATVQTDMASGQYCNVLKGELSADAKSCSGEV ------------!!!!--------------------------3333---1111------- ITVNSDGTINLNIGAWDAMAIHKNAKLN ---1111--------------1111--- >ACETATE KINASE; SWP:P38502; PDB:1G99A; MKVLVINAGSSSLKYQLIDMTNESALAVGLCERIGIDNSIITQKKFDGKKLEKLTDLPTH --------------------------------------------1111------------ KDALEEVVKALTDDEFGVIKDMGEINAVGHRVVHGGEKFTTSALYDEGVEKAIKDCFELA --------------------3333-----------3333----------------33331 PLHNPPNMMGISACAEIMPGTPMVIVFDTAFHQTMPPYAYMYALPYDLYEKHGVRKYGFH 111--------------1111-------3333---1111-----3333------------ GTSHKYVAERAALMLGKPAEETKIITCHLGNGSSITAVEGGKSVETSMGFTPLEGLAMGT ------------1111-3333-----------------iiii------------------ RCGSIDPAIVPFLMEKEGLTTREIDTLMNKKSGVLGVSGLSNDFRDLDEAASKGNRKAEL --------------1111------------------------------------------ ALEIFAYKVKKFIGEYSAVLNGADAVVFTAGIGENSASIRKRILTGLDGIGIKIDDEKNK -----------------1111-------------------------3333-----3333- IRGQEIDISTPDAKVRVFVIPTNEELAIARETKEIVET ---------1111------------------------- >H14; SWP:A2KD53; PDB:1G9EA; QVQLQESGGGLVQAGGSLRLSCAASGRTGSTYDMGWFRQAPGKERESVAAINWDSARTYY ------------2222-----------------------2222----------------- ASSVRGRFTISRDNAKKTVYLQMNSLKPEDTAVYTCGAGEGGTWDSWGQGTQVTVSS ---------------------------------------iiii-------------- >LECTIN; SWP:P05046; PDB:1G9FA; AETVSFSWNKFVPKQPNMILQGDAIVTSSGKLQLNKVDTPKPSSLGRALYSTPIHIWDKE -----------2222-----------1111------------------------------ TGSVASFAASFNFTFYAPDTKRLADGLAFFLAPIDTKPQTHAGYLGLFNENESGDQVVAV --------------------------------1111----!!!!----2222-------- EFDTFRNSWDPPNPHIGINVNSIRSIKTTSWDLANNKVAKVLITYDASTSLLVASLVYPS ------1111-----------------------2222---------1111--------11 QRTSNILSDVVDLKTSLPEWVRIGFSAATGLDIPGESHDVLSWSFASNLPHLDLTSFVLH 11---------3333-------------------------------------3333---- E - >CELLULASE CEL48F; SWP:P37698; PDB:1G9GA; ASSPANKVYQDRFESMYSKIKDPANGYFSEQGIPYHSIETLMVEAPDYGHVTTSEAMSYY ---------------------3333---1111---------------1111--------- MWLEAMHGRFSGDFTGFDKSWSVTEQYLIPTEKDQPNTSMSRYDANKPATYAPEFQDPSK ------------------------------3333-333311111111---------1111 YPSPLDTSQPVGRDPINSQLTSAYGTSMLYGMHWILDVDNWYGFGARADGTSKPSYINTF -----1111------3333--------------------3333--%%%%----------- QRGEQESTWETIPQPCWDEHKFGGQYGFLDLFTKDTGTPAKQFKYTNAPDADARAVQATY --11111111-----------------1111----------------3333--------- WADQWAKEQGKSVSTSVGKATKMGDYLRYSFFDKYFRKIGQPSQAGTGYDAAHYLLSWYY ------1111----------------------1111-2222-------1111-------- AWGGGIDSTWSWIIGSSHNHFGYQNPFAAWVLSTDANFKPKSSNGASDWAKSLDRQLEFY -------------------3333-----------3333---1111--------------- QWLQSAEGAIAGGATNSWNGRYEAVPSGTSTFYGMGYVENPVYADPGSNTWFGMQVWSMQ 11113333--------2222-----2222--iiii---------------3333------ RVAELYYKTGDARAKKLLDKWAKWINGEIKFNADGTFQIPSTIDWEGQPDTWNPTQGYTG -----------------------3333----1111-----------------1111---- NANLHVKVVNYGTDLGCASSLANTLTYYAAKSGDETSRQNAQKLLDAMWNNYSDSKGIST 1111-------------------------------------------------1111--- VEQRGDYHRFLDQEVFVPAGWTGKMPNGDVIKSGVKFIDIRSKYKQDPEWQTMVAALQAG ---11113333------2222---1111---22223333-------1111------1111 QVPTQRLHRFWAQSEFAVANGVYAILFPD ----------------------------- >SERRALYSIN; SWP:O69771; PDB:1G9KA; GTSSAFTQIDNFSHFYDRGDHLVNGKPSFTVDQVADQLTRSGASWHDLNNDGVINLTYTF -------------1111-----iiii----------1111------1111---------- LTAPPVGYASRGLGTFSQFSALQKEQAKLSLESWADVAKVTFTEGPAARDDGHMTFANFS ----22221111------------------------------------------------ ASNGGAAFAYLPNSSRKGESWYLINKDYQVNKTPGEGNYGRQTLTHEIGHTLGLSHPGDY 3333------1111-2222-----11111111--2222-----------1111------- NPTYRDAVYAEDTRAYSVMSYWSEKNTGQVFTKTGEGAYASAPLLDDIAAVQKLYGANLE --3333--1111----1111--3333------2222---------------------333 TRADDTVYGFNSTADRDFYSATSSTDKLIFSVWDGGGNDTLDFSGFSQNQKINLTAGSFS 3--------------1111---1111----------------1111--------2222-- DVGGMTGNVSIAQGVTIENAIGGSGNDLLIGNDAANVLKGGAGNDIIYGGGGADVLWGGT -iiii------2222--------------------------------------------- GSDTFVFGAVSDSTPKAADIIKDFQSGFDKIDLTAITKLGGLNFVDAFTGHAGDAIVSYH --------3333-3333-------2222----3333%%%%----------2222------ QASNAGSLQVDFSGQGVADFLVTTVGQVATYDIVA ----------------------------1111--- >POLYADENYLATE-BINDING PRO; SWP:P11940; PDB:1G9LA; GPLGSAAAATPAVRTVPQYKYAAGVRNPQQHLNAQPQVTMQQPAVHVQGQEPLTASMLAS ------------------------------------------------------------ APPQEQKQMLGERLFPLIQAMHPTLAGKITGMLLEIDNSELLHMLESPESLRSKVDEAVA 1111---3333--33333333-------3333----------1111--3333-------- VLQAHQAKEAAQKAVNSATGVPTV --1111------------------ >IGKV1-5 protein; SWP:Q6GMW0; PDB:1G9ML; ELELTQSPATLSVSPGERATLSCRASESVSSDLAWYQQKPGQAPRLLIYGASTRATGVPA -------------2222---------------------2222------------222233 RFSGSGSGAEFTLTISSLQSEDFAVYYCQQYNNWPPRYTFGQGTRLEIKRTVAAPSVFIF 33----------------1111-------------------------------------- PPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQKSKDSTYSLSST --33331111---------------------%%%%------------------------- LTLSKADYEKHKVYACEVTHQGLSSPVTKSFNRG ---------------------------------- >NHE-RF; SWP:O14745; PDB:1G9OA; RMLPRLCCLEKGPNGYGFHLHGEKGKLGQYIRLVEPGSPAEKAGLLAGDRLVEVNGENVE -----------1111-------2222--------2222--1111-2222----iiii-11 KETHQQVVSRIRAALNAVRLLVVDPETDEQL 11--------1111----------------- >OMEGA-ATRACOTOXIN-HV2A; SWP:P82852; PDB:1G9PA; LLACLFGNGRCSSNRDCCELTPVCKRGSCVSSGPGLVGGILGGIL ------------3333-3333---%%%%----------------- >HYPOXANTHINE PHOSPHORIBOS; SWP:P36766; PDB:1G9TA; MKHTVEVMIPEAEIKARIAELGRQITERYKDSGSDMVLVGLLRGSFMFMADLCREVQVSH ---------3333---------------1111----------1111-----3333----- EVDFMTASRDLKILKDLDEDIRGKDVLIVEDIIDSGNTLSKVREILSLREPKSLAICTLL --------------------2222------------------------------------ DKPSRREVNVPVEFIGFSIPDEFVVGYGIDYAQRYRHLPYIGKVILL -1111-------------------------%%%%1111--------- >INTERLEUKIN-13; SWP:P35225; PDB:1GA3A; GGPVPPSTALRELIEELVNITQNQKAPLCNGSMVWSINLTAGMYCAALESLINVSGCSAI -----3333----------------------------3333------------------3 EKTQRMLSGFCPHKVSAGQFSSLHVRDTKIEVAQFVKDLLLHLKKLFREGRFN 333-------------------------------------------------- >SERINE-CARBOXYL PROTEINAS; SWP:P42790; PDB:1GA6A; AGTAKGHNPTEFPTIYDASSAPTAANTTVGIITIGGVSQTLQDLQQFTSANGLASVNTQT -------3333------1111--1111--------------------------------- IQTGSSNGDYSDDQQGQGEWDLDSQSIVGSAGGAVQQLLFYMADQSASGNTGLTQAFNQA ----1111----3333---------------------------1111!!!!--------- VSDNVAKVINVSLGWCEADANADGTLQAEDRIFATAAAQGQTFSVSSGDEGVYECNNRGY ------------------------------------1111---------!!!!------- PDGSTYSVSWPASSPNVIAVGGTTLYTTSAGAYSNETVWNEGLDSNGKLWATGGGYSVYE -!!!!-----1111-------------1111------------1111------------- SKPSWQSVVSGTPGRRLLPDISFDAAQGTGALIYNYGQLQQIGGTSLASPIFVGLWARLQ --3333-------------------3333-----iiii-----3333------------- SANSNSLGFPAASFYSAISSTPSLVHDVKSGNNGYGGYGYNAGTGWDYPTGWGSLDIAKL --%%%%-----------11111111---------iiii-----------!!!!------- SAYIRSNGF --------- >GALACTOSYL TRANSFERASE LG; SWP:P96945; PDB:1GA8A; DIVFAADDNYAAYLCVAAKSVEAAHPDTEIRFHVLDAGISEANRAAVAANLRGGGGNIRF ------3333----------------------------------------2222------ IDVNPEDFAGFPLNIRHISITTYARLKLGEYIADCDKVLYLDIDVLVRDSLTPLWDTDLG ---33331111---1111--------3333----------------------------!! DNWLGASIDLFVERQEGYKQKIGADGEYYFNAGVLLINLKKWRRHDIFKSSEWVEQYKDV !!-------3333-22223333-1111--------------11113333-------1111 QYQDQDILNGLFKGGVCYANSRFNFPTNYAFASRHTDPLYRDRTNTVPVAVSHYCGPAKP --3333-----2222-----1111-3333-----------------------------11 WHRDCTAWGAERFTELAGSLTTVPEEWRGKL 11----2222------1111---1111---- >D-GLYCERALDEHYDE-3-PHOSPH; SWP:P06977; PDB:1GADO; TIKVGINGFGRIGRIVFRAAQKRSDIEIVAINDLLDADYMAYMLKYDSTHGRFDGTVEVK ------------------3333-------------------------------------% DGHLIVNGKKIRVTAERDPANLKWDEVGVDVVAEATGLFLTDETARKHITAGAKKVVMTG %%%--iiii--------3333-3333--------------3333---------------- PSKDNTPMFVKGANFDKYAGQDIVSNASCTTNCLAPLAKVINDNFGIIEGLMTTVHATTA ---------22223333-----------------------------------------33 TQKTVDGPSHKDWRGGRGASQNIIPSSTGAAKAVGKVLPELNGKLTGMAFRVPTPNVSVV 33------33333333-3333-------33333333-3333------------------- DLTVRLEKAATYEQIKAAVKAAAEGEMKGVLGYTEDDVVSTDFNGEVCTSVFDAKAGIAL ------------------------1111----------33332222------3333---- NDNFVKLVSWYDNETGYSNKVLDLIAHISK -------------------------1111- >CHIMERIC 48G7 FAB; SWP:GC1_HUMAN; PDB:1GAFH; QVQLQQSGAELVKPGASVKLSCTASGFNIKDTYMHWVKQRPKQGLEWIGRIDPANVDTKY ------------2222-----------3333--------2222----------------- DPKFQDKATITADTSSKTTYLQLSSLTSEDTAVYYCASYYGIYWGQGTTLTVSSASTKGP 3333---------1111---------3333-------2222------------------- SVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLS ----------------------------------%%%%--2222-------1111----- SVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSC -----3333------------1111------------ >GLUCOAMYLASE-471; SWP:P22832; PDB:1GAI; ATLDSWLSNEATVARTAILNNIGADGAWVSGADSGIVVASPSTDNPDYFYTWTRDSGLVI -------------------------1111---2222------------------------ KTLVDLFRNGDTDLLSTIEHYISSQAIIQGVSNPSGDLSSGGLGEPKFNVDETAYTGSWG ------11113333------------3333--111111113333----1111-------- RPQRDGPALRATAMIGFGQWLLDNGYTSAATEIVWPLVRNDLSYVAQYWNQTGYDLWEEV --3333---------------1111----------------------1111---1111-- NGSSFFTIAVQHRALVEGSAFATAVGSSCSWCDSQAPQILCYLQSFWTGSYILANFDSSR ----------------------1111-----------------1111------------- SGKDTNTLLGSIHTFDPEAGCDDSTFQPCSPRALANHKEVVDSFRSIYTLNDGLSDSEAV ---------------1111---11111111-----------1111--3333---1111-- AVGRYPEDSYYNGNPWFLCTLAAAEQLYDALYQWDKQGSLEITDVSLDFFKALYSGAATG ----11112222------------------------------3333-------1111--- TYSSSSSTYSSIVSAVKTFADGFVSIVETHAASNGSLSEQFDKSDGDELSARDLTWSYAA --1111---------------------11111111------------------------- LLTANNRRNSVVPPSWGETSASSVPGTCAATSASGTYSSVTVTSWPSIVATG ------1111------3333-------------------------------- >FERTILIZATION PROTEIN; SWP:Q25063; PDB:1GAKA; FDDVVVSRQEQSYVQRGMVNFLDEEMHKLVKRFRDMRWNLGPGFVFLLKKVNRERMMRYC ------------------------------------------------------------ MDYARYSKKILQLKHLPVNKKTLTKMGRFVGYRNYGVIRELYADVFRDVQGFRGPKMTAA ------------------------------------------------------------ MRKYSSKDPGTFPCKNE -------1111------ >Ferredoxin-1, chloroplast; SWP:P27787; PDB:1GAQB; ATYNVKLITPEGEVELQVPDDVYILDQAEEDGIDLPYSCRAGSCSSCAGKVVSGSVDQSD -------------------------3333---------------1111------------ QSYLDDGQIADGWVLTCHAYPTSDVVIETHKEEELTGA --------------3333-------------------- >Coat protein; SWP:P07234; PDB:1GAVJ; ATLRSFVLVDNGGTGNVTVVPVSNANGVAEWLSNNSRSQAYRVTASYRASGADKRKYTIK ------------------------%%%%-------3333--------------------- LEVPKIVTQVVNGVELPVSAWKAYASIDLTIPIFAATDDVTVISKSLAGLFKVGNPIAEA ----------iiii-----------------1111------------------------- ISSQSGFYA --------- >FERREDOXIN-NADP+ REDUCTAS; SWP:Q9SLP6; PDB:1GAWA; PATAKAKKESKKQEEGVVTNLYKPKEPYVGRCLLNTKITGDDAPGETWHMVFSTEGKIPY -------------2222-----3333-------------1111----------iiii--- REGQSIGVIADGVDKNGKPHKVRLYSIASSAIGDFGDSKTVSLCVKRLIYTNDAGEIVKG 2222---------1111------------3333--------------------------- VCSNFLCDLQPGDNVQITGPVGKEMLMPKDPNATIIMLATGTGIAPFRSFLWKMFFEKHD -----11112222---------1111---1111-------------------------11 DYKFNGLGWLFLGVPTSSSLLYKEEFGKMKERAPENFRVDYAVSREQTNAAGERMYIQTR 11-------------11112222---------1111------1111--1111-------- MAEYKEELWELLKKDNTYVYMCGLKGMEKGIDDIMVSLAEKDGIDWFDYKKQLKRGDQWN -------------1111--------3333------------------------1111--- VEVY ---- >(1,3-1,4)-BETA-D-GLUCAN 4; SWP:P27051; PDB:1GBG; QTGGSFYEPFNNYNTGLWQKADGYSNGNMFNCTWRANNVSMTSLGEMRLSLTSPSYNKFD -------------1111---------!!!!----3333---1111---------2222-- CGENRSVQTYGYGLYEVNMKPAKNVGIVSSFFTYTGPTDGTPWDEIDIEFLGKDTTKVQF -----------------------------------3333-----------3333------ NYYTNGVGNHEKIVNLGFDAANSYHTYAFDWQPNSIKWYVDGQLKHTATTQIPQTPGKIM ---iiii-----------3333---------1111----iiii----------------- MNLWNGAGVDEWLGSYNGVTPLYAHYNWVRYTKR --------3333---------------------- >GRB2; SWP:P29354; PDB:1GBQA; MEAIAKYDFKATADDELSFKRGDILKVLNEECDQNWYKAELNGKDGFIPKNYIEMKP ------------1111------------------------%%%%----1111----- >AUSTRALIAN BLACK SWAN EGG; SWP:P00717; PDB:1GBS; RTDCYGNVNRIDTTGASCKTAKPEGLSYCGVPASKTIAERDLKAMDRYKTIIKKVGEKLC --11113333------33333333------------------3333-------------- VEPAVIAGIISRESHAGKVLKNGWGDRGNGFGLMQVDKRSHKPQGTWNGEVHITQGTTIL -3333--------%%%%---iiii1111---1111-1111-------------------- TDFIKRIQKKFPSWTKDQQLKGGISAYNAGAGNVRSYARMDIGTTHDDYANDVVARAQYY ----------3333---------------3333---1111---2222------------1 KQHGY 111-- >METHIONINE GAMMA-LYASE; SWP:P13254; PDB:1GC0A; LPGFATRAIHHGYDPQDHGGALVPPVYQTATFTFPSNPTLNLLEARMASLEGGEAGLALA ---------22223333iiii--------------------------------------- SGMGAITSTLWTLLRPGDEVLLGNTLYGCTFAFLHHGIGEFGVKLRHVDMADLQALEAAM --------------2222------------------3333--------1111----1111 TPATRVIYFESPANPNMHMADIAGVAKIARKHGATVVVDNTYCTPYLQRPLELGADLVVH 1111-----------------------3333-----------------3333-------- SATYLSGHGDITAGIVVGSQALVDRIRLQGLKDMTGAVLSPHDAALLMRGIKTLNLRMDR ---3333----------------------------------------------------- HCANAQVLAEFLARQPQVELIHYPQPGGMIAFELKGGIGAGRRFMNALQLFSRAVSLGDA ----------33331111-------!!!!----1111----------------------- ESLAQHPASMTHSSYTPEERAHYGISEGLVRLSVGLEDIDDLLADVQQALKASA -----33331111-------1111-1111--------3333------------- >ADP-DEPENDENT GLUCOKINASE; SWP:Q7M537; PDB:1GC5A; MKESLKDRIRLWKRLYVNAFENALNAIPNVKGVLLAYNTNIDAIKYLDADDLEKRVTEKG -------------------------3333------------------------------- KEKVFEIIENPPEKISSIEELLGGILRSIKLGKAMEWFVESEEVRRYLREWGWDELRIGG -3333-3333-------------------------------------------------3 QAGIMANLLGGVYRIPTIVHVPQNPKLQAELFVDGPIYVPVFEGNKLKLVHPKDAIAEEE 333---------------------33331111------------------3333------ ELIHYIYEFPRGFQVFDVQAPRENRFIANADDYNARVYMRREFREGFEEITRNVELAIIS --------------------------------3333---3333---33331111------ GLQVLKEYYPDGTTYKDVLDRVESHLNILNRYNVKSHFEFAYTANRRVREALVELLPKFT 3333----1111-3333------------1111---------------------3333-- SVGLNEVELASIMEIIGDEELAKEVLEGHIFSVIDAMNVLMDETGIERIHFHTYGYYLAL ----------------------------------------------------2222---- TQYRGEEVRDALLFASLAAAAKAMKGNLERIEQIRDALSVPTNERAIVLEEELEKEFTEF ----3333---------------------3333---1111-------------------- ENGLIDMVDRQLAFVPTKIVASPKSTVGIGDTISSSAFVSEFGMRKR 2222------------------------------------------- >GLUCOSE/GALACTOSE-BINDING; SWP:P23905; PDB:1GCA; ADTRIGVTIYKYDDNFMSVVRKAIEKDGKSAPDVQLLMNDSQNDQSKQNDQIDVLLAKGV ----------1111----------------1111---------------------1111- KALAINLVDPAAAGTVIEKARGQNVPVVFFNKEPSRKALDSYDKAYYVGTDSKESGVIQG --------1111--------1111-----------------1111-----3333------ DLIAKHWQANQGWDLNKDGKIQYVLLKGEPGHPDAEARTTYVVKELNDKGIQTEQLALDT ---------11111111-----------2222---------------------------- AMWDTAQAKDKMDAWLSGPNANKIEVVIANNDAMAMGAVEALKAHNKSSIPVFGVDALPE %%%%-------------1111---------------------11111111---------- ALALVKSGAMAGTVLNDANNQAKATFDLAKNLAEGKGAADGTSWKIENKIVRVPYVGVDK -------------------------------1111-1111------%%%%--------33 DNLSEFTQK 331111--- >SUBTILISIN; SWP:P29600; PDB:1GCI; AQSVPWGISRVQAPAAHNRGLTGSGVKVAVLDTGISTHPDLNIRGGASFVPGEPSTQDGN ----3333--------1111--2222-----------1111--------2222------- GHGTHVAGTIAALNNSIGVLGVAPSAELYAVKVLGASGSGSVSSIAQGLEWAGNNGMHVA ----------------------1111--------1111---------------------- NLSLGSPSPSATLEQAVNSATSRGVLVVAASGNSGAGSISYPARYANAMAVGATDQNNNR --------------------1111-----------------3333---------1111-- ASFSQYGAGLDIVAPGVNVQSTYPGSTYASLNGTSMATPHVAGAAALVKQKNPSWSNVQI 1111--2222----------------------3333---------------1111----- RNHLKNTATSLGSTNLYGSGLVNAEAATR ------------3333!!!!--3333--- ------------------------------- >GCN4P-II; SWP:P03069; PDB:1GCMA; RMKQIEDKIEEILSKIYHIENEIARIKKLIG -------------------------3333-- >GLUCAGON; SWP:P01274; PDB:1GCN; HSQGTFTSDYSKYLDSRRAQDFVQWLMNT ----3333---1111--------1111-- >GROWTH FACTOR RECEPTOR-BO; SWP:P29354; PDB:1GCQA; STYVQALFDFDPQEDGELGFRRGDFIHVMDNSDPNWWKGACHGQTGMFPRNYVTPV -------------2222---2222----------------%%%%----1111---- >Proto-oncogene vav; SWP:P27870; PDB:1GCQC; GSHMPKMEVFQEYYGIPPPPGAFGPFLRLNPGDIVELTKAEAEHNWWEGRNTATNEVGWF ------------------2222-------2222-------3333---------------- PCNRVHPYV 1111----- >HEMOGLOBIN; SWP:Q9YGW2; PDB:1GCVA; AFTACEKQTIGKIAQVLAKSPEAYGAECLARLFVTHPGSKSYFEYKDYSAAGAKVQVHGG ---------------------------------------1111-----1111-------- KVIRAVVKAAEHVDDLHSHLETLALTHGKKLLVDPQNFPMLSECIIVTLATHLTEFSPDT -----------33333333--------------3333----------------------- HCAVDKLLSAICQELSSRYR -------------1111--- >Hemoglobin subunit beta; SWP:Q9YGW1; PDB:1GCVB; VHWTQEERDEISKTFQGTDMKTVVTQALDRMFKVYPWTNRYFQKRTDFRSSIHAGIVVGA ------------------------------------3333-3333--------------- LQDAVKHMDDVKTLFKDLSKKHADDLHVDPGSFHLLTDCIIVELAYLRKDCFTPHIQGIW ------11113333--------------3333---------------!!!!--------- DKFFEVVIDAISKQYH -----------1111- >GLUCAN 1,4-ALPHA-MALTOTET; SWP:P13507; PDB:1GCYA; DQAGKSPNAVRYHGGDEIILQGFHWNVVREAPNDWYNILRQQAATIAADGFSAIWMPVPW -----1111--------------1111--------------------------------- RDFSSWSKSGGGEGYFWHDFNKNGRYGSDAQLRQAASALGGAGVKVLYDVVPNHMNRGYP -----------------------1111------------1111------------1111- DKEINLPAGQGFWRNDCADPGNYPNDCDDGDRFIGGDADLNTGHPQVYGMFRDEFTNLRS ------------1111--------1111----!!!!----1111---------------- QYGAGGFRFDFVRGYAPERVNSWMTDSADNSFCVGELWKGPSEYPNWDWRNTASWQQIIK ----------3333-3333--------1111--------3333-1111------------ DWSDRAKCPVFDFALKERMQNGSIADWKHGLNGNPDPRWREVAVTFVDNHDTGYSPGQNG ----------------------3333---3333--33331111-------------2222 GQHHWALQDGLIRQAYAYILTSPGTPVVYWDHMYDWGYGDFIRQLIQVRRAAGVRADSAI -------1111-------1111--------------------------------1111-- SFHSGYSGLVATVSGSQQTLVVALNSDLGNPGQVASGSFSEAVNASNGQVRVWRS -----------------------------3333------------iiii------ >MACROPHAGE MIGRATION INHI; SWP:P14174; PDB:1GD0A; PMFIVNTNVPRASVPDGFLSELTQQLAQATGKPPQYIAVHVVPDQLMAFGGSSEPCALCS ---------3333-2222--------------3333------------iiii-------- LHSIGKIGGAQNRSYSKLLCGLLAERLRISPDRVYINYYDMNAANVGWNNSTFALEHH -----------------------------1111--------3333--iiii------- >Glyceraldehyde-3-phosphat; SWP:P00362; PDB:1GD1O; AVKVGINGFGRIGRNVFRAALKNPDIEVVAVNDLTDANTLAHLLKYDSVHGRLDAEVSVN -------------------1111------------------------------------! GNNLVVNGKEIIVKAERDPENLAWGEIGVDIVVESTGRFTKREDAAKHLEAGAKKVIISA !!!--------------3333-3333--------------3333---------------- PAKNEDITIVMGVNQDKYDPKAHHVISNASCTTNCLAPFAKVLHEQFGIVRGMMTTVHSY ---------22223333-3333-------------------------------------- TNDQRILDLPHKDLRRARAAAESIIPTTTGAAKAVALVLPELKGKLNGMAMRVPTPNVSV 1111--------3333--1111-------33333333-3333------------------ VDLVAELEKEVTVEEVNAALKAAAEGELKGILAYSEEPLVSRDYNGSTVSSTIDALSTMV -------------------------1111----------33332222------3333--- IDGKMVKVVSWYDNETGYSHRVVDLAAYIASKGL -----------------------------1111- ------------------------------------------------------------ ----- >LYSOZYME; SWP:P48816; PDB:1GD6A; KTFTRCGLVHELRKHGFEENLMRNWVCLVEHESSRDTSKTNTNRNGSKDYGLFQINDRYW ------------1111-1111-----------%%%%------1111----1111------ CSKGASPGKDCNVKCSDLLTDDITKAAKCAKKIYKRHRFDAWYGWKNHCQGSLPDISSC --------1111-3333-------------------!!!!--3333---------1111 >CSAA PROTEIN; SWP:Q9AQH8; PDB:1GD7A; MTPLEAFQILDLRVGRVLRAEPHEKARKPSYKLWVDLGPLGVKQSSAQITELYRPEDLVG ----------------------3333-----------1111--------22223333222 RLVVCAVNLGAKRVAGFLSEVLVLGVPDEAGRVVLLAPDREVPLGGKVF 2------------iiii----------1111-----------2222--- >50S RIBOSOMAL PROTEIN L17; SWP:Q9Z9H5; PDB:1GD8A; SSHRLALYRNQAKSLLTHGRITTTVPKAKELRGFVDHLIHLAKRGDLHARRLVLRDLQDV 3333-----------------------------------------3333----------- KLVRKLFDEIAPRYRDRQGGYTRVLKLAERRRGDGAPLALVELVE -----------1111------------------------------ >ASPARTATE AMINOTRANSFERAS; SWP:O59096; PDB:1GDEA; ALSDRLELVSASEIRKLFDIAAGMKDVISLGIGEPDFDTPQHIKEYAKEALDKGLTHYGP -----3333----------3333----------------3333-------1111-----1 NIGLLELREAIAEKLKKQNGIEADPKTEIMVLLGANQAFLMGLSAFLKDGEEVLIPTPAF 111--------------------3333------3333-----1111-2222--------1 VSYAPAVILAGGKPVEVPTYEEDEFRLNVDELKKYVTDKTRALIINSPCNPTGAVLTKKD 111----------------3333----33333333-1111-------------------- LEEIADFVVEHDLIVISDEVYEHFIYDDARHYSIASLDGMFERTITVNGFSKTFAMTGWR ------------------1111---!!!!---33332222-----------11111111- LGFVAAPSWIIERMVKFQMYNATCPVTFIQYAAAKALKDERSWKAVEEMRKEYDRRRKLV -----------------1111--------------------------------------- WKRLNEMGLPTVKPKGAFYIFPRIRDTGLTSKKFSELMLKEARVAVVPGSAFGKAGEGYV ----1111---------------3333-------------------------3333---- RISYATAYEKLEEAMDRMERVLKERKLV ---------------------------- >Glycerate dehydrogenase; SWP:P36234; PDB:1GDHA; KKKILITWPLPEAAMARARESYDVIAHGDDPKITIDEMIETAKSVDALLITLNEKCRKEV ----------3333---1111------------3333--3333-------3333------ IDRIPENIKCISTYSIGFDHIDLDACKARGIKVGNAPHGVTVATAEIAMLLLLGSARRAG 11113333---------1111-------------------3333---------------- EGEKMIRTRSWPGWEPLELVGEKLDNKTLGIYGFGSIGQALAKRAQGFDMDIDYFDTHRA --------------1111----------------3333---------------------- SSSDEASYQATFHDSLDSLLSVSQFFSLNAPSTPETRYFFNKATIKSLPQGAIVVNTARG 3333-1111------33331111---------3333----333311112222------11 DLVDNELVVAALEAGRLAYAGFDVFAGEPNINEGYYDLPNTFLFPHIGSAATQAREDMAH 11-3333-----------------2222---3333-----------1111---------- QANDLIDALFGGADMSYALA --------1111-------- >CYTOCHROME C6; SWP:Q8WKJ8; PDB:1GDVA; ADLDNGEKVFSANCAACHAGGNNAIMPDKTLKKDVLEANSMNTIDAITYQVQNGKNAMPA -----------------2222----1111-------1111--------------!!!!-- FGGRLVDEDIEDAANYVLSQSEKGW --------------------1111- >RIBOSOME RECYCLING FACTOR; SWP:O66928; PDB:1GE9A; MIKELEDIFKEAEKDMKKAVEYYKNEIAGLRTSRASTALVEEIKVEYYGSKVPIKQLGTI --3333------------------------------3333------iiii---------- SVPEHNQIVIQVWDQNAVPAIEKAIREELNLNPTVQGNVIRVTLPPLTEERRRELVRLLH -------------3333------------------------------------------- KITEEARVRVRNVRREAKEMIEELEGISEDEKKRALERLQKLTDKYIDEINKLMEAKEKE -----------------------------------------------------------1 IMSV 111- >Papaya proteinase 4 [Prec; SWP:P05994; PDB:1GECE; LPESVDWRAKGAVTPVKHQGYCESCWAFSTVATVEGINKIKTGNLVELSEQELVDCDLQS -----3333--------------3333-----------------------------1111 YGCNRGYQSTSLQYVAQNGIHLRAKYPYIAKQQTCRANQVGGPKVKTNGVGRVQSNNEGS !!!!-----------------3333----------3333--------------------- LLNAIAHQPVSVVVESAGRDFQNYKGGIFEGSCGTKVDHAVTAVGYGKSGGKGYILIKNS ----1111---------3333---------------------------iiii-------- WGPGWGENGYIRIRRASGNSPGVCGVYRSSYYPIKN -1111-iiii--------3333%%%%---------- >GLUCOSE 1-DEHYDROGENASE; SWP:P40288; PDB:1GEEA; MYKDLEGKVVVITGSSTGLGKSMAIRFATEKAKVVVNYRSKEDEANSVLEEIKKVGGEAI -3333-----------------------------------3333--------1111---- AVKGDVTVESDVINLVQSAIKEFGKLDVMINNAGLENPVSSHEMSLSDWNKVIDTNLTGA ----1111-------------------------------1111----------------- FLGSREAIKYFVENDIKGTVINMSSVHEKIPWPLFVHYAASKGGMKLMTETLALEYAPKG -----------1111---------1111---2222--------------------3333- IRVNNIGPGAINTPINAEKFADPEQRADVESMIPMGYIGEPEEIAAVAAWLASSEASYVT ------------3333-11113333-------3333---3333---------3333---- GITLFADGGMTLYPSFQAGRG ------iiii--3333%%%%- >HOLLIDAY JUNCTION RESOLVA; SWP:Q9V301; PDB:1GEFA; MYRKGAQAERELIKLLEKHGFAVVRSAGSKKVDLVAGNGKKYLCIEVKVTKKDHLYVGKR ----------------1111-----2222------------------------------- DMGRLIEFSRRFGGIPVLAVKFLNVGWRFIEVSPKIEKFVFTPSSGVSLEVLLGIQKTLE -----------------------------------------3333--------3333--- >ACETOIN REDUCTASE; SWP:Q48436; PDB:1GEGA; KKVALVTGAGQGIGKAIALRLVKDGFAVAIADYNDATAKAVASEINQAGGHAVAVKVDVS ---------------------------------------------1111--------333 DRDQVFAAVEQARKTLGGFDVIVNNAGVAPSTPIESITPEIVDKVYNINVKGVIWGIQAA 3------------1111---------------3333------------------------ VEAFKKEGHGGKIINACSQAGHVGNPELAVYSSSKFAVRGLTQTAARDLAPLGITVNGYC ----1111---------1111---2222--------------------3333-------- PGIVKTPMWAEIDRQVSEAAGKPLGYGTAEFAKRITLGRLSEPEDVAACVSYLASPDSDY ----------------------2222--------3333---3333---------3333-- MTGQSLLIDGGMVFN --------------- >RIBULOSE-1,5-BISPHOSPHATE; SWP:O93627; PDB:1GEHA; YVDKGYEPSKKRDIIAVFRVTPAEGYTIEQAAGAVAAESSTGTWTTLYPWYEQERWADLS --------------------------3333-------------------------3333- AKAYDFHDMGDGSWIVRIAYPFHAFEEANLPGLLASIAGNIFGMKRVKGLRLEDLYFPEK --------------------1111--------------3333---------------333 LIREFDGPAFGIEGVRKMLEIKDRPIYGVVPKPKVGYSPEEFEKLAYDLLSNGADYMKDD 31111-------------------------1111---3333------------------1 ENLTSPWYNRFEERAEIMAKIIDKVENETGEKKTWFANITADLLEMEQRLEVLADLGLKH 111--1111--------------------------------------------------- AMVDVVITGWGALRYIRDLAADYGLAIHGHRAMHAAFTRNPYHGISMFVLAKLYRLIGID ---3333--1111-----------------2222-----1111----------------- QLHVGTAEGGKWDVIQNARILRESHYKPDENDVFHLEQKFYSIKAAFPTSSGGLHPGNIQ ------------------3333------1111-------!!!!----------------- PVIEALGTDIVLQLGGGTLGHPDGPAAGARAVRQAIDAIMQGIPLDEYAKTHKELARALE --------------3333--11113333---------1111----3333----------- KWGHVTP ------- >GELATINASE A; SWP:P08253; PDB:1GEN; LGPVTPEICKQDIVFDGIAQIRGEIFFFKDRFIWRTVTPRDKPMGPLLVATFWPELPEKI --------------------%%%%----!!!!-----1111------3333-1111---- DAVYEAPQEEKAVFFAGNEYWIYSASTLERGYPKPLTSLGLPPDVQRVDAAFNWSKNKKT -----------------------!!!!-------3333---3333--------3333--- YIFAGDKFWRYNEVKKKMDPGFPKLIADAWNAIPDNLDAVVDLQGGGHSYFFKGAYYLKL ---!!!!-----1111--------3333------------------------!!!!---- ENQSLKSVKFGSIKSDWLGC 1111-------3333----- >TRYPTOPHAN SYNTHASE ALPHA; SWP:Q8U094; PDB:1GEQA; MFKDGSLIPYLTAGDPDKQSTLNFLLALDEYAGAIELGIPFSDPIADGKTIQESHYRALK --2222-----2222------------3333------------1111------------- NGFKLREAFWIVKEFRRHSSTPIVLMTYYNPIYRAGVRNFLAEAKASGVDGILVVDLPVF ---3333-------3333--------------------------3333---------111 HAKEFTEIAREEGIKTVFLAAPNTPDERLKVIDDMTTGFVYLVSLYEIPKTAYDLLRRAK 1-------------------1111--------1111------------3333-------- RICRNKVAVGFGVSKREHVVSLLKEGANGVVVGSALVKIIGEKGREATEFLKKKVEELLG --------------3333----1111------3333------!!!!----------1111 I - >GLUTATHIONE REDUCTASE; SWP:Q83PT1; PDB:1GESA; KHYDYIAIGGGSGGIASINRAAMYGQKCALIEAKELGGTCVNVGCVPKKVMWHAAQIREA ----------3333-------1111---------22223333------------------ IHMYGPDYGFDTTINKFNWETLIASRTAYIDRIHTSYENVLGKNNVDVIKGFARFVDAKT ---3333----------3333--------------------1111--------------- LEVNGETITADHILIATGGRPSHPDIPGVEYGIDSDGFFALPALPERVAVVGAGYIGVEL -------------------------2222----33331111------------------- GGVINGLGAKTHLFEMFDAPLPSFDPMISETLVEVMNAEGPQLHTNAIPKAVVKNTDGSL ----1111------------11113333--------------------------1111-- TLELEDGRSETVDCLIWAIGREPANDNINLEAAGVKTNEKGYIVVDKYQNTNIEGIYAVG ---1111---------------------3333-----1111----1111---2222---- DNTGAVELTPVAVAAGRRLSERLFNNKPDEHLDYSNIPTVVFSHPPIGTVGLTEPQAREQ --------------------------1111------------------------------ YGDDQVKVYKSSFTAMYTAVTTHRQPCRMKLVCVGSEEKIVGIHGIGFGMDEMLQGFAVA -3333---------3333----------------1111--------2222---------- LKMGATKKDFDNTVAIHPTAAEEFVTMR -----33331111------3333----- >Capsid protein; SWP:P03642; PDB:1GFF1; VPHDLSHLVFEAGKIGRLKTISWTPVVAGDSFECDMVGAIRLSPLRRGLAVDSRVDIFSF --------------------------2222------------------------------ YIPHRHIYGQQWINFMKDGVNASPLPPVTCSSGWDSAAYLGTIPSSTLKVPKFLHQGYLN --3333------------1111---------------1111---1111--3333------ IYNNYFKPPWSDDLTYANPSNMPSEDYKWGVRVANLKSIWTAPLPPDTRTSENMTTGTST -------1111------1111-1111------------------1111------------ IDIMGLQAAYAKLHTEQERDYFMTRYRDIMKEFGGHTSYDGDNRPLLLMRSEFWASGYDV -3333-------------------3333-1111----1111------------------- DGTDQSSLGQFSGRVQQTFNHKVPRFYVPEHGVIMTLAVTRFPPTHEMEMHYLVGKENLT --------------------------------------------------3333-----3 YTDIACDPALMANLPPREVSLKEFFHSSPDSAKFKIAEGQWYRTQPDRVAFPYNALDGFP 333---3333---------1111-----3333----2222---------3333------- FYSALPSTDLKDRVLVNTNNYDEIFQSMQLAHWNMQTKFNINVYRHMPTTRDSIMTS ----------1111---11113333-----------------------1111----- >Major spike protein; SWP:P03644; PDB:1GFF2; MFQKFISKHNAPINSTQLAATKTPAVAAPVLSVPNLSRSTILINATTTAVTTHSGLCHVV ------------------------------------------------------------ RIDETNPTNHHALSIAGSLSNVPADMIAFAIRFEVADGVVPTAVPALYDVYPIETFNNGK -----------------------------------2222--------------------- AISFKDAVTIDSHPRTVGNDVYAGIMLWSNAWTASTISGVLSVNQVNREATVLQPLK --------------------------------------------------------- ------------ >ERYTHROID MEMBRANE PROTEI; SWP:P11171; PDB:1GG3A; MHCKVSLLDDTVYECVVEKHAKGQDLLKRVCEHLNLLEEDYFGLAIWDNATSKTWLDSAK -----------------1111-----------1111--------------------1111 EIKKQVRGVPWNFTFNVKFYPPDPAQLTEDITRYYLCLQLRQDIVAGRLPCSFATLALLG -33332222-------------3333---------------------------------- SYTIQSELGDYDPELHGVDYVSDFKLAPNQTKELEEKVMELHKSYRSMTPAQADLEFLEN ------------------3333--------3333-------------------------- AKKLSMYGVDLHKAKDLEGVDIILGVCSSGLLVYKDKLRINRFPWPKVLKISYKRSSFFI 3333----------------------3333-----------------------!!!!--- KIRPGEQEQYESTIGFKLPSYRAAKKLWKVCVEHHTFFR --------------------------------3333--- >UDP-N-ACETYLMURAMOYLALANY; SWP:P11880; PDB:1GG4A; ISVTLSQLTDILNGELQGADITLDAVTTDTRKLTPGCLFVALKGERFDAHDFADQAKAGG ---3333--1111---------------1111----------------3333-------- AGALLVSRPLDIDLPQLIVKDTRLAFGELAAWVRQQVPARVVALTGSSGKTSVKETAAIL --------------------------------3333-----------------------3 SQCGNTLYTAGNLNNDIGVPTLLRLTPEYDYAVIELGANHQGEIAWTVSLTRPEAALVNN 333-------------33333333-3333------------------------------- LASLAGVAKAKGEIFSGLPENGIAINADNNDWLNWQSVIGSRKVWRFSPNAANSDFTATN --3333--------11111111---1111-3333----!!!!--------1111------ IHVTSHGTEFTLQTPTGSVDVLLPLPGRHNIANALAAAALSSVGATLDAIKAGLANLKAV -------------1111---------3333---------------3333----1111--2 PGRLFPIQLAENQLLLDDSYNANVGSTAAVQVLAEPGYRVLVVGDAELGAESEACHVQVG 222------2222---------3333----------------------1111-------- EAAKAAGIDRVLSVGKQSHAISTASGVGEHFADKTALITRLKLLIAEQQVITILVKGSRS ----------------3333-------------------------------------333 AAEEVVRALQ 3--------- >Glyceraldehyde-3-phosphat; SWP:P22512; PDB:1GGAO; TIKVGINGFGRIGRMVFQALCDDGLLGNEIDVVAVVDMNTDARYFAYQMKYDSVHGKFKH --------------------1111----------------3333---------------- SVSTTKSKPSVAKDDTLVVNGHRILCVKAQRNPADLPWGKLGVEYVIESTGLFTVKSAAE ------------------iiii---------3333-3333--------------3333-- GHLRGGARKVVISAPASGGAKTFVMGVNHNNYNPREQHVVSNASCTTNCLAPLVHVLVKE 3333-------------------22221111-3333---------------------111 GFGISTGLMTTVHSYTATQKTVDGVSVKDWRGGRAAALNIIPSTTGAAKAVGMVIPSTQG 1--------------1111---------3333--1111--------3333----3333-- KLTGMAFRVPTADVSVVDLTFIATRDTSIKEIDAALKRASKTYMKNILGYTDEELVSADF -----------------------------------------1111----------33332 ISDSRSSIYDSKATLQNNLPNERRFFKIVSWYDNEWGYSHRVVDLVRHMAARDRAAKL 222---------------2222------------------------------------ >Ig gamma-2A chain C regio; SWP:GCAM_MOUSE; PDB:1GGBH; QVQLQESGPGILQPSQTLSLTCSFSGFSLSTYGMGVS ------------------------------2222--- >Ig gamma-2A chain C regio; SWP:GCAA_MOUSE; PDB:1GGIH; QVQLKESGPGILQPSQTLSLTCSFSGFSLSTYGMGVSWIRQPSGKGLEWLAHIFWDGDKR ------------2222--------------2222-------------------1111--- YNPSLKSRLKISKDTSNNQVFLKITSVDTADTATYYCVQEGYIYWGQGTSVTVSSAKTTA -1111--------1111---------------------2222------------------ PSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLYTLS ------------------------------------iiii-------------------- SSVTVTSSTWPSQSITCNVAHPASSTKVDKKIEPR ----------------------------------- >Ig gamma-2A chain C regio; SWP:GCAA_MOUSE; PDB:1GGIL; DIVLTQSPGSLAVSLGQRATISCRASESVDDDGNSFLHWYQQKPGQPPKLLIYRSSNLIS ----------------------------------------------------------22 GIPDRFSGSGSRTDFTLTINPVEADDVATYYCQQSNEDPLTFGAGTKLEIKRADAAPTVS 223333----!!!!---------------------------------------------- IFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMS ---------------------------------iiii----------------------- STLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNR -----3333------------1111---------- >CELLULAR RETINOL-BINDING ; SWP:P82980; PDB:1GGLA; PPNLTGYYRFVSQKNMEDYLQALNISLAVRKIALLLKPDKEIEHQGNHMTVRTLSTFRNY --------------------1111-3333--3333------------------------- TVQFDVGVEFEEDLRSVDGRKCQTIVTWEEEHLVCVQKGEVPNRGWRHWLEGEMLYLELT ----2222-----1111-----------%%%%------------------!!!!------ ARDAVCEQVFRKVH !!!!---------- >LECTIN 1 A CHAIN; SWP:Q7SIF0; PDB:1GGPA; STDASYNADIKEERDAAEGPMAHGIPALNAGALDEARAYATVDSANTDEEVSVAVDVTNL ---------------------------------1111----------------------- AVVAYRAGSNSYFHAAAPGSSLSHLFSRSSQHTLGFDNTYGDMAQAAGSNRKAIPLGAAA -------------11113333----1111---------3333-------3333------- LESGIASLNSKNPLARTLMVIIQMLVEAARFRYIQNNVDVSIETQSAFAADAAMISLENN -------------3333---3333------3333----------------3333------ WANLSALVQGSSGG -------------- >Sugar binding protein; SWP:Q7SIF1; PDB:1GGPB; CAAATVRIAGRDGFCADVNGEGQNGAAIILKKCAENDNQLWTLKREATIRSNGGCLTTAA ----------%%%%---------------------1111----1111------------- AEQAKAGIYDCTQATAELSAWEIADNGTIINPASSLVLSSGAANSLLDLGVQTNSYASAQ ---------1111--1111----3333----3333--------2222---------1111 GWRTGNETSASVTQISGSAQLCMQAG -----------------%%%%----- >OUTER SURFACE PROTEIN C; SWP:Q07337; PDB:1GGQA; GPNLTEISKKITDSNAVLLAVKEVEALLSSIDEIAAKAIGKKIHQNNGLDTENNHNGSLL -------------------------------------2222------------------- AGAYAISTLIKQKLDGLKNEGLKEKIDAAKKCSETFTNKLKEKHTDLGKEGVTDADAKEA -------------1111--1111-------------------3333-------------- ILKTNGTKTKGAEELGKLFESVEVLSKAAKEMLANSVKELTS -1111--------------------------------1111- >CDC4P; SWP:Q09196; PDB:1GGWA; STDDSPYKQAFSLFDRHGTGRIPKTSIGDLLRACGQNPTLAEITEIESTLPAEVDMEQFL ----------33333333----3333---3333-----3333------------------ QVLNRPNGFDMPGDPEEFVKGFQVFDKDATGMIGVGELRYVLTSLGEKLSNEEMDELLKG ---1111--------------------------3333----------------------- VPVKDGMVNYHDFVQMILAN -------%%%%--------- >CALMODULIN-RELATED PROTEI; SWP:P27482; PDB:1GGZA; LTEEQVTEFKEAFSLFDKDGDGCITTRELGTVMRSLGQNPTEAELRDMMSEIDRDGNGTV ----------------1111-------------1111---------------1111---- DFPEFLGMMARKMKDTDNEEEIREAFRVFDKDGNGFVSAAELRHVMTRLGEKLSDEEVDE -----------------------------1111-------------1111---------- MIRAADTDGDGQVNYEEFVRVLVS -----1111--------------- >C-PHYCOCYANIN ALPHA SUBUN; SWP:P72509; PDB:1GH0A; MKTPLTEAVSVADSQGRFLSSTEIQVAFGRFRQAKAGLEAAKALTSKADSLISGAAQAVY ------------1111-------------------------------------------- NKFPYTTQMQGPNYAADQRGKDKCARDIGYYLRMVTYCLIAGGTGPMDEYLIAGIDEINR --3333----1111-------------------------------------2222----1 TFELSPSWYIEALKYIKANHGLSGDAAVEANSYLDYAINALS 111-3333-----------------------------3333- >C-phycocyanin beta chain; SWP:P72508; PDB:1GH0B; MFDAFTKVVSQADTRGEMLSTAQIDALSQMVAESNKRLDVVNRITSNASTIVSNAARSLF ------------1111-------------------------------------------- AEQPQLIAPGGAYTSRRMAACLRDMEIILRYVTYAVFAGDASVLEDRCLNGLRETYLALG ---33332222-------------------------------------2222-------- TPGSSVAVGVGKMKEAALAIVNDPAGITPGDCSALASEIAGYFDRAAAAVS -3333-----------------------------------------3333- >THIOREDOXIN-LIKE PROTEIN; SWP:O43396; PDB:1GH2A; VGVKPVGSDPDFQPELSGAGSRLAVVKFTMRGCGPCLRIAPAFSSMSNKYPQAVFLEVDV -------3333-------!!!!---------------------------1111-----11 HQCQGTAATNNISATPTFQFFRNKVRIDQYQGADAVGLEEKIKQHLE 11-----1111----------iiii---------------------- >LARGE T ANTIGEN; SWP:P03070; PDB:1GH6A; SHMREESLQLMDLLGLERSAWGNIPLMRKAYLKKCKEFHPDKGGDEEKMKKMNTLYKKME -3333------1111--------------------------------------------1 DGVKYAHQPDFGGFWDATEIPTYGTDEWEQWWNAFNEENLFCSEEMPSSDDEAT 111--------------------------------------------------- >TRANSLATION ELONGATION FA; SWP:O27734; PDB:1GH8A; MGDVVATIKVMPESPDVDLEALKKEIQERIPEGTELHKIDEEPIAFGLVALNVMVVVGDA -----------------3333--------------------------------------- EGGTEAAEESLSGIEGVSNIEVTDVRRLM --------3333--3333----------- >8.3 KDA PROTEIN (GENE MTH; SWP:O27252; PDB:1GH9A; MYIIFRCDCGRALYSREGAKTRKCVCGRTVNVKDRRIFGRADDFEEASELVRKLQEEKYG ------1111-------------3333--------------------------------- SCHFTNPSKRE -----1111-- >GH1; SWP:P08287; PDB:1GHC; MAGPSVTELITKAVSASKERKGLSLAALKKALAAGGYDVEKNNSRIKLGLKSLVSKGTLV ----3333----1111------%%%%-1111----------1111--------3333--- QTKGTGASGSFRLSK --------------- >ACETYLTRANSFERASE; SWP:P16966; PDB:1GHEA; AQLRRVTAESFAHYRHGLAQLLFETVHGGASVGFADLDQQAYAWCDGLKADIAAGSLLLW ------3333-1111----------1111------------------------------- VVAEDDNVLASAQLSLCQKPNGLNRAEVQKLVLPSARGRGLGRQLDEVEQVAVKHKRGLL ---!!!!-----------1111----------1111---3333---------1111---- HLDTEAGSVAEAFYSALAYTRVGELPGYCATPDGRLHPTAIYFKTL ----2222------1111------------1111------------ >ANTI-ANTI-IDIOTYPE GH1002; SWP:NA; PDB:1GHFH; VQLQQSGPELKKPGETVKISCKLWYTFTDYGMNWVKQAPGKGLKWMGWIQTNTEEPTYGA -------------------------1111-----------------------------11 EFKGRFAFSLETSAFTAYKQINNLKNEDMATYFCARVEAGFDYWAQGTTLTVSSAKTTPP 11---------1111--------------------------------------------- SVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSS ------!!!!------------------------%%%%-------------%%%%----- SVTVPSSPRPSETVTCNVAHPASSTKVDKKII -------------------------------- >ANTI-ANTI-IDIOTYPE GH1002; SWP:NA; PDB:1GHFL; DIQMTQTTSSLSASLGDRVTISCRESQDISNSLNWYQQKPDGTVKLLIYYTSRLHSGVPS --------------------------------------1111------------222233 RFSGSGTGTDYSLTISNLEQEDFATYFCQQGNTLPYTFGGGTKLEIKRADAAQTVSIFPP 33----------------3333-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT 3333-------------------------------------------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNR ---3333------------------------ >DNA-DAMAGE-INDUCIBLE PROT; SWP:Q47143; PDB:1GHHA; MRIEVTIAKTSPLPAGAIDALAGELSRRIQYAFPDNEGHVSVRYAAANNLSVIGATKEDK -------3333---------------------1111------------------------ QRISEILQETWESADDWFVSE ------------3333----- >E2, THE DIHYDROLIPOAMIDE ; SWP:P20708; PDB:1GHJ; AIDIKAPTFPESIADGTVATWHKKPGEAVKRDELIVDIETDKVVMEVLAEADGVIAEIVK ------------------------------------------------------------ NEGDTVLSGELLGKLTEGG 2222--2222--------- >PHEASANT EGG WHITE LYSOZY; SWP:P00702; PDB:1GHLA; GKVYGRCELAAAMKRMGLDNYRGYSLGNWVCAAKFESNFNTGATNRNTDGSTDYGILQIN -----------------2222---3333--------%%%%------1111----1111-3 SRWWCNDGRTPGSKNLCHIPCSALLSSDITASVNCAKKIVSDGNGMNAWVAWRKHCKGTD 333-----------1111-3333--------------3333--!!!!--------22223 VNVWIRGCRL 333-2222-- >BETA-LACTAMASE; SWP:P00807; PDB:1GHPA; KELNDLEKKYNAHIGVYALDTKSGKEVKFNSDKRFAYASTSKAINSAILLEQVPYNKLNK -----------------------------1111---------------3333-3333--- KVHINKDDIVAYSPILEKYVGKDITLKALIEASMTYSDNTANNKIIKEIGGIKKVKQRLK ----3333------33332222-------------------------------------1 ELGDKVTNPVRYDIELQYYSPKSKKDTSTPAAFGKTLNKLIANGKLSKENKKFLLDLMLN 111----------------1111------------------------------------- NKSGDTLIKDGVPKDYKVADKSGQAITYASRNDVAFVYPKGQSEPIVLVIFTNKDNKSDK 1111--3333--1111----------%%%%-------------------------1111- PNDKLISETAKSVMKEF -3333------------ >1,3-BETA-GLUCANASE; SWP:P15737; PDB:1GHSA; IGVCYGVIGNNLPSRSDVVQLYRSKGINGMRIYFADGQALSALRNSGIGLILDIGNDQLA -----------------------------------3333---2222--------1111-- NIAASTSNAASWVQNNVRPYYPAVNIKYIAAGNEVQGGATQSILPAMRNLNAALSAAGLG ----------------1111----------------3333------------------33 AIKVSTSIRFDEVANSFPPSAGVFKNAYMTDVARLLASTGAPLLANVYPYFAYRDNPGSI 33------1111-----3333----3333------------------------------- SLNYATFQPGTTVRDQNNGLTYTSLFDAMVDAVYAALEKAGAPAVKVVVSESGWPSAGGF -----------------------------------------1111-------------22 AASAGNARTYNQGLINHVGGGTPKKREALETYIFAMFNENQKTGDATERSFGLFNPDKSP 22--------------3333-3333-------------------3333------1111-- AYNIQF ------ >IGG1-KAPPA HC19 FAB (HEAV; SWP:NA; PDB:1GIGH; QVQLKESGPGLVAPSQSLSITCTVSGFLLISNGVHWVRQPPGKGLEWLGVIWAGGNTNYN ---------------------------3333--------------------1111----- SALMSRVSISKDNSKSQVFLKMKSLQTDDTAMYYCARDFYDYDVFYYAMDYWGQGTSVTV ------------1111---------1111------------------------------- SSAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQ ------------------------------------------%%%%-------------% SDLYTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVP %%%---------3333------------1111--------- >IOTA TOXIN COMPONENT IA; SWP:Q46220; PDB:1GIQA; IERPEDFLKDKENAIQWEKKEAERVEKNLDTLEKEALELYKKDSEQISNYSQTRQYFYDY ------!!!!--------------1111------------------------1111---- QIESNPREKEYKNLRNAISKNKIDKPINVYYFESPEKFAFNKEIRTENQNEISLEKFNEL ----3333---------------------------1111--------------------- KETIQDKLFKQDGFKDVSLYEPGNGDEKPTPLLIHLKLPKNTGMLPYINSNDVKTLIEQD ----------------------2222---------------------------------- YSIKIDKIVRIVIEGKQYIKAEASIVNSLDFKDDVSKGDLWGKENYSDWSNKLTPNELAD ------------iiii--------------!!!!-----------11111111------- VNDYMRGGYTAINNYLISNGPLNNPNPELDSKVNNIENALKLTPIPSNLIVYRRSGPQEF ------------------3333----------------1111-------------3333- GLTLTSPEYDFNKIENIDAFKEKWEGKVITYPNFISTSIGSVNMSAFAKRKIILRINIPK --11111111-------------2222-------------------1111--------22 DSPGAYLSAIPGYAGEYEVLLNHGSKFKINKVDSYKDGTVTKLILDATLIN 22---1111---------------------------!!!!----------- >RIBOSOME-INACTIVATING PRO; SWP:P09989; PDB:1GISA; MDVSFRLSGATSSSYGVFISNLRKALPNERKLYDIPLLRSSLPGSQRYALIHLTNYADET ------2222---------------------%%%%-------3333--------1111-- ISVAIDVTNVYIMGYRAGDTSYFFNQASATEAAKYVFKDAMRKVTLPYSGNYERLQTAAG ----------------!!!!-----------1111-1111-------------------- KIRENIPLGLPALDSAITTLFYYNANSAASALMVLIQSTSEAARYKFIEQQIGKRVDKTF -3333------------------3333--------------------------------- LPSLAIISLENSWSALSKQIQIASTNNGQFESPVVLINAQNQRVTITNVDAGVVTSNIAL -------------------------iiii--------1111------11113333----- LLNRNNMA --1111-- >C-REL PROTO-ONCOGENE PROT; SWP:P16236; PDB:1GJIA; PYIEIFEQPRQRGMRFRYKCEGRSAGSIPGEHSTDNNKTFPSIQILNYFGKVKIRTTLVT -----------------3333--------------------------------------- KNEPYKPHPHDLVGKDCRDGYYEAEFGPERRVLSFQNLGIQCVKKKDLKESISLRISKKI -------------2222iiii----------------------3333-3333-------- NPFNVPEEQLHNIDEYDLNVVRLCFQAFLPDEHGNYTLALPPLISNPIYDNRAPNTAELR -----3333----------------------------------------3333------- ICRVNKNCGSVKGGDEIFILCDKVQKDDIEVRFVLDNWEAKGSFSQADVHRQVAIVFRTP ---------3333-----------3333------!!!!------3333------------ PFLRDITEPITVKMQLRRPSDQEVSEPMDFRYLPD ----------------------------------- >LAP2; SWP:P42166; PDB:1GJJA; MPEFLEDPSVLTKDKLKSELVANNVTLPAGEQRKDVYVQLYLQHLTARNRDVTELTNEDL --------------------------------------------------1111------ LDQLVKYGVNPGPIVGTTRKLYEKKLLKLREQG ---------------------------3333-- >IMMUNOGLOBULIN G BINDING ; SWP:P19909; PDB:1GJSA; MKAIFVLNAQHDEAVDANSLAEAKVLANRELDKYGVSDYYKNLINNAKTVEGVKALIDEI ------------------------------------------3333-------------- LAALP ----- >MALTODEXTRIN GLYCOSYLTRAN; SWP:O33838; PDB:1GJWA; MLLREINRYCKEKATGKRIYAVPKLWIPGFFKKFDEKSGRCFVDPYELGAEITDWILNQS ----------------------3333-3333------------------------3333- REWDYSQPLSFLKGEKTPDWIKRSVVYGSLPRTTAAYNHKGSGYYEENDVLGFREAGTFF -------3333------3333--------3333----3333-------1111-------- KMMLLLPFVKSLGADAIYLLPVSRMSDLFKKGDAPSPYSVKNPMELDERYHDPLLEPFKV ---------1111----------------------1111--1111------3333----- DEEFKAFVEACHILGIRVILDFIPRTAARDSDLIREHPDWFYWIKVEELADYTPPRAEEL -----------1111-------3333-----3333-1111----33331111----3333 PFKVPDEDELEIIYNKENVKRHLKKFTLPPNLIDPQKWEKIKREEGNILELIVKEFGIIT -----3333-------------1111--3333-------3333----------------- PPGFSDLINDPQPTWDDVTFLRLYLDHPEASKRFLDPNQPPYVLYDVIKASKFPGKEPNR ------2222-----------------3333------------3333------------- ELWEYLAGVIPHYQKKYGIDGARLDMGHALPKELLDLIIKNVKEYDPAFVMIAEELDMEK --------3333-------------3333----------------1111-------1111 DKASKEAGYDVILGSSWYFAGRVEEIGKLPDIAEELVLPFLASVETPDTPRIATRKYASK ----1111------33331111--3333----1111---------1111-1111--3333 MKKLAPFVTYFLPNSIPYVNTGQEIGEKQPMNLGLDTDPNLRKVLSPTDEFFGKLAFFDH -----------2222----2222-----------------1111-1111-22223333-- YVLHWDSPDRGVLNFIKKLIKVRHEFLDFVLNGKFENLTTKDLVMYSYEKNGQKIVIAAN ---1111--------------------3333--------1111------iiii------- VGKEPKEITGGRVWNGKWSDEEKVVLKPLEFALVVQ --------------------------2222------ ------------------------------------------------------------ --------------------- >SORCIN; SWP:P05044; PDB:1GJYA; MDPLYGYFASVAGQDGQIDADELQRCLTQSGIAGGYKPFNLETCRLMVSMLDRDMSGTMG -3333-3333--1111--------------3333-----3333----------------- FNEFKELWAVLNGWRQHFISFDSDRSGTVDPQELQKALTTMGFRLNPQTVNSIAKRYSTS -----------------11111111----3333----1111----3333---------ii GKITFDDYIACCVKLRALTDSFRRRDSAQQGMVNFSYDDFIQCVMTV ii--------------------3333---------3333----1111 >MEGF/TGFALPHA44-50 CHIMER; SWP:P01132; PDB:1GK5A; NSYPGCPSSYDGYCLNGGVCMHIESLDSYTCNCVIGYSGDRCEHADLLA ---------------------------------------%%%%------ ------------------------------------------------------- >VIMENTIN; SWP:P08670; PDB:1GK7A; GSNEKVELQELNDRFANYIDKVRFLEQQNKILLAELEQL -----------------------------------1111 >RIBULOSE BISPHOSPHATE CAR; SWP:P00877; PDB:1GK8A; TKAGAGFKAGVKDYRLTYYTPDYVVRDTDILAAFRMTPQPGVPPEECGAAVAAESSTGTW ------------3333---1111--1111---------2222------------------ TTVWTDGLTSLDRYKGRCYDIEPVPGEDNQYIAYVAYIDLFEEGSVTNMFTSIVGNVFGF --3333---3333----------2222----------33332222----------33331 KALRALRLEDLRIPPAYVKTFVGPHGIQVERDKLNKYGRGLLGCTIKPKLGLSAKNYGRA 111----------33331111--------------------------------------- VYECLRGGLDFTDDENVNSQPFMRWRDRFLFVAEAIYKAQAETGEVKGHYLNATAGTCEE ----1111-----1111--3333------------------------------------- MMKRAVAKELGVPIIMHDYLTGGFTANTSLAIYCRDNGLLLHIHRAMHAVIDRQRNHGIH ------3333-------3333-----------------------2222-----1111--- FRVLAKALRMSGGDHLHSGTVVGKLEGEREVTLGFVDLMRDDYVEKDRSRGIYFTQDWSM -----------------------------------------------1111--------- PGVMPVASGGIHVWHMPALVEIFGDDACLQFGGGTLGHPWGNAPGAAANRVALEACTQAR -----------3333----------------1111--1111------------------1 NEGRDLAREGGDVIRSACKWSPELAAACEVWKEIKFEFDTIDKL 111-3333----------------------1111---------- >Ribulose bisphosphate car; SWP:P00873; PDB:1GK8I; MVWTPVNNKMFETFSYLPPLTDEQIAAQVDYIVANGWIPCLEFAEADKAYVSNESAIRFG -----------2222-----------------1111--------3333----3333---- SVSCLYYDNRYWTMWKLPMFGCRDPMQVLREIVACTKAFPDAYVRLVAFDNQKQVQIMGF --2222------------2222-3333-----------1111------------------ LVQRP ----- >PENICILLIN G ACYLASE ALPH; SWP:P06875; PDB:1GK9A; QSSSEIKIVRDEYGMPHIYANDTWHLFYGYGYVVAQDRLFQMEMARRSTQGTVAEVLGKD -1111-----1111-------------------------------------3333--333 FVKFDKDIRRNYWPDAIRAQIAALSPEDMSILQGYADGMNAWIDKVNTNPETLLPKQFNT 3-------11113333----11113333--------------------------3333-- FGFTPKRWEPFDVAMIFVGTMANRFSDSTSEIDNLALLTALKDKYGVSQGMAVFNQLKWL --------3333------------------------------------------------ VNPSAPTTIAVQESNYPLKFNQQNSQTA -1111----3333----------3333- >Penicillin G acylase [Pre; SWP:P06875; PDB:1GK9B; SNMWVIGKSKAQDAKAIMVNGPQFGWYAPAYTYGIGLHGAGYDVTGNTPFAYPGLVFGHN ------1111----------------------------iiii------%%%%-------- GVISWGSTAGFGDDVDIFAERLSAEKPGYYLHNGKWVKMLSREETITVKNGQAETFTVWR ----------------------3333-----iiii------------------------- TVHGNILQTDQTTQTAYAKSRAWDGKEVASLLAWTHQMKAKNWQEWTQQAAKQALTINWY 1111------1111------1111------------1111---------1111------- YADVNGNIGYVHTGAYPDRQSGHDPRLPVPGTGKWDWKGLLPFEMNPKVYNPQSGYIANW --1111-------------22221111-----1111-----3333------3333----- NNSPQKDYPASDLFAFLWGGADRVTEIDRLLEQKPRLTADQAWDVIRQTSRQDLNLRLFL ----2222----1111--------------------------------1111--3333-- PTLQAATSGLTQSDPRRQLVETLTRWDGINLLNDDGKTWQQPGSAILNVWLTSMLKRTVV ------11111111------------------3333----------------------33 AAVPMPFDKWYSASGYETTQDGPTGSLNISVGAKILYEAVQGDKSPIPQAVDLFAGKPQQ 33-----3333-------1111------------------!!!!-------1111--333 EVVLAALEDTWETLSKRYGNNVSNWKTPAMALTFRANNFFGVPQAAAEETRHQAEYQNRG 3-------------------3333-------------1111----3333----------- TENDMIVFSPTTSDRPVLAWDVVAPGQSGFIAPDGTVDKHYEDQLKMYENFGRKSLWLTK -------------------------------1111------------1111--------- QDVEAHKESQEVLHVQR ---1111---------- >Crustacyanin-A2 subunit; SWP:P80007; PDB:1GKAB; DGIPSFVTAGKCASVANQDNFDLRRYAGRWYQTHIIENAYQPVTRCIHSNYEYSTNDYGF ---1111--------------3333------------1111------------------- KVTTAGFNPNDEYLKIDFKVYPTKEFPAAHMLIDAPSVFAAPYEVIETDYETYSCVYSCI ----------------------33331111------------------------------ TTDNYKSEFAFVFSRTPQTSGPAVEKTAAVFNKNGVEFSKFVPVSHTAECVYRA -----------------1111----------1111-3333------3333---- >92 KDA TYPE IV COLLAGENAS; SWP:P14780; PDB:1GKDA; FEGDLKWHHHNITYWIQNYSEDLPRAVIDDAFARAFALWSAVTPLTFTRVYSRDADIVIQ --------------------------------------3333---------1111----- FGVAEHGDGYPFDGKDGLLAHAFPPGPGIQGDAHFDDDELWSLGKGQGYSLFLVAAHQFG --------------------------!!!!------------------------------ HALGLDHSSVPEALMYPMYRFTEGPPLHKDDVNGIRHLY 1111-----1111--------------3333-------- ------------------------------------------------------------ ------------------------------------------------------------ ---------------- >HISTIDINE AMMONIA-LYASE; SWP:P21310; PDB:1GKMA; TELTLKPGTLTLAQLRAIHAAPVRLQLDASAAPAIDASVACVEQIIAEDRTAYGINTGFG -----2222------------------3333--------------1111--2222----1 LLASTRIASHDLENLQRSLVLSHAAGIGAPLDDDLVRLIMVLKINSLSRGFSGIRRKVID 111-------------------------------------------1111----3333-- ALIALVNAEVYPHIPLKGSVGASGDLAPLAHMSLVLLGEGKARYKGQWLSATEALAVAGL ----------------------------------1111-----iiii--------1111- EPLTLAAKEGLALLNGTQASTAYALRGLFYAEDLYAAAIACGGLSVEAVLGSRSPFDARI -----2222-------------------------------------1111--3333---- HEARGQRGQIDTAACFRDLLGDSSEVSLSHKNADKVQDPYSLRCQPQVMGACLTQLRQAA -----------------------------!!!!-----3333------------------ EVLGIEANAVSDNPLVFAAEGDVISGGNFHAEPVAMAADNLALAIAEIGSLSERRISLMM -----1111--------1111-----3333------------------------------ DKHMSQLPPFLVENGGVNSGFMIAQVTAAALASENKALSHPHSVDSLPTSANQEDHVSMA 1111---2222-------!!!!-------------------1111--------------- PAAGKRLWEMAENTRGVLAIEWLGACQGLDLRKGLKTSAKLEKARQALRSEVAHYDRDRF -------------------------------2222------------3333--------- FAPDIEKAVELLAKGSLTGLLPAGVLPSL 3333----------1111-------1111 >COMPLEMENT RECEPTOR TYPE ; SWP:P17927; PDB:1GKNA; EAEAHCQAPDHFLFAKLKTQTTASDFPIGTSLKYECRPEYYGRPFSITCLDNLVWSSPKD ------------------------------------1111-------------------- VCKRKSCKTPPDPVNGMVHVITDIQVGSRITYSCTTGHRLIGHSSAECILSGNTAHWSTK ------------------------------------------------------------ PPICQRIP -------- >HYDANTOINASE; SWP:Q7SIE9; PDB:1GKPA; PLLIKNGEIITADSRYKADIYAEGETITRIGQNLEAPPGTEVIDATGKYVFPGFIDPHVH ------------------------------------2222----2222------------ IYLPFMATFAKDTHETGSKAALMGGTTTYIEMCCPSRNDDALEGYQLWKSKAEGNSYCDY ----!!!!----3333-------------------1111------------2222----- TFHMAVSKFDEKTEGQLREIVADGISSFIFLSYKNFFGVDDGEMYQTLRLAKELGVIVTA ---------1111-------------------2222------------------------ HCENAELVGRLQQKLLSEGKTGPEWHEPSRPEAVEAEGTARFATFLETTGATGYVVHLSC ---3333--------1111--33333333-3333-------------------------- KPALDAAMAAKARGVPIYIESVIPHFLLDKTYAERGGVEAMKYIMSPPLRDKRNQKVLWD ----------1111-------3333----3333---3333----------3333------ ALAQGFIDTVGTDHCPFDTEQKLLGKEAFTAIPNGIPAIEDRVNLLYTYGVSRGRLDIHR -1111------------3333-1111-3333------1111------------------- FVDAASTKAAKLFGLFPRKGTIAVGSDADLVVYDPQYRGTISVKTQHVNNDYNGFEGFEI ----------------------2222-------1111----3333---------2222-- DGRPSVVTVRGKVAVRDGQFVGEKGWGKLLRREPMYF --------iiii---iiii---2222----------- >NON-ATP DEPENDENT L-SELEC; SWP:P81006; PDB:1GKRA; MFDVIVKNCRLVSSDGITEADILVKDGKVAAISADTSDVEASRTIDAGGKFVMPGVVDEH ------------------------iiii------------------iiii---------- VHIIDMDLKNRYGRFELDSESAAVGGITTIIEMPITFPPTTTLDAFLEKKKQAGQRLKVD ----!!!!-----3333------------------------------------------- FALYGGGVPGNLPEIRKMHDAGAVGFSMMAASVPGMFDAVSDGELFEIFQEIAACGSVIV -------22223333---1111-------------------------------------- VHAENETIIQALQKQIKAAGGKDMAAYEASQPVFQENEAIQRALLLQKEAGCRLIVLHVS ---------------------------1111----------------------------- NPDGVELIHQAQSEGQDVHCESGPQYLNITTDDAERIGPYMKVAPPVRSAEMNIRLWEQL -----------1111-------3333---1111---------------3333-------- ENGLIDTLGSDHGGHPVEDKEPGWKDVWKAGNGALGLETSLPMMLTNGVNKGRLSLERLV ---------------1111-3333-3333------1111--------------------- EVMCEKPAKLFGIYPQKGTLQVGSDADLLILDLDIDTKVDASQFRSLHKYSPFDGMPVTG -------------3333---2222---------------3333-------1111------ APVLTMVRGTVVAEKGEVLVEQGFGQFVTR ------iiii----------2222------ >Reverse gyrase; SWP:O29238; PDB:1GKUB; AAAAAAAAAAAAAAAAAAASLCLFPEDFLLKEFVEFFRKCVGEPRAIQKMWAKRILRKES -----------------------1111-----------------3333------1111-- FAATAPTGVGKTSFGLAMSLFLALKGKRCYVIFPTSLLVIQAAETIRKYAEKAGVGTENL ----------3333--------1111-------------------------------333 IGYYHGRIPKREKENFMQNLRNFKIVITTTQFLSKHYRELGHFDFIFVDDVDAILKASKN 3------------------1111-------3333----------------3333------ VDKLLHLLGFHYDLKTKSWVGEARGCLMVSTATAKKGKKAELFRQLLNFDIGSSRITVRN -----1111---------------------------3333-------------------- VEDVAVNDESISTLSSILEKLGTGGIIYARTGEEAEEIYESLKNKFRIGIVTATKKGDYE --------------3333-----------------------3333-----2222------ KFVEGEIDHLIGTAHRGLDLPERIRFAVFVGCPSFRVTIEDIDSLSPQMVKLLAYLYRNV ----------------------------------------1111---------------- DEIERLLPAVERHIDEVREILKKVMGKERPQAKDVVVREGEVIFPDLRTYIQGSGRTSRL ------3333------------------------------------------3333---- FAGGLTKGASFLLEDDSELLSAFIERAKLYDIEFKSIDEVDFEKLSRELDESRDRYRRRQ 1111-----------------------1111--------------------------111 EFDLIKPALFIVESPTKARQISRFFGKPSVKVLDGAVVYEIPMQKYVLMVTASIGHVVDL 1---------------------1111---------------------------------- ITNRGFHGVLVNGRFVPVYASIKDNSRSRIEALRKLAHDAEFVIVGTDPDTEGEKIAWDL ------------------------------------------------------------ KNLLSGCGAVKRAEFHEVTRRAILEALESLRDVDENLVKAQVVRRIEDRWIGFVLSQKLW ---2222------------------1111------------------------------- ERFNNRNLSAGRAQTLVLGWIIDRFQESRERRKIAIVRDFDLVLEHDEEEFDLTIKLVEE -----------------------3333---------2222-------------------- REELRTPLPPYTTETMLSDANRILKFSVKQTMQIAQELFENGLITYHRTDSTRVSDVGQR --------------------------------------1111------------------ IAKEYLGDDFVGREWGESGAHECIRPTRPLTRDDVQRLIQEGVLVVEGLRWEHFALYDLI --------------------------------------3333-------3333------- FRRFMASQCRPFKVVVKKYSIEFDGKTAEEERIVRAEGRAYELYRAVWVKNELPTGTFRV -----1111--------------------------------------------------- KAEVKSVPKVLPFTQSEIIQMMKERGIGRPSTYATIVDRLFMRNYVVEKYGRMIPTKLGI -------------3333-----------1111--------1111----%%%%-------- DVFRFLVRRYAKFVSEDRTRDLESRMDAIERGELDYLKALEDMYAEIKSID --------------3333----------1111------------3333--- >[3-METHYL-2-OXOBUTANOATE ; SWP:Q00972; PDB:1GKZA; VRLTPTMMLYSGRSQDGSHLLKSGRYLQQELPVRIAHRIKGFRSLPFIIGCNPTILHVHE ----------------3333---------------------11113333----------- LYIRAFQKLTDFPPIKDQADEAQYCQLVRQLLDDHKDVVTLLAEGLRESRKHIEDEKLVR --------1111--------------------1111------------------1111-- YFLDKTLTSRLGIRMLATHHLALHEDKPDFVGIICTRLSPKKIIEKWVDFARRLCEHKYG --------------------------2222!!!!-------------------------- NAPRVRINGHVAARFPFIPMPLDYILPELLKNAMRATMESHLDTPYNVPDVVITIANNDV ---------1111-----3333--------------33333333-------------333 DLIIRISDRGGGIAHKDLDRVMDYHFTTAPMHGFGFGLPTSRAYAEYLGGSLQLQSLQGI 3------------1111--1111---------------------------------2222 GTDVYLRLRHIDGREE ---------------- >Protease inhibitors [Prec; SWP:P80060; PDB:1GL0I; KCTPGQVKQQDCNTCTCTPTGVWGCTLMGCQP -----------------1111----------- >Protease inhibitors [Prec; SWP:P80060; PDB:1GL1I; ISCEPGKTFKDKCNTCRCGADGKSAACTLKACPN ---2222--------------------------- ------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ >Syntaxin-8; SWP:Q9Z2Q7; PDB:1GL2D; DAGLDALSSIISRQKQMGQEIGNELDEQNEIIDDLANLVENTDEKLRTEARRVTL 3333--------------------------------------------------- >Basement membrane-specifi; SWP:Q05793; PDB:1GL4B; PIMVTVEEQRSQSVRPGADVTFICTAKSKSPAYTLVWTRLHNGKLPSRAMDFNGILTIRN --------------2222---------------------%%%%--1111--iiii----- VQPSDAGTYVCTGSNMFAMDQGTATLHVQ -3333------------------------ >TYROSINE-PROTEIN KINASE T; SWP:P24604; PDB:1GL5A; GSEIVVAMYDFQATEAHDLRLERGQEYIILEKNDLHWWRARDKYGSEGYIPSNYVTGKKS --------------------------------------------------3333------ NNLDQYD ------- >GLUTATHIONE S-TRANSFERASE; SWP:P19157; PDB:1GLQA; PPYTIVYFPVRGRCEAMRMLLADQGQSWKEEVVTIDTWMQGLLKPTCLYGQLPKFEDGDL ----------!!!!-------1111-----------------11111111------!!!! TLYQSNAILRHLGRSLGLYGKNQREAAQMDMVNDGVEDLRGKYVTLIYTNYENGKNDYVK --------------------------------------------------3333------ ALPGHLKPFETLLSQNQGGKAFIVGDQISFADYNLLDLLLIHQVLAPGCLDNFPLLSAYV -3333----------%%%%---------3333-------------11111111------- ARLSARPKIKAFLSSPEHVNRPINGNGKQ -----3333-------------------- >PROTEIN (GLUCOCORTICOID R; SWP:P06536; PDB:1GLUA; MKPARPCLVCSDEASGCHYGVLTCGSCKVFFKRAVEGQHNYLCAGRNDCIIDKIRRKNCP ------------------------------------------------------111133 ACRYRKCLQAGMNLEARKTKK 33-----1111---------- >GLUTATHIONE SYNTHASE; SWP:P04425; PDB:1GLV; MIKLGIVMDPIANINIKKDSSFAMLLEAQRRGYELHYMEMGDLYLINGEARAHTRTLNVK ---------3333-3333--3333----1111------3333---iiii----------- QNYEEWFSFVGEQDLPLADLDVILMRKDPPFDTEFIYATYILERAEEKGTLIVNKPQSLR -3333----------1111--------------------------1111------3333- DCNEKLFTAWFSDLTPETLVTRNKAQLKAFWEKHSDIILKPLDASIFRVKEGDPNLGVIA ----3333--3333--------3333-----------------------2222------- ETLTEHGTRYCMAQNYLPAIKDGDKRVLVVDGEPVPYCLARGGGGEPRPLTESDWKIARQ ----%%%%----------3333-------iiii-----------------1111------ IGPTLKEKGLIFVGLDIIGDRLTEINVTSPTCIREIEAEFPVSITGMLMDAIEARLQQQ -------------------------------3333-1111------------------- >RECG; SWP:Q9WY48; PDB:1GM5A; FTSSLFLWGEALPTLLEEFLNEVEKMLKNQVNTRRIHQLLKELDDPLLENKDLEEKLQAF -----3333------3333------3333---------3333--3333-----------3 LDYVKEIPNLPEARKRYRIQKSLEMIEKLRSWFLIDYLECSGEEVDLSTDIQYAKGVGPN 333--------33333333---3333--------------------------------33 RKKKLKKLGIETLRDLLEFFPRDYEDRRKIFKLNDLLPGEKVTTQGKIVSVETKKFQNMN 33-1111-----3333-------------------------------------------- ILTAVLSDGLVHVPLKWFNQDYLQTYLKQLTGKEVFVTGTVKSNAYTGQYEIHNAEVTPK ------------------------------------------------------------ EGEYVRRILPIYRLTSGISQKQMRKIFEENIPSLCCSLKETLPERILEKRKLLGVKDAYY ------------------3333--------------------33333333---------- GMHFPKTFYHLEKARERLAYEELFVLQLAFQKIRKEREKHGGIPKKIEGKLAEEFIKSLP ------3333--------3333----------3333--------------3333-3333- FKLTNAQKRAHQEIRNDMISEKPMNRLLQGDVGSGKTVVAQLAILDNYEAGFQTAFMVPT -----------------------------------------------------------3 SILAIQHYRRTVESFSKFNIHVALLIGATTPSEKEKIKSGLRNGQIDVVIGTHALIQEDV 333----------------------------3333------------------------- HFKNLGLVIIDEQHRFEALMNKGKMVDTLVMSATPIPRSMALAFYGDLDVTVIDEMPPGR ------------------------------------------------------------ KEVQTMLVPMDRVNEVYEFVRQEVMRGGQAFIVYPLIKSAVEMYEYLSKEVFKLGLMHGR -----------1111--------1111----------3333---1111------------ LSQEEKDRVMLEFAEGRYDILVSTTVIEVGIDVPRANVMVIENPERFGLAQLHQLRGRVG --------------------------------3333------------------3333-- RGGQEAYCFLVVGDVGEEAMERLRFFTLNTDGFKIAEYDLKTRGPGEKQHGLSGFKVADL -------------------------1111---3333------------------------ YRDLKLLEW ---1111-- >SALIVARY LIPOCALIN; SWP:P81608; PDB:1GM6A; VVTSNFDASKIAGEWYSILLASDAKENIEENGSMRVFVEHIRVLDNSSLAFKFQRKVNGE -------1111------------3333-2222-----------1111---------iiii CTDFYAVCDKVGDGVYTVAYYGENKFRLLEVNYSDYVILHLVDVNGDKTFQLMEFYGRKP -----------2222----------------3333---------!!!!------------ DVEPKLKDKFVEICQQYGIIKENIIDLTKIDRCFQLRG -------------1111--1111--3333---3333-- >HEAT SHOCK PROTEIN 16.9B; SWP:Q41560; PDB:1GMEA; SIVRRSNVFDPFADLWADPFDTFRSIVPAISGGGSETAAFANARMDWKETPEAHVFKADL ---------11111111--------3333------3333----------3333------2 PGVKKEEVKVEVEDGNVLVVSGERTKEKEDKNDKWHRVERSSGKFVRRFRLLEDAKVEEV 2221111------------------------------------------------1111- KAGLENGVLTVTVPKAEVKKPEVKAIQISG ----iiii---------------------- >PROTEIN KINASE C, EPSILON; SWP:P09216; PDB:1GMIA; MVVFNGLLKIKICEAVSLKPTAWSLRDVGPRPQTFLLDPYIALNVDDSRIGQTATKQKTN --------------------3333------------------------------------ SPAWHDEFVTDVCNGRKIELAVFHDAPIGYDDFVANCTIQFEELLQNGSRHFEDWIDLEP ---------------------------------------3333----------------- EGKVYVIIDLSGSSG --------------- ------------------------------------------------------------ ----- >CCT-GAMMA; SWP:P80318; PDB:1GMLA; DSCVLRGVMINKDVTHPRMRRYIKNPRIVLLDSSLEYKDFTRILQMEEEYIHQLCEDIIQ ---------------1111-------------------3333---------------333 LKPDVVITEKGISDLAQHYLMRANVTAIRRVRKTDNNRIARACGARIVSRPEELREDDVG 3-------------------1111-------------------------3333-3333-- TGAGLLEIKKIGDEYFTFITDCKDPKACTILLRG ----------!!!!---------1111------- >UREE; SWP:P18317; PDB:1GMUA; MLYLTQRLEIPAAATASVTLPIDVRVKSRVKVTLNDGRDAGLLLPRGLLLRGGDVLSNEE --------------------3333---------1111-------------2222---111 GTEFVQVIAADEEVSVVRCDDPFMLAKACYALGNRHVPLQIMPGELRYHHDHVLDDMLRQ 1-------------------------------1111-----2222-------------11 FGLTVTFGQLPFEPEAGA 11---------------- >GLPE PROTEIN; SWP:P09390; PDB:1GMXA; MDQFECINVADAHQKLQEKEAVLVDIRDPQSFAMGHAVQAFHLTNDTLGAFMRDNDFDTP ------------------------------------2222---3333--------1111- VMVMYHGNSSKGAAQYLLQQGYDVVYSIDGGFEAWQRQFPAEVAYGA -----------------1111------2222-------1111----- >CATHEPSIN B; SWP:P07858; PDB:1GMYA; KLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDL ------3333-11113333---------3333--------------iiii---------- LTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPC ----3333-!!!!--------------------2222----------------------- TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSD -------------2222--3333------------------------------------- FLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDH 1111-------------------------iiii---------1111-iiii-------22 CGIESEVVAGIPRT 22------------ >PHOSPHOLIPASE A2; SWP:NA; PDB:1GMZA; DLWQFGKMILKETGKLPFPYYVTYGCYCGVGGRGGPKDATDRCCFVHDCCYGKLTSCKPK --------------------------1111-------3333------------1111111 TDRYSYSRKDGTIVCGEDPCRKEICECDKAAAVCFRENLDTYNKKYMSYLKSLCKKDDC 1----------------3333----------------3333-3333---3333------ >TRANSCRIPTION FACTOR GATA; SWP:P17679; PDB:1GNF; GSEARECVNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIR --------------------1111----3333-------------- >GLNK; SWP:P77118; PDB:1GNKA; MKLVTVIIKPFKLEDVREALSSIGIQGLTVTEVKGFGRVNFLPKVKIDVAIADDQLDEVI --------1111--------1111---------------------------3333----- DIVSKAAYTGKIGDGKIFVAELQRVIRIRTGEADEAAL ---------------------------1111-!!!!-- >HYBRID CLUSTER PROTEIN; SWP:Q01770; PDB:1GNLA; SNAMFCYQCQETVGNKGCTQVGVCGKKPETAALQDALIYVTKGLGQIATRLRAEGKAVDH --------1111%%%%-----1111--------------------------1111----- RIDRLVTGNLFATITNANFDDDILAERVRMTCAAKKELAASLTDKSGLSDAALWEASEKS -------------2222---------------------1111--22223333-------- AMLAKAGTVGVMATTDDDVRSLRWLITFGLKGMAAYAKHADVLGKHENSLDAFMQEALAK -----11111111---------------------------1111--3333---------- TLDDSLSVADLVALTLETGKFGVSAMALLDAANTGTYGHPEITKVNIGVGSNPGILISGH --1111------------------------------------------------------ DLRDLEMLLKQTEGTGVDVYTHSEMLPAHYYPAFKKYAHFKGNYGNAWWKQKEEFESFNG -----------2222------!!!!3333-3333--1111------3333-----3333- PVLLTTNCLVPPKDSYKDRVYTTGIVGFTGCKHIPGEIGEHKDFSAIIAHAKTCPAPTEI ------------33331111--!!!!-2222-------------------1111------ ESGEIIGGFAHNQVLALADKVIDAVKSGAIKKFVVMAGCDGRAKSRSYYTDFAEGLPKDT ------------------------------------------3333-------------- VILTAGCAKYRYNKLNLGDIGGIPRVLDAGQCNDSYSLAVIALKLKEVFGLEDVNDLPIV -------------------iiii-------3333------------------1111---- YNIAWYEQKAVIVLLALLSLGVKNIHLGPTLPAFLSPNVAKVLVEQFNIGGITSPQDDLK -----------------1111----------1111------------------------3 AFF 333 >HYBRID CLUSTER PROTEIN; SWP:P31101; PDB:1GNTA; MFCFQCQETAKNTGCTVKGMCGKPEETANLQDLLIFVLRGIAIYGEKLKELGQPDRSNDD -----1111%%%%-----1111--------------------------1111---1111- FVLQGLFATITNANWDDARFEAMISEGLARRDKLRNAFLAVYKAKNGKDFSEPLPEAATW ---------2222--3333-----------------------------------3333-- TGDSTAFAEKAKSVGILATENEDVRSLRELLIIGLKGVAAYAEHAAVLGFRKTEIDEFML --3333----11111111---------------------------1111--3333----- EALASTTKDLSVDEMVALVMKAGGMAVTTMALLDEANTTTYGNPEITQVNIGVGKNPGIL --3333------------------------------------------------------ ISGHDLKDMAELLKQTEGTGVDVYTHGEMLPANYYPAFKKYPHFVGNYGGSWWQQNPEFE ---------------2222------!!!!3333-3333--1111------1111------ SFNGPILLTTNCLVPLKKENTYLDRLYTTGVVGYEGAKHIADRPAGGAKDFSALIAQAKK ----------------1111-3333---!!!!-2222------2222---3333---111 CPPPVEIETGSIVGGFAHHQVLALADKVVEAVKSGAIKRFVVMAGCDGRQKSRSYYTEVA 1----------------------------------------------------------- ENLPKDTVILTAGCAKYRYNKLNLGDIGGIPRVLDAGQCNDSYSLAVIALKLKEVFGLDD ---1111-----3333----------iiii-------3333------------------3 INDLPVSYDIAWYEQKAVAVLLALLFLGVKGIRLGPTLPAFLSPNVAKVLVENFNIKPIG 333-----------------------------------1111------------------ TVQDDIAAMMAGK --------1111- >GABARAP; SWP:O95166; PDB:1GNUA; MKFVYKEEHPFEKRRSEGEKIRKKYPDRVPVIVEKAPKARIGDLDKKKYLVPSDLTVGQF --3333------------------1111-------1111------------1111----- YFLIRKRIHLRAEDALFFFVNNVIPPTSATMGQLYQEHHEEDFFLYIAYSDESVYGL ----------1111-----%%%%--1111----------1111-------------- >GLUTATHIONE S-TRANSFERASE; SWP:P46422; PDB:1GNWA; GIKVFGHPASIATRRVLIALHEKNLDFELVHVELKDGEHKKEPFLSRNPFGQVPAFEDGD ------1111----------------------33333333--3333-1111------!!! LKLFESRAITQYIAHRYENQGTNLLQTDSKNISQYAIMAIGMQVEDHQFDPVASKLAFEQ !---------------1111---------------------------------------- IFKSIYGLTTDEAVVAEEEAKLAKVLDVYEARLKEFKYLAGETFTLTDLHHIPAIQYLLG ---1111------------------------------1111---------------3333 TPTKKLFTERPRVNEWVAEITKRPASEKVQ --333333333333-----11113333--- >BETA-GLUCOSIDASE; SWP:Q59976; PDB:1GNXA; ALTFPEGFLWGSATASYQIEGAAAEDGRTPSIWDTYARTPGRVRNGDTGDVATDHYHRWR ----2222------3333------iiii--3333----22222222----!!!!------ EDVALMAELGLGAYRFSLAWPRIQPTGRGPALQKGLDFYRRLADELLAKGIQPVATLYHW ------------------3333-2222-----3333----------1111---------- DLPQELENAGGWPERATAERFAEYAAIAADALGDRVKTWTTLNEPWCSAFLGYGSGVHAP --3333111133333333------------------------------------------ GRTDPVAALRAAHHLNLGHGLAVQALRDRLPADAQCSVTLNIHHVRPLTDSDADADAVRR ------------------------------1111-------------------------- IDALANRVFTGPMLQGAYPEDLVKDTAGLTDWSFVRDGDLRLAHQKLDFLGVNYYSPTLV -----------1111--------1111----11112222--------------------- SAHSPWPGADRVAFHQPPGETTAMGWAVDPSGLYELLRRLSSDFPALPLVITENGAAFHD -----2222------------1111------------------1111------------- YADPEGNVNDPERIAYVRDHLAAVHRAIKDGSDVRGYFLWSLLDNFEWAHGYSKRFGAVY --1111---------------------1111---------------!!!!---------- VDYPTGTRIPKASARWYAEVARTGVLPT --1111---------------------- >XYLANASE 10C; SWP:Q59675; PDB:1GNYA; GNVVIEVDMANGWRGNASGSTSHSGITYSADGVTFAALGDGVGAVFDIARPTTLEDAVIA ----------------------------1111------2222-----------2222--- MVVNVSAEFKASEANLQIFAQLKEDWSKGEWDCLAGSSELTADTDLTLTCTIDEDDDKFN ---------------------2222----------3333--------------1111--- QTARDVQVGIQAKGTPAGTITIKSVTITLAQEA --------------------------------- >DNA-directed RNA polymera; SWP:Q57840; PDB:1GO3E; MYKILEIADVVKVPPEEFGKDLKETVKKILMEKYEGRLDKDVGFVLSIVDVKDIGEGKVV -------------3333----------------2222----------------------2 HGDGSAYHPVVFETLVYIPEMYELIEGEVVDVVEFGSFVRLGPLDGLIHVSQIMDDYVSY 222----------------2222---------1111------------1111-------- DPKAIIGKETGKVLEIGDYVRARIVAISLKASKIALTMRQPYLGKLEWIEEEKAKKQ --------------2222--------------------------3333--------- >Uncharacterized protein M; SWP:Q60351; PDB:1GO3F; MIGKKILGERYVTVSEAAEIMYNRAQIGELSYEQGCALDYLQKFAKLDKEEAKKLVEELI -----------------------1111--------------------------------1 SLGIDEKTAVKIADILPEDLDDLRAIYYKRELPENAEEILEIVRKYI 111-----------------------2222--1111----------- >MAD1 (MITOTIC ARREST DEFI; SWP:AAH09964; PDB:1GO4E; FSREEADTLRLKVEELEGERSRLEEEKRMLEAQLERRALQGDYDQSRTKVLHMSLNPTSV ---3333--------------------------3333------1111------------- ARQRLREDHSQLQAECERLRGLLRAME --------------------------- --------------------------------------- >PHOSPHOLIPASE A2; SWP:P81165; PDB:1GODA; SMYQLWKMILQETGKNAVPSYGLYGCNCGVGSRGKPKDATDRCCFVHKCCYKKLTDCSPK 3333------------3333---------------------------------------- TDSYSYSWKDKTIVCGDNNPCLQEMCECDKAVAICLRENLDTYNKNYKIYPKPLCKKADA --------------------------------------1111-1111----1111----- C - >CHITINASE B; SWP:Q54276; PDB:1GOIA; TRKAVIGYYFIPTNQINNYTETDTSVVPFPVSNITPAKAKQLTHINFSFLDINSNLECAW -----------3333-------3333---3333-----1111----------1111---- DPATNDAKARDVVNRLTALKAHNPSLRIMFSIGGWYYSNDLGVSHANYVNAVKTPASRAK 1111------------------1111--------33331111-------1111------- FAQSCVRIMKDYGFDGVNIDWEYPQAAEVDGFIAALQEIRTLLNQQTITDGRQALPYQLT ------------------------3333-----------------------3333----- IAGAGGAFFLSRYYSKLAQIVAPLDYINLMTYDLAGPWEKVTNHQAALFGDAAGPTFYNA -----3333---3333----1111-----------1111-----------1111------ LREANLGWSWEELTRAFPSPFSLTVDAAVQQHLMMEGVPSAKIVMGVPFYGRAFKGVSGG 1111------------------------------22223333------------------ NGGQYSSHSTPGEDPYPSTDYWLVGCEECVRDKDPRIASYRQLEQMLQGNYGYQRLWNDK iiii------------------2222---------------------------------- TKTPYLYHAQNGLFVTYDDAESFKYKAKYIKQQQLGGVMFWHLGQDNRNGDLLAALDRYF --------1111------3333-------------------1111-1111---------- NAADYDDSQLDMGTGLRYTGVGPGNLPIMTAPAYVPGTTYAQGALVSYQGYVWQTKWGYI -1111-1111-----------1111---------2222--2222---iiii--------- TSAPGSDSAWLKVGRV --22223333------ >KINESIN HEAVY CHAIN; SWP:P48467; PDB:1GOJA; SSSANSIKVVARFRPQNRVEIESGGQPIVTFQGPDTCTVDSKEAQGSFTFDRVFDMSCKQ ----------------3333--------------------1111----------111133 SDIFDFSIKPTVDDILNGYNGTVFAYGQTGAGKSYTMMGTSIDDPDGRGVIPRIVEQIFT 33------------1111---------2222-3333----11111111------------ SILSSAANIEYTVRVSYMEIYMERIRDLLAPQNDNLPVHEEKNRGVYVKGLLEIYVSSVQ 3333-3333-----------%%%%--1111----------1111---2222--------- EVYEVMRRGGNARAVAATNMNQESSRSHSIFVITITQKNVETGSAKSGQLFLVDLAGSEK ----------------------1111---------------------------------- VGKTGASGQTLEEAKKINKSLSALGMVINALTDGKSSHVPYRDSKLTRILQESLGGNSRT --------------3333---------------------1111------3333------- TLIINCSPSSYNDAETLSTLRFGMRAKSIKNKAKVNAELSPAELKQMLAKAKTQ --------3333-------------1111------------------------- >GT-ALPHA/GI-ALPHA CHIMERA; SWP:P04695; PDB:1GOTA; SAEEKHSRELEKKLKEDAEKDARTVKLLLLGAGESGKSTIVKQKIIHQDGYSLEECLEFI --------------------1111-------2222------------------------- AIIYGNTLQSILAIVRATTLNIQYGDSARQDDARKLHADTIEEGTPKESDIIQRLWKDSG -----------------1111----3333------------2222--------------- IQACFDRASEYQLNDSAGYYLSDLERLVTPGYVPTEQDVLRSRVKTTGIIETQFSFKDLN ------3333---1111------3333-2222------1111-------------%%%%- FRFDVGGQRSERKKWIHCFEGVTAIIFCVALSDYDLVLAEDEENRHESKLFDSICNNKWF ----------3333--1111---------1111----3333------------1111--- TDTSIILFLNKKDLFEEKIKKSPLTICYPEYAGSNTYEEAGNYIKVQFLELNRRDVKEIY ----------------3333--3333-1111----------------3333--------- SHTCATDTQNVKFVFDAVTDIIIKEN ---1111------------------- >GT-ALPHA/GI-ALPHA CHIMERA; SWP:P04697; PDB:1GOTB; SELDQLRQEAEQLKNQIRDARKACADATLSQITNNIDPVGRIQMRTRRTLRGHLAKIYAM ---------------------1111-------1111------------------------ HWGTDSRLLLSASQDGKLIIWDSYTTNKVHAIPLRSSWVMTCAYAPSGNYVACGGLDNIC --1111--------------------------------------3333------1111-- SIYNLKTREGNVRVSRELAGHTGYLSCCRFLDDNQIVTSSGDTTCALWDIETGQQTTTFT -------------------------------1111--------------1111------- GHTGDVMSLSLAPDTRLFVSGACDASAKLWDVREGMCRQTFTGHESDINAICFFPNGNAF -----------1111--------------------------------------1111--- ATGSDDATCRLFDLRADQELMTYSHDNIICGITSVSFSKSGRLLLAGYDDFNCNVWDALK -------------1111-------1111---------3333------------------- ADRAGVLAGHDNRVSCLGVTDDGMAVATGSWDSFLKIWN -------------------1111---------------- >Guanine nucleotide-bindin; SWP:P02698; PDB:1GOTG; LTEKDKLKMEVDQLKKEVTLERMLVSKCCEEFRDYVEERSGEDPLVKGIPEDKNPFKE ----------------1111------------------33331111---33331111- >RIBONUCLEASE; SWP:P00649; PDB:1GOUA; AVINTFDGVADYLIRYKRLPNDYITKSQASALGWVASKGDLAEVAPGKSIGGDVFSNREG -------------------1111------1111-3333-3333-2222--------1111 RLPSAGSRTWREADINYVSGFRNADRLVYSSDWLIYKTTDHYATFTRIR ----2222---------------------1111------iiii------ >(S)-2-HYDROXY-ACID OXIDAS; SWP:P05414; PDB:1GOX; MEITNVNEYEAIAKQKLPKMVYDYYASGAEDQWTLAENRNAFSRILFRPRILIDVTNIDM ----3333---------3333-------!!!!-------3333----------------- TTTILGFKISMPIMIAPTAMQKMAHPEGEYATARAASAAGTIMTLSSWATSSVEEVASTG ---iiii------------3333-3333-----------------1111-------3333 PGIRFFQLYVYKDRNVVAQLVRRAERAGFKAIALTVDTPRLGRREADIKNRFVLPPFLTL -------------------------------------------3333-------1111-3 KNFEGIDLGLSSYVAGQIDRSLSWKDVAWLQTITSLPILVKGVITAEDARLAVQHGAAGI 333-------------------3333--3333---------------------------- IVSNHGARQLDYVPATIMALEEVVKAAQGRIPVFLDGGVRRGTDVFKALALGAAGVFIGR ---%%%%-------3333--------iiii----------------------------33 PVVFSLAAEGEAGVKKVLQMMRDEFELTMALSGCRSLKEISRSHIAADWD 33---------------------------1111--3333-1111--1111 >CATION-INDEPENDENT MANNOS; SWP:P11717; PDB:1GP0A; DCQVTNPSTGHLFDLSSLSGRAGFTAAYSEKGLVYMSICGENENCPPGVGACFGQTRISV --------------3333-3333------------------11112222----------- GKANKRLRYVDQVLQLVYKDGSPCPSKSGLSYKSVISFVCRPEAGPTNRPMLISLDKQTC ---------iiii----------1111--------------------------------- TLFFSWHTPLACE -------3333-- >GLUTATHIONE PEROXIDASE; SWP:P00435; PDB:1GP1A; RTVYAFSARPLAGGEPFNLSSLRGKVLLIENVASLGTTVRDYTQMNDLQRRLGPRGLVVL -1111----3333----33332222---------------------------1111---- GFPCNQFGHQENAKNEEILNCLKYVRPGGGFEPNFMLFEKCEVNGEKAHPLFAFLREVLP ------%%%%---3333----------iiii-------------1111------------ TPSDDATALMTDPKFITWSPVCRNDVSWNFEKFLVGPDGVPVRRYSRRFLTIDIEPDIET -1111------3333------1111----------1111------11113333------- LLSQ 1111 >G PROTEIN GI ALPHA 1; SWP:P16874; PDB:1GP2G; SIAQARKLVEQLKMEANIDRIKVSKAAADLMAYCEAHAKEDPLLTPVPASENPF 3333-------3333------3333-----------33331111---------- >LEUCOANTHOCYANIDIN DIOXYG; SWP:Q96323; PDB:1GP6A; VAVERVESLAKSGIISIPKEYIRPKEELESINDVFLEEKKEDGPQVPTIDLKNIESDDEK --------1111-----3333--33331111-----3333----------1111------ IRENCIEELKKASLDWGVMHLINHGIPADLMERVKKAGEEFFSLSVEEKEKYANDQATGK ----------------------------------------11113333-1111--1111- IQGYGSKLANNASGQLEWEDYFFHLAYPEEKRDLSIWPKTPSDYIEATSEYAKCLRLLAT ----------3333-------------3333-3333----1111---------------- KVFKALSVGLGLEPDRLEKEVGGLEELLLQMKINYYPKCPQPELALGVEAHTDVSALTFI -------1111-1111------------------------1111---------------- LHNMVPGLQLFYEGKWVTAKCVPDSIVMHIGDTLEILSNGKYKSILHRGLVNKEKVRISW -----------iiii------2222----------1111--------------------- AVFCEPPKDKIVLKPLPEMVSVESPAKFPPRTFAQHIEHKLFGKEQEEL ---------------3333-1111------------------------- >PHOSPHOLIPASE A2; SWP:P80966; PDB:1GP7A; HLIQFGNMIQCTVPGFLSWIKYADYGCYCGAGGSGTPVDKLDRCCQVHDNCYTQAQKLPA 3333-------------3333-----------------3333---------------333 CSSIMDSPYVKIYSYDCSERTVTCKADNDECAAFICNCDRVAAHCFAASPYNNNNYNIDT 3--1111----------%%%%---1111-----------------1111--3333----- TTRC ---- ---------------------------------------- >CORE GP32; SWP:P03695; PDB:1GPC; GFSSEDKGEWKLKLDNAGNGQAVIRFLPSKNDEQAPFAILVNHGFKKNGKWYIETCSSTH ---3333-------1111----------------------------iiii-----3333- GDYDSCPVCQYISKNDLYNTDNKEYSLVKRKTSYWANILVVKDPAAPENEGKVFKYRFGK --1111------1111--------------------------11111111--------33 KIWDKINAMIAVDVEMGETPVDVTCPWEGANFVLKVKQVSGFSNYDESKFLNQSAIPNID 33----3333--1111-----1111-------------iiii--1111-------2222- DESFQKELFEQMVDLSEMTSKDKFKSFEELNTKFGQVM --------1111--3333-------------------- >GLUCOSE OXIDASE; SWP:P81156; PDB:1GPEA; YLPAQQIDVQSSLLSDPSKVAGKTYDYIIAGGGLTGLTVAAKLTENPKIKVLVIEKGFYE -------3333----33332222----------------------1111----------1 SNDGAIIEDPNAYGQIFGTTVDQNYLTVPLINNRTNNIKAGKGLGGSTLINGDSWTRPDK 1113333-1111-3333-1111-------1111---------22223333--------33 VQIDSWEKVFGMEGWNWDNMFEYMKKAEAARTPTAAQLAAGHSFNATCHGTNGTVQSGAR 33---------2222-----------------------------3333------------ DNGQPWSPIMKALMNTVSALGVPVQQDFLCGHPRGVSMIMNNLDENQVRVDAARAWLLPN -----------------1111----------------------1111---3333-----1 YQRSNLEILTGQMVGKVLFKQTASGPQAVGVNFGTNKAVNFDVFAKHEVLLAAGSAISPL 111------------------1111----------1111--------------1111--- ILEYSGIGLKSVLDQANVTQLLDLPVGINMQDQTTTTVSSRASSAGAGQGQAVFFANFTE ------------------------------------------3333----------3333 TFGDYAPQARDLLNTKLDQWAEETVARGGFHNVTALKVQYENYRNWLLDEDVAFAELFMD -!!!!-------------------1111-------------------------------- TEGKINFDLWDLIPFTRGSVHILSSDPYLWQFANDPKFFLNEFDLLGQAAASKLARDLTS iiii---------------------1111-------2222----------------3333 QGAMKEYFAGETLPGYNLVQNATLSQWSDYVLQNFRPNWHAVSSCSMMSRELGGVVDATA !!!!1111-----!!!!-1111---------------------------1111---1111 KVYGTQGLRVIDGSIPPTQVSSHVMTIFYGMALKVADAILDDYAKSA -2222------3333----------------------------1111 >EXOGLUCANASE I; SWP:Q09431; PDB:1GPIA; QAGTNTAENHPQLQSQQCTTSGGCKPLSTKVVLDSNWRWVHSTSGYTNCYTGNEWDTSLC ---------------------------------3333----1111--------------- PDGKTCAANCALDGADYSGTYGITSTGTALTLKFVTGSNVGSRVYLMADDTHYQLLKLLN ---------------3333----------------!!!!--------------------- QEFTFDVDMSNLPCGLNGALYLSAMDADGGMSKYPGNKAGAKYGTGYCDSQCPKDIKFIN ------------2222---------111133331111--3333-----1111------ii GEANVGNWTETGSNTGTGSYGTCCSEMDIWEANNDAAAFTPHPCTTTGQTRCSGDDCARN ii---------1111-------------------------------------!!!!---- TGLCDGDGCDFNSFRMGDKTFLGKGMTVDTSKPFTVVTQFLTNDNTSTGTLSEIRRIYIQ ----3333---3333-------2222--1111---------11111111----------i NGKVIQNSVANIPGVDPVNSITDNFCAQQKTAFGDTNWFAQKGGLKQMGEALGNGMVLAL iii--------2222--------------------------------------------- SIWDDHAANMLWLDSDYPTDKDPSAPGVARGTCATTSGVPSDVESQVPNSQVVFSNIKFG ---------3333----11111111--------1111-3333----1111---------- DIGSTFSGTS 2222------ >GLUTAMYL-TRNA REDUCTASE; SWP:Q9UXR8; PDB:1GPJA; MEDLVSVGITHKEAEVEELEKARFESDEAVRDIVESFGLSGSVLLQTSNRVEVYASGARD ---------3333------------1111-----------------1111-------111 RAEELGDLIHDDAWVKRGSEAVRHLFRVASGLESMMVGEQEILRQVKKAYDRAARLGTLD 1--------1111----------------------2222--------------------- EALKIVFRRAINLGKRAREETRISEGAVSIGSAAVELAERELGSLHDKTVLVVGAGEMGK ---------------------1111-------------------1111------------ TVAKSLVDRGVRAVLVANRTYERAVELARDLGGEAVRFDELVDHLARSDVVVSATAAPHP ------------------------------------1111----1111------------ VIHVDDVREALRKRDRRSPILIIDIANPRDVEEGVENIEDVEVRTIDDLRVIARENLERR -------------------------------2222--2222------------------- RKEIPKVEKLIEEELSTVEEELEKLKERRLVADVAKSLHEIKDRELERALRRLKTVLQDF ----------------------------------------------3333--------33 AEAYTKRLINVLTSAIMELPDEYRRAASRALRRASELNG 33------------------------------------- >RP2 LIPASE; SWP:P16233; PDB:1GPL; AEVCYSHLGCFSDEKPWAGTSQRPIKSLPSDPKKINTRFLLYTNENQNSYQLITATDIAT -------------------3333-------3333--------1111-------1111--- IKASNFNLNRKTRFIIHGFTDSGENSWLSDMCKNMFQVEKVNCICVDWKGGSKAQYSQAS ------1111-------22221111---------3333---------3333---3333-- QNIRVVGAEVAYLVQVLSTSLNYAPENVHIIGHSLGAHTAGEAGKRLNGLVGRITGLDPA -----------------------3333-------------------%%%%---------- EPYFQDTPEEVRLDPSDAKFVDVIHTDISPILPSLGFGMSQKVGHMDFFPNGGKDMPGCK ---22223333-------------------------------------2222---2222- TGISCNHHRSIEYYHSSILNPEGFLGYPCASYDEFQESGCFPCPAKGCPKMGHFADQYPG -11111111----------3333-----------1111-----1111----1111----1 KTNAVEQTFFLNTGASDNFTRWRYKVSVTLSGKKVTGHILVSLFGNKGNSKQYEIFKGTL 111-----------------------------------------3333------------ KPDSTHSNEFDSDVDVGDLQMVKFIWYNNVINPTLPRVGASKIIVETNVGKQFNFCSPET 2222---------------------------1111-----------1111---------- VREEVLLTLTPC ------------ >GMP SYNTHETASE; SWP:P04079; PDB:1GPMA; ENIHKHRILILDFGSQYTQLVARRVRELGVYCELWAWDVTEAQIRDFNPSGIILSGGPES -1111---------1111-----------------------------------------3 TTEENSPRAPQYVFEAGVPVFGVCYGMQTMAMQLGGHVEASNEREFGYAQVEVVNDSALV 333------3333------------------1111---------------------3333 RGIEDALTADGKPLLDVWMSHGDKVTAIPSDFITVASTESCPFAIMANEEKRFYGVQFHP -------1111-----------------1111----------------1111------11 EVTHTRQGMRMLERFVRDICQCEALWTPAKIIDDAVARIREQVGDDKVILGLSGGVDSSV 11---------------1111---------------------!!!!-------------- TAMLLHRAIGKNLTCVFVDNGLLRLNEAEQVLDMFGDHFGLNIVHVPAEDRFLSALAGEN --------!!!!-----------2222----------------------------2222- DPEAKRKIIGRVFVEVFDEEALKLEDVKWLAQGTIYPDVIESAAKMGLVEPLKELFKDEV --------------------1111----------3333-----------1111--3333- RKIGLELGLPYDMLYRHPFPGPGLGVRVLGEVKKEYCDLLRRADAIFIEELRKADLYDKV ---------3333------1111----------------------------11111111- SQAFTVFLPVRSVGVMGDGRKYDWVVSLRAVETIDFMTAHWAHLPYDFLGRVSNRIINEV ------------------------------------------------------------ NGISRVVYDISGKPPATIEWE --------------------- >ANTIBODY M41; SWP:NA; PDB:1GPOH; EVKLQESGPSLVKPSQTLSLTCSVTGDSITSDFWSWIRQFPGNRLEYMGFVQYSGETAYN ---------------------------1111--------------------1111----3 PSLKSRISITRDTSKNQYYLDLNSVTTEDTAVYYCANWHGDYWGQGTTVTVSSAKTTPPS 333---------1111---------3333-------1111-------------------- VYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSSS ---------------------------------%%%%--1111-------%%%%------ VTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRD ---1111------------1111----------- >ENDONUCLEASE PI-SCEI; SWP:P17255; PDB:1GPPA; CFAKGTNVLMADGSIECIENIEVGNKVMGKDGRPREVIKLPRGSETMYSVVQKSMPELLK --2222---1111---3333-2222---1111---------------------------- FTCNATHELVVRTPRSVRRLSRTIKGVEYFEVITFEMGQKKAPDGRIVELVKEVSKSYPV ---1111----------------iiii--------------1111-------------33 SEGPERANELVESYRKASNKAYFEWTIEARDLSLLGSHVRKATYQTYAPIGAAFARECRG 33-------------------------33333333------------------------- FYFELQELKEDDYYGTLSDDSDHQFLLANQVVVH --------------------------1111---- >GLUCOSE PERMEASE; SWP:P20166; PDB:1GPR; EPLQNEIGEEVFVSPITGEIHPITDVPDQVFSGKMMGDGFAILPSEGIVVSPVRGKILNV ----1111-------------3333--3333--1111----------------------- FPTKHAIGLQSDGGREILIHFGIDTVSLKGEGFTSFVSEGDRVEPGQKLLEVDLDAVKPN 1111------1111----------3333-2222----2222--2222-----33333333 VPSLMTPIVFTNLAEGETVSIKASGSVNREQEDIVKIE -------------2222----------2222------- ----------------------------------------------- >GAMMA-1-H THIONIN; SWP:P20230; PDB:1GPT; RICRRRSAGFKGPCVSNKNCAQVCMQEGWGGGNCDGPLRRCKCMRRC ------2222--------------3333------!!!!--------- >TRANSKETOLASE; SWP:P23254; PDB:1GPUA; QFTDIDKLAVSTIRILAVDTVSKANSGHPGAPLGMAPAAHVLWSQMRMNPTNPDWINRDR --3333------------------------------------1111--1111--1111-- FVLSNGHAVALLYSMLHLTGYDLSIEDLKQFRQLGSRTPGHPEFELPGVEVTTGPLGQGI ----------------1111---3333--2222---------3333--------2222-- SNAVGMAMAQANLAATYNKPGFTLSDNYTYVFLGDGCLQEGISSEASSLAGHLKLGNLIA ------------------2222-----------3333-----------------1111-- IYDDNKITIDGATSISFDEDVAKRYEAYGWEVLYVENGNEDLAGIAKAIAQAKLSKDKPT -------11113333----------1111-------3333--------------1111-- LIKMTTTIGYGSLHAGSHSVHGAPLKADDVKQLKSKFGFNPDKSFVVPQEVYDHYQKTIL ------2222-1111-3333--------------1111-1111----3333--------- KPGVEANNKWNKLFSEYQKKFPELGAELARRLSGQLPANWESKLPTYTAKDSAVATRKLS ------------------------------1111--22221111---3333--------- ETVLEDVYNQLPELIGGSADLTPSNLTRWKEALDFQPPSSGSGNYSGRYIRYGIREHAMG -------3333---------3333------------1111---1111------------- AIMNGISAFGANYKPYGGTFLNFVSYAAGAVRLSALSGHPVIWVATHDSIGVGEDGPTHQ ---------%%%%------3333-------------------------333333331111 PIETLAHFRSLPNIQVWRPADGNEVSAAYKNSLESKHTPSIIALSRQNLPQLEGSSIESA -------1111--------------------------------------------3333- SKGGYVLQDVANPDIILVATGSEVSLSVEAAKTLAAKNIKARVVSLPDFFTFDKQPLEYR -------------------!!!!-----------1111-------------3333----- LSVLPDNVPIMSVEVLATTCWGKYAHQSFGIDRFGASGKAPEVFKFFGFTPEGVAERAQK -----------------1111-----------------------1111------------ TIAFYKGDKLISPLKKAF ----2222---3333--- >COMPLEMENT C1R COMPONENT; SWP:P00736; PDB:1GPZA; IKCPQPKTLDEFTIIQNLQPQYQFRDYFIATCKQGYQLIEGNQVLHSFTAVCQDDGTWHR --------------------------------2222----------------1111---- AMPRCKIKDCGQPRNLPNGDFRYTTTMGVNTYKARIQYYCHEPYYKMQTEQGVYTCTAQG ---------------2222------2222-2222----------------------1111 IWKNEQKGEKIPRCLPVCGKPVNPVEQRQQIIGGQKAKMGNFPWQVFTNIHGRGGGALLG -------------------------------------22221111--------------- DRWILTAAHTLYPKEHASLDVFLGHTNVEELMKLGNHPIRRVSVHPDYRQDESYNFEGDI ------3333-------------------------------------------------- ALLELENSVTLGPNLLPICLPDNDTFYDLGLMGYVSGFGIAHDLRFVRLPVANPQACENS -----------1111------------2222---------------------3333---- QNMFCAGHPSLKQDACQGDSGGVFAVRDPNTDRWVATGIVSWGIGCSRGYGFYTKVLNYV ------------1111--------------------------2222--------3333-- DWIKKEME -------- >MALIC ENZYME; SWP:P40927; PDB:1GQ2A; KKGYEVLRDPHLNKGAFTLEERQQLNIHGLLPPCFLGQDAQVYSILKNFERLTSDLDRYI ---3333-3333------------------------------------3333-------- LLSLQDRNEKLFYKVLTSDIERFPIVYTPTVGLACQHYGLAFRRPRGLFITIHDRGHIAT --3333---------3333---------3333------3333--------1111--3333 LQSWPESVIKAIVVTDGERILGLGDLGCYGGIPVGKLALYTACGGVKPHQCLPVLDVGTD 3333---------------!!!!--!!!!-----------------3333---------- NETLLKDPLYIGLRHKRIRGQAYDDLLDEFEAVTSRYGNCLIQFEDFANANAFRLLHKYR ----------------------------------------------------------11 NKYCTFNDDIQGTASVAVAGLLAALRITKNRLSDHTVLFQGAGEAALGIANLIVAQKEGV 11----3333-------------1111---1111--------3333---------3333- SKEEAIKRIWVDSKGLIVKGRASLTPEKEHFAHEHCEKNLEDIVKDIKPTVLIGVAAIGG -----1111--1111--2222---33331111---------------------------- AFTQQILQDAAFNKRPIIFALSNPTSKAECTAEQLYKYTEGRGIFASGSPFDPVTLPSGQ -----------------------3333---3333----iiii-------------3333- TLYPGQGNNSYVFPGVALGVISCGLKHIGDDVFLTTAEVIAQEVSEENLQEGRLYPPLVT -------3333-----------------------------1111----1111----3333 IQQVSLKIAVRIAKEAYRNNTASTYPQPEDLEAFIRSQVYSTDYNCFVADSYTWPEEAKV ----------------------------------1111---------------------- K - >PROCLAVAMINATE AMIDINO HY; SWP:P37819; PDB:1GQ6A; SPRYAQIPTFMRLPHDPQPRGYDVVVIGAPYDGGTSYRPGARFGPQAIRSESGLIHGVGI -1111---2222-------------------1111----3333-------3333------ DRGPGTFDLINCVDAGDINLTPFDMNIAIDTAQSHLSGLLKANAAFLMIGGDHSLTVAAL ----1111-------------------------------------------3333----- RAVAEQHGPLAVVHLDAHSDTNPAFYGGRYHHGTPFRHGIDEKLIDPAAMVQIGIRGHLD ------------------------2222--1111-----------1111----------- YARGHGVRVVTADEFGELGVGGTADLIREKVGQRPVYVSVDIDVVDPAFAPGTGTPAPGG --1111------------------------!!!!------1111-3333----------- LLSREVLALLRCVGDLKPVGFDVMEVSPLYDHGGITSILATEIGAELLYQYARAH -3333-----3333------------3333-%%%%-------------------- >PECTIN METHYLESTERASE; SWP:P83218; PDB:1GQ8A; SSTVGPNVVVAADGSGDYKTVSEAVAAAPEDSKTRYVIRIKAGVYRENVDVPKKKKNIMF ----------1111-----3333-3333-----------------------1111----- LGDGRTSTIITASKNVQDGSTTFNSATVAAVGAGFLARDITFQNTAGAAKHQAVALRVGS ---3333-------3333--3333----------------------1111---------- DLSAFYRCDILAYQDSLYVHSNRQFFINCFIAGTVDFIFGNAAVVLQDCDIHARRPGSGQ --------------------------------------------------------2222 KNMVTAQGRTDPNQNTGIVIQKSRIGATSDLQPVQSSFPTYLGRPWKEYSRTVVMQSSIT ----------1111----------------3333-------------------------- NVINPAGWFPWDGNFALDTLYYGEYQNTGAGAATSGRVTWKGFKVITSSTEAQGFTPGSF ---3333----!!!!1111---------1111-1111--1111----------------- IAGGSWLKATTFPFSLGL -33333333--------- >CYTOCHROME C'; SWP:P00148; PDB:1GQAA; ADAEHVVEARKGYFSLVALEFGPLAAMAKGEMPYDAAAAKAHASDLVTLTKYDPSDLYAP -----------------------------------------------3333--3333-22 GTSADDVKGTAAKAAIWQDADGFQAKGMAFFEAVAALEPAAGAGQKELAAAVGKVGGTCK 223333------3333---------------------3333------------------- SCHDDFRVKR ---------- >RELEASE FACTOR 2; SWP:P07012; PDB:1GQEA; INPVNNRIQDLTERSDVLRGYLDYDAKKERLEEVNAELEQPDVWNEPERAQALGKERSSL -3333--------------1111------------11111111----------------- EAVVDTLDQKQGLEDVSGLLELAVEADDEETFNEAVAELDALEEKLAQLEFRRFSGEYDS -------------------------------------------------3333--1111- ADCYLDIQAGSGGTEAQDWASLERYLRWAESRGFKTEIIEESEGEVAGIKSVTIKISGDY -----------------------------1111--------------------------- AYGWLRTETGVHRLVRKSPFDSGGRRHTSFSSAFVYPEVDDDIDIEINPADLRIDVYRAS ----1111---------1111-----------------2222-----3333--------- GAGGQHVNRTESAVRITHIPTGIVTQCQNDRSQHKNKDQAKQKAKLYEVEQKKNAEKQAE -----1111--------------------------------------------------- DNKSDIGWGSQIRSYVLDDSRIKDLRTGVETRNTQAVLDGSLDQFIEASLKAGL ------------------------------------1111-3333----1111- >ALPHA-GLUCURONIDASE; SWP:Q8VP74; PDB:1GQIA; EDGYDMWLRYQPIADQTLLKTYQKQIRHLHVAGDSPTINAAAAELQRGLSGLLNKPIVAR --1111--------------------------------------------1111------ DEKLKDYSLVIGTPDNSPLIASLNLGERLQALGAEGYLLEQTRINKRHVVIVAANSDVGV ------------3333-3333----33331111----------iiii------------- LYGSFHLLRLIQTQHALEKLSLSSAPRLQHRVVNHWDNLNRVVERGYAGLSLWDWGSLPN ----------1111--2222-----------------1111------------3333--- YLAPRYTDYARINASLGINGTVINNVNADPRVLSDQFLQKIAALADAFRPYGIKMYLSIN --3333-------1111-----------------3333---------3333--------1 FNSPRAFGDVDTADPLDPRVQQWWKTRAQKIYSYIPDFGGFLVKADSEGQPGPQGYGRDH 111----------1111-----------------1111--------iiii-3333----- AEGANMLAAALKPFGGVVFWRAFVYHPDIEDRFRGAYDEFMPLDGKFADNVILQIKNGPI ----------3333-----------3333-3333-----3333----1111--------- DFQPREPFSALFAGMSRTNMMMEFQITQEYFGFATHLAYQGPLFEESLKTETHARGEGST --------3333---------------1111!!!!--------------------2222- IGNILEGKVFKTRHTGMAGVINPGTDRNWTGHPFVQSSWYAFGRMAWDHQISAATAADEW -------------------------1111--1111------------1111--------- LRMTFSNQPAFIEPVKQMMLVSREAGVNYRSPLGLTHLYSQGDHYGPAPWTDDLPRADWT -------3333--------------------iiii------------1111----1111- AVYYHRASKTGIGFNRTKTGSNALAQYPEPIAKAWGDLNSVPEDLILWFHHLSWDHRMQS 3333---1111-----1111-3333-----------3333-1111-------1111-333 GRNLWQELVHKYYQGVEQVRAMQRTWDQQEAYVDAARFAQVKALLQVQEREAVRWRNSCV 3---------------------------3333---------------------------- LYFQSVAGRPIPANYEQPEHDLEYYKMLARTTYVPEPWHPASSSRVLK -----------1111--------------------33333333----- >3-DEHYDROQUINATE DEHYDRAT; SWP:P24670; PDB:1GQNA; MKTVTVKNLIIGEGMPKIIVSLMGRDINSVKAEALAYREATFDILEWRVDHFMDIASTQS -----!!!!---------------------------1111-------3333--3333--- VLTAARVIRDAMPDIPLLFTFRSAKEGGEQTITTQHYLTLNRAAIDSGLVDMIDLELFTG -----------1111-------3333-----------------------------1111- DADVKATVDYAHAHNVYVVMSNHDFHQTPSAEEMVSRLRKMQALGADIPKIAVMPQSKHD -----------1111--------------------------1111-----------3333 VLTLLTATLEMQQHYADRPVITMSMAKEGVISRLAGEVFGSAATFGAVKQASAPGQIAVN -------------------------33333333-3333--------------2222---- DLRSVLMILHNA --------1111 >DEHYDROQUINASE; SWP:P54517; PDB:1GQOA; PHFLILNGPNVNRLGSREPEVFGRQTLTDIETDLFQFAEALHIQLTFFQSNHEGDLIDAI -------2222------1111-----------------1111---------3333----- HEAEEQYSGIVLNPGALSHYSYAIRDAVSSISLPVVEVHLSNLYAREEFRHQSVIAPVAK -3333---------3333---3333----------------3333-3333----3333-- GQIVGLGAEGYKLAVRYLLSQ ------3333-------1111 >DOC1/APC10; SWP:P53068; PDB:1GQPA; SVLVLDDRIVDAATKDLYVNGFQNPTPENLQHMFHQGIEILDSARMINVTHLALWKPSSF ----------33333333------------------------------1111-------- KLGNPVDFALDDNYDTFWQSDGGQPHQLDIMFSKRMDICVMAIFFSMIADESYAPSLVKV 22223333----1111-----------------------------3333!!!!------- YAGHSPSDARFYKMLEVRNVNGWVALRFLLKCQFIRLLFPVNHENGKDTHLRGIRLYVPS ----3333----------------------------------%%%%-------------- >UDP-N-ACETYLMURAMATE-L-AL; SWP:P45066; PDB:1GQQA; VQQIHFIGIGGAGMSGIAEILLNEGYQISGSDIADGVVTQRLAQAGAKIYIGHAEEHIEG -------1111-------------------------------1111-------1111222 ASVVVVSSAIKDDNPELVTSKQKRIPVIQRAQMLAEIMRFRHGIAVAGTHGKTTTTAMIS 2-----3333----3333-----------------3333--------------------- MIYTQAKLDPTFVSRYLIAEADEFLHLQPMVSVVTNMEFEKMKATYVKFLHNLPFYGLAV -----------------------3333--------------------------1111--- MCADDPVLMELVPKVGRQVITYGFSEQADYRIEDYEQTGFQGHYTVICPNNERINVLLNV -1111-----3333----------1111---------!!!!------1111--------- PGKHNALNATAALAVAKEEGIANEAILEALADFQGAGRRFDQLGEFIRPNGKVRLVDDYG -----------------------------------------------1111--------- HHPTEVGVTIKAAREGWGDKRIVMIFQPHRYSRTRDLFDDFVQVLSQVDALIMLDVYAAG -------------3333-------------------------3333-------------- EAPIVGADSKSLCRSIRNLGKVDPILVSDTSQLGDVLDQIIQDGDLILAQGAGSVSKISR ---2222---------3333--------3333----1111-2222--------------- GLAESWKN -------- >EOSINOPHIL-DERIVED NEUROT; SWP:P10153; PDB:1GQVA; MKPPQFTWAQWFETQHINMTSQQCTNAMQVINNYQRRCKNQNTFLLTTFANVVNVCGNPN --1111------------------------------------------------1111-- MTCPSNKTRKNCHHSGSQVPLIHCNLTTPSPQNISNCRYAQTPANMFYIVACDNRDQRRD --1111-----------------------11111111------------------3333- PPQYPVVPVHLDRII 3333----------- >MYO-INOSITOL-1-PHOSPHATE ; SWP:P71703; PDB:1GR0A; TEVRVAIVGVGNCASSLVQGVEYYYNADDTSTVPGLMHVRFGPYHVRDVKFVAAFDVDAK -----------------------11111111-2222----!!!!3333---------111 KVGFDLSDAIFASENNTIKIADVAPTNVIVQRGPTLDGIGKYYADTIELSDAEPVDVVQA 1---33331111----------------------!!!!---------------------- LKEAKVDVLVSYLPVGSEEADKFYAQCAIDAGVAFVNALPVFIASDPVWAKKFTDARVPI -1111--------2222-------------------------1111-------1111--- VGDDIKSQVGATITHRVLAKLFEDRGVQLDRTMQLNVGGNMDFLNMLEDVHIGPSDHVGW ----------------------1111------------------3333----------33 LDDRKWAYVRLEGRAFGDVPLNLEYKLEVWDSPNSAGVIIDAVRAAKIAKDRGIGGPVIP 33------------2222-------------------------------1111------- ASAYLMKSPPEQLPDDIARAQLEEFIIG ---------------------------- >COLLAGEN X; SWP:Q03692; PDB:1GR3A; MPVSAFTVILSKAYPAIGTPIPFDKILYNRQQHYDPRTGIFTCQIPGIYYFSYHVHVKGT ----------------------------1111---------------------------- HVWVGLYKNGTPVMYTYDEYTKGYLDQASGSAIIDLTENDQVWLQLPNAESNGLYSSEYV -------iiii---------2222------------2222-------3333-----1111 HSSFSGFLVAPM ------------ >GREA PROTEIN; SWP:P0A6W5; PDB:1GRJ; QAIPMTLRGAEKLREELDFLKSVRRPEIIAAIAEAREHGDLKENAEYHAAREQQGFCEGR -----------------------------------11113333----------------- IKDIEAKLSNAQVIDVTKMPNNGRVIFGATVTVLNLDSDEEQTYRIVGDDEADFKQNLIS --------------3333-------2222------------------3333-3333---- VNSPIARGLIGKEEDDVVVIVEFEVIKVEYL --------2222------------------- >MAJOR SPERM PROTEIN 31/40; SWP:P53017; PDB:1GRWA; SVPPGDIQTQPGTKIVFNAPYDDKHTYHIKVINSSARRIGYGIKTTNMKRLGVDPPCGVL ----------------------------------------------3333---------- DPKEAVLLAVSCDAFAFGQEDTNNDRITVEWTNTPDGAAKQFRREWFQGDGMVRRKNLPI 2222-----------3333-----------------------3333-------------- EYNP ---- >POLY (ADP-RIBOSE) POLYMER; SWP:O88554; PDB:1GS0A; ESQLDLRVQELLKLICNVQTMEEMMIEMKYDTKRAPLGKLTVAQIKAGYQSLKKIEDCIR -----------------------------------3333--------------------- AGQHGRALVEACNEFYTRIPHDFGLSIPPVIRTEKELSDKVKLLEALGDIEIALKLVKEH ----------------------!!!!---------------------------------- PLDQHYRNLHCALRPLDHESNEFKVISQYLQSTHAPTHKDYTMTLLDVFEVEKEGEKEAF ------------------------------11113333--------------22223333 REDLPNRMLLWHGSRLSNWVGILSHGLRVAPPEAPITGYMFGKGIYFADMSSKSANYCFA --------------3333------------33331111-----------3333--3333- SRLKNTGLLLLSEVALGQCNELLEANPKAQGLLRGKHSTKGMGKMAPSPAHFITLNGSTV -------------------------11111111--------------3333---iiii-- PLGPASDTGILNPEGYTLNYNEFIVYSPNQVRMRYLLKIQFNFLQ --------------------------1111--------------- >ACETYLGLUTAMATE KINASE; SWP:P11445; PDB:1GS5A; MMNPLIIKLGGVLLDSEEALERLFSALVNYRESHQRPLVIVHGGGCVVDELMKGLNLPVK -----------3333----------------------------3333-----1111---- KKNGLRVTPADQIDIITGALAGTANKTLLAWAKKHQIAAVGLFLGDGDSVKVTQLDEELG -iiii---1111--------------------1111------1111---------3333- HVGLAQPGSPKLINSLLENGYLPVVSSIGVTDEGQLMNVNADQAATALAATLGADLILLS ----------------1111----------1111-------------------------- DVSGILDGKGQRIAEMTAAKAEQLIEQGIITDGMIVKVNAALDAARTLGRPVDIASWRHA ------1111--------------------!!!!------------------------33 EQLPALFNGMPMGTRILA 33---1111--------- >APOLIPOPROTEIN E; SWP:P02649; PDB:1GS9A; SGQRWELALGRFWDYLRWVQTLSEQVQEELLSSQVTQELRALMDETMKELKAYKSELEEQ ------------------3333----------------------------------1111 LTPVAEETRARLSKELQAAQARLGADMEDVRGRLVQYRGEVQAMLGQSTEELRVRLASHL -------------------------------------------iiii------------- RKLRKRLLRDADDLQKRLAVYQAG ------------------------ >GLUTATHIONE SYNTHETASE; SWP:P04425; PDB:1GSA; MIKLGIVMDPIANINIKKDSSFAMLLEAQRRGYELHYMEMGDLYLINGEARAHTRTLNVK ---------3333-3333----------1111------3333---iiii----------- QNYEEWFSFVGEQDLPLADLDVILMRKDPPFDTEFIYATYILERAEEKGTLIVNKPQSLR ---------------3333--------------------------1111----------- DCNEKLFTAWFSDLTPETLVTRNKAQLKAFWEKHSDIILKPLDGMGGASIFRVKEGDPNL ----------3333------------------------------iiii-----2222--- GVIAETLTEHGTRYCMAQNYLPAIKDGDKRVLVVDGEPVPYCLARIPQGGETRGNLAAGG --------iiii----------3333-------iiii-----------------3333-- RGEPRPLTESDWKIARQIGPTLKEKGLIFVGLDIIGDRLTEINVTSPTCIREIEAEFPVS -----------------------------------------------------1111--- ITGMLMDAIEARLQ ----------1111 >SPORE COAT PROTEIN A; SWP:P07788; PDB:1GSKA; TLEKFVDALPIPDTLKPVQQSKEKTYYEVTMEECTHQLHRDLPPTRLWGYNGLFPGPTIE --------------------1111--------------1111-------%%%%------- VKRNENVYVKWMNNLPSTHFLPIDHTIHEPEVKTVVHLHGGVTPDDSDGYPEAWFSKDFE -2222------------------1111----------2222--1111--1111------- QTGPYFKREVYHYPNQQRGAILWYHDHAMALTRLNVYAGLVGAYIIHDPKEKRLKLPSDE --1111---------------------22223333------------33333333--!!! YDVPLLITDRTINEDGSLFYPSAPENPSPSLPNPSIVPAFCGETILVNGKVWPYLEVEPR !-----------1111-----------3333---------------iiii---------- KYRFRVINASNTRTYNLSLDNGGDFIQIGSDGGLLPRSVKLNSFSLAPAERYDIIIDFTA ------------------1111-------1111-------------2222-------111 YEGESIILANSAGCGGDVNPETDANIMQFRVTKPLAQKDESRKPKYLASYPSVQHERIQN 1-----------------1111---------------------------1111------- IRTLKLAGTQDEYGRPVLLLNNKRWHDPVTETPKVGTTEIWSIINPTRGTHPIHLHLVSF ----------1111-----%%%%1111------2222----------------------- RVLDRRPFDIARYQESGELSYTGPAVPPPPSEKGWKDTIQAHAGEVLRIAATFGPYSGRY ----------------------------3333---------2222--------------- VWHCHILEHEDYDMMRPMDITD -----3333------------- >GRIFFONIA SIMPLICIFOLIA L; SWP:P24146; PDB:1GSL; NTVNFTYPDFWSYSLKNGTEITFLGDATRIPGALQLTKTDANGNPVRSSAGQASYSEPVF ---------------2222----------2222------1111----------------- LWDSTGKAASFYTSFTFLLKNYGAPTADGLAFFLAPVDSSVKDYGGFLGLFRHETAADPS --1111-----------------------------1111-----1111---3333--333 KNQVVAVEFDTWINKDWNDPPYPHIGIDVNSIVSVATTRWENDDAYGSSIATAHITYDAR 3------------3333-----------------------3333---------------- SKILTVLLSYEHGRDYILSHVVDLAKVLPQKVRIGFSAGVGYDEVTYILSWHFFSTLDGT ----------------------3333------------------------------2222 NK -- >MUCOSAL ADDRESSIN CELL AD; SWP:Q13477; PDB:1GSMA; VKPLQVEPPEPVVAVALGASRQLTCRLACADRGASVQWRGLDTSLGAVQSDTGRSVLTVR ---------------2222----------------------------------------- NASLSAAGTRVCVGSCGGRTFQHTVQLLVYAFPNQLTVSPAALVPGDPEVACTAHKVTPV --3333---------iiii----------------------------------------- DPNALSFSLLVGGQELEGAQALGPEVQEEEEEPQGDEDVLFRVTERWRLPPLGTPVPPAL 1111-----------2222----------------------------------------- YCQATMRLPGLELSHRQAIPVLIEGR -------2222----------1111- >GLYCINAMIDE RIBONUCLEOTID; SWP:P15640; PDB:1GSOA; MKVLVIGNGGREHALAWKAAQSPLVETVFVAPGNAGTALEPALQNVAIGVTDIPALLDFA ---------------------3333--------------1111-----1111-------- QNEKIDLTIVGPEAPLVKGVVDTFRAAGLKIFGPTAGAAQLEGSKAFTKDFLARHKIPTA 1111--------------------1111------3333---------------------- EYQNFTEVEPALAYLREKGAPIVIKAKGVIVAMTLEEAEAAVHDMLAGNAFGDAGHRIVI --------3333---------------------3333------1111--2222------- EEFLDGEEASFIVMVDGEHVLPMATSQDHKRVGDKDTGPNTGGMGAYSPAPVVTDDVHQR ---------------------------------%%%%------------3333------- TMERIIWPTVKGMAAEGNTYTGFLYAGLMIDKQGNPKVIEFNCRFGDLETQPIMLRMKSD -------------1111-------------1111------------3333---1111--- LVELCLAACESKLDEKTSEWDERASLGVVMAAGGYPGDYRTGDVIHGLPLEEVAGGKVFH -------11111111-------------------------------------2222---- AGTKLAQVVTNGGRVLCVTALGHTVAEAQKRAYALMTDIHWDDCFCRKDIGWRAIER ---------------------------------1111---2222----2222----- >CLASS-MU GLUTATHIONE S-TR; SWP:P20136; PDB:1GSUA; VVTLGYWDIRGLAHAIRLLLEYTETPYQERRYKAGPAPDFDPSDWTNEKEKLGLDFPNLP ---------!!!!-------1111-----------------3333--1111--------- YLIDGDVKLTQSNAILRYIARKHNMCGETEVEKQRVDVLENHLMDLRMAFARLCYSPDFE ---!!!!------------------------------------------------1111- KLKPAYLEQLPGKLRQLSRFLGSRSWFVGDKLTFVDFLAYDVLDQQRMFVPDCPELQGNL --------------------!!!!--------3333----------------3333---- SQFLQRFEALEKISAYMRSGRFMKAPIFWYTALWNNK ----------3333-1111---------1111----- >Transcription factor SOX-; SWP:P48432; PDB:1GT0D; DRVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLR ----------------3333------------------------1111------------ ALHMKEHPDYKYRPRRKTKT ------1111---------- >ODORANT-BINDING PROTEIN; SWP:P07435; PDB:1GT1A; QEEEAEQNLSELSGPWRTVYIGSTNPEKIQENGPFRTYFRELVFDDEKGTVDFYFSVKRD --------3333------------3333-2222-------------------------ii GKWKNVHVKATKQDDGTYVADYEGQNVFKIVSLSRTHLVAHNINVDKHGQTTELTELFVK ii----------3333-----------------------------1111----------- LNVEDEDLEKFWKLTEDKGIDKKNVVNFLENENHPHPE ---------------1111-3333-------------- >MTH169; SWP:O26271; PDB:1GTDA; KFVEVRIRLKKGLNPEAATIERALALLGYEVEDTDTTDVITFTDEDSLEAVEREVEDCQR ------------------------------------------------------------ LLCNPVIHDYDVSINES ---3333---------- >DIHYDROPYRIMIDINE DEHYDRO; SWP:Q28943; PDB:1GTEA; APVLSKDVADIESILALNPRTQSHAALHSTLAKKLDKKHWKRNPDKNCFHCEKLENNFDD --1111------1111-------------------3333-----1111-----2222--- IKHTTLGERGALREAMRCLKCADAPCQKSCPTHLDIKSFITSISNKNYYGAAKMIFSDNP -----------------------3333--1111---------1111---------1111- LGLTCGMVCPTSDLCVGGCNLYATEEGSINIGGLQQFASEVFKAMNIPQIRNPCLPSQEK ---------33333333--33333333------------------------1111-1111 MPEAYSAKIALLGAGPASISCASFLARLGYSDITIFEKQEYVGGLSTSEIPQFRLPYDVV -3333--------------------------------------3333---1111-3333- NFEIELMKDLGVKIICGKSLSENEITLNTLKEEGYKAAFIGIGLPEPKTDDIFQGLTQDQ -------1111---------2222------1111---------------3333---3333 GFYTSKDFLPLVAKSSKAGMCACHSPLPSIRGAVIVLGAGDTAFDCATSALRCGARRVFL ---3333-------------------------------------------1111------ VFRKGFVNIRAVPEEVELAKEEKCEFLPFLSPRKVIVKGGRIVAVQFVRTEQDETGKWNE ----3333---3333----1111--------------iiii------------------- DEDQIVHLKADVVISAFGSVLRDPKVKEALSPIKFNRWDLPEVDPETMQTSEPWVFAGGD 1111------------------3333-1111----1111------------1111---33 IVGMANTTVESVNDGKQASWYIHKYIQAQYGASVSAKPELPLFYTPVDLVDISVEMAGLK 33-------------------------1111-------------3333-------iiii- FINPFGLASAAPTTSSSMIRRAFEAGWGFALTKTFSLDKDIVTNVSPRIVRGTTSGPMYG ---------3333-----------------------3333-------------------- PGQSSFLNIELISEKTAAYWCQSVTELKADFPDNIVIASIMCSYNKNDWMELSRKAEASG ------------------------------1111----------------------1111 ADALELNLSCPHGMGLACGQDPELVRNICRWVRQAVQIPFFAKLTPNVTDIVSIARAAKE ----------------3333--------------------------------------11 GGADGVTATNTVSGLMGLKADGTPWPAVGAGKRTTYGGVSGTAIRPIALRAVTTIARALP 11----------------1111------1111--------------------------22 GFPILATGGIDSAESGLQFLHSGASVLQVCSAVQNQDFTVIQDYCTGLKALLYLKSIEEL 22-----------------1111------3333-----------------------3333 QGWDGQSPGTESHQKGKPVPRIAELMGKKLPNFGPYLEQRKKIIAEEKMRLKEQNERKPF ---!!!!------iiii----1111----------------------------------- IPKKPIPAIKDVIGKALQYLGTFGELSNIEQVVAVIDEEMCINCGKCYMTCNDSGYQAIQ -------3333----3333--3333-----------3333-------------------- FDPETHLPTVTDTCTGCTLCLSVCPIIDCIRMVSRTTPYEPKRGL ----------1111---3333----2222---------------- >TRP RNA-BINDING ATTENUATI; SWP:Q9X6J6; PDB:1GTFA; SDFVVIKALEDGVNVIGLTRGADTRFHHSEKLDKGEVLIAQFTEHTSAIKVRGKAYIQTR --------------------------------2222------1111------------11 HGVIESEGK 11------- >PORPHOBILINOGEN DEAMINASE; SWP:P06983; PDB:1GTKA; DNVLRIATRQSPLALWQAHYVKDKLMASHPGLVVELVPMVGLFVKELEVALLENRADIAV ----------------------------1111------------------1111------ HSMKDVPVEFPQGLGLVTICEREDPRDAFVSNNYDSLDALPAGSIVGTSSLRRQCQLAER -1111-----2222---------------------3333-2222-----3333------- RPDLIIRSLRGNVGTRLSKLDNGEYDAIILAVAGLKRLGLESRIRAALPPEISLPAVGQG 3333-------------------------------11113333-----3333---2222- AVGIECRLDDSRTRELLAALNHHETALRVTAERAMNTRLEGGCQVPIGSYAELIDGEIWL ------1111------3333----------------1111-1111--------iiii--- RALVGAPDGSQIIRGERRGAPQDAEQMGISLAEELLNNGAREILAEVYNGDAPA -----1111----------3333------------------------------- >GLUTAMATE DEHYDROGENASE; SWP:P80319; PDB:1GTMA; ADPYEIVIKQLERAAQYMEISEEALEFLKRPQRIVEVTIPVEMDDGSVKVFTGFRVQHNW -3333---------1111--------3333------------1111-------------1 ARGPTKGGIRWHPEETLSTVKALAAWMTWKTAVMDLPYGGGKGGIIVDPKKLSDREKERL 111--------1111--------------------------------3333--------- ARGYIRAIYDVISPYEDIPAPDVYTNPQIMAWMMDEYETISRRKTPAFGIITGKPLSIGG -------3333-1111-----2222---------------%%%%-3333-----3333-- SLGRIEATARGASYTIREAAKVLGWDTLKGKTIAIQGYGNAGYYLAKIMSEDFGMKVVAV --3333---------------------2222----------------------------- SDSKGGIYNPDGLNADEVLKWKNEHGSVKDFPGATNITNEELLELEVDVLAPAAIEEVIT --------1111---------------2222------3333------------------- KKNADNIKAKIVAEVANGPVTPEADEILFEKGILQIPDFLCNAGGVTVSYFEWVQNITGY --3333---------------3333---1111-------1111----------------- YWTIEEVRERLDKKMTKAFYDVYNIAKEKNIHMRDAAYVVAVQRVYQAMLDRGWVKH --------------------------------------------------------- >4-HYDROXYPHENYLACETATE DE; SWP:Q46978; PDB:1GTTA; MKGTIFAVALNHRSQLDAWQEAFQQSPYKAPPKTAVWFIKPRNTVIGCGEPIPFPQGEKV -----------3333-----1111-----------------3333-2222----2222-- LSGATVALIVGKTATKVREEDAAEYIAGYALANDVSLPEESFYRPAIKAKCRDGFCPIGE -----------------33333333--------------------3333--2222----- TVALSNVDNLTIYTEINGRPADHWNTADLQRNAAQLLSALSEFATLNPGDAILLGTPQAR ---------------iiii-----3333------------------2222---------- VEIQPGDRVRVLAEGFPPLENPVVDEREVTTRKSFPTLPHPHGTLFALGLNYADHPEEPL ---2222-----2222--------3333-------------------------------- VFLKAPNTLTGDNQTSVRPNNIEYMHYEAELVVVIGKQARNVSEADAMDYVAGYTVCNDY -----3333---------------------------------3333-1111--------- AIRDYLENYYRPNLRVKSRDGLTPMLSTIVPKEAIPDPHNLTLRTFVNGELRQQGTTADL -3333-------3333--2222--------3333--1111------iiii-----3333- IFSVPFLIAYLSEFMTLNPGDMIATGTPKGLSDVVPGDEVVVEVEGVGRLVNRIVSEETA ----------3333---2222-------------2222-----2222--------3333- K - >THYMIDYLATE KINASE; SWP:O05891; PDB:1GTVA; MLIAIEGVDGAGKRTLVEKLSGAFRAAGRSVATLAFPRYGQSVAADIAAEALHGEHGDLA -------2222-------------1111---------2222---------1111-!!!!- SSVYAMATLFALDRAGAVHTIQGLCRGYDVVILDRYVASNAAYSAARLHENAAGKAAAWV -----------------------------------3333------1111-1111------ QRIEFARLGLPKPDWQVLLAVSAELAGERSRGRAQRDPGRARDNYERDAELQQRTGAVYA ------------------------------------1111--3333-------------- ELAAQGWGGRWLVVGADVDPGRLAATLA -----2222-----1111---------- >3-DEHYDROQUINATE DEHYDRAT; SWP:P15474; PDB:1GTZA; RSLANAPIMILNGPNLNLLGQAQPEIYGSDTLADVEALCVKAAAAHGGTVDFRQSNHEGE -3333-------2222-2222-3333-----------------1111------------- LVDWIHEARLNHCGIVINPAAYSHTSVAILDALNTCDGLPVVEVHISNIHQREPFRHHSY --------------------3333-----------2222--------1111-3333--33 VSQRADGVVAGCGVQGYVFGVERIAALAG 33----------3333------------- >CYTOCHROME C''; SWP:Q9RQB9; PDB:1GU2A; DVTNAEKLVYKYTNIAHSANPMYEAPSITDGKIFFNRKFKTPSGKEAACASCHTNNPANV -------------------3333-----------------3333---3333----1111- GKNIVTGKEIPPLAPRVNTKRFTDIDKVEDEFTKHCNDILGADCSPSEKANFIAYLLTET -----------------3333--3333----------------------------1111- KPTK ---- >ENDOGLUCANASE C; SWP:P14090; PDB:1GU3A; TFDDGPEGWVAYGTDGPLDTSTGALCVAVPAGSAQYGVGVVLNGVAIEEGTTYTLRYTAT -1111--------------1111------22222222----------2222--------- ASTDVTVRALVGQNGAPYGTVLDTSPALTSEPRQVTETFTASATYPATPAADDPEGQIAF -------------------------------------------------2222------- QLGGFSADAWTLCLDDVALDSE ---------------------- >CAAT/ENHANCER BINDING PRO; SWP:P17676; PDB:1GU4A; DKHSDEYKIRRERNNIAVRKSRDKAKMRNLETQHKVLELTAENERLQKKVEQLSRELSTL 1111-------------------------------------------------------- RNLFKQ ------ >CYTOCHROME C552; SWP:P32050; PDB:1GU6A; VEAKNETFAPQHPDQYLSWKATSEQSERVDALAEDPRLVILWAGYPFSRDYNKPRGHAFA ---3333-3333-----------------3333-3333---22221111------3333- VTDVRETLRTGAPKNAEDGPLPMACWSCKSPDVARLIQKDGEDGYFHGKWARGGPEIVNN ------3333-------------3333--3333---------------33331111---- LGCADCHNTASPEFAKGKPELTLSRPYAARAMEAIGKPFEKAGRFDQQSMVCGQCHVEYY -1111--3333-------------3333----1111-3333-3333-------------- FDGKNKAVKFPWDDGMKVENMEQYYDKIAFSDWTNSLSKTPMLKAQHPEYETWTAGIHGK --3333-----1111-------------------------------------------11 NNVTCIDCHMPKVQNAEGKLYTDHKIGNPFDNFAQTCANCHTQDKAALQKVVAERKQSIN 11-3333--------------------333333333333--------------------- DLKIKVEDQLVHAHFEAKAALDAGATEAEMKPIQDDIRHAQWRWDLAIASHGIHMHAPEE --------------------1111-3333--------------------1111------- GLRMLGTAMDKAADARTKLARLLATKGITHEIQIPDISTKEKAQQAIGLNMEQIKAEKQD -----------------------1111-----------------1111------------ FIKTVIPQWEEQARKNGLLSQ --------------------- >2,4-DIENOYL-COA REDUCTASE; SWP:Q8WZM3; PDB:1GU7A; MITAQAVLYTQHGEPKDVLFTQSFEIDDDNLAPNEVIVKTLGSPVNPSDINQIQGVYPSK -------------3333---------1111-1111----------3333----------- PAKTTGFGTTEPAAPCGNEGLFEVIKVGSNVSSLEAGDWVIPSHVNFGTWRTHALGNDDD ----1111-------------------1111---2222------------------1111 FIKLPNPAQSKANGKPNGLTINQGATISVNPLTAYLMLTHYVKLTPGKDWFIQNGGTSAV ----------1111------------------------------2222------1111-- GKYASQIGKLLNFNSISVIRDRPNLDEVVASLKELGATQVITEDQNNSREFGPTIKEWIK -----------------------------------------------3333--------1 QSGGEAKLALNCVGGKSSTGIARKLNNNGLMLTYGGMSFQPVTIPTSLYIFKNFTSAGFW 111------------------1111-------------------3333------------ VTELLKNNKELKTSTLNQIIAWYEEGKLTDAKSIETLYDGTKPLHELYQDGVANSKDGKQ ------------------------------------------3333-------3333--- LITY ---- >ALKYLHYDROPEROXIDASE D; SWP:P0A5N5; PDB:1GU9A; EKLKAALPEYAKDIKLNLSSITRSSVLDQEQLWGTLLASAAATRNPQVLADIGAEATDHL 3333---1111---------1111---3333---------1111-----------1111- SAAARHAALGAAAIGNNVFYRGRGFLEGRYDDLRPGLRMNIIANPGIPKANFELWSFAVS -----------------------------1111-----3333------------------ AINGCSHCLVAHEHTLRTVGVDREAIFEALKAAAIVSGVAQALATIEALS ----------------1111------------------------------ >D-ALLOSE-BINDING PERIPLAS; SWP:P39265; PDB:1GUDA; AAEYAVVLKTLSNPFWVDMKKGIEDEAKTLGVSVDIFASPSEGDFQSQLQLFEDLSNKNY ----------------------------------------2222---------------- KGIAFAPLSSVNLVMPVARAWKKGIYLVNLDEKIDMDNLKKAGGNVEAFVTTDNVAVGAK --------1111--------1111---------------1111----------------- GASFIIDKLGAEGGEVAIIEGKAGNASGEARRNGATEAFKKASQIKLVASQPADWDRIKA ---------3333--------2222----------------1111-------%%%%---- LDVATNVLQRNPNIKAIYCANDTMAMGVAQAVANAGKTGKVLVVGTDGIPEARKMVEAGQ ----------1111------------------11112222-------------------- MTATVAQNPADIGATGLKLMVDAEKSGKVIPLDKAPEFKLVDSILVTQ -----------------------3333---1111-------------- >LAMINARINASE 16A; SWP:Q9WXN1; PDB:1GUIA; SINNGTFDEPIVNDQANNPDEWFIWQAGDYGISGARVSDYGVRDGYAYITIADPGTDTWH ---1111------33331111-----1111------------iiii----------1111 IQFNQWIGLYRGKTYTISFKAKADTPRPINVKILQNHDPWTNYFAQTVNLTADWQTFTFT ---------2222----------------------------------------------- YTHPDDADEVVQISFELGEGTATTIYFDDVTVSPQ ---1111---------------------------- >GLUTATHIONE TRANSFERASE A; SWP:O15217; PDB:1GULA; RPKLHYPNGRGRMESVRWVLAAAGVEFDEEFLETKEQLYKLQDGNHLLFQQVPMVEIDGM ---------!!!!-------------------------------------------iiii KLVQTRSILHYIADKHNLFGKNLKERTLIDMYVEGTLDLLELLIMHPFLKPDDQQKEVVN --------------------------------------------3333--3333------ MAQKAIIRYFPVFEKILRGHGQSFLVGNQLSLADVILLQTILALEEKIPNILSAFPFLQE --------3333-------------%%%%--------------333311111111----- YTVKLSNIPTIKRFLEPGSKKKPPPDEIYVRTVYNIF ------------------------------------- >GALACTOSE-1-PHOSPHATE URI; SWP:P09148; PDB:1GUQA; TQFNPVDHPHRRYNPLTGQWILVSPHRAKRPWQGAQETPAKQVLPAHDPDCFLCAGNVRV ---3333------------------3333------------------1111--2222-11 TGDKNPDYTGTYVFTNDFAALMSDTPDAPESHDPLMRCQSARGTSRVICFSPDHSKTLPE 11--------------------------------------------------11113333 LSVAALTEIVKTWQEQTAELGKTYPWVQVFENKGAAMGCSNPHPGGQIWANSFLPNEAER -------------------3333----------3333----------------------- EDRLQKEYFAEQKSPMLVDYVQRELADGSRTVVETEHWLAVVPYWAAWPFETLLLPKAHV ---------------------------1111---1111----1111-------------- LRITDLTDAQRSDLALALKKLTSRYDNLFQCSFPYSMGWHGAPFNGEENQHWQLHAHFYP -1111-------------------------------------------3333-------- PLLRSATVRKFMVGYEMLAETQRDLTAEQAAERLRAVSDIHFRESGV ----1111-----3333-----------------------3333--- >MOLYBDATE BINDING PROTEIN; SWP:P08854; PDB:1GUTA; SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKS ---------------------------2222-------3333------2222------33 TDVMILA 33----- >MYB PROTO-ONCOGENE PROTEI; SWP:P06876; PDB:1GUUA; KTRWTREEDEKLKKLVEQNGTDDWKVIANYLPNRTDVQCQHRWQKVLNPE ------------------------------2222---------------- >RETINOBLASTOMA PROTEIN; SWP:P06400; PDB:1GUXA; NTIQQLMMILNSASDQPSENLISYFNNCTVNPKESILKRVKDIGYIFKEKFAKAVGQGCV ---------1111-----------1111-------------------------------- EIGSQRYKLGVRLYYRVMESMLKSENFSKLLNDNIFHMSLLACALEVVMATYSFPWILNV -------------------------33331111---------------------3333-- LNLKAFDFYKVIESFIKAEGNLTREMIKHLERCEHRIMESLAWLSDSPLFDLIKQSK ---3333-----------1111-----------------11112222---------- >Retinoblastoma-associated; SWP:P06400; PDB:1GUXB; TSLSLFYKKVYRLAYLRLNTLCERLLSEHPELEHIIWTLFQHTLQNEYELMRDRHLDQIM ----------------------------3333------------------22223333-- MCSMYGICKVKNIDLKFKIIVTAYKDLPHAVQETFKRVLIKEEEYDSIIVFYNSVFMQRL --------1111--------------11113333-------------------------- KTNILQYASTRPPTLSPIPHI ----1111------------- >MALATE DEHYDROGENASE; SWP:P80039; PDB:1GUZA; MKITVIGAGNVGATTAFRLAEKQLARELVLLDVVEGIPQGKALDMYESGPVGLFDTKVTG --------3333-----------------------3333--------------------- SNDYADTANSDIVIITAGLPRKPGMTREDLLMKNAGIVKEVTDNIMKHSKNPIIIVVSNP --3333---------------2222----------------------------------- LDIMTHVAWVRSGLPKERVIGMAGVLDAARFRSFIAMELGVSMQDINACVLGGHGDAMVP ---------------1111----------------------3333--------!!!!--- VVKYTTVAGIPISDLLPAETIDKLVERTRNGGAEIVEHLKQGSAFYAPASSVVEMVESIV 3333--iiii1111--------------------------------------------11 LDRKRVLPCAVGLEGQYGIDKTFVGVPVKLGRNGVEQIYEINLDQADLDLLQKSAKIVDE 11------------2222------------1111-------------------------- NCKML 3333- >MALATE DEHYDROGENASE; SWP:P80039; PDB:1GV0A; MKITVIGAGNVGATTAFRLAEKQLARELVLLDVVEGIPQGKALDMYESGPVGLFDTKVTG --------3333------------------------------------------------ SNDYADTANSDIVVITAGLPRKPGMTLSMNAGIVREVTGRIMEHSKNPIIVVVSNPLDIM ---3333----------------------------------------------------- THVAWQKSGLPKERVIGMAGVLDSARFRSFIAMELGVSMQDVTACVLGGHGDAMVPVVKY ----------3333-----------------------3333--------!!!!---3333 TTVAGIPVADLISAERIAELVERTRTGGAEIVNHLKQGSAFYSPATSVVEMVESIVLDRK --iiii3333--------------------3333--------------------1111-- RVLTCAVSLDGQYGIDGTFVGVPVKLGKNGVEHIYEIKLDQSDLDLLQKSAKIVDENCKM ----------2222------------1111--------------------------3333 L - >MANGANESE SUPEROXIDE DISM; SWP:BAB77594; PDB:1GV3A; SIGFIDRQLGTNPAELPPLPYGYDALEKAIDAETMKLHHDKHHAAYVNNLNNALKKHPEL ---1111--------------11111111------------------------3333333 QNSSVEALLRDLNSVPEDIRTTVRNNGGGHLNHTIFWQIMSPDGGGQPTGDIAQEINQTF 3---------3333-3333-----------------11111111---------------- GSFEEFKKQFNQAGGDRFGSGWVWLVRNPQGQLQVVSTPNQDNPIMEGSYPIMGNDVWEH ---------------------------1111--------------------------333 AYYLRYQNRRPEYLNNWWNVVNWSEINRRTQAS 3----!!!!------------------------ >PROGRAMED CELL DEATH PROT; SWP:Q9Z0X1; PDB:1GV4A; TVPQIRAPSHVPFLLIGGGTAAFAAARSIRARDPGARVLIVSEDPELPYMRPPLSKELWF --------------------------------------------------3333-3333- SDDPNVTKTLQFRQWNGKERSIYFQPPSFYVSAQDLPNIENGGVAVLTGKKVVHLDVRGN ---3333------1111--------3333--33331111-----------------1111 MVKLNDGSQITFEKCLIATGGTPRSLSAIDRAGAEVKSRTTLFRKIGDFRALEKISREVK ---1111-------------------------33331111--------------1111-- SITVIGGGFLGSELACALGRKSQASGIEVIQLFPEKGNMGKILPQYLSNWTMEKVKREGV -------------------------------------------------------1111- KVMPNAIVQSVGVSGGRLLIKLKDGRKVETDHIVTAVGLEPNVELAKTGGLEIDSDFGGF -------------iiii----1111-----------------1111-------------- RVNAELQARSNIWVAGDAACFYDIKLGRRRVEHHDHAVVSGRLAGENMTGAAKPYWHQSM --1111--2222---1111---------------------------1111---------- FWSDLGPDVGYEAIGLVDSSLPTVGVFAKATAQDNPKSATEQSGTGIRSESETESEASEI -----1111--------1111---------1111-------------3333--------- TIPPSAPAVPQVPVEGEDYGKGVIFYLRDKVVVGIVLWNVFNRMPIARKIIKDGEQHEDL --------------3333------------------------3333-------------- NEVAKLFNIH --3333---- >ANGIOGENIN; SWP:P03950; PDB:1GV7A; DNSRYTHFLTQHYDAKPQGRDDRYCESIMRRRGLTSPCKDINTFIHGNKRSIKAICSQKN -----------------------------1111--------------3333-3333---- VACKNGQTNCYISKSSFQVTTCKLHGGSPWPPCQYRATAGFRNVVVACENGLPVHLDQSI --1111------------------------------------------iiii----3333 FR -- >P58/ERGIC-53; SWP:Q62902; PDB:1GV9A; PHRRFEYKYSFKGPHLVQSDGTVPFWAHAGNAIPSADQIRIAPSLKSQRGSVWTKTKAAF -----1111--------1111-2222--------1111---------------------- ENWEVEVTFRVTGRGRIGADGLAIWYTENQGLDGPVFGSADMWNGVGIFFDSFDNNPAIV -----------------------------------iiii--------------------- VVGNNGQINYDHQNDGATQALASCQRDFRNKPYPVRAKITYYQKTLTVMINNGFTPDKND ----------3333---------------------------iiii--------------- YEFCAKVENMVIPTQGHFGISAATGGLADDHDVLSFLTFQLTE ------------------------------------------- >OVOTRANSFERRIN; SWP:P56410; PDB:1GVCA; SYYAVAVVKKGTDFMIKDLRGKTSCHTGLGRSAGWNIPIGTLIHRGDIEWEGIESGSVEQ --------3333--33332222-----22221111-----------------2222---- AVAKFFSASCVPGATTEQKLCRQCKGDAKTKCLRNAPYSGYSGAFQCLKDGKGDVAFVKH ----------2222--33331111--1111-----1111-------------------11 TTVQENAPEEKDEYELLCLDGTRQPVDSYKTCNWARV 11333311111111---1111---11111111----- >MYB PROTO-ONCOGENE PROTEI; SWP:P06876; PDB:1GVDA; LIKGPWTKEEDQRLIKLVQKYGPKRWSVIAKHLKGRIGKQCRERWHNHLNPE --------------------------------22223333------------ >AFLATOXIN B1 ALDEHYDE RED; SWP:P38918; PDB:1GVEA; ARPATVLGAMEMGRRMDVTSSSRSVRAFLQRGHTEIDTAFVYANGQSETILGDLGLGLGR --------1111----------------1111------1111iiii----1111--2222 SGCKVKIATKAAPMFGKTLKPADVRFQLETSLKRLQCPRVDLFYLHFPDHGTPIEETLQA -------------iiii-------------------------------11113333---- CHQLHQEGKFVELGLSNYVSWEVAEICTLCKKNGWIMPTVYQGMYNAITRQVETELFPCL ---------------------------------------------11113333------- RHFGLRFYAFNPLAGGLLTGRYKYQDKDGKNPESRFFGNPFSQLYMDRYWKEEHFNGIAL ----------1111-1111---3333-------1111-------------3333------ VEKALKTTYGPTAPSMISAAVRWMYHHSQLKGTQGDAVILGMSSLEQLEQNLALVEEGPL ---------1111-----------------3333------------------3333---- EPAVVDAFDQAWNLVAHECPNYFR 3333----------3333------ >TAGATOSE-BISPHOSPHATE ALD; SWP:P42908; PDB:1GVFA; SIISTKYLLQDAQANGYAVPAFNIHNAETIQAILEVCSEMRSPVILAGTPGTFKHIALEE ----3333----------------------------------------3333-------- IYALCSAYSTTYNMPLALHLDHHESLDDIRRKVHAGVRSAMIDGSHFPFAENVKLVKSVV -------------------------------------------1111------------- DFCHSQDCSVEAELGRLGSAFLTDPQEAKRFVELTGVDSLAVAIGTAHGLYSKTPKIDFQ ---1111----------------------------------------------------- RLAEIREVVDVPLVLHGASDVPDEFVRRTIELGVTKVNVATELKIAFAGAVKAWFAENPQ -----------------22223333--------------3333--------------111 GNDPRYYMRVGMDAMKEVVRNKINVCGSANRIS 1-3333--------------------------- >FLAVOHEMOPROTEIN; SWP:P24232; PDB:1GVHA; MLDAQTIATVKATIPLLVETGPKLTAHFYDRMFTHNPELKEIFNMSNQRNGDQREALFNA --------------------------------------1111---1111----------- IAAYASNIENLPALLPAVEKIAQKHTSFQIKPEQYNIVGEHLLATLDEMFSPGQEVLDAW ------33333333---------3333---3333-------------------------- GKAYGVLANVFINREAEIYNENASKAGGWEGTRDFRIVAKTPRSALITSFELEPVDGGAV ------------------------2222-------------------------1111--- AEYRPGQYLGVWLKPEGFPHQEIRQYSLTRKPDGKGYRIAVKREEGGQVSNWLHNHANVG --------------1111-------------------------2222------------- DVVKLVAPAGDFFMAVADDTPVTLISAGVGQTPMLAMLDTLAKAGHTAQVNWFHAAENGD ----------------1111---------------------1111------------333 VHAFADEVKELGQSLPRFTAHTWYRQPSEADRAKGQFDSEGLMDLSKLEGAFSDPTMQFY 3----------1111------------3333------------3333------1111--- LCGPVGFMQFTAKQLVDLGVKQENIHYECFGPHKVL ---------------1111-1111------------ >C-ETS-1 PROTEIN; SWP:P14921; PDB:1GVJA; MNHKPKGTFKDYVRDRADLNKDKPVIPAAALAGYTGSGPIQLWQFLLELLTDKSCQSFIS ----------------1111-------------------------------1111----- WTGDGWEFKLSDPDEVARRWGKRKNKPKMNYEKLSRGLRYYYDKNIIHKTAGKRYVYRFV ---!!!!------------------1111------------1111----2222------- CDLQSLLGYTPEELHAMLDVKP -3333----------------- >Elastase-1 [Precursor]; SWP:P00772; PDB:1GVKB; VVGGTEAQRNSWPSQISLQYRSGSSWAHTCGGTLIRQNWVMTAAHCVDRELTFRVVVGEH -------11111111------!!!!-----------------3333-------------- NLNQNNGTEQYVGVQKIVVHPYWNTDDVAAGYDIALLRLAQSVTLNSYVQLGVLPRAGTI 1111---------------111133333333--------------1111------2222- LANNSPCYITGWGLTRTNGQLAQTLQQAYLPTVDYAICSSSSYWGSTVKNSMVCAGGDGV -2222----------------------------3333------!!!!-1111-------- RSGCQGDSGGPLHCLVNGQYAVHGVTSFVSRLGCNVTRKPTVFTRVSAYISWINNVIASN ---2222--------iiii----------3333--2222-----3333--------1111 >EPSILON; SWP:Q57231; PDB:1GVNA; VTYEKTFEIEIINELSASVYNRVLNYVLNHELNKNDSQLLEVNLLNQLKLAKRVNLFDYS --------------------------------1111-------------1111-1111-- LEELQAVHEYWRSMNRYSKQVLNKEKVA ---------------------------- >Zeta toxin; SWP:Q54944; PDB:1GVNB; ANIVNFTDKQFENRLNDNLEELIQGKKAVESPTAFLLGGQPGSGKTSLRSAIFEETQGNV -3333-----------------2222-------------2222------------iiii- IVIDNDTFKQQHPNFDELVKLYEKDVVKHVTPYSNRMTEAIISRLSDQGYNLVIEGTGRT ---33331111-----------11111111------------------------------ TDVPIQTATMLQAKGYETKMYVMAVPKINSYLGTIERYETMYADDPMTARATPKQAHDIV -----------1111-----------------------------1111----3333---- VKNLPTNLETLHKTGLFSDIRLYNREGVKLYSSLETPSISPKETLEKELNRKVSGKEIQP -----------3333--------1111----33331111--------------------- TLERIEQKMVLNKHQETPEFKAIQQKLESLQPP ---------------------------1111-- >GENE V PROTEIN; SWP:P03669; PDB:1GVP; MIKVEIKPSQAQFTTRSGVSRQGKPYSLNEQLCYVDLGNEYPVLVKITLDEGQPAYAPGL ------3333---------1111--------------------------2222------- YTVHLSSFKVGQFGSLMIDRLRLVPAK ---3333---1111------------- >KALLIKREIN; SWP:Q8WMN9; PDB:1GVZA; IIGGWECEKHSKPWQVAVYHQGHFQCGGVLVHPQWVLTAAHCMSDDYQIWLGRHNLSKDE -------22221111----------------1111---1111------------1111-1 DTAQFHQVSDSFLDPQFDLSLLKKKYLRP 111----------1111------------ >CLATHRIN COAT ASSEMBLY PR; SWP:Q00380; PDB:1GW5S; MIRFILIQNRAGKTRLAKWYMQFDDDEKQKLIEEVHAVVTVRDAKHTNFVEFRNFKIIYR --------1111-------------------------3333-1111-----!!!!----- RYAGLYFCICVDVNDNNLAYLEAIHNFVEVLNEYFHNVCELDLVFNFYKVYTVVDEMFLA -!!!!------1111-----------------------3333----------------ii GEIRETSQTKVLKQLLMLQSLE ii---------------1111- >GLUTATHIONE S-TRANSFERASE; SWP:O04941; PDB:1GWCA; GDDLKLLGAWPSPFVTRVKLALALKGLSYEDVEEDLYKKSELLLKSNPVHKKIPVLIHNG --------3333----------------------1111-------------------iii APVCESMIILQYIDEVFASTGPSLLPADPYERAIARFWVAYVDDKLVAPWRQWLRGKTEE i---------------1111---------------------------------------- EKSEGKKQAFAAVGVLEGALRECSKGGGFFGGDGVGLVDVALGGVLSWMKVTEALSGDKI -----------------------iiii-1111---------------------------- FDAAKTPLLAAWVERFIELDAAKAALPDVGRLLEFAKAREA -3333---------3333----------------------- >CATALASE; SWP:CAD27348; PDB:1GWEA; TTPHATGSTRQNGAPAVSDRQSLTVGSEGPIVLHDTHLLETHQHFNRMNIPERRPHAKGS -1111----1111------------1111--1111------------------------- GAFGEFEVTEDVSKYTKALVFQPGTKTETLLRFSTVAGELGSPDTWRDVRGFALRFYTEE -----------3333--33332222-------------11111111-------------- GNYDLVGNNTPIFFLRDPMKFTHFIRSQKRLPDSGLRDATMQWDFWTNNPESAHQVTYLM ----------------3333-----1111---------------33333333-------- GPRGLPRTWREMNGYGSHTYLWVNAQGEKHWVKYHFISQQGVHNLSNDEATKIAGENADF 1111---3333------------1111----------1111------------------- HRQDLFESIAKGDHPKWDLYIQAIPYEEGKTYRFNPFDLTKTISQKDYPRIKVGTLTLNR --------1111------------3333------1111-----3333------------- NPENHFAQIESAAFSPSNTVPGIGLSPDRMLLGRAFAYHDAQLYRVGAHVNQLPVNRPKN ---3333-1111--3333-2222-------------------------11113333---- AVHNYAFEGQMWYDHTGDRSTYVPNSNGDSWSDETGPVDDGWEADGTLTREAQALRADDD ---------------!!!!------------------------------------1111- DFGQAGTLVREVFSDQERDDFVETVAGALKGVRQDVQARAFEYWKNVDATIGQRIEDEVK --------------------------1111----------------------------11 RHEGDGIPGVEAGGEARI 11----2222---1111- >CYTOCHROME P450 154C1; SWP:Q9L142; PDB:1GWIA; ARIPLDPFVTDLDGESARLRAAGPLAAVELPGGVPVWAVTHHAEAKALLTDPRLVKDINV -----1111----------1111------2222-----------------3333--3333 WGAWRRGEIPADWPLIGLANPGRSMLTVDGAEHRRLRTLVAQALTVRRVEHMRGRITELT ---------1111-3333-----3333--------------------------------- DRLLDELPADGGVVDLKAAFAYPLPMYVVADLMGIEEARLPRLKVLFEKFFSTQTPPEEV ---1111-------3333-----------------3333------------11113333- VATLTELASIMTDTVAAKRAAPGDDLTSALIQASENGDHLTDAEIVSTLQLMVAAGHETT ----------------------------------iiii---------------------- ISLIVNAVVNLSTHPEQRALVLSGEAEWSAVVEETLRFSTPTSHVLIRFAAEDVPVGDRV --------------------------3333-------------------------!!!!- IPAGDALIVSYGALGRDERAHGPTADRFDLTRTSGNRHISFGHGPHVCPGAALSRMEAGV -2222-------33333333-1111-------------1111-11111111--------- ALPALYARFPHLDLAVPAAELRNKPVVTQNDLFELPVRLAHHH --------1111----3333-----1111-------------- >MORPHINONE REDUCTASE; SWP:Q51990; PDB:1GWJA; TSFSNPGLFTPLQLGSLSLPNRVIMAPLTRSRTPDSVPGRLQQIYYGQRASAGLIISEAT ------1111---!!!!-------------------------------3333-------- NISPTARGYVYTPGIWTDAQEAGWKGVVEAVHAKGGRIALQLWHVGRVSHELVQPDGQQP --3333-------------------------1111---------!!!!-3333-%%%%-- VAPSALKAEGAECFVEFEDGTAGLHPTSTPRALETDGIPGIVEDYRQAAQRAKRAGFDMV -------2222-----1111-------------1111----------------------- EVHAANACLPNQFLATGTNRRTDQYGGSIENRARFPLEVVDAVAEVFGPERVGIRLTPFL ----%%%%-3333--1111---1111---------------------3333--------- ELFGLTDDEPEAMAFYLAGELDRRGLAYLHFNEPDWIGGDITYPEGFREQMRQRFKGGLI -%%%%--------------------------------------2222------------- YCGNYDAGRAQARLDDNTADAVAFGRPFIANPDLPERFRLGAALNEPDPSTFYGGAEVGY ------------------------3333-------------------3333-----2222 TDYPFLDNGHDRLG -------------- >NON-CATALYTIC PROTEIN 1; SWP:Q9C171; PDB:1GWMA; MNVRATYTVIFKNASGLPNGYDNWGWGCTLSYYGGAMIINPQEGKYGAVSLKRNSGSFRG -----------------2222-----------iiii-----2222--------------- GSLRFDMKNEGKVKILVENSEADEKFEVETISPSDEYVTYILDVDFDLPFDRIDFQDAPG -------------------1111---------------------------------3333 NGDRIWIKNLVHSTGSADDFVDPINLEHHHHHH ---------------3333---3333------- >PEROXIDASE C1A; SWP:P00433; PDB:1GWUA; MQLTPTFYDNSCPNVSNIVRDTIVNELRSDPRIAASILRLHFHDCFVNGCDASILLDNTT -----1111--1111--------------1111------------------3333---11 SFRTEKDAFGNANSARGFPVIDRMKAAVESACPRTVSCADLLTIAAQQSVTLAGGPSWRV 11-3333---------3333------------------------------1111------ PLGRRDSLQAFLDLANANLPGPFFTLPQLKDSFRNVGLNRSSDLVALSGGHTFGKNQCRF --------------------1111---------1111--3333-------------3333 IMDRLYNFSNTGLPDPTLNTTYLQTLRGLCPLNGNLSALVDFDLRTPTIFDNKYYVNLEE -------%%%%---1111------------22221111---------------------- QKGLIQSDQELFSSPNATDTIPLVRSFANSTQTFFNAFVEAMDRMGNITPLTGTQGQIRL ----33333333-1111----------------------------------!!!!----- NCRVVNS 1111--- >STICHOLYSIN II; SWP:P07845; PDB:1GWYA; ALAGTIIAGASLTFQVLDKVLEELGKVSRKIAVGIDNESGGTWTALNAYFRSGTTDVILP -2222--3333----------1111----------------------------------- EFVPNTKALLYSGRKDTGPVATGAVAAFAYYMSSGNTLGVMFSVPFDYNWYSNWWDVKIY ---2222------------------------1111------------------------- SGKRRADQGMYEDLYYGNPYRGDNGWHEKNLGYGLRMKGIMTSAGEAKMQIKISR -------------------------------iiii-------------------- >2-C-METHYL-D-ERYTHRITOL 2; SWP:P36663; PDB:1GX1A; ERIGHGFDVHAFGGEGPIIIGGVRIPYEKGLLAHSDGDVALHALTDALLGAAALGDIGKL -------------------iiii--------------------------1111--3333- FPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKLPHIPQRVFIAEDLGC -33331111---------------1111----------------1111------------ HDDVNVKATTTEKLGFTGRGEGIACEAVALLIK ----------%%%%3333--------------- >SERINE/THREONINE-PROTEIN ; SWP:O96017; PDB:1GXCA; PWARLWALQDGFANLECVNDNYWFGRDKSCEYCFDEPLLKRTDKYRTYSKKHFRIFREVG -----------------------------------3333---3333-------------1 PKNSYIAYIEDHSGNGTFVNTELVGKGKRRPLNNNSEIALSLSRNKVFVFFDLTVD 111---------------iiii--2222----2222-------------------- >MYOSIN BINDING PROTEIN C,; SWP:Q14896; PDB:1GXEA; RQEPPKIHLDCPGRIPDTIVVVAGNKLRLDVPISGDPAPTVIWQKAITQGNKAPARPAPD ------------------------------------------------------------ APEDTGDSDEWVFDKKLLCETEGRVRVETTKDRSIFTVEGAEKEDEGVYTVTVKNPVGED -----------------------------------------3333--------------- QVNLTVKVID ---------- >COLICIN E8 IMMUNITY PROTE; SWP:P09881; PDB:1GXGA; MELKNSISDYTETEFKKIIEDIINCEGDEKKQDDNLEHFISVTEHPSGSDLIYYPEGNND -----1111-----------------------------------33331111-------- GSPEAVIKEIKEWRAANG ------------------ >Photosystem I reaction ce; SWP:P12975; PDB:1GXIE; ALNRGDKVRIKRTESYWYGDVGTVASVEKSGILYPVIVRFDRVNYNGFSGSASGVNTNNF ----------------2222-------------------------1111----------- AENELELVQAAAK -1111-------- >CHROMOSOME SEGREGATION SM; SWP:Q9X0R4; PDB:1GXJA; GFSRAVRAVFEEKERFPGLVDVVSNLIEVDEKYSLAVSVLLGGTAQNIVVRNVDTAKAIV -----------33331111--3333----3333--------------------------- EFLKQNEAGRVTILPLDLIDGSFNRISGLENERGFVGYAVDLVKFPSDLEVLGGFLFGNS --------------1111-------2222--2222--3333----1111-------!!!! VVVETLDDAIRMKKKYRLNTRIATLDGELISGRGAITGGRE -----------------------1111---1111------- >PECTATE LYASE; SWP:Q9F7L3; PDB:1GXMA; MTGRMLTLDGNPAANWLNNARTKWSASRADVVLSYQQNNGGWPKNLDYNSVGNGGGGNES -------2222-------3333--3333----11111111------3333---------- GTIDNGATITEMVFLAEVYKSGGNTKYRDAVRKAANFLVNSQYSTGALPQFYPLKGGYSD ---iiii-------------------------------33331111-------------- HATFNDNGMAYALTVLDFAANKRAPFDTDVFSDNDRTRFKTAVTKGTDYILKAQWKQNGV ----%%%%----------1111----------------------------------iiii LTVWCAQHGALDYQPKKARAYELESLSGSESVGVLAFLMTQPQTAEIEQAVRAGVAWFNS ------------------1111---------------1111------------------1 PRTYLEGYTYDSSLAATNPIVPRAGSKMWYRFYDLNTNRGFFSDRDGSKFYDITQMSLER 111-------3333--------2222-----------------1111----3333----- RTGYSWGGNYGTSIINFAQKVGYL --------3333------------ >PHOSPHATE REGULON TRANSCR; SWP:P08402; PDB:1GXQA; PMAVEEVIEMQGLSLDPTSHRVMAGEEPLEMGPTEFKLLHFFMTHPERVYSREQLLNHVW --1111---iiii----------------------------1111--------------- GTNVYVEDRTVDVHIRRLRKALEPGGHDRMVQTVRGTGYRFSTRF ------3333------------1111-3333-------------- >TRANSDUCIN-LIKE ENHANCER ; SWP:Q04724; PDB:1GXRA; DYFQGAMGSKPAYSFHVTADGQMQPVPFPPDALIGPGIPRHARQINTLNHGEVVCAVTIS ----------------------------1111--2222---------------------- NPTRHVYTGGKGCVKVWDISHPGNKSPVSQLDCLNRDNYIRSCKLLPDGCTLIVGGEAST --------------------3333----------1111-------3333----------- LSIWDLAAPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGH ---------------------------1111------1111------1111--------- TDGASCIDISNDGTKLWTGGLDNTVRSWDLREGRQLQQHDFTSQIFSLGYCPTGEWLAVG ---------1111-------------------------------------1111------ MESSNVEVLHVNKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASIFQ ---------2222------------------1111------------------------- SKESSSVLSCDISVDDKYIVTGSGDKKATVYE ------------1111---------------- >HYDROXYNITRILE LYASE; SWP:Q8W4X3; PDB:1GXSA; QQEDDRILGLPGQPNGVAFGMYGGYVTIDDNNGRALYYWFQEADTADPAAAPLVLWLNGG 3333-----2222----------------1111-------------3333---------- PGCSSIGLGAMQELGAFRVHTNGESLLLNEYAWNKAANILFAESPAGVGFSYSNTSSDLS ---3333------------1111-----1111------------2222------------ MGDDKMAQDTYTFLVKWFERFPHYNYREFYIAGESGHFIPQLSQVVYRNRNNSPFINFQG --------------------1111------------------------33331111---- LLVSSGLTNDHEDMIGMFESWWHHGLISDETRDSGLKVCPGTSFMHPTPECTEVWNKALA ---------------------1111----------------------3333--------- EQGNINPYTIYTPTCDREPSPYQRRFW -----1111------------------ >Hydroxynitrile lyase; SWP:Q8W4X3; PDB:1GXSB; LPPYDPCAVFNSINYLNLPEVQTALHANVSGIVEYPWTVCSNTIFDQWGQAADDLLPVYR ----1111--------------------%%%%----------------------3333-- ELIQAGLRVWVYSGDTDSVVPVSSTRRSLAALELPVKTSWYPWYMAPTEREVGGWSVQYE --------------------3333----1111-------------1111----------- GLTYVTVRGAGHLVPVHRPAQAFLLFKQFLKGEPMPAE ------------1111---------------------- >HYDROGENASE MATURATION PR; SWP:P30131; PDB:1GXUA; NTSCGVQLRIRGKVQGVGFRPFVWQLAQQLNLHGDVCNDGDGVEVRLREDPEVFLVQLYQ ----------------------------------------------------------11 HCPPLARIDSVEREPFIWSALPTEFTIR 111111---------------------- >T-CELL ECTO-ADP-RIBOSYLTR; SWP:P20974; PDB:1GXYA; PLMLDTAPNAFDDQYEGCVNKMEEKAPLLLQEDFNMNAKLKVAWEEAKKRWNNIKPSRSY ------1111----2222-----------------------------------3333--- PKGFNDFHGTALVAYTGSIAVDFNRAVREFKENPGQFHYKAFHYYLTRALQLLSNGDCHS 2222--------3333------------33333333--3333--------1111------ VYRGTKTRFHYTGAGSVRFGQFTSSSLSKKVAQSQEFFSDHGTLFIIKTCLGVYIKEFSF ---------------------------3333-------1111------------1111-- RPDQEEVLIPGYEVYQKVRTQGYNEIFLDSPKRKKSNYNCLYS 3333-----1111--------------------------1111 >NUCLEAR TRANSPORT FACTOR ; SWP:P13662; PDB:1GY6A; DKPIWEQIGSSFIQHYYQLFDNDRTQLGAIYIDASCLTWEGQQFQGKAAIVEKLSSLPFQ --3333----------------3333-11111111---iiii-----------1111--- KIQHSITAQDHQPTPDSCIISMVVGQLKADEDPIMGFHQMFLLKNINDAWVCTNDMFRLA ---------------------------------------------%%%%----------- LHNFG ----- >NUCLEAR TRANSPORT FACTOR ; SWP:P33331; PDB:1GY7A; DFNTLAQNFTQFYYNQFDTDRSQLGNLYRNESMLTFETSQLQGAKDIVEKLVSLPFQKVQ -------------------11113333-1111---!!!!--------------------- HRITTLDAQPASPYGDVLVMITGDLLIDEEQNPQRFSQVFHLIPDGNSYYVFNDIFRLNY --------------------------!!!!--------------!!!!------------ S - >UDP-GALACTOSE 4-EPIMERASE; SWP:Q8T8E9; PDB:1GY8A; HMRVLVCGGAGYIGSHFVRALLRDTNHSVVIVDSLVGTHGKSDHVETRENVARKLQQSDG -------1111-----------------------1111---1111--------------- PKPPWADRYAALEVGDVRNEDFLNGVFTRHGPIDAVVHMCAFLAVGESVRDPLKYYDNNV --1111---------1111--------1111------------3333------------- VGILRLLQAMLLHKCDKIIFSSSAAIFGNPTMNAEPIDINAKKSPESPYGESKLIAERMI ------------------------3333---------1111------------------- RDCAEAYGIKGICLRYFNACGAHEDGDIGEHYQGSTHLIPIILGRVMSDIAPDDKRMPIF ----------------------3333-----2222------------------------- GTDYPTPDGTCVRDYVHVCDLASAHILALDYVEKLGPNDKSKYFSVFNLGTSRGYSVREV -----1111----------------------111111111111----------------- IEVARKTTGHPIPVRECGRREGDPAYLVAASDKAREVLGWKPKYDTLEAIMETSWKFQRT -------------------2222---------------------------------3333 HPNGYA --!!!! >LACCASE 2; SWP:Q12718; PDB:1GYCA; AIGPAASLVVANAPVSPDGFLRDAIVVNGVFPSPLITGKKGDRFQLNVVDTLTNHTMLKS ---------------1111-------iiii--------2222-----------3333--- TSIHWHGFFQAGTNWADGPAFVNQCPIASGHSFLYDFHVPDQAGTFWYHSHLSTQYCDGL ----2222-22221111----------2222--------------------!!!!1111- RGPFVVYDPKDPHASRYDVDNESTVITLTDWYHTAARLGPRFPLGADATLINGLGRSAST ----------1111------1111----------1111------------iiii-----1 PTAALAVINVQHGKRYRFRLVSISCDPNYTFSIDGHNLTVIEVDGINSQPLLVDSIQIFA 111-------2222------------------2222------iiii------------22 AQRYSFVLNANQTVGNYWIRANPNFGTVGFAGGINSAILRYQGAPVAEPTTTQTTSVIPL 22---------------------------2222-------2222---------------- IETNLHPLARMPVPGSPTPGGVDKALNLAFNFNGTNFFINNASFTPPTVPVLLQILSGAQ 1111-------------2222-----------------%%%%------------1111-- TAQDLLPAGSVYPLPAHSTIEITLPATALAPGAPHPFHLHGHAFAVVRSAGSTTYNYNDP 1111--2222----------------1111------------------2222-------- IFRDVVSTGTPAAGDNVTIRFQTDNPGPWFLHCHIDFHLEAGFAIVFAEDVADVKAANPV ---------3333---------------------33331111-------3333-3333-- PKAWSDLCPIYDGLSEANQ 3333------11113333- >ARABINAN ENDO-1,5-ALPHA-L; SWP:P95470; PDB:1GYHA; GAKQVDVHDPVMTREGDTWYLFSTGPGITIYSSKDRVNWRYSDRAFATEPTWAKRVSPSF --------------!!!!------2222---------------------11113333--- DGHLWAPDIYQHKGLFYLYYSVSAFGKNTSAIGVTVNKTLNPASPDYRWEDKGIVIESVP -----------iiii--------2222-------------1111--------------22 QRDLWNAIAPAIIADDHGQVWMSFGSFWGGLKLFKLNDDLTRPAEPQEWHSIAKLERSVL 22------------------------!!!!------1111-----------------333 MDDSQAGSAQIEAPFILRKGDYYYLFASWGLCCRKGDSTYHLVVGRSKQVTGPYLDKTGR 31111-------------!!!!-----------!!!!-----------1111---1111- DMNQGGGSLLIKGNKRWVGLGHNSAYTWDGKDYLVLHAYEAADNYLQKLKILNLHWDGEG 3333---------1111----------iiii--------1111-------------1111 WPQVDEKELDSYISQRLK ----3333---------- >CYTOCHROME C3, A DIMERIC ; SWP:Q9R638; PDB:1GYOA; LDVPCKVVITAPEGEDPHPRFGKVEMSHAKHRNVSCVSCHHMFDGCGDFQKCADCHIDRD -----------2222-----------333311113333-1111-------1111------ DRSYERGFYKAWHSESEISCRGCHKAMKAKNEQTGPIGCLQGCHEA ---1111--------------------1111------1111----- >CYTOSOL AMINOPEPTIDASE; SWP:P11648; PDB:1GYTA; MEFSVKSGSPEKQRSACIVVGVFEPRRLSPIAEQLDKISDGYISALLRRGELEGKPGQTL --------3333----------------------------------3333---------- LLHHVPNVLSERILLIGCGKERELDERQYKQVIQKTINTLNDTGSMEAVCFLTELHVKGR --------------------------------------3333--------3333--2222 NNYWKVRQAVETAKETLYSFDQLKTNKSEPRRPLRKMVFNVPTRRELTSGERAIQHGLAI --------------1111--1111------------------3333-------------- AAGIKAAKDLGNMPPNICNAAYLASQARQLADSYSKNVITRVIGEQQMKELGMHSYLAVG -------------3333------------------------------------------3 QGSQNESLMSVIEYKGNASEDARPIVLVGKGLTFDSGGISIKPSEGMDEMKYDMCGAAAV 333----------------------------------------2222--1111------- YGVMRMVAELQLPINVIGVLAGCENMPGGRAYRPGDVLTTMSGQTVEVLNTDAEGRLVLC ---------------------------------------3333------11113333--- DVLTYVERFEPEAVIDVATLTGACVIALGHHITGLMANHNPLAHELIAASEQSGDRAWRL ------1111----------3333------------------------------------ PLGDEYQEQLESNFADMANIGGRPGGAITAGCFLSRFTRKYNWAHLDIAGTAWRSGKAKG --3333-1111----------------------11111111--------------1111- ATGRPVALLAQFLLNRAGFNGEE ----------------------- >HYPOTHETICAL PROTEIN YDCE; SWP:P31992; PDB:1GYXA; PHIDIKCFPRELDEQQKAALAADITDVIIRHLNSKDSSISIALQQIQPESWQAIWDAEIA ----------------------------------3333--------3333--------33 PQMEALIKKPGYSMNA 33-------------- >HYPOTHETICAL TRNA/RRNA ME; SWP:P39290; PDB:1GZ0A; SEIYGIHAVQALLERAPERFQEVFILKGREDKRLLPLIHALESQGVVIQLANRQYLDEKS ---------------1111--------------1111---------------3333---- DGAVHQGIIARVKPGRQYQENDLPDLIASLDQPFLLILDGVTDPHNLGACLRSADAAGVH ----iiii----------3333-------------------------------------- AVIVPKDRSAQLNATAKKVACGAAESVPLIRVTNLARTRLQEENIWIVGTAGEADHTLYQ ---------------------3333--------3333---1111------------1111 SKTGRLALVGAEGEGRRLTREHCDELISIPAGSVSSLNVSVATGICLFEAVRQRS ---------------33331111--------------3333----------1111 >OVOCLEIDIN; SWP:Q9PRS8; PDB:1GZ2A; GCGPGWVPTPGGCLGFFSRELSWSRAESFCRRWGPGSHLAAVRSAAELRLLAELLNARGG --2222--2222---------------------2222----------------------- DGSGEGADGRVWIGLHRPAGSRSWRWSDGTAPRFASWHRTAKARRGGRCAALRDEEAFTS -----------------2222----1111----------3333------------%%%%- WAARPCTERNAFVCKAAA ----1111---------- >ESTRADIOL 17 BETA-DEHYDRO; SWP:P97852; PDB:1GZ6A; SPLRFDGRVVLVTGAGGGLGRAYALAFAERGALVVVNDLGGDFKGVGKGSSAADKVVEEI ----2222-------------------1111----------------------------- RRRGGKAVANYDSVEAGEKLVKTALDTFGRIDVVVNNAGILRDRSFSRISDEDWDIIQRV 1111--------3333----------------------------3333------------ HLRGSFQVTRAAWDHKKQNYGRIITASASGIYGNFGQANYSAAKLGLLGLANTLVIEGRK ---------------1111-------3333---2222---------------------11 NNIHCNTIAPNAGSRTETVPEDLVEALKPEYVAPLVLWLCHESCEENGGLFEVGAGWIGK 11-------------33333333----3333---------1111---------------- LRWERTLGAIVRKRNQPTPEAVRDNWVKICDFSNASKPKSIQESTGGIIEVLHKIDS -----------------3333----------2222---------------------- >LIPASE 2; SWP:P32946; PDB:1GZ7A; APTATLANGDTITGLNAIVNEKFLGIPFAEPPVGTLRFKPPVPYSASLNGQQFTSYGPSC -----1111-----------------------!!!!-----------2222--------- MQMNPMGSFEDTLPKNALDLVLQSKIFQVVLPNDEDCLTINVIRPPGTRASAGLPVMLWI -------1111----------3333-------------------22221111-------- FGGGFELGGSSLFPGDQMVAKSVLMGKPVIHVSMNYRVASWGFLAGPDIQNEGSGNAGLH --%%%%--3333----------1111-------------1111----------------- DQRLAMQWVADNIAGFGGDPSKVTIYGESAGSMSTFVHLVWNDGDNTYNGKPLFRAAIMQ ------------------1111------------------%%%%---iiii--------- SGCMVPSDPVDGTYGTEIYNQVVASAGCGSASDKLACLRGLSQDTLYQATSDTPGVLAYP --------1111-----------11111111------1111--------1111-1111-! SLRLSYLPRPDGTFITDDMYALVRDGKYAHVPVIIGDQNDEGTLFGLSSLNVTTDAQARA !!!------------------------------------11113333-1111-------- YFKQSFIHASDAEIDTLMAAYTSDITQGSPFDTGIFNAITPQFKRISALLGDLAFTLARR -----1111--------------3333-----!!!!-----3333--------------- YFLNYYQGGTKYSFLSKQLSGLPVLGTFHGNDIIWQDYLVGSGSVIYNNAFIAFANDLDP ----------------1111-------22223333-----3333---------------- NKAGLWTNWPTYTSSSQSGNNLMQINGLGLYTGKDNFRPDAYSALFSNPPSFFV -------------1111------------------------------3333--- >CELL DIVISION PROTEIN KIN; SWP:P24941; PDB:1GZ8A; MENFQKVEKIGEGTYGVVYKARNKLTGEVVALKKIRVPSTAIREISLLKELNHPNIVKLL 1111--------1111-------------------------------1111-1111---- DVIHTENKLYLVFEFLHQDLKKFMDASALTGIPLPLIKSYLFQLLQGLAFCHSHRVLHRD ----%%%%----------------1111-----------------------1111----- LKPQNLLINTEGAIKLADFGLARAFGVPVRTYTHEVVTLWYRAPEILLGKYYSTAVDIWS -3333---1111-----22223333-----1111----11113333-------------- LGCIFAEMVTRRALFPGDSEIDQLFRIFRTLGTPDEVVWPGVTSMPDYKPSFPKWARQDF ----------------------------------33332222--11111111------33 SKVVPPLDEDGRSLLSQMLHYDPNKRISAKAALAHPFFQDVTKPVPHLRL 33-3333--------------3333--333311111111----------- >ERYTHRINA CRISTA-GALLI LE; SWP:P16404; PDB:1GZCA; VETISFSFSEFEPGNDNLTLQGAALITQSGVLQLTKINQNGMPAWDSTGRTLYTKPVHMW -----------2222-----------1111-------1111------------------- DSTTGTVASFETRFSFSIEQPYTRPLPADGLVFFMGPTKSKPAQGYGYLGVFNNSKQDNS --------------------------------------------!!!!---------333 YQTLAVEFDTFSNPWDPPQVPHIGIDVNSIRSIKTQPFQLDNGQVANVVIKYDAPSKILH 3-----------1111------------------------2222---------1111--- VVLVYPSSGAIYTIAEIVDVKQVLPDWVDVGLSGATGAQRDAAETHDVYSWSFQASLPE -----1111---------3333----------------2222----------------- >GLUCOSE-6-PHOSPHATE ISOME; SWP:P08059; PDB:1GZDA; AALTQNPQFKKLQTWYHEHRSDLNLRRLFEGDKDRFNHFSLNLNTNHGRILLDYSKNLVT -1111-------------3333-3333------3333----------------------- EAVMQMLVDLAKSRGVEAARERMFNGEKINFTEDRAVLHVALRNRSNTPILVDGKDVMPE -------------------------------------3333--1111----iiii--333 VNRVLEKMKSFCKRVRSGEWKGYSGKSITDVINIGIGGSDLGPLMVTEALKPYSAEGPRV 3--------------------1111----------!!!!----------33331111--- WFVSNIDGTHIAKTLATLNPESSLFIIASKTFTTQETITNAETAKEWFLQSAKDPSAVAK ----------3333----3333------1111-----------------------3333- HFVALSTNTTKVKEFGIDPQNMFEFWDWVGGRYSLWSAIGLSIALHVGFDNFEQLLSGAH ------------1111-3333----111111111111----------------------- WMDQHFRTTPLEKNAPVLLALLGIWYINFFGCETHAMLPYDQYLHRFAAYFQQGDMESNG ---------3333---------------------------3333---3333--------- KYITKSGTRVDHQTGPIVWGEPGTNGQHAFYQLIHQGTKMIPCDFLIPVQTQHPIRKGLH ---1111----------------3333---3333--------------------%%%%-- HKILLANFLAQTEALMKGKSTEEARKELQAAGKSPEDFEKLLPHKVFEGNRPTNSIVFTK ---------------------------------333311113333--------------- LTPFILGALIAMYEHKIFVQGVIWDINSFDQWGVELGKQLAKKIEPELDGSSPVTSHDSS -------------------------------3333----------------------333 TNGLINFIKQEREA 3------------- >CELLULAR TUMOR ANTIGEN P5; SWP:P04637; PDB:1GZHA; SSVPSQKTYQGSYGFRLGFLHSGTAKSVTCTYSPALNKMFCQLAKTCPVQLWVDSTPPPG ----------1111-------------------1111--------------------222 TRVRAMAIYKQSQHMTEVVRRCPHHERCAPPQHLIRVEGLRVEYLDDRNTFRHSVVVPYE 2---------3333-------3333----1111--------------------------- PPECTTIHYNYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRNSFEVRVCACPGRDRRTE -------------1111---iiii---------1111----------------------- EENLRKK ------- >Tumor suppressor p53-bind; SWP:Q12888; PDB:1GZHB; LNKTLFLGYAFLLTMATTSDKLASRSPPFNKQYTESQLRAGAGYILEDFNTAYQCLLIAD ---1111-------------------------------1111------------------ QHCRTRKYFLCLASGIPCVSHVWVHDSCHANQLQNYRNYLLPAGYSLEEQRILDWQPREN ---------------------------1111---3333--------3333---------1 PFQNLKVLLVSDQQQNFLELWSEILMTGGAASVKQHHSSAHNKDIALGVFDVVVTDPSCP 111---------------------------------1111-----1111------1111- ASVLKCAEALQLPVVSQEWVIQCLIVGERIGFKQHPKYKHDYVSH ----------------------------------11111111--- >RAC-BETA SERINE/THREONINE; SWP:P31751; PDB:1GZKA; KVTMNDFDYLKLLGKGTFGKVILVREKATGRYYAMKILRKEVIVTESRVLQNTRHPFLTA --1111---------3333--------------------3333----3333---1111-- LKYAFQTHDRLCFVMEYANGGELFFHLSRERVFTEERARFYGAEIVSALEYLHSRDVVYR ----------------------------------------------------1111---- DIKLENLMLDKDGHIKITDFGLTPEYLAPEVLEDNDYGRAVDWWGLGVVMYEMMCGRLPF --3333---1111--------------3333------3333------------------- YNQDHERLFELILMEEIRFPRTLSPEAKSLLAGLLKKDPKQRLGGGPSDAKEVMEHRFFL ------3333---------------------------3333----1111------3333- SINWQDVVQKKLLPPFKPQVTSEVDTRYFDD ------1111-----------33331111-- >Guanine nucleotide exchan; SWP:O52623; PDB:1GZSB; GSLTNKVVKDFMLQTLNDIDIRGSASKDPAYASQTREAILSAVYSKNKDQCCNLLISKGI -------------------------------------------------------1111- NIAPFLQEIGEAAKNAGLPGTTKNDVFTPSGAGANPFITPLISSANSKYPRMFINQHQQA -------------1111-----%%%%--1111--11113333------3333-------- SFKIYAEKIIMTEVAPLFNECAMPTPQQFQLILENIANKYIQNTP ---------------1111--------------------1111-- >3-DEHYDROQUINATE DEHYDRAT; SWP:P36918; PDB:1H05A; LIVNVINGPNLGRLGRRGTTHDELVALIEREAAELGLKAVVRQSDSEAQLLDWIHQAADA -------2222-2222----------------1111------------------------ AEPVILNAGGLTHTSVALRDACAELSAPLIEVHISNVHAREEFRRHSYLSPIATGVIVGL -------!!!!----------3333----------1111-3333----3333-------- GIQGYLLALRYLAEHVGT ------------------ >LYSOZYME; SWP:P15057; PDB:1H09A; VKKNDLFVDVSSHNGYDITGILEQMGTTNTIIKISESTTYLNPCLSAQVEQSNPIGFYHF -2222-----3333---------------------------1111---1111-------- ARFGGDVAEAEREAQFFLDNVPMQVKYLVLDYEDDPSGDAQANTNACLRFMQMIADAGYK -----------------1111---------------------------------1111-- PIYYSYKPFTHDNVDYQQILAQFPNSLWIAGYGLNDGTANFEYFPSMDGIRWWQYSSNPF --------------3333-------------!!!!----3333----------------- DKNIVLLDDEEDDKPKTAGTWKQDSKGWWFRRNNGSFPYNKWEKIGGVWYYFDSKGYCLT -----------------------3333----1111---------iiii----1111---- SEWLKDNEKWYYLKDNGAMATGWVLVGSEWYYMDDSGAMVTGWVKYKNNWYYMTNERGNM -----%%%%----1111----------------1111--------!!!!-----2222-- VSNEFIKSGKGWYFMNTNGELADNPSFTKEPDGLITVA -------!!!!--------------------------- >ALANINE--GLYOXYLATE AMINO; SWP:P21549; PDB:1H0CA; HKLLVTPPKALLKPLSIPNQLLLGPGPSNLPPRIMAAGGLQMIGSMSKDMYQIMDEIKEG -------3333------------------------3333--------------------- IQYVFQTRNPLTLVISGSGHCALEAALVNVLEPGDSFLVGANGIWGQRAVDIGERIGVHP -----------------3333--------------------------------------- MTKDPGGHYTLQEVEEGLAQHKPVLLFLTHGESSTGVLQPLDGFGELCHRYKCLLLVDSV ------------------------------------------3333-------------- ASLGGTPLYMDRQGIDILYSGSQKALNAPPGTSLISFSDKAKKKMYSRKTKPFSFYLDIK -2222-----1111---------3333---------------------------1111-- WLANFWGCDDQPRMYHHTIPVISLYSLRESLALIAEQGLENSWRQHREAAAYLHGRLQAL ------------------------------------------------------------ GLQLFVKDPALRLPTVTTVAVPAGYDWRDIVSYVIDHFDIEIMGGLGPSTGKVLRIGLLG -------3333-3333---------3333----------------!!!!---------!! CNATRENVDRVTEALRAALQHCPKKK !!------------------------ >ANTIBODY FAB FRAGMENT, LI; SWP:P01654; PDB:1H0DA; DIVLTQSPASLAVSLGQRATISCRASESVDNYI -------------2222---------------- >Angiogenin [Precursor]; SWP:ANGI_HUMAN; PDB:1H0DB; EVMLVESGGGLVKPGGSLKLSCAASGFTFSSYTMSWVRQTPEKRLEWVATISSGGGNTYY ------------2222-----------3333--------1111----------------- PDSVKGRFTISRDIAKNTLYLQMSSL 3333---------1111--------- >FORMATE DEHYDROGENASE (LA; SWP:Q934F5; PDB:1H0HA; ATMALKTVDAKQTTSVCCYCSVGCGLIVHTDKKTNRAINVEGDPDHPINEGSLCAKGAST ----1111----------------------------------1111--iiii-3333-33 WQLAENERRPANPLYRAPGSDQWEEKSWDWMLDTIAERVAKTREATFVTKNAKGQVVNRC 33---1111-------2222-----------------------1111---1111------ DGIASVGSAAMDNEECWIYQAWLRSLGLFYIEHQARIHSATVAALAESYGRGAMTNHWID -------3333---------------------3333--------------------3333 LKNSDVILMMGSNPAENHPISFKWVMRAKDKGATLIHVDPRYTRTSTKCDLYAPLRSGSD 1111--------3333-3333-------1111----------3333---------2222- IAFLNGMTKYILEKELYFKDYVVNYTNASFIVGEGFAFEEGLFAGYNKETRKYDKSKWGF --------------------------1111--3333--iiii------------------ ERDENGNPKRDETLKHPRCVFQIMKKHYERYDLDKISAICGTPKELILKVYDAYCATGKP --1111----1111-1111--------1111---------------------------11 DKAGTIMYAMGWTQHTVGVQNIRAMSINQLLLGNIGVAGGGVNALRGEANVQGSTDHGLL 11-----------1111------------1111---2222--------------1111-1 MHIYPGYLGTARASIPTYEEYTKKFTPVSKDPQSANWWSNFPKYSASYIKSMWPDADLNE 1112222----1111---------------1111-----3333---------1111---- AYGYLPKGEDGKDYSWLTLFDDMFQGKIKGFFAWGQNPACSGANSNKTREALTKLDWMVN -3333---2222--3333------------------3333---------3333------- VNIFDNETGSFWRGPDMDPKKIKTEVFFLPCAVAIEKEGSISNSGRWMQWRYVGPEPRKN ---------333322223333-----------1111------1111-------------- AIPDGDLIVELAKRVQKLLAKTPGKLAAPVTKLKTDYWVNDHGHFDPHKIAKLINGFALK --------------------------3333---3333--1111----------------- DFKVGDVEYKAGQQIATFGHLQADGSTTSGCWIYTGSYTEKGNMAARRDKTQTDMQAKIG ---!!!!--2222---3333----------3333----11113333------33331111 LYPGWTWAWPVNRRIIYNRASVDLNGKPYAPEKAVVEWNAAEKKWVGDVPDGPWPPQADK -1111----%%%%-----11111111---1111------1111----------------- EKGKRAFIMKPEGYAYLYGPGREDGPLPEYYEPMECPVIEHPFSKTLHNPTALHFATEEK ------1111--------1111--------------------------1111-------- AVCDPRYPFICSTYRVTEHWQTGLMTRNTPWLLEAEPQMFCEMSEELATLRGIKNGDKVI ---3333--------1111!!!!-333333333333-----------------2222--- LESVRGKLWAKAIITKRIKPFAIQGQQVHMVGIPWHYGWSFPKNGGDAANILTPSVGNPN --1111--------1111----iiii-----------3333------1111--------- TGIPETKAFMVNVTKA ---------------- >Formate dehydrogenase sub; SWP:Q8GC87; PDB:1H0HB; SKGFFVDTTRCTACRGCQVACKQWHGNPATPTENTGFHQNPPDFNFHTYKLVRMHEQEID ------3333----------------------------------1111----------ii GRIDWLFFPDQCRHCIAPPCKATADMEDESAIIHDDATGCVLFTPKTKDLEDYESVISAC ii-------------------------1111--------------3333----------1 PYDVPRKVAESNQMAKCDMCIDRITNGLRPACVTSCPTGAMNFGDLSEMEAMASARLAEI 111----1111------%%%%1111----------------------------------- KAAYSDAKLCDPDDVRVIFLTAHNPKLYHEYAVA ---1111---1111---------3333-1111-- >CARDIOTOXIN-3; SWP:P01444; PDB:1H0JA; LKCNKLVPLFYKTCPAGKNLCYKMFMVATPKVPVKRGCIDVCPKSSLLVKYVCCNTDRCN ------3333---------------1111----------------1111----------- >PEPTIDYL-PROLYL CIS-TRANS; SWP:P52013; PDB:1H0PA; KGPKVTDRVYFDMEIGGKPIGRIVIGLFGKTVPKTATNFIELAKKPKGEGYPGSKFHRVI --------------iiii---------------------------2222-2222------ ADFMIQGGDFTRGDGTGGRSIYGEKFADENFKLKHYGAGWLSMANAGADTNGSQFFITTV -------------------1111------------------------------------- KTPWLDGRHVVFGKILEGMDVVRKIEQTEKLPGDRPKQDVIIAASGHIAVDTPFSVEREA -3333------------3333---1111-------------------------------- VV -- >SERINE PROTEASE INHIBITOR; SWP:Q9NQ38; PDB:1H0ZA; ESGKATSYAELCNEYRKLVRNGKLACTRENDPIQGPDGKVHGNTCSMCEVFFQAEEEEKK ------3333-3333-------------------1111-------------------333 KKEGESRN 3---3333 >ENDO-1,4-BETA-XYLANASE; SWP:Q8RJN8; PDB:1H12A; AFNNNPSSVGAYSSGTYRNLAQEMGKTNIQQKVNSTFDNMFGYNNTQQLYYPYTENGVYK ---------3333--------1111-----------------------------iiii-- AHYIKAINPDEGDDIRTEGQSWGMTAAVMLNKQEEFDNLWRFAKAYQKNPDNHPDAKKQG -------3333----------------1111------------------1111-3333-- VYAWKLKLNQNGFVYKVDEGPAPDGEEYFAFALLNASARWGNSGEFNYYNDAITMLNTIK --------1111------------------------------------------------ NKLMENQIIRFSPYIDNLTDPSYHIPAFYDYFANNVTNQADKNYWRQVATKSRTLLKNHF ----%%%%---1111----3333-3333---1111------------------------- TKVSGSPHWNLPTFLSRLDGSPVIGYIFNGQANPGQWYEFDAWRVIMNVGLDAHLMGAQA ----------------1111-------2222--1111-3333------------------ WHKSAVNKALGFLSYAKTNNSKNCYEQVYSYGGAQNRGCAGEGQKAANAVALLASTNAGQ -------------------1111------iiii------------------1111----- ANEFFNEFWSLSQPTGDYRYYNGSLYMLAMLHVSGNFKFYNNTF -------1111----1111------------------------- >FORMATE ACETYLTRANSFERASE; SWP:P09373; PDB:1H16A; SELNEKLATAWEGFTKGDWQNEVNVRDFIQKNYTPYEGDESFLAGATEATTTLWDKVMEG --------1111----3333-------------------1111----------------- VKLENRTHAPVDFDTAVASTITSHDAGYINKQLEKIVGLQTEAPLKRALIPFGGIKMIEG -------------------1111------3333---------2222---3333------- SCKAYNRELDPMIKKIFTEYRKTHNQGVFDVYTPDILRCRKSGVLTGLPDAYGRGRIIGD ---------------------------3333----------------------------- YRRVALYGIDYLMKDKLAQFTSLQADLENGVNLEQTIRLREEIAEQHRALGQMKEMAAKY --------------------------------------------------------3333 GYDISGPATNAQEAIQWTYFGYLAAVKSQNGAAMSFGRTSTFLDVYIERDLKAGKITEQE --------------------------------------3333------------------ AQEMVDHLVMKLRMVRFLRTPEYDELFSGDPIWATESIGGMGLDGRTLVTKNSFRFLNTL -------------------3333------------------1111----3333------- YTMGPSPEPNMTILWSEKLPLNFKKFAAKVSIDTSSLQYENDDLMRPDFNNDDYAIACCV ---------------1111---------------------------1111---------- SPMIVGKQMQFFGARANLAKTMLYAINGGVDEKLKMQVGPKSEPIKGDVLNYDEVMERMD ---2222-----------3333----iiii------------------------------ HFMDWLAKQYITALNIIHYMHDKYSYEASLMALHDRDVIRTMACGIAGLSVAADSLSAIK --------------------------3333------------------------------ YAKVKPIRDEDGLAIDFEIEGEYPQFGNNDPRVDDLAVDLVERFMKKIQKLHTYRDAIPT --------1111------------2222-3333--------------1111-2222---- QSVLTITSNVVYGKKTGNTPDGRRAGAPFGPGANPMHGRDQKGAVASLTSVAKLPFAYAK ----!!!!----------1111-2222---!!!!-2222-----------3333333311 DGISYTFSIVPNALGKDDEVRKTNLAGLMDGYFHHEASIEGGQHLNVNVMNREMLLDAME 11-------3333----------------------1111-----------3333------ NPEKYPQLTIRVSGYAVRFNSLTKEQQQDVITRTFTQSM 33331111---------3333--------1111------ >Protein THO1; SWP:P40040; PDB:1H1JS; GSADYSSLTVVQLKDLLTKRNLSVGGLKNELVQRLIKDDEESKG -----------------1111----------------------- >ENDO TYPE CELLULASE ENGI; SWP:Q8TG26; PDB:1H1NA; AKVFQWFGSNESGAEFGSQNLPGVEGKDYIWPDPNTIDTLISKGMNIFRVPFMMERLVPN ------------11111111---2222-------------1111--------3333---- SMTGSPDPNYLADLIATVNAITQKGAYAVVDPHNYGRYYNSIISSPSDFETFWKTVASQF 1111-----------------1111------------%%%%------------------1 ASNPLVIFDTDNEYHDMDQTLVLNLNQAAIDGIRSAGATSQYIFVEGNSWTGAWTWTNVN 111------------------------------1111----------%%%%3333----- DNMKSLTDPSDKIIYEMHQYLDSDGSGTSATCVSSTIGQERITSATQWLRANGKKGIIGE --1111-1111----------1111--------1111----------------------- FAGGADNVCETAITGMLDYMAQNTDVWTGAIWWAAGPWWGDYIFSMEPDNGIAYQQILPI --------------------------------------!!!!--------------3333 LTPYL 3333- >CYTOCHROME C-552; SWP:P74917; PDB:1H1OA; VSSDCMVCHGMTGRDTLYPIVPRLAGQHKSYMEAQLKAYKDHSRADQNGEIYMWPVAQAL 1111-------------1111--2222---------------------------3333-- DSAKITALADYFNAQKPPMQSSGIKHAGAKEGKAIFNQGVTNEQIPACMECHGSDGQGAG ----------------------------------------1111--3333--------!! PFPRLAGQRYGYIIQQLTYFHNGTRVNTLMNQIAKNITVAQMKDVAAYLSSL !!--2222-------------------------1111--------------- >D-RIBULOSE-5-PHOSPHATE 3-; SWP:Q9SE42; PDB:1H1YA; AAKIAPSMLSSDFANLAAEADRMVRLGADWLHMDIMDGHFVPNLTIGAPVIQSLRKHTKA ------3333-1111--------1111-------------------3333----1111-- YLDCHLMVTNPSDYVEPLAKAGASGFTFHIEVSRDNWQELIQSIKAKGMRPGVSLRPGTP ---------3333---------------3333------------1111-------11113 VEEVFPLVEAENPVELVLVMTVEPGFGGQKFMPEMMEKVRALRKKYPSLDIEVDGGLGPS 333------------------------------------------1111--------111 TIDVAASAGANCIVAGSSIFGAAEPGEVISALRKSVEGSQ 1--------------3333-----------------3333 >METALLOCARBOXYPEPTIDASE I; SWP:P01075; PDB:1H20A; EQHADPICNKPCKTHDDCSGAWFCQACWNSARTCGPYVG ------2222----3333----------3333------- >SPLIT-SORET CYTOCHROME C; SWP:P81040; PDB:1H21A; GRFDQVGGAFGWKPHKLDPKECAQVAYDGYWYKGFGCGFGAFYSIVGLMGEKYGAPYNQF 22222222----------------------2222---------------------1111- PFAMLEANKGGISDWGTICGALYGAAATFSLFWGRKEVHPMVNELFRWYEVTKLPIFNPG 3333-------%%%%-----------3333----3333--------------------!! DAAQGVKGDLPMSASDSVLCHISVSKWCYENKIEATSKQRSERCGRLTADAAFKAAEIIN !!-------------------------------1111----------------------- TKIDQGKDFKSTFPMQASVSSCGECHMTKGNDANWAKGIMDCTPCHSGTAATQNKFVNHP ----!!!!-------------3333--2222----------3333---3333-------- >ALCOHOL DEHYDROGENASE; SWP:Q9Y9P9; PDB:1H2BA; KAARLHEYNKPLRIEDVDYPRLEGRFDVIVRIAGAGVCHTDLHLVQGMWHELLQPKLPYT ----------------------!!!!-----------3333-----1111---------- LGHENVGYIEEVAEGVEGLEKGDPVILHPAVTDGTCLACRAGEDMHCENLEFPGLNIDGG ------------2222---2222---------------11111111-----2222----- FAEFMRTSHRSVIKLPKDISREKLVEMAPLADAGITAYRAVKKAARTLYPGAYVAIVGVG -------3333----1111--------3333-----------------2222-------3 GLGHIAVQLLKVMTPATVIALDVKEEKLKLAERLGADHVVDARRDPVKQVMELTRGRGVN 333--------------------3333----1111----------------1111----- VAMDFVGSQATVDYTPYLLGRMGRLIIVGYGGELRFPTIRVISSEVSFEGSLVGNYVELH -------------3333--2222------------------------------------- ELVTLALQGKVRVEVDIHKLDEINDVLERLEKGEVLGRAVLIP -----1111---------1111-------1111---------- >MATRIX PROTEIN VP40; SWP:Q05128; PDB:1H2CA; VSSAFILEAMVNVISGPKVLMKQIPIWLPLGVADQKTYSFDSTTAAIMLASYTITHFGKA ---------------2222---------------------------------------11 TNPLVRVNRLGPGIPDHPLRLLRIGNQAFLQEFVLPPVQLPQYFTFDLTALKLITQPLPA 11-----------2222-3333-------3333--------------------------- ATWTD 1111- >PHOSPHATASE; SWP:Q9ALU0; PDB:1H2EA; ATTLYLTRHGETKWNVERRMQGWQDSPLTEKGRQDAMRLGKRLEAVELAAIYTSTSGRAL --------------------!!!!----------------1111-----------3333- ETAEIVRGGRLIPIYQDERLREIHLGDWEGKTHDEIRQMDPIAFDHFWQAPHLYAPQRGE ------iiii------3333----!!!!---------------------3333------- RFCDVQQRALEAVQSIVDRHEGETVLIVTHGVVLKTLMAAFKDTPLDHLWSPPYMYGTSV 3333---------------2222---------------------3333-------2222- TIIEVDGGTFHVAVEGDVSHIEEVKEV -----%%%%--------1111------ >DNA REPAIR PROTEIN RAD52 ; SWP:P43351; PDB:1H2IA; LCFGQCQYTAEEYQAIQKALRQRLGPEYISSRMAGGGQKVCYIEGHRVINLANEMFGYNG -2222-------------3333--1111-----1111------3333---------1111 WAHSITQQNVDFVDLNNGKFYVGVCAFVRVQLKDGSYHEDVGYGVSEGLKSKALSLEKAR -------------------------------3333------------------------- KEAVTDGLKRALRSFGNALGNCILDKDYLRSLNKLPRQLPLEVDLTKAKRQDLEPSVEEA ----------3333--1111-11113333------------------------------- RYNSCR ------ >FACTOR INHIBITING HIF1; SWP:Q969Q7; PDB:1H2KA; EPREEAGALGPAWDESQLRSYSFPTRPIPRLSQSDPRAEELIENEEPVVLTDTNLVYPAL -------------3333--------------1111------1111----------3333- KWDLEYLQENIGNGDFSVYSASTHKFLYYDEKKMANFQNFKPRSNREEMKFHEFVEKLQD -----------------------------33331111------------3333------- IQQRGGEERLYLQQTLNDTVGRKIVMDFLGFNWNWINKQQGKRGWGQLTSNLLLIGMEGN ----------------1111-------1111-------------------------2222 VTPAHYDEQQNFFAQIKGYKRCILFPPDQFECLYPYPVHHPCDRQSQVDFDNPDYERFPN -------------------------11111111---1111-2222---1111-3333--- FQNVVGYETVVGPGDVLYIPMYWWHHIESLLNGGITITVNFWYKGAPTPEYPLKAHQKVA 1111-------2222----2222------2222--------------------3333--- IMRNIEKMLGEALGNPQEVGPLLNTMIKGRYN --------------1111--------2222-- >SENSORY RHODOPSIN II; SWP:P42196; PDB:1H2SA; MVGLTTLFWLGAIGMLVGTLAFAWAGRDAGSGERRYYVTLVGISGIAAVAYVVMALGVGW -----------------------1111--------------------------1111--- VPVAERTVFAPRYIDWILTTPLIVYFLGLLAGLDSREFGIVITLNTVVMLAGFAGAMVPG --------3333------------------------------------------1111-- IERYALFGMGAVAFLGLVYYLVGPMTESASQRSSGIKSLYVRLRNLTVILWAIYPFIWLL ----------------------------1111---------------------------- GPPGVALLTPTVDVALIVYLDLVTKVGFGFIALDAAATLRAEHGE ----------------------------------------1111- >PROLYL ENDOPEPTIDASE; SWP:P23687; PDB:1H2WA; MLSFQYPDVYRDETAIQDYHGHKVCDPYAWLEDPDSEQTKAFVEAQNKITVPFLEQCPIR -----------1111---iiii---1111---1111------------------------ GLYKERMTELYDYPKYSCHFKKGKRYFYFYNTGLQNQRVLYVQDSLEGEARVFLDPNILS --------1111---------!!!!-------------------1111------3333-1 DDGTVALRGYAFSEDGEYFAYGLSASGSDWVTIKFMKVDGAKELPDVLERVKFSCMAWTH 111---------1111--------iiii---------2222-----------------11 DGKGMFYNAYPQQDGKSDGTETSTNLHQKLYYHVLGTDQSEDILCAEFPDEPKWMGGAEL 11-------------------------------22223333------1111--------- SDDGRYVLLSIREGCDPVNRLWYCDLQQESNGITGILKWVKLIDNFEGEYDYVTNEGTVF 1111--------------------3333---------------------------!!!!- TFKTNRHSPNYRLINIDFTDPEESKWKVLVPEHEKDVLEWVACVRSNFLVLCYLHDVKNT ----2222--------1111-3333-----------------------------%%%%-- LQLHDLATGALLKIFPLEVGSVVGYSGQKKDTEIFYQFTSFLSPGIIYHCDLTKEELEPR ---------------------------1111--------1111----------------- VFREVTVKGIDASDYQTVQIFYPSKDGTKIPMFIVHKKGIKLDGSHPAFLYGYGGFNISI ----------3333---------1111---------2222--------------%%%%-- TPNYSVSRLIFVRHMGGVLAVANIRGGGEYGETWHKGGILANKQNCFDDFQCAAEYLIKE --------------------------------------!!!!---------------111 GYTSPKRLTINGGSNGGLLVATCANQRPDLFGCVIAQVGVMDMLKFHKYTIGHAWTTDYG 1--3333-----!!!!----------3333-----------11113333-----3333-- CSDSKQHFEWLIKYSPLHNVKLPEADDIQYPSMLLLTADHDDRVVPLHSLKFIATLQYIV 33333333-3333-3333------1111---------1111------------------- GRSRKQNNPLLIHVDTKAGHGAGKPTAKVIEEVSDMFAFIARCLNIDWIP --3333--------------2222-------------------------- >GROWTH-ARREST-SPECIFIC PR; SWP:Q14393; PDB:1H30A; HCDGRGGLKLSQDMDTCEDILPCVPFSVAKSVKSLYLGRPVIRLRFKRLQPTRLVAEFDF --3333------------------------------------------------------ RTFDPEGILLFAGGHQDSTWIVLALRAGRLELQLRYNGVGRVTSSGPVINHGMWQTISVE --------------1111-------iiii------------------------------- ELARNLVIKVNRDAVMKIAVAGDLFQPERGLYHLNLTVGGIPFHEKDLVQPINPRLDGCM ---------iiii------------------------------1111------------- RSWNWLTVKVNTRMQCFSVTERGSFYPGSGFAFYSLDYMRSTWEVEVVAHIRPAADTGVL -----------1111--------------------------------------------- FALWAPDLRAVPLSVALVDYHKKQLVVLAVEHTALALMEIKVCDGQEHVVTVSLRDGEAT ----1111---------------------!!!!-------1111----------2222-- LEVDGTRGQSEVSAAQLQERLAVLERHLRSPVLTFAGGLPDVPVTSAPVTAFYRGCMTLE --iiii------------------------------------1111-------------- VNRRLLDLDEAAYKHSDITAHSCPPVEPAAA %%%%--1111----1111------------- >DIHEME CYTOCHROME C; SWP:Q939U1; PDB:1H32A; GPDDPLVINGEIEIVTRAPTPAHLADRFDEIRSGWTFRTDDTQALEMDDFENSGMVFVEE 1111---iiii---------3333--------3333--33333333-33333333----- ARAVWDRPEGTEGKACADCHGAVDDGMYGLRAVYPKYVESAGKVRTVEQMINACRTSRMG --3333---1111-3333---3333-22221111---3333------------------- APEWDYIGPDMTAMVALIASVSRGMPVSVAIDGPAQSTWEKGREIYYTRYGQLDLSCASC ----1111-----------1111--------!!!!---------1111--1111-3333- HEQYFDHYIRADHLSQGQINGFPSYRLKNARLNAVHDRFRGIRDTRGVPFAVGSPEFVAL --------!!!!------1111-------------------1111-----2222------ ELYVASRGNGLSVEGPSVRN -----1111----------- >Cytochrome c; SWP:Q939U4; PDB:1H32B; AEVAPGDVAIDGQGHVARPLTDAPGDPVEGRRLMTDRSVGNCIACHEVTEMQFPGTVGPS ---3333---1111---------------------1111--------3333--------- LDGVAARYPEAMIRGILVNSKNVFPETVMPAYYRVEGFNRPGIAFTSKPIEGEIRPLMTA 2222----3333------3333-2222--------------------------------- GQIEDVVAYLMTLTQ --------------- >BOWMAN-BIRK TYPE PROTEINA; SWP:P01056; PDB:1H34A; KPCCDHCSCTKSIPPQCRCTDLRLDSCHSACKSCICTLSIPAQCVCDDIDDFCYEPC ---------------------------1111-------------------------- >ATP-PHOSPHORIBOSYLTRANSFE; SWP:P10366; PDB:1H3DA; TRLRIAMQKSGRLSDDSRELLARCGIKINLHTQRLIAMAENMPIDILRVRDDDIPGLVMD ---------------------1111------------------------3333------- GVVDLGIIGENVLEEELLNRRAQGEDPRYFTLRRLDFGGCRLSLATPVDEAWDGPLSLNG --------------------1111----------------------1111---3333222 KRIATSYPHLLKRYLDQKGISFKSCLLNGSVEVAPRAGLADAICDLVSTGATLEANGLRE 2--------------1111----------33331111----------------------- VEVIYRSKACLIQRDGEMEESKQQLIDKLLTRIQGVIQARESKYIMMHAPTERLDEVIAL ---------------------------------------------------------111 LPGAERPTILPLAMHMVSSETLFWETMEKLKALGASSILVLPIEKMME 1----------------------------------------------- >TYROSYL-TRNA SYNTHETASE; SWP:P83453; PDB:1H3EA; HTPEEALALLKRGAEEIVPEEELLAKLKEGRPLTVKLGADPTRPDLHLGHAVVLRKMRQF --3333--3333------3333-------------------------------------- QELGHKVVLIIGDFTGMIGDPSGRSKTRPPLTLEETRENAKTYVAQAGKILRQEPHLFEL 1111---------3333--------------3333------------------1111--- RYNSEWLEGLTFKEVVRLTSLMTVAQMLEREDFKKRYEAGIPISLHELLYPFAQAYDSVA ------1111------3333--3333-----------------3333------------- IRADVEMGGTDQRFNLLVGREVQRAYGQSPQVCFLMPLLVGLDGREKMSKSLDNYIGLTE --------1111-----------1111-------------3333----1111-------- PPEAMFKKLMRVPDPLLPSYFRLLTDLEEEEIEALLKAGPVPAHRVLARLLTAAYALPQI --------11113333-----------3333---3333--------------1111---- PPRIDRAFYESLGYAWEAFGRDKEAGPEEVRRAEARYDEVAKGGIPEEIPEVTIPASELK ---------3333-3333---3333-----------------------------3333-i EGRIWVARLFTLAGLTPSNAEARRLIQNRGLRLDGEVLTDPMLQVDLSRPRILQRGKDRF iii-------1111------------------iiii---1111----------------- VRVRLSD ------- >TYROSYL-TRNA SYNTHETASE; SWP:P83453; PDB:1H3FA; GHTPEEALALLKRGAEEIVPEEELLAKLKEGRPLTVKLGADPTRPDLHLGHAVVLRKMRQ -----------2222----3333---1111----------1111---------------- FQELGHKVVLIIGDFTRENAKTYVAQAGKILRQEPHLFELRYNSEWLEGLTFKEVVRLTS -1111----------------------------3333--------3333----------- LMTVAQMLEREDFKKRYEAGIPISLHELLYPFAQAYDSVAIRADVEMGGTDQRFNLLVGR --3333----------1111---3333---------------------1111-------- EVQRAYGQSPQVCFLMPLLVGLDGREKMSKSLDNYIGLTEPPEAMFKKLMRVPDPLLPSY ---1111-------------3333----3333----1111--------11113333---- FRLLTDLEEEEIEALLKAGPVPAHRVLARLLTAAYALPQIPPRIDRAFYESLGYAWEAFG ---------------------------------1111------------3333-1111-- RDKEAGPEEVRRAEARYDEVAKEEIPEVTIPASELKEGRIWVARLFTLAGLTPSNAEARR -1111-3333--------------------3333-iiii--------------------- LIQNRGLRLDGEVLTDPMLQVDLSRPRILQRGKDRFVRVRLSD -1111---iiii---1111------------------------ >CYCLOMALTODEXTRINASE; SWP:CAD32957; PDB:1H3GA; PTAIEHEPPFWWAGQHKGLQLVHGRDIGREAALDYPGVRLVSTTRVPNANYLFVDLEIGP -------------------------3333-----2222---------1111-------33 EAQPGSFDIVFKGDGRSERYRYRLLAREQGSAQRQGFGPGDAIYQIPDRFANGDPSNDNV 33----------%%%%-----------2222------1111-----1111---3333--- AGREQADRRHGGGRHGGDIRGTIDHLDYIAGLGFTQLWPTPLVENDAAAYSYHGYAATDH ------1111---------------------------------------3333-----11 YRIDPRYGSNEDFVRLSTEARKRGGLIQDVVLSHIGKHHWWKDLPTPDWINYGGKFVPTQ 11-3333-3333--------1111-----------1111--------------------- HHRVAVQDPYAAQADSENFTKGWFVEGPDLNQTNPLVANYLIQNNIWWIEYAGLSGLRID ---111111113333---------------1111-------------------------- TYGYSDGAFLTEYTRRLAEYPRLNVGEEWSTRVPVVARWQRGKANFDGYTSHLPSLDFPL -1111--------------1111------------33332222-1111------------ VDARNALSKTGEENGLNEVYETLSLDYLYPEPQNLVLFGGNHDARFSAAGEDFDRWRNLV --------3333------------------1111--------------%%%%3333---- FLTPRIPQFYSGDEILTSTVKGRDDASYRRDFPGGWAGDKANAFSGAGLTSQQRAAQDLV ---------2222----------3333--------2222------2222----------- RKLANWRKNQPVIHNGRLHFGPEENTWVYFRYNKDKRIVANNNDKPTLPTARFQELKGAP ---------3333---------%%%%------1111------------3333---iiii- SGVDFLSGKTVGLGRELRLAPKSVVVIELPGLPE ---------------------------------- >RNA POLYMERASE SIGMA FACT; SWP:O87834; PDB:1H3LA; STAERSARFERDALEFLDQMYSAALRMTRNPADAEDLVQETYAKAYASFHQFREGTNLKA ----------------------3333---------------------3333--------- WLYRILTNTFINSYR --------------- >LEUCYL-TRNA SYNTHETASE; SWP:Q7SIE4; PDB:1H3NA; MEKYNPHAIEAKWQRFWEEKGFMKAKDLPGGRGKQYVLVMFPYPSGDLHMGHLKNYTMGD --------------------1111------------------------------------ VLARFRRMQGYEVLHPMGWDAFGLPAENAALKFGVHPKDWTYANIRQAKESLRLMGILYD ------------------------------1111------------------1111---3 WDREVTTCEPEYYRWNQWIFLKMWEKGLAYRAKGLVNWCPKCQTVLANEQVVEGRCWRHE 333--1111--------------1111------------3333---3333-iiii---11 DTPVEKRELEQWYLRITAYAERLLKDLEGLNWPEKVKAMQRAWIGRSEGAEILFPVEGKE 11---------------------1111-----3333-------------------2222- VRIPVFTTRPDTLFGATFLVLAPEHPLTLELAAPEKREEVLAYVEAAKRKTEIERQAEGR --------11111111-----1111-3333--3333--------------3333------ EKTGVFLGAYALNPATGERIPIWTADYVLFGYGTGAIMAVPAHDQRDYEFARKFGLPIKK ------------------------11113333-------3333----------------- VIERPGEPLPEPLERAYEEPGIMVNSGPFDGTESEEGKRKVIAWLEEKGLGKGRVTYRLR -------------------------!!!!-----3333---------------------- DWLISRQRYWGTPIPMVHCEACGVVPVPEEELPVLLPDLKDVEDIRPKGKSPLEAHPEFY ---------------------------3333---------3333-----------3333- ETTCPKCGGPAKRDTDTMDTFFDSSWYYLRYTDPHNDRLPFDPEKANAWMPVDQYIGGVE -------------------------33331111------------------------333 HAVLHLLYSRFFTKFLHDLGMVKVEEPFQGLFTQGMVLAWTDFGPVEVEGSVVRLPEPTR 3--3333---------1111----------------------------!!!!-------- IRLEIPESALSLEDVRKMGAELRPHEDGTLHLWKPAVMSKSKGNGVMVGPFVKEQGADIA ---------------1111-----1111----------3333------------------ RITILFAAPPENEMVWTEEGVQGAWRFLNRIYRRVAEDREALLETSGVFQAEALEGKDRE --------3333-----------------------------1111----3333-!!!!-- LYGKLHETLKKVTEDLEALRFNTAIAALMEFLNALYEYRKDRPVTPVYRTAIRYYLQMLF ---------------1111----------------------------------------- PFAPHLAEELWHWFWPDSLFEAGWPELDEKALEK -----------------3333------3333--- >TRANSCRIPTION INITIATION ; SWP:O00268; PDB:1H3OA; FLLQAPLQRRILEIGKKHGITELHPDVVSYVSHATQQRLQNLVEKISET --------------1111------------------------------- >Transcription initiation ; SWP:Q16514; PDB:1H3OB; HVLTKKKLQDLVREVDPNEQLDEDVEELLQIADDFIESVVTAACQLARHRKSSTLEVKDV ---------------------3333------------------------------3333- QLHLERQWNWI ----------- >ANTIBODY FAB FRAGMENT; SWP:NA; PDB:1H3PH; EVQLVESGGGLVKPGGSLKLSCAASGFTFSSYAMSWVRQSPEKRLEWVAEVSSDGSYAYY ---------------------------3333--------1111----------------- PDTLTGRFTISRDNAKNTLYLEMTSL -------------1111--------- >HYPOTHETICAL 62.8 KDA PRO; SWP:O94312; PDB:1H3ZA; SERVNYKPGMRVLTKMSGFPWWPSMVVTESKMTSVARKSKPKRAGTFYPVIFFPNKEYLW ------2222-----2222--------3333-33331111-------------------- TGSDSLTPLTSEAISQFLEKPKPKTASLIKAYKMAQSTPDLDSLSVPS -3333-----------------------------3333-3333----- >LACTOFERRIN; SWP:P02788; PDB:1H45A; RRRSVQWCAVSQPEATKCFQWQRNMRRVRGPPVSCIKRDSPIQCIQAIAENRADAVTLDG 1111---------------------1111------------------1111-------33 GFIYEAGLAPYKLRPVAAEVYGTERQPRTHYYAVAVVKKGGSFQLNELQGLKSCHTGLRR 33--------------------3333-----------------11112222-----2222 TAGWNVPIGTLRPFLDWTGPPEPIEAAVARFFSASCVPGADKGQFPNLCRLCAGTGENKC 1111------3333--------3333----------2222333333331111--!!!!-- AFSSQEPYFSYSGAFKCLRDGAGDVAFIGESTVFEDLSDEAERDEYELLCPDNTRKPVDK --1111-----------3333-------11113333--33331111---1111---1111 FKDCHLARVPSHAVVARSVNGKEDAIWNLLRQAQEKFGKDKSPKFQLFGSPSGQKDLLFK 1111---------------------------------2222----1111-2222-----1 DSAIGFSRVPPRIDSGLYLGSG 111------------------- >Gamma-crystallin D; SWP:P07320; PDB:1H4AX; GKITLYEDRGFQGRHYECSSDHPNLQPYLSRCNSARVDSGCWMLYEQPNYSGLQYFLHRG --------%%%%------------3333-------------------%%%%--------- DYADHQQWMGLSDSVRSCRLIPHSGSHRIRLYEREDYRGQMIEFTEDCSCLQDRFRFNEI -----1111-------------------------%%%%-----------3333------- HSLNVLEGSWVLYELSNYRGRQYLLMPGDYRRYQDWGATNARVGSLRRVIDFS ---------------%%%%------------3333------------------ >POLCALCIN BET V 4; SWP:Q39419; PDB:1H4BA; ADDHPQDKAERERIFKRFDANGDGKISAAELGEALKTLGSITPDEVKHMMAEIDTDGDGF --------------------------3333-----3333--3333--------------- ISFQEFTDFGRANRGLLKDVAKIF ------------------------ >XYLANASE; SWP:Q7SIE3; PDB:1H4GA; IVTDNSIGNHDGYDYEFWKDSGGSGTMILNHGGTFSAQWNNVNNILFRKGKKFNETQTHQ ---------iiii-----------------!!!!-----------------------333 QVGNMSINYGANFQPNGNAYLCVYGWTVDPLVEYYIVDSWGNWRPPGATPKGTITVDGGT 3------------------------------------------------------iiii- YDIYETLRVNQPSIKGIATFKQYWSVRRSKRTSGTISVSNHFRAWENLGMNMGKMYEVAL ------------------------------------3333-----1111----------- TVEGYQSSGSANVYSNTLRINGNPL -------------------iiii-- >GLUCAN 1,3-BETA-GLUCOSIDA; SWP:P23776; PDB:1H4PA; YYDYDHGSLGEPIRGVNIGGWLLLEPYITPSLFEAFRTNDDNDEGIPVDEYHFCQYLGKD --11113333--------------33333333-1111-11112222--------3333-- LAKSRLQSHWSTFYQEQDFANIASQGFNLVRIPIGYWAFQILDDDPYVSGLQESYLDQAI --------------3333----1111--------3333---1111-----3333------ GWARNNSLKVWVDLHGAAGSQNGFDNSGLRDSYKFLEDSNLAVTINVLNYILKKYSAEEY ---1111---------2222---1111------11113333-----------11113333 LDIVIGIELINEPLGPVLDMDKMKNDYLAPAYEYLRNNIKSDQVIIIHDAFQPYNYWDDF -------------3333-------------------------------%%%%22221111 MTENDGYWGVTIDHHHYQVFASDQLERSIDEHIKVACEWGTGVLNESHWIVCGEFAAALT -3333---------------3333----------------1111---------------- DCIKWLNSVGFGARYDGSWVNGDQTSSYIGSCANNDDIAYWSDERKENTRRYVEAQLDAF --2222-2222-3333----!!!!---------11111111------------------- EMRGGWIIWCYKTESSLEWDAQRLMFNGLFPQPLTDRKYPNQCGTISN ----------------1111----1111----1111----1111---- >MERLIN; SWP:P35240; PDB:1H4RA; KTFTVRIVTMDAEMEFNCEMKWKGKDLFDLVCRTLGLRETWFFGLQYTIKDTVAWLKMDK --------3333------1111-----------------1111-----!!!!----1111 KVLDHDVSKEEPVTFHFLAKFYPENAEEELVQEITQHLFFLQVKKQILDEKIYCPPEASV 1111--------------------3333-------------------------------- LLASYAVQAKYGDYDPSVHKRGFLAQEELLPKRVINLYQMTPEMWEERITAWYAEHRGRA --------------1111-22221111---33331111---------------1111--- RDEAEMEYLKIAQDLEMYGVNYFAIRNKKGTELLLGVDALGLHIYDPENRLTPKISFPWN ----------33331111--------1111-------1111----1111--------111 EIRNISYSDKEFTIKPLDKKIDVFKFNSSKLRVNKLILQLCIGNHDLFMRRRKA 1------!!!!------3333----------------------------1111- >Histidyl-tRNA synthetase; SWP:P56194; PDB:1H4VB; TARAVRGTKDLFGKELRMHQRIVATARKVLEAAGALELVTPIFEETQVFEKGVGAKEMFT ----2222----------------------1111----------3333------------ FQDRGGRSLTLRPEGTAAMVRAYLEHGMKVWPQPVRLWMAGPMFRAERPYRQFHQVNYEA -----------------------11111111----------------------------- LGSENPILDAEAVVLLYECLKELGLRRLKVKLSSVGDPEDRARYNAYLREVLSPHREALS ---------------------------------------------------11111111- EDSKERLEENPMRILDSKSERDQALLKELGVRPMLDFLGEEARAHLKEVERHLERLSVPY 33333333-33331111---------------3333-----------------1111--- ELEPALVRGLDYYVRTAFEVHHSALGGGGRYDGLSELLGGPRVPGVGFAFGVERVALALE --1111---1111-----------------1111-1111--------------------- AEGFGLPEEKGPDLYLIPLTEEAVAEAFYLAEALRPRLRAEYALAPRKPAKGLEEALKRG -------------------3333--------1111---------------------1111 AAFAGFLGEDELRAGEVTLKRLATGEQVRLSREEVPGYLLQALG ---------------------1111-----3333---------- >TRYPSIN IVA; SWP:P35030; PDB:1H4WA; IVGGYTCEENSLPYQVSLNSGSHFCGGSLISEQWVVSAAHCYKTRIQVRLGEHNIKVLEG -------22221111----------------------1111------------1111--- NEQFINAVKIIRHPKYNRDTLDNDIMLIKLSSPAVINARVSTISLPTAPPAAGTECLISG ------------1111--------------------1111-------------------- WGNTLSFGADYPDELKCLDAPVLTQAECKASYPGKITNSMFCVGFLEGGKDSCQRDSGGP -------------------------------2222-1111------------2222---- VVCNGQLQGVVSWGHGCAWKNRPGVYTKVYNYVDWIKDTIAANS --iiii---------------------3333------------- >ANTI-SIGMA F FACTOR ANTAG; SWP:O32723; PDB:1H4XA; AFQLEMVTRETVVIRLFGELDHHAVEQIRAKISTAIFQGAVTTIIWNFERLSFMDSGVGL -------2222------------------------------------1111--------- VLGRMRELEAVAGRTILLNPSPTMRKVFQFSGLGPWMMDATEEEAIDRVR --------1111----------------11113333----------1111 >MALTOSE PHOSPHORYLASE; SWP:Q7SIE1; PDB:1H54A; MKRIFEVQPWNVITHTFDPKDKRLQESMTSLGNGYMGMRGDFEEGYSGDSLQGIYLGGVW -----------------3333-----1111-----------1111----------2222- YPDKTRVGWWKNGYPKYFGKVVNAVNFIKLPIEINGEPVDLAKDKISDFTLDLDMHQGVL ----------2222-------------------iiii--1111----------------- NRSFVVERGAVRVALNFQRFLSVAQPELSVQKVTVKNLSDAEVDVTLKPSIDADVMNEEA -------!!!!-------------1111----------------------------3333 NYDRFWDVLATDQQADRGSIVAKTTPNPFGTPRFTSGMEMRLVTDLKNVAITQPNEKEVT -------------1111-------------------------------------1111-- TAYTGKLAPQASAELEKRVIVVTSRDYDTQESLTAAMHQLSDKVAQSSYEDLLNAHTAIW -------2222-----------3333-----------------1111------------- AQRWEKSDVVIKGDDESQQGIRFNLFQLFSTYYGEDARLNIGPKGFTGEKYGGATYWDTE -----------------------------------1111--1111-----------3333 AFAFPVYLGITDPKVTRNLLMYRYKQLDGAYINAQEQGLKGALFPMVTFDGIECHNEWEI ----------------------------------1111------------------3333 TFEEIHRNGDIAFAIYNYTRYTGDDSYVLHEGAKVLTEISRFWADRVHFSKRNNQYMIHG ----3333------------------------------------------1111------ VTGADEYENNVDNNWDTNMLAQWTLKYTLEILGKVDQDTAKQLDVSDEEKTKWQDIVDRM -------------------------------1111------------------------- YLPYDKDLNIFVQHDGFLDKDIEPVSSIPADQRPINQNWSWDKILRSPYIKQGDVLQGIW -----1111----2222------3333-3333-3333----------------------- DFIDDYTPEQKKANFDFYEPLTVHESSLSPAIHSVLAADLHYEDKAVELYSRTARLDLDN -1111------------3333----1111--------1111--------1111-3333-1 YNNDTTDGLHITSMTGAWIAVVQGFAGMRVRDGQLHYAPFLPKTWTSYTFRQVFRDRLIE 1113333--3333-----------------iiii-------1111--------iiii--- VSVHADGPHFKLLSGEPLTIDVAGAAAAAAAA ---1111--------------iiii------- >Insulin-like growth facto; SWP:P24593; PDB:1H59B; SALAEGQSCGVYTERCAQGLRCLPRQDEEKPLHALLHGRGVCLNE ---2222--1111---2222----1111----------------- >MURINE T CELL RECEPTOR (T; SWP:A2N3K1; PDB:1H5BA; GDQVEQSPSALSLHEGTDSALRCNFTTTMRSVQWFRQNSRGSLISLFYLASGTKENGRLK -------------2222--------------------3333--------------!!!!- SAFDSERARYSTLHIRDAQLEDSGTYFCAAEASSGSWQLIFGSGTQLTVMPVT ------------------3333------------------------------- >MYOTOXIN; SWP:P01475; PDB:1H5OA; YKQCHKKGGHCFPKEKICLPPSSDFGKMDCRWRWKCCKKGSG --3333------1111--------------1111-------- >NUCLEAR AUTOANTIGEN SP100; SWP:P23497; PDB:1H5PA; MDENINFKQSELPVTCGEVKGTLYKERFKQGTSKKCIQSEDKKWFTPREFEIEGDRGASK ------1111-----iiii-----1111-!!!!-----1111------------------ NWKLSIRCGGYTLKVLMENKFLPEPPSTRKKVTIK ----------------------------------- >NADP-DEPENDENT MANNITOL D; SWP:O93868; PDB:1H5QA; PGFTISFVNKTIIVTGGNRGIGLAFTRAVAAAGANVAVIYRSAADAVEVTEKVGKEFGVK ------2222---------3333-------------------1111-------------- TKAYQCDVSNTDIVTKTIQQIDADLGPISGLIANAGVSVVKPATELTHEDFAFVYDVNVF ------1111-------------------------------3333--------------- GVFNTCRAVAKLWLQKQQKGSIVVTSSMSSQIINQSSLNGSLTQVFYNSSKAACSNLVKG --------------------------3333------2222---3333------------- LAAEWASAGIRVNALSPGYVNTDQTAHMDKKIRDHQASNIPLNRFAQPEEMTGQAILLLS ----3333-------------3333---3333--1111-1111---3333---------3 DHATYMTGGEYFIDGGQLIW 333----------iiii--- >GLUCOSE-1-PHOSPHATE THYMI; SWP:P37744; PDB:1H5RA; KMRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDT ------------3333-1111--1111------3333------1111--------3333- PRFQQLLGDGSQWGLNLQYKVQPSPDGLAQAFIIGEEFIGGDDCALVLGDNIFYGHDLPK ------!!!!1111------------3333------3333-------1111---2222-- LMEAAVNKESGATVFAYHVNDPERYGVVEFDKNGTAISLEEKPLEPKSNYAVTGLYFYDN --------------------3333------1111-------------------------- DVVQMAKNLKPSARGELEITDINRIYLEQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIA -----1111--1111--3333-----1111-------1111------------------- TIEERQGLKVSCPEEIAFRKGFIDVEQVRKLAVPLIKNNYGQYLYKMTKD -----------------1111----------3333----------1111- >UPPER COLLAR PROTEIN; SWP:P04332; PDB:1H5WA; RQKRNRWFIHYLNYLQSLAYQLFEWENLPPTINPSFLEKSIHQFGYVGFYKDPVISYIAC ------------------1111------1111---------------------------- NGALSGQRDVYNQATVFRAASPVYQKEFKLYNYRDMKEEDMGVVIYNNDMAFPTTPTLEL --------1111--------1111--------3333-----------1111--------- FAAELAELKEIISVNQNAQKTPVLIRANDSLKQVYNQYEGNAPVIFAHEALDSDSIEVFK -----------------3333--------------------------3333--------- TDAPYVVDKLNAQKNAVWNEMMTFLGIKNSNDEQIESSGTVFLKSREEACEKINELYGLN ----------------------1111---------------------------------- VKVKFRYDI --------- >HISF; SWP:Q8ZY16; PDB:1H5YA; HMALRIIPCLDIDGGAKVVVKGVNFQGIREVGDPVEMAVRYEEEGADEIAILDITAAPEG -------------3333--1111-3333----------------------------3333 RATFIDSVKRVAEAVSIPVLVGGGVRSLEDATTLFRAGADKVSVNTAAVRNPQLVALLAR ------------------------------------------------------------ EFGSQSTVVAIDAKWNGEYYEVYVKGGREATGLDAVKWAKEVEELGAGEILLTSIDRDGT --3333------------------iiii--------------------------1111-- GLGYDVELIRRVADSVRIPVIASGGAGRVEHFYEAAAAGADAVLAASLFHFRVLSIAQVK ------------3333-----------3333----1111---------1111-------- RYLKERGVEVRI ---1111----- >Putative snRNP Sm-like pr; SWP:Q9V0Y8; PDB:1H641; ERPLDVIHRSLDKDVLVILKKGFEFRGRLIGYDIHLNVVLADAEMIQDGEVVKRYGKIVI -3333----2222-------------------1111----------iiii---------- RGDNVLAISPT 3333------- >CHLOROPLAST OUTER ENVELOP; SWP:Q41009; PDB:1H65A; VREWSGINTFAPATQTKLLELLGNLKQEDVNSLTILVGKGGVGKSSTVNSIIGERVVSIS ----3333--3333-----------1111------------------------------- PFQSEGPRPVVSRSRAGFTLNIIDTPGLIEGGYINDALNIIKSFLLDKTIDVLLYVDRLD --------------iiii-----------iiii---------1111-------------- AYRVDNLDKLVAKAITDSFGKGIWNKAIVALTHAQFSPPDGLPYDEFFSKRSEALLQVVR -----------------------1111-----------%%%%------------------ SGASLKKDAQASDIPVVLIENSGRCNKNDSDEKVLPNGIAWIPHLVQTITEVALNKSESI -----11113333--------1111--1111---1111---------------------- FVDKNLIDKLAAAD ---------3333- >CALPONIN ALPHA; SWP:Q9PSG0; PDB:1H67A; MPQTERQLRVWIEGATGRRIGDNFMDGLKDGVILCELINKLQPGSVQKVNDPVQNWHKLE ----3333----------------3333-------------------------------- NIGNFLRAIKHYGVKPHDIFEANDLFENTNHTQVQSTLIALASQAKTK --------------3333--3333------------------------ >PRECURSOR FORM OF GLUCOSE; SWP:P75002; PDB:1H6DA; QAATLPAGASQVPTTPAGRPMPYAIRPMPEDRRFGYAIVGLGKYALNQILPGFAGCQHSR -----3333-----------------------------------------3333------ IEALVSGNAEKAKIVAAEYGVDPRKIYDYSNFDKIAKDPKIDAVYIILPNSLHAEFAIRA ----------------1111-1111--3333------1111-------3333-------- FKAGKHVMCEKPMATSVADCQRMIDAAKAANKKLMIGYRCHYDPMNRAAVKLIRENQLGK 1111---------------------------------3333--------------1111- LGMVTTDNSDVMDQNDPAQQWRLRRELAGGGSLMDIGIYGLNGTRYLLGEEPIEVRAYTY ------------111111111111------3333-------------------------- SDPNDERFVEVEDRIIWQMRFRSGALSHGASSYSTTTTSRFSVQGDKAVLLMDPATGYYQ -1111---------------3333------------------------------------ NLISVQTPGHANQSMMPQFIMPANNQFSAQLDHLAEAVINNKPVRSPGEEGMQDVRLIQA ------2222------------------------------------3333---------- IYEAARTGRPVNTDWGYVRQGGY ------------------2222- >T-BOX TRANSCRIPTION FACTO; SWP:O15119; PDB:1H6FA; DPKVHLEAKELWDQFHKRGTEMVITKSGRRMFPPFKVRCSGLDKKAKYILLMDIIAADDC ---------------1111-----1111--------------1111-------------- RYKFHNSRWMVAGKADPEMPKRMYIHPDSPATGEQWMSKVVTFHKLKLTNNISDKHGFTI ----%%%%-----------------1111------1111---1111----1111------ LNSMHKYQPRFHIVRANDILKLPYSTFRTYLFPETEFIAVTAYQNDKITQLKIDNNPFAK -2222------------3333----------1111--------------------33331 GFRD 111- >ALPHA-1 CATENIN; SWP:P35221; PDB:1H6GA; DLRRQLRKAVDHVSDSFLETNVPLLVLIEAAKNGNEKEVKEYAQVFREHANKLIEVANLA ---------------1111---------------------------------------33 CSISNNEEGVKLVRSASQLEALCPQVINAALALAAKPQSKLAQENDLFKEQWEKQVRVLT 33---3333--------------------------1111--------------------- DAVDDITSIDDFLAVSENHILEDVNKCVIALQEKDVDGLDRTAGAIRGRAARVIHVVTSE ---1111-----------------------1111-------------------------- DNYEPGVYTEKVLEATKLLSNTVPRFTEQVEAAVEALSSDPAQPDENEFIDASRLVYDGI ------------------------------------------------------------ RDIRKAVL -------- >NEUTROPHIL CYTOSOL FACTOR; SWP:Q15080; PDB:1H6HA; AVAQQLRAESDFEQLPDDVAISANIADIEEKRGFTSHFVFVIEVKTKGGSKYLIYRRYRQ 3333-------33331111--------------------------1111-------3333 FHALQSKLEERFGPDSKSSALACTLPTLPAKVYVGVKQEIAEMRIPALNAYMKSLLSLPV ------------1111--1111--------------------------------111133 WVLMDEDVRIFFYQSPYDSEQVP 33------------3333----- >AQUAPORIN-1; SWP:P29972; PDB:1H6IA; LFWRAVVAEFLATTLFVFISIGSALGFKYPVGNNQTAVQDNVKVSLAFGLSIATLAQSVG 3333--------------------3333-----------------------------111 HISGAHLNPAVTLGLLLSCQISIFRALMYIIAQCVGAIVATAILSGITSSLTGNSLGRND 1------------3333-------------------------3333-------------- LADGVNSGQGLGIEIIGTLQLVLCVLATTDRRRRDLGGSAPLAIGLSVALGHLLAIDYTG ------------------------------------------------------------ CGINPARSFGSAVITHNFSNHWIFWVGPFIGGALAVLIYDFILAP -----------------1111-1111------------------- >CBP80; SWP:Q09161; PDB:1H6KA; TEDHLESLICKVGEKSACSLESNLEGLAGVLEADLPNYKSKILRLLCTVARLLPEKLTIY 3333------2222--------------------------------------3333---- TTLVGLLNARNYNFGGEFVEAMIRQLKESLKANNYNEAVYLVRFLSDLVNCHVIAAPSMV ----------------------------------3333--------3333---------- AMFENFVSVTQEEDVPQVRRDWYVYAFLSSLPWVGKELYEKKDAEMDRIFANTESYLKRR ------3333-----3333-------1111--------------------------1111 QKTHVPMLQVWTADKPHPQEEYLDCLWAQIQKLKKDRWQERHILRPYLAFDSILCEALQH -3333------------------------------%%%%-----3333------3333-- NLPPFTPPPHTEDSVYPMPRVIFRMFDYTDDPEGPVMPGSHSVERFVIEENLHCIIKSHW --------------------------33331111----1111----------------33 KERKTCAAQLVSYPGKNKIPLNYHIVEVIFAELFQLPAPPHIDVMYTTLLIELCKLQPGS 33-------1111------3333--------1111------3333--------------- LPQVLAQATEMLYMRLDTMNTTCVDRFINWFSHHLSNFQFRWSWEDWSDCLSQDPESPKP ---------------1111------------------%%%%-33333333---1111--- KFVREVLEKCMRLSYHQRILDIVPPTFSALCPSNPTCIYKYGDESSNSLPGHSVALCLAV ----------1111----1111-33331111-----------------2222-------- AFKSKATNDEIFSILKDVPNFNPLKIEVFVQTLLHLAAKSFSHSFSALAKFHEVFKTLAE ----------------------------------1111---------------------- SDEGKLHVLRVMFEVWRNHPQMIAVLVDKMIRTQIVDCAAVANWIFSSELSRDFTRLFVW ---------------1111---------------------------33331111------ EILHSTIRKMNKHVLKIQKELEEAKEKEQIERLQEKVESAQSEQKNLFLVIFQRFIMILT ---------------------3333---3333---------------------------- EHLVRCETDGTSVLTPWYKNCIERLQQIFLQHHQIIQQYMVTLENLLFTAELDPHILAVF ------1111-----------------------3333-----------11113333---- QQFCALQA -------- >Nuclear cap-binding prote; SWP:P52298; PDB:1H6KZ; KSCTLYVGNLSFYTTEEQIYELFSKSGDIKKIIMGLDKMKCGFCFVEYYSRADAENAMRY ----------1111---------1111---------1111-------------------- INGTRLDDRIIRTDWDAG 2222-%%%%--------- >PHYTASE; SWP:O66037; PDB:1H6LA; KLSDPYHFTVNAAAETEPVDTAGDAADDPAIWLDPKNPQNSKLITTNKKSGLAVYSLEGK ------------------------------------3333------1111-----1111- MLHSYHTGKLNNVDIRYDFPLNGKKVDIAAASNRSEGKNTIEIYAIDGKNGTLQSITDPN --------------------iiii----------2222-------------------111 RPIASAIDEVYGFSLYHSQKTGKYYAMVTGKEGEFEQYELNADKNGYISGKKVRAFKMNS 1----------------------------------------------------------- QTEGMAADDEYGSLYIAEEDEAIWKFSAEPDGGSNGTVIDRADGRHLTPDIEGLTIYYAA ----------------------------1111---------------------------i DGKGYLLASSQGNSSYAIYERQGQNKYVADFQITDGPETDGTSDTDGIDVLGFGLGPEYP iii------------------!!!!----------1111----------------3333- FGLFVAQNGENIDHGQKANQNFKMVPWERIADKIGFHPQVNKQVDPRKMTDRS ------------iiii---------33333333-----1111--1111----- >TELOMERIC REPEAT BINDING ; SWP:P54274; PDB:1H6OA; EDAGLVAEAEAVAAGWMLDFLCLSLCRAFRDGRSEDFRRTRNSAEAIIHGLSSLTACQLR ------------------------------------------------------------ TIYICQFLTRIAAGKTLQFENDERITPLESALMIWGSIEKEHDKLHEEIQNLIKIQAIAV ------------1111---3333------------------------------------- CMENGNFKEAEEVFERIFGDPSKLLMIISQKDTFHSFFQHFSYNHMMEKIKSYVNYVLSE -----------------------3333--------------------------------- KSSTFLMKAAAKVVE --------------- >TELOMERIC REPEAT BINDING ; SWP:Q15554; PDB:1H6PA; AGEARLEEAVNRWVLKFYFHEALRAFRGSRYGDFRQIRDIMQALLVRPLGVSRLLRVMQC -------------------------1111--------------1111------------- LSRIEEGENLSFDMEAELTPLESAINVLEMIKTEFTLTEAVVESSRKLVKEAAVIICIKN ---1111-----1111---------------------3333------------------- KEFEKASKILKKHMRNDLLNIIREKNLAHPVIQNFSYETFQQKMLRFLESHLDDAEPYLL ------------------------------------------------1111-------- TMAKKALK ----1111 >TRANSLATIONALLY CONTROLLE; SWP:Q10344; PDB:1H6QA; MLLYKDVISGDELVSDAYDLKEVDDIVYEADCQMVTVKQGGDVDIGANPSAEDAEENAEE ------------------------------------------------------------ GTETVNNLVYSFRLSPTSFDKKSYMSYIKGYMKAIKARLQESNPERVPVFEKNAIGFVKK -------------------3333-------------------1111-------------- ILANFKDYDFYIGESMDPDAMVVLMNYREDGITPYMIFFKDGLVSEKF ----3333----3333----------------------3333------ >GREEN FLUORESCENT PROTEIN; SWP:P42212; PDB:1H6RA; SKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFIVTTGKLPVPWPTLV -!!!!---------------iiii----------3333----------------3333-3 TTFLQCFARYPDHMKRHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIE 3333333---111111113333-------------2222-----------!!!!------ LKGIDFKEDGNILGHKLEYNYNSHCVYIVADKQKNGIKVNFKIRHNIEDGSVQLADHYQQ ------1111-1111---------------3333------------1111---------- NTPIGDGPVLLPDNHYLCYQSALSKDPNEKRDHMVLLEFVTAAGITH -------------------------1111------------------ >INTERNALIN B; SWP:P25147; PDB:1H6TA; GPLGSETITVPTPIKQIFSDDAFAETIKDNLKKKSVTDAVTQNELNSIDQIIANNSDIKS --3333------3333------------------3333------1111------------ VQGIQYLPNVTKLFLNGNKLTDIKPLANLKNLGWLFLDENKVKDLSSLKDLKKLKSLSLE 2222--1111------------3333--1111-----------11111111--------- HNGISDINGLVHLPQLESLYLGNNKITDITVLSRLTKLDTLSLEDNQISDIVPLAGLTKL ------3333--1111------------3333------------------3333--1111 QNLYLSKNHISDLRALAGLKNLDVLELFSQECLNKPINHQSNLVVPNTVKNTDGSLVTPE ------------3333--1111----------------------------1111------ IISDDGDYEKPNVKWHLPEFTNEVSFIFYQPVTIGKAKARFHGRVTQPLKE --%%%%-----------1111------------!!!!-------------- >INTERNALIN H; SWP:Q9ZEY1; PDB:1H6UA; GSITQPTAINVIFPDPALANAIKIAAGKSNVTDTVTQADLDGITTLSAFGTGVTTIEGVQ -------1111------------------1111------1111--------------333 YLNNLIGLELKDNQITDLAPLKNLTKITELELSGNPLKNVSAIAGLQSIKTLDLTSTQIT 31111------------3333------------------3333--3333----------- DVTPLAGLSNLQVLYLDLNQITNISPLAGLTNLQYLSIGNAQVSDLTPLANLSKLTTLKA -3333--1111------------3333--1111------------3333--1111----- DDNKISDISPLASLPNLIEVHLKNNQISDVSPLANTSNLFIVTLTNQTITNQPVFYNNNL -------3333--1111------------3333--1111--------------------- VVPNVVKGPSGAPIAPATISDNGTYASPNLTWNLTSFINNVSYTFNQSVTFKNTTVPFSG -------1111--------%%%%------------------------------------- TVTQPLTE -------- >THIOREDOXIN REDUCTASE; SWP:O89049; PDB:1H6VA; SYDFDLIIIGGGSGGLAAAKEAAKFDKKVMVLDFVTPTPLGTNWGLGGTCVNVGCIPKKL -----------3333-------1111-----------1111------3333--------- MHQAALLGQALKDSRNYGWKLEDTVKHDWEKMTESVQNHIGSLNWGYRVALREKKVVYEN ------------3333-------------------------------------------- AYGKFIGPHKIMATNNKGKEKVYSAERFLIATGERPRYLGIPGDKEYCISSDDLFSLPYC ------2222----1111----------------------2222-----3333------- PGKTLVVGASYVALECAGFLAGIGLDVTVMVRSILLRGFDQDMANKIGEHMEEHGIKFIR --------------------1111------------------------------------ QFVPTKIEQIEAGTPGRLKVTAKSTNSEETIEDEFNTVLLAVGRDSCTRTIGLETVGVKI ---------------------------------------------------3333----- NEKTGKIPVTDEEQTNVPYIYAIGDILEGKLELTPVAIQAGRLLAQRLYGGSTVKCDYDN ----------------1111---3333--------------------------------- VPTTVFTPLEYGCCGLSEEKAVEKFGEENIEVYHSFFWPLEWTVPSRDNNKCYAKVICNL ----------------3333------1111--------33331111------------11 KDNERVVGFHVLGPNAGEVTQGFAAALKCGLTKQQLDSTIGIHPVCAEIFTTLSVTKRSG 11------------3333--3333--1111--------------33331111---3333- GDILQSGCCG ---------- >PYRUVATE PHOSPHATE DIKINA; SWP:O76283; PDB:1H6ZA; VAKKWVYYFGGGNADGNKNMKELLGGKGANLAEMVNLGIPVPPGFTITTEACKTYQETET ----------------3333---------------------------3333--------- IPQEVADQVRENVSRVEKEMGAKFGDPANPLLFSVRSGAAASDTVLNLGLNKVTVDAWVR -3333-----------------2222------------3333--------3333--3333 RAPRLERFVYDSYRRFITMYADIVMQVGREDFEEALSRMKERRGTKFDTDLTASDLKELC -3333-----------------------1111---11113333---3333-1111----- DGYLELFELKTGCSFPQDPVMQLFAAIKAVFRSWGNPRATIYRRMNNITGLLGTAVNVQA -------------------------------1111----3333----------------- MVFGNINDRSATGVAFSRSPSTGENFFFGEYLVNAQGEDVVAGIRTPQQINHSLSLRWAK -------------------------------------3333----------3333----- AHGVGEEERRKRYPSMEEAMPENYRLLCDVRKRLENHYRDMQDLEFTVQDGRLWLLQCRN ----3333------3333---------------3333----------------------- GKRTIHAAVRIAIDMVNEGLISREEAVLRIDPYQVDHLMHPNLEPGAEKANKPIGRGLAA ---------------3333-----------3333-1111----3333------------- SPGAAVGQVVFDAESAKEWSGRGKKVIMVRLETSPEDLAGMDAACGILTARGGMTSHAAV -----------3333------------------11111111-----------1111---- VARGMGKCCVSGCGDMVIRGKSFKLNGSVFREGDYITIDGSKGLIYAGKLKLRSPDLKGS --1111-------------------------------1111------------------3 FQTILQWCQEMKRLGVRTNADTPADAAKARSFGAEGVGLCRTEHMFFEGSRINFIREMIL 333----------------------3333-----------------------33333333 ADSASGRKAALDKLLPIQRADFVGILRAMRGLPVTIRLLDPPLHEFVPHDAAAQFELAQK --3333---3333------------3333------------3333-------3333---- LGMPAEKVRNRVNALHELNPMLGHRGCRLGITYPEIYNMQVRAIIEAAIAVSEEGSSVIP -------33333333-----------3333--1111-------------3333------- EIMVPLVGKKEELSLIREEVVKTAEAVITKSGKRVHYTVGTMIEVPRAAVTADSIAQKAD ----------------------------------------------------3333---- FFSFGTNDLTQMGCGFSRDDAGPFLRHYGNLGIYAQDPFQSIDQEGIGELVRIAVTKGRR -----------1111--1111--3333---------1111--1111-----------333 VKPMLKMGICGEHGGDPATIGFCHKVGLDYVSCSPFRVPVAIVAAAHASIKDRRAAMK 3---------3333-------------------1111--------------------- >NG,NG-DIMETHYLARGININE DI; SWP:Q9I4E3; PDB:1H70A; FMFKHIIARTPARSLVDGLTSSHLGKPDYAKALEQHNAYIRALQTCDVDITLLPPDERFP -----------1111---------------------------1111---------3333- DSVFVEDPVLCTSRCAIITRPGAESRRGETEIIEETVQRFYPGKVERIEAPGTVEAGDIM 33333333---1111-------3333--3333--------2222-----------1111- MVGDHFYIGESARTNAEGARQMIAILEKHGLSGSVVRLEKVLHLKTGLAYLEHNNLLAAG -!!!!-----1111------------1111------------1111-----iiii---!! EFVSKPEFQDFNIIEIPEEESYAANCIWVNERVIMPAGYPRTREKIARLGYRVIEVDTSE !!--3333--------33331111----iiii-------------3333--------333 YRKIDGGVSSMSLRF 3-----3333----- >Homoserine kinase; SWP:Q58504; PDB:1H72C; MKVRVKAPCTSANLGVGFDVFGLCLKEPYDVIEVEAIDDKEIIIEVDDKNIPTDPDKNVA -------------!!!!------------------------------1111--1111--- GIVAKKMIDDFNIGKGVKITIKKGVKAGSGLGSSAASSAGTAYAINELFKLNLDKLKLVD --------1111----------------------------------1111---------- YASYGELASSGAKHADNVAPAIFGGFTMVTNYEPLEVLHIPIDFKLDILIAIPNISINTK ------------------------------------------------------------ EAREILPKAVGLKDLVNNVGKACGMVYALYNKDKSLFGRYMMSDKVIEPVRGKLIPNYFK --1111----3333--------------1111-------1111---3333----2222-- IKEEVKDKVYGITISGSGPSIIAFPKEEFIDEVENILRDYYENTIRTEVGKGVEVV ----1111------!!!!-------3333--------------------------- >GLUTAREDOXIN-LIKE PROTEIN; SWP:Q47414; PDB:1H75A; MRITIYTRNDCVQCHATKRAMENRGFDFEMINVDRVPEAAEALRAQGFRQLPVVIAGDLS ---------------------1111------3333--------1111--------!!!!- WSGFRPDMINRLHPAP ----3333-1111--- >SEROTRANSFERRIN; SWP:P09571; PDB:1H76A; QKTVRWCTISNQEANKCSSFRENMSKAVKNGPLVSCVKKSSYLDCIKAIRDKEADAVTLD ------------------------------------------------1111-------3 AGLVFEAGLAPYNLKPVVAEFYGQKDNPQTHYYAVAVVKKGSNFQWNQLQGKRSCHTGLG 333--------------------1111-----------2222--11112222-----222 RSAGWIIPMGLLYDQLPEPRKPIEKAVASFFSSSCVPCADPVNFPKLCQQCAGKGAEKCA 23333------3333------3333----------2222----33331111--!!!!--- CSNHEPYFGYAGAFNCLKEDAGDVAFVKHSTVLENLPDKADRDQYELLCRDNTRRPVDDY -3333----------------------11113333-----3333----1111---11111 ENCYLAQVPSHAVVARSVDGQEDSIWELLNQAQEHFGRDKSPDFQLFSSSHGKDLLFKDS 111---------------------------------2222--------1111-----111 ANGFLKIPSKMDSSLYLGYQYVTALRNLREEECKKVRWCAIGHEETQKCDAWSINSGGKI 1------1111------------------------------------------1111--- ECVSAENTEDCIAKIVKGEADAMSLDGGYIYIAGKCGLVPVLAENYKTEGENCVNTPEKG ---------------------------------1111------------1111------- YLAVAVVKKSSGPDLNWNNLKGKKSCHTAVDRTAGWNIPMGLLYNKINSCKFDQFFGEGC -------3333333311112222-----22223333--------------1111------ APGSQRNSSLCALCIGSERAPGRECLANNHERYYGYTGAFRCLVEKGDVAFVKDQVVQQN 22221111--1111--------2222----1111------------------22221111 TDGKNKDDWAKDLKQMDFELLCQNGAREPVDNAENCHLARAPNHAVVARDDKVTCVAEEL iiii--3333---3333----1111---11111111------------3333-------- LKQQAQFGRHVTDCSSSFCMFKSNTKDLLFRDDTQCLARVGKTTYESYLGADYITAVANL -------3333-------1111--------1111-------------------------3 RKCSTSKLLEACTFHSA 333-------1111--- >TUBULIN-SPECIFIC CHAPERON; SWP:O75347; PDB:1H7CA; PRVRQIKIKTGVVRRLVKERVYEKEAKQQEEKIEKRAEDGENYDIKKQAEILQESRIPDC -----------------------------------------3333--------------- QRRLEAAYLDLQRILENEKDLEEAEEYKEARLVLDSVKL -----------------3333------------3333-- >AMINOLEVULINIC ACID SYNTH; SWP:Q64452; PDB:1H7DA; MVAAAMLLRSCPVLSQGPTGLLGKVAKTYQFLFSIGRCPILATQGPTCS ---------------------3333-3333------------------- >3-DEOXY-MANNO-OCTULOSONAT; SWP:P42216; PDB:1H7EA; SKAVIVIPARYGSSRLPGKPLLDIVGKPMIQHVYERALQVAGVAEVWVATDDPRVEQAVQ -----------------------iiii3333--------2222----------------1 AFGGKAIMTRNDHESGTDRLVEVMHKVEADIYINLQGDEPMIRPRDVETLLQGMRDDPAL 111-------------------3333---------1111---3333----------1111 PVATLCHAISAAEAAEPSTVKVVVNTRQDALYFSRSPIPYPRNAEKARYLKHVGIYAYRR ---------3333--1111-----1111--------------3333-------------- DVLQNYSQLPESMPEQAESLEQLRLMNAGINIRTFEVAATGPGVDTPACLEKVRALMAQE ----3333-----------3333--1111----------------3333----------- LAENA ----- >PMS1 PROTEIN HOMOLOG 2; SWP:P54278; PDB:1H7SA; GQVVLSLSTAVKELVENSLDAGATNIDLKLKDYGVDLIEVSDNGCGVEEENFEGLTLADL ------------------------------%%%%-------------3333-3333---- TQVETFGFRGEALSSLCALSDVTISTCHASAKVGTRLFDHNGKIIQKTPYPRPRGTTVSV ---------------------------3333-------1111------------------ QQLFSTLPVRHKEFQRNIKKEYAKVQVLHAYCIISAGIRVSCTNQLGQGKRQPVVCTGGS -2222-----------------------------2222-------!!!!----------- PSIKENIGSVFGQKQLQSLIPFVQLPPSDSVCEEYGLSCSDALHNLFYISGFISQCTHGV ---------------1111-------------1111-3333---------------2222 GRSSTDRQFFFINRRPCDPAKVCRLVNEVYHYNRHQYPFVVLNISVDSECVDINQILLQE -----------iiii-----------------1111----------3333---------3 EKLLLAVLKTSLIGFDS 333-------------- >ADENOVIRUS FIBRE PROTEIN; SWP:P04501; PDB:1H7ZA; KNNTLWTGPKPEANCIIEYGKQNPDSKLTLILVKNGGIVNGYVTLMGASDYVNTLFKNKN -----------------2222-------------!!!!----------3333-------- VSINVELYFDATGHILPDSSSLKTDLELKYKQTADFSARGFMPSTTAYPFVLPNAGTHNE ---------1111---1111-----------------1111-----------------11 NYIFGQCYYKASDGALFPLEVTVMLNKRLPDSRTSYVMTFLWSLNAGLAPETTQATLITS 11--------1111---------------------------------------------- PFTFSYIREDD ----------- >IOTA-CARRAGEENASE; SWP:Q9F5I8; PDB:1H80A; VSPKTYKDADFYVAPTQQDVNYDLVDDFGANGNDTSDDSNALQRAINAISRKPNGGTLLI ------3333------------3333-------------------------1111----- PNGTYHFLGIQMKSNVHIRVESDVIIKPTWNGDGKNHRLFEVGVNNIVRNFSFQGLGNGF --------------------1111-------------------------------!!!!- LVDFKDSRDKNLAVFKLGDVRNYKISNFTIDDNKTIFASILVDVTERNGRLHWSRNGIIE ---1111---------------------------------------iiii---------- RIKQNNALFGYGLIQTYGADNILFRNLHSEGGIALRMETDNLLMKNYKQGGIRNIFADNI -------1111-----------------------------33331111------------ RCSKGLAAVMFGPHFMKNGDVQVTNVSSVSCGSAVRSDSGFVELFSGCAQTPAARVTQKD ------------!!!!------------------------------------------33 ACLDKAKLEYGIEPGSFGTVKVFDVTARFGYNADLKQDQLDYFSTSNPMCKRVCLPTKEQ 33---------------------------------1111-1111--1111------3333 WSKQGQIYIGPSLAAVIDTTPETSKYDYDVKTFNVKRINFPVNSHKTIDTNTESSRVCNY ----------------------------------------2222----1111------11 YGMSECSSSRWER 11----------- >Transforming protein Myb; SWP:P01104; PDB:1H8AC; NPELNKGPWTKEEDQRVIEHVQKYGPKRWSDIAKHLKGRIGKQCRERWHNHLNPEVKKTS ------------------------------3333-----3333---------3333---- WTEEEDRIIYQAHKRLGNRWAEIAKLLPGRTDNAVKNHWNSTMRR -------------------33331111--------------3333 >ALPHA-ACTININ 2, SKELETAL; SWP:P35609; PDB:1H8BA; MADTDTAEQVIASFRILASDKPYILAEELRRELPPDQAQYCIKRMPAYSGPGSVPGALDY ---------------------------------3333---3333---------------- AAFSSALYGESDL ------------- >FAS-ASSOCIATED FACTOR 1; SWP:Q9UNN5; PDB:1H8CA; NAEPVSKLRIRTPSGEFLERRFLASNKLQIVFDFVASKGFPWDEYKLLSTFPRRDVTQLD -----------1111-------11113333----1111---------------------1 PNKSLLEVKLFPQETLFLEAKE 1113333--------------- >MAJOR AUTOLYSIN; SWP:P06653; PDB:1H8GA; TDGNWYWFDNSGEATGWKKIADKWYYFNEEGAKTGWVKYKDTWYYLDAKEGAVSNAFIQS --------1111-------%%%%----1111----------------------------1 ADGTGWYYLKPDGTLADRPEFTVEPDGLITVK 111------1111---3333---1111----- >SPECTRIN ALPHA CHAIN; SWP:P07751; PDB:1H8KA; KELVLVLYDYQEKSPREVTVKKGDILTLLNSTNKDWWKVEVDDRQGFIPAAYLKKLD --------------------------------1111--------------------- >CARBOXYPEPTIDASE GP180 RE; SWP:Q90240; PDB:1H8LA; QAVQPVDFRHHHFSDMEIFLRRYANEYPSITRLYSVGKSVELRELYVMEISDNPGIHEAG --------------------------3333----------------------2222-222 EPEFKYIGNMHGNEVVGRELLLNLIEYLCKNFGTDPEVTDLVQSTRIHIMPSMNPDGYEK 2---------1111--------------1111-------------------------111 SQEGDRGGTVGRNNSNNYDLNRNFPDQFFQVTDPPQPETLAVMSWLKTYPFVLSANLHGG 12222---2222-1111-1111----------------------1111------------ SLVVNYPFDDDEQGIAIYSKSPDDAVFQQLALSYSKENKKMYQGSPCKDLYPTEYFPHGI ----------1111-------------------3333-3333----11111111-2222- TNGAQWYNVPGGMQDWNYLNTNCFEVTIELGCVKYPKAEELPKYWEQNRRSLLQFIKQVH ---3333-----------------------------3333-----------------111 RGIWGFVLDATDGRGILNATISVADINHPVTTYKDGDYWRLLVQGTYKVTASARGYDPVT 1-------------------------------1111----------------2222---- KTVEVDSKGGVQVNFTLSRT -------------------- >SYNAPTOBREVIN HOMOLOG 1; SWP:P36015; PDB:1H8MA; MRIYYIGVFRSGGEKALELSEVKDLSQFGFFERSSVGQFMTFFAETVASRTGAGERQSIE ------------------------------------------------------------ EGNYIGHVYARSEGICGVLITDKQYPVRPAYTLLNKILDEYLVAHPKEEWADVTETNDAL ---------------------1111--------------------3333-------3333 KMKQLDTYISKYQDPSQADA ---------3333------- >MUTANT AL2 6E7S9G; SWP:NA; PDB:1H8NA; KDIVLTQSHKFMSTSVGDRVSITCKASQDVGTAVAWYQQKPGQSPKLLIYWASTRHTGVP --------------2222---------------------2222------------22223 DRFTGSGSGTDFTLTISNVQSEDLADYFCQQYSSYPLTFGAGTKLELQVQLQESGGELVR 333----------------1111------------------------------------2 PGASVKLSCKASGYTFTSYWINWVKQRPGQGLEWIGNIYPSDSYTNYNQKFKDKATLTVD 222-----------1111--------2222-----------------3333--------- KSSSTAYMQLSSLTSEDSAVYFCARWGYWGQGTLVTVSA 1111---------3333-------2222----------- >SEMINAL PLASMA PROTEIN PD; SWP:P02784; PDB:1H8PA; EECVFPFVYRNRKHFDCTVHGSLFPWCSLDADYVGRWKYCAQRDYAKCVFPFIYGGKKYE --------%%%%------2222------------------3333---------iiii--- TCTKIGSMWMSWCSLSPNYDKDRAWKYC ---2222----------3333------- >ECHOVIRUS 11 COAT PROTEIN; SWP:P29813; PDB:1H8TA; GDVVEAVENAVARVADTIGSGPSNSQAVPALTAVETGHTSQVTPSDTVQTRHVKNYHSRS ---------------------------1111-----------3333------------11 ESSIENFLSRSACVYMGEYHTTNSDQTKLFASWTISARRMVQMRRKLEIFTYVRFDVEVT 11-1111------------------1111------------------------------- FVITSKQDQGTQLGQDMPPLTHQIMYIPPGGPIPKSVTDYTWQTSTNPSIFWTEGNAPPR -----------------------------------1111-------------2222---- MSIPFISIGNAYSNFYDGWSHFSQNGVYGYNTLNHMGQIYVRHVNGSSPLPMTSTVRMYF --------------------1111----3333---------------------------- KPKHVKAWVPRPPRLCQYKNASTVNFSPTDITDKRNSITYIPDTVKPDV ------------------------------------1111--------- >Genome polyprotein; SWP:P29813; PDB:1H8TB; SDRVRSITLGNSTITTQESANVVVGYGRWPEYLRDDEATAEDQPTQPDVATCRFYTLESV --------!!!!---------------------1111----------3333--------- TWEKDSPGWWWKFPDALKDMGLFGQNMYYHYLGRAGYTIHVQCNASKFHQGCLLVVCVPE ----------------1111-------------------------1111----------- AEMGCSTVDGTVNEHGLSEGETAKKFSATGTNGTNTVQSIVTNAGMGVGVGNLTIFPHQW ------1111---3333----------------------3333-----11111111---- INLRTNNCATIVMPYINNVPMDNMFRHHNFTLMIIPFVPLNYSSDFSTYVPITVTVAPMC -3333------------------------------------------------------- AEYNGLRLSTAL ------------ >Genome polyprotein; SWP:P29813; PDB:1H8TC; GLPVINTPGSNQFLTSDDFQSPSAMPQFDVTPELNIPGEVQNLMEIAEVDSVVPVNNVAG ------2222---1111-------2222-----------------1111--------222 NLETMDIYRIPVQSGNHQSSQVFGFQVQPGLDGVFKHTLLGEILNYYAHWSGSIKLTFVF 2--3333---------3333-------11113333--------1111------------- CGSAMATGKFLLAYAPPGANAPKSRKDAMLGTHIIWDVGLQSSCVLCIPWISQTHYRLVQ --1111-----------------33331111----------------------------- QDEYTSAGNVTCWYQTGIVVPAGTPTSCSIMCFVSACNDFSVRLLKDTPFIQQAALLQ -3333----------------------------------------------------- >EOSINOPHIL GRANULE MAJOR ; SWP:P13727; PDB:1H8UA; RYLLVRSLQTFSQAWFTCRRCYRGNLVSIHNFNINYRIQCSVSALNQGQVWIGGRITGSG ---------------------------------------1111----------------- RCRRFQWVDGSRWNFAYWAAHQPWSRGGHCVALCTRGGYWRRAHCLRRLPFICSY ------1111-------------2222----------------1111-------- >Y-BOX BINDING PROTEIN; SWP:P67809; PDB:1H95A; MKKVIATKVLGTVKWFNVRNGYGFINRNDTKEDVFVHQTAIKKNNPRKYLRSVGDGETVE ------------------------------------1111-------------------- FDVVEGEKGAEAANVTGPG ------------------- >GLOBIN-3; SWP:P80721; PDB:1H97A; TLTKHEQDILLKELGPHVDTPAHIVETGLGAYHALFTAHPQYISHFSRLEGHTIENVMQS -------------1111-----------------------3333-1111---33331111 EGIKHYARTLTEAIVHMLKEISNDAEVKKIAAQYGKDHTSRKVTKDEFMSGEPIFTKYFQ 3333-------------1111--------------1111--------------------1 NLVKDAEGKAAVEKFLKHVFPMMAAEI 111--------------------1111 >FERREDOXIN; SWP:P03942; PDB:1H98A; PHVICEPCIGVKDQSCVEVCPVECIYDGGDQFYIHPEECIDCGACVPACPVNAIYPEEDV ----3333-----3333--1111-------------------3333--1111---3333- PEQWKSYIEKNRKLAGL 3333------------- >TRANSCRIPTION ANTITERMINA; SWP:P39805; PDB:1H99A; GAMEKFKTLLYDIPIECMEVSEEIISYAKLQLGKKLNDSIYVSLTDHINFAIQRNQKGLD 1111---------3333-------------------------------------1111-- IKNALLWETKRLYKDEFAIGKEALVMVKNKTGVSLPEDEAGFIALHIVNAELNEEMPNII --1111-----------------------------3333-----------1111------ NITKVMEEILSIVKYHFKIEFNEESLHYYRFVTDLKFFAQRLFNGTHMEDDFLLDTVKEK ---------------------1111----------------1111-----1111-3333- YHRAYECTKKIQTYIEREYEHKLTSDELLYLTIDIERVVK -3333----------------------------------- >Core-binding factor subun; SWP:Q13951; PDB:1H9DB; PRVVPDQRSKFENEEFFRKLSRECEIKYTGFRDRPHEERQARFQNACRDGRSEIAFVATG --------------3333------------1111----------------------1111 TNLSLQFFPTPSREYVDLEREAGKVYLKAPMILNGVCVIWKGWIDLQRLDGMGCLEFDEE -----------3333-----------------iiii------------------------ RAQQE -1111 >Trypsin inhibitor 2; SWP:P12071; PDB:1H9HI; GCPRILIRCKQDSDCLAGCVCGPNGFCGSP ----------1111---------------- >MOLYBDENUM-BINDING-PROTEI; SWP:Q44529; PDB:1H9MA; MKISARNVFKGTVSALKEGAVNAEVDILLGGGDKLAAVVTLESARSLQLAAGKEVVAVVK -------------------------------------------------2222------1 APWVLLMTDSSGYRLSARNILTGTVKTIETGAVNAEVTLALQGGTEITSMVTKEAVAELG 111------iiii---------------------------2222-------3333----- LKPGASASAVIKASNVILGVP -2222------1111------ >PHOSPHATIDYLINOSITOL 3-KI; SWP:P27986; PDB:1H9OA; GSPIPHHDEKTWNVGSSNRNKAENLLRGKRDGTFLVRESSKQGCYACSVVVDGEVKHCVI -------------------------22222222-------2222------iiii------ NKTATGYGFAEPYNLYSSLKELVLHYQHTSLVQHNDSLNVTLAYPVYA --1111------------------------33333333------1111 >RETINOID X RECEPTOR, BETA; SWP:P28702; PDB:1H9UA; MPVDRILEAELAVPVTNICQAADKQLFTLVEWAKRIPHFSSLPLDDQVILLRAGWNELLI -3333------------------3333----------3333------------------- ASFSHRSIDVRDGILLATGLHVHRNSAHSAGVGAIFDRVLTELVSKMRDMRMDKTELGCL ----1111-------1111---3333-1111----------------------------- RAIILFNPDAKGLSNPSEVEVLREKVYASLETYCKQKYPEQQGRFAKLLLRLPALRSIGL ------1111---------------------------3333------------------- KCLEHLFFFKLIGDTPIDTFL --------------3333--- >SEED LECTIN; SWP:P81637; PDB:1H9WA; ADTIVAVELDSYPNTDIGDPSYPHIGIDIKSIRSKSTARWNMQTGKVGTAHISYNSVAKR -------------3333-------------------------2222-------------- LSAVVSYTGSSSTTVSYDVDLNNVLPEWVRVGLSATTGLYKETNTILSWSFTSKLKTANS -------------------3333------------------------------------- LHFSFNQFSQNPKDLILQGDATTDSDGNLELTKVSSSGDPQGSSVGRALFYAPVHIWEKS -----------1111--------1111-------1111-------------------222 AVVASFDATFTFLIKSPDRDPADGITFFIANTDTSIPSGSGGRLLGLFPDAN 2-----------------------------1111--2222!!!!-------- >GAMMA CRYSTALLIN S; SWP:P22914; PDB:1HA4A; GQYKIQIFEKGDFSGQMYETTEDCPSIMEQFHMREIHSCKVLEGVWIFYELPNYRGRQYL ----------%%%%-----------3333----------------------%%%%----- LDKKEYRKPIDWGAASPAVQSFRRIVE -------3333---------------- >MACROPHAGE INFLAMMATORY P; SWP:O89093; PDB:1HA6A; ASNYDCCLSYIQTPLPSRAIVGFTRQMADEACDINAIIFHTKKRKSVCADPKQNWVKRAV ---------------3333----------------------------------------- NLLSLRVKKM --3333---- >PHEROMONE; SWP:NA; PDB:1HA8A; GECEQCFSDGGDCTTCFNNGTGPCANCLAGYPAGCSNSDCTAFLSQCYGGC ------------------------1111--1111---3333---------- >TRYPSIN INHIBITOR II; SWP:P82409; PDB:1HA9A; SGSDGGVCPKILKKCRRDSDCPGACICRGNGYCG --1111----------3333-------1111--- >HIV-1 REVERSE TRANSCRIPTA; SWP:POL_HV1B1; PDB:1HAR; PISPIETVPVKLKPGMDGPKVAQWPLTAAKIAALVAICTEMEKEGKISKIGPENPYNTPV ------------2222-------------------------1111-----1111------ FAIWAKLVDFRELNKRTQDFWEVKSVTVLDVGDAYFSVPLDEDFRKYTAFTIPSINNETP ---------33331111-------------1111------3333---------2222--- GIRYQYNVLPQGWKGSPAIFQSSMTKILAPFKAANPDIVIYQYMDDLYVGSDLAIGAHRT ---------2222-----------------------------2222-------3333--- KIEELRQHLLRWGLTT ---------1111--- >ACYL-COA BINDING PROTEIN; SWP:P07107; PDB:1HB6A; SQAEFDKAAEEVKHLKTKPADEEMLFIYSHYKQATVGDINTERPGMLDFKGKAKWDAWNE ---------3333-------------------------------1111----------11 LKGTSKEDAMKAYIDKVEELKKKYGI 11------------------------ >ACYL-COA BINDING PROTEIN; SWP:NA; PDB:1HBKA; HMAQVFEECVSFINGLPRTINLPNELKLDLYKYYKQSTIGNCNIKEPSAHKYIDRKKYEA ------------1111-------------------------------3333--------- WKSVENLNREDAQKRYVDIVSEIFPYWQD -1111------------------1111-- >METHYL-COENZYME M REDUCTA; SWP:P11558; PDB:1HBNA; ADKLFINALKKKFEESPEEKKTTFYTLGGWKQSERKTEFVNAGKEVAAKRGIPQYNPDIG --1111---------1111--------!!!!------------------------1111- TPLGQRVLMPYQVSTTDTYVEGDDLHFVNNAAMQQMWDDIRRTVIVGLNHAHAVIEKRLG ------------2222----3333-3333----------1111----------------- KEVTPETITHYLETVNHAMPGAAVVQEHMVETHPALVADSYVKVFTGNDEIADEIDPAFV -----------------3333-----------33331111-----------11113333- IDINKQFPEDQAETLKAEVGDGIWQVVRIPTIVSRTCDGATTSRWSAMQIGMSMISAYKQ -3333-------------!!!!---------------1111------------------- AAGEAATGDFAYAAKAEVIHMGTYLPVRARGENEPGGVPFGYLADICQSSRVNYEDPVRV ---3333--------------------------3333----------3333-1111---- SLDVVATGAMLYDQIWLGSYMSGGVGFTQYATAAYTDNILDDFTYFGKEYVEDKYGLCEA ------------------1111----33333333---------------------2222- PNNMDTVLDVATEVTFYGLEQYEEYPALLEDQFGGSRAAVVAAAAGCSTAFATGNAQTGL -------------------------3333------------------------------- SGWYLSMYLHKEQHSRLGFYYDLQDQGASNVFSIRGDEGLPLELRGPNYPNYAMNVGHQG ---------------------3333-3333----------3333-11111111------- EYAGISQAPHAARGDAFVFNPLVKIAFADDNLVFDFTNVRGEFAKGALREFEPAGERALI ------------------------11111111--1111-------1111-------3333 TPA --- >Methyl-coenzyme M reducta; SWP:P11560; PDB:1HBNB; AKFEDKVDLYDDRGNLVEEQVPLEALSPLRNPAIKSIVQGIKRTVAVNLEGIENALKTAK ---------------------3333-3333------------------------------ VGGPACKIMGRELDLDIVGNAESIAAAAKEMIQVTEDDDTNVELLGGGKRALVQVPSARF --2222-2222----3333---------------2222-------iiii------3333- DVAAEYSAAPLVTATAFVQAIINEFDVSMYDANMVKAAVLGRYPQSVEYMGANIATMLDI ---------------------------3333-----------------2222-------3 PQKLEGPGYALRNIMVNHVVAATLKNTLQAAALSTILEQTAMFEMGDAVGAFERMHLLGL 333--22223333-3333----%%%%----------------------!!!!-------- AYQGMNADNLVFDLVKANGKEGTVGSVIADLVERALEDGVIKVEKELTDYKVYGTDDLAM -----2222--------------------------------------------------- WNAYAAAGLMAATMVNQGAARAAQGVSSTLLYYNDLIEFETGLPSVDFGKVEGTAVGFSF ----------------------1111------------------2222------------ FSHSIYGGGGPGIFNGNHIVTRHSKGFAIPCVAAAMALDAGTQMFSPEATSGLIKEVFSQ ---------3333-1111---------3333----1111------3333-------3333 VDEFREPLKYVVEAAAEIKNE -3333------------1111 >Methyl-coenzyme M reducta; SWP:P11562; PDB:1HBNC; AQYYPGTTKVAQNRRNFCNPEYELEKLREISDEDVVKILGHRAPGEEYPSVHPPLEEMDE ------------------1111--------------------2222-------3333--- PEDAIREMVEPIDGAKAGDRVRYIQFTDSMYFAPAQPYVRSRAYLCRYRGADAGTLSGRQ -------------------------------------------------------1111- IIETRERDLEKISKELLETEFFDPARSGVRGKSVHGHSLRLDEDGMMFDMLRRQIYNKDT -----------------------------------1111--1111---1111-------- GRVEMVKNQIGDELDEPVDLGEPLDEETLMEKTTIYRVDGEAYRDDVEAVEIMQRIHVLR -------1111-------------------------1111-3333--------------- SQGGFNL ------- >HEMOGLOBIN D; SWP:P02001; PDB:1HBRA; MLTAEDKKLIQQAWEKAASHQEEFGAEALTRMFTTYPQTKTYFPHFDLSPGSDQVRGHGK ---3333---------11113333-------------3333-1111--1111-------- KVLGALGNAVKNVDNLSQAMAELSNLHAYNLRVDPVNFKLLSQCIQVVLAVHMGKDYTPE --------------3333---------------3333----------------3333--- VHAAFDKFLSAVSAVLAEKYR -------------1111---- >REGULATORY PROTEIN GAL4; SWP:P04386; PDB:1HBWA; TRAHLTEVESRLERLEQLFLLIFPREDLDMILKMDSLRDIEALLTGLFVQDNVNKDA ----------------------------3333----3333----------------- >SERUM RESPONSE FACTOR; SWP:P11831; PDB:1HBXA; GKKTRGRVKIKMEFIDNKLRRYTTFSKRKTGIMKKAYELSTLTGTQVLLLVASETGHVYT ------------------------------------------------------------ FATRKLQPMITSETGKALIQTCLNSPD ---1111--------------1111-- >ARTHROPODAN HEMOCYANIN; SWP:P04254; PDB:1HC1; TGNAQKQQDINHLLDKIYEPTKYPDLKDIAENFNPLGDTSIYNDHGAAVETLMKELNDHR -----------11111111--------------1111------%%%%------------- LLEQRHWYSLFNTRQRKEALMLFAVLNQCKEWYCFRSNAAYFRERMNEGEFVYALYVSVI ------------------------------3333-------1111-3333---------- HSKLGDGIVLPPLYQITPHMFTNSEVIDKAYSAKMTQKPGTFNVSFKNREQRVAYFGEDI ----3333---3333-3333--------------------------------3333--33 GMNIHHVTWHMDFPFWWEDSYGYHLDRKGELFFWVHHQLTARFDFERLSNWLDPVDELHW 33-----------1111-3333------------------------3333--------11 DRIIREGFAPLTSYKYGGEFPVRPDNIHFEDVDGVAHVHDLEITESRIHEAIDHGYITDS 11-----------------------------2222-3333------------------11 DGHTIDIRQPKGIELLGDIIESSKYSSNVQYYGSLHNTAHVMLGRQGDPHGKFNLPPGVM 11------1111---------------1111-------------3333---------111 EHFETATRDPSFFRLHKYMDNIFKKHTDSFPPYTHDNLEFSGMVVNGVAIDGELITFFDE 11111---1111----3333-----3333----3333----------------------- FQYSLINAVDSGENIEDVEINARVHRLNHNEFTYKITMSNNNDGERLATFRIFLCPIEDN ----1111---------------------------------------------------- NGITLTLDEARWFCIELDKFFQKVPSGPETIERSSKDSSVTVPDMPSFQSLKEQADNAVN -------1111-----------------------1111---------------------- GGLDLSAYERSCGIPDRMLLPKSKPEGMEFNLYVAVTDGDKDTEGHHAQCGVHGEAYPDN --------------3333-----3333-----------1111------------------ RPLGYPLERRIPDERVIDGVSNIKHVVVKIVHHL ---------------3333--------------- >PROLYL-TRNA SYNTHETASE; SWP:Q5SM28; PDB:1HC7A; KGLTPQSQDFSEWYLEVIQKAELADYGPVRGTIVVRPYGYAIWENIQQVLDRMFKETGHQ ----3333----------1111------2222----------------------1111-- NAYFPLFIPMSFLFSPELAVVTHAGGEELEEPLAVRPTSETVIGYMWSKWIRSWRDLPQL --------3333--3333-----%%%%-------------------------3333---- LNQWGNVVRWEMRTRPFLRTSEFLWQEGHTAHATREEAEEEVRRMLSIYARLAREYAAIP --------------2222------------------------------------------ VIEGLKTEKEKFAGAVYTTTIEALMKDGKALQAGTSHYLGENFARAFDIKFQDRDLQVKY ------3333-1111---------1111----------!!!!----------1111---- VHTTSWGLSWRFIGAIIMTHGDDRGLVLPPRLAPIQVVIVPIYKDESRERVLEAAQGLRQ ---------------------1111---1111-----------3333------------- ALLAQGLRVHLDDRDQHTPGYKFHEWELKGVPFRVELGPKDLEGGQAVLASRLGGKETLP --1111---------------------------------3333-------1111-----3 LAALPEALPGKLDAFHEELYRRALAFREDHTRKVDTYEAFKEAVQEGFALAFHCGDKACE 333------------------------1111----------1111--------------- RLIQEETTATTRCVPFEAEPEEGFCVRCGRPSAYGKRVVFAKAY -------------------------------------------- >RIBOSOMAL PROTEIN L11; SWP:P56210; PDB:1HC8A; TFITKTPPAAVLLKKAAGIESGSGEPNRNKVATIKRDKVREIAELKMPDLNAASIEAAMR ------------------------3333------3333-------3333----------- MIEGTARSMGIVVE -------------- >ALPHA-BUNGAROTOXIN; SWP:P01378; PDB:1HC9A; IVCHTTATSPISAVTCPPGENLCYRKMWCDVFCSSRGKVVELGCAATCPSKKPYEEVTCC ----------------2222---------1111--------------------------- STDKCNPHPKQRPG -------1111--- >HISACTOPHILIN; SWP:P13231; PDB:1HCD; MGNRAFKSHHGHFLSAEGEAVKTHHGHHDHHTHFHVENHGGKVALKTHCGKYLSIGDHKQ --------------------------------------!!!!------------------ VYLSHHLHGDHSLFHLEHHGGKVSIKGHHHHYISADHHGHVSTKEHHDHDTTFEEIII ---------------------------%%%%----!!!!------------------- >Choriogonadotropin subuni; SWP:P01233; PDB:1HCNB; KEPLRPRCRPINATLAVEKEGCPVCITVNTTICAGYCPTMTRVLQGVLPALPQVVCNYRD ------------------------------------------------------------ VRFESIRLPGCPRGVNPVVSYAVALSCQCALCRRSTTDCGGPKDHPLTCD --------------------------------3333-------------- >HUMAN/CHICKEN ESTROGEN RE; SWP:P03372; PDB:1HCQA; MKETRYCAVCNDYASGYHYGVWSCEGCKAFFKRSIQGHNDYMCPATNQCTIDKNRRKSCQ -----------------iiii------------------------------1111----- ACRLRKCYEVGMMK -------1111--- >ALPHA-1,2-MANNOSIDASE; SWP:Q9P8T8; PDB:1HCUA; KRGSPNPTRAAAVKAAFQTSWNAYHHFAFPHDDLHPVSNSFDDERNGWGSSAIDGLDTAI --------------------------------------------%%%%-----------1 LMGDADIVNTILQYVPQINFTTTAVANQGSSVFETNIRYLGGLLSAYDLLRGPFSSLATN 111----------3333------------------------------------3333--- QTLVNSLLRQAQTLANGLKVAFTTPSGVPDPTVFFNPTVRRSGASSNNVAEIGSLVLEWT -------------------11111111--------------------3333--------- RLSDLTGNPQYAQLAQKGESYLLNPKGSPEAWPGLIGTFVSTSNGTFQDSSGSWSGLMDS -----------------3333----------2222-------------------2222-- FYEYLIKMYLYDPVAFAHYKDRWVLGADSTIGHLGSHPSTRKDLTFLSSYNGQSTSPNSG -------------------------------------1111---------!!!!------ HLASFGGGNFILGGILLNEQKYIDFGIKLASSYFGTYTQTASGIGPEGFAWVDSVTGAGG ------------------------------------1111-------------------- SPPSSQSGFYSSAGFWVTAPYYILRPETLESLYYAYRVTGDSKWQDLAWEALSAIEDACR --3333------------------------------------------------------ AGSAYSSINDVTQANGGGASDDMESFWFAEALKYAYLIFAEESDVQVQATGGNKFVFNTE !!!!-----1111----------3333---------------3333-------------- AHPFSIRS -------- >IMMUNOGLOBULIN G; SWP:A2KD53; PDB:1HCV; VQLQESGGGLVQAGGSLRLSCAASGRTGSTYDMGWFRQAPGKERESVAAINWDSARTYYA -----------2222------------1111-------2222-----------------3 SSVRGRFTISRDNAKKTVYLQMNSLKPEDTAVYTCGAGEGGTWDSWGQGTQVTVSS 333---------1111---------1111---------iiii-------------- >CYTOCHROME F; SWP:P36438; PDB:1HCZ; YPIFAQQNYENPREATGRIVCANCHLASKPVDIEVPQAVLPDTVFEAVVKIPYDMQLKQV 3333---------1111------------------------------------1111--- LANGKKGALNVGAVLILPEGFELAPPDRISPEMKEKIGNLSFQNYRPNKKNILVIGPVPG 1111--------------------3333-----------------1111---------33 QKYSEITFPILAPDPATNKDVHFLKYPIYVGGNRGRGQIYPDGSKSNNTVYNATAGGIIS 33-----------33331111------------------1111----------------- KILRKEKGGYEITIVDASNERQVIDIIPRGLELLVSEGESIKLDQPLTSNPNVGGFGQGD ----1111--------1111---------------2222--2222--------------- AEIVLQDPLR ------3333 >HETEROGENEOUS NUCLEAR RIB; SWP:Q14103; PDB:1HD0A; KMFIGGLSWDTTKKDLKDYFSKFGEVVDCTLKLDPITGRSRGFGFVLFKESESVDKVMDQ -------1111--------1111------------------------------------- KEHKLNGKVIDPKRA ----iiii------- >PEROXIREDOXIN 5 RESIDUES ; SWP:P30044; PDB:1HD2A; APIKVGDAIPAVEVFEGEPGNKVNLAELFKGKKGVLFGVPGAFTPGCSKTHLPGFVEQAE ---2222----------1111-------2222--------2222---------------- ALKAKGVQVVACLSVNDAFVTGEWGRAHKAEGKVRLLADPTGAFGKETDLLLDDSLVSIF --1111-----------------------2222-----1111-----------1111--- GNRRLKRFSMVVQDGIVKALNVEPDGTGLTCSLAPNIISQL ------------iiii------1111---1111----1111 >ENDOGLUCANASE; SWP:O93782; PDB:1HD5A; ADGKSTRYWDCCKPSCGWAKKAPVNQPVFSCNANFQRLTDFDAKSGCEPGGVAYSCADQT -------------1111--------------1111----1111-1111-------1111- PWAVNDDFAFGFAATSIAGSNEAGWCCACYELTFTSGPVAGKKMVVQSTSTSNHFDLNIP ----1111--------222233332222----------2222----------------22 GGGVGIFDGCTPQFGGLPGQRYGGISSRNECDRFPDALKPGCYWRFDWFKNADNPSFSFR 22--3333-3333-------------333311113333----33331111---------- QVQCPAELVARTGCRRNDDGNFPAV ----3333-------1111------ >PHEROMONE ER-22; SWP:P58548; PDB:1HD6A; DICDIAIAQCSLTLCQDCENTPICELAVKGSCPPPWS ----------33333333------------------- >DNA-(APURINIC OR APYRIMID; SWP:P27695; PDB:1HD7A; LYEDPPDQKTSPSGKPATLKICSWNVDGLRAWIKKKGLDWVKEEAPDILCLQETKCSEGL ----------1111---------------------------------------------- SHQYWSAPYSGVGLLSRQCPLKVSYGIGDEEHDQEGRVIVAEFDSFVLVTAYVPNAGRGL -----------------------------1111---------1111----------2222 VRLEYRQRWDEAFRKFLKGLASRKPLVLCGDLNVAHEEIDLRNPKGNKKNAGFTPQERQG ------------------------------------3333--33331111---------- FGELLQAVPLADSFRHLYPNTPYAYTFWTYMMNARSKNVGWRLDYFLLSHSLLPALCDSK -----------------1111---------%%%%1111----------33331111---- IRSKALGSDHCPITLYLAL -1111-------------- >PENICILLIN-BINDING PROTEI; SWP:P04287; PDB:1HD8A; LNIKTMIPGVPQIDAESYILIDYNSGKVLAEQNADVRRDPASLTKMMTSYVIGQAMKAGK --------------------------------1111---!!!!----------------- FKETDLVTIGNLKPGMQVPVSQLIRDINLQSGNDACVAMADFAAGSQDAFVGLMNSYVNA -1111-------2222-------------------------------------------- LGLKNTHFQTVHGLDADGQYSSARDMALIGQALIRDVPNEYSIYKEKEFTFNGIRQLNRN ------------------------------------33333333------iiii-----3 GLLWDNSLNVDGIKTGHTDKAGYNLVASATEGQMRLISAVMGGRTFKGREAESKKLLTWG 333-3333---------1111---------!!!!----------3333------------ FRFFETVNPLKVGKEFASEPVWFGDSDRASLGVDKDVYLTIPRGRMKDLKASYVLNSSEL ----------2222---------------------------22221111----------- HAPLQKNQVVGTINFQLDGKTIEQRPLVVLQEIPEGN ----2222--------iiii----------------- >3-ALPHA, 20 BETA-HYDROXYS; SWP:P19992; PDB:1HDCA; NDLSGKTVIITGGARGLGAEAARQAVAAGARVVLADVLDEEGAATARELGDAARYQHLDV ---------------3333---------------------------------------11 TIEEDWQRVVAYAREEFGSVDGLVNNAGISTGMFLETESVERFRKVVEINLTGVFIGMKT 11-------------------------------3333-3333------------------ VIPAMKDAGGGSIVNISSAAGLMGLALTSSYGASKWGVRGLSKLAAVELGTDRIRVNSVH 3333-------------1111-------3333---------------------------- PGMTYTPMTAETGIRQGEGNYPNTPMGRVGEPGEIAGAVVKLLSDTSSYVTGAELAVDGG -----3333-------22221111------3333---------3333----------iii WTTGPTVKYVMGQ i------------ >SPHERULIN 3A; SWP:P09353; PDB:1HDFA; SVCKGVSGNPAKGEVFLYKHVNFQGDSWKVTGNVYDFRSVSGLNDVVSSVKVGPNTKAFI ----------2222------%%%%---------------2222----------------- FKDDRFNGNFIRLEESSQVTDLTTRNLNDAISSIVATFE ---%%%%-------------3333--------------- -------------------- >ARYLSULFATASE; SWP:AAG03573; PDB:1HDHA; KRPNFLVIVADDLGFSDIGAFGGEIATPNLDALAIAGLRLTDFHTASTSPTRSMLLTGTD -------------11111111--------------------------------------3 HHIAGIGTMAEALTPELEGKPGYEGHLNERVVALPELLREAGYQTLMAGKWHLGLKPEQT 333-----1111-3333--2222---------3333--1111-------------11113 PHARGFERSFSLLPGAANHYGFEPPYDESTPRILKGTPALYVEDERYLDTLPEGFYSSDA 333--------------1111-----3333---1111-----!!!!-----2222----- FGDKLLQYLKERDQSRPFFAYLPFSAPHWPLQAPREIVEKYRGRYDAGPEALRQERLARL --------11111111-----------------3333-11111111-------------- KELGLVEADVEAHPVLALTREWEALEDEERAKSARAMEVYAAMVERMDWNIGRVVDYLRR ------1111----------1111----------------------------------33 QGELDNTFVLFMSDNGAEGALLEAFPKFGPDLLGFLDRHYDNSLENIGRANSYVWYGPRW 333333--------------11113333--------------1111--1111-------- AQAATAPSRLYKAFTTQGGIRVPALVRYPRLSRQGAISHAFATVMDVTPTLLDLAGVRHP ----------2222-3333--------3333-----------1111-------------- GKRWRGREIAEPRGRSWLGWLSGETEAAHDENTVTGWELFGMRAIRQGDWKAVYLPAPVG ---%%%%------------1111------1111-----iiii----!!!!---------- PATWQLYDLARDPGEIHDLADSQPGKLAELIEHWKRYVSETGVV -------33331111----------------------------- >PHOSPHOGLYCERATE KINASE; SWP:Q7SIB7; PDB:1HDIA; NKLTLDKLNVKGKRVVMRVDFNVPMAAAQITNNARIKAAVPSIKFCLDDGAKSVVLMSHL ---3333--2222----------------------------------------------- GRPDGSPMPDKYSLQPVAAELKSALGKAVLFLKDCVGPAVEKACADPAAGSVILLENLRF --%%%%-3333--3333------------------------------2222-----1111 HVEEEGKGKDASGNKAAGEPAKIKAFRASLSALGDVYVNDAFGTAHRAHSSMVGVNLPKK 3333-----3333----------------3333-------3333----3333-------- AGAFLMKKELNYFAAAAESPERPFLAILGGAKVADKIQLINNMLDKVNEMIIGGGMAFTF --------------------------------1111------1111-------3333--- LKVLNNMEIGTSLFDEAGKKIVKNLMSKAAANGVKITLPVDFVTADKFDEQAKIGQATVA --------!!!!--3333------------------------------1111-----333 SGIPAGWMGLDCGPKSSAKYSEAVARAKQIVWNGPVGVFEWEAFAQGTKALMDEVVKATS 3----------------------1111----------3333-----------------11 RGCITIIGGGDTATCCAKWNTEDNVSHVSTGGGASLELLEGKVLPGVDALSNV 11--------------11111111-------------1111--3333------ >HUMAN HSP40; SWP:P25685; PDB:1HDJ; MGKDYYQTLGLARGASDEEIKRAYRRQALRYHPDKNKEPGAEEKFKEIAEAYDVLSDPRK ----3333-------1111--------3333------2222------------------- REIFDRYGEEGLKGSGC -------3333------ >EOSINOPHIL LYSOPHOSPHOLIP; SWP:Q05315; PDB:1HDKA; SLLPVPYTEAASLSTGSTVTIKGRPLVCFLNEPYLQVDFHTEMKEESDIVFHFQVCFGRR -------------2222----------3333------------1111--------2222- VVMNSREYGAWKQQVESKNMPFQDGQEFELSISVLPDKYQVMVNGQSSYTFDHRIKPEAV ------iiii------------2222--------1111----iiii---------3333- KMVQVWRDISLTKFNVSYL ------------------- >SERINE PROTEINASE INHIBIT; SWP:Q9NQ38; PDB:1HDLA; KNEDQEMCHEFQAFMKNGKLFCPQDKKFFQSLDGIMFINKCATCKMILEKEAKSQ -1111----3333--iiii-------------------------------3333- >BILIVERDIN IX BETA REDUCT; SWP:P30043; PDB:1HDOA; MAVKKIAIFGATGQTGLTTLAQAVQAGYEVTVLVRDSSRLPSEGPRPAHVVVGDVLQAAD ---------1111----------1111--------3333--------------1111--- VDKTVAGQDAVIVLLGTRNDLSPTTVMSEGARNIVAAMKAHGVDKVVACTSAFLLWDPTK ----2222--------!!!!-------------------------------1111-1111 VPPRLQAVTDDHIRMHKVLRESGLKYVAVMPPHIGDQPLTGAYTVTLDGRGPSRVISKHD -3333------------------------------------------------------- LGHFMLRCLTTDEYDGHSTYPSHQY ------1111-1111---------- >OCT-2 POU HOMEODOMAIN; SWP:P09086; PDB:1HDP; RRKKRTSIETNVRFALEKSFLANQKPTSEEILLIAEQLHMEKEVIRVWFCNRRQKEKRIN ---------3333---------------------3333---------------------- PCS --- >DIHYDROPTERIDINE REDUCTAS; SWP:P09417; PDB:1HDR; EARRVLVYGGRGALGSRCVQAFRARNWWVASVDVVENEEASASIIVKMTDSFTEQADQVT --------1111------------------------------------------------ AEVGKLLGEEKVDAILCVAGGWAGGNAKSKSLFKNCDLMWKQSIWTSTISSHLATKHLKE ------!!!!---------------1111-----------------------------22 GGLLTLAGAKAALDGTPGMIGYGMAKGAVHQLCQSLAGKNSGMPPGAAAIAVLPVTLDTP 22------3333---1111---------------1111-----2222-----------33 MNRKSMPEADFSSWTPLEFLVETFHDWITGKNRPSSGSLIQVVTTEGRTELTPAYF 33--------1111-3333----------2222-2222------iiii-------- >HEMOGLOBIN S (DEOXY) (BET; SWP:P01972; PDB:1HDSA; VLSAANKSNVKAAWGKVGGNAPAYGAQALQRMFLSFPTTKTYFPHFDLSHGSAQQKAHGQ -------------------3333-------------3333--3333-------------- KVANALTKAQGHLNDLPGTLSNLSNLHAHKLRVNPVNFKLLSHSLLVTLASHLPTNFTPA -3333--3333-----3333-------------3333------------3333----333 VHANLNKFLANDSTVLTSKYR 3-------------------- >Hemoglobin subunit beta-3; SWP:P02074; PDB:1HDSB; MLTAEEKAAVTGFWGKVDVDVVGAQALGRLLVVYPWTQRFFQHFGNLSSAGAVMNNPKVK --3333--3333---------------------3333--------------3333----- AHGKRVLDAFTQGLKHLDDLKGAFAQLSGLHCNKLHVNPQNFRLLGNVLALVVARNFGGQ ----------------------------------------3333-------3333----- FTPNVQALFQKVVAGVANALAHKYH --3333------------------- >EXOENZYME S; SWP:Q51451; PDB:1HE1A; ASSAVVFKQMVLQQALPMTLKGLDKASELATLTPEGLAREHSRLASGDGALRSLSTALAG 3333----------3333----------1111---------3333!!!!----------- IRAGSQVEESRIQAGRLLERSIGGIALQQWGTTGGAASQLVLDASPELRREITDQLHQVM ---------------------iiii3333------------------------------- SEVALLRQAVESEVS --------------- >HIGH AFFINITY NERVE GROWT; SWP:P04629; PDB:1HE7A; SHMPASVQLHTAVEMHHWCIPFSVDGQPAPSLRWLFNGSVLNETSFIFTEFLEPAANETV -----------------------------------iiii----1111------------- RHGCLRLNQPTHVNNGNYTLLAANPFGQASASIMAAFMDNPFEFNPE ----------3333---------1111------------1111---- ------------------------------------------------------------ ---------------------------- >HCV HELICASE; SWP:P26664; PDB:1HEIA; NSSPPAVPQSFQVAHLHAPTGSGKSTKVPAAYAAQGYKVLVLNPSVAATLGFGAYMSKAH ------------------33331111------1111------------------------ GVDPNIRTGVRTITTGSPITYSTYGKFLADGGCSGGAYDIIICDECHSTDATSILGIGTV ---------------------------1111-------------1111-3333------- LDQAETAGARLVVLATATPPGSVTVSHPNIEEVALSTTGEIPFYGKAIPLEVIKGGRHLI ----1111----------2222--------------------iiii--3333-------- FCHSKKKCDELAAKLVALGINAVAYYRGLDVSVIPTNGDVVVVSTDALMTGFTGDFDSVI -------------------------2222---------------3333------------ DCNTCVTQTVDFSLDPTFTIETTTLPQDAVSRTQRRGRTGRGKPGIYRFVAPGERPSGMF ------------------------------------------------------------ DSSVLCECYDAGCAWYELMPAETTVRLRAYMNTPGLPVCQDHLEFWEGVFTGLTHIDAHF 3333--------------3333---------------------------1111------- LSQTKQSGENFPYLVAYQATVCARAQAPPPSWDQMWKCLIRLKPTLHGPTPLLYRLGAVQ ----1111--------------1111------33331111-------------------- NEVTLTHPITKYIMTCMSADLEV ----------------------- >GAG POLYPROTEIN, CORE PRO; SWP:P03351; PDB:1HEKA; SGDPLTWSKALKKLEKVTVQGSQKLTTGNCNWALSLVDLFHDTNFVKEKDWQLRDVIPLL --------------------------3333--------1111-3333----33331111- EDVTQTLSGQEREAFERTWWAISAVKGLQINNVVDGKASFQLLRAKYE --1111--3333------------------------------------ >CHEY; SWP:P0AE67; PDB:1HEY; DKELKFLVVGNGGTGKSTVRNLLKELGFNNVEDAEDGVDALNKLQAGGYGFVISDWNMPN -----------------3333---------------------1111-------------- MDGLELLKTIRADGAMSALPVLMVTAEAKKENIIAAAQAGASGYVVKPFTAATLEEKLNK ---------------1111---------3333---------------------------- IFEKLGM ------- >SEPTUM SITE-DETERMINING P; SWP:Q9X0D7; PDB:1HF2A; MVDFKMTKEGLVLLIKDYQNLEEVLNAISARITQMGGFFAKGDRISLMIENHNKHSQDIP ------1111---------3333----------------3333-------33331111-- RIVSHLRNLGLEVSQILVGKVQSRTTVESTGKVIKRNIRSGQTVVHSGDVIVFGNVNKGA ------1111----------------------------2222--------------1111 EILAGGSVVVFGKAQGNIRAGLNEGGQAVVAALDLQTSLIQIAGFITHSKGEENVPSIAH ------------------------1111-------------!!!!--------------- VKGNRIVIEPFDKVSF ---------1111--- >CLATHRIN ASSEMBLY PROTEIN; SWP:O55011; PDB:1HF8A; GSAVSKTVCKATTHEIMGPKKKHLDYLIQCTNEMNVNIPQLADSLFERTTNSSWVVVFKS --------3333--------------------1111------------------------ LITTHHLMVYGNERFIQYLASRNTLFNLSNFLDKSGLQGYDMSTFIRRYSRYLNEKAVSY ------------------1111-----1111----------------------------- RQVAFDFTKVKRGADGVMRTMNTEKLLKTVPIIQNQMDALLDFNVNSNELTNGVINAAFM -----1111------3333---------------------3333-1111----------- LLFKDAIRLFAAYNEGIINLLEKYFDMKKNQCKEGLDIYKKFLTRMTRISEFLKVAEQVG --------------------1111---3333----------------------------- IDRGDIPDLSQAPSSLLDALEQH -3333------------------ >FIBROBLAST COLLAGENASE; SWP:P03956; PDB:1HFC; PRWEQTHLTYRIENYTPDLPRADVDHAIEKAFQLWSNVTPLTFTKVSEGQADIMISFVRG ---------------33333333------------1111--------------------- DHRDNSPFDGPGGNLAHAFQPGPGIGGDAHFDEDERWTNNFREYNLHRVAAHELGHSLGL ----------------------!!!!-----1111--------------------1111- SHSTDIGALMYPSYTFSGDVQLAQDDIDGIQAIYGRS ----1111----------------------------- >Periplasmic [Fe] hydrogen; SWP:P07598; PDB:1HFEL; SRTVMERIEYEMHTPDPKADPDKLHFVQIDEAKCIGCDTCSQYCPTAAIFGEMGEPHSIP ----iiii-------11111111------3333----3333----------2222----- HIEACINCGQCLTHCPENAIYEAQSWVPEVEKKLKDGKVKCIAMPAPAVRYALGDAFGMP 3333----3333--1111-----------------1111------3333--3333----2 VGSVTTGKMLAALQKLGFAHCWDTEFTADVTIWEEGSEFVERLTKKSDMPLPQFTSCCPG 222----------3333------------------------------------------- WQKYAETYYPELLPHFSTCKSPIGMNGALAKTYGAERMKYDPKQVYTVSIMPCIAKKYEG --------33333333------------------------3333---------------- LRPELKSSGMRDIDATLTTRELAYMIKKAGIDFAKLPDGKRDSLMGESTGGATIFGVTGG -33331111----------------------3333-------------3333----2222 VMEAALRFAYEAVTGKKPDSWDFKAVRGLDGIKEATVNVGGTDVKVAVVHGAKRFKQVCD ----------------------3333------------------------3333------ DVKAGKSPYHFIEYMACPGGCVCGGGQPVMPGVLEAM ----------------2222---1111--2222---- >FACTOR H, 15TH AND 16TH C; SWP:P08603; PDB:1HFH; EKIPCSQPPQIEHGTINSSRSSQESYAHGTKLSYTCEGGFRISEENETTCYMGKWSSPPQ --------------------------------------------------iiii------ CEGLPCKSPPEISHGVVAHMSDSYQYGEEVTYKCFEGFGIDGPAIAKCLGEKWSHPPSCI ------------------------------------------------------------ >MIGRATION INHIBITORY FACT; SWP:Q9Y063; PDB:1HFOA; PIFTLNTNIKATDVPSDFLSSTSALVGNILSKPGSYVAVHINTDQQLSFGGSTNPAAFGT ---------1111-1111--------------3333------------iiii-------- LMSIGGIEPSRNRDHSAKLFDHLNTKLGIPKNRMYIHFVNLNGDDVGWNGTTF -------1111------------------1111--------3333--iiii-- >LACCASE 1; SWP:Q9Y780; PDB:1HFUA; AIVNSVDTMTLTNANVSPDGFTRAGILVNGVHGPLIRGGKNDNFELNVVNDLDNPTMLRP ----------------1111-------iiii-------1111-----------1111--- TSIHWHGLFQRGTNWADGADGVNQCPISPGHAFLYKFTPAGHAGTFWYHSHFGTQYCDGL ---------22221111-2222-----2222-------iiii---------!!!!1111- RGPMVIYDDNDPHAALYDEDDENTIITLADWYHIPAPSIQQPDATLINGKGRYVGGPAAE ----------1111------1111----------3333--------iiii--2222---- LSIVNVEQGKKYRMRLISLSCDPNWQFSIDGHELTIIEVDGELTEPHTVDRLQIFTGQRY ------2222------------------2222------%%%%------------2222-- SFVLDANQPVDNYWIRAQPNKGRNGLAGTFANGVNSAILRYAGAANADPTTSANPNPAQL ----------------------iiii---2222-------2222---------------- NEADLHALIDPAAPGIPTPGAADVNLRFQLGFSGGRFTINGTAYESPSVPTLLQIMSGAQ 3333-------------2222-----------iiii--iiii------------1111-- SANDLLPAGSVYELPRNQVVELVVPAGVLGGPHPFHLHGHAFSVVRSAGSSTYNFVNPVK 3333--2222------------------------------------2222---------- RDVVSLGVTGDEVTIRFVTDNPGPWFFHCHIEFHLMNGLAIVFAEDMANTVDANNPPVEW -------2222-------------------33331111-------3333-3333--3333 AQLCEIYDDLPPEATSIQTV ------11113333------ >ALPHA-LACTALBUMIN; SWP:P00713; PDB:1HFX; KQLTKCALSHELNDLAGYRDITLPEWLCIIFHISGYDTQAIVKNSDHKEYGLFQINDKDF ---------1111-2222---------------%%%%-------------1111------ CESSTTVQSRNICDISCDKLLDDDLTDDIMCVKKILDIKGIDYWLAHKPLCSDKLEQWYC ----------1111-3333--------------------3333-%%%%-------1111- EAQ --- >TRIOSEPHOSPHATE ISOMERASE; SWP:P95583; PDB:1HG3A; AKLKEPIIAINFKTYIEATGKRALEIAKAAEKVYKETGVTIVVAPQLVDLRMIAESVEIP --------------3333---------------------------3333---1111---- VFAQHIDPIKPGSHTGHVLPEAVKEAGAVGTLLNHSENRMILADLEAAIRRAEEVGLMTM -------------2222-----------------1111---------------------- VCSNNPAVSAAVAALNPDYVAVEPPELIGTGIPVSKAKPEVITNTVELVKKVNPEVKVLC ------------1111-------3333-----3333----------------3333---- GAGISTGEDVKKAIELGTVGVLLASGVTKAKDPEKAIWDLV -----------------------3333-------------- >ULTRASPIRACLE; SWP:P20153; PDB:1HG4A; FSIERIIEAEQRAETQCGDRALTFLRVGPYSTVQPDYKGAVSALCQVVNKQLFQMVEYAR ----------------!!!!-------1111--3333----------------------- MMPHFAQVPLDDQVILLKAAWIELLIANVAWCSIVSLQPQQLFLNQSFSYHRNSAIKAGV -2222---3333------------------1111---------------------1111- SAIFDRILSELSVKMKRLNLDRRELSCLKAIILYNPDIRGIKSRAEIEMCREKVYACLDE ---------------1111------------------2222------------------- HCRLEHPGDDGRFAQLLLRLPALRSISLKCQDHLFLFRITSDRPLEELFLEQLEAPPPPG -----3333----------------------------------3333------------- >HPLC-12 TYPE III ANTIFREE; SWP:P19614; PDB:1HG7A; MNQASVVANQLIPINTALTLVMMRSEVVTPVGIPAEDIPRLVSMQVNRAVPLGTTLMPDM ------------------3333-----------33333333---------2222--1111 VKGYAA 2222-- >ENDOPOLYGALACTURONASE; SWP:Q07181; PDB:1HG8A; DPCSVTEYSGLATAVSSCKNIVLNGFQVPTGKQLDLSSLQNDSTVTFKGTTTFATTADND 1111--3333------------------2222-------2222--------------111 FNPIVISGSNITITGASGHVIDGNGQAYWDGKGSNSNSNQKPDHFIVVQKTTGNSKITNL 1--------------2222----3333----!!!!------------------------- NIQNWPVHCFDITGSSQLTISGLILDNRAGDKPNAKSGSLPAAHNTDGFDISSSDHVTLD --------------------------3333------!!!!-------------------- NNHVYNQDDCVAVTSGTNIVVSNMYCSGGHGLSIGSVGGKSDNVVDGVQFLSSQVVNSQN ------------------------------------------------------------ GCRIKSNSGATGTINNVTYQNIALTNISTYGVDVQQDYLNGGPTGKPTNGVKISNIKFIK ------2222----------------------------%%%%------------------ VTGTVASSAQDWFILCGDGSCSGFTFSGNAITGGGKTSSCNYPTNTCPS -----1111---------------------------------------- >HUMAN GROWTH HORMONE; SWP:P01241; PDB:1HGU; PTIPLSRLFQNAMLRAHRLHQLAFDTYEEFEEAYIPQKYSFLQAPQASLCFSESIPTPSN ---3333-----------------------------------3333-------------- REQAQQKSNLQLLRISLLLIQSWLEPVGFLRSVFANSLVYGASDSDVYDLLKDLEEGIQT -------1111-33333333----------3333------------------------11 LMGRLEDGSPRTGQAFKQTYAKFDANSHNDDALLKNYGLLYCFRKDMDKVETFLRIVQCR 11-----------------------------3333--3333---------1111-1111- SVEGSCG ------- >PH75 INOVIRUS MAJOR COAT ; SWP:P82889; PDB:1HGVA; MDFNPSEVASQVTNYIQAIAAAGVGVLALAIGLSAAWKYAKRFLKG ----3333-------3333---3333--------------3333-- >HYPOXANTHINE-GUANINE-XANT; SWP:P51900; PDB:1HGXA; MDDLERVLYNQDDIQKRIRELAAELTEFYEDKNPVMICVLTGAVFFYTDLLKHLDFQLEP 1111------------------------1111--------1111------1111------ DYIICSSLTISKDLKTNIEGRHVLVVEDIIDTGLTMYQLLNNLQMRKPASLKVCTLCDKD -----------------2222--------------------------------------- IGKKAYDVPIDYCGFVVENRYIIGYGFDFHNKYRNLPVIGILKE ----------------------------%%%%3333-------- >HOLLIDAY JUNCTION RESOLVI; SWP:Q9UWX8; PDB:1HH1A; SAVERNIVSRLRDKGFAVVRAPAPIPDIIALKNGVIILIEMKSRKDIEGKIYVRREQAEG -----------1111----------------iiii----------1111----------- IIEFARKSGGSLFLGVKKPGVLKFIPFEKLRRTETGNYVADSEIEGLDLEDLVRLVEAKI -------------------------3333---1111------------------------ SRTLD ----- ----- >NEUTROPHIL CYTOSOL FACTOR; SWP:P19878; PDB:1HH8A; SLVEAISLWNEGVLAADKKDWKGALDAFSAVQDPHSRICFNIGCMYTILKNMTEAEKAFT ---------------1111--------1111----------------------------- RSINRDKHLAVAYFQRGMLYYQTEKYDLAIKDLKEALIQLRGNQLIDYKILGLQFKLFAC -----1111-----------1111---------------iiii----3333--------- EVLYNIAFMYAKKEEWKKAEEQLALATSMKSEPRHSKIDKAMECVWKQKLYEPVVIPVGR ----------1111----------3333---3333---------1111--------2222 LFRPNERQVAQL --------1111 >GUINEA FOWL LYSOZYME; SWP:P00704; PDB:1HHL; KVFGRCELAAAMKRHGLDNYRGYSLGNWVCAAKFESNFNSQATNRNTDGSTDYGVLQINS ------------11112222---3333--------%%%%------1111----1111-33 RWWCNDGRTPGSRNLCNIPCSALQSSDITATANCAKKIVSDGDGMNAWVAWRKHCKGTDV 33-----------1111--3333-------------------!!!!--3333--2222-3 RVWIKGCRL 3332222-- >CALRETICULIN; SWP:P18418; PDB:1HHNA; SKKIKDPDAAKPEDWDERAKIDDPTDSKPEDWDKPEHIPDPDAKKPEDWDEEMDGEWEPP -----3333------------------------------1111--11113333------- VIQNPEYKGEWKPRQIDNPDYKGTWIHPEIDNPEYSPDANI ---1111------------------------1111------ >IGG2A KAPPA ANTIBODY CB41; SWP:Q7TS98; PDB:1HI6A; DIKMTQSPSSMYTSLGERVTITCKASQDINSFLTWFLQKPGKSPKTLIYRANRLMIGVPS -------------2222-----------%%%%------2222------------222233 RFSGSGSGQTYSLTISSLEYEDMGIYYCLQYDDFPLTFGAGTKLDLKRADAAPTVSIFPP 33----------------3333-------------------------------------- SSEQLTSGGASVVCFLNNFYPKEINVKWKIDGSERQNGVLDSWTEQDSKDSTYSMSSTLT -----------------------------iiii--------------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC -3333------------3333--------3333- >IGG2A KAPPA ANTIBODY CB41; SWP:NA; PDB:1HI6B; QDQLQQSGAELVRPGASVKLSCKALGYIFTDYEIHWVKQTPVHGLEWIGGIHPGSSGTAY ------------2222-----------1111----------------------------- NQKFKGKATLTADKSSTTAFMELSSLTSEDSAVYYCTRKDYWGQGTLVTVSAAKTTAPSV 3333---------1111---------3333------------------------------ YPLVPVCGGTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPALLQSGLYTLSSSV --------------------------------%%%%-------------iiii------- TVTSNTWPSQTITCNVAHPASSTKVDKKIEPRV ---3333----------3333------------ >PS2 PROTEIN; SWP:P04155; PDB:1HI7A; EAQTETCTVAPRERQNCGFPGVTPSQCANKGCCFDDTVRGVPWCFYPNTIDVPPEEECEF ---------3333-----33333333-1111----------------------------- >DIPEPTIDE TRANSPORT PROTE; SWP:P26902; PDB:1HI9A; MKLYMSVDMEGISGLPDDTFVDSGKRNYERGRLIMTEEANYCIAEAFNSGCTEVLVNDSH --------1111-----11111111---------------------1111---------! SKMNNLMVEKLHPEADLISGDVKPFSMVEGLDDTFRGALFLGYHARASTPGVMSHSMIFG !!!---1111-1111-------1111-2222--------------------------333 VRHFYINDRPVGELGLNAYVAGYYDVPVLMVAGDDRAAKEAEELIPNVTTAAVKQTISRS 3----iiii------------1111-------------------2222------------ AVKCLSPAKRGRLLTEKTAFALQNKDKVKPLTPPDRPVLSIEFANYGQAEWANLMPGTEI -----------------------3333---------------------------2222-- KTGTTTVQFQAKDMLEAYQAMLVMTELAMRTSFC 2222-----------------------1111--- >IGG2A-KAPPA 17/9 FAB (HEA; SWP:NA; PDB:1HILA; DIVMTQSPSSLTVTAGEKVTMSCTSSQSLFNSGKQKNYLTWYQQKPGQPPKVLIYWASTR -------------2222--------------1111---------2222------------ ESGVPDRFTGSGSGTDFTLTISSVQAEDLAVYYCQNDYSNPLTFGGGTKLELKRADAAPT 2222--------------------3333-------------------------------- VSIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYS ------33331111---------------------iiii--------------------- MSSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNR -----------1111--------1111---------- >IGG2A-KAPPA 17/9 FAB (HEA; SWP:NA; PDB:1HILB; EVQLVESGGDLVKPGGSLKLSCAASGFSFSSYGMSWVRQTPDKRLEWVATISNGGGYTYY ------------2222-----------3333--------1111--------1111----- PDSVKGRFTISRDNAKNTLYLQMSSLKSEDSAMYYCARRERYDENGFAYWGQGTLVTVSA 3333----------------------3333----------%%%%---------------- AKTTAPSVYPLAPVSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLYTLSS -----------------------------------iiii--------------------- SVTVTSSTWPSQSITCNVAHPASSTKVDKKIEPR --------------------1111---------- >ENDO-1,4-BETA-XYLANASE; SWP:Q59962; PDB:1HIXA; ITTNQTGTNNGYYYSFWTDGGGSVSMNLASGGSYGTSWTNCGNFVAGKGWANGARRTVNY --------iiii-----------------!!!!--------------------------- SGSFNPSGNAYLTLYGWTANPLVEYYIVDNWGTYRPTGTYKGTVTSDGGTYDVYQTTRVN ---------------------------------------------%%%%----------- APSVEGTKTFNQYWSVRQSKRTGGSITAGNHFDAWARYGMPLGSFNYYMIMATEGYQSSG --1111--------------------3333-----1111--------------------- SSSIS ----- >THYMOSIN BETA9; SWP:P21752; PDB:1HJ0A; ADKPDLGEINSFDKAKLKKTETQEKNTLPTKETIEQEKQAK -----------------------------3333-------- >OESTROGEN RECEPTOR BETA; SWP:Q62986; PDB:1HJ1A; LSPEQLVLTLLEAEPPNVLVSRPSMPFTEASMMMSLTKLADKELVHMIGWAKKIPGFVEL ----------1111---------------------------------------2222--- SLLDQVRLLESCWMEVLMVGLMWRSIDHPGKLIFAPDLVLDRDEGKCVEGILEIFDMLLA ----------------------1111-2222---1111--3333---2222--------- TTSRFRELKLQHKEYLCVKAMILLNSSRKLTHLLNAVTDALVWVIAKSGISSQQQSVRLA ---------------------------------------------1111----------- NLLMLLSHVRHISNKGMEHLLSM ----------------------- >TRYPSIN I; SWP:P35031; PDB:1HJ8A; IVGGYECKAYSQPHQVSLNSGYHFCGGSLVNENWVVSAAHCYKSRVEVRLGEHNIKVTEG -------22221111---------------1111---1111------------1111--- SEQFISSSRVIRHPNYSSYNIDNDIMLIKLSKPATLNTYVQPVALPTSCAPAGTMCTVSG ------------1111--------------------1111-------------------- WGNTMSSTADSNKLQCLNIPILSYSDCNNSYPGMITNAMFCAGYLEGGKDSCQGDSGGPV ---------3333-----------------2222-1111----3333----2222----- VCNGELQGVVSWGYGCAEPGNPGVYAKVCIFNDWLTSTMASY -iiii------------2222-----3333-------1111- >BETA-1,4-GALACTANASE; SWP:P83691; PDB:1HJQA; ALQYKGVDWSSVMVEERAGVRYKNVNGQEKPLEYILAENGVNMVRQRVWVNPWDGNYNLD --------1111---1111----1111---3333--1111-----------1111----- YNIQLARRAKAAGLGLYINFHYSDTWADPAHQTTPAGWPSDINNLAWKLYNYTLDSMNRF ---------1111--------------1111---2222---------------------- ADAGIQVDIVSIGNEITQGLLWPLGKTNNWYNIARLLHSAAWGVKDSRLNPKPKIMVHLD 1111-----------1111---1111-----------------1111------------- NGWNWDTQNWWYTNVLSQGPFEMSDFDMMGVSFYPFYSASATLDSLRRSLNNMVSRWGKE 3333----------1111---1111------------3333------------------- VAVVETNWPTSCPYPRYQFPADVRNVPFSAAGQTQYIQSVANVVSSVSKGVGLFYWEPAW -------------------3333-----------------------2222------1111 IHNANLGSSCADNTMFTPSGQALSSLSVFHRI --3333----------1111-------3333- >HOLLIDAY JUNCTION RESOLVA; SWP:P0A814; PDB:1HJRA; AIILGIDPGSRVTGYGVIRQVGRQLSYLGSGCIRTKVDDLPSRLKLIYAGVTEIITQFQP --------3333--------------------------3333------------------ DYFAIEQVFMAKNADSALKLGQARGVAIVAAVNQELPVFEYAARQVKQTVVGIGSAEKSQ ---------------3333-----------3333-------1111-----------3333 VQHMVRTLLKLPANPQADAADALAIAITHCHVSQNAMQ -------------------------------------- >BETA-1,4-GALACTANASE; SWP:P83692; PDB:1HJSA; ALTYRGVDWSSVVVEERAGVSYKNTNGNAQPLENILAANGVNTVRQRVWVNPADGNYNLD --------1111---1111----1111---------1111-----------1111----- YNIAIAKRAKAAGLGVYIDFHYSDTWADPAHQTMPAGWPSDIDNLSWKLYNYTLDAANKL ---------1111--------------1111---2222---------------------- QNAGIQPTIVSIGNEIRAGLLWPTGRTENWANIARLLHSAAWGIKDSSLSPKPKIMIHLD 1111-----------1111------------------------1111------------- NGWDWGTQNWWYTNVLKQGTLELSDFDMMGVSFYPFYSSSATLSALKSSLDNMAKTWNKE 1111-----------3333--1111------------1111------------------- IAVVETNWPISCPNPRYSFPSDVKNIPFSPEGQTTFITNVANIVSSVSRGVGLFYWEPAW -------------------1111-----------------------2222------1111 IHNANLGSSCADNTMFSQSGQALSSLSVFQRI --1111----------1111-----3333--- >CHITINASE-3 LIKE PROTEIN ; SWP:P36222; PDB:1HJXA; YKLVCYYTSWSQYREGDGSCFPDALDRFLCTHIIYSFANISNDHIDTWEWNDVTLYGMLN --------------!!!!--1111-1111-----------%%%%----1111-------- TLKNRNPNLKTLLSVGGWNFGSQRFSKIASNTQSRRTFIKSVPPFLRTHGFDGLDLAWLY 3333-3333-------1111---------------------------------------- PGRRDKQHFTTLIKEMKAEFIKEAQPGKKQLLLSAALSAGKVTIDSSYDIAKISQHLDFI -2222-----------------------------------------------3333---- SIMTYDFHGAWRGTTGHHSPLFRGQEDASPDRFSNTDYAVGYMLRLGAPASKLVMGIPTF ----------2222----------1111--------------------3333-------- GRSFTLASSETGVGAPISGPGIPGRFTKEAGTLAYYEICDFLRGATVHRILGQQVPYATK -----------2222-------------2222---------2222----1111------! GNQWVGYDDQESVKSKVQYLKDRQLAGAMVWALDLDDFQGSFCGQDLRFPLTNAIKDALA !!!-----------------1111-------1111-3333-------------------- AT -- >HEAT SHOCK PROTEIN HSP82; SWP:P02829; PDB:1HK7A; TKPLWTRNPSDITQEEYNAFYKSISNDWEDPLYVKHFSVEGQLEFRAILFIPKRAPFDLF --3333-3333------------------------------------------------- ESKKKKNNIKLYVRRVFITDEAEDLIPEWLSFVKGVVDSEDLPLNQNKIMKVIRKNIVKK ------------iiii----------3333----------------3333---------- LIEAFNEIAEDSEQFEKFYSAFSKNIKLGVHEDTQNRAALAKLLRYNSTKSVDELTSLTD ---------------------------------1111---1111---1111--------- YVTRMPEHQKNIYYITGESLKAVEKSPFLDALKAKNFEVLFLTDPIDEYAFTQLKEFEGK -----1111---------3333---1111--3333---------------------iiii TLVDITKDF ---3333-- >ANAEROBIC RIBONUCLEOTIDE-; SWP:P07071; PDB:1HK8A; DSRVFPTQRDLMAGIVSKHIAKNMVPSFIMKAHESGIIHVHDIDYSPALPFTNCCLVDLK -------------------3333--3333------------1111--------------- GMLENGFKLGNAQIETPKSIGVATAIMAQITAQVASHQYGGTTFANVDKVLSPYVKRTYA --------!!!!--------------------3333------------------------ KHIEDAEKWQIADALNYAQSKTEKDVYDAFQAYEYEVNTLFSSNGQTPFVTITFGTGTDW ------1111--3333------------------3333---1111--------------- TERMIQKAILKNRIKGLGRDGITPIFPKLVMFVEEGVNLYKDDPNYDIKQLALECASKRM -----------------1111------------2222--1111----------------- YPDIISAKNNKAITGSSVPVSPMGCRSFLSVWKDSTGNEILDGRNNLGVVTLNLPRIALD ---------------------------------1111---2222--------------11 SYIGTQFNEQKFVELFNERMDLCFEALMCRISSLKGVKATVAPILYQEGAFGVRLKPDDD 11!!!!---------------------------222233333333---1111---11113 IIELFKNGRSSVSLGYIGIHELNILVGRDIGREILTKMNAHLKQWTERTGFAFSLYSTPA 333--iiii--------------------3333--------------------------- ENLCYRFCKLDTEKYGSVKDVTDKGWYTNSFHVSVEENITPFEKISREAPYHFIATGGHI -----------------22223333---!!!!---------------3333---1111-- SYVELPDMKNNLKGLEAVWDYAAQHLDYFGVNMPVDKCFTCGSTHEMTPTENGFVCSICG -------1111--------------------------------------3333------- ETDPKKMNTIRRTCAYLGNPN --3333--------------- >HFQ PROTEIN; SWP:P25521; PDB:1HK9A; SLQDPFLNALRRERVPVSIYLVNGIKLQGQIESFDQFVILLKNTVSQMVYKHAISTVVPS ----------1111------1111----------1111-----------3333------- RPVSH ----- >NK CELL ACTIVATING RECEPT; SWP:O95944; PDB:1HKFA; SKAQVLQSVAGQTLTVRCQYPPTGSLYEKKGWCKEASALVCIRLVTSSKPRTMAWTSRFT --------2222----------!!!!-----------------------------!!!!- IWDDPDAGFFTVTMTDLREEDSGHYWCRIYRPSDNSVSKSVRFYLVVS ---3333----------3333--------------------------- >GAMMA LACTAMASE; SWP:Q8GJP7; PDB:1HKHA; GYITVGNENSTPIELYYEDQGSGQPVVLIHGYPLDGHSWERQTRELLAQGYRVITYDRRG -------!!!!---------------------------3333----1111-------222 FGGSSKVNTGYDYDTFAADLHTVLETLDLRDVVLVGFSMGTGELARYVARYGHERVAKLA 2---------------------------------------------------1111---- FLASLEPFLVQRDDNPEGVPQEVFDGIEAAAKGDRFAWFTDFYKNFYNLDENLGSRISEQ -----------1111----3333---------------------1111------------ AVTGSWNVAIGSAPVAAYAVVPAWIEDFRSDVEAVRAAGKPTLILHGTKDNILPIDATAR --------11113333-11113333----------3333--------------3333--- RFHQAVPEADYVEVEGAPHGLLWTHADEVNAALKTFLAK -----3333--------------------------3333 >REPLICATION PROTEIN; SWP:Q52546; PDB:1HKQA; QSNKLIESSHTLTLNEKRLVLCAASLIDSRKPLPKDGYLTIRADTFAEVFGIDVKHAYAA -------3333----------------3333--2222---------------3333---- LDDAATKLFNRDIRRYVKGKVVERMRWVFHVKYREGQGCVELGFSPTIIPHLTMLHKEFT -------1111-----!!!!--------------1111------11111111-------- SYQLK ----- >HEAT-SHOCK TRANSCRIPTION ; SWP:P22813; PDB:1HKS; GSGVPAFLAKLWRLVDDADTNRLICWTKDGQSFVIQNQAQFAKELLPLNYKHNNMASFIR ------3333----------------------------1111-----------------3 QLNMYGFHKITSIDNGGLRFDRDEIEFSHPFFKRNSPFLLDQIKRK 333------------------------------------------- >CALCIUM/CALMODULIN-DEPEND; SWP:P11798; PDB:1HKXA; MTTIEDEDTKVRKQEIIKVTEQLIEAISNGDFESYTKMCDPGMTAFEPEALGNLVEGLDF --------------------------1111-----11111111---3333-------333 HRFYFENLWSRNSKPVHTTILNPHIHLMGDESACIAYIRITQYLDAGGIPRTAQSEETRV 33333----3333----------------------------------------------- WHRRDGKWQIVHFHRSGAPSV --------------------- >MICRONEME PROTEIN 5 PRECU; SWP:Q9U966; PDB:1HKYA; DYKDDDDKVKLTCYQNGVSFTGGKAISEAKAASSQACQELCEKDAKCRFFTLASGKCSLF --3333----------------------------------3333---------------- ADDAALRPTKSDGAVSGNKRCILLED -------------------------- >N-ACETYLNEURAMINATE LYASE; SWP:P06995; PDB:1HL2A; TNLRGVMAALLTPFDQQQALDKASLRRLVQFNIQQGIDGLYVGGSTGEAFVQSLSEREQV --------------1111--------------1111-------33331111--------- LEIVAEEAKGKIKLIAHVGCVSTAESQQLAASAKRYGFDAVSAVTPFYYPFSFEEHCDHY -------2222------------------------------------------------- RAIIDSADGLPMVVYNIPARSGVKLTLDQINTLVTLPGVGALKQTSGDLYQMEQIRREHP ----------------3333---------------2222--------3333-------11 DLVLYNGYDEIFASGLLAGADGGIGSTYNIMGWRYQGIVKALKEGDIQTAQKLQTECNKV 11-----3333---------------3333-----------1111--------------- IDLLIKTGVFRGLKTVLHYMDVVSVPLCRKPFGPVDEKYLPELKALAQQLMQERG -----------------1111--------------3333------------1111 >PUTATIVE ALPHA-L-FUCOSIDA; SWP:Q9WYE2; PDB:1HL9A; RYKPDWESLREHTVPKWFDKAKFGIFIHWGIYSVPGWATPDAWFFQNPYAEWYENSLRIK ----3333------3333------------3333-------1111---3333------22 ESPTWEYHVKTYGENFEYEKFADLFTAEKWDPQEWADLFKKAGAKYVIPTTKHHDGFCLW 22----------111133333333--1111----------------------1111---- GTKYTDFNSVKRGPKRDLVGDLAKAVREAGLRFGVYYSGGLDWRFTTEPIRYPEDLSYIR -------3333!!!!-----------1111---------------------3333----- PNTYEYADYAYKQVMELVDLYLPDVLWNDMGWPEKGKEDLKYLFAYYYNKHPEGSVNDRW --------------------------------3333--------------1111------ GVPHWDFKTAELPGYKWEFTRGIGLSFGYNRNEMLSVEQLVYTLVDVVSKGGNLLLNVGP ------------------------------------------------------------ KGDGTIPDLQERLLGLGEWLRKYGDAIYGTSVWERCCAKTEDGTEIRFTRKCNRIFVIFL 1111--3333------------33332222---------1111-------!!!!------ GIPTGEKIVIEDLNLSAGTVRHFLTGERLSFKNVGKNLEITVPKKLLETDSITLVLEAVE ----------------------1111-------!!!!-----33331111---------- >HEMOGLOBIN (DEOXY); SWP:P80018; PDB:1HLB; GGTLAIQAQGDLTLAQKKIVRKTWHQLMRNKTSFVTDVFIRIFAYDPSAQNKFPQMAGMS -2222-------3333--------------1111------3333-3333---3333---- ASQLRSSRQMQAHAIRVSSIMSEYVEELDSDILPELLATLARTHDLNKVGADHYNLFAKV 1111------------------------11113333--------1111-1111------- LMEALQAELGSDFNEKTRDAWAKAFSVVQAVLLVKHG --------------3333------------------- >HUMAN LECTIN; SWP:P05162; PDB:1HLCA; ELEVKNMDMKPGSTLKITGSIADGTDGFVINLGQGTDKLNLHFNPRFSESTIVCNSLDGS ---------2222---------------------1111--------1111---------- NWGQEQREDHLCFSPGSEVKFTVTFESDKFKVKLPDGHELTFPNRLGHSHLSYLSVRGGF -------------------------3333----1111------1111------------- NMSSFKLKE --------- >HORSE LEUKOCYTE ELASTASE ; SWP:P05619; PDB:1HLEA; MEQLSTANTHFAVDLFRALNESDPTGNIFISPLSISSALAMIFLGTRGNTAAQVSKALYF -----------------------------------------3333--------------1 DTVEDIHSRFQSLNADINKPGAPYILKLANRLYGEKTYNFLADFLASTQKMYGAELASVD 111-------------------------------1111---------------------3 FQQAPEDARKEINEWVKGQTEGKIPELLVKGMVDNMNT 333----------------iiii--------------- >Leukocyte elastase inhibi; SWP:P05619; PDB:1HLEB; EENFNADHPFIFFIRHNPSANILFLGRFSSP ----------------1111----------- >LIPASE, GASTRIC; SWP:P07098; PDB:1HLGA; SPEVTMNISQMITYWGYPNEEYEVVTEDGYILEVNRIPYGKKNSGQRPVVFLQHGLLASA 1111-------3333----------1111-------------------------2222-- TNWISNLPNNSLAFILADAGYDVWLGNSRGNTWARRNLYYSPDSVEFWAFSFDEMAKYDL ------1111------1111----------2222--111111113333------------ PATIDFIVKKTGQKQLHYVGHSQGTTIGFIAFSTNPSLAKRIKTFYALAPVATVKYTKSL -----------------------------------3333--------------------- INKLRFVPQSLFKFIFGDKIFYPHNFFDQFLATEVCSREMLNLLCSNALFIICGFDSKNF ----------------------!!!!1111----------3333------------1111 NTSRLDVYLSHNPAGTSVQNMFHWTQAVKSGKFQAYDWGSPVQNRMHYDQSQPPYYNVTA 111133331111--------------------------------------------1111 MNVPIAVWNGGKDLLADPQDVGLLLPKLPNLIYHKEIPFYNHLDFIWAMDAPQEVYNDIV -----------------------3333-------------1111---11113333----- SMISEDKK -------- >ALPHA-2A ADRENERGIC RECEP; SWP:P08913; PDB:1HLLA; TSSIVHLCAISLDRYWSITQAIEYNLKRTPRR -----------3333-3333-3333------- >HEMOGLOBIN (CYANO MET); SWP:P80017; PDB:1HLM; GATQSFQSVGDLTPAEKDLIRSTWDQLMTHRTGFVADVFIRIFHNDPTAQRKFPQMAGLS ------------3333-------------3333---------------11111111---- PAELRTSRQMHAHAIRVSALMTTYIDEMDTEVLPELLATLTRTHDKNHVGKKNYDLFGKV 3333--------------------1111--------------3333-----33333333- LMEAIKAELGVGFTKQVHDAWAKTFAIVQGVLITKHAS ---3333-------3333-------------------- >HIGH-POTENTIAL IRON-SULFU; SWP:P80882; PDB:1HLQA; AAPLVAETDANAKSLGYVADTTKADKTKYPKHTKDQSCSTCALYQGKTAPQGACPLFAGK -----1111---1111---1111-33331111333333331111-!!!!----1111--- EVVAKGWCSAWAKKA --1111-1111---- >MAJOR CENTROMERE AUTOANTI; SWP:P07199; PDB:1HLVA; MGPKRRQLTFREKSRIIQEVEENPDLRKGEIARRFNIPPSTLSTILKNKRAILASERKYG ----------------------11113333----------------------------!! VASTCRKTNKLSPYDKLEGLLIAWFQQIRAAGLPVKGIILKEKALRIAEELGMDDFTASN !!---------1111-------------1111---3333-------------1111---- GWLDRFRRRRS ----------- >HONGOTOXIN 1; SWP:P59847; PDB:1HLYA; TVIDVKCTSPKQCLPPCKAQFGIRAGAKCMNGKCKCYPH ---------3333----------------%%%%------ >PHOSPHOGLUCOSE ISOMERASE; SWP:Q9N1E2; PDB:1HM5A; AALTRNPQFQKLQQWHREHGSELNLRHLFDTDKERFNHFSLTLNTNHGHILLDYSKNLVT -3333--------------1111--------11111111--------------------- EEVMHMLLDLAKSRGVEAARESMFNGEKINSTEDRAVLHVALRNRSNTPIVVDGKDVMPE ----------------------1111----1111---3333--3333----%%%%----- VNKVLDKMKAFCQRVRSGDWKGYTGKTITDVINIGIGGSDLGPLMVTEALKPYSSGGPRV ---------------------1111----------!!!!----------33332222--- WFVSNIDGTHIAKTLACLNPESSLFIIASKTFTTQETITNAKTAKDWFLLSAKDPSTVAK --------------11113333------1111---------------3333--3333111 HFVALSTNTAKVKEFGIDPQNMFEFWDWVGGRYSLWSAIGLSIALHVGFDNFEQLLSGAH 1-----------3333-3333----1111111111111111------------------- WMDQHFRTTPLEKNAPVLLAMLGIWYINCFGCETQAVLPYDQYLHRFAAYFQQGDMESNG ---------3333---------------------------3333---------------- KYITKSGARVDHQTGPIVWGEPGTNGQHAFYQLIHQGTKMIPCDFLIPVQTQHPIRKGLH ---1111------------------1111-3333--------------------%%%%-- HKILLANFLAQTEALMKGKSTEEARKELQAAGKSPEDLMKLLPHKVFEGNRPTNSIVFTK ----------------------------1111---------3333--------------- LTPFILGALIAMYEHKIFVQGVVWDINSFDQWGVELGKQLAKKIEPELDGSSPVTSHDSS ------------------------------3333---------3333------------- TNGLINFIKQQREAKI ----------1111-- >ANNEXIN 1; SWP:P19619; PDB:1HM6A; AMVSEFLKQAWFIDNEEQEYIKTVKGSKGGPGSAVSPYPTFNPSSDVEALHKAITVKGVD --------------------------2222-------------------------2222- EATIIEILTKRTNAQRQQIKAAYLQEKGKPLDEALKKALTGHLEEVALALLKTPAQFDAD ------------------------------------------------3333-------- ELRAAMKGLGTDEDTLNEILASRTNREIREINRVYKEELKRDLAKDITSDTSGDYQKALL ------------------------------------------------------------ SLAKGDRSEDLAINDDLADTDARALYEAGERRKGTDLNVFITILTTRSYPHLRRVFQKYS -3333-----------------------1111--------------------------11 KYSKHDMNKVLDLELKGDIENCLTVVVKCATSKPMFFAEKLHQAMKGIGTRHKTLIRIMV 11---1111--------------------------------------------------1 SRSEIDMNDIKACYQKLYGISLCQAILDETKGDYEKILVALCG 111---------------------------------------- >UDP-N-ACETYLGLUCOSAMINE-1; SWP:Q97R46; PDB:1HM9A; SNFAIILAAGKGTRMKSDLPKVLHKVAGISMLEHVFRSVGAIQPEKTVTVVGHKAELVEE -----------3333----3333--iiii---------3333-----------3333--- VLAGQTEFVTQSEQLGTGHAVMMTEPILEGLSGHTLVIAGDTPLITGESLKNLIDFHINH -----------------------33332222----------1111--------------- KNVATILTAETDNPFGYGRIVRNDNAEVLRIVEQKDATDFEKQIKEINTGTYVFDNERLF -------------2222-----1111------3333-3333------------------- EALKNINTNNAQGEYYITDVIGIFRETGEKVGAYTLKDFDESLGVNDRVALATAESVMRR --1111---1111--3333------------------3333------------------- RINHKHMVNGVSFVNPEATYIDIDVEIAPEVQIEANVILKGQTKIGAETVLTNGTYVVDS --------------1111---1111--------------------2222--2222----- TIGAGAVITNSMIEESSVADGVTVGPYAHIRPNSSLGAQVHIGNFVEVKGSSIGENTKAG ------------------2222--------------2222-------------2222--- HLTYIGNCEVGSNVNFGAGTITVNYDGKNKYKTVIGDNVFVGSNSTIIAPVELGDNSLVG ----------------2222---------------------------------2222--2 AGSTITKDVPADAIAIGRGRQINKDEYATRLPHHPKNQ 222------2222----------22221111--1111- >HIGH MOBILITY GROUP PROTE; SWP:P63159; PDB:1HME; FKDPNAPKRPPSAFFLFCSEYRPKIKGEHPGLSIGDVAKKLGEMWNNTAADDKQPYEKKA -----------1111-----------------3333------------3333-------- AKLKEKYEKDIAAYRAK ---------3333---- ------------------------------------------------------------ -------- >MUSCLE FATTY ACID BINDING; SWP:P05413; PDB:1HMT; VDAFLGTWKLVDSKNFDDYMKSLGVGFATRQVASMTKPTTIIEKNGDILTLKTHSTFKNT 3333----------------1111--------1111--------!!!!------------ EISFKLGVEFDETTADDRKVKSIVTLDGGKLVHLQKWDGQETTLVRELIDGKLILTLTHG ----2222-----1111---------iiii------iiii--------iiii------!! TAVCTRTYEKE !!--------- >CHONDROITIN ABC LYASE I; SWP:P59807; PDB:1HN0A; ATSNPAFDPKN -------3333 >P19 ARF PROTEIN; SWP:NA; PDB:1HN3A; GSHMGRRFLVTVRIQRAGRPLQERVFLVKFVRSRRPRTAS ---------------------3333--------------- >PROPHOSPHOLIPASE A2; SWP:P00592; PDB:1HN4A; AFRSMIKCAIPGSHPLMDFNNYGCYCGLGGSGTPVDELDRCCETHDNCYRDAKNLDSCKF ---------22223333------------------3333---------------333311 LVDNPYTESYSYSCSNTEITCNSKNNACEAFICNCDRNAAICFSKAPYNKEHKNLDTKKY 11-1111-------iiii---1111-----------------1111--3333---1111- C - >CD2; SWP:P06729; PDB:1HNF; TNALETWGALGQDINLDIPSFQMSDDIDDIKWEKTSDKKKIAQFRKEKETFKEKDTYKLF --------2222-----3333-------------1111-------3333----3333--1 KNGTLKIKHLKTDDQDIYKVSIYDTKGKNVLEKIFDLKIQERVSKPKISWTCINTTLTCE 111-------3333---------------------------------------------- VMNGTDPELNLYQDGKHLKLSQRVITHKWTTSLSAKFKCTAGNKVSKESSVEPVSCPEK ------------%%%%--------------------------1111------------- >BETA-KETOACYL-ACYL CARRIE; SWP:P24249; PDB:1HNJA; MYTKIIGTGSYLPEQVRTNADLEKMVDTSDEWIVTRTGIRERHIAAPNETVSTMGFEAAT -----------------33331111--------------------1111----------- RAIEMAGIEKDQIGLIVVATTSATHAFPSAACQIQSMLGIKGCPAFDVAAACAGFTYALS --------1111-----------------------1111----------!!!!------- VADQYVKSGAVKYALVVGSDVLARTCDPTDRGTIIIFGDGAGAAVLAASEEPGIISTHLH --------------------3333--1111--1111------------------------ ADGSYGELLTLPNADRVNPENSIHLTMAGNEVFKVAVTELAHIVDETLAANNLDRSQLDW -33331111--------3333---------------------------1111-3333--- LVPHQANLRIISATAKKLGMSMDNVVVTLDRHGNTSAASVPCALDEAVRDGRIKPGQLVL --------------------3333---3333---!!!!---------1111--2222--- LEAFGGGFTWGSALVRF ----------------- >H-NS; SWP:P0ACF8; PDB:1HNR; AQRPAKYSYVDENGETKTWTGQGRTPAVIKKAMDEQGKSLDDFLIKQ ----------3333--------------------------------- ------------------------------ >VACUOLAR ATP SYNTHASE SUB; SWP:P41807; PDB:1HO8A; GATKILMDSTHFNEIRSIIRSRSVAWDALARSEELSEIDASTAKALESILVKKVNGKTLI ------------------3333-------------------------------------- PLIHLLSTSDNEDCKKSVQNLIAELLSSDKYGDDTVKFFQEDPKQLEQLFDVSLKGDFQT ----------1111-----------------------------3333--1111---3333 VLISGFNVVSLLVQNGLHNVKLVEKLLKNNNLINILQNIEQMDTCYVCIRLLQELAVIPE ------------------3333---------------3333-------------333333 YRDVIWLHEKKFMPTLFKILQRATDHLGIQLQYHSLLLIWLLTFNPVFANELVQKYLSDF 33-----3333----------33331111------------------------------- LDLLKLVKITIKEKVSRLCISIILQCCSTRVKQHKKVIKQLLLLGNALPTVQSLSERKYS -----------------------11113333----------------------------- DEELRQDISNLKEILENEYQELTSFDEYVAELDSKLLCWSPPHVDNGFWSDNIDEFKKDN ----------------------3333--------------3333-------3333---%% YKIFRQLIELLQAKVRNGDVNAKQEKIIIQVALNDITHVVELLPESIDVLDKTGGKADIM %%---------------------3333---------------1111-------3333--- ELLNHSDSRVKYEALKATQAIIGYTFK 3333----------------3333--- >5'-NUCLEOTIDASE; SWP:P07024; PDB:1HP1A; YEQDKTYKITVLHTNDHHGHFWRNEYGEYGLAAQKTLVDGIRKEVAAEGGSVLLLSGGDI -2222-----------iiii---1111------------------1111----------- NTGVPESDLQDAEPDFRGMNLVGYDAMAIGNHEFDNPLTVLRQQEKWAKFPLLSANIYQK ---3333----------------------1111---3333-------------------- STGERLFKPWALFKRQDLKIAVIGLTTDDTAKIGNPEYFTDIEFRKPADEAKLVIQELQQ ----------------------------1111------2222------------------ TEKPDIIIAATHMGHYDNGEHGSNAPGDVEMARALPAGSLAMIVGGHSQDPVCMAAENKK ---------------2222-!!!!-------33332222----------------2222- QVDYVPGTPCKPDQQNGIWIVQAHEWGKYVGRADFEFRNGEMKMVNYQLIPVNLKKKRVL ----2222------iiii------%%%%---------iiii------------------- YTPEIAENQQMISLLSPFQNKGKAQLEVKIGETNGRLEGDRDKVRFVQTNMGRLILAAQM ---------------------------------------1111----------------- DRTGADFAVMSGGGIRDSIEAGDISYKNVLKVQPFGNVVVYADMTGKEVIDYLTAVAQMK ----------3333-----------------------------------------1111- PDSGAYPQFANVSFVAKDGKLNDLKIKGEPVDPAKTYRMATLNFNATGGDGYPRLDNKPG --1111----------iiii-----iiii--1111------------2222---1111-- YVNTGFIDAEVLKAYIQKSSPLDVSVYEPKGEVSWQ ------------------------1111-------- >TITYUSTOXIN K ALPHA; SWP:P46114; PDB:1HP2A; VFINAKCRGSPECLPKCKEAIGKAAGKCMNGKCKCYP --------3333----------------%%%%----- >HU-P8; SWP:P56277; PDB:1HP8; MPQKDPCQKQACEIQKCLQANSYMESKCQAVIQELRKCCAQYPKGRSVVCSGFEKEEEEN ----1111------------iiii3333---------1111-3333-3333--------- LTRKSASK -------- >H PROTEIN OF THE GLYCINE ; SWP:P16048; PDB:1HPCA; SNVLDGLKYAPSHEWVKHEGSVATIGITDHAQDHLGEVVFVELPEPGVSVTKGKGFGAVE ---------1111-----!!!!----------------------2222--2222------ SVKATSDVNSPISGEVIEVNTGLTGKPGLINSSPYEDGWMIKIKPTSPDELESLLGAKEY 1111----------------3333---3333----1111----------3333------- TKFCEEEDAAH ----------- ---------------------- >HIGH POTENTIAL IRON SULFU; SWP:P38524; PDB:1HPI; MERLSEDDPAAQALEYRHDASSVQHPAYEEGQTCLNCLLYTDASAQDWGPCSVFPGKLVS ----1111---1111---3333--1111----33331111-1111-----1111------ ANGWCTAWVAR ----1111--- >LIPASE; SWP:P29183; PDB:1HPLA; NEVCYERLGCFSDDSPWAGIVERPLKILPWSPEKVNTRFLLYTNENPDNFQEIVADPSTI -------------------3333--------3333-------1111-------------1 QSSNFNTGRKTRFIIHGFIDKGEESWLSTMCQNMFKVESVNCICVDWKSGSRTAYSQASQ 111--1111----------2222----------3333---------3333---------- NVRIVGAEVAYLVGVLQSSFDYSPSNVHIIGHSLGSHAAGEAGRRTNGAVGRITGLDPAE ----------------------1111-------------------%%%%----------2 PCFQGTPELVRLDPSDAQFVDVIHTDIAPFIPNLGFGMSQTAGHLDFFPNGGKEMPGCQK 222---3333--1111-------------------------------2222---2222-- NVLSQIVDIDGIWQGTRDFAACNHLRSYKYYTDSILNPDGFAGFSCASYSDFTANKCFPC --------------------3333-------3333------------------------- SSEGCPQMGHYADRFPGRTKGVGQLFYLNTGDASNFARWRYRVDVTLSGKKVTGHVLVSL 1111----1111----1111---------------------------------------- FGNKGNSRQYEIFQGTLKPDNTYSNEFDSDVEVGDLEKVKFIWYNNVINLTLPKVGASKI -1111------------------------------------------------------- TVERNDGSVFNFCSEETVREDVLLTLTAC ---1111---------------------- >PANCREATIC SECRETORY TRYP; SWP:P00995; PDB:1HPT; DSLGREAKCYNELNGCTYEYRPVCGTDGDTYPNECVLCFENRKRQTSILIQKSGPC --------%%%%------------1111----3333-------------------- >CYTOTOXIC NECROTIZING FAC; SWP:Q47106; PDB:1HQ0A; SIESTSKSNFQKLSRGNIDVLKGRGSISSTRQRAIYPYFEAANADEQQPLFFYIKKDRFD ------------1111-3333--------------------------------------- NHGYDQYFYDNTVGPNGIPTLNTYTGEIPSDSSSLGSTYWKKYNLTNETSIIRVSNSARG ----1111----------------------1111---1111----1111----------- ANGIKIALEEVQEGKPVIITSGNLSGCTTIVARKEGYIYKVHTGTTKSLAGFTSTTGVKK ------1111-2222------------------iiii-----------2222-------- AVEVLELLTKEPIPRVEGIMSNDFLVDYLSENFEDSLITYSSSEKKPDSQITIIRDNVSV ------------------------------------------33333333----1111-- FPYFLDNIPEHGFGTSATVLVRVDGNVVVRSLSESYSLNADASEISVLKVFSKKF ----------------------iiii------------1111------------- >SIGNAL RECOGNITION PARTIC; SWP:P07019; PDB:1HQ1A; GFDLNDFLEQLRQDDKVLVRMEAIINSMTMKERAKPEIIKGSRKRRIAAGSGMQVQDVNR ------------------------1111--------------------1111-3333--- LLKQFDDMQRMMKKMK ---------------- >HISTIDINE DECARBOXYLASE; SWP:P00862; PDB:1HQ6A; SELDAKLNKLGVDRIAISPYKQWTRGYMEPGNIGNGYVTGLKVDAGVRRAETKNAYIGQI -------1111--------------2222-------------------3333-------- NMTTAS ------ >NKG2-D; SWP:O54709; PDB:1HQ8A; GYCGPCPNNWICHRNNCYQFFNEEKTWNQSQASCLSQNSSLLKIYSKEEQDFLKLVKSYH ------2222--iiii------------------1111-------3333---1111---- WMGLVQIPANGSWQWEDGSSLSYNQLTLVEIPKGSCAVYGSSFKAYTEDCANLNTYICMK --------------1111---1111---------------%%%%----1111-------- RAV --- >RUVB; SWP:Q5SL87; PDB:1HQCA; ALRPKTLDEYIGQERLKQKLRVYLEAAKARKEPLEHLLLFGPPGLGKTTLAHVIAHELGV -----3333-------------------------------------3333---------- NLRVTSGPAIEKPGDLAAILANSLEEGDILFIDEIHRLSRQAEEHLYPAMEDFVMDIVIG ------3333--------------2222-----3333----------------------- QGPAARTIRLELPRFTLIGATTRPGLITAPLLSRFGIVEHLEYYTPEELAQGVMRDARLL -----------------------------3333-----------------------3333 GVRITEEAALEIGRRSRGTMRVAKRLFRRVRDFAQVAGEEVITRERALEALAALGLDELG ------------3333---------33331111--------------------------- LEKRDREILEVLILRFGGGPVGLATLATALSEDPGTLEEVHEPYLIRQGLLKRTPRGRVP -------------1111----------------------------1111----1111--- TELAYRHLGYPPPV -------------- >PHENOL HYDROXYLASE P2 PRO; SWP:P19731; PDB:1HQI; MSSLVYIAFQDNDNARYVVEAIIQDNPHAVVQHHPAMIRIEAEKRLEIRRETVEENLGRA ------------------------------------------------------------ WDVQEMLVDVITIGGNVDEDDDRFVLEWKN -3333------------------------- >6,7-DIMETHYL-8-RIBITYLLUM; SWP:O66529; PDB:1HQKA; MQIYEGKLTAEGLRFGIVASRFNHALVDRLVEGAIDCIVRHGGREEDITLVRVPGSWEIP ---------2222---------3333------------1111-3333-------3333-- VAAGELARKEDIDAVIAIGVLIRGATPHFDYIASEVSKGLANLSLELRKPITFGVITADT --------1111--------------3333------------------------------ LEQAIERAGTKHGNKGWEAALSAIEMANLFKSLR ----1111-1111--------------------- >LECTIN; SWP:NA; PDB:1HQLA; SVSFTFPNFWSDVEDSIIFQGDANTTAGTLQLCKTNQYGTPLQWSAGRALYSDPVQLWDN -------------------------iiii------1111--------------------3 KTESVASFYTEFTFFLKITGNGPADGLAFFLAPPDSDVKDAGEYLGLFNKSTATQPSKNQ 333-----------------------------1111----!!!!----3333--3333-- VVAVEFDTWTNPNFPEPSYRHIGINVNSIVSVATKRWEDSDIFSGKIATARISYDGSAEI -------------------------------------3333------------------- LTVVLSYPDGSDYILSHSVDMRQNLPESVRVGISASTGNNQFLTVYILSWRFSSNL -------------------3333--------------------------------- >DNA-DIRECTED RNA POLYMERA; SWP:Q9KWU8; PDB:1HQMA; LKAPVFTATTQGDHYGEFVLEPLERGFGVTLGNPLRRILLSSIPGTAVTSVYIEDVLHEF -------------------------------------1111------------------- STIPGVKEDVVEIILNLKELVVRFLDPRWRTTLILRAEGPKEVRAVDFTPSADVEIMNPD --2222---------3333------------------------3333------------- LHIATLEEGGKLYMEVRVDRGVGYVPAERHGIKDRINAIPVDAIFSPVRRVAFQVEDTRL ------------------------------------------------------------ GQRTDLDKLTLRIWTDGSVTPLEALNQAVAILKEHLNYFANPE ------------------------------------------- >DNA-directed RNA polymera; SWP:Q9KWU7; PDB:1HQMC; KIKRFGRIREVIPLPPLTEIQVESYKKALQADVPPEKRENVGIQAAFKETFPIEEGDKGK -----------------3333---1111------------------3333---------- GGLVLDFLEYRIGDPPFSQDECREKDLTYQAPLYARLQLIHKDTGLIKEDEVFLGHLPLM -------------------3333------------------------------------- TEDGSFIINGADRVIVSQIHRSPGVYFTPDPARPGRYIASIIPLPKRGPWIDLEVEASGV ------------------------------------------------------------ VTMKVNKRKFPLVLLLRVLGYDQETLVRELSAYGDLVQGLLDEAVLAMRPEEAMVRLFTL ------------------------------------3333--3333-----------111 LRPGDPPKKDKALAYLFGLLADPKRYDLGEAGRYKAEEKLGVGLSGRTLVRFEDGEFKDE 1---------------------------------3333---------------------- VFLPTLRYLFALTAGVPGHEVDDIDHLGNRRIRTVGELMADQFRVGLARLARGVRERMVM --1111---1111---------------------1111--3333---------------- GSPDTLTPAKLVNSRPLEAALREFFSRSQLSQFKDETNPLSSLRHKRRISALGPGGLTRE -1111---------3333---------1111------3333------------------- RAGFDVRDVHRTHYGRICPVETPEGANIGLITSLAAYARVDALGFIRTPYRRVKNGVVTE ------------------------------------------------------------ EVVYMTASEEDRYTIAQANTPLEGDRIATDRVVARRRGEPVIVAPEEVEFMDVSPKQVFS -------------------------------------------------------3333- LNTNLIPFLEHDDANRALMGSNMQTQAVPLIRAQAPVVMTGLEERVVRDSLAALYAEEDG --------------3333----------------------3333---------------- EVVKVDGTRIAVRYEDGRLVHPLRRYARSNQGTAFDQRPRVRVGQRVKKGDLLADGPASE ----------------------------1111---------------------------- EGFLALGQNVLVAIMPFDGYNFEDAIVISEELLKRDFYTSIHIERYEIEARDTKLGPERI ----------------iiii----------3333-------------------------- TRDIPHLSEAALRDLDEEGIVRIGAEVKPGDILVGRTSFKGEQEPSPEERLLRSIFGEKA ---2222----------------------------------------------------- RDVKDTSLRVPPGEGGIVVGRLRLRRGDPGVELKPGVREVVRVFVAQKRKLQVGDKLANR ------------------------2222-------------------------------- HGNKGVVAKILPVEDMPHLPDGTPVDVILNPLGVPSRMNLGQILETHLGLAGYFLGQRYI ------------------1111-------------1111-------------1111---- SPVFDGATEPEIKELLAEAFNLYFGKRQGEGFGVDKREKEVLARAEKLGLVSPGKSPEEQ ----------------------3333--------3333--1111-1111------3333- LKELFDLGKVVLYDGRTGEPFEGPIVVGQMFIMKLYHMVEDKMHARSTGPYSLITQQPLG -1111--------------------------------3333------------------- GKAQFGGQRFGEMEVWALEAYGAAHTLQEMLTIKSDDIEGRNAAYQAIIKGEDVPEPSVP ----------3333---------33331111----------------------------3 ESFRVLVKELQALALDVQTLDEKDNPVDIFEGL 333------3333-------1111--------- >DNA-directed RNA polymera; SWP:Q9EVV4; PDB:1HQME; MAEPGIDKLFGMVDSKYRLTVVVAKRAQQLLRHRFKNTVLEPEERPKMRTLEGLYDDPNA --22223333----3333------------------------------------------ VTWAMKELLTGRLFFGENLVPEDRLQKEMERLYPTEEE ---------------------------3333------- >ISOCITRATE DEHYDROGENASE; SWP:P39126; PDB:1HQSA; MAQGEKITVSNGVLNVPNNPIIPFIEGDGTGPDIWNAASKVLEAAVEKAYKGEKKITWKE ---------iiii--------------!!!!------------------iiii------- VYAGEKAYNKTGEWLPAETLDVIREYFIAIKGPLTTPVGGGIRSLNVALRQELDLFVLRP ---------------3333----------------------------------------- VRYFTGVPSPVKRPEDTDMVIFRENTEDIYAGIEYAKGSEEVQKLISFLQNELNVNKIRF ---2222-----3333-----------1111----2222--------------------3 PETSGIGIKPVSEEGTSRLVRAAIDYAIEHGRKSVTLVHKGNIMKFTEGAFKNWGYELAE 333------------------------------------1111----------------- KEYGDKVFTWAQYDRIAEEQGKDAANKAQSEAEAAGKIIIKDSIADIFLQQILTRPNEFD --3333-------------------------------------3333--3333-3333-- VVATMNLNGDYISDALAAQVGGIGIAPGANINYETGHAIFEATHGTAPKYAGLDKVNPSS ----------------------------------------------1111---------- VILSGVLLLEHLGWNEAADLVIKSMEKTIASKVVTYDFARLMDGATEVKCSEFGEELIKN ---------1111---------------------3333-------------------111 MD 1- >PROGRAMMED CELL DEATH PRO; SWP:P12815; PDB:1HQVA; PGPGGGPGPAALPDQSFLWNVFQRVDKDRSGVISDNELQQALSNGTWTPFNPVTVRSIIS -------------------------1111---------3333------------------ MFDRENKAGVNFSEFTGVWKYITDWQNVFRTYDRDNSGMIDKNELKQALSGFGYRLSDQF -----------3333-----------------1111-------------1111---3333 HDILIRKFDRQGRGQIAFDDFIQGCIVLQRLTDIFRRYDTDQDGWIQVSYEQYLSMVF --------3333--------------------------1111---------------- >Actin-binding protein; SWP:P15891; PDB:1HQZ1; LEPIDYTTHSREIDAEYLKIVRGSDPDTTWLIISPNAKKEYEPESTGSSFHDFLQLFDET --------3333------------3333-------1111-------------1111-111 KVQYGLARVSPPGSDVEKIIIIGWCPDSAPLKTRASFAANFAAVANNLFKGYHVQVTARD 1------------------------1111------------------------------3 EDDLDENELLMKISNAAGA 333---------1111--- >Translation initiation fa; SWP:Q5SHR1; PDB:1HR0W; AKEKDTIRTEGVVTEALPNATFRVKLDSGPEILAYISGKMRMHYIRILPGDRVVVEITPY --------------------------------------3333---------------333 DPTRGRIVYRK 3---------- >MITOCHONDRIAL PROCESSING ; SWP:P11914; PDB:1HR6A; ARTDNFKLSSLANGLKVATSNTPGHFSALGLYIDAGSRFEGRNLKGCTHILDRLAFKSTE -3333-----1111---------------------3333----2222----1111---11 HVEGRAMAETLELLGGNYQCTSSRENLMYQASVFNQDVGKMLQLMSETVRFPKITEQELQ 11-------------------------------1111----------------------- EQKLSAEYEIDEVWMKPELVLPELLHTAAYSGETLGSPLICPRGLIPSISKYYLLDYRNK ---------------3333----------%%%%1111----33331111----------- FYTPENTVAAFVGVPHEKALELTGKYLGDWQSTHPPITKKVAQYTGGESCIPPAPVFGNL --3333--------------------1111------------------------------ PELFHIQIGFEGLPIDHPDIYALATLQTLLGGGGSFSAGGPGKGMYSRLYTHVLNQYYFV -------------1111-----------------------------------3333---- ENCVAFNHSYSDSGIFGISLSCIPQAAPQAVEVIAQQMYNTFANKDLRLTEDEVSRAKNQ ----------------------33331111-------------1111------------- LKSSLLMNLESKLVELEDMGRQVLMHGRKIPVNEMISKIEDLKPDDISRVAEMIFTGNVN --------------------------------------11113333-------1111--- NAGNGKGRATVVMQGDRGSFGDVENVLKAYGLGNSSS 1111-----------3333--------1111------ >Mitochondrial-processing ; SWP:P10507; PDB:1HR6B; PGTRTSKLPNGLTIATEYIPNTSSATVGIFVDAGSRAENVKNNGTAHFLEHLAFKGTQNR -------1111-----------------------11113333---------1111----- PQQGIELEIENIGSHLNAYTSRENTVYYAKSLQEDIPKAVDILSDILTKSVLDNSAIERE -3333----1111------------------3333------------------------- RDVIIRESEEVDKMYDEVVFDHLHEITYKDQPLGRTILGPIKNIKSITRTDLKDYITKNY ------------------------------3333-----3333----------------- KGDRMVLAGAGAVDHEKLVQYAQKYFGHVPKSESPVPLGSPRGPLPVFCRGERFIKENTL 1111---------3333--------1111-------1111-------------------- PTTHIAIALEGVSWSAPDYFVALATQAIVGNWDRAIGTGTNSPSPLAVAASQNGSLANSY ------------1111-------------------------------------------- MSFSTSYADSGLWGMYIVTDSNEHNVRLIVNEILKEWKRIKSGKISDAEVNRAKAQLKAA -------------------1111-3333-------------------------------- LLLSLDGSTAIVEDIGRQVVTTGKRLSPEEVFEQVDKITKDDIIMWANYRLQNKPVSMVA -1111-----------------------------1111---------------------- LGNTSTVPNVSYIEEKLNQ ---1111------------ >HEREGULIN ALPHA; SWP:Q02297; PDB:1HRE; GTSHLVKCAEKEKTFCVNGGECFMVKDLSNPSRYLCKCQPGFTGARCTENVPMKVQNQEK --------11113333-------------------------------------------- AEELYQK ------- >n/a; SWP:P03366; PDB:1HRHA; YQLEKEPIVGAETFYVDGAANRETKLGKAGYVTNKGRQKVVPLTNTTNQKTELQAIYLAL -------2222---------------------1111------------------------ QDSGLEVNIVTDSQYALGIIQAQPDKSESELVNQIIEQLIKKEKVYLAWVPGGNEQVDKL ---------------------------------------------------3333----- VSAGI ----- >RENIN; SWP:P00797; PDB:1HRNA; TTSSVILTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKCSRLYTACVYHKLFD ---------iiii-----------------------------111133333333-----1 ASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITVGGITVTQMFGEVTEMPALPFMLAEFD 111--------------1111------------iiii----------------1111--- GVVGMGFIEQAIGRVTPIFDNIISQGLKEDVFSFYYNRDSENSLGGQIVLGGSDPQHYEG ------3333------------1111---------------------------3333--- NFHYINLIKTGVWQIQMKGVSVGSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGA ----------------------------2222-----1111------------------- KKRLFDYVVKCNEGPTLPDISFHLGGKEYTLTSADYVFQESYSSKKLCTLAIHAMDIPPP ---------33331111--------------3333------------------------- TGPTWALGATFIRKFYTEFDRRNNRIGFALAR --------------------1111-------- >CYTOCHROME C2; SWP:P00080; PDB:1HROA; SAPPGDPVEGKHLFHTICITCHTDIKGANKVGPSLYGVVGRHSGIEPGYNYSEANIKSGI ------------1111--------2222------2222-------2222----------- VWTPDVLFKYIEHPQKIVPGTKMGYPGQPDPQKRADIIAYLETLK ------------3333-2222------------------------ >Hirudin variant-1; SWP:P01050; PDB:1HRTI; VVYTDCTESGQNLCLCEGSNVCGQGNKCILGSDGEKNQCVTGEGTPKPQSHNDGDFEEIP ----------------------2222-----2222------------------------- EEYLQ ----- >YRDC GENE PRODUCT; SWP:P45748; PDB:1HRUA; NNLQRDAIAAAIDVLNEERVIAYPTEAVFGVGCDPDSETAVRLLELKQRPVDKGLILIAA ---------------------------------11113333---------3333------ NYEQLKPYIDDTLTDVQRETIFSRWPGPVTFVFPAPATTPRWLTGRFDSLAVRVTDHPLV 33333333-------------1111----------11111111----------------- VALCQAYGKPLVSTSANLSGLPPCRTVDEVRAQFGAAFPVVPGETGGRLNPSEIRDALTG -----------------2222-------------1111---------------------- ELFR ---- >HUMAN SRY; SWP:Q05066; PDB:1HRYA; DRVKRPMNAFIVWSRDQRRKMALENPRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQ ------------------------------3333----------3333--3333------ AMHREKYPNYKYR ------------- >LEUKOTRIENE A-4 HYDROLASE; SWP:P09960; PDB:1HS6A; PEIVDTCSLASPASVCRTKHLHLRCSVDFTRRTLTGTAALTVQSQEDNLRSLVLDTKDLT -----------1111--------------------------------------------- IEKVVINGQEVKYALGERQSYKGSPMEISLPIALSKNQEIVIEISFETSPKSSALQWLTP -----iiii---------!!!!------------2222----------1111------33 EQTSGKEHPYLFSQCQAIHCRAILPCQDTPSVKLTYTAEVSVPKELVALMSAIRDGETPD 33----------------3333------1111----------1111-------------- PEDPSRKIYKFIQKVPIPCYLIALVVGALESRQIGPRTLVWSEKEQVEKSAYEFSETESM -----------------3333---------------------3333-------------- LKIAEDLGGPYVWGQYDLLVLPPSFPYGGMENPCLTFVTPTLLAGDKSLSNVIAHEISHS ---------------------1111------2222---3333----1111---------- WTGNLVTNKTWDHFWLNEGHTVYLERHICGRLFGEKFRHFNALGGWGELQNSVKTFGETH --3333--------------------------------------------------1111 PFTKLVVDLTDIDPDVAYSSVPYEKGFALLFYLEQLLGGPEIFLGFLKAYVEKFSYKSIT --------22223333---3333------------------------------2222--- TDDWKDFLYSYFKDKVDVLNQVDWNAWLYSPGLPPIKPNYDMTLTNACIALSQRWITAKE -----------11113333---------------------------------------33 DDLNSFNATDLKDLSSHQLNEFLAQTLQRAPLPLGHIKRMQEVYNFNAINNSEIRFRWLR 331111--1111--------------1111--3333--------3333------------ LCIQSKWEDAIPLALKMATEQGRMKFTRPLFKDLAAFDKSHDQAVRTYQEHKASMHPVTA --11113333------------3333----------3333-----------1111----- MLVGKDLKVD ---------- >SYNTAXIN VAM3; SWP:Q12241; PDB:1HS7A; TNQKTKELSNLIETFAEQSRVLEKECTKIGSKRDSKELRYKIETELIPNCTSVRDKIESN ---------------------------2222---3333---------------------- ILIHQNGKLSADFKNLKTKYQSLQQSYNQRKSLFPLK 3333---3333------------------1111---- >n/a; SWP:P01891; PDB:1HSBA; GSHSMRYFYTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGPEYW -------------3333----------!!!!-----1111--------1111---3333- DRNTRNVKAQSQTDRVDLGTLRGYYNQSEAGSHTIQMMYGCDVGSDGRFLRGYRQDAYDG -------------------------------------------1111----------%%% KDYIALKEDLRSWTAADMAAQTTKHKWEAAHVAEQWRAYLEGTCVEWLRRYLENGKETLQ %-----3333-----------------------------------------------111 RTDAPKTHMTHHAVSDHEATLRCWALSFYPAEITLTWQRDGEDQTQDTELVETRPAGDGT 1-------------------------------------iiii--2222------------ FQKWVAVVVPSGQEQRYTCHVQHEGLPKPL ---------22223333-----1111---- >FUSION PROTEIN CONSISTING; SWP:P0AEX9; PDB:1HSJA; KIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDII -----------1111--------------------------3333-----3333------ FWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKD --3333---------------3333---------1111iiii---------------333 LLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKD 3----------3333-----------------33333333----------------3333 VGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKV ------------------------1111-------------------3333--------- NYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLG ---------iiii-------------3333-3333------------------------- AVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEA ---------3333-------------------1111------------------------ LAAAQTNAAAEFMSKINDINDLVNATFQVKKFFRDTKKKFNLNYEEIYILNHILRSESNE ------1111-----------------------1111---------------1111---- ISSKEIAKCSEFKPYYLTKALQKLKDLKLLSKKRSLQDERTVIVYVTDTQKANIQKLISE ------------3333-------3333---------------------3333-------- LEEYIKN 3333--- >UDP-N-ACETYLENOLPYRUVOYLG; SWP:P61431; PDB:1HSKA; NKDIYQALQQLIPNEKIKVDEPLKRYTYTKTGGNADFYITPTKNEEVQAVVKYAYQNEIP ------------3333-----3333-1111------------3333-------------- VTYLGNGSNIIIREGGIRGIVISLLSLDHIEVSDDAIIAGSGAAIIDVSRVARDYALTGL ------------1111-------1111-----!!!!-------3333------------3 EFACGIPGSIGGAVYMNAGAYGGEVKDCIDYALCVNEQGSLIKLTTKELELDYRNSIIQK 333----------------iiii3333--------1111--------------------- EHLVVLEAAFTLAPGKMTEIQAKMDDLTERRESKQPLEYPSCGSVFQRPPGHFAGKLIQD ---------------3333----------------1111---------2222------11 SNLQGHRIGGVEVSTKHAGFMVNVDNGTATDYENLIHYVQKTVKEKFGIELNREVRIIGE 112222----------1111---------------------------------------- HPK --- >HISTIDINE-BINDING PROTEIN; SWP:P0AEU0; PDB:1HSLA; AIPQKIRIGTDPTYAPFESKNAQGELVGFDIDLAKELCKRINTQCTFVENPLDALIPSLK --------------------1111--------------------------3333------ AKKIDAIMSSLSITEKRQQEIAFTDKLYAADSRLVVAKNSDIQPTVASLKGKRVGVLQGT -------------3333---------------------------33332222----2222 TQETFGNEHWAPKGIEIVSYQGQDNIYSDLTAGRIDAAFQDEVAASEGFLKQPVGKDYKF ---------3333-----------------------------------1111--1111-- GGPAVKDEKLFGVGTGMGLRKEDNELREALNKAFAEMRADGTYEKLAKKYFDFDVYGG ------3333---------3333----------------------------------- >PHOSPHOLIPASE C-GAMMA (SH; SWP:P19174; PDB:1HSQ; GSPTFKCAVKALFDYKAQREDELTFIKSAIIQNVEKQEGGWWRGDYGGKKQLWFPSNYVE -------------------------2222------------------------------- EMVNPEGIHRD ----------- >0.19 ALPHA-AMYLASE INHIBI; SWP:P01085; PDB:1HSSA; MCYPGQAFQVPALPACRPLLRLQCNGSQVPEAVLRDCCQQLAHISEWCRCGALYSMLDSM --2222------3333------------------------33333333------------ YKEHGAFPRCRREVVKLTAASITAVCRLPIVVDASGDGAYVCKDVAAYPDA ------2222----------------------3333------3333----- >HISTONE H5; SWP:P02259; PDB:1HSTA; SHPTYSEMIAAAIRAEKSRGGSSRQSIQKYIKSHYKVGHNADLQIKLSIRRLLAAGVLKQ ------------1111---------------------1111-----------3333---- TKGVGASGSFRLAK -------------- >ALPHA-AMYLASE ISOZYME 1; SWP:P00693; PDB:1HT6A; HQVLFQGFNWESWKQSGGWYNMMMGKVDDIAAAGVTHVWLPPPSHSVSNEGYMPGRLYDI --------1111--2222---3333-----1111-------------1111----11111 DASKYGNAAELKSLIGALHGKGVQAIADIVINHRCADYKDSRGIYCIFEGGTSDGRLDWG 111---------------1111-----------------1111-----------2222-3 PHMICRDDTKYSDGTANLDTGADFAAAPDIDHLNDRVQRELKEWLLWLKSDLGFDAWRLD 333-1111---------------1111---1111-------------------------- FARGYSPEMAKVYIDGTSPSLAVAEVWDNMATGGDGKPNYDQDAHRQNLVNWVDKVGGAA -1111--------1111---------------1111--------------------!!!! SAGMVFDFTTKGILNAAVEGELWRLIDPQGKAPGVMGWWPAKAVTFVDNHDTGSTQAMWP ---------------3333-3333--1111---3333-3333------------------ FPSDKVMQGYAYILTHPGIPCIFYDHFFNWGFKDQIAALVAIRKRNGITATSALKILMHE -1111-------1111---------------------------1111-1111-------1 GDAYVAEIDGKVVVKIGSRYDVGAVIPAGFVTSAHGNDYAVWEK 111----%%%%----------3333-2222-------------- >CYCLIC PARATHYROID HORMON; SWP:P01270; PDB:1HTH; SVSEIQLHNLGHLNEERVEWLRKKLQDVHNF -----------11113333------------ >Rho guanine nucleotide ex; SWP:O15085; PDB:1HTJF; ESDIIFQDLEKLKSRPAHLGVFLRYIFSQADPSPLLFYLCAEVYQQASPKDSRSLGKDIW 333333333333------------------------------------------------ NIFLEKNAPLRVKIPELQAEIDSRLRNSEDARGVLCEAQEAAPEIQEQIHDYRTKRTLGL -------------------------------------------------------1111- GSLYGENDLLDLDGDPLRERQVAEKQLAALGDILSAYAADRSAPDFALNTYSHAGIRL ----33333333---------------------11111111----------1111--- >Gastricsin [Precursor]; SWP:P20142; PDB:1HTRB; SVTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTSHSRFNPSE ----33331111-----------------------------1111-3333------3333 SSTYSTNGQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGI ------------------------------iiii--------------1111-------- MGLAYPALSVDEATTAMQGMVQEGALTSPVFSVYLSNQQGSSGGAVVFGGVDSSLYTGQI ---------------------------------------------------1111----- YWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQAIVDTGTSLLTVPQQYMSALLQATGAQ -------------------!!!!----1111-----1111-----3333----------- EDEYGQFLVNCNSIQNLPSLTFIINGVEFPLPPSSYILSNNGYCTVGVEPTYLSSQNGQP -1111----11111111------iiii----3333-------------------1111-- LWILGDVFLRSYYSVYDLGNNRVGFATAA --------1111-----1111-------- >PGC protein; SWP:Q8IUM8; PDB:1HTRP; AVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDL -----------------1111--------------1111---- >HISTIDYL-TRNA SYNTHETASE; SWP:P04804; PDB:1HTTA; NIQAIRGMNDYLPGETAIWQRIEGTLKNVLGSYGYSEIRLPIVEQTPLFKRAIGEVTDVV ----2222---3333---------------1111----------3333--------3333 EKEMYTFEDRNGDSLTLRPEGTAGCVRAGIEHGLLYNQEQRLWYIGPMFRHERPQKGRYR ------------------------------------------------------1111-- QFHQLGCEVFGLQGPDIDAELIMLTARWWRALGISEHVTLELNSIGSLEARANYLDEESR ------------------------------------------------------------ EHFAGLCKLLESAGIAYTVNQRLVRGLDYYNRTVFEWVTNQGTVCAGGRYDGLVEQLGGR ----------1111-----1111--------------------------11113333--- ATPAVGFAMGLERLVLLVQAVNPEFKADPVVDIYLVASGADTQSAAMALAERLRDELPGV --------------------------------------------------------2222 KLMTNHGGGNFKKQFARADKWGARVAVVLGESEVANGTAVVKDLRSGEQTAVAQDSVAAH ------------------1111------------------------------3333---- LRTLLG ------ >HI0065; SWP:P44492; PDB:1HTWA; MESLTQYIPDEFSMLRFGKKFAEILLKLHTEKAIMVYLNGDLGAGKTTLTRGMLQGIGHQ -------------------------3333-----------2222----------1111-- GNVKSPTYTLVEEYNIAGKMIYHFDLYRLADPEELEFMGIRDYFNTDSICLIEWSEKGQG -----1111------iiii------1111-3333------3333---------3333--- ILPEADILVNIDYYDDARNIELIAQTNLGKNIISAFSN -------------!!!!--------------------- >EIF4GII; SWP:O43432; PDB:1HU3A; SDPENIKTQELFRKVRSILNKLTPQFNQLKQVSGLTVDTEERLKGVIDLVFEKAIDEPSF -3333--------------------3333--1111---3333--------------3333 SVAYANCRCLVTLKVPNFRKLLLNRCQKEFEKDKAAKDKARRRSIGNIKFIGELFKLKLT -------------------------------1111-------------------1111-3 EAIHDCVVKLLKNHDEESLECLCRLLTTIGKDLDFEKAKPRDQYFNQEKIVKERKTSSRI 333---------------------------33331111-----3333--3333-----33 RFLQDVIDLRLCNWVS 33-------1111--- >HU PROTEIN; SWP:P0A3H0; PDB:1HUEA; MNKTELINAVAETSGLSKKDATKAVDAVFDSITEALRKGDKVQLIGFGNFEVRERAARKG ---3333----------------------------1111--------------------- RNPQTGEEMEIPASKVPAFKPGKALKDAVK ------------------------------ >TYROSINE PHOSPHATASE YOPH; SWP:O68720; PDB:1HUFA; LSLSDLHRQVSRLVQQESGDCTGKLRGNVAANKETTFQGLTIASGARESEKVFAQTVLSH -------------11111111-------------------3333--3333--------11 VANVVLTQEDTAKLLQSTVKHNLNNYDLRSVGNGNSVLVSLRSDQMTLQDAKVLLEAALR 11----3333---------------------iiii------------------------- QES --- >Insulin [Precursor]; SWP:P01308; PDB:1HUIB; EVNQHLCGSELVEALELVCGERGFFYEPK --------3333----------------- >INTERLEUKIN-5; SWP:P05113; PDB:1HULA; IPTSALVKETLALLSTHRTLLIANETLRIPVPVHKNHQLCTEEIFQGIGTLESQTVQGGT -3333------------------1111--------11111111-------1111------ VERLFKNLSLIKKYIDGQKKKCGEERRRVNQFLDYLQEFLGVMNTEWI -------------------1111------------------------- >HUMAN MACROPHAGE INFLAMMA; SWP:P13236; PDB:1HUMA; APMGSDPPTACCFSYTARKLPRNFVVDYYETSSLCSQPAVVFQTKRSKQVCADPSESWVQ --------------------1111----------------------------3333---- EYVYDLELN --------- >MANNOSE-BINDING PROTEIN; SWP:P11226; PDB:1HUP; AASERKALQTEMARIKKWLTFSLGKQVGNKFFLTNGEIMTFEKVKALCVKFQASVATPRN --------------------------!!!!------------------1111-------- AAENGAIQNLIKEEAFLGITDEKTEGQFVDLTGNRLTYTNWNEGEPNNAGSDEDCVLLLK --------------------3333-----1111--------2222--2222-------11 NGQWNDVPCSTSHLAVCEFPI 11-----1111---------- >RAB5C; SWP:P35278; PDB:1HUQA; ICQFKLVLLGESAVGKSSLVLRFVKGQFHEYQESTIGAAFLTQTVCLDDTTVKFEIWDTA ----------2222---------------------------------------------- GQERYHSLAPMYYRGAQAAIVVYDITNTDTFARAKNWVKELQRQASPNIVIALAGNKADL -333311111111----------1111------------------1111-------3333 ASKRAVEFQEAQAYADDNSLLFMETSAKTAMNVNEIFMAIAKKL ---------------1111-------1111----------1111 >RIBOSOMAL PROTEIN S7; SWP:P22744; PDB:1HUS; RDVLPDPIYNSKLVTRLINKIIDGKKSKAQKILYTAFDIIRERTGKDPEVFEQALKNVPV ------1111-----------%%%%-----------------------------1111-- LEVRARRVGGANYQVPVEVRPDRRVSLGLRWLVQYARLRNEKTEERLANEIDAANNTGAA --------------------3333----------3333-------------3333----- VKKREDTHKAEAN -----1111---- ------------------------------------------------------------ -------------------- >HUMAN GROWTH HORMONE; SWP:P01241; PDB:1HUW; FPTIPLSRLADNAWLRADRLNQLAFDTYQEFEEAYIPKEQIHSFWWNPQTSLCPSESIPT ------------------------------------------------33331111---- PSNKEETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEG ------3333-------------11113333----------2222--------------- IQTLMGRLEALLKNYGLLYCFNKDMSKVSTYLRTVQCRSVEGSCGF ---------3333--------------------------------- >ACTIVATOR OF (R)-2-HYDROX; SWP:P11568; PDB:1HUXA; SIYTLGIDVGSTASKCIILKDGKEIVAKSLVAVGTGTSGPARSISEVLENAHMKKEDMAF -------------------%%%%---------------3333-----------1111--- TLATGYGRNSLEGIADKQMSELSCHAMGASFIWPNVHTVIDIGGQDVKVIHVENGTMTNF ------11113333------------------1111------2222-------------- QMNDKCAAGTGRFLDVMANILEVKVSDLAELGAKSTKRVAISSTCTVFAESEVISQLSKG ------2222-------------1111----1111---------3333--------1111 TDKIDIIAGIHRSVASRVIGLANRVGIVKDVVMTGGVAQNYGVRGALEEGLGVEIKTSPL --------------------3333-----------3333---------1111-----111 AQYNGALGAALYAYKKAAK 1-------------1111- >ELONGIN C; SWP:Q03071; PDB:1HV2A; MSQDFVTLVSKDDKEYEISRSAAMISPTLKAMIEGPFRESKGRIELKQFDSHILEKAVEY ------------------3333-------------3333----------3333------- LNYNLKYSGVSEDDDEIPEFEIPTEMSLELLLAADYLSI ----------------------3333------------- >STROMELYSIN 3; SWP:Q02853; PDB:1HV5A; MFVLSGGRWEKTDLTYRILRFPWQLVREQVRQTVAEALQVWSEVTPLTFTEVHEGRADIM ----iiii-------------11113333-----------3333---------------- IDFARYWHGDNLPFDGPGGILAHAFFPKTHREGDVHFDYDETWTIGDNQGTDLLQVAAHE --------------------------1111-------1111------------------- FGHVLGLQHTTAAKALMSPFYTFRYPLSLSPDDRRGIQHLYG --1111--------1111-----------3333--------- >PUTATIVE ATP-DEPENDENT RN; SWP:Q58083; PDB:1HV8A; VEYNFNELNLSDNILNAIRNKGFEKPTDIQKVIPLFLNDEYNIVAQARTGSGKTASFAIP ---3333----3333-----------3333----------------------3333---- LIELVNENNGIEAIILTPTRELAIQVADEIESLKGNKNLKIAKIYGGKAIYPQIKALKNA -----------------------------1111---------------3333-------- NIVVGTPGRILDHINRGTLNLKNVKYFILDEADELNGFIKDVEKILNACNKDKRILLFSA --------------------1111------3333-------------------------- TPREILNLAKKYGDYSFIKAKINANIEQSYVEVNENERFEALCRLLKNKEFYGLVFCKTK --------------------3333---------33333333------------------- RDTKELASLRDIGFKAGAIHGDLSQSQREKVIRLFKQKKIRILIATDVSRGIDVNDLNCV ---------1111---------------------3333---------------------- INYHLPQNPESYHRIGRTGRAGKKGKAISIINRREYKKLRYIERAKLKIKKLK -------3333--------------------1111------------------ >UDP-N-ACETYLGLUCOSAMINE P; SWP:P17114; PDB:1HV9A; NAMSVVILAAGKGTRMYSDLPKVLHTLAGKAMVQHVIDAANELGAAHVHLVYGHGGDLLK ------------1111----1111--iiii3333------1111---------------- QALKDDNLNWVLQAEQLGTGHAMQQAAPFFADDEDILMLYGDVPLISVETLQRLRDAKPQ -------------------------3333-1111--------1111--------111122 GGIGLLTVKLDDPTGYGRITRENGKVTGIVEHKDATDEQRQIQEINTGILIANGADMKRW 22----------2222-----iiii-----3333-3333--------------------3 LAKLTNNNAQGEYYITDIIALAYQEGREIVAVHPQRLSEVEGVNNRLQLSRLERVYQSEQ 333----3333------------------------3333--------------------- AEKLLLAGVMLRDPARFDLRGTLTHGRDVEIDTNVIIEGNVTLGHRVKIGTGCVIKNSVI ----1111----1111---------------------------------2222------- GDDCEISPYTVVEDANLAAACTIGPFARLRPGAELLEGAHVGNFVEMKKARLGKGSKAGH 2222-------------------------2222--2222-------------2222---- LTYLGDAEIGDNVNIGAGTITCNYDGANKFKTIIGDDVFVGSDTQLVAPVTVGKGATIAA ---------------2222---------------------------------2222--22 GTTVTRNVGENALAISRVPQTQKEGWRRP 22------2222----------------- >ALPHA-AMYLASE; SWP:P06279; PDB:1HVXA; AAPFNGTMMQYFEWYLPDDGTLWTKVANEANNLSSLGITALWLPPAYKGTSRSDVGYGVY ------------1111-----------------1111-------------1111------ DLYDLGEFNQKGAVRTKYGTKAQYLQAIQAAHAAGMQVYADVVFDHKGGADGTEWVDAVE 1111-----iiii--1111------------1111------------------------- VNPSDRNQEISGTYQIQAWTKFDFPGRGNTYSSFKWRWYHFDGVDWDESRKLSRIYKFRG -1111------------------3333---------1111-------------------2 IGKAWDWEVDTENGNYDYLMYADLDMDHPEVVTELKSWGKWYVNTTNIDGFRLDAVKHIK 222-------2222----------1111---------------------------11111 FSFFPDWLSYVRSQTGKPLFTVGEYWSYDINKLHNYIMKTNGTMSLFDAPLHNKFYTASK 111------------------------------------%%%%----------------- SGGTFDMRTLMTNTLMKDQPTLAVTFVDNHDTEPGQALQSWVDPWFKPLAYAFILTRQEG iiii-3333----3333-1111------11112222------3333-------------- YPCVFYGDYYGIPQYNIPSLKSKIDPLLIARRDYAYGTQHDYLDHSDIIGWTREGVTEKP ----3333----1111---3333--------------------------------1111- GSGLAALITDGPGGSKWMYVGKQHAGKVFYDLTGNRSDTVTINSDGWGEFKVNGGSVSVW --------------------3333------3333--------1111-------------- VPR --- >THYMIDYLATE SYNTHASE; SWP:P04818; PDB:1HVYA; PPHGELQYLGQIQHILRGVRKDDRTGTGTLSVFGMQARYSLRDEFPLLTTKRVFWKGVLE --3333----------------1111---------------------------3333--- ELLWFIKGSTNAKELSSKGVKIWDANGSRDFLDSLGFSTREEGDLGPVYGFQWRHFGAEY ----------33333333--11111111----111111112222----------2222-- RDMESDYSGQGVDQLQRVIDTIKTNPDDRRIIMCAWNPRDLPLMALPPCHALCQFYVVNS -1111-2222--------------1111--------3333-----------------%%% ELSCQLYQRSGDMGLGVPFNIASYALLTYMIAHITGLKPGDFIHTLGDAHIYLNHIEPLK %-------------------------------1111---------------1111----- IQLQREPRPFPKLRILRKVEKIDDFKAEDFQIEGYNPHPTIKMEMAV -1111---------------3333-1111------------------ >FATTY ACID METABOLISM REG; SWP:P09371; PDB:1HW1A; AQSPAGFAEEYIIESIWNNRFPPGTILPAERELSELIGVTRTTLREVLQRLARDGWLTIQ ---------------1111--2222--------------------------1111----- HGKPTKVNNFWETSGLNILETLARLDHESVPQLIDNLLSVRTNISTIFIRTAFRQHPDKA --------3333--3333-------3333------------------------------- QEVLATANEVADHADAFAELDYNIFRGLAFASGNPIYGLILNGMKGLYTRIGRHYFANPE ------2222-------------------------------------------1111--- ARSLALGFYHKLSALCSEGAHDQVYETVRRYGHESGEIWHRMQKNL --------------------3333-----------------3333- >CATABOLITE GENE ACTIVATOR; SWP:P03020; PDB:1HW5A; VLGKPQTDPTLEWFLSHCHIHKYPSKSTLIHQGEKAETLYYIVKGSVAVLIKDEEGKEMI --------------1111-----2222---2222------------------1111---- LSYLNQGDFIGELGLFEEGQERSAWVRAKTACEVAEISYKKFRQLIQVNPDILMRLSAQM ----2222---1111--------------------------------------------- ARRLQVLAEKVGNLAFLDVTGRIAQTLLNLAKQPDAMTHPDGMQIKITRQEIGQIVGCSR ----------------------------1111------1111-----3333--------- ETVGRILKMLEDQNLISAHGKTIVVYGT ---------------------------- >2,5-DIKETO-D-GLUCONIC ACI; SWP:P06632; PDB:1HW6A; TVPSIVLNDGNSIPQLGYGVFKVPPADTQRAVEEALEVGYRHIDTAAIYGNEEGVGAAIA ------1111-------------3333-----------------3333------------ ASGIARDDLFITTKLWNDEPAAAIAESLAKLALDQVDLYLVHWPTPAADNYVHAWEKMIE ----3333------------------------------------3333------------ LRAAGLTRSIGVSNHLVPHLERIVAATGVVPAVNQIELHPAYQQREITDWAAAHDVKIES -1111---------------------------------3333---------1111----- WGPLGQGKYDLFGAEPVTAAAAAHGKTPAQAVLRWHLQKGFVVFPKSVRRERLEENLDVF -1111-----11113333------------------1111--------------1111-- DFDLTDTEIAAIDAMDP ----------------- >HEAT SHOCK PROTEIN HSP33; SWP:P0A6Y5; PDB:1HW7A; HDQLHRYLFENFAVRGELVTVSETLQQILENHDYPQPVKNVLAELLVATSLLTATLKFDG ---------------------------------------------------3333----- DITVQLQGDGPMNLAVINGNNNQQMRGVARVQGEIPENADLKTLVGNGYVVITITPSEGE -------------------1111------------------------------------- RYQGVVGLEGDTLAACLEDYFMRSEQLPTRLFIRTGDVDGKPAAGGMLLQVMPAQNAQQD -------------------------------------iiii------------3333333 DFDHLATLTETIKTEELLTLPANEVLWRLYHEEEVTVYDPQDVEFKCTC 3----------------------------3333---------------- >EBULIN; SWP:Q9AVR2; PDB:1HWMA; IDYPSVSFNLAGAKSTTYRDFLKNLRDRVATGTYEVNGLPVLRRESEVQVKNRFVLVRLT ---------2222----------------------iiii---------3333-------- NYNGDTVTSAVDVTNLYLVAFSANGNSYFFKDATELQKSNLFLGTTQHTLSFTGNYDNLE 1111-------------------------1111333311112222---------3333-- TAAGTRRESIELGPNPLDGAITSLWYDGGVARSLLVLIQMVPEAARFRYIEQEVRRSLQQ -----3333--------------1111---3333-------------------------- LTSFTPNALMLSMENNWSSMSLEVQLSGDNVSPFSGTVQLQNYDHTPRLVDNFEELYKIT ------3333--------------3333-------------1111--------------- GIAILLFRCVA ----------- >Ribosome-inactivating pro; SWP:Q9AVR2; PDB:1HWMB; ETCAIPAPFTRRIVGRDGLCVDVRNGYDTDGTPIQLWPCGTQRNQQWTFYNDKTIRSMGK --------------2222----2222--2222----------1111---1111---iiii CMTANGLNSGSYIMITDCSTAAEDATKWEVLIDGSIINPSSGLVMTAPSGASRTTLLLEN -------2222-----3333----------1111----------------2222------ NIHAASQGWTVSNDVQPIATLIVGYNEMCLQANGENNNVWMEDCDVTSVQQQWALFDDRT ---1111-------------------------------------3333-------1111- IRVNNSRGLCVTSNGYVSKDLIVIRKCQGLATQRWFFNSDGSVVNLKSTRVMDVKESDVS ----------------2222---------1111----1111----1111-----3333-- LQEVIIFPATGNPNQQWRTQVPQI -----------1111--------- >Heme-responsive zinc fing; SWP:P12351; PDB:1HWTC; RIPLSCTICRKRKVKCDKLRPHCQQCTKTGVAHLCHYMEQTWAEEAEKELLKDNELKKLR --------------------------11113333-------------------------- ERVKSLEKTL ---------- >PII PROTEIN; SWP:NA; PDB:1HWUA; MKQVTAIIKPFKLDEVRESLAEVGVTGLTVTEVKGFGYVVDFLPKVKIEVVVDDKVVEQA --------1111--------1111----------------------------1111---- VDAIIKAARTGKIGDGKIFVQEVEQVIRIRTGETGPDAV -----------2222------------------!!!!-- >ALPHA AMYLASE (PPA); SWP:P00690; PDB:1HX0A; YAPQTQSGRTSIVHLFEWRWVDIALECERYLGPKGFGGVQVSPPNENIVVTNPSRPWWER -----2222-----2222-------------1111--------------------1111- YQPVSYKLCTRSGNENEFRDMVTRCNNVGVRIYVDAVINHMCGSGAAAGTGTTCGSYCNP ---------1111------------1111-------------1111-----1111----1 GNREFPAVPYSAWDFNDGKCKTASGGIESYNDPYQVRDCQLVGLLDLALEKDYVRSMIAD 111-1111--3333--3333-1111---1111--------iiii---3333--------- YLNKLIDIGVAGFRIDASKHMWPGDIKAVLDKLHNLNTNWFPAGSRPFIFQEVIDLGGEA ----------------3333-3333----1111---3333-2222--------------- IKSSEYFGNGRVTEFKYGAKLGTVVRKWSGEKMSYLKNWGEGWGFMPSDRALVFVDNHDN -33333333----3333----------iiii333311113333---1111------3333 QRGHGAGGSSILTFWDARLYKIAVGFMLAHPYGFTRVMSSYRWARNFVNGEDVNDWIGPP -------3333-3333-------------------------------iiii1111----- NNNGVIKEVTINADTTCGNDWVCEHRWREIRNMVWFRNVVDGQPFANWWDNGSNQVAFGR -iiii------1111--%%%%-3333-3333--------2222----------------! GNRGFIVFNNDDWQLSSTLQTGLPGGTYCDVISGDKVGNSCTGIKVYVSSDGTAQFSISN !!!---------------------------------!!!!--------1111------11 SAEDPFIAIHAESKL 11-------1111-- >BAG family molecular chap; SWP:Q99933; PDB:1HX1B; GNSPQEEVELKKLKHLEKSVEKIADQLEELNKELTGIQQGFLPKDLQAEALCKLDRRVKA --3333------------------------------------------------------ TIEQFMKILEEIDTLILPENFKDSRLKRKGLVKKVQAFLAECDTVEQNICQE -----------1111--1111-----------------------1111---- >BSTI; SWP:Q90248; PDB:1HX2A; NFVCPPGQTFQTCASSCPKTCETRNKLVLCDKKCNQRCGCISGTVLKSKDSSECVHPSKC ----2222-----------3333--------------------------------3333- >10 KDA CHAPERONIN; SWP:P09621; PDB:1HX5A; IKPLEDKILVQATTASGLVIPPQEGTVVAVGPGRWDEDGEKRIPLDVAEGDTVIYSKYGG -------------3333------------------3333--------2222-----2222 TEIKYNGEEYLILSARDVLAVV ----iiii-----3333----- >MAJOR CAPSID PROTEIN; SWP:P22535; PDB:1HX6A; LRNQQAMAANLQARQIVLQQSYPVIQQVETQTFDPANRSVFDVTPANVGIVKGFLVKVTA --3333---------------------------3333----------------------- AITNNHATEAVALTDFGPANLVQRVIYYDPDNQRHTETSGWHLHFVNTAKQGAPFLSSMV -------------1111-----------1111---------------------2222--- TDSPIKYGDVMNVIDAPATIAAGATGELTMYYWVPLAYSETDLTGAVLANVPQSKQRLKL --------------------2222--------------11112222-------------- EFANNNTAFAAVGANPLEAIYQGAGAADCEFEEISYTVYQSYLDQLPVGQNGYILPLIDL ---1111---2222-1111---1111-----------------------------3333- STLYNLENSAQAGLTPNVDFVVQYANLYRYLSTIAVFDNGGSFNAGTDINYLSQRTANFS --------------2222--------------------iiii---1111------2222- DTRKLDPKTWAAQTRRRIATDFPKGVYYCDNRDKPIYTLQYGNVGFVVNPKTVNQNARLL -------------3333-----2222----3333-----2222----------------- MGYEYFTSRT ---------- >SYNAPSE-ENRICHED CLATHRIN; SWP:Q9VI75; PDB:1HX8A; QGLAKSVCKATTEECIGPKKKHLDYLVHCANEPNVSIPHLANLLIERSQNANWVVVYKSL -------------------------------1111-----------1111---------- ITTHHLMAYGNERFMQYLASSNSTFNLSSFLDKGTGGMGVPGGRMGYDMSPFIRRYAKYL -----------------------------------------!!!!--------------- NEKSLSYRAMAFDFCKVEGSLRSMNAEKLLKTLPVLQAQLDALLEFDCQSNDLSNGVINM ------------1111---3333--------------------3333-3333-------- SFMLLFRDLIRLFACYNDGIINLLEKYFDMNKKHARDALDLYKKFLVRMDRVGEFLKVAE -------------------------3333------------------------------- NVGIDKGDIPDLTKAPSSLLDALEQHLATL ----3333-------3333----------- >3BETA/17BETA-HYDROXYSTERO; SWP:P19871; PDB:1HXHA; TNRLQGKVALVTGGASGVGLEVVKLLLGEGAKVAFSDINEAAGQQLAAELGERSMFVRHD -1111---------------------1111--------------------1111-----1 VSSEADWTLVMAAVQRRLGTLNVLVNNAGILLPGDMETGRLEDFSRLLKINTESVFIGCQ 111-------------------------------3333---------------------- QGIAAMKETGGSIINMASVSSWLPIEQYAGYSASKAAVSALTRAAALSCRKQGYAIRVNS -----------------1111---1111-------------------------------- IHPDGIYTPMMQASLPKGVSKEMVLHDPKLNRAGRAYMPERIAQLVLFLASDESSVMSGS ---------------22223333-------1111---3333---------3333------ ELHADNSILGMGL ----%%%%----- >PEROXISOME TARGETING SIGN; SWP:Q9U763; PDB:1HXIA; NNTDYPFEANNPYYHENPEEGLSLKLANLAEAALAFEAVCQKEPEREEAWRSLGLTQAEN -----------------------1111---------------1111-----------111 EKDGLAIIALNHARLDPKDIAVHAALAVSHTNEHNANAALASLRAWLL 1--------------1111----------------------------- >GAMMA-DELTA T-CELL RECEPT; SWP:NA; PDB:1HXMA; AIELVPEHQTVPVSIGVPATLRCSMKGEAIGNYYINWYRKTQGNTMTFIYREKDIYGPGF -------------2222-----------3333------------------------2222 KDNFQGDIDIAKNLAVLKILAPSERDEGSYYCACDTLGMGGEYTDKLIFGKGTRVTVEPR --------3333----------3333---------------------------------- SQPHTKPSVFVMKNGTNVACLVKEFYPKDIRINLVSSKKITEFDPAIVISPSGKYNAVKL -------------------------------------------------1111------- GKYEDSNSVTCSVQHDNKTVHSTDFE ----3333------%%%%--1111-- >GAMMA-DELTA T-CELL RECEPT; SWP:NA; PDB:1HXMB; AGHLEQPQISSTKTLSKTARLECVVSGITISATSVYWYRERPGEVIQFLVSISYDGTVRK ----------------------------------------2222--------3333---- ESGIPSGKFEVDRIPETSTSTLTIHNVEKQDIATYYCALWEAQQELGKKIKVFGPGTKLI 2222----------1111---------1111----------------------------- ITDKQLDADVSPKPTIFLPSIAETKLQKAGTYLCLLEKFFPDVIKIHWEEKKSNTILGSQ -------------------3333------------------------------------- EGNTMKTNDTYMKFSWLTVPEKSLDKEHRCIVRHENNKNGVDQEIIFPPI ----------------------3333-------1111------------- >HEMOPEXIN; SWP:P20058; PDB:1HXN; ESTRCDPDLVLSAMVSDNHGATYVFSGSHYWRLDTNRDGWHSWPIAHQWPQGPSTVDAAF -33331111-------1111-----!!!!--------------3333------------- SWEDKLYLIQDTKVYVFLTKGGYTLVNGYPKRLEKELGSPPVISLEAVDAAFVCPGSSRL -------------------------2222--3333------------------------- HIMAGRRLWWLDLKSGAQATWTELPWPHEKVDGALCMEKPLGPNSCSTSGPNLYLIHGPN ---!!!!----33331111-------------------------------------!!!! LYCYRHVDKLNAAKNLPQPQRVSRLLGCTH ------------------------------ >GUANINE NUCLEOTIDE EXCHAN; SWP:Q08326; PDB:1HXRA; ELVSAEGRNRKAVLCQRCGSRVLQPGTALFSRRQLFLPSMRKKPDGDVLEEHWLVNDMFI ---1111----------------2222-----------------------------3333 FENVGFTKDVGNVKFLVCADCEIGPIGWHCLDDKNSFYVALERVSHE ---------iiii----------------3333------3333---- >Genome polyprotein; SWP:P03300; PDB:1HXS1; GSSSTAATSRDALPNTEASGPTHSKEIPALTAVETGATNPLVPSDTVQTRHVVQHRSRSE -------1111---------------3333-3333------3333------------111 SSIESFFARGACVTIMTVDNPASTTNKDKLFAVWKITYKDTVQLRRKLEFFTYSRFDMEL 1--------------------1111----------------------------------- TFVVTANFTETNNGHALNQVYQIMYVPPGAPVPEKWDDYTWQTSSNPSIFYTYGTAPARI -------------------------------------3333----------2222----- SVPYVGISNAYSHFYDGFSKVPLKDQSAALGDSLYGAASLNDFGILAVRVVNDHNPTKVT ----------------------11113333---2222-1111------------------ SKIRVYLKPKHIRVWCPRPPRAVAYYGPGVDYKDGTLTPLSTKDLTTY --------------------------------2222-------1111- >Genome polyprotein; SWP:P03300; PDB:1HXS2; ACGYSDRVLQLTLGNSTITTQEAANSVVAYGRWPEYLRDSEANPVDQPTEPDVAACRFYT ----1111----!!!!-----------2222------3333---------!!!!------ LDTVSWTKESRGWWWKLPDALRDMGLFGQNMYYHYLGRSGYTVHVQCNASKFHQGALGVF ------1111----------1111-------------------------1111------- AVPEMCLAGDSNTTTMHTSYQNANPGEKGGTFTGTFTPDNNQTSPARRFCPVDYLLGNGT -------------------3333--3333----------------------1111----- LLGNAFVFPHQIINLRTNNCATLVLPYVNSLSIDSMVKHNNWGIAILPLAPLNFASESSP 33331111-----3333-----------------3333---------------------- EIPITLTIAPMCCEFNGLRNITLPRLQ --------------------------- >Genome polyprotein; SWP:P03300; PDB:1HXS3; GLPVMNTPGSNQYLTADNFQSPCALPEFDVTPPIDIPGEVKNMMELAEIDTMIPFDLSAT ------2222---1111-------2222-------------33331111--------111 KKNTMEMYRVRLSDKPHTDDPILCLSLSPASDPRLSHTMLGEILNYYTHWAGSLKFTFLF 1----1111------------------1111---1111---------------------- CGSMMATGKLLVSYAPPGADPPKKRKEAMLGTHVIWDIGLQSSCTMVVPWISNTTYRQTI --1111-----------------33331111----------------------------- DDSFTEGGYISVFYQTRIVVPLSTPREMDILGFVSACNDFSVRLLRDTTHIEQKA -3333-------------------------------1111--------------- >Genome polyprotein; SWP:P03300; PDB:1HXS4; GAQVSSQKVGAHENSNRAYGGSTINYTTINYYRDSASNAASKQDFSQDPSKFTEPIKDVL ---------------------------------3333-----------3333-------- IKTAPMLN 1111---- >TRIGGER FACTOR; SWP:P47480; PDB:1HXVA; KLANGDIAIIDFTGIVDNKKLASASAQNYELTIGSNSFIKGFETGLIAMKVNQKKTLALT ---------------------------------------------1111----------- FPSDYHVKELQSKPVTFEVVLKAIK -3333-3333--------------- >DELTA CRYSTALLIN I; SWP:P24057; PDB:1HY0A; DPIMQMLSTSISTEQRLSEVDIQASIAYAKALEKAGILTKTELEKILSGLEKISEELSKG --3333--------------------------------------------------1111 VIVVTQSDEDIQTANERRLKELIGDIAGKLHTGRSRNEQVVTDLKLFMKNSLSIISTHLL ----3333---------------33333333----------------------------- QLIKTLVERAAIEIDVILPGYTHLQKAQPIRWSQFLLSHAVALTRDSERLGEVKKRINVL ------------1111-----%%%%-----3333-------------------------- PLGSGALAGNPLDIDREMLRSELEFASISLNSMDAISERDFVVEFLSVATLLLIHLSKMA 2222----------------------------------3333------------------ EDLIIYSTSEFGFLTLSDAFSTGSSLMPQKKNPDSLELIRSKSGRVFGRLASILMVLKGL ----------------3333------1111--------------------------2222 PSTYNKDLQEDKEAVIDVVDTLTAVLQVATGVISTLQISKENMEKALTPEMLATDLALYL ----3333-----------------------------------33333333--------- VRKGMPFRQAHTASGKAVHLAETKGIAINNLTLEDLKSISPLFSSDVSQVFNFVNSVEQY 1111----------------------1111---------3333--3333----------- TALGGTAKSSVTTQIEQLRELMKKQKE -2222---------------------- >YERSINIA PESTIS VIRULENCE; SWP:P31493; PDB:1HY5A; TSFSDSIKQLAAETLPKYQQLNSLDAELQKNHDQFATGSGPLRGSITQCQGLQFCGGELQ --------------33333333--------3333-------------------------- AEASAILNTPVCGIPFSQWGTIGGAASAYVASGVDLTQAANEIKGLAQQQKLLSL ----------iiii3333---------------------------------3333 >STROMELYSIN-1; SWP:P08254; PDB:1HY7A; FRTFPGIPKWRKTHLTYRIVNYTPDLPKDAVDSAVEKALKVWEEVTPLTFSRLYEGEADI ---2222---------------33333333-----------3333--------------- MISFAVREHGDFYPFDGPGNVLAHAYAPGPGINGDAHFDDDEQWTKDTTGTNLFLVAAHE -----------------------------!!!!--------------------------- IGHSLGLFHSANTEALMYPLLTRFRLSQDDINGIQSLYGPPP --1111-----1111-----1111------------------ >COCAINE AND AMPHETAMINE R; SWP:Q16568; PDB:1HY9A; YGQVPMCDAGEQCAVRKGARIGKLCDCPRGTSCNSFLLKCL ---------------------------2222---------- >L-LACTATE/MALATE DEHYDROG; SWP:Q60176; PDB:1HYEA; MKVTIIGASGRVGSATALLLAKEPFMKDLVLIGREHSINKLEGLREDIYDALAGTRSDAN ------1111------------1111-------1111--------------2222----- IYVESDENLRIIDESDVVIITSGVPRKEGMSRMDLAKTNAKIVGKYAKKIAEICDTKIFV --------3333------------------------------------------------ ITNPVDVMTYKALVDSKFERNQVFGLGTHLDSLRFKVAIAKFFGVHIDEVRTRIIGEHGD ------------------1111---!!!!----------------3333---------11 SMVPLLSATSIGGIPIQKFERFKELPIDEIIEDVKTKGEQIIRFGPAAAILNVVRCIVNN 11--1111--iiii33333333--------------1111----------------1111 EKRLLTLSAYVDGEFDGIRDVCIGVPVKIGRDGIEEVVSIELDKDEIIAFRKSAEIIKKY -----------------------------1111--------------------------- CEEVKNL ---1111 >L-2-HYDROXYISOCAPROATE DE; SWP:P14295; PDB:1HYHA; ARKIGIIGLGNVGAAVAHGLIAQGVADDYVFIDANEAKVKADQIDFQDAMANLEAHGNIV ---------3333-------1111------------------------3333-------- INDWAALADADVVISTLGNIKLQQFAELKFTSSMVQSVGTNLKESGFHGVLVVISNPVDV --33331111--------3333-------------------------------------- ITALFQHVTGFPAHKVIGTGTLLDTARMQRAVGEAFDLDPRSVSGYNLGEHGNSQFVAWS ------------1111--!!!!----------------3333-------2222----333 TVRVMGQPIVTLIDLAAIEEEARKGGFTVLNGKGYTSYGVATSAIRIAKAVMADAHAELV 3--iiii3333---------------------------------------1111------ VSNRRDDMGMYLSYPAIIGRDGVLAETTLDLTTDEQEKLLQSRDYIQQRFDEIVDTL ----3333----------1111------------------------------3333- >AGOUTI RELATED PROTEIN; SWP:O00253; PDB:1HYKA; CVRLHESCLGQQVPCCDPCATCYCRFFNAFCYCRKLGTAMNPCSRT ----------------2222-----------------3333----- >HYDROLYZED CUCURBITA MAXI; SWP:P19873; PDB:1HYMA; SSCPGKSSWPHLVGVGGSVAKAIIERQNPNVKAVILEEGTPVTK -----------2222----------------------------- >Band 3 anion transport pr; SWP:P02730; PDB:1HYNP; KVYVELQELVMDEKNQELRWMEAARWVQLEENLGENGAWGRPHLSHLTFWSLLELRRVFT ---------------------------------1111----------3333-------11 KGTVLLDLQETSLAGVANQLLDRFIFEDQIRPQDREELLRALLLKHSHAGELEALGGVKP 11----------------------------1111------1111---3333-3333---- AVLTRSGDPSQPLLPQHSSLETQLFCEQGDGGTEGHSPSGILEKIPPDSEATLVLVGRAD ---1111-----------------------3333-------1111-------------11 FLEQPVLGFVRLQEAAELEAVELPVPIRFLFVLLGPEAPHIDYTQLGRAAATLMSERVFR 11---------------3333--------------------------------------- IDAYMAQSRGELLHSLEGFLDCSLVLPPTDAPSEQALLSLVPVQRELLRRRYQ -------3333--------1111------------------------------ >FUMARYLACETOACETATE HYDRO; SWP:P35505; PDB:1HYOA; MSFIPVAEDSDFPIQNLPYGVFSTQSNPKPRIGVAIGDQILDLSVIKHLFTGPALSKHQH ------1111--3333-------3333--------!!!!----1111----3333---33 VFDETTLNNFMGLGQAAWKEARASLQNLLSASQARLRDDKELRQRAFTSQASATMHLPAT 33-------------------------------3333-----------3333-------- IGDYTDFYSSRQHATNVGIMFRGKENALLPNWLHLPVGYHGRASSIVVSGTPIRRPMGQM ----------------------3333--1111----------1111-2222--------- RPDNSKPPVYGACRLLDMELEMAFFVGPGNRFGEPIPISKAHEHIFGMVLMNDWSARDIQ --1111------------------------2222--3333-1111--------------- QWEYVPLGPFLGKSFGTTISPWVVPMDALMPFVVPNPKQDPKPLPYLCHSQPYTFDINLS ---------3333-----------33333333-----------3333------------- VSLKGEGMSQAATICRSNFKHMYWTMLQQLTHHSVNGCNLRPGDLLASGTISGSDPESFG -----------------1111----------1111-----2222-----------1111- SMLELSWKGTKAIDVGQGQTRTFLLDGDEVIITGHCQGDGYRVGFGQCAGKVLPAL ------%%%%-----iiii-----2222---------2222--------------- >HYDROPHOBIC PROTEIN FROM ; SWP:P24337; PDB:1HYP; PSCPDLSICLNILGGSLGTVDDCCALIGGLGDIEAIVCLCIQLRALGILNLNRNLQLILN -----3333--1111-1111-----------------------3333------------- SCGRSYPSNATCPRT --------------- >CELL DIVISION INHIBITOR (; SWP:O29562; PDB:1HYQA; VRTITVASGKGGTGKTTITANLGVALAQLGHDVTIVDADITMANLELILGMEGLPVTLQN -------------3333---------1111--------------------------3333 VLAGEARIDEAIYVGPGGVKVVPAGVSLEGLRKANPEKLEDVLTQIMESTDILLLDAPAG 1111--3333----2222------------------------------------------ LERSAVIAIAAAQELLLVVNPEISSITDGLKTKIVAERLGTKVLGVVVNRITTLGIEMAK -3333------------------------------------------------------- NEIEAILEAKVIGLIPEDPEVRRAAAYGKPVVLRSPNSPAARAIVELANYIA -----------------3333--------3333-1111-------------- >HIV-1 REVERSE TRANSCRIPTA; SWP:NA; PDB:1HYSD; QITLKESGPGIVQPSQPFRLTCTFSGFSLSTSGIGVTWIRQPSGKGLEWLATIWWDDDNR -----------------------------------------------------1111--- YNPSLKSRLTVSKDTSNNQAFLNMMTVETADTAIYYCAQSAITSVTDSAMDHWGQGTSVT -3333--------3333-----------3333---------------------------- VSSAATTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVL --------------------------------------------%%%%-2222------- QSDLYTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKI %%%%---------3333------------1111------- >ALKYL HYDROPEROXIDE REDUC; SWP:P19480; PDB:1HYUA; MLDTNMKTQLRAYLEKLTKPVELIATLDDSAKSAEIKELLAEIAELSDKVTFKEDNTLPV -------------1111-----------------------------1111---------- RKPSFLITNPGSQQGPRFAGSPLGHEFTSLVLALLWTGGHPSKEAQSLLEQIRDIDGDFE --------2222----------!!!!-------------------------1111----- FETYYSLSCHNCPDVVQALNLMAVLNPRIKHTAIDGGTFQNEITERNVMGVPAVFVNGKE -----1111----------------1111---------3333-1111--------iiii- FGQGRMTLTEIVAKVDTGAEKRAAEALNKRDAYDVLIVGSGPAGAAAAVYSARKGIRTGL --------------------------1111---------------------1111----- MGERFGGQVLDTVDIENYISVPKTEGQKLAGALKAHVSDYDVDVIDSQSASKLVPAATEG -----!!!!--------2222------------------------------------222 GLHQIETASGAVLKARSIIIATGAKWRNMNVPGEDQYRTKGVTYCPHCDGPLFKGKRVAV 2-----1111--------------------22221111----------33332222---- IGGGNSGVEAAIDLAGIVEHVTLLEFAPEMKADQVLQDKVRSLKNVDIILNAQTTEVKGD ---3333-----------------------------------1111-------------- GSKVVGLEYRDRVSGDIHSVALAGIFVQIGLLPNTHWLEGALERNRMGEIIIDAKCETSV ----------------------------------3333------1111----1111---2 KGVFAAGDCTTVPYKQIIIATGEGAKASLSAFDYLIRTKIA 222---1111------------------------------- >HEAD-TO-TAIL JOINING PROT; SWP:P03727; PDB:1HYWA; MTRQEELAAARAALHDLMTGKRVATVQKDGRRVEFTATSVSDLKKYIAELEVQTGMTQ --3333--------3333--------------------------------1111---- >MALT REGULATORY PROTEIN; SWP:P06993; PDB:1HZ4A; EIKDIREDTMHAEFNALRAQVAINDGNPDEAERLAKLALEELPPGWFYSRIVATSVLGEV ---3333---------------1111------------1111------------------ LHCKGELTRSLALMQQTEQMARQHDVWHYALWSLIQQSEILFAQGFLQTAWETQEKAFQL ---------------------1111----------------------------------- INEQHLEQLPMHEFLVRIRAQLLWAWARLDEAEASARSGIEVLSSYQPQQQLQCLAMLIQ -----3333-------------------------------1111--3333---------- CSLARGDLDNARSQLNRLENLLGNGKYHSDWISNANKVRVIYWQMTGDKAAAANWLRHTA --------------------1111-------------------1111------------- KPEFANNHFLQGQWRNIARAQILLGEFEPAEIVLEELNENARSLRLMSDLNRNLLLLNQL ---%%%%1111----------1111----------------------------------- YWQAGRKSDAQRVLLDALKLANRTGFISHFVIEGEAMAQQLRQLIQLNTLPELEQHRAQR -1111---------------------3333------------------------------ ILREIN ------ >PROTEIN L; SWP:Q51912; PDB:1HZ6A; AMEEVTIKANLIFANGSTQTAEFKGTFEKATSEAYAYADTLKKDNGEWTVDVADKGYTLN ------------1111------------------------------------%%%%---- IKFAG ----- >AU-BINDING PROTEIN/ENOYL-; SWP:Q13825; PDB:1HZDA; EDELRVRHLEEENRGIVVLGINRAYGKNSLSKNLIKMLSKAVDALKSDKKVRTIIIRSEV ---------!!!!---------3333----3333----------1111-----------2 PGIFCAGADLKERAKMSSSEVGPFVSKIRAVINDIANLPVPTIAAIDGLALGGGLELALA 222-----333311113333--------------1111-------------------333 CDIRVAASSAKMGLVETKLAIIPGGGGTQRLPRAIGMSLAKELIFSARVLDGKEAKAVGL 3-----1111----3333-------3333------------------------------- ISHVLEQNQEGDAAYRKALDLAREFLPQGPVAMRVAKLAINQGMEVDLVTGLAIEEACYA -------1111-------------1111--------------1111-------------- QTIPTKDRLEGLLAFKEKRPPRYKGE -1111---------1111-------- >COMPLEMENT FACTOR C4A; SWP:P0C0L4; PDB:1HZFA; SPGGVASLLRLPRGCGEQTIYLAPTLAASRYLDKTEQWSTLPPETKDHAVDLIQKGYMRI ----3333------3333------------------3333-1111--------------- QQFRKADGSYAAWLSRDSSTWLTAFVLKVLSLAQEQVGGSPEKLQETSNWLLSQQQADGS 11111111----1111---------------------------------------1111- FQDPCPVLDRSQGGLVGNDETVALTAFVTIALHHGLAVFQDEGAEPLKQRVEASISKASS -----------!!!!------------------3333----------------------- FLGEKASAGLLGAHAAAITAYALTLTKAPADLRGVAHNNLAAQETGDNLYWGQAPALWIE -----------------------1111---------------------------3333-- TTAYALLHLLLHEGKAEADQASAWLTRQGSFQGGFRSTQDTVIALDALSAYWIASHT -------3333-------------------2222---3333-----------1111- >IMMUNOGLOBULIN HEAVY CHAI; SWP:NA; PDB:1HZHH; QVQLVQSGAEVKKPGASVKVSCQASGYRFSNFVIHWVRQAPGQRFEWMGWINPYNGNKEF ---------------------------1111--------2222----------------- SAKFQDRVTFTADTSANTAYMELRSL 1111--------3333---------- >IMMUNOGLOBULIN HEAVY CHAI; SWP:NA; PDB:1HZHL; EIVLTQSPGTLSLSPGERATFSCRSSHSIRSRRVAWYQHKPGQAPRLVIHGVSNRASGIS -------------2222----------------------2222------------2222- DRFSGSGSGTDFTLTITRVEPEDFALYYCQVYGASSYTFGQGTKLERKRTVAAPSVFIFP -------!!!!--------1111---------%%%%------------------------ PSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTL -3333-------------------------%%%%-------------------------- TLSKADYEKHKVYACEVTHQGLRSPVTKSFNRGEC ----3333----------1111------------- >DUAL SPECIFICITY PROTEIN ; SWP:Q16828; PDB:1HZMA; MIDTLRPVPFASEMAISKTVAWLNEQLELGNERLLLMDCRPQELYESSHIESAINVAIPG -----------------------------------------33333333----------% IMLRRLQKGNLPVRALFTRGEDRDRFTRRCGTDTVVLYDESSSDWNENTGGESLLGLLLK %%%----------------------3333------------------------------- KLKDEGCRAFYLEGGFSKFQAEFSLHCETNLDGS --3333---------------------------- >CHOLECYSTOKININ TYPE A RE; SWP:P32238; PDB:1HZNA; IFSANAWRAYDTASAERRLSGTPISFILL 3333--------3333------3333--- >BETA-LACTAMASE; SWP:P52664; PDB:1HZOA; NTIEEQLNTLEKYSQGRLGVALINTEDNSQITYRGEERFAMASTSKVMAVAAVLKASEKQ ---------------------------------1111----------------------2 AGLLDKNITIKKSDLVAYSPITEKHLTTGMTLAELSAATLQYSDNTAMNKILDYLGGPAK 222-------3333------3333--------------------------------3333 VTQFARSINDVTYRLDRKEPELNTAIHGDPRDTTSPIAMAKSLQALTLGDALGQSQRQQL -----1111----------------2222------------------------------- VTWLKGNTTGDNSIKAGLPKHWVVGDKTGSGDYGTTNDIAVIWPENHAPLILVVYFTQQE ---1111--11113333-1111---------%%%%------------------------1 QNAKYRKDIIAKAAEIVTKEISNS 111----------------1111- >ISOPENTENYL DIPHOSPHATE D; SWP:Q46822; PDB:1HZTA; LHLAFSSWLFNAKGQLLVTRRALSKKAWPGVWTNSVCGHPQLGESNEDAVIRRCRYELGV ----------1111-------1111--2222----------------------------- EITPPESIYPDFRYRATDPSGIVENEVCPVFAARTTSALQINDDEVMDYQWCDLADVLHG --------1111-----1111--------------------1111--------------- IDATPWAFSPWMVMQATNREARKRLSAFTQLKL ---3333-----------------3333----- >BETA-KETOACYL [ACP] REDUC; SWP:P25716; PDB:1I01A; MNFEGKIALVTGASRGIGRAIAETLAARGAKVIGTATSENGAQAISDYLGANGKGLMLNV --2222-------------------1111-------------------!!!!------33 TDPASIESVLEKIRAEFGEVDILVNNANLLMRMKDEEWNDIIETNLSSVFRLSKAVMRAM 33---------------------------------------------------------- MKKRHGRIITIGSVVGTMAAAKAGLIGFSKSLAREVASRGITVNVVAPGFIETDMTRALS -----------------------------------3333-------------3333---- DDQRAGILAQVPAGRLGGAQEIANAVAFLASDEAAYITGETLHVNGGM ----------3333---3333------------1111-------iiii >EPIDERMAL GROWTH FACTOR R; SWP:Q08509; PDB:1I07A; KKYAKSKYDFVARNSSELSVMKDDVLEILDDRRQWWKVRNASGDSGFVPNNILDIMRTP -------------1111---2222----%%%%-------1111---------------- >DELTA CRYSTALLIN I; SWP:Q7SIE0; PDB:1I0AA; GRFVGSVDPIMEILSSSISTEQRLTEVDIQASMAYAKALEKASILTKTELEKILSGLEKI ---------3333---3333-------------------1111----------------- SEESSKGVLVMTQSDEDIQTAIERRLKELIGDIAGKLQTGRSRNEQVVTDLKLLLKSSIS -----------3333---------------33333333---3333--------------- VISTHLLQLIKTLVERAAIEIDIIMPGYTHLQKALPIRWSQFLLSHAVALTRDSERLGEV -------------------1111-----%%%%-----3333------------------3 KKRITVLPLGSGVLAGNPLEIDRELLRSELDMTSITLNSIDAISERDFVVELISVATLLM 333------------------3333--------------------3333----------- IHLSKLAEDLIIFSTTEFGFVTLSDAYSTGSSLLPQKKNPDSLELIRSKAGRVFGRLAAI -----------------------3333---1111-------------------------- LMVLKGIPSTFSKDLQEDKEAVLDVVDTLTAVLQVATGVISTLQINKENMEKALTPELLS ---2222----3333----------------------------------3333-3333-- TDLALYLVRKGMPIRQAQTASGKAVHLAETKGITINNLTLEDLKSISPLFASDVSQVFSV -------1111----------------------3333-33333333333311113333-- VNSVEQYTAVGGTAKSSVTAQIEQLRELLKKQK --1111--2222--------------------- >CREATINE KINASE,M CHAIN; SWP:P06732; PDB:1I0EA; NKFKLNYKPEEEYPDLSKHNNHMAKVLTLELYKKLRDKETPSGFTVDDVIQTGVDNPGHP ---11113333---------3333---3333--------1111-3333------------ FIMTVGCVAGDEESYEVFKELFDPIISDRHGGYKPTDKHKTDLNHENLKGGDDLDPNYVL ----------3333---33333333----%%%%----------3333-------1111-- SSRVRTGRSIKGYTLPPHCSRGERRAVEKLSVEALNSLTGEFKGKYYPLKSMTEKEQQQL --------------1111----------------3333!!!!-----3333--------- IDDHFLFDKPVSPLLLASGMARDWPDARGIWHNDNKSFLVWVNEEDHLRVISMEKGGNMK 1111---------3333---------------1111------------------------ EVFRRFCVGLQKIEEIFKKAGHPFMWNQHLGYVLTCPSNLGTGLRGGVHVKLAHLSKHPK -----------------1111--------------1111--------------3333111 FEEILTRLRLQKRGTSVFDVSNADRLGSSEVEQVQLVVDGVKLMVEMEKKLEKGQSIDDM 1-----------------------------------------------3333-------- IPAQK ----- >HYPOXANTHINE-GUANINE PHOS; SWP:Q4DRC4; PDB:1I0IA; YEFAEKILFTEEEIRTRIKEVAKRIADDYKGKGLRPYVNPLVLISVLKGSFMFTADLCRA 1111----------------------1111-----------------1111--------- LCDFNVPVRMEFICVSSYGEGLTSSGQVRMLLDTRHSIEGHHVLIVEDIVQTALTLNYLY -1111--------------------------------2222------------------- HMYFTRRPASLKTVVLLDKREGRRVPFSADYVVANIPNAFVIGYGLDYDDTYRELRDIVV ------------------1111--------------------iiii-%%%%1111----- LRPEVYA -3333-- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:1I0RA; MDVEAFYKISYGLYIVTSESNGRKCGQIANTVFQLTSKPVQIAVCLNKENDTHNAVKESG -----1111----------iiii-----------------------1111---------- AFGVSVLELETPMEFIGRFGFRKSSEFEKFDGVEYKTGKTGVPLVTQHAVAVIEAKVVKE -------11113333-------1111-1111------1111----2222----------- CDVGTHTLFVGEAVDAEVLKDAEVLTYADYHLMKKGKTPRT ----------------------------------------- >GUANYL-SPECIFIC RIBONUCLE; SWP:P00651; PDB:1I0VA; ACDYTCGSNCYSSSDVSTAQAAGYKLHEDGETVGSNSYPHKYNNYEGFDFSVSSPYYEWP -----!!!!------------------------1111------3333------------- ILSSGDVYSGGSPGADRVVFNENNQLAGVITHTGASGNNFVECT -3333---------------1111-------2222!!!!----- >L-LACTATE DEHYDROGENASE H; SWP:P07195; PDB:1I0ZA; ATLKEKLIAPVAEEEATVPNNKITVVGVGQVGMACAISILGKSLADELALVDVLEDKLKG --3333----------------------3333-------1111----------------- EMMDLQHGSLFLQTPKIVADKDYSVTANSKIVVVTAGVRQQEGESRLNLVQRNVNVFKFI -------3333----------3333---------------22223333------------ IPQIVKYSPDCIIIVVSNPVDILTYVTWKLSGLPKHRVIGSGCNLDSARFRYLMAEKLGI -------1111----------------------1111---!!!!---------------- HPSSCHGWILGEHGDSSVAVWSGVNVAGVSLQELNPEMGTDNDSENWKEVHKMVVESAYE 3333---------1111--3333--iiii3333-1111--------3333---------- VIKLKGYTNWAIGLSVADLIESMLKNLSRIHPVSTMVKGMYGIENEVFLSLPCILNARGL ----------------------1111----------2222---------------1111- TSVINQKLKDDEVAQLKKSADTLWDIQKDLKD -------------------------3333--- >L-LACTATE DEHYDROGENASE M; SWP:P00338; PDB:1I10A; ATLKDQLIYNLLKEEQTPQNKITVVGVGAVGMACAISILMKDLADELALVDVIEDKLKGE -3333----------------------3333-------1111------------------ MMDLQHGSLFLRTPKIVSGKDYNVTANSKLVIITAGARQQEGESRLNLVQRNVNIFKFII ------3333----------3333---------------22223333------------- PNVVKYSPNCKLLIVSNPVDILTYVAWKISGFPKNRVIGSGCNLDSARFRYLMGERLGVH ------1111-----------------------1111--!!!!----------------3 PLSCHGWVLGEHGDSSVPVWSGMNVAGVSLKTLHPDLGTDKDKEQWKEVHKQVVESAYEV 333---------1111--3333--iiii3333---2222--1111--------------- IKLKGYTSWAIGLSVADLAESIMKNLRRVHPVSTMIKGLYGIKDDVFLSVPCILGQNGIS -----------------------------------2222---------------1111-- DLVKVTLTSEEEARLKKSADTLWGIQKELQF ------------------------3333--- >TRANSCRIPTION FACTOR SOX-; SWP:P35710; PDB:1I11A; PHIKRPMNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKAMTNLEKQPYYEEQARLS --------------------33331111-------------------------------- KQHLEKYPDY --3333---- >GLUCOSAMINE-PHOSPHATE N-A; SWP:P43577; PDB:1I12A; LPDGFYIRRMEEGDLEQVTETLKVLTTVGTITPESFCKLIKYWNEATVWNDKKIMQYNPM -2222-----1111---------------------------------------------- VIVDKRTETVAATGNIIIERKIIHELGLCGHIEDIAVNSKYQGQGLGKLLIDQLVTIGFD ----------------------%%%%-----------3333-----------------11 YGCYKIILDCDEKNVKFYEKCGFSNAGVEMQIRK 11--------3333----1111------------ >PRION-LIKE PROTEIN; SWP:Q9QUG3; PDB:1I17A; RVAENRPGAFIKQGRKLDIDFGAEGNRYYAANYWQFPDGIYYEGCSEANVTKEMLVTSCV -------------------------------3333-----------33333333------ NATQAANQAEFSREKQDSKLHQRVLWRLIKEICSAKHCDFWLERGAA -------------3333--------------------3333------ >CHOLESTEROL OXIDASE; SWP:Q7SID9; PDB:1I19A; VAPLPTPPNFPNDIALFQQAYQNWSKEIMLDATWVCSPKTPQDVVRLANWAHEHDYKIRP ----------1111--------3333-------------3333----------------- RGAMHGWTPLTVEKGANVEKVILADTMTHLNGITVNTGGPVATVTAGAGASIEAIVTELQ ------------2222-------------------------------------------- KHDLGWANLPAPGVLSIGGALAVNAHGAALPAVGQTTLPGHTYGSLSNLVTELTAVVWNG -----------3333----------------2222---------3333------------ TTYALETYQRNDPRITPLLTNLGRCFLTSVTMQAGPNFRQRCQSYTDIPWRELFAPKGAD --------1111--3333--iiii------------------------3333---2222- GRTFEKFVAESGGAEAIWYPFTEKPWMKVWTVSPSLVGKPPQAREVSGPYNYIFSDNLPE -----------------------------------22221111--------3333----- PITDMIGAINAGNPGIAPLFGPAMYEITKLGLAATNANDIWGWSKDVQFYIKATTLRLTE --------11111111--------------------------3333-----1111----- GGGAVVTSRANIATVINDFTEWFHERIEFYRAKGEFPLNGPVEIRCCGLDQAADVKVPSV -------1111-------------------1111----------------3333------ GPPTISATRPRPDHPDWDVAIWLNVLGVPGTPGMFEFYREMEQWMRSHYNNDDATFRPEW -----1111-1111-------------2222-------------------1111----33 SKGWAFGPDPYTDNDIVTNKMRATYIEGVPTTENWDTARARYNQIDPHRVFTNGFMDKLL 33-----------------------22221111------------1111---3333---- P - >IG GAMMA-2A CHAIN C REGIO; SWP:P20760; PDB:1I1CA; SVFIFPPKTKDVLGGGLTPKVTCVVVDISQNDPEVRFSWFIDDVEVHTAQTHAPEKQSNS -------3333-----------------3333--------%%%%---------------- TLRSVSELPIVERDWLNGKTFKCKVNSGAFPAPIEKSISKPEGTPRGPQVYTMAPPKEEM ----------33331111-------------------------------------3333- TQSQVSITCMVKGFYPPDIYTEWKMNGQPQENYKNTPPTMDTDGSYFLYSKLNVKKETWQ ------------------------iiii------------1111----------3333-- QGNTFTCSVLHEGLENEHTEKSLSH ----------1111%%%%------- >TRANSCRIPTIONAL REGULATOR; SWP:P42180; PDB:1I1GA; IDERDKIILEILEKDARTPFTEIAKKLGISETAVRKRVKALEEKGIIEGYTIKINPKKLG --3333------------3333--1111--------------------------3333-- YSLVTITGVDTKPEKLFEVAEKLKEYDFVKELYLSSGDHMIMAVIWAKDGEDLAEIISNK -------------------------1111-------------------3333-------3 IGKIEGVTKVCPAIILEKLK 333----------------- >MELANOMA DERIVED GROWTH R; SWP:Q16674; PDB:1I1JA; GPMPKLADRKLCADQECSHPISMAVALQDYMAPDCRFLTIHRGQVVYVFSKLKGRGRLFW -------------1111----------------3333---2222--------!!!!---- GGSVQGDYYGDLAARLGYFPSSIVREDQTLKPGKVDVKTDKWDFYC -------2222--------3333----------------1111--- >PROTEIN-L-ISOASPARTATE O-; SWP:P22061; PDB:1I1NA; WKSGGASHSELIHNLRKNGIIKTDKVFEVMLATDRSHYAKCNPYMDSPQSIGFQATISAP ---------------1111----------11113333----1111------%%%%---33 HMHAYALELLFDQLHEGAKALDVGSGSGILTACFARMVGCTGKVIGIDHIKELVDDSVNN 33--------33332222------!!!!----------1111------------------ VRKDDPTLLSSGRVQLVVGDGRMGYAEEAPYDAIHVGAAAPVVPQALIDQLKPGGRLILP --------1111-------3333-3333---------------333311112222----- VGPAGGNQMLEQYDKLQDGSIKMKPLMGVIYVPLTDKEKQWSRW --2222---------1111----------------3333----- >ANTHRANILATE SYNTHASE COM; SWP:P00898; PDB:1I1QA; KPTLELLTCDAAYRENPTALFHQVCGDRPATLLLESADIDSKDDLKSLLLVDSALRITAL ------------------------!!!!---------1111------------------! GDTVTIQALSDNGASLLPLLDTALPAGVENDVLPAGRVLRFPPVSPLLDENARLCSLSVF !!!------3333----3333---2222--------------------33331111-333 DAFRLLQGVVNIPTQEREAMFFGGLFAYDLVAGFEALPHLEAGNNCPDYCFYLAETLMVI 3----1111---1111----------33331111-------------------------- DHQKKSTRIQASLFTASDREKQRLNARLAYLSQQLTQPAPPLPVTPVPDMRCECNQSDDA -------------------------------3333-----------1111---------- FGAVVRQLQKAIRAGEIFQVVPSRRFSLPCPSPLAAYYVLKKSNPSPYMFFMQDNDFTLF -----------------------------------------------------1111--- GASPESSLKYDAASRQIEIYPIAGTRPRGRRADGTLDRDLDSRIELDMRTDHKELSEHLM -----------1111---------------1111-------------------------- LVDLARNDLARICTPGSRYVADLTKVDRYSYVMHLVSRVVGELRHDLDALHAYRACMNMG -------------2222-----------1111-----------1111-----------33 TLSGAPKVRAMQLIADAEGQRRGSYGGAVGYFTAHGDLDTCIVIRSALVENGIATVQAGA 33---------------------2222-----3333-------------iiii------- GIVLDSVPQSEADETRNKARAVLRAIATAHHA --11113333-----------------1111- >Anthranilate synthase com; SWP:P00905; PDB:1I1QB; ADILLLDNIDSFTWNLADQLRTNGHNVVIYRNHIPAQTLIDRLATMKNPVLMLSPGPGVP -----------3333-----1111------------------1111------------33 SEAGCMPELLTRLRGKLPIIGICLGHQAIVEAYGGYVGQILHGKATSIEHDGQAMFAGLA 33!!!!------2222--------------1111------------------!!!!---- NPLPVARYHSSNVPAGLTINAHFNGMVMAVRHDADRVCGFQFHPESILTTQGARLLEQTL --------------------------------1111------1111--1111-------- AWAQQK -1111- >Interleukin-6 homolog [Fr; SWP:Q98823; PDB:1I1RB; EFEKDLLIQRLNWMLWVIDECFRDLCYRTGICKGILEPAAIFHLKLPAINDTDHCGLIGF -------------------------------2222--------------3333------- NETSCLKKLADGFFEFEVLFKFLTTEFGKSVINVDVMELLTKTLGWDIQEELNKLTKTHY --------------------------11112222-3333------------3333----- SPPKFDRGLLGRLQGLKYWVRHFASFYVLSAMEKFAGQAVRVLDSIP -----------1111--3333---------------------1111- >ENDO-1,4-BETA-XYLANASE; SWP:P23360; PDB:1I1WA; AAQSVDQLIKARGKVYFGVATDQNRLTTGKNAAIIQANFGQVTPENSMKWDATEPSQGNF ---------1111--------3333--------------------11113333--2222- NFAGADYLVNWAQQNGKLIRGHTLVWHSQLPSWVSSITDKNTLTNVMKNHITTLMTRYKG ------------1111--------------3333---------------------1111- KIRAWDVVNEAFNEDGSLRQTVFLNVIGEDYIPIAFQTARAADPNAKLYINDYNLDSASY ------------1111-----------3333-----------3333-------------- PKTQAIVNRVKKWRAAGVPIDGIGSQTHLSAGQGASVLQALPLLASAGTPEVAITELDVA -------------1111------------2222-------------------------22 GASSTDYVNVVNACLNVSSCVGITVWGVADPDSWRASTTPLLFDGNFNPKPAYNAIVQNL 22--------------1111--------3333--3333-----1111------------- QQ -- >SULFOLIPID BIOSYNTHESIS P; SWP:O48917; PDB:1I24A; GSRVMVIGGDGYCGWATALHLSKKNYEVCIVDNLVRRLFDHQLGLESLTPIASIHDRISR -------1111----------------------3333----------------------- WKALTGKSIELYVGDICDFEFLAESFKSFEPDSVVHFGEQRSAPYSMIDRSRAVYTQHNN --------------1111-----------------------3333--------------- VIGTLNVLFAIKEFGEECHLVKLGTMGEYGTPNIDIEEGYITITHNGRTDTLPYPKQASS --------------1111-------3333---------------iiii-----------3 FYHLSKVHDSHNIAFTCKAWGIRATDLNQGVVYGVKTDETEMHEELRNRLDYDAVFGTAL 333---------------------------------3333--1111-------------- NRFCVQAAVGHPLTVYGKGGQTRGYLDIRDTVQCVEIAIANPAKAGEFRVFNQFTEQFSV ---------------!!!!-------3333-------------2222------------- NELASLVTKAGSKLGLDVKKMTVPNPRVEAEEHYYNAKHTKLMELGLEPHYLSDSLLDSL ---------3333-----------------------------1111-------------- LNFAVQFKDRVDTKQIMPSVSWKKIGVKTKSMT ------3333-3333-----3333--------- >HUWENTOXIN-II; SWP:P82959; PDB:1I25A; LFECSFSCEIEKEGDKPCKKKKCKGGWKCKFNMCVKV -----------------------1111---------- >PTU-1; SWP:NA; PDB:1I26A; AEKDCIAPGAPCFGTDKPCCNPRAWCSSYANKCL ------2222------------------------ >TRANSCRIPTION FACTOR IIF; SWP:P35269; PDB:1I27A; GPLGSGDVQVTEDAVRRYLTRKPMTTKDLLKKFQTKKTGLSSEQTVNVLAQILKRLNPER 2222-------------------------11113333----------------------- KMINDKMHFSLKE --%%%%------- >50S RIBOSOMAL PROTEIN L1P; SWP:P54050; PDB:1I2AA; MDREALLQAVKEARELAKPRNFTQSFEFIATLKEIDMRKPENRIKTEVVLPHGRGKEAKI --------------------------------------3333-------1111------- AVIGTGDLAKQAEELGLTVIRKEEIEELGKNKRKLRKIAKAHDFFIAQADLMPLIGRYMG ----!!!!----1111----3333--------------1111-----1111-------33 VILGPRGKMPKPVPANANIKPLVERLKKTVVINTRDKPYFQVLVGNEKMTDEQIVDNIEA 333333-------1111--------1111----!!!!--------1111----------- VLNVVAKKYEKGLYHIKDAYVKLTMGPAVKVK --------11111111------1111------ >4-AMINO-4-DEOXYCHORISMATE; SWP:P28305; PDB:1I2KA; MFLINGHKQESLAVSDRATQFGDGCFTTARVIDGKVSLLSAHIQRLQDACQRLMISCDFW ---iiii-----1111---------------iiii---------------1111------ PQLEQEMKTLAAEQQNGVLKVVISRGSGGRGYSTLNSGPATRILSVTAYPAHYDRLRNEG -----------------------------!!!!2222----------------------- ITLALSPVRLGRNPHLAGIKHLNRLEQVLIRSHLEQTNADEALVLDSEGWVTECCAANLF ------------3333-----------------------------1111----------- WRKGNVVYTPRLDQAGVNGIMRQFCIRLLAQSSYQLVEVQASLEESLQADEMVICNALMP --!!!!-----------------------------------33331111------1111- VMPVCACGDVSFSSATLYEYLAPLCERPN ------!!!!------------------- >GTP-BINDING NUCLEAR PROTE; SWP:P17080; PDB:1I2MA; QVQFKLVLVGDGGTGKTTFVKRHLKKYVATLGVEVHPLVFHTNRGPIKFNVWDTAGQEKF ----------2222------1111-----2222----------------------3333- GGLRDGYYIQAQCAIIMFDVTSRVTYKNVPNWHRDLVRVCENIPIVLCGNKVDIKDRKVK --!!!!-2222-------1111-1111--------------------------------- AKSIVFHRKKNLQYYDISAKSNYNFEKPFLWLARKLIGDPNLEFV --3333-------------%%%%1111-----------1111--- >BETA-LACTAMASE; SWP:P00808; PDB:1I2SA; DDFAKLEEQFDAKLGIFALDTGTNRTVTYRPDERFAFASTIKALTVGVLLQQKSIEDLNQ -----------------------------1111---!!!!-------------3333--- RITYTRDDLVNYNPITEKHVDTGMTLKELADASLRYSDNTAQNLILKQIGGPESLKKELR ----3333------33331111------------------------1111---------1 KIGDEVTNPERFEPELNEVNPGETQDTSTARALATSLQAFALEDKLPSEKRELLIDWMKR 111----------------2222----------------------------------111 NTTGDALIRAGVPEGWEVADKTGAGSYGTRNDIAIIWPPKGDPVVLAVLSSRDKKDAKYD 1--11111111--------------%%%%------------------------1111--- DKLIAEATKVVLKAL --------------- >HYD PROTEIN; SWP:O95071; PDB:1I2TA; HRQALGERLYPRVQAMQPAFASKITGMLLELSPAQLLLLLASEDSLRARVDEAMELIIAH ----------------3333-------1111----------------------------- G - >DEFENSIN HELIOMICIN; SWP:P81544; PDB:1I2VA; DKLIGSCVWGAVNYTSDCNGECLLRGYKGGHCGSFANVNCWCET ----------------------1111------------------ >CLATHRIN COAT ASSEMBLY PR; SWP:P20172; PDB:1I31A; IGWRREGIKYRRNELFLDVLESVNLLMSPQGQVLSAHVSGRVVMKSYLSGMPECKFGMND ---------------------------3333----------------------------- KIKQSIAIDDCTFHQCVRLSERSISFIPPDGEFELMRYRTTKDIILPFRVIPLVREVGRT -------------3333------------------------------------------- KLEVKVVIKSNFKPSLLAQKIEVRIPTPLNTSGVQVICMKGKAKYKASENAIVWKIKRMA ------------1111-----------1111--------------3333----------- GMKESQISAEIELLPTNDKKKWARPPISMNFEVPFAPSGLKVRYLKVFEPKLNYSDHDVI -----------------------------------3333---------------3333-- KWVRYIGRSGIYETRC ---------------- >GLYCERALDEHYDE 3-PHOSPHAT; SWP:Q27890; PDB:1I32A; APIKVGINGFGRIGRMVFQAICDQGLIGTEIDVVAVVDMSTNAEYFAYQMKHDTVHGRPK -----------------------------------------3333--------------- YTVEAVKSSPSVETADVLVVNGHRIKCVKAQRNPADLPWGKLGVDYVIESTGLFTDKLKA --------------------------------3333-3333--------------3333- EGHIKGGAKKVVISAPASGGAKTIVMGVNQHEYSPASHHVVSNASCTTNCLAPIVHVLTK ---1111-----------------2222-----1111-------3333------------ ENFGIETGLMTTIHSYTATQKTVDGVSLKDWRGGRAAAVNIIPSTTGAAKAVGMVIPSTK ----------------3333------33333333-1111-------3333-11113333- GKLTGMSFRVPTPDVSVVDLTFRATRDTSIQEIDKAIKKAAQTYMKGILGFTDEELVSAD ------------------------------------------1111----------3333 FINDNRSSVYDSKATLQNNLPGEKRFFKVVSWYDNEWAYSHRVVDLVRYMAAKDAASS 2222--------------------------------------------------1111 >PROTEIN KINASE BYR2; SWP:P28829; PDB:1I35A; CILRFIACNGQTRAVQSRGDYQKTLAIALKKFSLEDASKFIVCVSQSSRIKLITEEEFKQ ------------------------------------3333-------------------- ICFNSSSPERDRLIIVPKEKPCPSFEDLRRSWEIE --3333----------3333--------------- >CONSERVED HYPOTHETICAL PR; SWP:O27779; PDB:1I36A; LRVGFIGFGEVAQTLASRLRSRGVEVVTSLEGRSPSTIERARTVGVTETSEEDVYSCPVV -------------------1111------2222--------------------------- ISAVTPGVALGAARRAGRHVRGIYVDINNISPETVRMASSLIEKGGFVDAAIMGSVRRKG ----1111-------1111-------------------3333------------3333-1 ADIRIIASGRDAEEFMKLNRYGLNIEVRGREPGDASAIKMLRSSYTKGVSALLWETLTAA 111-----111133333333---------------------------------------- HRLGLEEDVLEMLEYTEGNDFRESAISRLKSSCIHARRRYEEMKEVQDMLAEVIDPVMPT 1111----------------------------------------------------3333 CIIRIFDKLKDARLQGCA ----------3333---- >RIBONUCLEASE HII; SWP:O29634; PDB:1I39A; MKAGIDEAGKGCVIGPLVVAGVACSDEDRLRKLGVKDSKKLSQGRREELAEEIRKICRTE --------1111-----------------333333333333-----------3333---- VLKVSPENLDERMAAKTINEILKECYAEIILRLKPEIAYVDSPDVIPERLSRELEEITGL ----------------------------------------------3333---------- RVVAEHKADEKYPLVAAASIIAKVEREREIERLKEKFGDFGSGYASDPRTREVLKEWIAS ------3333---------------------------------3333----------333 GRIPSCVRMRWKTVSNLRQK 3--11111111--------- >RESPONSE REGULATOR RCP1; SWP:Q55169; PDB:1I3CA; NPPKVILLVEDSKADSRLVQEVLKTSTIDHELIILRDGLAAAFLQQQGEYENSPRPNLIL ---------------------3333-----------3333--1111!!!!---------- LDLNLPKKDGREVLAEIKQNPDLKRIPVVVLTTSHNEDDVIASYELHVNCYLTKSRNLKD -------------------1111------------3333----1111------------- LFKVQGIESFWLETVTLPAAPG ---------------------- >HEMOGLOBIN GAMMA CHAINS; SWP:P02096; PDB:1I3DA; GHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPKV --------------11113333-------------------1111--------------- KAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFGK ----------------1111------------------3333---------------!!! EFTPEVQASWQKMVTAVASALSSRYH !--------------------3333- >INTRON-ASSOCIATED ENDONUC; SWP:P13299; PDB:1I3JA; KFCKCGVRIQTSAYTCSKCRNRSGENNSFFNHKHSDITKSKISEKMKGKKPSNIKKISCD --3333---1111--3333---!!!!1111---------------2222-1111------ GVIFDCAADAARHFKISSGLVTYRVKSDKWNWFYIN ----------------3333---------------- >EARLY 35 KDA PROTEIN; SWP:P08160; PDB:1I3PA; PVEIDVSQTIIRDCQVDKQTRELVYINKIMNTQLTKPVLMMFNISGPIRSVTRKNNNLRD --------------------------3333----------------------------33 RIKSKPDEQFDQLEKDYDSIKYFKDEHYSVSCQNGSVLKSKFAKILKSHDYTDKKSIEAY 33-------1111-----------------------------3333-------------- EKYCLPKLVDERNDYYVAVCVLKPGFENGSNQVLSFEYNPIGNKVIVPFAHEINDTGLYE ---3333----------------3333--------------------------1111--- YDVVAYVDSVQFDGEQFEEFVQSLILPSSFKNSEKVLYYNEAKSMIYKALEFTTKYNWKI --------------------1111------------------------------------ FCNGFIYDKKSKVLYVKLHNVTSALNKNVILNTI ---------------------------------- >ANTIBODY VHH LAMA DOMAIN; SWP:NA; PDB:1I3UA; VQLQESGGGLVQAGDSLKLSCEASGDSIGTYVIGWFRQAPGKERIYLATIGRNLVGPSDF -----------2222-----------3333--------2222------------------ YTRYADSVKGRFAVSRDNAKNTVNLQMNSLKPEDTAVYYCAAKTTTWGGNDPNNWNYWGQ -------2222------1111---------3333-------------33331111----- GTQVTV ------ >EWS/FLI1 ACTIVATED TRANSC; SWP:O35324; PDB:1I3ZA; MDLPYYHGCLTKRECEALLLKGGVDGNFLIRDSESVPGALCLCVSFKKLVYSYRIFREKH --1111-----------------------------2222------%%%%----------- GYYRIETDAHTPRTIFPNLQELVSKYGKPGQGLVVHLSNPIMR -------1111-------------------------------- >ARFAPTIN 2; SWP:P53365; PDB:1I49A; SRTVDLELELQIELLRETKRKYESVLQLGRALTAHLYSLLQTQHALGDAFADLSQKSPEL --------------------------------------------------------3333 QEEFGYNAETQKLLCKNGETLLGAVNFFVSSINTLVTKTMEDTLMTVKQYEAARLEYDAY ------------------------------------------------------------ RTDLEELSLGPRDAGTRGRLESAQATFQAHRDKYEKLRGDVAIKLKFLEENKIKVMHKQL ------3333-------------------------------------------------- LLFHNAVSAYFAGNQKQLEQT --------------------- >ANNEXIN IV; SWP:P13214; PDB:1I4AA; ASGFNAAEDAQTLRKAMKGLGTDEDAIINVLAYRSTAQRQEIRTAYKTTIGRDLMDDLKS --------------1111-----------1111--------------------------- ELSGNFEQVILGMMTPTVLYDVQELRKAMKGAGTDEGCLIEILASRTPEEIRRINQTYQL -----------1111-----------1111------------------------------ QYGRSLEDDIRSDTSFMFQRVLVSLSAGGRDESNYLDDALMRQDAQDLYEAGEKKWGTDE ------------------------1111--------------------3333-3333--- VKFLTVLCSRNRNHLLHVFDEYKRIAQKDIEQSIKSETSGSFEDALLAIVKCMRNKSAYF ------------------------------------------------------------ AERLYKSMKGLGTDDDTLIRVMVSRAEIDMLDIRANFKRLYGKSLYSFIKGDTSGDYRKV ------------------------1111-------------------------------- LLILCGGDD --------- >ARFAPTIN 2; SWP:P53365; PDB:1I4DA; SRTVDLELELQIELLRETKRKYESVLQLGRALTAHLYSLLQTQHALGDAFADLSQKSPEL --------------------------------------------------------3333 QEEFGYNAETQKLLCKNGETLLGAVNFFVSSINTLVTKTMEDTLMTVKQYEAARLEYDAY ------------------------------------------------------------ RTDLEELSESAQATFQAHRDKYEKLRGDVAIKLKFLEENKIKVMHKQLLLFHNAVSAYFA ---------3333----------------------------------------------1 GNQKQLEQ 111----- >BETA-2-MICROGLOBULIN; SWP:P01892; PDB:1I4FA; GSHSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGPEYW -------------3333----------!!!!-----1111--------3333-------- DGETRKVKAHSQTHRVDLGTLRGYYNQSEAGSHTVQRMYGCDVGSDWRFLRGYHQYAYDG -------------------------------------------1111----------%%% KDYIALKEDLRSWTAADMAAQTTKHKWEAAHVAEQLRAYLEGTCVEWLRRYLENGKETLQ %-----1111------3333------------------1111---------------111 RTDAPKTHMTHHAVSDHEATLRCWALSFYPAEITLTWQRDGEDQTQDTELVETRPAGDGT 1-------------------------------------iiii--2222------------ FQKWAAVVVPSGQEQRYTCHVQHEGLPKPLTLRWE ---------22223333-----1111--------- >50S RIBOSOMAL PROTEIN L22; SWP:P48286; PDB:1I4JA; MEAKAIARYVRISPRKVRLVVDLIRGKSLEEARNILRYTNKRGAYFVAKVLESAAANAVN -----------------------2222-------------2222---------------- NHDALEDRLYVKAAYVDEGPAVLPRARGRADIIKKRTSHITVILGEKHGK ----3333------------------------------------------ >PUTATIVE SNRNP SM-LIKE PR; SWP:O29386; PDB:1I4KA; PRPLDVLNRSLKSPVIVRLKGGREFRGTLDGYDIHMNLVLLDAEEIQNGEVVRKVGSVVI -3333-3333----------------------1111------------------------ RGDTVVFVSPAP 3333-------- >MAJOR PRION PROTEIN; SWP:P04156; PDB:1I4MA; GAVVGGLGGYMLGSAMSRPIIHFGSDYEDRYYRENMHRYPNQVYYRPMDEYSNQNNFVHD ------2222------------------------3333--------3333---------- CVNITIKQHTVTTTTKGENFTETDVKMMERVVEQMCITQYERESQAYY --------------3333------------------------------ >INDOLE-3-GLYCEROL PHOSPHA; SWP:Q56319; PDB:1I4NA; RRLWEIVEAKKKDILEIDGENLIVQRRNHRFLEVLSGKERVKIIAEFKKASPSAGDINAD ------------3333-1111------------------------------------111 ASLEDFIRMYDELADAISILTEKHYFKGDPAFVRAARNLTCRPILAKDFYIDTVQVKLAS 1---------------------------3333---1111------------3333----1 SVGADAILIIARILTAEQIKEIYEAAEELGMDSLVEVHSREDLEKVFSVIRPKIIGINTR 111------3333----------------------------------------------- DLDTFEIKKNVLWELLPLVPDDTVVVAESGIKDPRELKDLRGKVNAVLVGTSIMKAENPR ---------3333-3333-1111------------33332222----------------- RFLEEMRAWSE ----------- >CRUSTACYANIN; SWP:P80029; PDB:1I4UA; DKIPDFVVPGKCASVDRNKLWAEQTPNRNSYAGVWYQFALTNNPYQLIEKCVRNEYSFDG ---1111----------------11113333----------------------------- KQFVIESTGIAYDGNLLKRNGKLYPNPFGEPHLSIDYENSFAAPLVILETDYSNYACLYS ----------1111-----------1111------------------------------- CIDYNFGYHSDFSFIFSRSANLADQYVKKCEAAFKNINVDTTRFVKTVQGSSCPYDTQKT ----------------------------------1111-3333------11113333111 L 1 >MITOCHONDRIAL REPLICATION; SWP:P14908; PDB:1I4WA; PIPGIKDISKLKFFYGFKYLWNPTVYNKIFDKLDLTKTYKHPEELKVLDLYPGVGIQSAI -------------iiii----------------3333---3333---------------- FYNKYCPRQYSLLEKRSSLYKFLNAKFEGSPLQILKRDPYDWSTYSNLIDEERIFVPEVQ --------------------------2222----------3333---------------- SSDHINDKFLTVANVTGEGSEGLIMQWLSCIGNKNWLYRFGKVKMLLWMPSTTARKLLAR ------------------3333--------1111!!!!-----------------1111- PGMHSRSKCSVVREAFTDTKLIAISDANELKGFDSQCIEEWDPILFSAAEIWPTKGKPIA --11113333---------------33333333-------------3333---------- LVEMDPIDFDFDVDNWDYVTRHLMILKRTPLNTVMDSLGHGGQQYFNSRITDKDLLKKCP -------------------------11111111--1111------1111---1111--33 IDLTNDEFIYLTKLFMEWPFKP 33-------------------- >METHEMERYTHRIN; SWP:P02244; PDB:1I4YA; GFPIPDPYVWDPSFRTFYSIIDDEHKTLFNGIFHLAIDDNADNLGELRRCTGKHFLNEQV ----------3333---------------------------------------------- LMQASQYQFYDEHKKEHETFIHALDNWKGDVKWAKSWLVNHIKTIDFKYKGKI --1111------------------------------------------2222- >4-DIPHOSPHOCYTIDYL-2-C-ME; SWP:Q46893; PDB:1I52A; HLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKRVVIAISPGDSR -------------1111----3333------------------3333-------2222-3 FAQLPLANHPQITVVDGGDERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALS 333-----1111-------------------!!!!------1111-------------11 ETSRTGGILAAPVRDTMKRAEPGKNAIAHTVDRNGLWHALTPQFFPRELLHDCLTRALNE 11------------------2222--------2222---------------------111 GATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTR 1----3333--1111--------1111----3333---------- >AZURIN; SWP:P00282; PDB:1I53A; AQCSVDIQGNDQMQFNTNAITVDKSCKQFTVNLSHPGNLPKNVMGHNWVLSTAADMQGVV ---------1111---------3333-------------1111--------3333----- TDGMASGLDKDYLKPDDSRVIAQTKLIGSGEKDSVTFDVSKLKEGEHYMFFCTFPGHSAL ------3333------3333-------2222----------------------2222--- MKGTLTLK -------- >CHEMOTAXIS PROTEIN CHEA; SWP:Q56310; PDB:1I58A; GSHMVPISFVFNRFPRMVRDLAKKMNKEVNFIMRGEDTELDRTFVEEIGEPLLHLLRNAI ---------3333---------1111--------1111---------------------- DHGIEPKEERIAKGKPPIGTLILSARHEGNNVVIEVEDDGRGIDKEKIIRKAIEKGLIDE -----3333-1111-------------!!!!---------------------------33 SKAATLSDQEILNFLFVPGFSTKEKVSEVSGRGVGMDVVKNVVESLNGSISIESEKDKGT 331111----------22223333--3333-------------1111-------2222-- KVTIRLPLT --------- >URACIL PHOSPHORIBOSYLTRAN; SWP:P70881; PDB:1I5EA; GKVYVFDHPLIQHKLTYIRDKNTGTKEFRELVDEVATLMAFEITRDLPLEEVEIETPVSK -------3333--------11113333----------------3333------------- ARAKVIAGKKLGVIPILRAGIGMVDGILKLIPAAKVGHIGLYRDPQTLKPVEYYVKLPSD -----------------3333--3333---3333-----------------------111 VEERDFIIVDPMLATGGSAVAAIDALKKRGAKSIKFMCLIAAPEGVKAVETAHPDVDIYI 1-------------------------1111----------------------1111---- AALDERLNDHGYIVPGLGDAGDRLFGTK -------1111----------------- >TRYPAREDOXIN II; SWP:O77093; PDB:1I5GA; SGLKKFFPYSTNVLKGAAADIALPSLAGKTVFFYFSASWCPPSRAFTPQLIDFYKAHAEK --11111111----!!!!---33332222-------1111------------------11 KNFEVMLISWDESAEDFKDYYAKMPWLALPFEDRKGMEFLTTGFDVKSIPTLVGVEADSG 11------------------1111-----3333--------1111-----------1111 NIITTQARTMVVKDPEAKDFPWPN -------------1111------- >E3 ubiquitin-protein liga; SWP:Q62940; PDB:1I5HW; GSPVDSNDLGPLPPGWEERTHTDGRVFFINHNIKKTQWEDPRMQNVAITG ------------2222----1111---------------3333------- >APOLIPOPROTEIN CII; SWP:P02655; PDB:1I5JA; TFLTQVKESLSSYWESAKTAAQNLYEKTYLPAVDEKLRDLYSKSTAAMSTYTGIFTDQVL ----------------------------------11113333-------3333------1 SVLKGEE 111---- >CHEMOTAXIS PROTEIN CHEA; SWP:P09384; PDB:1I5NA; DISDFYQTFFDEADELLADEQHLLDLVPESPDAEQLNAIFRAAHSIKGGAGTFGFTILQE --------------------------3333-3333------------------------- TTHLENLLDEARRGEQLNTDIINLFLETKDIQEQLDAYKNSEEPDAASFEYICNALRQLA -------------------------------------1111------------------- LEAK ---- >PESTICIDIAL CRYSTAL PROTE; SWP:P21253; PDB:1I5PA; MNNVLNSGRTTICDAYNVVAHDPFSFEHKSLDTIQKEWMEWKRTDHSLYVAPVVGTVSSF -----------------------3333-----------------------3333------ LLKKVGSLIGKRILSELWGIIFPSGSTNLMQDILRETEQFLNQRLNTDTLARVNAELIGL ------3333-----------2222----------------------------------- QANIREFNQQVDNFLNPTQNPVPLSITSSVNTMQQLFLNRLPQFQIQGYQLLLLPLFAQA -----------------------------------------111122223333------- ANMHLSFIRDVILNADEWGISAATLRTYRDYLRNYTRDYSNYCINTYQTAFRGLNTRLHD ---------------1111-------------------------------1111------ MLEFRTYMFLNVFEYVSIWSLFKYQSLMVSSGANLYASGSGPQQTQSFTAQNWPFLYSLF -----------------1111---------------------------3333------11 QVNSNYILSGISGTRLSITFPNIGGLPGSTTTHSLNSARVNYSGGVSSGLIGATNLNHNF 111111------------------------------------------------------ NCSTVLPPLSTPFVRSWLDSGTDREGVATSTNWQTESFQTTLSLRCGAFSARGNSNYFPD ------1111-----------------------------------------------222 YFIRNISGVPLVIRNEDLTRPLHYNQIRNIESPSGTPGGARAYLVSVHNRKNNIYAANEN 2------------3333---------------2222---------------------111 GTMIHLAPEDYTGFTISPIHATQVNNQTRTFISEKFGNQGDSLRFEQSNTTARYTLRGNG 1---------------3333-----3333------------------------------- NSYNLYLRVSSIGNSTIRVTINGRVYTVSNVNTTTNNDGVNDNGARFSDINIGNIVASDN --------------------iiii-----------------iiii--------------- TNVTLDINVTLNSGTPFDLMNIMFVPTNLPPLY ----------1111-----------1111---- >KINESIN-LIKE PROTEIN KIF1; SWP:P33173; PDB:1I5SA; ASVKVAVRVRPFNSREMSRDSKCIIQMSGSTTTIVNPKQPKETPKSFSFDYSYWSHTSPE ----------------1111-------!!!!----1111------------------333 DINYASQKQVYRDIGEEMLQHAFEGYNVCIFAYGQTGAGKSYTMMGKQEKDQQGIIPQLC 3---------------------------------2222----------2222-------- EDLFSRINDTTNDNMSYSVEVSYMEIYCERVRDLLNPKNKGNLRVREHPLLGPYVEDLSK ------1111-1111-----------%%%%--1111------------------2222-- LAVTSYNDIQDLMDSGNKARTVAATNMNETSSRSHAVFNIIFTQKRHDAETNITTEKVSK -----------------33333333---3333---------------------------- ISLVDLAGSERAKGTRLKEGANINKSLTTLGKVISALAEMDIPYRDSVLTWLLRENLGGN -------1111-------------------------------1111------3333-333 SRTAMVAALSPADINYDETLSTLRYADRAK 3----------3333-----------3333 >IOLI PROTEIN; SWP:P42419; PDB:1I60A; KLCFNEATTLENSNLKLDLELCEKHGYDYIEIRTDKLPEYLKDHSLDDLAEYFQTHHIKP ----33331111----------1111-------------3333-3333------------ LALNALVFFNNRDEKGHNEIITEFKGETCKTLGVKYVVAVPLVTEQKIVKEEIKKSSVDV ------------------------------------------------3333-------- LTELSDIAEPYGVKIALEFVGHPQCTVNTFEQAYEIVNTVNRDNVGLVLDSFHFHAGSNI -------3333----------1111----------------1111-------------33 ESLKQADGKKIFIYHIDDTEDFPIGFLTDEDRVWPGQGAIDLDAHLSALKEIGFSDVVSV 3311113333-----------------3333--2222------------1111------- ELFRPEYYKLTAEEAIQTAKKTTVDVVSKYFS ---3333------------------3333--- >HYDROGEN PEROXIDE-INDUCIB; SWP:P11721; PDB:1I6AA; ETMSGPLHIGLIPTVGPYLLPHIIPMLHQTFPKLEMYLHEAQTHQLLAQLDSGKLDAVIL -----------1111---3333--------1111-------------------------- ALVKESEAFIEVPLFDEPMLLAIYEDHPWANREAVPMADLAGEKLLMLEDGHCLRDQAMG --1111-----------------1111-1111---33332222----------------- FCFEAGADEDTHFRATSLETLRNMVAAGSGITLLPALAVPPERKRDGVVYLPAIKPEPRR ---1111---1111--------------------3333------iiii------------ TIGLVYRPGSPLRSRYEQLAEAIRARMDGHFD ------2222-3333-----------2222-- >NEUROTOXIN V-5; SWP:P58779; PDB:1I6FA; KDGYPVDSKGCKLSCVANNYCDNQCKMKKASGGHCYAMSCYCEGLPENAKVSDSATNICG ------1111-------------------------%%%%------1111----------- >TRYPTOPHANYL-TRNA SYNTHET; SWP:P00953; PDB:1I6LA; KTIFSGIQPSGVITIGNYIGALRQFVELQHEYNCYFCIVDQHAITVWQDPHELRQNIRRL ---------------------------3333--------3333----------------- AALYLAVGIDPTQATLFIQSEVPAHAQAAWLQCIVYIGELERTQFKEKSAGKEAVSAGLL ---------1111----3333---------1111-3333---------2222----3333 TYPPLAADILLYNTDIVPVGEDQKQHIELTRDLAERFNKRYGELFTIPEARIPKVGARIS -3333---3333-------3333------------------------------------3 LVDPTKKSKSDPNPKAYITLLDDAKTIEKKIKSAVTDSEGTIRYDKEAKPGISNLLNIYS 333----1111-1111--1111----------------------3333------------ TLSGQSIEELERQYEGKGYGVFKADLAQVVIETLRPIQERYHHWESEELDRVLDEGAEKA -------------2222------------------------------------------- NRVASEVRKEQAGLGR ---------------- >CARBONIC ANHYDRASE; SWP:P61517; PDB:1I6PA; KDIDTLISNNALWSKMLVEEDPGFFEKLAQAQKPRFLWIGCSDSRVPAERLTGLEPGELF -3333---------3333--11113333------------3333--3333----2222-- VHRNVANLVIHTDLNCLSVVQYAVDVLEVEHIIICGHYGCGGVQAAVENPELGLINNWLL ---------1111-----------------------2222------------3333---- HIRDIWFKHSSLLGEMPQERRLDTLCELNVMEQVYNLGHSTIMQSAWKRGQKVTIHGWAY ----------3333--3333---------------------------------------- GIHDGLLRDLDVTATNRETLEQRYRHGISNLKLK 3333------------------------3333-- >30S RIBOSOMAL PROTEIN S8P; SWP:P54041; PDB:1I6UA; SLMDPLANALNHISNCERVGKKVVYIKPASKLIGRVLKVQDNGYIGEFEFIEDGRAGIFK ----------------3333-------------------1111----------------- VELIGKINKCGAIKPRFPVKKFGYEKFEKRYLPARDFGILIVSTTQGVSHEEAKKRGLGG ---------------------------------2222------1111------1111--- RLLAYVY ------- >BAG-FAMILY MOLECULAR CHAP; SWP:Q60739; PDB:1I6ZA; GSPEFMLIGEKSNPEEEVELKKLKDLEVSAEKIANHLQELNKELSGIQQGFLAKELQAEA ------------------------------------------------------------ LCKLDRKVKATIEQFMKILEEIDTMVLPEQFKDSRLKRKNLVKKVQVFLAECDTVEQYIC ---------------------1111----------------------------------- QETERLQSTNLALAE --------------- >APOLIPOPROTEIN(A); SWP:P08519; PDB:1I71A; DCYHGDGQSYRGSFSTTVTGRTCQSWSSMTPHWHQRTTEYYPNGGLTRNYCRNPDAEIRP ---!!!!---------1111----1111--------33331111---------------- WCYTMDPSVRWEYCNLTQCPVME -----1111-------------- >PROBABLE MANGANESE-DEPEND; SWP:O68579; PDB:1I74A; SKILVFGHQNPDSDAIGSSAYAYLKRQLGVDAQAVALGNPNEETAFVLDYFGIQAPPVVK -------------------------1111------------------------------- SAQAEGAKQVILTDHNEFQQSIADIREVEVVEVVDHHRVANFETANPLYRLEPVGSASSI 3333------------3333---3333----------------------------3333- VYRLYKENGVAIPKEIAGVLSGLISDTLLLKSPTTHASDPAVAEDLAKIAGVDLQEYGLA ---------------------------%%%%11111111--------------------- LKAGTNLASKTAAQLVDIDAKTFELNGSQVRVAQVNTVDINEVLERQNEIEEAIKASQAA -1111-1111--------------iiii----------3333------------------ NGYSDFVLITDILNSNSEILALGNNTDKVEAAFNFTLKNNHAFLAGAVSRKKQVVPQLTE ------------------------3333---------%%%%-------3333-------- SFNG ---- >NEUTROPHIL COLLAGENASE; SWP:P22894; PDB:1I76A; MLTPGNPKWERTNLTYRIRNYTPQLSEAEVERAIKDAFELWSVASPLIFTRISQGEADIN --2222---------------1111---------------1111---------------- IAFYQRDHGDNSPFDGPNGILAHAFQPGQGIGGDAHFDAEETWTNTSANYNLFLVAAHEF ----------------------------!!!!-----1111------------------- GHSLGLAHSSDPGALMYPNYAFRETSNYSLPQDDIDGIQAIYG ----------1111----------------------------- >CYTOCHROME C3; SWP:Q9L915; PDB:1I77A; APAAPDKPLEFKGSQKTVMFPHAVHAKVECVTCHHQVDGKESFAKCGSSGCHDDLAGKQG ---------------------3333---3333----iiii----1111-----------1 EKSLYYVVHTKKELKHTNCIGCHSKVVEGKPELKKDLTACAKSKCHP 111-----------------------11111111--------3333- >HOMER 2B; SWP:Q9QWW1; PDB:1I7AA; EQPIFTTRAHVFQINWVPASKQAVTVSYFYDVTRNSYRIISVDGAKVIINSTITPNMTFT -------------------------------1111-------!!!!-------1111--- KTSQKFGQWADSRANTVFGLGFSSELQLTKFAEKFQEVREAAR -----------3333-----------------------1111- >DNA TOPOISOMERASE III; SWP:P14294; PDB:1I7DA; MRLFIAEKPSLARAIADVLPKPHRKGDGFIECGNGQVVTWCIGHLLEQAQPDAYDSRYAR --------------3333-------2222---%%%%-------------3333------- WNLADLPIVPEKWQLQPRPSVTKQLNVIKRFLHEASEIVHAGDPDREGQLLVDEVLDYLQ --1111-----------3333-----------------------------------1111 LAPEKRQQVQRCLINDLNPQAVERAIDRLRSNSEFVPLCVSALARARADWLYGINMTRAY ---3333-----------------1111-------------------------------- TILGRNAGYQGVLSVGRVQTPVLGLVVRRDEEIENFVAKDFFEVKAHIVTPADERFTAIW ----1111-----------------------------------------1111------- QPSEACEPYQDEEGRLLHRPLAEHVVNRISGQPAIVTSYNDKRESESAPLPFSLSALQIE --1111----1111---3333--------------------------------------- AAKRFGLSAQNVLDICQKLYETHKLITFPRSDCRYLPEEHFAGRHAVMNAISVHAPDLLP ------------------------------------33331111-----3333-1111-- QPVVDPDIRNRCWDDKKVDAHHAIIPTARSSAINLTENEAKVYNLIARQYLMQFCPDAVF 33331111-----3333----------------------------------1111----- RKCVIELDIAKGKFVAKARFLAEAGWRTLLGSKERDEENDGTPLPVVAKGDELLCEKGEV ------------------------3333----3333------------------------ VERQTQPPRHFTDATLLSAMTGIARFVQDKDLKKILRATDGLGTEATRAGIIELLFKRGF -------------------------------------------1111------------- LTKKGRYIHSTDAGKALFHSLPEMATRPDMTAHWESVLTQISEKQCRYQDFMQPLVGTLY -----------------11113333--3333---------1111--3333---------- QLIDQAKRTPVRQFRGIVAP -----1111-3333------ >PEROXISOME PROLIFERATOR A; SWP:Q07869; PDB:1I7GA; ETADLKSLAKRIYEAYLKNFNMNKVKARVILSGSNNPPFVIHDMETLCMAEKTLQNKEAE ------------------------------------------------------------ VRIFHCCQCTSVETVTELTEFAKAIPGFANLDLNDQVTLLKYGVYEAIFAMLSSVMNKDG ------------------------2222------------------------11111111 MLVAYGNGFITREFLKSLRKPFCDIMEPKFDFAMKFNALELDDSDISLFVAAIICCGDRP --%%%%--------1111-------3333------------------------------- GLLNVGHIEKMQEGIVHVLRLHLQSNHPDDIFLFPKLLQKMADLRQLVTEHAQLVQIIKK ---3333-------------------1111------------------------------ TESDAALHPLLQEIYRDMY -1111-------------- >FERREDOXIN; SWP:P25528; PDB:1I7HA; PKIVILPHQDLCPDGAVLEANSGETILDAALRNGIEIEHACEKSCACTTCHCIVREGFDS --------------------2222------1111----1111-----1111-----3333 LPESSEQEDDMLDKAWGLEPESRLSCQARVTDEDLVVEIPRYTINHARE -----------1111---1111-1111-----------------1111- >UBIQUITIN-CONJUGATING ENZ; SWP:O00762; PDB:1I7KA; PVGKRLQQELMTLMMSGDKGISAFPESDNLFKWVGTIHGAAGTVYEDLRYKLSLEFPSGY -----------------2222----1111----------2222-2222--------1111 PYNAPTVKFLTPCYHPNVDTQGNISLDILKEKWSALYDVRTILLSIQSLLGEPNIDSPLN --------------11111111---333311111111-----------1111-------- THAAELWKNPTAFKKYLQETYSKQVT -------------------------- >SYNAPSIN II; SWP:Q9Z1H0; PDB:1I7NA; KAKVLLVVDEPHTDWAKCFRGKKILGDYDIKVEQAEFSELNLVAHADGTYAVDMQVLRNG ---------33333333-2222-------------3333-----1111------------ TKVVRSFRPDFVLIRQHAFGMAENEDFRHLVIGMQYAGLPSINSLESIYNFCDKPWVFAQ ----------------------------------1111----------11113333---- MVAIFKTLGGEKFPLIEQTYYPNHREMLTLPTFPVVVKIGHAHSGMGKVKVENHYDFQDI ---------3333---------3333----------------iiii-------------- ASVVALTQTYATAEPFIDAKYDIRVQKIGNNYKAYMRTSISGNWKTNTGSAMLEQIAMSD ---------------------------!!!!---------!!!!---------------- RYKLWVDACSEMFGGLDICAVKAVHGKDGKDYIFEVMDCSMPLIGEHQVEDRQLITDLVI -----------iiii----------1111--------1111------------------- SKMNQLLS -------- >ANTHRANILATE SYNTHASE; SWP:P00897; PDB:1I7QA; TKPQLTLLKVQASYRGDPTTLFHQLCGARPATLLLESAEINDKQNLQSLLVIDSALRITA -------------------------!!!!------------------------------- LGHTVSVQALTANGPALLPLLDEALPPEVRNQARPNGRELTFPAIDAVQDEDARLRSLSV !!!!------3333-----------3333----2222----------------1111--- FDALRTILTLVDSPADEREAVMLGGLFAYDLVAGFENLPALRQDQRCPDFCFYLAETLLV -----3333----1111----------33331111------------------------- LDHQRGSARLQASVFSEQASEAQRLQHRLEQLQAELQQPPQPIPHQKLENMQLSCNQSDE ----------------------------------1111---------1111--------- EYGAVVSELQEAIRQGEIFQVVPSRRFSLPCPAPLGPYQTLKDNNPSPYMFFMQDDDFTL ------------------------------------------------------1111-- FGASPESALKYDAGNRQIEIYPIAGTRPRGRRADGSLDLDLDSRIELEMRTDHKELAEHL ------------1111---------------1111------------------------- MLVDLARNDLARICQAGSRYVADLTKVDRYSFVMHLVSRVVGTLRADLDVLHAYQACMNM --------------2222-----------1111-----------1111-----------3 GTLSGAPKVRAMQLIAALRSTRRGSYGGRVGYFTAVRNLDTCIVIRSAYVEDGHRTVQAG 333-------------------!!!!-------1111-------------iiii------ AGVVQDSIPEREADETRNKARAVLRAIATAHHAKEVF ---1111---------------------1111----- >Anthranilate synthase com; SWP:P00900; PDB:1I7QB; ADILLLDNVDSFTYNLVDQLRASGHQVVIYRNQIGAEVIIERLQHMEQPVLMLSPGPGTP -----------1111-----1111------11113333--------------------33 SEAGCMPELLQRLRGQLPIIGICLGHQAIVEAYGGQVGQAGEILHGKASAIAHDGEGMFA 33!!!!------2222--------------1111---------------------!!!!- GMANPLPVARYHSLVGSNIPADLTVNARFGEMVMAVRDDRRRVCGFQFHPESILTTHGAR -------------------1111-----!!!!------1111------1111--1111-- LLEQTLAWALAK ------------ >Epithelial-cadherin [Prec; SWP:P09803; PDB:1I7WB; LKAADSDPTAPPYDSLLVFDYEGGEAASLSLDYLNEWGNRFKKLADMY ------1111---------------------------3333------- >CHIMERA OF IG GAMMA-1 CHA; SWP:P01834; PDB:1I7ZA; DLVLTQSPASLAVSLGQRATISCRASKSVST -------------2222-------------- >CHALCONE SYNTHASE 2; SWP:P30074; PDB:1I88A; MVSVSEIRKAQRAEGPATILAIGTANPANCVEQSTYPDFYFKITNSEHKTELKEKFQRMC -------------------------------3333------11111111----------- DKSMIKRRYMYLTEEILKENPNVCEYMAPSLDARQDMVVVEVPRLGKEAAVKAIKEWGQP -------------------3333------------------------------------3 KSKITHLIVCTTSGVDMPGADYQLTKLLGLRPYVKRYMMYQQGFAGGTVLRLAKDLAENN 333---------------------------1111-------------------------2 KGARVLVVCSEVTAVTFRGPSDTHLDSLVGQALFGDGAAALIVGSDPVPEIEKPIFEMVW 222---------1111----1111---3333----------------2222--------- TAQTIAPDSEGAIDVHLREAGLTFHLLKDVPGIVSKNITKALVEAFEPLGISDYNSIFWI --------2222-----1111-------------------------1111--1111---- AHPGGPAILDQVEQKLALKPEKMNATREVLSEYGNMSSACVLFILDEMRKKSTQNGLKTT ----3333----------3333--------------3333-------------------- GEGLEWGVLFGFGPGLTIETVVLRSVAI iiii------------------------ >ENDO-1,4-BETA-XYLANASE A; SWP:Q60037; PDB:1I8AA; MVATAKYGTPVIDGEIDEIWNTTEEIETKAVAMGSLDKNATAKVRVLWDENYLYVLAIVK ----------------3333--------------1111----------1111-------- DPVLNKDNSNPWEQDSVEIFIDENNHKTGYYEDDDAQFRVNYMNEQTFGTGGSPARFKTA ---------1111------------------1111-----1111----22223333---- VKLIEGGYIVEAAIKWKTIKPTPNTVIGFNIQVNDANEKGQRVGIISWSDPTNNSWRDPS ---2222--------------2222-----------1111-------------3333-11 KFGNLRLIK 11------- >RIBOFLAVIN SYNTHASE; SWP:P29015; PDB:1I8DA; MFTGIVQGTAKLVSIDEKPNFRTHVVELPDHMLDGLETGASVAHNGCCLTVTEINGNHVS -----------------1111-------33332222-------iiii-------!!!!-- FDLMKETLRITNLGDLKVGDWVNVERAAKFSDEIGGHLMSGHIMTTAEVAKILTSENNRQ -----3333-3333--2222--------1111----------------------2222-- IWFKVQDSQLMKYILYKGFIGIDGISLTVGEVTPTRFCVHLIPETLERTTLGKKKLGARV ----------11112222---iiii-------1111-------------3333------- NIEIDPQTQAVVDTVERVLAARENAM -------------------------- >PUTATIVE SNRNP SM-LIKE PR; SWP:Q8ZYG5; PDB:1I8FA; ATLGATLQDSIGKQVLVKLRDSHEIRGILRSFDQHVNLLLEDAEEIIDGNVYKRGTMVVR -------1111--------%%%%---------1111----------iiii---------3 GENVLFISPVP 333-------- >EPIDERMAL GROWTH FACTOR R; SWP:P18529; PDB:1I8KA; DIELTQSPASLSVATGEKVTIRCMTSTDIDDDMNWYQQKPGEPPKFLISEGNTLRPGVPS -------------2222-----------%%%%------2222------------222233 RFSSSGTGTDFVFTIENTLSEDVGDYYCLQSFNVPLTFGCGTKLEI 33----------------3333------------------------ >Ig heavy chain V region 5; SWP:P18529; PDB:1I8KB; QVKLQQSGGGLVKPGASLKLSCVTSGFTFRKFGMSWVRQTSDKCLEWVASISTGGYNTYY ------------2222-----------3333--------1111--------1111----- SDNVKGRFTISRENAKNTLYLQMSSLKSEDTALYYCTRGYSSTSYAMDYWGQGTTVTVS 3333--------3333----------3333----------------------------- >T LYMPHOCYTE ACTIVATION A; SWP:P33681; PDB:1I8LA; VIHVTKEVKEVATLSCGHNVSVEELAQTRIYWQKEKKMVLTMMSGDMNIWPEYKNRTIFD --------------------3333------------------%%%%-------------3 ITNNLSIVILALRPSDEGTYECVVLKYEKDAFKREHLAEVTLSVKADFPTPSISDFEIPT 333---------3333-------------------------------------------- SNIRRIICSTSGGFPEPHLSWLENGEELNAINTTVSQDPETELYAVSSKLDFNMTTNHSF ------------------------------------------------------------ MCLIKYGHLRVNQTFNWNT -----!!!!---------- >Cytotoxic T-lymphocyte pr; SWP:P16410; PDB:1I8LC; MHVAQPAVVLASSRGIASFVCEYASPGKATEVRVTVLRQADSQVTEVCAATYMMGNELTF ---------------------------------------%%%%----------------- LDDSICTGTSSGNQVNLTIQGLRAMDTGLYICKVELMYPPPYYLGIGNGAQIYVIDPE ----------------------3333-------------------------------- >ANTI-PLATELET PROTEIN; SWP:Q01747; PDB:1I8NA; ETITAGNEDCWSKRPGWKLPDNLLTKTEFTSVDECRKMCEESAVEPSCYILQINTETNEC -------------------1111------------------------------------- YRNNEGDVTWSSLQYDQPNVVQWHLHACS --------3333----------------- >CYTOCHROME C2; SWP:P00091; PDB:1I8OA; DAKAGEAVFKQCMTCHRADKNMVGPALAGVVGRKAGTAAGFTYSPLNHNSGEAGLVWTAD 3333----------------------2222-------2222---------1111---333 NIVPYLADPNAFLKKFLTEKGKADQAVGVTKMTFKLANEQQRKDVVAYLATLK 3----------------11111111-----------------------1111- >UDP-GALACTOPYRANOSE MUTAS; SWP:P37747; PDB:1I8TA; MYDYIIVGSGLFGAVCANELKKLNKKVLVIEKRNHIGGNAYTEDCEGIQIHKYGAHIFHT ------------------3333--------------!!!!----iiii------------ NDKYIWDYVNDLVEFNRFTNSPLAIYKDKLFNLPFNMNTFHQMWGVKDPQEAQNIINAQK -------------------------!!!!------------------------------- KKYGDKVPENLEEQAISLVGEDLYQALIKGYTEKQWGRSAKELPAFIIKRIPVRFTFDNN 3333----------------------------------3333------------------ YFSDRYQGIPVGGYTKLIEKMLEGVDVKLGIDFLKDKDSLASKAHRIIYTGPIDQYFDYR ---------2222--------2222------3333-----1111--------------11 FGALEYRSLKFETERHEFPNFQGNAVINFTDANVPYTRIIEHKHFDYVETKHTVVTKEYP 11----------------------------1111------3333---------------- LEWKVGDEPYYPVNDNKNMELFKKYRELASREDKVIFGGRLAEYKYYDMHQVISAALYQV --------------------------------------3333------------------ KNIMSTD ------- >GRANULIN-1; SWP:P81013; PDB:1I8XA; VIHCDAATICPDGTTCSLSPYGVWYCSPFS --!!!!------------1111-------- >30S ribosomal protein S8; SWP:P24319; PDB:1I94H; MLTDPIADMLTRIRNATRVYKESTEVPASRFKEEILKILAREGFIKGYERVEVDGKPYLR -----3333-------1111----------3333-------------------------- IHLKYGPRRQGPDPRPEQVIKHIRRISRPGRRVYVGVKEIPRVRRGLGIAILSTPKGVLT ------------------------------------1111-------------------- DREARKLGVGGELICEVW --3333------------ >30S ribosomal protein S17; SWP:P24321; PDB:1I94Q; PKKVLTGVVVSDKMQKTVTVLVERQFPHPLYGKVIKRSKKYLAHDPEERYKVGDVVEIIE --------------------------------------------1111------------ ARPISKRKRFRVLRLVEEGRLDLVEKYLVRRQNYASLSKRGGKA --------------------3333--------3333----1111 >HYPOTHETICAL PROTEIN RV21; SWP:O33253; PDB:1I9GA; TGPFSIGERVQLTDAKGRRYTMSLTPGAEFHTHRGSIAHDAVIGLEQGSVVKSSNGALFL ----2222----------------2222---------333322222222----------- VLRPLLVDYVMSMPRGPQVIYPKDAAQIVHEGDIFPGARVLEAGAGSGALTLSLLRAVGP --------1111--------3333-----1111-2222------!!!!3333------11 AGQVISYEQRADHAEHARRNVSGCYGQPPDNWRLVVSDLADSELPDGSVDRAVLDMLAPW 11-----------------------------------3333----------------333 EVLDAVSRLLVAGGVLMVYVATVTQLSRIVEALRAKQCWTEPRAWETLQRGWNVVGLAVR 3---------2222----------------------------------------!!!!-- PQHSMRGHTAFLVATRRLAPGAVA ------------------------ >RECOMBINANT MONOCLONAL AN; SWP:NA; PDB:1I9IH; EVKLVESGGGLVKPGGSLKLSCAASGFTFSTYALSWVRQTADKRLEWVASIVSGGNTYYS ------------2222-----------1111--------1111--------3333----- GSVKGRFTISRDIARNILYLQMSSLRSEDTAMYYCAREYYGYVGLAYWGQGTLVTVSAAK --2222------1111---------3333------------------------------- TTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLY --------------------------------------%%%%------------------ TLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRDC --------3333-----------3333------------- >CD40 LIGAND; SWP:NA; PDB:1I9RH; QVQLVQSGAEVVKPGASVKLSCKASGYIFTSYYMYWVKQAPGQGLEWIGEINPSNGDTNF ---------------------------3333----------------------------- NEKFKSKATLTVDKSASTAYMELSSLRSEDTAVYYCTRSDGRNDMDSWGQGTLVTVSSAS 3333---------1111---------1111---------%%%%----------------- TKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGL -----------1111-!!!!---------------------------------------- YSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPK ---------3333-------------------------- >CD40 LIGAND; SWP:NA; PDB:1I9RL; DIVLTQSPATLSVSPGERATISCRASQRVSSSTYSYMHWYQQKPGQPPKLLIKYASNLES -------------2222-------------------------2222------------22 GVPARFSGSGSGTDFTLTISSVEPEDFATYYCQHSWEIPPTFGGGTKLEIKRTVAAPSVF 223333----------------3333---------------------------------- IFPPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLS ----3333-------------------------iiii----------------------- STLTLSKADYEKHKVYACEVTHQGLSSPVTKSFNR ---------1111---------------------- >PHOSPHATIDYLINOSITOL PHOS; SWP:O43001; PDB:1I9ZA; YDPIHEYVNHELRKRENEFSEHKNVKIFVASYNLNGCSATTKLENWLFPENTPLADIYVV ----------33333333---------------iiii-----3333-------------- GFQEIVQLTSADPAKRREWESCVKRLLNGKCTSGPGYVQLRSGQLVGTALMIFCKESCLP ---------------------------1111-------------!!!!------333311 SIKNVEGTVKKTGLGNKGAVAIRFDYEDTGLCFITSHLAAGYTNYDERDHDYRTIASGLR 11-----------------------!!!!------------------------------- FRRGRSIFNHDYVVWFGDFNYRISLTYEEVVPCIAQGKLSYLFEYDQLNKQMLTGKVFPF -%%%%1111----------------3333----1111--------------1111--222 FSELPITFPPTYKFDIGTDIYDTSDKHRVPAWTDRILYRGELVPHSYQSVPLYYSDHRPI 2-------------2222-----3333--------------------------------- YATYEANIVKVDREKKKILFEELYNQRKQEVRDASQ --------------------------------1111 >POLYGALACTURONASE; SWP:O74213; PDB:1IA5A; ATTCTFSGSNGASSASKSKTSCSTIVLSNVAVPSGTTLDLTKLNDGTHVIFSGETTFGYK -------1111------1111-----------2222-------2222------------- EWSGPLISVSGSDLTITGASGHSINGDGSRWWDGEGGNGGKTKPKFFAAHSLTNSVISGL ------------------2222-----3333---!!!!---------------------- KIVNSPVQVFSVAGSDYLTLKDITIDNSDGDDNGGHNTDAFDIGTSTYVTISGATVYNQD --------------------------3333------------------------------ DCVAVNSGENIYFSGGYCSGGHGLSIGSVGGRSDNTVKNVTFVDSTIINSDNGVRIKTNI ----------------------------------------------------------22 DTTGSVSDVTYKDITLTSIAKYGIVVQQNYGDTSSTPTTGVPITDFVLDNVHGSVVSSGT 22-----------------------------1111--------------------1111- NILISCGSGSCSDWTWTDVSVSGGKTSSKCTNVPSGASC ------2222-----------------------1111-- >CELLULASE CEL9M; SWP:Q9EYQ2; PDB:1IA6A; AGTHDYSTALKDSIIFFDANKCGPQAGENNVFDWRGACHTTDGSDVGVDLTGGYHDAGDH ---------------3333---1111-----1111---11113333-------------- VKFGLPQGYSAAILGWSLYEFKESFDATGNTTKMLQQLKYFTDYFLKSHPNSTTFYYQVG -------------------------1111---------------------1111------ EGNADHTYWGAPEEQTGQRPSLYKADPSSPASDILSETSAALTLMYLNYKNIDSAYATKC 33331111--3333-----------1111-------------------1111-------- LNAAKELYAMGKANQGVGNGQSFYQATSFGDDLAWAATWLYTATNDSTYITDAEQFITLN ---------------------------------------------3333-----3333-1 KMQDKWTMCWDDMYVPAALRLAQITGKQIYKDAIEFNFNYWKTQVTTTPGGLKWLSNWGV 111-----1111-----------------------------------1111--------- LRYAAAESMVMLVYCKQNPDQSLLDLAKKQVDYILGDNPANMSYIIGYGSNWCIHPHHRA -------------------------------------1111---2222---------333 ANGYTYADNAKPAKHLLTGALVGGPDQNDKFLDDANQYQYTEVALDYNAGLVGVLAGAIK 3------1111-----2222-------------11113333------------------- FFG --- >CHK1 CHECKPOINT KINASE; SWP:O14757; PDB:1IA8A; AVPFVEDWDLVQTLGEGAYGEVQLAVNRVTEEAVAVKIVDMKRCPENIKKEICINKMLNH ---1111--------------------------------1111-----------1111-1 ENVVKFYGHRREGNIQYLFLEYCSGGELFDRIEPDIGMPEPDAQRFFHQLMAGVVYLHGI 111--------!!!!-------11113333------------------------------ GITHRDIKPENLLLDERDNLKISDFGLATVFRYNNRERLLNKMCGTLPYVAPELLKRREF -------3333---1111------1111----%%%%---------3333-3333------ HAEPVDVWSCGIVLTAMLAGELPWDQPSDSCQEYSDWKEKKTYLNPWKKIDSAPLALLHK -3333----------------------3333-----111111113333---3333----- ILVENPSARITIPDIKKDRWYNKPLKKGAKRP ----3333--3333----3333---------- >TRANSIENT RECEPTOR POTENT; SWP:Q923J1; PDB:1IA9A; YYYSAVERNNLMRLSQSIPFVPVPPRGEPVTVYRLEESSPSILNNSMSSWSQLGLCAKIE ------------1111----------------------33331111-------------- FLSKMGGGLRRAVKVLCTWSEHDILKSGHLYIIKSFLPEVINTWSSIYKEDTVLHLCLRE --------------------%%%%-2222--------------3333------------- IQQQRAAQKLTFAFNQMKPKSIPYSPRFLEVFLLYCHSAGQWFAVEECMTGEFRKYNNNN ------------------1111--------------1111-----------------111 GDEIIPTNTLEEIMLAFSHWTYEYTRGELLVLDLQGVGENLTDPSVIKAEEKRSCDMVFG 1-----------------------iiii--------!!!!-------1111-3333---- PANLGEDAIKNFRAKHHCNSCCRKLKLPDLKRNDYT ----1111--------------1111--11111111 >ASTACIN; SWP:P07584; PDB:1IAB; AAILGDEYLWSGGVIPYTFAGVSGADQSAILSGMQELEEKTCIRFVPRTTESDYVEIFTS ----3333-2222----------------------------------------------- GSGCWSYVGRISGAQQVSLQANGCVYHGTIIHELMHAIGFYHEHTRMDRDNYVTINYQNV -----------------------------------------11111111------3333- DPSMTSNFDIDTYSRYVGEDYQYYSIMHYGKYSFSIQWGVLETIVPLQNGIDLTDPYDKA 1111-1111------------1111------2222-2222--------------3333-- HMLQTDANQINNLYTNECSL -------------------- >ADAMALYSIN II; SWP:P34179; PDB:1IAG; NLPQRYIELVVVADRRVFMKYNSDLNIIRTRVHEIVNIINKFYRSLNIRVSLTDLEIWSG --------------------%%%%------------------3333-------------- QDFITIQSSSSNTLNSFGEWRERVLLIWKRHDNAQLLTAINFEGKIIGKAYTSSMCNPRS ------------------------3333--------------%%%%-------2222--- SVGIVKDHSPINLLVAVTMAHELGHNLGMEHDGKDCLRGASLCIMRPGLTPGRSYEFSDD ----------3333------------------1111-!!!!-1111-------------- SMGYYQKFLNQYKPQCILNKP -------------3333---- >Igk-C protein; SWP:Q569Y8; PDB:1IAIH; QIQLVQSGPELKKPGETVKISCKASGYTFTNYGMNWVKQAPGKGLKWMAWINTYTGEPTY ---------------------------1111----------------------------- ADDFKGRFAFSLETSASTAYLQINNLKNEDTATYFCARDGYYENYYAMDYWGQGTSVTVS 3333---------1111---------3333-----------iiii--------------- SAKTTAPSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQS ---------------------------------------------3333---------%% DLYTLSSSVTVTSSTTPSQSITCNVAHPASSTKVDKKID %%-------------1111-------------------- >Igk-C protein; SWP:Q58EU4; PDB:1IAIL; DIVMTQSHKFMSTSVGDRVSITCKASQDVSTAVAWYQQKPGQSPKLLIYSASYQYTGVPD -------------2222-------------------------------------2222-- RFTGSGSRTDFTFTINSVQAEDLAVYYCHQHYSTPFTFGSGTKLEIKRADAAPTVSIFPP ------------------3333-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT ------------------------------------------------------------ LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC ---3333--------------------------- --------- >INTERCELLULAR ADHESION MO; SWP:P05362; PDB:1IAM; QTSVSPSKVILPRGGSVLVTCSTSCDQPKLLGIETPLPKKELLLPGNNRKVYELSNVQED -----------2222--------------------------------------------- SQPMCYSNCPDGQSTAKTFLTVYWTPERVELAPLPSWQPVGKQLTLRCQVEGGAPRAQLT --------3333--------------------------2222------------3333-- VVLLRGEKELKREPAVGEPAEVTTTVLVRRDHHGAQFSCRTELDLRPQGLELFENTSAPY ----!!!!--------------------3333------------1111------------ QLQTF ----- >GUANINE NUCLEOTIDE EXCHAN; SWP:Q92888; PDB:1IAPA; SQFQSLEQVKRRPAHLMALLQHVALQFEPGPLLCCLHADMLGSLGPKEAKKAFLDFYHSF 111133331111----------------------------11113333------------ LEKTAVLRVPVPPNVAFELDRTRADLISEDVQRRFVQEVVQSQQVAVGRQLEDFRSKRLM -1111-----------------3333-------------------------------111 GMTPWEQELAQLEAWVGRDRASYEARERHVAERLLMHLEEMQHTISTDEEKSAAVVNAIG 1---------3333--------------------------3333---------------- LYMRHLGVRT ---1111--- >Interleukin-4 receptor al; SWP:P24394; PDB:1IARB; FKVLQEPTCVSDYMSISTCEWKMNGPTNCSTELRLLYQLVFLLSEAHTCIPENNGGAGCV ---------------------------3333-----------------------1111-- CHLLMDDVVSADNYTLDLWAGQQLLWKGSFKPSEHVKPRAPGNLTVHDTLLLTWSNPYPP --------3333------------------3333------------------------11 DNYLYNHLTYAVNIWSENDPADFRIYNVTYLEPSLRIAAGISYRARVRAWAQAYNTTWSE 11------------------------------------------------3333------ WSPSTKWH -------- >PHOSPHOGLUCOSE ISOMERASE; SWP:P06744; PDB:1IATA; AALTRDPQFQKLQQWYREHRSELNLRRLFDANKDRFNHFSLTLNTNHGHILVDYSKNLVT 3333--------------3333---------11111111--------------------- EDVMRMLVDLAKSRGVEAARERMFNGEKINYTEGRAVLHVALRNRSNTPILVDGKDVMPE ----------------------1111----1111---3333--3333----iiii----- VNKVLDKMKSFCQRVRSGDWKGYTGKTITDVINIGIGGSDLGPLMVTEALKPYSSGGPRV ---------------------1111----------!!!!----------33332222--- WYVSNIDGTHIAKTLAQLNPESSLFIIASKTFTTQETITNAETAKEWFLQAAKDPSAVAK -------------3333-3333------3333-----------------------3333- HFVALSTNTTKVKEFGIDPQNMFEFWDWVGGRYSLWSAIGLSIALHVGFDNFEQLLSGAH ------------3333-1111----1111111111111111------------------- WMDQHFRTTPLEKNAPVLLALLGIWYINCFGCETHAMLPYDQYLHRFAAYFQQGDMESNG ---------3333---------------------------3333---------------- KYITKSGTRVDHQTGPIVWGEPGTNGQHAFYQLIHQGTKMIPCDFLIPVQTQHPIRKGLH ---1111----------------3333---3333--------------------%%%%-- HKILLANFLAQTEALMRGKSTEEARKELQAAGKSPEDLERLLPHKVFEGNRPTNSIVFTK ----------------------------1111--------3333---------------- LTPFMLGALVAMYEHKIFVQGIIWDINSFDQWGVELGKQLAKKIEPELDGSAQVTSHDAS ------------------------------1111-----------3333----------- TNGLINFIKQQREARV ----------3333-- >1-AMINOCYCLOPROPANE-1-CAR; SWP:P18485; PDB:1IAYA; ILSKLATNESPYFDGWKAYDSDPFHPLKNPNGVIQMGLAENQLCLDLIEDWIKRNPKGSI --3333----------------------1111---------------------------- CSSFKAIANFQDYHGLPEFRKAIAKFMEKTRGGRVRFDPERVVMAGGATGANETIIFCLA ---------------3333-----------%%%%---1111------------------- DPGDAFLVPSPYYPAFNRDLRWRTGVQLIPIHCESSNNFKITSKAVKEAYENAQKSNIKV 2222--------3333--------------------%%%%-------------1111--- KGLILTNPSNPLGTTLDKDTLKSVLSFTNQHNIHLVCDEIYAATVFDTPQFVSIAEILDE --------------------------------------1111----------33331111 QEMTYCNKDLVHIVYSLSKDMGLPGFRVGIIYSFNDDVVNCARKMSSFGLVSTQTQYFLA 1111--1111------------1111---------------------------------- AMLSDEKFVDNFLRESAMRLGKRHKHFTNGLEVVGIKCLKNNAGLFCWMDLRPLLRESTF 1111---------------------------1111---------------3333------ DSEMSLWRVIINDVKLNVSPGSSFECQEPGWFRVCFANMDDGTVDIALARIRRFVGVEK -------------------3333------------------------------------ >EQUINATOXIN II; SWP:P17723; PDB:1IAZA; AGAVIDGASLSFDILKTVLEALGNVKRKIAVGVDNESGKTWTALNTYFRSGTSDIVLPHK -----3333----------3333------------------------------------- VPHGKALLYNGQKDRGPVATGAVGVLAYLMSDGNTLAVLFSVPYDYNWYSNWWNVRIYKG -2222------------------------1111--------------------------- KRRADQRMYEELYYNLSPFRGDNGWHTRNLGYGLKSRGFMNSSGHAILEIHVSKA ------------------------------%%%%--------------------- >PUMILIO 1; SWP:Q14671; PDB:1IB2A; GRSRLLEDFRNNRYPNLQLREIAGHIMEFSQDQHGSRFIQLKLERATPAERQLVFNEILQ --------1111-----33332222----------------3333------------333 AAYQLMVDVFGNYVIQKFFEFGSLEQKLALAERIRGHVLSLALQMYGCRVIQKALEFIPS 3--------------------------------2222--------3333----------- DQQNEMVRELDGHVLKCVKDQNGNHVVQKCIECVQPQSLQFIIDAFKGQVFALSTHPYGC -------1111-----------------------3333-------2222----------- RVIQRILEHCLPDQTLPILEELHQHTEQLVQDQYGNYVIQHVLEHGRPEDKSKIVAEIRG ----------3333-------------3333--------------------------222 NVLVLSQHKFASNVVEKCVTHASRTERAVLIDEVCTMNDGPHSALYTMMKDQYANYVVQK 23333----3333------------------3333---!!!!-----1111--------- MIDVAEPGQRKIVMHKIRPHIA -1111-------3333------ >CONSERVED PROTEIN SP14.3; SWP:NA; PDB:1IB8A; GSGVDAIATIVELVREVVEPVIEAPFELVDIEYGKIGSDMILSIFVDKPEGITLNDTADL ------------------------------------------------------------ TEMISPVLDTIKPDPFPEQYFLEITSPGLERPLKTKDAVAGAVGKYIHVGLYQAIDKQKV ---3333-------------------------------3333------------------ FEGTLLAFEEDELTMEYMDKTRKKTVQIPYSLVSKARLAVKLLE --------%%%%------%%%%---------------------- >GLUCOSE PERMEASE; SWP:P69786; PDB:1IBA; MAPALVAAFGGKENITNLDACITRLRVSVADVSKVDQAGLKKLGAAGVVVAGSGVQAIFG 3333----1111-----------------------3333--------------------- TKSDNLKTEMDEYIRNFG -3333------------- >IGG2B-KAPPA 40-50 FAB (HE; SWP:NA; PDB:1IBGH; VHLVQSGPGLVAPSQSLSITCTVSGFSLTTYGVHWFRQPPGKGLEWLGLIWAGGNTDYNS --------------------------3333--------2222------------------ ALMSRLSINKDNSKSQVFLKMNSL --1111-----1111--------- >CYSTATHIONINE BETA-LYASE; SWP:P53780; PDB:1IBJA; ASVSTLLVNLDNKFDPFDAMSTPLYQTATFKQPSAIENGPYDYTRSGNPTRDALESLLAK -33331111--3333---------------------------3333-------------- LDKADRAFCFTSGMAALSAVTHLIKNGEEIVAGDDVYGGSDRLLSQVVPRSGVVVKRVNT ------------------------2222---------3333-----3333--------11 TKLDEVAAAIGPQTKLVWLESPTNPRQQISDIRKISEMAHAQGALVLVDNSIMSPVLSRP 11-------------------------------------1111----------1111-33 LELGADIVMHSATKFIAGHSDVMAGVLAVKGEKLAKEVYFLQNSEGSGLAPFDCWLCLRG 33--------3333----------------------------1111-------------- IKTMALRIEKQQENARKIAMYLSSHPRVKKVYYAGLPDHPGHHLHFSQAKGAGSVFSFIT ------------------------1111----1111-------1111------------- GSVALSKHLVETTKYFSIAVSFGSVKSLISMPCFMSHASIPAEVREARGLTEDLVRISAG -3333---------------------------------------------1111------ IEDVDDLISDLDIAFKTFPL -------------------- >ASPERGILLOPEPSIN; SWP:Q12567; PDB:1IBQA; SKGSAVTTPQNNDEEYLTPVTVGKSTLHLDFDTGSADLWVFSDELPSSEQTGHDLYTPSS ---------------------!!!!----------------111133332222-----11 SATKLSGYSWDISYGDGSSASGDVYRDTVTVGGVTTNKQAVEAASKISSEFVQDTANDGL 11--2222----------------------iiii-------------3333--------- LGLAFSSINTVQPKAQTTFFDTVKSQLDSPLFAVQLKHDAPGVYDFGYIDDSKYTGSITY ----3333--------------1111-----------------------1111------- TDADSSQGYWGFSTDGYSIGDGSSSSSGFSAIADTGTTLILLDDEIVSAYYEQVSGAQES ----1111----------!!!!-----------1111-----3333-------2222--- YEAGGYVFSCSTDLPDFTVVIGDYKAVVPGKYINYAPVSTGSSTCYGGIQSNSGLGLSIL --------3333--------!!!!----3333------2222------------------ GDVFLKSQYVVFNSEGPKLGFAAQA 3333--------------------- >DNA FRAGMENTATION FACTOR ; SWP:P19909; PDB:1IBXA; MLQKPKSVKLRALRSPRKFGVAGRSCQEVLRKGCLRFQLPERGSRLCLYEDGTELTEDYF -------------------------3333----3333---3333---------------- PSVPDNAELVLLTLGQAWQGH --------------------- >DNA FRAGMENTATION FACTOR ; SWP:NA; PDB:1IBXB; SGEIRTLKPCLLRRNYSREQHGVAASCLEDLRSKACDILAIDKSLTPVTLVLAEDGTIVD ------------------------------------3333--1111-------------- DDDYFLCLPSNTKFVALASNEKWAYNNSD --3333----------------------- >NITROSOCYANIN; SWP:Q820S6; PDB:1IBYA; EHNFNVVINAYDTTIPELNVEGVTVKNIRAFNVLNEPETLVVKKGDAVKVVVENKSPISE -------------------iiii-------------------2222-------------- GFSIDAFGVQEVIKAGETKTISFTADKAGAFTIWCQLHPKNIHLPGTLNVVE ----1111-----2222---------------------1111---------- ------------------------------------------------------------ ------------------- >PROTEINASE K; SWP:P06873; PDB:1IC6A; AAQTNAPWGLARISSTSPGTSTYYYDESAGQGSCVYVIDTGIEASHPEFEGRAQMVKTYY -----------1111----------33332222---------1111--iiii-------- YSSRDGNGHGTHCAGTVGSRTYGVAKKTQLFGVKVLDDNGSGQYSTIIAGMDFVASDKNN ------------------------------------1111-----------------111 RNCPKGVVASLSLGGGYSSSVNSAAARLQSSGVMVAVAAGNNNADARNYSPASEPSVCTV 1-1111--------------------------------------3333--1111------ GASDRYDRRSSFSNYGSVLDIFGPGTDILSTWIGGSTRSISGTSMATPHVAGLAAYLMTL ---1111--1111--3333-------------%%%%-----3333--------------- GKTTAASACRYIADTANKGDLSNIPFGTVNLLAYNNYQA ---3333-----------------2222----------- >Ig heavy chain V region 3; SWP:P01823; PDB:1IC7H; DVQLQESGPSLVKPSQTLSLTCSVTGDSITSAYWSWIRKFPGNRLEYMGYVSYSGSTYYN ------------2222-----------1111--------------------1111----- PSLKSRISITRDTSKNQYYLDLNSVTTEDTATYYCANWAGDYWGQGTLVTVSAA --%%%%------1111---------3333-------3333-------------- >HEPATOCYTE NUCLEAR FACTOR; SWP:P20823; PDB:1IC8A; KELENLSPEEAAHQKAVVETLLQEDPWRVAKMVKSYLQQHNIPQREVVDTTGLNQSHLSQ ------------------------3333---------1111-3333-------------- HLNKGTPMKTQKRAALYTWYVRKQREVAQQFTHARNRFKWGPASQQILFQAYERQKNPSK -------------------------3333------------------------------- EERETLVEECNRAECIQRGVSPSQAQGLGSNLVTEVRVYNWFANRRKEEA --3333---------------1111--!!!!--1111------------- ---------------------------------------- >HLA class II histocompati; SWP:P04233; PDB:1ICFI; LTKCQEEVSHIPAVHPGSFRPKCDENGNYLPLQCYGSIGYCWCVFPNGTEVPNTRSRGHH -------1111---2222-----1111--------1111-----1111------------ NCSES ----- >TUMOR NECROSIS FACTOR REC; SWP:P19438; PDB:1ICHA; PATLYAVVENVPPLRWKEFVKRLGLSDHEIDRLELQNGRCLREAQYSMLATWRRRTPRRE ------------1111---------3333----3333----------------------- ATLELLGRVLRDMDLLGCLEDIEEALC 3333------1111---------3333 >12-OXOPHYTODIENOATE REDUC; SWP:Q9XG54; PDB:1ICPA; QVDKIPLMSPCKMGKFELCHRVVLAPLTRQRSYGYIPQPHAILHYSQRSTNGGLLIGEAT ----3333----!!!!---------------2222--3333----11112222------- VISETGIGYKDVPGIWTKEQVEAWKPIVDAVHAKGGIFFCQIWHVGRVSNKDFQPNGEDP --1111------------------------3333----------!!!!-33332222--- ISCTDRGLTPQIMSNGIDIAHFTRPRRLTTDEIPQIVNEFRVAARNAIEAGFDGVEIHGA ------------1111------------1111---------------------------% HGYLIDQFMKDQVNDRSDKYGGSLENRCRFALEIVEAVANEIGSDRVGIRISPFAHYNEA %%%--------------1111-3333----------------1111-----1111-%%%% GDTNPTALGLYMVESLNKYDLAYCHVVEPRMKTCTESLVPMRKAYKGTFIVAGGYDREDG ---------------3333----------------------------------------- NRALIEDRADLVAYGRLFISNPDLPKRFELNAPLNKYNRDTFYTSDPIVGYTDYPFLE --------------3333-------------------3333------2222------- >PROTEIN LLR18A; SWP:P52778; PDB:1ICXA; GIFAFENEQSSTVAPAKLYKALTKDSDEIVPKVIEPIQSVEIVEGNGGPGTIKKIIAIHD ----------------------1111--3333-3333----------------------- GHTSFVLHKLDAIDEANLTYNYSIIGGEGLDESLEKISYESKILPGPDGGSIGKINVKFH --------------1111------------1111------------%%%%---------- TKGDVLSETVRDQAKFKGLGLFKAIEGYVLAHPDY ------3333----3333-------------1111 >PHOQ HISTIDINE KINASE; SWP:P23837; PDB:1ID0A; RELHPVAPLLDNLTSALNKVYQRKGVNISLDISPEISFVGEQNDFVEVMGNVLDNACKYC --------------------3333--------1111----3333---------------- LEFVEISARQTDEHLYIVVEDDGPGIPLSKREVIFDRGQRVDTLRPGQGVGLAVAREITE --------------------------1111--1111--------1111-----------1 QYEGKIVAGESMLGGARMEVIFGRQH 111-------3333------------ >PUTATIVE POTASSIUM CHANNE; SWP:P31069; PDB:1ID1A; HRKDHFIVCGHSILAINTILQLNQRGQNVTVISNLPEDDIKQLEQRLGDNADVIPGDSND ----------------------1111---------3333-----------------3333 SSVLKKAGIDRCRAILALSDNDADNAFVVLSAKDMSSDVKTVLAVSDSKNLNKIKMVHPD -------3333--------------------1111-----------3333-3333----- IILSPQLFGSEILARVLNGEEINNDMLVSMLLN ---3333---------------3333------- >AMICYANIN; SWP:P22365; PDB:1ID2A; QDKITVTSEKPVAAADVPADAVVVGIEKMKYLTPEVTIKAGETVYWVNGEVMPHNVAFKK ------------3333-1111-----%%%%--------2222----------------22 GIVGEDAFRGEMMTKDQAYAITFNEAGSYDYFCTPHPFMRGKVIVE 22------------------------------1111---------- >HISTONE H3; SWP:P02303; PDB:1ID3A; PHRYKPGTVALREIRRFQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSSAIGALQESVE ----2222----------------------------3333--------3333-------- AYLVSLFEDTNLAAIHAKRVTIQKKEIKLARRLRGER -------------3333-----3333----------- >Histone H2A.1; SWP:P04911; PDB:1ID3C; QSRSAKAGLTFPVGRVHRLLRRGNYAQRIGSGAPVYLTAVLEYLAAEILELAGNAARDNK -----------------------------3333-----------------------1111 KTRIIPRHLQLAIRNDDELNKLLGNVTIAQGGVLPNIHQNLLPKKSAKAT -----------------3333------2222------3333--------- >Histone H2B.2; SWP:P02294; PDB:1ID3D; RKETYSSYIYKVLKQTHPDTGISQKSMSILNSFVNDIFERIATEASKLAAYNKKSTISAR ----3333--------3333---------------------------------------- EIQTAVRLILPGELAKHAVSEGTRAVTKYSSST --------------------------------- >HIV-2 PROTEASE; SWP:P04584; PDB:1IDAA; PQFSLWKRPVVTAYIEGQPVEVLLDTGADDSIVAGIELGNNYSPKIVGGIGGFINTKEYK --------------iiii------1111--------------------1111-------- NVEIEVLNKKVRATIMTGDTPINIFGRNILTALGMSLNL -----%%%%------------------------------ >PECTIN LYASE A; SWP:Q01172; PDB:1IDK; VGVSGSAEGFAKGVTGGGSATPVYPDTIDELVSYLGDDEARVIVLTKTFDFTDSEGTTTG --------1111--!!!!--------------------------------1111------ TGCAPWGTASACQVAIDQDDWCENYEPDAPSVSVEYYNAGTLGITVTSNKSLIGEGSSGA --------1111-----%%%%---------------3333-------------------- IKGKGLRIVSGAENIIIQNIAVTDINPKYVWGGDAITLDDCDLVWIDHVTTARIGRQHYV --------2222-------------1111------------------------------- LGTSADNRVSLTNNYIDGVSDYSATCDGYHYWAIYLDGDADLVTMKGNYIYHTSGRSPKV ---2222----------------------------------------------------- QDNTLLHAVNNYWYDISGHAFEIGEGGYVLAEGNVFQNVDTVLETYEGEAFTVPSSTAGE -----------------------2222---------------------------3333-- VCSTYLGRDCVINGFGSSGTFSEDSTSFLSDFEGKNIASASAYTSVASRVVANAGQGNL -------------------------11111111--------3333---------2222- >SCYTALONE DEHYDRATASE; SWP:P56221; PDB:1IDPA; DEITFSDYLGLMTCVYEWADSYDSKDWDRLRKVIAPTLRIDYRSFLDKLWEAMPAEEFVG ------------------------------------------1111-------------- MVSSKQVLGDPTLRTQHFIGGTRWEKVSEDEVIGYHQLRVPHQRYKDTTMKEVTMKGHAH ---1111--1111----------------------------------------------- SANLHWYKKIDGVWKFAGLKPDIRWGE ---------iiii-------------- >IRON SUPEROXIDE DISMUTASE; SWP:P17670; PDB:1IDSA; AEYTLPDLDWDYGALEPHISGQINELHHSKHHATYVKGANDAVAKLEEARAKEDHSAILL ----------1111----------------------------------------1111-- NEKNLAFNLAGHVNHTIWWKNLSPNGGDKPTGELAAAIADAFGSFDKFRAQFHAAATTVQ ------------------11111111---------------------------------- GSGWAALGWDTLGNKLLIFQVYDHQTNFPLGIVPLLLLDMWEHAFYLQYKNVKVDFAKAF ----------------------------------------1111----!!!!-----333 WNVVNWADVQSRYAAATS 3----------------- >NEURAL CELL ADHESION MOLE; SWP:P13590; PDB:1IE5A; GKDIQVIVNVPPSVRARQSTMNATANLSQSVTLACDADGFPEPTMTWTKDGEPIEQEDNE ------------------------------------------------------1111-- EKYSFNYDGSELIIKKVDKSDEAEYICIAENKAGEQDATIHLKVFAK ------------------------------1111------------- >IMPERATOXIN A; SWP:P59868; PDB:1IE6A; GDCLPHLKRCKADNDCCGKKCKRRGTNAEKRCR ---------------1111-------------- >VITAMIN D3 RECEPTOR; SWP:P11473; PDB:1IE9A; DSLRPKLSEEQQRIIAILLDAHHKTYDPTYSDFCQFRPPVRVNDGGGSVTLELSQLSMLP -----------------------------1111----------1111--------1111- HLADLVSYSIQKVIGFAKMIPGFRDLTSEDQIVLLKSSAIEVIMLRSNESFTMDDMSWTC -------------------2222------------------------1111--------- GNQDYKYRVSDVTKAGHSLELIEPLIKFQVGLKKLNLHEEEHVLLMAICIVSPDRPGVQD -3333--3333-1111-3333-----------3333------------------2222-- AALIEAIQDRLSNTLQTYIRCRHPPPGSHLLYAKMIQKLADLRSLNEEHSKQYRCLSFQP -------------------------1111-----------------------------22 ECSMKLTPLVLEVFG 221111--------- >BRUC.D4.4; SWP:NA; PDB:1IEHA; DVQLQASGGGLVQPGGSLRVSCAASGFTFSSYHMAWVRQAPGKGLEWVSTINPGDGSTYY ------------------------------------------------------------ ADSVKGRFTISRDNAKNTLYLQMNSLKSEDTAVYYCAKYSGGALDAWGQGTQVTVSSQSE ---1111----------------------------------------------------- QKLISEEDLNHHHHH --------------- >OVOTRANSFERRIN; SWP:P02789; PDB:1IEJA; KSVIRWCTISSPEEKKCNNLRDLTQQERISLTCVQKATYLDCIKAIANNEADAITLDGGQ -----------------------1111------------------1111----------- VFEAGLAPYKLKPIAAEVYEHTEGSTTSYYAVAVVKKGTEFTVNDLQGKTSCHTGLGRSA -----------------------------------2222--11112222-----222211 GWNIPIGTLLHRGAIEWEGIESGSVEQAVAKFFSASCVPGATIEQKLCRQCKGDPKTKCA 11----------------3333---------------2222--33331111--3333--1 RNAPYSGYSGAFHCLKDGKGDVAFVKHTTVNENAPDQKDEYELLCLDGSRQPVDNYKTCN 111----------------------1111333333331111---1111---11111111- WARVAAHAVVARDDNKVEDIWSFLSKAQSDFGVDTKSDFHLFGPPGKKDPVLKDLLFKDS --------------3333-------------1111-------------1111-----111 AIMLKRVPSLMDSQLYLGFEYYSAIQSMR 1------1111-------------3333- >INTERFERON REGULATORY FAC; SWP:P15314; PDB:1IF1A; RMRPWLEMQINSNQIPGLIWINKEEMIFQIPWKHAAKHGWDINKDACLFRSWAIHTGRYK --------3333----------------------------1111---3333--------- AGEKEPDPKTWKANFRCAMNSLPDIEEVKDQSRNKGSSAVRVYRM --------------------------------------------- >INTESTINAL FATTY ACID BIN; SWP:P02693; PDB:1IFC; AFDGTWKVDRNENYEKFMEKMGINVVKRKLGAHDNLKLTITQEGNKFTVKESSNFRNIDV ------------------3333--------------------!!!!-------------- VFELGVDFAYSLADGTELTGTWTMEGNKLVGKFKRVDNGKELIAVREISGNELIQTYTYE --2222-----3333---------!!!!------------------------------ii GVEAKRIFKKE ii--------- >INOVIRUS; SWP:P03619; PDB:1IFK; ADDATSQAKAAFDSLTAQATEMSGYAWALVVLVVGATVGIKLFKKFVSRAS -----------------------------------------------3333 ----------------------------------------------------- >MAJOR COAT PROTEIN ASSEMB; SWP:P03623; PDB:1IFP; MQSVITDVTGQLTAVQADITTIGGAIIVLAAVVLGIRWIKAQFF 3333---------------------------1111-----3333 >VESICLE TRAFFICKING PROTE; SWP:O08547; PDB:1IFQA; SVLLTIARVADGLPLAASQEDEQSGRDLQQYQSQAKQLFRKLNEQSPTRCTLEAGATFHY -----------------------3333----------3333-1111-------------- IIEQGVCYLVLCEAAFPKKLAFAYLEDLHSEFDEQHGKKVPTVSRPYSFIEFDTFIQKTK --%%%%------33333333----------------1111--------3333-------- KLYI ---- >LAMIN A/C; SWP:P02545; PDB:1IFRA; GSHRTSGRVAVEEVDEEGKFVRLRNKSNEDQSMGNWQIKRQNGDDPLLTYRFPPKFTLKA --------------1111--------------2222-----!!!!-------------22 GQVVTIWAAGAGATHSPPTDLVWKAQNTWGCGNSLRTALINSTGEEVAMRKLV 22-----1111-----------------------------1111--------- >PROTEIN LLR18B; SWP:P52779; PDB:1IFVA; GVFAFEDEHPSAVAQAKLFKALTKDSDDIIPKVIEQIQSVEIVEGNGGPGTVKKITASHG ------------------------33333333---------------2222--------- GHTSYVLHKIDAIDEASFEYNYSIVGGTGLDESLEKITFESKLLSGPDGGSIGKIKVKFH --------------1111--------11111111------------%%%%---------- TKGDVLSDAVREEAKARGTGLFKAVEGYVLANPNY ----------------------------------- >POLYADENYLATE-BINDING PRO; SWP:P04147; PDB:1IFWA; GPLGSPRNANDNNQFYQQKQRQALGEQLYKKVSAKTSNEEAAGKITGMILDLPPQEVFPL ----------------3333------------1111-3333------------3333--- LESDELFEQHYKEASAAYESFKKEQEQQTEQA ------------------------1111---- >THIAMIN PYROPHOSPHOKINASE; SWP:P35202; PDB:1IG0A; EECIENPERIKIGTDLINIRNKMNLKELIHPNEDENSTLLILNQKIDIPRPLFYKIWKLH -------------2222------3333--------------------------------- DLKVCADGAANRLYDYLDDDETLRIKYLPNYIIGDLDSLSEKVYKYYRKNKVTIIKQTTQ -------------------33331111--------3333--------1111--------- YSTDFTKCVNLISLHFNSPEFRSLISNKDNLQSNHGIELEKGIHTLYNTMTESLVFSKVT --------------------------------%%%%----3333------11113333-- PISLLALGGIGGRFDQTVHSITQLYTLSENASYFKLCYMTPTDLIFLIKKNGTLIEYDPQ ------------3333--------------3333-------------------------- FRNTCIGNCGLLPIGEATLVKETRGLKWDVKNWPTSVVTGRVSSSNRFVGDNCCFIDTKD ------------2222--------------------1111-------------------- DIILNVEIFVDKLIDFL --------33333333- >THIAMIN PYROPHOSPHOKINASE; SWP:NA; PDB:1IG3A; HSSGLVPRGSHMEHAFTPLEPLLPTGNLKYCLVVLNQPLDARFRHLWKKALLRACADGGA ----------------1111-------------------3333---1111-----!!!!- NHLYDLTEGERESFLPEFVSGDFDSIRPEVKEYYTKKGCDLISTPDQDHTDFTKCLQVLQ ------22221111--------1111---------------------------------- RKIEEKELQVDVIVTLGGLGGRFDQIMASVNTLFQATHITPVPIIIIQKDSLIYLLQPGK -----------------------------------1111--------!!!!--------- HRLHVDTGMEGSWCGLIPVGQPCNQVTTTGLKWNLTNDVLGFGTLVSTSNTYDGSGLVTV ----------------------------------------2222---------------- ETDHPLLWTMAIKS -------------- >VITAMIN D-DEPENDENT CALCI; SWP:P02633; PDB:1IG5A; KSPEELKGIFEKYAAKEGDPNQLSKEELKLLLQTEFPSLLKGPSTLDELFEELDKNGDGE ------------3333--1111-------------3333--------------1111--- VSFEEFQVLVKKISQ ----------3333- >MODULATOR RECOGNITION FAC; SWP:Q14865; PDB:1IG6A; RADEQAFLVALYKYMKERKTPIERIPYLGFKQINLWTMFQAAQKLGGYETITARRQWKHI ---------------1111-3333---------3333------------------3333- YDELGGNPGSTSAATCTRRHYERLILPYERFIKGEEDKPLPPIKPRK ---------3333--3333-----3333------------1111--- >HOMEOTIC PROTEIN MSX-1; SWP:P13297; PDB:1IG7A; RKPRTPFTTAQLLALERKFRQKQYLSIAERAEFSSSLSLTETQVKIWFQNRRAKAKRL ----------------------------------1111-------------------- >HEXOKINASE PII; SWP:P04807; PDB:1IG8A; DVPKELMQQIENFEKIFTVPTETLQAVTKHFISELEKGLSKKGGNIPMIPGWVMDFPTGK -------------------3333----------------3333----------------- ESGDFLAIDLGGTNLRVVLVKLGGDRTFDTTQSKYRLPDAMRTTQNPDELWEFIADSLKA -------------------------------------3333----3333----------- FIDEQFPQGISEPIPLGFTFSFPASQNKINEGILQRWTKGFDIPNIENHDVVPMLQKQIT -----1111----------------------------iiii----2222----------1 KRNIPIEVVALINDTTGTLVASYYTDPETKMGVIFGTGVNGAYYDVCSDIEKLQGKLSDD 111------------------------------------------33331111----111 IPPSAPMAINCEYGSFDNEHVVLPRTKYDITIDEESPRPGQQTFEKMSSGYYLGEILRLA 11111-----------1111-----------------2222-------3333-------- LMDMYKQGFIFKNQDLSKFDKPFVMDTSYPARIEEDPFENLEDTDDLFQNEFGINTTVQE ----1111--2222-1111------3333--------1111------------------- RKLIRRLSELIGARAARLSVCGIAAICQKRGYKTGHIAADGSVYNRYPGFKEKAANALKD ----------------------------------------------2222---------- IYGWTQTSLDDYPIKIVPAEDGSGAGAAVIAALAQKRIAEGKSVGIIGA -------3333--------------------------1111-------- >Ig heavy chain V region M; SWP:P01783; PDB:1IGCH; DVQLVESGGGLVQPGGSRKLSCAASGFTFSSFGMHWVRQAPEKGLEWVAYISSGSSTLHY ------------2222-----------------------------------1111----- ADTVKGRFTISRDNPKNTLFLQMTSLRSEDTGMYYCARWGNYPYYAMDYWGQGTSVTVSS 3333----------------------1111------------------------------ AKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSD ----------------------------------------%%%%---------------- LYTLSSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVPRDC -------------------------3333------------- >IGG1-KAPPA B13I2 FAB (HEA; SWP:NA; PDB:1IGFH; EVQLVESGGDLVKPGGSLKLSCAASGFTFSRCAMSWVRQTPEKRLEWVAGISSGGSYTFY ---------------------------3333----------------------------- PDTVKGRFIISRNNARNTLSLQMSSL 3333---------------------- >IGG2A-KAPPA 26-10 FAB (HE; SWP:NA; PDB:1IGJB; VQLQQSGPELVKPGASVRMSCKSSGYIFTDFYMNWVRQSHGKSLDYIGYISPYSGVTGYN -----------2222--------------------------------------------- QKFKGKATLTVDKSSSTAYMELRSLTSEDSAVYYCAGSSGNKWAMDYWGHGASVTVSSAK ------------1111---------3333------------------------------- TTAPSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLY ---------------------------------------iiii----------------- TLSSSVTVTSSTWPSQSITCNVAHPASSTKVDKKIEP -----------------------3333---------- >INSULIN-LIKE GROWTH FACTO; SWP:P01344; PDB:1IGL; AYRPSETLCGGELVDTLQFVCGDRGFYFSRPASRVSRRSRGIVEECCFRSCDLALLETYC ----------3333------------------------------------------3333 ATPAKSE ------- >IGM-KAPPA POT FV (HEAVY C; SWP:NA; PDB:1IGMH; EVHLLESGGNLVQPGGSLRLSCAASGFTFNIFVMSWVRQAPGKGLEWVSGVFGSGGNTDY ------------2222-----------1111----------------------------- ADAVKGRFTITRDNSKNTLYLQMNSLRAEDTAIYYCAKHRVSYVLTGFDSWGQGTLVTVS -3333---------------------3333------------------------------ SGSASAPTL --------- >DNA rearranged by a t(2; SWP:Q6LBV5; PDB:1IGML; DIQMTQSPSSLSASVGDRVTITCQASQDISNYLAWYQQKPGKAPELRIYDASNLETGVPS ------------------------------------------------------2222-- RFSGSGSGTDFTFTISSLQPEDIATYYCQQYQNLPLTFGPGTKVDIKRTVAAPSV ------------------1111--------------------------------- >RAP1; SWP:P11938; PDB:1IGNA; KASFTDEEDEFILDVVRKNPTRRTTHTLYDEISHYVPNHTGNSIRHRFRVYLSKRLEYVY ------------------3333---------33333333------------3333----- EVDKFGKLVRDDDGNLIKTKVLPPSIKRKFSADEDYTLAIAVKKQFYRDLFQIDPDTGRS --1111----1111---------------------------------------------- LIRTQSRRGPIAREFFKHFAEEHAAHTENAWRDRFRKFLLAYGIDDYISYYEAEEPMKNL -----------2222--------------------------------------------- TPTPGNYNS --------- >FAMILY 11 XYLANASE; SWP:Q7SID8; PDB:1IGOA; ATTITSNQTGTHDGYDYELWKDSGNTSMTLNSGGAFSAQWSNIGNALFRKGKKFDSTKTH -----------iiii----------------!!!!-----------------------33 SQLGNISINYNATFNPGGNSYLCVYGWTKDPLTEYYIVDNWGTYRPTGTPKGTFTVDGGT 33-----------------------------------------------------iiii- YDIYETTRINQPSIIGIATFKQYWSVRQTKRTSGTVSVSEHFKKWESLGMPMGKMYETAL ------------1111--------------------3333-----1111----------- TVEGYQSNGSANVTANVLTIGGKPL -------------------iiii-- >TRANSCRIPTIONAL REPRESSOR; SWP:P07674; PDB:1IGQA; KKAIVQVEHDERPARLILNRRPPAEGYAWLKYEDDGQEFEANLADVKLVALIEG --------%%%%----1111---2222--------------3333--------- >INSULIN-LIKE GROWTH FACTO; SWP:P08069; PDB:1IGRA; EICGPGIDIRNDYQQLKRLENCTVIEGYLHILLISKAEDYRSYRFPKLTVITEYLLLFRV -----------33333333-------------------------3333------------ AGLESLGDLFPNLTVIRGWKLFYNYALVIFEMTNLKDIGLYNLRNITRGAIRIEKNADLC ----3333-----------------------2222--------------------1111- YLSTVDWSLILDAVSNNYIVGNKPPKECGDLCPGTMEEKPMCEKTTINNEYNYRCWTTNR -11113333---3333-------3333----2222-----------%%%%------1111 CQKMCPSTCGKRACTENNECCHPECLGSCSAPDNDTACVACRHYYYAGVCVPACPPNTYR -----3333------------1111--------------------iiii-----2222-- FEGWRCVDRDFCANILSAESSDSEGFVIHDGECMQECPSGFIRNGSQSMYCIPCEGPCPK %%%%-----------1111---------%%%%-----1111---2222------------ VCEEEKKTKTIDSVTSAQMLQGCTIFKGNLLINIRRGNNIASELENFMGLIEVVTGYVKI -------------33333333--------------------3333--1111--------- RHSHALVSLSFLKNLRLILGEEQLEGNYSFYVLDNQNLQQLWDWDHRNLTIKAGKMYFAF --3333--3333----------------------1111---------------------- NPKLCVSEIYRMEEVTGTKGRQSKGDINTRNNGERASCEKEQKLISEEDLN 1111-----------------------------------------3333-- >IGG2A INTACT ANTIBODY - M; SWP:A0A5D8; PDB:1IGTA; DIVLTQSPSSLSASLGDTITITCHASQNINVWLSWYQQKPGNIPKLLIYKASNLHTGVPS ----------------------------------------------------------33 RFSGSGSGTGFTLTISSLQPEDIATYYCQQGQSYPLTFGGGTKLEIKRADAAPTVSIFPP 33---------------------------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT 33333333---------------------------------------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC ----3333-------------------------- >Igh protein; SWP:Q6PIP8; PDB:1IGTB; EVKLQESGGGLVQPGGSLKLSCATSGFTFSDYYMYWVRQTPEKRLEWVAYISNGGGSTYY ---------------------------1111--------1111----------------- PDTVKGRFTISRDNAKNTLYLQMSRLSK ---------------------------- >ISOCITRATE LYASE; SWP:P05313; PDB:1IGWA; KTRTQQIEELQKEWTQPRWEGITRPYSAEDVVKLRGSVNPECTLAQLGAAKMWRLLHGES ---------------3333------------1111------------------------- KKGYINSLGALTGGQALQQAKAGIEAVYLSGWQVAADANLAASMYPDQSLYPANSVPAVV --------------------------------------1111---------1111----- ERINNTFRRADQIQWSAGIEPGDPRYVDYFLPIVADAEAGFGGVLNAFELMKAMIEAGAA --------------1111-2222-------------!!!!-------------------- AVHFEDQLASVKKCGKVLVPTQEAIQKLVAARLCADVTGVPTLLVARTDADAADLITSDC -------1111--------3333-------------------------3333-------- DPYDSEFITGERTSEGFFRTHAGIEQAISRGLAYAPYADLVWCETSTPDLELARRFAQAI 33331111----1111-----------------3333----------------------- HAKYPGKLLAYNCSFQQQLSDMGYKFQFITLAGIHSMWFNMFDLANAYAQGEGMKHYVEK ---2222----------------------------------------3333--------- VQQPEFAAAKDGYTFVSHQQEVGTGYFDKVTTIIQG -------3333--3333------------------- >DNA POLYMERASE; SWP:Q38087; PDB:1IH7A; MKEFYLTVEQIGDSIFERYIDSNGRERTREVEYKPSLFAHCPESQATKYFDIYGKPCTRK ----------!!!!------1111-----------------3333-----1111------ LFANMRDASQWIKRMEDIGLEALGMDDFKLAYLSDTYNYEIKYDHTKIRVANFDIEVTSP -------------------------------------------3333-----------11 DGFPEPSQAKHPIDAITHYDSIDDRFYVFDLLNSPYGNVEEWSIEIAAKLQEQGGDEVPS 11--3333------------1111---------1111-----3333---3333-----33 EIIDKIIYMPFDNEKELLMEYLNFWQQKTPVILTGWNVESFDIPYVYNRIKNIFGESTAK 331111------------------------------3333--------------3333-- RLSPHRKTRVKVIENMYGSREIITLFGISVLDYIDLYKKFSFTNQPSYSLDYISEFELNV --1111------------------2222-------------------------------- GKLKYDGPISKLRESNHQRYISYNIIDVYRVLQIDAKRQFINLSLDMGYYAKIQIQSVFS -------1111------------------------------------------3333--- PIKTWDAIIFNSLKEQNKVIPQGRSHPVQPYPGAFVKEPIPNRYKYVMSFDLTSLYPSII 3333-------------------------------------------------------- RQVNISPETIAGTFKVAPLHDYINAVAERPSDVYSCSPNGMMYYKDRDGVVPTEITKVFN -----1111--------3333---------------1111-------------------- QRKEHKGYMLAAQRNGEIIKEALHNPNLSVDEPLDVDYRFDFSDEIKEKIKKLSAKSLNE -------------------------------------------------1111------- MLFRAQRTEVAGMTAQINRKLLINSLYGALGNVWFRYYDLRNATAITTFGQMALQWIERK -------------------------------1111------------------------- VNEYLNEVCGTEGEAFVLYGDTDSIYVSADKIIDKVGESKFRDTNHWVDFLDKFARERME ------1111--------------------------3333--3333-------------- PAIDRGFREMCEYMNNKQHLMFMDREAIAGPPLGSKGIGGFWTGKKRYALNVWDMEGTRY -------------------------------2222--------2222-------iiii-- AEPKLKIMGLETQKSSTPKAVQKALKECIRRMLQEGEESLQEYFKEFEKEFRQLNYISIA -------------1111--------------------------------1111-3333-- SVSSANNIAKYDVGGFPGPKCPFHIRGILTYNRAIPQVVEGEKVYVLPLREGNPFGDKCI ------3333--iiii----------------------2222-------2222------- AWPSGTEITDLIKDDVLHWMDYTVLLEKTFIKPLEGFTSAAKLDYEKKASLFDMFDF --------3333----1111-----------------------------1111---- >CYCLIN-DEPENDENT KINASE 6; SWP:P42773; PDB:1IHBA; WGNELASAAARGDLEQLTSLLQNNVNVNAQNGFGRTALQVMKLGNPEIARRLLLRGANPD -------------------3333--1111-1111-3333-------------1111---- LKDRTGFAVIHDAARAGFLDTLQTLLEFQADVNIEDNEGNLPLHLAAKEGHLRVVEFLVK -------3333------------------------1111-3333---------------- HTASNVGHRNHKGDTACDLARLYGRNEVVSLMQANG ----1111-1111-------1111-------3333- >CYCLOPHILIN 40; SWP:P26882; PDB:1IHGA; SHPSPQAKPSNPSNPRVFFDVDIGGERVGRIVLELFADIVPKTAENFRALCTGEKGIGPT ----------1111--------iiii--------------------------1111---- TGKPLHFKGCPFHRIIKKFMIQGGDFSNQNGTGGESIYGEKFEDENFHYKHDKEGLLSMA ------2222-------------------------1111--------------------- NAGSNTNGSQFFITTVPTPHLDGKHVVFGQVIKGMGVAKILENVEVKGEKPAKLCVIAEC -----------------3333------------3333---1111---------------- GELKEGDDWGIFPKDGSGDSHPDFPEDADVDLKDVDKILLISEDLKNIGNTFFKSQNWEM ---2222----------------3333---1111------------------1111---- AIKKYTKVLRYVEGSRAAAEDADGAKLQPVALSCVLNIGACKLKMSDWQGAVDSCLEALE --------------------3333------------------1111-----------333 IDPSNTKALYRRAQGWQGLKEYDQALADLKKAQEIAPEDKAIQAELLKVKQKIKAQKDKE 31111-----------1111---------------1111--------------------- KAAY ---- >INAD; SWP:P13217; PDB:1IHJA; GELIHMVTLDKTGKKSFGICIVRGEVKDSPNTKTTGIFIKGIVPDSPAHLCGRLKVGDRI ----------2222----------------------------2222--------2222-- LSLNGKDVRNSTEQAVIDLIKEADFKIELEIQTF --iiii-1111--------1111----------- >GLIA-ACTIVATING FACTOR; SWP:P31371; PDB:1IHKA; TDLDHLKGILRRRQLYCRTGFHLEIFPNGTIQGTRKDHSRFGILEFISIAVGLVSIRGVD --------1111----1111-----1111---------1111-------2222------- SGLYLGMNEKGELYGSEKLTQECVFREQFEENWYNTYSSNLYKHVDTGRRYYVALNKDGT -------1111-------------------%%%%---------------------1111- PREGTRTKRHQKFTHFLPRPVDPDKVPELYKDILSQS --1111-1111----------111111111111---- >CAPSID PROTEIN; SWP:Q83884; PDB:1IHMA; DPLAMDPVAGSSTAVATAGQVNPIDPWIINNFVQAPQGEFTISPNNTPGDVLFDLSLGPH ---------3333-----------3333--------------3333-----------111 LNPFLLHLSQMYNGWVGNMRVRIMLAGNAFTAGKIIVSCIPPGFGSHNLTIAQATLFPHV 1------------------------------------------------33331111--- IADVRTLDPIEVPLEDVRNVLFHNNDRNQQTMRLVCMLYTPLRTGGGTGDSFVVAGRVMT --3333------------------------------------------1111-------- CPSPDFNFLFLVPPTVEQKTRPFTLPNLPLSSLSNSRAPLPISSMGISPDNVQSVQFQNG --3333------2222-1111-------3333----------------3333-------- RCTLDGRLVGTTPVSLSHVAKIRGTSNGTVINLTELDGTPFHPFEGPAPIGFPDLGGCDW --3333--------3333----------------3333---3333---2222-------- HINMTQFGHSSQTQYDVDTTPDTFVPHLGSIQANGIGSGNYVGVLSWISPPSHPSGSQVD --------------------11113333-----------------------3333----- LWKIPNYGSSITEATHLAPSVYPPGFGEVLVFFMSKMPGPGAYNLPCLLPQEYISHLASE ---------3333------------------------------------3333------- QAPTVGEAALLHYVDPDTGRNLGEFKAYPDGFLTCVPNGASSGPQQLPINGVFVFVSWVS ---------------------------3333-------11113333-------------- RFYQLKPVGTAS ------------ >HYPOTHETICAL PROTEIN MTH9; SWP:NA; PDB:1IHNA; SHFSDCRFGSVTYRGREYRSDIVVHVDGSVTPRRKEISRRKYGTSHVAEEELEELLEEKP ------2222--iiii--------1111-----------------------33331111- ESIIIGSGVHGALETGFRSDATVLPTCEAIKRYNEERSAGRRVAAIIHVTC -------2222-------------3333--------1111----------- >PANTOATE--BETA-ALANINE LI; SWP:P31663; PDB:1IHOA; MLIIETLPLLRQQIRRLRMEGKRVALVPTMGNLHDGHMKLVDEAKARADVVVVSIFVNPM -----------------1111------------------------------------333 QFDRPEDLARYPRTLQEDCEKLNKRKVDLVFAPSVKEIYPNGTETHTYVDVPGLSTMLEG 3------1111-----------1111-------3333-11111111------1111-333 ASRPGHFRGVSTIVSKLFNLVQPDIACFGEKDFQQLALIRKMVADMGFDIEIVGVPIMRA 3-2222----------------------3333---------------------------1 KDGLALSSRNGYLTAEQRKIAPGLYKVLSSIADKLQAGERDLDEIITIAGQELNEKGFRA 111---3333------------------------1111---------------3333--- DDIQIRDADTLLEVSETSKRAVILVAAWLGDARLIDNKMVEL --------------1111----------!!!!---------- >PHYTASE; SWP:P34752; PDB:1IHP; SCDTVDQGYQCFSETSHLWGQYAPFFSLANESVISPEVPAGCRVTFAQVLSRHGARYPTD -----------3333---!!!!-----1111-------2222----------------33 SKGKKYSALIEEIQQNATTFDGKYAFLKTYNYSLGADDLTPFGEQELVNSGIKFYQRYES 33------------------!!!!-3333----------------------------333 LTRNIVPFIRSSGSSRVIASGKKFIEGFQSTKLKDPRAQPGQSSPKIDVVISEASSSNNT 3------------3333-----------------11112222-----------1111-33 LDPGTCTVFEDSELADTVEANFTATFVPSIRQRLENDLSGVTLTDTEVTYLMDMCSFDTI 33------1111---------3333------------2222--3333--------3333- STTKLSPFCDLFTHDEWINYDYLQSLKKYYGHGAGNPLGPTQGVGYANELIARLTHSPVH -----3333-----------------------3333--3333------------------ DDTSSNHTLDSSPATFPLNSTLYADFSHDNGIISILFALGLYNGTKPLSTTTVENITQTD -----3333--3333---------------------1111-1111---------3333ii GFSSAWTVPFASRLYVEMMQCQAEQEPLVRVLVNDRVVPLHGCPVDALGRCTRDSFVRGL ii3333--2222--------3333--------iiii---------1111----------- SFARSGGDWAECFA -------3333--- -------------------------------------- >ARSENICAL PUMP-DRIVING AT; SWP:P08690; PDB:1IHUA; MQFLQNIPPYLFFTGKGGVGKTSISCATAIRLAEQGKRVLLVSTDPASNVGQVFSQTIGN 3333-----------2222-------------1111------------3333-------- TIQAIASVPGLSALEIDPQAAAQQYRARIVDPIKGVLPDDVVSSINEQLSGACTTEIAAF ----1111---------------------3333-------------1111---------- DEFTGLLTDASLLTRFDHIIFDTAPTGHTIRLLQLPGAWSSFIASCLGPMAGLEKQREQY --------3333----------------------------------3333-----3333- AYAVEALSDPKRTRLVLVARLQKSTLQEVARTHLELAAIGLKNQYLVINGVLPKTEAAND --------1111------------------------1111------------3333---- TLAAAIWEREQEALANLPADLAGLPTDTLFLQPVNMVGVSALSRLLSTQPQRPDIPSLSA 3333-------------3333---------------------3333----------3333 LVDDIARNEHGLIMLMGKGGVGKTTMAAAIAVRLADMGFDVHLTTSDPANNLQVSRIDPH ---3333----------2222-------------1111---------------------- EETERYRQHVLETKGKELDEAGKRLLEEDLRSPCTEEIAVFQAFSRVIREAGKRFVVMDT --------------2222------------------------------------------ APTGHTLLLLDATTPMMLLQDPERTKVLLVTLPETTPVLEAANLQADLERAGIHPWGWII ------------------------------------------------1111-------- NNSLSIADTRSPLLRMRAQQELPQIESVKRQHASRVALVPVLASEPTGIDKLKQLAGHHH ---1111--------------------------------------------3333----- >HIV-1 INTEGRASE; SWP:P04586; PDB:1IHVA; MIQNFRVYYRDSRDPVWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRD ----------1111----------------------------3333------ >STAPHYLOCOCCAL NUCLEASE; SWP:P00644; PDB:1IHZA; KLHKEPATLIKAIDGDTLKLMYKGQPMTFRLLLVDTPETKHPKKGVEKYGPEASAFTKKM ---------------------iiii---------------------2222---------- LENAKKLEVEFDKGQRTDKYGRGLAYLYADGKMLNEALVRQGLAKVAYVYKPNNTHEQHL 1111-------------1111-------iiii---------------------1111--- RKSEAQAKKEKLNIWS -------1111!!!!- >PHOSPHOENOLPYRUVATE CARBO; SWP:P51058; PDB:1II2A; PPTIHRNLLSPELVQWALKIEKDSRLTARGALAVMSYAKTGRSPLDKRIVDTDDVRENVD --------------------1111--1111------------3333-----33331111- WGKVNMKLSEESFARVRKIAKEFLDTREHLFVVDCFAGHDERYRLKVRVFTTRPYHALFM -----------------------1111------------1111---------3333---- RDMLIVPTPEELATFGEPDYVIYNAGECKADPSIPGLTSTTCVALNFKTREQVILGTEYA -----------1111---------1111--1111------------------------33 GEMKKGILTVMFELMPQMNHLCMHASANVGKQGDVTVFFGLSGTGKTTLSADPHRNLIGD 33-------------1111----------1111-------22223333---1111----- DEHVWTDRGVFNIEGGCYAKAIGLNPKTEKDIYDAVRFGAVAENCVLDKRTGEIDFYDES -----1111-----------2222--------11112222--------------111133 ICKNTRVAYPLSHIEGALSKAIAGHPKNVIFLTNDAFGVMPPVARLTSAQAMFWFVMGYT 33-------33332222-----------------1111---------------------- ANVPGVEAGGTRTARPIFSSCFGGPFLVRHATFYGEQLAEKMQKHNSRVWLLNTGYAGGR ----------------------3333---3333--------------------------1 ADRGAKRMPLRVTRAIIDAIHDGTLDRTEYEEYPGWGLHIPKYVAKVPEHLLNPRKAWKD 111--------------------1111----------------22223333-3333---- VRQFNETSKELVAMFQESFSARFAAKASQEMKSAVPRYVEFA -----------------------111133331111------- >MRE11 NUCLEASE; SWP:Q8U1N9; PDB:1II7A; MKFAHLADIHLGYEQFHKPQREEEFAEAFKNALEIAVQENVDFILIAGDLFHSSRPSPGT ---------22222222------------------------------------------- LKKAIALLQIPKEHSIPVFAIEGNHDRTQRGPSVLNLLEDFGLVYVIGMRKEKVENEYLT ----------------------3333---------------------------------- SERLGNGEYLVKGVYKDLEIHGMKYMSSAWFEANKEILKRLFRPTDNAILMLHQGVREVS ---1111-------!!!!--------3333---22223333--------------3333- EARGEDYFEIGLGDLPEGYLYYALGHIHKRYETSYSGSPVVYPGSLERWDFGDYEVRYEW ----------3333--------------------iiii-----------3333------- DGIKFKERYGVNKGFYIVEDFKPRFVEIKVRPFIDVKIKGSEEEIRKAIKRLIPLIPKNA ------------------%%%%-----------------------------3333-1111 YVRLNIGWRKPFDLTEIKELLNVEYLKIDTWRI --------------------------------- >ENZYME IIB OF THE CELLOBI; SWP:P17409; PDB:1IIBA; KKHIYLFSSAGMSTSLLVSKMRAQAEKYEVPVIIEAFPETLAGEKGQNADVVLLGPQIAY -------------------------------------3333---3333------333311 MLPEIQRLLPNKPVEVIDSLLYGKVDGLGVLKAAVAAIKKAAA 11------1111--------------------------3333- >PEPTIDE N-MYRISTOYLTRANSF; SWP:P14743; PDB:1IICA; AMKDHKFWRTQPVKDFDEKVVEEGPIDKPKTPEDISDKPLPLLSSFEWCSIDVDNKKQLE -------1111---2222------------3333--------1111-----1111----- DVFVLLNENYVEDRDAGFRFNYTKEFFNWALKSPGWKKDWHIGVRVKETQKLVAFISAIP ------------1111----------------22223333------1111---------- VTLGVRGKQVPSVEINFLCVHKQLRSKRLTPVLIKEITRRVNKCDIWHALYTAGIVLPAP ----iiii------------1111-----------------1111--------------- VSTCRYTHRPLNWKKLYEVDFTGLPDGHTEEDMIAENALPAKTKTAGLRKLKKEDIDQVF ----------------1111----22223333------------2222---3333----- ELFKRYQSRFELIQIFTKEEFEHNFIGEESLPLDKQVIFSYVVEQPDGKITDFFSFYSLP ------3333---------------------1111---------1111------------ FTILNNTKYKDLGIGYLYYYATDADFQFKDRFDPKATKALKTRLCELIYDACILAKNANM ----------------------1111---1111----------------------1111- DVFNALTSQDNTLFLDDLKFGPGDGFLNFYLFNYRAKPITGGLNPDNSNDIKRRSNVGVV -------!!!!----1111------------------------1111------------- ML -- >HLA-DR ANTIGENS ASSOCIATE; SWP:P04233; PDB:1IIEA; YGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKVFESWMHHWLLFEM ----3333--------1111-------3333----------------------------- SRHSLEQKPTDAPPK --------------- >GLUCOSE-1-PHOSPHATE THYMI; SWP:P26393; PDB:1IINA; MKTRKGIILAGGSGTRLYPVTMAVSQQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQD -------------3333-1111--1111------3333------1111--------3333 TPRFQQLLGDGSQWGLNLQYKVQPSPDGLAQAFIIGEEFIGHDDCALVLGDNIFYGHDLP -------!!!!1111----------------------3333-------1111---1111- KLMEAAVNKESGATVFAYHVNDPERYGVVEFDQKGTAVSLEEKPLQPKSNYAVTGLYFYD ---------------------3333------3333------------------------1 NSVVEMAKNLKPSARGELEITDINRIYMEQGRLSVAMMGRGYAWLDTGTHQSLIEASNFI 111---1111--1111--3333-----1111-------1111------------------ ATIEERQGLKVSCPEEIAFRKNFINAQQVIELAGPLSKNDYGKYLLKMV ----1111----------1111----------3333------------- - >GLYCOSYLTRANSFERASE GTFB; SWP:P96559; PDB:1IIRA; MRVLLATCGSRGDTEPLVALAVRVRDLGADVRMCAPPDCAERLAEVGVPHVPVGPRAKPL ---------3333-----------1111-------1111----1111------------- TAEDVRRFTTEAIATQFDEIPAAAEGCAAVVTTGLLAAAIGVRSVAEKLGIPYFYAFHCP --------------------3333----------3333--------------------33 SYVPSPYYPPPPIDIPAQWERNNQSAYQRYGGLLNSHRDAIGLPPVEDIFTFGYTDHPWV 33------------------------------------1111------------------ AADPVLAPLQPTDLDAVQTGAWILPDERPLSPELAAFLDAGPPPVYLGFGAPADAVRVAI -------------------------------------1111----------3333----- DAIRAHGRRVILSRGWADLVLPDDGADCFAIGEVNHQVLFGRVAAVIHHGGAGTTHVAAR ---1111-----2222--------1111------33331111------------------ AGAPQILLPQMADQPYYAGRVAELGVGVAHDGPIPTFDSLSAALATALTPETHARATAVA ----------!!!!-------------------------------1111----------1 GTIRTDGAAVAARLLLDAVSRE 111---------------1111 >PLASMA RETINOL-BINDING PR; SWP:P41263; PDB:1IIUA; MDCRVSSFKVKENFDKNRYSGTWYAMAKKDPEGLFLQDNVVAQFTVDENGQMSATAKGRV ---1111-------3333----------------------------1111---------- RLFNNWDVCADMIGSFTDTEDPAKFKMKYWGVASFLQKGNDDHWVVDTDYDTYALHYSCR --------------------1111--------3333------------------------ ELNEDGTCADSYSFVFSRDPKGLPPEAQKIVRQRQIDLCLDRKYRVIVHNGFCS --1111------------1111-3333--------11112222-------1111 >LYSOZYME; SWP:Q7SID7; PDB:1IIZA; KRFTRCGLVNELRKQGFDENLMRDWVCLVENESARYTDKIANVNKNGSRDYGLFQINDKY -----------------1111-----------%%%%---------------1111----- WCSKGSTPGKDCNVTCSQLLTDDITVASTCAKKIYKRTKFDAWSGWDNHCNHSNPDISSC ---------1111-3333-------------------!!!!------------------- >PLASMODIAL SPECIFIC LAV1-; SWP:P14725; PDB:1IJ5A; EIFSQELTQREANVKKVHENLEELQKKLDHTSFAHDRLEAQIAQKEQEQKAKLAEYDQKV ---3333-------------------1111------------------------------ QNEFDARERAEREREAARGDAAAEKQRLASLLKDLEKPMLSEEDTNILRQLFLSSAVSGS ----------------------------------------3333------------2222 GKFSFQDLKQVLAKYADTIPEGPLKKLFVMVENDTKGRMSYITLVAVANDLAALVADFRK --------------3333----3333------------------------1111--3333 IDTNSNGTLSRKEFREHFVRLGFDKKSVQDALFRYADEDESDDVGFSEYVHLGLCLLVLR ------------------1111--3333--------1111-------------------- ILYAFADFDKSGQLSKEEVQKVLEDAHIPESARKKFEHQFSVVDVDDSKSLSYQEFVMLV ---1111---------------------11111111------------------------ LLMFH ----- >VON WILLEBRAND FACTOR; SWP:P04275; PDB:1IJBA; SEPPLHDFYCSRLLDLVFLLDGSSRLSEAEFEVLKAFVVDMMERLRVSQKWVRVAVVEYH -----------------------------------------1111--1111--------- DGSHAYIGLKDRKRPSELRRIASQVKYAGSQVASTSEVLKYTLFQIFSKIDRPEASRIAL -------1111----------1111--------------------------1111----- LLMASQEPQRMSRNFVRYVQGLKKKKVIVIPVGIGPHANLKQIRLIEKQAPENKAFVLSS -------33331111-------1111--------1111-----------3333------3 VDELEQQRDEIVSYLCDLAPEA 333-1111-------3333--- >PHOSPHOLIPASE A2; SWP:Q7SID6; PDB:1IJLA; SLIQFETLIMKVVKKSGMFWYSAYGCYCGWGGHGRPQDATDRCCFVHDCCYGKVTGCDPK ---------------3333------------------------------1111------- MDSYTYSEENGDIVCGGDDPCKREICECDRVAADCFRDNLDTYNSDTYWRYPRQDCEESP --------%%%%------3333----------------3333-3333----3333----- EPC --- >LOW-DENSITY LIPOPROTEIN R; SWP:P01130; PDB:1IJQA; IAYLFFTNRHEVRKMTLDRSEYTSLIPNLRNVVALDTEVASNRIYWSDLSQRMICSTQLY --------------------------------------1111------1111-------- DTVISRDIQAPDGLAVDWIHSNIYWTDSVLGTVSVADTKGVKRKTLFRENGSKPRAIVVD ------------------------------------1111--------2222-------- PVHGFMYWTDWGTPAKIKKGGLNGVDIYSLVTENIQWPNGITLDLLSGRLYWVDSKLHSI --------------------1111--------------------1111------------ SSIDVNGGNRKTILEDEKRLAHPFSLAVFEDKVFWTDIINEAIFSANRLTGSDVNLLAEN ---1111---------------------!!!!---------------------------- LLSPEDMVLFHNLTQPRGVNWCERTTLSNGGCQYLCLPAPQINPHSPKFTCACPDGMLLA ----------3333-----3333----%%%%-----------1111-------2222--1 RDMRSCLT 111----- >FIBROBLAST GROWTH FACTOR ; SWP:P08620; PDB:1IJTA; GIKRLRRLYCNVGIGFHLQALPDGRIGGAHADTRDSLLELSPVERGVVSIFGVASRFFVA -----------!!!!-----1111--------1111-------2222------------- MSSKGKLYGSPFFTDECTFKEILLPNNYNAYESYKYPGMFIALGKNGKTKKGNRVSPTMK -1111--------1111-------%%%%----3333-------1111---1111-11111 VTHFLPRL 111----- >SECRETED FRIZZLED-RELATED; SWP:P97401; PDB:1IJXA; AACEPVRIPLCKSLPWEMTKMPNHLHHSTQANAILAMEQFEGLLGTHCSPDLLFFLCAMY -------3333------------------------3333--3333---1111-------- APICTIDFQHEPIKPCKSVCERARQGCEPILIKYRHSWPESLACDELPVYDRGVCISPEA -------3333---------------------------33333333--1111-------- IVTAD ----- >FRIZZLED HOMOLOG 8; SWP:Q61091; PDB:1IJYA; ELACQEITVPLCKGIGYEYTYMPNQFNHDTQDEAGLEVHQFWPLVEIQCSPDLKFFLCSM --------3333-----------1111----------------------1111------- YTPICLEDYKKPLPPCRSVCERAKAGCAPLMRQYGFAWPDRMRCDRLPEQGNPDTLCMDY -------------------------------1111---33333333-----1111----- ER -- >PYRUVATE DEHYDROGENASE; SWP:NA; PDB:1IK6A; VAGVVMMANMAKAINMALHEEMERDERVVVLGELVTEGLYERFGPERVIDTPLNEGGILG ------------------------3333-----11113333------------------- FAMGMAMAGLKPVAEIQFVLGADELLNHIAKLRYKAPLVVRTPVGSPEAIFVHTPGLVVV -----1111-----------------------------------------1111------ MPSTPYNAKGLLKAAIRGDDPVVFLEPKILYRAPREEVPEGDYVVEIGKARVAREGDDVT --------------------------3333---------------2222----------- LVTYGAVVHKALEAAERVKASVEVVDLQTLNPLDFDTVLKSVSKTGRLIIAHDSPKTGGL ----3333-----1111--------------------------------------2222- GAEVRALVAEKALDRLTAPVIRLAGPDVPTVERIIKAIEYVMRY -----------3333----------------------------- >n/a; SWP:NA; PDB:1IKFH; EVKLVESGGGLVQPGGSLKLSCATSGFTFSDYYMYWVRQNSEKRLEWVAFISNGGGSAFY ------------2222-----------3333--------1111--------3333----- ADIVKGRFTISRDNAKNTLYLQMSRLKSEDTAMYYCTRHTLYDTLYGNYPVWFADWGQGT 3333--------3333----------3333-------------1111------------- LVTVSAAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFP ----------------------------------------------%%%%---------- AVLQSDLYTLSSSVTVPSSSRPSETVTCNVAHPASSTKVDKKIVPRDC ----------------1111------------1111------------ >NF-kappa-B inhibitor alph; SWP:P25963; PDB:1IKND; DGDSFLHLAIIHEEKALTMEVIRLAFLNFQNNLQQTPLHLAVITNQPEIAEALLGAGCDP --------------3333------------1111-3333--3333-3333---------- ELRDFRGNTPLHLACEQGCLASVGVLTQSCTTPHLHSILKATNYNGHTCLHLASIHGYLG ---1111-3333------------------1111--3333-------------1111333 IVELLVSLGADVNAQEPCNGRTALHLAVDLQNPDLVSLLLKCGADVNRVTYQGYSPYQLT 3----1111-1111-------------------------1111------1111-3333-2 WGRPSTRIQQQLGQLTLENLQMLPESEDEESYDTES 222--------3333-1111---------------- >Ephrin-B2 [Precursor]; SWP:P52800; PDB:1IKOP; SIVLEPIYWNSSNSKFLPGQGLVLYPQIGDKLDIICPKVDSKTVGQYEYYKVYMVDKDQA ---------1111-------------2222------------------------------ DRCTIKKENTPLLNCARPDQDVKFTIKFQEFSPNLWGLEFQKNKDYYIISTSNGSLEGLD -----1111-------1111-------------1111-----------------3333-- NQEGGVCQTRAMKILMKVGQD --------------------- >EXOTOXIN A; SWP:P11439; PDB:1IKPA; EEAFDLWNECAKACVLDLKDGVRSSRMSVDPAIADTNGQGVLHYSMVLEGGNDALKLAID ----3333---------1111---------3333--------------2222------!! NALSITSDGLTIRLEGGVEPNKPVRYSYTRQARGSWSLNWLVPIGHEKPSNIKVFIHELN !!---------------------------------------------------------1 AGNQLSHMSPIYTIEMGDELLAKLARDATFFVRAHESNEMQPTLAISHAGVSVVMAQKRW 111--------------------------------------------------------3 SEWASGKVLCLLDQLDGVYNYLAQQRCNLDDTWEGKIYRVLAGNPAKHDLDIKPTVISHR 333--------3333--------------3333--------------------------- LHFPEGGSLAALTAHQACHLPLETFTRHRQPRGAEQLEQCGYPVQRLVALYLAARLSWNQ --1111--------------3333--------------------------------1111 VDQVIRNALASPGSGGDLGEAIREQPEQARLALTLAAAESERFVRQGTGNDEAGAANADV ------------------------------------------111111113333------ VSLTCPVAAGECAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGTQNWTVERLLQAHR -----1111-----1111----------3333----------1111-------------- QLEERGYVFVGYHGTFLEAAQSIVFGGVRARSQDLDAIWRGFYIAGDPALAYGYAQDQEP -----------------------------------3333-------33333333------ DARGRIRNGALLRVYVPRSSLPGFYRTSLTLAAPEAAGEVERLIGHPLPLRLDAITGPEE 1111------------33331111-----3333--------------------------- EGGRLETILGWPLAERTVVIPSAIPTDPRNVGGDLDPSSIPDKEQAISALPDYASQPGK ----------3333------------1111-----3333-33331111----------- >ESTRADIOL 17 BETA-DEHYDRO; SWP:P51659; PDB:1IKTA; LQSTFVFEEIGRRLKDIGPEVVKKVNAVFEWHITKGGNIGAKWTIDLKSGSGKVYQGPAK 3333--------------------------------------------!!!!-------- GAADTTIILSDEDFMEVVLGKLDPQKAFFSGRLKARGNIMLSQKLQMILKDYAKL ------------------------------------------------------- >MONOCLONAL ANTIBODY G3-51; SWP:Q52L64; PDB:1IL1A; QLQQSGAELVRSGASVKLSCATSDFNIKDYYIHWVRQRPEQGLEWIGWLDPENGDTESAP ----------2222-----------3333-----------------------------33 KFQGKATMTADTSSNTAYLQLSSLTSEASAVYYCNAISTTRDYYALDYWGQGTSVTVSSA 33---------1111---------3333-------------------------------- KTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDL ---------------------------------------%%%%--2222-------%%%% YTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPR ---------1111-----------3333----------- >CONSERVED HYPOTHETICAL PR; SWP:O26981; PDB:1ILOA; MMKIQIYGTGCANCQMLEKNAREAVKELGIDAEFEKIKEMDQILEAGLTALPGLAVDGEL --------------------------------------3333-3333--------iiii- KIMGRVASKEEIKKILS ----------------- >Interleukin-1 receptor an; SWP:P18510; PDB:1ILR1; SKMQAFRIWDVNQKTFYLRNNQLVAGYLQGPNVNLEEKIDVVPIEPHALFLGIHGGKMCL ---------1111-----%%%%-------1111--------------------%%%%--- SCVKSGDETRLQLEAVNITDLSENRKQDKRFAFIRSDSGPTTSFESAACPGWFLCTAMEA ----!!!!--------3333-11111111--------!!!!-------2222-------- DQPVSLTNMPDEGVMVTKFYFQEDE ------------------------- >Unique short US2 glycopro; SWP:P09713; PDB:1IM3D; PWFQIEDNRCYIDNGKLFARGSIVGNMSRFVFDPKADYGGVGENLYVHADDVEFVPGESL -----------------------------------------------3333---2222-- KWNVRNLDVMPIFETLALRLVLQGDVIWLRCVPEL ---------3333--------iiii---------- >DBH; SWP:NA; PDB:1IM4A; HIVIFVDFDYFFAQVEEVLNPQYKGKPLVVCVYSTSGAVATANYEARKLGVKAGMPIIKA -------------------1111--------------------3333----2222----- MQIAPSAIYVPMRKPIYEAFSNRIMNLLNKHADKIEVASIDEAYLDVTNKVEGNFENGIE ---1111-----3333------------1111----------------1111--3333-- LARKIKQEILEKEKITVTVGVAPNKILAKIIADKSKPNGLGVIRPTEVQDFLNELDIDEI -------------------------------------------3333----111133332 PGIGSVLARRLNELGIQKLRD 222--------1111--1111 >180aa long hypothetical P; SWP:O58727; PDB:1IM5A; PEEALIVVDMQRDFMPGGALPVPEGDKIIPKVNEYIRKFKEKGALIVATRDWHPENHISF ----------33332222---2222--------------1111----------------3 RERGGPWPRHCVQNTPGAEFVVDLPEDAVIISKATEPDKEAYSGFEGTDLAKILRGNGVK 333--------22221111-----1111-------1111---1111--------1111-- RVYICGVATEYCVRATALDALKHGFEVYLLRDAVKGIKPEDEERALEEMKSRGIKIVQF -------1111------------------1111----------------1111------ >YECO; SWP:P43985; PDB:1IM8A; FIFDENVAEVFPDIQRSVPGYSNIITAIGLAERFVTADSNVYDLGCSRGAATLSARRNIN ---------3333----2222------------------------!!!!------1111- QPNVKIIGIDNSQPVERCRQHIAAYHSEIPVEILCNDIRHVEIKNASVILNFTLQFLPPE ---------------------1111-----------3333------------3333-111 DRIALLTKIYEGLNPNGVLVLSEKFRFEDTKINHLLIDLHHQFKRANGYSELEVSQKRTA 1------------2222------------------------------33333333----- LENVRTDSIETHKVRLKNVGFSQVELWFQCFNFGSIAVK 3333------------3333---------!!!!------ >Nuclear factor of activat; SWP:O94916; PDB:1IMHC; KKSPMLCGQYPVKSEGKELKIVVQPETQHRARYLTEGSRGSVKDRTQQGFPTVKLEGHNE ----1111-----%%%%---------------1111-------1111------------- PVVLQVFVGNDSGRVKPHGFYQACRVTGRNTTPCKEVDIEGTTVIEVGLDPSNNMTLAVD --------------------------------------iiii-------3333------- CVGILKLRNADVEARIGIAGSKKKSTRARLVFRVNIMRKDGSTLTLQTPSSPILCTQPAG ---------------33333333------------------------------------- VPEILKKSLHSCSVKGEEEVFLIGKNFLKGTKVIFQENVSDENSWKSEAEIDMELFHQNH ------------3333-----------------------3333----------------- LIVKVPPYHDQHITLPVSVGIYVVTNAGRSHDVQPFTYTPD ------------------------1111------------- >CCG1-INTERACTING FACTOR B; SWP:Q96IU4; PDB:1IMJA; AASVEQREGTIQVQGQALFFREALPGSGQARFSVLLLHGIRFSSETWQNLGTLHRLAQAG ------------iiii----------------------11113333-------------- YRAVAIDLPGLGHSKEAAAPAPIGELAPGSFLAAVVDALELGPPVVISPSLSGMYSLPFL -------2222--1111----------1111------------------------3333- TAPGSQLPGFVPVAPICTDKINAANYASVKTPALIVYGDQDPMGQTSFEHLKQLPNHRVL -2222-----------1111-3333------------1111------------------- IMKGAGHPCYLDKPEEWHTGLLDFLQGL -------3333----------------- >DNA LIGASE III; SWP:P49916; PDB:1IMOA; GSADETLCQTKVLLDIFTGVRLYLPPSTPDFSRLRRYFVAFDGDLVQEFDMTSATHVLGS --1111----------2222----1111---3333--3333------------------- RDKNPAAQQVSPEWIWACIRKRRLVAPC ---1111---3333-------------- >HYPOTHETICAL PROTEIN HI02; SWP:P71346; PDB:1IMUA; MTLNITSKQMDITPAIREHLEERLAKLGKWQTQLISPHFVLNKVPNGFSVEASIGTPLGN -------------------------------------------1111--------1111- LLASATSDDMYKAINEVEEKLERQLNKLQHKSESRRADERLKDSFEN ----------------------------------------------- >PIGMENT EPITHELIUM-DERIVE; SWP:P36955; PDB:1IMVA; TGALVEEEDPFFKVPVNKLAAAVSNFGYDLYRVRSSMSPTTNVLLSPLSVATALSALSLG --------3333--------------------------------------------3333 ADERTESIIHRALYYDLISSPDIHGTYKELLDTVTAPQKNLKSASRIVFEKKLRIKSSFV -------------1111------------------3333----------------3333- APLEKSYGTRPRVLTGNPRLDLQEINNWVQAQMKGKLARSTKEIPDEISILLLGVAHFKG ----------------3333------------%%%%------------------------ QWVTKFDSRKTSLEDFYLDEERTVRVPMMSDPKAVLRYGLDSDLSCKIAQLPLTGSMSII ------1111--------1111-------------------3333--------------- FFLPLKVTQNLTLIEESLTSEFIHDIDRELKTVQAVLTVPKLKLSYEGEVTKSLQEMKLQ --------------11113333------------------------------------33 SLFDSPDFSKITGKPIKLTQVEHRAGFEWNEDGAGTTHLTFPLDYHLNQPFIFVLRDTDT 33-----1111------------------------------------------------- GALLFIGKILDPRGP ----------3333- >YAJQ PROTEIN; SWP:P44096; PDB:1IN0A; PSFDIVSEITLHEVRNAVENANRVLSTRYDFRGVEAVIELNEKNETIKITTESDFQLEQL ---------------------------1111----------1111--------------- IEILIGSCIKRGIEHSSLDIPAESEHHGKLYSKEIKLKQGIETEMAKKITKLVKDSKIKV -------------3333---------!!!!-----------------------1111--- QTQIQGEQVRVTGKSRDDLQAVIQLVKSAELGQPFQFNNFRD ------------------------------------------ >HOLLIDAY JUNCTION DNA HEL; SWP:Q56313; PDB:1IN4A; QFLRPKSLDEFIGQENVKKKLSLALEAAKMRGEVLDHVLLAGPPGLGKTTLAHIIASELQ 1111--3333-------------------------------------------------- TNIHVTSGPVLVKQGDMAAILTSLERGDVLFIDEIHRLNKAVEELLYSAIEDFQIDIQPF ------3333----------11112222-----3333----------------------- TLVGATTRSGLLSSPLRSRFGIILELDFYTVKELKEIIKRAASLMDVEIEDAAAEMIAKR -------1111-33331111----------------------1111-------------- SRGTPRIAIRLTKRVRDMLTVVKADRINTDIVLKTMEVLNIDDEGLDEFDRKILKTIIEI %%%%-------------------------------------1111--------------- YRGGPVGLNALAASLGVEADTLSEVYEPYLLQAGFLARTPRGRIVTEKAYKHLKYEVP iiii----------------------------------1111---------------- >IGG1-LAMBDA CHA255 FAB (H; SWP:NA; PDB:1INDH; EVTLVESGGDSVKPGGSLKLSCAASGFTLSGETMSWVRQTPEKRLEWVATTLSGGGFTFY ------------2222-----------3333--------1111----------------- SASVKGRFTISRDNAQNNLYLQLNSLRSEDTALYFCASHRFVHWGHGTLVTVSAKTTPPS 1111---------1111---------3333--------%%%%------------------ VYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLESDLYTLSSS ---------------------------------------2222----------------- VTVPSSPRPSETVTCNVAHPASSTKVDKKIVPR -------------------1111---------- >SPERMIDINE SYNTHASE; SWP:Q9WZC2; PDB:1INLA; RTLKELERELQPRQHLWYFEYYTGNNVGLFMKMNRVIYSGQSDIQRIDIFENPDLGVVFA --3333------------------------------------------------------ LDGITMTTEKDEFMYHEMLAHVPMFLHPNPKKVLIIGGGDGGTLREVLKHDSVEKAILCE %%%%---3333--------------------------3333--------3333------- VDGLVIEAARKYLKQTSCGFDDPRAEIVIANGAEYVRKFKNEFDVIIIDSLFTEEFYQAC -3333--------111133331111-----3333-1111--------------------- YDALKEDGVFSAETEDPFYDIGWFKLAYRRISKVFPITRVYLGFMTTYPSGMWSYTFASK ----1111---------------------------------------2222--------- GIDPIKDFDPEKVRKFNKELKYYNEEVHVASFALPNFVKKELGLM ----1111---------------------1111------------ >INOSITOL POLYPHOSPHATE 1-; SWP:P21327; PDB:1INP; MSDILQELLRVSEKAANIARACRQQETLFQLLIEEKKEGEKNKKFAVDFKTLADVLVQEV -------------------3333--3333------------------------------- IKENMENKFPGLGKKIFGEESNELTNDLGEKIIMRLGPTEEETVALLSKVLNGNKLASEA ----------3333-----------1111------------3333-----iiii------ LAKVVHQDVFFSDPALDSVEINIPQDILGIWVDPIDSTYQYIKGSADITPNQGIFPSGLQ ---1111----------------1111-----------------------iiii---333 CVTVLIGVYDIQTGVPLMGVINQPFVSQDLHTRRWKGQCYWGLSYLGTNIHSLLPPVSTR 3----------------------------------------------------------- SNSEAQSQGTQNPSSEGSCRFSVVISTSEKETIKGALSHVCGERIFRAAGAGYKSLCVIL ---------------------------1111----------------------------- GLADIYIFSEDTTFKWDSCAAHAILRAMGGGMVDLKECLERNPDTGLDLPQLVYHVGNEG -------------3333--------1111------------------1111--------- AAGVDQWANKGGLIAYRSEKQLETFLSRLLQHLAPVATHT -!!!!------------------3333------------- >TROPOMODULIN; SWP:Q9DEA6; PDB:1IO0A; NSTDVEETLKRIQNNDPDLEEVNLNNIMNIPVPTLKACAEALKTNTYVKKFSIVGTRSND ---------------1111----2222---3333------3333---------------- PVAFALAEMLKVNNTLKSLNVESNFISGSGILALVEALQSNTSLIELRIDNQSQPLGNNV --------1111------------------------------------------------ EMEIANMLEKNTTLLKFGYHFTQQGPRLRASNAMMNNNDLVRKRRL ------3333------------------------------------ >PHASE 1 FLAGELLIN; SWP:P06179; PDB:1IO1A; NIKGLTQASRNANDGISIAQTTEGALNEINNNLQRVRELAVQSANSTNSQSDLDSIQAEI --------------------------------------------1111------------ TQRLNEIDRVSGQTQFNGVKVLAQDNTLTIQVGANDGETIDIDLKQINSQTLGLDTLNVQ ---------------iiii---------------2222---------3333--------- QKYKVSDTAATVTGYADTTIALDNSTFKASATGLGGTDQKIDGDLKFDDTTGKYYAKVTV -----------------------1111---1111-------------------------2 TGGTGKDGYYEVSVDKTNGEVTLAGGATSPLTGGLPATATEDVKNVQVANADLTEAKAAL 222--------------------%%%%---2222-1111---------1111-------- TAAGVTGTASVVKMSYTDNNGKTIDGGLAVKVGDDYYSATQNKDGSISINTTKYTADDGT -----------------1111----------!!!!------1111--------------- SKTALNKLGGADGKTEVVSIGGKTYAASKAEGHNFKAQPDLAEAAATTTENPLQKIDAAL ---------1111------!!!!--33332222--------------------------- AQVDTLRSDLAAVQNRFNSAITNLGNTVNNLTSAR ----------------------------------- >RIBONUCLEASE HII; SWP:O74035; PDB:1IO2A; MKIAGIDEAGRGPVIGPMVIAAVVVDENSLPKLEELKVRDSKKLTPKRREKLFNEILGVL -------------------------3333-------33331111-----------3333- DDYVILELPPDVIGSREGTLNEFEVENFAKALNSLKVKPDVIYADAADVDEERFARELGE --------33331111---------------1111-----------------------11 RLNFEAEVVAKHKADDIFPVVSAASILAKVTRDRAVEKLKEEYGEIGSGYPSDPRTRAFL 11----------3333---------------------------------3333------- ENYYREHGEFPPIVRKGWKTLKKIAEKVESEKK --------------1111---------3333-- >CYTOCHROME P450 CYP119; SWP:Q55080; PDB:1IO7A; MYDWFSEMRKKDPVYYDGNIWQVFSYRYTKEVLNNFSKFSSDLTGYHERLEDLRNGKIRF --------------------------------------------3333----1111---- DIPTRYTMLTSDPPLHDELRSMSADIFSPQKLQTLETFIRETTRSLLDSIDPREDDIVKK -3333-3333------------1111--------------------11113333-3333- LAVPLPIIVISKILGLPIEDKEKFKEWSDLVAFRLGKPGEIFELGKKYLELIGYVKDHLN ----------------1111----------3333-------------------------- SGTEVVSRVVNSNLSDIEKLGYIILLLIAGNETTTNLISNSVIDFTRFNLWQRIREENLY --------1111---------------1111--------------1111-----1111-- LKAIEEALRYSPPVMRTVRKTKERVKLGDQTIEEGEYVRVWIASANRDEEVFHDGEKFIP --------------------------!!!!--2222-----------------1111-11 DRNPNPHLSFGSGIHLCLGAPLARLEARIAIEEFSKRFRHIEILDTEKVPNEVLNGYKRL 11-----1111-11111111-------------1111----------------------- VVRLKS ------ >ARCELIN-5A; SWP:Q42460; PDB:1IOAA; ATETSFNFPNFHTDDKLILQGNATISSKGQLQLTGVGSNELPRVDSLGRAFYSDPIQIKD -----------1111----------1111-------1111-------------------- SNNVASFNTNFTFIIRAKNQSISAYGLAFALVPVNSPPQKKQEFLGIFNTNNPEPNARTV ------------------3333----------1111----!!!!---------1111--- AVVFNTFKNRIDFDKNFIKPYVNENCDFHKYNGEKTDVQITYDSSNNDLRVFLHFTVSQV -----1111-----------------3333-------------1111------------- KCSVSATVHLEKEVDEWVSVGFSPTSGLTEDTTETHDVLSWSFSSKFR --------3333----------------3333---------------- >APOC-I; SWP:P02654; PDB:1IOJ; TPDVSSALDKLKEFGNTLEDKARELISRIKQSELSAKMREWFSETFQKVKEKLKIDS ------3333--------------------------------------3333----- >CHAPERONIN 60; SWP:Q9Z462; PDB:1IOKA; AAKEVKFNSDARDRMLKGVNILADAVKVTLGPKGRNVVIDKSFGAPRITKDGVSVAKEIE --------------------------------------------------33333333-- LSDKFENMGAQMVREVASRTNDEAGDGTTTATVLAQAIVREGLKAVAAGMNPMDLKRGID --3333----------33331111------------------------------------ VATAKVVEAIKSAARPVNDSSEVAQVGTISANGESFIGQQIAEAMQRVGNEGVITVEENK ------------------3333-----1111----------------------------- GMETEVEVVEGMQFDRGYLSPYFVTNADKMIAELEDAYILLHEKKLSSLQPQKPLLIVAE -------------------3333------------------------------------- DVEIAAVKAPGFGDRRKAMLQDIAILTGGIDMLGRAKKVSINKDNTTIVDGAGEKAEIEA -----------!!!!--------------------------------------------- RVSQIRQQIEETTSDYDREKLQERVAKLAGGVAVIRVGGMTEIEVKERKDRVDDALNATR -----------------------3333-------------11113333----------11 AAVQEGIVVGGGVALVQGAKVLEGLSGANSDQDAGIAIIRRALEAPMRQIAENAGVDGAV 11-------%%%%--------1111---------------3333-------1111-3333 VAGKVRESSDKAFGFNAQTEEYGDMFKFGVIDPAKVVRTALEDAASVAGLLITTEAMIAE ---------1111----------3333--------------------------------- KP -- >CITRATE SYNTHASE; SWP:Q9LCX9; PDB:1IOMA; VARGLEGVLFTESRMCYIDGQQGKLYYYGIPIQELAEKSSFEETTFLLLHGRLPRRQELE -2222--------------1111---iiii3333-------------------------- EFSAALARRRALPAHLLESFKRYPVSAHPMSFLRTAVSEFGMLDPTEGDISREALYEKGL ------1111--------3333-11113333-------3333---1111----------- DLIAKFATIVAANKRLKEGKEPIPPREDLSHAANFLYMANGVEPSPEQARLMDAALILHA ---------------1111------3333------------------------------- EHGFNASTFTAIAAFSTETDLYSAITAAVASLKGPRHGGANEAVMRMIQEIGTPERAREW --------------1111---------------1111-3333----------3333---- VREKLAKKERIMGMGHRVYKAFDPRAGVLEKLARLVAEKHGHSKEYQILKIVEEEAGKVL ----1111--2222--------1111---------------------------------- NPRGIYPNVDFYSGVVYSDLGFSLEFFTPIFAVARISGWVGHILEYQELDNRLLRPGAKY 1111---3333------1111-3333---------------------------------- VGELDVPYVPLEAR ---------3333- >PROBABLE CELL DIVISION IN; SWP:O58346; PDB:1IONA; TRIISIVSGKGGTGKTTVTANLSVALGEGRKVLAVDGDLTANLSLVLGVDDVNITLHDVL ----------------------------------------------------------11 AGDAKLEDAIYTQFENVYILPGAVDWEHVIKADPRKLPEVIKSLKGKYDFILIDCPAGLQ 11--3333-----2222-----------11113333-----1111--------------3 LRASALSGEEAILVTNPEISCLTDTKVGVLKKAGLAILGFILNRYGRSERDIPPEAAQDV 333---------------------------1111-------------1111--------- DVPLLAVIPEDPVIREGTLEGIPAVKYKPESKGAQAFIKLAEEVDKLAGIKAKI ----------------------3333-1111----------------------- >SF11-RNASE; SWP:Q7SID5; PDB:1IOOA; DFEYLQLVLTWPASFCYANHCERIAPNNFTIHGLWPDNVKTRLHNCKPKPTYSYFTGKML ------------------------------------------------------------ NDLDKHWMQLKFEQDYGRTEQPSWKYQYIKHGSCCQKRYNQNTYFGLALRLKDKFDLLRT --------1111------------------3333-------------------------- LQTHRIIPGSSYTFQDIFDAIKTVSQENPDIKCAEVTKGTPELYEIGICFTPNADSMFRC -1111-2222--------------------------2222----------1111------ PQSDTCDKTAKVLFRR ------1111------ >D-ALA\:D-ALA LIGASE; SWP:P07862; PDB:1IOW; MTDKIAVLLGGTSAEREVSLNSGAAVLAGLREGGIDAYPVDPKEVDVTQLKSMGFQKVFI ------------1111--------------1111------3333-11113333------- ALHGRGGEDGTLQGMLELMGLPYTGSGVMASALSMDKLRSKLLWQGAGLPVAPWVALTRA ---2222-------------------------------------1111------------ EFEKGLSDKQLAEISALGLPVIVKPSREGSSVGMSKVVAENALQDALRLAFQHDEEVLIE --------------3333----------%%%%------3333------1111-------- KWLSGPEFTVAILGEEILPSIRIQPSGTFYDYEAKFLSDETQYFCPAGLEASQEANLQAL ------------!!!!--------------3333---------------3333------- VLKAWTTLGCKGWGRIDVMLDSDGQFYLLEANTSPGMTSHSLVPMAARQAGMSFSQLVVR --------------------1111-------------1111------------------- ILELAD -1111- >BETACELLULIN; SWP:P35070; PDB:1IOXA; RKGHFSRCPKQYKHYCIKGRCRFVVAEQTPSCVCDEGYIGARCERVDLFY --------33331111-----------------------1111------- >BEM1 PROTEIN; SWP:P29366; PDB:1IP9A; GAMGSSTSGLKTTKIKFYYKDDIFALMLKGDTTYKELRSKIAPRIDTDNFKLQTKLFDGS ------------------!!!!------------------3333---------------- GEEIKTDSQVSNIIQAKLKISVHDI ------------------------- >RNA 2'-O-RIBOSE METHYLTRA; SWP:Q7SID4; PDB:1IPAA; MRITSTANPRIKELARLLERKHRDSQRRFLIEGAREIERALQAGIELEQALVWEGGLNPE ----1111---------------1111-------------1111--------1111---- EQQVYAALLALLEVSEAVLKKLSVRDNPAGLIALARMPERTLEEYRPSPDALILVAVGLE -------------------1111------------------------1111--------- KPGNLGAVLRSADAAGAEAVLVAGGVDLYSPQVIRNSTGVVFSLRTLAASESEVLDWIKQ --------------------------1111----1111--------------------11 HNLPLVATTPHAEALYWEANLRPPVAIAVGPEHEGLRAAWLEAAQTQVRIPMQGQADSLN 11------1111--3333------------1111--3333-------------------- VSVSAALLLYEALRQRLL ------------------ >BETA-CONGLYCININ, BETA CH; SWP:P25974; PDB:1IPJA; NNPFYLRSSNSFQTLFENQNGRIRLLQRFNKRSPQLENLRDYRIVQFQSKPNTILLPHHA -1111-3333-------1111------1111-33331111-------------------- DADFLLFVLSGRAILTLVNNDDRDSYNLHPGDAQRIPAGTTYYLVNPHDHQNLKIIKLAI ----------------------------2222----2222-------------------- PVNKPGRYDDFFLSSTQAQQSYLQGFSHNILETSFHSEFEEINRVLLGQQEGVIVELSAK ---2222--------1111-3333--3333-------3333------------------- SSSRKTISSEDEPFNLRSRNPIYSNNFGKFFEITPEKNPQLRDLDIFLSSVDINEGALLL -----1111-----1111------1111-----333333331111--------------- PHFNSKAIVILVINEGDANIELVGIKLEVQRYRAELSEDDVFVIPAAYPFVVNATSNLNF ------------------------------------2222----2222------------ LAFGINAENNQRNFLAGEKDNVVRQIERQVQELAFPGSAQDVERLLKKQRESYFVDA ------2222----------3333--3333-------1111---1111--------- >ARGINYL-TRNA SYNTHETASE; SWP:Q93RP5; PDB:1IQ0A; MLRRALEEAIAQALKEMGVPVRLKVARAPKDKPGDYGVPLFALAKELRKPPQAIAQELKD ----------------------------------------1111-----3333----333 RLPLPEFVEEAVPVGGYLNFRLRTEALLREALRPKAPFPRRPGVVLVEHTSVNPNKELHV 3---1111--------------------------------------------------33 GHLRNIALGDAIARILAYAGREVLVLNYIDDTGRQAAETLFALRHYGLTWDGKEKYDHFA 33---------------------------1111----------1111-------3333-- GRAYVRLHQDPEYERLQPAIEEVLHALERGELREEVNRILLAQMATMHALNARYDLLVWE -----------3333--------------------------------1111--------- SDIVRAGLLQKALALLEQSPHVFRPREGKYAGALVMDASPVIPGLEDPFFVLLRSNGTAT ------------------1111-------2222--------------------1111--- YYAKDIAFQFWKMGILEGLRFRPYENPYYPGLRTSAPEGEAYTPKAEETINVVDVRQSHP ----------1111--------------1111---------------------------- QALVRAALALAGYPALAEKAHHLAYETVLLEGRQMSGAVSVDEVLEEATRRARAIVEEKN -------------3333------------%%%%--------------------------1 PDHPDKEEAARMVALGAIRFSMVKTEPKKQIDFRYQEALSFEGDTGPYVQYAHARAHSIL 111----------------------3333----------1111----------------- RKAGEWGAPDLSQATPYERALALDLLDFEEAVLEAAEERTPHVLAQYLLDLAASWNAYYN ---------3333---------3333---------------------------------- ARENGQPATPVLTAPEGLRELRLSLVQSLQRTLATGLDLLGIPAPEVM --%%%%----1111-----------------------1111------- >RALBP1-INTERACTING PROTEI; SWP:Q8NFH8; PDB:1IQ3A; GSLQDNSSYPDEPWRITEEQREYYVNQFRSLQPDPSSFISGSVAKNFFTKSKLSIPELSY ------------------------------------------------------------ IWELSDADCDGALTLPEFCAAFHLIVARKNGYPLPEGLPPTLQPEFIVTD -------------3333--------------------------------- >50S RIBOSOMAL PROTEIN L5; SWP:P08895; PDB:1IQ4A; MNRLKEKYLNEVVPALMSKFNYKSIMQVPKIEKIVINMGVGDAVQNPKALDSAVEELTLI -----------------------3333---------------1111-------------- AGQRPVVTRAKKSIAGFRLRQGMPIGAKVTLRGERMYEFLDKLISVSLPRARDFRGVSKK -------------------2222-----------------------3333---------- SFDGRGNYTLGIKEQLIFPEIDYDKVNKVRGMDIVIVTTANTDEEARELLALLGMPFQK -----------------11111111-------------------------1111----- >(R)-SPECIFIC ENOYL-COA HY; SWP:O32472; PDB:1IQ6A; AQSLEVGQKARLSKRFGAAEVAAFAALSEDFNPLHLDPAFAATTAFERPIVHGMLLASLF ----2222------------------------1111-3333--3333----3333----- SGLLGQQLPGKGSIYLGQSLSFKLPVFVGDEVTAEVEVTALREDKPIATLTTRIFTQGGA ---------2222-------------2222-------------------------1111- LAVTGEAVVKLP ------------ >OVOTRANSFERRIN; SWP:P02789; PDB:1IQ7A; RIQWCAVGKDEKSKCDRWSVVSNGDVECTVVDETKDCIIKIMKGEADAVALDGGLVYTAG ---------------------iiii-------3333------------------------ VCGLVPVMAERYDDESCSKDERPASYFAVAVARKDSNVNWNNLKGKKSCHTAVGRTAGWV --------------------------------1111--33332222-----22221111- IPMGLIHNRTGTCNFDEYFSEGCAPGSPPNSRLCQLCQGSGGIPPEKCVASSHEKYFGYT -------------3333------2222---1111-----------2222-3333------ GALRCLVEKGDVAFIQHSTVEENTGGKNKADWAKNLQMDDFELLCTDGRRANVMDYRECN ---------------11113333------3333---3333----3333---11111111- LAEVPTHAVVVRPEKANKIRDLLERQEKRFGVNGSEKSKFMMFESQNKDLLFKDLTKCLF -----------3333--------------------3333-1111%%%%----1111---- KVREGTTYKEFLGDKFYTVISSLKTCNPSDILQMCSFLEGK ----------------------------------------- >ARCHAEOSINE TRNA-GUANINE ; SWP:O58843; PDB:1IQ8A; KMLKFEIKARDGAGRIGKLEVNGKKIETPAIMPVVNPKQMVVEPKELEKMGFEIIITNSY ----------!!!!------iiii-----------3333--------------------- IIYKDEELRRKALELGIHRMLDYNGIIEVDSGSFQLMKYGSIEVSNREIIEFQHRIGVDI ---------------------------------1111----------------1111--- GTFLDIPTPPDAPREQAVKELEITLSRAREAEEIKEIPMNATIQGSTYTDLRRYAARRLS ------------3333----------------------------!!!!----------11 SMNFEIHPIGGVVPLLESYRFRDVVDIVISSKMALRPDRPVHLFGAGHPIVFALAVAMGV 11-------------1111-----------3333-1111---2222-3333----1111- DLFDSASYALYAKDDRYMTPEGTKRLDELDYFPCSCPVCSKYTPQELREMPKEERTRLLA -----3333---------1111--1111-------3333---333311113333------ LHNLWVIKEEIKRVKQAIKEGELWRLVDERARSHPKLYSAYKRLLEHYTFLEEFEPITKK --------------------------------------------------3333------ SALFKISNESLRWPVVRRAKERAKSINERFGELVEHPIFGRVSRYLSLTYPFAQSEAEDD ------3333--------------------------------3333-------------- FKIEKPTKEDAIKYVMAIAEYQFGEGASRAFDDAKVELSKTGMPRQVKVNGKRLATVRAD ------1111-------------2222---1111-------------------------- DGLLTLGIEGAKRLHRVLPYPRMRVVVNKEAEPFARKGKDVFAKFVIFADPGIRPYDEVL ----------------------------1111--1111---3333----11112222--- VVNENDELLATGQALLSGREMIVFQYGRAVKVRKGVE --1111------------3333--------------- >ALPHA-NEUROTOXIN; SWP:P01426; PDB:1IQ9A; LECHNQQSSQPPTTKTCPGETNCYKKVWRDHRGTIIERGCGCPTVKPGIKLNCCTTDKCN ------!!!!------2222---------3333------------2222----------- N - >DI-HEME PEROXIDASE; SWP:P55929; PDB:1IQCA; ANEPIQPIKAVTPENADMAELGKMLFFDPRLSKSGFISCNSCHNLSMGGTDNITTSIGHK -------------------------------3333--3333--1111----------222 WQQGPINAPTVLNSSMNLAQFWDGRAKDLKEQAAGPIANPKEMASTHEIAEKVVASMPQY 2---------2222------1111---3333---33331111------------------ RERFKKVFGSDEVTIDRITTAIAQFEETLVTPGSKFDKWLEGDKNALNQDELEGYNLFKG --------------------------------------11111111-------------- SGCVQCHNGPAVGGSSYQKMGVFKPYETKNPAAGRMDVTGNEADRNVFKVPTLRNIELTY -1111---1111---------------------3333---3333--------2222---- PYFHDGGAATLEQAVETMGRIQLNREFNKDEVSKIVAFLKTLTGDQPDFKLPILPPSNND --1111---------------------------------------------------111 TPRSQPYE 1------- >IgG VH protein [Precursor; SWP:Q9Y298; PDB:1IQDB; QVQLVQSGAEVKKPGASVKVSCKVSGYTLTELPVHWVRQAPGKGLEWVGSFDPESGESIY ------------2222-----------3333----------------------------- AREFQGSVTMTADTSTNIAYMELSSL 1111---------1111--------- >HYPOTHETICAL PROTEIN MTH1; SWP:O27908; PDB:1IQOA; LFIATLKGIFTLKDLPEEFRPFVDYKAGLEKKKLSDDDEIAIISIKGTQSNHVLFLSSYN -----------1111--------------------------------------------- SVDEIRKELEEAGAKINHTTLKILEGHL -3333----------------------- >RFCS; SWP:Q8U4J3; PDB:1IQPA; SEEIREVKVLEKPWVEKYRPQRLDDIVGQEHIVKRLKHYVKTGSMPHLLFAGPPGVGKTT ------------3333-----1111----------------------------------- AALALARELFGENWRHNFLELNASDERGINVIREKVKEFARTKPIGGASFKIIFLDEADA ---------!!!!--------3333-------------------iiii--------1111 LTQDAQQALRRTMEMFSSNVRFILSCNYSSKIIEPIQSRCAIFRFRPLRDEDIAKRLRYI -------------1111----------3333-33331111-------------------- AENEGLELTEEGLQAILYIAEGDMRRAINILQAAAALDKKITDENVFMVASRARPEDIRE --------------------------------3333-----3333--------3333--- MMLLALKGNFLKAREKLREILLKQGLSGEDVLVQMHKEVFNLPIEEPKKVLLADKIGEYN ----3333-----------------------------1111---3333------------ FRLVEGANEIIQLEALLAQFTLIGKK --1111-------------------- >S3-RNASE; SWP:O80323; PDB:1IQQA; YDYFQFTQQYQLAVCNSNRTLCKDPPDKLFTVHGLWPSNMVGPDPSKCPIKNIRKREKLL -------------1111------------------------------------------- EHQLEIIWPNVFDRTKNNLFWDKEWMKHGSCGYPTIDNENHYFETVIKMYISKKQNVSRI --3333---33331111----------3333-------------------1111------ LSKAKIEPDGKKRALLDIENAIRNGADNKKPKLKCQKKGTTTELVEITLCSDKSGEHFID -1111--------3333--------iiii--------!!!!----------1111----- CPHPFEPISPHYCPTNNIKY -----3333----------- >PHOTOLYASE; SWP:P61497; PDB:1IQRA; GPLLVWHRGDLRLHDHPALLEALARGPVVGLVVLDPNNLKTTPRRRAWFLENVRALREAY ----------------------1111--------3333---------------------- RARGGALWVLEGLPWEKVPEAARRLKAKAVYALTSHTPYGRYRDGRVREALPVPLHLLPA 1111--------1111-------------------------------------------- PHLLPPDLPRAYRVYTPFSRLYRGAAPPLPPPEALPKGPEEGEIPREDPGLPLPEPGEEA --------------33333333-------------------------------------- ALAGLRAFLEAKLPRYAEERDRLDGEGGSRLSPYFALGVLSPRLAAWEAERRGGEGARKW -----------3333---1111--1111--33331111-----------3333------- VAELLWRDFSYHLLYHFPWMAERPLDPRFQAFPWQEDEALFQAWYEGKTGVPLVDAAMRE ------------------3333-------------------------------------- LHATGFLSNRARMNAAQFAVKHLLLPWKRCEEAFRHLLLDGDRAVNLQGWQWAGGLGVDA -------------------------------------1111----------1111-2222 APYFRVFNPVLQGERHDPEGRWLKRWAPEYPSYAPKDPVVDLEEARRRYLRLARD -3333---------------3333--3333------------------------- >RIBOSOMAL PROTEIN S7; SWP:O59230; PDB:1IQVA; IKVMGRWSTEDVEVKDPSLKPYINLEPRVHIVERLINKVMRSGGSSKKVRAYEVVKEAFK --%%%%--1111---33331111---------------1111------------------ IIEKRTGKNPIQVLVWAIENAAPREDTTSVMFGGIRYHVAVDISPLRRLDVALRNIALGA ------------------------------------------------------------ SAKCYRTKMSFAEALAEEIILAANKDPKSYAYSKKLEIERIAESSR ---------------------11113333----------------- >ANTIBODY M-HFE7A, LIGHT C; SWP:NA; PDB:1IQWH; QVQLQQPGAELVKPGASVKLSCKASGYTFTSYWMQWVKQRPGQGLEWIGEIDPSDSYTNY ------------2222-----------3333----------------------------- NQKFKGKATLTVDTSSSTAYMQLSSLTSEDSAVYYCARNRDYSNNWYFDVWGTGTTVTVS 3333----------------------3333-----------iiii--------------- SAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQS -----------------------------------------%%%%-------------%% DLYTLSSSVTVPSSTWPSQTVTCNVAHPASSTKVDKKIVPR %%---------3333-----------3333----------- >Igk-V21-4 protein; SWP:A0A5E6; PDB:1IQWL; DIVLTQSPASLAVSLGQRATISCKASQSVDYDGDSYMNWYQQKPGQPPKLLIYAASNLES -------------2222-------------%%%%--------2222------------22 GIPARFSGSGSGTDFTLNIHPVEEEDAATYYCQQSNEDPRTFGGGTKLEIKRADAAPTVS 221111----------------3333---------------------------------- IFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMS ----33331111---------------------iiii----------------------- STLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNR ---------1111--------1111---------- >FERREDOXIN; SWP:P10245; PDB:1IQZA; PKYTIVDKETCIACGACGAAAPDIYDYDEDGIAYVTLDDNQGIVEVPDILIDDMMDAFEG ------3333-----3333-1111---1111---3333--------3333---------- CPTDSIKVADEPFDGDPNKFE 1111--------iiii-1111 >Ribulose bisphosphate car; SWP:Q43832; PDB:1IR1S; KVWPTQNMKRYETLSYLPPLTTDQLARQVDYLLNNKWVPCLEFETDHGFVYREHHNSPGY -----------2222-----------------1111--------------------2222 YDGRYWTMWKLPMFGCTDPAQVLNELEECKKEYPNAFIRIIGFDSNRQVQCVSFIAYKPA ------------2222-3333-----------1111----------------------22 GY 22 >INSULIN RECEPTOR; SWP:P06213; PDB:1IR3A; SSVFVPDEWEVSREKITLLRELGQGSFGMVYEGNARDIIKGEAETRVAVKTVNESASLRE ------1111-3333-----------------------2222----------1111---- RIEFLNEASVMKGFTCHHVVRLLGVVSKGQPTLVVMELMAHGDLKSYLRSLRPEAENNPG ---------3333--1111-------------------1111---------1111----- RPPPTLQEMIQMAAEIADGMAYLNAKKFVHRDLAARNCMVAHDFTVKIGDFGMTRDIETD ---------------------------------3333---1111------1111------ RKGGKGLLPVRWMAPESLKDGVFTTSSDMWSFGVVLWEITSLAEQPYQGLSNEQVLKFVM --------3333-----------3333-----------1111------------------ DGGYLDQPDNCPERVTDLMRMCWQFNPKMRPTFLEIVNLLKDDLHPSFPEVSFFHSEENK -------------------------3333----------1111-3333----1111---- >EXONUCLEASE RECJ; SWP:Q93R48; PDB:1IR6A; PLALLPLKGLREAAALLEEALRQGKRIRVHGDYDADGLTGTAILVRGLAALGADVHPFIP ------------------------------------------------------------ HRLEEGYGVLMERVPEHLEASDLFLTVDCGITNHAELRELLENGVEVIVTDHHTPGKTPP 3333-----3333----1111-----------11113333-------------------- PGLVVHPALTPDLKEKPTGAGVAFLLLWALHERLGLPPPLEYADLAAVGTIADVAPLWGW -----1111-----------------------------3333------------------ NRALVKEGLARIPASSWVGLRLLAEAVGYTGKAVEVAFRIAPRINAASRLGEAEKALRLL --------1111----3333----1111---3333------------11113333----- LTDDAAEAQALVGELHRLNARRQTLEEAMLRKLLPQADPEAKAIVLLDPEGHPGVMGIVA --------------------------------3333-3333----------1111----- SRILEATLRPVFLVAQGKGTVRSLAPISAVEALRSAEDLLLRYGGHKEAAGFAMDEALFP --------------!!!!-----------------3333------3333-----3333-- AFKARVEAYAARFPDPVREVALLDL --------3333--3333---3333 >HEMOGLOBIN ALPHA CHAIN; SWP:P01922; PDB:1IRDA; VLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHGK ----------------!!!!----------------------1111--2222-------- KVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPA -----------1111------------------3333----------------3333--- VHASLDKFLASVSTVLTSKYR --------------1111--- >HEMOGLOBIN ALPHA CHAIN; SWP:P02023; PDB:1IRDB; VHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKV --------------11111111---------------33331111--------------- KAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGK ----------------1111------------------3333---------------!!! EFTPPVQAAYQKVVAGVANALAHKYH !--------------------3333- >TISSUE FACTOR PATHWAY INH; SWP:P10646; PDB:1IRHA; EFHGPSWCLTPADRGLCRANENRFYYNSVIGKCRPFKYSGCGGNENNFTSKQECLRACKK --------------------------------------------------3333------ G - >INTERLEUKIN-2; SWP:P60568; PDB:1IRL; APTSSSTKKTQLQLEHLLLDLQMILNGINNYKNPKLTRMLTAKFYMPKKATELKHLQCLE --3333-----------------------------11113333---------3333---- EELKPLEEVLNLAQSKNFHLRPRDLISNINVIVLELKGSETTFMCEYADETATIVEFLNR ---3333-3333---------3333---------------------------3333---- WITFCQSIISTLT ------------- >RUBREDOXIN; SWP:P00268; PDB:1IRO; MKKYTCTVCGYIYNPEDGDPDNGVNPGTDFKDIPDDWVCPLCGVGKDQFEEVE -------------3333-3333--22223333-1111-------3333----- >OMEGA TRANSCRIPTIONAL REP; SWP:Q57468; PDB:1IRQA; IMGDKTVRVRADLHHIIKIETAKNGGNVKEVMDQALEEYIRKYLPDKL --------------------------3333-------------2222- >20S PROTEASOME; SWP:P60901; PDB:1IRUA; SRGSSAGFDRHITIFSPEGRLYQVEYAFKAINQGGLTSVAVRGKDCAVIVTQKKVPDKLL -------1111----3333-3333------------------------------------ DSSTVTHLFKITENIGCVMTGMTADSRSQVQRARYEAANWKYKYGYEIPVDMLCKRIADI 3333-------------------------------------------------------- SQVYTQNAEMRPLGCCMILIGIDEEQGPQVYKCDPAGYYCGFKATAAGVKQTESTSFLEK 3333----------------------------------------------------3333 KVKKKFDWTFEQTVETAITCLSTVLSIDFKPSEIEVGVVTVENPKFRILTEAEIDAHLVA -----------------------------1111------1111----------------1 LAER 111- >Proteasome subunit alpha ; SWP:P25787; PDB:1IRUB; AERGYSFSLTTFSPSGKLVQIEYALAAVAGGAPSVGIKAANGVVLATEKKQKSILYDERS ------------1111-------------------------------------------- VHKVEPITKHIGLVYSGMGPDYRVLVHRARKLAQQYYLVYQEPIPTAQLVQRVASVMQEY --------------------------------------------------------3333 TQSGGVRPFGVSLLICGWNEGRPYLFQSDPSGAYFAWKATAMGKNYVNGKTFLEKRYNED ----------------------------3333----------2222-------------- LELEDAIHTAILTLKESFEGQMTEDNIEVGICNEAGFRRLTPTEVKDYLAAIA ----------------------------------------------------- >Proteasome subunit alpha ; SWP:Q53XP2; PDB:1IRUC; SRRYDSRTTIFSPEGRLYQVEYAMEAIGHAGTCLGILANDGVLLAAERRNIHKLLDEVFF 3333-------3333-3333---------------------------------------- SEKIYKLNEDMACSVAGITSDANVLTNELRLIAQRYLLQYQEPIPCEQLVTALCDIKQAY ---------------------------------------------3333---------11 TQFGGKRPFGVSLLYIGWDKHYGFQLYQSDPSGNYGGWKATCIGNNSAAAVSMLKQDYKE 11---------------------------------------------------------- GEMTLKSALALAIKVLNKTMDVSKLSAEKVEIATLTRENGKTVIRVLKQKEVEQLIKKHE -------------------------3333--------------------------3333- EEEAKAEREK -3333----- >Proteasome subunit alpha ; SWP:O14818; PDB:1IRUD; SYDRAITVFSPDGHLFQVEYAQEAVKKGSTAVGVRGRDIVVLGVEKKSVAKLQDERTVRK ---------1111-3333------------------------------------------ ICALDDNVCMAFAGLTADARIVINRARVECQSHRLTVEDPVTVEYITRYIASLKQRYTQS ------------------------------------------------------------ NGRRPFGISALIVGFDFDGTPRLYQTDPSGTYHAWKANAIGRGAKSVREFLEKNYTDEAI --------------------------3333----------2222------1111-3333- ETDDLTIKLVIKALLEVVQSGGKNIELAVMRRDQSLKILNPEEIEKYVAEIEKEKEENEK -3333------------------------------------------------------- KKQ --- >Proteasome subunit alpha ; SWP:P28066; PDB:1IRUE; YDRGVNTFSPEGRLFQVEYDIEAIKLGSTAIGIQTSEGVCLAVEKRITSPLMEPSSIEKI --------1111---------3333---------1111----------11113333---- VEIDAHIGCAMSGLIADAKTLIDKARVETQNHWFTYNETMTVESVTQAVSNLALQFGEED ---1111------3333------------------------------------------- ADPGAMSRPFGVALLFGGVDEKGPQLFHMDPSGTFVQCDARAIGSASEGAQSSLQELYHK -------------------1111------3333----------1111-----------11 SMTLKEAIKSSLIILKQVMEEKLNATNIELATVQPGQNFHMFTKEELEEVIKDI 113333-------------------------------------------3333- >Proteasome subunit alpha ; SWP:P25786; PDB:1IRUF; NQYDNDVTVWSPQGRIHQIEYAMEAVKQGSATVGLKSKTHAVLVALKRAQSELAAHQKKI 1111-1111-1111-3333------3333---------------------1111------ LHVDNHIGISIAGLTADARLLCNFMRQECLDSRFVFDRPLPVSRLVSLIGSKTQIPTQRY ---1111---------------------------------3333--------------22 GRRPYGVGLLIAGYDDMGPHIFQTCPSANYFDCRAMSIGARSQSARTYLERHMSEFMECN 22------------------------------------2222---------33331111- LNELVKHGLRALRETLPAEQDLTTKNVSIGIVGKDLEFTIYDDDDVSPFLEGLEERPQ -----------3333-------1111------1111------3333------------ >Proteasome subunit alpha ; SWP:P25788; PDB:1IRUG; SSIGTGYDLSASTFSPDGRVFQVEYAMKAVENSSTAIGIRCKDGVVFGVEKLVLSKLYEE ---------1111-1111------------------------------------3333-- GSNKRLFNVDRHVGMAVAGLLADARSLADIAREEASNFRSNFGYNIPLKHLADRVAMYVH ---------1111---------------------------------------------33 AYTLYSAVRPFGCSFMLGSYSVNDGAQLYMIDPSGVSYGYWGCAIGKARQAAKTEIEKLQ 33-----------------------------1111-------------33331111---3 MKEMTCRDIVKEVAKIIYIVHDEVKDKAFELELSWVGELTNGRHEIVPKDIREEAEKYAK 333-3333----------------------------3333-------------------- ESLKE 3333- >Proteasome subunit beta t; SWP:P28072; PDB:1IRUH; TTIMAVQFDGGVVLGADSRTTTGSYIANRVTDKLTPIHDRIFCCRSGSAADTQAVADAVT ------------------------------------------------------------ YQLGFHSIELNEPPLVHTAASLFKEMCYRYREDLMAGIIIAGWDPQEGGQVYSVPMGGMM ---------------------------1111----------------------------- VRQSFAIGGSGSSYIYGYVDATYREGMTKEECLQFTANALALAMERDGSSGGVIRLAAIA --------3333----------------------------------1111---------3 ESGVERQVLLGDQIPKFAVATL 333-------3333-------- >Proteasome subunit beta t; SWP:P70195; PDB:1IRUI; TTIAGVVYKDGIVLGADTRATEGMVVADKNCSKIHFISPNIYCCGAGTAADTDMTTQLIS -------1111----------!!!!------------1111------------------- SNLELHSLSTGRLPRVVTANRMLKQMLFRYRGYIGAALVLGGVDVTGPHLYSIYPHGSTD ---------------------------1111------------1111------1111--- KLPYVTMGSGSLAAMAVFEDKFRPDMEEEEAKNLVSEAIAAGIFNDLGSGSNIDLCVISK -------1111---------------3333---------------1111----------- NKLDFLRPYTVPNKKGTRLGRYRCEKGTTAVLTEKITPLE ------------------------2222------------ >Proteasome subunit beta t; SWP:P33672; PDB:1IRUJ; SIMSYNGGAVMAMKGKNCVAIAADRRFGIQAQLVTTDFQKIFPMGDRLYIGLAGLATDVQ 3333----------------------------------------2222------------ TVAQRLKFRLNLYELKEGRQIKPYTLMSMVANLLYEKRFGPYYTEPVIAGLDPKTFKPFI ---------------------3333----------------------------------- CSLDLIGCPMVTDDFVVSGTCAEQMYGMCESLWEPNMDPDHLFETISQAMLNAVDRDAVS ---1111------------------------------3333----------3333-1111 GMGVIVHIIEKDKITTRTLKARMD ------------------------ >Proteasome subunit beta t; SWP:P49721; PDB:1IRUK; MEYLIGIQGPDYVLVASDRVAASNIVQMKDDHDKMFKMSEKILLLCVGEAGDTVQFAEYI ----------------------!!!!---------------------------------- QKNVQLYKMRNGYELSPTAAANFTRRNLADCLRSRTPYHVNLLLAGYDEHEGPALYYMDY ---------------3333---------------------------------------11 LAALAKAPFAAHGYGAFLTLSILDRYYTPTISRERAVELLRKCLEELQKRFILNLPTFSV 11----------3333---------------3333------------------------- RIIDKNGIHDLDNISFPKQ ---1111------------ >Proteasome subunit beta t; SWP:P28074; PDB:1IRUL; TTTLAFKFRHGVIVAADSRATAGAYIASQTVKKVIEINPYLLGTMAGGAADCSFWERLLA -------1111----------!!!!----------------------------------- RQCRIYELRNKERISVAAASKLLANMVYQYKGMGLSMGTMICGWDKRGPGLYYVDSEGNR --------------------------3333------------------------1111-- ISGATFSVGSGSVYAYGVMDRGYSYDLEVEQAYDLARRAIYQATYRDAYSGGAVNLYHVR --------1111-----------11113333---------------1111---------- EDGWIRVSSDNVADLHEKYSG -------------3333---- >Proteasome subunit beta t; SWP:P20618; PDB:1IRUM; RFSPYVFNGGTILAIAGEDFAIVASDTRLSEGFSIHTRDSPKCYKLTDKTVIGCSGFHGD ------------------------------------------------------------ CLTLTKIIEARLKMYKHSNNKAMTTGAIAAMLSTILYSRRFFPYYVYNIIGGLDEEGKGA ------------------------------------1111-------------1111--- VYSFDPVGSYQRDSFKAGGSASAMLQPLLDNQVGFKNMQNVEHVPLSLDRAMRLVKDVFI ----1111----------1111-------------------------------------- SAAERDVYTGDALRICIVTKEGIREETVSLRKD -----3333---------3333----------- >Proteasome subunit beta t; SWP:P28070; PDB:1IRUN; TQNPMVTGTSVLGVKFEGGVVIAADMLGSYGSLARFRNISRIMRVNNSTMLGASGDYADF ---------------2222----------!!!!------------1111----------- QYLKQVLGQMVIDEELLGDGHSYSPRAIHSWLTRAMYSRRSKMNPLWNTMVIGGYADGES -------------3333----------------------1111------------iiii- FLGYVDMLGVAYEAPSLATGYGAYLAQPLLREVLEKQPVLSQTEARDLVERCMRVLYYRD -----1111-------------------------------3333---------------1 ARSYNRFQTATVTEKGVEIEGPLSTETNWDIAHMISG 111---------3333------------1111----- >LYSYL-TRNA SYNTHETASE; SWP:O57963; PDB:1IRXA; HWADYIADKIIRERGEKEKYVVESGITPSGYVHVGNFRELFTAYIVGHALRDKGYEVRHI --------------------------------------------------1111------ HMWDDYDRFRKVPRNVPQEWKDYLGMPISEVPDPWGCHESYAEHFMRKFEEEVEKLGIEV ----------------11111111--1111--1111----------------3333---- DLLYASELYKRGEYSEEIRLAFEKRDKIMEILNKYREIAKQPPLPENWWPAMVYCPEHRR ---3333-----------------------------1111-------------------- EAEIIEWDGGWKVKYKCPEGHEGWVDIRSGNVKLRWRVDWPMRWSHFGVDFEPAGKDHLV -------------------------3333-------------------------3333-- AGSSYDTGKEIIKEVYGKEAPLSLMYEFVGIKGQNVILLSDLYEVLEPGLVRFIYARHRP -------------------------------------33333333-------------11 NKEIKIDLGLGILNLYDEFEKVERIYFGVEGEELRRTYELSMPKKPERLVAQAPFRFLAV 11------1111-------------------3333---1111-----------3333--3 LVQLPHLTEEDIINVLIKQGHIPRDLSKEDVERVKLRINLARNWVKKYAPEDVKFSILEK 3331111---------1111-----------------------------3333------- PPEVEVSEDVREAMNEVAEWLENHEEFSVEEFNNILFEVAKRRGISSREWFSTLYRLFIG ---------------------------------------------3333----------- KERGPRLASFLASLDRSFVIKRLRLEG ----------11113333--------- >HMTH1; SWP:P36639; PDB:1IRYA; MGASRLYTLVLVLQPQRVLLGMKKRGFGAGRWNGFGGKVQEGETIEDGARRELQEESGLT -------------------------------------------3333------------- VDALHKVGQIVFEFVGEPELMDVHVFCTDSIQGTPVESDEMRPCWFQLDQIPFKDMWPDD -------------2222--------------------1111-----1111-3333---33 SYWFPLLLQKKKFHGYFKFQGQDTILDYTLREVDTV 33----1111-------------------------- >ARR10-B; SWP:O49397; PDB:1IRZA; TAQKKPRVLWTHELHNKFLAAVDHLGVERAVPKKILDLMNVDKLTRENVASHLQKFRVAL -------------------------------3333-333322223333-------3333- KKVS ---- >RIBOSOME RECYCLING FACTOR; SWP:Q8GRF5; PDB:1IS1A; MINEIKKDAQERMDKSVEALKNNLSKVRTGRAHPSLLSGISVEYYGAATPLNQVANVVAE -----------------------1111-----33331111---iiii--3333------- DARTLAITVFDKELTQKVEKAIMMSDLGLNPMSAGTIIRVPLPPLTEERRKDLVKIVRGE ----------1111-------------------!!!!----------------------- AEGGRVAVRNIRRDANNDLKALLKDKEISEDEDRKAQEEIQKLTDVAVKKIDEVLAAKEK ----------------------1111---------------------------------- ELMEV 1111- >CONGERIN II; SWP:Q9YIC2; PDB:1IS3A; DRAEVRNIPFKLGMYLTVGGVVNSNATRFSINVGESTDSIAMHMDHRFSYGADQNVLVLN ----------2222--------2222-----------------------!!!!------- SLVHNVGWQQEERSKKFPFTKGDHFQTTITFDTHTFYIQLSNGETVEFPNRNKDAAFNLI -------------------2222--------1111----1111------1111------- YLAGDARLTFVRLE -------------- >GTP CYCLOHYDROLASE I; SWP:P22288; PDB:1IS8A; RPRSEEDNELNLPNLAAAYSSILRSLGEDPQRQGLLKTPWRAATAMQFFTKGYQETISDV -----------------------1111-11111111--------------3333-3333- LNDAIFDEDHDEMVIVKDIDMFSMCEHHLVPFVGRVHIGYLPNKQVLGLSKLARIVEIYS ------------------------------------------------------------ RRLQVQERLTKQIAVAITEALQPAGVGVVIEATHMCMVMRGKMNSKTVTSTMLGVFREDP ---------------------------------3333----1111--------3333--- KTREEFLTLIRS ------------ >RIBOSOME RECYCLING FACTOR; SWP:P16174; PDB:1ISEA; ISDIRKDAEVRMDKCVEAFKTQISKIRTGRASPSLLDGIVVEYYGTPTPLRQLASVTVED ---------------------1111------11111111---iiii--3333-------- SRTLKINVFDRSMSPAVEKAIMASDLGLNPNSAGSDIRVPLPPLTEERRKDLTKIVRGEA ---------3333-------33331111-------------------------------- EQARVAVRNVGRDANDKVKALLKDKEISEDDDRRSQDDVQKLTDAAIKKIEAALADKEAE ---------------------------3333----------------------------- LMQF ---- >BONE MARROW STROMAL CELL ; SWP:Q10588; PDB:1ISIA; WRAEGTSAHLRDIFLGRCAEYRALLSPEQRNKDCTAIWEAFKVALDKDPCSVLPSDYDLF -------------------3333--3333-------------1111------3333---- ITLSRHSIPRDKSLFWENSHLLVNSFADNTRRFMPLSDVLYGRVADFLSWCRQKADSGLD --------2222---------------iiii---33333333--2222------------ YQSCPTSEDCENNPVDSFWKRASIQYSKDSSGVIHVMLNGSEPTGAYPIKGFFADYEIPN -----3333-----------------1111--------1111--------3333--3333 LQKEKITRIEIWVMHEIGGPNVESCGEGSMKVLEKRLKDMGFQYSCINDYRPVKLLQCVD -3333----------2222----2222----------1111---------------3333 HSTHPDCALK 11111111-- >LIPASE; SWP:P37957; PDB:1ISPA; EHNPVVMVHGIGGASFNFAGIKSYLVSQGWSRDKLYAVDFWDKTGTNYNNGPVLSRFVQK ---------22221111--------1111-1111-------11113333----------- VLDETGAKKVDIVAHSMGGANTLYYIKNLDGGNKVANVVTLGGANRLTTGKALPGTDPNQ -------------------------------------------1111---------1111 KILYTSIYSSADMIVMNYLSRLDGARNVQIHGVGHIGLLYSSQVNSLIKEGLNGGGQNT --------1111---3333--2222--------3333-------------1111----- >HIGH POTENTIAL IRON SULFU; SWP:P33678; PDB:1ISUA; GTNAAMRKAFNYQDTAKNGKKCSGCAQFVPGASPTAAGGCKVIPGDNQIAPGGYCDAFIV ----------------iiii33331111----1111---1111------1111-1111-- KK -- >HEMOGLOBIN; SWP:Q7SID0; PDB:1IT2A; PIIDQGPLPTLTDGDKKAINKIWPKIYKEYEQYSLNILLRFLKCFPQAQASFPKFSTKKS -----------------------3333-----------------3333---3333----- NLEQDPEVKHQAVVIFNKVNEIINSMDNQEEIIKSLKDLSQKHKTVFKVDSIWFKELSSI 3333--------------------1111-------------------------------- FVSTIDGGAEFEKLFSIICILLRSAY --1111--------------1111-- >HUMANIZED ANTIBODY HFE7A,; SWP:NA; PDB:1IT9H; QVQLVQSGAEVKKPGASVKVSCKASGYTFTSYWMQWVKQAPGQGLEWMGEIDPSDSYTNY ---------------------------1111--------2222----------------- NQKFKGKATLTVDTSTSTAYMELSSLRSEDTAVYYCARNRDYSNNWYFDVWGEGTLVTVS 1111--------3333----------3333------------------------------ SASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQS -----------------------------------------------2222-------11 SGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKV 11----------3333------------1111------- >HUMANIZED ANTIBODY HFE7A,; SWP:NA; PDB:1IT9L; EIVLTQSPGTLSLSPGERATLSCKASQSVDYDGDSYMNWYQQKPGQAPRLLIYAASNLES -------------2222-----------------------------------------22 GIPDRFSGSGSGTDFTLTISRLEPEDFAVYYCQQSNEDPRTFGQGTKLEIKRTVAAPSVF 223333----------------1111---------------------------------- IFPPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLS -----3333--------------------------------------------------- STLTLSKADYEKHKVYACEVTHQGLSSPVTKSFN -----3333------------3333--------- >Interleukin-1 receptor ty; SWP:P14778; PDB:1ITBB; CKEREEKIILVSSANEIDVRPCPLNPNEHKGTITWYKDDSKTPVSTEQASRIHQHKEKLW ----------------------------------------------3333---------- FVPAKVEDSGHYYCVVRNSSYCLRIKISAKFVENEPNLCYNAQAIFKQKLPVAGDGGLVC !!!!3333--------------------------2222--3333---------------- PYMEFFKNENNELPKLQWYKDCKPLLLDNIHFSGVKDRLIVMNVAEKHRGNYTCHASYTY ----3333%%%%-------iiii-----------!!!!------1111------------ LGKQYPITRVIEFITLEENKPTRPVIVSPANETMEVDLGSQIQLICNVTGQLSDIAYWKW --------------------------------------------------1111------ NGSVIDEDDPVLGEDYYSVENPANKRRSTLITVLNISEIESRFYKHPFTCFAKNTHGIDA --------------------3333---------------3333----------1111--- AYIQLIYPVT ---------- >HEMOGLOBIN (CYANO MET); SWP:P06148; PDB:1ITHA; GLTAAQIKAIQDHWFLNIKGCLQAAADSIFFKYLTAYPGDLAFFHKFSSVPLYGLRSNPA -----------------3333---------------33333333--111133331111-- YKAQTLTVINYLDKVVDALGGNAGALMKAKVPSHDAMGITPKHFGQLLKLVGGVFQEEFS -------------------------------3333------------------------- ADPTTVAAWGDAAGVLVAAMK --------------------- >CATALASE-PEROXIDASE; SWP:O59651; PDB:1ITKA; KRPKSNQDWWPSKLNLEILDQNARDVGPVEDDFDYAEEFQKLDLEAVKSDLEELMTSSQD ----3333-1111-3333-1111------1111-----11113333------3333--11 WWPADYGHYGPLFIRMAWHSAGTYRTADGRGGAAGGRQRFAPINSWPDNANLDKARRLLL 11-2222-------------1111--------11111111-33333333----------- PIKQKYGQKISWADLMILAGNVAIESMGFKTFGYAGGREDAFEEDKAVNWGPEDEFETQE -----!!!!---------------1111----------------1111------------ RFDEPGEIQEGLGASVMGLIYVNPEGPDGNPDPEASAKNIRQTFDRMAMNDKETAALIAG ---2222-2222---2222---1111iiii--------------1111-3333------- GHTFGKVHGADDPEENLGPEPEAAPIEQQGLGWQNKNMITSGIEGPWTQSPTEWDMGYIN -----------3333----3333-3333---------------------1111------- NLLDYEWEPEKGPGGAWQWAPKSEELKNSVPDAHDPDEKQTPMMLTTDIALKRDPDYREV -----------1111--------1111-----------------33333333-------- METFQENPMEFGMNFAKAWYKLTHRDMGPPERFLGPEVPDEEMIWQDPLPDADYDLIGDE -----------------------1111-3333--1111----1111-------------- EIAELKEEILDSDLSVSQLVKTAWASASTYRDSDKRGGANGARLRLEPQKNWEVNEPEQL --------------------------1111--------22221111-33333333----- ETVLGTLENIQTEFNDSRSDGTQVSLADLIVLGGNAAVEQAAANAGYDVEIPFEPGRVDA ------------------------------------------1111-------------- GPEHTDAPSFDALKPKVDGVRNYIQDDITRPAEEVLVDNADLLNLTASELTALIGGMRSI 3333-33333333-----1111--------3333------1111---------------- GANYQDTDLGVFTDEPETLTNDFFVNLLDMGTEWEPAADSEHRYKGLDRDTGEVKWEATR --2222-2222---------3333----3333--------------------------33 IDLIFGSNDRLRAISEVYGSADAEKKLVHDFVDTWSKVMKLDRFDLE 331111-------------------------------1111-3333- >RENAL DIPEPTIDASE; SWP:P16444; PDB:1ITUA; DFFRDEAERIMRDSPVIDGHNDLPWQLLDMFNNRLQDERANLTTLAGTHTNIPKLRAGFV ------------------------------%%%%--33331111---------------- GGQFWSVYTPCDTQNKDAVRRTLEQMDVVHRMCRMYPETFLYVTSSAGIRQAFREGKVAS ---------1111----------------------3333--------------------- LIGVEGGHSIDSSLGVLRALYQLGMRYLTLTHSCNTPWADNWLVDTGDSEPQSQGLSPFG ---------%%%%-------1111-----------------3333-----1111------ QRVVKELNRLGVLIDLAHVSVATMKATLQLSRAPVIFSHSSAYSVCASRRNVPDDVLRLV ---------------2222----------------------3333--------------- KQTDSLVMVNFYNNYISCTNKANLSQVADHLDHIKEVAGARAVGFGGDFDGVPRVPEGLE -----------3333-------3333------------1111-----2222----2222- DVSKYPDLIAELLRRNWTEAEVKGALADNLLRVFEAVEQASNLTQAPEEEPIPLDQLGGS 1111---------------------------------11111111-------3333---- CRTHYGYSS --------- >MMP9; SWP:P14780; PDB:1ITVA; DDACNVNIFDAIAEIGNQLYLFKDGKYWRFSEGRGSRPQGPFLIADKWPALPRKLDSVFE -3333-------------------------------------3333-1111--------- EPLSKKLFFFSGRQVWVYTGASVLGPRRLDKLGLGADVAQVTGALRSGRGKMLLFSGRRL ----------!!!!----!!!!-------1111-1111---------2222----!!!!- WRFDVKAQMVDPRSASEVDRMFPGVPLDTHDVFQFREKAYFCQDRFYWRVSSRSELNQVD ----1111--1111--3333-2222---------%%%%----!!!!------%%%%---- QVGYVTYDILQCPED ---3333-------- >ISOCITRATE DEHYDROGENASE; SWP:P16100; PDB:1ITWA; STPKIIYTLTDEAPALATYSLLPIIKAFTGSSGIAVETRDISLAGRLIATFPEYLTDTQK ----------------------------3333--------------33333333-3333- ISDDLAELGKLATTPDANIIKLPNISASVPQLKAAIKELQQQGYKLPDYPEEPKTDTEKD -------------1111----------------------1111----------------- VKARYDKIKGSAVNPVLREGNSDRRAPLSVKNYARKHPHKMGAWSADSKSHVAHMDNGDF ----3333----3333----------------------------1111----------33 YGSEKAALIGAPGSVKIELIAKDGSSTVLKAKTSVQAGEIIDSSVMSKNALRNFIAAEIE 33------------------1111-----------2222--------------------- DAKKQGVLLSVHLKATMMKVSDPIMFGQIVSEFYKDALTKHAEVLKQIGFDVNNGIGDLY -------------3333----------------------------1111-3333------ ARIKTLPEAKQKEIEADIQAVYAQRPQLAMVNSDKGITNLHVPSDVIVDASMPAMIRDSG -3333-3333----------3333-------3333--11111111-3333-------%%% KMWGPDGKLHDTKAVIPDRCYAGVYQVVIEDCKQHGAFDPTTMGSVPNVGLMAQKAEEYG %--1111-------------3333--------------3333----------%%%%3333 SHDKTFQIPADGVVRVTDESGKLLLEQSVEAGDIWRMCQAKDAPIQDWVKLAVNRARATN 1111-------------1111--------2222--------------------------- TPAVFWLDPARAHDAQVIAKVERYLKDYDTSGLDIRILSPVEATRFSLARIREGKDTISV -------3333-------------1111-1111--------------------------- TGNVLRDYLTDLFPIMELGTSAKMLSIVPLMSGGGLFETGAGGSAPKHVQQFLEEGYLRW -----------------------------1111-----------3333------------ DSLGEFLALAASLEHLGNAYKNPKALVLASTLDQATGKILDNNKSPARKVGEIDNRGSHF -3333------------1111------------------1111----------------- YLALYWAQALAAQTEDKELQAQFTGIAKALTDNETKIVGELAAAQGKPVDIAGYYHPNTD -----------------------------------------1111--------------- LTSKAMRPSATFNAALAPLA ---------------3333- >GLYCOSYL HYDROLASE; SWP:P20533; PDB:1ITXA; LQPATAEAADSYKIVGYYPSWAAYGRNYNVADIDPTKVTHINYAFADICWNGIHGNPDPS ------3333--------1111-3333-3333-3333------------iiii----111 GPNPVTWTCQNEKSQTINVPNGTIVLGDPWIDTGKTFAGDTWDQPIAGNINQLNKLKQTN 1---------1111-----2222-------------2222-------------------1 PNLKTIISVGGWTWSNRFSDVAATAATREVFANSAVDFLRKYNFDGVDLDWEYPVSGGLD 111-------33331111----------------------------------------11 GNSKRPEDKQNYTLLLSKIREKLDAAGAVDGKKYLLTIASGASATYAANTELAKIAAIVD 11--1111--------------------------------------1111---------- WINIMTYDFNGAWQKISAHNAPLNYDPAASAAGVPDANTFNVAAGAQGHLDAGVPAAKLV ----------1111-----------33331111--3333----------1111-3333-- LGVPFYGRGWDGCAQAGNGQYQTCTGGSSVGTWEAGSFDFYDLEANYINKNGYTRYWNDT --------------2222---------------2222------------iiii------- AKVPYLYNASNKRFISYDDAESVGYKTAYIKSKGLGGAMFWELSGDRNKTLQNKLKADL -----------------------------------------33331111---------- >TRANSKETOLASE; SWP:Q7SIC9; PDB:1ITZA; AATGELLEKSVNTIRFLAIDAVEKANSGHPGLPMGCAPMGHVLYDEVMRYNPKNPYWFNR --------------------------------------------------1111--1111 DRFVLSAGHGCMLQYALLHLAGYDSVKEEDLKQFRQWGSRTPGHPENFETPGVEVTTGPL ------3333------------33333333--2222---------1111---------22 GQGIANAVGLALAEKHLAARFNKPDSEIVDHYTYVILGDGCQMEGIANEACSLAGHWGLG 22--------------------1111-----------3333-----------------11 KLIAFYDDNHISIDGDTEIAFTEDVSTRFEALGWHTIWVKNGNTGYDDIRAAIKEAKAVT 11---------11113333----------1111-------3333-----------3333- DKPTLIKVTTTIGFGSPNKANSYSVHGSALGAKEVEATRQNLGWPYDTFFVPEDVKSHWS ----------2222-------3333----------------------------------- RHTPEGAALEADWNAKFAEYEKKYADDAATLKSIITGELPTGWVDALPKYTPESPGDATR ---------------------------------------22221111---1111------ NLSQQCLNALANVVPGLIGGSADLASSNMTLLKMFGDFQKDTAEERNVRFGVREHGMGAI -------------1111------1111----1111---1111------------------ CNGIALHSPGFVPYCATFFVFTDYMRGAMRISALSEAGVIYVMTHDSIGLGEDGPTHQPI -------1111------33333333-------------------------33331111-- EHLVSFRAMPNILMLRPADGNETAGAYKVAVLNRKRPSILALSRQKLPHLPGTSIEGVEK -----3333-----------------------1111-------------22223333--- GGYTISDNSTGNKPDLIVMGTGSELEIAAKAADELRKEGKTVRVVSFVSWELFDEQSDEY ---------%%%%-------!!!!-----------1111-------------1111---- KESVLPAAVTARISIEAGSTLGWQKYVGAQGKAIGIDKFGASAPAGTIYKEYGITVESII -----3333----------2222----1111-----------------------3333-- AAAKSF ------ >GAMMA1-ADAPTIN; SWP:O43747; PDB:1IU1A; GIPSITAYSKNGLKIEFTFERSNTNPSVTVITIQASNSTELDMTDFVFQAAVPKTFQLQL ---------iiii-----------1111------------------------3333---- LSPSSSIVPAFNTGTITQVIKVLNPQKQQLRMRIKLTYNHKGSAMQDLAEVNNFPPQSWQ ---------%%%%----------1111------------iiii-----------3333-- >MICROBIAL TRANSGLUTAMINAS; SWP:P81453; PDB:1IU4A; DSDDRVTPPAEPLDRMPDPYRPSYGRAETVVNNYIRKWQQVYSHRDGRKQQMTEEQREWL 1111-------3333-------iiii------------------iiii------------ SYGCVGVTWVNSGQYPTNRLAFASFDEDRFKNELKNGRPRSGETRAEFEGRVAKESFDEE --3333------------------------------------------------------ KGFQRAREVASVMNRALENAHDESAYLDNLKKELANGNDALRNEDARSPFYSALRNTPSF --------------1111--------------------3333-------33331111333 KERNGGNHDPSRMKAVIYSKHFWSGQDRSSSADKRKYGDPDAFRPAPGTGLVDMSRDRNI 3----%%%%1111-------------11113333----1111-----------1111--- PRSPTSPGEGFVNFDYGWFGAQTEADADKTVWTHGNHYHAPNGSLGAMHVYESKFRNWSE -----3333------------------------------1111-------------1111 GYSDFDRGAYVITFIPKSWNTAPDKVKQGWP ---------------3333------------ >RUBREDOXIN; SWP:P24297; PDB:1IU5A; AKYVCKICGYIYDEDAGDPDNGVSPGTKFEEIPDDWVCPICGAPKSEFEKL ------------3333-3333--22223333-1111-------3333---- >PYRROLIDONE-CARBOXYLATE P; SWP:O58321; PDB:1IU8A; MKILLTGFEPFGGDDKNPTMDIVEALSERIPEVVGEILPVSFKRAREKLLKVLDDVRPDI ----------iiii---------------1111--------------------------- TINLGLAPGRTHISVERVAVNMIDARIPDNDGEQPKDEPIVEGGPAAYFATIPTREIVEE ------2222------------------1111--------2222---------------- MKKNGIPAVLSYTAGTYLCNFAMYLTLHTSATKGYPKIAGFIHVPYTPDQVLEKKNTPSM -1111-----------------------------------------33331111------ SLDLEIKGVEIAIRVAQSALHSSQLR ---------------------1111- >HIGH-POTENTIAL IRON-SULFU; SWP:P80176; PDB:1IUAA; AAPANAVTADDPTAIALKYNQDATKSERVAAARPGLPPEEQHCANCQFMQANVGEGDWKG --1111-1111----------3333-3333------3333-333311111111-!!!!-- CQLFPGKLINVNGWCASWTLKAG 1111----------1111----- >FERREDOXIN; SWP:Q8IED5; PDB:1IUEA; AFYNITLRTNDGEKKIECNEDEYILDASERQNVELPYSCRGGSCSTCAAKLVEGEVDNDD ------------------1111------1111------------1111---------111 QSYLDEEQIKKKYILLCTCYPKSDCVIETHKEDELHDM 1-------------3333-------------------- >CENTROMERE ABP1 PROTEIN; SWP:P49777; PDB:1IUFA; HMGKIKRRAITEHEKRALRHYFFQLQNRSGQQDLIEWFREKFGKDISQPSVSQILSSKYS ----------3333---------------3333----------------3333------- YLDNTVEKPWDVKRNRPPKYPLLEAALFEWQVQQGDDATLSGETIKRAAAILWHKIPEYQ 1111--------------------------3333-------------------------- DQPVPNFSNGWLEGFRKRHILH --------3333---------- >PUTATIVE ASPARTATE AMINOT; SWP:P83786; PDB:1IUGA; DWLLTPGPVRLHPKALEALARPQLHHRTEAAREVFLKARGLLREAFRTEGEVLILTGSGT -----------------1111---1111-----------------------------333 LAMEALVKNLFAPGERVLVPVYGKFSERFYEIALEAGLVVERLDYPYGDTPRPEDVAKEG 3----------2222------------------------------------3333----- YAGLLLVHSETSTGALADLPALARAFKEKNPEGLVGADMVTSLLVGEVALEAMGVDAAAS ----------1111------------1111-----------2222----3333------- GSQGLMCPPGLGFVALSPRALERLKPRGYYLDLARELKAQKEGESAWTPAINLVLAVAAV -----------------3333-------1111---33331111----------------- LEEVLPRLEEHLALKAWQNALLYGVGEEGGLRPVPKRFSPAVAAFYLPEGVPYARVKEAF ---3333-------------------1111--------3333---------3333----- AQRGAVIAGGQGPLKGKVFRLSLMGAYDRYEALGVAGMFREVLEEIL ----------!!!!--------------------------------- >2'-5' RNA LIGASE; SWP:Q84CU4; PDB:1IUHA; MRLFYAVFLPEEVRAALVEAQTKVRPFRGWKPVPPHQLHLTLLFLGERPEEELPDYLALG -------------------33333333------1111-----------3333-------- HRLARLEAPFRARLRGTGYFPNEGTPRVWFAKAEAEGFLRLAEGLRAGVEELLGEEAVRI --1111----------------------------3333-----------------33332 PGWDKPFKPHITLARRKAPAPRVPPVLFGLEWPVEGFALVRSELKPKGPVYTVLEKFSLR 222--------------------------------------------------------- GEH --- >HYPOTHETICAL PROTEIN TT13; SWP:Q53VV7; PDB:1IUJA; MFVTMNRIPVRPEYAEQFEEAFRQRARLVDRMPGFIRNLVLRPKNPGDPYVVMTLWESEE ----------3333----------33331111------------1111------------ AFRAWTESPAFKEGHARSGTLPKEAFLGPNRLEAFEVVLDSE --------3333------------------------------ >HYPOTHETICAL PROTEIN TT14; SWP:Q8GHJ5; PDB:1IUKA; MNDQELRAYLSQAKTIAVLGAHKDPSRPAHYVPRYLREQGYRVLPVNPRFQGEELFGEEA -----------------------1111-------------------3333----iiii-- VASLLDLKEPVDILDVFRPPSALMDHLPEVLALRPGLVWLQSGIRHPEFEKALKEAGIPV --1111------------11111111--------------2222---------------- VADRCLMVEHKRLFRG ---------------- >GLYCEROL-3-PHOSPHATE ACYL; SWP:P10349; PDB:1IUQA; ASHSRKFLDVRSEEELLSCIKKETEAGKLPPNVAAGEELYQNYRNAVIESGNPKADEIVL ----3333-------------------------------------------1111----- SNTVALDRILLDVEDPFVFSSHHKAIREPFDYYIFGQNYIRPLIDFGNSFVGNLSLFKDI ---------------------------------------3333-3333----3333---- EEKLQQGHNVVLISNHQTEADPAIISLLLEKTNPYIAENTIFVAGDRVLADPLCKPFSIG -----------------1111------------------------3333-33333333-- RNLICVYSKKHFDIPELTETKRKANTRSLKEALLLRGGSQLIWIAPSGGRDRPDPSTGEW -------3333--3333----------------3333-------3333------------ YPAPFDASSVDNRRLIQHSDVPGHLFPLALLCHDIPPPRVIAFNGAGLSVAPEISFEEIA --------------3333-------------3333------------------------1 ATHKNPEEVREAYSKALFDSVAQYNVLKTAISGKQGLGASTADVSLSQPW 111---------------------------1111--33331111------ >KIAA0730 PROTEIN; SWP:Q9NZJ4; PDB:1IURA; MHHHHHHLVPRGSILKEVTSVVEQAWKLPESERKKIIRRLYLKWHPDKNPENHDIANEVF ----------------------1111-------------------3333----3333--- KHLQNEINRLEKQAFLDQNADRASRRTF ---------------------------- >CULLIN-3 HOMOLOGUE; SWP:Q9JLV5; PDB:1IUYA; MAAKQGESDPERKETRQKVDDDRKHEIEAAIVRIMKSRKKMQHNVLVAEVTQQLKARFLP ------------------------------------------------------------ SPVVIKKRIEGLIEREYLARTPEDRKVYTYVA ------------1111---------------- >PLASTOCYANIN; SWP:P56274; PDB:1IUZ; AQIVKLGGDDGSLAFVPSKISVAAGEAIEFVNNAGFPHNIVFDEDAVPAGVDADAISYDD -------1111-----------2222----------------1111-22223333----- YLNSKGETVVRKLSTPGVYGVYCEPHAGAGMKMTITVQ ---2222---------------33331111-------- >HYPOTHETICAL PROTEIN; SWP:P83694; PDB:1IV0A; MRVGALDVGEARIGLAVGEEGVPLASGRGYLVRKTLEEDVEALLDFVRREGLGKLVVGLP -----------------------------------------------3333--------- LRTDLKESAQAGKVLPLVEALRARGVEVELWDERFTTK -------------3333--------------------- >2-C-methyl-D-erythritol 2; SWP:Q8RQP5; PDB:1IV3A; RIGYGEDSHRLEEGRPLYLCGLLIPSPVGALAHSDGDAAMHALTDALLSAYGLGDIGLLF ------------------iiii--------------------------------3333-- PDTDPRWRGERSEVFLREAMRLVEARGAKLLQASLVLTLDRPKLGPHRKALVDSLSRLMR 1111--2222-------------1111----------------3333------------- LPQDRIGLTFKTSEGLAPSHVQARAVVLLD -1111-------iiii-------------- >MALTOOLIGOSYL TREHALOSE S; SWP:Q53688; PDB:1IV8A; MISATYRLQLNKNFNFGDVIDNLWYFDLGVSHLYLSPVLMASPGSNHGYDVIDHSRINDE ----------1111--------3333---------------2222-------1111--11 LGGEKEYRRLIETAHTIGLGIIQDIVPNHMAVNSLNWRLMDVLMGSYYTYFDFFPEDDKI 11------------1111--------------11113333------1111---3333--- RLPILGEDLDTVISKGLLKIVKDGDEYFLEYFKWKLPLTEVGNDIYDTLQKQNYTLMSWK ------------------------------!!!!-------------1111------333 NPPSYRRFFDVNTLIGVNVEDHVFQESHSILDLDVDGYRIDHIDGLYDPEKYINDLRSII 3------!!!!---------3333-----1111--------1111--------------- KNIIIVEKILGFQEELKLNSDGTTGYDFLNYSNLLFNFNQEIMDSIYENFTAEKISISES ----------1111------------------1111-------------------3333- IKIKAQIIDELFSYEVRLASQLGISYDILRDYLSCIDVYRTYANQIVKECDKTNEIEEAT ------------3333-------------------------%%%%-3333---------- KRNPEAYTKLQQYMPAVYAAYEDTFLFRYNRLISINEVGSDLRYYKISPDQFHVFNQKRR ------------3333-------3333----3333-22221111---3333-------22 GKITLNATSTHDTKFSEDVRMKISVLSEFPEEWKNVEEWHSIINPKVSRNDEYRYYQVLV 22---------------------3333--------------------------------- GSFYEGFSNDFERIQHMISVREAINTSWRNQNKEYENRVMELVEETFTNKDFIKSFMKFE -------3333--------3333---3333------------------------------ SIRRIGMIKSLSLVALKIMSAGIPDFYQGTEIWRYLLTDPDNRVPVDFKKLHEILEKSKF --------------------------2222------------------------1111-- EKNMLESMDDGRIKMYLTYLLSLRKQLAEDFLKGEYKGLDLEEGLCGFIRFNKILVIIKT -3333-3333---------------------------------------%%%%------- KGSVNYKLKLEEGAIYTDVLTGEEIKKEVQINELPRILVRM 1111------------------------------------- ------------------------------------------------ >ISOVALERYL-COA DEHYDROGEN; SWP:P26440; PDB:1IVHA; VDDAINGLSEEQRQLRQTMAKFLQEHLAPKAQEIDRSNEFKNLREFWKQLGNLGVLGITA ---3333-------------------3333---------1111-------------1111 PVQYGGSGLGYLEHVLVMEEISRASGAVGLSYGAHSNLCINQLVRNGNEAQKEKYLPKLI 3333-------------------------------------------------------- SGEYIGALAMSEPNAGSDVVSMKLKAEKKGNHYILNGNKFWITNGPDADVLIVYAKTDLA -----------------3333--------------------2222------------111 AVPASRGITAFIVEKGMPGFSTSKKLDKLGMRGSNTCELIFEDCKIPAANILGHENKGVY 1-3333-------2222-----------------------------3333---2222--- VLMSGLDLERLVLAGGPLGLMQAVLDHTIPYLHVREAFGQKIGHFQLMQGKMADMYTRLM -----------------------------3333---iiii-------------------- ACRQYVYNVAKACDEGHCTAKDCAGVILYSAECATQVALDGIQCFGGNGYINDFPMGRFL ------------1111-----------------------------3333-3333------ RDAKLYEIGAGTSEVRRLVIGRAFNAD --3333-----------------1111 >IGG-KAPPA M29B FV (LIGHT ; SWP:NA; PDB:1IVLA; DIELTQSPATLSVTPGNSVSISCRASQSIGNRLFWYQQKSHESPRLLIKYASQSISGIPS -------------2222-----------!!!!----------------------222233 RFSGSGSGTDFTLSINSVETEDLAVYFCQQVSEWPFTFGGGTKLEIK 33----------------1111------------------------- >LYSOZYME M; SWP:P08905; PDB:1IVMA; KVYERCEFARTLKRNGMAGYYGVSLADWVCLAQHESNYNTRATNYNRGDQSTDYGIFQIN ----------------2222----------------------------------1111-- SRYWCNDGKTPRAVNACGINCSALLQDDITAAIQCAKRVVRDPQGIRAWVAWRAHCQNRD --------------3333-3333---------------3333--33333333-------- LSQYIRNCGV 3333------ >THIOESTERASE I; SWP:P29679; PDB:1IVNA; ADTLLILGDSLSAGYRMSASAAWPALLNDKWSKTSVVNASISGDTSQQGLARLPALLKQH --------3333-----3333-------------------2222---------------- QPRWVLVELGGNDGLRGFQPQQTEQTLRQILQDVKAANAEPLLMQIRLPANYGRRYNEAF ---------1111-----3333------------1111----------1111-------- SAIYPKLAKEFDVPLLPFFMEEVYLKPQWMQDDGIHPNRDAQPFIADWMAKQLQPLVNH --------1111---------33333333-1111---3333------------3333-- >EPIDERMAL GROWTH FACTOR R; SWP:P00533; PDB:1IVOA; EEKKVCQGTSNKLTQLGTFEDHFLSLQRMFNNCEVVLGNLEITYVQRNYDLSFLKTIQEV -----------------3333--------2222-----------------3333------ AGYVLIALNTVERIPLENLQIIRGNMYYENSYALAVLSNYDANKTGLKELPMRNLQEILH ---------------1111---------------------1111---------------- GAVRFSNNPALCNVESIQWRDIVSSDFLSNMSMDFQNHLGSCQKCDPSCPNGSCWGAGEE ------------1111-3333--------------------------------------- NCQKLTKIICAQQCSGRCRGKSPSDCCHNQCAAGCTGPRESDCLVCRKFRDEATCKDTCP ---------------------------1111-------------------!!!!------ PLMLYNPTTYQMDVNPEGKYSFGATCVKKCPRNYVVTDHGSCVRACGADSYEMEEDGVRK --------------1111------------1111--3333-------------------- CKKCEGPCRKVCNGIGIGEFKDSLSINATNIKHFKNCTSISGDLHILPVAFRGDSFTHTP -------------2222--1111-------1111------------3333---------- PLDPQELDILKTVKEITGFLLIQAWPENRTDLHAFENLEIIRGRTKQHGQFSLAVVSLNI --3333---1111------------------3333-----------iiii---------- TSLGLRSLKEISDGDVIISGNKNLCYANTINWKKLFGTSGQKTKIISNRGENSCKATGQV -------------------------1111--------2222------------3333--- CHALCSPEGCWGPEPRDCVSCRNVSRGRECV -------------1111--------%%%%-- >HUMAN PROTECTIVE PROTEIN; SWP:P10619; PDB:1IVYA; APDQDEIQRLPGLAKQPSFRQYSGYLKSSGSKHLHYWFVESQKDPENSPVVLWLNGGPGC -3333-----------------------!!!!-----------3333------------- SSLDGLLTEHGPFLVQPDGVTLEYNPYSWNLIANVLYLESPAGVGFSYSDDKFYATNDTE ---------------3333-----1111------------2222----3333-------- VAQSNFEALQDFFRLFPEYKNNKLFLTGESYAGIYIPTLAVLVMQDPSMNLQGLAVGNGL ---------------3333--------------------------3333----------- SSYEQNDNSLVYFAYYHGLLGNRLWSSLQTHCCSQNKCNFYDNKDLECVTNLQEVARIVG --------------1111------------------------------------------ NSGLNIYNLYAPCAGGVPSHFRYEKDTVVVQDLGNIFTRLPLKRMWHQALLRSGDKVRMD ----1111----------------------------1111-----33333333------- PPCTNTTAASTYLNNPYVRKALNIPEQLPQWDMCNFLVNLQYRRLYRSMNSQYLKLLSSQ 2222-------1111----1111-3333----------------------------3333 KYQILLYNGDVDMACNFMGDEWFVDSLNQKMEVQRRPWLVKYGDSGEQIAGFVKEFSHIA -------------------------------------------------------2222- FLTIKGAGHMVPTDKPLAAFTMFSRFLNKQPY ---------3333------------1111--- >HYPOTHETICAL PROTEIN 1110; SWP:Q9D1H1; PDB:1IVZA; GSSGSSGSSSSQHFNLNFTITNLPYSQDIAQPSTTKYQQTKRSIENALNQLFRNSSIKSY -------------------------3333----------------------1111----- FSDCQVLAFRSVSNNNNHTGVDSLCNFSPLARRVDRVAIYEEFLRMTHNGTQLLNFTLDR ---------------------------3333---3333---------------------3 KSVFVDSGPSSG 333--------- ------------------------------------------------------- >GASTRIC H/K-ATPASE; SWP:P19156; PDB:1IWCA; MGKAENYELYQVELGPGPSGDMAAKMSKKKAGRG ---1111---------------3333-------- >ERVATAMIN B; SWP:P60994; PDB:1IWDA; LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTAS -----3333--------------3333-----------------------------1111 HGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSCKPYRLRVVSINGFQRVTRNNESAL !!!!------------------3333---------------------------------- QSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGTQSGKNYWIVRNSW ---1111----------------------------------------iiii--------- GQNWGNQGYIWMERNVASSAGLCGIAQLPSYPTKA 1111-iiii--------1111%%%%---------- >ADENYLOSUCCINATE SYNTHETA; SWP:P28650; PDB:1IWEA; ATGSRVTVVLGAQWGDEGKGKVVDLLATDADIVSRCQGGNNAGHTVVVDGKEYDFHLLPS ------------------------3333-------------------%%%%--------- GIINTKAVSFIGNGVVIHLPGLFEEAEKNEKKGLKDWEKRLIISDRAHLVFDFHQAVDGL ---1111----1111-------------3333---3333-----------3333------ QEVQRQAQEGKNIGTTKKGIGPTYSSKAARTGLRICDLLSDFDEFSARFKNLAHQHQSMF ------------------------------------------------------1111-1 PTLEIDVEGQLKRLKGFAERIRPMVRDGVYFMYEALHGPPKKVLVEGANAALLDIDFGTY 111-----------------3333-------------------------1111------- PFVTSSNCTVGGVCTGLGIPPQNIGDVYGVVKAYTTRVGIGAFPTEQINEIGDLLQNRGH --------3333-------1111-------------------1111-------------- EWGVTTGRKRRCGWLDLMILRYAHMVNGFTALALTKLDILDVLSEIKVGISYKLNGKRIP -----------------------------------33331111----------iiii--- YFPANQEILQKVEVEYETLPGWKADTTGARKWEDLPPQAQSYVRFVENHMGVAVKWVGVG ----3333-----------------1111-3333-------------------------- KSRESMIQLF ---------- >OUTER-MEMBRANE LIPOPROTEI; SWP:P39178; PDB:1IWLA; DAASDLKSRLDKVSSFHASFTQKVTDVQEGQGDLWVKRPNLFNWHMTQPDESILVSDGKT ------------------------------------------------------------ LWFYNPFVEQATATWLKDATGNTPFMLIARNQSSDWQQYNIKQNGDDFVLTPKASNGNLK ----1111------3333-------------33331111----!!!!------------- QFTINVGRDGTIHQFSAVEQDDQRSSYQLKSQQNGAVDAAKFTFTPPQGVTVDDQRK ------1111--------1111----------------------------------- >OUTER MEMBRANE LIPOPROTEI; SWP:P24208; PDB:1IWMA; GKSPDSPQWRQHQQDVRNLNQYQTRGAFAYISDQQKVYARFFWQQTGQDRYRLLLTNPLG -------------------------------3333---------------------1111 STELELNAQPGNVQLVDNKGQRYTADDAEEMIGKLTGMPIPLNSLRQWILGLPGDATDYK --------2222----1111--------------------3333--3333--!!!!---- LDDQYRLSEITYSQNGKNWKVVYGGYDTKTQPAMPANMELTDGGQRIKLKMDNWIVK -1111---------------------------------------------------- >GLYCEROL DEHYDRATASE ALPH; SWP:Q59476; PDB:1IWPA; MKRSKRFAVLAQRPVNQDGLIGEWPEEGLIAMDSPFDPVSSVKVDNGLIVELDGKRRDQF ------------------------1111-----1111-------%%%%---%%%%1111- DMIDRFIADYAINVERTEQAMRLEAVEIARMLVDIHVSREEIIAITTAITPAKAVEVMAQ -----------------3333---------1111-----------------------111 MNVVEMMMALQKMRARRTPSNQCHVTNLKDNPVQIAADAAEAGIRGFSEQETTVGIARYA 1-------------------------1111-------------------------3333- PFNALALLVGSQCGRPGVLTQCSVEEATELELGMRGLTSYAETVSVYGTEAVFTDGDDTP --------------2222-----------------------------------1111-33 WSKAFLASAYASRGLKMRYTSGTGSEALMGYSESKSMLYLESRCIFITKGAGVQGLQNGA 33--------1111-------2222-1111-%%%%------------------------! VSCIGMTGAVPSGIRAVLAENLIASMLDLEVASANDQTFSHSDIRRTARTLMQMLPGTDF !!!--33332222-----------1111---------------------3333------- IFSGYSAVPNYDNMFAGSNFDAEDFDDYNILQRDLMVDGGLRPVTEAETIAIRQKAARAI --------3333--------1111------------------------------------ QAVFRELGLPPIADEEVEAATYAHGSNEMPPRNVVEDLSAVEEMMKRNITGLDIVGALSR ------------3333--------3333---------------------3333-----11 SGFEDIASNILNMLRQRVTGDYLQTSAILDRQFEVVSAVNDINDYQGPGTGYRISAERWA 11-----------3333--11112222--1111---3333------2222---------- EIKNIPGVVQPDTIE ----2222-1111-- >Glycerol dehydrase beta s; SWP:O08505; PDB:1IWPB; FTLKTREGGVASADERADEVVIGVGPAFDKHQHHTLIDMPHGAILKELIAGVEEEGLHAR ------------------------1111------1111--------------1111---- VVRILRTSDVSFMAWDAANLSGSGIGIGIQSKGTTVIHQRDLLPLSNLELFSQAPLLTLE --------------------1111-----3333---------1111------3333---- TYRQIGKNAARYARKESPSPVPVVNDQMVRPKFMAKAALFHIKETKHVVQDAEPVTLHID -------------------------11113333-----------11112222-------- LVRE ---- >Glycerol dehydratase smal; SWP:Q59475; PDB:1IWPG; KTMRVQDYPLATRCPEHILTPTGKPLTDITLEKVLSGEVGPQDVRISRQTLEYQAQIAEQ ---1111-3333-3333--1111-3333-----------3333----------------- MQRHAVARNFRRAAELIAIPDERILAIYNALRPFRSSQAELLAIADELEHTWHATVNAAF -----------33333333------------2222------------------------- VRESAEVYQQRHKLRKGS ------------------ >PCOC COPPER RESISTANCE PR; SWP:Q47454; PDB:1IX2A; PELKSSVPQADSAVAAPEKIQLNFSENLTVKFSGAKLTTGKGSSHSPPVAAKVAPGADPK --------2222----------------3333-------------------------111 SVIIPREPLPAGTYRVDWRAVSSDTHPITGNYTFTVK 1--------------------3333------------ >FKBP; SWP:O52980; PDB:1IX5A; MVDKGVKIKVDYIGKLESGDVFDTSIEEVAKEAGIYAPDREYEPLEFVVGEGQLIQGFEE ---------------------------3333-------------------------3333 AVLDMEVGDEKTVKIPAEKAYGNRNEMLIQKIPRDAFKEADFEPEEGMVILAEGIPATIT ---------------3333-------------3333------------------------ EVTDNEVTLDFNHELAGKDLVFTIKIIEVVE ------------1111--------------- >SUPEROXIDE DISMUTASE; SWP:P00448; PDB:1IX9A; SYTLPSLPYAYDALEPHFDKQTMEIHHTKHHQTYVNNANAALESLPEFANLPVEELITKL ---------1111--------------------------1111-3333---333311111 DQLPADKKTVLRNNAGGHANHSLFWKGLKKGTTLQGDLKAAIERDFGSVDNFKAEFEKAA 1111111-----------------1111-------------------------------- ASRFGSGWAWLVLKGDKLAVVSTANQDSPLMGEAISGASGFPIMGLDVWEHAYFLKFQNR -------------!!!!------!!!!3333-3333------------3333----!!!! RPDYIKEFWNVVNWDEAAARFAAKK -----3333---------------- >LYSR-TYPE REGULATORY PROT; SWP:P27102; PDB:1IXCA; EFRQLKYFIAVAEAGNAAAAKRLHVSQPPITRQQALEADLGVVLLEIELTAAGHAFLEDA ------------------------------------------------------------ RRILELAGRSGDRSRAAARGDVGELSVAYFGTPIYRSLPLLLRAFLTSTPTATVSLTHTK -------------------------------3333-------------1111-------- DEQVEGLLAGTIHVGFSRFFPRHPGIEIVNIAQEDLYLAVHRSQSGKFGKTCKLADLRAV ----------------------1111--------------33333333----33331111 ELTLFPRGGRPSFADEVIGLFKHAGIEPRIARVVEDATAALALTAGAASSIVPASVAAIR ---------------------1111---------------------------3333---- WPDIAFARIVGTRVKVPISCIFRKEKQPPILARFVEHVRRSAKD 2222------1111------------------------------ >CYLINDROMATOSIS TUMOUR-SU; SWP:Q9NQC7; PDB:1IXDA; GSSGSSGLAMPPGNSHGLEVGSLAEVKENPPFYGVIRWIGQPPGLNEVLAGLELEDECAG ---------3333-----2222-------------------------------------- CTDGTFRGTRYFTCALKKALFVKLKSCRPDSRFASLQPSGPSSG -----%%%%-------------3333------------------ >PHOSPHATE-BINDING PROTEIN; SWP:P06128; PDB:1IXH; EASLTGAGATFPAPVYAKWADTYQKETGNKVNYQGIGSSGGVKQIIANTVDFGASDAPLS ---------1111----------------------------------------------- DEKLAQEGLFQFPTVIGGVVLAVNIPGLKSGELVLDGKTLGDIYLGKIKKWDDEAIAKLN ------------------------22222222-----------------1111------1 PGLKLPSQNIAVVRRADGSGTSFVFTSYLAKVNEEWKNNVGTGSTVKWPIGLGGKGNDGI 111--------------------------------------------------------- AAFVQRLPGAIGYVEYAYAKQNNLAYTKLISADGKPVSPTEENFANAAKGADWSKTFAQD ------2222----3333-1111-------1111-----------1111--3333----- LTNQKGEDAWPITSTTFILIHKDQKKPEQGTEVLKFFDWAYKTGAKQANDLDYASLPDSV ------------------------------------------------1111----3333 VEQVRAAWKTNIKDSSGKPLY -------------1111---- >METHYLTRANSFERASE; SWP:O50082; PDB:1IXKA; LSPSLDKLLRLGYSKLFADRYFQLWGERAIRIAEAEKPLPRCFRVNTLKISVQDLVKRLN ---------------------------------------------3333----------1 KKGFQFKRVPWAKEGFCLTREPFSITSTPEFLTGLIYIQEASSYPPVALDPKPGEIVADA 111-----1111-----------1111---1111-----3333--------2222----- AAPGGKTSYLAQLRNDGVIYAFDVDENRLRETRLNLSRLGVLNVILFHSSSLHIGELNVE -------------------------------------------------3333-1111-- FDKILLDAPCTGSGTIHRTDDIKFCQGLQRLLEKGLEVLKPGGILVYSTCSLEPEENEFV -----------3333--------------------11112222---------3333---- IQWALDNFDVELLPLKYGEPALTNPFGIELSEEIKNARRLYPDVHETSGFFIAKIRKL -----------------------2222---3333------1111-------------- >HYPOTHETICAL PROTEIN PH11; SWP:O58863; PDB:1IXLA; IPVEQRTHKLTSRILVGKPILIKEGYAEVELETIDEKVDEKGLVHGGFTFGLADYAALAV -------11113333-------2222------------1111------------------ NEPTVVLGKAEVRFTKPVKVGDKLVAKAKIIEDLGKKKIVEVKVYREEEVVLEGKFYCYV -1111-------------2222-----------!!!!--------!!!!----------- LEKHVLD ---1111 >SPORULATION RESPONSE REGU; SWP:P06535; PDB:1IXMA; SDTALTNELIHLLGHSRHDWMNKLQLIKGNLSLQKYDRVFEMIEEMVIDAKHESKLSNLK ------------------------------------------------------------ TPHLAFDFLTFNWKTHYMTLEYEVLGEIKDLSAYDQKLAKLMRKLFHLFDQAVSRESENH ---------1111------------------3333------------------1111--- LTVSLQTDHPDRQLILYLDFHGAFADPSAFDIMRFEITSHECLIEIGL -------------------------3333------------------- >HOLLIDAY JUNCTION DNA HEL; SWP:Q9F1Q3; PDB:1IXRA; MIRYLRGLVLKKEAGGFVLLAGGVGFFLQAPTPFLQALEEGKEVGVHTHLLLKEEGLSLY --------------------------------------2222------------------ GFPDEENLALFELLLSVSGVGPKVALALLSALPPRLLARALLEGDARLLTSASGVGRRLA ------------------------------------------------3333------33 ERIALELKGKVPPHL 33-3333-------- >PROBABLE 26S PROTEASOME R; SWP:P50086; PDB:1IXVA; NYPLHQACMENEFFKVQELLHSKPSLLLQKDQDGRIPLHWSVSFQAHEITSFLLSKMENV ----------------------1111----1111-3333-----------------1111 NLDDYPDDSGWTPFHIACSVGNLEVVKSLYDRPLKPDLNKITNQGVTCLHLAVGKKWFEV 3333--------------------------------1111-1111-------1111---- SQFLIENGASVRIKDKFNQIPLHRAASVGSLKLIELLCGLGKSAVNWQDKQGWTPLFHAL ----1111------1111-3333---------------1111------1111-3333--- AEGHGDAAVLLVEKYGAEYDLVDNKGAKAEDVALNEQVKKFFLNNVVDA ---------------------------3333-----------1111--- >COAGULATION FACTORS IX/X-; SWP:P23806; PDB:1IXXA; DCLSGWSSYEGHCYKAFEKYKTWEDAERVCTEQAKGAHLVSIESSGEADFVAQLVTQNMK --2222--%%%%---------------------2222----------------------- RLDFYIWIGLRVQGKVKQCNSEWSDGSSVSYENWIEAESKTCLGLEKETDFRKWVNIYCG ----------------------1111--------3333-------3333--------111 QQNPFVCEA 1-------- >ATP-DEPENDENT METALLOPROT; SWP:Q9LCZ4; PDB:1IXZA; TEAPKVTFKDVAGAEEAKEELKEIVEFLKNPSRFHEMGARIPKGVLLVGPPGVGKTHLAR ------3333-------------------------------------------------- AVAGEARVPFITASGSDFVEMFVGVGAARVRDLFETAKRHAPCIVFIDEIDAVGRNDERE -----------------1111---------------1111--------3333-------- QTLNQLLVEMDGFEKDTAIVVMAATNRPDILDPALLRPGRFDRQIAIDAPDVKGREQILR -------------1111---------3333-3333-2222-------------------- IHARGKPLAEDVDLALLAKRTPGFVGADLENLLNEAALLAAREGRRKITMKDLEEAAS 1111----1111--------2222---------------------------------- >OMSVP3; SWP:P05586; PDB:1IY6A; AVSVDCSEYPKCACTMEYRPLCGSDNKTYGNKCNFCCAVVESNGTLTLSHFGKC ----------------------1111---------------------------- >LEVODIONE REDUCTASE; SWP:Q9LBG2; PDB:1IY8A; RFTDRVVLITGGGSGLGRATAVRLAAEGAKLSLVDVSSEGLEASKAAVLETAPDAEVLTT -2222----------------------------------------------1111----- VADVSDEAQVEAYVTATTERFGRIDGFFNNAGIEGKQNPTESFTAAEFDKVVSINLRGVF --1111--------------------------------3333------------------ LGLEKVLKIMREQGSGMVVNTASVGGIRGIGNQSGYAAAKHGVVGLTRNSAVEYGRYGIR ----------------------1111---------------------------3333--- INAIAPGAIWTPMVENSMKQLDPENPRKAAEEFIQVNPSKRYGEAPEIAAVVAFLLSDDA ---------------------3333------1111-1111---3333---------3333 SYVNATVVPIDGGQSAAY ----------iiii---- >SPERMIDINE SYNTHASE; SWP:P70998; PDB:1IY9A; SELWYTEKQTKNFGITMKVNKTLHTEQTEFQHLEMVETEEFGNMLFLDGMVMTSEKDEFV ---------1111---------------------------------iiii---3333--- YHEMVAHVPLFTHPNPEHVLVVGGGDGGVIREILKHPSVKKATLVDIDGKVIEYSKKFLP -----------------------3333--------3333--------------------- SIAGKLDDPRVDVQVDDGFMHIAKSENQYDVIMVDSTEPVGPAVNLFTKGFYAGIAKALK --1111-1111----------1111----------------------------------- EDGIFVAQTDNPWFTPELITNVQRDVKEIFPITKLYTANIPTYPSGLWTFTIGSKKYDPL ---------------------------------------1111--------------111 AVEDSRFFDIETKYYTKDIHKAAFVLPKFVSDLI 1-3333---------------1111-33333333 >RIBONUCLEASE; SWP:Q7XZV5; PDB:1IYBA; FAQDFDFFYFVQQWPGSYCDTKQSCCYPKTGKPASDFGIHGLWPNNNDGSYPSNCDSNSP -------------3333----------1111--------------1111------1111- YDQSQVSDLISRMQQNWPTLACPSGTGSAFWSHEWEKHGTCAENVFDQHGYFKKALDLKN -33331111--------------------------------3333--------------- QINLLEILQGAGIHPDGGFYSLNSIKNAIRSAIGYAPGIECNVDESGNSQLYQIYICVDG --------1111-------------------------------1111-----------11 SGSNLIECPIFPRGKCGSSIEFPTF 11----------------------- >SCARABAECIN; SWP:Q86SC0; PDB:1IYCA; ELPKLPDDKVLIRSRSNCPKGKVWNGFDCKSPFAFS ------------------------------3333-- >BRANCHED-CHAIN AMINO ACID; SWP:P00510; PDB:1IYEA; KADYIWFNGEMVRWEDAKVHVMSHALHYGTSVFEGIRCYDSHKGPVVFRHREHMQRLHDS ------iiii--3333---11113333-------------1111---------------- AKIYRFPVSQSIDELMEACRDVIRKNNLTSAYIRPLIFVGDVGMGVNPPAGYSTDVIIAA -----------------------1111---------------------2222-------- FPWGAYLGAEALEQGIDAMVSSWNRAAPNTIPTAAKAGGNYLSSLLVGSEARRHGYQEGI ----1111-3333-------------2222------3333-----------1111----- ALDVNGYISEGAGENLFEVKDGVLFTPPFTSSALPGITRDAIIKLAKELGIEVREQVLSR --1111-------------iiii----1111---------------------------33 ESLYLADEVFMSGTAAEITPVRSVDGIQVGEGRCGPVTKRIQQAFFGLFTGETEDKWGWL 33---------------------iiii-!!!!--------------1111----1111-- DQVN ---- >PARKIN; SWP:O60260; PDB:1IYFA; MIVFVRFNSSHGFPVEVDSDTSIFQLKEVVAKRQGVPADQLRVIFAGKELRNDWTVQNCD ---------------------3333------1111--------------------3333- LDQQSIVHIVQRPWRK ---------------- >HYPOTHETICAL PROTEIN (201; SWP:NA; PDB:1IYGA; GSSGSSGMEAVLNELVSVEDLKNFERKFQSEQAAGSVSKSTQFEYAWCLVRSKYNEDIRR ----------------3333----------------------------3333-------- GIVLLEELLPKGSKEEQRDYVFYLAVGNYRLKEYEKALKYVRGLLQTEPQNNQAKELERL --------1111-3333-----------11113333------------------------ IDKAMKKSGPSSG ------------- >DELETED IN SPLIT HAND/SPL; SWP:Q13437; PDB:1IYJA; QPVDLGLLEEDDEFEEFPHVWEDNWDDDNVEDDFSNQLRAELEKH ---------1111---------------------------3333- >Breast cancer type 2 susc; SWP:O35923; PDB:1IYJB; FPQFNKDLMSSLQNARDLQDIRIKNKERHHLCPQPGSLYLTKSSTLPRISLQAAVGDSVP ------3333-------------3333----------1111------------------- SACSPKQLYMYGVSKACISVNSKNAEYFQFAIEDHFGKEALCAGKGFRLADGGWLIPSDD ------3333---3333------3333---3333-----3333----------------- GKAGKEEFYRALCDTPGVDPKLISSVWVSNHYRWIVWKLAAMEFAFPKEFANRCLNPERV ----3333----------3333---------------------------------3333- LLQLKYRYDVEIDNSSRSALKKILERDDTAAKTLVLCVSDIISLSKVDTIELTDGWYAVK -----------------3333--------------------------------------- AQLDPPLLALVKSGRLTVGQKIITQGAELVGSPDACAPLEAPDSLRLKISANSTRPARWH ---3333---------2222-----------------------------1111------- SKLGFFHDPRPFPLPLSSLFSDGGNVGCVDVIVQRVYPLQWVEKTVSGSYIFRNEREEEK --------------3333------------------------------------------ EALRFSRDVSTVWKLRVTSYKKREKSALLSIWRPSSDLPSLLTEGQRYRIYHLSVSKSKN ----------------------------------1111---------------------- KFEWPSIQLTATKRTQYQQLPVSSETLLQLYQPRELLPFSKLSDPAFQPPCSEVDVVGVV -----------1111----------------------3333--------%%%%------- VSVVKPIGLAPLVYLSDECLHLLVVKFGIDLNEDIKPRVLIAASNLQWRPESTSRVPTLF ----------------1111---------------------------------------- AGNFSVFSASPKEAHFQERVTNMKHAIENIDTFYKEAEKKLIQVLKGDSPK ------------------------3333-------------1111------ >MYRISTOYL-COA:PROTEIN N-M; SWP:P30418; PDB:1IYKA; EGPIDKLKTPEDVPNDPLPLISDFEWSTLDIDDNLQLDELYKLLYDNYVEDIDATFRFKY --------3333-----------------1111-----------------1111------ SHEFFQWALKPPGWRKDWHVGVRVKSTGKLVAFIAATPVTFKLNKSNKVIDSVEINFLCI ----------22221111-------------------------1111------------- HKKLRNKRLAPVLIKEITRRVNKQNIWQALYTGGSILPTPLTTCRYQHRPINWSKLHDVG 1111-----------------1111-------------------------------1111 FSHLPPNQTKSSMVASYTLPNNPKLKGLRPMTGKDVSTVLSLLYKYQERFDIVQLFTEEE ----2222----------------2222---3333----------3333----------- FKHWMLGHDENSDSNVVKSYVVEDENGIITDYFSYYLLPFTVLDNAQHDELGIAYLFYYA -----------------------1111--------------------------------- SDSFEKPNYKKRLNELITDALITSKKFGVDVFNCLTCQDNTYFLKDCKFGSGDGFLNYYL --1111----------------3333----------!!!!----1111------------ FNYRTFPMDGGIDKKTKEVVEDQTSGIGVVLL -------------------------------- ------------------------------------------------------- >CHLOROPLASTIC ASCORBATE P; SWP:Q8LNY5; PDB:1IYNA; AASDSAQLKSAREDIKELLKTKFCHPIMVRLGWHDAGTYNKNIEEWPQRGGANGSLRFDV -----------------------------------11111111---3333---3333-33 ELKHGANAGLVNALNLLKPIKDKYSGVTYADLFQLASATAIEEAGGPKIPMKYGRVDVTE 33-3333----------------1111--------------1111--------------3 PEQCPEEGRLPDAGPPSPAQHLRDVFYRMGLNDKEIVALSGAHTLGRSRPDRSGWGKPET 333-----------------------1111------------------1111-------1 KYTKDGPGAPGGQSWTAQWLKFDNSYFKDIKERRDEDLLVLPTDAALFEDPSFKVYAEKY 111-------------------------------1111-----3333------------- AADPEAFFKDYAEAHAKLSNLGAKFGPAEGFSLEG -------------------2222---1111----- >DNA FRAGMENTATION FACTOR ; SWP:O00273; PDB:1IYRA; TGISRETSSDVALASHILTALREKQAPELSLSSQDLELVTKEDPKALAVALNWDIKKTET -------------3333--------------3333---11113333-3333--------- VQEACERELALRLQQTQSLHSLR -------------1111------ >BETA-LACTAMASE TOHO-1; SWP:Q47066; PDB:1IYSA; NSVQQQLEALEKSSGGRLGVALINTADNSQILYRADERFAMCSTSKVMAAAAVLKQSESD ---------------------------------1111------------------33331 KHLLNQRVEIKKSDLVNYNPIAEKHVNGTMTLAELGAAALQYSDNTAMNKLIAHLGGPDK 111-------3333------33332222-------------------------------- VTAFARSLGDETFRLDRTEPTLNTAIPGDPRDTTTPLAMAQTLKNLTLGKALAETQRAQL -----1111----------------2222------------------------------- VTWLKGNTTGSASIRAGLPKSWVVGDKTGSGDYGTTNDIAVIWPENHAPLVLVTYFTQPE ---1111--11113333-1111---------%%%%------------------------- QKAERRRDILAAAAKIVTHGF -----3333------1111-- >DIHYDROLIPOAMIDE ACETYLTR; SWP:P10802; PDB:1IYU; SEIIRVPDIGGDGEVIELLVKTGDLIEVEQGLVVLESAKASMEVPSPKAGVVKSVSVKLG --------------------2222---------------------------------222 DKLKEGDAIIELEPAAGAR 2--2222------------ >ENOLASE; SWP:Q8GR70; PDB:1IYXA; SIITDVYAREILDSRGNPTIEVEVYTESGAFGRGMVPSGASTGEYEAVELRDGDKARYGG ------------1111---------3333------------------------3333iii KGVTKAVDNVNNIIAEAIIGYDVRDQMAIDKAMIALDGTPNKGKLGANAILGVSIAVARA i--------------------1111-------------3333------------------ AADYLEVPLYHYLGGFNTKVLPTPMMNIINGGSHADNSIDFQEFMIMPVGAPTFKEALRM ------------------------------!!!!-------------1111--------- GAEVFHALAAILKSRGLATSVGDEGGFAPNLGSNEEGFEVIIEAIEKAGYVPGKDVVLAM ------------1111-----1111--------3333----------------------- DAASSEFYDKEKGVYVLADSGEGEKTTDEMIKFYEELVSKYPIISIEDGLDENDWDGFKK --3333---3333----3333-----------------------------1111------ LTDVLGDKVQLVGDDLFVTNTQKLSEGIEKGIANSILIKVNQIGTLTETFEAIEMAKEAG ----3333-----3333---------------------3333--------------1111 YTAVVSHRSGETEDSTISDIAVATNAGQIKTGSLSRTDRIAKYNQLLRIEDQLGEVAEYK --------------3333-----------------3333-------------!!!!---! GLKSFYNLKAA !!!-1111--- >QUINONE OXIDOREDUCTASE; SWP:Q8L3C8; PDB:1IZ0; KAWVLKRLGGPLELVDLPEPEAEEGEVVLRVEAVGLNFADHLRLGAYLTRLHPPFIPGEV ------2222------------2222----------3333-------------------- VGVVEGRRYAALVPQGGLAERVAVPKGALLPLPEGLSPEEAAAFPVSFLTAYLALKRAQA ---iiii-----------------3333----22223333-------------------- RPGEKVLVQAAAGALGTAAVQVARAGLRVLAAASRPEKLALPLALGAEEAATYAEVPERA 2222-----1111---------------------1111-------------3333----- KAWGGLDLVLEVRGKEVEESLGLLAHGGRLVYIAPIPPLRLRRNLAVLGFWLTPLLREGA 1111-----------3333-11112222------------------------3333---- LVEEALGFLLPRLGRELRPVVGPVFPFAEAEAAFRALLDRGHTGKVVVRL --------3333-------------3333---------3333-------- >QUINONE OXIDOREDUCTASE; SWP:Q8L3C8; PDB:1IZ0A; KAWVLKRLGGPLELVDLPEPEAEEGEVVLRVEAVGLNFADHLRLGAYLTRLHPPFIPGEV ------2222------------2222----------3333-------------------- VGVVEGRRYAALVPQGGLAERVAVPKGALLPLPEGLSPEEAAAFPVSFLTAYLALKRAQA ---iiii-----------------3333----22223333-------------------- RPGEKVLVQAAAGALGTAAVQVARAGLRVLAAASRPEKLALPLALGAEEAATYAEVPERA 2222-----1111---------------------1111-------------3333----- KAWGGLDLVLEVRGKEVEESLGLLAHGGRLVYIAPIPPLRLRRNLAVLGFWLTPLLREGA 1111-----------3333-11112222------------------------3333---- LVEEALGFLLPRLGRELRPVVGPVFPFAEAEAAFRALLDRGHTGKVVVRL --------3333-------------3333---------3333-------- >PROLIFERATING CELL NUCLEA; SWP:O73947; PDB:1IZ5A; PFEIVFEGAKEFAQLIDTASKLIDEAAFKVTEDGISMRAMDPSRVVLIDLNLPSSIFSKY ------------------------------3333------1111--------3333---- EVVEPETIGVNLDHLKKILKRGKAKDTLILKKGEENFLEITIQGTATRTFRVPLIDVEEP ------------------33331111---------------------------------- ELPFTAKVVVLGEVLKAAVKAASLVSDSIKFIARENEFIMKAEGETQEVEIKLTLEDEGL ----------3333-------3333--------2222-----------------1111-- LDIEVQEETKSAYGVSYLSDMVKGLGKADEVTIKFGNEMPMQMEYYIRDEGRLTFLLAPR ---------------------11111111------2222--------------------- >INITIATION FACTOR 5A; SWP:O50089; PDB:1IZ6A; GDKTKVQVSKLKPGRYIIIDDEPCRIVNITVSSPGKHGSAKARIEAVGIFDGKVRSIVKP ------3333--------%%%%------------1111---------------------1 TSAEVDVPIIDKKTAQVIAITPDTVQIMDMETYETFEVPIDTGVADEIRDQLKEGINVEY 111-----------------------------------3333--3333----2222---- WETLGRIKIMRIKGEG --iiii-----2222- >macrophomate synthase int; SWP:Q9UVD4; PDB:1IZCA; AKSYSEQPELHAKAPYRSAMLTYPGNLRQALKDAMADPSKTLMGVAHGIPSTFVTKVLAA --33333333---1111-------------------3333------------------11 TKPDFVWIDVEHGMFNRLELHDAIHAAQHHSEGRSLVIVRVPKHDEVSLSTALDAGAAGI 11--------------------------1111--------------------1111---- VIPHVETVEEVREFVKEMYYGPIGRRSFSPWTFSPGIADASLFPNDPYNVATSNNHVCII ------3333-------------------1111------------1111----------- PQIESVKGVENVDAIAAMPEIHGLMFGPGDYMIDAGLDLNGALSGVPHPTFVEAMTKFST -----------------3333--------------------------------------- AAQRNGVPIFGGALSVDMVPSLIEQGYRAIAVQFDVWGLSRLVHGSLAQARASAKQFAG --1111--------1111-----------------------------------3333-- >ASPARTIC PROTEINASE; SWP:Q9URD0; PDB:1IZDA; AATGSVTTNPTSNDEEYITQVTVGDDTLGLDFDTGSADLWVFSSQTPSSERSGHDYYTPG ----------2222--------!!!!----------------111133332222-----1 SSAQKIDGATWSISYGDGSSASGDVYKDKVTVGGVSYDSQAVESAEKVSSEFTQDTANDG 111--2222-----3333-------------iiii-------------------1111-- LLGLAFSSINTVQPTPQKTFFDNVKSSLSEPIFAVALKHNAPGVYDFGYTDSSKYTGSIT -----3333---------3333-3333-----------------------1111------ YTDVDNSQGFWGFTADGYSIGSDSSSDSITGIADTGTTLLLLDDSIVDAYYEQVNGASYD -----1111----------!!!!----------1111-----3333-------2222--- SSQGGYVFPSSASLPDFSVTIGDYTATVPGEYISFADVGNGQTFGGIQSNSGIGFSIFGD 1111----1111--------!!!!----3333------%%%%-------2222-----33 VFLKSQYVVFDASGPRLGFAAQA 33--------------------- >PROTEINASE; SWP:Q536T2; PDB:1IZIA; PQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMNLPGRWKPKMIGGIGGFIKVRQYD --------------iiii------1111--------------------2222-------- QILIEICGHKVIGTVLVGPTPTNVIGRNLLTQIGCTLNF -----iiii----------------3333-1111----- >Photosystem II reaction c; SWP:Q8CM25; PDB:1IZLD; GILLFPCAYLALGGWLTGTTFVTSWYTHGLASSYLEGCNFLTVAVSTEAQGDFTRWCQLG 3333---------------------------%%%%------------------------- GLWTFIALHGAFGLIGFMLRQFEIARLVGVRPYNAIAFSAPIAVFVSVFLIYPLGFAPSF --1111---------3333------------3333------------------------- GVAAIFRFLLFFQGFHNWTLNPFHMMGVAGVLGGALLCAIHGATVENTYSMVTANRFWSQ ----1111----1111-----3333---------------1111---------------- IFGIAFSNKRWLHFFMLFVPVTGLWMSAIGVVGLALNLRSYDFISQEIRAAEDPEFETFY -1111-----%%%%---------------------3333---------1111------11 TKNLLLNEGIRAWMAPQDQPHENFVFPEEVLPRGNAL 11-1111-------------------%%%%------- >HYPOTHETICAL PROTEIN HI08; SWP:P44882; PDB:1IZMA; LISHSDNQQLKSAGIGFNATELHGFLSGLLCGGLKDQSWLPLLYQFSNDNHAYPTGLVQP --3333----1111---------------1111--------------%%%%--------- VTELYEQISQTLSDVEGFTFELGLTEDENVFTQADSLSDWANQFLLGIGLAQPELAKEKG ----------------------------3333-------------------1111----- EIGEAVDDLQDICQLGYDEDDNEEELAEALEEIIEYVRTIALFYSHFN -----------1111--------------------------------- >CAPZ ALPHA-1 SUBUNIT; SWP:P13127; PDB:1IZNA; RVSDEEKVRIAAKFITHAPPGEFNEVFNDVRLLLNNDNLLREGAAHAFAQYNMDQFTPVK --------------11112222----------------3333----------1111---- IEGYDDQVLITEHGDLGNGRFLDPRNKISFKFDHLRKEASDPQPEDTESALKQWRDACDS ----------3333---------------------------------------------- ALRAYVKDHYPNGFCTVYGKSIDGQQTIIACIESHQFQPKNFWNGRWRSEWKFTITPPTA -------------------------------------3333------------------- QVAAVLKIQVHYYEDGNVQLVSHKDIQDSVQVSSDVQTAKEFIKIIENAENEYQTAISEN ------------------------------------------------------------ YQTMSDTTFKALRRQLPVTRTKIDWNKILSYKIGK -------3333-----3333---3333-------- >F-actin-capping protein s; SWP:P14315; PDB:1IZNB; SDQQLDCALDLMRRLPPQQIEKNLSDLIDLVPSLCEDLLSSVDQPLKIARDKVVGKDYLL ----------1111-3333-----------3333-----------------1111----- CDYNRDGDSYRSPWSNKYDPPLEDGAMPSARLRKLEVEANNAFDQYRDLYFEGGVSSVYL 1111-!!!!--------------------------------------------------- WDLDHGFAGVILIKKAGDGSKKIKGCWDSIHVVEVQEKSSGRTAHYKLTSTVMLWLQTNK -----------------1111--------------------------------------- TGSGTMNLGGSLTRQMEKDETVSDSSPHIANIGRLVEDMENKIRSTLNEIYFGKTKDIVN 1111------------------1111--------------------------------11 GLRSIDAIPDNQKYKQLQRELSQVLTQRQI 11---------------------------- >CYTOCHROME P450 152A1; SWP:O31440; PDB:1IZOA; PHDKSLDNSLTLLKEGYLFIKNRTERYNSDLFQARLLGKNFICMTGAEAAKVFYDTDRFQ --------------!!!!------1111-------%%%%--------------------- RQNALPKRVQKSLFGVNAIQGMDGSAHIHRKMLFLSLMTPPHQKRLAELMTEEWKAAVTR 2222-------------3333-------------1111---------------------3 WEKADEVVLFEEAKEILCRVACYWAGVPLKETEVKERADDFIDMVDAFGAVGPRHWKGRR 333--------------------------3333-----------1111---3333----- ARPRAEEWIEVMIEDARAGLLKTTSGTALHEMAFHTQEDGSQLDSRMAAIELINVLRPIV ---------------1111----2222---------1111-----------------333 AISYFLVFSALALHEHPKYKEWLRSGNSREREMFVQEVRRYYPFGPFLGALVKKDFVWNN 3--------------3333--------------------------------------%%% CEFKKGTSVLLDLYGTNHDPRLWDHPDEFRPERFAEREENLFDMIPQGGGHAEKGHRCPG %--2222------3333-3333--1111-3333-----------------1111------ EGITIEVMKASLDFLVHQIEYDVPEQSLHYSLARMPSLPESGFVMSGIRRK --------------------------------------3333--------- >ANTI-CEA MAB T84.66, LIGH; SWP:A0N8W2; PDB:1J05L; DIVLTQSPASLAVSLGQRATMSCRAGESVDI -------------2222-------------- >GLUTAREDOXIN-LIKE PROTEIN; SWP:O57917; PDB:1J08A; GLISEEDKRIIKEEFFSKMVNPVKLIVFIGKEHCQYCDQLKQLVQELSELTDKLSYEIVD ---------------1111----------------------------------------1 FDTPEGKELAEKYRIDRAPATTITQDGKDFGVRYFGIPAGHEFAAFLEDIVDVSKGDTDL 111-------1111----------iiii----------!!!!------------------ MQDSKEEVSKIDKDVRILIFVTPTCPYCPLAVRMAHKFAIENTKAGKGKILGDMVEAIEY -------1111----------1111-3333--------------------------1111 PEWADQYNVMAVPKIVIQVNGEDKVQFEGAYPEKMFLEKLLSALS ----1111----------iiii---------3333-----3333- >GLUTAMYL-TRNA SYNTHETASE; SWP:P27000; PDB:1J09A; MVVTRIAPSPTGDPHVGTAYIALFNYAWARRNGGRFIVRIEDTDRARYVPGAEERILAAL --------------3333-----------1111---------------2222-------- KWLGLSYDEGPDVGGPHGPYRQSERLPLYQKYAEELLKRGWAYRAFETPEELEQIRKEKG --------------------3333------------------------------------ GYDGRARNIPPEEAEERARRGEPHVIRLKVPRPGTTEVKDELRGVVVYDNQEIPDVVLLK -----1111---------------------------------------3333-------1 SDGYPTYHLANVVDDHLMGVTDVIRAEEWLVSTPIHVLLYRAFGWEAPRFYHMPLLRNPD 111---3333---------------33331111------------------------111 KTKISKRKSHTSLDWYKAEGFLPEALRNYLCLMGFSMPDGREIFTLEEFIQAFTWERVSL 1---1111---3333------3333-----------1111-------------3333--- GGPVFDLEKLRWMNGKYIREVLSLEEVAERVKPFLREAGLSWESEAYLRRAVELMRPRFD -----------------------------------1111---------------1111-- TLKEFPEKARYLFTEDYPVSEKAQRKLEEGLPLLKELYPRLRAQEEWTEAALEALLRGFA 3333----3333------------------------------------------------ AEKGVKLGQVAQPLRAALTGSLETPGLFEILALLGKERALRRLERALA -----3333----------------------1111------------- >1-AMINOCYCLOPROPANE-1-CAR; SWP:O57809; PDB:1J0AA; MHPKIFALLAKFPRVELIPWETPIQYLPNISREIGADVYIKRDDLTGLGIGGNKIRKLEY --------1111-----------------------------------!!!!--------- LLGDALSKGADVVITVGAVHSNHAFVTGLAAKKLGLDAILVLRGKEELKGNYLLDKIMGI -----1111--------1111----------1111------------------------- ETRVYDAKDSFELMKYAEEIAEELKREGRKPYVIPPGGASPIGTLGYVRAVGEIATQSEV --------1111----------------------2222-3333----------------- KFDSIVVAAGSGGTLAGLSLGLSILNEDIRPVGIAVGRFGEVMTSKLDNLIKEAAELLGV ----------------------1111---------------------------------- KVEVRPELYDYSFGEYGKITGEVAQIIRKVGTREGIILDPVYTGKAFYGLVDLARKGELG --------------2222--------------------3333------------------ EKILFIHTGGISGTFHYGDKLLSLL ---------3333----3333---- >HYPOTHETICAL PROTEIN 1810; SWP:NA; PDB:1J0GA; GSEGAATMSKVSFKITLTSDPRLPYKVLSVPESTPFTAVLKFAAEEFKVPAATSAIITND -------------------1111-------33333333-----3333------------- GIGINPAQTAGNVFLKHGSELRIIPRDRVGSC --------3333-------------------- >NEOPULLULANASE; SWP:P38940; PDB:1J0HA; MRKEAIYHRPADNFAYAYDSETLHLRLRTKKDDIDRVELLHGDPYDWQNGAWQFQMMPMR -3333-----!!!!---------------2222---------1111-%%%%--------- KTGSDELFDYWFAEVKPPYRRLRYGFVLYSGEEKLVYTEKGFYFEVPTDDTAYYFCFPFL ----1111---------------------!!!!----1111---------3333------ HRVDLFEAPDWVKDTVWYQIFPERFANGNPSISPEGSRPWGSEDPTPTSFFGGDLQGIID 3333----3333----------------3333-2222-2222------------------ HLDYLVDLGITGIYLTPIFRSPSNHKYDTADYFEVDPHFGDKETLKTLIDRCHEKGIRVM ------------------------------1111-1111-------------1111---- LDAVFNHCGYEFAPFQDVWKNGESSKYKDWFHIHEFPLQTEPRPNYDTFAFVPQMPKLNT --------1111--------!!!!1111-----------------------1111----- ANPEVKRYLLDVATYWIREFDIDGWRLDVANEIDHEFWREFRQEVKALKPDVYILGEIWH ----------------------------3333----------------3333-------- DAMPWLRGDQFDAVMNYPFTDGVLRFFAKEEISARQFANQMMHVLHSYPNNVNEAAFNLL -3333---------------------------------------11113333-------- GSHDTSRILTVCGGDIRKVKLLFLFQLTFTGSPCIYYGDEIGMTGGNDPECRKCMVWDPM -1111----1111------------1111------2222------------------333 QQNKELHQHVKQLIALRKQYRSLRRGEISFLHADDEMNYLIYKKTDGDETVLVIINRSDQ 3------------------3333------------------------------------- KADIPIPLDARGTWLVNLLTGERFAAEAETLCTSLPPYGFVLYAIEHW -----------------------------------2222--------- >CYTOCHROME C3; SWP:P00132; PDB:1J0PA; AAPKAPADGLKMDKTKQPVVFNHSTHKAVKCGDCHHPVNGKEDLQKCATAGCHDNMDKKD ---------------------333311111111----iiii----1111-------1111 KSAKGYYHAMHDKGTKFKSCVGCHLETAGADAAKKKELTGCKGSKCHS -1111----------------------!!!!------------3333- >INTERLEUKIN-18; SWP:Q14116; PDB:1J0SA; YFGKLESKLSVIRNLNDQVLFIDQGNRPLFEDMTDSDCRDNAPRTIFIISMYKDSQPRGM -------------1111-----1111----------3333-------------------- AVTISVKCEKISTLSCENKIISFKEMNPPDNIKDTKSDIIFFQRSVPGHDNKMQFESSSY ------------------------------------1111-------------------- EGYFLACEKERDLFKLILKKEDELGDRSIMFTVQNED ---------!!!!---------11111111------- >MOLT-INHIBITING HORMONE; SWP:P55847; PDB:1J0TA; ASFIDNTCRGVMGNRDIYKKVVRVCEDCTNIFRLPGLDGMCRNRCFYNEWFLICLKAANR ---------3333----3333------3333-------3333-%%%%--------1111- EDEIEKFRVWISILNAGQ ------------1111-- >DOWNSTREAM OF TYROSINE KI; SWP:Q9P104; PDB:1J0WA; QSERFNVYLPSPNLDVHGECALQITYEYICLWDVQNPRVKLISWPLSALRRYGRDTTWFT ----------1111----------1111----------------1111------------ FEAGRCETGEGLFIFQTRDGEAIYQKVHSAALAIAEL ------------------------------------- >Glyceraldehyde-3-phosphat; SWP:P46406; PDB:1J0XO; VKVGVNGFGRIGRLVTRAAFNSGKVDVVAINDPFIDLHYMVYMFQYDSTHGKFHGTVKAE -------------------------------1111------------------------i NGKLVINGKAITIFQERDPANIKWGDAGAEYVVESTGVFTTMEKAGAHLKGGAKRVIISA iii--iiii--------3333-3333--------------3333---------------- PSADAPMFVMGVNHEKYDNSLKIVSNASTTNCLAPLAKVIHDHFGIVEGLMTTVHAITAT --------22221111-1111-------3333-------------------------333 QKTVDGPSGKLWRDGRGAAQNIIPASTGAAKAVGKVIPELNGKLTGMAFRVPTPNVSVVD 3---------3333--3333-------33331111-3333-------------------- LTCRLEKAAKYDDIKKVVKQASEGPLKGILGYTEDQVVSCDFNSDTHSSTFDAGAGIALN -----------------------1111----------33332222------3333----- DHFVKLISWYDNEFGYSNRVVDLMVHMASKE ---------------------------1111 >RADIXIN; SWP:P26043; PDB:1J19A; PKPINVRVTTMDAELEFAIQPNTTGKQLFDQVVKTVGLREVWFFGLQYVDSKGYSTWLKL ---------------------------------------3333------1111-----33 NKKVTQQDVKKENPLQFKFRAKFFPEDVSEELIQEITQRLFFLQVKEAILNDEIYCPPET 333333--------------------3333------------------------------ AVLLASYAVQAKYGDYNKEIHKPGYLANDRLLPQRVLEQHKLTKEQWEERIQNWHEEHRG ----------------3333-22221111---3333-------------------1111- MLREDSMMEYLKIAQDLEMYGVNYFEIKNKKGTELWLGVDALGLNIYEHDDKLTPKIGFP -------------1111-2222------3333-------1111----1111--------3 WSEIRNISFNDKKFVIKPIDKKAPDFVFYAPRLRINKRILALCMGNHELYMRRRKPDTIE 333------!!!!------1111--------3333---------------1111------ VQQMKAQARVDSSGAA ---------------- >GLYCOGEN SYNTHASE KINASE-; SWP:P49841; PDB:1J1BA; SKVTTVVATPGQGPDRPQEVSYTDTKVIGNGSFGVVYQAKLCDSGELVAIKKVLQDKRFK -------------------------------------------------------1111- NRELQIMRKLDHCNIVRLRYFFYSSGEKKDEVYLNLVLDYVPETVYRVARHYSRAKQTLP ------1111-1111----------3333------------------------------- VIYVKLYMYQLFRSLAYIHSFGICHRDIKPQNLLLDPDTAVLKLCDFGSAKQLVRGEPNV ----------------------------3333----1111------1111---2222--- SYICSRYYRAPELIFGATDYTSSIDVWSAGCVLAELLLGQPIFPGDSGVDQLVEIIKVLG ---------3333----------------------------------------------- TPTREQIREMNPNYTEFKFPQIKAHPWTKVFRPRTPPEAIALCSRLLEYTPTARLTPLEA -------------------------3333--33333333----------3333------- CAHSFFDELRDPNVKLPNGRDTPALFNFTTQELSSNPPLATILIPPHARIQAAA --333333331111-1111---------33333333-3333---3333------ >Troponin T, cardiac muscl; SWP:P45379; PDB:1J1DB; QTEREKKKKILAERRKVLAIDHLNEDQLREKAKELWQTIYNLEAEKFDLQEKFKQQKYEI -------------------11113333--------------------------------- NVLRNRINDN ----3333-- >Troponin I, cardiac muscl; SWP:P19429; PDB:1J1DC; AKKKSKISASRKLQLKTLLLQIAKQELEREAEERRGEKGRALSTRAQPLELAGLGFAELQ -------3333------------------------------------------------- DLARQLHARVDKVDEERYDIEAKVTKNITEIADLTQKIFDLRRRVRISADAMMQALLG -------------------------------------3333----------------- >SMALL PROTEIN B; SWP:Q8RR57; PDB:1J1HA; MAPVLENRRARHDYEILETYEAGIALKGTEVKSLRAGKVDFTGSFARFEDGELYLENLYI --------------------------1111--3333------------------------ APYEKGSYANVDPRRKRKLLLHKHELRRLLGKVEQKGLTLVPLKIYFNERGYAKVLLGLA ---------------------3333------3333------------1111--------- RGK --- >META CLEAVAGE COMPOUND HY; SWP:Q84II3; PDB:1J1IA; AYVERFVNAGGVETRYLEAGKGQPVILIHGGGAGAESEGNWRNVIPILARHYRVIAMDML --------iiii-------------------22223333-------------------22 GFGKTAKPDIEYTQDRRIRHLHDFIKAMNFDGKVSIVGNSMGGATGLGVSVLHSELVNAL 22--------------------------------------------------3333---- VLMGSAGLVVEYDFTREGMVHLVKALTNDGFKIDDAMINSRYTYATDEATRKAYVATMQW ---------------------------1111----------------------------- IREQGGLFYDPEFIRKVQVPTLVVQGKDDKVVPVETAYKFLDLIDDSWGYIIPHCGHWAM -1111----33331111--------1111---3333---------------------333 IEHPEDFANATLSFLSLR 3------------3333- >TRANSLIN; SWP:Q15631; PDB:1J1JA; MSVSEIFVELQGFLAAEQDIREEIRKVVQSLEQTAREILTLLQGVHQGAGFQDIPKRCLK -------------------------------------------1111---1111------ AREHFGTVKTHLTSLKTKFPAEQYYRFHEHWRFVLQRLVFLAAFVVYLETETLVTREAVT --------------3333-333333333333----------------------------- EILGIEPDREKGFHLDVEDYLSGVLILASELSRLSVNSVTAGDYSRPLHISTFINELDSG --------------------------------------------3333------------ FRLLNLKNDSLRKRYDGLKYDVKKVEEVVYDLSIRGF 1111--------------------------------- >PIRIN; SWP:O00625; PDB:1J1LA; SSKKVTLSVLSREQSEGVGARVRRSIGRPELKNLDPFLLFDEFKGGRPGGFPDHPHRGFE ------------------------2222-------------------------------- TVSYLLEGGSAHEDFCGHTGKNPGDLQWTAGRGILHAEPCSEEPAHGLQLWVNLRSSEKV ---------------------2222----!!!!---------------------3333-- EPQYQELKSEEIPKPSKDGVTVAVISGEALGIKSKVYTRTPTLYLDFKLDPGAKHSQPIP -------3333-----iiii--------iiii-----------------2222------2 KGWTSFIYTISGDVYIGPDDAQQKIEPHHTAVLGEGDSVQVENKDPKRSHFVLIAGEPLR 222--------------3333--------------------------------------- EPVIQHGPFVNTNEEISQAILDFRNAKNGFERAKTWKSKIGN -----!!!!--------------------1111-----1111 >ALGQ2; SWP:Q9KWT5; PDB:1J1NA; KEATWVTDKPLTLKIHMHFRDKWVWDENWPVAKESFRLTNVKLQSVANKAATNSQEQFNL -1111--------------------1111------------------1111--------- MMASGDLPDVVGGDNLKDKFIQYGQEGAFVPLNKLIDQYAPHIKAFFKSHPEVERAIKAP 1111------------------------------------------------------11 DGNIYFIPYVPDGVVARGYFIREDWLKKLNLKPPQNIDELYTVLKAFKEKDPNGNGKADE 11------------------------------------------------1111------ VPFIDRHPDEVFRLVNFWGARSSGSDNYMDFYIDNGRVKHPWAETAFRDGMKHVAQWYKE ---------------------------------iiii--1111--------------111 GLIDKEIFTRKAKAREQMFGGNLGGFTHDWFASTMTFNEGLAKTVPGFKLIPIAPPTNSK 1--1111-----------1111---------3333---------2222---------111 GQRWEEDSRQKVRPDGWAITVKNKNPVETIKFFDFYFSRPGRDISNFGVPGVTYDIKNGK 1------------------1111-------------------------2222----iiii AVFKDSVLKSPQPVNNQLYDMGAQIPIGFWQDYDYERQWTTPEAQAGIDMYVKGKYVMPG ------------------1111--------------1111-------------------- FEGVNMTREERAIYDKYWADVRTYMYEMGQAWVMGTKDVDKTWDEYQRQLKLRGLYQVLQ ------3333---------------------------3333------------------- MMQQAYDRQYKN ------------ >ANTIVIRAL PROTEIN S; SWP:P23339; PDB:1J1QA; INTITFDAGNATINKYATFMESLRNEAKDPSLKCYGIPMLPNTNSTIKYLLVKLQGASLK ------3333------------------3333-iiii------------------1111- TITLMLRRNNLYVMGYSDPYDNKCRYHIFNDIKGTEYSDVENTLCPSSNPRVAKPINYNG -------------------%%%%----------3333--------1111----------- LYPTLEKKAGVTSRNEVQLGIQILSSDIGKISGQGSFTEKIEAKFLLVAIQMVSEAARFK ------------3333--------------2222-------------------------- YIENQVKTNFNRDFSPNDKVLDLEENWGKISTAIHNSKNGALPKPLELKNADGTKWIVLR --------1111-------------------------iiii--------1111------3 VDEIKPDVGLLNYVNGTCQAT 3333333-------------- >ALGINATE LYASE; SWP:P84143; PDB:1J1TA; STIPSSITSGSIFDLEGDNPNPLVDDSTLVFVPLEAQHITPNGNGWRHEYKVKESLRVAM ------1111--------------1111---3333----1111---------3333--11 TQTYEVFEATVKVEMSDGGKTIISQHHASDTGTISKVYVSDTDESGFNDSVANNGIFDVY 11-------------2222----------------------------------------- VRLRNTSGNEEKFALGTMTSGETFNLRVVNNYGDVEVTAFGNSFGIPVEDDSQSYFKFGN ----1111----------2222--------iiii----iiii------------------ YLQSQDPYTLDKCGEAGNSNSFKNCFEDLGITESKVTMTNVTYTRETN --------------2222------------------------------ >CHROMOSOMAL REPLICATION I; SWP:P03004; PDB:1J1VA; VTIDNIQKTVAEYYKIKVADLLSKRRSRSVARPRQAALAKELTNHSLPEIGDAFGGRDHT ----------------3333------3333-------3333----------------333 TVLHACRKIEQLREESHDIKEDFSNLIRTLSS 3---------3333--------------1111 >ATP-DEPENDENT RNA HELICAS; SWP:Q8TZH8; PDB:1J24A; GVKVVVDSRELRSEVVKRLKLLGVKLEVKTLDVGDYIISEDVAIERKSANDLIQSIIDGG ------3333---------------------------------------------1111- LFDQVKRLKEAYSRPIMIVEGSLYGIRNVHPNAIRGAIAAVTVDFGVPIIFSSTPEETAQ -----------------------------3333--------------------------- YIFLIAKREQEER ------------- >IMMATURE COLON CARCINOMA ; SWP:Q8R035; PDB:1J26A; GSSGSSGEHAKQASSYIPLDRLSISYCRSSGPGGQNVNKVNSKAEVRFHLASADWIEEPV -----------------3333---------------------------333311113333 RQKIALTHKNKINKAGELVLTSESSRYQFRNLAECLQKIRDMIAEASGPSSG -------3333----------------------------------------- >HYPOTHETICAL PROTEIN TT17; SWP:Q84BR1; PDB:1J27A; MKAYLGLYTARLETPARSLKEKRALIKPALERLKARFPVSAARLYGLDAWGYEVVGFTLL ---------------------------------------------1111----------- GNDPAWVEETMRAAARFLAEAGGFQVALEEFRLEAFEL -----------------1111----------------- >URICASE; SWP:Q45697; PDB:1J2GA; RVMYYGKGDVFAYRTYLKPLTGVRTIPESPFSGRDHILFGVNVKISVGGTKLLTSFTKGD -------------------------3333-------------------11113333---- NSLVVATDSMKNFIQKHLASYTGTTIEGFLEYVATSFLKKYSHIEKISLIGEEIPFETTF 1111-3333-------------------------------1111---------------- AVKNGNRAASELVFKKSRNEYATAYLNMVRNEDNTLNITEQQSGLAGLQLIKVSGNSFVG --iiii------------------------1111-------------------------- FIRDEYTTLPEDSNRPLFVYLNIKWKYKNTEDSFGTNPENYVAAEQIRDIATSVFHETET ---1111---------------------3333----3333--3333-------------- LSIQHLIYLIGRRILERFPQLQEVYFESQNHTWDKIVEEIPESEGKVYTEPRPPYGFQCF -----------------1111------------------2222----------------- TVTQED --3333 >ADP-ribosylation factor-b; SWP:Q9UJY5; PDB:1J2JB; IFEDEEKSKMLARLLKSSHPEDLRAANKLIKEMVQEDQKRM -------------1111-3333------------------- >DISINTEGRIN TRIFLAVIN; SWP:P21859; PDB:1J2LA; GEECDCGSPSNPCCDAATCKLRPGAQCADGLCCDQCRFKKKRTICRIARGDFPDDRCTGQ -------1111----------2222----1111%%%%--2222----------------- SADCPRWN -------- >17-KDA PKC-POTENTIATED IN; SWP:O18734; PDB:1J2MA; GPGGSPGGLQKRHARVTVKYDRRELQRRLDVEKWIDGRLEELYRGREADMPDEVNIDELL -------------------------------------------------------3333- ELESEEERSRKIQGLLKSCTNPTENFVQELLVKLRGLHK ---------------3333-------------------- >Fusion of Rhombotin-2 and; SWP:P25801; PDB:1J2OA; GSLLTCGGCQQNIGDRYFLKAIDQYWHEDCLSCDLCGCRLGEVGRRLYYKLGRKLCRRDY --------------------------1111------------------------------ LRLGGSGGHMGSGGDVMVVGEPTLMGGEFGDEDERLITRLENTQFDAANGIDDE ------------------------------------------------------ >PROTEASOME ALPHA SUBUNIT; SWP:O29760; PDB:1J2PA; PQMGYDRAITVFSPDGRLFQVEYAREAVKRGATAIGIKCKEGVILIADKRVGSKLLEKDT -------1111-1111-3333-----3333--------1111----------11113333 IEKIYKIDEHICAATSGLVADARVLIDRARIEAQINRLTYDIPITVKELAKKICDFKQQY -------1111---------------------------------------------3333 TQYGGVRPFGVSLLIAGVNEVPKLYETDPSGALLEYKATAIGMGRMAVTEFFEKEYRDDL ---------------------------1111----------1111--------------- SFDDAMVLGLVAMGLSIESELVPENIEVGYVKVDDRTFKEVSPEELKPYVERANERIREL ---------------------1111------3333------3333-3333---------- LKK --- >Proteasome subunit beta [; SWP:Q9P996; PDB:1J2QH; TTTVGLVCKDGVVMATEKRATMGNFIASKAAKKIYQIADRMAMTTAGSVGDAQFLARIIK ---------------------!!!!----------------------------------- IEANLYEIRRERKPTVRAIATLTSNLLNSYRYFPYLVQLLIGGIDSEGKSIYSIDPIGGA ---------------------------1111-----------------------1111-- IEEK ---- >HYPOTHETICAL ISOCHORISMAT; SWP:P37347; PDB:1J2RA; MLELNAKTTALVVIDLQEGILPFAGGPHTADEVVNRAGKLAAKFRASGQPVFLVRVGWSA ----3333--------33333333--------------------1111----------11 DYAEALKQPVDAPSPAKVLPENWWQHPAALGTTDSDIEIIKRQWGAFYGTDLELQLRRRG 11-----------------1111---3333--3333----------2222------1111 IDTIVLCGISTNIGVESTARNAWELGFNLVIAEDACSAASAEQHNNSINHIYPRIARVRS ---------1111---------1111-----1111---------------3333------ VEEILNAL ----1111 >Acyl-[acyl-carrier-protei; SWP:O25927; PDB:1J2ZA; SKIAKTAIISPKAEINKGVEIGEFCVIGDGVKLDEGVKLHNNVTLQGHTFVGKNTEIFPF ---1111--1111--------2222--1111--2222--------------2222----- AVLGTQPQDLKYKGEYSELIIGEDNLIREFCMINPGTEGGIKKTLIGDKNLLMAYVHVAH --------3333------------------------3333------------------22 DCVIGSHCILANGVTLAGHIEIGDYVNIGGLTAIHQFVRIAKGCMIAGKSALGKDVPPYC 22--------2222--------------2222--2222--2222------------2222 TVEGNRAFIRGLNRHRMRQLLESKDIDFIYALYKRLFRPIPSLRESAKLELEEHANNPFV ----------------------------------1111---------------------- KEICSFILESSRGVAYKSS ------1111--------- >144AA LONG HYPOTHETICAL R; SWP:Q96XZ7; PDB:1J30A; DLKGTKTAENLKQGFIGESMANRRYLYFAKRADEEGYPEIAGLLRSIAEGETAHAFGHLD -2222------------------------------------------------------- FIRQGGLTDPATDKPIGTLEQMIESAIAGETYEWTQMYPGFAKVAREEGFPEVAEWFETL --1111---------------------------------------1111----------- ARAEKSHAEKFQNVLKQLKGG -----------------3333 >HYPOTHETICAL PROTEIN PH06; SWP:O58376; PDB:1J31A; VKVGYIQEPKILELDKNYSKAEKLIKEASKEGAKLVVLPELFDTGYNFESREEVFDVAQQ ---------2222---------------1111--------3333-----3333------- IPEGETTTFLELARELGLYIVAGTAEKSGNYLYNSAVVVGPRGYIGKYRKIHLFYREKVF ---------------------------!!!!--------1111----------!!!!--- FEPGDLGFKVFDIGFAKVGVICFDWFFPESARTLALKGAEIIAHPANLVPYAPRAPIRAL ----------------------3333--------1111------------3333------ ENRVYTITADRVGEERGLKFIGKSLIASPKAEVLSIASETEEEIGVVEIDLNLARNKRLN --------------iiii---------1111------------------3333------- DNDIFKDRREEYYFR --3333--3333--- >ASPARTATE AMINOTRANSFERAS; SWP:Q8RR70; PDB:1J32A; MKLAARVESVSPSMTLIIDAKAKAMKAEGIDVCSFSAGEPDFNTPKHIVEAAKAALEQGK ---3333-------3333-------1111------------------------------- TRYGPAAGEPRLREAIAQKLQRDNGLCYGADNILVTNGGKQSIFNLMLAMIEPGDEVIIP ----3333--------------------3333-------------------2222----- APFWVSYPEMVKLAEGTPVILPTTVETQFKVSPEQIRQAITPKTKLLVFNTPSNPTGMVY ---3333----------------3333---------11111111---------------- TPDEVRAIAQVAVEAGLWVLSDEIYEKILYDDAQHLSIGAASPEAYERSVVCSGFAKTYA ------------1111------1111---%%%%---3333-3333-----------1111 MTGWRVGFLAGPVPLVKAATKIQGHSTSNVCTFAQYGAIAAYENSQDCVQEMLAAFAERR 1111--------------------------3333-------------------------- RYMLDALNAMPGLECPKPDGAFYMFPSIAKTGRSSLDFCSELLDQHQVATVPGAAFGADD ---------2222--------------3333--3333--------------3333--111 CIRLSYATDLDTIKRGMERLEKFLHGIL 1-----------------------1111 >COAGULATION FACTOR IX-BIN; SWP:P23806; PDB:1J34A; DCPSGWSSYEGHCYKPFKLYKTWDDAERFCTEQAKGGHLVSIESAGEADFVAQLVTENIQ --2222-----------------------11112222----------------------- NTKSYVWIGLRVQGKEKQCSSEWSDGSSVSYENWIEAESKTCLGLEKETGFRKWVNIYCG ----------------------1111--------3333-------3333--------111 QQNPFVCEA 1-------- >Coagulation factor IX/fac; SWP:P23807; PDB:1J34B; DCPSDWSSYEGHCYKPFSEPKNWADAENFCTQQHAGGHLVSFQSSEEADFVVKLAFQTFG --1111--iiii-----------------11112222----------------------- HSIFWMGLSNVWNQCNWQWSNAAMLRYKAWAEESYCVYFKSTNNKWRSRACRMMAQFVCE -----------1111---1111---------------------------1111------- FQA --- ---------------------------------- >ANGIOTENSIN CONVERTING EN; SWP:Q10714; PDB:1J36A; IQAKEYLENLNKELAKRTNVETEAAWAYRSAITDENEKKKNEISAELAKFMKEVASDTTK --------------------------------------------------------3333 FQWRSYQSEDLKRQFKALTKLGYAALPEDDYAELLDTLSAMESNFAKVKVCDYKDSTKCD -1111----3333---3333-3333-------------------1111---1111----- LALDPEIEEVISKSRDHEELAYYWREFYDKAGTAVRSQFERYVELNTKAAKLNNFTSGAE -------------------------------3333---------------1111------ AWLDEYEDDTFEQQLEDIFADIRPLYQQIHGYVRFRLRKHYGDAVVSETGPIPMHLLGNM -3333--1111----------3333----------------3333-------------11 WAQQWSEIADIVSPFPEKPLVDVSAEMEKQAYTPLKMFQMGDDFFTSMNLTKLPQDFWDK 11--3333------3333---------1111--------------1111----3333--- SIIEKPTDGRDLVCHASAWDFYLIDDVRIKQCTRVTQDQLFTVHHELGHIQYFLQYQHQP -------------------------------------------------------33333 FVYRTGANPGFHEAVGDVLSLSVSTPKHLEKIGLLKDYVRDDEARINQLFLTALDKIVFL 333----3333---3333-------------------------3333------------- PFAFTMDKYRWSLFRGEVDKANWNCAFWKLRDEYSGIEPPVVRSEKDFDAPAKYHISADV ------------------3333---------------------3333-3333-3333--- EYLRYLVSFIIQFQFYKSACIKAGQYDPDNVELPLDNCDIYGSARAGAAFHNMLSMGASK --------------------------3333---1111--2222---------3333---- PWPDALEAFNGERIMSGKAIAEYFEPLRVWLEAENIKNNVHIGWITSNKCVSSHHHHH -----3333---------3333------------------------------------ >50S RIBOSOMAL PROTEIN L13; SWP:O59300; PDB:1J3AA; MRIINADGLILGRLASRVAKMLLEGEEVVIVNAEKAVITGNREVIFSKYKQRTYPKRSDE -----2222------------1111------3333------------------------- IVRRTIRGMLPWKTDRGRKAFRRLKVYVGIPKEFQDKQLETIVEAHVSRLSRPKYVTVGE ------1111----------1111------3333-------33333333-------3333 VAKFLGGKF --------- >ATP-DEPENDENT PHOSPHOENOL; SWP:Q7SIC6; PDB:1J3BA; QRLEALGIHPKKRVFWNTVSPVLVEHTLLRGEGLLAHHGPLVVDTTPYTGRSPKDKFVVR --3333-----------------------------2222------------3333----- EPEVEGEIWWGEVNQPFAPEAFEALYQRVVQYLSERDLYVQDLYAGADRRYRLAVRVVTE 3333------3333------------------1111-----------3333--------- SPWHALFARNMFILPRRFGAFVPGFTVVHAPYFQAVPERDGTRSEVFVGISFQRRLVLIV -------------3333------------1111--3333------------1111----- GTKYAGEIKKSIFTVMNYLMPKRGVFPMHASANVGKEGDVAVFFGLSGTGKTTLSTDPER ---3333-----------3333------------1111-------22223333---1111 PLIGDDEHGWSEDGVFNFEGGCYAKVIRLSPEHEPLIYKASNQFEAILENVVVNPESRRV ----------1111-----------22223333---------2222-------------- QWDDDSKTENTRSSYPIAHLENVVESGVAGHPRAIFFLSADAYGVLPPIARLSPEEAMYY 11113333-------3333----3333-------------1111---------------- FLSGYTARVPRATFSACFGAPFLPMHPGVYARMLGEKIRKHAPRVYLVNTGWTGGPYGVG ------------------3333---3333--------------------------2222- YRFPLPVTRALLKAALSGALENVPYRRDPVFGFEVPLEAPGVPQELLNPRETWADKEAYD ---3333----------1111-----------------22223333-3333--------- QQARKLARLFQENFQKYASGVAKEVAEAGPRTE ------------33331111-----1111---- >HIGH MOBILITY GROUP PROTE; SWP:P17741; PDB:1J3CA; MKKKDPNAPKRPPSAFFLFCSEYRPKIKSEHPGLSIGDTAKKLGEMWSEQSAKDKQPYEQ ----3333------3333------------3333-----------3333---1111--33 KAAKLKEKYEKDIAAYRAK 33-11113333-------- >SEQA PROTEIN; SWP:P36658; PDB:1J3EA; PLGSAMRELLLSDEYAEQKRAVNRFMLLLSTLYSLDAQAFAEATESLHGRTRVYFAADEQ ------------3333---------------------------1111----------333 TLLKNGNQTKPKHVPGTPYWVITNTNTGRKCSMIEHIMQSMQFPAELIEKVCGTI 31111--------2222----------------------------------1111 >AMPD PROTEIN; SWP:P82974; PDB:1J3GA; MLLDEGWLAEARRVPSPHYDCRPDDENPSLLVVHNISLPPGEFGGPWIDALFTGTIDPNA ---iiii-----------------------------------------3333-------- HPYFAGIAHLRVSAHCLIRRDGEIVQYVPFDKRAWHAGVSSYQGRERCNDFSIGIELEGT -----------------------------------------%%%%--------------% DTLAYTDAQYQQLAAVTNALITRYPAIANNMTGHCNIAPERKTDPGPSFDWARFRALVTP %%%---3333---------------3333---------1111-------3333------- SSHKEMT ------- >Bifunctional dihydrofolat; SWP:P13922; PDB:1J3KA; MMEQVCDVFDIYAICACCKVESKNEGKKNEVFNNYTFRGLGNKGVLPWKCISLDMKYFRA ---3333------------------1111---3333-----iiii-----3333------ VTTYVNESKYEKLKYKRCKYLNKETKKLQNVVVMGRTNWESIPKKFKPLSNRINVILSRT -----3333---------1111----------------11113333--2222-------- LKKEDFDEDVYIINKVEDLIVLLGKLNYYKCFILGGSVVYQEFLEKKLIKKIYFTRINST -3333---------3333----1111---------------------------------- YECDVFFPEINENEYQIISVSDVYTSNNTTLDFIIYKKTNN ----------3333-----------%%%%------------ >Bifunctional dihydrofolat; SWP:P13922; PDB:1J3KC; DDEEEDDFVYFNFNKEKEEKNKNSIHPNDFQIYNSLKYKYHPEYQYLNIIYDIMMNGNKQ -3333----1111------------11113333-------3333---------------- SDRTGVGVLSKFGYIMKFDLSQYFPLLTTKKLFLRGIIEELLWFIRGETNGNTLLNKNVR -1111-------------3333--------------------------------1111-1 IWEANGTREFLDNRKLFHREVNDLGPIYGFQWRHFGAEYTNMYDNYENKGVDQLKNIINL 111--------111111112222----------2222---1111-2222----------- IKNDPTSRRILLCAWNVKDLDQMALPPCHILCQFYVFDGKLSCIMYQRSCDLGLGVPFNI ---1111--------33331111-------------%%%%-------------------- ASYSIFTHMIAQVCNLQPAQFIHVLGNAHVYNNHIDSLKIQLNRIPYPFPTLKLNPDIKN -----------1111---------------1111-----3333-----------3333-1 IEDFTISDFTIQNYVHHEKISMDMAA 111-1111--------------3333 >DEMETHYLMENAQUINONE METHY; SWP:P83846; PDB:1J3LA; MEARTTDLSDLYPEGEALPVFKSFGGRARFAGRVRTLRVFEDNALVRKVLEEEGAGQVLF -----3333--1111---------------------------------1111-2222--- VDGGGSLRTALLGGNLARRAWEKGWAGVVVHGAVRDTEELREVPIGLLALAATPKKSAKE --iiii--------------1111-----------3333--------------------- GKGEVDVPLKVLGVEVLPGSFLLADEDGLLLLPEPPSGVRSGG ----------iiii--2222----1111--------------- >THE CONSERVED HYPOTHETICA; SWP:Q84BQ8; PDB:1J3MA; GMRKTLKATLAEARAQVEAALKEEGFGILTEIDVAATLKAKLGLEKPPYLILGACNPNLA ---------------------1111----------------------------------- ARALEALPEIGLLLPCNVVLREAEEGVEVLIQDPKEMFRVLPEATQRALAPVAEEARTRL ----------1111--------1111------3333------------------------ SRALSRL ---1111 >3-OXOACYL-(ACYL-CARRIER P; SWP:Q7SIC5; PDB:1J3NA; MRRVVVTGLGALTPIGVGQEAFHKAQLAGKSGVRPITRFDASALPVRIAAEVDVDPGAYL -------------------------1111-----------3333----------3333-- DRKELRRLDRFVQYALIAAQLALEDAGLKPEDLDPERVGTLVGTGIGGMETWEAQSRVFL 33331111---------------1111-3333-3333----------------------- ERGPNRISPFFIPMMIANMASAHIAMRYGFTGPSSTVVTACATGADALGSALRMIQLGEA --1111-11113333-----------------------!!!!------------------ DLVLAGGTEAAITPMAIGAFAVMRALSTRNEEPEKASRPFTLSRDGFVMGEGAGVLVLEA ------------3333----1111----33331111-2222------------------- YEHAKKRGARIYAELVGFGRSADAHHITEPHPEGKGAALAMARALKDAGIAPEQVGYINA ----1111----------------------1111----------------1111------ HGTSTPVGDRAEVLAIKRVFGDHAKRLMVSSTKSMIGHLLGAAGAVEAIATVQALYHGVI -------------------!!!!1111---3333----!!!!------------------ PPTINLEDPDPELDLDFVPEPREAKVDYALSNSFAFGGHNAVLAFKRV ---------3333----------------------------------- >PHOSPHOGLUCOSE ISOMERASE; SWP:P84140; PDB:1J3QA; MKYKEPFGVKLDFETGIIENAKKSVRRLSDMKGYFIDEEAWKKMVEEGDPVVYEVYAIEQ -----------------2222-----33332222-------------------------- EEKEGDLNFATTVLYPGKVGNEFFMTKGHYHSKIDRAEVYFALKGKGGMLLQTPEGEARF --2222--------------------------1111----------------1111---- IEMEPGTIVYVPPYWAHRTINTGDKPFIFLALYPADAGHDYGTIAEKGFSKIVVEENGKV ---2222--------------------------1111----------------------- VVKDNPK ---3333 >CYTOCHROME C; SWP:P00001; PDB:1J3SA; GDVEKGKKIFIMKCSQCHTVEKGGKHKTGPNLHGLFGRKTGQAPGYSYTAANKNKGIIWG ---------1111---------------------2222---------------------- EDTLMEYLENPKKYIPGTKMIFVGIKKKEERADLIAYLKKATNE 3333--------------------------------3333---- >INTERSECTIN 2; SWP:Q9NZM3; PDB:1J3TA; GSSGSSGVENLKAQALCSWTAKKDNHLNFSKHDIITVLEQQENWWFGEVHGGRGWFPKSY -----------------------------2222---------------iiii-------- VKIIPGSESGPSSG -------------- >ASPARTASE; SWP:Q9LCC6; PDB:1J3UA; VRIEKDFLGEKEIPKDAYYGVQTIRATENFPITGYRIHPELIKSLGIVKKSAALANMEVG -----1111----1111------------------------------------------- LLDKEVGQYIVKAADEVIEGKWNDQFIVDPIQGGAGTSINMNANEVIANRALELMGEEKG --3333---------------3333--------iiii--------------------222 NYSKISPNSHVNMSQSTNDAFPTATHIAVLSLLNQLIETTKYMQQEFMKKADEFAGVIKM 2----33331111--3333----------------------------------1111--- GRTHLQDAVPILLGQEFEAYARVIARDIERIANTRNNLYDINMGATAVGTGLNADPEYIS --%%%%-----3333-------------------3333---2222-----2222------ IVTEHLAKFSGHPLRSAQHLVDATQNTDCYTEVSSALKVCMINMSKIANDLRLMASGPRA ------------------------------------------------------------ GLSEIVLPARQPGSSIMPGKVNPVMPEVMNQVAFQVFGNDLTITSASEAGQFELNVMEPV --------------------------------------------------!!!!1111-- LFFNLIQSISIMTNVFKSFTENCLKGIKANEERMKEYVEKSIGIITAINPHVGYETAAKL ----------------------3333-----------------33333333--------- AREAYLTGESIRELCIKYGVLTEEQLNEILNPYEMIHPGIAG 3333-----------------3333-------3333------ >GIDING PROTEIN-MGLB; SWP:Q9X9L0; PDB:1J3WA; LVLYGAPYERAVEVLEETLRETGARYALLIDRKGFVLAHKEALWAPKPPPLDTLATLVAG ------------------------------1111-------3333--------------- NAAATQALAKLLGEARFQEEVHQGERMGLYVDEAGEHALLVLVFDETAPLGKVKLHGKRA ---------1111-------------------------------1111------------ SEALARIAEEALAN -------------- >HIGH MOBILITY GROUP PROTE; SWP:P17741; PDB:1J3XA; MGKGDPNKPRGKMSSYAFFVQTSREEHKKKHPDSSVNFAEFSKKCSERWKTMSAKEKSKF --------------------------3333-------3333------3333-3333---- EDMAKSDKARYDREMKN ---------3333---- >SEX-DETERMINING REGION Y ; SWP:Q05066; PDB:1J46A; MQDRVKRPMNAFIVWSRDQRRKMALENPRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQK --------------------------11113333--------111133333333------ LQAMHREKYPNYKYRPRRKAKMLPK --------1111------------- >APOPROTEIN OF C1027; SWP:Q06110; PDB:1J48A; APAFSVSPASGLSDGQSVSVSVSGAAAGETYYIAQCAPVGGQDACNPATATSFTTDASGA ------------2222----------------------iiii-------------1111- ASFSFVVRKSYTGSTPEGTPVGSVDCATAACNLGAGNSGLDLGHVALTF --------------1111------3333--------1111--------- >D-LACTATE DEHYDROGENASE; SWP:P26297; PDB:1J4AA; TKIFAYAIREDEKPFLKEWEDAHKDVEVEYTDKLLTPETVALAKGADGVVVYQQLDYIAE --------3333----------1111---------33333333----------------- TLQALADNGITKMSLRNVGVDNIDMAKAKELGFQITNVPVYSPNAIAEHAAIQAARILRQ -----1111----------1111------------------3333--------------- DKAMDEKVARHDLRWAPTIGREVRDQVVGVVGTGHIGQVFMQIMEGFGAKVITYDIFRNP -------1111----------3333-------------------3333------------ ELEKKGYYVDSLDDLYKQADVISLHVPDVPANVHMINDESIAKMKQDVVIVNVSRGPLVD --1111----3333--------------3333--------33332222------3333-- TDAVIRGLDSGKIFGYAMDVYEGEVGIFNEDWEGKEFPDARLADLIARPNVLVTPKTAFY -------------------------------2222------------1111-----3333 TTHAVRNMVVKAFDNNLELVEGKEAETPVKV ------------------------------- >FUSE BINDING PROTEIN; SWP:Q96AE4; PDB:1J4WA; GSHMIDVPIPRFAVGIVIGRNGEMIKKIQNDAGVRIQFKPDDGTTPERIAQITGPPDRAQ ---------3333-----2222--------------------------------3333-- HAAEIITDLLRSVQQEFNFIVPTGKTGLIIGKGGETIKSISQQSGARIELQRNPPPNADP ----------3333-------33331111-2222-------------------3333-11 NMKLFTIRGTPQQIDYARQLIEEKI 11-------------------1111 >S-100P PROTEIN; SWP:P25815; PDB:1J55A; MTELETAMGMIIDVFSRYSGSEGSTQTLTKGELKVLMEKELPGFLDAVDKLLKDLDANGD -----------------------1111-------------2222---------------- AQVDFSEFIVFVAAITSACHKYFEKAGL ---------------------------- >YVRK PROTEIN; SWP:O34714; PDB:1J58A; PQPIRGDKGATVKIPRNIERDRQNPDLVPPETDHGTVSNKFSFSDTHNRLEKGGYAREVT ----!!!!---------------------1111--------3333--------------1 VRELPISENLASVNRLKPGAIRELHWHKEAEWAYIYGSARVTIVDEKGRSFIDDVGEGDL 1113333---------2222------------------------1111-------2222- WYFPSGLPHSIQALEEGAEFLLVFDDGSFSENSTFQLTDWLAHTPKEVIAANFGVTKEEI ---2222------1111--------11111111-------1111-----------33331 SNLPGKEKYIFENQLPGSLKDDIVEGPNGEVPYPFTYRLLEQEPIESEGGKVYIADSTNF 111--------------3333----1111--------1111-----1111-----33333 KVSKTIASALVTVEPGARELHWHPNTHEWQYYISGKARTVFASDGHARTFNYQAGDVGYV 333----------2222-------------------------%%%%-------------- PFAGHYVENIGDEPLVFLEIFKDDHYADVSLNQWLALPETFVQAHLDLGKDFTDVLSKEK ------------------------------------------------33331111---- HPVV ---- >ANTIFREEZE PROTEIN TYPE 1; SWP:P04002; PDB:1J5BA; DVASDAKAAAELVAANAKAAAELVAANAKAAAEAVAR 3333--------------------------------- ------------------------------------------------------------ -------------------------------------- >APO-NEOCARZINOSTATIN; SWP:P01550; PDB:1J5HA; AAPTATVTPSSGLSDGTVVKVAGAGLQAGTAYDVGQCAWVDTGVLACNPADFSSVTADAN ------------------------------------------------------------ GSASTSLTVRRSFEGFLFDGTRWGTVDCTTAACQVGLSDAAGNGPEGVAISFN ---------------------------3333---------------------- >BEKM-1 TOXIN; SWP:Q9BKB7; PDB:1J5JA; RPTDIKCSESYQCFPVCKSRFGKTNGRCVNGFCDCF --------3333----------------iiii---- >METALLOTHIONEIN-1; SWP:P29499; PDB:1J5LA; PCEKCTSGCKCPSKDECAKTCSKPCSCCPT -------------11111111--------- >ASPARTATE DEHYDROGENASE; SWP:NA; PDB:1J5PA; HMTVLIIGMGNIGKKLVELGNFKIYAYDRISKDIPGVVRLDEFQVPSDVSTVVECASPEA ---------3333--------------------------------1111----------- VKEYSLQILKNPVNYIIISTSAFADEVFRERFFSELKNSPARVFFPSGAIGGLDVLSSIK -----3333-------------------------3333--------!!!!-3333---33 DFVKNVRIETIKPPKSLGLDLKGKTVVFEGSVEEASKLFPRNINVASTIGLIVGFEKVKV 33----------3333-------------------------------------3333--- TIVADPAMDHNIHIVRISSAIGNYEFKIENISMLTVYSILRTLRNLESKIIFG ----1111----------1111------------------------------- >MAJOR CAPSID PROTEIN; SWP:P30328; PDB:1J5QA; TFFKTVYRRYTNFAIESIQQTINGSVGFGNKVSTQISRNGDLITDIVVEFVLTKGGNGGT --------------------------2222--------------------------2222 TYYPAEELLQDVELEIGGQRIDKHYNDWFRTYDALFRMNDDRYNYRRMTDWVNNELVGAQ ---------------iiii------------------------------------2222- KRFYVPLIFFFNQTPGLALPLIALQYHEVKLYFTLASQVQGVNYNGSSAIAGAAQPTMSV ---------11113333--33331111-----------2222--!!!!------------ WVDYIFLDTQERTRFAQLPHEYLIEQLQFTGSETATPSATTQASQNIRLNFNHPTKYLAW ------------------------------------------------------------ NFNNPTNYGQYTALANIPGACSGAGTAAATVTTPDYGNTGTYNEQLAVLDSAKIQLNGQD ---3333---------2222-----1111-------------3333---------%%%%- RFATRKGSYFNKVQPYQSIGGVTPAGVYLYSFALKPAGRQPSGTCNFSRIDNATLSLTYK -----3333-------------------------1111---------------------- TCSIDATSPAAVLGNTETVTANTATLLTALNIYAKNYNVLRIMSGMGGLAYAN ----1111--1111-1111----1111---------------iiii------- >URONATE ISOMERASE; SWP:Q9WXR9; PDB:1J5SA; HMFLGEDYLLTNRAAVRLFNEVKDLPIVDPHNHLDAKDIVENKPWNDIWEVEGATDHYVW ----1111---3333------1111---------33333333------------------ ELMRRCGVSEEYITGSRSNKEKWLALAKVFPRFVGNPTYEWIHLDLWRRFNIKKVISEET ---1111-3333------------------1111----------------------3333 AEEIWEETKKKLPEMTPQKLLRDMKVEILCTTDDPVSTLEHHRKAKEAVEGVTILPTWRP -----------3333------------------3333----------------------3 DRAMNVDKEGWREYVEKMGERYGEDTSTLDGFLNALWKSHEHFKEHGCVASDHALLEPSV 333-1111----------------1111---------------1111------------- YYVDENRARAVHEKAFSGEKLTQDEINDYKAFMMVQFGKMNQETNWVTQLHIGALRDYRD ---------------------------------------3333---------------33 SLFKTLGPDSGGDISTNFLRIAEGLRYFLNEFDGKLKIVLYVLDPTHLPTISTIARAFPN 33------------------3333-------2222--------3333----------111 VYVGAPWWFNDSPFGMEMHLKYLASVDLLYNLAGMVTDSRKLLSFGSRTEMFRRVLSNVV 1------1111----------3333--3333---------3333---------------- GEMVEKGQIPIKEARELVKHVSYDGPKALFF ---1111------------------------ >ARCHEASE, POSSIBLE CHAPER; SWP:NA; PDB:1J5UA; HHHRKPIEHTADIAYEISGNSYEELLEEARNILLEEEGIVLDTEEKEKYPLEETEDAFFD ------------------------------------------------------------ TVNDWILEISKGWAPWRIKREGNELKVTFRKIRKKEGTEIKALTYHLLKFERDGDVLKTK --------------------------------------------2222----!!!!---- VVFDT ----- >GLYCYL-TRNA SYNTHETASE AL; SWP:Q9WY59; PDB:1J5WA; YLQDVIKLNDFWASKGCLLEQPYDEVGAGTFHPATFFGSLRKGPWKVAYVQPSRRPTENP ------------1111-----------33333333-3333------------------11 NRLQRYFQYQVIIKPSPENSQELYLESLEYLGINLKEHDIRFVEDNWESPTLGAWGVGWE 11-------------------------3333--3333-----------3333-------- VWLDGEITQFTYFQQIGGISLKDIPLEITYGLERIAYLQGVDNVYEVQWNENVKYGDVFL --------------------------------33331111--3333---11113333--- ENEREFSVFNFEEANVGLLFRHFDEYEKEFYRLVEKNLYLPAYDYILKCSHTFNLLDARG --------------------------------------------------------1111 AISVSQRQTYVKRIQAARKAARVFLEVQAN ------------------------------ >GLUCOSAMINE-6-PHOSPHATE D; SWP:Q9WZS0; PDB:1J5XA; SKTLKEITDQKNELKKFFENFVLNLEKITDEVLFVGCGSSYNLALTISYYFERVLKIRTK --------------------3333------------------------------------ AIPAGEVAFQKIPDLEERGLAFLFSRTGNTTEVLLANDVLKKRNHRTIGITIEEESRLAK --33331111--------------3333------------------------11113333 ESDLPLVFPVREEAIVTKSFSILLSLFLADKIAGNSTERFSELVGYSPEFFDISWKVIEK ----------------3333----------------3333-----------------111 IDLKEHDHFVFLGSEFFGVSLESALKCIESLTFSEAYSTLEYRHGPKALVKKGTLVFQKV 13333--------3333-----------------------33331111------------ SGDEQEKRLRKELESLGATVLEVGEGGDIPVSNDWKSAFLRTVPAQILGYQKAISRGISP -------------1111------2222---------3333------------------11 DKPPHLEKTVVL 112222------ >TRANSCRIPTIONAL REGULATOR; SWP:Q9X1T8; PDB:1J5YA; KTVRQERLKSIVRILERSKEPVSGAQLAEELSVSRQVIVQDIAYLRSLGYNIVATPRGYV ---------------------------------------------1111-----1111-- LAGGKSGVSRLVAVKHAPEEIKEELLCVVRNGGRIVDVIVEHPVYGEIRGIIDVSSEEEV --%%%%----------1111--------1111---------------------------- LKFVNLEAKTEPLLTLSGGVHLHTIEAPDEETERIRELKKKGFLIEE ---------------%%%%---------3333------1111----- >TATD-RELATED DEOXYRIBONUC; SWP:NA; PDB:1J6OA; VDTHAHLHFHQFDDDRNAVISSFEENNIEFVVNVGVNLEDSKKSLDLSKTSDRIFCSVGV -----11111111----------1111-----------------------1111------ HPHDAKEVPEDFIEHLEKFAKDEKVVAIGETGLDFFRNISPAEVQKRVFVEQIELAGKLN 333311111111---------3333----------------------------------- LPLVVHIRDAYSEAYEILRTESLPEKRGVIHAFSSDYEWAKKFIDLGFLLGIGGPVTYPK -----------------3333----------------------1111-----3333-333 NEALREVVKRVGLEYIVLETDCPFLPPQPFRGKRNEPKYLKYVVETISQVLGVPEAKVDE 3----------3333-----------3333-----3333--------------3333--- ATTENARRIFLEVKE --------------- >CYTOCHROME C MATURATION P; SWP:Q8EK44; PDB:1J6QA; SNLNLFYTPSEIVNGKTDTGVKPEAGQRIRVGGMVTVGSMVRDPNSLHVQFAVHDSLGGE -------3333-----1111----------------------3333--------3333-- ILVTYDDLLPDLFREGQGIVAQGVLGEDGKLAATEVLAKH ---------------------------------------- >METHIONINE SYNTHASE; SWP:NA; PDB:1J6RA; HHMPKVEIAPSEIKIPDNVLKAKLGFGGAEEIPEEFRKTVNRAYEELLDAAKPVVLWRDF --------3333---3333--1111-------3333------------------------ EVDGSLSFDDMRLTGELATKHLSGSKIITVFLATLGKKVDEKIEEYFRKGEDLLAFFIDG -------2222------------------------3333-------1111---------- IASEMVEYALRKVDAELRMKRSNLEGSFRISPGYGDLPLSLNKKIAEIFKEEVDVNVIED --------------------1111------2222--------------1111-------- SYVLVPRKTITAFVGWR ----------------- >UDP-N-ACETYLMURAMATE-ALAN; SWP:Q9WY73; PDB:1J6UA; HKIHFVGIGGIGSAVALHEFSNGNDVYGSNIEETERTAYLRKLGIPIFVPHSADNWYDPD ------1111---------1111-----------------1111-------3333----- LVIKTPAVRDDNPEIVRARERVPIENRLHYFRDTLKREKKEEFAVTGTDGKTTTTAVAHV ----33331111------------------------------------------------ LKHLRKSPTVFLGGIDSLEHGNYEKGNGPVVYELDESEEFFSEFSPNYLIITNARGDHLE -------------------------------------3333-----------------11 NYGNSLTRYRSAFEKISRNTDLVVTFAEDELTSHLGDVTFGVKKGTYTLERSASRAEQKA 11--------------1111-----1111--3333------------------1111--- VEKNGKRYLELKLKVPGFHNVLNALAVIALFDSLGYDLAPVLEALEEFRGVHRRFSIAFH --iiii----------3333-----------1111---------------2222------ DPETNIYVIDDYAHTPDEIRNLLQTAKEVFENEKIVVIFQPHRGNFAKALQLADEVVVTE -------------------------------------------------3333------- VYDSGKIWDSLKSLGKEAYFVEKLPELEKVISVSENTVFLFVGAGDIIYSSRRFVERYQS ----------------------33333333--------------3333------------ SK -- >AUTOINDUCER-2 PRODUCTION ; SWP:P44007; PDB:1J6WA; LLDSFKVDHTKNAPAVRIAKTLTPKGDNITVFDLRFCIPNKEILSPKGIHTLEHLFAGFR -3333--3333-----------1111-----------2222---3333------------ DHLNGDSIEIIDISPGCRTGFYSLIGTPNEQKVSEAWLASQDVLGVQDQASIPELNIYQC -----------------------------------------3333--333311113333- GSYTEHSLEDAHEIAKNVIARGIGVNKNEDLSLDN -1111---------------------3333----- >AUTOINDUCER-2 PRODUCTION ; SWP:Q9ZMW8; PDB:1J6XA; KNVESFNLDHTKVKAPYVRIADRKKGVNGDLIVKYDVRFKQPNRDHDPSLHSLEHLVAEI --3333--1111-------------1111-----------2222---------------1 IRNHANYVVDWSPGCQTGFYLTVLNHDNYTEILEVLEKTQDVLKAKEVPASNEKQCGWAA 111------------------------------------3333--------3333--333 NHTLEGAQNLARAFLDKRAEWSEVG 3---------------3333----- >PEPTIDYL-PROLYL CIS-TRANS; SWP:NA; PDB:1J6YA; HMASRDQVKASHILIKHQGSRRKASWKDPEGKIILTTTREAAVEQLKSIREDIVSGKANF ---------------------------------------3333----------------- EEVATRVSDCSSAKRGGDLGSFGRGQMQKPFEEATYALKVGDISDIVDTDSGVHIIKRTA 3333----3333------------------3333-------------------------- >ASPARTIC PROTEINASE; SWP:Q00663; PDB:1J71A; SDVPTTLINEGPSYAADIVVGSNQQKQTVVIDTGSSDLWVVDTDAECQVTYSGQTNNFCK --------------------1111-----------------1111-----------1111 QEGTFDPSSSSSAQNLNQDFSIEYGDLTSSQGSFYKDTVGFGGISIKNQQFADVTTTSVD -----33331111----------1111-------------iiii---------------- QGIMGIGFTADEAGYNLYDNVPVTLKKQGIINKNAYSLYLNSEDASTGKIIFGGVDNAKY -------3333--------------1111------------1111----------1111- TGTLTALPVTSSVELRVHLGSINFDGTSVSTNADVVLDSGTTITYFSQSTADKFARIVGA -------------------------------------1111-----3333---------- TWDSRNEIYRLPSCDLSGDAVFNFDQGVKITVPLSELILKDSDSSICYFGISRNDANILG ---1111-----------------%%%%----3333-----------------3333--3 DNFLRRAYIVYDLDDKTISLAQVKYTSSSDISAL 3331111-----1111------------------ >MACROPHAGE CAPPING PROTEI; SWP:P40121; PDB:1J72A; PFPGSVQDPGLHVWRVEKLKPVPVAQENQGVFFSGDSYLVLHNGPEEVSHLHLWIGQQSS ----3333-------------------2222-1111-------------------1111- RDEQGACAVLAVQLDDYLGGRPVQHREVQGNESDLFMSYFPRGLKYQEGGVESGFKHVVP ---------------------------2222-3333------------------------ NEVVVQRLYQVKGKKNIRATERALNWDSFNTGDCFILDLGQNIFAWCGGKSNILERNKAR ------------------------3333-1111-----!!!!-----1111--------- DLALAIRDSERQGKAQVEIVTDGEEPAEMIQVLGPKPALKEGNPEEDLTADKANAQAAAL ------------------------------------------------------------ YKVSDATGQMNLTKVADSSPFALELLISDDCFVLDNGLCGKIYIWKGRKANEKERQAALQ ----1111-------------3333-1111-----3333--------------------- VAEGFISRMQYAPNTQVEILPQGRESPIFKQFFKDWK -------------------------3333-------- >MMS2; SWP:Q15819; PDB:1J74A; VKVPRNFRLLEELEEGQKGVGDGTVSWGLEDDEDMTLTRWTGMIIGPPRTNYENRIYSLK -------------3333----------------------------------2222----- VECGPKYPEAPPSVRFVTKINMNGINNSSGMVDARSIPVLAKWQNSYSIKVVLQELRRLM ---1111--------------2222-------33333333---1111----------333 MSKENMKLPQPPEGQTYNN 33333------2222---- >5'-D(*TP*CP*GP*CP*GP*CP*G; SWP:Q9QY24; PDB:1J75A; NLEQKILQVLSDDGGPVKIGQLVKKCQVPKKTLNQVLYRLKKEDRVSSPEPATWSIG -----------------3333-------------------1111-----2222---- >HEMO; SWP:Q9RGD9; PDB:1J77A; ALTFAKRLKADTTAVHDSVDNLVMSVQPFVSKENYIKFLKLQSVFHKAVDHIYKDAELNK --------------------------1111--------------------3333-3333- AIPELEYMARYDAVTQDLKDLGEEPYKFDKELPYEAGNKAIGWLYCAEGSNLGAAFLFKH -22221111---------1111--------------3333----------1111------ AQKLDYNGEHGARHLAPHPDGRGKHWRAFVEHLNALNLTPEAEAEAIQGAREAFAFYKVV 3333--1111-3333--1111--------------------------------------- LRETFGLAADAEAPEGMMPH -------2222--2222--- >VITAMIN D BINDING PROTEIN; SWP:P02774; PDB:1J78A; CKEFSHLGKEDFTSLSLVLYSRKFPSGTFEQVSQLVKEVVSLTEACCYDTRTSALSAKSC -----------------------1111--------------------------------- ESNSPFPVHPGTAECCTKRKLCMAALKHQPQEFPTYVEPTNDEICEAFRKDPKEYANQFM --------------------3333------------------------------------ WEYSTNYGQAPLSLLVSYTKSYLSMVGSCCTSASPTVCFLKERLQLKHLSLLTTLSNRVC ------1111-----------------3333----------------------------- SQYAAYGEKKSRLSNLIKLAQKVPTADLEDVLPLAEDITNILSKCCESASEDCMAKELPE ----------------------33333333----------------------3333-333 HTVKLCDNLSTKNSKFEDCCQEKTAMDVFVCTYFMPAAQLPELPDVELPTNKDVCDPGNT 3-------1111------1111---------------------------------3333- KVMDKYTFELSRRTHLPEVFLSKVLEPTLKSLGECCDVEDSTTCFNAKGPLLKKELSSFI ----------------3333------------1111------------------------ DKGQELCADYSENTFTEYKKKLAERLKAKLPDATPTELAKLVNKRSDFASNCCSINSPPL --------3333-----------------1111------------------------333 YCDSEIDAELKNI 3------3333-- >DIHYDROOROTASE; SWP:P05020; PDB:1J79A; SQVLKIRRPDDWHLHLRDGDMLKTVVPYTSEIYGRAIVMPNLAPPVTTVEAAVAYRQRIL ------------------------3333-------------------------------1 DAVPAPHDFTPLMTCYLTDSLDPNELERGFNEGVFTAALYPANATTNSSHGVTSVDAIMP 111--------------1111---------------------11111111---3333--- VLERMEKIGMPLLVHGEVTHADIDIFDREARFIESVMEPLRQRLTALKVVFEHITTKDAA -------------------11113333-3333-----------1111------------- DYVRDGNERLAATITPQHLMFNRNHMLVGGVRPHLYCLPILKRNIHQQALRELVASGFQR --11111111----3333---3333------1111-------3333-------3333--- VFLGTDSAPHARHRKESSCGCAGCFNAPTALGSYATVFEEMNALQHFEAFCSVNGPQFYG ----------3333-----------3333---------11113333-------------- LPVNDTFIELVREEQQVAESIALTDDTLVPFLAGETVRWSVK ----------------------1111---2222--------- >MMS2; SWP:Q16781; PDB:1J7DB; AGLPRRIIKETQRLLAEPVPGIKAEPDESNARYFHVVIAGPQDSPFEGGTFKLELFLPEE ------------------2222-------1111-------2222-2222----------- YPMAAPKVRFMTKIYHPNVDKLGRICLDILKDKWSPALQIRTVLLSIQALLSAPNPDDPL ---------------11111111---1111----1111----------------1111-- ANDVAEQWKTNEAQAIETARAWTRLYAMN --3333----------------------- >D-TYROSYL-TRNA(TYR) DEACY; SWP:P44814; PDB:1J7GA; MIALIQRVSQAKVDVKGETIGKIGKGLLVLLGVEKEDNREKADKLAEKVLNYRIFSDEND --------------iiii---------------2222----------------------- KMNLNVQQAQGELLIVSQFTLAADTQKGLRPSFSKGASPALANELYEYFIQKCAEKLPVS ------------------3333----------1111----------------3333---- TGQFAADMQVSLTNDGPVTFWLNV ------------------------ >HYPOTHETICAL PROTEIN HI07; SWP:P44839; PDB:1J7HA; MMTQIIHTEKAPAAIGPYVQAVDLGNLVLTSGQIPVNPATGEVPADIVAQARQSLENVKA ------------------------------------1111-----3333----------- IIEKAGLTAADIVKTTVFVKDLNDFAAVNAEYERFFKENNHPNFPARSCVEVARLPKDVG -------3333---------3333------------11113333----------2222-- LEIEAIAVRK ---------- >AMINOGLYCOSIDE 3'-PHOSPHO; SWP:P00554; PDB:1J7IA; RISPELKKLIEKYRCVKDTEGMSPAKVYKLVGENENLYLKMTDSRYKGTTYDVEREKDMM --------1111-------------------------------1111------------- LWLEGKLPVPKVLHFERHDGWSNLLMSEADGVLCSEEYEDEQSPEKIIELYAECIRLFHS --2222-----------%%%%---------------------3333-----------111 IDISDCPYTNSLDSRLAELDYLLNNDLADVDCENWEEDTPFKDPRELYDFLKTEKPEEEL 1-------------------------------3333------3333-------------- VFSHGDLGDSNIFVKDGKVSGFIDLGRSGRADKWYDIAFCVRSIREDIGEEQYVELFFDL --------------iiii------1111---3333--------------3333------- LGIKPDWEKIKYYILLDELF ---------------3333- >HYPOXANTHINE PHOSPHORIBOS; SWP:O33799; PDB:1J7JA; HTVEVMIPEAEIKARIAELGRQITERYKDSGSEMVLVGLLRGSFMFMADLCREVQVPHEV --------------------------1111----------1111------1111------ DFMTASRDVKILKDLDEDIRGKDVLIVEDIIDSGNTLSKVREILGLREPKSLAICTLLDK ------------------2222-------------------------------------3 PSRREVDVPVEFVGFSIPDEFVVGYGIDYAQRYRHLPYVGKVV 333--------------------iiii-%%%%1111------- >MATRIX METALLOPROTEINASE ; SWP:P08253; PDB:1J7MA; SWMSTVGGNSGGAPCVFPFTFLGNKYESCTSAGRSDGKMWCATTANYDDDRKWGFCPDQG --------------------iiii------2222-----------3333----------- >CALCIUM VECTOR PROTEIN; SWP:P04573; PDB:1J7QA; AAPKARALGPEEKDECMKIFDIFDRNAENIAPVSDTMDMLTKLGQTYTKRETEAIMKEAR --------1111------------------------------------------------ GPKGDKKNIGPEEWLTLCSKWVRQDD -------------------------- >INTERPHOTORECEPTOR RETINO; SWP:Q7SZI7; PDB:1J7XA; DPSVTHVLHQLCDILANNYAFSERIPTLLQHLPNLDYSTVISEEDIAAKLNYELQSLTED 3333----------------3333------1111-------------------------3 PRLVLKSKTDTLVPGDSIQAENIPEDEALQALVNTVFKVSILPGNIGYLRFDQFADVSVI 333---3333--------1111-------------------------------------- AKLAPFIVNTVWEPITITENLIIDLRYNVGGSSTAVPLLLSYFLDPETKIHLFTLHNRQQ ----3333---3333---------1111----1111----1111------------3333 NSTDEVYSHPKVLGKPYGSKKGVYVLTSHQTATAAEEFAYLQSLSRATIIGEITSGNLHS ---------------------------1111----------1111--------------- KVFPFGDTQLSVTVPIINFIDSNGDYWLGGGVVPDAIVLADEALDKAKEIIAFHPPLA ----2222------------1111--------------3333---------------- >ENDO-1,4-BETA GLUCANASE E; SWP:P94622; PDB:1J83A; QPTAPKDFSSGFWDFNDGTTQGFGVNPDSPITAINVENANNALKISNLNSKGSNDLSEGN -------1111--------iiii--1111---------%%%%-----3333--------3 FWANVRISADIWGQSINIYGDTKLTDVIAPTPVNVSIAAIPQSSTHGWGNPTRAIRVWTN 333-----1111-----2222---------------------1111-----------111 NFVAQTDGTYKATLTISTNDSPNFNTIATDAADSVVTNILFVGSNSDNISLDNIKFTK 1---1111--------1111---------1111------------------------- >YBAB; SWP:P44711; PDB:1J8BA; LGGLKQAQQQEKQKQEEIAQLEVTGESGAGLVKITINGAHNCRRIDIDPSLEDDKELEDL ---------------------------iiii-----1111-------3333--------- IAAAFNDAVRRAEELQKEKASVTAG -------------------3333-- >UBIQUITIN-LIKE PROTEIN HP; SWP:Q9UHD9; PDB:1J8CA; MAENGESSGPPRPSRGPAAAQGSAAAPAEPKIIKVTVKTPKEKEEFAVPENSSVQQFKEA ----------------------------------------------------3333---- ISKRFKSQTDQLVLIFAGKILKDQDTLIQHGIHDGLTVHLVIK ---------------------33333333-------------- >LOW-DENSITY LIPOPROTEIN R; SWP:Q07954; PDB:1J8EA; GSHSCSSTQFKCNSGRCIPEHWTCDGDNDCGDYSDETHANCTNQ -----1111--1111---1111-----3333-111133331111 >SIRTUIN 2, ISOFORM 1; SWP:Q8IXJ6; PDB:1J8FA; GEADMDFLRNLFRLLDELTLEGVARYMQSERCRRVICLVGAGISTSAGIPDFRSPSTGLY ----------------------------3333-----------3333---3333---111 DNLEKYHLPYPEAIFEISYFKKHPEPFFALAKELYPGQFKPTICHYFMRLLKDKGLLLRC 1----------------------------3333------------------1111----- YTQNIDTLERIAGLEQEDLVEAHGTFYTSHCVSASCRHEYPLSWMKEKIFSEVTPKCEDC --------------3333--1111--------1111------------1111-------- QSLVKPDIVFFGESLPARFFSCMQSDFLKVDLLLVMGTSLQVQPFASLISKAPLSTPRLL ---------2222-----------1111--------------------33331111---- INKEKAGQSDPFLGMIMGLGGGMDFDSKKAYRDVAWLGECDQGCLALAELLGWKKELEDL ----------3333------------1111------------------1111-------- VRREHASIDAQS ------------ >T-cell receptor alpha cha; SWP:P01737; PDB:1J8HD; QSVTQLGSHVSVSEGALVLLRCNYSSSVPPYLFWYVQYPNQGLQLLLKYTSAATLVKGIN ----------------------------------------------------------ii GFEAEFKKSETSFHLTKPSAHMSDAAEYFCAVSESPFGNEKLTFGTGTRLTIIPNIQNPD ii-----1111---------1111----------1111---------------------- PAVYQLRSSDKSVCLFTDFDSQTNVSQSKDSDVYITDKTVLDMRSMDFKSNSAVAWSNKS -----------------------------1111----------1111-----------33 DFACANAFNNSIIPEDTF 333333------3333-- >TRBC1 protein [Fragment]; SWP:Q8N2T6; PDB:1J8HE; VKVTQSSRYLVKRTGEKVFLECVQDMDHENMFWYRQDPGLGLRLIYFSYDVKMKEKGDIP ------------------------------------2222---------2222------- EGYSVSREKKERFSLILESASTNQTSMYLCASSSTGLPYGYTFGSGTRLTVVEDLNKVFP --------3333--------1111---------1111----------------1111--- PEVAVFEPSEAEISHTQKATLVCLATGFFPDHVELSWWVNGKEVHSGVSTDPQPLKEQPA --------------------------------------iiii--2222---------333 LNDSRYSLSSRLRVSATFWQNPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEAWGR 3-------------3333--3333-----------3333--------------------- A - >LYMPHOTACTIN; SWP:P47992; PDB:1J8IA; VGSEVSDKRTCVSLTTQRLPVSRIKTYTITEGSLRAVIFITKRGLKVCADPQATWVRDVV -------------------3333--------3333-----1111---------------- RSMDRKSNTRNNMIQTKPTGTQQSTNTAVTLTG -3333---------------------------- ------------------------------------------------------------ ---------------------------------- >Signal recognition 54 kDa; SWP:P70722; PDB:1J8MF; LLDNLRDTVRKFLTGSSSYDKAVEDFIKELQKSLISADVNVKLVFSLTNKIKERLKNEKP 3333-------------------------------------------------------- PTYIERREWFIKIVYDELSNLFGGDKEPKVIPDKIPYVIMLVGVQGTGKTTTAGKLAYFY 2222---------------1111------------------------------------- KKKGFKVGLVGADVYRPAALEQLQQLGQQIGVPVYGEPGEKDVVGIAKRGVEKFLSEKME ------------------------------------2222-------------------- IIIVDTAGRHGYGEEAALLEEMKNIYEAIKPDEVTLVIDASIGQKAYDLASKFNQASKIG ----------2222------------------------1111-----------3333--- TIIITKMDGTAKGGGALSAVAATGATIKFIGTGEKIDELEVFNPRRFVARLHHHH -----1111-----------1111----------1111---------33331111 >PYELONEPHRITIC ADHESIN; SWP:Q47450; PDB:1J8RA; WNNIVFYSLGDVNSYQGGNVVITQRPQFITSWRPGIATVTWNQCNGPEFADGFWAYYREY --------------------1111------------------------------------ IAWVVFPKKVTQNGYPLFIEVHNKGSWSEENTGDNDSYFFLKGYKWDERAFDAGNLCQKP ----------1111---------!!!!---3333------------------------22 GEITRLTEKFDDIIFKVALPADLPLGDYSVKIPYTSGQRHFASYLGARFKIPYNVAKTLP 22-----------------1111----------------------------33331111- RENELFLFKNIGG ------------- >PHENYLALANINE-4-HYDROXYLA; SWP:P00439; PDB:1J8UA; VPWFPRTIQELDRFANQILSYGAELDADHPGFKDPVYRARRKQFADIAYNYRHGQPIPRV ------3333-3333------11111111-1111-----------------2222----- EYMEEEKKTWGTVFKTLKSLYKTHACYEYNHIFPLLEKYCGFHEDNIPQLEDVSQFLQTC ------------------------------------------1111-------------- TGFRLRPVAGLLSSRDFLGGLAFRVFHCTQYIRHGSKPMYTPEPDICHELLGHVPLFSDR -------------------3333----------3333---------------3333---- SFAQFSQEIGLASLGAPDEYIEKLATIYWFTVEFGLCKQGDSIKAYGAGLLSSFGELQYC ------------2222----------------------!!!!------------------ LSEKPKLLPLELEKTAIQNYTVTEFQPLYYVAESFNDAKEKVRNFAATIPRPFSVRYDPY ----------33331111---------------------------1111----------- TQRIEVL ------- >UROPORPHYRINOGEN DECARBOX; SWP:Q42967; PDB:1J93A; TQPLLLDAVRGKEVERPPVWLMRQAGRYMKSYQLLCEKYPLFRDRSENVDLVVEISLQPW ---------------------------------3333----3333---1111--1111-- KVFRPDGVILFSDILTPLSGMNIPFDIIKGKGPVIFDPLRTAADVEKVREFIPEKSVPYV ------------11113333------------------------3333---3333----- GEALTILRKEVNNQAAVLGFVGAPFTLASYVVEGGSSKNFTKIKRLAFAEPKVLHALLQK ----------%%%%---------------------------------------------- FATSMAKYIRYQADSGAQAVQIFDSWATELSPVDFEEFSLPYLKQIVDSVKLTHPNLPLI ------------1111--------------3333-------------------1111--- LYASGSGGLLERLPLTGVDVVSLDWTVDMADGRRRLGPNVAIQGNVDPGVLFGSKEFITN -----111111113333------3333-------------------3333---------- RINDTVKKAGKGKHILNLGHGIKVGTPEENFAHFFEIAKGLRY ------3333------------11113333-------1111-- >3ALPHA-HYDROXYSTEROID DEH; SWP:P52895; PDB:1J96A; DDSKYQCVKLNDGHFMPVLGFGTYAPAEVPKSKALEAVKLAIEAGFHHIDSAHVYNNEEQ ---------1111------------3333----------------------3333----- VGLAIRSKIADGSVKREDIFYTSKLWSNSHRPELVRPALERSLKNLQLDYVDLYLIHFPV --------1111--3333-------1111-3333-------------------------- SVKPGEEVIPKDENGKILFDTVDLCATWEAMEKCKDAGLAKSIGVSNFNHRLLEMILNKP -----------1111-------------------1111--------------------22 GLKYKPVCNQVECHPYFNQRKLLDFCKSKDIVLVAYSALGSHREEPWVDPNSPVLLEDPV 22-----------1111---------1111------1111--------1111-1111--- LCALAKKHKRTPALIALRYQLQRGVVVLAKSYNEQRIRQNVQVFEFQLTSEEMKAIDGLN --------------------1111-----------------1111-----------1111 RNVRYLTLDIFAGPPNYPFSDEY -------3333--1111------ >AUTOINDUCER-2 PRODUCTION ; SWP:O34667; PDB:1J98A; VESFELDHNAVVAPYVRHCGVHKVGTDGVVNKFDIRFCQPNKQAMKPDTIHTLEHLLAFT 3333--3333----------------------------2222------------------ IRSHAEKYDHFDIIDISPMGQTGYYLVVSGETTSAEIVDLLEDTMKEAVEITEIPAANEK ----3333--------------------------------------3333---2222333 QCGQAKLHDLEGAKRLMRFWLSQDKEELLKVFG 3--3333------------33333333--1111 >ALCOHOL SULFOTRANSFERASE; SWP:Q06520; PDB:1J99A; SDDFLWFEGIAFPTMGFRSETLRKVRDEFVIRDEDVIILTYPKSGTNWLAEILCLMHSKG ------iiii---------------------1111--------------------1111- DAKWIQSVPIWERSPWVESEIGYTALSETESPRLFSSHLPIQLFPKSFFSSKAKVIYLMR --3333--1111---1111--------------------3333-3333------------ NPRDVLVSGYFFWKNMKFLKKPKSWEEYFEWFCQGTVLYGSWFDHIHGWMPMREEKNFLL -----------1111---------------------22223333----3333--1111-- LSYEELKQDTGRTIEKICQFLGKTLEPEELNLILKNSSFQSMKENKMSNYSLLSVDYVVD --------------------------------------------3333-----1111--3 KTQLLRKGVSGDWKNHFTVAQAEDFDKLFQEKMADLPRELFPWE 333--------3333-----------------11111111---- >OLIGORIBONUCLEASE; SWP:P45340; PDB:1J9AA; SHSFDKQNLIWIDLETGLDPEKERIIEIATIVTDKNLNILAEGPVLAVHQSDELLNKNDW ----1111----------1111-----------1111----------------------- CQKTHSENGLIERIKASKLTERAAELQTLDFLKKWVPKGASPICGNSIAQDKRFLVKYPD -----3333---------------------------2222-------------------3 LADYFHYRHLDVSTLKELAARWKPEILEGFKKENTHLALDDIRESIKELAYYREHFKLD 333-------3333---3333-3333----------3333------------------- >TERMINASE SMALL SUBUNIT; SWP:P03707; PDB:1J9IA; MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIENE ---------------------1111--------------------------------333 KLRREVEE 3------- >STATIONARY PHASE SURVIVAL; SWP:P96112; PDB:1J9LA; MRILVTNDDGIQSKGIIVLAELLSEEHEVFVVAPDKERSATGHSITIHVPLWMKKVFISE ---------11113333--------------------2222-----------------11 RVVAYSTTGTPADCVKLAYNVVMDKRVDLIVSGVNRGPNMGMDILHSGTVSGAMEGAMMN 11--------------------%%%%-------------!!!!-------------1111 IPSIAISSANYESPDFEGAARFLIDFLKEFDFSLLDPFTMLNINVPAGEIKGWRFTRQSR ------------------------------3333-------------------------- RRWNDYFEERVSPFGEKYYWMMGEVIEDDDRDDVDYKAVREGYVSITPIHPFLTNEQCLK -----------1111--------------------------------------------- KLREVYD ------- >NADPH-CYTOCHROME P450 RED; SWP:P00388; PDB:1JA1A; PVKESSFVEKMKKTGRNIIVFYGSQTGTAEEFANRLSKDAHRYGMRGMSADPEEYDLADL -----------1111------------------------------------1111--333 SSLPEIDKSLVVFCMATYGEGDPTDNAQDFYDWLQETDVDLTGVKFAVFGLGNKTYEHFN 33333------------------1111-------------2222---------------- AMGKYVDQRLEQLGAQRIFELGLGDDDGNLEEDFITWREQFWPAVCEFFGVEATGEESSI ------------------------3333-------------------------------- RQYELVVHEDMDVAKVYTGEMGRLKSYENQKPPFDAKNPFLAAVTANRKLNQGTERHLMH -----------3333-------2222--------1111---------------------- LELDISDSKIRYESGDHVAVYPANDSALVNQIGEILGADLDVIMSLNNLDEESNKKHPFP ----2222----2222-----------------1111-1111-------3333------- CPTTYRTALTYYLDITNPPRTNVLYELAQYASEPSEQEHLHKMASSSGEGKELYLSWVVE --------------------------3333---------1111---------------11 ARRHILAILQDYPSLRPPIDHLCELLPRLQARYYAIASSSKVHPNSVHICAVAVEYEAKS 11---------1111-----------------------33331111-----------333 GRVNKGVATSWLRAKEPAGENGGRALVPMFVRKSQFRLPFKSTTPVIMVGPGTGIAPFMG 3----------1111-------------------------3333------!!!!------ FIQERAWLREQGKEVGETLLYYGCRRSDEDYLYREELARFHKDGALTQLNVAFSREQAHK -------------------------1111-----------------------1111---- VYVQHLLKRDREHLWKLIHEGGAHIYVAGDARNMAKDVQNTFYDIVAEFGPMEHTQAVDY -3333------------------------------------------------------- VKKLMTKGRYSLNVWS ----1111-------- >MHC CLASS I RECOGNITION R; SWP:Q9JHN9; PDB:1JA3A; VKYWFCYGTKCYYFIMNKTTWSGCKANCQHYSVPIVKIEDEDELKFLQRHVIPEGYWIGL ------!!!!---------1111-----1111-------3333----------------- SYDKKKKEWAWIDNGPSKFDMKSRGCVFLSKARIEDTDCNIPYYCICGKKLDKFP ------------------------------------------------------- >1,3,6,8-TETRAHYDROXYNAPHT; SWP:Q9HFV6; PDB:1JA9A; SKPLAGKVALTTGAGRGIGRGIAIELGRRGASVVVNYGSSSKAAEEVVAELKKLGAQGVA -1111---------------------1111---------------------1111----- IQADISKPSEVVALFDKAVSHFGGLDFVMSNSGMEVWCDELEVTQELFDKVFNLNTRGQF ---1111-------------------------------3333------------------ FVAQQGLKHCRRGGRIILTSSIAAVMTGIPNHALYAGSKAAVEGFCRAFAVDCGAKGVTV ----------2222--------1111---------------------------1111--- NCIAPGGVKTDMFDENSWHYAPGGYKGMPQEKIDEGLANMNPLKRIGYPADIGRAVSALC ---------------3333-22222222------------1111---3333--------- QEESEWINGQVIKLTGGGI 3333----------iiii- >PHOSPHOLIPASE C BETA; SWP:Q91086; PDB:1JADA; NKEVTQLPEPQTASLAELQQKLFLKLLKKQEKELKELERKGSKRREELLQKYSVLFLEPV -------------3333------------------------------------------- YPRGLDSQVVELKERLEELIHLGEEYHDGIRRRKEQHATEQTAKITELAREKQIAELKAL ------------------------------------------------------------ KESSESNIKDIKKKLEAKRLDRIQVRSTSDKAAQERLKKEINNSHIQEVVQTIKLLTEKT ------------------------------------------------------------ ARYQQKLEEKQAENLRAIQEKEGQLQQEAVAEYEEKLKTLTVEVQEVKNYKEVFP --------------------------------------3333------------- >ALPHA-AMYLASE; SWP:P56634; PDB:1JAE; KDANFASGRNSIVHLFEWKWNDIADECERFLQPQGFGGVQISPPNEYLVADGRPWWERYQ -----2222-----2222-------------1111--------------22223333--- PVSYIINTRSGDESAFTDMTRRCNDAGVRIYVDAVINHMTGMNGVGTSGSSADHDGMNYP -------1111------------1111------------------1111----1111--- AVPYGSGDFHSPCEVNNYQDADNVRNCELVGLRDLNQGSDYVRGVLIDYMNHMIDLGVAG ----1111--------------------iiii---------------------1111--- FRVDAAKHMSPGDLSVIFSGLKNLNTDYGFADGARPFIYQEVIDLGGEAISKNEYTGFGC ----3333---------1111---3333--2222----------------33333333-- VLEFQFGVSLGNAFQGGNQLKNLANWGPEWGLLEGLDAVVFVDNHDNQRTGGSQILTYKN ------------1111--3333----3333---3333--------3333--3333-1111 PKPYKMAIAFMLAHPYGTTRIMSSFDFTDNDQGPPQDGSGNLISPGINDDNTCSNGYVCE ----------------------------1111----1111-------1111-------33 HRWRQVYGMVGFRNAVEGTQVENWWSNDDNQIAFSRGSQGFVAFTNGGDLNQNLNTGLPA 33-----------1111------------------------------------------- GTYCDVISGELSGGSCTGKSVTVGDNGSADISLGSAEDDGVLAIHVNAKL -----------iiii--------1111------1111-------1111-- >CYTOCHROME C'; SWP:P00142; PDB:1JAFA; QFQKPGDAIEYRQSAFTLIANHFGRVAAMAQGKAPFDAKVAAENIALVSTLSKLPLTAFG ---3333---------------------1111---------------------1111--2 PGTDKGHGTEAKPAVWSDAAGFKAAADKFAAAVDKLDAAGKTGDFAQIKAAVGETGGACK 222---------3333-------------------------------------------- GCHDKFKE -------- >DNA POLYMERASE BETA-LIKE ; SWP:P42494; PDB:1JAJA; MLTLIQGKKIVNHLRSRLAFEYNGQLIKILSKNIVAVGSLRREEKMLNDVDLLIIVPEKK --3333-------3333----iiii----3333----3333------------------- LLKHVLPNIRIKGLSFSVKVCGERKCVLFIEWEKKTYQLDLFTALAEEKPYAIFHFTGPV --------------------------------------------1111------------ SYLIRIRAALKKKNYKLNQYGLFKNQTLVPLKITTEKELIKELGFTYRIPKKRL ----------1111---3333--%%%%---------------------3333-- >BETA-N-ACETYLHEXOSAMINIDA; SWP:Q9Y691; PDB:1JAKA; DRKAPVRPTPLDRVIPAPASVDPGGAPYRITRGTHIRVDDSREARRVGDYLADLLRPATG 3333-----1111-----------------1111----------------------3333 YRLPVTAHGHGGIRLRLAGGPYGDEGYRLDSGPAGVTITARKAAGLFHGVQTLRQLLPPA -------------------------------1111------3333--------------1 VEKDSAQPGPWLVAGGTIEDTPRYAWRSAMLDVSRHFFGVDEVKRYIDRVARYKYNKLHL 111-----------------------------------------------1111------ HLSDDQGWRIAIDSWPRLATYGGSTEVGGGPGGYYTKAEYKEIVRYAASRHLEVVPEIDM ---1111----3333-----1111-2222------------------1111--------- PGHTNAALASYAELNCDGVAPPLYTGTKVGFSSLCVDKDVTYDFVDDVIGELAALTPGRY ----------33331111----------------11113333----------1111---- LHIGGDEAHSTPKADFVAFMKRVQPIVAKYGKTVVGWHQLAGAEPVEGALVQYWGLDRTG -------1111----------------1111--------1111--2222------1111- DAEKAEVAEAARNGTGLILSPADRTYLDMKYTKDTPLGLSWAGYVEVQRSYDWDPAGYLP --------------------1111-1111--1111----1111--3333----3333-22 GAPADAVRGVEAPLWTETLSDPDQLDYMAFPRLPGVAELGWSPASTHDWDTYKVRLAAQA 223333---------1111-----------------------3333-3333--------- PYWEAAGIDFYRSPQVPWT ------------1111--- >YCHF PROTEIN; SWP:P44681; PDB:1JALA; MGFKCGIVGLPNVGKSTLFNALTKAGPFCTIEPNTGVVPMPDPRLDALAEIVKPERILPT ------------------------------------------------------------ TMEFVDIAGLVAGASKGEGLGNKFLANIRETDAIGHVVRCFENIDPLDDIDTINTELALA --------------3333--------------------------3333------------ DLDSCERAIQRLQKRAKGGDKEAKFELSVMEKILPVLENAGMIRSVGLDKEELQAIKSYN ---------------11113333--33333333---1111-3333----------1111- FLTLKPTMYIANVNEDGFENNPYLDRVREIAAKEGAVVVPVCAAIESEIAELDDEEKVEF 1111-------------------------3333----------3333---------3333 LQDLGIEEPGLNRVIRAGYALLNLQTYFTAGVKEVRAWTVSVGATAPKAAAVIHTDFEKG --------3333------------------3333------2222----------3333-- FIRAEVIAYEDFIQFNGENGAKEAGKWRLEGKDYIVQDGDVMHFRFNV -------3333-1111--------------1111--2222-------- >UBIQUITIN-CONJUGATING ENZ; SWP:P23567; PDB:1JASA; MSTPARRRLMRDFKRLQEDPPVGVSGAPSENNIMQWNAVIFGPEGTPFEDGTFKLVIEFS ------------------------------------------2222-3333--------- EEYPNKPPTVRFLSKMFHPNVYADGSICLDILQNRWSPTYDVSSILTSIQSLLDEPNPNS -----------------11111111---33331111---------------3333-3333 PANSQAAQLYQENKREYEKRVSAIVEQSWNDS --3333-------------------------- >UBIQUITIN-CONJUGATING ENZ; SWP:P52490; PDB:1JATA; AASLPKRIIKETEKLVSDPVPGITAEPHDDNLRYFQVTIEGPEQSPYEDGIFELELYLPD ----3333-----------2222----1111----------2222-1111--------11 DYPMEAPKVRFLTKIYHPNIDRLGRICLDVLKTNWSPALQIRTVLLSIQALLASPNPNDP 11------------------1111---1111111133333333------------1111- LANDVAEDWIKNEQGAKAKAREWTKLYAKKKP --3333-------------------------- >Ubiquitin-conjugating enz; SWP:P53152; PDB:1JATB; SKVPRNFRLLEELEKGEKESCSYGLADSDDITMTKWNGTILGPPHSNHENRIYSLSIDCG --------------------------1111-----------------2222--------1 PNYPDSPPKVTFISKINLPCVNPTTGEVQTDFHTLRDWKRAYTMETLLLDLRKEMATPAN 111--------------1111-------11113333--3333--------------3333 KKLRQPKEGETF ------2222-- >COENZYME F420H2:NADP+ OXI; SWP:O29370; PDB:1JAYA; MRVALLGGTGNLGKGLALRLATLGHEIVVGSRREEKAEAKAAEYRRIAGDASITGMKNED --------------------1111------------------------------------ AAEACDIAVLTIPWEHAIDTARDLKNILREKIVVSPLVPVSRGAKGFTYSSERSAAEIVA --------------------------3333------------1111-------------- EVLESEKVVSALHTIPAARFANLDEKFDWDVPVCGDDDESKKVVMSLISEIDGLRPLDAG -----------11113333--1111-------------------------2222------ PLSNSRLVESLTPLILNIMRFNGMGELGIKFL 3333---------------------------- >PHOTOSYSTEM I P700 CHLORO; SWP:P18083; PDB:1JB0C; AHTVKIYDTCIGCTQCVRACPTDVLEMVPWDGCKAGQIASSPRTEDCVGCKRCETACPTD -------------3333---------------1111------3333------3333---- FLSIRVYLGAETTRSMGLAY -----------3333----- >PHOTOSYSTEM I P700 CHLORO; SWP:P20452; PDB:1JB0D; TTLTGQPPLYGGSTGGLLSAADTEEKYAITWTSPKEQVFEMPTAGAAVMREGENLVYFAR -----------------3333-------------------1111---------------3 KEQCLALAAQQLRPRKINDYKIYRIFPDGETVLIHPKDGVFPEKVNKGREAVNSVPRSIG 333--------3333----------3333------1111-3333-------------333 QNPNPSQLKFTGKKPYDP 3--33332222--1111- >PHOTOSYSTEM I P700 CHLORO; SWP:P25898; PDB:1JB0E; VQRGSKVKILRPESYWYNEVGTVASVDQTPGVKYPVIVRFDKVNYTGYSGSASGVNTNNF -2222-----1111-2222---------2222---------------2222--------- ALHEVQEVA 1111----- >HPRK PROTEIN; SWP:Q9RE09; PDB:1JB1A; ERRSHGVLVDIYGLGVLITGDSGVGKSETALELVQRGHRLIADDRVDVYQQDEQTIVGAA ----------%%%%------2222-3333----1111----------------------- PPILSHLLEIRGLGIIDVNLFGAGAVREDTTISLIVHLENWTPDQLIFDVPVPKITVPVK 3333-----------------1111----------------------------------2 VGRNLAIIIEVAANFRAKSGYDATKTFEKNLNHLIEH 222---------------------------------- >AGRIN; SWP:P31696; PDB:1JB3A; ELQRREEEANVVLTGTVEEIMNVDPVHHTYSCKVRVWRYLKGKDIVTHEILLDGGNKVVI ------------------------------------------------------------ GGFGDPLICDNQVSTGDTRIFFVNPAPQYMWPAHRNELMLNSSLMRITLRNLEEVEHCVE -2222--------2222---------3333-1111------------------------- EHRKLLA ------- >TELOMERE-BINDING PROTEIN ; SWP:P29549; PDB:1JB7A; YEYVELAKASLTSAQPQHFYAVVIDATFPYKTNQERYICSLKIVDPTLYLKQQKGAGDAS ----3333------------------------------------1111------------ DYATLVLYAKRFEDLPIIHRAGDIIRVHRATLRLYNGQRQFNANVFYSSSWALFSTDKRS ----------3333--------------------iiii-----3333------------- VTQEINNQDAVSDTTPFSFSSKHATIEKNEISILQNLRKWANQYFSSYSVISSDMYTALN --------------------------3333---------------------1111--111 KAQAQKGDFDVVAKILQVHELDEYTNELKLKDASGQVFYTLSLKLKFPHVRTGEVVRIRS 11111----------------1111------1111-------333311112222------ ATYDETSTQKKVLILSHYSNIITFIQSSKLAKELRAKIQDDHSVEVASLKKNVSLNAVVL ---1111--------1111-----1111-------------------1111--------- TEVDKKHAALPSTSLQDLFHHADSDKELQAQDTFRTQFYVTKIEPSDVKEWVKGYDRKTK ---3333------3333---1111-3333-----------------3333---------- KSSSLKGASGKGDNIFQVQFLVKDASTQLNNNTYRVLLYTQDGLGANFFNVKADNLHKNA ----1111---------------3333------------1111-1111------3333-- DARKKLEDSAELLTKFNSYVDAVVERRNGFYLIKDTKLIY --------------2222--------iiii---------- >Telomere-binding protein ; SWP:P16458; PDB:1JB7B; QQQSAFKQLYTELFNNEGDFSKVSSNLKKPLKCYVKESYPHFLVTDGYFFVAPYFTKEAV ---------------%%%%11111111--------------------------------- NEFHAKFPNVNIVDLTDKVIVINNWSLELRRVNSAEVFTSYANLEARLIVHSFKPNLQER ------111111112222--------------3333----%%%%---------------- LNPTRYPVNLFRDDEFKTTIQHFRHTALQAAINKTVKGDNLVDISKVADAAGKKGKVDAG --------1111------------------------------3333---1111----111 IVKASASKGDEFSDFSFKEGNTATLKIADIFVQEKG 1------------------------------1111- >FERREDOXIN-NADP REDUCTASE; SWP:Q41736; PDB:1JB9A; SRSKVSVAPLHLESAKEPPLNTYKPKEPFTATIVSVESLVGPKAPGETCHIVIDHGGNVP ----------1111---------3333-------------1111----------iiii-- YWEGQSYGVIPPGENPKKPGAPQNVRLYSIASTRYGDNFDGRTGSLCVRRAVYYDPETGK -2222---------------------------1111------------------------ EDPSKNGVCSNFLCNSKPGDKIQLTGPSGKIMLLPEEDPNATHIMIATGTGVAPFRGYLR -1111-------11112222---------1111----1111------------------- RMFMEDVPNYRFGGLAWLFLGVANSDSLLYDEEFTSYLKQYPDNFRYDKALSREQKGKMY ------1111-------------3333-------------1111---------------3 VQDKIEEYSDEIFKLLDGGAHIYFCGLKGMMPGIQDTLKKVAERRGESWDQKLAQLKKNK 333------------1111-------3333------------1111----------1111 QWHVEVY ------- >GUANYLATE CYCLASE ACTIVAT; SWP:P51177; PDB:1JBAA; GQQFSWEEAEENGAVGAADAAQLQEWYKKFLEECPSGTLFMHEFKRFFKVPDNEEATQYV -----3333---------------------1111-----------------------333 EAMFRAFDTNGDNTIDFLEYVAALNLVLRGTLEHKLKWTFKIYDKDRNGCIDRQELLDIV 3-------------------------------------3333------------------ ESIYKLKKACSVEVEAEQQGKLLTPEEVVDRIFLLVDENGDGQLSLNEFVEGARRDKWVM -----------------------1111------3333-------------1111------ KMLQMDLNP -3333---- >CHEMOTAXIS PROTEIN CHEY; SWP:P06143; PDB:1JBEA; ADKELKFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGVDALNKLQAGGYGFVISDWNMP -1111-------------------1111----------------3333------------ NMDGLELLKTIRAAMSALPVLMVTAEAKKENIIAAAQAGASGYVVKPFTAATLEEKLNKI --------------1111-----------------1111--------------------- FEKLGM ------ >COCHLIN; SWP:O43405; PDB:1JBIA; TAPIAITCFTRGLDIRKEKADVLCPGGCPLEEFSVYGNIVYASVSSICGAAVHRGVISNS ----------3333---------------------------3333--------------- GGPVRVYSLPGRENYSSVDANGIQSQMLSRWSASFTVTLE -------------------%%%%----------------- >CD3 Epsilon and gamma Ect; SWP:P22646; PDB:1JBJA; DDAENIEYKVSISGTSVELTCPLDSDENLKWEKNGQELPQKHDKHLVLQDFSEVEDSGYY ---------------------------------------------------3333----- VCYTPASNKNTYLYLKARVGSADDAKKDAAKKDDAKKDDAKKDGSQTNKAKNLVQVDGSR ---3333----------------------------------------------------- GDGSVLLTCGLTDKTIKWLKDGSIISPLNATKNTWNLGNNAKDPRGTYQCQGAKETSNPL -------------------iiii---------------3333------------------ QVYYRM ------ >CLPB PROTEIN; SWP:P03815; PDB:1JBKA; HMQALKKYTIDLTERAEQGKLDPVIGRDEEIRRTIQVLQRRTKNNPVLIGEPGVGKTAIV ---------------1111-----------------1111----------22223333-- EGLAQRIINGEVPEGLKGRRVLALDMGALVAGAKYRGEFEERLKGVLNDLAKQEGNVILF ------------3333-------------2222-------------------2222---- IDELHTMVGAMDAGNMLKPALARGELHCVGATTLDEYRQYIEKDAALERRFQKVFVAEPS --3333----------------------------------1111--3333---------- VEDTIAILR ----3333- >C-PHYCOCYANIN ALPHA CHAIN; SWP:P50032; PDB:1JBOA; MKTPITEAIAAADTQGRFLSNTELQAVDGRFKRAVASMEAARALTNNAQSLIDGAAQAVY ------------1111-------------------------------------------- QKFPYTTTMQGSQYASTPEGKAKCARDIGYYLRMVTYCLVAGGTGPMDEYLIAGLSEINS ----1111--1111-------------------------------------2222----1 TFDLSPSWYIEALKYIKANHGLTGQAAVEANAYIDYAINALS 111-3333------------------------------1111 >C-phycocyanin beta chain; SWP:P50033; PDB:1JBOB; MLDAFAKVVAQADARGEFLTNAQFDALSNLVKEGNKRLDAVNRITSNASTIVANAARALF ------------1111-------------------------------------------- AEQPQLIQPGGAYTNRRMAACLRDMEIILRYVTYAILAGDSSVLDDRCLNGLRETYQALG ---33332222-------------------------------------2222-------- TPGSSVAVAIQKMKDAAIAIANDPNGITPGDCSALMSEIAGYFDRAAAAVA -3333--------------1111-----------------------3333- >CYSTATHIONINE BETA-SYNTHA; SWP:P35520; PDB:1JBQA; WIRPDAPSRCTWQLGRPASESPHHHTAPAKSPKILPDILKKIGDTPMVRINKIGKKFGLK --1111------22223333----------------3333-----------3333----- CELLAKCEFFNAGGSVKDRISLRMIEDAERDGTLKPGDTIIEPTSGNTGIGLALAAAVRG ------11111111--------------------2222-------3333----------- YRCIIVMPEKMSSEKVDVLRALGAEIVRTPTESHVGVAWRLKNEIPNSHILDQYRNASNP -----------3333--------------------------1111------3333----- LAHYDTTADEILQQCDGKLDMLVASVGTGGTITGIARKLKEKCPGCRIIGVDPEGSILAE --------------iiii------------------------1111-------------- PEELNQTEQTTYEVEGIGYDFIPTVLDRTVVDKWFKSNDEEAFTFARMLIAQEGLLCGGS 3333------------------11113333---------------------------333 AGSTVAVAVKAAQELQEGQRCVVILPDSVRNYMTKFLSDRWMLQKGFL 3-----3333-33332222---------1111--1111---------- >FOLYLPOLYGLUTAMATE SYNTHA; SWP:P15925; PDB:1JBWA; MNYTETVAYIHSFPRLAKTGDHRRILTLLHALGNPQQQGRYIHVTGTNGKGSAANAIAHV -----------------------------11113333----------------------- LEASGLTVGLYTSPFIMRFNERIMIDHEPIPDAALVNAVAFVRAALERLQQQQADFNVTE -----------------3333---%%%%------------------------1111---- FEFITALAYWYFRQRQVDVAVIEVGIGGDTDSTNVITPVVSVLTEVALDHQKLLGHTITA ---------------------------1111------------------1111------- IAKHAGIIKRGIPVVTGNLVPDAAAVVAAKVATTGSQWLRFDRDFSVPKAKLHGWGQRFT ----11112222---------------------------2222----------------- YEDQDGRISDLEVPLVGDYQQRNMAIAIQTAKVYAKQTEWPLTPQNIRQGLAASHWPARL --1111----------3333------------------------------1111------ EKISDTPLIVIDGAHNPDGINGLITALKQLFSQPITVIAGILADKDYAAMADRLTAAFST -------------------------------------------3333------3333--- VYLVPVPGTRLKDSWQEALAASLNDVPDQPIVITGSLYLASAVRQTLLG -------------------------1111-------------------- >TROPONIN C, SKELETAL MUSC; SWP:P02588; PDB:1JC2A; EDAKGKSEEELANCFRIFDKNADGFIDIEELGEILRATGEHVIEEDIEDLMKDSDKNNDG -------------3333---------3333----3333----3333-------------- RIDFDEFLKMMEGVQ --------------- >METHYLMALONYL-COA EPIMERA; SWP:Q8VQN0; PDB:1JC4A; NEDLFICIDHVAYACPDADEASKYYQETFGWHELHREENPEQGVVEIAPAAKLTEHTQVQ ---------------------------------------1111----------------- VAPLNDESTVAKWLAKHNGRAGLHHAWRVDDIDAVSATLRERGVQLLYDEPKLGTGGNRI ----1111------1111------------------------------------------ NFHPKSGKGVLIELTQYPK --3333iiii--------- >VENOM BASIC PROTEASE INHI; SWP:P25660; PDB:1JC6A; KNRPTFCNLLPETGRCNALIPAFYYNSHLHKCQKFNYGGCGGNANNFKTIDECQRTCAAK ----3333------------------3333---------------------3333----- YGRSS ----- >TECHYLECTIN-5A; SWP:Q9U8W8; PDB:1JC9A; DPTDCADILLNGYRSSGGYRIWPKSWMTVGTLNVYCDMETDGGGWTVIQRRGNYGNPSDY --------1111----------1111-------------iiii-------------1111 FYKPWKNYKLGFGNIEKDFWLGNDRIFALTNQRNYMIRFDLKDKENDTRYAIYQDFWIEN -------------1111-------------------------1111-------------3 EDYLYCLHIGNYSGDAGNSFGRHNGHNFSTIDKDHDTHETHCAQTYKGGWWYDRCHESNL 333----------------3333------1111-------3333--------------11 NGLYLNGEHNSYADGIEWRAWKGYHYSLPQVEMKIRPVEF 11---------------3333------------------- --------------------------------------------------- -------------------------------------------------- >ROD SHAPE-DETERMINING PRO; SWP:Q9WZ57; PDB:1JCFA; MLRKDIGIDLGTANTLVFLRGKGIVVNEPSVIAIDSTTGEILKVGLEAKNMIGKTPATIK ---------------------------------------------3333-2222-1111- AIRPMRDGVIADYTVALVMLRYFINKAKGGMNLFKPRVVIGVPIGITDVERRAILDAGLE -----iiii---------------------------------1111-------------- AGASKVFLIEEPMAAAIGSNLNVEEPSGNMVVDIGGGTTEVAVISLGSIVTWESIRIAGD ----------------1111-1111-------------------%%%%------------ EMDEAIVQYVRETYRVAIGERTAERVKIEIGNVFPSKENDELETTVSGIDLSTGLPRKLT -----------------------------------3333--------------------- LKGGEVREALRSVVVAIVESVRTTLEKTPPELVSDIIERGIFLTGGGSLLRGLDTLLQKE -3333-------------------1111----------------1111------------ TGISVIRSEEPLTAVAKGAGMVLDKVNILKKLQGAG ---------3333------------3333------- >Tryptophan biosynthesis p; SWP:P00909; PDB:1JCMP; MQCVLAKIVADKAIWVEARKQQQPLASFQNEVQPSTRHFYDALQGARTAFILECKKASPS --3333-----------------3333----------3333------------------- KGVIRDDFDPARIAAIYKHYASAISVLTDEKYFQGSFNFLPIVSQIAPQPILCKDFIIDP -----------33333333----------1111--3333--------------------- YQIYLARYYQADACLLMLSVLDDDQYRQLAAVAHSLEMGVLTEVSNEEEQERAIALGAKV ------1111------1111---------------------------------------- VGINNRDLCDLSIDLNRTRELAPKLGHNVTVISESGINTYAQVRELSHFANGFLIGSALM ---------------3333--3333--------------------1111------3333- AHDDLHAAVRRVLLGENKV ---------------%%%% >INOSINE MONOPHOSPHATE DEH; SWP:P20839; PDB:1JCNA; TGYVPEDGLTAQQLFASADDLTYNDFLILPGFIDFIADEVDLTSALTRKITLKTPLISSP ---------3333--------1111----------1111-------1111---------- MDTVTEADMAIAMALMGGIGFIHHNCTPEFQANEVRKVKNFEQGFITDPVVLSPGIPITE 1111-------------------------------------2222--------------- VGIVTSRDIDPRIELVVAPAGVTLKEANEILQRSKKGKLPIVNDCDELVRTDLKKNRDYP ----3333--------------3333---3333-------------------------11 LASKDSQKQLLCGAAVGTREDDKYRLDLLTQAGVDVIVLDSSQGNSVYQIAMVHYIKQKY 11--1111----------3333-------------------------------------1 PHLQVIGGNVVTAAQAKNLIDAGVDGLRVGMGCGSICITQEVMACGRPQGTAVYKVAEYA 111--------3333---------------------1111-------------------- RRFGVPIIADGGIQTVGHVVKALALGASTVMMGSLLAATTEAPGEKGSIQKFVPYLIAGI 1111------------------1111------3333--1111-----3333--------- QHGCQDIGARSLSVLRSMMYSGELKFEKRTMSAQI ----------------------------------- >PROTEIN FARNESYLTRANSFERA; SWP:Q04631; PDB:1JCRA; FLSLDSPTYVLYRDRAEWADIDPVPQNDGPSPVVQIIYSEKFRDVYDYFRAVLQRDERSE --1111----33333333------------------------------------------ RAFKLTRDAIELNAANYTVWHFRRVLLRSLQKDLQEEMNYIIAIIEEQPKNYQVWHHRRV ------------1111-----------1111----------------------------- LVEWLKDPSQELEFIADILNQDAKNYHAWQHRQWVIQEFRLWDNELQYVDQLLKEDVRNN -------1111------1111--------------------1111----------1111- SVWNQRHFVISNTTGYSDRAVLEREVQYTLEMIKLVPHNESAWNYLKGILQDRGLSRYPN -----------------------------------1111-----------33333333-- LLNQLLDLQPSHSSPYLIAFLVDIYEDMLENQCDNKEDILNKALELCEILAKEKDTIRKE -----1111---------------------------------------------3333-- YWRYIGRSLQSKHSRESDIPASV -------------3333--3333 >CONSERVED PROTEIN MTH1692; SWP:O27727; PDB:1JCUA; MLIRKITRKNPSPDVLEEAISVMEGGGIVIYPTDTIYGLGVNALDEDAVRRLFRVKGRSP ---------------------3333----------------1111--------------- HKPVSICVSCVDEIPRFSRPSGDAMELMERILPGPYTVVLERNELIPDVITGGSSRVGIR ---------3333---------------------------------3333---------- VPDDEICRRIAARFPVTATSANISGKPPSPRLEEIVRDLDAVDLVLDAGDCLDMEPSTVI ---3333-3333------------------3333-------------------------- DLTVNPPRVLRRGKGPLDPVLLRGAGDV ---------------------------- >CARBONIC ANHYDRASE XII; SWP:O43570; PDB:1JD0A; KWTYFGPDGENSWSKKYPSCGGLLQSPIDLHSDILQYDASLTPLEFQGYNLSANKQFLLT -----11111111-----1111--------3333---1111----------1111----- NNGHSVKLNLPSDMHIQGLQSRYSATQLHLHWGNPNDPHGSEHTVSGQHFAAELHIVHYN ----------1111-------------------1111-------iiii------------ SDLYPDASTASNKSEGLAVLAVLIEMGSFNPSYDKIFSHLQHVKYKGQEAFVPGFNIEEL -----33331111----------------3333-----1111--2222---------111 LPERTAEYYRYRGSLTTPPCNPTVLWTVFRNPVQISQEQLLALETALYCTHMDDPSPREM 1--3333-------------------------------------------1111------ INNFRQVQKFDERLVYTSFS -------------------- >HYPOTHETICAL 13.9 KDA PRO; SWP:P40037; PDB:1JD1A; TTLTPVICESAPAAAASYSHAMKVNNLIFLSGQIPVTPDNKLVEGSIADKAEQVIQNIKN -------1111------------!!!!---------1111-------------------- VLEASNSSLDRVVKVNIFLADINHFAEFNSVYAKYFNTHKPARSCVAVAALPLGVDMEME -------1111---------1111--------------------------2222------ AIAAER ------ >APOPTOSIS 1 INHIBITOR; SWP:Q24306; PDB:1JD5A; GNYFPPEYAIETARLRTFEAWPRNLKQKP -----333-3333----11113333---- >Transcription factor 7-li; SWP:Q9NQB0; PDB:1JDHB; LGANDELISFKDEGEQEEKSSENSSAERDLADVKSSLV -------------------------3333--------- >CYTOCHROME C2, ISO-2; SWP:P81154; PDB:1JDLA; GDPAKGEAVFKKCMACHRVGPDAKNLVGPALTGVIDRQAGTAPGFNYSAINHAAGEAGLH --------3333------------------2222-------2222---------1111-- WTPENIIAYLPDPNAFLRKFLADAGHAEQAKGSTKMVFKLPDEQERKDVVAYLKQFSP --------3333---------11113333------------------------1111- >ATRIAL NATRIURETIC PEPTID; SWP:P17342; PDB:1JDNA; PQKIEVLVLLPQDDSYLFSLTRVRPAIEYALRSVEGLPPGTRFQVAYEDSDCGNRALFSL ------------3333--3333---------------------------%%%%------- VDRVAAARGAKPDLILGPVCEYAAAPVARLASHWDLPMLSAGALAAGFQHKDSEYSHLTR ------%%%%----------------------------------3333----1111---- VAPAYAKMGEMMLALFRHHHWSRAALVYSDDKLERNCYFTLEGVHEVFQEEGLHTSIYSF ---3333----------------------------------------------------- DETKDLDLEDIVRNIQASERVVIMCASSDTIRSIMLVAHRHGMTSGDYAFFNIELFNSSS 3333----------1111--------------------1111------------%%%%11 YGDGSWKRGDKHDFEAKQAYSSLQTVTLLRTVKPEFEKFSMEVKSSVEKQGLNMEDYVNM 11-------1111------1111---------3333------------------------ FVEGFHDAILLYVLALHEVLRAGYSKKDGGKIIQQTWNRTFEGIAGQVSIDANGDRYGDF -------------------1111-1111-----1111-----1111----1111------ SVIAMTDVEAGTQEVIGDYFGKEGRFEMRPNVKYPWGPLKLRIDENR ------3333----------1111------------1111------- >HYPOTHETICAL PROTEIN TM09; SWP:Q9X078; PDB:1JDQA; GSSHHHHHHSSGLVPRGSHMAKYQVTKTLDVRGEVCPVPDVETKRALQNMKPGEILEVWI ---------------------------------------------3333----------- DYPMSKERIPETVKKLGHEVLEIEEVGPSEWKIYIKVK -------------------------------------- >L-ARGININE\:GLYCINE AMIDI; SWP:P50440; PDB:1JDW; CPVSSYNEWDPLEEVIVGRAENACVPPFTIEVKANTYEKYWPFYQKQGGHYFPKDHLKKA -------------------2222---------11113333------2222---------- VAEIEEMCNILKTEGVTVRRPDPIDWSLKYKTPDFESTGLYSAMPRDILIVVGNEIIEAP ------------------------1111---1111--------3333----!!!!----- MAWRSRFFEYRAYRSIIKDYFHRGAKWTTAPKPTMADELYNQDYPIHSVEDRHKLAAQGK --1111--------------1111-----------3333----------------1111- FVTTEFEPCFDAADFIRAGRDIFAQRSQVTNYLGIEWMRRHLAPDYRVHIISFKDPNPMH -----------1111-----------1111------------------------------ IDATFNIIGPGIVLSNPDRPCHQIDLFKKAGWTIITPPTPIIPDDHPLWMSSKWLSMNVL --------2222---1111-1111---1111-----------1111-----1111----- MLDEKRVMVDANEVPIQKMFEKLGITTIKVNIRNANSLGGGFHCWTCDVRRRGTLQSYLD --1111---3333-------1111-------33331111---1111-------------- >5'-METHYLTHIOADENOSINE PH; SWP:P50389; PDB:1JE0A; PVHILAKKGEVAERVLVVGDPGRARLLSTLLQNPKLTNENRGFLVYTGKYNGETVSIATH ------2222---------3333----1111-------2222-------%%%%------- GIGGPSIAIVLEELAMLGANVFIRYGTTGALVPYINLGEYIIVTGASYNQGGLFYQYLRD --------------1111-------------33332222--------------------- NACVASTPDFELTNKLVTSFSKRNLKYYVGNVFSSDAFYAEDEEFVKKWSSRGNIAVEME --------------------1111-----------------1111----1111------- CATLFTLSKVKGWKSATVLVVSDNLAEELEKSVMDGAKAVLDTLTS ------------------------------------------1111 >HYPOTHETICAL 8.6 KDA PROT; SWP:P31065; PDB:1JE3A; MGSSHHHHHHSSGLVPRGSHMKNIVPDYRLDMVGEPCPYPAVATLEAMPQLKKGEILEVV -----------------------------------------------1111--------- SDCPQSINNIPLDARNHGYTVLDIQQDGPTIRYLIQK --------3333------------------------- >HELIX-DESTABILIZING PROTE; SWP:P03696; PDB:1JE5A; MAKKIFTSALGTAEPYAYIAKPDYGNGFGNPRGVYKVDLTIPNKDPRCQRMVDEIVKCHE -----------------------------------------1111--------------- EAYAAAVEEYEANPPPLKPYEGDMPFFDNGDGTTTFKFKCYASFQDKKTKETKHINLVVV ------------------------------------------------------------ DSKGKKMEDVPIIGGGSKLKVKYSLVPYKWNTAVGASVKLQLESVMLVELATDWADEVEE 1111---------2222-------------3333-------------------3333--- N - >MHC CLASS I CHAIN-RELATED; SWP:NA; PDB:1JE6A; MEPHSLRYNLMVLSQDGSVQSGFLAEGHLDGQPFLRYDRQKRRAKPQGQWAEDVLGAETW ----------------------------iiii-----------------------3333- DTETEDLTENGQDLRRTLTHIKDQKGGLHSLQEIRVCEIHEDSSTRGSRHFYYNGELFLS ----------------------------------------------------iiii---- QNLETQESTVPQSSRAQTLAMNVTNFWKEDAMKTKTHYRAMQADCLQKLQRYLKSGVAIR -------------------------------------------------------1111- RTVPPMVNVTCSEVSEGNITVTCRASSFYPRNITLTWRQDGVSLSHNTQQWGDVLPDGNG ------------------------------------------------------------ TYQTWVATRIRQGEEQRFTCYMEHSGNHGTHPVPS -------------3333------iiii-------- >NITRATE/NITRITE RESPONSE ; SWP:P10957; PDB:1JE8A; RDVNQLTPRERDILKLIAQGLPNKIARRLDITESTVKVHVKHLKKKLKSRVEAAVWVHQE -1111-----------1111------1111------------------------------ RIF --- >HEMOGLOBIN ZETA CHAIN; SWP:P02008; PDB:1JEBA; SLTKTERTIIVSMWAKISTQADTIGTETLERLFLSHPQTKTYFPHFDLHPGSAQLRAHGS -----------------------------------3333---1111--2222-------- KVVAAVGDAVKSIDDIGGALSKLSELHAYILRVDPVNFKLLSHCLLVTLAARFPADFTAE -----------1111------------------3333----------------3333--- AHAAWDKFLSVVSSVLTEKYR ----------------3333- >Hemoglobin subunit beta-1; SWP:P02088; PDB:1JEBB; VHLTDAEKAAVSGLWGKVNADEVGGEALGRLLVVYPWTQRYFDSFGDLSSASAIMGNAKV ------------3333------------------333311113333-------------- KAHGKKVITAFNDGLNHLDSLKGTFASLSELHCDKLHVDPENFRLLGNMIVIVLGHHLGK ------------3333----------------------3333----------------33 DFTPAAQAAFQKVVAGVAAALAH 33--------------------- >EMERIN; SWP:P50402; PDB:1JEIA; DNYADLSDTELTTLLRRYNIPHGPVVGSTRRLYEKKIFEYETQRRRLSPPSSS -------------------------------------3333------------ ---------------------------------------- ---------------------------------- >HYPOTHETICAL PROTEIN MJ12; SWP:Q58644; PDB:1JEOA; LEELDIVSNNILILKKFYTNDEWKNKLDSLIDRIIKAKKIFIFGVGRSGYIGRCFAMRLM -------------3333---3333---------------------3333----------1 HLGFKSYFVGETTTPSYEKDDLLILISGSGRTESVLTVAKKAKNINNNIIAIVEGNVVEF 111--------------1111--------------------1111---------3333-- ADLTIPLEVKKSKYLPMGTTFEETALIFLDLVIAEIMKRLNLDESEIIKRHNLL -----------1111!!!!-----------------------3333-------- >CUCUMBER STELLACYANIN; SWP:P29602; PDB:1JER; MQSTVHIVGDNTGWSVPSSPNFYSQWAAGKTFRVGDSLQFNFPANAHNVHEMETKQSFDA ------2222--------1111----1111--2222------2222-------------- CNFVNSDNDVERTSPVIERLDELGMHYFVCTVGTHCSNGQKLSINVVAAN --1111-------------------------!!!!1111----------- >OLIGO-PEPTIDE BINDING PRO; SWP:P06202; PDB:1JETA; ADVPAGVQLADKQTLVRNNGSEVQSLDPHKIEGVPESNVSRDLFEGLLISDVEGHPSPGV ---2222-------------------1111--------3333--------1111------ AEKWENKDFKVWTFHLRENAKWSDGTPVTAHDFVYSWQRLADPNTASPYASYLQYGHIAN ------%%%%------1111-1111---3333---------3333-1111---------- IDDIIAGKKPATDLGVKALDDHTFEVTLSEPVPYFYKLLVHPSVSPVPKSAVEKFGDKWT ---------1111------1111--------11113333-3333----------!!!!-- QPANIVTNGAYKLKNWVVNERIVLERNPQYWDNAKTVINQVTYLPISSEVTDVNRYRSGE 3333----------------------1111-3333------------------------- IDMTYNNMPIELFQKLKKEIPNEVRVDPYLCTYYYEINNQKAPFNDVRVRTALKLALDRD --------3333-------3333--------------1111----------------333 IIVNKVKNQGDLPAYSYTPPYTDGAKLVEPEWFKWSQQKRNEEAKKLLAEAGFTADKPLT 3-----------------1111-------3333--------------------3333--- FDLLYNTSDLHKKLAIAVASIWKKNLGVNVNLENQEWKTFLDTRHQGTFDVARAGWCADY ------------------------------------------------------------ NEPTSFLNTMLSDSSNNTAHYKSPAFDKLIADTLKVADDTQRSELYAKAEQQLDKDSAIV -3333-----1111--1111-----------1111------------------------- PVYYYVNARLVKPWVGGYTGKDPLDNIYVKNLYIIKH -----------1111------1111--3333------ >KU70; SWP:P12956; PDB:1JEYA; GRDSLIFLVDASKAMFESQSEDELTPFDMSIQCIQSVYISKIISSDRDLLAVVFYGTEKD -----------3333--------------------------1111--------------- KNSVNFKNIYVLQELDNPGAKRILELDQFKGQQGQKRFQDMMGHGSDYSLSEVLWVCANL -1111------------------------------------------------------- FSDVQFKMSHKRIMLFTNEDNPHGNDSAKASRARTKAGDLRDTGIFLDLMHLKKPGGFDI ----------------------11113333----------1111---------2222-33 SLFYRDIISVHFEESSKLEDLLRKVRAKETRKRALSRLKLKLNKDIVISVGIYNLVQKAL 333333------------------------------------------------------ KPPPIKLYRETNEPVKTKTRTFNTSTGGLLLPSDTKRSQIYGSRQIILEKEETEELKRFD -------1111-------------------1111--------------3333-1111--- DPGLMLMGFKPLVLLKKHHYLRPSLFVYPEESLVIGSSTLFSALLIKCLEKEVAALCRYT ----------3333-1111----------33332222----------------------- PRRNIPPYFVALVPQEEELDDQKIQVTPPGFQLVFLPFADDKRKMPFTEKIMATPEQVGK -------------------1111--------------3333------------------- MKAIVEKLRFTYRSDSFENPVLQQHFRNLEALALDLMEPEQAVDLTLPKVEAMNKRLGSL ----3333----1111---------------------------1111------------- VDEFKELVYPPDY ------------- >ATP-dependent DNA helicas; SWP:P13010; PDB:1JEYB; NKAAVVLCMDVGFTMSNSIPGIESPFEQAKKVITMFVQRQVFAENKDEIALVLFGTDGTD -----------3333-------------------------1111---------------- NPLSGGDQYQNITVHRHLMLPDFDLLEDIESKIQPGSQQADFLDALIVSMDVIQHETIGK ------------------------------------------------------------ KFEKRHIEIFTDLSSRFSKSQLDIIIHSLKKCDISLQFFLPFSLGGPFRLGGHGPSFPLK ------------------1111-------1111---------------2222-------- GITEQQKEGLEIVKMVMISLEGEDGLDEIYSFSESLRKLCVFKKIERHSIHWPCRLTIGS ---------------------11111111---------3333------------------ NLSIRIAAYKSILQERVKKTWTVVDAKTLKKEDIQKETVYCLNDDDETEVLKEDIIQGFR ------------------------------1111----------------1111------ YGSDIVPFSKVDEEQMKYKSEGKCFSVLGFCKSSQVQRRFFMGNQVLKVFAARDDEAAAV !!!!----3333-------------------3333-3333-----------2222----- ALSSLIHALDDLDMVAIVRYAYDKRANPQVGVAFPHIKHNYECLVYVQLPFMEDLRQYMF -------------------------------------1111---------1111------ SSLKNSKKYAPTEAQLNAVDALIDSMSLAKKDEKTDTLEDLFPTTKIPNPRFQRLFQCLL -----------------------1111---------------1111-------------- HRALHPREPLPPIQQHIWNMLNPPAEVTTKSQIPLSKIKTLFPLIEAKKK ----1111-----33331111--3333----------------------- >XYLOSE REDUCTASE; SWP:O74237; PDB:1JEZA; SIPDIKLSSGHLMPSIGFGCWKLANATAGEQVYQAIKAGYRLFDGAEDYGNEKEVGDGVK ------1111---------22223333------------------3333----------- RAIDEGLVKREEIFLTSKLWNNYHDPKNVETALNKTLADLKVDYVDLFLIHFPIAFKFVP --------1111-------1111-3333-------------------------------3 IEEKYPPGFYCGDGNNFVYEDVPILETWKALEKLVAAGKIKSIGVSNFPGALLLDLLRGA 333---!!!!--!!!!------3333---------------------------------- TIKPAVLQVEHHPYLQQPKLIEFAQKAGVTITAYSTLFAHDTIKAIAAKYNKTPAEVLLR -----------1111---------1111-------1111--------------------- WAAQRGIAVIPKSNLPERLVQNRSFNTFDLTKEDFEEIAKLDIGLRFNDPWDWDNIPIFV -3333----------11111111---------------1111------3333-------- >OBELIN; SWP:Q27709; PDB:1JF0A; KYAVKLQTDFDNPKWIKRHKFMFDYLDINGNGQITLDEIVSKASDDICKNLGATPAQTQR --------1111------------------------------------------------ HQDCVEAFFRGCGLEYGKETKFPEFLEGWKNLANADLAKWARNEPTLIREWGDAVFDIFD ---------1111-2222---------------------1111------------3333- KDGSGTITLDEWKAYGRISGISPSEEDCEKTFQHCDLDNSGELDVDEMTRQHLGFWYTLD ----------------3333-----------------1111------------------3 PEADGLYGNGVP 333-1111---- >MONOMER HEMOGLOBIN COMPON; SWP:P02216; PDB:1JF3A; GLSAAQRQVVASTWKDIAGADNGAGVGKECLSKFISAHPEMAAVFGFSGASDPGVAELGA --------------------iiii-------------3333-------1111-------- KVLAQIGVAVSHLGDEGKMVAEMKAVGVRHKGYGNKHIKAEYFEPLGASLLSAMEHRIGG -----------3333-------------3333!!!!--3333---------------!!! KMNAAAKDAWAAAYGDISGALISGLQS !-------------------------- >MONOMER HEMOGLOBIN COMPON; SWP:P15447; PDB:1JF4A; GLSAAQRQVVASTWKDIAGSDNGAGVGKECFTKFLSAHHDMAAVFGFSGASDPGVADLGA --------------------iiii-------------3333-1111--1111-------- KVLAQIGVAVSHLGDEGKMVAEMKAVGVRHKGYGNKHIKAEYFEPLGASLLSAMEHRIGG ---------1111--3333---------3333------3333---------------!!! KMNAAAKDAWAAAYADISGALISGLQS !-------------------------- >ARSENATE REDUCTASE; SWP:P30330; PDB:1JF8A; DKKTIYFISTGNSARSQMAEGWGKEILGEGWNVYSAGIETHGVNPKAIEAMKEVDIDISN ---------------------------------------------------1111--111 HTSDLIDNDILKQSDLVVTLCSDADNNCPILPPNVKKEHWGFDDPAGKEWSEFQRVRDEI 1------------------------------1111---------22223333-------- KLAIEKFKLR ------1111 >SELENOCYSTEINE LYASE; SWP:P77444; PDB:1JF9A; IFSVDKVRADFPVLSREVNGLPLAYLDSAASAQKPSQVIDAEAEFYRHGYAAVHRGIHTL -----------------iiii-----3333----3333---------------------- SAQATEKMENVRKRASLFINARSAEELVFVRGTTEGINLVANSWGNSNVRAGDNIIISQM ----------------------1111-----------------------2222----111 EHHANIVPWQMLCARVGAELRVIPLNPDGTLQLETLPTLFDEKTRLLAITHVSNVLGTEN 13333--------------------1111--3333-----1111---------------- PLAEMITLAHQHGAKVLVDGAQAVMHHPVDVQALDCDFYVFSGHKLYGPTGIGILYVKEA ---------1111------1111------3333--------3333-------------33 LLQEMPPWEGGGSMIATVSLSEGTTWTKAPWRFEAGTPNTGGIIGLGAALEYVSALGLNN 33----------------------------1111-------------------------- IAEYEQNLMHYALSQLESVPDLTLYGPQNRLGVIAFNLGKHHAYDVGSFLDNYGIAVRTG --------------3333--------1111-------!!!!------------------- HHCAMPLMAYYNVPAMCRASLAMYNTHEEVDRLVTGLQRIHRLLG %%%%----1111---------1111-------------------- >NITRIC-OXIDE REDUCTASE CY; SWP:P23295; PDB:1JFBA; APSFPFSRASGPEPPAEFAKLRATNPVSQVKLFDGSLAWLVTKHKDVCFVATSEKLSKVR ---------1111------------------1111-----------------1111--11 TRQGFPELSASGKQAAKAKPTFVDMDPPEHMHQRSMVEPTFTPEAVKNLQPYIQRTVDDL 11------------3333--3333------------3333-------------------- LEQMKQKGCANGPVDLVKEFALPVPSYIIYTLLGVPFNDLEYLTQQNAIRTNGSSTAREA --------1111----1111---------------1111-------3333-1111----- SAANQELLDYLAILVEQRLVEPKDDIISKLCTEQVKPGNIDKSDAVQIAFLLLVAGNATM ----------------------------------1111---------------------- VNMIALGVATLAQHPDQLAQLKANPSLAPQFVEELCRYHTASALAIKRTAKEDVMIGDKL -----------------------3333----------------------------!!!!- VRANEGIIASNQSANRDEEVFENPDEFNMNRKWPPQDPLGFGFGDHRCIAEHLAKAELTT -2222-----3333--3333--1111-1111-------1111-11111111--------- VFSTLYQKFPDLKVAVPLGKINYTPLNRDVGIVDLPVIF --------1111----3333----1111----------- >TRANSCRIPTION REGULATOR N; SWP:Q14919; PDB:1JFIA; ARFPPARIKKIMQTDEEIGKVAAAVPVIISRALELFLESLLKKACQVTQSRTMTTSHLKQ --------------3333---3333----------------------1111--3333111 CIE 1-- >TATA-binding protein-asso; SWP:Q01658; PDB:1JFIB; DDLTIPRAAINKMIKETLPNVRVANDARELVVNCCTEFIHLISSEANEICNKSEKKTISP -----3333--------------3333--------------------------------- EHVIQALESLGFGSYISEVKEVLQECKTVALKRRKASSRLENLGIPEEELLRQQQELFAK -----------1111--------------------------------------------- ARQQQAELAQQEWLQ ----------1111- >CALCIUM-BINDING PROTEIN; SWP:P38505; PDB:1JFJA; MAEALFKEIDVNGDGAVSYEEVKAFVSKKRAIKNEQLLQLIFKSIDADGNGEIDQNEFAK ------3333---------------3333---3333-----------------------3 FYGSIQGQDLSDDKIGLKVLYKLMDVDGDGKLTKEEVTSFFKKHGIEKVAEQVMKADANG 333------3333---------------------------3333-3333----------- DGYITLEEFLEFSL ----------3333 >ASPARTATE RACEMASE; SWP:O58403; PDB:1JFLA; MKTIGILGGMGPLATAELFRRIVIKTPAKRDQEHPKVIIFNNPQIPDRTAYILGKGEDPR ----------------------1111---3333--------1111--------------- PQLIWTAKRLEECGADFIIMPCNTAHAFVEDIRKAIKIPIISMIEETAKKVKELGFKKAG ---------------------3333-------1111---------------1111----- LLATTGTIVSGVYEKEFSKYGVEIMTPTEDEQKDVMRGIYEGVKAGNLKLGRELLLKTAK -----------------1111------3333----------3333--------------- ILEERGAECIIAGCTEVSVVLKQDDLKVPLIDPMDVIAEVAVKVALEK --1111-------3333----3333------3333------------- >RETINOIC ACID EARLY TRANS; SWP:O08603; PDB:1JFMA; DAHSLRCNLTIKDPTPADPLWYEAKCFVGEILILHLSNINKTMTSGDPGETANATEVKKC ------------------------------------------------------------ LTQPLKNLCQKLRNKVSNTKVDTHKTNGYPHLQVTMIYPQSQGRTPSATWEFNISDSYFF ---------------3333----------------------------------iiii--- TFYTENMSWRSANDESGVIMNKWKDDGEFVKQLKFLIHECSQKMDEFLKQSKEK ------------3333-----3333----------------------------- >Ig heavy chain V region 3; SWP:P01747; PDB:1JFQH; VQLQQSGVELVRAGSSVKMSCKASGYTFTSNGINWVKQRPGQGLEWIGYNNPGNGYITYN -----------2222-----------1111-----------------------------3 EKFKGKTTLTVDKSSNTAYMQLRSLTSEDSAVYFCARSEYYGGSYKFDYWGQGTTLTVSS 333---------1111---------1111------------------------------- AGTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSD ----------------------------------------%%%%---------------- LYTLSSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVPR ----------3333-----------3333----------- >LIPASE; SWP:Q56008; PDB:1JFRA; NPYERGPAPTNASIEASRGPYATSQTSVSSLVASGFGGGTIYYPTSTADGTFGAVVISPG 1111-----3333---------------3333--------------1111---------2 FTAYQSSIAWLGPRLASQGFVVFTIDTNTTLDQPDSRGRQLLSALDYLTQRSSVRTRVDA 2223333--------1111---------1111-------------------1111---11 TRLGVMGHSMGGGGSLEAAKSRTSLKAAIPLTGWNTDKTWPELRTPTLVVGADGDTVAPV 11-------------------1111--------------1111--------1111---33 ATHSKPFYESLPGSLDKAYLELRGASHFTPNTSDTTIAKYSISWLKRFIDSDTRYEQFLC 33-----11111111----------1111----------------------33331111- PIPRPSLTIAEYRGTCPHTS -----1111----------- >THIOL:DISULFIDE INTERCHAN; SWP:P43221; PDB:1JFUA; TGDPACRAAVATAQKIAPLAHGEVAALTMASAPLKLPDLAFEDADGKPKKLSDFRGKTLL --3333---------3333-!!!!------------------1111---33332222--- VNLWATWCVPCRKEMPALDELQGKLSGPNFEVVAINIDTRDPEKPKTFLKEANLTRLGYF ----1111------------------1111----------1111----------1111-- NDQKAKVFQDLKAIGRALGMPTSVLVDPQGCEIATIAGPAEWASEDALKLIRAATG -1111------1111-----------1111----------1111------------ >TAT PROTEIN; SWP:P04610; PDB:1JFWA; MEPVDPRLEPWKHPGSQPKTACTTCYCKKCCFHCQVCFTTKALGISYGRKKRRQRRRPPQ -------------------------------------3333------------------- GSQTHQVSLSKQPTSQPRGDPTGPKE -------------------------- >1,4-BETA-N-ACETYLMURAMIDA; SWP:P25310; PDB:1JFXA; DTSGVQGIDVSHWQGSINWSSVKSAGMSFAYIKATEGTNYKDDRFSANYTNAYNAGIIRG ----------3333---------------------------1111-------1111---- AYHFARPNASSGTAQADYFASNGGGWSRDNRTLPGVLDIEHNPSGAMCYGLSTTQMRTWI -----3333------------------------------------1111----------- NDFHARYKARTTRDVVIYTTASWWNTCTGSWNGMAAKSPFWVAHWGVSAPTVPSGFPTWT -------------------------1111--1111-----------------1111---- FWQYSATGRVGGVSGDVDRNKFNGSAARLLALANNTA ---------2222------------------------ >PROTEIN-L-ISOASPARTATE O-; SWP:Q8TZR3; PDB:1JG1A; EKELYEKWMRTVEMLKAEGIIRSKEVERAFLKYPRYLSVEDKYKKYAHIDEPLPIPAGQT -----------------------------------111133331111--------iiii- VSAPHMVAIMLEIANLKPGMNILEVGTGSGWNAALISEIVKTDVYTIERIPELVEFAKRN ----------------2222------!!!!------------------------------ LERAGVKNVHVILGDGSKGFPPKAPYDVIIVTAGAPKIPEPLIEQLKIGGKLIIPVGSYH -1111---------3333-3333---------------333311112222--------11 LWQELLEVRKTKDGIKIKNHGGVAFVPLIGEYGWK 11--------1111---------------1111-- >GTP CYCLOHYDROLASE I FEED; SWP:P70552; PDB:1JG5A; PYLLISTQIRMEVGPTMVGDEHSDPELMQQLGASKRRVLGNNFYEYYVNDPPRIVLDKLE ---------3333------11113333-1111-----2222---------3333---333 CRGFRVLSMTGVGQTLVWCLHKE 3---------------------- >BACTERIOFERRITIN; SWP:Q59738; PDB:1JGCA; MKGDAKVIEFLNAALRSELTAISQYWVHFRLQEDWGLAKMAKKSREESIEEMGHADKIIA ---3333----------------------------------------------------- RILFLEGHPNLQKLDPLRIGEGPRETLECDLAGEHDALKLYREARDYCAEVGDIVSKNIF ------------------------------------------------------------ ESLITDEEGHVDFLETQISLYDRLGPQGFALLNAAPMDAA ------------------------------1111-1111- >SEGMENTATION PROTEIN EVEN; SWP:P06602; PDB:1JGGA; RYRTAFTRDQLGRLEKEFYKENYVSRPRRCELAAQLNLPESTIKVWFQNRRMKDKRQ ------3333-------3333------------1111-3333-----------3333 ------------------------------------------------------------ ------ >IG KAPPA-CHAIN; SWP:NA; PDB:1JGLH; QIQLVQSGPELKKPGETVRISCKASDYMTSGMQWVQQMPGKGLKWIGWLNTQSGVPEYAE ------------2222---------------------2222-----------------33 DFKGRFAFSLETTAYLQINNL 33------------------- >MULTIPLE ANTIBIOTIC RESIS; SWP:P27245; PDB:1JGSA; LFNEIIPLGRLIHMVNQKKDRLLNEYLSPLDITAAQFKVLCSIRCAACITPVELKKVLSV --------------------------1111-------------1111------------- DLGALTRMLDRLVCKGWVERLPNPNDKRGVLVKLTTGGAAICEQCHQLVGQDLHQELTKN ------------1111------1111---------------------------------- LTADEVATLEYLLKKVLP -----------3333--- >ANTIBODY LIGHT CHAIN; SWP:NA; PDB:1JGUH; EVKLVESRGGLVKPGGSLQLSCAASGFTFSGYAMSWFRLTPEKRLEWVASIYNGFRIHYL ------------2222------------2222-------1111----------------3 DSVKGRFTISSDYARNILYLQMSTL 333---------------------- >TYROSYL-TRNA SYNTHETASE; SWP:P00952; PDB:1JH3A; ALFSGDIANLTAAEIEQGFKDVPSFVHEGGDVPLVELLVSAGISPSKRQAREDIQNGAIY ------11113333------------------1111--1111------------------ VNGERLQDVGAILTAEHRLEGRFTVIRRGKKKYYLIRYA iiii---1111--3333----------3333-------- >CYCLIC PHOSPHODIESTERASE; SWP:O04147; PDB:1JH6A; MEEVKKDVYSVWALPDEESEPRFKKLMEALRSEFTGPRFVPHVTVAVSAYLTADEAKKMF ------------------------------------------------------------ ESACDGLKAYTATVDRVSTGTFFFQCVFLLLQTTPEVMEAGEHCKNHFNCSTTTPYMPHL ---------------------1111----------------------------------- SLLYAELTEEEKKNAQEKAYTLDSSLDGLSFRLNRLALCKTDTEDKTLETWETVAVCNLN -------------------------2222------------1111--1111--------- P - >SULFATE ADENYLYLTRANSFERA; SWP:Q54506; PDB:1JHDA; MIKPVGSDELKPLFVYDPEEHHKLSHEAESLPSVVISSQAAGNAVMMGAGYFSPLQGFMN ---------------------------1111------------------1111------- VADAMGAAEKMTLSDGSFFPVPVLCLLENTDAIGDAKRIALRDPNVEGNPVLAVMDIEAI ------------1111-------------3333------------2222----------- EEVSDEQMAVMTDKVYRTTDMDHIGVKTFNSQGRVAVSGPIQVLNFSYFQADFPDTFRTA -------------------1111----1111------------------1111------- VEIRNEIKEHGWSKVVAFQTRNPMHRAHEELCRMAMESLDADGVVVHMLLGKLKKGDIPA -------1111------------------------------------------2222-33 PVRDAAIRTMAEVYFPPNTVMVTGYGFDMLYAGPREAVLHAYFRQNMGATHFIIGRDHAG 33------------------------------------------1111--------2222 VGDYYGAFDAQTIFDDEVPEGAMEIEIFRADHTAYSKKLNKIVMMRDVPDHTKEDFVLLS -----1111-3333----2222--------------1111---333311111111----- GTKVREMLGQGIAPPPEFSRPEVAKILMDYYQSINS -------1111---3333-----------3333--- >LEXA REPRESSOR; SWP:P03033; PDB:1JHFA; KALTARQQEVFDLIRDHISQTGMPPTRAEIAQRLGFRSPNAAEEHLKALARKGVIEIVSG -------------------------------------------------1111------- ASRGIRLLQEEEEGLPLVGRVAADEPLLAQQHIEGHYQVDPSLFKPNADFLLRVSGMSMK -------------------------11111111------1111-------------1111 DIGIMDGDLLAVHKTQDVRNGQVVVARIDDEVTVKRLKKQGNKVELLPENSEFKPIVVDL ----2222----------2222-----iiii--------!!!!------3333-----11 RQQSFTIEGLAVGVIRN 11--------------- >TRP OPERON REPRESSOR; SWP:P03032; PDB:1JHGA; SAAMAEQRHQEWLRFVDLLKNAYQNDLHLPLLNLMLTPDEREALGTRVRIIEELLRGEMS ----------------------1111---------------------------------- QRELKNELGAGIATITRGSNSLKAAPVELRQWLEEVLLKSD ----------------------------------------- >APC10; SWP:Q9UM13; PDB:1JHJA; ATPNKTPPGADPKQLERTGTVREIGSQAVWSLSSCKPGFGVDQLRDDNLETYWQSDGSQP -1111----------1111----3333--------22223333----1111--------- HLVNIQFRRKTTVKTLCIYADYKSDESYTPSKISVRVGNNFHNLQEIRQLELVEPSGWIH --------------------3333!!!!-----------1111----------------- VPLTDNHKKPTRTFMIQIAVLANHQNGRDTHMRQIKIYTPV -----------------------%%%%-------------- >Ig heavy chain V region 3; SWP:P01749; PDB:1JHLH; QVQLQQSGAELVRPGASVKLSCKASGYTFISYWINWVKQRPGQGLEWIGNIYPSDSYTNY ------------2222-----------3333--------2222----------------- NQKFKDKATLTVDKSSSTAYMQLSSPTSEDSAVYYCTRDDNYGAMDYWGQGTTVTV 3333---------1111---------3333-------------------------- >Ig heavy chain V region 3; SWP:P01749; PDB:1JHLL; DIELTQSPSYLVASPGETITINCRASKSISKSLAWYQEKPGKTNNLLIYSGSTLQSGIPS -------------2222-------------------------------------222233 RFSGSGSGTDFTLTISSLEPEDFAMYICQQHNEYPWTFGGGTKLEIKR 33----------------1111-------------------------- >CALNEXIN; SWP:P24643; PDB:1JHNA; YKAPVPSGEVYFADSFDRGTLSGWILSKAKDGKWEVDEMKETKLPGDKGLVLMSRAKHHA ---------------3333----------------------------------------- ISAKLNKPFLFDTKPLIVQYEVNFQNGIECGGAYVKLLSKTPELNLDQFHDKTPYTIMFG -----------------------1111--------------------------------- PDKCGEDYKLHFIFRHKNPKTGVYEEKHAKRPDADLKTYFTDKKTHLYTLILNPDNSFEI ------------------------------------3333------------3333---- LVDQSIVNSGNPVNPSREIEDPEDQKPEDWDERPKIPDPDAVKPDDWNEDAPAKIPDEEA --------------------------1111-------------3333------------- TKPDGWLDDEPEYVPDPDAEKPEDWDEDMDGEWEAPQIANPKCESAPGCGVWQRPMIDNP -------------------------3333----------3333---------------11 NYKGKWKPPMIDNPNYQGIWKPRKIPNPDFFEDLEPFKMTPFSAIGLELWSMTSDIFFDN 11------------------------1111------------------------------ FIVCGDRRVVDDWANDGWGL ------3333------3333 >MOG1 PROTEIN; SWP:P47123; PDB:1JHSA; MNNKEVELYGGAITTVVPPGFIDASTLREVPDTQAVYVNSRRDEEEFEDGLATNESIIVD -------iiii-----------1111----1111-------3333--------------- LLETVDKSDLKEAWQFHVEDLTELNGTTKWEALQEDTVQQGTKFTGLVMEVANKWGKPDL --------------------------------------2222----------11111111 AQTVVIGVALIRLTQFDTDVVISINVPLTKEEASQASNKELPARCHAVYQLLQEMVRKFH ------------3333-------------------1111--------------------- VVDTSLFA ---3333- >ABC TRANSPORTER; SWP:Q9X0M3; PDB:1JI0A; VSDIVLEVQSLHVYYGAIHAIKGIDLKVPRGQIVTLIGANGAGKTTTLSAIAGLVRAQKG ----------------------------2222------2222------------------ KIIFNGQDITNKPAHVINRGIALVPEGRRIFPELTVYENLGAYNRKDKEGIKRDLEWIFS ---iiii-2222------------------11113333---------------------- LFPRLKERLKQLGGTLSGGEQQLAIGRALSRPKLLDEPSLGLAPILVSEVFEVIQKINQE -3333--1111-----3333----------------1111-----------------111 GTTILLVEQNALGALKVAHYGYVLETGQIVLEGKASELLDNEVRKAYLGVA 1-----------------------iiii-----3333-----3333----- >ALPHA-AMYLASE I; SWP:Q60053; PDB:1JI1A; AANDNNVEWNGLFHDQGPLFDNAPEPTSTQSVTLKLRTFKGDITSANIKYWDTADNAFHW -------3333-----3333------1111--------2222------------------ VPMVWDSNDPTGTFDYWKGTIPASPSIKYYRFQINDGTSTAWYNGNGPSSTEPNADDFYI --------1111-----------------------!!!!----1111------------- IPNFKTPDWLKNGVMYQIFPDRFYNGDSSNDVQTGSYTYNGTPTEKKAWGSSVYADPGYD 2222--3333--------3333----3333--2222--iiii-----2222----22221 NSLVFFGGDLAGIDQKLGYIKKTLGANILYLNPIFKAPTNHKYDTQDYMAVDPAFGDNST 111-------------------------------------------1111-3333----- LQTLINDIHSTANGPKGYLILDGVFNHTGDSHPWFDKYNNFSSQGAYESQSSPWYNYYTF ----------------------------1111---1111-----3333---1111----- YTWPDSYASFLGFNSLPKLNYGNSGSAVRGVIYNNSNSVAKTYLNPPYSVDGWRLDAAQY ---------iiii---------2222--------1111------------------3333 VDANGNNGSDVTNHQIWSEFRNAVKGVNSNAAIIGEYWGNANPWTAQGNQWDAATNFDGF --iiii---------------------1111---------33331111------------ TQPVSEWITGKDYQNNSASISTTQFDSWLRGTRANYPTNVQQSMMNFLSNHDITRFATRS -----------1111-----------------3333-------------1111----111 GGDLWKTYLALIFQMTYVGTPTIYYGDEYGMQGGADPDNRRSFDWSQATPSNSAVALTQK 1------------------------3333--------------1111-3333-------- LITIRNQYPALRTGSFMTLITDDTNKIYSYGRFDNVNRIAVVLNNDSVSHTVNVPVWQLS -------3333-------------------------------------------3333-- MPNGSTVTDKITGHSYTVQNGMVTVAVDGHYGAVLAQ -2222-------------iiii--------------- >LIPASE; SWP:Q9L6D3; PDB:1JI3A; ASLRANDAPIVLLHGFTGWGREEMFGFKYWGGVRGDIEQWLNDNGYRTYTLAVGPLSSNW -------------------1111-------!!!!-------1111--------1111--- DRVCEAYVQLVGGTVDYGAAHAAKHGHARFGRTYPGLLPELKRGGRIHIIAHSQGGQTAR -------------------------------------3333------------------- MLVSLLENGSQEEREYAKAHNVSLSPLFEGGHHFVLSVTTIATPHDGTTLVNMVDFTDRF ------------------------3333---------------11113333--------- FDLQKAVLEAAAVASNVPYTSQVYDFKLDQWGLRRQPGESFDHYFERLKRSPVWTSTDTA --------1111---------------3333----2222-----------1111----33 RYDLSVSGAEKLNQWVQASPNTYYLSFSTERTYRGALTGNHYPELGMNAFSAVVCAPFLG 33----------------1111----------------------------------3333 SYRNPTLGIDSHWLENDGIVNTISMNGPKRGSNDRIVPYDGTLKKGVWNDMGTYNVDHLE ----1111-1111-------3333----2222------------------------1111 IIGVDPNPSFDIRAFYLRLAEQLASLQP ------1111------------1111-- >NEUTROPHIL-ACTIVATING PRO; SWP:P43313; PDB:1JI4A; MKTFEILKHLQADAIVLFMKVHNFHWNVKGTDFFNVHKATEEIYEEFADMFDDLAERIVQ -----------------------------1111-------------------------11 LGHHPLVTLSEAIKLTRVKEETKTSFHSKDIFKEILEDYKYLEKEFKELSNTAEKEGDKV 11---------------------------------------------------1111--- TVTYADDQLAKLQKSIWMLQAHLA --------------------1111 >DLP-1; SWP:Q8RPQ2; PDB:1JI5A; QVIEVLNKQVADWSVLFTKLHNFHWYVKGPQFFTLHEKFEELYTESATHIDEIAERILAI 3333------------------------1111------------------------1111 GGKPVATMKEYLEISSIQEAAYGETAEGMVEAIMKDYEMMLVELKKGMEIAQNSDDEMTS ---------------------------------------------------1111----- DLLLGIYTELEKHAWMLRAFLN -----------------3333- >PESTICIDIAL CRYSTAL PROTE; SWP:Q06117; PDB:1JI6A; DAVGTGISVVGQILGVVGVPFAGALTSFYQSFLNTIWPSDADPWKAFMAQVEVLIDKKIE -------------------%%%%---3333-------1111------------------- EYAKSKALAELQGLQNNFEDYVNALNSWKKTPLSLRSKRSQDRIRELFSQAESHFRNSMP ---------------------------33333333----------------------333 SFAVSKFEVLFLPTYAQAANTHLLLLKDAQVFGEEWGYSSEDVAEFYHRQLKLTQQYTDH 3--22223333---------------3333---1111----------------------- CVNWYNVGLNGLRGSTYDAWVKFNRFRREMTLTVLDLIVLFPFYDIRLYSKGVKTELTRD --------1111---------------------3333--3333----------------- IFTDPIFSLNTLQEYGPTFLSIENSIRKPHLFDYLQGIEFHTRLQPGYFGKDSFNYWSGN ---------1111---------------------------------1111---------- YVETRPSIGSSKTITSPFYGDKSTEPVQKLSFDGQKVYRTIANTDVAAWPNGKVYLGVTK -------------------------------2222-------------1111-------- VDFSQYDDQKNETSTQTYDSKRNNGHVSAQDSIDQLPPETTDEPLEKAYSHQLNYAECFL ------------------------------3333---------3333------------- MQDRRGTIPFFTWTHRSVDFFNTIDAEKITQLPVVKAYALSSGASIIEGPGFTGGNLLFL -%%%%---------33331111-----------1111---1111---------------- KESSNSIAKFKVTLNSAALLQRYRVRIRYASTTNLRLFVQNSNNDFLVIYINKTMNKDDD --------------3333-------------------------------------1111- LTYQTFDLATTNSNMGFSGDKNELIIGAESFVSNEKIYIDKIEFIPVQL -3333-------------------------------------------- >ETS-RELATED PROTEIN TEL1; SWP:P41212; PDB:1JI7A; SIRLPAHLRLQPIYWSRDDVAQWLKWAENEFSLRPIDSNTFENGKALLLLTKEDFRYRSP ----1111--3333--------------1111----1111--33331111---------- HSGDELYELLQHILKQ ---------------- >DISSIMILATORY SIROHEME-SU; SWP:Q8ZUX1; PDB:1JI8A; MPVKCPGEYQVDGKKVILDEDCFMQNPEDWDEKVAEWLARELEGIQKMTEEHWKLVKYLR ------------------2222---3333-3333-------------------------- EYWETFGTCPPIKMVTKETGFSLEKIYQLFPSGPAHGACKVAGAPKPTGCV -1111--------3333---------------------------------- >METALLOTHIONEIN-III; SWP:P28184; PDB:1JI9A; KSCCSCCPAGCEKCAKDCVCKGEEGAKAEAEKCSCCQ -----------1111--1111-----3333------- >PHOSPHOLIPASE A2; SWP:O42187; PDB:1JIAA; HLLQFRKMIKKMTGKEPVVSYAFYGCYCGSGGRGKPKDATDRCCFVHDCCYEKVTGCDPK 3333-------------3333-----------------------------1111---333 WDDYTYSWKNGTIVCGGDDPCKKEVCECDKAAAICFRDNLKTYKKRYMAYPDILCSSKSE 3-------iiii--------------------------3333-3333------------- KC -- >SIGNAL RECOGNITION PARTIC; SWP:P09132; PDB:1JIDA; AARSPADQDRFICIYPAYLNNKKTIAEGRRIPISKAVENPTATEIQDVCSAVGLNVFLEK ---11111111---3333-11113333----3333--------------1111------- NKMYSREWNRDVQYRGRVRVQLKQEDGSLCLVQFPSRKSVMLYAAEMIPKLKTR ---1111---3333---------1111---3333------------33333333 >DLP-2; SWP:Q8RPQ1; PDB:1JIGA; STKTNVVEVLNKQVANWNVLYVKLHNYHWYVTGPHFFTLHEKFEEFYNEAGTYIDELAER --------------------------------1111------------------------ ILALEGKPLATMKEYLATSSVNEGTSKESAEEMVQTLVNDYSALIQELKEGMEVAGEAGD -1111--------------------------------------------------1111- ATSADMLLAIHTTLEQHVWMLSAFLK -------------------------- >DNA POLYMERASE ETA; SWP:Q04049; PDB:1JIHA; MSKFTWKELIQLGSPSKAYESSLACIAHIDMNAFFAQVEQMRCGLSKEDPVVCVQWNSII ---------3333---33331111----------------1111-1111-----!!!!-- AVSYAARKYGISRMDTIQEALKKCSNLIPIHTAVFKKGEDFWQYHDGCGSWVQDPAKQIS ---1111----1111--------------------2222-----22221111-1111--- VEDHKVSLEPYRRESRKALKIFKSACDLVERASIDEVFLDLGRICFNMLMFDNEYELTGD ---------------------------------------------------------111 LKLKDALSNIREAFIGGNYDINSHLPLIPEKIKSLKFEGDVFNPEGRDLITDWDDVILAL 13333--------3333--1111-----3333----------1111-------------- GSQVCKGIRDSIKDILGYTTSCGLSSTKNVCKLASNYKKPDAQTIVKNDCLLDFLDCGKF --------------------------------3333----------3333---------- EITSFWTLGGVLGKELIDVLDLPHENSIKHIRETWPDNAGQLKEFLDAKVKQSDYDRSTS 1111-------------1111----------------3333----------11113333- NIDPLKTADLAEKLFKLSRGRYGLPLSSRPVVKSMMSNKNLRGKSCNSIVDCISWLEVFC --1111----------1111---------------------!!!!--------------- AELTSRIQDLEQEYNKIVIPRTVSISLKTKSYEVYRKSGPVAYKGINFQSHELLKVGIKF ----------------------------1111---------------------------- VTDLDIKGKNKSYYPLTKLSMTITNFDII ----------------------------- >TYROSYL-TRNA SYNTHETASE; SWP:NA; PDB:1JILA; TNVLIEDLKWRGLIYQQTDEQGIEDLLNKEQVTLYCGADPTADSLHIGHLLPFLTLRRFQ ---------------------------------------------3333---------33 EHGHRPIVLIGGGTGMIGDPSGKSEERVLQTEEQVDKNIEGISKQMHNIFEFGTDHGAVL 33----------3333---2222----------------------1111----------- VNNRDWLGQISLISFLRDYGKHVGVNYMLGKDSIQSRLEHGISYTEFTYTILQAIDFGHL -33331111----------1111-------------3333--3333-------------- NRELNCKIQVGGSDQWGNITSGIELMRRMYGQTDAYGLTIPLVTKSDGKKFGKSESGAVW -----------3333-----------------------------1111------------ LDAEKTSPYEFYQFWINQSDEDVIKFLKYFTFLGKEEIDRLEQSKNEAPHLREAQKTLAE -1111---------11113333-------------------------3333--------- EVTKFIHGEDALNDAIRISQALF ----------------------- >Proteinase inhibitor [Pre; SWP:Q03026; PDB:1JIWI; SSLILLSASDLAGQWTLQQDEAPAICHLELRDSEVAEASGYDLGGDTACLTRWLPSEPRA ------3333--------!!!!-------------1111------33333333------- WRPTPAGIALLERGGLTLMLLGRQGEGDYRVQKGDGGQLVLRRAT -----------1111---------2222----------------- >DNA BETA-GLUCOSYLTRANSFER; SWP:P04547; PDB:1JIXA; MKIAIINMGNNVINFKTVPSSETIYLFKVISEMGLNVDIISLKNGVYTKSFDEVDVNDYD ------------------------------1111----------1111-1111-1111-- RLIVVNSSINFFGGKPNLAILSAQKFMAKYKSKIYYLFTDIRLPFSQSWPNVKNRPWAYL --------------------------1111---------1111----33331111-3333 YTEEELLIKSPIKVISQGINLDIAKAAHKKVDNVIEFEYFPIEQYKIHMNDFQLSKPTKK -3333--------------------1111-1111------333311111111-------- TLDVIYGGSFRSGQRESKMVEFLFDTGLNIEFFGNAREKQFKNPKYPWTKAPVFTGKIPM ---------%%%%-----------------------3333--3333------------11 NMVSEKNSQAIAALIIGDKNYNDNFITLRVWETMASDAVMLIDEEFDTKHRIINDARFYV 11----3333-------1111-----3333------------33331111----3333-- NNRAELIDRVNELKHSDVLRKEMLSIQHDILNKTRAKKAEWQDAFKKAIDL ----------------------------------------------1111- >50S ribosomal protein L10; SWP:P60617; PDB:1JJ2H; KPGAMYRNSSKPAYTRREYISGIPGKKIAQFDMGNNGAGPTYPAQVELVVEKPVQIRHNA -3333----------3333---------------3333---------------------- LEAARVAANRYVQNSGAAANYKFRIRKFPFHVIRENKDGMRAPFGKPVGTAARVHGANHI --------3333------------------------------------------2222-- FIAWVNPDPNVEEAWRRAKMKVTPTINIDSSPAGNA ----------------1111---------------- >50S ribosomal protein L15; SWP:P60618; PDB:1JJ2L; ARSAYSYIREAWKRPKEGQIAELMWHRMQEWRNEPAVVRIERPTRLDRARSLGYKAKQGI --3333-------11113333--------3333----------------1111---2222 IVVRVAIRKGSSRRTRFNKGRRSKRMMVNRITRKKNIQRIAEERANRKFPNLRVLNSYSV ---------------------3333--1111----3333---------1111-------- GEDGRHKWHEVILIDPDHPAIKSDDQLSWISRTRHRLRTFRGLTSAGRRCRGLRGQGKGS --------------1111--1111--3333-3333-3333--------1111----2222 EKVRPSLRVNGAKA -----3333----- >50S ribosomal protein L37; SWP:P60619; PDB:1JJ2Y; RTGRFGPRYGLKIRVRVADVEIKHKKKHKCPVCGFKKLKRAGTGIWMCGHCGYKIAGGCY --1111-----------------------------------2222--------------- QPETVAGKAVMKA -------3333-- >PEPTIDE TRANSPORTER TAP1; SWP:Q03518; PDB:1JJ7A; GLLTPLHLEGLVQFQDVSFAYPNRPDVLVLQGLTFTLRPGEVTALVGPNGSGKSTVAALL --------------------1111-------------2222------2222-------11 QNLYQPTGGQLLLDGKPLPQYEHRYLHRQVAAVGQEPQVFGRSLQENIAYGLTQKPTMEE 11----------iiii3333---------------------------------------- ITAAAVKSGAHSFISGLPQGYDTEVDEAGSQLSGGQRQAVALARALIRKPCVLILDDATS ----------------2222--------3333------------3333----------11 ALDANSQLQVEQLLYESPERYSRSVLLITQHLSLVEQADHILFLEGGAIREGGTHQQLME 113333----------3333--------------1111------iiii------------ KKGCYWAMVQA ----------- >METALLOTHIONEIN; SWP:P30331; PDB:1JJDA; TLVKCACEPCLCNVDPSKAIDRNGLYYCSEACADGHTGGSKGCGHTGCNCHG ------1111----1111---------------------------------- >ENDO-1,4-BETA-XYLANASE Z; SWP:P10478; PDB:1JJFA; SLPTMPPSGYDQVRNGVPRGQVVNISYFSTATNSTRPARVYLPPGYSKDKKYSVLYLLHG ------2222---2222-------------------------22221111---------2 IGGSENDWFEGGGRANVIADNLIAEGKIKPLIIVTPNTNAAGPGIADGYENFTKDLLNSL 222-------------------1111---------------2222-3333---------- IPYIESNYSVYTDREHRAIAGLSMGGGQSFNIGLTNLDKFAYIGPISAAPNTYPNERLFP ------------3333----------------1111------------1111-3333-11 DGGKAAREKLKLLFIACGTNDSLIGFGQRVHEYCVANNINHVYWLIQGGGHDFNVWKPGL 11---------------1111-------------1111-------------1111----- WNFLQMADEAGLTRD ----------1111- >M156R; SWP:Q9Q8E9; PDB:1JJGA; MTVIKPSSRPRPRKNKNIKVNTYRTSAMDLSPGSVHEGIVYFKDGIFKVRLLGYEGHECI ------------------------------------------------------------ LLDYLNYRQDTLDRLKERLVGRVIKTRVVRADGLYVDLRRFF -%%%%-------3333-------------------------- >CARBOXYLESTERASE; SWP:O28558; PDB:1JJIA; MLDMPIDPVYYQLAEYFDSLPKFDQFSSAREYREAINRIYEERNRQLSQHERVERVEDRT ----------------1111-1111---------------------3333---------- IKGRNGDIRVRVYQQKPDSPVLVYYHGGGFVICSIESHDALCRRIARLSNSTVVSVDYRL --1111---------------------iiii--3333----------------------- APEHKFPAAVYDCYDATKWVAENAEELRIDPSKIFVGGDSAGGNLAAAVSIMARDSGEDF -----------------------3333--1111--------------------1111--- IKHQILIYPVVNFVAPTPSLLEFGEGLWILDQKIMSWFSEQYFSREEDKFNPLASVIFAD --------------------------------------------3333--1111-1111- LENLPPALIITAEYDPLRDEGEVFGQMLRRAGVEASIVRYRGVLHGFINYYPVLKAARDA 2222----------1111-------------------------22221111--------- INQIAALLVFD ----------- >NEUROSERPIN; SWP:O35684; PDB:1JJOA; TITEWSVNMYNHLRGTGEDENILFSPLSIALAMGMMELGA -------33331111-------------------1111-- >Neuroserpin [Precursor]; SWP:O35684; PDB:1JJOC; ENQYVMKLANSLFVQNGFHVNEEFLQMLKMYFNAEVNHVDFSQNVAVANSINKWVENYTN --------------3333---------------------3333---------------%% SLLKDLVSPEDFDV %%-----3333--- --------------------------------- >THYROID AUTOANTIGEN; SWP:P12956; PDB:1JJRA; KVEYSEEELKTHISKGTLGKFTVPMLKEACRAYGLKSGLKKQELLEALTKHFQD ----------------1111-3333---------------3333---------- >CREB-BINDING PROTEIN; SWP:P45481; PDB:1JJSA; ALQDLLRTLKSPSSPQVLNILKSNPQLMAAFIKQRTAKYVAN --3333--------------------3333-1111------- >IMP-1 METALLO BETA-LACTAM; SWP:P52699; PDB:1JJTA; SLPDLKIEKLDEGVYVHTSFEEVNGWGVVPKHGLVVLVNAEAYLIDTPFTAKDTEKLVTW ----------2222--------!!!!-----------!!!!------------------- FVERGYKIKGSISSHFHSDSTGGIEWLNSRSIPTYASELTNELLKKDGKVQATNSFSGVN 3333------------11111111---1111-------------1111------------ YWLVKNKIEVFYPGPGHTPDNVVVWLPERKILFGGCFIKPYGLGNLGDANIEAWPKSAKL ---2222-------------------1111---!!!!--------1111-1111------ LKSKYGKAKLVVPSHSEVGDASLLKLTLEQAVKGLNESKK ----1111-----------3333-------------3333 >DEPHOSPHO-COA KINASE; SWP:P44920; PDB:1JJVA; MTYIVGLTGGIGSGKTTIANLFTDLGVPLVDADVVAREVVAKDSPLLSKIVEHFGAQILN ---------2222---------1111-----------1111-------------3333-- RAALRERVFNHDEDKLWLNNLLHPAIRERMKQKLAEQTAPYTLFVVPLLIENKLTALCDR ---------------------------------1111----------3333--3333--- ILVVDVSPQTQLARSANFEQIQRIMNSQVSQQERLKWADDVINNDAELAQNLPHLQQKVL ------------------------1111------------------3333---------- ELHQFYLQQAENKN -------------- >ribonucleoside-diphosphat; SWP:P09938; PDB:1JK0A; LNKELETLREENRVKSDMLKEKLSKDAENHKAYLKSHQVHRHKLKEMEKEEPLLNEDKER ---------------------------------------------3333-3333--3333 TVLFPIKYHEIWQAYKRAEASFWTAEEIDLSKDIHDWNNRMNENERFFISRVLAFFAASD -------3333------1111--3333--1111--------3333--------------- GIVNENLVENFSTEVQIPEAKSFYGFQIMIENIHSETYSLLIDTYIKDPKESEFLFNAIH 1111------3333---------------------------------3333-----1111 TIPEIGEKAEWALRWIQDADALFGERLVAFASIEGVFFSGSFASIFWLKKRGMMPGLTFS -3333------------------------------------------------3333--- NELICRDEGLHTDFACLLFAHLKNKPDPAIVEKIVTEAVEIEQRYFLDALPVALLGMNAD ------------------1111----3333------------3333----3333---333 LMNQYVEFVADRLLVAFGNKKYYKVENPFDFMEN 3---------------------------3333-- >Ribonucleoside-diphosphat; SWP:P49723; PDB:1JK0B; FQKERHDMKEAEKDEILLMENSRRFVMFPIKYHEIWAAYKKVEASFWTAEEIELAKDTED -3333-----33333333------------------------1111-3333-----3333 FQKLTDDQKTYIGNLLALSILIENFSAQLQNPEGKSFYGFQIMMENIYSEVYSMMVDAFF 11111111------------33333333-------------------------------- KDPKNIPLFKEIANLPEVKHKAAFIERWISNDDSLYAERLVAFAAKEGIFQAGNYASMFW ---33333333----3333----1111-----------------------3333------ LTDKKIMPGLAMANRNICRDRGAYTDFSCLLFAHLRTKPNPKIIEKIITEAVEIEKEYYS --------3333-------------------1111------------------------- NSLPHTYIEFVADGLLQGFGNEKYY -----3333-------1111----- >NEUROPHYSIN 2; SWP:P01180; PDB:1JK4A; LRQCLPCGPGGKGRCFGPSICCGDELGCFVGTAEALRCQEENYLPSPCQSGQKPCGSGGR --------%%%%-------------------333333333333----------------- CAAAGICCNDESCVTEPEC --2222---------1111 >SERINE/THREONINE PROTEIN ; SWP:P36873; PDB:1JK7A; KLNIDSIIQRLLEVRGSKPGKNVQLQENEIRGLCLKSREIFLSQPILLELEAPLKICGDI --3333-----1111--2222-------------------3333---------------- HGQYYDLLRLFEYGGFPPESNYLFLGDYVDRGKQSLETICLLLAYKIKYPENFFLLRGNH ------------------------------------------------1111-----111 ECASINRIYGFYDECKRRYNIKLWKTFTDCFNCLPIAAIVDEKIFCCHGGLSPDLQSMEQ 13333-------------------------1111-----%%%%--------1111-3333 IRRIMRPTDVPDQGLLCDLLWSDPDKDVLGWGENDRGVSFTFGAEVVAKFLHKHDLDLIC 1111-----------------------------3333----------------------- RAHQVVEDGYEFFAKRQLVTLFSAPNYCGEFDNAGAMMSVDETLMCSFQILKPA -----1111----%%%%--------2222-----------1111---------- -------- >HLA class II histocompati; SWP:P01920; PDB:1JK8B; SPEDFVYQFKGMCYFTNGTERVRLVTRYIYNREEYARFDSDVGVYRAVTPLGPPAAEYWN --------------------------------------3333----------3333---- SQKEVLERTRAELDTVCRHNYQLELRTTLQRRVEPTVTISPSRTEALNHHNLLVCSVTDF --------------------------3333------------------------------ YPAQIKVRWFRNDQEETTGVVSTPLIRNGDWTFQILVMLEMTPQRGDVYTCHVEHPSLQN ----------!!!!----------------------------------------1111-- PIIVEWRAQS ---------- >D-TYR-TRNATYR DEACYLASE; SWP:P32147; PDB:1JKEA; MIALIQRVTRASVTVEGEVTGEIGAGLLVLLGVEKDDDEQKANRLCERVLGYRIFSDAEG --------------iiii---------------1111----------------------- KMNLNVQQAGGSVLVVSQFTLAADTERGMRPSFSKGASPDRAEALYDYFVERCRQQEMNT ----3333----------3333-------------------------------1111--- QTGRFAADMQVSLVNDGPVTFWLQV ------------------------- >MYO-INOSITOL-1-PHOSPHATE ; SWP:P11986; PDB:1JKFA; SVKVVTDKCTYKDNELLTKYSYENAVVTKTASGRFDVTPTVQDYVFKLDLKKPEKLGIML ------------------------------------------------------------ IGLGGNNGSTLVASVLANKHNVEFQTKEGVKQPNYFGSMTQCSTLKLGIDAEGNDVYAPF -1111------------1111----1111-----22223333-------1111-----11 NSLLPMVSPNDFVVSGWDINNADLYEAMQRSQVLEYDLQQRLKAKMSLVKPLPSIYYPDF 11-----3333--------------------------------1111---------3333 IAANQDERANNCINLDEKGNVTTRGKWTHLQRIRRDIQNFKEENALDKVIVLWTANTERY ------------------------------------------------------------ VEVSPGVNDTMENLLQSIKNDHEEIAPSTIFAAASILEGVPYINGSPQNTFVPGLVQLAE ---------------------11113333------------------------------- HEGTFIAGDDLKSGQTKLKSVLAQFLVDAGIKPVSIASYNHGDSKVAMDEYYSELMLGGH --------------------------1111-------------------------iiii- NRISIHNVCEDSLLATPLIIDLLVMTEFCTRVSYKKVDPVKEDAGKFENFYPVLTFLSYW ----------------------------1111---------------------3333111 LKAPLTRPGFHPVNGLNKQRTALENFLRLLIGLPSQNELRFEERLL 1---------------------------1111-------1111--- >P15; SWP:Q9UKK6; PDB:1JKGA; ASVDFKTYVDQACRAAEEFVNVYYTTMDKRRRLLSRLYMGTATLVWNGNAVSGQESLSEF ---3333----------------------3333-11111111---iiii----------- FEMLPSSEFQISVVDCQPVHDEATPSQTTVLVVICGSVKFEGNKQRDFNQNFILTAQASP -------------------33332222------------2222----------------- SNTVWKIASDCFRFQDWAS ---------------3333 >BREFELDIN A ESTERASE; SWP:O68884; PDB:1JKMA; PGRLGDESSGPRTDPRFSPAMVEALATFGLDAVAAAPPVSASDDLPTVLAAVGASHDGFQ !!!!-111133331111--------1111----------1111----------------- AVYDSIALDLPTDRDDVETSTETILGVDGNEITLHVFRPAGVEGVLPGLVYTHGGGMTIL ---------1111------------1111---------2222------------%%%%-- TTDNRVHRRWCTDLAAAGSVVVMVDFRNAWTAEGHHPFPSGVEDCLAAVLWVDEHRESLG --------------1111--------------------------------------3333 LSGVVVQGESGGGNLAIATTLLAKRRGRLDAIDGVYASIPYISGGYAWDHERRLTELPSL -----------------------11113333-------------1111--------3333 VENDGYFIENGGMALLVRAYDPTGEHAEDPIAWPYFASEDELRGLPPFVVAVNELDPLRD 1111----------------1111----3333-111133332222----------1111- EGIAFARRLARAGVDVAARVNIGLVHGADVIFRHWLPAALESTVRDVAGFAADRARLR ------------------------2222---3333----------------------- >DNA-invertase hin; SWP:P03013; PDB:1JKOC; GRPRAINKHEQEQISRLLEKGHPRQQLAIIFGIGVSTLYRYFPASS -----------------1111-3333-------------------- >DEATH-ASSOCIATED PROTEIN ; SWP:P53355; PDB:1JKSA; TVFRQENVDDYYDTGEELGSGQFAVVKKCREKSTGLQYAAKFIKKRRTKSSRRGVSREDI ------3333-------------------------------------1111--------- EREVSILKEIQHPNVITLHEVYENKTDVILILELVAGGELFDFLAEKESLTEEEATEFLK ------3333-1111--------1111--------------------------------- QILNGVYYLHSLQIAHFDLKPENIMLLDRNVPKPRIKIIDFGLAHKIDFGNEFKNIFGTP -------------------3333-----------------1111--------------33 EFVAPEIVNYEPLGLEADMWSIGVITYILLSGASPFLGDTKQETLANVSAVNYEFEDEYF 33-3333------3333----------------1111------------------33331 SNTSALAKDFIRRLLVKDPKKRMTIQDSLQHPWIKPPQFE 111-------1111---3333--33331111--------- >PHOSPHORIBOSYLGLYCINAMIDE; SWP:P08179; PDB:1JKXA; MNIVVLISGNGSNLQAIIDACKTNKIKGTVRAVFSNKADAFGLERARQAGIATHTLIASA ------------------------------------1111------1111------3333 FDSREAYDRELIHEIDMYAPDVVVLAGFMRILSPAFVSHYAGRLLNIHPSLLPKYPGLHT ---------------1111--------------------2222----------------- HRQALENGDEEHGTSVHFVTDELDGGPVILQAKVPVFAGDSEDDITARVQTQEHAIYPLV ----1111-------------1111-----------3333-------------------- ISWFADGRLKMHENAAWLDGQRLPPQGYA -----------%%%%--iiii--1111-- >DEFENSE-RELATED PEPTIDE 1; SWP:P81929; PDB:1JKZA; KTCEHLADTYRGVCFTNASCDDHCKNKAHLISGTCHNWKCFCTQNC -----------------------------------iiii------- >S-ADENOSYLMETHIONINE DECA; SWP:P17707; PDB:1JL0A; HFFEGTEKLLEVWFSRQQGSGDLRTIPRSEWDILLKDVQCSIISVTKTDKQEAYVLSESS ---------------------1111-3333-----1111--------1111--------- MFVSKRRFILKTCGTTLLLKALVPLLKLARDYSGFDSIQSFFYSRKNFMKPSHQGYPHRN ------------!!!!3333-----------------------------3333------- FQEEIEFLNAIFPNGAGYCMGRMNSDCWYLYTLDFPVISQPDQTLEILMSELDPAVMDQF ---------------------2222----------------------------3333--- YMKDGVTAKDVTRESGIRDLIPGSVIDATMFNPCGYSMNGMKSDGTYWTIAITPEPEFSY --2222------33331111---------------------1111---------3333-- VSFETNLSQTSYDDLIRKVVEVFKPGKFVTTLFVNQSSKCQKIEGFKRLDCQSAMFNDYN ----------------------------------1111----2222-------------- FVFTSFAKKQ ---------- >RIBONUCLEASE HI; SWP:P00647; PDB:1JL1A; KQVEIFTAGSALGNPGPGGYGAILRYRGREKTFSAGYTRTTNNRMELMAAIVALEALKEH -------------------------iiii------------------------1111--- AEVILSTDSQYVRQGITQWIHNWKKRGWKTADKKPVKNVDLWQRLDAALGQHQIKWEWVK -----------------------1111--1111-------------3333---------- GHAGHPENERADELARAAAMNPTLEDTGYQVE -2222------------1111----1111--- >CHIMERIC RNASE H; SWP:P00647; PDB:1JL2A; KQVEIFTDGSALGNPGPGGYGAILRYRGREKTFSAGYTRTTNNRMELKAAIEGLKALKEP ------------------------------------------------------------ AEVDLYTDSHYLKKAFTEGWLEGWRKRGWRTAEGKPVKNRDLWEALLLAMAPHRVRFHFV ------------------------1111--1111---------------3333------- KGHAGHPENERADELARAAAMNPTLEDTGY ------------------1111----3333 >ARSENATE REDUCTASE; SWP:P45947; PDB:1JL3A; NKIIYFLCTGNSCRSQMAEGWAKQYLGDEWKVYSAGIEAHGLNPNAVKAMKEVGIDISNQ --------------------------------------------------1111--1111 TSDIIDSDILNNADLVVTLCGDAADKCPMTPPHVKREHWGFDDPARAQGTEEEKWAFFQR -----33331111-----------------1111--------3333-------------- VRDEIGNRLKEFAETGK ----------------- >OUTER PROTEIN YOPM; SWP:P17778; PDB:1JL5A; KSKTEYYNAWSEWERNAPPGNGEQREMAVSRLRDCLDRQAHELELNNLGLSSLPELPPHL -----------------2222--------------------------------------- ESLVASCNSLTELPELPQSLKSLLVDNNNLKALSDLPPLLEYLGVSNNQLEKLPELQNSS ----------------1111----------------1111---------------1111- FLKIIDVDNNSLKKLPDLPPSLEFIAAGNNQLEELPELQNLPFLTAIYADNNSLKKLPDL ------------------1111---------------1111------------------- PLSLESIVAGNNILEELPELQNLPFLTTIYADNNLLKTLPDLPPSLEALNVRDNYLTDLP 3333---------------1111-------------------1111-------------- ELPQSLTFLDVSENIFSGLSELPPNLYYLNASSNEIRSLCDLPPSLEELNVSNNKLIELP --1111----------------1111----------------1111-------------- ALPPRLERLIASFNHLAEVPELPQNLKQLHVEYNPLREFPDIPESVEDLRMNS --1111----------------1111----------------3333------- >INTERLEUKIN 3; SWP:P08700; PDB:1JLI; ANCSIMIDEIIHHLKRPPNPLLDPNNLNSEDMDILMERNLRTPNLLAFVRAVKHLENASA -3333-33331111-------------3333--333311113333-----3333222233 IESILKNLLPCLPLATAAPTRHPIHIKDGDWNEFRRKLTFYLKTLENAQAQQ 33------1111-----------------3333------------------- >GEPHYRIN; SWP:Q9NQX3; PDB:1JLJA; HQIRVGVLTVSDSCFRNLAEDRSGINLKDLVQDPSLLGGTISAYKIVPDEIEEIKETLID -------------1111---------------1111------------------------ WCDEKELNLILTTGGTGFAPRDVTPEATKEVIEREAPGMALAMLMGSLNVTPLGMLSRPV ------------------1111-----1111----------------------1111--- CGIRGKTLIINLPGSKKGSQECFQFILPALPHAIDLLRDAIVKVKEVHD ---!!!!------------------3333-----1111---3333---- >PROTEIN TYROSINE PHOSPHAT; SWP:Q62132; PDB:1JLNA; GSPREKVAMEYLQSASRVLTRSQLRDVVASSHLLQSEFMEIPMNFVDPKEIDIPRHGTKN -------------------------------3333--1111-----3333--22221111 RYKTILPNPLSRVCLRPKNITDSLSTYINANYIRGYSGKEKAFIATQGPMINTVNDFWQM -1111--3333-----------3333--------2222-----------1111------- VWQEDSPVIVMITKLKEKNEKCVLYWPEKRGIYGKVEVLVTGVTECDNYTIRNLVLKQGS --------------------------------!!!!---------------------!!! HTQHVKHYWYTSWPDHKTPDSAQPLLQLMLDVEEDRLASEGRGPVVVHCSAGIGRTGCFI !-------------------------------------2222------------------ ATSIGCQQLKEEGVVDALSIVCQLRVDRGGMVQTSEQYEFVHHALCLFESRLSPETV ---------------------------2222-----------------1111----- >PHOSPHOLIPASE A2 INHIBITO; SWP:P04084; PDB:1JLTA; NLFQFGDMILQKTGKEAVHSYAIYGCYCGWGGQARAQDATDRCCFAQDCCYGRVNDCNPK ---------------33333333--------------3333---------1111----11 TATYTYSFENGDIVCGDNDLCLRAVCECDRAAAICLGENVNTYDKNYEYYSISHCTEESE 11------iiii--------------------------3333-3333-3333-------- QC -- >Phospholipase A2; SWP:P14420; PDB:1JLTB; NLFQFAKMINGKLGAFSVWNYISYGCYCGWGGQGTPKDATDRCCFVHDCCYGRVRGCNPK ---------------3333-------------------------------1111------ LAIYSYSFKKGNIVCGKNNGCLRDICECDRVAANCFHQNKNTYNKNYKFLSSSRCRQTSE --------iiii------!!!!----------------1111-1111---3333------ QC -- >GLUTATHIONE TRANSFERASE G; SWP:Q9GNE9; PDB:1JLVA; MDFYYLPGSAPCRAVQMTAAAVGVELNLKLTNLMAGEHMKPEFLKINPQHCIPTLVDNGF -----1111----------1111--------33331111-------1111------iiii ALWESRAICTYLAEKYGKDDKLYPKDPQKRAVVNQRLYFDMGTLYQRFADYYYPQIFAKQ ------------------3333-------------------------------------- PANAENEKKMKDAVDFLNTFLDGHKYVAGDSLTIADLTVLATVSTYDVAGFELAKYPHVA ------------------1111-----------------------------3333----- AWYERTRKEAPGAAINEAGIEEFRKYF ---------2222--------3333-- >GLUTATHIONE TRANSFERASE G; SWP:Q9GN60; PDB:1JLWA; MDFYYLPGSAPCRAVQMTAAAVGVELNLKLTNLMAGEHMKPEFLKLNPQHCIPTLVDEDG -----3333----------1111----------%%%%---3333--1111------1111 FVLWESRAIQIYLVEKYGAHDADLAERLYPSDPRRRAVVHQRLFFDVAVLYQRFAEYYYP -----------------3333--------------------------------------- QIFGQKVPVGDPGRLRSMEQALEFLNTFLEGEQYVAGGDDPTIADLSILATIATYEVAGY --!!!!--------------------1111---1111----3333--------------- DLRRYENVQRWYERTSAIVPGADKNVEGAKVFGRYFT --------------11112222---------3333-- >AGGLUTININ; SWP:Q71QF2; PDB:1JLYA; AGLPVIMCLKSNNHQKYLRYQSDNIQQYGLLQFSADKILDPLAQFEVEPSKTYDGLVHIK ------------------------1111--------1111------------2222---- SRYTNKYLVRWSPNHYWITASANEPDENKSNWACTLFKPLYVEEGNMKKVRLLHVQLGHY -----------1111------------1111-----------2222--------3333-- TQNYTVGGSFVSYLFAESSQIDTGSKDVFHVIDWKSIFQFPKGYVTFKGNNGKYLGVITI ---------------------1111-------3333------------1111-------i NQLPCLQFGYDNLNDPKVAHQMFVTSNGTICIKSNYMNKFWRLSTDDWILVDGNDPRETN iii--------1111---------1111------3333-----2222-------1111-- EAAALFRSDVHDFNVISLLNMQKTWFIKRFTSGKPGFINCMNAATQNVDETAILEIIEL -----------2222------------------2222-----------1111------- >FOUR-HELIX BUNDLE MODEL; SWP:NA; PDB:1JM0A; DYLRELLKLELQAIKQYREALEYVKLPVLAKILEDEEKHIEWLETILG -------------------------3333------------------- >RIESKE IRON-SULFUR PROTEI; SWP:Q53766; PDB:1JM1A; NTDGLAGFPRYKVANIQQVQQQIKSSGCAVYFFAYPLTDEPCFLVDLQALTGQQITEIPN ----iiii------3333------------------1111-------3333--------1 PYYGKYAGPLGQIQTIKGVGPNGTIFAFSDVCVHLGCQLPAQVIVSSESDPGLYAKGADL 111-----1111-------1111----------------1111---3333----1111-- HCPCHGSIYALKDGGVVVSGPAPRPLPIVILDYDSSTGDIYAVGTNAPYFSAGIPRTTPQ ----------1111-------------------------------------------333 DNLLYDPRYSYSVPNNPSCSNG 3----3333------------- >Histone acetyltransferase; SWP:Q92831; PDB:1JM4B; GSHMSKEPRDPDQLYSTLKSILQQVKSHQSAWPFMEPVKRTEAPGYYEVIRFPMDLKTMS ---------3333-----------------3333--------------------3333-- ERLKNRYYVSKKLFMADLQRVFTNCKEYNPPESEYYKCANILEKFFFSKIKEAGLIDK --1111----------------------------3333-------------------- >PYRUVATE DEHYDROGENASE KI; SWP:Q64536; PDB:1JM6A; ASLAGAPKYIEHFSKFSPSPLSMKQFLDFGACEKTSFTFLRQELPVRLANIMKEINLLPD --------------------------------33333333-------------1111-11 RVLSTPSVQLVQSWYVQSLLDIMEFLDKDPEDHRTLSQFTDALVTIRNRHNDVVPTMAQG 11------------------3333----1111---------------1111-----3333 VLEYDPVSNQNIQYFLDRFYLSRISIRMLINQHTLIFDPKHIGSIDPNCSVSDVVKDAYD -----------------------------------------!!!!--------------- MAKLLCDKYYMASPDLEIQEVNATNATQPIHMVYVPSHLYHMLFELFKNAMRATVESHES ---------------------3333---------3333----------------1111-- SLTLPPIKIMVALGEEDLSIKMSDRGGGVPLRKIERLFSYMYSTAPTPAGFGYGLPISRL -----------------------------33333333-1111------------------ YAKYFQGDLQLFSMEGFGTDAVIYLKALSTDSVERLPVY -------------2222----------3333-------- >BREAST CANCER TYPE 1 SUSC; SWP:P38398; PDB:1JM7A; MDLSALRVEEVQNVINAMQKILECPICLELIKEPVSTKCDHIFCKFCMLKLLNQKKGPSQ -%%%%--3333------3333---------------1111----3333------------ CPLCKNDITKRSLQESTRFSQLVEELLKIICAFQLDTGLEYAN 3333--------------------------------------- >BRCA1-associated RING dom; SWP:Q99728; PDB:1JM7B; MEPDGRGAWAHSRAALDRLEKLLRCSRCTNILREPVCLGGCEHIFCSNCVSDCIGTGCPV -------------------1111-------------------------3333-------- CYTPAWIQDLKINRQLDSMIQLCSKLRNLLHDNELSD ------------------------------------- >Tumor necrosis factor rec; SWP:Q92956; PDB:1JMAB; CKEDEYPVGSECCPKCSPGYRVKEACGELTGTVCEPCPPGTYIAHLNGLSKCLQCQMCDP -1111--!!!!-----2222------1111-------2222------------------- AMGLRASRNCSRTENAVCGCSPGHFCIVQDHCAACRAYAT ----------1111------2222---------------- >REPLICATION PROTEIN A; SWP:P27694; PDB:1JMCA; KVVPIASLTPYQSKWTICARVTNKSQIRTWSNSRGEGKLFSLELVDESGEIRATAFNEQV ---3333-1111---------------------------------3333----------- DKFFPLIEVNKVYYFSKGTLKIANKQFTAVKNDYEMTFNNETSVMPCEDDHHLPTVQFDF --1111-2222------------3333-----------3333------------------ TGIDDLENKSKDSLVDIIGICKSYEDATKITVRSNNREVAKRNIYLMDTSGKVVTATLWG -111111112222----------------------------------1111--------- EDADKFDGSRQPVLAIKGARVSDFGGRSLSVLSSSTIIANPDIPEAYKLRGWFDAEGQ -------2222-----------2222-----3333----------------------- >HEPARIN COFACTOR II; SWP:P05546; PDB:1JMJA; LDLEKIFSEDDLQLFHGKSRIQRLNILNAKFAFNLYRVLKDQVNTFDNIFIAPVGISTAM ----3333----3333-----------------------11111111----3333----- GMISLGLKGETHEQVHSILHFKDFVNASSKYEITTIHNLFRKLTHRLFRRNFGYTLRSVN ---1111---------1111-------11113333------------------------- DLYIQKQFPILLDFKTKVREYYFAEAQIADFSDPAFISKTNNHIMKLTKGLIKDALENID ----3333---------------------1111--------------iiii--------1 PATQMMILNCIYFKGSWVNKFPVEMTHNHNFRLNEREVVKVSMMQTKGNFLAANDQELDC 111------------------1111------------------------------1111- DILQLEYVGGISMLIVVPHKMSGMKTLEAQLTPRVVERWQKSMTNRTREVLLPKFKLEKN -------------------1111----------------1111----------------- YNLVESLKLMGIRMLFDKNGNMAGISDQRIAIDLFKHQGTITVNEEGTQATTVTTVGFMP ------------33331111-1111------------------3333------------- LSTQVRFTVDRPFLFLIYEHRTSCLLFMGRVANPSRS -------------------1111---------1111- >Surfactin synthetase subu; SWP:Q08787; PDB:1JMKC; GGSDGLQDVTIMNQDQEQIIFAFPPVLGYGLMYQNLSSRLPSYKLCAFDFIEEEDRLDRY ------------1111--------3333------3333-1111----------------- ADLIQKLQPEGPLTLFGYSAGCSLAFEAAKKLEGQGRIVQRIIMVDSYKKQGVSSDVEAL --------------------------------1111------------------------ MNVNRDNEALNSEAVKHGLKQKTHAFYSYYVNLISTGQVKADIDLLTSGADFDIPEWLAS ------3333-33331111-----------------------------------1111-- WEEATTGAYRMKRGFGTHAEMLQGETLDRNAGILLEFLNTQT 1111------------1111---3333---------1111-- >PROTEIN L; SWP:Q51912; PDB:1JMLA; GMEEVTIKANLIFANGSTQTAEFKGTFEKATSEAYAYADTLKKDNGEWTVDVVPKAYTLN ------------1111---------3333------------------------------- IKFAG ----- >PROTEIN I/II V-REGION; SWP:P11657; PDB:1JMMA; QKDLADYPVKLKAYEDEQASIKAALAELEKHKNEDGNLTEPSAQNLVYDLEPNANLSLTT -3333-------------------------1111----------------1111------ DGKFLKASAVDDAFSKSTSKAKYVQKILQLDDLDITNLEQSNDVASSELYGNFGDKAGWS ----------------1111--------1111-3333--1111------------3333- TTVSNNSQVKWGSVLLERGQSATATYTNLQNSYYNGKKISKIVYKYTVDPKSKFQGQKVW ---!!!!---------2222-------------iiii-----------1111-------- LGIFTDPTLGVFASAYTGQVEKNTSIFIKNEFTFYDEDGKPINFDNALLSVASLNREHNS -----1111--------------------------1111-----------------1111 IEAKDYSGKFVKISGSSIGEKNGIYATDTLNFKQGEGGSRWTYKNSQAGSGWDSSDAPNS ------------2222----------------2222----------2222---1111--- WYGAGAIKSGPNNHVTVGATSATNVPVSDPVVPGKDNTDGKKPNIWYSLNGKIRAVNVPK 1111----------------3333-3333--22221111--------------------- VTKEKPTPPV ---------- >VISCOTOXIN A2; SWP:P32880; PDB:1JMNA; KSCCPNTTGRNIYNTCRFGGGSRQVCASLSGCKIISASTCPSDYPK -----3333-------3333-------------------------- >THROMBIN, LIGHT CHAIN; SWP:P05546; PDB:1JMOA; GEEDDDLDLEKIFSEDDDIDIVDSLSVSPTDSDVSAGNILQLFHGKSRIQRLNILNAKFA ------------1111------------------------3333---------------- FNLYRVLKDQVNTFDNIFIAPVGISTAMGMISLGLKGETHEQVHSILHFKDFVNASSKYE -----------1111----3333--------1111---------1111-------11113 ITTIHNLFRKLTHRLFRRNFGYTLRSVNDLYIQKQFPILLDFKTKVREYYFAEAQIADFS 333-----------------------------3333---------------------111 DPAFISKTNNHIMKLTKGLIKDALENIDPATQMMILNCIYFKGSWVNKFPVEMTHNHNFR 1------------1111----1111--1111------------------3333------- LNEREVVKVSMMQTKGNFLAANDQELDCDILQLEYVGGISMLIVVPHKMSGMKTLEAQLT -----------------------1111--------------------1111----1111- PRVVERWQKSMTNRTREVLLPKFKLEKNYNLVESLKLMGIRMLFDKNGNMAGISDQRIAI -------1111------------------------1111-33331111-3333------- DLFKHQGTITVNEEGTQATTVTTVGFMPLSTQVRFTVDRPFLFLIYEHRTSCLLFMGRVA -----------------------------------------------1111--------- NPSRS 1111- >VISCOTOXIN B; SWP:P08943; PDB:1JMPA; KSCCPNTTGRNIYNTCRLGGGSRERCASLSGCKIISASTCPSDYPK ----------------1111-------------------------- ---------------------------------------------- >TERMINAL DEOXYNUCLEOTIDYL; SWP:P09838; PDB:1JMSA; KKISQYACQRRTTLNNYNQLFTDALDILAENDELRENEGSCLAFMRASSVLKSLPFPITS -----1111--------3333-----------1111--------------3333-----3 MKDTEGIPCLGDKVKSIIEGIIEDGESSEAKAVLNDERYKSFKLFTSVFGVGLKTAEKWF 3332222---3333---------------------------------2222--------1 RMGFRTLSKIQSDKSLRFTQMQKAGFLYYEDLVSCVNRPEAEAVSMLVKEAVVTFLPDAL 111-------------------------------------------------1111---- VTMTGGFRRGKMTGHDVDFLITSPEATEDEEQQLLHKVTDFWKQQGLLLYCDILESTFEK ----3333--------------11113333----------------------------11 FKQPSRKVDALDHFQKCFLILKLDHGRVHSEKSEGKGWKAIRVDLVMCPYDRRAFALLGW 11---------------------3333---------------------1111-------- TGSRQFERDLRRYATHERKMMLDNHALYDRTKRVFLEAESEEEIFAHLGLDYIEPWERNA ---------------------------------------------1111----1111--- >SPLICING FACTOR U2AF 35 K; SWP:Q01081; PDB:1JMTA; SQTIALLNIYRNPQDGLRSAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMNVCDNLGDH ------------------------------------------------------------ LVGNVYVKFRREEDAEKAVIDLNNRWFNGQPIHAELSP --------------------------iiii-------- >PROTEIN MU-1; SWP:Q8V5E4; PDB:1JMUA; TINVTGDGNVFKPSAETSSTAVPSLSLSPGMLN --1111-------3333----------3333-- >UNIVERSAL STRESS PROTEIN ; SWP:P44880; PDB:1JMVA; MYKHILVAVDLSEESPILLKKAVGIAKRHDAKLSIIHVDVNFSDLYTGLIDVNMSSMQDR -----------1111--------------------------3333----------1111- ISTETQKALLDLAESVDYPISEKLSGSGDLGQVLSDAIEQYDVDLLVTGHHQDFWSKLMS ------------3333-------------------------------------------- STRQVMNTIKIDMLVVPLRD -----1111----------- >METHYL-ACCEPTING CHEMOTAX; SWP:P02941; PDB:1JMWA; MNQQGFVISNELRQQQSELTSTWDLMLQTRINLARSAARMMMDASNQQSSAKTDLLQNAK ------------------------------------------------------------ TTLAQAAAHYANFKNMTPLPAMAEASANVDEKYQRYQAALAELIQFLDNGNMDAYFAQPT -----------3333---1111-3333------------------------3333----- QGMQNALGEALGNYARVSENLYRQTF -----------------3333----- >AMINE DEHYDROGENASE; SWP:Q8VW85; PDB:1JMXA; EQGPSLLQNKCMGCHIPEGNDTYSRISHQRKTPEGWLMSIARMQVMHGLQISDDDRRTLV ------------------------1111-------------------------------- KYLADKQGLAPSETDGVRYAMERRLNTVEQFDTQLSETCGRCHSGARVALQRRPAKEWEH ---------33332222------1111----------------33331111--3333--- LVNFHLGQWPSLEYQAQARDRDWLPIALQQVVPDLAKRYPLESAAWAEWQKARPKADALP --------1111-----1111---------------------------------3333-- GQWAFSGHMLAKGDVRGVMSVTPDQGDTFKVEVKGAYADGTPFNGSGSAILYNGYEWRGN --------2222------------!!!!--------1111-------------------- VKVGDANLRQVFAALDGEMKGRMFEAEHDERGLDFTAVKEGKARLLAVQPAFIKAGGESE --!!!!--------%%%%------1111-------------------------------- ITLVGSGLAGKPDLGAGVEVTEVLEQTPTLVRLKARAAADAKPGQREVAVGTLKGVNLAV --------------2222-------------------1111--------!!!!------- YDKVEEVKVVPAFSIARIGENGASVPKVQGRFEAEAWGKDANGQPLRIGYLPASWKVEPF -------------------iiii----------------1111----------------- NERAVEDEDVKFAGKMQADGVFVPGGAGPNPERKMMTNNAGNLKVIATLADGGQTGEGHM 3333---3333-----1111---------3333%%%%----------------------- IVTVQRWNNPPLP ------------- >Quinohemoprotein amine de; SWP:Q8VW82; PDB:1JMXB; GPALKAGHEYMIVTNYPNNLHVVDVASDTVYKSCVMPDKFGPGTAMMAPDNRTAYVLNNH ----2222----------------1111-------------------1111--------- YGDIYGIDLDTCKNTFHANLSSVPGEVGRSMYSFAISPDGKEVYATVNPTQRLNDHYVVK ----------------------2222----------1111-------------------- PPRLEVFSTADGLEAKPVRTFPMPRQVYLMRAADDGSLYVAGPDIYKMDVKTGKYTVALP ----------!!!!------------------1111------------------------ LRNWNRKGYSAPDVLYFWPHQSPRHEFSMLYTIARFATADLLYGYLSVDLKTGKTHTQEF 1111-2222------------1111----------------------------------- ADLTELYFTGLRSPKDPNQIYGVLNRLAKYDLKQRKLIKAANLDHTYYCVAFDKKGDKLY ------------3333---------------1111-----------------1111---- LGGTFNDLAVFNPDTLEKVKNIKLPGGDMSTTTPQVFIR ----------------------------!!!!------- >Quinohemoprotein amine de; SWP:P0A182; PDB:1JMXG; AVAGCTATTDPGWEVDAFGGVSSLCQPMEADLYGCSDPCWPAQVPDMMSTYQDWNAQASN -1111----------1111-1111--3333--3333----------------1111---3 SAEDWRNLGTVFPKDK 3333333--------- >2C-METHYL-D-ERYTHRITOL 2,; SWP:P44815; PDB:1JN1A; MIRIGHGFDVHAFGEDRPLIIGGVEVPYHTGFIAHSDGDVALHALTDAILGAAALGDIGK ------------------------------333333333333--------1111--1111 LFPDTDMQYKNADSRGLLREAFRQVQEKGYKIGNVDITIIAQAPKMRPHIDAMRAKIAED ----1111----3333-------3333------------------3333----------- LQCDIEQVNVKATTTEKLGFTGRQEGIACEAVALLIR ----1111------iiii3333--------------- >monoclonal anti-estradiol; SWP:NA; PDB:1JN6B; EVQLQQSGAELARPGASVKLSCRTSGYSFTTYWMQWVRQRPGQGLEWIAAIYPGDDDARY ---------------------------1111----------------------------- TQKFKGKATLTADRSSSIVYLQLNSLTSEDSAVYSCSRGRSLYYTMDYWGQGTSVTVTTP 3333----------------------1111----------1111---------------- PSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVSWNTGSLSSGVHTFPAVLQSDLYTLS -----------------------------------%%%%-------------%%%%---- SSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVP ------3333----------3333---------- >IMAGINAL DISC GROWTH FACT; SWP:Q9V3D4; PDB:1JNDA; SNLVCYYDSSSYTREGLGKLLNPDLEIALQFCSHLVYGYAGLRGENLQAYSMNENLDIYK ---------3333--1111--------3333--------------------------111 HQFSEVTSLKRKYPHLKVLLSVGGDHDIDPDHPNKYIDLLEGEKVRQIGFIRSAYELVKT 1-------33331111------%%%%--1111--------------------------11 YGFDGLDLAYQFPKNKPRKVIVDPHAALHKEQFTALVRDVKDSLRADGFLLSLTVLPNVN 11------------------------------------------1111-------22223 STWYFDIPALNGLVDFVNLATFDFLTPARNPEEADYSAPIYHPDGSKDRLAHLNADFQVE 333--33331111------------33331111---------1111---1111------- YWLSQGFPSNKINLGVATYGNAWKLTKDSGLEGVPVVPETSGPAPEGFQSQKPGLLSYAE --1111-1111--------------3333----------------------2222----- ICGKLSNPQNQFLKGNESPLRRVSDPTKRFGGIAYRPVDGQITEGIWVSYDDPDSASNKA -3333-3333---!!!!-------3333----------!!!!------------------ AYARVKNLGGVALFDLSYDDFRGQCSGDKYPILRAIKYRL ---1111-------3333-1111-------------1111 >DIHEME CYTOCHROME C NAPB; SWP:P44654; PDB:1JNIA; NQPPMVPHSVANYQVTKNVNQCLNCHSPENSRLSGATRISPTHFMDRDGKVSPRRYFCLQ ---------1111--1111-------1111-1111----3333--1111-------1111 CHVS ---- >monoclonal anti-estradiol; SWP:NA; PDB:1JNLH; EVQLQQSGAELVKPGASVRLSCSASGFNIKDTYMFWVKQRPEQGLDWIGRINPANGISKY ---------------------------3333--------2222----------------- DPRFQGKATLTADTSSNTAYLQLDNLTSEDTAVYYCAIEKDLPWGQGTLVTVSVAKTTPP 3333----------------------3333------------------------------ SVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSS -----------------------------------iiii--------------------- SVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRDC ------------------------------------ >PROTO-ONCOGENE C-JUN; SWP:P05412; PDB:1JNMA; KAERKRMRNRIAASKSRKRKLERIARLEEKVKTLKAQNSELASTANMLREQVAQLK --3333-------------------------------------------------- >T-CELL LEUKEMIA/LYMPHOMA ; SWP:P56280; PDB:1JNPA; RAETPAHPNRLWIWEKHVYLDEFRRSWLPVVIKSNEKFQVILRQEDVTLGEAMSPSQLVP --------------3333--1111-----------------------------3333--- YELPLMWQLYPKDRYRSADSMYWQILYHIKFRDVEDMLLEL ---------1111---------------------------- >ADENYLYLSULFATE REDUCTASE; SWP:O28603; PDB:1JNRA; VYYPKKYELYKADEVPTEVVETDILIIGGGFSGCGAAYEAAYWAKLGGLKVTLVEKAAVE ----------1111-------------------------------------------333 RSGAVAQGLSAINTYIDLTGRSERQNTLEDYVRYVTLDMMGLAREDLVADYARHVDGTVH 3-1111--------------------------------iiii------------------ LFEKWGLPIWKTPDGKYVREGQWQIMIHGESYKPIIAEAAKMAVGEENIYERVFIFELLK -----------1111-------------1111------------1111------------ DNNDPNAVAGAVGFSVREPKFYVFKAKAVILATGGATLLFRPRSTGEAAGRTWYAIFDTG 1111-----------------------------------------1111-----3333-- SGYYMGLKAGAMLTQFEHRFIPFRFKDGYGPVGAWFLFFKCKAKNAYGEEYIKTRAAELE 3333----------1111--------------------------1111-3333-333333 KYKPYGAAQPIPTPLRNHQVMLEIMDGNQPIYMHTEEALAELAGGDKKKLKHIYEEAFED 33-1111----------------1111---------------iiii-------------- FLDMTVSQALLWACQNIDPQEQPSEAAPAEPYIMGSHSGEAGFWVCGPEDLMPEEYAKLF 1111-------------1111--------------------------1111-33331111 PLKYNRMTTVKGLFAIGDCAGANPHKFSSGSFTEGRIAAKAAVRFILEQKPNPEIDDAVV ---2222--2222--!!!!----------------------------------------- EELKKKAYAPMERFMQYKDLSTADDVNPEYILPWQGLVRLQKIMDEYAAGIATIYKTNEK ----------------1111--1111-----------------------3333------- MLQRALELLAFLKEDLEKLAARDLHELMRAWELVHRVWTAEAHVRHMLFRKETRWPGYYY ------------------------------------------------------------ RTDYPELNDEEWKCFVCSKYDAEKDEWTFEKVPYVQVIEWSF 1111---3333----------1111----------------- >Adenylylsulfate reductase; SWP:O28604; PDB:1JNRB; PSFVNPEKCDGCKALERTACEYICPNDLMTLDKEKMKAYNREPDMCWECYSCVKMCPQGA -----1111--------3333--1111--------------1111----------1111- IDVRGYVDYSPLGGACVPMRGTSDIMWTVKYRNGKVLRFKFAIRTTPWGSIQPFEGFPEP -----3333---------------------1111------------2222---2222--- TEEALKSELLAGEPEIIGTSEFPQVKKKA 3333-----22223333------------ >PEPTIDYL-PROLYL CIS-TRANS; SWP:P39159; PDB:1JNSA; AKTAAALHILVKEEKLALDLLEQIKNGADFGKLAKKHSICPSGKRGGDLGEFRQGQMVPA ---------------------------------------1111-1111---------333 FDKVVFSCPVLEPTGPLHTQFGYHIIKVLYRN 3-----------------3333---------- >PHY3 PROTEIN; SWP:Q9ZWQ6; PDB:1JNUA; KSFVITDPRLPDNPIIFASDRFLELTEYTREEVLGNNCRFLQGRGTDRKAVQLIRDAVKE ------1111------------------33332222------1111-------------- QRDVTVQVLNYTKGGRAFWNLFHLQVMRDENGDVQYFIGVQQEM -----------3333----------------------------- >ELONGATION FACTOR 1-ALPHA; SWP:P35021; PDB:1JNYA; KPHLNLIVIGHVDHGKSTLVGRLLMDRGFIDEKTVKEAEEAAKKLGKESEKFAFLLDRLK ----------2222--------------------------------3333---------- EEMRFETKKYFFTIIDAPGHRDFVKNMITGASQADAAILVVSAKKGEYEAGMSVEGQTRE ------1111---------2222---1111----------------------1111---- HIILAKTMGLDQLIVAVNKMDLTEPPYDEKRYKEIVDQVSKFMRSYGFNTNKVRFVPVVA ------------------3333---------------------1111--1111------1 PSGDNITHKSENMKWYNGPTLEEYLDQLELPPKPVDKPLRIPIQDVYSISGVGTVPVGRV 111---------3333--------1111----3333------------2222-------- ESGVLKVGDKIVFMPAGKVGEVRSIETHHTKMDKAEPGDNIGFNVRGVEKKDIKRGDVVG -----2222-----------------%%%%-----2222---------3333-2222--- HPNNPPTVADEFTARIIVVWHPTALANGYTPVLHVHTASVACRVSELVSKLDPRTGQEAE 3333---------------------2222-----!!!!---------------------- KNPQFLKQGDVAIVKFKPIKPLCVEKYNEFPPLGRFAMRDMGKTVGVGIIVDVKP ------2222---------------11113333------%%%%------------ >HYPOTHETICAL PROTEIN HI13; SWP:P71376; PDB:1JO0A; TTLSTKQKQFLKGLAHHLNPVVMLGGNGLTEGVLAEIENALNHHELIKVKVAGADRETKQ -------------3333-------1111-------------------------------- LIINAIVRETKAAQVQTIGHILVLYRPSEEAKIQLPR -----------------!!!!---------------- >potassium large conductan; SWP:Q9Y691; PDB:1JO6A; MFIWTSGRTSSSYRHDEKRNIYQKIRDHDLLDKRKTVTALKAGED ---------------1111-1111--3333-3333---------- >ACTIN BINDING PROTEIN; SWP:P15891; PDB:1JO8A; PWATAEYDYDAAEDNELTFVENDKIINIEFVDDDWWLGELEKDGSKGLFPSNYVSLGN ------------1111---2222--------1111--------------1111----- >EARLY ENDOSOMAL AUTOANTIG; SWP:Q15075; PDB:1JOCA; QDERRALLERCLKGEGEIEKLQTKVLELQRKLDNTTAAVQELGRENQSLQIKHTQALNRK ------------------------------------------------------------ WAEDNEVQNCMACGKGFSVTVRRHHCRQCGNIFCAECSAKNALTPSSKKPVRVCDACFND ---3333----------1111------------3333----------------------- LQG --- >CARBOXY-CIS,CIS-MUCONATE ; SWP:P38677; PDB:1JOFA; PLHHLIGTWTPPGAIFTVQFDDEKLTCKLIKRTEIPQDEPISWTFDHERKNIYGAAKKWS -----------------------------------1111------1111----------- SFAVKSPTEIVHEASHPIGGHPRANDADTNTRAIFLLAAKQPPYAVYANPFYKFAGYGNV -----1111-----------1111-1111------------------------------- FSVSETGKLEKNVQNYEYQENTGIHGVFDPTETYLYSADLTANKLWTHRKLASGEVELVG ---1111-----------3333------1111-------1111-------3333------ SVDAPDPGDHPRWVAHPTGNYLYALEAGNRICEYVIDPATHPVYTHHSFPLIPPGIPDRD -----1111------3333------1111-------3333------------2222---- PETGKGLYRADVCALTFSGKYFASSRANKFELQGYIAGFKLRDCGSIEKQLFLSPTPTSG ---------------3333---------1111---------1111--------------! GHSNAVSPCPWSDEWAITDDQEGWLEIYRWKDEFLHRVARVRIPEPGFGNAIWYD !!!-----1111------------------%%%%----------2222------- >HYPOTHETICAL PROTEIN HI00; SWP:P43934; PDB:1JOGA; NLNVLDAAFYSLEQTVVQISDRNWFDQPSIVQDTLIAGAIQKFEFVYELSLKKRQLQQDA 3333----------------3333------------------------------------ INTDDIGAYGFKDILREALRFGLIGDSKWVAYRDRNITSHTYDQEKAAVYAQIDDFLIES ------------------1111--------------3333---3333-3333-------- SFLLEQLRQ -----3333 >AZURIN; SWP:P80546; PDB:1JOI; AECKVTVDSTDQMSFNTKAIEIDKSCKTFTVELTHSGSLPKNVMGHNWVLSSAADMPGIA ---------1111---------3333-------------1111--------3333----- SDGMAAGIDKNYLKEGDTRVIAHTKIIGAGEKDSVTFDVSKLAAGTDYAFFCSFPGHISM -3333-3333---2222----------2222------1111------------2222111 MKGTVTVK 1------- >YHCH PROTEIN; SWP:P44583; PDB:1JOPA; MIISSLTNPNFKVGLPKVIAEVCDYLNTLDLNALENGRHDINDQIYMNVMEPKAELHHEY -------111122223333-----3333-3333--------------------------- LDVQVLIRGTENIEVGATYPNLSKYEDYNEADDYQLCADIDDKFTVTMKPKMFAVFYPYE --------------------3333--------------------------------2222 PHKPCCVIKKLVVKVPVKLI ---------------3333- >RIBOSOME-BINDING FACTOR A; SWP:P45141; PDB:1JOSA; RSDRVAQEIQKEIAVILQREVKDPRIGMVTVSDVEVSSDLSYAKIFVTFLFDHDEMAIEQ ----------------------1111----------1111-------------------- GMKGLEKASPYIRSLLGKAMRLRIVPEIRFIYDQSLVEGM ---------------------------------------- >AGGLUTININ; SWP:P18674; PDB:1JOTA; GVTFDDGAYTGIREINFEYNSETAIGGLRVTYDLNGMPFVAEDHKSFITGFKPVKISLEF ---------------------------------iiii----------------------- PSEYIVEVSGYVGKVEGYTVIRSLTFKTNKQTYGPYGVTNGTPFSLPIENGLIVGFKGSI --------------iiii------------------------------------------ GYWLDYFSIYLSL ------------- >HI1317; SWP:Q9RP27; PDB:1JOVA; MKTTLLKTLTPELHLVQHNDIPVLHLKHAVGTAKISLQGAQLISWKPQNAKQDVLWLSEV ---------1111----!!!!------3333------%%%%-----2222-------111 EPFKNGNAIRGGVPICYPWFGGVKQPAHGTARIRLWQLSHYYISVHKVRLEFELFSDLNI 1--2222------------------22221111----------1111--------1111- IEAKVSMVFTDKCHLTFTHYGEESAQAALHTYFNIGDINQVEVQGLPETCFNSLNQQQEN ------------------------------------1111-------------------- VPSPRHISENVDCIYSAENMQNQILDKSFNRTIALHHHNASQFVLWNPWHKKTSGMSETG -----------------------------------------------!!!!-22221111 YQKMLCLETARIHHLLEFGESLSVEISLK 1111------------2222--------- >ENVZ_ECOLI; SWP:P02933; PDB:1JOYA; MAAGVKQLADDRTLLMAGVSHDLRTPLTRIRLATEMMSEQDGYLAESINKDIEECNAIIE ----------1111-------1111------------------3333------------- QFIDYLR ------- >3'(2'),5'-BISPHOSPHATE NU; SWP:Q9Z1N4; PDB:1JP4A; HNVLMRLVASAYSIAQKAGTIVRCVIAEGDLGIVQKTSATDLQTKADRMVQMSICSSLSR -------------------------------------1111------------------- KFPKLTIIGEEDLPEVDQELIEDGQSEEILKQPCPSQYSAIKEEDLVVWVDPVDGTKEYT -1111-----------3333------3333----3333---3333-------------11 EGLLDNVTVLIGIAYEGKAIAGIINQPYYNYQAGPDAVLGRTIWGVLGLGAFGFQLKEAP 111111--------iiii-------111111111111--------2222----------2 AGKHIITTTRSHSNKLVTDCIAAMNPDNVLRVGGAGNKIIQLIEGKASAYVFASPGCKKW 222------------------1111--------------------------------333 DTCAPEVILHAVGGKLTDIHGNPLQYDKEVKHMNSAGVLAALRNYEYYASRVPESVKSAL 3--------1111----1111-----1111---1111------3333-1111----1111 IP -- >neural kinase, Nuk=Eph/El; SWP:P54763; PDB:1JPAA; KIFIDPFTFEDPNEAVREFAKEIDISCVKIEQVIGAGEFGEVCSGHLKLREIFVAIKTLK ----1111---------------3333---------1111-------------------- SGYTEKQRRDFLSEASIMGQFDHPNVIHLEGVVTKSTPVMIITEFMENGSLDSFLRQNDG ----------------3333--1111-------------------1111--------222 QFTVIQLVGMLRGIAAGMKYLADMNYVHRDLAARNILVNSNLVCKVSDFPIRWTAPEAIQ 2--------------------1111------3333---1111-------1111------- YRKFTSASDVWSYGIVMWEVMSYGERPYWDMTNQDVINAIEQDYRLPPPMDCPSALHQLM -------------------1111--2222-------------------2222-------- LDCWQKDRNHRPKFGQIVNTLDKMIRNPNSLKA ------3333----------------3333--- >AGGLUTININ; SWP:P30617; PDB:1JPC; DNILYSGETLSTGEFLNYGSFVFIMQEDCNLVLYDVDKPIWATNTGGLSRSCFLSMQTDG ----2222--2222---!!!!----1111-----!!!!------2222--------1111 NLVVYNPSNKPIWASNTGGQNGNYVCILQKDRNVVIYGTDRWATGTHT -----1111-------------------1111---------------- >L-ALA-D/L-GLU EPIMERASE; SWP:NA; PDB:1JPDX; MRTVKVFEEAWPLHTPSRSEARVVVVELEEEGIKGTGECTPYPRYGESDASVMAQIMSVV -----------------------------------------3333--------------3 PQLEKGLTREELQKILPAGAARNALDCALWDLAARRQQQSLADLIGITLPETVITAQTVV 333----3333----------------------1111----------------------- IGTPDQMANSASTLWQAGAKLLKVKLDNHLISERMVAIRTAVPDATLIVDANESWRAEGL -----------------------------------------1111-----%%%%--2222 AARCQLLADLGVAMLEQPLPAQDDAALENFIHPLPICADESCHTRSNLKALKGRYEMVNI -------1111--------11113333-----------1111-3333-3333-------- KLDKTGGLTEALALATEARAQGFSLMLGCMLCTSRAISAALPLVPQVSFADLDGPTWLAV 3333--------------1111----------3333---33331111-----3333---- DVEPALQFTTGELHL --------2222--- >Tissue factor [Precursor]; SWP:P13726; PDB:1JPSH; EVQLVESGGGLVQPGGSLRLSCAASGFNIKEYYMHWVRQAPGKGLEWVGLIDPEQGNTIY ------------2222-----------3333--------2222----------------- DPKFQDRATISADNSKNTAYLQMNSLRAEDTAVYYCARDTAAYFDYWGQGTLVTVSSAST 3333---------1111---------3333------------------------------ KGPSVFPLAPSSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSS ---------------------------------%%%%--2222-------3333------ VVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEP ----1111------------1111--------- >Prolactin-binding protein; SWP:Q9QV16; PDB:1JPTH; EVQLVESGGGLVQPGGSLRLSCAASGFNIKEYYMHWVRQAPGKGLEWVGLIDPEQGNTIY ------------2222-----------3333--------2222----------------- DPKFQDRATISADNSKNTAYLQMNSLRAEDTAVYYCARDTAAYFDYWGQGTLVTVSSAST 3333---------1111---------3333------------------------------ KGPSVFPLAPSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSSV --------------------------------%%%%--2222-------1111------- VTVPSSSLGTQTYICNVNHKPSNTKVDKKVEP ---3333------------1111--------- >IGKC protein; SWP:Q6GMW1; PDB:1JPTL; DIQMTQSPSSLSASVGDRVTITCRASRDIKSYLNWYQQKPGKAPKVLIYYATSLAEGVPS -------------2222-----------iiii------2222------------222233 RFSGSGSGTDYTLTISSLQPEDFATYYCLQHGESPWTFGQGTKVEIKRTVAAPSVFIFPP 33----!!!!--------1111-------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT 3333-------------------------%%%%--------------------------- LSKADYEKHKVYACEVTHQGLSSPVTKSFNRGE -----------------1111------------ >GP41 ENVELOPE PROTEIN; SWP:Q87972; PDB:1JPXA; GIVQQQQQLLDVVKRQQELLRLTVWGTKQEWERKVDFLEENITALLEEAQIQQEKNMYE -----------------------------1111-----------------33331111- >INTERLEUKIN 17F; SWP:Q96PD4; PDB:1JPYA; HTFFQKPESCPPVPGGSMKLDIGIINENQRVSMSRNIESRSTSPWNYTVTWDPNRYPSEV 3333-------------------2222--------3333------------1111----- VQAQCRNLGCINAQGKEDISMNSVPIQQETLVVRRKHQGCSVSFQLEKVLVTVGCTCVTP -----------3333----------------------!!!!------------------- V - ------------------------------------------------------------ ----------- >METHANE MONOOXYGENASE COM; SWP:P22868; PDB:1JQ4A; MQRVHTITAVTEDGESLRFECRSDEDVITAALRQNIFLMSSCREGGCATCKALCSEGDYD ----------3333-----------3333------------------------------- LKGCSVQALPPEEEEEGLVLLCRTYPKTDLEIELPYTH ----1111---3333----------------------- >GLYCEROL DEHYDROGENASE; SWP:P32816; PDB:1JQ5A; AAERVFISPAKYVQGKNVITKIANYLEGIGNKTVVIADEIVWKIAGHTIVNELKKGNIAA ----------------33333333-3333------------------------1111--- EEVVFSGEASRNEVERIANIARKAEAAIVIGVGGGKTLDTAKAVADELDAYIVIVPTAAS ------------------------------------------------------------ TDAPTSALSVIYSDDGVFESYRFYKKNPDLVLVDTKIIANAPPRLLASGIADALATWVEA --1111------1111---------------------11113333--------------- RSVIKSGGKTMAGGIPTIAAEAIAEKCEQTLFKYGKLAYESVKAKVVTPALEAVVEANTL -----------------------------------------1111--------------- LSGLGFESGGLAAAHAIHNGFTALEGEIHHLTHGEKVAFGTLVQLALEEHSQQEIERYIE -------------------------3333---------------1111------------ LYLCLDLPVTLEDIKLKDASREDILKVAKAATAEGETIHNAFNVTADDVADAIFAADQYA --1111---3333-------------------22223333-------------------- KAYKEK ------ >NADP-DEPENDENT ALCOHOL DE; SWP:P25984; PDB:1JQBA; MKGFAMLGINKLGWIEKERPVAGSYDAIVRPLAVSPCTSDIHTVFEGALGDRKNMILGHE -------2222-----------1111--------------------1111---------- AVGEVVEVGSEVKDFKPGDRVIVPCTTPDWRSLEVQAGFQQHSNGMLAGWKFSNFKDGVF --------1111---2222---------------11113333--2222--2222------ GEYFHVNDADMNLAILPKDMPLENAVMITDMMTTGFHGAELADIEMGSSVVVIGIGAVGL -------3333-----1111------------------------2222------------ MGIAGAKLRGAGRIIGVGSRPICVEAAKFYGATDILNYKNGHIEDQVMKLTNGKGVDRVI --------------------------------------------------iiii------ MAGGGSETLSQAVKMVKPGGIISNINYHGSGDALLIPRVEWGCGMAHKTIKGGLCPGGRL ----------------2222--------------------%%%%---------------- RAERLRDMVVYNRVDLSKLVTHVYHGFDHIEEALLLMKDKPKDLIKAVVIL --------1111--3333---------------3333---1111------- >CARBOXYPEPTIDASE A; SWP:O97389; PDB:1JQGA; HEIYDGHAVYQVDVASMDQVKLVHDFENDLMLDVWSDAVPGRPGKVLVPKFKREIFENFL -1111----------3333-----------------------------1111-------- KQSGVQYKLEVENVKEQLELEDQLLAAAAAKS 1111---------------------------- >SHORT CHAIN ACYL-COA DEHY; SWP:P15651; PDB:1JQIA; VYQSVELPETHQMLRQTCRDFAEKELVPIAAQLDKEHLFPTSQVKKMGELGLLAMDVPEE --------------------------1111--------------------1111---333 LSGAGLDYLAYSIALEEISRGCASTGVIMSVNNSLYLGPILKFGSSQQKQQWITPFTNGD 3----------------3333-------------------------------1111---- KIGCFALSEPGNGSDAGAASTTAREEGDSWVLNGTKAWITNSWEASATVVFASTDRSRQN --------1111--3333-------!!!!-----------1111----------1111-- KGISAFLVPMPTPGLTLGKKEDKLGIRASSTANLIFEDCRIPKENLLGEPGMGFKIAMQT -----------2222----------3333------------1111---2222-------- LDMGRIGIASQALGIAQASLDCAVKYAENRHAFGAPLTKLQNIQFKLADMALALESARLL -------------------------------%%%%1111--------------------- TWRAAMLKDNKKPFTKESAMAKLAASEAATAISHQAIQILGGMGYVTEMPAERYYRDARI ----------------------------------------3333-3333--------333 TEIYEGTSEIQRLVIAGHLLRSYR 3------------------3333- >CARBON MONOXIDE DEHYDROGE; SWP:P31896; PDB:1JQKA; ETAWHRYEKQQPQCGFGSAGLCCRICLKGPCRIDPFGEGPKYGVCGADRDTIVARHLVRM -3333-3333---------------1111-------------1111-3333--------- IAAGTAAHSEHGRHIALAMQHISQGELHDYSIRDEAKLYAIAKTLGVATEGRGLLAIVGD ------------------------------------------------2222-1111--- LAAITLGDFQNQDYDKPCAWLAASLTPRRVKRLGDLGLLPHNIDASVAQTMSRTHVGCDA ------1111--1111-----------------1111----------------------- DPTNLILGGLRVAMADLDGSMLATELSDALFGTPQPVVSAANLGVMKRGAVNIAVNGHNP ----------------------------------------------1111--------33 MLSDIICDVAADLRDEAIAAGAAEGINIIGICCTGHEVMMRHGVPLATNYLSQELPILTG 33--------1111-----------------------------------1111--3333- ALEAMVVDVQCIMPSLPRIAECFHTQIITTDKHNKISGATHVPFDEHKAVETAKTIIRMA ------------3333---1111---------------------3333------------ IAAFGRRDPNRVAIPAFKQKSIVGFSAEAVVAALAKVNADDPLKPLVDNVVNGNIQGIVL -3333--1111---------------------3333-3333------3333--------- FVGCNTTKVQQDSAYVDLAKSLAKRNVLVLATGCAAGAFAKAGLMTSEATTQYAGEGLKG -----1111-------------------------------------3333---------- VLSAIGTAAGLGGPLPLVMHMGSCVDNSRAVALATALANKLGVDLSDLPLVASAPECMSE ----------------------3333-----------------1111------------- KALAIGSWAVTIGLPTHVGSVPPVIGSQIVTKLVTETAKDLVGGYFIVDTDPKSAGDKLY -----------------------3333--------------------------------- AAIQERRAGL ---------- >DNA polymerase III subuni; SWP:P28630; PDB:1JQLB; MIRLYPEQLRAQLNEGLRAAYLLLGNDPLLLQESQDAVRQVAAAQGFEEHHTFSIDPNTD ----3333---------------------------------3333--------------3 WNAIFSLCQAMSLFASRQTLLLLLPENGPNAAINEQLLTLTGLLHDDLLLIVRGNKLSKA 333---3333-1111---------1111-1111-----------3333------------ QENAAWFTALANRSVQVTCQ -----3333-1111------ >PHOSPHOENOLPYRUVATE CARBO; SWP:P00864; PDB:1JQNA; QYSALRSNVSMLGKVLGETIKDALGEHILERVETIRKLSKSSRAGNDANRQELLTTLQNL --------------------------------------------------------1111 SNDELLPVARAFSQFLNLANTAEQYHSISPKGEAASNPEVIARTLRKLKNQPELSEDTIK 1111------------------------1111-11113333---------3333------ KAVESLSLELVLTAHPTEITRRTLIHKMVEVNACLKQLDNKDIADYEHNQLMRRLRQLIA --1111-----------------3333----------------3333------------- QSWHTDEIRKLRPSPVDEAKWGFAVVENSLWQGVPNYLRELNEQLEENLGYKLPVEFVPV -------------3333------------------------------------3333--- RFTSWMGGDRDGNPNVTADITRHVLLLSRWKATDLFLKDIQVLVSELSMVEATPELLALV ----2222-2222---------------------------------------3333---- GEEGAAEPYRYLMKNLRSRLMATQAWLEARLKGEELPKPEGLLTQNEELWEPLYACYQSL -----------------------------1111-----------3333------------ QACGMGIIANGDLLDTLRRVKCFGVPLVRIDIRQESTRHTEALGELTRYLGIGDYESWSE -----3333--------------1111-------3333---------------3333--- ADKQAFLIRELNSKRPLLPRNWQPSAETREVLDTCQVIAEAPQGSIAAYVISMAKTPSDV -------------------------------------------------------3333- LAVHLLLKEAGIGFAMPVAPLFETLDDLNNANDVMTQLLNIDWYRGLIQGKQMVMIGYSD ----------------------------------------3333---%%%%--------- SAKDAGVMAASWAQYQAQDALIKTCEKAGIELTLFHGRGGSIGRGGAPAHAALLSQPPGS -------------------------1111----------3333-------------2222 LKGGLRVTEQGEMIRFKYGLPEITVSSLSLYTGAILEANLLPPPEPKESWRRIMDELSVI 1111-----3333---------------------------------3333---------- SCDVYRGYVRENKDFVPYFRSATPEQELGKLPLGSRPGGVESLRAIPWIFAWTQNRLMLP -----------------------3333-----------1111--------1111---333 AWLGAGTALQKVVEDGKQSELEAMCRDWPFFSTRLGMLEMVFAKADLWLAEYYDQRLVDK 3-----------1111------------------------------------------33 ALWPLGKELRNLQEEDIKVVLAIANDSHLMADLPWIAESIQLRNIYTDPLNVLQAELLHR 33-------------------1111----1111--------------------------- SRQAEKEGQEPDPRVEQALMVTIAGIAAGMRNTG ----1111-------------------------- >PHOSPHOENOLPYRUVATE CARBO; SWP:P04711; PDB:1JQOA; IEYDALLVDRFLNILQDLHGPSLREFVQECYEVSADYEGKGDTTKLGELGAKLTGLAPAD -----------------------------------------3333-------3333---- AILVASSILHMLNLANLAEEVQIAHRRRNSDIEETLKRLVSEVGKSPEEVFEALKNQTVD -------------------------3333------------------------1111--- LVFTAHPTQSARRSLLQKNARIRNCLTQLNAKDITDDDKQELDEALQREIQAAFRTDEIR --------------------------3333----------------------1111---- RAQPTPQAEMRYGMSYIHETVWKGVPKFLRRVDTALKNIGINERLPYNVSLIRFSSWMGG -----------3333---------------------1111-----3333----------- DRDGNPRVTPEVTRDVCLLARMMAANLYIDQIEELMFELSMWRCNDELRVRAEELHSSSG -2222------------------------------3333--------------------- SKVTKYYIEFWKQIPPNEPYRVILGHVRDKLYNTRERARHLLASGVSEISAESSFTSIEE ---1111-----------3333---------------------------3333---3333 FLEPLELCYKSLCDCGDKAIADGSLLDLLRQVFTFGLSLVKLDIRQESERHTDVIDAITT -----------------3333--------------------------------------- HLGIGSYREWPEDKRQEWLLSELRGKRPLLPPDLPQTDEIADVIGAFHVLAELPPDSFGP ------------------------------1111-------------3333--3333--- YIISMATAPSDVLAVELLQRECGVRQPLPVVPLFERLADLQSAPASVERLFSVDWYMDRI -------3333--------1111-------------3333-------3333--------- KGKQQVMVGYSDSGKDAGRLSAAWQLYRAQEEMAQVAKRYGVKLTLFHGRGGTVGRGGGP -----------1111---------------------3333------------3333---- THLAILSQPPDTINGSIRVTVQGEVIEFCFGEEHLCFQTLQRFTAATLEHGMHPPVSPKP ----1111----%%%%-------------------------------------------- EWRKLMDEMAVVATEEYRSVVVKEARFVEYFRSATPETEYGRMNIGSRPITTLRAIPWIF -----------------------------------3333-3333-----1111------- SWTQTRFHLPVWLGVGAAFKFAIDKDVRNFQVLKEMYNEWPFFRVTLDLLEMVFAKGDPG --1111-3333--------------3333-----------3333---------1111--- IAGLYDELLVAEELKPFGKQLRDKYVETQQLLLQIAGHKDILEGDPFLKQGLVLRNPYIT ----------3333---------------------------3333--------------- TLNVFQAYTLKRIRDPNFKVTPQPPLSKEAGLVKLNPASEYPPGLEDTLILTMKGIAAGM --------------1111-----------3333--------------------------- QNTG ---- >DIPEPTIDYL PEPTIDASE I; SWP:P80067; PDB:1JQPA; DTPANCTYPDLLGTWVFQVGPRHPRSHINCSVMEPTEEKVVIHLKKLDTAYDEVGNSGYF ------3333-------------3333--------------------------------- TLIYNQGFEIVLNDYKWFAFFKYEVKGSRAISYCHETMTGWVHDVLGRNWACFVGKKMLS -----------%%%%----------!!!!--------------1111------------- LPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNNSQTPILSPQEVVS -----1111iiii-------------3333------------1111-------------- CSPYAQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDAPCKPKENCLRYYSSEYYYVGG -----!!!!--3333------------3333-------------------------2222 FYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHPFNPFELTNHAVLLVGYGKD --------------------------3333------------------------------ PVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIPIPKL --------------3333-iiii--------%%%%------------- >PEROXISOMAL MEMBRANE PROT; SWP:P80667; PDB:1JQQA; ISEFGSEPIDPSKLEFARALYDFVPENPEMEVALKKGDLMAILSKKDPLGRDSDWWKVRT --------------------------1111----------------1111---------3 KNGNIGYIPYNYIEIIKRR 333-----1111------- >INOSINE-5'-MONOPHOSPHATE ; SWP:P12269; PDB:1JR1A; GLTAQQLFNCGDGLTYNDFLILPGYIDFTADQVDLTSALTKKITLKTPLVSSPMDTVTEA --3333--------1111----------3333---------------------1111--- GMAIAMALTGGIGFIHHNCTPEFQANEVRKVKKYEQGFITDPVVDRVRFEAKMGSRLVIM ------------------------------------------------------------ TKREDLVVAPAGITLKEANEILQRSKLPIVNENDELVAIIARTDLKKNRDYPLASKDAKK ---------2222---------------------------3333------1111--1111 QLLCGAAIGTHEDDKYRLDLLALAGVDVVVLDSSQGNSIFQINMIKYMKEKYPNLQVIGG ----------3333-------3333--------------------------1111----- NVVTAAQAKNLIDAGVDALRVGMGCGSICITQEVLACGRPQATAVYKVSEYARRFGVPVI -------------------------1111-3333----------------3333------ ADGGIQNVGHIAKALALGASTVMMGSLLAATTEAPGEYFFSDGIRLKKYRGMGSLDAMIK --------------1111------1111--1111----------------11113333-- VAQGVSGAVQDKGSIHKFVPYLIAGIQHSCQDIGAKSLTQVRAMMYSGELKFEKRTSSAQ -------------3333------------------------------------------- VEGGVHSLHSYEKRLF ---------------- >UROPORPHYRINOGEN-III SYNT; SWP:P10746; PDB:1JR2A; MKVLLLKDAKEDDCGQDPYIRELGLYGLEATLIPVLSFEFLSLPSFSEKLSHPEDYGGLI ------------iiii3333--3333-------------------------3333----- FTSPRAVEAAELCLEQNNKTEVWERSLKEKWNAKSVYVVGNATASLVSKIGLDTEGETCG --3333--------1111------------3333-------------1111--------- NAEKLAEYICSRESSALPLLFPCGNLKREILPKALKDKGIAMESITVYQTVAHPGIQGNL ------------------------3333-3333--1111-------------1111---- NSYYSQQGVPASITFFSPSGLTYSLKHIQELSGDNIDQIKFAAIGPTTARALAAQGLPVS -------------------------------!!!!1111-------------1111---- CTAESPTPQALATGIRKALQ ----------------1111 >10 KDA ANTI-SIGMA FACTOR; SWP:P32267; PDB:1JR5A; MNKNIDTVREIITVASILIKFSREDIVENRANFIAFLNEIGVTHEGRKLNQNSFRKIVSE --3333------------1111-3333------------------------------111 LTQEDKKTLIDEFNEGFEGVYRYLEMYTNK 1----------------------------- >HELICASE NS3; SWP:P26664; PDB:1JR6A; GSVTVPHPNIEEVALSTTGEIPFYGKAIPLEVIKGGRHLIFCHSKKKCDELAAKLVALGI ------3333------------------3333-----------3333------------- NAVAYYRGLDVSVIPTNGDVVVVATDALMTGFTGDFDSVIDCNTSDGKPQDAVSRTQRRG --------------------------------------------------3333------ RTGRGKPGIYRFVAPGER ------------------ >HYPOTHETICAL 37.4 KDA PRO; SWP:P76621; PDB:1JR7A; GQDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDL ---2222----1111-------------------33333333------------------ CANQLQPLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQY %%%%----------3333------2222-3333--------------------------- YARFVVKNVYLRQPHRVMELHNDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHL ---------1111----------1111-------------------------11111111 DNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPVMRYIDQFVQPKDFEEGVWL -----3333-------------------------1111------3333------------ SELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFAYASN ---------1111-----2222------------------1111---------------- HYQTHQ ------ >ERV2 PROTEIN, MITOCHONDRI; SWP:Q12284; PDB:1JR8A; DDKVKKEVGRASWKYFHTLLARFPDEPTPEEREKLHTFIGLYAELYPCGECSYHFVKLIE -------------------1111------------------------------------- KYPVQTSSRTAAAMWGCHIHNKVNEYLKKDIYDCATILEDYDCGC ------------------------1111-------3333------ >MANGANESE SUPEROXIDE DISM; SWP:Q7SIC3; PDB:1JR9A; KFELPELPYAYDALEPTIDKETMNIHHTKHHNTYVTKLNGALEGHEDLKNKSLNDLISNL ---------1111----------------------------22223333----------- DAVPENIRTAVRNNGGGHANHSLFWKLMSPNGGGKPTGEVADKINDKYGSFEKFQEEFAA ---3333---------------3333-----------3333------------------- AAAGRFGSGWAWLVVNNGEIEIMSTPIQDNPLMEGKKPILGLDVWEHAYYLKYQNKRPDY ---------------!!!!------!!!!---------------1111----!!!!---- ISAFWNVVNWDEVAAQYSQAA --3333--------------- >SUBGROUP A ROUS SARCOMA V; SWP:P98162; PDB:1JRFA; GSSRCPPGQFRCSEPPGAHGECYPQDWLCDGHPDCDDGRDEWGCGTS ------------------3333-3333-------------------- >Interferon-gamma receptor; SWP:P15260; PDB:1JRHH; AVKLQESGPGILKPSQTLSLTCSFSGFSLTTYGMGVWIRQSSGKGLEWLAHIWWDDDKYY ------------------------------2222------2222--------1111---- NPSLKSRLTISKDTSRNQVFLKITSV 33331111----1111---------- >ScFv 6H8 protein [Fragmen; SWP:Q7TQM2; PDB:1JRHL; SVEMTQSPSSFSVSLGDRVTITCKASEDIYNRLAWYQQKPGNAPRLLISGATSLETEVPS ----------------------------%%%%--------------------------33 RFSGSGSGKDYTLSITSLQTEDVATYYCQQYWSTWTFGGGTKLEIKRADAAPTVSIFPCF 33----------------3333-------------------------------------- LNNFYPKDINVKGVLNSWTDQDSKDSTYSMSSTCEATHKTSTSPIVK ----------------------------------------------- --------------------------------------- >CONSERVED HYPOTHETICAL PR; SWP:O26734; PDB:1JRMA; VITMDCLREVGDDLLVNIEVSPASGKFGIPSYNEWRKRIEVKIHSPPQKGKANREIIKEF ---------!!!!-------------------3333--------3333------------ SETFGRDVEIVSGQKSRQKTIRIQGMGRDLFLKLVSEKFGLEIP ------------3333---------------------------- >XANTHINE DEHYDROGENASE, C; SWP:O54050; PDB:1JROA; MEIAFLLNGETRRVRIEDPTQSLLELLRAEGLTGTKEGCNEGDCGACTVMIRDAAGSRAV ------iiii-------1111------11113333---------1111----3333---- NACLMMLPQIAGKALRTIEGIAAPDGRLHPVQQAMIDHHGSQCGFCTPGFIVSMAAAHDR 3333-33332222---1111--1111-----------------1111----------111 DRKDYDDLLAGNLCRCTGYAPILRAAEAAAGEPPADWLQADAAFTLPAFLPETSDALADW 1-------1111----------------------3333-3333----------------- YLAHPEATLIAGGTDVSLWVTKALRDLPEVAFLSHCKDLAQIRETPDGYGIGAGVTIAAL --------------3333--------------11111111-----------1111----- RAFAEGPHPALAGLLRRFASEQVRQVATIGGNIANGSPIGDGPPALIAMGASLTLRRGQE --3333-------3333--3333-------------1111------1111---------- RRRMPLEDFFLEYRKQDRRPGEFVESVTLPKSAPGLRCYKLSKRFDQDISAVCGCLNLTL ------3333--------2222----------1111------------------------ KGSKIETARIAFGGMAGVPKRAAAFEAALIGQDFREDTIAAALPLLAQDFTPLSDMRASA ----------------------3333---------------3333---------3333-- AYRMNAAQAMALRYVRELSGEAVAVLEVMP -----------------------1111--- >Xanthine dehydrogenase; SWP:O54051; PDB:1JROB; SVGKPLPHDSARAHVTGQARYLDDLPCPANTLHLAFGLSTEASAAITGLDLEPVRESPGV 2222---1111---------3333---1111-------------------------2222 IAVFTAADLPHDNDASPAPSPEPVLATGEVHFVGQPIFLVAATSHRAARIAARKARITYA ----1111------------------------------------------1111------ PRPAILTLDQALAADSRFEGGPVIWARGDVETALAGAAHLAEGCFEIGGQEHFYLEGQAA ------------------------------------------------------------ LALPAEGGVVIHCSSQHPSEIQHKVAHALGLAFHDVRVEMRRMGGGFGGKESQGNHLAIA ----------------3333-----------1111----------iiii--3333----- CAVAARATGRPCKMRYDRDDDMVITGKRHDFRIRYRIGADASGKLLGADFVHLARCGWSA ---------------------------------------3333----------------! DLSLPVCDRAMLHADGSYFVPALRIESHRLRTNTQSNTAFRGFGGPQGALGMERAIEHLA !!!-------1111!!!!------------------------------------------ RGMGRDPAELRALNFYDPPEKKTQTTHYGQEVADCVLGELVTRLQKSANFTTRRAEIAAW 1111-------1111----------1111------------------------------- NSTNRTLARGIALSPVKFGISFTLTHLNQAGALVQIYTDGSVALNHGGTEMGQGLHAKMV 1111-------------------3333---------1111-------------------- QVAAAVLGIDPVQVRITATDTSKVPNTSATAASSGADMNGMAVKDACETLRGRLAGFVAA ---------3333------1111-------%%%%-------------------------- REGCAARDVIFDAGQVQASGKSWRFAEIVAAAYMARISLSATGFYATPKLSWDRLRGQGR ----3333---------iiii--------------------------------1111--- PFLYFAYGAAITEVVIDRLTGENRILRTDILHDAGASLNPALDIGQIEGAYVQGAGWLTT -------------------------------------------------------1111- EELVWDHCGRLMTHAPSTYKIPAFSDRPRIFNVALWDQPNREETIFRSKAVGEPPFLLGI -----1111-----1111----1111------------------%%%%----3333---- SAFLALHDACAACGPHWPDLQAPATPEAVLAAVRRAEGRA ---------1111--------------------------- >N-acetylornithine carbamo; SWP:Q8A1E9; PDB:1JS1X; MKKFTCVQDIGDLKSALAESFEIKKDRFKYVELGRNKTLLMIFFNSSLRTRLSTQKAALN -----3333----------------111111112222---------------------11 LGMNVIVLDINQGAWKLETERGVIMDGDKPEHLLEAIPVMGCYCDIIGVRSFARFENREY 11--------1111-----------------3333------------------------- DYNEVIINQFIQHSGRPVFSMEAATRHPLQSFADLITIEEYKKTARPKVVMTWAPHPRPL ------------------------------------------------------------ PQAVPNSFAEWMNATDYEFVITHPEGYELDPKFVGNARVEYDQMKAFEGADFIYAKNWAA -----------1111--------2222--3333!!!!---------2222---------- YTGDNYGQILSTDRNWTVGDRQMAVTNNAYFMHCLPVRRNMIVTDDVIESPQSIVIPEAA -!!!!--------1111-33331111-----------2222--------1111------- NREISATVVLKRLLENLPHHHHHH -------------1111------- >DOPA DECARBOXYLASE; SWP:P80041; PDB:1JS3A; MNASDFRRRGKEMVDYMADYLEGIEGRQVYPDVQPGYLRPLIPATAPQEPDTFEDILQDV ----------------------3333-------22223333------------------- EKIIMPGVTHWHSPYFFAYFPTASSYPAMLADMLCGAIGCIGFSWAASPACTELETVMMD ---3333--1111------------------------------3333------------- WLGKMLQLPEAFLAGEAGEGGGVIQGSASEATLVALLAARTKVVRRLQAASPGLTQGAVL ---1111-3333--------------------------------------1111------ EKLVAYASDQAHSSVERAGLIGGVKLKAIPSDGKFAMRASALQEALERDKAAGLIPFFVV -------1111--------------------1111------------------------- ATLGTTSCCSFDNLLEVGPICHEEDIWLHVDAAYAGSAFICPEFRHLLNGVEFADSFNFN -------------3333----------------3333---33333333-3333------- PHKWLLVNFDCSAMWVKRRTDLTGAFKSGLITDYRHWQLPLGRRFRSLKMWFVFRMYGVK -----------------33333333-------3333------------------------ GLQAYIRKHVQLSHEFEAFVLQDPRFEVCAEVTLGLVCFRLKGSDGLNEALLERINSARK ----------------------3333---------------------------------- IHLVPCRLRGQFVLRFAICSRKVESGHVRLAWEHIRGLAAELLA -------iiii--------1111--------------------- >HEMOCYANIN; SWP:O61363; PDB:1JS8A; AIIRKNVNSLTPSDIKELRDAMAKVQADTSDNGYQKIASYHGIPLSCHYENGTAYACCQH -----1111--------------------1111---------------1111-------- GMVTFPNWHRLLTKQMEDALVAKGSHVGIPYWDWTTTFANLPVLVTEEKDNSFHHAHIDV -1111---------------1111--------1111-----3333-----1111----11 ANTDTTRSPRAQLFSFFYRQIALALEQTDFCDFEIQFEIGHNAIHSWVGGSSPYGMSTLH 11-------3333----------3333-3333----------------!!!!--1111-- YTSYDPLFYLHHSNTDRIWSVWQALQKYRGLPYNTANCEINKLVKPKFNLDTNPNAVTKA 33333333-----------------------------------------1111--3333- HSTGATSFDYHKLGYDYDNLNFHGMTIPELEEHLKEIQHEDRVFAGFLLRTIGQSADVNF --3333--3333---------iiii---------3333---------------------- DVCTKDGECTFGGTFCILGGEHEMFWAFDRLFKYDITTSLKHLRLDAHDDFDIKVTIKGI ---3333------------2222----------------------1111---------33 DGHVLSNKYLSPPTVFLAPA 33----3333---------- >HAEMAGGLUTININ (HA1 CHAIN; SWP:Q91CD4; PDB:1JSDA; DKICIGYQSTNSTETVDTLTETNVPVTHAKELLHTSHNGMLCATNLGHPLILDTCTIEGL -----------------1111----------------------1111------------- IYGNPSCDLLLGGREWSYIVERPSAVNGMCYPGNVENLEELRSLFSSASSYQRIQIFPDT ---11111111----------1111-------------------3333---------333 IWNVSYSGTSSACSDSFYRSMRWLTQKNNAYPIQDAQYTNNRGKSILFMWGINHPPTDTV 3--------3333-------------%%%%------------------------------ QTNLYTRTDTTTSVTTEDINRTFKPVIGPRPLVNGLHGRIDYYWSVLKPGQTLRVRSNGN --------------------------------%%%%-----------2222--------- LIAPWYGHILSGESHGRILKTDLNSGNCVVQCQTERGGLNTTLPFHNVSKYAFGNCPKYV ---------------------------------1111-----------1111-------- GVKSLKLAVGLRNVPAR ----------------- >Hemagglutinin; SWP:Q3SC70; PDB:1JSDB; GLFGAIAGFIEGGWPGLVAGWYGFQHSNDQGVGMAADSDSTQKAIDKITSKVNNIVDKMN 11112222-----1111----------1111----------------------------- KQYGIIDHEFSEIETRLNMINNKIDDQIQDIWTYNAELLVLLENQKTLDEHDANVNNLYN ----------1111---------------------------------------------- KVKRALGSNAMEDGKGCFELYHKCDDQCMETIRNGTYNRR -----!!!!----------------------1111----- >LYSOZYME; SWP:P00703; PDB:1JSE; KVYGRCELAAAMKRLGLDNYRGYSLGNWVCAAKFESNFNTHATNRNTDGSTDYGILQINS ------------11112222---3333--------%%%%--------------1111--- RWWCNDGRTPGSKNLCNIPCSALLSSDITASVNCAKKIASGGNGMNAWVAWRNRCKGTDV --------2222-1111-3333------------------1111----------2222-3 HAWIRGCRL 3332222-- >ONCOGENE PRODUCT P14TCL1; SWP:P56279; PDB:1JSG; CPTLGEAVTDHPDRLWAWEKFVYLDEKQHAWLPLTIEIKDRLQLRVLLRREDVVLGRPMT ------------------------1111----------%%%%-----------------3 PTQIGPSLLPIMWQLYPDGRYRSSDSSFWRLVYHIKIDGVEDMLLELLPDD 333------------1111---1111------------------------- >HAEMAGGLUTININ (HA1 CHAIN; SWP:Q9DLP3; PDB:1JSMA; DQICIGYHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCDLNGVKPLILRDCSVAGW -----------------1111----------------------iiii----!!!!----- LLGNPMCDEFLNVPEWSYIVEKDNPVNGLCYPENFNDYEELKHLLSSTNHFEKIRIIPRS ---11111111---------------------------------1111---------111 SWSNHDASSGVSSACPYNGRSSFFRNVVWLIKKNNAYPTIKRSYNNTNQEDLLILWGIHH 11111------3333-%%%%---1111-----%%%%------------------------ PNDAAEQTKLYQNPTTYVSVGTSTLNQRSVPEIATRPKVNGQSGRMEFFWTILKPNDAIN ---------------------1111-------------iiii-----------2222--- FESNGNFIAPEYAYKIVKKGGSAIMKSGLEYGNCNTKCQTPMGAINSSMPFHNIHPLTIG ---------------------------------------1111----------------- ECPKYVKSGRLVLATGLRNVP --------------------- >Hemagglutinin [Fragment]; SWP:Q4ZJH5; PDB:1JSMB; GLFGAIAGFIEGGWQGMVDGWYGYHHSNEQGSGYAADKESTQKAIDGTTNKVNSIIDKMN 1111---------1111----------3333----------------------------- TQFEAVGKEFNNLERRIENLNKKMEDGFLDVWTYNAELLVLMENERTLDFHDSNVKNLYD ----------1111---------------------------------------------- KVRLQLRDNAKELGNGCFEFYHKCDNECMESVKNGTYDYP ---------------------------------------- >CREB-binding protein; SWP:Q92793; PDB:1JSPB; GSHMRKKIFKPEELRQALMPTLEALYRQDPESLPFRQPVDPQLLGIPDYFDIVKNPMDLS -------------------3333-----33333333---3333----------------- TIKRKLDTGQYQEPWQYVDDVWLMFNNAWLYNRKTSRVYKFCSKLAEVFEQEIDPVMQSL ------------3333-----------------------1111---------33331111 G - >CHOLESTEROL-REGULATED STA; SWP:Q99JV5; PDB:1JSSA; ASISTKLQNTLIQYHSIEEDEWRVAKKAKDVTVWRKPSEEFNGYLYKAQGVMDDVVNNVI -3333--------1111-1111-------------------------------------- DHIRPGPWRLDWDRLMTSLDVLEHFEENCCVMRYTTAGQLLNIISPREFVDFSYTVGYEE -------1111--------------1111----------%%%%--------------!!! GLLSCGVSVEWSETRPEFVRGYNHPCGWFCVPLKDSPSQSLLTGYIQTDLRGMIPQSAVD !-------------3333-----------------1111---------------3333-- TAMASTLANFYSDLRKGLR ------------------- >Cyclin-dependent kinase i; SWP:P46527; PDB:1JSUC; KPSACRNLFGPVDHEELTRDLEKHCRDMEEASQRKWNFDFQNHKPLEGKYEWQEVEKGSL -3333----------------------------1111--1111------------3333- PEFYYRPPR 3333----- >L-ASPARTATE AMMONIA-LYASE; SWP:P04422; PDB:1JSWA; MSNNIRIEEDLLGTREVPADAYYGVHTLRAIENFYISNNKISDIPEFVRGMVMVKKAAAM ------------------------------------------11113333---------- ANKELQTIPKSVANAIIAACDEVLNNGKCMDQFPVDVYQGGAGTSVNMNTNEVLANIGLE --3333--3333-----------!!!!-------------iiii---------------1 LMGHQKGEYQYLNPNDHVNKCQSTNDAYPTGFRIAVYSSLIKLVDAINQLREGFERKAVE 111-------------------3333---------------------------------- FQDILKMGRTQLQDAVPMTLGQEFRAFSILLKEEVKNIQRTAELLLEVNLGATAIGTGLN 1111-----iiii---------3333---------------------------------- TPKEYSPLAVKKLAEVTGFPCVPAEDLIEATSDCGAYVMVHGALKRLAVKMSKICNDLRL -----------3333--------------------------------------------- LSSGPRAGLNEINLPELQAGSSIMPAKVNPVVPEVVNQVCFKVIGNDTTVTMAAEAGQLQ ---2222----------------------------------------------------- LNVMEPVIGQAMFESVHILTNACYNLLEKCINGITANKEVCEGYVYNSIGIVTYLNPFIG -----------------------------3333----3333---1111---1111----- HHNGDIVGKICAETGKSVREVVLERGLLTEAELDDIFSV ----------1111--------------3333------- >GLUCOSE-INHIBITED DIVISIO; SWP:P17113; PDB:1JSXA; MLNKLSLLLKDAGISLTDHQKNQLIAYVNMLHKWNEMLVRHILDSIVVAPYLQGERFIDV -----------------------------------------------3333--------- GTGPGLPGIPLSIVRPEAHFTLLDSLGKRVRFLRQVQHELKLENIEPVQSRVEEFPSEPP -!!!!---------1111--------------------------------3333------ FDGVISRAFASLNDMVSWCHHLPGEQGRFYALKGQMPEDEIALLPEEYQVESVVKLQVPD ------------------1111-1111-------------11111111------------ GERHLVVIKANKI ------------- >Hypothetical transcriptio; SWP:P23217; PDB:1JT6A; NLKDKILGVAKELFIKNGYNATTTGEIVKLSESSKGNLYYHFKTKENLFLEILNIEESKW ------------------3333------1111---------------------------- QEQWKKEQIKAKTNREKFYLYNELSLTTEYYYPLQNAIIEFYTEYYKTNSINEKMNKLEN --------------------------3333-3333---------3333------------ KYIDAYHVIFKEGNLNGEWSINDVNAVSKIAANAVNGIVTFTHEQNINERIKLMNKFSQI -------------1111------------------------33333333----------- FLNGLS --1111 >PROBABLE TRANSLATION INIT; SWP:Q57887; PDB:1JT8A; MAEQQQEQQIRVRIPRKEENEILGIIEQMLGASRVRVRCLDGKTRLGRIPGRLKNRIWVR -------------------------------------------------3333------- EGDVVIVKPWEVQGDQKCDIIWRYTKTQVEWLKRKGYLDELL ------------------------------------------ >Beta-lactamase inhibitory; SWP:O87916; PDB:1JTDB; VAATSVVAWGGNNDWGEATVPAEAQSGVDAIAGGYFHGLALKGGKVLGWGANLNGQLTMP ------------1111----3333---------1111----iiii------1111----3 AATQSGVDAIAAGNYHSLALKDGEVIAWGGNEDGQTTVPAEARSGVDAIAAGAWASYALK 333-----------------iiii------1111----3333-----------------i DGKVIAWGDDSDGQTTVPAEAQSGVTALDGGVYTALAVKNGGVIAWGDNYFGQTTVPAEA iii---------1111-3333-----------------iiii------1111----3333 QSGVDDVAGGIFHSLALKDGKVIAWGDNRYKQTTVPTEALSGVSAIASGEWYSLALKNGK -----------------iiii---------1111-3333-----------------iiii VIAWGSSRTAPSSVQSGVSSIEAGPNAAYALKG ----%%%%--3333------------------- ------------------------------------------------------------ -- >17 BETA-HYDROXYSTEROID DE; SWP:P14061; PDB:1JTVA; ARTVVLITGCSSGIGLHLAVRLASDPSQSFKVYATLRDLKTQGRLWEAARALACPPGSLE ------------------------3333---------3333--------1111-2222-- TLQLDVRDSKSVAAARERVTEGRVDVLVCNAGLGLLGPLEALGEDAVASVLDVNVVGTVR ----1111----------3333---------------3333------------------- MLQAFLPDMKRRGSGRVLVTGSVGGLMGLPFNDVYCASKFALEGLCESLAVLLLPFGVHL ---------------------3333---2222--------------------1111---- SLIECGPVHTGSPEEVLDRTDIHTFHRFYQYLAHSKQVFREAAQNPEEVAEVFLTALRAP ----------------1111---------------------------------------- KPTLRYFTTERFLPLLRMRLDDPSGSNYVTAMHREVFG ---------1111--------3333------------- >HYDROXYNITRILE LYASE; SWP:Q945K2; PDB:1JU2A; LATTSDHDFSYLSFAYDATDLELEGSYDYVIVGGGTSGCPLAATLSEKYKVLVLERGSLP --------3333----3333--------------3333--------------------33 TAYPNVLTADGFVYNLQQEDDGKTPVERFVSEDGIDNVRGRVLGGTSIINAGVYARANTS 331111-3333---3333------------1111-------22221111--------111 IYSASGVDWDMDLVNQTYEWVEDTIVYKPNSQSWQSVTKTAFLEAGVHPNHGFSLDHEEG 11111-------------------------------------1111-------------- TRITGSTFDNKGTRHAADELLNKGNSNNLRVGVHASVEKIIFSNAPGLTATGVIYRDSNG --------1111---3333-11111111----------------------------1111 TPHQAFVRSKGEVIVSAGTIGTPQLLLLSGVGPESYLSSLNIPVVLSHPYVGQFLHDNPR -------2222------3333----------------1111------1111--------- NFINILPPNPIEPTIVTVLGISNDFYQCSFSSLPFTTPPFGFFPSSSYPLPNSTFAHFAS ---------------------1111-------------2222------------------ KVAGPLSYGSLTLKSSSNVRVSPNVKFNYYSNLTDLSHCVSGMKKIGELLSTDALKPYKV -----------------1111------2222--------------------33331111- EDLPGVEGFNILGIPLPKDQTDDAAFETFCRESVASYWHYHGGCLVGKVLDGDFRVTGIN ---!!!!-----------1111----------------------2222--1111-2222- ALRVVDGSTFPYTPASHPQGFYLMLGRYVGIKILQERSASD -----1111----------------------------1111 >COCAINE ESTERASE; SWP:Q9L9D7; PDB:1JU3A; NYSVASNVMVPMRDGVRLAVDLYRPDADGPVPVLLVRNPYDKFDVFAWSTQSTNWLEFVR -----------1111-------------------------333333331111------11 DGYAVVIQDTRGLFASEGEFVPHVDDEADAEDTLSWILEQAWCDGNVGMFGVSYLGVTQW 11-------2222---------1111-------------1111--------!!!!----- QAAVSGVGGLKAIAPSMASADLYRAPWYGPGGALSVEALLGWSALIGTGLITSRSDARPE -1111-3333----------3333----1111-------------------------111 DAADFVQLAAILNDVAGAASVTPLAEQPLLGRLIPWVIDQVVDHPDNDESWQSISLFERL 1---------------------3333--3333--------3333---3333111133331 GGLATPALITAGWYDGFVGESLRTFVAVKDNADARLVVGPWSHSNLTGRNADRKFGIAAT 111--------1111--------------------------1111----------3333- YPIQEATTMHKAFFDRHLRGETDALAGVPKVRLFVMGIDEWRDETDWPLPDTAYTPFYLG --------------------11112222--------------------1111-------- GSGAANTSTGGGTLSTSISGTESADTYLYDPADPVPSLGGTLLFHNGDNGPADQRPIHDR ------1111-------------------3333-----!!!!-----------3333--1 DDVLCYSTEVLTDPVEVTGTVSARLFVSSSAVDTDFTAKLVDVFPDGRAIALCDGIVRMR 111----------------------------------------1111----------333 YRETLVNPTLIEAGEIYEVAIDMLATSNVFLPGHRIMVQVSSSNFPKYDRNSNTGGVIAR 3----------2222---------------2222----------------------1111 EQLEEMCTAVNRIHRGPEHPSHIVLPIIKR -3333----------1111----------- >LEGINSULIN; SWP:Q39837; PDB:1JU8A; ADCNGACSPFEVPPCRSRDCRCVPIGLFVGFCIHPTG -------3333-----3333-----1111-------- >DIHYDROOROTATE DEHYDROGEN; SWP:P54321; PDB:1JUBA; MLNTTFANAKFANPFMNASGVHCMTIEDLEELKASQAGAYITKSSTLEKREGNPLPRYVD -----%%%%--------2222--------------------------------------- LELGSINSMGLPNLGFDYYLDYVLKNQKENAQEGPIFFSIAGMSAAENIAMLKKIQESDF 1111----------3333-------3333------------------------------- SGITELNLSCPNVPGEPQLAYDFEATEKLLKEVFTFFTKPLGVKLPPYFDLVHFDIMAEI ------------2222-1111----------------------------3333------- LNQFPLTYVNSVNSIGNGLFIDPEAESVVIKPKDGFGGIGGAYIKPTALANVRAFYTRLK 1111------------------1111-----%%%%-----3333----------1111-3 PEIQIIGTGGIETGQDAFEHLLCGATMLQIGTALHKEGPAIFDRIIKELEEIMNQKGYQS 333--------------------------------------------------------3 IADFHGKLKSL 333-------- >LYSOZYME; SWP:P37156; PDB:1JUG; KILKKQELCKNLVAQGMNGYQHITLPNWVCTAFHESSYNTRATNHNTDGSTDYGILQINS ------------11112222---3333--------%%%%------1111----1111--- RYWCHDGKTPGSKNACNISCSKLLDDDITDDLKCAKKIAGEAKGLTPWVAWKSKCRGHDL -------------1111--1111---------------------3333------2222-3 SKFKC 333-- >QUERCETIN 2,3-DIOXYGENASE; SWP:Q7SIC2; PDB:1JUHA; SSLIVEDAPDHVRPYVIRHYSHARAVTVDTQLYRFYVTGPSSGYAFTLMGTNAPHSDALG -----------------2222%%%%--!!!!------3333%%%%--------------- VLPHIHQKHYENFYCNKGSFQLWAQSGNETQQTRVLSSGDYGSVPRNVTHTFQIQDPDTE -------------------------!!!!-------2222----2222------------ MTGVIVPGGFEDLFYYLGTNATDTTHTPYIPSISTLQSFDVYAELSFTPRTDTVNGTAPA ----------------------1111------33331111---1111------iiii--- NTVWHTGANALASTAGDPYFIANGWGPKYLNSQYGYQIVAPFVTATQAQDTNYTLSTISM ---------------------2222------------------3333!!!!--------- STTPSTVTVPTWSFPGACAFQVQEGRVVVQIGDYAATELGSGDVAFIPGGVEFKYYSEAY ---1111-----------------------!!!!-----2222----2222--------- FSKVLFVSSGSDGLDQNLVNGGEEWSSVSFPADW ---------------------------------- >SORCIN; SWP:P30626; PDB:1JUOA; FPGQTQDPLYGYFAAVAGQDGQIDADELQRCLTQSGIAGGYKPFNLETCRLMVSMLDRDM ------3333------!!!!---------------1111-----------------1111 SGTMGFNEFKELWAVLNGWRQHFISFDTDRSGTVDPQELQKALTTMGFRLSPQAVNSIAK --------------------------3333------------------------------ RYSTNGKITFDDYIACCVKLRALTDSFRRRDTAQQGVVNFPYDDFIQCVMSV ------------------------------1111-------------3333- >ADP-RIBOSYLATION FACTOR B; SWP:Q9NZ52; PDB:1JUQA; ESLESWLNKATNPSNRQEDWEYIIGFCDQINKELEGPQIAVRLLAHKIQSPQEWEALQAL -----------3333--------------------------------------------- TVLEACKNCGRRFHNEVGKFRFLNELIKVVSPKYLGDRVSEKVKTKVIELLYSWTALPEE ---------3333-----3333----3333----1111------------------3333 AKIKDAYHLKRQGIVQSDPPIPVDRTLI ---------1111----------1111- >DIHYDROFOLATE REDUCTASE; SWP:P04382; PDB:1JUVA; MIKLVFRYSPTKTVDGFNELAFGLGDGLPWGRVKKDLQNFKARTEGTIMIMGAKTFQSLP ------------1111-------!!!!1111------------2222------------- TLLPGRSHIVVCDLARDYPVTKDGDLAHFYITWEQYITYISGGEIQVSSPNAPFETMLDQ --2222------1111----1111----------------------------------11 NSKVSVIGGPALLYAALPYADEVVVSRIVKRHRVNSTVQLDASFLDDISKREMVETHWYK 11-------------3333---------------------3333---3333--------- IDEVTTLTESVYK ------------- >GLCNAC1P URIDYLTRANSFERAS; SWP:Q16222; PDB:1JV1A; MNINDLKLTLSKAGQEHLLRFWNELEEAQQVELYAELQAMNFEELNFFFQKAIEGFRMEP ----------1111--11113333------------1111-------------------- VPREVLGSATRDQDQLQAWESEGLFQISQNKVAVLLLAGGQGTRLGVAYPKGMYDVGLPS -3333------3333-----------1111------------1111---3333----111 RKTLFQIQAERILKLQQVAEKYYGNKCIIPWYIMTSGRTMESTKEFFTKHKYFGLKKENV 1----------------------------------1111------------iiii1111- IFFQQGMLPAMSFDGKIILEEKNKVSMAPDGNGGLYRALAAQNIVEDMEQRGIWSIHVYC -----------1111-----1111------------------------1111-------- VDNILVKVADPRFIGFCIQKGADCGAKVVEKTNPTEPVGVVCRVDGVYQVVEYSEISLAT --1111-----------1111-----------1111-------iiii----1111----- AQKRSSDGRLLFNAGNIANHFFTVPFLRDVVNVYEPQLQHHVAQKKIPYVDTQGQLIKPD ----1111-------------------------3333-------------1111------ KPNGIKMEKFVFDIFQFAKKFVVYEVLREDEFSPLKNADSQNGKDNPTTARHALMSLHHC ---------11113333---------3333-------1111------------------- WVLNAGGHFIDENGSRLPAIPRLKDANDVPIQCEISPLISYAGEGLESYVADKEFHAPLI ----------1111----------1111-------3333------33332222------- IDENGVHELV -1111----- >INTEGRIN, ALPHA V; SWP:P06756; PDB:1JV2A; FNLDVDSPAEYSGPEGSYFGFAVDFFVPSASSRMFLLVGAPKANTTQPGIVEGGQVLKCD -------------2222----------1111---------------2222---------- WSSTRRCQPIEFDATGNRDYAKDDPLEFKSHQWFGASVRSKQDKILACAPLYHWRTEMKQ ------------------------------------------------1111-------- EREPVGTCFLQDGTKTVEYAPCRSQDIDADGQGFCQGGFSIDFTKADRVLLGGPGSFYWQ -------------------1111----11111111-----------------1111---- GQLISDQVAEIVSKYDPNVYSIKYNNQLATRTAQAIFDDSYLGYSVAVGDFNGDGIDDFV ---------------------------------3333---2222---------------- SGVPRAARTLGMVYIYDGKNMSSLYNFTGEQMAAYFGFSVAATDINGDDYADVFIGAPLF ------------------------------2222----------------------1111 MDRGSDGKLQEVGQVSVSLQRASGDFQTTKLNGFEVFARFGSAIAPLGDLDQDGFNDIAI ---1111-------------1111------------------------------------ AAPYGGEDKKGIVYIFNGRSTGLNAVPSQILEGQWAARSMPPSFGYSMKGATDIDKNGYP ------%%%%--------3333--------------------2222-------------- DLIVGAFGVDRAILYRARPVITVNAGLEVYPSILNQDNKTCSLPGTALKVSCFNVRFCLK ------------------------------------------------------------ ADGKGVLPRKLNFQVELLLDKLKQKGAIRRALFLYSRSPSHSKNMTISRGGLMQCEELIA -------------------3333------------------------------------- YLRDESEFRDKLTPITIFMEYRLDYRTAADTTGLQPILNQFTPANISRQAHILLDCGEDN ------------------------------------------------------------ VCKPKLEVSVDSDQKKIYIGDDNPLTLIVKAQNQGEGAYEAELIVSIPLQADFIGVVRNN -----------------------------------------------------------1 EALARLSCAFKTENQTRQVVCDLGNPMKAGTQLLAGLRFSVHQQSEMDTSVKFDLQIQSS 111--------------------------------------------------------- NLFDKVSPVVSHKVDLAVLAAVEIRGVSSPDHVFLPIPNWEHKENPETEEDVGPVVQHIY ---------------------------------3333----------3333--------- ELRNNGPSSFSKAMLHLQWPYKYNNNTLLYILHYDIDGPMNCTSDMEINPLRIKISSLDI ----------------------%%%%----------------------1111-------- HTLGCGVAQCLKIVCQVGRLDRGKSAILYVKSLLWTETFMNKENQNHSYSLKSSASFNVI ------------------------------------------------------------ EFPYKNLPIEDITNSTLVTTNVTWGIQ --------------------------- >Integrin beta-3 [Precurso; SWP:P05106; PDB:1JV2B; EFPVSEARVLEDRPLSDKGSGDSSQVTQVSPQRIALRLRPDDSKNFSIQVRQVEDYPVDI -----------------1111-----------------2222------------------ YYLMDLSYSMKDDLWSIQNLGTKLATQMRKLTSNLRIGFGAFVDKPVSPYMYISPPEALE ------33333333--3333------3333-------------------------1111- NPCYDMKTTCLPMFGYKHVLTLTDQVTRFNEEVKKQSVSRNRDAPEGGFDAIMQATVCDE ------------------------------3333-------------3333-------33 KIGWRNDASHLLVFTTDAKTHIALDGRLAGIVQPNDGQCHVGSDNHYSASTTMDYPSLGL 33-------------------22223333------------3333-1111---------- MTEKLSQKNINLIFAVTENVVNLYQNYSELIPGTTVGVLSMDSSNVLQLIVDAYGKIRSK -----1111-------33333333------2222-----1111-3333------------ VELEVRDLPEELSLSFNATCLNNEVIPGLKSCMGLKIGDTVSFSIEAKVRGCPQEKEKSF --------3333------------------------------------------------ TIKPVGFKDSLIVQVTFDCDKGEMCSGHGQCSCGDCLCDSDWTGYYCNCTTRTDTCMSSN ---2222------------------------------------1111-----3333-111 GLLCSGRGKCECGSCVCIQPGSYGDTCEKCPTCPDACTFKKECVECKKFDREPYMTENTC 1-%%%%-----------------1111--1111-----3333------------------ NRYCRDEIESVKELKDTGKDAVNCTYKNEDDCVVRFQYYEDSSGKSILYVVEEPECPKG ---------------------------1111---------------------------- >IG KAPPA CHAIN PRECURSOR ; SWP:NA; PDB:1JV5B; QVQLQQPGAELVKPGTSVKLSCKASGYNFTSYWINWVKLRPGQGLEWIGDIYPGSGITNY ------------2222-----------3333--------2222----------------- NEKFKSKATLTVDTSSSTAYMQLSSLASEDSALYYCAGQYGNLWFAYWGQGTLVTVS 1111---------1111---------3333---------iiii-------------- >NAD(H)-DEPENDENT ALCOHOL ; SWP:P39462; PDB:1JVBA; RAVRLVEIGKPLSLQEIGVPKPKGPQVLIKVEAAGVCHSDVHRQGRFGNLRIVEDLGVKL ----------------------!!!!----------3333------!!!!---------- PVTLGHEIAGKIEEVGDEVVGYSKGDLVAVNPWQGEGNCYYCRIGEEHLCDSPRWLGINF ---------------1111---2222---------------11113333-----2222-- DGAYAEYVIVPHYKYYKLRRLNAVEAAPLTCSGITTYRAVRKASLDPTKTLLVVGAGGGL -----------3333------3333--------------------1111-----111133 GTAVQIAKAVSGATIIGVDVREEAVEAAKRAGADYVINASMQDPLAEIRRITESKGVDAV 33-----------------------------------3333----------iiii----- IDLNNSEKTLSVYPKALAKQGKYVVGLFGADLHYHAPLITLSEIQFVGSLVGNQSDFLGI -----3333--3333--2222-------------3333-1111----------------- RLAEAGKVKPITKTKLEEANEAIDNLENFKAIGRQVLIP --1111--------3333--------------------- >IMMUNOGLOBULIN LAMBDA LIG; SWP:NA; PDB:1JVKA; TALTQPASVSGSPGQSITVSCTGVSSIVGSYNLVSWYQQHPGKAPKLLTYEVNKRPSGVS -----------2222------------------------2222----------------- DRFSGSKSGNSASLTISGLQAEDEADYYCSSYDGSSTSVVFGGGTKLTVLGQPKAAPSVT -------!!!!--------3333------------------------------------- LFPPSSEELQANKATLVCLISDFYPGAVTVAWKADSSPVKAGVETTKPSKQSNNKYAASS ----33331111--------------------------------------1111------ YLSLTPEQWKSHRSYSCQVTHEGSTVEKTVAPTAC --------------------iiii----------- >BIFUNCTIONAL HISTIDINE BI; SWP:P33734; PDB:1JVNA; MPVVHVIDVESGNLQSLTNAIEHLGYEVQLVKSPKDFNISGTSRLILPGVGNYGHFVDNL ---------------------1111-------1111-3333------------------- FNRGFEKPIREYIESGKPIMGIVGLQALFAGSVESPKSTGLNYIDFKLSRFDDSEKPVPE ----------------------3333-----3333------------------------- IGWNSCIPSENLFFGLDPYKRYYFVHSFAAILNSEKKKNLENDGWKIAKAKYGSEEFIAA ------------iiii1111--------------------1111-------!!!!----- VNKNNIFATQFHPEKSGKAGLNVIENFLKQQSPPIPNYSAEEKELLMNDYSNYGLTRRII --!!!!-----3333-----------------------------------iiii------ ACLDVRTNDQGDLVVTKGDLGKPVQLAQKYYQQGADEVTFLNITDCPLKDTPMLEVLKQA -------1111----1111-----------1111------------3333---------- AKTVFVPLTVGGGIKDIVDVDGTKIPALEVASLYFRSGADKVSIGTDAVYAAEKYYELGN ------------------1111------------1111------3333------------ RGDGTSPIETISKAYGAQAVVISVDPKRVYVNSQADTKNKVFETEYPGPNGEKYCWYQCT -----3333------3333-------------3333-----------1111--------- IKGGRESRDLGVWELTRACEALGAGEILLNCIDKDGSNSGYDLELIEHVKDAVKIPVIAS -iiii--------------1111--------1111------------------------- SGAGVPEHFEEAFLKTRADACLGAGMFHRGEFTVNDVKEYLLEHGLKVRMDEE ----3333------------------1111--3333-----1111-------- >MACROPHAGE INFECTIVITY PO; SWP:Q09734; PDB:1JVWA; AASHEERMNNYRKRVGRLFMEQKAAQPDAVKLPSGLVFQRIARGSGKRAPAIDDKCEVHY 1111---------------------1111--3333---------------1111------ TGRLRDGTVFDSSRERGKPTTFRPNEVIKGWTEALQLMREGDRWRLFIPYDLAYGVTGGG ---1111----3333-------1111------------2222------3333--3333-- GMIPPYSPLEFDVELISIKDGGKGRTAEEVDEILRKAEED -----------------2222------------------- >HEMOLYSIN EXPRESSION MODU; SWP:P23870; PDB:1JW2A; MSEKPLTKTDYLMRLRRCQTIDTLERVIEKNKYELSDNELAVFYSAADHRLAELTMNKLY --11113333----------3333-----------3333--------------------- DKIPSSVWKFIR ------------ >CONSERVED HYPOTHETICAL PR; SWP:O27635; PDB:1JW3A; MKGFEFFDVTADAGFWAYGHDLEEVFENAALAMFEVMTDTSLVEAAEERRVEITSEDRVS ---------------------3333-------3333---------------------333 LLYDWLDELLFIHDTEFILFSKFKVKIDEKDDGLHLTGTAMGEEIKEGHERRDEVKAVTF 3----------------------------------------------------------- HMMEILDEDGLIKARVILDL -------iiii--------- >ENOYL-ACP REDUCTASE; SWP:NA; PDB:1JW7A; GFLKGKKGLIVGVANNKSIAYGIAQSCFNQGATLAFTYLNESLEKRVRPIAQELNSPYVY 1111-----------------------1111----------------------------- ELDVSKEEHFKSLYNSVKKDLGSLDFIVHSVAFAPKEALEGSLLETSKSAFNTAMEISVY --1111----------------------------3333---3333--------------- SLIELTNTLKPLLNNGASVLTLSYLGSTKYMAHYNVMGLAKAALESAVRYLAVDLGKHHI --------3333-----------3333---22223333----------------3333-- RVNALSAGPIRTLASSGIADFRMILKWNEINAPLRKNVSLEEVGNAGMYLLSSLSSGVSG -----------3333----3333--------1111----------------3333----- EVHFVDAGYHVMGMGAVEEKDNKATLLWDLHKEQ -------3333--------------3333----- >Molybdopterin biosynthesi; SWP:P12282; PDB:1JW9B; AELSDQEMLRYNRQIILRGFDFDGQEALKDSRVLIVGLGGLGCAASQYLASAGVGNLTLL -------------1111------------------------------------------- DFDTVSLSNLQRQTLHSDATVGQPKVESARDALTRINPHIAITPVNALLDDAELAALIAE -----3333---33333333---3333---------1111-----------------111 HDLVLDCTDNVAVRNQLNAGCFAAKVPLVSGAAIRMEGQITVFTYQDGEPCYRCLSRLFG 1--------------------------------------------2222-3333-1111- EAGVMAPLIGVIGSLQAMEAIKMLAGYGKPASGKIVMYDAMTCQFREMKLMRNPGCEVCG ---------------------------------------1111---------1111---- >BITISCETIN; SWP:Q7LZK5; PDB:1JWIA; CLPDWSSYKGHCYKVFKKVGTWEDAEKFCVENSGHLASIDSKEEADFVTKLASQTLFVYD -1111--iiii------------------------------------------------- AWIGLRDESKTQQCSPQWTDGSSVVYENVDEPTKCFGLDVHTEYRTWTDLPCGEKNPFIC -----------------1111------------------1111-------1111------ KS -- >Bitiscetin beta chain; SWP:Q7LZK8; PDB:1JWIB; GCLPDWSSYKGHCYKVFKVEKTWADAEKFCKELVNGGHLMSVNSREEGEFISKLALEKMR --2222--iiii---------------------2222----------------------- IVLVWIGLSHFWRICPLRWTDGARLDYRALSDEPICFVAESFHNKWIQWTCNRKKSFVCK ----------3333----1111---------------------------1111------- YRV --- >CSK HOMOLOGOUS KINASE; SWP:P42679; PDB:1JWOA; LSLMPWFHGKISGQEAVQQLQPPEDGLFLVRESARHPGDYVLCVSFGRDVIHYRVLHRDG 1111-------3333--------2222--------2222------iiii----------- HLTIDEAVFFCNLMDMVEHYSKDKGAICTKLVRPKRK ------------------------------------- >N-ACETYLMURAMOYL-L-ALANIN; SWP:Q9LCR3; PDB:1JWQA; MKVVVIDAGHGAKDSGAVGISRKNYEKTFNLAMALKVESILKQNPKLEVVLTRSDDTFLE ----------!!!!----3333--3333---------------1111------------- LKQRVKVAENLKANVFVSIHANSSGSSASNGTETYYQRSASKAFANVMHKYFAPATGLTD -------------------------3333--------3333----------3333----- RGIRYGNFHVIRETTMPAVLLEVGYLSNAKEEATLFDEDFQNRVAQGIADGITEYLDVK -------3333-------------1111------------------------------- >DNA POLYMERASE IV (FAMILY; SWP:Q97W02; PDB:1JX4A; IVLFVDFDYFYAQVEEVLNPSLKGKPVVVCVFSGRFEDSGAVATANYEARKFGVKAGIPI ------------------3333-------------2222-------3333----2222-- VEAKKILPNAVYLPRKEVYQQVSSRINLLREYSEKIEIASIDEAYLDISDKVRDYREAYN ------1111------------------3333---------------1111--------- LGLEIKNKILEKEKITVTVGISKNKVFAKIAADAKPNGIKVIDDEEVKRLIRELDIADVP ------------------------------------------------------333322 GIGNITAEKLKKLGINKLVDTLSIEFDKLKGIGEAKAKYLISLARDEYNEPIRTRVRKSI 22--------1111--3333----3333--------------1111-------------- GRIVTKRNSRNLEEIKPYLFRAIEESYYKLDKRIPKAIHVVAVTEDLDIVSRGRTFPHGI ----------3333-----------------------------1111------------- SKETAYSESVKLLQKILEEDERKIRRIGVRFSKFI ----------------------------------- >LUXP PROTEIN; SWP:P54300; PDB:1JX6A; GYWGYQEFLDEFPEQRNLTNALSEAVRAQPVPLSKPTQRPIKISVVYPGQQVSDYWVRNI ------------------------------------------------------------ ASFEKRLYKLNINYQLNQVFTRPNADIKQQSLSLMEALKSKSDYLIFTLDTTRHRKFVEH -------1111----------2222--------------------------1111----- VLDSTNTKLILQNITTPVREWDKHQPFLYVGFDHAEGSRELATEFGKFFPKHTYYSVLYF -----------------3333----------------------------2222------- SEGYISDVRGDTFIHQVNRDNNFELQSAYYTKATKQSGYDAAKASLAKHPDVDFIYACST ------------------------------------------------------------ DVALGAVDALAELGREDIMINGWGGGSAELDAIQKGDLDITVMRMNDDTGIAMAEAIKWD --------------3333-----------------------------------------1 LEDKPVPTVYSGDFEIVTKADSPERIEALKKRAFRYSD 111--------------1111----------------- >HYPOTHETICAL PROTEIN YCHN; SWP:P39164; PDB:1JX7A; QKIVIVANGAPYGSESLFNSLRLAIALREQESNLDLRLFLSDAVTAGLRGQKPGEGYNIQ ---------2222-----------------1111------11111111---------333 QLEILTAQNVPVKLCKTCTDGRGISTLPLIDGVEIGTLVELAQWTLSADKVLTF 3---3333-----------11111111---------3333--3333-------- >PUTATIVE TRYPSIN INHIBITO; SWP:Q42328; PDB:1JXCA; CPEIEAQGNECLKEYGGDVGFGFCAPRIFPTICYTRCRENKGAKGGRCRWGQGSNVKCLC -------------------3333-------3333-3333--------------------- DFCGDTPQ -------- >PLASTOCYANIN A; SWP:P00299; PDB:1JXGA; MIDVLLGADDGSLAFVPSEFSCSPGCKIVFKNNAGFPHNIVFDEDSIPSGVDASKISMSE -------1111-----------2222----------------1111-22223333---11 EDLLNAKGETFEVALSNKGEYSFYCSPHQGAGMVGKVTVN 11---2222----------------1111----------- >PHOSPHOMETHYLPYRIMIDINE K; SWP:P55882; PDB:1JXHA; MQRINALTIAGTDPSGGAGIQADLKTFSALGAYGCSVITALVAENTCGVQSVYRIEPDFV ------------1111-----------1111-------------1111-------3333- AAQLDSVFSDVRIDTTKIGMLAETDIVEAVAERLQRHHVRNVVLDTVMLLLSPSAIETLR ------1111-------------------------------------------------- VRLLPQVSLITPNLPEAAALLDAPHARTEQEMLAQGRALLAMGCEAVLMKGDWLFTREGE --1111-------------------------------------------------3333- QRFRVNTKNTHGTGCTLSAALAALRPRHRSWGETVNEAKAWLSAALAQADTLEVGKGIGP ---------2222----------3333--------------------3333--------- VHHFHAWW -1111--- >POSTSYNAPTIC DENSITY PROT; SWP:P31016; PDB:1JXMA; GFYIRALFDYCGFLSQALSFRFGDVLHVIDAGDEEWWQARRVGFIPSKRRVERREWSRLV --------------------2222------------------------------------ LSYETVTQMEVHYARPIIILGPTKDRANDDLLSEFPDKFGSCVPHTTRPKREYEIDGRDY --------------------2222----------1111------------11112222-- HFVSSREKMEKDIQAHKFIEAGQYNSHLYGTSVQSVREVAEQGKHCILDVSANAVRRLQA ----3333--------------------------------------------------11 AHLHPIAIFIRPRSLENVLEINKRITEEQARKAFDRATKLEQEFTECFSAIVEGDSFEEI 11-----------------------------------------3333------------- YHKVKRVIEDLSGPYIWVPARERL ------------------------ >TYROSYL-DNA PHOSPHODIESTE; SWP:Q9NUW8; PDB:1JY1A; LEDPGEGQDIWDLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILSPLFGTLVSSAQFNYC --1111--1111-2222------------33332222-3333--3333------------ FDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLCQAKLDIAFGTHHTK ------11113333----------------------3333-----------2222----- LLLYEEGLRVVIHTSNLIHADWHQKTQGIWLSPLYPRIADGTHKSGESPTHFKANLISYL ---1111----------3333-----------------2222-----1111--------- TAYNAPSLKEWIDVIHKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLKKLLKDHAAESW 33333333------1111-1111------------!!!!--------------------- PVVGQFSSVGSLGADESKWLCSEFKESLTLGSVPLYLIYPSVENVRTSLEGYPAGGSLPY --------------1111-----------------------------11113333----- SIQTAEKQNWLHSYFHKWSAETSGRSNAPHIKTYRPSPDFSKIAWFLVTSANLSKAAWGA 333311113333-------3333-1111--------1111-------------3333--- LEKNGTQLIRSYELGVLFLPSALGLDSFKVKATFPVPYDLPPELYGSKDRPWIWNIPYVK --%%%%------------3333-----------------------1111----------- APDTHGNWVPS --1111----- >Fibrinogen alpha chain [P; SWP:P02672; PDB:1JY2N; GWPFCSDEDWNTKCPSGCRMKGLIDEVDQDFTSRINKLRDSLF -----3333---------------------------------- >Fibrinogen beta chain [Pr; SWP:P02676; PDB:1JY2O; RKPPDADGCLHADPDLGVLCPTGCKLQDTLVRQERPIRKSIEDLRNTVDSV ------------3333----------------------------------- >Fibrinogen gamma-B chain ; SWP:P12799; PDB:1JY2P; RDNCCILDERFGSYCPTTCGIADFLNNYQTSVDKDLRTLEGILY -------3333--------------------------------- -------------------------------- >CALSEPRRP; SWP:Q9M7C7; PDB:1JY5A; HKEFDYFTLALTWSGTECLSCPTNACSRSEVETGFTIKGLWPDYDDGTWPSCCEGAKYDQ ------------3333----33331111---------------1111-----------33 NEISILSNDLSKYWPSYSCPSSSACGSFDASDLAYEWAKHGTCSSPVLGNQYEYFSTTLM 331111-----------------iiii------------3333----------------- LYFKYNISEILSESGYLPSNTAEYKVEGIMSAIQSALRVTPVVKCKSDAVEQVQICFDKT -----------1111---------3333-----------------------------111 LQLQECPSTASTCPSLVSLPIKN 1---------------------- >YOPE REGULATOR; SWP:P31491; PDB:1JYAA; YSFEQAITQLFQQLSLSIPDTIEPVIGVKVGEFACHITEHPVGQILFTLPSLDNNDEKET -----------------------------!!!!-------2222--------3333---- LLSHNIFSQDILKPILSWDEVGGHPVLWNRQPLNSLDNNSLYTQLELVQGAERLQ --1111---1111------3333--------1111-1111--------------- >PLASMA RETINOL-BINDING PR; SWP:P02753; PDB:1JYDA; ERDCRVSSFRVKENFDKARFSGTWYAMAKKDPEGLFLQDNIVAEFSVDETGQMSATAKGR ----3333-------3333----------------------------1111--------- VRLLNNWDVCADMVGTFTDTEDPAKFKMKYWGVASFLQKGNDDHWIVDTDYDTYAVQYSC ---1111--------------1111--------1111----------------------- RLLNLDGTCADSYSFVFSRDPNGLPPEAQKIVRQRQEELCLARQYRLIVHNGYC ---1111------------1111-------------11111111---------- >LACTOSE OPERON REPRESSOR; SWP:P03023; PDB:1JYEA; LLIGVATSSLALHAPSQIVAAILSRADQLGASVVVSMVERSGVEACKTAVHNLLAQRVSG --------3333--------------1111-----------------------1111--- LIINYPLDDQDAIAVEAACTNVPALFLDVSDQTPINSIIFSHEDGTRLGVEHLVALGHQQ -----------------------------1111--------------------1111--- IALLAGPLSSVSARLRLAGWHKYLTRNQIQPIAEREGDWSAMSGFQQTMQMLNEGIVPTA ------3333--------------1111-----------------------1111----- MLVANDQMALGAMRAITESGLRVGADISVVGYDDTEDSSCYIPPLTTIKQDFRLLGQTSV ----------------1111--------------3333---------------------- DRLLQLSQGQAVKGNQLLPVSLVKRKTTLAP -----1111---------------------- >DNA GYRASE INHIBITORY PRO; SWP:P33012; PDB:1JYHA; MNYEIKQEEKRTVAGFHLVGPWEQTVKKGFEQLMMWVDSKNIVPKEWVAVYYDNPDETPA --------------------3333-----------------------------1111-33 EKLRCDTVVTVPGYFTLPENSEGVILTEITGGQYAVAVARVVGDDFAKPWYQFFNSLLQD 33---------1111-----2222--------------------3333--------1111 SAYEMLPKPCFEVYLNNGAEDGYWDIEMYVAVQPK ----------------3333--------------- >CTP:PHOSPHOCHOLINE CYTIDY; SWP:NA; PDB:1JYKA; EIRVKAIILAAGLGTRLRPLTENTPKALVQVNQKPLIEYQIEFLKEKGINDIIIIVGYLK -------------3333-1111--3333--%%%%3333------1111------------ EQFDYLKEKYGVRLVFNDKYADYNNFYSLYLVKEELANSYVIDADNYLFKNFRNDLTRST ----3333--------1111----3333---1111------------------------- YFSVYREDCTNEWFLVYGDDYKVQDIIVDSKAGRILSGVSFWDAPTAEKIVSFIDKAYVS -----------------1111-----------------------------------1111 GEFVDLYWDNVKDNIKELDVYVEELEGNSIYEIDSVQDYRKLEEILK --1111333311111111-------1111------------------ >PEPTIDE DEFORMYLASE; SWP:Q8I372; PDB:1JYMA; DEIKIVKYPDPILRRRSEVTNFDDNLKRVVRKFDIYESKGIGLSAPQVNISKRIIVWNAL ---------3333----------------------1111----3333------------- YEKRKEENERIFINPSIVEQSLVKLKLIEGCLSFGIEGKVERPSIVSISYYDINGYKHLK ----2222-------------------------------------------1111----- ILKGIHSRIFQHEFDHLNGTLFIDKTQVDKKKVRPKLNELIRDYKATHSEEPLEHH ---------------1111-1111-----3333----------1111--------- >SICP; SWP:P74873; PDB:1JYOA; LQAHQDIIANIGEKLGLPLTFDDNNQCLLLLDSDIFTSIEAKDDIWLLNGMIIPLSPVCG ---------------------1111----------------!!!!--------------- DSIWRQIMVINGELAANNEGTLAYIDAAETLLLIHAITDLTNTYHIISQLESFVNQQEAL --------------1111-------1111------------------------------- KNILQEYAKV ---3333--- >Effector protein sptP; SWP:P74873; PDB:1JYOE; DKAYVAPEKFSSKVLTWLGKMPLFKNTEVVQKHTENIRVQDQKILQTFLHALTEKYGETA ---------%%%%---------3333--------------3333---------------- VNDALLMSRINMNKPLTQRLAVQITECVKAADEGFINLIKSK --------------3333------------------------ >GROWTH FACTOR RECEPTOR-BO; SWP:P29354; PDB:1JYRA; GSMAWFFGKIPRAKAEEMLSKQRHDGAFLIRESESAPGDFSLSVKFGNDVQHFKVLRDGA --1111-----------3333--2222--------2222------%%%%--------111 GKYFLWVVKFNSLNELVDYHRSTSVSRNQQIFLRDI 1----------------------------------- >MTA/SAH NUCLEOSIDASE; SWP:P24247; PDB:1JYSA; MKIGIIGAMEEEVTLLRDKIENRQTISLGGCEIYTGQLNGTEVALLKSGIGKVAAALGAT ----------------1111-------iiii------iiii------------------- LLLEHCKPDVIINTGSAGGLAPTLKVGDIVVSDEARYHDADVTAFGYEYGQLPGCPAGFK --------------------11112222-------------3333--22222222----- ADDKLIAAAEACIAELNLNAVRGLIVSGDAFINGSVGLAKIRHNFPQAIAVEMEATAIAH -------------1111---------------------------1111------------ VCHNFNVPFVVVRAISDVADQSFDEFLAVAAKQSSLMVESLVQKLA --1111----------------3333-------------------- >BETA-GALACTOSIDASE; SWP:P00722; PDB:1JZ8A; RRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQLRSLNGEWRFAWFPAPEAVPES -11111111--------------------------1111-------------3333-333 WLECDLPEADTVVVPSNWQMHGYDAPIYTNVTYPITVNPPFVPTENPTGCYSLTFNVDES 3----1111-------3333-------------------------------------333 WLQEGQTRIIFDGVNSAFHLWCNGRWVGYGQDSRLPSEFDLSAFLRAGENRLAVMVLRWS 3--------------------iiii---------------1111---------------- DGSYLEDQDMWRMSGIFRDVSLLHKPTTQISDFHVATRFNDDFSRAVLEAEVQMCGELRD --1111---------------------------------1111---------------11 YLRVTVSLWQGETQVASGTAPFGGEIIDERGGYADRVTLRLNVENPKLWSAEIPNLYRAV 11-------!!!!--------------1111------------------3333------- VELHTADGTLIEAEACDVGFREVRIENGLLLLNGKPLLIRGVNRHEHHPLHGQVMDEQTM ----1111-----------------iiii--iiii---------------!!!!------ VQDILLMKQNNFNAVRCSHYPNHPLWYTLCDRYGLYVVDEANIETHGMVPMNRLTDDPRW -------1111-----------3333------------------1111-1111---3333 LPAMSERVTRMVQRDRNHPSVIIWSLGNESGHGANHDALYRWIKSVDPSRPVQYEGGGAD --------------1111--------------3333------------------------ TTATDIICPMYARVDEDQPFPAVPKWSIKKWLSLPGETRPLILCQYAHAMGNSLGGFAKY 1111-------------------------33332222----------------2222--- WQAFRQYPRLQGGFVWDWVDQSLIKYDENGNPWSAYGGDFGDTPNDRQFCMNGLVFADRT ------1111----------------1111-----2222------!!!!------1111- PHPALTEAKHQQQFFQFRLSGQTIEVTSEYLFRHSDNELLHWMVALDGKPLASGEVPLDV ---------1111--------------------------------iiii----------- APQGKQLIELPELPQPESAGQLWLTVRVVQPNATAWSEAGHISAWQQWRLAENLSVTLPA 2222---------------------------------2222------------------- ASHAIPHLTTSEMDFCIELGNKRWQFNRQSGFLSQMWIGDKKQLLTPLRDQFTRAPLDND ----------1111----!!!!----------------------------------3333 IGVSEATRIDPNAWVERWKAAGHYQAEAALLQCTADTLADAVLITTAHAWQHQGKTLFIS !!!!1111-1111--------1111--------------------------iiii----- RKTYRIDGSGQMAITVDVEVASDTPHPARIGLNCQLAQVAERVNWLGLGPQENYPDRLTA ------1111----------1111-----------------------------1111333 ACFDRWDLPLSDMYTPYVFPSENGLRCGTRELNYGPHQWRGDFQFNISRYSQQQLMETSH 3-------3333---------------------!!!!-----------------1111-1 RHLLHAEEGTWLNIDGFHMGIGGDDSWSPSVSAEFQLSAGRYHYQLVWCQK 111----------------------------3333---------------- >NEUROTOXIN 2; SWP:P01493; PDB:1JZAA; KEGYLVNKSTGCKYGCLKLGENEGCDKECKAKNQGGSYGYCYAFACWCEGLPESTPTYPL ------------------------------3333-------%%%%------3333----- PNKSCS ------ >GALACTOSE-SPECIFIC LECTIN; SWP:P21963; PDB:1JZNA; NNCPLDWLPMNGLCYKIFNQLKTWEDAEMFCRKYKPGCHLASFHRYGESLEIAEYISDYH ---1111--iiii-------------------------------3333------------ KGQENVWIGLRDKKKDFSWEWTDRSCTDYLTWDKNQPDHYQNKEFCVELVSLTGYRLWND -----------1111-----------------2222---%%%%------3333------- QVCESKDAFLCQCKF -1111---------- >Hypothetical 27.5 kDa pro; SWP:P40165; PDB:1JZTA; LKVVSSKLAAEIDKELGPQIGFTLQQLELAGFSVAQAVCRQFPLRGKTETEKGKHVFVIA ----------------------3333---------------------3333--------- GPGNNGGDGLVCARHLKLFGYNPVVFYPKRSERTEFYKQLVHQLNFFKVPVLSQDEGNWL ---------------------------------3333-------1111------1111-1 EYLKPEKTLCIVDAIFGFSFKPPREPFKGIVEELCKVQNIIPIVSVDVPTGWDVDKGPIS 1111111---------1111--------------1111----------2222-------- QPSINPAVLVSLTVPKPCSSHIRENQTTHYVGGRFIPRDFANKFGFEPFGYESTDQILKL ---------------3333---3333---------------1111------!!!!----- >LIPOCALIN Q83; SWP:Q9I9P7; PDB:1JZUA; MTVPDRSEIAGKWYVVALASNTEFFLREKDKMKMAMARISFLGEDELKVSYAVPKPNGCR ----3333-------------11111111------------------------------- KWETTFKKTSDDGEVYYSEEAKKKVEVLDTDYKSYAVIYATRVKDGRTLHMMRLYSRSPE -------------------3333--------------------%%%%------------- VSPAATAIFRKLAGERNYTDEMVAMLPRQEECTVDEV ------------------3333--------------- >FOCAL ADHESION KINASE 1; SWP:Q05397; PDB:1K04A; EISPPPTANLDRSNDKVYENVTGLVKAVIEMSSKIQPAPPEEYVPMVKEVGLALRTLLAT --------------------------------------1111------------------ VDETIPLLPASTHREIEMAQKLLNSDLGELINKMKLAQQYVMTSLQQEYKKQMLTAAHAL ---3333-3333------------------------------------------------ AVDAKNLLDVIDQARLKMLGQT ---------------------- >FEZ-1 BETA-LACTAMASE; SWP:Q9K578; PDB:1K07A; YPMPNPFPPFRIAGNLYYVGTDDLASYLIVTPRGNILINSDLEANVPMIKASIKKLGFKF ------------!!!!--------------------------1111------------33 SDTKILLISHAHFDHAAGSELIKQQTKAKYMVMDEDVSVILSGGKSDFHYANDSSTYFTQ 33---------3333-----------------3333-----iiii-------3333---- STVDKVLHDGERVELGGTVLTAHLTPGHTRGCTTWTMKLKDHGKQYQAVIIGSIGVNPGY -------2222---iiii----------------------iiii------------2222 KLVDNITYPKIAEDYKHSIKVLESMRCDIFLGSHAGMFDLKNKYVLLSKGQNNPFVDPTG -------1111----------------------3333---------------1111---- CKNYIEQKANDFYTELKKQETG ---------------------- >URE2 PROTEIN; SWP:P23202; PDB:1K0DA; QPLEGYTLFSHRSAPNGFKVAIVLSELGFHYNTIFLDFNLGEHRAPEFVSVNPNARVPAL ----------------------------------------111133333333-------- IDHGMDNLSIWESGAILLHLVNKYYKETGNPLLWSDDLADQSQINAWLFFQTSGHAPMIG ----%%%%----------------------2222-------------------------- QALHFRYFHSQKIASAVERYTDEVRRVYGVVEMALAERREALVMFDYPVWLVGDKLTIAD ------------3333-----------------------------------%%%%-3333 LAFVPWNNVVDRIGINIKIEFPEVYKWTKHMMRRPAVIKAL 333333333333---3333---------------------- >P-AMINOBENZOATE SYNTHASE ; SWP:P05041; PDB:1K0EA; TLSPAVITLLWRQDAAEFYFSRLSHLPWAMLLHSGYADHPYSRFDIVVAEPICTLTTFGK -------------3333-----1111-------%%%%--------------------!!! ETVVSESEKRTTTTDDPLQVLQQVLDRADIRPTHNEDLPFQGGALGLFGYDLGRRFESLP !---------------------------------1111----------11111111---- EIAEQDIVLPDMAVGIYDWALIVDHQRHTVSLLSHNDVNARRAWLESQQFSPQEDFTLTS ------------------------------------------------------------ DWQSNMTREQYGEKFRQVQEYLHSGDCYQVNLAQRFHATYSGDEWQAFLQLNQANRAPFS ------------------------------------------------------------ AFLRLEQGAILSLSPERFILCDNSEIQTRPIKGTLPRANSAKDRAENLMIVDLMRNDIGR ----1111-------------iiii--------------3333----------------- VAVAGSVKVPELFVVEPFPAVHHLVSTITAQLPEQLHASDLLRAAFPGGSITGAPKVRAM --------------------------------11113333-------3333--------- EIIDELEPQRRNAWCGSIGYLSFCGNMDTSITIRTLTAINGQIFCSAGGGIVADSQEEAE ----------!!!!-------1111-------------iiii---------1111----- YQETFDKVNRILKQLEK -------3333------ >GPFII; SWP:P03714; PDB:1K0HA; MADFDNLFDAAIARADETIRGYMGTSATITSGEQSGAVIRGVFDDPENISYAGQGVRVEG --------------------------------------------3333------------ SSPSLFVRTDEVRQLRRGDTLTIGEENFWVDRVSPDDGGSCHLWLGRGVPPAVNRRR ----------3333--------!!!!------------------------------- >P-HYDROXYBENZOATE HYDROXY; SWP:P20586; PDB:1K0IA; MKTQVAIIGAGPSGLLLGQLLHKAGIDNVILERQTPDYVLGRIRAGVLEQGMVDLLREAG ---------------------1111----------------------------------- VDRRMARDGLVHEGVEIAFAGQRRRIDLKRLSGGKTVTVYGQTEVTRDLMEAREACGATT ------------------%%%%-------------------------------1111--- VYQAAEVRLHDLQGERPYVTFERDGERLRLDCDYIAGCDGFHGISRQSIPAERLKVFERV ----------1111--------%%%%-------------11113333--3333------- YPFGWLGLLADTPPVSHELIYANHPRGFALCSQRSATRSQYYVQVPLSEKVEDWSDERFW -----------------------3333------------------11113333------- TELKARLPSEVAEKLVTGPSLEKSIAPLRSFVVEPMQHGRLFLAGDAAHIVPPTGAKGLN ---111133331111----------------------!!!!---3333----1111---- LAASDVSTLYRLLLKAYREGRGELLERYSAICLRRIWKAERFSWWMTSVLHRFPDTDAFS ------------------------------------------------------------ QRIQQTELEYYLGSEAGLATIAENYVGLPYEEIE ---------------------------------- >CHLORIDE INTRACELLULAR CH; SWP:O00299; PDB:1K0MA; PQVELFVKAGSDGAKIGNCPFSQRLFMVLWLKGVTFNVTTVDTKRRTETVQKLCPGGELP ---------1111----------------3333--------1111--------3333--- FLLYGTEVHTDTNKIEEFLEAVLCPPRYPKLAALNPESNTAGLDIFAKFSAYIKNSNPAL ---!!!!---------------------------3333-2222--------1111-3333 NDNLEKGLLKALKVLDNYLTSPLPEGVDETSAEDEGVSQRKFLDGNELTLADCNLLPKLH -----------------1111----11113333--------1111--------------- IVQVVCKKYRGFTIPEAFRGVHRYLSNAYAREEFASTCPDDEEIELAYEQVAKAL --------------3333-------------3333----------1111------ >DNA POLYMERASE ALPHA CATA; SWP:P09884; PDB:1K0PA; ICEEPTCRNRTRHLPLQFSRTGPLCPACMKA --1111------------------------- >CHEMOTAXIS PROTEIN CHEW; SWP:Q56311; PDB:1K0SA; MKTLADALKEFEVLSFEIDEQALAFDVDNIEMVIEKSDITPVPKSRHFVEGVINLRGRII -------------------------3333-----------1111----------%%%%-- PVVNLAKILGISFDEQKMKSIIVARTKDVEVGFLVDRVLGVLRITENQLDLTNVSDKFGK ---3333------------------%%%%------------------------------- KSKGLVKTDGRLIIYLDIDKIIEEITVKEGV -------%%%%-------------------- >COPZ; SWP:O32221; PDB:1K0VA; MEQKTLQVEGMSCQHCVKAVETSVGELDGVSAVHVNLEAGKVDVSFDADKVSVKDIADAI ------------3333--------------------3333------1111-3333----- EDQGYDVAKIEGR ------------- >L-RIBULOSE 5 PHOSPHATE 4-; SWP:P08203; PDB:1K0WA; MLEDLKRQVLEANLALPKHNLVTLTWGNVSAVDRERGVFVIKPSGVDYSIMTADDMVVVS -3333-----------1111-------------1111---------1111-3333----- IETGEVVEGAKKPSSDTPTHRLLYQAFPSIGGIVHTHSRHATIWAQAGQSIPATGTTHAN -------------1111---------1111------------------------3333-- YFYGTIPCTRKMTDAEINGEYEWETGNVIVETFEKQGIDAAQMPGVLVHSHGPFAWGKNA --------------------------------------1111-----2222--------- EDAVHNAIVLEEVAYMGIFCRQLAPQLPDMQQTLLNKHYLRKH -----------------------1111---------------- >LECTIN; SWP:Q7SIC1; PDB:1K12A; VIPEGYTQENVAVRGKATQSAQLRGEHAANSEASNAIDGNRDSNFYHGSCTHSSGQANPW --2222----3333----------1111---3333--------1111------------- WRVDLLQVYTITSVTITNRGDCCGERISGAEINIGQHLASNGVNNPECSVIGSMATGETK ----------------------11112222---------iiii-----------2222-- TFHCPAPMIGRYVVTYLPTSESLHLCEVEVNVDKPAAA -------------------------------------- >B-CELL LYMPHOMA 3-ENCODED; SWP:P20749; PDB:1K1AA; EDGDTPLHIAVVQGNLPAVHRLVNLFQQGGRELDIYNNLRQTPLHLAVITTLPSVVRLLV 2222----------------------1111------1111-------------------1 TAGASPMALDRHGQTAAHLACEHRSPTCLRALLDSAAPGTLDLEARNYDGLTALHVAVNT 111-1111-1111-------1111------------2222-1111-3333---------- ECQETVQLLLERGADIDAVDIKSGRSPLIHAVENNSLSMVQLLLQHGANVNAQMYSGSSA ---------1111-1111-3333--------------------1111-1111-1111--- LHSASGRGLLPLVRTLVRSGADSSLKNCHNDTPLMVARSRRVIDILRG ----------------1111------1111-3333------------- >D-HYDANTOINASE; SWP:Q45515; PDB:1K1DA; MTKIIKNGTIVTATDTYEAHLLIKDGKIAMIGQNLEEKGAEVIDAKGCYVFPGGIDPHTH ------------------------------------------------------------ LDMPLGGTVTKDDFESGTIAAAFGGTTTIIDFCLTNKGEPLKKAIETWHNKANGKAVIDY ---------------------------------------3333--------2222----- GFHLMISEITDDVLEELPKVLEEEGITSLVFMAYKNVFQADDGTLYCTLLAAKELGALVM ------------------------------------------------------------ VHAENGDVIDYLTKKALADGNTDPIYHALTRPPELEGEATGRACQLTELAGSQLYVVHVT ----------------1111--3333-1111----------------------------- CAQAVEKIAEARNKGLDVWGETCPQYLVLDQSYLEKPNFEGAKYVWSPPLREKWHQEVLW ----------------------3333---3333------------------3333----- NALKNGQLQTLGSDQCSFDFKGQKELGRGDFTKIPNGGPIIEDRVSILFSEGVKKGRITL -----------------------------3333-------------------1111---- NQFVDIVSTRIAKLFGLFPKKGTIVVGSDADLVIFDPNIERVISAETHHMAVDYNAFEGM -------------------------------------------3333---------2222 KVTGEPVSVLCRGEFVVRDKQFVGKPGYGQYLKRAKYGT ----------iiii---%%%%------------------ >deoxy-D-mannose-octuloson; SWP:P45314; PDB:1K1EA; KLENIKFVITDVDGVLTDGQLHYDANGEAIKSFHVRDGLGIKMLMDADIQVAVLSGRDSP 3333-------2222--------1111-----------------1111------------ ILRRRIADLGIKLFFLGKLEKETACFDLMKQAGVTAEQTAYIGDDSVDLPAFAACGTSFA ----------------------------------3333------3333------------ VADAPIYVKNAVDHVLSTHGGKGAFREMSDMILQAQGKSSVFDTAQGFLKSVKSMGQ 111133331111------2222-----------1111-3333--------3333--- >BREAKPOINT CLUSTER REGION; SWP:P11274; PDB:1K1FA; VDPVGFAEAWKAQFPDSEPPRELRSVGDIEQELERAKASIRRLEQEVNQERFRIYLQTLL -------------1111-------3333-------------------------------- AKEK ---- >SF1-BO ISOFORM; SWP:Q15637; PDB:1K1GA; TRVSDKVMIPQDEYPEINFVGLLIGPRGNTLKNIEKECNAKIMIRGKGSVKEGKVGRKDG ---------3333----------------------------------------------- QMLPGEDEPLHALVTANTMENVKKAVEQIRNILKQGIETPEDQNDLRKMQLRELARLNGT --------------------------------3333---333311113333--------- LR -- >MAFG; SWP:O54790; PDB:1K1VA; LTDEELVTMSVRELNQHLRGLSKEEIIQLKQRRRTLKNRGY ---------3333-------------11113333------- >4-ALPHA-GLUCANOTRANSFERAS; SWP:O32462; PDB:1K1WA; INFIFGIHNHQPLGNFGWVFEEAYNRSYRPFEILEEFPEKVNVHFSGPLLEWIEENKPDY -----------2222----------------3333---------------------3333 LDLLRSLIKRGQLEIVVAGFYEPVLAAIPKEDRLVQIELKDYARKLGYDAKGVWLTERVW ------------------1111-1111-3333-----------1111-------2222-- QPELVKSLREAGIEYVVVDDYHFSAGLSKEELFWPYYTEDGGEVITVFPIDEKLRYLIPF ------------------3333-----3333---------------------1111---- RPVKKTIEYLESLSKVAVFHDDGEKFGVWPGTYWLREFFDAITEKINLTYSEYLSKFTPR --3333---------------3333------------------------3333------- GLVYLPIASYFESEWSLPAKQAKLFVEFVEQLKEEGKFEKYRVFVRGGIWKNFFFKYPES -------------3333--------------------11111111---3333----3333 NFHKRLVSKAVRDNPEARKYILKAQCNDAYWHGVFGGIYLPHLRRTVWENIIKAQRYLKP -------3333----------1111----------!!!!---------------1111-- ENKILDVDFDGRAEIVENDGFIATIKPHYGGSIFELSSKRKAVNYNDVLPRRWEHYHEVP -----------------1111----3333---------1111-1111------1111--- EAHELGKQIPEEIRRELAYDWQLRAILQDHFIKPEETLDNYRLVKYHELGDFVNQPYEYE ---------33331111---------------11113333-------------------- IENGVKLWREGGVYAEEKIPARVEKKIELTEDGFIAKYRVLLEKPYKALFGVEINLAVHS ------------------------------------------------------------ VEKPEEFEAKEFEVNDPYGIGKVRIELDKAAKVWKFPIKTLSQSEAGWDFIQQGVSYTLF ---------------2222----------------------------------------- PIEKELEFTVRFREL --------------- >4-ALPHA-GLUCANOTRANSFERAS; SWP:O32462; PDB:1K1XA; MERINFIFGIHNHQPLGNFGWVFEEAYNRSYRPFMEILEEFPEMKVNVHFSGPLLEWIEE --------------22223333------------------1111---------------- NKPDYLDLLRSLIKRGQLEIVVAGFYEPVLAAIPKEDRLVQIEMLKDYARKLGYDAKGVW -3333-------1111-------1111-3333-3333------------1111------- LTERVWQPELVKSLREAGIEYVVVDDYHFMSAGLSKEELFWPYYTEDGGEVITVFPIDEK 2222--3333----1111------3333------3333--------iiii---------- LRYLIPFRPVKKTIEYLESLTSDDPSKVAVFHDDGEKFGVWPGTYEWVYEKGWLREFFDA --------3333-----1111--1111-------------2222----1111-------- ITSNEKINLMTYSEYLSKFTPRGLVYLPIASYFEMSEWSLPAKQAKLFVEFVEQLKEEGK ---1111------------------------3333-1111---------------1111- FEKYRVFVRGGIWKNFFFKYPESNFMHKRMLMVSKAVRDNPEARKYILKAQCNDAYWHGV 11111111---3333---------------------11113333----11111111---- FGGIYLPHLRRTVWENIIKAQRYLKPENKILDVDFDGRAEIMVENDGFIATIKPHYGGSI -!!!!3333-----------1111--------------------1111----3333---- FELSSKRKAVNYNDVLPRRWEHYHEQIPEEIRRELAYDWQLRAILQDHFIKPEETLDNYR ----------1111------1111---33331111---------------1111------ LVKYHELGDFVNQPYEYEMIENGVKLWREGGVYAEEKIPARVEKKIELTEDGFIAKYRVL -------------------2222------------------------------------- LEKPYKALFGVEINLAVHSVMEKPEEFEAKEFEVNDPYGIGKVRIELDKAAKVWKFPIKT ------------------------------------------------------------ LSQSEAGWDFIQQGVSYTMLFPIEKELEFTVRFREL ------------------------------------ >MANGANESE-DEPENDENT INORG; SWP:P95765; PDB:1K20A; SKILVFGHQNPDSDAIGSSYAFAYLAREAYGLDTEAVALGEPNEETAFVLDYFGVAAPRV ------------------------------------------------------------ ITSAKAEGAEQVILTDHNEFQQSVADIAEVEVYGVVDHHRVANFETANPLYMRLEPVGSA --3333------------1111---1111-----------------------------33 SSIVYRMFKEHSVAVSKEIAGLMLSGLISDTLLLKSPTTHPTDKAIAPELAELAGVNLEE 33------1111-----------------------11113333-------------3333 YGLAMLKAGTNLASKSAEELIDIDAKTFELNGNNVRVAQVNTVDIAEVLERQAEIEAAIE ------1111-11113333----------iiii--------------------------- KAIADNGYSDFVLMITDIINSNSEILAIGSNMDKVEAAFNFVLENNHAFLAGAVSRKKQV -------------------------------------------iiii-------3333-- VPQLTESFNA ------1111 >LOW-AFFINITY PENICILLIN-B; SWP:NA; PDB:1K25A; QITRTVPAKRGTIYDRNGVPIAEDATSYNVYAVTTSPNRSYPNGQFASSFIGLAQLHENE ------------------------------------------------------------ DGSKSLLGTSGMESSLNSILAGTDGIITYGNIVPGTELVSQQTVDGKDVYTTLSSPLQSF ----------3333----------------------------------------3333-- METQMDAFLEKVKGKYMTATLVSAKTGEILATTQRPTFNADTKEGITEDFVWRDILYQSN -----------------------1111--------------------------3333--- YEPGSAMKVMTLASSIDNNTFPSGEYFNSSELSSNVGMSLLEQKMGDATWLDYLKRFKFG --------------1111------11113333---3333--------------------- VPTRFGLTDEYAGQLPADNIVSIAQSSFGQGISVTQTQMLRAFTAIANDGVMLEPKFISA -----------------------------------------------%%%%--------- IYDTNNQSVRKSQKEIVGNPVSKEAASTTRNHMILVGTDPLIITVPGQNVAVKSGTAQIA ---------------------3333-------------------2222------------ DEKNGGYLVGSTNYIFSAVTMNPAENPDFILYVTVQQPEHYSGIQLGEFATPILERASAM -----------------------------------------3333--------------- KESLNLQSPAKNLDKVTTESSYAMPSIKDISPGELAEALRRNIVQPIVVGTGTKIKETSV -1111----3333------------------------3333------------------- EEGTNLAPNQQVLLLSDKVEEIPDMYGWKKETAETFAKWLDIELEFEGSGSVVQKQDVRT ------------------------2222------------------------------22 NTAIKNIKKIKLTLGD 22-------------- >Baseplate structural prot; SWP:P17172; PDB:1K28D; LQRPGYPNLSVKLFDSYDAWSNNRFVELAATITTLTRDSLYGRNEGLQFYDSKNIHTKDG --2222-------------1111----3333-------------------33331111-- NEIIQISVANANDINNVKTRIYGCKHFSVSVDSKGDNIIAIELGTIHSIENLKFGRPFFP ---------------------------------------------3333----------- DAGESIKELGVIYQDRTLLTPAINAINAYVPDIPWTSTFENYLSYVREVALAVGSDKFVF ---------------3333------------------3333------------------- VWQDIGVNDYDINQEPYPIVGEPSKYPLAYDFVWLTKSNPHKRDPKNATIYAHSFLDSSI --------------------------------------3333-----------3333--- PITTGKGENSIVVSRSGAYSETYRNGYEEAIRLQTAQYDGYAKCSTIGNFNLTPGVKIIF ---------------!!!!----------------------------------------- NDSKNQFKTEFYVDEVIHELSNNNSVTHLYFTNATKLETIDPVKVKNEF ------------------------------------------------- >NUDIX HOMOLOG; SWP:Q8ZTD8; PDB:1K2EA; MIVTSGVLVENGKVLLVKHKRLGVYIYPGGHVEHNETPIEAVKREFEEETGIVVEPIGFT ---------%%%%----------------------------------------------- YGIIDENAVERPMPLVILEEVVKYPEETHIHFDLIYLVKRVGGDLKNGEWIDVREIDRIE ----1111---------------3333------------------------1111----- TFPNVRKVVSLALSTLYRLGKISKLAAALEHH -2222--------------------------- >SIAH-1A PROTEIN; SWP:P61092; PDB:1K2FA; SVLFPCKYASSGCEITLPHTEKAEHEELCEFRPYSCPCPGASCKWQGSLDAVMPHLMHQH -------3333------3333--------------------------1111--------- KSITTLQGEDIVFLATDINLPGAVDWVMMQSCFGFHFMLVLEKQEKYDGHQQFFAIVQLI ----------------1111-----------iiii----------1111----------- GTRKQAENFAYRLELNGHRRRLTWEATPRSIHEGIATAIMNSDCLVFDTSIAQLFAENGN -33331111--------------------3333-----1111--------------iiii LGINVTISMC ---------- >S-100 PROTEIN, ALPHA CHAI; SWP:P35467; PDB:1K2HA; GSELETAMETLINVFHAHSGKEGDKYKLSKKELKDLLQTELSSFLDVQKDADAVDKIMKE -3333--------3333-----------------------3333-----3333------- LDENGDGEVDFQEFVVLVAALTVACNNFFWENS ---1111--3333-------------1111--- >TYROSINE-PROTEIN KINASE B; SWP:Q06187; PDB:1K2PA; IDPKDLTFLKELGTGQFGVVKYGKWRGQYDVAIKMIKEGSMSEDEFIEEAKVMMNLSHEK --------------1111-----------------------3333-------3333-111 LVQLYGVCTKQRPIFIITEYMANGCLLNYLREMRHRFQTQQLLEMCKDVCEAMEYLESKQ 1-------------------1111-------1111--3333------------------- FLHRDLAARNCLVNDQGVVKVSDFGLSRYVLDDEYTSSVGSKFPVRWSPPEVLMYSKFSS ------3333---1111--------------------------3333-3333------33 KSDIWAFGVLMWEIYSLGKMPYERFTNSETAEHIAQGLRLYRPHLASEKVYTIMYSCWHE 33-------------------11113333----1111-----11113333---3333--- KADERPTFKILLSNILDV 1111-------------- >SORBITOL DEHYDROGENASE; SWP:Q59787; PDB:1K2WA; MRLDGKTALITGSARGIGRAFAEAYVREGARVAIADINLEAARATAAEIGPAACAIALDV -------------------------------------------------1111-----11 TDQASIDRCVAELLDRWGSIDILVNNAALFDLAPIVEITRESYDRLFAINVSGTLFMMQA 11-------------------------------1111-3333------------------ VARAMIAGGRGGKIINMASQAGRRGEALVGVYCATKAAVISLTQSAGLNLIRHGINVNAI ------------------3333---1111--------------------3333------- APGVVDGEHWDGVDAKFADYENLPRGEKKRQVGAAVPFGRMGRAEDLTGMAIFLATPEAD ------3333-------------22223333----3333---3333-----33331111- YIVAQTYNVDGGNWMS ---------iiii--- >PUTATIVE L-ASPARAGINASE; SWP:P37595; PDB:1K2XA; GKAVIAIHGGAGAISRAQMSLQQELRYIEALSAIVETGQKMLEAGESALDVVTEAVRLLE --------------3333-----------------------1111--------------- EPLFNAGIGAVFTRDETHELDACVMDGNTLKAGAVAGVSHLRNPVLAARLVMEQSPHVMM ------2222--1111--------------------------3333-------------- IGEGAENFAFARGMERVSPEIFSTSLRYEQLLAAR ---------1111------1111------------ >Putative L-asparaginase [; SWP:P37595; PDB:1K2XB; TVGAVALDLDGNLAAATSTGGMTNKLPGRVGDSPLVGAGCYANNASVAVSCTGTGEVFIR -------1111----------22222222--1111-------3333------------11 ALAAYDIAALMDYGGLSLAEACERVVMEKLPALGGSGGLIAIDHEGNVALPFNTEGMYRA 11----------------------------1111--------1111-------------- WGYAGDTPTTGIYRE --2222--------- >TRICORN PROTEASE; SWP:P96086; PDB:1K32A; MPNLLLNPDIHGDRIIFVCCDDLWEHDLKSGSTRKIVSNLGVINNARFFPDGRKIAIRVM ----------!!!!----iiii--------------------------1111-------- RGSSLNTADLYFYNGENGEIKRITYFSGKSTGRRMFTDVAGFDPDGNLIISTDAMQPFSS -1111-------------------------1111--------1111-----------111 MTCLYRVENDGINFVPLNLGPATHILFADGRRVIGRNTFELPHWKGYRGGTRGKIWIEVN 1------%%%%----------------iiii---------1111----1111-------2 SGAFKKIVDMSTHVSSPVIVGHRIYFITDIDGFGQIYSTDLDGKDLRKHTSFTDYYPRHL 222-------------------------1111---------------------------- NTDGRRILFSKGGSIYIFNPDTEKIEKIEIGDLESPEDRIISIPSKFAEDFSPLDGDLIA ----------iiii----------------------------3333--------%%%%-- FVSRGQAFIQDVSGTYVLKVPEPLRIRYVRRGGDTKVAFIHGTREGDFLGIYDYRTGKAE ----------1111----------------------------1111-------------- KFEENLGNVFAMGVDRNGKFAVVANDRFEIMTVDLETGKPTVIERSREAMITDFTISDNS --------------1111--------------------------------------1111 RFIAYGFPLKHGETDGYVMQAIHVYDMEGRKIFAATTENSHDYAPAFDADSKNLYYLSYR -----------1111--------------------------------1111--------- SLDPSPDRVVLNFSFEVVSKPFVIPLIPGSPNPTKLVPRSMTSEAGEYDLNDMYKRSSPI --------------------------2222-1111--3333----------3333----- NVDPGDYRMIIPLESSILIYSVPVHGEFAAYYQGAPEKGVLLKYDVKTRKVTEVKNNLTD --------------------------3333---------------1111----------- LRLSADRKTVMVRKDDGKIYTFPLEKPEDERTVETDKRPLVSSIHEEFLQMYDEAWKLAR ---1111------1111-----3333---------------------------------1 DNYWNEAVAKEISERIYEKYRNLVPLCKTRYDLSNVIVEMQGEYRTSHSYEMGGTFTDKD 111-3333--------------3333--------------3333---------------- PFRSGRIACDFKLDGDHYVVAKAYAGDYSNEGEKSPIFEYGIDPTGYLIEDIDGETVGAG -------------!!!!---------1111----3333-----2222----iiii--111 SNIYRVLSEKAGTSARIRLSGKGGDKRDLMIDILDDDRFIRYRSWVEANRRYVHERSKGT 1------1111---------------------------------------------%%%% IGYIHIPDMGMMGLNEFYRLFINESSYQGLIVDVRFNGGGFVSQLIIEKLMNKRIGYDNP -------------------3333----------2222----3333---3333-------- RRGTLSPYPTNSVRGKIIAITNEYAGSDGDIFSFSFKKLGLGKLIGTRTWGGVVGITPKR ---------------------1111!!!!-------1111-------------------- RLIDGTVLTQPEFAFWFRDAGFGVENYGVDPDVEIEYAPHDYLSGKDPQIDYAIDALIEE -1111--------------!!!!2222----------3333------------------- LRN --- ---------------------------------------------- >BETA-LACTAMASE OXA-2; SWP:P05191; PDB:1K38A; TLERSDWRKFFSEFQAKGTIVVADERQADRAMLVFDPVRSKKRYSPASTFIPHTLFALDA -----------1111--------------------3333------!!!!----------- GAVRDEFQIFRWDGVNRGFAGHNQDQDLRSAMRNSTVWVYELFAKEIGDDKARRYLKKID ----1111----------3333----------------------------------1111 YGNADPSTGDYWIEGSLAISAQEQIAFLRKLYRNELPFRVEHQRLVKDLMIVEAGRNWIL !!!!-----1111----------------------------------1111--------- RAKTGWEGRMGWWVGWVEWPTGSVFFALNIDTPNRMDDLFKREAIVRAILRSIEALPP ------------------1111---------11111111-----------1111---- >CEST; SWP:P58233; PDB:1K3EA; MSSRSELLLEKFAEKIGIGSISFNENRLCSFAIDEIYYISLSDANDEYMMIYGVCGKFPT -----------------------1111--------------------------------- DNSNFALEILNANLWFAENGGPYLCYEAGAQSLLLALRFPLDDATPEKLENEIEVVVKSM ----------------------------------------2222---------------- ENLYLVLHNQGITLKIEEISS --------------------- >GALACTOSE OXIDASE PRECURS; SWP:Q01745; PDB:1K3IA; ASAPIGSAISRNNWAVTCDSAQSGNECNKAIDGNKDTFWHTFYGANGDPKPPHTYTIDMK ----------1111-------22221111----1111-----!!!!-------------- TTQNVNGLSMLPRQDGNQNGWIGRHEVYLSSDGTNWGSPVASGSWFADSTTKYSNFETRP -----------------2222--------------------------------------- ARYVRLVAITEANGQPWTSIAEINVFQASSYTAPQPGLGRWGPTIDLPIVPAAAAIEPTS ----------1111--------------------2222---------------------- GRVLMWSSYRNDAFGGSPGGITLTSSWDPSTGIVSDRTVTVTKHDMFPGISMDGNGQIVV ----------3333--------------------------------------1111---- TGGNDAKKTSLYDSSSDSWIPGPDMQVARGYQSSATMSDGRVFTIGGSWSGGVFEKNGEV ----1111-----1111-------------------1111-------------------- YSPSSKTWTSLPNAKVNPMLTADKQGLYRSDNHAWLFGWKKGSVFQAGPSTAMNWYYTSG ----------1111-3333---1111--1111-------iiii---------------!! SGDVKSAGKRQSNRGVAPDAMCGNAVMYDAVKGKILTFGGSPDYQDSDATTNAHIITLGE !!---------1111----2222------1111--------------------------2 PGTSPNTVFASNGLYFARTFHTSVVLPDGSTFITGGQRRGIPFEDSTPVFTPEIYVPEQD 222----------------------1111-----------2222-----------3333- TFYKQNPNSIVRVYHSISLLLPDGRVFNGGGGLCDCTTNHFDAQIFTPNYLYNSNGNLAT ------------2222----1111---------!!!-----------3333-1111---- RPKITRTSTQSVKVGGRITISTDSSISKASLIRYGTATHTVNTDQRRIPLTLTNNGGNSY ------------2222---------------------iiii--------------%%%%- SFQVPSDSGVALPGYWMLFVMNSAGVPSVASTIRVTQ ------3333-----------1111------------ >FUNCTIONAL ANTI-APOPTOTIC; SWP:P90504; PDB:1K3KA; MDEDVLPGEVLAIEGIFMACGLNEPEYLYHPLLSPIKLYITGLMRDKESLFEAMLANVRF --------3333-------3333------------3333-----------3333------ HSTTGIDQLGLSMLQVSGDGNMNWGRALAILTFGSFVAQKLSNEPHLRDFALAVLPAYAY -3333------3333-------3333--------------11113333-3333------- EAIGPQWFRARGGWRGLKAYCTQVLT 3333---------3333----3333- >CONSERVED PROTEIN MT0001; SWP:O26109; PDB:1K3RA; MNRVDLSLFIPDSLTAETGDLKIKTYKVVLIARAASIFGVKRIVIYHDDADGEARFIRDI ------------1111-------------------1111--------------------- LTYMDTPQYLRRKVFPIMRELKHVGILPPLRTPHHPTGKPVTGEYRQGLTVKRVKKGTLV ------11111111------1111-------1111-----2222---------3333--- DIGADKLALCREKLTVNRIMSFRVVRLGKEILIEPDEPEDRYWGYEVLDTRRNLAESLKT ------------------------------------------------------------ VGADVVVATSRNASPITSILDEVKTRMRGAREAAILFGGPYKGLPEIDADIWVNTLPGQC ---------1111-3333--------3333-------------------------2222- TETVRTEEAVLATLSVFNMLTQ ---------------------- >SIGE; SWP:O30917; PDB:1K3SA; ESLLNRLYDALGLDEPLLIIDDGIQVYFNESDHTLECCPFPLPDDILTLQHFLRLNYTSA --------1111--------iiii------1111-------------------3333--- VTIGADADNTALVALYRLPQTSTEEEALTGFELFISNVKQLKEHYA -----1111---------1111------------------------ >GLYCERALDEHYDE-3-PHOSPHAT; SWP:P22513; PDB:1K3TA; MPIKVGINGFGRIGRMVFQALCEDGLLGTEIDVVAVVDMNTDAEYFAYQMRYDTVHGKFK -----------------------------------------3333--------------- YEVTTTKSSPSVAKDDTLVVNGHRILCVKAQRNPADLPWGKLGVEYVIESTGLFTAKAAA --------3333-------%%%%---------3333-3333--------------3333- EGHLRGGARKVVISAPASGGAKTLVMGVNHHEYNPSEHHVVSNASCTTNCLAPIVHVLVK ------------------------22221111-3333---------------------11 EGFGVQTGLMTTIHSYTATQKTVDGVSVKDWRGGRAAAVNIIPSTTGAAKAVGMVIPSTQ 11--------------3333------33333333-1111-------33333333-3333- GKLTGMSFRVPTPDVSVVDLTFTAARDTSIQEIDAALKRASKTYMKGILGYTDEELVSAD ------------------------------------------1111----------3333 FINDNRSSIYDSKATLQNNLPKERRFFKIVSWYDNEWGYSHRVVDLVRHMASKDRSARL 2222----------1111-2222------------------------------------ >CAPSID PROTEIN VP2; SWP:P18546; PDB:1K3VA; GVGVSTGTFNNQTEFQYLGEGLVRITAHASRLIHLNMPEHETYKRIHVLNSESGVAGQMV 1111---------------------------------------------3333----333 QDDAHTQMVTPWSLIDANAWGVWFNPADWQLISNNMTEINLVSFEQEIFNVVLKTITESA 3-----------------3333-------------------------------------- TSPPTKIYNNDLTASLMVALDTNNTLPYTPAAPRSETLGFYPWLPTKPTQYRYYLSCIRN ----------3333------1111------1111------1111---------------- LNPPTYTGQSQQITDSIQTGLHSDIMFYTIENAVPIHLLRTGDEFSTGIYHFDTKPLKLT --------------------3333----3333-------1111----------------- HSWQTNRSLGLPPKLLTEPTTEGDQHPGTLPAANTRKGYHQTINNSYTEATAIRPAQVGY ----1111---------------------------------------3333--------- NTPYMNFEYSNGGPFLTPIVPTADTQYNDDEPNGAIRFTMDYQHGHLTTSSQELERYTFN -----------------------------1111-------------3333---------- PQSKCGRAPKQQFNQQAPLNLENTNNGTLLPSDPIGGKSNMHFMNTLNTYGPLTALNNTA -----------------------3333-------iiii---3333-----1111------ PVFPNGQIWDKELDTDLKPRLHVTAPFVCKNNPPGQLFVKIAPNLTDDFNADSPQQPRII ------------------------------------------------------------ TYSNFWWKGTLTFTAKMRSSNMWNPIQQHTTTAENIGNYIPTNIGGIRMFPEYSQLIPRK -------------------------------11111111--1111--------------- LY -- >ENDONUCLEASE VIII; SWP:P50465; PDB:1K3XA; PEGPEIRRAADNLEAAIKGKPLTDVWFAFPQLKTYQSQLIGQHVTHVETRGKALLTHFSN ----------------2222--------111111111111---------!!!!----111 DLTLYSHNQLYGVWRVVDTGEEPQTTRVLRVKLQTADKTILLYSASDIEMLRPEQLTTHP 1------!!!!------2222------------------------------3333----- FLQRVGPDVLDPNLTPEVVKERLLSPRFRNRQFAGLLLDQAFLAGLGNYLRVEILWQVGL -------1111-------------3333---3333---3333------------------ TGNHKAKDLNAAQLDALAHALLEIPRFSYATRGALFRFKVFHRDGEPCERCGSIIEKTTL ----3333-------------------------------2222----------------% SSRPFYWCPGCQH %%%----1111-- >GLUTATHIONE S-TRANSFERASE; SWP:P08263; PDB:1K3YA; AEKPKLHYFNARGRMESTRWLLAAAGVEFEEKFIKSAEDLDKLRNDGYLMFQQVPMVEID -----------!!!!-------1111-----------------1111-1111------ii GMKLVQTRAILNYIASKYNLYGKDIKERALIDMYIEGIADLGEMILLLPVCPPEEKDAKL ii-------------1111---------------------------3333-3333----- ALIKEKIKNRYFPAFEKVLKSHGQDYLVGNKLSRADIHLVELLYYVEELDSSLISSFPLL ---------------------------%%%%------------------11111111--- KALKTRISNLPTVKKFLQPGSPRKPPMDEKSLEEARKIFRF ----------------------------------------- >XYLANASE; SWP:P96988; PDB:1K42A; MLVANINGGFESTPAGVVTDLAEGVEGWDLNVGSSVTNPPVFEVLETSDAPEGNKVLAVT -------------------3333-1111----3333------------------------ VNGVGNNPWDIEATAFPVNVRPGVTYTYTIWARAEQDGAVVSFTVGNQSFQEYGRLHEQQ ------------------------------------------------------------ ITTEWQPFTFEFTVSDQETVIRAPIHFGYAANVGNTIYIDGLAIASQP ----------------------------1111---------------- >NUCLEOSIDE DIPHOSPHATE KI; SWP:P71904; PDB:1K44A; TERTLVLIKPDGIERQLIGEIISRIERKGLTIAALQLRTVSAELASQHYAEHEGKPFFGS ------------1111---------1111-------------------3333--1111-- LLEFITSGPVVAAIVEGTRAIAAVRQLAGGTDPVQAAAPGTIRGDFALETQFNLVHGSDS --3333----------2222-----------------2222-------3333-------- AESAQREIALWFPGA --------------- >ANTIBODY FAB FRAGMENT HEA; SWP:P0A334; PDB:1K4CA; QVQLQQPGAELVKPGASVKLSCKASGYTFTSDWIHWVKQRPGHGLEWIGEIIPSYGRANY ------------2222------------1111-------2222----------------- NEKIQKKATLTADKSSSTAFMQLSSLTSEDSAVYYCARERGDGYFAVWGAGTTVTVSSAK 3333---------1111---------3333------------------------------ TTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLY -----------1111-----------------------%%%%-------------%%%%- TLSSSVTVPSSSWPSETVTCNVAHPASSTKVDKKIVPRD --------3333------------1111----------- >Voltage-gated potassium c; SWP:P0A334; PDB:1K4CB; DILLTQSPAILSVSPGERVSFSCRASQSIGTDIHWYQQRTNGSPRLLIKYASESISGIPS -------------2222---------------------2222------------222233 RFSGSGSGTDFTLSINSVESEDIANYYCQQSNRWPFTFGSGTKLEIKRADAAPTVSIFPP 33----------------3333-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT 3333-------------------------iiii--2222--------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRN -33331111--------3333----------- >3,4-DIHYDROXY-2-BUTANONE ; SWP:Q8TG90; PDB:1K4IA; FDAIPDVIQAFKNGEFVVVLDDPSRENEADLIIAAESVTTEQMAFMVRHSSGLICAPLTP --3333---------------1111----------------------------------- ERTTALDLPQMVTHNADPRGTAYTVSVDAEHPSTTTGISAHDRALACRMLAAPDAQPSHF ----------------1111----------1111-----------------11113333- RRPGHVFPLRAVAGGVRARRGHTEAGVELCRLAGKRPVAVISEIVDDGQEVEGRAVRAAP -----------11113333-------------------------------2222------ GMLRGDECVAFARRWGLKVCTIEDMIAHVEKTEGKL ------------1111-------------------- >NAMN ADENYLYLTRANSFERASE; SWP:P52085; PDB:1K4MA; MKSLQALFGGTFDPVHYGHLKPVETLANLIGLTRVTIIPNNVPPHRPQPEANSVQRKHML ---------------3333-----3333--------------1111-------------- ELAIADKPLFTLDERELKRNAPSYTAQTLKEWRQEQGPDVPLAFIIGQDSLLTFPTWYEY ------3333---3333-----------------------------3333--3333--33 ETILDNAHLIVCRRPGYPLEMAQPQYQQWLEDHLTHNPEDLHLQPAGKIYLAETPWFNIS 333333-------2222-------------------3333-------------------- ATIIRERLQNGESCEDLLPEPVLTYINQQGLYR -------1111--1111---------------- >PROTEIN EC4020; SWP:P52007; PDB:1K4NA; GHIANWQSIDELQDIASDLPRFIHALDELSRRLGLNITPLTADHISLRCHQNATAERWRR ----11113333------------------------1111-------------------- GFEQCGELLSENINGRPICLFKLHEPVQVAHWQFSIVELPWPGEKRYPHEGWEHIEIVLP ------------iiii------------!!!!---------------------------- GDPETLNARALALLSDEGLSLPGISVKTSRLPNPTLAVTDGKTTIKFHPWSIEEIVASEQ -3333-----1111---------------------------------------------- >MAJOR ENVELOPE PROTEIN E; SWP:P07720; PDB:1K4RA; SRCTHLENRDFVTGTQGTTRVTLVLELGGCVTITAEGKPSMDVWLDAIYQENPAKTREYC 3333----------2222-------2222-----2222---------------------- LHAKLSDTKVAARCPTMGPATLAEEHQGGTVCKRDQSDRGWGNHCGLFGKGSIVACVKAA -------------1111----3333--------------3333----------------- CEAKKKATGHVYDANKIVYTVKVEPHTGDYVAANETHSGRKTASFTISSEKTILTMGEYG -2222-------1111---------------1111-1111-----1111------!!!!- DVSLLCRVASGVDLAQTVILELDKTVEHLPTAWQVHRDWFNDLALPWKHEGAQNWNNAER ------3333---1111-----1111-------------1111---------------11 LVEFGAPHAVKMDVYNLGDQTGVLLKALAGVPVAHIEGTKYHLKSGHVTCEVGLEKLKMK 11-----!!!!----------------2222-----!!!!-------------1111--- GLTYTMCDKTKFTWKRAPTDSGHDTVVMEVTFSGTKPCRIPVRAVAHGSPDVNVAMLITP 1111---1111----------------------------------2222----------- NPTIENNGGGFIEMQLPPGDNIIYVGELSHQWFQK ------------------------!!!!------- >DNA TOPOISOMERASE I; SWP:P11387; PDB:1K4TA; QKWKWWEEERYPEGIKWKFLEHKGPVFAPPYEPLPENVKFYYDGKVMKLSPKAEEVATFF ---3333---------------------------1111---iiii--------------- AKMLDHEYTTKEIFRKNFFKDWRKEMTNEEKNIITNLSKCDFTQMSQYFKAQTEARKQMS 1111------------------11113333------1111-------------------3 KEEKLKIKEENEKLLKEYGFCIMDNHKERIANFKIEPPGLFRGRGNHPKMGMLKRRIMPE 333-------------------------------------------1111-------333 DIIINCSKDAKVPSPPPGHKWKEVRHDNKVTWLVSWTENIQGSIKYIMLNPSSRIKGEKD 3-----1111-----2222------------------------------11113333--- WQKYETARRLKKCVDKIRNQYREDWKSKEMKVRQRAVALYFIDKLALRAGNEKEEGETAD --------3333-----------1111--------------------------2222--- TVGCCSLRVEHINLHPELDGQEYVVEFDFLGKDSIRYYNKVPVEKRVFKNLQLFMENKQP --1111-3333------iiii---------2222------------------------11 EDDLFDRLNTGILNKHLQDLMEGLTAKVFRTYNASITLQQQLKELTAPDENIPAKILSYN 11------------------22223333------------------1111---------- RANRAVAILCNHQRAPPKTFEKSMMNLQTKIDAKKEQLADARRDLKSAKADAKVMKDAKT -------------------------------------------------3333---3333 KKVVESKKKAVQRLEEQLMKLEVQATDREENKQIALGTSKLNLDPRITVAWCKKWGVPIE ----------------------------1111-------------------------333 KIYNKTQREKFAWAIDMADEDYEF 3-----------------1111-- >Neutrophil cytosol factor; SWP:P14598; PDB:1K4UP; SKPQPAVPPRPSADLILNRCSESTKRKLASAV -----------------------3333----- >Neutrophil cytosol factor; SWP:P19878; PDB:1K4US; QLKKGSQVEALFSYEATQPEDLEFQEGDIILVLSKVNEEWLEGESKGKVGIFPKVFVEDS ----------------------------------------------------3333---- AT -- >LIVER CARBOXYLESTERASE; SWP:P12337; PDB:1K4YA; PPVVDTVHGKVLGKFVSLEGFAQPVAVFLGVPFAKPPLGSLRFAPPQPAESWSHVKNTTS -----1111--------2222----------------!!!!------------------- YPPMCSQDAVSGHMLSELFTNRKENIPLKFSEDCLYLNIYTPADLTKRGRLPVMVWIHGG -------------------------------------------1111------------% GLMVGGASTYDGLALSAHENVVVVTIQYRLGIWGFFSTGDEHSRGNWGHLDQVAALRWVQ %%%---1111---------------------1111----3333----------------- DNIANFGGDPGSVTIFGESAGGQSVSILLLSPLTKNLFHRAISESGVALLSSLFRKNTKS -3333---1111-----!!!!---------1111------------1111---------- LAEKIAIEAGCKTTTSAVMVHCLRQKTEEELMEVTLKMKFMALDLVGDPKENTAFLTTVI ------1111------------1111---------------------------------- DGVLLPKAPAEILAEKKYNMLPYMVGINQQEFKLDQKTATELLWKSYPIVNVSKELTPVA ----------------------------------------------3333--3333---- TEKYLGGTDDPVKKKDLFLDMLADLLFGVPSVNVARHHRDAGAPTYMYEYRHGDEIFSVL --------------------------------------1111----------3333-111 GAPFLKEGATEEEIKLSKMVMKYWANFARNGNPNGEGLPQWPAYDYKEGYLQIGATTQAA 1---------------------------------2222---------------------- QKLKDKEVAFWTELWAKEAAR ------------1111----- >ADENYLYL CYCLASE-ASSOCIAT; SWP:P17555; PDB:1K4ZA; MPPRKELVGNKWFIENYENETESLVIDANKDESIFIGKCSQVLVQIKGKVNAISLSETES ----------------------------1111---------------------------- CSVVLDSSISGMDVIKSNKFGIQVNHSLPQISIDKSDGGNIYLSKESLNTEIYTSCSTAI -------1111--------------------------------3333------------- NVNLPIGEDDDYVEFPIPEQMKHSFADGKFKSAVFEH -------iiii--------------iiii-------- >BETA LACTAMASE OXA-10; SWP:P14489; PDB:1K55A; SITENTSWNKEFSAEAVNGVFVLCKSSSKSCATNDLARASKEYLPASTFIPNAIIGLETG ----333333331111------------------3333------!!!!------------ VIKNEHQVFKWDGKPRAMKQWERDLTLRGAIQVSAVPVFQQIAREVGEVRMQKYLKKFSY ---1111----------3333---------1111-------------------------! GNQNISGGIDKFWLEGQLRISAVNQVEFLESLYLNKLSASKENQLIVKEALVTEAAPEYL !!!----1111-------------------------------------1111-------- VHSKTGFSGVGTESNPGVAWWVGWVEKETEVYFFAFNMDIDNESKLPLRKSIPTKIMESE -----------3333-----------!!!!-----------3333------------111 GIIG 1--- >ENDOPOLYGALACTURONASE; SWP:P79074; PDB:1K5CA; CTVKSVDDAKDIAGCSAVTLNGFTVPAGNTLVLNPDKGATVTMAGDITFAKTTLDGPLFT ----3333---2222----------2222------2222--------------------- IDGTGINFVGADHIFDGNGALYWDGKGTNNGTHKPHPFLKIKGSGTYKKFEVLNSPAQAI ---------%%%%----3333----!!!!------------------------------- SVGPTDAHLTLDGITVDDFAGDTKNLGHNTDGFDVSANNVTIQNCIVKNQDDCIAINDGN ---------------------2222----------------------------------- NIRFENNQCSGGHGISIGSIATGKHVSNVVIKGNTVTRSMYGVRIKAQRTATSASVSGVT --------------------2222-----------------------1111--------- YDANTISGIAKYGVLISQSYPDDVGNPGTGAPFSDVNFTGGATTIKVNNAATRVTVECGN -----------------------------------------------1111--------- CSGNWNWSQLTVTGGKAGTIKSDKAKITGGQYL ------1111-----------!!!!-------- >Ran-specific GTPase-activ; SWP:P43487; PDB:1K5DB; NHDPQFEPIVSLPEQEIKTLEEDEEELFKMRAKLFRFASENDLPEWKERGTGDVKLLKHK --------------------1111----------------------------------33 EKGAIRLLMRRDKTLKICANHYITPMMELKPNAGSDRAWVWNTHADFADECPKPELLAIR 33--------------------------------------------1111---------- FLNAENAQKFKTKFEECRKEIEEREK -------------------------- >NUCLEOPLASMIN CORE; SWP:P05221; PDB:1K5JA; VSLIWGCELNEQNKTFEFKEHQLALRTVCLGDKAKDEFHIVEIVTQEEKSVPIATLKPSI ---------3333-----------------1111----------------------1111 LPMATMVGIELTPPVTFRLKAGSGPLYISGQHVA ---------------------------------- >TAT PROTEIN; SWP:P04613; PDB:1K5KA; MDPVDPNLEPWNHPGSQPRTPCNKCYCKKCCYHCQMCFITKGLGISYGRKKRRQRRRPPQ 1111-------------------------------------------------------- GNQAHQDPLPEQPSSQHRGDHPTGPKE 1111----------------------- >BETA-2-MICROGLOBULIN, LIG; SWP:Q29846; PDB:1K5NA; GSHSMRYFHTSVSRPGRGEPRFITVGYVDDTLFVRFDSDAASPREEPRAPWIEQEGPEYW -------------2222----------!!!!-----1111--------1111---3333- DRETQICKAKAQTDREDLRTLLRYYNQSEAGSHTLQNMYGCDVGPDGRLLRGYHQHAYDG -------------------------------------------1111----------iii KDYIALNEDLSSWTAADTAAQITQRKWEAARVAEQLRAYLEGECVEWLRRYLENGKETLQ i-----3333------3333-------------------------------------111 RADPPKTHVTHHPISDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDRT 1-------------1111--------------------iiii-3333------------- FQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRWEP ---------22221111-----1111---------- >BETA-2-MICROGLOBULIN, LIG; SWP:P01884; PDB:1K5NB; MIQRTPKIQVYSRHPAENGKSNFLNCYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKD ----------------2222---------------------iiii------------111 WSFYLLYYTEFTPTEKDEYACRVNHVTLSQPKIVKWDRDM 1-----------------------3333--------1111 >MATING-TYPE PROTEIN ALPHA; SWP:P01367; PDB:1K61A; RGHRFTKENVRILESWFAKNIENPYLDTKGLENLMKNTSLSRIQIKNWVSNRRRKEKTIT -----------------1111--------------------------------------- >ARGININOSUCCINATE LYASE; SWP:P04424; PDB:1K62A; GAVDPIMEKFNASIAYDRHLWEVDVQGSKAYSRGLEKAGLLTKAEMDQILHGLDKVAEEW ------------3333-------------------1111--3333--------------- AQGTFKLNSNDEDIHTANERRLKELIGATAGKLHTGRSRNDQVVTDLRLWMRQTCSTLSG -------1111-3333----------3333---22223333------------------- LLWELIRTMVDRAEAERDVLFPGYTHLQRAQPIRWSHWILSHAVALTRDSERLLEVRKRI ------------------------%%%%-----3333----------------------- NVLPLGSGAIAGNPLGVDRELLRAELNFGAITLNSMDATSERDFVAEFLFWRSLCMTHLS ---------------------------------3333----3333--------------- RMAEDLILYCTKEFSFVQLSDAYSTGSSLMPRKKNPDSLELIRSKAGRVFGRCAGLLMTL -----------3333----3333---1111-----------------------------2 KGLPSTYNKDLQEDKEAVFEVSDTMSAVLQVATGVISTLQIHQENMGQALSPDMLATDLA 222----3333-----------------------------------11113333-3333- YYLVRKGMPFRQAHEASGKAVFMAETKGVALNQLSLQELQTISPLFSGDVICVWDYRHSV ---1111----------------------3333-333333333333---3333-3333-- EQYGALGGTARSSVDWQIRQVRALLQAQQA ----2222---------------------- >PHYTOCHROME RESPONSE REGU; SWP:Q8RTM8; PDB:1K66A; AVGNATQPLLVVEDSDEDFSTFQRLLQREGVVNPIYRCITGDQALDFLYQTGSYCNPDIA ---1111------------------------------------------------3333- PRPAVILLDLNLPGTDGREVLQEIKQDEVLKKIPVVIMTTSSNPKDIEICYSYSISSYIV ----------------------------3333------------------1111------ KPLEIDRLTETVQTFIKYWLDIVVLPEMG ----------------------------- >PHYTOCHROME RESPONSE REGU; SWP:Q8RTN0; PDB:1K68A; AHKKIFLVEDNKADIRLIQEALANSTVPHEVVTVRDGMEAMAYLRQEGEYANASRPDLIL --------------------3333------------------1111---1111------- LLNLPKKDGREVLAEIKSDPTLKRIPVVVLSTSINEDDIFHSYDLHVNCYITKSANLSQL ---------------1111--1111-----------------1111-------------- FQIVKGIEEFWLSTATLPS ------------------- >ACETATE COA-TRANSFERASE A; SWP:P76458; PDB:1K6DA; MKTKLMTLQDATGFFRDGMTIMVGGFMGIGTPSRLVEALLESGVRDLTLIANDTAFVDTG ------33333333-2222------iiii--------------------------1111- IGPLIVNGRVRKVIASHIGTNPETGRRMISGEMDVVLVPQGTLIEQIRCGGAGLGGFLTP ----1111---------1111----------------------------1111------2 TGVGTVEGKQTLTLDGKTWLLERPLRADLALIRAHRCDTLGNLTYQLSARNFNPLIALAA 222----------!!!!--------------------1111-----1111-3333----- DITLVEPDELVETGELQPDHIVTPGAVIDHIIVSQES -----------2222-3333---3333---------- >ATP-DEPENDENT CLP PROTEAS; SWP:P15716; PDB:1K6KA; MLNQELELSLNMAFARAREHRHEFMTVEHLLLALLSNPSAREALEACSVDLVALRQELEA --------------------------------------------1111------------ FIEQTTPVLPASEEERDTQPTLSFQRVLQRAVFHVQSSGRNEVTGANVLVAIFSEQESQA -------------------------------------------------3333-1111-- AYLLRKHEVSRLDVVNFISHGT ----1111-------------- >6-phosphofructo-2-kinase/; SWP:P16118; PDB:1K6MA; NSPTMVIMVGLPARGKTYISTKLTRYLNFIGTPTKVFNLGQYRREAVSYKNYEFFLPDNM --------------------------------------------------3333-33333 EALQIRKQCALAALKDVHNYLSHEEGHVAVFDATNTTRERRSLILQFAKEHGYKVFFIES 333--------------------------------------------------------- ICNDPGIIAENIRQVKLGSPDYIDCDREKVLEDFLKRIECYEVNYQPLDEELDSHLSYIK ---------------1111--------------------3333-------1111------ IFDVGTRYMVNRVQDHIQSRTVYYLMNIHVTPRSIYLCRHGESELNIRGRIGGDSGLSVR --iiii---------------------------------------1111----------- GKQYAYALANFIQSQGISSLKVFTSRMKRTIQTAEALGVPYEQFKALNEIDAGVCEEMTY ------------3333----------3333--3333-------1111----!!!!----- EEIQEHYPEEFALRDQDKYRYRYPKGESYEDLVQRLEPVIMELERQENVLVICHQAVMRC ----------------------2222---------------------------------- LLAYFLDKSSEELPYLKCPLHTVLKLTPVAYGCKVESIYLNVEAVNTHREKPENVDITRE --------33331111---------------------------------------11113 PEEALDTVPAHY 333-1111---- >IMMUNOGLOBULIN FAB D3, LI; SWP:NA; PDB:1K6QH; EVQLQQSGAELVRPGALVKLSCKASGFNIKDYYMHWVKQRPEQGLELIGWIDPENGNTIY ------------2222-----------3333---------------------1111---- DPKFQDKASITADTSSNTAYLQLSSLTSEDTAVYYCARDTAAYFDYWGQGTTLTVSSAKT 3333----------------------3333------------------------------ TPPSVYPLAPGSANSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSS ----------------------------------%%%%---------------------- SVTVPSSTWPSETVTCNVAHPASSTKVDKKKIP -------------------3333---------- >IMMUNOGLOBULIN FAB D3, LI; SWP:NA; PDB:1K6QL; DIKMTQSPSSMSASLGESVTITCKASRDIKSYLSWYQQKPWKSPKTLIYYATSLADGVPS ----------------------------iiii----------------------222211 RFSGSGSGQDYSLTISSLESDDTATYYCLQHGESPFTFGSGTKLELKRADAAPTVSIFPP 11----------------1111-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT 3333-------------------------!!!!--------------------------- LTKDEYERHNSYTCEATPIVKSFN --3333------------------ >HYPOTHETICAL PROTEIN YGBM; SWP:Q46891; PDB:1K77A; PRFAANLSFTEVPFIERFAAARKAGFDAVEFLFPYNYSTLQIQKQLEQNHLTLALFNTAP ------------3333-----1111--------1111----------------------- GDINAGEWGLSALPGREHEAHADIDLALEYALALNCEQVHVAGVVPAGEDAERYRAVFID -1111----1111--------------------------------22223333------- NIRYAADRFAPHGKRILVEALSPGVKPHYLFSSQYQALAIVEEVARDNVFIQLDTFHAQK --------3333---------33332222----------------1111----------- VDGNLTHLIRDYAGKYAHVQIAGLPDRHEPDDGEINYPWLFRLFDEVGYQGWIGCEYKPR ---------1111----------------------------------------------- GLTEEGLGWFDAWRGS -3333----------- >PAIRED BOX PROTEIN PAX5; SWP:Q02548; PDB:1K78A; GVNQLGGVFVNGRPLPDVVRQRIVELAHQGVRPCDISRQLRVSHGCVSKILGRYYETGSI --1111---2222-------------1111------------------------------ KPGVIGGSKPKVATPKVVEKIAEYKRQNPTMFAWEIRDRLLAERVCDNDTVPSVSSINRI ---------------------------1111----------------------------- IRTK ---- >RHAMNOGALACTURONAN ACETYL; SWP:Q00017; PDB:1K7CA; TTVYLAGDSTMAKNGGGSGTNGWGEYLASYLSATVVNDAVAGRSARSYTREGRFENIADV -------33332222-iiii--11113333---------2222-------------3333 VTAGDYVIVEFGHNDGGSLSTDNGRTDCSGTGAEVCYSVYDGVNETILTFPAYLENAAKL -2222------1111--3333------------------iiii----------------- FTAKGAKVILSSQTPNNPWETGTFVNSPTRFVEYAELAAEVAGVEYVDHWSYVDSIYETL ------------------1111-------------------------------------- GNATVNSYFPIDHTHTSPAGAEVVAEAFLKAVVCTGTSLKSVLTTTSFEGTCL -----1111------------------------------1111---------- >ALKALINE PHOSPHATASE; SWP:NA; PDB:1K7HA; EEDKAYWNKDAQDALDKQLGIKLREKQAKNVIFFLGDGMSLSTVTAARIYKGGLTGKFER -----------------1111--------------2222-----------------!!!! EKISWEEFDFAALSKTYNTDKQVTDSAASATAYLTGVKTNQGVIGLDANTVRTNCSYQLD --1111---------------------------------2222---333322223333-3 ESLFTYSIAHWFQEAGRSTGVVTSTRVTHATPAGTYAHVADRDWENDSDVVHDREDPEIC 333---------1111---------1111-3333------3333------1111-3333- DDIAEQLVFREPGKNFKVIMGGGRRGFFPEEALDIEDGIPGEREDGKHLITDWLDDKASQ -----------3333-------------1111-------------------------111 GATASYVWNRDDLLAVDIANTDYLMGLFSYTHLDTVLTRDAEMDPTLPEMTKVAIEMLTK 1-----------11113333--------------1111-3333----------------- DENGFFLLVEGGRIDHMHHANQIRQSLAETLDMEEAVSMALSMTDPEETIILVTADHGHT 1111-----------------------------------1111-1111------------ LTITGYADRNTDILDFAGISDLDDRRYTILDYGSGPGYHITEDGKRYEPTEEDLKDINFR -------22221111-------------------------1111-----------1111- YASAAPKHSATHDGTDVGIWVNGPFAHLFTGVYEENYIPHALAYAACVGTGRTFCD ----------------------2222-------1111---------------3333 >SECRETED PROTEASE C; SWP:P16317; PDB:1K7IA; ANTSSAYNSVYDFLRYHDRGDGLTVNGKTSYSIDQAAAQITRENVSWNGTNVFGKSANLT ------------1111--------%%%%------------3333-1111----------- FKFLQSVSSIPSGDTGFVKFNAEQIEQAKLSLQSWSDVANLTFTEVTGNKSANITFGNYT ---------1111---------------------------------!!!!---------- RDASGNLDYGTQAYAYYPGNYQGAGSSWYNYNQSNIRNPGSEEYGRQTFTHEIGHALGLA -1111---------------2222-----11113333-----------------1111-- HPGEYNAGEGDPSYNDAVYAEDSYQFSIMSFWGENETGADYNGHYGGAPMIDDIAAIQRL ------------3333--11113333--------1111--iiii---------------- YGANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFSVWDAGGTDTFDFSGYSNNQRINL ---------------------1111---1111----------------1111-------- NEGSFSDVGGLKGNVSIAHGVTIENAIGGSGNDILVGNSADNILQGGAGNDVLYGGAGAD 2222---iiii------2222--------------------------------------- TLYGGAGRDTFVYGSGQDSTVAAYDWIADFQKGIDKIDLSAFRNEGQLSFVQDQFTGKGQ --------------1111-3333-------2222--------1111-------------- EVMLQWDAANSITNLWLHEAGHSSVDFLVRIVGQAAQSDIIV -------1111-------2222-------------1111--- >PROTEIN YCIO; SWP:P45847; PDB:1K7JA; SQFFYIHPDNPQQRLINQAVEIVRKGGVIVYPTDSGYALGCKIEDKNAERICRIRQLPDG ----------------------1111---------------1111------------222 HNFTLCRDLSELSTYSFVDNVAFRLKNNTPGNYTFILKGTKEVPRRLLQEKRKTIGRVPS 2------3333----------------------------33333333------------- NPIAQALLEALGEPLSTSLLPGSEFTESDPEEIKDRLEKQVDLIIHGGYLGQKPTTVIDL -------------------2222-------------1111-------------------- TDDTPVVVREGVGDVKPFL %%%%----------3333- >HYPOTHETICAL PROTEIN YGGV; SWP:HAM1_ECOLI; PDB:1K7KA; HQKVVLATGNVGKVRELASLLSDGLDIVAQTDLGVDSAEETGLTFIENAILKARHAAKVT ---------------------111----3333---------------------------- ALPAIADDSGLAVDVLGGAPGIYSARYSGEDATDQKNLQKLLETKDVPDDQRQARFHCVL -------------1111--!!!!--1111------------------1111--------- VYLRHAEDPTPLVCHGSWPGVITREPAGTGGFGYDPIFFVPSEGKTAAELTREEKSAISH ----1111-----------------------!!!!----3333--3333----------- RGQALKLLLDALRNG --------------- >DELTA 2 CRYSTALLIN; SWP:P24058; PDB:1K7WA; TDPIMEKLNSSIAYDQRLSEVDIQGSMAYAKALEKAGILTKTELEKILSGLEKISEEWSK ---------------------------------1111----------------------- GVFVVKQSDEDIHTANERRLKELIGDIAGKLHTGRSRNDQVVTDLKLFMKNSLSIISTHL -----3333---------------3333-1111--3333--------------------- LQLIKTLVERAAIEIDVILPGYTHLQKAQPIRWSQFLLSHAVALTRDSERLGEVKKRINV -------------1111-----%%%%-----3333------------------------- LPLGSGALAGNPLDIDREMLRSELEFASISLNSMDAISERDFVVEFLSFATLLMIHLSKM -2222----------------------------------3333----------------- AEDLIIYSTSEFGFLTLSDAFSTGASLMPQKKNPDSLELIRSKAGRVFGRLASILMVLKG -----------------3333---3333-----------------------------222 LPSTYNKDLQEDKEAVFDVVDTLTAVLQVATGVISTLQISKENMEKALTPEMLATDLALY 2----3333---------------------------------------3333-------- LVRKGVPFRQAHTASGKAVHLAETKGITINKLSLEDLKSISPQFSSDVSQVFNFVNSVEQ -1111-----------------1111-1111---------111111113333-------- YTALGGTAKSSVTTQIEQLRELMKKQKEQ --2222----------------------- ------------------------------------ >FORMAMIDOPYRIMIDINE-DNA G; SWP:P05523; PDB:1K82A; PELPEVETSRRGIEPHLVGATILHAVVRNGRLRWPVSEEIYRLSDQPVLSVQRRAKYLLL ------------33332222-----------------3333------------!!!!--- ELPEGWIIIHLGMSGSLRILPEELPPEKHDHVDLVMSNGKVLRYTDPRRFGAWLWTKELE -1111---------------------1111-----1111------3333----------- GHNVLTHLGPEPLSDDFNGEYLHQKCAKKKTAIKPWLMDNKLVVGVGNIYASESLFAAGI ----2222--1111-----------1111----3333-3333------------------ HPDRLASSLSLAECELLARVIKAVLLRSIEQGGTTLKPGYFAQELQVYGRKGEPCRVCGT 11113333--------------------1111--------1111--2222---------- PIVATKHAQRATFYCRQCQK ------%%%%---------- ------------------------------------------------------------ ---------------------------- >PROLINE DEHYDROGENASE; SWP:P09546; PDB:1K87A; PQSVSRAAITAAYRRPETEAVSMLLEQARLPQPVAEQAHKLAYQLADKLRNQKNASGRAG ---------1111--3333------1111------------------------------- MVQGLLQEFSLSSQEGVALMCLAEALLRIPDKATRDALIRDKISNGNRSPSLFVNAATWG -------------------------1111---------------------------3333 LLFTGNEASLSRSLNRIIGKSGEPLIRKGVDMAMRLMGEQFVTGETIAEALANARKLEEK ---------------3333--3333------------3333--------3333-3333-- GFRYSYDMLGEAALTAADAQAYMVSYQQAIHAIGKASNGRGIYEGPGISIKLSALHPRYS ------------------------------------iiii----------3333---333 RAQYDRVMEELYPRLKSLTLLARQYDIGINIDAEEADRLEISLDLLEKLCFEPELAGWNG 3---------3333--------1111--------1111-------------3333----- IGFVIQAYQKRCPLVIDYLIDLATRSRRRLMIRLVKGAYWDSEIKRAQMDGLEGYPVYTR --------1111-----------------------------------------------3 KVYTDVSYLACAKKLLAVPNLIYPQFATHNAHTLAAIYQLAGQNYYPGQYEFQCLHGMGE 333--------------3333-----------------3333---1111-----222233 PLYEQVTGKVADGKLNRPCRIYAPVGTHETLLAYLVRRLLENGANTSFVNRIADTSLPLD 33------3333--------------1111-------------11113333--1111333 ELVADPVTAVEKLAQQEGQTGLPHPKIPLPR 3----------------------1111-111 >PROBABLE TRANSLATION INIT; SWP:Q57562; PDB:1K8BA; EILIEGNRTIIRNFRELAKAVNRDEEFFAKYLLKETGSAGNLEGGRLILQRR --------------3333---------------------------------- >QA-2 ANTIGEN; SWP:P14429; PDB:1K8DA; GQFTVRPGLGEPWIV -----2222------ >ADENYLYL CYCLASE-ASSOCIAT; SWP:Q01518; PDB:1K8FA; PAVLELEGKKWRVENQENVSNLVIEDTELKQVAYIYKCVNTTLQIKGKINSITVDNCKKL ---------------------------1111----------------------------- GLVFDDVVGIVEIINSKDVKVQVMGKVPTISINKTDGCHAYLSKNSLDCEIVSAKSSEMN ------------------------------------------1111-------------- VLIPTEGGDFNEFPVPEQFKTLWNGQKLVTTVTEIAG ------------------------------------- >SMALL PROTEIN B; SWP:O66640; PDB:1K8HA; GKSDKIIPIAENKEAKAKYDILETYEAGIVLKGSEVKSLREKGTVSFKDSFVRIENGEAW ------------3333-------------------------------------------- LYNLYIAPYKHATIENHDPLRKRKLLLHKREIMRLYGKVQEKGYTIIPLKLYWKNNKVKV ---------------------------------------3333----------------- LIALAKGKKLYDR ------------- >MHC CLASS II H2-M ALPHA C; SWP:P28078; PDB:1K8IA; QNHTFRHTLFCQDGIPNIGLSETYDEDELFSFDFSQNTRVPRLPDFAEWAQGQGDASAIA -----------------------iiii---------------33333333----3333-- FDKSFCEMLMREVSPKLEGQIPVSRGLPVAEVFTLKPLEFGKPNTLVCFISNLFPPTLTV -------------3333---------------------2222------------------ NWQLHSAPVEGASPTSISAVDGLTFQAFSYLNFTPEPFDLYSCTVTHEIDRYTAIAYWVP ---iiii----------------------------1111--------------------- Q - >H2-M beta 2; SWP:Q31099; PDB:1K8IB; GFVAHVESTCVLNDAGTPQDFTYCVSFNKDLLACWDPDVGKIVPCEFGVLSRLAEIISNI -----------------------------------------------11113333----- LNEQESLIHRLQNGLQDCATHTQPFWDVLTHRTRAPSVRVAQTTPFNTREPVMLACYVWG ---3333---3333----3333-------------------------------------- FYPADVTITWMKNGQLVPSHSNKEKTAQPNGDWTYQTVSYLALTPSYGDVYTCVVQHSGT -----------iiii------------------------------2222-------1111 SEPIRGDWTP ---------- >ACTIN-LIKE PROTEIN 3; SWP:P61157; PDB:1K8KA; GRLPACVVDCGTGYTKLGYAGNTEPQFIIPSCIAIKEVMKGVDDLDFFIGDEAIEKPTYA -----------------------------------------3333----3333------- TKWPIRHGIVEDWDLMERFMEQVIFKYLRAEPEDHYFLLTEPPLNTPENREYTAEIMFES -----iiii---------------------3333-------------------------- FNVPGLYIAVQAVLALAASWTSRQVGERTLTGTVIDSGDGVTHVIPVAEGYVIGSCIKHI ---------------------1111----------------------iiii-1111---- PIAGRDITYFIQQLLRDREVGIPPEQSLETAKAVKERYSYVCPDLVKEFNKYDTDGSKWI ----------------------3333----------------------------3333-- KQYTGINAISKKEFSIDVGYERFLGPEIFFHPEFANPDFTQPISEVVDEVIQNCPIDVRR ------------------3333----33333333-1111--3333-----11113333-- PLYKNIVLSGGSTMFRDFGRRLQRDLKRTVDARLKLSEELSKPKPIDVQVITHHMQRYAV ----------11112222------------------------------------3333-- WFGGSMLASTPEFYQVCHTKKDYEEIGPSICRHNPVFGVMS -----11113333-------------3333----------- >Actin-like protein 2; SWP:P61161; PDB:1K8KB; GVVVDSGDGVTHICPVYEGFSLPHLTRRLDIAGRDITRYLIKLLLLRGYAFNHSADFETV ----------------------2222-----3333---------1111-----1111--- RMIKEKLCYVGYNIEQEQKLALETTVLVESYTLPDGRIIKVGGERFEAPEALFQPHLINV ------------------------1111----1111-----------------3333--- EGVGVAELLFNTIQAADIDTRSEFYKHIVLSGGSTMYPGLPSRLERELKQLYLERVLKGD -----------------------1111----1111-2222-------------------- VEKLSKFKIR ---------- >Actin-related protein 2/3; SWP:Q58CQ2; PDB:1K8KC; AYHSFLVEPISCHAWNKDRTQIAICPNNHEVHIYEKSGNKWVQVHELKEHNGQVTGVDWA ---------------1111-----------------!!!!-------------------- PDSNRIVTCGTDRNAYVWTLKGRTWKPTLVILRINRAARCVRWAPNEKKFAVGSGSRVIS 1111---------------------------------------1111------------- ICYFEQENDWWVCKHIKKPIRSTVLSLDWHPNSVLLAAGSCDFKCRIFSAYIKEVEERPA -----------------------------1111------------------1111----- PTPWGSKMPFGELMFESSSSCGWVHGVCFSANGSRVAWVSHDSTVCLADADKKMAVATLA -3333---2222-----------------1111------1111-----1111-------- SETLPLLAVTFITESSLVAAGHDCFPVLFTYDSAAGKLSFGGRLDVPTARERFQNLDKKA --------------------3333--------1111------------------------ AGLDSLHKNSVSQISVLSGGKAKCSQFCTTGMDGGMSIWDVRSLESALKDLKIV -------------------3333------------------------1111--- >Actin-related protein 2/3; SWP:Q3MHR7; PDB:1K8KD; MILLEVNNRIIEETLALKFENAAAGNKPEAVEVTFADFDGVLYHISNPNGDKTKVMVSIS --------3333-------------------------%%%%-----22221111------ LKFYKELQAHGADELLKRVYGSYLVNPESGYNVSLLYDLENLPASKDSIVHQAGMLKRNC 1111---------------!!!!----2222------1111---3333--------3333 FASVFEKYFQFQEEGKEGENRAVIHYRDDETMYVESKKDRVTVVFSTVFKDDDDVVIGKV ------------------------------------1111----------3333------ FMQEFKEGRRASHTAPQVLFSHREPPLELKDTDAAVGDNIGYITFVLFPRHTNASARDNT -----------1111----------3333-------1111-------3333--------- INLIHTFRDYLHYHIKCSKAYIHTRMRAKTSDFLKVLNRARPDA --1111-------------------------------1111--- >ACTIN-LIKE PROTEIN 3; SWP:NA; PDB:1K8KE; PAYHSSLMDPDTKLIGNMALLPIRSQFKGPAPRETKDTDIVDEAIYYFKANVFFKNYEIK --------1111-------------------------------------3333------- NEADRTLIYITLYISECLKKLQKCNSKSQGEKEMYTLGITNFPIPGEPGFPLNAIYAKPA 3333---------------------3333--------------2222----1111----- NKQEDEVMRAYLQQLRQETGLRLCEKVFDPQNDKPSKWWTCFVKRQFMNKSLSG ---------------------3333----------11111111--2222-1111 >Actin-related protein 2/3; SWP:P59998; PDB:1K8KF; TATLRPYLSAVRATLQAALCLENFSSQVVERHNKPEVEVRSSKELLLQPVTISRNEKEKV ----------------------------2222--3333---3333--------------- LIEGSINSVRVSIAVKQADEIEKILCHKFMRFMMMRAENFFILRRKPVEGYDISFLITNF ------------------------------------3333-------2222--------- HTEQMYKHKLVDFVIHFMEEIDKEISEMKLSVNARARIVAEEFLKNF -----3333---------------------------------3333- >Actin-related protein 2/3; SWP:Q3SYX9; PDB:1K8KG; ARFRKVDVDEYDENKFVDEDDGAGPDEGEVDSCLRQGNMTAALQAALKNPPINTKSQAVK -1111----------------------------1111-------1111------------ DRAGSIVLKVLISFKANDIEKAVQSLDKNGVDLLMKYIYKGFESPSDNSSAVLLQWHEKA --------------3333----1111--------------1111-!!!!----------- LAAGGVGSIVRVLTARKTV -----------1111---- >E2 component of Branched-; SWP:P11182; PDB:1K8MA; MGQVVQFKLSDIGEGIREVTVKEWYVKEGDTVSQFDSICEVQSDKASVTITSRYDGVIKK --------------------------2222--1111------------------------ LYYNLDDIAYVGKPLVDIETEALKDLE ---1111---------------3333- >TRIACYLGLYCEROL LIPASE, G; SWP:P80035; PDB:1K8QA; AFGKLHPTNPEVTMNISQMITYWGYPAEEYEVVTEDGYILGIDRIPYGRKNSENIGRRPV --------3333--------1111---------1111----------1111--2222--- AFLQHGLLASATNWISNLPNNSLAFILADAGYDVWLGNSRGNTWARRNLYYSPDSVEFWA -----2222--------1111------1111-------2222---------11113333- FSFDEMAKYDLPATIDFILKKTGQDKLHYVGHSQGTTIGFIAFSTNPKLAKRIKTFYALA -----------------------------------------1111-3333---------- PVATVKYTETLINKLMLVPSFLFKLIFGNKIFYPHHFFDQFLATEVCSRETVDLLCSNAL ---------11111111------------------3333-----------3333------ FIICGFDTMNLNMSRLDVYLSHNPAGTSVQNVLHWSQAVKSGKFQAFDWGSPVQNMMHYH ------1111-1111--------------------------------------------- QSMPPYYNLTDMHVPIAVWNGGNDLLADPHDVDLLLSKLPNLIYHRKIPPYNHLDFIWAM -------3333---------1111---------------------------1111---11 DAPQAVYNEIVSMMGTD 113333------3333- ------------------------------------------------------------ ---------------------- >CALMODULIN-SENSITIVE ADEN; SWP:P40136; PDB:1K8TA; DRIDVLKGEKALKASGLVPEHADAFKKIARELNTYILFRPVNKLATNLIKSGVATKGLNV ------------3333-3333--------------------1111-----------1111 HGKSSDWGPVAGYIPFDQDLSKKHGQQLAVEKGNLENKKSITEHEGEIGKIPLKLDHLRI -------1111-----1111-----------------------iiii------------- EELKENGIILKGKKEIDNGKKYYLLESNNQVYEFRISDENNEVQYKTKEGKITVLGEKFN ----------------%%%%---------------------------------------- WRNIEVMAKNVEGVLKPLTADYDLFALAPSLTEIKKQIPQKEWDKVVNTPNSLEKQKGVT ----------------------------------1111---------------------- NLLIKYGIERKPDSTKGTLSNWQKQMLDRLNEAVKYTGYTGGDVVNHGTNEIFIINPEGE ---1111---------------------------1111-----------------1111- FILTKNWEMTGRFIEKNITGKDYLYYFNRSYNKIAPGNKAYIEWTDPITKAKINTIPTSA ------------------1111-------------------------------------- EFIKNLSSIRRSSNVGVYKDSGDKDEFAKKESVKKIAGYLSDYYNSANHIFSQEKKRKIS ----------------------------------------33331111---3333----- IFRGIQAYNEIENVLKSKQIAPEYKNYFQYLKERITNQVQLLLTHQKSNIEFKLLYKQLN --------------3333----------------------------1111----3333-- FTENETDNFEVFQKIIDE ---3333----------- >S100A6; SWP:P06703; PDB:1K8UA; APLDQAIGLLVAIFHKYSGREGDKHTLSKKELKELIQKELTIGSKLQDAEIARLEDLDRN ----------------1111--1111---------------1111--------------1 KDQEVNFQEYVTFLGALALIYNEALKG 111-------------3333-3333-- >NEUROPEPTIDE F; SWP:P41967; PDB:1K8VA; PDKDFIVNPSDLVLDNKAALRDYLRQINEYFAIIGRPRF --------%%%%-----3333--1111------------ >TRNA PSEUDOURIDINE SYNTHA; SWP:P09171; PDB:1K8WA; MDINGVLLLDKPQGMSSNDALQKVKRIYNANRAGHTGALDPLATGMLPICLGEATKFSQY -----------2222------------------------3333---------3333---- LLDSDKRYRVIARLGQRTDTSDADGQIVEERPVTFSAEQLAAALDTFRGDIEQIPSMYSA 1111-------------11111111-------------------1111------------ LKYQGKKLYEYARQGIEVPREARPITVYELLFIRHEGNELELEIHCSKGTYIRTIIDDLG --iiii3333-1111--------------------!!!!-------2222---------- EKLGCGAHVIYLRRLAVSKYPVERMVTLEHLRELVEQAEQQDIPAAELLDPLLMPMDSPA ----------------!!!!3333-------------------3333-3333--1111-3 SDYPVVNLPLTSSVYFKNGNPVRTSGAPLEGLVRVTEGENGKFIGMGEIDDEGRVAPRRL 333-----3333---1111------------------3333--------1111------- VVEY ---- >ARGININOSUCCINATE SYNTHAS; SWP:P22767; PDB:1K92A; TTILKHLPVGQRIGIAFSGGLDTSAALLWMRQKGAVPYAYTANLGQPDEEDYDAIPRRAM -------2222----------------------------------1111-3333-----1 EYGAENARLIDCRKQLVAEGIAAIQCGAFHNTTGGLTYFNTTPLGRAVTGTMLVAAMKED 111-----------------------------iiii---------------------111 GVNIWGDGSTYKGNDIERFYRYGLLTNAELQIYKPWLDTDFIDELGGRHEMSEFMIACGF 1--------11113333---------1111---1111------------------1111- DYKMSVEKAYSTDSNMLGATHEAKDLEYLNSSVKIVNPIMGVKFWDESVKIPAEEVTVRF --------------1111----!!!!-11113333-------1111-------------- EQGHPVALNGKTFSDDVEMMLEANRIGGRHGLGMSDQIENRIIEAKSRGIYEAPGMALLH iiii---iiii-------------------2222------1111---------------- IAYERLLTGIHNEDTIEQYHAHGRQLGRLLYQGRWFDSQALMLRDSLQRWVASQITGEVT -----------------------------111111113333---------3333------ LELRRGNDYSILNTVSENLTYKPERLTMEKGDSVFSPDDRIGQLTMRNLDITDTREKLFG ----!!!!-------1111--3333----------------------------------- YAKTGLLSSSAASGVPQVENLENK -1111--------------1111- >GRANCALCIN; SWP:P28676; PDB:1K94A; SVYTYFSAVAGQDGEVDAEELQRCLTQSGINGTYSPFSLETCRIMIAMLDRDHTGKMGFN 3333------1111--------------1111-------------33331111------- AFKELWAALNAWKENFMTVDQDGSGTVEHHELRQAIGLMGYRLSPQTLTTIVKRYSKNGR -------------------1111-------------1111----------------iiii IFFDDYVACCVKLRALTDFFRKRDHLQQGSANFIYDDFLQGTMAI -----------------------1111--------------3333 >UPSTREAM BINDING FACTOR 1; SWP:P17480; PDB:1K99A; MKKLKKHPDFPKKPLTPYFRFFMEKRAKYAKLHPEMSNLDLTKILSKKYKELPEKKKMKY --------------------------------1111-------------------3333- IQDFQREKQEFERNLARFREDHPDLIQNAKK ------------------------------- >Alaserpin [Precursor]; SWP:P14754; PDB:1K9OI; GETDLQKILRESNDQFTAQMFSEVVKANPGQNVVLSAFSVLPPLGQLALASVGESHDELL -----------------------------------3333------------!!!!----- RALALPNDNVTKDVFADLNRGVRAVKGVDLKMASKIYVAKGLELNDDFAAVSRDVFGSEV --------3333----1111------------------------3333------------ QNVDFVKSVEAAGAINKWVEDQTNNRIKNLVDPDALDETTRSVLVNAIYFKGSWKDKFVK ---33333333-----------%%%%-----1111-----------------------33 ERTMDRDFHVSKDKTIKVPTMIGKKDVRYADVPELDAKMIEMSYEGDQASMIIILPNQVD 33------------------------------1111---------------------111 GITALEQKLKDPKALSRAEERLYNTEVEITLPKFKIETTTDLKEVLSNMNIKKLFTPGAA 1---------11113333---------------------------1111--33332222- RLENLLKTKESLTVDAAIQKAFIEVNEEGAEAAAANAFGIVPKSLILYPEVHIDRPFYFE ------------------------------------------------------------ LKIDGIPMFNGKVIEP ---------------- >POLCALCIN PHL P 7; SWP:O82040; PDB:1K9UA; DDMERIFKRFDTNGDGKISLSELTDALRTLGSTSADEVQRMMAEIDTDGDGFIDFNEFIS ----------1111---------------11113333--------1111----------- FCNANPGLMKDVAKVF ---------------- >Imidazole glycerol phosph; SWP:Q9X0C8; PDB:1K9VF; MRIGIISVGPGNIMNLYRGVKRASENFEDVSIELVESPRNDLYDLLFIPGVGHFGEGMRR ------------------------------------------------------------ LRENDLIDFVRKHVEDERYVVGVCLGMQLLFEESEEAPGVKGLSLIEGNVVKLRSRRLPH -1111----------------------------3333----------------------- MGWNEVIFKDTFPNGYYYFVHTYRAVCEEEHVLGTTEYDGEIFPSAVRKGRILGFQFHPE ---------------------------3333------iiii-------!!!!-----333 KSSKIGRKLLEKVIECSLSR 3--------------3333- >HALOTOLERANCE PROTEIN HAL; SWP:P32179; PDB:1KA1A; ALERELLVATQAVRKASLLTKRIQSEVISHKDSTTITKNDNSPVTTGDYAAQTIIINAIK ----------------------33333333-------1111------------------- SNFPDDKVVGEESSSGLSDAFVSGILNEIKANDEVYNKNYKKDDFLFTNDQFPLKSLEDV --1111-------2222-----------------3333----------3333-------- RQIIDFGNYEGGRKGRFWCLDPIDGTKGFLRGEQFAVCLALIVDGVVQLGCIGCPNLVLS ----1111--------------------1111----------iiii-----------333 SYGAQDLKGHESFGYIFRAVRGLGAFYSPSSDAESWTKIHVRHLKDTKDMITLEGVEKGH 3-----2222---------2222-----3333-------------3333-------1111 SSHDEQTAIKNKLNISKSLHLDSQAKYCLLALGLADVYLRLPIKLSYQEKIWDHAAGNVI ----------1111--------3333-----------------1111--3333------- VHEAGGIHTDAMEDVPLDFGNGRTLATKGVIASSGPRELHDLVVSTSCDVIQSR -1111---------------------------------------------1111 >PHOSPHOCARRIER PROTEIN HP; SWP:P02907; PDB:1KA5A; MEQNSYVIIDETGIHARPATMLVQTASKFDSDIQLEYNGKKVNLKSIMGVMSLGVGKDAE ---------1111---3333----------------iiii------3333---------- ITIYADGSDESDAIQAISDVLSKEGLTK ---------------------------- >PUTATIVE P4-SPECIFIC DNA ; SWP:P10277; PDB:1KA8A; DADPTFDFIGYLETLPQTSGMYMGNASIIPRNYRKYLYHAYLAYMEANGYRNVLSLKMFG --------1111------------3333---1111----------1111----------- LGLPVMLKEYGLNYEKRHTKQGIQTNLTLKEESYGDWLPK -------1111-------1111--------3333------ >Imidazole glycerol phosph; SWP:Q7SIB9; PDB:1KA9F; SLAKRIVPCLDVHAGRVVKGVNFVNLRDAGDPVEAARAYDEAGADELVFLDISATHEERA ------------%%%%-----------1111--------1111----------------- ILLDVVARVAERVFIPLTVGGGVRSLEDARKLLLSGADKVSVNSAAVRRPELIRELADHF ------------------------3333----3333------3333-------------- GAQAVVLAIDARWRGDFPEVHVAGGRVPTGLHAVEWAVKGVELGAGEILLTSMDRDGTKE 1111------------------%%%%--------------3333-------3333----- GYDLRLTRMVAEAVGVPVIASGGAGRMEHFLEAFQAGAEAALAASVFHFGEIPIPKLKRY -------------------------3333----1111---------1111---------- LAEKGVHVRLD -1111------ >Imidazole glycerol phosph; SWP:Q7SIC0; PDB:1KA9H; MKALLIDYGSGNLRSAAKALEAAGFSVAVAQDPKAHEEADLLVLPGQGHFGQVMRAFQES --------------------1111------------------------------------ GFVERVRRHLERGLPFLGICVGMQVLYEGSEEAPGVRGLGLVPGEVRRFRAGRVPQMGWN ---------1111-----!!!!-------3333--------------------------- ALEFGGAFAPLTGRHFYFANSYYGPLTPYSLGKGEYEGTPFTALLAKENLLAPQFHPEKS ----!!!!1111-----------------------iiii----------------1111- GKAGLAFLALARRYF --------------- >FIBER KNOB PROTEIN; SWP:P36711; PDB:1KACA; TPYDPLTLWTTPDPPPNCSLIQELDAKLTLCLTKNGSIVNGIVSLVGVKGNLLNIQSTTT ---1111---------------------------!!!!------------1111-1111- TVGVHLVFDEQGRLITSTPTALVPQASWGYRQGQSVSTNTVTNGLGFMPNVSAYPRPNAS --------1111----------1111-----!!!!-------3333----33331111-- EAKSQMVSLTYLQGDTSKPITMKVAFNGITSLNGYSLTFMWSGLSNYINQPFSTPSCSFS 3333------22221111------------------------------------------ YITQE ----- >HISTIDINOL DEHYDROGENASE; SWP:P06988; PDB:1KAEA; SFNTIIDWNSCTAEQQRQLLRPAISASESITRTVNDILDNVKARGDEALREYSAKFDKTT 1111--1111-------------------------------------------------- VTALKVSAEEIAAASERLSDELKQAAVAVKNIETFHTAQKLPPVDVETQPGVRCQQVTRP --------------3333------------------1111--------2222-------- VASVGLYIPGGSAPLFSTVLLATPASIAGCKKVVLCSPPPIADEILYAAQLCGVQDVFNV --------------3333-------------------------------1111------- GGAQAIAALAFGTESVPKVDKIFGPGNAFVTEAKRQVSQRLDGAAIDPAGPSEVLVIADS -3333------------------------------33331111---------------11 GATPDFVASDLLSQAEHGPDSQVILLTPAADARRVAEAVERQLAELPRAETARQALNASR 113333-------33331111------------------------1111-----3333-- LIVTKDLAQCVEISNQYGPEHLIIQTRNARELVDSITSAGSVFLGDWSPESAGDYASGTN ---------------------------33333333-------------3333-------- HVLPTYGYTATCSSLGLADFQKRTVQELSKEGFSALASTIETLAAAERLTAHKNAVTLRV ----iiii-------3333-------------------------1111------------ NALKEQA ---1111 >TRANSCRIPTION REGULATORY ; SWP:P22915; PDB:1KAFA; MEITSDMEEDKDLMLKLLDKNGFVLKKVEIYRSNYLAILEKRTNGIRNFEINNNGNMRIF ------------------1111--------%%%%--------iiii-----1111----- GYKMMEHHIQKFTDIGMSCKIAKNGNVYLDIKRSAENIEAVITVASEL ----3333----1111-----1111-------------------1111 >SHIKIMATE KINASE I; SWP:P24167; PDB:1KAGA; EKRNIFLVGPMGAGKSTIGRQLAQQLNMEFYDSDQEIEKRTGADVGWVFDLEGEEGFRDR ---------22223333------------------------------------------- EEKVINELTEKQGIVLATGGGSVKSRETRNRLSARGVVVYLETTIEKQLARTPLLHVETP -------1111-------1111---------------------3333------------- PREVLEALANERNPLYEEIADVTISAKVVANQIIHMLE 3333---------------------------------- >NICOTINATE-NUCLEOTIDE ADE; SWP:P54455; PDB:1KAMA; SKKIGIFGGTFDPPHNGHLLMANEVLYQAGLDEIWFMPNQIPDSFHRVEMLKLAIQSNPS ------------------------------------------------------3333-- FKLELVEMEREGPSYTFDTVSLLKQRYPNDQLFFIIGADMIEYLPKWYKLDELLNLIQFI ----11112222--------------1111------11111111---------------- GVKRPGFHVETPYPLLFADVPEFEVSSTMIRERFKSKKPTDYLIPDKVKKYVEENGLYES ---------------------------------------2222----------------- >RAP2A; SWP:P10114; PDB:1KAO; MREYKVVVLGSGGVGKSALTVQFVTGTFIEKYDPTIEDFYRKEIEVDSSPSVLEILDTAG ----------2222------------------1111---------%%%%----------- TEQFASMRDLYIKNGQGFILVYSLVNQQSFQDIKPMRDQIIRVKRYEKVPVILVGNKVDL ---3333---------------1111-----------------1111---------3333 ESEREVSSSEGRALAEEWGCPFMETSAKSKTMVDELFAEIVRQMNYA 1111---------------------3333------------------ >Alkaline metalloproteinas; SWP:Q03023; PDB:1KAPP; GRSDAYTQVDNFLHAYARGGDELVNGHPSYTVDQAAEQILREQASWQKAPGDSVLTLSYS -----------1111--------iiii----------1111------------------- FLTKPNDFFNTPWKYVSDIYSLGKFSAFSAQQQAQAKLSLQSWSDVTNIHFVDAGQGDQG -----3333-3333-3333----------------------------------------- DLTFGNFSSSVGGAAFAFLPDVPDALKGQSWYLINSSYSANVNPANGNYGRQTLTHEIGH ----------------------3333--------11111111--2222-----------1 TLGLSHPGDYNAGEGDPTYADATYAEDTRAYSVMSYWEEQNTGQDFKGAYSSAPLLDDIA 111-------2222---3333--1111----1111---3333---iiii----------- AIQKLYGANLTTRTGDTVYGFNSNTERDFYSATSSSSKLVFSVWDAGGNDTLDFSGFSQN --------------------------3333---1111----------------1111--- QKINLNEKALSDVGGLKGNVSIAAGVTVENAIGGSGSDLLIGNDVANVLKGGAGNDILYG -----2222---iiii------2222---------------------------------- GLGADQLWGGAGADTFVYGDIAESSAAAPDTLRDFVSGQDKIDLSGLDAFVNGGLVLQYV -------------------3333-3333-------2222----11113333--------- DAFAGKAGQAILSYDAASKAGSLAIDFSGDAHADFAINLIGQATQADIVV -----2222----------------------------------3333--- >QUINOHEMOPROTEIN ALCOHOL ; SWP:Q46444; PDB:1KB0A; TGPAAQAAAAVQRVDGDFIRANAARTPDWPTIGVDYAETRYSRLDQINAANVKDLGLAWS ------------------------------11111111---------33331111----- YNLESTRGVEATPVVVDGIMYVSASWSVVHAIDTRTGNRIWTYDPQIDRSTGFKGCCDVV ---------------iiii-----%%%%-------------------11111111----- NRGVALWKGKVYVGAWDGRLIALDAATGKEVWHQNTFEGQKGSLTITGAPRVFKGKVIIG --------------1111----------------1111--------------iiii---- NGGAEYGVRGYITAYDAETGERKWRWFSVPGDPSKPFEDESMKRAARTWDPSGKWWEAGG --3333-------------------------1111---3333---11113333-3333-- GGTMWDSMTFDAELNTMYVGTGNGSPWSHKVRSPKGGDNLYLASIVALDPDTGKYKWHYQ -----------1111------------3333-1111------------------------ ETPGDNWDYTSTQPMILADIKIAGKPRKVILHAPKNGFFFVLDRTNGKFISAKNFVPVNW ---------------------%%%%--------1111----------------------- ASGYDKHGKPIGIAAARDGSKPQDAVPGPYGAHNWHPMSFNPQTGLVYLPAQNVPVNLMD ----1111----3333-1111------3333----------1111--------------- DKKWEFNQAGPGKPQSGTGWNTAKFFNAEPPKSKPFGRLLAWDPVAQKAAWSVEHVSPWN 11112222-2222--1111----------------------------------------- GGTLTTAGNVVFQGTADGRLVAYHAATGEKLEAPTGTGVVAAPSTYMVDGRQYVSVAVGW -----1111--------------------------------------iiii--------- GGVYGLAARATERQGPGTVYTFVVGGKARMPETGQLLQGVKYDPAKVEAGTMLYVANCVF -3333-------------------------------------3333-------------- CHGVPGVDRGGNIPNLGYMDASYIENLPNFVFKGPAMVRGMPDFTGKLSGDDVESLKAFI --------------1111-3333--3333----1111------2222-!!!!-------- QGTADAIRP --------- >KB5-C20 T-CELL ANTIGEN RE; SWP:Q5R1D3; PDB:1KB5A; QQVRQSPQSLTVWEGETAILNCSYEDSTFNYFPWYQQFPGEGPALLISIRSVSDKKEDGR ------------------------------------------------------------ FTIFFNKREKKLSLHITDSQPGDSATYFCAARYQGGRALIFGTGTTVSVSPGSAD ------1111---------3333---------2222------------------- >KB5-C20 T-CELL ANTIGEN RE; SWP:NA; PDB:1KB5H; EVQLQQSGPELEKPGASVKISCKASGYSFTGYNMNWVKQSNGKSLEWIGNIDPYYGGISY ------------2222----------------------2222------------------ NQKFKGRATLTVDKSSSTAYMQLKSL -------------1111--------- >VITAMIN D3 RECEPTOR; SWP:P11473; PDB:1KB6A; PRICGVCGDRATGFHFNAMTCEGCKGFFRRSMKRKALFTCPFNGDCRITKDNRRHCQACR --------------iiii------------------------------11113333---- LKRCVDIGMMKEFILTDEEVQRKREMILKRKEEE ---------3333--------------------- >KAPPA-BUNGAROTOXIN; SWP:P01398; PDB:1KBAA; RTCLISPSSTPQTCPNGQDICFLKAQCDKFCSIRGPVIEQGCVATCPQFRSNYRSLLCCT -----------------------------3333----------------1111------- TDNCNH 2222-- >KINASE SUPPRESSOR OF RAS; SWP:Q61097; PDB:1KBEA; GSVTHRFSTKSWLSQVCNVCQKSMIFGVKCKHCRLKCHNKCTKEAPACR ----------------------------------------3333----- >NUCLEAR RECEPTOR COACTIVA; SWP:Q9Y6Q9; PDB:1KBHA; EGQSDERALLDQLHTLLSNTDATGLEEIDRALGIPELVNQGQALEPK ---------------------1111-------3333-33331111-- >CYTOCHROME B2; SWP:P00175; PDB:1KBIA; EPKLDMNKQKISPAEVAKHNKPDDCWVVINGYVYDLTRFLPNHPGGQDVIKFNAGKDVTA ---------------3333-1111--------------333311113333-------333 IFEPLHAPNVIDKYIAPEKKLGPLQGSMPPELVCPPYAPGETKEDIARKEQLKSLLPPLD 31111-1111-----3333---------1111-----2222----------3333--333 NIINLYDFEYLASQTLTKQAWAYYSSGANDEVTHRENHNAYHRIFFKPKILVDVRKVDIS 3--------------------------!!!!-------3333------------------ TDMLGSHVDVPFYVSATALCKLGNPLEGEKDVARGCGQGVTKVPQMISTLASCSPEEIIE --iiii-------------3333------------------------1111--------- AAPSDKQIQWYQLYVNSDRKITDDLVKNVEKLGVKALFVTVDAPSLGQREKDMKLKFSNT ---1111----------------------------------------------------- KKTNVEESQGASRALSKFIDPSLTWKDIEELKKKTKLPIVIKGVQRTEDVIKAAEIGVSG -----------1111333311113333--3333------------3333----------- VVLSNHGGRQLDFSRAPIEVLAETMPILEQRNLKDKLEVFVDGGVRRGTDVLKALCLGAK ----%%%%-------3333---------------------------3333---------- GVGLGRPFLYANSCYGRNGVEKAIEILRDEIEMSMRLLGVTSIAELKPDLLDLSTLKART -----------------------------------3333--3333-1111--1111---- VGVPNDVLYNEVYEGPTLTEFEDA ------------------------ >PYRUVATE PHOSPHATE DIKINA; SWP:P22983; PDB:1KBLA; AKWVYKFEEGNASMRNLLGGKGCNLAEMTILGMPIPQGFTVTTEACTEYYNSGKQITQEI -----3333-3333--------------1111----------------1111-------- QDQIFEAITWLEELNGKKFGDTEDPLLVSVRSGARASMPGMMDTILNLGLNDVAVEGFAK -----------------2222----------------2222-----------3333---- KTGNPRFAYDSYRRFIQMYSDVVMEVPKSHFEKIIDAMKEEKGVHFDTDLTADDLKELAE --------------------------3333---------------3333----------- KFKAVYKEAMNGEEFPQEPKDQLMGAVKAVFRSWDNPRAIVYRRMNDIPGDWGTAVNVQT ---------iiii------------------3333-3333---1111-3333-------- MVFGNKGETSGTGVAFTRNPSTGEKGIYGEYLINAQGEDVVAGVRTPQPITQLENDMPDC ------1111-------------------------33333333-----3333-------- YKQFMDLAMKLEKHFRDMQDMEFTIEEGKLYFLQTRNGKRTAPAALQIACDLVDEGMITE ----------------------------------------------------1111---- EEAVVRIEAKSLDQLLHPTFNPAALKAGEVIGSALPASPGAAAGKVYFTADEAKAAHEKG ---111133333333--------------------------------------------- ERVILVRLETSPEDIEGMHAAEGILTVRGGMTSHAAVVARGMGTCCVSGCGEIKINEEAK ----------3333---1111--------1111----------------3333---1111 TFELGGHTFAEGDYISLDGSTGKIYKGDIETQEASVSGSFERIMVWADKFRTLKVRTNAD ---!!!!--2222----------------------------------------------- TPEDTLNAVKLGAEGIGLCRTEHMFFEADRIMKIRKMILSDSVEAREEALNELIPFQKGD --------1111--------3333--1111------------------------------ FKAMYKALEGRPMTVRYLDPPLHEFVPHTEEEQAELAKNMGLTLAEVKAKVDELHEFNPM -------iiii---------3333---------------------------------333 MGHRGCRLAVTYPEIAKMQTRAVMEAAIEVKEETGIDIVPEIMIPLVGEKKELKFVKDVV 3-----------------------------------------------3333-------- VEVAEQVKKEKGSDMQYHIGTMIEIPRAALTADAIAEEAEFFSFGTNDLTQMTFGFSRDD ------------------------3333----------------------------3333 AGKFLDSYYKAKIYESDPFARLDQTGVGQLVEMAVKKGRQTRPGLKCGICGEHGGDPSSV --------1111----1111---------------------1111--------------- EFCHKVGLNYVSCSPFRVPIARLAAAQAALNN -------------1111--------------- >PURPLE ACID PHOSPHATASE; SWP:P80366; PDB:1KBPA; RDMPLDSDVFRVPPGYNAPQQVHITQGDLVGRAMIISWVTMDEPGSSAVRYWSEKNGRKR ---11111111------------------------------------------------- IAKGKMSTYRFFNYSSGFIHHTTIRKLKYNTKYYYEVGLRNTTRRFSFITPPQTGLDVPY ------------------------------------------------------1111-- TFGLIGDLGQSFDSNTTLSHYELSPKKGQTVLFVGDLSYADRYPNHDNVRWDTWGRFTER --------------------------------------111122223333-------333 SVAYQPWIWTAGNHEIEFAPEINETEPFKPFSYRYHVPYEASQSTSPFWYSIKRASAHII 3----------1111---3333---2222--------1111------------!!!!--- VLSSYSAYGRGTPQYTWLKKELRKVKRSETPWLIVLMHSPLYNSYNHHFMEGEAMRTKFE --1111--2222---------11111111----------------22221111------- AWFVKYKVDVVFAGHVHAYERSERVSNIAYKITDGLCTPVKDQSAPVYITIGDAGNYGVI ------------------------------------------------------------ DSNMIQPQPEYSAFREASFGHGMFDIKNRTHAHFSWNRNQDGVAVEADSVWFFNRHWYPV --------3333-------------------------33331111--------------- DDST ---- >MAJOR OUTER MEMBRANE PROT; SWP:Q02219; PDB:1KBVA; ELPVIDAVTTHAPEVPPAIDRDYPAKVRVKMETVEKTMKMDDGVEYRYWTFDGDVPGRMI ----------------------------------------2222---------------- RVREGDTVEVEFSNNPSSTVPHNVDFHAATGQGGGAAATFTAPGRTSTFSFKALQPGLYI --2222--------1111-------1111-2222-3333--2222--------------- YHCAVAPVGMHIANGMYGLILVEPKEGLPKVDKEFYIVQGDFYTKGKKGAQGLQPFDMDK ------3333-------------1111-------------------2222---------- AVAEQPEYVVFNGHVGALTGDNALKAKAGETVRMYVGNGGPNLVSSFHVIGEIFDKVYVE ------------------!!!!----2222------------------2222-----222 GGKLINENVQSTIVPAGGSAIVEFKVDIPGNYTLVDHSIFRAFNKGALGQLKVEGAENPE 2-------------2222--------------------3333---------------333 IM 3- >SRC TYROSINE KINASE; SWP:P00524; PDB:1KC2A; AEEWYFGKITRRESERLLLNPENPRGTFLVRESETTKGAYCLSVSLNVAHYKIRKLDSGG -1111-----------1111---2222--------2222----------------3333- FYITSRTQFSSLQQLVAYYSKHADGLCHRLTNVCPT ---1111----------1111--------------- >NITRITE REDUCTASE; SWP:P25006; PDB:1KCBA; DISTLPRVKVDLVKPPFVHAHDQVAKTGPRVVEFTMTIEEKKLVIDREGTEIHAMTFNGS 3333-----------------------------------------1111-------iiii VPGPLMVVHENDYVELRLINPDTNTLLHNIDFHAATGALGGGALTQVNPGEETTLRFKAT --------2222--------1111-------1111-%%%%3333---2222--------- KPGVFVYHCAPEGMVPWHVTSGMNGAIMVLPRDGLKDEKGQPLTYDKIYYVGEQDFYVPK ----------2222---1111---------1111--1111-------------------- DEAGNYKKYETPGEAYEDAVKAMRTLTPTHIVFNGAVGALTGDHALTAAVGERVLVVHSQ 1111------3333--------3333------iiii----!!!!----2222-------- ANRDTRPHLEGGHGDYVWATGKFRNPPDLDQETWLIPGGTAGAAFYTFRQPGVYAYVNHN --------2222-----11111111-----------2222-------------------- LIEAFELGAAGHFKVTGEWNDDLMTSVVKPASM -------------------3333---------- >HYPOTHETICAL 30.2 KD PROT; SWP:Q10423; PDB:1KCFA; TVKLSFLQHICKLTGLSRSGRKDELLRRIVDSPIYPTSRVLGIDLGIKNFSYCFASQNED --------------------1111---------------------------------111 SKVIIHNWSVENLTEKNGLDIQWTEDFQPSSMADLSIQLFNTLHEKFNPHVILMERQRYR 1---------------1111-------3333----------------------------3 SGIATIPEWTLRVNMLESMLYALHYAEKRNYPFLLSLSPKSTYSYWASVLNKKSRVQMVK 333--------------------------------------------------------- ELIDGQKILFENEEALYKWNNGSRVEFKKDDMADSALIASGWMRWQAQLKHYRNFCKQFL --1111-----------------------------------------------3333--- >NKG2D ligand 3 [Precursor; SWP:Q9BZM4; PDB:1KCGC; DAHSLWYNFTIIHLPRHGQQWCEVQSQVDQKNFLSYDCGSDKVLSMGHLEEQLYATDAWG -------------------------------------------------------3333- KQLEMLREVGQRLRLELADTEPLTLQVRMSCECEADGYIRGSWQFSFDGRKFLLFDSNNR ---------------3333----------------------------------------- KWTVVHAGARRMKEKWEKDSGLTTFFKMVSMRDCKSWLRDFLMHRKKRLE -----1111----------------------------------------- >PHOSPHATIDYLINOSITOL TRAN; SWP:P53810; PDB:1KCMA; VLLKEYRVILPVSVDEYQVGQLYSVAEASKNETGGGEGVEVLVNEPYEKDDGEKGQYTHK ------------3333----------1111------------------1111-------- IYHLQSKVPTFVRMLAPEGALNIHEKAWNAYPYCRTVITNEYMKEDFLIKIETWHKPDLG ---1111-3333----2222-------------------3333----------------- TQENVHKLEPEAWKHVEAIYIDIADRSQVLSKDYKAEEDPAKFKSVKTGRGPLGPNWKQE ---1111-33331111-----111111113333-11111111-----------1111111 LVNQKDCPYMCAYKLVTVKFKWWGLQNKVENFIHKQEKRLFTNFHRQLFCWLDKWVDLTM 1--2222-------------------------------------------33331111-- DDIRRMEEETKRQLDE ---------------- >GELSOLIN; SWP:P06396; PDB:1KCQA; VVQRLFQVKGRRVVRATEVPVSWESFNNGDCFILDLGNNIHQWCGSNSNRYERLKATQVS ---------------------3333----------!!!!-----1111------------ KGIRDNERSGRARVHVSEEGTEPEAMLQVLGPKPALPAGTEDTA -------iiii------2222-3333------------------ >Major surface antigen [Pr; SWP:P03142; PDB:1KCRH; QVALQESGPGLVKPSQSLSLTCTVTGYSITSDYAWNWIRQFPGNKLEWMGYIRNGGSTTY ----------------------------------------------------3333---- NPSLASRISITRDTSKNQFFLQLNSVTTEDTATYYCARGGTGFTYWGAGTLVTVSAAATT ---1111-------------------3333------------------------------ PPSVYPLAPGSAAAAAAMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSALYTL ------------------------------------%%%%-------------------- SSSVTVPSSPRPSATVTCNVAHPASSTKVDKKIVPRDC ----------------------1111------------ >Major surface antigen [Pr; SWP:P03142; PDB:1KCRL; DIVLTQSPKSMSMSVGERVTLSCKASENVGTYVSWYQQKPEQSPKLLIYGASNRYTGVPD ----------------------------------------------------------33 RFTGSGSATDFTLKISSVQAEDLADYHCGQTYSYPTFGGGTKLAIKRADAAPTVSIFPPS 33----------------3333-------------------------------------3 SEQLTAGGASVVCFLNNFYPKDINVKWKIDGSERQNGVANSWTAQDSADSTYSMSSTLTL 333--------------------------------------------------------- TKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC --3333--------------------------- >PC287 IMMUNOGLOBULIN; SWP:NA; PDB:1KCUH; QVKLQQSGPGLVKPSQSLSLTCTVTGYSITSDYAWNWIRQFPGNKLEWMAYISYSGSTTY ---------------------------1111---------------------1111---- NPSLKSRISITRDTSKNQFFLQLNSVTTEDTAIYYCARGGTGFDYWGAGTTLTVSAAATT 1111---------1111---------3333------------------------------ PPSVYPLAPGSATAAASMVTLGCLVKGYFPEPVTVTWNSGALSSGVHTFPAVLQSDLYTL ------------------------------------%%%%-------------------- SSSVTVPSSPWPSETVTCNVAHPASSTKVDKKIVPRD ----------------------1111----------- >PC287 IMMUNOGLOBULIN; SWP:NA; PDB:1KCUL; DIVLTQSPKSMSMSVGEKVTLSCKASENVDTYVSWYQQRPEQPPALLIYGASNRYTGVPD -------------2222-------------------------------------222233 RFTGSGSATDFTLTISSVQAEDLADYHCGQSYSYPLTFGGGTKLELKRADAAPTVSIFPP 33----------------1111-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVANSWTAQDSKDSTYSMSSTLT 33331111---------------------iiii--------------------------- LTKDEYERHNSYTCEATHKTSTSPVVKSFNRNEC -3333------------1111--------1111- >Ig heavy chain V region 1; SWP:P18532; PDB:1KCVH; QVTLSQSGPGLVKPSQSLSLTCTVTSYSITSDYAWNWIRQFAGQSLEWMGYISYSGSTSY ------------2222-----------1111---------1111--------1111---- NPSLKSRISITRDTSKNQFFLQLNSVTTDDTATYYCARGGTGFPYWGTGTNVTVSAASTT 1111---------1111---------3333------------------------------ APSVFPLVPGSATAAASAVTLGCLVKGYFPEPVTVAWNEGALSSGVLTVSAVLQSGLYTL ------------------------------------iiii-------------------- SSNTTVASGTWPSASVTCLVAHPKSSTAADKKIEPKD ------1111------------1111----------- >Ig heavy chain V region 1; SWP:P18532; PDB:1KCVL; DIVMTQSPKSMGMSVGEAVTLNCKASENVGTYVSWYQQKPGQSPVLLIYGASNRYTGVPD -------------2222-------------------------------------222233 RFTGSGSATDFTLTISSVQADDDADYYCGQSYSSPLTFGGGTKLELKRADAAPTSSIFPP 33----------------1111-------------------------------------- SSEQLSSGGASVVCFLNSFYPKSIAVKWKVDGSKRANGTANSWTDQDSASSTYSMSSTLT 33331111---------------------iiii--------------------------- LTKDKYERHNSYTCEATHKTSSSPVVKSFNRNEC -----1111--------3333------------- >DIHYDROPYRIMIDINASE RELAT; SWP:P97427; PDB:1KCXA; DRLLIRGGRIINDDQSFYADVYLEDGLIKQIGENLIVPGGVKTIEANGRMVIPGGIDVNT -----------1111--------iiii------------------iiii----------- YLQKPSQGMTSADDFFQGTKAALAGGTTMIIDHVVPEPGSSLLTSFEKWHEAADTKSCCD -----iiii---------------------------2222-------------------- YSLHVDITSWYDGVREELEVLVQDKGVNSFQVYMAYKDLYQMSDSQLYEAFTFLKGLGAV ------------3333-------------------2222--------------------- ILVHAENGDLIAQEQKRILEMGITGPEGHALSRPEELEAEAVFRAIAIAGRINCPVYITK ------------------------3333-1111--------------------------- VMSKSAADIIALARKKGPLVFGEPIAASLGTDGTHYWSKNWAKAAAFVTSPPLSPDPTTP -----------3333---------3333----3333-------------------1111- DYLTSLLACGDLQVTGSGHCPYSTAQKAVGKDNFTLIPEGVNGIEERMTVVWDKAVATGK ----------------------3333------1111------3333---------1111- MDENQFVAVTSTNAAKIFNLYPRKGRIAVGSDADVVIWDPDKMKTITAKSHKSTVEYNIF --------------------3333---2222---------------3333-------111 EGMECHGSPLVVISQGKIVFEDGNISVSKGMGRFIPRKPFPEHLYQRVRIRSKVFG 1------------iiii---iiii---2222---------3333------------ >BETA-METHYLASPARTASE; SWP:Q05514; PDB:1KCZA; MKIVDVLCTPGLTGFYFDDQRAIKKGAGHDGFTYTGSTVTEGFTQVRQKGESISVLLVLE ----------------------1111---!!!!------2222---------------11 DGQVAHGDCAAVQYSGAGGRDPLFLAKDFIPVIEKEIAPKLIGREITNFKPMAEEFDKMT 11-----------2222-------3333--------33332222---------------- VNGNRLHTAIRYGITQAILDAVAKTRKVTMAEVIRDEYNPGAEINAVPVFAQSGDDRYDN iiii----------------------------------2222---------------333 VDKMIIKEADVLPHALINNVEEKLGLKGEKLLEYVKWLRDRIIKLRVREDYAPIFHIDVY 3-----------------------1111-------------------1111-------ii GTIGAAFDVDIKAMADYIQTLAEAAKPFHLRIEGPMDVEDRQKQMEAMRDLRAELDGRGV ii----%%%%-------------------------------------------------- DAELVADEWCNTVEDVKFFTDNKAGHMVQIKTPDLGGVNNIADAIMYCKANGMGAYCGGT ------2222---------1111-------3333--3333-------------------- NETNRSAEVTTNIGMACGARQVLAKPGMGVDEGMMIVKNEMNRVLALVGRRK -----------------------------------------------3333- ----------------------------------- ----------------------------------- >CELLOBIOSE DEHYDROGENASE; SWP:Q01738; PDB:1KDGA; TPYDYIIVGAGPGGIIAADRLSEAGKKVLLLERGGPSTKQTGGTYVAPWATSSGLTKFDI -------------------------------------3333-----33331111-33331 PGLFESLFTDSNPFWWCKDITVFAGCLVGGGTSVNGALYWYPNDGDFSSSVGWPSSWTNH 111--1111---1111----------22221111--------3333-3333--1111--- APYTSKLSSRLPSTDHPSTDGQRYLEQSFNVVSQLLKGQGYNQATINDNPNYKDHVFGYS -----------------1111---------------1111----11111111-------- AFDFLNGKRAGPVATYLQTALARPNFTFKTNVMVSNVVRNGSQILGVQTNDPTLGPNGFI ----iiii--3333--------1111-------------!!!!-------11112222-- PVTPKGRVILSAGAFGTSRILFQSGIGPTDMIQTVQSNPTAAAALPPQNQWINLPVGMNA --1111------3333----------------------3333----3333----2222-- QDNPSINLVFTHPSIDAYENWADVWSNPRPADAAQYLANQSGVFAGASPKLNFWRAYSGS -----------1111-----1111-----------------3333-------------11 DGFTRYAQGTVRPGAASVNSSLPYNASQIFTITVYLSTGIQSRGRIGIDAALRGTVLTPP 11----------------------------------2222--------1111-------- WLVNPVDKTVLLQALHDVVSNIGSIPGLTMITPDVTQTLEEYVDAYDPATMNSNHWVSST ---3333-----------1111--2222-----1111---------3333---------- TIGSSPQSAVVDSNVKVFGTNNLFIVDAGIIPHLPTGNPQGTLMSAAEQAAAKILALAGG ----3333---1111-2222---------------------------------------- P - >PLASTOCYANIN; SWP:Q7SIB8; PDB:1KDJ; AKVEVGDEVGNFKFYPDSITVSAGEAVEFTLVGETGHNIVFDIPAGAPGTVASELKAASM ------1111-----------2222------------------2222--------1111- DENDLLSEDEPSFKAKVSTPGTYTFYCTPHKSANMKGTLTVK ------3333-----------------1111----------- >CHYMOTRYPSIN B, B CHAIN; SWP:P07338; PDB:1KDQA; VNGEDAIPGSWPWQVSLQDKTGFHFCGGSLISEDWVVTAAHCGVKTSDVVVAGEFDQGSD ------22221111----1111----------------3333--1111------------ EENIQVLKIAQVFKNPKFNMFTVRNDITLLKLATPAQFSETVSAVSLPNVDDDFPPGTVC --------------------------------------1111------1111--2222-- ATTGWGKTKY ---------- >Chymotrypsinogen B [Precu; SWP:P07338; PDB:1KDQB; TPEKLQQAALPIVSEADCKKSWGSKITDVMTCAGASGVDSCMGDSGGPLVCQKDGVWTLA ----------------------3333--------2222-----2222-----iiii---- GIVSWGSGVCSTSTPAVYSRVTALMPWVQQILEAN -------------------3333-3333--1111- >CBP; SWP:P45481; PDB:1KDXA; GVRKGWHEHVTQDLRSHLVHKLVQAIFPTPDPAALKDRRMENLVAYAKKVEGDMYESANS ----3333----------------------3333-------------------------- RDEYYHLLAEKIYKIQKELEE --------------------- >POSSIBLE G-T MISMATCHES R; SWP:P29588; PDB:1KEAA; DATNKKRKVFVSTILTFWNTDRRDFPWRHTRDPYVILITEILLRRTTAGHVKKIYDKFFV ---------------3333-----1111--------------22223333------1111 KYKCFEDILKTPKSEIAKDIKEIGLSNQRAEQLKELARVVINDYGGRVPRNRKAILDLPG ---3333------------3333--------------------iiii----3333--222 VGKYTCAAVMCLAFGKKAAMVDANFVRVINRYFGGSYENLNYNHKALWELAETLVPGGKC 2--------------------------------!!!!--------------11112222- RDFNLGLMDFSAIICAPRKPKCEKCGMSKLCSYYEKC --------------------3333--3333--3333- >28B4 FAB; SWP:GC1_MOUSE; PDB:1KELH; EVKLVESGGGLGQPGGSLRLSCATSGFTFTDYYFNWARQPPGKALEWLGFIRNKAKGYTT ------------2222-----------3333--------2222---------3333---- EYSASVKGRFTISRDNSQGILYLQMNTLRAEDSATYYCARWGSYAMDYWGQGTSVTVSSA -----2222-------------------3333---------1111--------------- KTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDL ---------------------------------------%%%%-------------%%%% YTLSSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVP ---------3333------------1111--------- >HEMAGGLUTININ HA1; SWP:NA; PDB:1KENH; DVHLQESGPGLVKPSQSLSLTCYVTGYSITSGYYWTWIRQFPGNKLEWMGYISYDGSNNY ----------------------------------------1111---------------- NPSLKNRISITRDTSKNQFFLKLNSVTAEDTASYYCAAFYYDYDFFFDYWGQGTTLTVSS -------------1111---------3333------------------------------ AKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSD ----------------------------------------%%%%-------------iii LYTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRD i---------------------------------------- >F65A/Y131C-MI CARBONIC AN; SWP:P23589; PDB:1KEQA; GTRQSPINIQWKDSVYDPQLAPLRVSYDAASCRYLWNTGYAFQVEFDDSCEDSGISGGPL ---------3333---1111-------3333-------------------------!!!! GNHYRLKQFHFHWGATDEWGSEHAVDGHTYPAELHLVHWNSTKYENCKKASVGENGLAVI ---------------1111-----iiii-----------------3333----------- GVFLKLGAHHQALQKLVDVLPEVRHKDTQVAMGPFDPSCLMPACRDYWTYPGSLTTPPLA ---------3333--33331111-2222---------1111------------------- ESVTWIVQKTPVEVSPSQLSMFRTLLFSGRGEEEDVMVNNYRPLQPLRDRKLRSSFRL --------------3333-3333-----2222--------------!!!!-------- >DTDP-D-GLUCOSE 4,6-DEHYDR; SWP:P26391; PDB:1KEWA; MKILITGGAGFIGSAVVRHIIKNTQDTVVNIDKLTYAGNLESLSDISESNRYNFEHADIC ------1111------------------------1111--1111-1111--------111 DSAEITRIFEQYQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYALLEVARKYWSALGE 1-----------------------3333---3333-------------------1111-- DKKNNFRFHHISTDEVYGDLPHPDEVENSVTLPLFTETTAYAPSSPYSASKASSDHLVRA --------------1111---1111----------1111--------------------- WRRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKPLPIYGKGDQIRDWLYVEDHA ----------------------------------1111-----!!!!-------3333-- RALHMVVTEGKAGETYNIGGHNEKKNLDVVFTICDLLDEIVPKATSYREQITYVADRPGH ----------2222---------------------------------1111-----2222 DRRYAIDAGKISRELGWKPLETFESGIRKTVEWYLANTQWVNNVKSGAYQSWIEQNYEGR ------------------------------------------------------------ Q - >NEUROPILIN-1; SWP:O14786; PDB:1KEXA; FKCMEALGMESGEIHSDQITASSQYSTNWSAERSRLNYPENGWTPGEDSYREWIQVDLGL --------1111--1111-------11113333-2222----------1111-------- LRFVTAVGTQGAISKETKKKYYVKTYKIDVSSNGEDWITIKEGNKPVLFQGNTNPTDVVV -----------------------------------------iiii--------------- AVFPKPLITRFVRIKPATWETGISMRFEVYGCKIT ----------------------------------- >ERYTHRONOLIDE SYNTHASE; SWP:Q03133; PDB:1KEZA; SSALRDGYRQAGVSGRVRSYLDLLAGLSDFREHFDGSDGFSLDLVDMADGPGEVTVICCA --------3333-----3333-1111-3333----------------------------- GTAAISGPHEFTRLAGALRGIAPVRAVPQPGYEEGEPLPSSMAAVAAVQADAVIRTQGDK ------33333333---------------------------------------------- PFVVAGHSAGALMAYALATELLDRGHPPRGVVLIDVYPPGHQDAMNAWLEELTATLFDRE ---------------------------------------------------3333----- TVRMDDTRLTALGAYDRLTGQWRPRETGLPTLLVSAGEPMGPWPDDSWKPTWPFEHDTVA ----1111---------------------------------------------------- VPGDHFTMVQEHADAIARHIDAWLGGG ---1111-----333333333333--- >FUMARATE REDUCTASE FLAVOP; SWP:P00363; PDB:1KF6A; MQTFQADLAIVGAGGAGLRAAIAAAQANPNAKIALISKVYPMRSHTVAAEGGSAAVAQDH ---------------------------1111--------11113333----------111 DSFEYHFHDTVAGGDWLCEQDVVDYFVHHCPTEMTQLELWGCPWSRRPDGSVNVRRFGGM 13333-----------------------------------------1111------iiii KIERTWFAADKTGFHMLHTLFQTSLQFPQIQRFDEHFVLDILVDDGHVRGLVAMNMMEGT -------!!!!---------------3333-------------%%%%------------- LVQIRANAVVMATGGAGRVYRYNTNGGIVTGDGMGMALSHGVPLRDMEFVQYHPTGLPGS -------------------------1111--------1111----3333----------- GILMTEGCRGEGGILVNKNGYRYLQDYGMGPETPLGEPKNKYMELGPRDKVSQAFWHEWR --------1111----1111--3333------------2222------------------ KGNTISTPRGDVVYLDLRHLGEKKLHERLPFICELAKAYVGVDPVKEPIPVRPTAHYTMG ------1111-------------------------------------------------- GIETDQNCETRIKGLFAVGECSSVGLHGANRLGSNSLAELVVFGRLAGEQATERAATAGN ----1111---2222---1111----!!!!-22223333--------------1111--- GNEAAIEAQAAGVEQRLKDLVNQDGGENWAKIRDEMGLAMEEGCGIYRTPELMQKTIDKL ---------------------------3333----------------------------- AELQERFKRVRITDTSSVFNTDLLYTIELGHGLNVAECMAHSAMARKESRGAHQRLDEGC ----3333-----------------------------------------!!!!---2222 TERDDVNFLKHTLAFRDADGTTRLEYSDVKITTLPPA ---3333---------1111----------------- >FUMARATE REDUCTASE FLAVOP; SWP:P00364; PDB:1KF6B; AEMKNLKIEVVRYNPEVDTAPHSAFYEVPYDATTSLLDALGYIKDNLAPDLSYRWSCRMA -------------1111-------------1111-------------1111--------- ICGSCGMMVNNVPKLACKTFLRDYTDGMKVEALANFPIERDLVVDMTHFIESLEAIKPYI --------%%%%--1111-33331111------------!!!!----------1111--- IGNSRTADQGTNIQTPAQMAKYHQFSGCINCGLCYAACPQFGLNPEFIGPAAITLAHRYN -----3333-----3333---3333------3333-----------------------11 EDSRDHGKKERMAQLNSQNGVWSCTFVGYCSEVCPKHVDPAAAIQQGKVESSKDFLIATL 11----3333------11111111---3333--1111-3333------------------ KPR --- >MONOCLONAL ANTIBODY LIGHT; SWP:NA; PDB:1KFAH; VMLVESGGGLVKPGGSLKLSCAASGFTFSSYAMSWVRQTPERRLEWVATITTRGYTFYPD -----------2222-----------3333--------1111----------------33 SVKGRFTVSRDNARNTLNLQMSSLRSEDTAMFYCTREGLLLDYFTMDYWGQGTSVTVSSA 33---------1111---------1111-------------------------------- KTTPPSVYPLAPSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSSSV --------------------------------%%%%------------------------ TVPSSSRPSETVTCNVAHPASSTKVDKKIVPAD --3333------------1111----------- >PHOSPHOGLUCOMUTASE 1; SWP:P47244; PDB:1KFIA; QVIPAPRVQVTQPYAGQKPGTSGLRKKVSEATQPNYLENFVQSIFNTLRKDELKPKNVLF --------------------------3333--2222--------33333333-------- VGGDGRYFNRQAIFSIIRLAYANDISEVHVGQAGLMSTPASSHYIRKVNEEVGNCIGGII -----2222-----------1111------2222-------------------------- LTASHNPGGKEHGDFGIKFNVRTGAPAPEDFTDQIYTHTTKIKEYLTVDYEFEKHINLDQ --------------------1111------------3333----------3333--1111 IGVYKFEGTRLEKSHFEVKVVDTVQDYTQLMQKLFDFDLLKGLFSNKDFSFRFDGMHGVA ---------2222--------------------------------1111-----%%%%-- GPYAKHIFGTLLGCSKESLLNCDPSEDFGGGHPDPNLTYAHDLVELLDIHKKKDVGTVPQ --------------3333--------%%%%-----------------1111--3333--- FGAACDGDADRNMILGRQFFVTPSDSLAVIAANANLIFKNGLLGAARSMPTSGALDKVAA -----1111-----------------------1111-----------3333-3333---- KNGIKLFETPTGWKFFGNLMDAGLINLCGEESFGTGSNHIREKDGIWAVLAWLTILAHKN -----------3333------------------------------------------111 KNTDHFVTVEEIVTQYWQQFGRNYYSRYDYEQVDSAGANKMMEHLKTKFQYFEQLKQGNK 1------------------------------------------3333------------- ADIYDYVDPVDQSVSKNQGVRFVFGDGSRIIFRLSGTGSVGATIRIYFEQFEQQQIQHET -----------------------1111--------------------------------- ATALANIIKLGLEISDIAQFTGRNEPTVIT ---------------3333----------- >MAJOR OUTER MEMBRANE LIPO; SWP:P02937; PDB:1KFNA; SSNAKIDQLSSDVQTLNAKVDQASNDANAARSDAQAAKDDAARANQRLDNMAT -----------------------------------------------1111-- >EXCINUCLEASE ABC SUBUNIT ; SWP:P07028; PDB:1KFTA; TSSLETIEGVGPKRRQMLLKYMGGLQGLRNASVEEIAKVPGISQGLAEKIFWSLKH --1111------3333-------3333----3333--------------3333--- >Calpain-2 catalytic subun; SWP:P17655; PDB:1KFUL; AGIAAKLAKDREAAEGLGSHERAIKYLNQDYEALRNECLEAGTLFQDPSFPAIPSALGFK -3333-------------3333---%%%%3333-------------1111--3333---- ELGPYSSKTRGMRWKRPTEICADPQFIIGGATRTDICQGALGDCWLLAAIASLTLNEEIL --1111---------3333-----------------------3333----1111--1111 ARVVPLNQSFQENYAGIFHFQFWQYGEWVEVVVDDRLPTKDGELLFVHSAEGSEFWSALL ------------------------------------------------------------ EKAYAKINGCYEALSGGATTEGFEDFTGGIAEWYELKKPPPNLFKIIQKALQKGSLLGCS ----1111-11112222------1111-------1111---------------------- IDITSAADSEAITFQKLVKGHAYSVTGAEEVESNGSLQKLIRIRNPWGEVEWTGRWNDNC --------------------------------iiii--------1111------------ PSWNTIDPEERERLTRRHEDGEFWMSFSDFLRHYSRLEICNLTPDTLTSDTYKKWKLTKM -------11113333------------3333----------1111--------------- DGNWRRGSTAGGCRNYPNTFWMNPQYLIKLEEEDEDEEDGESGCTFLVGLIQKHRRRQRK ----2222----3333-------------------------------------------- MGEDMHTIGFGIYEVPEELSGQTNIHLSKNFFLTNRARERSDTFINLREVLNRFKLPPGE ---------------------------3333----------------------------- YILVPSTFEPNKDGDFCIRVFSEKKADYQAVDDEIEANLEEFDISEDDIDDGVRRLFAQL ------------------------------------------------%%%%-----333 AGEDAEISAFELQTILRRVLAKRQDIKSDGFSIETCKIMVDMLDSDGSGKLGLKEFYILW 3------3333----------------------------3333----------------- TKIQKYQKIYREIDVDRSGTMNSYEMRKALEEAGFKMPCQLHQVIVARFADDQLIIDFDN ----------------------1111------------------------1111------ FVRCLVRLETLFKIFKQLDPENTGTIELDLISWLCFSVL ------------------3333----------------- >CHITINASE B; SWP:Q9REI6; PDB:1KFWA; PLTSTVNGYRNVGYFAQWGVYGRAFQAKQLDVSGTAKNLTHINYSFGNINNQTLTCFMAN -----iiii------1111-3333-3333-11113333---------------------- KAQGTGPNGSDGAGDAWADFGMGYAADKSVSGKADTWDQPLAGSFNQLKQLKAKNPKLKV -----11112222-----------11113333---1111---------------1111-- MISLGGWTWSKNFSKAAATEASRQKLVSSCIDLYIKGNLPNFEGRGGAGAAAGIFDGIDI -----111111113333----------------3333----iiii-22222222------ DWEWPGTNSGLAGNGVDTVNDRANFKALLAEFRKQLDAYGSTNNKKYVLSAFLPANPADI ---2222---2222------------------------3333------------------ DAGGWDDPANFKSLDFGSIQGYDLHGAWNPTLTGHQANLYDDPADPRAPSKKFSADKAVK ---11113333--------------3333------------1111--3333--------- KYLAAGIDPKQLGLGLAAYGRGWTGAKNVSPWGPATDGAPGTYETANEDYDKLKTLGTDH --1111-3333--------------------------------2222-3333-------- YDAATGSAWRYDGTQWWSYDNIATTKQKTDYIVSKGLGGGMWWELSGDRNGELVGAMSDK --1111-------------------------------------33331111--------- FRAAAPGPVTEAAPP --------------- >Glycoprotein gp42; SWP:P03205; PDB:1KG0C; HTFQVPQNYTKANCTYCNTREYTFSYKGCCFYFTKKKHTWNGCFQACAEKYPCTYFYGPT -----------------1111----!!!!---------3333-----------------1 PDILPVVTRNLNAIESLWVGVYRVGEGNWTSLDGGTFKVYQIFGSHCTYVSKFSTVPVSH 111----11111111---------------1111----------------1111------ HECSFLKPCLCVSQRS ---------------- >NECROSIS INDUCING PROTEIN; SWP:Q02039; PDB:1KG1A; DRCRYTLCCDGALKAVSACLHESESCLVPGDCCRGKSRLTLCSYGEGGNGFQCPTGYRQC --------------------1111-----------------------------2222--- >A/G-SPECIFIC ADENINE GLYC; SWP:P17802; PDB:1KG2A; MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP ---------------------1111------------------3333------------- TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS --------3333--3333---------------------iiii----------2222--- TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF ---------------------------------1111----------------2222--- NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKP -----------------33331111---------3333------ >T-CELL RECEPTOR ALPHA CHA; SWP:NA; PDB:1KGCD; KTTQPNSMESNEEEPVHLPCNHSTISGTDYIHWYRQLPSQGPEYVIHGLTSNVNNRMASL ----------2222----------------------2222-------------------- AIAEDRKSSTLILHRATLRDAAVYYCILPLAGGTSYGKLTFGQGTILTVHPNIQNPDPAV --1111----------3333---------------------------------------- YQLRDSKSSDKSVCLFTDFDSQTNVSQSKDSDVYITDKTVLDMRSMDFKSNSAVAWSNKS -----1111----------3333------1111----------1111-----------11 DFACANAFNNSIIPEDTFFPS 113333-1111--1111---- >T-CELL RECEPTOR ALPHA CHA; SWP:NA; PDB:1KGCE; GVSQSPRYKVAKRGQDVALRCDPISGHVSLFWYQQALGQGPEFLTYFQNEAQLDKSGLPS -----------2222--------2222--------2222--------!!!!---1111-3 DRFFAERPEGSVSTLKIQRTQQEDSAVYLCASSLGQAYEQYFGPGTRLTVTEDLKNVFPP 333---1111----------3333----------1111--------------1111---- EVAVFEPSEAEISHTQKATLVCLATGFYPDHVELSWWVNGKEVHSGVSTDPQPLKEQPAL -------------------------------------iiii--2222---------3333 NDSRYCLSSRLRVSATFWQNPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEAWGRA -------------3333--1111-----------1111---------------------- D - >PERIPHERAL PLASMA MEMBRAN; SWP:O14936; PDB:1KGDA; HMRKTLVLLGAHGVGRRHIKNTLITKHPDRFAYPIPHTTRPPEENGKNYYFVSHDQMMQD ----------2222------------1111-------------2222------------- ISNNEYLEYGSHEDAMYGTKLETIRKIHEQGLIAILDVEPQALKVLRTAEFAPFVVFIAA 1111-------iiii----3333----1111-------3333-----3333--------- PTITPGLNEDESLQRLQKESDILQRTYAHYFDLTIINNEIDETIRHLEEAVELVC ---1111-------------------3333------------------------- >TRANSTHYRETIN; SWP:P02767; PDB:1KGIA; SKCPLMVKVLDAVRGSPAVDVAVKVFKKTADGSWEPFASGKTAESGELHGLTTDEKFTEG ----------------------------1111----------1111------3333---- VYRVELDTKSYWKALGISPFHEYAEVVFTANDSGHRHYTIAALLSPYSYSTTAVVSNPQN ------------1111----------------------------1111------------ ----------------------------------- >RIBONUCLEOTIDE REDUCTASE ; SWP:O69274; PDB:1KGNA; SNEYDEYIANHTDPVKAINWNVIPDEKDLEVWDRLTGNFWLPEKIPVSNDIQSWNKMTPQ -1111--1111------------------------1111-3333-3333----1111--- EQLATMRVFTGLTLLDTIQGTVGAISLLPDAETMHEEAVYTNIAFMESVHAKSYSNIFMT --------------------------3333------------------------------ LASTPQINEAFRWSEENENLQRKAKIIMSYYNGDDPLKKKVASTLLESFLFYSGFYLPMY --------------------------------------------------3333------ LSSRAKLTNTADIIRLIIRDESVHGYYIGYKYQQGVKKLSEAEQEEYKAYTFDLMYDLYE -1111-3333-------------------------1111--------------------- NEIEYTEDIYDDLGWTEDVKRFLRYNANKALNNLGYEGLFPTDETKVSPAILSSLS ----------3333-----------------1111-----3333---3333----- >DNA BINDING RESPONSE REGU; SWP:Q9WYN0; PDB:1KGSA; NVRVLVVEDERDLADLITEALKKEFTVDVCYDGEEGYALNEPFDVVILDILPVHDGWEIL ------------------------------------------------------------ KSRESGVNTPVLLTALSDVEYRVKGLNGADDYLPKPFDLRELIARVRALIRRKSESKSTK --1111---------------1111----------------------------------- LVCGDLILDTATKKAYRGSKEIDLTKKEYQILEYLVNKNRVVTKEELQEHLWVFSDVLRS --!!!!----------%%%%----------------2222------------3333---- HIKNLRKKVDKGFKKKIIHTVRGIGYVARDE ---------2222-------2222------- >EPHRIN TYPE-B RECEPTOR 2; SWP:P54763; PDB:1KGYA; AEETLMDSTTATAELGWMVHPPSGWEEVSGYDENMNTIRTYQVCNVFESSQNNWLRTKFI --------------------1111--------------------1111------------ RRRGAHRIHVEMKFSVRDCSSIPSVPGSCKETFNLYYYEADFDLATKTFPNWMENPWVKV -%%%%------------3333--------------------------------------- DTIAADESFSQVDLGGRVMKINTEVRSFGPVSRNGFYLAFQDYGGCMSLIAVRVFYRKCP ----------------1111---------------------------------------- R - >PROTEIN L; SWP:Q51912; PDB:1KH0A; EEVTIKANLIFANGSTQTAEFKGTKEKALSEVLAYADTLKKDNGEWTIDKRVTNGVIILN ----------1111--------------------------------------iiii---- IKFAG ----- >Phosphoenolpyruvate carbo; SWP:P35558; PDB:1KHBA; NLSAKVVQGSLDSLPQAVREFLENNAELCQPDHIHICDGSEEENGRLLGQMEEEGILRRL 3333-----3333----------------------------------------------3 KKYDNCWLALTDPRDVARIESKTVIVTQEQRDTVPIPKTGLSQLGRWMSEEDFEKAFNAR 333--------1111---3333------3333---------------------------- FPGCMKGRTMYVIPFSMGPLGSPLSKIGIELTDSPYVVASMRIMTRMGTPVLEALGDGEF 22222222-------------1111-----------------------------!!!!-- VKCLHSVGCPLPLQKPLVNNWPCNPELTLIAHLPDRREIISFGSGYGGNSLLGKKCFALR -----------------%%%%--3333------1111--------3333----------- MASRLAKEEGWLAEHMLVLGITNPEGEKKYLAAAFPSACGKTNLAMMNPSLPGWKVECVG ----------------------1111---------22223333-------2222------ DDIAWMKFDAQGHLRAINPENGFFGVAPGTSVKTNPNAIKTIQKNTIFTNVAETSDGGVY --------1111--------------22223333-------------------1111--- WEGIDEPLASGVTITSWKNKEWSSEDGEPCAHPNSRFCTPASQCPIIDAAWESPEGVPIE 2222----2222---1111---1111-----1111----333311111111-1111---- GIIFGGRRPAGVPLVYEALSWQHGVFVGAAMRSEAKIIMHDPFAMRPFFGYNFGKYLAHW --------------------------------------------1111------------ LSMAQHPAAKLPKIFHVNWFRKDKEGKFLWPGFGENSRVLEWMFNRIDGSTKLTPIGYIP 3333------------------1111-----!!!!----------1111----1111--- KEDALNLKGLGHINMMELFSISKEFWDKEVEDIEKYLVDQVNADLPCEIEREILALKQRI 2222--2222------------------------------!!!!-3333----------1 SQM 111 >DNA CYTOSINE-5 METHYLTRAN; SWP:O88509; PDB:1KHCA; TEYQDDKEFGIGDLVWGKIKGFSWWPAMVVSWKATSKRQAMPGMRWVQWFGDGKFSEISA ---------2222-----------------3333------2222--------------11 DKLVALGLFSQHFNLATFNKLVSYRKAMYHTLEKARVRAGKTFSSSPGESLEDQLKPMLE 11--33333333---------------------------------2222----------- WAHGGFKPTGIEGLKPN ---------3333---- >ANTHRANILATE PHOSPHORIBOS; SWP:Q8VP84; PDB:1KHDA; THQPILEKLFKSQSMTQEESHQLFAAIVRGELEDSQLAAALISMKMRGERPEEIAGAASA --------1111------------------------------------------------ LLADAQPFPRPDYDFADIVGTGGDGTNSINISTASAFVAASCGAKVAKHGNRCDLLQAFG -1111----------------------------------1111---------33331111 IRLDMSAEDSRQALDDLNVCFLFAPQYHTGFRHAMPVRQQLKTRTIFNVLGPLINPARPP -1111-----------------3333-11111111--------------3333-1111-- KALIGVYSPELVLPIAQALKVLGYKNAAVVHGGGMDEVAIHTPTQVAELNNGEIESYQLS -------3333--------------------iiii--------------iiii------3 PQDFGLQSYSLNALQGGTPEENRDILARLLQGKGDAAHARQVAANVALLLKLFGQDNLRH 333------3333---------------1111----------------3333-------- NAQLALETIRSGTAFERVTALAAR -----------3333-----1111 >GUANIDINOACETATE METHYLTR; SWP:P10868; PDB:1KHHA; RWETPYMHSLAAAAASRGGRVLEVGFGMAIAASRVQQAPIKEHWIIECNDGVFQRLQNWA 1111----------1111-------!!!!-----1111---------------------1 LKQPHKVVPLKGLWEEVAPTLPDGHFDGILYDTYPLSEETWHTHQFNFIKTHAFRLLKPG 111---------33333333----------------1111-----------------222 GILTYCNLTSWGELMKSKYTDITAMFEETQVPALLEAGFQRENICTEVMALVPPADCRYY 2-----3333--1111------------------3333-3333----------1111--- AFPQMITPLVTKH ------------- >HEX1; SWP:P87252; PDB:1KHIA; GSASQTVTIPCHHIRLGDILILQGRPCQVIRISTSAATGQHRYLGVDLFTKQLHEESSFV ---------1111-2222---iiii----------------------------------- SNPAPSVVVQTMLGPVFKQYRVLDMQDGSIVAMTETGDVKQNLPVIDQSSLWNRLQKAFE ---2222--------------------------1111----------%%%%--------- SGRGSVRVLVVSDHGREMAVDMKVVHG -3333-------iiii----------- >ALPHA-TOXIN; SWP:Q9RF12; PDB:1KHOA; WDGKADGTGTHAMIATQGVTILENDLSSNEPEVIRNNLEILKQNMHDLQLGSTYPDYDKN ---1111-------------------11113333------------------3333---- AYDLYQDHFWDPDTDNNFTKDSKWYLSYSIPDTAESQIRKFSALARYEWKRGNYKQATFY ----1111--1111-------3333-------3333------------1111-------- LGEAMHYFGDADTPYHAANVTAVDSPGHVKFETFAEDRKDQYKINTTGSKTNDAFYSNIL --------------------3333-------------3333--------33333333111 TNEDFNSWSKEFARSFAKTAKDLYYSHANMSCSWDEWDYAAKVALANSQKGTSGYIYRFL 1---------------------------1111---------------------------- HDVSDGKDSSANKNVNELVAYITTGGEKYAGTDDYMYFGIKTKDGQTQEWTMDNPGNDFM --------------------------1111-----------1111--------------2 TGSQDTYTFKLKDKNLKIDDIQNMWIRKSKYTEFGDDYKPANIKVIANGNVVLNKDINEW 222-------------3333--------------------------iiii---------- ISGNSTYNIK ---------- >ADENYLATE KINASE; SWP:P43411; PDB:1KHTA; NKVVVVTGVPGVGSTTSSQLAMDNLRKEGVNYKMVSFGSVMFEVAKEENLVSDRDQMRKM --------2222---------------------------------1111---33331111 DPETQKRIQKMAGRKIAEMAKESPVAVDTHSTVSTPKGYLPGLPSWVLNELNPDLIIVVE -----------------3333-------------1111-----3333------------- TTGDEILMRRMSDETRVRDLDTASTIEQHQFMNRCAAMSYGVLTGATVKIVQNRNGLLDQ -----------------------------------------------------2222--- AVEELTNVLR ---------- >SMAD1; SWP:Q15797; PDB:1KHUA; PKHWCSIVYYELNNRVGEAFHASSTSVLVDGFTDPSNNKNRFCLGLLSNVNRNSTIENTR ----------!!!!-----------------------------3333-1111-------- RHIGKGVHLYYVGGEVYAECLSDSSIFVQSRNCNYHHGFHPTTVCKIPSGCSLKIFNNQE --!!!!-----%%%%------------------------1111----2222--------- FAQLLAQSVNHGFETVYELTKMCTIRMSFVKGWGAEYHRQDVTSTPCWIEIHLHGPLQWL --------------------1111---------2222---1111---------------- DKVLTQMGSPHNPISSVS --3333------------ >RNA-DIRECTED RNA POLYMERA; SWP:P27410; PDB:1KHVA; FCGEPIDYRGITAHRLVGAEPRPPVSGTRYAKVPGVPDEYKTGYRPANLGRSDPDSDKSL -------%%%%-------------------------3333--------!!!!--3333-- MNIAVKNLQVYQQEPKLDKVDEFIERAAADVLGYLRFLTKGERQANLNFKAAFNTLDLST ---------1111-------------------------iiii----------3333---- SCGPFVPGKKIDHVKDGVMDQVLAKHLYKCWSVANSGKALHHIYACGLKDELRPLDGKKR --1111--3333---------------------1111----------------------- LLWGCDVGVAVCAAAVFHNICYKLKMVARFGPIAVGVDMTSRDVDVIINNLTSKASDFLC --------------------------3333---2222----------------------- LDYSKWDSTMSPCVVRLAIDILADCCEQTELTKSVVLTLKSHPMTILDAMIVQTKRGLPS ----3333--------------1111-----------1111-----------------11 GMPFTSVINSICHWLLWSAAVYKSCAEIGLHCSNLYEDAPFYTYGDDGVYAMTPMMVSLL 11-----------------------1111----3333------!!!!------------- PAIIENLRDYGLSPTAADKTEFIDVCPLNKISFLKRTFELTDIGWVSKLDKSSILRQLEW -------1111---------------1111--%%%%------------------------ SKTTSRHMVIEETYDLAKEERGVQLEELQVAAAAHGQEFFNFVCRELERQQAYTQFSVYS ------------------------------------------------------------ YDAARKILADRKR --------3333- >SMAD2; SWP:Q15796; PDB:1KHXA; PVTYSEPAFWCSIAYYELNQRVGETFHASQPSLTVDGFTDPSNSERFCLGLLSNVNRNAT ----------------!!!!----------------------1111-3333--1111--- VEMTRRHIGRGVRLYYIGGEVFAECLSDSAIFVQSPNCNQRYGWHPATVCKIPPGCNLKI -------!!!!-----!!!!-------------------1111-1111----2222---- FNNQEFAALLAQSVNQGFEAVYQLTRMCTIRMSFVKGWGAEYRRQTVTSTPCWIELHLNG -----------3333--------3333------------------1111----------- PLQWLDKVLTQMGSPSVRCSM -------3333---------- >CLPB PROTEIN; SWP:P03815; PDB:1KHYA; DRLTNKFQLALADAQSLALGHDNQFIEPLHLSALLNQEGGSVSPLLTSAGINAGQLRTDI --------------------------3333--33332222------1111---------- NQALNRLPQVQPSQDLVRVLNLCDKLAQKRGDNFISSELFVLAALESRGTLADILKAAGA ---1111----------------------------3333----1111------------- TTANITQAIEQ ----------- >ANGIOSTATIN; SWP:P00747; PDB:1KI0A; LSECKTGNGKNYRGTMSKTKNGITCQKWSSTSPHRPRFSPATHPSEGLEENYCRNPDNDP -----!!!!---------1111----1111--------33331111--------111111 QGPWCYTTDPEKRYDYCDILECEEECMHCSGENYDGKISKTMSGLECQAWDSQSPHAHGY 11------1111------------------1111------1111----1111-------- IPSKFPNKNLKKNYCRNPDRELRPWCFTTDPNKRWELCDIPRCTTPPPSSGPTYQCLKGT 33331111---------------------1111------------------------!!! GENYRGNVAVTVSGHTCQHWSAQTPHTHERTPENFPCKNLDENYCRNPDGKRAPWCHTTN !---------1111----1111--------33331111---------------------1 SQVRWEYCKIPSC 111---------- >Intersectin-1; SWP:Q15811; PDB:1KI1B; DMLTPTERKRQGYIHELIVTEENYVNDLQLVTEIFQKPLMESELLTEKEVAMIFVNWKEL -------------------------------------------------------3333- IMCNIKLLKALRVRKKMSGEKMPVKMIGDILSAQLPHMQPYIRFCSRQLNGAALIQQKTD ---------------------------------33333333------------------- EAPDFKEFVKRLEMDPRCKGMPLSSFILKPMQRVTRYPLIIKNILENTPENHPDHSHLKH --------------3333---33331111---------------11111111-------- ALEKAEELCSQVNEGVREKENSDRLEWIQAHVQCEGLSEQLVFNSVTNCLGPRKFLHSGK ------------------------------------------------------------ LYKAKNNKELYGFLFNDFLLLTQITKPKVFSPKSNLQYMYKTPIFLNEVLVKLPTDPSGD --------------1111--------------------------1111------------ FHISHIDRVYTLRAESINERTAWVQKIKAASELYIETEKKKR ---------------3333----------------------- >ADENYLATE KINASE; SWP:P43410; PDB:1KI9A; KNKLVVVTGVPGVGGTTITQKAMEKLSEEGINYKMVNFGTVMFEVAQEENLVEDRDQMRK -----------------------------------------------------3333333 LDPDTQKRIQKLAGRKIAEMVKESPVVVDTHSTIKTPKGYLPGLPVWVLNELNPDIIIVV 33333------------3333--------------1111-----33333333-------- ETSGDEILIRRLNDETRNRDLETTAGIEEHQIMNRAAAMTYGVLTGATVKIIQNKNNLLD -------------3333-------------------------------------2222-- YAVEELISVLR ----3333--- >inosine-adenosine-guanosi; SWP:Q9GPQ4; PDB:1KICA; SAKNVVLDHAGNLDDFVAMVLLASNTEKVRLIGALCTDADCFVENGFNVTGKIMCLMHNN -----------3333---------3333-------------------------------- MNLPLFPIGKSAATAVNPFPKEWRCLAKNMDDMPILNIPENVELWDKIKAENEKYEGQQL --------------------------3333--3333-3333--------3333------- LADLVMNSEEKVTICVTGPLSNVAWCIDKYGEKFTSKVEECVIMGGAVDVRGNVFLPSTD ----1111-------------------------3333------------------1111- GTAEWNIYWDPASAKTVFGCPGLRRIMFSLDSTNTVPVRSPYVQRFGEQTNFLLSILVGT ---3333---------1111--------33331111------------1111-------- MWAMCTYYAWDALTAAYVVDQKVANVDPVPIDVVVDKQPNEGATVRTDAENYPLTFVARN -1111--------------3333--------------1111------------------- PEAEFFLDMLLRSARAC ----------------- >GROEL (HSP60 CLASS); SWP:P06139; PDB:1KID; GLVPRGSEGMQFDRGYLSPYFINKPETGAVELESPFILLADKKISNIREMLPVLEAVAKA -----------------1111---1111-----------------3333----------- GKPLLIIAEDVEGEALATLVVNTMRGIVKVAAVKAPGFGDRRKAMLQDIATLTGGTVISE -------------------------------------!!!!-----------------33 EIGMELEKATLEDLGQAKRVVINKDTTTIIDGVGEEAAIQGRVAQIRQQIEEATSDYDRE 33--3333-3333---------1111---------------------------------- KLQERVAKLAGGV ---------1111 >Coagulation factor X [Pre; SWP:P00743; PDB:1KIGH; IVGGRDCAEGECPWQALLVNEENEGFCGGTILNEFYVLTAAHCLHQAKRFTVRVGDRNTE ---------------------------------------1111----------------- QEEGNEMAHEVEMTVKHSRFVKETYDFDIAVLRLKTPIRFRRNVAPACLPEKDWAEATLM ----------------------------------------2222-------3333----- --------------------------------------------------- >DNA GYRASE SUBUNIT B; SWP:Q9LCX5; PDB:1KIJA; AIRVLKGLEGVRHRPAMYIGGTGVEGYHHLFKEILDNAVDEALAGYATEILVRLNEDGSL -----!!!!-3333----------------------------------------1111-- TVEDNGRGIPVDLMPEEGKPAVEVIYNTLHSGGKFEQGAYKVSGGLHGVGASVVNALSEW --------------1111---------------3333---------!!!!---------- TVVEVFREGKHHRIAFSRGEVTEPLRVVGEAPRGKTGTRVTFKPDPEIFGNLRFDPSKIR ------iiii------iiii----------2222----------3333!!!!--3333-- ARLREVAYLVAGLKLVFQDRQHGKEEVFLDKGGVASFAKALAEGEDLLYEKPFLIRGTHG ---------2222-----3333-------3333------1111---------------!! EVEVEVGFLHTQGYNAEILTYANMIPTRDGGTHLTAFKSAYSRALNQYAKKAGLNKEKGP !!-------------------iiii-1111------------------------------ QPTGDDLLEGLYAVVSVKLPNPQFEGQTKGKLLNPEAGTAVGQVVYERLLEILEENPRIA ------------------------------------------------------------ KAVYEKALRAAQAREAARKARELV --------------------1111 >Complexin-1; SWP:P63041; PDB:1KILE; KKEEERQEALRQAEEERKAKYAKMEAEREVMRQGIRDKYGI 3333------------------------------------- >APOLIPOPROTEIN A; SWP:P08519; PDB:1KIV; CYHGNGQSYRGTFSTTVTGRTCQSWSSMTPHRHQRTPENYPNDGLTMNYCRNPDADTGPW --!!!!---------1111----3333--------33331111--!!!!----------- CFTMDPSIRWEYCNLTRC ----3333---------- ----------------------------------- >LECTIN I; SWP:Q38784; PDB:1KJ1A; RNLLTNGEGLYAGQSLDVEPYHFIMQEDCNLVLYDHSTSVWASNTGILGKKGCKAVLQSD ----2222--2222---!!!!----1111-----!!!!--------2222-------111 GNFVVYDAEGRSLWASHSVRGNGNYVLVLQEDGNVVIYGSDIWSTGTYK 1----------------------------1111---------------- >II lectin [Precursor] [Fr; SWP:Q38783; PDB:1KJ1D; RNILMNDEGLYAGQSLDVEPYHLIMQEDCNLVLYDHSTAVWTTNTDIPGKKGCKAVLQSD ----2222--2222-----------1111-----------------2222-------111 GNFVVYDAEGRSLWASHSVRGNGNYVLVLQEDGNVVIYGSDIWSTNTYK 1-----1111-------------------1111---------------- >BETA-DEFENSIN 3; SWP:P81534; PDB:1KJ6A; GIINTLQKYYCRVRGGRCAVLSCLPKEEQIGKCSTRGRKCCRRKK -----1111--------------1111------------------ >INTEGRASE; SWP:P03700; PDB:1KJKA; DLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGH --------1111-------------------------------1111-- >BETA-2-MICROGLOBULIN; SWP:P16391; PDB:1KJMA; GSHSLRYFYTAVSRPGLGEPRFIAVGYVDDTEFVRFDSDAENPRMEPRARWMEREGPEYW -------------2222----------!!!!-----------------1111-------- EQQTRIAKEWEQIYRVDLRTLRGYYNQSEGGSHTIQEMYGCDVGSDGSLLRGYRQDAYDG ----------------------1111-------------------------------iii RDYIALNEDLKTWTAADFAAQITRNKWERARYAERLRAYLEGTCVEWLSRYLELGKETLL i-----1111------3333-------1111----------------------------- RSDPPEAHVTLHPRPEGDVTLRCWALGFYPADITLTWQLNGEDLTQDMELVETRPAGDGT -------------1111---------------------iiii------------------ FQKWASVVVPLGKEQNYTCRVEHEGLPKPLSQRWEPL ---------22221111-----3333----------- >MTH0777; SWP:O26871; PDB:1KJNA; TGKALVLGCPESPVQIPLAIYTSHKLKKKGFRVTVTANPAALRLVQVADPEGIYTDEVDL -----------1111----------3333-------------------1111-------- ESCINELAEGDYEFLAGFVPNDAAAAYLVTFAGILNTETLAIIFDRDADVLEELVNEIET -------2222------------------------------------------------- LDAEIIAARAHHNPAPLRVRIDRFEEKP ---------------------------- >PHOSPHORIBOSYLGLYCINAMIDE; SWP:P33221; PDB:1KJQA; TLLGTALRPAATRVMLLGSGELGKEVAIECQRLGVEVIAVDRYADAPAMHVAHRSHVINM ----2222---------------------------------------3333-------11 LDGDALRRVVELEKPHYIVPEIEAIATDMLIQLEEEGLNVVPCARATKLTMNREGIRRLA 11-------------------------------1111----------------------- AEELQLPTSTYRFADSESLFREAVADIGYPCIVKPVMSKGQTFIRSAEQLAQAWKYAQQG -1111----------------------------------------3333----------- GRAGAGRVIVEGVVKFDFEITLLTVSAVDGVHFCAPVGHRQEDGDYRESWQPQQMSPLAL 2222----------------------1111-----------iiii--------------- ERAQEIARKVVLALGGYGLFGVELFVCGDEVIFSEVSPRPHDTGMVTLISQDLSEFALHV --------------------------!!!!----------3333-3333----------- RAFLGLPVGGIRQYGPAASAVILPQLTSQNVTFDNVQNAVGADLQIRLFGKPEIDGSRRL ----------------------------------3333---------------------- GVALATAESVVDAIERAKHAAGQVKVQG --------------------3333---- >C5A; SWP:P01031; PDB:1KJS; MLQKKIEEIAAKYKHSVVKKCCYDGACVNNDETCEQRAARISLGPRCIKAFTECCVVASQ --------------3333-------------------------3333------------- LRANISHKDMQLGR -------------- >BETA-2-MICROGLOBULIN; SWP:Q95565; PDB:1KJVA; GSHSLRYFDIAVSRPGLGEPRYISVGYVDDTEFARYDSDAENRRYQPRARWMEREGPEYW -------------2222----------iiii-----1111-----33331111------- ERNTPIYKGKEQTFRVNLRTLRGYYNQSEGGSHTIQEMYGCDVGSDGSLLRGYEQFAYDG -------------------------------------------1111----------iii RDYIALNEDLKTWTAADFAARISRNKLERDGFADLHRAYLEGECVESLRRYLELGKETLL i-----1111-------------------------------------------------- RSDPPKAHVTLHPRPEGDVTLRCWALGFYPADITLTWQLNGEDLTQDMELVETRPAGDGT -------------3333---------------------iiii-3333------------- FQKWASVVVPLGKEQNYTCRVEHEGLPKPLSQRWEP ---------22221111-----1111---------- >POSTSYNAPTIC DENSITY PROT; SWP:P31016; PDB:1KJWA; GFYIRALFDYDKTKDCGFLSQALSFRFGDVLHVIDAGDEEWWQARRVHSDSETDDIGFIP ----------3333-----------2222------------------------------- SKRRVERREWSRLKWGSSSGSQGREDSVLSYETVTQMEVHYARPIIILGPTKDRANDDLL ------------------------------------------------2222-------- SEFPDKFGSCVPHTTRPKREYEIDGRDYHFVSSREKMEKDIQAHKFIEAGQYNSHLYGTS --1111------------11112222-------------------------iiii----- VQSVREVAEQGKHCILDVSANAVRRLQAAHLHPIAIFIRPRSLENVLEINKRITEEQARK ------------------3333----1111-----------3333--------------- AFDRATKLEQEFTECFSAIVEGDSFEEIYHKVKRVIEDLSGPYIWVPARERL -----------3333------------------------------------- >Regulator of G-protein si; SWP:O08773; PDB:1KJYB; DIEGLVELLNRVQSSGAHDQRGLLRKEDLVLPEFL -----------1111---------3333---1111 >EIF2GAMMA; SWP:Q9V1G0; PDB:1KK1A; SRQAEVNIGMVGHVDHGKTTLTKALTGVWTDSEELRRGITIKIGFADAEIRRCPNCGRYS ------------2222---------------3333------------------------- TSPVCPYCGHETEFVRRVSFIDAPGHEALMTTMLAGASLMDGAILVIAANEPCPRPQTRE -----------------------------------3333--------------------- HLMALQIIGQKNIIIAQNKIELVDKEKALENYRQIKEFIEGTVAENAPIIPISALHGANI ------------------1111----------------2222-1111------1111-33 DVLVKAIEDFIPTPKRDPNKPPKMLVLRSFDVNKPGKLVGGVLDGSIVQGKLKVGDEIEI 33--------------1111--------------------------------2222---- RPGVPYEEHGRIKYEPITTEIVSLQAGGQFVEEAYPGGLVGVGTKLDPYLTKGDLMAGNV -------iiii--------------iiii-----------------3333-%%%%2222- VGKPGKLPPVWDSLRLEVHLLERVVEQELKVEPIKRKEVLLLNVGTARTMGLVTGLGKDE --2222----------------------------2222-----!!!!------------- IEVKLQIPVCAEPGDRVAISRQIGSRWRLIGYGIIKE -----------2222---------------------- >MANGANESE SUPEROXIDE DISM; SWP:Q92450; PDB:1KKCA; QQYTLPPLPYPYDALQPYISQQIMELHHKKHHQTYVNGLNAALEAQKKAAEATDVPKLVS ----------1111---------------------------------------------- VQQAIKFNGGGHINHSLFWKNLAPEKSGGGKIDQAPVLKAAIEQRWGSFDKFKDAFNTTL ------------------1111-111122221111------------------------1 LGIQGSGWGWLVTDGPKGKLDITTTHDQDPVTGAAPVFGVDMWEHAYYLQYLNDKASYAK 111---------------------------------------3333----!!!!------ GIWNVINWAEAENRYIAGDK 3333---------------- >RIBOSOME-BINDING FACTOR A; SWP:P09170; PDB:1KKGA; MAKEFGRPQRVAQEMQKEIALILQREIKDPRLGMMTTVSGVEMSRDLAYAKVYVTFLNDK ----------------------------1111------------------------3333 DEDAVKAGIKALQEASGFIRSLLGKAMRLRIVPELTFFYDNSLVEGMR ---------------3333----------------------------- >MEVALONATE KINASE; SWP:Q58487; PDB:1KKHA; PRGSHMIIETPSKVILFGEHAVVYGYRAISMAIDLTSTIEIKETQEDEIILNLNDLNKSL 2222---------------3333-----------------------------3333---- GLNLNEIKNINPNNFGDFKYCLCAIKNTLDYLNIEPKTGFKINISSKIPISCGLGSSASI --111111111111!!!!------------------------------------------ TIGTIKAVSGFYNKELKDDEIAKLGYMVEKEIQGKASITDTSTITYKGILEIKNNKFRKI ---------1111--------------------------3333----------------- KGEFEEFLKNCKFLIVYAEKRKKKTAELVNEVAKIENKDEIFKEIDKVIDEALKIKNKED -------1111-------------------33331111-------------1111----- FGKLMTKNHELLKKLNISTPKLDRIVDIGNRFGFGAKLTGAGGGGCVIILVNEEKEKELL ------------1111-----------------------------------3333----- KELNKEDVRIFNCRMMN --1111----------- >HPRK PROTEIN; SWP:Q9RE09; PDB:1KKMA; ERRSMHGVLVDIYGLGVLITGDSGVGKSETALELVQRGHRLIADDRVDVYQQDEQTIVGA -----------iiii-------------------1111---------------------- APPILSHLLEIRGLGIIDVMNLFGAGAVREDTTISLIVHLENWTPDKTFDRLGSGEQTQL -3333------------3333--3333--------------------------------- IFDVPVPKITVPVKVGRNLAIIIEVAAMNFRAKSMGYDATKTFEKNLNHLIEHNEE %%%%---------2222---------------1111-------------------- >MANNOSYL-OLIGOSACCHARIDE ; SWP:P31723; PDB:1KKTA; SNQAKADAVKEAFQHAWNGYMKYAFPHDELTPVSNGHADSRNGWGASAVDALSTAVIMGK ------------------------------------------------3333-------- ADVVNAILEHVADIDFSKTSDTVSLFETTIRYLAGMLSGYDLLQGPAKNLVDNQDLIDGL ---------3333-1111--------------------------1111----3333---- LDQSRNLADVLKFAFDTPSGVPYNNINITSHGNDGATTNGLAVTGTLVLEWTRLSDLTGD -----------3333-3333-------------------3333----------------- EEYAKLSQKAESYLLKPQPSSSEPFPGLVGSSININDGQFADSRVSWNGGDDSFYEYLIK ----------3333----3333--2222-------------------2222--------- MYVYDPKRFETYKDRWVLAAESTIKHLKSHPKSRPDLTFLSSYSNRNYDLSSQHLTCFDG ------------------------------1111------------------3333---- GSFLLGGTVLDRQDFIDFGLELVDGCEATYNSTLTKIGPDSWGWDPKKVPSDQKEFYEKA --------------------------------1111--------1111-1111------- GFYISSGSYVLRPEVIESFYYAHRVTGKEIYRDWVWNAFVAINSTCRTDSGFAAVSDVNK -----------------------------------------------1111-----1111 ANGGSKYDNQESFLFAEVMKYSYLAHSEDAAWQVQKGGKNTFVYNTEAHPISVAR iiii------3333---------1111--1111--!!!!-----1111------- >TRANSCRIPTION REGULATORY ; SWP:P09547; PDB:1KKXA; NNKQYELFMKSLIENCKKRNMPLQSIPEIGNRKINLFYLYMLVQKFGGADQVTRTQQWSM --3333-------------------------------3333-3333-3333--------- VAQRLQISDYQQLESIYFRILLPYERHMISQEGIKETQAKRI ------------------------------------------ >SERINE HYDROXYMETHYLTRANS; SWP:Q7SIB6; PDB:1KL1A; MKYLPQQDPQVFAAIEQERKRQHAKIELIASENFVSRAVMEAQGSVLTNKYAEGYPGRRY --3333----------------------1111----------------------2222-- YGGCEYVDIVEELARERAKQLFGAEHANVQPHSGAQANMAVYFTVLEHGDTVLGMNLSHG ---3333---------------------------------------2222-----1111- GHLTHGSPVNFSGVQYNFVAYGVDPETHVIDYDDVREKARLHRPKLIVAAASAYPRIIDF -1111-11113333---------------------------------------------- AKFREIADEVGAYLMVDMAHIAGLVAAGLHPNPVPYAHFVTTTTHKTLRGPRGGMILCQE ------------------1111--1111----3333---------3333---------33 QFAKQIDKAIFPGIQGGPLMHVIAAKAVAFGEALQDDFKAYAKRVVDNAKRLASALQNEG 33------------------------------------------------------1111 FTLVSGGTDNHLLLVDLRPQQLTGKTAEKVLDEVGITVNKNTIPYDPESPFVTSGIRIGT --2222----------3333----------------------------1111-------- AAVTTRGFGLEEMDEIAAIIGLVLKNVGSEQALEEARQRVAALTD ----------------------1111------------------- >THREONINE SYNTHASE; SWP:P16120; PDB:1KL7A; PNASQVYRSTRSSSPKTISFEEAIIQGLATDGGLFIPPTIPQVDQATLFNDWSKLSFQDL -1111---1111----------------1111------------------3333------ AFAIRLYIAQEEIPDADLKDLIKRSYSTFRSDEVTPLVQNVTGDKENLHILELFHGPTYA ----11113333-------------1111-1111-----1111------------1111- FKDVALQFVGNLFEYFLQRTNANLPEGEKKQITVVGATSGDTGSAAIYGLRGKKDVSVFI --------------------1111-----------------3333----2222------- LYPTGRISPIQEEQTTVPDENVQTLSVTGTFDNCQDIVKAIFGDKEFNHNVGAVNSINWA --2222------------1111-------3333------------------------333 RILAQTYYFYSFFQATNGKDSKKVKFVVPSGNFGDILAGYFAKKGLPIEKLAIATNENDI 3----3333-----------------------3333------------------------ LDRFLKSGLYERSDKVAATLSPADILISSNFERLLWYLAREYLANGDDLKAGEIVNNWFQ --3333---------------------1111------------iiii------------- ELKTNGKFQVDKSIIEGASKDFTSERVSNEETSETIKKIYESSVNPKHYILDPHTAVGVC ------------------------------------------------------------ ATERLIAKDNDKSIQYISLSTAHPAKFADAVNNALSGFSNYSFEKDVLPEELKKLSTLKK ----------3333--------3333-----------33333333---3333-3333--- KLKFIERADVELVKNAIEEELAK ----------------------- >EUKARYOTIC TRANSLATION IN; SWP:P05198; PDB:1KL9A; LSCRFYQHKFPEVEDVVVNVRSIAEGAYVSLLEYNNIEGILLSELRIGRNECVVVIRVDK -----------2222------------------%%%%---3333-2222----------3 EKGYIDLSKRRVSPEEAIKCEDKFTKSKTVYSILRHVAEVLEYTKDEQLESLFQRTAWVF 333-----1111--------------------------------3333------------ DDKYKRPGYGAYDAFKHAVSDPSILDSLDLNEDEREVLINNINRR ----------------------1111------------------- >TRANSFORMING GROWTH FACTO; SWP:P18341; PDB:1KLAA; ALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSK -------------------------------------------------------3333- VLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS ---------------------------------------------------- >LAMININ; SWP:P02468; PDB:1KLO; CPCPGGSSCAIVPKTKEVVCTHCPTGTAGKRCELCDDGYFGDPLGSNGPVRLCRPCQCND -----------------------------------2222------------------%%% NIDPNAVGNCNRLTGECLKCIYNTAGFYCDRCKEGFFGNPLAPNPADKCKACACNPYGTV %-1111--------------%%%%--------2222--1111-3333-----------22 QQQSSCNPVTGQCQCLPHVSGRDCGTCDPGYYNLQSGQGCER 22-------------2222-1111---2222-1111------ >MEROMYCOLATE EXTENSION AC; SWP:Q10500; PDB:1KLPA; MPVTQEEIIAGIAEIIEEVTGIEPSEITPEKSFVDDLDIDSLSMVEIAVQTEDKYGVKIP ----------------------3333---------------3333--------------3 DEDLAGLRTVGDVVAYIQKLEEENPEAAQALRAKIESENPDAVANVQARLEAESK 333-------------3333----------------------------------- >ZINC FINGER Y-CHROMOSOMAL; SWP:P08048; PDB:1KLRA; KTYQCQYCEFRSADSSNLKTHIKTKHSKEK --------------1111------------ >HLA class II histocompati; SWP:P13758; PDB:1KLUB; GDTRPRFLWQLKFECHFFNGTERVRLLERCIYNQEESVRFDSDVGEYRAVTELGRPDAEY ------------------!!!!------------------3333------3333------ WNSQKDLLEQRRAAVDTYCRHNYGVGESFTVQRRVEPKVTVYPSKTQPLQHHNLLVCSVS -------------------------33331111--------------------------- GFYPGSIEVRWFRNGQEEKAGVVSTGLIQNGDWTFQTLVMLETVPRSGEVYTCQVEHPSV ------------iiii-----------------------------2222-------3333 TSPLTVEWRA ---------- >CYSTEINE RICH PROTEIN B; SWP:O25103; PDB:1KLXA; GGGTVKKDLKKAIQYYVKACELNEMFGCLSLVSNSQINKQKLFQYLSKACELNSGNGCRF -----------------------2222------1111------------1111------- LGDFYENGKYVKKDLRKAAQYYSKACGLNDQDGCLILGYKQYAGKGVVKNEKQAVKTFEK -------------------------1111------------------------------- ACRLGSEDACGIL ------------- >DIPETALIN; SWP:O96790; PDB:1KMAA; FQGNPCECPRALHRVCGSDGNTYSNPCMLTCAKHEGNPDLVQVHEGPCDEHDHDF ----------------1111----------------3333--------------- >VACUOLAR MORPHOGENESIS PR; SWP:P32912; PDB:1KMDA; KMSEKLRIKVDDVKINPKYVLYGVSTPNKRLYKRYSEFWKLKTRLERDVGSTIPYDFPEK -------------------------3333----3333----------------------- PGVLDRRWQRRYDDPEMIDERRIGLERFLNELYNDRFDSRWRDTKIAQDFLQLSKPN --------------3333-3333---------------3333------1111----- >CHEMOTAXIS PROTEIN CHEY; SWP:P07366; PDB:1KMIZ; SIKPADEHSAGDIIARIGSLTRMLRDSLRELGLDQAIAEAAEAIPDARDRLYYVVQMTAQ ---------------------------------------1111----------------- AAERALNSVEASQPHQDQMEKSAKALTQRWDDWFADPIDLADARELVTDTRQFLADVPAH ---3333-------------------------------3333---------3333----- TSFTNAQLLKIMMAQDFQDLTGQVIKRMMDVIQEIERQLLMVLLSQDQVDDLLDSLG -----3333-----------------------3333--------------------- >HISTIDYL-TRNA SYNTHETASE; SWP:P04804; PDB:1KMMA; NIQAIRGMNDYLPGETAIWQRIEGTLKNVLGSYGYSEIRLPIVEQTPLFKRAIGEVTDVV ----2222---3333---------------1111----------3333--------3333 EKEMYTFEDRNGDSLTLRPEGTAGCVRAGIEHGLLYNQEQRLWYIGPMFRHERPQKGRYR --------1111-----------------1111---------------------1111-- QFHQLGCEVFGLQGPDIDAELIMLTARWWRALGISEHVTLELNSIGSLEARANYRDALVA ---------------------------------1111-----------------3333-- FLEQHALGDYLDEESREHFAGLCKLLESAGIAYTVNQRLVRGLDYYNRTVFEWVTNSLGS -----3333--3333--------------------1111--------------------- QGTVCAGGRYDGLVEQLGGRATPAVGFAMGLERLVLLVQAVNPEFKADPVVDIYLVASGA ---------1111----------------------------------------------- DTQSAAMALAERLRDELPGVKLMTNHGGGNFKKQFARADKWGARVAVVLGESEVANGTAV ----------------2222---------------------------------------- VKDLRSGEQTAVAQDSVAAHLRTLLG --3333------3333---------- >TRANSFORMING PROTEIN RHOA; SWP:P06749; PDB:1KMQA; IRKKLVIVGDGACGKTCLLIVNSKDQFPEVYVPTVFENYVADIEVDGKQVELALWDTAGL ---------2222-------------------------------%%%%-----------3 EDYDRLRPLSYPDTDVILMCFSIDSPDSLENIPEKWTPEVKHFCPNVPIILVGNKKDLRN 3333333--------------1111------------------1111-------3333-- DEHTRRELAKMKQEPVKPEEGRDMANRIGAFGYMECSAKTKDGVREVFEMATRAALQ --------1111----3333-----------------1111---------------- >RHO GDP-DISSOCIATION INHI; SWP:P52565; PDB:1KMTA; AMVPNVVVTGLTLVCSSAPGPLELDLTGDLESFKKQSFVLKEGVEYRIKISFRVNREIVS --------------1111------111133331111----2222---------------- GMKYIQHTYRKGVKIDKTDYMVGSYGPRAAAYEFLTPVEEAPKGMLARGSYSIKSRFTDD ---------iiii------------------------------3333-----------11 DKTDHLSWEWNLTIKKDW 11---------------- >DIHYDROFOLATE REDUCTASE; SWP:P00374; PDB:1KMVA; VGSLNCIVAVSQNMGIGKNGDLPWPPLRNEFRYFQRMTTTSSVEGKQNLVIMGKKTWFSI ----------1111---iiii---------------------2222----------1111 PEKNRPLKGRINLVLSRELKEPPQGAHFLSRSLDDALKLTEQPELANKVDMVWIVGGSSV 3333--2222------------2222------------------1111------------ YKEAMNHPGHLKLFVTRIMQDFESDTFFPEIDLEKYKLLPEYPGVLSDVQEEKGIKYKFE ---1111----------------------------------2222------%%%%----- VYEKND ------ >VASCULAR ENDOTHELIAL GROW; SWP:P15692; PDB:1KMXA; ARQENPCGPCSERRKHLFVQDPQTCKCSCKNTDSRCKARQLELNERTCRCDKPRR --------------3333------------------------------------- >ALLOPHYCOCYANIN; SWP:P59856; PDB:1KN1A; SIVTKSIVNADAEARYLSPGELDRIKSFVLSGARRVRIAQTLTENRERIVKQAGDQLFQK ----------1111---3333--------------------------------3333--- RPDVVSPGGNAYGEEMTATCLRDLDYYLRLVTYGIVSGDVTPIEEIGLVGVREMYKSLGT -1111-------1111-----------------------3333----------------- PISAVAEGVKCMKSVASSLLSGEDSAEAGFYFDYVVGAMQ 3333-------------------------------1111- >PHOSPHATIDYLETHANOLAMINE ; SWP:Q8VIN1; PDB:1KN3A; SMWTGPLSLHEVDEQPQHLLRVTYTEAEVEELGQVLTPTQVKHRPGSISWDGLDPGKLYT -1111--1111-------------------2222--3333---------22221111--- LILTDPDAPSRKKPVYREWHHFLVVNMKGNDISSGNVLSDYVGSGPPKGTGLHRYVWLVY ---------33331111----------!!!!1111-----------2222---------- QQDKPLRCDEPILTNRSGDHRGKFKTAAFRKKYHLGAPVAGTCYQAEWDSYVPKLYKQLS -----------------2222---------1111--------------3333----1111 >PROHORMONE CONVERTASE 1; SWP:P21662; PDB:1KN6A; FVNEWAAEIPGGQEAASAIAEELGYDLLGQIGSLENHYLFKHKSHPRRSRRSALHITKRL -------------------------------1111------------------1111--3 SDDDRVTWAEQQY 3333333------ >VOLTAGE-GATED POTASSIUM C; SWP:P15385; PDB:1KN7A; MEVAMVSAESSGCNSHMPYGYAAQARARERERLAHSRAAAAAAVAAATAAVEGTGGSGGG -------------------333333331111------------3333------------- PHHHHQTRGAYSSHD --------------- >ADENOVIRUS TYPE 5 FIBER P; SWP:P11818; PDB:1KNB; NDKLTLWTTPAPSPNCRLNAEKDAKLTLVLTKCGSQILATVSVLAVKGSLAPISGTVQSA -1111---------------------------!!!!-----------1111--1111--- HLIIRFDENGVLLNNSFLDPEYWNFRNGDLTEGTAYTNAVGFMPNLSAYPKSHGKTAKSN ------1111----------------!!!!--------3333-------2222--3333- IVSQVYLNGDKTKPVTLTITLNGTQETGDTTPSAYSMSFSWDWSGHNYINEIFATSSYTF -----22221111-----------------------------2222-2222--------- SYIAQE ------ >THIOL:DISULFIDE INTERCHAN; SWP:P30960; PDB:1KNGA; RPAPQTALPPLEGLQADNVQVPGLDPAAFKGKVSLVNVWASWCVPCHDEAPLLTELGKDK ----------2222-%%%%-------3333---------1111-3333----------33 RFQLVGINYKDAADNARRFLGRYGNPFGRVGVDANGRASIEWGVYGVPETFVVGREGTIV 33---------------------------------33331111----------1111--- YKLVGPITPDNLRSVLLPQMEKAL --------------------1111 >ENDO-1,4-BETA-XYLANASE A; SWP:P26514; PDB:1KNMA; PPADGGQIKGVGSGRCLDVPDASTSDGTQLQLWDCHSGTNQQWAATDAGELRVYGDKCLD ----------1111----2222--2222-----------------1111----------- AAGTSNGSKVQIYSCWGGDNQKWRLNSDGSVVGVQSGLCLDAVGNGTANGTLIQLYTCSN ----2222---------1111----1111------------2222--2222--------- GSNQRWTRT 1111----- >GLUCONATE KINASE; SWP:P46859; PDB:1KNQA; TTNHDHHIYVLMGVSGSGKSAVASEVAHQLHAAFLDGDFLHPRRNIEKMASGEPLNDDDR --1111-------------------------------1111-------1111---3333- KPWLQALNDAAFAMQRTNKVSLIVCSALKKHYRDLLREGNPNLSFIYLKGDFDVIESRLK ------------------------------------2222-------------------- ARKGHFFKTQMLVTQFETLQEPGADETDVLVVDIDQPLEGVVASTIEVIKK -2222-----------------3333------------------------- >BSE634I RESTRICTION ENDON; SWP:Q8RT53; PDB:1KNVA; NLTNSNCVEEYKENGKTKIRIKPFNALIELYHHQTPTGSIKENLDKLENYVKDVVKAKGL 1111-------------------------------------------------------- AIPTSGAFSNTRGTWFEVMIAIQSWNYRVKRELNDYLIIKMPNVKTFDFRKIFDNETREK --------------------------------1111------3333-3333--------- LHQLEKSLLTHKQQVRLITSNPDLLIIRQKDLIKSEYNLPINKLTHENIDVALTLFKDIE ----------------------------1111-3333----------------3333--- GKCKWDSLVAGVGLKTSLRPDRRLQLVHEGNILKSLFAHLKMRYWNPKAEFKYYGASSEP ---1111--------------------------------------1111----------- VSKADDDALQTAATHTIVNVNSTPERAVDDIFSLTSFEDIDKMLDQIIKK ------------1111--1111-------------3333----------- >DIAMINOPIMELATE DECARBOXY; SWP:P00861; PDB:1KNWA; PHSLFSTDTDLTAENLLRLPAEFGCPVWVYDAQIIRRQIAALKQFDVVRFAQKACSNIHI --1111---------------------------------1111-------3333------ LRLMREQGVKVDSVSLGEIERALAAGYNPQTHPDDIVFTADVIDQATLERVSELQIPVNA ----1111--------------1111-33331111------------------------- GSVDMLDQLGQVSPGHRVWLRVNPGFGHGHSQKTNTGGENSKHGIWYTDLPAALDVIQRH ------------2222-------------1111------------1111----------- HLQLVGIHMHIGSGVDYAHLEQVCGAMVRQVIEFGQDLQAISAGGGLSVPYQQGEEAVDT ---------------------------------------------------2222----- EHYYGLWNAAREQIARHLGHPVKLEIEPGRFLVAQSGVLITQVRSVKQMGSRHFVLVDAG ----------------------------33333333------------!!!!-------3 FNDLMRPAMYGSYHHISALAADGRSLEHAPTVETVVAGPLCESGDVFTQQEGGNVETRAL 333-3333-----------1111--1111------------1111----2222------- PEVKAGDYLVLHDTGAYGASMSSNYNSRPLLPEVLFDNGQARLIRRRQTIEELLALELLH ---2222----------3333---%%%%--------iiii--------3333-3333--- H - >PROBABLE HPR(SER) KINASE/; SWP:P75548; PDB:1KNXA; MKKLLVKELIEQFQDCVNLIDGHTNTSNVIRVPGLKRVVFEMLGLFSSQIGSVAILGKRE ----333311111111--------1111-------------------------------- FGFLSQKTLVEQQQILHNLLKLNPPAIILTKSFTDPTVLLQVNQTYQVPILKTDFFSTEL -------3333---3333-----------1111---------1111---------3333- SFTVETYINEQFATVAQIHGVLLEVFGVGVLLTGRSGIGKSECALDLINKNHLFVGDDAI -------3333-------------iiii-------------------1111--------- EIYRLGNRLFGRAQEVAKKFMEIRGLGIINVERFYGLQITKQRTEIQLMVNLLSLTFERL -----------------------------3333--3333--------------------- GTELKKQRLLGVDLSFYEIPISPGRKTSEIIESAVIDFKLKHSGYNSALDFIENQKAILK --------iiii-------------3333-----------1111---------------- RKK --- >KANAMYCIN NUCLEOTIDYLTRAN; SWP:P05057; PDB:1KNYA; MNGPIIMTREERMKIVHEIKERILDKYGDDVKAIGVYGSLGRQTDGPYSDIEMMCVMSTE --------------------------!!!!---------------1111---------22 EAEFSHEWTTGEWKVEVNFYSEEILLDYASQVESDWPLTHGQFFSILPIYDSGGYLEKVY 22--------------------3333-1111---3333--1111---------------- QTAKSVEAQTFHDAICALIVEELFEYAGKWRNIRVQGPTTFLPSLTVQVAMAGAMLIGLH ------3333---------------------------3333------------------- HRICYTTSASVLTEAVKQSDLPSGYDHLCQFVMSGQLSDSEKLLESLENFWNGIQEWTER ------11113333-------2222-------------3333------------------ HGYIVDVSKRIPF ------------- >NONSTRUCTURAL RNA-BINDING; SWP:P03536; PDB:1KNZA; TQQMAVSIINSSFEAAVVAATSALENMGIEYDYQDIYSRVKNKFDFVMDDSGVKNNPIGK ------------------------1111--------------------3333-------- AITIDQALNNKFGSAIRNRNWLADTSRPAKLDEDVNKLRMMLGIDQKMRVLNACFSVKRI -----3333--------------1111-----------3333------------------ PGKSSSIIKCTKLMRDKLERGEVEVDDSFVDEKM -------------------------33333333- >VIM-2 METALLO-BETA-LACTAM; SWP:Q9K2N0; PDB:1KO3A; EYPTVSEIPVGEVRLYQIADGVWSHIATQSFDGAVYPSNGLIVRDGDELLLIDTAWGAKN ---3333-----------2222----------------------!!!!------------ TAALLAEIEKQIGLPVTRAVSTHFHDDRVGGVDVLRAAGVATYASPSTRRLAEVEGNEIP ------------------------1111-------------------------------- THSLEGLSSSGDAVRFGPVELFYPGAAHSTDNLVVYVPSASVLYGGCAIYELSRTSAGNV --------2222---!!!!--------------------------1111-3333-----1 ADADLAEWPTSIERIQQHYPEAQFVIPGHGLPGGLDLLKHTTNVVKAHTN 111-1111----------1111---------------------------- >NUCLEAR PORE COMPLEX PROT; SWP:P52948; PDB:1KO6A; MHPAGIILTKVGYYTIPSMDDLAKITNEKGECIVSDFTIGRKGYGSIYFEGDVNLTNLNL -3333-----------------3333--------------2222----------222233 DDIVHIRRKEVVVYLDDNQKPPVGEGLNRKAEVTLDGVWPTDKTSRCLIKSPDRLADINY 33----------------------!!!!-------------------------------- EGRLEAVSRKQGAQFKEYRPETGSWVFKVSHF -------------------1111--------- >HPR KINASE/PHOSPHATASE; SWP:Q9S1H5; PDB:1KO7A; MLTTKSLVERFELEMIAGEAGLNKQIKNTDISRPGLEMAGYFSHYASDRIQLLGTTELSF -----------------3333------------3333----11111111----------- YNLLPDEERKGRMRKLCRPETPAIIVTRDLEPPEELIEAAKEHETPLITSKIATTQLMSR 11113333--3333---3333--------------------------------------- LTTFLEHELARTTSLHGVLVDVYGVGVLITGDSGIGKSETALELIKRGHRLVADDNVEIR ---------------------iiii------2222---------1111------------ EISKDELIGRAPKLIEHLLEIRGLGIINVMTLFGAGSILTEKRLRLNIHLENEETLRILD -----------3333------------3333--1111----------------------- TEITKKTIPVRPGRNVAVIIEVAAMNYRLNIMGINTAEEFNDRLN ----------------------------------3333------- >TWITCHIN; SWP:Q23551; PDB:1KOA; YDNYVFDIWKQYYPQPVEIKHDHVLDHYDIHEELGTGAFGVVHRVTERATGNNFAAKFVM 1111--3333------------3333----------1111-------------------- TPHESDKETVRKEIQTMSVLRHPTLVNLHDAFEDDNEMVMIYEFMSGGELFEKVADEHNK --3333---------------1111-------------------------3333-1111- MSEDEAVEYMRQVCKGLCHMHENNYVHLDLKPENIMFTTKRSNELKLIDFGLTAHLDPKQ -3333-------------------------3333----1111-------3333---3333 SVKVTTGTAEFAAPEVAEGKPVGYYTDMWSVGVLSYILLSGLSPFGGENDDETLRNVKSC -------3333----3333---3333---------------------------------- DWNMDDSAFSGISEDGKDFIRKLLLADPNTRMTIHQALEHPWLTPGNAPGRDSQIPSSRY -----3333---3333----------1111----------3333-------------333 TKIRDSIKTKYDAWPEPLPPLGRISNYSSLRKHRPQEYSIRDAFWDRSEAQPRFIVKPYG 3---------1111---------1111-3333---1111------3333----------- TEVGEGQSANFYCRVIASSPPVVTWHKDDRELKQSVKYMKRYNGNDYGLTINRVKGDDKG --------------------------%%%%----1111----------------3333-- EYTVRAKNSYGTKEEIVFLNVTRHSEP --------------------------- >TWITCHIN; SWP:Q16980; PDB:1KOBA; INDYDKFYEDIWKKYVPQPVEVKQGSVYDYYDILEELGSGAFGVVHRCVEKATGRVFVAK ---1111--3333--------------------------1111----------------- FINTPYPLDKYTVKNEISIMNQLHHPKLINLHDAFEDKYEMVLILEFLSGGELFDRIAAE -------------------1111-1111-----------------------3333---11 DYKMSEAEVINYMRQACEGLKHMHEHSIVHLDIKPENIMCETKKASSVKIIDFGLATKLN 11-------------------------------1111---------------1111---- PDEIVKVTTATAEFAAPEIVDREPVGFYTDMWAIGVLGYVLLSGLSPFAGEDDLETLQNV -----------11113333------3333------------------------------- KRCDWEFDEDAFSSVSPEAKDFIKNLLQKEPRKRLTVHDALEHPWLKGDHSNLTSRIPSS ---------1111--3333----------3333---------3333-----------333 RYNKIRQKIKEKYADWPAPQPAIGRIANFSSLRKHRPQEYQIYDSYFDRKEA 3-------3333--------3333-11111111----1111----------- >ENDOSTATIN; SWP:P39061; PDB:1KOE; QPVLHLVALNTPLSGGMRGIRGADFQCFQQARAVGLSGTFRAFLSSRLQDLYSIVRRADR -------------------------------1111----------11113333--3333- GSVPIVNLKDEVLSPSWDSLFSGSQGQLQPGARIFSFDGRDVLRHPAWPQKSVWHGSDPS ------1111-----3333---------2222---1111-33333333---------111 GRRLMESYCETWRTETTGATGQASSLLSGRLLEQKAASCHNSYIVLCIENSF 1--1111%%%%----3333-----3333---------1111----------- >FORMALDEHYDE DEHYDROGENAS; SWP:P46154; PDB:1KOLA; GNRGVVYLGSGKVEVQKIDYPKMQDPRGKKIEHGVILKVVSTNICGSDQHMVRGRTTAQV --------%%%%------------1111----------------3333----------22 GLVLGHEITGEVIEKGRDVENLQIGDLVSVPFNVACGRCRSCKEMHTGVCLTVNPARAGG 22--------------------2222---------------11111111----3333--- AYGYVDMGDWTGGQAEYVLVPYADFNLLKLPDRDKAMEKIRDLTCLSDILPTGYHGAVTA ---2222--------------3333-------------33331111-----------111 GVGPGSTVYVAGAGPVGLAAAASARLLGAAVVIVGDLNPARLAHAKAQGFEIADLSLDTP 1-2222---------------------------------------1111----1111--- LHEQIAALLGEPEVDCAVDAVGFEARGHGHEGAKHEAPATVLNSLMQVTRVAGKIGIPGL ---------------------1111-----3333--1111---------2222------- YVTEDPGAVDAAAKIGSLSIRFGLGWAKSHSFHTGQTPVMKYNRALMQAIMWDRINIAEV ----1111-3333-------3333-1111--------3333--------1111------- VGVQVISLDDAPRGYGEFDAGVPKKFVIDPHKTFSA ------3333------------------1111---- >PROTEIN YEBC; SWP:P24237; PDB:1KONA; GHSKWANTRHRKAAQDAKRGKIFTKIIRELVTAAKLDANPRLRAAVDKALSNNMTRDTLN --3333----1111-----------------------------------1111------- RAIARGANMETIIYEGYGPGGTAIMIECLSDNRNRTVAEVRHAFSKCGGNLGTDGSVAYL ------------------%%%%----------------------------------3333 FSKKGVISFEKGDEDTIMEAALEAGAEDVVTYDDGAIDVYTAWEEMGKVRDALEAAGLKA ------------3333---------------1111------------------1111--- DSAEVSMIPSTKADMDAETAPKLMRLIDMLEDCDDVQEVYHNGEISDEVAATL --------------------------------3333---------3333---- >CARBONIC ANHYDRASE; SWP:Q50940; PDB:1KOPA; HTHWGYTGHDSPESWGNLSEEFRLCSTGKNQSPVNITETVSGKLPAIKVNYKPSMVDVEN -------11111111---3333-1111--------------------------------- NGHTIQVNYPEGGNTLTVNGRTYTLKQFHFHVPSENQIKGRTFPMEAHFVHLDENKQPLV -----------------iiii----------------iiii-----------1111---- LAVLYEAGKTNGRLSSIWNVMPMTAGKVKLNQPFDASTLLPKRLKYYRFAGSLTTPPCTE ----------3333--3333--------------3333---------------------- GVSWLVLKTYDHIDQAQAEKFTRAVGSENNRPVQPLNARVVIE -----------------------------------!!!!---- >ARGININOSUCCINATE SYNTHET; SWP:P59846; PDB:1KORA; MKIVLAYSGGLDTSIILKWLKETYRAEVIAFTADIGQGEEVEEAREKALRTGASKAIALD ---------------------1111--------------3333----------------- LKEEFVRDFVFPMMRAGAVYEGYYLLGTSIARPLIAKHLVRIAEEEGAEAIAHGATGKGN -----------3333----------3333------------------------------- DQVRFELTAYALKPDIKVIAPWREWSFQGRKEMIAYAEAHGIPVPPYSMDANLLHISYEG ------------1111---3333--------------1111----------3333----! GVLEDPWAEPPKGMFRMTQDPEEAPDAPEYVEVEFFEGDPVAVNGERLSPAALLQRLNEI !!!-1111--2222-----1111------------iiii---iiii-------------- GGRHGVGRVDIVENRFVGMKSRGVYETPGGTILYHARRAVESLTLDREVLHQRDMLSPKY -1111---------1111------------------------------------------ AELVYYGFWYAPEREALQAYFDHVARSVTGVARLKLYKGNVYVVGRKAPKSLYRGYDQKD --------------------------------------------------------3333 AEGFIKIQALRLRVRALVER -------------------- >VOLTAGE-DEPENDENT CHANNEL; SWP:P60590; PDB:1KOZA; DCVRFWGKCSQTSDCCPHLACKSKWPRNICVWDGSV ---------------1111----------------- >CREATINE AMIDINOHYDROLASE; SWP:Q7SIB5; PDB:1KP0A; AAMITKYHNGKKYTPFSAEMTRRRLRAWMAKSIDAVLFTSYHNINYYSGWLYCYFGRKYA -----------------3333--------------------------------iiii--- VIVKAVTISKGIDGGMPWRRSFGNIVYTDWKRDNFYSAVKKLVKGAKIGIEHDHVTLHRR ---------3333-3333------------11113333----2222----1111------ LKALPGTEFVDVGPVMWRVIKSSEELIRGARISDIGGAATAAAISAGVPEYEVAIATTAM ---1111------3333----3333---------------111122223333-------- VRIARFPYVELMDTWIWFQSGINTDGAHNPVTRVVRGDILSLNCFPMIFGYYTALERTLF -------------------!!!!--1111------------------iiii--------- LVDASLIWKNTAVHRRGLLIKPGARCKDIASELNMYRWDLLRYRTFGYGHSFGVLHYYGR --3333--------------2222---------------1111----------------- EAGVELREDITVLEPGMVVSMEPMVMPEGEPGAGGYREHDILVIKENTENITGFPFGPEH 3333--1111---2222---------2222--------------------------3333 NIIKA ----- >TOXIN; SWP:P16948; PDB:1KP6A; NNAFCAGFGLSCKWECWCTAHGTGNELRYATAAGCGDHLSKSYYDARAGHCLFSDDLRNQ -3333-----------------------------!!!!2222------------------ FYSHCSSLNNNMSCRSLSK -----1111---------- >PROTEIN KINASE C INTERACT; SWP:P49773; PDB:1KPF; DTIFGKIIRKEIPAKIIFEDDRCLAFHDISPQAPTHFLVIPKKHISQISVAEDDDESLLG -------------------1111-----------------------3333-3333----- HLMIVGKKCAADLGLNKGYRMVVNEGSDGGQSVYHVHLHVLGGRQMHWPPG ----------11111111--------1111--------------------- >CYCLOPROPANE-FATTY-ACYL-P; SWP:Q11195; PDB:1KPGA; DELKPHFANVQAHYDLSDDFFRLFLDPTQTYSCAYFERDDTLQEAQIAKIDLALGKLGLQ ------------111133331111-3333------------------------1111--- PGTLLDVGCGWGATRAVEKYDVNVVGLTLSKNQANHVQQLVANSENLRSKRVLLAGWEQF --------!!!!-------------------------------------------3333- DEPVDRIVSIGAFEHFGHERYDAFFSLAHRLLPADGVLLHTITGLHPKEIHERGLPSFTF -----------3333-1111-------------------------1111-1111------ ARFLKFIVTEIFPGGRLPSIPVQECASANGFTVTRVQSLQPHYAKTLDLWSAALQANKGQ -----------2222-----------1111--------3333------------------ AIALQSEEVYERYKYLTGCAEFRIGYIDVNQFTCQK ---------------------1111----------- >CYCLOPROPANE-FATTY-ACYL-P; SWP:Q11196; PDB:1KPIA; QLKPPVEAVRSHYDKSNEFFKLWLDPSMTYSCAYFERPDMTLEEAQYAKRKLALDKLNLE -----------1111---------1111--------1111-------------1111--2 PGMTLLDIGCGWGSTMRHAVAEYDVNVIGLTLSENQYAHDKAMFDEVDSPRRKEVRIQGW 222---------------------------------------3333------------33 EEFDEPVDRIVSLGAFEHFADGAGDAGFERYDTFFKKFYNLTPDDGRMLLHTITIPDKEE 33------------3333--------3333-----1111--------------------- AQELGLTSPMSLLRFIKFILTEIFPGGRLPRISQVDYYSSNAGWKVERYHRIGANYVPTL -3333------------------2222---3333-----1111----------------- NAWADALQAHKDEAIALKGQETCDIYMHYLRGCSDLFRDKYTDVCQFTLVK --------------------------------------------------- >BETA-2-MICROGLOBULIN; SWP:P13747; PDB:1KPRA; GSHSLKYFHTSVSRPGRGEPRFISVGYVDDTQFVRFDNDAASPRMVPRAPWMEQEGSEYW -------------------------------------------------1111------- DRETRSARDTAQIFRVNLRTLRGYYNQSEAGSHTLQWMHGCELGPDGRFLRGYEQFAYDG ----------------------1111-----------------1111----------iii KDYLTLNEDLRSWTAVDTAAQISEQKSNDASEAEHQRAYLEDTCVEWLHKYLEKGKETLL i-----3333-------3333------------------------------------111 HLEPPKTHVTHHPISDHEATLRCWALGFYPAEITLTWQQDGEGHTQDTELVETRPAGDGT 1-------------1111------------------------------------------ FQKWAAVVVPSGEEQAYTCHVQHEGLPEPVTLRW ------------1111------1111-------- >Ran GTPase-activating pro; SWP:P46061; PDB:1KPSB; TDLSTFLSFPSPEKLLRLGPKVSVLIVQQTDTSDPEKVVSAFLKVASVFRDDASVKTAVL -----3333---------1111----------------------1111------------ DAIDALMKKAFSCSSFNSNTFLTRLLIHMGLLKSEDKIKAIPSLHGPLMVLNHVVRQDYF ------------3333----------------------------------------3333 PKALAPLLLAFVTKPNGALETCSFARHNLLQTLYNI 1111-------------------------------- >KP4 TOXIN; SWP:Q90121; PDB:1KPTA; LGINCRGSSQCGLSGGNLMVRIRDQACGNQGQTWCPGERRAKVCGTGNSISAYVQSTNNC -------3333-----------------3333--2222---------------------- ISGTEACRHLTNLVNHGCRVCGSDPLYAGNDVSRGQLTVNYVNSC -------------1111-------------3333----------- >HOST FACTOR FOR Q BETA; SWP:Q7A104; PDB:1KQ1A; NIQDKALENFKANQTEVTVFFLNGFQMKGVIEEYDKYVVSLNSQGKQHLIYKHAISTYTV --------------------1111----------1111----iiii----3333------ >GLYCEROL DEHYDROGENASE; SWP:NA; PDB:1KQ3A; HMITTTIFPGRYVQGAGAINILEEELSRFGERAFVVIDDFVDKNVLGENFFSSFTKVRVN --------------22221111----1111----------------11111111------ KQIFGGECSDEEIERLSGLVEEETDVVVGIGGGKTLDTAKAVAYKLKKPVVIVPTIASTD ----------------11111111------------------------------------ APCSALSVIYTPNGEFKRYLFLPRNPDVVLVDTEIVAKAPARFLVAGMGDALATWFEAES 1111------1111---------------------11113333----------------- CKQKYAPNMTGRLGSMTAYALARLCYETLLEYGVLAKRSVEEKSVTPALEKIVEANTLLS -------1111----------------------------1111----------------- GLGFESGGLAAAHAIHNGLTVLENTHKYLHGEKVAIGVLASLFLTDKPRKMIEEVYSFCE ---------------------3333---3333---------------------------- EVGLPTTLAEIGLDGVSDEDLMKVAEKACDKNETIHNEPQPVTSKDVFFALKAADRYGRM ------3333--2222-------------1111-1111----3333-------------1 RKNL 111- >HYPOTHETICAL PROTEIN TM04; SWP:Q9WYT0; PDB:1KQ4A; HKIDILDKGFVELVDVGNDLSAVRAARVSFERDRHLIEYLKHGHETPFEHIVFTFHVKAP -----------------3333-----------------------3333------------ IFVARQWFRHRIASYNELSSYEFYIPSPERLEGYKTTIPPERVTEKISEIVDKAYRTYLE --------------------------33332222----3333------------------ LIESGVPREVARIVLPLNLYTRFFWTVNARSLNFLNLRADSHAQWEIQQYALAIARIFKE -1111-3333-1111------------3333--------1111----------------- KCPWTFEAFLKYAYKGDIL ------------------- >NEUTROPHIL CYTOSOL FACTOR; SWP:P14598; PDB:1KQ6A; GDTFIRHIALLGFEKRFVPSQHYVYFLVKWQDLSEKVVYRRFTEIYEFHKTLKEFPIEAG -----------------------------1111-----------------------1111 AINPENRIIPHLPAPKWFDGQRAAENRQGTLTEYCSTLSLPTKISRCPHLLDFFKVRPDD --3333----------------------------------3333-------1111-1111 LKLPTDNQTKKPETYL -----1111------- >HEPATOCYTE NUCLEAR FACTOR; SWP:Q63244; PDB:1KQ8A; YIALITMAIRDSAGGRLTLAEINEYLMGKFPFFRGSYTGWRNSVRHNLSLNDCFVKVLRD --------3333------------------3333-----3333----------------- PSRPWGKDNYWMLNP --------------- >OXYGEN-INSENSITIVE NAD(P); SWP:Q01234; PDB:1KQBA; DIISVALKRHSTKAFDASKKLTAEEAEKIKTLLQYSPSSTNSQPWHFIVASTEEGKARVA ---------------1111------------------2222------------------1 KSAAGTYVFNERKMLDASHVVVFCAKTAMDDAWLERVVDQEEADGRFNTPEAKAANHKGR 111-1111---------------------------------1111--------------- TYFADMHRVDLKDDDQWMAKQVYLNVGNFLLGVGAMGLDAVPIEGFDAAILDEEFGLKEK ---------------------------------1111--------------------111 GFTSLVVVPVGHHSVEDFNATLPKSRLPLSTIVTEC 1------------11113333------3333----- >FORMATE DEHYDROGENASE, NI; SWP:P24183; PDB:1KQFA; QARNYKLLRAKEIRNTCTYCSVGCGLLMYSLGDGAKNAREAIYHIEGDPDHPVSRGALCP ----1111--------------------------1111---------1111--iiii-33 KGAGLLDYVNSENRLRYPEYRAPGSDKWQRISWEEAFSRIAKLMKADRDANFIEKNEQGV 33-33331111----------2222------------------------------1111- TVNRWLSTGMLCASGASNETGMLTQKFARSLGMLAVDNQARVHGPTVASLAPTFGRGAMT ------------1111---------------------3333------------------- NHWVDIKNANVVMVMGGNAAEAHPVGFRWAMEAKNNNDATLIVVDPRFTRTASVADIYAP --3333-----------3333--3333---------------------3333-------- IRSGTDITFLSGVLRYLIENNKINAEYVKHYTNASLLVRDDFAFEDGLFSGYDAEKRQYD -2222---------------------------1111--1111--iiii------------ KSSWNYQLDENGYAKRDETLTHPRCVWNLLKEHVSRYTPDVVENICGTPKADFLKVCEVL 1111----1111----1111-1111--------3333----------------------- ASTSAPDRTTTFLYALGWTQHTVGAQNIRTMAMIQLLLGNMGMAGGGVNALRGHSNIQGL 11111111----------------------------------2222-------------- TDLGLLSTSLPGYLTLPSEKQVDLQSYLEANTPKATLADQVNYWSNYPKFFVSLMKSFYG ----------%%%%---1111-------------------------------------!! DAAQKENNWGYDWLPKWDQTYDVIKYFNMMDEGKVTGYFCQGFNPVASFPDKNKVVSCLS !!-3333iiii------------------1111----------3333-----------11 KLKYMVVIDPLVTETSTFWQNHGESNDVDPASIQTEVFRLPSTCFAEEDGSIANSGRWLQ 11----------33331111--3333--3333-----------1111------1111--- WHWKGQDAPGEARNDGEILAGIYHHLRELYQSEGGKGVEPLMKMSWNYKQPHEPQSDEVA --------!!!!-------------------------3333--------1111------- KENNGYALEDLYDANGVLIAKKGQLLSSFAHLRDDGTTASSCWIYTGSWTEQGNQMANRD ------------1111----2222---3333----------1111----1111------- NSDPSGLGNTLGWAWAWPLNRRVLYNRASADINGKPWDPKRMLIQWNGSKWTGNDIPDFG --1111---1111----%%%%---3333--1111---1111------------------- NAAPGTPTGPFIMQPEGMGRLFAINKMAEGPFPEHYEPIETPLGTNPLHPNVVSNPVVRL --2222----1111--------!!!!--------------1111-3333-----1111-- YEQDALRMGKKEQFPYVGTTYRLTEHFHTWTKHALLNAIAQPEQFVEISETLAAAKGINN 33331111-3333---------3333!!!!----------------------------22 GDRVTVSSKRGFIRAVAVVTRRLKPLNVNGQQVETVGIPIHWGFEGVARKGYIANTLTPN 22-----1111--------3333----iiii---------------------1111---- VGDANSQTPEYKAFLVNIEKA ---------1111-------- -------------------------------------- >Fusion Protein of and str; SWP:P03069; PDB:1KQLA; MDKVEELLSKNYHLENEVARLKKLVDDLEDELYAQKLKYKAISEELDHALNDMTS -3333-------------------------------------------------- >NICOTINAMIDE MONONUCLEOTI; SWP:Q9HAN9; PDB:1KQNA; KTEVVLLACGSFNPITNMHLRLFELAKDYMNGTGRYTVVKGIISPVGDAYKKKGLIPAYH ---------------3333---------------------------3333-2222----- RVIMAELATKNSKWVEVDTWESLQKEWKETLKVLRHHQEKLEAAVPKVKLLCGADLLESF ------1111--------3333------3333---------------------------- AVPNLWKSEDITQIVANYGLICVTRAGNDAQKFIYESDVLWKHRSNIHVVNEWIANDISS -2222-3333--------------------------------3333-------------- TKIRRALRRGQSIRYLVPDLVQEYIEKHNLYSSESEDRNAGVILAPLQRNTA ------1111--2222---------------3333-2222--------1111 >NH(3)-DEPENDENT NAD(+) SY; SWP:P08164; PDB:1KQPA; SMQEKIMRELHVKPSIDPKQEIEDRVNFLKQYVKKTGAKGFVLGISGGQDSTLAGRLAQL ------------------------------------------------3333-------- AVESIREEGGDAQFIAVRLPHGTQQDEDDAQLALKFIKPDKSWKFDIKSTVSAFSDQYQQ -----1111----------------3333------------------------------- ETGDQLTDFNKGNVKARTRMIAQYAIGGQEGLLVLGTDHAAEAVTGFFTKYGDGGADLLP --------------------------------------33331111--2222-------- LTGLTKRQGRTLLKELGAPERLYLKEPTADLLDEKPQQSDETELGISYDEIDDYLEGKEV 2222---------------3333-----------22223333-----------1111--- SAKVSEALEKRYSMTEHKRQVPASMFDDWWK --------------3333-----11111111 >DEAD RINGER PROTEIN; SWP:Q24573; PDB:1KQQA; GWSFEEQFKQVRQLYEINDDPKRKEFLDDLFSFMQKRGTPINRLPIMAKSVLDLYELYNL --------------3333-3333---------3333----------%%%%---------- VIARGGLVDVINKKLWQEIIKGLHLPSSITSAALTLRTQYMKYLYPYECEKKNLSTPAEL -11113333-----------1111-3333------------------------------- QAAIDGNRREG ----------- >VP4; SWP:P12473; PDB:1KQRA; LDGPYQPTTFNPPVDYWMLLAPTAAGVVVEGTNNTDRWLATILVEPNVTSETRSYTLFGT --------------------------------------------------------iiii QEQITIANASQTQWKFIDVVKTTQNGSYSQYGPLQSTPKLYAVMKHNGKIYTYNGETPNV ----------------------1111-------------------iiii----------- TTKYYSTTNYDSVNMTAFCDFYIIPREEESTCTEYINNGL --------3333------------3333------------ >CELLULAR RETINOL-BINDING ; SWP:Q8UVG6; PDB:1KQWA; PADFNGTWEMLSNDNFEDVMKALDIDFATRKIAVHLKQTKVIVQNGDKFETKTLSTFRNY --------------------1111-------3333---------!!!!------------ EVNFVIGEEFDEQTKGLDNRTVKTLVKWDGDKLVCVQKGEKENRGWKQWIEGDLLHLEIH ----2222--------------------!!!!------------------!!!!------ CQDKVCHQVFKKKN !!!!---------- >NEURAL GLOBIN; SWP:O76242; PDB:1KR7A; MVNWAAVVDDFYQELFKAHPEYQNKFGFKGVALGSLKGNAAYKTQAGKTVDYINAAIGGS ------------------3333---1111--11111111--------------------- ADAAGLASRHKGRNVGSAEFHNAKACLAKACSAHGAPDLGHAIDDILSHL ----------1111-----------------1111-----------1111 >BENZOATE 1,2-DIOXYGENASE ; SWP:P07771; PDB:1KRHA; SNHQVALQFEDGVTRFICIAQGETLSDAAYRQQINIPMDCREGECGTCRAFCESGNYDMP --------1111-------2222------1111------------1111----------3 EDNYIEDALTPEEAQQGYVLACQCRPTSDAVFQIQASSEVCKTKIHHFEGTLARVENLSD 333--3333----------3333-------------3333------------------11 STITFDIQLDDGQPDIHFLAGQYVNVTLPGTTETRSYSFSSQPGNRLTGFVVRNVPQGKM 11-------2222--------------2222----------2222---------2222-- SEYLSVQAKAGDKMSFTGPFGSFYLRDVKRPVLMLAGGTGIAPFLSMLQVLEQKGSEHPV --------2222------------------------------------------------ RLVFGVTQDCDLVALEQLDALQQKLPWFEYRTVVAHAESQHERKGYVTGHIEYDWLNGGE -------1111-------------1111-----------------3333--3333%%%%- VDVYLCGPVPMVEAVRSWLDTQGIQPANFLFEKFSAN ------------------------------------- >PLASMINOGEN; SWP:P00747; PDB:1KRN; DCYHGDGQSYRGTSSTTTTGKKCQSWSSMTPHRHQKTPENYPNAGLTMNYCRNPDADKGP ---!!!!---------1111----1111--------33331111--------1111---- WCFTTDPSVRWEYCNLKKC -----1111---------- >FERRITIN; SWP:Q46106; PDB:1KRQA; MLSKEVVKLLNEQINKEMYAANLYLSMSSWCYENSLDGAGAFLFAHASEESDHAKKLITY -------------------------------1111------------------------- LNETDSHVELQEVKQPEQNFKSLLDVFEKTYEHEQFITKSINTLVEHMLTHKDYSTFNFL ------------------------------------------------------------ QWYVSEQHEEEALFRGIVDKIKLIGEHGNGLYLADQYIKNIALSR --------------------------!!!!--------------- >GALACTOSIDE O-ACETYLTRANS; SWP:P07464; PDB:1KRRA; NMPMTERIRAGKLFTDMCEGLPEKRLRGKTLMYEFNHSHPSEVEKRESLIKEMFATVGEN -----------------iiii-------------11111111------------------ AWVEPPVYFSYGSNIHIGRNFYANFNLTIVDDYTVTIGDNVLIAPNVTLSVTGHPVHHEL --------------------------------------------------------1111 RKNGEMYSFPITIGNNVWIGSHVVINPGVTIGDNSVIGAGSIVTKDIPPNVVAAGVPCRV 1111---------------------2222--------2222------------------- IREINDRDKHYYFKDYKVES ----3333------------ >PROTEIN EC1268, RPIA; SWP:P27252; PDB:1KS2A; TQDELKKAVGWAALQYVQPGTIVGVGTGSTAAHFIDALGTKGQIEGAVSSSDASTEKLKS -------------11112222---------------33331111--------------11 LGIHVFDLNEVDSLGIYVDGADEINGHQIKGGGAALTREKIIASVAEKFICIADASKQVD 11----3333--------------------------------1111-------3333--- ILGKFPLPVEVIPARSAVARQLVKLGGRPEYRQGVVTDNGNVILDVHGEILDPIAENAIN ----------------------1111-----2222-1111-----------3333----- AIPGVVTVGLFANRGADVALIGTPDGVKTIVK ----------------------1111------ >ENDOGLUCANASE A; SWP:O74705; PDB:1KS5A; QTMCSQYDSASSPPYSVNQNLWGEYQGTGSQCVYVDKLSSSGASWHTEWTWSGGEGTVKS --------------------1111--------------1111-----------2222--- YSNSGVTFNKKLVSDVSSIPTSVEWKQDNTNVNADVAYDLFTAANVDHATSSGDYELMIW -----------3333-----------------------------11111111-------- LARYGNIQPIGKQIATATVGGKSWEVWYGSTTQAGAEQRTYSFVSESPINSYSGDINAFF ------------------iiii----------iiii------------------------ SYLTQNQGFPASSQYLINLQFGTEAFTGGPATFTVDNWTASVN ---------1111------------------------------ >TRANSFORMING GROWTH FACTO; SWP:Q90999; PDB:1KS6A; QLPRLCKFCDVKATTCSNQDQCTSNCNITSICEKNNEVCAAVWRRNDENVTLETICHDPQ ------------------------------------------------------------ KRLYGHMLDDSSSEQCVMKEKKDDGGLMFMCSCTGEECNDVLIFSAI --%%%%---1111-----------------------3333------- >ENDO-B-1,4-GLUCANASE; SWP:O77044; PDB:1KS8A; MAYDYKQVLRDSLLFYEAQRSGRLPADQKVTWRKDSALNDQGDQGQDLTGGYFDAGDFVK ----------------1111----1111-1111---1111-1111--------------- FGFPMAYTATVLAWGLIDFEAGYSSAGALDDGRKAVKWATDYFIKAHTSQNEFYGQVGQG -----------------------1111--------------------------------- DADHAFWGRPEDMTMARPAYKIDTSRPGSDLAGETAAALAAASIVFRNVDGTYSNNLLTH --------3333----------1111---------------------------------- ARQLFDFANNYRGKYSDSITDARNFYASADYRDELVWAAAWLYRATNDNTYLNTAESLYD -------------1111-3333-------3333--------------------------1 EFGLQNWGGGLNWDSKVSGVQVLLAKLTNKQAYKDTVQSYVNYLINNQQKTPKGLLYIDM 1111111----1111-----------------------------------1111------ WGTLRHAANAAFIMLEAAELGLSASSYRQFAQTQIDYALGDGGRSFVCGFGSNPPTRPHH -----------------1111------------------1111---2222---------- RSSSCPPAPATCDWNTFNSPDPNYHVLSGALVGGPDQNDNYVDDRSDYVHNEVATDYNAG 3333---------3333---------2222-----1111----11113333--3333--- FQSALAALVALGY --------1111- >2-DEHYDROPANTOATE 2-REDUC; SWP:P77728; PDB:1KS9A; MKITVLGCGALGQLWLTALCKQGHEVQGWLRVPQPYCSVNLVETDGSIFNESLTANDPDF -------------------1111-------------------1111-------------- LATSDLLLVTLKAWQVSDAVKSLASTLPVTTPILLIHNGMGTIEELQNIQQPLLMGTTTH 1111-------1111-----1111-----------------33331111----------- AARRDGNVIIHVANGITHIGPARQQDGDYSYLADILQTVLPDVAWHNNIRAELWRKLAVN ----!!!!--------------1111--3333--3333---------3333--------- CVINPLTAIWNCPNGELRHHPQEIMQICEEVAAVIEREGHHTSAEDLRDYVMQVIDATAE ---------------3333----------------1111------------------111 NISSMLQDIRALRHTEIDYINGFLLRRARAHGIAVPENTRLFEMVKRKESE 1-------1111--------------------------------------- >BACTERIOCHLOROPHYLL A PRO; SWP:Q46393; PDB:1KSAA; TTAHSDYEIVLEGGSSSWGKVKARAKVNAPPASPLLPADCDVKLNVKPLDPAKGFVRISA ------------------------------------------------------------ VFESIVDSTKNKLTIEADIANETKERRISVGEGMVSVGDFSHTFSFEGSVVNLFYYRSDA -----iiii--------------------------------------------------3 VRRNVPNPIYMQGRQFHDILMKVPLDNNDLIDTWEGTVKAIGSTGAFNDWIRDFWFIGPA 333-------------------------3333-----------------3333---!!!! FTALNEGGQRISRIEVNGLNTESGPKGPVGVSRWRFSHGGSGMVDSISRWAELFPSDKLN ------------------------------------------------------------ RPAQVEAGFRSDSQGIEVKVDGEFPGVSVDAGGGLRRILNHPLIPLVHHGMVGKFNNFNV -----------1111------------------------------------3333----- DAQLKVVLPKGYKIRYAAPQYRSQNLEEYRWSGGAYARWVEHVCKGGVGQFEILYAQ --------2222--------------------!!!!-----1111------------ >Retinal rod rhodopsin-sen; SWP:O43924; PDB:1KSGB; KDERAREILRGFKLNWMNLRDAETGKILWQGTEDLSVPGVEHEARVPKKILKCKAVSREL --------1111------------------------------------1111-------- NFSSTEQMEKFRLEQKVYFKGQCLEEWFFEFGFVIPNSTNTWQSLIIETKFFDDDLLVST ------------------------------------------------------------ SRVRLFYV -------- >ARF-LIKE PROTEIN 2; SWP:Q9D0J4; PDB:1KSHA; RELRLLMLGLDNAGKTTILKKFNGEDVDTISPTLGFNIKTLEHRGFKLNIWDVGGQKSLR ---------2222-------1111------------------iiii---------33331 SYWRNYFESTDGLIWVVDSADRQRMQDCQRELQSLLVEERLAGATLLIFANKQDLPGALS 111---2222-------11111111------------1111----------1111----- NAIQEALELDSIRSHHWRIQGCSAVTGEDLLPGIDWLLDDISSR -------3333------------1111---3333---------- >Retinal rod rhodopsin-sen; SWP:O43924; PDB:1KSHB; SAKDERAREILRGFKLNWMNLRDAETGKILWQGTEDLSVPGVEHEARVPKKILKCKAVSR ----------1111---------1111------------------------1111----- ELNFSSTEQMEKFRLEQKVYFKGQLEEWFFEFGFVIPNSTNTWQSLIEMPASVLTGNVII --------------------!!!!--------------------------3333------ ETKFFDDDLLVSTSRVRLFYV --------------------- >COPPER AMINE OXIDASE; SWP:Q43077; PDB:1KSIA; VQHPLDPLTKEEFLAVQTIVQNKYPISNNRLAFHYIGLDDPEKDHVLRYETHPTLVSIPR --1111---3333------3333-3333-----------------------3333----- KIFVVAIINSQTHEILINLRIRSIVSDNIHNGYGFPILSVDEQSLAIKLPLKYPPFIDSV -------%%%%-------1111----------------------11113333-------- KKRGLNLSEIVCSSFTMGWFGEEKNVRTVRLDCFMKESTVNIYVRPITGITIVADLDLMK 1111-1111-------------------------------3333---------------- IVEYHDRDIEAVPTAENTEYQVSKQSPPFGPKQHSLTSHQPQGPGFQINGHSVSWANWKF --------------------3333---------------1111-----------!!!!-- HIGFDVRAGIVISLASIYDLEKHKSRRVLYKGYISELFVPYQDPTEEFYFKTFFDSGEFG -------------------1111---------------------3333-----3333--- FGLSTVSLIPNRDCPPHAQFIDTYVHSANGTPILLKNAICVFEQYGNIMWRHTENGIPNE --------2222--1111--------1111--------------------------2222 SIEESRTEVNLIVRTIVTVGNDNVIDWEFKASGSIKPSIALSGILEIKGTNIKHKDEIKE -----------------------------1111--------------------3333--- DLHGKLVSANSIGIYHDHFYIYYLDFDIDGTHNSFEKTSLKTVRIKDGSSKRKSYWTTET -------2222---------------2222------------------------------ QTAKTESDAKITIGLAPAELVVVNPNIKTAVGNEVGYRLIPAIPAHPLLTEDDYPQIRGA ----3333---------------1111-1111-----------------1111-----11 FTNYNVWVTAYNRTEKWAGGLYVDHSRGDDTLAVWTKQNREIVNKDIVMWHVVGIHHVPA 11---------1111-1111--2222----3333-------------------------3 QEDFPIMPLLSTSFELRPTNFFERNPVLKTLSPRDVAWPGC 333---------------------1111------------- --- >S100 CALCIUM-BINDING PROT; SWP:P33764; PDB:1KSOA; ARPLEQAVAAIVCTFQEYAGRCGDKYKLCQAELKELLQKELATWTPTEFRECDYNKFMSV -----------------1111--1111-------------1111--1111---------- LDTNKDCEVDFVEYVRSLACLCLYCHEYFKDCP ---1111------------------3333---- >LATENT TRANSFORMING GROWT; SWP:P22064; PDB:1KSQA; SADQPKEEKKECYYNLNDASLCDNVLAPNVTKQECCCTSGAGWGDNCEIFPCPVLGTAEF ----------------%%%%----------3333-3333--------------------3 TEMCPKGKGFVPAGE 333------------ ------------------------------------------------------------ ---------------------------------------- >51 KDA FK506-BINDING PROT; SWP:Q13451; PDB:1KT0A; VLKIVTPMIGDKVYVHYKGKLFDSPFVFSLGKGQVIKAWDIGVATMKRGEICHLLCKPEY -------2222-----------------2222--------------2222------3333 AYGSAGSLPKIPSNATLFFEIELLDFKGEDLFEDGGIIRRTKRKGEGYSNPNEGATVEIH -!!!!------1111------------------------------------2222----- LEGRCGGRMFDCRDVAFTVGEGEDHDIPIGIDKALEKMQREEQCILYLGPRYGFGEAGKP ----iiii---------22223333-------------2222------3333--3333-1 KFGIEPNAELIYEVTLKSFEKAKESWEMDTKEKLEQAAIVKEKGTVYFKGGKYMQAVIQY 111---------------------1111-------------------------------- GKIVSWLEMEYGLSEKESKASESFLLAAFLNLAMCYLKLREYTKAVECCDKALGLDSANE ------1111---3333------------------------------------------- KGLYRRGEAQLLMNEFESAKGDFEKVLEVNAARLQISMCQKKAKEHNERDRRIYANM ----------1111--3333------------------------------------- >FK506-BINDING PROTEIN FKB; SWP:Q9XSH5; PDB:1KT1A; KKDRGVLKIVHGEETPMIGDRVYVHYNGKLANPFVFSIGKGQVIKAWDIGVATMKKGEIC ----------------2222-----------------------------1111-2222-- HLLCAYGATGSLPKIPSNATLFFEVELLDFKGEDLLEDGGIIRRTKRRGEGYSNPNEGAR -------------------------------------------------------2222- VQIHLEGRCGGRVFDCRDVAFTVGEGEDHDIPIGIDKALEKMQREEQCILHLGPRYGFGE --------iiii---------22221111-------3333--2222------3333--33 AGKPKFGIEPNAELIYEVTLKSFEKAKESWEMDTKEKLEQAAIVKEKGTVYFKGGKYVQA 33-1111-2222---------------1111--------------------1111----- VIQYGKIVSWLEMEYGLSEKESKASESFLLAAFLNLAMCYLKLREYTKAVECCDKALGLD ----------1111-------------------------------3333------11111 SANEKGLYRRGEAQLLMNEFESAKGDFEKVLEVNPQNKAARLQIFMCQKKAKEHNERDRR 111-----------1111-----------33331111----------------------- TYANMFKKFAEQDA ---------1111- >MHC class II E-beta-k [Pr; SWP:Q31163; PDB:1KT2B; ADLIAYLKQATKGGGGSLVPRGSGGGGSRPWFLEYCKSECHFYNGTQRVRLLVRYFYNLE --------------------------------------------------------!!!! ENLRFDSDVGEFRAVTELGRPDAENWNSQPEFLEQKRAEVDTVCRHNYEIFDNFLVPRRV -----3333------3333-----------------------------------1111-- EPTVTVYPTKTQPLEHHNLLVCSVSDFYPGNIEVRWFRNGKEEKTGIVSTGLVRNGDWTF ----------3333-----------------------iiii------------------- QTLVMLETVPQSGEVYTCQVEHPSLTDPVTVEW ---------------------3333-------- >PLASMA RETINOL-BINDING PR; SWP:P18902; PDB:1KT6A; ERDCRVSSFRVKENFDKARFAGTWYAMAKKDPEGLFLQDNIVAEFSVDENGHMSATAKGR ----3333-------3333----------------------------1111--------- VRLLNNWDVCADMVGTFTDTEDPAKFKMKYWGVASFLQKGNDDHWIIDTDYETFAVQYSC ---------------------1111--------1111----------------------- RLLNLDGTCADSYSFVFARDPSGFSPEVQKIVRQRQEELCLARQYRLIPHNGYCD ---1111-----------------------------11112222----------- >ALPHA-N-ACETYLGALACTOSAMI; SWP:Q90744; PDB:1KTBA; LENGLARTPPMGWLAWERFRCNVNCREDPRQCISEMLFMEMADRIAEDGWRELGYKYINI ---------------3333----3333------3333-----------3333-------- DDCWAAKQRDAEGRLVPDPERFPRGIKALADYVHARGLKLGIYGDLGRLTCGGYPGTTLD ---------1111--------1111--------1111------------1111----111 RVEQDAQTFAEWGVDMLKLDGCYSSGKEQAQGYPQMARALNATGRPIVYSCSWPAYQGGL 1--------------------------------------------------3333----- PPKVNYTLLGEICNLWRNYDDIQDSWDSVLSIVDWFFTNQDVLQPFAGPGHWNDPDMLII --------------------------------------33333333-2222-------22 GNFGLSYEQSRSQMALWTIMAAPLLMSTDLRTISPSAKKILQNRLMIQINQDPLGIQGRR 22---------------1111-------1111-------------------3333----- IIKEGSHIEVFLRPLSQAASALVFFSRRTDMPFRYTTSLAKLGFPMGAAYEVQDVYSGKI ---1111--------%%%%------------------3333------------------- ISGLKTGDNFTVIINPSGVVMWYLCPKA ----1111------2222---------- >THIOLTRANSFERASE; SWP:P12309; PDB:1KTE; AQAFVNSKIQPGKVVVFIKPTCPFCRKTQELLSQLPFKEGLLEFVDITATSDTNEIQDYL ----3333-2222------------------3333--2222----1111----------- QQLTGARTVPRVFIGKECIGGCTDLESMHKRGELLTRLQQVGAVK -------------!!!!---------------------------- >DIADENOSINE TETRAPHOSPHAT; SWP:Q9U2M7; PDB:1KTGA; VVKAAGLVIYRKLAGKIEFLLLQASYPPHHWTPPKGHVDPGEDEWQAAIRETKEEANITK ------------iiii----------------------2222----------------33 EQLTIHEDCHETLFYEAKGKPKSVKYWLAKLNNPDDVQLSHEHQNWKWCELEDAIKIADY 33---1111-------%%%%-------------------3333----------------- AEMGSLLRKFSAFLAGF ------------3333- >COLLAGEN ALPHA 3(VI) CHAI; SWP:P12111; PDB:1KTHA; ETDICKLPKDEGTCRDFILKWYYDPNTKSCARFWYGGCGGNENKFGSQKECEKVCAPV -3333-------------------1111------------------------------ >ALLERGEN DER P 2; SWP:P49278; PDB:1KTJA; SEVDVKDCANHEIKKVLVPGCHGSEPCIIHRGKPFQLEAVFEANQNTKTAKIEIKASIDG -----------------2222!!!!----2222------------------------iii LEVDVPGIDPNACHYMKCPLVKGQQYDIKYTWNVPKIAPKSENVVVTVKVMGDDGVLACA i---------1111------2222----------1111-------------1111----- IATHAKIRD --------- >EXOTOXIN TYPE C; SWP:NA; PDB:1KTKE; GAVVSQHPSRVIAKSGTSVKIECRSLDFQATTMFWYRQFPKQSLMLMATSAEGSKATYEQ ------------------------------------------------------------ GVEKDKFLINHASLTLSTLTVTSAHPEDSSFYICSALAGSGSSTDTQYFGPGTRLTVLED ------------1111--------3333-------------------------------- LKNFPPEVAVFEPSEAEISHTQKATLVCLATGFYPDHVELSWWVNGKEVHSGVSTDPQPL ---------------3333----------------------------------------- KEQPALNDSRYALSSRLRVSATFWQNPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSA --------------------2222------------------------------------ EAWGRAD ------- >ANTI-HIS TAG ANTIBODY 3D5; SWP:NA; PDB:1KTRH; QVQLQQSGPEDVKPGASVKISCKASGYTFTDYYMNWVKQSPGKGLEWIGDINPNNGGTSY ------------2222-----------1111----------------------------- NQKFKGRATLTVDKSSSTAYMELRSLTSEDSSVYYCESQSGAYWGQGTTVTVSA 3333---------1111---------3333------------------------ ------------------------------------- >TRANSFORMING GROWTH FACTO; SWP:P10600; PDB:1KTZA; ENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSASASPCCVPQDLEPLTILY ----------3333---------------------------------------------- YVGRTPKVEQLSNMVVKSCKCS -!!!!----------------- >L1 LIPASE; SWP:O66015; PDB:1KU0A; ASPRANDAPIVLLHGFTGWGREEMLGFKYWGGVRGDIEQWLNDNGYRTYTLAVGPLSSNW -------------------1111iiii---!!!!-------1111--------1111--- DRACEAYAQLVGGTVDYGAAHAAKHGHARFGRTYPGLLPELKRGGRVHIIAHSQGGQTAR --------------------------------------3333------------------ MLVSLLENGSQEEREYAKEHNVSLSPLFEGGHRFVLSVTTIATPHDGTTLVNMVDFTDRF ----------3333----------3333-------------------3333---3333-- FDLQKAVLEAAAVASNAPYTSEIYDFKLDQWGLRREPGESFDHYFERLKRSPVWTSTDTA -----------------3333------3333----2222-----------3333----33 RYDLSVPGAETLNRWVKASPNTYYLSFSTERTYRGALTGNYYPELGMNAFSAIVCAPFLG 33----------------1111---------------------2222---------3333 SYRNAALGIDSHWLGNDGIVNTISMNGPKRGSNDRIVPYDGTLKKGVWNDMGTYNVDHLE ----1111-1111-------3333----2222------------------------1111 VIGVDPNPSFNIRAFYLRLAEQLASLRP ------1111------------1111-- >ARF GUANINE-NUCLEOTIDE EX; SWP:P39993; PDB:1KU1A; DREFIECTNAFNEKPKKGIPMLIEKGFIASDSDKDIAEFLFNNNNRMNKKTIGLLLCHPD -------------3333-------------------------1111-----------111 KVSLLNEYIRLFDFSGLRVDEAIRILLTKFRLPGESQQIERIIEAFSSAYCENQDYDPSK 1------3333--2222---------------------------------1111--3333 ISDNAEDDISTVQPDADSVFILSYSIIMLNTDLHNPQVKEHMSFEDYSGNLKGCCNHKDF ----22221111------------------33331111----3333----2222%%%%-- PFWYLDRVYCSIRDKEIVMP -------------------- >SIGMA FACTOR SIGA; SWP:Q9EZJ8; PDB:1KU2A; SDPVRQYLHEIGQVPLLTLEEEIDLARKVEEGMEAIKKLSEATGLDQELIREVVRAKILG ----------------------------------------------------------11 TARIQKIPGLKEKPDPKTVEEVDGKLKSLPKELKRYLHIAREGEAARQHLIEANLRLVVS 11----2222------333311113333-3333--------------------------- IAKKYTGRGLSFLDLIQEGNQGLIRAVEKFEYKRRFKFSTYATWWIRQAINRAIADQART -1111-------------------------3333---3333-----------1111---- IRIPVHMVETINKLSRTARQLQQELGREPSYEEIAEAMGPGWDAKRVEETLKIAQEPVSL ---3333------------------------------------------1111------- >SIGMA FACTOR SIGA; SWP:Q9EZJ8; PDB:1KU3A; ELEKALSKLSEREAMVLKMRKGLIDGREHTLEEVGAYFGVTRERIRQIENKALRKLKYHE ----3333---------------------------------------------------- S - >HPHA; SWP:O74098; PDB:1KU5A; GELPIAPVDRLIRKAGAERVSEQAAKVLAEYLEEYAIEIAKKAVEFARHAGRKTVKVEDI -----------------------------------------------1111--------- KLAIKS ------ >HYPOTHETICAL PROTEIN MJ22; SWP:Q58958; PDB:1KU9A; IIEEAKKLIIELFSELAKIHGLNKSVGAVYAILYLSDKPLTISDIEELKISKGNVSSLKK ----------------------3333------1111----3333---------------- LEELGFVRKVWIKGERKNYYEAVDGFSSIKDIAKRKHDLIAKTYEDLKKLEEKCNEEEKE -----------2222---------3333--------------------1111-----333 FIKQKIKGIERKKISEKILEALNDLD 3------------------3333--- >METALLOPROTEINASE; SWP:O57413; PDB:1KUFA; QRFPQRYIELAIVVDHGMYTKYSSNFKKIRKRVHQMVSNINEMCRPLNIAITLALLDVWS ---------------------%%%%------------------3333------------- EKDFITVQADAPTTAGLFGDWRERVLLKKKNHDHAQLLTDTNFARNTIGWAYVGRMCDEK -------------------------3333--------------%%%%-------2222-- YSVAVVKDHSSKVFMVAVTMTHELGHNLGMEHDDKDKCKCDTCIMSAVISDKQSKLFSDC -----------3333------------------3333-----1111---1111------- SKDYYQTFLTNDNPQCILNAP -------------3333---- >Phosphoribosylaminoimidaz; SWP:Q9X0X0; PDB:1KUTA; TKIVKVTGDYALLEFKDDLTGKGSICAETTAILKYLSEKGIKTHLVEYIPPRTLKVIPLK ------!!!!--------------------------1111-------------------- FPLEVVVRLKKAGSFVRRYGGAEGEDLPVPLVEFFIKDDERHDPVCVDHLEILGIATKKQ -----------!!!!------2222------------3333----3333----------- AEKKEAAVKITLALKEFFERANFELWDIKYEFGLDKDGNVVLGDEISPDTFRLRKKGEIF ----------------------------------1111--------1111---------- DKDVYRRDLGDPLKKYREVLELCRSLNSQ ----------1111--------------- >CONSERVED PROTEIN; SWP:O27099; PDB:1KUUA; MYLGRILAVGRNSNGSFVAYRVSSRSFPNRTTSIQEERVAVVPVEGHERDVFRNPYIAYN -----------1111----------------------------2222-3333-------- CIRIVGDTAVVSNGSHTDTIADKVALGMNLRDAIGLSLLAMDYEKDELNTPRIAAAINGS ----!!!!-------------------------------------1111----------- EAFIGIVTADGLMVSRVPEETPVYISTYEQTEPAATEFKAGSPEEAAEFILKGGEFAAFT -------1111------3333--------------------------------1111--- HPVTAAAAFNDGEGWNLATREM ---------------------- >ALPHA-LIKE TOXIN; SWP:P59854; PDB:1KV0A; VRDGYIALPHNCAYGCLNNEYCNNLCTKDGAKIGYCNIVGKYGNACWCIQLPDNVPIRVP --------------------------1111---------------------1111----- GRCHPA ------ >PROTEIN-GLUTAMINE GAMMA-G; SWP:P21980; PDB:1KV3A; ETNGRDHHTADLCREKLVVRRGQPFWLTLSLTFSVVTGPAPSQEAGTKARFPLRDAVEEG 3333----3333-----------------------------3333--------------- DWTATVVDQQDCTLSLQLTTPANAPIGLYRLSLEASGHFILLFNAWCPADAVYLDSEEER -------------------------------------------1111--1111------- QEYVLTQQGFIYQGSAKFIKNIPWNFGQFQDGILDICLILLDVNPKFLKNAGRDCSRRSS --------------1111--------1111-------------3333--3333------- PVYVGRVGSGMVNCNDDQGVLLGRWDNNYGDGVSPMSWIGSVDILRRWKNHGCQRVKYGQ ---------------------------------1111----------------------- CWVFAAVACTVLRCLGIPTRVVTNYNSAHDQNSNLLIEYFRNEFGEIQGDKSEMIWNFHC -----------------------------------------1111--------------- WVESWMTRPDLQPGYEGWQALDPTPQEKSEGTYCCGPVPVRAIKEGDLSTKYDAPFVFAE -------1111---------------------------3333----1111--3333---- VNADVVDWIQQDDGSVHKSINRSLIVGLKISTKSVGRDEREDITHTYKYPEGSSEEREAF ---------------------------------2222-----3333---2222------- TRANHLNKLAEKEETGMAMRIRVGQSMNMGSDFDVFAHITNNTAEEYVCRLLLCARTVSY 1111------------------------------------------------------11 NGILGPECGTKYLLNLTLEPFSEKSVPLCILYEKYRDCLTESNLIKVRALLVEPVINSYL 11----------------------------3333-----1111----------------- LAERDLYLENPEIKIRILGEPKQKRKLVAEVSLQNPLPVALEGCTFTVEGAGLTEEQKTV ------------------------------------------------------------ EIPDPVEAGEEVKVRMDLVPLHMGLHKLVVNFESDKLKAVKGFRNVIIGPA ---------------------------------1111-------------- >MORICIN; SWP:P82818; PDB:1KV4A; AKIPIKAIKTVGKAVGKGLRAINIASTANDVFNFLKPKKRKA ---3333----------------------------------- >PROBABLE BLUE-COPPER PROT; SWP:P36649; PDB:1KV7A; RPTLPIPDLLTTDARNRIQLTIGAGQSTFGGKTATTWGYNGNLLGPAVKLQRGKAVTVDI ------------1111------------%%%%------------------2222------ YNQLTEETTLHWHGLEVPGEVDGGPQGIIPPGGKRSVTLNVDQPAATCWFHPHQHGKTGR -----------2222--3333--1111--2222--------------------2222--- QVAMGLAGLVVIEDDEILKLMLPKQWGIDDVPVIVQDKKFSADGQIDYQLDVMTAAVGWF --------------3333------2222------------1111---------------- GDTLLTNGAIYPQHAAPRGWLRLRLLNGCNARSLNFATSDNRPLYVIASDGGLLPEPVKV -----iiii----------------------------1111-------1111-------- SELPVLMGERFEVLVEVNDNKPFDLVTLPVSQMGMAIAPFDKPHPVMRIQPIAISASGAL -----2222--------2222----------2222------------------------- PDTLSSLPALPSLEGLTVRKLQLSMDPMLDMMGMQMLMEKYGDQAMAGMDFHHANKINGQ ------------2222-----------------------------22223333---%%%% AFDMNKPMFAAAKGQYERWVISGVGDMMLHPFHIHGTQFRILSENGKPPAAHRAGWKDTV --1111-------------------------------------iiii--1111------- KVEGNVSEVLVKFNHDAPKEHAYMAHCHLLEHEDTGMMLGFTV -----------------1111-------3333----------- >TYPE II QUINOHEMOPROTEIN ; SWP:Q8GR64; PDB:1KV9A; AGVDEAAIRATEQAGGEWLSHGRTYAEQRFSPLKQIDASNVRSLGLAWYMDLDNTRGLEA -----------------------1111---------11111111---------------- TPLFHDGVIYTSMSWSRVIAVDAASGKELWRYDPEVAKVKARTSCCDAVNRGVALWGDKV ----iiii-----%%%%-------------------1111-------------------- YVGTLDGRLIALDAKTGKAIWSQQTTDPAKPYSITGAPRVVKGKVIIGNGGAEYGVRGFV ---1111-------------------3333----------iiii------3333------ SAYDADTGKLAWRFYTVPGDPALPYEHPELREAAKTWQGDQYWKLGGGGTVWDSMAYDPE -------------------3333---3333---1111---3333--------------11 LDLLYVGTGNGSPWNREVRSPGGGDNLYLSSILAIRPDTGKLAWHYQVTPGDSWDFTATQ 11------------3333-1111-------------------------2222-------- QITLAELNIDGKPRKVLMQAPKNGFFYVLDRTNGKLISAEKFGKVTWAEKVDLATGRPVE --------iiii--------3333----------------------------3333---- APGVRYEKEPIVMWPSPFGAHNWHSMSFNPGTGLVYIPYQEVPGVYRNEGKDFVTRKAFN 2222-----------3333----------1111---------------!!!!-------- TAAGFADATDVPAAVVSGALLAWDPVKQKAAWKVPYPTHWNGGTLSTAGNLVFQGTAAGQ ---3333----3333--------------------------------------------- MHAYSADKGEALWQFEAQSGIVAAPMTFELAGRQYVAIMAGWGGVATLTGGESMNLPGMK -----------------------------%%%%----------3333---3333-2222- NRSRLLVFALDGKAQLPPPAPAPAKVERVPQPVTAAPEQVQAGKQLYGQFCSVCHGMGTI -----------------------------------3333----------------2222- SGGLIPDLRQSSDATREHFQQIVLQGALKPLGMPSFDDSLKPEEVEQIKLYVMSREYEDY ------3333---------------1111------------------------------- MARH ---- >SMK TOXIN; SWP:P19972; PDB:1KVEA; WSLRWRMQKSTTIAAIAGCSGAATFGGLAGGIVGCIAAGILAILQGFEVNWHNGGG ------------------------%%%%---------------------------- >Salt-mediated killer prot; SWP:P19972; PDB:1KVEB; GEATTIWGVGADEAIDKGTPSKNDLQNMSADLAKNGFKGHQGVACSTVKDGNKDVYMIKF ------------------------------------%%%%---------!!!!------- SLAGGSNDPGGSPCSDD ----------------- >COPPER-TRANSPORTING ATPAS; SWP:Q04656; PDB:1KVIA; MDPSMGVNSVTISVEGMTCNSCVWTIEQQIGKVNGVHHIKVSLEEKNATIIYDPKLQTPK --%%%%------------3333------1111---------3333-------3333---- TLQEAIDDMGFDAVIHNPD ------------------- >MEVALONATE KINASE; SWP:P17256; PDB:1KVKA; MLSEVLLVSAPGKVILHGEHAVVHGKVALAVALNLRTFLVLRPQSNGKVSLNLPNVGIKQ -------------------3333------------------------------------- VWDVATLQLLDTEKLKKVAGLPRDCVGNEGLSLLAFLYLYLAICRKQRTLPSLDIMVWSE --33331111----------------1111-------------1111------------- LPPGAGLGSSAAYSVCVAAALLTACEEVTNPLKDRGSIGSWPEEDLKSINKWAYEGERVI -------------------------------1111------------------------- HGNPSGVDNSVSTWGGALRYQQGKMSSLKRLPALQILLTNTKVPRSTKALVAGVRSRLIK ---------------------------------------------3333----------- FPEIMAPLLTSIDAISLECERVLGEMAAAPVPEQYLVLEELMDMNQHHLNALGVGHASLD -3333-------------------------3333---------------------3333- QLCQVTAAHGLHSKLTGAGGGGCGITLLKPGLERAKVEAAKQALTGCGFDCWETSIGAPG -----3333-----------------------3333------------------------ VSMHSATSIEDPVRQALG ----1111---------- >SRP19; SWP:O29010; PDB:1KVNA; MKESVVWTVNLDSKKSRAEGRRIPRRFAVPNVKLHELVEASKELGLKFRAEEKKYPKSWW ------3333-3333-3333---3333-----3333------------------------ EEGGRVVVEKRGTKTKLMIELARKIAEIREQKREQKKDKKKKKK -------------3333------------------3333----- >RC-RNASE4; SWP:Q9DFY6; PDB:1KVZA; MQDWATFKKKHLTDTWDVDCDNLMPTSLFDCKDKNTFIYSLPGPVKALCRGVIFSADVLS -------------------1111-----------------------1111---------- NSEFYLAECNVKPRKPCKYKLKKSSNRICIRCEHELPVHFAGVGICP --------------------------------%%%%----------- >Biphenyl-2,3-diol 1,2-dio; SWP:P17297; PDB:1KW3B; SIERLGYLGFAVKDVPAWDHFLTKSVGLMAAGSAGDAALYRADQRAWRIAVQPGELDDLA ---------------------------------!!!!----------------3333--- YAGLEVDDAAALERMADKLRQAGVAFTRGDEALMQQRKVMGLLCLQDPFGLPLEIYYGPA -------------------1111-----------------------1111---------- EIFHEPFLPSAPVSGFVTGDQGIGHFVRCVPDTAKAMAFYTEVLGFVLSDIIDIQMGPET -1111------------!!!!--------------------------------------- SVPAHFLHCNGRHHTIALAAFPIPKRIHHFMLQANTIDDVGYAFDRLDAAGRITSLLGRH -----------------------------------------------1111--------- TNDQTLSFYADTPSPMIEVEFGWGPRTVDSSWTVARHSRTAMWGHKSV -------------2222-----------1111---------------- >POLYHOMEOTIC; SWP:P39769; PDB:1KW4A; DRPPISSWSVDDVSNFIRELPGCQDYVDDFIQQEIDGQALLRLKEKHLVNAGKLGPALKI ---3333------------22221111--------333311113333------------- VAKVESIK -------- >LYSOZYME; SWP:P00720; PDB:1KW5A; MNIFEMLRIDEGLRLKIYKDTEGYYTIGIGHLLTKSPSLNAAKSELDKAIGRNTNGVITK -------------------1111------------------------------iiii--- DEAEKLFNQDVDAAVRGILRNAKMKPVYDSMDAVRRAAMINMVFQMGETGVAGFTNSLRM ------------------------------------------------------------ LQQKRWDEAAVNLAKSRWYNQTPNRAKRVITTFRTGTWDAYK 1111-------3333----------------------3333- >HCASK/LIN-2 PROTEIN; SWP:O14936; PDB:1KWAA; RSRLVQFQKNTDEPMGITLKMNELNHCIVARIMHGGMIHRQGTLHVGDEIREINGISVAN ----------------------3333------2222--1111--2222----iiii3333 QTVEQLQKMLREMRGSITFKIVPSYREF ---------------------------- >ENDOGLUCANASE A; SWP:P04955; PDB:1KWFA; AGVPFNTKYPYGPTSIADNQSEVTAMLKAEWEDWKSKRITSNGAGGYKRVQRDASTNYDT ----------------------------------------2222--------3333---- VSQGMGYGLLLAVCFNEQALFDDLYRYVKSHFNGNGLMHWHIDANNNVTSHDGGDGAATD 3333--------1111----------------1111------1111------3333---- ADEDIALALIFADKLWGSSGAINYGQEARTLINNLYNHCVEHGSYVLKPGDRWGGSSVTN ------------------------------------------------------1111-3 PSYFAPAWYKVYAQYTGDTRWNQVADKCYQIVEEVKKYNNGTGLVPDWCTASGTPASGQS 333--------------3333-----------------iiii-------1111--2222- YDYKYDATRYGWRTAVDYSWFGDQRAKANCDMLTKFFARDGAKGIVDGYTIQGSKISNNH -----3333-------------------------------3333-----1111------- NASFIGPVAAASMTGYDLNFAKELYRETVAVKDSEYYGYYGNSLRLLTLLYITGNFPNPL 3333-----------------------------3333-------------1111---111 SDL 1-- >BETA-GALACTOSIDASE; SWP:O69315; PDB:1KWGA; MLGVCYYPEHWPKERWKEDARRMREAGLSHVRIGEFAWALLEPEPGRLEWGWLDEAIATL ------1111-3333--------3333---------3333---2222------------- AAEGLKVVLGTPTATPPKWLVDRYPEILPVDREGRRRRFGGRRHYCFSSPVYREEARRIV 1111-------3333--------3333---1111-----------1111----------- TLLAERYGGLEAVAGFQTDNEYGCHDTVRCYCPRCQEAFRGWLEARYGTIEALNEAWGTA ------1111----------2222------------------------------------ FWSQRYRSFAEVELPHLTVAEPNPSHLLDYYRFASDQVRAFNRLQVEILRAHAPGKFVTH %%%%---3333-----------------------------------------2222---- NFMGFFTDLDAFALAQDLDFASWDSYPLGFTDLMPLPPEEKLRYARTGHPDVAAFHHDLY --------------1111------------1111--------------1111-------- RGVGRGRFWVMEQQPGPVNWAPHNPSPAPGMVRLWTWEALAHGAEVVSYFRWRQAPFAQE -1111----------------------2222----------------------------1 QMHAGLHRPDSAPDQGFFEAKRVAEELAALALPPVAQAPVALVFDYEAAWIYEVQPQGAE 111----1111--------------3333----------------------------111 WSYLGLVYLFYSALRRLGLDVDVVPPGASLRGYAFAVVPSLPIVREEALEAFREAEGPVL 1-------------1111------2222-2222--------------------------- FGPRSGSKTETFQIPKELPPGPLQALLPLKVVRVESLPPGLLEVAEGALGRFPLGLWREW -2222---1111--1111-!!!!--------------2222-----3333---------- VEAPLKPLLTFQDGKGALYREGRYLYLAAWPSPELAGRLLSALAAEAGLKVLSLPEGLRL ----------1111------!!!!--------------------1111------1111-- RRRGTWVFAFNYGPEAVEAPASEGARFLLGSRRVGPYDLAVWEE --!!!!---------------2222---------2222------ >PROTEGRIN-3 PRECURSOR; SWP:P32196; PDB:1KWIA; ALSYREAVLRAVDRLNEQSSEANLYRLLELDGTPKPVSFTVKETVCPRPTRQPPELCDFK ----------------------------------------------------1111---2 ENGRVKQCVGTVTLDPLDITCNEVQ 222---------------------- >PROCARBOXYPEPTIDASE B; SWP:P15086; PDB:1KWMA; HHGGEHFEGEKVFRVNVEDENHINIIRELASTTQIDFWKPDSVTQIKPHSTVDFRVKAED iiii1111---------------------------------1111-----------3333 TVTVENVLKQNELQYKVLISNLRNVVEAQFDSRVR --------------------3333--3333----- >MAP KINASE ACTIVATED PROT; SWP:P49137; PDB:1KWPA; FHVKSGLQIKKNAIIDDYKVTSQVLGLGINGKVLQIFNKRTQEKFALKMLQDCPKARREV ----------------------3333---------------------------------- ELHWRASQCPHIVRIVDVYENLYAGRKCLLIVMECLDGGELFSRIQDRGDQAFTEREASE --------1111------------------------------------------------ IMKSIGEAIQYLHSINIAHRDVKPENLLYTSKRPNAILKLTDFGFAKETTSGPEKYDKSC ----------3333--------3333---------------------------------- DMWSLGVIMYILLCGYPPFYSNHGLAISPGMKTRIRMYEFPNPEWSEVSEEVKMLIRNLL ---------------------------33331111----------------------333 KTEPTQRMTITEFMNHPWIMQSTKVPQTPLHTSRVLKEDKERWEDVKEEMTSALATMRVD 3---------------3333----------3333-------------------------- YEQIKIKKIEDASNPLLLK ------------------- >BETA-1,3-GLUCURONYLTRANSF; SWP:O94766; PDB:1KWSA; MTIYVVTPTYARLVQKAELVRLSQTLSLVPRLHWLLVEDAEGPTPLVSGLLAASGLLFTH -----------1111----------1111------------------------------- LVVLTPWVHPRGVEQRNKALDWLRGRGGAVGGEKDPPPPGTQGVVYFADDDNTYSRELFE ---------------------1111------------2222-------------3333-1 EMRWTRGVSVWPVGLVGGLRFEGPQVQDGRVVGFHTAWEPSRPFPVDMAGFAVALPLLLD 111------------iiii-------%%%%--------3333----1111---------- KPNAQFDSTAPRGHLESSLLSHLVDPKDLEPRAANCTRVLVWHTRTEKPKMKQEEQLQRQ 1111--111122223333-1111-3333----%%%%------------------------ GRGSDPAIEV ----1111-- - >Histone H2A type 1; SWP:P06897; PDB:1KX5C; SGRGKQGGKTRAKAKTRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYLAAVLEYLTA ------------------1111------------1111-----3333------------- EILELAGNAARDNKKTRIIPRHLQLAVRNDEELNKLLGRVTIAQGGVLPNIQSVLLPKKT ----------1111----3333-----------------------------3333----- ESSKSKSK -------- >Histone H2B 1.1; SWP:P02281; PDB:1KX5D; AKSAPAPKKGSKKAVTKTQKKDGKKRRKTRKESYAIYVYKVLKQVHPDTGISSKAMSIMN ---------------------------------------------1111----------- SFVNDVFERIAGEASRLAHYNKRSTITSREIQTAVRLLLPGELAKHAVSEGTKAVTKYTS ------------------1111-----------------------------------111 AK 1- >B LYMPHOCYTE STIMULATOR; SWP:Q9Y275; PDB:1KXGA; VTQDCLQLIADSETPTIQKGSYTFVPWLLSFKRGSALEEKENKILVKETGYFFIYGQVLY ----------1111----iiii-----------------%%%%----------------- TDKTYAMGHLIQRKKVHVFGDELSLVTLFRCIQNMPETLPNNSCYSAGIAKLEEGDELQL ------------------!!!!-------------------------------------- AIPRENAQISLDGDVTFFGALKLL -----------1111--------- ---------------- >CELL DIVISION CONTROL PRO; SWP:P32797; PDB:1KXLA; KMARKDPTIEFCQLGLDTFETKYITMFGMLVSCSFDKPAFISFVFSDFTKNDIVQNYLYD ---------1111-----------------------1111-------------------- RYLIDYENKLELNEGFKAIMYKNQFETFDSKLRKIFNNGLRDLQNGRDENLSQYGIVCKM --------------------------------1111--3333------------------ NIKVKMYNGKLNAIVRECEPVPHSQISSIASPSQCEHLRLFYQRAFKRIGESAISRYFEE ------------------------1111------------------33333333------ YRRFFPI ------- >DIGA16; SWP:P09464; PDB:1KXOA; DVYHDGACPEVKPVDNFDWSQYHGKWWQVAAYPDHITKYGKCGWAEYTPEGKSVKVSRYS -----------------3333----------3333------------------------- VIHGKEYFSEGTAYPVGDSKIGKIYHSYTGVTQEGVFNVLSTDNKNYIIGYFCSYDKKGH -iiii------------------------------------------------------- MDLVWVLSRSMVLTGEAKTAVENYLIGSPVVDSQKLVYSDFSECK ----------------------3333-----3333---------- >Vitamin D-binding protein; SWP:P02774; PDB:1KXPD; LERGRDYEKNKVCKEFSHLGKEDFTSLSLVLYSRKFPSGTFEQVSQLVKEVVSLTEACCA -----------------------------------1111--------------------2 EGADPDCYDTRTSALSAKSCESNSPFPVHPLKHQPQEFPTYVEPTNDEICEAFRKDPKEY 2221111---------1111---------------------------------------- ANQFMWEYSTNYGQAPLSLLVSYTKSYLSMVGSCCTSASPTVCFLKERLQLKHLSLLTTL -----------1111------------------1111----------------------- SNRVCSQYAAYGEKKSRLSNLIKLAQKVPTADLEDVLPLAEDITNILSKCCESASEDCMA ---------------------------11113333--------------1111----333 KELPEHTVKLCDNLSTKNSKFEDCCQEKTAMDVFVCTYFMPAAQLPELPDVELPTNKDVC 3------------1111-------------------------------------3333-- DPGNTKVMDKYTFELSRRTHLPEVFLSKVLEPTLKSLGECCDVEDSTTCFNAKGPLLKKE ---------------------3333-------------3333------------------ LSSFIDKGQELCADYSENTFTEYKKKLAERLKAKLPDATPKELAKLVNKRSDFASNCCSI -------------3333-----------------3333---------------------- NSPPLYCDSEIDAELKNI ------------3333-- >ALPHA-AMYLASE, PANCREATIC; SWP:NA; PDB:1KXQE; QVQLVESGGGSVQAGGSLSLSCAASTYTDTVGWFRQAPGKEREGVAAIYRRTGYTYSADS ------------2222-----------------------------------------111 VKGRFTLSQDNNKNTVYLQMNSLKPEDTGIYYCATGNSVRLASWEGYFYWGQGTQVTVSS 1----------------------3333------------3333----------------- >ALPHA-AMYLASE, PANCREATIC; SWP:NA; PDB:1KXTB; QVQLVASGGGSVQAGGSLRLSCAASTFSSYPMGWYRQAPGKECELSARIFSDGSANYADS ------------2222---------------------2222--------3333----111 VKGRFTISRDNAANTAYLQMDSL 1---------------------- >CYCLIN H; SWP:P51946; PDB:1KXU; WTFSSEEQLARLRADANRKFRCKAVANGKVLPNDPVFLEPHEEMTLCKYYEKRLLEFCSV -----3333-----------1111--------------3333--------------3333 FKPAMPRSVVGTACMYFKRFYLNNSVMEYHPRIIMLTCAFLACKVDEFNVSSPQFVGNLR -----3333---------------1111----------------------3333-1111- ESPLGQEKALEQILEYELLLIQQLNFHLIVHNPYRPFEGFLIDLKTRYPILENPEILRKT ------------------------------------------------------------ ADDFLNRIALTDAYLLYTPSQIALTAILSSASRAGITMESYLSESLMLKENRTCLSQLLD -----------3333---1111---------1111----3333--%%%%----------- IMKSMRNLVKKYEPPRSEEVAVLKQKLERCHSAELA ---------------3333--------3333----- >ALPHA-AMYLASE, PANCREATIC; SWP:NA; PDB:1KXVC; VQLVESGGGTVPAGGSLRLSCAASGNTLCTYDMSWYRRAPGKGRDFVSGIDNDGTTTYVD -----------2222----------3333---------2222--------1111----33 SVAGRFTISQGNAKNTAYLQMDSLKPDDTAMYYCKPSLRYGLPGCPIIPWGQGTQVTVS 33----------------------1111-------------2222-------------- >GTP-BINDING PROTEIN YPT7P; SWP:P32939; PDB:1KY2A; SRKKNILKVIILGDSGVGKTSLMHRYVNDKYSQQYKATIGADFLTKEVTVDGDKVATMQV -------------2222------------------------------------------- WDTAGQERFQSLGVAFYRGADCCVLVYDVTNASSFENIKSWRDEFLVHANVNSPETFPFV -----333311111111----------1111---------------------3333---- ILGNKIDAEESKKIVSEKSAQELAKSLGDIPLFLTSAKNAINVDTAFEEIARSALQQNQA ----11113333------------1111--------1111-------------------- >GTP-BINDING PROTEIN YPT7P; SWP:P32939; PDB:1KY3A; NILKVIILGDSGVGKTSLMHRYVNDKYSQQYIGADFLTKEVTVDGDKVATMQVWDTAAFY ---------2222--------------1111--------------------------111 RGADCCVLVYDVTNASSFENIKSWRDEFLVHANVNSPETFPFVILGNKIDAEESKKIVSE 1---------1111---------------------3333--------11113333----- KSAQELAKSLGDIPLFLTSAKNAINVDTAFEEIARSALQQNQ -------1111--------1111------------------- >PROTEASE DO; SWP:P09376; PDB:1KY9A; QPSLAPLEKVPSVVSINVEGSTTVNTPRPRNFQQFFGGQQQKFALGSGVIIDADKGYVVT ------1111------------------------------------------3333---- NNHVVDNATVIKVQLSDGRKFDAKVGKDPRSDIALIQIQNPKNLTAIKADSDALRVGDYT 11112222------1111----------1111-----------------3333-1111-- VAIGNPFGLGETVTSGIVSALGRSGYENFIQTDAAINRGNAGGALVNLNGELIGINTAIL -----------------------------------2222-------1111---------- APDGGNIGIGFAIPSNVKNLTSQVEYGQVKRGELGIGTELNSELAKAKVDAQRGAFVSQV --------------------------------------2222-----------------3 LPNSSAAKAGIKAGDVITSLNGKPISSFAALRAQVGTPVGSKLTLGLLRDGKQVNVNLE 333-----------------------3333--1111----------------------- >LACCASE; SWP:Q96UT7; PDB:1KYAA; GIGPVADLTITNAAVSPDGFSRQAVVVNGGTPGPLITGNMGDRFQLNVIDNLTNHTMLKS ---------------1111-------iiii--------2222-----------3333--- TSIHWHGFFQKGTNWADGPAFINQCPISSGHSFLYDFQVPDQAGTFWYHSHLSTQYCDGL ---------22221111----------2222--------------------!!!!----- RGPFVVYDPNDPAADLYDVDNDDTVITLVDWYHVAAKLGPAFPLGADATLINGKGRSPST ----------1111------1111----------1111------------iiii--1111 TTADLSVISVTPGKRYRFRLVSLSCDPNYTFSIDGHNMTIIETDSINTAPLVVDSIQIFA ----------2222------------------2222------iiii------------22 AQRYSFVLEANQAVDNYWIRANPNFGNVGFTGGINSAILRYDGAAAVEPTTTQTTSTAPL 22---------------------------2222-------2222---------------- NEVNLHPLVATAVPGSPVAGGVDLAINMAFNFNGTNFFINGASFTPPTVPVLLQIISGAQ 3333-------------2222-----------------iiii------------1111-- NAQDLLPSGSVYSLPSNADIEISFPATAAAPGAPHPFHLHGHAFAVVRSAGSTVYNYDNP 3333--2222----------------3333------------------2222-------- IFRDVVSTGTPAAGDNVTIRFRTDNPGPWFLHCHIDFHLEAGFAVVFAEDIPDVASANPV ---------1111---------------------33331111--------11113333-- PQAWSDLCPTYDARDPSDQ 3333------3333----- >ALPHA-ADAPTIN C; SWP:P42567; PDB:1KYFA; GSPGIRLGSSEDNFARFVCKNNGVLFENQLLQIGLKSEFRQNLGRMFIFYGNKTSTQFLN ----------1111-------------------------!!!!----------------- FTPTLICADDLQTNLNLQTKPVDPTVDGGAQVQQVVNIECISDFTEAPVLNIQFRYGGTF -------3333---------------2222-------------------------iiii- QNVSVKLPITLNKFFQPTEMASQDFFQRWKQLSNPQQEVQNIFKAKHPMDTEITKAKIIG -----------1111------------1111--1111----------------------- FGSALLEEVDPNPANFVGAGIIHTKTTQIGCLLRLEPNLQAQMYRLTLRTSKDTVSQRLC -----------1111--------1111-----------1111------------------ ELLSEQF ------- >Hypothetical 29.9 kDa pro; SWP:P94368; PDB:1KYHA; NVPFWTEEHVRATLPERTYGTALLLAGSDDPGAALLAGLGARSGLGKLVIGTSENVIPLI ----------------------------------------------------33331111 VPVLPEATYWRDGWKKAADAQLEETYRAIAIGPGLPQTESVQQAVDHVLTADCPVILDAG ---1111--------3333------------2222------------1111------!!! ALAKRTYPKREGPVILTPHPGEFFRTGVPVNELQKKRAEYAKEWAAQLQTVIVLKGNQTV !---------------------------33331111------------------------ IAFPDGDCWLNPTGNGALAKGGTGDTLTGILGLCCHEDPKHAVLNAVYLHGACAELWTDE --1111--------3333-2222------------------------------------- HSAHTLLAHELSDILPRVWKRFE -3333-3333-3333---3333- >SIROHEME BIOSYNTHESIS PRO; SWP:P15807; PDB:1KYQA; VKSLQLAHQLKDKILGGGEVGLTRLYKLPTGCKTVPDLHKSIIPKGKFIQ ---------2222-------------------------11113333333- >CAFFEIC ACID 3-O-METHYLTR; SWP:P28002; PDB:1KYZA; HISDEEANLFAMQLASASVLPMILKSALELDLLEIIAKAGPGAQISPIEIASQLPTTNPD ---------------------------------------2222-------1111---111 APVMLDRMLRLLACYIILTCSVRTQQDGKVQRLYGLATVAKYLVKNEDGVSISALNLMNQ 1-----------1111------------------------1111-1111--------111 DKVLMESWYHLKDAVLDGGIPFNKAYGMTAFEYHGTDPRFNKVFNKGMSDHSTITMKKIL 133333333-------------------33331111------------------------ ETYTGFEGLKSLVDVGGGTGAVINTIVSKYPTIKGINFDLPHVIEDAPSYPGVEHVGGDM ---1111---------!!!!-3333----1111------3333------2222-----11 FVSIPKADAVFMKWICHDWSDEHCLKFLKNCYEALPDNGKVIVAECILPVAPDSSLATKG 11-------------1111----------------------------------------- VVHIDVIMLAHNPGGKERTQKEFEDLAKGAGFQGFKVHCNAFNTYIMEFL ---------------------------1111---------iiii------ >6,7-DIMETHYL-8-RIBITYLLUM; SWP:Q9UUB1; PDB:1KZ1A; NPSDLKGPELRILIVHARGNLQAIEPLVKGAVETMIEKHDVKLENIDIESVPGSWELPQG ------1111---------3333------------------3333-------3333---- IRASIARNTYDAVIGIGVLIKGSTMHFEYISEAVVHGLMRVGLDSGVPVILGLLTVLNEE ---------------------------------------------------------333 QALYRAGLNGGHNHGNDWGSAAVEMGLKAL 3-1111-iiii------------------- >GUANINE NUCLEOTIDE EXCHAN; SWP:Q64096; PDB:1KZ7A; EEEESLAILRRHVNELLDTERAYVEELLCVLEGYAAEDNPLAHLISTGLQNKKNILFGNE ---------------------------------------------3333---3333---- EIYHFHNRIFLRELESCIDCPELVGRCFLEREEFQIYEKYCQNKPRSESLWRQCSDCPFF ----------------33331111-------3333------------------1111--- QECQKKLDHKLSLDSYLLKPVQRITKYQLLLKELKYSKHCEGAEDLQEALSSILGILKAV -----------33331111--------------1111----------------------- NDSHLIAITGYDGNLGDLGKLLQGSFSVWTDHKKGELARFKPQRHLFLHEKAVLFCKKRE ---1111--------1111-----------------------------1111-------- ENGEGYEKAPSYSYKQSLNTAVGITENVKGDTKKFEIWYNAREEVYIIQAPTPEIKAAWV ---------------------------2222--------%%%%----------------- NAIRKVLTSQLQACREASQHRALEQSH --------------------------- >ACYL-HOMOSERINELACTONE SY; SWP:P54656; PDB:1KZFA; MLELFDVSYEELQTTFSDRLGWEVICSQGMESDEFDGPGTRYILGICEGQLVCSVRFTSL -------------------------1111---1111----------%%%%--------11 DRPNMITHTFQHCFSDVTLPAYGTESSRFFVDKARARALLGEHYPISQVLFLAMVNWAQN 11-33331111--1111------------------------------------------- NAYGNIYTIVSRAMLKILTRSGWQIKVIKEAFLTEKERIYLLTLPAGQDDKQQLGGDVVS ---------------------------------1111----------------------- RTGCPPVAVTTWPLTLPV ----3333---------- >PROTEASE; SWP:P03369; PDB:1KZKA; PQITLWKRPLVTIRIGGQLKEALLDTGADDTVLEEMNLPGKWKPKMIGGIGGFIKVRQYD --------------iiii------1111--------------------1111-------- QIPVEICGHKAIGTVLVGPTPVNIIGRNLLTQIGCTLNF -----iiii-------------------3333------- >RIBOFLAVIN SYNTHASE; SWP:Q9Y7P0; PDB:1KZLA; MFTGLVEAIGVVKDVQGTIDNGFAMKIEAPQILDDCHTGDSIAVNGTCLTVTDFDRYHFT ----------------------------3333----2222---iiii-------1111-- VGIAPESLRLTNLGQCKAGDPVNLERAVLSSTRMGGHFVQGHVDTVAEIVEKKQDGEAID -----------3333-2222----------------------------------!!!!-- FTFRPRDPFVLKYIVYKGYIALDGTSLTITHVDDSTFSIMMISYTQSKVIMAKKNVGDLV ----------11112222---iiii----------------3333---3333--2222-- NVEVDQIGKYTEKLVEAHIADW ----3333-------------- >MAJOR SURFACE ANTIGEN P30; SWP:P13664; PDB:1KZQA; PLVANQVVTCPDKKSTAAVILTPTENHFTLKCPKTALTEPPTLAYSPNRQICPAGTTSSC ---------------------1111-------2222---3333---------2222---2 TSKAVTLSSLIPEAEDSWWTGDSASLDTAGIKLTVPIEKFPVTTQTFVVGCIKGDDAQSC 222--3333-11113333---3333----------1111---------------3333-- MVTVTVQARASSVVNNVARCSYGADSTLGPVKLSAEGPTTMTLVCGKDGVKVPQDNNQYC -------------%%%%----------------3333--------1111-----1111-- SGTTLTGCNEKSFKDILPKLTENPWQGNASSDKGATLTIKKEAFPAESKSVIIGCTGGSP ---3333----3333---------------3333-------------------------- EKHHCTVKLEFAG ------------- >INTESTINAL FATTY ACID-BIN; SWP:P12104; PDB:1KZWA; AFDSTWKVDRSENYDKFMEKMGVNIVKRKLAAHDNLKLTITQEGNKFTVKESSAFRNIEV ------------3333------------------------------------3333---- VFELGVTFNYNLADGTELRGTWSLEGNKLIGKFKRTDNGNELNTVREIIGDELVQTYVYE --2222------------------------------------------------------ GVEAKRIFKKD ----------- >Tumor suppressor p53-bind; SWP:Q12888; PDB:1KZYC; ALEEQRGPLPLNKTLFLGYAFLLTMATIPPFNKQYTESQLRAGAGYILEDFNEAQCNTAY -3333--------1111--------------3333-----1111---------3333--- QCLLIADQHCRTRKYFLCLASGIPCVSHVWVHDSCHANQLQNYRNYLLPAGYSLEEQRIL -----------------------------------------3333--------1111--- DWQPRENPFQNLKVLLVSDQQQNFLELWSEILMTGGAASVKQHHSSAHNKDIALGVFDVV ------1111------------------------------------------3333---- VTDPSCPASVLKCAEALQLPVVSQEWVIQCLIVGERIGFKQHPKYKHDYVSH --1111-----------------------------------33331111--- >BRCA1; SWP:O54952; PDB:1L0BA; RAERDISMVVSGLTPKEVMIVQKFAEKYRLALTDVITEETTHVIIKTDAEFVCERTLKYF ------------------------------------1111-------1111--------- LGIAGGKWIVSYSWVIKSIQERKLLSVHEFEVKGDVVTGSNHQGPRRSRESQLFEGLQIY --1111---------------------1111----------------------2222--- CCEPFTNMPKDELERMLQLCGASVVKELPLLTRDTGAHPIVLVQRLVMWDWVLDSISVYR -----------------1111--------!!!!--------------------------- CRDLDAYLVQ --3333---- ------------------------------------------------- >ANTI-SIGMA F FACTOR; SWP:O32727; PDB:1L0OA; MRNEMHLQFSARSENESFARVTVAAFVAQLDPTMDELTEIKTVVSEAVTNAIIHGYNNDP -----------3333----------3333---3333-----------------1111-11 NGIVSISVIIEDGVVHLTVRDEGVGIPDIEEARQPLFTTKPELERSGMGFTIMENFMDEV 11-------------------------------2222--1111----------------- IVESEVNKGTTVYLKKHIVKS ----2222------------- >RNA polymerase sigma fact; SWP:O32728; PDB:1L0OC; DGTVKVSRSLKEMGNKIRKAKDELSKTRGRAPTVTEIADHLGISPEDVVLAQEAVRL --------------------------------------1111---------3333-- >SURFACE LAYER PROTEIN; SWP:Q50245; PDB:1L0QA; STFAYIANSESDNISVIDVTSNKVTATIPVGSNPGAVISPDGTKVYVANAHSNDVSIIDT -------1111---------------------------1111------1111-------- ATNNVIATVPAGSSPQGVAVSPDGKQVYVTNASSTLSVIDTTSNTVAGTVKTGKSPLGLA --------------------1111------------------------------------ LSPDGKKLYVTNNGDKTVSVINTVTKAVINTVSVGRSPKGIAVTPDGTKVYVANFDSSIS -1111------1111----------------------------1111------1111--- VIDTVTNSVIDTVKVEAAPSGIAVNPEGTKAYVTNVDKYFNTVSIDTGTNKITARIPVGP ------------------------1111--------2222-----1111----------- DPAGIAVTPDGKKVYVALSFNTVSVIDTATNTITATAVGKNPYASGQFIGSIPVQPVYPS -------1111------------------------------------------------- ADFKSNITSGYIFLSEPVQFTDLSKDATEWKWDFGDGSSSKKQNPTHTYSETGIYTVRLT ------------2222-------------------------------------------- VSNSNGTDSQISTVNVVLKGSPTPS --1111-----------2222---- >THERMAL HYSTERESIS PROTEI; SWP:Q9GTP0; PDB:1L0SA; SCTNTNSQLSANSKCEKSTLTNCVDKSEVFGTTCTGSRFDGVTITTSTSTGSRISGPGCK ---------1111----------------------------------------------- ISTCIITGGVPAPSAACKISGCTFSAN ------iiii---1111---------- >ASPARTYL-TRNA SYNTHETASE; SWP:P36419; PDB:1L0WA; MRRTHYAGSLRETHVGEEVVLEGWVNRRRDLGGLIFLDLRDREGLVQLVAHPASPAYATA -----1111-1111--------------------------1111------1111-3333- ERVRPEWVVRAKGLVRLRPEPNPRLATGRVEVELSALEVLAEAKTPPFPVDAGWRGEEEK ---2222-----------------1111----------------------3333------ EASEELRLKYRYLDLRRRRMQENLRLRHRVIKAIWDFLDREGFVQVETPFLTKSTPEGAR ---------33331111---------------------1111------------------ DFLVPYRHEPGLFYALPQSPQLFKQMLMVAGLDRYFQIARCFRDEDLRADRQPDFTQLDL -----3333--------------------------------------------------- EMSFVEVEDVLELNERLMAHVFREALGVELPLPFPRLSYEEAMERYGSDKPDLRFGLELK -----3333--------------------------------------------------- EVGPLFRQSGFRVFQEAESVKALALPKALSRKEVAELEEVAKRHKAQGLAWARVEEGGFS -3333------1111-------------------------3333---------------- GGVAKFLEPVREALLQATEARPGDTLLFVAGPRKVAATALGAVRLRAADLLGLKREGFRF !!!!--3333----------2222-------3333-------------1111-------- LWVVDFPLLEWDEEEEAWTYMHHPFTSPHPEDLPLLEKDPGRVRALAYDLVLNGVEVGGG ----------------------1111--3333-------------------iiii----- SIRIHDPRLQARVFRLLGIGEEEQREKFGFFLEALEYGAPPHGGIAWGLDRLLALMTGSP ---------------------3333--------1111----------------------- SIREVIAFPKNKEGKDPLTGAPSPVPEEQLRELGLMVVRP 3333------1111-----------3333-1111------ >TRANSCRIPTION ANTITERMINA; SWP:P39805; PDB:1L1CA; MKIAKVINNNVISVVNEQGKELVVMGRGLAFQKKSGDDVDEARIEKVFTLDNKDV ---------------3333----------2222------3333--------3333 >PEPTIDE METHIONINE SULFOX; SWP:Q9K1N8; PDB:1L1DA; YKKPSDAELKRTLTEEQYQVTQNSATEYAFSHEYDHLFKPGIYVDVVSGEPLFSSADKYD ----3333-------------------22223333------------------3333--- SGCGWPSFTRPIDAKSVTEHDDFSFNRRTEVRSRAADSHLGHVFPDGPRDKGGLRYCING ------------1111-------------------------------1111-------33 ASLKFIPLEQDAAGYGALKGEV 33----3333111133331111 >MYCOLIC ACID SYNTHASE; SWP:Q7D9R5; PDB:1L1EA; YDLSDDFFRLFLDPTQTYSCAYFERDDMTLQEAQIAKIDLALGKLNLEPGMTLLDIGCGW ------3333--1111--------1111-------------1111--2222------!!! GATMRRAIEKYDVNVVGLTLSENQAGHVQKMFDQMDTPRSRRVLLEGWEKFDEPVDRIVS !------------------------------3333-----------3333---------- IGAFEHFGHQRYHHFFEVTHRTLPADGKMLLHTIVRPTFKEGREKGLTLTHELVHFTKFI ---11113333------------1111--------------------------------- LAEIFPGGWLPSIPTVHEYAEKVGFRVTAVQSLQLHYARTLDMWATALEANKDQAIAIQS ----2222------------1111---------3333----------------------3 QTVYDRYMKYLTGCAKLFRQGYTDVDQFTLEK 333----------------------------- >GLUTAMATE DEHYDROGENASE 1; SWP:P00367; PDB:1L1FA; DPNFFKMVEGFFDRGASIVEDKLVEDLRTRESEEQKRNRVRGILRIIKPCNHVLSLSFPI --3333-------------------------2222------------------------- RRDDGSWEVIEGYRAQHSQHRTPCKGGIRYSTDVSVDEVKALASLMTYKCAVVDVPFGGA --------------------------------------------------1111------ KAGVKINPKNYTDNELEKITRRFTMELAKKGFIGPGIDVPAPDMSTGEREMSWIADTYAS ------3333-----------------1111--1111-----3333-------------- TIGHYDINAHACVTGKPISQGGIHGRISATGRGVFHGIENFINEASYMSILGMTPGFGDK 1111-1111-------3333-----3333------------------------------- TFVVQGFGNVGLHSMRYLHRFGAKCIAVGESDGSIWNPDGIDPKELEDFKLQHGSILGFP ------------------1111--------------1111-------------------- KAKPYEGSILEADCDILIPAASEKQLTKSNAPRVKAKIIAEGANGPTTPEADKIFLERNI -------3333------------------3333----------------------1111- MVIPDLYLNAGGVTVSYFEWLKNLNHVSYGRLTFKYERDSNYHLLMSVQESLERKFGKHG ---3333--------------------2222----------------------------- GTIPIVPTAEFQDRISGASEKDIVHSGLAYTMERSARQIMRTAMKYNLGLDLRTAAYVNA ------------------3333-------------------------------------- IEKVFKVYNEAGVTFT ---------------- >HEAT SHOCK PROTEASE HTRA; SWP:Q9WZ41; PDB:1L1JA; DYESPIVNVVEACAPAVVKIDVVKTTSFFDPYFEQFFKKWFGELPPGFERQVASLGSGFI ------------3333-------------3333-------1111----3333-------- FDPEGYILTNYHVVGGADNITVTMLDGSKYDAEYIGGDEELDIAVIKIKASDKKFPYLEF -3333----3333----------3333-----------1111------------------ GDSDKVKIGEWAIAIGNPLGFQHTVTVGVVSATNRRIPKPDGSGYYVGLIQTDAAINPGN -3333-----------1111------------------3333--------------1111 SGGPLLNIHGEVIGINTAIVNPQEAVNLGFAIPINTVKKFLDTILT ---------------------------------------3333--- >RIBONUCLEOSIDE TRIPHOSPHA; SWP:Q59490; PDB:1L1LA; EISLSAEFIDRVKASVKPHWGKLGWVTYKRTYARWLPEKGRSENWDETVKRVVEGNINLD ----3333------------1111------------1111---------------11113 PRLQDSPSLELKQSLTEEAERLYKLIYGLGATPSGRNLWISGTDYQRRTGDSLNNCWFVA 333----------------------1111----------22223333---1111------ IRPQKYGDSKIVPSYLGKQEKAVSMPFSFLFDELMKGGGVGFSVARSNISQIPRVDFAID ------------11111111------------------------33331111-------- LQLVVDETSESYDASVKVGAVGKNELVQDADSIYYRLPDTREGWVLANALLIDLHFAQTN -----3333------1111--3333---1111-------3333------------33331 PDRKQKLILDLSDIRPYGAEIHGFGGTASGPMPLISMLLDVNEVLNNKAGGRLTAVDAAD 111-------1111-2222-----------3333----------3333------------ ICNLIGKAVVAGNAELALGSNDDQDFISMKQDQEKLMHHRWASNNSVAVDSAFSGYQPIA ---------!!!!------1111----1111------------------3333------- AGIRENGEPGIVNLDLSKNYGRIVDGYQAGIDGDVEGTNPCGEISLANGEPCNLFEVFPL ---------------------3333--22221111---1111----2222---------- IAEEQGWDLQEVFALAARYAKRVTFSPYDWEISREIIQKNRRIGISMSGIQDWLLTRLGN --1111-3333---------3333------------------------------------ RVVTGFKDDFDPETHEAIKVPVYDKRAIKMVDQLYKAVVKADQDYSKTLGCNESIKHTTV ------------------------------------------------------------ KPSGTVAKLAGASEGMHFHYGAYLIQRIRFQDSDPLLPALKACGYRTEADIYTENTTCVE ---3333------!!!!-------------1111------1111---------------- FPIKAVGADNPNFASAGTVSIAEQFATQAFLQTYWSDNAVSCTITFQDSEGDQVESLLRQ ----2222-1111-3333----------------------------11111111----11 YRFITKSTSLLPYFGGSLQQAPKEPIDKETYEKRSQEITGNVEEVFSQLNSDVKDLE 11--------------------------------3333------------------- >GENOME POLYPROTEIN: PICOR; SWP:P03300; PDB:1L1NA; GPGFDYAVAMAKRNIVTATTSKGEFTMLGVHDNVAILPTHASPGESIVIDGKEVEILDAK -------------------1111--------------1111-------iiii-------- ALEDQAGTNLEITIITLKRNEKFRDIRPHIPTQITETNDGVLIVNTSKYPNMYVPVGAVT ---1111------------------3333----------------1111----------- EQGYLNLGGRQTARTLMYNFPTRAGQCGGVITCTGKVIGMHVGGNGSHGFAAALKRSYFT -------------------------2222---2222------------------3333-- >Replication protein A 70 ; SWP:P27694; PDB:1L1OC; NTNWKTLYEVKSENLGQGDKPDYFSSVATVVYLRKENCMYQACPTQDCNKKVIDQQNGLY ----------3333------------------------------2222------------ RCEKCDTEFPNFKYRMILSVNIADFQENQWVTCFQESAEAILGQNAAYLGELKDKNEQAF --------------------------------------------------3333------ EEVFQNANFRSFIFRVRVKVETYIKATVMDVKPVDYREYGRRLVMSIRRSALM ----3333---------------------------3333-------------- >TRIGGER FACTOR; SWP:P0A850; PDB:1L1PA; GSHMQATWKEKDGAVEAEDRVTIDFTGSVDGEEFEGGKASDFVLAMGQGRMIPGFEDGIK --------------------------------------------2222---3333----- GHKAGEEFTIDVTFPEEYHAENLKGKAAKFAINLKKVEERELPELT --------------3333---------------------------- >ADENINE PHOSPHORIBOSYLTRA; SWP:Q967M2; PDB:1L1QA; TMSVADAHALIKTIPDFPTKGIAFKDLSDILSTPAALDAVRKEVTAHYKDVPITKVVGIE -------1111-------2222----3333---------------1111----------3 SRGFILGGIVANSLGVGFVALRKAGKLPGDVCKCTFDMEYQKGVTIEVQKRQLGPHDVVL 333-------------------2222----------------------3333-1111--- LHDDVLATGGTLLAAIELCETAGVKPENIYINVLYEIEALKGREKVGQKCTRLFSVIREH ------------------------1111--------3333-------------------- H - >HYPOTHETICAL PROTEIN MTH1; SWP:O27535; PDB:1L1SA; DYRVVFHIDEDDESRVLLLISNVRNLADLESVRIEVVAYSGVNVLRRDSEYSGDVSELTG ---------------------------------------------1111---------11 QGVRFCACSNTLRASGDGDDLLEGVDVVSSGVGHIVRRQTEGWAYIRP 11----------11113333-2222-------------1111------ >MUTM; SWP:P84131; PDB:1L1TA; PELPEVETIRRTLLPLIVGKTIEDVRIFWPNIIRHPRDSEAFAARMIGQTVRGLERRGKF ------------33332222--------3333-----------1111---------!!!! LKFLLDRDALISHLRMEGRYAVASALEPLEPHTHVVFCFTDGSELRYRDVRKFGTMHVYA -----------------------1111-----------1111------1111-------3 KEEADRRPPLAELGPEPLSPAFSPAVLAERAVKTKRSVKALLLDQTVVAGFGNIYVDESL 3331111--1111--1111------------------------1111------------- FRAGILPGRPAASLSSKEIERLHEEMVATIGEAVMKGGSFQHHLYVYGRQGNPCKRCGTP -----11111111--------------------1111--1111--2222----------- IEKTVVAGRGTHYCPRCQR -----%%%%---------- >CELLOBIOHYDROLASE; SWP:P38686; PDB:1L1YA; GPTKAPTKDGTSYKDLFLELYGKIKDPKNGYFSPDEGIPYHSIETLIVEAPDYGHVTTSE -------2222--------------3333-----------------------1111---- AFSYYVWLEAMYGNLTGNWSGVETAWKVMEDWIIPDSTEQPGMSSYNPNSPATYADEYED ------------------3333-------------33332222---1111---------1 PSYYPSELKFDTVRVGSDPVHNDLVSAYGPNMYLMHWLMDVDNWYGFGTGTRATFINTFQ 111------------------------------------1111-----!!!!-------- RGEQESTWETIPHPSIEEFKYGGPNGFLDLFTKDRSYAKQWRYTNAPDAEGRAIQAVYWA -11111111-----------------1111---------------3333----------- NKWAKEQGKGSAVASVVSKAAKMGDFLRNDMFDKYFMKIGAQDKTPATGYDSAHYLMAWY ----1111-----------------------------2222--------1111------- TAWGGGIGASWAWKIGCSHAHFGYQNPFQGWVSATQSDFAPKSSNGKRDWTTSYKRQLEF --------------------3333-----------3333--------------------- YQWLQSAEGGIAGGATNSWNGRYEKYPAGTSTFYGMAYVPHPVYADPGSNQWFGFQAWSM -11113333--------2222-----2222--iiii-----------1111--------- QRVMEYYLETGDSSVKNLIKKWVDWVMSEIKLYDDGTFAIPSDLEWSGQPDTWTGTYTGN -----------1111-----------------1111-----------------------1 PNLHVRVTSYGTDLGVAGSLANALATYAAATERWEGKLDTKARDMAAELVNRAWYNFYCS 111-------------------------------------------------------11 EGKGVVTEEARADYKRFFEQEVYVPAGWSGTMPNGDKIQPGIKFIDIRTKYRQDPYYDIV 11------------3333------2222---1111---22223333--3333-1111--- YQAYLRGEAPVLNYHRFWHEVDLAVAMGVLATYFPDMTYKVP ---1111--------------------------1111----- >ADP-DEPENDENT GLUCOKINASE; SWP:O58328; PDB:1L2LA; WESLYEKALDKVEASIRKVRGVLLAYNTNIDAIKYLKREDLEKRIEKVGKEEVLRYSEEL --------------1111-----------------------------------3333--- PKEIETIPQLLGSILWSIKRGKAAELLVVSREVREYMRKWGWDELRMGGQVGIMANLLGG -----------------1111--------------------------------------- VYGIPVIAHVPQLSELQASLFLDGPIYVPTRLIHPREFEDCIHYIYEFPRNFKVLDFEAP -------------33331111------------3333-----------2222-!!!!--- RENRFIGAADDYNPILYVREEWIERFEEIAKRSELAIISGLHPLTQENHGKPIKLVREHL -----------3333---3333-------1111-------11113333------------ KILNDLGIRAHLEFAFTPDEVVRLEIVKLLKHFYSVGLNEVELASVVSVMGEKELAERII ---1111----------------------1111--------------1111--------- SKDPADPIAVIEGLLKLIKETGVKRIHFHTYGYYLALTREKGEHVRDALLFSALAAATKA ------------------------------------------------------------ MKGNIEKLSDIREGLAVPIGEQGLEVEKILEKEFSLRDGIGSIEDYQLTFIPTKGIGDTI ------3333--3333------------3333----iiii--iiii-------------- SSSAFVSEFSLH -------3333- >REP PROTEIN; SWP:P27260; PDB:1L2MA; SGRFSIKAKNYFLTYPKCDLTKENALSQITNLQTPTNKLFIKICRELHENGEPHLHILIQ -----------------------------------------------1111--------- FEGKYNCTNQRFFDLVSPTRSAHFHPNIQGAKSSSDVKSYIDKDGDVLEWGTFQIDGR ----------1111--1111-------------3333--------------------- >ATP SYNTHASE B CHAIN; SWP:P00859; PDB:1L2PA; TDQLKKAKAEAQVIIEQANKRRSQILDEAKAEAEQERTKIVAQAQAEIEAERKRAREELR ---------------------------------------------------------111 K 1 >Hypothetical ABC transpor; SWP:Q58206; PDB:1L2TA; MIKLKNVTKTYKMGEEIIYALKNVNLNIKEGEFVSIMGPSGSGKSTMLNIIGCLDKPTEG ----------------------------2222------2222------------------ EVYIDNIKTNDLDDDELTKIRRDKIGFVFQQFNLIPLLTALENVELPLIFKYRGAMSGEE ---iiii-1111----------------1111--1111------3333------------ RRKRALECLKMAELEERFANHKPNQLSGGQQQRVAIARALANNPPIILADQPTGALDSKT --------------3333---1111--------------1111-------1111------ GEKIMQLLKKLNEEDGKTVVVVTHDINVARFGERIIYLKDGEVEREEKLR ------------------------33331111------iiii-------- >OROTIDINE 5'-PHOSPHATE DE; SWP:P08244; PDB:1L2UA; AVTNSPVVVALDYHNRDDALAFVDKIDPRDCRLKVGKEMFTLFGPQFVRELQQRGFDIFL --------------3333----11113333-------------3333---1111------ DLKFHDIPNTAAHAVAAAADLGVWMVNVHASGGARMMTAAREALVPFGKDAPLLIAVTVL ----------------------------1111-----------3333------------1 TSMEASDLVDLGMTLSPADYAERLAALTQKCGLDGVVCSAQEAVRFKQVFGQEFKLVTPG 1113333---------------------1111------3333------------------ IRPQIMTPEQALSAGVDYMVIGRPVTQSVDPAQTLKAINASLQR ---------------------3333--------------1111- >Outer membrane virulence ; SWP:P08008; PDB:1L2WI; SVGEMSGRSVSQQTSDQYANNLAGRTESPQGSSLASRIIERLSSVAHSVIGFIQRMF 2222%%%%-------------1111-------3333---------3333-------- >P24: PLANT TRANSCRIPTIONA; SWP:Q9LL85; PDB:1L3AA; TPKVFVGYSIYKGKAALTVEPRSPEFSPLDSGAFKLSREGMVMLQFAPAAGVRQYDWSRK -----------1111-----------------------------------%%%%------ QVFSLSVTEIGSIISLGTKDSCEFFHDPNKGRSDEGRVRKVLKVEPLPDGSGHFFNLSVQ ------------11111111--------------------------1111---------- NKLINLDENIYIPVTKAEFAVLVSAFNFVMPYLLGWHTAVNSFKPE ------------------------------3333-----1111--- >Histone acetyltransferase; SWP:Q09472; PDB:1L3EB; MGSGAHTADPEKRKLIQQQLVLLLHAHKCQRREQANGEVRQCNLPHCRTMKNVLNHMTHC --------3333--------------------3333------------------------ QSGKSCQVAHCASSRQIISHWKNCTRHDCPVCLPLKNAGDK -------3333---------------------3333----- >Precorrin-6y methyltransf; SWP:O26249; PDB:1L3IA; IPDDEFIKNPSVPGPTAEVRCLICLAEPGKNDVAVDVGCGTGGVTLELAGRVRRVYAIDR -1111---1111----------------1111------!!!!-----1111--------- NPEAISTTENLQRHGLGDNVTLEGDAPEALCKIPDIDIAVVGGSGGELQEILRIIKDKLK 3333-------1111-1111--------3333-----------iiii------------2 PGGRIIVTAILLETKFEAECLRDLGFDVNITELNIARGRALDRGTVSRNPVALIYTGV 222------------------1111---------------iiii-------------- >HETEROGENEOUS NUCLEAR RIB; SWP:P09651; PDB:1L3KA; KEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCVVMRDPNTKRSRGFGFVTYATVEE --3333--------3333---------1111----------------------------- VDAAMNARPHKVDGRVVEPKRAVSTVKKIFVGGIKEDTEEHHLRDYFEQYGKIEVIEIMT ----1111---iiii-------------------------------1111---------- DRGSGKKRGFAFVTFDDHDSVDKIVIQKYHTVNGHNCEVRKAL -----------------------1111----iiii-------- >TRANSCRIPTIONAL ACTIVATOR; SWP:P33905; PDB:1L3LA; QHWLDKLTDLAAIEGDECILKTGLADIADHFGFTGYAYLHIQHRHITAVTNYHRQWQSTY ---------1111---------------1111---------------------------- FDKKFEALDPVVKRARSRKHIFTWSGEHERPTLSKDERAFYDHASDFGIRSGITIPIKTA 11111111----------------3333-3333---------3333-------------i NGFSFTASDKPVIDLDREIDAVAAAATIGQIHARISFLRTTPTAEDAAWLDPKEATYLRW iii--------------------------------1111--------------------- IAVGKTEEIADVEGVKYNSVRVKLREAKRFDVRSKAHLTALAIRRKLI 1111-----------3333--------1111-----------1111-- >POLLEN ALLERGEN PHL P 5B; SWP:Q40963; PDB:1L3PA; IPAGELQIIDKIDAAFKVAATAAATAPADDKFTVFEAAFNKAIKETTGGAYDTYKCIPSL -33331111-------------11113333----------------iiii--3333---- EAAVKQAYAATVAAAPQVKYAVFEAALTKAITAMSEVQKVSQ --------1111--3333------------------------ >PLATELET AGGREGATION INHI; SWP:Q90WC0; PDB:1L3XA; EAGEECDCGSPGNPCCDAATCKLRQGAQCAEGLCCDQCRFMKEGTICRRARGDDLDDYCN -----1111-----------------------------------------!!!!------ GISAGCPRNPFHA ------------- >INTEGRIN BETA-2:CYSTEINE-; SWP:P05107; PDB:1L3YA; ECDTINCERYNGQVCGGPGRGLCFCGKCRCHPGFEGSACQA -------------%%%%------2222-------------- >S-syntaxin; SWP:O46345; PDB:1L4AB; GKSASGIIMETQQAKQTLADIEARHADIMKLETSIRELHDMFMDMAMLVESQGEMIDRIE --!!!!--------------------------------------------3333------ YNVEAAVDYIETAKVDTKKAVK 1111--333311111111---- >Synaptosomal-associated p; SWP:Q8T3S4; PDB:1L4AC; KTELEEIQQQCNQVTDDSLESTRRMLNMCEESKEAGIRTLVMLDEQGEQLDRIEEGLDQI -----------------------------------------------3333-------33 NQDMKDAEKNLEG 33----------- >Synaptosomal-associated p; SWP:Q8T3S4; PDB:1L4AD; PSSGYVTRITNDAREDDMENNMKEVSSMIGNLRNMAIDMGNEIGSQNRQVDRIQQKAESN ---------------------------------------------------------111 ESRIDEANKKATKLL 1---11113333--- >Streptokinase C [Precurso; SWP:P00779; PDB:1L4DB; NNSQLVVSVAGTVEGTNQDISLKFFEIDLTSRPAMPHKLEKADLLKAIQEQLIANVHSND ------------2222-----------------------------------1111----- DYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQEFLLSGHVRVRPYK --------1111---1111-----1111------------------------ >SFAE PROTEIN; SWP:P62609; PDB:1L4IA; GVALGATRVIYPEGQKQVQLAVTNNDDKSSYLIQSWIENAEGKKDARFVITPPLFSMQGK -----------2222----------1111---------1111------------------ KENTLRIIDATNGQMPEDRESLFWVNVKAIPAMDLQFAIVSRIKLLYRPQGLVIPPEQAP ----------iiii----------------------------------------333311 GKLEFTRELTLFNPTPYYLTVTDLKAGNKSLENTMVPPQGKVTVNIGGDITYKTINDYGA 11----------------------------------2222---------------1111- LTEQVRGVV --------- >AMIDE SYNTHASE; SWP:Q9KTV9; PDB:1L5AA; MLLAQKPFWQRHLAYPHINLDTVAHSLRLTGPLDTTLLLRALHLTVSEIDLFRARFSAQG -3333---------1111--------------------------3333--1111--1111 ELYWHPFSPPIDYQDLSIHLEAEPLAWRQIEQDLQRSSTLIDAPITSHQVYRLSHSEHLI ---------------3333----------------------------------1111--- YTRAHHIVLDGYGMMLFEQRLSQHYQSLLSGQTPTAAFKPYQSYLEEEAAYLTSHRYWQD ----1111-------------------------------3333----------------- KQFWQGYLREAPDLTLTSATYDPQLSHAVSLSYTLNSQLNHLLLKLANANQIGWPDALVA -----------------11113333----------3333--------1111--------- LCALYLESAEPDAPWLWLPFMNRWGSVAANVPGLMVNSLPLLRLSAQQTSLGNYLKQSGQ ---------1111---------2222---------------------------------- AIRSLYLHGRYRIEQIEQDQGLNAEQSYFMSPFINILPFESPHFADCQTELKVLASGSAE -------1111------1111-1111---------------------------------- GINFTFRGSPQHELCLDITADLASYPQSHWQSHCERFPRFFEQLLARFQQVEQDVARLLA --------3333--------1111------------------------------------ EPAA ---- >NEUROPHYSIN 1; SWP:P01175; PDB:1L5CA; AVLDLDVRTCLPCGPGGKGRCFGPSICCGDELGCFVGTAEALRCQEENYLPSPCQSGQKP ----------------------------1111-----1111--1111------------- CGSGGRCAAAGICCSPDGCEEDPACDPEAAFS --------------1111---3333------- >ACONITATE HYDRATASE 2; SWP:P36683; PDB:1L5JA; MLEEYRKHVAERAAEGIAPKPLDANQMAALVELLKNPPAGEEEFLLDLLTNRVPPGVDEA ------------1111---------------------2222-------------!!!!-- AYVKAGFLAAIAKGEAKSPLLTPEKAIELLGTMQGGYNIHPLIDALDDAKLAPIAAKALS -----------------1111--------------11113333-------3333---333 HTLLMFDNFYDVEEKAKAGNEYAKQVMQSWADAEWFLNRPALAEKLTVTVFKVTGETNTD 3---!!!!-------1111----------------1111--------------------3 DLSPAPDAWSRPDIPLHALAMLKNAREGIEPDQPGVVGPIKQIEALQQKGFPLAYVGDVV 33333331111--33331111----2222---2222------------------------ GTGSSRKSATNSVLWFMGDDIPHVPNKRGGGLCLGGKIAPIFFNTMEDAGALPIEVDVSN --------------------2222---------------------------------111 LNMGDVIDVYPYKGEVRNHETGELLATFELKTDVLIDEVRAGGRIPLIIGRGLTTKAREA 12222-----1111-----------------3333-----------------------11 LGLPHSDVFRQAKDVAESDRGFSLAQKMVGRACGVKGIRPGAYCEPKMTSVGSQDTTGPM 11----------------------------1111----2222-----------1111--- TRDELKDLACLGFSADLVMQSFCHTAAYPKPVDVNTHHTLPDFIMNRGGVSLRPGDGVIH -----1111-----------------------------------1111----2222-333 SWLNRMLLPDTVGTGGDSHTRFPIGISFPAGSGLVAFAAATGVMPLDMPESVLVRFKGKM 33333-----------1111---------------------------------------- QPGITLRDLVHAIPLYAIKQGLLTVEKKGKKNIFSGRILEIEGLPDLKVEQAFELTDASA 22223333------------------2222-3333--------11113333------333 ERSAAGCTIKLNKEPIIEYLNSNIVLLKWMIAEGYGDRRTLERRIQGMEKWLANPELLEA 3-----------------------------1111----------------3333------ DADAEYAAVIDIDLADIKEPILCAPNDPDDARPLSAVQGEKIDEVFIGSCMTNIGHFRAA 1111--------3333-------2222-----33332222--------11113333---- GKLLDAHKGQLPTRLWVAPPTRMDAAQLTEEGYYSVFGKSGARIEIPGCSLCMGNQARVA ---3333-----------------------------------------!!!!-------2 DGATVVSTSTRNFPNRLGTGANVFLASAELAAVAALIGKLPTPEEYQTYVAQVDKTAVDT 222---------2222-2222------------------------------3333----- YRYLNFNQLSQYTEKADGVIFQ ----11113333---1111--- >Nicotinate-nucleotide--di; SWP:Q05603; PDB:1L5OA; LHALLRDIPAPDAEAMARTQQHIDGLLKPPGSLGRLETLAVQLAGMPGLNGTPQVGEKAV ----1111--------------1111--2222----------------%%%%-------- LVMCADHGVWDEGVAVSPKIVTAIQAANMTRGTTGVCVLAAQAGAKVHVIDVGIDAEPIP ------3333-------3333-----3333----------1111--------------22 GVVNMRVARGCGNIAVGPAMSRLQAEALLLEVSRYTCDLAQRGVTLFGVGELGMANTTPA 22----------3333---------------------3333-----------2222---- AAMVSVFTGSDAKEVVGIGANLPPSRIDNKVDVVRRAIAINQPNPRDGIDVLSKVGGFDL ----------3333--------3333-----------------1111------------- VGMTGVMLGAARCGLPVLLDGFLSYSAALAACQIAPAVRPYLIPSHFSAEKGARIALAHL ----------------------------------33331111-------1111----111 SMEPYLHMAMRLGEGSGAALAMPIVEAACAMFHNMGELAASNIVLP 1-----------------------------------3333------ >FERREDOXIN; SWP:P21149; PDB:1L5PA; GTITAVKGGVKKQLKFEDDQTLFTVLTEAGLMSADDTCQGNKACGKCICKHVSGKVAAAE ------iiii------2222------1111---2222%%%%------------------3 DDEKEFLEDQPANARLACAITLSGENDGAVFEL 3331111---1111-1111---3333------- >MALTODEXTRIN PHOSPHORYLAS; SWP:P00490; PDB:1L5WA; SQPIFNDKQFQEALSRQWQRYGLNSAAEMTPRQWWLAVSEALAEMLRAQPFAKPVANQRH ------------------1111--3333-----------------1111----------- VNYISMEFLIGRLTGNNLLNLGWYQDVQDSLKAYDINLTDLLEEEIDPALGNGGLGRLAA ------------------------------3333-------------------------- CFLDSMATVGQSATGYGLNYQYGLFRQSFVDGKQVEAPDDWHRSNYPWFRHNEALDVQVG -----------------------------iiii--------33331111--3333----- IGGKVTKDGRWEPEFTITGQAWDLPVVGYRNGVAQPLRLWQATHAHPFDLTKFNDGDFLR -----1111-------------------------------------------1111-333 AEQQGINAEKLTKVLYPNDNAFEGKKLRLMQQYFQCACSVADILRRHHLAGRKLHELADY 3--3333----------------------------------------1111-33331111 EVIQLNDTHPTIAIPELLRVLIDEHQMSWDDAWAITSKTFAYTNHTLMPEALERWDVKLV -------1111-----------------------1111---------3333--------- KGLLPRHMQIINEINTRFKTLVEKTWPGDEKVWAKLAVVHDKQVHMANLCVVGGFAVNGV -------------------------2222----------%%%%----------------- AALHSDLVVKDLFPEYHQLWPNKFHNVTNGITPRRWIKQCNPALAALLDKSLQKEWANDL -------------------1111------------------------------------- DQLINLEKFADDAKFRQQYREIKQANKVRLAEFVKVRTGIEINPQAIFDIQIKRLHEYKR -------------------------------------------------------3333- QHLNLLHILALYKEIRENPQADRVPRVFLFGAKAAPGYYLAKNIIFAINKVADVINNDPL -----------------1111-------------1111-------------------333 VGDKLKVVFLPDYCVSAAEKLIPAADISEQISTAGKEASGTGNMKLALNGALTVGTLDGA 3------------3333---3333--------2222----------1111-------!!! NVEIAEKVGEENIFIFGHTVEQVKAILAKGYDPVKWRKKDKVLDAVLKELESGKYSDGDK !-------1111-------------------3333-1111------------1111--11 HAFDQMLHSIGKQGGDPYLVMADFAAYVEAQKQVDVLYRDQEAWTRAAILNTARCGMFSS 11---------111111113333-------------------------------3333-- DRSIRDYQARIWQAKR ----------1111-- >SURVIVAL PROTEIN E; SWP:Q8ZU79; PDB:1L5XA; KILVTNDDGVHSPGLRLLYQFALSLGDVDVVAPESPKSATGLGITLHKPLRYEVDLCGFR --------1111---------1111----------------------------------- AIATSGTPSDTVYLATFGLGRKYDIVLSGINLGDNTSLQVILSSGTLGAAFQAALLGIPA ------3333-----------------------------3333----------1111--- LAYSAYLENWNELLNNKEAVEIGAVVSSTASYVLKNGPQGVDVISVNFPRRLGRGVRAKL -----------3333----------------------2222-----------1111---- VKAAKLRYAQQVVERVDPRGVRYYWLYGRDLAPEPETDVYVVLKEGGIAITPLTLNLNAV ----------------1111-------------------------------------333 DAHREVDDSLNRVEYINASLSKLAAALEHH 3----------------------------- >NON-SPECIFIC LIPID TRANSF; SWP:P83210; PDB:1L6HA; AGCNAGQLTVCTGAIAGGARPTAACCSSLRAQQGCFCQFAKDPRYGRYVNSPNARKAVSS ------3333---1111--1111--------3333--------1111-----3333---- CGIALPTCH --------- >MATRIX METALLOPROTEINASE-; SWP:P14780; PDB:1L6JA; VLFPGDLRTNLTDRQLAEEYLYRYGYTLGPALLLLQKQLSLPETGELDSATLKAMRTPRC --2222---------------1111----------------------------1111--- GVPDLGRFQTFEGDLKWHHHNITYWIQNYSEDLPRAVIDDAFARAFALWSAVTPLTFTRV -----------------------------33333333-----------3333-------- YSRDADIVIQFGVAEHGDGYPFDGKDGLLAHAFPPGPGIQGDAHFDDDELWSLGKGVVVP -3333------------------2222---------!!!!-------------------- TRFGNADGAACHFPFIFEGRSYSACTTDGRSDGLPWCSTTANYDTDDRFGFCPSERLYTR -----iiii--------------------------------3333--------1111--- DGNADGKPCQFPFIFQGQSYSACTTDGRSDGYRWCATTANYDRDKLFGFCPTRADSTVMG ---iiii-------iiii------2222------------------------3333---- GNSAGELCVFPFTFLGKEYSTCTSEGRGDGRLWCATTSNFDSDKKWGFCPDQGYSLFLVA -------------iiii------2222--------------------------------- AHEFGHALGLDHSSVPEALMYPMYRFTEGPPLHKDDVNGIRHLYG -----1111-----1111--------------------------- >THIOL:DISULFIDE INTERCHAN; SWP:P36655; PDB:1L6PA; DAPGRSQFVPADQAFAFDFQQNQHDLNLTWQIKDGYYLYRKQIRITPEHAKIADVQLPQG ---------1111--------!!!!-------2222--3333------------------ VWHEDEFYGKSEIYRDRLTLPVTINQASAGATLTVTYQGCADAGFCYPPETKTVPLSEVV ---------------------------2222---------3333---------------- A - >HYPOTHETICAL PROTEIN TA01; SWP:NA; PDB:1L6RA; HMIRLAAIDVDGNLTDRDRLISTKAIESIRSAEKKGLTVSLLSGNVIPVVYALKIFLGIN ---------------1111-------------1111------------------------ GPVFGENGGIMFDNDGSIKKFFSNEGTNKFLEEMSKRTSMRSILTNRWREASTGFDIDPE ----%%%%----1111-----------------3333-----3333-----------333 DVDYVRKEAESRGFVIFYSGYSWHLMNRGEDKAFAVNKLKEMYSLEYDEILVIGDSNNDM 3-----------------!!!!----2222---------------1111------1111- PMFQLPVRKACPANATDNIKAVSDFVSDYSYGEEIGQIFKHFELM -----------1111----1111------iiii------1111-- >PORPHOBILINOGEN SYNTHASE; SWP:P15002; PDB:1L6SA; TDLIQRPRRLRKSPALRAMFEETTLSLNDLVLPIFVEEEIDDYKAVEAMPGVMRIPEKHL -----3333----------------3333----------------3333------3333- AREIERIANAGIRSVMTFGISHHTDETGSDAWREDGLVARMSRICKQTVPEMIVMSDTCF -------1111-----------------3333----------------1111-------1 CEYTSHGHCGVLEHGVDNDATLENLGKQAVVAAAAGADFIAPSAAMDGQVQAIRQALDAA 1111111-------------------------1111---------2222--------111 GFKDTAIMSYSTKFASSFYGPFREAAGSALKGDRKSYQMNPMNRREAIRESLLDEAQGAD 11111----------------------------1111--1111-----------1111-- CLMVKPAGAYLDIVRELRERTELPIGAYQVSGEYAMIKFAALAGAIDEEKVVLESLGSIK ------1111------1111--------------------1111---------------- RAGADLIFSYFALDLAEKKILR --------1111---------- >FRUCTOSE-6-PHOSPHATE ALDO; SWP:P78055; PDB:1L6WA; MELYLDTSDVVAVKALSRIFPLAGVTTNPSIIAAGKKPLDVVLPQLHEAMGGQGRLFAQV --------------------------------1111-3333------------------- MATTAEGMVNDALKLRSIIADIVVKVPVTAEGLAAIKMLKAEGIPTLGTAVYGAAQGLLS ------------------1111-----------------1111----------------- ALAGAEYVAPYVNRIDAQGGSGIQTVTDLHQLLKMHAPQAKVLAASFKTPRQALDCLLAG 1111-----------1111-----------------1111--------3333-------- CESITLPLDVAQQMISYPAVDAAVAKFEQDWQGAFGRTSI ---------------------------------------- ---------------------------------- >BILIARY GLYCOPROTEIN C; SWP:Q61353; PDB:1L6ZA; EVTIEAVPPQVAEDNNVLLLVHNLPLALGAFAWYKGNTTAIDKEIARFVPNSNMNFTGQA ---------------------------------------3333----------------- YSGREIIYSNGSLLFQMITMKDMGVYTLDMTDENYRRTQATVRFHVHQPVTQPFLQVTNT -------1111-------3333-------------------------------------- TVKELDSVTLTCLSNDIGANIQWLFNSQSLQLTERMTLSQNNSILRIDPIKREDAGEYQC --2222------------------%%%%----1111--------------3333------ EISNPVSVRRSNSIKLDIIFDPS ----------------------- >LYSOZYME; SWP:P00720; PDB:1L75; MNIFEMLRIDEGLRLKIYKDTEGYYTIGIGHLLTKSPSLNAAKSELDKAIGRNCNGVITK -------------------1111------------------------------%%%%--- DEAEKLFNQDVDAAVRGILRNAKLKPVYDSLDAVRRCALINMVFQMGETGVAGFTNSLRM ---------------------------11113333------------------------- LQQKRWAAAAAAAAKSRWYNQTPNRAKRVITTFRTGTWDAYK -------------------------------------3333- >CEPHALOSPORIN C DEACETYLA; SWP:P94388; PDB:1L7AA; MQLFDLPLDQLQTYKPEKTAPKDFSEFWKLSLEELAKVQAEPDLQPVDYPADGVKVYRLT ----------1111------1111------------------------------------ YKSFGNARITGWYAVPDKEGPHPAIVKYHGYNASYDGEIHEMVNWALHGYATFGMLVRGQ --2222-----------------------2222-iiii-------1111-------2222 QRSEDTSISPHGHALGWMTKGILDKDTYYYRGVYLDAVRALEVISSFDEVDETRIGVTGG ----------------111111111111------------------11111111------ SQGGGLTIAAAALSDIPKAAVADYPYLSNFERAIDVALEQPYLEINSFFRRNGSPETEVQ ----------------------------3333--------3333---------3333--- AMKTLSYFDIMNLADRVKVPVLMSIGLIDKVTPPSTVFAAYNHLETKKELKVYRYFGHEY ----333333331111---------1111---3333----1111---------------- IPAFQTEKLAFFKQILKG 3333-------------- >DNA LIGASE; SWP:P26996; PDB:1L7BA; MEKGGEALKGLTFVITGELSRPREEVKALLRRLGAKVTDSVSRKTSYLVVGENPGSKLEK -----------------------3333--------------------------------3 ARALGVPTLTEEELYRLLEARTGKKAEELVGS 333--------------3333---3333---- >nicotinamide nucleotide T; SWP:P0C186; PDB:1L7DA; MKIAIPKERRPGEDRVAISPEVVKKLVGLGFEVIVEQGAGVGASITDDALTAAGATIAST ---------2222-------------1111-----22223333-------1111-----3 AAQALSQADVVWKVQRPMTAEEGTDEVALIKEGAVLMCHLGALTNRPVVEALTKRKITAY 333-1111----------1111--3333--2222------3333---------------- AMELMPRISRAQSMDILSSQSNLAGYRAVIDGAYEFARAFPMMMTAAGTVPPARVLVFGV 3333---33331111-----------------------------1111------------ GVAGLQAIATAKRLGAVVMATDVRAATKEQVESLGGKFITVKKQAEAVLKELVKTDIAIT -----------------------3333----1111-------3333----3333------ TALIPGKPAPVLITEEMVTKMKPGSVIIDLAVEAGGNCPLSEPGKIVVKHGVKIVGHTNV -------------333311112222---1111-----11112222---iiii------33 PSRVAADASPLFAKNLLNFLTPHVDKDTKTLVMKLEDETVSGTCVTRDGAIVHP 33--1111-----------3333----------1111-3333----%%%%---- >chimera of Fab2C4: "human; SWP:NA; PDB:1L7IH; EVQLVESGGGLVQPGGSLRLSCAASGFTFTDYTMDWVRQAPGKGLEWVADVNPNSGGSIY ------------2222-----------1111--------2222----------------- NQRFKGRFTLSVDRSKNTLYLQMNSL -1111--------------------- >PA-I GALACTOPHILIC LECTIN; SWP:Q05097; PDB:1L7LA; AWKGEVLANNEAGQVTSIIYNPGDVITIVAAGWASYGPTQKWGPQGDREHPDQGLICHDA ------1111----------2222------------------11111111------1111 FCGALVMKIGNSGTIPVNTGLFRWVAPNNVQGAITLIYNDVPGTYGNNSGSFSVNIGKDQ 2222----!!!!----!!!!------2222----------22221111------------ S - >PHOSPHOSERINE PHOSPHATASE; SWP:Q58989; PDB:1L7MA; KKKKLILFDFDSTLVNNETIDEIAREAGVEEEVKKITKEAEGKLNFEQSLRKRVSLLKDL ---------2222---------3333----------------------------1111-- PIEKVEKAIKRITPTEGAEETIKELKNRGYVVAVVSGGFDIAVNKIKEKLGLDYAFANRL 3333----1111--2222--------------------3333-----1111--------- IVKDGKLTGDVEGEVLKENAKGEILEKIAKIEGINLEDTVAVGDGANDISFKKAGLKIAF --%%%%----------1111--------------3333------1111--1111------ CAKPILKEKADICIEKRDLREILKYIK ---3333--------------3333-- >ANTI-TESTOSTERONE (LIGHT ; SWP:NA; PDB:1L7TH; EVKLVESGGGLVKPGGSLKLSCAASGFTFSRYALSWVRQTADKRLEWVASIVSGGNTYYS ------------2222-----------3333--------1111--------1111----1 GSVKGRFTISRDIARNILYLQMSSLRSEDTAMYYCARAYYGYVGLVHWGQGTLVTVSSAK 111--------3333----------1111------------------------------- TTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLY --------------------------------------%%%%-------------%%%%- TLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRDC --------1111-----------3333------------- >If kappa light chain [Fra; SWP:A2NHM3; PDB:1L7TL; DVVVTQTPLSLPVSLGDQASISCRSSEVIVTRNGYTPIEWYLQKPGQSPKLLIYKAYKRF -------------2222--------------------------2222------------2 PGVPDRFSGSGSGTDFTLKISRVEAEDLGVYYCFDGSTVPPKFGGGTKLEIKRADAAPTV 2223333----!!!!--------3333--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM ----------------------------------iiii---------------------- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRDEC --------3333----------1111--------1111- >Vitamin B12 import ATP-bi; SWP:P06611; PDB:1L7VC; SIVQLQDVAESTRLGPLSGEVRAGEILHLVGPNGAGKSTLLARAGTSGKGSIQFAGQPLE ----------3333-----------------2222----------------------333 AWSATKLALHRAYLSQQQTPPFATPVWHYLTLHQHDKTRTELLNDVAGALALDDKLGRST 3-3333------------------3333--1111-----------------1111---11 NQLSGGEWQRVRLAAVVLQITPQANPAGQLLLLDEPNSLDVAQQSALDKILSALCQQGLA 11--------------333333331111--------------------------1111-- IVSSHDLNHTLRHAHRAWLLKGGKLASGRREEVLTPPNLAQAYG ----------------------------3333------------ >HYPOTHETICAL PROTEIN ZK65; SWP:P34661; PDB:1L7YA; MSGGTAATTAGSKVTFKITLTSDPKLPFKVLSVPESTPFTAVLKFAAEEFKVPAATSAII ---------------------------------3333---------------3333---- TNDGVGVNPAQPAGNIFLKHGSELRLIPRDRVGH 3333------------------------------ >EUKARYOTIC TRANSLATION IN; SWP:P20415; PDB:1L8BA; KHPLQNRWALWFFKNDKSKTWQANLRLISKFDTVEDFWALYNHIQLSSNLMPGCDYSLFK -------------------3333----------------3333--3333-2222-----2 DGIEPMWEDEKNKRGGRWLITLNKQQRRSDLDRFWLETLLCLIGESFDDYSDDVCGAVVN 222--3333--1111-------1111---------------1111--1111--------- VRAKGDKIAIWTTECENRDAVTHIGRVYKERLGLPPKIVIGYQSHADTATKKNRFVV -3333-----------------------------1111------------------- >CREB-BINDING PROTEIN; SWP:P45481; PDB:1L8CA; ADPEKRKLIQQQLVLLLHAHKCQRREQANGEVRACSLPHCRTMKNVLNHMTHCQAGKACQ ------------------------------------3333---------------3333- VAHCASSRQIISHWKNCTRHDCPVCLPLKNASDKR -------------------------3333------ >DNA DOUBLE-STRAND BREAK R; SWP:P58301; PDB:1L8DA; KKLLEELETKKTTIEEERNEITQRIGELKNKIGDLKTAIEELKKAKGKCPVCGRELTDEH -----------------------------------------1111--------------- REELLSKYHLDLNNSKNTLAKLIDRKSELERELRRIDMEIKRL ------------------------------------------- >ENDOGLUCANASE; SWP:Q8J0K8; PDB:1L8FA; ANGQSTRYWDCCKPSCGWRGKGPVNQPVYSCDANFQRIHDFDAVSGCEGGPAFSCADHSP -------------1111--------------1111----1111-1111------1111-- WAINDNLSYGFAATALSGQTEESWCCACYALTFTSGPVAGKTMVVQSTSTGGDLGSNHFD ---1111--------222233332222----------2222------------------- LNIPGGGVGLFDGCTPQFGGLPGARYGGISSRQECDSFPEPLKPGCQWRFDWFQNADNPS --2222-------3333-------------333311113333--------1111------ FTFERVQCPEELVARTGCRRHDDGGFA --------33333333---1111---- >ENDOTHELIAL PROTEIN C REC; SWP:Q9UNN8; PDB:1L8JA; LQRLHMLQISYFRDPYHVWYQGNASLGGHLTHVLEGPDTNTTIIQLQPLQEPESWARTQS -------------1111--------iiii-------1111-------------------- GLQSYLLQFHGLVRLVHQERTLAFPLTIRCFLGCELPPEGSRAHVFFEVAVNGSSFVSFR --------------------------------------------------iiii------ PERALWQADTQVTSGVVTFTLQQLNAYNRTRYELREFLEDTCVQYVQKHI 1111---------------------------------------------- >T-CELL PROTEIN-TYROSINE P; SWP:P17706; PDB:1L8KA; IEREFEELDTQRRWQPLYLEIRNESHDYPHRVAKFPENRNRNRYRDVSPYDHSRVKLQNA 3333---------------3333------3333-3333-----1111--1111---2222 ENDYINASLVDIEEAQRSYILTQGPLPNTCCHFWLMVWQQKTKAVVMLNRIVEKESVKCA -----------3333----------11113333--------------------------- QYWPTDDQEMLFKETGFSVKLLSEDVKSYYTVHLLQLENINSGETRTISHFHYTTWPDFG ------------1111----------1111------------------------------ VPESPASFLNFLFKVRESGSLNPDHGPAVIHCSAGIGRSGTFSLVDTCLVLMEKGDDINI ------------------1111-------------------------------------- KQVLLNMRKYRMGLIQTPDQLRFSYMAIIEGAK ------1111----------------------- >L-3-PHOSPHOSERINE PHOSPHA; SWP:P78330; PDB:1L8LA; HSELRKLFYSADAVCFDVDSTVIREEGIDELAKICGVEDAVSEMTRRAMGGAVPFKAALT 3333-3333--------2222------3333------3333---1111------------ ERLALIQPSREQVQRLIAEQPPHLTPGIRELVSRLQERNVQVFLISGGFRSIVEHVASKL ----------------1111-------3333---3333------------------1111 NIPATNVFANRLKFYFNGEYAGFDETQPTAESGGKGKVIKFLKEKFHFKKIIMIGDGATD --3333--------3333------------2222----------------------3333 MEACPPADAFIGFGGNVIRQQVKDNAKWYITDFVELLGELEE ---3333-----------1111-----------11113333- >ALPHA-D-GLUCURONIDASE; SWP:Q8VVD2; PDB:1L8NA; GYEPCWLRYERKDQYSRLRFEEIVAKRTSPIFQAAVEELQKGLRSMMEIEPQVVQEVNET ---!!!!-----1111-----------------------------------------111 ANSIWLGTLEDEEFERPLEGTLVHPEGYVIRSDVDPFRIYIIGKTDAGVLYGVFHFLRLL 1------1111----3333----1111-----------------3333------------ QMGENIAQLSIIEQPKNRLRMINHWDNMDGSIERGYAGRSIFFVDDQFVNQRIKDYARLL --------------------------1111-------------%%%%------------- ASVGINAISINNVNVHKTETKLITDHFLPDVAEVADIFRTYGIKTFLSINYASPIEIGGL 1111-----------3333-11111111----------1111-------1111------- PTADPLDPEVRWWWKETAKRIYQYIPDFGGFVVKADSEFRPGPFTYGRDHAEGANMLAEA ---1111-----------------1111--------%%%%-3333--------------- LAPFGGLVIWRCFVYNCQQDWRDRTTDRAKAAYDHFKPLDGQFRENVILQIKNGPMDFQV 3333---------------3333---3333-----3333----1111------------- REPVSPLFGAMPKTNQMMEVQITQEYTGQQKHLCFLIPQWKEVLDFDTYAKGKGSEVKKV ----3333--1111---------1111%%%%------------------------33331 IDGSLFDYRYSGIAGVSNIGSDPNWTGHTLAQANLYGFGRLAWNPDLSAEEIANEWVVQT 111------------------1111--1111------------1111------------- FGDDSQVVETISWMLLSSWRIYENYTSPLGVGWMVNPGHHYGPNVDGYEYSHWGTYHYAD ----------------------1111-iiii------------1111------------1 RDGIGVDRTVATGTGYTAQYFPENAAMYESLDTCPDELLLFFHHVPYTHRLHSGETVIQH 111-----------3333----------------3333-------1111-1111------ IYNTHFEGVEQAKQLRKRWEQLKGKIDEKRYHDVLERLTIQVEHAKEWRDVINTYFYRKS -------------------1111------------------------------------- GIDDQYGRKIY ---1111---- >CHROMOSOMAL REPLICATION I; SWP:O66659; PDB:1L8QA; DFLNPKYTLENFIVGEGNRLAYEVVKEALENLGSLYNPIFIYGSVGTGKTHLLQAAGNEA ---33333333---1111--------33332222-------------------------- KKRGYRVIYSSADDFAQAVEHLKKGTINEFRNYKSVDLLLLDDVQFLSGKERTQIEFFHI 1111---------------------3333---1111------3333-------------- FNTLYLLEKQIILASDRHPQKLDGVSDRLVSRFEGGILVEIELDNKTRFKIIKEKLKEFN ----1111---------33332222-------1111------------------------ LELRKEVIDYLLENTKNVREIEGKIKLIKLKGFEGLERKERKERDKLQIVEFVANYYAVK -----------------------------------------------------------3 VEDILSDKRNKRTSEARKIAYLCRKVCSASLIEIARAFKRKDHTTVIHAIRSVEEEKKRK 333---------1111------------------------------------3333---- FKHLVGFLEKQAFDKIC ----------------- >DACHSHUND; SWP:Q9UI36; PDB:1L8RA; GSQNNECKVDLRGAKVASFTVEGCELICLPQAFDLFLKHLVGGLHTVYTKLKRLEITPVV -3333-----iiii------iiii------------------------------------ CNVEQVRILRGLGAIQPGVNRCKLISRKDFETLYNDCTNA ---------------1111--------------------- >VLSE1; SWP:NA; PDB:1L8WA; GGLVAEAFGFKSDPKKSDVKTYFTTVAAKLEKTKTDLNSLPTAVEGAIKEVSELLDKLVK ------------------------------------------------------------ AVKTAEGASSGTAAIGEVVADADAAKVADKASVKGIAKGIKEIVEAAGGSEKLKAVAAAK -----1111-----------3333-----3333--------------------------- GENNKGAGKLFGKAGAAAHGDSEAASKAAGAVSAVSGEQILSAIVTAADAAEQDGKKPEE -1111--1111---------3333----------------------1111------3333 AKNPIAAAIGDKDGGAEFGQDEKKDDQIAAAIALRGAKDGKFAVKDGEKEKAEGAIKGAA ----------3333---------3333---------2222----22221111-------- ESAVRKVLGAITGLIGDAVSSGLRKVGDS ----------------------------- >UPSTREAM BINDING FACTOR 1; SWP:P17480; PDB:1L8YA; GKLPESPKRAEEIWQQSVIGDYLARFKNDRVKALKAMEMTWNNMEKKEKLMWIKKAAEDQ --------------3333-----1111--1111--------1111--------------- KRYERELSEMRAPPAATNSSKKLE -----3333--1111--------- >RNA-DIRECTED RNA POLYMERA; SWP:P12823; PDB:1L9KA; ETLGEKWKSRLNALGKSEFQIYKKSGIQEVDRTLAKEGIKRGETDHHAVSRGSAKLRWFV ----------11111111-----2222--------------------------------- ERNLVTPEGKVVDLGCGRGGWSYYCGGLKNVREVKGLTKGGPGHEEPIPMSTYGWNLVRL ---------------!!!!--------------------------------2222----- QSGVDVFFIPPERCDTLLCDIGESSPNPTVEAGRTLRVLNLVENWLSNNTQFCVKVLNPY ----3333--------------------------------3333--2222---------- MSSVIEKMEALQRKHGGALVRNPLSRNSTHEMYWVSNASGNIVSSVNMISRMLINRFTMR 3333-----------------11111111-------------------------3333-- HKKATYEPDVDLGSGTRNIGI --------------------- >GRANULYSIN; SWP:P22749; PDB:1L9LA; GRDYRTCLTIVQKLKKMVDKPTQRSVSNAATRVCRTGRSRWRDVCRNFMRRYQSRVIQGL -------------1111----3333------1111------------------------1 VAGETAQQICEDLR 111------3333- >ROTAVIRUS-NSP2; SWP:Q03243; PDB:1L9VA; MAELACFCYPHLENDSYKFIPFNNLAIKAMLTAKVDKKDMDKFYDSIIYGIAPPPQFKKR ------------%%%%------3333---1111--3333--------------3333111 YNTNDNSRGMNFETIMFTKVAMLICEALNSLKVTQANVSNVLSRVVSIRHLENLVIRKEN 1---------1111------------3333--------------------------1111 PQDILFHSKDLLLKSTLIAIGQSKEIETTITAEGGEIVFQNAAFTMWKLTYLEHQLMPIL --3333-----------1111-------1111--------1111-----3333------- DQNFIEYKVTLNEDKPISDVHVKELVAELRWQYNKFAVITHGKGHYRIVKYSSVANHADR 1111----------------------------1111-------------1111------- VYATFKSNVKTGVNNDFNLLDQRIIWQNWYAFTSSMKQGNTLDVCKRLLFQKMKPEKNPF ---------2222-------1111-----------1111-3333---------------- KGLSTDRKMDEVS ------------- >GAMMA-GLUTAMYL HYDROLASE; SWP:Q92820; PDB:1L9XA; AKKPIIGILMQKCRNKVMKNYGRYYIAASYVKYLESAGARVVPVRLDLTEKDYEILFKSI --------------3333----------------1111---------------------- NGILFPGGSVDLRRSDYAKVAKIFYNLSIQSFDDGDYFPVWGTCLGFEELSLLISGECLL -------------------------------1111------------------------- TATDTVDVAMPLNFTGGQLHSRMFQNFPTELLLSLAVEPLTANFHKWSLSVKNFTMNEKL ---------------1111--1111--3333----------------------------- KKFFNVLTTNTDGKIEFISTMEGYKYPVYGVQWHPEKAPYEWKNLDGISHAPNAVKTAFY ---------------------------------3333-------3333--3333------ LAEFFVNEARKNNHHFKSESEEEKALIYQFSPIYTGNISSFQQCYIFD -------1111--------------3333-----1111---------- >SGTX1; SWP:P56855; PDB:1LA4A; TCRYLFGGCKTTADCCKHLACRSDGKYCAWDGTF ---2222---1111-------------------- >Hemoglobin subunit beta-1; SWP:P45720; PDB:1LA6B; VEWTDKERSIISDIFSHMDYDDIGPKALSRCLVVYPWTQRYFIMSNANVAAHGIKVLHGL -------------1111-3333---------------3333-1111-------------- DRGMKNMDNIADAYTDLSTLHSEKLHVDPDNFKLLSDCITIVLAAKMGHAFTAETQGAFQ -----33331111--------------3333---------------!!!!---------- KFLAAVVSALGK ------------ ------------------------------------------------------------ -------------------- >CAPSID PROTEIN; SWP:Q86801; PDB:1LAJA; NIASSSAPSLQHPTFIASKKCRAGYTYTSLDVRPTRTEKDKSFGQRLIIPVPVSEYPKKK -------------------------------------2222--------1111--1111- VSCVQVRLNPSPKFNSTIWVSLRRLDETTLLTSENVFKLFTDGLAVLIYQHVPTGIQPNN ----------3333-----------3333------------------------------- KITFDMSNVGAEIGDMGKYALIVYSKDDVLEADEMVIHIDIEHQRIPSASTLPV -----------33333333-----------1111-------------------- >LEUCINE AMINOPEPTIDASE; SWP:P00727; PDB:1LAM; TKGLVLGIYSKEKEEDEPQFTSAGENFNKLVSGKLREILNISGPPLKAGKTRTFYGLHED -----------1111-------------1111--------------2222-------111 FPSVVVVGLGKKTAGIDEQENWHEGKENIRAAVAAGCRQIQDLEIPSVEVDPCGDAQAAA 1---------1111-------------------------------------iiii----- EGAVLGLYEYDDLKQKRKVVVSAKLHGSEDQEAWQRGVLFASGQNLARRLMETPANEMTP ----1111--1111---------------------------------------3333--- TKFAEIVEENLKSASIKTDVFIRPKSWIEEQEMGSFLSVAKGSEEPPVFLEIHYKGSPNA ----------------------------------------------------------11 SEPPLVFVGKGITFDSGGISIKAAANMDLMRADMGGAATICSAIVSAAKLDLPINIVGLA 11---------------------2222--1111--------------------------- PLCENMPSGKANKPGDVVRARNGKTIQVDNTDAEGRLILADALCYAHTFNPKVIINAATL ------------2222---1111------1111-------------1111---------- TGAMDIALGSGATGVFTNSSWLWNKLFEASIETGDRVWRMPLFEHYTRQVIDCQLADVNN 3333---!!!!------------------------------------------------- IGKYRSAGACTAAAFLKEFVTHPKWAHLDIAGVMTNKDEVPYLRKGMAGRPTRTLIEFLF ----------------1111---------3333------1111-------3333------ RFSQ ---- >LAR; SWP:P10586; PDB:1LARA; MITDLADNIERLKANDGLKFSQEYESIDPGQQFTWENSNLEVNKPKNRYANVIAYDHSRV --------------%%%%-----1111-------3333-33331111-1111--3333-- ILTSIDGVPGSDYINANYIDGYRKQNAYIATQGPLPETMGDFWRMVWEQRTATVVMMTRL ----2222-1111-------2222----------1111---------------------- EEKSRVKCDQYWPARGTETCGLIQVTLLDTVELATYTVRTFALHKSGSSEKRELRQFQFM -%%%%--------------!!!!---------1111--------2222------------ AWPDHGVPEYPTPILAFLRRVKACNPLDAGPMVVHCSAGVGRTGCFIVIDAMLERMKHEK -------------------------1111------------------------------- TVDIYGHVTCMRSQRNYMVQTEDQYVFIHEALLEAATCGHTEVPARNLYAHIQKLGQVPP ----------1111------3333-------------------3333------------- GESVTAMELEFKLLASSSRFISANLPCNKFKNRLVNIMPYELTRVCLQPIRGVEGSDYIN -----------3333----3333-3333-----1111--3333------2222-1111-- ASFLDGYRQQKAYIATQGPLAESTEDFWRMLWEHNSTIIVMLTKLREMGREKCHQYWPAE -----1111----------3333-----------------------iiii---------- RSARYQYFVVDPMAEYNMPQYILREFKVTDARDGQSRTIRQFQFTDWPEQGVPKTGEGFI ----!!!!---------1111--------------------------------------- DFIGQVHKTKEQFGQDGPITVHCSAGVGRTGVFITLSIVLERMRYEGVVDMFQTVKTLRT ---------------------------3333--------------------------111 QRPAMVQTEDQYQLCYRAALEYLGSF 1--------------------3333- >GLUCOCORTICOID RECEPTOR; SWP:P04151; PDB:1LATA; RPCLVCSDEASGCHYGVLTCEGCKAFFKRAVEGQHNYLCKYEGKCIIDKIRRKNCPACRY -------------iiii------------------------------11113333----- RKCLQAGMNLE ----------- >FERRITIN LIGHT CHAIN 1; SWP:P29391; PDB:1LB3A; SQIRQNYSTEVEAAVNRLVNLHLRASYTYLSLGFFFDRDDVALEGVGHFFRELAEEKREG 1111---------------------------------1111------------------- AERLLEFQNDRGGRALFQDVQKPSQDEWGKTQEAMEAALAMEKNLNQALLDLHALGSARA --------1111------------------------------------------------ DPHLCDFLESHYLDKEVKLIKKMGNHLTNLRRVASLGEYLFERLTLK ----------------------------------------------- >TNF RECEPTOR-ASSOCIATED F; SWP:Q9Y4K3; PDB:1LB6A; QQCNGIYIWKIGNFGMHLKCQEEEKPVVIHSPGFYTGKPGYKLCMRLHLQLPTAQRCANY --------------------1111--------------------------1111--2222 ISLFVHTMQGEYDSHLPWPFQGTIRLTILDQSEAPVRQNHEEIMDAKPELLAFQRPTIPR ---------1111-------------------3333----------11111111------ NPKGFGYVTFMHLEALRQRTFIKDDTLLVRCEVST -----------3333-------%%%%--------- >T7 LYSOZYME; SWP:P00806; PDB:1LBA; AKQRESTDAIFVHCSATKPSQNVGVREIRQWHKEQGWLDVGYHFIIKRDGTVEAGRDEMA -----------------3333--3333-------------------1111------1111 VGSHAKGYNHNSIGVCLVGGIDDKGKFDANFTPAQMQSLRSLLVTLLAKYEGAVLRAHHE ----2222-------------1111-------------------------------3333 VAPKACPSFDLKRWWEKNELVTSDRG -------------------------- >FERROCHELATASE; SWP:P16622; PDB:1LBQA; RSPTGIVLMNMGGPSKVEETYDFLYQLFADNDLIPISAKYQKTIAKYIAKFRTPKIEKQY ---------------3333----------------------------------------- REIGGGSPIRKWSEYQATEVCKILDKTCPETAPHKPYVAFRYAKPLTAETYKQMLKDGVK ----------------------3333-1111-----------------------1111-- KAVAFSQYPHFSYSTTGSSINELWRQIKALDSERSISWSVIDRWPTNEGLIKAFSENITK -----------1111---------------1111-------------------------- KLQEFPQPVRDKVVLLFSAHSLPMDVVNTGDAYPAEVAATVYNIMQKLKFKNPYRLVWQS --------1111----------1111-------------------1111----------- QVGPKPWLGAQTAEIAEFLGPKVDGLMFIPIAFTSDHIETLHEIDLGVIGESEYKDKFKR -------------------1111------1111--------------------3333--- CESLNGNQTFIEGMADLVKSHLQSNQLYSNQLPLDFALGKSNDPVKDLSLVFGNHE ---!!!!---------------------3333-3333---------3333------ >MURAMOYL-PENTAPEPTIDE CAR; SWP:P00733; PDB:1LBU; DGCYTWSGTLSEGSSGEAVRQLQIRVAGYPGTGAQLAIDGQFGPATKAAVQRFQSAYGLA ----------2222--3333-----1111-2222--------------------1111-- ADGIAGPATFNKIYQLQDDDCTPVNFTYAELNRCNSDWSGGKVSAATARANALVTMWKLQ -------------11111111-11113333------------------------------ AMRHAMGDKPITVNGGFRSVTCNSNVGGASNSRHMYGHAADLGAGSQGFCALAQAARNHG ---1111---------------------11111111-------------------1111- FTEILGPGYPGHNDHTHVAGGDGRFWSAPSCGI -----2222------------------3333-- >fructose 1,6-bisphosphata; SWP:O30298; PDB:1LBVA; MDERDALRISREIAGEVRKAIASMPLRERVKDVGMGKDGTPTKAADRVAEDAALEILRKE --------------------11113333--------------3333-------------- RVTVVTEESGVLGEGDVFVALDPLDGTFNATRGIPVYSVSLCFSYSDKLKDAFFGYVYNL -------------------------33331111--------------3333--------- ATGDEYYADSSGAYRNGERIEVSDAEELYCNAIIYYPDRKFPFKRMRIFGSAATELCFFA --------1111--iiii----------------------------------------11 DGSFDCFLDIRPGKMLRIYDAAAGVFIAEKAGGKVTELDGESLGNKKFDMQERLNIVAAN 11--------------3333--------1111----1111--1111-------------3 EKLHPKLLELIK 333--------- >BILIVERDIN REDUCTASE A; SWP:P46844; PDB:1LC0A; MITNSGKFGVVVVGVGRAGSVRLRDLKDPRSAAFLNLIGFVSRRELGSLDEVRQISLEDA ---------------------------1111-----------------!!!!-------- LRSQEIDVAYICSESSSHEDYIRQFLQAGKHVLVEYPMTLSFAAAQELWELAAQKGRVLH -------------3333--------1111------------------------------- EEHVELLMEEFEFLRREVLGKELLKGSLRFTASPLEEERFGFPAFSGISRLTWLVSLFGE --3333-----------2222--------------3333--3333--------------- LSLISATLEERKEDQYMKMTVQLETQNKGLLSWIEEKGPGLKRNRYVNFQFTSGSLEEVP ----------3333----------1111---------2222------------------- SVGVNKNIFLKDQDIFVQKLLDQVSAEDLAAEKKRIMHCLGLASDIQKLC ----2222----------1111---------------------------- >L-THREONINE-O-3-PHOSPHATE; SWP:P97084; PDB:1LC5A; LFNTAHGGNIREPATVLGISPDQLLDFSANINPLGMPVSVKRALIDNLDCIERYPDADYF -------------------1111-------------3333---------------1111- HLHQALARHHQVPASWILAGNGETESIFTVASGLKPRRAMIVTPGFAEYGRALAQSGCEI ------------3333-----3333----------------------------1111--- RRWSLREADGWQLTDAILEALTPDLDCLFLCTPNNPTGLLPERPLLQAIADRCKSLNINL -----3333------3333--1111----------------------------------- ILDEAFIDFIPHETGFIPALKDNPHIWVLRSLTKFYAIPGLRLGYLVNSDDAAMARMRRQ ---1111--1111--33331111--------------3333------------------- QMPWSVNALAALAGEVALQDSAWQQATWHWLREEGARFYQALCQLPLLTVYPGRANYLLL ---------------3333----------------------------------------- RCEREDIDLQRRLLTQRILIRSCANYPGLDSRYYRVAIRSAAQNERLLAALRNVL ---1111-----3333------1111---1111---------------------- >LUCIFERASE; SWP:P08659; PDB:1LCI; AKNIKKGPAPFYPLEDGTAGEQLHKAMKRYALVPGTIAFTDAHIEVNITYAEYFEMSVRL --------------------------------2222------------------------ AEAMKRYGLNTNHRIVVCSENSLQFFMPVLGALFIGVAVAPANDIYNERELLNSMNISQP ---------1111--------1111-------1111------11113333---------- TVVFVSKKGLQKILNVQKKLPIIQKIIIMDSKTDYQGFQSMYTFVTSHLPPGFNEYDFVP -----1111----------3333-----------iiii------3333-22221111--- ESFDRDKTIALIMNSLPKGVALPHRTACVRFSHARDPIFGNQIIPDTAILSVVPFHHGFG ---3333---------------3333---------------------------1111--- MFTTLGYLICGFRVVLMYRFEEELFLRSLQDYKIQSALLVPTLFSFFAKSTLIDKYDLSN -----------------------------1111------1111-3333---1111--111 LHEIASGGAPLSKEVGEAVAKRFHLPGIRQGYGLTETTSAILITPEGPGAVGKVVPFFEA 1----!!!!------------------------3333-----------------2222-- KVVDLDTGKTLGVNQRGELCVRGPMIMSGYVNNPEATNALIDKDGWLHSGDIAYWDEDEH -----------------------------------------1111----------1111- FFIVLIKYKGYQVAPAELESILLQHPNIFDAGVAGLPDDDAGELPAAVVVLEHGKTMTEK -------iiii--3333-------1111---------3333----------2222----- EIVDYVASQVTTAKKLRGGVVFVDEVPKLDARKIREILIKAKK ------11113333-1111------------------------ >FELINE LEUKEMIA VIRUS REC; SWP:P11261; PDB:1LCSA; PHQVYNVTWTITNLVTGTKANATSMLGTLTDAFPTMYFDLCDIIGNTWNPSDQEPFPGYG ---------------------------1111-------3333--1111--1111------ CDQPMRRWQQRNTPFYVCPGHANRKQCGGPQDGFCAVWGCETTGETYWRPTSSWDYITVK --------3333-----------3333-1111----2222-----1111----------- KGVTQGIYQCSGGGWCGPCYDKAVHSSTTGASEGGRCNPLILQFTQKGRQTSWDGPKSWG ---------!!!!-------1111---------------------3333----------- LRLYRSGYDPIALFSVSRQVMTITP ------------------------- >HTRA2 SERINE PROTEASE; SWP:O43464; PDB:1LCYA; PPASPRSQYNFIADVVEKTAPAVVYIEILDRHPFLGREVPISNGSGFVVAADGLIVTNAH ---3333-----------3333----------1111-------------1111----333 VVADRRRVRVRLLSGDTYEAVVTAVDPVADIATLRIQTKEPLPTLPLGRSADVRQGEFVV 3!!!!------3333-----------1111------------------3333-2222--- AMGSPFALQNTITSGIVSSAQRPNVEYIQTDAAIDFGNAGGPLVNLDGEVIGVNTMKVTA ---1111---------------------------3333------1111----------22 GISFAIPSDRLREFLHRRYIGVMMLTLSPSILAELQLREPSFPDVQHGVLIHKVILGSPA 22----------------------------------------------------2222-- HRAGLRPGDVILAIGEQMVQNAEDVYEAVRTQSQLAVQIRRGRETLTLYVTPEVTE 1111----------------3333-------------------------------- >PANCREATIC TRYPSIN INHIBI; SWP:P00974; PDB:1LD6A; RPDFCLEPPYAGACRAAAARYFYNAKAGLCQTFAYGACAAKRNNFKSAEDCLRTCGGA -3333------------------1111------------------------------- >MHC CLASS I H-2LD HEAVY C; SWP:P01897; PDB:1LD9A; GPHSMRYFETAVSRPGLGEPRYISVGYVDNKEFVRFDSDAENPRYEPQAPWMEQEGPEYW ------------------------------------3333--------3333-------- ERITQIAKGQEQWFRVNLRTLLGYYNQSAGGTHTLQWMYGCDVGSDGRLLRGYEQFAYDG ----------------------1111---------------------------------- CDYIALNEDLKTWTAADMAAQITRRKWEQAGAAEYYRAYLEGECVEWLHRYLKNGNATLL ----------------3333----------------------3333---3333-3333-- RTDSPKAHVTHHPRSKGEVTLRCWALGFYPADITLTWQLNGEELTQDMELVETRPAGDGT ------------------------------------------------------------ FQKWASVVVPLGKEQNYTCRVYHEGLPE ---------------------------- >APO-L-LACTATE DEHYDROGENA; SWP:LDH_BACST; PDB:1LDB; MKNNGGARVVVIGAGFVGASYVFALMNQGIADEIVLIDANESKAIGDAMDFNHGKVFAPK --------------3333-------1111------------------------3333--- PVDIWDYDDCRDADLVVICAGANDLVDKNIAIFRSIVESVMASGFQGLFLVATNPVDILT -----1111---------------1111-------------------------------- YATWKFSGLPHERVIGSGTILDTARFRFLLGEYFSVAPQNVHAYIIGEHGDTELPVWSQA ---1111---------!!!!-----------1111-1111---------1111--3333- YIGVMPIDLERIFVNVRDAAYQIIEKKGATYYGIAMGLARVTRAILHNENAILTVSAYLD ------------3333------3333------3333--------1111------------ GLYGERDVYIGVPAVINRNGIREVIEIELNDDEKNRFHHSAATLKSVLARAFTR ---------------------------------------------1111----- >ANAPHASE PROMOTING COMPLE; SWP:Q12440; PDB:1LDDA; KYELTLQRSLPFIEGMLTNLGAMKLHKIHSFLKITVPKDWGYNRITLQQLEGYLNTLADE ------------------------------------3333-1111--------------- GRLKYIANGSYEIV -------------- >GLYCEROL UPTAKE FACILITAT; SWP:P11244; PDB:1LDFA; TLKGQCIAEFLGTGLLIFFGVGCVAALKVAGASFGQWEISVIFGLGVAMAIYLTAGVSGA -----------------------------------------------------3333--- HLNPAVTIALWLFACFDKRKVIPFIVSQVAGAFCAAALVYGLYYNLFFDFEQTHHIVRGS ----------------1111---------------------------------------3 VESVDLAGTFSTYPNPHINFVQAFAVEMVITAILMGLILALTDDGNGVPRGPLAPLLIGL 333-3333------1111------------------------1111---!!!!------- LIAVIGASMGPLTGTAMNPARDFGPKVFAWLAGWGNVAFTGGRDIPYFLVPLFGPIVGAI --------3333------3333-------1111-3333-iiii--3333----------- VGAFAYRKLIGRHL ---------1111- >CULLIN HOMOLOG 1; SWP:Q13616; PDB:1LDJA; LDQIWDDLRAGIQQVYTRQSMAKSRYMELYTHVYNYCTSVGLELYKRLKEFLKNYLTNLL 3333------------------------------1111--3333---------------- KDGEDLMDESVLKFYTQQWEDYRFSSKVLNGICAYLNRHWVRRECYEIYSLALVTWRDCL --------------------------------3333-----------3333--------- FRPLNKQVTNAVLKLIEKERNGETINTRLISGVVQSYVELGLNEDDAFAKGPTLTVYKES ------3333---------------3333----------------------------111 FESQFLADTERFYTRESTEFLQQNPVTEYMKKAEARLLEEQRRVQVYLHESTQDELARKC 1---------------------------3333--------------------3333---- EQVLIEKHLEIFHTEFQNLLDADKNEDLGRMYNLVSRIQDGLGELKKLLETHIHNQGLAA -------------------------------------2222------------------- IEKCGEAALNDPKMYVQTVLDVHKKYNALVMSAFNNDAGFVAALDKACGRFINNNAVTKM 3333--3333---------------------1111------------------------- AQSSSKSPELLARYCDSLLKKSSKNPEEAELEDTLNQVMVVFKYIEDKDVFQKFYAKMLA -----------------------------------------1111--------------- KRLVHQNSASDDAEASMISKLKQACGFEYTSKLQRMFQDIGVSKDLNEQFKKHLTNSEPL --1111---1111------------3333------------------------------- DLDFSIQVLSSGSWPFQQSCTFALPSELERSYQRFTAFYASRHSGRKLTWLYQLSKGELV -------------------------1111-------1111----------3333------ TNCFKNRYTLQASTFQMAILLQYNTEDAYTVQQLTDSTQIKMDILAQVLQILLKSKLLVL ------------------------------------------------------------ EDENANVDEVELKPDTLIKLYLGYKNKKLRVNINVPMKTEQKQEQETTHKNIEEDRKLLI -1111-------------------------------3333-------------------- QAAIVRIMKMRKVLKHQQLLGEVLTQLSSRFKPRVPVIKKCIDILIEKEYLERVDGEKDT --------------------------3333---3333--------1111----------- YSYLA ----- >RING-box protein 1; SWP:P62877; PDB:1LDJB; KKRFEVKKWNAVALWAWDIVVDNCAICRNHIMDLCIECQANQASATSEECTVAWGVCNHA -------------------------------------------3333------------- FHFHCISRWLKTRQVCPLDNREWEFQKY --------3333---------------- >L-LACTATE DEHYDROGENASE; SWP:P00344; PDB:1LDNA; MKNNGGARVVVIGAGFVGASYVFALMNQGIADEIVLIDANESKAIGDAMDFNHGKVFAPK -----------------------------------------------------1111--- PVDIWHGDYDDCRDADLVVICAGANQKPGETRLDLVDKNIAIFRSIVESVMASGFQGLFL ---------3333------------------1111------------------------- VATNPVDILTYATWKFSGLPHERVIGSGTILDTARFRFLLGEYFSVAPQNVHAYIIGEHG --------------------------!!!!-----------1111-3333---------1 DTELPVWSQAYIGVMPIRKLVESKGEEAQKDLERIFVNVRDAAYQIIEKKGATYYGIAMG 111--3333-----------------------------1111------------------ LARVTRAILHNENAILTVSAYLDGLYGERDVYIGVPAVINRNGIREVIEIELNDDEKNRF -------1111------------1111------------1111---------3333---- HHSAATLKSVLARAFT ---------------- ---------------------------------------------- >GROUP X SECRETORY PHOSPHO; SWP:O15496; PDB:1LE6A; GILELAGTVGCVGPRTPIAYMKYGCFCGLGGHGQPRDAIDWCCHGHDCCYTRAEEAGCSP -----------------1111--------------------------------1111-33 KTERYSWQCVNQSVLCGPAENKCQELLCKCDQEIANCLAQTEYNLKYLFYPQFLCEPDSP 33-------%%%%------------------------1111--3333---1111------ KCD --- ----------------------------------------------------- >MATING-TYPE PROTEIN A-1; SWP:P01367; PDB:1LE8B; RGHRFTKENVRILESWFAKNIENPYLDTKGLENLMKNTSLSRIQIKNWVAARRAKEKTIT -------------------3333-------------------------------1111-- IAPELADLLSGEPL -3333-1111---- >WEST-CENTRAL AFRICAN LEGU; SWP:P24146; PDB:1LED; NTVNFTYPDFWSYSLKNGTEITFLGDATRIPGALQLTKTDANGNPVRSSAGQASYSEPVF ---------------2222----------2222------1111----------------- LWDSTGKAASFYTSFTFLLKNYGAPTADGLAFFLAPVDSSVKDYGGFLGLFRHETAADPS --1111-----------------------------1111-----1111---3333--333 KNQVVAVEFDTWINKDWNDPPYPHIGIDVNSIVSVATTRWENDDAYGSSIATAHITYDAR 3------------3333-----------------------3333---------------- SKILTVLLSYEHGRDYILSHVVDLAKVLPQKVRIGFSAGVGYDEVTYILSWHFFSTLDGT ----------------------3333------------------------------2222 NK -- >LEUCINE DEHYDROGENASE; SWP:Q7SIB4; PDB:1LEHA; MEIFKYMEKYDYEQLVFCQDEASGLKAVIAIHDTTLGPALGGARMWTYNAEEEAIEDALR --------------------1111---------1111------------3333------- LARGMTYKNAAAGLNLGGGKTVIIGDPFADKNEDMFRALGRFIQGLNGRYITAEDVGTTV -------------------------1111--3333--------1111-------2222-- DDMDLIHQETDYVTGISPAFGSSGNPSPVTAYGVYRGMKAAAKEAFGSDSLEGLAVSVQG --------------------------------------------------2222------ LGNVAKALCKKLNTEGAKLVVTDVNKAAVSAAVAEEGADAVAPNAIYGVTCDIFAPCALG ------------1111---------3333----1111----11113333----------- AVLNDFTIPQLKAKVIAGSADNQLKDPRHGKYLHELGIVYAPDYVINAGGVINVADELYG ---33331111--------------3333------------3333---------3333-- YNRTRAMKRVDGIYDSIEKIFAISKRDGVPSYVAADRMAEERIAKVAKARSQFLQDQRNI -----------------------------3333-------------------------33 LNGR 33-- >LECTIN; SWP:P02870; PDB:1LENA; TETTSFSITKFSPDQQNLIFQGDGYTTKGKLTLTKAVKSTVGRALYSTPIHIWDRDTGNV --------------1111--------iiii------------------------------ ANFVTSFTFVIDAPSSYNVADGFTFFIAPVDTKPQTGGGYLGVFNSKEYDKTSQTVAVEF --------------1111----------1111----!!!!---------1111------- DTFYNAAWDPSNKERHIGIDVNSIKSVNTKSWNLQNGERANVVIAFNAATNVLTVTLTYP ----3333-1111---------------------2222---------------------- N - >Lectin [Precursor]; SWP:P02870; PDB:1LENB; VTSYTLNEVVPLKDVVPEWVRIGFSATTGAEFAAQEVHSWSFNSQLG ----------3333--------------------------------- >GLUCOAMYLASE; SWP:O85672; PDB:1LF6A; SIKIDRFNNISAVNGPGEEDTWASAQKQGVGTANNYVSKVWFTLANGAISEVYYPTIDTA --------------------------------------------iiii-------1111- DVKEIKFIVTDGKSFVPDETKDAISKVEKFTDKSLGYKLVNTDKKGRYRITKDIFTDVKR -----------------3333---------------------1111----------1111 NSLIMKAKFEALEGSIHDYKLYLAYDPHIKNQGSYNEGYVIKANNNEMLMAKRDNVYTAL --------------1111----------%%%%----------%%%%------!!!!---- SSNIGWKGYSIGYYKVNDIMTDLDENKQMTKHYDSARGNIIEGAEIDLTKNSEFEIVLSF -1111-------2222------------------------------3333---------- GQSDSEAAKTALETLGEDYNNLKNNYIDEWTKYCNTLNNFNGKANSLYYNSMMILKASED ---------------------------------1111--iiii----------------- KTNKGAYIASLSIPWGDGQRDDNTGGYHLVWSRDLYHVANAFIAAGDVDSANRSLDYLAK --2222--------3333------1111-------------------------------- VVKDNGMIPQNTWISGKPYWTGIQLDEQADPIILSYRLKRYDLYDSLVKPLADFIIKIGP ------------3333-------3333------------1111----------------- KTGQERWEEIGGYSPATMAAEVAGLTCAAYIAEQNKDYESAQKYQEKADNWQKLIDNLTY ----1111------------------------1111------------------3333-- TENGPLGNGQYYIRIAGLSDPDADFMINIANGGGVYDQKEIVDPSFLELVRLGVKSADDP -----!!!!----------1111------%%%%---1111---------1111--1111- KILNTLKVVDSTIKVDTPKGPSWYRYNHDGYGEPSKTELYHGAGKGRLWPLLTGERGMYE -------------------------2222-----1111---------------------- IAAGKDATPYVKAMEKFANEGGIISEQVWEDTGLPTDSASPLNWAHAEYVILFASNIEHK ------3333--------1111-------------------------------------- VLDMPDIVYKRYVA 11113333------ >COMPLEMENT PROTEIN C8GAMM; SWP:P07360; PDB:1LF7A; ASPISTIQPKANFDAQQFAGTWLLVAVGSAGRRAEATTLHVAPQGTAMAVSTFRKLDGIC -3333--------3333--------------------------!!!!--------iiii- WQVRQLYGDTGVLGRFLLQARGARGAVHVVVAETDYQSFAVLYLERAGQLSVKLYARSLP -----------2222------------------------------iiii----------- VSDSVLSGFEQRVQEAHLTEDQIFYFPKYGFCEAADQFHVLDEV -------------1111-3333-------------1111----- >LIVER TRANSCRIPTION FACTO; SWP:P22361; PDB:1LFB; RFKWGPASQQILFQAYERQKNPSKEERETLVEECNRAECIQRGVSPSQAQGLGSNLVTEV --------------1111-------------------1111---3333-3333------- RVYNWFANRRKEEAFRHK ----------1111---- >RALGDS; SWP:Q03386; PDB:1LFDA; GDCCIIRVSLDVDNGNMYKSILVTSQDKAPTVIRKAMDKHNLDEDEPEDYELLQIISEDH -----------------------11113333------111133333333-------1111 KLKIPENANVFYAMNSAANYDFILKKR --------3333--1111--------- >P450 MONOOXYGENASE; SWP:Q8RN04; PDB:1LFKA; DPRPLHIRRQGLDPADELLAAGALTRVTAETHWMATAHAVVRQVMGDHQQFSTRRRWDPE ---3333--!!!!--3333----------------------------1111--------- LVGNLMDYDPPEHTRLRRKLTPGFTLRKMQRMAPYIEQIVNDRLDEMERAGSPADLIAFV ---3333------------3333------------------------3333---3333-- ADKVPGAVLCELVGVPRDDRDMFMKLCHGHLDASLSQKRRAALGDKFSRYLLAMIARERK ---------------1111------------3333------------------------- EPGEGMIGAVVAEYGDDATDEELRGFCVQVMLAGDDNISGMIGLGVLAMLRHPEQIDAFR -------------!!!!----------------------------------3333-1111 GDEQSAQRAVDELIRYLTVPYSPTPRIAREDLTLAGQEIKKGDSVICSLPAANRDPALAP ---------------------------------%%%%--2222------3333-3333-- DVDRLDVTREPIPHVAFGHGVHHCLGAALARLELRTVFTELWRRFPALRLADPAQDTEFR 1111-1111-----1111-11111111-----------------1111---1111----- LTTPAYGLTELMVAW --------------- >LIVER FATTY ACID BINDING ; SWP:P02692; PDB:1LFO; MNFSGKYQVQSQENFEPFMKAMGLPEDLIQKGKDIKGVSEIVHEGKKVKLTITYGSKVIH -------------------1111-------3333---------!!!!------!!!!--- NEFTLGEEELETMTGEKVKAVVKMEGDNKMVTTFKGIKSVTEFNGDTITNTMTLGDIVYK ---2222----3333------------------iiii------!!!!------!!!!--- RVSKRI ------ >HYPOTHETICAL PROTEIN AQ_1; SWP:O67517; PDB:1LFPA; SHWAQIKHKKAKVDAQRGKLFSKLIREIIVATRLGGPNPEFNPRLRTAIEQAKKANPWEN -------------------------------------3333-----------11113333 IERAIKKGAGELEGEQFEEVIYEGYAPGGVAVVLATTDNRNRTTSEVRHVFTKHGGNLGA ---------------------------------------------------1111----2 SGCVSYLFERKGYIEVPAKEVSEEELLEKAIEVGAEDVQPGEEVHIIYTVPEELYEVKEN 2223333---------3333-----------------------------3333------- LEKLGVPIEKAQITWKPISTVQINDEETAQKVIKLLNALEELDDVQQVIANFEIPEEILQ -1111------------------------------------3333--------------- K - >PEPV; SWP:P45494; PDB:1LFWA; MDLNFKELAEAKKDAILKDLEELIAIDSSEDLENATEEYPVGKGPVDAMTKFLSFAKRDG ------------------------------3333-3333-----------------1111 FDTENFANYAGRVNFGAGDKRLGIIGHMDVVPAGEGWTRDPFKMEIDEEGRIYGRGSADD -----iiii-------------------------------------1111---2222--- KGPSLTAYYGMLLLKEAGFKPKKKIDFVLGTNEETNWVGIDYYLKHEPTPDIVFSPDAEY --------------1111-------------1111------------------------- PIINGEQGIFTLEFSFKNDDTKGDYVLDKFKAGIATNVTPQVTRATISGPDLEAVKLAYE ------------------------------------------------------------ SFLADKELDGSFEINDESADIVLIGQGAHASAPQVGKNSATFLALFLDQYAFAGRDKNFL --------------!!!!----------11111111----------3333---------- HFLAEVEHEDFYGKKLGIFHHDDLMGDLASSPSMFDYEHAGKASLLNNVRYPQGTDPDTM ------2222---3333--------------------1111------------------- IKQVLDKFSGILDVTYNGFEEPHYVPGSDPMVQTLLKVYEKQTGKPGHEVVIGGGTYGRL -------3333--------------1111--------------------------3333- FERGVAFGAQPENGPMVMHAANEFMMLDDLILSIAIYAEAIYELTKDE ----------2222--2222---------------------------- >PRION-LIKE PROTEIN; SWP:Q9UKY0; PDB:1LG4A; AENRPGAFIKQGRKLDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINA -----------------------------3333---------------1111-------3 TQAANQGEFQKPDNKLHQQVLWRLVQELCSLKHCEFWLE 333-3333------------------------------- >VSV MATRIX PROTEIN; SWP:Q8B0H2; PDB:1LG7A; QLRYEKFFFTVKMTVRSNRPFRTYSDVAAAVSHWDHMYIGMAGKRPFYKILAFLGSSNLK ----------------------3333----1111------3333-----------1111- ATPAQPEYHAHEGRAYLPHRMGKTPPMLNVPEHFRRPFNIGLYKGTVELTMTIYDDESLE ------------------------------------------------------------ AAPMIWDHFNSSKFSDFREKALMFGLIVEKKASGAWVLDSVSH ---3333------1111-------------------------- >Mannose/glucose-specific ; SWP:P12307; PDB:1LGCB; ETSYTLNEVVPLKEFVPEWVRIGFSATTGAEFAAHEVLSWYFNSELSVTSS ----------3333------------------------------------- >CELL CYCLE CHECKPOINT PRO; SWP:Q96EP1; PDB:1LGPA; MQPWGRLLRLGAEEGEPHVLLRKREWTIGRRRGCDLSFPSNKLVSGDHCRIVVDEKSGQV --------2222------------------1111---1111------------------- TLEDTSTSGTVINKLKVVKKQTCPLQTGDVIYLVYRKNEPEHNVAYLYESLSE -------------------------2222------11111111---------- >TRIACYLGLYCEROL LIPASE; SWP:P21811; PDB:1LGYA; KVVAATTAQIQEFTKYAGIAATAYCRSVVPGNKWDCVQCQKWVPDGKIITTFTSLLSDTN -----3333----------3333-3333--------------1111-------------- GYVLRSDKQKTIYLVFRGTNSFRSAITDIVFNFSDYKPVKGAKVHAGFLSSYEQVVNDYF ------1111----------3333-----------3333---------------1111-- PVVQEQLTAHPTYKVIVTGHSLGGAQALLAGMDLYQREPRLSPKNLSIFTVGGPRVGNPT ---------1111------!!!!--------------33333333--------------- FAYYVESTGIPFQRTVHKRDIVPHVPPQSFGFLHPGVESWIKSGTSNVQICTSEIETKDC ----3333--------!!!!1111--3333------------------------------ SNSIVPFTSILDHLSYFDINEGSCL -1111----1111--%%%%------ >OMP SYNTHASE; SWP:P08870; PDB:1LH0A; MKPYQRQFIEFALNKQVLKFGEFTLKSGRKSPYFFNAGLFNTGRDLALLGRFYAEALVDS ------------------------3333-------3333--------------------- GIEFDLLFGPAYKGIPIATTTAVALAEHHDKDLPYCFNRKEAKDHGEGGSLVGSALQGRV -----------1111-----------------------------!!!!------------ MLVDDVITAGTAIRESMEIIQAHGATLAGVLISLDRQERGRGEISAIQEVERDYGCKVIS ----------3333------1111------------------------------------ IITLKDLIAYLEEKPDMAEHLAAVRAYREEFGV --3333---33333333---------------- >PYRIDOXAL KINASE; SWP:P82197; PDB:1LHPA; ECRVLSIQSHVVRGYVGNRAATFPLQVLGFEVDAVNSVQFSNHTGYSHWKGQVLNSDELQ ---------------!!!!------1111------------------------------- ELYDGLKLNHVNQYDYVLTGYTRDKSFLAMVVDIVQELKQQNPRLVYVCDPVMGDQRNGE ------1111-------------------------------1111--------------- GAMYVPDDLLPVYREKVVPVADIITPNQFEAELLTGRKIHSQEEALEVMDMLHSMGPDTV -----1111-------3333---------------------------------------- VITSSNLLSPRGSDYLMALGSQRTRGSVVTQRIRMEMHKVDAVFVGTGDLFAAMLLAWTH --------1111------------------------------------------------ KHPNNLKVACEKTVSAMHHVLQRTIKCAKAKSGEGVKPSPAQLELRMVQSKKDIESPEIV -1111---------------------------2222--3333----1111-3333----- VQATVL ------ >MYOGLOBIN; SWP:P56208; PDB:1LHT; GLSDDEWNHVLGIWAKVEPDLSAHGQEVIIRLFQLHPETQERFAKFKNLTTIDALKSSEE ------------33333333---------------3333---3333-------------- VKKHGTTVLTALGRILKQKNNHEQELKPLAESHATKHKIPVKYLEFICEIIVKVIAEKHP -----------------!!!!3333--------------3333----------------- SDFGADSQAAMKKALELFRNDMASKYKEFGFQG ------------------------3333----- >S-ADENOSYLHOMOCYSTEINE HY; SWP:P23526; PDB:1LI4A; DKLPYKVADIGLAAWGRKALDIAENEMPGLMRMRERYSASKPLKGARIAGCLHMTVETAV --------3333-------------------------1111-2222-------------- LIETLVTLGAEVQWSSCNIFSTQDHAAAAIAKAGIPVYAWKGETDEEYLWCIEQTLYFKD -----1111--------1111--------------------------------------- GPLNMILDDGGDLTNLIHTKYPQLLPGIRGISEETTTGVHNLYKMMANGILKVPAINVND --------------------33331111-----------------1111--------111 SVTKSKFDNLYGCRESLIDGIKRATDVMIAGKVAVVAGYGDVGKGCAQALRGFGARVIIT 1--1111---------------------2222---------------------------- EIDPINALQAAMEGYEVTTMDEACQEGNIFVTTTGCIDIILGRHFEQMKDDAIVCNIGHF ----------1111----33331111--------------33331111------------ DVEIDVKWLNENAVEKVNIKPQVDRYRLKNGRRIILLAEGRLVNLGCAMGHPSFVMSNSF -------------------2222----1111-----%%%%-3333------3333----- TNQVMAQIELWTHPDKYPVGVHFLPKKLDEAVAEAHLGKLNVKLTKLTEKQAQYLGMSCD ------------3333--------3333---------1111----------------111 GPFKPDHYRY 1---1111-- >CYSTEINYL-TRNA SYNTHETASE; SWP:P21888; PDB:1LI5A; MLKIFNTLTRQKEEFKPIHAGEVGMYVCGITVYDLCHIGHGRTFVAFDVVARYLRFLGYK ------1111--------2222-------------------------------------- LKYVRNITDIDDKIIKRANENGESFVAMVDRMIAEMHKDFDALNILRPDMEPRATHHIAE ----------3333----1111------------------------------3333---- IIELTEQLIAKGHAYVADNGDVMFDVPTDPTYGVLSRQKRNPMDFVLWKMSKEGEPSWPS ----------------1111----333311113333----3333-------2222----1 PWGAGRPGWHIECSAMNCKQLGNHFDIHGGGSDLMFPHHENEIAQSTCAHDGQYVNYWMH 111------3333-----------------3333-------------------------- SGMVMVDREKMSKSLGNFFTVRDVLKYYDAETVRYFLMSGHYRSQLNYSEENLKQARAAL -----------3333----33331111--------1111-3333---------------- ERLYTALRGTDKTVAPAGGEAFEARFIEAMDDDFNTPEAYSVLFDMAREVNRLKAEDMAA ------22221111-------------------------------------3333----- ANAMASHLRKLSAVLGLLEQEPEAFL ----------3333------3333-- >LAMBDA III BENCE JONES PR; SWP:NA; PDB:1LILA; YEVTQPPSLSVSPGQTARITCSGEKLGDAYVCWYQQRPGQSPVVVIYQDNRRPSGIPERF ------------------------------------------------------------ SGSSSGNTATLTISGTQTLDEADYYCQVWDSNASVVFGGGTKLTVLGQPKAAPSVTLFPP ----------------3333---------------------------------------- SSEELQANKATLVCLISDFYPGAVTVAWKADSSPVKAGVETTTPSKQSNNKYAASSYLSL 33333333---------------------iiii--------------------------- TPEQWKSHRSYSCQVTHEGSTVEKTVAPTECS 33331111------------------------ >LQ2; SWP:P45628; PDB:1LIR; FTQESCTASNQCWSICKRLHNTNRGKCMNKKCRCYS -------3333------------------------- >PYRUVATE KINASE, ISOZYMES; SWP:P30613; PDB:1LIUA; QQQQLPAAMADTFLEHLCLLDIDSEPVAARSTSIIATIGPASRSVERLKEMIKAGMNIAR -%%%%3333-------11111111--------------1111------------------ LNFSHGSHEYHAESIANVREAVESFAGSPLSYRPVAIALDTKGPEIRTGILQGGPESEVE ---------------------------3333-------------------2222------ LVKGSQVLVTVDPAFRTRGNANTVWVDYPNIVRVVPVGGRIYIDDGLISLVVQKIGPEGL -2222------3333------------3333----2222----%%%%------------- VTQVENGGVLGSRKGVNLPGAQVDLPGLSEQDVRDLRFGVEHGVDIVFASFVRKASDVAA -----------------------------------------------------3333--- VRAALGPEGHGIKIISKIENHEGVKRFDEILEVSDGIMVARGDLGIEIPAEKVFLAQKMM -----1111---------------------1111-------3333---3333-------- IGRCNLAGKPVVCATQMLESMITKPRPTRAETSDVANAVLDGADCIMLSGETAKGNFPVE -----------------3333---------------------------3333-------- AVKMQHAIAREAEAAVYHRQLFEELRRAAPLSRDPTEVTAIGAVEAAFKCCAAAIIVLTT ------------1111-------------------------------------------- TGRSAQLLSRYRPRAAVIAVTRSAQAARQVHLCRGVFPLLYREPPEAIWADDVDRRVQFG -------3333---------------------2222-----------3333--------- IESGKLRGFLRVGDLVIVVTGWRPGSGYTNIMRVLSI ----------2222----------------------- >Cytochrome B5 outer mitoc; SWP:P04166; PDB:1LJ0A; SDPAVTYYRLEEVAKRNTSEETWMVLHGRVYDLTRFLSEHPGGEEVLREQAGADATESFE -1111---3333-----3333----iiii---11111111---33331111--------- DVGHSPDAREMSKQYYIGDVHPNDLKPK -----------3333-----3333---- >NONSTRUCTURAL RNA-BINDING; SWP:P03536; PDB:1LJ2A; HSLQNVIPQQQAHIAELQVYNNKLERDLQNKIGSLTSSIEWYLRSMELDPEIKADIEQQI -3333------------------------------------3333-------------11 NSIDAINPLHAFDDLESVIRNLISDYDKLFLMFKGLIQRSNYQYSF 11---------------------------------1111------- >PLASMINOGEN ACTIVATOR INH; SWP:P05121; PDB:1LJ5A; VHHPPSYVAHLASDFGVRVFQQVAQASKDRNVVFSPYGVASVLAMLQLTTGGETQQQIQA --3333--------------------------------------3333------------ AMGFKIDDKGMAPALRHLYKELMGPWNKDEISTTDAIFVQRDLKLVQGFMPHFFRLFRST ----1111---------------3333------------1111--2222----------- VKQVDFSEVERARFIINDWVKTHTKGMISNLLGKGAVDQLTRLVLVNALYFNGQWKTPFP ----3333---------------iiii-----2222-1111------------------- DSSTHRRLFHKSDGSTVSVPMMAQTNKFNYTEFTTPDGHYYDILELPYHGDTLSMFIAAP -1111-----1111--------------------1111---------1111--------- YEKEVPLSALTNILSAQLISHWKGNMTRLPRLLVLPKFSLETEVDLRKPLENLGMTDMFR -3333-----------------1111------------------------1111-33331 QFQADFTSLSDQEPLHVAQALQKVKIEVNESGTVASSSTAVIVSARMAPEEIIMDRPFLF 111--3333-------------------1111---------------------------- VVRHNPTGTVLFMGQVMEP ----1111----------- >MANNITOL DEHYDROGENASE; SWP:O08355; PDB:1LJ8A; KLNKQNLTQLAPEVKLPAYTLADTRQGIAHIGVGGFHRAHQAYYTDALNTGEGLDWSICG --3333----1111-----3333-----------------------------1111---- VGLRSEDRKARDDLAGQDYLFTLYELGDTDDTEVRVIGSISDLLAEDSAQALIDKLASPE ---3333---------%%%%-----------------------3333----------333 IRIVSLTITEGGYCIDDSNGEFAHLPQIQHDLAHPSSPKTVFGFICAALTQRRAAGIPAF 3-------1111---------------------1111---------------1111---- TVSCDNLPHNGAVTRKALLAFAALHNAELHDWIKAHVSFPNAVDRITPTSTAHRLQLHDE ------------------------------------------------------------ HGIDDAWPVVCEPFVQWVLEDKFVNGRPAWEKVGVQFTDDVTPYEEKIGLLNGSHLALTY ----------------------------------------3333---------------- LGFLKGYRFVHETNDPLFVAYRAYDLDVTPNLAPVPGIDLTDYKQTLVDRFSNQAIADQL --------3333---------------3333---2222-----------11113333--- ERVCSDGSSKFPKFTVPTINRLIADGRETERAALVVAAWALYLKGVDENGVSYTIPDPRA ------1111------------------3333--------------1111---------- EFCQGLVSDDALISQRLLAVEEIFGTAIPNSPEFVAAFERCYGSLRDNGVTTTLKHLLKK ----11113333---11111111---3333--------------------------1111 P - >TRANSCRIPTIONAL REGULATOR; SWP:Q82ZP8; PDB:1LJ9A; TDILREIGMIARALDSISNIEFKELSLTRGQYLYLVRVCENPGIIQEKIAELIKVDRTTA -------------------1111----iiii---------2222---------------- ARAIKRLEEQGFIYRQEDASNKKIKRIYATEKGKNVYPIIVRENQHSNQVALQGLSEVEI -------1111------3333------------------------------2222----- SQLADYLVRMRKNVSEDWEFVKKG ------------------------ >ARCHAEAL SM-LIKE PROTEIN ; SWP:NA; PDB:1LJOA; GAMVLPNQMVKSMVGKIIRVEMKGEENQLVGKLEGVDDYMNLYLTNAMECKGEEKVRSLG ----------1111----------------------1111----------!!!!------ EIVLRGNNVVLIQPQ ----1111------- >GLUTATHIONE S-TRANSFERASE; SWP:P30712; PDB:1LJRA; MGLELFLDLVSQPSRAVYIFAKKNGIPLELRTVDLVKGQHKSKEFLQINSLGKLPTLKDG ----------------------------------3333----3333--3333------!! DFILTESSAILIYLSCKYQTPDHWYPSDLQARARVHEYLGWHADCIRGTFGIPLWVQVLG !!------------------3333---------------------2222-3333------ PLIGVQVPEEKVERNRTAMDQALQWLEDKFLGDRPFLAGQQVTLADLMALEELMQPVALG 1111-------------------------------1111---3333----------1111 YELFEGRPRLAAWRGRVEAFLGAELCQEAHSIILSILEQAAKKTLPTPSPEAYQAMLLRI -1111-------------------------3333---------------3333------1 ARIP 111- >BETA-2-MICROGLOBULIN; SWP:P01901; PDB:1LK2A; GPHSLRYFVTAVSRPGLGEPRYMEVGYVDDTEFVRFDSDAENPRYEPRARWMEQEGPEYW -------------2222----------!!!!-----1111--------3333---3333- ERETQKAKGNEQSFRVDLRTLLGYYNQSKGGSHTIQVISGCEVGSDGRLLRGYQQYAYDG -------------------------------------------1111----------iii CDYIALNEDLKTWTAADMAALITKHKWEQAGEAERLRAYLEGTCVEWLRRYLKNGNATLL i-----1111-----------------------------------------------111 RTDSPKAHVTHHSRPEDKVTLRCWALGFYPADITLTWQLNGEELIQDMELVETRPAGDGT 1-------------------------------------iiii--1111------------ FQKWASVVVPLGKEQYYTCHVYHQGLPEPLTLRW ---------22221111-----1111-------- >Beta-2-microglobulin [Pre; SWP:P01887; PDB:1LK2B; IQKTPQIQVYSRHPPENGKPNILNCYVTQFHPPHIEIQMLKNGKKIPKVEMSDMSFSKDW ---------------2222---------------------iiii------------1111 SFYILAHTEFTPTETDTYACRVKHDSMAEPKTVYWDRDM -----------------------3333--------1111 >INTERLEUKIN-10; SWP:NA; PDB:1LK3H; QVNLLQSGAALVKPGASVKLSCKASGYTFTDFYIHWVKQSHGKSLEWIGYINPNSGYTNY ------------2222-----------1111----------------------------- NEKFKNKATLTVDKSTSTGYMELSRLTSEDSANYSCTRGVPGNNWFPYWGQGTLVTVSSA 3333--------3333----------3333---------2222----------------- ETTAPSVYPLAPGTALKSNSMVTLGCLVKGYFPEPVTVTWNSGALSSGVHTFPAVLQSGL ---------------------------------------%%%%-------------iiii YTLTSSVTVPSSTWPSQTVTCNVAHPASSTKVDKKIVPR ---------1111------------1111---------- >INTERLEUKIN-10; SWP:NA; PDB:1LK3L; DTVLTQPPALTVSPGEKLTISCKASESVTSRMHWYQQKPGQQPKLLIYKASNLASGVPAR ------------2222-----------!!!!------2222------------2222333 FSGSGSGTDFTLTIDPVEADDTAIYFCQQSWNGPLTFGAGTKLELKRADAAPTVSIFPPS 3----------------1111--------------------------------------3 TEQLATGGASVVCLMNNFYPRDISVKWKIDGTERRDGVLDSVTDQDSKDSTYSMSSTLSL 333-------------------------iiii---------------------------- TKADYESHNLYTCEVVHKTSSSPVVKSFNR ----------------1111---------- >D-RIBOSE-5-PHOSPHATE ISOM; SWP:O50083; PDB:1LK5A; MNVEEMKKIAAKEALKFIEDDMVIGLGTGSTTAYFIKLLGEKLKRGEISDIVGVPTSYQA --------------11112222------3333---------------------------- KLLAIEHDIPIASLDQVDAIDVAVDGADEVDPNLNLIKGRGAALTMEKIIEYRAGTFIVL ----1111----1111--------------1111----1111-------3333------- VDERKLVDYLCQKMPVPIEVIPQAWKAIIEELSIFNAKAELRMGVNKDGPVITDNGNFII -3333---2222--------3333--------1111----------------1111---- DAKFPRIDDPLDMEIELNTIPGVIENGIFADIADIVIVGTREGVKKLER ------------------------------------------------- >BIPHENYL-2,3-DIOL 1,2-DIO; SWP:P47228; PDB:1LKDA; SIRSLGYMGFAVSDVAAWRSFLTQKLGLMEAGTTDNGDLFRIDSRAWRIAVQQGEVDDLA ---------------------------------1111----------------3333--- FAGYEVADAAGLAQMADKLKQAGIAVTTGDASLARRRGVTGLITFADPFGLPLEIYYGAS -------------------1111-----------------------1111---------- EVFEKPFLPGAAVSGFLTGEQGLGHFVRCVPDSDKALAFYTDVLGFQLSDVIDMKMGPDV -1111------------!!!!-----------------------------------1111 TVPAYFLHCNERHHTLAIAAFPLPKRIHHFMLEVASLDDVGFAFDRVDADGLITSTLGRH -----------------------------------3333-------3333---------- TNDHMVSFYASTPSGVEVEYGWSARTVDRSWVVVRHDSPSMWGHKSV -----------1111------------1111---------------- >LEUKOCIDIN F SUBUNIT; SWP:P0A077; PDB:1LKFA; EGKITPVSVKKVDDKVTLYKTTATADSDKFKISQILTFNFIKDKSYDKDTLVLKATGNIN ---------------------------1111-----------1111-------------- SGFVKPNPNDYDFSKLYWGAKYNVSISSQSNDSVNVVDYAPKNQNEEFQVQNTLGYTFGG ------1111--------------------1111-------------------------- DISISNGLTAFSETINYKQESYRTTLSRNTNYKNVGWGVEAHKIMNNGWGPYGRDSFHPT ------------------2222----1111----------------------1111---- YGNELFLAGRQSSAYAGQNFIAQHQMPLLSRSNFNPEFLSVLSHRQDGAKKSKITVTYQR ---1111-------3333---3333-3333------------------------------ EMDLYQIRWNGFYWAGANYKNFKTRTFKSTYEIDWENHKVKLLDTKETENNK ---------------------------------------------------- >LEUKEMIA INHIBITORY FACTO; SWP:P09056; PDB:1LKI; NATCAIRHPCHGNLMNQIKNQLAQLNGSANALFISYYTAQGEPFPNNVEKLCAPNMTDFP --3333-------------------------------------------------1111- SFHGNGTEKTKLVELYRMVAYLSASLTNITRDQKVLNPTAVSLQVKLNATIDVMRGLLSN ------------------------------------1111-------------------- VLCRLCNKYRVGHVDVPPVPDHSDKEAFQRKKLGCQLLGTYKQVISVVVQAF ---------------------1111----------------------1111- >CALMODULIN; SWP:P06787; PDB:1LKJA; SSNLTEEQIAEFKEAFALFDKDNNGSISSSELATVMRSLGLSPSEAEVNDLMNEIDVDGN ---------------------------3333----------------------------- HQIEFSEFLALMSRQLKSNDSEQELLEAFKVFDKNGDGLISAAELKHVLTSIGEKLTDAE -------------------------------------------------3333------- VDDMLREVSDGSGEINIQQFAALLSK -------------------------- >HUMAN P56 TYROSINE KINASE; SWP:P07100; PDB:1LKKA; LEPEPWFFKNLSRKDAERQLLAPGNTHGSFLIRESESTAGSFSLSVRDFDQNQGEVVKHY ---11111111----------22222222--------2222------------------- KIRNLDNGGFYISPRITFPGLHELVRHYTNASDGLCTRLSRPCQT ----2222----3333---------------iiii---------- >RUBRERYTHRIN ALL-IRON(II); SWP:P24931; PDB:1LKOA; KSLKGSRTEKNILTAFAGESQARNRYNYFGGQAKKDGFVQISDIFAETADQEREHAKRLF --2222---------------------------1111----------------------3 KFLEGGDLEIVAAFPAGIIADTHANLIASAAGEHHEYTEMYPSFARIAREEGYEEIARVF 333--------------------------------------------------------- ASIAVAEEFHEKRFLDFARNIKEGRVFLREQATKWRCRNCGYVHEGTGAPELCPACAHPK -----------------------------------------------------------1 AHFELLGINW 111------- >TAILSPIKE PROTEIN; SWP:P12528; PDB:1LKTA; ANVVVSNPRPIFTESRSFKAVANGKIYIGQIDTDPVNPANQIPVYIENEDGSHVQITQPL ---------------------------------33331111------3333--------- IINAAGKIVYNGQLVKIVTVQGHSMAIYDANGSQVDYIANVLKY --1111---iiii---------------1111-------3333- >MYOSIN IE HEAVY CHAIN; SWP:Q54IK6; PDB:1LKXA; GVPDFVLLNQITENAFIENLTMRHKSDNIYTYIGDVVISTNPFKNLNIYKESDIKAYNGR ---3333----------------1111----------------------3333------- YKYEMPPHMYALANDAYRSMRQSQENQCVIISGESGAGKTEASKKIMQFLTFVSSNQSPN 3333---3333----------------------2222--------------3333--333 GERISKMLLDSNPLLEAFGNAKTLRNDNSSRFGKYMEMQFNAVGSPIGGKITNYLLEKSR 3---------------------3333--------------3333--------------33 VVGRTQGERSFHIFYQMLKGLSQSKLDELGLTPNAPAYEYLKKSGCFDVSTIDDSGEFKI 33--------------1111------1111---3333-3333------33333333---- IVKAMETLGLKESDQNSIWRILAAILHIGNITFAEAAEQTTVKVSDTKSLAAAASCLKTD -----1111-3333-------------1111--------------3333----------- QQSLSIALCYRSVISVPMDCNQAAYSRDALAKALYERLFNWLVSKINTIINCTTEKGPVI ----3333-------------------------------------3333----------- GILDIYGFEVFQNNSFEQLNINFCNEKLQQLFIELTLKSEQEEYVREGIEWKNIEYFNNK ------------------------------------------------------------ PICELIEKKPIGLISLLDEACLIAKSTDQTFLDSICKQFEKNPHLQSYVVSKDRSIGDTC ------------------3333------------------------------33331111 FRLKHYAGDVTYDVRGFLDKNKDTLFGDLISSMQSSSDPLVQGLFPETAGSQFRNAMNAL ----1111-----2222---------------1111----3333---------------- ITTLLACSPHYVRCIKSNDNKQAGVIDEDRVRHQVRYLGLLENVRVRRAGFAGRIEYTRF --------------------------------------------------------3333 YNRYKMLCKKKQATELILQQHNIDKEEIRMGKTKVFIRNPTTLFYFEEKR ----3333---------------1111------------1111-3333-- >GLYCOGENIN-1; SWP:P13280; PDB:1LL2A; MTDQAFVTLTTNDAYAKGALVLGSSLKQHRTSRRLAVLTTPQVSDTMRKALEIVFDEVIT --------------------------1111---------1111----------------- VDILDSGDSAHLTLMKRPELGVTLTKLHCWSLTQYSKCVFMDADTLVLANIDDLFEREEL -1111--1111-----3333-------1111----------------------------- SAAPDPGWPDCFNSGVFVYQPSVETYNQLLHVASEQGSFDGGDQGLLNTFFNSWATTDIR -------1111-----------------------------------------3333-333 KHLPFIYNLSSISIYSYLPAFKAFGANAKVVHFLGQTKPWNYTYDTKTKSVRTHPQFLNV 3---1111---------------3333----------1111------------------- WWDIFTTSVVPLLQQ --------3333--- >CHITINASE 1; SWP:P54196; PDB:1LL7A; GGFRSVVYFVNWAIYGRGHNPQDLKADQFTHILYAFANIRPSGEVYLSDTWADTDKHYPG ---------1111-3333-3333-1111-----------3333--------------222 DKWDEPGNNVYGCIKQMYLLKKNNRNLKTLLSIGGWTYSPNFKTPASTEEGRKKFADTSL 2----------------------1111--------1111---3333-------------- KLMKDLGFDGIDIDWQYPEDEKQANDFVLLLKACREALDAYSAKHPNGKKFLLTIASPAG --------------------------------------------1111-----------3 PQNYNKLKLAEMDKYLDFWNLMAYDFSGSWDKVSGHMSNVFPSTTKPESTPFSSDKAVKD 3331111----3333------------3333--------------3333----------- YIKAGVPANKIVLGMPLYGRAFASTDGIGTSFNGVGGGSWENGVWDYKDMPQQGAQVTEL ------1111----------------2222----------2222-3333--2222----- EDIAASYSYDKNKRYLISYDTVKIAGKKAEYITKNGMGGGMWWESSSDKTGNESLVGTVV 1111------1111-----------------------------3333---1111------ NGLGGTGKLEQRENELSYPESVYDNLKNGMPS 11113333---------1111----1111--- >PAS KINASE; SWP:Q96RG2; PDB:1LL8A; GAMDPEFNKAIFTVDAKTTEILVANDKACGLLGYSSQDLIGQKLTQFFLRSDSDVVEALS --------------------------3333----111122223333----------1111 EEHMEADGHAAVVFGTVVDIISRSGEKIPVSVWMKRMRQERRLCCVVVLEPVER ---------------------1111----------------------------- >HEMOCYANIN (SUBUNIT TYPE ; SWP:P04253; PDB:1LLA; LHDKQIRICHLFEQLSSATHSDRLKNVGKLQPGAIFSCFHPDHLEEARHLYEVFWEAGDF ---------11113333---3333----------------------------------33 NDFIEIAKEARTFVNEGLFAFAAEVAVLHRDDCKGLYVPPVQEIFPDKFIPSAAINEAFK 33------1111-----------------1111------1111-1111------------ KESPILVDVTGNILDPEYRLAYYREDVGINAHHWHWHLVYPSTWNPKYFGKKKDRKGELF --------------3333----1111--------------11113333------------ YYMHQQMCARYDCERLSNGMHRMLPFNNFDEPLAGYAPHLTHVASGKYYSPRPDGLKLRD ---------------1111--------1111----------3333--------------- LGDIEISEMVRMRERILDSIHLGYVISEDGSHKTLDELHGTDILGALVESSYESVNHEYY 11113333------------------1111-----3333-----------1111-3333- GNLHNWGHVTMARIHDPDGRFHEEPGVMSDTSTSLRDPIFYNWHRFIDNIFHEYKNTLKP ----------1111-1111------333311113333-----------------1111-- YDHDVLNFPDIQVQDVTLHARVDNVVHTFMREQELELKHGINPGNARSIKARYYHLDHEP -3333--2222-------------------------------!!!!-------------- FSYAVNVQNNSASDKHATVRIFLAPKYDELGNEIKADELRRTAIELDKFKTDLHPGKNTV ---------------------------1111---3333---------------------- VRHSLDSSVTLSHQPTFEDLLHGVGLSEYCSCGWPSHLLVPKGNIKGMEYHLFVMLTDWD --33331111------------------------1111-----3333----------333 KDKVSVACVDAVSYCGARDHKYPDKKPMGFPFDRPIHTEHISDFLTNNMFIKDIKIKFHE 3---------3333------------2222----------1111-1111----------- >L-LACTATE DEHYDROGENASE; SWP:P00343; PDB:1LLC; ASITDKDHQKVILVGDGAVGSSYAFAMVLQGIAQEIGIVDIFKDKTKGDAIDLSNALPFT ------------------------------------------3333-----1111-1111 SPKKIYSAEYSDAKDADLVVITAGAPKQPGETRLDLVNKNLKILKSIVDPIVDSGFNLIF ---------------------------------33333333------11113333----- LVAANPVDILTYATWKLSGFPKNRVVGSGTSLDTARFRQSIAEMVNVDARSVHAYIMGEH ------3333-------------------------------------------------- GDTEFPVWSHANIGGVTIAEWVKAHPEIKEDKLVKMFEDVRDAAYEIIKLKGATFYGIAT ---------------------------------------11113333--------3333- ALARISKAILNDENAVLPLSVYMDGQYGINDLYIGTPAVINRNGIQNILEIPLTDHEEES ----------------------------------------3333-----------3333- MQKSASQLKKVLTDAFAKNDI --------------------- >L-LACTATE DEHYDROGENASE; SWP:P19869; PDB:1LLDA; PTKLAVIGAGAVGSTLAFAAAQRGIAREIVLEDIAKERVEAEVLDMQHGSSFYPTVSIDG --------------------1111----------3333----------33331111---- SDDPEICRDADMVVITAGPRQKPGQSRLELVGATVNILKAIMPNLVKVAPNAIYMLITNP ----1111-----------------3333-----------3333----1111-------- VDIATHVAQKLTGLPENQIFGSGTNLDSARLRFLIAQQTGVNVKNVHAYIAGEHGDSEVP --------------1111---!!!!----------------3333---------1111-- LWESATIGGVPMSDWTPLPGHDPLDADKREEIHQEVKNAAYKIINGKGATNYAIGMSGVD 3333--iiii3333---2222---3333--------------1111-------------- IIEAVLHDTNRILPVSSMLKDFHGISDICMSVPTLLNRQGVNNTINTPVSDKELAALKRS ----1111-------------iiii-----------1111-------------------- AETLKETAAQFGF ------3333--- >LIPASE 3; SWP:P32947; PDB:1LLFA; APTAKLANGDTITGLNAIINEAFLGIPFAEPPVGNLRFKDPVPYSGSLNGQKFTSYGPSC -----1111-----------------------!!!!-----------2222--------- MQQNPEGTFEENLGKTALDLVMQSKVFQAVLPQSEDCLTINVVRPPGTKAGANLPVMLWI ---1111-------------------------------------22222222-------- FGGGFEIGSPTIFPPAQMVTKSVLMGKPIIHVAVNYRVASWGFLAGDDIKAEGSGNAGLK --%%%%--3333----------1111-------------1111----------------- DQRLGMQWVADNIAGFGGDPSKVTIFGESAGSMSVLCHLIWNDGDNTYKGKPLFRAGIMQ -----------3333---1111------------------%%%%---iiii--------- SGAMVPSDPVDGTYGNEIYDLFVSSAGCGSASDKLACLRSASSDTLLDATNNTPGFLAYS --------1111-----------11111111------1111--------1111-1111-! SLRLSYLPRPDGKNITDDMYKLVRDGKYASVPVIIGDQNDEGTIFGLSSLNVTTNAQARA !!!------------------------------------11113333-1111-------- YFKQSFIHASDAEIDTLMAAYPQDITQGSPFDTGIFNAITPQFKRISAVLGDLAFIHARR -----1111--------------3333-----!!!!---1111----------------- YFLNHFQGGTKYSFLSKQLSGLPIMGTFHANDIVWQDYLLGSGSVIYNNAFIAFATDLDP ----------------1111-------22223333-----3333---------------- NTAGLLVNWPKYTSSSQSGNNLMMINALGLYTGKDNFRTAGYDALMTNPSSFFV -------------1111--------1111--------------------1111- >Early growth response pro; SWP:P08046; PDB:1LLMC; MKPFQCRICMRNFSRSDHLTTHIRTHTGEKPFACDICGRKFARSDERKRHRDIQHILPIL ----------------------3333---------------------------------- EDKVEELLSKNYHLENEVARLKKLVGE --------------------------- >ANTIVIRAL PROTEIN 3; SWP:Q40772; PDB:1LLNA; NIVFDVENATPETYSNFLTSLREAVDKLTCHGMIMATTLTEQPYVLVDLFGSGTFTLAIR -----1111--------------------iiii----------------1111------- RGNLYLEGYSDIYNGKCRYRIFDSESDAQETVCPGDKSKPGTQNNIPYESYKGMESKGGA ------------iiii--------111133332222----1111---------------3 RTKLGLGITLKSRMGIYGKDATDQKQYQKNEAEFLLIAVQMVTEASRFYIENVAKFDDAN 333------------22221111-------------------------------1111-- GYQPDPAISLEKNWDSVSVIAKVGTSGDSTVTLPGDLDENNKPWTTATMNDLKNDIMALL -------------3333-3333---------------1111------------------- THVTCKVK -------- >LIGNIN PEROXIDASE; SWP:P49012; PDB:1LLP; ATCANGKTVGDASCCAWFDVLDDIQANMFHGGQCGAEAHESIRLVFHDSIAISPAMEAKG --1111----33333333----------%%%%---------------3333--------- KFGGGGADGSIMIFDTIETAFHPNIGLDEVVAMQKPFVQKHGVTPGDFIAFAGAVALSNC --------3333--3333--3333-3333------------------------------2 PGAPQMNFFTGRKPATQPAPDGLVPEPFHTVDQIIARVNDAGEFDELELVWMLSAHSVAA 222----------------------1111-------------------------3333-- VNDVDPTVQGLPFDSTPGIFDSQFFVETQFRGTLFPGSGGNQGEVESGMAGEIRIQTDHT ----1111-------1111------3333-----------2222----2222-------- LARDSRTACEWQSFVGNQSKLVDDFQFIFLALTQLGQDPNAMTDCSDVIPLSKPIPGNGP -------------2222----------------22223333---1111------------ FSFFPPGKSHSDIEQACAETPFPSLVTLPGPATSVARIPPHKA ----22221111----1111----------------------- >ALCOHOL DEHYDROGENASE; SWP:Q9HTD9; PDB:1LLUA; TLPQTMKAAVVHAYGAPLRIEEVKVPLPGPGQVLVKIEASGVCHTDLHAAEGDWPVKPPL ------------2222--------------------------3333-------------- PFIPGHEGVGYVAAVGSGVTRVKEGDRVGIPWLYTACGCCEHCLTGWETLCESQQNTGYS ---------------2222---2222-------------3333---33331111-2222- VNGGYAEYVLADPNYVGILPKNVEFAEIAPILCAGVTVYKGLKQTNARPGQWVAISGIGG -----------3333----11113333-3333----------3333-2222-------33 LGHVAVQYARAMGLHVAAIDIDDAKLELARKLGASLTVNARQEDPVEAIQRDIGGAHGVL 33---------------------------1111-----1111------------------ VTAVSNSAFGQAIGMARRGGTIALVGLPPGDFPTPIFDVVLKGLHIAGSIVGTRADLQEA ----3333----11112222-------------------1111----------------- LDFAGEGLVKATIHPGKLDDINQILDQMRAGQIEGRIVLEM ---1111---------3333-------1111---------- >PEPTIDE DEFORMYLASE PDF1; SWP:NA; PDB:1LM4A; HMLTMKDIIRDGHPTLRQKAAELELPLTKEEKETLIAMREFLVNSQDEEIAKRYGLRGLA ---3333-----3333-------------------------------------------3 APQINISKRMIAVLIPDDGSGKSYDYMLVNPKIVSHSVQEAYLPTGEGLSVDDNVAGLVH 333---------------------------------------1111-------------- RHNRITIKAKDIEGNDIQLRLKGYPAIVFQHEIDHLNGVMFYDHIDKNHPLQPHTDAVEV ----------1111--------------------1111-3333--3333----3333--- >subdomain of Desmoplakin ; SWP:P15924; PDB:1LM5A; SSPIAAIFDTENLEKISITEGIERGIVDSITGQRLLEAQACTGGIIHPTTGQKLSLQDAV ---------1111-------------------------1111------------------ SQGVIDQDMATRLKPAQKAFIGFKMSAAEAVKEKWLPYEAGQRFLEFQYLTGGLVDPEVH ------------------------------------------------1111---3333- GRISTEEAIRKGFIDGRAAQRLQDTSSYAKILTCPKTKLKISYKDAINRSMVEDITGLRL -----------------------3333--------------------------------- LEAASVSSK --------- >subdomain of Desmoplakin ; SWP:P15924; PDB:1LM7A; SFQGIRQPVTVTELVDSGILRPSTVNELESGSYDEVGERIKDFLQGSSCIAGIYNETTKQ --------------------33331111------1111-3333----------------- KLGIYEAMKIGLVRPGTALELLEAQAATGFIVDPVSNLRLPVEEAYKRGLVGIEFKEKLL ---3333--------------------------1111--------------3333----- SAERAVTGYNDPETGNIISLFQAMNKELIEKGHGIRLLEAQIATGGIIDPKESHRLPVDI 1111-------2222---3333-1111--3333-------1111---------------- AYKRGYFNEELSEILSDPSDDTKGFFDPNTEENLTYLQLKERCIKDEETGLCLLPLKE -------33333333---3333----------------1111---------------- >Transcription elongation ; SWP:Q15370; PDB:1LM8B; MDVFLMIRRHKTTIFTDAKESSTVFELKRIVEGILKRPPDEQRLYKDDQLLDDGKTLGEC --------!!!!------1111---------------1111----!!!!--11113333- GFTSQTARPQAPATVGLAFRADDTFEALCIEPFSSPPELPDVMKPQ --3333-1111----------------------------3333--- >Transcription elongation ; SWP:Q15369; PDB:1LM8C; MYVKLISSDGHEFIVKREHALTSGTIKAMLSGPNEVNFREIPSHVLSKVCMYFTYKVRYT ------1111-----3333----------------------3333--------------- NSSTEIPEFPIAPEIALELLMAANFLDC -----------1111------------- >Von Hippel-Lindau disease; SWP:P40337; PDB:1LM8V; RPVLRSVNSREPSQVIFCNRSPRVVLPVWLNFDGEPQPYPTLPPGTGRRIHSYRGHLWLF ------------------------------1111--------2222------2222---- RDAGTHDGLLVNQTELFVPSLNVDGQPIFANITLPVYTLKERCLQVVRSLVKPENYRRLD ----------iiii--------%%%%-------------------------33331111- IVRSLYEDLEDHPNVQKDLERLTQERIAHQ -3333------------------------- >Repressor protein CI; SWP:P03034; PDB:1LMB3; PLTQEQLEDARRLKAIYEKKKNELGLSQESVADKMGMGQSGVGALFNGINALNAYNAALL ---------------------1111-------1111--------1111------------ AKILKVSVEEFSPSIAREIYEMYEAVS ------3333------------1111- >PEPTIDE DEFORMYLASE; SWP:P96113; PDB:1LMEA; HMYRIRVFGDPVLRKRAKPVTKFDENLKKTIERMIETMYHYDGVGLAAPQVGISQRFFVM ---------3333-------------------------1111----3333---------- DVGNGPVAVINPEILEIDPETEVAEEGLSFPEIFVEIERSKRIKVKYQNTRGEYVEEELE -----------------------------2222---------------1111-------- GYAARVFQHEFDHLNGVLIIDRISP ------------1111-3333---- >IMMUNOGENIC PROTEIN MPT63; SWP:P97155; PDB:1LMIA; SAYPITGKLGSELTMTDTVGQVVLGWKVSDLKSSTAVIPGYPVAGQVWEATATVNAIRGS -------2222------1111--------------------------------------- VTPAVSQFNARTADGINYRVLWQAAGPDTISGATIPQGEQSTGKIYFDVTGPSPTIVAMN ---3333----1111--------------------2222--------------------- NGMEDLLIWEP ----------- >FIBRILLIN 1; SWP:P35555; PDB:1LMJA; TDIDECRISPDLCGRGQCVNTPGDFECKCDEGYESGFMMMKNCMDIDECQRDPLLCRGGV ---3333---1111--------------------------------1111-1111----- CHNTEGSYRCECPPGHQLSPNISACI -------------------------- >ANTI-PHOSPHATIDYLINOSITOL; SWP:NA; PDB:1LMKA; VQLQQSGTELMKPGRSLKISCKTTGYIFSNYWIEWVKQRPGHGLEWIGKILPGGGSNTYN --------------------------1111--------2222-----------------3 DKFKGKATFTADTSSNIAYMQLSSLTSEDSAVYYCARGEDYYAYWYVLDYWGQGTTVTVS 333----------------------1111----------3333----------------- SGGGGSDIELTQSPLSLPVSLGDQASISCRSSQSLVHSNGNTSLHWYLKKPGQSPKLLIY ------------------------------------1111---------2222------- KVSTRFSGVPDRFSGSGSGTDFTLKISRVEAEDLGVYFCSQSTHVPFTFGSGTKLELK -----------------------------3333------------------------- >LEISHMANOLYSIN; SWP:P08148; PDB:1LML; VVRDVNWGALRIAVSTEDLTDPAYHCARVGQHVKDHAGAIVTCTAEDILTNEKRDILVKH ---------------3333-1111---2222------------3333------------- LIPQAVQLHTERLKVQQVQGKWKVTDMVGDICGDFKVPQAHITEGFSNTDFVMYVASVPS ------------------------------1111---3333------------------- EEGVLAWATTCQTFSDGHPAVGVINIPAANIASRYDQLVTRVVTHEMAHALGFSGPFFED 2222---------1111---------3333------------------1111-------- ARIVANVPNVRGKNFDVPVINSSTAVAKAREQYGCDTLEYLEVEDQGGAGSAGSHIKMRN ---------iiii---------------------1111---------1111-----3333 AQDELMAPAAAAGYYTALTMAIFQDLGFYQADFSKAEVMPWGQNAGCAFLTNKCMEQSVT 1111-------------------3333----3333----2222--3333------iiii- QWPAMFCNAIRCPTSRLSLGACGVTRHPGLPPYWQYFTDPSLAGVSAFMDYCPVVVPYSD -3333-------3333--------------3333----1111---3333----------- GSCTQRASEAHASLLPFNVFSDAARCIDGAFRPKASYAGLCANVQCDTATRTYSVQVHGS -11113333-3333------1111--------------------------------2222 NDYTNCTPGLRVELSTVSNAFEGGGYITCPPYVEVCQGNVQAAKD ------2222--3333--------------3333-22223333-- ---------------------------------------- >LYSOZYME; SWP:P11941; PDB:1LMQ; KVYDRCELARALKASGMDGYAGNSLPNWVCLSKWESSYNTQATNRNTDGSTDYGIFQINS ------------11112222---3333--------%%%%------1111----1111-33 RYWCDDGRTPGAKNVCGIRCSQLLTDDLTVAIRCAKRVVLDPNGIGAWVAWRLHCQNQDL 33-----------1111-3333------------------1111----------2222-3 RSYVAGCGV 3332222-- >TOXIN ADO1; SWP:P58608; PDB:1LMRA; ADDDCLPRGSKCLGENKQCCKGTTCMFYANRCVGV -------------------2222---3333----- >3-METHYLADENINE DNA GLYCO; SWP:P05100; PDB:1LMZA; MERCGWVSQDPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVLKKRENYRACF --------------------------------------------3333------------ HQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNARAYLQMEQNGEPFADFVWSFV ----3333---3333--3333------------------------------3333----- NHQPQMTQATTLSEIPTSTPASDALSKALKKRGFKFVGTTICYSFMQACGLVNDHVVGCC ----------3333----3333------3333-----------------------2222- CYPGNKP ------- >PHOSPHATIDYLCHOLINE TRANS; SWP:Q9UKL6; PDB:1LN1A; FSEEQFWEACAELQQPALAGADWQLLVETSGISIYRLLDKKTGLYEYKVFGVLEDCSPTL -3333-----3333---3333-------%%%%------------------------3333 LADIYMDSDYRKQWDQYVKELYEQECNGETVVYWEVKYPFPMSNRDYVYLRQRRDLDMEG --------------1111------------------------------------------ RKIHVILARSTSMPQLGERSGVIRVKQYKQSLAIESDGKKGSKVFMYYFDNPGGQIPSWL ------------1111--2222----------------------------------3333 INWAAKNGVPNFLKDMARACQNY -----------------3333-- >HYPOTHETICAL PROTEIN YHBY; SWP:P42550; PDB:1LN4A; MDLSTKQKQHLKGLAHPLKPVVLLGSNGLTEGVLAEIEQALEHHELIKVKIATEDRETKT --------------3333------3333-------------------------------- LIVEAIVRETGACNVQVIGKTLVLYRPTKERKISLPLE -----------------!!!!------3333------- >SIGNAL RECOGNITION PARTIC; SWP:Q58440; PDB:1LNGA; MIIWPSYIDKKKSRREGRKVPEELAIEKPSLKDIEKALKKLGLEPKIYRDKRYPRQHWEI ---3333-33333333----3333--------------1111----------3333---- CGCVEVDYKGNKLQLLKEICKIIKGKN --------------------------- >GUANYL-SPECIFIC RIBONUCLE; SWP:P05798; PDB:1LNIA; DVSGTVCLSALPPEATDTLNLIASDGPFPYSQDGVVFQNRESVLPTQSYGYYHEYTVITP ------3333-3333------1111----3333-----1111-----2222-------22 GARTRGTRRIITGEATQEDYYTGDHYATFSLIDQTC 22------------2222------------------ >HEMOCYANIN; SWP:P83040; PDB:1LNLA; NLVRKSVRNLSPAERASLVAALKSLQEDSSADGFQSLASFHAQPPLCPAPAANKAFACCV -----1111---3333-----------------33333333-------1111-------- HGMATFPEWHRLYTVQFEDALRRHGSVVGIPYWDTVVPQEDLPAFFNDEIWDDALFHANF --1111------------------------------------------------------ TNPFNGADIDFNHQKIARDINVDKLAKEGPKGYDTWSFKQYIYALEQEDYCDFEVQFEIA ---------1111-----------------------------3333---3333------- HNAIHAWVGGTEEYSMGHLHYASYDPVFILHHSNTDRLFALWQELQKFRGHDPNEVNCAL --------------------11113333----------------3333----------33 EMMREPLKPFSFGAPYNLNPTTKEHSKPEDTFDYKGHFHYEYDHLELQGMNVQRLHDYIN 33------1111---------3333-------------------------3333111133 QQKEADRVFAGFLLEGIGTSAHLDFSICAIDGECTHAGYFDVLGGSLETPWQFDRLYKYE 33---------------------------------------------------------- ITDVLESKGLDVHDVFDIKITQTSWDNEDISTDRFPPPSVIYVPK -------------------------------1111---------- >X-PROLYL DIPEPTIDYL AMINO; SWP:P22346; PDB:1LNSA; MRFNHFSIVDKNFDEQLAELDQLGFRWSVFWDEKKILKDFLIQSPSDMTALQATAELDVI --------------------1111---1111---------1111--3333---------- EFLKSSIELDWEIFWNIALQLLDFVPNFDFEIGKAFEYAKNSNLPQIEAEMTTENIISAF -------------------1111-2222--2222-----1111----------------- YYLLCTRRKTGMILVEHWVSEGLLPLDNHYHFFNDKSLATFDSSLLEREVLWVESPVDSE -------1111-------1111----------%%%%-----1111------------111 QRGENDLIKIQIIRPKSTEKLPVVMTASPYHLGINDKANDLALHDMNVELEEKTSHEIHV 1--------------------------3333--------1111----------------- EQKLPQKLSAKAKELPIVDKAPYRFTHGWTYSLNDYFLTRGFASIYVAGVGTRSSDGFQT ------------------------------3333--1111--------2222-------- SGDYQQIYSMTAVIDWLNGRARAYTSRKKTHEIKASWANGKVAMTGKSYLGTMAYGAATT ---------------1111------1111-----1111---------------------- GVEGLELILAEAGISSWYNYYRENGLVRSPGGFPGEDLDVLAALTYSRNLDGADFLKGNA ---------------3333----------2222------------3333----------- EYEKRLAEMTAALDRKSGDYNQFWHDRNYLINTDKVKADVLIVHGLQDWNVTPEQAYNFW --------------------3333---33331111------------------------- KALPEGHAKHAFLHRGAHIYMNSWQSIDFSETINAYFVAKLLDRDLNLNLPPVILQENSK ---2222----------------------------------------------------- DQVWTMMNDFGANTQIKLPLGKTAVSFAQFDNNYDDETFKKYSKDFNVFKKDLFENKANE ----------------------------------3333--------------1111---- AVIDLELPSMLTINGPVELELRLKLNDTKGFLSAQILDFGQKKRLEDKVRVKDFKVLDRG ------------------------------------------------------------ RNFMLDDLVELPLVESPYQLVTKGFTNLQNQSLLTVSDLKADEWFTIKFELQPTIYHLEK --------------------------1111-1111-----------------------11 ADKLRVILYSTDFEHTVRDNRKVTYEIDLSQSKLIIPIESVKN 11-------------------------3333------------ >MULTIDRUG RESISTANCE OPER; SWP:P52003; PDB:1LNWA; NYPVNPDLPALAVFQHVRTRIQSELDCQRLDLTPPDVHVLKLIDEQRGLNLQDLGRQCRD ----1111-----------------1111----------------2222----------- KALITRKIRELEGRNLVRRERNPSDQRSFQLFLTDEGLAIHQHAEAISRVHDELFAPLTP -----------1111---------------------------------------3333-- VEQATLVHLLDQCLAAQ -------------1111 >SPO0B-ASSOCIATED GTP-BIND; SWP:P20964; PDB:1LNZA; FVDQVKVYVKGGDGGNGVAFRREKYVPKGGPAGGDGGKGGDVVFEVDEGLRTLDFRYKKH ----------------------------------------------3333---------- FKAIRGEHGSKNQHGRNADDVIKVPPGTVVTDDDTKQVIADLTEHGQRAVIARGGRGGRG ---------%%%%-----------2222---------------2222------------3 NSRFATPANPAPQLSENGEPGKERYIVLELKVLADVGLVGFPSVGKSTLLSVVSSAKPKI 333--1111-----------------------------------------1111------ ADYHFTTLVPNLGVETDDGRSFVADLPGLIEGAHQGVGLGHQFLRHIERTRVIVHVIDSG -----------------------------3333--%%%%3333----------------- LEGRDPYDDYLTINQELSEYNLRLTERPQIIVANKDPEAAENLEAFKEKLTDDYPVFPIS ----------------3333--1111---------------------------------- AVTREGLRELLFEVANQLENTPEFPLYDEEEL ----1111--------1111------------ >IF KAPPA LIGHT CHAIN; SWP:NA; PDB:1LO0Y; EVKLVESGGGLVKPGGSLKLSCAASGFSFRNYGMSWVRQTPEKRLEWVASISYGGLIYYP ------------2222-----------1111--------1111----------------3 DSIKGRFTISRDIAQNILYLQMSSLRSEDTAMYHCIRGDSFLV 333---------1111---------3333-------------- >STEROID HORMONE RECEPTOR ; SWP:O95718; PDB:1LO1A; AIPKRLCLVCGDIASGYHYGVASCEACKAFFKRTIQGNIEYSCPATNECEITKRRRKSCQ -----------------%%%%-------------1111-------------33333333- ACRFMKALKVGMLKEGVRLDRVRGGRQKYK ------------3333-3333--------- >IF KAPPA LIGHT CHAIN; SWP:NA; PDB:1LO4H; ELKLVETGGDLVKPGGSLTLSCEASGFTLRTYGMSWVRQTPQMRLEWVASISYGGLLYFS ------------2222-----------------------1111--------1111----1 DSVKGRFTISRDIVRNILTLQMSRLRSEDTAIYYCARGTSFVRYFDVWGAGTTVTVSSAK 111----------------------3333------------------------------- TTAPSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLY --------------------------------------%%%%------------------ TLSSSVTVTSSTWPSQSITCNVAHPASSTQVDKKIVPRAA --------1111------------1111------------ >KALLIKREIN 6; SWP:Q92876; PDB:1LO6A; LVHGGPCDKTSHPYQAALYTSGHLLCGGVLIHPLWVLTAAHCKKPNLQVFLGKHNLRQRE -------11111111----------------1111---1111------------1111-1 SSQEQSSVVRAVIHPDYDAASHDQDIMLLRLARPAKLSELIQPLPLERDCSANTTSCHIL 111----------1111--------------------1111-------1111-------- GWGKTADGDFPDTIQCAYIHLVSREECEHAYPGQITQNMLCAGDEKYGKDSCQGDSGGPL ----1111----------------------2222-1111------------2222----- VCGDHLRGLVSWGNIPCGSKEKPGVYTNVCRYTNWIQKTIQ ---------------------------3333---------- >4-HYDROXYBENZOYL-COA THIO; SWP:P56653; PDB:1LO7A; ARSITMQQRIEFGDCDPAGIVWYPNYHRWLDAASRNYFIKCGLPPWRQTVVERGIVGTPI ----------3333-1111--3333-------------------3333------------ VSCNASFVCTASYDDVLTIETCIKEWRRKSFVQRHSVSRTTPGGDVQLVMRADEIRVFAM -----------2222-------------------------1111---------------- NDGERLRAIEVPADYIELCS -!!!!--------------- >LEGUME ISOLECTIN I (BETA ; SWP:P04122; PDB:1LOEA; TETTSFSITKFGPDQQNLIFQGDGYTTKERLTLTKAVRNTVGRALYSSPIHIWDSKTGNV --------------1111------------------------------------------ ANFVTSFTFVIDAPNSYNVADGFTFFIAPVDTKPQTGGGYLGVFNSKDYDKTSQTVAVEF ----------------------------1111----!!!!---------3333------- DTFYNTAWDPSNGDRHIGIDVNSIKSINTKSWALQNGKEANVVIAFNAATNVLTVSLTYP ----3333-3333---------------------2222---------------------- >Mannose/glucose-specific ; SWP:P12306; PDB:1LOEB; TSYTLNEVVPLKEFVPEWVRIGFSATTGAEFAAHEVLSWYFHSELA ---------3333--------------------------------- >CYCLOPHILIN A; SWP:P23869; PDB:1LOPA; MVTFHTNHGDIVIKTFDDKAPETVKNFLDYCREGFYNNTIFHRVINGFMIQGGGFEPGMK ---------------------------------1111-------2222------------ QKATKEPIKNEANNGLKNTRGTLAMARTQAPHSATAQFFINVVDNDFLNFSGESLQGWGY ------------------2222----------------------3333-----1111--- CVFAEVVDGMDEVDKIKGVATGRSGMHQDVPKEDVIIESVTVSE --------------3333-----!!!!----------------- >AFFIBODY BINDING PROTEIN ; SWP:P38507; PDB:1LP1A; KFNKELSVAGREIVTLPNLNDPQKKAFIFSLWDDPSQSANLLAEAKKLNDAQAPK ---------------1111--------------3333------------1111-- >Immunoglobulin G-binding ; SWP:P38507; PDB:1LP1B; KFNKEQQNAFYEILHLPNLNEEQRNAFIQSLKDDPSQSANLLAEAKKLNDAQAP ---------------1111--------------3333------------1111- >AAV-2 CAPSID PROTEIN; SWP:P03135; PDB:1LP3A; GADGVGNSSGNWHCDSTWMGDRVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYS -----------------------------------%%%%-----------3333------ TPWGYFDFNRFHCHFSPRDWQRLINNNWGFRPKRLNFKLFNIQVKEVTQNDGTTTIANNL ---------1111---------1111---------------------------------- TSTVQVFTDSEYQLPYVLGSAHQGCLPPFPADVFMVPQYGYLTLNNGSQAVGRSSFYCLE --------1111---------------------------------!!!!-3333---111 YFPSQMLRTGNNFTFSYTFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLSRTNTPSGTTT 1-------------------------------------3333------------------ QSRLQFSQAGASDIRDQSRNWLPGPCYRQQRVSKTSADNNNSEYSWTGATKYHLNGRDSL ---------33331111----------------------------1111----iiii--- VNPGPAMASHKDDEEKFFPQSGVLIFGKQGSEKTNVDIEKVMITDEEEIRTTNPVATEQY -------------3333--------------------1111----3333-----3333-- GSVSTNLQRGNRQAATADVNTQGVLPGMVWQDRDVYLQGPIWAKIPHTDGHFHPSPLMGG ------------------------2222---------------------------1111- FGLKHPPPQILIKNTPVPANPSTTFSAAKFASFITQYSTGQVSVEIEWELQKENSKRWNP ------------------------------------------------------------ EIQYTSNYNKSVNVDFTVDTNGVYSEPRPIGTRYLTRNL ------------------1111----------------- >BETA-2-MICROGLOBULIN; SWP:NA; PDB:1LP9E; MDSVTQTEGLVTLTEGLPVMLNCTYQSTYSPFLFWYVQHLNEAPKLLLKSFTDNKRPEHQ -------------2222------------------------------------------- GFHATLHKSSSSFHLQKSSAQLSDSALYYCALFLASSSFSKLVFGQGTSLSVVPNIQNPE -------1111---------3333------------------------------------ PAVYQLKDPRSQDSTLCLFTDFDSQINVPKTMESGTFITDKTVLDMKAMDSKSNGAIAWS -------1111-----------1111---------------------------------- NQTSFTCQDIFKET -----3333----- >LIPASE; SWP:P02703; PDB:1LPBA; GIIINLDEGELCLNSAQCKSNCCQHDTILSLSRCALKARENSECSAFTLYGVYYKCPCER ------2222---3333---------------------2222----------------22 GLTCEGDKSLVGSITNTNFGICHNV 22----------------------- >Pancreatic triacylglycero; SWP:P16233; PDB:1LPBB; KEVCYERLGCFSDDSPWSGITERPLHILPWSPKDVNTRFLLYTNENPNNFQEVAADSSSI -------------------3333-------3333--------3333---------3333- SGSNFKTNRKTRFIIHGFIDKGEENWLANVCKNLFKVESVNCICVDWKGGSRTGYTQASQ ----------------22221111---------1111---------3333---------- NIRIVGAEVAYFVEFLQSAFGYSPSNVHVIGHSLGAHAAGEAGRRTNGTIGRITGLDPAE -------------------------------------------1111------------- PCFQGTPELVRLDPSDAKFVDVIHTDGAPIVPNLGFGMSQVVGHLDFFPNGGVEMPGCKK --22223333--3333-------------------------------2222---2222-- NILSQIVDIDGIWEGTRDFAACNHLRSYKYYTDSIVNPDGFAGFPCASYNVFTANKCFPC 3333---1111----------3333-------33331111-----------1111----- PSGGCPQMGHYADRYPGKTNDVGQKFYLDTGDASNFARWRYKVSVTLSGKKVTGHILVSL 2222----1111----1111---------------------------------------- FGNKGNSKQYEIFKGTLKPDSTHSNEFDSDVDVGDLQMVKFIWYNNVINPTLPRVGASKI -1111------------2222--------------------------------------- IVETNVGKQFNFCSPETVREEVLLTLTPC ---1111---------------------- >DIHYDROLIPOAMIDE DEHYDROG; SWP:P14218; PDB:1LPFA; SQKFDVVVIGAGPGGYVAAIRAAQLGLKTACIEKYIGKEGKVALGGTCLNVGCIPSKALL -----------3333-------------------------------3333---------- DSSYKYHEAKEAFKVHGIEAKGVTIDVPAMVARKANIVKNLTGGIATLFKANGVTSFEGH ------------3333---------------------------------1111------- GKLLANKQVEVTGLDGKTQVLEAENVIIASGSRPVEIPPAPLSDDIIVDSTGALEFQAVP ----%%%%----------------------------3333--!!!!--3333-------- KKLGVIGAGVIGLELGSVWARLGAEVTVLEALDKFLPAADEQIAKEALKVLTKQGLNIRL -------------------1111------------11113333--------1111----- GARVTASEVKKKQVTVTFTDANGEQKETFDKLIVAVGRRPVTTDLLAADSGVTLDERGFI ---------%%%%-----------------------------------------1111-- YVDDHCKTSVPGVFAIGDVVRGAMLAHKASEEGVMVAERIAGHKAQMNYDLIPSVIYTHP --1111---2222---1111----3333-------------------1111--------- EIAWVGKTEQTLKAEGVEVNVGTFPFAASGRAMAANDTTGLVKVIADAKTDRVLGVHVIG ------------------------3333-------------------------------2 PSAAELVQQGAIGMEFGTSAEDLGMMVFSHPTLSEALHEAALAVNGHAIHIA 222----------1111--------------3333------3333------- >RETINOL-BINDING PROTEIN I; SWP:Q96R05; PDB:1LPJA; PADLSGTWTLLSSDNFEGYMLALGIDFATRKIAKLLKPQKVIEQNGDSFTIHTNSSLRNY -------------------------------3333---------!!!!------------ FVKFKVGEEFDEDNRGLDNRKCKSLVIWDNDRLTCIQKGEKKNRGWTHWIEGDKLHLEMF ----2222---------------------------------------------------- CEGQVCKQTFQRA ------------- >DOUBLESEX PROTEIN; SWP:P23023; PDB:1LPVA; SISPRTPPNCARCRNHGLKITLKGHKRYCKFRYCTCEKCRLTADRQRVMALQ ----------1111-----------1111----------------------- >LYSOZYME; SWP:P00720; PDB:1LPYA; NIFELRIDEGLRLKIYKDTEGYYTIGIGHLLTKSPSLNAAKSELDKAIGRNTNGVITKDE 3333--1111-------1111------------------------------iiii----- AEKLFNQDVDAAVRGILRNAKKPYDSDAVRRAANFQGETRAGFTNSRQQKRWDEAAVNAK ------------------------------------33331111---------------- SRWYNQTPNRAKRVITTFRTGTWDAYK ----------------------3333- >STAGE 0 SPORULATION PROTE; SWP:P06534; PDB:1LQ1A; KKNLDASITSIIHEIGVPAHIKGYLYLREAISMVYNDIELLGSITKVLYPDIAKKFNTTA -----------------1111---------------3333-------------1111--- SRVERAIRHAIEVAWSRGNIDSISSLFAKPTNSEFIAMVADKLRLEH ------------------3333------------------------- >ALPHA3W; SWP:NA; PDB:1LQ7A; GSRVKALEEKVKALEEKVKALGGGGRIEELKKKWEELKKKIEELGGGGEVKKVEEEVKKL ----------------------------3333----------------3333-------- EEEIKKL ------- ----------------------------- >ACTVA-ORF6 MONOOXYGENASE; SWP:Q53908; PDB:1LQ9A; AEVNDPRVGFVAVVTFPVDGPATQHKLVELATGGVQEWIREVPGFLSATYHASTDGTAVV -1111----------------------------111133332222-------1111---- NYAQWESEQAYRVNFGADPRSAELREALSSLPGLMGPPKAVFMTPRGAILPS --------------1111------------2222------------------ >TAS PROTEIN; SWP:Q46933; PDB:1LQAA; MQYHRIPHSSLEVSTLGLGTMTFGEQNSEADAHAQLDYAVAQGINLIDVAEMYPVPPRPE -------------------1111----------------1111------1111----333 TQGLTETYVGNWLAKHGSREKLIIASKVSGPSRNNDKGIRPDQALDRKNIREALHDSLKR 3----------------3333-----------%%%%------------------------ LQTDYLDLYQVHWPQRPTNCFGKLGYSWTDSAPAVSLLDTLDALAEYQRAGKIRYIGVSN -------------------iiii------------------------------------- ETAFGVMRYLHLADKHDLPRIVTIQNPYSLLNRSFEVGLAEVSQYEGVELLAYSCLGFGT ----------------------------1111---------------------1111-11 LTGKYLNGAKPAGARNTLFSRFTRYSGEQTQKAVAAYVDIARRHGLDPAQMALAFVRRQP 111111----22223333----1111---------------1111-------------11 FVASTLLGATTMDQLKTNIESLHLELSEDVLAEIEAVHQVYTYPAP 11-----------------3333----------------------- >OSMOTICAL INDUCIBLE PROTE; SWP:P75170; PDB:1LQLA; MDKKYDITAVLNEDSSMTAISDQFQITLDARPKHTAKGFGPLAALLSGLAACELATANLM ------------%%%%----!!!!-------3333------------------------- APAKMITINKLLMNVTGSRSTNPTDGYFGLREINLHWEIHSPNSETEIKEFIDFVSKRCP 3333-------------------------------------------------------- AHNTLQGVSQLKINVNVTLVH ---3333-------------- >Viral interleukin-10 homo; SWP:P17150; PDB:1LQSL; TKPQCRPEDYATRLQDLRVTFHRVKPTLQREDDYSVWLDGTVVKGCWGCSVMDWLLRRYL -1111-1111---------33333333-----------------1111------------ EIVFPAGDHVYPGLKTELHSMRSTLESIYKDMRQCPLLGCGDKSVISRLSQEAERKSDNG ----3333--3333--------------------3333--3333------3333---%%% TRKGLSELDTLFSRLEEYLHSR %----------------1111- >FPRA; SWP:O05783; PDB:1LQTA; RPYYIAIVGSGPSAFFAAASLLKAADTTEDLDMAVDMLEMLPTPWGLVRSGVAPDHPKIK ------------------------1111-----------------3333---11113333 SISKQFEKTAEDPRFRFFGNVVVGEHVQPGELSERYDAVIYAVGAQSDRMLNIPGEDLPG -----------1111------2222--33333333-----------------2222-222 SIAAVDFVGWYNAHPHFEQVSPDLSGARAVVIGNGNVALDVARILLTDPDVLARTDIADH 2-3333-------1111-----------------3333---------33331111----- ALESLRPRGIQEVVIVGRRGPLQAAFTTLELRELADLDGVDVVIDPAELDGITDEDAAAV ----3333-----------3333--------3333-2222----33332222-------- GKVCKQNIKVLRGYADRERPGHRRMVFRFLTSPIEIKGKRKVERIVLGRNELVSDGSGRV -------------1111------------------------------------------- AAKDTGEREELPAQLVVRSVGYRGVPTPGLPFDDQSGTIPNVGGRINGSPNEYVVGWIKR --------------------------2222---1111----iiii2222------3333- GPTGVIGTNKKDAQDTVDTLIKNLGNAKEGAECKSFDHADQVADWLAARQPKLVTSAHWQ ----3333-------------------1111-----3333-------------------- VIDAFERAAGEPHGRPRVKLASLAELLRIGLG ---------3333------------------- >PEPTIDE DEFORMYLASE 2; SWP:O31410; PDB:1LQYA; MITMKDIIKEGHPTLRKVAEPVPLPPSEEDKRILQSLLDYVKMSQDPELAAKYGLRPGIG --1111-----3333--------------------------------------------- LAAPQINVSKRMIAVHVTDENGTLYSYALFNPKIVSHSVQQCYLTTGEGCLSVDRDVPGY -3333-------------1111---------------------1111--1111------- VLRYARITVTGTTLDGEEVTLRLKGLPAIVFQHEIDHLNGIMFYDRINPADPFQVPDGAI ------------1111--------------------1111-1111-----1111-2222- PIGR ---- >TOLA PROTEIN; SWP:P50600; PDB:1LR0A; ALAELLSDTTERQQALADEVGSEVTGSLDDLIVNLVSQQWRRPPSARNGSVEVLIELPDG ------------------------------------------1111----------1111 TITNASVSRSSGDKPFDSSAVAAVRNVGRIPEQQLPRATFDSLYRQRRIIFKPEDLSLHH ------------------------------------------------------------ HHH --- >DNA-BINDING PROTEIN H-NS; SWP:P08936; PDB:1LR1A; SEALKILNNIRTLRAQARESTLETLEEMLEKLEVVVNERREEESAAAAEVEERTRKL 33331111--------------3333------------------------------- >AUXIN BINDING PROTEIN 1; SWP:P13689; PDB:1LR5A; SCVRDNSLVRDISQMPQSSYGIEGLSHITVAGALNHGMKEVEVWLQTISPGQRTPIHRHS ----------3333----iiii----------3333------------2222-------- CEEVFTVLKGKGTLLMGSSSLKYPGQPQEIPFFQNTTFSIPVNDPHQVWNSDEHEDLQVL ----------------------------------------2222---------------- VIISRPPAKIFLYDDWSMPHTAAVLKFPFVWDEDCFEAAK --------------11113333------1111-------- >FOLLISTATIN; SWP:P21674; PDB:1LR7A; ETCENVDCGPGKKCRMNKKNKPRCVCAPDCSNITWKGPVCGLDGKTYRNECALLKARCKE --2222--2222----1111--------3333--------1111---------------- QPELEVQYQGKCK 1111--------- >BETA-ELICITIN CRYPTOGEIN; SWP:P15570; PDB:1LRIA; TACTASQQTAAYKTLVSILSDASFNQCSTDSGYSMLTAKALPTTAQYKLMCASTACNTMI ---------------3333--------------1111----------------------- KKIVTLNPPNCDLTVPTSGLVLNVYSYANGFSNKCSSL ---1111---------------------------1111 >LEUCINE-RICH REPEAT VARIA; SWP:Q44534; PDB:1LRV; TPIGDCRVCSFRMSLLLTGRCTPGDACVAVESGRQIDRFFRNNPHLAVQYLADPFWERRA ----33331111---------2222------3333-------33331111---------- IAVRYSPVEALTPLIRDSDEVVRRAVAYRLPREQLSALMFDEDREVRITVADRLPLEQLE 3333--33333333-------------------3333-------------------3333 QMAADRDYLVRAYVVQRIPPGRLFRFMRDEDRQVRKLVAKRLPEESLGLMTQDPEPEVRR -1111--------3333-3333-1111---------------3333-1111--------- IVASRLRGDDLLELLHDPDWTVRLAAVEHASLEALRELDEPDPEVRLAIAGRL -1111-!!!!3333-----------3333------------------------ >METHANOL DEHYDROGENASE SU; SWP:P12293; PDB:1LRWA; NDQLVELAKDPANWVMTGRDYNAQNYSEMTDINKENVKQLRPAWSFSTGVLHGHEGTPLV -----33333333------1111---------33331111-------------------- VGDRMFIHTPFPNTTFALDLNEPGKILWQNKPKQNPTARTVACCDVVNRGLAYWPGDDQV !!!!--------------3333---------------3333------------------- KPLIFRTQLDGHIVAMDAETGETRWIMENSDIKVGSTLTIAPYVIKDLVLVGSSGAELGV -------1111-------------------3333--------------------3333-- RGYVTAYDVKSGEMRWRAFATGPDEELLLAEDFNAPNPHYGQKNLGLETWEGDAWKIGGG ----------------------1111-------3333---------1111!!!!------ TNWGWYAYDPEVDLFYYGSGNPAPWNETMRPGDNKWTMAIWGREATTGEAKFAYQKTPHD ---------1111------------3333------------------------------- EWDYAGVNVMMLSEQEDKQGQMRKLLTHPDRNGIVYTLDRTNGDLISADKMDDTVNWVKE ----------------1111---------1111------------------3333----- VQLDTGLPVRDPEFGTRMDHKARDICPSAMGYHNQGHDSYDPERKVFMLGINHICMDWEP ----------3333-------------1111----------------------------- FMLPYRAGQFFVGATLTMYPGPKGDRGNASGLGQIKAYDAISGEMKWEKMERFSVWGGTM -----2222-----------11113333-------------------------------- ATAGGLTFYATLDGFIKARDSDTGDLLWKFKLPSGVIGHPMTYKHDGRQYVAIMYGVGGW --------------------------------------------%%%%----------33 PGVGLVFDLADPTAGLGSVGAFKRLQEFTQMGGGVMVFSLDGESPYSDPNVGEYAPGEPT 33--------1111iiii3333-3333------------%%%%11113333--------- >Methanol dehydrogenase su; SWP:P29898; PDB:1LRWB; YDGTNCKAPGNCWEPKPDYPAKVEGSKYDPQHDPAELSKQGESLAVMDARNEWRVWNMKK -------2222----2222-------------3333------------------------ TGKFEYDVKKIDGYDETKAPPAE ------333322221111----- >factor essential for expr; SWP:P14304; PDB:1LRZA; MKFTNLTAKEFGAFTDSMPYSHFTQTVGHYELKLAEGYETHLVGIKNNNNEVIAACLLTA --------------1111---1111---------------------1111---------- VPVMKVFKYFYSNRGPVIDYENQELVHFFFNELSKYVKKHRCLYLHIDPYLPYQYLNHDG -----------%%%%---3333--------------1111-------------------- EITGNAGNDWFFDKMSNLGFEHTGFHKGFDPVLQIRYHSVLDLKDKTADDIIKNMDGLRK -------3333----1111-----------------------2222-----1111----- RNTKKVKKNGVKVRFLSEEELPIFRSFMDDKFYYNRLKYYKDRVLVPLAYINFDEYIKEL ----------------3333---3333-3333-------!!!!----------------- NEERDILNKDLNKALKDIEKRPENKKAHNKRDNLQQQLDANEQKIEEGKRLQEEHGNELP --------------------1111------------------------------------ ISAGFFFINPFEVVYYAGGTSNAFRHFAGSYAVQWEMINYALNHGIDRYNFYGVSGKFTE --------3333--------33331111------------------------------11 DAEDAGVVKFKKGYNAEIIEYVGDFIKPINKPVYAAYTAL 11---------1111------------------------- >SIGNAL RECOGNITION PARTIC; SWP:O07347; PDB:1LS1A; MFQQLSARLQEAIGRLRGRGRITEEDLKATLREIRRALMDADVNLEVARDFVERVREEAL -------------1111---------------------1111------------------ GKQVLESLTPAEVILATVYEALKEALGGEARLPVLKDRNLWFLVGLQGSGKTTTAAKLAL --1111--3333---------------------------------2222----------- YYKGKGRRPLLVAADTQRPAAREQLRLLGEKVGVPVLEVMDGESPESIRRRVEEKARLEA --1111-----------3333------------------2222----------------- RDLILVDTAGRLQIDEPLMGELARLKEVLGPDEVLLVLDAMTGQEALSVARAFDEKVGVT --------------------------------------33333333-------------- GLVLTKLDGDARGGAALSARHVTGKPIYFAGGLEPFYPERLAGRILGMG -----3333----------------------------------1111-- >PLASMEPSIN IV; SWP:Q8IM16; PDB:1LS5A; SENDSIELDDVANLMFYGEGQIGTNKQPFMFIFDTGSANLWVPSVNCDSIGCSTKHLYDA ----------%%%%--------1111-----------------1111-----------33 SASKSYEKDGTKVEISYGSGTVRGYFSKDVISLGDLSLPYKFIEVTDADDLEPIYSGSEF 33--------------1111------------!!!!-----------3333---1111-- DGILGLGWKDLSIGSIDPVVVELKKQNKIDNALFTFYLPVHDKHVGYLTIGGIESDFYEG -------1111------------1111--------------------------1111--- PLTYEKLNHDLYWQIDLDIHFGKYVMQKANAVVDSGTSTITAPTSFLNKFFRDMNVIKVP ---------------------------------1111-----33331111---------- FLPLYVTTCDNDDLPTLEFHSRNNKYTLEPEFYMDPLSDIDPALCMLYILPVDIDDNTFI -------1111---------3333----3333---------------------------- LGDPFMRKYFTVFDYEKESVGFAVAKNL -3333---------3333---------- >ARYL SULFOTRANSFERASE; SWP:P50225; PDB:1LS6A; SRPPLEYVKGVPLIKYFAEALGPLQSFQARPDDLLISTYPKSGTTWVSQILDMIYQGGDL -------iiii------------1111--1111-----2222-----------1111--3 EKCHRAPIFMRVPFLEFKAPGIPSGMETLKDTPAPRLLKTHLPLALLPQTLLDQKVKVVY 333---3333---1111-2222-----3333-----------3333-33331111----- VARNAKDVAVSYYHFYHMAKVHPEPGTWDSFLEKFMVGEVSYGSWYQHVQEWWELSRTHP ------------------1111-----------------22223333-------3333-- VLYLFYEDMKENPKREIQKILEFVGHSLPEETVDFMVQHTSFKEMKKNPMTNYTTVPQEF ----------------------------3333---------------1111-11113333 MDHSISPFMRKGMAGDWKTTFTVAQNERFDADYAEKMAGCSLSFRSEL -3333-----------1111---------------------------- >CYTOCHROME C6; SWP:P83391; PDB:1LS9A; VDAELLADGKKVFAGNCAACHLGGNNSVLADKTLKKDAIEKYLEGGLTLEAIKYQVNNGK -----------------1111iiii---3333----------2222-------------! GAMPAWADRLDEDDIEAVSNYVYDQAVNSKW !!!--1111----------------1111-- >LIPOVITELLIN (LV-1N, LV-1; SWP:Q91062; PDB:1LSHA; FQPGKVYRYSYDAFSISGLPEPGVNRAGLSGEMKIEIHGHTHNQATLKITQVNLKYFLGP -2222------------------------------------------------------- WPSDSFYPLTGGYDHFIQQLEVPVRFDYSAGRIGDIYAPPQVTDTAVNIVRGILNLFQLS ------------------1111------iiii------3333----------3333---- LKKNQQTFELQETGVEGICQTTYVVQEGYRTNEMAVVKTKDLNNCDHKVYKTMGTAYAER -------------1111-----------------------1111----------1111-- CPTCQKMNKNLRSTAVYNYAIFDEPSGYIIKSAHSEEIQQLSVFDIKEGNVVIESRQKLI -3333------------------3333-----------------1111------------ LEGIQSAPAASQAASLQNRGGLMYKFPSSAITKMSSLFVTKGKNLESEIHTVLKHLVENN --------------------------2222---------2222----------------- QLSVHEDAPAKFLRLTAFLRNVDAGVLQSIWHKLHQQKDYRRWILDAVPAMATSEALLFL ----1111----------1111-----------1111----------3333--------- KRTLASEQLTSAEATQIVASTLSNQQATRESLSYARELLNTSFIRNRPILRKTAVLGYGS ---1111--------------1111----------------3333--------------- LVFRYCANTVSCPDELLQPLHDLLSQSSDRAKEEEIVLALKALGNAGQPNSIKKIQRFLP -----1111---3333-------------------------------3333-3333---- GQGKSLDEYSTRVQAEAIMALRNIAKRDPRKVQEIVLPIFLNVAIKSELRIRSCIVFFES ---------------------3333----------------1111------------111 KPSVALVSMVAVRLRREPNLQVASFVYSQMRSLSRSSNPEFRDVAAACSVAIKMLGSKLD 1-----------3333----------------1111-3333----------1111---11 RLGCRYSKAVHVDTFNARTMAGVSADYFRINSPSGPLPRAVAAKIRGQGMGYASDIVEFG 111111-------------------------3333-------------iiii-------- LRAEGLQELLYDWKSVPEERPLASGYVKVHGQEVVFAELDKKMQEQIGAVVSKLEQGMDV ------1111------------------iiii-----------3333------------- LLTKGYVVSEVRYMQPVCIGIPMDLNLLVSGVTTNRANLSASFSSLPADMKLADLLATNI ----------------1111---------------------------------------- ELRVAATTSMSQHAVAIMGLTTDLAKAGMQTHYKTSAGLGVNGKIEMNARESNFKASLKP ---------------------1111-----------------------1111-------- FQQKTVVVLSTMESIVFVRDPSGSRILPVLPPKMTQKQIHDIMTARPVMRRKQSCSKSAA -------------------3333-----------------------------------11 LSSKVCFSARLRNAAFIRNALLYKITGDYVSKVYVQPTSSKAQITKVELELQAG 11-----------3333------------------------------------- >Vitellogenin [Precursor]; SWP:Q91062; PDB:1LSHB; SKPKVVIVLRAVRADGKQQGLQTTLYYGLTSNGLPKAKIVAVELSDLSVWKLCAKFRLSA ------------3333----------------------------1111------------ HMKAKAAIGWGKNCQQYRAMLEASTGNLQSHPAARVDIKWGRLPSSLQRAKNALLENGAP -------------------------------------------3333------------- VIASKLEMEIMPKANQKHQVSVILAAMTPRRMNIIVKLPKVTYFQQGILLPFTF -------------------------------------1111------------- ------------------------------------------------------------ ------ >THROMBOSPONDIN 1; SWP:P07996; PDB:1LSLA; QDGGWSHWSPWSSCSVTCGDGVITRIRLCNSPSPQMNGKPCEGEARETKACKKDACPING ----------------------------------2222---------------------- GWGPWSPWDICSVTCGGGVQKRSRLCNNPTPQFGGKDCVGDVTENQICNKQDC ----------------------------------------------------- >TRK SYSTEM POTASSIUM UPTA; SWP:Q58505; PDB:1LSSA; MYIIIAGIGRVGYTLAKSLSEKGHDIVLIDIDKDICKKASAEIDALVINGDCTKIKTLED --------3333-------1111---------------------------1111------ AGIEDADMYIAVTGKEEVNLMSSLLAKSYGINKTIARISEIEYKDVFERLGVDVVVSPEL -1111---------3333--------1111---------3333----1111--------- IAANYIEKLIER ------------ >LYSINE, ARGININE, ORNITHI; SWP:P02911; PDB:1LST; ALPQTVRIGTDTTYAPFSSKDAKGEFIGFDIDLGNEMCKRMQVKCTWVASDFDALIPSLK --------------------1111--------------------------1111------ AKKIDAIISSLSITDKRQQEIAFSDKLYAADSRLIAAKGSPIQPTLESLKGKHVGVLQGS -------------3333-------------------2222----33332222----2222 TQEAYANDNWRTKGVDVVAYANQDLIYSDLTAGRLDAALQDEVAASEGFLKQPAGKEYAF ----------1111----------------------------------11111111---- AGPSVKDKKYFGDGTGVGLRKDDTELKAAFDKALTELRQDGTYDKMAKKYFDFNVYGD ------3333---------1111--------------1111------------3333- >BETAINE-HOMOCYSTEINE METH; SWP:NA; PDB:1LT7A; KGILERLNAGEIVIGDGGFVFALEKRGYHPEAVRQLHREFLRAGSNVMQTFTVNEAAADI ---------------2222---------------------1111---------------- ARQVADEGDALVAGGVSQTPSYLSAKSETEVKKVFLQQLEVFMKKNVDFLIAEYFEHVEE ------------------------------------------------------------ AVWAVETLIASGKPVAATMAIGPEGDLHGVPPGEAAVRLVKAGASIIGVNCHFDPTISLK --------1111---------11111111------------------------------- TVKLMKEGLEAAQLKAHLMSQPLAYHTPDANKQGFIDLPEFPFGLEPRVATRWDIQKYAR ---------1111-------------111111111111------3333------------ EAYNLGVRYIGGCCGFEPYHIRAIAEELAPERGFLPPRARARKEYWENLRIASGRPYNPS --1111------22223333---------------------3333---------1111-- MSKPD ----- >BETAINE-HOMOCYSTEINE METH; SWP:Q93088; PDB:1LT8A; KGILERLNAGEIVIGDGGFVFALEKRGYVKAGPWTPEAAVEHPEAVRQLHREFLRAGSNV --3333-----------------1111---------3333-3333--------1111--- MQTFTFYASEDKGQEVNEAAADIARQVADEGDALVAGGVSQTPSYLSAKSETEVKKVFLQ --------------------------3333----------------------3333---- QLEVFMKKNVDFLIAEYFEHVEEAVWAVETLIASGKPVAATMAIGPEGDLHGVPPGEAAV --------------------------------------------11111111-------- RLVKAGASIIGVNCHFDPTISLKTVKLMKEGLEAAQLKAHLMSQPLAYHTPDANKQGFID --------------------------------1111-------------11111111111 LPEFPFGLEPRVATRWDIQKYAREAYNLGVRYIGGCCGFEPYHIRAIAEELAPERGFLPP 1------3333--------------1111------22223333----------------- ASEKHGSWGSGLDMHTKPWVRARARKEYWENLRIASGRPYNPSMSKPD ------2222-------3333---11111111-----1111------- >CORAL TREE LECTIN; SWP:P16404; PDB:1LTE; VETISFSFSEFEPGNDNLTLQGASLITQSGVLQLTKINQNGMPAWDSTGRTLYAKPVHIW -----------2222-----------1111-------1111------------------- DMTTGTVASFETRFSFSIEQPYTRPLPADGLVFFMGPTKSKPAQGYGYLGIFNQSKQDNS -1111---------------------------------------!!!!---------333 YQTLGVEFDTFSNPWDPPQVPHIGIDVNSIRSIKTQPFQLDNGQVANVVIKYDASSKLLH 3-----------1111------------------------2222---------1111--- AVLVYPSSGAIYTIAEIVDVKQVLPEWVDVGLSGATGAQRDAAETHDVYSWSFQASLPE -----1111---------3333----------------2222----------------- >PHOSPHOGLYCERATE KINASE; SWP:P27362; PDB:1LTKA; HHLGNKLSISDLKDIKNKKVLVRVDFNVPIENGIIKDTNRITATLPTINHLKKEGASKII --------------2222-------------------33331111--------------- LISHCGRPDGLRNEKYTLKPVAETLKGLLGEEVLFLNDCVGKEVEDKINAAKENSVILLE -------iiii-1111-3333--------------------------------------- NLRFHIEEEGKGVDANGNKVKANKEDVEKFQNDLTKLADVFINDAFGTAHRAHSSVGVKL 11113333-----1111----------------1111----------1111--------- NVKASGFLKKELEYFSKALENPQRPLLAILGGAKVSDKIQLIKNLLDKVDRIIGGGAYTF ----------------------------------1111---------------------- KKVLNNKIGTSLFDEAGSKIVGEIEKAKAKNVQIFLPVDFKIADNFDNNANTKFVTDEEG -------!!!!--3333---3333-------------------------------3333- IPDNWGLDAGPKSIENYKDVILTSKTVIWNGPQGVFEPNFAKGSIECLNLVVEVTKKGAI --------------------1111------------------------------------ TIVGGGDTASLVEQQNKKNEISHVSTGGGASLELLEGKELPGVLALSN -----3333-------3333-------------1111--3333----- >DNA REPLICATION INITIATOR; SWP:O27798; PDB:1LTLA; VDKSKTLTKFEEFFSLQDYKDRVFEAIEKYPNVRSIEVDYLDLEMFDPDLADLLIEKPDD -----------1111-1111------------------3333---------3333----- VIRAAQQAIRNIDRLRKNVDLNIRFSGISNVIPLRELRSKFIGKFVAVDGIVRKTDEIRP -------3333-1111----------------3333-3333------------------- RIVKAVFECRGCMRHHAVTQSTNMITEPSLCSECGGRSFRLLQDESEFLDTQTLKLQEPL -----------------------------------------3333--------------- ENLSGGEQPRQITVVLEDDLVDTLTPGDIVRVTGTLRTVRDERTKRFKNFIYGNYTEFL ----------------!!!!--------------------------------------- >HEAT-LABILE ENTEROTOXIN, ; SWP:P06717; PDB:1LTSA; RLYRADSRPPDEIKRSGGLMPRGHNEYFDRGTQMNINLYDHARGTQTGFVRYDDGYVSTS --------3333--------2222-1111-----------1111-2222---%%%%---- LSLRSAHLAGQSILSGYSTYYIYVIATAPNMFNVNDVLGVYSPHPYEQEVSALGGIPYSQ -------------2222---------------3333-!!!!--3333---------3333 IYGWYRVNFGVIDERLHRNREYRDRYYRNLNIAPAEDGYRLAGFPPDHQAWREEPWIHHA -------iiii-------1111-3333------33333333---11113333--3333-- PQGCG 2222- >Heat-labile enterotoxin A; SWP:P06717; PDB:1LTSC; GDTCNEETQNLSTIYLREYQSKVKRQIFSDYQSEVDIYNRI 3333------------------------1111---3333-- >PHENYLALANINE-4-HYDROXYLA; SWP:P30967; PDB:1LTZA; FVVPDITTRKNVGLSHDANDFTLPQPLDRYSAEDHATWATLYQRQCKLLPGRACDEFLEG ----------2222-----------3333-------------------2222-------- LERLEVDADRVPDFNKLNEKLMAATGWKIVAVPGLIPDDVFFEHLANRRFPVTWWLREPH -1111----------------------------------------------------111 QLDYLQEPDVFHDLFGHVPLLINPVFADYLEAYGKGGVKAKALGALPMLARLYWYTVEFG 1----------------------------------------2222--------------- LINTPAGMRIYGAGILSSKSESIYCLDSASPNRVGFDLMRIMNTRYRIDTFQKTYFVIDS ---1111----3333--------1111-------------1111---------------3 FKQLFDADFAPLYLQLADAQPWGAGDIAPDDLVL 333------------1111---1111-1111--- >LECTIN; SWP:P05045; PDB:1LU1; ANIQSFSFKNFNSPSFILQGDATVSSGKLQLTKVKENGIPTPSSLGRAFYSSPIQIYDKS ------------1111--------iiii------1111--------------------11 TGAVASWATSFTVKISAPSKASFADGIAFALVPVGSEPRRNGGYLGVFDSDVYNNSAQTV 11------------------------------1111----!!!!---------3333--- AVEFDTLSNSGWDPSMKHIGIDVNSIKSIATVSWDLANGENAEILITYNAATSLLVASLV --------3333------------------------2222-------------------- HPSRRTSYILSERVDITNELPEYVSVGFSATTGLSEGYIETHDVLSWSFASKLPDDSTAE 3333----------3333----------------1111---------------------- PLDLASYLVRNVL ------------- >SOLUBLE SECRETED ANTIGEN ; SWP:Q10804; PDB:1LU4A; ADERLQFTATTLSGAPFDGASLQGKPAVLWFWTPWCPFCNAEAPSLSQVAAANPAVTFVG !!!!------1111---33332222-------1111----------------1111---- IATRADVGAMQSFVSKYNLNFTNLNDADGVIWARYNVPWQPAFVFYRADGTSTFVNNPTA -------------------------1111---1111----------1111---------- AMSQDELSGRVAAL -------------- >VENOM TOXIN PEPTIDE MTX4; SWP:Q7YT39; PDB:1LU8A; GCLEFWWKCNPNDDKCCRPKLKCSKLFKLCNFSF ---------1111--------------------- >METHYLENE TETRAHYDROMETHA; SWP:P55818; PDB:1LUAA; SKKLLFQFDTDATPSVFDVVVGYDGGADHITGYGNVTPDNVGAYVDGTIYTRGGKEKQST ----------------------1111----------3333------------!!!!1111 AIFVGGGDMAAGERVFEAVKKRFFGPFRVSCMLDSNGSNTTAAAGVALVVKAAGGSVKGK -------------------3333!!!!-----------------------------2222 KAVVLAGTGPVGMRSAALLAGEGAEVVLCGRKLDKAQAAADSVNKRFKVNVTAAETADDA ------------------------------------------------------------ SRAEAVKGAHFVFTAGAIGLELLPQAAWQNESSIEIVADYNAQPPLGIGGIDATDKGKEY ---3333----------------33331111--------------------3333----- GGKRAFGALGIGGLKLKLHRACIAKLFESSEGVFDAEEIYKLAKEMA -----------------------3333-------------------- >BACTERIAL LUCIFERASE; SWP:P07740; PDB:1LUCA; MKFGNFLLTYQPPELSQTEVMKRLVNLGKASEGCGFDTVWLLEHHFTEFGLLGNPYVAAA -----------1111---------------3333------------3333---------- HLLGATETLNVGTAAIVLPTAHPVRQAEDVNLLDQMSKGRFRFGICRGLYDKDFRVFGTD --1111----------3333----------------iiii---------3333------3 MDNSRALMDCWYDLMKEGFNEGYIAADNEHIKFPKIQLNPSAYTQGGAPVYVVAESASTT 333----------------------------------------2222------------- EWAAERGLPMILSWIINTHEKKAQLDLYNEVATEHGYDVTKIDHCLSYITSVDHDSNRAK ---1111-------------------------1111-1111------------------- DICRNFLGHWYDSYVNATKIFRIDYSYEINPVGTPEECIAIIQQDIDATGIDNICCGFEA ----------------------11111111------------------------------ NGSEEEIIASMKLFQSDVMPYLKEKQ -----------------3333----- >Alkanal monooxygenase bet; SWP:P07739; PDB:1LUCB; MKFGLFFLNFMNSKRSSDQVIEEMLDTAHYVDQLKFDTLAVYENHFSNNGVVGAPLTVAG ------------------------------1111-------------------------- FLLGMTKNAKVASLNHVITTHHPVRVAEEACLLDQMSEGRFAFGFSDCEKSADMRFFNRP -----------------1111-------------1111-----------3333-1111-3 TDSQFQLFSECHKIINDAFTTGYCHPNNDFYSFPKISVNPHAFTEGGPAQFVNATSKEVV 333----------------------------------------2222------------- EWAAKLGLPLVFRWDDSNAQRKEYAGLYHEVAQAHGVDVSQVRHKLTLLVNQNVDGEAAR ------------1111----------------1111--1111------------------ AEARVYLEEFVRESYSNTDFEQKMGELLSENAIGTYEESTQAARVAIECCGAADLLMSFE ----------------------------------------------------------33 SMEDKAQQRAVIDVVNANIV 33------------------ >MUSCLE-SPECIFIC TYROSINE ; SWP:Q62838; PDB:1LUFA; LNPKLLSLEYPRNNIEYVRDIGEGAFGRVFQARAPGLLPYEPFTMVAVKMLKEEASADMQ -----1111-1111---------3333----------1111----------1111----- ADFQREAALMAEFDNPNIVKLLGVCAVGKPMCLLFEYMAYGDLNEFLRSMSPPPLSCAEQ --------3333--1111-------------------1111------------------- LCIARQVAAGMAYLSERKFVHRDLATRNCLVGENMVVKIADFGLSRNIYSADYYKDAIPI --------------1111------3333----%%%%-------3333-3333------33 RWMPPESIFYNRYTTESDVWAYGVVLWEIFSYGLQPYYGMAHEEVIYYVRDGNILACPEN 33-----------3333-----------1111-------------------------222 CPLELYNLMRLCWSKLPADRPSFCSIHRILQRMCE 2---------1111-3333---------------- >CARBONIC ANHYDRASE II; SWP:P00918; PDB:1LUGA; SHHWGYGKHNGPEHWHKDFPIAKGERQSPVDIDTHTAKYDPSLKPLSVSYDQATSLRILN ------111133333333--1111--------3333---1111------1111------- NGHAFNVEFDDSQDKAVLKGGPLDGTYRLIQFHFHWGSLDGQGSEHTVDKKKYAAELHLV -------------------!!!!---------------1111-----iiii--------- HWNTKYGDFGKAVQQPDGLAVLGIFLKVGSAKPGLQKVVDVLDSIKTKGKSADFTNFDPR --3333-3333--------------------3333-----3333--2222-------333 GLLPESLDYWTYPGSLTTPPLLECVTWIVLKEPISVSSEQVLKFRKLNFNGEGEPEELMV 3-------------------------------------------------2222------ DNWRPAQPLKNRQIKASFK ------------------- >TYROSINE-PROTEIN KINASE I; SWP:Q03526; PDB:1LUIA; NNLETYEWYNKSISRDKAEKLLLDTGKEGAFMVRDSRTPGTYTVSVFTKAIISENPCIKH -------------3333---------------------------------3333------ YHIKETNDSPKRYYVAEKYVFDSIPLLIQYHQYNGGGLVTRLRYPVCG -----------------------------1111--------------- >Beta-catenin-interacting ; SWP:Q9NSA3; PDB:1LUJB; GAPAKSPEEMYIQQKVRVLLMLRKMGSNLTASEEEFLRTYAGVVSSQLSQLPQHSIDQAA ----------------------1111---------------------------------- EDVVMAFSRSE ----------- ------------------------------- >ALDOSE 1-EPIMERASE; SWP:Q966D4; PDB:1LURA; ASGFIEIANKQGLTATLLPFGATLAKLTFPDKNGKNQDLVLGFDTIDEFEKDAASIGKTV ------------------------------1111----------33331111--2222-- GRVANRIKNSTLHFDGKQYTTPNNGPHYLHGGPNGLGYRKWEVVRHAPESVSFSVRANEQ ------2222---iiii------!!!!-iiii--3333-------------------333 DDGLPGDAKIDVTYTVNDRNQLIIEHHATCDTPGLLALTNHAYWNLDGSDTVAEHFLEEA 3---------------1111-------------------------------1111----- DEFVEVDDTFCPTGAIRSVTDTGFDFRSGKQLKESGKDAEELLDLDNDLVITKKTPSTYL ------1111--------2222---1111-3333-------------------------- RFWSEKSGIELSITTSYPVIHLYASKFLDCKGKKGEHYKANKALAIEPQFHSAAPNFDHF -------------------------------2222---2222--------2222--1111 PDVSLRPGDHYCQEIVYTFSHVN -----2222-------------- >PROTEIN K3; SWP:P18378; PDB:1LUZA; FCYSLPNAGDVIKGRVYEKDYALYIYLFDYPHFEAILAESVKHDRYVEYRDKLVGKTVKV ------2222--------%%%%----1111------3333------------2222---- KVIRVDYTKGYIDVNYKRCRHQ ------------------1111 >HEPATOCYTE NUCLEAR FACTOR; SWP:Q14541; PDB:1LV2A; AAGSINTLAQAEVRSRQISVSSTDINVKKIASIGDVCESMKQQLLVLVEWAKYIPAFCEL ------------1111---------------------------------33333333--- PLDDQVALLRAHAGEHLLLGATKRSMMYKDILLLGNNYVIHRNSCEVEISRVANRVLDEL ------------------------1111-----1111--------3333----------- VRPFQEIQIDDNEYACLKAIVFFDPDAKGLSDPVKIKNMRFQVQIGLEDYINDRQYDSRG -----------------------1111----3333------------------------- RFGELLLLLPTLQSITWQMIEQIQFVKLFGMVKIDNLLQEMLLGG -------------------------------------3333---- >HYPOTHETICAL PROTEIN YACG; SWP:P36681; PDB:1LV3A; MSETITVNCPTCGKTVVWGEISPFRPFCSKRCQLIDLGEWAAEEKRIPSSGDLSESDDWS ---------------------------------3333----------3333-------33 EEPKQ 33--- >FTSH; SWP:P28691; PDB:1LV7A; MLTEDQIKTTFADVAGCDEAKEEVAELVEYLREPSRFKIPKGVLMVGPPGTGKTLLAKAI ---------3333-------------------3333-----------2222--------- AGEAKVPFFTISGSDFVEMFVGVGASRVRDMFEQAKKAAPCIIFIDEIDAVGRQRGAGLG ---------------1111---------------3333--------3333-----iiii- GGHDEREQTLNQMLVEMDGFEGNEGIIVIAATNRPDVLDPALLRPGRFDRQVVVGLPDVR !!!!------------1111-------------1111-3333-2222------------- GREQILKVHMRRVPLAPDIDAAIIARGTPGFSGADLANLVNEAALFAARGNKRVVSMVEF ---------1111--1111--------2222----------------1111----3333- EKAKDKIMMGL ----------- >SELENOCYSTEINE-SPECIFIC E; SWP:Q46455; PDB:1LVAA; GSPEKILAQIIQEHREGLDWQEAATRASLSLEETRKLLQSAAAGQVTLLRVENDLYAIST -------------1111-----------------------1111------!!!!------ ERYQAWWQAVTRALEEFHSRYPLRPGLAREELRSRYFSRLPARVYQALLEEWSREGRLQL --------------------1111------------1111------------1111---- AANTVALAGFTPSFSETQKKLLKDLEDKYRVSRWQPPSFKEVAGSFNLDPSELEELLHYL ------1111---------------------!!!!--------1111------------- VREGVLVKINDEFYWHRQALGEAREVIKNLASTGPFGLAEARDALGSSRKYVLPLLEYLD ------------------------------1111-------------3333--------1 QVKFTRRVGDKRVVVGN 111-------------- >SYNTAXIN 6; SWP:Q63635; PDB:1LVFA; EDPFFVVKGEVQKAVNTAQGLFQRWTELLQGPSAATREEIDWTTNELRNNLRSIEWDLED ------------------------------1111-------------------------- LDETISIVEANPRKFNLDATELSIRKAFITSTRQIVRDMKDQMSAS -------33333333------------------------------- >GUANYLATE KINASE; SWP:Q64520; PDB:1LVGA; RPVVLSGPSGAGKSTLLKKLFQEHSSIFGFSVSHTTRNPRPGEEDGKDYYFVTREMMQRD -------2222----------------------------22222222------------- IAAGDFIEHAEFSGNLYGTSKEAVRAVQAMNRICVLDVDLQGVRSIKKTDLCPIYIFVQP -----------iiii----3333----1111--------------1111----------- PSLDVLEQRLRLRNTETEESLAKRLAAARTDMESSKEPGLFDLVIINDDLDKAYATLKQA --------------------------------33332222-------------------- LSEEIKKAQG ---------- >DIHYDROLIPOAMIDE DEHYDROG; SWP:P09063; PDB:1LVL; QQTIQTTLLIIGGGPGGYVAAIRAGQLGIPTVLVEGQALGGTCLNIGCIPSKALIHVAEQ -------------3333------------------------3333--------------- FHQASRFTEPSPLGISVASPRLDIGQSVAWKDGIVDRLTTGVAALLKKHGVKVVHGWAKV ----------1111--------3333---------------------------------- LDGKQVEVDGQRIQCEHLLLATGSSSVELPMLPLGGPVISSTEALAPKALPQHLVVVGGG ----------------------------1111-------3333----------------3 YIGLELGIAYRKLGAQVSVVEARERILPTYDSELTAPVAESLKKLGIALHLGHSVEGYEN 333-----------------------33333333-------------------------- GCLLANDGKGGQLRLEADRVLVAVGRRPRTKGFNLECLDLKMNGAAIAIDERCQTSMHNV ---------------------------------3333-----!!!!---1111------- WAIGDVAGEPMLAHRAMAQGEMVAEIIAGKARRFEPAAIAAVCFTDPEVVVVGKTPEQAS ---3333----3333--------------------------------------------- QQGLDCIVAQFPFAANGRAMSLESKSGFVRVVARRDNHLILGWQAVGVAVSELSTAFAQS -----------3333--3333----------------------------1111------- LEMGACLEDVAGTIHAHPTLGEAVQEAALRALGHALHI -----3333--------------------1111----- >OLIGOPEPTIDE SUBSTRATE FO; SWP:P04517; PDB:1LVMA; GELFKGPRDYNPISSTICHLTNESDGHTTSLYGIGFGPFIITNKHLFRRNNGTLLVQSLH 111------33331111------iiii--------!!!!---1111-----------111 GVFKVKNTTTLQQHLIDGRDMIIIRMPKDFPPFPQKLKFREPQREERICLVTTNFQTKSM 1------1111----2222-------1111------------2222-------------- SSMVSDTSCTFPSSDGIFWKHWIQTKDGQCGSPLVSTRDGFIVGIHSASNFTNTNNYFTS ------------!!!!---------2222--------------------1111------- VPKNFMELLTNQEAQQWVSGWRLNADSVLWGGHKVFMDKP ----------3333---------------iiii------- >REPLICASE, HYDROLASE DOMA; SWP:Q9IW06; PDB:1LVOA; SGLRKMAQPSGLVEPCIVRVSYGNNVLNGLWLGDEVICPRHVIASDTTRVINYENEMSSV ---------33331111----!!!!------!!!!-----1111------------1111 RLHNFSVSKNNVFLGVVSARYKGVNLVLKVNQVNPNTPEHKFKSIKAGESFNILACYEGC 3333----!!!!---------!!!!--------1111--------2222----------- PGSVYGVNMRSQGTIKGSFIAGTCGSVGYVLENGILYFVYMHHLELGNGSHVGSNFEGEM ---------1111------2222--------iiii----------1111----------2 YGGYEDQPSMQLEGTNVMSSDNVVAFLYAALINGERWFVTNTSMSLESYNTWAKTNSFTE 222--------------------------------1111-------------1111---- LSSTDAFSMLAAKTGQSVEKLLDSIVRLNKGFGGRTILSYGSLCDEFTPTEVIRQMYGV ---3333---------3333-------1111iiii-iiii------------3333--- >GLUCOSE-1-PHOSPHATE THYMI; SWP:NA; PDB:1LVWA; HMKGIVLAGGSGTRLYPITRAVSKQLLPIYDKPMIYYPLSVLMLAGIRDILIISTPRDLP ------------11111111--1111------3333------1111--------3333-- LYRDLLGDGSQFGVRFSYRVQEEPRGIADAFIVGKDFIGDSKVALVLGDNVFYGHRFSEI -----!!!!1111------------3333--11113333-------1111---------- LRRAASLEDGAVIFGYYVRDPRPFGVVEFDSEGRVISIEEKPSRPKSNYVVPGLYFYDNQ ---1111-------------1111-----1111--------------------------- VVEIARRIEPSDRGELEITSVNEEYLRMGKLRVELMGRGMAWLDTGTHDGLLEASSFIET ----1111--1111--3333-----1111-------2222-------------------- IQKRQGFYIACLEEIAYNNGWITREDVLEMAEKLEKTDYGKYLRDLAEGNFHG ----------------1111-------------1111---------------- >TRANSCRIPTIONAL REGULATOR; SWP:P44308; PDB:1LW7A; EKKVGVIFGKFYPVHTGHINIYEAFSKVDELHVIVCSDTVRDLKLFYDSKKRPTVQDRLR --------------3333------1111-------------------------------- WQQIFKYQKNQIFIHHLVEDGIPSYPNGWQSWSEAVKTLFHEKHFEPSIVFSSEPQDKAP ----3333--------------------------------1111---------3333--- YEKYLGLEVSLVDPDRTFFNVSATKIRTTPFQYWKFIPKEARPFFAKTVAILGGESSGKS ------------1111-----3333---33333333-33331111--------------- VLVNKLAAVFNTTSAWEYGREFVFEKLGGDEQAMQYSDYPQALGHQRYIDYAVRHSHKIA -------1111-----2222---------1111----3333------------------- FIDTDFITTQAFCIQYEGKAHPFLDSIKEYPFDVTILLKNNTEQKQRQQFQQLLKKLLDK ---------------------------------------------3333----------- YKVPYIEIESPSYLDRYNQVKAVIEKVLNEEEISELQN -------------------------3333--------- >PUTATIVE SECRETED PROTEIN; SWP:Q9Z4W2; PDB:1LWBA; APADKPQVLASFTQTSASSQNAWLAANRNQSAWAAYEFDWSTDLCTQAPDNPFGFPFNTA -1111----1111-------------------3333------!!!!----1111------ CARHDFGYRNYKAAGSFDANKSRIDSAFYEDMKRVCTGYTGEKNTACNSTAWTYYQAVKI -----------1111-3333---------------1111-------------------11 FG 11 >ISOCITRATE DEHYDROGENASE; SWP:P33198; PDB:1LWDA; ADQRIKVAKPVVEMDGDEMTRIIWQFIKEKLILPHVDVQLKYFDLGLPNRDQTNDQVTID -------------------------------3333------------------------- SALATQKYSVAVKCATITPDEARVEEFKLKKMWKSPNGTIRNILGGTVFREPIICKNIPR ------------------------1111--------------------------1111-- LVPGWTKPITIGRHAHGDQYKATDFVVDRAGTFKIVFTPKDGSSAKQWEVYNFPAGGVGM -3333-----------!!!!------------------1111------------------ GMYNTDESISGFAHSCFQYAIQKKWPLYMSTKNTILKAYDGRFKDIFQEIFEKHYKTDFD -------------------------------3333--3333------------------1 KYKIWYEHRLIDDMVAQVLKSSGGFVWACKNYDGDVQSDILAQGFGSLGLMTSVLVCPDG 111-------------------------------------------1111------1111 KTIEAEAAHGTVTRHYREHQKGRPTSTNPIASIFAWTRGLEHRGKLDGNQDLIRFAQTLE -----------3333---1111-------------------------------------- KVCVETVESGAMTKDLAGCIHGLSNVKLNEHFLNTSDFLDTIKSNLDRALGRQ ------1111----------------2222-------------------1111 >4-ALPHA-GLUCANOTRANSFERAS; SWP:P80099; PDB:1LWJA; MIGYQIYVRSFRDGNLDGVGDFRGLKNAVSYLKELGIDFVWLMPVFSSISFHGYDVVDFY ------3333----------------------1111------------------------ SFKAEYGSEREFKEMIEAFHDSGIKVVLDLPIHHTGFLHTWFQKALKGDPHYRDYYVWAN --3333-------------1111------------1111----------1111------- KETDLDERREWDGEKIWHPLEDGRFYRGLFGPFSPDLNYDNPQVFDEMKRLVLHLLDMGV ---1111-----------------------1111---------------------1111- DGFRFDAAKHMRDTIEQNVRFWKYFLSDLKGIFLAEIWAEARMVDEHGRIFGYMLNFDTS ----2222-------------------------------3333-3333------------ HCIKEAVWKENTRVLIESIERAVIAKDYLPVNFTSNHDMSRLASFEGGFSKEKIKLSISI -----------------------------------1111-3333---------------- LFTLPGVPLVFYGDELGMKGVYQKPNTEVVLDPFPWNESMCVEGQTFWKWPAYNGPFSGI ----------2222-------------1111-----1111-------------------- SVEYQKRDPDSILSHTLGWTRFRKENQWIDRAKLEFLCKEDKFLVYRLYDDQHSLKVFHN 3333---1111-----------------1111-------1111----------------- LSGEEVVFEGVKMKPYKTEVV -------iiii---------- ------------------------------------------------------------ ------------------------------------ >FIBRINOGEN ALPHA-1 CHAIN; SWP:P02674; PDB:1LWUA; NELEVRYSEVLRELERRIIHLQRRINMQLQQLTLLQHNIKTQVSQILRVEVDIDVALRAC 1111---------3333-----------------------------------------33 KGSCARYLEYRLDKEKNLQLEKAASYIANLKFERFEEVV 33-------------------------3333---3333- >Fibrinogen beta chain [Fr; SWP:P02678; PDB:1LWUB; AQKEIENRYKEVKIRIESTVAGSLRSMKSVLEHLRAKMQRMEEAIKTQKELCSAPCTVNC 3333-------------------------------------------------------- RVPVVSGMHCEDIYRNGGRTSEAYYIQPDLFSEPYKVFCDMESHGGGWTVVQNRVDGSSN -------------1111-----------1111---------------------------- FARDWNTYKAEFGNIAFGNGKSICNIPGEYWLGTKTVHQLTKQHTQQVLFDMSDWEGSSV ------------------------------------------------------------ YAQYASFRPENEAQGYRLWVEDYSGNAGNALLEGATQLMGDNRTMTIHNGMQFSTFDRDN ----------3333--------------------1111!!!!-----2222--------- DNWNPGDPTKHCSREDAGGWWYNRCHAANPNGRYYWGGIYTKEQADYGTDDGVVWMNWKG ------33333333--------------------------3333----------3333-- SWYSMRQMAMKLRPK --------------- >Fibrinogen gamma chain [P; SWP:P04115; PDB:1LWUC; KTVQKILEEVRILEQIGVSHDAQIQELSEMWRVNQQFVTRLQQQLVDIRQTCSRPCQDTT 3333---------------------------------------------1111----333 ANKISPITGKDCQQVVDNGGKDSGLYYIKPLKAKQPFLVFCEIENGNGWTVIQHRHDGSV 3--------------1111----------------------------------------- NFTRDWVSYREGFGYLAPTLTTEFWLGNEKIHLLTGQQAYRLRIDLTDWENTHRYADYGH -----------------------------------------------1111--------- FKLTPESDEYRLFYSMYLDGDAGNAFDGFDFGDDPQDKFYTTHLGMLFSTPERDNDKYEG ----3333---------------3333--------3333---2222---1111------- SCAEQDGSGWWMNRCHAGHLNGKYYFGGNYRKTDVEFPYDDGIIWATWHDRWYSLKMTTM 3333-------------------------------------------------------- KLLPMGRDLSGHGGQQQ ----------------- >BONE MORPHOGENETIC PROTEI; SWP:P18075; PDB:1LXIA; QACKKHELYVSFRDLGWQDWIIAPEGYAAYYCEGECAFPLNSYMNATNHAIVQTLVHFIN ----------3333--------------------------3333---------------1 PETVPKPCCAPTQLNAISVLYFDDSSNVILKKYRNMVVRACGCH 111-------------------1111------------------ >HYPOTHETICAL 11.5KDA PROT; SWP:P35195; PDB:1LXJA; PKIFCLADVCVPIGTDSASISDFVALIEKKIRESPLKSTLHSAGTTIEGPWDDVGLIGEI ------------------------------3333------1111---------------- HEYGHEKGYVRVHTDIRVGTRTDKHQTAQDKIDVVLKKISQ ----1111--------------------------------- >HYPOTHETICAL PROTEIN MTH1; SWP:O27255; PDB:1LXNA; ITAELTVIPLGTCSTSLSSYVAAAVEALKKLNVRYEISGGTLLEAEDLDELEAVKAAHEA ---------------------------1111---------------3333---------- VLQAGSDRVYTTLKIDDRRDADRGLRDKVESVKEKI --------------------------------1111 >POLYNUCLEOTIDE KINASE; SWP:P06855; PDB:1LY1A; MKKIILTIGCPGSGKSTWAREFIAKNPGFYNINRDDYRQSIMAHEERDEYKYTKKKEGIV ---------2222--------------------------1111--3333---3333---- TGMQFDTAKSILYGGDSVKGVIISDTNLNPERRLAWETFAKEYGWKVEHKVFDVPWTELV --------------3333----------3333---------------------------- KRNSKRGTKAVPIDVLRSMYKSMREYLGLPVY ------1111---------------------- >COMPLEMENT RECEPTOR TYPE ; SWP:P20023; PDB:1LY2A; EASCGSPPPILNGRISYYSTPIAVGTVIRYSCSGTFRLIGEKSLLCITKDKVDGTWDKPA ---------2222-------------------1111------------------------ PKCEYFNKYSSCPEPIVPGGYKIRGSTPYRHGDSVTFACKTNFSMNGNKSVWCQANNMWG ------1111------2222---------2222------2222----------1111--- PTRLPTCVSI ---------- >n/a; SWP:P07339; PDB:1LYBA; GPIPEVLKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYN ------------------------------------------111133333333-----3 SDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQ 333--------------3333---------------- >Cathepsin D [Precursor]; SWP:P07339; PDB:1LYBB; GGVKVERQVFGEATKQPGITFIAAKFDGILGMAYPRISVNNVLPVFDNLMQQKLVDQNIF -----------------3333------------3333-%%%%-------1111------- SFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLTLCKEG ---------------------3333------------------------1111------- CEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKL -----------------------------------------1111-------iiii---- SPEDYTLKVSQAGKTLCLSGFMGMDIPPPSGPLWILGDVFIGRYYTVFDRDNNRVGFAEA 3333-------------------------------------------------------- A - >CAP18; SWP:P25230; PDB:1LYP; GLRKRLRKFRNKIKEKLKKIGQKIQGLLPKLA -----------3333-------33333333-- >PCOC COPPER RESISTANCE PR; SWP:Q47454; PDB:1LYQA; AHPELKSSVPQADSAVAAPEKIQLNFSENLTVKFSGAKLTMTGMKSHSPMPVAAKVAPGA ----------2222----------------3333-------------------------- DPKSMVIIPREPLPAGTYRVDWRAVSSDTHPITGNYTFTVK 2222---------------------1111------------ >PROTEIN-TYROSINE PHOSPHAT; SWP:P15273; PDB:1LYVA; VSPYGPEARAELSSRLTTLRNTLAPATNDPRYLQACGGEKLNRFRDIQCRRQTAVRADLN -1111---------------1111----1111---iiii----1111--3333--1111- ANYIQVGNTRTIACQYPLQSQLESHFRMLAENRTPVLAVLASSSEIANQRFGMPDYFRQS -----!!!!--------3333--------------------------3333--------- GTYGSITVESKMTQQVGLGDGIMADMYTLTIREAGQKTISVPVVHVGNWPDQTAVSSEVT --!!!!------------iiii----------2222-------------2222------- KALASLVDQTAETKRNMYESKGSSAVADDSKLRPVIHSRAGVGRTAQLIGAMCMNDSRNS ------------------1111-33331111--------------------33333333- QLSVEDMVSQMRVQRNGIMVQKDEQLDVLIKLAEGQGRPLLNS ---------------1111--3333--------1111------ >GLYCOSYLTRANSFERASE B; SWP:P16442; PDB:1LZJA; MVSLPRMVYPQPKVLTPCRKDVLVVTPWLAPIVWEGTFNIDILNEQFRLQNTTIGLTVFA ------------1111---------1111----2222----------1111--------- IKKYVAFLKLFLETAEKHFMVGHRVHYYVFTDQPAAVPRVTLGTGRQLSVLEVGERRFLS !!!!1111-----------2222---------1111------2222---------3333- EVDYLVCVDVDMEFRDHVGVEILTPLFGTLHPSFYGSSREAFTYERRPQSQAYIPKDEGD ------------------3333--------1111---3333-----1111----1111-- FYYMGAFFGGSVQEVQRLTRACHQAMMVDQANGIEAVWHDESHLNKYLLRHKPTKVLSPE ---1111----------------------1111------------------------333 YLWDQQLLGWPAVLRKLRFTAVP 3--3333---3333--------- >HEROIN ESTERASE; SWP:O06441; PDB:1LZLA; TTFPTLDPELAAALTMLPKVDFADLPNARATYDALIGAMLADLSFDGVSLRELSAPGLDG --1111-3333---------3333-----------33331111-2222---------!!! DPEVKIRFVTPDNTAGPVPVLLWIHGGGFAIGTAESSDPFCVEVARELGFAVANVEYRLA !-------------------------iiii--3333------------------------ PETTFPGPVNDCYAALLYIHAHAEELGIDPSRIAVGGQSAGGGLAAGTVLKARDEGVVPV -----------------------1111-1111---------------------------- AFQFLEIPELDDRLETVSMTNFVDTPLWHRPNAILSWKYYLGESYSGPEDPDVSIYAAPS ---------------3333----------------------1111-1111---1111-11 RATDLTGLPPTYLSTMELDPLRDEGIEYALRLLQAGVSVELHSFPGTFHGSALVATAAVS 11-------------1111-------------1111-----------2222--3333--- ERGAAEALTAIRRGLRS ------------1111- >BETA-2-MICROGLOBULIN; SWP:P30460; PDB:1M05A; GSHSMRYFDTAMSRPGRGEPRFISVGYVDDTQFVRFDSDAAPWIEQEGPEYWDRNTQIFK -------------2222----------!!!!-----11111111---3333--------- TNTQTDRESLRNLRGYYNQSEAGSHTLQSMYGCDVGPDGRLLRGHNQYAYDGKDYIALNE -----------------------------------1111----------iiii-----11 DLRSWTAADTAAQITQRKWEAARVAEQDRAYLEGTCVEWLRRYLENGKDTLERADPPKTH 11------3333----------------------------------3333---------- VTHHPISDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDRTFQKWAAVV ------------------------------iiii-1111--------------------- VPSGEEQRYTCHVQHEGLPKPLTLRWEPS ----1111------1111----------- >Capsid protein; SWP:P08767; PDB:1M06F; REIVDLSHLAFDCGMLGRLKTVSWTPVIAGDSFELDAVGALRLSPLRRGLAIDSKVDFFT ---------------------------2222----------------------------- FYIPHRHVYGDQWIQFMRDGVNAQPLPSVTCNRYPDHAGYVGTIVPANNRIPKFLHQSYL ---3333-----------!!!!-----------33333333----1111--3333----- NIYNNYFRAPWMPERTEANPSNLNEDDARYGFRCCHLKNIWSAPLPPETKLAEEMGIESN --------1111------1111----------------1111---1111----------- SIDIMGLQAAYAQLHTEQERTYFMQRYRDVISSFGGSTSYDADNRPLLVMHTDFWASGYD -------------------------3333-3333----3333------------------ VDGTDQSSLGQFSGRVQQTFKHSVPRFFVPEHGVMMTLALIRFPPISPLEHHYLAGKSQL ----1111--------------------------------------1111-3333----- TYTDLAGDPALIGNLPPREISYRDLFRDGRSGIKIKVAESIWYRTHPDYVNFKYHDLHGF 3333---3333---------3333-11111111------3333-------3333------ PFLDDAPGTSTGDNLQEAILVRHQDYDACFQSQQLLQWNKQARYNVSVYRHMPTVRDSIM -----2222----3333----33333333------------------------------- TS -- >Major spike protein; SWP:P31281; PDB:1M06G; MYQNFVTKHDTAIQTSRFSVTGNVIPAAPTGNIPVINGGSITAERAVVNLYANMNVSTSS ---------------------------3333----------------------------- DGSFIVAMKVDTSPTDPNCVISAGVNLSFAGTSYPIVGIVRFESASEQPTSIAGSEVEHY ----------------------------------------------------3333---- PIEMSVGSGGVCSARDCATVDIHPRTSGNNVFVGVICSSAKWTSGRVIGTIATTQVIHEY -------1111--------------2222------------------------------- QVLQPLK ---1111 >RIBONUCLEASE; SWP:P11916; PDB:1M07A; NWATFQQKHIINTPIINCNTIMDNNIYIVGGQCKRVNTFIISSATTVKAICTGVINMNVL -----------------------3333-iiii----------------1111-------- STTRFQLNTCTRTSITPRPCPYSSRTETNYICVKCENQYPVHFAGIGRCP -----------------------------------iiii----------- >ENDODEOXYRIBONUCLEASE I; SWP:P00641; PDB:1M0DA; SGLEDKVSKQLESKGIKFEYEEWKVPYVIPASNHTYTPDFLLPNGIFVETKGLWESDDRK -----------1111--------------------------1111--------------- KHLLIREQHPELDIRIVFSSSRTKLYKGSPTSYGEFCEKHGIKFADKLIPAEWIKEPKKE --------1111-------1111--------------1111--------3333------- VPFDRLKRK -3333---- >METALLOTHIONEIN MT_NC; SWP:O73914; PDB:1M0GA; SCCPCCPSGCTKCASGCVCKGKTCDTSCCQ ---------3333---1111----1111-- >RIBOSE-5-PHOSPHATE ISOMER; SWP:P44725; PDB:1M0SA; MNQLEMKKLAAQAALQYVKADRIVGVGSGSTVNCFIEALGTIKDKIQGAVAASKESEELL --------------11112222-------------------3333--------------- RKQGIEVFNANDVSSLDIYVDGADEINPQKMMIKGGGAALTREKIVAALAKKFICIVDSS 1111----3333--------------1111----1111--------1111-------333 KQVDVLGSTFPLPVEVIPMARSQVGRKLAALGGSPEYREGVVTDNGNVILDVHNFSILNP 3---2222--------3333-----------------2222-1111-------------- VEIEKELNNVAGVVTNGIFALRGADVVIVGTPEGAKVID ---------2222-----------------1111----- >GLUTATHIONE SYNTHETASE; SWP:Q08220; PDB:1M0TA; PPSKDQLNELIQEVNQWAITNGLSMYPPKFEENPSNASVSPVTIYPTPIPRKCFDEAVQI ----------------------------3333---------------------------- QPVFNELYARITQDMAQPDSYLFTGKLWSLYLATLKSAQYKKQNFRLGIFRSDYLIDKKE -------------------------------1111------------------------- QIKQVEFNTVSVSFAGLSEKVDRLHSYLNRANKYDPKGPIYNDQNMVISDSGYLLSKALA -------------3333-----------------1111---3333--------------- KAVESYKSQQSSSTTSDPIVAFIVQRNERNVFDQKVLELNLLEKFGTKSVRLTFDDVNDK -----------------------------3333--------------------------- LFIDDKTGKLFIRDTEQEIAVVYYRTGYTTTDYTSEKDWEARLFLEKSFAIKAPDLLTQL ----------------------------1111--3333---------------------- SGSKKIQQLLTDEGVLGKYISDAEKKSSLLKTFVKIYPLDDTKLGREGKRLALSEPSKYV ----------------1111---------1111---------------------3333-- LKPQGNNVYKENIPNFLKGIEERHWDAYILMELIEPELNENNIILRDNKSYNEPIISELG --------!!!!----11113333---------------------iiii----------- IYGCVLFNDEQVLSNEFSGSLLRSKGCLDSIILY ---------------------------------- >GST2 GENE PRODUCT; SWP:P41043; PDB:1M0UA; KHSYTLFYFNVKALAEPLRYLFAYGNQEYEDVRVTRDEWPALKPTMPMGQMPVLEVDGKR ------------1111------------------3333---33332222------iiii- VHQSISMARFLAKTVGLCGATPWEDLQIDIVVDTINDFRLKIAVVSYEPEDEIKEKKLVT --------------------------------------------1111-3333------- LNAEVIPFYLEKLEQTVKDNDGHLALGKLTWADVYFAGITDYMNYMVKRDLLEPYPALRG ------------------------%%%%---------------------1111------- VVDAVNALEPIKAWIEKRPVTEV ----------------------- >GLUTATHIONE SYNTHETASE; SWP:Q08220; PDB:1M0WA; PSKDQLNELIQEVNQWAITNGLSMYPPKFEENPSNASVSPVTIYPTPIPRKCFDEAVQIQ -------------------------2222------------------------------- PVFNELYARITQDMAQPDSYLHKTTEALALSDSEFTGKLWSLYLATLKSAQYKKQNFRLG ---------------1111-------------------------1111------------ IFRSDYLIDKKKGTEQIKQVEFNTVSVSFAGLSEKVDRLHSYLNRANKYDPKGPIYNDQN ----------iiii--------------3333-----------1111--1111---3333 MVISDSGYLLSKALAKAVESYKSQQSDPIVAFIVQRNERNVFDQKVLELNLLEKFGTKSV ---------------------1111--------------3333----------------- RLTFDDVNDKLFIDDKTGKLFIRDTEQEIAVVYYRTGYTTTDYTSEKDWEARLFLEKSFA ----------------------1111------------3333--3333------------ IKAPDLLTQLSGSKKIQQLLTDEGVLGKYISDAEKKSSLLKTFVKIYPLDDTKLGREGKR ----------------------3333-------------1111----------------- LALSEPSKYVLKPQREGGGNNVYKENIPNFLKGIEERHWDAYILMELIEPELNENNIILR ----3333--------------!!!!----11113333---------------------% DNKSYNEPIISELGIYGCVLFNDEQVLSNEFSGSLLRSKFNTSNEGGVAAGFGCLDSIIL %%%----------------------------------------1111------------- Y - >ARGININE KINASE; SWP:P51541; PDB:1M15A; VDQATLDKLEAGFKKLQEASDCKSLLKKHLTKDVFDSIKNKKTGMGATLLDVIQSGVENL ------------------1111-3333----------1111-1111-3333---333311 DSGVGIYAPDAESYRTFGPLFDPIIDDYHGGFKLTDKHPPKQWGDINTLVGLDPAGQFII 11-------3333-------------1111--1111--------1111----1111---- STRVRCGRSLQGYPFNPCLTAEQYKEMEEKVSSTLSSMEDELKGTYYPLTGMSKATQQQL ---------2222-3333----------------1111-1111-----2222-------- IDDHFLFKEGDRFLQTANACRYWPTGRGIFHNDAKTFLVWVNEEDHLRIISMQKGGDLKT 1111----------1111-22222222----1111------------------------- VYKRLVTAVDNIESKLPFSHDDRFGFLTFCPTNLGTTMRASVHIQLPKLAKDRKVLEDIA -----------------------------3333------------3333----------- SKFNLQVRGTRGEHTESEGGVYDISNKRRLGLTEYQAVREMQDGILEMIKMEKAAA --------1111-----iiii----------------------------------- >PHOSPHOENOLPYRUVATE PHOSP; SWP:P56839; PDB:1M1BA; VKKTTQLKQMLNSKDLEFIMEAHNGLSARIVQEAGFKGIWGSGLSVSAQLGVRDSNEASW ------------------------------------------------------------ TQVVEVLEFMSDASDVPILLDADTGYGNFNNARRLVRKLEDRGVAGACLEDKLFPKTNSL ----------1111-------!!!!--------------1111--------------111 HDGRAQPLADIEEFALKIKACKDSQTDPDFCIVARVEAFIAGWGLDEALKRAEAYRNAGA 1-------------------------1111------3333-------------------- DAILMHSKKADPSDIEAFMKAWNNQGPVVIVPTKYYKTPTDHFRDMGVSMVIWANHNLRA -------------------------------3333---3333------------------ SVSAIQQTTKQIYDDQSLVNVEDKIVSVKEIFRLQRDDELVQAEDKYLPKN -----------------1111-----3333--1111--------------- >MAJOR COAT PROTEIN; SWP:P32503; PDB:1M1CA; MLRFVTKNSQDKSSDLFSICSDRGTFVAHNRVRTDFKFDNLVFNRVYGVSQKFTLVGNPT 33333333%%%%-------------------------%%%%------------------- VCFNEGSSYLEGIAKKYLTLDGGLAIDNVLNELRSTCGIPGNAVASHAYNITSWRWYDNH -------------3333------------------------------------------3 VALLMNMLRAYHLQVLTEQGQYSAGDIPMYHDGHVKIKLPVTIDDTAGPTQFAWPSDRST 333----------------------------------------3333---------2222 DSYPDWAQFSESFPSIDVPYLDVRPLTVTEVNFVLMMMSKWHRRTNLAIDYEAPQLADKF ----------------------1111---------1111-------1111---------- AYRHALTVQDADEWIEGDRTDDQFRPPSSKVMLSALRKYVNHNRLYNQFYTAAQLLAQIM -------------------3333-----------------1111------------1111 MKPVPNCAEGYAWLMHDALVNIPKFGSIRGRYPFLLSGDAALIQATALEDWSAIMAKPEL ------33333333-------------1111-------------------------3333 VFTYAMQVSVALNTGLYLRRVKKTGFGTTIDDSYEDGAFLQPETFVQAALACCTGQDAPL --------------------------------333311113333---------------- NGMSDVYVTYPDLLEFDAVTQVPITVIEPAGYNIVDDHLVVVGVPVACSPYMIFPVAAFD ---------3333-1111----------------iiii--------------3333---- TANPYCGNFVIKAANKYLRKGAVYDKLEAWKLAWALRVAGYDTHFKVYGDTHGLTKFYAD --3333-----------1111--------------3333--------------------- NGDTWTHIPEFVTDGDVMEVFVTAIERRARHFVELPRLNSPAFFRSVEVSTTIYDTHVQA --------3333------------------------------------------------ GAHAVYHASRINLDYVKPVSTGIQVINAGELKNYWGSVRRTQQGLGVVGLT -----------3333----2222------3333-------1111------- ------------------------------------------------------------ ----- >KID TOXIN PROTEIN; SWP:P13976; PDB:1M1FA; MERGEIWLVSLDPTAGHEQQGTRPVLIVTPAAFNRVTRLPVVVPVTSFARTAGFAVSLDG -2222---------!!!!---------------------------------!!!!----- VGIRTTGVVRCDQPRTIDMKARGGKRLERVPETIMNEVLGRLSTILT ---------1111----3333-------------------3333--- >TRANSCRIPTION ANTITERMINA; SWP:O67757; PDB:1M1HA; QVQELEKKWYALQVEPGKENEAKENLLKVLELEGLKDLVDEVIVPAEEKVVIRAQGKEKY --------------2222----------------3333---------------%%%%--- RLSLKGNARDISVLGKKGVTTFRIENGEVKVVESVEGDTCVNAPPISKPGQKITCKENKT --------------1111----------------2222-1111------------1111- EAKIVLDNKIFPGYILIKAHMNDKLLMAIEKTPHVFRPVMVGGKPVPLKEEEVQNILNQI ----------------------------------------iiii----3333---1111- KR -- >SUPPRESSOR OF FUSED; SWP:Q9UMX1; PDB:1M1LA; ASLFPPGLHAIYGECRRLYPDQPNPLQVTAIVKYWLGGPDPLDYVSMYRNVGSPSANIPE ------------------1111----------1111------------------------ HWHYISFGLSDLYGDNRVHEFTGTDGPSGFGFELTFRLKRETGESAPPTWPAELMQGLAR ---------------------------!!!!-----------------3333-------- YVFQSENTFCSGDHVSWHSPLDNSESRIQHMLLTEDPQMQPVQTPFGVVTFLQIVGVCTE ---------2222------1111--------------------1111------------- ELHSAQQWNGQGILELLRTVPIAGGPWLITDMRRGETIFEIDPHLQERVDKGIETD --------3333---33333333-1111--1111--3333-3333----------- >NITROGENASE MOLYBDENUM-IR; SWP:P07328; PDB:1M1NA; MSREEVESLIQEVLEVYPEKARKDRNKHLAVNDPAVTQSKKCIISNKKSQPGLMTIRGCA -------------3333--------1111---1111-3333--------2222------- YAGSKGVVWGPIKDMIHISHGPVGCGQYSRAGRRNYYIGTTGVNAFVTMNFTSDFQEKDI --------3333---------------------------2222--1111------3333- VFGGDKKLAKLIDEVETLFPLNKGISVQSECPIGLIGDDIESVSKVKGAELSKTIVPVRC ------------------1111---------3333------------------------- EGFRGVSQSLGHHIANDAVRDWVLGKRDEDTTFASTPYDVAIIGDYNIGGDAWSSRILLE 3333---3333-----------11111111-----1111-------2222---------1 EMGLRCVAQWSGDGSISEIELTPKVKLNLVHCYRSMNYISRHMEEKYGIPWMEYNFFGPT 111-------2222------1111--------3333------------------------ KTIESLRAIAAKFDESIQKKCEEVIAKYKPEWEAVVAKYRPRLEGKRVMLYIGGLRPRHV ---------1111-------------------------33332222--------3333-- IGAYEDLGMEVVGTGYEFAHNDDYDRTMKEMGDSTLLYDDVTGYEFEEFVKRIKPDLIGS -------------------3333---3333-2222------------------------- GIKEKFIFQKMGIPFREMHSWDYSGPYHGFDGFAIFARDMDMTLNNPCWKKLQAPWE 3333----1111-------%%%%------3333---------11113333---1111 >Nitrogenase molybdenum-ir; SWP:P07329; PDB:1M1NB; SQQVDKIKASYPLFLDQDYKDMLAKKRDGFEEKYPQDKIDEVFQWTTTKEYQELNFQREA --1111-----1111---------------------------------------1111-- LTVNPAKACQPLGAVLCALGFEKTMPYVHGSQGCVAYFRSYFNRHFREPVSCVSDSMTED --------3333------------------3333-----------------------333 AAVFGGQQNMKDGLQNCKATYKPDMIAVSTTCMAEVIGDDLNAFINNSKKEGFIPDEFPV 3-----------------------------------------------------1111-- PFAHTPSFVGSHVTGWDNMFEGIARYFTLKSMDDKVVGSNKKINIVPGFETYLGNFRVIK -----1111-3333-------------111111112222------------3333----- RMLSEMGVGYSLLSDPEEVLDTPADGQFRMYAGGTTQEEMKDAPNALNTVLLQPWHLEKT ---1111--------3333----------------------3333-------1111---- KKFVEGTWKHEVPKLNIPMGLDWTDEFLMKVSEISGQPIPASLTKERGRLVDMMTDSHTW --------------------------------------------------------3333 LHGKRFALWGDPDFVMGLVKFLLELGCEPVHILCHNGNKRWKKAVDAILAASPYGKNATV 2222-----------------------------1111--------------1111----- YIGKDLWHLRSLVFTDKPDFMIGNSYGKFIQRDTLHKGKEFEVPLIRIGFPIFDRHHLHR ---------------------------------33333333------------------- STTLGYEGAMQILTTLVNSILERLDEETRGMQATDYNHDLVR -------------------------1111-----1111---- >SMALL TETRAHEME CYTOCHROM; SWP:Q8EDL6; PDB:1M1QA; DQKLSDFHAESGGCESCHKDGTPSADGAFEFAQCQSCHGKLSEMDAVHKPHDGNLVCADC ------------1111-2222------------------3333-33331111---3333- HAVHDMNVGQKPTCESCHDDGRTSASVLKK -3333-2222---3333-----3333---- >WR4; SWP:NA; PDB:1M1SA; HSINVDPPTGNYPATGGNSTHNITSESDSRLAFKVKSSNNEHYRVRPVYGFVDAKGKSKL ------------3333-------------------------------------------- DINRLPGPPKEDKIVIQYAEVPAEETDPAPFKAGAQQGEIIVKLIAA ---------------------3333-----1111------------- >BETA-LACTAM SYNTHETASE; SWP:Q9R8E3; PDB:1M1ZA; APVLPAAFGFLASARTGGGPVFATRGSHTDIDTPQGERSLAATLVHAPSVAPDRAVARSL ----------------------------------!!!!--------1111-3333----- TGAPTTAVLAGEIYNRDELLSVLPAGPAPEGDAELVLRLLERYDLHAFRLVNGRFATVVR -------------------1111-----------------------3333---------- TGDRVLLATDHAGSVPLYTCVAPGEVRASTEAKALAAHFPLADARRVAGLTGVYQVPAGA !!!!-----1111--------2222-----33333333--------2222---------- VMDIDLGSGTAVTHRTWTPGLSRRILPEGEAVAAVRAALEKAVAQRVTPGDTPLVVLSGG -------------------------------------------1111------------- IDSSGVAACAHRAAGELDTVSMGTDTSNEFREARAVVDHLRTRHREITIPTTELLAQLPY -------------------------------------1111------------------- AVWASESVDPDIIEYLLPLTALYRALDGPERRILTGYGADIPLGGMHREDRLPALDTVLA ----------------------1111-----------3333--1111----3333----- HDMATFDGLNEMSPVLSTLAGHWTTHPYWDREVLDLLVSLEAGLKRRHGRDKWVLRAAMA -111122221111---3333-----3333-----------3333--%%%%--------11 DALPAETVNRPKSSFSRLLLDHGVAEAKRQVVRELFDLTVGGGRHPSEVDTDDVVRSVAD 11----------------1111----------------------3333------------ >PEPTIDE AMIDASE; SWP:Q8RJN5; PDB:1M22A; PFPYAETDVADLQARMTAGELDSTTLTQAYLQRIAALDRTGPRLRAVIELNPDALKEAAE -1111----------1111--------------------!!!!-------1111------ RDRERRDGRLRGPLHGIPLLLKDNINAAPMATSAGSLALQGFRPDDAYLVRRLRDAGAVV ----1111---1111--------------------3333--------------------- LGKTNLSEWANFRGNDSISGWSARGGQTRNPYRISHSPCGSSSGSAVAVAANLASVAIGT -------%%%%------2222--------3333---------------1111-------- ETDGSIVCPAAINGVVGLKPTVGLVSRDGIIPISFSQDTAGPMARSVADAAAVLTAIAGR ----------1111------------2222---3333----------------------- DDADPATATMPGRAVYDYTARLDPQGLRGKRIGLLQTPLLKYRGMPPLIEQAATELRRAG 11113333----------111111112222-----------2222--------------- AVVVPVELPNQGAWAEAERTLLLYEFKAGLERYFNTHRAPLRSLADLIAFNQAHSKQELG -------2222-----------------------1111----------------1111-- LFGQELLVEADATAGLADPAYIRARSDARRLAGPEGIDAALAAHQLDALVAPTTGVAWPI ---------1111-1111--------------1111----------------------11 RSDFPGESYSAAAVAGYPSLTVPMGQIDGLPVGLLFMGTAWSEPKLIEMAYAYEQRTRAR 11-2222-------------------iiii--------2222------------------ RPPHFDT ------- >[2FE-2S] FERREDOXIN; SWP:O66511; PDB:1M2DA; FKHVFVCVQDRPPGHPQGSCAQRGSREVFQAFMEKIQTDPQLFMTTVITPTGCMNASMMG -----------1111---3333-----------------3333------------3333- PVVVVYPDGVWYGQVKPEDVDEIVEKHLKGGEPVERLVISK ---------------3333--------------3333---- >KAIA; SWP:Q79PF6; PDB:1M2EA; MLSQIAICIWVESTAILQDCQRALSADRYQLQVCESGEMLLEYAQTHRDQIDCLILVAAN -------------------------1111------------------3333-----3333 PSFRAVVQQLCFEGVVVPAIVVGDRDSEDPDEPAKEQLYHSAELHLGIHQLEQLPYQVDA -3333-----------------------3333---33331111---1111---------- ALAEFLRLAPVETMA --------------- >SILENT INFORMATION REGULA; SWP:O28597; PDB:1M2KA; MDEKLLKTIAESKYLVALTGAGVSAESGIPTFRGKDGLWNRYRPEELANPQAFAKDPEKV --------1111-----------1111------2222-----3333-------------- WKWYAWRMEKVFNAQPNKAHQAFAELERLGVLKCLITQNVDDLHERAGSRNVIHLHGSLR --------------------------------------------1111-----1111111 VVRCTSCNNSFEVESAPKIPPLPKCDKCGSLLRPGVVWAGEMLPPDVLDRAMREVERADV 1------------------------------------2222--3333------3333--- IIVAGTSAVVQPAASLPLIVKQRGGAIIEINPDETPLTPIADYSLRGKAGEVMDELVRHV -----------3333-----1111----------1111---------3333--------- RKALSLKLN ---1111-- >CYTOCHROME B5; SWP:P00171; PDB:1M2MA; AVKYYTLEEIQKHNNSKSTWLILHYKVYDLTKFLEEHPGGEAVLRAQAGGDATANFEAVG -----3333-----3333----iiii---11111111---33331111--------1111 HSTDARELSKTFIIGELHPDDR -------3333------3333- >PROTEIN TRANSPORT PROTEIN; SWP:P15303; PDB:1M2OA; DFETNEDINGVRFTWNVFPSTRSDANSNVVPVGCLYTPLKEYDELNVAPYNPVVCSGPHC 3333-------------------------------------------------------- KSILNPYCVIDPRNSSWSCPICNSRNHLPPQYTNENMPLELQSTTIEYITNKPVTVPPIF ----1111--------------------3333-----3333------------------- FFVVDLTSETENLDSLKESIITSLSLLPPNALIGLITYGNVVQLHDLSSETIDRCNVFRG ---------------------3333--1111----------------------------- DREYQLEALTEMLTGQKPTVTPFSLNRFFLPLEQVEFKLNQLLENLSPDQWSVPAGHRPL ----3333------------1111-1111-3333--------1111-------2222--- RATGSALNIASLLLQGCYKNIPARIILFASGPGTVAPGLIVNSELKDPLRSHHDIDSDHA -----------------2222----------------------3333------------1 QHYKKACKFYNQIAQRVAANGHTVDIFAGCYDQIGMSEMKQLTDSTGGVLLLTDAFSTAI 111-------------------------------3333----1111--------111133 FKQSYLRLFAKDEEGYLKMAFNGNMAVKTSKDLKVQGLIGHASAVKKTDANNISESEIGI 33---------1111--------------1111--------------------------- GATSTWKMASLSPYHSYAIFFEIANTHLAYTQFITTYQHSSGTNRIRVTTVANQLLPFGT --------------------------------------1111-----------------3 PAIAASFDQEAAAVLMARIAVHKAETDDGADVIRWLDRTLIKLCQKYADYNKDDPQSFRL 3331111---------------------3333------------------22221111-- APNFSLYPQFTYYLRRSQFLSVFNNSPDETAFYRHIFTREDTTNSLIMIQPTLTSFSMED 1111-------------------------------------------------------- DPQPVLLDSISVKPNTILLLDTFFFILIYHGEQIAQWRKAGYQDDPQYADFKALLEEPKL -------3333-1111------------------------3333---------------- EAAELLVDRFPLPRFIDTEAGGSQARFLLSKLNPSTIVLTDDVSLQNFMTHLQQVAVS ------------------22223333-3333--------------------------- >Small COPII coat GTPase S; SWP:P20606; PDB:1M2OB; GKLLFLGLDNAGKTTLLHMLKNDRLATLQPTWHPTSEELAIGNIKFTTFDLGGHIQARRL -------2222------------------------------------------3333333 WKDYFPEVNGIVFLVDAADPERFDEARVELDALFNIAELKDVPFVILGNKIDAPNAVSEA 3--------------11111111------------1111----------3333----333 ELRSALGLLNTTGIEGQRPVEVFMCSVVMRNGYLEAFQWLSQYI 3-------------------------1111---------3333- >TOXIN BMTX3; SWP:Q9NBG9; PDB:1M2SA; FGLIDVKCFASSECWTACKKVTGSGQGKCQNNQCRCY ---------3333------------------------ >MISTLETOE LECTIN I A CHAI; SWP:P81446; PDB:1M2TA; YERLRLRTDQQTTGAEYFSFITVLRDYVSSGSFSNNIPLLRQSTVPVSEGQRFVLVELTN --------1111---------------------iiii--------1111----------1 AGGDTITAAIDVTNLYVVAYEAGNQSYFLSDAPAGAETQDFSGTTSSSQPFNGSYPDLER 111------------------!!!!---11112222------------------------ YAGHRDQIPLGIDQLIQSVTALRFPGGQTKTQARSILILIQMISEAARFNPILWRARQYI ---1111----------------------------------------------------- NSGASFLPDVYMLELETSWGQQSTQVQHSTDGVFNNPIALAIAPGVIVTLTNIRDVIASL -----------------------------iiii------------------3333----- AIMLFVCG -------- >Beta-galactoside-specific; SWP:P81446; PDB:1M2TB; AVTCTASEPIVRIVGRNGMTVDVRDDDFHDGNQIQLWPSKSNNDPNQLWTIKKDGTIRSN --------------2222----2222--2222-----------1111----1111---ii GSCLTTYGYTAGVYVMIFDCNTAVREATIWQIWGNGTIINPRSNLVLAASSGIKGTTLTV ii-------2222-----1111-3333-----1111----3333--------2222---- QTLDYTLGQGWLAGNDTAPRETTIYGFRDLCMESAGGSVYVETCTAGQENQRWALYGDGS -----1111----------------2222-----!!!!------2222-------1111- IRPKQLQSQCLTNGRDSISTVINIVSCSAGSSGQRWVFTNEGAILNLKNGLAMDVAQANP --1111----------2222------33331111----1111------------222233 SLQRIIIYPATGNPNQMWLPVP 33-------------------- >Protein transport protein; SWP:P40482; PDB:1M2VB; FLTPAQEQLHQQIRPMNQLYPIDLLTELPPPITDLTLPPPPLVIPPERMLVPSELSNASP --3333----------------3333-----3333---------3333----------33 DYIRSTLNAVPKNSSLLKKSKLPFGLVIRPYQHLYDDIDPPPLNEDGLIVRCRRCRSYMN 33---------------------------------------------------------1 PFVTFIEQGRRWRCNFCRLANDVPMQMDQSDPNDPKSRYDRNEIKCAVMEYMAPKEYTLR 111---iiii-------------3333----------11111111--------3333--- QPPPATYCFLIDVSQSSIKSGLLATTINTLLQNLDSIPNHDERTRISILCVDNAIHYFKI ---------------3333-------------3333--1111------------------ PLDQINMMDIADLEEPFLPRPNSMVVSLKACRQNIETLLTKIPQIFQSNLITNFALGPAL ----------------------------------------3333---------------- KSAYHLIGGVGGKIIVVSGTLPNLGIGKLQRRNENTSKETAQLLSCQDSFYKNFTIDCSK -------------------------------------3333-------3333----3333 VQITVDLFLASEDYMDVASLSNLSRFTAGQTHFYPGFSGKNPNDIVKFSTEFAKHISMDF ---------------3333----1111----------3333-3333-------------- CMETVMRARGSTGLRMSRFYGHFFNRSSDLCAFSTMPRDQSYLFEVNVDESIMADYCYVQ ----------2222---------------------------------------------- VAVLLSLNNSQRRIRIITLAMPTTESLAEVYASADQLAIASFYNSKAVEKALNSSLDDAR ------------------------------------------------------------ VLINKSVQDILATYKKEIVVSNTAGGAPLRLCANLRMFPLLMHSLTKHMAFRSGIVPSDH --------------------3333-------1111------------------------- RASALNNLESLPLKYLIKNIYPDVYSLHDMADEAGLPGTIVLPQPINATSSLFERYGLYL -----------3333--------------------------------------------- IDNGNELFLWMGGDAVPALVFFNQRVRNIINQLRNHDDVITYQSLYIVRAREVATLRLWA -----------------33333333----------------------------------1 SSTLVEDKILNNESYREFLQIMKARISK 111-----%%%%---------------- >CLASS B CARBAPENEMASE BLA; SWP:O08498; PDB:1M2XA; DVKIEKLKDNLYVYTTYNTFNGTKYAANAVYLVTDKGVVVIDCPWGEDKFKSFTDEIYKK -------!!!!--------iiii----------1111--------1111----------- HGKKVIMNIATHSHDDRAGGLEYFGKIGAKTYSTKMTDSILAKENKPRAQYTFDNNKSFK -------------11111111--------------------1111--------------- VGKSEFQVYYPGKGHTADNVVVWFPKEKVLVGGCIIKSADSKDLGYIGEAYVNDWTQSVH !!!!----------------------------1111-1111---------1111------ NIQQKFSGAQYVVAGHDDWKDQRSIQHTLDLINEYQQKQ -----1111------------------------------ >PLACENTAL CALCIUM-BINDING; SWP:P26447; PDB:1M31A; MACPLEKALDVMVSTFHKYSGKEGDKFKLNKSELKELLTRELPSFLGKRTDEAAFQKLMS ------------------3333-------3333---------3333----3333----33 NLDSNRDNEVDFQEYCVFLSCIAMMCNEFFEGFPDKQPRKK 33--------------------------------------- >2-aminoethylphosphonate-p; SWP:P96060; PDB:1M32A; YLLLTPGPLTTSRTVKEALFDSCTWDDDYNIGVVEQIRQQLTALATASEGYTSVLLQGSG ----------------------1111--------------------------------33 SYAVEAVLGSALGPQDKVLIVSNGAYGARVEAGLGIAHHAYDCGEVARPDVQAIDAILNA 33----------1111-------3333----------------1111------------- DPTISHIAVHSETTTGLNPIDEVGALAHRYGKTYIVDASSFGGIPDIAALHIDYLISSAN 3333-------3333---3333-----------------2222--3333----------- KCIQGVPGFAFVIAREQKLAACKGHSRSLSLDLYAQWRCEDNHGKWRFTSPTHTVLAFAQ 3333-------------33332222--3333----------iiii--------------- ALKELAKEGGVAARHQRYQQNQRSLVAGRALGFNTLLDDELHSPIITAFYSPEDPQYRFS ----------------------------1111-----3333------------1111--- EFYRRLKEQGFVIYPGKVSQSDCFRIGNIGEVYAADITALLTAIRTAYWT -----------------1111-----------3333-------------- >BIOH PROTEIN; SWP:P13001; PDB:1M33A; NIWWQTKGQGNVHLVLLHGWGLNAEVWRCIDEELSSHFTLHLVDLPGFGRSRGFGALSLA ------------------2222--1111-----1111-------2222---------333 DAEAVLQQAPDKAIWLGWSLGGLVASQIALTHPERVRALVTVASSPCFSARDEWPGIKPD 3---3333-----------------------3333----------------------333 VLAGFQQQLSDDQQRTVERFLALQTGTETARQDARALKKTVLALPPEVDVLNGGLEILKT 3-------------------------1111------------------------------ VDLRQPLQNVSPFLRLYGYLDGLVPRKVVPLDKLWPHSESYIFAKAAHAPFISHPAEFCH ---3333----------1111---3333------3333----------3333-------- LLVALKQRVGS ---3333---- >MONOCYTIC LEUKEMIA ZINC F; SWP:Q92794; PDB:1M36A; GSRLPKLYLCEFCLKYMKSRTILQQHMKKCGWF ---------------------------1111-- >CALTRACTIN, ISOFORM 1; SWP:P41208; PDB:1M39A; FGDFLTVMTQKMSEKDTKEEILKAFKLFDDDETGKISFKNLKRVAKELGENLTDEELQEM ------------------3333------1111---------------------------- IDEADRDGDGEVSEQEFLRI ------------3333---- >SUCCINYL-COA:3-KETOACID-C; SWP:Q29551; PDB:1M3EA; TKFYTDAVEAVKDIPNGATVLVGGFGLCGIPENLIGALLKTGVKELTAVSNNAGVDNFGL -----3333-11112222--------------------3333------------111133 GLLLQSKQIKRISSYVGENAEFERQYLAGELEVELTPQGTLAERIRAGGAGVPAFYTSTG 33----------------3333-----------------------------------222 YGTLVQEGGSPIKYNKDGSIAIASKPREVREFNGQHFILEEAIRGDFALVKAWKADQAGN 2--3333------------------------iiii--------------------1111- VTFRKSARNFNLPCKAAETTVVEVEEIVDIGSFAPEDIHIPKIYVHRLVKGEKYEKRIER ---!!!!--3333---------------2222-1111---3333---------------- LSVRKEEDDNVRERIIKRAALEFEDGYANLGIGIPLLASNFISPNTVHLQSENGILGLGP ------------------1111----------3333-3333---------1111------ YPLQNEVDADLINAGKETVTVLPGASYFSSDESFAIRGGHVNLTLGAQVSKYGDLANWIP --1111-1111-1111-------------------1111----------1111-----22 GKLVKGGGADLVSSAKTKVVVTEHSAKGNAHKIEKCTLPLTGKQCVNRIITEKAVFDVDR 22-------11111111------------------------------------------- KKGLTLIELWEGLTVDDIKKSTGCDFAVSPKLIPQQVTT -----------------1111------------------ >DUAL SPECIFICITY PROTEIN ; SWP:Q05923; PDB:1M3GA; QGGPVEILPYLFLGSCSHSSDLQGLQACGITAVLNVSASCPNHFEGLFRYKSIPVEDNQM -------------------------3333------------------------------- VEISAWFQEAIGFIDWVKNSGGRVLVHSQAGISRSATICLAYLMQSRRVRLDEAFDFVKQ --11113333-------------------------------------------------- RRGVISPNFSFMGQLLQFETQVLCH ------------------------- >ACETYL-COA ACETYLTRANSFER; SWP:P07097; PDB:1M3KA; STPSIVIASAARTAVGSFNGAFANTPAHELGATVISAVLERAGVAAGEVNEVILGQVLPA -------------------1111--3333---------------3333----------22 GEGQNPARQAAMKAGVPQEATAWGMNQLAGSGLRAVALGMQQIATGDASIIVAGGMESMS 22--3333---1111-3333------!!!!---------------------------333 MAPHCAHLRGGVKMGDFKMIDTMIKDGLTDAFYGYHMGTTAENVAKQWQLSRDEQDAFAV 3------3333---------3333-----------3333--------------------- ASQNKAEAAQKDGRFKDEIVPFIVKGRKGDITVDADEYIRHGATLDSMAKLRPAFDKEGT ------------1111---------1111------1111--------1111----1111- VTAGNASGLNDGAAAALLMSEAEASRRGIQPLGRIVSWATVGVDPKVMGTGPIPASRKAL -3333--------------------------------------33331111--------- ERAGWKIGDLDLVEANEAFAAQACAVNKDLGWDPSIVNVNGGAIAIGHPIGASGARILNT -----3333-----------------------3333-11113333----1111------- LLFEMKRRGARKGLATLCIGGGMGVAMCIESL -------------------------------- >8-OXOGUANINE DNA GLYCOSYL; SWP:O15527; PDB:1M3QA; GSEGHRTLASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSGVLADQVWTLTQ ------33333333------3333-33331111---------2222----%%%%------ TEEQLHCTVYRSQASRPTPDELEAVRKYFQLDVTLAQLYHHWGSVDSHFQEVAQKFQGVR 1111-------------3333--------3333------------------3333----- LLRQDPIECLFSFICSSNNNIARITGMVERLCQAFGPRLIQLDDVTYHGFPSLQALAGPE ------------1111-------------------------!!!!------3333--222 VEAHLRKLGLGYRARYVSASARAILEEQGGLAWLQQLRESSYEEAHKALCILPGVGTKVA 2----------------------------------3333------------2222----- DCICLMALDKPQAVPVEVHMWHIAQRDYSWHPTTSQAKGPSPQTNKELGNFFRSLWGPYA ---------1111----------------------------------------------- GWAQAVLFSADLRQ -------------- >HYPOTHETICAL PROTEIN YCKF; SWP:P42404; PDB:1M3SA; GMKTTEYVAEILNELHNSAAYISNEEADQLADHILSSHQIFTAGAGRSGLMAKSFAMRLM -----------------------------------------------------------1 HMGFNAHIVGEILTPPLAEGDLVIIGSGSGETKSLIHTAAKAKSLHGIVAALTINPESSI 111----2222------2222-----3333------------------------1111-- GKQADLIIRMPGSPKDYKTIQPMGSLFEQTLLLFYDAVILKLMEKKGLDSETMFTHHANL 1111--------------------------------------------3333-------- E - >3-METHYL-2-OXOBUTANOATE H; SWP:P31057; PDB:1M3UA; PTTISLLQKYKQEKKRFATITAYDYSFAKLFADEGLNVMLVGDSLGMTVQGHDSTLPVTV --3333-------------------------1111-------------------3333-- ADIAYHTAAVRRGAPNCLLLADLPFMAYATPEQAFENAATVMRAGANMVKIEGGEWLVET -------------1111------2222--------------1111--------3333--- VQMLTERAVPVCGHLGLTPQSVNIFGGYKVQGRGDEAGDQLLSDALALEAAGAQLLVLEC ----1111---------3333-3333----------------------3333-------- VPVELAKRITEALAIPVIGIGAGNVTDGQILVMHDAFGITGGHIPKFAKNFLAETGDIRA --------------------------------------------1111--3333------ AVRQYMAEVESGVYPGEEHSFH ---------------3333--- >fusion of the LIM interac; SWP:P70662; PDB:1M3VA; GSLSWKRCAGCGGKIADRFLLYAMDSYWHSRCLKCSSCQAQLGDIGTSSYTKSGMILCRN ----------------------%%%%--1111--------1111-------%%%%----- DYIRLFGNSGAGGSGGHMGSGGDVMVVGEPTLMGGEFGDEDERLITRLENTQFDAANGID ------------------------------------------------------------ DE -- ------------------------------ >BETA-LACTAMASE TEM; SWP:P00810; PDB:1M40A; HPETLVKVKDAEDQLGARVGYIELDLNSGKILESFRPEERFPMMSTFKVLLCGAVLSRVD ----------------------------------------------------------11 AGQEQLGRRIHYSQNDLVEYSPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLTTIGGP 11--1111----3333------3333----------------------------111133 KELTAFLHNMGDHVTRLDRWEPELNEAIPNDERDTTTPAAMATTLRKLLTGELLTLASRQ 33-----1111----------------2222----------------------------- QLIDWMEADKVAGPLLRSALPAGWFIADKSGAGERGSRGIIAALGPDGKPSRIVVIYTTG -----------11113333-2222---------iiii----------------------- SQATMDERNRQIAEIGASLIKHW ----------------------- >MYOSIN LIGHT CHAIN; SWP:P53141; PDB:1M45A; TRANKDIFTLFDKKGQGAIAKDSLGDYLRAIGYNPTNQLVQDIINASLASSLTLDQITGL ---%%%%3333--------1111------------------------------------- IEVNEKELDATTKAKTEDFVKAFQVFDKESTGKVSVGDLRYMLTGLGEKLTDAEVDELLK --------1111--3333----33331111---------------!!!!---------11 GVEVDSNGEIDYKKFIEDVLRQ 11--1111-------------- >INTERLEUKIN-2; SWP:P01585; PDB:1M48A; SSSTKKTQLQLEHLLLDLQMILNGINNYKNPKLTRMLTFKFYMPKKATELKHLQCLEEEL ----------------------3333---------1111---------3333---3333- KPLEEVLNLAQRPRDLISNINVIVLELKGSETTFMCEYADETATIVEFLNRWITFCQSII -----------------------------------------------------------1 STL 111 >AMINOGLYCOSIDE 2'-N-ACETY; SWP:P95219; PDB:1M4IA; MHTQVHTARLVHTADLDSETRQDIRQMVTGAFAGDFTETDWEHTLGGMHALIWHHGAIIA ---2222----3333--------------1111--------1111--------iiii--- HAAVIQRRLIYRGNALRCGYVEGVAVRADWRGQRLVSALLDAVEQVMRGAYQLGALSSSA ----------iiii------------3333---3333---------------------33 RARRLYASRGWLPWHGPTSVLAPTGPVRTPDDDGTVFVLPIDISLDTSAELMCDWRAGDV 33----1111-----------1111---3333---------------------------- W - >A6 GENE PRODUCT; SWP:Q91YR1; PDB:1M4JA; IQASEDVKEIFARARNGKYRLLKISIENEQLVVGSCSPPSDSWEQDYDSFVLPLLEDKQP --------------------------%%%%-----------3333-----3333------ CYVLFRLDSQNAQGYEWIFIAWSPDHSHVRQKMLYAATRATLKKEFGGGHIKDEVFGTVK ----------1111---------1111-------------------3333--------33 EDVSLHGYKKYLL 33----------- >KILLER CELL IMMUNOGLOBULI; SWP:P43631; PDB:1M4KA; PSLLAHPGPLVKSEETVILQCWSDVRFEHFLLHREGKYKDTLHLIGEHHDGVSKANFSIG -----------2222--------------------------------------------- PMMQDLAGTYRCYGSVTHSPYQLSAPSDPLDIVITGLYEKPSLSAQPGPTVLAGESVTLS --3333---------------------------------------------2222----- CSSRSSYDMYHLSREGEAHERRFSAGPKVNGTFQADFPLGPATHGGTYRCFGSFRDSPYE -------------2222------------------------------------1111--- WSNSSDPLLVSVT ------------- >CARBOXYPEPTIDASE A; SWP:P00730; PDB:1M4LA; ARSTNTFNYATYHTLDEIYDFMDLLVAEHPQLVSKLQIGRSYEGRPIYVLKFSTGGSNRP --1111----------------------1111--------1111---------------- AIWIDLGIHSREWITQATGVWFAKKFTEDYGQDPSFTAILDSMDIFLEIVTNPDGFAFTH --------1111----------------2222---------------------------- SQNRLWRKTRSVTSSSLCVGVDANRNWDAGFGKAGASSSPCSETYHGKYANSEVEVKSIV --1111------1111-----1111----2222-----1111------22223333---- DFVKDHGNFKAFLSIHSYSQLLLYPYGYTTQSIPDKTELNQVAKSAVAALKSLYGTSYKY --------------------------------1111------------------------ GSIITTIYQASGGSIDWSYNQGIKYSFTFELRDTGRYGFLLPASQIIPTAQETWLGVLTI -3333-------------1111--------------!!!!-3333--------------- MEHTVNN ------- >BACULOVIRAL IAP REPEAT-CO; SWP:O70201; PDB:1M4MA; PQIWQLYLKNYRIATFKNWPFLEDCACTPERMAEAGFIHCPTENEPDLAQCFFCFKELEG ---1111----------------------------------1111--------------- WEPDDNPIEEHRKHSPGCAFLTVKKQMEELTVSEFLKLDRQRAKNKIAKETN -----------------3333----3333-----------------3333-- >INTERLEUKIN-22; SWP:Q9GZX6; PDB:1M4RA; SHCRLDKSNFQQPYITNRTFMLAKEASLADNNTDVRLIGEKLFHGVSMSERCYLMKQVLN -------1111---------------1111-1111---333322221111---------- FTLEEVLFPQSDRFQPYMQEVVPFLARLSNRLSTCHIEGDDLHIQRNVQKLKDTVKKLGE -------1111----3333------------2222------------------------- SGEIKAIGELDLLFMSLRNACI ---------------------- >BONE MORPHOGENETIC PROTEI; SWP:Q13253; PDB:1M4UA; MQHYLHIRPAPSDNLPLVDLIEHPDPIFDPKEKDLNETLLRSLLGGHYDPGFMATSPPED -------------------------3333--------------!!!!-1111-------- RPGGAEDLAELDQLLRQRPSGAMPSEIKGLEFSEGLAQGKKQRLSKKLRRKLQMWLWSQT --------------1111-----3333--------------------------------- FCPVLYAWNDLGSRFWPRYVKVGSCFSKRSCSVPEGMVCKPSKSVHLTVLRWRCQRRGGQ -----------1111-----------------------------------------%%%% RCGWIPIQYPIISECKCSC ------------------- >SET3, SUPERANTIGEN-LIKE P; SWP:Q2YVS0; PDB:1M4VA; AKYENVTKDIFDLRDYYSGASKELKNVTGYRYSKGGKHYLIFDKHQKFTRIQIFGKDIER ---------------------------------iiii------%%%%------!!!!--- LKTRKNPGLDIFVVKEAETVFSYGGVTKKNQGAYYDYLNAPKFVIKKEVDAGVYTHVKRH ------------------------------------------------!!!!-------- YIYKEEVSLKELDFKLRQYLIQNFDLYKKFPKDSKIKVIMKDGGYYTFELNKKLQPHRMS ------------------------2222--!!!!-----1111-----1111--1111-- DVIDGRNIEKMEANIR ---1111--------- >ENDOXYLANASE; SWP:Q8GMV7; PDB:1M4WA; DTTITQNQTGYDNGYFYSFWTDAPGTVSMTLHSGGSYSTSWRNTGNFVAGKGWSTGGRRT -----------iiii-------2222---------------------------------- VTYNASFNPSGNAYLTLYGWTRNPLVEYYIVESWGTYRPTGTYKGTVTTDGGTYDIYETW ------------------------------------------------%%%%-------- RYNAPSIEGTRTFQQFWSVRQQKRTSGTITIGNHFDAWARAGMNLGSHDYQIMATEGYQS -----------------------------3333-----1111------------------ SGSSTVSISEGGNPGNP ----------------- >ATP-DEPENDENT PROTEASE HS; SWP:Q9WYZ1; PDB:1M4YA; TTILVVRRNGQTVMGGDGQVTFGSTVLKGNARKVRKLGEGKVLAGFAGSVADAMTLFDRF -------iiii--------------------------iiii------------------- EAKLREWGGNLTKAAVELAKDWRTDRVLRRLEALLLVADKENIFIISGNGEVIQPDDDAA ------%%%%----------------3333----------------1111---------- AIGSGGPYALAAAKALLRNTDLSAREIVEKAMTIAGEICIYTNQNIVIEEV --1111--------------------------------1111--------- >ORIGIN RECOGNITION COMPLE; SWP:P54784; PDB:1M4ZA; TMAKTLKDLQGWEIITTDEQGNITEHYLKRSSDGIKLGRGDSVVMHNEAAGTYSVYMIQE ----3333-----------------------------2222-----3333---------- LRLNTLNNVVELWALTYLRWFEVNPLAHYRQFNPDANILNRPLNYYNKLFSETANKNELY ------------------1111----------3333-----3333---------1111-- LTAELAELQLFNFIRVANVMDGSKWEVLKGNVDPERDFTVRYICEPTGEKFVDINIEDVK --------1111--------------------1111--------1111------3333-- AYIKKVEPREAQEYLKDLTLP -1111---------1111--- >ISOMALTULOSE SYNTHASE; SWP:Q8KR84; PDB:1M53A; EYPAWWKEAVFYQIYPRSFKDTNDDGIGDIRGIIEKLDYLKSLGIDAIWINPHYDSPNTD ---3333-------3333-----------------------------------------i NGYDISNYRQIMKEYGTMEDFDSLVAEMKKRNMRLMIDVVINHTSDQHPWFIQSKSDKNN iii---1111-3333-------------1111------------------------1111 PYRDYYFWRDGKDNQPPNNYPSFFGGSAWQKDAKSGQYYLHYFARQQPDLNWDNPKVRED -1111----------------1111------------------1111---3333------ LYAMLRFWLDKGVSGMRFDTVATYSKIPGFPNLTPEQQKNFAEQYTMGPNIHRYIQEMNR --------1111-------1111---2222---3333--33331111--3333------- KVLSRYDVATAGEIFGVPLDRSSQFFDRRRHELNMAFMFDLIRLDRDSNERWRHKSWSLS -1111--------222211113333-3333----------1111-----1111----333 QFRQIISKMDVTVGKYGWNTFFLDNHDNPRAVSHFGDDRPQWREASAKALATITLTQRAT 3-----------------------1111-3333-----3333----------1111---- PFIYQGSELGMTNYPFRQLNEFDDIEVKGFWQDYVQSGKVTATEFLDNVRLTSRDNSRTP ---2222----------1111-------------1111---------3333--3333--- FQWNDTLNAGFTRGKPWFHINPNYVEINAEREETREDSVLNYYKKMIQLRHHIPALVYGA -----2222-----------1111------33331111--------------3333---- YQDLNPQDNTVYAYTRTLGNERYLVVVNFKEYPVRYTLPANDAIEEVVIDTQQQAAAPHS ----1111---------!!!!-----------------1111----------------11 TSLSLSPWQAGVYKLR 11---2222------- >REP PROTEIN; SWP:Q9YJC1; PDB:1M55A; MATFYEVIVRVPFDVEEHLPGISDSFVDWVTGQIWELPPESDLNLTLVEQPQLTVADRIR ------------------2222---------------1111--3333------------- RVFLYEWNKFSKQESKFFVQFEKGSEYFHLHTLVETSGISSMVLGRYVSQIRAQLVKVVF -----------------------------------22223333----------------i QGIEPQINDWVAITKVKKGGANKVVDSGYIPAYLLPKVQPELQWAWTNLDEYKLAALNLE iii-------------2222-----3333----3333-----------33333333---- ERKRLVAQFLAES ------------- >RC-RNASE2 RIBONUCLEASE; SWP:Q9DFY8; PDB:1M58A; MQNWETFQKKHLTDTRDVKCDAEMKKALFDCKQKNTFIYARPGRVQALCKNIIVSKNVLS -----3333---------33333333------------------3333------------ TDEFYLSDCNRIKLPCHYKLKKSSNTICITCENKLPVHFVAVEECP -------------------------------%%%%----------- >Formylmethanofuran--tetra; SWP:O28076; PDB:1M5HA; MKVNGVEVEETFAEAFDIKIARVLITGYDYYWAWVAANEATGFGTSVIMCPAEAGIEIKA --iiii--------------------------------1111---3333----------- KPSETPDGRPGYYIQICHMSKKGLEEQLLARLGQCVLTAPTTAVFNGLPDAEEKDDTGFK 33331111---------------------------1111--------1111--------- LKFFADGYQKEVEVGGRKCWAVPMMEGDFIIENDIGYTNGIAGGNFFIMAETQPSALAAA -------------iiii------3333--------------------------------- KAAVDAISDVEGVITPFPGGIVASGSKVGANKYKFLKASTNEKFAPSIRDQVEGTQIPAG ------1111------2222------------1111----11113333---------111 VKAVYEIVINGLNADAIKEATRVGILAATKIPGVVKITAGNYGGKLGKHIINLNELF 1-----------------------------2222-----------------3333-- >APC PROTEIN; SWP:P25054; PDB:1M5IA; STGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPSLQTDMTRRQLEYEAR --------------------------------------1111------------------ QIRVAMEEQLGTCQDMEKRAQRRIARIQQIEKDILRIRQLLQSQA --------------------------------------------- >SMALL NUCLEAR RIBONUCLEOP; SWP:Q8ZVU2; PDB:1M5QA; FVAELNNLLGREVQVVLSNGEVYKGVLHAVDNQLNIVLANASNKAGEKFNRVFIYRYIVH -------2222-----1111----------1111--------1111--------1111-- IDSTERRIDREFAKQAEKIFPGVKYIEETNVVLIGDKVRVSEIGVEGVGPVAERAKRLFE --------------3333--------1111----------1111---------------- EFL --- >Formylmethanofuran--tetra; SWP:P55301; PDB:1M5SA; MEINGVEIEDTYAEAFPIKIARVLITAATKRWALVAATEATGFATSVIMCPAEAGIERLA --iiii--------------------------------1111---1111----------- SPSETPDGRPGVYVQICTFKYEALEEQLLERIGQCVLTAPTTAVFNGLPEAEKQDNVGFK 11111111---------------------------1111--------1111--------- LKFFADGMESETQIAGRKVYKVPIMEGDFLAEENIGAIAGIAGGNFFIFGDSQMTALTAA --1111-------iiii------3333--------------------------------- EAAVDTIAELEGTITPFPGGIVASGSKSGANKYKFLKATANERFCPSIKDKIENTEIPAD ---------2222---2222------------1111----111133331111-----111 VNAVYEIVINGLDEESIKAAMKAGIKAAVTVPGVKKISAGNYGGKLGKYQFKLHELF 1-------------------------11112222-----------------3333-- >PYRIDOXAL PHOSPHATE BIOSY; SWP:P24223; PDB:1M5WA; AELLLGVNIDHIATLRNARGTAYPDPVQAAFIAEQAGADGITVHLREDRRHITDRDVRIL ------------------------3333----1111---------1111----------- RQTLDTRMNLEMAVTEEMLAIAVETKPHFCCLVPEKRQEVTTEGGLDVAGQRDKMRDACK -----------------------------------------------1111--------- RLADAGIQVSLFIDADEEQIKAAAEVGAPFIEIHTGCYADAKTDAEQAQELARIAKAATF --1111-----------------1111--------------------------------- AASLGLKVNAGHGLTYHNVKAIAAIPEMHELNIGHAIIGRAVMTGLKDAVAEMKRLMLEA -1111---------1111------3333-----------3333----------------- RG -- >SURVIVAL PROTEIN SURA; SWP:P21202; PDB:1M5YA; VDKVAAVVNNGVVLESDVDGLMQSVKLNAAQARQQLPDDATLRHQIMERLIMDQIILQMG ------------------------------------------------------------ QKMGVKISDEQLDQAIANIAKQNNMTLDQMRSRLAYDGLNYNTYRNQIRKEMIISEVRNN --------------------1111------------------------------------ EVRRRITILPQEVESLAQQVTELNLSHILIPLPENPTSDQVNEAESQARAIVDQARNGAD --1111--2222------------------------------------------1111-- FGKLAIAHSADQQALNGGQMGWGRIQELPGIFAQALSTAKKGDIVGPIRSGVGFHILKVN ----------1111---------3333-1111-------2222------1111------- DLRGESKNISVTEVHARHILLKPSPIMTDEQARVKLEQIAADIKSGKTTFAAAAKEFSQD -----------------------33333333--------3333----------------- PGSANQGGDLGWATPDIFDPAFRDALTRLNKGQMSAPVHSSFGWHLIELLDTRNVDRAYR --1111-------3333-------3333-2222------1111------------3333- MLMNRKFSEEAASWMQEQRASAYVKILS ---------------------------- >AMPA RECEPTOR INTERACTING; SWP:P97879; PDB:1M5ZA; SPTPVELHKVTLYKDSGMEDFGFSVADGLLEKGVYVKNIRPAGPGDLGGLKPYDRLLQVN ---------------------------1111-------------------2222----ii HVRTRDFDCCLVVPLIAESGNKLDLVISRNP ii-11113333----1111------------ >HYPOTHETICAL PROTEIN YCDX; SWP:P75914; PDB:1M65A; YPVDLHMHTVASTHAYSTLSDYIAQAKQKGIKLFAITDHGPDMEDAPHHWHFINMRIWPR --------3333---------------------------1111----3333---1111-- VVDGVGILRGIEANIKNVDGEIDCSGKMFDSLDLIIAGFHEPVFAPHDKATNTQAMIATI -iiii-----------1111--------1111-------3333----------------- ASGNVHIISHPGNPKYEIDVKAVAEAAAKHQVALEINNSSNCREVAAAVRDAGGWVALGS ---------1111-----------------------3333-------------------- DSHTAFTMGEFEECLKILDAVDFPPERILNVSPRRLLNFLESRGMAPIAEFADL ---3333-----------1111-33331111---------1111---3333--- >RECEPTOR PROTEIN-TYROSINE; SWP:P21860; PDB:1M6BA; AVCPRCEVVMGNLEIVLTGHNADLSFLQWVREVTGYVLVAMNEFSTLPLPNLRVVRGTQV -----------------------3333---------------------1111-------2 YDGKFAIFVMLNYNTNSSHALRQLRLTQLTEILSGGVYIEKNDKLCHMDTIDWRDIVRDR 222----------3333--------1111------------1111-1111-3333----- DAEIVVKDNGRSCPPCHEVCKGRCWGPGSEDCQTLTKTICAPQCNGHCFGPNPNQCCHDE ----------------1111-------1111--------------------1111--111 CAGGCSGPQDTDCFACRHFNDSGACVPRCPQPLVYNKLTFQLEPNPHTKYQYGGVCVASC 1-------1111--------------------------------1111------------ PHNFVVDQTSCVRACPPDKMEVDKNGLKMCEPCGGLCPKACEGTGSGSRFQTVDSSNIDG 3333--!!!!-------------------------------------------1111111 FVNCTKILGNLDFLITGLNGDPWHKIPALDPEKLNVFRTVREITGYLNIQSWPPHMHNFS 1------------3333----1111----3333-1111--------------1111--33 VFSNLTTIGGRSLYNRGFSLLIMKNLNVTSLGFRSLKEISAGRIYISANRQLCYHHSLNW 33----------------------1111----1111------------1111--333333 TKVLRGPTEERLDIKHNRPRRDCVAEGKVCDPLCSSGGCWGPGPGQCLSCRNYSRGGVCV 33----------------33331111----1111--------1111--------iiii-- THCNFLNGEPREFAHEAECFSCHPECQPMEGTATCNGSGSDTCAQCAHFRDGPHCVSSCP --------------%%%%----------2222------------------!!!!------ HGVLGAKGP --------- >CATHEPSIN F; SWP:Q9UBX1; PDB:1M6DA; APPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMD ------1111-------------3333--------------------------------- KACMGGLPSNAYSAIKNLGLETEDDYSYQ !!!!-----------------3333---- >S-adenosyl-L-methionine:s; SWP:Q9SPV4; PDB:1M6EX; MDVRQVLHMKGGAGENSYAMNSFIQRQVISITKPITEAAITALYSGDTVTTRLAIADLGC --3333------------------------------------------------------ SSGPNALFAVTELIKTVEELRKKMGRENSPEYQIFLNDLPGNDFNAIFRSLPIENDVDGV ----1111------3333--------------------------------------2222 CFINGVPGSFYGRLFPRNTLHFIHSSYSLMWLSQVPIGIESNKGNIYMANTCPQSVLNAY ---------------------------1111----------------------3333--3 YKQFQEDHALFLRCRAQEVVPGGRMVLTILGRRSEDRASTECCLIWQLLAMALNQMVSEG 333----------------1111---------------3333------------------ LIEEEKMDKFNIPQYTPSPTEVEAEILKEGSFLIDHIEASEIYWSSCTKDGDGGGSVEEE ------1111--------3333-------------------------------------- GYNVARCMRAVAEPLLLDHFGEAIIEDVFHRYKLLIIERMSKEKTKFINVIVSLIRKSD -----------3333-----3333----------------------------------- >PROGRAMMED CELL DEATH PRO; SWP:095831; PDB:1M6IA; APSHVPFLLIGGGTAAFAAARSIRARDPGARVLIVSEDPELPYMRPPLSKELWFSPNVTK --------------------------1111--------------3333-3333--3333- TLRFKQWNGKERSIYFQPPSFYVSAQDLPHIENGGVAVLTGKKVVQLDVRDNMVKLNDGS -----1111--------1111--3333---2222--------------1111---1111- QITYEKCLIATGGTPRSLSAIDRAGAEVKSRTTLFRKIGDFRSLEKISREVKSITIIGGG -----------------3333---33331111----3333-------------------- FLGSELACALGRKARALGTEVIQLFPEKGNMGKILPEYLSNWTMEKVRREGVKVMPNAIV -----------------------------------------------1111--------- QSVGVSSGKLLIKLKDGRKVETDHIVAAVGLEPNVELAKTGGLEIDSDFGGFRVNAELQA -----iiii----1111-----------------33331111------------1111-- RSNIWVAGDAACFYDIKLGRRRVEHHDHAVVSGRLAGENMTGAAKPYWHQSMFWSDLGPD -------1111----------------------------------------------111 VGYEAIGLVDSSLPTVGVFAKATAQDNPKSATEQSGTGIRSESETESEASYGKGVIFYLR 1--------1111---------1111------------3333------------------ DKVVVGIVLWNIFNRMPIARKIIKDGEQHEDLNEVAKLF --------------3333-------------3333---- >TRIOSEPHOSPHATE ISOMERASE; SWP:O02611; PDB:1M6JA; GAGKFVVGGNWKCNGTLASIETLTKGVAASVDAELAKKVEVIVGVPFIYIPKVQQILAGE ---------------------------11113333----------3333----------1 ANGANILVSAENAWTKSGAYTGEVHVGMLVDCQVPYVILGHSERRQIFHESNEQVAEKVK 111----------------2222-3333-1111-------3333---------------- VAIDAGLKVIACIGETEAQRIANQTEEVVAAQLKAINNAISKEAWKNIILAYEPVWAIGT --1111------------------------------111133331111-----3333--- GKTATPDQAQEVHQYIRKWMTENISKEVAEATRIQYGGSVNPANCNELAKKADIDGFLVG ----------------------------------------3333------1111-----3 GASLDAAKFKTIINSVSEKL 333------------1111- >BETA-LACTAMASE OXA-1; SWP:P13661; PDB:1M6KA; STDISTVASPLFEGTEGCFLLYDASTNAEIAQFNKAKCATQMAPDSTFIALSLMAFDAEI --------3333------------------------1111---!!!!------------- IDQKTIFKWDKTPKGMEIWNSNHTPKTWMQFSVVWVSQEITQKIGLNKIKNYLKDFDYGN -1111----------3333---------1111-------------------------!!! QDFSGDERNNGLTEAWLESSLKISPEEQIQFLRKIINHNLPVKNSAIENTIENMYLQDLD !--------33331111-------------------------3333-----1111---11 NSTKLYGKTGAGFTANRTLQNGWFEGFIISKSGHKYVFVSALTGNLGSNLTSSIKAKKNA 11-----------2222------------3333--------------------------- ITILNTLNL --------- >BETA-2-MICROGLOBULIN; SWP:P30481; PDB:1M6OA; GSHSMRYFYTAMSRPGRGEPRFITVGYVDDTLFVRFDSDATSPRKEPRAPWIEQEGPEYW -------------2222----------!!!!-----1111--------3333---3333- DRETQISKTNTQTYRENLRTALRYYNQSEAGSHIIQRMYGCDVGPDGRLLRGYDQDAYDG -------------------------------------------1111----------iii KDYIALNEDLSSWTAADTAAQITQRKWEAARVAEQDRAYLEGLCVESLRRYLENGKETLQ i-----1111-----------------------------------------------111 RADPPKTHVTHHPISDHEVTLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDRT 1-------------------------------------iiii-3333------------- FQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRWEP ---------22221111-----1111---------- >CATION-DEPENDENT MANNOSE-; SWP:P11456; PDB:1M6PA; EKTCDLVGEKGKESEKELALLKRLTPLFQKSFESTMYSYVFRVCREAGQHSSGAGLVQIQ --------2222-----------3333--------------------------------- KSNGKETVVGRFNETQIFQGSNWIMLIYKGGDEYDNHCGREQRRAVVMISCNRHTLADNF ----------1111----------------------%%%%-----------1111----- NPVSEERGKVQDCFYLFEMDSSLACS --------------------3333-- >L-ALLO-THREONINE ALDOLASE; SWP:NA; PDB:1M6SA; MIDLRSDTVTKPTEEMRKAMAQAEVGDDVYGEDPTINELERLAAETFGKEAALFVPSGTM ------1111----------------3333------------------------------ GNQVSIMAHTQRGDEVILEADSHIFWYEVGAMAVLSGVMPHPVPGKNGAMDPDDVRKAIR ----------2222----11113333-%%%%--------------iiii-3333-1111- PRNIHFPRTSLIAIENTHNRSGGRVVPLENIKEICTIAKEHGINVHIDGARIFNASIASG --3333------------1111----3333------------------------------ VPVKEYAGYADSVMFCLSGLCAPVGSVVVGDRDFIERARKARKMLGGGMRQAGVLAAAGI -3333-1111-------------------------------------------------- IALTKMVDRLKEDHENARFLALKLKEIGYSVNPEDVKTNMVILRTDNLKVNAHGFIEALR ------3333---------------------3333---------1111------------ NSGVLANAVSDTEIRLVTHKDVSRNDIEEALNIFEKLFRKFS ------------------1111-------------------- >NDT80 PROTEIN; SWP:P38830; PDB:1M6UA; FKVGPPFELVRDYCPVVESHTGRTLDLRIIPRIDRGFDHIDEEWVGYKRNYFTLVSTFET ------------------1111-----------------%%%%----------------- ANCDLDTFLKSSFDLLVEDSSVESRLRVQYFAIKIKAKNDDDDTEINLVQHTAKRDKGPQ ------------------%%%%-------------------------------------- FCPSVCPLVPSPLPKHQIIREASNVRNITKTKKYDSTFYLHRNHVNYEEYGVDSLLFSYP --------------3333------2222-----3333---1111--111111113333-- EDSIQKVARYERVQFASSISVKKPFQQNKHFSLHVILGAVVDPDTFHGENPGIPYDELAL -----------------1111----2222------------3333--------------1 KNGSKGMFVYLQEMKTPPLIIRGRS 111---------------------- >S-ADENOSYL-METHYLTRANSFER; SWP:Q9WZX6; PDB:1M6YA; RKYSQRHIPVVREVIEFLKPEDEKIILDCTVGEGGHSRAILEHCPGCRIIGIDVDSEVLR ------------------------------!!!!---------1111------------- IAEEKLKEFSDRVSLFKVSYREADFLLKTLGIEKVDGILDLGVSTYQLKGENRGFTFERE -----1111---------3333-----1111------------3333------------- EPLDRDLESEVTAQKVLNELPEEELARIIFEYGEEKRFARRIARKIVENRPLNTTLDLVK -----1111-------------------------------------1111---3333--- AVREALPSYEIRRRKRHFATKTFQAIRIYVNRELENLKEFLKKAEDLLNPGGRIVVISFH --33333333------1111----------------------3333--2222-------- SLEDRIVKETFRNSKKLRILTEKPVRPSEEEIRENPRARSGRLRAAERI ----------------------------------3333----------- >CYTOCHROME C4; SWP:Q52369; PDB:1M70A; AGDAEAGQGKVAVCGACHGVDGNSPAPNFPKLAGQGERYLLKQLQDIKAGSTPGAPEGVG --333311113333----1111---1111--2222----------------22222222- RKVLEMTGMLDPLSDQDLEDIAAYFSSQKGSVGYADPALAKQGEKLFRGGKLDQGMPACT --1111-1111------------------------3333-----------3333------ GCHAPNGVGNDLAGFPKLGGQHAAYTAKQLTDFREGNRTNDGDTMIMRGVAAKLSNKDIE ---1111---1111---2222-------------------3333-3333-1111------ ALSSYIQGLH ----3333-- >heavy chain of the monocl; SWP:NA; PDB:1M71B; EVKVEESGGGLVQPGGSMKLSCVASGFTFSNYWMEWVRQSPEKGLEWVAEIRLKS ------------2222-----------3333------------------------ >CASPASE-1; SWP:P89116; PDB:1M72A; RVARMPVDRNAPYYNMNHKHRGMAIIFNHEHFDIHSLKSRTGTNVDSDNLSKVLKTLGFK -------1111----------------------3333--2222-----------1111-- VTVFPNLKSEEINKFIQQTAEMDHSDADCLLVAVLTHGELGMLYAKDTHYKPDNLWYYFT ------------------1111-1111-----------2222--------3333-11113 ADKCPTLAGKPKLFFIQACQGDRLDGGITLSRSYRIPVHADFLIAFSTVPGYFSWRNTTR 3331111-----------------------------1111--------2222-------- GSWFMQALCEELRYAGTERDILTLLTFVCQKVALDFESNAPDSAMMHQQKQVPCITSMLT ---------------------------------------33331111------------- RLLVFGK ------- >RND3/RHOE SMALL GTP-BINDI; SWP:P52199; PDB:1M7BA; VKCKIVVVGDSQCGKTALLHVFAKDCFPENYVPTVFENYTASFEIDTQRIELSLWDTSGS ---------2222----------------------------------------------3 PYYDNVRPLSYPDSDAVLICFDISRPETLDSVLKKWKGEIQEFCPNTKMLLVGCKSDLRT 333-------2222-------111133333333----------2222-------3333-- DVSTLVELSNHRQTPVSYDQGANMAKQIGAATYIECSALQSENSVRDIFHVATLACVNK ------------------------------------3333------------------- >ADENYLYLSULFATE KINASE; SWP:Q12657; PDB:1M7GA; STNITFHASALTRSERTELRNQRGLTIWLTGLSASGKSTLAVELEHQLVRDRRVHAYRLD ----3333----------1111---------2222------------------------3 GDNIRFGLNKDLGFSEADRNENIRRIAEVAKLFADSNSIAITSFISPYRKDRDTARQLHE 333---1111-------------------------------------------------- VATPGEETGLPFVEVYVDVPVEVAEQRDPKGLYKKAREGVIKEFTGISAPYEAPANPEVH --2222-------------333311111111------------2222------------- VKNYELPVQDAVKQIIDYLDTKGYLPAK ------3333---------1111----- >SILENCER OF DEATH DOMAINS; SWP:O95429; PDB:1M7KA; TPPSIKKIIHVLEKVQYLEQEVEEFVGKKTDKAYWLLEEMLTKELLELDSVETGGQDSVR ---------------------------3333----------------------------- QARKEAVCKIQAILEKLEKKG -----------------1111 >CATALASE; SWP:P46206; PDB:1M7SA; TDTLTRDNGAVVGDNQNSQTAGAQGPVLLQDVQLLQKLQRFDRERIPERVVHARGTGVKG -----1111------------1111--1111----------------------------- EFTASADISDLSKATVFKSGEKTPVFVRFSSVVHGNHSPETLRDPHGFATKFYTADGNWD -------3333--33332222-------------22221111-----------1111--- LVGNNFPTFFIRDAIKFPDMVHAFKPDPRTNLDNDSRRFDFFSHVPEATRTLTLLYSNEG ------------3333----------------------------3333------------ TPAGYRFMDGNGVHAYKLVNAKGEVHYVKFHWKSLQGIKNLDPKEVAQVQSKDYSHLTND ---3333------------1111----------1111----------------------- LVGAIKKGDFPKWDLYVQVLKPEELAKFDFDPLDATKIWPDVPEKKIGQMVLNKNVDNFF ----1111------------11113333--1111----1111---------------333 QETEQVAMAPANLVPGIEPSEDRLLQGRVFSYADTQMYRLGANGLSLPVNQPKVAVNNGN 3-------3333-2222----3333---------------1111--3333---------- QDGALNTGHTTSGVNYEPSRLEPRPADDKARYSELPLSGTTQQAKITREQNFKQAGDLYR --------------------------3333----------------------------11 SYSAKEKTDLVQKFGESLADTLTESKNIMLSYLYKEDPNYGTRVAEVAKGDLSKVKSLAA 11---------------1111------------------------------------333 SLKD 3--- >CHIMERA OF HUMAN AND E. C; SWP:P10599; PDB:1M7TA; MVKQIESKTAFQEALDAAGDKLVVVDFSATWCGPCKMIKPFFHSLSEKYSNVIFLEVDVD -----------------!!!!-------11113333--------1111---------333 DAQDVAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLV 311113333-----------iiii---------3333---------- >NITRIC OXIDE SYNTHASE; SWP:NA; PDB:1M7VA; WNEAKAFIAECYQELGKEEEVKDRLDSIKSEIDRTGSYVHTKEELEHGAKMAWRNSNRCI ----------------1111--------------------3333-----------1111- GRLFWNSLNVIDRRDVRTKEDVRDALFHHIETATNNGKIRPSITIFPPEEKGEKQVEIWN 33331111-------------------------%%%%----------------------- HQLIRYAGYEGERIGDPASRSLTAACEQLGWRGERTDFDLLPLIFRMRGDEQPVWYELPR ---------------3333-------1111----------------------------33 SLVIEVPITHPDIEAFSDLELKWYGVPIISDMKLEVGGIHYNAAPFNGWYMGTEIGARNL 33----------33333333---------------iiii-----------3333---111 ADEKRYDKLKKVASVIGISTNYNTDLWKDQALVELNKAVLYSYKKQGVSIVDHHTAASQF 11111--------1111----3333----------------------------------- KRFEEQEEEAGRKLTGDWTWLIPPISPAATHIFHRSYDNSIVKPNYFYQDKPYE -------1111------1111----11113333-----------------1111 >1,4-ALPHA-GLUCAN BRANCHIN; SWP:P07762; PDB:1M7XA; THLRPYETLGAHADTMDGVTGTRFSVWAPNARRVSVVGQFNYWDGRRHPMRLRKESGIWE 33331111-----------------------------3333--1111-----3333---- LFIPGAHNGQLYKYEMIDANGNLRLKSDPYAFEAQMRPETASLICGLPEKVVQTEERKKA -----------------1111------------------------------------333 NQFDAPISIYEVHLGSWRRHTDNNFWLSYRELADQLVPYAKWMGFTHLELLPINEHPFDG 31111-------1111-----------------------------------------333 SWGYQPTGLYAPTRRFGTRDDFRYFIDAAHAAGLNVILDWVPGHFPTDDFALAEFDGTNL 3------------1111------------1111-------1111---%%%%--------- YEHSTLIYNYGRREVSNFLVGNALYWIERFGIDALRVDAVASMIYRGGRENLEAIEFLRN --------3333--------------------------3333--------3333------ TNRILGEQVSGAVTMAEESTDFPGVSRPQDMGGLGFWYKWNLGWMHDTLDYMKLDPVYRQ ------------------------------------------------------333311 YHHDKLTFGILYNYTENFVLPLSHDEVVHGKKSILDRMPGDAWQKFANLRAYYGWMWAFP 1133333333-1111-------3333-iiii-3333-------------------1111- GKKLLFMGNEFAQGREWNHDASLDWHLLEGGDNWHHGVQRLVRDLNLTYRHHKAMHELDF -------3333------1111--3333-----3333---------------3333--111 DPYGFEWLVVDDKERSVLIFVRRDKEGNEIIVASNFTPVPRHDYRFGINQPGKWREILNT 11111-------1111-------3333-------------------------------11 DSMHYHGSNAGNGGTVHSDEIASHGRQHSLSLTLPPLATIWLVREAE 11--------------------%%%%--------------------- >1-AMINOCYCLOPROPANE-1-CAR; SWP:P37821; PDB:1M7YA; MLSRNATSSYFLGWQEYEKNPYHEVHNTNGIIQMGLAENQLCFDLLESWLAKNPEAAAFK ---11113333------------------------------3333--------3333--- KNGESIFAELALFQDYHGLPAFKKAMVDFMAEIRGNKVTFDPNHLVLTAGATSANETFIF iiii------------------------------------3333---------------- CLADPGEAVLIPTPYYPGFDRDLKWRTGVEIVPIHCTSSNGFQITETALEEAYQEAEKRN ---2222--------3333-----------------3333----------------1111 LRVKGVLVTNPSNPLGTTMTRNELYLLLSFVEDKGIHLISDEIYSGTAFSSPSFISVMEV -----------------------------------------1111----------3333- LKDRNCDENSEVWQRVHVVYSLSKDLGLPGFRVGAIYSNDDMVVAAATKMSSFGLVSSQT ----------3333-------------3333------------------3333------- QHLLSAMLSDKKLTKNYIAENHKRLKQRQKKLVSGLQKSGISCLNGNAGLFCWVDMRHLL -----1111---------------------------1111---------------1111- RSNTFEAEMELWKKIVYEVHLNISPGSSCHCTEPGWFRVCFANLPERTLDLAMQRLKAFV ------------------------3333-------------------------------- GEYY ---- >CATALASE; SWP:P42321; PDB:1M85A; KKLTTAAGAPVVDNNNVITAGPRGPMLLQDVWFLEKLAHFDREVIPERRHAKGSGAFGTF ----1111------------------1111---------1111----------------- TVTHDITKYTRAKIFSEVGKKTEMFARFSTVAGERGAADAERDIRGFALKFYTEEGNWDM -----3333--3333-2222--------------------------------1111---- VGNNTPVFYLRDPLKFPDLNHIVKRDPRTNMRNMAYKWDFFSHLPESLHQLTIDMSDRGL -----------3333-----------------3333---33333333--------1111- PLSYRFVHGFGSHTYSFINKDNERFWVKFHFRCQQGIKNLMDDEAEALVGKDRESSQRDL --1111------------1111-------------------------------------- FEAIERGDYPRWKLQIQIMPEKEASTVPYNPFDLTKVWPHADYPLMDVGYFELNRNPDNY ---1111------------3333------1111-----3333----------------33 FSDVEQAAFSPANIVPGISFSPDKMLQGRLFSYGDAHRYRLGVNHHQIPVNAPKCPFHNY 33-1111--3333-2222--------3333-------------33333333--------- HRDGAMRVDGNSGNGITYEPNSGGVFQEQPDFKEPPLSIEGAADHWNHREDEDYFSQPRA ----------------------------3333--------------1111---------- LYELLSDDEHQRMFARIAGELSQASKETQQRQIDLFTKVHPEYGAGVEKAIKVLE -3333--------------3333----------------3333------------ >SMALL INDUCIBLE CYTOKINE ; SWP:P78556; PDB:1M8AA; DCCLGYTDRILHPKFIVGFTRQLANEGCDINAIIFHTKKKLSVCANPKQTWVKYIVRLLS -----------3333-------1111----------1111-----1111-------3333 K - >ANTIFREEZE PROTEIN ISOFOR; SWP:Q9GSA6; PDB:1M8NA; GTCVNTNSQITANSQCVKSTATNCYIDNSQLVDTSICTRSQYSDANVKKSVTTDCNIDKS ----------1111---------------------------------------------- QVYLTTCTGSQYNGIYIRSSTTTGTSISGPGCSISTCTITRGVATPAAACKISGCSLSAM ----------------------------1111-------iiii---3333---------- >SULFATE ADENYLYLTRANSFERA; SWP:Q12650; PDB:1M8PA; MANAPHGGVLKDLLARDAPRQAELAAEAESLPAVTLTERQLCDLELIMNGGFSPLEGFMN ----------------3333-----1111-------3333---------1111------- QADYDRVCEDNRLADGNVFSMPITLDASQEVIDEKKLQAASRITLRDFRDDRNLAILTID ---------------------------3333----------------------------- DIYRPDKTKEAKLVFGGDPEHPAIVYLNNTVKEFYIGGKIEAVNKLNHYDYVALRYTPAE --------------------------------------------------3333------ LRVHFDKLGWSRVVAFQTRNPMHRAHRELTVRAARSRQANVLIHPVVGLTKPGDIDHFTR ------------------------------------------------------------ VRAYQALLPRYPNGMAVLGLLGLAMRMGGPREAIWHAIIRKNHGATHFIVGRDHAGPGSN ------1111-2222-------------------------------------2222---- SKGEDFYGPYDAQHAVEKYKDELGIEVVEFQMVTYLPDTDEYRPVDQVPAGVKTLNISGT --------------------3333------------3333--------2222-----333 ELRRRLRSGAHIPEWFSYPEVVKILRESNPPRATQGFTIFLTGYMNSGKDAIARALQVTL 3-----------1111--3333---1111-1111-------------------------- NQQGGRSVSLLLGDTVRHELSSELGFTREDRHTNIQRIAFVATELTRAGAAVIAAPIAPY ------------------------------------------------------------ EESRKFARDAVSQAGSFFLVHVATPLEHCEQSDKRGIYAAARRGEIKGFTGVDDPYETPE -----------1111--------------------------------------------- KADLVVDFSKQSVRSIVHEIILVLESQGFLERQ ------3333-----------------1111-- >PHOSPHOLIPASE A2; SWP:P14418; PDB:1M8RA; SLVQFETLIMKVAKKSGMQWYSNYGCYCGWGGQGRPQDATDRCCFVHDCCYGKVTGCDPK ---------------3333-------------------------------1111---333 MDVYSFSEENGDIVCGGDDPCKKEICECDRAAAICFRDNLNTYNDKKYWAFGAKNCPQEE 3-------%%%%--------------------------3333-3333----3333-3333 SEPC ---- >PHOSPHOLIPASE A2; SWP:Q9DF33; PDB:1M8TA; HLVQFNGMIRCTIPGSIPWWDYSDYGCYCGSGGSGTPVDELDRCCQVHDNCYTQAQQLTE 3333--------1111----------------------3333-----------1111--- CSPYSKRYSYDCSEGTLTCKADNDECAAFVCDCDRVAAICFAGAPYNKENINIDTTTRC -3333--------------1111-----------------1111--3333---3333-- >GAMMA-E; SWP:P23005; PDB:1M8UA; GKITFYEDRGFQGRHYECSSDHSNLQPYFSRCNSIRVDSGCWMIYEQPNFQGPQYFLRRG --------%%%%------------1111-------------------%%%%--------- DYPDYQQWMGLNDSIRSCRLIPHTSSHRLRIYEREDYRGQMVEITEDCSSLHERFHFSEI ----3333--------------------------%%%%-----------1111------- HSFHVLEGWWVLYEMPNYRGRQYLLRPGDYRRYHEWGAVDARVGSLRRAVDFY -------------------------------3333------------------ >SERINE PROTEINASE INHIBIT; SWP:P07385; PDB:1M93A; MDIFREIASSMKGENVFISPPSISSVLTILYYGANGSTAEQLSKYV ----------2222---------------3333-!!!!---3333- >Serine proteinase inhibit; SWP:P07385; PDB:1M93B; DISFKSMNKVYGRYSAVFKDSFLRKIGDNFQTVDFTDSRTVDAINKSVDIFTEGKINPLL ------------1111-----------------1111------------1111------- DEPLSPDTSLLAISAVYFKAKWLMPFEKEFTSDYPFYVSPTEMVDVSMMSMYGEAFNHAS ----1111------------------3333------------------------------ VKESFGNFSIIELPYVGDTSMVVILPDNIDGLESIEQNLTDTNFKKWSDSMDAMFIDVHI --1111-------------------------3333------------1111--------- PKFKVTGSYNLVDALVKLGLTEVFGSTGDYSNMSNSDVSVDAMIHKTYIDVNEEYTEAAA ---------------1111-----1111-1111--------------------------- ATSAL ----- >Serine proteinase inhibit; SWP:P07385; PDB:1M93C; TNEFSADHPFIYVIRHVDGKILFVGRYSSPT ---------------2222------------ >PROTEIN YNR032C-A; SWP:Q6Q546; PDB:1M94A; MIEVVVNDRLGKKVRVKCLAEDSVGDFKKVLSLQIGTQPNKIVLQKGGSVLKDHISLEDY ------------------1111---------------1111----%%%%--33333333- EVHDQTNLELYYL ------------- >ORANGE CAROTENOID PROTEIN; SWP:P83689; PDB:1M98A; PFTIDTARSIFPETLAADVVPATIARFKQLSAEDQLALIWFAYLEMGKTITIAAPGAANM --333311111111------------1111---------------3333------3333- QFAENTLQEIRQMTPLQQTQAMCDLANRTDTPICRTYASWSPNIKLGFWYELGRFMDQGL 1111-----1111-----------------------1111---------------1111- VAPIPEGYKLSANANAILVTIQGIDPGQQITVLRNCVVDMGFDTSKLGSYQRVAEPVVPP -------------------------------------------1111------------- QEMSQRTKVQIEGVTNSTVLQYMDNLNANDFDNLISLFAEDGALQPPFQKPIVGKENTLR -3333-----2222-----------1111-----11113333------------------ FFREECQNLKLIPERGVSEPTEDGYTQIKVTGKVQTPWFGGNVGMNIAWRFLLNPENKVF ---------------------iiii----------33333333----------1111--- FVAIDLLASPKELLNL --------3333---- >Gag polyprotein; SWP:Q72497; PDB:1M9DC; PIVQNLQGQMVHQAISPRTLNAWVKVVEEKAFSPEVIPMFSALSEGATPQDLNTMLNTVG ----1111-----------------------------------2222------------- GHQAAMQMLKETINEEAAEWDRTHPPAMGPLPPGQIREPRGSDIAGTTSTLQEQIGWMTH -------------------------------2222------------------------- NPPIPVGEIYKRWIILGLNKIVRMYS -------------------------- >ANNEXIN VI; SWP:P08133; PDB:1M9IA; YRGSIHDFPGFDPNQDAEALYTAMKGFGSDKEAILDIITSRSNRQRQEVCQSYKSLYGKD -----------3333------1111-----------1111-3333--------------- LIADLKYELTGKFERLIVGLMRPPAYCDAKEIKDAISGIGTDEKCLIEILASRTNEQMHQ -----------------3333--------------------------------------- LVAAYKDAYERDLEADIIGDTSGHFQKMLVVLLQGTREEDDVVSEDLVQQDVQDLYEAGE ------------------------------3333---------3333------------- LKWGTDEAQFIYILGNRSKQHLRLVFDEYLKTTGKPIEASIRGELSGDFEKLMLAVVKCI -----------------------------------3333-2222---------------- RSTPEYFAERLFKAMKGLGTRDNTLIRIMVSRSELDMLDIREIFRTKYEKSLYSMIKNDT -------------------------------1111-------1111-------------- SGEYKKTLLKLSGGDDDAAGQFFPEAAQVAYQMWELSAVARVELKGDVRPANDFNPDADA ------------------------------------------------------------ KALRKAMKGLGTDEDTIIDIITHRSNVQRQQIRQTFKSHFGRDLMTDLKSEISGDLARLI --------------------11113333-------------------------------- LGLMMPPAHYDAKQLKKAMEGAGTDEKALIEILATRTNAEIRAINEAYKEDYHKSLEDAL -----3333----------------3333------------------------------- SSDTSGHFRRILISLATGHREEGGENLDQAREDAQVAAEILEIADTPSGDKTSLETRFMT -----3333-----3333--------3333------------------------------ ILCTRSYPHLRRVFQEFIKMTNYDVEHTIKKEMSGDVRDAFVAIVQSVKNKPLFFADKLY ---------------------------------!!!!-------------3333------ KSMKGAGTDDKTLTRIMVSRSEIDLLNIRREFIEKYDKSLHQAIEGDTSGDFLKALLALC -------------------1111------------------------------------- GGED ---- >OUTER ARM DYNEIN LIGHT CH; SWP:Q9XHH2; PDB:1M9LA; MAKATTIKDAIRIFEERKSVVATEAEKVELHGMIPPIEKMDATLSTLKACKHLALSTNNI ---------------------%%%%--------1111-------3333------------ EKISSLSGMENLRILSLGRNLIKKIENLDAVADTLEELWISYNQIASLSGIEKLVNLRVL ----3333------------------3333----------------3333---------- YMSNNKITNWGEIDKLAALDKLEDLLLAGNPLYNDYKENNATSEYRIEVVKRLPNLKKLD --------3333------------------------------------------------ GMPVDVDEREQANVARGG 3333--1111-------- >ENDOTHELIAL NITRIC-OXIDE ; SWP:P29474; PDB:1M9MA; KFPRVKNWEVGSITYDTLSAQAQQDGPCTPRRCLGSLVFPEQLLSQARDFINQYYSSIKR ----------------3333--------3333-1111------------------11112 SGSQAHEQRLQEVEAEVAATGTYQLRESELVFGAKQAWRNAPRCVGRIQWGKLQVFDARD 222-------------------------------------1111-3333--------111 CRSAQEMFTYICNHIKYATNRGNLRSAITVFPQRCPGRGDFRIWNSQLVRYAGYRQQDGS 1-----------------%%%%------------1111-----------------1111- VRGDPANVEITELCIQHGWTPGNGRFDVLPLLLQAPDEPPELFLLPPELVLEVPLEHPTL ---3333-------1111---------------------------3333----------3 EWFAALGLRWYALPAVSNMLLEIGGLEFPAAPFSGWYMSTEIGTRNLCDPHRYNILEDVA 3333333---------------iiii----------------------1111-------- VCMDLDTRTTSSLWKDKAAVEINVAVLHSYQLAKVTIVDHHAATASFMKHLENEQKARGG 1111----3333------------------------------------------------ CPADWAWIVPPISGSLTPVFHQEMVNYFLSPAFRYQPDPW ------------11113333-------------------- >TRISTETRAPROLINE; SWP:P22893; PDB:1M9OA; MTTSSRYKTELCRTYSESGRCRYGAKCQFAHGLGELRQAN ------------3333------2222-------------- >EARTHWORM FIBRINOLYTIC EN; SWP:Q8MX72; PDB:1M9UA; VIGGTNASPGEFPWQLSQQRQSGSW -------22221111---------- >Gag polyprotein; SWP:Q72497; PDB:1M9XC; PIVQNLQGQMVHQAISPRTLNAWVKVVEEKAFSPEVIPMFSALSEGATPQDLNTMLNTVG ----1111-----------------------------------2222--------1111- GHQAAMQMLKETINEEAAEWDRLHPVAMAPIAPGQMREPRGSDIAGTTSTLQEQIGWMTH -------------------------------2222----3333--1111----------- NPPIPVGEIYKRWIILGLNKIVRMYS --------------------3333-- >TGF-BETA RECEPTOR TYPE II; SWP:P37173; PDB:1M9ZA; ALCKFCDVRFSTCDNQKSCMSNCSITSICEKPQEVCVAVWRKNDENITLETVCHDPKLPY ------------------------------1111--------------------1111-% HDFILEDAASPTCIMKEKKKPGETFFMCSCSSDECNDNIIFSEEY %%%-1111-----------------------2222---------- >SUPEROXIDE DISMUTASE; SWP:P18868; PDB:1MA1A; EKKFYELPELPYPYDALEPHISREQLTIHHQKHHQAYVDGANALLRKLDEARESDTDVDI ------------1111-------------------------------------------- KAALKELSFHVGGYVLHLFFWGNMGPADECGGEPSGKLAEYIEKDFGSFERFRKEFSQAA -------------------------1111------------------------------1 ISAEGSGWAVLTYCQRTDRLFIMQVEKHNVNVIPHFRILLVLDVWEHAYYIDYRNVRPDY 111-----------1111--------------2222--------3333----!!!!---- VEAFWNIVNWKEVEKRFEDIL -3333----------3333-- >TRANSCRIPTIONAL REGULATOR; SWP:O30124; PDB:1MA3A; MEDEIRKAAEILAKSKHAVVFTGAGISAEGLWRKYDPEEVASISGFKRNPRAFWEFSMEM ----------------------3333---------3333--------------------- KDKLFAEPNPAHYAIAELERMGIVKAVITQNIDMLHQRAGSRRVLELHGSMDKLDCLDCH -3333-------------------------------1111-----1111----------- ETYDWSEFVEDFNKGEIPRCRKCGSYYVKPRVVLFGEPLPQRTLFEAIEEAKHCDAFMVV ---3333----1111------------------2222----------------------- GSSLVVYPAAELPYIAKKAGAKMIIVNAEPTMADPIFDVKIIGKAGEVLPKIVEEVKRLR -------3333-----1111----------1111---------3333----------111 SE 1- >ATP synthase gamma chain,; SWP:P35435; PDB:1MABG; RDITRRLKSIKNIQKITKSMKMVAAAKYARAERELKPARVYGTGSLCGAIHSSVAKQMKL --------------------------333333333333!!!!---------3333----- ANIIYYSLKESTTSEQSARMTAMDNASKNASDMIDKLTLTFNRTRQAVITKELIEIISGA -3333-----------------------------------------3333---------- AALD ---- >1,3-1,4-BETA-D-GLUCAN 4-G; SWP:P23904; PDB:1MACA; GSVFWEPLSYFNPSTWEKADGYSNGGVFNCTWRANNVNFTNDGKLKLGLTSSAYNKFDCA -----------3333-----------------1111---1111----------------- EYRSTNIYGYGLYEVSMKPAKNTGIVSSFFTYTGPAHGTQWDEIDIEFLGKDTTKVQFNY ------------------------------------------------3333-------- YTNGVGGHEKVISLGFDASKGFHTYAFDWQPGYIKWYVDGVLKHTATANIPSTPGKIMMN -iiii-----------1111-----------------iiii------------------- LWNGTGVDDWLGSYNGANPLYAEYDWVKYTSN ------3333---------------------- >PHOSPHOLIPASE C DELTA-1; SWP:P10688; PDB:1MAI; GLQDDPDLQALLKGSQLLKVKSSSWRRERFYKLQEDCKTIWQESRKVMRSPESQLFSIED -1111----------------------------1111------------3333---3333 IQEVRMGHRTEGLEKFARDIPEDRCFSIVFKDQRNTLDLIAPSPADAQHWVQGLRKIIH ----------------33331111----------------------------------- >IGG2A-KAPPA 26-10 FV (LIG; SWP:P01631; PDB:1MAJ; DVVMTQTPLSLPVSLGDQASISCRSSQSLVHSNGNTYLNWYLQKAGQSPKLLIYKVSNRF ------------------------------------------------------------ SGVPDRFSGSGSGTDFTLKISRVEAEDLGIYFCSQTTHVPPTFGGGTKLEIKR -----------------------3333-------------------------- >Ig gamma-2B chain C regio; SWP:GCBM_MOUSE; PDB:1MAMH; EVKLVESGGGLVQPGGSLRLSCATSGFTFTDYYMSWVRQPPGKALEWLGFIRNKADGYTT ------------2222-----------3333----------------------------- EYSASVKGRFTISRDNSQSILYLQMNTLRAEDSATYYCTRDPYGPAAYWGQGTLVTVSAA --3333----------------------1111--------1111---------------- KTTPPSVYPLAPGCGDTTGSSVTLGCLVKGYFPESVTVTWNSGSLSSSVHTFPALLQSGL --------------------------------------------------------iiii YTMSSSVTVPSSTWPSQTVTCSVAHPASSTTVDKKLE ---------33333333-------------------- >CELL DIVISION RESPONSE RE; SWP:Q9A5I4; PDB:1MB3A; TKKVLIVEDNELNMKLFHDLLEAQGYETLQTREGLSALSIARENKPDLILMDIQLPEISG ---------------------1111-------3333------------------1111-- LEVTKWLKEDDDLAHIPVVAVTDEERIREGGCEAYISKPISVVHFLETIKRLLERQP ---------1111---------3333-3333-------------------------- >ASPARTATE-SEMIALDEHYDE DE; SWP:Q9KQG2; PDB:1MB4A; MRVGLVGWRGMVGSVLMQRMVEERDFDLIEPVFFSTSQIGVPAPNFGKDAGMLHDAFDIE ------1111---------------1111-------------------------111133 SLKQLDAVITCQGGSYTEKVYPALRQAGWKGYWIDAASTLRMDKEAIITLDPVNLKQILH 33-------------------------------------1111------3333------- GIHHGTKTFVGGNCTVSLMLMALGGLYERGLVEWMSAMTYQAASGAGAQNMRELISQMGV -1111-----------------33331111------------3333-------------- INDAVSSELANPASSILDIDKKVAETMRSGSFPTDNFGVPLAGSLIPWIDVKRDNGQSKE -----1111-1111--------------1111-3333---2222---------------- EWKAGVEANKILGLQDSPVPIDGTCVRIGAMRCHSQALTIKLKQNIPLDEIEEMIATHND -------------1111------------------------------------------- WVKVIPNERDITARELTPAKVTGTLSVPVGRLRKMAMGDDFLNAFTVGDQLLWGAAEPLR ----------------33332222----------3333----------1111-------- RTLRIILAE --------- >HUWENTOXIN-IV; SWP:P83303; PDB:1MB6A; ECLEIFKACNPSNDQCCKSSKLVCSRKTRWCKYQI ---------3333----1111----1111------ >MYOGLOBIN; SWP:P02210; PDB:1MBA; SLSAAEADLAGKSWAPVFANKNANGLDFLVALFEKFPDSANFFADFKGKSVADIKASPKL ---------------3333----------------33333333--2222-------1111 RDVSSRIFTRLNEFVNNAANAGKMSAMLSQFAKEHVGFGVGSAQFENVRSMFPGFVASVA ----------------1111---------------1111--------------------- APPAGADAAWTKLFGLIIDALKAAGA --2222---------------1111- >CHYMOTRYPSIN-LIKE SERINE ; SWP:P19811; PDB:1MBMA; KARGNVGFVAGSSYGTGSVWTRNNEVVVLTASHVVGRANMATLKIGDAMLTLTFKKNGDF --1111---------------%%%%-----3333-3333-----!!!!--------!!!! AEAVTTQSELPGNWPQLHFAQPTTGPASWCTATGDEEGLLSGEVCLAWTTSGDSGSAVVQ -----3333-------------------------------3333-----3333------! GDAVVGVHTGSNTSGVAYVTTPSGKLLGADTVTLSSLSKHFTGPLTSIPKDIPDNIIADV !!!---------%%%%----1111-------------1111-----------1111---- DAVPRSLAMLIDGLSNRE ---3333----------- >MYOGLOBIN; SWP:P68080; PDB:1MBS; GLSDGEWHLVLNVWGKVETDLAGHGQEVLIRLFKSHPETLEKFDKFKHLKSEDDMRRSED ---------------------------------------------------3333----- LRKHGNTVLTALGGILKKKGHHEAELKPLAQSHATKHKIPIKYLEFISEAIIHVLHSKHP ------------------------3333-------------------------------- AEFGADAQAAMKKALELFRNDIAAKYKELGFHG --------------------------------- >SERINE/THREONINE KINASE; SWP:Q64702; PDB:1MBYA; SVFVKNVGWATQLTSGAVWVQFNDGSQLVMQAGVSSISYTSPDGQTTRYGENEKLPEYIK ---------------------1111---------------1111-----1111--3333- QKLQLLSSILLMFSN ---1111-------- >3',5'-CYCLIC NUCLEOTIDE P; SWP:Q922S4; PDB:1MC0A; YTDHDRKILQLCGELFDLDATSLQLKVLQYLQQETQATHCCLLLVSEDNLQLSCKVIGDK -----------1111-----------------1111---------1111----------- VLGEEVSFPLTMGRLGQVVEDKQCIQLKDLTSDDVQQLQNMLGCELQAMLCVPVISRATD --------3333-3333--------3333------------------------------- QVVALACAFNKLGGDFFTDEDEHVIQHCFHYTGTVLTSTLAFQKEQKLKCECQALLQVAK ------------------------------------------------------------ NLFTHLDDVSVLLQEIITEARNLSNAEICSVFLLDQNELVAKVFDGGVVDDESYEIRIPA ----1111-------------1111------------------iiii------------- DQGIAGHVATTGQILNIPDAYAHPLFYRGVDDSTGFRTRNILCFPIKNENQEVIGVAELV ------------------33331111-3333----------------1111--------- NKINGPWFSKFDEDLATAFSIYCGISIAHSLLYKKVNEAQY --------3333----------------------------- >ACUTOHAEMONLYSIN; SWP:O57385; PDB:1MC2A; SLFELGKMIWQETGKNPVKNYGLYGCNCGVGGRGEPLDATDRCCFVHKCCYKKLTDCDSK ---------------3333--------------------------------------333 KDRYSYKWKNKAIVCGKNQPCMQEMCECDKAFAICLRENLDTYNKSFRYHLKPSCKKTSE 3-------%%%%--------------------------1111-3333---3333------ QC -- >GLUCOSE-1-PHOSPHATE THYMI; SWP:P27831; PDB:1MC3A; HKGIILAGGSGTRLHPITRGVSKQLLPIYDKPIYYPLSVLLAGIREILIITTPEDKGYFQ ----------3333-------1111--------------------------3333----- RLLGDGSEFGIQLEYAEQPSPDGLAQAFIIGETFLNGEPSCLVLGDNIFFGQGFSPKLRH --!!!!1111------------3333--11113333------------------------ VAARTEGATVFGYQVDPERFGVVEFDDNFRAISLEEKPKQPKSNWAVTGLYFYDSKVVEY ------------------------------------------------------------ AKQVKPSERGELEITSINQYLEAGNLTVELLGRGFAWLDTGTHDSLIEASTFVQTVEKRQ ------3333----------3333------------------------------------ GFKIACLEEIAWRNGWLDDEGVKRAASSLAKTGYGQYLLELLRARP -----------------3333-------1111-------------- >Segment polarity protein ; SWP:P51141; PDB:1MC7A; TVTLNMERHHFLGISIVGQSNDRGDGGIYIGSIMKGGAVAADGRIEPGDMLLQVNDVNFE -------------------------------------3333----1111----!!!!--- NMSNDDAVRVLREIVSQTGPISLTVAKAWDPTPRS -----3333-------------------------- >FLAP ENDONUCLEASE-1; SWP:O50123; PDB:1MC8A; GVPIGDLVPRKEIDLENLYGKKIAIDALNAIYQFLSTIRQEDGTPLMDSKGRITSHLSGL ---3333----------2222------------------1111----1111--------- FYRTINLMEAGIKPAYVFDGKPPEFKRKELEKRREAREEAELKWKEALAKGNLEEARKYA ----------------------------------------------3333---------- QRATKVNEMLIEDAKKLLQLMGIPIIQAPSEGEAQAAYMASKGDVYASASQDYDSLLFGA ------3333------------------------------------------3333---- PRLIRNLTITGKRKMPGKDVYVEIKPELVVLDEVLKELKITREKLIELAILVGTDYNPGG -----1111--------------------------------------------1111--- VKGIGPKKALEIVRYSRDPLAKFQRQSDVDLYAIKEFFLNPPVTNEYSLSWKEPDEEGIL -----3333---------3333--------------------------------3333-- KFLCDEHNFSEERVKNGIERLKKAIKAGRQS ------------------------1111--- >Ig heavy chain V region M; SWP:P01789; PDB:1MCPH; EVKLVESGGGLVQPGGSLRLSCATSGFTFSDFYMEWVRQPPGKRLEWIAASRNKGNKYTT ------------2222-------------------------------------------- EYSASVKGRFIVSRDTSQSILYLQMNALRAEDTAIYYCARNYYGSTWYFDVWGAGTTVTV --3333----------------------3333---------------------------- SSESARNPTIYPLTLPPALSSDPVIIGCLIHDYFPSGTMNVTWGKSGKDITTVNFPPALA ------------------------------------------------------------ SGGRYTMSNQLTLPAVECPEGESVKCSVQHDSNPVQELDVNC -------------1111------------!!!!--------- >IGL@ protein; SWP:Q6PIK1; PDB:1MCWW; SALTQPASVSGSPGQSITVSCAGHTSDVADSNSISWFQQHPDKAPKLLIYAVTFRPSGIP ------------------------------------------------------------ LRFSGSKSGNTASLTISGLLPDDEADYFCMSYLSDASFVFGSGTKVTVLRQPKANPTVTL ------------------------------------------------------------ FPPSSEELQANKATLVCLISDFYPGAVTVAWKADGSPVEAGVETTKPSKQSNNKYAASSY ----3333---------------------------------------------------- LSLTPEQWKSHRSYSCQVTHEGSTVEKTVAPTECS ------1111------------------------- >INTERLEUKIN 1 FAMILY, MEM; SWP:Q9QYY1; PDB:1MD6A; VLSGALCFRMKDSALKVLYLHNNQLLAGGLHAEKVIKGEEISVVPNRALDASLSPVILGV -----------1111-----%%%%----3333-------------11113333------% QGGSQCLSCGTEKGPILKLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFL %%%-------------------3333-------3333--------------3333----- CTSPEADQPVRLTQIPEDPAWDAPITDFYFQQCD -------------------1111----------- >C1R COMPLEMENT SERINE PRO; SWP:P00736; PDB:1MD8A; DCGQPRNLPNGDFRYTTTMGVNTYKARIQYYCHEPYYKMQTQGVYTCTAQGIWKNEQKGE -------2222------2222-2222---------------------1111--------- KIPRCLPVCGKPVNPVEQRIIGGQKAKMGNFPWQVFTNIHGRGGGALLGDRWILTAAHTL --------------------------22221111---------------------3333- YPKEHASLDVFLGHTNVEELMKLGNHPIRRVSVHPDYRQDESYNFEGDIALLELENSVTL ---------------------------------3333-------2222------------ GPNLLPICLPDNDTFYDLGLMGYVSGFGVMEEKIAHDLRFVRLPVANPQACENWLRGKNR 1111------------2222-----------------------------------1111- MDVFSQNMFCAGHPSLKQDACQGDSGGVFAVRDPNTDRWVATGIVSWGIGCSRGYGFYTK ----1111----1111----2222-----------------------------------3 VLNYVDWIKKEMEE 333----------- >Amicyanin [Precursor]; SWP:P22364; PDB:1MDAH; EKSKVAGSAAAASAAAASDGSSCDHGPGAISRRSHITLPAYFAGTTENWVSCAGCGVTLG -----------------------------1111-----%%%%------------------ HSLGAFLSLAVAGHSGSDFALASTSFARSAKGKRTDYVEVFDPVTFLPIADIELPDAPRF ------------1111-------------------------------------------- SVGPRVHIIGNCASSACLLFFLFGSSAAAGLSVPGASDDQLTKSASCFHIHPGAAATHYL ----2222---1111--------------------------------------1111--- GSCPASLAASDLAAAPAAAGIVGAQCTGAQNCSSQAAQANYPGMLVWAVASSILQGDIPA --------------------------3333------------------------------ AGATMKAAIDGNESGRKADNFRSAGFQMVAKLKNTDGIMILTVEHSRSCLAAAENTSSVT -----------------------------------------------1111--------- ASVGQTSGPISNGHDSDAIIAAQDGASDNYANSAGTEVLDIYDAASDQDQSSVELDKGPE ---------------------------------1111----------------------- SLSVQNEA -------- >AMICYANIN; SWP:NA; PDB:1MDAL; VDPRAKWQPQDNDIQACDYWRHCSIAGNICDCSAGSLTSCPPGTLVASGSVGSCYNPPDP ---------------11111111-----3333---------------------------- NKYITAYRDCCGYNVSGRCACLNTEGELPVYNKDANDIIWCFGGEDGMTYHCSISPVSGA ------------------------------------------------------------ >2,3-DIHYDROXYBENZOATE-AMP; SWP:P40871; PDB:1MDBA; MLKGFTPWPDELAETYRKNGCWAGETFGDLLRDRAAKYGDRIAITCGNTHWSYRELDTRA -2222--------------------------------3333----!!!!----------- DRLAAGFQKLGIQQKDRVVVQLPNIKEFFEVIFALFRLGALPVFALPSHRSSEITYFCEF ------------2222--------3333-----------------11113333-----11 AEAAAYIIPDAYSGFDYRSLARQVQSKLPTLKNIIVAGEAEEFLPLEDLHTEPVKLPEVK 11---------iiii3333--------3333--------!!!!-3333-----------1 SSDVAFLQLSGGSTGLSKLIPRTHDDYIYSLKRSVEVCWLDHSTVYLAALPMAHNYPLSS 111-------------------------------------1111------1111------ PGVLGVLYAGGRVVLSPSPSPDDAFPLIEREKVTITALVPPLAMVWMDAASSRRDDLSSL -------------------3333-------------------------1111----3333 QVLQVGGAKFSAEAARRVKAVFGCTLQQVFGMAEGLVNYTRLDDPEEIIVNTQGKPMSPY ----------33331111-------------3333-----1111-------------111 DESRVWDDHDRDVKPGETGHLLTRGPYTIRGYYKAEEHNAASFTEDGFYRTGDIVRLTRD 1-----1111---2222-----------------3333-----1111----------111 GYIVVEGRAKDQINRGGEKVAAEEVENHLLAHPAVHDAAMVSMPDQFLGERSCVFIIPRD 1------3333---%%%%--3333-------1111------------------------- EAPKAAELKAFLRERGLAAYKIPDRVEFVESFPQTGVGKVSKKALREAISEKLLAG ---3333-----1111-1111-------------1111------------------ >INSECT FATTY ACID BINDING; SWP:P31417; PDB:1MDC; SYLGKVYSLVKQENFDGFLKSAGLSDDKIQALVDKPTQKMEANGDSYSNTTGGGGAKTVS -2222--------------1111-------------------!!!!-----2222----- FKSGVEFDDVIGAGDSVKSMYTVDGNVVTHVVKGDAGVATFKKEYNGDDLVVTITSSNWD -2222------------------!!!!------3333--------!!!!------1111- GVARRYYKA --------- >MANDELATE RACEMASE; SWP:P11444; PDB:1MDL; EVLITGLRTRAVNVPLAYPVHTAVGTVGTAPLVLIDLATSAGVVGHSYLFAYTPVALKSL --------------------------------------1111----------3333---- KQLLDDMAAMIVNEPLAPVSLEAMLAKRFCLAGYTGLIRMAAAGIDMAAWDALGKVHETP ------33332222------------1111------------------------1111-- LVKLLGANARPVQAYDSHSLDGVKLATERAVTAAELGFRAVKTRIGYPALDQDLAVVRSI -3333-------------------------------------------3333-------- RQAVGDDFGIMVDYNQSLDVPAAIKRSQALQQEGVTWIEEPTLQHDYEGHQRIQSKLNVP -------------%%%%-------------3333--------1111-------1111--- VQMGENWLGPEEMFKALSIGACRLAMPDAMKIGGVTGWIRASALAQQFGIPMSSHLFQEI ---1111---------1111-------3333-------------------------3333 SAHLLAATPTAHWLERLDLAGSVIEPTLTFEGGNAVIPDLPGVGIIWREKEIGKYLV -------------------3333-------iiii--------------3333----- >ARNB AMINOTRANSFERASE; SWP:Q8ZNF3; PDB:1MDOA; DFLPFSRPAGAEELAAVKTVLDSGWITTGPKNQELEAAFCRLTGNQYAVAVSSATAGHIA --------------------3333----------------------------3333---- LALGIGEGDEVITPSTWVSTLNIVLLGANPVVDVDRDTLVTPEHIEAAITPQTKAIIPVH -----2222--------------1111-------1111-------11111111------2 YAGAPADLDAIYALGERYGIPVIEDAAHATGTSYKGRHIGARGTAIFSFHAIKNITCAEG 222------------1111------1111----iiii2222--------1111------- GIVVTDNPQFADKLRSLKFHGLGVDQAEVLAPGYKYNLPDLNAAIALAQLQKLDALNARR ------------------iiii-------------------------------------- AAIAAQYHQAADLPFQPLSLPSWEHIHAWHLFIIRVDEARCGITRDALASLKTKGIGTGL ------------------------------------3333---3333------------- HFRAAHTQKYYRERFPTLTLPDTEWNSERICSLPLFPDTESDFDRVITALHQIAG ---1111-------1111--------------------3333------------- >CALPAIN II, CATALYTIC SUB; SWP:Q07009; PDB:1MDWA; ERAIKYLNQDYETLRNECLEAGALFQDPSFPALPSSLGFKELGPYSSKTRGIEWKRPTEI ----2222------------------1111--3333------11111111-----3333- CADPQFIIGGATRTDICQGALGDSWLLAAIASLTLNEEILARVVPLDQSFQENYAGIFHF ------------1111-------------------------------------------- QFWQYGEWVEVVVDDRLPTKDGELLFVHSAEGSEFWSALLEKAYAKINGCYEALSGEGFE ---%%%%------------iiii-------1111---------------3333-----33 DFTGGIAEWYELRKPPPNLFKIIQKALEKGSLLGCSIDITSAADSEAVTYQKLVKGHAYS 33--------3333------------1111----------3333----1111-------- VTGAEEVESSGSLQKLIRIRNPWGQVEWTGKWNDNCPSWNTVDPEVRANLTERQEDGEFW --------iiii--------3333-----2222--3333--------------------- MSFSDFLRHYSRLEICNLT ------------------- >PROTEIN (MYOD BHLH DOMAIN; SWP:P10085; PDB:1MDYA; MELKRKTTNADRRKAATMRERRRLSKVNEAFETLKRSTSSNPNQRLPKVEILRNAIRYIE ----------------------------------------3333---------------- GLQALLRD -------- >CRUZIPAIN; SWP:P25779; PDB:1ME4A; APAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTD -----3333---------!!!!-3333-----------1111------------------ SGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQ !!!!--------------iiii---1111---1111------------------------ DEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMTSCVSEQLDHGVLLVGYNDSAAVPYWII ----------------------1111---------------------------------- KNSWTTQWGEEGYIRIAKGSNQCLVKEEASSAVVG ----3333-iiii-------2222----------- >INOSINE-5'-MONOPHOSPHATE ; SWP:P50097; PDB:1ME8A; AKYYNEPCHTFNEYLLIPGLSTVDCIPSNVNLSTPLVKFQKGQQSEINLKIPLVSAIMQS ---------3333--------11113333----------2222--------------111 VSGEKMAIALAREGGISFIFGSQSIESQAAMVHAVKNFKAHNELVDSQKRYLVGAGINTR 1-----------------------------------1111-----1111----------- DFRERVPALVEAGADVLCIDSSDGFSEWQKITIGWIREKYGDKVKVGAGNIVDGEGFRYL 3333-----------------------------------!!!!----------------- ADAGADFIKIGIGGGSIITREQKGIGRGQATAVIDVVAERNKYFEETGIYIPVCSDGGIV ------------------1111-------------------------------------- YDYHMTLALAMGADFIMLGRYFARFEESPTRKVTINGSVMKEYWGEGSSRARNWEGVDSY 3333--------------3333--1111------iiii------1111------------ VPYAGKLKDNVEASLNKVKSTMCNCGALTIPQLQSKAKITLVSSVSIVEGGAHDVI ----------------------1111----------------11113333------ >EGLIN C; SWP:P07518; PDB:1MEEA; AQSVPYGISQIKAPALHSQGYTGSNVKVAVIDSGIDSSHPDLNVRGGASFVPSETNPYQD ----3333--------------2222---------1111-----------1111-----1 GSSHGTHVAGTIAALNNSIGVLGVAPSASLYAVKVLDSTGSGQYSWIINGIEWAISNNMD 111---------------------1111--------1111--------------1111-- VINMSLGGPTGSTALKTVVDKAVSSGIVVAAAAGNEGSSGSTSTVGYPAKYPSTIAVGAV --------------------------------------!!!!-----3333--------- NSANQRASFSSAGSELDVMAPGVSIQSTLPGGTYGAYNGTSMATPHVAGAAALILSKHPT ------1111--1111----------------------3333-----------3333333 WTNAQVRDRLESTATYLGSSFYYGKGLINVQAAAQ 3-----------------3333!!!!--3333--- >CARICAIN; SWP:P10056; PDB:1MEG; LPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRS -----3333--------------3333--------------------------------- HGCKGGYPPYALEYVAKNGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGN !!!!-----------------3333----------1111--------------------- LLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVEHAVTAVGYGKSGGKGYILIKNS ----3333---------------------------------------------------- WGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPTKN -1111-iiii-----------2222----------- >PHOSOPHORIBOSYLGLYCINAMID; SWP:P22102; PDB:1MEOA; ARVAVLISGTGSNLQALIDSTREPNSSAQIDIVISNKAAVAGLDKAERAGIPTRVINHKL ----------2222--------------------------------1111------1111 YKNRVEFDSAIDLVLEEFSIDIVCLAGFMRILSGPFVQKWNGKMLNIHPSLLPSFKGSNA ---------------1111--------------------2222----------------- HEQALETGVTVTGCTVHFVAEAGQIILQEAVPVKRGDTVATLSERVKLAEHKIFPAALQL ---------------------------------2222----------------------- VASGTVQLGENGKICWVKEEHH --------1111---------- >FAB 29G12 LIGHT CHAIN; SWP:NA; PDB:1MEXH; QVQLQQSDAELVKPGASVKISCKASGYTFTDHAIHWVKQKPGLEWIGYISPGNGDIKYNE ------------2222-----------1111---------------------------33 KFKGKATLTADKSSSTAYMQLNSLTSEDSAVYFCKMEYLDYWGQGTTLTVSSGGTTPPSV 33---------1111---------3333-------------------------------- YPLAPGSAAQTNSVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSSSVT -----1111-----------------------iiii------------------------ VPSSTWPSQSVTCNVAHPASSTAVDKKIAPA ----------------3333----------- >Igk-C protein; SWP:Q58EU4; PDB:1MEXL; DIVMTQSQKFMSTSLGNRVSVTCKASQNVGTNVAWFQQKPGQSPKTLIYSASYRYSGVPD -------------2222-----------!!!!------2222------------222233 RFTGSGSGTDFTLTINNVQSEDLAEYFCQQYNSYPYTFGGGTKLEIKRADAAPTVSIFPP 33----------------3333-------------------------------------- SSEQLTGGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT 33331111---------------------iiii--2222--------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNE -33331111--------1111--------3333 >DNA; SWP:NA; PDB:1MEYC; EKPYKCPECGKSFSQSSNLQKHQRTHTGEKPYKCPECGKSFSQSSDLQKHQRTHTGEKPY --------------3333---1111-------------------------1111------ KCPECGKSFSRSDHLSRHQRTHQ ----------3333----3333- >INTEGRIN ALPHA M; SWP:P11215; PDB:1MF7A; CPQEDSDIAFLIDGSGSIIPHDFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIHFTFK --------------3333--------------------1111------------------ EFQNNPNPRSLVKPITQLLGRTHTATGIRKVVRELFNITNGARKNAFKILVVITDGEKFG ------3333-1111---------------------3333--2222-------------- DPLGYEDVIPEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKT ---3333-----1111--------3333-3333----------3333------3333333 IQNQLREKIFCIGS 3-------1111-- >COPPER,ZINC SUPEROXIDE DI; SWP:P00441; PDB:1MFMA; ATKAVAVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEEEDNTAGCTSA ------------------------------------------------------------ GPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHSIIGRTLVVH ----1111----1111---1111------1111--------------11112222----- EKADDLGKGGNEQSTKTGNAGSRLACGVIGIAQ ----iiii--3333------------------- >7S RNA OF HUMAN SRP; SWP:P13624; PDB:1MFQC; KHGQFTLRDMYEQFQNIMKMGPFSQILGMIPGFNEQESMARLKKLMTIMDSMNDQELDST ---------------------3333---------1111----------1111-3333--- DGAKVFSKQPGRIQRVARGSGVSTRDVQELLTQYTKFAQMVKKMGGIK --------3333------------------------------------ >FOUR-HELIX BUNDLE MODEL; SWP:NA; PDB:1MFTA; DYLRELYKLEQQAMKLYREASEKARNPEKKSVLQKILEDEEKHIEWLETIN --------------------3333----------------------3333- >PROTEIN (HTLV-1 GP21 ECTO; SWP:P02928; PDB:1MG1A; GKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAH -------1111--------------------------3333-----1111--------33 DRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPN 33---------------3333----33333333-%%%%---------------1111--- PPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVD ---3333-------3333----------33333333----------------1111---- NAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYGV --------------1111-------3333--3333--------3333---3333------ TVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVAL -----iiii-------------1111-3333----------1111--------------3 KSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDAALAAA 333---11113333--------------1111-------------------3333----- QTNAAAMSLASGKSLLHEVDKDISQLTQAIVKNHKNLLKIAQYAAQNRRGLDLLFWEQGG ------------------------------------------------------------ LCKALQEQCCFLNITNSHVSILQERPPLEN 3333-----------3333-3333------ >DOUBLECORTIN-LIKE KINASE ; SWP:O15075; PDB:1MG4A; KKAKKVRFYRNGDRYFKGIVYAISPDRFRSFEALLADLTRTLSDNVNLPQGVRTIYTIDG ---------2222----------3333--------------------3333-----1111 LKKISSLDQLVEGESYVCGSIEPFKKLEYTKNVNPNWSVNV -----3333-2222-------------1111---3333--- >ALCOHOL DEHYDROGENASE; SWP:P00334; PDB:1MG5A; SFTLTNKNVIFVAGLGGIGLDTSKELLKRDLKNLVILDRIENPAAIAELKAINPKVTVTF ---2222--------3333------3333-------------------11113333---- YPYDVTVPIAETTKLLKTIFAQLKTVDVLINGAGILDDHQIERTIAVNYTGLVNTTTAIL ---11113333----------------------------------------------333 DFWDKRKGGPGGIICNIGSVTGFNAIYQVPVYSGTKAAVVNFTSSLAKLAPITGVTAYTV 3--1111-----------3333---1111-------------------3333-------- NPGITRTTLVHKFNSWLDVEPQVAEKLLAHPTQPSLACAENFVKAIELNQNGAIWKLDLG ------3333-----%%%%-------3333-------------------2222----iii TLEAIQWTKHWDSGI i-------------- >EARLY SWITCH PROTEIN XOL-; SWP:Q23229; PDB:1MG7A; ERRVKILGIDRSENSPVLTYMSKLAAAPHTVHMMDSGFLAINRQCLVKGKAILAREPKSS --------------3333--------------%%%%----------------------11 NEHMIDDLPKHAHDQHTLSILRDFIDQLKLHNVYEINFYDPLDSSGKLAVIPMLIALWKC 11----------------------3333--------------1111-------------- MLASETDICDQEVLKSIMNSVIAKFELQIPCKNAVIDATLSGSREEVHIIAESNGTTEHF 1111-------------------------------------------------------- NKKHDLVFVKTDLHPEDFTPQMFPSQAKAKLLRDAFNNEEDEDTFPDILVPAYMTAHSKN ----------1111---------------------1111-11113333------------ RVRQEDYTCLEVEFDSQVALEKLMNEHEQVEGFEVQQGGILVALKKDSFFDDELIEKIAI ---------3333-------------3333-----1111-----2222------------ AIATESRQSVSSVSFDLLKLGPGASLVTLANSRRFEPECRVVLQIEVKPVS ---------------------------1111-------------------- >PARKIN; SWP:NA; PDB:1MG8A; MGMIVFVRFNSSYGFPVEVDSDTSILQLKEVVAKRQGVPADQLRVIFAGKELPNHLTVQN -------------------1111---------------3333----iiii--11111111 CDLEQQSIVHIVQRPRRR ------------------ >SM-LIKE PROTEIN; SWP:O26745; PDB:1MGQA; RVNVQRPLDALGNSLNSPVIIKLKGDREFRGVLKSFDLHMNLVLNDAEELEDGEVTRRLG --1111-----1111-------2222----------1111----------%%%%------ TVLIRGDNIVYISP ----3333------ >GUANYL-SPECIFIC RIBONUCLE; SWP:P30289; PDB:1MGRA; VKAVGRVCYSALPSQAHDTLDLIDEGGPFPYSQDGVVFQNREGLLPAHSTGYYHEYTVIT -------3333-3333--------------1111-----1111-----2222-------2 PGSPTRGARRIITGQQWQEDYYTADHYASFRRVDFAC 222------------2222-----iiii--------- >HUMAN MELANOMA GROWTH STI; SWP:P09341; PDB:1MGSA; ASVATELRCQCLQTLQGIHPKNIQSVNVKSPGPHCAQTEVIATLKNGRKACLNPASPIVK ------------------1111------------------------------1111---- KIIEKMLNSDKSN ------------- >O6-METHYLGUANINE-DNA METH; SWP:O74023; PDB:1MGTA; MLSVEKFRVGERVVWIGVIFSGRVQGIAFAFDRGTLMKRIHDLAEHLGKRGVSISLDVQP --------!!!!-----------------------------------1111--------- SDYPEKVFKVLIGELDNASFLRELSFEGVTPFEKKVYEWLTKNVKRGSVITYGDLAKALN ---------------33333333--2222---------------2222--------1111 TSPRAVGGAMKRNPYPIVVPCHRVVAHDGIGYYSSGIEEKKFLLEIEGV ---------1111------3333--1111---1111--------1111- >PHOSPHOLIPASE A2; SWP:P60043; PDB:1MH2A; NTYQFQNMIQCTVPKRSWRDFADYGCYCGRGGSGTPIDDLDSCCQVHDNCYNSAREQGGC 3333--------------1111---------------3333------------------- RPKQKTYTYQCKAGGLSCSGANNSCAATTCDCDRLAAICFAGAPYNDNNYNIDLKARCQ 3333-------iiii------------------------------3333---3333--- >Phospholipase A2 isoform ; SWP:P60044; PDB:1MH2B; NTWQFKNMISCTVPSRSWWDFADYGCYCGRGGSGTPSDDLDRCCQTHDNCYNEAEKISGC 3333------------3333------------------3333--------------2222 NPRFRTYSYACTAGTLTCTGRNNACAASVCDCDRNAAICFAGAPYNDSNYNIDLQARCN 3333-------iiii------------------------------1111---3333--- >IMMUNOGLOBULIN MS6-164; SWP:NA; PDB:1MH5B; VQLQQPGAELVKPGASVKLSCKASGYTFTSNWINWVKQRPGQGLEWIGNIYPDSYRTNYN -----------2222-----------1111--------2222-----------------3 EKFKRKATLTVDTSSSTAYMQLSSL 333--------1111---------- >PHOSPHOLIPASE A2; SWP:P60043; PDB:1MH8A; NIYQFKNMIECTVPARSWWDFADYGCYCGGGGSGTPTDDLDRCCQVHDNCYNQAQEITGC ------------------1111---------------3333---------------2222 RPKWKTYTYQCTQGTLTCKGRNNACAATTCDCDRLAAICFAGAPYNDTNYNIDLKARCQ 3333-------%%%%------------------------1111--3333---3333--- >MHC CLASS I ANTIGEN H2-M3; SWP:Q31093; PDB:1MHCA; GSHSLRYFHTAVSRPGRGEPQYISVGYVDDVQFQRCDSIEEIPRMEPRAPWMEKERPEYW ---------------------------!!!!-------iiii------3333---3333- KELKLKVKNIAQSARANLRTLLRYYNQSEGGSHILQWMVSCEVGPDMRLLGAHYQAAYDG ----------------------1111-----------------1111----------iii SDYITLNEDLSSWTAVDMVSQITKSRLESAGTAEYFRAYVEGECLELLHRFLRNGKEILQ i-----1111--------------------3333-------------------------- RADPPKAHVAHHPRPKGDVTLRCWALGFYPADITLTWQKDEEDLTQDMELVETRPSGDGT -------------------------------------------3333------------- FQKWAAVVVPSGEEQRYTCYVHHEGLTEPLALKWRS ------------3333------1111---------- >Protein L [Precursor]; SWP:Q51918; PDB:1MHHB; QIQLVQSGPELKKPGETVKISCKASGYTFTDFSMHWVNQAPGKGLNWMGWVNTETGEPTY ------------2222-----------1111--------2222----------------- ADDFKGRFAFSLETSASTAYLQINSL 1111--------3333---------- >S-ADENOSYLMETHIONINE DECA; SWP:Q04694; PDB:1MHMA; SLFVYSYKIIIKTCGTTKLLLAIPPILRLAETLSLKVQDVRYTRGSRHFSEEVAVLDGYF -------------33333333---------1111-------------------------- GKLAAGSKAVIMGSPDKTQKWHVYSASAGSVQSNDPVYTLEMCMTGLDREKASVFYKTEE --1111-------------------------------------------3333------- SSAAHMTVRSGIRKILPKSEICDFEFEPCGYSMNSIEGAAVSTIHITPEDGFTYASFESV -------333311111111-----------------!!!!--------2222-------- GYNPKTMELGPLVERVLACFEPAEFSVALHADVATKLLERICSVDVKGYSLAEWSPEEFG ---------------------------------------------2222---------!! EGGSIVYQKFTRT !!----------- >S-adenosylmethionine deca; SWP:Q04694; PDB:1MHMB; FEKRLEISFVEPGLFGKGLRSLSKAQLDEILGPAECTIVDNLSNDYVDSYVLSE -----------------3333-3333-----1111------------------- >SURVIVAL MOTOR NEURON PRO; SWP:Q16637; PDB:1MHNA; LQQWKVGDKCSAIWSEDGCIYPATIASIDFKRETCVVVYTGYGNREEQNLSDLLSPICE ----2222---------------------1111-----2222------3333--3333- >S-100 PROTEIN; SWP:P02638; PDB:1MHO; SELEKAVVALIDVFHQYSGREGDKHKLKKSELKELINNELSHFLEEIKEQEVVDKVMETL ----------------------1111----------------------3333-------- DSDGDGECDFQEFMAFVAMITTACHEFF 1111------------------------ >IGHG1 protein; SWP:Q569F4; PDB:1MHPH; EVQLVESGGGLVQPGGSLRLSCAASGFTFSRYTMSWVRQAPGKGLEWVATISGGGHTYYL ------------2222-----------3333--------2222--------1111----3 DSVKGRFTISRDNSKNTLYLQMNSLRAEDTAVYYCTRGFGDGGYFDVWGQGTLVTVSSAS 333--------3333----------3333---------!!!!------------------ TKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGL -----------3333-----------------------%%%%--2222-------1111- YSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPK ---------3333-----------3333----------- >ADP-RIBOSYLATION FACTOR B; SWP:Q9UJY4; PDB:1MHQA; SLELWLNKATDPSMSEQDWSAIQNFCEQVNTDPNGPTHAPWLLAHKIQSPQEKEALYALT -------------------------------3333------------------------- VLEMCMNHCGEKFHSEVAKFRFLNELIKVLSPKYLGSWATGKVKGRVIEILFSWTVWFPE -------------------------3333---2222---------------------333 DIKIRDAYQMLKKQGIIKQDPKL 3---------------------- >IMMUNOGLOBULIN-BINDING PR; SWP:NA; PDB:1MHXA; MHHHHHHAMDTYKLFIVIGDRVVVVTTEAVDAATAEKVFKQYANDNGVDGEWTYDDAAKT -------------------------------------------1111--------1111- FTVTE ----- >Methane monooxygenase com; SWP:P27354; PDB:1MHYB; KRGLTDPERAAIIAAAVPDHALDTQRKYHYFIQPRWKPLSEYEQLSCYAQPNPDWIAGGL -3333-----------------------1111-------------2222---1111---- DWGDWTQKFHGGRPSWGNESTELRTTDWYRHRDPARRWHHPYVKDKSEEARYTQRFLAAY ----------------1111------1111--1111------------------------ SSEGSIRTIDPYWRDEILNKYFGALLYSEYGLFNAHSSVGRDCLSDTIRQTAVFAALDKV -----1111--------------------------------------------------- DNAQMIQMERLFIAKLVPGFDASTDVPKKIWTTDPIYSGARATVQEIWQGVQDWNEILWA ----------------2222-------------3333----------------------- GHAVYDATFGQFARREFFQRLATVYGDTLTPFFTAQSQTYFQTTRGAIDDLFVYCLANDS -------------------3333-----3333---------------------------- EFGAHNRTFLNAWTEHYLASSVAALKDFVGLYAKVEKVAGATDSAGVSEALQRVFGDWKI -------------------------------1111--2222------------------- DYADKIGFRVDVDQKVDAVLAGY -3333--------------1111 >Methane monooxygenase com; SWP:P27353; PDB:1MHYD; NRAPVGVEPQEVHKWLQSFNWDFKENRTKYPTKYHMANETKEQFKVIAKEYARMEAAKDE -------3333-----3333--1111----------1111-------------------- RQFGTLLDGLTRLGAGNKVHPRWGETMKVISNFLEVGEYNAIAASAMLWDSATAAEQKNG -------------3333------------------------------------------- YLAQVLDEIRHTHQCAFINHYYSKHYHDPAGHNDARRTRAIGPLWKGMKRVFADGFISGD ---------------------------------3333---------------3333---- AVECSVNLQLVGEACFTNPLIVAVTEWASANGDEITPTVFLSVETDELRHMANGYQTVVS ----------------------------1111--3333---------------------- IANDPASAKFLNTDLNNAFWTQQKYFTPVLGYLFEYGSKFKVEPWVKTWNRWVSEDWGGI 11113333-------------------------1111----------------------- WIGRLGKYGVESPRSLRDAKRDAYWAHHDLALAAYAMWPLGFARLALPDEEDQAWFEANY ----3333----1111---1111-------------3333-------------------2 PGWADHYGKIFNEWKKLGYEDPKSGFIPYQWLLANGHDVYIDRVSQVPFIPSLAKGTGSL 222--------------1111-----3333--1111-------------3333------- RVHEFNGKKHSLTDDWGERQWLIEPERYECHNVFEQYEGRELSEVIAEGHGVRSDGKTLI ----iiii---------------1111----3333-22223333--------1111---- AQPHTRGDNLWTLEDIKRAGCVFPDPLAKF ------------------------1111-- >Methane monooxygenase com; SWP:P27355; PDB:1MHYG; AKREPIHDNSIRTEWEAKIAKLTSVDQATKFIQDFRLAYTSPFRKSYDIDVDYQYIERKI ------------------1111------------------1111----1111-------- EEKLSVLKTEKLPVADLITKATTGEDAAAVEATWIAKIKAAKSKYEAEAIHIEFRQLYKP ------------3333----1111-------------1111-3333-------------- PVLPVNVFLRTDAALGTVLMEIRNTDYYGTPLEGLRKERGVKVLHLQ -------------------------1111------------------ >IMMUNOGLOBULIN-BINDING PR; SWP:NA; PDB:1MI0A; HHHAMDTYKLVIVLNGTTFTYTTEAVDAATAEKVFKQYANDNGVDGEWTYADATKTFTVT ---------------------------------------1111--------1111----- E - >NEUROBEACHIN; SWP:Q8NFP9; PDB:1MI1A; GPVVLSTPAQLIAPVVVAKGTLSITTTEIYFEVDEDDSAFKKIDTKVLAYTEGLHGKWFS ---------------------------------11113333--33331111-------33 EIRAVFSRRYLLQNTALEVFANRTSVFNFPDQATVKKVVYSLPRVGVGTSYGLPQARRIS 33-------iiii-----------------3333----1111--!!!!1111-------- LATPRQLYKSSNTQRWQRREISNFEYLFLNTIAGRTYNDLNQYPVFPWVLTNYESEELDL ---------------------3333----------1111--------------------- TLPGNFRDLSKPIGALNPKRAVFYAERYETWEDDQSPPYHYNTHYSTATSTLSWLVRIEP -3333------1111---------------------------------------1111-- FTTFFLNANDGKFDHPDRTFSSVARSWRTSQRDTSDVKELIPEFYYLPEFVNSNGYNLGV --------iiii--1111--------------1111----3333---------------- REDEVVVNDVDLPPWAKKPEDFVRINRALESEFVSCQLHQWIDLIFGYKQRGPEAVRALN ------------1111-3333------11113333-----------1111-33331111- VFHYLTYEGSVNLDSITDPVLREAEAQIQNFGQTPSQLLIEPHPPR --11112222-3333--3333------------------------- >MACROPHAGE INFLAMMATORY P; SWP:P10889; PDB:1MI2A; AVVASELRCQCLKTLPRVDFKNIQSLSVTPPGPHCAQTEVIATLKGGQKVCLDPEAPLVQ -------------------3333---------------------------------1111 KIIQKILNKGKAN ------------- >XYLOSE REDUCTASE; SWP:O74237; PDB:1MI3A; SIPDIKLSSGHLMPSIGFGCWKLANATAGEQVYQAIKAGYRLFDGAEDYGNEKEVGDGVK ------1111---------22223333------------------3333----------- RAIDEGLVKREEIFLTSKLWNNYHDPKNVETALNKTLADLKVDYVDLFLIHFPIAFKFVP --1111--3333-------1111-3333-------------------------------3 IEEKYPPGFYCGDGNNFVYEDVPILETWKALEKLVAAGKIKSIGVSNFPGALLLDLLRGA 333---!!!!--!!!!------3333-----------------------------3333- TIKPAVLQVEHHPYLQQPKLIEFAQKAGVTITAYSSFGPQSFVEMNQGRALNTPTLFAHD -----------1111---------1111------11113333-----3333---3333-- TIKAIAAKYNKTPAEVLLRWAAQRGIAVIPKSNLPERLVQNRSFNTFDLTKEDFEEIAKL --------------------3333-------------------------------3333- DIGLRFNDPWDWDNIPIFV -------3333-------- >DNAB INTEIN; SWP:Q55418; PDB:1MI8A; AISGDSLISLASTGKRVSIKDLLDEKDFEIWAINEQTMKLESAKVSRVFMTGKKLVYILK --3333-----------3333--------------------------------------- TRLGRTIKATANHRFLTIDGWKRLDELSLKEHIALPRKSDISWDSIVSITETGVEEVFDL 3333-----1111---------1111-1111----------------------------- TVPGPHNFVANDIIVHASI ---------%%%%------ >NONSPECIFIC LIPID-TRANSFE; SWP:P07597; PDB:1MIDA; LNCGQVDSKMKPCLTYVQGGPGPSGECCNGVRDLHNQAQSSGDRQTVCNCLKGIARGIHN ---------3333--1111-----------------------------------1111-- LNLNNAASIPSKCNVNVPYTISPDIDCSRIY ---------------------11113333-- >PROTEIN PROSPERO; SWP:P29617; PDB:1MIJA; SSTLTPHLRKAKLFFWVRYPSSAVLKYFPDIKFNKNNTAQLVKWFSNFREFYYIQEKYAR --------------------3333---1111----------------------------- QAVTESELYRVLNLHYNRNNHIEVPQNFRFVVESTLREFFRAIQGGKDTEQSWKKSIYKI ------------------------3333-----------------1111--1111----- ISRDDPVPEYFKSP -------3333--- >SHC ADAPTOR PROTEIN; SWP:P29353; PDB:1MIL; GSQLRGEPWFHGKLSRREAEALLQLNGDFLVRESTTTPGQYVLTGLQSGQPKHLLLVDPE ------1111----33331111------------------------%%%%---------- GVVRTKDHRFESVSHLISYHMDNHLPIISAGSELCLQQPVERKL --------------------3333----iiii------------ >CHIMERIC SDZ CHI621; SWP:NA; PDB:1MIMH; QLQQSGTVLARPGASVKMSCKASGYSFTRYWMHWIKQRPGQGLEWIGAIYPGNSDTSYNQ -------------------------1111--------2222-----------------33 KFEGKAKLTAVTSASTAYMELSSLTHEDSAVYYCSRDYGYYFDFWGQGTTLTVSSASTKG 33--------3333----------1111-------------------------------- PSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSL -----------------------------------%%%%--2222-------1111---- SSVVTVPSSSLGTQTYICNVNHKPSNTKVDKRVEP ------3333------------------------- >CHIMERIC SDZ CHI621; SWP:NA; PDB:1MIML; QIVSTQSPAIMSASPGEKVTMTCSASSSRSYMQWYQQKPGTSPKRWIYDTSKLASGVPAR -------------2222--------------------2222----------------333 FSGSGSGTSYSLTISSMEAEDAATYYCHQRSSYTFGGGTKLEIKRTVAAPSVFIFPPSDE 3----------------3333--------------------------------------- QLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLTLSK --------------------------%%%%----------------------------33 ADYEKHKVYACEVTHQGLSSPVTKSFNRGE 331111------------------------ >NITROGENASE MOLYBDENUM IR; SWP:P00467; PDB:1MIOA; SENLKDEILEKYIPKTKKTRSGHIVIKTEETPNPEIVANTRTVPGIITARGCAYAGCKGV ------------3333---3333----------------------------3333----- VMGPIKDMVHITHGPIGCSFYTWGGRRFKSKPENGTGLNFNEYVFSTDMQESDIVFGGVN -3333---------33331111-----------------1111----------------- KLKDAIHEAYEMFHPAAIGVYATCPVGLIGDDILAVAATASKEIGIPVHAFSCEGYKGVS ---33333333-------------3333---3333------------------1111--3 QSAGHHIANNTVMTDIIGKGNKEQKKYSINVLGEYNIGGDAWEMDRVLEKIGYHVNATLT 333---------------------------------%%%%-------------------- GDATYEKVQNADKADLNLVQCHRSINYIAEMMETKYGIPWIKCNFIGVDGIVETLRDMAK ---3333--1111---------------------------------3333---------- CFDDPELTKRTEEVIAEEIAAIQDDLDYFKEKLQGKTACLYVGGSRSHTYMNMLKSFGVD ------------------------1111----------------3333------------ SLVAGFEFAHRDDYEGREVIPTIKIDADSKNIPEITVTPDEQKYRVVIPEDKVEELKKAG ---------3333-----3333---------------------------3333------- VPLSSYGGMMKEMHDGTILIDDMNHHDMEVVLEKLKPDMFFAGIKEKFVIQKGGVLSKQL --------1111-2222------3333---------------3333----1111------ HSYDYNGPYAGFRGVVNFGHELVNGIYTPAWKMITPPWKKASSES -%%%%------3333------------3333-------------- >Nitrogenase molybdenum-ir; SWP:P11347; PDB:1MIOB; LDATPKEIVERKALRINPAKTCQPVGAMYAALGIHNCLPHSHGSQGCCSYHRTVLSRHFK ------------------------------1111---------3333------------- EPAMASTSSFTEGASVFGGGSNIKTAVKNIFSLYNPDIIAVHTTCLSETLGDDLPTYISQ ----------3333---------------------------------------------- MEDAGSIPEGKLVIHTNTPSYVGSHVTGFANMVQGIVNYLSENTGAKNGKINVIPGFVGP -------2222------------------------------------------------- ADMREIKRLFEAMDIPYIMFPDTSGVLDGPTTGEYKMYPEGGTKIEDLKDTGNSDLTLSL -------------------------------------------33331111--------- GSYASDLGAKTLEKKCKVPFKTLRTPIGVSATDEFIMALSEATGKEVPASIEEERGQLID ----------------------------3333---------------------------- LMIDAQQYLQGKKVALLGDPDEIIALSKFIIELGAIPKYVVTGTPGMKFQKEIDAMLAEA ----33332222------3333------------------------3333---3333--- GIEGSKVKVEGDFFDVHQWIKNEGVDLLISNTYGKFIAREENIPFVRFGFPIMDRYGHYY ------------------------------1111----------------------3333 NPKVGYKGAIRLVEEITNVILDKIERECTEEDFEVVR ----------------------------1111----- >PLASMEPSIN; SWP:O60989; PDB:1MIQA; HLTLAFKIERPYDKVLKTISKKNLKNYIKETFNFFKSGYMKQNYLG --------------------------------3333---------- >TRYPSIN INHIBITOR V; SWP:P19873; PDB:1MIT; GSSCPGKSSWPHLVGVGGSVAKAIIERQNPNVKAVILEEGTPVTKDFRCNRVRIWVNKRG ------------2222------------1111---------------1111-----3333 LVVSPPRIG --------- >TRNA CCA-ADDING ENZYME; SWP:Q7SIB1; PDB:1MIWA; KPPFQEALGIIQQLKQHGYDAYFVGGAVRDLLLGRPIGDVDIATSALPEDVAIFPKTIDV 3333--------------------3333------------------3333---------- GSKHGTVVVVHKGKAYEVTTFKTDGSVTFVRSLEEDLKRRDFTNAIADEYGTIIDPFGGR 3333------iiii----------------------1111-------1111--------- EAIRRRIIRTVGEAEKRFREDALRRAVRFVSELGFALAPDTEQAIVQNAPLLAHISVERT --------------------3333------------------------3333---3333- EEKLLGGPFAARALPLLAETGLNAYLPGLAGKEKQLRLAAAYRWPWLAAREERWALLCHA --33331111---3333----22222222---------11113333------------11 LGVQESRPFLRAWKLPNKVVDEAGAILTALADIPRPEAWTNEQLFSAGLERALSVETVRA 11---3333-1111--------------------3333---------------------- AFTGAPPGPWHEKLRRRFASLPIKTKGELAVNGKDVIEWVGKPAGPWVKEALDAIWRAVV ------------------------3333---3333---------3333------------ NGEVENEKERIYAWLERNRTREKNC -------3333-------------- >TALIN; SWP:P54939; PDB:1MIXA; MKFFYSDQNVDSRDPVQLNLLYVQARDDILNGSHPVSFDKACEFAGYQCQIQFGPHNEQK -----33331111-------------------------------------------3333 HKPGFLELKDFLPKEYIKQKGERKIFMAHKNCGNMSEIEAKVRYVKLARSLKTYGVSFFL -2222-3333--33331111---------1111-----------------1111------ VKEKMKGKNKLVPRLLGITKECVMRVDEKTKEVIQEWSLTNIKRWAASPKSFTLDFGDYQ ------------------1111---------------1111--------------!!!!- DGYYSVQTTEGEQIAQLIAGYIDIIL -----------------1111----- >SANK E3_5 PROTEIN; SWP:NA; PDB:1MJ0A; GSDLGKKLLEAARAGQDDEVRILMANGADVNATDNDGYTPLHLAASNGHLEIVEVLLKNG -----------------------1111-1111-1111-------------------1111 ADVNASDLTGITPLHLAAATGHLEIVEVLLKHGADVNAYDNDGHTPLHLAAKYGHLEIVE -1111-1111-----------------------------1111-------1111------ VLLKHGADVNAQDKFGKTAFDISIDNGNEDLAEILQ --1111-1111-1111-------1111----1111- >ENOYL-COA HYDRATASE, MITO; SWP:P14604; PDB:1MJ3A; ANFQYIITEKKGKNSSVGLIQLNRPKALNALCNGLIEELNQALETFEEDPAVGAIVLTGG -----------2222--------3333---------------------1111-------1 EKAFAAGADIKEMQNRTFQDCYSGLSHWDHITRIKKPVIAAVNGYALGGGCELAMMCDII 111-----333311113333------11111111-------------------3333--- YAGEKAQFGQPEILLGTIPGAGGTQRLTRAVGKSLAMEMVLTGDRISAQDAKQAGLVSKI --1111----3333-------1111----------------------------------- FPVETLVEEAIQCAEKIANNSKIIVAMAKESVNAAFEMTLTEGNKLEKKLFYSTFATDDR -1111-----------3333-------------1111---------------1111---- REGMSAFVEKRKANFKDH ------------------ >SULFITE OXIDASE; SWP:P51687; PDB:1MJ4A; STHIYTKEEVSSHTSPETGIWVTLGSEVFDVTEFVDLHPGGPSKLMLAAGGPLEPFWALY -----3333-----3333-----!!!!---33331111---33331111------33333 AVHNQSHVRELLAQYKIGEL 333---------1111---- >1,3,4,6-TETRACHLORO-1,4-C; SWP:P51698; PDB:1MJ5A; SLGAKPFGEKKFIEIKGRRMAYIDEGTGDPILFQHGNPTSSYLWRNIMPHCAGLGRLIAC --------------iiii-----------------------1111-33332222------ DLIGMGDSDKLDPSGPERYAYAEHRDYLDALWEALDLGDRVVLVVHDWGSALGFDWARRH -2222---------1111--------------1111-----------------------1 RERVQGIAYMEAIAMPIEWADFPEQDRDLFQAFRSQAGEELVLQDNVFVEQVLPGLILRP 111--------------3333-3333--------3333---------------1111--- LSEAEMAAYREPFLAAGEARRPTLSWPRQIPIAGTPADVVAIARDYAGWLSESPIPKLFI ---------3333---3333------1111-iiii------------------------- NAEPGALTTGRMRDFCRTWPNQTEITVAGAHFIQEDSPDEIGAAIAAFVRRLRPAHHH ---------------1111------------3333----------------------- >IMMUNOGLOBULIN MS6-126; SWP:NA; PDB:1MJ8H; VQLQQPGAELVKPGASVKLSCKASGYTFTSNWINWVKQRPGQGLEWIGNIYPGSGYTNYN -----------2222-----------3333--------2222---------1111----3 ERFKSKATLTVDTSSSTAYMQLSSLTSDDSAVYYCARKHYFYDGVVYWGQGTLVTVSAAK 333----------------------1111------------------------------- TTAPSVYPLAPVCSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLYTLSSS ----------------------------------iiii---------------------- VTVTSSTWPSQSITCNVAHPASSTKVDKKIEPRGPT ------------------3333-------------- >MAJOR COLD-SHOCK PROTEIN ; SWP:P0A9X9; PDB:1MJC; SGKMTGIVKWFNADKGFGFITPDDGSKDVFVHFSAIQNDGYKSLDEGQKVSFTIESGAKG ---------------------1111------3333--%%%%---2222------------ PAAGNVTSL --------- >DELETED IN SPLIT HAND/SPL; SWP:NA; PDB:1MJEA; KDLMSSLQSARDLQDMRIKNKERRHLRLQPGSLYLTKSSTLPRISLQAAVGDRAPSACSP ----------1111----1111-------------3333--------------------- KQLYIYGVSKECINVNSKNAEYFQFDIQDHFGKEDLCAGKGFQLADGGWLIPSNDGKAGK ------------------------------------------------------------ EEFYRALCDTPGVDPKLISSIWVANHYRWIVWKLAAMEFAFPKEFANRCLNPERVLLQLK ------1111----------1111------------------------------------ YRYDVEIDNSRRSALKKILERDDTAAKTLVLCISDIVDTIELTDGWYAVRAQLDPPLMAL ------1111----3333------------------------------------------ VKSGKLTVGQKIITQGAELVGSPDACAPLEAPDSLRLKISANSTRPARWHSRLGFFRDPR 3333--2222-------------------------------------------------- PFPLPLSSLFSDGGNVGCVDIIVQRVYPLQWVEKTVSGLYIFRSEREEEKEALRFAEAQQ ----3333-3333----------------------------------------------- KKLEALFTKVHTLSRDVTTVWKLRVTSYKKKEKSALLSIWRPSSDLSSLLTEGKRYRIYH --------------------------------------------3333------------ LAVSKSKSKFERPSIQLTATKRTQYQQLPVSSETLLQVYQPRESLHFSRLSDPAFQPPCS -------------------3333------------------------3333--------- EVDVVGVVVSVVKPIGLAPLVYLSDECLNLLVVKFGIDLNEDIKPRVLIAASNLQCQPES ------------------------1111-------------------------------- TSGVPTLFACHFSIFSASPKEAYFQEKVNNLKHAIENIDTFYKEAEKKLIHVLEGDSPKW ---------1111-------33331111-----------------------1111----- >SPERMIDINE SYNTHASE; SWP:Q8U4G1; PDB:1MJFA; AFIEWYPRGYGVAFKIKKKIYEKLSKYQKIEVYETEGFGRLLALDGTVQLVTLGERSYHE -----2222---------------1111---------------iiii-----------33 PLVHPAMLAHPKPKRVLVIGGGDGGTVREVLQHDVDEVIMVEIDEDVIMVSKDLIKIDNG 33------------------3333-----3333----------3333--------1111- LLEAMLNGKHEKAKLTIGDGFEFIKNNRGFDVIIADSTDPVLFSEEFYRYVYDALNNPGI ------------------------------------------------------------ YVTQAGSVYLFTDELISAYKEMKKVFDRVYYYSFPVIGYASPWAFLVGVKGDIDFTKIDR ------3333-------------------------2222--------------1111--3 ERAKKLQLEYYDPLMHETLFQMPKYIRETLQ 333--------33333333---3333----- >ATP-BINDING DOMAIN OF PRO; SWP:Q57997; PDB:1MJHA; VMYKKILYPTDFSETAEIALKHVKAFKTLKAEEVILLHVIDEREIKVEEFENELKNKLTE ----------------------------------------3333--3333---------- EAKNKMENIKKELEDVGFKVKDIIVVGIPHEEIVKIAEDEGVDIIIMGSHGKTNLKEILL -------------1111-------------------------------------3333-- GSVTENVIKKSNKPVLVVKRKNS ----------------------- >50S RIBOSOMAL PROTEIN L5; SWP:P41201; PDB:1MJIA; LDVALKRKYYEEVRPELIRRFGYQNVWEVPRLEKVVINQGLGEAKEDARILEKAAQELAL ------------------1111--1111------------------1111---------- ITGQKPAVTRAKKSISNFKLRKGMPIGLRVTLRRDRWIFLEKLLNVALPRIRDFRGLNPN ----------------------------------------------3333-------111 SFDGRGNYNLGLREQLIFPEITYDMVDALRGDIAVVTTAETDEEARALLELLGFPFRK 1-----------------------3333------------3333-----3333----- >INTEGRIN ALPHA-L; SWP:P20701; PDB:1MJNA; GNVDLVFLFDGSMSLQPDEFQKILDFMKDVMKKCSNTSYQFAAVQFSTSYKTEFDFSDYV -----------1111----------------1111------------------------- KRKDPDALLKHVKHMLLLTNTFGAINYVATEVFREELGARPDATKVLIIITDGEATDSGN --------1111---------------------3333--1111----------------- IDAAKDIIRYIIGIGKHFQTKESQETLHKFASKPASEFVKILDTFEKLKDLCTELQKKI 3333----------1111--------3333---3333------3333------------ >NITRIC-OXIDE SYNTHASE HOM; SWP:P0A004; PDB:1MJTA; HLFKEAQAFIENMYKECHYETQIINKRLHDIELEIKETGTYTHTEEELIYGAKMAWRNSN --------------1111----------------------------------------11 RCIGRLFWDSLNVIDARDVTDEASFLSSITYHITQATNEGKLKPYITIYAPKDGPKIFNN 11-33331111----1111-----------------%%%%-------------------- QLIRYAGYDNCGDPAEKEVTRLANHLGWKGKGTNFDVLPLIYQLPNESVKFYEYPTSLIK -------1111-3333-------1111----------------2222-------3333-- EVPIEHNHYPKLRKLNLKWYAVPIISNMDLKIGGIVYPTAPFNGWYMVTEIGVRNFIDDY --------33331111------------------------------3333-------111 RYNLLEKVADAFEFDTLKNNSFNKDRALVELNYAVYHSFKKEGVSIVDHLTAAKQFELFE 1--------1111----3333------------------1111----------------- RNEAQQGRQVTGKWSWLAPPLSPTLTSNYHHGYDNTVKDPNFFYKK ---1111-----3333-----33333333----------------- >IMMUNOGLOBULIN MS6-12; SWP:NA; PDB:1MJUH; VQLQQPGAELVKPGASVKLSCKASGYTFTNYWINWVKQRPGQGLEWIGNIYPGSSYTHYN -----------2222-----------3333--------2222-----------------3 EKFKNKATLTVDTSSSTAYMQLSSLTSDDSAVYYCANKLGWFPYWGQGTLVTVSAAKTTA 333----------------------3333---------!!!!------------------ PSVYPLAPVCSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLYTLSSSVTVT ------------------------------iiii-------------------------- SSTWPSQSITCNVAHPASSTKVDKKIEPRGP 3333----------3333------------- >Igk-V28 protein [Fragment; SWP:Q5XKG4; PDB:1MJUL; DIVMTQAAPSVPVTPGESVSISCRSSKSLLHSNGNTYLYWFLQRPGQSPQLLIYRMSNLA -------------2222-------------1111---------2222------------2 SGVPDRFSGSGSGTAFTLRISRVEAEDVGVYYCLQHLEYPFTFGAGTKLELKRADAAPTV 2223333----------------3333--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM -----3333-------------------------iiii---------------------- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC ----------1111--------1111--------1111- >VASCULAR ENDOTHELIAL GROW; SWP:P15692; PDB:1MJVA; MVVKFMDVYQRSYCHPIETLVDIFQEYPDEIEYIFKPSAVPLMRCGGACNDEGLECVPTE ---------1111--------3333-1111--------------------1111------ ESNITMQIMRIKPHQGQHIGEMSFLQHNKCECRPKK -----------2222--------------------- ------------------------------------------------------------ ------------------------------------- >HYPOTHETICAL PROTEIN YQJY; SWP:P54562; PDB:1MK4A; HMDIRTITSSDYEMVTSVLNEWWGGRQLKEKLPRLFFEHFQDTSFITSEHNSMTGFLIGF -------3333------1111-iiii------3333---3333-----%%%%-------- QSQSDPETAYIHFSGVHPDFRKMQIGKQLYDVFIETVKQRGCTRVKCVTSPVNKVSIAYH ----1111--------11113333-------------1111--------1111------- TKLGFDIEKGTKTVNGISVFANYDGPGQDRVLFVKNI -------------iiii----1111------------ >BETA-HYDROXYDECANOYL THIO; SWP:P18391; PDB:1MKAA; VDKRESYTKEDLLASGRGELFGAKGPQLPAPNMLMMDRVVKMTETGGNFDKGYVEAELDI --------------1111---1111---------------------1111---------- NPDLWFFGCHFIGDPVMPGCLGLDAMWQLVGFYLGWLGGEGKGRALGVGEVKFTGQVLPT 11113333--2222---3333------------------------------------111 AKKVTYRIHFKRIVNRRLIMGLADGEVLVDGRLIYTASDLKVGLFQDTSAF 1---------------------------iiii---------------1111 ------------------------------------------- >PHOSPHODIESTERASE 4D; SWP:Q08499; PDB:1MKDA; TEQEDVLAKELEDVNKWGLHVFRIAELSGNRPLTVIMHTIFQERDLLKTFKIPVDTLITY -3333-----1111-----------1111------------------------------- LMTLEDHYHADVAYHNNIHAADVVQSTHVLLSTPALEAVFTDLEILAAIFASAIHDVDHP --------1111--------------------3333-------------------2222- GVSNQFLINTNSELALMYNDSSVLENHHLAVGFKLLQEENCDIFQNLTKKQRQSLRKMVI -------1111------%%%%--------------------1111--------------- DIVLATDMSKHMNLLADLKTMVETKKVTSSGVLLLDNYSDRIQVLQNMVHCADLSNPTKP --11111111-----------------3333-----3333--------------3333-3 LQLYRQWTDRIMEEFFPQGDRERERGMEISPMCDKHNASVEKSQVGFIDYIVHPLWETWA 333--------------------------22221111----------------------- DLVHPDAQDILDTLEDNREWYQSTIPQS ----1111-------------------- >Fusion protein consisting; SWP:O43516; PDB:1MKEA; DLPPPEPYVQTTKSYPSKLARNESRGSGSGSLFSFLGKKCVTMSSAVVQLYAADRNCMWS --------------3333------------------------------------------ KKCSGVACLVKDNPQRSYFLRIFDIKDGKLLWEQELYNNFVYNSPRGYFHTFAGDTCQVA ------------3333-------------------------------------------- LNFANEEEAKKFRKAVTDLLGRRQ -----3333--------------- >M3; SWP:O41925; PDB:1MKFA; HSSGVSTQSVDLSQIKRGDEIQAHCLTPAETEVTECAGILKDVLSKNLHELQGLCNVKNK -------------------------------------------3333----1111----- MGVPWVSVEELGQEIITGRLPFPSVGGTPVNDLVRVLVVAESNTPEETPEEEFYAYVELQ -------3333----------------1111----------------------------- TELYTFGLSDDNVVFTSDYMTVWMIDIPKSYVDVGMLTRATFLEQWPGAKVTVMIPYSST --------1111---------------3333-2222---------2222----------- FTWCGELGAISEESAPQPSLSARSPVCKNSARYSTSKFCEVDGCTAETGMEKMSLLTPFG ----------3333--------------3333---11113333-3333---------111 GPPQQAKMNTCPCYYKYSVSPLPAMDHLILADLAGLDSLTSPVYVMAAYFDSTHENPVRP 1----------3333-------------------3333-----------------3333- SSKLYHCALQMTSHDGVWTSTSSEQCPIRLVEGQSQNVLQVRVAPTSMPNLVGVSLMLEG -------------iiii--------------------------33331111------222 QQYRLEYFGDH 2---------- >VASCULAR ENDOTHELIAL GROW; SWP:P15692; PDB:1MKGA; VVKFMDVYQRSYCHPIETLVDIFQEYPDEIEYIFKPSCVPLMRAGGCCNDEGLECVPTEE --3333--1111--------3333-------------------------1111------- SNITMQIMRIKPHQGQHIGEMSFLQHNKAECRPK ----------2222-------------------- >C-TERMINAL DOMAIN OF METH; SWP:Q9V011; PDB:1MKHA; MYVKFDDFAKLDLRVGKIIEVKDHPNADKLYVVKVDLGDEVRTLVAGLKKYYKPEELLNR ---3333----------------------------------------3333-33332222 YVVVVANLEPKKLGIGSQGMLLAADDGERVALLMPDKEVKLGAKVR ----1111-------------------------------2222--- >PROBABLE GLUTAMINASE YBGJ; SWP:O31465; PDB:1MKIA; AKELINPALQLHDWVEYYRPFAANGQSANDSQLGICVLEPDGTIHAGDWNVSFTQISKVI ---------------------1111---1111------1111------------------ SFIAACSRGIPYVLDRVDVEPTGDAFNSIIRLEINKPGKPFNPINAGALTIASILPGESA ------------3333--------1111-----------------------1111----- YEKLEFLYSVETLIGKRPRIHEEVFRSEWETAHRNRALAYYLKETNFLEAEVEETLEVYL ------------------------------------------1111-------------- KQCAESTTEDIALIGLILAHDGYHPIRHEQVIPKDVAKLAKALLTCGYNASGKYAAFVGV -------------------iiii------------------------------------- PAKSGVSGGIALVPPSARREQPFQSGCGIGIYGPAIDEYGNSLTGGLLKHAQEWELSIF ----1111--------------1111----------1111--3333-----1111---- >VASCULAR ENDOTHELIAL GROW; SWP:P15692; PDB:1MKKA; VVKFMDVYQRSYCHPIETLVDIFQEYPDEIEYIFKPSCVPLMRCGGCANDEGLECVPTEE --------1111--------3333-1111------------------------------- SNITMQIMRIKPHQGQHIGEMSFLQHNKCEARP ----------2222------------------- >ICLR TRANSCRIPTIONAL REGU; SWP:Q9WXS0; PDB:1MKMA; MNTLKKAFEILDFIVKNPGDVSVSEIAEKFNMSVSNAYKYMVVLEEKGFVLRKKDKRYVP --------------------------------3333----------------1111---- GYKLIEYGSFVLRRFNIRDIAHDHLVDIMKRTGETVHLILKDGFEGVYIDKVEGEQSIPM 3333------3333-3333-------1111-----------!!!!--------1111--- VSRLGMKVDLYSTASGKSILAFVPEKELKEYLKIVELKPKTPNTITNPRVLKRELEKIRK --------1111-------1111-----------------1111---------------- RGYAVDNEENEIGIMCVGVPIFDHNGYPVAGVSISGVARKFTEEKIEEYSDVLKEKAEEI ----------2222--------1111----------3333-3333--------------- SRKLGY -1111- >MIDKINE; SWP:P21741; PDB:1MKNA; KKKDKVKKGGPGSECAEWAWGPCTPSSKDCGVGFREGTCGAQTQRIRCRVPCNWKKEFG ----------------------------------------------------1111--- >PYST1; SWP:Q16828; PDB:1MKP; ASFPVEILPFLYLGCAKDSTNLDVLEEFGIKYILNVTPNLPNLFENAGEFKYKQIPISDH -------2222---1111----------------------------!!!!---------1 WSQNLSQFFPEAISFIDEARGKNCGVLVHSLAGISRSVTVTVAYLMQKLNLSMNDAYDIV 111-3333--------------------------------------1111---------- KMKKSNISPNFNFMGQLLDFERTL ---3333---1111-----3333- >PROBABLE GTP-BINDING PROT; SWP:Q9X1F8; PDB:1MKYA; ATVLIVGRPNVGKSTLFNKLVKDPVQDTVEWYGKTFKLVDTCGVFDNPQDIISQKKEVTL ------------------------------iiii--------11113333---------- NIREADLVLFVVDGKRGITKEDESLADFLRKSTVDTILVANKAENLREFEREVKPELYSL -3333------------------------------------------------3333333 GFGEPIPVSAEHNINLDTLETIIKKLEEKGLDLESKPEITDAIKVAIVGRPNVGKSTLFN 3--------1111-3333--------1111-------------------2222------- AILNKERALVSPIPVDDEVFIDGRKYVFVDTAGLEKYSNYRVVDSIEKADVVVIVLDATQ ----1111------------iiii------3333----3333--------------1111 GITRQDQRAGLERRGRASVVVFNKWDLVVHREKRYDEFTKLFREKLYFIDYSPLIFTSAD --3333-----1111--------3333--3333------------3333---------11 KGWNIDRIDANLAYASYTTKVPSSAINSALQKVLAFTNLPRGLKIFFGVQVDIKPPTFLF 11-3333--------1111--3333-------3333------------------------ FVNSIEKVKNPQKIFLRKLIRDYVFPFEGSPIFLKFKRSR ---3333-3333--------------2222---------- >MOLYBDENUM COFACTOR BIOSY; SWP:P30746; PDB:1MKZA; QVSTEFIPTRIAILTVSNRRGEEDDTSGHYLRDSAQEAGHHVVDKAIVKENRYAIRAQVS --------------------3333------------------------------------ AWIASDDVQVVLITGGTGLTEGDQAPEALLPLFDREVEGFGEVFRLSFEEIGTSTLQSRA -------------------11113333-3333----------------------1111-- VAGVANKTLILAPGSTKACRTAWENIIAPQLDARTRPCNFHPHLKKGS ----%%%%-------------------11111111----3333----- >TRIOSEPHOSPHATE ISOMERASE; SWP:P04789; PDB:1ML1A; SKPQPIAAANWKSGSPDSLSELIDLFNSTSINHDVQCVVASTFVHLAMTKERLSHPKFVI -------------------------1111------------3333---------1111-- AAQNAGNADALASLKDFGVNWIVLGHSERRWYYGETNEIVADKVAAAVASGFMVIACIGE ------------------------------------------------------------ TLQERESGRTAVVVLTQIAAIAKKLKKADWAKVVIAYEPVWAIGTGKVATPQQAQEAHAL ---------3333--------111133331111-----1111-------3333------- IRSWVSSKIGADVAGELRILYGGSVNGKNARTLYQQRDVNGFLVGGASLKPEFVDIIKAT -------------------------3333------1111-----3333---------111 Q 1 >ASPARTATE TRANSCARBAMOYLA; SWP:P77918; PDB:1ML4A; DWKGRDVISIRDFSKEDIETVLATAERLERELKEKGQLEYAKGKILATLFFEPSTRTRLS -2222---1111--------------------------1111-------------3333- FESAMHRLGGAVIGFAEASTSSVKKGESLRDTIKTVEQYCDVIVIRHPKEGAARLAAEVA -----1111-------3333-3333----------1111---------2222----1111 EVPVINAGDGSNQHPTQTLLDLYTIKKEFGRIDGLKIGLLGDLKYGRTVHSLAEALTFYD --------!!!!------------------------------------------3333-- VELYLISPELLRMPRHIVEELREKGMKVVETTTLEDVIGKLDVLYVTRIQKERFPDEQEY -------1111--3333----1111-------33333333---------1111--33331 LKVKGSYQVNLKVLEKAKDELRIMHPLPRVDEIHPEVDNTKHAIYFRQVFNGVPVRMALL 111------3333----1111------------3333--3333----------------- ALVLGVI ------- >HYPOTHETICAL PROTEIN (CRP; SWP:P0ADX1; PDB:1ML8A; MQARVKWVEGLTFLGESASGHQILMDGNSGDKAPSPMEMVLMAAGGCSAIDVVSILQKGR ----------------1111-------%%%%----------------------------- QDVVDCEVKLTSERRERLFTHINLHFIVTGRDLKDAAVARAVDLSAEKYCSVALMLEKAV ------------------------------------------------------------ NITHSYEVVAA ----------- >HISTONE H3 METHYLTRANSFER; SWP:Q8X225; PDB:1ML9A; QLPISIVNREDDAFLNPNFRFIDHSIIGKNVPVADQSFRVGCSCASDEECMYSTCQCLDE ---------------1111--------1111---3333-------3333--11111111- MAPDKRFAYYSQGAKKGLLRDRVLQSQEPIYECHQGCACSKDCPNRVVERGRTVPLQIFR --------------2222---3333--------------1111---3333---------- TKDRGWGVKCPVNIKRGQFVDRYLGEIITSEEADRRRAESTIARRKDVYLFALDKFSDPD --------------2222----------------------33333333-----1111--- SLDPLLAGQPLEVDGEYMSGPTRFINHSCDPNMAIFARVGDHADKHIHDLALFAIKDIPK --3333---------------1111---------------1111--------------22 GTELTFDYVN 22----1111 >MALONYL-COENZYME A ACYL C; SWP:P0AAI9; PDB:1MLA; QFAFVFPGQGSQTVGMLADMAASYPIVEETFAEASAALGYDLWALTQQGPAEELNKTWQT -------2222-2222---------------------------------3333--1111- QPALLTASVALYRVWQQQGGKAPAMMAGHSLGEYSALVCAGVIDFADAVRLVEMRGKFMQ -----------------------------3333----1111------------------- EAVPEGTGAMAAIIGLDDASIAKACEEAAEGQVVSPVNFNSPGQVVIAGHKEAVERAGAA ---2222---------------------iiii--------2222---------------- CKAAGAKRALPLPVSVPSHCALMKPAADKLAVELAKITFNAPTVPVVNNVDVKCETNGDA -1111--------------3333----------1111----------------------- IRDALVRQLYNPVQWTKSVEYMAAQGVEHLYEVGPGKVLTGLTKRIVDTLTASALNEPSA ------3333------------1111---------------3333--------------- MAAAL -3333 >IGG1-KAPPA D44.1 FAB (HEA; SWP:NA; PDB:1MLBB; QVQLQESGAEVMKPGASVKISCKATGYTFSTYWIEWVKQRPGHGLEWIGEILPGSGSTYY ------------2222-----------3333----------------------------- NEKFKGKATFTADTSSNTAYMQLSSLTSEDSAVYYCARGDGNYGYWGQGTTLTVSSASTT 3333----------------------1111------------------------------ PPSVFPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTL ------------------------------------%%%%-------------iiii--- SSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVPRDC ----------------------1111------------ >Malate dehydrogenase, mit; SWP:P00346; PDB:1MLDA; AKVAVLGASGGIGQPLSLLLKNSPLVSRLTLYDIAHTPGVAADLSHIETRATVKGYLGPE ------1111------------1111-----------------1111----------333 QLPDCLKGCDVVVIPAGVPRKPGMTRDDLFNTNATIVATLTAACAQHCPDAMICIISNPV 3-3333--------------22223333-------------------1111-------33 NSTIPITAEVFKKHGVYNPNKIFGVTTLDIVRANAFVAELKGLDPARVSVPVIGGHAGKT 33---------1111--1111-----------------1111-3333---------!!!! IIPLISQCTPKVDFPQDQLSTLTGRIQEAGTEVVKAKAGAGSATLSMAYAGARFVFSLVD ---3333-----------------------------iiii-------------------- AMNGKEGVVECSFVKSQETDCPYFSTPLLLGKKGIEKNLGIGKISPFEEKMIAEAIPELK ------------------------------1111-------------------------- ASIKKGEEFVKNM ------------- >TRYPTOPHAN 5-MONOOXYGENAS; SWP:P17752; PDB:1MLWA; SVPWFPKKISDLDHCANRVLMYGSELDADHPGFKDNVYRKRRKYFADLAMNYKHGDPIPK -------33331111-----------1111-1111-----------------2222---- VEFTEEEIKTWGTVFRELNKLYPTHACREYLKNLPLLSKYCGYREDNIPQLEDVSNFLKE ---------------------1111------------------1111------------- RTGFSIRPVAGYLSPRDFLSGLAFRVFHCTQYVRHSSDPFYTPEPDTCHELLGHVPLLAE ----------------------------------3333---------------------- PSFAQFSQEIGLASLGASEEAVQKLATCYFFTVEFGLCKQDGQLRVFGAGLLSSISELKH -------------2222----------------------iiii----3333--------- ALSGHAKVKPFDPKITCKQECLITTFQDVYFVSESFEDAKEKMREFTKTI -----------33331111---------------------------1111 ------------------------------------ >MI2-BETA; SWP:Q14839; PDB:1MM2A; GPLGSDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTCPALKG 2222-----------------------------------------------1111----- K - >Mi2-beta(Chromodomain hel; SWP:Q14839; PDB:1MM3A; GPLGSDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLRPALYEVPDGEWQCPRCTCPALKG --!!!!------------------------------------------------------ K - >ANTIMICROBIAL PEPTIDE 2; SWP:P27275; PDB:1MMC; VGECVRGRCPSGMCCSQFGYCGKGPKYCGR ----%%%%-%%%%--1111----3333--- >MMLV REVERSE TRANSCRIPTAS; SWP:P03355; PDB:1MML; TWLSDFPQAWAETGGMGLAVRQAPLIIPLKATSTPVSIKQYPMSQEARLGIKPHIQRLLD 3333-11113333-----1111-------1111----------3333------------- QGILVPCQSPWNTPLLPVKKPGTNDYRPVQDLREVNKRVEDIHPTVPNPYNLLSGLPPSH -------------------------------33331111-------------11113333 QWYTVLDLKDAFFCLRLHPTSQPLFAFEWRDPEMGISGQLTWTRLPQGFKNSPTLFDEAL ---------3333----33333333-----3333-----------2222----------- HRDLADFRIQHPDLILLQYVDDLLLAATSELDCQQGTRALLQTLGNLGYRASAKKAQICQ ----------1111-------------------------------------3333----- KQVKYLGYLLK ----iiii--- >MATRILYSIN; SWP:P09237; PDB:1MMQ; YSLFPNSPKWTSKVVTYRIVSYTRDLPHITVDRLVSKALNMWGKEIPLHFRKVVWGTADI ---2222---------------33333333-----------3333--------------- MIGFARGAHGDSYPFDGPGNTLAHAFAPGTGLGGDAHFDEDERWTDGSSLGINFLYAATH -----------------------------!!!!-----1111------------------ ELGHSLGMGHSSDPNAVMYPTYGNGDPQNFKLSQDDIKGIQKLYGK ------------1111---------1111----------------- >RIBOSOMAL PROTEIN L11; SWP:P29395; PDB:1MMSA; QIKLQLPAGKATPAPPVGPALGQHGVNIMEFCKRFNAETADKAGMILPVVITVYEDKSFT --------------------1111---3333----------------------1111--- FIIKTPPASFLLKKAAGIEKGSSEPKRKIVGKVTRKQIEEIAKTKMPDLNANSLEAAMKI ---------------------------------3333-------3333------------ IEGTAKSMGIEVV ------------- ---------------------------------------------------- >CORE PROTEIN P15; SWP:P03332; PDB:1MN8A; QTVTTPLSLTLGHWKDVERIAHNQSVDVKKRRWVTFCSAEWPTFNVGWPRDGTFNRDLIT ---------------------1111-----------------------1111-------- QVKIKVFSPGPHGHPDQVPYIVTWEALAFDPPPWV -------------3333--------------1111 >MANGANESE SUPEROXIDE DISM; SWP:P61503; PDB:1MNGA; PYPFKLPDLGYPYEALEPHIDAKTMEIHQKHHGAVTNLNAALEKYPYLHGVEVEVLLRHL -----------1111---------------3333-------11111111--3333---33 AALPQDIQTAVRNNGGGHLNSLFWRLLTPGGAKEPVGELKKAIDEQFGGFQALKEKLTQA 33-3333-----------------1111-------------------------------- AMGRFGSGWAWLVKDPFGKLHVLSTPNQDNPVMEGFTPIVGIVWEAYYLKYQNRRADYLQ --------------1111-------!!!!3333-----------------!!!!------ AIWNVLNWDVAEEFFKKA 3333-------------- >MCM1 TRANSCRIPTIONAL REGU; SWP:P11746; PDB:1MNMA; QKERRKIEIKFIENKTRRHVTFSKRKHGIMKKAFELSVLTGTQVLLLVVSETGLVYTFST -------------------------------------1111--------1111------3 PKFEPIVTQQEGRNLIQACLNAPDD 3333333------------------ >MCM1 TRANSCRIPTIONAL REGU; SWP:P01367; PDB:1MNMC; GLVFNVVTQDMINKSTKPYRGHRFTKENVRILESWFAKNIENPYLDTKGLENLMKNTSLS -------1111-------2222--3333----------3333---3333----------- RIQIKNWVSNRRRKEKT ----------------- >NDT80 PROTEIN; SWP:P38830; PDB:1MNNA; VILTQLNEDGTTSNYFDKRKLKIAPRSTLQFKVGPPFELVRDYCPVVESHTGRTLDLRII ------1111------3333---11113333----------------------------- PRIDRGFDHIDEEWVGYKRNYFTLVSTFETANCDLDTFLKSSFDLLVGRLRVQYFAIKIK ----------------1111---------1111--------------------------- AKNDDDDTEINLVQHTAKRDKGPQFCPSVCPLVPSPLPKHQTIREASNVRNITKMKKYDS ---------------1111------------------------1111----------333 TFYLHRDHVNYEEYGVDSLLFSYPEDSIQKVARYERVQFASSISVKKPSQQNKHFSLHVI 3---1111-3333-11111111-----------------1111---1111---------- LGAVVDPDGIPYDELALKNGSKGMFVYLQEMKTPPLIIRGRSPSNYASSQ -----------------------------------------1111----- >MNT REPRESSOR; SWP:P03049; PDB:1MNTA; ARDDPHFNFRMPMEVREKLKFRAEANGRSMNSELLQIVQDALSKPSPVTGYRNDAERLAD --------------------------------------------------------3333 EQSELV --1111 >IGG2A-KAPPA ANTIBODY MN12; SWP:NA; PDB:1MNUH; EVNLQQSGTVLARPGASVRMSCKASGYSFTSYWLHWIKQRPGQGLEWIGGIYPGNRDTRY ------------2222-----------1111----------------------------- TQRFKDKAKLTAVTSANTAYMELSSLTNEDSAVYYCSIIYFDYADFIMDYWGQGTTVTVS 3333--------3333----------3333------------------------------ SAKTTAPSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQS --------------2222-----------------------%%%%-------------!! DLYTLSSSVTVTSSTWPSQSITCNVAHPASSTKVDKKIEPRG !!---------3333------------1111----------- >TRIOSEPHOSPHATE ISOMERASE; SWP:Q10657; PDB:1MO0A; GLTRKFFVGGNWKMNGDYASVDGIVTFLNASADNSSVDVVVAPPAPYLAYAKSKLKAGVL ---------------------------------1111------3333--------3333- VAAQNCYKVPKGAFTGEISPAMIKDLGLEWVILGHSERRHVFGESDALIAEKTVHALEAG -------------2222-----------------3333---------------------- IKVVFCIGEKLEEREAGHTKDVNFRQLQAIVDKGVSWENIVIAYEPVWAIGTGKTASGEQ -------------1111------------3333------------3333----------- AQEVHEWIRAFLKEKVSPAVADATRIIYGGSVTADNAAELGKKPDIDGFLVGGASLKPDF --------------------------------1111-3333-1111-----1111----- VKIINARSTA ---------- >HPR-LIKE PROTEIN CRH; SWP:O06976; PDB:1MO1A; MVQQKVEVRLKTGLQARPAALFVQEANRFTSDVFLEKDGKKVNAKSIGLSLAVSTGTEVT ------------------------------------iiii--1111-------2222--- LIAQGEDEQEALEKLAAYVQEEVLQ ----1111----------------- >Sodium/Potassium-transpor; SWP:P06685; PDB:1MO7A; QNPMTVAHMWFDNQIHEADTTENQSGVSFDKTSATWFALSRIAGLCNRAVFQANQENLPI ------------------------------------------------------------ LKRAVAGDASESALLKCIEVCCGSVMEMREKYTKIVEIPFNSTNKYQLSIHKNPNASEPK %%%%--------------------3333-------------------------------- HLLVMKGAPERILDRCSSILLHGKEQPLDEELKDAFQNAYLELGGLGERVLGFCHLLLPD ------------1111----%%%%------------------------------------ EQFPEGFQFDTDEVNFPVDNLCFVGLISMIDPP --------------------------------- >ORF3; SWP:Q56839; PDB:1MO9A; KVWNARNDHLTINQWATRIDEILEAPDGGEVIYNVDENDPREYDAIFIGGGAAGRFGSAY ----1111----------------1111-------1111--------------------- LRAMGGRQLIVDRWPFLGGSCPHNACVPHHLFSDCAAELMLARTFSGQYWFPDMTEKVVG -1111--------------3333------------------------!!!!--------- IKEVVDLFRAGRNGPHGIMNFQSKEQLNLEYILNCPAKVIDNHTVEAAGKVFKAKNLILA ----------------------------------------------iiii---------- VGAGPGTLDVPGVNAKGVFDHATLVEELDYEPGSTVVVVGGSKTAVEYGCFFNATGRRTV ---------2222-2222-3333------------------------------------- MLVRTEPLKLIKDNETRAYVLDRMKEQGMEIISGSNVTRIEEDANGRVQAVVAMTPNGEM -------3333-------------------------------1111-------------- RIETDFVFLGLGEQPRSAELAKILGLDLGPKGEVLVNEYLQTSVPNVYAVGDLIGGPMEM ----------------------------1111----1111---2222----1111----- FKARKSGCYAARNVMGEKISYTPKNYPDFLHTHYEVSFLGMGEEEARAAGHEIVTIKMPP ------------1111-------------------------------------------- DTENGLNVALPASDRTMLYAFGKGTAHMSGFQKIVIDAKTRKVLGAHHVGYGAKDAFQYL -1111-------2222-----22221111-----------------------3333---- NVLIKQGLTVDELGDMDELFLNPTHFIQLSRLRAGSKNLVSL ---1111-----1111----------------3333------ >MOLONEY MURINE LEUKEMIA V; SWP:P03385; PDB:1MOF; DLREVEKSISNLEKSLTSLSEVVLQNRRGLDLLFLKEGGLCAALKEECAFYAD ----------------------------------1111--------------- >GLUCOSAMINE 6-PHOSPHATE S; SWP:P17169; PDB:1MOQ; GDKGIYRHYMQKEIYEQPNAIKNTLTGRISHGQVDLSELGPNADELLSKVEHIQILACGT --!!!!----------------1111---iiii--33331111---1111-------!!! SYNSGMVSRYWFESLAGIPCDVEIASEFRYRKSAVRRNSLMITLSQSGETADTLAGLRLS !--------------------------1111----------------------------1 KELGYLGSLAICNVPGSSLVRESDLALMTNAGTEIGVASTKAFTTQLTVLLMLVAKLSRL 111----------2222----------------------3333---------------11 KGLDASIEHDIVHGLQALPSRIEQMLSQDKRIEALAEDFSDKHHALFLGRGDQYPIALEG 11----------------------1111----------1111-------!!!!------- ALKLKEISYIHAEAYAAGELKHGPLALIDADMPVIVVAPNNELLEKLKSNIEEVRARGGQ ---------------1111---3333--1111---------------------3333--- LYVFADQDAGFVSSDNMHIIEMPHVEEVIAPIFYTVPLQLLAYHVALIKGTDVDQPRNLA -----3333----1111--------3333----------------------33332222- KSVTVE ------ >Transforming growth facto; SWP:P01135; PDB:1MOXC; VSHFNDCPDSHTQFCFHGTCRFLVQEDKPACVCHSGYVGARCEHADLLA -----------------------1111------2222-1111------- >ADP-RIBOSYLATION FACTOR-L; SWP:P38116; PDB:1MOZA; GNIFSSMFDKLWGSNKELRILILGLDGAGKTTILYRLQIGEVVTTKPTIGFNVETLSYKN --------1111----------------------3333---------------------- LKLNVWDLGIRPYWRCYYADTAAVIFVVDSTDKDRMSTASKELHLMLQEEELQDAALLVF ----------3333----------------------------------3333-------- ANKQDQPGALSASEVSKELNLVELKDRSWSIVASSAIKGEGITEGLDWLIDVIKEEQL --1111-------------3333------------1111---------------3333 >SER/ARG-RELATED NUCLEAR M; SWP:Q8IYB3; PDB:1MP1A; SHMQLKFAECLEKKVDMSKVNLEVIKPWITKRVTEILGFEDDVVIEFIFNQLEVKNPDSK --------3333---1111--1111-------------------------1111------ MMQINLTGFLNGKNAREFMGELWPLLLSAQENIAGIPSAFLELKKEEIKQR ---------------------------3333-----3333----------- >FOCAL ADHESION KINASE 1; SWP:Q05397; PDB:1MP8A; DYEIQRERIELGRCIGEGQFGDVHQGIYMSPPALAVAIKTCKNCTSDSVREKFLQEALTM ----3333---------1111-------------------1111--3333---------- RQFDHPHIVKLIGVITENPVWIIMELCTLGELRSFLQVRKYSLDLASLILYAYQLSTALA ----1111------------------1111------1111---3333------------- YLESKRFVHRDIAARNVLVSSNDCVKLGDLPIKWMAPESINFRRFTSASDVWMFGVCMWE --1111------3333----1111------3333-------------------------- ILMHGVKPFQGVKNNDVIGRIENGERLPMPPNCPPTLYSLMTKCWAYDPSRRPRFTELKA 1111----22223333----1111-----2222---------1111-3333--------- QLSTILEEEKAQ --------1111 >TATA-BINDING PROTEIN; SWP:Q9UWN7; PDB:1MP9A; DEIPYKAVVNIENIVATVTLDQTLDLYAMERSVPNVEYDPDQFPGLIFRLESPKITSLIF --------------------------------------1111-----------------1 KSGKMVVTGAKSTDELIKAVKRIIKTLKKYGMQLTGKPKIQIQNIVASANLHVIVNLDKA 111------------------------1111----------------------------- AFLLENNMYEPEQFPGLIYRMDEPRVVLLIFSSGKMVITGAKREDEVHKAVKKIFDKLVE ---------3333-----------------1111--------3333-------------- LDCVKPVEEEELE -----1111---- >3-METHYLADENINE DNA GLYCO; SWP:P04395; PDB:1MPGA; MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH -----------------------2222---1111------!!!!---------1111--- INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRA ---3333---------------33333333-----3333--1111--------------- ILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE -------------------------3333-----------11113333------------ ALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL -------------------------------2222-------------------1111-- IKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA ----2222---------1111------------1111----- >PEPSIN; SWP:P09177; PDB:1MPP; SVDTPGLYDFDLEEYAIPVSIGTPGQDFYLLFDTGSSDTWVPHKGCDNSEGCVGKRFFDP ----------------------------------------------1111--------33 SSSSTFKETDYNLNITYGTGGANGIYFRDSITVGGATVKQQTLAYVDNVSGPTAEQSPDS 331111--------------------------iiii--------------3333--1111 ELFLDGIFGAAYPDNTAMEAEYGDTYNTVHVNLYKQGLISPVFSVYMNTNDGGGQVVFGG -----------3333-3333---------------------------------------- VNNTLLGGDIQYTDVLKSRGGYFFWDAPVTGVKIDGSDAVSFDGAQAFTIDTGTNFFIAP -3333------------iiii------------iiii-------------1111------ SSFAEKVVKAALPDATESQQGYTVPCSKYQDSKTTFSLVLQKSGSSSDTIDVSVPISKML -----------1111-------------1111---------2222-----------1111 LPVDKSGETCMFIVLPDGGNQFIVGNLFLRFFVNVYDFGKNRIGFAPLASGYEND ---3333--------------------3333------------------------ >NKG2-D TYPE II INTEGRAL M; SWP:P26718; PDB:1MPUA; QIPLTESYCGPCPKNWICYKNNCYQFFDESKNWYESQASCMSQNASLLKVYSKEDQDLLK ------------2222------------------------1111-----------3333- LVKSYHWMGLVHIPTNGSWQWEDGSILSPNLLTIIEMQKGDCALYASSFKGYIENCSTPN --------------------1111------------------------------1111-- TYICMQRT -------- >ALPHA-AMINO ACID ESTER HY; SWP:Q8PK36; PDB:1MPXA; TSPTPDITGKPFVAADASNDYIKREVIPRDGVKLHTVIVLPKGAKNAPIVLTRTPYDASG ---------------1111---------------------2222------------3333 RTERLASPHKDLLSAGDDVFVEGGYIRVFQDVRGKYGSEGDYVTRPLRGPLNPSEVDHAT -------------3333---1111-------2222-------------1111----3333 DAWDTIDWLVKNVSESNGKVGIGSSYEGFTVVALTNPHPALKVAVPESPIDGWGDDWFNY ------------1111----------------3333-3333-----------------ii GAFRQVNFDYFTGQLSKRGKGAGIARQGHDDYSNFLQAGSAGDFAKAAGLEQLPWWHKLT ii-3333----------------------------3333---------1111-------- EHAAYDAFWQEQALDKVARTPLKVPTWLQGLWDQEDWGAIHSYAAEPRDKRNTLNYLVGP -----333311113333----------------------------1111----------- WRHSQVNYDGSALGALNFEGDTARQFRHDVLRPFFDQYLVDGAPKADTPPVFIYNTGENH -2222-------!!!!-----------------------2222----------------- WDRLKAWPRSCDKGCAATSKPLYLQAGGKLSFQPPVAGQAGFEEYVSDPAKPVPFVPRPV -----------------------------------------------3333--------- DFADRAWTTWLVHDQRFVDGRPDVLTFVTEPLTEPLQIAGAPDVHLQASTSGSDSDWVVK 3333--1111----3333--1111------------------------------------ LIDVYPEEASNPKGGYELPVSLAIFRGRYRESFSTPKPLTSNQPLAFQFGLPTANHTFQP --------------------------1111---------2222---------------22 GHRVVQVQSSLFPLYDRNPQTYVPNIFFAKPGDYQKATQRVYVSPEQPSYISLPVR 22----------------------3333-1111----------3333--------- >CATECHOL 2,3-DIOXYGENASE; SWP:P06622; PDB:1MPYA; MNKGVMRPGHVQLRVLDMSKALEHYVELLGLIEMDRDDQGRVYLKAWTEVDKFSLVLREA ------------------------------------1111-----1111----------- DEPGMDFMGFKVVDEDALRQLERDLMAYGCAVEQLPAGELNSCGRRVRFQAPSGHHFELY -----------------------------------22222222-------1111------ ADKEYTGKWGLNDVNPEAWPRDLKGMAAVRFDHALMYGDELPATYDLFTKVLGFYLAEQV -----------------------!!!!--------------------------------- LDENGTRVAQFLSLSTKAHDVAFIHHPEKGRLHHVSFHLETWEDLLRAADLISMTDTSID -1111-----------------------------------3333---------------- IGPTRHGLTHGKTIYFFDPSGNRNEVFCGGDYNYPDHKPVTWTTDQLGKAIFYHDRILNE -----------------1111------------1111-----3333------------33 RFMTVLT 33----- ----------------------------------------- >CYTIDINE DEAMINASE; SWP:P32320; PDB:1MQ0A; ECVQQLLVCSQEAKQSAYCPYSHFPVGAALLTQEGRIFKGCNIENACYPLGICAERTAIQ ------------3333---------------3333-----------3333---------- KAVSEGYKDFRAIAIASDMQDDFISPCGACRQVMREFGTNWPVYMTKPDGTYIVMTVQEL --1111----------------------------------------3333-----3333- LPSSFGPEDL -----3333- >AURORA-RELATED KINASE 1; SWP:O14965; PDB:1MQ4A; RQWALEDFEIGRPLGKGKFGNVYLAREKQSKFILALKVLFKAQLEKAGVEHQLRREVEIQ ---3333---------1111----------------------------3333------33 SHLRHPNILRLYGYFHDATRVYLILEYAPLGTVYRELQKLSKFDEQRTATYITELANALS 33--1111--------1111-------1111----------------------------- YCHSKRVIHRDIKPENLLLGSAGELKIADFGWSVHAPSSRTLCGTLDYLPPEMIEGRMHD ------------3333---1111------1111------------1111----------- EKVDLWSLGVLCYEFLVGKPPFEANTYQETYKRISRVEFTFPDFVTEGARDLISRLLKHN -------------------1111------------------3333--------------3 PSQRPMLREVLEHPWITANSS 333--3333------------ >EPHRIN TYPE-A RECEPTOR 2; SWP:P29317; PDB:1MQBA; TTEIHPSCVTRQKVIGAGEFGEVYKGMLKTKKEVPVAIKTLKAGYTEKQRVDFLGEAGIM ----3333---------1111--------------------2222-------------33 GQFSHHNIIRLEGVISKYKPMMIITEYMENGALDKFLREKDGEFSVLQLVGMLRGIAAGM 33-------------------------1111--------2222----------------- KYLANMNYVHRDLAARNILVNSNLVCKVSDFGLKIPIRWTAPEAISYRKFTSASDVWSFG -------------3333---1111-----------1111--------------------- IVMWEVMTYGERPYWELSNHEVMKAINDGFRLPTPMDCPSAIYQLMMQCWQQERARRPKF -----1111--2222----------1111-----2222--------------3333--33 ADIVSILDKLIRAPDSLKTLADF 33----------3333------- >ADPR PYROPHOSPHATASE; SWP:O33199; PDB:1MQEA; FETISSETLHTGAIFALRRDQVRIVTREVVEHFGAVAIVAMDDNGNIPMVYQYRHTYGRR -----------1111--------------------------1111---------1111-- LWELPAGLLDVAGEPPHLTAARELREEVGLQASTWQVLVDLDTAPGFSDESVRVYLATGL ----------2222-----------------------------3333------------- REVGRTMGWYPIAEAARRVLRGEIVNSIAIAGVLAVHAVTTGFAQPRPLDTEWIDRPTAF ------------------1111----------------1111-----1111-1111-333 AARRAER 3------ >GLUTAMATE RECEPTOR 2; SWP:P19491; PDB:1MQIA; NKTVVVTTILESPYVMMKKNHEMLEGNERYEGYCVDLAAEIAKHCGFKYKLTIVGDGKYG -------------------3333-!!!!-------------------------1111--- ARDADTKIWNGMVGELVYGKADIAIAPLTITLVREEVIDFSKPFMSLGISIMIKKGTPIE ---------------1111-----------3333-------------------2222--- SAEDLSKQTEIAYGTLDSGSTKEFFRRSKIAVFDKMWTYMRSAEPSVFVRTTAEGVARVR ----1111------------------------------3333------------------ KSKGKYAYLLESTMNEYIEQRKPCDTMKVGGNLDSKGYGIATPKGSSLGNAVNLAVLKLN -iiii-------------------------------------2222-------------- EQGLLDKLKNKWWYDKGECG -------------------- >Ig heavy chain V region 5; SWP:P18525; PDB:1MQKH; EVKLQESGGDLVQPGGSLKLSCAASGFTFSSYTMSWVRQTPEKRLEWVASINNGGGRTYY ------------2222-----------1111--------1111--------1111----- PDTVKGRFTISRDNAKNTLYLQMSSLKSEDTAMYYCVRHEYYYAMDYWGQGTTVTVSSAW 3333--------3333----------3333------------------------------ RHP --- >Ig heavy chain V region 5; SWP:P18525; PDB:1MQKL; DIELTQTPVSLSASVGETVTITCRASENIYSYLAWYQQKQGKSPQFLVYNAKTLGEGVPS -------------2222---------------------2222------------222233 RFSGSGSGTQFSLKINSLLPEDFGSYYCQHHYGTPPLTFGGGTKLEIKR 33----------------1111--------------------------- >HEMAGGLUTININ HA1 CHAIN; SWP:P03442; PDB:1MQMA; STATLCLGHHAVPNGTIVKTITDDQIEVTNATELVQSSSTGKICNNPHRILDGRACTLID ------------------------------------------------------------ ALLGDPHCDVFQNETWDLFVERSNAFSNCYPYDIPDYASLRSLVASSGTLEFITEGFTWT ----11111111---------1111----------------------------------- GVTQNGGSSACKRGPANGFFSRLNWLTKSESAYPVLNVTMPNNDNFDKLYIWGVHHPSTN -------1111--------1111-----------------------------------33 QEQTNLYVQASGRVTVSTRRSQQTIIPNIGSRPWVRGQPGRISIYWTIVKPGDVLVINSN 33---------------1111-------------iiii-----------2222------- GNLIAPRGYFKMRTGKSSIMRSDAPIDTCISECITPNGSIPNDKPFQNVNKITYGACPKY ----------------------------------1111---------------------- VKQNTLKLATGMRNVPEK ------------------ >BETA-LACTAMASE II; SWP:P04190; PDB:1MQOA; TVIKNETGTISISQLNKNVWVHTELGSFNGEAVPSNGLVLNTSKGLVLVDSSWDDKLTKE ----1111-------------------iiii----------1111--------------- LIEMVEKKFQKRVTDVIITHAHADRIGGIKTLKERGIKAHSTALTAELAKKNGYEEPLGD ------1111-----------33331111--------------------1111------- LQTVTNLKFGNMKVETFYPGKGHTEDNIVVWLPQYNILVGGCLVKSTSAKDLGNVADAYV --------!!!!----------------------------3333-3333-----1111-3 NEWSTSIENVLKRYRNINAVVPGHGEVGDKGLLLHTLDLLK 333----------------------------------1111 >SLY1 PROTEIN; SWP:P22213; PDB:1MQSA; KDISLRDQISAILKLFLNKDLNNNDNITTITDDIFNQQEIIWKVLILDIKSTATISSVLR ---3333--------%%%%2222-------3333--------------3333-3333--3 VNDLLKAGITVHSLIKQDRSPLPDVPAIYFVSPTKENIDIIVNDLKSDKYSEFYINFTSS 333----------3333----1111----------------------------------- LPRNLLEDLAQQVSITGKSDKIKQVYDQYLDFIVTEPELFSLEISNAYLTLNDPKTTEEE -3333------3333--3333-------------------------------3333---- ITGLCANIADGLFNTVLTINSIPIIRAAKGGPAEIIAEKLGTKLRDFVINTNSERGVLII ----------------3333----------3333-------------------------- LDRNIDFASFSHSWIYQCVFDIFKLSRNTVTIPLATKKYDIEPNDFFWENSHLPFPEAAE -------------------------%%%%------------------------------- NVEAALNTYKEEAAEITRKTEVVKKLPELTAKKNTIDTHNIFAALLSQLESKSLDTFFEV -------------------------------------------------------3333- EQDPGSTKTRSRFLDILKDGKTNNLEDKLRSFIVLYLTSTTGLPKDFVQNVENYFKENDY ------------------------------------------------------------ DINALKYVYKLREFQLSNSLQNKSLYGLTEGKLQGGVGSLISGIKKLLPEKKTIPITNVV -3333-3333----1111-------3333------------------------------- DAIDPLNSSQKNLETTDSYLYIDPKITRGSHTRKPKRQSYNKSLVFVVGGGNYLEYQNLQ ---3333--------1111---3333---------------------------------- EWAHSQLHNPKKVYGSTAITTPAEFLNEISRLGASNSS --------------------3333-------------- >POLYPROTEIN; SWP:Q8B8X4; PDB:1MQTA; RVADTVGSGPVNSESIPALTAAETGHTSQVVPSDTMQTRHVKNYHSRSESTVENFLCRSA ---------------3333-3333------3333------------11113333------ CVFYTTYKNHDSDGDNFAYWVINARQVAQLRRKLEMFTYARFDLELTFVITSTQEQSTTQ ------------------------------------------------------------ GQDTPVLTHQIMYVPPGGPVPTKVNSYSWQTSTNPSVFWTEGNAPPRMSIPFIGIGNAYS ---------------------------------------2222----------------- MFYDGWARFDKQGTYGISTLNSMGTLYMRHVNGGGPGPIVSTVRIYFKPKHVKTWVPRPP ---------------3333----------------------------------------- RLCQYKKAGNVNFIPTSVTEGRTDITTMKTT ------------------------------- >CYTOCHROME C'; SWP:P00149; PDB:1MQVA; ATDVIAQRKAILKQMGEATKPIAAMLKGEAKWDQAVVQKSLAAIADDSKKLPALFPADSK -------------------------------------------------3333--1111- TGGDTAALPKIFEDKAKFDDLFAKLAAAATAAQGTIKDEASLKANIGGVLGNCKSCHDDF -------3333---------------------------------3333-------3333- RAK --- >Ski oncogene; SWP:P12755; PDB:1MR1C; SHMRVYHECFGKCKGLLVPELYSSPSAACIQCLDCRLMYPPHKFVVHSHKALENRTCHWG --------!!!!-----3333--1111-------------3333---------------- FDSANWRAYILLSQDYTGKEEQARLGRCLDDVKEKFD -33331111---1111---3333-------------- >ADP-ribosylation factor 2; SWP:P19146; PDB:1MR3F; ASKLFSNLFGNKEMRILMVGLDGAGKTTVLYKLKLGEVITTIPTIGFNVETVQYKNISFT ----3333------------2222-------------------2222------!!!!--- VWDVGGQDRIRSLWRHYYRNTEGVIFVIDSNDRSRIGEAREVMQRMLNEDELRNAVWLVF ----1111--3333---1111-------11111111------------1111-------- ANKQDLPEAMSAAEITEKLGLHSIRNRPWFIQSTCATSGEGLYEGLEWLSNNLKNQS --3333-------------3333------------1111------------------ >NICOTIANA ALATA PLANT DEF; SWP:P32026; PDB:1MR4A; RECKTESNTFPGICITKPPCRKACISEKFTDGHCSKILRRCLCTKPC ------3333------------------------------------- ------------------------------------------------------------ -------- >STREPTOGRAMIN A ACETYLTRA; SWP:P50870; PDB:1MR7A; MGPNPMKMYPIEGNKSVQFIKPILEKLENVEVGEYSYYDSKNGETFDKQILYHYPILNDK ---1111---2222------3333--------------------3333-----3333--- LKIGKFCSIGPGVTIIMNGANHRMDGSTYPFNLFGNGWEKHMPKLDQLPIKGDTIIGNDV ---------2222----1111--------3333-iiii1111-1111------------- WIGKDVVIMPGVKIGDGAIVAANSVVVKDIAPYMLAGGNPANEIKQRFDQDTINQLLDIK --2222--2222--------2222------2222-------------------------3 WWNWPIDIINENIDKILDNSIIR 333---------------3333- ------------------------------- >IGG2B-KAPPA JEL103 FAB (H; SWP:NA; PDB:1MRCH; QVQLQQSGAELVKPGASVKLSCKASGYTFTSYWMQWVKQRPGQGLEWIGEISYTNYNQKF ------------2222-----------3333-------------------------3333 KGKATLTVDSTAYMQLSSL ------------------- >IGG2B-KAPPA JEL103 FAB (H; SWP:NA; PDB:1MREH; QVQLQQSGAELVKPGASVKLSCKASGYTFTSYWMQWVKQRPGQGLEWIGEIDPSDSYTNY ---------------------------3333----------------------------- NQKFKGKATLTVDSTAYMQLSSLTSEDSAVYYCANLRGYFDYWGQGTTLTVSSAKTTPPS 3333-------------------3333---------!!!!-------------------- VYPLAPGTGSSVTLGCLVKGYFPESVTVTWNSGSLSSSVHTFPALLQSGLYTMSSSVTVP ------------------------------iiii-------------------------- SSTWPSQTVTCSVAHPASSTTVDKKLEP --------------3333---------- >ALPHA-MOMORCHARIN; SWP:P16094; PDB:1MRG; DVSFRLSGADPRSYGMFIKDLRNALPFREKVYNIPLLLPSVSGAGRYLLMHLFNRDGKTI -----22223333--------1111-----%%%%--------3333-------1111--- TVAVDVTNIYIMGYLADTTSYFFNEPAAELASQYVFRDARRKITLPYSGDYERLQIAAGK ------------------------------3333-1111--------------------- PREKIPIGLPALDSAISTLLHYDSTAAAGALLVLIQTTAEAARFKYIEQQIQERAYRDEV 3333------------------3333------------------------3333------ PSLATISLENSWSGLSKQIQLAQGNNGIFRTPIVLVDNKGNRVQITNVTSKVVTSNIQLL ---------------------2222---------------------11113333------ LNTRNI -3333- >ALPHA-TRICHOSANTHIN; SWP:P09989; PDB:1MRJ; DVSFRLSGATSSSYGVFISNLRKALPNERKLYDIPLLRSSLPGSQRYALIHLTNYADETI -----2222---------------------iiii-------1111--------1111--- SVAIDVTNVYIMGYRAGDTSYFFNEASATEAAKYVFKDAMRKVTLPYSGNYERLQTAAGK ---------------!!!!-----------3333-1111--------------------- IRENIPLGLPALDSAITTLFYYNANSAASALMVLIQSTSEAARYKFIEQQIGKRVDKTFL 3333------------------1111---------------------------------- PSLAIISLENSWSALSKQIQIASTNNGQFESPVVLINAQNQRVTITNVDAGVVTSNIALL ------------------------iiii--------1111------1111---------- LNRNNMA -1111-- >POL POLYPROTEIN; SWP:P03367; PDB:1MRXA; PQITLWKRPLVTIKIGGQLKEALLDTGADDTVIEEMSLPGRWKPKMIGGIGGFIKVRQYD --------------iiii------1111--------------------2222-------- QIIIEICGHKAIGTVLVGPTPFNVIGRNLLTQIGCTLNF -----iiii----------------3333-1111----- >RIBOFLAVIN KINASE/FMN ADE; SWP:Q9WZW1; PDB:1MRZA; VVSIGVFDGVHIGHQKVLRTMKEIAFFRKDDSLIYTISYPPEYFLPDFPGLLMTVESRVE ------2222-----------------------------3333-1111------------ MLSRYARTVVLDFFRIKDLTPEGFVERYLSGVSAVVVGRDFRFGKNASGNASFLRKKGVE -3333------33331111---------2222-----------2222-------1111-- VYEIEDVVVQGKRVSSSLIRNLVQEGRVEEIPAYLGRYFEIEGIVFPTANIDRGNEKLVD ----------------------111133333333-------------------------- LKRGVYLVRVHLPDGKKKFGVMNVGFRRNVKYEVYILDFEGDLYGQRLKLEVLKFMRDEK ------------------------------------------------------------ KEELKAAIDQDVKSARNMIDDIINSK --3333-------------------- >TRANS-SIALIDASE; SWP:Q26966; PDB:1MS9A; APGSSRVELFKRQSSKVPFEKDGKVTERVVHSFRLPALVNVDGVMVAIADARYETSFDNS 2222------2222------iiii----------------iiii---------------- LIDTVAKYSVDDGETWETQIAIKNSRASSVSRVVDPTVIVKGNKLYVLVGSYNSSRSYWT ----------iiii-------------1111---------!!!!-------------333 SHGDARDWDILLAVGEVTKSTAGGKITASIKWGSPVSLKEFFPAEMEGMHTNQFLGGAGV 3---1111------------2222-------------3333------------------- AIVASNGNLVYPVQVTNKKKQVFSKIFYSEDEGKTWKFGKGRSAFGCSEPVALEWEGKLI ---1111---------1111----------iiii---------2222-------iiii-- INTRVDYRRRLVYESSDMGNTWLEAVGTLSRVWGPSPKSNQPGSQSSFTAVTIEGMRVML ----2222--------iiii--------2222---1111-------------iiii---- FTHPLNFKGRWLRDRLNLWLTDNQRIYNVGQVSIGDENSAYSSVLYKDDKLYCLHEINSN -----3333------------------------!!!!---------%%%%--------%% EVYSLVFARLVGELRIIKSVLQSWKNWDSHLSSICTPAGCGPAVTTVGLVGFLSHSATKT %%----------------------------1111-----------2222----------- EWEDAYRCVNASTANAERVPNGLKFAGVGGGALWPVSQQGQNQRYHFANHAFTLVASVTI ---1111-----------2222----2222------------1111-------------- HEVPKGASPLLGASLDSSGGKKLLGLSYDKRHQWQPIYGSTPVTPTGSWEMGKRYHVVLT ---------------3333---------1111-----!!!!--------2222------- MANKIGSVYIDGEPLEGSGQTVVPDERTPDISHFYVGGYKRSGMPTDSRVTVNNVLLYNR -%%%%----iiii-2222----------------------1111---------------- QLNAEEIRTLFLSQDLIGTEAH ----------1111----1111 >BACTERIOPHAGE MS2 COAT PR; SWP:P03612; PDB:1MSC; ASNFTQFVLVDNGGTGDVTVAPSNFANGVAEWISSNSRSQAYKVTCSVRQSSAQNRKYTI ------------------------------------------------------------ KVEVPKVATQTVGGVELPVAARRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPS --------------------------------11113333-------------------- AIAANSGIY -1111---- >COBALAMIN-DEPENDENT METHI; SWP:P13009; PDB:1MSK; TPPVTLEAARDNDFAFDWQAYTPPVAHRLGVQEVEASIETLRNYIDWTPFFMTWSLAGKY ----------------3333----------------33333333--33333333------ PRILEDEVVGVEAQRLFKDANDMLDKLSAEKTLNPRGVVGLFPANRVGDDIEIYRDETRT 3333------------------------------------------!!!!-----3333- HVINVSHHLRQQTEKTGFANYCLADFVAPKLSGKADYIGAFAVTGGLEEDALADAFEAQH -----------------------1111-3333-------------1111-------1111 DDYNKIMVKALADRLAEAFAEYLHERVRKVYWGYAPNENLSNEELIRENYQGIRPAPGYP ----------------------------------1111--3333--------------11 ACPEHTEKATIWELLEVEKHTGMKLTESFAMWPGASVSGWYFSHPDSKYYAVAQIQRDQV 11-3333-----1111-1111----1111--------------1111------------- EDYARRKGMSVTEVERWLAPNLGYDAD -----------------3333------ >MAJOR SPERM PROTEIN; SWP:P27441; PDB:1MSPA; SVPPGDINTQPSQKIVFNAPYDDKHTYHIKITNAGGRRIGWAIKTTNMRRLSVDPPCGVL ------------------------------------------------------------ DPKEKVLMAVSCDTFNAATEDLNNDRITIEWTNTPDGAAKQFRREWFQGDGMVRRKNLPI 2222-----------3333---------------2222----3333-------------- EYNL ---- >DNA-directed RNA polymera; SWP:P00573; PDB:1MSWD; NTINIAKNDFSDIELAAIPFNTLADHYGERLAREQLALEHESYEMGEARFRKMFERQLKA ---------3333--3333----------------------------------------- GEVADNAAAKPLITTLLPKMIARINDWFEEVKAKRGKRPTAFQFLQEIKPEAVAYITIKT -333311113333-------------------------33331111-------------- TLACLTSADNTTVQAVASAIGRAIEDEARFGRIRDLEAKHFKKNVEEQLNKRVGHVYKKA --3333------------------------1111----3333---3333----3333--- FMQVVEADMLSKGLLGGEAWSSWHKEDSIHVGVRCIEMLIESTGMVSLHRQSETIELAPE ------------------------3333------------------------------33 YAEAIATRAGALAGISPMFQPCVVPPKPWTGITGGGYWANGRRPLALVRTHSKKALMRYE 33---------------------------------------------------3333--- DVYMPEVYKAINIAQNTAWKINKKVLAVANVITKWKHCPVEDIPAIEREELPMKTAWKRA -------------1111-------------1111---------------------3333- AAAVYRKDKARKSRRISLEFMLEQANKFANHKAIWFPYNMDWRGRVYAVSMFNPQGNDMT -----------------------33331111---------1111--------1111---- KGLLTLAKGKPIGKEGYYWLKIHGANCAGVDKVPFPERIKFIEENHENIMACAKSPLENT -------------------------1111----3333----------------------- WWAEQDSPFCFLAFCFEYAGVQHHGLSYNCSLPLAFDGSCSGIQHFSAMLRDEVGGRAVN -1111-------------------1111-------------------------------- LLPSETVQDIYGIVAKKVNEILQADAINGTDNEVVTVTDENTGEISEKVKLGTKALAGQW ---------------------------------------------------3333----- LAYGVTRSVTKRSVMTLAYGSKEFGFRQQVLEDTIQPAIDSGKGLMFTQPNQAAGYMAKL -----3333------3333--3333----------------------------------- IWESVSVTVVAAVEAMNWLKSAAKLLAAEVKDKKTGEILRKRCAVHWVTPDGFPVWQEYK ------------------------------------------------1111-------- KPIQTRLNLMFLGQFRLQPTINTNKDSEIDAHKQESGIAPNFVHSQDGSHLRKTVVWAHE ------------------------------------------------------------ KYGIESFALIHDSFGTIPADAANLFKAVRETMVDTYESCDVLADFYDQFADQLHESQLDK ----------------3333---------------------------------------- MPALPAKGNLNLRDILESDFAFA ----------33331111----- >DNA-BINDING PROTEIN SMUBP; SWP:P38935; PDB:1MSZA; GVDHFRAMIVEFMASKKMQLEFPPSLNSHDRLRVHQIAEEHGLRHDSSGEGKRRFITVSK -3333-----------------3333-3333----------------------------- RA -- >FATTY-ACID AMIDE HYDROLAS; SWP:P97612; PDB:1MT5A; ARGAATRARQKQRASLETMDKAVQRFRLQNPDLDSEALLTLPLLQLVQKLQSGELSPEAV 33333333--------------1111--------1111---------------------- FFTYLGKAWEVNKGTNCVTSYLTDCETQLSQAPRQGLLYGVPVSLKECFSYKGHDSTLGL ----------3333-------11113333---1111-1111----3333-2222-----3 SLNEGMPSESDCVVVQVLKLQGAVPFVHTNVPQSMLSFDCSNPLFGQTMNPWKSSKSPGG 333---------------1111-------------------3333----3333------- SSGGEGALIGSGGSPLGLGTDIGGSIRFPSAFCGICGLKPTGNRLSKSGLKGCVYGQTAV -----------------------1111-------------1111---------------- QLSLGPMARDVESLALCLKALLCEHLFTLDPTVPPLPFREEVYRSSRPLRVGYYETDNYT ---------3333----------3333-----------3333------------------ MPSPAMRRALIETKQRLEAAGHTLIPFLPNNIPYALEVLSAGGLFSDGGRSFLQNFKGDF -----------------1111-----------------------111133333333---- VDPCLGDLILILRLPSWFKRLLSLLLKPLFPRLAAFLNSMRPRSAEKLWKLQHEIEMYRQ -3333-----11113333-------3333-------3333-------------------- SVIAQWKAMNLDVLLTPMLGPALDLNTPGRATGAISYTVLYNCLDFPAGVVPVTTVTAED --------------------------33333333-------3333-----------1111 DAQMELYKGYFGDIWDIILKKAMKNSVGLPVAVQCVALPWQEELCLRFMREVEQLMT -3333-------3333---------2222--------2222---------------- >6-PHOSPHOFRUCTOKINASE; SWP:P00512; PDB:1MTOA; MKRIGVLTSGGDSPGMNAAIRSVVRKAIYHGVEVYGVYHGYAGLIAGNIKKLEVGDVGDI ----------------------------------------3333--------1111---1 IHRGGTILYTARCPEFKTEEGQKKGIEQLKKHGIEGLVVIGGDGSYQGAKKLTEHGFPCV 111--1111----3333-3333-------1111--------33331111-3333------ GVPGTIDNDIPGTDFTIGFDTALNTVIDAIDKIRDTATSHERTWVIEVMGRHAGDIALYS ----3333-----------------------------------------!!!!------- GLAGGAETILIPEADYDMNDVIARLKRGHERGKKHSIIIVAEGVGSGVDFGRQIQEATGF ----------1111--3333---------------------------------------- ETRVTVLGHVQRGGSPTAFDRVLASRLGARAVELLLEGKGGRCVGIQNNQLVDHDIAEAL -------3333-----3333-----------3333-----------%%%%--------11 ANKHTIDQRMYALSKELSI 11----3333--------- >SERINE PROTEINASE INHIBIT; SWP:Q47NK3; PDB:1MTPA; GGFLRDDHLEFALHLHRRLAEAVPDGEVIWSPYSVACALGVLAAGARATTRTELTTLLGT ----------------------1111----------------1111-------------- DPAPLLAALDRAVTDSPDLASRTVLWVSADVPVRSSFRATMHDRPDSDVRTADFRTNPEG ----------1111-1111--------1111---------3333---------1111--- VRATVNADIADATRGMIRELLPQGAVTPDLRAILTNALWAKARWTTPFEAHLTREGTFRT ----------1111-------2222-1111------------------3333-------1 PRGPKRVPFMHRTKTMPYATARGWRMVTLHAHDELAVDVLLPPGTNAAAVPTAPLLTALH 111-----------------iiii-------%%%%-------1111-------------- RRSASTSVELALPRFELTQPHQLVEVLAEAGVRTLFTASADLSGISTVPLYVDTVIHQAR ---------------------------1111-33331111-1111--------------- LRVDERGAEGAAATAAMMLL ---1111------------- >MARGATOXIN; SWP:P40755; PDB:1MTX; TIINVKCTSPKQCLPPCKAQFGQSAGAKCMNGKCKCYPH --------3333--1111--------------------- >Methane monooxygenase com; SWP:P18798; PDB:1MTYB; ERRRGLTDPEMAAVILKALPEAPLDGNNKMGYFVTPRWKRLTEYEALTVYAQPNADWIAG ---3333--------1111---------2222---------------2222---1111-- GLDWGDWTQKFHGGRPSWGNETTELRTVDWFKHRDPLRRWHAPYVKDKAEEWRYTDRFLQ ------------------3333------1111--1111---------------------- GYSADGQIRAMNPTWRDEFINRYWGAFLFNEYGLFNAHSQGAREALSDVTRVSLAFWGFD ------3333-------------------------------------------------- KIDIAQMIQLERGFLAKIVPGFDESTAVPKAEWTNGEVYKSARLAVEGLWQEVFDWNESA ------------------2222--------------1111-------------------- FSVHAVYDALFGQFVRREFFQRLAPRFGDNLTPFFINQAQTYFQIAKQGVQDLYYNCLGD ----------------------3333----3333-------------------------- DPEFSDYNRTVMRNWTGKWLEPTIAALRDFMGLFAKLPAGTTDKEEITASLYRVVDDWIE -------------------------------3333--2222------------------- DYASRIDFKADRDQIVKAVLAGLK -------------------1111- >Methane monooxygenase com; SWP:P22869; PDB:1MTYD; AANRAPTSVNAQEVHRWLQSFNWDFKNNRTKYATKYKMANETKEQFKLIAKEYARMEAVK ---------333333331111---1111----------1111------------------ DERQFGSLQVALTRLNAGVRVHPKWNETMKVVSNFLEVGEYNAIAATGMLWDSAQAAEQK ---------------3333----------------------------------------- NGYLAQVLDEIRHTHQCAYVNYYFAKNGQDPAGHNDARRTRTIGPLWKGMKRVFSDGFIS --------------------------------1111-------3333-------3333-- GDAVECSLNLQLVGEACFTNPLIVAVTEWAAANGDEITPTVFLSIETDELRHMANGYQTV ------------------------------1111--3333-------------------- VSIANDPASAKYLNTDLNNAFWTQQKYFTPVLGMLFEYGSKFKVEPWVKTWDRWVYEDWG --11113333-------------------------------------------------- GIWIGRLGKYGVESPRSLKDAKQDAYWAHHDLYLLAYALWPTGFFRLALPDQEEMEWFEA ------3333----1111--------------------3333------------------ NYPGWYDHYGKIYEEWRARGCEDPSSGFIPLMWFIENNHPIYIDRVSQVPFCPSLAKGAS -2222--------------1111-----3333--1111-------------3333----- TLRVHEYNGEMHTFSDQWGERMWLAEPERYECQNIFEQYEGRELSEVIAELHGLRSDGKT ------iiii---------------1111----3333-22223333--------1111-- LIAQPHVRGDKLWTLDDIKRLNCVFKNPVKAF -----------------3333-------1111 >Methane monooxygenase com; SWP:P11987; PDB:1MTYG; LGIHSNDTRDAWVNKIAHVNTLEKAAEMLKQFRMDHTTPFRNSYELDNDYLWIEAKLEEK ---------------1111------------------1111--1111------------- VAVLKARAFNEVDFRHKTAFGEDAKSVLDGTVAKMNAAKDKWEAEKIHIGFRQAYKPPIM -----------------1111-------------1111---------------------- PVNYFLDGERQLGTRLMELRNLNYYDTPLEELRKQRGVRVVH ------------------11111111---------------- >PROLINE IMINOPEPTIDASE; SWP:P96084; PDB:1MTZA; ECIENYAKVNGIYIYYKLCKAPEEKAKLMTMHGGPGMSHDYLLSLRDMTKEGITVLFYDQ --------iiii--------------------------3333----3333---------2 FGCGRSEEPDQSKFTIDYGVEEAEALRSKLFGNEKVFLMGSSYGGALALAYAVKYQDHLK 222------3333-----------------!!!!--------------------3333-- GLIVSGGLSSVPLTVKEMNRLIDELPAKYRDAIKKYGSSGSYENPEYQEAVNYFYHQHLL ------------------------------------11111111---------------- RSEDWPPEVLKSLEYAERRNVYRIMNGPNEFTITGTIKDWDITDKISAIKIPTLITVGEY -------------------3333-----1111--1111---1111------------111 DEVTPNVARVIHEKIAGSELHVFRDCSHLTMWEDREGYNKLLSDFILKHL 1--3333-------2222----------1111-------------1111- >HIV-2 RT; SWP:P04584; PDB:1MU2A; AKVEPIKIMLKPGKDGPKLRQWPLTKEKIEALKEICEKMEKEGQLEEAPPTNPYNTPTFA ----------2222----------3333--------------------1111-------- IKKKDRMLIDFRELNKVTQDFTEIQLGIPHPAGLAKKRRITVLDVGDAYFSIPLHEDFRP ----------------------------------------------3333----333333 YTAFTLKRYIYKVLPQGWKGSPAIFQHTMRQVLEPFRKANKDVIIIQYMDDILIASDRTD 33------------2222---------------------1111----------------- LEHDRVVLQLKELLNGLGFSTPDEKFQKDPPYHWMGYELWPTKWKLQKIQLPQKEIWTVN --------------1111---3333--------iiii----------------------- DIQKLVGVLNWAAQLYPGIKTKHLCRLISGKMTLTEEVQWTELAEAELEENRIILSQEQE -----------1111------3333-------1111------------------------ GHYYQEEKELEATVQKDQDNQWTYKIHQEEKILKVGKYAKVTHTNGIRLLAQVVQKIGKE ----3333-------------------!!!!----------------------------- ALVIWGRIPKFHLPVEREIWEQWWDNYWQVTWIPDWDFVSTPPLVRLAFNLVGDPIPGAE ---------------3333------------------------------------2222- TFYTDGSCNRQSKEGKAGYVTDRGKDKVKKLEQTTNQQAELEAFAMALTDSGPKVNIIVD -----------------------------------------------1111--------- SQYVMGIVASQPTESESKIVNQIIEEMIKKEAIYVAWVPAHKGIGGNQEVDHLVSQGI -------------------------3333-------------------------2222 >MUCONATE LACTONIZING ENZY; SWP:P08310; PDB:1MUCA; ALIERIDAIIVDLPTIRQQQTLVVLRVRCSDGVEGIGEATTIGGLAYGYESPEGIKANID ----------------------------1111---------------------------- AHLAPALIGLAADNINAAMLKLDKLAKGNTFAKSGIESALLDAQGKRLGLPVSELLGGRV --333322221111------------------------------------3333------ RDSLEVAWTLASGDTARDIAEARHMLEIRRHRVFKLKIGANPVEQDLKHVVTIKRELGDS -----------------------------------------3333-----------!!!! ASVRVDVNQYWDESQAIRACQVLGDNGIDLIEQPISRINRGGQVRLNQRTPAPIMADESI ------%%%%-3333--------------------3333-----------------3333 ESVEDAFSLAADGAASIFALKIAKNGGPRAVLRTAQIAEAAGIGLYGGTMLEGSIGTLAS -3333---------------3333--3333------------------------------ AHAFLTLRQLTWGTELFGPLLLTEEIVNEPPQYRDFQLHIPRTPGLGLTLDEQRLARFAR ---1111--1111---3333-----------------------!!!!------------- >G:T/U SPECIFIC DNA GLYCOS; SWP:P43342; PDB:1MUGA; MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL -------------------------------1111----------------3333--333 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 3--------------3333----------------------------------------- AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVV ---------!!!!--------1111--------------3333-- >DNA BINDING PROTEIN HU-AL; SWP:P02342; PDB:1MULA; MNKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTFKVNHRAEAAA ----------------3333-----------------------2222------------- NVPAFVSGKALKDAVK ------------1111 >ADENINE GLYCOSYLASE; SWP:P17802; PDB:1MUN; MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP ---------------------1111------------------3333------------- TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS --------3333----2222-------------------iiii----------2222--- TAGAILSLSLGKHFPILNGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF ---------------------------------3333----------------2222--- NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPK -----------------33331111---------3333------- >TN5 TRANSPOSASE; SWP:Q46731; PDB:1MUSA; SALHRAADWAKSVFSSAALGDPRRTARLVNVAAQLAKYSGKSITISSEGSKAAQEGAYRF -------------1111---3333-------------22223333-iiii---------- IRNPNVSAEAIRKAGAMQTVKLAQEFPELLAIEDTTSLSYRHQVAEELGKLGSIQKASRG --1111---------------3333----------------3333-------1111---- WWVHSVLLLEATTFRTVGLLHQEWWMRPDDPADADEKESGKWLAAAATSRLRMGSMMSNV -----------------------------3333---3333------------!!!!1111 IAVCDREADIHAYLQDKLAHNERFVVRSKHPRKDVESGLYLYDHLKNQPELGGYQISIPQ ----3333---------1111-------------1111------1111------------ KGVVDKRGKRKNRPARKASLSLRSGRITLKQGNITLNAVLAEEINPPKGETPLKWLLLTS ----1111---------------------2222--------------------------- EPVESLAQALRVIDIYTHRWRIEEFHKAWKTGAGAERQRMEKPDNLERMVSILSFVAVRL ---------------------------------1111----------------------- LQLRESFTPPSQSAETVLTPDECQLLGYLDKGKRKRKEKAGSLQWAYMAIARLGGFMDSK ----1111----3333-------------2222-33332222---------1111--111 RTGIASWGALWEGWEALQSKLDGFLAAKDLMAQGIKIG 1-----------------------------1111---- >NUCLEOSIDE TRIPHOSPHATE P; SWP:P08337; PDB:1MUT; MKKLQIAVGIIRNENNEIFITRRAADAHMANKLEFPGGKIEMGETPEQAVVRELQEEVGI ------------------------------------------------------------ TPQHFSLFEKLEYEFPDRHITLWFWLVERWEGEPWGKEGQPGEWMSLVGLNADDFPPANE ---------------------------------------------------1111---33 PVIAKLKRL 33--3333- >XYLOSE ISOMERASE; SWP:P15587; PDB:1MUWA; SYQPTPEDRFTFGLWTVGWQGRDPFGDATRPALDPVETVQRLAELGAHGVTFHDDDLIPF ----3333----1111--------------------------1111------1111--22 GSSDTERESHIKRFRQALDATGMTVPMATTNLFTHPVFKDGGFTANDRDVRRYALRKTIR 22--------------------------------3333---1111--------------- NIDLAVELGAKTYVAWGGREGAESGAAKDVRVALDRMKEAFDLLGEYVTSQGYDIRFAIE ----------------1111---1111--------------------------------- PKPNEPRGDILLPTVGHALAFIERLERPELYGVNPEVGHEQMAGLNFPHGIAQALWAGKL ---------------------1111-3333-----3333-1111----------1111-- FHIDLNGQSGIKYDQDLRFGAGDLRAAFWLVDLLESAGYEGPRHFDFKPPRTEDIDGVWA -------------------------------------------------3333------- SAAGCMRNYLILKERAAAFRADPEVQEALRASRLDELAQPTAADGVQELLADRTAFEDFD ---------------------------------3333----3333------33331111- VDAAAARGMAFERLDQLAMDHLLGAR -------------------------- >MYC BOX DEPENDENT INTERAC; SWP:O00499; PDB:1MV3A; DGSPAATPEIRVNHEPEPAGGATPGATLPKSPSQLRKGPPVPPPPKHTPSKEVKQEQILS ------------------------------------------------------------ LFEDTFVPEISVTTPSQPAEASEVAGGTQPAAGAQEPGETAASEAASSSLPAVVVETFPA ------------------------------------------------------------ TVNGTVEGGSGAGRLDLPPGFMFKVQAQHDYTATDTDELQLKAGDVVLVIPFQNPEEQDE -----------------2222--------------------------------3333-22 GWLMGVKESDWNQHKKLEKCRGVFPENFTERVP 22----33331111-3333-----3333----- ------------------------------------- >Multidrug resistance ABC ; SWP:Q9CHL8; PDB:1MV5A; MLSARHVDFAYDDSEQILRDISFEAQPNSIIAFAGPSGGGKSTIFSLLERFYQPTAGEIT -----------------------------------22223333---1111---------- IDGQPIDNISLENWRSQIGFVSQDSAIMAGTIRENLTYGLEGDYTDEDLWQVLDLAFARS -------------1111---------------------3333--3333------------ FVENMPDQLNTEVGERGVKISGGQRQRLAIARAFLRNPKILMLDEATASLDSESESMVQK ------!!!!-------------------------------------------------- ALDSLMKGRTTLVIAHRLSTIVDADKIYFIEKGQITGSGKHNELVATHPLYAKYVSEQLT -----1111-------33331111------------------------3333-------- VG -- >GDP-MANNOSE 6-DEHYDROGENA; SWP:P11759; PDB:1MV8A; MRISIFGLGYVGAVCAGCLSARGHEVIGVDVSSTKIDLINQGKSPIVEPGLEALLQQGRQ --------3333-------1111---------------1111-----2222--------- TGRLSGTTDFKKAVLDSDVSFICVGTPSKKNGDLDLGYIETVCREIGFAIREKSERHTVV -------------1111-----------1111-----------------1111------- VRSTVLPGTVNNVVIPLIEDCSGKKAGVDFGVGTNPEFLRESTAIKDYDFPPMTVIGELD -----2222------------------------------2222----------------- KQTGDLLEEIYRELDAPIIRKTVEVAEMIKYTCNVWHAAKVTFANEIGNIAKAVGVDGRE ----------1111-------------------------------------1111----- VMDVICQDHKLNLSRYYMRPGFAFGGSCLPKDVRALTYRASQLDVEHPMLGSLMRSNSNQ --------------2222----------------------1111--3333---------- VQKAFDLITSHDTRKVGLLGLSFKAGTDDLRESPLVELAEMLIGKGYELRIFDRNVEYAR -----------------------2222---------------1111-------------- VHGANKEYIESKIPHVSSLLVSDLDEVVASSDVLVLGNGDELFVDLVNKTPSGKKLVDLV ------------33331111-------------------3333-3333------------ GFMPHTTTAQAEGICW -------1111----- >RXR RETINOID X RECEPTOR; SWP:P19793; PDB:1MVCA; DMPVERILEAELAVEPDPVTNICQAADKQLFTLVEWAKRIPHFSELPLDDQVILLRAGWN --3333-----1111-3333-------------------2222----------------- ELLIASFSHRSIAVKDGILLATGLHVHRNSAHSAGVGAIFDRVLTELVSKMRDMQMDKTE --------1111-------1111---3333-1111------------------------- LGCLRAIVLFNPDSKGLSNPAEVEALREKVYASLEAYCKHKYPEQPGRFAKLLLRLPALR ----------1111----3333-------------------1111--------------- SIGLKCLEHLFFFKLIGDTPIDTFLMEMLEAP --------------------------1111-- >TRUNCATED 1,3-1,4-BETA-D-; SWP:P17989; PDB:1MVEA; VSAKDFSGAELYTLEEVQYGKFEARKAAASGTVSSFLYQNGSEIADGRPWVEVDIEVLGK ----------------------------2222------2222---------------333 NPGSFQSNIITGKAGAQKTSEKHHAVSPAADQAFHTYGLEWTPNYVRWTVDGQEVRKTEG 3-----------2222------------1111---------1111----iiii------! GQVSNLTGTQGLRFNLWSSESAAWVGQFDESKLPLFQFINWVKVYKYTPGQGEGGSDFTL !!!-----------------3333----3333--------------------iiii---- DWTDNFDTFDGSRWGKGDWTFDGNRVDLTDKNIYSRDGLILALTRKGQESFNGQVPRD ---------1111-------2222----1111------------2222---------- >IMMUNOGLOBULIN HEAVY CHAI; SWP:P0AE72; PDB:1MVFA; QVQLVESGGGSVQAGGSLRLSCAASGFTYSRKYMGWFRQAPGKEREGVAAIFIDNGNTIY ------------2222----------1111---------2222----------------- ADSVQGRFTISQDNAKNTVYLQMNSLKPEDTAMYYCAASSRWMDYSALTAKAYNSWGQGT 3333---------1111---------3333---------------11111111------- QVTVSSR ------- >IMMUNOGLOBULIN HEAVY CHAI; SWP:P18534; PDB:1MVFD; SSVKRWGNSPAVRIPATLMQALNLNIDDEVKIDLVDGKLIIEPV -----!!!!-----3333------2222------iiii------ >CRYPTIC LOCI REGULATOR 4; SWP:O60016; PDB:1MVHA; KLDSYTHLSFYEKRELFRKKLREIEGPEVTLVNEVDDEPCPSLDFQFISQYRLTQGVIPP --------------------1111-----------------------------2222--- DPNFQSGCNCSSLGGCDLNNPSRCECLDDLDEPTHFAYDAQGRVRADTGAVIYECNSFCS 3333---------------1111---1111--------3333--1111-------1111- CSMECPNRVVQRGRTLPLEIFKTKEKGWGVRSLRFAPAGTFITCYLGEVITSAEAAKRDK -1111---3333------------------------2222-----------------111 NYDDDGITYLFDLDMFDDASEYTVDAQNYGDVSRFFNHSCSPNIAIYSAVRNHGFRTIYD 1-----------------------------3333-----------------3333----- LAFFAIKDIQPLEELTFDYAGAKDFSPVQ ---------2222------!!!!------ >IMMUNOGLOBULIN G BINDING ; SWP:P06654; PDB:1MVKA; MQYKVILNEAVDAATFEKVVKQFFNDNGVDGEWTYDDATKTFTVTE ------------------------1111------------------ >PPC DECARBOXYLASE ATHAL3A; SWP:Q9SWE5; PDB:1MVLA; KPRVLLAASGSVAAIKFGNLCHCFTEWAEVRAVVTKSSLHFLDKLSLPQEVTLYTDEDEW ----------3333--------------------3333----1111-1111---3333-- SSWNKIGDPVLHIELRRWADVLVIAPLSANTLGKIAGGLCDNLLTCIIRAWDYTKPLFVA ----------------------------------1111---------1111--------- PAMNTLMWNNPFTERHLLSLDELGITLIPPIKNGAMAEPSLIYSTVRLFWESQ --------------------3333-------------------------1111 >MURINE MINUTE VIRUS COAT ; SWP:P07302; PDB:1MVMA; GVGVSTGSYDNQTHYRFLGDGWVEITALATRLVHLNMPKSENYCRIRVHNTTDTSVKGNM ------------------------------------------------------------ AKDDAHEQIWTPWSLVDANAWGVWLQPSDWQYICNTMSQLNLVSLDQEIFNVVLKTVTEQ --------------------3333------------------------------------ DSGGQAIKIYNNDLTACMMVAVDSNNILPYTPAANSMETLGFYPWKPTIASPYRYYFCVD -------------------------------3333-------1111-------------- RDLSVTYENQEGTIEHNVMGTPKGMNSQFFTIENTQQITLLRTGDEFATGTYYFDTNPVK ------------------------------------------------------------ LTHTWQTNRQLGQPPLLSTFPEADTDAGTLTAQGSRHGATQMEVNWVSEAIRTRPAQVGF ------------------------------------------------------------ CQPHNDFEASRAGPFAAPKVPADVTQGMDREANGSVRYSYGKQHGENWAAHGPAPERYTW ---------1111----------------------------------------------- DETNFGSGRDTRDGFIQSAPLVVPPPLNGILTNANPIGTKNDIHFSNVFNSYGPLTTFSH -------------------------------1111---------1111------------ PSPVYPQGQIWDKELDLEHKPRLHITAPFVCKNNAPGQMLVRLGPNLTDQYDPNGATLSR -----------------------1111--------------------------------- IVTYGTFFWKGKLTMRAKLRANTTWNPVYQVSVEDNGNSYMSVTKWLPTATGNMQSVPLI ------------------------------------------------------------ TRPVARNTY --------- >PHOP RESPONSE REGULATOR; SWP:P13792; PDB:1MVOA; MNKKILVVDDEESIVTLLQYNLERSGYDVITASDGEEALKKAETEKPDLIVLDVMLPKLD ----------------------1111---------------------------------- GIEVCKQLRQQKLMFPILMLTAKDEEFDKVLGLELGADDYMTKPFSPREVNARVKAILRR --------1111--------------------1111------------------------ S - >Myosin regulatory light c; SWP:P02609; PDB:1MVWB; FDETEIEDFKEAFTVIDQNADGIIDKDDLRETFAAMGRLNVKNEELDAMIKEASGPINFT ----------------1111-------------1111----3333----1111------- VFLTMFGEKLKGADPEDVIMGAFKVLDPDGKGSIKKSFLEELLTTGGGRFTPEEIKNMWA --------------3333--------1111----------------1111---------- AFPPDVAGNVDYKNICYVITHGEDA ---3333------------------ >Myosin light chain 3, ske; SWP:P02605; PDB:1MVWC; SKAAADDFKEAFLLFDRTGDAKITASQVGDIARALGQNPTNAEINKILGNPSKEEMNAAA ----------1111-1111----1111-----1111------------1111-------- ITFEEFLPMLQAAANNKDQGTFEDFVEGLRVFDKEGNGTVMGAELRHVLATLGEKMTEEE -1111-----------1111--------1111---------------------------- VEELMKGQEDSNGCINYEAFVKHIMSV ----------------------1111- ------------------------------------------------------------ -- >GROWTH FACTOR RECEPTOR-BO; SWP:Q14451; PDB:1MW4A; GSPASGTSLSAAIHRTQLWFHGRISREESQRLIGQQGLVDGLFLVRESQRNPQGFVLSLC ------------------------3333---1111------------------------- HLQKVKHYLILPSEEEGRLYFSMDDGQTRFTDLLQLVEFHQLNRGILPCLLRHCCTRVAL -------------------------------------3333------------------- >HYPOTHETICAL PROTEIN HI14; SWP:P44209; PDB:1MW5A; ETDLLKVRQPVKLYSVATLFHEFSEVITKLEHSVQKEPTSLLSEENWHKQFLKFAQALPA 3333--------------------------1111--------3333--------1111-- HGSASWLNLDDALQAVVGNSRSAFLHQLIAKLKSRHLQVLELNKIGSEPLDLSNLPAPFY ----------------3333-----------------------3333----%%%%----- VLLPESFAARITLLVQDKALPYVRVSEYWHALEYKGELN ---------------------------3333-------- >HYPOTHETICAL PROTEIN HP01; SWP:O24970; PDB:1MW7A; KVFPKLAKAITLAAKDGGSEPDTNAKLRTAILNAKAQNMPKDNIDAAIKRASSKEGNLSE -------------------3333------------------------3333-1111---- ITYEGKANFGVLIIMECMTDNPTRTIANLKSYFNKTQGASIVPNGSLEFMFNRKSVFECL -------%%%%------------------------2222---22222222---------3 KNEVENLKLSLEDLEFALIDYGLEELEEVEDKIIIRGDYNSFKLLNEGFESLKLPILKAS 3333333---------3333-----------------3333------------------- LQRIATTPIELNDEQMELTEKLLDRIEDDDDVVALYTNIE -------------------------1111----------- >DNA topoisomerase 1; SWP:P06612; PDB:1MW9X; GKALVIVESPAKAKTINKYLGSDYVVKSSVGHIRDLPTERGALVNRMGVDPWHNWEAHYE ----------------11111111-------------------------1111------- VLPGKEKVVSELKQLAEKADHIYLATDLDREGEAIAWHLREVIGGDDARYSRVVFNEITK -2222----------1111--------------------------3333----------- NAIRQAFNKPGELNIDRVNAQQARRFMDRVVGYMVSPLLWKKIARGLSAGRVQSVAVRLV -------------------------------------------2222------------- VEREREIKAFVPEEFWEVDASTTTPSGEALALQVTHQNDKPFRPVNKEQTQAAVSLLEKA -----------------------1111---------%%%%-------------------- RYSVLEREDKPTTSKPGAPFITSTLQQAASTRLGFGVKKTMMMAQRLYEAGYITYMRTDS -----------------------------------------------1111--------- TNLSQDAVNMVRGYISDNFGKKYLPESPNQYAREAIRPSDVNVMAESLKDMEADAQKLYQ -------------------3333----------------11113333------------- LIWRQFVACQMTPAKYDSTTLTVGAGDFRLKARGRILRFDGWTKVMPALEDRILPAVNKG -------1111-------------!!!!-----------!!!!--------------222 DALTLVELTPAQHFTKPPARFSEASLVKELEKRGIGRPSTYASIISTIQDRGYVRVENRR 2-----------------------------------1111--------1111----%%%% FYAEKMGEIVTDRLEENFRELMNYDFTAQMENNLDQVANHEAEWKAVLDHFFSDFTQQLD ------------------3333--------------1111-------------------- KAEKDPEEGGMRPNQM ----3333-------- >2C T CELL RECEPTOR ALPHA ; SWP:P01738; PDB:1MWAA; QSVTQPDRTSSQRKSYSATPLWVYPRQGLQLLLKYYSGDPVVQGVNGFAF ---------2----------------------------------iiii-- >MYOGLOBIN; SWP:P02189; PDB:1MWCA; GLSDGEWQLVLNVWGKVEADVAGHGQEVLIRLFKGHPETLEKFDKFKHLKSEDEMKASED -----------------------------------333311111111------------- LKKHGNTVLTALGGILKKKGHHEAELTPLAQSHATKHKIPVKYLEFISEAIIQVLQSKHP ---------------1111--3333---------1111-3333----------------1 GDFGADAQGAMSKALELFRNDMAAKYKELGFQG 111-----------------------1111--- >PARM; SWP:P11904; PDB:1MWMA; MLVFIDDGSTNIKLQWQESDGTIKQHISPNSFKREWAVSFGDKKVFNYTLNGEQYSFDPI -----------------1111--------------------------------------- SPDTNIAWQYSDVNVVAVHHALLTSGLPVSEVDIVCTLPLTEYYDRNNQPNTENIERKKA ----3333------------------------------1111--1111----------33 NFRKKITLNGGDTFTIKDVKVMPESIPAGYEVLQELDELDSLLIIDLGGTTLDISQVMGK 33----------------------3333-3333---1111------------------%% LSGISKIYGDSSLGVSLVTSAVKDALSLARTKGSSYLADDIIIHRKDNNYLKQRINDENK %%-----------3333--------------------------1111---------3333 ISIVTEAMNEALRKLEQRVLNTLNEFSGYTHVMVIGGGAELICDAVKKHTQIRDERFFKT -----------------------------------1111-------------1111---- NNSQYDLVNGMYLIGN -3333----------- >AMYLOID A4 PROTEIN; SWP:P05067; PDB:1MWPA; LLAEPQIAMFCGRLNMHMNVQNGKWDSDPSGTKTCIDTKEGILQYCQEVYPELQITNVVE ---------2222--------------1111------------------1111------- ANQPVTIQNWCKRGRKQCKTHPHFVIPYRCLVGEFV ------------%%%%-------------------- >HYPOTHETICAL PROTEIN HI08; SWP:P44887; PDB:1MWQA; HYYVIFAQDIPNTLEKRLAVREQHLARLKQLQAENRLLTAGPNPAIDDENPSEAGFTGST -------------------------------1111---------------!!!!------ VIAQFENLQAAKDWAAQDPYVEAGVYADVIVKPFKKVF --------------------1111-------------- >CATALASE-PEROXIDASE PROTE; SWP:Q939D2; PDB:1MWVA; NGTSNRDWWPNQLDLSILHRHSSLSDPMGKDFNYAQAFEKLDLAAVKRDLHALMTTSQDW ---3333-1111--3333---3333---1111-----1111----------3333--111 WPADFGHYGGLFIRMAWHSAGTYRTADGRGGAGEGQQRFAPLNSWPDNANLDKARRLLWP 1-2222-------------------------11111111-33333333-3333------- IKQKYGRAISWADLLILTGNVALESMGFKTFGFAGGRADTWEPEDVYWGSEKIWLELSGG ----!!!!---------------1111-------------------------2222---- PNSRYSGDRQLENPLAAVQMGLIYVNPEGPDGNPDPVAAARDIRDTFARMAMNDEETVAL ------------------2222---1111iiii-3333---------1111--------- IAGGHTFGKTHGAGPASNVGAEPEAAGIEAQGLGWKSAYRTGKGADAITSGLEVTWTTTP ---------------------3333-3333--------!!!!!!!!------------11 TQWSHNFFENLFGYEWELTKSPAGAHQWVAKGADAVIPDAFDPSKKHRPTMLTTDLSLRF 11------------------1111-----2222--------1111------33333333- DPAYEKISRRFHENPEQFADAFARAWFKLTHRDMGPRARYLGPEVPAEVLLWQDPIPAVD ------------------------------1111-3333----------3333------- HPLIDAADAAELKAKVLASGLTVSQLVSTAWAAASTFRGSDKRGGANGARIRLAPQKDWE ---------------------------------------------22221111-333333 ANQPEQLAAVLETLEAIRTAFNGAQRGGKQVSLADLIVLAGCAGVEQAAKNAGHAVTVPF 33-------------------1111iiii--------------------1111------- APGRADASQEQTDVESMAVLEPVADGFRNYLKGKYRVPAEVLLVDKAQLLTLSAPEMTVL -------3333-33333333-----1111--------3333------1111--------- LGGLRVLGANVGQSRHGVFTAREQALTNDFFVNLLDMGTEWKPTAADADVFEGRDRATGE ----1111-2222-2222---2222----------1111----3333------------- LKWTGTRVDLVFGSHSQLRALAEVYGSADAQEKFVRDFVAVWNKVMNLDRFDLA -----33333333---------11111111--------------1111-3333- >HYPOTHETICAL PROTEIN HI13; SWP:O86237; PDB:1MWWA; MITVFGLKSKLAPRREKLAEVIYNSLHLGLDIPKGKHAIRFLCLEKEDFYYPFDRSDDYT ------33333333------------------------------3333---11111111- VIEINLMAGRMEGTKKRLIKMLFSELEYKLGIRAHDVEITIKEQPAHCWGFRGMTGDEAR ----------3333------------------3333--------1111--iiii3333-- >ZNTA; SWP:P37617; PDB:1MWYA; SGTRYSWKVSGMDCAACARKVENAVRQLAGVNQVQVLFATEKLVVDADNDIRAQVESALQ --------------------------------------------------3333------ KAGYSLRDEQAAE ------------- >C-TERMINAL BINDING PROTEI; SWP:Q13363; PDB:1MX3A; MPLVALLDGRDCTVEMPILKDVATVAFCDAQSTQEIHEKVLNEAVGALMYHTITLTREDL -----------3333---1111---------3333-3333---------------33333 EKFKALRIIVRIGSGFDNIDIKSAGDLGIAVCNVPAASVEETADSTLCHILNLYRRATWL 333------------1111----------------1111--------------------- HQALREGTRVQSVEQIREVASGAARIRGETLGIIGLGRVGQAVALRAKAFGFNVLFYDPY ---1111------------2222--2222----------------3333--------111 LSDGVERALGLQRVSTLQDLLFHSDCVTLHCGLNEHNHHLINDFTVKQMRQGAFLVNTAR 12222-1111-----------------------1111----333311112222------- GGLVDEKALAQALKEGRIRGAALDVHESEPFSFSQGPLKDAPNLICTPHAAWYSEQASIE -------------------------------11111111----------3333------- MREEAAREIRRAITGRIPDSLKNCVN -----------------1111----- >ALPHA AMYLASE; SWP:O08452; PDB:1MXGA; AKYLELEEGGVIMQAFYWDVPGGGIWWDHIRSKIPEWYEAGISAIWLPPPSKGMSGGYSM ----1111-----------------------------1111-----------1111---- GYDPYDYFDLGEYYQKGTVETRFGSKEELVRLIQTAHAYGIKVIADVVINHRAGGDLEWN -----1111-----iiii--3333------------1111-------------------- PFVGDYTWTDFSKVASGKYTANYLDFHPNELHCCDEGTFGGFPDICHHKEWDQYWLWKSN ----------1111-------3333---3333------!!!!---1111----------- ESYAAYLRSIGFDGWRFDYVKGYGAWVVRDWLNWWGGWAVGEYWDTNVDALLSWAYESGA -------1111-------3333-3333--------------------------------- KVFDFPLYYKMDEAFDNNNIPALVYALQNGQTVVSRDPFKAVTFVANHDTDIIWNKYPAY --------------------------1111-3333-1111-------------------- AFILTYEGQPVIFYRDFEEWLNKDKLINLIWIHDHLAGGSTTIVYYDNDELIFVRNGDSR ---------------------3333---------------------1111-------111 RPGLITYINLSPNWVGRWVYVPKFAGACIHEYTGNLGGWVDKRVDSSGWVYLEAPPHDPA 1-------------------3333---------1111--------------------333 NGYYGYSVWSYCGVG 3-------------- >PTERIDINE REDUCTASE 2; SWP:Q8I814; PDB:1MXHA; CPAAVITGGARRIGHSIAVRLHQQGFRVVVHYRHSEGAAQRLVAELNAARAGSAVLCKGD ---------------------1111--------------------33332222------- LSLSSSLLDCCEDIIDCSFRAFGRCDVLVNNASAYYPTPLLPPIDAQVAELFGSNAVAPL ---1111---------------------------------------------3333---- FLIRAFARRQSRNLSVVNLCDAMTDLPLPGFCVYTMAKHALGGLTRAAALELAPRHIRVN ------1111----------1111---2222--------------------3333----- AVAPGLSLLPPAMPQETQEEYRRKVPLGQSEASAAQIADAIAFLVSKDAGYITGTTLKVD ---------3333-------3333---------------------1111----------i GGLILARA iii----- >HYPOTHETICAL TRNA/RRNA ME; SWP:P44868; PDB:1MXIA; MLDIVLYEPEIPQNTGNIIRLCANTGFRLHLIEPLGFTWDDKRLRRSGLDYHEFAEIKRH ----------3333-----------------------1111---3333-3333------- KTFEAFLESEKPKRLFALTTKGCPAHSQVKFKLGDYLMFGPETRGIPMSILNEMPMEQKI ------------------1111--1111---2222-----1111--333311113333-- RIPMTANSRSMNLSNSVAVTVYEAWRQLGYKGAVNL -------------------------11112222--- >RIBONUCLEOTIDE REDUCTASE ; SWP:P00453; PDB:1MXRA; AYTTFSQTKNDQLKEPMFFGQPVNVARYDQQKYDIFEKLIEKQLSFFWRPEEVDVSRDRI ----------3333-----------------------------1111-3333--1111-- DYQALPEHEKHIFISNLKYQTLLDSIQGRSPNVALLPLISIPELETWVETWAFSETIHSR -1111-----------------------3333--1111--3333---------------- SYTHIIRNIVNDPSVVFDDIVTNEQIQKRAEGISSYYDELIEMTSYWHLLGEGTHTVNGK ----3333---3333--------------2222-----------------------iiii TVTVSLRELKKKLYLCLMSVNALEAIRFYVSFACSFAFAERELMEGNAKIIRLIARDEAL --------------------------------------1111------------------ HLTGTQHMLNLLRSGADDPEMAEIAEECKQECYDLFVQAAQQEKDWADYLFRDGSMIGLN -----------------3333-----------------------------1111-2222- KDILCQYVEYITNIRMQAVGLDLPFQTRSNPIPWINTWL ----------------1111-------------3333-- >KDPG ALDOLASE; SWP:P00885; PDB:1MXSA; LSMADKAARIDAICEKARILPVITIAREEDILPLADALAAGGIRTLEVTLRSQHGLKAIQ ---------------------------------------------------1111----- VLREQRPELCVGAGTVLDRSMFAAVEAAGAQFVVTPGITEDILEAGVDSEIPLLPGISTP -----3333----------------------------------3333------------- SEIMMGYALGYRRFKLFPAEISGGVAAIKAFGGPFGDIRFCPTGGVNPANVRNYMALPNV ------1111------------------------------------3333------1111 MCVGTTWMLDSSWIKNGDWARIEACSAEAIALLDAN -----1111--------------------3333--- >IRON (III) SUPEROXIDE DIS; SWP:Q8DIR2; PDB:1MY6A; AFVQEPLPFDPGALEPYGMSAKTLEFHYGKHHKGYVDNLNKLTQDTELADKSLEDVIRTT ---------11113333-------------------------11113333---------2 YGDAAKVGIFNNAAQVWNHTFFWNSLKPGGGGVPTGDVAARINSAFGSYDEFKAQFKNAA 222--------------------------------------------------------- ATQFGSGWAWLVLEAGTLKVTKTANAENPLVHGQVPLLTIDVWEHAYYLDYQNRRPDFID -------------iiii------!!!!3333-----------3333----!!!!------ NFLNQLVNWDFVAKNLAA ------------------ >NF-KAPPAB P65 (RELA) SUBU; SWP:Q04207; PDB:1MY7A; TAELKICRVNRRSGSCLGGDEIFLLCDKVQKEDIEVYFTGPGWEARGSFSQADVHRQVAI --------------1111-----------1111------2222------3333-iiii-- VFRTPPYADPSLQAPVRVSMQLRRPSDRELSEPMEFQYLPDTDDRHR --------1111------------1111--------------1111- >ARC REPRESSOR; SWP:P03050; PDB:1MYLA; KMPQFNLRWPREVLDLVRKVAEENGMSVNSYIYQLVMESFKKEGR ----------------------------------------1111- >DROSOMYCIN; SWP:P41964; PDB:1MYN; DCLSGRYKGPCAVWDNETCRRVCKEEGRSSGHCSPSLKCWCEGC -----------1111----------------------------- >MYROSINASE; SWP:P29736; PDB:1MYR; EITCQENNPFTCGNTDGLNSSSFEADFIFGVASSAYQIEGTIGRGLNIWDGFTHRYPDKS -------------3333-3333-1111------3333---2222-----------1111- GPDHGNGDTTCDSFSYWQKDIDVLDELNATGYRFSIAWSRIIPRGKRSRGVNQKGIDYYH 1111----!!!!------------------------3333-11113333----------- GLIDGLIKKGITPFVTLFHWDLPQTLQDEYEGFLDPQIIDDFKDYADLCFEEFGDSVKYW ------1111------------3333----!!!!-------------------------- LTINQLYSVPTRGYGSALDAPGRCSPTVDPSCYAGNSSTEPYIVAHHQLLAHAKVVDLYR ----3333----------------11111111---3333--------------------- KNYTHQGGKIGPTMITRWFLPYNDTDRHSIAATERMKQFFLGWFMGPLTNGTYPQIMIDT --3333------------------------------------------------------ VGARLPTFSPEETNLVKGSYDFLGLNYYFTQYAQPSPNPVNATNHTAMMDAGAKLTYINA !!!!-----------2222-------------------1111---3333---------11 SGHYIGPLFESDGGDGSSNIYYYPKGIYSVMDYFKNKYYNPLIYVTENGISTPGSENRKE 11------------1111----3333--------------------------3333---- SMLDYTRIDYLCSHLCFLNKVIKEKDVNVKGYLAWALGDNYEFNNGFTVRFGLSYINWNN ---3333----------------------------------2222-----------1111 VTDRDLKKSGQWYQKFISP ------------------- >MYOGLOBIN; SWP:P02205; PDB:1MYT; ADFDAVLKCWGPVEADYTTMGGLVLTRLFKEHPETQKLFPKFAGIAQADIAGNAAISAHG 3333-----3333------------------333311111111--33332222------- ATVLKKLGELLKAKGSHAAILKPLANSHATKHKIPINNFKLISEVLVKVMHEKAGLDAGG ----------3333--3333--------------3333---------------------- QTALRNVMGIIIADLEANYKELGFSG -------------------1111--- >CYTOCHROME C550; SWP:P56150; PDB:1MZ4A; AELTPEVLTVPLNSEGKTITLTEKQYLEGKRLFQYACASCHVGGITKTNPSLDLRTETLA ---3333-----1111---------------------1111iiii2222----------- LATPPRDNIEGLVDYMKNPTTYDGEQEIAEVHPSLRSADIFPKMRNLTEKDLVAIAGHIL --------------------1111---------333311111111--------------- VEPKILGDKWG 3333-!!!!-- >Colicin-E7; SWP:Q47112; PDB:1MZ8B; KRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEVSK 1111-------------11111111-------------2222------------------ DPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHHEKPISQNGGVYDMDNISVVT 33331111-------1111-----3333-!!!!---------1111--1111-------- PKRHIDIHRGK ------1111- >CARTILAGE OLIGOMERIC MATR; SWP:Q9R0G6; PDB:1MZ9A; MDLAPQMLRELQETNAALQDVRELLRQQVKEITFLKNTVMECDAC -----------------------------------------3333 >PRO-GRANZYME K; SWP:P49863; PDB:1MZAA; MEIIGGKEVSPHSRPFMASIQYGGHHVCGGVLIDPQWVLTAAHCQYRFTK ---------22221111----------------1111---3333------ >FERRIC UPTAKE REGULATION ; SWP:Q03456; PDB:1MZBA; MVENSELRKAGLKVTLPRVKILQMLDSAQRHMSAEDVYKALMEAGEDVGLATVYRVLTQF -------1111------------------------------------------------- EAAGLVVRHNFDGGHAVFELADSGHHDHMVCVDTGEVIEFMDAEIEKRQKEIVRERGFEL -----------------------------------------------------1111--- VDHNLVLYVRKKK ------------- >SUFE PROTEIN; SWP:P76194; PDB:1MZGA; LLPDKEKLLRNFLRCANWEEKYLYIIELGQRLPELRDEDRSPQNSIQGCQSQVWIVRQNA -----------3333-------------1111---3333-3333---------------- QGIIELQGDSDAAIVKGLIAVVFILYDQTPQDIVNFDVRPWFEKALTQHLTPSRSQGLEA -------------------------------------3333---3333------------ IRAIRAKAAALSLEHHHHHH ---------------1111- >DEOXYRIBOSE-PHOSPHATE ALD; SWP:O66540; PDB:1MZHA; MIDVRKYIDNAALKPHLSEKEIEEFVLKSEELGIYAVCVNPYHVKLASSIAKKVKVCCVI --3333-------1111------------1111------3333----------------- GFPLGLNKTSVKVKEAVEAVRDGAQELDIVWNLSAFKSEKYDFVVEELKEIFRETPSAVH -------------------1111------------1111---------------1111-- KVIVETPYLNEEEIKKAVEICIEAGADFIKTSTGFAPRGTTLEEVRLIKSSAKGRIKVKA ----3333-------------------------------------------iiii----- SGGIRDLETAISMIEAGADRIGTSSGISIAEEFLKRHLILEHHHH -------------1111---------------------------- >BETA-KETOACYLSYNTHASE III; SWP:Q9F6D4; PDB:1MZJA; GLRVPERRFSRVLGVGSYRPRREVSNKEVCTWIDSTEEWIETRTGIRSRRIAEPDETIQV ------------------------33331111--------------------33333333 MGVAASRRALEHAGVDPAEIDLVVVSTMTNFVHTPPLSVAIAHELGADNAGGFDLSAACA ---------------3333---------------------------1111------!!!! GFCHALSIAADAVESGGSRHVLVVATERMTDVIDLADRSLSFLFGDGAGAAVVGPSDVPG ------------3333-----------3333--------1111----------------- IGPVVRGIDGTGLGSLHMSSSWDQYVEDPSVGRPALVMDGKRVFRWAVADVVPAAREALE --------33331111----33331111------------------------------11 VAGLTVGDLVAFVPHQANLRIIDVLVDRLGVPEHVVVSRDAEDTGNTSSASVALALDRLV 11--3333-----------------------1111----3333---!!!!---------3 RSGAVPGGGPALMIGFGAGLSYAGQALLLPDPPS 333------------------------------- >KINASE ASSOCIATED PROTEIN; SWP:P46014; PDB:1MZKA; LGSSWLFLEVIAGPAIGLQHAVNSTSSSKLPVKLGRVSPSDLALKDSEVSGKHAQITWNS ----------------------1111-------------------3333----------- TKFKWELVDMGSLNGTLVNSHSISHPDLGSRKWGNPVELASDDIITLGTTTKVYVRISSQ -----------------%%%%------3333--------2222----------------- NE -- >50S RIBOSOMAL PROTEIN L1P; SWP:P35024; PDB:1MZPA; MLADKESLIEALKLALSTEYNVKRNFTQSVEIILTFKGIDKKGDLKLREIVPLPKQPSKA ----------------3333---------------------------------------- KRVLVVPSSEQLEYAKKASPKVVITREELQKLQGQKRPVKKLARQNEWFLINQESALAGR -----------------------------3333------------------3333----- ILGPALGPRGKFPTPLPNTADISEYINRFKRSVLVKTKDQPQVQVFIGTEDKPEDLAENA --11111111-----------------------------------------3333----- IAVLNAIENKAKVETNLRNIYVKTTGKAVKVKR ------3333--3333----------------- >2,5-DIKETO-D-GLUCONATE RE; SWP:Q46857; PDB:1MZRA; ANPTVIKLQDGNVMPQLGLGVWQASNEEVITAIQKALEVGYRSIDTAAAYKNEEGVGKAL -------1111-----------------------------------3333---------- KNASVNREELFITTKLWNDDHKRPREALLDSLKKLQLDYIDLYLMHWPVPAIDHYVEAWK -----3333-------1111----------------------------3333-------- GMIELQKEGLIKSIGVCNFQIHHLQRLIDETGVTPVINQIELHPLMQQRQLHAWNATHKI ------------------------------------------1111---------1111- QTESWSPLAQGGKGVFDQKVIRDLADKYGKTPAQIVIRWHLDSGLVVIPKSVTPSRIAEN -----1111--2222-------------------------1111--------------11 FDVWDFRLDKDELGEIAKLDQGKRLGPDPDQFGG 11-------------3333--------1111--- >PPR; SWP:Q9X2W8; PDB:1MZUA; IDTAEFDALPVGAIQVDGSGVIHRYNRTESRLSGRIPERVIGRNFFTEVAPCTNIPAFSG --3333----------1111---------------33332222------3333-3333-- RFMDGVTSGTLDARFDFVFDFQMAPVRVQIRMQNAGVPDRYWIFVRK -----1111-------------------------------------- >ADENINE PHOSPHORIBOSYLTRA; SWP:O77103; PDB:1MZVA; SLKEIGPNSLLLEDSHSLSQLLKKNYRWYSPIFSPRNVPRFADVSSITESPETLKAIRDF ------------1111-------------3333--------------------------- LVERYRTMSPAPTHILGFDARGFLFGPMIAVELGIPFVLMRKADKNAGLLIRSEPYEKEY -------------------3333---33331111-------1111--------------- KEAAPEVMTIRHGSIGKNSRVVLIDDVLATGGTALSGLQLVEASGAEVVEMVSILTIPFL ----------2222------------------------------------------3333 KAAERIHSTAGGRYKNVRFIGLLSEDVLTEANCGDL --------%%%%-1111------3333-3333---- >U4/U6 small nuclear ribon; SWP:O43172; PDB:1MZWB; EVKASLRALGEPITLFGEGPAERRERLRNIL ------1111----2222--------3333- >COPPER-CONTAINING NITRITE; SWP:Q53239; PDB:1MZYA; LSNLPRVKHTLVPPPFAHAHEQVAASGPVINEFEMRIIEKEVQLDEDAYLQAMTFDGSIP 1111--------------------------------------------------iiii-- GPLMIVHEGDYVELTLINPPENTMPHNIDFHAATGALGGGGLTLINPGEKVVLRFKATRA ------2222--------1111-------1111-%%%%1111---2222----------- GAFVYHCAPGGPMIPWHVVSGMAGCIMVLPRDGLKDHEGKPVRYDTVYYIGESDHYIPKD -----------------1111--------1111--1111--------------------1 EDGTYMRFSDPSEGYEDMVAVMDTLIPSHIVFNGAVGALTGEGALKAKVGDNVLFVHSQP 111------3333--------1111------iiii----!!!!----2222--------- NRDSRPHLIGGHGDLVWETGKFHNAPERDLETWFIRGGTAGAALYKFLQPGVYAYVNHNL -------2222-----11111111-----------2222--------------------- IEAVHKGATAHVLVEGEWDNDLMEQVVAPVG ------------------3333--------- >ANNEXIN GH1; SWP:NA; PDB:1N00A; HHHATLTVPTTVPSVSEDCEQLRKAFSGWGTNEGLIIDILGHRNAEQRNLIRKTYAETYG -------------3333---------------------1111------------------ EDLLKALDKELSNDFERLVLLWALDPAERDALLANEATKRWTSSNQVLMEIACTRSANQL ------------------------------------------------------------ LHARQAYHARYKKSLEEDVAHHTTGDFHKLLLPLVSSYRYEGEEVNMTLAKTEAKLLHEK -------------3333----------------1111----------------------- ISNKAYSDDDVIRVLATRSKAQINATLNHYKNEYGNDINKDLKADPKDEFLALLRSTVKC 1111---3333--------------------1111-3333----1111------------ LVYPEKYFEKVLRLAINRRGTDEGALTRVVCTRAEVDLKVIADEYQRRNSVPLTRAIVKD --3333---------------1111-------1111---------------33331111- THGDYEKLLLVLAGHVEN ----------1111---- - >PUTATIVE RIBOFLAVIN KINAS; SWP:O74866; PDB:1N05A; PEIVGPEKVQSPYPIRFEGKVVHGFGRGSKELGIPTANISEDAIQELLRYRDSGVYFGYA --------------------------3333--------------3333------------ MVQKRVFPMVMSVSAEVHLIERQGEDFYEEIMRVIVLGYIRPELNYAGLDKLIEDIHTDI -%%%%------------------------------------------------------- RVALNSMDRPSYSSYKKDPFFK ----11113333-33333333- >PUTATIVE RIBOFLAVIN KINAS; SWP:NA; PDB:1N08A; RPEIVGPEKVQSPYPIRFEGKVVHGFGRGSKELGIPTANISEDAIQELLRYRDSGVYFGY ----------------------------3333--------3333--1111---------- AMVQKRVFPMVMSVGWNPYYKNKLRSAEVHLIERQGEDFYEEIMRVIVLGYIRPELNYAG --%%%%--------------------------------2222------------------ LDKLIEDIHTDIRVALNSMDRPSYSSYKKDPFFK --------------------3333-----3333- >PROTEIN MRAZ; SWP:P75467; PDB:1N0EA; FQGHMLLGTFNITLDAKNRISLPAKLRAFFEGSIVINRGFENCLEVRKPQDFQKYFEQFN --------------1111-----3333------------------------------111 SFPSTQKDTRTLKRLIFANANFVDVDTAGRVLIPNNLINDAKLDKEIVLIGQFDHLEIWD 1--------------3333------1111------------------------------- KKLYEDYLANSESLETVAERM -----------------1111 >Fimbrial protein papE [Pr; SWP:P08407; PDB:1N0LB; VPACTVSNTTVDWQDVENGNHEKEFTVNRCPYNLGTKVTITATNTYNNAILVQSSDGLLV ------------------------------1111-----------%%%%----------- YLYNSNAGNIGTAITLGTPFTPGKITGNNADKTISLHAKLGYKPFSATATLVASYS -----iiii-----2222---------3333------------------------- >3 ANKYRIN REPEATS; SWP:NA; PDB:1N0QA; GRTPLHLAARNGHLEVVKLLLEAGADVNAKDKNGRTPLHLAARNGHLEVVKLLLEAGADV --3333-------------------1111-1111-3333--------------1111-11 NAKDKNGRTPLHLAARNGHLEVVKLLLEAGAY 11-1111-------------------1111-- >BILIN-BINDING PROTEIN; SWP:P09464; PDB:1N0SA; DVYHDGACPEVKPVDNFDWSQYHGKWWEVAKYPSPNGKYGKCGWAEYTPEGKSVKVSRYD -----------------3333-------------%%%%-----------!!!!------- VIHGKEYFMEGTAYPVGDSKIGKIYHSRTVGGYTRKTVFNVLSTDNKNYIIGYSCRYDED -iiii------------3333--------!!!!--------------------------- KKGHWDHVWVLSRSMVLTGEAKTAVENYLIGSPVVDSQKLVYSDFSEAACKVN -------------------------------33331111------3333---- >ELONGATION FACTOR 2; SWP:P32324; PDB:1N0UA; AFTVDQMRSLMDKVTNVRNMSVIAHVDHGKSTLTDSLVQRAGIISAGITIKSTAISLYSE ------------3333--------1111-------------------------------- MSDEDVKEIKQKTDGNSFLINLIDSPGHVDFSSEVTAALRVTDGALVVVDTIEGVCVQTE -33333333----------------------3333---1111------------------ TVLRQALGERIKPVVVINKVDRALLELQVSKEDLYQTFARTVESVNVIVSTYADEVLGDV ------1111-------------------------------------------3333--- QVYPARGTVAFGSGLHGWAFTIRQFATRYAKKFGVDKAKMMDRLWGDSFFNPKTKKWTNK --3333-------3333---3333------1111-----------------1111----- DTDAEGKPLERAFNMFILDPIFRLFTAIMNFKKDEIPVLLEKLEIVLKGDEKDLEGKALL --1111---------------------1111---------1111---!!!!--------- KVVMRKFLPAADALLEMIVLHLPSPVTAQAYRAEQLYEGPADDANCIAIKNCDPKADLML --------3333-----------3333------3333--1111----------------- YVSKMVPTSDKGRFYAFGRVFAGTVKSGQKVRIQGPNYVPGKKDDLFIKAIQRVVLMMGR ----------------------------------1111----2222-----------!!! FVEPIDDCPAGNIIGLVGIDQFLLKTGTLTTSETAHNMKVMKFSVSPVVQVAVEVKNAND !-------2222-------3333--------1111---------------------3333 LPKLVEGLKRLSKSDPCVLTYMSESGEHIVAGTGELHLEICLQDLEHDHAGVPLKISPPV ----------------------1111---------------------------------- VAYRETVESESSQTALSKSPNKHNRIYLKAEPIDEEVSLAIENGIINPRDDFKARARIMA ------------------3333------------------------1111---------- DDYGWDVTDARKIWCFGPDGNGPNLVIDQTKAVQYLHEIKDSVVAAFQWATKEGPIFGEE ---------1111----iiii-------------3333---------------------- MRSVRVNILDVTLHADAIHRGGGQIIPTMRRATYAGFLLADPKIQEPVFLVEIQCPEQAV ---------------3333-3333-------------------------------1111- GGIYSVLNKKRGQVVSEEQTPLFTVKAYLPVNESFGFTGELRQATGGQAFPQMVFDHWST -----------------------------11112222-------%%%%------------ LGSDPLDPTSKAGEIVLAARKRHGMKEEVPGWQEYYDKL ---1111-------------1111------3333----- >DNA REPAIR PROTEIN RAD51 ; SWP:Q06609; PDB:1N0WA; EIIQITTGSKELDKLLQGGIETGSITEFGEFRTGKTQICHTLAVTCQLPIDRGGGEGKAY -------------1111------------2222---------------3333-------- IDTEGTFRPERLLAVAERYGLSGSDVLDNVAYARAFNTDHQTQLLYQASAVESRYALLIV -------3333-----1111------1111------------------------------ DSATALYRELSARQHLARFLRLLRLADEFGVAVVITNAHASTTRLYLRKGRGETRICKIY ---3333-3333--------------------------1111--------!!!!------ DSPCLPEAEAFAINADGVGDAKD ----------------------- >Breast cancer type 2 susc; SWP:P51587; PDB:1N0WB; PTLLGFHTASGKKVKIAKESLDKVKNLFDEKEQ 3333---1111-----3333-11111111---- --------------------------------------------- >POLLEN ALLERGEN PHL P 1; SWP:P43213; PDB:1N10A; PKVPPGPNITATYGDKWLDAKSTWYGGGACGYKDVDKPPFSGMTGCGNTPIFKSGRGCGS ---------------------------1111--3333--iiii--------%%%%-1111 CFEIKCTKPEACSGEPVVVHITDDNEEPIAPYHFDLSGHAFGAMAKKGDEQKLRSAGELE --------3333-----------------------------11112222---1111---- LQFRRVKCKYPEGTKVTFHVEKGSNPNYLALLVKYVNGDGDVVAVDIKEKGKDKWIELKE ----------2222---------------------------------------------- SWGAIWRIDTPDKLTGPFTVRYTTEGGTKTEAEDVIPEGWKADTSYES 2222-------------------------------------------- >ANKYRIN; SWP:P16157; PDB:1N11A; LTPLHVASFMGHLPIVKNLLQRGASPNVSNVKVETPLHMAARAGHTEVAKYLLQNKAKVN -3333---------------------------------------------3333------ AKAKDDQTPLHCAARIGHTNMVKLLLENNANPNLATTAGHTPLHIAAREGHVETVLALLE -------3333------------------------1111-3333---------------- KEASQACMTKKGFTPLHVAAKYGKVRVAELLLERDAHPNAAGKNGLTPLHVAVHHNNLDI --------1111-3333-------3333------------------3333------3333 VKLLLPRGGSPHSPAWNGYTPLHIAAKQNQVEVARSLLQYGGSANAESVQGVTPLHLAAQ ---3333-------1111-------------------1111------1111-------11 EGHAEMVALLLSKQANGNLGNKSGLTPLHLVAQEGHVPVADVLIKHGVMVDATTRMGYTP 11-------------1111-1111-3333------------------------3333--- LHVASHYGNIKLVKFLLQHQADVNAKTKLGYSPLHQAAQQGHTDIVTLLLKNGASPNEVS ---------3333--------1111-1111-------1111------------------- SDGTTPLAIAKRLGYISVTDVLKVVTDETSFVLHRMSFPETVDE ----------1111------------------------------ >MATURE FIMBRIAL PROTEIN P; SWP:P08407; PDB:1N12A; VPACTVSNTTVDWQDVEIQTLSQNGNHEKEFTVNRCPYNLGTKVTITATNTYNNAILVQN ----------------3333-1111--------------------------%%%%----- TSNTSSDGLLVYLYNSNAGNIGTAITLGTPFTPGKITGNNADKTISLHAKLGYKGNQNLI ---1111---------iiii-----2222------------------------------- AGPFSATATLVASYS --------------- ---------------------------------------------- >Pyruvoyl-dependent argini; SWP:Q57764; PDB:1N13B; IMPPEAEIVPLPKLPMGALVPTAYGYIISDVPGETISAAISVAIPKDKSLCGLIMEYEGK --2222--------2222------------2222------------1111---------- CSKKEAEKTVREMAKIGFEMRGWELDRIESIAVEHTVEKLGCAFAAAALWYK ---------------------------------------------------- >FKBP52; SWP:Q02790; PDB:1N1AA; APLPMEGVDISPKQDEGVLKVIKREGTGTEMPMIGDRVFVHYTGWLLDGTKFDSSLKFSF -3333-----1111------------------2222---------1111----------- DLGKGEVIKAWDIAIATMKVGEVCHITCKPEYAYGSAGSPPKIPPNATLVFEVELFEFKG -------3333---11112222------3333-!!!!----------------------- E - >(+)-BORNYL DIPHOSPHATE SY; SWP:O81192; PDB:1N1BA; LWDSNYIQSLNTPYTEERHLDRKAELIVQVRILLKEKMEPVQQLELIHDLKYLGLSDFFQ ------1111----------------------1111--3333------------3333-- DEIKEILGVIYNEHKCFHNNEVEKMDLYFTALGFRLLRQHGFNISQDVFNCFKNEKGIDF -------------3333--------------------1111-------1111-3333--- KASLAQDTKGMLQLYEASFLLRKGEDTLELAREFATKCLQKKLDDENLLLWIRHSLDLPL -3333------------11112222---------------1111--------------33 HWRIQSVEARWFIDAYARRPDMNPLIFELAKLNFNIIQATHQQELKDLSRWWSRLCFPEK 33-3333-----------1111------------------------------33333333 LPFVRDRLVESFFWAVGMFEPHQHGYQRKMAATIIVLATVIDDIYDVYGTLDELELFTDT 1111----------------1111------------------------------------ FKRWDTESITRLPYYMQLCYWGVHNYISDAAYDILKEHGFFCLQYLRKSVVDLVEAYFHE ------3333-------------------------------3333--------------- AKWYHSGYTPSLDEYLNIAKISVASPAIISPTYFTFANASHDTAVIDSLYQYHDILCLAG ----------3333------11113333--3333-1111--------------------- IILRLPDDLGDVPKTIQCYMKETNASEEEAVEHVKFLIREAWKDMNTAIAAGYPFPDGMV -------------------------------------------------------3333- AGAANIGRVAQFIYLHGDGFSKTYEHIAGLLFEPYA -----------1111--------------------- >TORA SPECIFIC CHAPERONE; SWP:O87949; PDB:1N1CA; VDINPARALVYQLLSSLFAREVDEQRLKELTSEAAQQFWEQLSLEANFTQSVDKIRSTLN --------------------------------------------3333----------33 GIKDDEALLELAADYCGLFLVGTSASPYASLYLLLFGEQHQQSEFLHQSKLQVQSHFPEP 33-------------------------3333----1111-------------------33 ADHLAVLAYAHLCCHSENSVQLSFLQTCVNSWLAKFINHLTQCNKNGFYSAVATLTLAWV 333333---3333---3333--------3333-----------3333------------- KQDIAQLEPAVAIISL ------------1111 >INTERLEUKIN-19; SWP:Q9UHD0; PDB:1N1FA; NHGLRRCLISTDMHHIEESFQEIKRAIQAKDTFPNVTILSTLETLQIIKPLDVCCVTKNL -3333--11113333------------1111-1111--3333------------------ LAFYVDRVFKDHQEPNPKILRKISSIANSFLYMQKTLRQCQQCHCRQEATNATRVIHDNY -------3333------------------------------------------------3 DQLEVHAAAIKSLGELDVFLAWINKNHEVMSSA 333------------------------------ >MEROZOITE SURFACE PROTEIN; SWP:Q9GSQ9; PDB:1N1IA; SSAHKCIDTNVPENAACYRYLDGTEEWRCLLGFKEVGGKCVPASITCEENNGGCAPEAEC 3333---------------1111------2222--%%%%------1111%%%%-1111-- TMDDKKEVECKCTKEGSEPLFEGVFCSSSSG --1111-------------%%%%-------- >NF-YB; SWP:P25208; PDB:1N1JA; IYLPIANVARIMKNAIPQTGKIAKDAKECVQECVSEFISFITSEASERCHQEKRKTINGE ------------11111111--3333-----------------------1111------- DILFAMSTLGFDSYVEPLKLYLQKFRE ------11113333------------- >Nuclear transcription fac; SWP:Q13952; PDB:1N1JB; LPLARIKKIMKLDEDVKMISAEAPVLFAKAAQIFITELTLRAWIHTEDNKRRTLQRNDIA ------------1111---3333-----------------------1111----3333-- MAITKFDQFDFLIDIVPR -----33331111----- >DPS PROTEIN; SWP:P83695; PDB:1N1QA; MKTSIQQLVAVLLNRQVANWVVLYVKLHNFHWNVNGPNFFTLHEKFEELYTEASGHIDTL --3333-----------------------------1111--------------------- AERVLSIGGSPIATLAASLEEASIKEATGGESAAEMVSSVVNDFVDLVGELKVARDVADE ----1111--------------------------------------------------11 ADDEATADMLDAIEAGLEKHVWMLEAFLE 11-----------------------1111 >SIALIDASE; SWP:O44049; PDB:1N1TA; AASLAPGSSRVELFKRKNSTVPFEESNGTIRERVVHSFRIPTIVNVDGVMVAIADARYET ----2222----------------1111-----------------iiii----------- SFDNSFIETAVKYSVDDGATWNTQIAIKNSRASSVSRVMDATVIVKGNKLYILVGSFNKT ---------------iiii--------------------------!!!!----------- RNSWTQHRDGSDWEPLLVVGEVTKSAANGKTTATISWGKPVSLKPLFPAEFDGILTKEFV --3333---1111-----------------------------3333-------------- GGVGAAIVASNGNLVYPVQIADMGGRVFTKIMYSEDDGNTWKFAEGRSKFGCSEPAVLEW --------1111---------1111----------iiii---------2222-------i EGKLIINNRVDGNRRLVYESSDMGKTWVEALGTLSHVWTNSPTSNQQDCQSSFVAVTIEG iii------2222--------iiii--------2222---1111-------------iii KRVMLFTHPLNLKGRWMRDRLHLWMTDNQRIFDVGQISIGDENSGYSSVLYKDDKLYSLH i---------1111------------------------!!!!---------%%%%----- EINTNDVYSLVFVRLIGELQLMKSVVRTWKEEDNHLASICTPVVPAGCGAAVPTAGLVGF ---%%%%----------------------------1111--------------2222--- LSHSANGSVWEDVYRCVDANVANAERVPNGLKFNGVGGGAVWPVARQGQTRRYQFANYRF -----------1111-----------2222----2222-----3333---1111------ TLVATVTIDELPKGTSPLLGAGLEGPGDAKLLGLSYDKNRQWRPLYGAAPASPTGSWELH ------------------------------------1111-----!!!!--------222 KKYHVVLTMADRQGSVYVDGQPLAGSGNTVVRGATLPDISHFYIGGPRSKGAPTDSRVTV 2--------%%%%----iiii-2222---------------------------------- TNVVLYNRRLNSSEIRTLFLSQDMIGTD --------------------1111---- >RIBONUCLEASE, SEMINAL; SWP:P00669; PDB:1N1XA; KESAAAKFERQHMDSGSSNYCNLMMRKMTQGKCKPVNTFVHESLADVKAVCSQKKVTCKN ----------------11113333-----------------------3333------111 GQTNCYQSKSTMRITDCRETGSSKYPNCAYKTTQVEKHIIVACGGKPSVPVHFDASV 1------------------1111---------------------------------- >IL-6 RECEPTOR ALPHA CHAIN; SWP:P08887; PDB:1N26A; LAPRRCPAQEVARGVLTSLPGDSVTLTCPGVEPEDNATVHWVLRKPAAGSHPSRWAGMGR -----------2222---2222-----1111----------------------------- RLLLRSVQLHDSGNYSCYRAGRPAGTVHLLVDVPPEEPQLSCFRKSPLSNVVCEWGPRST -------1111----------------------------------1111----------- PSLTTKAVLLVRKFQNSPAEDFQEPCQYSQESQKFSCQLAVPEGDSSFYIVSMCVASSVG -1111------------------------------------2222-----------1111 SKFSKTQTFQGCGILQPDPPANITVTAVARNPRWLSVTWQDPHSWNSSFYRLRFELRYRA ---------------------------2222----------3333--------------3 ERSKTFTTWMVKDLQHHCVIHDAWSGLRHVVQLRAQEEFGQGEWSEWSPEAMGTPWTES 333--------%%%%--------2222---------1111------------------- >Hepatoma-derived growth f; SWP:Q9JMG7; PDB:1N27A; GSSGSSGEYKAGDLVFAKMKGYPHWPARIDELPEGAVKPPANKYPIFFFGTHETAFLGPK ---------2222-----2222-----------------2222--------------333 DLFPYKEYKDKFGKSNKRKGFNEGLWEIENSGPSSG 3--3333------------3333----1111----- >PHOSPHOLIPASE A2, MEMBRAN; SWP:P14555; PDB:1N28A; ALVNFHRMIKLTTGKEAALSYGFYGCHCGVGGRGSPKDATDRCCVTQDCCYKRLEKRGCG ----------------3333-----------------3333------------------- TKFLSYKFSNSGSRITCAKQDSCRSQLCECDKAAATCFARNKTTYNKKYQYYSNKHCRGS 1111------------------------------------3333-3333---3333---- TPRC ---- >GLUTATHIONE S-TRANSFERASE; SWP:P39100; PDB:1N2AA; MKLFYKPGACSLASHITLRESGKDFTLVSVDLMKKRLENGDDYFAVNPKGQVPALLLDDG -----2222---------1111---------1111-1111-3333-1111------1111 TLLTEGVAIMQYLADSVPDRQLLAPVNSISRYKTIEWLNYIATELHKGFTPLFRPDTPEE ----------------3333----2222--------------------3333-1111333 YKPTVRAQLEKKLQYVNEALKDEHWICGQRFTIADAYLFTVLRWAYAVKLNLEGLEHIAA 3------------------1111----------------------1111--2222----- FMQRMAERPEVQDALSAEGLK ---------------1111-- ------------------------------------------------ >ORGANIC HYDROPEROXIDE RES; SWP:Q9HZZ3; PDB:1N2FA; MQTIKALYTATATATGGRDGRAVSSDGVLDVKLSTPREMGGQGGAATNPEQLFAAGYSAC -----------------------1111--------3333--------------------- FIGAMKFVAGQRKQTLPADASITGKVGIGQIPGGFGLEVELHINLPGMEREAAEALVAAA ------------------------------2222----------2222------------ HQVCPYSNATRGNIDVRLNVSV ---------2222--------- >DTDP-GLUCOSE OXIDOREDUCTA; SWP:P26392; PDB:1N2SA; MNILLFGKTGQVGWELQRSLAPVGNLIALDVHSKEFCGDFSNPKGVAETVRKLRPDVIVN ------1111---------1111------1111--------------------------- AAAHTAVDKAESEPELAQLLNATSVEAIAKAANETGAWVVHYSTDYVFPGTGDIPWQETD -----33331111-------------------------------------------1111 ATSPLNVYGKTKLAGEKALQDNCPKHLIFRTSWVYAGKGNNFAKTMLRLAKERQTLSVIN ------------------------------------------------------------ DQYGAPTGAELLADCTAHAIRVALNKPEVAGLYHLVAGGTTTWHDYAALVFDEARKAGIT ------------------------------------------------------1111-- LALTELNAVPTSAYPTPASRPGNSRLNTEKFQRNFDLILPQWELGVKRMLTEMFTTTT ----------3333-------------------------------------------- >VITAMIN B12 TRANSPORT PRO; SWP:P37028; PDB:1N2ZA; AAPRVITLSPANTELAFAAGITPVGVSSYSDYPPQAQKIEQVSTWQGMNLERIVALKPDL ----------------1111------------3333-------3333------------- VIAWRGGNAERQVDQLASLGIKVMWVDATSIEQIANALRQLAPWSPQPDKAEQAAQSLLD ---3333-3333----1111--------------------3333---------------- QYAQLKAQYADKPKKRVFLQFGINPPFTSGKESIQNQVLEVCGGENIFKDSRVPWPQVSR --------1111---------------------------------1111----------- EQVLARSPQAIVITGGPDQIPKIKQYWGEQLKIPVIPLTSDWFERASPRIILAAQQLCNA ---1111--------3333-------------------3333------------------ LSQVD 1111- >MINOR CORE PROTEIN LAMBDA; SWP:P17378; PDB:1N35A; SSMILTQFGPFIESISGITDQSNDVFEDAAKAFSMFTRSDVYKALDEIPFSDDAMLPIPP -----------3333---------------------3333----------3333----11 TIYTKPSHDSYYYIDALNRVRRKTYQGPDDVYVPNCSIVELLEPHETLTSYGRLSEAIEN 11-----1111---1111--------3333-------3333------1111--------- RAKDGDSQARIATTYGRIAESQARQIKAPLEKFVLALLVAEAGGSLYDPVLQKYDEIPDL --------------------1111----3333--------1111---------------- SHNCPLWCFREICRHISGPLPDRAPYLYLSAGVFWLMSPRMTSAIPPLLSDLVNLAILQQ ----------------!!!!-----------------11111111--------------- TAGLDPSLVKLGVQICLHAAASSSYSWFILKTKSIFPQNTLHSMYESLEGGYCPNLEWLE ---------------------------------------1111----------------- PRSDYKFMYMGVMPLSAKYARSAPSNDKKARELGEKYGLSSVVGELRKRTKTYVKHDFAS 3333-----------3333---------------1111----------3333-------- VRYIRDAMACTSGIFLVRTPTETVLQEYTQSPEIKVPIPQKDWTGPIGEIRILKDTTSSI ------1111----------------------------3333----!!!!---------- ARYLYRTWYLAAARMAAQPRTWDPLFQAIMRSQYVTARGGSGAALRESLYAINVSLPDFK -----------------3333----------1111-----------------------22 GLPVKAATKIFQAAQLANLPFSHTSVAILADTSMGLRNQVQRRPRSIMPLNVPQQQVSAP 22-----------------3333---1111------------------------3333-- HTLTADYINYHMNLSPTSGSAVIEKVIPLGVYASSPPNQSINIDISACDASITWDFFLSV -------------------------3333------------------1111--------- IMAAIHEGVASSSIGKPFMGVPASIVNDESVVGVRAARPISGMQNMIQHLSKLYKRGFSY -----------------iiii--------------------------------------- RVNDSFSPGNDFTHMTTTFPSGSTATSTEHTANNSTMMETFLTVWGPEHTDDPDVLRLMK ---1111---------------3333-------------------1111---------11 SLTIQRNYVCQGDDGLMIIDGTTAGKVNSETIQNDLELISKYGEEFGWKYDIAYDGTAEY 111111----!!!!-------1111-----------------3333-------------% LKLYFIFGCRIPNLSRHPIVGKERANSSAEEPWPAILDQIMGVFFNGVHDGLQWQRWIRY %%%--iiii---3333-1111------------3333----------------------- SWALCCAFSRQRTMIGESVGYLQYPMWSFVYWGLPLVKAFGSDPWIFSWYMPTGDLGMYS ------------------------3333----------%%%%-----1111--------- WISLIRPLMTRWMVANGYVTDRCSTVFGNADYRRCFNELKLYQGYYMAQLPRNPKKSGRA -------------1111---------!!!!-3333-1111------1111---------- ASREVREQFTQALSDYLMQNPELKSRVLRGRSEWEKYGAGIIHNPPSLFDVPHKWYQGAQ ----------------------------------------------3333---------1 EAAIATREELAEMDETLMRARRHSYSSFSKLLEAYLLVKWRMCEAREPSVDLRLPLCAGI 111-------------------------------3333------------1111--2222 DPLNSDPFLKMVSVGPMLQSTRKYFAQTLFMAKTVSGLDVNAIDSALLRLRTLGADKKAL 1111--------------------1111----------------------1111-3333- TAQLLMVGLQESEADALAGKIMLQDVNTVQLARVVNLAVPDTWMSLDFDSMFKHHVKLLP ----1111------------1111--3333---------3333----------------- KDGRHLNTDIPPRMGWLRAILRFLGAGMVMTATGVAVDIYLEDIHGGGRSLGQRFMTWMR ----3333--1111--------------------------------------------11 QEGR 11-- >DEPHOSPHO-COA KINASE; SWP:P36679; PDB:1N3BA; RYIVALTGGIGSGKSTVANAFADLGINVIDADIIARQVVEPGAPALHAPEEKNWLNALLH --------22223333-----1111--------------22223333------------- PLIQQETQHQIQQATSPYVLWVVPLLVENSLYKKANRVLVVDVSPETQLKRTQRDDVTRE ----------------------11111111-1111-----------------1111---- HVEQILAAQATREARLAVADDVIDNNGAPDAIASDVARLHAHYLQLASQFVSQE ----3333-------1111--------3333------------------1111- >PROTEIN YFIA; SWP:P11285; PDB:1N3GA; MTMNITSKQMEITPAIRQHVADRLAKLEKWQTHLINPHIILSKEPQGFVADATINTPNGV ------------3333----------1111-------------------------1111- LVASGKHEDMYTAINELINKLERQLNKLQHKGEARRAATSVKDANFVEEVEEE ----------------------------------------------------- >SHC TRANSFORMING PROTEIN; SWP:P29353; PDB:1N3HA; MNKLSGGGGRRTRVEGGQLGGEEWTRHGSFVNKPTRGWLHPNDKVMGPGVSYLVRYMGCV ----------------------------------------3333---------------- EVLQSMRALDFNTRTQVTREAISLVCEAVPGAKGATRRRKPCSRPLSSILGRSNLKFAGM ----------------------------3333---------------------------- PITLTVSTSSLNLMAADCKQIIANHHMQSISFASGGDPDTAEYVAYVAKDPVNQRACHIL ---------------3333----------------------------------------- ECPEGLAQDVISTIGQAFELRFKQYLR --------------------------- >HISTONE H3 LYSINE METHYLT; SWP:O41094; PDB:1N3JA; MFNDRVIVKKSPLGGYGVFARKSFEKGELVEECLCIVRHNDDWGTALEDYLFSRKNMSAM --3333---------------------------------3333----------------- ALGFGAIFNHSKDPNARHELTAGLKRMRIFTIKPIAIGEEITISYGDDYWLSRPRLTQN ---3333---------------------------------------------------- >ASTROCYTIC PHOSPHOPROTEIN; SWP:Q9Z297; PDB:1N3KA; MAEYGTLLQDLTNNITLEDLEQLKSACKEDIPSEKSEEITTGSAWFSFLESHNKLDKDNL -----------1111----------------33331111--------------------3 SYIEHIFEISRRPDLLTMVVDYRTRVLKISEEDELDTKLTRIPSAKKYKDIIRQPSEEEI 333------------------------------3333----1111--------------1 IKLAPPPKKA 111------- >TYROSYL-TRNA SYNTHETASE; SWP:P54577; PDB:1N3LA; APSPEEKLHLITRNLQEVLGEEKLKEILKERELKIYWGTATTGKPHVAYFVPMSKIADFL -----------2222--------------------------------------------1 KAGCEVTILFADLHAYLDNMKAPWELLELRVSYYENVIKAMLESIGVPLEKLKFIKGTDY 111---------------3333--------------------------1111---33331 QLSKEYTLDVYRLSSVVTQHDSKKAGAEVVKQVEHPLLSGLLYPGLQALDEEYLKVDAQF 111--------------------1111---------3333-----------1111----- GGIDQRKIFTFAEKYLPALGYSKRVHLMNPMVPGLTGSESKIDLLDRKEDVKKKLKKAFC -3333---------3333------------------------1111--------1111-- EPGNVENNGVLSFIKHVLFPLKSEFVILRDEKWGGNKTYTAYVDLEKDFAAEVVHPGDLK 2222-------------3333--------3333-------3333---------------- NSVEVALNKLLDPIREKFNTPALKKLASAAYP --------------------3333-------- >INTEGRIN ALPHA-X; SWP:P20702; PDB:1N3YA; QEQDIVFLIDGSGSISSRNFATMMNFVRAVISQFQRPSTQFSLMQFSNKFQTHFTFEEFR -----------3333---------------1111-------------------------- RSSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHASYGARRDAAKILIVITDGKKEGDSL ---3333-----------------------11111111-1111----------------- DYKDVIPMADAAGIIRYAIGVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQN 3333-----1111--------------1111---------3333------3333------ QLKEKIFAI -----1111 >CYTOCHROME P450 121; SWP:Q59571; PDB:1N40A; TATVLLEVPFSARGDRIPDAVAELRTREPIRKVRTITGAEAWLVSSYALCTQVLEDRRFS ----------------------------------1111-----------------3333- MKETAAAGAPRLNALTVPPEVVNNMGNIADAGLRKAVMKAITPKAPGLEQFLRDTANSLL 3333-2222--------3333-------1111------1111--2222------------ DNLITEGAPADLRNDFADPLATALHCKVLGIPQEDGPKLFRSLSIAFMSSADPIPAAKIN ------------1111---------------3333---------1111-----3333--- WDRDIEYMAGILENPNITTGLMGELSRLRKDPAYSHVSDELFATIGVTFFGAGVISTGSF -------------3333-------------3333-------------------------- LTTALISLIQRPQLRNLLHEKPELIPAGVEELLRINLSFADGLPRLATADIQVGDVLVRK --------------------3333----------------------------!!!!--22 GELVLVLLEGANFDPEHFPNPGSIELDRPNPTSHLAFGRGQHFCPGSALGRRHAQIGIEA 22-----------3333--1111------1111-1111-11111111------------- LLKKMPGVDLAVPIDQLVWRTRFQRRIPERLPVLW ----1111----3333------------------- >HEME OXYGENASE 1; SWP:P09601; PDB:1N45A; PQDLSEALKEATKEVHTQAENAEFMRNFQKGQVTRDGFKLVMASLYHIYVALEEEIERNK --------------------------------------------------------1111 ESPVFAPVYFPEELHRKAALEQDLAFWYGPRWQEVIPYTPAMQRYVKRLHEVGRTEPELL ----3333-3333---------------1111-----------------------3333- VAHAYTRYLGDLSGGQVLKKIAQKALDLPSSGEGLAFFTFPNIASATKFKQLYRSRMNSL ----------------------------------3333-1111-------------1111 EMTPAVRQRVIEEAKTAFLLNIQLFEELQELLTH -----------------------------1111- >THYROID HORMONE RECEPTOR ; SWP:P10828; PDB:1N46A; KPEPTDEEWELIKTVTEAHVATNAQWKQKRKFLPEDIGQAPIVNAPEGGKVDLEAFSHFT ------------------3333-----------3333----------------------- KIITPAITRVVDFAKKLPMFCELPCEDQIILLKGCCMEIMSLRAAVRYDPESETLTLNGE -----------------3333--3333-----------------1111-1111------- MAVTRGQLKNGGLGVVSDAIFDLGMSLSSFNLDDTEVALLQAVLLMSSDRPGLACVERIE --------------------------3333----------------1111---------- KYQDSFLLAFEHYINYRKHHVTHFWPKLLMKVTDLRMIGACHASRFLHMKVECPTELFPP -----------------------------------------------------3333--- LFLEVFED -------- >ISOLECTIN B4; SWP:P56625; PDB:1N47A; TESTSFSFTNFNPNQNNLILQEDALVNSAGTLELTAVAAGAPVPDSLGRALYAAPIHIHD --------------1111--------1111------------------------------ NTTLASFTTSFSFVMAAPAAAAVADGLAFFLAPPDTQPQARGGFLGLFADRAHDASYQTV --------------------------------1111----!!!!---------1111--- AVEFDTYSNAWDPNYTHIGIDTNGIESKKTTPFDMVYGEKANIVITYQASTKALAASLVF --------1111-----------------------2222---------1111-------- PVSQTSYAVSARVDLRDILPEYVRVGFSATTGLNAGVVETHDIVSWSFAVSLA 1111---------3333----------------2222---------------- >AUXILIN; SWP:Q27974; PDB:1N4CA; GPLGSPEFSMPHSSPQNRPNYNVSFSSMPGGQNERGKAAANLEGKQKAADFEDLLSGQGF ------------------------------------------------------------ NAHKDKKGPRTIAEMRKEEMAKEMDPEKLKILEWIEGKERNIRALLSTMHTVLWAGETKW ----------3333---3333-------------3333--33331111------------ KPVGMADLVTPEQVKKVYRKAVLVVHPDKATGQPYEQYAKMIFMELNDAWSEFENQGQKP ---1111--3333-------3333-3333---1111------------------------ LY -- >INOSITOL 1,4,5-TRISPHOSPH; SWP:P11881; PDB:1N4KA; GGDVVRLFHAEQEKFLTCDEHRKKQHVFLRTTGRQSATSATSSKALWEVEVVQLFRFKHL ---------3333-------%%%%-----------1111--------------------- ATGHYLAAEVDMVYSLVSVPEGNDISSIFELDPYVRLRHLCTNTWVHSTNIPIDKEEEKP ------------------------1111-----------1111----------------- VMLKIGTSPLKEDKEAFAIVPVSPAEVRDLDFANDASKVLGSIAGKLEKGTITQNERRSV ----------------------------------------------1111---------- TKLLEDLVYFVTGGTNSGQDVLEVVFSKPNRERQKLMREQNILKQIFKLLQAPFTHAPFR ---------1111-------3333--------------------------1111------ HICRLCYRVLRHSQQDYRKNQEYIAKQFGFMQKQIGYDVLAEDTITALLHNN -----------1111--------------------1111--------1111- >FLORAL DEFENSIN-LIKE PROT; SWP:Q8H6Q1; PDB:1N4NA; ATCKAECPTWDSVCINKKPCVACCKKAKFSDGHCSKILRRCLCTKEC ------------------------1111------------------- >Geranylgeranyl transferas; SWP:P53610; PDB:1N4QB; LDFLRDRHVRFFQRCLQVLPERYSSLETSRLTIAFFALSGLDMLDSLDVVNKDDIIEWIY -------------------11111111--------------11113333-3333-----1 SLQVLPTEDRSNLDRCGFRGSSYLGIPFNPSKNPGTAHPYDSGHIAMTYTGLSCLIILGD 111---1111-1111------1111------------------3333------------- DLSRVDKEACLAGLRALQLEDGSFCAVPEGSENDMRFVYCASCICYMLNNWSGMDMKKAI -1111---------11111111----3333--------------------1111------ SYIRRSMSYDNGLAQGAGLESHGGSTFCGIASLCLMGKLEEVFSEKELNRIKRWCIMRQQ ---11111111----2222-------------------3333-------------3333- NGYHGRPNKPVDTCYSFWVGATLKLLKIFQYTNFEKNRNYILSTQDRLVGGFAKWPDSHP -----2222--------------11113333----------1111--------------- DALHAYFGICGLSLMEESGICKVHPALNVSTRTSERLRDLHQSWKT -----------3333------------------------------- >CHOLESTEROL OXIDASE; SWP:P12676; PDB:1N4WA; GYVPAVVIGTGYGAAVSALRLGEAGVQTLMLEMGQLWNQPGPDGNIFCGMLNPDKRSSWF ---------------------1111---------------1111----1111--3333-- KNRTEAPLGSFLWLDVVNRNIDPYAGVLDRVNYDQMSVYVGRGVGGGSLVNGGMAVEPKR ------222211111111----------------------------3333---------- SYFEEILPRVDSSEMYDRYFPRANSMLRVNHIDTKWFEDTEWYKFARVSREQAGKAGLGT ------1111-------------------------------------------1111--- VFVPNVYDFGYMQREAAGEVPKSALATEVIYGNNHGKQSLDKTYLAAALGTGKVTIQTLH --------------1111----1111--1111------3333------3333-------- QVKTIRQTKDGGYALTVEQKDTDGKLLATKEISCRYLFLGAGSLGSTELLVRARDTGTLP -------1111---------1111-----------------3333--------1111-11 NLNSEVGAGWGPNGNIMTARANHMWNPTGAHQSSIPALGIDAWDNSDSSVFAEIAPMPAG 113333----------------1111-------------------1111----------- LETWVSLYLAITKNPQRGTFVYDAATDRAKLNWTRDQNAPAVNAAKALFDRINKANGTIY -----------------------1111--------------------------------- RYDLFGTQLKAFADDFCYHPLGGCVLGKATDDYGRVAGYKNLYVTDGSLIPGSVGVNPFV ------------------------2222-------2222------3333----------- TITALAERNVERIIKQ ---------------- >IMMUNOGLOBULIN KAPPA CHAI; SWP:NA; PDB:1N4XH; EVQLQQSGPELKKPGETVKISCKATNYAFTDYSMHWVKQAPGGDLKYVGWINTETDEPTF ------------2222-----------1111--------2222----------------- ADDFKGRFAFSLDTSTSTAFLQINNLKNEDTATYFCVRDRHDYGEIFTYWGQGTTVTVSA --3333-------1111---------3333------------------------------ >IMMUNOGLOBULIN KAPPA CHAI; SWP:NA; PDB:1N4XL; MDILMTQTPLYLPVSLGDQASISCRSSQTIVHNNGNTYLEWYLQKPGQSPQLLIYKVSNR --------------2222-------------1111---------2222------------ FSGVPDRFSGSGSGTDFTLKISRVEAEDLGIYYCFQGSHFPPTFGGGTKLEIA 22223333----------------1111------------------------- >TRIOSEPHOSPHATE ISOMERASE; SWP:P48499; PDB:1N55A; AKPQPIAAANWKCNGTTASIEKLVQVFNEHTISHDVQCVVAPTFVHIPLVQAKLRNPKYV ------------------------------------------3333---------1111- ISAQNAIAKSGAFTGEVSMPILKDIGVHWVILGHSERRTYYGETDEIVAQKVSEACKQGF ------------2222------1111-----------------------------1111- MVIACIGETLQQREANQTAKVVLSQTSAIAAKLTKDAWNQVVLAYEPVWAIGTGKVATPE ------------1111-------------111133331111-----3333---------- QAQEVHLLLRKWVSENIGTDVAAKLRILYGGSVNAANAATLYAKPDINGFLVGGASLKPE ---------------------------------1111------1111-----3333---- FRDIIDATR -----1111 >CHAPERONE HSP31; SWP:P31658; PDB:1N57A; TSKNPQVDIAEDNAFFPSEYSLSQYTSPVSDLDGVDYPKPYRGKHKILVIAADERYLPTD -------3333-----------------------------------------------11 NGKLFSTGNHPIETLLPLYHLHAAGFEFEVATISGLTKFEYWAPHKDEKVPFFEQHKSLF 11--------3333-------1111--------------11111111--------3333- RNPKKLADVVASLNADSEYAAIFVPGGHGALIGLPESQDVAAALQWAIKNDRFVISLCHG ----3333-11111111-----------11113333-----------1111-----!!!! PAAFLALRHGDNPLNGYSICAFPDAADKQTPEIGYPGHLTWYFGEELKKGNIINDDITGR ----1111---1111-------3333--3333---------------------------- VHKDRKLLTGDSPFAANALGKLAAQELAAYAG ---!!!!----3333------------3333- >CARBONYL REDUCTASE/20BETA; SWP:Q28960; PDB:1N5DA; SSNTRVALVTGANKGIGFAIVRDLCRQFAGDVVLTARDVARGQAAVKQLQAEGLSPRFHQ -----------------------------------------------3333--------- LDIIDLQSIRALCDFLRKEYGGLDVLVNNAAIAFQLDNPTPFHIQAELTMKTNFMGTRNV ----------------------------------2222---------------------- CTELLPLIKPQGRVVNVSSTEGVRALNECSPELQQKFKSETITEEELVGLMNKFVEDTKN ---3333-2222-------------1111----------------------------111 GVHRKEGWSDSTYGVTKIGVSVLSRIYARKLREQRAGDKILLNACCPGWVRTDMGGPKAP 13333-----------------------------3333-------------33333333- KSPEVGAETPVYLALLPSDAEGPHGQFVTDKKVVEWGVPPESYPWVNA -3333-----------------------iiii---------------- >PEPTIDE DEFORMYLASE; SWP:Q9I7A8; PDB:1N5NA; MAILNILEFPDPRLRTIAKPVEVVDDAVRQLIDDMFETMYEAPGIGLAATQVNVHKRIVV ------------1111-------------------------------3333--------- MDLSEDKSEPRVFINPEFEPLTEEMDQYQEGCLSVPGFYENVDRPQKVRIKALDRDGNPF -------------------------------1111------------------1111--- EEVAEGLLAVCIQHECDHLNGKLFVDYLSTLKRDRIRKKLEKQHRQQ -----------------1111-3333--------------------- >SERUM ALBUMIN; SWP:P02768; PDB:1N5UA; AHKSEVAHRFKDLGEENFKALVLIAFAQYLQQCPFEDHVKLVNEVTEFAKTCVADESAEN -----------------------------1111---------------------1111-1 CDKSLHTLFGDKLCTVATLRETYGEMADCCAKQEPERNECFLQHKDDNPNLPRLVRPEVD 111------------1111---!!!!--1111---------1111------------333 VMCTAFHDNEETFLKKYLYEIARRHPYFYAPELLFFAKRYKAAFTECCQAADKAACLLPK 3--------------------1111---3333-------------1111--3333----- LDELRDEGKASSAKQRLKCASLQKFGERAFKAWAVARLSQRFPKAEFAEVSKLVTDLTKV -----------------------------------------1111--------------- HTECCHGDLLECADDRADLAKYICENQDSISSKLKECCEKPLLEKSHCIAEVENDEMPAD -------------------------3333----3333---3333----1111-------- LPSLAADFVESKDVCKNYAEAKDVFLGMFLYEYARRHPDYSVVLLLRLAKTYETTLEKCC ---3333---1111-------------------1111--------------------333 AAADPHECYAKVFDEFKPLVEEPQNLIKQNCELFEQLGEYKFQNALLVRYTKKVPQVSTP 3--3333-11113333-------------------------------------1111--- TLVEVSRNLGKVGSKCCKHPEAKRMPCAEDYLSVVLNQLCVLHEKTPVSDRVTKCCTESL --------------1111-3333-----------------------------------33 VNRRPCFSALEVDETYVPKEFNAETFTFHADICTLSEKERQIKKQTALVELVKHKPKATK 33----1111--1111-----3333---3333----------------------1111-- EQLKAVMDDFAAFVEKCCKADDKETCFAEEGKKLVAASQAALG ---------------------------------------1111 >CARBON MONOXIDE DEHYDROGE; SWP:P19921; PDB:1N62A; KAHIELTINGHPVEALVEPRTLLIHFIREQQNLTGAHIGCDTSHCGACTVDLDGMSVKSC -------iiii------1111------------------------1111--iiii--111 TMFAVQANGASITTIEGMAAPDGTLSALQEGFRMMHGLQCGYCTPGMIMRSHRLLQENPS 1-33332222---3333--1111---------1111----1111---------------- PTEAEIRFGIGGNLCRCTGYQNIVKAIQYAAAKINGVPFEE -------1111------------------------------ >Carbon monoxide dehydroge; SWP:P19919; PDB:1N62B; TVEPTSAERAEKLQGMGCKRKRVEDIRFTQGKGNYVDDVKLPGMLFGDFVRSSHAHARIK ----------33332222---3333-3333----1111--2222---------------- SIDTSKAKALPGVFAVLTAADLKPLNLHYMPTLAGDVQAVLADEKVLFQNQEVAFVVAKD ---3333--2222----33333333------1111------------2222--------- RYVAADAIELVEVDYEPLPVLVDPFKAMEPDAPLLREDIKDKMTGAHGARKHHNHIFRWE -------1111-----------3333--1111---3333------------1111----- IGDKEGTDATFAKAEVVSKDMFTYHRVHPSPLETCQCVASMDKIKGELTLWGTFQAPHVI ------------------------------------------------------------ RTVVSLISGLPEHKIHVIAPDIGGGFGNKVGAYSGYVCAVVASIVLGVPVKWVEDRMENL ----------1111----------iiii----3333------------------------ STTSFARDYHMTTELAATKDGKILAMRCHVLADHGAFDACADPSKWPAGFMNICTGSYDM -----------------1111---------------------1111---1111-!!!!-- PVAHLAVDGVYTNKASGGVAYRCSFRVTEAVYAIERAIETLAQRLEMDSADLRIKNFIQP -------------------%%%%------------------------------1111-11 EQFPYMAPLGWEYDSGNYPLAMKKAMDTVGYHQLRAEQKAKQEAFKRGETREIMGIGISF 11----1111-------------------------------------------------- FTEIVGAGPSKNCDILGVSMFDSAEIRIHPTGSVIARMGTKSQGQGHETTYAQIIATELG --------3333--iiii----------1111-------------3333----------- IPADDIMIEEGNTDTAPYGLGTYGSRSTPTAGAATAVAARKIKAKAQMIAAHMLEVHEGD -3333------3333-------%%%%------------------------------3333 LEWDVDRFRVKGLPEKFKTMKELAWASYNSPPPNLEPGLEAVNYYDPPNMTYPFGAYFCI ---------2222------------------1111------------------------- MDIDVDTGVAKTRRFYALDDCGTRINPMIIEGQVHGGLTEAFAVAMGQEIRYDEQGNVLG ----------------------------------------------------1111---- ASFMDFFLPTAVETPKWETDYTVTPSPHHPIGAKGVGESPHVGGVPCFSNAVNDAYAFLN ---------3333------------1111------1111----------------3333- AGHIQMPHDAWRLWKVGEQLGLHV --------------------1111 >Carbon monoxide dehydroge; SWP:P19920; PDB:1N62C; MIPGSFDYHRPKSIADAVALLTKLGEDARPLAGGHSLIPIMKTRLATPEHLVDLRDIGDL -----------------------!!!!-------------1111---------1111111 VGIREEGTDVVIGAMTTQHALIGSDFLAAKLPIIRETSLLIADPQIRYMGTIGGNAANGD 1----!!!!----------------------------1111-3333-------------3 PGNDMPALMQCLGAAYELTGPEGARIVAARDYYQGAYFTAIEPGELLTAIRIPVPPTGHG 333----------------1111----3333---2222---2222----------2222- YAYEKLKRKIGDYATAAAAVVLTMSGGKCVTASIGLTNVANTPLWAEEAGKVLVGTALDK --------2222--------------------------------------3333------ PALDKAVALAEAITAPASDGRGPAEYRTKMAGVMLRRAVERAKARA ----------1111----1111------------------------ >CLUMPING FACTOR; SWP:Q53653; PDB:1N67A; GTDITNQLTNVTVGIDSGTTVYPHQAGYVKLNYGFSVPNSAVKGDTFKITVPKELNLNGV ---1111--------------1111------------33332222------1111----- TSTAKVPPIMAGDQVLANGVIDSDGNVIYTFTDYVNTKDDVKATLTMPAYIDPENVKKTG 1111------!!!!-------1111-------1111---------------3333----- NVTLATGIGSTTANKTVLVDYEKYGKFYNLSIKGTIDQIDKTNNTYRQTIYVNPSGDNVI -------!!!!---------------!!!!----------1111--------1111---- APVLTGNLKPNTDSNALIDQQNTSIKVYKVDNAADLSESYFVNPENFEDVTNSVNITFPN ------------------3333---------3333-1111--1111---3333------- PNQYKVEFNTPDDQITTPYIVVVNGHIDPNSKGDLALRSTLYGYNSNIIWRSMSWDNEVA ---------1111--------------1111-------------1111------------ FNNGSGSGDGIDKPVVPEQPDEPGEIEPIPEK ---------------------2222------- >SAPOSIN B; SWP:Q92740; PDB:1N69A; GDVCQDCIQMVTDIQTAVRTNSTFVQALVEHVKEECDRLGPGMADICKNYISQYSEIAIQ --------------------1111------------------------------------ MMMHMQPKEICALVGFCD -----------1111--- >MYOCYTE-SPECIFIC ENHANCER; SWP:Q02080; PDB:1N6JA; GRKKIQISRILDQRNRQVTFTKRKFGLMKKAYELSVLCDCEIALIIFNSANRLFQYASTD -----------------------------------1111--------1111--------- MDRVLLKYTEYSEPHESRTNTDILETLKRRGIG -------3333-----------------3333- >INTERFERON-ALPHA/BETA REC; SWP:P48551; PDB:1N6UA; SYDSPDYTDESCTFKISLRNFRSILSWELKNHSIVPTHYTLLYTIMSKPEDLKVVKNCAN ------------------%%%%----------------------3333------1111-- TTRSFCDLTDEWRSTHEAYVTVLEGFSGNTTLFSCSHNFWLAIDMSFEPPEFEIVGFTNH -------------3333-----------------------1111---------------- INVMVKFPSIVEEELQFDLSLVIEEQSEGIVKKHKPEIKGNMSGNFTYIIDKLIPNTNYC ----------3333------------iiii------------------------------ VSVYLEHSDEQAVIKSPLKCTLLPPGQESEFS ---------------------------%%%%- >Hypothetical 12.3 kDa pro; SWP:Q03759; PDB:1N6ZA; MSKSNTYRMLVLLEDDTKINKEDEKFLKGKPGKMHEFVDELILPFNVDELDELNTWFDKF -------------------3333------------------------------------- DAEICIPNEGHIKYEISSDGLIVLMLDKEIEEVVEKVKKFVEENN ----1111--------3333------3333--------------- >AAC(6')-II; SWP:Q47764; PDB:1N71A; MIISEFDRNNPVLKDQLSDLLRLTWPEEYGDSSAEEVEEMMNPERIAVAAVDQDELVGFI ------1111---------------------------11113333--------------- GAIPQYGITGWELHPLVVESSRRKNQIGTRLVNYLEKEVASRGGITIYLGTDDLDHGTTL -----!!!!---------1111-----------------1111--------------111 SQTDLYEHTFDKVASIQNLREHPYEFYEKLGYKIVGVLPNANGWDKPDIWMAKTIIPRPD 1--1111-----1111-----3333--1111--------1111----------------- >AMPA RECEPTOR INTERACTING; SWP:P97879; PDB:1N7EA; GAIIYTVELKRYGGPLGITISGTEEPFDPIIISSLTKGGLAERTGAIHIGDRILAINSSS ------------------------1111-------2222--------2222----iiii- LKGKPLSEAIHLLQMAGETVTLKIKKQTDAQPASS 22223333----1111------------------- >GDP-D-MANNOSE-4,6-DEHYDRA; SWP:P93031; PDB:1N7HA; RKIALITGITGQDGSYLTEFLLGKGYEVHGLIRRSSNFNTQRINHIYALMKLHYADLTDA -------1111----------1111--------------33333333--------1111- SSLRRWIDVIKPDEVYNLAAQSHVAVSFEIPDYTADVVATGALRLLEAVRSHTIDSGRTV ----------------------3333---------------------------------- KYYQAGSSEMFGSTPPPQSETTPFHPRSPYAASKCAAHWYTVNYREAYGLFACNGILFNH --------3333------1111-------------------------------------- ESPRRGENFVTRKITRALGRIKVGLQTKLFLGNLQASRDWGFAGDYVEAMWLMLQQEKPD -11111111--------------------------------3333-------1111---- DYVVATEEGHTVEEFLDVSFGYLGLNWKDYVEIDQRYFRPAEVDNLQGDASKAKEVLGWK --------------------1111-3333----3333----------------------- PQVGFEKLVKMMVDEDLELAKREKVLVDAGY ------------------------------- >DEOXYRIBOSE-PHOSPHATE ALD; SWP:Q9Y948; PDB:1N7KA; PSARDILQQGLDRLGSPEDLASRIDSTLLSPRATEEDVRNLVREASDYGFRCAVLTPVYT -----------------------------1111--------------------------- VKISGLAEKLGVKLCSVIGFPLGQAPLEVKLVEAQTVLEAGATELDVVPHLSLGPEAVYR -------------------------------------1111--------1111------- EVSGIVKLAKSYGAVVKVILEAPLWDDKTLSLLVDSSRRAGADIVKTSTGVYTKGGDPVT ---------1111-------3333------------------------------------ VFRLASLAKPLGMGVKASGGIRSGIDAVLAVGAGADIIGTSSAVKVLESFKSLV ---33333333-------------------1111-------------------- >HYALURONIDASE; SWP:Q54873; PDB:1N7OA; VKDTYTDRLDDWNGIIAGNQYYDSKNDQMAKLNQELEGKVADSLSSISSQADRIYLWEKF -----------------3333-1111------------------------------3333 SNYKTSANLTATYRKLEEMAKQVTNPSSRYYQDETVVRTVRDSMEWMHKHVYNSEKSIVG -3333-------------------1111-2222-------------------1111---- NWWDYEIGTPRAINNTLSLMKEYFSDEEIKKYTDVIEKFVPDPEHFRKTTDNPVKALGGN 3333---------------1111-----------3333---1111-1111-------333 LVDMGRVKVIAGLLRKDDQEISSTIRSIEQVFKLVDQGEGFYQDGSYIDHTNVAYTGAYG 3----------------------------1111--------1111--------------- NVLIDGLSQLLPVIQKTKNPIDKDKMQTMYHWIDKSFAPLLVNGELMDMSRGRSISRANS ------------1111-----3333-----------3333-iiii-3333------1111 EGHVAAVEVLRGIHRIADMSEGETKQRLQSLVKTIVQSDSYYDVFKNLKTYKDISLMQSL 3333------------1111---------------3333---3333-------------- LSDAGVASVPRTSYLSAFNKMDKTAMYNAEKGFGFGLSLFSSRTLNYEHMNKENKRGWYT --3333-----------1111-------------------1111-----%%%%---1111 SDGMFYLYNGDLSHYSDGYWPTVNPYKMPGTTETDAKRADSDTGKVLPSAFVGTSKLDDA ----------1111-%%%%----11112222-------1111------------------ NATATMDFTNWNQTLTAHKSWFMLKDKIAFLGSNIQNTSTDTAATTIDQRKLESSNPYKV ---------1111---------------------------------------3333---- YVNDKEASLTEQEKDYPETQSVFLESSDSKKNIGYFFFKKSSISMSKALQKGAWKDINEG -%%%%-----------------------------------------------3333-111 QSDKEVENEFLTISQAHKQNGDSYGYMLIPNVDRATFNQMIKELESSLIENNETLQSVYD 1-----------------2222-------------------1111--------------- AKQGVWGIVKYDDSVSTISNQFQVLKRGVYTIRKEGDEYKIAYYNPETQESAPDQEVFKK 3333-------------%%%%-------------!!!!--------------3333---- L - ------------------------------------------------------------ --- >Syntaxin-1A; SWP:P32851; PDB:1N7SB; GSALSEIETRHSEIIKLENSIRELHDMFMDMAMLVESQGEMIDRIEYNVEHAVDYVERAV --3333------------------------------------------------------ SDTKKAVK ----1111 ------------------------------------------------------------ ------------------- >VESICLE-ASSOCIATED MEMBRA; SWP:NA; PDB:1N7SD; GSARENEMDENLEQVSGIIGNLRHMALDMGNEIDTQNRQIDRIMEKADSNKTRIDEANQR ------------------------------------------------------------ ATKMLG -1111- >BASEPLATE STRUCTURAL PROT; SWP:P19062; PDB:1N7ZA; IYRAIVTSKFRTEKLNFYNSIGSGPDKNTIFITFGRSEPWSSNENEVGFAPPYPTDSVLG -----------------3333--1111-------------1111-2222----------- VTDWTHGTVKVLPSLDAVIPRRDWGDTRYPDPYTFRINDIVVCNSAPYNATESGAGWLVY ----------------------2222----1111-2222------1111--2222----- RCLDVPDTGCSIASLTDKDECLKLGGKWTPSARSTPPEGRGDAEGTIEPGDGYVWEYLFE -----------------------------------------1111--------------- IPPDVSINRCTNEYIVVPWPEELKEDPTRWGYEDNLTWQQDDFGLIYRVKANTIRFKAYL ----------1111-----------3333--1111-----%%%%3333------------ DSVYFPEAALPGNKGFRQISIITNPLEAKAHPNDPNVKAEKDYYDPEDLRHSGEIYENRP 33333333-2222-----------------1111----------3333------------ PIIADQTEEINILFTF ---------------- >PLASMODIUM FALCIPARUM GAM; SWP:Q8IEU2; PDB:1N81A; YHYEHETHAPLSPRIRKVGDIEFHACSDYIYLLMTLSKDPEKFNYALKDRVSIRRYVRKN ---------------------------------1111----33331111----------- QNRYNYFLIEERVQDNIVNRISDRLISYCTDKEVTEDYIKKIDDYLWVEQRVIEEVSINV ------------------------------------------------------------ DHAREVKEKKRIMNDKKLIRMLFDTYEYVKDVKFTDDQYKDAAARISQFLIDVVDSYIIK ------------------------------------------------------1111-- PIPALP ------ >INTRA-CELLULAR XYLANASE; SWP:Q9ZFM8; PDB:1N82A; NSSLPSLRDVFANDFRIGAAVNPVTIEMQKQLLIDHVNSITAENHMKFEHLQPEEGKFTF 1111------1111--------------------------------3333---2222--- QEADRIVDFACSHRMAVRGHTLVWHNQTPDWVFQDGQGHFVSRDVLLERMKCHISTVVRR ----------1111--------------3333--3333---------------------- YKGKIYCWDVINEAVADEGDELLRPSKWRQIIGDDFMEQAFLYAYEADPDALLFYNDYNE 2222----------------------------1111-----------1111-------11 CFPEKREKIFALVKSLRDKGIPIHGIGMQAHWSLTRPSLDEIRAAIERYASLGVVLHITE 11--------------1111------------1111-------------1111------- LDVSMFEFHDRRTDLAAPTSEMIERQAERYGQIFALFKEYRDVIQSVTFWGIADDHTWLD ------1111-----------------------------3333---------1111-333 NFPVHGRKNWPLLFDEQHKPKPAFWRAVSV 3-------------1111------------ >NUCLEAR RECEPTOR ROR-ALPH; SWP:P35398; PDB:1N83A; HHLEVLFQGPAELEHLAQNISKSHLETCQYLREELQQITWQTFLQEEIENYQNKQREVMW -------------------------------------3333---------1111------ QLCAIKITEAIQYVVEFAKRIDGFMELCQNDQIVLLKAGSLEVVFIRMCRAFDSQNNTVY --------------------2222---------------------3333----1111--- FDGKYASPDVFKSLGCEDFISFVFEFGKSLCSMHLTEDEIALFSAFVLMSADRSWLQEKV ------33333333----------------1111---------------1111------- KIEKLQQKIQLALQHVLQKNHREDGILTKLICKVSTLRALCGRHTEKLMAFKAIYPDIVR ---------------1111----------------------------------------- LHFPPLYKELF ----------- >DAHP SYNTHETASE; SWP:P0AB91; PDB:1N8FA; LRIKEIKELLPPVALLQKFPATENAANTVAHARKAIHKILKGNDDRLLVVIGPCSIHDPV ----------3333---------------------------------------------- AAKEYATRLLALREELKDELEIVMRVYFEKPRTTVGWKGLINDPHMDNSFQINDGLRIAR --------------------------------------33331111-------------- KLLLDINDSGLPAAGEFLDMITPQYLADLMSWGAIGARTTESQVHRELASGLSCPVGFKN --------------------3333-3333------1111---------1111-------- GTDGTIKVAIDAINAAGAPHCFLSVTKWGHSAIVNTSGNGDCHIILRGGKEPNYSAKHVA 1111-3333-----1111-------1111------------------------------- EVKEGLNKAGLPAQVMIDFSHANSSKQFKKQMDVCADVCQQIAGGEKAIIGVMVESHLVE ------1111---------!!!!%%%%--------------------------------- GNQSLESGEPLAYGKSITDACIGWEDTDALLRQLANAVKARRG ---3333------------------------------------ >PROBABLE MALATE SYNTHASE ; SWP:Q50596; PDB:1N8IA; TDRVSVGNLRIARVLYDFVNNEALPGTDIDPDSFWAGVDKVVADLTPQNQALLNARDELQ -----!!!!--------------2222--3333--------------------------- AQIDKWHRRDMDAYRQFLTEIGYLLPEPDDFTITTSGVDAEITTTAGPQLVVPVLNARFA --------------------------------------3333----------1111---- LNAANARWGSLYDALYGTDVIPETDGAEKGPTYNKVRGDKVIAYARKFLDDSVPLSSGSF -----------------------2222-------------------------------33 GDATGFTVQDGQLVVALPDKSTGLANPGQFAGYTGAAESPTSVLLINHGLHIEILIDPES 33------%%%%----3333-----3333------3333-------iiii------1111 QVGTTDRAGVKDVILESAITTIMDFEDSVAAVDAADKVLGYRNWLGLNKGDLAARVLNRD -33331111----------------1111------------------------------- RNYTAPGGGQFTLPGRSLMFVRNVGHLMTNDAIVDTDGSEVFEGIMDALFTGLIAIHGLK ----1111--------------------------1111------------------1111 ASDINSRTGSIYIVKPKMHGPAEVAFTCELFSRVEDVLGLPQNTMKIGIMDEERRTTVNL ----------------------------------------2222--------3333---- KACIKAAADRVVFINTGFLDRTGDEIHTSMEAGPMVRKGTMKSQPWILAYEDHNVDAGLA ------1111------------------1111----3333------------------11 AGFSGRAQVGKGMWTMTELMADMVETKIAQPRAGASTAWVPSPTAATLHALHYHQVDVAA 112222---------1111-----------1111---------------3333------- VQQGLAGKRRATIEQLLTIPLAKELAWAPDEIREEVDNNCQSILGYVVRWVDQGVGCSKV ----2222---3333------------3333----------------------------- PDIHDVALMEDRATLRISSQLLANWLRHGVITSADVRASLERMAPLVDRQNAYRPMAPNF -1111--------------------1111-----------------------------11 DDSIAFLAAQELILSGAQQPNGYTEPILHRRRREFKARAAE 11------------33332222-3333----------3333 >ALCOHOL DEHYDROGENASE E C; SWP:P00327; PDB:1N8KA; STAGKVIKCKAAVLWEEKKPFSIEEVEVAPPKAHEVRIKMVATGICRSDDHVVSGTLVTP -2222----------------------------------------3333----------- LPVIAGHEAAGIVESIGEGVTTVRPGDKVIPLFTPQCGKCRVCKHPEGNFCLKNDLSMPR ----------------2222---2222------------3333-------1111------ GTMQDGTSRFTCRGKPIHHFLGTSTFSQYTVVDEISVAKIDAASPLEKVCLIGCGFSTGY --1111-----iiii----%%%%---------1111----111111113333-------- GSAVKVAKVTQGSTCAVFGLGGVGLSVIMGCKAAGAARIIGVDINKDKFAKAKEVGATEC ---------2222-------------------------------3333----1111---- VNPQDYKKPIQEVLTEMSNGGVDFSFEVIGRLDTMVTALSCCQEAYGVSVITGVPPDSQN -3333---3333---1111-------------------1111-------------2222- LSMNPMLLLSGRTWKGAIFGGFKSKDSVPKLVADFMAKKFALDPLITHVLPFEKINEGFD -------3333------%%%%-3333--------1111---3333-----3333------ LLRSGESIRTILTF -1111--------- >FATTY ACID SYNTHASE; SWP:P12785; PDB:1N8LA; GDGEAQRDLVKAVAHILGIRDLAGINLDSSLADLGLDSLMGVEVRQILEREHDLVLPIRE --%%%%-1111--------------1111--------------3333---------3333 VRQLTLRKLQEMSSKA -------1111----- >POTASSIUM CHANNEL BLOCKIN; SWP:P58498; PDB:1N8MA; IEAIRCGGSRDCYRPCQKRTGCPNAKCINKTCKCYGCS -------3333----------------%%%%------- >CYSTATHIONINE GAMMA-LYASE; SWP:P31373; PDB:1N8PA; TLQESDKFATKAIHAGEHVDVHGSVIEPISLSTTFKQSSPANPIGTYEYSRSQNPNRENL ------3333---2222--3333-------------------------3333-3333--- ERAVAALENAQYGLAFSSGSATTATILQSLPQGSHAVSIGDVYGGTHRYFTKVANAHGVE -----1111--------3333-----1111------------------------------ TSFTNDLLNDLPQLIKENTKLVWIETPTNPTLKVTDIQKVADLIKKHAAGQDVILVVDNT -----3333--1111--------------------------------2222--------- FLSPYISNPLNFGADIVVHSATKYINGHSDVVLGVLATNNKPLYERLQFLQNAIGAIPSP ---33333333--------3333------------------------------------- FDAWLTHRGLKTLHLRVRQAALSANKIAEFLAADKENVVAVNYPGLKTHPNYDVVLKQHR --------3333------------------------------1111--------1111-% DALGGGMISFRIKGGAEAASKFASSTRLFTLAESLGGIESLLEVPAVMTHGGIPKEAREA %%%----------------------------------------3333------------- SGVFDDLVRISVGIEDTDDLLEDIKQALKQATN --------------------------------- >CHEMOSENSORY PROTEIN; SWP:Q9U3Y0; PDB:1N8VA; KYTDKYDNINLDEILANKRLLVAYVNCVMERGKCSPEGKELKEHLQDAIENGCKKCTENQ ---1111-------------------1111--------------------%%%%------ EKGAYRVIEHLIKNEIEIWRELTAKYDPTGNWRKKYEDRAK --------------------------1111----------- >Receptor tyrosine-protein; SWP:P06494; PDB:1N8YC; TQVCTGTDMKLRLPASPETHLDMLRHLYQGCQVVQGNLELTYVPANASLSFLQDIQEVQG --------!!!!---3333--------2222-----------------1111-------- YMLIAHNQVKRVPLQRLRIVRGTQLFEDKYALAVLDNRDPQPEGLRELQLRSLTEILKGG -------------1111--------%%%%--------------------1111------- VLIRGNPQLCYQDMVLWKDVFRKNNQLAPVDIDTNRSRACPPCAPACKDNHCWGESPEDC -----1111-1111-3333--1111------------------3333--------3333- QILTGTICTSGCARCKGRLPTDCCHEQCAAGCTGPKHSDCLACLHFNHSGICELHCPALV ----1111----------3333--1111-------------------iiii--------- TYESMHNPEGRYTFGASCVTTCPYNYLSTEVGSCTLVCPPNNQEVTAEDGTQRCEKCSKP ------1111------------2222--1111--------------1111---------- CARVCYGLGMEHLRGARAITSDNVQEFDGCKKIFGSLAFLPESFDGDPSSGIAPLRPEQL ------22221111-----3333---2222---------3333------------3333- QVFETLEEITGYLYISAWPDSLRDLSVFQNLRIIRGRILHDGAYSLTLQGLGIHSLGLRS 1111--------------3333--3333-----------%%%%--------------111 LRELGSGLALIHRNAHLCFVHTVPWDQLFRNPHQALLHSGNRPEEDCGLEGLVCNSLCAH 1------------1111--11113333---------------1111--------1111%% GHCWGPGPTQCVNCSHFLRGQECVEECRVWKGLPREYVSDKRCLPCHPECQPQNSSETCF %%----1111----------------------------%%%%----1111---------- GSEADQCAACAHYKDSSSCVARCPSGVKPDLSYMPIWKYPDEEGICQPCPIN --1111--------!!!!----------------------1111-------- >ORF, HYPOTHETICAL PROTEIN; SWP:NA; PDB:1N91A; MDGVMSAVTVNDDGLVLRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLG ---------------------------------1111----------3333--------- KQFRVAKSQVVIEKGELGRHKQIKIINPQQIPPEVAALINLEHHHHHH -----1111----------------------33331111--------- >Nucleoprotein; SWP:Q01552; PDB:1N93X; KLPGKFLQYTVGGSDPHPGIGHEKDIRQNAVALLDQSRRDMFHTVTPSLVFLCLLIPGLH ---------------------3333----------3333---3333---------2222- AAFVHGGVPRESYLSTPVTRGEQTVVKTAKFYGEKTTQRDLTELEISSIFSHCCSLLIGV ------------------------------------------------------------ VIGSSSKIKAGAEQIKKRFKTMMAALNRPSHGETATLLQMFNPHEAIDWINGQPWVGSFV ----3333---------------11113333----1111-------------1111---- LSLLTTDFESPGKEFMDQIKLVASYAQMTTYTTIKEYLAECMDATLTIPVVAYEIRDFLE ------------------------2222---------------33333333--------- VSAKLKEDHADLFPFLGAIRHPDAIKLAPRSFPNLASAAFYWSKKENSTIQPGASVKETQ --------!!!!111111113333---3333-------------------2222------ LARYRRREISRGEDGAELSGEISAIMKMIGVTGLN --------------------------1111----- >CYP175A1; SWP:Q746J6; PDB:1N97A; MKRLSLREAWPYLKDLQQDPLAVLLAWGRAHPRLFLPLPRFPLALIFDPEGVEGALLAEG ----3333-----------------------------2222------------------- TTKATFQYRALSRLTGRGLLTDWGESWKEARKALKDPFLPKNVRGYREAMEEEARAFFGE ----3333--3333---3333-------------3333-------------------111 WRGEERDLDHEMLALSLRLLGRALFGKPLSPSLAEHALKALDRIMAQTRSPLALLDLAAE 1------------------------------------------------1111------- ARFRKDRGALYREAEALIVHPPLSHLPRERALSEAVTLLVAGHETVASALTWSFLLLSHR ------------3333-------------------------------------------- PDWQKRVAESEEAALAAFQEALRLYPPAWILTRRLERPLLLGEDRLPPGTTLVLSPYVTQ ----------------------------------------!!!!--2222----3333-- RLHFPDGEAFRPERFLEERGTPSGRYFPFGLGQRLCLGRDFALLEGPIVLRAFFRRFRLD -----1111--3333-------11111111-11111111-------------3333---- PLPFPRVLAQVTLRPEGGLPARPRE --------------2222------- >BETA-LACTAMASE SHV-2; SWP:P30896; PDB:1N9BA; SPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVD -----------------------------------1111-------------------11 AGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGP 11--3333----3333------3333---------------------------------- AGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPASMAATLRKLLTSQRLSARSQR -------1111----------------2222----------------------------- QLLQWMVDDRVAGPLIRSVLPAGWFIADKTGASERGARGIVALLGPNNKAERIVVIYLRD --------------------2222---------------------%%%%----------- TPASMAERNQQIAGIGAALIEHWQR ------------------------- >PROLACTIN; SWP:P01236; PDB:1N9DA; LPICPGGAARCQVTLRDLFDRAVVLSHYIHNLSSEMFSEFDKRYTHGRGFITKAINSCHT -------1111--3333-----------------------3333-------------333 SSLATPEDKEQAQQMNQKDFLSLIVSILRSWNEPLYHLVTEVRGMQEAPEAILSKAVEIE 3-------3333------------------------------------------------ EQTKRLLEGMELIVSQVHPETKENEIYPVWSGLPSLQMADEESRLSAYYNLLHCLRRDSH --------------------------------3333------------------------ KIDNYLKLLKCRIIHNNNC -3333-------------- >PUTATIVE BLUE LIGHT RECEP; SWP:Q8LPE0; PDB:1N9LA; GLRHTFVVADATLPDCPLVYASEGFYAMTGYGPDEVLGHNCRFLQGEGTDPKEVQKIRDA ---------1111------------------11112222-3333-1111----------- IKKGEACSVRLLNYRKDGTPFWNLLTVTPIKTPDGRVSKFVGVQVDVTS --------------1111-------------1111-------------- >G protein-activated inwar; SWP:P35562; PDB:1N9PA; RQRFVDKNGRCNVQHERAETLFSEHAVISRDGKLTLFRVGNLRNSHVSAQIRCKLLKSRQ -----1111--------------------iiii--------1111--------------- TPEGEFLPLDQLELDVGFSTGADQLFLVSPLTICHVIDAKSPFYDLSQRSQTEQFEVVVI 1111------------11111111-------------1111-----3333---------- LEGIVETTGTCQARTSYTEDEVLWGHRFFPVISLEEGFFKVDYSQFHATFEVPTPPYSVK ----3333---------1111--------------------3333--------------- EQEELLSSP --------- >SMALL NUCLEAR RIBONUCLEOP; SWP:P54999; PDB:1N9RA; LKGLVNHRVGVKLKFNSTEYRGTLVSTDNYFNLQLNEAEEFVAGVSHGTLGEIFIRCNNV 1111-------------------------------------iiii-----------1111 LYIRELPN -------- >ASPARTYL-TRNA SYNTHETASE ; SWP:NA; PDB:1N9WA; MRVLVRDLKAHVGQEVELLGFLHWRRDLGRIQFLLLRDRSGVVQVVTGGLKLPLPESALR ---33331111--------------------------1111------------------- VRGLVVENAKAPGGLEVQAKEVEVLSPALEPTPYRYVTLRGEKARAPLKVQAALVRGFRR -------1111----------------------33333333-3333-------------- YLDRQDFTEIFTPPQLYKQIMVGVFERVYEVAPVEYLSLDVEMGFIADEEDLMRLEEALL --1111-------------3333------------------------3333--------- AEMLEEALNTAGDEIRLLGATWPSFPQDIPRLTHAEAKRILKEELGYPVGQDLSEEAERL ---------------1111----------------------------------------- LGEYAKERWGSDWLFVTRYPRSVRPFYTYPEEDGTTRSFDLLFRGLEITSGGQRIHRYEE -------------------3333-1111--3333--------iiii-------------- LLESLKAKGMDPEAFHGYLEVFKYGMPPHGGFAIGAERLTQKLLGLPNVRYARAFP -----1111-3333-3333-1111------------------------3333---- >DESIGNED PROTEIN CTPR2; SWP:NA; PDB:1NA3A; GNSAEAWYNLGNAYYKQGDYDEAIEYYQKALELDPNNAEAWYNLGNAYYKQGDYDEAIEY --------------1111---------------1111-----------1111-------- YQKALELDPNNAEAKQNLGNAKQKQG -------1111--------------- >RESTRICTION ENDONUCLEASE ; SWP:Q6IVW7; PDB:1NA6A; SVFHNWLLEIACENYFVYIKRLSANDTGATQVGLYIPSGIVEKLFPSINHTRELNPSVFL ----------------------3333----------3333----3333------------ TAHVSSHDCPDSEARAIYYNSAHFGKTRNEKRITRWGRGSPLQDPENTGALTLLAFKLDE --------------------1111---------------33331111-----------33 QGGDCKEVNIWVCASTDEEDVIETAIGEVIPGALISGPAGQILGGLSLQQAPVNHKYILP 33---------------------------2222----1111------------------3 EDWHLRFPSGSEIIQYAASHYDPDEQLLDRRRVEYDIFLLVEELHVLDIIRKGFGSVDEF 333------------1111------------------------------------3333- IALANSVSNRRKSRAGKSLELHLEHLFIEHGLRHFATQAPDFLFPSAGAYHPLRMLAVKT -----------------3333-----3333---------------3333----------- TCKDRWRQILNHLFTLQEGVSLAQYREMRESGVRLVVPSSLHKKYPEAVRAELMTLGAFI -22223333----------------------------33333333----1111------- AELTG ----- >ADP-RIBOSYLATION FACTOR B; SWP:Q9UJY5; PDB:1NA8A; HHHHMELSLASITVPLESIKPSNILPVTVYDQHGFRILFHFARDPLPGRSDVLVVVVSML --------------3333-------------iiii-------------1111-------- STAPQPIRNIVFQSAVPKVMKVKLQPPSGMELPAFNPIVHPSAITQVLLLANPQKEKVRL ----------------3333---------------3333------------1111----- RYKLTFTMGDQTYNEMGDVDQFPPPETWGSL -----------------------3333---- >Envelope glycoprotein [Fr; SWP:O36230; PDB:1NAKH; EVQLQESGPSLVKPSQTLSLTCSVTGVSITSGYWNWIRKFPGNKFEYMGYISKSGSAYYN ------------2222-----------1111--------------------1111----1 PSLKSRISFTRDTSKNQFYLKLNSV 111---------1111--------- >Periplasmic divalent cati; SWP:P36654; PDB:1NAQA; SNTASVVVLCTAPDEATAQDLAAKVLAEKLAACATLIPGATSLYYWEGKLEQEYEVQMIL -------------------------1111------------------------------- KTTVSHQQALLECLKSHHPYQTPELLVLPVTHGDTDYLSWLNASLR --3333-----------1111--------------------3333- >NARBONIN; SWP:Q08884; PDB:1NAR; PKPIFREYIGVKPNSTTLHDFPTEIINTETLEFHYILGFAIESYYESGKGTGTFEESWDV -----------1111-------1111------------------1111----------33 ELFGPEKVKNLKRRHPEVKVVISIGGRGVNTPFDPAEENVWVSNAKESLKLIIQKYSDDS 33------------1111---------1111-----1111-----------------111 GNLIDGIDIHYEHIRSDEPFATLMGQLITELKKDDDLNINVVSIAPSENNSSHYQKLYNA 1--------------------------------1111---------3333---------- KKDYINWVDYQFSNQQKPVSTDDAFVEIFKSLEKDYHPHKVLPGFSTDPLDTKHNKITRD 3333------3333----------------------2222-------33333333----- IFIGGCTRLVQTFSLPGVFFWNANDSVIPKRDGDKPFIVELTLQQLLAA ---------1111--------3333-----22222222----------- >HORMONE RECEPTOR ALPHA 1,; SWP:P10827; PDB:1NAVA; HMEEMIRSLQQRPEPTPEEWDLIHIATEAHRSTNAKQRRKFLPDDIGQSPDGDKVDLEAF ------1111-------3333--------1111---------1111-------------- SEFTKIITPAITRVVDFAKKLPMFSELPCEDQIILLKGCCMEIMSLRAAVRYDPESDTLT -----------------------1111--------------------------------- LSGEMAVKREQLKNGGLGVVSDAIFELGKSLSAFNLDDTEVALLQAVLLMSTDRSGLLCV %%%%--------------------------3333--3333-------------------- DKIEKSQEAYLLAFEHYVNHRKHNIPHFWPKLLMKVTDLRMIGACHASRFLHMKVECPTE ------------------3333-----------------------------------333 LFPPLFLEVFEDQ 3------------ >NUCLEOSIDE DIPHOSPHATE KI; SWP:Q7SIA9; PDB:1NB2A; KERTFLMVKPDGVQRNLVGEVVKRFESKGLKLAGAKLMVISKDGAAAHYAELGGGPFFGG -----------------------------------------3333-1111-11113333- LVGGATSGPVFAMVWEGLNAAATARQILGATNPSDAAPGTIRGDFGVSAGRNAIHGSDSA ---1111---------2222-----------1111------1111-------------33 GSAAKEIGAFFGGGEAASGTPAAAADIYG 33-----3333-----------3333--- >CATHEPSIN H; SWP:O46427; PDB:1NB3A; YPPSMDWRKK -----3333- >POLYPROTEIN; SWP:O92972; PDB:1NB4A; SMSYTWTGALITPCAAEESKLPINPLSNSLLRHHNMVYATTSRSASLRQKKVTFDRLQVL -------------------------3333---3333----3333---------------- DDHYRDVLKEMKAKASTVKAKLLSIEEACKLTPPHSAKSKFGYGAKDVRNLSSRAVNHIR --------------------------------1111--1111-----1111--------- SVWEDLLEDTETPIDTTIMAKSEVFCVQPEKGGRKPARLIVFPDLGVRVCEKMALYDVVS ------------------------------------------------------------ TLPQAVMGSSYGFQYSPKQRVEFLVNTWKSKKCPMGFSYDTRCFDSTVTESDIRVEESIY ------!!!!3333----------------------------3333-------------1 QCCDLAPEARQAIRSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAT 111----------------3333----1111------------1111------------- AACRAAKLQDCTMLVNGDDLVVICESAGTQEDAAALRAFTEAMTRYSAPPGDPPQPEYDL ---------------!!!!------------------------1111-----------33 ELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETARHTPINSWLGNIIMYAPTLW 33--%%%%----------------------------------------------111133 ARMILMTHFFSILLAQEQLEKALDCQIYGACYSIEPLDLPQIIERLHGLSAFTLHSYSPG 33------------------------iiii----1111---------3333--------- EINRVASCLRKLGVPPLRTWRHRARSVRAKLLSQGGRAATCGRYLFNWAVRTKLKLTPIP ---------------3333--------------------------3333----------- AASQLDLSGWFVAGYSGGDIYHSLSR --------------2222-------- >Cystatin-A; SWP:P01040; PDB:1NB5I; MIPGGLSEAKPATPEIQEIVDKVKPQLEEKTNETYGKLEAVQYKTQVVAGTNYYIKVRAG ------------3333-------------------------------------------- DNKYMHLKVFKSLPGQNEDLVLTGYQVDKNKDDELTGF -----------------------------3333----- >UBIQUITIN CARBOXYL-TERMIN; SWP:Q93009; PDB:1NB8A; KKHTGYVGLKNQGATCYNSLLQTLFFTNQLRKAVYPTEGDDSSKSVPLALQRVFYELQHS ------------------------------------11111111---------------- DKPVGTKKLTKSFGWETLDSFQHDVQELCRVLLDNVENKKGTCVEGTIPKLFRGKVSYIQ -----3333-------1111---3333--------------1111--------------- CKEVDYRSDRREDYYDIQLSIKGKKNIFESFVDYVAVEQLDGDNKYDAGEHGLQEAEKGV --------------------2222-----------------------!!!!--------- KFLTLPPVLHLQLRFIKINDRFEFPEQLPLDEFLQKTDPKDPANYILHAVLVHSGDNHGG -----------------------------1111--------------------------- HYVVYLNPKGDGKWCKFDDDVVSRCTKEEAIEHNYGGHHCTNAYLVYIRESKLSEVLQAV ------1111-------!!!!----3333-3333--------------1111-3333--- TDHDIPQQLVERLQEEKRIEAQK 3333-3333-------------- >HYPOTHETICAL PROTEIN FLJ1; SWP:Q969G6; PDB:1NB9A; RHLPYFCRGQVVRGFGRGSKQLGIPTANFPEQVVDNLPADISTGIYYGWASVGSGDVHKM -----------------3333------------11111111----------!!!!----- VVSIGWNPYYKNTKKSMETHIMHTFKEDFYGEILNVAIVGYLRPEKNFDSLESLISAIQG ----------------------------2222---------------------------- DIEEAKKRLELPEYLKIKEDNFFQVSK ----------3333--------3333- >N-CARBAMOYLSARCOSINE AMID; SWP:P32400; PDB:1NBAA; TFNDIEARLAAVLEEAFEAGTSIYNERGFKRRIGYGNRPAVIHIDLANAWTQPGHPFSCP ----------------------------------------------3333----1111-- GMETIIPNVQRINEAARAKGVPVFYTTNVYRNRDASSGTNDMGLWYSKIPTETLPADSYW 3333------------1111-------------1111----!!!!----3333----333 AQIDDRIAPADGEVVIEKNRASAFPGTNLELFLTSNRIDTLIVTGATAAGCVRHTVEDAI 3--3333--2222----------22223333--1111---------1111---------- AKGFRPIIPRETIGDRVPGVVQWNLYDIDNKFGDVESTDSVVQYLDALPQFEDTVPKTLS --------1111--------------------------------1111-3333------- DPQPEVEAPADPV ------------- >CELLULOSOMAL SCAFFOLDING ; SWP:Q06851; PDB:1NBCA; NLKVEFYNSNPSDTTNSINPQFKVTNTGSSAIDLSKLTLRYYYTVDGQKDQTFWCDHAAI --------------------------------1111------------------------ IGSNGSYNGITSNVKGTFVKMSSSTNNADTYLEISFTGGTLEPGAHVQIQGRFAKNDWSN -1111----1111----------------------------2222---------1111-- YTQSNDYSFKSASQFVEWDQVTAYLNGVLVWGKEP -----1111---------------iiii------- >HELLETHIONIN D; SWP:P60057; PDB:1NBLA; KSCCRNTLARNCYNACRFTGGSQPTCGILCDCIHVTTTTCPSSHPS -----3333------------------1111---------3333-- >JUNCTIONAL ADHESION MOLEC; SWP:Q9Y624; PDB:1NBQA; AMGSVTVHSSEPEVRIPENNPVKLSCAYSGFSSPRVEWKFDQGDTTRLVCYNNKITASYE ----------------2222------------------------------%%%%-3333- DRVTFLPTGITFKSVTREDTGTYTCMVSEEGGNSYGEVKVKLIVLVPPSKPTVNIPSSAT -----1111--------------------------------------------------- IGNRAVLTCSEQDGSPPSEYTWFKDGIVMPTNPKSTRAFSNSSYVLNPTTGELVFDPLSA -----------------------iiii-------------------------------33 SDTGEYSCEARNGYGTPMTSNAVRMEAVE 33--------------------------- >PROBABLE DIHYDRONEOPTERIN; SWP:O06275; PDB:1NBUA; ADRIELRGLTVHGRHGVYDHERVAGQRFVIDVTVWIDLAEAANSDDLADTYDYVRLASRA -----------------3333------------------------3333----------- AEIVAGPPRKLIETVGAEIADHVMDDQRVHAVEVAVHKPQAPIPQTFDDVAVVIRRSR ----------3333-----------3333--------1111----------------- >IGG2B-KAPPA BV04-01 FAB (; SWP:NA; PDB:1NBVH; EVQPVETGGGLVQPKGSLKLSCAASGFSFNTNAMNWVRQAPGKGLEWVARIRSKSNNYAT ------------2222-----------1111---------------------1111---- YYADSVKDRFTISRDDSQNMLYLQMNNLKTEDTAMYYCVRDQTGTAWFAYWGQGTLVTVS --3333------------------------------------------------------ AAKTTPPSVYPLAPGCGDTTGSSVTLGCLVKGYFPESVTVTWNSGSLSSSVHTFPALLQS ------------------------------------------------------------ GLYTMSSSVTVPSSTWPSQTVTCSVAHPASSTTVDKKLE --------------------------------------- >GLYCEROL DEHYDRATASE REAC; SWP:Q59474; PDB:1NBWA; PLIAGIDIGNATTEVALASDYPQARAFVASGIVATTGMKGTRDNIAGTLAALEQALAKTP -------------------------------------2222--------------1111- WSMSDVSRIYLNEAAPVIGDVAMETITETIITESTMIGHNPQTPGGVGVGVGTTIALGRL -1111--------------------------%%%%--------------------11111 ATLPAAQYAEGWIVLIDDAVDFLDAVWWLNEALDRGINVVAAILKKDDGVLVNNRLRKTL 1113333---------------------------------------------1111---- PVVDEVTLLEQVPEGVMAAVEVAAPGQVVRILSNPYGIATFFGLSPEETQAIVPIARALI -------3333------------2222---1111----------------------1111 GNRSAVVLKTPQGDVQSRVIPAGNLYISGEKRRGEADVAEGAEAIMQAMSACAPVRDIRG ------------------------------------1111---------1111------- EPGTHAGGMLERVRKVMASLTGHEMSAIYIQDLLAVDTFIPRKVQGGMAGECAMENAVGM ------------------1111-3333----------------2222------------- AAMVKADRLQMQVIARELSARLQTEVVVGGVEANMAIAGALTTPGCAAPLAILDLGAGST ------------------------------3333--------2222-------------- DAAIVNAEGQITAVHLAGAGNMVSLLIKTELGLEDLSLAEAIKKYPLAKVESLFSIRHEN -----1111-------------------1111-------------------------111 GAVEFFREALSPAVFAKVVYIKEGELVPIDNASPLEKIRLVRRQAKEKVFVTNCLRALRQ 1---------3333-------------------3333----------------------- VSPGGSIRDIAFVVLVGGSSLDFEIPQLITEALSHYGVVAGQGNIRGTEGPRNAVATGLL ------3333---------------------3333--------2222------------- LAGQAN ------ >Putative uncharacterized ; SWP:Q48423; PDB:1NBWB; PPGVRLFYDPRGHHAGAINELCWGLEEQGVPCQTITYDGGGDAAALGALAARSSPLRVGI --------3333-------------1111------------------------1111--- GLSASGEIALTHAQLPADAPLATGHVTDSDDQLRTLGANAGQLVKVLPLSERN --3333-----11111111-----1111------------------------- >MONOCLONAL ANTIBODY 2D12.; SWP:NA; PDB:1NC2B; VKLQESGPGLVQPSQSLSITCTVSGFSLTDYGVHWVRQSPGKGLEWLGVIWSGGGTAYTA -----------2222-----------3333--------------------1111----33 AFISRLNIYKDNSKNQVFFEMNSLQANDTAMYYCARRGSYPYNYFDVWGQGTTVTVSSAK 33--------3333----------1111-------------------------------- TTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLY --------------------------------------%%%%--1111------------ TLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPR --------1111------------1111---------- >HYPOTHETICAL PROTEIN YTER; SWP:O34559; PDB:1NC5A; KSPLTYAEALANTIMNTYTVEELPPANRWHYHQGVFLCGVLRLWEATGEKRYFEYAKAYA -3333-------------3333-------------------------------------- DLLIDDNGNLLFRRDELDAIQAGLILFPLYEQTKDERYVKAAKRLRSLYGTLNRTSEGGF ----1111----11113333--------------------------3333----1111-- WHKDGYPYQMWLDGLYMGGPFALKYANLKQETELFDQVVLQESLMRKHTKDAKTGLFYHA --1111----3333----------------3333-------------------------- WDEAKKMPWANEETGCSPEFWARSIGWYVMSLADMIEELPKKHPNRHVWKNTLQDMIKSI -1111-1111------------------------3333-2222----------------- CRYQDKETGLWYQIVDKGDRSDNWLESSGSCLYMYAIAKGINKGYLDRAYETTLLKAYQG 1111---------1111--1111------------------------------------- LIQHKTETSEDGAFLVKDICVGTSAGFYDYYVSRERSTNDLHGAGAFILAMTELEPLFRS --------1111--------------33331111-------------------------2 AGK 222 >HYPOTHETICAL PROTEIN TM10; SWP:Q9X0F9; PDB:1NC7A; HNGARKWFFPDGYIPNGKRGYLVSHESLCINTGDETAKIRITFLFEDSKPVVHEVEISPK ------------------!!!!-------------------------------------- SLHLRLDKLGIPKCKPYSIAESNVPVVQLSRLDVGKNHYTLTTIGYWEEG ----1111------------------------------------------ >NUCLEOCAPSID PROTEIN; SWP:P18042; PDB:1NC8; AQQRKVIRCWNCGKEGHSARQCRAPRRQG -----------------3333-------- >Ig gamma-2A chain C regio; SWP:GCAM_MOUSE; PDB:1NCBH; QIQLVQSGPELKKPGETVKISCKASGYTFTNYGMNWVKQAPGKGLEWMGWINTNTGEPTY ---------------------------3333----------------------------- GEEFKGRFAFSLETSASTANLQINNL ------------3333---------- >N-CADHERIN; SWP:P15116; PDB:1NCIA; WVIPPINLPENSRGPFPQELVRIRSGRDKNLSLRYSVTGPGADQPPTGIFIINPISGQLS --------------------------1111------------------------------ VTKPLDRELIARFHLRAHAVDINGNQVENPIDIVINVID --------------------1111--------------- >T LYMPHOCYTE ACTIVATION A; SWP:P42081; PDB:1NCNA; MLKIQAYFNETADLPCQFANSQNQSLSELVVFWQDQENLVLNEVYLGKEKFDSVHSKYMG ------2222---------3333-3333------1111------iiii------3333-- RTSFDSDSWTLRLHNLQIKDKGLYQCIIHHKKPTGMIRIHQMNSELSVLA -----1111-------3333------------------------------ >COAT PROTEIN VP1; SWP:P03303; PDB:1NCQA; TVASISSGPKHTQKVPILTANETGATMPVLPSDSIETRTTYMHFNGSETDVECFLGRAAC --------------1111-----------1111------------33333333------- VHVTEIQNKDATGIDNHREAKLFNDWKINLSSLVQLRKKLELFTYVRFDSEYTILATASQ ----------2222-3333----------------------------------------- PDSANYSSNLVVQAMYVPPGAPNPKEWDDYTWQSASNPSVFFKVGDTSRFSVPYVGLASA -----------------2222-------3333----------2222-------------- YNCFYDGYSHDDAETQYGITVLNHMGSMAFRIVNEHDEHKTLVKIRVYHRAKHVEAWIPR -----------1111--3333--------------------------------------- APRALPYTSIGRTNYPKNTEPVIKKRKGDIKSY ----------------------------1111- >Genome polyprotein; SWP:P03303; PDB:1NCQB; GYSDRVQQITLGNSTITTQEAANAVVCYAEWPEYLPDVDASDVNKTSKPDTSVCRFYTLD --3333----!!!!------------%%%%-----3333---------!!!!-------- SKTWTTGSKGWCWKLPDALKDMGVFGQNMFFHSLGRSGYTVHVQCNATKFHSGCLLVVVI ----1111-------3333----------------------------------------- PEHQLASHEGGNVSVKYTFTHPGERGIDLSSANEVGGPVKDVLYNMNGTLLGNLLIFPHQ ------1111-----3333---3333-1111--2222---3333-----33331111--- FINLRTNNTATIVIPYINSVPIDSMTRHNNVSLMVIPIAPLTVPTGATPSLPITVTIAPM --3333-------------------------------------2222------------- CTEFSGIRSKSIVPQ --------------- >Genome polyprotein; SWP:P03303; PDB:1NCQC; GLPTTTLPGSGQFLTTDDRQSPSALPNYEPTPRIHIPGKVHNLLEIIQVDTLIPMNNTHT ------2222---1111-------2222-------------33331111----------- KDEVNSYLIPLNANRQNEQVFGTNLFIGDGVFKTTLLGEIVQYYTHWSGSLRFSLMYTGP --3333-------------------11113333-------1111---------------1 ALSSAKLILAYTPPGARGPQDRREAMLGTHVVWDIGLQSTIVMTIPWTSGVQFRYTDPDT 111-------------------------------------------------------33 YTSAGFLSCWYQTSLILPPETTGQVYLLSFISACPDFKLRLMKDTQTISQTVALTE 33-------------------------------3333------------------- >Genome polyprotein; SWP:P03303; PDB:1NCQD; INYYKDAASTSSAGQSLSMDPSKFTEPVKDLMLKGAPALN -----3333-----------3333--------2222---- ------------------------------------------------------------ ------------------------------- >IMMUNOGLOBULIN IGG2A; SWP:NA; PDB:1NCWH; RVQLQQSGPGLVKPSQSLSLTCTVTGYSITSDFAWNWIRQFPGNKLEWMGYINYSGFTSH ------------2222-----------1111---------------------1111---- NPSLKSRISITRDTSKNQFFLQLNSVTTEDTATYYCAGLLWYDGGAGSWGQGTLVTVSAA 3333--------3333----------1111------------------------------ KTTAPSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDL ------------3333------------------------iiii---------------- YTLSSSVTVTSSTWPSQSITCNVAHPASSTKVDKKIEPRGPT ------------------------3333-------------- >BAP1; SWP:P83512; PDB:1ND1A; RFSPRYIELAVVADHGIFTKYNSNLNTIRTRVHEMLNTVNGFYRSVDVHAPLANLEVWSK --------------------%%%%------------------3333-------------- QDLIKVQKDSSKTLKSFGEWRERDLLPRISHDHAQLLTAVVFDGNTIGRAYTGGMCDPRH ------------------------3333--------------%%%%-------2222--- SVGVVRDHSKNNLWVAVTMAHELGHNLGIDHDTGSCSCGAKSCIMASVLSKVLSYEFSDC ------------------------------------------1111-------------- SQNQYETYLTNHNPQCILNKP -------------3333---- >AMINOGLYCOSIDE 3'-PHOSPHO; SWP:P00552; PDB:1ND4A; GSPAAWVERLFGYDWAQQTIGCSDAAVFRLSAQGRPVLFVKTDLSGALNELQDEAARLSW --3333-1111--------------------2222----------1111----------3 LATTGVPCAAVLDVVTEAGRDWLLLGEVPGQDLLSSHLAPAEKVSIMADAMRRLHTLDPA 333-------------%%%%-----------3333---3333-----------1111333 TCPFDHQAKHRIERARTRMEAGLVDQDDLDEEHQGLAPAELFARLKARMPDGEDLVVTHG 3-----------------------1111-1111---3333-----1111----------- DACLPNIMVENGRFSGFIDCGRLGVADRYQDIALATRDIAEELGGEWADRFLVLYGIAAP --3333---%%%%------1111---3333------------------------------ DSQRIAFYRLLDEFF -----------1111 >PROSTATIC ACID PHOSPHATAS; SWP:P15309; PDB:1ND6A; KELKFVTLVFRHGDRSPIDTFPTDPIKESSWPQGFGQLTQLGMEQHYELGEYIRKRYRKF --------------------1111--33331111-------------------------- LNESYKHEQVYIRSTDVDRTLMSAMTNLAALFPPEGVSIWNPILLWQPIPVHTVPLSEDQ -----3333-------3333---------------1111-1111-----------1111- LLYLPFRNCPRFQELESETLKSEEFQKRLHPYKDFIATLGKLSGLHGQDLFGIWSKVYDP ------------------1111------3333-----3333------------------- LYCESVHNFTLPSWATEDTMTKLRELSELSLLSLYGIHKQKEKSRLQGGVLVNEILNHMK ----1111---1111-------------------------------3333---------- RATQIPSYKKLIMYSAHDTTVSGLQMALDVYNGLLPPYASCHLTELYFEKGEYFVEMYYR ----------------3333----------------2222--------iiii-------- NETQHEPYPLMLPGCSPSCPLERFAELVGPVIPQDWSTECMT -3333------2222------------3333---3333---- >WW DOMAIN-CONTAINING PROT; SWP:Q9H0M0; PDB:1ND7A; HMGFRWKLAHFRYLCQSNALPSHVKINVSRQTLFEDSFQQIMALKPYDLRRRLYVIFRGE ---------------1111---------------------11111111------------ EGLDYGGLAREWFFLLSHEVLNPMYCLFEYAGKNNYCLQINPASTINPDHLSYFCFIGRF ------------------11111111--------------1111--1111---------- IAMALFHGKFIDTGFSLPFYKRMLSKKLTIKDLESIDTEFYNSLIWIRDNNIEECGLEMY ----1111-------------1111------3333------------------------- FSVDMEILGKVTSHDLKLGGSNILVTEENKDEYIGLMTEWRFSRGVQEQTKAFLDGFNEV ----------------2222----------------------2222-------------- VPLQWLQYFDEKELEVMLCGMQEVDLADWQRNTVYRHYTRNSKQIIWFWQFVKETDNEVR -33331111--------------------1111-----1111------------------ MRLLQFVTGTCRLPLGGFAELMGSNGPQKFCIEKVGKDTWLPRSHTCFNRLDLPPYKSYE -------------22221111---------------1111----3333------------ QLKEKLLFAIEETE -------------- >TRANSLATION INITIATION FA; SWP:P02995; PDB:1ND9A; TDVTIKTLAAERQTSVERLVQQFADAGIRKSADDSVSAQEKQTLIDHLN ---------------------3333-----------2222--------- >CARNITINE ACETYLTRANSFERA; SWP:P47934; PDB:1NDBA; AHQDALPRLPVPPLQQSLDYYLKALQPIVSEEEWAHTKQLVDEFQTSGGVGERLQKGLER 3333--------------------1111-----------------2222----------- RAKKENWLSEWWLKTAYLQFRQPVVIYSSPGVILPKQDFVDLQGQLRFAAKLIEGVLDFK 3333------------1111---------------------------------------- SIDNETLPVEFLGGQPLCNQYYQILSSCRVPGPKQDSVVNFLKSKRPPTHITVVHNYQFF -1111-------------3333------------------1111----------iiii-- ELDVYHSDGTPLTSDQIFVQLEKIWNSSLQSNKEPVGILTSNHRNTWAKAYNNLIKDKVN -----1111---------------1111------33331111----------1111---- RESVNSIQKSIFTVCLDKQVPRVSDDVYRNHVAGQLHGGGSKFNSGNRWFDKTLQFIVAE -----------------------3333------------11111111-3333------11 DGSCGVYEHAAAEGPPIVALVDHVEYTKKPELVRSPVPLPPKKLRFNITPEIKNDIEKAK 11-----3333-------------1111-------------------------------- QNLSIIQDLDILTFHHFGKDFPKSEKLSPDAFIQVALQLAYYRIYGQACATYESASLRFH -----1111-------------1111--------------------------------22 LGRTDTIRSASIDSLAFVKGGDSTVPEQQKVELLRKAVQAHRAYTDRAIRGEAFDRHLLG 22---------------------------------------------1111--------- LKLQAIEDLVSPDIFDTSYAIAHFNLSTSQVPAKTDCVFFGPVVPDGYGICYNPEAHINF -----------3333--------------------------------------------- SVSAYNSCAETNAARAHYLEKALLDRTLLQNHPRAK ----3333---3333-------------1111---- >UBIQUITIN-LIKE PROTEIN NE; SWP:Q15843; PDB:1NDDA; MLIKVKTLTGKEIEIDIEPTDKVERIKERVEEKEGIPPQQQRLIYSGKQMNDEKTAADYK ------1111-------11113333-----------3333----iiii--33333333-- ILGGSVLHLVLALR -2222--------- >Lysozyme C [Precursor]; SWP:P00698; PDB:1NDGB; EVQLQESGPSLVKPSQTLSLTCSVTGDSIIRDYWSWIRKFPGNKLEYMGYISFSGNTFYH ------------2222-----------1111--------1111--------1111----3 PSLKSRISITRDTSKNQHYLQLSSVTTEDTATYYCANWDGTYWGEGTLVTVSAAKTTAPS 333----------------------1111-------3333-------------------- VYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLYTLSSS ----------------------------------iiii---------------------- VTVTSSTWPSQSITCNVAHPASSTAVDKKI ------------------3333-------- >CYTOCHROME B5 REDUCTASE; SWP:P83686; PDB:1NDH; PAITLENPDIKYPLRLIDKEVVNHDTRRFRFALPSPEHILGLPVGQHIYLSARIDGNLVI ------1111------------------------1111---------------iiii--- RPYTPVSSDDDKGFVDLVIKVYFKDTHPKFPAGGKMSQYLESMKIGDTIEFRGPNGLLVY -------3333----------------------------11112222------------- QGKGKFAIRPDKKSSPVIKTVKSVGMIAGGTGITPMLQVIRAIMKDPDDHTVCHLLFANQ -iiii-----------------------!!!!-------------1111----------- TEKDILLRPELEELRNEHSARFKLWYTVDRAPEAWDYSQGFVNEEMIRDHLPPPEEEPLV 3333----------11111111------------------------------3333---- LMCGPPPMIQYACLPNLERVGHPKERCFAF -----------------1111-3333---- >HYPOTHETICAL PROTEIN TA13; SWP:Q9HIL9_THEAC; PDB:1NE2A; GIKNDLEIRLQKLQQQGYPTDASTAAYFLIEIYNDGNIGGRSVIDAGTGNGILACGSYLL -------------------------------------2222------!!!!--------- GAESVTAFDIDPDAIETAKRNCGGVNFVADVSEISGKYDTWINPPFDRAFIDKAFETSWI ---------------------1111----3333-------------3333---------- YSIGNAKARDFLRREFSARGDVFREEKVYITVPRIYRARIEAVIFGVRNHSF ----3333-------------------------------------------- ------------------------------------------------------------ -------- >ERGTOXIN; SWP:Q86QT3; PDB:1NE5A; DRDSCVDKSRCAKYGYYQECQDCCKNAGHNGGTCMFFKCKCA ---3333---------3333----3333-------------- >GLUCOSAMINE-6-PHOSPHATE I; SWP:P46926; PDB:1NE7A; MKLIILEHYSQASEWAAKYIRNRIIQFNPGPEKYFTLGLPTGSTPLGCYKKLIEYYKNGD ------------------------3333-1111--------3333--------------- LSFKYVKTFNMDEYVGLPRDHPESYHSFMWNNFFKHIDIHPENTHILDGNAVDLQAECDA -----------------1111-----------3333---3333----------------- FEEKIKAAGGIELFVGGIGPDGHIAFNEPGSSLVSRTRVKTLAMDTILANARFFDGELTK -----1111---------1111-----2222------------------3333%%%%111 VPTMALTVGVGTVMDAREVMILITGAHKAFALYKAIEEGVNHMWTVSAFQQHPRTVFVCD 1-----------------------1111------------1111----1111-------3 EDATLELKVKTVKYFKGLMLVHNKLVDPLYSIKEKETEKSQ 33311113333--------3333------------------ >CONSERVED HYPOTHETICAL PR; SWP:P96622; PDB:1NE8A; LIVKRGDVYFADLSPVVGSEQGGVRPVLVIQNDIGNRFSPTAIVAAITAQIQKAKLPTHV ---2222---------!!!!-----------------------------------1111- EIDAKRYGFERDSVILLEQIRTIDKQRLTDKITHLDDEMMDKVDEALQISLALIDF --3333---------1111----3333----------------------------- >FEMX; SWP:Q9EY50; PDB:1NE9A; PVLNLNDPQAVERYEEFMRQSPYGQVTQDLGWAKVKNNWEPVDVYLEDDQGAIIAAMSML ---1111-------------11111111--3333-1111--------1111--------- LGDTPTDKKFAYASKGPVMDVTDVDLLDRLVDEAVKALDGRAYVLRFDPEVAYSDEFNTT ------------2222---1111--------------iiii------------------- LQDHGYVTRNRNVADAGMHATIQPRLNMVLDLTKFPDAKTTLDLYPSKTKSKIKRPFRDG -1111----1111---1111----------33331111-3333----------------- VEVHSGNSATELDEFFKTYTTMAERHGITHRPIEYFQRMQAAFDADTMRIFVAEREGKLL -----------------------1111----3333--------1111-------iiii-- STGIALKYGRKIWYMYAGSMDGNTYYAPYAVQSEMIQWALDTNTDLYDLGGIESESTDDS -------!!!!------------%%%%------------1111------------1111- LYVFKHVFVKDAPREYIGEIDKVLDPEVYAELVKD -----1111-------------------------- >Probable translation init; SWP:O27797; PDB:1NEEA; MDDYEKLLERAIDQLPPEVFETKRFEVPKAYSVIQGNRTFIQNFREVADALNRDPQHLLK ------------------------------------------3333-------3333--- FLLRELGTAGNLEGGRAILQGKFTHFLINERIEDYVNKFVICHECNRPDTRIIREGRISL ------------------------------------------------------------ LKCEACGAKAPLKNV --------------- >HYPOTHETICAL PROTEIN YOAG; SWP:P76247; PDB:1NEIA; MGKATYTVTVTNNSNGVSVDYETETPMTLLVPEVAAEVIKDLVNTVRSYDTENEHDVCGW -------------------------3333--3333------------------------- >SUCCINATE DEHYDROGENASE F; SWP:P10444; PDB:1NEKA; MKLPVREFDAVVIGAGGAGMRAALQISQSGQTCALLSKVFPTRSHTVSAQGGITVALGNT --------------------------1111---------11113333------------- HEDNWEWHMYDTVKGSDYIGDQDAIEYMCKTGPEAILELEHMGLPFSRLDDGRIYQRPFG ---3333------1111----------------------1111----------------- GQSKNFGGEQAARTAAAADRTGHALLHTLYQQNLKNHTTIFSEWYALDLVKNQDGAVVGC ---------------------------------------------------1111----- TALCIETGEVVYFKARATVLATGGAGRIYQSTTNAHINTGDGVGMAIRAGVPVQDMEMWQ ----------------------------------3333---------------------- FHPTGIAGAGVLVTEGCRGEGGYLLNKHGERFMERYAPNAKDLAGRDVVARSIMIEIREG -----2222--------1111----1111-3333------1111----------1111-- RGCDGPWGPHAKLKLDHLGKEVLESRLPGILELSRTFAHVDPVKEPIPVIPTCHYMMGGI ----3333------3333----------------------3333---------------- PTKVTGQALTVNEKGEDVVVPGLFAVGEIACVSVHGANRLGGNSLLDLVVFGRAAGLHLQ --1111-----3333----2222---1111---------2222----------------- ESIAEQGALRDASESDVEASLDRLNRWNNNRNGEDPVAIRKALQECMQHNFSVFREGDAM -------------------------------------------------------3333- AKGLEQLKVIRERLKNARLDDTSSEFNTQRVECLELDNLMETAYATAVSANFRTESRGAH -------------1111------------------------------------------- SRFDFPDRDDENWLCHSLYLPESESMTRRSVNMEPKLRPAFPPKIRTY -3333---3333---------!!!!----------------------- >Succinate dehydrogenase i; SWP:P07014; PDB:1NEKB; MRLEFSIYRYNPDVDDAPRMQDYTLEADEGRDMMLLDALIQLKEKDPSLSFRRSCREGVC ----------1111--------------------------3333-3333----------- GSDGLNMNGKNGLACITPISALNQPGKKIVIRPLPGLPVIRDLVVDMGQFYAQYEKIKPY ------iiii--1111-3333-------------------!!!!----------1111-- LLNNGQNPPAREHLQMPEQREKLDGLYECILCACCSTSCPSFWWNPDKFIGPAGLLAAYR ---------------33333333----------3333-3333--1111------------ FLIDSRDTETDSRLDGLSDAFSVFRCHSIMNCVSVCPKGLNPTRAIGHIKSMLLQRNA 1111----------1111------------3333-1111------------------- >EPIDIDYMAL SECRETORY PROT; SWP:P79345; PDB:1NEPA; EPVKFKDCGSWVGVIKEVNVSPCPTQPCKLHRGQSYSVNVTFTSNTQSQSSKAVVHGIVM ------------------------------2222------------------------ii GIPVPFPIPESDGCKSGIRCPIEKDKTYNYVNKLPVKNEYPSIKVVVEWELTDDKNQRFF ii---------1111-------2222----------3333------------1111---- CWQIPIEVEA ---------- >DNA-BINDING PROTEIN NER; SWP:P06020; PDB:1NEQ; CSNEKARDWHRADVIAGLKKRKLSLSALSRQFGYAPTTLANALERHWPKGEQIIANALET -----------------------3333----------------------------1111- KPEVIWPSRYQAGE 3333-1111----- >MYELIN P0 PROTEIN; SWP:P06907; PDB:1NEU; IVVYTDREVYGAVGSQVTLHCSFWSSEWVSDDISFTWRYQPEGGRDAISIFHYAKGQPYI -----------2222-------------------------2222---------iiii--- DEVGTFKERIQWVGDPSWKDGSIVIHNLDYSDNGTFTCDVKNVGKTSQVTLYVFE ---1111-------3333----------1111----------------------- >CDC4 PROTEIN; SWP:P52286; PDB:1NEXA; SNVVLVSGEGERFTVDKKIAERSLLLKNYLIVPVPNVRSSVLQKVIEWAEHHRDSNFPVD ------1111-----3333------------------3333----------1111----- SWDREFLKVDQELYEIILAANYLNIKPLLDAGCKVVAEIRGRSPEEIRRTFNIVNDFTPE ----1111------------------------------2222------------------ EEAAIR ------ >Cell division control pro; SWP:P07834; PDB:1NEXB; LKRDLITSLPFEISLKIFNYLQFEDIINSLGVSQNWNKIIRKSTSLWKKLLISENFVSPK ---3333--3333----11113333---1111-------1111--------------333 GFNSLNLKLSQKYPKLSQQDRLRLSFLENIFILKNWYNPKFVPQRTTLRGHTSVITCLQF 3-----------1111---------------------1111------------------- EDNYVITGADDKIRVYDSINKKFLLQLSGHDGGVWALKYAHGGILVSGSTDRTVRVWDIK %%%%------------------------------------------------------33 KGCCTHVFEGHNSTVRCLDIVEYKNIKYIVTGSRDNTLHVWKLPKDYPLVFHTPEENPYF 33--------------------------------------------------33333333 VGVLRGHASVRTVSGHGNIVVSGSYDNTLIVWDVAQKCLYILSGHTDRIYSTIYDHERKR ---------------!!!!-------------3333-------------------1111- CISASDTTIRIWDLENGELYTLQGHTALVGLLRLSDKFLVSAAADGSIRGWDANDYSRKF ----iiii--------------------------1111----1111-------------- SYHHTNLSAITTFYVSDNILVSGSENQFNIYNLRSGKLVHANILKDADQIWSVNFKGKTL ---1111--------1111----2222--------------1111----------!!!!- VAAVEKDGQSFLEILDFS ------------------ >TRIOSEPHOSPHATE ISOMERASE; SWP:P00942; PDB:1NEYA; ARTFFVGGNFKLNGSKQSIKEIVERLNTASIPENVEVVICPPATYLDYSVSLVKKPQVTV -------------------------1111------------3333----1111-1111-- GAQNAYLKASGAFTGENSVDQIKDVGAKYVILGHSERRSYFHEDDKFIADKTKFALGQGV ------------2222------1111-------3333-1111-------------1111- GVILCIGETLEEKKAGKTLDVVERQLNAVLEEVKDFTNVVVAYEPVAIGTGLAATPEDAQ ------------1111-------------------------------------------- DIHASIRKFLASKLGDKAASELRILYGGSANGSNAVTFKDKADVDGFLVGGASLKPEFVD ------------------1111--------3333-1111-1111-----3333------- IINSRN 1111-- >BETA-2-MICROGLOBULIN; SWP:P14433; PDB:1NEZA; GSHSLRYFYTALSRPAISEPWYIAVGYLDDTQFARFDSAGETGTYKLSAPWVEQEGPEYW ---------------------------!!!!-----------------11113333---- ARETEIVTSNAQFFRENLQTMLDYYNLSQNGSHTIQVMYGCEVEFFGSLFRAYEQHGYDG ----------------------1111-1111-------------2222---------iii QDYIALNEDLKTWTAADMAAEITRSKWEQAGYTELRRTYLEGPCKDSLLRYLENRKKTQE i-----1111------3333---------------------------------3333--- CTDPPKTHVTHHARPEGDVTLRCWALGFYPAHITLTWQLNGEELIQDTELVETRPAGDGT -------------1111---------------------iiii-1111------------- FQKWAAVVVPSGEEQKYTCHVYHEGLPEPLTLRW ---------22221111-----1111-------- >NEUROFIBROMIN; SWP:P21359; PDB:1NF1A; ERLVELVTMMGDQGELPIAMALANVVPCSQWDELARVLVTLFDSRHLLYQLLWNMFSKEV ---------1111-------------3333------------11111111---------- ELADSMQTLFRGNSLASKIMTFCFKVYGATYLQKLLDPLSLEENQRNLLQMTEKFFHAII -------2222-------------3333-1111-------------1111---------- SSSSEFPPQLRSVCHCLYQVVSQRFPQNSIGAVGSAMFLRFINPAIVSPYEAPIIERGLK -3333--------------3333-----3333----3333-------------------- LMSKILQSIANHVLFTKEEHMRPFNDFVKSNFDAARRFFLDIASALHRLLWNNQEKIGQY --------------------3333--1111------------------------------ LSSRPFDKMATLLAYLGPPE ------------1111---- >PHOSPHATASE; SWP:Q9WZB9; PDB:1NF2A; MYRVFVFDLDGTLLNDNLEISEKDRRNIEKLSRKCYVVFASGRMLVSTLNVEKKYFKRTF --------2222--1111------------------------------------------ PTIAYNGAIVYLPEEGVILNEKIPPEVAKDIIEYIKPLNVHWQAYIDDVLYSEKDNEEIK ---%%%%---------------------------3333-------%%%%----------- SYARHSNVDYRVEPNLSELVSKMGTTKLLLIDTPERLDELKEILSERFKDVVKVFKSFPT ---1111-------3333----------------3333---------3333------111 YLEIVPKNVDKGKALRFLRERMNWKKEEIVVFGDNENDLFMFEEAGLRVAMENAIEKVKE 1-----------------------3333------3333-3333-------1111----11 ASDIVTLTNNDSGVSYVLERISTDCLD 11-----1111-----3333------- >Partitioning defective 6 ; SWP:Q9JK83; PDB:1NF3C; IVISMPQDFRPVSSIIDVDILPETHRRVRLCKYGTEKPLGFYIRDGSSVRVTPHGLEKVP ----------------3333-1111--------3333--------------1111----- GIFISRLVPGGLAQSTGLLAVNDEVLEVNGIEVSGKSLDQVTDMMIANSRNLIITVRPAN -------2222--3333--2222----iiii-1111-----------1111--------- QRN --- >PHENAZINE BIOSYNTHESIS PR; SWP:Q7DC80; PDB:1NF9A; MSGIPEITAYPLPTAQQLPANLARWSLEPRRAVLLVHDMQRYFLRPLPESLRAGLVANAA -------------3333----------3333--------33333333------------- RLRRWCVEQGVQIAYTAQPGSMTEEQRGLLKDFWGPGMRASPADREVVEELAPGPDDWLL ------1111------------3333!!!!----------3333---3333--1111--- TKWRYSAFFHSDLLQRMRAAGRDQLVLCGVYAHVGVLISTVDAYSNDIQPFLVADAIADF -----1111--------1111---------1111---------1111-----1111---- SEAHHRMALEYAASRCAMVVTTDEVLE --------------------------- >N15 ALPHA-BETA T-CELL REC; SWP:NA; PDB:1NFDA; DSVTQTEGLVTVTEGLPVKLNCTYQTTYLTIAFFWYVQYLNEAPQVLLKSSTDNKRTEHQ ------------------------------------------------------------ GFHATLHKSSSSFHLQKSSAQLSDSALYYCALSEGGNYKYVFGAGTRLKVIAHIQNPEPA --------------------3333------------------------------------ VYQLKDPRSQDSTLCLFTDFDSQINVPKTMESGTFITDKTVLDMKAMDSKSNGAIAWSNQ ------------------------------------------------------------ TSFTCQDIFKETNATYPSSDVPC ---3333---------------- >N15 ALPHA-BETA T-CELL REC; SWP:NA; PDB:1NFDB; DSGVVQSPRHIIKEKGGRSVLTCIPISGHSNVVWYQQTLGKELKFLIQHYEKVERDKGFL ------------------------------------------------------------ PSRFSVQQFDDYHSEMNMSALELEDSAMYFCASSLRWGDEQYFGPGTRLTVLEDLRNVTP --------3333---------3333----------------------------3333--- PKVSLFEPSKAEIANKQKATLVCLARGFFPDHVELSWWVNGKEVHSGVSTDPQAYKESNY --------------------------------------iiii--2222------------ SYSLSSRLRVSATFWHNPRNHFRCQVQFHGLSEEDKWPEGSPKPVTQNISAEAWGRADC ----------3333--1111--------------------------------------- >N15 ALPHA-BETA T-CELL REC; SWP:NA; PDB:1NFDE; YELIQPSSASVTVGETVKITCSGDQLPKNFAYWFQQKSDKNILLLIYMDNKRPSGIPERF ----------------------1111----------1111------------22221111 SGSTSGTTATLTISGAQPEDEAAYYCLSSYGDNND ----!!!!--------1111--------------- >N15 ALPHA-BETA T-CELL REC; SWP:NA; PDB:1NFDF; EVYLVESGGDLVQPGSSLKVSCAASGFTFSDFWMYWVRQAPGKGLEWVGRIKNIP ------------2222-----------3333------------------------ >PUTATIVE OXIDOREDUCTASE R; SWP:Q10855; PDB:1NFFA; SGRLTGKVALVSGGARGMGASHVRAMVAEGAKVVFGDILDEEGKAMAAELADAARYVHLD ---2222-------------------1111-------------------!!!!------1 VTQPAQWKAAVDTAVTAFGGLHVLVNNAGILNIGTIEDYALTEWQRILDVNLTGVFLGIR 111-------------------------------3333---------------------- AVVKPMKEAGRGSIINISSIEGLAGTVACHGYTATKFAVRGLTKSTALELGPSGIRVNSI ------------------1111---------------------------3333------- HPGLVKTPMTDWVPEDIFQTALGRAAEPVEVSNLVVYLASDESSYSTGAEFVVDGGTVAG ------3333---1111--1111---3333---------3333----------iiii--- LAHN ---- >D-HYDANTOINASE; SWP:Q8VTT5; PDB:1NFGA; MDIIIKNGTIVTADGISRADLGIKDGKITQIGGALGPAERTIDAAGRYVFPGGIDVHTHV -----------------------iiii----------------2222------------- ETVSFNTQSADTFATATVAAACGGTTTIVDFCQQDRGHSLAEAVAKWDGMAGGKSAIDYG -----------3333-------------------2222------------2222------ YHIIVLDPTDSVIEELEVLPDLGITSFVFMAYRGMNMIDDVTLLKTLDKAVKTGSLVMVH ------------3333-3333--------------------------------------- AENGDAADYLRDKFVAEGKTAPIYHALSRPPRVEAEATARALALAEIVNAPIYIVHVTCE --------------------3333-----3333--------------------------- ESLEEVMRAKSRGVRALAETCTHYLYLTKEDLERPDFEGAKYVFTPPARAKKDHDVLWNA ----------------------1111--------%%%%3333-------3333------- LRNGVFETVSSDHCSWLFKGHKDRGRNDFRAIPNGAPGVEERLMMVYQGVNEGRISLTQF 1111----------------3333---3333------3333--------1111--3333- VELVATRPAKVFGMFPQKGTIAVGSDADIVLWDPEAEMVIEQTAMHNAMDYSSYEGHKVK ----------------------------------------3333-------1111----- GVPKTVLLRGKVIVDEGSYVGEPTDGKFLKRRKYKQ -------iiii---iiii---1111----------- >CONSERVED HYPOTHETICAL PR; SWP:O28323; PDB:1NFJA; EHVVYVGNKPVMNYVLATLTQLNEGADEVVIKARGRAISRAVDVAEIVRNRFMPGVKVKE ---------3333--------------------!!!!----------------------- IKIDTEELESEQGRRSNVSTIEIVLAK ----------%%%%------------- >LUXF GENE PRODUCT; SWP:P09142; PDB:1NFP; MTKWNYGVFFLNFYHVGQQEPSLTMSNALETLRIIDEDTSIYDVVAFSEHHIDKSYNDET ---------------%%%%3333--------------------------------1111- KLAPFVSLGKQIHVLATSPETVVKAAKYGMPLLFKWDDSQQKRIELLNHYQAAAAKFNVD -------!!!!--------------1111-----1111---------------------- IANVRHRLMLFVNVNDNPTQAKAELSIYLEDYLSYTQAETSIDEIINSNAAGNFDTCLHH 1111------------3333-------------------------1111----------- VAEMAQGLNNKVDFLFCFESMKDQENKKSLMINFDKRVINYRKEHNLN -------%%%%------1111--------------------------- >BACTERIOFERRITIN; SWP:Q93PP9; PDB:1NFVA; NREDRKAKVIEVLNKARAMELHAIHQYMNQHYSLDDMDYGELAANMKLIAIDEMRHAENF ------------------------------------------------------------ AERIKELGGEPTTQKEGKVVTGQAVPVIYESDADQEDATIEAYSQFLKVCKEQGDIVTAR ----1111------------------------------------------1111------ LFERIIEEEQAHLTYYENIGSHIKNLGDTYLAKIAGTPSSTGTASKGFV ---------------------------------2222------------ >COAT PROTEIN; SWP:Q9PWT2; PDB:1NG0A; DWFDTGMITSYLGGFQRTAGTTDSQVFIVSPAALDRVGTIAKAYALWRPKHWEIVYLPRC ----------------------------------------1111---------------- STQTDGSIEMGFLLDYADSVPTNTRTMASSTSFTTSNVWGGGDGSSLLHTSMKSMGNAVT --------------1111----33331111------11113333-1111-----!!!!-- SALPCDEFSNKWFKLSWSTPEESENAHLTDTYVPARFVVRSDFPVVTADQPGHLWLRSRI ---33331111---------111133333333---------------------------- LLKGSVSPSTNL ------3333-- >NEUTROPHIL CYTOSOLIC FACT; SWP:P14598; PDB:1NG2A; ILQTYRAIADYEKTSGSEMALSTGDVVEVVEKSESGWWFCQMKAKRGWIPASFLEPLDSP --------------1111---2222-------3333-------------3333------- DETEDPEPNYAGEPYVAIKAYTAVEGDEVSLLEGEAVEVIHKLLDGWWVIRKDDVTGYFP --------3333------------1111--------------1111-------------3 SMYLQKSGQDVSQAQRQIKRGAPPRRSSIRNAHSIHQRSRKRLSQDAYRRNSVRFL 333--22223333-----------3333---------------------------- >HYPOTHETICAL PROTEIN YQEY; SWP:P54464; PDB:1NG6A; MSLLERLNQDMKLYMKNREKDKLTVVRMVKASLQNEAIKLKKDSLTEDEELTVLSRELKQ ------------------------------------------------------------ RKDSLQEFSNANRLDLVDKVQKELDILEVYLPEQLSEEELRTIVNETIAEVGASSKADMG ---------------------------1111-----------------1111--3333-- KVMGAIMPKVKGKADGSLINKLVSSQLS -----33332222--------------- >GENOME POLYPROTEIN [CORE ; SWP:P03300; PDB:1NG7A; MGPLQYKDLKIDIKTSPPPECINDLLQAVDSQEVRDYCEKKGWIVNITSQVQTERNINRA ---------------------33331111-------1111-------------------- >HEMOGLOBIN-LIKE PROTEIN H; SWP:O53197; PDB:1NGKA; KSFYDAVGGAKTFDAIVSRFYAQVAEDEVLRRVYPEDDLAGAEERLRMFLEQYWGGPRTY ----1111--------------3333-3333--------------------1111----- SEQRGHPRLRMRHAPFRISLIERDAWLRCMHTAVASIDSETLDDEHRRELLDYLEMAAHS ------------3333---------------------3333-----------------11 LVNSPF 11---- >Transcription factor IIIB; SWP:P29056; PDB:1NGMB; GSYCPRNLHLLPTTDTYLSKVSDDPDNLEDVDDEELNAHLLNEEASKLKERIWIGLNADF ------------3333-1111------------3333-----3333-------------- LLEQESKRLKQE --------1111 >METHYL-CPG BINDING PROTEI; SWP:Q9Z2D7; PDB:1NGNA; KWTPPRSPFNLVQEILFHDPWKLLIATIFLNRTSGKMAIPVLWEFLEKYPSAEVARAADW ----------3333-3333----------22223333-----------------1111-- RDVSELLKPLGLYDLRAKTIIKFSDEYLTKQWRYPIELHGIGKYGNDSYRIFCVNEWKQV ------3333-3333------------------33332222--------------3333- HPEDHKLNKYHDWLWENHEKLSLS ------------------------ >P75 LOW AFFINITY NEUROTRO; SWP:P07174; PDB:1NGR; GNLYSSLPLTKREEVEKLLNGDTWRHLAGELGYQPEHIDSFTHEACPVRALLASWGAQDS ---3333-----3333-----3333---1111-3333----------------1111--- ATLDALLAALRRIQRADIVESLCSE -3333-------------------- >PROTEASE RETROPEPSIN; SWP:P03367; PDB:1NH0A; PQITLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMSLPGRWKPKMIGGIGGFIKVRQYD --------------iiii------1111--------------------2222-------- QILIEICGHKAIGTVLVGPTPVNIIGRNLLTQIGCTLNF -----iiii----------------3333-1111----- >AVIRULENCE B PROTEIN; SWP:P13835; PDB:1NH1A; ALPGPSQRQLEVYDQCLIGAARWPDDSSKSNTPENRAYCQSMYNSIRSAGDEISRGGITS ----------------2222--------------------------------1111---- FEELWGRATEWRLSKLQRGEPLYSAFASERTSDTDAVTPLVKPYKSVLARVVDHEDAHDE -----------3333----11113333-----------------------1111------ IMQDNLFGDLNVKVYRQTAYLHGNVIPLNTFRVATDTEYLRDRVAHLRTELGAKALKQHL --------------------iiii------------------------------------ QRYNPDRIDHTNASYLPIIKDHLNDLYRQAISSDLSQAELISLIARTHWWAASAMPDQRG -----------3333--------------------------------------------- SAAKAEFAARAIASAHGIELPPFRNGNVSDIEAMLSGEEEFVEKYRSLLD -------------1111------2222-----1111--------3333-- >Transcription initiation ; SWP:P32773; PDB:1NH2B; NAEASRVYEIIVESVVNEVREDFENAGIDEQTLQDLKNIWQKKLTE -----------------------1111------------------- >Transcription initiation ; SWP:P32774; PDB:1NH2D; GYYELYRRSTIGNSLVDALDTLISDGRIEASLAMRVLETFDKVVAETLKDNTQSKLTVKG ----------------------1111---------------------------------- NLDTYGFCDDVWTFIVKNCQVTVEDQSVISVDKLRIVACNSKKS -------%%%%--------------------------------- >ATP PHOSPHORIBOSYLTRANSFE; SWP:O33256; PDB:1NH8A; MLRVAVPNKGALSEPATEILAEAGYRRRTDSKDLTVIDPVNNVEFFFLRPKDIAIYVGSG ---------1111----------------1111-----1111------3333-------- ELDFGITGRDLVCDSGAQVRERLALGFGSSSFRYAAPAGRNWTTADLAGMRIATAYPNLV ------------3333--------------------------33332222---------- RKDLATKGIEATVIRLDGAVEISVQLGVADAIADVVGSGRTLSQHDLVAFGEPLCDSEAV -------------------33331111--------------------------------- LIERAEARDQLVARVQGVVFGQQYLMLDYDCPRSALKKATAITPGLESPTIAPLADPDWV --------------------1111-------3333--------------------1111- AIRALVPRRDVNGIMDELAAIGAKAILASDIRFCRF ------1111--------1111-------------- >DNA-BINDING PROTEIN ALBA; SWP:Q57665; PDB:1NH9A; MDNVVLIGKKPVMNYVVAVLTQLTSNDEVIIKARGKAINKAVDVAEMIRNRFIKDIKIKK ----------3333-------------------!!!!---------------1111---- IEIGTDKEVNVSTIEIVLAK -------------------- >POLYGALACTURONASE I; SWP:P26213; PDB:1NHCA; STCTFTSASEASESISSCSDVVLSSIEVPAGETLDLSDAADGSTITFEGTTSFGYKEWKG -------------3333-----------2222---11112222----------------- PLIRFGGKDLTVTMADGAVIDGDGSRWWDSKGTNGGKTKPKFMYIHDVEDSTFKGINIKN --------------2222-----3333---!!!!-------------------------- TPVQAISVQATNVHLNDFTIDNSDGDDNGGHNTDGFDISESTGVYISGATVKNQDDCIAI -----------------------3333--------------------------------- NSGESISFTGGTCSGGHGLSIGSVGGRDDNTVKNVTISDSTVSNSANGVRIKTIYKETGD -----------------------------------------------------2222--- VSEITYSNIQLSGITDYGIVIEQDYENGSPTGTPSTGIPITDVTVDGVTGTLEDDATQVY -------------------------iiii-----------------------1111---- ILCGDGSCSDWTWSGVDLSGGKTSDKCENVPSGASC ---2222-----------------------1111-- >Nucleoside diphosphate ki; SWP:P15266; PDB:1NHKR; AIERTLSIIKPDGLEKGVIGKIISRFEEKGLKPVAIRLQHLSQAQAEGFYAVHKARPFFK -------------1111--------3333--------------------3333--1111- DLVQFMISGPVVLMVLEGENAVLANRDIMGATNPAQAAEGTIRKDFATSIDKNTVHGSDS ----1111------------------------3333-2222-------3333-------- LENAKIEIAYFFRETEIHSYPYQK ------------1111-------- >SYNAPTOSOMAL-ASSOCIATED P; SWP:O00161; PDB:1NHLA; STRRILGLAIESQDAGIKTITLDEQKEQLNRIEEGLDQINKDRETEKTLTEL -------------------------------------------3333----- >PROBABLE THIOREDOXIN; SWP:O26898; PDB:1NHOA; MVVNIEVFTSPTCPYCPMAIEVVDEAKKEFGDKIDVEKIDIMVDREKAIEYGLMAVPAIA ---------1111--------------------------1111----------------- INGVVRFVGAPSREELFEAINDEME -----------3333---------- >NADH PEROXIDASE; SWP:P37062; PDB:1NHS; MKVIVLGSSHGGYEAVEELLNLHPDAEIQWYEKGDFISFLCCGMQLYLEGKVKDVNSVRY --------3333----------3333-------------3333----------1111--- MTGEKMESRGVNVFSNTEITAIQPKEHQVTVKDLVSGEERVENYDKLIISPGAVPFELDI -----3333--------------1111--------------------------------2 PGKDLDNIYLMRGRQWAIKLKQKTVDPEVNNVVVIGSGYIGIEAAEAFAKAGKKVTVIDI 222------------------33333333-------------------1111-------- LDRPLGVYLDKEFTDVLTEEMEANNITIATGETVERYEGDGRVQKVVTDKNAYDADLVVV ----3333-3333----------------------------------1111--------- AVGVRPNTAWLKGTLELHPNGLIKTDEYMRTSEPDVFAVGDATLIKYNPADTEVNIALAT -------3333------1111----1111---2222---1111----------------- NARKQGRFAVKNLEEPVKPFPGVQGSSGLAVFDYKFASTGINEVMAQKLGKETKAVTVVE ---------1111-----------------!!!!-------------------------- DYLMDFNPDKQKAWFKLVYDPETTQILGAQLMSKADLTANINAISLAIQAKMTIEDLAYA ---1111-----------------------------3333-------------------- DFFFQPAFDKPWNIINTAALEAVKQER ----1111----3333-------1111 >ELONGATION FACTOR 1-GAMMA; SWP:P29547; PDB:1NHYA; SQGTLYANFRIRTWVPRGLVKALKLDVKVVTPDAAAEQFARDFPLKKVPAFVGPKGYKLT ---------3333-------1111------3333--------1111-------%%%%--- EAAINYYLVKLSQDDKKTQLLGADDDLNAQAQIIRWQSLANSDLCIQIANTIVPLKGGAP ----------------------11113333---------------3333----------- YNKKSVDSADAVDKIVDIFENRLKNYTYLATENISLADLVAASIFTRYFESLFGTEWRAQ -3333----------------3333--------------------3333----------- HPAIVRWFNTVRASPFLKDEYKDFKFADKPLSPPQ -------------33331111-------------- >GLUCOCORTICOID RECEPTOR; SWP:P04150; PDB:1NHZA; PTLVSLLEVIEPEVLYAGYDSSVPDSTWRIMTTLNMLGGRQVIAAVKWAKAIPGFRNLHL -----------------------------------------------------3333--- DDQMTLLQYSWMSLMAFALGWRSYRQSSANLLCFAPDLIINEQRMTLPDMYDQCKHMLYV --------------------------%%%%----1111--3333--22223333------ SSELHRLQVSYEEYLCMKTLLLLSSVPKDGLKSQELFDEIRMTYIKELGKAIVKREGNSS ------------------3333----1111------------------------------ QNWQRFYQLTKLLDSMHEVVENLLNYCFQTFLDKTMSIEFPEMLAEIITNNIKKLLFHQ ----------------------------------------------------------- >EZRIN; SWP:P15311; PDB:1NI2A; PKPINVRVTTMDAELEFAIQPNTTGKQLFDQVVKTIGLREVWYFGLHYVDNKGFPTWLKL -------------------1111----------------3333------1111------- DKKVSAQEVRKENPLQFKFRAKFYPEDVAEELIQDITQKLFFLQVKEGILSDEIYCPPET --3333--------------------3333------------------1111----3333 AVLLGSYAVQAKFGDYNKEVHKSGYLSSERLIPQRVMDQHKLTRDQWEDRIQVWHAEHRG -----------------------1111-----33331111--3333--------3333-- MLKDNAMLEYLKIAQDLEMYGINYFEIKNKKGTDLWLGVDALGLNIYEKDDKLTPKIGFP -3333-------33331111--------1111-------3333----1111--------3 WSEIRNISFNDKKFVIKPIDKKAPDFVFYAPRLRINKRILQLCMGNHELYMRRRKP 333----------------------------3333----------------1111- >YCHF GTP-BINDING PROTEIN; SWP:O13998; PDB:1NI3A; KVQWGRPGNNLKTGIVGPNVGKSTFFRAITKSVLGNPANYPYATIDPEEAKVAVPDERFD ---------------------------------------------1111----------- WLCEAYKPKSRVPAFLTVFDIAGLTKGASTGVGLGNAFLSHVRAVDAIYQVVRAFDDAEI -----------------------------------3333-3333-----------1111- IHVEGDVDPIRDLSIIVDELLIKDAEFVEKHLEGLRKITSRGANTLEKAKKEEQAIIEKV ------------------------------------------------------------ YQYLTETKQPIRKGDWSNREVEIINSLYLLTAKPVIYLVNSERDFLRQKNKYLPKIKKWI ---------3333-----------11111111--------3333---------------- DENSPGDTLIPSVAFEERLTNFTEEEAIEECKKLNTKSLPKIIVTGYNALNLINYFTCGE ----------------3333----3333------------------------------33 DEVRSWTIRKGTKAPQAAGVIHTDFEKAFVVGEIHYQDLFDYKTENACRAAGKYLTKGKE 33--------------3333-3333---------33331111------------------ YVESGDIAHWK --2222----- >Pyruvate dehydrogenase E1; SWP:P08559; PDB:1NI4A; SFANDATFEIKKCDLHRLEEGPPVTTVLTREDGLKYYRQTVRRELKADQLYKQKIIRGFC ---------------------------------------3333----------------- HLCDGQEACCVGLEAGINPTDHLITAYRAHGFTFTRGLSVREILAELTGRKGGCAKGKGG --2222-------11111111------------1111------------1111-%%%%-- SHYAKNFYGGNGIVGAQVPLGAGIALACKYNGKDEVCLTLYGDGAANQGQIFEAYNAALW ---2222-----2222-----------------------------------------111 KLPCIFICENNRYGGTSVERAAASTDYYKRGDFIPGLRVDGDILCVREATRFAAAYCRSG 1---------------3333-----1111-!!!!-------------------------- KGPILELQTYRYHGHSSDPGVSYRTREEIQEVRSKSDPILLKDRVNSNLASVEELKEIDV ------------------------------------------------------------ EVRKEIEDAAQFATADPEPPLEELGYHIYSSDPPFEVRGANQWIKFKSVS -------------------3333-----------------1111------ >Pyruvate dehydrogenase E1; SWP:P11177; PDB:1NI4B; SLQVTVRDAINQGDEELERDEKVFLLGEEVAQYDGAYKVSRGLWKKYGDKRIIDTPISEG -------------------1111-------11111111-22223333------------- FAGIAVGAAAGLRPICEFTFNFSQAIDQVINSAAKTYYSGGLQPVPIVFRGPNGASAGVA ------------------3333----------------iiii-----------------1 AQHSQCFAAWYGHCPGLKVVSPWNSEDAKGLIKSAIRDNNPVVVLENELYGVPFEFPPEA 111----------2222---------------------------------------3333 QSKDFLIPIGKAKIERQGTHITVVSHSRPVGHCLEAAAVLSKEGVECEVINRTIRPDETI -1111--2222---------------3333----------1111---------------- EASVKTNHLVTVEGGWPQFGVGAEICARIEGPAFNFLDAPAVRVTGADVPPYAKILEDNS ----------------2222------------3333------------------------ IPQVKDIIFAIKKTLNI --3333----------- >PUTATIVE CELL CYCLE PROTE; SWP:P52097; PDB:1NI5A; STLTLNRQLLTSRQILVAFSGGLDSTVLLHQLVQWRTENPGVALRAIHVHHGLSANADAW --------1111--------------------------2222-----------1111--- VTHCENVCQQWQVPLVVERVQLAQEGLGIEAQARQARYQAFARTLLPGEVLVTAQHLDDQ --------1111---------------------------------2222------3333- CETFLLALKRGSGPAGLSAAEVSEFAGTRLIRPLLARTRGELVQWARQYDLRWIEDESNQ ------------3333-----------------1111---------1111---------- DDSYDRNFLRLRVVPLLQQRWPHFAEATARSAALCAEQESLLDELLADDLAHCQSPQGTL 11113333------------1111------------------------------3333-- QIVPLASDARRAAIIRRWLAGQNAPPSRDALVRIWQEVALAREDASPCLRLGAFEIRRYQ -------------------1111---3333-------11113333-----!!!!----%% SQLWWIKSVTGQSENIVPWQTWLQPLELPAGLGSVQLNAGGDIRPPRADEAVSVRFKAPG %%---------1111-----3333----%%%%--------------1111---------- LLHIVGRNGGRKLKKIWQELGVPPWLRDTTPLLFYGETLIAAAGVFVTQEGVAEGENGVS ---2222---------------3333--------!!!!---2222--1111--------- FVWQKTLS -------- >HYPOTHETICAL PROTEIN YGDK; SWP:Q46926; PDB:1NI7A; MTNPQFAGHPFGTTVTAETLRNTFAPLTQWEDKYRQLIMLGKQLPALPDELKAQAKEIAG ---------------3333----3333-3333--------3333---------------- CENRVWLGYTVAENGKMHFFGDSEGRIVRGLLAVLLTAVEGKTAAELQAQSPLALFDELG -----------3333---------------------1111------33333333------ LRAQLSASRSQGLNALSEAIIAATKQVLE -3333--3333------------------ >PROTEIN GLPX; SWP:P28860; PDB:1NI9A; HMRRELAIEFSRVTESAALAGYKWLGRGDKNTADGAAVNAMRIMLNQVNIDGTIVIGEGE --3333------------------------------------------------------ APMLYIGEKVGTGRGDAVDIAVDPIEGTQANALAVLAVGDKGCFLNAPDMYMEKLIVGPG ----2222-------------------------------2222--------------333 AKGTIDLNLPLADNLRNVAAALGKPLSELTVTILAKHDAVIAEMQQLGVRVFAIPDGDVA 3----1111----------1111-3333-------------------------------- ASILTCMPDSEVDVLYGIGGAPEGVVSAAVIRALDGDMNGRLLAGIEAGKVLRLGDMARS -------------------------------1111-----------------3333---- DNVIFSATGITKGDLLEGISRKGNIATTETLLIRGKSRTIRRIQSIHYLDR ---------------------!!!!---------1111------------- >HYPOTHETICAL PROTEIN TA12; SWP:Q9HIT9; PDB:1NIGA; MDIKRYCPVTDSELPADHVYFKFRSEIEAAEAYLGLAISEGIKVRETREILDIIDTVYNS --------------1111-------------------3333------------------- LSDSKLNDFQEKRLNFTEEDWYDIKEKANNGNRWSLYMFLARSAVDSAVYWSYRMKETEE ----------------------3333---------------------------------- FKEIVKEEMISKLLKAGYVILRESLG -----3333-------------1111 >HYPOTHETICAL PROTEIN YJIA; SWP:P24203; PDB:1NIJA; NPIAVTLLTGFLGAGKTTLLRHILNEQHGYKIAVIENEFGEVSVDDQLIGDRATQIKTLT ---------------3333------------------------------1111-----11 NGCICCSRSNELEDALLDLLDNLDKGNIQFDRLVIECTGMADPGPIIQTFFSHEVLCQRY 11--------------------3333-----------!!!!------------------- LLDGVIALVDAVHADEQMNQFTIAQSQVGYADRILLTKTDVAGEAEKLHERLARINARAP ---------3333---33331111---1111------1111------------------- VYTVTHGDIDLGLLFNTNGFMLEENVVSTKPRFHFIADKQNDISSIVVELDYPVDISEVS ---------3333----1111--------------3333--------------------- RVMENLLLESADKLLRYKGMLWIDGEPNRLLFQGVQRLYSADWDRPWGDEKPHSTMVFIG ---------1111---------2222--------!!!!--------!!!!---------- IQLPEEEIRAAFAGLRK -----------1111-- >B-LUFFIN; SWP:P22851; PDB:1NIOA; ANVSFSLSGADSKSYSKFITALRKALPSKEKVSNIPLLLPSASGASRYILMQLSNYDAKA ------2222------------3333-----iiii--------3333-------1111-- ITMAIDVTNVYIMGYLVNSTSYFFNESDAKLASQYVFKGSTIVTLPYSGNYERLQNAAGK -------------------------------1111-2222-------------------- VREKIPLGFRAFDSAITSLFHYDSTAAAGAFLVIIQTTAEASRFKYIEGQIIKRIPKNEV 3333------------------1111-------------------------1111----- PSPAALSLENEWSALSKQIQLAQTNNGAFRTPVVIIDNKGQRVEIKDVNSKVVTNNIKLL -3333---------------3333iiii--------1111------11113333------ LNKQNIA -3333-- >NITRITE REDUCTASE; SWP:P24474; PDB:1NIRA; AAEQYQGAASAVDPAHVVRTNGAPDMSESEFNEAKQIYFQRCAGCHGVLRKGATGKPLTP 3333--------3333------------------------------1111--------33 DITQQRGQQYLEALITYGTPLGMPNWGSSGELSKEQITLMAKYIQHTPPQPPEWGMPEMR 33------------------------3333-----------1111--------------- ESWKVLVKPEDRPKKQLNDLDLPNLFSVTLRDAGQIALVDGDSKKIVKVIDTGYAVHISR -------3333---------3333------1111-------------------------- MSASGRYLLVIGRDARIDMIDLWAKEPTKVAEIKIGIEARSVESSKFKGYEDRYTIAGAY -1111---------------1111----------------------2222---------- WPPQFAIMDGETLEPKQIVSTRGMTVDTQTYHPEPRVAAIIASHEHPEFIVNVKETGKVL -----------------------------------------------------1111--- LVNYKDIDNLTVTSIGAAPFLHDGGWDSSHRYFMTAANNSNKVAVIDSKDRRLSALVDVG -----3333-----------------1111-------1111------------------- KTPHPGRGANFVHPKYGPVWSTSHLGDGSISLIGTDPKNHPQYAWKKVAELQGQGGGSLF ----!!!!-------------------------------3333----------------- IKTHPKSSHLYVDTTFNPDARISQSVAVFDLKNLDAKYQVLPIAEWADLGEGAKRVVQPE ---1111------1111-3333-------1111-----------3333------------ YNKRGDEVWFSVWNGKNDSSALVVVDDKTLKLKAVVKDPRLITPTGKFNVYNTQHDVY -3333---------1111-------------------1111-----------1111-- --------------------------------- >HAINANTOXIN-IV; SWP:P83471; PDB:1NIYA; ECLGFGKGCNPSNDQCCKSSNLVCSRKHRWCKYEI ---------3333----1111-------------- >NPL4; SWP:Q9ES54; PDB:1NJ3A; GSTSAMWACQHCTFMNQPGTGHCEMCSLPRT ----------------1111----------- >PROLINE-TRNA SYNTHETASE; SWP:Q58635; PDB:1NJ8A; MLEFSEWYSDILEKAEIYDVRYPIKGCGVYLPYGFKIRRYTFEIIRNLLDESGHDEALFP ------------1111-------------------------------------------- MLIPEDLLAKEAEHIKGFEDEVYWVTHGGKTQLDVKLALRPTSETPIYYMMKLWVKVHTD -----3333--------3333---------------------3333---3333---1111 LPIKIYQIVNTFRYETKHTRPLIRLREIMTFKEAHTAHSTKEEAENQVKEAISIYKKFFD -----------------------------------------------------------1 TLGIPYLISKRPEWDKFPGAEYTMAFDTIFPDGRTMQIATVHNLGQNFSKTFEIIFETPT 111--------------------------1111----------------1111------- GDKDYAYQTCYGISDRVIASIIAIHGDEKGLILPPIVAPIQVVIVPLIFKGKEDIVMEKA --------------------------1111---3333----------------------- KEIYEKLKGKFRVHIDDRDIRPGRKFNDWEIKGVPLRIEVGPKDIENKKITLFRRDTMEK -----------------------------------------3333--------------- FQVDETQLMEVVEKTLNNIMENIKNRAWEKFENFITILEDINPDEIKNILSEKRGVILVP ---1111------------------------1111------3333--1111--------- FKEEIYNEELEEKVEATILGETEYKGNKYIAIAKTY ------------------------------------ >IMMUNOGLOBULIN VARIABLE C; SWP:NA; PDB:1NJ9H; EVQLQQSGPELVKPGASVKVSCKASGYSFTDYNMYWVKQNHGESLEWIAYIDPSNGDTFY ---------------------------1111----------------------------- NQKFQGKATVTLDKSSSTAFMHLNSL 1111---------1111--------- >THYMIDYLATE SYNTHASE; SWP:P00469; PDB:1NJD; MLEQPYLDLAKKVLDEGHFKPDRTHTGTYSIFGHQMRFDLSKGFPLLTTKKVPFGLIKSE 1111----------------------------------1111------------------ LLWFLHGDTNIRFLLQHRNHIWDEWAFEKWVKSDEYHGPDMTDFGHRSQKDPEFAAVYHE ---1111--3333------1111---------1111------------------------ EMAKFDDRVLHDDAFAAKYGDLGLVYGSQWRAWHTSKGDTIDQLGDVIEQIKTHPYSRRL -----------------------------------------------------1111--- IVSAWNPEDVPTMALPPCHTLYQFYVNDGKLSLQLYQRSADIFLGVPFDIASYALLTHLV -----3333-----------------iiii------------------------------ AHECGLEVGEFIHTFGDAHLYVNHLDQIKEQLSRTPRPAPTLQLNPDKHDIFDFDMKDIK --------------------1111------1111---------------3333-3333-- LLNYDPYPAIKAPVAV ---------------- >DNA POLYMERASE III SUBUNI; SWP:P06710; PDB:1NJGA; QVLARKWRPQTFADVVGQEHVLTALANGLSLGRIHHAYLFSGTRGVGKTSIARLLAKGLN -3333-----3333--------------1111----------2222-------------- CETGITATPCGVCDNCREIEQGRFVDLIEIDAASRTKVEDTRDLLDNVQYAPARGRFKVY 1111--------------1111-1111---1111-------------------------- LIDEVHMLSRHSFNALLKTLEEPPEHVKFLLATTDPQKLPVTILSRCLQFHLKALDVEQI ---3333----------------1111-------3333-33331111------------- RHQLEHILNEEHIAHEPRALQLLARAAEGSLRDALSLTDQAIASGDGQVSTQAVSAMLGT ----------------3333------iiii-------------------------1111- >PROTEIN YOJF; SWP:O31858_BACSU; PDB:1NJHA; AKAIIKEDVQASLERYADRPVYIHLETTTGTVVAYIRNAKVTYHQAKIKGNGPYRVGLKT ------------3333-------------------------------------------- EEGWIYAEGLTEYTVDEENRLLAGHLPGGKLAISLQISEKPFTV ---------------1111-------1111-------------- >HYPOTHETICAL PROTEIN YBAW; SWP:P77712; PDB:1NJKA; HQTQIKVRGYHLDVYQHVNNARYLEFLEEARWDGLENSDSFQWTAHNIAFVVVNININYR -------1111-1111--3333------------3333-----1111------------- RPAVLSDLLTITSQLQQLNGKSGILSQVITLEPEGQVVADALITFVCIDLKTQKALALEG ---2222-----------------------------------------3333-------- ELREKLEQVK --1111---- >SUPERMAN PROTEIN; SWP:Q38895; PDB:1NJQA; WPPRSYTCSFCKREFRSAQALGGHMNVHRRDRARLRL ----------------3333----------------- >32.1 KDA PROTEIN IN ADH3-; SWP:Q04299; PDB:1NJRA; KRIILCDTNEVVTNLWQESIPKYLCIHHGHLQSLDSRKGDAHSYAIVSPGNSYGYLGGGF -----------------------------3333-----------------1111----33 DKALYNYFGGKPFETWFRNQLGGRYHTVGSATVVDLQRCLEECRDGIRYIIHVPTVVAPS 33------------------------2222-----3333--------------------- APIFNPQNPLKTGFEPVFNAWNALHSPKDIDGLIIPGLCTGYAGVPPIISCKSAFALRLY ----3333-1111-------------1111------22223333-3333----------- AGDHISKELKNVLIYYLQYPFEPFFPESCKIECQKLGIDIELKSFNVEKDAIELLIPRRI !!!!----------1111--3333-3333------------11111111--3333-3333 ----------------------------------------------------------- >RHAMNOGALACTURONASE B; SWP:Q00019; PDB:1NKGA; AFGITTSSSAYVIDTNAPNQLKFTVSRSSCDITSIIHYGTELQYSSQGSHIGSGLGSATV ------1111------1111----------------iiii-------------------- TATQSGDYIKVTCVTDTLTQYMVVHNGDPIIHMATYITAEPSIGELRFIARLNSDLLPNE ----!!!!----------------2222------------3333---------------- EPFGDVSTTADGTAIEGSDVFLVGSETRSKFYSSERFIDDQRHCIAGDAHRVCMILNQYE ----11112222----------!!!!---------3333-------------------11 SSSGGPFHRDINSNNGGSYNALYWYMNSGHVQTESYRMGLHGPYSMYFSRSGTPSTSIDT 11--1111---------------------------------------------------- SFFADLDIKGYVAASGRGKVAGTASGADSSMDWVVHWYNDAAQYWTYTSSSGSFTSPAMK --1111-2222-3333-----------1111-------1111------1111-------- PGTYTMVYYQGEYAVATSSVTVSAGSTTTKNISGSVKTGTTIFKIGEWDGQPTGFRNAAN ----------------------2222-------------------------2222-3333 QLRMHPSDSRMSSWGPLTYTVGSSALTDFPMAVFKSVNNPVTIKFTATSAQTGAATLRIG ----1111-----------2222-1111-----1111----------1111--------- TTLSFAGGRPQATINSYTGSAPAAPTNLDSRGVTRGAYRGLGEVYDVSIPSGTIVAGTNT ----iiii-----------------------1111--------------2222------- ITINVISGSSGDTYLSPNFIFDCVELFQ -----------!!!!------------- >PROBABLE FOSFOMYCIN RESIS; SWP:Q9I4K6; PDB:1NKIA; MLTGLNHLTLAVADLPASIAFYRDLLGFRLEARWDQGAYLELGSLWLCLSREPQYGGPAA ---------------------------------1111----!!!!------1111----- DYTHYAFGIAAADFARFAAQLRAHGVREWKQNRSEGDSFYFLDPDGHRLEAHVGDLRSRL ---------3333--------1111-----------------1111-------------- AACRQAPYAGMRFA -------2222--- >NK-LYSIN; SWP:Q29075; PDB:1NKL; GYFCESCRKIIQKLEDMVGPQPNEDTVTQAASQVCDKLKILRGLCKKIMRSFLRRISWDI ----------------------3333---------------------------------1 LTGKKPQAICVDIKICKE 111-------1111---- >SIALIC ACID BINDING IG-LI; SWP:Q9Y286; PDB:1NKOA; SNRKDYSLTMQSSVTVQEGMCVHVRCSFSYPVDSDTDSDPVHGYWFRAWKAPVATNNPAW ----------------2222---------------1111-----------------1111 AVQEETRDRFHLLGDPQTKNCTLSIRDARMSDAGRYFFRMEKGNIKWNYKYDQLSVNVTA --3333--------3333----------1111---------!!!!---1111-------- LT -- >MYC PROTO-ONCOGENE PROTEI; SWP:P01106; PDB:1NKPA; GHMNVKRRTHNVLERQRRNELKRSFFALRDQIPELENNEKAPKVVILKKATAYILSVQAE -------------------------------3333--1111------------------- EQKLISEEDLLRKRREQLKHKLEQLGGC ---------------------------- >Hypothetical 28.8 kDa pro; SWP:P53889; PDB:1NKQA; SYNYLKAARKIICIGRNYAAHIKELNNQPFFFLKPTSSIVTPLSSSPANSTFNGLNEDGT -3333---------------3333----------1111---1111----------3333- NPGPIFIPRGVKVHHEIELALIVSKHLSNVTKKPEEVYDSISGVALALDLTARNVQDEAK -------2222---------------------33331111-------------------- KKGLPWTISKGFDTFPISAIVSREKFSSYKSNLQDIFRVKCSVNGQLRQDGGTNLLHPLH -----------2222------33333333---1111------iiii-----------333 KILQHISTISLEPGDIILTGTPAGVGELKPGDRVHCELLQNNDNIVDNFECENRPGPYEF 3----------2222-------------2222-------%%%%----------------- RE -- >P58-CL42 KIR; SWP:P43626; PDB:1NKR; RKPSLLAHPGPLVKSEETVILQCWSDVMFEHFLLHREGMFNDTLRLIGEHHDGVSKANFS -------------2222------------------------------------------- ISRMTQDLAGTYRCYGSVTHSPYQVSAPSDPLDIVIIGLYEKPSLSAQPGPTVLAGENVT ----3333---------1111--------------------------------2222--- LSCSSRSSYDMYHLSREGEAHERRLPAGPKVNGTFQADFPLGPATHGGTYRCFGSFHDSP ---------------2222------------------------------------3333- YEWSKSSDPLLVSVT --------------- >ADENYLATE KINASE; SWP:P35028; PDB:1NKSA; MKIGIVTGIPGVGKSTVLAKVKEILDNQGINNKIINYGDFMLATALKLGYAKDRDEMRKL --------2222----------------------------------------3333---- SVEKQKKLQIDAAKGIAEEARAGGEGYLFIDTHAVIRTPSGYLPGLPSYVITEINPSVIF -------------------------------------1111-----33331111------ LLEADPKIILSRQKRDTTRNRNDYSDESVILETINFARYAATASAVLAGSTVKVIVNVEG ---------1111------------3333------------------------------- DPSIAANEIIRSMK 3333------1111 >PREPROTEIN TRANSLOCASE SE; SWP:O05885; PDB:1NKTA; KLLRLGEGRMVKRLKKVADYVGTLSDDVEKLTDAELRAKTDEFKRRLADQKNPETLDDLL --1111-------------3333--3333---------------11113333--3333-- PEAFAVAREAAWRVLDQRPFDVQVMGAAALHLGNVAEMKTGEGKTLTCVLPAYLNALAGN --------------------------------------22223333-------------- GVHIVTVNDYLAKRDSEWMGRVHRFLGLQVGVILATMTPDERRVAYNADITYGTNNEFGF -----------------------1111------1111-------1111------------ DYLRDNMAHSLDDLVQRGHHYAIVDEVDSILIDEARTPLIISGPADGASNWYTEFARLAP ---------3333----------------------------------3333-------33 LMEKDVHYEVDLRKRTVGVHEKGVEFVEDQLGIDNLYEAANSPLVSYLNNALKAKELFSR 332222----------------------------11112222----------------22 DKDYIVRDGEVLIVDEFTGRVLIGRRYNEGMHQAIEAKEHVEIKAENQTLATITLQNYFR 22-------------------------iiii-----1111------------------33 LYDKLAGMTGTAQTEAAELHEIYKLGVVSIPTNMPMIREDQSDLIYKTEEAKYIAVVDDV 33---------3333--------------------------------------------- AERYAKGQPVLIGTTSVERSEYLSRQFTKRRIPHNVLNAKYHEQEATIIAVAGRRGGVTV ---------------------------1111-----------------1111-2222--- ATNMAGRGTDIVLGGNVDFLTDQRLRERGLDPVETPEEYEAAWHSELPIVKEEASKEAKE ---2222----2222----------1111------------------------------- VIEAGGLYVLGTERHESRRIDNQLRGRSGRQGDPGESRFYLSLGDELMRRFNGAALETLL -----------------------------iiii--------11113333----------- TRLNLPDDVPIEAKMVTRAIKSAQTQVEQQNFEVRKNVLKYDEVMNQQRKVIYAERRRIL 1111-1111---3333------------------------------------------11 EGENLKDQALDMVRDVITAYVDGATGEGYAEDWDLDALWTALKTLYPVGITADSLTLLEA 11--------------------------------3333---3333-----3333------ LLKDAERAYAAREAELEEIAGEGAMRQLERNVLLNVIDRKWREHLYEMDYLKEGIGLRAM ----------------------3333---------------------------3333--- AQRDPLVEYQREGYDMFMAMLDGMKEESVGFLFNVTV ------------------------------------- >HYPOTHETICAL PROTEIN YJHP; SWP:P39367; PDB:1NKVA; PRIFTISESEHRIHNPFTEEKYATLGRVLRKPGTRILDLGSGSGELCTWARDHGITGTGI ----3333------------------3333----------!!!!----1111-------- DSSLFTAQAKRRAEELGVSERVHFIHNDAAGYVANEKCDVAACVGATWIAGGFAGAEELL -----------------3333------------------------1111---1111--33 AQSLKPGGILIGEPYWRQLPATEEIAQACGVSSTSDFLTLPGLVGAFDDLGYDVVEVLAD 33--2222-----------------1111---1111--3333-----------------3 QEGWDRYEAAKWLTRRWLEANPDDDFAAEVRAELNIAPKRYVTYARECFGWGVFALIAR 333-------------1111---1111-------------------------------- >LIGHT-HARVESTING PROTEIN ; SWP:P26789; PDB:1NKZA; NQGKIWTVVNPAIGIPALLGSVTVIAILVHLAILSHTTWFPAYWQGGVKKAA -1111----3333-------------------------3333---------- >Light-harvesting protein ; SWP:P26790; PDB:1NKZB; ATLTAEQSEELHKYVIDGTRVFLGLALVAHFLAFSATPWLH -------------------------------------2222 >Lambda-chain [Precursor]; SWP:A2NUT2; PDB:1NL0L; QSVLTQPPSVSAAPGQKVTISCSGSTSN ------------2222------------ >PROTHROMBIN; SWP:P00735; PDB:1NL1A; ANKGFLVRKGNLRCLPCSRAFALSLSATDAFWAKYTACESARNPREKLNECLEGNCAEGV -----------------------------------1111------------------!!! GMNYRGNVSVTRSGIECQLWRSRYPHKPEINSTTHPGADLRENFCRNPDGSITGPWCYTT !---------1111----1111--------33332222--------11111111------ SPTLRREECSVPVCGQD 1111--------2222- >TRANSCRIPTIONAL REPRESSOR; SWP:P03050; PDB:1NLAA; MKGMSKMPQFLNRWPREVLDLVRKVAEENGRSVNSEIYQRVMESFKKEGRIGA -------3333---3333-------------3333---1111--3333----- >ANTIBODY 19D9D6 LIGHT CHA; SWP:NA; PDB:1NLBH; QIQLVQSGPELKKPGETVKISCKASGYTFTDFSMHWVNQAPGKGLNWMGWVNTETGEPTY ------------2222-----------1111--------2222----------------- ADDFKGRFAFSLETSASTAYLQINSLKNEDTATYFCARFLLRQYFDVWGAGTTVTVSSAK 3333--------3333----------3333---------1111----------------- TTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLY --------------------------------------%%%%-------------iiii- TLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPR --------1111-----------3333----------- >FAB1583; SWP:NA; PDB:1NLDH; QVKLQQSGPGLVQPSQSLSITCTVSGFSLTCYGVHWVRQSPGKGLEWLGVIWSGGDTDYN ---------------------------1111--------------------1111----3 AAFISRLSITKDNSKSQVFFKMNSL 333---------1111--------- >REGULATORY PROTEIN REPA; SWP:P20356; PDB:1NLFA; ATHKPINILEAFAAAPPPLDYVLPNMVAGTVGALVSPGGAGKSMLALQLAAQIAGGPDLL ----------------------22222222------2222-----------------111 EVGELPTGPVIYLPAEDPPTAIHHRLHALGAHLSAEERQAVADGLLIQPLIGSLPNIMAP 1----------------3333--------1111----------------2222--11113 EWFDGLKRAAEGRRLMVLDTLRRFHIEEENASGPMAQVIGRMEAIAADTGCSIVFLHHAV 333------2222------3333----1111-3333-----------------------3 LVDNIRWQSYLSSMTSAEAEEWGVDDDQRRFFVRFGVSKANYGAPFADRWFRRHDGGVLK 333----------------1111----3333----------------------2222--- PAVLERQRKSKGVP -------------- >ADENAIN; SWP:P03252; PDB:1NLNA; GSSEQELKAIVKDLGCGPYFLGTYDKRFPGFVSPHKLACAIVNTAGRETGGVHWMAFAWN ---------------3333-----1111-----------------3333----------- PRSKTCYLFEPFGFSDQRLKQVYQFEYESLLRRSAIASSPDRCITLEKSTQSVQGPNSAA ---------1111-------------------------1111------------1111-- CGLFCCMFLHAFANWPQTPMDHNPTMNLITGVPNSMLNSPQVQPTLRRNQEQLYSFLERH --------------1111----3333------3333--3333------------------ SPYFRSHSAQIRSATSFCHLKNM -----------------1111-- >Tyrosine-protein kinase t; SWP:P00525; PDB:1NLOC; TFVALYDYESRTETDLSFKKGERLQIVNNTEGDWWLAHSLTTGQTGYIPSNYVAPS ------------------2222--------------------------3333---- >NUCLEOPLASMIN-LIKE PROTEI; SWP:Q27415; PDB:1NLQA; EESFYGVTLTAESDSVTWDVDEDYARGQKLVIKQILLGAEAKENEFNVVEVNTPKDSVQI ---------3333------------------------11112222--------------- PIAVLKAGETRAVNPDVEFYESKVTFKLIKGSGPVYIHGHNIKDD -----2222------------------------------------ >CONCANAVALIN A; SWP:P02866; PDB:1NLS; ADTIVAVELDTYPNTDIGDPSYPHIGIDIKSVRSKKTAKWNMQNGKVGTAHIIYNSVDKR -------------1111-------------------------2222--------3333-- LSAVVSYPNADSATVSYDVDLDNVLPEWVRVGLSASTGLYKETNTILSWSFTSKLKSNST ------2222---------3333------------------------------------- HETNALHFMFNQFSKDQKDLILQGDATTGTDGNLELTRVSSNGSPQGSSVGRALFYAPVH ----------------1111--------2222-------1111----------------- IWESSAVVASFEATFTFLIKSPDSHPADGIAFFISNIDSSIPSGSTGRLLGLFPDAN --1111-----------------------------1111--2222!!!!-------- >MITOCHONDRIAL PROTEIN IMP; SWP:P25491; PDB:1NLTA; PQRGKDIKHEISASLEELYKGRTAKLALNKQILCKECEGRGGKKGAVKKCTSCNGQGIKF -------------3333-------------------iiii---------1111------- VTRQMGPMIQRFQTECDVCHGTGDIIDPKDRCKSCNGKKVENERKILEVHVEPGMKDGQR ----------------------------------iiii-------------22222222- IVFKGEADQAPDVIPGDVVFIVSERPHKSFKRDGDDLVYEAEIDLLTAIAGGEFALEHVS --------------------------------!!!!------------------------ GDWLKVGIVPGEVIAPGMRKVIEGKGMPIPKYGGYGNLIIKFTIKDPE --------2222--2222------------------------------ >MAD PROTEIN; SWP:Q05195; PDB:1NLWA; SRSTHNEMEKNRRAHLRLSLEKLKGLVPLGPDSSRHTTLSLLTKAKLHIKKLEDSDRKAV 3333------------------3333---------------------------------- HQIDQLQREQRHLKRQLEK ---------------3333 >POLLEN ALLERGEN PHL P 6; SWP:P43215; PDB:1NLXA; ATTEEQKLIEDVNASFRAAMATTANVPPADKYKTFEAAFTVSSKRNLADAVSKAPQLVPK --------------------------3333-----------------------3333--- LDEVYNAAYNAADHAAPEDKYEAFVLHFSEALRIIAGTPEVHAV -----------11113333------------------------- >ACTIN; SWP:P02577; PDB:1NM1A; DVQALVIDNGSGMCKAGFAGDDAPRAVFPSIVGRPRHTKDSYVGDEAQSKRGILTLKYPI -----------------2222----------------------------3333------- EHGIVTNWDDMEKIWHHTFYNELRVAPEEHPVLLTEAPLNPKANREKMTQIMFETFNTPA iiii---------------------3333-------2222-------------------- MYVAIQAVLSLYASGRTTGIVMDSGDGVSHTVPIYEGYALPHAILRLDLAGRDLTDYMMK -----------1111-------------------iiii-3333----------------- ILTERGYSFTTTAEREIVRDIKEKLAYVALDFEQEMATAASSSALEKSYELPDGQVITIG 3333--------------------------------------1111-------------- NERFRCPEALFQPSFLGMESAGIHETTYNSIMKCDVDIRKDLYGNVVLSGGTTMFPGIAD -------3333--1111-------------11113333-----------1111-2222-- RMNKELTALAPSTMKIKIIAPPERKYSVWIGGSILASLSTFQQMWISKEEYDESGPSIVH ----------1111------1111---------33331111---------------3333 RKCF ---- >malonyl CoA:acyl carrier ; SWP:NA; PDB:1NM2A; HMLVLVAPGQGAQTPGFLTDWLALPGAADRVAAWSDAIGLDLAHFGTKADADEIRDTSVA --------2222-2222-3333-2222----------------------3333--3333- QPLLVAAGILSAAALGTGFTPGAVAGHSVGEITAAVFAGVLDDTAALSLVRRRGLAMAEA --------------------------!!!!-----1111--------------------3 AAVTETGMSALLGGDPEVSVAHLERLGLTPANVNGAGQIVAAGTMEQLAALNEDKPEGVR 333--------------------1111-------iiii-----------------2222- KVVPLKVAGAFHTRHMAPAVDKLAEAAKALTPADPKVTYVSNKDGRAVASGTEVLDRLVG ------------3333---------3333------------------------------- QVANPVRWDLCMETFKELGVTAIIEVCPGGTLTGLAKRALPGVKTLALKTPDDLDAAREL 1111-----------1111--------------------2222------3333------- VAEHT ----- >PROTEIN HI0572; SWP:P44758; PDB:1NM3A; SEGKKVPQVTFRTRQGDKWVDVTTSELFDNKTVIVFSLPGAFTPTCSSSHLPRYNELAPV --------------!!!!----3333-----------------3333------------- FKKYGVDDILVVSVNDTFVNAWKEDEKSENISFIPDGNGEFTEGGLVGKEDLGFGKRSWR -1111----------3333--3333----------11113333-----1111-------- YSLVKNGVVEKFIEPNEPGDPFKVSDADTLKYLAPQHQVQESISIFTKPGCPFCAKAKQL ----iiii-------------------------3333--------------3333----- LHDKGLSFEEIILGHDATIVSVRAVSGRTTVPQVFIGGKHIGGSDDLEKY -------------------------------------------3333--- >CARNITINE O-ACETYLTRANSFE; SWP:P43155; PDB:1NM8A; HHTDPLPRLPVPPLQQSLDHYLKALQPIVSEEEWAHTKQLVDEFQASGGVGERLQKGLER ------------------------1111-----------------2222----------- RARKTENWLSEWWLKTAYLQYRQPVVIYSSPGVMLPKQDFVDLQGQLRFAAKLIEGVLDF 1111--1111-------1111--------------------------------------- KVMIDNETLPVEYLGGKPLCMNQYYQILSSCRVPGPKQDTVSNFSKTKKPPTHITVVHNY -------------iiii---3333-------------------1111----------iii QFFELDVYHSDGTPLTADQIFVQLEKIWNSSLQTNKEPVGILTSNHRNSWAKAYNTLIKD i-------1111---------------1111------33331111----------1111- KVNRDSVRSIQKSIFTVCLDATMPRVSEDVYRSHVAGQMLHGGGSRLNSGNRWFDKTLQF ---------------------------1111------------11111111-3333---- IVAEDGSCGLVYEHAAAEGPPIVTLLDYVIEYTKKPELVRSPMVPLPMPKKLRFNITPEI --1111------3333---------------1111------------------------- KSDIEKAKQNLSIMIQDLDITVMVFHHFGKDFPKSEKLSPDAFIQMALQLAYYRIYGQAC --------------1111-------------3333------------------------- ATYESASLRMFHLGRTDTIRSASMDSLTFVKAMDDSSVTEHQKVELLRKAVQAHRGYTDR -------3333--------------------1111------------------------- AIRGEAFDRHLLGLKLQAIEDLVSTPDIFMDTSYAIAMHFHLSTSQVPAKTDCVMFFGPV 1111--------------1111---3333-3333-------------------------- VPDGYGVCYNPMEAHINFSLSAYNSCAETNAARLAHYLEKALLDMRALLQS -----------1111-------3333------------------------- >N9 NEURAMINIDASE; SWP:NA; PDB:1NMBH; QVQLQQPGAELVKPGASVRMSCKASGYTFTNYNMYWVKQSPGQGLEWIGIFYPGNGDTSY ------------2222-----------3333----------------------------- NQKFKDKATLTADKSSNTAYMQLSSLTSEDSAVYYCARSGGSYRYDGGFDYWGQGTTLTV 1111---------1111---------3333----------!!!!---------------- SS -- >Immnuoglobulin kappa ligh; SWP:A2NVF0; PDB:1NMBL; DIQMTQTTSSLSASLGDRVTISCRASQDISNYLNWYQQNPDGTVKLLIYYTSNLHSEVPS ----------------------------iiii--------------------------33 RFSGSGSGTDYSLTISNLEQEDIATYFCQQDFTLPFTFGGGTKLEIRRA 33----!!!!--------1111--------------------------- >DI-HAEM CYTOCHROME C PERO; SWP:P83787; PDB:1NMLA; DNLMERANSMFEPIPKYPPVIDGNELTQAKVELGKMEFFEPRLSSSHLISCNTCHNVGLG --------------------%%%%------------1111---1111--3333--1111- GDDELPTSIGHGWQKGPRNSPTVFNAVFNAAQFWDGRAADLAEQAKGPVQAGVEMSSTPD ---------%%%%---------2222----------3333------33331111------ RVVATLKSMPEYIERFEDAFPGQENPVTFDNMAVAIEAYEATLITPEAPFDKYLRGDTSA -------------------2222---------------------------------1111 LNESEKEGLALFMDRGCTACHSGVNLGGQNYYPFGLVAKKGRFSVTETASDEYVFRASPL ------------11113333--1111-------------1111----2222--------2 RNIELTAPYFHSGAVWSLEEAVAVMGTAQLGTELNNDEVKSIVAFLKTLTGNVPEVTYPV 222------1111--------3333--1111----------------------------- LPPSTANTPKPVDMIP ----1111-------- >HYPOTHETICAL PROTEIN YQGF; SWP:P52050; PDB:1NMNA; SGTLLAFDFGTKSIGVAVGQRITGTARPLPAIKAQDGTPDWNIIERLLKEWQPDEIIVGL ---------1111-------1111----------iiii---------------------- PLNMDGTEQPLTARARKFANRIHGRFGVEVKLHDERLSTVGKVDSASAVIILESYFEQGY --1111--1111-----------------------------1111----------1111- >HYPOTHETICAL PROTEIN YBGI; SWP:P75743; PDB:1NMOA; KNTELEQLINEKLNSAAISDYAPNGLQVEGKETVQKIVTGVTASQALLDEAVRLGADAVI 3333----------3333------------------------------------------ VHHGYFWKGESPVIRGKRNRLKTLLANDINLYGWHLPLDAHPELGNNAQLAALLGITVGE ------2222--------------1111------3333---------------------- IEPLVPWGELTPVPGLELASWIEARLGRKPLWCGDTGPEVVQRVAWCTGGGQSFIDSAAR -1111----------------------------1111-------------1111------ FGVDAFITGEVSEQTIHSAREQGLHFYAAGHHATERGGIRALSEWLNENTDLDVTFIDIP -----------3333----------------1111------------------------- NPA --- >POLY(A)-BINDING PROTEIN; SWP:Q27335; PDB:1NMRA; GSSLASQGQNLSTVLANLTPEQQKNVLGERLYNHIVAINPAAAAKVTGMLLEMDNGEILN --------3333-3333-3333------------3333---3333--------------- LLDTPGLLDAKVQEALEVLNRHMNV ------------------------- >60S ribosomal protein L30; SWP:P14120; PDB:1NMUB; APVKSQESINQKLALVIKSGKYTLGYKSTVKSLRQGKSKLIIIAANTPVLRKSELEYYAM ----3333-----------------------------------1111------------- LSKTKVYYFQGGNNELGTAVGKLFRVGVVSILEAGDSDILTTLA ---------------------------------!!!!3333--- >RIBOSE 5-PHOSPHATE ISOMER; SWP:P37351; PDB:1NN4A; QMKKIAFGCDHVGFILKHEIVAHLVERGVEVIDKGTWSSERTDYPHYASQVALAVAGGEV ------------------------1111--------------3333-------------- DGGILICGTGVGISIAANKFAGIRAVVCSEPYSAQLSRQHNDTNVLAFGSRVVGLELAKM -------------------2222-------------------------1111-------- IVDAWLGAQYEGGRHQQRVEAITAIEQ -----------!!!!------------ >Similar to deoxythymidyla; SWP:P23919; PDB:1NN5A; RRGALIVLEGVDRAGKSTQSRKLVEALCAAGHRAELLRFPERSTEIGKLLSSYLQKKSDV ----------2222-------------1111---------3333--------1111---- EDHSVHLLFSANRWEQVPLIKEKLSQGVTLVVDRYAFSGVAFTGAKENFSLDWCKQPDVG ------------3333--------------------------1111--------3333-- LPKPDLVLFLQLQLADAAKRERYENGAFQERALRCFHQLMKDTTLNWKMVDASKSIEAVH ------------33333333-----------------11111111--------------- EDIRVLSEDAIATATEKPLGELWK ------------1111-------- >CHYMASE; SWP:P23946; PDB:1NN6A; GGTECKPHSRPYMAYLEIVTSNGPSKFCGGFLIRRNFVLTAAHCAGRSITVTLGAHNITE -----22221111------1111----------1111---1111------------1111 EEDTWQKLEVIKQFRHPKYNTSTLHHDIMLLKLKEKASLTLAVGTLPFPSQFNFVPPGRM -1111----------1111--------------------1111----------------- CRVAGWGRTGVLKPGSDTLQEVKLRLMDPQACSHFRDFDHNLQLCVGNPRKTAFKGDSGG -------------------------------3333---3333-----1111-----2222 PLLCAGVAQGIVSYGRSDAKPPAVFTRISHYRPWINQILQAN ---iiii--------1111-------3333--------1111 >POTASSIUM CHANNEL KV4.2; SWP:Q63881; PDB:1NN7A; LIVLNVSGTRFQTWQDTLERYPDTLLGSSERDFFYHPETQQYFFDRDPDIFRHILNFYRT -----iiii--------3333---11113333----1111--------3333--3333-- GKLHYPRHECISAYDEELAFFGLIPEIIGDCCYEEYKDRRRENAE -----3333--------------------1111--------1111 >IRON-UTILIZATION PERIPLAS; SWP:P35755; PDB:1NNFA; DITVYNGQQKEAATAVAKAFEQETGIKVTLNSGKSEQLAGQLKEEGDKTPADVFYTEQTA --------------------------------------------!!!!------------ TFADLSEAGLLAPISEQTIQQTAQKGVPLAPKKDWIALSGRSRVVVYDHTKLSEKDMEKS -----1111----------11112222--1111--------------1111-3333---3 VLDYATPKWKGKIGYVSTSGAFLEQVVALSKMKGDKVALNWLKGLKENGKLYAKNSVALQ 333--3333------1111----------------------------------------- AVENGEVPAALINNYYWYNLAKEKGVENLKSRLYFVRHQDPGALVSYSGAAVLKASKNQA ------------3333--------3333--------%%%%1111--------1111---- EAQKFVDFLASKKGQEALVAARAEYPLRADVVSPFNLEPYEKLEAPVVSATTAQDKEHAI --------------------------------1111--3333------------------ KLIEEAGL -------- >ASPARAGINYL-TRNA SYNTHETA; SWP:Q8TZN6; PDB:1NNHA; NAVEIISREISPTLDIQTKILEYMTDFFVKEGFKWLLPVIISPITDPLWPDPAGEGMEPA ----1111--------------------1111------------------1111------ EVEIYGVKMRLTHSMILHKQLAIAMGLKKIFVLSPNIRLESRQKDDGRHAYEFTQLDFEV ---iiii---------------1111--------------3333---------------- ERAKMEDIMRLIERLVYGLFRKAEEWTGREFPKTKRFEVFEYSEVLEEFGSDEKASQEME ------------------------------------------------------------ EPFWIINIPREFYDREVDGFWRNYDLILPYGYGEVASGGEREWEYEKIVAKIRKAGLNED ----------3333--iiii--------iiii-----------3333-----1111-333 SFRPYLEIAKAGKLKPSAGAGIGVERLVRFIVGAKHIAEVQPFPRIPGIPAVI 3-------1111-----------------------3333------2222---- >Putative uncharacterized ; SWP:O07529; PDB:1NNI1; NLVINGTPRKHGRTRIAASYIAALYHTDLIDLSEFVLPVFNGEAEQSELLKVQELKQRVT --------1111------------------------------1111-------------- KADAIVLLSPEYHSGSGALKNALDFLSSEQFKYKPVALLAVAGGGKGGINALNNRTVRGV -----------%%%%------3333-33332222---------!!!!----------111 YANVIPKQLVLDPVHIDVENATVAENIKESIKELVEELSFAKAGN 1----------1111-3333---3333--------------3333 >L-3-PHOSPHOSERINE PHOSPHA; SWP:P78330; PDB:1NNLA; SELRKLFYSADAVCFDVDSTVIREEGIDELAKICGVEDAVPFKAALTERLALIQPSREQV ------1111------2222----------------3333-------------------- QRLIAEQPPHLTPGIRELVSRLQERNVQVFLISGGFRSIVEHVASKLNIPATNVFANRLK -----------2222-------1111---------3333-----1111-3333------- FYFNGEYAGFDETQPTAESGGKGKVIKLLKEKFHFKKIIMIGDGATDMEACPPADAFIGF -1111-----11111111-------------------------33331111--------- GGNVIRQQVKDNAKWYITDFVELLG ------------------3333--- >RUBRERYTHRIN; SWP:Q9UWP7; PDB:1NNQA; VVKRTMTKKFLEEAFAGESMAHMRYLIFAEKAEQEGFPNIAKLFRAIAYAEFVHAKNHFI --------------------------------1111-----------------------1 ALGKLGKTPENLQMGIEGETFEVEEMYPVYNKAAEFQGEKEAVRTTHYALEAEKIHAELY 111-------------------------------1111---------------------- RKAKEKAEKGEDIEIKKVYICPICGYTAVDEAPEYCPVCGAPKEKFVVFE ------1111-------------------------------3333----- >L-ASPARAGINASE II; SWP:P00805; PDB:1NNSA; LPNITILATGGTIAGGGDSATKSNYTVGKVGVENLVNAVPQLKDIANVKGEQVVNIGSQD -----------1111-------------------3333-3333-------------3333 MNDNVWLTLAKKINTDCDKTDGFVITHGTDTMEETAYFLDLTVKCDKPVVMVGAMRPSTS ---------------1111---------1111------------------------1111 MSADGPFNLYNAVVTAADKASANRGVLVVMNDTVLDGRDVTKTNTTDVATFKSVNYGPLG -----------------3333--------%%%%--3333-------1111---------- YIHNGKIDYQRTPARKHTSDTPFDVSKLNELPKVGIVYNYANASDLPAKALVDAGYDGIV --iiii----------!!!!----1111-----------------------1111----- SAGVGNGNLYKSVFDTLATAAKTGTAVVRSSRVPTGATTQDAEVDDAKYGFVASGTLNPQ --------------------1111--------------------3333-----!!!!--- KARVLLQLALTQTKDPQQIQQIFNQY --------1111-------------- >HYPOTHETICAL PROTEIN HI14; SWP:P44199; PDB:1NNVA; MTTEIKKLDPDTAIDIAYDIFLEMAGENLDPADILLFNLQFEERGGVEFVETADDWEEEI -------------------------3333-3333-------------------------- GVLIDPEEYAEVWVGLVNEQDEMDDVFAKFLISHREEDREFHVIWKK -----------------1111-------------------------- >HYPOTHETICAL PROTEIN; SWP:Q8U1C6; PDB:1NNWA; VYVAVLANIAGNLPALTAALSRIEEMREEGYEIEKYYILGNIVGLFPYPKEVIEVIKDLT --------------------------1111------------------------------ KKENVKIIRGKYDQIIAMSDPHATDPGYIDKLELPGHVKKALKFTWEKLGHEGREYLRDL ---------------3333------3333---------------------------1111 PIYLVDKIGGNEVFGVYGSPINPFDGEVLAEQPTSYYEAIMRPVKDYEMLIVASPMYPVD ------------------3333----------3333----1111--------3333---- AMTRYGRVVCPGSVGFPPGKEHKATFALVDVDTLKPKFIEVEYDKKIIEERIRAEGLPEE --1111----------------------------------------------1111---- IIKILYHGGRP ----------- >PROTEIN YGIW; SWP:P52083; PDB:1NNXA; QGGFSGPSGSVTTVESAKSLRDDTWVTLRGNIVERISDDLYVFKDASGTINVDIDHKRWN ------------33331111----------------2222----1111------3333ii GVTVTPKDTVEIQGEVDKDWNSVEIDVKQIRKV ii--1111----------1111----------- >REPLISOME ORGANIZER; SWP:Q38151; PDB:1NO1A; IEKDVVQILKAVSEFYPGRFQPDDLKGTVKAWHRVLAEYELEEINNLTDYAKVNKFPPTV 3333-----------2222----------------11113333------3333-----33 SDLLK 33--- >HEAD MORPHOGENESIS PROTEI; SWP:P13848; PDB:1NO4A; PLKPEEHEDILNKLLDPELAQSERTEALQQLRVNYGSFVSEYNDLTKSHEKLAAEKDDLI --3333---------1111----------------------------------------- VSNSKLFRQIGLTEKQE ------------1111- >HYPOTHETICAL PROTEIN HI00; SWP:P43933; PDB:1NO5A; QLDIKSEELAIVKTILQQLVPDYTVWAFGSRVKGKAKKYSDLDLAIISEEPLDFLARDRL -------------------1111-------------1111-------------------- KEAFSESDLPWRVDLLDWATTSEDFREIIRKVYVVIQEKE ----------------3333--------3333-------- >MAJOR CAPSID PROTEIN; SWP:P06491; PDB:1NO7A; ANPYGAYVAAPAGPAADMQQLFLNAWGQRLAHGRVRWVAALELHPAFDFFVGVADVELPG ---3333------1111--------------------------1111------------- GDVPPAGPGEIQATWRVVNGNLPLALCPAAFRDARGLELGVGRHAMAPATIAAVRGAFDD -----------------3333------3333--------2222-------------1111 RNYPAVFYLLQAAIHGSEHVFCALARLVVQCITSYWNNTRCAAFVNDYSLVSYVVTYLGG ---3333------------------------------------1111------------- DLPEECMAVYRDLVAHVEALAQLVDDFTLTGPELGGQAQAELNHLMRDPALLPPLVWDCD ---3333------------33333333------%%%%3333---1111-----------3 ALMRRAALDRHRDCRVSAGGHDPVYAAACNVATADFNRNDGQLLHNTQARAADAADDRPH 333--------------iiii----------------------------3333------- RGADWTVHHKIYYYVMVPAFSRGRCCTAGVRFDRVYATLQNMVVPEIAPGEECPSDPVTD --------------------iiii-----------------------2222--------- PAHPLHPANLVANTVNAMFHNGRVVVDGPAMLTLQVLAHNMAERTTALLCSAAPDAGANT --11111111-------------------------3333--------------------- ANMRIFDGALHAGILLMAPQHLDHTIQNGDYFYPLPVHALFAGADHVANAPNFPPALRDL -------------------------------------3333--3333------3333-33 SRQVPLVPPALGANYFSSIRQPVVQHVRESAAGENALTYALMAGYFKISPVALHHQLKTG 33-----3333-1111---3333---------3333--------------------3333 LH -- >ALY; SWP:O08583; PDB:1NO8A; GKLLVSNLDFGVSDADIQELFAEFGTLKKAAVHYDRSGRSLGTADVHFERKADALKAMKQ --------1111------------------------------------------------ YNGVPLDGRPMNIQLVTS 2222-------------- >NEOCARZINOSTATIN; SWP:P0A3R9; PDB:1NOA; AAPTATVTPSSGLSDGTVVKVAGAGLQAGTAYDVGQCAWVDTGVLACNPADFSSVTADAN -------------2222---------2222----------2222---3333------111 GSASTSLTVRRSFEGFLFDGTRWGTVDCTTAACQVGLSDAAGNGPEGVAISFN 1---------------1111------1111--------1111----------- >XYLANASE; SWP:Q46961; PDB:1NOFA; DTVKIDANVNYQIIQGFGGMSGVGWINDLTTEQINTAYGSGVGQIGLSIMRVRIDPDSSK -----1111-------------------------------2222------------3333 WNIQLPSARQAVSLGAKIMATPWSPPAYMKSNNSLINGGRLLPANYSAYTSHLLDFSKYM 3333-------1111----------11111111--------3333--------------- QTNGAPLYAISIQNEPDWKPDYESCEWSGDEFKSYLKSQGSKFGSLKVIVAESLGFNPAL 1111----------1111------------------------!!!!----------3333 TDPVLKDSDASKYVSIIGGHLYGTTPKPYPLAQNAGKQLWMTEHYVDSKQSANNWTSAIE --333333331111------2222--------1111----------111111113333-- VGTELNASMVSNYSAYVWWYIRRSYGLLTEDGKVSKRGYVMSQYARFVRPGALRIQATEN --------1111----------1111--1111----------------2222-------- PQSNVHLTAYKNTDGKMVIVAVNTNDSDQMLSLNISNANVTKFEKYSTSASLNVEYGGSS -2222------1111---------------------------------1111-------- QVDSSGKATVWLNPLSVTTFVSK --1111----------------- >CONSERVED HYPOTHETICAL PR; SWP:Q9HIA7; PDB:1NOGA; SPVVEVQGTIDELNSFIGYALVLSRWDDIRNDLFRIQNDLFVLGEDVSTGGKGRTVTREM ------------------------------------------------iiii-------- IDYLEARVKEMKAEIGKIELFVVPGGSVESASLHMARAVSRRLERRIVAASKLTEINKNV --------------------------------------------------------3333 LIYANRLSSILFMHALISNKRLNIPEKIW -------------------1111------ >INDUCIBLE NITRIC OXIDE SY; SWP:P29477; PDB:1NOS; NPKSLTRGPRDKPTPLEELLPHAIEFINQYYGSFKEAKIEEHLARLEAVTKEIETTGTYQ --------------3333-----------3333--------------------------- LTLDELIFATKMAWRNAPRCIGRIQWSNLQVFDARNCSTAQEMFQHICRHILYATNNGNI ----------------111133333333-----1111-----------------%%%%-- RSAITVFPQRSDGKHDFRLWNSQLIRYAGYQMPDGTIRGDAATLEFTQLCIDLGWKPRYG -------------------------------1111----3333-------1111------ RFDVLPLVLQADGQDPEVFEIPPDLVLEVTMELGLKWYALPAVANMLLEVGGLEFPACPF ----------iiii-------3333------------------------iiii------- NGWYMGTEIGVRDFCDTQDRAVTEINVAVLHSFQKQNVTIMDHHTASESFMKHMQNEYVL --------------------------3333---1111----------------------- SPFYYYQIEPWKTHIWQNEHHHH --------3333----------- >NODAMURA VIRUS COAT PROTE; SWP:P12871; PDB:1NOVA; NMLKMSAPGLDFLKCAFASPDFSTDPGKGIPDKFQGLVLPKKHCLTQSITFTPGKQTMLL -----------------1111------------------------------2222----- VAPIPGIACLKAEANVGASFSGVPLASVEFPGFDQLFGTSATDTAANVTAFRYASMAAGV --------------2222--------------3333------------------------ YPTSNLMQFAGSIQVYKIPLKQVLNSYSQTVA ----1111------------------------ >BETA-HEXOSAMINIDASE BETA ; SWP:P07686; PDB:1NOWA; ALWPLPLSVKMTPNLLHLAPENFYISHSPNSTAGPSCTLLEEAFRRYHGYIFGTQVQQLL ------------------1111-----1111--3333----------------------- VSITLQSECDAFPNISSDESYTLLVKEPVAVLKANRVWGALRGLETFSQLVYQDSYGTFT -------1111--1111-----------------------------1111---1111--- INESTIIDSPRFSHRGILIDTSRHYLPVKIILKTLDAMAFNKFNVLHWHIVDDQSFPYQS --------------------------3333--------1111---------3333----3 ITFPELSNKGSYSLSHVYTPNDVRMVIEYARLRGIRVLPEFDTPGHTLSWGKGQKDLLTP 333---1111--1111--------------1111--------------3333-2222--- CYSLDSFGPINPTLNTTYSFLTTFFKEISEVFPDQFIHLGGDEVEFKCWESNPKIQDFMR -------------3333------------------------------------------- QKGFGTDFKKLESFYIQKVLDIIATINKGSIVWQEVFDDKAKLAPGTIVEVWKDSAYPEE -----------------------1111---------1111---2222------------- LSRVTASGFPVILSAPWYLDLISYGQDWRKYYKVEPLDFGGTQKQKQLFIGGEACLWGEY ----1111-----11113333---------33331111---33331111--------111 VDATNLTPRLWPRASAVGERLWSSKDVRDMDDAYDRLTRHRCRMVERGIAAQPLYAGYCN 13333-3333-3333--------1111-----------------1111------------ >NADH OXIDASE; SWP:Q60049; PDB:1NOX; PVLDAKTAALKRRSIRRYRKDPVPEGLLREILEAALRAPSAWNLQPWRIVVVRDPATKRA ----------------------------------1111-2222----------------- LREAAFGQAHVEEAPVVLVLYADLEDALAHLDEVIHPGVQGERREAQKQAIQRAFAAMGQ ----iiii3333-----------------3333--3333!!!!-----------1111-- EARKAWASGQSYILLGYLLLLLEAYGLGSVPMLGFDPERVRAILGLPSRAAIPALVALGY ----------------------1111--------------------1111---------- PAEEGYPSHRLPLERVVLWR -----------1111----- >DNA POLYMERASE; SWP:P04415; PDB:1NOYA; DEFYISIETVGNNIVERYIDENGKERTREVEYLPTMFRHCKEESKYKDIYGKNCAPQKFP -------------------------------------3333------3333--------- SMKDARDWMKRMEDIGLEALGMNDFKLAYISDTYGSEIVYDRKFVRVANCDIEVTGDKFP ------------------3333------------------3333---------------- DPMKAEYEIDAITHYDSIDDRFYVFDLLNSMYGSVSKWDAKLAAKLDCEGGDEVPQEILD 3333-------------------------1111-----3333---3333-----333311 RVIYMPFDNERDMLMEYINLWEQKRPAIFTGWNIEGFDVPYIMNRVKMILGERSMKRFSP 11------------------------------3333--------------3333------ IGRVKSKLLQNMYGSKEIYSIDGVSILDYLDLYKKFAFTNLPSFSLESVAQHETKKGKLP ---------!!!!----------------------------------------------- YDGPINKLRETNHQRYISYNIIDVESVQAIDKIRGFIDLVLSMSYYAKMPFSGVMSPIKT ---1111-------------------------------------3333-3333------- WDAIIFNSLKGE -----3333--- >KETOL-ACID REDUCTOISOMERA; SWP:Q9HVA2; PDB:1NP3A; MRVFYDKDCDLSIIQGKKVAIIGYGSQGHAHACNLKDSGVDVTVGLRSGSATVAKAEAHG ----3333-----1111--------------------------------3333--3333- LKVADVKTAVAAADVVMILTPDEFQGRLYKEEIEPNLKKGATLAFAHGFSIHYNQVVPRA ---------1111-------3333--------3333-2222-----------------33 DLDVIMIAPKAPGHTVRSEFVKGGGIPDLIAIYQDASGNAKNVALSYACGVGGGRTGIIE 33-----------------1111-----------3333-------------3333----- TTFKDETETDLFGEQAVLCGGCVELVKAGFETLVEAGYAPEMAYFECLHELKLIVDLMYE ---------------------------------1111----------------------- GGIANMNYSISNNAEYGEYVTGPEVINAESRAAMRNALKRIQDGEYAKMFITEGAANYPS ------3333----------3333-----------------------------1111--- MTAYRRNNAAHPIEQIGEKLRAMMPWI -----------------------1111 >Molybdopterin-guanine din; SWP:P32125; PDB:1NP6A; MIPLLAFAAWSGTGKTTLLKKLIPALCARGIRPGLIKHTHHELRKAGAAQTIVASQQRWA ---------2222-------------1111-----------3333---------3333-- LMTETPDEEELDLQFLASRMDTSKLDLILVEGFKHEEIAKIVLFRDGAGHRPEELVIDRH ----1111--------33333333--------------------1111--3333---111 VIAVASDVPLNLDVALLDINDVEGLADFVVEWMQKQNG 1----------------1111----------------- >DNA PHOTOLYASE; SWP:P77967; PDB:1NP7A; MKHVPPTVLVWFRNDLRLHDHEPLHRALKSGLAITAVYCYDPRQFAQTHQGFAKTGPWRS ----------------------------------------3333---1111--------- NFLQQSVQNLAESLQKVGNKLLVTTGLPEQVIPQIAKQINAKTIYYHREVTQEELDVERN --------------1111--------3333------------------------------ LVKQLTILGIEAKGYWGSTLCHPEDLPFSIQDLPDLFTKFRKDIEKKKISIRPCFFAPSQ -----1111------------1111---3333---------------------------- LLPSPNIKLELTAPPPEFFPQINFDHRSVLAFQGGETAGLARLQDYFWHGDRLKDYKETR --------------3333------1111-------------------11113333---11 NGMVGADYSSKFSPWLALGCLSPRFIYQEVKRYEQERVSNDSTHWLIFELLWRDFFRFVA 11---------------------------------------------------------- QKYGNKLFNRGGLLNKNFPWQEDQVRFELWRSGQTGYPLVDANMRELNLTGFMSNRGRQN --!!!!--1111-----------------1111---3333-------------------- VASFLCKNLGIDWRWGAEWFESCLIDYDVCSNWGNWNYTAGIGNDARDFRYFNIPKQSQQ -----------------------11113333----------------------------- YDPQGTYLRHWLPELKNLPGDKIHQPWLLSATEQKQWGVQLGVDYPRPCVNFHQSVEARR -1111------3333-----33333333------1111-2222----------------- KIE --- >CALCIUM-DEPENDENT PROTEAS; SWP:Q64537; PDB:1NP8A; SEEERQFRKLFVQLAGDDMEVSATELMNILNKVVTRHPDLKTDGFGIDTCRSMVAVMDSD -3333----------1111-----------------1111-----------------111 TTGKLGFEEFKYLWNNIKKWQGIYKRFDTDRSGTIGSNELPGAFEAAGFHLNQHIYSMII 1--------------------------1111----1111-----1111------------ RRYSDETGNMDFDNFISCLVRLDAMFRAF ----1111--------------------- >FOSFOMYCIN-RESISTANCE PRO; SWP:Q56415; PDB:1NPBA; MLQSLNHLTLAVSDLQKSVTFWHELLGLTLHARWNTGAYLTCGDLWVCLSYDEARQYVPP -----------------------------------------!!!!------1111---11 QESDYTHYAFTVAEEDFEPLSQRLEQAGVTIWKQNKSEGASFYFLDPDGHKLELHVGSLA 11----------3333-----------------------------1111----------- ARLAACREKPYAGMVFTSDE ----------2222------ >NEUTRAL PROTEASE; SWP:P05806; PDB:1NPC; VTGTNKVGTGKGVLGDTKSLNTTLSGSSYYLQDNTRGATIFTYDAKNRSTLPGTLWADAD -----------1111---------!!!!----------------%%%%------------ NVFNAAYDAAAVDAHYYAGKTYDYYKATFNRNSINDAGAPLKSTVHYGSNYNNAFWNGSQ ----3333------------------------1111------------------------ MVYGDGDGVTFTSLSGGIDVIGHELTHAVTENSSNLIYQNESGALNEAISDIFGTLVEFY ------------1111-------------------------------------------- DNRNPDWEIGEDIYTPGKAGDALRSMSDPTKYGDPDHYSKRYTGSSDNGGVHTNSGIINK ---------1111---------------3333----3333-----%%%%----------- QAYLLANGGTHYGVTVTGIGKDKLGAIYYRANTQYFTQSTTFSQARAGAVQAAADLYGAN ----------iiii----------------------1111-----------------111 SAEVAAVKQSFSAVGVN 1----------1111-- >NIDOGEN; SWP:P10493; PDB:1NPEA; GTHLLFAQTGKIERLPLERNTMKKTEAKAFLHIPAKVIIGLAFDCVDKVVYWTDISEPSI -------2222-----------3333------1111------------------------ GRASLHGGEPTTIIRQDLGSPEGIALDHLGRTIFWTDSQLDRIEVAKMDGTQRRVLFDTG ----------------------------------------------1111---------- LVNPRGIVTDPVRGNLYWTDWNRDNPKIETSHMDGTNRRILAQDNLGLPNGLTFDAFSSQ ----------1111-------3333------1111--------------------1111- LCWVDAGTHRAECLNPAQPGRRKVLEGLQYPFAVTSYGKNLYYTDWKTNSVIAMDLAISK --------------3333------------------!!!!-------------------- EMDTFHPHKQTRLYGITIALSQC ----------------------- >GELSOLIN; SWP:P13020; PDB:1NPHA; DDGTGQKQIWRIEGSNKVPVDPATYGQFYGGDSYIILYNYGQIIYNWQGAQSTQDEVAAS --------------------1111----1111--------------------3333---- AILTAQLDEELGGTPVQSRVVQGKEPAHLMSLFGGKPMIIYKGGTSRDGGQTAPASIRLF --------------------2222-3333------------------------------- QVRASSSGATRAVEVMPKSGALNSNDAFVLKTPSAAYLWVGAGASEAEKTAAQELLKVLR ----3333---------3333-1111--------------1111---------------- SQHVQVEEGSEPDGFWEALGGKTSYRTSPRLKDKKMDAHPPRLFACSNRIGRFVIEEVPG -----------11113333--------3333---3333---------------------- ELMQEDLATDDVMLLDTWDQVFVWVGKDSQEEEKTEALTSAKRYIETDPANRDRRTPITV --3333-1111--------------1111------------------3333--------- VRQGFEPPSFVGWFLGWDNNYWS -2222-33331111-----1111 >TOXIN VII; SWP:P15226; PDB:1NPIA; KEGYLMDHEGCKLSCFIRPSGYCGRECGIKKGSSGYCAWPACYCYGLPNWVKVWDRATNK ------1111--------2222-----1111----------------1111---3333-- C - >NUCLEOSIDE DIPHOSPHATE KI; SWP:P22887; PDB:1NPK; VNKERTFLAVKPDGVARGLVGEIIARYEKKGFVLVGLKQLVPTKDLAESHYAEHKERPFF --------------1111--------------------------------3333--1111 GGLVSFITSGPVVAMVFEGKGVVASARLMIGVTNPLASAPGSIRGDFGVDVGRNIIHGSD -----1111---------2222-----------3333------------1111------- SVESANREIALWFKPEELLTEVKPNPNLYE -------------3333-------1111-- >AGGLUTININ; SWP:NA; PDB:1NPLA; DNILYSGETLSPGEFLNNGRYVFIMQEDCNLVLYDVDKPIWATNTGGLDRRCHLSMQSDG ----2222--2222---!!!!----1111-----!!!!------2222--------1111 NLVVYSPRNNPIWASNTGGENGNYVCVLQKDRNVVIYGTARWATGTNIH -----1111-------------------1111----------------- >NEUROPSIN; SWP:Q61955; PDB:1NPMA; ILEGRECIPHSQPWQAALFQGERLICGGVLVGDRWVLTAAHCKKQKYSVRLGDHSLQSQP -------22221111-----------------------1111------------------ EQEIQVAQSIQHPCYNNSNP -----------1111----- >DEVELOPMENT-SPECIFIC PROT; SWP:P02966; PDB:1NPSA; ANITVFYNEDFQGKQVDLPPGNYTRAQLAALGIENNTISSVKVPPGVKAILYQNDGFAGD --------%%%%----------------1111-----------2222------------- QIEVVANAEELGPLNNNVSSIRVISVPV ----------!!!!-------------- >PROGRAMMED CELL DEATH PRO; SWP:Q02242; PDB:1NPUA; SLTFYPAWLTVSEGANATFTCSLSNWSEDLMLNWNRLSPSNQTEKQAAFSNGLSQPVQDA -----------2222-----------1111-------1111--------iiii-----33 RFQIIQLPNRHDFHMNILDTRRNDSGIYLCGAISLHPKAKIEESPGAELVVTERIL 33----1111----------1111-------------------------------- >Hypothetical shikimate 5-; SWP:P44774; PDB:1NPYA; MINKDTQLCMSLSGRPSNFGTTFHNYLYDKLGLNFIYKAFTTQDIEHAIKGVRALGIRGC --1111------------------------------------------------------ AVSMPFKETCMPFLDEIHPSAQAIESVNTIVNDNGFLRAYNTDYIAIVKLIEKYHLNKNA -------3333-------3333----------iiii--------------------1111 KVIVHGSGGMAKAVVAAFKNSGFEKLKIYARNVKTGQYLAALYGYAYINSLENQQADILV -------!!!!-------1111----------------------------2222------ NVTSIGMKGGKEEMDLAFPKAFIDNASVAFDVVAMPVETPFIRYAQARGKQTISGAAVIV ---2222--1111---------1111-------------------1111----------- LQAVEQFELYTHQRPSDELIAEAAAFART ----------------------------- >14.3 KDA PERCHLORIC ACID ; SWP:P80601; PDB:1NQ3A; SLVRRIISTAKAPAAIGPYSQAVLVDRTIYISGQLGMDPASGQLVPGGVVEEAKQALTNI --------1111------------------------------------------------ GEILKAAGCDFTNVVKATVLLADINDFSAVNDVYKQYFQSSFPARAAYQVAALPKGGRVE ----1111-1111---------1111--------1111--------------2222---- IEAIAVQGPLTTA ------------- >Oxytetracycline polyketid; SWP:P43677; PDB:1NQ4A; MTLLTLSDLLTLLRECAGEEESIDLGGDVEDVAFDALGYDSLALLNTVGRIERDYGVQLG ----------------------3333------3333------------------------ DDAVEKATTPRALIEMTNASLTGASPSAGGAARDK -3333---3333----------------------- >XYS1; SWP:Q59922; PDB:1NQ6A; AGALGDAAAAKGRYFGAAVAANHLGEAAYASTLDAQFGSVTPENEMKWDAVESSRNSFSF --------1111-------1111--------------------11113333--2222--3 SAADRIVSHAQSKGMKVRGHTLVWHSQLPGWVSPLAATDLRSAMNNHITQVMTHYKGKIH 333-------1111--------------1111----------------------2222-- SWDVVNEAFQDGGSGARRSSPFQDKLGNGFIEEAFRTARTVDADAKLCYNDYNTDGQNAK --------------------------1111-----------1111--------------- SNAVYEMVKDFKQRGVPIDCVGFQSHFNSNSPVPSDFQANLQRFADLGVDVQITELDIEG ---------------------------1111--1111-------1111------------ SGSAQAANYTKVVNACLAVTRCTGITVWGVTDKYSWRSGGTPLLFDGDYNKKPAYDAVLA ------------------1111--------3333--3333-----1111----------- AL -- >NUCLEAR RECEPTOR ROR-BETA; SWP:P45446; PDB:1NQ7A; TMSEIDRIAQNIIKSHLETCQYTMEELHQLAWQTHTYEEIKAYQSKSREALWQQCAIQIT -----------------------------%%%%---------1111-------------- HAIQYVVEFAKRITGFMELCQNDQILLLKSGCLEVVLVRMCRAFNPLNNTVLFEGKYGGM -------------3333--------------------3333-----------iiii---- QMFKALGSDDLVNEAFDFAKNLCSLQLTEEEIALFSSAVLISPDRAWLLEPRKVQKLQEK --3333----------------1111---------------1111----3333------- IYFALQHVIQKNHLDDETLAKLIAKIPTITAVCNLHGEKLQVFKQSHPDIVNTLFPPLYK ------------------------------------------------------------ ELFN ---- >CLASS 1 COLLAGENASE; SWP:Q9X721; PDB:1NQJA; KATVIPNFNTTMQGSLLGDDSRDYYSFEVKEEGEVNIELDKKDEFGVTWTLHPESDRITY ----------------!!!!---------------------------------------- GQVDGNKVSNKVKLRPGKYYLLVYKYSGSGNYELRVNK ---!!!!------------------------------- >ALKANESULFONATE MONOOXYGE; SWP:P80645; PDB:1NQKA; MSLNMFWFLPTHGDGHYLGTEEGSRPVDHGYLQQIAQAADRLGYTGVLIPTGRSCEDAWL --------------------2222---------------------------1111----- VAASMIPVTQRLKFLVALRPSVTSPTVAARQAATLDRLSNGRALFNLVTGSDPQELAGDG ----3333-----------3333---------------------------------1111 VFLDHSERYEASAEFTQVWRRLLQRETVDFNGKHIHVRGAKLLFPAIQQPYPPLYFGGSS ---------------------1111----------------------------------- DVAQELAAEQVDLYLTWGEPPELVKEKIEQVRAKAAAHGRKIRFGIRLHVIVRETNDEAW -----------------------------------1111--------------------- QAAERLISHLDDETIAKAQAAFARDNLEISPNLWAGVGLVRGGAGTALVGDGPTVAARIN ----1111----------------3333-2222-1111---------------------- EYAALGIDSFVLSGYPHLEEAYRVGELLFPLLDVAIPEIPQPQPL --1111---------3333--------3333-------------- >Pro-epidermal growth fact; SWP:P01133; PDB:1NQLB; DSECPLSHDGYCLHDGVCMYIEALDKYACNCVVGYIGERCQYRDLKWW -----1111---iiii---------------2222-1111-------- >COA PYROPHOSPHATASE (MUTT; SWP:Q9RV46; PDB:1NQZA; PHDPLDDIQADPWALWLSGYRRAAVLVALTREADPRVLLTVRSKGQIAFPGGSLDAGETP --3333----1111----------------------------------------2222-- TQAALREAQEEVALDPAAVTLLGELDDVFTPVGFHVTPVLGRIAPEALDTLRVTPEVAQI --------------3333-----------3333----------33331111--3333--- ITPTLAELRAVPLVRERRTLPDGTEVPLYRYPWRGLDIWGMTARVLHDLLE -------------------1111---------iiii--------------- >ACTIN INTERACTING PROTEIN; SWP:Q11176; PDB:1NR0A; SEFSQTALFPSLPRTARGTAVVLGNTPAGDKIQYCNGTSVYTVPVGSLTDTEIYTEHSHQ ---------------2222------1111------!!!!--------------------- TTVAKTSPSGYYCASGDVHGNVRIWDTTQTTHILKTTIPVFSGPVKDISWDSESKRIAAV ------3333------1111--------3333------------------1111------ GEGRERFGHVFLFDTGTSNGNLTGQARAMNSVDFKPSRPFRIISGSDDNTVAIFEGPPFK --1111---------------------------------------1111----------- FKSTFGEHTKFVHSVRYNPDGSLFASTGGDGTIVLYNGVDGTKTGVFEDDSLKNVAHSGS -----------------1111---------------------------1111-------- VFGLTWSPDGTKIASASADKTIKIWNVATLKVEKTIPVGTRIEDQQLGIIWTKQALVSIS ------1111------1111------1111----------1111---------------1 ANGFINFVNPELGSIDQVRYGHNKAITALSSSADGKTLFSADAEGHINSWDISTGISNRV 111----------------------------1111------1111--------------- FPDVHATMITGIKTTSKGDLFTVSWDDHLKVVPAGGSGVDSSKAVANKLSSQPLGLAVSA --------------1111-----1111-------------------------------11 DGDIAVAACYKHIAIYSHGKLTEVPISYNSSCVALSNDKQFVAVGGQDSKVHVYKLSGAS 11---------------------------------1111--------------------- VSEVKTIVHPAEITSVAFSNNGAFLVATDQSRKVIPYSVANNFELAHTNSWTFHTAKVAC ------------------1111------1111--------%%%%---------------- VSWSPDNVRLATGSLDNSVIVWNMNKPSDHPIIIKGAHAMSSVNSVIWLNETTIVSAGQD ---1111------1111-----1111-----------2222----------------111 SNIKFWNVPF 1--------- >DNA-BINDING PROTEIN TFX; SWP:O27001; PDB:1NR3A; MRERGWSQKKIARELKTTRQNVSAIERKAMENIEKSRNTLDFVKSLKSPVRILCRRGDTL ----------3333---------3333---------------3333---------3333- DEIIKRLLEESNKEGIHVIHDSITLAFLIREKASHRIVHRVVKSDFEIGVTRDGEIIVDL 3333------3333---------------------------------------------- NS -- >THYMUS AND ACTIVATION-REG; SWP:Q92583; PDB:1NR4A; RGTNVGRECCLEYFKGAIPLRKLKTWYQTSEDCSRDAIVFVTVQGRAICSDPNNKRVKNA ---2222-----------3333-------1111--------1111-----1111------ VKYLQSL ---3333 >CYTOCHROME P450 2C5; SWP:P00179; PDB:1NR6A; GKLPPGPTPFPIIGNILQIDAKDISKSLTKFSECYGPVFTVYLGMKPTVVLHGYEAVKEA --------------3333-1111------------------------------------- LVDLGEEFAGRGSVPILEKVSKGLGIAFSNAKTWKEMRRFSLMTLRNFGMGKRSIEDRIQ -----1111-----------iiii-1111----------------1111----------- EEARCLVEELRKTNASPCDPTFILGCAPCNVICSVIFHNRFDYKDEEFLKLMESLHENVE ------------%%%%---3333------------------------------------- LLGTPWLQVYNNFPALLDYFPGIHKTLLKNADYIKNFIMEKVKEHQKLLDVNNPRDFIDC ------3333--333333333333--------------------3333-1111------- FLIKMEQENNLEFTLESLVIAVSDLFGAGTETTSTTLRYSLLLLLKHPEVAARVQEEIER ---------1111----------------------------------------------- VIGRHRSPCMQDRSRMPYTDAVIHEIQRFIDLLPTNLPHAVTRDVRFRNYFIPKGTDIIT ----------3333------------------1111----------!!!!--2222---- SLTSVLHDEKAFPNPKVFDPGHFLDESGNFKKSDYFMPFSAGKRMCVGEGLARMELFLFL 33331111-----1111-------1111----11111111-11111111----------- TSILQNFKLQSLVEPKDLDITAVVNGFVSVPPSYQLCFIPIH -------------3333------------------------- >PROTEIN YCGM; SWP:P76004; PDB:1NR9A; HYQHHNWQGALLDYPVSKVVCVGSNYAPEEPVLFIKPETALCDLRQPLAIPSDFGSVHHE -----3333---------------------------3333--1111-------------- VELAVLIGATLRQATEEHVRKAIAGYGVALDLTLRDVQGKKKAGQPWEKAKAFDNSCPLS --------------3333----------------------1111--3333---------- GFIPAAEFTGDPQNTTLSLSVNGEQRQQGTTADIHKIVPLIAYSKFFTLKAGDVVLTGTP ----------1111------iiii-----3333----------------2222------- DGVGPLQSGDELTVTFDGHSLTTRVLG ------2222----------------- >NEUROTOXIN V, CSE-V; SWP:P46066; PDB:1NRA; KKDGYPVDSGNCKYECLKDDYCNDLCLERKADKGYCYWGKVSCYCYGLPDNSPTKTSGKC -------1111-------3333---------------%%%%------------------- NPA --- >REGULATORY PROTEIN BLAR1; SWP:P12287; PDB:1NRFA; FLPGTNVEYEDYSTFFDKFSASGGFVLFNSNRKKYTIYNRKESTSRFAPASTYKVFSALL -----------3333--------------1111-----3333------!!!!-------- ALESGIITKNDSHMTWDGTQYPYKEWNQDQDLFSAMSSSTTWYFQKLDRQIGEDHLRHYL -1111--3333-----------3333----3333--------------3333-------- KSIHYGNEDFSVPADYWLDGSLQISPLEQVNILKKFYDNEFDFKQSNIETVKDSIRLEES ----!!!!---3333--------------------3333----3333-----1111---- NGRVLSGKTGTSVINGELHAGWFIGYVETADNTFFFAVHIQGEKRAAGSSAAEIALSILD -----------------------------------------------------------1 KKGIYP 111--- >PYRIDOXINE 5'-PHOSPHATE O; SWP:Q9NVS9; PDB:1NRGA; EETHLTSLDPVKQFAAWFEEAVQCPDIGEANAMCLATCTRDGKPSARMLLLKGFGKDGFR -----------------------1111-1111------1111------------3333-- FFTNFESRKGKELDSNPFASLVFYWEPLNRQVRVEGPVKKLPEEEAECYFHSRPKSSQIG ---1111-----------------3333---------------------11113333--- AVVSHQSSVIPDREYLRKKNEELEQLYQDQEVPKPKSWGGYVLYPQVMEFWQGQTNRLHD ----2222---3333-----------2222----1111---------------1111--- RIVFRRGLPTGDSPLGPMTHRGEEDWLYERLAP ---------------1111---!!!!------- >HYPOTHETICAL PROTEIN HI07; SWP:P44862; PDB:1NRIA; LSTLITEQRNPNSVDIDRQSTLEIVRLNEEDKLVPLAIESCLPQISLAVEQIVQAFQQGG 11111111-1111-1111-------------------------------------1111- RLIYIGAGTSGRLGVLDASECPPTFGVSTEVKGIIAGGECAIRHPVEGAEDNTKAVLNDL ----------------------------------22223333---2222----------- QSIHFSKNDVLVGIAASGRTPYVIAGLQYAKSLGALTISIASNPKSEAEIADIAIETIVG 1111-1111-----3333------------------------------------------ PEILTGSSRLKSGTAQKVLNLTTASILLGKCYENLVDVQASNEKLKARAVRIV ---2222------------------1111------------------------ >Signal recognition partic; SWP:P32916; PDB:1NRJA; MFDQLAVFTPQGQVLYQYNCLGKKFSEIQINSFISQLITSPVTRKESVANANTDGFDFNL --------1111------1111---3333-------------3333---3333------- LTINFNALFYLNKQPELYFVVTFAEQTLELNQETQQTLALVLKLWNSLHLSESILKNRQG --------------------------------------------3333-------1111- QNEKNKHNYVDILQGIEDDLKKFEQYF 11111111--1111------------- >Signal recognition partic; SWP:P36057; PDB:1NRJB; SYQPSIIIAGPQNSGKTSLLTLLTTDSVRPTVVSQEPLSAADYDGSGVTLVDFPGHVKLR ----------2222----------------------------%%%%---------1111- YKLSDYLKTRAKFVKGLIFMVDSTVDPKKLTTTAEFLVDILSITESSCENGIDILIACNK ---------3333--------11111111------------------1111--------1 SELFTARPPSKIKDALESEIQKVIERRKKSLNELDVLGFKFANLEASVVAFEGSINKRKI 111------------------------------3333--3333-----------1111-- SQWREWIDEKL ----------- >ORPHAN NUCLEAR RECEPTOR P; SWP:O75469; PDB:1NRLA; GLTEEQRMMIRELMDAQMKTFDTTFSHFKNFRLPGVSREEAAKWSQVRKDLCSLKVSLQL ---------------------1111-------------------------1111------ RGEDGSVWNYKPPADSGGKEIFSLLPHMADMSTYMFKGIISFAKVISYFRDLPIEDQISL -1111--------------1111----------------------3333----------- LKGAAFELCQLRFNTVFNAETGTWECGRLSYCLEDTAGGFQQLLLEPMLKFHYMLKKLQL -----------3333----------!!!!--------------------------1111- HEEEYVLMQAISLFSPDRPGVLQHRVVDQLQEQFAITLKSYIECNRPQPAHRFLFLKIMA -----------------------------------------------3333--------- MLTELRSINAQHTQRLLRIQDIHPFATPLMQELFGITG -------------------------------------- >GROWTH FACTOR RECEPTOR-BO; SWP:Q13322; PDB:1NRVA; IHRTQHWFHGRISREESHRIIKQQGLVDGLFLLRDSQSNPKAFVLTLCHHQKIKNFQILP -1111---!!!!---------1111-2222--------1111------%%%%-------- CTFFSLDDGNTKFSDLIQLVDFYQLNKGVLPCKLKHHCIR ------iiii----------------!!!!---------- >hypothetical protein, hal; SWP:P94592; PDB:1NRWA; KLIAIDLDGTLLNSKHQVSLENENALRQAQRDGIEVVVSTGRAHFDVSIFEPLGIKTWVI ------2222--1111--------------------------3333---3333------- SANGAVIHDPEGRLYHHETIDKKRAYDILSWLESENYYYEVFTGSAIYTPQNGRELLDVE -iiii---1111--------------------1111------3333-------------- LDRFRSANPEADLSVLKQAAEVQYSQSGFAYINSFQELFEADEPIDFYNILGFSFFKEKL -------1111------------1111------3333----------------------- EAGWKRYEHAEDLTLVSSAEHNFELSSRKASKGQALKRLAKQLNIPLEETAAVGDSLNDK ------1111--------1111----1111----------1111-3333------1111- SLEAAGKGVAGNAREDIKSIADAVTLTNDEHGVAHKHLL -3333---------------------3333-3333---- >PTS SYSTEM, SORBOSE-SPECI; SWP:P37081; PDB:1NRZA; MQITLARIDDRLIHGQVTTVWSKVANAQRIIICNDDVFNDEVRRTLLRQAAPPGMKVNVV --------1111-----------------------3333--------11112222----- SLEKAVAVYHNPQYQDETVFYLFTNPHDVLTMVRQGVQIATLNIGGMAWRPGKKQLTKAV ----------3333------------------1111-------------2222---2222 SLDPQDIQAFRELDKLGVKLDLRVVASDPSVNILDKINETAFC -------------1111-------1111--------3333--- >HYPOTHETICAL PROTEIN YBEA; SWP:P05850; PDB:1NS5A; KLQLVAVGTKPDWVQTGFTEYLRRFPKDPFELIEIPAGKRGKNADIKRILDKEGEQLAAA --------------------3333----------------1111-------------333 GKNRIVTLDIPGKPWDTPQLAAELERWKLDGRDVSLLIGGPEGLSPACKAAAEQSWSLSA 3-------1111--------------3333---------1111-----1111-------- LTLPHPLVRVLVAESLYRAWSITTNHPYH ----------------------------- >PROCARBOXYPEPTIDASE B; SWP:P09955; PDB:1NSA; FEGEKVFRVNVEDENDISELHELASTRQIDFWKPDSVTQIKPHSTVDFRVKAEDILAVED 2222-----------------3333----------1111-----------3333------ FLEQNELQYEVLINNLRSVLEAQFDSVSR --1111----------------------- >NEURAMINIDASE; SWP:P27907; PDB:1NSCA; EPEWTYPRLSCQGSTFQKALLISPHRFGEARGNSAPLIIREPFIACGPKECKHFALTHYA ----------------------------1111--------------1111---------- AQPGGYYNGTREDRNKLRHLISVKLGKIPTVENSIFHMAAWSGSACHDGREWTYIGVDGP ------2222-------------2222--3333--------------------------1 DSNALIKIKYGEAYTDTYHSYANNILRTQESACNCIGGDCYLMITDGSASGISKCRFLKI 111------!!!!----------------------iiii--------1111--------- REGRIIKEIFPTGRVEHTEECTCGFASNKTIECACRDNSYTAKRPFVKLNVETDTAEIRL iiii----------------------1111------------------------------ MCTETYLDTPRPDDGSITGPCESNGDKGRGGIKGGFVHQRMASKIGRWYSRTMSKTERMG ------------2222---1111-----------------1111---------------- MELYVRYDGDPWTDSDALAHSGVMVSMKEPGWYSFGFEIKDKKCDVPCIGIEMVHDGGKK ---------3333------------1111----------------------------111 TWHSAATAIYCLMGSGQLLWDTVTGVDMAL 1-------------------------3333 >CALGIZZARIN; SWP:P24480; PDB:1NSHA; SRPTETERCIESLIAVFQKYAGKDGHSVTLSKTEFLSFMNTELAAFTKNQKDPGVLDRMM ----------------3333----------3333--------33332222-----3333- KKLDLNSDGQLDFQEFLNLIGGLAVACHESFVKAAPPQKRF 3333--iiii------------------------1111--- >PHOSPHORIBOSYL ANTHRANILA; SWP:Q56320; PDB:1NSJ; MVRVKICGITNLEDALFSVESGADAVGFVFYPKSKRYISPEDARRISVELPPFVFRVGVF ------------------1111--------1111-----------3333----------- VNEEPEKILDVASYVQLNAVQLHGEEPIELCRKIAERILVIKAVGVSNERDMERALNYRE ---3333-----1111-----------------3333----------3333----1111- FPILLDTKTPEYGGSGKTFDWSLILPYRDRFRYLVLSGGLNPENVRSAIDVVRPFAVDVS -------------------33333333-------------3333---------------3 SGVEAFPGKKDHDSIKMFIKNAKGL 333--2222---------------- >PROBABLE ACETYLTRANSFERAS; SWP:P96579; PDB:1NSLA; GFTCKVNEHITIRLLEPKDAERLAELIIQNQQRLGKWLFFSSADTYRETIIPDWRRQYAD ---------------3333---------------1111---3333--------------- LNGIEAGLLYDGSLCGISLHNLDQVNRKAEIGYWIAKEFEGKGIITAACRKLITYAFEEL -----------------------1111--------------------------------- ELNRVAICAAVGNEKSRAVPERIGFLEEGKARDGLYVNGHHDLVYYSLLKREW ---------1111-----------------2222--------------3333- >Thermonuclease [Precursor; SWP:P00644; PDB:1NSNH; DVQLQESGPGLVKPSQSLSLTCTVTGYSITSDYAWNWIRQFPGNKLEWMGYITYSGTTSY ----------------------------------------------------3333---- NPSLKSRISISRDTSKNQFFMQLNSVTTEDTGTFYCTRGNGDWGQGTTLTVSSAKTTPPS ----2222------------------1111------------------------------ VYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSSS ----------------------------------iiii---------------------- VTVPSSPRPSETVTCNVAHPASSTKVDKKI ------------------------------ >Igk protein; SWP:Q66JS7; PDB:1NSNL; DIVLTQSPSSLAVSLGQRATISCRASQSVSTSSFRYMHWYQQKPGQPPRLLIKYASNLES ------------------------------------------2222------------22 GVPARFSGSGSGTDFTLNIHPVEEEDTATYYCQHSWEIPYTFGGGTKLEIKRADAAPTVS 22--------------------1111---------------------------------- IFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMS ----3333---------------------------------------------------- STLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNE ---------3333------------------------ >PROTEASE; SWP:P04024; PDB:1NSOA; WVQPITAQKPSLTLWLDDKMFTGLINTGADVTIIKLEDWPPNWPITDTLTNLRGIGQSNN ----------------------------------3333-3333----------------- PKQSSKYLTWRDKENNSGLIKPFVIPNLPVNLWGRDLLSQMKIMMAS -----------1111------------------3333---------- >NUCLEOSIDE DIPHOSPHATE KI; SWP:P08879; PDB:1NSQA; AANKERTFIMVKPDGVQRGLVGKIIERFEQKGFKLVALKFTWASKELLEKHYADLSARPF 1111------------------------3333-------------------3333--111 FPGLVNYMNSGPVVPMVWEGLNVVKTGRQMLGATNPADSLPGTIRGDFCIQVGRNIIGSD 1-----1111---------2222-----------3333-2222-------1111------ AVESAEKEIALWFNEKELVTWTPAAKDWIYE -------------3333-----1111----- >HEPARAN SULFATE N-DEACETY; SWP:P52848; PDB:1NSTA; DPLWQDPCCDRFPKLLIIGPQKTGTTALYLFLGMHPDLSSNYPSSETFEEIQFFNGHNYH ----------------------------------1111--------!!!!-----3333- KGIDWYMEFFPISDFYFEKSANYFDSEVAPRRAAALLPKAKVLTILINPADRAYSWYQHQ -------------------1111-------------1111-------------------- RAHDDPVALKYTFHEVITAGSDASSKLRALQNRCLVPGWYATHIERWLSAYHANQILVLD 1111-3333----------1111-----------3333--------1111-1111----- GKLLRTEPAKVMDMVQKFLGVTNTIDYHKTLAFDPKKGFWCQLLEGGKTKCLGKSKGRKY ------3333------1111-----3333-----3333--------------1111---- PEMDLDSRAFLKDYYRDHNIELSKLLYKMGQTLPTWLREDLQ --------------------------1111---3333----- >GALACTOSE MUTAROTASE; SWP:Q9ZB17; PDB:1NSZA; SIKIRDFGLGSDLISLTNKAGVTISFTNLGARIVDWQKDGKHLILGFDSAKEYLEKDAYP -------iiii------1111------2222------iiii-------3333------22 GATVGPTAGRIKDGLVKISGKDYILNQNEGPQTLHGGEESIHTKLWTYEVTDLGAEVQVK 22---------------iiii-------!!!!-iiii--3333----------------- FSLVSNDGTNGYPGKIEMSVTHSFDDDNKWKIHYEAISDKDTVFNPTGNVYFNLNGDASE -----2222---------------1111------------------------33331111 SVENHGLRLAASRFVPLKDQTEIVRGDIVDIKNTDLDFRQEKQLSNAFNSNMEQVQLVKG -1111-------------1111-----------1111-----33331111---------- IDHPFLLDQLGLDKEQARLTLDDTSISVFTDQPSIVIFTANFGDLGTLYHEKKQVHHGGI ----------3333------!!!!-----------------!!!!---iiii--2222-- TFECQVSPGSEQIPELGDISLKAGEKYQATTIYSLHTKLE --------33333333-----2222--------------- >mannose-binding protein a; SWP:Q9JJS8; PDB:1NT0A; EPVFGRLVSPGFPEKYGNHQDRSWTLTAPPGFRLRLYFTHFNLELSYRCEYDFVKLTSGT --------2222----------------2222-------------2222--------!!! KVLATLCGQESTDTERAPGNDTFYSLGPSLKVTFHSDYPFTGFEAFYAAEDVDECRPCDH !----------------------------------------------------------- YCHYLGGYYCSCRVGYILHQNKHTCSALCSGQVFTGRSGFLSSPEYPQPYPKLSSCAYNI ---2222-----2222--1111-------------------------------------- RLEEGFSITLDFVESFDVEMHCDSLKIQTDKREYGPFCGKTLPPRIETDSNKVTITFTTD ------------------------------------------------------------ ESGNHTGWKIHYTSTA ---------------- >FIBRILLARIN-LIKE PRE-RRNA; SWP:O28192; PDB:1NT2A; KELMRNVYLLDDTLVTKSKYGSHYGEKVFDGYREWVPWRSKLAAMILKGHRLKLRGDERV -----------------------------------1111--------------------- LYLGAASGTTVSHLADIVDEGIIYAVEYSAKPFEKLLELVRERNNIIPLLFDASKPWKYS ----!!!!----------------------3333--3333-----------33333333- GIVEKVDLIYQDIAQKNQIEILKANAEFFLKEKGEVVIMVKARSIDSTAEPEEVFKSVLK --------------1111-----------------------3333--------------- EMEGDFKIVKHGSLMPYHRDHIFIHAYRF ----------------------------- >Putative uncharacterized ; SWP:O28191; PDB:1NT2B; LRYNLWFGVYDGKEIKLSENFEESFLKAENPSPLPFNVSEVGAKALGKDYYRILRKTALA ----1111---------------1111--------------------------------- VSEKMVEKELRREDRYVVALVKALEEIDESINMLNEKLEDIRAVKESEITEKFEKKIREL ------3333-----------------------------3333----------------- RELRRDVEREIEEVMEKIAPNMTELVGAKVAAKLLERAGSMERLVRLPASKIQVIGAEHG -------------3333----3333---------------3333---------------- IIFLHPFIRTLPKAKRGKMARFLAAKLAIAAKIDYFRGEIDESLYESIRRRYEELR 33333333------3333----------------------3333------------ >GLUCOSE-1-PHOSPHATASE; SWP:P19926; PDB:1NT4A; QTVPEGYQLQQVLMMSRANLRAPLANNGSVLEQSTPNKWPEWDVPGGQLTTKGGVLEVYM ---2222----------------3333--------------------------------- GHYMREWLAEQGMVKSGECPPPYTVYAYANSLQRTVATAQFFITGAFPGCDIPVHHQEKM --------1111------------------------------------------------ GTMDPTFNPVITDDSAAFSEQAVAAMEKELSKLQLTDSYQLLEKIVNYKDSPACKEKQQC ---3333---------------------3333---------------1111--------- SLVDGKNTFSAKYQQEPGVSGPLKVGNSLVDAFTLQYYEGFPMDQVAWGEIKSDQQWKVL 3333-------------------------------------1111%%%%----------- SKLKNGYQDSLFTSPEVARNVAKPLVSYIDKALVTDRTSAPKITVLVGHDSNIASLLTAL -------------3333--1111-------------1111-------------------- DFKPYQLHDQNERTPIGGKIVFQRWHDSKANRDLMKIEYVYQSAEQLRNADALTLQAPAQ ------2222----1111-----------------------------------3333--- RVTLELSGCPIDADGFCPMDKFDSVLNEAVK -----1111--1111---------------- >NITROGEN REGULATION PROTE; SWP:P41789; PDB:1NTCA; MDLPGELFEASTPDSPSHLPPDSWATLLAQWADRALRSGHQNLLSEAQPELERTLLTTAL ----------------------------------------------------------33 RHTQGHKQEAARLLGWGAATLTAKLKELGME 33---3333--1111---------------- >TYROSYL-TRNA SYNTHETASE; SWP:P54577; PDB:1NTGA; PEEVIPSRLDIRVGKIITVEKHPDADSLYVEKIDVGEAEPRTVVSGLVQFVPKEELQDRL ----------------------------------------------1111-33332222- VVVLCNLKPQKMRGVESQGMLLCASIEGINRQVEPLDPPAGSAPGEHVFVKGYEKGQPDE --------------------------------------22222222---2222------- ELKPKKKVFEKLQADFKISEECIAQWKQTNFMTKLGSISCKSLKGGNISLE --3333------------1111---%%%%---1111------2222----- >MONOMETHYLAMINE METHYLTRA; SWP:O30642; PDB:1NTHA; TFRKSFDCYDFYDRAKVGEKCTQDDWDLMKIPMKAMELKQKYGLDFKGEFIPTDKDMMEK ---------------------------------------------iiii----------- LFKAGFEMLLECGIYCTDTHRIVKYTEDEIWDAINNVQKEFVLGTGRDAVNVRKRSVGDK --------------------------------------------!!!!-------2222- AKPIVQGGPTGSPISEDVFMPVHMSYALEKEVDTIVNGVMTSVRGKSPIPKSPYEVLAAK --------iiii--3333-------3333-------------iiii--2222-------- TETRLIKNACAMAGRPGMGVKGPETSLSAQGNISADCTGGMTCTDSHEVSQLNELKIDLD ----------11111111------------------2222-1111------------333 AISVIAHYKGNSDIIMDEQMPIFGGYAGGIEETTIVDVATHINAVLMSSASWHLDGPVHI 3-------1111---------2222----------------3333--------------- RWGSTNTRETLMIAGWACATISEFTDILSGNQYYPCAGPCTEMCLLEASAQSITDTASGR ------------------------------------------------------------ EILSGVASAKGVVTDKTTGMEARMMGEVARATAGVEISEVNVILDKLVSLYEKNYASAPA -------%%%%-2222---------------2222---------------11111111-- GKTFQECYDVKTVTPTEEYMQVYDGARKKLEDLGLVF --3333---1111-----------------1111--- >NEUROTOXIN I; SWP:P01382; PDB:1NTN; ITCYKTPIITSETCAPGQNLCYTKTWCDAWCGSRGKVIELGCAATCPTVESYQDIKCCST ---------------------------1111------------------1111------- DNCNPHPKQKRP ------------ >DISABLED HOMOLOG 1; SWP:P97318; PDB:1NTVA; GQDRSEATLIKRFKGEGVRYKAKLIGIDEVSAARGDKLCQDSMMKLKGVVAGARSKGEHK -------------!!!!-----------------------------------1111---- QKIFLTISFGGIKIFDEKTGALQHHHAVHEISYIAKDITDHRAFGYVCGKEGNHRFVAIK -------1111-----1111------1111------1111---------2222------- TAQAAEPVILDLRDLFQLIYELKQREELEKKA ----------------------11113333-- >ALPHA-NEUROTOXIN; SWP:P01416; PDB:1NTX; RICYNHQSTTRATTKSCEENSCYKKYWRDHRGTIIERGCGCPKVKPGVGIHCCQSDKCNY ------------------------------------------------------2222-- >TRIPLE FUNCTIONAL DOMAIN ; SWP:O75962; PDB:1NTYA; ARRKEFIMAELIQTEKAYVRDLRECMDTYLWEMTSGVEEIPPGIVNKELIIFGNMQEIYE ----------------------------------------3333--3333---------- FHNNIFLKELEKYEQLPEDVGHCFVTWADKFQMYVTYCKNKPDSTQLILEHAGSYFDEIQ ----------1111-3333-------3333---------------------!!!!----- QRHGLANSISSYLIKPVQRITKYQLLLKELLTCCEEGKGEIKDGLEVMLSVPKRANDAMH ----------------------------------2222---------------------- LSMLEGFDENIESQGELILQESFQVWDPKTLIRKGRERHLFLFEMSLVFSKEVKDSSGRS 1111-----3333-----------------------------------------1111-- KYLYKSKLFTSELGVTEHVEGDPCKFALWVGRTPTSDNKIVLKASSIENKQDWIKHIREV --------3333------2222------------1111---------------------- IQERT -1111 >HYPOTHETICAL PROTEIN YQGF; SWP:P52050; PDB:1NU0A; SGTLAFDFGTKSIGVAVGQRITGTARPLPAIKAQDGTPDWNIIERLLKEWQPDEIIVGLP --------1111-------1111----------iiii----------------------- LNDGTEQPLTARARKFANRIHGRFGVEVKLHDERLSTVEAGGYRALNKGKVDSASAVIIL ------3333------------------------------------33331111------ ESYEQGY ------- >U1A RNA BINDING DOMAIN; SWP:P09012; PDB:1NU4A; RPNHTIYINNLNEKIKKDELKKSLHAIFSRFGQILDILVSRSLKMRGQAFVIFKEVSSAT -----------1111-------------1111----------------------3333-- NALRSMQGFPFYDKPMRIQYAKTDSDIIAKM -----2222-iiii-----------3333-- >CHLOROMUCONATE CYCLOISOME; SWP:P27099; PDB:1NU5A; MKIEAISTTIVDVPTRRPLQMSFTTVHKQSYVIVQVKAGGLVGIGEGGSVGGPTWGSESA -------------------------------------iiii------------------- ETIKVIIDNYLAPLLVGKDASNLSQARVLMDRAVTGNLSAKAAIDIALHDLKARALNLSI ----------333322221111-------------------------------1111--- ADLIGGTMRTSIPIAWTLASGDTARDIDSALEMIETRRHNRFKVKLGARTPAQDLEHIRS ------------------------------------------------------------ IVKAVGDRASVRVDVNQGWDEQTASIWIPRLEEAGVELVEQPVPRANFGALRRLTEQNGV ----!!!!------%%%%-------------------------1111------------- AILADESLSSLSSAFELARDHAVDAFSLKLCNMGGIANTLKVAAVAEAAGISSYGGTMLD ----3333---------1111-------3333---------------------------- STVGTAAALHVYATLPSLPYGCELIGPWVLGDRLTQQDLEIKDFEVHLPLGSGLGVDLDH -----------1111---------3333-----------------------!!!!----- DKVRHYTRA --------- >PROTHROMBIN; SWP:NA; PDB:1NU7D; MIVTKDYSKESRVNENSKYGTLISDWYLKGRLTSLESQFINALGILETYHYGEKEYKDAK -------------1111------3333--------------------3333-3333---- DKLMTRILGEDQYLLERKKVQYEEYKKLYKKYKEENPTSKVKMKTFDQYTIEDLTMREYN -----------------------------------1111-----3333------------ ELTESLKSAVKDFEKDVEIIENQHHDLKPFTDEMEEKATARVDDLANKAYSVYFAFVRDT --------------------11111111---------------------------1111- QHKTEALELKAKVDLVLGDEDKPHRISNERIEKEMIKDLESIIEDFFIETGLNKPDNITS ------------------1111-------------------------------------- YDSSKHHYKNHSEGFEALVKETREAVTNANDSWKTKTVKKYG -3333-------------------------3333-------- >NUCLEOSIDE DIPHOSPHATE KI; SWP:P22392; PDB:1NUEA; ANLERTFIAIKPDGVQRGLVGEIIKRFEQKGFRLVAMKFLRASEEHLKQHYIDLKDRPFF --------------1111---------3333-------------------1111--1111 PGLVKYMNSGPVVAMVWEGLNVVKTGRVMLGETNPADSKPGTIRGDFCIQVGRNIIHGSD -----1111---------2222-----------3333------------1111------- SVKSAEKEISLWFKPEELVDYKSCAHDWVYE -------------1111-----1111----- -------------------------------- >XANTHINE-GUANINE PHOSPHOR; SWP:P00501; PDB:1NULA; EKYIVTWDMLQIHARKLASRLMPSEQWKGIIAVSRGGLVPGALLARELGIRHVDTVCISS -----------------3333-3333--------1111---------------------- LKVLKRAEGDGEGFIVIDDLVDTGGTAVAIREMYPKAHFVTIFAKPAGRPLVDDYVVDIP ----------2222-------------------1111-------33331111-------1 QDTWIEQPWDMGVVFVPPISGR 111---3333------------ >FIBROBLAST GROWTH FACTOR-; SWP:O15520; PDB:1NUNA; SYNHLQGDVRWRKLFSFTKYFLKIEKNGKVSGTKKENCPYSILEITSVEIGVVAVKAINS -1111----------1111-----1111--------------------2222-------- NYYLAMNKKGKLYGSKEFNNDCKLKERIEENGYNTYASFNWQHNGRQMYVALNGKGAPRR ------3333--------1111-------------------------------------3 GQKTRRKNTSAHFLPMVVH 333-33331111------- >Fibroblast growth factor ; SWP:P21802; PDB:1NUNB; KRAPYWTNTEKEKRLHAVPAANTVKFRCPAGGNPPTRWLKNGKEFKQEHRIGGYKVRNQH -------3333----------------------------------11112222---3333 WSLIESVVPSDKGNYTCVVENEYGSINHTYHLDVVERSPHRPILQAGLPANASTVVGGDV -------1111---------1111------------------------------2222-- EFVCKVYSDAQPHIQWIKHVEKNGSKYGPDGLPYLKVLKHSGINSSNAEVLALFNVTEAD ---------------------iiii--1111----------1111-3333------3333 AGEYICKVSNYIGQANQSAWLTVLP ------------------------- >FKSG76; SWP:Q96T66; PDB:1NUUA; SRIPVVLLACGSFNPITNMHLRMFEVARDHLHQTGMYQVIQGIISPVNDTYGKKDLAASH -----------------------------------------------1111--------- HRVAMARLALQTSDWIRVDPWESEQAQWMETVKVLRHHHSKLLAVPELKLLCGADVLKTF -------1111--------3333------3333--------------------------- QTPNLWKDAHIQEIVEKFGLVCVGRVSHDPKGYIAESPILRMHQHNIHLAKEPVQNEISA -2222-3333--------------2222--------------3333-------------- TYIRRALGQGQSVKYLIPDAVITYIKDHGLYTK ------1111--2222----------------- >FRUCTOSE-1,6-BISPHOSPHATA; SWP:P00636; PDB:1NUYA; TNIVTLTRFVMEEGRKARGTGEMTQLLNSLCTAVKAISTAVRKAGIAHLYGIAGSTNVTG ------------3333----------------------------1111--------1111 DQVKKLDVLSNDLVINVLKSSFATCVLVSEEDKNAIIVEPEKRGKYVVCFDPLDGSSNID ---3333-----------3333------1111------3333------------333311 CLVSIGTIFGIYRKNSTDEPSEKDALQPGRNLVAAGYALYGSATMLVLAMVNGVNCFMLD 11------------------3333----1111---------------------------- PAIGEFILVDRDVKIKKKGSIYSINEGYAKEFDPAITEYIQRKKFPPDNSAPYGARYVGS ------------------------33331111-------------1111----------- MVADVHRTLVYGGIFMYPANKKSPKGKLRLLYECNPMAYVMEKAGGLATTGKEAVLDIVP -------------------1111------------------1111--------1111--- TDIHQRAPIILGSPEDVTELLEIYQKHA -1111-------------------1111 >HEMK PROTEIN; SWP:Q9WYV8; PDB:1NV8A; RKIWSLIRDCSGKLEGVTETSVLEVLLIVSRVLGIRKEDLFLKDLGVSPTEEKRILELVE -----------1111---------------------1111-------------------- KRASGYPLHYILGEKEFMGLSFLVEEGVFVPRPETEELVELALELIRKYGIKTVADIGTG ------1111------%%%%----2222---3333-----------------------!! SGAIGVSVAKFSDAIVFATDVSSKAVEIARKNAERHGVSDRFFVRKGEFLEPFKEKFASI !!-------------------------------11111111--------11111111--- EMILSNPPYVKSSAHLPKDVLFEPPEALFGGEDGLDFYREFFGRYDTSGKIVLMEIGEDQ ----------1111-3333----3333-------------------2222---------- VEELKKIVSDTVFLKDSAGKYRFLLLNRRSS ---1111--------1111------------ >4-HYDROXY-2-OXOVALERATE A; SWP:P51016; PDB:1NVMA; TFNPSKKLYISDVTLRDGSHAIRHQYTLDDVRAIARALDKAKVDSIEVAHGDGLQGSSFN ------------1111-3333----------------------------1111-----11 YGFGRHTDLEYIEAVAGEISHAQIATLLLPGIGSVHDLKNAYQAGARVVRVATHCTEADV 11-------------1111---------2222---------------------1111--- SKQHIEYARNLGMDTVGFLMMSHMIPAEKLAEQGKLMESYGATCIYMADSGGAMSMNDIR --------1111--------1111------------------------1111--3333-- DRMRAFKAVLKPETQVGMHAHHNLSLGVANSIVAVEEGCDRVDASLAGMGAGAGNAPLEV ----------1111-------11113333-----1111------2222--!!!!------ FIAVAERLGWNHGTDLYTLMDAADDIVRPLQDRPVRVDRETLGLGYAGVYSSFLRHAEIA -----1111-----------------3333-------------------3333------- AAKYNLKTLDILVELGHRRMVGGQEDMIVDVALDLLAAHK ---------------1111-22221111------------ >Acetaldehyde dehydrogenas; SWP:Q52060; PDB:1NVMB; MNQKLKVAIIGSGNIGTDLMIKVLRNAKYLEMGAMVGIDAASDGLARAQRMGVTTTYAGV --------------------------------------1111------1111-------- EGLIKLPEFADIDFVFDATSASAHVQNEALLRQAKPGIRLIDLTPAAIGPYCVPVVNLEE -----3333----------3333-----------1111--------------33333333 HLGKLNVNMVTCGGQATIPMVAAVSRVAKVHYAEIVASISSKSAGPGTRANIDEFTETTS 1111-------------------1111------------3333-3333------------ KAIEVIGGAAKGKAIIIMNPAEPPLIMRDTVYVLSAAADQAAVAASVAEMVQAVQAYVPG ---------------------------------------------------------111 YRLKQQVQFDVIPESAPLNIPGLGRFSGLKTSVFLEVEGAAHYLPAYAGNLDIMTSAALA 1-----------1111---2222---------------------3333------------ TAERMAQSMLNA ------------ >Transcription initiation ; SWP:P52655; PDB:1NVPB; TVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLM ---------------------1111------------------ >Transcription initiation ; SWP:P52655; PDB:1NVPC; DTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSKAIGDAEW ---------------!!!!-----------iiii------------- >Transcription initiation ; SWP:P52657; PDB:1NVPD; YQLYRNTTLGNSLQESLDELIQSQQITPQLALQVLLQFDKAINAALAQRVRNRVNFRGSL ------------------------------------------------------------ NTYRFCDNVWTFVLNDVEFREVTELIKVDKVKIVACD -----%%%%---------------------------- >SHIKIMATE 5'-DEHYDROGENAS; SWP:Q58484; PDB:1NVTA; GPLGSMINAKTKVIGLIGHPVEHSFSPIMHNAAFKDKGLNYVYVAFDVLPENLKYVIDGA -------1111---------1111----------1111----------33331111---- KALGIVGFNVTIPHKIEIMKYLDEIDKDAQLIGAVNTIKIEDGKAIGYNTDGIGARMALE ---------------3333-------3333----------iiii-----------33331 EEIGRVKDKNIVIYGAGGAARAVAFELAKDNNIIIANRTVEKAEALAKEIAEKLNKKFGE 111----------------------1111---------3333---------1111----- EVKFSGLDVDLDGVDIIINATPIGMYPNIDVEPIVKAEKLREDMVVMDLIYNPLETVLLK -----1111-2222-------2222----------3333--------------------- EAKKVNAKTINGLGMLIYQGAVAFKIWTGVEPNIEVMKNAIIDKITK ----------------------------------------3333--- >CHOLINE KINASE (49.2 KD); SWP:Q22942; PDB:1NW1A; GMKELLSTMDLDTDANTIPELKERAHMLCARFLGGAWKTVPLEHLRISRIKGGMSNMLFL -----111111113333------------------3333-3333--------1111---- CRLSEVYPPIRNEPNKVLLRVYFNPETESHLVAESVIFTLLSERHLGPKLYGIFSGGRLE ---3333----------------------------------------------2222--- EYIPSRPLSCHEISLAHMSTKIAKRVAKVHQLEVPIWKEPDYLCEALQRWLKQLTGTVDA --------3333---3333---------1111---------------------1111-11 EHRFDLPEECGVSSVNCLDLARELEFLRAHISLSKSPVTFCHNDLQEGNILLPKRLVLID 11----3333-------------------3333------------3333----------- FEYASYNYRAFDFANHFIEWTIDYDIDEAPFYKIQTENFPENDQMLEFFLNYLREQGNTR 1111---3333-------1111------------3333---------------------1 ENELYKKSEDLVQETLPFVPVSHFFWGVWGLLQVELSPVGFGFADYGRDRLSLYFKHKQL 111-----------3333-------------3333---------------------3333 LKNLA ----- >THIOREDOXIN; SWP:P80579; PDB:1NW2A; ATMTLTDANFQQAIQGDKPVLVDFWAAWCGPCRMMAPVLEEFAEAHADKVTVAKLNVDEN -----3333-3333-----------1111-3333-----------1111------3333- PETTSQFGIMSIPTLILFKGGEPVKQLIGYQPKEQLEAQLADVLQ ----1111----------iiii---------3333-----3333- >HISTONE METHYLTRANSFERASE; SWP:Q8TEK3; PDB:1NW3A; LELRLKSPVGAEPAVYPWPLPVYDKHHDAAHEIIETIRWVCEEIPDLKLAMENYVLIDYD -----------------------------------------------11112222----3 TKSFESMQRLCDKYNRAIDSIHQLWKGTTQPMKLNTRPSTGLLRHILQQVYNHSVTDPEK 333--------------------3333-----------------------------3333 LNNYEPFSPEVYGETSFDLVAQMIDEIKMTDDDLFVDLGSGVGQVVLQVAAATNCKHHYG -------1111------------------3333------!!!!----------------- VEKADIPAKYAETMDREFRKWMKWYGKKHAEYTLERGDFLSEEWRERIANTSVIFVNNFA ----------------------------------------3333---1111------111 FGPEVDHQLKERFANMKEGGRIVSSKPFAPLNFRINSRNLSDIGTIMRVVELSPLKGSVS 1-----------11112222---------------1111--3333--------------1 WTGKPVSYYLHTIDRTILENYFSSLKNP 111-------------------3333-- >Caspase-9 [Precursor]; SWP:P55211; PDB:1NW9B; GALESLRGNADLAYILSMEPCGHCLIINNVNFCRESGLRTRTGSNIDCEKLRRRFSSLHF 1111----1111--------------------3333------------------------ MVEVKGDLTAKKMVLALLELARQDHGALDCCVVVILSHGCQASHLQFPGAVYGTDGCPVS -------------------1111-1111------------------------1111---- VEKIVNIFNGTSCPSLGGKPKLFFIQACGATPFQSSLPTPSDIFVSYSTFPGFVSWRDPK ----3333----3333--------------------------------------1111-- SGSWYVETLDDIFEQWAHSEDLQSLLLRVANAVSVKGIYKQMPGCFNFLRKKLFFKTS --------------------------------1111---------------------- >PEPTIDE METHIONINE SULFOX; SWP:P96814; PDB:1NWAA; HMTSNQKAILAGGCFWGLQDLIRNQPGVVSTRVGYSGGNIPNATYRNHGTHAEAVEIIFD 1111----------------33332222---------------3333!!!!--------- PTVTDYRTLLEFFFQIHDPTTKDRQGNDRGTSYRSAIFYFDEQQKRIALDTIADVEASGL ------------------------!!!!--1111---------------------3333- WPGKVVTEVSPAGDFWEAEPEHQDYLQRYPNGYTCHFVRPGWRLPRR ------------------3333-1111-1111------1111----- >HYPOTHETICAL PROTEIN AQ_1; SWP:O67709; PDB:1NWBA; MQEQAQQFIFKVTDKAVEEIKKVAQENNIENPILRIRVVPGGCSGFQYAMGFDDTVEEGD -------------3333------1111--------------3333--------------- HVFEYDGVKVVIDPFSMPYVNGAELDYVVDFMGGGFTIRNP ---------------11112222------!!!!-------- >AZURIN; SWP:P34097; PDB:1NWPA; AECKVTVDSTDQMSFNTKDIAIDKSCKTFTVELTHSGSLPKNVMGHNLVISKEADMQPIA ---------1111---------3333-------------1111--------3333----- TDGLSAGIDKQYLKDGDARVIAHTKVIGAGEKDSVTFDVSKLAAGEKYGFFCSFPGHISM --11113333---2222----------2222------3333-2222-------2222111 MKGTVTLK 1------- >CCAAT/ENHANCER BINDING PR; SWP:P05554; PDB:1NWQA; NSNEYRVRRERNNIAVRKSRDKAKQRNVETQQKVLELTSDNDRLRKRVEQLSRELDTLRG 1111-------------------------------------------------------- >COMPLEMENT DECAY-ACCELERA; SWP:P08174; PDB:1NWVA; FRSCEVPTRLNSASLKQPYITQNYFPVGTVVEYECRPGYRREPSLSPKLTCLQNLKWSTA ------------------------------------------------------------ VEFCKKKSCPNPGEIRNGQIDVPGGILFGATISFSCNTGYKLFGSTSSFCLISGSSVQWS ------------------------------------2222-------------------- DPLPECREH --------- >LIMONENE-1,2-EPOXIDE HYDR; SWP:Q9ZAG3; PDB:1NWWA; IEQPRWASKDSAAGAASTPDEKIVLEFMDALTSNDAAKLIEYFAEDTMYQNMPLPPAYGR ---1111--1111----------------1111-3333-1111-------3333------ DAVEQTLAGLFTVMSIDAVETFHIGSSNGLVYTERVDVLRALPTGKSYNLSILGVFQLTE --------------------------iiii-----------1111-------------ii GKITGWRDYFDLREFEEAVDLPLRG ii-------------------1111 >PHOTOACTIVE YELLOW PROTEI; SWP:P16113; PDB:1NWZA; MEHVAFGSEDIENTLAKMDDGQLDGLAFGAIQLDGDGNILQYNAAEGDITGRDPKQVIGK ----2222-3333-222233331111-------1111---------------33332222 NFFKDVAPCTDSPEFYGKFKEGVASGNLNTMFEYTFDYQMTPTKVKVHMKKALSGDSYWV 3333--3333-1111--------------------------------------------- FVKRV ----- >CARBAPENEM SYNTHASE; SWP:Q9XB59; PDB:1NX8A; SEIVKFNPVMASGFGAYIDHRDFLEAKTETIKNLLMRQGFVVVKNLDIDSDTFRDIYSAY ---------3333-------------3333--------------------------3333 GTIVEYRDTLKLEGEKGKIVTGRGQLPFHADGGLLLSQVDQVFLYAAEIKNVKFRGATTV ------------------1111-------------------------------------- CDHALACQEMPAHLLRVLEEETFEVRVLWFKVPVFTDLGWVRKMLIYFPFDEGQPASWEP ----------1111------------------------------------2222------ RIVGFTDHETQAFFQELGAFLKQPRYYYKHFWEDGDLLIMDNRRVIHEREEFNDDDIVRR -22223333-------------3333------2222----1111---------3333--- LYRGQTAD -------- >ALPHA-AMINO ACID ESTER HY; SWP:Q8VRK8; PDB:1NX9A; HDPLSVQTGSDIPASVHMPTDQQRDYIKREVMVPMRDGVKLYTVIVIPKNARNAPILLTR -1111-------------3333------------1111---------2222--------- TPYNAKGRANRVPNALTMREVLPQGDDVFVEGGYIRVFQDIRGKYGSQGDYVMTRPPHGP ---3333---------3333--3333---1111-------2222--------------11 LNPTKTDETTDAWDTVDWLVHNVPESNGRVGMTGSAYEGFTVVMALLDPHPALKVAAPES 11----3333------------1111------------------3333-1111------- PMVDGWMGDDWFHYGAFRQGAFDYFVSQMTARGGGNDIPRRDADDYTNFLKAGSAGSFAT ------------iiii---------------------------3333------------- QAGLDQYPFWQRMHAHPAYDAFWQGQALDKILAQRKPTVPMLWEQGLWDQEDMWGAIHAW ---1111-----1111---33331111----3333------------------------- QALKDADVKAPNTLVMGPWRHSGVNYNGSTLGPLEFEGDTAHQYRRDVFRPFFDEYLKPG ---1111------------2222-------!!!!-----------------------222 SASVHLPDAIIYNTGDQKWDYYRSWPSVCESNCTGGLTPLYLADGHGLSFTHPAADGADS 2------------------------------------------%%%%------------- YVSDPAHPVPFISRPFAFAQSSRWKPWLVQDQREAESRPDVVTYETEVLDEPVRVSGVPV ---1111---------33331111-3333---3333-1111------------------- ADLFAATSGTDSDWVVKLIDVQPAMTPDDPKMGGYELPVSMDIFRGRYRKDFAKPEALQP -------------------------11111111------------1111-1111----22 DATLHYHFTLPAVNHVFAKGHRIMVQIQSSWFPLYDRNPQKFVPNIFDAKPADYTVATQS 22---------------2222-----------------------3333-3333------- IHHGGKEATSILLPVVK ---!!!!---------- >NEUROTOXIN B; SWP:Q90VW1; PDB:1NXB; RICFNQHSSQPQTTKTCSPGESSCYHKQWSDFRGTIIERGCGCPTVKPGIKLSCCESEVC --------------------------------------------------------2222 NN -- >Mannosyl-oligosaccharide ; SWP:P45700; PDB:1NXCA; FLPPVGVENREPADATIREKRAKIKEMMTHAWNNYKRYAWGLNELKPISKEGHSSSLFGN --------------------------------------2222-------------1111- IKGATIVDALDTLFIMGMKTEFQEAKSWIKKYLDFNVNAEVSVFEVNIRFVGGLLSAYYL -------------1111------------------------------------------- SGEEIFRKKAVELGVKLLPAFHTPSGIPWALLNMKSGIGRNWPWASGGSSILAEFGTLHL ------------------11111111---------------1111%%%%----------- EFMHLSHLSGDPVFAEKVMKIRTVLNKLDKPEGLYPNYLNPSSGQWGQHHVSVGGLGDSF ------------------------3333-2222--------------------2222--- YEYLLKAWLMSDKTDLEAKKMYFDAVQAIETHLIRKSSGGLTYIAEWKGGLLEHKMGHLT --------1111------------------------1111-------iiii-----3333 CFAGGMFALGADGAPEARAQHYLELGAEIARTCHESYNRTYVKLGPEAFRFDGGVEAIAT ---------3333-2222------------------1111-----------iiii----- RQNEKYYILRPEVIETYMYMWRLTHDPKYRTWAWEAVEALESHCRVNGGYSGLRDVYIAR 1111-----------------------------------------1111-----1111-- ESYDDVQQSFFLAETLKYLYLIFSDDDLLPLEHWIFNTEAHPFPILR -------3333---------11111111-3333---1111------- >MTH396 PROTEIN; SWP:O26496; PDB:1NXHA; EGELRLKRRILESYRWQEDVVKPLSRELEIDVEEFQDILDKLDSSLEALHPRFESARPRC ---------------3333------------------------3333------------- IREKLHSDLQLCWLVDVEIISVDDAEALKDEITELVLAGREYSEALSEGRRRLHEILRS -----------------------------------1111-------------------- >CONSERVED HYPOTHETICAL PR; SWP:Q9KUU1; PDB:1NXIA; MSHQDDYLSVEELIEIQKEETRDIIQALLEDGSDPDALYEIEHHLFAEDFDKLEKAAVEA --------3333----------------3333-1111----------------------3 FKMGFEVLEAEETEDEDGNKLLCFDATMQSALDAKLIDEQVEKLVNLAEKFDIIYDGWGT 333-----------------------------3333------------------------ YYEGLEHHHHHH ------------ >Probable S-adenosylmethio; SWP:P96224; PDB:1NXJA; AISFRPTADLVDDIGPDVRSCDLQFRQFGGRSQFAGPISTVRCFQDNALLKSVLSQPSAG --------------1111----------------------------------1111---- GVLVIDGAGSLHTALVGDVIAELARSTGWTGLIVHGAVRDAAALRGIDIGIKALGTNPRK ------%%%%------3333-------------------33331111------------- STKTGAGERDVEITLGGVTFVPGDIAYSDDDGIIVV --------------iiii--2222----3333---- >MAP KINASE-ACTIVATED PROT; SWP:P49137; PDB:1NXKA; QFPQFHVKSGLQIKKNAIIDDYKVTSQVLGLGINGKVLQIFNKRTQEKFALKLQDCPKAR --1111----------3333-----------2222------------------------- REVELHWRASQCPHIVRIVDVYENLYAGRKCLLIVECLDGGELFSRIQDRAFTEREASEI -----------1111--------------------------------------------- KSIGEAIQYLHSINIAHRDVKPENLLYTSKRPNAILKLTDFGFAKETTPYYVAPEVLGPE --------------------3333----------------1111--------3333---- KYDKSCDWSLGVIYILLCGYPPFYSNHGLAISPGKTRIRGQYEFPNPEWSEVSEEVKLIR -3333--------3333--------3333-----3333--------1111--3333--11 NLLKTEPTQRTITEFNHPWIQSTKVPQTPLHTSRVLKED 11----11113333---------------------3333 >DTDP-6-DEOXY-D-XYLO-4-HEX; SWP:Q8GIQ0; PDB:1NXMA; NFFGKTLAARPVEAIPGMLEFDIPVHGDNRGWFKENFQKEKMLPLGFPESFFAEGKLQNN -----------3333------------1111------3333-1111-33331111----- VSFSRKNVLRGLHAEPWDKYISVADGGKVLGTWVDLREGETFGNTYQTVIDASKSIFVPR ----2222------------------------------1111--------1111----22 GVANGFQVLSDFVAYSYLVNDYWALELKPKYAFVNYADPSLDIKWENLEEAEVSEADENH 22---------------------33331111---1111--------3333---3333--- PFLKDVKPLRKEDL -3333----3333- >HYPOTHETICAL OXIDOREDUCTA; SWP:P37672; PDB:1NXUA; KVTFEQLKAAFNRVLISRGVDSETADACAEFARTTESGVYSHGVNRFPRFIQQLENGDII ---------------1111---------------1111-11111111------1111--- PDAQPKRITSLGAIEQWDAQRSIGNLTAKKDRAIELAADHGIGLVALRNANHWRGGSYGW ----------!!!!----%%%%-------------------------------------- QAAEKGYIGICWTNSIAVPPWGAKECRIGTNPLIVAIPSTPITVDSSFSYGLEVNRLAGR --1111-------------2222--------------------------------1111- QLPVDGGFDDEGNLTKEPGVIEKNRRILPGYWKGSGSIVLDIATLLSDGASVAEVTQDNS --------------------------------------------1111------------ DEYGISQIFIAIEVDKLIDGPTRDAKLQRIDYVTSAERADENQAIRLPGHEFTTLLAENR -------------1111---------------1111---1111---2222---------- RNGITVDDSVWAKIQAL ----------------- >PROBABLE POLYSACCHARIDE D; SWP:O34928; PDB:1NY1A; VPNEPINWGFKRSVNHQPPDAGKQLNSLIEKYDAFYLGNTKEKTIYLTFDNGYENGYTPK -------------%%%%------------------------------------------- VLDVLKKHRVTGTFFVTGHFVKDQPQLIKRSDEGHIIGNHSFHHPDLTTKTADQIQDELD ------------------------------1111-----------3333----------- SVNEEVYKITGKQDNLYLRPPRGVFSEYVLKETKRLGYQTVFWSVAFVDWKINNQKGKKY -------------------2222----------1111-------------1111------ AYDHIKQAHPGAIYLLHTVSRDNAEALDDAITDLKKQGYTFKSIDDLFEKE ----11112222-------2222-------------------3333-3333 ------------------------------------------------------------ ----------- >TRANSCRIPTIONAL REGULATOR; SWP:O67198; PDB:1NY5A; MNVLVIEDDKVFRGLLEEYLSMKGIKVESAERGKEAYKLLSEKHFNVVLLDLLLPDVNGL ----------3333-----------------3333---1111-----------3333333 EILKWIKERSPETEVIVITGHGTIKTAVEAMKMGAYDFLTKPCMLEEIELTINKAIEHRK 3--------3333------2222-------1111-------------------------- LRKENELLRREKDLKEEEYVFESPKMKEILEKIKKISCAECPVLITGESGVGKEVVARLI -----------3333--------------------1111--------2222--------- HKLSDRSKEPFVALNVASIPRDIFEAELFGYEKGAFTGAVSSKEGFFELADGGTLFLDEI ---1111-------3333-3333--------22222222----------2222-----33 GELSLEAQAKLLRVIESGKFYRLGGRKEIEVNVRILAATNRNIKELVKEGKFREDLYYRL 33-------------------2222----------------------------------- GVIEIEIPPLRERKEDIIPLANHFLKKFSRKYAKEVEGFTKSAQELLLSYPWYGNVRELK --------33333333-----------------------------------1111----- NVIERAVLFSEGKFIDRGELSCLV -------------------3333- >Genome polyprotein M; SWP:P03599; PDB:1NY71; GPVCAEASDVYSPCMIASTPPAPFSDVTAVTFDLINGKITPVGDDNWNTHIYNPPIMNVL -----------------------------------------------------3333--- RTAAWKSGTIHVQLNVRGAGVKRADWDGQVFVYLRQSMNPESYDARTFVISQPGSAMLNF ---------------------3333-------------3333------------------ SFDIIGPNSGFEFAESPWANQTTWYLECVATNPRQIQQFEVNMRFDPNFRVAGNILMPPF -------------------------------1111----------1111----------- PLSTETPPL --------- >Genome polyprotein M; SWP:P03599; PDB:1NY72; MEQNLFALSLDDTSSVRGSLLDTKFAQTRVLLSKAMAGGDVLLDEYLYDVVNGQDFRATV ---3333-----------3333-----------------------3333------1111- AFLRTHVITGKIKVTATTNISDNSGCCLMLAINSGVRGKYSTDVYTICSQDSMTWNPGCK ------------------------------------------3333---------1111- KNFSFTFNPNPCGDSWSAEMISRSRVRMTVICVSGWTLSPTTDVIAKLDWSIVNEKCEPT ---------1111--------1111----------------------------------- IYHLADCQNWLPLNRWMGKLTFPQGVTSEVRRMPLSIGGGAGATQAFLANMPNSWISMWR -------------------------------------------------------3333- YFRGELHFEVTKMSSPYIKATVTFLIAFGNLSDAFGFYESFPHRIVQFAEVEEKCTLVFS --------------1111---------11111111-1111--------1111-------1 QQEFVTAWSTQVNPRTTLEADGCPYLYAIIHDSTTGTISGDFNLGVKLVGIKDFCGIGSN 111---------11113333---------------------------------------- PGIDGSRLL --------- >PROTEIN YRBA; SWP:NA; PDB:1NY8A; MIEDPMENNEIQSVLMNALSLQEVHVSGDGSHFQVIAVGELFDGMSRVKKQQTVYGPLME ------------------------------------------------------------ YIADNRIHAVSIKAYTPAEWARDRKLNGFLEHHHHHH ---3333--------3333------------------ >TRANSCRIPTIONAL ACTIVATOR; SWP:P32184; PDB:1NY9A; WQRIQDEADELTRRFVALMDAGEPADSEGAMDAAEDHRQGIARNHYDCGYEMHTCLGEMY -----3333--------------1111--------------3333--------------- VSDERFTRNIDAAKPGLAAYMRDAILANAVRHTP --3333---33332222----------------- >CALERYTHRIN; SWP:P06495; PDB:1NYAA; TTAIASDRLKKRFDRWDFDGNGALERADFEKEAQHIAEAFGKDAGAAEVQTLKNAFGGLF --3333------33331111----3333---------1111------------------- DYLAKEAGVGSDGSLTEEQFIRVTENLIFEQGEASFNRVLGPVVKGIVGMCDKNADGQIN ---------1111--3333------------3333----3333----3333--------3 ADEFAAWLTALGMSKAEAAEAFNQVDTNGNGELSLDELLTAVRDFHFGRLDVELLG 333-----1111-3333--------1111--------------------------- >CYSTEINE PROTEASE INHIBIT; SWP:NA; PDB:1NYCA; SMYQLQFINLVYDTTKLTHLEQTNINLFIGNWSNHQLQKSICIRHGDDTSHNQYHILFID ------------3333---------1111-----1111--------3333---------- TAHQRIKFSSFDNEEIIYILDYDDTQHILMQTSSKQGIGTSRPIVYERLV ---------1111----------1111----------------------- >RIESKE IRON-SULFUR PROTEI; SWP:O52396; PDB:1NYKA; TPEKEPLKPGDILVYAQGGGEPKPIRLEELKPGDPFVLAYPMDPKTKVVKSGEAKNTLLV 3333---2222--------------3333-2222---------------1111------- ARFDPEELAPEVAQHAAEGVVAYSAVCTHLGCIVSQWVADEEAALCPCHGGVYDLRHGAQ ---3333--3333---iiii------------------1111--------------%%%% VIAGPPPRPVPQLPVRVEDGVLVAAGEFLGPVGVQA -----------------iiii--------------- >Hypothetical 12.0 kDa pro; SWP:P38804; PDB:1NYNA; MSTVTKYFYKGENTDLIVFAASEELVDEYLKNPSIGKLSEVVELFEVFTPQDGRGAEGEL ---------------------------------1111-----------------1111-- GAASKAQVENEFGKGKKIEEVIDLILRNGKPNSTTSSLKTKGGNAGTKAYN ---3333-----------3333-3333------------------------ >IMMUNOGENIC PROTEIN MPT70; SWP:Q50769; PDB:1NYOA; GDLVGPGCAEYAAANPTGPASVQGMSQDPVAVAASNNPELTTLTAALSGQLNPQVNLVDT -----------------1111-3333----3333--------------3333-------- LNSGQYTVFAPTNAAFSKLPASTIDELKTNSSLLTSILTYHVVAGQTSPANVVGTRQTLQ ---------------3333--------------------------------------333 GASVTVTGQGNSLKVGNADVVCGGVSTANATVYMIDSVLMPPA 3-------------iiii------------------------- >PINCH PROTEIN; SWP:P48059; PDB:1NYPA; GSMGVPICGACRRPIEGRVVNAMGKQWHVEHFVCAKCEKPFLGHRHYERKGLAYCETHYN ---------------------%%%%--1111-----------------%%%%--3333-- QLFGDV ------ >THREONYL-TRNA SYNTHETASE ; SWP:Q8NW68; PDB:1NYRA; INIQFPDGNKKAFDKGTTTEDIAQSISPGLRKKAVAGKFNGQLVDLTKPLETDGSIEIVT --------------------------------------------1111------------ PGSEEALEVLRHSTAHLMAHAIKRLYGNVKFGVGPVIEGGFYYDFDIDQNISSDDFEQIE ----------------------------------------------------1111---- KTMKQIVNENMKIERKVVSRDEAKELFSNDEYKLELIDAIPEDENVTLYSQGDFTDLCRG ------3333----------------------------------------!!!!------ VHVPSTAKIKEFKLLSTAGAYWRGDSNNKMLQRIYGTAFFDKKELKAHLQMLEERKERDH ----1111------------2222------------------------------333333 RKIGKELELFTNSQLVGAGLPLWLPNGATIRREIERYIVDKEVSMGYDHVYTPVLANVDL 33--1111----3333-------------------------1111--------------- YKTSGHWDHYQEDMFPPMQLDETESMVLRPMNCPHHMMIYANKPHSYRELPIRIAELGTM -----33333333--------------------------------3333----------- HRYEASGAVSGLQRVRGMTLNDSHIFVRPDQIKEEFKRVVNMIIDVYKDFGFEDYSFRLS ----3333-------------------1111----------------1111--------- YRDPEDKEKYFDDDDMWNKAENMLKEAADELGLSYEEAIGEAAFYGPKLDVQVKTAMGKE ----------------------------1111----------1111-------------- ETLSTAQLDFLLPERFDLTYIGQDGEHHRPVVIHRGVVSTMERFVAFLTEETKGAFPTWL ----------3333-------1111--------------------------iiii-3333 APKQVQIIPVNVDLHYDYARQLQDELKSQGVRVSIDDRNEKMGYKIREAQMQKIPYQIVV --------------------------1111------------------------------ GDKEVENNQVNVRQYGSQDQETVEKDEFIWNLVDEIRLKKHR --3333------------------------------------ >Inhibin beta A chain [Pre; SWP:P08476; PDB:1NYSB; LECDGNICCKKQFFVSFKDIGWNDWIIAPSGYHANYCEGECPSHILKSCCVPTKLRPMSM ---------------3333---1111---------------------------------- LYYDDGQNIIKKDIQNMIVEECGCS ------------------------- >SHIKIMATE 5-DEHYDROGENASE; SWP:P15770; PDB:1NYTA; METYAVFGNPIAHSKSPFIHQQFAQQLNIEHPYGRVLAPINDFINTLNAFFSAGGKGANV ----------1111------------------------1111--------1111------ TVPFKEEAFARADELTERAALAGAVNTLMRLEDGRLLGDNTDGVGLLSDLERLSFIRPGL ------------------------------1111----------------------2222 RILLIGAGGASRGVLLPLLSLDCAVTITNRTVSRAEELAKLFAHTGSIQALSMDELEGHE ------------------------------3333-------3333------33332222- FDLIINATSSGISGDIPAIPSSLIHPGIYCYDMFYQKGKTPFLAWCEQRGSKRNADGLGM ---------3333------3333-1111------------------1111---------- LVAQAAHAFLLWHGVLPDVEPVIKQLQEELS ------------------------------- >RIBONUCLEASE P PROTEIN CO; SWP:Q9X1H4; PDB:1NZ0A; ERLRLRRDFLLIFKEGKSLQNEYFVVLFRKNGDYSRLGIVVKRKFGKATRRNKLKRWVRE ---3333-------------1111-----------------3333--------------- IFRRNKGVIPKGFDIVVIPRKKLSEEFERVDFWTVREKLLNLLKRIEG ----3333-----------------3333------------1111--- >AUXILIN; SWP:Q27974; PDB:1NZ6A; DPEKLKILEWIEGKERNIRALLSTHTVLWAGETKWKPVGADLVTPEQVKKVYRKAVLVVH 3333------2222------1111----2222---------------------------3 PDKATGQPYEQYAKIFELNDAWSEFENQGQKPLY 3332222-3333-------------1111----- >TRANSCRIPTION ANTITERMINA; SWP:P35872; PDB:1NZ8A; SIEWYAVHTLVGQEEKAKANLEKRIKAFGLQDKIFQVLIPTEEVVELREGGKKEVVRKKL ---------22223333------------------------------------------- FPGYLFIQMDLGDEEEPNEAWEVVRGTPGITGFVGAGMRPVPLSPDEVRHILEVSGLLG 2222---------------------------------------3333------------ >TRANSCRIPTION ANTITERMINA; SWP:P35872; PDB:1NZ9A; AQVAFREGDQVRVVSGPFADFTGTVTEINPERGKVKVMVTIFGRETPVELDFSQVVKA -----2222------1111---------------------%%%%------1111---- >DIVALENT CATION TOLERANCE; SWP:Q7SIA8; PDB:1NZAA; MEEVVLITVPSEEVARTIAKALVEERLAACVNIVPGLTSIYRWQGEVVEDQELLLLVKTT ------------------------------------------%%%%-------------3 THAFPKLKERVKALHPYTVPEIVALPIAEGNREYLDWLRENTG 333-----------------------------------1111- >COMPLEMENT C1S COMPONENT; SWP:P09871; PDB:1NZIA; EPTMYGEILSPNYPQAYPSEVEKSWDIEVPEGYGIHLYFTHLDIELSENCAYDSVQIISE ---------2222----------------2222-------------2222---------- EGRLCGQRSSNNPHSPIVEEFQVPYNKLQVIFKSDFSNEERFTGFAAYYVATDINECTDV -----------1111----------------------1111-------------3333-- DVPCSHFCNNFIGGYFCSCPPEYFLHDDMKNCGVN ----------2222-----2222--1111------ >HYPOTHETICAL PROTEIN YADB; SWP:P27305; PDB:1NZJA; TQYIGRFAPSPSGELHFGSLIAALGSYLQARARQGRWLVRIEDIDPPREVPGAAETILRQ ------------------------------1111----------3333-2222------- LEHYGLHWDGDVLWQSQRHDAYREALAWLHEQGLSYYCTCTRARIQSIGGIYDGHCRVLH -1111--------3333------------1111------------1111----1111--- HGPDNAAVRIRQQHPVTQFTDQLRGIIHADEKLAREDFIIHRRDGLFAYNLAVVVDDHFQ -----------------------------3333--------1111---3333-------- GVTEIVRGADLIEPTVRQISLYQLFGWKVPDYIHLPLALNALPKGDPRPVLIAALQFLGQ -------3333--------------------------------------------1111- QAEAHWQDFSVEQILQSAVKNWRLTAVPESAIV ----3333--------------3333------- >FISSION PROTEIN FIS1P; SWP:Q9Y3D6; PDB:1NZNA; HEAVLNELVSVEDLLKFEKKFQSEKAAGSVSKSTQFEYAWCLVRTRYNDDIRKGIVLLEE ------------------------1111-------------1111--------------- LLPKGSKEEQRDYVFYLAVGNYRLKEYEKALKYVRGLLQTEPQNNQAKELERLIDKAKKD 1111-----------------1111---------------1111---------------- >4-CHLOROBENZOYL COENZYME ; SWP:Q9R4B0; PDB:1NZYA; MYEAIGHRVEDGVAEITIKLPRHRNALSVKAMQEVTDALNRAEEDDSVGAVMITGAEDAF ---------iiii------3333---------------------1111-------!!!!- CAGFYLREIPLDKGVAGVRDHFRIAALWWHQMIHKIIRVKRPVLAAINGVAAGGGLGISL ----1111---------------------------------------------------- ASDMAICADSAKFVCAWHTIGIGNDTATSYSLARIVGMRRAMELMLTNRTLYPEEAKDWG -------1111------1111---%%%%-------------------------------- LVSRVYPKDEFREVAWKVARELAAAPTHLQVMAKERFHAGWMQPVEECTEFEIQNVIASV ------3333----------------------------3333-3333------------- THPHFMPCLTRFLDGHRADRPQVELPAGV -3333------1111-------------- >ALDEHYDE DEHYDROGENASE, M; SWP:P05091; PDB:1O04A; AVPAPNQQPEVFCNQIFINNEWHDAVSRKTFPTVNPSTGEVICQVAEGDKEDVDKAVKAA -----------------%%%%---3333-------------------------------- RAAFQLGSPWRRMDASHRGRLLNRLADLIERDRTYLAALETLDNGKPYVISYLVDLDMVL ----22223333-3333------------------------------------------- KCLRYYAGWADKYHGKTIPIDGDFFSYTRHEPVGVCGQIIPWNFPLLMQAWKLGPALATG ------1111----------------------------------------------1111 NVVVMKVAEQTPLTALYVANLIKEAGFPPGVVNIVPGFGPTAGAAIASHEDVDKVAFTGS -------3333----------------2222----------------------------3 TEIGRVIQVAAGSSNLKRVTLELGGKSPNIIMSDADMDWAVEQAHFALFFNQGQCSCAGS 333----------------------------1111-------------------1111-- RTFVQEDIYDEFVERSVARAKSRVVGNPFDSKTEQGPQVDETQFKKILGYINTGKQEGAK ----3333-----------1111---1111------------------------1111-- LLCGGGIAADRGYFIQPTVFGDVQDGMTIAKEEIFGPVMQILKFKTIEEVVGRANNSTYG -----------------------11111111----------------------------- LAAAVFTKDLDKANYLSQALQAGTVWVNCYDVFGAQSPFGGYKMSGSGRELGEYGLQAYT ---------------------------------3333----!!!!------33331111- EVKTVTVKVPQKNS -------------- >BETA-PHOSPHOGLUCOMUTASE; SWP:P71447; PDB:1O08A; MFKAVLFDLDGVITDTAEYHFRAWKALAEEIGINGVDRQFNEQLKGVSREDSLQKILDLA --------2222--3333----------1111----33331111------------1111 DKKVSAEEFKELAKRKNDNYVKMIQDVSPADVYPGILQLLKDLRSNKIKIALASASKNGP ---------------------1111--3333-2222-------1111--------1111- FLLERMNLTGYFDAIADPAEVAASKPAPDIFIAAAHAVGVAPSESIGLEDSQAGIQAIKD ---11113333-----1111---------------1111-3333--------------11 SGALPIGVGRPEDLGDDIVIVPDTSHYTLEFLKEVWLQKQK 11-------3333---------3333--------------- >ERVATAMIN C; SWP:P83654; PDB:1O0EA; LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKKN -----3333-----------------------------------------------3333 HGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAASKVVSIDGYNGVPFCNEALKQ !!!!------------------3333---------------------------------- AVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQANYWIVRNSWGRYWGE -------------------------------------------1111-------1111-i KGYIRMLRVGGCGLCGIARLPYYPTKA iii------!!!!%%%%---------- >HYPOTHETICAL PROTEIN HI11; SWP:P45083; PDB:1O0IA; LWKKTFTLENLNQLCSNSAVSHLGIEISAFGEDWIEATPVDHRTQPFGVLHGGVSVALAE ------------1111----1111----------------33331111------------ TIGSLAGSLCLEEGKTVVGLDINANHLRPVRSGKVTARATPINLGRNIQVWQIDIRTEEN ------1111----------------------------------1111--------1111 KLCCVSRLTLSVINL --------------- >APOPTOSIS REGULATOR BCL-W; SWP:Q92843; PDB:1O0LA; STPASAPDTRALVADFVGYKLRQKGYVCGAGPGEGPAADPLHQAMRAAGDEFETRFRRTF 1------------------------3333------------------------------- SDLAAQLHVTPGSAQQRFTQVSDELFQGGPNWGRLVAFFVFGAALCAESVNKEMEPLVGQ ------------------------------3333-------------3333--------- VQEWMVEYLETRLADWIHSSGGWAEFTALYGDGALEEARRLREGNWASVRTVLTGAVALG -------------------------------------------------3333---1111 AL -- >SPLICING FACTOR U2AF 65 K; SWP:P26368; PDB:1O0PA; GHPTEVLCLMNMVLPEELLDDEEYEEIVEDVRDECSKYGLVKSIEIPRPVDGVEVPGCGK -------------3333----3333--------1111-----------2222--2222-- IFVEFTSVFDCQKAMQGLTGRKFANRVVVTKYCDPDSYHRRDFW -------------------------------------------- >NAD-DEPENDENT MALIC ENZYM; SWP:P27443; PDB:1O0SA; SVAHHEDVYSHNLPPMDEKEMALYKLYRPERVTPKKRSAELLKEPRLNKGMGFSLYERQY --1111----------3333----------------------------!!!!-------- LGLHGLLPPAFMTQEQQAYRVITKLREQPNDLARYIQLDGLQDRNEKLFYRVVCDHVKEL --2222------------------1111----------------------------3333 MPIVYTPTVGLACQNFGYIYRKPKGLYITINDNSVSKIYQILSNWHEEDVRAIVVTDGER ----------------------------1111---------1111--------------- ILGLGDLGAYGIGIPVGKLALYVALGGVQPKWCLPVLLDVGTNNMDLLNDPFYIGLRHKR 2222--!!!!------------------3333-----------------1111------- VRGKDYDTLLDNFMKACTKKYGQKTLIQFEDFANPNAFRLLDKYQDKYTMFNDDIQGTAS ---------------------1111------------------1111----3333----- VIVAGLLTCTRVTKKLVSQEKYLFFGAGAASTGIAEMIVHQMQNEGISKEEACNRIYLMD -------3333----1111-----------------------1111------1111---1 IDGLVTKNRKEMNPRHVQFAKDMPETTSILEVIRAARPGALIGASTVRGAFNEEVIRAMA 111---------33331111---------------------------------------- EINERPIIFALSNPTSKAECTAEEAYTFTNGAALYASGSPFPNFELNGHTYKPGQGNNAY -------------3333-----------%%%%------------------------3333 IFPGVALGTILFQIRHVDNDLFLLAAKKVASCVTEDSLKVGRVYPQLKEIREISIQIAVE ---------1111----3333--------111133331111----3333----------- MAKYCYKNGTANLYPQPEDLEKYVRAQVYNTEYEELINATYDWPEQDMRHGFPVPVVRHD -------------------------------------------3333------------- SM -- >RIBONUCLEASE III; SWP:Q9X0I6; PDB:1O0WA; HMNESERKIVEEFQKETGINFKNEELLFRALCHSSYANEQNQAGRKDVESNEKLEFLGDA --------------------------------3333----11111111------------ VLELFVCEILYKKYPEAEVGDLARVKSAAASEEVLAMVSRKMNLGKFLFLGKGEEKTGGR -------------11113333---------------------3333-----------333 DRDSILADAFEALLAAIYLDQGYEKIKELFEQEFEFYIEKIMKGEMLFDYKTALQEIVQS 3----------------------------------------------------------- EHKVPPEYILVRTEKNDGDRIFVVEVRVNGKTIATGKGRTKKEAEKEAARIAYEKLL ---------------------------iiii-------------------------- >METHIONINE AMINOPEPTIDASE; SWP:Q9X1I7; PDB:1O0XA; MIRIKTPSEIEKMKKAGKAVAVALREVRKVIVPGKTAWDVETLVLEIFKKLRVKPAFKGY -------------------------3333--22223333---------------3333-i GGYKYATCVSVNEEVVHGLPLKEKVFKEGDIVSVDVGAVYQGLYGDAAVTYIVGETDERG iii-------!!!!------3333--2222---------iiii----------------- KELVRVTREVLEKAIKMIKPGIRLGDVSHCIQETVESVGFNVIRDYVGHGVGRELHEDPQ --------------111122223333---------1111--------------------- IPNYGTPGTGVVLRKGMTLAIEPMVSEGDWRVVVKEDGWTAVTVDGSRCAHFEHTILITE -------------2222-----------------1111----3333-------------- NGAEILTKE --------- >DEOXYRIBOSE-PHOSPHATE ALD; SWP:Q9X1P5; PDB:1O0YA; YRIEEAVAKYREFYEFKPVRESAGIEDVKSAIEHTNLKPFATPDDIKKLCLEARENRFHG -------------------------------------11113333--------------- VCVNPCYVKLAREELEGTDVKVVTVVGFPLGANETRTKAHEAIFAVESGADEIDMVINVG ---3333-----1111-----------------------------1111----------- MLKAKEWEYVYEDIRSVVESVKGKVVKVIIETCYLDTEEKIAACVISKLAGAHFVKTSTG -1111---------------2222------3333-------------------------- FGTGGATAEDVHLMKWIVGDEMGVKASGGIRTFEDAVKMIMYGADRIGTSSGVKIVQGGE ------------------1111-----------------1111----------------- ERYG ---- >N-ACETYLGLUCOSAMINE-6-PHO; SWP:Q9WZS1; PDB:1O12A; IVEKVLIVDPIDGEFTGDVEIEEGKIVKVEKRECIPRGVLPGFVDPHIHGVVGADTNCDF --------------------------------------------------iiii----33 SEEEFLYSQGVTTFLATTVSTSLEKKEILRKARDYILENPSTSLLGVHLEGPYISKEKKG 33--3333-------------3333-------------1111------------3333-- AHSEKHIRPPSERELSEIDSPAKLTFAPEIESSELLLRLVKRDIVLSAGHSIATFEEFKF --3333----33331111--------1111-3333---1111-----------3333--- YKEGVKRITHFPNGLKPLHHREIGITGAGLLLDDVKLELICDGVHLSREVKLVYKVKKAN 1111---------------------------1111----------------------333 GIVLVTDSISAAGLKDGTTTLGDLVVKVKDGVPRLEDGTLAGSTLFFSQAVKNFRKFTGC 3-------3333----------------iiii--1111---------------------- SITELAKVSSYNSCVELGLDDRGRIAEGTRADLVLLDEDLNVVTIKEGEVVFRSR -------------------------2222-------1111--------------- >PROBABLE NIFB PROTEIN; SWP:Q9X2D6; PDB:1O13A; IIAIPVSENRGKDSPISEHFGRAPYFAFVKVKNNAIADISVEENPLAQDHVHGAVPNFVK ---------!!!!------1111--------%%%%--------11112222--------- EKGAELVIVRGIGRRAIAAFEAGVKVIKGASGTVEEVVNQYLSGQ ----------------------------------------1111- >ANTHRANILATE PHOSPHORIBOS; SWP:P50384; PDB:1O17A; MNINEILKKLINKSDLEINEAEELAKAIIRGEVPEILVSAILVALRMKGESKNEIVGFAR ---------1111----------------------------------------------- AMRELAIKIDVPNAIDTAGGLGTVNVSTASAILLSLVNPVAKHGNRAVSGKSGSADVLEA ----------1111-------------------3333----------------------- LGYNIIVPPERAKELVNKTNFVFLFAQYYHPAMKNVANVRKTLGIRTIFNILGPLTNPAN -------3333-------------3333------------------3333-1111-1111 AKYQLMGVFSKDHLDLLSKSAYELDFNKIILVYGEPGIDEVSPIGNTFMKIVSKRGIEEV ---------3333-------1111----------------------------1111---- KLNVTDFGISPIPIEKLIVNSAEDSAIKIVRAFLGKDEHVAEFIKINTAVALFALDRVGD --3333------3333---------------1111------------------------- FREGYEYADHLIEKSLDKLNEIISMNGDVTKLKTIVVKS --------------------------------------- >GASTROTROPIN; SWP:P51161; PDB:1O1UA; AFTGKFEMESEKNYDEFMKLLGISSDVIEKARNFKIVTEVQQDGQDFTWSQHYSGGHTMT -----------------------------3333--------------------------- NKFTVGKESNIQTMGGKTFKATVQMEGGKLVVNFPNYHQTSEIVGDKLVEVSTIGGVTYE ------------------------------------------------------------ RVSKRLA ------- >RIBOSE-5-PHOSPHATE ISOMER; SWP:NA; PDB:1O1XA; HVKIAIASDHAAFELKEKVKNYLLGKGIEVEDHGTYSEESVDYPDYAKKVVQSILSNEAD --------3333-----------1111--------------3333--------1111--- FGILLGTGLGSIAANRYRGIRAALCLFPDARLARSHNNANILVLPGRLIGAELAFWIVDT ----------------2222------------------------1111------------ FLSTPFDGGRHERRIRKIDEV -------!!!!----3333-- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:1O1YA; HVRVLAIRHVEIEDLGEDIFREKNWSFDYLDTPKGEKLERPLEEYSLVVLLGGYGAYEEE ---------3333-------1111------3333------3333----------111133 KYPFLKYEFQLIEEILKKEIPFLGILGSQLAKVLGASVYRGKNGEEIGWYFVEKVSDNKF 33-----------------------------1111----------------------333 FREFPDRLRVFQWHGDTFDLPRRATRVFTSEKYENQGFVYGKAVGLQFHIEVGARTKRWI 3-------------------1111-----3333------!!!!---------3333---- EAYKDELEKKKIDPRLLLETAEREEKVLKGLLRSLLERVES ----------------------------------------- >GLYCEROPHOSPHODIESTER PHO; SWP:NA; PDB:1O1ZA; HVIVLGHRGYSAKYLENTLEAFMKAIEAGANGVELDVRLSKDGKVVVSHDEDLKRLFGLD --------------2222-------1111----------1111----------1111--- VKIRDATVSELKELTDGKITTLKEVFENVSDDKIINIEIKEREAADAVLEISKKRKNLIF -1111---------iiii----------------------3333-----1111------- SSFDLDLLDEKFKGTKYGYLIDEENYGSIENFVERVEKERPYSLHVPYQAFELEYAVEVL -----------2222--------1111--------------------3333--------- RSFRKKGIVIFVWTLNDPEIYRKIRREIDGVITDEVELFVKLR ---1111----------------3333------------1111 >GAMMA-GLUTAMYL PHOSPHATE ; SWP:Q9WYC9; PDB:1O20A; DELLEKAKKVREAWDVLRNATTREKNKAIKKIAEKLDERRKEILEANRIDVEKARERGVK -----------33333333-----------------------------------1111-- ESLVDRLALNDKRIDEIKACETVIGLKDPVGEVIDSWVREDGLRIARVRVPIGPIGIIYE ---3333---------------3333--2222------1111------------------ SRPNVTVETTILALKSGNTILLRGGSDALNSNKAIVSAIREALKETEIPESSVEFIENTD -3333--------1111-------3333--------------1111--1111-------3 RSLVLEIRLREYLSLVIPRGGYGLISFVRDNATVPVLETGVGNCHIFVDESADLKKAVPV 333-----1111------------------------------------11113333---- IINAKTQRPGTCNAAEKLLVHEKIAKEFLPVIVEELRKHGVEVRGCEKTREIVPDVVPAT -------1111---------3333------------1111------------1111---3 EDDWPTEYLDLIIAIKVVKNVDEAIEHIKKYSTGHSESILTENYSNAKKFVSEIDAAAVY 333--------------------------------------------------------- VNASTRFTDGGQFGFGAEIGISTQRFHARGPVGLRELTTYKFVVLGEYHVRE ---3333-3333--------------------3333---------------- >ORPHAN PROTEIN TM0875; SWP:Q9WZX8; PDB:1O22A; ILEILYYKKGKEFGILEKKKEIFNETGVSLEPVNSELIGRIFLKISVLEEGEEVPSFAIK -------2222-------------------------------------2222-------- ALTPKENAVDLPLGDWTDLKNVFVEEIDYLDSYGDKILSEKNWYKIYVPYSSVKKKNRNE -------1111----------------------------!!!!-----33331111---- LVEEFKYFFESKGWNPGEYTFSVQEI ---------1111-1111-------- >THYMIDYLATE SYNTHASE THYX; SWP:THYX_THEMA; PDB:1O26A; HMKIDILDKGFVELVDVMGNDLSAVRAARVSFDMGLKDEERDRHLIEYLMKHGHETPFEH -------------------3333----3333------3333-------------3333-- IVFTFHVKAPIFVARQWFRHRIASYNELSGRYSKLSYEFYIPSPERLEGYKTTIPPERVT ----------------3333--------3333----------3333--------3333-- EKISEIVDKAYRTYLELIESGVPREVARIVLPLNLYTRFFWTVNARSLMNFLNLRADSHA -----------------1111-3333-1111-------------------------3333 QWEIQQYALAIARIFKEKCPWTFEAFLKYAYKGDILKE ---------------------------------3333- >ALCOHOL DEHYDROGENASE, IR; SWP:Q9X022; PDB:1O2DA; VWEFYPTDVFFGEKILEKRGNIIDLLGKRALVVTGKSSSKKNGSLDDLKKLLDETEISYE -------------3333-----3333----------3333-------------------- IFDEVEENPSFDNVKAVERYRNDSFDFVVGLGGGSPDFAKAVAVLLKEKDLSVEDLYDRE ---------3333------1111---------3333-------333311113333--333 KVKHWLPVVEIPTTAGTGSEVTPYSILTDPEGNKRGCTLFPVYAFLDPRYTYSSDELTLS 3----------------3333-------1111--------------3333---------- TGVDALSHAVEGYLSRKSTPPSDALAIEAKIIHRNLPKAIEGNREARKKFVASCLAGVIA --------------1111------------------------------------------ QTGTTLAHALGYPLTTEKGIKHGKATGVLPFVEVKEEIPEKVDTVNHIFGGSLLKFLKEL ----3333-------------------3333--------------------------111 GLYEKVAVSSEELEKWVEKGSRAKHLKNTPGTFTPEKIRNIYREALGV 1---------------------3333---------------------- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:1O3UA; HAKDDLEHAKHDLEHGFYNWACFSSQQAAEKAVKAVFQRGAQAWGYSVPDFLGELSSRFE ------------1111-------------------------------------------- IPEELMDHALELDKACDALPSGSPRNRYSRIEAERLVNYAEKIIRFCEDLLSRI -3333-----3333------------------------------------1111 >PROTO-ONCOGENE TYROSINE-P; SWP:P12931; PDB:1O4RA; SIQAEEWYFGKITRRESERLLLNAENPRGTFLVRESETTKGAYCLSVSDFDNAKGLNVKH 33331111--------------33332222--------2222------------------ YKIRKLDSGGFYITSRTQFNSLQQLVAYYSKHADGLCHRLTTVCP -----3333----3333----------1111-!!!!--------- >ASPARTATE AMINOTRANSFERAS; SWP:Q9X0Y2; PDB:1O4SA; VSRRISEIPISKTMELDAKAKALIKKGEDVINLTAGEPDFPTPEPVVEEAVRFLQKGEVK -3333------------------1111--------------------------1111--- YTDPRGIYELREGIAKRIGERYKKDISPDQVVVTNGAKQALFNAFMALLDPGDEVIVFSP --3333--------------------1111-------------------2222------- VWVSYIPQIILAGGTVNVVETFMSKNFQPSLEEVEGLLVGKTKAVLINSPNNPTGVVYRR -1111----------------3333---------11111111------------------ EFLEGLVRLAKKRNFYIISDEVYDSLVYTDEFTSILDVSEGFDRIVYINGFSKSHSMTGW --------------------1111---------1111----1111-------11111111 RVGYLISSEKVATAVSKIQSHTTSCINTVAQYAALKALEVDNSYMVQTFKERKNFVVERL ------------------------------------1111-------------------- KKMGVKFVEPEGAFYLFFKVRGDDVKFCERLLEEKKVALVPGSAFLKPGFVRLSFATSIE 1111------------------------------------3333---------------- RLTEALDRIEDFLNS -----------1111 >PUTATIVE OXALATE DECARBOX; SWP:Q9X113; PDB:1O4TA; MVVRSSEITPERISNMRGGKGEVEMAHLLSKEAMHNKARLFARMKLPPGSSVGLHKHEGE ---3333--------%%%%----------3333%%%%---------2222---------- FEIYYILLGEGVFHDNGKDVPIKAGDVCFTDSGESHSIENTGNTDLEFLAVIILL --------------iiii----2222----2222--------------------- >TYPE II QUINOLIC ACID PHO; SWP:Q9X1X8; PDB:1O4UA; MEKILDLLMSFVKEDEGKLDLASFPLRNTTAGAHLLLKTENVVASGIEVSRMFLEKMGLL -------------------33331111---------------------------1111-- SKFNVEDGEYLEGTGVIGEIEGNTYKLLVAERTLLNVLSVMFSVATTTRRFAEKLKHAKI -----2222------------------------------------------1111----- AATRKILPGLGVLQKIAVVHGGGDCVMIKDNHLKMYGSAERAVQEVRKIIPFTTKIEVEV ---------3333-------------------------------------1111------ ENLEDALRAVEAGADIVMLDNLSPEEVKDISRRIKDINPNVIVEVSGGITEENVSLYDFE -------------------------------------1111--------33333333-11 TVDVISSSRLTLQEVFVDLSLEIQR 11----3333--------------- >PHOSPHORIBOSYLAMINOIMIDAZ; SWP:Q9WYS7; PDB:1O4VA; PRVGIIMGSDSDLPVMKQAAEILEEFGIDYEITIVSAHRTPDRMFEYAKNAEERGIEVII --------3333-----------1111--------1111-----------3333------ AGAGGAAHLPGMVASITHLPVIGVPVKTSTLNGLDSLFSIVQMPGGVPVATVAINNAKNA ------------------------------iiii------------------2222---- GILAASILGIKYPEIARKVKEYKERMKREVLEKAQRLEQIGYKEYLNQK ------------------------------------------------- >PIN (PILT N-TERMINUS) DOM; SWP:O29664; PDB:1O4WA; KVRCAVVDTNVLYVYLNKADVVGQLREFGFSRFLITASVKRELEKLESLRGKEKVAARFA -------3333--------------1111------------------------------- LKLLEHFEVVETESEGDPSLIEAAEKYGCILITNDKELKRKAKQRGIPVGYLKEDKRVFV ---1111--------3333-----------------------1111-------------- ELL --- >BETA-AGARASE A; SWP:Q9RGX9; PDB:1O4YA; AQDWNGIPVPANPGNGMTWQLQDNVSDSFNYTSSEGNRPTAFTSKWKPSYINGWTGPGST -1111--------2222----1111--------2222-3333--------------!!!! IFNAPQAWTNGSQLAIQAQPAGNGKSYNGIITSKNKIQYPVYMEIKAKIMDQVLANAFWT --3333---------------iiii----------------------------------- LTDDETQEIDIMEGYGSDRGGTWFAQRMHLSHHTFIRNPFTDYQPMGDATWYYNGGTPWR -1111-----------3333---1111-------------------1111---iiii333 SAYHRYGCYWKDPFTLEYYIDGVKVRTVTRAEIDPNNHLGGTGLNQATNIIIDCENQTDW 3------------------iiii-----3333-1111-iiii--------------3333 RPAATQEELADDSKNIFWVDWIRVYKPVAV ----3333--1111---------------- >BETA-AGARASE B; SWP:Q9RGX8; PDB:1O4ZA; DWKDIPVPADAGPNMKWEFQEISDNFEYEAPADNKGSEFLEKWDDFYHNAWAGPGLTEWK 1111---------------3333-------1111-3333--------------!!!!--1 RDRSYVADGELKMWATRKPGSDKINMGCITSKTRVVYPVYIEARAKVMNSTLASDVWLLS 111---iiii-------2222--------------------------------------1 ADDTQEIDILEAYGADYSESAGKDHSYFSKKVHISHHVFIRDPFQDYQPKDAGSWFEDGT 111---------------1111-----1111-------------------3333------ VWNKEFHRFGVYWRDPWHLEYYIDGVLVRTVSGKDIIDPKHFTNTTDPGNTEIDTRTGLN 3333------------------iiii-------11111111-----2222---------- KEMDIIINTEDQTWRSSPASGLQSNTYTPTDNELSNIENNTFGVDWIRIYKPVEK -----------3333--33333333----3333---3333--------------- >CBS DOMAIN-CONTAINING PRE; SWP:NA; PDB:1O50A; MKVKDVCKLISLKPTVVEEDTPIEEIVDRILEDPVTRTVYVARDNKLVGMIPVMHLLKVS -33331111--------1111-----------3333------%%%%-------------- GFHFFGFIPSMKRLIAKNASEIMLDPVYVHMDTPLEEALKLMIDNNIQEMPVVDEKGEIV -----------------3333--------1111--------------------------- GDLNSLEILLALWKGREK ---3333----------- >HYPOTHETICAL PROTEIN TM00; SWP:Y021_THEMA; PDB:1O51A; HKLLKIYLGEKDKHSGKPLFEYLVKRAYELGKGVTVYRGIGFGHPDLPIVLEIVDEEERI --------1111-iiii----------1111----------------------------- NLFLKEIDNIDFDGLVFTADVNVK -----3333--------------- >SAM-DEPENDENT O-METHYLTRA; SWP:NA; PDB:1O54A; HVGKVADTLKPGDRVLLSFEDESEFLVDLEKDKKLHTHLGIIDLNEVFEKGPGEIIRTSA ---3333--2222-----1111-------2222---1111----3333--2222---111 GKKGYILIPSLIDEIMNMKRTQIVYPKDSSFIAMMLDVKEGDRIIDTGVGSGAMCAVLAR 1-----------------------3333----------2222------!!!!-------- AVGSSGKVFAYEKREEFAKLAESNLTKWGLIERVTIKVRDISEGFDEKDVDALFLDVPDP --1111-------------------11113333------3333---------------33 WNYIDKCWEALKGGGRFATVCPTTNQVQETLKKLQELPFIRIEVWESLFRPYKPVPERLR 33-----33332222------------------1111-----------------1111-- PVDRMVAHTAYMIFATKVCRREE ----------------------- >PUR OPERON REPRESSOR; SWP:P37551; PDB:1O57A; KFRRSGRLVDLTNYLLTHPHELIPLTFFSERYESAKSSISEDLTIIKQTFEQQGIGTLLT --------------1111-----3333--------------------------------- VPGAAGGVKYIPKMKQAEAEEFVQTLGQSLANPERILPGGYVYLTDILGKPSVLSKVGKL --1111-------------------------3333--------3333------------- FASVFAEREIDVVMTVATKGIPLAYAAASYLNVPVVIVRKDGSTVSINYVSGSSNRIQTM ----1111---------1111--------------------------------------- SLAKRSMKTGSNVLIIDDFMKAGGTINGMINLLDEFNANVAGIGVLVEAEGVDERLVDEY --1111-2222--------------------3333------------------------- MSLLTLSTINMKEKSIEIQNGNFLRFFKDN ---------------------3333----- >O-ACETYLSERINE SULFHYDRYL; SWP:NA; PDB:1O58A; HMMERLIGSTPIVRLDSIDSRIFLKLEKNNPGGSVKDRPALFMILDAEKRGLLKNGIVEP 3333--------------1111-------1111-------------------1111---- TSGNMGIAIAMIGAKRGHRVILTMPETMSVERRKVLKMLGAELVLTPGELGMKGAVEKAL ------------------------1111--------1111------3333---------- EISRETGAHMLNQFENPYNVYSHQFTTGPEILKQMDYQIDAFVAGVGTGGTISGVGRVLK -----------1111-------------------%%%%---------------------- GFFGNGVKIVAVEPAKSPVLSGGQPGKHAIQGIGAGFVPKILDRSVIDEVITVEDEEAYE --!!!!-------11113333--------2222-----11113333-------------- MARYLAKKEGLLVGISSGANVAAALKVAQKLGPDARVVTVAPDHAERYLSIL ---------------------------11111111--------11113333- >ALLANTOICASE; SWP:P25335; PDB:1O59A; HKFFSLADEAEFKSIIISKNKAVDVIGSKLGGQVVSFSDEWFASAENLIQPTAPIRDWYD ----3333---------------11111111--------11113333------------- GWETRRHNEEYDWVIIKGVAAAHIIGGEIDTAFFNGNHAPFVSIEALYDEGEEGNIVEDD ------------------------------2222----------------------1111 SRWVEIVEKFECGPSQRHLFVRGNGLTKERFTHIKLKYPDGGIARFRLYGRVVPPHIIDL ---------------------1111---------------------------------11 AYVCNGAVALKYSDQHFGSVDNLLLPGRGHDSDGWETKRSRQPGHTDWAVIQLGRESSFI 111111---------------1111----------------2222--------------- EKIIVDTAHFRGNFPQFITVEGCLKTWVELVGKSKTGPDKEHVYEIRKSIRVSHVKLTII ------2222-------------------------------------------------- PDGGVKRIRVWGY ------------- >FORMIMINOTETRAHYDROFOLATE; SWP:A1GJS1; PDB:1O5HA; EVERLSLKEFCDVAERKPTPGGGAVGSVVGAACALAEVANFTRKKKGYEDVEPEERIVEA 3333---------------------------------3333---2222--3333------ EEARLKLFDLAKKDEAFEKVKAYKSSEGELQNALKEAASVPDVIRVKDLAHELEKLAEFG ---------------------1111----------------------------------- NKNLASDTLNAADLCHAVFQVEKVNVLINLKEISDETFRKNLEELEEQEAQIEGCYQRVK 1111------------------------3333---------------------------- KLEGIVWSS --------- >3-OXOACYL-(ACYL CARRIER P; SWP:Q9X0Q1; PDB:1O5IA; GIRDKGVLVLAASRGIGRAVADVLSQEGAEVTICARNEELLKRSGHRYVVCDLRKDLDLL -2222-------------------1111-----------------------3333----- FEKVKEVDILVLNAGGPKAGFFDELTNEDFKEAIDSLFLNMIKIVRNYLPAMKEKGWGRI --------------------3333-3333------------------------------- VAITSFSVISPIENLYTSNSARMALTGFLKTLSFEVAPYGITVNCVAPGWTETERVKELL -----3333--1111----------------33333333-------------3333---- SEEKKKQVESQIPMRRMAKPEEIASVVAFLCSEKASYLTGQTIVVDGGLSKFPL -------11111111---3333---------3333----------iiii----- >PERIPLASMIC DIVALENT CATI; SWP:NA; PDB:1O5JA; HILVYSTFPNEEKALEIGRKLLEKRLIACFNAFEIRSGYWWKGEIVQDKEWAAIFKTTEE ----------------------------------------iiii-------------333 KEKELYEELRKLHPYETPAIFTLKVENVLTEYNWLRESVL 3---------------------------3333----1111 >DIHYDRODIPICOLINATE SYNTH; SWP:Q9X1K9; PDB:1O5KA; HMFRGVGTAIVTPFKNGELDLESYERLVRYQLENGVNALIVLGTTGESPTVNEDEREKLV --------------iiii-------------1111-------33333333---------- SRTLEIVDGKIPVIVGAGTNSTEKTLKLVKQAEKLGANGVLVVTPYYNKPTQEGLYQHYK ------iiii----------------------1111------------------------ YISERTDLGIVVYNVPGRTGVNVLPETAARIAADLKNVVGIEANPDIDQIDRTVSLTKQA --------------3333----------------3333---------------------- RSDFMVWSGNDDRTFYLLCAGGDGVISVVSNVAPKQMVELCAEYFSGNLEKSREVHRKLR 1111-----1111----3333------3333------------1111------------- PLMKALFVETNPIPVKAALNLMGFIENELRLPLVPASEKTVELLRNVLKESGLL ------------------------------------------------1111-- >TRANSCRIPTIONAL REGULATOR; SWP:Q9X0Q3; PDB:1O5LA; MDLKKLLPCGKVIVFRKGEIVKHQDDPIEDVLILLEGTLKTEHVSENGKTLEIDEIKPVQ -----3333------2222---2222------------------1111------------ IIASGFIFSSEPRFPVNVVAGENSKILSIPKEVFLDLLMKDRELLLFFLKDVSEHFRVVS --3333------------------------------------------------------ EKLFFLTTK --------- >URACIL PHOSPHORIBOSYLTRAN; SWP:Q9WZI0; PDB:1O5OA; HMKNLVVVDHPLIKHKLTIMRDKNTGPKEFRELLREITLLLAYEATRHLKCEEVEVETPI -1111----------------1111------------------1111----------333 TKTIGYRINDKDIVVVPILRAGLVMADGILELLPNASVGHIGIYRDPETLQAVEYYAKLP 3-------3333--------------------1111------------------------ PLNDDKEVFLLDPMLATGVSSIKAIEILKENGAKKITLVALIAAPEGVEAVEKKYEDVKI --3333----------------------1111----------------------1111-- YVAALDERLNDHGYIIPGLGDAGDRLFRTK ---------1111----------------- >NOVEL THERMOTOGA MARITIMA; SWP:Q9X0J6; PDB:1O5UA; EVKIEKPTPEKLKELSVEKWPIWEKEVSEFDWYYDTNETCYILEGKVEVTTEDGKKYVIE ---------------3333-------------------------------1111-----2 KGDLVTFPKGLRCRWKVLEPVRKHYNLF 222----2222----------------- >AMINE OXIDASE [FLAVIN-CON; SWP:P21396; PDB:1O5WA; AGHMFDVVVIGGGISGLAAAKLLSEYKINVLVLEARDRVGGRTYTVRNEHVKWVDVGGAY ------------3333-------1111------------!!!!----3333--------- VGPTQNRILRLSKELGIETYKVNVNERLVQYVKGKTYPFRGAFPPVWNPLAYLDYNNLWR ------------1111-------------------------------------------- TMDEMGKEIPVDAPWQARHAQEWDKMTMKDLIDKICWTKTAREFAYLFVNINVTSEPHEV -----111111111111-3333---------------------------------1111- SALWFLWYVRQCGGTARIFSVTNGGQERKFVGGSGQVSEQIMGLLGDKVKLSSPVTYIDQ ---------1111-------------------3333-----33331111----------- TDDNIIVETLNHEHYECKYVISAIPPILTAKIHFKPELPPERNQLIQRLPMGAVIKCMVY ------------------------33333333------------1111------------ YKEAFWKKKDYCGCMIIEDEEAPIAITLDDTKPDGSLPAIMGFILARKADRLAKLHKDIR ---3333------------------------1111------------------------- KRKICELYAKVLGSQEALYPVHYEEKNWCEEQYSGGCYTAYFPPGIMTQYGRVIRQPVGR --------------3333--------3333------------2222---3333------- IYFAGTETATQWSGYMEGAVEAGERAAREVLNALGKVAKKDIWVEEPESKDVPAIEITHT ----3333---22223333------------1111-------------1111-------3 FLERNLPSVPGLLKITGVSTSVALLCFVLYK 333-----------------------3333- >TRIOSEPHOSPHATE ISOMERASE; SWP:Q07412; PDB:1O5XA; RKYFVAANWKCNGTLESIKSLTNSFNNLDFDPSKLDVVVFPVSVHYDHTRKLLQSKFSTG ------------------------------1111-------1111--------3333--- IQNVSKFGNGSYTGEVSAEIAKDLNIEYVIIGHFERRKYFHETDEDVREKLQASLKNNLK -----------2222------1111-------3333------------------1111-- AVVCFGESLEQREQNKTIEVITKQVKAFVDLIDNFDNVILVYEPLWAIGTGKTATPEQAQ -----------1111------------3333------------3333------------- LVHKEIRKIVKDTCGEKQANQIRILYGGSVNTENCSSLIQQEDIDGFLVGNASLKESFVD ------------------------------1111-3333-1111-----3333-1111-- IIKSAM --1111 >folylpolyglutamate syntha; SWP:NA; PDB:1O5ZA; HMALEVLRYLYHKVKPGLERISMLLSKLGNPHLEYKTIHIGGTNGKGSVANMVSNILVSQ ---------3333------------11113333--------------------------- GYRVGSYYSPHLSTFRERIRLNEEYISEEDVVKIYETMEPILNELDKEEIFSPSFFEVVT -------------3333---%%%%-----------------------3333--------- AMAFLYFAEKNVDIAVLEVGLGGRLDATNVVFPLCSTIVTVDRYTIEQIAWEKSGIIKER -------1111--------------3333------------------------1111222 VPLVTGERKREALKVMEDVARKKSSRMYVIDKDFSVKVKSLKLHENRFDYCGENTFEDLV 2---------------------------2222---------2222--------------- LTMNGPHQIENAGVALKTLEATGLPLSEKAIREGLKNAKNLGRFEILEKNGKMYILDGAH ----3333-----------1111----------------2222-----iiii-------- NPHGAESLVRSLKLYFNGEPLSLVIGILDDKNREDILRKYTGIFERVIVTRVPSPRMKDM ----------3333-1111--------1111------1111------------3333--- NSLVDMAKKFFKNVEVIEDPLEAIESTERATVVTGSLFLVGYVREFLTTGKINEEWKL ------3333-------------1111-------------------------3333-- >2-DEHYDRO-3-DEOXYPHOSPHOO; SWP:P45251; PDB:1O60A; QNKIVKIGNIDVANDKPFVLFGGNVLESRDAQVCEAYVKVTEKLGVPYVFKASFDKANRS ------!!!!--1111-------------------------------------------- SIHSYRGPGEEGLKIFQELKDTFGVKIITDVHEIYQCQPVADVVDIIQLPAFLARQTDLV 1111----------------------------3333-------------3333------- EAAKTGAVINVKKPQFLSPSQGNIVEKIEECGNDKIILCDRGTNFGYDNLIVDLGFSVKK --3333-------11113333-------1111---------------------3333--1 ASKGSPVIFDVTHSLQRAQVTELARSGLAVGIAGLFLEAHPNPNQAKCDGPSALPLSALE 111-------1111--1111-------3333----------1111----1111-3333-- GFVSQKAIDDLVKSFPELDT -----------1111----- >ATP PHOSPHORIBOSYLTRANSFE; SWP:Q9X0D2; PDB:1O63A; LKLAIPKGRLEEKVTYLKKTGVIFERESSILREGKDIVCFVRPFDVPTYLVHGVADIGFC -------1111------1111------1111--2222----3333----1111------- GTDVLLEKETSLIQPFFIPTNISRVLAGPKGRGIPEGEKRIATKFPNVTQRYCESKGWHC ----------------------------2222---------------------1111--- RIIPLKGSVELAPIAGLSDLIVDITETGRTLKENNLEILDEIFVIRTHVVVNPVSYRTKR -------3333-1111------------------------------------3333---- EEVVSFLEKLQEVIEHDSNE -------------------- >HYPOTHETICAL PROTEIN YIIM; SWP:NA; PDB:1O65A; KFLVEREQMRYPVDVYTGKAKIQVDGELMLTELGLEGDEQAVHGGPDRALCHYPREHYLY -3333-------------------------11112222-----3333------------- WAREFPEQAELFVAPAFGENLSTDGLTESNVYMGDIFRWGEALIQVSQPRSPCYKLNYHF ----33333333--1111--------1111-2222---!!!!------------------ DISDIAQLMQNTGKVGWLYSVIAPGKVSADAPLELVSRVSDVTVQEAAAIAWHMPFDDDQ -1111----------------------3333----------------------------- YHRLLSAAGLSKSWTRTMQKRRLSGKIEDFSRRLWGKE -------------------------------------- >3-METHYL-2-OXOBUTANOATE H; SWP:Q9JZW6; PDB:1O66A; LITVNTLQKKAAGEKIALTAYESSFAALDDAGVELLVGDSLGAVQGRKSTLPVSLRDCYH ---------1111---------------1111-----------------11113333--- TECVARGAKNAIVSDLPFGAYQQSKEQAFAAAAELAAGAHVKLEGGVWAETTEFLQRGIP ----------------2222---------------------------------------- VCAHIGLTPQSVFAKAQALLNDAKAHDDAGAAVVLECVLAELAKKVTETVSCPTIGIGAG -------3333---------------1111------------------------------ ADCDGQVLVHDLGIFPGKTAKFVKNFQGHDSVQAAVRAYVAEVKAKTFPAAEHI -------------------1111--------------------------3333- >AMINOTRANSFERASE; SWP:Q9S5Y7; PDB:1O69A; GNELKYIEEVFKSNYIAPLGEFVNRFEQSVKDYSKSENALALNSATAALHLALRVAGVKQ -----------------2222--------------------------------3333-22 DDIVLASSFTFIASVAPICYLKAKPVFIDCDETYNIDVDLLKLAIKECEKKPKALILTHL 22--------3333----------------1111------------------------22 YGNAAKMDEIVEICKENDIVLIEDAAEALGSFYKNKALGTFGEFGVYSYNGNKIITTSGG 22---3333-----1111------1111----%%%%2222---------1111------- GMLIGKNKEKIEKARFYSTQARENCLHYEHLDYGYNYRLSNVLGAIGVAQMEVLEQRVLK ------------------------------------------------------------ KREIYEWYKEFLGEYFSFLDELENSRSNRWLSTALINFDKNELNACQKDINISQKNITLH ---------------------2222-------------3333------------------ PKISKLIEDLKNKQIETRPLWKAMHTQEVFKGAKAYLNGNSELFFQKGICLPSGTAMSKD ----------1111--------11113333-----------------------1111--- DVYEISKLILKSIK -------------- >PUTATIVE FLAGELAR MOTOR S; SWP:A1GGY7; PDB:1O6AA; SDKLELLLDIPLKVTVELGRTRTLKRVLEIHGSIIELDKLTGEPVDILVNGKLIARGEVV 3333--1111-------------------2222------2222-----iiii-------- VIDENFGVRITEIVSPKERLELLNE -!!!!----------------1111 >PHOSPHOPANTETHEINE ADENYL; SWP:O34797; PDB:1O6BA; ASIAVCPGSFDPVTYGHLDIIKRGAHIFEQVYVCVLNNSSKKPLFSVEERCELLREVTKD -------------------------------------------------------1111- IPNITVETSQGLLIDYARRKNAKAILRGLRAVSDFEYEQGTSVNRVLDESIETFFANNQY 1111-------3333--1111---------3333-------------1111------111 SFLSSSIVKEVARYDGSVSEFVPPEVELALQQKFRQGGSH 1----------1111--1111-----------1111---- >UDP-N-ACETYLGLUCOSAMINE 2; SWP:P39131; PDB:1O6CA; KKLKVTVFGTRPEAIKAPLVLELKKYPEIDSYVTVTAQHRQLDQVLDAFHIKPDFDLNIK ---------3333-------3333-3333------------------------------- ERQTLAEITSNALVRLDELFKDIKPDIVLVHGDTTTTFAGSLAAFYHQIAVGHVEAGLRT ---------------------------------3333-------1111------------ GNKYSPFPEELNRQTGAIADLHFAPTGQAKDNLLKENKKADSIFVTGNTAIDALNTTVRD -------------------------3333----1111-----------33331111---- GYSHPVLDQVGEDKILLTENFKAIRRIVGEFEDVQVVYPPVVREAAHKHDSDRVHLIEPL ---33333333-------------------1111-------------------------- EVIDFHNFAAKSHFILTDGVQEEAPSLGKPVLVLRDTTEGVEAGTLKLAGTDEENIYQLA ------------------11113333---------------------------------- KQLLTDPDEYKKSQASNPYGDGEASRRIVEELLFHYGYRKEQPDSFT ------3333------1111-------------1111---------- >HYPOTHETICAL UPF0247 PROT; SWP:Q9WZU8; PDB:1O6DA; LRVRIAVIGKLDGFIKEGIKHYEKFLRRFCKPEVLEIKRVHRGSIEEIVRKETEDLTNRI -------------------------3333---------------------------1111 LPGSFVMVMDKRGEEVSSEEFADFLKDLEMKGKDITILIGGPYGLNEEIFAKAHRVFSLS 2222-----1111---------------------------1111-33331111------- KMTFTHGMTVLIVLEQIFRAFKIIHGE --------------------------- >CAPSID PROTEIN P40; SWP:P03234; PDB:1O6EA; APSVYVCGFVERPDAPPKDACLHLDPLTVKSQLPLKKPLPLTVEHLPDAPVGSVFGLYQS -----------1111----1111-------------------%%%%-------------- SAGLFSAASITSGDFLSLLDSIYHDCDIAQSQRLPLPREPKVEALHAWLPSLSLASLHPD -----------3333-----3333-3333------------------------------- IPQTTADGGKLSFFDHVSICALGRRRGTTAVYGTDLAWVLKHFSDLEPSIAAQIENDANA ---------------------------------------3333----------------- AKRHPLPLTKLIAKAIDAGFLRNRVETLRQDRGVANIPAESYLKA -----------------------3333-----1111--------- >RAC-BETA SERINE/THREONINE; SWP:P31751; PDB:1O6LA; KVTMNDFDYLKLLGKGTFGKVILVREKATGRYYAMKILRKEVIIAKDEVAHTVTESRVLQ --1111---------1111----------------------------------------- NTRHPFLTALKYAFQTHDRLCFVMEYANGGELFFHLSRERVFTEERARFYGAEIVSALEY ---1111-------------------11113333-------------------------- LHSRDVVYRDIKLENLMLDKDGHIKITDFGLCKEGISDGATMKFCGTPEYLAPEVLEDND -----------3333---1111------1111----!!!!------3333-3333----- YGRAVDWWGLGVVMYEMMCGRLPFYNQDHERLFELILMEEIRFPRTLSPEAKSLLAGLLK -------------------------------------------1111------------- KDPKQRLGGGPSDAKEVMEHRFFLSINWQDVVQKKLLPPFKPQVTSEVDTRYFDDEFTAQ -333322221111------3333---33331111-----------111111113333--- SITQEMFEDFDYIADW ---------------- >Epithelial-cadherin [Prec; SWP:P12830; PDB:1O6SB; SVIPPISCPENEKGPFPKNLVQIKSNKDKEGKVFYSITGQGADTPPVGVFIIERETGWLK --------------------------3333--------2222------------------ VTEPLDRERIATYTLFSHAVSSNGNAVEDPMEILITVTDQ -----3333-----------1111---------------- >INTERNALIN A; SWP:P25146; PDB:1O6VA; LGSATITQDTPINQIFTDTALAEKMKTVLGKTNVTDTVSQTDLDQVTTLQADRLGIKSID ----------1111------------------1111--33331111------------22 GVEYLNNLTQINFSNNQLTDITPLKNLTKLVDILMNNNQIADITPLANLTNLTGLTLFNN 22--1111------------3333--1111------------3333--1111-------- QITDIDPLKNLTNLNRLELSSNTISDISALSGLTSLQQLSFGNQVTDLKPLANLTTLERL ----3333--1111------------3333-----------------3333--3333--- DISSNKVSDISVLAKLTNLESLIATNNQISDITPLGILTNLDELSLNGNQLKDIGTLASL ---------3333--1111------------3333--1111------------3333--1 TNLTDLDLANNQISNLAPLSGLTKLTELKLGANQISNISPLAGLTALTNLELNENQLEDI 111------------3333--1111------------3333--1111------------- SPISNLKNLTYLTLYFNNISDISPVSSLTKLQRLFFYNNKVSDVSSLANLTNINWLSAGH -33331111------------------1111------------1111--1111------- NQISDLTPLANLTRITQLGLNDQAWTNAPVNYKANVSIPNTVKNVTGALIAPATISDGGS -----3333--1111----------------------------1111--------%%%%- YTEPDITWNLPSYTNEVSYTFSQPVTIGKGTTTFSGTVTQPLKA -------------------------------------------- >PRE-MRNA PROCESSING PROTE; SWP:P33203; PDB:1O6WA; MSIWKEAKDASGRIYYYNTLTKKSTWEKPKELISQEELLLRENGWKAAKTADGKVYYYNP --------1111------1111------3333-----------------1111------- TTRETSWTIPAFEKK --------------- >PROCARBOXYPEPTIDASE A2; SWP:P48052; PDB:1O6XA; MRSLETFVGDQVLEIVPSNEEQIKNLLQLEAQEHLQLDFWKSPTTPGETAHVRVPFVNVQ ------------------------------------------------------3333-- AVKVFLESQGIAYSIMIEDVQ --------------------- >PROBABLE SERINE/THREONINE; SWP:P71584; PDB:1O6YA; TPSHLSDRYELGEILGFGGMSEVHLARDLRLHRDVAVKVLRADLARDPSFYLRFRREAQN ----%%%%---------------------------------3333--3333--------- AAALNHPAIVAVYDTGEAETPAGPLPYIVMEYVDGVTLRDIVHTEGPMTPKRAIEVIADA -----1111----------3333------------------------------------- CQALNFSHQNGIIHRDVKPANIMISATNAVKVMDFGIARAIAQYLSPEQARGDSVDARSD -----------------3333---1111-------------1111----------3333- VYSLGCVLYEVLTGEPPFTGDSPVSVAYQHVREDPIPPSARHEGLSADLDAVVLKALAKN ------------------------------------3333--------------1111-3 PENRYQTAAEMRADLVRVHNG 333------------------ >MALATE DEHYDROGENASE; SWP:Q07841; PDB:1O6ZA; TKVSVVGAAGTVGAAAGYNIALRDIADEVVFVDIPDKEDDTVGQAADTNHGIAYDSNTRV ------1111-3333------------------1111------------1111------- RQGGYEDTAGSDVVVITAGIPRQPGQTRIDLAGDNAPIMEDIQSSLDEHNDDYISLTTSN ---33332222------------------------------------------------- PVDLLNRHLYEAGDRSREQVIGFGGRLDSARFRYVLSEEFDAPVQNVEGTILGEHGDAQV 3333-----3333---1111----------------------3333-------------- PVFSKVSVDGTDPEFSGDEKEQLLGDLQESAMDVIERKGATEWGPARGVAHMVEAILHDT -3333--iiii--------------------------------------------1111- GEVLPASVKLEGEFGHEDTAFGVPVSLGSNGVEEIVEWDLDDYEQDLMADAAEKLSDQYD -----------2222------------1111----------------------------- KIS --- >FASCICLIN I; SWP:P10674; PDB:1O70A; AENGALRKFYEVIMDNGGAVLDDINSLTEVTILAPSNEAWNSSNINNVLRDRNKMRQILN -----------------3333--1111--------3333-------3333---------- MHIIKDRLNVDKIRQKNANLIAQVPTVNNNTFLYFNVRGEGSDTVITVEGGGVNATVIQA --------------1111---------------------!!!!------iiii------- DVAQTNGYVHIIDHVLGVPYTTVLGKLESDPMMSDTYKMGKFSHFNDQLNNTQRRFTYFV ---1111---------------------------------1111---------------- PRDKGWQKTELDYPSAHKKLFMADFSYHSKSILERHLAISDKEYTMKDLVKFSQESGSVI ---------------------3333--------1111-------3333------------ LPTFRDSLSIRVEEEAGRYVIIWNYKKINVYRPDVECTNGIIHVIDYPLLEEKDVVV --1111--------iiii----!!!!-------------------------1111-- >TRYPAREDOXIN; SWP:O77404; PDB:1O73A; MSGLAKYLPGATNLLSKSGEVSLGSLVGKTVFLYFSASWCPPCRGFTPVLAEFYEKHHVA -3333---1111---------33332222----------3333---------------11 KNFEVVLISWDENESDFHDYYGKMPWLALPFDQRSTVSELGKTFGVESIPTLITINADTG 11------------------1111-----33333333----1111--------------- AIIGTQARTRVIEDPDGANFPWPN -------------1111------- >47 KDA MEMBRANE ANTIGEN; SWP:P29723; PDB:1O75A; ETSYGYATLSYADYWAGELGQSRDVLLADLDAGMFDAVSRATHGHGAFRQQFQYAVEVLG ---------3333---1111-3333----------------------------------- EKVLSKQETEDSRGRKKWEYETDPSVTKMVRASASFQDLGEDGEIKFEAVEGAVALADRA ----------1111--------1111-------------1111------2222---2222 SSFMVDSEEYKITNVKVHGMKFVPVAVPHELKGIAKEKFHFVEDSRVTENTNGLKTMLTE ----%%%%-------------------3333------------11111111--------- DSFSARKVSSMESPHDLVVDTVGTGYHSRFGSDAEASVMLKRADGSELSHREFIDYVMNF ------------1111-------------------------1111--------------- NTVRYDYYGDDASYTNLMASYGTKHSADSWWKTGRVPRISCGINYGFDRFKGSGPGYYRL -------!!!!--------------1111--------------11111111--------- TLIANGYRDVVADVRFLPKYEGNIDIGLKGKVLTIGGADAETLMDAAVDVFADGQPKLVS ---2222-------------------------------33332222-----2222----- DQAVSLGQNVLSADFTPGTEYTVEVRFKEFGSVRAKVVA --------------------------------------- ------------------------------------------------------------ ------------------------ >Tumor necrosis factor-ind; SWP:P98066; PDB:1O7BT; GVYHREARSGKYKLTYAEAKAVCEFEGGHLATYKQLEAARKIGFHVCAAGWMAKGRVGYP -------------------------------3333--3333------------------- IVKPGPNCGFGKTGIIDYGIRLNRSERWDAYCYNPHAK -------------------------------------- >LYSOSOMAL ALPHA-MANNOSIDA; SWP:Q29451; PDB:1O7DA; GYKTCPKVKPDMLNVHLVPHTHDDVGWLKTVDQYFYGIYNNIQPAGVQYILDSVISSLLA 1111----------------------------------3333------------------ NPTRRFIYVEIAFFSRWWRQQTNATQKIVRELVRQGRLEFANGGWVMNDEATTHYGAIID 3333-------------1111--------------------------------------- QMTLGLRFLEETFGSDGRPRVAWHIDPFGHSREQASLFAQMGFDGFFFGRLDYQDKKVRK -------------3333-------------3333----1111-----------------1 KTLQMEQVWRASTSLKPPTADLFTSVLPNMYNPPEGLCWDMLCADKPVVEDTRSPEYNAK 111------------------------------------1111-------1111------ ELVRYFLKLATDQGKLYRTKHTVMTMGSDFQYENANTWFKNLDKLIQLVNAQ -------------1111----------------------------------- >Lysosomal alpha-mannosida; SWP:Q29451; PDB:1O7DB; IRVNVLYSTPACYLWELNKANLSWSVKKDDFFPYADGPYMFWTGYFSSRPALKRYERLSY ------------------------------------2222--1111-------------- NFLQVCNQLEALAGPA ---------------- >Lysosomal alpha-mannosida; SWP:Q29451; PDB:1O7DC; GDSAPLNEAMAVLQHHDAVSGTSRQHVANDYARQLSEGWRPCEVLMSNALAHLSGLKEDF -----------------3333--------------------------------------- AFCRKLNISICPLTQTAERFQVIVYNPLGRKVDWMVRLPVSKHVYLVKDPGGKIVPSDVV ----3333--3333----------------------------------3333-------- TIPSSDSQELLFSALVPAVGFSIYSVSQMPN -2222-----------2222----------- >Lysosomal alpha-mannosida; SWP:Q29451; PDB:1O7DD; RDLVIQNEYLRARFDPNTGLLMELENLLLLPVRQAFYWYNASTGNNLSSQASGAYIFRPN ------1111----------------------------------3333------------ QNKPLFVSHWAQTHLVKASLVQEVHQNFSAWCSQVVRLYPRQRHLELEWTVGPIPVGDGW -----------------1111-------1111------2222-------------1111- GKEVISRFDTALATRGLFYTDSNGREILERRRNYRPTWKLNQTEPVAGNYYPVNSRIYIT -------------iiii----%%%%---------3333-------1111----------- DGNMQLTVLTDRSQGGSSLRDGSLELMVHRRLLKDDARGVGEPLNKEGSGLWVRGRHLVL ----------------------------------------------!!!!---------- LDKKETAAARHRLQAEMEVLAPQVVLAQGG --3333------------------------ >Lysosomal alpha-mannosida; SWP:Q29451; PDB:1O7DE; PRTQFSGLRRELPPSVRLLTLARWGPETLLLRLEHQFAVGEDSGRNLSSPVTLDLTNLFS ------------1111---------------------2222iiii---------2222-- AFTITNLRETTLAANQLLAYASRLQWTTDATITLQPMEIRTFLASVQWE ----------1111--3333--------------2222----------- >L2 BETA LACTAMASE; SWP:Q9RBQ1; PDB:1O7EA; TDAAITAASDFAALEKACAGRLGVTLLDTASGRRIGHRQDERFPMCSTFKSMLAATVLSQ -------------------------------------1111------------------- AERMPALLDRRVPVGEADLLSHAPVTRRHAGKDMTVRDLCRATIITSDNTAANLLFGVVG ---1111-------3333------33332222---------------------------- GPPAVTAFLRASGDTVSRSDRLEPELNSFAKGDPRDTTTPAAMAATLQRVVLGEVLQPAS ---------1111----------------2222--------------------------- RQQLADWLIDNETGDACLRAGLGKRWRVGDKTGSNGEDARNDIAVLWPVAGGAPWVLTAY -------------11113333-3333---------------------1111--------- LQAGAISYEQRASVLAQVGRIADRLIG --1111--------------------- >CAMP-DEPENDENT RAP1 GUANI; SWP:Q9Z1P0; PDB:1O7FA; AEWIACLDKRPLERSSEDVDIIFTRLKGVKAFEKFHPNLLRQICLCGYYENLEKGITLFR ---------3333------------11111111--3333--------------------2 QGDIGTNWYAVLAGSLDVKVSETSSHQDAVTICTLGIGTAFGESILDNTPRHATIVTRES 222---------------------3333-------2222----3333------------- SELLRIEQEDFKALWEKYRQYMAGLLAPPYGVMETVPSEKILRAGKILRIAILSRAPHMI ------------------1111---------------------------------3333- RDRKYHLKTYRQCCVGTELVDWMIQQTSCVHSRTQAVGMWQVLLEDGVLNHVDQERHFQD -------------------------------------------1111------------- KYLFYRFLDDEREDAPLPTEEEKKECDEELQDTMLLLSQMGPDAHMRMILRKPPGQRTVD ------3333------------------------------------------1111-333 DLEIIYDELLHIKALSHLSTTVKRELAGVLIFESHAKGGTVLFNQGEEGTSWYIILKGSV 3-----3333-3333---------3333---------------2222------------- NVVIYGKGVVCTLHEGDDFGKLALVNDAPRAASIVLREDNCHFLRVDKEDFNRILRDVEA ---2222------2222----1111------------------------------1111- N - >SINGLE STRANDED DNA BINDI; SWP:Q97W73; PDB:1O7IA; MEEKVGNLKPNMESVNVTVRVLEASEARQIQTKNGVRTISEAIVGDETGRVKLTLWGKHA ---3333------------------------1111----------1111------!!!!- GSIKEGQVVKIENAWTTAFKGQVQLNAGSKTKIAEASEDGFPESSQIPENTPTAP ---2222-----------iiii-----1111------2222-3333--------- >L-ASPARAGINASE; SWP:P06608; PDB:1O7JA; KLPNIVILATGGTIAGSAATGTQTTGYKAGALGVDTLINAVPEVKKLANVKGEQFSNMAS -----------3333----1111--------------------1111-----------33 ENMTGDVVLKLSQRVNELLARDDVDGVVITHGTDTVEESAYFLHLTVKSDKPVVFVAAMR 33------------------1111----------3333---------------------- PATAISADGPMNLLEAVRVAGDKQSRGRGVMVVINDRIGSARYITKTNASTLDTFRANEE 1111-----------------1111--------iiii--3333-------1111--1111 GYLGVIIGNRIYYQNRIDKLHTTRSVFDVRGLTSLPKVDILYGYQDDPEYLYDAAIQHGV ------%%%%----------!!!!----1111-----------22223333----1111- KGIVYAGMGAGSVSVRGIAGMRKALEKGVVVMRSTRTGNGIVPPDEELPGLVSDSLNPAH ------------------------1111----------------1111----!!!!---- ARILLMLALTRTSDPKVIQEYFHTY -------3333-------------- >NAPHTHALENE 1,2-DIOXYGENA; SWP:P23094; PDB:1O7NA; MNYNNKILVSESGLSQKHLIHGDEELFQHELKTIFARNWLFLTHDSLIPAPGDYVTAKMG -3333----2222----3333----------------------3333--2222-----!! IDEVIVSRQNDGSIRAFLNVCRHRGKTLVSVEAGNAKGFVCSYHGWGFGSNGELQSVPFE !!------1111------------------------------------1111------33 KDLYGESLNKKCLGLKEVARVESFHGFIYGCFDQEAPPLMDYLGDAAWYLEPMFKHSGGL 33-!!!!-3333-----------iiii-----1111-3333-!!!!-3333--------- ELVGPPGKVVIKANWKAPAENFVGDAYHVGWTHASSLRSGESIFSSLAGNAALPPEGAGL -------------3333--------3333-1111-------1111-2222---------- QMTSKYGSGMGVLWDGYSGVHSADLVPELMAFGGAKQERLNKEIGDVRARIYRSHLNCTV ---1111--------1111--3333-------------------------1111------ FPNNSMLTCSGVFKVWNPIDANTTEVWTYAIVEKDMPEDLKRRLADSVQRTFGPAGFWES --------------------------------1111------------------------ DDNDNMETASQNGKKYQSRDSDLLSNLGFGEDVYGDAVYPGVVGKSAIGETSYRGFYRAY --------------1111-------2222------------------------------- QAHVSSSNWAEFEHASSTWHTELTKTTD -----------------------2222- >NAPHTHALENE 1,2-DIOXYGENA; SWP:P23095; PDB:1O7NB; MINIQEDKLVSAHDAEEILRFFNCHDSALQQEATTLLTQEAHLLDIQAYRAWLEHCVGSE --33331111---------------------------------1111----------111 VQYQVISRELRAASERRYKLNEAMNVYNENFQQLKVRVEHQLDPQNWGNSPKLRFTRFIT 1----------1111---------------------------11111111---------- NVQAAMDVNDKELLHIRSNVILHRARRGNQVDVFYAAREDKWKRGEGGVRKLVQRFVDYP ---------------------------------------------iiii----------- ERILQTHNLMVFL ------------- >N-ACETYLLACTOSAMINIDE ALP; SWP:P14769; PDB:1O7QA; KLKLSDWFNPFKRPEVVTMTKWKAPVVWEGTYNRAVLDNYYAKQKITVGLTVFAVGRYIE --1111--33331111---1111----2222-------------------------3333 HYLEEFLTSANKHFMVGHPVIFYIMVDDVSRMPLIELGPLRSFKVFKIKPEKRWQDISMM --------------2222---------3333------2222------------------- RMKTIGEHIVAHIQHEVDFLFCMDVDQVFQDKFGVETLGESVAQLQAWWYKADPNDFTYE -----------3333------------------3333--------1111---3333---- RRKESAAYIPFGEGDFYYHAAIFGGTPTQVLNITQECFKGILKDKKNDIEAQWHDESHLN -3333----2222-----3333----------------------1111------------ KYFLLNKPTKILSPEYCWDYHIGLPADIKLVKMSWQTKEYNVVRNNV -------------1111-3333--3333----------3333----- >CITRATE SYNTHASE; SWP:P80148; PDB:1O7XA; VVSKGLENVIIKVTNLTFIDGEKGILRYRGYNIEDLVNYGSYEETIYLMLYGKLPTKKEL --2222---------------------iiii------------------------1111- NDLKAKLNEEYEVPQEVLDTIYLMPKEADAIGLLEVGTAALASIDKNFKWKENDKEKAIS ------1111----------33331111-------------------------------- IIAKMATLVANVYRRKEGNKPRIPEPSDSFAKSFLLASFAREPTTDEINAMDKALILYTD --------------1111------------------------------------------ HEVPASTTAALVAASTLSDMYSSLTAALAALKGPLHGGAAEEAFKQFIEIGDPNRVQNWF -------------1111---------------1111--3333---------1111----- NDKVVNQKNRLMGFGHRVYKTYDPRAKIFKKLALTLIERNADARRYFEIAQKLEELGIKQ ----------------------1111----------3333-------------------- FSSKGIYPNTDFYSGIVFYALGFPVYMFTALFALSRTLGWLAHIIEYVEEQHRLIRPRAL -1111---11113333-------3333--------------------------------- YVGPEYQ ------- >SMALL INDUCIBLE CYTOKINE ; SWP:P02778; PDB:1O7ZA; CTCISISNQPVNPRSLEKLEIIPASQFCPRVEIIATMKKKGEKRCLNPESKAIKNLLKAV -----------1111-------------------------------11113333------ S - >PEPTIDE ANTIBIOTIC AS-48; SWP:Q47765; PDB:1O82A; MAKEFGIPAAVAGTVLNVVEAGGWVTTIVSILTAVGSGGLSLLAAAGRESIKAYLKKEIK -------3333-------1111-3333------------------!!!!----------- KKGKRAVIAW ---------- >PECTATE LYASE C; SWP:P11073; PDB:1O88A; ATDTGGYAATAGGNVTGAVSKTATSMQDIVNIIDAARLDANGKKVKGGAYPLVITYTGNE ----------!!!!2222----------------11111111--2222------------ DSLINAAAANICGQWSKDPRGVEIKEFTKGITIIGANGSSANFGIWIKKSSDVVVQNMRI ---------11111111------------------2222--------------------- GYLPGGAKDGDMIRVDDSPNVWVDHNELFAANHECDGTPDNDTTFESAVDIKGASNTVTV ----1111-----------------------------2222------------------- SYNYIHGVKKVGLDGSSSSDTGRNITYHHNYYNDVNARLPLQRGGLVHAYNNLYTNITGS ----------------1111---------------------------------------- GLNVRQNGQALIENNWFEKAINPVTSRYDGKNFGTWVLKGNNITKPADFSTYSITWTADT --------------------------------------------3333-1111------- KPYVNADSWTSTGTFPTVAYNYSPVSAQCVKDKLPGYAGVGKNLATLTSTACK -----1111-----------------------3333-----%%%%--3333-- >YHDH; SWP:P26646; PDB:1O89A; LQALLLEQQTLASVQTLDESRLPEGDVTVDVHWSSLNYKDALAITGKGKIIRNFPMIPGI -----------------3333--------------------------------------- DFAGTVRTSEDPRFHAGQEVLLTGWGVGENHWGGLAEQARVKGDWLVAMPQGLDARKAMI ----------11112222-----iiii--------------3333----2222------- IGTAGFTAMLCVMALEDAGVRPQDGEIVVTGASGGVGSTAVALLHKLGYQVVAVSGREST ---------------1111-1111------1111----------------------1111 HEYLKSLGASRVLPRDEFAESRPLEKQVWAGAIDTVGDKVLAKVLAQMNYGGCVAACGLA -------------3333---------------------------11112222------11 GGFTLPTTVMPFILRNVRLQGVDSVMTPPERRAQAWQRLVADLPESFYTQAAKEISLSEA 11------3333-------------------------------3333--------3333- PNFAEAIINNQIQGRTLVKV -------------------- >RIBOSE 5-PHOSPHATE ISOMER; SWP:P27252; PDB:1O8BA; IVGVGTGSTEGAVSSSDAFDLNEVDSLGIYVDGADEINGHQIKGGGALTREKIIASVAEK -------------------3333-------------------------------1111-- FICIADASKQVDILGKFPLPVEVIPARSAVARQLVKLGGRPEYRQGVVTDNGNVILDVHG -----3333----------------------------------2222-1111-------- EILDPIAENAINAIPGVVTVGLFANRGADVALIGTPDGVKTIV ---3333------2222-----------------1111----- >GUANYLIN; SWP:Q02747; PDB:1O8RA; VTVQDGNFSFSLESVKKLKDLQEPQEPRVGKLRNFAPIPGEPVVPILCSNPNFPEELKPL ------------33333333---------------------------------3333-33 CKEPNAQEILQRLEEIAEDPGTCEICAYAACTGC 331111---------1111--3333--------- >FATTY ACID BINDING PROTEI; SWP:Q02970; PDB:1O8VA; MEAFLGTWKMEKSEGFDKIMERLGVDFVTRKMGNLVKPNLIVTDLGGGKYKMRSESTFKT 3333----------------1111---------------------iiii------3333- TESFKLGEKFKEVTPDSREVASLITVENGVMKHEQDDKTKVTYIERVVEGNELKATVKVD ----2222-----1111---------iiii------------------!!!!------!! EVVCVRTYSKVA !!---------- >TRYPAREDOXIN; SWP:O96438; PDB:1O8XA; GLDKYLPGIEKLRRGDGEVEVKSLAGKLVFFYFSASWCPPARGFTPQLIEFYDKFHESKN 3333-2222----!!!!--33332222-------1111------------------1111 FEVVFCTWDEEEDGFAGYFAKMPWLAVPFAQSEAVQKLSKHFNVESIPTLIGVDADSGDV ------------------1111-----3333----------------------------- VTTRARATLVKDPEGEQFPWKDAP ---333333331111--------- >COLLAGEN ALPHA 1(VIII) CH; SWP:Q00780; PDB:1O91A; EMPAFTAELTVPFPPVGAPVKFDKLLYNGRQNYNPQTGIFTCEVPGVYYFAYHVHCKGGN ---------------------------2222--3333----------------------- VWVALFKNNEPMMYTYDEYKKGFLDQASGSAVLLLRPGDQVFLQMPSEQAAGLYAGQYVH ------!!!!-------------------------2222-------1111-----1111- SSFSGYLLYPM ----------- >Electron transfer flavopr; SWP:P53570; PDB:1O94C; MKILVAVKQTAALEEDFEIRDGMDVDEDFMMYDLNEWDDFSLEEAMKIKESSDDVEVVVV -------------------------3333-----3333----------1111-------- SVGPDRVDESLRKCLAKGADRAVRVWDDAAEGSDAIVVGRILTEVIKKEAPDMVFAGVQS ---3333-------1111--------3333------------------------------ SDQAYASTGISVASYLNWPHAAVVADLQYKPGDNKAVIRRELEGGMLQEVEINCPAVLTI -----------------------------2222--------2222--------------- QLGINKPRYASLRGIKQAATKPIEEVSLADIGLSANDVGAAQSMSRVRRMYIP 2222------------3333------3333---3333-3333----------- >Electron transfer flavopr; SWP:P53570; PDB:1O97C; MKILVAVKQTAALEEDFEIREDGMDVDEDFMMYDLNEWDDFSLEEAMKIKESSDTDVEVV -------------2222--1111---3333-----3333--------------------- VVSVGPDRVDESLRKCLAKGADRAVRVWDDAAEGSDAIVVGRILTEVIKKEAPDMVFAGV ----------------1111--------3333---------------------------- QSSDQAYASTGISVASYLNWPHAAVVADLQYKPGDNKAVIRRELEGGMLQEVEINCPAVL -------------------------------2222--------2222------------- TIQLGINKPRYASPIEEVSLADIGLSANDVGAAQSMSRVRRMYIPEKGRATMIEGTISEQ --2222------------3333---1111-3333-------------------------- AAKIIQIINEF ----------- >Electron transfer flavopr; SWP:P53571; PDB:1O97D; SKILVIAEHRRNDLRPVSLELIGAANGLKKSGEDKVVVAVIGSQADAFVPALSVNGVDEL ---------iiii-3333------------1111-------1111111111112222--- VVVKGSSIDFDPDVFEASVSALIAAHNPSVVLLPHSVDSLGYASSLASKTGYGFATDVYI -----------------------------------3333-------3333---------- VEYQGDELVATRGGYNQKVNVEVDFPGKSTVVLTIRPSVFKPLEGAGSPVVSNVDAPSVQ ---!!!!-------%%%%------2222-------------------------------- SRSQNKDYVEVGDIDITTVDFIMSIGRGIGEETNVEQFRELADEAGATLCCSRPIADAGW ------------------------------3333-------------------------- LPKSRQVGQSGKVVGSCKLYVAMGISGSIQHMAGMKHVPTIIAVNTDPGASIFTIAKYGI -3333--1111--1111----------3333---3333--------1111-1111----- VADIFDIEEELKAQL --------------- >2,3-BISPHOSPHOGLYCERATE-I; SWP:Q9X519; PDB:1O98A; SKKPVALIILDGFALRDETYGNAVAQANKPNFDRYWNEYPHTTLKACGEAVGLPEGQMGN ---------2222-----2222-3333-------------------!!!!---2222--3 SEVGHLNIGAGRIVYQSLTRINIAIREGEFDRNETFLAAMNHVKQHGTSLHLFGLLSDGG 333-------------------------3333---------------------------1 VHSHIHHLYALLRLAAKEGVKRVYIHGFLDGRDVGPQTAPQYIKELQEKIKEYGVGEIAT 111----------------------------------3333------------------- LSGRYYSMDRDKRWDRVEKAYRAMVYGEGPTYRDPLECIEDSYKHGIYDEFVLPSVIVRE --3333------------------------------------1111--1111------11 DGRPVATIQDNDAIIFYNFRPDRAIQISNTFTNEDFREFDRGPKHPKHLFFVCLTHFSET 11------2222--------1111-----------------1111------------333 VAGYVAFKPTNLDNTIGEVLSQHGLRQLRIAETEKYPHVTFFMSGGREEEFPGEDRILIN 3-------------------1111-------3333---------------2222------ SPKVPTYDLKPEMSAYEVTDALLKEIEADKYDAIILNYANPDMVGHSGKLEPTIKAVEAV -----33331111------------1111---------3333--3333------------ DECLGKVVDAILAKGGIAIITADHGNADEVLTPDGKPQTAHTTNPVPVIVTKKGIKLRDG -----------1111----------3333--1111------------------------- GILGDLAPTMLDLLGLPQPKEMTGKSLIV -1111-------------3333------- >14-3-3-LIKE PROTEIN C; SWP:P93343; PDB:1O9DA; PTAREENVYMAKLAEQAERYEEMVEFMEKVSNSLGSEELTVEERNLLSVAYKNVIGARRA ------------------------------------------------------------ SWRIISSIEQKEESRGNEEHVNSIREYRSKIENELSKICDGILKLLDAKLIPSAASGDSK ------------1111---------------------------------3333------- VFYLKMKGDYHRYLAEFKTGAERKEAAESTLTAYKAAQDIATTELAPTHPIRLGLALNFS ---------------------------------------------1111----------- VFYYEILNSPDRACNLAKQAFDEAIAELDTLGDSTLIMQLLRDNLTLWTSD --------------------------------------------------- >RRNA METHYLTRANSFERASE; SWP:Q9F5K5; PDB:1O9GA; SAYRHAVERDSSDLACGVVLHSAPGYPAFPVRLATEIFQRALARLPGDGPVTLWDPCCGS ----------33332222----1111---------------1111------------!!! GYLLTVLGLLHRRSLRQVIASDVDPAPLELAAKNLALLSPAGLTARELERREQSERFGKP !---------3333----------3333------------------------------33 SYLEAAQAARRLRERLTAEGGALPCAIRTADVFDPRALSAVLAGSAPDVVLTDLPYGERT 33----------------------------11111111---iiii---------3333-- HWEGQVPGQPVAGLLRSLASALPAHAVIAVTDRSRKIPVAPVKALERLKIGTRSAVVRAA 3333--------------11111111-----------------------!!!!------- DVLEAGP --3333- >PSEUDOCATALASE; SWP:BAA13239; PDB:1O9IA; MFKHTRKLQYNAKPDRSDPIMARRLQESLGGQWGETTGMMSFLSQGWASTGAEKYKDLLL ------------------------------1111-----------------3333----- DTGTEEMAHVEMISTMIGYLLEDAPFGPEDLKRDPSLATTMAGMDPEHSLVHGLNASLNN --------------------2222--3333----------11111111---%%%%----1 PNGAAWNAGYVTSSGNLVADMRFNVVRESEARLQVSRLYSMTEDEGVRDMLKFLLARETQ 111---3333--------------------------3333-------------------- HQLQFMKAQEELEEKYGIIVPGDMKEIEHSEFSHVLMNFSDGDGSKAFEGQVAKDGEKFT -----------------------1111-3333----------3333-2222-1111---- YQENPEAMGGIPHIKPGDPRLHNHQG -----------------3333----- >ALDEHYDE DEHYDROGENASE, C; SWP:Q28399; PDB:1O9JA; DLPAPLTNIKIQHTKLFINNEWHESVSGKTFPVFNPATEEKICEVEEADKEDVDKAVKAA -----------------%%%%---3333-------------------------------- REAFQMGSPWRTMDASERGQLIYKLADLIERDRLLLATLESINAGKVFASAYLMDLDYCI ----22223333-3333------------------------------------------- KALRYCAGWADKIQGRTIPVDGEFFSYTRHEPIGVCGLIFPWNAPMILLACKIGPALCCG --------1111-------------------------------3333---------1111 NTVIVKPAEQTPLTALHVASLIKEAGFPPGVVNIVPGYGPTAGAAISSHMDVDKVAFTGS -------3333----------------2222------3333-------1111-------- TEVGKMIQEAAAKSNLKRVTLELGAKNPCIVFADADLDSAVEFAHQGVFTNQGQSCIAAS -------------------------------1111--------------%%%%-1111-- KLFVEEAIYDEFVQRSVERAKKYVFGNPLTPGVNHGPQINKAQHNKIMELIESGKKEGAK ----3333----------1111----1111---------3333----------------- LECGGGPWGNKGYFIQPTVFSNVTDDMRIAKEEIFGPVQQIMKFKSLDEVIKRANNTYYG -----------------------11111111----------------------------- LVAGVFTKDLDKAVTVSSALQAGTVWVNCYLAASAQSPAGGFKMSGHGREMGEYGIHEYT ---------------------------------1111----!!!!-------3333---- EVKTVTMKISEKNS -------------- >AGROBACTERIUM TUMEFACIENS; SWP:Q8UCK6; PDB:1O9RA; MKTHKTKNDLPSNAKSTVIGILNESLASVIDLALVTKQAHWNLKGPQFIAVHELLDTFRT --------------------------------------------1111------------ QLDNHGDTIAERVVQLGGTALGSLQAVSSTTKLKAYPTDIYKIHDHLDALIERYGEVANM ------------------------------------------------------------ IRKAIDDSDEAGDPTTADIFTAASRDLDKSLWFLEAHVQEKS --------1111----------------------1111---- >HRCQ2; SWP:O85094; PDB:1O9YA; ALDSLALDLTLRCGELRLTLAELRRLDAGTILEVTGISPGHATLCHGEQVVAEGELVDVE -1111-----------------11112222-------2222----!!!!---------ii GRLGLQITRLV ii--------- >ENDO-BETA-1,4-GLUCANASE; SWP:O00095; PDB:1OA2A; TSCDQWATFTGNGYTVSNNLWGASAGSGFGCVTVVSLSGGASWHADWQWSGGQNNVKSYQ ----------iiii-----1111----------------------------1111----- NSQIAIPQKRTVNSISSMPTTASWSYSGSNIRANVAYDLFTAANPNHVTYSGDYELMIWL ----------3333-----------------------------1111------------- GKYGDIGPIGSSQGTVNVGGQSWTLYYGYNGAMQVYSFVAQTNTTNYSGDVKNFFNYLRD -----------------iiii--------!!!!--------------------------- NKGYNAAGQYVLSYQFGTEPFTGSGTLNVASWTASIN ----3333----------------------------- >ENDO-BETA-1-4-GLUCANASE; SWP:Q8NJY6; PDB:1OA3A; TSCDQYATFSGNGYIVSNNLWGASAGSGFGCVTSVSLNGAASWHADWQWSGGQNNVKSYQ ----------!!!!-----1111----------------------------1111----- NVQINIPQKRTVNSIGSMPTTASWSYSGSDIRANVAYDLFTAANPNHVTYSGDYELMIWL ----------3333-----------------------------1111------------- GKYGDIGPIGSSQGTVNVGGQTWTLYYGYNGAMQVYSFVAQSNTTSYSGDVKNFFNYLRD -----------------iiii--------!!!!--------------------------- NKGYNAGGQYVLSYQFGTEPFTGSGTLNVASWTASIN ----3333----------------------------- >ENDO-BETA-1,4-GLUCANASE; SWP:Q9KIH1; PDB:1OA4A; NQQICDRYGTTTIQDRYVVQNNRWGTSATQCINVTGNGFEITQADGSVPTNGAPKSYPSV ------------%%%%--------------------------------1111-------- YDGCHYGNCAPRTTLPMRISSIGSAPSSVSYRYTGNGVYNAAYDIWLDPTPRTNGVNRTE ----iiii-2222----3333--------------------------------------- IMIWFNRVGPVQPIGSPVGTAHVGGRSWEVWTGSNGSNDVISFLAPSAISSWSFDVKDFV ----------------------iiii----------------------------3333-- DQAVSHGLATPDWYLTSIQAGFEPWEGGTGLAVNSFSSAVNA ---1111--1111--------------2222----------- >ATAXIN-1; SWP:P54253; PDB:1OA8A; GSPAAAPPTLPPYFMKGSIIQLANGELKKVEDLKTEDFIQSAEISNDLKIDSSTVERIED ----------11112222----------3333-3333----------------------- SHSPGVAVIQFAVGEHRAQVSVEVLVEYPFFVFGQGWSSCCPERTSQLFDLPCSKLSVGD --2222-------1111-------1111---2222---------------------2222 VCISLTLK -------- >SEPIAPTERIN REDUCTASE; SWP:Q64105; PDB:1OAA; ADGLGCAVCVLTGASRGFGRALAPQLARLLSPGSVMLVSARSESMLRQLKEELGAQQPDL --------------------------11112222----------------------1111 KVVLAAADLGTEAGVQRLLSAVRELPRPEGLQRLLLINNAATLGDVSKGFLNVNDLAEVN -------1111----------1111-----------------------3333-------- NYWALNLTSMLCLTSGTLNAFQDSPGLSKTVVNISSLCALQPYKGWGLYCAGKAARDMLY -----------------------1111--------3333---2222-------------- QVLAAEEPSVRVLSYAPGPLDNDMQQLARETSKDPELRSKLQKLKSDGALVDCGTSAQKL ------1111-------------------------------------------------- LGLLQKDTFQSGAHVDFYD ---------2222--1111 >COPPER AMINE OXIDASE; SWP:P46883; PDB:1OACA; AHMVPMDKTLKEFGADVQWDDYAQLFTLIKDGAYVKVKPGAQTAIVNGQPLALQVPVVMK ----------1111------1111-----!!!!----2222----iiii----------- DNKAWVSDTFINDVFQSGLDQTFQVEKRPHPLNALTADEIKQAVEIVKASADFKPNTRFT ------1111---1111------------1111----------------33331111--- EISLLPPDKEAVWAFALENKPVDQPRKADVIMLDGKHIIEAVVDLQNNKLLSWQPIKDAH -------------------------------------------------------2222- GMVLLDDFASVQNIINNSEEFAAAVKKRGITDAKKVITTPLTVGYFDGKDGLKQDARLLK ---3333------------------1111----------------iiii---3333---- VISYLDVGDGNYWAHPIENLVAVVDLEQKKIVKIEEGPVVPVPMTARPFDGRDRVAPAVK ----------3333------------------------------------1111------ PMQIIEPEGKNYTITGDMIHWRNWDFHLSMNSRVGPMISTVTYNDNGTKRKVMYEGSLGG -----1111-----!!!!--!!!!--------------------iiii------------ MIVPYGDPDIGWYFKAYLDSGDYGMGTLTSPIARGKDAPSNAVLLNETIADYTGVPMEIP --------1111-----3333----1111---2222--1111--------1111------ RAIAVFERYAGPEYKHQEMGQPNVSTERRELVVRWISTVGNDYIFDWIFHENGTIGIDAG -----------------2222----------------------------1111------- ATGIEAVKGVKAKTMHDETAKDDTRYGTLIDHNIVGTTHQHIYNFRLDLDVDGENNSLVA -------------1111-3333-1111---2222-------------------------- MDPVVKPNTAGGPRTSTMQVNQYNIGNEQDAAQKFDPGTIRLLSNPNKENRMGNPVSYQI --------------------------3333-----1111-----1111-1111------- IPYAGGTHPVAKGAQFAPDEWIYHRLSFMDKQLWVTRYHPGERFPEGKYPNRSTHDTGLG ----------------1111-----3333---------1111-1111------------- QYSKDNESLDNTDAVVWMTTGTTHVARAEEWPIMPTEWVHTLLKPWNFFDETPTLGALK --1111--------------------3333---------------------1111---- >CYTOCHROME C NITRITE REDU; SWP:Q8VNU2; PDB:1OAHA; TGIAETETKMSAFKGQFPQQYASYMKNNEDRIMTDYKGSVPYHKNDNVNPLPKGFKHAQP --------3333-1111--------1111-------------1111------------11 YLKNLWLGYPFMYEYNETRGHTYAIDDFLNIDRINRFAADGKGNLPATCWNCKTPKMMEW 113333--3333-------3333-------3333---1111------1111--3333--- VSQYGDKFWSMDVNEFRAKDKINAHDETIGCANCHDPATMELRLYSEPLKDWLKRSGKDW -------11111111--1111--------3333-------------------------33 QKMSRNEKRTLVCAQCHVEYYFTHKDNGPAAKPVFPWDNGFNPEDMYQYYKGHGAKGPDG 33-3333---3333---------3333-2222----1111----------------1111 KPGPFVDWVHAASKVPMIKMQHPEYETFQDGPHGAAGVSCADCHMQYISSHWMTSPMKDP ----------------------3333-1111-3333--3333------------111111 EMRACRQCHADKTGEYLRQRVLYTQQKTFDQLLKAQEMSVKAHEAVRLANAYEGHRAANY 111111--1111------------------------------------------------ EALMAEAREMVRKGQLFWDYVSAENSVGFHNPAKALDTLMTSMECSQKAVDLATEATDFG --------------------3333-iiii-------------------------111111 IAPALAGDIKKLVPPILTLSRKLQQDPEFLKQNPWTRLLPALPKAEQVWEGQDRA 111111-3333--------3333--3333-----3333-----------!!!!-- >NUCLEAR RNA EXPORT FACTOR; SWP:Q9UBU9; PDB:1OAIA; PTLSPEQQEMLQAFSTQSGMNLEWSQKCLQDNNWDYTRSAQAFTHLKAKGEIPEVAFMK -------------------------------%%%%-----------1111--3333--- >SUPEROXIDE DISMUTASE; SWP:P00446; PDB:1OALA; QDLTVKMTDLQTGKPVGTIELSQNKYGVVFIPELADLTPGEHGFHIHQNGSCASSEKDGK -----------------------1111-----------------------------iiii VVLGGAAGGHYDPEHTNKHGFPWTDDNHKGDLPALFVSANGLATNPVLAPRLTLKELKGH -2222------1111-----1111-------------1111-----------33332222 AIMIHAGGDNHSDMPKALGGGGARVACGVIQ ----------------iiii----------- >CARBON MONOXIDE DEHYDROGE; SWP:P27989; PDB:1OAOA; PRFRDLSHNCRPSEAPRVMEPKNRDRTVDPAVLEMLVKSKDDKVITAFDRFVAQQPQCKI ----3333-----------1111----------------------------3333----- GYEGICCRFCMAGPCRIKATDGPGSRGICGASAWTIVARNVGLMILTGAAAHCEHGNHIA 1111-----3333--------1111-1111------------------------------ HALVEMAEGKAPDYSVKDEAKLKEVCRRVGIEVEGKSVLELAQEVGEKALEDFRRLKGEG -----1111-1111------------1111--2222-------------------2222- EATWLMTTINEGRKEKFRTHNVVPFGIHASISELVNQAHMGMDNDPVNLVFSAIRVALAD ----1111------------------------------2222------------------ YTGEHIATDFSDILFGTPQPVVSEANMGVLDPDQVNFVLHGHNPLLSEIIVQAAREMEGE --------------------------11111111--------3333------3333---- AKAAGAKGINLVGICCTGNEVLMRQGIPLVTSFASQELAICTGAIDAMCVDVQCIMPSIS -1111---------------------------11113333-------------------- AVAECYHTRIITTADNAKIPGAYHIDYQTATAIESAKTAIRMAIEAFKERKESNRPVYIP --1111-------1111-2222-----3333------------------1111------- QIKNRVVAGWSLEALTKLLATQNAQNPIRVLNQAILDGELAGVALICGCNNLKGFQDNSH ----------------------3333----------------------------2222-- LTVMKELLKNNVFVVATGCSAQAAGKLGLLDPANVETYCGDGLKGFLKRLGEGANIEIGL ---------------------------11111111----------------11111111- PPVFHMGSCVDNSRAVDLLMAMANDLGVDTPKVPFVASAPEAMSGKAAAIGTWWVSLGVP -------3333-----------------3333---------------------------- THVGTMPPVEGSDLIYSILTQIASDVYGGYFIFEMDPQVAARKILDALEYRTWKLGVHKE --------3333---------3333----------------------------------- VAERYETKLCQGY ------------- >Carbon monoxide dehydroge; SWP:P27988; PDB:1OAOC; MTDFDKIFEGAIPEGKEPVALFREVYHGAITATSYAEILLNQAIRTYGPDHPVGYPDTAY -3333--2222-2222-3333--------------------------1111--------- YLPVIRCFSGEEVKKLGDLPPILNRKRAQVSPVLNFENARLAGEATWYAAEIIEALRYLK --------------3333--------1111--------------------------3333 YKPDEPLLPPPWTGFIGDPVVRRFGIKMVDWTIPGEAIILGRAKDSKALAKIVKELMGMG -1111-----------3333---33331111-------------------------1111 FMLFICDEAVEQLLEENVKLGIDYIAYPLGNFTQIVHAANYALRAGMMFGGVTPGAREEQ -------------1111---3333------!!!!------------------2222---- RDYQRRRIRAFVLYLGEHDMVKTAAAFGAIFTGFPVITDQPLPEDKQIPDWFFSVEDYDK -----------------------------1111---------1111----------1111 IVQIAMETRGIKLTKIKLDLPINFGPAFEGESIRKGDMYVEMGGNRTPAFELVRTVSESE ------------------------3333------1111----iiii----------3333 ITDGKIEVIGPDIDQIPEGSKLPLGILVDIYGRKMQADFEGVLERRIHDFINYGEGLWHT -2222------1111-2222-----------11111111--------------2222--- GQRNINWLRVSKDAVAKGFRFKNYGEILVAKMKEEFPAIVDRVQVTIFTDEAKVKEYMEV -!!!!---------1111-3333------------------------------------- AREKYKERDDRMRGLTDETVDTFYSCVLCQSFAPNHVCIVTPERVGLCGAVSWLDAKASY ----------3333-3333---------33331111----1111-3333----------- EINHAGPNQPIPKEGEIDPIKGIWKSVNDYLYTASNRNLEQVCLYTLMENPMTSCGCFEA --1111----------------------------iiii---------------------- IMAILPECNGIMITTRDHAGMTPSGMTFSTLAGMIGGGTQTPGFMGIGRTYIVSKKFISA ----3333------1111---1111------1111iiii-2222---3333--1111333 DGGIARIVWMPKSLKDFLHDEFVRRSVEEGLGEDFIDKIADETIGTTVDEILPYLEEKGH 3--1111-------------------1111-11111111-1111--3333---------3 PALTMDPIM 333------ >PEPTIDOGLYCAN-ASSOCIATED ; SWP:P07176; PDB:1OAPA; QQNNIVYFDLDKYDIRSDFAQMLDAHANFLRSNPSYKVTVEGHADERGTPEYNISLGERR --------2222---3333-------------1111------------------------ ANAVKMYLQGKGVSADQISIVSYGKEKPAVLGHDEAAYSKNRRAVLVY --------1111-3333-----!!!!-----------1111------- >Ig lambda-1 chain V regio; SWP:P01724; PDB:1OAQL; QAVVTQESALTTSPGETVTLTCRSSTGAVTTSNYANWVQEKPDHLFTGLIGGTNNRAPGV ------------2222-------3333--3333-----------------------2222 PARFSGSLIGNKAALTITGAQTEDEAIYFCALWYSNHLVFGGGTKLTVLG 3333----!!!!--------1111---------%%%%------------- >Major merozoite surface p; SWP:Q25976; PDB:1OB1B; ELQLVQSGPQLKKPGETVRISCKASGYTFTTAGIQWVQRLPGKDLKWIGWINTHSGVPQY ---------------------------1111----------------------------- ADDFKGRFAFSLETSASTAFLQIINL ------------1111---------- >CELL DIVISION CONTROL PRO; SWP:Q07785; PDB:1OB3A; MEKYHGLEKIGEGTYGVVYKAQNNYGETFALKKIRLEKEDEGIPSTTIREISILKELKHS ------------1111---------------------3333------------3333-11 NIVKLYDVIHTKKRLVLVFEHLDQDLKKLLDVCEGGLESVTAKSFLLQLLNGIAYCHDRR 11------------------------------2222--------------------1111 VLHRDLKPQNLLINREGELKIADFGLARAFGIVTLWYRAPDVLMGSKKYSTTIDIWSVGC ------3333---1111-----22223333----1111---1111----3333------- IFAEMVNGTPLFPGVSEADQLMRIFRILGTPNSKNWPNVTELPKYDPNFTVYEPLPWESF -------------------------------33332222--11111111------3333- LKGLDESGIDLLSKMLKLDPNQRITAKQALEHAYFKE ------------------3333--3333---3333-- >HOLLIDAY-JUNCTION RESOLVA; SWP:Q97YX6; PDB:1OB8A; GKNAERELVSILRGEGFNAVRIPTNPLPDIFATKGNTLLSIECKSTWENKVKVKEHQVRK ------------1111-----------------!!!!----------------3333--- LLDFLSMFTMKGVPLIAIKFKQVHEWRVLVPEKAEDIIVTIDNSIPIEDLFKILEKRIE ----3333-----------3333----------------3333--3333---------- >ALPHA-GLUCOSIDASE; SWP:O33830; PDB:1OBBA; PSVKIGIIGAGSAVFSLRLVSDLCKTPGLSGSTVTLMDIDEERLDAILTIAKKYVEEVGA --------3333-------------3333------------------------------- DLKFEKTMNLDDVIIDADFVINTAMVGGHTYLEKVRQIGEKYGYYRGIDAQEFNMVSDYY --------3333-2222-------2222-----------11112222---2222-1111- TFSNYNQLKYFVDIARKIEKLSPKAWYLQAANPIFEGTTLVTRTVPIKAVGFHGHYGVME ---------------------1111---------------------------3333---- IVEKLGLEEEKVDWQVAGVNHGIWLNRFRYNGGNAYPLLDKWIEEKSKDWKPENPFNDQL --1111-3333-------2222-------%%%%------------3333----1111111 SPAAIDMYRFYGVMPIGDTVRNSSWRYHRDLETKKKWYGEPWGGADSEIGWKWYQDTLGK 1---------------3333---3333----------------1111------------- VTEITKKVAKFIKENPSVRLSDLGSVLGKDLSEKQFVLEVEKILDPERKSGEQHIPFIDA --------------11111111----------------------1111------------ LLNDNKARFVVNIPNKGIIHGIDDDVVVEVPALVDKNGIHPEKIEPPLPDRVVKYYLRPR --------------iiii22221111--------1111----------3333-------- IMRMEMALEAFLTGDIRIIKELLYRDPRTKSDEQVEKVIEEILALPENEEMRKHYLKR -------------------------1111---------------1111---------- >Glyceraldehyde 3-phosphat; SWP:P83696; PDB:1OBFO; TIRVAINGYGRIGRNILRAHYEGGKSHDIEIVAINDLGDPKTNAHLTRYDTAHGKFPGTV --------------------1111------------------------------------ SVNGSYMVVNGDKIRVDANRNPAQLPWGALKVDVVLECTGFFTTKEKAGAHIKGGAKKVI --!!!!--iiii--------3333-3333--------------3333------------- ISAPGGADVDATVVYGVNHGTLKSTDTVISNASTTNCLAPLVKPLNDKLGLQDGLMTTVH -----1111----22223333-1111-------3333----------------------- AYTNNQVLTDVYHEDLRRARSATMSMIPTKTGAAAAVGDVLPELDGKLNGYAIRVPTINV --3333--------3333--1111-------33333333-3333---------------- SIVDLSFVAKRNTTVEEVNGILKAASEGELKGILDYNTEPLVSVDYNHDPASSTVDASLT -------------3333----------1111----------33332222------3333- KVSGRLVKVSSWYDNEWGFSNRMLDTTVALMSAA --!!!!-----------------------1111- >FLAVODOXIN; SWP:P11241; PDB:1OBOA; AKKIGLFYGTQTGKTESVAEIIRDEFGNDVVTLHDVSQAEVTDLNDYQYLIIGCPTLNIG --------------------------1111----3333-33331111------------- ELQSDWEGLYSELDDVDFNGKLVAYFGTGDQIGYADNFQDAIGILEEKISQRGGKTVGYW -----------3333--2222--------33331111-3333-------1111------- STDGYDFNDSKALRNGKFVGLALDEDNQSDLTDDRIKSWVAQLKSEFGL -2222--------iiii------33331111------------------ >CARBOXYPEPTIDASE T; SWP:P29068; PDB:1OBR; DFPSYDSGYHNYNEMVNKINTVASNYPNIVKKFSIGKSYEGRELWAVKISDNVGTDENEP --1111-------------------3333--------1111----------3333----- EVLYTALHHAREHLTVEMALYTLDLFTQNYNLDSRITNLVNNREIYIVFNINPDGGEYDI --------11113333----------1111---------------------------111 SSGSYKSWRKNRQPNSGSSYVGTDLNRNYGYKWGCCGGSSGSPSSETYRGRSAFSAPETA 1-------------2222-----1111----2222------1111------2222----- AMRDFINSRVVGGKQQIKTLITFHTYSELILYPYGYTYTDVPSDMTQDDFNVFKTMANTM ------1111iiii---------------------------1111--------------- AQTNGYTPQQASDLYITDGDMTDWAYGQHKIFAFTFEMYPTSYNPGFYPPDEVIGRETSR ----------3333------------------------------!!!!-1111------- NKEAVLYVAEKADCPYSVIGKSC -------------3333------ >Vitronectin [Precursor]; SWP:P04004; PDB:1OC0B; ESCKGRCTEGFNVDKKCQCDELCSYYQSCCTDYTAEC --2222-----3333----11111111--1111---- >DTDP-GLUCOSE 4,6-DEHYDRAT; SWP:Q8GIP9; PDB:1OC2A; QFKNIIVTGGAGFIGSNFVHYVYNNHPDVHVTVLDKLTYAGNKANLEAILGDRVELVVGD -------------------------1111--------111133333333-3333-----1 IADAELVDKLAAKADAIVHYAAESHNDNSLNDPSPFIHTNFIGTYTLLEAARKYDIRFHH 111-------1111----------3333----3333------------------------ VSTDEVYGDLPLREDLPGHGEGPGEKFTAETNYNPSSPYSSTKAASDLIVKAWVRSFGVK ----1111---33331111--2222--1111----------------------------- ATISNCSNNYGPYQHIEKFIPRQITNILAGIKPKLYGEGKNVRDWIHTNDHSTGVWAILT --------------------------1111-----!!!!-------3333---------- KGRMGETYLIGADGEKNNKEVLELILEKMGQPKDAYDHVTDRAGHDLRYAIDASKLRDEL --2222--------------------1111-1111------2222--------------- GWTPQFTDFSEGLEETIQWYTDNQDWWKAEKEAVEANYAKTQEVIK ----------------------33333333-------3333----- >L-LACTATE DEHYDROGENASE; SWP:Q7SI97; PDB:1OC4A; APKAKIVLVGSGMIGGVMATLIVQKNLGDVVMFDIVKNMPHGKALDTSHTNVMAYSNCKV ---------------------------------------------------1111----- SGSNTYDDLKDADVVIVTAGFTKAPGKSDKEWNRDDLLPLNNKIMIEIGGHIKNNCPNAF --------2222-----------2222-----3333----3333-----------1111- IIVVTNPVDVMVQLLHQHSGVPKNKIVGLGGVLDTSRLKYYISQKLNVCPRDVNAHIVGA ------3333-----------1111-----------------------3333-------- HGNKMVLLKRYITVGGIPLQEFINNKKITDQELDAIFDRTINTALEIVNLHASPYVAPAA -1111--3333--iiii3333-1111---------------------------------- AIIEMAESYIRDLRKVLICSTLLEGQYGHKDIFAGTPLVIGGNGVEQVIELQLNADEKKK --------1111------------2222------------1111---------------- FDEAVAETSRMKALI --------------- >CELLOBIOHYDROLASE II; SWP:Q9C1S9; PDB:1OC7A; APYNGNPFEGVQLWANNYYRSEVHTLAIPQITDPALRAAASAVAEVPSFQWLDRNVTVDT -----1111-----------------3333------------1111-------3333--- LLVQTLSEIREANQAGANPQYAAQIVVYDLPDRDCAAAASNGEWAIANNGVNNYKAYINR ------------1111-----------------1111-------3333------------ IREILISFSDVRTILVIEPDSLANMVTNMNVPKCSGAASTYRELTIYALKQLDLPHVAMY -------1111---------3333---3333------------------11111111--- MDAGHAGWLGWPANIQPAAELFAKIYEDAGKPRAVRGLATNVANYNAWSVSSPPPYTSPN ----1111--3333------------1111-3333-----2222---------1111--- PNYDEKHYIEAFRPLLEARGFPAQFIVDQGRSGKQPTGQKEWGHWCNAIGTGFGMRPTAN ----------------1111--------------------1111---------------- TGHQYVDAFVWVKPGGECNGTSDTTAARYDYHCGLEDALKPAPEAGQWFNEYFIQLLRNA --1111-------2222-----1111---3333-1111-----2222---------1111 NPPF ---- >MALONAMIDASE E2; SWP:Q9ZIV5; PDB:1OCKA; MISLADLQRRIETGELSPNAAIAQSHAAIEAREKEVHAFVRHDKSARAQASGPLRGIAVG -------------------------------1111-------1111-----1111----- IKDIIDTANMPTEMGSEIYRGWQPRSDAPVVMMLKRAGATIIGKTTTTAFASRDPTATLN ------------%%%%1111-------------3333----------2222--------3 PHNTGHSPGGSSSGSAAAVGAGMIPLALGTQTGGSVIRPAAYCGTAAIKPSFRMLPTVGV 333---------------1111----------------------------2222--2222 KCYSWALDTVGLFGARAEDLARGLLAMTGRSEFSGIVPAKAPRIGVVRQEFAGAVEPAAE ---1111--------3333----------3333---------------3333-------- QGLQAAIKAAERAGASVQAIDLPEAVHEAWRIHPIIQDFEAHRALAWEFSEHHDEIAPML ----------1111--------3333-------------------------3333----- RASLDATVGLTPKEYDEARRIGRRGRRELGEVFEGVDVLLTYSAPGTAPAKALASTGDPR ------1111-------------------3333----------------3333-----11 YNRLWTLMGNPCVNVPVLKVGGLPIGVQVIARFGNDAHALATAWFLEDALAK 11-----------------iiii--------2222----------------- >OCT-3; SWP:P20263; PDB:1OCP; METLVQARKRKRTSIENRVRWSLETMFLKCPKPSLQQITHIANQLGLEKDVVRVWFCNRR ---------------3333----------------------------------------- QKGKRSS ------- >SORTING NEXIN GRD19; SWP:Q08826; PDB:1OCSA; AEPENFLEIEVHNPKTHIPNGMDSKGMFTDYEIICRTNLPSFHKRVSKVRRRYSDFEFFR --------------------1111--------------1111---------3333----- KLIKEISMLNHPKVMVPHLPGKILLSNRFSNEVIEERRQGLNTWMQSVAGHPLLQSGSKV ----------1111------------1111-----------------1111--------- LVRFIEAEKFV ----------- >MALTOSE O-ACETYLTRANSFERA; SWP:P77791; PDB:1OCXA; STEKEKMIAGELYRSADETLSRDRLRARQLIHRYNHSLAEEHTLRQQILADLFGQVTEAY -3333--------3333------------------------------------------- IEPTFRCDYGYNIFLGNNFFANFDCVMLDVCPIRIGDNCMLAPGVHIYTATHPIDPVARN ---------1111----------------------------------------------- SGAELGKPVTIGNNVWIGGRAVINPGVTIGDNVVVASGAVVTKDVPDNVVVGGNPARIIK -----------------2222--2222--------2222--------------------- KL -- >BACTERIOPHAGE T4 SHORT TA; SWP:Q38160; PDB:1OCYA; RVVTQNEIDRTIPVGAIMMWAADSLPSDAWRFCHGGTVSASDCPLYASRIGTRYGGSSSN --------3333--------------------------3333-------!!!!---3333 PGLPDMRGLFVRGSGRGSHLTNPNVNGNDQFGKPRLGVGCTGGYVGEVQKQQMSYHKHAG -----2222-------3333-1111---1111--3333-----2222------------- GFGEYDDSGAFGNTRRSNFVGTRKGLDWDNRSYFTNDGYEIDPASQRNSRYTLNRPELIG ------------------------------------------3333-1111---2222-- NETRPWNISLNYIIKVKE ------------------ >ACETYL-COENZYME A CARBOXY; SWP:Q00955; PDB:1OD2A; ATPYPVKEWLQPKRYKAHLGTTYVYDFPELFRQASSSQWKNFSADVKLTDDFFISNELIE -----1111-3333--------1111-3333-------11111111-------------- DENGELTEVEREPGANAIGVAFKITVKTPEYPRGRQFVVVANDITFKIGSFGPQEDEFFN 1111-------2222------------1111-----------3333%%%%---------- KVTEYARKRGIPRIYLAANSGARIGAEEIVPLFQVAWNDAANPDKGFQYLYLTSEGETLK -------------------------3333---------11113333-------------- KFDKENSVLTERTVINGEERFVIKTIIGSEDGLGVECLRGSGLIAGATSRAYHDIFTITL ---3333------------------------------------------3333------- VTCRSVGIGAYLVRLGQRAIQVEGQPIILTGAPAINKLGREVYTSNLQLGGTQIYNNGVS -----!!!!------------2222-------------------3333--11113333-- HLTAVDDLAGVEKIVEWSYVPAKRNPVPILETKDTWDRPVDFTPTNDETYDVRWIEGRET --------------------------------------------------3333------ ESGFEYGLFDKGSFFETLSGWAKGVVVGRARLGGIPLGVIGVETRTVENLIPADPANPNS ---------2222----11113333------iiii------------------1111--- AETLIQEPGQVWHPNSAFKTAQAINDFNNGEQLPILANWRGFSGGQRDFNEVLKYGSFIV --------------------------------------------3333------------ DALVDYKQPIIIYIPPTGELRGGSWVVVDPTINADQEYADVNARAGVLEPQGVGIKFRRE ----------------------3333--33333333---1111-----3333------33 KLLDTNRLDDKYRELRSQLSNKSLAPEVHQQISKQLADRERELLPIYGQISLQFADLHDR 33------3333----1111----3333--------------------------1111-3 SSRVAKGVISKELEWTEARRFFFWRLRRRLNEEYLIKRLSHQVGEASRLEKIARIRSWYP 3331111-------1111------------------------------------1111-1 ASVDHEDDRQVATWIEENYKTLDDKLKGLKLESFAQDLAKKIR 11133333333-------------------------------- >PUTATIVE XYLANASE; SWP:Q93AQ5; PDB:1OD3A; VGGTRSAFSNIQAEDYDSSYGPNLQIFSLPGGGSAIGYIENGYSTTYKNIDFGDGATSVT -----1111--1111-----1111----1111-------2222--------!!!!----- ARVATQNATTIQVRLGSPSGTLLGTIYVGSTGSFDTYRDVSATISNTAGVKDIVLVFSGP ----------------1111------------1111------------------------ VNVDWFVFSKSG ------------ >PHOSPHOPANTETHEINE ADENYL; SWP:Q7SIA7; PDB:1OD6A; MHVVYPGSFDPLTNGHLDVIQRASRLFEKVTVAVLENQYLFSAEERLAIIREATAHLANV -----------------------------------------------------1111--- EAATFSGLLVDFVRRVGAQAIVKGLRAVSDYEYELQMAHLNRQLYPGLETLFILAATRYS -------3333--------------1111---------------2222-------3333- FVSSTMVKEIARYGGDVSKLVPPATLRALKAKLGQ ----------1111--1111----------1111- >TRANSCRIPTIONAL REGULATOR; SWP:P0AA16; PDB:1ODD; AVIAFGKFKLNLGTREMFREDEPMPLTSGEFAVLKALVSHPREPLSRDKLMNLARGREYS ----!!!!----------iiii-----------------2222----------------1 AMERSIDVQISRLRRMVEEDPAHPRYIQTVWGLGYVFVPD 111-3333-----------3333------2222------- >HYPOTHETICAL 33.3 KDA PRO; SWP:P42938; PDB:1ODFA; SKTVLDYTIEFLDKYIPEWFETGNKCPLFIFFSGPQGSGKSFTSIQIYNHLMEKYGGEKS -------------------1111-----------2222-----------------1111- IGYASIDDFYLTHEDQLKLNEQFKNNKLLQGRGLPGTHDMKLLQEVLNTIFNQDTVVLPK ----3333--------------11111111------------------------------ YDKSQFKGEGDRCPTGQKIKLPVDIFILEGWFLGFNPILQGIENNDLLTGDMVDVNAKLF ----%%%%-----------------------2222-----11113333!!!!-------- FYSDLLWRNPEIKSLGIVFTTDNINNVYGWRLQQEHELISKVGKGMTDEQVHAFVDRYMP -----1111--------------------------------------------------- SYKLYLNDFVRSESLGSIATLTLGIDSNRNVYSTKTRCIE -------------------------1111----------- >MGCM1; SWP:P70348; PDB:1ODHA; LSWDINDVKLPQNVKTTDWFQEWPDSYVKHIYSSDDRNAQRHLSSWAMRNTNNHNSRILK ------------------------%%%%--------------------------1111-- KSCLGVVVCSRDCSTEEGRKIYLRPAICDKARQKQQRKSCPNCNGPLKLIPCRGHGGFPV ---------------------------3333-----------------------iiii-- TNFWRHDGRFIFFQSKGEHDHPRPETKLEAEARRAMKK -------------------------3333--------- >PURINE NUCLEOSIDE PHOSPHO; SWP:Q72IR2; PDB:1ODKA; SPIHVRAHPGDVAERVLLPGDPGRAEWIAKTFLQNPRRYNDHRGLWGYTGLYKGVPVSVQ -------3333-----------------------------2222-------iiii----- TTGMGTPSAAIVVEELVRLGARVLVRVGTAGAASSDLAPGELIVAQGAVPLDGTTRQYLE ----------------1111-------------33332222-----------------ii GRPYAPVPDPEVFRALWRRAEALGYPHRVGLVASEDAFYATTPEEARAWARYGVLAFEME ii----------------------------------1111-------3333--------- ASALFLLGRMRGVRTGAILAVSNRIPPEVLQEGVRRMVEVALEAVLEV -------------------------------------------1111- >ISOPENICILLIN N SYNTHASE; SWP:P05326; PDB:1ODMA; SVSKANVPKIDVSPLFGDDQAAKMRVAQQIDAASRDTGFFYAVNHGINVQRLSQKTKEFH -------------1111------------------------------------------- MSITPEEKWDLAIRAYNKEHQDQVRAGYYLSIPGKKAVESFCYLNPNFTPDHPRIQAKTP ------------33333333-----------2222---------11111111------22 THEVNVWPDETKHPGFQDFAEQYYWDVFGLSSALLKGYALALGKEENFFARHFKPDDTLA 22------33332222-----------------------1111-1111-11113333--- SVVLIRYPYLDPYPEAAIKTAADGTKLSFEWHEDVSLITVLYQSNVQNLQVETAAGYQDI -------------3333---1111----------------------------1111---- EADDTGYLINCGSYMAHLTNNYYKAPIHRVKWVNAERQSLPFFVNLGYDSVIDPFDPREP ----------------1111--------------------------1111--------11 NGKSDREPLSYGDYLQNGLVSLINKNGQT 11--------------------------- >PUTATIVE CYTOCHROME P450 ; SWP:Q9KZR7; PDB:1ODOA; ALVLDPTGADHHTEHRTLREGGPATWVDVLGVQAWSVSDPVLLKQLLTSSDVSKDARAHW ----1111----------1111------iiii----------------1111--3333-1 PAFGEVVGTWPLALWVAVENMFTAYGPNHRKLRRLVAPAFSARRVDAMRPAVEAMVTGLV 111-3333-1111------3333--3333------3333--------------------- DRLAELPAGEPVDLRQELAYPLPIAVIGHLMGVPQDRRDGFRALVDGVFDTTLDQAEAQA --11112222--3333-----------------11113333----3333----------- NTARLYEVLDQLIAAKRATPGDDMTSLLIAARDDRLSPEELRDTLLLMISAGYETTVNVI ------------------------------------------------------3333-- DQAVHTLLTRPDQLALVRKGEVTWADVVEETLRHEPAVKHLPLRYAVTDIALPDGRTIAR ---------------------------------------------------1111---22 GEPILASYAAANRHPDWHEDADTFDATRTVKEHLAFGHGVHFCLGAPLARMEVTLALESL 22-----------1111--1111-1111-----1111-11111111-------------- FGRFPDLRLADPAEELPPVPSLISNGHQRLPVLLHA ---1111---1111---------------------- >MANNANASE A; SWP:CAA57670; PDB:1ODZA; VKPVTVKLVDSQATMETRSLFAFMQEQRRHSIMFGHQHETTQGLTITRTDGTQSDTFNAV ---------1111------------3333------------------------------- GDFAAVYGWDTLSIVAPKAEGDIVAQVKKAYARGGIITVSSHFDNPKTDTQKGVWPVGTS ----------1111----------------1111-------------1111-------11 WDQTPAVVDSLPGGAYNPVLNGYLDQVAEWANNLKDEQGRLIPVIFRLYHENTGSWFWWG 11----11112222---------------------1111-----------1111--1111 DKQSTPEQYKQLFRYSVEYLRDVKGVRNFLYAYSPNNFWDVTEANYLERYPGDEWVDVLG ----------------------------------------------1111-1111----- FDTYGPVADNADWFRNVVANAALVARMAEARGKIPVISGIGIRAPDIEAGLYDNQWYRKL ----------------------------------------------1111---------- ISGLKADPDAREIAFLLVWRNAPQGVPGGTQVPHYWVPANRPENINNGTLEDFQAFYADE --------3333--------------------------------1111----------11 FTAFNRDIEQVYQRPTLIV 11-3333------------ >DISSIMILATORY COPPER-CONT; SWP:O68601; PDB:1OE1A; DADKLPHTKVTLVAPPQVHPHEQATKSGPKVVEFTMTIEEKKMVIDDKGTTLQAMTFNGS 3333----------------------------------------------------iiii MPGPTLVVHEGDYVQLTLVNPATNAMPHNVDFHGATGALGGAKLTNVNPGEQATLRFKAD --------2222--------1111-------1111-%%%%3333---2222--------- RSGTFVYHCAPEGMVPWHVVSGMSGTLMVLPRDGLKDPQGKPLHYDRAYTIGEFDLYIPK ----------2222----1111--------1111--1111-------------------- GPDGKYKDYATLAESYGDTVQVMRTLTPSHIVFNGKVGALTGANALTAKVGETVLLIHSQ 1111------3333--------1111------iiii----!!!!----2222-------- ANRDTRPHLIGGHGDWVWETGKFANPPQRDLETWFIRGGSAGAALYTFKQPGVYAYLNHN --------2222-----11111111-----------2222-------------------- LIEAFELGAAGHIKVEGKWNDDLMKQIKAPAPIPR -------------------3333------------ >SINGLE-STRAND SELECTIVE M; SWP:Q9YGN6; PDB:1OE4A; ESPADSFLKVELELNLKLSNLVFQDPVQYVYNPLVYAWAPHENYVQTYCKSKKEVLFLGM -----------------1111----------1111------------------------- NPGPFGMAQTGVPFGEVNHVRDWLQIEGPVSKPEVEHPKRRIRGFECPQSEVSGARFWSL --1111------------------------------1111--!!!!-------------- FKSLCGQPETFFKHCFVHNHCPLIFMNHSGKNLTPTDLPKAQRDTLLEICDEALCQAVRV ------3333----------------1111---3333----------------------- LGVKLVIGVGRFSEQRARKALMAEGIDVTVKGIMHPSPRNPQANKGWEGIVRGQLLELGV ------------------------------------11111111-----------1111- LSLLT 3333- >GLUTATHIONE S-TRANSFERASE; SWP:P30113; PDB:1OE8A; DHIKVIYFNGRGRAESIRMTLVAAGVNYEDERISFQDWPKIKPTIPGGRLPAVKITDNHG -----------1111------------------333333331111--------------- HVKWMVESLAIARYMAKKHHMMGGTEEEYYNVEKLIGQAEDLEHEYYKTLMKPEEEKQKI ---------------------------------------------3333----------- IKEILNGKVPVLLDIICESLKASTGKLAVGDKVTLADLVLIAVIDHVTDLDKEFLTGKYP ---1111------------1111----------3333---------33331111222233 EIHKHRENLLASSPRLAKYLSDRA 33---------------------- >FIBROBLAST GROWTH FACTOR ; SWP:P21802; PDB:1OECA; ELPEDPKWEFPRDKLTLGKPLGEGCFGQVVMAEAVGIKPKEAVTVAVKMLKDDATEKDLS ----3333--3333------------------------------------1111------ DLVSEMEMMKMIGKHKNIINLLGACTQDGPLYVIVEYASKGNLREYLRARREQMTFKDLV --------------1111-------------------3333-------------3333-- SCTYQLARGMEYLASQKCIHRDLAARNVLVTENNVMKIADFGLARDINNIDYYKKTTNGR -----------------------3333---2222------1111-------2222--111 LPVKWMAPEALFDRVYTHQSDVWSFGVLMWEIFTLGGSPYPGIPVEELFKLLKEGHRMDK 11111-3333------3333-----------1111----22223333------------- PANCTNELYMMMRDCWHAVPSQRPTFKQLVEDLDRILTLTT ----3333----3333--3333--------------1111- >ENOLASE; SWP:Q9NDH8; PDB:1OEPA; MTIQKVHGREVLDSRGNPTVEVEVTTEKGVFRSAVPSGASVYEACELRDGDKKRYVGKGC ------------1111---------1111---------------------3333iiii-- LQAVKNVNEVIGPALIGRDELKQEELDTLMLRLDGTPNKGKLGANAILGCSMAISKAAAA ----------333322223333-------------1111---1111-------------- AKGVPLYRYLASLAGTKELRLPVPCFNVINGGKHAGNALPFQEFMIAPVKATSFSEALRM ------------------------------!!!!-------------1111--------- GSEVYHSLKGIIKKKYGQDAVNVGDEGGFAPPIKDINEPLPILMEAIEEAGHRGKFAICM -----------------------1111----------3333----------2222----- DCAASETYDEKKQQYNLTWVTAEQLRETYCKWAHDYPIVSIEDPYDQDDFAGFAGITEAL --3333--3333---------1111------------------------3333----111 KGKTQIVGDDLTVTNTERIKMAIEKKACNSLLLKINQIGTISEAIASSKLCMENGWSVMV 1----------%%%%------------------1111--------------1111----- SHRSGETEDTYIADLVVALGSGQIKTGAPCRGERTAKLNQLLRIEEELGAHAKFGFPGWS ---------3333-----------------3333--------------1111---1111- >ENDOTHIAPEPSIN; SWP:P11838; PDB:1OEWA; STGSATTTPIDSLDDAYITPVQIGTPAQTLNLDFDTGSSDLWVFSSETTASEVQTIYTPS ----------1111------------------------------11113333-----111 KSTTAKLLSGATWSISYGDGSSSSGDVYTDTVSVGGLTVTGQAVESAKKVSSSFTEDSTI 11111--2222-----1111-------------iiii-------------------1111 DGLLGLAFSTLNTVSPTQQKTFFDNAKASLDSPVFTADLGYHAPGTYNFGFIDTTAYTGS -------3333---------3333-3333-----------------------1111---- ITYTAVSTKQGFWEWTSTGYAVGSGTFKSTSIDGIADTGTTLLYLPATVVSAYWAQVSGA -------1111----------!!!!-----------1111----------------1111 KSSSSVGGYVFPCSATLPSFTFGVGSARIVIPGDYIDFGPISTGSSSCFGGIQSSAGIGI ---3333----1111--------!!!!----3333------2222----------3333- NIFGDVALKAAFVVFNGATTPTLGFASK ---33331111----------------- >NEUTROPHIL CYTOSOL FACTOR; SWP:P19878; PDB:1OEYA; GSHAYTLKVHYKYTVVKTQPGLPYSQVRDVSKKLELRLEHTKLSYRPRDSNELVPLSEDS ------------------22223333-----1111-3333------2222------3333 KDAWGQVKNYCLTLWCEN ---1111%%%%------- >Neutrophil cytosol factor; SWP:Q15080; PDB:1OEYJ; HTNWLRVYYYEDTISTIKDIAVEEDLSSTPLLKDLLELTRREFQREDIALNYRDAEGDLV ----------!!!!----------1111-------------------------1111--- RLLSDEDVALVRQARGLPSQKRLFPWKLHITQKDNYRVYNTP ----------1111-----------------11111111--- >MRNA EXPORT FACTOR MEX67; SWP:Q99257; PDB:1OF5A; QQFFFENDALGQSSTDFATNFLNLWDNNREQLLNLYSPQSQFSVSVDSTSIGQESINSIF 1111---1111----------------3333-----1111------------------33 KTLPKTKHHLQEQPNEYSMETISYPQINGFVITLHGFFEETSNNKLSKKSFDRTWVIVPM 33------33333333-------------------------------------------% NNSVIIASDLLTVRAYSTGAWKTASIAIAQAAGS %%%------------------------------- >mRNA transport regulator ; SWP:P34232; PDB:1OF5B; NQAQITATFTKKILAHLDDPDSNLAQFVQLFNPNCRIIFNATPFAQATVFLQMWQNQVVQ --3333---------1111----33331111-------iiii------------------ TQHALTGVDYHAIPGSGTLICNVNCKVRFDWGPYFGISLQLIIDDRIFRNDFNGVISGFN ---------------------------------------------3333----------- YNMVYKPE -------- >PHOSPHO-2-DEHYDRO-3-DEOXY; SWP:P32449; PDB:1OF8A; VRILGYDPLASPALLQVQIPATPTSLETAKRGRREAIDIITGKDDRVLVIVGPCSIHDLE --------------------------------------1111------------------ AAQEYALRLKKLSDELKGDLSIIMRAYLEKPRTTVGWKGLINDPDVNNTFNINKGLQSAR ---------------3333-------------------33331111-------------- QLFVNLTNIGLPIGSEMLDTISPQYLADLVSFGAIGARTTESQLHRELASGLSFPVGFKN ------1111----------3333-3333------1111---------1111-------- GTDGTLNVAVDACQAAAHSHHFMGVTKHGVAAITTTKGNEHCFVILRGGKKGTNYDAKSV 1111----------3333-------1111-------------------1111-------- AEAKAQLPAGSNGLMIDYSHGNSNKDFRNQPKVNDVVCEQIANGENAITGVMIESNINEG ---11112222-------!!!!%%%%1111----------1111---------------- NQGILKYGVSITDACIGWETTEDVLRKLAAAVRQRREVN -----2222------------------------------ >PORE-FORMING PEPTIDE AMEO; SWP:P34095; PDB:1OF9A; GEILCNLCTGLINTLENLLTTKGADKVKDYISSLCNKASGFIATLCTKVLDFGIDKLIQL ---333333333333--------3333------------1111----------------- IEDKVDANAICAKIHAC -------3333------ >Chromatin-remodeling comp; SWP:Q24368; PDB:1OFCX; AVDAYFREALKAPRPPKQPIVQDFQFFPPRLFELLDQEIYYFRKTVGYKVPKNTKVQREE 3333----------1111---1111--3333----------------------1111--- QRKIDEAEPLTEEEIQEKENLLSQGFTAWTKRDFNQFIKANEKYGRDDIDNIAKDVEGKT ---1111-------------1111-1111---------------3333---11112222- PEEVIEYNAVFWERCTELQDIERIMGQIERGEGKIQRRLSIKKALDQKMSRYRAPFHQLR -------------33331111---------------------------1111-3333--- LQYGNNKGKNYTEIEDRFLVCMLHKLGFDKENVYEELRAAIRASPQFRFDWFIKSRTALE --!!!!--------------------1111-------------3333---3333------ LQRRCNTLITLIERENIELEEKERAEK ---------------------3333-- >FERREDOXIN-DEPENDENT GLUT; SWP:P55038; PDB:1OFDA; CGVGFIANLRGKPDHTLVEQALKALGCMEHRGGCSADNDSGDGAGVMTAIPRELLAQWFN ---------------------------3333---1111------------3333-----1 TRNLPMPDGDRLGVGMVFLPQEPSAREVARAYVEEVVRLEKLTVLGWREVPVNSDVLGIQ 111----1111--------------------------1111-----------1111---- AKNNQPHIEQILVTCPEGCAGDELDRRLYIARSIIGKKLAEDFYVCSFSCRTIVYKGMVR --------------1111----------------3333-1111----------------1 SIILGEFYLDLKNPGYTSNFAVYHRRFSTNTMPKWPLAQPMRLLGHNGEINTLLGNINWM 111----3333-1111-----------------3333----------------------- AAREKELEVSGWTKAELEALTPIVNQANSDSYNLDSALELLVRTGRSPLEAAMILVPEAY --3333--2222----3333----3333-------------1111--------------- KNQPALKDYPEISDFHDYYSGLQEPWDGPALLVFSDGKIVGAGLDRNGLRPARYCITKDD --3333-----------3333-----------------------1111--------1111 YIVLGSEAGVVDLPEVDIVEKGRLAPGQMIAVDLAEQKILKNYQIKQQAAQKYPYGEWIK -------------3333-------2222-----1111-----------1111-------- IQRQTVASDSFAEKTLFNDAQTVLQQQAAFGYTAEDVEMVVVPMASQGKEPTFCMGDDTP ---------------------------1111-3333------------------------ LAVLSHKPRLLYDYFKQRFAQVTNPPIDPLRENLVMSLAMFLGKRGNLLEPKAESARTIK 3333-----3333----------------1111-------------3333-3333----- LRSPLVNEVELQAIKTGQLQVAEVSTLYDLDGVNSLEDALTNLVKTAIATVQAGAEILVL --------------------------------------------------1111------ TDRPNGAILTENQSFIPPLLAVGAVHHHLIRAGLRLKASLIVDTAQCWSTHHFACLVGYG --2222---1111----------------11111111-------------------1111 ASAICPYLALESVRQWWLDEKTQKLMENGRLDRIDLPTALKNYRQSVEAGLFKILSKMGI ------------------------------------------------------------ SLLASYHGAQIFEAIGLGAELVEYAFAGTTSRVGGLTIADVAGEVMVFHGMAFKKLENFG -33333333----------------2222-1111-------------------------- FVNYRPGGEYHMNSPEMSKSLHKAVAAYDHYELYRQYLKDRPVTALRDLLDFNADQPAIS -------------------------------------1111---3333-----------3 LEEVESVESIVKRFCTGGMSLGALSREAHETLAIAMNRLGAKSNSGEGGEDVVRYLTLDD 333--3333----------3333--3333----------------------1111----- VDSEGNSPTLPHLHGLQNGDTANSAIKQIASGRFGVTPEYLMSGKQLEIKMAQGAKPGEG -1111-3333------3333---------3333-------------------3333---- GQLPGKKVSEYIAMLRRSKPGVTLISPPPHHDIYSIEDLAQLIYDLHQINPEAQVSVKLV ---3333-----------1111---------------------------1111------- AEIGIGTIAAGVAKANADIIQISGHDGGTGASPLSSIKHAGSPWELGVTEVHRVLMENQL -2222-------1111-------3333---------------3333---------11113 RDRVLLRADGGLKTGWDVVMAALMGAEEYGFGSIAMIAEGCIMARVCHTNNCPVGVATQQ 333------------------1111-----------1111-----1111----------1 ERLRQRFKGVPGQVVNFFYFIAEEVRSLLAHLGYRSLDDIIGRTDLLKVRSDVQLSKTQN 1111111--3333----------------------333322221111------------- LTLDCLLNLPDTKQNRQWLNHEPVHSNGPVLDDDILADPDIQEAINHQTTATKTYRLVNT --3333---------3333----------3333------------------------111 DRTVGTRLSGAIAKKYGNNGFEGNITLNFQGAAGQSFGAFNLDGMTLHLQGEANDYVGKG 1-2222-----------------------------2222--2222------------222 MNGGEIVIVPHPQASFAPEDNVIIGNTCLYGATGGNLYANGRAGERFAVRNSVGKAVIEG 2---------1111--3333--------2222---------------------------- AGDHCCEYMTGGVIVVLGPVGRNVGAGMTGGLAYFLDEVGDLPEKINPEIITLQRITASK ------------------------2222------------3333---------------- GEEQLKSLITAHVEHTGSPKGKAILANWSDYLGKFWQAVPPSEKDSPEANN --------------------------33333333--------11111111- >FERREDOXIN I; SWP:P27320; PDB:1OFFA; ASYTVKLITPDGESSIECSDDTYILDAAEEAGLDLPYSCRAGACSTCAGKITAGSVDQSD --------3333------1111----------------------1111---------111 QSFLDDDQIEAGYVLTCVAYPTSDCTIETHKEEDL 1-------------3333------------3333- >ATP-DEPENDENT PROTEASE HS; SWP:P43773; PDB:1OFHA; SEMTPREIVSELDQHIIGQADAKRAVAIALRNRWRRMQLQEPLRHEVTPKNILMIGPTGV -----------1111--------------------------3333--------------- GKTEIARRLAKLANAPFIKVEATKFTEVGYVGKEVDSIIRDLTDSAGGAIDAVEQNGIVF 3333----------------3333------11113333-----1111------------- IDEIDKICKKGEYSGADVSREGVQRDLLPLVEGSTVSTKHGMVKTDHILFIASGAFQVAR --3333---------3333------------------1111------------------3 PSDLIPELQGRLPIRVELTALSAADFERILTEPHASLTEQYKALMATEGVNIAFTTDAVK 333-----1111--------------------------------3333------------ KIAEAAFRVNEKTENIGARRLHTVMERLMDKISFSASDMNGQTVNIDAAYVADALGEVVE ----------------------------------33332222------------------ NEDLSRFIL ---3333-- >CHONDROITINASE B; SWP:Q46079; PDB:1OFLA; VVASNETLYQVVKEVKPGGLVQIADGTYKDVQLIVSNSGKSGLPITIKALNPGKVFFTGD ---------------2222--------------------2222-------2222------ AKVELRGEHLILEGIWFKDGNRAIQAWKSHGPGLVAIYGSYNRITACVFDCFDEANSAYI ----------------------1111-2222----------------------------- TTSLTEDGKVPQHCRIDHCSFTDKITFDQVINLNNTARAIKDGSVGGPAMYHRVDHCFFS ----1111---------------------------------------------------- NPQKPGNAGGGIRIGYYRNDIGRCLVDSNLFMRQDSEAEIITSKSQENVYYGNTYLNCQG ----------------1111---------------------------------------- TMNFRHGDHQVAINNFYIGNDQRFGYGGMFVWGSRHVIACNYFELSETIKSRGNAALYLN ------------------------------------------------3333-------- PGAMASEHALAFDMLIANNAFINVNGYAIHFNPLDERRKEYCAANRLKFETPHQLMLKGN --2222------------------------------------------------------ LFFKDKPYVYPFFKDDYFIAGKNSWTGNVALGVEKGIPVNISANRSAYKPVKIKDIQPIE ------------------2222---------------------3333-----------22 GIALDLNALISKGITGKPLSWDEVRPYWLKEMPGTYALTARLSADRAAKFKAVIKRNKEH 22-------3333------3333--1111------1111---------------1111-- >HYPOTHETICAL PROTEIN PA30; SWP:Q9HZJ8; PDB:1OFTA; PAAFSELSLSGLPGHCLTLLAPILRELSEEQDARWLTLIAPPASLTHEWLRRAGLNRERI -----------------------------------------3333----------1111- LLLQAKDNAAALALSCEALRLGRSHTVVSWLEPLSRAARKQLSRAAQLGQAQSLNIRLG ----------------------------------------------------------- >HYPOTHETICAL PROTEIN PA30; SWP:P47204; PDB:1OFUA; TAVIKVIGVGGGGGNAVNHMAKNNVEGVEFICANTDAQALKNIAARTVLQLGPGVTKGLG -----------------------------------3333------------3333%%%%- AGANPEVGRQAALEDRERISEVLEGADMVFITTGMGGGTGTGAAPIIAEVAKEMGILTVA ----------------------2222------------3333---------1111----- VVTRPFPFEGRKRMQIADEGIRALAESVDSLITIPNEKLLTILGKDASLLAAFAKADDVL -----3333--------------1111---------------!!!!-------------- AGAVRGISDIIKRPGMINVDFADVKTVMSEMGMAMMGTGCASGPNRAREATEAAIRNPLL -------------------3333-------------------1111----------1111 EDVNLQGARGILVNITAGPDLSLGEYSDVGNIIEQFASEHATVKVGTVIDADMRDELHVT ---3333----------1111-----------3333-1111--------1111------- VVATGLG ------- >NINE-HEME CYTOCHROME C; SWP:Q9RN68; PDB:1OFWA; AALEPTDSGAPSAIVMFPVGEKPNPKGAAMKPVVFNHLIHEKKIDNCETCHHTGDPVSCS -----3333--------------1111------------------1111-1111---333 TCHTVEGKAEGNYITLDRAMHATNIAKRAKGNTPVSCVSCHEQQTKERRECAGCHAIVTP 3--11113333------------------------------------3333-1111---- KRDEAWCATCHNITPSMTPEQMQKGINGTLLPGDNEALAAETVLAQKTVEPVSPMLAPYK --3333-------3333-------------3333--------1111------3333---- VVIDALADKYEPSNFTHRRHLTSLMERIKDDKLAQAFHNKPEILCATCHHRSPLSLTPPK ---1111--------------------1111--------11111111------------1 CGSCHTKEIDKANPGRPNLMAAYHLQCMGCHKGMDVARPRDTDCTTCHKAAPK 111------3333--------------------------1111-1111----- >FUCOSE-SPECIFIC LECTIN; SWP:P18891; PDB:1OFZA; PTEFLYTSKIAAISWAATGGRQQRVYFQDLNGKIREAQRGGDNPWTGGSSQNVIGEAKLF ----2222--------------------1111-------!!!!-----1111-----222 SPLAAVTWKSAQGIQIRVYCVNKDNILSEFVYDGSKWITGQLGSVGVKVGSNSKLAALQW 2--------1111--------1111---------------3333-----1111------- GGSESAPPNIRVYYQKSNLSGSSIHEYVWSGKWTAGASFGSTAPGTGIGATAIGPGRLRI ------------------2222-------------------------------2222--- YYQATDNKIREHCWDSNSWYVGGFSASASAGVSIAAISWGSTPNIRVYWQKGREELYEAA ---1111-------------------------------------------2222------ YGGSWNTPGQIKDASRPTPSLPDTFIAANSSGNIDISVFFQASGVSLQQWQWISGKGWSI ------------1111-------------------------------------------- GAVVPTGTPAGW --------2222 >HYPOTHETICAL OXIDOREDUCTA; SWP:P76187; PDB:1OG6A; LVQRITIAPQGPEFSRFVMGYWRLMDWNMSARQLVSFIEEHLDLGVTTVDHADIYGGYQC ----------------------3333---3333--------1111---------%%%%-3 EAAFGEALKLAPHLRERMEIVSKCGIATTAREENVIGHYITDRDHIIKSAEQSLINLATD 333----1111--1111----------3333----------------------------- HLDLLLIHRPDPLMDADEVADAFKHLHQSGKVRHFGVSNFTPAQFALLQSRLPFTLATNQ ----------1111----------------------------------1111-------- VEISPVHQPLLLDGTLDQLQQLRVRPMAWSCLGGGRLFNDDYFQPLRDELAVVAEELNAG ---11111111------------------1111------3333----------3333--- SIEQVVNAWVLRLPSQPLPIIGSGKIERVRAAVEAETLKMTRQQWFRIRKAALGYDVP 3333--------1111------------------1111--3333-------------- >T-cell receptor beta chai; SWP:P01850; PDB:1OGAD; QLLEQSPQFLSIQEGENLTVYCNSSSVFSSLQWYRQEPGEGPVLLVTVVTGGEVKKLKRL ------------2222--------------------2222---------2222---!!!! TFQFGDARKDSSLHITAAQPGDTGLYLCAGAGSQGNLIFGKGTKLSVKPNIQNPDPAVYQ ----1111----------3333---------1111------------------------- LRDSKSSDKSVCLFTDFDSQTNVSQSKDSDVYITDKTVLDMRSMDFKSNSAVAWSNKSDF --1111-----------3333------1111----------1111--------------- ACANAFNNSIIPEDTFFPSP 3333-1111----------- >T-cell receptor beta chai; SWP:P01850; PDB:1OGAE; GITQSPKYLFRKEGQNVTLSCEQNLNHDAMYWYRQDPGQGLRLIYYSQIVNDFQKGDIAE -----------2222--------------------2222---------2222------22 GYSVSREKKESFPLTVTSAQKNPTAFYLCASSSRSSYEQYFGPGTRLTVTEDLKNVFPPE 22-----1111----------------------2222--------------1111----- VAVFEPSEAEISHTQKATLVCLATGFYPDHVELSWWVNGKEVHSGVSTDPQPLKEQPALN ------------------------------------iiii---------------3333- DSRYSLSSRLRVSATFWQNPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEAWGRAD ------------3333--3333-----------3333----------------------- Q - >HIGH AFFINITY RIBOSE TRAN; SWP:P36946; PDB:1OGDA; MKKHGILNSHLAKILADLGHTDKIVIADAGLPVPDGVLKIDLSLKPGLPAFQDTAAVLAE --------------11112222-----1111--2222-------2222------------ EMAVEKVIAAAEIKASNQENAKFLENLFSEQEIEYLSHEEFKLLTKDAKAVIRTGEFTPY ---------3333--------------3333-------------1111------------ ANCILQAGVLF ----------- >DEOXYURIDINE TRIPHOSPHATA; SWP:O15923; PDB:1OGKA; VPARVLNSLAHLQDGLNIFMDPDWRQIRHVDDWALAITMESAELIDSYPWKWWKNVKAQT ----------------------3333--1111------------3333------1111-- DMHNVRIEIADILHFSLSGEIQKRDDVALKSLKEMGFFCRPPADDELLELMFFPLTEVAS -------------------------------------------3333------------- AVATFRNIIQLASIYRFDLITKGLLLAAQDLDFNLVGYYVAKYTLNQIRQLEDNELLHEC -----------11113333--------------------------------3333----- VQSVSVEDVLNEGTYLKAWEKIACSVFDAFGMPEEERRHAYDWLKSAALD 3333---------3333---------------33331111---------- >DEOXYURIDINE TRIPHOSPHATA; SWP:O15923; PDB:1OGLA; RVPARVLNSLAHLQDGLNIFMDPDWRQIRHVDDWALAITMESAELIDSYPWKWWKNVKAQ --3333---------------1111----3333------------------1111----- TDMHNVRIEIADILHFSLSGEIQKRTQDDDVALKSLKEMGFFCRPPADELLELMFFPLTE ----------------------1111----------3333-------3333-----3333 VASAVATFRNIIQLASIYRFDLITKGLLLAAQDLDFNLVGYYVAKYTLNQIRQLKGYKEG --------------1111-------------1111-------------------3333-- VYVKVREGVEDNELLHECVQSVSVEDVLNEGTYLKAWEKIACSVFDAFGMPEEERRHAYD ---------3333-----33333333--3333------------------3333------ WLKSAA 1111-- >Dextranase [Precursor]; SWP:P48845; PDB:1OGOX; TTANTHCGADFCTWWHDSGEINTQTPVQPGNVRQSHKYSVQVSLAGTNNFHDSFVYESIP -------3333----------------1111------------2222-----------22 RNGNGRIYAPTDPPNSNTLDSSVDDGISIEPSIGLNMAWSQFEYSHDVDVKILATDGSSL 22------1111-------1111-----3333---------------------1111--- GSPSDVVIRPVSISYAISQSDDGGIVIRVPADANGRKFSVEFKTDLYTFLSDGNEYVTSG -3333----3333------1111--------1111------1111--------------- GSVVGVEPTNALVIFASPFLPSGMIPHMTPDNTQTMTPGPINNGDWGAKSILYFPPGVYW --------------------3333----3333---------2222--------------- MNQDQSGNSGKLGSNHIRLNSNTYWVYLAPGAYVKGAIEYFTKQNFYATGHGILSGENYV ---1111------------3333-----2222-----------------------11112 YQANAGDNYIAVKSDSTSLRMWWHNNLGGGQTWYCVGPTINAPPFNTMDFNGNSGISSQI 222---%%%%---3333------------------------------------------- SDYKQVGAFFFQTDGPEIYPNSVVHDVFWHVNDDAIKIYYSGASVSRATIWKCHNDPIIQ ------------------2222-------------------------------------- MGWTSRDISGVTIDTLNVIHTRYIKSETVVPSAIIGASPFYASGMSPDSRKSISMTVSNV -----------------------------------------------1111--------- VCEGLCPSLFRITPLQNYKNFVVKNVAFPDGLQTNSIGTGESIIPAASGLTMGLAISAWT ---------------------------1111---3333--------2222---------- IGGQKVTMENFQANSLGQFNIDGSYWGEWQIS iiii--1111-1111------3333------- >SULFITE OXIDASE; SWP:Q9S850; PDB:1OGPA; PGIRGPSEYSQEPPRHPSLKVNAKEPFNAEPPRSALVSSYVTPVDLFYKRNHGPIPIVDH ---------------1111-------------1111------3333-------------- LQSYSVTLTGLIQNPRKLFIKDIRSLPKYNVTATLQCAGNRRTAMSKVRNVRGVGWDVSA 2222------------------1111----------3333-----3333----------- IGNAVWGGAKLADVLELVGIPKLTASTNLGARHVEFVSVDRCKEENGGPYKASITLSQAT ---------------1111-------1111-----------3333---------3333-- NPEADVLLAYEMNGETLNRDHGFPLRVVVPGVIGARSVKWLDSINVIAEESQGFFMQKDY 3333-------iiii--3333-------22223333----------------1111---- KMFPPSVNWDNINWSSRRPQMDFPVQSAICSVEDVQMVKPGKVSIKGYAVSGGGRGIERV ---33331111-1111-----------------------------------iiii----- DISLDGGKNWVEASRTQEPGKQYISEHSSSDKWAWVLFEATIDVSQTTEVIAKAVDSAAN ----%%%%------------------33331111---------------------1111- VQPENVESVWNLRGVLNTSWHRVLLRLG ----3333--1111-------------- >POLYGALACTURONASE INHIBIT; SWP:P58822; PDB:1OGQA; ELCNPQDKQALLQIKKDLGNPTTLSSWLPTTDCCNRTWLGVLCDTDTQTYRVNNLDLSGL ---------------1111-3333---11111111--2222---1111------------ NLPKPYPIPSSLANLPYLNFLYIGGINNLVGPIPPAIAKLTQLHYLYITHTNVSGAIPDF ---------3333-1111-------1111----3333--1111--------------333 LSQIKTLVTLDFSYNALSGTLPPSISSLPNLVGITFDGNRISGAIPDSYGSFSKLFTSMT 3--1111--------------1111--1111---------------------1111---- ISRNRLTGKIPPTFANLNLAFVDLSRNMLEGDASVLFGSDKNTQKIHLAKNSLAFDLGKV -------------3333---------------3333-1111--------------3333- GLSKNLNGLDLRNNRIYGTLPQGLTQLKFLHSLNVSFNNLCGEIPQGGNLQRFDVSAYAN --1111--------------3333--1111----------------!!!!---3333--- NKCLCGSPLPACT ------------- >UBIQUITIN; SWP:P02248; PDB:1OGWA; MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQEDGRTLSDYNI ------1111-------1111---------------1111----iiii-11113333--- QKESTHLVLRLRGG 2222---------- >PERIPLASMIC NITRATE REDUC; SWP:Q53176; PDB:1OGYA; IRWSKAPCRFCGTGCGVMVGTRDGQVVATHGDTQAEVNRGLNCVKGYFLSKIMYGEDRLT ---------------------%%%%------1111-------3333-3333---3333-- TPLLRMKDGVYHKEGEFAPVSWDEAFDVMAAQAKLVLKEKAPEAVGMFGSGQWTIWEGYA ----------------------------------------1111---------1111--- ASKLMRAGFRSNNLDPNARHCMASAATAFMRTFGMDEPMGCYDDFEAADAFVLWGSNMAE ----------------3333---------------------3333-----------3333 MHPILWSRLTDRRLSHEHVRVAVLSTFTHRSSDLSDTPIIFRPGTDRAILNYIAHHIIST ------------3333------------3333---------2222------------111 GRVNRDFVDRHTNFALGATDIGYGLRPEHQLQLAAKGAADAGAMTPTDFETFAALVSEYT 1---3333--------------------3333--2222---------3333----33333 LEKAAEISGVEPALLEELAELYADPDRKWMSLWTMGFNQHVRGVWANHMVYNLHLLTGKI 333-------3333-----3333----------3333----------------------- SEPGNSPFSLTGQPFACGTAREVGTFAHRLPADMVVTNPEHRAHAEEIWKLPAGLLPDWV --------------------3333-1111-%%%%-----------------2222----- GAHAVEQDRKLHDGEINFYWVQVNNNMQAAPNIDQETYPGYRNPENFIVVSDAYPTVTGR ---3333-----------------3333---3333-------1111---------3333- AADLVLPAAMWVEKEGAYGNAERRTHFWHQLVEAPGEARSDLWQLMEFSKRFTTDEVWPE -------------------1111-----------!!!!--------3333---------- EILSAAPAYRGKTLFEVLFANGSVDRFPASDVNPDHANHEAALFGFYPQKGLFEEYAAFG --------------3333----1111-----------1111---------------3333 RGHGHDLAPFDTYHEVRGLHWPVVEGEETRWRYREGFDPYVKPGEGLRFYGKPDGRAVIL ---------3333----------%%%%----------1111----------1111----- GVPYEPPAESPDEEFGFWLVTGRVLEHWHSGSMTLRWPELYKAFPGAVCFMHPEDARSRG -----------------------3333!!!!-1111-3333------------------- LNRGSEVRVISRRGEIRTRLETRGRNRMPRGVVFVPWFDASQLINKVTLDANDPISRQTD -2222-----1111--------------2222------3333-3333------------- FKKCAVKIEA ---------- >Diheme cytochrome c napB ; SWP:Q53177; PDB:1OGYB; DAPRLTGADRPMSEVAAPPLPETITDDRRVGRNYPEQPPVIPHSIEGYQLSVNANRCLEC --------------------------------------------------11113333-- HRRQYSGLVAAPMISITHFQDREGQMLADVSPRRYFCTACHVPQTNAQPLVTNEFRDMLT --------------------2222-------22223333--------------------- LMPASNE ------- >STEROID DELTA-ISOMERASE; SWP:P07445; PDB:1OH0A; LPTAQEVQGLMARYIELVDVGDIEAIVQMYADDATVEDPFGQPPIHGREQIAAFYRQGLG --------------------------33331111-------------------------- GGKVRACLTGPVRASHNGCGAMPFRVEMVWNGQPCALDVIDVMRFDEHGRIQTMQAYWSE -----------------------------%%%%------------1111---------33 VNLSV 33--- >STAPHOSTATIN A; SWP:Q99SX7; PDB:1OH1A; GSMEQFELFSIDKFKCNSEAKYYLNIIEGEWHPQDLNDSPLKFILSTSDDSDYICKYINT -----------------------------------%%%%--------------------- EHKQLTLYNKNNSSIVIEIFIPNDNKILLTIMNTEALGTSPRMTFIKHK ------------------------------------------------- >BETA-MANNOSIDASE; SWP:Q9RIK9; PDB:1OH4A; ARYVLAEEVDFSSPEEVKNWWNSGTWQAEFGSPDIEWNGEVGNGALQLNVKLPGKSDWEE ------------33331111-------------------2222----------------- VRVARKFERLSECEILEYDIYIPNVEGLKGRLRPYAVLNPGWVKIGLDMNNANVESAEII ------1111--------------2222-----------------2222---1111---- TFGGKEYRRFHVRIEFDRTAGVKELHIGVVGDHLRYDGPIFIDNVRLYKRTGGM -iiii-------------2222----------------------------iiii >CDC14B2 PHOSPHATASE; SWP:O60729; PDB:1OHEA; RDPQDDVYLDITDRLCFAILYSRPKSASNVHYFSIDNELEYENFYADFGPLNLAMVYRYC -3333---------------------1111----1111---------------------- CKINKKLKSITMLRKKIVHFTGSDQRKQANAAFLVGCYMVIYLGRTPEEAYRILIFGETS --------3333------------------------------------------------ YIPFRDAAYGSCNFYITLLDCFHAVKKAMQYGFLNFNSFNLDEYEHYEKAENGDLNWIIP ---------------------------------------------33331111-----22 DRFIAFCGPHSRARLESGYHQHSPETYIQYFKNHNVTTIIRLNKRMYDAKRFTDAGFDHH 22-------------iiii---3333---------------------33333333----- DLFFADGSTPTDAIVKEFLDICENAEGAIAVHSKAGLGRTGTLIACYIMKHYRMTAAETI ----2222--3333---------------------------------------------- AWVRICRPGSVIGPQQQFLVMKQTNLWLEGDYFRQKLKG ------2222-!!!!------------------------ >NUDAURELIA CAPENSIS OMEGA; SWP:Q90063; PDB:1OHFA; VITFPTNVATMPEFRSWARGKLDIDQDSIGWYFKYLDPAGATESARAVGEYSKIPDGLVK -------11113333--------------------------1111--------------- FSVDAEIREIYNEECPTVSDASIPLDGAQWSLSIISYPMFRTAYFAVANVDNKEISLDVT ------------------3333--------------------------1111-------- NDLIVWLNNLASWRDVVDSGQWFTFSDDPTWFVRIRVLHPTYDLPDPTEGLLRTVSDYRL -----------3333------------1111---------3333-3333----------- TYKSITCEANMPTLVDQGFWIGGHYALTPIATTQNAVEGSGFVHPFNVTRPGIAAGVTLT -----------3333--------------------------------------------- WASMPPGGSAPSGDPAWIPDSTTQFQWRHGGFDAPTGVITYTIPRGYTMQYFDTTTNEWN 2222-------------------------------------------------1111--- GFANPDDVVTFGQTGGAAGTNATITITAPTVTLTILATTTSAANVINFRNLDAETTAASN ---2222---------%%%%---------------------------------------- RSEVPLPPLTFGQTAPNNPKIEQTLVKDTLGSYLVHSKMRNPVFQLTPASSFGAISFTNP ---------33333333-------3333------------------------------22 GFDRNLDLPGFGGIRDSLDVNMSTAVCHFRSLSKSCSIVTKTYQGWEGVTNVNTPFGQFA 221111--------------------------1111-----------------1111--- HSGLLKNDEILCLADDLATRLTGVYGATDNFAAAVLAFAANMLTSVLKSEATTSVIKEL -------------------------1111------------------------------ >DELTA-AMINOLEVULINIC ACID; SWP:P05373; PDB:1OHLA; MHTAEFLETEPTEISSVLAGGYNHPLLRQWQSERQLTKNMLIFPLFISDNPDDFTEIDSL ------------3333-3333--3333-1111----1111---------1111------2 PNINRIGVNRLKDYLKPLVAKGLRSVILFGVPLIPGTKDPVGTAADDPAGPVIQGIKFIR 222---3333-3333---1111-----------2222----3333-1111---------- EYFPELYIICDVCLCEYTSHGHCGVLYDDGTINRERSVSRLAAVAVNYAKAGAHCVAPSD --1111-------11111111-----1111------------------------------ MIDGRIRDIKRGLINANLAHKTFVLSYAAKFSGNLYGPFRDAACSAPSNGDRKCYQLPPA -2222--------11111111---------------3333-----------1111--111 GRGLARRALERDMSEGADGIIVKPSTFYLDIMRDASEICKDLPICAYHVSGEYAMLHAAA 1-----------1111--------1111----------1111------------------ EKGVVDLKTIAFESHQGFLRAGARLIITYLAPEFLDWLDE ---------------------------111133333333- >BACTERIOCIN SAKACIN P; SWP:P35618; PDB:1OHMA; KYYGNGVHCGKHSCTVDWGTAIGCIGNNAAANWATGGNAGWNKC ---------3333----3333----------------------- >STEROID DELTA-ISOMERASE; SWP:P00947; PDB:1OHPA; MNTPEHMTAVVQRYVAALNAGDLDGIVALFADDATVENPVGSEPRSGTAAIREFYANSLK --------------------------11111111----2222--------------1111 LPLAVELTQEVRAVANEAAFAFIVSFEYQGRKTVVAPIDHFRFNGAGKVVSMRALFGEKN ---------------------------iiii------------1111---------3333 IHAGA ----- >IMMUNOGLOBULIN; SWP:NA; PDB:1OHQA; EVQLLESGGGLVQPGGSLRLSCAASGFRISDEDMGWVRQAPGKGLEWVSSIYGPSGSTYY ------------2222-----------3333--------2222----------------- ADSVKGRFTISRDNSKNTLYLQMNSLRAEDTAVYYCASALEPLSEPLGFWGQGTLVTVSS 3333---------1111---------1111------------------------------ >CG14704 PROTEIN; SWP:Q9VGN3; PDB:1OHTA; TARLLSRSDWGARLPKSVEHFQGPAPYVIIHHSYPAVCYSTPDCKSRDQDFHQLERGWND -----3333------------------------------3333----------------- IGYSFGIGGDGIYTGRGFNVIGAHAPKYNDKSVGIVLIGDWRTELPPKQLDAAKNLIAFG -------1111-------------2222-------------------------------- VFKGYIDPAYKLLGHRQVRDTECPGGRLFAEISSWPHFTHINDTEGVS -------------3333-----------------2222-1111----- >APOPTOSIS REGULATOR CED-9; SWP:P41958; PDB:1OHUA; NDWEEPRLDIEGFVVDYFTHRIRQNGEWFGAPGLPSGVQPEHERVGTIFEKKHAENFETF -11111111-------------1111-1111--1111-3333------------------ SEQLLAVPRISFSLYQDVVRTVGNPSYGRLIGLISFGGFVAAKESVELQGQVRNLFVYTS ------------------1111---3333---------------3333------------ LFIKTRIRNNWKEHNRSWDDFTLGKQKEDYERAEAEK ------1111-1111-3333------------1111- >4-AMINOBUTYRATE AMINOTRAN; SWP:P80147; PDB:1OHVA; FDYDGPLMKTEVPGPRSRELMKQLNIIQNAEAVHFFCNYEESRGNYLVDVDGNRMLDLYS -----------------------3333--3333----3333-!!!!--1111------%% QISSIPIGYSHPALVKLVQQPQNVSTFINRPALGILPPENFVEKLRESLLSVAPKGMSQL %%-----------------33333333----3333--1111-------1111-2222--- ITMACGSCSNENAFKTIFMWYRSKERGQSAFSKEELETCMINQAPGCPDYSILSFMGAFH -------------------------!!!!-------------------------2222-- GRTMGCLATTHSKAIHKIDIPSFDWPIAPFPRLKYPLEEFVKENQQEEARCLEEVEDLIV --33331111--33332222---------------1111--------------------- KYRKKKKTVAGIIVEPIQSEGGDNHASDDFFRKLRDISRKHGCAFLVDEVQTGGGSTGKF --1111------------1111----------------1111------------1111-- WAHEHWGLDDPADVMTFSKKMMTGGFFHKEEFRPNAPYRIFNTWLGDPSKNLLLAEVINI 3333--------------1111------3333--------------3333---------- IKREDLLSNAAHAGKVLLTGLLDLQARYPQFISRVRGRGTFCSFDTPDESIRNKLISIAR -1111----------------------3333------!!!!------------------- NKGVMLGGCGDKSIRFRPTLVFRDHHAHLFLNIFSDILADF ------------------11113333-----------1111 >HYPOTHETICAL PROTEIN AF21; SWP:O28085; PDB:1OI0A; GSSMKISRGLLKTILEAAKSAHPDEFIALLSGSKDVMDELIFLPFVSIGMKVFGTVHSHP ------------------------------------------------------------ SPSCRPSEEDLSLFTRFGKYHIIVCYPYDENSWKCYNRKGEEVELEVV ----------------------------1111----1111-------- >HYPOTHETICAL PROTEIN YCGT; SWP:P76015; PDB:1OI2A; DVLDEQLAGLAKAHPSLTLHQDPVYVTRADAPVAGKVALLSGGGSGHEPMHCGYIGQGML -------------1111----------1111-2222--------------1111-2222- SGACPGEIFTSPTPDKIFECAMQVDGGEGVLLIIKNYTGDILNFETATELLHDSGVKVTT ------------3333-----------------------------------1111----- VVIDDDVAVKDSLYTAGRRGVANTVLIEKLVGAAAERGDSLDACAELGRKLNNQGHSIGI -----------1111--------------------------------------------- ALGACLADNEMEFGVGIHGEPGIDRRPFSSLDQTVDEMFDTLLVNGSYHRTLRFWDYQQG ---------------1111-------------------------------------1111 SWQEEQQTKQPLQSGDRVIALVNNLGATPLSELYGVYNRLTTRCQQAGLTIERNLIGAYC ------------2222------------3333---------------------------- TSLDMTGFSITLLKVDDETLALWDAPVHTPALNWGK ----------------------------1111---- >HYPOTHETICAL PROTEIN YHBO; SWP:P45470; PDB:1OI4A; SYYHHHHHHLESTSLYKKAGLSKKIAVLITDEFEDSEFTSPADEFRKAGHEVITIEKQAG ----------1111--3333---------2222------------1111--------222 KTVKGKKGEASVTIDKSIDEVTPAEFDALLLPGGHSPDYLRGDNRFVTFTRDFVNSGKPV 2---1111--------3333-3333--------------1111----------1111--- FAICHGPQLLISADVIRGRKLTAVKPIIIDVKNAGAEFYDQEVVVDKDQLVTSRTPDDLP --!!!!---------2222----3333------------------%%%%-----3333-- AFNREALRLLGA -------1111- >PCZA361.16; SWP:O52806; PDB:1OI6A; MQARKLAVDGAIEFTPRVFADDRGLLILPYQEEAFVEAHGGPLFRVAQTIHSMSKRGVVR -------2222---------3333------------------------------2222-- GIHYTVTPPGTAKYVYCARGKAMDIVIDIRVGSPTFGQWDSVLMDQQDPRAVYLPVGVGH -----------------------------2222-2222----------------2222-- AFVALEDDTVMSYMLSRSYVTQDELALSALDPALGLPIDIGVEPIVSDRDRVAITLAEAQ -------------------3333----1111---------------3333---------- RQGLLPDYTTSQEIERRLTAVP ---------------------- >SUCCINYL-COA SYNTHETASE A; SWP:P09143; PDB:1OI7A; MILVNRETRVLVQGITGREGQFHTKQMLTYGTKIVAGVTPGKGGMEVLGVPVYDTVKEAV ----3333-----1111---------------------2222----iiii--------33 AHHEVDASIIFVPAPAAADAALEAAHAGIPLIVLITEGIPTLDMVRAVEEIKALGSRLIG 33----------3333--------1111-------------------------------- GNCPGIISAEETKIGIMPGHVFKRGRVGIISRSGTLTYEAAAALSQAGLGTTTTVGIGGD -------2222------3333-----------------------1111------------ PVIGTTFKDLLPLFNEDPETEAVVLIGEIGGSDEEEAAAWVKDHMKKPVVGFIGGRVGTP -----3333-------3333---------------------------------------- ESKLRAFAEAGIPVADTIDEIVELVKKALG -------1111-----3333---------- >PUTATIVE ALKYLSULFATASE A; SWP:Q9WWU5; PDB:1OIHA; LELDVHPVAGRIGAEIRGVKLSPDLDAATVEAIQAALVRHKVIFFRGQTHLDDQSQEGFA --------1111---------1111----------------------3333--------- KLLGEPVAPVVDGTRYLLQLDRANSWHTDVTFVEAYPKASILRSVVAPASGGDTVWANTA ----------2222---------------1111--------------------------- AAYQELPEPLRELADKLWAVHSNEVYETEHPVVRVHPISGERALQLGHFVKRIKGYSLAD --1111----------------------------------------1111--2222---- SQHLFAVLQGHVTRLENTVRWRWEAGDVAIWDNRATQHYAVDDYGTQPRIVRRVTLAGEV -------------3333------2222----3333--------!!!!------------- PVGVDGQLSRTTRK --1111-------- >DNA TOPOISOMERASE I; SWP:P04786; PDB:1OIS; DTIKWVTLKHNGVIFPPPYQPLPSHIKLYYDGKPVDLPPQAEEVAGFFAALLESDHAKNP ----------------------1111---iiii---------------1111-3333--- VFQKNFFNDFLQVLKESGGPLNGIEIKEFSRCDFTKMFDYFQLQKEQKKQLTEKKQIRLE --------------1111---------3333----------------1111--------- REKFEEDYKFCELDGRREQVGNFKVEPPDLFRGRGAHPKTGKLKRRVNPEDIVLNLSKDA ------------iiii--------------------1111-------3333-----3333 PVPPAPEGHKWGEIRHDNTVQWLAMWRENIFNSFKYVRLAA ----------------1111--------------------- >RAS-RELATED PROTEIN RAB-1; SWP:P24410; PDB:1OIVA; DEYDYLFKVVLIGDSGVGKSNLLSRFTRNEFNLESKSTIGVEFATRSIQVDGKTIKAQIW -------------2222--------------------------------iiii------- DTAGQERYRAITSAYYRGAVGALLVYDIAKHLTYENVERWLKELRDHADSNIVIMLVGNK ---------------2222-------1111------------------1111-------3 SDLRHLRAVPTDEARAFAEKNGLSFIETSALDSTNVEAAFQTILTEIY 333---------------1111----------2222------------ >4-DIPHOSPHOCYTIDYL-2-C-ME; SWP:Q8FI04; PDB:1OJ4A; RTQWPSPAKLNLFLYITGQRADGYHTLQTLFQFLDYGDTISIELRDDGDIRLLTPVEGVE -------------------1111--------------------------------22223 HEDNLIVRAARLLKTAADSGRLPTGSGANISIDKRLPGGGLGGGSSNAATVLVALNHLWQ 333-------------1111--2222------------------------------1111 CGLSDELAEGLTLGADVPVFVRGHAAFAEGVGEILTPVDPPEKWYLVAHPGVSIPTPVIF ---------33333333-----------!!!!----------------------3333-- KDPELPRNTPKRSIETLLKCEFSNDCEVIARKRFREVDAVLSWLLEYAPSRLTGTGACVF -1111------------------1111----------------3333-----!!!!---- AEFDTESEARQVLEQAPEWLNGFVAKGVNLSPLHRAL ----------------1111-----------3333-- >STEROID RECEPTOR COACTIVA; SWP:O61202; PDB:1OJ5A; VESFMTKQDTTGKIISIDTSSLRAAGRTGWEDLVRKCIYAFFQPQGREPSYARQLFQEVM ------------------3333-----------------1111-!!!!------------ TRGTASSPSYRFILNDGTMLSAHTRCKLCYPMQPFIMGIHIIDRE -------------1111---------------------------- >NEUROGLOBIN; SWP:Q9NPG2; PDB:1OJ6A; RPEPELIRQSWRAVSRSPLEHGTVLFARLFALEPDLLPLFQYNGRQFSSPEDSLSSPEFL -------------3333---------------33331111--------3333-------- DHIRKVMLVIDAAVTNVEDLSSLEEYLASLGRKHRAVGVKLSSFSTVGESLLYMLEKSLG -------------1111-3333------------1111-3333---------------!! PAFTPATRAAWSQLYGAVVQAMSRGWD !!--------------------1111- >HYPOTHETICAL OXIDOREDUCTA; SWP:NA; PDB:1OJ7A; GLNNFNLHTPTRILFGKGAIAGLREQIPHDARVLITYGGGSVKKTGVLDQVLDALKGMDV ---------------2222---3333-1111--------3333---------1111---- LEFGGIEPNPAYETLMNAVKLVREQKVTFLLAVGGGSVLDGTKFIAAAANYPENIDPWHI ----------3333--------------------------------3333-1111----- LQTGGKEIKSAIPMGCVLTLPATGSESNAGAVISRKTTGDKQAFHSAHVQPVFAVLDPVY 11111111---------------3333--------1111------3333-------3333 TYTLPPRQVANGVVDAFVHTVEQYVTKVAKIHDRFAEGILLTLIEDGPKALKEPENYDVR 1111------------------------------------------------1111---- ANVMWAATQALNGLIGAGVPQDWATHMLGHELTAMHGLDHAQTLAIVLPALWNEKRDTKR -------3333-1111----------------------------------------1111 AKLLQYAERVWNITEGSDDERIDAAIAATRNFFEQLGVPTHLSDYGLDGSSIPALLKKLE ---------------------------------1111---3333----1111-------1 EHGMTQLGENHDITLDVSRRIYEAAR 111----1111-----------1111 >RC-RNASE6 RIBONUCLEASE; SWP:Q9DFY5; PDB:1OJ8A; DWDTFQKKHLTDTKKVKCDVEMKKALFDCKKTNTFIFARPPRVQALCKNIKNNTNVLSRD ----------------3333---3333-----------------1111--2222------ VFYLPQCNRKKLPCHYRLDGSTNTICLTCMKELPIHFAGVGKCP -----------------------------%%%%----------- >SENSOR PROTEIN DCUS; SWP:P39272; PDB:1OJGA; SDMTRDGLANKALAVARTLADSPEIRQGLQKKPQESGIQAIAEAVRKRNDLLFIVVTDMQ -3333------------3333---1111-------------------------------- SLRYSHPEAQRIGQPFKGDDILKALNGEENVAINRGFLAQALRVFTPIYDENHKQIGVVA --------3333------3333-1111--------2222--------------------- IGLELSRVTQQINDSR ---------------- -------------------------------------------------- >ENDOGLUCANASE I; SWP:P56680; PDB:1OJJA; KPGETKEVHPQLTTFRCTKRGGCKPATNFIVLDSLSHPIHRAEGLGPGGCGDWGNPPPKD --------------------------------1111-----2222------2222--333 VCPDVESCAKNCIMEGIPDYSQYGVTTNGTSLRLQHILPDGRVPSPRVYLLDKTKRRYEM 3--3333--------------------!!!!------1111----------1111----- LHLTGFEFTFDVDATKLPCGMNSALYLSEMHPTGAKSKYNPGGAYYGTGYCDAQCFVTPF --2222-----------2222---------1111--1111--3333-------------- INGLGNIEGKGSCCNSMDIWEANSRASHVAPHTCNKKGLYLCEGEECAFEGVCDKNGCGW %%%%-1111-------------1111----------------!!!!-1111--3333--- NNYRVNVTDYYGRGEEFKVNTLKPFTVVTQFLANRRGKLEKIHRFYVQDGKVIESFYTNK 3333---------1111----------------1111----------iiii--------2 EGVPYTNMIDDEFCEATGSRKYMELGATQGMGEALTRGMVLAMSIWWDQGGNMEWLDHGE 222-----------1111----------------------------------3333-!!! AGPCAKGEGAPSNIVQVEPFPEVTYTNLRWGEIGSTYQ !---2222-3333------------------2222--- >TRANSCRIPTIONAL REGULATOR; SWP:P25852; PDB:1OJLA; HMIGSSPAMQHLLNEIAMVAPSDATVLIHGDSGTGKELVARALHACSARSDRPLVTLNCA ------------------------------------------------------------ ALNESLLESELFGHEKGAFTKRREGRFVEADGGTLFLDEIGDISPLMQVRLLRAIQEREV --------------------------3333--------------3333------------ QRVGSNQTISVDVRLIAATHRDLAEEVSAGRFRQDLYYRLNVVAIEMPSLRQRREDIPLL -2222----------------3333-3333------------------3333-3333--- ADHFLRRFAERNRKVVKGFTPQAMDLLIHYDWPGNIRELENAIERAVVLLTGEYISEREL --------------------------1111------------------------------ PLAIAYSGEIQPLVDVEKEVILAALEKTGGNKTEAARQLGITRKTLLAKLSR -1111------3333------------%%%%----------3333------- >ADP-RIBOSYLTRANSFERASE; SWP:Q9ADS9; PDB:1OJQA; AETKNFTDLVEATKWGNSLIKSAKYSSKDKMAIYNYTKNSSPINTPLRSANGDVNKLSEN --------------------3333---------------3333----1111-1111---- IQEQVRQLDSTISKSVTPDSVYVYRLLNLDYLSSITGFTREDLHMLQQTNNGQYNEALVS -----------------------------1111-2222-----------iiii------- KLNNLMNSRIYRENGYSSTQLVSGAALAGRPIELKLELPKGTKAAYIDSKELTAYPGQQE -----2222------------2222-------------2222------1111-------- VLLPRGTEYAVGSVKLSDNKRKIIITAVVFKK -------------------------------- >RHAMNULOSE-1-PHOSPHATE AL; SWP:P32169; PDB:1OJRA; MQNITQSWFVQGMIKATTDAWLKGWDERNGGNLTLRLDDADIAPYHDNFHQQPRYIPLSQ --3333--------------3333--!!!!-------333333331111----------- PMPLLANTPFIVTGSGKFFRNVQLDPAANLGIVKVDSDGAGYHILWGLTNEAVPTSELPA -3333--------2222333333333333-------------------%%%%--1111-- HFLSHCERIKATNGKDRVIMHCHATNLIALTYVLENDTAVFTRQLWEGSTECLVVFPDGV -----------iiii---------------------------------3333---1111- GILPWMVPGTDAIGQATAQEMQKHSLVLWPFHGVFGSGPTLDETFGLIDTAEKSAQVLVK -------------------3333------------------------------------- VYSMGGMKQTISREELIALGKRFGVTPLASALAL -1111-----------------------3333-- >SURFACE PROTEIN; SWP:Q51225; PDB:1OJT; GSADAEYDVVVLGGGPGGYSAAFAAADEGLKVAIVERYKTLGGVCLNVGCIPSKALLHNA ------------------------------------------------------------ AVIDEVRHLAANGIKYPEPELDIDMLRAYKDGVVSRLTGGLAGMAKSRKVDVIQGDGQFL --------3333---------3333----------------------------------- DPHHLEVSLTAGDAYEQAAPTGEKKIVAFKNCIIAAGSRVTKLPFIPEDPRIIDSSGALA -------------2222-------------------------1111--1111-3333333 LKEVPGKLLIIGGGIIGLEMGTVYSTLGSRLDVVEMMDGLMQGADRDLVKVWQKQNEYRF 3-----------------------1111------------22223333-------3333- DNIMVNTKTVAVEPKEDGVYVTFEGANAPKEPQRYDAVLVAAGRAPNGKLISAEKAGVAV ----------------------------------------------1111-3333----- TDRGFIEVDKQMRTNVPHIYAIGDIVGQPMLAHKAVHEGHVAAENCAGHKAYFDARVIPG 3333----1111---1111---3333------------------1111------------ VAYTSPEVAWVGETELSAKASARKITKANFPWAASGRAIANGCDKPFTKLIFDAETGRII ------------------------------3333-------------------------- GGGIVGPNGGDMIGEVCLAIEMGCDAADIGKTIHPHPTLGESIGMAAEVALGTCTDLPPQ -----2222---------------33331111------3333------------------ KK -- >MALATE DEHYDROGENASE; SWP:O08349; PDB:1OJUA; MKLGFVGAGRVGSTSAFTCLLNLDVDEIALVDIAEDLAVGEAMDLAHAAAGIDKYPKIVG ------------------------------------------------------------ GADYSLLKGSEIIVVTAGLARKPGMTRLDLAHKNAGIIKDIAKKIVENAPESKILVVTNP --3333-------------------------------------3333-1111-------3 MDVMTYIMWKESGKPRNEVFGMGNQLDSQRLKERLYNAGARNIRRAWIIGEHGDSMFVAK 333--------------------------------1111-------------1111--33 SLADFDGEVDWEAVENDVRFVAAEVIKRKGATIFGPAVAIYRMVKAVVEDTGEIIPTSMI 33------------3333----------------------------1111---------- LQGEYGIENVAVGVPAKLGKNGAEVADIKLSDEEIEKLRNSAKILRERLEELGY --2222------------1111--------3333---------------1111- >ALPHA-AMYLASE INHIBITOR H; SWP:P01092; PDB:1OK0A; DTTVSEPAPSCVTLYQSWRYSQADNGCAETVTVKVVYEDDTEGLCYAVAPGQITTVGDGY --------1111----1111----------------1111--------2222-------- IGSHGHARYLARCL -1111--------- >DNA POLYMERASE III; SWP:P00583; PDB:1OK7A; MKFTVEREHLLKPLQQVSGPLGGRPTLPILGNLLLQVADGTLSLTGTDLEMEMVARVALV -----3333-----------------3333-------%%%%------------------- QPHEPGATTVPARKFFDICRGLPEGAEIAVQLEGERMLVRSGRSRFSLSTLPAADFPNLD ----------------------2222------!!!!---------------3333----- DWQSEVEFTLPQATMKRLIEATQFSMAHQDVRYYLNGMLFETEGEELRTVATDGHRLAVC ----------3333-------3333------3333------------------------- SMPIGQSLPSHSVIVPRKGVIELMRMLDGGDNPLRVQIGSNNIRAHVGDFIFTSKLVDGR -----------------------1111--------------------------------- FPDYRRVLPKNPDKHLEAGCDLLKQAFARAAILSNEKFRGVRLYVSENQLKITANNPEQE --3333-----------------------3333-1111-----------------1111- EAEEILDVTYSGAEMEIGFNVSYVLDVLNALKCENVRMMLTDSVSSVQIEDAASQSAAYV -----------------------------------------1111-----1111------ VMP --- >MAJOR ENVELOPE PROTEIN E; SWP:P12823; PDB:1OK8A; MRCIGISNRDFVEGVSGGSWVDIVLEHGSCVTTMAKNKPTLDFELIKTEAKQPATLRKYC ------1111---------------2222-----2222---------------------- IEAKLTNTTTESRCPTQGEPTLNEEQDKRFVCKHSMVDRGWGNGCGLFGKGGIVTCAMFT ---------------------3333-1111---------3333----------------- CKKNMEGKIVQPENLEYTVVITPHGKEVKITPQSSITEAELTGYGTVTMECSPRTGLDFN ----------3333----------------3333-------------------------- EMVLLQMKDKAWLVHRQWFLDLPLPWLPGADTQGSNWIQKETLVTFKNPHAKKQDVVVLG ------!!!!--------1111-----1111-------3333------------------ SQEGAMHTALTGATEIQMSSGNLLFTGHLKCRLRMDKLQLKGMSYSMCTGKFKVVKEIAE -------1111------------------------------------------------- TQHGTIVIRVQYEGDGSPCKIPFEIMDLEKRHVLGRLITVNPIVTEKDSPVNIEAEPPFG ------------------------------------------------------------ DSYIIIGVEPGQLKLNWFKK -------------------- >URACIL-DNA GLYCOSYLASE; SWP:Q9I983; PDB:1OKBA; MEFFGETWRRELAAEFEKPYFKQLMSFVADERSRHTVYPPADQVYSWTEMCDIQDVKVVI -----------3333------------------------1111-3333---3333----- LGQDPYHGPNQAHGLCFSVQKPVPPPPSLVNIYKELCTDIDGFKHPGHGDLSGWAKQGVL -------2222---2222---------------------2222-------33331111-- LLNAVLTVRAHQANSHKDRGWETFTDAVIKWLSVNREGVVFLLWGSYAHKKGATIDRKRH --------2222-1111----------------------------------33333333- HVLQAVHPSPLSAHRGFLGCKHFSKANGLLKLSGTEPINWRAL --------3333----2222------------------3333- >POSSIBLE 3-MERCAPTOPYRUVA; SWP:Q9NE49; PDB:1OKGA; APKHPGKVFLDPSEVADHLAEYRIVDCRYSLKIKDHGSIQYAKEHVKSAIRADVDTNLSK ---2222---333311111111----------2222---------2222---3333---- LVPTSTARHPLPPAEFIDWCANGAGELPVLCYDDECGAGGCRLWWLNSLGADAYVINGGF -1111-----------------------------iiii--------1111-----2222- QACKAAGLEESGEPSSLPRPATHWPFKTAFQHHYLVDEIPPQAIITDARSADRFASTVRP --1111----------------------------1111-----------3333------- YAADKPGHIEGARNLPYTSHLVTRGDGKVLRSEEEIRHNITVVQADLSSFVFSGSGVTAC !!!!----2222---3333---------------------------1111------3333 INIALVHHLGLGHPYLYCGSWSEYSGLFRPPIRSIIDDYGCQQTPSLGDNPKANLDTTLK ----------------3333--3333--3333--------------1111---3333--- VDGAPERPDAEVQSAATHLHAGEAATVYFKSGRVVTIEVPVVPNLEA iiii---------------2222-----3333--------------- >VISCOTOXIN A3; SWP:P01538; PDB:1OKHA; KSCCPNTTGRNIYNACRLTGAPRPTCAKLSGCKIISGSTCPSDYPK ----------------------------------------1111-- >BETA CRYSTALLIN B1; SWP:P53674; PDB:1OKIA; PPGNYRLVVFELENFQGRRAEFSGESNLADRGFDRVRSIIVSAGPWVAFEQSNFRGEMFI ------------%%%%----------3333---------------------%%%%----- LEKGEYPRWNTWSSSYRSDRLMSFRPIKMDAQEHKISLFEGANFKGNTIEIQGDDAPSLW -------1111------------------------------%%%%------------333 VYGFSDRVGSVKVSSGTWVGYQYPGYRGYQYLLEPGDFRHWNEWGAFQPQMQSLRRLRDK 3--------------------------------------3333---------------33 QW 33 >HYPOTHETICAL PROTEASE YEA; SWP:P76256; PDB:1OKJA; LRILAIDTATEACSVALWNDGTVNAHFELCPREHTQRILPMVQDILTTSGTSLTDINALA ----------------------------------1111-------------3333----- YGRGPGSFTGVRIGIGIAQGLALGAELPMIGVSTLMTMAQGAWRKNGATRVLAAIDARMG ------------------------------------------------------------ EVYWAEYQRDENGIWHGEETEAVLKPEIVHERMQQLSGEWVTVGTGWQAWPDLGKESGLV ---------1111---3333------------------------3333-11112222--- LRDGEVLLPAAEDMLPIACQMFAEGKTVAVEHAEPVYLR ---------3333---------------3333------- >Cell division protein fts; SWP:P83749; PDB:1OKKD; AIPWGGNLEEVLEELEMALLAADVGLSATEEILQEVRASGRKDLKEAVKEKLVGMLEPPV ------3333---------1111-----------------------------3333---- EPKGRVVLVVGVNGVGKTTTIAKLGRYYQNLGKKVMFCAGDTFRAAGGTQLSEWGKRLSI -----------2222-------------1111-----------2222------------- PVIQGPEGTDPAALAYDAVQAMKARGYDLLFVDTAGRLHTKHNLMEELKKVKRAIAKADP -----2222----------------------------1111-----------------33 EEPKEVWLVLDAVTGQNGLEQAKKFHEAVGLTGVIVTKLDGTAKGGVLIPIVRTLKVPIK 33--------1111-----------------------3333------------------- FVGVGEGPDDLQPFDPEAFVEALLE ------1111-----------1111 >METHICILLIN RESISTANCE RE; SWP:P26598; PDB:1OKRA; KTYEISSAEWEVMNIIWMKKYASANNIIEEIQMQKDWSPKTIRTLITRLYKKGFIDRKKD -----3333--------------------------------------------------! NKIFQYYSLVEESDIKYKTSKNFINKVYKGGFNSLVLNFVEKEDLSQDEIEELRNILNKK !!!------------------------%%%%----------------------------- >RNA POLYMERASE ALPHA SUBU; SWP:P03422; PDB:1OKSA; ASRSVIRSIIKSSRLEEDRKRYLTLLDDIKGANDLAKFHQLVKIIKHHHHH ------------------------3333-----------------3333-- >GLUTATHIONE S-TRANSFERASE; SWP:Q8MU52; PDB:1OKTA; MGDNIVLYYFDARGKAELIRLIFAYLGIEYTDKRFGVNGDAFVEFKNFKKEKDTPFEQVP ------------------------------------------------------------ ILQIGDLILAQSQAIVRYLSKKYNICGESELNEFYADMIFCGVQDIHYKFNNTNLFKQNE ---!!!!----------------------------------------------3333--- TTFLNEDLPKWSGYFEKLLKKNHTNNNNDKYYFVGNNLTYADLAVFNLYDDIETKYPSSL ---------------------------------------------------------111 KNFPLLKAHNEFISNLPNIKNYITNRKESVY 1------------------------------ >IMMUNOGLOBULIN G; SWP:NA; PDB:1OL0A; QVQLVESGGGLVQPGGSLRLSCAASGFTFSSYASWFRQAPGKEREIVSAVSGSGGSTYYA ------------2222-----------3333-------2222--------3333-----3 DSVRGRFTISRDNSKNTLYLQNSLRAEDTAVYYCAREPRIPRPPSFDYWGQGTLVTVSS 333---------------------3333--------------3333------------- ------------------------------ ------------------------------------------ >NK RECEPTOR; SWP:O76016; PDB:1OLLA; TLPKPFIWAEPHFMVPKEKQVTICCQGNYGAVEYQLHFEGSLFAVDRPKPPERINKVKFY ---------------2222--------2222------iiii------------------- IPDMNSRMAGQYSCIYRVGELWSEPSNLLDLVVTEMYDTPTLSVHPGPEVISGEKVTFYC ----1111---------!!!!-----------------------------2222------ RLDTATSMFLLLKEGRSSHVQRGYGKVQAEFPLGPVTTAHRGTYRCFGSYNNHAWSFPSE ------------------------------------3333----------1111------ PVKLLVTG -------- >SEC14-LIKE PROTEIN 2; SWP:O76054; PDB:1OLMA; MSGRVGDLSPRQKEALAKFRENVQDVLPALPNPDDYFLLRWLRARSFDLQKSEAMLRKHV ---2222---------------33331111--------------%%%%------------ EFRKQKDIDNIISWQPPEVIQQYLSGGMCGYDLDGCPVWYDIIGPLDAKGLLFSASKQDL ------33331111-----------------1111-------1111----1111------ LRTKMRECELLLQECAHQTTKLGRKVETITIIYDCEGLGLKHLWKPAVEAYGEFLCMFEE ----------------------------------22223333------------------ NYPETLKRLFVVKAPKLFPVAYNLIKPFLSEDTRKKIMVLGANWKEVLLKHISPDQVPVE --------------1111------3333-----1111---1111--------1111-333 YGGTMTDPDGNPKCKSKINYGGDIPRKYYVRDQVKQQYEHSVQISRGSSHQVEYEILFPG 3-----1111---3333-------3333----------------2222------------ CVLRWQFMSDGADVGFGIFLKTERQRAGEMTEVLPNQRYNSHLVPEDGTLTCSDPGIYVL -------------------------3333------------------------------- RFDNTYSFIHAKKVNFTVEVLLPDKASEEKMKQLG ---1111---------------------------- >ALPHA-TOXIN; SWP:Q8GCY3; PDB:1OLPA; WDGKEDGTGTHSVIVTQAIEMLKHDLSKDEPEAIRNDLSILEKNLHKFQLGSTFPDYDPN ---1111-------------------1111----------------------3333-111 AYSLYQDHFWDPDTDHNFTQDNKWYLSYAVPDNAESQTRKFATLAKNEWDKGNYEKAAWY 111111111-------3333-33331111---3333------------------------ LGQGMHYFGDLNTPYHAANVTAVDSPGHVKFETYAEERKDTYRLDTTGYNTDDAFYKDTL -------------3333---33333333---------3333------------3333111 KNDNFNEWSKGYCKYWAKKAKNLYYSHATMSNSWDDWEYAASHGVGNAQKGVAGYLYRFL 1---------------------------1111---------------------------- NDVSNKDAVDKDYDLNEIVVMIKTADVQDAGTDNYIYFGIETKDGVKEEWALDNPGNDFT ----1111------------------2222-----------1111--------------2 RNQEGTYTLKLKNKNTKYSDIKNMWIRDEKLTVATDGWKPSYVKVIAGDKVRLEKNINEW 222-------------3333--------------------------!!!!---------- ISGGTTYTLK -2222----- >ENDO-BETA-1,4-GLUCANASE; SWP:Q8NJY3; PDB:1OLRA; IRSLCELYGYWSGNGYELLNNLWGKDTATSGWQCTYLDGTNNGGIQWSTAWEWQGAPDNV -----2222---iiii-----1111---------------iiii-----------1111- KSYPYVGKQIQRGRKISDINSMRTSVSWTYDRTDIRANVAYDVFTARDPDHPNWGGDYEL --------------3333-----------------------------11111111----- MIWLARYGGIYPIGTFHSQVNLAGRTWDLWTGYNGNMRVYSFLPPSGDIRDFSCDIKDFF ---------------------iiii--------!!!!-----------------3333-- NYLERNHGYPAREQNLIVYQVGTECFTGGPARFTCRDFRADLW ---------3333------------------------------ >OXYGEN-INDEPENDENT COPROP; SWP:P32131; PDB:1OLTA; QQIDWDLALIKYYTSYPTALEFSEDFGEQAFLQAVARYPERPLSLYVHIPFCHKLCYFCG -----3333--------3333------------11113333--------------1111- CNKIVTRQQHKADQYLDALEQEIVHRAPLFAGRHVSQLHWGGGTPTYLNKAQISRLMKLL -------3333----------------1111------------1111------------- RENFQFNADAEISIEVDPREIELDVLDHLRAEGFNRLSMGVQDFNKEVQRLVNREQDEEF ------1111-----------3333----1111----------------1111------- IFALLNHAREIGFTSTNIDLIYGLPKQTPESFAFTLKRVAELNPDRLSVFNYAHLPTIFA --------1111-----------2222---------------------------333333 AQRKIKDADLPSPQQKLDILQETIAFLTQSGYQFIGMDHFARPDDELAVAQREGVLHRNF 33---3333------------------1111----!!!!--1111-------------33 QGYTTQGDTDLLGMGVSAISMIGDCYAQNQKELKQYYQQVDEQGNALWRGIALTRDDCIR 33------------2222---%%%%----------------------------------- RDVIKSLICNFRLDYSPIEQQWDLLFADYFAEDLKLLAPLAKDGLVDVDEKGIQVTAKGR -------------------1111-3333------------1111----1111---3333- LLIRNICMC -------11 >SEMAPHORIN 4D; SWP:Q92854; PDB:1OLZA; FAPIPRITWEHREVHLVQFHEPDIYNYSALLLSEDKDTLYIGAREAVFAVNALNISEKQH --------------------2222--------1111------2222----1111------ EVYWKVSEDKKAKCAEKGKSKQTECLNYIRVLQPLSATSLYVCGTNAFQPACDHLNLTSF ------3333---3333------------------1111------%%%%-------1111 KFLGKNEDGKGRCPFDPAHSYTSVMVDGELYSGTSYNFLGSEPIISRNSSHSPLRTEYAI --------2222---1111------iiii-------3333------------------33 PWLNEPSFVFADVIRKSPGEDDRVYFFFTEVSVEYEFVFRVLIPRIARVCKGDQGGLRTL 33-----------------------------------------------1111--1111- QKKWTSFLKARLICSRPDSGLVFNVLRDVFVLRSPGLKVPVFYALFTPQLNNVGLSAVCA ---------------3333--------------1111----------------------- YNLSTAEEVFSHGKYMQSTTVEQSHTKWVRYNGPVPKPRPGACIDSEARAANYTSSLNLP -3333----------------1111-------------2222--3333------3333-- DKTLQFVKDHPLMDDSVTPIDNRPRLIKKDVNYTQIVVDRTQALDGTVYDVMFVSTDRGA ------------------2222--------------------1111-------------- LHKAISLEHAVHIIEETQLFQDFEPVQTLLLSSKKGNRFVYAGSNSGVVQAPLAFCGKHG -------------------1111--------------------1111-------3333-- TCEDCVLARDPYCAWSPPTATCVALHQTESPSRGLIQEMSGDASVCPDKSKGSYRQHFFK -----33331111---1111-----------1111------3333--------------- HGGTAELKCSQKSNLARVFWKFQNGVLKAESPKYGLMGRKNLLIFNLSEGDSGVYQCLSE -----------------------------------2222--------3333--------- ERVKNKTVFQVVAKHVLEVKVV ---------------------- >XYLANASE INHIBITOR PROTEI; SWP:Q8L5C6; PDB:1OM0A; AGGKTGQVTVFWGRNKAEGSLREACDSGMYTMVTMSFLDVFGANGKYHLDLSGHDLSSVG --------------1111------3333-------------1111-----iiii3333-- ADIKHCQSKGVPVSLSIGGYGTGYSLPSNRSALDLFDHLWNSYFGGSKPSVPRPFGDAWL ------1111-------------------------------------1111-1111---- DGVDLFLEHGTPADRYDVLALELAKHNIRGGPGKPLHLTATVRCGYPPAAHVGRALATGI ----------1111---------1111--------------------------------- FERVHVRTYESDKWCNQNLGWEGSWDKWTAAYPATRFYVGLTADDKSHQWVHPKNVYYGV -----------22223333------------1111-----------1111-3333----- APVAQKKDNYGGIMLWDRYFDKQTNYSSLIKYYA ------1111-------------------1111- >CASEIN KINASE II, ALPHA C; SWP:P28523; PDB:1OM1A; SKARVYADVNVLRPKEYWDYEALTVQWGEQDDYEVVRKVGRGKYSEVFEGINVNNNEKCI -----1111----3333-3333------3333---------1111--------------- IKILKPVKKKKIKREIKILQNLCGGPNIVKLLDIVRDQHSKTPSLIFEYVNNTDFKVLYP ---------------------2222----------------------------3333333 TLTDYDIRYYIYELLKALDYCHSQGIMHRDVKPHNVMIDHELRKLRLIDWGLAEFYHPGK 3------------------------------3333----1111------1111---2222 EYNVRVASRYFKGPELLVDLQDYDYSLDMWSLGCMFAGMIFRKEPFFYGHDNHDQLVKIA --------11113333-------3333--------------------------------- KVLGTDGLNVYLNKYRIELDPQLEALVGRHSRKPWLKFMNADNQHLVSPEAIDFLDKLLR ---------------------------------3333--33331111-------1111-- YDHQERLTALEAMTHPYFQQVRAAE -3333---------3333------- >NITRIC-OXIDE SYNTHASE, BR; SWP:P29476; PDB:1OM4A; RFLKVKNWETDVVLTDTLHLKSTLETGCTEHICMGSIMLPTKDQLFPLAKEFLDQYYSSI ------------------1111------1111-1111---3333-------------111 KRFGSKAHMDRLEEVNKEIESTSTYQLKDTELIYGAKHAWRNASRCVGRIQWSKLQVFDA 12222-------------------------------------1111-3333--------1 RDCTTAHGMFNYICNHVKYATNKGNLRSAITIFPQRTDGKHDFRVWNSQLIRYAGYKQPD 111-----------------%%%%---------------------------------333 GSTLGDPANVQFTEICIQQGWKAPRGRFDVLPLLLQANGNDPELFQIPPELVLEVPIRHP 3----3333-------1111----------------iiii-------3333--------- KFDWFKDLGLKWYGLPAVSNMLLEIGGLEFSACPFSGWYMGTEIGVRDYCDNSRYNILEE -1111-------------------iiii-----------3333-------1111------ VAKKMDLDMRKTSSLWKDQALVEINIAVLYSFQSDKVTIVDHHSATESFIKHMENEYRCR --1111----3333------------------1111----3333---------------- GGCPADWVWIVPPMSGSITPVFHQEMLNYRLTPSFEYQPDPWNTHVW ------1111----11113333-----------------1111---- >ONCOMODULIN; SWP:P02631; PDB:1OMD; ITDILSAEDIAAALQECQDPDTFEPQKFFQTSGLSKMSASQVKDIFRFIDNDQSGYLDGD 3333---------3333-2222-------33331111------------1111-----33 ELKYFLQKFQSDARELTESETKSLMDAADNDGDGKIGADEFQEMVHS 33-3333--1111---------------------------------- >TRWC PROTEIN; SWP:Q47673; PDB:1OMHA; LSHVLTRQDIGRAASYYGDASEWQGKGAEELGLSGEVDSKRFRELLAGNIGEGHRIRSAT -----3333-------------------1111------------1111--------3333 RQDSKERIGLDLTFSAPKSVSLQALVAGDAEIIKAHDRAVARTLEQAEARAQARQKIQGK 1111----------------------------------------------------iiii TRIETTGNLVIGKFRHETSRERDPQLHTHAVILNTKRSDGQWRALKNDEIVKATRYLGAV ------------------1111--------------1111-------3333--------- YNAELAHELQKLGYQLRYGKDGNFDLAHIDRQQIEGFSKRTEQIAEWYAARGLDPNSVSL ---------1111------------1111------------------------3333--- EQKQAAKVLSRAKKTSVDREALRAEWQATAKELGIDFS -------1111--------------------------- >ALANINE DEHYDROGENASE; SWP:O28608; PDB:1OMOA; METLILTQEEVESLISMDEAMNAVEEAFRLYALGKAQMPPKVYLEFEKGDLRAMPAHLMG ----------1111----------------1111-----------1111--------iii YAGLKWVNSHPGNPDKGLPTVMALMILNSPETGFPLAVMDATYTTSLRTGAAGGIAAKYL i--------1111----------------------------3333--------------- ARKNSSVFGFIGCGTQAYFQLEALRRVFDIGEVKAYDVREKAAKKFVSYCEDRGISASVQ -1111--------3333------------------------------------------- PAEEASRCDVLVTTTPSRKPVVKAEWVEEGTHINAIGADGPGKQELDVEILKKAKIVVDD 3333------------------3333-------------2222-------1111------ LEQAKHGGEINVAVSKGVIGVEDVHATIGEVIAGLKDGRESDEEITIFDSTGLAIQDVAV -------1111--------3333-----------------------------3333---- AKVVYENALSKNVGSKIKFF -------------------- >RECOVERIN; SWP:P21457; PDB:1OMRA; GNSKSGALSKEILEELQLNTKFTEEELSSWYQSFLKECPSGRITRQEFQTIYSKFFPEAD ------1111-----!!!!------------------3333--------------1111- PKAYAQHVFRSFDANSDGTLDFKEYVIALHMTSAGKTNQKLEWAFSLYDVDGNGTISKNE -----------------------------------3333---------1111-------- VLEIVTAIFKMISPEDTKHLPEDENTPEKRAEKIWGFFGKKDDDKLTEKEFIEGTLANKE --------1111333311111111-----------1111-1111---------------- ILRLIQFEPQKVKEKLKEKKL --------------------- >G-PROTEIN COUPLED RECEPTO; SWP:P21146; PDB:1OMWA; SKKILLPEPSIRSVMQKYLEDRGEVTFEKIFSQKLGYLLFRDFCLKHLEEAKPLVEFYEE -------------------1111--3333----3333----------33333333----- IKKYEKLETEEERLVCSREIFDTYIMKELLACSHPFSKSAIEHVQGHLVKKQVPPDLFQP --3333---------------------1111-----3333-------1111--1111333 YIEEICQNLRGDVFQKFIESDKFTRFCQWKNVELNIHLTMNDFSVHRIIGRGGFGEVYGC 3--------------------------------------1111--------1111----- RKADTGKMYAMKCLDKKRIKMKQGETLALNERIMLSLVSTGDCPFIVCMSYAFHTPDKLS -----------------3333---3333--------------1111--------1111-- FILDLMNGGDLHYHLSQHGVFSEADMRFYAAEIILGLEHMHNRFVVYRDLKPANILLDEH ---------3333--------3333-------------------------3333---111 GHVRISDLGLACDFSKKKPHASVGTHGYMAPEVLQKGVAYDSSADWFSLGCMLFKLLRGH 1------1111-------------1111-3333-2222---3333--------------- SPFRQHKTKDKHEIDRMTLTMAVELPDSFSPELRSLLEGLLQRDVNRRLGCLGRGAQEVK 1111---------------------33333333----------33332222--!!!!--- ESPFFRSLDWQMVFLQKYPPPLIPPRGIKLLDSDQELYRNFPLTISERWQQEVAETVFDT -3333-------1111-----------------3333-------3333------------ INAETDRLEARKKTKNKQLGHEEDYALGKDCIMHGYMSKMGWQRRYFYLFPNRLEWRGEG --------------3333-----1111----------------------1111------- EAPQSLLTMEEIQSVEETQIKERKCLLLKIRGGKQFVLQCDSDPELVQWKKELRDAYREA -------3333------------------1111--------3333--------------- QQLVQRVPKMKNKP --3333-1111--- >ALPHA-NEUROTOXIN TX12; SWP:Q9GQW3; PDB:1OMYA; VRDAYIAQNYNCVYHCARDAYCNELCTKNGAKSGSCPYLGEHKFACYCKDLPDNVPIRVP --------------------------1111---------------------1111----- GKCH ---- >ALPHA-1,4-N-ACETYLHEXOSAM; SWP:Q9ES89; PDB:1OMZA; ALDSFTLIMQTYNRTDLLLRLLNHYQAVPSLHKVIVVWNNVGEKGPEELWNSLGPHPIPV --------------------------------------------------1111------ IFKPQTANKMRNRLQVFPEVETNAVLMVDDDTLISAQDLVFAFSIWQQFPDQIIGFVPRK --------1111----3333----------------------------1111-------- HVSTSSGIYSYGGFELQTPGPGNGDQYSMVLIGASFFNSKYLELFQKQPAAVHALIDETQ ----2222----1111--------------1111----------11113333-------- NCDDIAMNFLVTRHTGKPSGIFVKPINMVNLEAEHFLQRSYCINKLVNIYDGMPLKYSNI ------------------------------------------------------------ MISQFGFPYANHK ---22222222-- >YYCN PROTEIN; SWP:O32293; PDB:1ON0A; TILTPQTEEFRSYLTYTTKHYAEEKVKAGTWLPEDAQLLSKQVFTDLLPRGLETPHHHLW -------------------------------1111------------1111--2222--- SLKLNEKDIVGWLWIHAEPEHPQQEAFIYDFGLYEPYRGKGYAKQALAALDQAARSGIRK ------------------------------------------------------------ LSLHVFAHNQTARKLYEQTGFQETDVVSKKLL -----1111-------1111------------ >TRANSCRIPTIONAL REGULATOR; SWP:P54512; PDB:1ON2A; TTPSMEMYIEQIYMLIEEKGYARVSDIAEALAVHPSSVTKMVQKLDKDEYLIYEKYRGLV ---------------------------------3333--------1111----------- LTSKGKKIGKRLVYRHELLEQFLRIIGVDEEKIYNDVEGIEHHLSWNSIDRIGDLVQYFE -----------------------1111-3333-------1111----------------- EDDARKKDLKSIQKK -----------3333 >Methylmalonyl-CoA carboxy; SWP:Q8GBW6; PDB:1ON3A; KLASTMEGRVEQLAEQRQVIEAGGGERRVEKQHSQGKQTARERLNNLLDPHSFDEVGAFR ---------------------!!!!-------1111------------2222----1111 KHRTTLFGMDKAVVPADGVVTGRGTILGRPVHAASQDFTVMGGSAGETQSTKVVETMEQA -----iiii----2222--------iiii-------3333-------------------- LLTGTPFLFFYDSGGARIQEGIDSLSGYGKMFFANVKLSGVVPQIAIIAGPCAGGASYSP ----------------3333-----------------2222------------3333--1 ALTDFIIMTKKAHMFITGPQVIKSVTGEDVTADELGGAEAHMAISGNIHFVAEDDDAAEL 111-----1111------------------3333-------------------------- IAKKLLSFLPQNNTEEASFVNPNNDVSPNTELRDIVPIDGKKGYDVRDVIAKIVDWGDYL ----3333----------------------3333----3333------3333-2222--- EVKAGYATNLVTAFARVNGRSVGIVANQPSVMSGCLDINASDKAAEFVNFCDSFNIPLVQ --1111----------iiii-------1111iiii----------------1111----- LVDVPGFLPGVQQEYGGIIRHGAKMLYAYSEATVPKITVVLRKAYGGSYLAMCNRDLGAD ---------3333------------------------------------11113333--- AVYAWPSAEIAVMGAEGAANVIFRKEIKAADDPDAMRAEKIEEYQNAFNTPYVAAARGQV ----1111------------1111--1111-------------------3333------- DDVIDPADTRRKIASALEMYATKRQTRPAKKHGNFPC -------------------1111-------------- >GLUTATHIONE REDUCTASE; SWP:Q94655; PDB:1ONFA; VYDLIVIGGGSGGMAAARRAARHNAKVALVEKSRLGGTCVNVGCVPKKIMFNAASVHDIL --------------------1111------------3333-------------------- ENSRHYGFDTKFSFNLPLLVERRDKYIQRLNNIYRQNLSKDKVDLYEGTASFLEGRNILI -3333---------3333--------------------1111------------------ AVGNKPVFPPVKGIENTISSDEFFNIKESKKIGIVGSGYIAVELINVIKRLGIDSYIFAR ----------2222-----3333--------------3333------------------- GNRILRKFDESVINVLENDMKKNNINIVTFADVVEIKKVSDKNLSIHLSDGRIYEHFDHV ----1111------------1111-----------------------1111--------- IYCVGRSPDTENLKLEKLNVETNNNYIVVDENQRTSVNNIYAVGDCCMVKFYNVQLTPVA -----------------------------1111----------3333------------- INAGRLLADRLFLKKTRKTNYKLIPTVIFSHPPIGTIGLSEEAAIQIYGKENVKIYESKF --------3333-------------------------------1111-3333-------- TNLFFSVYDIEPELKEKTYLKLVCVGKDELIKGLHIIGLNADEIVQGFAVALKMNATKKD -3333-----3333-----------------------2222---3333---1111-3333 FDETIPIHPTAAEEFLTLQ ---------33331111-- >14.5 KDA TRANSLATIONAL IN; SWP:P52758; PDB:1ONIA; SSLIRRVISTAKAPGAIGPYSQAVLVDRTIYISGQIGMDPSSGQLVSGGVAEEAKQALKN ---------1111----------------------------------------------- MGEILKAAGCDFTNVVKTTVLLADINDFNTVNEIYKQYFKSNFPARAAYQVAALPKGSRI -----1111-3333---------1111--------------------------2222--- EIEAVAIQGPLTTASL ---------------- >BETA-GALACTOSIDE SPECIFIC; SWP:P81446; PDB:1ONKA; YERLRLRVTHQTTGEEYFRFITLLRDYVSSGSFSNEIPLLRQSTIPVSDAQRFVLVELTN ---------------------------------iiii----3333--------------1 QGQDSVTAAIDVTNAYVVAYQAGDQSYFLRDAPRGAETHLFTGTTRSSLPFNGSYPDLER 111------------------!!!!---22222222------------------------ YAGHRDQIPLGIDQLIQSVTALRFPGGSTRTQARSILILIQMISEAARFNPILWRYRQYI ---1111----------------------------------------------------- NSGASFLPDVYMLELETSWGQQSTQVQHSTDGVFNNPIRLAIPPGNFVTLTNVRDVIASL -----------------------------iiii------------------3333----- AIMLFVCGE --------- >GLYCINE CLEAVAGE SYSTEM H; SWP:P83697; PDB:1ONLA; DIPKDRFYTKTHEWALPEGDTVLVGITDYAQDALGDVVYVELPEVGRVVEKGEAVAVVES --------1111-----!!!!----------------------2222--2222------- VKTASDIYAPVAGEIVEVNLALEKTPELVNQDPYGEGWIFRLKPRDMGDLDELLDAGGYQ --------------------3333-3333-----1111-------3333-----3333-- EVLESEA ------- >T-CELL SURFACE GLYCOPROTE; SWP:P06126; PDB:1ONQA; PLSFHVIWIASFYNHSWKQNLVSGWLSDLQTHTWDSNSSTIVFLWPWSRGNFSNEEWKEL -------------------------!!!!------1111-----1111!!!!-------- ETLFRIRTIRSFEGIRRYAHELQFEYPFEIQVTGGCELHSGKVSGSFLQLAYQGSDFVSF -------------------1111---------------%%%%---------iiii----- QNNSWLPYPVAGNMAKHFCKVLNQNQHENDITHNLLSDTCPRFILGLLDAGKAHLQRQVK %%%%---1111---------1111------------------------1111-1111--- PEAWLSHGPSPGPGHLQLVCHVSGFYPKPVWVMWMRGEQEQQGTQRGDILPSADGTWYLR -----------2222--------------------!!!!-1111-------1111----- ATLEVAAGEAADLSCRVKHSSLEGQDIVLYWHHH -----11112222-----1111------------ >TRANSALDOLASE B; SWP:P30148; PDB:1ONRA; TDKLTSLRQYTTVVADTGDIAAMKLYQPQDATTNPSLILNAAQIPEYRKLIDDAVAWAKQ -------------------------------------------3333----------111 QSNDRAQQIVDATDKLAVNIGLEILKLVPGRISTEVDARLSYDTEASIAKAKRLIKLYND 1----------------------3333---------3333------------------11 AGISNDRILIKLASTWQGIRAAEQLEKEGINCNLTLLFSFAQARACAEAGVFLISPFVGR 11-1111------------------1111-----------------1111---------- ILDWYKANTDKKEYAPAEDPGVVSVSEIYQYYKEHGYETVVMGASFRNIGEILELAGCDR --------------3333--------------1111------------------------ LTIAPALLKELAESEGAIERKLSYTGEVKARPARITESEFLWQHNQDPMAVDKLAEGIRK ------------------------------------------------------------ FAIDQEKLEKMIGDLL ------------1111 >ISOASPARTYL DIPEPTIDASE; SWP:P39377; PDB:1ONWA; MIDYTAAGFTLLQGAHLYAPEDRGICDVLVANGKIIAVASNIPSDIVPNCTVVDLSGQIL ---3333-----------------------iiii--------1111--------2222-- CPGFIDQHVHLIGGGGEAGPTTRTPEVALSRLTEAGVTSVVGLLGTDSISRHPESLLAKT ---------1111-----3333-----3333-1111------------------------ RALNEEGISAWMLTGAYHVPSRTITGSVEKDVAIIDRVIGVCAISDHRSAAPDVYHLANM ----------------------------------1111-------1111----------- AAESRVGGLLGGKPGVTVFHMGDSKKALQPIYDLLENCDVPISKLLPTHVNRNVPLFEQA -----------------------33333333---------1111----1111-------- LEFARKGGTIDITSSIDEPVAPAEGIARAVQAGIPLARVTLSSDGNGSGVAGFETLLETV ------------1111------------------3333-----2222------------- QVLVKDYDFSISDALRPLTSSVAGFLNLTGKGEILPGNDADLLVMTPELRIEQVYARGKL ---------3333-3333---------2222---2222-------1111------iiii- MVKDGKACVKGTFET --iiii----1111- >MAGO NASHI PROTEIN; SWP:P49028; PDB:1OO0A; EDFYLRYYVGHKGKFGHEFLEFEFRPDGKLRYANNSNYKNDTMIRKEAFVHQSVMEELKR ------------1111--------1111-------------------------------- IIIDSEIMQEDDLPWPPPDRVGRQELEIVIGDEHISFTTSKTGSLVDVNRSKDPEGLRCF -----3333--1111--------------------------------1111--------- YYLVQDLKCLVFSLIGLHFKIKPI ------------------------ >RNA-binding protein 8A; SWP:Q9V535; PDB:1OO0B; PGPQRSVEGWILFVTSIHEEAQEDEIQEKFCDYGEIKNIHLNLDRRTGFSKGYALVEYET -----------------11113333----3333--------------------------- HKQALAAKEALNGAEIMGQTIQVDWCFVKGPK ----------2222-iiii------------- >TRANSTHYRETIN; SWP:O93330; PDB:1OO2A; RCPLMVKILDAVKGTPAGSVALKVSQKTADGGWTQIATGVTDATGEIHNLITEQQFPAGV ---------------------------1111----------1111------3333----- YRVEFDTKAYWTNQGSTPFHEVAEVVFDAHPEGHGHYTLALLLSPFSYTTTAVVSS -----------1111----------------------------------------- >DIHYDROPTERIDINE REDUCTAS; SWP:Q9XVJ3; PDB:1OOEA; SSGKVIVYGGKGALGSAILEFFKKNGYTVLNIDLSANDQADSNILVDGNKNWTEQEQSIL --------1111------------------------1111------1111---------- EQTASSLQGSQVDGVFCVAGGWAGGSASSKDFVKNADLMIKQSVWSSAIAAKLATTHLKP ------!!!!---------------1111-----------------------------22 GGLLQLTGAAAAMGPTPSMIGYGMAKAAVHHLTSSLAAKDSGLPDNSAVLTIMPVTLDTP 22------3333---1111------------------------2222------------- MNRKWMPNADHSSWTPLSFISEHLLKWTTETSSRPSSGALLKITTENGTSTITPQ -----1111------3333----------1111--2222------iiii------ >ODORANT BINDING PROTEIN L; SWP:O02372; PDB:1OOHA; HMTMEQFLTSLDMIRSGCAPKFKLKTEDLDRLRVGDFNFPPSQDLMCYTKCVSLMAGTVN -----------------3333----------1111------------------------1 KKGEFNAPKALAQLPHLVPPEMMEMSRKSVEACRDTHKQFKESCERVYQTAKCFSENADG 111---------3333--3333----------11111111-------------------- QFMWP ----- >COAT PROTEIN VP1; SWP:P13900; PDB:1OOPA; RVADTIGSGPVNSESIPALTAAETGHTSQVVPSDTMQTRHVKNYHSRSESTVENFLCRSA ---------------3333-3333------3333------------11113333------ CVFYTTYENHDSDGDNFAYWVINTRQVAQLRRKLEMFTYARFDLELTFVITSTQEQPTVR ----------------------------3333---------------------------- GQDAPVLTHQIMYVPPGGPVPTKVNSYSWQTSTNPSVFWTEGSAPPRMSVPFIGIGNAYS --------------2222---------------------2222----------------- MFYDGWARFDKQGTYGISTLNNMGTLYMRHVNDGGPGPIVSTVRIYFKPKHVKTWVPRPP -------1111-----3333---------------------------------------- RLCQYQKAGNVNFEPTGVTEGRTDITTMKTT ------------------------------- >Genome polyprotein; SWP:P13900; PDB:1OOPB; SDRVRSITLGNSTITTQECANVVVGYGVWPTYLKDEEATAEDQPTQPDVATCRFYTLESV --------!!!!---------------------1111----------1111--------- MWQQSSPGWWWKFPDALSNMGLFGQNMQYHYLGRAGYTIHVQCNASKFHQGCLLVVCVPE -------------3333------------------------------------------- AEMGCATLANKPDPKSLSKGEIANMFESQNSTGETAVQANVINAGMGVGVGNLTIFPHQW ------1111--3333--!!!!-----------------3333-----33331111---- INLRTNNSATIVMPYINSVPMDNMFRHNNFTLMVIPFAPLSYSTGATTYVPITVTVAPMC -3333------------------------------------------------------- AEYNGLRLAGKQ ------------ >Genome polyprotein; SWP:P13900; PDB:1OOPC; GLPTLSTPGSNQFLTSDDFQSPSAMPQFDVTPEMDIPGQVNNLMEIAEVDSVVPVNNTEG --------2222-1111-------2222-------------33331111----------3 KVMSIEAYQIPVQSNPTNGSQVFGFPLTPGANSVLNRTLLGEILNYYAHWSGSIKLTFMF 333-3333-------------------3333---1111-----1111------------- CGSAMATGKFLLAYSPPGAGAPTTRKEAMLGTHVIWDVGLQSSCVLCIPWISQTHYRYVV --1111-----------------3333--------------------------------- MDEYTAGGYITCWYQTNIVVPADAQSDCKILCFVSACNDFSVRMLKDTPFIKQDNFFQ -3333---------------2222------------1111------------------ >Genome polyprotein; SWP:P13900; PDB:1OOPD; GAQVSTQKTGAHEIHYTNINYYKDAASNSANRQDFTQDPGKFTEPVKDIMVKSMPALN -----------------------3333----------3333---------1111---- >Hypothetical 40.4 kDa pro; SWP:P43603; PDB:1OOTA; SPKAVALYSFAGEESGDLPFRKGDVITILKKSDSQNDWWTGRVNGREGIFPANYVELV -------------2222---2222------------------iiii------------ >Succinyl-CoA:3-ketoacid-c; SWP:Q29551; PDB:1OOYA; TKFYTDAVEAVKDIPNGATVLVGGFGLCGIPENLIGALLKTGVKELTAVSNNAGVDNFGL -----3333-11112222------!!!!------------------------------33 GLLLQSKQIKRMISSYVGENAEFERQYLAGELEVELTPQGTLAERIRAGGAGVPAFYTST 33-1111----------------------------------------1111-------22 GYGTLVQEGGSPIKYNKDGSIAIASKPREVREFNGQHFILEEAIRGDFALVKAWKADQAG 22--3333-------1111-------------iiii--------------------1111 NVTFRKSARNFNLPMCKAAETTVVEVEEIVDIGSFAPEDIHIPKIYVHRLVKGEKYEKRI ----!!!!--3333----------------2222-3333---3333-------------- ERNVRERIIKRAALEFEDGMYANLGIGIPLLASNFISPNMTVHLQSENGILGLGPYPLQN -----------1111-2222------3333--11111111-----1111--------111 EVDADLINAGKETVTVLPGASYFSSDESFAMIRGGHVNLTMLGAMQVSKYGDLANWMIPG 1-1111-1111-----2222---------------------------1111--------- KLVKGMGGAMDLVSSAKTKVVVTMEHSAKGNAHKIMEKCTLPLTGKQCVNRIITEKAVFD -----!!!!3333-3333---------2222----------------------------- VDRKKGLTLIELWEGLTVDDIKKSTGCDFAVSPKLIPMQQVTT ------------2222---------------1111-------- >VENOM SERINE PROTEINASE; SWP:Q9I8X1; PDB:1OP0A; VIGGNECDINEHRFLVAFFNTTGFFCGGTLINPEWVVTAAHCDSTDFQMQLGVHSKKVLN -------11111111----------------1111---1111-----------------1 EDEQTRNPKEKFICPNKNNNEVLDKDIMLIKLDKPISNSKHIAPLSLPSSPPSVGSVCRI 111-----------------1111--------------1111----------2222---- MGWGSITPVKETFPDVPYCANINLLDHAVCQAGYPELLAEYRTLCAGIVQGKDTCGGDSG -------------------------------------3333-------------2222-- GPLICNGQFQGIVSYGAHPCGQGPKPGIYTNVFDYTDWIQRNIAGNTDATCPP ------------------------------3333------------------- >FAB 2G12, LIGHT CHAIN; SWP:NA; PDB:1OP3H; EVQLVESGGGLVKAGGSLILSCGVSNFRISAHTMNWVRRVPGGGLEWVASISTSSTYRDY ------------2222------------1111-------3333--------2222----- ADAVKGRFTVSRDDLEDFVYLQMHKMRVEDTAIYYCARKGSDRLSDNDPFDAWGPGTVVT 1111----------------------1111------------------------------ VSPASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVL --------------------------------------------iiii------------ QSSGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPK --------------3333------------1111---------- >FAB 2G12, LIGHT CHAIN; SWP:NA; PDB:1OP3K; VVMTQSPSTLSASVGDTITITCRASQSIETWLAWYQQKPGKAPKLLIYKASTLKTGVPSR ------------2222---------------------2222------------2222333 FSGSGSGTEFTLTISGLQFDDFATYHCQHYAGYSATFGQGTRVEIKRTVAAPSVFIFPPS 3----------------3333--------------------------------------3 DEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLTL 3331111---------------------iiii---------------------------- SKADYEKHKVYACEVTHQGLSSPVTKSFNRG 33333333--------1111----------- >NEURAL-CADHERIN; SWP:P15116; PDB:1OP4A; EASGEIALCKTGFPEDVYSAVLPKDVHEGQPLLNVKFSNCNRKRKVQYESSEPADFKVDE ----------------------------------------------------------33 DGTVYAVRSFPLTAEQAKFLIYAQDKETQEKWQVAVNLSREPTLTEEPMKEPHEIEEIVF 33-----------------------1111------------------------------- PRQLAKHSGALQRQKR ---------------- >HL6 CAMEL VHH FRAGMENT; SWP:P61626; PDB:1OP9A; QVQLQESGGGSVQAGGSLRLSCSASGYTYISGWFRQAPGKEREGVAAIRSSDGTTYYADS ------------2222--------------------2222---------1111----333 VKGRFTISQDNAKNTVYLQMNSLKPEDTAMYYCAATEVAGWPLDIGIYDYWGQGTEVTVS 3---------1111---------3333-------------11111111------------ S - >CELLULAR RETINOL BINDING ; SWP:P06768; PDB:1OPBA; TKDQNGTWEMESNENFEGYMKALDIDFATRKIAVRLTQTKIIVQDGDNFKTKTNSTFRNY --------------------1111-------3333---------!!!!------------ DLDFTVGVEFDEHTKGLDGRNVKTLVTWEGNTLVCVQKGEKENRGWKQWVEGDKLYLELT ----2222---------------------------------------------------- CGDQVCRQVFKKK !!!!--------- >OMPR; SWP:P41405; PDB:1OPC; VIAFGKFKLNLGTREMFREDEPMPLTSGEFAVLKALVSHPREPLSRDKLMNLARGREYSA ---!!!!----------%%%%-----------------2222--------3333----11 MERSIDVQISRLRRMVEEDPAHPRYIQTVWGLGYVFVPD 11----------------3333------2222------- >HISTIDINE-CONTAINING PROT; SWP:P07006; PDB:1OPD; MFEQEVTITAPNGLHTRPAAQFVKEAKGFTSEITVTSNGKSASAKDLFKLQTLGLTQGTV ---------2222------------3333-------iiii--1111----3333-2222- VTISAEGEDEQKAVEHLVKLMAELE --------------------1111- >COAT PROTEIN; SWP:P04383; PDB:1OPOA; SMTMSKTELLSTVKGTTGVIPSFEDWVVSPRNVAVFPQLSLLATNFNKYRITALTVKYSP ----------------------------1111-----3333------------------- ACSFETNGRVALGFNDDASDTPPTTKVGFYDLGKHVETAAQTAKDLVIPVDGKTRFIRDS ----------------3333--------1111------1111------------------ ASDDAKLVDFGRIVLSTYGFDKADTVVGELFIQYTIVLSDPTKTAKISQASNDKVSDGPT ---3333----------------------------------------------------- YVVPSVNGNELQLRVVAAGKWCIIVRGTVEGGFTKPTLIGPGISGDVDYESARPIAVCEL ------!!!!-------------------------------------------------- VTQMEGQILKITKTSAEQPLQWVVYRM -------------3333---------- >TYPE III ANTIFREEZE PROTE; SWP:P19608; PDB:1OPS; SQSVVATQLIPMNTALTPAMMEGKVTNPIGIPFAEMSQLVGKQVNTPVAKGQTLMPNMVK ----------2222--3333-----------33331111---------2222--111122 TYAA 22-- >PROTEIN YESU; SWP:YESU_BACSU; PDB:1OQ1A; AYKEGACLYRNPLRSKSDVKDWREGGGQISFDDHSLHLSHVQDEAHFVFWCPETFPDGII --------------33331111--------2222-------------------------- VTWDFSPIEQPGLCLFFAAAGIRGEDLFDPSLRKRTGTYPEYHSGDINALHLSYFRRKYA --------------------1111-1111--------3333-----------------33 EERAFRTCNLRKSRGFHLAAGADPLPSPDDADSPYRKLIKDKGYVHFSINGLPILEWDDG 33------------------------3333----------!!!!----iiii-------- STYGPVLTKGKIGFRQAPKAVYRDFAVHQAVRR --------------------------------- >ACYL-[ACYL-CARRIER PROTEI; SWP:P22337; PDB:1OQ9A; FMPPREVHVQVTHSMPPQKIEIFKSLDNWAEENILVHLKPVEKCWQPQDFLPDPASDGFD ---------------------------------1111--3333--3333---1111---- EQVRELRERAKEIPDDYFVVLVGDMITEEALPTYQTMLNTLDGVRDETGASPTSWAIWTR --------3333------------------------11111111--------3333---- AWTAEENRHGDLLNKYLYLSGRVDMRQIEKTIQYLIGSGMDPRTENSPYLGFIYTSFQER -----------------3333----------------------%%%%3333--------- ATFISHGNTARQAKEHGDIKLAQICGTIAADEKRHETAYTKIVEKLFEIDPDGTVLAFAD --------------------------------------------------3333------ MMRKKISMPAHLMYDGRDDNLFDHFSAVAQRLGVYTAKDYADILEFLVGRWKVDKLTGLS -------1111-------------------------------------1111-------3 AEGQKAQDYVCRLPPRIRRLKEAPTMPFSWIFDRQVKL 333------------------------3333------- >AUGMENTER OF LIVER REGENE; SWP:Q63042; PDB:1OQCA; EDCPQDREELGRNTWAFLHTLAAYYPDMPTPEQQQDMAQFIHIFSKFYPCEECAEDIRKR ---------------------1111----------------------------------- IDRSQPDTSTRVSFSQWLCRLHNEVNRKLGKPDFDCSRVDERWRDGWKDGSC ----------------------------------1111--------1111-- >Glucocorticoid Modulatory; SWP:Q9Y692; PDB:1OQJA; DMEIAYPITCGESKAILLWKKFVCPGINVKCVKFNDQLISPKHFVHLAGKSTLKDWKRAI ---------!!!!----3333------------%%%%--------11113333-3333-- RLGGIMLRKMMDSGQIDFYQHDKVCSNTCR -iiii3333----------3333------- >CONSERVED PROTEIN MTH11; SWP:O26119; PDB:1OQKA; RHELIGLSVRIARSVHRDIQGISGRVVDETRNTLRIEMDDGREITVPKGIAVFHFRTPQG ---------------3333------------------3333---------------3333 ELVEIDGRALVARPEERI -------------1111- >Beta-galactoside-specific; SWP:P81446; PDB:1OQLB; DDVTCSASEPTVRIVGRNGMCVDVRDDDFRDGNQIQLWPSKSNNDPNQLWTIKRDGTIRS ---------------2222----2222--2222-------------------1111---i NGSCLTTYGYTAGVYVMIFDCNTAVREATLWQIWGNGTIINPRSNLVLAASSGIKGTTLT iii----------------1111-3333-----1111----1111--------2222--- VQTLDYTLGQGWLAGNDTAPREVTIYGFRDLCMESNGGSVWVETCVSSQKNQRWALYGDG ------3333-----------------%%%%--------------2222-------1111 SIRPKQNQDQCLTCGRDSVSTVINIVSCSAGSSGQRWVFTNEGAILNLKNGLAMDVAQAN ---3333----------2222----------1111----1111-------------%%%% PKLRRIIIYPATGKPNQMWLPVP 3333---------1111------ >CALTRACTIN; SWP:P05434; PDB:1OQPA; GSGERDSREEILKAFRLFDDDNSGTITIKDLRRVAKELGENLTEEELQEMIAEADRNDDN ------------------1111-------------------------------------- EIDEDEFIRIMKKTSLF -----------1111-- >Phospholipase A2 RV-4 [Pr; SWP:Q02471; PDB:1OQSB; NLFQFARMINGKLGAFSVWNYISYGCYCGWGGQGTPKDATDRCCFVHDCCYGGVKGCNPK ---------------3333-------------------------------1111----11 LAIYSYSFQRGNIVCGRNNGCLRTICECDRVAANCFHQNKNTYNKEYKFLSSSKCRQRSE 11------iiii------!!!!--------------1111---1111---3333------ QC -- >TOXIN-COREGULATED PILUS S; SWP:P23024; PDB:1OQVA; DSQNMTKAAQSLNSIQVALTQTYRGLGNYPATADATAASKLTSGLVSLGKISSDEAKNPF ----------------------1111-------------------1111--3333----- IGTNMNIFSFPRNAAANKAFAISVDGLTQAQCKTLITSVGDMFPYIAIKAGGAVALADLG -----------iiii-----------------------3333------------3333-- DFENSAAAAETGVGVIKSIAPASKNLDLTNITHVEKLCKGTAPFGVAFGNS 1111---3333--------1111---1111--------------------- >UV EXCISION REPAIR PROTEI; SWP:P54725; PDB:1OQYA; SAVTITLKTLQQQTFKIRMEPDETVKVLKEKIEAEKGRDAFPVAGQKLIYAGKILSDDVP --------1111------------------------------------------------ IRDYRIDEKNFVVVMVTKTKAGQGTSAPPEASPTAAPESSTSFPPAPTSGMSHPPPAARE -1111-3333------------------------3333---------------------- DKSPSEESAPTTSPESVSGSVPSSGSSGREEDAASTLVTGSEYETMLTEIMSMGYERERV --------------------------------------1111--------------3333 VAALRASYNNPHRAVEYLLTGIPGSPEPEHGSVQESQVSEQPATEAAGENPLEFLRDQPQ -1111------------------------------------------------------- FQNMRQVIQQNPALLPALLQQLGQENPQLLQQISRHQEQFIQMLNEPPGELADISDVEGE ---------------------11113333---1111------1111-------------- VGAIGEEAPQMNYIQVTPQEKEAIERLKALGFPESLVIQAYFACEKNENLAANFLLSQNF -------------------3333----1111-------3333------3333-------- DDE --- >GLUTARYL 7-AMINOCEPHALOSP; SWP:Q9L5D6; PDB:1OR0A; APIAAYKPRSNEILWDGYGVPHIYGVDAPSAFYGYGWAQARSHGDNILRLYGEARGKGAE ---------------1111---------------------------------1111-333 YWGPDYEQTTVWLLTNGVPERAQQWYAQQSPDFRANLDAFAAGINAYAQQNPDDISPDVR 3-3333-------------------1111---------------------1111-33333 QVLPVSGADVVAHAHRLNFLYVASPGRTLGEG 333----------------------------- >Glutaryl-7-aminocephalosp; SWP:P07662; PDB:1OR0B; SNSWAVAPGKTANGNALLLQNPHLSWTTDYFTYYEAHLVTPDFEIYGATQIGLPVIRFAF ------3333-----------------1111--------1111------2222------- NQRGITNTVNGVGATNYRLTLQDGGYLYDGQVRPFERPQASYRLRQADGTTVDKPLEIRS ---------------------iiii--iiii--------------1111----------- SVHGPVFERADGTAVAVRVAGLDRPGLEQYFDITADSFDDYEAALARQVPTFNIVYADRE 3333----1111--------1111---------------------------------333 GTINYSFNGVAPKRAEGDIAFWQGLVPGDSSRYLWTETHPLDDLPRVTNPPGGFVQNSND 3----------------3333--------3333------3333------3333------- PPWTPTWPVTYTPKDFPSYLAPQTPHSLRAQQSVRLSENDDLTLERFALQLSHRAVADRT -----------3333-1111----------------------3333-3333-----1111 LPDLIPAALIDPDPEVQAAARLLAAWDREFTSDSRAALLFEEWARLFAGQNFAGQAGFAT -----3333---------------------1111--------------1111--1111-- PWSLDKPVSTPYGVRDPKAAVDQLRTAIANTKRKYGAIDRPFGDASRILNDVNVPGAAGY --3333----------------------------------1111----!!!!-------3 GNLGSFRVFTWSDPDENGVRTPVHGETWVAIEFSTPVRAYGLSYGNSRQPGTTHYSDQIE 333-----------1111------------------------------2222-----333 RVSRADFRELLLRREQVEAAVQERTPFNFK 31111------------------------- >HEME-BASED AEROTACTIC TRA; SWP:O07621; PDB:1OR4A; ETAYFSDSNGQQKNRIQLTNKHADVKKQLKMVRLGDAELYVLEQLQPLIQENIVNIVDAF -------3333-------33333333---------------------------------- YKNLDHESSLMDIINDHSSVDRLKQTLKRHIQEMFAGVIDDEFIEKRNRIASIHLRIGLL ------------------3333---------3333------------------------- PKWYMGAFQELLLSMIDIYEASITNQQELLKAIKATTKILNLEQQLVLE ------------------------------------------------- >ACYL CARRIER PROTEIN; SWP:Q54996; PDB:1OR5A; SALTVDDLKKLLAETAGEDDSVDLAGELDTPFVDLGYDSLALLETAAVLQQRYGIALTDE ---3333---------------3333----3333-------------------------3 TVGRLGTPRELLDEVNTTPATA 333---3333------------ >RNA POLYMERASE SIGMA-E FA; SWP:P38106; PDB:1OR7C; MQKEQLSALMDGETLDSELLNELAHNPEMQKTWESYHLIRDSMRGDTPEVLHFDISSRVM --------1111------------------------------------------------ AAIEEE --1111 >PROTEIN ARGININE N-METHYL; SWP:Q63009; PDB:1OR8A; HFGIHEEMLKDEVRTLTYRNSMFHNRHLFKDKVVLDVGSGTGILCMFAAKAGARKVIGIE ---------------------11113333---------!!!!------------------ CSSISDYAVKIVKANKLDHVVTIIKGKVEEVELPVEKVDIIISEWMGYCLFYESMLNTVL -3333-------1111----------1111----------------22222222------ HARDKWLAPDGLIFPDRATLYVTAIEDRQYKDYKIHWWENVYGFDMSCIKDVAIKEPLVD -------2222-------------------------3333iiii-3333----------- VVDPKQLVTNACLIKEVDIYTVKVEDLTFTSPFCLQVKRNDYVHALVAYFNIEFTRCHKR --3333-----------3333-3333---------------------------3333--- TGFSTSPESPYTHWKQTVFYMEDYLTVKTGEEIFGTIGMRPNAKNNRDLDFTIDLDFKGQ -----1111--3333------------2222---------------------------11 LCELSCSTDYRMR 11----------- >CRO REPRESSOR INSERTION M; SWP:P03040; PDB:1ORC; QRITLKDYAMRFGQTKTAKDLGVYQSAINKAIHAGRKIFLTINADGSVYAEEVKDGEVKP -----------------------3333---------------1111-------iiii--- FPSN ---- >GRANZYME A PRECURSOR; SWP:P12544; PDB:1ORFA; IIGGNEVTPHSRPYMVLLSLDRKTICAGALIAKDWVLTAAHCNLNKRSQVILGAHSITRE -------22221111-----1111--------------1111--1111------------ EPTKQIMLVKKEFPYPCYDPATREGDLKLLQLTEKAKINKYVTILHLPKKGDDVKPGTMC 3333----------1111--------------------1111------------2222-- QVAGWGRTHNSASWSDTLREVEITIIDRKVCNDRNHYNFNPVIGMNMVCAGSLRGGRDSC --------------------------3333------%%%%---1111----1111----2 NGDSGSPLLCEGVFRGVTSFGLENKCGDPRGPGVYILLSKKHLNWIIMTIKG 222------------------2222--3333--------------------- >FLAGELLAR PROTEIN FLIS; SWP:O67806; PDB:1ORJA; RNIAEAYFQNMVETATPLEQIILLYDKAIECLERAIEIYDQVNELEKRKEFVENIDRVYD ---3333------------------------------1111--3333------------- IISALKSFLDHEKGKEIAKNLDTIYTIILNTLVKVDKTKEELQKILEILKDLREAWEEVK -----3333--------------------------------------------------- KKVHHH --1111 ---------------------------------------------- >ENDONUCLEASE III; SWP:Q5KXY2; PDB:1ORNA; MLTKQQIRYCLDEMAKMFPDAHCELVHRNPFELLIAVVLSAQCTDALVNKVTKRLFEKYR -----------------1111--------------------------------1111--- TPHDYIAVPLEELEQDIRSIGLYRNKARNIQKLCAMLIDKYNGEVPRDRDELMKLPGVGR -3333---3333----3333-3333---------------%%%%---3333---2222-- KTANVVVSVAFGVPAIAVDTHVERVSKRLGFCRWDDSVLEVEKTLMKIIPKEEWSITHHR --------------------------------1111-------------3333------- MIFFGRYHCKAQSPQCPSCPLLHLCREGKKRMRK --------------33331111--------3333 >Voltage-gated potassium c; SWP:Q9YDF8; PDB:1ORQB; DVQLQESGPGLVKPSQSLSLTCTVTGYSITSLYAWNWIRQFPGNKLEWMGYINYSGYTSY ---------------------------1111---------1111--------1111---- NPSLKSRISITRDTSKNQFFLQLHSVTTEDTATYSCTRGVDYFAMDYWGQGASVTVSSAK -1111---------------------3333---------2222----------------- TTAPSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPALLQSDLY --------------------------------------iiii------------------ TLSSSVTVTSNTWPSQSITCNVAHPASSTKVDKKIVPRD --------3333------------1111----------- >CDP-TYVELOSE-2-EPIMERASE; SWP:P14169; PDB:1ORRA; AKLLITGGCGFLGSNLASFALSQGIDLIVFDNLSRKGATDNLHWLSSLGNFEFVHGDIRN ------1111----------1111----------2222------1111--------3333 KNDVTRLITKYMPDSCFHLAGQVAMTTSIDNPCMDFEINVGGTLNLLEAVRQYNSNCNII -----------------------------------------------------1111--- YSSTNKVYGDLEQYKYNETETRYTCVDKPNGYDESTQLDFHSPYGCSKGAADQYMLDYAR -----1111-3333----1111--1111----1111------------------------ IFGLNTVVFRHSSMYGGRQFATYDQGWVGWFCQKAVEIKNGINKPFTISGNGKQVRDVLH ---------------------1111------------1111------------------3 AEDMISLYFTALANVSKIRGNAFNIGGTIVNSLSLLELFKLLEDYCNIDMRFTNLPVRES 333----------33332222------3333----------------------------- DQRVFVADIKKITNAIDWSPKVSAKDGVQKMYDWTSSI ----------------------------------3333 >33H1 FAB LIGHT CHAIN; SWP:Q9YDF8; PDB:1ORSA; QIVLTQSPAIMSASLGDRVTMTCTASSSVSSSYLHWYQQKPGSSPKLWIYSTSNLASGVP -------------2222------------3333------2222----------------3 ARFSGSGSGTSYSLTISSMEAEDAATYYCHQFHRSLTFGSGTKLEIKRADAAPTVSIFPP 333----------------3333------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT -----------------------------iiii--2222--------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC -33331111--------3333--------1111- >Voltage-gated potassium c; SWP:Q9YDF8; PDB:1ORSB; DVQLQESGPGLVKPSQSLSLTCTVTGYSITNNYAWNWIRQFPGNKLEWMGYINYSGTTSY ------------2222-----------1111---------1111--------1111---- NPSLKSRISITRDTSKNQFFLQLNSVTTEDTATYFCVRGYDYFAMDYWGQGTSVTVSSAK 1111--------3333----------1111---------2222----------------- TTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLY --------------------------------------%%%%------------------ TLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRDCG --------1111------------1111------------- >Voltage-gated potassium c; SWP:Q9YDF8; PDB:1ORSC; DVMEHPLVELGVSYAALLSVIVVVVEYTMQLSGEYLVRLYLVDLILVIILWADYAYRAYK ------------------------------------------------------------ SGDPAGYVKKTLYEIPALVPAGLLALIEGHLAGLGLFRLVRLLRFLRILLIISRGSKFLS ----3333--33333333-3333--------1111------------------------- AIADAADKLVPR ------------ >YUAD PROTEIN; SWP:O32079; PDB:1ORUA; WKRTAKAEGLYIADTKSFVTKQDKLDFDYGGIPGDLHFGLTKKAGAREPFSRGTEIFNRR ---------------------------22222222---------3333--2222------ QISIVSIEECNEIALKGVPRILPEWLGANVAVSGPDLTSLKEGSRIIFPSGAALLCEGEN ---------------------3333----------3333-2222---3333--------- DPCIQPGEVIQSYYPDQPKLASAFVRHALGIRGIVCIVERPGAVYTGDEIEVHSYQ -------------1111----------2222-------------2222-------- >DIPEPTIDYL PEPTIDASE IV; SWP:P22411; PDB:1ORVA; SRRTYTLTDYLKSTFRVKFYTLQWISDHEYLYKQENNILLFNAEYGNSSIFLENSTFDEL ---------------------------------iiii----------------------- GYSTNDYSVSPDRQFILFEYNYVKQWRHSYTASYDIYDLNKRQLITEERIPNNTQWITWS ---------1111----------------------------------------------- PVGHKLAYVWNNDIYVKNEPNLSSQRITWTGKENVIYNGVTDWVYEEEVFSAYSALWWSP ---------%%%%-----1111---------2222-----------------------11 NGTFLAYAQFNDTEVPLIEYSFYSDESLQYPKTVRIPYPKAGAENPTVKFFVVDTRTLSP 11---------1111---------3333-----------2222----------3333-11 NASVTSYQIVPPASVLIGDHYLCGVTWVTEERISLQWIRRAQNYSIIDICDYDESTGRWI 11---------3333-----------------------3333------------------ SSVARQHIEISTTGWVGRFRPAEPHFTSDGNSFYKIISNEEGYKHICHFQTDKSNCTFIT -3333---------------------3333--------1111-------1111------- KGAWEVIGIEALTSDYLYYISNEHKGMPGGRNLYRIQLNDYTKVTCLSCELNPERCQYYS ----------------------22221111------1111-------11113333----- ASFSNKAKYYQLRCFGPGLPLYTLHSSSSDKELRVLEDNSALDKMLQDVQMPSKKLDVIN ---------------------------------------------1111----------- LHGTKFWYQMILPPHFDKSKKYPLLIEVYAGPCSQKVDTVFRLSWATYLASTENIIVASF iiii------------1111----------2222---------3333------------- DGRGSGYQGDKIMHAINRRLGTFEVEDQIEATRQFSKMGFVDDKRIAIWGWSYGGYVTSM -2222---3333-1111-2222---------------11111111--------------- VLGAGSGVFKCGIAVAPVSKWEYYDSVYTERYMGLPTPEDNLDYYRNSTVMSRAENFKQV 1111---------------3333-3333--------3333333311113333----1111 EYLLIHGTADDNVHFQQSAQLSKALVDAGVDFQTMWYTDEDHGIASNMAHQHIYTHMSHF ------1111---3333--------1111-----------1111---------------- LKQCFSLP -------- >Flagellin; SWP:O67803; PDB:1ORYB; NVDFAKEMTEFTKYQIRMQSGVAMLAQANALPQLVLQLLR -------------------3333-------3333--1111 >PPCA; SWP:Q8GGK7; PDB:1OS6A; ADDIVLKAKNGDVKFPHKAHQKAVPDCKKCHEKGPGKIEGFGKEMAHGKGCKGCHEEMKK -------1111--------------1111------3333--------------------- GPTKCGECHKK ---1111---- >TRYPSIN; SWP:P00775; PDB:1OS8A; VVGGTRAAQGEFPFMVRLSMGCGGALYAQDIVLTAAHCVSSN -------22221111------------1111---1111---- >HYPOTHETICAL PROTEIN MERP; SWP:Q5NUU9; PDB:1OSDA; ATQTVTLSVPGMTCSACPITVKKAISKVEGVSKVDVTFETRQAVVTFDDAKTSVQKLTKA --------1111-1111--------------------1111------1111--------- TADAGYPSSVKQ -1111------- >BILE ACID RECEPTOR; SWP:Q96RI1; PDB:1OSHA; SHGELTPDQQTLLHFIMDSYNKQRMPENFLILTEMATNHVQVLVEFTKKLPGFQTLDHED -1111-------------3333---------------------------2222------- QIALLKGSAVEAMFLRSAEIFNKKLPSGHSDLLEERIRNSGISDEYITPMFSFYKSIGEL -------------------------3333----------------------------111 KMTQEEYALLTAIVILSPDRQYIKDREAVEKLQEPLLDVLQKLCKIHQPENPQHFACLLG 1---------------1111---------------------------1111--------- RLTELRTFNHHHAEMLMSWRVNDHKFTPLLCEIWDV -------------------1111------------- >THYMIDINE KINASE; SWP:P14344; PDB:1OSNA; VKMGVLRIYLDGAYGIGKTTAAEEFLHHFAITPNRILLIGEPLSYWRNLAGEDAICGIYG ------------------11111111-----1111------3333--------------- TQTRRLNGDVSPEDAQRLTAHFQSLFCSPHAIMHAKISALMDTSTSDLVQVNKEPYKIML ----1111--------------3333----------3333-------------------- SDRHPIASTICFPLSRYLVGDMSPAALPGLLFTLPAEPPGTNLVVCTVSLPSHLSRVSET ----3333--------------3333----------------------3333-------- VNLPFVMVLRNVYIMLINTIIFLKTNNWHAGWNTLSFCNDVFKQKLQKSECIKLREVPGI ------------------------------1111----------1111----------11 EDTLFAVLKLPELCGEFGNILPLWAWGMETLSNCLRSMSPFVLSLEQTPQHAAQELKTLL 11-3333--3333--------3333---------3333---------3333---333333 PQMTPANMSSGAWNILKELVNAVQD 33----------------------- >Ig gamma-2B chain C regio; SWP:GCBM_MOUSE; PDB:1OSPH; EVQLQESGPSLVKPSQTLSLTCSVTGEPITSGFWDWIRKFPGNKLEFMGYIRYGGGTYYN ---------------------------------------1111--------2222----1 PSLKSPISITRDTSKNHYYLQLNSVVTEDTATYYCARSRDYYGSSGFAFWGEGTLVTVSA 111---------1111---------1111------------------------------- AKTTPPSVYPLAPGCGDTTGSSVTLGCLVKGYFPESVTVTWNSGSLSSSVHTFPALLQSG ---------------------------------------------------------iii LYTMSSSVTVPSSTWPSQTVTCSVAHPASSTTVDKKLE i---------11113333-------3333--------- >BILE ACID RECEPTOR; SWP:Q62735; PDB:1OSVA; AELTVDQQTLLDYIMDSYSKQRMPQEITNKILKEEFSAEENFLILTEMATSHVQILVEFT ----------------1111--------1111----3333------------------33 KRLPGFQTLDHEDQIALLKGSAVEAMFLRSAEIFNKKLPAGHADLLEERIRKSGISDEYI 332222------------------------------------------------------ TPMFSFYKSVGELKMTQEEYALLTAIVILSPDRQYIKDREAVEKLQEPLLDVLQKLCKIY ---------3333-------------------1111------------------------ QPENPQHFACLLGRLTELRTFNHHHAEMLMSWRVNDHKFTPLLCEIWDV ---------------------------3333------------------ >IMMUNOMODULATORY PROTEIN ; SWP:P80412; PDB:1OSYA; SATSLTFQLAYLVKKIDFDYTPNWGRGTPSSYIDNLTFPKVLTDKKYSYRVVVNGSDLGV ----------------------------1111--------------------iiii---- ESNFAVTPSGGQTINFLQYNKGYGVADTKTIQVFVVIPDTGNSEEYIIAEWKKT ------3333----3333-iiii--1111----------%%%%----------- >NEUROGENIC LOCUS NOTCH PR; SWP:P07207; PDB:1OT8A; TAQVISDLLAQGAELNATMDKTGETSLHLAARFARADAAKRLLDAGADANSQDNTGRTPL ----------------3333----3333--------------1111-1111-1111---- HAAVAADAMGVFQILLRNRATNLNARMHDGTTPLILAARLAIEGMVEDLITADADINAAD -----------------33331111-1111------------------------1111-1 NSGKTALHWAAAVNNTEAVNILLMHHANRDAQDDKDETPLFLAAREGSYEASKALLDNFA 111-------------------1111-1111-1111-------------------1111- NREITDHMDRLPRDVASERLHHDIVRLLD 1111-1111-3333--1111--------- >PHOTOACTIVE YELLOW PROTEI; SWP:P16113; PDB:1OTDA; SQLDNLAFGAIQLDGDGTILQYNAAEGDITGRNPKEVIGKNFFKDVAPCTDSPEFSGKFK -1111--------1111---------------33332222------3333-1111----- EGVASGNLNTMFEYTFDYQMTPTKVKVHMKKALSGDSYWVFVKRV --------------------------------------------- >4-OXALOCROTONATE TAUTOMER; SWP:P49172; PDB:1OTFA; PIAQLYIIEGRTDEQKETLIRQVSEAMANSLDAPLERVRVLITEMPKNHFGIGGEPASK ----------------------------1111-3333--------1111--iiii3333 >5-CARBOXYMETHYL-2-HYDROXY; SWP:Q05354; PDB:1OTGA; PHFIVECSDNIREEADLPGLFAKVNPTLAATGIFPLAGIRSRVHWVDTWQMADGQHDYAF -------33333333-------------------1111------------!!!!------ VHMTLKIGAGRSLESRQQAGEMLFELIKTHFAALMESRLLALSFEIEELHPTLNFKQNNV --------------------------------3333----------------------33 HALFK 33--- >ORNITHINE TRANSCARBAMOYLA; SWP:P00480; PDB:1OTHA; KVQLKGRDLLTLKNFTGEEIKYMLWLSADLKFRIKQKGEYLPLLQGKSLGMIFEKRSTRT ---2222---3333---------------------------1111------------333 RLSTETGFALLGGHPCFLTTQDIHLGVNESLTDTARVLSSMADAVLARVYKQSDLDTLAK 3-------1111------3333----------------------------3333------ EASIPIINGLSDLYHPIQILADYLTLQEHYSSLKGLTLSWIGDGNNILHSIMMSAAKFGM --------------3333--------------2222-----------------3333--- HLQAATPKGYEPDASVTKLAEQYAKENGTKLLLTNDPLEAAHGGNVLITDTWISMGREEE ------2222------------------------------2222---------2222111 KKKRLQAFQGYQVTMKTAKVAASDWTFLHCLPRKPEEVDDEVFYSPRSLVFPEAENRKWT 1----1111--------11111111---------3333-3333-1111------------ IMAVMVSLLTDYSPQLQKPKF --------------------- >Alpha-Ketoglutarate-Depen; SWP:P37610; PDB:1OTJA; SERLSITPLGPYIGAQISGADLTRPLSDNQFEQLYHAVLRHQVVFLRDQAITPQQQRALA --------------------3333-----------------------------------3 QRFGELHIHPVYPHAEGVDEIIVLDTHNDNPPDNDNWHTDVTFIETPPAGAILAAKELPS 333-----------2222--------1111------------------------------ TGGDTLWTSGIAAYEALSVPFRQLLSGLRAEHDFRKSFPEYKYRKTEEEHQRWREAVAKN -------------1111-------2222----3333--3333------------------ PPLLHPVVRTHPVSGKQALFVNEGFTTRIVDVSEKESEALLSFLFAHITKPEFQVRWRWQ ---------------------3333--------------------11113333------2 PNDIAIWDNRVTQHYANADYLPQRRIMHRATILGDKPFYRA 222----1111------------------------------ >PHENYLACETIC ACID DEGRADA; SWP:P76079; PDB:1OTKA; HGNQLTAYTLRLGDNCLVLSQRLGEWCGHAPELEIDLALANIGLDLLGQARNFLSYAAEL -----------------------1111--------------------------------- AGEGDEDTLAFTRDERQFSNLLLVEQPNGNFADTIARQYFIDAWHVALFTRLMESRDPQL -------------3333---3333------------------------------------ AAISAKAIKEARYHLRFSRGWLERLGNGTDVSGQKMQQAINKLWRFTAELFDADEIDIAL ------------------------11113333----------1111--1111-------- SEEGIAVDPRTLRAAWEAEVFAGINEATLNVPQEQAYRTGGKKGLHTEHLGPMLAEMQYL -------3333----------------------------3333---3333---------- QRVL ---- >PROTEIN CUE2; SWP:P36075; PDB:1OTRA; NDDHESKLSILMDMFPAISKSKLQVHLLENNNDLDLTIGLLLKENDDKS --3333-----------------------%%%%---------------- >Igk-C protein; SWP:Q58EV6; PDB:1OTSC; VRLLESGGGLVQPGGSLKLSCAASGFDYSRYWMSWVRQAPGKGLKWIGEINPVSSTINYT -----------2222-----------------------2222--------1111------ PSLKDKFIISRDNAKDTLYLQISKVRSEDTALYYCARLYYGYGYWYFDVWGAGTTVTVSS -----------3333----------3333------------------------------- AKTTPPSVYPLAPGSAAAAASMVTLGCLVKGYFPEPVTVTWNSGSLAAGVHTFPAVLQAA ----------------------------------------%%%%-------------%%% LYTLSSSVTVPSSSWPSETVTCNVAHPASSTKVDKKIVPRA %---------1111------------1111----------- >COENZYME PQQ SYNTHESIS PR; SWP:P27505; PDB:1OTVA; LITDTLSPQAFEEALRAKGDFYHIHHPYHIAMHNGDATRKQIQGWVANRFYYQTTIPLKD ---------------------3333----------------------------------- AAIMANCPDAQTRRKWVQRILDHDGSHGEDGGIEAWLRLGEAVGLSRDDLLSERHVLPGV ---1111------------------iiii-3333------1111-3333---11113333 RFAVDAYLNFARRACWQEAACSSLTELFAPQIHQSRLDSWPQHYPWIKEEGYFYFRSRLS -------------------------1111-%%%%-----33331111-----------11 QANRDVEHGLALAKAYCDSAEKQNRMLEILQFKLDILWSMLDAMTMAYALQRPPYHTVTD 11--1111--------------------------------------------2222---- KAAWHTTRLVLEHH -------1111--- >PRECORRIN-8X METHYLMUTASE; SWP:Q9HKE7; PDB:1OU0A; SLAAIDSIDPDISGPRHIVVKAIHAAGDFAIAPLIRYSDGFFKSLAKLKEGCTIICDSEV ---3333-3333---------------33331111--1111------------------3 RAGIYSRPVLERNRVVCYLNDVRSKEADVNGITRSAAGIRIAQDHRNSVIVIGNAPTALL 333--------------11113333--1111-----------------------3333-- EARIEENGWYDIPIVGIPVGFINASKAKEGLVSSHIEYISVEGHRGGSPIAASIVNGFGR ---------------------------------------------------------333 FL 3- >TRNA CCA-ADDING ENZYME; SWP:Q96Q11; PDB:1OU5A; KMKLQSPEFQSLFTEGLKSLTELFVKENHELRIAGGAVRDLLNGVKPQDIDFATTATPTQ -------3333--1111-------1111------22223333----------------33 MKEMFQSAGIRMINNRGEKHGTITARLHEENFEITTLRIFTTDWQKDAERRDLTINSMFL 33-------------------------------------------3333----1111--- GFDGTLFDYFNGYEDLKNKKVRFVGHAKQRIQEDYLRILRYFRFYGRIVDKPGDHDPETL 3333------3333--------------3333---------------------------- EAIAENAKGLAGISGERIWVELKKILVGNHVNHLIHLIYDLDVAPYIGLPANASLEEFDK --33333333----1111--------------------11113333-------------- VSKNVDGFSPKPVTLLASLFKVQDDVTKLDLRLKIAKEEKNLGLFIVKNRKDLIKATDSS ----------3333-3333----------------------------------------- DPLKPYQDFIIDSREPDATTRVCELLKYQGEHCLLKEMQQWSI --3333------------------------------------- >NUCLEASE; SWP:Q8DCA6; PDB:1OUOA; APPSSFSAAKQQAVKIYQDHPISFYCGCDIEWQGKKGIPNLETCGYQVRKQQTRASRIEW ----3333------1111--------------!!!!---3333-------3333------ EHVVPAWQFGHHRQCWQKGGRKNCSKNDQQFRLEADLHNLTPAIGEVNGDRSNFNFSQWN ----3333-11113333-----------3333---3333--------------------- GVDGVSYGRCEQVNFKQRKVPPDRARGSIARTYLYSQEYGFQLSKQQQQLQAWNKSYPVD --------------1111---3333-----------1111-------------------- EWECTRDDRIAKIQGNHNPFVQQSC -----------------33331111 >HEMOGLOBIN I; SWP:P02019; PDB:1OUTA; SLTAKDKSVVKAFWGKISGKADVVGAEALGRMLTAYPQTKTYFSHWADLSPGSGPVKKHG ----------------3333------------------33333333---2222------- GIIMGAIGKAVGLMDDLVGGMSALSDLHAFKLRVDPGNFKILSHNILVTLAIHFPSDFTP --------333333333333--------------------------------------33 EVHIAVDKFLAAVSAALADKYR 33---------------1111- >Hemoglobin subunit beta-1; SWP:P02142; PDB:1OUTB; VEWTDAEKSTISAVWGKVNIDEIGPLALARVLIVYPWTQRYFGSFGNVSTPAAIMGNPKV --------------11113333-------------------3333----33331111--- AAHGKVVCGALDKAVKNMGNILATYKSLSETHANKLFVDPDNFRVLADVLTIVIAAKFGA ----------------33333333---------------------------------!!! SFTPEIQATWQKFMKVVVAAMGSRYF !------------------1111--- >CONSERVED HYPOTHETICAL SE; SWP:O25728; PDB:1OUVA; DPKELVGLGAKSYKEKDFTQAKKYFEKACDLKENSGCFNLGVLYYQGQGVEKNLKKAASF ----------------------------1111---------------------------- YAKACDLNYSNGCHLLGNLYYSGQGVSQNTNKALQYYSKACDLKYAEGCASLGGIYHDGK ----1111--------------------------------1111---------------- VVTRDFKKAVEYFTKACDLNDGDGCTILGSLYDAGRGTPKDLKKALASYDKACDLKDSPG ----------------1111---------------------------------------- CFNAGNMYHHGEGATKNFKEALARYSKACELENGGGCFNLGAMQYNGEGVTRNEKQAIEN -------------------------------------------1111------1111--- FKKGCKLGAKGACDILKQLKIKVHH --------3333-3333-------- >LECTIN; SWP:P93114; PDB:1OUWA; VPMDTISGPWGNNGGNFWSFRPVNKINQIVISYGGGGNNPIALTFSSTKADGSKDTITVG ---------------------------------%%%%-----------1111-------- GGGPDSITGTEMVNIGTDEYLTGISGTFGIYLDNNVLRSITFTTNLKAHGPYGQKVGTPF ---------------1111-----------%%%%-------------------------- SSANVNEIVGFLGRSGYYVDAIGTYNRH ---------------------------- >Alpha-2-macroglobulin rec; SWP:P30533; PDB:1OV2A; GEEFRMEKLNQLWEKAQRLHLPPVRLAELHADLKIQERDELAWKKLKLDGLDEDGEKEAR ----------------------------------------------1111----3333-- LIRNLNVILAKYGLDGKKDARQ ---------------------- >VICH PROTEIN; SWP:Q9Z4R1; PDB:1OV9A; EITKTLLNIRSLRAYARELTIEQLEEALDKLTTVVQERKEAEAEE 33331111---------------------------------1111 >ORPHAN NUCLEAR RECEPTOR N; SWP:P43354; PDB:1OVLA; SLISALVRAHVDSNPAMTSLDYSRFQANDTQHIQQFYDLLTGSMEIIRGWAEKIPGFADL ----------1111-------1111----------------------------2222--- PKADQDLLFESAFLELFVLRLAYRSNPVEGKLIFCNGVVLHRLQCVRGFGEWIDSIVEFS --------------------------1111---1111---33333333-3333------- SNLQNNIDISAFSCIAALAVTERHGLKEPKRVEELQNKIVNCLKDHVTFNNGGLNRPNYL ----------------------2222------------------------1111------ SKLLGKLPELRTLCTQGLQRIFYLKLEDLVPPPAIIDKLFLDTLPF ---------------------------------------------- >INDOLE-3-PYRUVATE DECARBO; SWP:P23234; PDB:1OVMA; TPYCVADYLLDRLTDCGADHLFGVPGDYNLQFLDHVIDSPDICWVGCANELNASYAADGY -------------1111--------11113333--------------------------- ARCKGFAALLTTFGVGELSAMNGIAGSYAEHVPVLHIVGAPGTAAQQRGELLHHTLGDGE -------------3333------------------------------------------- FRHFYHMSEPITVAQAVLTEQNACYEIDRVLTTMLRERRPGYLMLPADVAKKAATPPVNA --------1111------1111-----------------------3333----------- LTHKQAHADSACLKAFRDAAENKLAMSKRTALLADFLVLRHGLKHALQKWVKEVPMAHAT -----------------------1111-----------1111------------------ MLMGKGIFDERQAGFYGTYSGSASTGAVKEAIEGADTVLCVGTRFTDTLTAGFTHQLTPA 3333-------2222----!!!!--------3333----------3333--------333 QTIEVQPHAARVGDVWFTGIPMNQAIETLVELCKQHVHAPDGSLTQENFWRTLQTFIRPG 3----------!!!!-----------------1111---------------3333--222 DIILADQGTSAFGAIDLRLPADVNFIVQPLWGSIGYTLAAAFGAQTACPNRRVIVLTGDG 2------------1111---------------2222-----------1111------333 AAQLTIQELGSMLRDKQHPIILVLNNEGYTVERAIHGAEQRYNDIALWNWTHIPQALSLD 3-----------------------------------11111111----33331111---- PQSECWRVSEAEQLADVLEKVAHHERLSLIEVMLPKADIPPLLGALTKALEACNN --------------------1111----------1111--3333------3333- >50S RIBOSOMAL PROTEIN L18; SWP:P09415; PDB:1OVYA; GTTERPRLSVFRSNKHIYAQIIDDTKSATIVSASTLDKEFGLDSTNNIEAAKKVGELVAK ---------------------------------11113333--3333------------- RALEKGIKQVVFDRGGYLYHGRVKALADAAREAGLEF ---------------------3333------------ >SMART/HDAC1 ASSOCIATED RE; SWP:Q96T58; PDB:1OW1A; PVDMVQLLKKYPIVWQGLLALKNDTAAVQLHFVSGNNVLAHRSLPLSPPLRIAQRMRLEA -----3333-----------!!!!-----------3333--------------------- TQLEGVARRMTVETDYCLLLALPCGRDQEDVVSQTESLKAAFITYLQAKQAAGIINVPNP -------11111111-------------------------------------------22 GSNQPAYVLQIFPPCEFSESHLSRLAPDLLASISNISPHLMIVIASV 22------------------------------2222----------- >PHEROMONE BINDING PROTEIN; SWP:Q8MTC1; PDB:1OW4A; SSTQSYKDAMGPLVRECMGSVSATEDDFKTVLNRNPLESRTAQCLLACALDKVGLISPEG ----------------1111------------------------------------1111 AIYTGDDLMPVMNRLYGFNDFKTVMKAKAVNDCANQVNGAYPDRCDLIKNFTDCVRNSY ----3333----------------------------2222------------------- >SERINE/THREONINE-PROTEIN ; SWP:P23561; PDB:1OW5A; PFVQLFLEEIGCTQYLDSFIQCNLVTEEEIKYLDKDILIALGVNKIGDRLKILRKSKSFQ 3333-3333--3333---3333---333333333333-------3333------3333-- >SPECTRIN ALPHA CHAIN, ERY; SWP:P02549; PDB:1OWAA; MEQFPKETVVESSGPKVLETAEEIQERRQEVLTRYQSFKERVAERGQKLEDSYHLQVFKR ----------1111---------------------------------------------- DADDLGKWIMEKVNILTDKSYEDPTNIQGKYQKHQSLEAEVQTKSRLMSELEKTREERFT -----------------3333----3333------------------------------- MGHSAHEETKAHIEELRHLWDLLLELTLEKGDQLLR -3333--------------------------3333- >CITRATE SYNTHASE; SWP:P00891; PDB:1OWCA; ADTKAKLTLNGDTAVELDVLKGTLGQDVIDIRTLGSKGVFTFDPGFTSTASCESKITFID ------------------------------1111--------2222-------------- GDEGILLHRGFPIDQLATDSNYLEVCYILLNGEKPTQEQYDEFKTTVTLHTMIHEQITRL 1111---iiii-----------------------------------1111---3333-33 FHAFRRDSHPMAVMCGITGALAAFYHDSLDVNNPRHREIAAFRLLSKMPTMAAMCYKYSI 33--1111-----------3333-1111-33333333----------------------- GQPFVYPRNDLSYAGNFLNMMFSTPCEPYEVNPILERAMDRILILHADHEQNASTSTVRT -------1111------------3333--------------------------------- AGSSGANPFACIAAGIASLWGPAHGGANEAALKMLEEISSVKHIPEFFRRAKDKNDSFRL -1111-3333---------3333--3333-------------3333-3333-----3333 MGFGHRVYKNYDPRATVMRETCHEVLKELGTKDDLLEVAMELENIALNDPYFIEKKLYPN ----3333---1111---------------------3333------------1111---1 VDFYSGIILKAMGIPSSMFTVIFAMARTVGWIAHWSEMHSDGMKIARPRQLYTGYEKRDF 111------1111-3333------------------------------------------ KSDIKR ------ >INTEGRATION HOST FACTOR A; SWP:P06984; PDB:1OWFA; ALTKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFGNFDLRDKNQRP -----------------3333---------------1111----2222------------ GRNPKTGEDIPITARRVVTFRPGQKLKSRVENASPK -----------------------------1111--- >INTEGRATION HOST FACTOR A; SWP:P08756; PDB:1OWFB; MTKSELIERLATQQSHIPAKTVEDAVKEMLEHMASTLAQGERIAIRGFGSFSLHYRAPRT ------------------------------------1111----2222------------ GRNPKTGDKVELEGKYVPHFKPGKELRDRANIYG -----------------------3333------- >DEOXYRIBODIPYRIMIDINE PHO; SWP:P05327; PDB:1OWLA; APILFWHRRDLRLSDNIGLAAARAQSAQLIGLFCLDPQILQSADMAPARVAYLQGCLQEL -----------------------------------3333--1111--------------- QQRYQQAGSRLLLLQGDPQHLIPQLAQQLQAEAVYWNQDIEPYGRDRDGQVAAALKTAGI ----------------3333------1111-------------------------1111- RAVQLWDQLLHSPDQILSGSGNPYSVYGPFWKNWQAQPKPTPVATPTELVDLSPEQLTAI -----------1111--1111-----------3333-----------------------3 APLLLSELPTLKQLGFDWDGGFPVEPGETAAIARLQEFCDRAIADYDPQRNFPAEAGTSG 333------3333---------------------------3333-------3333----- LSPALKFGAIGIRQAWQAASAAHALSRSDEARNSIRVWQQELAWREFYQHALYHFPSLAD -------------------------------------------------------3333- GPYRSLWQQFPWENREALFTAWTQAQTGYPIVDAAMRQLTETGWMHNRCRMIVASFLTKD ----3333------3333---1111---3333---------------------------- LIIDWRRGEQFFMQHLVDGDLAANNGGWQWSASSGMDPKPLRIFNPASQAKKFDATATYI ---------------1111----------------------------------1111--- KRWLPELRHVHPKDLISGEITPIERRGYPAPIVNHNLRQKQFKALYNQLKAAI ---1111---3333------3333!!!!------------------------- >SIGNAL PROCESSING PROTEIN; SWP:P30922; PDB:1OWQA; YKLICYYTSWSQYREGDGSCFPDAIDPFLCTHVIYSFANISNNEIDTWEWNDVTLYDTLN -------1111---!!!!--3333-1111-----------%%%%----1111-------- TLKNRNPNLKTLLSVGGWNFGSERFSKIASKTQSRRTFIKSVPPFLRTHGFDGLDLAWLY -----1111-------11113333------------------------------------ PGWRDKRHLTTLVKEMKAEFVREAQAGTEQLLLSAAVPAGKIAIDRGYDIAQISRHLDFI -3333-------------------------------------------33331111---- SLLTYDFHGGWRGTVGHHSPLFRGNSDGSSRFSNADYAVSYMLRLGAPANKLVMGIPTFG -----------------------------1111--------------1111--------- RSYTLASSKTDVGAPISGPGIPGQFTKEKGTLAYYEICDFLHGATTHRFRDQQVPYATKG ----------2222-------------2222-------3333-------3333-----!! NQWVAYDDQESVKNKARYLKNRQLAGAMVWALDLDDFRGTFCGQNLTFPLTSAIKDVLAR !!-----------------1111-------1111-3333--------------------- V - >Phospholipase A2 isoform ; SWP:Q5G290; PDB:1OWSB; NIKQFNNMIECTVPARSWWDFADYGCYCGSGSGSPTDDLDRCCQTHDNCYGAGGGSTGCA ----------------3333---!!!!---------3333---------------2222- PKSRTYTYQCSQGTLTCSGENSACAATTCDCDRLAAICFAGAPYNDTNYNIDLKSRCQ 1111-------------11113333-------------------3333---3333--- >FIBRONECTIN FIRST TYPE II; SWP:P02751; PDB:1OWWA; SGPVEVFITETPSQPNSHPIQWNAPQPSHISKYILRWRPKNSVGRWKEATIPGHLNSYTI -------------1111----------------------------------3333----- KGLKPGVVYEGQLISIQQYGHQEVTRFDFTTTS --------------------------------- >LUPUS LA PROTEIN; SWP:P05455; PDB:1OWXA; HHGSLEEKIGCLLKFSGDLDDQTCREDLHILFSNHGEIKWIDFVRGAKEGIILFKEKAKE -----------------------3333-3333-----------2222------------- ALGKAKDANNGNLQLRNKEVTWEVLEGEVEKEALKKIIEDQQESLNKWKSKGR -------------------------3333------------------------ >BETA KETOACYL-ACYL CARRIE; SWP:NA; PDB:1OX0A; MKLNRVVVTGYGVTSPIGNTPEEFWNSLATGKIGIGGITKFDHSDFDVHNAAEIQDFPFD --------------1111------------------------1111-----------333 KYFVKKDTNRFDNYSLYALYAAQEAVNHANLDVEALNRDRFGVIVASGIGGIKEIEDQVL 3--3333----3333----------------1111-3333-------------------- RLHEKGPKRVKPMTLPKALPNMASGNVAMRFGANGVCKSINTACSSSNDAIGDAFRSIKF -----3333-111133331111-------------------!!!!--------------- GFQDVMLVGGTEASITPFAIAGFQALTALSTTEDPTRASIPFDKDRNGFVMGEGSGMLVL -----------------------1111------1111--2222----------------- ESLEHAEKRGATILAEVVGYGNTCDAYHMTSPHPEGQGAIKAIKLALEEAEISPEQVAYV --------------------------------1111----------------3333---- NAHGTSTPANEKGESGAIVAVLGKEVPVSSTKSFTGHLLGAAGAVEAIVTIEAMRHNFVP ------------------------------3333---!!!!------------------- MTAGTSEVSDYIEANVVYGQGLEKEIPYAISNTFGFGGHNAVLAFKRWE --------1111----------------------2222----------- >STRINGENT STARVATION PROT; SWP:P25663; PDB:1OX8A; QLTPRRPYLLRAFYEWLLDNQLTPHLVVDVTLPGVQVPEYARDGQIVLNIAPRAVGNLEL -----------------1111----------2222------iiii--------------- ANDEVRFNARFGGIPRQVSVPLAAVLAIYARENGAGTFEPEAAYD 3333------iiii------3333-----------------1111 >RNA-BINDING PROTEIN SMAUG; SWP:Q23972; PDB:1OXJA; HMVGMSGIGLWLKSLRLHKYIELFKNMTYEEMLLITEDFLQSVGVTKGASHKLALCIDKL -2222-------111133333333---33331111-----1111---------------- KERANILNRVEQELLSGQMELSTAVEELTNIVLTPMKPLESPGPPEENIGLRFLKVIDIV --------------------------33331111---1111--3333------------- TNTLQQDPYAVQDDETLGVLMWILDRSIHNEAFMNHASQLKDLKFKLSKM ------1111---------------3333---1111----------1111 >Osmosensing histidine pro; SWP:P39928; PDB:1OXKB; SVKILVVEDNHVNQEVIKRMLNLEGIENIELACDGQEAFDKVKELTSKGENYNMIFMDVQ ---------------------1111--------------------1111----------- MPKVDGLLSTKMIRRDLGYTSPIVALTAFADDSNIKECLESGMNGFLSKPIKRPKLKTIL --------------------------------------1111------------------ TEFCAAYQ ---2222- >BACULOVIRAL IAP REPEAT-CO; SWP:Q96CA5; PDB:1OXNA; AGATLSRGPAFPGMGSEELRLASFYDWPLTAEVPPELLAAAGFFHTGHQDKVRCFFCYGG ----------3333-------1111--3333--3333----------------------- LQSWKRGDDPWTEHAKWFPSCQFLLRSKGRDFVHSVQET -----------------1111------------------ >PATATIN; SWP:Q8LPW4; PDB:1OXWA; LGEVTVLSIDGGGIRGIIPATILEFLEGQLQEDNNADARLADYFDVIGGTSTGGLLTAIS -----------!!!!-------------------11113333------------------ TPNENNRPFAAAKEIVPFYFEHGPQIFNPSGQILGPKYDGKYLQVLQEKLGETRVHQALT --1111----3333-------------------------3333------!!!!3333--- EVVISSFDIKTNKPVIFTKSNLANSPELDAKYDISYSTAAAPTYFPPHYFVTNTSNGDEY ---------------------33331111-----------2222---------1111--- EFNLVDGAVATVADPALLSISVATRLAQKDPAFASIRSLNYKKLLLSLGTGTTSEFDKTY -------------------------11113333------3333----------1111--- TAKEAATWTAVHWLVIQKTDAASSYTDYYLSTAFQALDSKNNYLRVQENALTGTTTEDDA 333311113333--3333----------------11111111---------!!!!----- SEANELLVQVGENLLKKPVSEDNPETYEEALKRFAKLLSDRKKLRANKA 3333--------3333--------------------------------- >ABC transporter, ATP bind; SWP:Q97UY8; PDB:1OXXK; MVRIIVKNVSKVFKKGKVVALDNVNINIENGERFGILGPSGAGKTTFMRIIAGLDVPSTG ------------%%%%------------2222---------------------------- ELYFDDRLVASNGKLIVPPEDRKIGMVFQTWALYPNLTAFENIAFPLTNMKMSKEEIRKR ---!!!!---iiii---3333------1111--1111------3333------------- VEEVAKILDIHHVLNHFPRELSGAQQQRVALARALVKDPSLLLLDEPFSNLDARMRDSAR ---------1111---3333--------------1111-------1111--3333----- ALVKEVQSRLGVTLLVVSHDPADIFAIADRVGVLVKGKLVQVGKPEDLYDNPVSIQVASL ----------------------------------iiii---------------------- IGEINELEGKVTNEGVVIGSLRFPVSVSSDRAIIGIRPEDVKLSKDVIKDDSWILVGKGK -----------1111--!!!!---------------3333---------1111------- VKVIGYQGGLFRITITPLDSEEEIFTYSDHPIHSGEEVLVYVRKDKIKVFEK ------iiii----------------------2222------1111------ >KETOPANTOATE HYDROXYMETHY; SWP:Q10505; PDB:1OY0A; RTKIRTHHLQRWKADGHKWAMLTAYDYSTARIFDEAGIPVLLVGDSAANVVYGYDTTVPI ----3333---------------------------------------------------- SIDELIPLVRGVVRGAPHALVVADLPFGSYEAGPTAALAAATRFLKDGGAHAVKLEGGER 3333-----------1111------22223333------------------------333 VAEQIACLTAAGIPVMAHIGFTPGDAAEQTIADAIAVAEAGAFAVVMEMVPAELATQITG 3-------1111------------------------------------------------ KLTIPTVGIGAGPNCDGQVLVWQDMAGFSGAKTARFVKRYADVGGELRRAAMQYAQEVAG --------------------3333---------1111--------------------111 GVFPADEH 1------- >NF-kappa-B inhibitor beta; SWP:Q60778; PDB:1OY3D; VFGYVTEDGDTALHLAVIHQHEPFLDFLLGFSAGHEYLDLQNDLGQTALHLAAILGEAST -----1111-3333-------------3333---3333---1111--------------- VEKLYAAGAGVLVAERGGHTALHLACRVRAHTCACVLLQPRPSHPRDADEDWRLQLEAEN --------------1111----------------------------------3333---1 YDGHTPLHVAVIHKDAEMVRLLRDAGADLNKPEPTCGRTPLHLAVEAQAASVLELLLKAG 111-------1111-------------1111----------------------------- ADPTARMYGGRTPLGSALLRPNPILARLLRAHGAPEPEDG ------3333-----------3333----1111------- >TRNA (GUANINE-N(1)-)-METH; SWP:O67463; PDB:1OY5A; NPLRFFVLTIFPHIISCYSEYGIVKQAIKKGKVEVYPIDLREFAPKGQVDDVPYGGLPGM ----------3333------------------------3333------------------ VLKPEPIYEAYDYVVENYGKPFVLITEPWGEKLNQKLVNELSKKERIMIICGRYEGVDER --------------------------1111----------1111--------!!!!-333 VKKIVDMEISLGDFILSGGEIVALAVIDAVSRVLPGVLSEPYPVYTRPREYRGMKVPEEL 31111----------------------------2222-------------iiii--3333 LSGHHKLIELWKLWHRIENTVKKRPDLIPKDLTELEKD --------------11111111-----1111--1111- >OLD YELLOW ENZYME; SWP:Q02899; PDB:1OYC; SFVKDFKPQALGDTNLFKPIKIGNNELLHRAVIPPLTRMRALHPGNIPNRDWAVEYYTQR ----------1111-------!!!!-----------------------1111-------- AQRPGTMIITEGAFISPQAGGYDNAPGVWSEEQMVEWTKIFNAIHEKKSFVWVQLWVLGW ---------------3333--1111--------------------------------!!! AAFPDNLARDGLRYDSASDNVFMDAEQEAKAKKANNPQHSLTKDEIKQYIKEYVQAAKNS !------1111------------------------------------------------- IAAGADGVEIHSANGYLLNQFLDPHSNTRTDEYGGSIENRARFTLEVVDALVEAIGHEKV 1111--------iiii-------1111--------3333----------------1111- GLRLSPYGVFNSMSGGAETGIVAQYAYVAGELEKRAKAGKRLAFVHLVEPRVTNPFLTEG ----1111-%%%%-1111-----------------1111---------3333-1111222 EGEYEGGSNDFVYSIWKGPVIRAGNFALHPEVVREEVKDKRTLIGYGRFFISNPDLVDRL 2-------3333-------------1111-----1111--------1111---------- EKGLPLNKYDRDTFYQMSAHGYIDYPTYEEALKLGWDKK ---------1111-----2222---------3333---- >LEVANSUCRASE; SWP:P05655; PDB:1OYGA; QKPYKETYGISHITRHDMLQIPEQQKNEKYQVPEFDSSTIKNISSAKGLDVWDSWPLQNA ------%%%%-----------------1111----1111---1111------------11 DGTVANYHGYHIVFALAGDPKNADDTSIYMFYQKVGETSIDSWKNAGRVFKDSDKFDAND 11----iiii--------1111----------------3333--------3333-----3 SILKDQTQEWSGSATFTSDGKIRLFYTDFSGKHYGKQTLTTAQVNVSASDSSLNINGVED 333-------------1111------------%%%%------------------------ YKSIFDGDGKTYQNVQQFIDEGNYSSGDNHTLRDPHYVEDKGHKYLVFEANTGTEDGYQG -------------3333-11111111-------------iiii---------1111---- EESLFNKAYYGKSTSFFRQESQKLLQSDKKRTAELANGALGMIELNDDYTLKKVMKPLIA -----3333------------------------------------1111----------- SNTVTDEIERANVFKMNGKWYLFTDSRGSKMTIDGITSNDIYMLGYVSNSLTGPYKPLNK 2222-----------iiii-------3333--22223333---------1111------- TGLVLKMDLDPNDVTFTYSHFAVPQAKGNNVVITSYMTNRGFYADKQSTFAPSFLLNIKG ---------1111-------------------------22221111------------!! KKTSVVKDSILEQGQLTVNK !!------------------ ------------------------------------------------------------ -- >GLUTATHIONE S-TRANSFERASE; SWP:O65032; PDB:1OYJA; AEEKELVLLDFWVSPFGQRCRIAMAEKGLEFEYREEDLGNKSDLLLRSNPVHRKIPVLLH ----------11113333------1111--------1111-------------------i AGRPVSESLVILQYLDDAFPGTPHLLPPANSGADAAYARATARFWADYVDRKLYDCGSRL iii------------1111-------------------------------------3333 WRLKGEPQAAAGREMAEILRTLEAELGDREFFGGGGGGRLGFVDVALVPFTAWFYSYERC -------------------------!!!!-------------------3333-------- GGFSVEEVAPRLAAWARRCGRIDSVVKHLPSPEKVYDFVGVLKKKYGV ---3333----------------------------------------- >RIBONUCLEASE PH; SWP:P28619; PDB:1OYSA; MRHDGRQHDELRPITFDLDEGSVLITAGNTKVICNASVEDRVPPFLRGGGKGWITAEYSM -3333-1111----------------!!!!------------1111-------------- GRTMEIQRLIGRALRAVVDLEKLGERTIWIDCDVIQADGGTRTASITGAFLAMAIAIGKL --------------11113333-------------------------------------- IKAGTIKTNPITDFLAAISVGIDKEQGILLDLNYEEDSSAEVDMNVIMTGSGRFVELQGT 1111--------------------------------------------1111-------- GEEATFSREDLNGLLGLAEKGIQELIDKQKEVL --------------------------------- >Wound-induced proteinase ; SWP:P05119; PDB:1OYVI; KACTRECGNLGFGICPRSEGSPLNPICINCCSGYKGCNYYNSFGKFICEGESDPKRPNAC --------------------1111----3333-2222---1111---------------- TFNCDPNIAYSRCGCTTCCTGYKGCYYFGKDGKFVCEGESDEPK ----1111--------3333------------------------ >ATP-DEPENDENT DNA HELICAS; SWP:P15043; PDB:1OYWA; AQAEVLNLESGAKQVLQETFGYQQFRPGQEEIIDTVLSGRDCLVVPTGGGKSLCYQIPAL ------3333---------------2222------1111-----------------3333 LLNGLTVVVSPLISLKDQVDQLQANGVAAACLNSTQTREQQLEVTGCRTGQIRLLYIAPE ----------------------1111------3333----------1111-------333 RLLDNFLEHLAHWNPVLLAVDEAHCISQWGHDFRPEYAALGQLRQRFPTLPFALTATADD 3-%%%%---1111-------------1111---3333------3333------------- TTRQDIVRLLGLNDPLIQISSFDRPNIRYLEKFKPLDQLRYVQEQRGKSGIIYCNSRAKV -----------------------1111---------------3333-------------- EDTAARLQSKGISAAAYHAGLENNVRADVQEKFQRDDLQIVVATVAFGGINKPNVRFVVH -------1111------1111----------------------3333------------- FDIPRNIESYYQETGRAGRDGLPAEALFYDPADAWLRRCLEEKPQGQLQDIERHKLNAGA -------------33331111--------3333------1111----------------- FAEAQTCRRLVLLNYFGEGRQEPCGNCDICLDPPKQYDGSTDAQIALSTIGRVNQRFGGY 1111---------1111---------3333------------------------------ VVEVIRGANNQRIRDYGHDKLKVYGGRDKSHEHWVSVIRQLIHLGLVTQNIAQHSALQLT ----------------33333333-1111------------1111----1111------3 EAARPVLRGESSLQLAVPRIV 333------------------ >HYPOTHETICAL PROTEIN YIBA; SWP:P24172; PDB:1OYZA; YQKRKASKEYGLYNQCKKLNDDELFRLLDDHNSLKRISSARVLQLRGGQDAVRLAIEFCS ------1111----------------1111---------------------------111 DKNYIRRDIGAFILGQIKICKKCEDNVFNILNNALNDKSACVRATAIESTAQRCKKNPIY 1------------------1111----------------------------------111 SPKIVEQSQITAFDKSTNVRRATAFAISVIATIPLLINLLKDPNGDVRNWAAFAININKY 1--------3333------------------------3333------------------- DNSDIRDCFVELQDKNEEVRIEAIIGLSYRKDKRVLSVLCDELKKNTVYDDIIEAAGELG -------------------------------3333------1111---3333-------- DKTLLPVLDTLYKFDDNEIITSAIDKLKRS 3333-------------------------- >LETHAL(3)MALIGNANT BRAIN ; SWP:Q9Y468; PDB:1OZ2A; ECWSWESYLEEQKAITAPVSLFQDSQAVTHNKNGFKLGMKLEGIDPQHPSMYFILTVAEV -----------------3333-3333---------2222-----1111------------ CGYRLRLHFDGYSECHDFWVNANSPDIHPAGWFEKTGHKLQPPKGYKEEEFSWSQYLRST !!!!----22223333----1111----2222----------22221111---------- RAQAAPKHLFVSQSHSPPPLGFQVGMKLEAVDRMNPSLVCVASVTDVVDSRFLVHFDNWD -----1111---------22222222-----1111------------!!!!----22221 DTYDYWCDPSSPYIHPVGWCQKQGKPLTPPQDYPDPDNFCWEKYLEETGASAVPTWAFKV 111----1111----2222-1111-----2222-3333---------------3333--- RPPHSFLVNMKLEAVDRRNPALIRVASVEDVEDHRIKIHFDGWSHGYDFWIDADHPDIHP ------2222-----1111--------------------22223333----1111----2 AGWCSKTGHPLQPPLGPREPSSAS 222------------2222----- >PHOSPHOLIPASE A2; SWP:Q7T3S7; PDB:1OZ6A; NLYQFGRMIWNRTGKLPILSYGSYGCYCGWGGQGPPKDATDRCCLVHDCCYTRVGDCSPK ----------------1111-----------------3333----------------111 MTLYSYRFENGDIICDNKDPCKRAVCECDREAAICLGENVNTYDKKYKSYEDCTEEVQEC 1-------------------------------------3333--1111------------ >ECHICETIN A-CHAIN; SWP:Q7T248; PDB:1OZ7A; MCPPGWSSNGVYCYMLFKEPKTWDEAEKFCNKQGKDGHLLSIESKKEEILVDIVVSENIG --2222--%%%%---------------------2222----------------------- KMYKIWTGLSERSKEQHCSSRWSDGSFFRSYEIAIRYSECFVLEKQSVFRTWVATPCENT -------------3333--------------------------3333--------3333- FPFMCKYPVPR -------1111 >ECHICETIN A-CHAIN; SWP:NA; PDB:1OZ7B; NCLPDWSVYEGYCYKVFKERMNWADAEKFCTKQHKDGHLVSFRNSKEVDFVISLAFPMLK --2222--%%%%-----------------33332222------------------1111- NDLVWIGLTDYWRDCNWEWSDGAQLDYKAWDNERHCFIYKNTDNQWTRRDCTWTFSFVCK ------------3333--1111------------------1111-----1111------- CPA --- >HYPOTHETICAL PROTEIN AQ_1; SWP:O67367; PDB:1OZ9A; KNRVLVKLKKRKVRKDKIEKWAELALSALGLNNVELSVYITDDQEIRELNKTYRKKDKPT --------------------------1111------------------------------ DVLSFPMGEEFGGYKILGDVVISQDTAERQARELGHSLEEEVKRLIVHGIVHLLGYDHEK ----------iiii-------------------------------------1111--333 GGEEEKKFRELENYVLSKLSK 3-------------------- >ACETOLACTATE SYNTHASE, CA; SWP:P27696; PDB:1OZHA; VRQWAHGADLVVSQLEAQGVRQVFGIPGAKIDKVFDSLLDSSIRIIPVRHEANAAFMAAA -----3333------1111--------3333-----3333---------3333------- VGRITGKAGVALVTSGPGCSNLITGMATANSEGDPVVALGGAVKRADKSMDTVAMFSPVT --------------!!!!-------------------------3333--------3333- KYAIEVTAPDALAEVVSNAFRAAEQGRPGSAFVSLPQDVVDGPVSGKVLPASGAPQMGAA -------1111------------------------3333--------------------- PDDAIDQVAKLIAQAKNPIFLLGLMASQPENSKALRRLLETSHIPVTSTYQAAGAVNQDN ---------------------------3333-----------------3333----1111 FSRFAGRVGLFNNQAGDRLLQLADLVICIGYSPVEYEPAMWNSGNATLVHIDVLPAYEER 1111---------------------------3333-1111------------------11 NYTPDVELVGDIAGTLNKLAQNIDHRLVLSPQAAEILRDRQHQRELLDRGAQLNQFALHP 11-----------------1111-----------------------------------33 LRIVRAMQDIVNSDVTLTVDMGSFHIWIARYLYTFRARQVMISNGQQTMGVALPWAIGAW 33-------------------3333-----3333---------1111------------- LVNPERKVVSVSGDGGFLQSSMELETAVRLKANVLHLIWVDNGYNMVAIQEEKKYQRLSG --1111--------------------------------------3333------------ VEFGPMDFKAYAESFGAKGFAVESAEALEPTLRAAMDVDGPAVVAIPVDYRDNPLLMGQL ------------1111-------3333----------------------1111------3 HLSQI 333-- >SMAD 3; SWP:Q92940; PDB:1OZJA; FTPPIVKRLLGWKKGEQNGQEEKWCEKAVKSLVKKLKKTGQLDELEKAITTQNVNTKCIT --3333--3333------------------------1111-3333-------1111---- IPRSLDGRLQVSHRKGLPHVIYCRLWRWPDLHSHHELRAMELCEFAFNMKKDEVCVNPYH ---1111--------------------3333-3333---------3333-------1111 YQRVET ------ >RETICULON 4 RECEPTOR; SWP:Q9BZR6; PDB:1OZNA; PCPGACVCYNEPKVTTSCPQQGLQAVPVGIPAASQRIFLHGNRISHVPAASFRACRNLTI --2222------------------------1111-----------------3333----- LWLHSNVLARIDAAAFTGLALLEQLDLSDNAQLRSVDPATFHGLGRLHTLHLDRCGLQEL -----------11112222----------1111---11112222---------------- GPGLFRGLAALQYLYLQDNALQALPDDTFRDLGNLTHLFLHGNRISSVPERAFRGLHSLD 22222222----------------22221111----------------11112222---- RLLLHQNRVAHVHPHAFRDLGRLMTLYLFANNLSALPTEALAPLRALQYLRLNDNPWVCD ------------11111111----------------33331111---------------3 CRARPLWAWLQKFRGSSSEVPCSLPQRLAGRDLKRLAANDLQGC 333---------------------3333---3333-3333---- >PHOSPHOLIPASE A2; SWP:NA; PDB:1OZYA; NLLQFRKMIKCTIPGIEPLLAFSNYGCYCGKGGSGTPVDELDRCCQTHDYCYDKAKIHPE ---------1111---------------------------3333-------------333 CRGILSGPSFNTYAYDCTDGKLTCNDQKDKCKLFICNCDRTAAMCFAKAPYKEENNRIDA 3-----3333------------------------------------------1111---- S - >DEFENSIN ARD1; SWP:P84156; PDB:1OZZA; DKLIGSCVWGAVNYTSNCNAECKRRGYKGGHCGSFANVNCWCET ---------------------------------2222------- >NADP-DEPENDENT ALCOHOL DE; SWP:O57380; PDB:1P0FA; CTAGKDITCKAAVAWEPHKPLSLETITVAPPKAHEVRIKILASGICGSDSSVLKEIIPSK -2222----------------------------------------3333--1111----- FPVILGHEAVGVVESIGAGVTCVKPGDKVIPLFVPQCGSCRACKSSNSNFCEKNDMGAKT ----------------2222---2222------------3333-1111--1111------ GLMADMTSRFTCRGKPIYNLMGTSTFTEYTVVADIAVAKIDPKAPLESCLIGCGFATGYG --1111-----iiii----%%%%---------1111----11113333------------ AAVNTAKVTPGSTCAVFGLGGVGFSAIVGCKAAGASRIIGVGTHKDKFPKAIELGATECL --------2222-------------------------------3333----1111----- NPKDYDKPIYEVICEKTNGGVDYAVECAGRIETMMNALQSTYCGSGVTVVLGLASPNERL 3333---3333--------------------------1111-------------1111-- PLDPLLLLTGRSLKGSVFGGFKGEEVSRLVDDYMKKKINVNFLVSTKLTLDQINKAFELL --33333333------%%%%-1111-------1111--3333------3333------11 SSGQGVRSIMIY 11---------- >HYPOTHETICAL PROTEIN RV08; SWP:O53831; PDB:1P0HA; ALDWRSALTADEQRSVRALVTATTAVDGVAPVGEQVLRELGQQRTEHLLVAGSRPGGPII ------------------------------------1111-------------2222--- GYLNLSPPGGAMAELVVHPQSRRRGIGTAMARAALAKTAGRNQFWAHGTLDPARATASAL -----------------1111----------------iiii----2222--------111 GLVGVRELIQMRRPLRDIPEPTIPDGVVIRTYAGTSDDAELLRVNNAAFAGHPEQGGWTA 1--------------------------------3333-----------2222-------- VQLAERRGEAWFDPDGLILAFGDGRLLGFHWTKVHPDHPGLGEVYVLGVDPAAQRRGLGQ --------11111111------------------1111-----------1111------- MLTSIGIVSLARRLVEPAVLLYVESDNVAAVRTYQSLGFTTYSVDTAYAL -----------------------1111-------1111------------ >CHOLINESTERASE; SWP:P06276; PDB:1P0IA; IIIATKNGKVRGMQLTVFGGTVTAFLGIPYAQPPLGRLRFKKPQSLTKWSDIWNATKYAN ----1111--------%%%%--------------!!!!---------------------- SCCQNIDQSFPGFHGSEMWNPNTDLSEDCLYLNVWIPAPKPKNATVLIWIYGGGFQTGTS ---------2222---1111--------------------------------%%%%--11 SLHVYDGKFLARVERVIVVSMNYRVGALGFLALPGNPEAPGNMGLFDQQLALQWVQKNIA 111111--------------------1111--2222---------------------333 AFGGNPKSVTLFGESAGAASVSLHLLSPGSHSLFTRAILQSGSFNAPWAVTSLYEARNRT 3---1111------------------3333------------1111-------------- LNLAKLTGCSRENETEIIKCLRNKDPQEILLNEAFVVPYGTPLSVNFGPTVDGDFLTDMP ----1111----3333----11113333----1111----1111--------------33 DILLELGQFKKTQILVGVNKDEGTAFLVYGAPGFSKDNNSIITRKEFQEGLKIFFPGVSE 33-1111-------------3333-1111-2222--------------------1111-- FGKESILFHYTDWVQRPENYREALGDVVGDYNFICPALEFTKKFSEWGNNAFFYYFEHRS ---------------1111-------------------------1111-----------1 SKLPWPEWMGVMHGYEIEFVFGLPLERRDYTKAEEILSRSIVKRWANFAKYGNPQETQNQ 111--3333--2222------11113333------------------------------- STSWPVFKSTEQKYLTLNTESTRIMTKLRAQQCRFWTSFFPKV --------------------------------------3333- >ISOPENTENYL-DIPHOSPHATE D; SWP:P50740; PDB:1P0KA; RETGLDDITFVHVSLPDLALEQVDISTKIGELSSSSPIFINAMTGGGGKLTYEINKSLAR ---3333-----------1111------!!!!---------------------------- AASQAGIPLAVGSQMSALKDPSERLSYEIVRKENPNGLIFANLGSEATAAQAKEAVEMIG --------------3333-3333--------------------1111------------- ANALQIHLNVIQEIFSGALKRIEQICSRVSVPVIVKEVGFGMSKASAGKLYEAGAAAVDI --------3333--2222------------------------------------------ GGRQISFFNSWGISTAASLAEIRSEFPASTMIASGGLQDALDVAKAIALGASCTGMAGHF --3333-------------------1111-----------------1111---------- LKALTDSGEEGLLEEIQLILEELKLIMTVLGARTIADLQKAPLVIKGETHHWLTERGVNT ---------------------------------3333----------------------3 SSYSVR 333--- >UBIQUITIN-LIKE 5; SWP:Q9BZL1; PDB:1P0RA; MIEVVCNDRLGKKVRVKCNTDDTIGDLKKLIAAQTGTRWNKIVLKKWYTIFKDHVSLGDY -------3333-------3333-----------------------------------111 EIHDGMNLELYYQ 1------------ >SENSOR KINASE CITA; SWP:P52687; PDB:1P0ZA; EERLHYQVGQRALIQAMQISAMPELVEAVQKRDLARIKALIDPMRSFSDATYITVGDASG --------------------------------------------------------1111 QRLYHVNPDEIGKSMEGGDSDEALINAKSYVSVRKGSLGSSLRGKSPIQDATGKVIGIVS ------3333-------------------------1111----------1111------- VGYTIEQLEHH ---3333---- >PROTEIN-TYROSINE PHOSPHAT; SWP:P18052; PDB:1P15A; MRTGNLPANMKKNRVLQIIPYEFNRVIIPVKRGEENTDYVNASFIDGYRQKDSYIASQGP -333333331111-3333--3333----------------------2222---------- LLHTIEDFWRMIWEWKSCSIVMLTELEERGQEKCAQYWPSDGLVSYGDITVELKKEEECE ---3333-----1111-----------------------------!!!!----------- SYTVRDLLVTNTRENKSRQIRQFHFHGWPEVGIPSDGKGMINIIAAVQKQQQQSGNHPIT --------------------------------------3333------------------ VHCSAGAGRTGTFCALSTVLERVKAEGILDVFQTVKSLRLQRPHMVQTLEQYEFCYKVVQ ---------------------------------------------------3333----3 EYIDA 333-- >MRNA CAPPING ENZYME ALPHA; SWP:P78587; PDB:1P16A; VQLEEREIPVIPGNKLDEEETKELRLVAELLGRRNTGFPGSQPVSFERRHLEETLQKDYF ----------------------------1111--------------3333---------- VCEKTDGLRCLLFLINDPDKGEGVFLVTRENDYYFIPNIHFPLSVNETREKPTYHHGTLL ---------------------------1111----------------------------- DGELVLENRNVSEPVLRYVIFDALAIHGKCIIDRPLPKRLGYITENVKPFDNFKKHNPDI -------------------------iiii-3333-------------------------1 VNSPEFPFKVGFKTLTSYHADDVLSKDKLFHASDGLIYTCAETPYVFGTDQTLLKWKPAE 111------------11113333--------------------------1111----333 ENTVDFQLEFVFNEVQDPDLDERDPTSTYLDYDAKPNLIKLRVWQGSNVHTDFAKLDLSD 3---------------11111111------------------------------------ DDWERLKALEQPLQGRIAECRQSTTKKGYWELRFRNDKSNGNHISVVEKILVSIKDGVKE ----------------------------------------------------------33 KEVIEWCPKISRAWKKRENDRRQ 33--------------------- >GLUTAMATE RECEPTOR INTERA; SWP:P97879; PDB:1P1DA; QVVHTETTEVVLTADPVTGFGIQLQGSVFATETLSSPPLISYIEADSPAERCGVLQIGDR -----------------------------------------------3333--------- VMAINGIPTEDSTFEEANQLLRDSSITSKVTLEIEFDVAESVIPSSGTFHVKLPKKHSVE ---iiii-----3333--------1111-------------------------------- LGITISSPSSRKPGDPLVISDIKKGSVAHRTGTLELGDKLLAIDNIRLDSCSMEDAVQIL ---------------------------3333----------------33333333----- QQCEDLVKLKIRKDED ---------------- >INOSITOL-3-PHOSPHATE SYNT; SWP:P11986; PDB:1P1JA; ITSVKVVTDKCTYKDNELLTKYSYENAVVTKTASGRFDVTPTVQDYVFKLDLKKPEKLGI -------------------------------1111------------------------- MLIGLGGNNGSTLVASVLANKHNVEFQTKEGVKQPNYFGSMTQCSTLKLGIDAEGNDVYA ---1111--------------------1111-----2222-----------1111----- PFNSLLPMVSPNDFVVSGWDINNADLYEAMQRSQVLEYDLQQRLKAKMSLVKPLPSIYYP 1111-----3333--------------------------------3333---------33 DFIAANQDERANNCINLDEKGNVTTRGKWTHLQRIRRDIQNFKEENALDKVIVLWTANTE 33-11113333------1111----------------------1111------------- RYVEVSPGVNDTMENLLQSIKNDHEEIAPSTIFAAASILEGVPYINGSPQNTFVPGLVQL -----2222----------11111111----------1111------------3333--- AEHEGTFIAGDDLKSGQTKLKSVLAQFLVDAGIKPVSIASYNHLGNNDGYNLSAPKQFRS ----------------------------1111---------------------------- KEISKSSVIDDIIASNDILYNDKLGKKVDHCIVIKYMKPVGDSKVAMDEYYSELMLGGHN ------1111--------------------------3333--------------iiii-- RISIHNVCEDSLLATPLIIDLLVMTEFCTRVSYKKVDPVKEDAGKFENFYPVLTFLSYWL --------3333---------------1111-----1111------------33331111 KAPLTRPGFHPVNGLNKQRTALENFLRLLIGLPSQNELRFEERLL -----2222------------------1111-------3333--- >Periplasmic divalent cati; SWP:O28301; PDB:1P1LA; MHNFIYITAPSLEEAERIAKRLLEKKLAACVNIFPIKSFFWWEGKIEAATEFAMIVKTRS ----------------------1111---------------%%%%-------------11 EKFAEVRDEVKAMHSYTTPCICAIPIERGLKEFLDWIDETVE 11---------------------------------------- >HYPOTHETICAL PROTEIN TM09; SWP:Q9X034; PDB:1P1MA; MIIGNCLILKDFSSEPFWGAVEIENGTIKRVLQGEVKVDLDLSGKLVMPALFNTHTHAPM ----------1111---------iiii--------------2222-----------3333 TLLRGVAEDLSFEEWLFSKVLPIEDRLTEKMAYYGTILAQMEMARHGIAGFVDMYFHEEW 1111------3333-------3333----------------------------------- IAKAVRDFGMRALLTRGLVDSNGDDGGRLEENLKLYNEWNGFEGRIFVGFGPHSPYLCSE --------------------iiiiiiii----------2222-----------1111--- EYLKRVFDTAKSLNAPVTIHLYETSKEEYDLEDILNIGLKEVKTIAAHCVHLPERYFGVL -----------------------3333--333311113333-------111133333333 KDIPFFVSHNPASNLKLGNGIAPVQRMIEHGMKVTLGTDGAASNNSLNLFFEMRLASLLQ --------------1111---------1111--------1111----------------3 KAQNPRNLDVNTCLKMVTYDGAQAMGFKSGKIEEGWNADLVVIDLDLPEMFPVQNIKNHL 3331111-------------------------2222----------1111-11113333- VHAFSGEVFATMVAGKWIYFDGEYPTIDSEEVKRELARIEKELY ------------iiii---iiii1111----------------- >CLEAVAGE STIMULATION FACT; SWP:P33240; PDB:1P1TA; DPAVDRSLRSVFVGNIPYEATEEQLKDIFSEVGPVVSFRLVYDRETGKPKGYGFCEYQDQ ---3333---------11113333-----1111----------1111------------- ETALSAMRNLNGREFSGRALRVDNAASEKNKEELKSLGTGAPVI -----------------------11113333--3333------- >DEOXYRIBOSE-PHOSPHATE ALD; SWP:P00882; PDB:1P1XA; MTDLKASSLRALKLMDLTTLNDDDTDEKVIALCHQAKTPVGNTAAICIYPRFIPIARKTL --------------------1111-------------1111-------3333-------- KEQGTPEIRIATVTNFPHGNDDIDIALAETRAAIAYGADEVDVVFPYRALMAGNEQVGFD 11111111-----------------------------------------1111------- LVKACKEACAAANVLLKVIIETGELKDEALIRKASEISIKAGADFIKTSTGKVAVNATPE ---------1111-------3333--------------1111------------------ SARIMMEVIRDMGVEKTVGFKPAGGVRTAEDAQKYLAIADELFGADWADARHYRFGASSL -------------------------------------------1111-3333-------- LASLLKALGH -----1111- >F-BOX/WD-REPEAT PROTEIN 1; SWP:Q9Y297; PDB:1P22A; SPAIMLQRDFITALPARGLDHIAENILSYLDAKSLCAAELVCKEWYRVTSDGMLWKKLIE --------1111--1111------------------------------------------ RMVRTDSLWRGLAERRGWGQYLFPPNSFYRALYPKIIQDIETIESNWRCGRHSLQRIHCR ----------3333----1111----------------------1111------------ SETSKGVYCLQYDDQKIVSGLRDNTIKIWDKNTLECKRILTGHTGSVLCLQYDERVIITG ------------------------------------------------------------ SSDSTVRVWDVNTGEMLNTLIHHCEAVLHLRFNNGMMVTCSKDRSIAVWDMASPTDITLR ---------3333-------------------!!!!----1111--------3333---- RVLVGHRAAVNVVDFDDKYIVSASGDRTIKVWNTSTCEFVRTLNGHKRGIACLQYRDRLV ---------------1111------------------------------------!!!!- VSGSSDNTIRLWDIECGACLRVLEGHEELVRCIRFDNKRIVSGAYDGKIKVWDLVAALDP ---1111-----------------------------------------------333333 RAPAGTLCLRTLVEHSGRVFRLQFDEFQIVSSSHDDTILIWD 333333------------------------------------ >RNA-binding protein 8A; SWP:Q9Y5S9; PDB:1P27B; PGPQRSVEGWILFVTGVHEEATEEDIHDKFAEYGEIKNIHLNLDRRTGYLKGYTLVEYET -----------------11113333----3333--------------------------3 YKEAQAAMEGLNGQDLMGQPISVDWCFVRGPP 333-------2222-iiii------------- >LIGHT CHAIN ANTI-LYSOZYME; SWP:P00698; PDB:1P2CA; DIELTQSPATLSVTPGDSVSLSCRASQSISNNLHWYQQKSHESPRLLIKYTSQSMSGIPS -------------2222-----------!!!!------2222------------222233 RFSGSGSGTDFTLSINSVETEDFGVYFCQQSGSWPRTFGGGTKLDIKRADAAPTVSIFPP 33----------------1111-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT -----------------------------iiii--------------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRN -----1111--------1111----------- >Lysozyme C [Precursor]; SWP:P00698; PDB:1P2CB; EVQLQESGAELMKPGASVKISCKATGYTFTTYWIEWIKQRPGHSLEWIGEILPGSDSTYY ---------------------------1111--------2222--------2222----- NEKVKGKVTFTADASSNTAYMQLSSLTSEDSAVYYCARGDGFYVYWGQGTTLTVSSASTT ----1111----1111----------3333------------------------------ PPSVYPLAPGSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSSSVTV ------------------------------%%%%-------------%%%%--------- PSSPWPSETVTCNVAHPASSTKVDKKIVPR 3333------------1111---------- >RESPONSE REGULATOR; SWP:Q9WXY0; PDB:1P2FA; WKIAVVDDDKNILKKVSEKLQQLGRVKTFLTGEDFLNDEEAFHVVVLDVLPDYSGYEICR --------------------1111----------3333---------------------- IKETRPETWVILLTLLSDDESVLKGFEAGADDYVTKPFNPEILLARVKRFLEREKKGLYD ----3333---------------------------------------------------- FGDLKIDATGFTVFLKGKRIHLPKKEFEILLFLAENAGKVVTREKLLETFWEDPVSPRVV !!!!----------iiii-----------------2222----------------3333- DTVIKRIRKAIEDDPNRPRYIKTIWGVGYFTG -----------------------%%%%----- >RAS GTPASE-ACTIVATING-LIK; SWP:O14188; PDB:1P2XA; RETLQAYDYLCRVDEAKKWIEECLGTDLGPTSTFEQSLRNGVVLALLVQKFQPDKLIKIF -----------------------------333333331111----------1111----- YSNELQFRHSDNINKFLDFIHGIGLPEIFHFELTDIYEGKNLPKVIYCIHALSYFLSMQD -----3333-----------1111-3333--3333---------------------1111 LAPPLIKSDENLSFTDEDVSIIVRRLRQSNVILPNFKAL --------1111----------------------3333- >HEXON PROTEIN; SWP:P03277; PDB:1P2ZA; MMPQWSYMHISGQDASEYLSPGLVQFARATETYFSLNNKFRNPTVAPTHDVTTDRSQRLT -------------3333------------------1111--------------------- LRFIPVDREDTAYSYKARFTLAVGDNRVLDMASTYFDIRGVLDRGPTFKPYSGTAYNALA -----------------------------3333-----------1111--------1111 PKGAPNSCEWEQTHVYAQAPLSGETITKSGLQIGPVYADPSYQPEPQIGESQWNEADANA --------------------------1111--------3333--1111------------ AGGRVLKKTTPMKPCYGSYARPTNPFGGQSVLVLPKVDLQFFSNATKPKVVLYSEDVNME ------1111----2222-----1111--------------------------------- TPDTHLSYKPGKGDENSKAMLGQQSMPNRPNYIAFRDNFIGLMYYNSTGNMGVLAGQASQ 1111------------3333----------------%%%%------3333---------- LNAVVDLQDRNTELSYQLLLDSIGDRTRYFSMWNQAVDSYDPDVRIIENHGTEDELPNYC ------3333---------3333-------1111------1111---------------- FPLGGIGVTDTYQAIKATTWTKDETFATRNEIGVGNNFAMEINLNANLWRNFLYSNIALY -1111-----------------1111------------------------------3333 LPDKLKYNPTNVEISDNPNTYDYMNKRVVAPGLVDCYINLGARWSLDYMDNVNPFNHHRN -3333-----------1111---------3333-11112222---3333---11111111 AGLRYRSMLLGNGRYVPFHIQVPQKFFAIKNLLLLPGSYTYEWNFRKDVNMVLQSSLGND ------3333---------------3333------------------3333---3333-- LRVDGASIKFDSICLYATFFPMAHNTASTLEAMLRNDTNDQSFNDYLSAANMLYPIPANA -----------------------3333--------3333-----3333--------2222 TNVPISIPSRNWAAFRGWAFTRLKTKETPSLGSGYDPYYTYSGSIPYLDGTFYLNHTFKK -----------2222--------3333--------1111-----3333-----1111--- VAITFDSSVSWPGNDRLLTPNEFEIKRSVDGEGYNVAQCNMTKDWFLVQMLANYNIGYQG ------------%%%%--1111-----33331111%%%%--------------------- FYIPESYKDRMYSFFRNFQPMSRQVVDDTKYKEYQQVGILHQHNNSGFVGYLAPTMREGQ ----3333-11113333-------------1111---3333---2222------------ AYPANVPYPLIGKTAVDSITQKKFLCDRTLWRIPFSSNFMSMGALTDLGQNLLYANSAHA -----------1111------------------------------3333-3333------ LDMTFEVDPMDEPTLLYVLFEVFDVVRVHQPHRGVIETVYLRTPFSA -------------------------------2222------------ >HEXON PROTEIN; SWP:P04133; PDB:1P30A; MMPQWSYMHISGQDASEYLSPGLVQFARATETYFSLNNKFRNPTVAPTHDVTTDRSQRLT -------------3333------------3333--1111--------------------- LRFIPVDREDTAYSYKARFTLAVGDNRVLDMASTYFDIRGVLDRGPTFKPYSGTAYNALA -----------------------------3333-----------1111--------1111 PKGAPNPCEWDTHVFGQAPYSGINITKEGIQIGVEGQTPKYADKTFQPEPQIGESQWYET 1111---------------------1111-------------3333--3333-------- EINHAAGRVLKKTTPMKPCYGSYAKPTNENGGQGILVLESQVEMQFFSTTELTPKVVLYS ----------1111----2222-----1111----------------------------- EDVDIETPDTHISYMPTIKEGNSRELMGQQSMPNRPNYIAFRDNFIGLMYYNSTGNMGVL ----------------------3333----------------%%%%------3333---- AGQASQLNAVVDLQDRNTELSYQLLLDSIGDRTRYFSMWNQAVDSYDPDVRIIENHGTED -1111--------------------3333-1111--1111------1111---------- ELPNYCFPLGGVINTETLTKVKPGWEKDAFSDKNEIRVGNNFAMEINLNANLWRNFLYSN -------1111------------------------------------------------- IALYLPDKLKYSPSNVKISDNPNTYDYMNKRVVAPGLVDCYINLGARWSLDYMDNVNPFN -11113333-----------1111---------3333-11112222---3333---3333 HHRNAGLRYRSMLLGNGRYVPFHIQVPQKFFAIKNLLLLPGSYTYEWNFRKDVNMVLQSS 1111------3333---------------3333------------------3333---11 LGNDLRVDGASIKFDSICLYATFFPMAHNTASTLEAMLRNDTNDQSFNDYLSAANMLYPI 11--3333-------------------------------3333-----3333-------- PANATNVPISIPSRNWAAFRGWAFTRLKTKETPSLGSGYDPYYTYSGSIPYLDGTFYLNH 2222-----------2222--------3333--------1111-----3333-----111 TFKKVAITFDSSVSWPGNDRLLTPNEFEIKRSVDGEGYNVAQCNMTKDWFLVQMLANYNI 1---------------%%%%--1111--------3333-%%%%----------------- GYQGFYIPESYKDRMYSFFRNFQPMSRQVVDDTKYKDYQQVGILHQHNNSGFVGYLAPTM --------3333-11113333-------------1111---3333---2222-------- REGQAYPANFPYPLIGKTAVDSITQKKFLCDRTLWRIPFSSNFMSMGALTDLGQNLLYAN ---------------1111------------------------------3333-3333-- SAHALDMTFEVDPMDEPTLLYVLFEVFDVVRVHRPHRGVIETVYLRTPFSA -----------------------------------2222------------ >MITOCHONDRIAL MATRIX PROT; SWP:Q07021; PDB:1P32A; MHTDGDKAFVDFLSDEIKEERKIQKHKTLPKMSGGWELELNGTEAKLVRKVAGEKITVTF ----------------------------------------!!!!------2222------ NINNSIPLTSTPNFVVEVIKNDDGKKALVLDCHYPEDEAESDIFSIREVSFQSTGESEWK -------------------1111--------------1111------------------1 DTNYTLNTDSLDWALYDHLMDFLADRGVDNTFADELVELSTALEHQEYITFLEDLKSFVK 111--------------------1111--------------------------------- SQ -- >PTERIDINE REDUCTASE 1; SWP:P42556; PDB:1P33A; TAPVALVTGAAKRLGSSIAEALHAEGYTVCLHYHRSAADASTLAATLNARRPNSAITVQA ----------------------------------------------1111---------- DLSNVATASSVPVTLFSRCSALVDACYMHWGRCDVLVNNASSFYPTPLLRDKESLEVAAA ------------------------------------------------------------ DLFGSNAIAPYFLIKAFAQRVADTRAEQRGTSYSIVNMVDAMTSQPLLGYTMYTMAKEAL -------------------------1111------------3333-2222---------- EGLTRSAALELASLQIRVNGVSPGLSVLPDDMPFSVQEDYRRKVPLYQRNSSAEEVSDVV -----------1111------------------------1111--------3333----- IFLCSPKAKYITGTCIKVDGGYSLTRA -------1111-------iiii----- >P35; SWP:P08160; PDB:1P35A; CVIFPVEIDVSQTIIRDCQVDKQTRELVYINKIMNTQLTKPVLMMFNISGPIRSVTRKNN ----------------------------------------------------------11 NLRDRIKSKVDEQFDQLERDYSDQMDGFHDSIKYFKDEHYSVSCQNGSVLKSKFAKILKS 11----------3333-----------------------------33331111-----11 HDYTDKKSIEAYEKYCLPKLVDERNDYYVAVCVLKPGFENGSNQVLSFEYNPIGNKVIVP 11-------------3333---------------1111--------------%%%%---- FAHEINDTGLYEYDVVAYVDSVQFDGEQFEEFVQSLILPSSFKNSEKVLYYNEASKNKSM -----3333-----------------------1111------------------------ IYKALEFTTESSWGKSEKYNWKIFCNGFIYDKKSKVLYVKLHNVTSALNKNVILNTIKA ----------------------------------------------------------- >GLUTAMYL-ENDOPEPTIDASE; SWP:Q9EXR9; PDB:1P3CA; VVIGDDGRTKVANTRVAPYNSIAYITFGGSSCTGTLIAPNKILTNGHCVYNTASRSYSAK --------------------------------------------3333---1111----- GSVYPGMNDSTAVNGSANMTEFYVPSGYINTGASQYDFAVIKTDTNIGNTVGYRSIRQVT -------%%%%1111---------3333----3333---------3333----------- NLTGTTIKISGYPGDKMRSTGKVSQWEMSGSVTREDTNLAYYTIDTFSGNSGSAMLDQNQ -2222------------------------------1111----------2222---1111 QIVGVHNAGYSNGTINGGPKATAAFVEFINYAKAQ ---------%%%%------------------1111 >UDP-N-ACETYLMURAMATE--ALA; SWP:P45066; PDB:1P3DA; IIPERRVQQIHFIGIGGAGSGIAEILLNEGYQISGSDIADGVVTQRLAQAGAKIYIGHAE -------------1111---------1111-----------------1111--------- EHIEGASVVVVSSAIKDDNPELVTSKQKRIPVIQRAQLAEIRFRHGIAVAGTHGKTTTTA 1111-------33331111--------------33333333------------------- ISIYTQAKLDPTFVNGGLVKSAGKNAHLGASRYLIAEADESDASFLHLQPVSVVTNEPDH ------------------1111-----------------1111-3333------------ DTYEGDFEKKATYVKFLHNLPFYGLAVCADDPVLELVPKVGRQVITYGFSEQADYRIEDY 1111-3333-------33331111---1111----3333----------1111------- EQTGFQGHYTVICPNNERINVLLNVPGKHNALNATAALAVAKEEGIANEAILEALADFQG ------------1111-------------------------1111--------------- AGRRFDQLGEFIRPNGKVRLVDDYGHHPTEVGVTIKAAREGWGDKRIVIFQPHRYSRTRD 2222--------1111-------------------------------------------- LFDDFVQVLSQVDALILDVYAAGEAPIVGADSKSLCRSIRNLGKVDPILVSDTSQLGDVL -------1111---------iiii--2222---------3333--------3333----- DQIIQDGDLILAQGAGSVSKISRGLAESW ----2222--------------------- >10 KDA CHAPERONIN; SWP:P09621; PDB:1P3HA; AKVNIKPLEDKILVQANEAETTTASGLVIPDTAKEKPQEGTVVAVGPGRWDEDGEKRIPL -------!!!!-----------1111----3333-------------------------- DVAEGDTVIYSKYGGTEIKYNGEEYLILSARDVLAVVSK --2222-----2222----%%%%-----3333------- >LYSOZYME; SWP:P00720; PDB:1P3NA; MNIFEMLRIDEGLRLKIYKDTEGYYTIGIGHLLTKSPSLNAAKSELDKAIGRNTNGVITK -------------------1111------------------------------iiii--- DEAEKLFNQDVDAAVRGILRNAKLKPIYDSLDAVRRAALVNLVFQIGETGAAGFTNSLRY ---------------------------1111----------------------------- LQQKRWDEAAVNFAKSRWYNQTPNRAKRIITVFRTGTWDAYKNL 1111---------------------------------3333--- >Vacuolar protein sorting-; SWP:P54787; PDB:1P3QQ; SSLIKKIEENERKDTLNTLQNFPDDPSLIEDVCIAAASGPCVD --------------------------------------3333- >DISABLED HOMOLOG 2; SWP:P98078; PDB:1P3RA; MEKTDEYLLARFKGDGVKYKAKLIGIDDVPDARGDKMSQDSMMKLKGMAAAGRSQGQHKQ ---3333-----!!!!------------------------------------1111---- RIWVNISLSGIKIIDEKTGVIEHEHPVNKISFIARDVTDNRAFGYVCGGEGQHQFFAIKT ------1111---------------1111------1111---------2222-------- GQQAEPLVVDLKDLFQVIYNVKKKEEDK ---------------------------- >CYSTEINE DESULFURASE; SWP:P39171; PDB:1P3WB; KLPIYLDYSATTPVDPRVAEKMMQFMTMDGTFGNPASRSHRFGWQAEEAVDIARNQIADL ------3333-----------1111-1111---1111--3333----------------- VGADPREIVFTSGATESDNLAIKGAANFYQKKGKHIITSKTEHKAVLDTCRQLEREGFEV ---3333---------------------3333------11113333-------1111--- TYLAPQRNGIIDLKELEAAMRDDTILVSIMHVNNEIGVVQDIAAIGEMCRARGIIYHVDA -----1111-------33331111-------------------------1111------- TQSVGKLPIDLSQLKVDLMSFSGHKIYGPKGIGALYVRRKPRVRIEAQMHGGGHERGMRS --2222-----------------------------------------------%%%%--- GTLPVHQIVGMGEAYRIAKEEMATEMERLRGLRNRLWNGIKDIEEVYLNGDLEHGAPNIL ---------------------------------------1111------------1111- NVSFNYVEGESLIMALKDLAVSSGSAEPSYVLRALGLNDELAHSSIRFSLGRFTTEEEID -------3333-3333-------------3333-----------------1111------ YTIELVRKSIGRLRDLSPLWEMYKQ ------------3333--------- >Mersacidin decarboxylase; SWP:Q9RC23; PDB:1P3Y1; ISILKDKKLLIGICGSISSVGISSYLLYFKSFFKEIRVVMTKTAEDLIPAHTVSYFCDHV 3333-----------3333---------1111----------------33331111---- YSEHGENGKRHSHVEIGRWADIYCIIPATANILGQTANGVAMNLVATTVLAHPHNTIFFP -1111------------------------------1111---------1111-------- NMNDLMWNKTVVSRNIEQLRKDGHIVIEPVEIMRGLITPDKALLAIEKGFK --3333--------------------------------------------- >ANTIBODY VARIABLE LIGHT C; SWP:NA; PDB:1P4BH; DVQLQQSGPGLVAPSQSLSITCTVSGFSLTDYGVNWVRQSPGKGLEWLGVIWGDGITDYN ---------------------------3333--------2222----------------1 SALKSRLSVTKDNSKSQVFLKMNSLQSGDSARYYCVTGLFDYWGQGTTLTVS 1111111-----1111---------3333----------------------- >L(+)-MANDELATE DEHYDROGEN; SWP:P20932; PDB:1P4CA; NLFNVEDYRKLAQKRLPKMVYDYLEGGAEDEYGVKHNRDVFQQWRFKPKRLVDVSRRSLQ ---3333--------------------!!!!----------------------------- AEVLGKRQSMPLLIGPTGLNGALWPKGDLALARAATKAGIPFVLSTASNMSIEDLARQCD --iiii------------3333-------------------------------------- GDLWFQLYVIHREIAQGMVLKALHTGYTTLVLTTDVAVNGYRERDLHNRFKIPPFLTLKN ----------------------1111--------------------------1111-333 FEGIDLGKMDKANLEMQAALMSRQMDASFNWEALRWLRDLWPHKLLVKGLLSAEDADRCI 3--------------3333------1111------------------------------1 AEGADGVILSNHGGRQLDCAISPMEVLAQSVAKTGKPVLIDSGFRRGSDIVKALALGAEA 111-------%%%%--1111-------------------------3333----------- VLLGRATLYGLAARGETGVDEVLTLLKADIDRTLAQIGCPDITSLSPDYLQNE ---3333---------------------------------1111-3333---- >TRAI PROTEIN; SWP:P14565; PDB:1P4DA; MMSIAQVRSAGSAGNYYTDKDNYYVLGSMGERWAGRGAEQLGLQGSVDKDVFTRLLEGRL ------------------3333-3333-----------1111-----------3333--1 PDGADLSRMQDGSNRHRPGYDLTFSAPKSVSMMAMLGGDKRLIDAHNQAVDFAVRQVEAL 111------iiii----------------------------------------------- ASTRVMTDGQSETVLTGNLVMALFNHDTSRDQEPQLHTHAVVANVTQHNGEWKTLSSDKV ------iiii-------------------------------------iiii--------- GKTGFIENVYANQIAFGRLYREKLKEQVEALGYETEVVGKHGMWEMPGVPVEAFSVDPEI ---------------------------3333--------%%%%--22223333------- KMAEWMQTLKETGFDIRAYRDAADQRADLRTLTPG ---------3333---------------------- >N(4)-(BETA-N-ACETYLGLUCOS; SWP:Q47898; PDB:1P4KA; TTNKPIVLSTWNFGLHANVEAWKVLSKGGKALDAVEKGVRLVEDDPTERSVGYGGRPDRD -----------------------3333-----------------1111---2222--111 GRVTLDACIMDENYNIGSVACMEHIKNPISVARAVMEKTPHVMLVGDGALEFALSQGFKK 1-------------------------3333-----------------------1111--- ENLLTAESEKEWKEWLKTSQYKPIVNIENHNTIGMIALDAQGNLSGACTTSGMAYKMHGR --------------------------------------1111----------22222222 VGDSPIIGAGLFVDNEIGAATATGHGEEVIRTVGTHLVVELMNQGRTPQQACKEAVERIV --1111-------1111------------------------1111--------------- KIVNRRGKNLKDIQVGFIALNKKGEYGAYCIQDGFNFAVHDQKGNRLETPGFALK ---1111-3333--------1111----------------1111----------- >Killer cell lectin-like r; SWP:Q64329; PDB:1P4LD; RGVKYWFCYSTKCYYFIMNKTTWSGCKANCQHYGVPILKIEDEDELKFLQRHVIPGNYWI ---------------------3333-----1111-------------------------- GLSYDKKKKEWAWIDNGPSKLDMKIKKMNFKSRGCVFLSKARIEDIDCNIPYYCICGKKL ------------1111------------------------------------------33 DK 33 >INSULIN-LIKE GROWTH FACTO; SWP:P08069; PDB:1P4OA; PEYFSAADVYVPDEWEVAREKITMSRELGQGSFGMVYEGVAKGVVKDEPETRVAIKTVNE ------------1111-3333---------1111----------2222----------11 AASMRERIEFLNEASVMKEFNCHHVVRLLGVVSQGQPTLVIMELMTRGDLKSYLRSLRPA 11------------1111---1111-------------------1111------------ MANNPVLAPPSLSKMIQMAGEIADGMAYLNANKFVHRDLAARNCMVAEDFTVKIGDFGMT ---3333----------------------1111------3333---1111---------- RDIYETDYYRKGGKGLLPVRWMSPESLKDGVFTTYSDVWSFGVVLWEIATLAEQPYQGLS 33331111-2222----1111--------------------------1111----1111- NEQVLRFVMEGGLLDKPDNCPDMLFELMRMCWQYNPKMRPSFLEIISSIKEEMEPGFREV -------1111-----2222--------------3333----------3333-3333--- SFYYSEEN -1111--- >OUTER SURFACE PROTEIN B; SWP:Q44975; PDB:1P4PA; SKKLTRSNGTTLEYSQITDADNATKAVETLKNSIKLEGSLVVGKTTVEIKEGTVTLKREI -----1111---------------------%%%%------iiii------!!!!------ EKDGKVKVFLNDTAGSNKKTGKWEDSTSTLTISADSKKTKDLVFLTDGTITVQQYNTAGT ------------------------1111-----!!!!-------1111-------3333- SLEGSASEIKNLSELKNALK ----------------1111 >CBP/P300-INTERACTING TRAN; SWP:Q99967; PDB:1P4QA; GSGSGSGSNVIDTDFIDEEVLMSLVIEMGLDRIKELPELWLGQNEFDFMTDF ------------11113333--------1111-------------------- >ADP-RIBOSYLATION FACTOR B; SWP:Q9NZ52; PDB:1P4UA; LSLASIHVPLESIKPSSALPVTAYDKNGFRILFHFAKECPPGRPDVLVVVVSMLNMAPLP --1111--3333-------------iiii----------2222----------------- VKSIVLQAAAPKSMKVKLQPPSGTELSPFSPIQPPAAITQVMLLANPLKEKVRLRYKLTF ----------1111---------------3333------------1111----------- ALGEQLSTEVGEVDQFPPVEQWGNL -!!!!------------3333---- >RCSB; SWP:P96320; PDB:1P4WA; YTPESVAKLLEKISAGGYGDKRLSPKESEVLRLFAEGFLVTEIAKKLNRSIKTISSQKKS ----3333-------------------------3333----------------------- AMMKLGVDNDIALLNYLSSVSMTPVDK --------3333--------------- >STAPHYLOCOCCAL ACCESSORY ; SWP:Q2G1N7; PDB:1P4XA; MKYNNHDKIRDFIIIEAYMFRFKKKVKPEVDMTIKEFILLTYLFHQQENTLPFKKIVSDL -------------------------3333----------------------3333----- CYKQSDLVQHIKVLVKHSYISKVRSKIDERNTYISISEEQREKIAERVTLFDQIIKQFNL --3333--------1111------------------------------------------ ADQSESQMIPKDSKEFLNLMMYTMYFKNIIKKHLTLSFVEFTILAIITSQNKNIVLLKDL -----------------------------------------------1111----3333- IETIHHKYPQTVRALNNLKKQGYLIKERSTEDERKILIHMDDAQQDHAEQLLAQVNQLLA -------------------------------3333---------------------1111 DKDHLHLVFE --1111---- >Phosphomannomutase/phosph; SWP:P26276; PDB:1P5DX; LPASIFRAYDIRGVVGDTLTAETAYWIGRAIGSESLARGEPCVAVGRDGRLSGPELVKQL -3333-1111---2222------------------1111----------1111------- IQGLVDCGCQVSDVGMVPTPVLYYAANVLEGKSGVMLTGHNPPDYNGFKIVVAGETLANE ----1111---------3333--------------------1111------iiii--!!! QIQALRERIEKNDLASGVGSVEQVDILPRYFKQIRDDIAMAKPMKVVVDCGNGVAGVIAP !---------------------------------1111-----------%%%%------- QLIEALGCSVIPLYCEVDGNFPNHHPDPGKPENLKDLIAKVKAENADLGLAFDGDGDRVG -----------------1111-----11111111------------------1111---- VVTNTGTIIYPDRLLMLFAKDVVSRNPGADIIFDVKCTRRLIALISGYGGRPVMWKTGHS --1111-------------------2222----1111----------------------- LIKKKMKETGALLAGEMSGHVFFKERWFGFDDGIYSAARLLEILSQDQRDSEHVFSAFPS ---------------3333----2222----------------1111-------3333-- DISTPEINITVTEDSKFAIIEALQRDAQWGEGNITTLDGVRVDYPKGWGLVRASNTTPVL -----------1111----------------------------1111------------- VLRFEADTEEELERIKTVFRNQLKAVDSSLPVPF --------------------------1111---- >RNA-BINDING PROTEIN REGUL; SWP:Q99497; PDB:1P5FA; SKRALVILAKGAEEMETVIPVDVMRRAGIKVTVAGLAGKDPVQCSRDVVICPDASLEDAK --------2222-3333-------1111---------------1111------------1 KEGPYDVVVLPGGNLGAQNLSESAAVKEILKEQENRKGLIAAICAGPTALLAHEIGFGSK 111------------------------------1111-----!!!!---------2222- VTTHPLAKDKMMNGGHYTYSENRVEKDGLILTSRGPGTSFEFALAIVEALNGKEVAAQVK ---1111-3333--------------!!!!----1111---------------------3 APLVLK 333--- >L-SERINE DEHYDRATASE; SWP:P20132; PDB:1P5JA; GEPLHVKTPIRDSMALSKMAGTSVYLKMDSAQPSGSFKIRGIGHFCKRWAKQGCAHFVCS ---------------------------3333----------------------------- SAGNAGMAAAYAARQLGVPATIVVPGTTPALTIERLKNEGATCKVVGELLDEAFELAKAL ------------------------------------------------------------ AKNNPGWVYIPPFDDPLIWEGHASIVKELKETLWEKPGAIALSVGGGGLLCGVVQGLQEC ---2222-------3333---------------------------------------111 GWGDVPVIAMETFGAHSFHAATTAGKLVSLPKITSVAKALGVKTVGSQALKLFQEHPIFS 11111-------------------------------3333----------3333------ EVISDQEAVAAIEKFVDDEKILVEPACGAALAAVYSHVIQKLQLEGNLRTPLPSLVVIVC -----------------------3333------1111----------------------- GGSNISLAQLRALKEQLGM --------------1111- >FK506-BINDING PROTEIN 4; SWP:Q02790; PDB:1P5QA; EEDGGIIRRIQTRGEGYAKPNEGAIVEVALEGYYKDKLFDQRELRFEIGEGENLDLPYGL -%%%%---------------2222-------------------------3333---3333 ERAIQREKGEHSIVYLKPSYAFGSVGKEKFQIPPNAELKYELHLKSFEKAKESWENSEEK ------2222-------1111------1111------------------------1111- LEQSTIVKERGTVYFKEGKYKQALLQYKKIVSWLEYESSFSNEEAQKAQALRLASHLNLA -------------------3333------------------------------------- MCHLKLQAFSAAIESCNKALELDSNNEKGLSRRGEAHLAVNDFELARADFQKVLQLYPNN ---1111---------------1111-----------1111------------------- KAAKTQLAVCQQRIRRQLAREKKLYANMFERLAEEENKAKA ----------------------------------------- >DOCKING PROTEIN 1; SWP:P97465; PDB:1P5TA; GSQFWVTSQKTEASERCGLQGSYILRVEAEKLTLLTLGAQSQILEPLLFWPYTLLRRYGR --------------------------------------------------3333------ DKVFSFEAGRRCPSGPGTFTFQTSQGNDIFQAVEAAIQQQKAQGK --------1111--------------------------------- >CHAPERONE PROTEIN CAF1M; SWP:P26926; PDB:1P5UA; SKEYGVTIGESRIIYPLDAAGVMVSVKNTQDYPVLIQSRIYDENKEPFVVTPPLFRLDAK ---------------2222----------------------1111------------222 QQNSLRIAQAGGVFPRDKESLKWLCVKGIPPKDPDKDVGVFVQFAINNCIKLLVRPNELK 2------------------------------------------------------3333- GTPIQFAENLSWKVDGGKLIAENPSPFYMNIGELTFGGKSIPSHYIPPKSTWAFDLPKGL -33333333-----%%%%-----------------iiii--------------------2 AGARNVSWRIINDQGGLDRLYSKNVT 222--------1111----------- >CHAPERONE PROTEIN CAF1M; SWP:P26926; PDB:1P5VA; FASKEYGVTIGESRIIYPLDAAGVMVSVKNTQDYPVLIQSRIYDPFVVTPPLFRLDAKQQ -----------------2222----------------------------------2222- NSLRIAQAGGVFPRDKESLKWLCVKGIPKDVGVFVQFAINNCIKLLVRPNELKGTPIQFA ------------------------------------------------------333333 ENLSWKVDGGKLIAENPSPFYMNIGELTFGGKSIPSHYIPPKSTWAFDLPNVSWRIINDQ 33-----%%%%-----------------%%%%-------------------------111 GGLDRLYSKNV 1---------- >F1 capsule antigen [Precu; SWP:P26948; PDB:1P5VB; VEPARITLTYKEGAPITIMDNGNIDTELLVGTLTLGGYKTGTTSTSVNFTDAAGDPMYLT ------------------1111----------------22221111----33332222-- FTSQDGNNHQFTTKVIGKDSRDFDISPKVNGENLVGDDVVLATGSQDFFVRSIGSKGGKL ------------------1111------iiii----------------------3333-- AAGKYTDAVTVTVSNQ ---------------- >Deoxycytidine kinase; SWP:P27707; PDB:1P5ZB; RIKKISIEGNIAAGKSTFVNILKQLCEDWEVVPEPVARWCNVQSTNGGNVLQMMYEKPER ---------2222--------33331111-----3333------------------3333 WSFTFQTYACLSRIRAQLASLNGKLKDAEKPVLFFERSVYSDRYIFASNLYESECMNETE ----------------------3333------------------------1111------ WTIYQDWHDWMNNQFGQSLELDGIIYLQATPETCLHRIYLRGRNEEQGIPLEYLEKLHYK ----------------1111----------------------3333---3333------- HESWLLHRTLKTNFDYLQEVPILTLDVNEDFKDKYESLVEKVKEFLSTL -------------3333------------3333-----------3333- >NUCLEOCAPSID PROTEIN; SWP:Q9YJI1; PDB:1P65A; DVRHHFTPSERQLCLSSIQTAFNQGAGTCTLSDSGRISYTVEFSLPTHHTVRLIRVT 3333---------------------------1111----------3333-------- >DE NOVO DESIGNED PROTEIN ; SWP:NA; PDB:1P68A; MYGKLNDLLEDLQEVLKNLHKNWHGGKDNLHDVDNHLQNVIEDIHDFMQGGGSGGKLQEM ---3333----------3333-----3333----------------1111---------- MKEFQQVLDELNNHLQGGKHTVHHIEQNIKEIFHHLEELVHR ----------1111---3333------3333----------- >CYTOSINE DEAMINASE; SWP:Q12178; PDB:1P6OA; TGGMASKWDQKGMDIAYEEAALGYKEGGVPIGGCLINNKDGSVLGRGHNMRFQKGSATLH iiii-1111--------------1111------------------------1111----- GEISTLENCGRLEGKVYKDTTLYTTLSPCDMCTGAIIMYGIPRCVVGENVNFKSKGEKYL ------3333--3333-------------------------------------------- QTRGHEVVVVDDERCKKIMKQFIDERPQDWFEDIGE 1111---------------------------1111- >FATTY ACID-BINDING PROTEI; SWP:P83409; PDB:1P6PA; AFNGTWNVYAQENYENFLRTVGLPEDIIKVAKDVNPVIEIEQNGNEFVVTSKTPKQTHSN -----------------------3333--3333-------------------2222---- SFTVGKESEITSMDGKKIKVTVQLEGGKLICKSDKFSHIQEVNGDEMVEKITIGSSTLTR ------------------------------------------!!!!------!!!!---- KSKRV ----- >CHEY2; SWP:Q52884; PDB:1P6QA; MSLAEKIKVLIVDDQVTSRLLLGDALQQLGFKQITAAGDGEQGMKIMAQNPHHLVISDFN --------------------------1111----------------3333---------- MPKMDGLGLLQAVRANPATKKAAFIILTAQGDRALVQKAAALGANNVLAKPFTIEKMKAA ---------------3333-------------------1111----------3333---- IEAVFGALK --------- >PENICILLINASE REPRESSOR; SWP:P06555; PDB:1P6RA; MKKIPQISDAELEVMKVIWKHSSINTNEVIKELSKTSTWSPKTIQTMLLRLIKKGALNHH -------3333------3333------------------3333----------------- KEGRVFVYTPNIDESDYIEVKS ---------------------- >RAC-BETA SERINE/THREONINE; SWP:P31751; PDB:1P6SA; MNEVSVIKEGWLHKRGEYIKTWRPRYFLLKSDGSFIGYKERPEAPDQTLPPLNNFSVAEC -------------------------------------------1111------------- QLMKTERPRPNTFVIRCLQWTTVIERTFHVDSPDEREEWMRAIQMVANSLK --------------------------------------------------- >POTENTIAL COPPER-TRANSPOR; SWP:O32220; PDB:1P6TA; MLSEQKEIAMQVSGMTCAACAARIEKGLKRMPGVTDANVNLATETVNVIYDPAETGTAAI ------------------------------3333-----3333------------3333- QEKIEKLGYHVVTEKAEFDIEGMTCAACANRIEKRLNKIEGVANAPVNFALETVTVEYNP ------------------------------------------------------------ KEASVSDLKEAVDKLGYKLKLKGEQDSIEGR ---3333-----------------%%%%--- >THYMIDINE KINASE; SWP:P24425; PDB:1P6XA; SHMVTIVRIYLDGVYGIGKSTTGRVMASAASGGSPTLYFPEPMAYWRTLFETDVISGIYD -------------2222----------3333----------3333--------------- TQNRKQQGNLAVDDAALITAHYQSRFTTPYLILHDHTCTLFGGNSLQRGTQPDLTLVFDR ------------------------------------1111-------------------- HPVASTVCFPAARYLLGDMSMCALMAMVATLPREPQGGNIVVTTLNVEEHIRRLRTRARI -3333---------------------3333----2222---------------------- GEQIDITLIATLRNVYFMLVNTCHFLRSGRVWRDGWGELPTSCGAYKHRATQMDAFQERV ------------------------------33333333-------------2222----- SPELGDTLFALFKTQELLDDRGVILEVHAWALDALMLKLRNLNVFSADLSGTPRQCAAVV --33333333---3333-1111----------------1111------------------ ESLLPLMSSTLSDFDSASALERAARTFNAEMGV --3333--------------------------- >DNA-BINDING PROTEIN HU; SWP:P05514; PDB:1P71A; MNKGELVDAVAEKASVTKKQADAVLTAALETIIEAVSSGDKVTLVGFGSFESRERKAREG -----------1111-3333---------------1111--------------------- RNPKTNEKMEIPATRVPAFSAGKLFREKVAPPKA ---------------------------------- >SHIKIMATE 5-DEHYDROGENASE; SWP:P43876; PDB:1P77A; MDLYAVWGNPIAQSKSPLIQNKLAAQTHQTMEYIAKLGDLDAFEQQLLAFFEEGAKGCNI ----------1111---------------------------------------------- TSPFKERAYQLADEYSQRAKLAEACNTLKKLDDGKLYADNTDGIGLVTDLQRLNWLRPNQ --------3333------------------1111----------------------2222 HVLILGAGGATKGVLLPLLQAQQNIVLANRTFSKTKELAERFQPYGNIQAVSMDSIPLQT -------3333-------------------3333-------3333------1111----- YDLVINATSASVDAEILKLGSAFYDMQYAKGTDTPFIALCKSLGLTNVSDGFGMLVAQAA ----------------1111--------2222--------1111---------------- HSFHLWRGVMPDFVSVYEQLKKAML ------------------------- >KRUPPEL-LIKE FACTOR 3; SWP:Q60980; PDB:1P7AA; GSTRGSTGIKPFQCPDCDRSFSRSDHLALHRKRHMLV ------------------------------1111--- >Nuclear factor of activat; SWP:Q13469; PDB:1P7HL; SSVPLEWPLSSQSGSYELRIEVQPKPHHRAHYETEGSRGAVKAPTGGHPVVQLHGYMENK ---3333-----!!!!---------------1111------------------------- PLGLQIFIGTADERILKPHAFYQVHRITGKTVTTTSYEKIVGNTKVLEIPLEPKNNMRAT ---------------------------------------------------3333----- IDCAGILKLRNADIELRKGETDIGRKNTRVRLVFRVHIPESSGRIVSLQTASNPIECSQR ---------3333----------2222------------1111----------------- SAHELPMVERQDTDSCLVYGGQQMILTGQNFTSESKVVFTEKTTDGQQIWEMEATVDKDK ----------------3333-----------1111-------3333----------3333 SQPNMLFVEIPEYRNKHIRTPVKVNFYVINGKRKRSQPQHFTYHPV --------------1111---------------------------- >ANTIBODY LIGHT CHAIN FAB; SWP:NA; PDB:1P7KA; ELQMTQSPASLSASVGETVTITCRASENIYSYLAWYQQKQGKSPQLLVYNAKTLAEGVPS -------------2222---------------------2222------------222233 RFSGSGSGTQFSLKINSLQPEDFGSYYCQHHYGTPLTFGAGTKLELKRADAAPTVSIFPP 33----------------3333-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT 3333-------------------------iiii--2222--------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC -----1111--------1111------------- >ANTIBODY LIGHT CHAIN FAB; SWP:NA; PDB:1P7KB; QVKLLESGPELVKPGASVKMSCKASGYTFTSYVMHWVKQKPGQGLEWIGYINPYNDGTKY ------------2222-----------1111--------2222----------------- NEKFKGKATLTSDKSSSTAYMELSSLTSEDSAVYYCVRGGYRPYYAMDYWGQGTSVTVSS 3333---------1111---------3333------------1111-------------- AKTTPPSVYPLAPGSNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTL ------------------------------------%%%%-------------iiii--- SSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRD ------1111------------1111----------- >PHOSPHOLIPASE A2; SWP:NA; PDB:1P7OA; NLLQFRNMIKCTIPGREPLLAFSNYGCYCGKGGSGTPVDELDRCCQTHDNCYDKAEKLPE ----------------3333------------------3333-----------1111333 CKGILSGPYFNTYSYDCTDGKLTCNDQNDKCKLFICNCDRTAAMCFAKAPYNEAYNHFNR 3-----3333-----------------------------------------3333---11 QLCK 11-- >MALATE SYNTHASE G; SWP:P37330; PDB:1P7TA; TITQSRLRIDANFKRFVDEEVLPGTGLDAAAFWRNFDEIVHDLAPENRQLLAERDRIQAA ---!!!!-------------3333------------------------------------ LDEWHRSNPGPVKDKAAYKSFLRELGYLVPQPERVTVETTGIDSEITSQAGPQLVVPAMN ------------------------------------------3333----------3333 ARYALNAANARWGSLYDALYGSDIIPQDPQRGEQVIAWVRRFLDESLPLENGSYQDVVAF 3333------------------------------------------------3333---- KVVDKQLRIQLKNGKETTLRTPAQFVGYRGDAAAPTCILLKNNGLHIELQIDANGRIGKD ----------1111------3333------1111-------iiii------1111----- DPAHINDVIVEAAISTILDCEDSVAAVDAEDKILLYRNLLGLMQGTLQEKMQIVRKLNDD 1111----------------1111------------------------------------ RHYTAADGSEISLHGRSLLFIRNVGHLMTIPVIWDSEGNEIPEGILDGVMTGAIALYDLK ----1111--------------------------1111---------------------- VQKNSRTGSVYIVKPKMHGPQEVAFANKLFTRIETMLGMAPNTLKMGIMDEERRTSLNLR ---------------------------------------2222--------3333----- SCIAQARNRVAFINTGFLDRTGDEMHSVMEAGPMLRKNQMKSTPWIKAYERNNVLSGLFC -----1111------------------1111----11111111----------------- GLRGKAQIGKGMWAMPDLMADMYSQKGDQLRAGANTAWVPSPTAATLHALHYHQTNVQSV -2222---------1111------------------------------3333-------- QANIAQTEFNAEFEPLLDDLLTIPVAENANWSAQEIQQELDNNVQGILGYVVRWVEQGIG -------3333-------1111---------3333------------------------- SKVPDIHNVALMEDRATLRISSQHIANWLRHGILTKEQVQASLENMAKVVDQQNAGDPAY ----1111--------------------1111-------------------1111-1111 RPMAGNFANSCAFKAASDLIFLGVKQPNGYTEPLLHAWRLREKES --2222---------------33332222-3333----------- >CATALASE HPII; SWP:P21179; PDB:1P80A; DSLAPEDGSHRPAAEPTPPGAQPTAPGSLKAPDTRNEKLNSLEDVRKGSENYALTTNQGV ----1111---------2222----3333-1111-------3333---2222---1111- RIADDQNSLRAGSRGPTLLEDFILREKITHFDHERIPERIVHARGSAAHGYFQPYKSLSD -----------1111--1111---------1111-----------------------333 ITKADFLSDPNKITPVFVRFSTVQGGAGSADTVRQIRGFATKFYTEEGIFDLVGNNTPIF 3--1111-1111-------------1111---------------1111------------ FIQDAHKFPDFVHAVKPEPHWAIPQGQSAHDTFWDYVSLQPETLHNVMWAMSDRGIPRSY ---3333--------------------------------3333--------3333---33 RTMEGFGIHTFRLINAEGKATFVRFHWKPLAGKASLVWDEAQKLTGRDPDFHRRELWEAI 33------------1111----------1111---------------1111--------1 EAGDFPEYELGFQLIPEEDEFKFDFDLLDPTKLIPEELVPVQRVGKMVLNRNPDNFFAEN 111------------3333------1111-----3333----------------3333-1 EQAAFHPGHIVPGLDFTNDPLLQGRLFSYTDTQISRLGGPNFHEIPINRPTCPYHNFQRD 111--1111-2222----3333------------1111--11113333------------ GMHRMGIDTNPANYEPNSINDNWPRETPPGPKRGGFESYQERVEGNKVRERSPSFGEYYS -------------------%%%%----------------------------3333----- HPRLFWLSQTPFEQRHIVDGFSFELSKVVRPYIRERVVDQLAHIDLTLAQAVAKNLGIEL -----1111---------------1111-3333-------1111---------1111--- TDDQLNITPPPDVNGLKKDPSLSLYAIPDGDVKGRVVAILLNDEVRSADLLAILKALKAK 3333--------iiii--3333---------2222------------------------- GVHAKLLYSRMGEVTADDGTVLPIAATFAGAPSLTVDAVIVPCGNIADIADNGDANYYLM ---------------1111-------3333-3333---------33331111-------- EAYKHLKPIALAGDARKFKATIKIADQGEEGIVEADSADGSFMDELLTLMAAHRVWSRIP -----------------3333-------2222-----------------1111-333311 KIDKIPA 11----- >PROTEIN TYROSINE PHOSPHAT; SWP:O00810; PDB:1P8AA; AAEKKAVLFVCLGNICRSPACEGICRDMVGDKLIIDSAATSGFHVGQSPDTRSQKVCKSN ------------------------------------------------------------ GVDISKQRARQITKADFSKFDVIAALDQSILSDINSMKPSNCRAKVVLFNPPNGVDDPYY -------------3333---------3333------------------------------ SSDGFPTMFASISKEMKPFLTEHGLI ------------------3333---- >PEA ALBUMIN 1, SUBUNIT B; SWP:P08687; PDB:1P8BA; ASCNGVCSPFEMPPCGTSACRCIPVGLVIGYCRNPSG -------1111--%%%%-------------------- >OXYSTEROLS RECEPTOR LXR-B; SWP:P55055; PDB:1P8DA; VQLTAAQELMIQQLVAAQLQCNKRSFSDQPKVTPWPLQSRDARQQRFAHFTELAIISVQE ------------------------------------------------------------ IVDFAKQVPGFLQLGREDQIALLKASTIEIMLLETARRYNHETECITFLKDFTYSKDDFH ---------1111----------------------1111-1111----------3333-- RAGLQVEFINPIFEFSRAMRRLGLDDAEYALLIAINIFSADRPNVQEPGRVEALQQPYVE ----3333-----------1111-----------33331111----3333----3333-- ALLSYTRIKRPQDQLRFPRMLMKLVSLRTLSSVHSEQVFALRLQDKKLPPLLSEIWDVHE ---------1111--------------------------1111-----3333-------- >FURIN PRECURSOR; SWP:P23188; PDB:1P8JA; VYQEPTDPKFPQQWYLSGVTQRDLNVKEAWAQGFTGHGIVVSILDDGIEKNHPDLAGNYD ------1111--3333-1111--------------2222---------1111--3333-3 PGASFDVNDQDPDPQPRYTQMNDNRHGTRCAGEVAAVANNGVCGVGVAYNARIGGVRMLD 333---1111--------1111-------------------------1111--------- GEVTDAVEARSLGLNPNHIHIYSASWGPEDDGKTVDGPARLAEEAFFRGVSQGRGGLGSI ----------11111111----------------------------------%%%%---- FVWASGNGGREHDSCNCDGYTNSIYTLSISSATQFGNVPWYSEACSSTLATTYSSGNQNE -------3333--1111--1111---------1111--1111--1111--------1111 KQIVTTDLRQKCTESHTGTSASAPLAAGIIALTLEANKNLTWRDMQHLVVQTSKPAHLNA -------%%%%------3333---------------1111--------------2222-- DDWATNGVGRKVSHSYGYGLLDAGAMVALAQNWTTVAPQRKCIVEILVEPKDIGKRLEVR -----1111------!!!!----------------------------------------- KAVTACLGEPNHITRLEHVQARLTLSYNRRGDLAIHLISPMGTRSTLLAARPHDYSADGF ---%%%%-1111----------------3333------1111--------1111------ NDWAFMTTHSWDEDPAGEWVLEIENTSEANNYGTLTKFTLVLYGTAPEGL -------1111--------------------------------------- >Intron-encoded DNA endonu; SWP:P03880; PDB:1P8KZ; GSDLTYAYLVGLFEGDGYFSITKKGKYLTYELGIELSIKDVQLIYKIKKILGIGIVSFRK -1111-------------------------------3333-------------------- RNEIEVALRIRDKNHLKSFILPIFEKYPFSNKQYDYLRFRNALLSGIISLEDLPDYTRSD ------------------------------3333--------------3333-------- EPLNSIESIINTSYFSAWLVGFIEAEGCFSVYKLNKDDDYLIASFDIAQRDGDILISAIR -----3333--------------------------------------------------- KYLSFTTKVYLDKTNCSKLKVTSVRSVENIIKFLQNAPVKLLGNKKLQYLLWLKQLRKIS ----------------------3333----------------3333-------------3 RYSEKIKIPSNY 333--------- >HYPOTHETICAL PROTEIN; SWP:Q9F5X9; PDB:1P90A; ERVPEGSIRVAIASNNGEQLDGHFGSCLRFLVYQVSAKDASLVDIRSTLDVALAEDKNAW ---2222---------------1111---------1111--------1111--------- RVEQIQDCQVLYVVSIGGPAAAKVVRAGIHPLKKPKGCAAQEAIAELQTVMAGSPPPWLA ----1111----------------1111-----1111-3333-------3333--33333 KLV 333 >RIBOSOMAL RNA LARGE SUBUN; SWP:P36999; PDB:1P91A; SFSCPLCHQPLSREKNSYICPQRHQFDAKEGYVNLLPVQHKRSRDPGDSAEQARRAFLDA ---------------------------3333----------------------------- GHYQPLRDAIVAQLRERLDDKATAVLDIGCGEGYYTHAFADALPEITTFGLDVSKVAIKA 1111------------------------------3333----3333-------------- AAKRYPQVTFCVASSHRLPFSDTSDAIIRIYAPCKAEELARVVKPGGWVITATPGPRHLE ----1111-----------------------------------2222-------1111-3 LKGLIYNEVHLHAPHAEQLEGFTLQQSAELCYPRLRGDEAVALLQTPFAWRAKPEVWQTL 333---------------2222-----------------------1111----------- AAKEVFDCQTDFNIHLWQRSY --------------------- >PLASMID PARTITION PROTEIN; SWP:Q9KJ82; PDB:1P94A; MSLEKAHTSVKKMTFGENRDLERVVTAPVSSGKIKRVNVNFDEEKHTRFKAACARKGTSI ------1111-%%%%3333----------2222--------------------------- TDVVNQLVDNWLKENE ---------------- >ENDOTHELIAL PAS DOMAIN PR; SWP:Q99814; PDB:1P97A; GAMDSKTFLSRHSMDMKFTYCDDRITELIGYHPEELLGRSAYEFYHALDSENMTKSHQNL -----------------------3333----1111----3333----------------- CTKGQVVSGQYRMLAKHGGYVWLETQGTVIYNPRNLQPQCIMCVNYVLSEIEKN --------------1111------------------------------------ >HYPOTHETICAL PROTEIN PG11; SWP:Q99WD9; PDB:1P99A; KVTIGVASNDTKAWEKVKELAKKDDIDVEIKHFSDYNLPNKALNDGDIDMNAFQHFAFLD -------------------3333-----------1111---------------------- QYKKAHKGTKISALSTTVLAPLGIYSDKIKDVKKVKDGAKVVIPNDVSNQARALKLLEAA -----1111---------------------1111-2222------------------111 GLIKLKKDFGLAGTVKDITSNPKHLKITAVDAQQTARALSDVDIAVINNGVATKAGKDPK 1----1111----3333---1111------111133331111----------1111---- NDPIFLEKSNSDAVKPYINIVAVNDKDLDNKTYAKIVELYHSKEAQKALQEDVKDGEKPV ----------3333---------3333-------------------------!!!!---- NLSKDEIKAIETSLA --------------- >Platelet glycoprotein Ib ; SWP:P07359; PDB:1P9AG; HPICEVSKVASHLEVNCDKRNLTALPPDLPKDTTILHLSENLLYTFSLATLMPYTRLTQL -----------------------------1111-------------33333333------ NLDRAELTKLQVDGTLPVLGTLDLSHNQLQSLPLLGQTLPALTVLDVSFNRLTSLPLGAL ---------------1111---------------11111111-------------11112 RGLGELQELYLKGNELKTLPPGLLTPTPKLEKLSLANNNLTELPAGLLNGLENLDTLLLQ 222----------------22221111----------------22222222--------- ENSLYTIPKGFFGSHLLPFAFLHGNPWLCNCEILYFRRWLQDNAENVYVWKQGVDVKAMT -------2222!!!!--------------1111---------1111----2222------ SNVASVQCDNSDKFPVYKYPGKGCPT -3333-222211111111-------- >ADENYLOSUCCINATE SYNTHETA; SWP:Q9U8D3; PDB:1P9BA; GNVVAILGAQWGDEGKGKIIDMLSEYSDITCRFNGGANAGHTISVNDKKYALHLLPCGVL ----------------------3333------------------iiii---------111 YDNNISVLGNGMVIHVKSLMEEIESVGGKLLDRLYLSNKAHILFDIHQIIDSIQETKKLK 1---------------------------3333----1111---3333-----------11 EGKQIGTTKRGIGPCYSTKASRIGIRLGTLKNFENFKNMYSKLIDHLMDLYNITEYDKEK 11-----------------------3333------------------------------- ELNLFYNYHIKLRDRIVDVISFMNTNLENNKKVLIEGANAAMLDIDFGTYPYVTSSCTTV -----------1111------------------------1111---------------33 GGVFSGLGIHHKKLNLVVGVVKSYLTRVGCGPFLTELNNDVGQYLREKGHEYGTTTKRPR 33-------1111-------------------1111----------1111---------- RCGWLDIPMLLYVKCINSIDMINLTKLDVLSGLEEILLCVNFKNKKTGELLEKGCYPVEE -------------------------33332222------------------2222---33 EISEEYEPVYEKFSGWKEDISTCNEFDELPENAKKYILAIEKYLKTPIVWIGVGPNRKNM 331111-------------1111-3333-3333----------------------3333- IVKK ---- >METHYL PARATHION HYDROLAS; SWP:Q841S6; PDB:1P9EA; AAPQVRTSAPGYYRMLLGDFEITALSDGTVALPVDKRLNQPAPKTQSALAKSFQKAPLET ----------------!!!!------------3333----3333-----1111------- SVTGYLVNTGSKLVLVDTGAAGLFGPTLGRLAANLKAAGYQPEQVDEIYITHMHPDHVGG -------------------!!!!-1111------------3333---------1111333 LMVGEQLAFPNAVVRADQKEADFWLSQTNLDKAPDDESKGFFKGAMASLNPYVKAGKFKP 3-------1111-----------------1111-------------------1111---- FSGNTDLVPGIKALASHGHTPGHTTYVVESQGQKLALLGDLILVAAVQFDDPSVTTQLDS -------2222------------------iiii----!!!!--3333---1111-1111- DSKSVAVERKKAFADAAKGGYLIAASHLSFPGIGHIRAEGKGYRFVPVNYSVVN ------------------------------------------------------ >EAFP 2; SWP:P83597; PDB:1P9GA; TCASRCPRPCNAGLCCSIYGYCGSGAAYCGAGNCRCQCRG --1111----2222--1111----3333-1111---1111 >INVASIN; SWP:P31489; PDB:1P9HA; PNLTAVQISPNADPALGLEYGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSV ------------1111-----------2222----------2222---2222---2222- AIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSS --2222---2222---2222--2222---2222---------2222---2222---2222 HVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEK --1111------2222-----------1111-----------1111------------- ----------------------------- >chimera of Epidermal grow; SWP:P01135; PDB:1P9JA; VVSHFNDCPLSHDGYCLHDGVCMYIEALDKYACNCVVGYIGERCQYRDLKWWEL -------------------------1111------2222-1111----3333-- >ORF, HYPOTHETICAL PROTEIN; SWP:NA; PDB:1P9KA; GSMIHRMSNMATFSLGKHPHVELCDLLKLEGWSESGAQAKIAIAEGQVKVDGAVETRKRC ---------------------3333----------------------------------- KIVAGQTVSFAGHSVQVVA ------------------- >DIHYDRODIPICOLINATE REDUC; SWP:P72024; PDB:1P9LA; MRVGVLGAKGKVGTTMVRAVAAADDLTLSAELDAGDPLSLLTDGNTEVVIDFTHPDVVMG ------1111----------------------2222-----1111--------1111--- NLEFLIDNGIHAVVGTTGFTAERFQQVESWLVAKPNTSVLIAPNFAIGAVLSMHFAKQAA -----1111--------------------3333-------------------------33 RFFDSAEVIELHHPHKADAPSGTAARTAKLIAEARKGLPPNPDATSTSLPGARGADVDGI 33------------------------------1111------------2222----iiii PVHAVRLAGLVAHQEVLFGTEGETLTIRHDSLDRTSFVPGVLLAVRRIAERPGLTVGLEP ------2222---------2222-----------------------1111-------333 LLDLH 3---- >PHOSPHOPANTOTHENOYLCYSTEI; SWP:Q5VVM3; PDB:1P9OA; RWAEVMARFAARLGAQGRRVVLVTSGGTKVPLEARPVRFLDNFSSGRRGATSAEAFLAAG -------------1111------------------------------------------- YGVLFLYRARSAFPYAHRFPPQTWLSALRPSGPLSGLLSLEAEENALPGFAEALRSYQEA -------2222---3333-3333-------------------33332222---------- AAAGTFLVVEFTTLADYLHLLQAAAQALNPLGPSAMFYLAAAVSDFYVPPLQITMKMVPK ---------------------------33331111------------------------1 LLSPLVKDWAPKAFIISFKLETDPAIVINRARKALEIYQHQVVVANIFVLIVTKDSETKL 111-3333-1111----------------------------------------------- LLSEEEIEKGVEIEEKIVDNLQSRHTAFI ----------------------------- >TRNA (GUANINE-N(1)-)-METH; SWP:P07020; PDB:1P9PA; MWIGIISLFPEMFRAITDYGVTGRAVKNGLLSIQSWSPRDFTHDRHRTVDDRPYGGGPGM --------3333------------------------3333---1111------------- LMMVQPLRDAIHAAKAAAGEGAKVIYLSPQGRKLDQAGVSELATNQKLILVCGRYEGIDE ------------------2222-----1111----------1111--------!!!!-33 RVIQTEIDEEWSIGDYVLSGGELPAMTLIDSVSRFIPGVLGEGLLDCPHYTRPEVLEGME 33---------------------------------1111----------------iiii- VPPVLLSGNHAEIRRWRLKQSLGRTWLRRPELLENLALTEEQARLLAEFKTEHAQ -3333-----------------------33331111------------------- >REPLICASE POLYPROTEIN 1AB; SWP:Q05002; PDB:1P9SA; AGLRKAQPSGFVEKCVVRVCYGNTVLNGLWLGDIVYCPRHVIASNTTSAIDYDHEYSIRL --------33331111----!!!!-------------3333-----------------33 HNFSIISGTAFLGVVGATHGVTLKIKVSQTNHTPRHSFRTLKSGEGFNILACYDGCAQGV 33----------------!!!!-------------------2222--------------- FGVNRTNWTIRGSFINGACGSPGYNLKNGEVEFVYHQIELGSGSHVGSSFDGVYGGFEDQ ----1111------2222--------%%%%---------3333-----1111-%%%%--- PNLQVESANQLTVNVVAFLYAAILNGCTWWLKGEKLFVEHYNEWAQANGFTANGEDAFSI ---------------------------1111-----3333--------------111133 LAAKTGVCVERLLHAIQVLNNGFGGKQILGYSSLNDEFSINEVVKQFGVN 33-----1111------3333-iiii-iiii------------------- >TRIGGER FACTOR; SWP:P22257; PDB:1P9YA; MQVSVETTQGLGRRVTITIAADSIETAVKSELVNVAKKVRIDGLRKGKVPMNIVAQRYGA --------!!!!-------3333-----------------!!!!2222-3333------- SVRQDVLGDLMSRNFIDAIIKEKINPAGAPTYVPGEYKLGEDFTYSVEFEVYPEVEL -------------------1111---------------------------------- >PEROXIDASE; SWP:P80679; PDB:1PA2A; MQLNATFYSGTCPNASAIVRSTIQQALQSDTRIGASLIRLHFHDCFVNGCDASILLDDTG -----1111--1111--------------1111------------------3333---11 SIQSEKNAGPNVNSARGFNVVDNIKTALENACPGVVSCSDVLALASEASVSLAGGPSWTV 11-3333------------------------2222---------------1111------ LLGRRDSLTANLAGANSSIPSPIESLSNITFKFSAVGLNTNDLVALSGAHTFGRARCGVF --------------------1111---------1111------------------33333 NNRLFNFSGTGNPDPTLNSTLLSTLQQLCPQNGSASTITNLDLSTPDAFDNNYFANLQSN 333---%%%%---1111------------11111111-------1111--33333333-- DGLLQSDQELFSTTGSSTIAIVTSFASNQTLFFQAFAQSMINMGNISPLTGSNGEIRLDC ---3333-----2222---------------------------------!!!!-----11 KKVNGS 112222 >PROBABLE RIBOSOME-BINDING; SWP:P75589; PDB:1PA4A; KERLENDIIRLINRTVIHEIYNETVKTGHVTHVKLSDDLLHVTVYLDCYNREQIDRVVGA ---------------2222----3333-------------------------3333---- FNQAKGVFSRVLAHNLYLAKAVQIHFVKDKAIDNAM ------3333-------------------------- >Microtubule-associated pr; SWP:Q15691; PDB:1PA7A; MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKK -----1111------------------------3333--------------2222-3333 VKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYD -1111--------------------------33333333--------------------- GKDYDPVAAR ----3333-- >CYCLODEXTRIN GLUCANOTRANS; SWP:P05618; PDB:1PAMA; APDTSVSNKQNFSTDVIYQIFTDRFSDGNPANNPTGAAFDGSCTNLRLYCGGDWQGIINK -1111------1111-----3333----3333--!!!!-1111----------------- INDGYLTGMGITAIWISQPVENIYSVINYSGVNNTAYHGYWARDFKKTNPAYGTMQDFKN 1111-3333--------------------------1111----1111-3333-------- LIDTAHAHNIKVIIDFAPNHTSPASSDDPSFAENGRLYDNGNLLGGYTNDTQNLFHHYGG -----1111---------------3333----%%%%--iiii-------1111------- TDFSTIENGIYKNLYDLADLNHNNSSVDVYLKDAIKMWLDLGVDGIRVDAVKHMPFGWQK ----3333------------11113333----------1111-------1111------- SFMATINNYKPVFTFGEWFLGVNEISPEYHQFANESGMSLLDFRFAQKARQVFRDNTDNM --------------------2222----------------------------------33 YGLKAMLEGSEVDYAQVNDQVTFIDNHDMERFHTSNGDRRKLEQALAFTLTSRGVPAIYY 33-------------1111------1111----1111-----------1111------22 GSEQYMSGGNDPDNRARLPSFSTTTTAYQVIQKLAPLRKSNPAIAYGSTHERWINNDVII 22---------------------------------3333-3333----------1111-- YERKFGNNVAVVAINRNMNTPASITGLVTSLPRGSYNDVLGGILNGNTLTVGAGGAASNF ----!!!!-----------------------------1111-----------iiii---- TLAPGGTAVWQYTTDATTPIIGNVGPMMAKPGVTITIDGRGFGSGKGTVYFGTTAVTGAD --2222-----------------------2222-----------------!!!!--!!!! IVAWEDTQIQVKIPAVPGGIYDIRVANAAGAASNIYDNFEVLTGDQVTVRFVINNATTAL ----1111------------------1111----------------------------22 GQNVFLTGNVSELGNWDPNNAIGPMYNQVVYQYPTWYYDVSVPAGQTIEFKFLKKQGSTV 22-------3333%%%%1111----------------------------------!!!!- TWEGGANRTFTTPTSGTATVNVNWQP -------------------------- >Translation initiation fa; SWP:P32501; PDB:1PAQA; DFEKEGIATVERAENNHDLDTALLELNTLRSNVTYHEVRIATITALLRRVYHFIATQTLG --3333------------3333-------------------------------1111--- PKDAVVKVFNQWGLLFKRQAFDEEEYIDLNIIEKIVEQSFDKPDLILFSALVSLYDNDII -------------3333------------------------3333---------1111-- EEDVIYKWWDNVSTDPRYDEVKKLTVKWVEWLQNAD --------1111--33333333-------------- >HYPOTHETICAL PROTEIN TA11; SWP:Q9HI35; PDB:1PAVA; MDVKPDRVIDARGSYCPGPLMELIKAYKQAKVGEVISVYSTDAGTKKDAPAWIQKSGQEL -------------------------3333-2222---------333311113333----- VGVFDRNGYYEIVMKKVK ------------------ >PSEUDOAZURIN PRECURSOR; SWP:P04377; PDB:1PAZ; ENIEVHMLNKGAEGAMVFEPAYIKANPGDTVTFIPVDKGHNVESIKDMIPEGAEKFKSKI ----------1111-----------2222---------------2222-2222-----22 NENYVLTVTQPGAYLVKCTPHYAMGMIALIAVGDSPANLDQIVSAKKPKIVQERLEKVIA 22----------------1111-------------1111--------------------- >NEUROGENIC LOCUS NOTCH HO; SWP:P46531; PDB:1PB5A; EEACELPECQEDAGNKVCSLQCNNHACGWDGGDCS -----3333---------1111-----%%%%---- >HYPOTHETICAL TRANSCRIPTIO; SWP:P75899; PDB:1PB6A; AVSAKKKAILSAALDTFSQFGFHGTRLEQIAELAGVSKTNLLYYFPSKEALYIAVLRQIL ---------------------33333333-------33333333--3333---------- DIWLAPLKAFREDFAPLAAIKEYIRLKLEVSRDYPQASRLFCMEMLAGAPLLMDELTGDL -11113333-1111------------------------------11113333-------- KALIDEKSALIAGWVKSGKLAPIDPQHLIFMIWASTQHYADFAPQVEAVTGATLRDEVFF -----------------------------------33331111---------1111---- NQTVENVQRIIIEGIRPR ----------1111---- >N-METHYL-D-ASPARTATE RECE; SWP:P35439; PDB:1PB7A; TRLKIVTIHQEPFVYVKPTMSDGTCKEEFTVNGDPVKKVICTGPNHTVPQCCYGFCIDLL -------------------1111------1111--------------------------- IKLARTMNFTYEVHLVADGKFGTQERVNNSNKKEWNGMMGELLSGQADMIVAPLTINNER ---------------1111-------%%%%--------------------------3333 AQYIEFSKPFKYQGLTILVKKGTRITGINDPRLRNPSDKFIYATVKQSSVDIYFRRQVEL --------------------------1111------3333-------------1111111 STMYRHMEKHNYESAAEAIQAVRDNKLHAFIWDSAVLEFEASQKCDLVTTGELFFRSGFG 1----3333------------1111-------3333-------1111------------- IGMRKDSPWKQNVSLSILKSHENGFMEDLDKTWVRYQECDS ----------------------------------------- >6-PHOSPHO-BETA-D-GALACTOS; SWP:P11546; PDB:1PBGA; MTKTLPKDFIFGGATAAYQAEGATHTDGKGPVAWDKYLEDNYWYTAEPASDFYHKYPVDL -----1111-------3333-----%%%%--3333------------!!!!--------- ELAEEYGVNGIRISIAWSRIFPTGYGEVNEKGVEFYHKLFAECHKRHVEPFVTLHHFDTP ---1111--------1111-1111-------------------1111------------3 EALHSNGDFLNRENIEHFIDYAAFCFEEFPEVNYWTTFNEIGPIGDGQYLVGKFPPGIKY 333---!!!!3333--------------3333---------------------------- DLAKVFQSHHNMMVSHARAVKLYKDKGYKGEIGVVHALPTKYPYDPENPADVRAAELEDI --------------------------------------------1111------------ IHNKFILDATYLGHYSDKTMEGVNHILAENGGELDLRDEDFQALDAAKDLNDFLGINYYM ------------------------------------3333------1111---------- SDWMQAFDGETEIIKYQIKGVGRRVAPDYVPRWIIYPEGLYDQIMRVKNDYPNYKKIYIT -----------------2222--------------3333-----------1111------ ENGLGYKDEFVDNTVYDDGRIDYVKQHLEVLSDAIADGANVKGYFIWSLMDVFSWSNGYE ----------%%%%--------------------1111------------------!!!! KRYGLFYVDFDTQERYPKKSAHWYKKLAETQVIE ---------------------------------- >BOWMAN-BIRK PROTEINASE IN; SWP:P56679; PDB:1PBIA; KSACCDTCLCTKSNPPTCRCVDVGETCHSACLSCICAYSNPPKCQCFDTQKFCYKQCHNS ---------------------------1111----------------------------- ELEEVIKN -------- >HYPOTHETICAL PROTEIN; SWP:O27659; PDB:1PBJA; RVEDVVTDVDTIDITASLEDVLRNYVENAKGSSVVVKEGVRVGIVTTWDVLEAIAEGDDL 3333--------1111--------------------iiii------------------33 AEVKVWEVERDLVTISPRATIKEAAEKVKNVVWRLLVEEDDEIIGVISATDILRAK 33-3333--1111--1111--------1111-------iiii-----3333----- >FKBP25; SWP:Q00688; PDB:1PBK; PKYTKSVLKKGDKTNFPKKGDVVHCWYTGTLQDGTVFDTNIQTSAKKKKNAKPLSFKVGV -----------------3333---------1111-------------------------- GKVIRGWDEALLTMSKGEKARLEIEPEWAYGKKGQPDAKIPPNAKLTFEVELVDID --------------2222------3333------3333------------------ >ELONGATION FACTOR 1-GAMMA; SWP:P26641; PDB:1PBUA; AKDPFAHLPKSTFALDEFKRKYSNEDTLSVALPYFWEHFDKDGWSLWYSEYRFPEELTQT -------------3333--------3333----------1111----------------3 FMSCNLITGMFQRLDKLRKNAFASVILFGTNNSSSISGVWVFRGQELAFPLSPDWQVDYE 333------3333---3333---------3333--------------3333------333 SYTWRKLDPGSEETQTLVREYFSWEGAFQHVGKAFNQGKIFK 3-----------3333-----------3333----------- >PHOSPHATIDYLINOSITOL 3-KI; SWP:P27986; PDB:1PBWA; LPDLAEQFAPPDIAPPLLIKLVEAIEKKGLECSTLYRTQSSSNLAELRQLLDCDTPSVDL --3333----------------------1111-2222----------3333-------33 EMIDVHVLADAFKRYLLDLPNPVIPAAVYSEMISLAPEVQSSEEYIQLLKKLIRSPSIPH 33-------------1111---------------3333------------33331111-- QYWLTLQYLLKHFFKLSQTSSKNLLNARVLSEIFSPMLFRFSAASSDNTENLIKVIEILI -----------------------------------------------------------1 STEW 111- >quinohemoprotein amine de; SWP:Q8VUT0; PDB:1PBYA; VTGEEVLQNACAACHVQHEDGRWERIDAARKTPEGWDMTVTRMMRNHGVALEPEERAAIV -----------------1111---1111-------------------------------- RHLSDTRGLSLAETEERRYILEREPVAWDEGPDTSMTQTCGRCHSYARVALQRRTPEDWK ---------333322223333-----------------------33331111-------- HLVNFHLGQFPTLEYQALARDRDWWGIAQAEIIPFLARTYPLGEAPDAYADDASGAYVLA ---------1111--2222----------------------------------------- GRQPGRGDYTGRLVLKKAGEDYEVTMTLDFADGSRSFSGTGRILGAGEWRATLSDGTVTI -----------------!!!!--------1111---------------------!!!!-- RQIFALQDGRFSGRWHDADSDVIGGRLAAVKADAAPQVLAVAPARLKIGEETQLRVAGTG ------%%%%------1111----------1111------------2222---------- LGSDLTLPEGVAGSVESAGNGVTVLKLTATGTPGPVSLELGGQKVDLVAYDRPDRISIVP ------------------iiii-----------------iiii----------------- DLTIARIGGNGGPIPKVPAQFEAMGWLNGPDGQPGTGDDIALGAFPASWATDNFDEEAEK --------iiii------------------------------------------------ MQDAKYAGSIDDTGLFTPAEAGPNPERPMQTNNAGNLKVIATVDAEGEPLSAEAHLYATV -3333-----1111---------3333%%%%----------------------------- QRFVDAPIR --------- >Quinohemoprotein amine de; SWP:Q8VUS7; PDB:1PBYB; RDYILAPARPDKLVVIDTEKMAVDKVITIADAGPTPMVPMVAPGGRIAYATVNKSESLVK -----------------1111--------------------2222--------------- IDLVTGETLGRIDLSTPEERVKSLFGAALSPDGKTLAIYESPVRLELTHFEVQPTRVALY ---------------1111----------1111--------------------------- DAETLSRRKAFEAPRQITMLAWARDGSKLYGLGRDLHVMDPEAGTLVEDKPIQSWEAETY -1111-----------------1111-------------------------11113333- AQPDVLAVWNQHESSGVMATPFYTARKDIDPADPTAYRTGLLTMDLETGEMAMREVRIMD -----------3333----------111111113333----------------------- VFYFSTAVNPAKTRAFGAYNVLESFDLEKNASIKRVPLPHSYYSVNVSTDGSTVWLGGAL --------3333-----------------------------------1111--------- GDLAAYDAETLEKKGQVDLPGNASMSLASVRLFTRDE ------------------2222--!!!!--------- >Quinohemoprotein amine de; SWP:Q8VUS8; PDB:1PBYC; MNALVGCTTSFDPGWEVDAFGAVSNLCQPMEADLYGCADPCWPAQVADTLNTYPNWSAGA 3333-------------1111-1111--3333-3333----------3333-11112222 DDVMQDWRKLQSVFPETK -33333333--------- ------------------------------------------------------------ - >PHOSPHATE-BINDING PROTEIN; SWP:P15712; PDB:1PC3A; VATTPASSPVTLAETGSTLLYPLFNLWGPAFHERYPNVTITAQGTGSGAGIAQAAAGTVN -----------------11113333---------1111---------------1111--- IGASDAYLSEGDMAAHKGLMNIALAISAQQVNYNLPGVSEHLKLNGKVLAAMYQGTIKTW ---------------2222---------------2222-------------1111---11 DDPQIAALNPGVNLPGTAVVPLHRSDGSGDTFLFTQYLSKQDPEGWGKSPGFGTTVDFPA 113333--2222--------------------------------3333------------ VPGALGENGNGGMVTGCAETPGCVAYIGISFLDQASQRGLGEAQLGNSSGNFLLPDAQSI 2222---------------2222----1111----1111-------1111---------- QAAAAGFASKTPANQAISMIDGPAPDGYPIINYEYAIVNNRQKDAATAQTLQAFLHWAIT ------1111-1111--------1111--------------------------------- DGNKASFLDQVHFQPLPPAVVKLSDALIATISS ---33333333-----3333-------1111-- >PROTEIN NINB; SWP:P03765; PDB:1PC6A; MKKLTFEIRSPAHQQNAIHAVQQILPDPTKPIVVTIQERNRSLDQNRKLWACLGDVSRQV -----------------------------------------3333--------------- EWHGRWLDAESWKCVFTAALKQQDVVPNLAGNGFVVIGQSTSRMRVGEFAELLELIQAFG -------3333----------------3333--------3333-3333------------ TERGVKWSDEARLALEWKARW 1111----------------- >PROCARBOXYPEPTIDASE A; SWP:P09954; PDB:1PCA; KEDFVGHQVLRISVDDEAQVQKVKELEDLEHLQLDFWRGPARPGFPIDVRVPFPSIQAVK ---2222---------------------1111---------2222------3333----- VFLEAHGIRYTIMIEDVQLLLDEEQEQMFASQGR ---1111--------3333---------1111-- >PEC-60; SWP:P37109; PDB:1PCE; EKQVFSRMPICEHMTESPDCSRIYDPVCGTDGVTYESECKLCLARIENKQDIQIVKDGEC ----------------------------3333---------------------------- >TRANSCRIPTIONAL COACTIVAT; SWP:P53999; PDB:1PCFA; AMFQIGKMRYVSVRDFKGKVLIDIREYWMDPEGEMKPGRKGISLNPEQWSQLKEQISDID -----2222------iiii----------1111--------------------------- DAVRKL ------ >PHOSPHOCARRIER PROTEIN; SWP:P45611; PDB:1PCH; AKFSAIITDKVGLHARPASVLAKEASKFSSNITIIANEKQGNLKSIMNVMAMAIKTGTEI --------1111------------3333-------!!!!--1111----3333-2222-- TIQADGNDADQAIQAIKQTMIDTALIQG -----1111-----------1111---- >PLASTOCYANIN; SWP:P21697; PDB:1PCS; ATVKMGSDSGALVFEPSTVTIKAGEEVKWVNNKLSPHNIVFDADGVPADTAAKLSHKGLL ------1111-----------2222----------------------------------- FAAGESFTSTFTEPGTYTYYCEPHRGAGMVGKVVVE -2222---------------33331111-------- >OSMOTIN; SWP:P14170; PDB:1PCVA; ATIEVRNNCPYTVWAASTPIGGGRRLDRGQTWVINAPRGTKMARVWGRTNCNFNAAGRGT --------------------------2222-----------------------1111--- CQTGDCGGVLQCTGWGKPPNTLAEYALDQFSGLDFWDISLVDGFNIPMTFAPTNPSGGKC -----%%%%------------------------------1111----------------- HAIHCTANINGECPRELRVPGGCNNPCTTFGGQQYCCTQGPCGPTFFSKFFKQRCPDAYS -------3333--3333-2222--3333---3333---------3333---3333----- YPQDDPTSTFTCPGGSTNYRVIFCP ----1111----2222--------- >PROTEIN TRANSPORT PROTEIN; SWP:P40482; PDB:1PCXA; RPMNQLYPIDLLTELPPPITDLTLPPPPLVIPPERMLVPSELSNASPDYIRSTLNAVPKN ---------3333-----3333---------3333----3333--3333----------3 SSLLKKSKLPFGLVIRPYQHLYDDIDPPPLNEDGLIVRCRRCRSYMNPFVTFIEQGRRWR 333------------------1111---------------------1111---%%%%--- CNFCRLANDVPMQMDQPKSRYDRNEIKCAVMEYMAPKEYTLRQPPPATYCFLIDVSQSSI ------------------11113333---------1111----------------33333 KSGLLATTINTLLQNLDSIPNHDERTRISILCVDNAIHYFKIPLDSENINMMDIADLEEP 333-----------1111-------------------------3333--------11112 NSMVVSLKACRQNIETLLTKIPQIFQSNLITNFALGPALKSAYHLIGGVGGKIIVVSGTL 222-----------------33331111-------------------------------- PNLGIGKLQRDSFYKNFTIDCSKVQITVDLFLASEDYMDVASLSNLSRFTAGQTHFYPGF ---2222-------------3333--------------3333----1111------2222 SGKNPNDIVKFSTEFAKHISMDFCMETVMRARGSTGLRMSRFYGHFFNRSSDLCAFSTMP 3333-------------1111------------2222----------------------- RDQSYLFEVNVDESIMADYCYVQVAVLLSLNNSQRRIRIITLAMPTTESLAEVYASADQL -----------------------------------------------------1111--- AIASFYNSKAVEKALNSSLDDARVLINKSVQDILATYKKEIVAGGAPLRLCANLRMFPLL --------------------------------------------------1111------ MHSLTKHMAFRSGIVPSDHRASALNNLESLPLKYLIKNIYPDVYSLHDMADEAGLPVGTI ------3333--------------------3333---------------3333------- VLPQPINATSSLFERYGLYLIDNGNELFLWMGGDAVPALVFDVFGTQDIFDIPIGKQEIP ---------1111----------------------------------1111--------- VVENSEFNQRVRNIINQLRNHDDVITYQSLYIVRGASLSEPVNHASAREVATLRLWASST ---------------3333----------------------------------------- LVEDKILNNESYREFLQIMKARISK -----!!!!---------------- >Glutathione-requiring pro; SWP:O35543; PDB:1PD21; MPNYKLLYFNMRGRAEIIRYIFAYLDIKYEDHRIEQADWPKIKPTLPFGKIPVLEVEGLT ------------1111------------------333333331111---------%%%%- LHQSLAIARYLTKNTDLAGKTELEQCQVDAVVDTLDDFMSLFPWAEENQDLKERTFNDLL --------------3333-------------------3333-1111-------------- TRQAPHLLKDLDTYLGDKEWFIGNYVTWADFYWDICSTTLLVLKPDLLGIYPRLVSLRNK --3333--------!!!!-3333--------------------11111111--------- VQAIPAISAWILKRPQTKL ---3333------------ >NONSTRUCTURAL PROTEIN NS2; SWP:Q67248; PDB:1PD3A; GKWREQLGQKFEEIRWLIEEVRHRLKITENSFEQITFMQALQLLLEVEQEIRTF ---------------------------1111----------------------- >Myosin-binding protein C,; SWP:Q14896; PDB:1PD6A; DEKKSTAFQKKLEPAYQVSKGHKIRLTVELADHDAEVKWLKNGQEIQMSGSKYIFESIGA ------------------2222-------------------------------------- KRTLTISQCSLADDAAYQCVVGGEKCSTELFVKE ---------------------------------- ------------------------------------------------------------ --------------------------- >CHAPERONE PROTEIN PAPD; SWP:P42190; PDB:1PDKB; LLDRPCHVSGDSLNKHVVFKTRASRDFWYPPGRSPTESFVIRLENCHATAVGKIVTLTFK ---------1111---------1111---------------------1111--------- GTEEAALPGHLKVTGVNAGRLGIALLDTDGSSLLKPGTSHNKGQGEKVTGNSLELPFGAY ---1111-------1111--------1111----2222----------!!!!-------- VVATPEALRTKSVVPGDYEATATFELTYR -------1111------------------ >PRD PAIRED DOMAIN; SWP:P06601; PDB:1PDNC; QGRVNQLGGVFINGRPLPNNIRLKIVEMAADGIRPCVISRQLRVSHGCVSKILNRYQETG ----1111--------------------1111-3333----------------------- SIRPGVIGGSKPRIATPEIENRIEEYKRSSPGMFSWEIREKLIREGVCDRSTAPSVSAIS -----------------------------2222---------3333--3333-------- RLV --- >MANNOSE PERMEASE; SWP:P69797; PDB:1PDO; TIAIVIGTHGWAAEQLLKTAEMLLGEQENVGWIDFVPGENAETLIEKYNAQLAKLDTTKG -----------------------------------2222------------1111-1111 VLFLVDTWGGSPFNAASRIVVDKEHYEVIAGVNIPMLVETLMARDDDPSFDELVALAVET ------2222-------------------------------------------------- GREGVKALK -3333---- >HUMAN DISCS LARGE PROTEIN; SWP:Q12959; PDB:1PDR; ITREPRKVVLHRGSTGLGFNIVGGEDGEGIFISFILAGGPADLSGELRKGDRIISVNSVD -----------------------------------2222--------2222--------- LRAASHEQAAAALKNAGQAVTIVAQYRPEEYSRQHA ------------1111-------------------- >NUCLEAR HORMONE RECEPTOR ; SWP:P49869; PDB:1PDUA; AISLITALVRSHVDTTPDPSCLDYSHYEEQSMSEADKVQQFYQLLTSSVDVIKQFAEKIP ------------1111-3333--1111-------------------------------22 GYFDLLPEDQELLFQSASLELFVLRLAYRARIDDTKLIFCNGTVLHRTQCLRSFGEWLND 22------------------------------------1111---3333-----3333-- IMEFSRSLHNLEIDISAFACLCALTLITERHGLREPKKVEQLQMKIIGSLRDHVTYNAEA ---------------------------------------------------------333 QKKQHYFSRLLGKLPELRSLSVQGLQRIFYLKLEDLVPAPALIENMFVTT 3--------------------------------------33333333--- >ENOLASE; SWP:P56252; PDB:1PDZ; SITKVFARTIFDSRGNPTVEVDLYTSKGLFRAAVPSGASTGVHEALEMRDGDKSKYHGKS -----------1111---------1111-----------------------3333iiii- VFNAVKNVNDVIVPEIIKSGLKVTQQKECDEFMCKLDGTENKSSLGANAILGVSLAICKA ---------------------1111----------------------------------- GAAELGIPLYRHIANLANYDEVILPVPAFNVINGGSHAGNKLAMQEFMILPTGATSFTEA --------------1111--------------------------------1111-3333- MRMGTEVYHHLKAVIKARFGLDATAVGDEGGFAPNILNNKDALDLIQEAIKKAGYTGKIE -------------------3333---1111-------3333-------------2222-- IGMDVAASEFYKQNNIYDLDFKTANNDGSQKISGDQLRDMYMEFCKDFPIVSIEDPFDQD -----3333---------%%%%----------3333------------------------ DWETWSKMTSGTTIQIVGDDLTVTNPKRITTAVEKKACKCLLLKVNQIGSVTESIDAHLL ------------------3333---------------------1111------------- AKKNGWGTMVSHRSGETEDCFIADLVVGLCTGQIKTGAPCRSERLAKYNQILRIEEELGS -1111--------------3333---1111----------3333-----------3333- GAKFAGKNFRAPS ----!!!!----- >2-DEHYDRO-3-DEOXYPHOSPHOO; SWP:O66496; PDB:1PE1A; MEKFLVIAGPCAIESEELLLKVGEEIKRLSEKFKEVEFVFKSSFDKANRSSIHSFRGHGL --------------------------------1111--------------1111------ EYGVKALRKVKEEFGLKITTDIHESWQAEPVAEVADIIQIPAFLCRQTDLLLAAAKTGRA -----------------------3333-3333--------3333----------3333-- VNVKKGQFLAPWDTKNVVEKLKFGGAKEIYLTERGTTFGYNNLVVDFRSLPIMKQWAKVI -----11113333--------1111----------------------------------- YDATHSVQLPGGGMREFIFPLIRAAVAVGCDGVFMETHPEPEKALSDASTQLPLSQLEGI ---3333------3333----------------------3333---1111--3333---- IEAILEIREVASKYYETI ----------3333---- >NEUROTOXIN CN11; SWP:P63019; PDB:1PE4A; RDGYPLASNGCKFGCSGLGENNPTCNHVCEKKAGSDYGYCYAWTCYCEHVAEGTVLWGDS -------3333------------------------------------------------- GTGPCRS ------- >PECTATE LYASE A; SWP:P29155; PDB:1PE9A; AELVSDKALESAPTVGWASQNGFTTGGAAATSDNIYIVTNISEFTSALSAGAEAKIIQIK ------1111------1111------11113333---------------!!!!------- GTIDISGGTPYTDFADQKARSQINIPANTTVIGLGTDAKFINGSLIIDGTDGTNNVIIRN ---1111-----------------------------------------1111-------- VYIQTPIDVEPHYEKGDGWNAEWDAMNITNGAHHVWIDHVTISDGNFTDDMYTTKDGETY -------------2222--------------------------!!!!1111---iiii-- VQHDGALDIKRGSDYVTISNSLIDQHDKTMLIGHSDSNGSQDKGKLHVTLFNNVFNRVTE ----------------------------------1111---2222--------------- RAPRVRYGSIHSFNNVFKGDAKDPVYRYQYSFGIGTSGSVLSEGNSFTIANLSASKACKV ----------------------------------2222--------------3333---- VKKFNGSIFSDNGSVLNGSAVDLSGCGFSAYTSKIPYIYDVQPMTTELAQSITDNAGSGK ---------------iiii---1111---------------------------------- L - >AMIDASE OPERON; SWP:P27017; PDB:1PEA; PLIGLLFSETGVTADIERSQRYGALLAVEQLNREGGVGGRPIETLSQDPGGDPDRYRLCA ------------------------------------iiii--------%%%%-------- EDFIRNRGVRFLVGCYMSHTRKAVMPVVERADALLCYPTPYEGFEYSPNIVYGGPAPNQN -------------------------------------------------------3333- SAPLAAYLIRHYGERVVFIGSDYIYPRESNHVMRHLYRQHGGTVLEEIYIPLYPSDDDLQ -------------------------------------1111------------------- RAVERIYQARADVVFSTVVGTGTAELYRAIARRYGDGRRPPIASLTTSEAEVAKMESDVA -----3333----------3333----------------------------111111112 EGQVVVAPYFSSIDTPASRAFVQACHGFFPENATITAWAEAAYWQTLLLGRAAQAAGNWR 222------1111-3333-----------1111--------------------3333--- VEDVQRHLYDIDIDAPQGPVRVERQNNHSRLSSRIAEIDARGVFQVRWQSPEPIRPDPYV ----1111------3333--------------------1111------------------ VVHNLDDW 3333---- >PEPNH1; SWP:P19836; PDB:1PEH; NEKKYHLQERVDKVKKKVKDVEEKSKEWVQKVE -3333-------3333--1111--3333----- >Ribonucleoside-diphosphat; SWP:Q08698; PDB:1PEQA; RVMQETMDYHALNAMLNLYDKAGHIQFDKDQQAIDAFFATHVRPHSVTFASQHERLGTLV --------------1111-1111--3333------------3333--------------1 REGYYDDAVLARYDRAFVLRLFEHAHASGFRFQTFLGAWKFYTSYTLKTFDGKRYLEHFE 111------1111-----------1111-----3333-----------1111-------- DRVTMVALTLAQGDETLATQLTDEMLSGRFQPATPTFLNCGKQQRGELVSCFLLRIEDNM ----------iiii------------------------2222------------------ ESIGRAVNSALQLSKRGGGVAFLLSNLREAGAPIKRIENQSSGVIPVMKMLEDAFSYANQ ------------3333-------1111------iiii------3333------------- GAGAVYLHAHHPDILRFLDTKRIKTLSLGVVIPDITFRLAKENAQMALFSPYDIQRRYGK -------1111-------1111----------3333------------------------ PFGDIAISERYDELIADPHVRKTYINARDFFQTLAEIQFESGYPYIMFEDTVNRANPIAG 1111-3333-------1111---------------------------------------- RINMSNLCSEILQVNSASRYDDNLDYTHIGHDISCNLGSLNIAHVMDSPDIGRTVETAIR --------------------1111------------------------------------ GLTAVSDMSHIRSVPSIAAGNAASHAIGLGQMNLHGYLAREGIAYGSPEALDFTNLYFYT -----1111-33333333-------------------------2222------------- ITWHAVHTSMRLARERGKTFAGFAQSRYASGDYFTQYLQDDWQPKTAKVRALFARSGITL -------------------2222----3333--3333----------------1111--- PTREMWLKLRDDVMRYGIYNQNLQAVPPTGSISYINHATSSIHPIVAKIEIRKEGKTGRV -----------------------------3333---------------------1111-- YYPAPFMTNENLDMYQDAYDIGPEKIIDTYAEATRHVDQGLSLTLFFPDTATTRDINKAQ ---222211111111-3333-------------------------------3333----- IYAWRKGIKSLYYIRLRQL ---1111------------ >COLLAGENASE-3; SWP:P45452; PDB:1PEX; TPDKCDPSLSLDAITSLRGETMIFKDRFFWRLHPQQVDAELFLTKSFWPELPNRIDAAYE ------------------------!!!!--------------1111-3333--------- HPSHDLIFIFRGRKFWALNGYDILEGYPKKISELGLPKEVKKISAAVHFEDTGKTLLFSG ----------!!!!----!!!!---------1111------------------------- NQVWRYDDTNHIMDKDYPRLIEEDFPGIGDKVDAVYEKNGYIYFFNGPIQFEYSIWSNRI -------------------3333----------------------!!!!----------- VRVMPANSILWC ----1111---- >HYPOTHETICAL PROTEIN YJGH; SWP:P39332; PDB:1PF5A; VERTAVFPAGRHSLYAEHRYSAAIRSGDLLFVSGQVGSREDGTPEPDFQQQVRLAFDNLH -------------3333--------!!!!---------1111------------------ ATLAAAGCTFDDIIDVTSFHTDPENQFEDIMTVKNEIFSAPPYPNWTAVGVTWLAGFDFE ---1111-3333---------3333----------------------------iiii--- IKVIARIPEQ ---------- >POLYCOMB PROTEIN; SWP:P26017; PDB:1PFBA; DLVYAAEKIIQKRVKKGVVEYRVKWKGWNQRYNTWEPEVNILDRRLIDIYEQTNK --------------iiii------22223333----3333--------------- >IGG1 PFC' FC; SWP:NA; PDB:1PFC; RTISKAKGPPRIPEVYLLPPPRNELSKKKVSLTCMITGFYPADINVEWDSSEPSDYKNTP ------------------------------------------------------------ PVFDTDGSFFLYSRLKVDTDAWNNGESFTCSVMHEALPNHVIQKSISRSPG ----------------------3333------------------------- >FERREDOXIN; SWP:Q7M1S1; PDB:1PFD; ATYNVKLITPDGEVEFKCDDDVYVLDQAEEEGIDIPYSCRAGSCSSCAGKVVSGSIDQSD --------3333------1111------3333---------------------------- QSFLDDEQMDAGYVLTCHAYPTSDVVIETHKEEEIV ------------------------------------ >METHIONINE GAMMA-LYASE; SWP:O15565; PDB:1PFFA; SALEGKIAKLEHAEACAATASGMGAIAASVWTFLKAGDHLISDDCLYGCTHALFEHQLRK ------------------------------11112222-------------------333 FGVEVDFIDMAVPGNIEKHLKPNTRIVYFETPANPTLKVIDIEDAVKQARKQKDILVIVD 3----------22223333-1111------------------------------------ NTFASPILTNPLDLGVDIVVHSATKYINGHTDVVAGLVCSRADIIAKVKSQGIKDITGAI -3333----3333--------3333----------------------------------- ISPHDAWLITRGTLTLDMRVKRAAENAQKVAEFLHEHKAVKKVYYPGLPDHPGHEIAKKQ ------------------------------------3333----1111--1111------ MKMFGSMIAFDVDGLEKAKKVLDNCHVVSLAVSLGGPESLIQHPASMTHAGVPKEEREAA ---------------------1111-----------------3333--11113333-111 GLTDNLIRLSVGCENVQDIIDDLKQALDLVL 1-1111--------3333--------3333- ---------------------------------------------- >TFIIH basal transcription; SWP:P32780; PDB:1PFJA; MATSSEEVLLIVKKVRQKKQDGALYLMAERIAWAPEGKDRFTISHMYADIKCQKISPEGK ----------------3333------1111---------------3333----------- AKIQLQLVLHAGDTTNFHFSNESTAVKERDAVKDLLQQLLPKFKRKAN --------------------------------------1111------ >PHOSPHOFRUCTOKINASE; SWP:P0A796; PDB:1PFKA; MIKKIGVLTSGGDAPGMNAAIRGVVRSALTEGLEVMGIYDGYLGLYEDRMVQLDRYSVSD ----------------------------1111---------------------3333--- MINRGGTFLGSARFPEFRDENIRAVAIENLKKRGIDALVVIGGDGSYMGAMRLTEMGFPC 1111--1111--------------------1111--------3333-------1111--- IGLPGTIDNDIKGTDYTIGFFTALSTVVEAIDRLRDTSSSHQRISVVEVMGRYCGDLTLA ----------------2222---------------------------------------- AAIAGGCEFVVVPEVEFSREDLVNEIKAGIAKGKKHAIVAITEHMCDVDELAHFIEKETG -----------1111--------------1111--------------------------- RETRATVLGHIQRGGSPVPYDRILASRMGAYAIDLLLAGYGGRCVGIQNEQLVHHDIIDA -------!!!!------------------------1111--------iiii----3333- IENMKRPFKGDWLDCAKKLY --------3333---3333- >PF4-M2 CHIMERA; SWP:P02776; PDB:1PFMA; MSAKELRCQCVKTTSQVRPRHITSLEVIKAGPHCPTAQLIATLKNGRKICLDLQAPLYKK -------------3333-------------1111-------------------------- IIKKLLES ----3333 >PERFRINGOLYSIN O; SWP:P19995; PDB:1PFO; DITDKNQSIDSGISSLSYNRNEVLASNGDKIESFVPKEGKKAGNKFIVVERQKRSLTTSP ------------1111--1111-------------------------------------- VDISIIDSVNDRTYPGALQLADKALVENRPTILMVKRKPININIDLPGLKGENSIKVDDP -------------2222-------1111--------------------2222-------- TYGKVSGAIDELVSKWNEKYSSTHTLPARTQYSESMVYSKSQISSALNVNAKVLENSLGV -------------------1111---------------3333-------3333------- DFNAVANNEKKVMILAYKQIFYTVSADLPKNPSDLFDDSVTFNDLKQKGVSNEAPPLMVS 3333--------------------------3333--1111-----1111-1111------ NVAYGRTIYVKLETTSSSKDVQAAFKALIKNTDIKNSQQYKDIYENSSFTAVVLGGDAQE -----------------1111----------------------1111-----------22 HNKVVTKDFDEIRKVIKDNATFSTKNPAYPISYTSVFLKDNSVAAVHNKTDYIETTSTEY 22--------------------1111---------------------------------- SKGKINLDHSGAYVAQFEVAWDEVSYDKEGNEVLTHKTWDGNYQDKTAHYSTVIPLEANA --------------------------1111--------1111--------------1111 RNIRIKARECTGLAWEWWRDVISEYDVPLTNNINVSIWGTTLYPGSSITYN -------------3333----------------------3333-------- >DIPEPTIDYL PEPTIDASE IV S; SWP:P27487; PDB:1PFQA; SRKTYTLTDYLKNTYRLKLYSLRWISDHEYLYKQENNILVFNAEYGNSSVFLENSTFDEF ------------------------------------------------------------ GHSINDYSISPDGQFILLEYNYVKQWRHSYTASYDIYDLNKRQLITEERIPNNTQWVTWS ---------1111------------1111--------1111------------------- PVGHKLAYVWNNDIYVKIEPNLPSYRITWTGKEDIIYNGITDWVYEEEVFSAYSALWWSP ---------%%%%-----1111---------2222------------------------- NGTFLAYAQFNDTEVPLIEYSFYSDESLQYPKTVRVPYPKAGAVNPTVKFFVVNTDSLSS -----------1111---------3333-------------------------1111--- VTNATSIQITAPASMLIGDHYLCDVTWATQERISLQWLRRIQNYSVMDICDYDESSGRWN -----------3333--------------------------------------------- CLVARQHIEMSTTGWVGRFRPSEPHFTLDGNSFYKIISNEEGYRHICYFQIDKKDCTFIT -3333---------------------3333--------1111-------1111------- KGTWEVIGIEALTSDYLYYISNEYKGMPGGRNLYKIQLSDYTKVTCLSCELNPERCQYYS ----------------------22221111---------3333----11113333----- VSFSKEAKYYQLRCSGPGLPLYTLHSSVNDKGLRVLEDNSALDKMLQNVQMPSKKLDFII ---2222--------------------------------------1111----------- LNETKFWYQMILPPHFDKSKKYPLLLDVYAGPCSQKADTVFRLNWATYLASTENIIVASF !!!!--------------------------2222---------3333------------- DGRGSGYQGDKIMHAINRRLGTFEVEDQIEAARQFSKMGFVDNKRIAIWGWSYGGYVTSM --------3333-1111-----------------1111---3333--------------- VLGSGSGVFKCGIAVAPVSRWEYYDSVYTERYMGLPTPEDNLDHYRNSTVMSRAENFKQV 1111-----------------------3333-----3333--------3333---3333- EYLLIHGTADDNVHFQQSAQISKALVDVGVDFQAMWYTDEDHGIASSTAHQHIYTHMSHF ------1111---3333--------1111-----------1111---------------- IKQCFSLP --1111-- >PF3 SINGLE-STRANDED DNA B; SWP:P03672; PDB:1PFSA; MNIQITFTDSVRQGTSAKGNPYTFQEGFLHLEDKPHPLQCQFFVESVIPAGSYQVPYRIN ---------------1111-----------1111-------------------------- VNNGRPELAFDFKAMKRA -iiii-----3333---- -------------------------------------------------- >METHIONYL-TRNA SYNTHETASE; SWP:P00959; PDB:1PFVA; AKKILVTCALPYANGSIHLGHMLEHIQADVWVRYQRMRGHEVNFICADDAHGTPIMLKAQ -----------------------------------1111--------------------- QLGITPEQMIGEMSQEHQTDFAGFNISYDNYHSTHSEENRQLSELIYSRLKENGFIKNRT ---------------------1111-------------------------1111------ ISQLYDPEKGMFLPDRFVKGTCPKCKSPDQYGDNCEVCGATYSPTELIEPKSVVSGATPV -------------3333-------------!!!!--------3333-------------- MRDSEHFFFDLPSFSEMLQAWTRSGALQEQVANKMQEWFESGLQQWDISRDAPYFGFEIP --------------------1111----------------------------------22 NAPGKYFYVWLDAPIGYMGSFKNLCDKRGDSVSFDEYWKKDSTAELYHFIGKDIVYFHSL 22-----3333---------------------------1111--------1111------ FWPAMLEGSNFRKPSNLFVHGYVTVNGAKMSKSRGTFIKASTWLNHFDADSLRYYYTAKL ------1111--------------iiii---1111---3333-----3333--------- SSRIDDIDLNLEDFVQRVNADIVNKVVNLASRNAGFINKRFDGVLASELADPQLYKTFTD ----------------------------------------iiii---------------- AAEVIGEAWESREFGKAVREIMALADLANRYVDEQAPWVVAKQEGRDADLQAICSMGINL -----------------------------------33331111----------------- FRVLMTYLKPVLPKLTERAEAFLNTELTWDGIQQPLLGHKVNPFKALYNRIDMRQVEALV -------3333----------------3333----------------------------- EASKEEV ------- >Coagulation factor IX; SWP:P00741; PDB:1PFXL; YNSGKLFVRGNLRCIKCSFARVFNTKTNFWKQYVDGDQCEPNPCLNGGLCKDINSYECWC -----------------------------1111---1111----%%%%------------ QVGFEGKNCELDATCNIKNGRCKQFCKTGADSKVLCSCTTGYRLAPDQKSCKPAVPFPCG ------------------iiii----------------------1111------------ RVSVSHSPTTLTR --3333------- >ACETYL-COA SYNTHETASE; SWP:Q8ZKF6; PDB:1PG4A; HKHAIPANIADRCLINPEQYETKYKQSINDPDTFWGEQGKILDWITPYQKVKNTSFAPGN ------------------------------3333----------------------2222 VSIKWYEDGTLNLAANCLDRHLQENGDRTAIIWEGDDTSQSKHISYRELHRDVCRFANTL -----1111--------33331111-----------3333-------------------- LDLGIKKGDVVAIYMPMVPEAAVAMLACARIGAVHSVIFGGFSPEAVAGCIIDSSSRLVI 1111-2222--------3333-----------------1111------------------ TADEGVRAGRSIPLKKNVDDALKNPNVTSVEHVIVLKRTGSDIDWQEGRDLWWRDLIEKA ------iiii-------------1111------------------------33333333- SPEHQPEAMNAEDPLFILYTSGSTGKPKGVLHTTGGYLVYAATTFKYVFDYHPGDIYWCT ---------1111---------------------------------1111-2222----- ADVGWVTGHSYLLYGPLACGATTLMFEGVPNWPTPARMCQVVDKHQVNILYTAPTAIRAL -11113333-------1111--------1111-1111----------------------- MAEGDKAIEGTDRSSLRILGSVGEPINPEAWEWYWKKIGKEKCPVVDTWWQTETGGFMIT -------2222-3333----------------------%%%%--------1111------ PLPGAIELKAGSATRPFFGVQPALVDNEGHPQEGATEGNLVITDSWPGQARTLFGDHERF -1111---2222----2222-----1111----------------1111---2222---- EQTYFSTFKNMYFSGDGARRDEDGYYWITGRVDDVLNVSGHRLGTAEIESALVAHPKIAE -------2222---------1111-------------iiii-------------1111-- AAVVGIPHAIKGQAIYAYVTLNHGEEPSPELYAEVRNWVRKEIGPLATPDVLHWTDSLPK ---------------------2222------------------3333------------- TRSGKIMRRILRKIAAGDADPGVVEKLLEEKQA 1111---------------3333---------- >ASPARTATE CARBAMOYLTRANSF; SWP:Q55338; PDB:1PG5A; LKHIISAYNFSRDELEDIFALTDKYSKNLNDTRKILSGKTISIAFFEPSTRTYLSFQKAI -----3333------------------------1111----------------------- INLGGDVIGFSGEGENLADTIRMLNNYSDGIVMRHKYDGASRFASEISDIPVINAGDGKH 1111--------------------------------2222----------------!!!! EHPTQAVIDIYTINKHFNTIDGLVFALLGDLKYARTVNSLLRILTRFRPKLVYLISPQLL -------------------2222--------------------1111---------3333 RARKEILDELNYPVKEVENPFEVINEVDVLYVTRIQKERFVDEMEYEKIKGSYIVSLDLA ------3333--------33333333---------------333333331111--33331 NKMKKDSIILHPLPRVNEIDRKVDKTTKAKYFEQASYGVPVRMSILTKIYGE 1111111------------3333--3333----------------------- >Aspartate carbamoyltransf; SWP:P74766; PDB:1PG5B; MVSKIKNGTVIDHIPAGRAFAVLNVLGIKEGFRIALVINVDSKKMGKKDIVKIEDKEISD --------------2222-----1111-------------------------------33 TEANLITLIAPTATINIVREYEVVKKTKLEVPKVVKGILKCPNPYCITSNDVEAIPTFKT 33-3333--1111-----iiii--------------------111111111111------ LTEKPLKMRCEYCETIIDENEIMSQILG ---------------------------- >HYPOTHETICAL PROTEIN SPYM; SWP:Q8P2Q3; PDB:1PG6A; RRFTIDQNQFPLVEIDLEHGGSVYLQQGSVYHTENVTLNTKLNGLGKLVGAIGRSVSGES -----------------2222----2222---1111------------------------ FITQASNGDGKLALAPNTPGQIVALELGEKQYRLNDGAFLALDGSAQYKERQNIGGGLFV ----------------------------------2222----3333-------------- TTEGLGTLLANSFGSIKKITLDGGTTIDNAHVVAWSRELDYDIHLENGFQSIGTGEGVVN ---------------------------3333----3333----------3333------- TFRGHGEIYIQSLNLEQFAGTLKRYL ---------------------3333- >HUMANIZED ANTIBODY D3H44; SWP:NA; PDB:1PG7X; EVQLQQSGPELVKPGASVKISCKASGYSFTGHLLNWVKQSHGKNLEWIGLVHPHNGAITY ------------2222-----------1111----------------------------- NQKFKDKATLTVDRSSTTAYIELVRL 3333---------1111--------- >6-PHOSPHOGLUCONATE DEHYDR; SWP:P31072; PDB:1PGJA; SMDVGVVGLGVMGANLALNIAEKGFKVAVFNRTYSKSEEFMKANASAPFAGNLKAFETME --------------------------------3333-------1111-1111-----333 AFAASLKKPRKALILVQAGAATDSTIEQLKKVFEKGDILVDTGNAHFKDQGRRAQQLEAA 3-3333-----------------------1111------------3333----------- GLRFLGMGISGGEEGARKGPAFFPGGTLSVWEEIRPIVEAAAAKADDGRPCVTMNGSGGA --------------------------33333333----------------------!!!! GSCVKMYHNSGEYAILQIWGEVFDILRAMGLNNDEVAAVLEDWKSKNFLKSYMLDISIAA ----------------------------------------------1111---------- ARAKDKDGSYLTEHVMDRIGSKGTGLWSAQEALEIGVPAPSLNMAVVSRQFTMYKTERQA ----1111-3333----------------------------------------------- NASNAPGITQSPGYTLKNKSPSGPEIKQLYDSVCIAIISCYAQMFQCLREMDKVHNFGLN ------1111---------1111------------------------------------3 LPATIATFRAGCILQGYLLKPMTEAFEKNPNISNLMCAFQTEIRAGLQNYRDMVALITSK 333--------11113333--------------3333----------------------- LEVSIPVLSASLNYVTAMFTPTLKYGQLVSLQRDVFGRHGYERVDKDGRESFQWPELQ ----------------1111--3333-----3333----------------------- >Genome polyprotein M; SWP:P23009; PDB:1PGL1; SISQQTVWNQMATVRTPLNFDSSKQSFCQFSVDLLGGGISVDKTGDWITLVQNSPISNLL ----------------11113333------------------------------------ RVAAWKKGCLMVKVVMSGNAAVKRSDWASLVQVFLTNSNSTEHFDACRWTKSEPHSWELI ------------------33333333-------------1111----------------- FPIEVCGPNNGFEMWSSEWANQTSWHLSFLVDNPKQSTTFDVLLGISQNFEIAGNTLMPA --------------------------------1111----------1111---------- FSVPQ ----- >Genome polyprotein M; SWP:P23009; PDB:1PGL2; METNLFKLSLDDVETPKGSMLDLKISQSKIALPKNTVGGTILRSDLLANFLTEGNFRASV ---3333-----------3333----------1111---------3333------1111- DLQRTHRIKGMIKMVATVGIPENTGIALACAMNSSIRGRASSDIYTICSQDCELWNPACT --------------------1111------------------3333---------3333- KAMTMSFNPNPCSDAWSLEFLKRTGFHCDIICVTGWTATPMQDVQVTIDWFISSQECVPR ---------1111----------------------------------------------- TYCVLNPQNPFVLNRWMGKLTFPQGTSRSVKRMPLSIGGGAGAKSAILMNMPNAVLSMWR --1111-------------------------------------------------3333- YFVGDLVFEVSKMTSPYIKCTVSFFIAFGNLADDTINFEAFPHKLVQFGEIQEKVVLKFS --------------1111---------11111111-------------1111-------- QEEFLTAWSTQVRPATTLLADGCPYLYAMVHDSSVSTIPGDFVIGVKLTIIENMCAYGLN ------------11113333---------------------------------------- PGISGSRLLG ---------- >PEPTIDE-N(4)-(N-ACETYL-BE; SWP:P21163; PDB:1PGS; DNTVNIKTFDKVKNAFGDGLSQSAEGTFTFPADVTTVKTIKMFIKNECPNKTCDEWDRYA ---------------------------------1111-----------%%%%-------- NVYVKNKTTGEWYEIGRFITPYWVGTEKLPRGLEIDVTDFKSLLSGNTELKIYTETWLAK -------------------------3333-------33331111-------------333 GREYSVDFDIVYGTPDYKYSAVVPVIQYNKSSIDGVPYGKAHTLGLKKNIQLPTNTEKAY 3-----------------------------1111------------------1111---- LRTTISGWGHAKPYDAGSRGCAEWCFRTHTIAINNANTFQHQLGALGCSANPINNQSPGN --------------------------------iiii----------3333---------- WTPDRAGWCPGMAVPTRIDVLNNSLTGSTFSYEYKFQSWTNNGTNGDAFYAISSFVIAKS ----22222222---------3333----------------------------------- NTPISAPVVTN ----------- >ACTIN INTERACTING PROTEIN; SWP:P46680; PDB:1PGUA; SSISLKEIIPPQPSTQRNFTTHLSYDPTTNAIAYPCGKSAFVRCLDDGDSKVPPVVQFTG ---------------2222-------1111----------------------------11 HGSSVVTTVKFSPIKGSQYLCSGDESGKVIVWGWTFDKESNSVEVNVKSEFQVLAGPISD 11-----------2222------1111----------1111------------------- ISWDFEGRRLCVVGEGRDNFGVFISWDSGNSLGEVSGHSQRINACHLKQSRPRSTVGDDG ---1111--------------------------------------------------%%% SVVFYQGPPFKFSASDRTHHKQGSFVRDVEFSPDSGEFVITVGSDRKISCFDGKSGEFLK %-------------------2222--------!!!!------------------------ YIEDDQEPVQGGIFALSWLDSQKFATVGADATIRVWDVTTSKCVQKWTLDKQQLGNQQVG ---1111--------------------1111------------------1111------- VVATGNGRIISLSLDGTLNFYELGHDEVLKTISGHNKGITALTVNPLISGSYDGRIEWSS ----iiii----1111-----2222-------------------------1111--3333 SSHQDHSNLIVSLDNSKAQEYSSISWDDTLKVNGITKHEFGSQPKVASANNDGFTAVLTN ------------------------1111---iiii-----------------------11 DDDLLILQSFTGDIIKSVRLNSPGSAVSLSQNYVAVGLEEGNTIQVFKLSDLEVSFDLKT 11-----1111------------------------------------3333--------- PLRAKPSYISISPSETYIAAGDVGKILLYDLQSREVKTSRWAFRTSKINAISWKPAEEIE -----------1111---------------1111-------------------------- EDLVATGSLDTNIFIYSVKRPKIIKALNAHKDGVNNLLWETPSTLVSSGADACIKRWNVV -------1111------------------2222-------1111----1111-------- >TROPOMODULIN TMD-1; SWP:O01479; PDB:1PGVA; TDVESCINRLREDDTDLKEVNINNMKRVSKERIRSLIEAACNSKHIEKFSLANTAISDSE ---------11111111----2222--------------1111-------------3333 ARGLIELIETSPSLRVLNVESNFLTPELLARLLRSTLVTQSIVEFKADNQRQSVLGNQVE ----------------------------------3333---------------------- MDMMMAIEENESLLRVGISFASMEARHRVSEALERNYERVRLRRLGK ------------------------------------------1111- >SWA2P; SWP:Q06677; PDB:1PGYA; ALVDEVKDMEIARLMSLGLSIEEATEFYENDVTYERYLEILKSKQKE --------------------3333----------------------- >PHOSPHORYLASE KINASE; SWP:P00518; PDB:1PHK; FYENYEPKEILGRGVSSVVRRCIHKPTCKEYAVKIIDVTGGGSFSAEEVQELREATLKEV 1111--------------------1111--------333333333333------------ DILRKVSGHPNIIQLKDTYETNTFFFLVFDLMKKGELFDYLTEKVTLSEKETRKIMRALL --------1111-------------------1111------------3333--------- EVICALHKLNIVHRDLKPENILLDDDMNIKLTDFGFSCQLDPGEKLREVCGTPSYLAPEI ----------------3333---1111------1111---2222-------3333-3333 IECSMNDNHPGYGKEVDMWSTGVIMYTLLAGSPPFWHRKQMLMLRMIMSGNYQFGSPEWD -----1111---------------------------------------------333311 DYSDTVKDLVSRFLVVQPQKRYTAEEALAHPFFQQYV 11-------3333---3333--333311111111--- >PHYCOCYANIN; SWP:P00306; PDB:1PHNA; MKTPITEAIAAADNQGRFLSNTELQAVNGRYQRAAASLEAARSLTSNAERLINGAAQAVY --3333------1111-------------------------------------------- SKFPYTSQMPGPQYASSAVGKAKCARDIGYYLRMVTYCLVVGGTGPMDEYLIAGLEEINR ----1111--1111-------------------------------------2222----1 TFDLSPSWYVEALNYIKANHGLSGQAANEANTYIDYAINALS 111-3333------------------------------1111 >C-phycocyanin beta chain; SWP:P00311; PDB:1PHNB; MLDAFAKVVAQADARGEFLSNTQLDALSKMVSEGNKRLDVVNRITSNASAIVTNAARALF ------------1111-------------------------------------------- SEQPQLIQPGGAYTNRRMAACLRDMEIILRYVSYAIIAGDSSILDDRCLNGLRETYQALG ---33332222-----------------------------3333--11113333------ VPGASVAVGIEKMKDSAIAIANDPSGITTGDCSALMAEVGTYFDRAATAVQ -3333------------------------------------------1111 >3-PHOSPHOGLYCERATE KINASE; SWP:P18912; PDB:1PHP; MNKKTIRDVDVRGKRVFCRVDFNVPMEQGAITDDTRIRAALPTIRYLIEHGAKVILASHL ----3333--2222------------iiii------------------------------ GRPKGKVVEELRLDAVAKRLGELLERPVAKTNEAVGDEVKAAVDRLNEGDVLLLENVRFY --%%%%-3333-------------------------------11112222---------1 PGEEKNDPELAKAFAELADLYVNDAFGAAHRAHASTEGIAHYLPAVAGFLMEKELEVLGK 111-----------1111------3333----11113333-------------------- ALSNPDRPFTAIIGGAKVKDKIGVIDNLLEKVDNLIIGGGLAYTFVKALGHDVGKSLLEE ----------------3333------------------3333----1111--!!!!--11 DKIELAKSFMEKAKEKGVRFYMPVDVVVADRFANDANTKVVPIDAIPADWSALDIGPKTR 11------------------------------1111-----1111-1111---------- ELYRDVIRESKLVVWNGPMGVFEMDAFAHGTKAIAEALAEALDTYSVIGGGDSAAAVEKF ------1111----------33331111-------------------------------- GLADKMDHISTGGGASLEFMEGKQLPGVVALEDK -1111-------3333--1111--3333------ >PHOSPHATIDYLINOSITOL 3-KI; SWP:P27986; PDB:1PHT; AEGYQYRALYDYKKEREEDIDLHLGDILTVNKGSLVALGFSDGQEARPEEIGWLNGYNET ---------------1111---2222----3333-1111-------3333---------- TGERGDFPGTYVEYIGRKKISPP -------1111------------ >PHENYLALANINE HYDROXYLASE; SWP:P04176; PDB:1PHZA; GQETSYIEDNSNQNGAISLIFSLKEEVGALAKVLRLFEENDINLTHIESRPSRLNKDEYE -------------------------2222-------3333--------------1111-- FFTYLDKRTKPVLGSIIKSLRNDIGATVHELSRDKEKNTVPWFPRTIQELDRFANQILDA -----33331111----------------------2222------3333---1111--11 DHPGFKDPVYRARRKQFADIAYNYRHGQPIPRVEYTEEEKQTWGTVFRTLKALYKTHACY 11-1111-----------------2222-------------------------1111--- EHNHIFPLLEKYCGFREDNIPQLEDVSQFLQTCTGFRLRPVAGLLSSRDFLGGLAFRVFH ---------------1111--------------------------------3333----- CTQYIRHGSKPMYTPEPDICHELLGHVPLFSDRSFAQFSQEIGLASLGAPDEYIEKLATI ------3333-------3333-----3333-------------1111--3333------- YWFTVEFGLCKEGDSIKAYGAGLLSSFGELQYCLSDKPKLLPLELEKTACQEYSVTEFQP -------------------------------1111--------33331111--------- LYYVAESFSDAKEKVRTFAATIPRPFSVRYDPYTQRVEVLDNT ------------------1111--------------------- >MOB1A; SWP:Q9H8S9; PDB:1PI1A; MEATLGSGNLRQAVMLPEGEDLNEWIAVNTVDFFNQINMLYGTITEFCTEASCPVMSAGP ---1111-3333----2222-----------------------1111-3333------11 RYEYHWADGTNIKKPIKCSAPKYIDYLMTWVQDQLDDETLFPSKIGVPFPKNFMSVAKTI 11-----------------------------------------1111--1111------- LKRLFRVYAHIYHQHFDSVMQLQEEAHLNTSFKHFIFFVQEFNLIDRRELAPLQELIEKL -------------------1111----------------------33333333---1111 GSKDR ----- >BOWMAN-BIRK INHIBITOR (PI; SWP:P01064; PDB:1PI2; YSKPCCDLCMCTRSMPPQCSCEDRINSCHSDCKSCMCTRSQPGQCRCLDTNDFCYKPCKS ----------------------------1111---------------------------- R - >GALACTOKINASE; SWP:Q9R7D7; PDB:1PIEA; TVLSALTEKFAEVFGDTKEVEYFFSPGRINLIGEHTDYNGGYVFPASITIGTTGLARLRE -----------1111--------------------1111--------------------- DKKVKLYSENFPKLGVIEFDLDEVEKKDGELWSNYVKGMIVMLKGAGYEIDKGFELLIKG -------11111111-------3333-----------------1111------------- EIPTASGLSSSASLELLVGVVLDDLFNLNVPRLELVQLGQKTENDYIGVNSGILDQFAIG -----------------------1111--------------------------------- FGEVKKAIELDCNTLKYEMVPVELRDYDIVIMNTNKPRALTESKYNERFAETREALKRMQ --2222-----------------!!!!--------------------------------- TRLDIQSLGELSNEEFDANTDLIGDETLIKRARHAVYENNRTKIAQKAFVAGNLTKFGEL ------------------3333-------------------------------------- LNASHASLKDDYEVTGLELDTLAETAQKQAGVLGARMTGAGFGGCAIALVAHDNVSAFRK --------------------------------------------------3333------ AVGQVYEEVVGYPASFYVAQIGSGSTKL ---------------------------- >N-(5'PHOSPHORIBOSYL)ANTHR; SWP:P00909; PDB:1PII; MQTVLAKIVADKAIWVEARKQQQPLASFQNEVQPSTRHFYDALQGARTAFILECKKASPS -----------------------33333333------3333------------------- KGVIRDDFDPARIAAIYKHYASAISVLTDEKYFQGSFNFLPIVSQIAPQPILCKDFIIDP -----------------------------------3333-------------------33 YQIYLARYYQADACLLMLSVLDDDQYRQLAAVAHSLEMGVLTEVSNEEEQERAIALGAKV 33----1111------3333-------------1111----------------------- VGINNRDLRDLSIDLNRTRELAPKLGHNVTVISESGINTYAQVRELSHFANGFLIGSALM --------------------3333-1111----------------1111------3333- AHDDLHAAVRRVLLGENKVCGLTRGQDAKAAYDAGAIYGGLIFVATSPRCVNVEQAQEVM -------------------------------------------3333-----------11 AAAPLQYVGVFRNHDIADVVDKAKVLSLAAVQLHGNEEQLYIDTLREALPAHVAIWKALS 11-----------------------------------------------3333------- VGETLPAREFQHVDKYVLDNGQGGSGQRFDWSLLNGQSLGNVLLAGGLGADNCVEAAQTG ---------------------------------2222-----------3333---1111- CAGLDFNSAVESQPGIKDARLLASVFQTLRAY ------3333--2222-----------1111- >SIGNAL TRANSDUCING PROTEI; SWP:P0A9Z1; PDB:1PIL; MKKIDAIIKPFKLDDVREALAEVGITGMTVTEVKGFGRQKGHTELYRGAEYMVDFLPKVK --------1111--------------------------2222------------------ IEIVVPDDIVDTCVDTIIRTAQTGKIGDGKIFVFDVARVIRIRTGEEDDAAI -----3333--------------------------------1111---1111 >PEPTIDYL-PROLYL CIS-TRANS; SWP:Q13526; PDB:1PINA; KLPPGWEKRMSRSSGRVYYFNHITNASQWERPSGGKNGQGEPARVRCSHLLVKHSQSRRP --2222----2222---------------------------------------1111--- SSWRQEKITRTKEEALELINGYIQKIKSGEEDFESLASQFSDCSSAKARGDLGAFSRGQM -3333--------------------------------------3333iiii----2222- QKPFEDASFALRTGEMSGPVFTDSGIHIILRTE -------11112222------3333-------- >Hypothetical zinc-type al; SWP:Q04894; PDB:1PIWA; MSYPEKFEGIAIQSHEDWKNPKKTKYDPKPFYDHDIDIKIEACGVCGSDIHCAAGHWGNM ----------------1111-----------1111----------3333-----1111-- KMPLVVGHEIVGKVVKLGPKSNSGLKVGQRVGVGAQVFSCLECDRCKNDNEPYCTKFVTT -------------------------2222----------------111133331111--- YSQPYEDGYVSQGGYANYVRVHEHFVVPIPENIPSHLAAPLLCGGLTVYSPLVRNGCGPG ----1111-------------3333----11113333-3333----------1111-222 KKVGIVGLGGIGSMGTLISKAMGAETYVISRSSRKREDAMKMGADHYIATLEEGDWGEKY 2------------------1111---------1111---1111-----3333-------- FDTFDLIVVCASSLTDIDFNIMPKAMKVGGRIVSISIPEQHEMLSLKPYGLKAVSISYSA --------------------3333--2222----------------33332222------ LGSIKELNQLLKLVSEKDIKIWVETLPVGEAGVHEAFERMEKGDVRYRFTLVGYDKEFSD ----------------------------------------------------3333---- >GLUTACONYL-COA DECARBOXYL; SWP:Q06700; PDB:1PIXA; GFYSMPRYFQNMPQVGKPLKKADAANEEQLKKIEEEIHQLIKEAQEAGKADADVNKRGEL ----3333-----------------------------------------3333-1111-- TALQRIEKLVEPGSWRPLNTLFNPQGNKNGSVAIVKGLGRVNGKWCVVVASDNKKLAGAW ----------2222----111111111111----------iiii-------3333%%%%- VPGQAECLLRASDTAKTLHVPLVYVLNCSGVKFDEQEKVYPNRRGGGTPFFRNAELNQLG 2222---------------------------33331111-----------------1111 IPVIVGIYGTNPAGGGYHSISPTVIIAHEKANMAVGGAGIMGGMNPKGHVDLEYANEIAD ---------------------------1111-------------------3333------ MVDRTGKTEPPGAVDIHYTETGFMREVYASEEGVLEGIKKYVGMLPKYDPEFFRVDDPKA ------------3333-------------------------1111---3333-------- PAFPADDLYSMVPLNDKRAYDIYNVIARLFDNSELHEYKKGYGPEMVTGLAKVNGLLVGV ---3333-------------33333333----------22221111------iiii---- VANVQGLLMNYPEYKAAGSVGIGGKLYRQGLVKMNEFVTLCARDRLPIVWIQDTTGIDVG -----------1111--------------------------------------------- NDAEKAELLGLGQSLIYSIQTSHIPQFEITLRKGTAAAHYVLGGPQGNDTNAFSIGTAAT ------------------1111---------------------3333---------1111 EIAVMNGETAATAMYSRRLAKDRKAGKDLQPTIDKMNNLIQAFYTKSRPKVCAELGLVDE ----------------------1111-----------------1111------------- IVDMNKIRGYVEAFTEAAYQNPESICPFHQMILPRAIREFETFVKK --1111--------------------1111---------------- >NAD-DEPENDENT MALIC ENZYM; SWP:P23368; PDB:1PJ3A; IKEKGKPLLNPRTNKGAFTLQERQLGLQGLLPPKIETQDIQALRFHRNLKKTSPLEKYIY ----3333----------3333----2222------------------------------ IGIQERNEKLFYRILQDDIESLPIVYTPTVGLACSQYGHIFRRPKGLFISISDRGHVRSI -3333-------------3333-----3333----3333----------3333------- VDNWPENHVKAVVVTDGERILGLGDLGVYGGIPVGKLCLYTACAGIRPDRCLPVCIDVGT 1111---------------!!!!--!!!!-----------------1111---------- DNIALLKDPFYGLYQKRDRTQQYDDLIDEFKAITDRYGRNTLIQFEDFGNHNAFRFLRKY -3333--1111--------------------------1111------------------1 REKYCTFNDDIQGTAAVALAGLLAAQKVISKPISEHKILFLGAGEAALGIANLIVSVENG 111----3333------------3333----3333---------------------1111 LSEQEAQKKIWFDKYGLLVKGRKAKIDSYQEPFTHSAPESIPDTFEDAVNILKPSTIIGV ------1111--1111--2222----33331111-------------------------- AGAGRLFTPDVIRAASINERPVIFALSNPTAQAECTAEEAYTLTEGRCLFASGSPFGPVK ----------------------------3333---------1111--------------- LTDGRVFTPGQGNNVYIFPGVALAVILCNTRHISDSVFLEAAKALTSQLTDEELAQGRLY 1111-----------------------------3333--------111133331111--- PPLANIQEVSINIAIKVTEYLYANKAFRYPEPEDKAKYVKERTWRSEYDSLLPDVYEWP -3333----------------1111---------------------------------- >N,N-DIMETHYLGLYCINE OXIDA; SWP:Q9AGP8; PDB:1PJ5A; TPRIVIIGAGIVGTNLADELVTRGWNNITVLDQGPLNMPGGSTSHAPGLVFQTNPSKTMA ---------3333-------1111----------3333--3333---------------- SFAKYTVEKLLSLTEDGVSCFNQVGGLEVATTETRLADLKRKLGYAAAWGIEGRLLSPAE --------------%%%%------------------------------------------ CQELYPLLDGENILGGLHVPSDGLASAARAVQLLIKRTESAGVTYRGSTTVTGIEQSGGR ----1111-1111-------------------------1111--------------iiii VTGVQTADGVIPADIVVSCAGFWGAKIGAMIGMAVPLLPLAHQYVKTTPVPAQQGRNDQP -----1111----------!!!!--------------------------3333------- NGARLPILRHQDQDLYYREHGDRYGIGSYAHRPMPVDVDTLGAYAPETVSEHHMPSRLDF ---------3333------!!!!-------------1111----3333-11111111--- TLEDFLPAWEATKQLLPALADSEIEDGFNGIFSFTPDGGPLLGESKELDGFYVAEAVWVT 3333-----------3333---------------1111---------2222------333 HSAGVAKAMAELLTTGRSETDLGECDITRFEDVQLTPEYVSETSQQNFVEIYDVLHPLQP 3--------------------11111111-3333---------------1111--1111- RLSPRNLRVSPFHARHKELGAFFLEAGGWERPYWFEANAALLKEMPAEWLPPARDAWSGM ---------1111------------iiii-----33331111---3333-----3333-- FSSPIAAAEAWKTRTAVAMYDMTPLKRLEVSGPGALKLLQELTTADLAKKPGAVTYTLLL --3333---------------3333------1111------------------------- DHAGGVRSDITVARLSEDTFQLGANGNIDTAYFERAARHQTQSGSATDWVQVRDTTGGTC 1111-----------1111-------------------------1111------3333-- CIGLWGPLARDLVSKVSDDDFTNDGLKYFRAKNVVIGGIPVTAMRLSYVGELGWELYTSA -----1111------------3333----------iiii-------1111--------33 DNGQRLWDALWQAGQPFGVIAAGRAAFSSLRLEKGYRSWGTDMTTEHDPFEAGLGFAVKM 33-----------3333--------------1111--2222--11113333--3333-33 AKESFIGKGALEGRTEEASARRLRCLTIDDGRSIVLGKEPVFYKEQAVGYVTSAAYGYTV 33--22221111--3333---------1111-----------iiii-------------- AKPIAYSYLPGTVSVGDSVDIEYFGRRITATVTEDPLYDPKMTRLRG ---------11112222-----iiii------------1111----- >PALMITOYL-PROTEIN THIOEST; SWP:Q9UMR5; PDB:1PJAA; SYKPVIVVHGLFDSSYSFRHLLEYINETHPGTVVTVLDLFDGRESLRPLWEQVQGFREAV -------------3333-----------2222---------3333--------------- VPIMAKAPQGVHLICYSQGGLVCRALLSVMDDHNVDSFISLSSPQMGQYGDTDYLKWLFP ------1111------3333-----------------------1111-----------11 TSMRSNLYRICYSPWGQEFSICNYWHDPHHDDLYLNASSFLALINGERDHPNATVWRKNF 1111113333--3333---3333---1111-------------------1111------3 LRVGHLVLIGGPDDGVITPWQSSFFGFYDANETVLEMEEQLVYLRDSFGLKTLLARGAIV 333-------1111----3333------1111---3333-3333---------------- RCPMAGISHTAWHSNRTLYETCIEPWLS -------1111-----------3333-- >L-ALANINE DEHYDROGENASE; SWP:O52942; PDB:1PJCA; MEIGVPKEIKNQEFRVGLSPSSVRTLVEAGHTVFIETQAGIGAGFADQDYVQAGAQVVPS ---------2222-------------1111-----22223333--3333-1111-----3 AKDAWSREMVVKVKEPLPAEYDLMQKDQLLFTYLHLAAARELTEQLMRVGLTAIAYETVE 333-------------333333331111------1111----------------1111-- LPNRSLPLLTPMSIIAGRLSVQFGARFLERQQGGRGVLLGGVPGVKPGKVVILGGGVVGT 3333-1111-------------------3333-----11112222--------------- EAAKMAVGLGAQVQIFDINVERLSYLETLFGSRVELLYSNSAEIETAVAEADLLIGAVLV ------1111--------------3333-!!!!--------------1111--------- PGRRAPILVPASLVEQMRTGSVIVDVAVDQGGCVETLHPTSHTQPTYEVFGVVHYGVPNM ---------333311112222------------1111-----------iiii------33 PGAVPWTATQALNNSTLPYVVKLANQGLKALETDDALAKGLNVQAHRLVHPAVQQVFPDL 33-------------------------3333------3333--iiii--3333---1111 A - >ENOYL-COA ISOMERASE; SWP:Q05871; PDB:1PJHA; IRQNEKISYRIEGPFFIIHLINPDNLNALEGEDYIYLGELLELADRNRDVYFTIIQSSGR ---1111----!!!!------3333----3333-------------1111---------- FFSSGADFKGIAKKYPSETSKWVSNFVARNVYVTDAFIKHSKVLICCLNGPAIGLSAALV ------------------------------------1111-------------------1 ALCDIVYSINDKVYLLYPFANLGLITEGGTTVSLPLKFGTNTTYECLMFNKPFKYDIMCE 111--------------3333-----iiii----------------1111--------11 NGFISKNFNMPSSNAEAFNAKVLEELREKVKGLYLPSCLGMKKLLKSNHIDAFNKANSVE 11---------------------------22223333----------------------- VNESLKYWVDGEPLKRFRQ ------------------- >SIROHEME SYNTHASE; SWP:P25924; PDB:1PJQA; MDHLPIFCQLRDRDCLIVGGGDVAERKARLLLEAGARLTVNALTFIPQFTVWANEGMLTL ---------2222----------------------------------------------- VEGPFDETLLDSCWLAIAATDDDTVNQRVSDAAESRRIFCNVVDAPKAASFIMPSIIDRS -----1111------------3333--------1111----1111--------------- PLMVAVSGGTSPVLARLLREKLESLLPQHLGQVARYAGQLRARVKKQFATMGERRRFWEK --------------------------1111-------------------3333------- FFVNDRLAQSLANADEKAVNATTERLFSEPLDHRGEVVLVGAGPGDAGLLTLKGLQQIQQ 1111------1111-----------1111----------------3333----------- ADIVVYDRLVSDDIMNLVRRDADRVFVGKRHCVPQEEINQILLREAQKGKRVVRLKGGDP ------1111---1111-1111------------------------------------11 FIFGRGGEELETLCHAGIPFSVVPGITAASGCSAYSGIPLTHRDYAQSVRLVTGELDWEN 11-----------1111---------3333---1111-------------------3333 LAAEKQTLVFYMGLNQAATIQEKLIAFGMQADMPVALVENGTSVKQRVVHGVLTQLGELA ------------------------1111-----------2222--------3333--333 QQVESPALIIVGRVVALRDKLNWFSNH 3-------------------------- >PCRA; SWP:P56255; PDB:1PJR; MNFLSEQLLAHLNKEQQEAVRTTEGPLLIMAGAGSGKTRVLTHRIAYLMAEKHVAPWNIL -3333-------3333---------------2222-------------------1111-- AITFTNKAAREMRERVQSLLGGAAEDVWISTFHSMCVRILRRDIDRIGINRNFSILDPTD -----------------------1111-----------11113333---1111------- QLSVMKTILKEKNIDPKKFEPRTILGTISAAKNELLPPEQFAKRYYEKVVSDVYQEYQQR -------------------3333--------1111-3333-------------------- LLRNHSLDFDDLIMTTIQLFDRVPDVLHYYQYKFQYIHIDEYQDTNRAQYTLVKKLAERF -------3333-----------------------------3333---------------- QNICAVGDADQSIYRWRGADIQNILSFERDYPNAKVILLEQNYRSTKRILQAANEVIEHN -------1111--3333--------3333-1111----------------------1111 VNRKPKRIWTENPEGKPILYYEAMNEADEAQFVAGRIREAVERGERRYRDFAVLYRTNAQ ----------------------------------------------1111------3333 SRVMEEMLLKANIPYQIVGGLKFYDRKEIKDILAYLRVIANPDDDLSLLRIINVPKRGIG --------1111---------3333---------------1111---------------- ASTIDLFEALGELEMIGLGAKAAGALAAFRSQLEQWTQLQEYVSVTELVEEVLDKSGYRE -----3333----------3333--------------3333------------1111--- MLKAERTIEAQSRLENLDEFLSVTKHFENVSDDKSLIAFLTDLALISGDAVMLMTLHAAK ------------------------------------------------------333322 GLEFPVVFLIGMEEGIFPHNRSLEDDDEMEEERRLAYVGITRAEEELVLTSAQMRTLFGN 22----------------3333----------------1111--------------iiii IQMDPPSRFLNEIPAHLLETASR ------3333---3333------ >WOUND-INDUCED PROTEINASE ; SWP:P05119; PDB:1PJUA; ACTRECGNLGFGICPRSEGSPLNPICINCCSGYKGCNYYNSFGKFICEGESDPKRPNACT -------------------3333----3333-2222---1111--------1111----- FNCDPNIAYSRCPRSQGKSLIYPTGCTTCCTGYKGCYYFGKDGKFVCEGESDEPK ---1111----------------!!!!3333-2222---1111------------ >COBATOXIN 1; SWP:O46028; PDB:1PJVA; AVCVYRTCDKDCKRRGYRSGKCINNACKCYPY -----------1111-------%%%%------ >ENVELOPE PROTEIN; SWP:Q9J0X3; PDB:1PJWA; DKLALKGTTYGMCTEKFSFAKNPADTGHGTVVIELSYSGSDGPCKIPIVSVASLNDMTPV ---------------------------------------------------2222----- GRLVTVNPFVATSSANSKVLVEMEPPFGDSYIVVGMGDKQINHHWHKAGST --------------------------------------------------- >DIISOPROPYLFLUOROPHOSPHAT; SWP:Q7SIG4; PDB:1PJXA; MEIPVIEPLFTKVTEDIPGAEGPVFDKNGDFYIVAPEVEVNGKPAGEILRIDLKTGKKTV ----------------2222-----1111----------iiii----------------- ICKPEVNGYGGIPAGCQCDRDANQLFVADMRLGLLVVQTDGTFEEIAKKDSEGRRMQGCN -----iiii----------------------------1111--------1111------- DCAFDYEGNLWITAPAGEVAPADYTRSMQEKFGSIYCFTTDGQMIQVDTAFQFPNGIAVR ----1111------------------------------1111------------------ HMNDGRPYQLIVAETPTKKLWSYDIKGPAKIENKKVWGHIPGTHEGGADGMDFDEDNNLL -1111---------1111--------2222-----------------------1111--- VANWGSSHIEVFGPDGGQPKMRIRCPFEKPSNLHFKPQTKTIFVTEHENNAVWKFEWQRN --2222------1111-------------------2222--------------------- GKKQYCETLKFGIF ---3333-1111-- >THIOPURINE S-METHYLTRANSF; SWP:O86262; PDB:1PJZA; HQSEVNKDLQQYWSSLNVVPGARVLVPLCGKSQDMSWLSGQGYHVVGAELSEAAVERYFT -----3333--------------------------------------------------- ERGEQPHITSQGDFKVYAAPGIEIWCGDFFALTARDIGHCAAFYDRAAMIALPADMRERY ----------!!!!----3333----------3333------------1111-------- VQHLEALMPQACSGLLITLEYDQALLEGPPFSVPQTWLHRVMSGNWEVTKVGGQDTLHSS ------------------------------------------------------------ ARGLKAGLERMDEHVYVLERV 3333----------------- >Polycomb protein Scm; SWP:Q9VHA0; PDB:1PK1B; RSQPIDWTIEEVIQYIESNDNSLAVHGDLFRKHEIDGKALLRLNSERMMKYMGLKLGPAL ---1111------------11111111------------1111----------------- KICNLVNKVN -----3333- >TISSUE-TYPE PLASMINOGEN A; SWP:P00750; PDB:1PK2; SEGNSDCYFGNGSAYRGTHSLTESGASCLPWNSMILIGKVYTAQNPSAQALGLGKHNYCR -----------------------------11111111----1111-3333---------- NPDGDAKPWCHVLKNRRLTWEYCDVPSCST 1111---------2222------------- >ORPHAN NUCLEAR RECEPTOR N; SWP:P45448; PDB:1PK5A; ASIPHLILELLKCEPDEPQVQAKIMAYLQQEQSNRNRQEKLSAFGLLCKMADQTLFSIVE ----------1111-3333----------------------------------------- WARSSIFFRELKVDDQMKLLQNCWSELLILDHIYRQVAHGKEGTIFLVTGEHVDYSTIIS ------3333------------------------------------1111---------- HTEVAFNNLLSLAQELVVRLRSLQFDQREFVCLKFLVLFSSDVKNLENLQLVEGVQEQVN --------------------1111------------------------------------ AALLDYTVCNYPQQTEKFGQLLLRLPELRAISKQAEDYLYYKHVNGDVPYNNLLIEMLHA ----------3333------------------------------------3333---111 KR 1- >COMPLEMENT C1Q SUBCOMPONE; SWP:P02745; PDB:1PK6A; QPRPAFSAIRRNPPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWEI ---------------------------2222----------------------------- CLSIVSSSRGQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKGHIYQGSE -------iiii---------------------------2222------1111-------- ADSVFSGFLIFPS ------------- >Complement C1q subcompone; SWP:P02746; PDB:1PK6B; TQKIAFSATRTINVPLRRDQTIRFDHVITNMNNNYEPRSGKFTCKVPGLYYFTYHASSRG ----------------2222---------2222--------------------------- NLCVNLMRGRERAQKVVTFCDYAYNTFQVTTGGMVLKLEQGENVFLQATDKNSLLGMEGA --------------------------------------2222--------------2222 NSIFSGFLLFPD ------------ >Complement C1q subcompone; SWP:P02747; PDB:1PK6C; KFQSVFTVTRQTHQPPAPNSLIRFNAVLTNPQGDYDTSTGKFTCKVPGLYYFVYHASHTA -----------------------------1111--------------------------- NLCVLLYRSGVKVVTFCGHTSKTNQVNSGGVLLRLQVGEEVWLAVNDYYDMVGIQGSDSV -------iiii------------------------2222--------------2222--- FSGFLLFPD --------- >RAT SYNAPSIN I; SWP:P09951; PDB:1PK8A; AARVLLVIDEPHTDWAKYFKGKKIHGEIDIKVEQAEFSDLNLVAHANGGFSVDMEVLRNG ---------33333333-2222-------------3333-----1111------------ VKVVRSLKPDFVLIRQHAFSMARNGDYRSLVIGLQYAGIPSVNSLHSVYNFCDKPWVFAQ ---------------------2222---------1111----------1111-------- MVRLHKKLGTEEFPLIDQTFYPNHKEMLSSTTYPVVVKMGHAHSGMGKVKVDNQHDFQDI ---------1111---------3333----------------iiii-------------- ASVVALTKTYATAEPFIDAKYDVRVQKIGQNYKAYMRTSVSGNWKTNTGSAMLEQIAMSD --3333---------------------!!!!-------------1111------------ RYKLWVDTCSEIFGGLDICAVEALHGKDGRDHIIEVVGSSMPLIGDHQDEDKQLIVELVV -----------iiii----------1111--------1111------------------- NKMTQA ------ >PURINE NUCLEOSIDE PHOSPHO; SWP:P09743; PDB:1PK9A; ATPHINAEMGDFADVVLMPGDPLRAKYIAETFLEDAREVNNVRGMLGFTGTYKGRKISVM -1111--2222---------3333----------------2222-------iiii----- GHGMGIPSCSIYTKELITDFGVKKIIRVGSCGAVLPHVKLRDVVIGMGACTDSKVNRIRF ----------------------------------11112222-----------------% KDHDFAAIADFDMVRNAVDAAKALGIDARVGNLFSADLFYSPDGEMFDVMEKYGILGVEM %%%------------------1111------------------3333---1111------ EAAGIYGVAAEFGAKALTICTVSDHIRTHEQTTAAERQTTFNDMIKIALESVLLGDK 3333-------------------------------------------------1111 >BIFUNCTIONAL DEAMINASE/DI; SWP:Q57872; PDB:1PKHA; MILSDKDIIDYVTSKRIIIKPFNKDFVGPCSYDVTLGDEFIIYDDEVYDLSKELNYKRIK ----------------------1111----------------------3333-------- IKNSILVCPLNYNLTEEKINYFKEKYNVDYVVEGGVLGTTNEYIELPNDISAQYQGRSSL ---------------------------------------------------------333 GRVFLTSHQTAGWIDAGFKGKITLEIVAFDKPVILYKNQRIGQLIFSKLLSPADVGYSER 3-------------2222-----------------2222--------------------- KT -- >PYRUVATE KINASE; SWP:Q27686; PDB:1PKLA; SQLAHNLTLSIFDPVANYRAARIICTIGPSTQSVEALKGLIQSGMSVARMNFSHGSHEYH ----3333-------------------3333-3333------------------------ QTTINNVRQAAAELGVNIAIALDTKGPEIRTGQFVGGDAVMERGATCYVTTDPAFADKGT ---------------------------------2222--------------3333----1 KDKFYIDYQNLSKVVRPGNYIYIDDGILILQVQSHEDEQTLECTVTNSHTISDRRGVNLP 111----1111----2222----iiii-------------------------------22 GCDVDLPAVSAKDRVDLQFGVEQGVDMIFASFIRSAEQVGDVRKALGPKGRDIMIICKIE 22--------------------------------3333--------1111---------- NHQGVQNIDSIIEESDGIMVARGDLGVEIPAEKVVVAQKILISKCNVAGKPVICATQMLE ---------------------3333----3333-------------------------33 SMTYNPRPTRAEVSDVANAVFNGADCVMLSGETAKGKYPNEVVQYMARICLEAQSALNEY 33---------------------------3333-------------------3333-333 VFFNSIKKLQHIPMSADEAVCSSAVNSVYETKAKAMVVLSNTGRSARLVAKYRPNCPIVC 3-----------------------------------------------3333-------- VTTRLQTCRQLNITQGVESVFFDADKLGHDEGKEHRVAAGVEFAKSKGYVQTGDYCVVIH ---3333---1111--------1111------------------1111--2222------ AANQTRILLVE ----------- >M1 PYRUVATE KINASE; SWP:P11979; PDB:1PKM; IQTQQLHAAMADTFLEHMCRLDIDSPPITARNTGIICTIGPASRSVEILKEMIKSGMNVA --%%%%3333-------1111------------------1111---------1111---- RLNFSHGTHEYHAETIKNVRAATESFASDPIRYRPVAVALDTKGPEIRTGLIKGSGTAEV -----------------------1111-3333---------------------3333--- ELKKGATLKITLDNAYMEKCDENVLWLDYKNICKVVEVGSKVYVDDGLISLLVKEKGADF --2222---------1111---------1111------------%%%%------------ LVTEVENGGSLGSKKGVNLPGAAVDLPAVSEKDIQDLKFGVEQDVDMVFASFIRKASDVH ------------------2222------------------1111---------------- EVRKVLGEKGKNIKIISKIENHEGVRRFDEILEASDGIMVARGDLGIEIPAEKVFLAQKM ------3333-----------3333-----------------3333---3333------- MIGRCNRAGKPVICATQMLESMIKKPRPTRAEGSDVANAVLDGADCIMLSGETAKGDYPL ------------------3333-----------------3333------3333------- EAVRMQHLIAREAEAAMFHRKLFEELVRGSSHSTDLMEAMAMGSVEASYKCLAAALIVLT -------------1111------------------------------------------- ESGRSAHQVARYRPRAPIIAVTRNHQTARQAHLYRGIFPVVCKDPVQEAWAEDVDLRVNL --------3333-----------------33332222----------------------- AMNVGKARGFFKHGDVVIVLTGWRPGSGFTNTMRVVPVP -----------2222------------------------ >MYELIN OLIGODENDROCYTE GL; SWP:Q63345; PDB:1PKOA; GQFRVIGPGHPIRALVGDEAELPCRISPGKNATGEVGWYRSSRVVHLYRNGKDQDAEQAP --------------2222------------------------------iiii-3333-11 EYRGRTELLKESIGEGKVALRIQNVRFSDEGGYTCFFRDHSYQEEAAVELKVEDPFYWIN 11-------1111------------3333---------!!!!-------------1111- PGR --- >RIBOSOMAL PROTEIN S5; SWP:P02357; PDB:1PKP; INPNKLELEERVVAVNRVAKVVKGGRRLRFSALVVVGDKNGHVGFGTGKAQEVPEAIRKA -3333------------------------------------------------------- IEDAKKNLIEVPIVGTTIPHEVIGHFGAGEIILKPASEGTGVIAGGPARAVLELAGISDI ---1111------------------!!!!------------------------------- LSKSIGSNTPINMVRATFDGLKQLK --------------------1111- >(8-18C5) CHIMERIC FAB, LI; SWP:Q63345; PDB:1PKQA; DIELTQSPSSLAVSAGEKVTMSCKSSQSLLNSNQK -------------2222------------------ >NUCLEOSIDE DIPHOSPHATE KI; SWP:Q07661; PDB:1PKUA; RMEQSFIMIKPDGVQRGLIGDIISRFEKKGFYLRGMKFMNVERSFAQQHYADLSDKPFFP -------------1111---------3333-------------------3333--1111- GLVEYIISGPVVAMVWEGKDVVATGRRIIGATRPWEAAPGTIRADYAVEVGRNVIHGSDS ----1111------------------------1111-2222-------1111-------- VDNGKKEIALWFPEGLAEWRSNLHPWIYE -----------1111-----1111----- >BIFUNCTIONAL PURINE BIOSY; SWP:P31939; PDB:1PKXA; PGQLALFSVSDKTGLVEFARNLTALGLNLVASGGTAKALRDAGLAVRDVSELTGFPEMLG -----------2222------------------------1111----3333------%%% GRVKTLHPAVHAGILARNIPEDNADMARLDFNLIRVVACNLYPFVKTVASPGVTVEEAVE %-----3333------------------------------------3333--------11 QIDIGGVTLLRAAAKNHARVTVVCEPEDYVVVSTEMQSSESKDTSLETRRQLALKAFTHT 11-------------3333-----3333----------1111------------------ AQYDEAISDYFRKQYSKGVSQMPLRYGMNPHQTPAQLYTLQPKLPITVLNGAPGFINLCD ---------------2222---------1111---------------------------- ALNAWQLVKELKEALGIPAAASFKHVSPAGAAVGIPLSEDEAKVCMVYDLYKTLTPISAA -----------------------%%%%-------------------33331111------ YARARGADRMSSFGDFVALSDVCDVPTAKIISREVSDGIIAPGYEEEALTILSKKKNGNY -------33332222---------------1111---------------------iiii- CVLQMDQSYKPDENEVRTLFGLHLSQKRNNGVVDKSLFSNVVTKNKDLPESALRDLIVAT -----1111---------iiii-----------3333----------------------- IAVKYTQSNSVCYAKNGQVIGIGAGQQSRIHCTRLAGDKANYWWLRHHPQVLSMKFKTGV --1111--------%%%%-----------------------------3333-----1111 AEISNAIDQYVTGTIGEDEDLIKWKALFEEVPELLTEAEKKEWVEKLTEVSISSDAFFPF 3333---------------------------------------1111------------- RDNVDRAKRSGVAYIAAPSGSAADKVVIEACDELGIILAHTNLRLFHH --------------------1111------------------------ >CELLOBIOSE DEHYDROGENASE; SWP:Q01738; PDB:1PL3A; SASQFTDPTTGFQFTGITDPVHDVTYGFVFPPLATSGAQSTEFIGEVVAPIASKWIGIAL -------------------------------------------------3333-----11 GGAHNNDLLLVAWANGNQIVSSTRWATGYVQPTAYTGTATLTTLPETTINSTHWKWVFRC 11------------!!!!--------%%%%-------------3333--1111------- QGCTEWNNGGGIDVTSQGVLAWAFSNVAVDDPSDPQSTFSEHTDFGFFGIDYSTAHSANY -----1111---1111--------------1111----------------3333--1111 QNYLN 3333- >SUPEROXIDE DISMUTASE [MN]; SWP:P04179; PDB:1PL4A; KHSLPDLPYDYGALEPHINAQIMQLHHSKHHAAYVNNLNVTEEKYQEALAKGDVTAQIAL ---------1111-----------------------------------1111-------- QPALKFNGGGHINHSIFWTNLSPNGGGEPKGELLEAIKRDFGSFDKFKEKLTAASVGVQG -----------------11111111-----------------------------3333-- SGWGWLGFNKERGHLQIAACPNQDPLQGTTGLIPLLGIDVWEHAYFLQYKNVRPDYLKAI ---------1111-------!!!!3333------------3333----!!!!------33 WNVINWENVTERYMACKK 33-----------3333- >REGULATORY PROTEIN SIR4; SWP:P11978; PDB:1PL5A; SFVDIVLSKAASALDEKEKQLAVANEIIRSLSDEVMRNEIRITSLQGDLTFTKKCLENAR 3333-------------------------------------------------------- SQISEKDAKINKLME ---------1111-- >HUMAN SORBITOL DEHYDROGEN; SWP:Q00796; PDB:1PL8A; AAAAKPNNLSLVVHGPGDLRLENYPIPEPGPNEVLLRMHSVGICGSDVHYWEYGRIGNFI --------------2222-----------1111----------------------!!!!- VKKPMVLGHEASGTVEKVGSSVKHLKPGDRVAIEPGAPRENDEFCKMGRYNLSPSIFFCA ------------------1111---2222---------------111133331111-222 TPPDDGNLCRFYKHNAAFCYKLPDNVTFEEGALIEPLSVGIHACRRGGVTLGHKVLVCGA 2-------------3333----1111-----------------------2222------- GPIGMVTLLVAKAMGAAQVVVTDLSATRLSKAKEIGADLVLQISKESPQEIARKVEGQLG --------------------------------1111------------------------ CKPEVTIECTGAEASIQAGIYATRSGGTLVLVGLGSEMTTVPLLHAAIREVDIKGVFRYC -----------------------2222--------------------------------- NTWPVAISMLASKSVNVKPLVTHRFPLEKALEAFETFKKGLGLKIMLKCDPSDQNP ----------------3333-----3333--------------------1111--- >PLASTOCYANIN; SWP:P00299; PDB:1PLC; IDVLLGADDGSLAFVPSEFSISPGEKIVFKNNAGFPHNIVFDEDSIPSGVDASKISMSEE ------1111-----------2222----------------1111-22223333---111 DLLNAKGETFEVALSNKGEYSFYCSPHQGAGMVGKVTVN 1---2222---------------33331111-------- >PLATELET FACTOR 4; SWP:P30035; PDB:1PLFA; LQCVCLKTTSGINPRHISSLEVIGAGLHCPSPQLIATLKTGRKICLDQQNPLYKKIIKRL ------------3333---------3333--------1111-----33333333------ LKS --- >Ig gamma-2A chain C regio; SWP:GCAM_MOUSE; PDB:1PLGH; QIQLQQSGPELVRPGASVKISCKASGYTFTDYYIHWVKQRPGEGLEWIGWIYPGSGNTKY ---------------------------3333----------------------------- NEKFKGKATLTVDTSSSTAYMQLSSLTSEDSAVYFCARGGKFAMDYWGQGTSVTVSSAKT 3333--------3333----------3333---------1111----------------- TAPSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLYT ------------------------------------------------------iiii-- LSSSVTVTSSTWPSQSITCNVAHPASSTKVDKKIE -------1111-----------1111--------- >PROLIFERATING CELL NUCLEA; SWP:P15873; PDB:1PLQ; MLEAKFEEASLFKRIIDGFKDCVQLVNFQCKEDGIIAQAVDDSRVLLVSLEIGVEAFQEY -------3333-------------------1111------1111--------3333---- RCDHPVTLGMDLTSLSKILRCGNNTDTLTLIADNTPDSIILLFEDTKKDRIAEYSLKLMD ------------------------------------------------------------ IDADFLKIEELQYDSTLSLPSSEFSKIVRDLSQLSDSINIMITKETIKFVADGDIGSGSV -----------------------------3333-------------------3333---- IIKPFVDMEHPETSIKLEMDQPVDLTFGAKYLLDIIKGSSLSDRVGIRLSSEAPALFQFD ------33331111----------------------3333-------------------- LKSGFLQFFLAPKFNDEE ------------------ >Nitrophorin-2 [Precursor]; SWP:Q26241; PDB:1PM1X; MDCSTNISPKQGLDKAKYFSGKWYVTHFLDKDPQVTDQYCSSFTPRESDGTVKEALYHYN -------------3333--------------1111------------iiii--------- ANKKTSFYNIGEGKLESSGLQYTAKYKTVDKKKAVLKEADEKNSYTLTVLEADDSSALVH -----------------------------1111------1111---------1111---- ICVREGSKDLGDVYTVLTHQKDAEPSAKVKSAVTQAGLQLSQFVGTKDLGCQYDDQFTSL ----!!!!-----------2222----------1111-3333---1111----3333--- >MTH1895; SWP:NA; PDB:1PM3A; HMRIVEEMVGKEVLDSSAKVIGKVKDVEVDIESQAIESLVLGKGGGETIVPYEMVKKIGD --------------1111--------------------------------1111------ KILLKGPEE -----1111 >YPM; SWP:Q57221; PDB:1PM4A; IPNIATYTGTIQGKGEVCIIGNKEGKTRGGELYAVLHSTNVNADMTLILLRNVGGNGWGE -----------2222------1111--------------1111----------------- IKRNDIDKPLKYEDYYTSGLSWIWKIKNNSSETSNYSLDATVHDDKEDSDVLTKCPV ----2222--------------------------------------1111------- >PROTEASOME; SWP:P25156; PDB:1PMAA; TVFSPDGRLFQVEYAREAVKKGSTALGMKFANGVLLISDKKVRSRLIEQNSIEKIQLIDD ---1111-----------1111---------------------1111-3333------11 YVAAVTSGLVADARVLVDFARISAQQEKVTYGSLVNIENLVKRVADQMQQYTQYGGVRPY 11---------------------------------3333---------1111-------- GVSLIFAGIDQIGPRLFDCDPAGTINEYKATAIGSGKDAVVSFLEREYKENLPEKEAVTL ---------3333------1111------------------------------------- GIKALKSSLEEGEELKAPEIASITVGNKYRIYDQEEVKKFL -----------------------2222---------3333- >Proteasome subunit beta [; SWP:P28061; PDB:1PMAB; TTTVGITLKDAVIMATERRVTMENFIMHKNGKKLFQIDTYTGMTIAGLVGDAQVLVRYMK ---------------------!!!!----------------------------------- AELELYRLQRRVNMPIEAVATLLSNMLNQVKYMPYMVQLLVGGIDTAPHVFSIDAAGGSV -----------------------------1111--------------------1111--- EDIYASTGSGSPFVYGVLESQYSEKMTVDEGVDLVIRAISAAKQRDSASGGMIDVAVITR -------1111-----------1111-------------------1111----------- KDGYVQLPTDQIESRIRKLGLIL ----------------------- >Beta-1,4-mannanase; SWP:P77847; PDB:1PMHX; SVNPVVLDFEDGTVSFGEAWGDSLKCIKKVSVSQDLQRPGNKYALRLDVEFNPNNGWDQG --------1111-------2222!!!!------11112222----------1111----- DLGTWIGGVVEGQFDFTGYKSVEFEFIPYDEFSKSQGGFAYKVVINDGWKELGSEFNITA ----22222222---2222---------------------------------------11 NAGKKVKINGKDYTVIHKAFAIPEDFRTKKRAQLVFQFAGQNSNYKGPIYLDNVRIRPED 11-----iiii-----------3333---------------------------------- A - >PHOSPHOMANNOSE ISOMERASE; SWP:P34948; PDB:1PMI; SSEKLFRIQCGYQNYDWGKIGSSSAVAQFVHNSDPSITIDETKPYAELWMGTHPSVPSKA --------------1111-!!!!----------3333--1111---------3333---- IDLNNQTLRDLVTAKPQEYLGESIITKFGSSKELPFLFKVLSIEKVLSIQAHPDKKLGAQ --%%%%---------3333-3333------------------------------------ LHAADPKNYPDDNHKPEMAIAVTDFEGFCGFKPLDQLAKTLATVPELNEIIGQELVDEFI --------------------------------3333--------3333------------ SGIKLPAEVGSQDDVNNRKLLQKVFGKLMNTDDDVIKQQTAKLLERTDREPQVFKDIDSR -------2222----------------1111----------------------3333111 LPELIQRLNKQFPNDIGLFCGCLLLNHVGLNKGEAMFLQAKDPHAYISGDIIECMAASDN 1-----------------------------2222-------------------------- VVRAGFTPKFKDVKNLVEMLTYSYESVEKQKMPLQEFPRSKGDAVKSVLYDPPIAEFSVL -------------------------3333-------1111-------------------- QTIFDKSKGGKQVIEGLNGPSIVIATNGKGTIQITGDDSTKQKIDTGYVFFVAPGSSIEL ---------------------------------22221111---2222----2222---- TADSANQDQDFTTYRAFVEA -------------------- >TISSUE PLASMINOGEN ACTIVA; SWP:P00750; PDB:1PMLA; DCYFGNGSAYRGTHSLTESGASCLPWNSMILIGKVYTAQNPSAQALGLGKHNYCRNPDGD ---!!!!---------1111----11111111----1111-3333--------------- AKPWCHVLKNRRLTWEYCDVPSCST ---------------------1111 >P2 MYELIN PROTEIN; SWP:P02690; PDB:1PMPA; SNKFLGTWKLVSSENFDEYMKALGVGLATRKLGNLAKPRVIISKKGDIITIRTESPFKNT --------------------1111-------3333-------------------3333-- EISFKLGQEFEETTADNRKTKSTVTLARGSLNQVQKWNGNETTIKRKLVDGKMVVECKMK ------------------------------------------------%%%%-------- DVVCTRIYEKV ----------- ------------------------------------------------------------ -------------------- >GLUTATHIONE TRANSFERASE; SWP:P15214; PDB:1PMT; MKLYYTPGSCSLSPHIVLRETGLDFSIERIDLRTKKTESGKDFLAINPKGQVPVLQLDNG -----2222---------1111--------------1111-3333-1111------1111 DILTEGVAIVQYLADLKPDRNLIAPPKALERYHQIEWLNFLASEVHKGYSPLFSSDTPES ----3333----11113333--------3333-----------------1111----333 YLPVVKNKLKSKFVYINDVLSKQKCVCGDHFTVADAYLFTLSQWAPHVALDLTDLSHLQD 3-----------------------1111-----------11113333----33333333- YLARIAQRPNVHSALVTEGLI ----33333333---1111-- >PSEUDOAZURIN; SWP:P04171; PDB:1PMY; DEVAVKMLNSGPGGMMVFDPALVRLKPGDSIKFLPTDKGHNVETIKGMAPDGADYVKTTV ----------2222-----------2222---------------2222-2222-----22 GQEAVVKFDKEGVYGFKCAPHYMMGMVALVVVGDKRDNLEAAKSVQHNKLTQKRLDPLFA 22---------------33331111----------1111--1111-------------11 QIQ 11- >PHENOL 2-MONOOXYGENASE; SWP:P15245; PDB:1PN0A; TKYSESYCDVLIVGAGPAGLMAARVLSEYVRQKPDLKVRIIDKRSTKVYNGQADGLQCRT --------------------------------3333------------------------ LESLKNLGLADKILSEANDMSTIALYNPDENGHIRRTDRIPDTLPGISRYHQVVLHQGRI ----1111--------------------1111-----------2222--------3333- ERRILDSIAEISDTRIKVERPLIPEKMEIDSSKAEDPEAYPVTMTLRYMSEDESTPLQFG -----------------------------3333--1111----------3333------- HKTENGLFRSNLQTQEEEDANYRLPEGKEAGEIETVHCKYVIGCDGGHSWVRRTLGFEMI -----------------1111---22222222-------------1111----------- GEQTDYIWGVLDAVPASNFPDIRSRCAIHSAESGSIMIIPRENNLVRFYVQLQATKFTPE --------------------1111-----------------iiii--------------- VVIANAKKIFHPYTFDVQQLDWFTAYHIGQRVTEKFSKDERVFIAGDACHTHSPKAGQGM -------------------------------------%%%%----3333----1111--- NTSMMDTYNLGWKLGLVLTGRAKRDILKTYEEERQPFAQALIDFDHQFSRLFSGRPAKDV ----------------1111--------------------------------------11 ADEMGVSMDVFKEAFVKGNEFASGTAINYDENLVTDKKSSKQELAKNCVVGTRFKSQPVV 11-----------------------------1111-1111333311112222-------- RHSEGLWMHFGDRLVTDGRFRIIVFAGKATDATQMSRIKKFAAYLDSENSVISRYTPKGA --------3333---------------3333---------------1111------2222 DRNSRIDVITIHSCHRDDIEMHDFPAPALHPKWQYDFIYADCDSWHHPHPKSYQAWGVDE --------------3333-1111--------------------1111------------- TKGAVVVVRPDGYTSLVTDLEGTAEIDRYFSGILVEPKEKSGAQTEADWTKS --------1111------1111-------------------------1111- >PEROXISOMAL HYDRATASE-DEH; SWP:P22414; PDB:1PN2A; DPVWRFDDRDVILYNIALGATTKQLKYVYENDSDFQVIPTFGHLITFNSGKSQNSFAKLL ---------------1111-333311111111------33333333--1111-1111--- RNFNPLLLHGEHYLKVHSWPPPTEGEIKTTFEPIATTPKGTNVVIVHGSKSVDNKSGELI --------------------------------------!!!!------------------ YSNEATYFIRNCQADNKVYADRPAFATNQFLAPKRAPDYQVDVPVSEDLAALYRLSGDRN ----------------------3333-------------------111133333333--- PLHIDPNFAKGAKFPKPILHGCTYGLSAKALIDKFGFNEIKARFTGIVFPGETLRVLAWK 1111-----1111-----------------------------------2222-------- ESDDTIVFQTHVVDRGTIAINNAAIKLVG ------------1111------------- >GLYCOSYLTRANSFERASE GTFA; SWP:P96558; PDB:1PN3A; MRVLITGCGSRGDTEPLVALAARLRELGADARMCLPPDYVERCAEVGVPMVPVGRAVRAG ---------3333-----------1111-------3333--------------------- AREPGELPPGAAEVVTEVVAEWFDKVPAAIEGCDAVVTTGLLPAAVAVRSMAEKLGIPYR --1111-1111---------------3333------------------------------ YTVLSPDHLPSEQSQAERDMYNQGADRLFGDAVNSHRASIGLPPVEHLYDYGYTDQPWLA --------3333-3333---------------------------------1111------ ADPVLSPLRPTDLGTVQTGAWILPDERPLSAELEAFLAAGSTPVYVGFGSSSRPATADAA -3333------------------------------------------!!!!-3333---- KMAIKAVRASGRRIVLSRGWADLVLPDDGADCFVVGEVNLQELFGRVAAAIHHDSAGTTL -------3333-------1111------1111---------------------------- LAMRAGIPQIVVRRVVDNVVEQAYHADRVAELGVGVAVDGPVPTIDSLSAALDTALAPEI -----------------1111--------------------------------1111--- RARATTVADTIRADGTTVAAQLLFDAVSLEK -----3333---------------------- >NACHT-, LRR- AND PYD-CONT; SWP:P06654; PDB:1PN5A; MAGGAWGRLACYLEFLKKEELKEFQLLLANKAHSRSSSGETPAQPEKTSGMEVASYLVAQ ------3333------3333-----------------------------------3333- YGEQRAWDLALHTWEQMGLRSLCAQAQEGAGHS -----------3333--------3333------ >GLUTATHIONE S-TRANSFERASE; SWP:Q93113; PDB:1PN9A; MDFYYLPGSAPCRAVQMTAAAVGVELNLKLTDLMKGEHMKPEFLKLNPQHCIPTLVDNGF -----3333----------------------111111113333---1111------iiii ALWESRAIQIYLAEKYGKDDKLYPKDPQKRAVVNQRLYFDMGTLYQRFADYHYPQIFAKQ ------------------3333-------------------------------------- PANPENEKKMKDAVGFLNTFLEGQEYAAGNDLTIADLSLAATIATYEVAGFDFAPYPNVA --------------------2222-1111-----------------------1111---- AWFARCKANAPGYALNQAGADEFKAKFLS ---------2222---------------- >PROFILIN; SWP:P02584; PDB:1PNE; AGWNAYIDNLMADGTCQDAAIVGYKDSPSVWAAVPGKTFVNITPAEVGILVGKDRSSFFV ----------1111-------------------22223333--------------3333- NGLTLGGQKCSVIRDSLLQDGEFTMDLRTKSTGGAPTFNITVTMTAKTLVLLMGKEGVHG ----iiii-------1111------------iiii-------------------2222-- GMINKKCYEMASHLRRSQY --------------1111- >SCORPION TOXIN; SWP:P31719; PDB:1PNH; TVCNLRRCQLSCRSLGLLGKCIGVKCECVKH -----------3333---------------- >NAD(P) TRANSHYDROGENASE S; SWP:Q59765; PDB:1PNOA; SGHIEGRHMAGSAEDAAFIMKNASKVIIVPGYGMAVAQAQHALREMADVLKKEGVEVSYA ----!!!!-----------1111-------33331111---------------------- IHPVAGRMPGHMNVLLAEANVPYDEVFELEEINSSFQTADVAFVIGANDVTNPAAKTDPS -------2222-----1111-3333--333311111111--------11111111--111 SPIYGMPILDVEKAGTVLFIKRSMASGYAGVENELFFRNNTMMLFGDAKKMTEQIVQAMN 1-2222---3333-------------3333-------1111-----3333------3333 >CYTOCHROME P450 2B4; SWP:P00178; PDB:1PO5A; GKLPPGPSPLPVLGNLLQMDRKGLLRSFLRLREKYGDVFTVYLGSRPVVVLCGTDAIREA ------------!!!!---3333-------------------!!!!-------------- LVDQAEAFSGRGKIAVVDPIFQGYGVIFANGERWRALRRFSLATMRDFGMGKRSVEERIQ ----3333-----3333------------------------------------------- EEARCLVEELRKSKGALLDNTLLFHSITSNIICSIVFGKRFDYKDPVFLRLLDLFFQSFS ------------%%%%-------------------------1111--------------- LISSFSSQVFELFSGFLKHFPGTHRQIYRNLQEINTFIGQSVEKHRATLDPSNPRDFIDV --1111-------------------------------------------1111------- YLLRMEKDKSDPSSEFHHQNLILTVLSLFFAGTETTSTTLRYGFLLMLKYPHVTERVQKE ----------1111---------------------------------------------- IEQVIGSHRPPALDDRAKMPYTDAVIHEIQRLGDLIPFGVPHTVTKDTQFRGYVIPKNTE -----------33331111----------------1111----------iiii--2222- VFPVLSSALHDPRYFETPNTFNPGHFLDANGALKRNEGFMPFSLGKRICLGEGIARTELF ---3333---3333--1111---11111111----11111111-----1111-------- LFFTTILQNFSIASPVPPEDIDLTPRESGVGNVPPSYQIRFLARH ----------------3333------iiii--------------- >PHOSPHOLIPASE A2; SWP:P00598; PDB:1POA; NLYQFKNMIQCTVPSRSWWDFADYGCYCGRGGSGTPVDDLDRCCQVHDNCYNEAEKISGC 3333--------11111111------------------------------------2222 WPYFKTYSYECSQGTLTCKGGNNACAAAVCDCDRLAAICFAGAPYNDNDYNINLKARC 3333-------!!!!-----------------------3333---1111---3333-- >PHOSPHOLIPASE A2; SWP:P00630; PDB:1POC; IIYPGTLWCGHGNKSSGPNELGRFKHTDACCRTHDMCPDVMSAGESKHGLTNTASHTRLS --2222----------1111---------------------2222-iiii---------- CDCDDKFYDCLKNSADTISSYFVGKMYFNLIDTKCYKLEHPVTGCGERTEGRCLHYTVDK ----------1111--------------------------------------------11 SKPKVYQWFDLRKY 11------------ >OCT-1 POU HOMEODOMAIN DNA; SWP:P14859; PDB:1POG; RRRKKRTSIETNIRVALEKSFLENQKPTSEEITMIADQLNMEKEVIRVWFCNRRQKEKRI ---------------------------3333---------------------1111---- DI -- >GLUTACONATE COENZYME A-TR; SWP:Q59111; PDB:1POIA; SKVMTLKDAIAKYVHSGDHIALGGFTTDRKPYAAVFEILRQGITDLTGLGGAAGGDWDML ------------------------!!!!----------1111------------------ IGNGRVKAYINCYTANSGVTNVSRRFRKWFEAGKLTMEDYSQDVIYMMWHAAALGLPFLP ------------------------------------------------------------ VTLMQGSGLTDEWGISKEVRKTLDKVPDDKFKYIDNPFKPGEKVVAVPVPQVDVAIIHAQ ------3333-----33331111------------1111--------------------- QASPDGTVRIWGGKFQDVDIAEAAKYTIVTCEEIISDEEIRRDPTKNDIPGMCVDAVVLA --1111-------!!!!3333---------------3333--3333---3333------2 PYGAHPSQCYGLYDYDNPFLKVYDKVSKTQEDFDAFCKEWVFDLKDHDEYLNKLGATRLI 222-------------3333----1111------------1111-3333------3333- NLKVVPGLGYHIDMTKE ------------1111- >Glutaconate CoA-transfera; SWP:Q59112; PDB:1POIB; DYTNYTNKEMQAVTIAKQIKNGQVVTVGTGLPLIGASVAKRVYAPDCHIIVESGLMDCSP ---------------11112222---------------------------1111------ VEVPRSVGDLRFMAHCGCIWPNVRFVGFEINEYLHKANRLIAFIGGAQIDPYGNVNSTSI --------3333--------3333--------3333-------------1111------- GDYHHPKTRFTGSGGANGIATYSNTIIMMQHEKRRFMNKIDYVTSPGWIDGPGGRERLGL -3333-------!!!!---------------1111---------------2222------ PGDVGPQLVVTDKGILKFDEKTKRMYLAAYYPTSSPEDVLENTGFDLDVSKAVELEAPDP ----------1111----------------1111-------------------------- AVIKLIREEIDPGQAFIQVP ----------1111------ >SPERMIDINE/PUTRESCINE-BIN; SWP:P0AFK9; PDB:1POT; NNTLYFYNWTEYVPPGLLEQFTKETGIKVIYSTYESNETMYAKLKTYKDGAYDLVVPSTY --------1111------------------------------1111-----------111 YVDKMRKEGMIQKIDKSKLTNFSNLDPDMLNKPFDPNNDYSIPYIWGATAIGVNGDAVDP 1-------------33331111---1111-----1111--------------------11 KSVTSWADLWKPEYKGSLLLTDDAREVFQMALRKLGYSGNTTDPKEIEAAYNELKKLMPN 11--3333--3333--------3333------1111-1111-----------33333333 VAAFNSDNPANPYMEGEVNLGMIWNGSAFVARQAGTPIDVVWPKEGGIFWMDSLAIPANA -------3333-------------------3333--------1111----------1111 KNKEGALKLINFLLRPDVAKQVAETIGYPTPNLAARKLLSPEVANDKTLYPDAETIKNGE ---------------------------------3333--1111--------33331111- WQNDVGAASSIYEEYYQKLKAG ---------------------- >VOLVATOXIN A2; SWP:Q6USC4; PDB:1PP0A; NVFQPVDQLPEDLIPSSIQVLKFSGKYLKLEQDKAYFDWPGFKTAIDNYTGEDLSFDKYD ---------3333-----------1111--%%%%-------------------------- QSTINQQSQEVGAMVDKIAKFLHDAFAAVVDLSKLAAIILNTFTNLEEESSSGFLQFNTN -------------------------1111----------------3333--1111----- NVKKNSSWEYRVLFSVPFAPSYFYSLVTTILITADIEEKTGWWGLTSSTKKNFAVQIDAL -------------------------------------3333----1111----------- ELVVKKGFKAP ----------- >Phospholipase A2 [Precurs; SWP:P00624; PDB:1PP2R; SLVQFETLIMKIAGRSGLLWYSAYGCYCGWGGHGLPQDATDRCCFVHDCCYGKATDCNPK 3333-----------3333-------------------3333-------3333----333 TVSYTYSEENGEIICGGDDPCGTQICECDKAAAICFRDNIPSYDNKYWLFPPKDCREEPE 3-------iiii--------------------------3333-3333---1111------ PC -- >39 KDA INITIATOR BINDING ; SWP:NA; PDB:1PP7U; DLEASFTSRLPPEIVAALKRKSSRDPNSRFPRKLHMLLTYLASNPQLEEEIGLSWISDTE -33333333---------------1111-------------------------------- FKMKKKNVALVMGIKLNTLNVNLRDLAFEQLQHDKGGWTQWKRSGFTRNSVFED ---------1111----------1111-------iiii----2222-------- >Leukocyte elastase [Precu; SWP:P08246; PDB:1PPFE; IVGGRRARPHAWPFMVSLQLRGGHFCGATLIAPNFVMSAAHCVANVNVRAVRVVLGAHNL -------2222--------------------1111---3333----3333--------11 SRREPTRQVFAVQRIFENGYDPVNLLNDIVILQLNGSATINANVQVAQLPAQGRRLGNGV 11-3333---------------------------------1111------------2222 QCLAMGWGLLGRNRGIASVLQELNVTVVTSLCRRSNVCTLVRGRQAGVCFGDSGSPLVCN --------------------------------1111------------2222-------- GLIHGIASFVRGGCASGLYPDAFAPVAQFVNWIDSIIQ ------------------------3333---------- >CYTOCHROME B; SWP:P31800; PDB:1PPJA; ATYAQALQSVPETQVSQLDNGLRVASEQSSQPTCTVGVWIDAGSRYESEKNNGAGYFVEH -33331111--------1111----------------------11113333--------- LAFKGTKNRPGNALEKEVESMGAHLNAYSTREHTAYYIKALSKDLPKAVELLADIVQNCS 1111-3333!!!!-----1111------------------3333---------------- LEDSQIEKERDVILQELQENDTSMRDVVFNYLHATAFQGTPLAQSVEGPSENVRKLSRAD ------------------------------------2222333333333333-------- LTEYLSRHYKAPRMVLAAAGGLEHRQLLDLAQKHFSGLSGTYDEDAVPTLSPCRFTGSQI ---------3333---------------------3333---------------------- CHREDGLPLAHVAIAVEGPGWAHPDNVALQVANAIIGHYDCTYGGGAHLSSPLASIAATN ---1111------------3333----------------1111-!!!!------------ KLCQSFQTFNICYADTGLLGAHFVCDHMSIDDMMFVLQGQWMRLCTSATESEVLRGKNLL -------------------------1111------------------------------- RNALVSHLDGTTPVCEDIGRSLLTYGRRIPLAEWESRIAEVDARVVREVCSKYFYDQCPA ----3333---------------------3333--------------------------- VAGFGPIEQLPDYNRIRSGMFW ------1111------------ >Ubiquinol-cytochrome-c re; SWP:P23004; PDB:1PPJB; EVPPHPQDLEFTRLPNGLVIASLENYAPASRIGLFIKAGSRYENSNNLGTSHLLRLASSL -------------1111----------------------11111111---------1111 TTKGASSFKITRGIEAVGGKLSVTSTRENMAYTVECLRDDVDILMEFLLNVTTAPEFRRW -1111---------1111------------------1111-------------------- EVAALQPQLRIDKAVALQNPQAHVIENLHAAAYRNALANSLYCPDYRIGKVTPVELHDYV -----------------------------------3333----3333------------- QNHFTSARMALIGLGVSHPVLKQVAEQFLNIRGGLGLSGAKAKYHGGEIREQNGDSLVHA ----1111---------------------------------------------------- ALVAESAAIGSAEANAFSVLQHVLGAGPHVKRGSNATSSLYQAVAKGVHQPFDVSAFNAS -------22223333--------------2222-1111---------------------- YSDSGLFGFYTISQAASAGDVIKAAYNQVKTIAQGNLSNPDVQAAKNKLKAGYLMSVESS -------------3333--------------1111------------------1111--- EGFLDEVGSQALAAGSYTPPSTVLQQIDAVADADVINAAKKFVSGRKSMAASGNLGHTPF ------------------------------3333-------1111---------1111-1 IDEL 111- >Cytochrome b; SWP:P00157; PDB:1PPJC; NNAFIDLPAPSNISSWWNFGSLLGICLILQILTGLFLAMHYTSDTTTAFSSVTHICRDVN -------------3333--------------------------3333-----------22 YGWIIRYMHANGASMFFICLYMHVGRGLYYGSYTFLETWNIGVILLLTVMATAFMGYVLP 22-------------------------------------------------------333 WGQMSFWGATVITNLLSAIPYIGTNLVEWIWGGFSVDKATLTRFFAFHFILPFIIMAIAM 3------------3333-------------------3333-------------------- VHLLFLHETGSNNPTGISSDVDKIPFHPYYTIKDILGALLLILALMLLVLFAPDLLGDPD -----3333---1111--1111-----1111--------------------1111--333 NYTPANPLNTPPHIKPEWYFLFAYAILRSIPNKLGGVLALAFSILILALIPLLHTSKQRS 3----1111-------1111------1111--------------33333333-------- MMFRPLSQCLFWALVADLLTLTWIGGQPVEHPYITIGQLASVLYFLLILVLMPTAGTIEN 1111-------------------1111--------------------------------- KLLKW 1111- >Cytochrome c1 heme protei; SWP:P00125; PDB:1PPJD; SDLELHPPSYPWSHRGLLSSLDHTSIRRGFQVYKQVCSSCHSMDYVAYRHLVGVCYTEDE -----------11111111-----------------------111133332222------ AKALAEEVEVQDGPNEDGEMFMRPGKLSDYFPKPYPNPEAARAANNGALPPDLSYIVRAR --------------1111-------3333---------------iiii------3333-2 HGGEDYVFSLLTGYCEPPTGVSLREGLYFNPYFPGQAIGMAPPIYNEVLEFDDGTPATMS 222--------------2222--2222--1111-----------2222--1111------ QVAKDVCTFLRWAAEPEHDHRKRMGLKMLLMMGLLLPLVYAMKRHKWSVLKSRKLAYRPP --------------1111------------------------------------------ K - >Ubiquinol-cytochrome c re; SWP:P00129; PDB:1PPJF; WLEGIRKWYYNAAGFNKLGLMRDDTIHENDDVKEAIRRLPENLYDDRVFRIKRALDLSMR 3333---------3333---3333-----------1111------------------111 QQILPKEQWTKYEEDKSYLEPYLKEVIRERKEREEWAKK 1---3333--3333----3333-------------3333 >Ubiquinol-cytochrome c re; SWP:P13271; PDB:1PPJG; GRQFGHLTRVRHVITYSLSPFEQRAFPHYFSKGIPNVLRRTRACILRVAPPFVAFYLVYT ------------------1111---------------------3333------------- WGTQEFEKSKRKNPA -------1111---- >Ubiquinol-cytochrome c re; SWP:P00126; PDB:1PPJH; LVDPLTTVREQCEQLEKCVKARERLELCDERVSSRSQTEEDCTEELLDFLHARDHCVAHK --3333------------------------------------------------------ LFNSLK -1111- >Ubiquinol-cytochrome c re; SWP:P13272; PDB:1PPJI; AAVPATSESPVLSVLCRESLRGQAAGRPLVASVSLNVPASVRY ---------------33332222-------------------- >Ubiquinol-cytochrome c re; SWP:P00130; PDB:1PPJJ; FFERAFDQGADAIYEHINEGKLWKHIKHKYENK -----------------22223333--1111-- >PROTEASE OMEGA; SWP:P10056; PDB:1PPO; LPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRS -----3333-----------------------------------------------3333 HGCKGGYPPYALEYVAKNGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGN !!!!-----------------3333----------3333--------------------- LLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYILIKNS ------------------------------------------------------------ WGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPTKN ------iiii-----------2222----------- >Peridinin-chlorophyll a-b; SWP:P80484; PDB:1PPRM; DEIGDAAKKLGDASYAFAKEVDWNNGIFLQAPGKLQPLEALKAIDKMIVMGAAADPKLLK -----------------111111113333------------------------------- AAAEAHHKAIGSISGPNGVTSRADWDNVNAALGRVIASVPENMVMDVYDSVSKITDPKVP ---------1111-1111---------------------3333--------11111111- AYMKSLVNGADAEKAYEGFLAFKDVVKKSQVTSAAGPATVPSGDKIGVAAQQLSEASYPF ---1111------------------3333------------------------------- LKEIDWLSDVYMKPLPGVSAQQSLKAIDKMIVMGAQADGNALKAAAEAHHKAIGSIDATG ----11113333--2222---------------------------------3333-1111 VTSAADYAAVNAALGRVIASVPKSTVMDVYNAMAGVTDTSIPLNMFSKVNPLDANAAAKA -----------------11113333-------3333-3333----1111----------- FYTFKDVVQAAQ ------------ >AVIAN PANCREATIC POLYPEPT; SWP:P68249; PDB:1PPT; GPSQPTYPGDDAPVEDLIRFYDNLQQYLNVVTRHRY --------11113333-------------------- >Bcl-2-like protein 11; SWP:O54918; PDB:1PQ1B; DLRPEIRIAQELRRIGDEFNETYTRRVFANDYR --3333--------------------------- >CYTOCHROME P450 2C8; SWP:P10632; PDB:1PQ2A; KLPPGPTPLPIIGNMLQIDVKDICKSFTNFSKVYGPVFTVYFGMNPIVVFHGYEAVKEAL -----------!!!!--------------3333--------!!!!--------------- IDNGEEFSGRGNSPISQRITKGLGIISSNGKRWKEIRRFSLTTLRNFGMGKRSIEDRVQE ---3333------------iiii------3333--------------------------- EAHCLVEELRKTKASPCDPTFILGCAPCNVICSVVFQKRFDYKDQNFLTLMKRFNENFRI -----------iiii-------------------------11113333------------ LNSPWIQVCNNFPLLIDCFPGTHNKVLKNVALTRSYIREKVKEHQASLDVNNPRDFIDCF ---3333----33331111------------------------1111-3333-------- LIKMEQEKDNQKSEFNIENLVGTVADLFVAGTETTSTTLRYGLLLLLKHPEVTAKVQEEI ---1111--------3333----------------------------------------- DHVIGRHRSPCMQDRSHMPYTDAVVHEIQRYSDLVPTGVPHAVTTDTKFRNYLIPKGTTI ------------3333------------------1111----------------2222-- MALLTSVLHDDKEFPNPNIFDPGHFLDKNGNFKKSDYFMPFSAGKRICAGEGLARMELFL --3333---------1111---11111111----11111111-----1111--------- FLTTILQNFNLKSVDDLKNLNTTAVTKGIVSLPPSYQICFIPV ---------------3333------------------------ >ARGINASE II, MITOCHONDRIA; SWP:P78540; PDB:1PQ3A; HSVAVIGAPFSQGQKRKGVEHGPAAIREAGLMKRLSSLGCHLKDFGDLSFTPVPKDDLYN -----------------3333--------------3333-------------------%% NLIVNPRSVGLANQELAEVVSRAVSDGYSCVTLGGDHSLAIGTISGHARHCPDLCVVWVD %%------------------------------------------------1111------ AHADINTPLTTSSGNLHGQPVSFLLRELQDKVPQLPGFSWIKPCISSASIVYIGLRDVDP ------3333----1111-3333-3333------2222-------1111----------- PEHFILKNYDIQYFSMRDIDRLGIQKVMERTFDLLIGKRQRPIHLSFDIDAFDPTLAPAT -----------------------------------1111--------1111-3333---- GTPVVGGLTYREGMYIAEEIHNTGLLSALDLVEVNPQLATSEEEAKTTANLAVDVIASSF --------------------3333----------3333-------------------111 GQTREG 1----- >periplasmic binding prote; SWP:NA; PDB:1PQ4A; DAMDITVSIPPQQYFLEKIGGDLVRVSVLVPGNNDPHTYEPKPQQLAALSEAEAYVLIGL --------3333-------!!!!-------11113333---3333---3333------ii GFEQPWLEKLKAANANMKLIDSAQGITPLEMEKMVADPHIWLSPTLVKRQATTIAKELAE ii3333--------------1111--------------1111-3333------------- LDPDNRDQYEANLAAFLAELERLNQELGQILQPLPQRKFIVFHPSWAYFARDYNLVQIPI -3333-------------------------1111-------------------------- EVEGQEPSAQELKQLIDTAKENNLTMVFGETQFSTKSSEAIAAEIGAGVELLDPLAADWS -%%%%--------------1111------1111-------------------1111---- SNLKAVAQKIANANS -----------1111 >TRYPSIN; SWP:P35049; PDB:1PQ7A; IVGGTSASAGDFPFIVSISRNGGPWCGGSLLNANTVLTAAHCVSGYAQSGFQIRAGSLSR -------22221111----iiii---------------33332222-------------- TSGGITSSLSSVRVHPSYSGNNNDLAILKLSTSIPSGGNIGYARLAASGSDPVAGSSATV --------------1111!!!!--------------!!!!------2222--2222---- AGWGATSEGGSSTPVNLLKVTVPIVSRATCRAQYGTSAITNQMFCAGVSSGGKDSCQGDS ------2222------------------------3333-1111----1111----2222- GGPIVDSSNTLIGAVSWGNGCARPNYSGVYASVGALRSFIDTYA -----1111-------------2222-----3333--------- >ASPARTATE 1-DECARBOXYLASE; SWP:P31664; PDB:1PQHA; SMIRTMLQGKLHRVKVTHADLHYEGTCAIDQDFLDAAGILENEAIDIWNVTNGKRFSTYA ---------------------------------------2222----------------- IAAERGSRIISVNGAAAHASVGDIVIIASFVTMPDEEARTWRPNVAYFEGDNEMKR ---2222------------2222----------3333------------------- >LYSOZYME; SWP:P00720; PDB:1PQKA; MNIFEMLRIDEGLRLKIYKDTEGYYTIGIGHLLTKSPSLNAAKSELDKAIGRNTNGVITK -------------------1111------------------------------iiii--- DEAEKLFNQDVDAAVRAVLRNAKLKPVYDSLDAVRRAALINMVFQMGETGVAGFTNSIRY ---------------------------1111----------------------------- LQQKRWDEAAVNFAKSRWYNQTPNRAKRIITVFRTGTWDAYKNL 1111---------------------------------3333--- >ASPARTATE-SEMIALDEHYDE DE; SWP:P44801; PDB:1PQUA; MKNVGFIGWRGMVGSVLMDRMSQENDFENLNPVFFTTSQAGQKAPVFGGKDAGDLKSAFD -------1111-------------1111----------2222----iiii------1111 IEELKKLDIIVTCQGGDYTNEVYPKLKATGWDGYWVDAASALRMKDDAIIVLDPVNQHVI 3333----------------------1111---------1111-1111---3333----- SEGLKKGIKTFVGGNCTVSLMLMAIGGLFEKDLVEWISVATYQAASGAGAKNMRELLSQM ----------------------------1111------------3333------------ GLLEQAVSSELKDPASSILDIERKVTAKMRADNFPTDNFGAALGGSLIPWIDKLLPETGQ --------33331111--------------3333-3333---2222-------------- TKEEWKGYAETNKILGLSDNPIPVDGLCVRIGALRCNSQAFTIKLKKDLPLEEIEQIIAS ----------------3333---------------------------------------- HNEWVKVIPNDKEITLRELTPAKVTGTLSVPVGRLRKLAMGPEYLAAFTVGDQLLWGAAE -------------------33332222----------3333----------1111----- PVRRILKQLVA ----------- >POLYKETIDE SYNTHASE; SWP:P96202; PDB:1PQWA; NEAATFGVAYLTAWHSLCEVGRLSPGERVLIHSATGGVGMAAVSIAKMIGARIYTTAGSD 3333-------------------2222-----1111------------------------ AKREMLSRLGVEYVGDSRSVDFADEILELTDGYGVDVVLNSLAGEAIQRGVQILAPGGRF -----1111------1111----------iiii-----------------11112222-- IELGKKDVYADASLGLAALAKSASFSVVDLDLNLKLQPARYRQLLQHILQHVADGKLEVL ----1111------3333------------------------------------------ PVT --- >MCMV M144; SWP:Q69G19; PDB:1PQZA; GSESGLRYAYTLVVDGTANTARCFGTGHVDGEAFVGYSNNKTHGIGRWVNASHVEEENKE --------------3333----------iiii-----%%%%------------------- FVRQCKELQAELDKMQNNSAVIGVKTVQLDVGCTSKIEKHYAYDGNETEDDTATSASERA -----------3333---------------------------iiii---------2222- RDCQKKLTEYRKLVLASAVSPQLEVERRSSGREGGMRLRCFARDYYPADLEIRWWKDDGG -------------1111------------------------------------------- GGALPQTSKQHHDPLPSGQGLYQKHIDVYVDGGLEHVYSCRVKGIATGLELQIVRWKG ------------------------------22221111-----3333----------- >L-XYLULOSE REDUCTASE; SWP:Q7Z4W1; PDB:1PR9A; MELFLAGRRVLVTGAGKGIGRGTVQALHATGARVVAVSRTQADLDSLVRECPGIEPVCVD ----2222-------------------------------3333-------2222-----1 LGDWEATERALGSVGPVDLLVNNAAVALLQPFLEVTKEAFDRSFEVNLRAVIQVSQIVAR 111---------------------------1111-------------------------- GLIARGVPGAIVNVSSQCSQRAVTNHSVYCSTKGALDMLTKVMALELGPHKIRVNAVNPT ---------------1111---2222--------------------3333---------- VVMTSMGQATWSDPHKAKTMLNRIPLGKFAEVEHVVNAILFLLSDRSGMTTGSTLPVEGG ----3333----3333---33331111---3333---------3333------------3 FWAC 333- -------------------------------------------------------- >PURINE REPRESSOR; SWP:P15039; PDB:1PRU; MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKV --------------------3333----------------1111------------ >HORF6; SWP:P30041; PDB:1PRXA; LLLGDVAPNFEANTTVGRIRFHDFLGDSWGILFSHPRDFTPVTTELGRAAKLAPEFAKRN -2222--------1111----------------------------------33333333- VKLIALSIDSVEDHLAWSKDINAYNSEEPTEKLPFPIIDDRNRELAILLGMLDPAEKDEK ----------------------1111-------------1111----------------- GMPVTARVVFVFGPDKKLKLSILYPATTGRNFDEILRVVISLQLTAEKRVATPVDWKDGD --1111------1111--------1111-------------------------------- SVMVLPTIPEEEAKKLFPKGVFTKELPSGKKYLRYTPQP ----3333--------1111-----1111---------- >FIBRILLARIN-LIKE PRE-RRNA; SWP:Q8U4M2; PDB:1PRYA; VEVKKHKFPGVYVVIDDDGSEKIATKNLVPGQRVYGERVIKWEGEEYRIWNPHRSKLGAA -------2222----1111---------22221111-----iiii-----1111------ IVNGLKNFPIKPGKSVLYLGIASGTTASHVSDIVGWEGKIYGIEFSPRVLRELVPIVEER ----------2222-----1111-----------1111----------------1111-1 RNIIPILGDATKPEEYRALVTKVDVIFEDVAQPTQAKILIDNAKAYLKRGGYGMIAVKSR 111-----3333---3333------------1111------------2222------333 SIDVTKEPEQVFKEVERLLSEYFEVIERLNLEPYEKDHALFVVRKP 3--------------------------------------------- >PENTALENENE SYNTHASE; SWP:Q55012; PDB:1PS1A; QDVDFHIPLPGRQSPDHARAEAEQLAWPRSLGLIRSDAAAERHLRGGYADLASRFYPHAT -------------1111-----------1111---3333----------------1111- GADLDLGVDLMSWFFLFDDLFDGPRGENPEDTKQLTDQVAAALDGPLPDTAPPIAHGFAD ----------------3333--3333---------------1111--11113333----- IWRRTCEGMTPAWCARSARHWRNYFDGYVDEAESRSAAQYLAMRRHTIGVQPTVDLAERA -----22223333-----------------3333-3333---3333-------------- GRFEVPHRVFDSAVMSAMLQIAVDVNLLLNDIASLEKEEARGEQNNMVMILRREHGWSKS -----3333-----------------------------1111---------------333 RSVSHMQNEVRARLEQYLLLESCLPKVGEIYQLDTAEREALERYRTDAVRTVIRGSYDWH 3---------------------------1111---------------------------- RSSG ---- >2,4-DIENOYL-COA REDUCTASE; SWP:P42593; PDB:1PS9A; SYPSLFAPLDLGFTTLKNRVLMGSMHTGLEEYPDGAERLAAFYAERARHGVALIVSGGIA -3333------------------------------------------------------- PDLTGVGMEGGAMLNDASQIPHHRTITEAVHQEGGKIALQILHTGRYSYQPHLVAPSALQ --1111-2222----3333-----------1111---------!!!!--1111------- APINRFVPHELSHEEILQLIDNFARCAQLAREAGYDGVEVMGSEGYLINEFLTLRTNQRS 1111--------------------------------------iiii3333--3333---- DQWGGDYRNRMRFAVEVVRAVRERVGNDFIIIYRLSMLDLVEDGGTFAETVELAQAIEAA 1111-3333-------------------------------2222-------------333 GATIINTGIGWHEARIPTIATPVPRGAFSWVTRKLKGHVSLPLVTTNRINDPQVADDILS 3---------3333-----3333--1111---1111------------------------ RGDADMVSMARPFLADAELLSKAQSGRADEINTCIGCNQACLDQIFVGKVTSCLVNPRAC ---------3333--1111---11113333-------1111---1111----1111-222 HETKMPILPAVQKKNLAVVGAGPAGLAFAINAAARGHQVTLFDAHSEIGGQFNIAKQIPG 21111----------------3333-------1111-----------------3333222 KEEFYETLRYYRRMIEVTGVTLKLNHTVTADQLQAFDETILASGIVPRTPPIDGIDHPKV 2----------------------------------------------------1111--- LSYLDVLRDKAPVGNKVAIIGCGGIGFDTAMYLSQPGESTSQNIAGFCNEWGIDSSLQQA -----------------------------------------------------1111-22 GGLSPQGMQIPRSPRQIVMLQRKASKPGQGLGKTTGWIHRTTLLSRGVKMIPGVSYQKID 22-1111------------------2222--1111--------1111------------1 DDGLHVVINGETQVLAVDNVVICAGQEPNRALAQPLIDSGKTVHLIGGCDVAMELDARRA 111----iiii------------------1111-------------1111---------- IAQGTRLALEI ----------- >D-3-PHOSPHOGLYCERATE DEHY; SWP:P0A9T0; PDB:1PSDA; EKDKIKFLLVEGVHQKALESLRAAGYTNIEFHKGALDDEQLKESIRDAHFIGLRSRTHLT ------------------------------------------------------------ EDVINAAEKLVAIGCFCIGTNQVDLDAAAKRGIPVFNAPFSNTRSVAELVIGELLLLLRG ---1111------------1111-33331111---------------------------- VPEANAKAHRGVWNKLAAGSFEARGKKLGIIGYGHIGTQLGILAESLGMYVYFYDIENKL -------1111-----1111--2222-------3333-------1111------------ PLGNATQVQHLSDLLNMSDVVSLHVPENPSTKNMMGAKEISLMKPGSLLINASRGTVVDI ---------------------------3333------------2222------1111--- PALCDALASKHLAGAAIDVFPTEPATNSDPFTSPLCEFDNVLLTPHIGGSTQEAQENIGL ------1111---------------1111---3333------------------------ EVAGKLIKYSDNGSTLSAVNFPEVSLPLHGGRRLMHIHENRPGVLTALNKIFAEQGVNIA --------------1111----------------------2222--------1111---- AQYLQTSAQMGYVVIDIEADEDVAEKALQAMKAIPGTIRARLLY ---------------------------------2222------- >PHOTOSYSTEM I ACCESSORY P; SWP:P31969; PDB:1PSE; AIERGSKVKILRKESYWYGDVGTVASIDKSGIIYPVIVRFNKVNYNGFSGSAGGLNTNNF ----------------2222---------------------------------------- AEHELEVVG 3333----- >ANTIBODY; SWP:NA; PDB:1PSKH; EVQLQQSGPELVKPGASVKISCKTSGYTFTKYTMHWVKQSHGKSLEWIGDINPNNGGTNY ---------------------------3333----------------------------- NQKFKGTATLTVHKSSTTAYMELRSLTSEDSAVYYCTSKSFDYWGQGTTLTVSSAKTTAP 3333---------1111---------1111--------%%%%------------------ SVYPLAPVAVTLGCLVKGYFPEPVTLTWNSSGVHTFPAVLQSDLYTLSSSVTVTSSTWPS -----------------------------------------------------1111--- QSITCNVAHPASSTKVDK ------------------ >SPAM-H1; SWP:Q26019; PDB:1PSM; EAYKKAKQASQDAEQAAKDAENASKEAEEAAKEAVNLK -----33331111-1111--3333---1111------- >PROBABLE THIOL PEROXIDASE; SWP:P72500; PDB:1PSQA; VTFLGNPVSFTGKQLQVGDKALDFSLTTTDLSKKSLADFDGKKKVLSVVPSIDTGICSTQ --%%%%---------------------1111---3333---------------------- TRRFNEELAGLDNTVVLTVSDLPFAQKRWCGAEGLDNAILSDYFDHSFGRDYALLINEWH -------1111----------3333---------1111---------------------- LLARAVFVLDTDNTIRYVEYVDNINSEPNFEAAIAAAKAL ---------1111---------1111-------------- >PSORIASIN; SWP:P31151; PDB:1PSRA; SNTQAERSIIGMIDMFHKYTRRDDKIDKPSLLTMMKENFPNFLSACDKKGTNYLADVFEK ----------------11111111----------------------1111-1111----- KDKNEDKKIDFSEFLSLLGDIATDYHKQSHGAAPCSGGSQ -1111----------------------1111--------- >ADP-HEPTOSE LPS HEPTOSYLT; SWP:P37692; PDB:1PSWA; KILVIGPSWVGDSQSLYRTLQARYPQAIIDVAPAWCRPLLSRPEVNEAIPEIGERRKLGH --------3333--------------------3333-3333------------------- SLREKRYDRAYVLPNSFKSALVPLFAGIPHRTGWRGERYGLLNDVRVLDKEAWPLVERYI -3333----------3333-----------------------------3333-------- ALAYDKGIRTAQDLPQPLLWPQLQVSEGEKSYTCNQFSLSSERPIGFCPGAEFGPAKRWP ---------3333---------------------1111------------11111111-3 HYHYAELAKQLIDEGYQVVLFGSAKDHEAGNEILAALNTEQQAWCRNLAGETQLDQAVIL 333--------1111-------3333-------111133331111--2222--------- IAACKAIVTNDSGLHVAAALNRPLVALYGPSSPDFTPPLSHKARVIRLITGEGYHQSLID 1111-------------1111------------------1111-----------3333-- ITPQRVLEELNALLLQEEA ------------------- >2S ALBUMIN; SWP:P01089; PDB:1PSYA; AEFMESKGEREGSSSQQCRQEVQRKDLSSCERYLRQSSSRRSTGEEVLRMPGDENQQQES ----------------------------3333---------------------------- QQLQQCCNQVKQVRDECQCEAIKYIAEDQIQQGQLHGEESERVAQRAGEIVSSCGVRCMR --------1111-----3333------------------3333----------------- QTRTN ----- >SURFACE ANTIGEN PSAA; SWP:P0A4G2; PDB:1PSZA; KKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIVPIGQDPHEYEPLPEDVKKTSEAD --3333--------------------!!!!-------2222------------------- LIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSDGVDVIYLEGQNEKGKEDPHAWLN -----%%%%-----------------2222---1111------2222-2222---3333- LENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKLDKLDKESKDKFNKIPAEKKLIVT ------------------3333---------------------111111111111----- SEGAFKYFSKAYGVPSAYIWEINTEEEGTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMK ---------------------3333--------------1111----------------- TVSQDTNIPIYAQIFTDSIAEQGKEGDSYYSMMKYNLDKIAEGLAK --------------------2222------------------1111 >KALATA B2; SWP:P58454; PDB:1PT4A; CGETCFGGTCNTPGCSCTWPICTRDGLPV ----1111---------------iiii-- >INTEGRIN ALPHA-1; SWP:P56199; PDB:1PT6A; QLDIVIVLDGSNSIYPWDSVTAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEFNLNKYSS ----------3333-3333-------3333---1111----------------1111--- TEEVLVAAKKIVQRGGRQTMTALGTDTARKEAFTEARGARRGVKKVMVIVTDGESHDNHR -------1111----------------------3333--2222----------------3 LKKVIQDCEDENIQRFSIAILGSYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALV 333-----1111------------1111----------3333--3333------333333 TIVKTLGERIFA 33------1111 >HISTIDINE-CONTAINING PHOS; SWP:P07515; PDB:1PTF; MEKKEFHIVAETGIHARPATLLVQTASKFNSDINLEYKGKSVNLKSIMGVMSLGVGQGSD ---------1111------------1111-------iiii--1111----3333-2222- VTITVDGADEAEGMAAIVETLQKEGLA --------------------------- >4-HYDROXYTHREONINE-4-PHOS; SWP:P19624; PDB:1PTMA; VKTQRVVITPGEPAGIGPDLVVQLAQREWPVELVVCADATLLTNRAALGLPLTLRPYSPN -----------1111--------------------------------------------- SPAQPQTAGTLTLLPVALRAPVTAGQLAVENGHYVVETLARACDGCLNGEFAALITGPVH ------2222------------2222-3333----------------------------- KGVINDAGIPFTGHTEFFEERSQAKKVVLATEELRVALATTHLPLRDIADAITPALLHEV ----1111----------------------2222---------33333333--------- IAILHHDLRTKFGIAEPRILVCGLNPHAGEGGHGTEEIDTIIPVLNELRAQGKLNGPLPA -------------------------%%%%iiii-3333---------3333-------11 DTLFQPKYLDNADAVLAYHDQGLPVLKYQGFGRGVNITLGLPFIRTSVDHGTALELAGRG 11--33331111------3333-3333-------------------------3333---- KADVGSFITALNLAIKIVNTQ ----------------3333- >PROTEIN KINASE C DELTA TY; SWP:P28867; PDB:1PTQ; HRFKVYNYMSPTFCDHCGSLLWGLVKQGLKCEDCGMNVHHKCREKVANLC --------------------------------------33331111---- >HYPOTHETICAL PROTEIN MTH6; SWP:O26773; PDB:1PU1A; MSLRKLTEGDLDEISSFLHNTISDFILKRVSAKEIVDIDITVLVEYTDELKVDISAELYL ------------------------------1111-------------------------- DELSDADPGIVDEAVDAAYRSLESFLDGFRE ------------------------3333--- >3-METHYLADENINE DNA GLYCO; SWP:O25323; PDB:1PU6A; LDSFEILKALKSLDLLKNAPAWWWPNALKFEALLGAVLTQNTKFEAVLKSLENLKNAFIL -------------1111--1111--2222---------22223333--------1111-- ENDDEINLKKIAYIEFSKLAECVRPSGFYNQKAKRLIDLSGNILKDFQSFENFKQEVTRE ----------------------3333---------------------------------- WLLDQKGIGKESADAILCYACAKEVMVVDKYSYLFLKKLGIEIEDYDELQHFFEKGVQEN ----2222----------------------------1111-------------------- LNSALALYENTISLAQLYARFHGIVEFSKQKLELKL -------%%%%------------------------- >P13SUC1; SWP:P08463; PDB:1PUC; SKSGVPRLLTASERERLEPFIDQIHYSPRYADDEYEYRHVMLPKAMLKAIPTDYFNPETG ----------------33331111-------1111-------333311111111------ TLRILQEEEWRGLGITQSLGWEMYEVHVPEPHILLFKREKD -----3333-1111---------------1111-------- >Transcription factor PU.1; SWP:P17947; PDB:1PUEE; KIRLYQFLLDLLRSGDMKDSIWWVDKDKGTFQFSSKHKEALAHRWGIQKGNRKKMTYEKM ----------------1111-----1111------------------------------- ARALRNYGKTGEVKKVKKKLTYQFSGEV ----3333---------2222------- >HOMEOBOX PROTEIN HOX-A9; SWP:P09631; PDB:1PUFA; NNPAANWLHARSTRKKRCPYTKHQTLELEKEFLFNMYLTRDRRYEVARLLNLTERQVKIW -1111-----1111---------------------------------------------- FQNRRMKMKKINKDRAK ----------------- >HYPOTHETICAL UPF0133 PROT; SWP:P17577; PDB:1PUGA; MKQAQQMQEKMQKMQEEIAQLEVTGESGAGLVKVTINGAHNCRRVEIDPSLLEDDKEMLE --------------------------%%%%------1111-------3333--------- DLVAAAFNDAARRIEETQKEKMASVSSGMQLPPG ---------------------------------- >PROBABLE GTP-BINDING PROT; SWP:P24253; PDB:1PUIA; FVMSAPDIRHLPSDTGIEVAFAGRSNAGKSSALNTLTNQQLINLFEVADGKRLVDLPGYG ------3333----------------------1111------------------------ EMKRKWQRALGEYLEKRQSLQGLVVLMDIRHPLKDLDQQMIEWAVDSNIAVLVLLTKADK ----------------1111-------1111-------------1111--------3333 LASGARKAQLNMVREAVLAFNGDVQVETFSSLKKQGVDKLRQKLDTWFS -----------------1111---------1111--------------- >CONSERVED HYPOTHETICAL PR; SWP:O31743; PDB:1PUJA; HMAKARREVTEKLKLIDIVYELVDARIPMSSRNPMIEDILKNKPRIMLLNKADKADAAVT ----------3333---------3333-111133333333----------3333------ QQWKEHFENQGIRSLSINSVNGQGLNQIVPASKEILQEKFDRMRAKGVKPRAIRALIIGI -------1111----------2222------------------1111------------2 PNVGKSTLINRLAKKNIAQWVKVGKELELLDTPGILWPKFEDELVGLRLAVTGAIKDSII 222----------------------------------------------------3333- NLQDVAVFGLRFLEEHYPERLKERYGLDEIPEDIAELFDAIGEKRGCLMSGGLINYDKTT 3333-------------------------------------------------------- EVIIRDIRTEKFGRLSFEQPT ---------1111-------- >HYPOTHETICAL PROTEIN C32E; SWP:P91127; PDB:1PULA; NWDDADVKKRWDAFTKFGAATATEMTGKNFDKWLKDAGVLDNKAITGTMTGIAFSKVTGP --------------3333----------------3333---------------------- KKKATFDETKKVLAFVAEDRARQSKKPIQDELDAITEKLAKLE -------------------3333---3333------------- >CONSERVED HYPOTHETICAL PR; SWP:Q9JR91; PDB:1PUZA; MMVFDDIAKRKIRFQTRRGLLELDLIFGRFMEKEFEHLSDKELSEFSEILEFQDQELLAL ----3333------------3333--------------3333------3333-------- INGHSETDKGHLIPMLEKIRRA --------1111---------- >SDA; SWP:Q7WY62; PDB:1PV0A; MRKLSDELLIESYFKATEMNLNRDFIELIENEIKRRSLGHIISVSS ----3333-------------------------11113333----- >Hypothetical 33.9 kDa est; SWP:P40363; PDB:1PV1A; MKVVKEFSVCGGRLIKLSHNSNSTKTSMNVNIYLPKHYYAQDFPRNKRIPTVFYLSGLTC --------!!!!---------1111---------3333------------------2222 TPDNASEKAFWQFQADKYGFAIVFPDTSPRGDEVANDPEGSWDFGQGAGFYLNATQEPYA 3333-----3333-----------------1111--1111--------%%%%---3333- QHYQMYDYIHKELPQTLDSHFNKLDFLDNVAITGHSMGGYGAICGYLKGYSGKRYKSCSA -------------------------------------------------1111------- FAPIVNPSNVPWGQKAFKGYLGEWEAYDPCLLIKNIRHVGDDRILIHVGDSDPFLEEHLK -----------------------111133331111---!!!!------1111-------3 PELLLEAVKATSWQDYVEIKKVHGFDHSYYFVSTFVPEHAEFHARNLGLI 333----2222-2222--------------3333----------1111-- >FOCAL ADHESION KINASE 1; SWP:Q00944; PDB:1PV3A; GSPGISGGGGGIRSNDKVYENVTGLVKAVIEMSSKIQPAPPEEYVPMVKEVGLALRTLLA ---------------------------------1111------3333------------- TVDESLPVLPASTHREIEMAQKLLNSDLAELINKMKLAQQYVMTSLQQEYKKQMLTAAHA ---------3333---------------------------3333---------------- LAVDAKNLLDVIDQARLKMISQSRPH -------------------------- >HYPOTHETICAL PROTEIN YWQG; SWP:P96719; PDB:1PV5A; NHLPEKRPYRDLLEKSAKEYVKLNVRKGKTGRYDSKIAGDPYFPKHETYPTDENGQPKLL ------1111---1111-------------1111---------1111----1111----- AQINFSHIPQLDGYPSSGILQFYISVHDDVYGLNFDDRCEQKNFRVIYFENIVENDDELV ---1111---2222---------------iiii---1111--------------1111-- SDFSFIGTGECDFPILSEAAVEPVKSSEWVLPTDFQFEQYTGETEFFGQFGEDEEDIYNE --3333------------------------1111---------------!!!!------- LAENGFGHKIGGYASFTQHDPREYAYKEHTILLQIDSDDDIDSWGDVGIANFFITPEDLR -------------------------1111------------------------------- KKDFSNVLYNWDCS -------------- >DELTA-AMINOLEVULINIC ACID; SWP:P13716; PDB:1PV8A; YLHPLLRAWQTATTTLNASNLIYPIFVTDVPDDIQPITSLPGVARYGVKRLEEMLRPLVE ----------1111--3333---------1111---3333------11113333------ EGLRCVLIFGVPEESPAIEAIHLLRKTFPNLLVACDVCLCAFRAEESRQRLAEVALAYAK ---------------------------1111---------------------------33 AGCQVVAPSDDGRVEAIKEALMAHGLGNRVSVMSYSAKFASCFYGPFRDAALPPGARGLA 33---------3333----------1111---------------3333----1111---- LRAVDRDVREGADMLMVKPGMPYLDIVREVKDKHPDLPLAVYHVSGEFAMLWHGAQAGAF -------------------1111--------------------3333------------- DLKAAVLEAMTAFRRAGADIIITYYTPQLLQWLKEE ----------------------1111-----1111- >XAA-PRO DIPEPTIDASE; SWP:P81535; PDB:1PV9A; LVKFMDENSIDRVFIAKPVNVYYFSGTSPLGGGYIIVDGDEATLYVPELEYEMAKEESKL -------------------------------------!!!!-----1111---------- PVVKFKKFDEIYEILKNTETLGIEGTLSYSMVENFKEKSVKEFKKIDDVIKDLRIIKTKE ------3333-------------1111-------------------------3333---- EIEIIEKACEIADKAVMAAIEEITEGKREREVAAKVEYLMKMNGAEKPAFDTIIASGHRS -----------------------22223333---------1111-----------!!!!- ALPHGVASDKRIERGDLVVIDLGALYNHYNSDITRTIVVGSPNEKQREIYEIVLEAQKRA ------------2222---------iiii------------------------------- VEAAKPGMTAKELDSIAREIIKEYGYGDYFIHSLGHGVGLEIHEWPRISQYDETVLKEGM ---------------------11111111-------------------1111----2222 VITIEPGIYIPKLGGVRIEDTVLITENGAKRLTKTER ------------------------------------- >PARVALBUMIN; SWP:PRVA_ESOLU; PDB:1PVAA; AAKDLLKADDIKKALDAVKAEGSFNHKKFFALVGLKAMSANDVKKVFKAIDADASGFIEE 3333----------3333-2222-----------1111------------1111----33 EELKFVLKSFAADGRDLTDAETKAFLKAADKDGDGKIGIDEFETLVHEA 33----11111111---------------1111---------------- >Genome polyprotein; SWP:P03302; PDB:1PVC1; QDSLPDTKASGPAHSKEVPALTAVETGATNPLAPSDTVQTRHVVQRRSRSESTIESFFAR -----------------3333-3333------1111------------1111-------- GACVAIIEVDNEQPTTRAQKLFAMWRITYKDTVQLRRKLEFFTYSRFDMEFTFVVTANFT ------------------------------------------------------------ NANNGHALNQVYQIMYIPPGAPTPKSWDDYTWQTSSNPSIFYTYGAAPARISVPYVGLAN ----------------------------3333----------2222-------------- AYSHFYDGFAKVPLKTDANDQIGDSLYSAMTVDDFGVLAVRVVNDHNPTKVTSKVRIYMK -------------1111-3333---2222---!!!!------------------------ PKHVRVWCPRPPRAVPYYGPGVDYRNNLDPLSEKGLTTY --------------------------------------- >Genome polyprotein; SWP:P03302; PDB:1PVC2; ACGYSDRVLQLTLGNSTITTQEAANSVVAYGRWPEFIRDDEANPVDQPTEPDVATCRFYT ------------!!!!-----------2222------1111---------!!!!------ LDTVMWGKESKGWWWKLPDALRDMGLFGQNMYYHYLGRSGYTVHVQCNASKFHQGALGVF ------1111----------1111-------------------------1111------- AIPEYCLAGDSDKQRYTSYANANPGERGGKFYSQFNKDNAVTSPKREFCPVDYLLGCGVL ------------------3333--3333----------------------1111-----3 LGNAFVYPHQIINLRTNNSATIVLPYVNALAIDSMVKHNNWGIAILPLSPLDFAQDSSVE 333---------3333-----------------3333----------------------- IPITVTIAPMCSEFNGLRNVTAPKFQ -------------------------- >Genome polyprotein; SWP:P03302; PDB:1PVC3; GLPVLNTPGSNQYLTSDNHQSPCAIPEFDVTPPIDIPGEVKNMMELAEIDTMIPLNLEST ------2222---1111------------------------33331111----------- KRNTMDMYRVTLSDSADLSQPILCLSLSPAFDPRLSHTMLGEVLNYYTHWAGSLKFTFLF -----1111---11111111-------1111---1111---------------------- CGSMMATGKILVAYAPPGAQPPTSRKEAMLGTHVIWDLGLQSSCTMVVPWISNVTYRQTT --1111-----------------33331111----------------------------- QDSFTEGGYISMFYQTRIVVPLSTPKSMSMLGFVSACNDFSVRLLRDTTHISQSA -3333-------------------------------1111--------------- >Genome polyprotein; SWP:P03302; PDB:1PVC4; GAQVSSQKVGAHENSSTINYTTINYYKDSASNAASKQDYSQDPSKFTEPLKDVLIKTAPA ---------------------------3333-----------3333--------1111-- LN -- >PYRUVATE DECARBOXYLASE; SWP:P06169; PDB:1PVDA; SEITLGKYLFERLKQVNVNTVFGLPGDFNLSLLDKIYEVEGMRWAGNANELNAAYAADGY -------------1111--------111133333333-2222------------------ ARIKGMSCIITTFGVGELSALNGIAGSYAEHVGVLHVVGVPSISHHTLGNGDFTVFHRMS ---------------------------1111----------------------------1 ANISETTAMITDIATAPAEIDRCIRTTYVTQRPVYLGLPANLVDLNVPAKLLQTPIDMSL 111--------3333-----------------------3333------3333-------- KPNDAESEKEVIDTILALVKDAKNPVILADACCSRHDVKAETKKLIDLTQFPAFVTPMGK ------------------1111-----------1111------------------3333- GSISEQHPRYGGVYVGTLSKPEVKEAVESADLILSVGALLSDKTKNIVEFHSDHMKIRNA ---1111--------11113333---1111--------------------1111--!!!! TFPGVQMKFVLQKLLTNIADAAKGYKPVAVPARTPANAAVPASTPLKQEWMWNQLGNFLQ ---------------------1111---------------3333--3333----3333-2 EGDVVIAETGTSAFGINQTTFPNNTYGISQVLWGSIGFTTGATLGAAFAAEEIDPKKRVI 222------33333333-----------------1111---------------1111--- LFIGDGSLQLTVQEISTMIRWGLKPYLFVLNNDGYTIEKLIHGPKAQYNEIQGWDHLSLL ---3333-----------1111-------------3333------3333-----3333-- PTFGAKDYETHRVATTGEWDKLTQDKSFNDNSKIRMIEIMLPVFDAPQNLVKQAKLT 1111-----------------11113333------------1111------------ >UV EXCISION REPAIR PROTEI; SWP:P54727; PDB:1PVEA; GSHMPLEFLRNQPQFQQMRQIIQQNPSLLPALLQQIGRENPQLLQQISQHQEHFIQMLNE -----3333----3333-------3333---------------------3333------- PVQEAGGQGGGG ------------ >DNA TOPOISOMERASE II; SWP:P06786; PDB:1PVGA; SASDKYQKISQLEHILKRPDTYIGSVETQEQLQWIYDEETDCIEKNVTIVPGLFKIFDEI 3333--------------1111---------------1111------------------- LVNAADNKVRDPSKRIDVNIHAEEHTIEVKNDGKGIPIEIHNKENIYIPEIFGHLLTSSN -----3333------------1111---------------------3333---------- YDDDEKKVTGGRNGYGAKLCNIFSTEFILETADLNVGQKYVQKWENNSICHPPKITSYKK ---------------------------------1111----------------------- GPSYTKVTFKPDLTRFGKELDNDILGVRRRVYDINGSVRDINVYLNGKSLKIRNFKNYVE -----------3333-----------------------------iiii------------ LYLKSLIPTILYERINNRWEVAFAVSDISFQQISFVNSIATTGGTHVNYITDQIVKKISE --3333---------1111----------------iiii--------------------- ILKKVKSFQIKNNFIFINCLIENPAFTSQTKEQLTTRVKDFGSRCEIPLEYINKIKTDLA -----33331111--------------3333-----3333-------------------- TRFEIADA ----3333 >Leukemia inhibitory facto; SWP:P15018; PDB:1PVHB; CAIRHPCHNNLMNQIRSQLAQLNGSANALFILYYTAQGEPFPNNLDKLCGPNVTDFPPFH ---------------------------------1111-------1111-----------1 ANGTEKAKLVELYRIVVYLGTSLGNITRDQKILNPSALSLHSKLNATADILRGLLSNVLC 111------------------------------1111----------------------- RLCSKYHVGHVDVTYGPDTSGKDVFQKKKLGCQLLGKYKQIIAVLAQAF ----------------------3333----------------------- >LEUCOCIDIN; SWP:O50604; PDB:1PVL; AQHITPVSEKKVDDKITLYKTTATSDSDKLKISQILTFNFIKDKSYDKDTLILKAAGNIY ------------1111-----------1111-----------1111-------------- SGYTKPNPKDTISSQFYWGSKYNISINSDSNDSVNVVDYAPKNQNEEFQVQQTVGYSYGG ------1111--------------------1111-------------------------- DINISNGLSGGGKSFSETINYKQESYRTSLDKRTNFKKIGWDVEAHKIMNNGWGPYGRDS ----------------------2222----1111---------------iiii---1111 YHSTYGNEMFLGSRQSNLNAGQNFLEYHKMPVLSRGNFNPEFIGVLSRKQNAAKKSKITV -------1111-1111--3333---1111-3333--------------1111-------- TYQREMDRYTNFWNQLHWIGNNYKDENRATHTSIYEVDWENHTVKLIDTQSKEKNPMS ---------------------------------------------------------- >CONSERVED HYPOTHETICAL PR; SWP:Q9HLD9; PDB:1PVMA; MFMRVEKIMNSNFKTVNWNTTVFDAVKIMNENHLYGLVVKDDNGNDVGLLSERSIIKRFI ---3333---------1111--------------------1111--------------33 PRNKKPDEVPIRLVMRKPIPKVKSDYDVKDVAAYLSENGLERCAVVDDPGRVVGIVTLTD 33--1111-3333---------11113333-----1111-------3333------3333 LSRYLSRASITDILLSHRTKDYQHLCPKCGVGVLEPVYNEKGEIKVFRCSNPACDYEE 1111----------------------------------1111--------1111---- >ORNITHINE CARBAMOYLTRANSF; SWP:Q51742; PDB:1PVVA; VVSLAGRDLLCLQDYTAEEIWTILETAKMFKIWQKIGKPHRLLEGKTLAMIFQKPSTRTR ---2222---3333----------------------------2222-------------- VSFEVAMAHLGGHALYLNAQDLQLRRGETIADTARVLSRYVDAIMARVYDHKDVEDLAKY -----------------33333333-----------1111---------3333------- ATVPVINGLSDFSHPCQALADYMTIWEKKGTIKGVKVVYVGDGNNVAHSLMIAGTKLGAD ---------3333------------------2222------------------------- VVVATPEGYEPDEKVIKWAEQNAAESGGSFELLHDPVKAVKDADVIYTDVWASMGQEAEA -----1111-------------------------3333-2222---------2222---- EERRKIFRPFQVNKDLVKHAKPDYMFMHCLPAHRGEEVTDDVIDSPNSVVWDQAENRLHA ------3333--333333331111--------2222--3333--1111------------ QKAVLALVMGGIK ------------- >XYLANASE; SWP:P81536; PDB:1PVXA; GTTPNSEGWHDGYYYSWWSDGGGDSTYTNNSGGTYEITWGNGGNLVGGKGWNPGLNARAI ---------iiii-----------------!!!!-------------------------- HFTGVYQPNGTSYLSVYGWTRNPLVEYYIVENFGSSNPSSGSTDLGTVSCDGSTYTLGQS ------------------------------------1111---------iiii------- TRYNAPSIDGTQTFNQYWSVRQDKRSSGTVQTGCHFDAWASAGLNVTGDHYYQIVATEGY ------------------------------3333-----1111----------------- FSSGYARITVADVG -------------- >K+ TOXIN-LIKE PEPTIDE; SWP:NA; PDB:1PVZA; TPFAIKCATDADCSRKCPGNPPCRNGFCACT -----------------------%%%%---- >FIBROBLAST GROWTH FACTOR-; SWP:O95750; PDB:1PWAA; PIRLRHLYTSGPHGLSSCFLRIRADGVVDCARGQSAHSLLEIKAVALRTVAIKGVHSVRY ----------1111--------1111--------1111-------!!!!-----3333-- LCMGADGKMQGLLQYSEEDCAFEEEIRPDGYNVYRSEKHRLPVSLSLPLSHFLPMLPMVP ---2222--------3333-------1111------1111-------------------- EEP --- >PULMONARY SURFACTANT-ASSO; SWP:P35247; PDB:1PWBA; ASLRQQVEALQGQVQHLQAAFSQYKKVELFPNGQSVGEKIFKTAGFVKPFTEAQLLCTQA 3333-------------------------------!!!!------------------111 GGQLASPRSAAENAALQQLVVAKNEAAFLSMTDSKTEGKFTYPTGESLVYSNWAPGEPND 1-------------------------------3333-----1111--------2222--2 DGGSEDCVEIFTNGKWNDRACGEKRLVVCEF 222-------1111-----1111-------- >PHOSPHOLIPASE A2; SWP:NA; PDB:1PWOA; NLYQFRKMIKCTIPGREPLLAFTDYGCYCGKGGSGTPVDELDRCCQTHDNCYDKAEKLPE ------------11113333-------------------------------------111 CKGILSGPYVNTYSYDCTDGKLTCNDQKDKCKLFICNCDRTAAMCFAKAPYIEANNHIDP 1-22223333-------iiii------------------------------3333----- NRCK ---- >2'-5'-OLIGOADENYLATE SYNT; SWP:Q29599; PDB:1PX5A; MELRHTPARDLDKFIEDHLLPNTFRTQVKEAIDIVRFLKERCFQGPVRVSKVVKGGSSRS -3333-3333-------------------------------------------------- DADLVVFLTKLTSFEDQLRRRGEFIQEIRRQLEACQREQKFKVTFEVQSPALSFVLSSPQ ------------3333-1111------------------------------------111 LQQEVEFDVLPAFDALGQWTPGYKPNPEIYVQLIKECKSRGKEGEFSTCFTELQRDFLRN 1------------3333--2222------------------22223333----------- RPTKLKSLIRLVKHWYQTCKKTHGNKLPPQYALELLTVYAWEQGSRKTDFSTAQGFQTVL -------------------1111-----3333---------------------------- ELVLKHQKLCIFWEAYYDFTNPVVGRCMLQQLKKPRPVILDPADPTGNVGGGDTHSWQRL ----3333----------------------1111------1111---2222-3333---- AQEARVWLGYPCCKNLDGSLVGAWTML ----------11111111--------- ------------------------------------------------ >HYPOTHETICAL PROTEIN YGJH; SWP:P42589; PDB:1PXFA; METVAYADFARLEMRVGKIVEVKRHENADKLYIVQVDVGQKTLQTVTSLVPYYSEEELMG -----3333----------------------------!!!!------------3333222 KTVVVLCNLQKAKMRGETSECMLLCAETDDGSESVLLTPERMMPAGVRVVA 2------------iiii----------1111-------------------- >SUBTILOSIN A; SWP:O07623; PDB:1PXQA; NKGCATCSIGAACLVDGPIPDFEIAGATGLFGLWG -------------1111------------1111-- >FIMBRIN-LIKE PROTEIN; SWP:Q7G188; PDB:1PXYA; SEKGPFVQHINRYLGDDPFLKQFLPLDPHSNQLYELVKDGVLLCKLINVAVPGTIDERAI -------------1111--3333---111133331111------------2222-3333- NTKRVLNPWERNENHTLCLNSAKAVGCSVVNIGTQDLAEGRPHLVLGLISQLIKIQLLAD ----------------------1111--111133333333-------------------- LNLKKLRLPPEKVLLKWMNFHLKKGGYKKTVSNFSADLKDAQAYAFLLNVLAPEHCDPAT ----------------------1111--------1111-------------1111--333 LDAKDPLERAELVLSHAERMNCKRYLTAEEIVEGSSTLNLAFVAQIFHERNGLNDVETCR 3-------------------------33331111--------------------3333-- DERCYRLWINSLGIDSYVNNVFEDVRNGWILLEVLDKVSPSSVNWKHASKPPIKMPFRKV --------1111-------3333-1111----------2222-3333-------3333-- ENCNQVIKIGKQLKFSLVNVAGNDIVQGNKKLILGLLWQLMRFHMLQLLKSLRSEMTDAD --------------------3333-----3333----------------1111---3333 ILSWANRKVRTMGRKLQIESFKDKSLSSGLFFLNLLWAVEPRVVNWNLVTKGETDDEKRL --------1111----------3333-------------3333-3333------------ NATYIVSVARKLGCSVFLLPEDIVEVNQKMILILTASIMYWSLQR ---------3333-----33331111------------------- >MAJOR POLLEN ALLERGEN JUN; SWP:P81294; PDB:1PXZA; DNPIDSCWRGDSNWDQNRMKLADCAVGFGSSTMGGKGGDFYTVTSTDDNPVNPTPGTLRY ---3333-----333311111111--1111--!!!!------------1111-2222--- GATREKALWIIFSQNMNIKLKMPLYVAGHKTIDGRGADVHLGNGGPCLFMRKVSHVILHS 1111-----------------------------2222----iiii--------------- LHIHGCNTSVLGDVLVSESIGVEPVHAQDGDAITMRNVTNAWIDHNSLSDCSDGLIDVTL ----------------3333---------------------------------------- GSTGITISNNHFFNHHKVMLLGHDDTYDDDKSMKVTVAFNQFGPNAGQRMPRARYGLVHV -----------------------3333--------------------------------- ANNNYDPWNIYAIGGSSNPTILSEGNSFTAPSESYKKEVTKRIGCESPSACANWVWRSTR --------------------------------1111----------33331111------ DAFINGAYFVSSGKTEETNIYNSNEAFKVENGNAAPQLTKNAGVVT ---------------------1111-----3333-3333------- >MYELIN-OLIGODENDROCYTE GL; SWP:Q61885; PDB:1PY9A; QFRVIGPGYPIRALVGDEAELPCRISPGKNATGMEVGWYRSPFSRVVHLYRNGKDQDAEQ -------------2222-------------1111----------------iiii-3333- APEYRGRTELLKETISEGKVTLRIQNVRFSDEGGYTCFFRDHSYQEEAAMELKVED 3333---------3333----------3333---------!!!!------------ >PYRUVOYL-DEPENDENT HISTID; SWP:P00862; PDB:1PYAA; SELDAKLNKLGVDRIAISPYKQWTRGYMEPGNIGNGYVTGLKVDAGVRDKSDDDVLDGIV 3333---1111--------------2222------------------------------- SYDRAETKNAYIGQINMTTAS --------------------- >Histidine decarboxylase p; SWP:P00862; PDB:1PYAB; FTGVQGRVIGYDILRSPEVDKAKPLFTETQWDGSELPIYDAKPLQDALVEYFGTEQDRRH --1111-2222-----3333---------1111--------------------3333--- YPAPGSFIVCANKGVTAERPKNDADMKPGQGYGVWSAIAISFAKDPTKDSSMFVEDAGVW --2222--------------------1111--------------1111------------ ETPNEDELLEYLEGRRKAMAKSIAECGQDAHASFESSWIGFAYTMMEPGQIGNAITVAPY ---------------------------1111---------------2222---------- VSLPIDSIPGGSILTPDKDMEIMENLTMPEWLEKMGYKSLSANNALKY ---111122221111--------------------------------- >METHIONYL-TRNA SYNTHETASE; SWP:O66738; PDB:1PYBA; ALIGIEDFLKVDLRVAKVLSAERVEGSEKLLKLTLSLGDEERTVVAGIAKYYTPEELVGK ----------------------------------------------------33332222 KIVIVANLKPRKIFGIESQGMILAASDGENLSVIVPDRDVKEGAKLS -----------1111-------------------------------- >IOLS PROTEIN; SWP:P46336; PDB:1PYFA; KKAKLGKSDLQVFPIGLGTNAVGGHNLYPNLNEETGKELVREAIRNGVTLDTAYIYGIGR ----!!!!----------1111-1111----------------1111-----3333iiii SEELIGEVLREFNREDVVIATKAAHRKQGNDFVFDNSPDFLKKSVDESLKRLNTDYIDLF --------11111111-----------!!!!----------------------------- YIHFPDEHTPKDEAVNALNEKKAGKIRSIGVSNFSLEQLKEANKDGLVDVLQGEYNLLNR --------------------1111----------------1111-----------11113 EAEKTFFPYTKEHNISFIPYFPLVSGLLAGKYTEDTTFPEGDLRNEQEHFKGERFKENIR 333-----------------1111-1111---1111--222233333333---------- KVNKLAPIAEKHNVDIPHIVLAWYLARPEIDILIPGAKRADQLIDNIKTADVTLSQEDIS ---------1111-3333--------1111--------3333----3333---------- FIDKLFAPG ---1111-- >PROTEIN (PYRIMIDINE PATHW; SWP:P07272; PDB:1PYIA; SRTACKRCRLKKIKCDQEFPSCKRCAKLEVPCVSLDPATGKDVPRSYVFFLEDRLAVMMR ------3333-------------------------------------------------- VLKEYGVDPTKIRGNIPATSDDEPFDLK ---------------------------- >RIBONUCLEASE; SWP:Q53752; PDB:1PYLA; DPALADVCRTKLPSQAQDTLALIAKNGPYPYNRDGVVFENRESRLPKKGNGYYHEFTVVT -------1111-3333------1111----1111-----1111-----2222-------1 PGSNDRGTRRVVTGGYGEQYWSPDHYATFQEIDPRC 111----------1111------iiii--------- >CASPASE-2; SWP:P42575; PDB:1PYOA; LQVKPCTPEFYQTHFQLAYRLQSRPRGLALVLSNVHFTGEKELEFRSGGDVDHSTLVTLF --------------1111---------------------------2222----------- KLLGYDVHVLCDQTAQEMQEKLQNFAQLPAHRVTDSCIVALLSHGVEGAIYGVDGKLLQL ---------------------------3333--------------2222--1111---33 QEVFQLFDNANCPSLQNKPKMFFIQACRGDETDRGVDQQ 33---------3333------------------------ >Caspase-2 [Precursor]; SWP:P42575; PDB:1PYOB; PKMRLPTRSDMICGYACLKGTAAMRNTKRGSWYIEALAQVFSERACDMHVADMLVKVNAL -----------------2222----------------------3333------------- IKDREGYAPGTEFHRCKEMSEYCSTLCRHLYLFPGHPP 1111---2222-2222----------------2222-- >PROCARBOXYPEPTIDASE A; SWP:P00730; PDB:1PYTA; KEDFVGHQVLRITAADEAEVQTVKELEDLEHLQ ---2222--------3333----1111------ >PROCARBOXYPEPTIDASE A; SWP:NA; PDB:1PYTD; CGAPIFQPNLSARVVGGEDAIPHSWPWQISLQYLRDN --------------------2222------------- >ATP SYNTHASE BETA CHAIN, ; SWP:P17614; PDB:1PYVA; ASRRLLASLLRQSAQRGGGLISRSLGNSIPKSASRASSRASPKGFLLNRAVQY --3333-------1111------1111-1111--------1111----3333- >GENERAL STRESS PROTEIN 69; SWP:P80874; PDB:1PZ1A; EYTSIADTGIEASRIGLGTWAIGGTWGGTDEKTSIETIRAALDQGITLIDTAPAYGFGQS -------------------3333------------------1111------3333iiii- EEIVGKAIKEYKRDQVILATKTALDWKNNQLFRHANRARIVEEVENSLKRLQTDYIDLYQ -----------1111-----------%%%%-----3333--------------------- VHWPDPLVPIEETAEVKELYDAGKIRAIGVSNFSIEQDTFRAVAPLHTIQPPYNLFEREE ----1111-----------1111----------3333-3333-----------1111--- ESVLPYAKDNKITTLLYGSLCRGLLTGKTEEYTFEGDDLRNHDPKFQKPRFKEYLSAVNQ -------1111------1111-1111--1111--!!!!----3333-------------- LDKLAKTRYGKSVIHLAVRWILDQPGADIALWGARKPGQLEALSEITGWTLNSEDQKDIN -----------3333--------2222--------1111---1111-------------- TILENTISDPVGPEFAPPTREEIPG ------------------1111--- >STEROL CARRIER PROTEIN 2; SWP:NA; PDB:1PZ4A; RMSLKSDEVFAKIAKRLESIDPANRQVEHVYKFRITQGGKVVKNWVMDLKNVKLVESDDA ---3333---------11111111------------------------------------ AEATLTMEDDIMFAIGTGALPAKEAMAQDKMEVDGQVELIFLLEPFIASLK --------------1111-------1111-------------11111111- >LIGHT CHAIN OF FAB (SYA/J; SWP:A0A5D7; PDB:1PZ5A; DVVLTQTPLSLPVRLGDQASISCRSSQSLLHSDGNTYLHWYLQKPGQSPKLLIYKVSNRF -------------2222-------------1111---------2222------------2 SGVPDRFSGSGSGTDFTLKISRVEAEDLGVYFCSQTTHVPTFGGGTKLEIKRADAAPTVS 2223333----!!!!--------3333--------------------------------- IFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMS ----33333333---------------------iiii----------------------- STLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNR ---------1111--------3333---------- >LIGHT CHAIN OF FAB (SYA/J; SWP:NA; PDB:1PZ5B; EVKVEESGGGLVQPGGSMKLSCVASGFTFSNYWMEWVRQSPEKGLEWVAEIRLKSNNYAT ------------2222-----------3333---------------------3333---- HYAESVKGRFTISRDDSKSSVYLQMNNLRAEDTGIYYCTRGGAVGAMDYWGQGTSVTVSS --3333---------1111---------3333---------1111--------------- ATTTAPSVYPLVPGCSDTSGSSVTLGCLVKGYFPEPVTVKWNYGALSSGVRTVSSVLQSG ----------------------------------------%%%%--2222-------iii FYSLSSLVTVPSSTWPSQTVICNVAHPASKVDLIKEISGP i---------1111-----------3333----------- >AGRIN; SWP:P31696; PDB:1PZ7A; KAAGDAEAIAFDGRTYMEYHNAVTKSAEPSEKALQSNHFELSIKTEATQGLILWSGKGLE ----------------------------------------------------------11 RSDYIALAIVDGFVQMMYDLGSKPVVLRSTVPINTNHWTHIKAYRVQREGSLQVGNEAPI 11-------iiii--------------------------------!!!!----!!!!--- TGSSPLGATQLDTDGALWLGGMERLSVAHKLPKAYSTGFIGCIRDVIVDRQELHLVEDAL -----------------------333311113333------------iiii--3333--- NNPTILHC -------- >LACTATE DEHYDROGENASE; SWP:P90613; PDB:1PZEA; PALVQRRKKVAMIGSGMIGGTMGYLCALRELADVVLYDVVKGMPEGKALDLSHVTSVVDT ------------------------------------------------------3333-- NVSVRAEYSYEAALTGADCVIVTAGLTKVPSEWSRNDLLPFNSKIIREIGQNIKKYCPKT -------------2222--------------3333------------------------- FIIVVTNPLDCMVKVMCEASGVPTNMICGMACMLDSGRFRRYVADALSVSPRDVQATVIG ----------------------1111-----------------------3333------- THGDCMVPLVRYITVNYP --1111--3333------ >LACTATE DEHYDROGENASE; SWP:P90613; PDB:1PZGA; PALVQRRKKVAMIGSGMIGGTMGYLCALRELADVVLYDVVKGMPEGKALDLSHVTSVVDT ------------------------------------------------------------ NVSVRAEYSYEAALTGADCVIVTAGLTKVPGKPDSEWSRNDLLPFNSKIIREIGQNIKKY --------3333-2222-----------22223333-3333------------------- CPKTFIIVVTNPLDCMVKVMEASGVPTNMICGMACMLDSGRFRRYVADALSVSPRDVQAT 1111----------------3333-1111-----------------------3333---- VIGTHGDCMVPLVRYITVNGYPIQKFIKDGVVTEKQLEEIAEHTKVSGGEIVRFLGQGSA -----1111--3333--iiii3333-1111------------------------------ YYAPAASAVAMATSFLNDEKRVIPCSVYCNGEYGLKDMFIGLPAVIGGAGIERVIELELN --------------1111------------2222------------1111---------- EEEKKQFQKSVDDVMALNKAVAALQAP --------------------------- >HEPATOCYTE NUCLEAR FACTOR; SWP:Q9NQH0; PDB:1PZLA; LASLPSINALLQAEVLSRQITNGDIRAKKIASIADVCESMKEQLLVLVEWAKYIPAFCEL ----------------3333---1111---------------------------3333-- PLDDQVALLRAHAGEHLLLGATKRSMVFKDVLLLGNDYIVPRHCPELAEMSRVSIRILDE -----------3333---------1111-----1111------33333333--------- LVLPFQELQIDDNEYAYLKAIIFFDPDAKGLSDPGKIKRLRSQVQVSLEDYINDRQYDSR ------------------------1111----3333----------------------22 GRFGELLLLLPTLQSITWQMIEQIQFIKLFGMAKIDNLLQEMLLGGS 22--------------------------------------------- >HYPOXANTHINE-GUANINE PHOS; SWP:NA; PDB:1PZMA; YPMSARTLVTQEQVWAATAKCAKKIAADYKDFHLTADNPLYLLCVLKGSFIFTADLARFL 1111------------------------1111--1111--------1111---------- ADEGVPVKVEFICASMLLDVRDSVENRHIMLVEDIVDSAITLQYLMRFMLAKKPASLKTV 1111-------------------2222--------------------------------- VLLDKPSGRKVDVLVDYPVITIPRAFVIGYGMDFAESYRELRDICVLKKE ----3333--------------------iiii-iiii1111--------- ------------------------------------------------------------ >SUPEROXIDE DISMUTASE [CU-; SWP:P0A608; PDB:1PZSA; QSLTSTLTAPDGTKVATAKFEFANGYATVTIATTGVGKLTPGFHGLHIHQVGKCEPNSVA --------1111----------iiii---------------------------------1 PTGGAPGNFLSAGGHYHVPGHTGTPASGDLASLQVRGDGSAMLVTTTDAFTMDDLLSGAK 111-----1111-----2222----1111------1111-----------------!!!! TAIIIHAGADNFANIPPERYVQVNGTPGPDETTLTTGDAGKRVACGVIGSG ----------%%%%-3333--1111-------------------------- >PROBABLE UBIQUITIN-CONJUG; SWP:P34477; PDB:1PZVA; SSLLLKKQLADMRRVPVDGFSAGLVDDNDIYKWEVLVIGPPDTLYEGGFFKAILDFPRDY 3333---------------------1111------------------------------- PQKPPKMKFISEIWHPNIDKEGNVCISILHDPPEERWLPVHTVETILLSVISMLTDPNFE --------------11111111---3333---3333-1111------------------- SPANVDAAKMQRENYAEFKKKVAQCVRRSQEE ----3333-------------------1111- >TRANSCRIPTION FACTOR GRAU; SWP:Q9U405; PDB:1PZWA; DICRLCLRGVSGAQMCLQIFDVDSGESKVAEVLRQHFWFEVLPNDEISKVICNVCWTQVS ------------------------------------------------------------ EFHQFYVSIQEAQVIYATTS ---------------1111- >HYPOTHETICAL PROTEIN APC3; SWP:P83812; PDB:1PZXA; MPIEIITDSGADLPQSYIREHRIAFLPLVVHWNGQDYKDGITIEPKQVYDAMRQGHTVKT --------1111-------------------iiii--2222--3333----1111----- AQPSPLAMKELFLPYAKENRPCLYIAFSSKLSGTYQTAMAVRSELLDEYPEFRLTIIDSK ---------------------------1111----------------------------- CASLGQGLAVMKAVELAKQNTPYNLLCETIESYCRHMEHIFTVDNLDYLARGGRISNIKP -!!!!-----------1111-------------1111----------------------- LLHVEDGALIPLEKWRGRKKVLKRMVELMGERGDDLQKQTIGISHADDEETALELKQMIE ----iiii--------------------------3333---------3333--------- ETHGCTRFFLSDIGSAIGAHAGPGTIALFFLNKYIEI -------------3333-------------------- -------------------------------------- >SEQUESTOSOME 1; SWP:NA; PDB:1Q02A; GSPPEADPRLIESLSQMLSMGFSDEGGWLTRLLQTKNYDIGAALDTIQYSKH ------1111------1111------------3333---------1111--- >TRANSCRIPTIONAL REGULATOR; SWP:P77565; PDB:1Q06A; MNISDVAKITGLTSKAIRFYEEKGLVTPPMRSENGYRTYTQQHLNELTLLRQARQVGFNL -------------------------------1111------------------------- EESGELVNLFNDPQRHSADVKRRTLEKVAEIERHIEELQSMRDQLLALANACPGCPIIEN ---------------3333-----------------------------------3333-- LS -- >ZN(II)-RESPONSIVE REGULAT; SWP:P36676; PDB:1Q08A; SDLQRLKFIRHARQLGFSLESIRELLSIRIDPEHHTCQESKGIVQERLQEVEARIAELQS ------------1111--------------3333-------------------------- MQRSLQRLNDACCGTAHSSVYCSILEALEQGASG --------1111-----3333------------- >SUPEROXIDE DISMUTASE [CU-; SWP:P00442; PDB:1Q0EA; ATKAVCVLKGDGPVQGTIHFEAKGDTVVVTGSITGLTEGDHGFHVHQFGDNTQGCTSAGP ----------------------!!!!--------------------------!!!!---- HFNPLSKKHGGPKDEERHVGDLGNVTADKNGVAIVDIVDPLISLSGEYSIIGRTMVVHEK --1111----1111---1111------1111--------------11112222------- PDDLGRGGNEESTKTGNAGSRLACGVIGIAK --iiii--3333------------------- >SUPEROXIDE DISMUTASE [NI]; SWP:P80734; PDB:1Q0GA; HCDLPCGVYDPAQARIEAESVKAIQEKMAANDDLHFQIRATVIKEQRAELAKHHLDVLWS ------------------------------------------------------------ DYFKPPHFESYPELHTLVNEAVKALSAAKASTDPATGQKALDYIAQIDKIFWETKKA ---3333---1111-------------1111---------------------1111- >COMPLEMENT FACTOR B; SWP:P00751; PDB:1Q0PA; SMNIYLVLDGSDSIGASNFTGAKKSLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEA ----------3333-------------------1111------------------3333- DSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWNRTRHVIILM 1111-------11111111-----------------1111------2222---------- TDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKKD ---------3333-------------1111-3333-----------------3333---- NEQHVFKVKDLS ------3333-- >1-DEOXY-D-XYLULOSE 5-PHOS; SWP:P45568; PDB:1Q0QA; MKQLTILGSTGSIGCSTLDVVRHNPEHFRVVALVAGKNVTRMVEQCLEFSPRYAVMDDEA -------1111------------1111--------------------------------- SAKLLKTMLQQQGSRTEVLSGQQAACDMAALEDVDQVMAAIVGAAGLLPTLAAIRAGKTI ---------1111-------------33331111--------3333-------1111--- LLANKESLVTCGRLFMDAVKQSKAQLLPVDSEHNAIFQSLPQPIQHNLGYADLEQNGVVS ------------------------------------11113333--2222-3333----- ILLTGSGGPFRETPLRDLATMTPDQACRHPNWSMGRKISVDSATMMNKGLEYIEARWLFN -------1111--111111113333-------------------------------1111 ASASQMEVLIHPQSVIHSMVRYQDGSVLAQLGEPDMRTPIAHTMAWPNRVNSGVKPLDFC -1111-----1111-------1111--------------------------------333 KLSALTFAAPDYDRYPCLKLAMEAFEQGQAATTALNAANEITVAAFLAQQIRFTDIAALN 3---------3333-------------------------------1111--3333----- LSVLEKMDMREPQCVDDVLSVDANAREVARKEVMRLAS --------------------------------3333-- >ACLACINOMYCIN METHYLESTER; SWP:Q54528; PDB:1Q0RA; SERIVPSGDVELWSDDFGDPADPALLLVMGGNLSALGWPDEFARRLADGGLHVIRYDHRD ------!!!!--------1111-------22221111---------1111-------222 TGRSTTRDFAAHPYGFGELAADAVAVLDGWGVDRAHVVGLSMGATITQVIALDHHDRLSS 2------3333----------------1111----------------------3333--- LTMLLGGGLDIDFDANIERVMRGEPTLDGLPGPQQPFLDALALMNQPAEGRAAEVAKRVS -------1111--------1111--1111-------------3333-------------- KWRILSGTGVPFDDAEYARWEERAIDHAGGVLAEPYAHYSLTLPPPSRAAELREVTVPTL -------------------------1111------3333------------1111----- VIQAEHDPIAPAPHGKHLAGLIPTARLAEIPGMGHALPSSVHGPLAEVILAHTRSAA ---------------------1111------------3333------------1111 >BSTDEAD; SWP:P83699; PDB:1Q0UA; TQFTRFPFQPFIIEAIKTLRFYKPTEIQERIIPGALRGESVGQSQTGTGKTHAYLLPIEK ----------------------------------1111---------------------- IKPERAEVQAVITAPTRELATQIYHETLKITKFCPKDRIVARCLIGGTDKQKALEKLNVQ -3333-------------------------11111111---------33331111----- PHIVIGTPGRINDFIREQALDVHTAHILVVDEADLLDGFITDVDQIAARPKDLQLVFSAT ---------------------1111----------------------------------- IPEKLKPFLKKYENPTFVHVL -1111---------------- ------------------------------------------------------------ --------------------- >FAB 9B1, LIGHT CHAIN; SWP:NA; PDB:1Q0XH; EVQLQQSGAELMKPGASVKISCKATGYTFSSYWIEWVKQRPGHGLEWIGEILPGSGDTIF ------------2222-----------3333--------2222----------------- NEKFKGKATFTADTSSNTAYMQLSSL 3333---------1111--------- >Maltose/maltodextrin tran; SWP:P02914; PDB:1Q12A; VQLQNVTKAWGEVVVSKDINLDIHEGEFVVFVGPSGCGKSTLLRMIAGLETITSGDLFIG ---------!!!!----------2222-----------3333------------------ EKRMNDTPPAERGVGMVFQSYALYPHLSVAENMSFGLKLAGAKKEVINQRVNQVAEVLQL ---11113333------3333--11113333--33331111-3333---------11111 AHLLDRKPKALSGGQRQRVAIGRTLVAEPSVFLLDEPLSNLDAALRVQMRIEISRLHKRL 111---3333-------------------------1111--------------------- GRTMIYVTHDQVEAMTLADKIVVLDAGRVAQVGKPLELYHYPADRFVAGFIGSPKMNFLP ---------3333-----------iiii-----3333----------------------- VKVTATAIDQVQVELPMPNRQQVWLPVESRDVQVGANMSLGIRPEHLLPSDIADVILEGE ----------------1111------------2222------1111--3333-------- VQVVEQLGNETQIHIQIPSIRQNLVYRQNDVVLVEEGATFAIGLPPERCHLFREDGTACR --------------------------------------------3333----1111---- RLHKEPG ------- >PROSTAGLANDIN-E2 9-REDUCT; SWP:P80508; PDB:1Q13A; DPKFQRVALSDGHFIPVLGFGTYAPEEVPKSKAMEATKIAIDAGFRHIDSAYFYKNEKEV 1111----1111------------33333333--------3333------3333------ GLAIRSKIADGTVKREDIFYTSKLWCTFHRPELVRPSLEDSLKNLQLDYVDLYIIHFPTA -------1111--1111-------1111-3333--------------------------- LKPGVEIIPTDEHGKAIFDTVDICATWEAMEKCKDAGLAKSIGVSNFNRRQLEMILNKPG ---------------------------------1111--------------------222 LKYKPVCNQVECHPYLNQGKLLEFCKSKGIVLVAYSALGSHREPEWVDQSAPVLLEDPLI 2-----------1111---------1111------1111-----3333----3333---- GALAKKHQQTPALIALRYQLQRGIVVLAKSFTEKRIKENIQVFEFQLPSEDMKVIDSLNR -------------------1111---------------1111-------------3333- NFRYVTADFAIGHPNYPFSDEY ------3333--1111------ >HST2 PROTEIN; SWP:P53686; PDB:1Q14A; MSVSTASTEMSVRKIAAHMKSNPNAKVIFMVGAGISTSCGLARLKLPYPEAVFDVDFFQS -----------------------------------1111--------1111----3333- DPLPFYTLAKELYPGNFRPSKFHYLLKLFQDKDVLKRVYTQNIDTLERQAGVKDDLIIEA -------3333--------3333------------------------1111-3333--11 HGSFAHCHCIGCGKVYPPQVFKSKLAEHPIKDFVKCDVCGELVKPAIVFFGEDLPDSFSE 11--------------3333----------------------------2222--3333-- TWLNDSEWLREKQQPLVIVVGTSLAVYPFASLPEEIPRKVKRVLCNLETVGDFKANKRPT ---------------------------33333333-1111---------!!!!----111 DLIVHQYSDEFAEQLVEELGWQEDFEKILTAQKEQLLEIVHDLEN 1-------------------------------------------- >CARA; SWP:Q9XB61; PDB:1Q15A; SNSFCVVYKGSDTDINNIQRDFDGKGEALSNGYLFIEQNGHYQKCEMERGTAYLIGSLYN ----------3333---3333---------------1111------1111---------- RTFLIGLAGVWEGEAYLANDAELLALLFTRLGANALALAEGDFCFFIDEPNGELTVITES ------------3333---------------33333333---------1111-------- RGFSPVHVVQGKKAWMTNSLKLVTAAEGEGALWFEEEALVCQSLMRADTYTPVKNAQRLK ---------------------------1111----3333-------------1111---- PGAVHVLTHDSEGYSFVESRTLTTPASNQLLALPREPLLALIDRYLNAPLEDLAPRFDTV ---------1111---------------------------------------3333---- GIPLSGGLDSSLVTALASRHFKKLNTYSIGTELSNEFEFSQQVADALGTHHQMKILSETE ----------------1111----------%%%%-------------------------- VINGIIESIYYNEIFDGLSAEIQSGLFNVYRQAQGQVSCMLTGYGSDLLFGGILKPGAQY ---------1111-------------------2222------2222----11112222-- DNPNQLLAEQVYRTRWTGEFATHGASCYGIDIRHPFWSHSLISLCHALHPDYKIFDNEVK ------------3333-111133331111----3333-----------3333--%%%%-- NILREYADSLQLLPKDIVWRSVNQAFANVLGSTVDNYQTKSRFTYRVYQAFLRGRLSITD -------1111--3333---3333--------1111--------------1111--1111 VTPSQLKDLIK ----------- >HST2 PROTEIN; SWP:P53686; PDB:1Q1AA; TASTEMSVRKIAAHMKSNPNAKVIFMVGAGISTSCGIPDFRSPGTGLYHNLARLKLPYPE ---------------------------33333333----------3333-3333---333 AVFDVDFFQSDPLPFYTLAKELYPGNFRPSKFHYLLKLFQDKDVLKRVYTQNIDTLERQA 3----------------3333--------3333------------------------111 GVKDDLIIEAHGSFAHCHCIGCGKVYPPQVFKSKLAEHPIKDFVKCDVCGELVKPAIVFF 1-3333--1111--------------3333---1111---------------------22 GEDLPDSFSETWLNDSEWLREKITTPQQPLVIVVGTSLAVYPFASLPEEIPRKVKRVLCN 22--3333---------------------------------33333333-1111------ LETVGDFKANKRPTDLIVHQYSDEFAEQLVEELGWQEDFEKILTA ---!!!!----1111------------------------------ >FK506-BINDING PROTEIN 4; SWP:Q02790; PDB:1Q1CA; EGVDISPKQDEGVLKVIKREGTGTEMPMIGDRVFVHYTGWLLDGTKFDSSLDRKDKFSFD -----1111------------------2222---------1111-----1111------- LGKGEVIKAWDIAIATMKVGEVCHITCKPEYAYGSAGSPPKIPPNATLVFEVELFEFKGE ------3333---11112222------3333----------------------------- DLTEEEDGGIIRRIQTRGEGYAKPNEGAIVEVALEGYYKDKLFDQRELRFEIGEGENLDL --1111------------------2222---------%%%%---------22221111-- PYGLERAIQRMEKGEHSIVYLKPSYAFGSVGKEKFQIPPNAELKYELHLKSFEKAKE 3333---11112222------3333-!!!!-3333---------------------- >NEUROGLOBIN; SWP:Q9ER97; PDB:1Q1FA; RPESELIRQSWRVVSRSPLEHGTVLFARLFALEPSLLPLFQYNGRQFSSPEDSLSSPEFL --------------------------------11113333-%%%%---33331111---- DHIRKVMLVIDAAVTNVEDLSSLEEYLTSLGRKHRAVGVRLSSFSTVGESLLYMLEKSLG -------------1111---1111---------------3333---------------!! PDFTPATRTAWSRLYGAVVQAMSRGWDG !!--------------------3333-- >TRANSCRIPTION FACTOR E; SWP:Q980M5; PDB:1Q1HA; MVNAEDLFINLAKSLLGDDVIDVLRILLDKGTEMTDEEIANQLNIKVNDVRKKLNLLEEQ -----------------3333--------------------------------------- GFVSYRKTRSGWFIYYWKPNIDQIN ------------------------- >Envelope glycoprotein [Fr; SWP:O36230; PDB:1Q1JH; EVQLVESGGGLVKPGGSLRLTCVASGFTFSDVWLNWVRQAPGKGLEWVGRIKSRT ------------2222-----------3333------------------------ >Lambda-chain [Precursor]; SWP:A2NUT2; PDB:1Q1JL; QSVLTQPPSVSAAPGQKVTISCSGSSSN ------------2222------------ >CHORISMATE SYNTHASE; SWP:O66493; PDB:1Q1LA; SLRYLRFLTAGESHGKGLTAILEGIPANLPLSEEEINHELRRRQRGYGIEKDTAEILSGV ------------------------------------------------------------ RFGKTLGSPIALFIRNRDWGGIKYNQRDLRNILERASARETAARVAVGAVCKKFLSEFGI iiii---------------3333--------------3333--------------1111- KIGSFVVSIGQKEVEELKDKSYFANPEKLLSYHEKAEDSELRIPFPEKDEEFKTYIDEVK --------!!!!-3333--3333---------------------3333----------11 EKGESLGGVFEVFALNVPPGLGSHIQWDRRIDGRIAQASIQAIKGVEIGLGFEAARRFGS 11-----------------------1111----------2222-------3333---333 QVHDEIGWSEGKGYFRHSNNLGGTEGGITNGPIVVRVAKPIPTIVAVPAASVVGEALAIV 3-----------------1111--iiii-------------------------------- LADALLEKLGGDFEEVKKRFEDYVNHVKSF --------------------------1111 >CELL DIVISION CONTROL PRO; SWP:P11433; PDB:1Q1OA; GPLGSILFRISYNNNSNNTSSSEIFTLLVEKVWNFDDLIMAINSKISNTHNNNISPITKI ------------------------------------------------------------ KYQDEDGDFVVLGSDEDWNVAKEMLAENNEKFLNIRLY ---3333------3333--------------------- >PUTIDAREDOXIN REDUCTASE; SWP:P16640; PDB:1Q1RA; NANDNVVIVGTGLAGVEVAFGLRASGWEGNIRLVGDATVIPHHLPPLSKAYLAGKATAES -------------------------------------------3333--3333---3333 LYLRTPDAYAAQNIQLLGGTQVTAINRDRQQVILSDGRALDYDRLVLATGGRPRPLPVAS ----3333-------------------------1111------------------1111- GAVGKANNFRYLRTLEDAECIRRQLIADNRLVVIGGGYIGLEVAATAIKANMHVTLLDTA ---------------------11112222------------------------------- ARVLERVTAPPVSAFYEHLHREAGVDIRTGTQVCGFEMSTDQQKVTAVLCEDGTRLPADL -2222---3333--------1111-------------------------1111------- VIAGIGLIPNCELASAAGLQVDNGIVINEHMQTSDPLIMAVGDCARFHSQLYDRWVRIES --------------1111---------1111---1111---1111----1111------- VPNALEQARKIAAILCGKVPRDEAAPWFWSDQYEIGLKMVGLSEGYDRIIVRGSLAQPDF -------------1111--------------%%%%-------2222-------1111--- SVFYLQGDRVLAVDTVNRPVEFNQSKQIITDRLPVEPNLLGDESVPLKEIIAAAKAELSS ----------------------------1111---3333--1111--------------- A - >FIBROBAST GROWTH FACTOR H; SWP:P61328; PDB:1Q1UA; PQLKGIVTRLFSQQGYFLQMHPDGTIDGTKDENSDYTLFNLIPVGLRVVAIQGVKASLYV -----------1111-----1111------3333-------------------3333--- AMNGEGYLYSSDVFTPECKFKESVFENYYVIYSSTLYRQQESGRAWFLGLNKEGQIMKGN --1111------------------%%%%----------------------1111---111 RVKKTKPSSHFVPKPIEV 1-11111111-------- >DEK PROTEIN; SWP:P35659; PDB:1Q1VA; DEPLIKKLKKPPTDEELKETIKKLLASANLEEVTMKQICKKVYENYPTYDLTERKDFIKT ------------3333--------11113333---------3333--------------- TVKELISLEH ---------- >sulfotransferase family, ; SWP:O00204; PDB:1Q20A; SDISEISQKLPGEYFRYKGVPFPVGLYSLESISLAENTQDVRDDDIFIITYPKSGTTWMI -----1111-------iiii---------------------1111-----2222------ EIICLILKEGDPSWIRSVPIWERAPWCETIVGAFSLPDQYSPRLMSSHLPIQIFTKAFFS -----1111--3333---1111---------1111--------------1111-3333-- SKAKVIYMGRNPRDVVVSLYHYSKIAGQLKDPGTPDQFLRDFLKGEVQFGSWFDHIKGWL --------------------3333-3333------------1111-2222---------1 RMKGKDNFLFITYEELQQDLQGSVERICGFLGRPLGKEALGSVVAHSTFSAMKANTMSNY 111--------3333---------------------------------------3333-1 TLLPPSLLDHRRGAFLRKGVCGDWKNHFTVAQSEAFDRAYRKQMRGMPTFPWDE 1113333-3333---------3333------------------1111--1111- >CHLORAMPHENICOL ACETYLTRA; SWP:P00483; PDB:1Q23A; ITGYTTVDISQWHRKEHFEAFQSVAQCTYNQTVQLDITAFLKTVKKNKHKFYPAFIHILA -------33331111--------------------------------------------- RLMNAHPEFRMAMKDGELVIWDSVHPCYTVFHEQTETFSSLWSEYHDDFRQFLHIYSQDV -11113333----iiii------------------------------------------- ACYGENLAYFPKGFIENMFFVSANPWVSFTSFDLNVANMDNFFAPVFTMGKYYTQGDKVL ---------1111----------1111---------------------------!!!!-- MPLAIQVHHAVCDGFHVGRMLNELQQYCDEWQGG -------3333-3333------------------ >CATION-INDEPENDENT MANNOS; SWP:P08169; PDB:1Q25A; QGAEFPELCSYTWEAVDTKNNMLYKINICGNMGVAQCGPSSAVCMHDLKTDSFHSVGDSL ----3333---------1111-----1111---33331111------1111------111 LKTASRSLLEFNTTVNCKQQNHKIQSSITFLCGKTLGTPEFVTATDCVHYFEWRTTAACK 1-----------------2222--------------------------------3333-- KNIFKANKEVPCYAFDRELKKHDLNPLIKTSGAYLVDDSDPDTSLFINVCRDIEVLRASS -1111----------1111----3333--------------------------------3 PQVRVCPTGAAACLVRGDRAFDVGRPQEGLKLVSNDRLVLSYVKEGAGQPDFCDGHSPAV 333---2222----------------------------------------1111------ TITFVCPSERREGTIPKLTAKSNCRFEIEWVTEYACHRDYLESRSCSLSSAQHDVAVDLQ -------------------------------3333-3333--------3333------33 PLSRVEASDSLFYTSEADEYTYYLSICGGSQAPICNKKDAAVCQVKKADSTQVKVAGRPQ 33--------------1111-----iiii--3333----------3333--------111 NLTLRYSDGDLTLIYFGGEECSSGFQRMSVINFECNQTAGNNGRGAPVFTGEVDCTYFFT 1-----iiii----------3333-----------1111%%%%---------%%%%---- WDTKYACV ---1111- >PUTATIVE NUDIX HYDROLASE ; SWP:Q9RY71; PDB:1Q27A; MGGVSDERLDLVNERDEVVGQILRTDPALRWERVRVVNAFLRNSQGQLWIPRRSPSKSLF ----------------------3333---------------------------------- PNALDVSVGGAVQSGETYEEAFRREAREELNVEIDALSWRPLASFSPFQTTLSSFMCVYE -------------------------------%%%%------------------------- LRSDATPIFNPNDISGGEWLTPEHLLARIAAGEAAKGDLAELVRRCYREEE --------------------3333---3333-------------------- >EXOCELLOBIOHYDROLASE I; SWP:P00725; PDB:1Q2BA; SACTLQSETHPPLTWQKCSSGGTCTQQTGSVVIDANWRWTHATNSSTNCYDGNTWSSTLC ------------------3333-----------3333----1111--------------- PDNETCAKNCCLDGAAYASTYGVTTSGNSLSIDFVTQSAQKNVGARLYLMASDTTYQEFT -------------------------!!!!------------------------------- LLGNEFSFDVDVSQLPCGLNGALYFVSMDADGGVSKYPTNTAGAKYGTGYCDSQCPRDLK 2222-------11112222---------111133331111--3333-----1111----- FINGQANVEGWEPSSNNANTGIGGHGSCCSEMDIWEANSISEALTPHPCTTVGQEICEGC -%%%%--2222-----1111-------------------------------------!!! GCGGTYSCNRYGGTCDPDGCDWNPYRLGNTSFYGPGSSFTLDTTKKLTVVTQFETSGAIN !--3333---------------3333---------1111--------------3333--- RYYVQNGVTFQQPNAELGSYSGNELNDDYCTAEEAEFGGSSFSDKGGLTQFKKATSGGMV ----iiii--------!!!!---------------------------------------- LVMSLWDDYYANMLWLDSTYPTNETSSTPGAVRGSCSTSSGVPAQVESQSPNAKVTFSNI ------------3333----11113333--------1111---------1111------- KFGPIGSTGNPSG ---2222------ >PNC27; SWP:Q6IT77; PDB:1Q2FA; PPLSQETFSDLWKLLKKWKMRRNQFWVKVQRG -----------------3333--3333----- >adaptor protein with plec; SWP:O14492; PDB:1Q2HA; PDWRQFCELHAQAAAVDFAHKFCRFLRDNPAYDTPDAGASFSRHFAANFLDVFGEEVRRV ----------------------------3333-1111----------------------- LVA --- >NEUROTOXIN BMK37; SWP:P83407; PDB:1Q2KA; AACYSSDCRVKCVAMGFSSGKCINSKCKCYK ---3333-----1111--------------- >PROTEASE III; SWP:P05458; PDB:1Q2LA; ETGWQPIQETIRKSDKDNRQYQAIRLDNGMVVLLVSDPQAVKSLSALVVPVGSLEDPEAY -------------------------1111-------1111-----------3333-3333 QGLAHYLEHMSLMGSKKYPQADSLAEYLKMHGGSHNASTAPYRTAFYLEVENDALPGAVD ---------1111-3333---------------------1111-------1111------ RLADAIAEPLLDKKYAERERNAVNAELTMARTRDGMRMAQVSAETINPAHPGSKFSGGNL ------------1111-------------1111--------1111-11111111----33 ETLSDKPGNPVQQALKDFHEKYYSANLMKAVIYSNKPLPELAKMAADTFGRVPNKESKKP 33---2222--------------3333---------------------3333-------- EITVPVVTDAQKGIIIHYVPALPRKVLRVEFRIDNNSAKFRSKTDELITYLIGNRSPGTL -------3333------------------------3333----------------2222- SDWLQKQGLVEGISANSDPIVNGNSGVLAISASLTDKGLANRDQVVAAIFSYLNLLREKG -------------------1111------------------------------------- IDKQYFDELANVLDIDFRYPSITRDMDYVEWLADTMIRVPVEHTLDAVNIADRYDAKAVK ----------------------------------1111--1111-1111----------- ERLAMMTPQNARIWYISPKEPHNKTAYFVDAPYQVDKISAQTFADWQKKAADIALSLPEL --11113333------1111-----------------------------1111------- NPYIPDDFSLIKSEKKYDHPELIVDESNLRVVYAPSRYFASEPKADVSLILRNPKAMDSA --------------------------------------1111------------------ RNQVMFALNDYLAGLALDQLSNQASVGGISFSTNANNGLMVNANGYTQRLPQLFQALLEG ----------------------------------------------1111---------- YFSYTATEDQLEQAKSWYNQMMDSAEKGKAFEQAIMPAQMLSQVPYFSRDERRKILPSIT ----------------------3333-------------1111-----------3333-- LKEVLAYRDALKSGARPEFMVIGNMTEAQATTLARDVQKQLGADGSEWCRNKDVVVDKKQ -----------2222--------------------------------------------- SVIFEKAGNSTDSALAAVFVPTGYDEYTSSAYSSLLGQIVQPWFYNQLRTEEQLGYAVFA ------------------------3333-------------3333--------------- FPMSVGRQWGMGFLLQSNDKQPSFLWERYKAFFPTAEAKLRAMKPDEFAQIQQAVITQML ---------------------------------------1111--------------111 QAPQTLGEEASKLSKDFDRGNMRFDSRDKIVAQIKLLTPQKLADFFHQAVVEPQGMAILS 1-------------------1111--------3333------------------------ QISGSQNGKAEYVHPEGWKVWENVSALQQTMPLMSEK ---------------------------1111------ >SIMILAR TO HYPOTHETICAL P; SWP:O31628; PDB:1Q2YA; MKAVIAKNEEQLKDAFYVREEVFVKEQNVPAEEEIDELENESEHIVVYDGEKPVGAGRWR -------------------------------3333--3333-------!!!!-------- MKDGYGKLERICVLKSHRSAGVGGIIMKALEKAAADGGASGFILNAQTQAVPFYKKHGYR -iiii--------3333-----------------------------3333----1111-- VLSEKEFLDAGIPHLQMMKD -------------------- >TYROSYL-DNA PHOSPHODIESTE; SWP:P38319; PDB:1Q32A; GAVFKLMKSDFYEREDMITLKDIFGTETLKRSILFSFQYELDFLLRQFHQNVENITIVGQ ------------------3333---1111---------------11113333-------2 KGTIMPIEARAMDATLAVILKKVKLIEITMPPFASHHTKLIINFYDNGECKIFLPSNNFT 222----3333---3333-1111-------2222-------------------------3 SMETNLPQQVCWCSPLLKIGKEGLPVPFKRSLIEYLNSYHLKDIDELITKSVEEVNFAPL 333---------------------------------3333----------------3333 SELEFVYSTPSKFQSSGLLSFYNKLEKLSDTAKHYLCQTSSIGTSLSRARDENLWTHLMI ----------1111--------------------------------------3333---- PLFTGIMSPPILPTNSLINEYSQRKIKPYIIFPTEQEFVTSPLKWSSSGWFHFQYLQKKS ---------------------------------3333---11111111------1111-- YYEMLRNKFKVFYKQDPAMVTRRRGTTPAHSKFYMHCATNSQVFKELEWCLYTSANLSQT ---------------1111-3333-----------------2222------------333 AWGTVSRKPRNYEAGVLYHSRRLRKVTCRTFTRDPTHVAVPFTLPVIPYDLAEDECFCL 3-----------------1111-------3333----------------1111------ >ADP-RIBOSE PYROPHOSPHATAS; SWP:Q9BW91; PDB:1Q33A; ENSHNKARTSPYPGSKVERSQVPNEKVGWLVEWQDYKPVEYTAVSVLAGPRWADPQISES ----1111---2222-------3333-3333-1111-------3333--1111--3333- NFSPKFNEKDGHVERKSKNGLYEIENGRPRNPAGRTGLVGRGLLGRWGPNHAADPIITRW ---------!!!!-----------iiii--1111------!!!!---------------- KRDSSGNKIHPVSGKHILQFVAIKRKDCGEWAIPGGVDPGEKISATLKREFGEEALNSLQ --1111---3333-----------3333---------2222---------------3333 KTSAEKREIEEKLHKLFSQDHLVIYKGYVDDPRNTDNAWETEAVNYHDETGEIDNLLEAG ---3333-----------------------11111111---------1111--------1 DDAGKVKWVDINDKLKLYASHSQFIKLVAEKRDAHWSEDSEADCHAL 111--------1111--!!!!-------------------1111--- >IRON BINDING PROTEIN FBPA; SWP:Q9Z4N6; PDB:1Q35A; ANEVNVYSYRQPYLIEPMLKNFEKDTGIKVNIIFADGLVDRVKQEGELSPADVLLTVDIS ----------3333------------------------------!!!!------------ RVMEIVNADLAQKIDSKVLEKNIPAQFRDSNDQWFGLTTRARVIYTSKDRVGKLPAGFDY -----1111--------------3333-1111----------------------222233 LDLAKPEYKGKVCVRSGKNSYNVSLFAAMIEHYGIEKTKAFLEGLKANLARKPQGGDRDQ 33--3333-------1111--------------------------1111------3333- VKAIKEGICDYSIGNSYYYGKMLDDEKQKSWAEAAIINFPSGEHGTHKNISGVVIAKHSP --------------3333-------------1111------1111----------1111- NKANAVKLIEYLSGEKAQGLYAELNHEYPVKEGIEPSAIVKGWGTFKSDTIKLEDIAKNY ------------------------------2222--33333333-------33333333- EAALKLVDEVKFDDFSE ----------1111--- >STROMELYSIN-2; SWP:P09238; PDB:1Q3AA; MPKWRKTHLTYRIVNYTPDLPRDAVDSAIEKALKVWEEVTPLTFSRLYEGEADIMISFAV ----------------3333---------------------------------------- KEHGDNYSFDGPGHSLAHAYPPGPGLYGDIHFDDDEKWTEDASGTNLFLVAAHELGHSLG -------------------------2222---------------------------1111 LFHSANTEALMYPLYNSLAQFRLSQDDVNGIQSLYG --------1111------------3333--3333-- >Potassium/sodium hyperpol; SWP:O88703; PDB:1Q3EA; DSSRRQYQEKYKQVEQYMSFHKLPADFRQKIHDYYEHRYQGKMFDEDSILGELNGPLREE ------------------1111----------------%%%%-------1111------- IVNFNCRKLVASMPLFANADPNFVTAMLTKLKFEVFQPGDYIIREGTIGKKMYFIQHGVV ------------3333-----------1111-----2222---2222------------- SVLTGNKEMKLSDGSYFGEICLLTRGRRTASVRADTYCRLYSLSVDNFNEVLEEYPMMRR -----------2222------------------------------------1111----- AFETVAIDRLDR --------1111 >NA,K-ATPASE; SWP:Q4RA55; PDB:1Q3IA; MMTVAHMWFDNQIHEADTTTFDKRSPTWTALSRIAGLCNRAVFKRDTAGDASESALLKCI --------%%%%------------------------------------------------ ELSCGSVRKMRDRNPKVAEISYQLSIHEREDNPQSHVLVMKGAPERILDRCSSILVQGKE ----------1111-----------------3333------------1111----iiii- IPLDKEMQDAFQNAYLELGGLGERVLGFCQLNLPSGKFPRGFKFDTDELNFPTEKLCFVG ------------------1111----------------2222------------------ LMSMIDHHHHHH ------------ >ALO3; SWP:P83653; PDB:1Q3JA; CIKNGNGCQPNGSQGNCCSGYCHKQPGWVAGYCRRK ----------------3333---------------- >HETEROCHROMATIN PROTEIN 1; SWP:P05205; PDB:1Q3LA; EYAVEKIIDRRVRKGMVEYYLKWKGYPETENTWEPENNLDCQDLIQQYEASR ------------iiii------22223333----3333-------------- >SHANK1; SWP:Q9WV48; PDB:1Q3OA; DYIIKEKTVLLQKKDSEGFGFVLRGAKAQTPIEEFTPTPAFPALQYLESVDEGGVAWRAG -------------1111------------1111----3333---------2222--1111 LRMGDFLIEVNGQNVVKVGHRQVVNMIRQGGNTLMVKVVMVTRH -2222----iiii-1111-----------!!!!----------- >THERMOSOME ALPHA SUBUNIT; SWP:O24729; PDB:1Q3QA; VVILPEGTQRYVGRDAQRLNILAARIIAETVRTTLGPKGMDKMLVDSLGDIVVTNDCATI ----2222-----------------------11111111------1111------3333- LDKIDLQHPAAKMMVEVAKTQDKEAGDGTTTAVVIAGELLRKAEELLDQNIHPSIITKGY ---------------------------3333---------------1111-3333----- ALAAEKAQEILDEIAIRVDPDDEETLLKIAATSITGKNAESHKELLAKLAVEAVKQVAEK ------------------1111----------------3333------------------ KDGKYVVDLDNIKFEKKAGEGVEESELVRGVVIDKEVVHPRMPKRVENAKIALINEALEV -------1111---------3333--------------1111------------------ KKTETDAKINITSPDQLMSFLEQEEKMLKDMVDHIAQTGANVVFVQKGIDDLAQHYLAKY ------------3333-------------------3333------------------111 GIMAVRRVKKSDMEKLAKATGAKIVTNVKDLTPEDLGYAEVVEERKLAGENMIFVEGCKN 1-------------------------3333-3333-----------%%%%---------- PKAVTILIRGGTEHVIDEVERALEDAVKVVKDVMEDGAVLPAGGAPEIELAIRLDEYAKQ -----------------------------------------iiii-----------3333 VGGKEALAIENFADALKIIPKTLAENAGLDTVEMLVKVISEHKNRGLGIGIDVFEGKPAD --3333------------------1111-----------------3333----------3 MLEKGIIEPLRVKKQAIKSASEAAIMILRIDDVIAAKA 333-----3333--------------1111-------- >MRNA TRANSPORT REGULATOR ; SWP:P84148; PDB:1Q40A; QDPTQQLEPFLKRFLASLDLLYTQSQPFPNVESYATQLGSNLKRSSAIIVNGQPIIPSPQ -1111----------------------------3333-1111-------iiii----333 EDCKLQFQKKWLQTPLSSHQLTSYDGHLIPGTGTFVVHFSAKVRFDQSGRNRLGESADLF 3--------3333---------------2222------------------1111---111 QENNQRPIWGSWFGVDVNLVVDENVQDGEIINSDYRFTYVPND 1------------------------------------------ >mRNA export factor MEX67; SWP:P84149; PDB:1Q40B; SRNLATNFIANYLKLWDANRSELILYQNESQFSQVDSSHPHLIESGSTDFGYYLNNSRNL 3333--------------3333----1111-----1111----------33331111-11 TRVSSIKARAKLSIGQEQIYKSFQQLPKTRHDIIATPELFSEVYKFPTLNGIITLHGSFD 11--3333-------------3333----------1111--------%%%%--------- EVAQPEVDGSASRYHSGPKHKRIPLSKKSFDRTFVVIPGSIVASDTLLIRPYTSDFPWKV -------------------------------------------------------1111- >MRNA TRANSPORT REGULATOR ; SWP:P84148; PDB:1Q42A; QDPTQQLEPFLKRFLASLDLLYTQPTSQPFPNVESYATQLGSNLKRSSAIIVNGQPIIPS 3333----------3333------1111---33333333-11111111---iiii----1 PQEDCKLQFQKKWLQTPLSSHQLTSYDGHLIPGTGTFVVHFSAKVRFDQSGRNRLGESAR 111---------1111--------------2222------------------1111---- PIWGSWFGVDVNLVVDENVMQDGEIINSMDYRFTYVPND --------------------------------------- >STEROID SULPHOTRANSFERASE; SWP:P52839; PDB:1Q44A; SVPAYLGDEDLTQETRALISSLPKEKGWLVSEIYEFQGLWHTQAILQGILICQKRFEAKD ------------------1111-------------iiii-------------------11 SDIILVTNPKSGTTWLKALVFALLNRHKFPVSSSGNHPLLVTNPHLLVPFLEGVYYESPD 11-----2222-----------1111---33331111-----3333-----------111 FDFSSLPSPRLMNTHISHLSLPESVKSSSCKIVYCCRNPKDMFVSLWHFGKKLYPIEKAV 11111-----------1111-3333------------------------3333-3333-- EAFCEGKFIGGPFWDHILEYWYASRENPNKVLFVTYEELKKQTEVEMKRIAEFLECGFIE -------2222---------------1111------------------------------ EEEVREIVKLCSFEGWRDTLSESLAEEIDRTIEEKFKGSGLKFS ----------33333333-------------------------- >12-OXOPHYTODIENOATE-10,11; SWP:Q9FUP0; PDB:1Q45A; SNETLFSSYKMGRFDLSHRVVLAPMTRCRALNGVPNAALAEYYAQRTTPGGFLISEGTMV ---1111---!!!!---------------2222----------11112222--------- SPGSAGFPHVPGIYSDEQVEAWKQVVEAVHAKGGFIFCQLWHVGRASHAVYQPNGGSPIS 2222--------------------------------------!!!!-33332222----- STNKPISENRWRVLLPDGSHVKYPKPRALEASEIPRVVEDYCLSALNAIRAGFDGIEIHG --------------1111-----------3333---------------1111-------- AHGYLIDQFLKDGINDRTDQYGGSIANRCRFLKQVVEGVVSAIGASKVGVRVSPAIDHLD iiii--------------1111---------------------3333-----1111-%%% ATDSDPLSLGLAVVGMLNKLQGVNGSKLAYLHVTQPREEEAKLMKSLRMAYNGTFMSSGG %------------------------------------3333------------------- FNKELGMQAVQQGDADLVSYGRLFIANPDLVSRFKIDGELNKYNRKTFYTQDPVVGYTDY --------------------3333-------------------3333------2222--- PFLAP ----- >TRANSLATION INITIATION FA; SWP:P20459; PDB:1Q46A; TSHCRFYENKYPEIDDIVMVNVQQIAEMGAYVKLLEYDNIEGMILLSELSRRRIRSIQKL ------------2222---------3333----1111-------3333-------3333- IRVGKNDVAVVLRVDKEKGYIDLSKRRVSSEDIIKCEEKYQKSKTVHSILRYCAEKFQIP -2222-------------------3333-------------------------------3 LEELYKTIAWPLSRKFGHAYEAFKLSIIDETVWEGIEPPSKDVLDELKNYISKR 333--------------3333---33333333---------------------- >SEMAPHORIN 3A; SWP:O08665; PDB:1Q47A; KNNVPRLKLSYKEMLESNNVITFNGLANSSSYHTFLLDEERSRLYVGAKDHIFSFNLVNI ---------3333------------1111---------1111------------------ KDFQKIVWPVSYTRRDECKWAGKDILKECANFIKVLEAYNQTHLYACGTGAFHPICTYIE -----------------------3333----------------------%%%%------- VGHHPEDNIFKLQDSHFENGRGKSPYDPKLLTASLLIDGELYSGTAADFMGRDFAIFRTL --------------------------3333------iiii-------1111--------- GHHHPIRTEQHDSRWLNDPRFISAHLIPESDNPEDDKVYFFFRENAIDGEHSGKATHARI -----------3333----------------3333------------------------- GQICKNDFGGHRSLVNKWTTFLKARLICSVPGPNGIDTHFDELQDVFLMNSKDPKNPIVY ---1111---------------------------------------------1111---- GVFTTSSNIFKGSAVCMYSMSDVRRVFLGPYAHRDGPNYQWVPYQGRVPYPRPGTCPSKT ------------------3333---1111------1111--------------------- FGGFDSTKDLPDDVITFARSHPAMYNPVFPINNRPIMIKTDVNYQFTQIVVDRVDAEDGQ -----3333--------------------2222----------------------3333- YDVMFIGTDVGTVLKVVSVLLEEMTVFREPTTISAMELSTKQQQLYIGSTAGVAQLPLHR -----------------------------------------------------------3 CDIY 333- >NIFU-LIKE PROTEIN; SWP:Q57074; PDB:1Q48A; MAYSEKVIDHYENPRNVGSLDKKDSNVGTGMVGAPACGDVMQLQIKVDDNGIIEDAKFKT -----------------------1111------1111-----------iiii-------- YGCGSAIASSSLITEWVKGKSLEEAGAIKNSQIAEELELPPVKVHCSILAEDAIKAAIAD -------3333---------3333----3333---------------------------- YKAKQGLEHHHHHH -------------- >PROSTAGLANDIN G/H SYNTHAS; SWP:P05979; PDB:1Q4GA; PVNPCCYYPCQHQGICVRFGLDRYQCDCTRTGYSGPNCTIPEIWTWLRTTLRPSPSFIHF ----1111--iiii-------------2222---1111---------------------- LLTHGRWLWDFVNATFIRDTLMRLVLTVRSNLIPSPPTYNIAHDYISWESFSNVSYYTRI ----------3333---------------1111-------------3333---------- LPSVPRDCPTPMGTKGKKQLPDAEFLSRRFLLRRKFIPDPQGTNLMFAFFAQHFTHQFFK ----1111-1111-------------------------1111------------------ TSGKMGPGFTKALGHGVDLGHIYGDNLERQYQLRLFKDGKLKYQMLNGEVYPPSVEEAPV --3333-----3333---1111--------------%%%%-----iiii----3333--- LMHYPRGIPPQSQMAVGQEVFGLLPGLMLYATIWLREHNRVCDLLKAEHPTWGDEQLFQT --------3333-----1111---------------------------1111-------- ARLILIGETIKIVIEEYVQQLSGYFLQLKFDPELLFGAQFQYRNRIAMEFNQLYHWHPLM ------------------------------33331111-------------11113333- PDSFRVGPQDYSYEQFLFNTSMLVDYGVEALVDAFSRQPAGRIGGGRNIDHHILHVAVDV -----!!!!--33332222------------------------------3333------- IKESRVLRLQPFNEYRKRFGMKPYTSFQELTGEKEMAAELEELYGDIDALEFYPGLLLEK ----------3333--1111-----3333----------------1111----------- CHPNSIFGESMIEMGAPFSLKGLLGNPICSPEYWKASTFGGEVGFNLVKTATLKKLVCLN -2222-----------------11111111----3333------------------1111 TKTCPYVSFHVPD ------------- >PROTEIN AT3G17210; SWP:Q9LUV2; PDB:1Q4RA; PVKHVLLASFKDGVSPEKIEELIKGYANLVNLIEPKAFHWGKDVSIENLHQGYTHIFEST ----------2222-----------------------------------iiii------- FESKEAVAEYIAHPAHVEFATIFLGSLDKVLVIDYKPTSVSL -----------------------1111--------------- >THIOESTERASE; SWP:Q04416; PDB:1Q4UA; TGGNLPDVASHYPVAYEQTLDGTVGFVIDEMTPERATASVEVTDTLRQRWGLVHGGAYCA 1111-----------1111------------1111-------3333-1111--------- LAEMLATEATVAVVHEKGMMAVGQSNHTSFFRPVKEGHVRAEAVRIHAGSTTWFWDVSLR ---------33333333------------------------------------------- DDAGRLCAVSSMSIAVRPRR 1111---------------- >MENB; SWP:Q7U1T0; PDB:1Q52A; NPFDAKAWRLVDGFDDLTDITYHRHVDDATVRVAFNRPEVRNAFRPHTVDELYRVLDHAR ---3333------1111-------------------3333----3333------------ MSPDVGVVLLTGNGPSPKDGGWAFCSGGDHILEVQRLIRFMPKVVICLVNGWAAGGGHSL -1111------------------------3333--------------------------- HVVCDLTLASREYARFKQTDADVGSFDGGYGSAYLARQVGQKFAREIFFLGRTYTAEQMH ------------------3333------3333--1111---------------------- QMGAVNAVAEHAELETVGLQWAAEINAKSPQAQRMLKFAFNLLDDGLVGQQLFAGEATRL ---------3333-----------1111-------------3333--------------- AYMTDEAVEGRDAFLQKRPPDWSPFPRYF ---------------------1111---- >P450 EPOXIDASE; SWP:Q9KIZ4; PDB:1Q5DA; DFKPFAPGYAEDPFPAIERLREATPIFYWDEGRSWVLTRYHDVSAVFRDERFAVSREEWE --1111-3333------------------1111-----3333--33333333--111111 SSAEYSSAIPELSDMKKYGLFGLPPEDHARVRKLVNPSFTSRAIDLLRAEIQRTVDQLLD 11------3333------1111------------3333-3333----------------- ARSGQEEFDVVRDYAEGIPMRAISALLKVPAECDEKFRRFGSATARALGVGLVPRVDEET -3333-----11111111-----------3333-------------1111---------- KTLVASVTEGLALLHGVLDERRRNPLENDVLTMLLQAEADGSRLSTKELVALVGAIIAAG ------------------------------------------------------------ TDTTIYLIAFAVLNLLRSPEALELVKAEPGLMRNALDEVLRFDNILRIGTVRFARQDLEY ---------------------------3333----------------------------i CGASIKKGEMVFLLIPSALRDGTVFSRPDVFDVRRDTSASLAYGRGPHVCPGVSLARLEA iii--2222-------11111111--1111-1111-11111111-11111111------- EIAVGTIFRRFPEMKLKETPVFGYHPAFRNIESLNVILKPS ----------1111----------1111------------- >PILS; SWP:Q9ZIU9; PDB:1Q5FA; MWGKKDAGTELTNYQTLATNTIGMMKGVDGYAFTSGAKMTDTLIQAGAAKGMTVSGDPAS -------------------3333---------------------------------1111 GSATLWNSWGGQIVVAPDTAGGTGFNNGFTITTNKVPQSACVSISTGMSRSGGTSGIKIN ------------------------------------------------------------ GNNHTDAKVTAEIASSECTADNGRTGTNTLVFNYNG ------------------------------------ >DUTP PYROPHOSPHATASE; SWP:Q6FHN1; PDB:1Q5HA; MQLRFARLSEHATAPTRGSARAAGYDLYSAYDYTIPPMEKAVVKTDIQIALPSGCYGRVA --------1111------------------------------------------------ PRSGLAAKHFIDVGAGVIDEDYRGNVGVVLFNFGKEKFEVKKGDRIAQLICERIFYPEIE --3333------------1111------------------2222---------------- EVQALDD ---1111 >3-CARBOXY-CIS,CIS-MUCONAT; SWP:Q59092; PDB:1Q5NA; QLYASLFYQRDVTEIFSDRALVSYMVEAEVALAQAQAQVGVIPQSAATVIQRAAKTAIDK 1111---------------------------------------------------3333- IDFDALATATGLAGNIAIPFVKQLTAIVKDADEDAARYVHWGATSQDILDTACILQCRDA -------------------------------3333111122223333------------- LAIVQNQVQQCYETALSQAQTYRHQVMMGRTWLQQALPITLGHKLARWASAFKRDLDRIN ---------------------1111-----%%%%-----3333----------------- AIKARVLVAQLGGAVGSLASLQDQGSIVVEAYAKQLKLGQTACTWHGERDRIVEIASVLG --3333------1111-3333------------1111-------11113333-------- IITGNVGKMARDWSLMMQTEIAEVFEPTRNPVAAASVLAAANRVPALMSSIYQSMVQEHE -----------------1111-------------------------------3333-!!! RSLGAWHAEWLSLPEIFQLTAGALERTLDVLKGMEVNAENMHQNIECTHGLIMAEAVMMA !--33333333--------------------------------1111iiii--------- LAPHMGRLNAHHVVEAACKTAVAEQKHLKDIISQVDEVKQYFNPSQLDEIFKPESYLGNI --------------------------33333333-3333------------3333----- QDQIDAVLQEA ----------- >PROTEASOME ALPHA-TYPE SUB; SWP:Q53080; PDB:1Q5QA; AEQIMRDRSELARKGARRSVLFGLVENPSTALHKVSELYLFAGYNEFE 3333-----------------11---------------11---3333- >Proteasome beta-type subu; SWP:Q53079; PDB:1Q5QH; TTIVALTYKGGVLLAGDRRATQGNLIASRDVEKVYVTDEYSAAGIAGTAGIAIELVRLFA ---------------------!!!!-----------------------3333-------- VELEHYEKIEGVPLTFDGKANRLASMVRGNLGAAMQGLAVVPLLVGYDLDADDESRAGRI -------------------------------3333------------1111-1111---- VSYDVVGGRYEERAGYHAVGSGSLFAKSALKKIYSPDSDEETALRAAIESLYDAADDDSA ---1111------------1111-----------2222-------------------111 TGGPDLTRGIYPTAVTITQAGAVHVSEETTSELARRIVAERTEQ 1----------------1111----3333--------------- >REGULATOR OF RNASE E ACTI; SWP:P32165; PDB:1Q5XA; KYDTSELCDIYQEDVNVVEPLFSNFGGRASFGGQIITVKCFEDNGLLYDLLEQNGRGRVL ----------!!!!-----------------------------------3333-2222-- VVDGGGSVRRALVDAELARLAVQNEWEGLVIYGAVRQVDDLEELDIGIQAMAAIPVGAAG ---iiii--------------1111-----------33331111---------------- EGIGESDVRVNFGGVTFFSGDHLYADNTGIILSEDPLDIE -----------%%%%--2222------------------- >NICKEL RESPONSIVE REGULAT; SWP:P28910; PDB:1Q5YA; TQGFAVLSYVYEHEKRDLASRIVSTQHHHHDLSVATLHVHINHDDCLEIAVLKGDMGDVQ -----------3333-------------3333---------------------------- HFADDVIAQRGVRHGHLQCLPKED --------2222------------ >SIPA; SWP:Q56027; PDB:1Q5ZA; PFSGLKFKQNSFLSTVPSVTNMHSMHFDARETFLGVIRKALEPDTSTPFPVRRAFDGLRA -2222--22221111--3333--1111----------------1111------------- EILPNDTIKSAALKAQCSDIDKHPELKAKMETLKEVITHHPQKEKLAEIALQFAREAGLT ------------------3333-----------------1111--------------111 RLKGETDYVLSNVLDGLIGDGSWRA 1!!!!------------11113333 >GENERAL TRANSCRIPTION FAC; SWP:Q9ESZ8; PDB:1Q60A; GSSGSSGLKQKVENLFNEKCGEALGLKQAVKVPFALFESFPEDFYVEGLPEGVPFRRPST ---------------------1111-------3333-------------2222------- FGIPRLEKILRNKAKIKFIIKKPEMFETAIKESSGPSSG --------3333---------3333-3333--------- >Decapping protein involve; SWP:NA; PDB:1Q67A; LNFNVIGRYDPKIKQLLFHTPHASLYKWDFKKDEWNKLEYQGVLAIYLRDVSKDIYNYGL ---------1111----------------------------------------------- IILNRINPDNFSMGIVPNSVVNKRKVFNAEEDTLNPLECMGVEVKDELVIIKNLKHEVYG --------------------------------------------!!!!----1111---- IWIHTVSDRQNIYELIKYLLENEPKD -------------------------- >T-CELL SURFACE GLYCOPROTE; SWP:P01730; PDB:1Q68A; RCRHRRRQAERLSQIKRLLSEKKTCQCPHRFQKTCSPI --------11113333--1111---------------- >Proto-oncogene tyrosine-p; SWP:P06239; PDB:1Q68B; SHPEDDWLENIDVCENCHYPIVPLDGKGT -----3333-------------------- >FKBP-type peptidyl-prolyl; SWP:P45523; PDB:1Q6HA; AFKNDDQKSAYALGASLGRYENSLKEQEKLGIKLDKDQLIAGVQDAFADKSKLSDQEIEQ ---3333--------------------1111--------------1111----------- TLQAFEARVKSSAQAKEKDAADNEAKGKEYREKFAKEKGVKTSSTGLVYQVVEAGKGEAP ------------------------------------2222--1111-------------- KDSDTVVVNYKGTLIDGKEFDNSYTRGEPLSFRLDGVIPGWTEGLKNIKKGGKIKLVIPP 1111---------1111----3333-------1111--------11112222------33 ELAYGKAGVPGIPPNSTLVFDVELLDVK 33------2222---------------- >3-KETO-L-GULONATE 6-PHOSP; SWP:P39304; PDB:1Q6OA; LPMLQVALDNQTMDSAYETTRLIAEEVDIIEVGTILCVGEGVRAVRDLKALYPHKIVLAD ------------------33331111--------------3333-------1111----- AKIADAGKILSRMCFEANADWVTVICCADINTAKGALDVAKEFNGDVQIELTGYWTWEQA -----3333-----1111------11113333--------1111---------------- QQWRDAGIGQVVYHRSRDAQAAGVAWGEADITAIKRLSDMGFKVTVTGGLALEDLPLFKG ---1111------------1111---3333--------------------33333333-- IPIHVFIAGRSIRDAASPVEAARQFKRSIAELW --------3333--------------------- >PHOSPHOLIPASE A2 VRV-PL-V; SWP:P59071; PDB:1Q6VA; SLLEFGKMILEETGKLAIPSYSSYGCYCGGCGSGTPKDATDRCCFVHCCCYGNLPDCNPK ---------------------------------------------------------333 SDRYKYKRVNGAIVCEKGTSCENRICECDKAAAICFRQNLNTYSGKYMLYPDFLCKGELK 3-------------------------------------3333-3333----1111----- C - >MONOAMINE OXIDASE REGULAT; SWP:O28346; PDB:1Q6WA; ARNPIYFESIQIGEKIEGLPRTVTETDIWTFAYLTADFFPLHTDVEFAKKTIFGKPIAQG -----3333-----------------------------------3333--1111------ LVLSIALGVDQVILSNYDVSSVIAFFGIKDVRFLRPVFIGDTIAASAEVVEKQDFDEKSG ------------------1111---------------1111--------------1111- VVTYKLEVKNQRGELVLTALYSALIRKTP ----------------------------- >BENZOYLFORMATE DECARBOXYL; SWP:P20906; PDB:1Q6ZA; ASVHGTTYELLRRQGIDTVFGNPGSNALPFLKDFPEDFRYILALQEACVVGIADGYAQAS -----------1111--------3333-------1111---------------------- RKPAFINLHSAAGTGNAMGALSNAWNSHSPLIVTAGQQTRAMIGVEALLTNVDAANLPRP ------------------3333----------------333311112222--3333---- LVKWSYEPASAAEVPHAMSRAIHMASMAPQGPVYLSVPYDDWDKDADPQSHHLFDRHVSS ---------3333------------------------1111-----11113333------ SVRLNDQDLDILVKALNSASNPAIVLGPDVDAANANADCVMLAERLKAPVWVAPSAPRCP ------------------------------1111-------------------------- FPTRHPCFRGLMPAGIAAISQLLEGHDVVLVIGAPVFRYHQYDPGQYLKPGTRLISVTCD -1111-----------------2222----------------------2222-------- PLEAARAPMGDAIVADIGAMASALANLVEESSRQLPTAAPEPAKVDQDAGRLHPETVFDT ----------------------------------------------------3333---- LNDMAPENAIYLNESTSTTAQMWQRLNMRNPGSYYFCAAGGLGFALPAAIGVQLAEPERQ -----1111-----1111------------------1111----3333-------1111- VIAVIGDGSANYSISALWTAAQYNIPTIFVIMNNGTYGALRWFAGVLEAENVPGLDVPGI ------3333--3333-------------------------------------------- DFRALAKGYGVQALKADNLEQLKGSLQEALSAKGPVLIEVSTVS ------1111------------------1111------------ >FAB M82G2, LIGHT CHAIN; SWP:NA; PDB:1Q72H; EVTLQESGGGLVQPGGSMKLSCAASGFTFSDAWVDWVRQSPGKGLEWVAEIRNKA ------------2222-----------3333------------------------ >1D-myo-inosityl 2-acetami; SWP:O50426; PDB:1Q74A; ETPRLLFVHAHPDDESLSNGATIAHYTSRGAQVHVVTCTLGEEGEVIGDRWAQLTADHAD ------------3333----------1111----------1111----1111--1111-- QLGGYRIGELTAALRALGVSAPIYLGGAGRWRDSGMARSQRRFVDADPRQTVGALVAIIR --------------1111-----2222--------------3333-3333---------- ELRPHVVVTYDPNGGYGHPDHVHTHTVTTAAVAAAGVHPGDPWTVPKFYWTVLGLSALIS ----------1111-------------------3333----------------------- GARALVPDDLRPEWVLPRADEIAFGYSDDGIDAVVEADEQARAAKVAALAAHATQVVVGP -11113333-1111---3333-----3333---------------------3333---11 TGRAAALSNNLALPILADEHYVLAGGSAGARDERGWETDLLAGLGFT 11----1111---------------------1111---1111----- >HYPOTHETICAL PROTEIN AQ_1; SWP:O66565; PDB:1Q77A; AKVLLVLTDAYSDCEKAITYAVNFSEKLGAELDILAVLEDVYNLERANVTFGLPFPPEIK -------3333--------------1111------------------------------- EESKKRIERRLREVWEKLTGSTEIPGVEYRIGPLSEEVKKFVEGKGYELVVWACYPSAYL -------------------------------------------------------3333- CKVIDGLNLASLIVK --------------- >POLY(A) POLYMERASE ALPHA; SWP:P25500; PDB:1Q79A; HYGITSPISLAAPKETDLLTQKLVETLKPFGVFEEEEELQRRILILGKLNNLVKEWIREI -------------3333---------3333------------------------------ SESKNLPQSVIENVGGKIFTFGSYRLGVHTKGADIDALCVAPRHVDRSDFFTSFYDKLKL -1111-3333-----------3333----2222--------11113333----------- QEEVKDLRAVEEAFVPVIKLFDGIEIDILFARLALQTIPEDLDLRDDSLLKNLDIRIRSL 1111----------------iiii--------------11113333-1111-----3333 NGCRVTDEILHLVPNIDNFRLTLRAIKLWAKRHNIYSNILGFLGGVSWAMLVARTCQLYP --------1111------------------1111---1111-----------------11 NAIASTLVHKFFLVFSKWEWPNPVLLKQPEECNLNLPVWDPRVNPSDRYHLMPIITPAYP 11------------1111---------------------33333333------------- QQNSTYNVSVSTRMVMVEEFKQGLAITDEILLSKAEWSKLFEAPNFFQKYKHYIVLLASA ----1111---------------------1111---3333----1111------------ PTEKQRLEWVGLVESKIRILVGSLEKNEFITLAHVNPQSFPAPKENPDKEEFRTMWVIGL ----------------------33331111---------------3333----------- VFKDLTYDIQSFTDTVYRQAINSKMFEVDMKIAAMHVKRKQLHQLLP --------------------1111--2222-------3333------ >3-OXOACYL-[ACYL-CARRIER P; SWP:P25716; PDB:1Q7BA; NFEGKIALVTGASRGIGRAIAETLAARGAKVIGTATSENGAQAISDYLGANGKGLMLNVT -2222-------------------1111-------------------!!!!------111 DPASIESVLEKIRAEFGEVDILVNNAGITRDNLLMRMKDEEWNDIIETNLSSVFRLSKAV 1-------------------------------3333-3333------------------- MRAMMKKRHGRIITIGSVVGTMGNGGQANYAAAKAGLIGFSKSLAREVASRGITVNVVAP ----------------3333---2222--------------------1111--------- GFIETDMTRALSDDQRAGILAQVPAGRLGGAQEIANAVAFLASDEAAYITGETLHVNGGM ------3333-3333---33333333---3333------------1111-------iiii YMV --- >HYPOTHETICAL PROTEIN YFDW; SWP:P77407; PDB:1Q7EA; LSTPLQGIKVLDFTGVQSGPSCTQLAWFGADVIKIERPGVGDVTRHQLRDIPDIDALYFT --1111-------------------1111-------2222-3333-----2222--3333 LNSNKRSIELNTKTAEGKEVEKLIREADILVENFHPFTWEHIQEINPRLIFGSIKGFDEC -2222-----1111---------1111------------------1111--------111 SPYVNVKAYENVAQAAGGAASTTGFWDGPPLVSAAALGDSNTGHLLIGLLAALLHREKTG 1-1111------------3333--1111-------11113333-----------3333-- RGQRVTSQDAVLNLCRVKLRDQQRLDKLGYLEEYPQYPNGTFGDAVPRGGNAGGGGQPGW ------------------------------1111--------------!!!!-------- ILKCKGWETDPNAYIYFTIQEQNWENTCKAIGKPEWITDPAYSTAHARQPHIFDIFAEIE ---2222--1111------1111---------3333--1111-33333333--------- KYTVTIDKHEAVAYLTQFDIPCAPVLSKEISLDPSLRQSGSVVEVEQPLRGKYLTVGCPK -3333----------1111--------3333----------------------------- FSAFTPDIKAAPLLGEHTAAVLQELGYSDDEIAAKQNHAIE ------------2222------1111--------------- >BRAIN TUMOR CG10719-PA; SWP:Q8MQJ9; PDB:1Q7FA; IKRQRMIYHCKFGEFGVMEGQFTEPSGVAVNAQNDIIVADTNNHRIQIFDKEGRFKFQFG -----------------2222---------1111---------------1111------- ECGKRDSQLLYPNRVAVVRNSGDIIVTERSPTHQIQIYNQYGQFVRKFGATILQHPRGVT ----2222------------------------------1111------1111-------- VDNKGRIIVVECKVMRVIIFDQNGNVLHKFGCSKHLEFPNGVVVNDKQEIFISDNRAHCV -1111---------------1111-------1111------------------1111--- KVFNYEGQYLRQIGGEGITNYPIGVGINSNGEILIADNHNNFNLTIFTQDGQLISALESK ---1111-------2222---------1111----------------1111--------- VKHAQCFDVALMDDGSVVLASKDYRLYIYRYVQLAPVGM -----------------------------------2222 >CONSERVED HYPOTHETICAL PR; SWP:Q9HIB8; PDB:1Q7HA; SKHFISKKEAKRIWEQSRYGIDITGESLEVAAQKSASAYYIGGKPVFQAGDLIPSVYLLN ----------------1111--2222--------------iiii---------------- YRNPSRNIVTVDEGAEPHILNGSDLFAPGIVSDDSIRKGDIFVKSSKGYFIAVGAEDAGE ------------3333--1111---3333-------2222----1111--------3333 VATKRGKAARIIHFPGDELIRAFP -------------2222------- >HEMORRHAGIC PROTEIN-RHODO; SWP:P30403; PDB:1Q7IA; GKECDCSSPENPCCDAATCKLRPGAQCGEGLCCEQCKFSRAGKICRIARGDWNDDRCTGQ ----------3333---------------------------------------------- SADCPRYH -------- >AMINOACYLASE-1; SWP:Q03154; PDB:1Q7LA; EEEHPSVTLFRQYLRIRTVQPKPDYGAAVAFFEETARQLGLGCQKVEVAPGYVVTVLTWP ------------1111--------------------------------2222-------- GTNPTLSSILLNSHTDVVPVFKEHWSHDPFEAFKDSEGYIYARGAQDMKCVSIQYLEAVR --1111--------------3333---1111---1111---2222--3333--------- RLKVEGHRFPRTIHMTFVPDEEVGGHQGMELFVQRPEFHALRAGFALDEGIANPTDAFTV --1111-------------1111-11113333---------------------------- FYSERSPWWVRV ------1111-- >Aminoacylase-1; SWP:Q03154; PDB:1Q7LB; NPWWAAFSRVCKDMNLTLEPEIMPAAGDNRYIRAVGVPALGFSPMNRTPVLLHDHDERLH --------------------------3333--1111--------------2222------ EAVFLRGVDIYTRLLPALASVPALPSDS -----------------1111--3333- >PREDICTED AMIDOTRANSFERAS; SWP:NA; PDB:1Q7RA; NKIGVLGLQGAVREHVRAIEACGAEAVIVKKSEQLEGLDGLVLPGGESTTRRLIDRYGLE --------1111-------1111-------33331111--------3333---------- PLKQFAAAGKPFGTCAGLILLAKRIVGYDEPHLGLDITVERNSFGRQRESFEAELSIKGV ----------------------------------------3333---------------- GDGFVGVFIRAPHIVEAGDGVDVLATYNDRIVAARQGQFLGCSFHPELTDDHRLQYFLNV -----------------1111-----%%%%-----!!!!-----3333------------ KEAKASSLK ----3333- >BIT1; SWP:Q9Y3E5; PDB:1Q7SA; GEYKMILVVRNDLKMGKGKVAAQCSHAAVSAYKQIQRRNPEMLKQWEYCGQPKVVVKAPD ---------1111----------------------------------------------- EETLIALLAHAKMLGLTVSLIQDAGRTQIAPGSQTVLGIGPGPADLIDKVTGHLKLY -----------1111-----------------------------------1111--- >PDZ2B DOMAIN OF PTP-BAS (; SWP:Q12923; PDB:1Q7XA; GSSPPKPGDIFEVELAKNDNSLGISVTVLFDKGGVNTSVRHGGIYVKAVIPQGAAESDGR -----------------------------------------------------3333--- IHKGDRVLAVNGVSLEGATHKQAVETLRNTGQVVHLLLEKGQSPTSKE ---------iiii----------------------------------- >5-methyltetrahydrofolate ; SWP:Q9WYA5; PDB:1Q7ZA; MRNRREVSKLLSERVLLLDGAYGTEFMKYGYDDLPEELNIKAPDVVLKVHRSYIESGSDV --------------------------1111---3333----------------1111--- ILTNTFGATRMKLRKHGLEDKLDPIVRNAVRIARRAAGEKLVFGDIGPTGELPYPLGSTL ----111133331111-1111---------------!!!!-------------------3 FEEFYENFRETVEIMVEEGVDGIIFETFSDILELKAAVLAAREVSRDVFLIAHMTFDEKG 333------------1111-------------------------------------1111 RSLTGTDPANFAITFDELDIDALGINCSLGPEEILPIFQELSQYTDKFLVVEPNAGKPIV -1111---------1111-----------3333-------3333---------------- ENGKTVYPLKPHDFAVHIDSYYELGVNIFGGCCGTTPEHVKLFRKVLGNRKPLQRKKKRI %%%%-----3333--------1111------2222------------------------- FAVSSPSKLVTFDHFVVIGERINPAGRKKLWAEMQKGNEEIVIKEAKTQVEKGAEVLDVN ----1111---------------2222------1111------------1111------- FGIESQIDVRYVEKIVQTLPYVSNVPLSLDIQNVDLTERALRAYPGRSLFNSAKVDEEEL --3333-3333------------------------------------------------- EMKINLLKKYGGTLIVLLMGKDVPKSFEERKEYFEKALKILERHDFSDRVIFDPGVLPLG -----------------------------------------11111111--------333 AEGKPVEVLKTIEFISSKGFNTTVGLSNLSFGLPDRSYYNTAFLVLGISKGLSSAIMNPL 3--------------1111-----3333------3333---------1111------111 DETLMKTLNATLVILEKKE 1------------------ >39 KDA INITIATOR BINDING ; SWP:A2FMX0; PDB:1Q87A; SMCIGNSTPNEQETFRAKVDEIWFRLTQKTDGTVMRDFLIEKAAEYFKQPEQPKQNAIEV ------------------------------------------------1111-------- ISAIMAPQEEQTKSKADLYKFLAMFGPYETIMLKIASLLLISNNKGHWLTFDPQDSISGW ------3333---3333---------3333-----------1111--------------- FDQNEPNCLILKTPTGIRKIWNKPLIEATGQYLMDENGEKYDSWDKYFEMKPIAYPTFAP -1111-------1111------11113333----1111---------------------1 MHHHH 111-- >PROTEIN YJCS; SWP:O31641; PDB:1Q8BA; SHYITACLKIISDKDLNEIKEFKKLEEETNKEEGCITFHAYPLEPSERKILWEIWENEEA --------------3333-------------1111--------3333---------3333 VKIHFTKKHTIDVQKQELTEVEWLKSNVN -------3333--1111------------ >HYPOTHETICAL PROTEIN MG02; SWP:P47273; PDB:1Q8CA; LTRTQRRIAVVEFIFSLLFFLPKEAEVIQADFLEYDTKERQLNEWQKLIVKAFSENIFSF --------------3333--------------11113333--3333-------------- QKKIEEQQLKNQLEIQTKIDLLTTAVVLCALSEQKAHNTDKPLLISEALLIMDHYSQGAE -------3333-----------------------------------------1111--33 KKQTHALLDKLL 33--3333---- >GDNF FAMILY RECEPTOR ALPH; SWP:Q62997; PDB:1Q8DA; ERPNCLSLQDSCKTNYICRSRLADFFTNCQPESRSVSNCLKENYADCLLAYSGLIGTVTP ---------------------------------------3333--------3333----- NVAPWCDCSNSGNDLEDCLKFLNFFKDNTCLKNAIQAFG ----------!!!!------------------------- >PYRIMIDINE NUCLEOSIDE HYD; SWP:P33022; PDB:1Q8FA; KRKIILDCDPGHDDAIAIMMAAKHPAIDLLGITIVAGNQTLDKTLINGLNVCQKLEINVP ----------3333---------1111------------3333----------------- VYAGMPQPIMRQQIVADNIHGDTGLDGPVFEPLTRQAESTHAVKYIIDTLMASDGDITLV --------------------1111------------------------------------ PVGPLSNIAVAMRMQPAILPKIREIVLMGGAYGTGNFTPSAEFNIFADPEAARVVFTSGV --------------33331111---------------11113333---------1111-- PLVMMGLDLTNQTVCTPDVIARMERAGGPAGELFSDIMNFTLKTQFENYGLAGGPVHDAT -----3333------3333------------------------------------3333- CIGYLINPDGIKTQEMYVEVDVNSGPCYGRTVCDELGVLGKPANTKVGITIDTDWFWGLV --------------------------2222---1111----------------------- EECVRGYI ----1111 >COFILIN, NON-MUSCLE ISOFO; SWP:P23528; PDB:1Q8GA; MASGVAVSDGVIKVFNDMKVRKSSTPEEVKKRKKAVLFCLSEDKKNIILEEGKEILVGDV --------3333--3333------1111------------3333---------------- GQTVDDPYATFVKMLPDKDCRYALYDATYETKESKKEDLVFIFWAPESAPLKSKMIYASS -----3333-3333-1111--------------------------11111111------- KDAIKKKLTGIKHELQANCYEEVKDRCTLAEKLGGSAVISLEGKPL ------------------3333--3333-----3333--------- >OSTEOCALCIN; SWP:Q8HYY9; PDB:1Q8HA; PDPLPRRVCLNPDCDELADHIGFQEAYRRFYGIA ----------3333-3333---3333-------- >DNA POLYMERASE II; SWP:P21189; PDB:1Q8IA; AQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIPADQVPRAQHILQGEQ -------------1111--------1111---------------1111-------1111- GFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGGVTVYEADVRPPERYLMERFIT ----------1111---------------------1111----1111-------1111-- SPVWVEGDMHNGTIVNARLKPHPDYRPPLKWVSIDIETTRHGELYCIGLEGCGQRIVYML ---------iiii--------1111-------------1111--------iiii------ GPENGDASSLDFELEYVASRPQLLEKLNAWFANYDPDVIIGWNVVQFDLRMLQKHAERYR ------1111--------3333-------------------------------------- LPLRLGRDNSELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQELL ---------------------------2222---------1111---------------- MDEIDRRFAEDKPALATYNLKDCELVTQIFHKTEIMPFLLERATVNGLPVDRHGGSVAAF ------------------------------------------------------------ GHLYFPRMHRAGYVAPNLGEVPPHASPGGYVMDSRPGLYDSVLVLDYKSLYPSIVRTFLI ------------------------------------------------------------ DPVGLVEGMAQPDPEHSTEGFLDAWFSREKHCLPEIVTNIWHGRDEAKRQGNKPLSQALK -------3333-3333----%%%%-----------------------1111--------- IIMNAFYGVLGTTACRFFDPRLASSITMRGHQIMRQTKALIEAQGYDVIYGDTDSTFVWL -----------3333---3333-------------------1111--------------- KGAHSEEEAAKIGRALVQHVNAWWAETLQKQRLTSALELEYETHFCRFLMPTIRGADTGS ----------------------------1111---------------------------- KKRYAGLIQEGDKQRMVFKGLETTDWTPLAQQFQQELYLRIFRNEPYQEYVRETIDKLMA ---------!!!!---------------------------1111-------------111 GELDARLVYSPLDYEHYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF 1-3333------3333------------3333---------------- >COPPER-TRANSPORTING ATPAS; SWP:Q04656; PDB:1Q8LA; GSMAQAGEVVLKMKVEGMTCHSCTSTIEGKIGKLQGVQRIKVSLDNQEATIVYQPHLISV ------------------------------1111-------------------------- EEMKKQIEAMGFPAFVKKQPKYLK ------------------3333-- >CROSSOVER JUNCTION ENDODE; SWP:P40116; PDB:1Q8RA; MNTYSITLPWPPSNNRYYRHNRGRTHVSAEGQAYRDNVARIIKNAMLDIGLAMPVKIRIE ------------3333--------------------------1111-------------- CHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVVKMPVTKGGRLELTITEMG ---------1111----------------3333-----------2222---------- >SR PROTEIN KINASE; SWP:Q03656; PDB:1Q8YA; YHPAFKGEPYKDARYILVRKLGWGHFSTVWLAKDMVNNTHVAMKIVRGDKVYTEAAEDEI ----2222--%%%%---------1111--------------------------------- KLLQRVNDADNTKEDSMGANHILKLLDHFNHKGPNGVHVVMVFEVLGENLLALIKKYEHR ------1111-3333--1111-----------1111--------------------%%%% GIPLIYVKQISKQLLLGLDYMHRRCGIIHTDIKPENVLMEIVDSPENLIQIKIADLGNAC --3333--------------------------3333-------1111--------1111- WYDEHYTNSIQTREYRSPEVLLGAPWGCGADIWSTACLIFELITGDFLFKDDDHIAQIIE 1111--------1111----------3333------------------------------ LLGELPSYLLRNGKYTRTFFNSLLRNISKLKFWPLEDVLTEKYKFSKDEAKEISDFLSPM -----3333-----3333--------------------------------------3333 LQLDPRKRADAGGLVNHPWLKDTLGMEEIRVPDRELYGSGSDIPGWFEEVR ---3333---3333--3333--2222----11112222-1111-------- >Cytochrome b6-f complex i; SWP:P49728; PDB:1Q90C; QAAKDALGNDIKAGEWLKTHLAGDRSLSQGLKGDPTYLIVTADSTIEKYGLNAVCTHLGC -----------3333--------------2222--------------------------- VVPWVAAENKFKCPCHGSQYNAEGKVVRGPAPLSLALAHCDVAEGLVTFSTWTETDFRTG ----3333------------1111------------------------------------ LEPWWA ------ >5(3)-DEOXYRIBONUCLEOTIDAS; SWP:Q9NPB1; PDB:1Q92A; GRALRVLVDMDGVLADFEGGFLRKFRARFPDQPFIALEDRRGFWVSEQYGRLRPGLSEKA ---------2222---------------3333---3333----3333-----2222---- ISIWESKNFFFELEPLPGAVEAVKEMASLQNTDVFICTSPIKMFKYCPYEKYAWVEKYFG --1111-3333----2222------1111------------------------------3 PDFLEQIVLTRDKTVVSADLLIDDRPDITGAEPTPSWEHVLFTACHNQHLQLQPPRRRLH 3331111-----1111---------------------------3333------------- SWADDWKAILDSKRP 33333333--1111- >THIOL PEROXIDASE; SWP:Q57549; PDB:1Q98A; TVTLAGNPIEVGGHFPQVGEIVENFILVGNDLADVALNDFASKRKVLNIFPSIDTGVCAT ----------------2222--------1111---33332222---------------33 SVRKFNQQAAKLSNTIVLCISADLPFAQARFCGAEGIENAKTVSTFRNHALHSQLGVDIQ 33------1111-----------3333------2222-------1111----1111---- TGPLAGLTSRAVIVLDEQNNVLHSQLVEEIKEEPNYEAALAVLA -1111----------1111---------1111--------1111 >HEVEIN; SWP:P02877; PDB:1Q9BA; EQCGRQAGGKLCPNNLCCSQWGWCGSTDEYCSPDHNCQSNCKD ---1111-----%%%%--1111----3333-3333-------- >SON OF SEVENLESS PROTEIN; SWP:Q07889; PDB:1Q9CA; LPYEFFSEENAPKWRGLLVPALKKVQGQVHPTLESNDDALQYVEELILQLLNLCQAQPRS ---------33332222-------1111-3333--3333--------------1111--- ASDVEERVQKSFPHPIDKWAIADAQSAIESLPVEKIHPLLKEVLGYKIDHQVSVYIVAVL -------1111--------------------------3333-------3333-------- EYISADILKLAGNYVRNIRHYEITKQDIKVACADKVLDFH --------------3333---------------------- >GUANYL-SPECIFIC RIBONUCLE; SWP:P00651; PDB:1Q9EA; ACDYTCGSNCYSSSDVSTAQAAGYKLHEDGETVGSNSYPHEFRNWNGFDFSVSSPYYEWP -----!!!!------------------------1111------1111------------- ILSSGDVYSGGSPGADRVVFNENNQLAGVITHTGASGNNFVECT -3333---------------1111-------2222!!!!----- >CELLOBIOHYDROLASE I CATAL; SWP:Q8TFL9; PDB:1Q9HA; QAGTATAENHPPLTWQECTAPGSCTTQNGAVVLDANWRWVHDVNGYTNCYTGNTWDPTYC -------------------2222----------3333-------------------1111 PDDETCAQNCALDGADYEGTYGVTSSGSSLKLNFVTGSNVGSRLYLLQDDSTYQIFKLLN -----------------------------------!!!!--------------------- REFSFDVDVSNLPCGLNGALYFVAMDADGGVSKYPNNKAGAKYGTGYCDSQCPRDLKFID ------------2222---------111133331111--3333-----1111------%% GEANVEGWQPSTGIGDHGSCCAEMDVWEANSISNAVTPHPCDTPGQTMCSGDDCGGTYSN %%--2222-------------------------------------------33331111- DRYAGTCDPDGCDFNPYRMGNTSFYGPGKIIDTTKPFTVVTQFLTDDGTDTGTLSEIKRF 1111----------3333--------------------------11111111-------- YIQNSNVIPQPNSDISGVTGNSITTEFCTAQKQAFGDTDDFSQHGGLAKMGAAMQQGMVL --%%%%--------2222-----------------------------------3333--- VMSLWDDYAAQMLWLDSDYPTDADPTTPGIARGTCPTDSGVPSDVESQSPNSYVTYSNIK -----------3333--------1111--------1111-3333----1111-------- FGPINSTFT --2222--- >POLYKETIDE SYNTHASE ASSOC; SWP:P96208; PDB:1Q9JA; MFPGSVIRKLSHSEEVFAQYEVFTSMTIQLRGVIDVDALSDAFDALLETHPVLASHLEQS -2222-----3333--------------------3333-----------3333------- SDGGWNLVADDLLHSGICVIDAELRLDQSVSLLHLQLILREGGAELTLYLHHCMADGHHG --------------------------3333--------------------1111------ AVLVDELFSRYTDAVTTGDPGPITPQPTPLSMEAVLAQRGIRKAERFMSVMYAYPGLPQA ------------------------------3333------------3333---------- VPVTRLWLSKQQTSDLMAFGREHRLSLNAVVAAAILLTEWQLRNTPHVPIPYVYPVDLRF ---------------------------------------1111-------------3333 VLAPPVAPTEATNLLGAASYLAEIGPNTDIVDLASDIVATLRADLANGVIQQSGLHFGTA ------1111--------------1111---------------------1111----333 FEGTPPGLPPLVFCTDATSFPTMRTPPGLEIEDIKGQFYCSISVPLDLYSCAVYAGQLII 3------------------------2222------------------------%%%%--- EHHGHIAEPGKSLEAIRSLLCTVPSEYG -------11113333--11113333--- >S25-2 FAB (IGG1K) LIGHT C; SWP:Q52L64; PDB:1Q9RA; DIVMSQSPSSLAVSAGEKVTMSCKSSQSLLNSRTRKNYLAWYQQKPGQSPKLLIYWASTR -------------2222---------------------------2222------------ ESGVPDRFTGSGSGTDFTLTITSVQAEDLAVYYCKQSYNLRTFGGGTKLEIKRADAAPTV 22223333----!!!!--------1111-------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM -----33331111---------------------iiii---------------------- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC ------33331111--------3333--------3333- >Putative uncharacterized ; SWP:Q52L64; PDB:1Q9RB; EVKLVESGGGLVQSGGSLRLSCATSGFTFTDYYMSWVRQPPGKALEWLGFIRNKANGYTT ------------2222-----------3333--------2222---------3333---- EYSPSVKGRFTISRDNSQSILYLQMNTLRAEDSATYYCARDHDGYYERFSYWGQGTLVTV -----2222-------------------3333---------------------------- SAAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQ ------------------------------------------%%%%-------------% SDLYTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPR %%%---------1111------------1111---------- >HYPOTHETICAL PROTEIN APC3; SWP:NA; PDB:1Q9UA; AMFHYTVDVSTGMNETIERLEESLKQEGFGVLWQFSVTEKLQEKGLDFSTPMVILEVNPQ ------------------------1111-------------1111--------------- EAARVLNENLLVGYFLPKLVVYQENGTTKIGMPKPTMLVGMMNDPALKEIAADIEKRLAA -----------------------iiii-------33333333------------------ CLDRCR --1111 >S45-18 FAB (IGG1K) LIGHT ; SWP:NA; PDB:1Q9WB; EVILVESGGGLVQPGGSLRLSCSTSGFTFTDYYMSWVRQPPGKALEWLGFIRNKPKGYTT ------------2222-----------3333--------2222---------3333---- EYSASVKGRFTISRDNSQSILYLQMNTLRAEDSATYYCVRDIYSFGSRDGMDYWGQGTSV -----2222-------------------3333-----------1111------------- TVSSAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAV ---------------------------------------------iiii----------- LQSDLYTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRDC --------------1111------------1111------------ >PROTEIN (MYRISTOYLATED HI; SWP:P04324; PDB:1QA5A; GGKWSKSSVVGWPAVRERMRRAEPAADGVGAASRDLEKHGAITSSNTAANNAACAW ------------33333333--------------3333------------------ >PERCHLORIC ACID SOLUBLE P; SWP:P52759; PDB:1QAHA; SSIIRKVISTSKAPAAIGAYSQAVLVDRTIYVSGQIGMDPSSGQLVPGGVAEEAKQALKN ---------3333----------------------------------------------- LGEILKAAGCDFTNVVKTTVLLADINDFGTVNEIYKTYFQGNLPARAAYQVAALPKGSRI -----1111-3333---------1111------3333----------------2222--- EIEAIAVQGPFT ------------ >ERMC' METHYLTRANSFERASE; SWP:P13956; PDB:1QAMA; QNFITSKHNIDKIMTNIRLNEHDNIFEIGSGKGHFTLELVQRCNFVTAIEIDHKLCKTTE -------------1111--1111------!!!!--------------------------- NKLVDHDNFQVLNKDILQFKFPKNQSYKIFGNIPYNISTDIIRKIVFDSIADEIYLIVEY 1111----------3333---------------3333----------------------- GFAKRLLNTKRSLALFLMAEVDISILSMVPREYFHPKPKVNSSLIRLNRKKSRISHKDKQ ---------------3333-----------------------------------3333-- KYNYFVMKWVNKEYKKIFTKNQFNNSLKHAGIDDLNNISFEQFLSLFNSYKLFNK ------------3333-----------------1111------------------ >QUINOLINIC ACID PHOSPHORI; SWP:P30012; PDB:1QAPA; DDRRDALLERINLDIPAAVAQALREDLGGEVDAGNDITAQLLPADTQAHATVITREDGVF -------------------------------33333333--------------------- CGKRWVEEVFIQLAGDDVRLTWHVDDGDAIHANQTVFELQGPARVLLTGERTALNFVQTL -3333-------------------------2222-------------------------- SGVASEVRRYVGLLAGTQTQLLDTRKTLPGLRTALKYAVLCGGGANHRLGLTDAFLIKEN -------------2222------------------------------------------- HIIASGSVRQAVEKAFWLHPDVPVEVEVENLDELDDALKAGADIIMLDNFNTDQMREAVK ------------------3333-------3333-------------------------11 RVNGQARLEVSGNVTAETLREFAETGVDFISVGALTKHVRALDLSMRFC 112222--------3333--------------3333------------- >NEURONAL NITRIC OXIDE SYN; SWP:P29476; PDB:1QAUA; NVISVRLFKRKVGGLGFLVKERVSKPPVIISDLIRGGAAEQSGLIQAGDIILAVNDRPLV ------------!!!!-----------------2222--3333--2222----!!!!-11 DLSYDSALEVLRGIASETHVVLILRGPEGFTTHLETTFTGDGTPKTIRVTQP 11--------1111------------2222--------1111---------- >ALPHA-1 SYNTROPHIN (RESID; SWP:Q61234; PDB:1QAVA; GSLQRRRVTVRKADAGGLGISIKGGRENKMPILISKIFKGLAADQTEALFVGDAILSVNG ------------3333---------------------22223333----2222----iii EDLSSATHDEAVQALKKTGKEVVLEVKYMK i-------------1111------------ >ALGINATE LYASE A1-III; SWP:Q9KWU1; PDB:1QAZA; GSHPFDQAVVKDPTASYVDVKARRTFLQSGQLDDRLKAALPKEYDCTTEATPNPQQGEMV --1111-----1111---------------------1111----3333------------ IPRRYLSGNHGPVNPDYEPVVTLYRDFEKISATLGNLYVATGKPVYATCLLNMLDKWAKA -------------11113333------------------------------------111 DALLNYDPKSQSWYQVEWSAATAAFALSTMMAEPNVDTAQRERVVKWLNRVARHQTSFPG 1-----1111-------------------1111---------------------3333-- GDTSCCNNHSYWRGQEATIIGVISKDDELFRWGLGRYVQAMGLINEDGSFVHEMTRHEQS --3333--------------------------------------1111-3333--!!!!- LHYQNYAMLPLTMIAETASRQGIDLYAYKENGRDIHSARKFVFAAVKNPDLIKKYASEPQ ------------------1111-3333--%%%%----------------3333------- DTRAFKPGRGDLNWIEYQRARFGFADELGFMTVPIFDPRTGGSATLLAYKP -333322221111------------1111-------------3333----- >HUMAN SIGNAL RECOGNITION ; SWP:P13624; PDB:1QB2A; QFTLRDMYEQFQNIMKMGPFSQILGMIPGFGTDFMSKGNEQESMARLKKLMTIMDSMNDQ --------------------------2222--2222----------------3333-333 ELDSTDGAKVFSKQPGRIQRVARGSGVSTRDVQELLTQYTKFAQMVK 3----3333-3333--------1111---------------3333-- >CYCLIN-DEPENDENT KINASES ; SWP:P20486; PDB:1QB3A; HAFQGRKLTDQERARVLEFQDSIHYSPRYSDDNYEYRHVMLPKAMLKVIPSDYFNSEVGT --------3333------3333-------------------333311111111------- LRILTEDEWRGLGITQSLGWEHYECHAPEPHILLFKRPLNYEAELRAATAAAQ ---------3333---------------1111--------------------- >Heat-labile enterotoxin I; SWP:P43529; PDB:1QB5D; GASQFFKDNCNRTTASLVEGVELTKYISDINNNTDGMYVVSSTGGVWRISRAKDYPDNVM ---------1111-----------------1111------1111---------------- TAEMRKIAMAAVLSGMRVNMCASPASSPNVIWAIELEAE --------------------------------------- >ADENINE PHOSPHORIBOSYLTRA; SWP:Q27679; PDB:1QB7A; PFKEVSPNSFLLDDSHALSQLLKKSYRWYSPVFSPRNVPRFADVSSITESPETLKAIRDF ------------1111-------------3333---------33331111---------- LVQRYRAMSPAPTHILGFDARGFLFGPMIAVELEIPFVLMRKADKNAGLLIRSEPYEKEY -------------------3333------------------1111--------------- KEAAPEVMTIRYGSIGKGSRVVLIDDVLATGGTALSGLQLVEASDAVVVEMVSILSIPFL ----------2222-2222-------------------------------------3333 KAAEKIHSTANSRYKDIKFISLLSDDALTEENCGDSKNYTGPRVLSCGDVLAEHPH ---------%%%%1111------3333-3333------------------1111-- >BACTERIOPHAGE Q BETA CAPS; SWP:P03615; PDB:1QBEA; AKLETVTLGNIGKDGKQTLVLNPRGVNPTNGVASLSQAGAVPALEKRVTVSVSQPNYKVQ -----------1111---------------------22223333---------------- VKIQNPTACTCDPSVTRQAYADVTFSFTQYSTDEERAFVRTELAALLASPLLIDAIDQLN ---------------------------11113333--------------------1111- PAY --- >PEPTIDE YY; SWP:P01305; PDB:1QBFA; YPAKPEAPGEDASPEELSRYYASLRHYLNLVTRQRY -----------------%%%%--------------- >INHIBITOR OF APOPTOSIS PR; SWP:Q13490; PDB:1QBHA; GSHMQTHAARMRTFMYWPSSVPVQPEQLASAGFYYVGRNDDVKCFCCDGGLRCWESGDDP -------3333------------3333--------------------------------- WVEHAKWFPRCEFLIRMKGQEFVDEIQGRYPHLLEQLLSTS 3333------3333---11113333---------------- >DNA (5'-D(*TP*CP*GP*CP*GP; SWP:P55265; PDB:1QBJA; SIYQDQEQRILKFLEELGEGKATTAHDLSGKLGTPKKEINRVLYSLAKKGKLQKEAGTPP -----------------1111--------1111-3333--------1111---------- LWKIA ----- >Transportin-1; SWP:Q92973; PDB:1QBKB; YEWKPDEQGLQQILQLLKESQSPDTTIQRTVQQKLEQLNQYPDFNNYLIFVLTKLKSEDE -----------1111--------------3333-------3333-------------333 PTRSLSGLILKNNVKAHFQNFPNGVTDFIKSECLNNIGDSSPLIRATVGILITTIASKGE 3----------3333-----------3333----3333-----------1111------- LQNWPDLLPKLCSLLDSEDYNTCEGAFGALQKICEDSAEILDSDRPLNIIPKFLQFFKHS ----------------3333----------------------------------3333-- SPKIRSHAVACVNQFIISRTQALLHIDSFTENLFALAGDEEPEVRKNVCRALVLLEVRMD ------------3333---------3333----------------3333---------11 RLLPHHNIVEYLQRTQDQDENVALEACEFWLTLAEQPICKDVLVRHLPKLIPVLVNGKYS 11----3333--1111------------------------33331111---3333----- DIDIILLKGDVEEDETIPDSEQDIRPRFHRSRTVAQQHDEDDDDDDEIDDDDTISDWNLR 3333---------------1111--------------!!!!11111111----------- KCSAAALDVLANVYRDELLPHILPLLKELLFHHEWVVKESGILVLGAIAEGCQGIPYLPE ---1111--------3333-----------------------------------3333-- LIPHLIQCLSDKKALVRSITCWTLSRYAHWVVSQPPDTYLKPLTELLKRILDSNKRVQEA 33333333---------------3333--3333--------------3333--------- ACSAFATLEEEACTELVPYLAYILDTLVFAFSKYQHKNLLILYDAIGTLADSVGHHLNKP ---------------3333--------3333---3333---------------------- EYIQLPPLIQKWNLKDEDKDLFPLLECLSSVATALQSGFLPYCEPVYQRCVNLVQKTLAQ ------------------------------3333-------------------------- ALNNAQPDQYEAPDKDFIVALDLLSGLAEGLGGNIEQLVARSNILTLYQCQDKPEVRQSS --------------------------------1111--11113333-------------- FALLGDLTKACFQHVKPCIADFPILGTNLNPEFISVCNNATWAIGEISIQGIEQPYIPVL ----------3333-1111---3333---------------------------1111--- HQLVEIINRPNTPKTLLENTAITIGRLGYVCPQEVAPLQQFIRPWCTSLRNIRDNEEKDS ----3333-----------------------3333--1111-------1111-------- AFRGICTISVNPSGVIQDFIFFCDAVASWINPKDDLRDFCKILHGFKNQVGDENWRRFSD -------33331111-----------------3333---------------------333 QFPLPLKERLAAFYGV 3--------------- >FABE8A; SWP:NA; PDB:1QBLH; EVQLQQSGAELVKPGASVKLSCTASGFNIKDTYMHWVKQRPEKGLEWIGRIDPASGNTKY ------------2222-----------3333---------------------1111---- DPKFQDKATITADTSSNTAYLQLSSLTSEDTAVYYCAGYDYGNFDYWGQGTTLTVSSAET 3333---------1111---------3333------------------------------ TPPSVYPLAPGTAALKSSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYT ----------1111----------------------------------------iiii-- LTSSVTVPSSTWPSQTVTCNVAHPASSTKVDKKIVPRNC -------1111------------1111------------ ------------------------------------------------------------ ------------------------------------------ >EVH1 DOMAIN FROM ENA/VASP; SWP:P70429; PDB:1QC6A; SEQSICQARASVVYDDTSKKWVPIKFSRINIYHNTASSTFRVVGVKLQDQQVVINYSIVK ----------------------------------------------------------11 GLKYNQATPTFHQWRDARQVYGLNFASKEEATTFSNALFALNIN 11------------------------------------------ >FLIG; SWP:Q9WY63; PDB:1QC7A; MFVFEDILKLDDRSIQLVLREVDTRDLALALKGASDELKEKIFKNMSKRAAALLKDELEY --33331111-------3333--------------------3333-3333---------- MGPVRLKDVEEAQQKIINIIRRLEEAGEIVIARGGGEELIM ----3333-----------------------1111------ >POKEWEED ANTIVIRAL PROTEI; SWP:P10297; PDB:1QCIA; VNTIIYNVGSTTISKYATFLNDLRNEAKDPSLKCYGIPMLPNTNTNPKYVLVELQGSNKK ---------------------------------iiii----1111-----------%%%% TITLMLRRNNLYVMGYSDPFETNKCRYHIFNDISGTERQDVETTLCPNANSRVSKNINFD -----------------------------1111!!!!----------3333--------- SRYPTLESKAGVKSRSQVQLGIQILDSNIGKISGVMSFTEKTEAEFLLVAIQMVSEAARF -------------3333--------------2222------------------------- KYIENQVKTNFNRAFNPNPKVLNLQETWGKISTAIHDAKNGVLPKPLELVDASGAKWIVL -------1111------3333-----------------iiii--------3333------ RVDEIKPDVALLNYVGGSCQTT 33333333-------------- >UBIQUITIN CONJUGATING ENZ; SWP:P15731; PDB:1QCQA; MSSSKRIAKELSDLERDPPTSCSAGPVGDDLYHWQASIMGPADSPYAGGVFFLSIHFPTD -----------------------------1111------------2222--------111 YPFKPPKISFTTKIYHPNINANGNICLDILKDQWSPALTLSKVLLSICSLLTDANPDDPL 1--------------11111111---111111113333----------------1111-- VPEIAHIYKTDRPKYEATAREWTKKYAV ---------------------------- >N-ETHYLMALEIMIDE SENSITIV; SWP:P18708; PDB:1QCSA; NAGRSQAARCPTDELSLSNCAVVSEKDYQSGQHVIVRTSPNHKYIFTLRTHPSVVPGSVA -----------3333--------3333-2222------1111--------11112222-- FSLPQRKWAGLSIGQEIEVALYSFDKAKQCIGTTIEIDFLQKKNIDSNPYDTDKAAEFIQ -----------2222---------1111------------3333---------------- QFNNQAFSVGQQLVFSFNDKLFGLLVKDIEADPSILKRQKIEVGLVVGNSQVAFEKAENS -------2222-----%%%%-------------1111---------1111------2222 SLNLIGKAKT ---------- >RUBREDOXIN VARIANT PFRD-X; SWP:P24297; PDB:1QCVA; AKWVLKITGYIYDEDAGDPDNGISPGTKFEELPDDWVAPITGAPKSEFEKLED ------------3333-3333------1111-3333--------1111----- >PECTIN LYASE B; SWP:Q00205; PDB:1QCXA; AGVVGAAEGFAHGVTGGGSASPVYPTTTDELVSYLGDNEPRVIILDQTFDFTGTEGTETT --------1111--!!!!--------------------------------2222------ TGCAPWGTASQCQVAINLHSWCDNYQASAPKVSVTYDKAGILPITVNSNKSIVGQGTKGV ---1111-3333-----%%%%----1111-------3333--------------!!!!-- IKGKGLRVVSGAKNVIIQNIAVTDINPKYVWGGDAITVDDSDLVWIDHVTTARIGRQHIV --------iiii-------------3333------------------------------- LGTSADNRVTISYSLIDGRSDYSATCNGHHYWGVYLDGSNDMVTLKGNYFYNLSGRMPKV ----------------------1111---------------------------------- QGNTLLHAVNNLFHNFDGHAFEIGTGGYVLAEGNVFQDVNVVVETPISGQLFSSPDANTN -----------------------2222----------------------------3333- QQCASVFGRSCQLNAFGNSGSMSGSDTSIISKFAGKTIAAAHPPGAIAQWTMKNAGQGK ------------------------------1111---------1111--------2222 >N5-CARBOXYAMINOIMIDAZOLE ; SWP:P09028; PDB:1QCZA; PARVAIVGSKSDWATQFAAEIFEILNVPHHVEVVSAHRTPDKLFSFAESAEENGYQVIIA --------3333--------------------------------------1111------ GAGGAAHLPGIAAKTLVPVLGVPVQSAALSGVDSLYSIVQPRGIPVGTLAIGKAGAANAA ------33333333------------1111------------------------------ LLAAQILATHDKELHQRLNDWRKAQTDEVLENPDPRGAA ------3333------------------1111------- >VHH-R2 ANTI-RR6 ANTIBODY; SWP:A2KD59; PDB:1QD0A; QVQLQESGGGLVQAGGSLRLSCAASGRAASGHGHYGMGWFRQVPGKEREFVAAIRWSGKE ------------------------------------------2222--------3333-- TWYKDSVKGRFTISRDNAKTTVYLQMNSLKGEDTAVYYCAARPVRVADISLPVGFDYWGQ ---1111-----------------------1111-------------1111--------- GTQVTVSS -------- >FORMIMINOTRANSFERASE-CYCL; SWP:P53603; PDB:1QD1A; SQLVECVPNFSEGKNQEVIDAISRAVAQTPGCVLLDVDSGPSTNRTVYTFVGRPEDVVEG -------------------------1111-----------3333---------------- ALNAARAAYQLIDMSRHHGEHPRMGALDVCPFIPVRGVTMDECVRCAQAFGQRLAEELGV -------------1111------------------------------------------- PVYLYGEAARTAGRQSLPALRAGEYEALPEKLKQAEWAPDFGPSAFVPSWGATVAGARKF ----!!!!--3333------22221111-33333333---------3333---------- LLAFNINLLSTREQAHRIALDLREQGGRLKKVQAIGWYLDEKNLAQVSTNLLDFEVTGLH ----------------------3333------------3333----------3333---- TVFEETCREAQELSLPVVGSQLVGLVPLKALLDAAAFYCEKENLFLLQDEHRIRLVVNRL ----------1111------------3333------------------------------ GLDSLAPFKPKERIIEYLV --1111--3333-3333-- >PURINE REGULATORY PROTEIN; SWP:P37552; PDB:1QD9A; TKAVHTKHAPAAIGPYSQGIIVNNMFYSSGQIPLTPSGEMVNGDIKEQTHQVFSNLKAVL ----------------------------------1111---------------------- EEAGASFETVVKATVFIADMEQFAEVNEVYGQYFDTHKPARSCVEVARLPKDALVEIEVI -----3333---------3333-------3333---------------2222-------- ALVK ---- >CYTOCHROME C NITRITE REDU; SWP:Q9Z4P4; PDB:1QDBA; GIAGKEKSEEWAKYYPRQFDSWKKTKEYDSFTDMLAKDPALVIAWSGYAFSKDYNSPRGH ---333333333333--------3333-----3333-3333---22221111------33 YYALQDNVNSLRTGAPVDAKTGPLPTACWTCKSPDVPRLIEEDGELEYFTGKWAKYGSQI 33-------3333-----------3333----------------3333---33331111- VNVIGCANCHDDKTAELKVRVPHLNRGLQAAGLKTFEESTHQDKRTLVCAQCHVEYYFKK ----3333------------3333----1111--3333-3333----------------- TEWKDAKGADKTAMVVTLPWANGVGKDGNAGVEGMIKYYDEINFSDWTHNISKTPMLKAQ ----1111-----------1111--iiii----------1111----------------- HPGFEFWKSGIHGQKGVSCADCHMPYTQEGSVKYSDHQVKENPLDSMDQSCMNCHRESES ------------1111-3333-------!!!!---------3333-11113333------ KLRGIVHQKYERKEFLNKVAFDNIGKAHLETGKAIEAGASDEELKEVRKLIRHGQFKADM ----------------------------------1111-3333----------------- AIAAHGNYFHAPEETLRLLAAGSDDAQKARLLLVKILAKHGVMDYIAPDFDTKDKAQKLA ---1111------------------------------1111------------------- KVDIAALAAEKMKFKQTLEQEWKKEAKAKGRANPELYKDVDTINDGKSSWNKK --------------------------1111--3333-----1111-------- >LITHOSTATHINE; SWP:P05451; PDB:1QDDA; QEAQTELPQARISCPEGTNAYRSYCYYFNEDRETWVDADLYCQNMNSGNLVSVLTQAEGA -------3333---2222------------------------------------------ FVASLIKESGTDDFNVWIGLHDPKKNRAWHWSSGSLVSYKSWGIGAPSSVNPGYCVSLTS ------3333-----------1111-----3333--------2222------------33 STGFQKWKDVPCEDKFSFVCKFKN 33--------1111---------- >TRANSLATION INITIATION FA; SWP:P10081; PDB:1QDEA; IQTNYDKVVYKFDDMELDENLLRGVFGYGFEEPSAIQQRAIMPIIEGHDVLAQAQSGTGK ----------3333-----------3333----3333----------------------- TGTFSIAALQRIDTSVKAPQALMLAPTRELALQIQKVVMALAFHMDIKVHACIGGLRDAQ --------11113333----------3333---------1111----------------- IVVGTPGRVFDNIQRRRFRTDKIKMFILDEADEMLSSGFKEQIYQIFTLLPPTTQVVLLS -------------------1111---------3333----------11111111------ ATMPNDVLEVTTKFMRNPVRILV ---3333---------------- >ANTHRANILATE SYNTHASE (TR; SWP:Q06128; PDB:1QDLA; MEVHPISEFASPFEVFKCIERDFKVAGLLESIGRYSVIAWSTNGYLKIHDDPVNILNGYL ----3333----------------------------------------------3333-1 KDLKLADIPGLFKGGMIGYISYDAVRFWEKIRDLKPAAEDWPYAEFFTPDNIIIYDHNEG 111---------------------3333---------------------------3333- KVYVNADLSSVGGCGDIGEFKVSFYDESLNKNSYERIVSESLEYIRSGYIFQVVLSRFYR ------------------------------------------------------------ YIFSGDPLRIYYNLRRINPSPYMFYLKFDEKYLIGSSPELLFRVQDNIVETYPIAGTRPR -----3333------------------!!!!-------------%%%%------------ GADQEEDLKLELELMNSEKDKAEHLMLVDLARNDLGKVCVPGTVKVPELMYVEKYSHVQH ---------------------------------------2222----------------- IVSKVIGTLKKKYNALNVLSATFPAGTVSGAPKPMAMNIIETLEEYKRGPYAGAVGFISA ---------11113333-------3333-------------------!!!!-------11 DGNAEFAIAIRTAFLNKELLRIHAGAGIVYDSNPESEYFETEHKLKALKTAIGVR 11-------------!!!!------------------------------------ >Anthranilate synthase com; SWP:Q06129; PDB:1QDLB; MDLTLIIDNYDSFVYNIAQIVGELGSYPIVIRNDEISIKGIERIDPDRLIISPGPGTPEK ------------1111-------------------------3333-----------3333 REDIGVSLDVIKYLGKRTPILGVCLGHQAIGYAFGAKIRRARKVFHGKISNIILVNNSPL 3333-------------------------------------------------------3 SLYYGIAKEFKATRYHSLVVDEVHRPLIVDAISAEDNEIMAIHHEEYPIYGVQFHPESVG 333---------------------------------------------------1111-- TSLGYKILYNFLNRV --------------- ------------------------------------------ >HISTIDYL-TRNA SYNTHETASE; SWP:O32422; PDB:1QE0A; MIKIPRGTQDILPEDSKKWRYIENQLDELMTFYNYKEIRTPIFESTDLFAREMYTFKDKG ----2222---3333---------------1111----------3333------------ DRSITLRPEGTAAVVRSYIEHKMQGNPNQPIKLYYNGPMFRYYRQFNQFGVEAIGAENPS ---------------------3333--------------------------------333 VDAEVLAMVMHIYQSFGLKHLKLVINSVGDMASRKEYNEALVKHFEPVIHEFCSDCQSRL 3------------1111--------------------------33331111---333311 HTDPMRILTAPRITDFLNEESKAYYEQVKAYLDDLGIPYTEDPNLVRGLDYYTHTAFELM 11--3333---1111---3333-------------------1111---1111-------- MDNPNYDGAITTLCGGGRYNGLLELLDGPSETGIGFALSIERLLLALEEEGIELDIEENL --1111--------------3333------------------------------------ DLFIVTMGDQADRYAVKLLNHLRHNGIKADKDYLQRKIKGQMKQADRLGAKFTIVIGDQE ------------------------------------------------------------ LENNKIDVKNMTTGESETIELDALVEYFKK -------------------3333------- >PARA-NITROBENZYL ESTERASE; SWP:P37967; PDB:1QE3A; THQIVTTQYGKVKGTTENGVHKWKGIPYAKPPVGQWRFKAPEPPEVWEDVLDATAYGPIC ------1111------%%%%------------!!!!------------------------ PQPSLPRQSEDCLYVNVFAPDTPSQNLPVMVWIHGGAFYLGAGSEPLYDGSKLAAQGEVI -----------------------------------%%%%--11111111-----1111-- VVTLNYRLGPFGFLHLSSFDEAYSDNLGLLDQAAALKWVRENISAFGGDPDNVTVFGESA -------!!!!----33333333------------------3333---1111-------- GGMSIAALLAMPAAKGLFQKAIMESGASRTMTKEQAASTAAAFLQVLGINESQLDRLHTV ----------3333-----------------------------------11113333--- AAEDLLKAADQLRIAEKENIFQLFFQPALDPKTLPEEPEKSIAEGAASGIPLLIGTTRDE 3333-------1111---3333------------------------2222---------- GYLFFTPDSDVHSQETLDAALEYLLGKPLAEKAADLYPRSLESQIHMMTDLLFWRPAVAY 3333----------------------------3333------------------------ ASAQSHYAPVWMYRFDWHPEKPPYNKAFHALELPFVFGNLDGLITDEVKQLSHTIQSAWI ---------------------------2222--------3333----------------- TFAKTGNPSTEAVNWPAYHEETRETVILDSEITIENDPESEKRQKLF ---------1111-----3333------------------------- >PENTOSYLTRANSFERASE; SWP:P81989; PDB:1QE5A; PPLDDPATDPFLVARAAADHIAQATGVEGHDMALVLGSGWGGAAELLGEVVAEVPTHEIP -3333---3333------------------------2222-1111---------333322 GFSSVTRSIRVERADGSVRHALVLGSRTHLYEGKGVRAVVHGVRTAAATGAETLILTNGC 22----------1111------------3333--3333--------1111---------- GGLNQEWGAGTPVLLSDHINLTARSPLEGPTFVDLTDVYSPRLRELAHRVDPTLPEGVYA ---33332222----------------------------------3333-1111------ QFPGPHYETPAEVRMAGILGADLVGMSTTLEAIAARHCGLEVLGVSLVTNLAAGISPTPL -----------------------------------1111------------2222----- SHAEVIEAGQAAGPRISALLADIAKR ----------------------1111 >BACTERIOPHAGE T4 GENE PRO; SWP:P10927; PDB:1QEXA; MFIQEPKKLIDTGEIGNASTGDILFDGGNKINSDFNAIYNAFGDQRKMAVANGTGADGQI ----------3333----3333---------------------3333---%%%%------ IHATGYYQKHSITEYATPVKVGTRHDIDTSTVGVKVIIERGELGDCVEFINSNGSISVTN -1111-----3333-----2222-----3333---------2222-----1111--3333 PLTIQAIDSIKGVSGNLVVTSPYSKVTLRCISSDNSTSVWNYSIESMFGQKESPAEGTWN ---------2222----------------------------------------------- ISTSGSVDIPLFHRTEYNMAKLLVTCQSVDGRKIKTAEINILVDTVNSEVISSEYAVMRV -3333-------1111-----------1111-------------1111------------ GNETEEDEIANIAFSIKENYVTATISSSTVGMRAAVKVIATQKIGVAQ ----------------%%%%---------------------------- ------------------------------- >INORGANIC PYROPHOSPHATASE; SWP:P50308; PDB:1QEZA; KLSPGKNAPDVVNVLVEIPQGSNIKYEYDDEEGVIKVDRVLYTSMNYPFNYGFIPGTLEE ----1111----------2222------------------------------------11 DGDPLDVLVITNYQLYPGSVIEVRPIGILYMKDEEGEDAKIVAVPKDKTDPSFSNIKDIN 11-------------2222-------------1111---------33331111----111 DLPQATKNKIVHFFEHYKELEPGKYVKISGWGSATEAKNRIQLAIKRVSG 1--------------1111-2222-------------------------- >ADENYLOSUCCINATE SYNTHETA; SWP:P12283; PDB:1QF5A; GNNVVVLGTQWGDEGKGKIVDLLTERAKYVVRYQGGHNAGHTLVINGEKTVLHLIPSGIL -----------------------1111-----------------iiii---------111 RENVTSIIGNGVVLSPAALMKEMKELEDRGIPVRERLLLSEACPLILDYHVALDNAREKA 1-------3333-------------3333--3333----1111---3333---------- RGAKAIGTTGRGIGPAYEDKVARRGLRVGDLFDKETFAEKLKEVMEYHNFQLVNYYKAEA -1111---------------------3333------------------------------ VDYQKVLDDTMAVADILTSMVVDVSDLLDQARQRGDFVMFEGAQGTLLDIDHGTYPYVTS ------------3333-1111-----------------------1111------------ SNTTAGGVATGSGLGPRYVDYVLGILKAYSTRVGAGPFPTELFDETGEFLCKQGNEFGAT --------1111--1111-------------------1111------------------- TGRRRRTGWLDTVAVRRAVQLNSLSGFCLTKLDVLDGLKEVKLCVAYRMPDGREVTTTPL -------------------1111-------33332222----------1111-------- AADDWKGVEPIYETMPGWSESTFGVKDRSGLPQAALNYIKRIEELTGVPIDIISTGPDRT 3333-----------------2222-3333-3333----------------------111 ETMILRDPFDA 1-----3333- >CASEIN KINASE II; SWP:P13862; PDB:1QF8A; VSWISWFCGLRGNEFFCEVDEDYIQDKFNLTGLNEQVPHYRQALDILDLEPDPNQSDLIE ---------2222------------33332222--------------------------- QAAELYGLIHARYILTNRGIAQLEKYQQGDFGYCPRVYCENQPLPIGLSDIPGEAVKLYC -----------1111-------------1111---1111-----------2222------ PKCDVYTPKSSRHHHTDGAYFGTGFPHLFVHPEYRPKRP ---------3333---3333---3333---3333----- >URIDYLMONOPHOSPHATE/CYTID; SWP:P20425; PDB:1QF9A; MEKSKPNVVFVLGGPGSGKGTQCANIVRDFGWVHLSAGDLLRQEQQSGSKDGEMIATMIK -------------2222-----------------------------------------11 NGEIVPSIVTVKLLKNAIDANQGKNFLVDGFPRNEENNNSWEENMKDFVDTKFVLFFDCP 11------------------2222--------------------1111------------ EEVMTQRLLKRGESSGRSDDNIESIKKRFNTFNVQTKLVIDHYNKFDKVKIIPANRDVNE ----------------1111-----------------------1111------------- VYNDVENLFKSMGF ---------1111- >PURPLE ACID PHOSPHATASE; SWP:P29288; PDB:1QFCA; TAPASTLRFVAVGDWGGVPNAPFHTAREMANAKEIARTVQIMGADFIMSLGDNFYFTGVH ------------------------------------------------------------ DANDKRFQETFEDVFSDRALRNIPWYVLAGNHDHLGNVSAQIAYSKISKRWNFPSPYYRL ----3333--3333--3333---------3333--------------3333--------- RFKVPRSNITVAIFMLDTVMLCGNSVARTQLSWLKKQLAAAKEDYVLVAGHYPIWSIAEH ----------------3333------1111---------------------------333 GPTRCLVKNLRPLLAAYGVTAYLCGHDHNLQYLQDENGVGYVLSGAGNFMDPSVRHQRKV 3--------3333-1111-----------------------------------1111--- PNGYLRFHYGSEDSLGGFTYVEIGSKEMSITYVEASGKSLFKTSLPR 2222------3333-------------------3333---------- >PROTEIN (GELATION FACTOR); SWP:P13466; PDB:1QFHA; KPAPSAEHSYAEGEGLVKVFDNAPAEFTIFAVDTKGVARTDGGDPFEVAINGPDGLVVDA -----1111----1111---------------1111----------------iiii---- KVTDNNDGTYGVVYDAPVEGNYNVNVTLRGNPIKNMPIDVKCIEGANGEDSSFGSFTFTV ---------------------------iiii-2222----------3333---------- AAKNKKGEVKTYGGDKFEVSITGPAEEITLDAIDNQDGTYTAAYSLVGNGRFSTGVKLNG ---1111--------------------------------------------------iii KHIEGSPFKQVLGNPGKKNPEVKSFTTTRTAN i-2222-------3333-3333---------- >FLAVIN REDUCTASE; SWP:P23486; PDB:1QFJA; TTLSCKVTSVEAITDTVYRVRIVPDAAFSFRAGQYLMVVMDERDKRPFSMASTPDEKGFI ------------------------------2222-------------------------- ELHIGYAKAVMDRILKDHQIVVDIPHGEAWLRDDEERPMILIAGGTGFSYARSILLTALA ------------------------------------------------------------ RNPNRDITIYWGGREEQHLYDLCELEALSLKHPGLQVVPVVEQPEAGWRGRTGTVLTAVL -1111---------33331111---------1111------------------------1 QDHGTLAEHDIYIAGRFEMAKIARDLFCSERNAREDRLFGDAFAFI 111--1111------------------------3333--3333--- >SIALOADHESIN; SWP:Q62230; PDB:1QFOA; TWGVSSPKNVQGLSGSCLLIPCIFSYPADVPVGITAIWYYDYSGKRQVVIHSGDPKLVDK ------------2222----------1111--------------------3333111133 RFRGRAELMGNMDHKVCNLLLKDLKPEDSGTYNFRFEISSNRWLDVKGTTVTVTT 33--------3333----------3333--------------------------- >Antitermination protein N; SWP:P03045; PDB:1QFQB; DAQTRRRERRAEKQAQWKAANPLLVGVSAKPVNRP -33333333-----------3333----------- >FEMALE-SPECIFIC HISTAMINE; SWP:O77421; PDB:1QFTA; NQPDWADEAANGAHQDAWKSLKADVENVYYMVKATYKNDPVWGNDFTCVGVMANDVNEDE --111133333333-------------------------------------------111 KSIQAEFLFMNNADTNMQFATEKVTAVKMYGYNRENAFRYETEDGQVFTDVIAYSDDNCD 1---------------------------iiii---------1111--------------- VIYVPGTDGNEEGYELWTTDYDNIPANCLNKFNEYAVGRETRDVFTSACLEIAAA -------------------1111------------2222------3333------ >HEMAGGLUTININ (HA1 CHAIN); SWP:NA; PDB:1QFUH; QVQLQQPGAELVRPGASVKLSCKASGYTLTTYWMNWFKQRPDQGLEWIGRIDPYDSETHY ------------2222------------3333-------3333----------------- NQKFKDKAILTVDRSSSTAYMQLSSL 3333--------3333---------- >GONADOTROPIN ALPHA SUBUNI; SWP:NA; PDB:1QFWH; QLQQSGAELVKPGASVKLSCKASDYTFTSYWMHWVKQRPGQGLEWIGEINPTNGRTYYNE -------------------------------------2222------------------- KKATLTVAASASTAAMQASSLTSEDSAVYYCARRYGNSFDYWGQGTTVTVSS ---------------------3333--------------------------- >GONADOTROPIN ALPHA SUBUNI; SWP:NA; PDB:1QFWI; QVQLQESGGHLVKPGGSLKLSCAASGFAFSSFDMSWIRQTPEKRLEWVASITNVGTYTYY ------------2222-----------3333--------1111--------2222----- PGSVKGRFSISRDNARNTLNLQMSSLRSEDTALYFCARQGTAAQPYWYFDVWGAGTTVTV ---2222-------------------3333----------3333---------------- S - >Ig kappa chain V-III regi; SWP:P01660; PDB:1QFWL; DIELTQSPDSLAVSLGQRATISCRASEDSYGNSFMQWYQQKPGQPPKLLIYRASNLESGI -------------2222---------------------------------------2222 PARFSGTGSRTDFTLTINPVEADDVATYYCQQSDEYPYMYTFGGGTKLEIKR --------------------3333---------------------------- >GONADOTROPIN ALPHA SUBUNI; SWP:NA; PDB:1QFWM; DIELTQSPKSMSMSVGERVTLSCKASETVDSFVSWYQQKPEQSPKLLIFGASNRFSGVPD -------------2222-------------------------------------222233 RFTGSGSATDFTLTISSVQAEDFADYHCGQTYNHPYTFGGGTKLEIKR 33----------------1111-------------------------- >PH 2.5 ACID PHOSPHATASE; SWP:P34755; PDB:1QFXA; KQFSQEFRDGYSILKHYGGNGPYSERVSYGIARDPPTSCEVDQVIMVKRHGERYPSPSAG -------33333333--------------------2222--------------------- KDIEEALAKVYSITEYKGDLAFLNDWTYYVPNECYYNAETTSGPYAGLLDAYNHGNDYKA ----------------!!!!------------1111------1111-------------- RYGHLWNGETVVPFFSSGYGRVIETARKFGEGFFGYNYSTNAALNIISESEVMGADSLTP -3333----------------------------!!!!------------3333------- TCDTTTCDNLTYQLPQFKVAAARLNSQNPGMNLTASDVYNLMVMASFELNARPFSNWINA ----3333-----3333-----------------------------3333----3333-- FTQDEWVSFGYVEDLNYYYCAGPGDKNMAAVGAVYANASLTLLNQGPKEAGSLFFNFAHD ---------------------3333---------------------------------11 TNITPILAALGVLIPNEDLPLDRVAFGNPYSIGNIVPMGGHLTIERLSCQATALSDEGTY 11----------------------2222--3333--2222-------------------- VRLVLNEAVLPFNDCTSGPGYSCPLANYTSILNKNLPDYTTTCNVSASYPQYLSFWWNYN ----iiii---!!!!---%%%%------------------1111-1111----3333--- TTTELNYRSSPIACQEGDAMD --1111--------------- >FERREDOXIN:NADP+ REDUCTAS; SWP:P10933; PDB:1QFZA; QVTTEAPAKVVKHSKKQDENIVVNKFKPKEPYVGRCLLNTKITGDDAPGETWHMVFSTEG --------------------------3333-------------1111----------iii EVPYREGQSIGIVPDGIDKNGKPHKLRLYSIASSAIGDFGDSKTVSLCVKRLVYTNDAGE i---2222---------1111------------3333------------------1111- VVKGVCSNFLCDLKPGSEVKITGPVGKEMLMPKDPNATVIMLGTGTGIAPFRSFLWKMFF -------------2222---------1111---1111----------------------- EKHEDYQFNGLAWLFLGVPTSSSLLYKEEFEKMKEKAPENFRLDFAVSREQVNDKGEKMY --1111-------------11112222-------------------1111--1111---- IQTRMAQYAEELWELLKKDNTFVYMCGLKGMEKGIDDIMVSLAAKDGIDWIEYKRTLKKA ---3333-------1111---------3333------------1111-3333-------- EQWNVEVS -------- >INTEGRIN BETA-4 SUBUNIT; SWP:P16144; PDB:1QG3A; DLGAPQNPNAKAAGSRKIHFNWLPPSGKPMGYRVKYWIQGDSESEAHLLDSKVPSVELTN -------------------------------------22223333--------------- LYPYCDYEMKVCAYGAQGEGPYSSLVSCRTHQEVPSEPGRLAFNVVSSTVTQLSWAEPAE --------------3333------------------------------------------ TNGEITAYEVCYGLVNDDNRPIGPMKKVLVDNPKNRMLLIENLRESQPYRYTVKARNGAG ---------------1111------------1111--------2222---------1111 WGPEREAIINLATQP ---------3333-- >PROTEIN (SPORE COAT POLYS; SWP:P39621; PDB:1QG8A; PKVSVIMTSYNKSDYVAKSISSILSQTFSDFELFIMDDNSNEETLNVIRPFLNDNRVRFY -----------1111-------1111---------------------------1111--- QSDISGVKERTEKTRYAALINQAIEMAEGEYITYATDDNIYMPDRLLKMVRELDTHPEKA -----3333--------------1111--------------1111----------1111- VIYSASKTYHLNDIVKETVRPAAQVTWNAPCAIDHCSVMHRYSVLEKVKEKFGSYWDESP ----------------------------2222-1111---3333--------------11 AFYRIGDARFFWRVNHFYPFYPLDEELDLNYITEFVRNLPPQRNCRELRESLKKLGMG 11-------------------------------3333--------------------- >VIRUS CAPSID PROTEIN; SWP:NA; PDB:1QGC5; TTAYTASARGDLAHLTTTAARTLP ---------1111----------- >TRANSKETOLASE; SWP:P27302; PDB:1QGDA; SSRKELANAIRALSMDAVQKAKSGHPGAPMGMADIAEVLWRDFLKHNPQNPSWADRDRFV ----------------------------------------------1111--1111---- LSNGHGSMLIYSLLHLTGYDLPMEELKNFRQLHSKTPGHPEVGKTAGVETTTGPLGQGIA --3333---------------3333--2222---------22222222-----2222--- NAVGMAIAEKTLAAQFNRPGHDIVDHYTYAFMGDGCMMEGISHEVCSLAGTLKLGKLIAF -----------------2222--------------------------------1111--- YDDNGISIDGHVEGWFTDDTAMRFEAYGWHVIRDIDGHDAASIKRAVEEARAVTDKPSLL ------3333--------------1111-------1111----------1111------- MCKTIIGFGSPNKAGTHDSHGAPLGDAEIALTREQLGWKYAPFEIPSEIYAQWDAKEAGQ ----2222----22221111--------------------2222-3333----------- AKESAWNEKFAAYAKAYPQEAAEFTRRMKGEMPSDFDAKAKEFIAKLQANPAKIASRKAS --------------------------------1111------------------------ QNAIEAFGPLLPEFLGGSADLAPSNLTLWSGSKAINEDAAGNYIHYGVREFGMTAIANGI ------33331111------1111----1111-33331111------------------- SLHGGFLPYTSTFLMFVEYARNAVRMAALMKQRQVMVYTHDSIGLGEDGPTHQPVEQVAS 3333--------33333333-------------------------33331111------- LRVTPNMSTWRPCDQVESAVAWKYGVERQDGPTALILSRQNLAQQERTEEQLANIARGGY ---2222---------------------------------------------3333---- VLKDCAGQPELIFIATGSEVELAVAAYEKLTAEGVKARVVSMPSTDAFDKQDAAYRESVL ---------------!!!!-----------1111-------------1111--------- PKAVTARVAVEAGIADYWYKYVGLNGAIVGMTTFGESAPAELLFEEFGFTVDNVVAKAKE 3333---------33333333-----------------------1111----------11 LL 11 >CHITOSANASE; SWP:P33673; PDB:1QGIA; ASPDDNFSPETLQFLRNNTGLDGEQWNNIMKLINKPEQDDLNWIKYYGYCEDIEDERGYT -3333------------------------------------33331111----------- IGLFGATTGGSRDTHPDGPDLFKAYDAAKGASNPSADGALKRLGINGKMKGSILEIKDSE ---------1111-------------1111----------------------------33 KVFCGKIKKLQNDAAWRKAMWETFYNVYIRYSVEQARQRGFTSAVTIGSFVDTALNQGAT 33-----1111-------------------------1111-------------------- GGSDTLQGLLARSGSSSNEKTFMKNFHAKRTLVVDTNKYNKPPNGKNRVKQWDTLVDMGK -1111----1111---------------33331111-----------------------1 MNLKNVDSEIAQVTDWEMK 111--------1111---- >PEROXIDASE N; SWP:Q39034; PDB:1QGJA; QLSPDIYAKSCPNLVQIVRKQVAIALKAEIRMAASLIRLHFHDCFVNGCDASLLLDGADS ----1111--1111----------------------------1111----3333--1111 EKLAIPNINSARGFEVIDTIKAAVENACPGVVSCADILTLAARDSVVLSGGPGWRVALGR 1111-----------------------2222--------------1111----------- KDGLVANQNSANNLPSPFEPLDAIIAKFVAVNLNITDVVALSGAHTFGQAKCAVFSNRLF ------33331111-1111---------1111------------------33333333-- NFTGAGNPDATLETSLLSNLQTVCPLGGNSNITAPLDRSTTDTFDNNYFKNLLEGKGLLS --------1111------------22221111-------1111---------------33 SDQILFSSDLAVNTTKKLVEAYSRSQSLFFRDFTCAMIRMGNISNGASGEVRTNCRVINN 33-----3333------------------------------------------1111--- >Importin subunit alpha-2; SWP:P52292; PDB:1QGKB; AARLHRFKNKGKDSTEMRRRRIEVNVELRKAKKDDQMLKRRNVS --3333--1111------------------------1111---- >CYSTATHIONINE GAMMA-SYNTH; SWP:Q9ZPL5; PDB:1QGNA; MKYASFLNSDGSVAIHAGERLGRGIVTDAITTPVVNTSAYFFNKTSELIDFKEKRRASFE ---1111--------2222------------------------3333---1111------ YGRYGNPTTVVLEEKISALEGAESTLLMASGMCASTVMLLALVPAGGHIVTTTDCYRKTR 1111-1111------------------------------------------1111----- IFIETILPKMGITATVIDPADVGALELALNQKKVNLFFTESPTNPFLRCVDIELVSKLCH -----3333--------3333-----------------------------3333-----1 EKGALVCIDGTFATPLNQKALALGADLVLHSATKFLGGHNDVLAGCISGPLKLVSEIRNL 111------3333-----3333--------33333333-----------3333------- HHILGGALNPNAAYLIIRGMKTLHLRVQQQNSTALRMAEILEAHPKVRHVYYPGLQSHPE --------3333-----1111----------------------1111----3333--111 HHIAKKQMTGFGGAVSFEVDGDLLTTAKFVDALKIPYIAPSFGGCESIVDQPAIMSYWDL 1--3333----------------------3333-----------------3333--3333 SQSDRAKYGIMDNLVRFSFGVEDFDDLKADILQALDSI 3333------3333--------3333-------3333- >ANAEROBIC COBALAMINE BIOS; SWP:Q05592; PDB:1QGOA; KKALLVVSFGTSYHDTCEKNIVACERDLAASCPDRDLFRAFTSGMIIRKLRQRDGIDIDT -------------------------------1111------------------------- PLQALQKLAAQGYQDVAIQSLHIINGDEYEKIVREVQLLRPLFTRLTLGVPLLSSHNDYV --------1111--------------------------3333------------3333-- QLMQALRQQMPSLRQTEKVVFMGHGASHHAFAAYACLDHMMTAQRFPARVGAVESYPEVD ----3333-----1111---------3333------------------------------ ILIDSLRDEGVTGVHLMPLMLVAGDHAINDMASDDGDSWKMRFNAAGIPATPWLSGLGEN ------1111------------------------1111-----------------3333- PAIRAMFVAHLHQALNM ----------------- >IMPORTIN BETA SUBUNIT; SWP:Q14974; PDB:1QGRA; MELITILEKTVSPDRLELEAAQKFLERAAVENLPTFLVELSRVLANPGNSQVARVAAGLQ ------1111-----------------------------------3333----------- IKNSLTSKDPDIKAQYQQRWLAIDANARREVKNYVLHTLGTETYRPSSASQCVAGIACAE -1111--------------11113333--------3333--------------------3 IPVNQWPELIPQLVANVTNPNSTEHMKESTLEAIGYICQDIDPEQLQDKSNEILTAIIQG 333--3333---------1111-------------------33331111----------- MRKEEPSNNVKLAATNALLNSLEFTKANFDKESERHFIMQVVCEATQCPDTRVRVAALQN -3333---------------33333333-------------------------------- LVKIMSLYYQYMETYMGPALFAITIEAMKSDIDEVALQGIEFWSNVCDEEMDLAIEASEA -------33333333-----------1111------------------------------ AEQGRPPEHTSKFYAKGALQYLVPILTQTLTKQDENDDDDDWNPCKAAGVCLMLLATCCE -----------------3333----------------1111------------------3 DDIVPHVLPFIKEHIKNPDWRYRDAAVMAFGCILEGPEPSQLKPLVIQAMPTLIELMKDP 333---------------------------1111---3333-------------3333-- SVVVRDTAAWTVGRICELLPEAAINDVYLAPLLQCLIEGLSAEPRVASNVCWAFSSLAEA ------------------3333--------------3333--3333-------------- AYEAADVADDQEEPATYCLSSSFELIVQKLLETTDRPDGHQNNLRSSAYESLMEIVKNSA ----------------1111---------------------------------------- KDCYPAVQKTTLVIMERLQQVLQMESHIQSTSDRIQFNDLQSLLCATLQNVLRKVQHQDA --------------------1111--------------------------1111-33331 LQISDVVMASLLRMFQSGGVQEDALMAVSTLVEVLGGEFLKYMEAFKPFLGIGLKNYAEY 111--------------3333--------------111111111111------------- QVCLAAVGLVGDLCRALQSNIIPFCDEVMQLLLENLGNENVHRSVKPQILSVFGDIALAI --------------------3333-----------------1111--------------- GGEFKKYLEVVLNTLQQASQAQVDKSDYDMVDYLNELRESCLEAYTGIVQGLKGDQENVH 33331111---------1111--------------------------------------3 PDVMLVQPRVEFILSFIDHIAGDEDHTDGVVACAAGLIGDLCTAFGKDVLKLVEARPMIH 333--1111--------------------------------------------------- ELLTEGRRSKTNKAKTLARWATKELRKLKNQA -------------------------------- >Precore/core protein [Fra; SWP:Q6TYG3; PDB:1QGTC; MDIDPYKEFGATVELLSFLPSDFFPSVRDLLDTASALYREALESPEHCSPHHTALRQAIL ---1111----333333333333------------------------------------- CWGELMTLATWVGNNLEDPASRDLVVNYVNTNMGLKIRQLLWFHISCLTFGRETVLEYLV -----------1111--------------------------------------------- SFGVWIRTPPAYRPPNAPILST ----11113333---------- >SPLICEOSOMAL PROTEIN U5-1; SWP:P83876; PDB:1QGVA; SYMLPHLHNGWQVDQAILSEEDRVVVIRFGHDWDPTCMKMDEVLYSIAEKVKNFAVIYLV ----------------3333----------1111--------------1111-------- DITEVPDFNKMYELYDPCTVMFFFRNKHIMIDLGINWAMEDKQEMVDIIETVYRGARKGR 3333-1111--%%%%--------iiii----------------------------1111- GLVVSPKDYST ----------- >CREATINE KINASE, B CHAIN; SWP:P05122; PDB:1QH4A; PFSNSHNLLKMKYSVDDEYPDLSVHNNHMAKVLTLDLYKKLRDRQTSSGFTLDDVIQTGV ---3333--11113333----1111-3333----------1111-1111------3333- DNPGHPFIMTVGCVAGDEESYEVFKELFDPVIEDRHGGYKPTDEHKTDLNADNLQGGDDL ----------------3333---3333--------iiii1111------3333------- DPNYVLSSRVRTGRSIRGFCLPPHCSRGERRAIEKLSVEALGSLGGDLKGKYYALRNMTD 1111-----------2222---------------------1111!!!!-----3333--- AEQQQLIDDHFLFDKPVSPLLLASGMARDWPDARGIWHNDNKTFLVWINEEDHLRVISMQ --------------------33331111----------3333------------------ KGGNMKEVFTRFCTGLTQIETLFKSKNYEFMWNPHLGYILTCPSNLGTGLRAGVHIKLPN -----------------------1111--------------3333-------------33 LGKHEKFGEVLKRLRLQKRGTGGVDTAAVGGVFDVSNADRLGFSEVELVQMVVDGVKLLI 33-1111----------------------------------------------------- EMEKRLEKGQSIDDLMPAQK -----1111--1111----- >HYDROXYACYLGLUTATHIONE HY; SWP:Q16775; PDB:1QH5A; MKVEVLPALTDNYMYLVIDDETKEAAIVDPVQPQKVVDAARKHGVKLTTVLTTHHHWDHA -------------------------------3333--------------------11111 GGNEKLVKLESGLKVYGGDDRIGALTHKITHLSTLQVGSLNVKCLATPCHTSGHICYFVS 111---------------1111-------2222---!!!!-------------------- KPGGSEPPAVFTGDTLFVAGCGKFYEGTADEMCKALLEVLGRLPPDTRVYCGHEYTINNL -----------!!!!-2222------------------3333-1111------------- KFARHVEPGNAAIREKLAWAKEKYSIGEPTVPSTLAEEFTYNPFMRVREKTVQQHAGETD ------1111-------------------------------33331111----------- PVTTMRAVRREKDQFKMPRD ----------3333------ >NITROGENASE MOLYBDENUM IR; SWP:P00466; PDB:1QH8A; TNATGERNLALIQEVLEVFPETARKERRKHMMVSDPKMKSVGKCIISNRKSQPGVMTVRG --------------3333----------------1111--!!!!-------2222----- CAYAGSKGVVFGPIKDMAHISHGPVGCGQYSRAGRRNYYTGVSGVDSFGTLNFTSDFQER ----------3333---------------------------2222--1111------333 DIVFGGDKKLSKLIEEMELLFPLTKGITIQSECPVGLIGDDISAVANASSKALDKPVIPV 3-------------------1111---------3333----------------------- RCEGFRGVSQSLGHHIANDVVRDWILNNREGQPFETTPYDVAIIGDYNIGGDAWASRILL --3333--3333------------11112222----1111-------2222--------- EEMGLRVVAQWSGDGTLVEMENTPFVKLNLVHCYRSMNYIARHMEEKHQIPWMEYNFFGP 1111-------2222------3333--------3333----------------------- TKIAESLRKIADQFDDTIRANAEAVIARYEGQMAAIIAKYRPRLEGRKVLLYMGGLRPRH ----------1111-------------------------33332222--------3333- VIGAYEDLGMEIIAAGYEFAHNDDYDRTLPDLKEGTLLFDDASSYELEAFVKALKPDLIG --------------------3333---3333-2222------------------------ SGIKEKYIFQKMGVPFRQMHSWDYSGPYHGYDGFAIFARDMDMTLNNPAWNELTAPWL -3333----1111-------%%%%------3333---------1111-1111--1111 >Nitrogenase molybdenum-ir; SWP:P09772; PDB:1QH8B; SQTIDKINSCYPLFEQDEYQELFRNKRQLEEAHDAQRVQEVFAWTTTAEYEALNFRREAL --3333-----1111-----------1111------------------------------ TVDPAKACQPLGAVLCSLGFANTLPYVHGSQGCVAYFRTYFNRHFKEPIACVSDSMTEDA -------3333-----1111---------3333-----------------------3333 AVFGGNNNMNLGLQNASALYKPEIIAVSTTCMAEVIGDDLQAFIANAKKDGFVDSSIAVP -----------------------------------------------1111--3333--- HAHTPSFIGSHVTGWDNMFEGFAKTFTADYQGQPGKLPKLNLVTGFETYLGNFRVLKRMM ----1111-3333-------------1111--2222------------3333-------- EQMAVPCSLLSDPSEVLDTPADGHYRMYSGGTTQQEMKEAPDAIDTLLLQPWQLLKSKKV ------------3333----------------------3333-------3333------- VQEMWNQPATEVAIPLGLAATDELLMTVSQLSGKPIADALTLERGRLVDMMLDSHTWLHG -----------------------------------------------------3333222 KKFGLYGDPDFVMGLTRFLLELGCEPTVILSHNANKRWQKAMNKMLDASPYGRDSEVFIN 2-----------------------------1111--------------1111-------- CDLWHFRSLMFTRQPDFMIGNSYGKFIQRDTLAKGKAFEVPLIRLGFPLFDRHHLHRQTT ------------------------------33333333---------------3333--- WGYEGAMNIVTTLVNAVLEKLDSDTSQLGKTDYSFDLVR ----------------------11112222-1111---- >HALOPEROXIDASE; SWP:Q8LLW7; PDB:1QHBA; GIPADNLQSRAKASFDTRVAAAELALARGAVPSFANGEELLYRNSETGDPSFIGSFTKGL -------------------------3333--------1111----------1111----- PHDDNGAIIDPDDFLAFVRAINSGDEKEIAALTLGPARDPETGLPIWRSDLANSLDLEVR --1111---3333---------------1111--------------------1111---- GWENSSAGLTFDLEGPDAQSVAMPPAPVLTSPELIAEMAELYLMALGRDIEFSEFDSPKN -----1111-------1111-------1111---------------11111111--3333 AAFIRSAIERLNGLEWFNTPAKLGDPPAEIRRRRGEVTVGNLFRGILPGSEVGPYLSQFI --------------3333---2222------------3333-------1111----3333 IVGSKQIGSATVGNKTLVSPNAADEFDGEIAYGSITISQRVRIATPGRDFMTDLKVFLDV -----------!!!!---11113333-----!!!!---------2222------------ QDAADFRGFESYEPGARLIRTIRDLATWVHFDSLYEAYLNACLILLANGVPFDPNLPFQQ -----2222-----------3333--1111--1111----------------1111---- EDKLDNQDVFVNFGSAHVLSLVTEVATRALKAVRYQKFNIHRRLRPEATGGLISVNKNAF 3333----------------------------------------3333-----------1 LKSESVFPEVDVLVEELSSILDDSASSNEKQNIADGDVSPGKSFLLPMAFAEGSPFHPSY 111---3333--------3333---------------------------1111------- GSGHAVVAGACVTILKAFFDANFQIDQVFEVDTDEDKLVKSSFPGPLTVAGELNKLADNV -------------------1111------------------------------------- AIGRNMAGVHYFSDQFESLLLGEQIAIGILEEQSLTYGENFFFNLPKFDGTTIQI ----1111--3333--------------------------------1111----- >VIRAL CAPSID VP6; SWP:P04509; PDB:1QHDA; MDVLYSLSKTLKDARDKIVEGTLYSNVSDLIQQFNQMIITMNGNEFQTGGIGNLPIRNWN ------------------22223333--------------2222------!!!!------ FDFGLLGTTLLNLDANYVETARNTIDYFVDFVDNVCMDEMVRESQRNGIAPQSDSLIKLS --------------------------------------1111----1111---------- GIKFKRINFDNSSEYIENWNLQNRRQRTGFTFHKPNIFPYSASFTLNRSQPAHDNLMGTM 33333333-------------1111------------------------3333------- WLNAGSEIQVAGFDYSCAINAPANTQQFEHIVQLRRVLTTATITLLPDAERFSFPRVITS -------------11111111%%%%-----------------------3333-------1 ADGATTWYFNPVILRPNNVEIEFLLNGQIINTYQARFGTIIARNFDTIRLSFQLMRPPNM 111---------------------iiii-------------------------------- TPAVAALFPNAQPFEHHATVGLTLRIESAVCESVLADASETMLANVTSVRQEYAIPVGPV 33333333------------------------------------------1111------ FPPGMNWTDLITNYSPSREDNLQRVFTVASIRSMLVK -2222-------------------------------- >PHOSPHOGLYCERATE MUTASE; SWP:P00950; PDB:1QHFA; PKLVLVRHGQSEWNEKNLFTGWVDVKLSAKGQQEAARAGELLKEKKVYPDVLYTSKLSRA -------------------!!!!---------------------------------3333 IQTANIALEKADRLWIPVNRSWRLNERHYGDLQGKDKAETLKKFGEEKFNTYRRSFDVPP ------------1111----3333----!!!!---------------------------- PPIDASSPFSQKGDERYKYVDPNVLPETESLALVIDRLLPYWQDVIAKDLLSGKTVMIAA ---1111---22221111--3333-------------------------1111------- HGNSLRGLVKHLEGISDADIAKLNIPTGIPLVFELDENLKPSKPSYYLDPEAAAAGAAAV ---------------33331111------------1111--------------------- >RIBONUCLEASE HI; SWP:Q04740; PDB:1QHKA; GNFYAVRKGRETGIYNTWNECKNQVDGYGGAIYKKFNSYEQAKSFLG -----------------3333-------------------------- >CELL DIVISION PROTEIN MUK; SWP:P22523; PDB:1QHLA; RGKFRSLTLINWNGFFARTFDLDELVTTLSGGNGAGKSTTMAAFVTALIPDLTLLLHGKL --------------------------------------------------1111-1111- KAGVCYSMLDTINSRHQRVVVGVRLQQVAGRDRKVDIKPFAIQGLPMSVQPTQLVTETLN ------------1111-----------------------------3333----------- ERQARVLPLNELKDKLEAMEGVQFKQFNSITDYHSLMFDLGIIARRLRSASDRSKFYRLI ------------------2222------3333-----1111------------------- EASLYGGISSAITRSLRDYLLPEN -------3333------------- >ALPHA-AMYLASE; SWP:P19531; PDB:1QHOA; SSSASVKGDVIYQIIIDRFYDGDTTNNNPAKSYGLYDPTKSKWKMYWGGDLEGVRQKLPY 3333-1111-----3333----1111--3333----1111-1111--------------- LKQLGVTTIWLSPVLDNLDTLAGTDNTGYHGYWTRDFKQIEEHFGNWTTFDTLVNDAHQN ---------------------------3333----1111-3333-------------111 GIKVIVDFVPNHSTPFKANDSTFAEGGALYNNGTYMGNYFDDATKGYFHHNGDISNWDDR 1---------------1111--!!!!----iiii---11111111----------1111- YEAQWKNFTDPAGFSLADLSQENGTIAQYLTDAAVQLVAHGADGLRIDAVKHFNSGFSKS ---------3333------1111--------------1111-------1111-------- LADKLYQKKDIFLVGEWYGDDPGTANHLEKVRYANNSGVNVLDFDLNTVIRNVFGTFTQT --------------------2222------------------------------------ MYDLNNMVNQTGNEYKYKENLITFIDNHDMSRFLSVNSNKANLHQALAFILTSRGTPSIY ----------------1111-----------3333------------------------2 YGTEQYMAGGNDPYNRGMMPAFDTTTTAFKEVSTLAGLRRNNAAIQYGTTTQRWINNDVY 222--------------------------------------3333----------1111- IYERKFFNDVVLVAINRNTQSSYSISGLQTALPNGSYADYLSGLLGGNGISVSNGSVASF -----!!!!--------1111-----------------1111----------iiii---- TLAPGAVSVWQYSTSASAPQIGSVAPNMGIPGNVVTIDGKGFGTTQGTVTFGGVTATVKS --2222-----------------------2222-----------------iiii------ WTSNRIEVYVPNMAAGLTDVKVTAGGVSSNLYSYNILSGTQTSVVFTVKSAPPTNLGDKI -1111------------------iiii---------------------------2222-- YLTGNIPELGNWSTDTSGAVNNAQGPLLAPNYPDWFYVFSVPAGKTIQFKFFIKRADGTI -----3333%%%%-----------------------------------------1111-- QWENGSNHVATTPTGATGNITVTWQN -------------------------- >AURACYANIN; SWP:P94610; PDB:1QHQA; ANAPGGSNVVNETPAQTVEVRAAPDALAFAQTSLSLPANTVVRLDFVNQNNLGVQHNWVL -----1111--------------------------------------------------- VNGGDDVAAAVNTAAQNNADALFVPPPDTPNALAWTAMLNAGESGSVTFRTPAPGTYLYI -------------3333-1111---2222----------2222----------------- CTFPGHYLAGMKGTLTVTP --22221111--------- >DNA POLYMERASE; SWP:Q56366; PDB:1QHTA; MILDTDYITENGKPVIRVFKKENGEFKIEYDRTFEPYFYALLKDDSAIEDVKKVTAKRHG ---------iiii--------iiii------------------33333333------iii TVVKVKRAEKVQKKFLGRPIEVWKLYFNHPQDVPAIRDRIRAHPAVVDIYEYDIPFAKRY i-------------iiii----------3333----------1111-------------- LIDKGLIPMEGDEELTMLAFAIATLYHEGEEFGTGPILMISYADGSEARVITWKKIDLPY -1111---------------------22222222-----------------------111 VDVVSTEKEMIKRFLRVVREKDPDVLITYNGDNFDFAYLKKRCEELGIKFTLGRDGSEPK 1----------------------------3333-------------------1111---- IQRMGDRFAVEVKGRIHFDLYPVIRRTINLPTYTLEAVYEAVFGKPKEKVYAEEIAQAWE -----------2222--------------------------------------------- SGEGLERVARYSMEDAKVTYELGREFFPMEAQLSRLIGQSLWDVSRSSTGNLVEWFLLRK ---3333--------------------------------33331111------------- AYKRNELAPNKPDERELARRRGGYAGGYVKEPERGLWDNIVYLDFRSLYPSIIITHNVSP --------------3333-iiii---------------------1111-----1111-11 DTLNREGCKEYDVAPEVGHKFCKDFPGFIPSLLGDLLEERQKIKRKMKATVDPLEKKLLD 11---------------------------------------------------------- YRQRAIKILANSFYGYYGYAKARWYCKECAESVTAWGREYIEMVIRELEEKFGFKVLYAD -------------3333-1111-------------------------------------- TDGLHATIPGADAETVKKKAKEFLKYINLELEYEGFYVRGFFVTKKKYAVIDEEGKITTR -------22223333------------------------------------3333----- GLEIVRRDWSEIAKETQARVLEAILKHGDVEEAVRIVKEVTLVIHEQIATGPHVAVAKRL -----------------------------------------------------------3 AARGVKIRPGTVISYIVLKGDRAIPADEFEYYIENQVLPAVERILKAFGY 333---------------------3333-3333----3333-1111---- >ADENOVIRUS FIBRE; SWP:Q96590; PDB:1QHVA; AITIGNKNDDKLTLWTTPDPSPNCRIHSDNDCKFTLVLTKCGSQVLATVAALAVSGDLSS --------1111----------------------------!!!!------------3333 MTGTVASVSIFLRFDQNGVLMENSSLKKHYWNFRNGNSTNANPYTNAVGFMPNLLAYPKT --------------1111----------------!!!!--------3333---------- QSQTAKNNIVSQVYLHGDKTKPMILTITLNGTSESTETSEVSTYSMSFTWSWESGKYTTE ---3333------22221111--------!!!!---2222------------22221111 TFATNSYTFSYIAQE --------------- >PURPLE ACID PHOSPHATASE; SWP:P29288; PDB:1QHWA; STLRFVAVGDWGGVPNAPFHTAREMANAKEIARTVQIMGADFIMSLGDNFYFTGVHDAND --------------------------------------------------------1111 KRFQETFEDVFSDRALRNIPWYVLAGNHDHLGNVSAQIAYSKISKRWNFPSPYYRLRFKV ------1111--3333---------3333----------11113333------------- PRSNITVAIFMLDTVMLCGNSDDFVSQQPEMPRDLGVARTQLSWLKKQLAAAKEDYVLVA ------------3333---11113333--------------------------------- GHYPIWSIAEHGPTRCLVKNLRPLLAAYGVTAYLCGHDHNLQYLQDENGVGYVLSGAGNF --------3333--3333--3333-1111----------------1111----------- MDPSVRHQRKVPNGYLRFHYGSEDSLGGFTYVEIGSKEMSITYVEASGKSLFKTSLPRRP ----1111---2222------1111-------------------3333------------ >CHLORAMPHENICOL PHOSPHOTR; SWP:Q56148; PDB:1QHXA; MTTRMIILNGGSSAGKSGIVRCLQSVLPEPWLAFGVDSLIEAMPLKMQSAEGGIEFDADG ----------2222-----------------------------3333---------1111 GVSIGPEFRALEGAWAEGVVAMARAGARIIIDDVFLGGAAAQERWRSFVGDLDVLWVGVR ----------------------1111--------1111----------!!!!-------- CDGAVAEGRETARGDRVAGMAAKQAYVVHEGVEYDVEVDTTHKESIECAWAIAAHVVP -------------------------3333-----------------------1111-- >N-GLYCOSIDASE; SWP:P27559; PDB:1QI7A; VTSITLDLVNPTAGQYSSFVDKIRNNVKDPNLKYGGTDIAVIGPPSKEKFLRINFQSSRG ----------------------------1111-%%%%-------------------1111 TVSLGLKRDNLYVVAYLAMDNTNVNRAYYFKSEITSAELTALFPEATTANQKALEYTEDY -------------------1111------3333-3333----33333333---------- QSIEKNAQITQGDKSRKELGLGIDLLLTFMEAVNKKARVVKNEARFLLIAIQMTAEVARF ----1111--!!!!3333------------1111-------------------------- RYIQNLVTKNFPNKFDSDNKVIQFEVSWRKISTAIYGDAKNGVFNKDYDFGFGKVRQVKD -------1111---------------------------------------------3333 LQMGLLMYLGKPK ------------- >VANADIUM BROMOPEROXIDASE; SWP:P81701; PDB:1QI9A; TCSTSDDADDPTPPNERDDEAFASRVAAAKRELEGTGTVCQINNGETDLAAKFHKSLPHD ------------1111-------------------------------3333--2222--1 DLGQVDADAFAALEDCILNGDLSICEDVPVGNSEGDPVGRLVNPTAAFAIDISGPAFSAT 111-----------------33331111-------3333---1111---------1111- TIPPVPTLPSPELAAQLAEVYWMALARDVPFMQYGTDDITVTAAANLAGMEGFPNLDAVS ------1111---------------11111111----------------1111------- IGSDGTVDPLSQLFRATFVGVETGPFISQLLVNSFTIDSITVEPKQETFAPDVNYMVDFD -1111--3333------2222------1111-----%%%%-------------------- EWLNIQNGGPPAGPELLDDELRFVRNARDLARVTFTDNINTEAYRGALILLGLDAFNRAG ----1111------------------------3333--------------1111--1111 VNGPFIDIDRQAGFVNFGISHYFRLIGAAELAQRSSWYQKWQVHRFARPEALGGTLHLTI -!!!!------------------------------------------3333--------- KGELNADFDLSLLENAELLKRVAAINAAQNPNNEVTYLLPQAIQEGSPTHPSYPSGHATQ --------3333---3333----------2222---------1111-------------- NGAFATVLKALIGLDRGGDCYPDPVYPDDDGLKLIDFRGSCLTFEGEINKLAVNVAFGRQ --------------3333---------1111----------------------------3 MLGIHYRFDGIQGLLLGETITVRTLHQELMTFAEESTFEFRLFTGEVIKLFQDGTFTIDG 333--3333------------------3333----------1111-----1111---iii FKCPGLVYTGVENCV i--------3333-- >LACTOYLGLUTATHIONE LYASE; SWP:Q04760; PDB:1QIPA; GGLTDEAALSCCSDADPSTKDFLLQQTMLRVKDPKKSLDFYTRVLGMTLIQKCDFPIMKF --------1111---3333------------------------------------1111- SLYFLAYEDKNDIPKEKDEKIAWALSRKATLELTHNWGTEDDETQSYHNGNSDPRGFGHI --------1111----------1111---------2222--1111--------------- GIAVPDVYSACKRFEELGVKFVKKPDDGKMKGLAFIQDPDGYWIEILNPNKMATLM --------------1111-----1111----------1111------11111111- >HYDROXYNITRILE LYASE; SWP:P52704; PDB:1QJ4A; AFAHFVLIHTICHGAWIWHKLKPLLEALGHKVTALDLAASGVDPRQIEEIGSFDEYSEPL ---------2222--1111------1111-------2222-----3333--3333----- LTFLEALPPGEKVILVGESCGGLNIAIAADKYCEKIAAAVFHNSVLPDTEHCPSYVVDKL ---11112222--------------------3333----------------1111----- MEVFPDWKDTTYFTYTKDGKEITGLKLGFTLLRENLYTLCGPEEYELAKMLTRKGSLFQN ------!!!!------iiii----------------1111-------------------- ILAKRPFFTKEGYGSIKKIYVWTDQDEIFLPEFQLWQIENYKPDKVYKVEGGDHKLQLTK -1111---33331111-------------3333---------------------1111-- TKEIAEILQEVADTYN ---------------- >14-3-3 PROTEIN ZETA/DELTA; SWP:P29312; PDB:1QJBA; MDKNELVQKAKLAEQAERYDDMAACMKSVTEQGAELSNEERNLLSVAYKNVVGARRSSWR --3333------------------------------------------------------ VVSSIEQKEKKQQMAREYREKIETELRDICNDVLSLLEKFLIPNASQAESKVFYLKMKGD ---3333---------------------------------3333---------------- YYRYLAEVAAGDDKKGIVDQSQQAYQEAFEISKKEMQPTHPIRLGLALNFSVFYYEILNS ------------------------------------1111-------------------- PEKACSLAKTAFDEAIAELDTLSEESYKDSTLIMQLLRDNLTLWTSDT -----------------3333-3333---------------------- >PHOSPHOPANTETHEINE ADENYL; SWP:P23875; PDB:1QJCA; KRAIYPGTFDPITNGHIDIVTRATQMFDHVILAIAASPSKKPMFTLEERVALAQQATAHL ----------------------1111----------------------------1111-1 GNVEVVGFSDLMANFARNQHATVLIRGLRAVADFEYEMQLAHMNRHLMPELESVFLMPSK 111-------------1111--------1111---------------1111-------33 EWSFISSSLVKEVARHQGDVTHFLPENVHQALMAKLA 33-----------1111--3333-------------- >METALLOTHIONEIN; SWP:P04734; PDB:1QJKA; PDVKCVCCTEGKECACFGQDCCVTGECCKDGTCCGI -------------------3333-1111-------- ------------------------------------------------------------ -------------------- >OUTER MEMBRANE PROTEIN A; SWP:P02934; PDB:1QJPA; APKDNTWYTGAKLGWSQHENKLGAGAFGGYQVNPYVGFEMGYDWLGRMPYAYKAQGVQLT --------------------------------1111------------------------ AKLGYPITDDLDIYTRLGGMVWRADTYSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLE -------1111----------------------------------------1111----- YQWTNGMLSLGVSYRFG ----------------- >EPIDERMAL GROWTH FACTOR R; SWP:P42567; PDB:1QJTA; LSLTQLSSGNPVYEKYYRQVEAGNTGRVLALDAAAFLKKSGLPDLILGKIWDLADTDGKG ---3333--3333----------------3333-----------3333------------ VLSKQEFFVALRLVACAQNGLEVSLSSLSLAVPPPRFHD -----3333-------1111---3333------------ >PECTIN METHYLESTERASE; SWP:P07863; PDB:1QJVA; ATTYNAVVSKSSSDGKTFKTIADAIASAPAGSTPFVILIKNGVYNERLTITRNNLHLKGE ------------------------1111-------------------------------- SRNGAVIAAATAAGTLKSDGSKWGTAGSSTITISAKDFSAQSLTIRNDFDFPANQAKSDS 3333-------1111-1111----3333-------------------------1111111 DSSKIKDTQAVALYVTKSGDRAYFKDVSLVGYQDTLYVSGGRSFFSDCRISGTVDFIFGD 1--------------1111----------------------------------------- GTALFNNCDLVSRYRADVKSGNVSGYLTAPSTNINQKYGLVITNSRVIRESDSVPAKSYG --------------11111111----------1111--------------1111------ LGRPWHPTTTFSDGRYADPNAIGQTVFLNTSMDNHIYGWDKMSGKDKNGNTIWFNPEDSR ----------1111---1111-----------3333---------1111-----3333-- FFEYKSYGAGAAVSKDRRQLTDAQAAEYTQSKVLGDWTPTLP -------1111---------333311113333-!!!!----- >CELLOBIOHYDROLASE CEL6A (; SWP:P07987; PDB:1QJWA; ATYSGNPFVGVTPWANAYYASEVSSLAIPSLTGAMATAAAAVAKVPSFMWLDTLDKTPLM -----1111-----------------3333-----------1111-------3333---- EQTLADIRTANKNGGNYAGQFVVFDLPDRDCAALASNGEYSIADGGVAKYKNYIDTIRQI ----------1111---------------1111-------3333---------------- VVEYSDIRTLLVIEPDSLANLVTNLGTPKCANAQSAYLECINYAVTQLNLPNVAMYLDAG ---1111---------3333---11113333--------------11111111------- HAGWLGWPANQDPAAQLFANVYKNASSPRALRGLATNVANYNGWNITSPPSYTQGNAVYN 1111--3333------------1111-3333-----2222---------1111------- EKLYIHAIGPLLANHGWSNAFFITDQGRSGKQPTGQQQWGDWCNVIGTGFGIRPSANTGD ------------1111---------------------1111------------------1 SLLDSFVWVKPGGECDGTSDSSAPRFDSHCALPDALQPAPQAGAWFQAYFVQLLTNANPS 111----------------1111---3333-1111-----2222---------1111--- FL -- >CREATINE KINASE, UBIQUITO; SWP:P12532; PDB:1QK1A; AASERRRLYPPSAEYPDLRKHNNCMASHLTPAVYARLCDKTTPTGWTLDQCIQTGVDNPG --1111---3333----1111-3333----------1111-1111------3333----- HPFIKTVGMVAGDEETYEVFADLFDPVIQERHNGYDPRTMKHTTDLDASKIRSGYFDERY ------------3333-1111--------------1111-------3333---------- VLSSRVRTGRSIRGLSLPPACTRAERREVERVVVDALSGLKGDLAGRYYRLSEMTEAEQQ -----------2222---------------------1111!!!!-----3333------- QLIDDHFLFDKPVSPLLTAAGMARDWPDARGIWHNNEKSFLIWVNEEDHTRVISMEKGGN -------------33331111-22222222----1111---------------------- MKRVFERFCRGLKEVERLIQERGWEFMWNERLGYILTCPSNLGTGLRAGVHIKLPLLSKD -------------------1111--------------3333--------------33331 SRFPKILENLRLQKRGTGGVDTAATGGVFDISNLDRLGKSEVELVQLVIDGVNYLIDCER 111--------------------------------------------------------- RLERGQDIRIPTPVIHTKH 3333--------------- >HUWENTOXIN-I; SWP:P56676; PDB:1QK6A; ACKGVFDACTPGKNECCPNRVCSDKHKWCKWKL ---------2222---3333---1111------ >SELENOCOSMIA HUWENA LECTI; SWP:Q86C51; PDB:1QK7A; GCLGDKCDYNNGCCSGYVCSRTWKWCVLAGPW -------------2222--------------- >ERABUTOXIN A; SWP:P01435; PDB:1QKDA; RICFNHQSSQPQTTKTCSPGESSCYNKQWSDFRGTIIERGCGCPTVKPGIKLSCCESEVC ------!!!!-------2222---------3333------------2222---------- NN -- >C4-DICARBOXYLATE TRANSPOR; SWP:P13632; PDB:1QKKA; PSVFLIDDDRDLRKAMQQTLELAGFTVSSFASATEALAGLSADFAGIVISDIRMPGMDGL --------------------1111------------11111111---------------- ALFRKILALDPDLPMILVTGHGDIPMAVQAIQDGAYDFIAKPFAADRLVQSARRAEEKRR ---------1111------3333-------1111-------------------------- LVMENRSLRRAAEAASEGLK -------------------- >DNA-DIRECTED RNA POLYMERA; SWP:P41584; PDB:1QKLA; MSDNEDNFDGDDFDDVEEDEGLDDLENAEEEGQENVEILPSGERPQANQKRITTPYMTKY ------------------------------------------------------------ ERARVLGTRALQIAMCAPVMVELEGETDPLLIAMKELKARKIPIIIRRYLPDGSYEDWGV -------------------------------------------------3333------- DELIITD ------- >ESTROGEN RECEPTOR BETA; SWP:Q92731; PDB:1QKMA; LDALSPEQLVLTLLEAEPPHVLISRPASMMMSLTKLADKELVHMISWAKKIPGFVELSLF 3333---------1111---------------------------------2222------ DQVRLLESCWMEVLMMGLMWRSIDHPGKLIFAPDLVLDRDEGKCVEGILEIFDMLLATTS -------------------1111-2222---------33333333--------------- RFRELKLQHKEYLCVKAMILLNSSMYPLVADSSRKLAHLLNAVTDALVWVIAKSGISSQQ -------------------1111-3333-------------------------------- QSMRLANLLMLLSHVRHASNKGMEHLLNMKCKNVVPVYDLLLEMLNAHVL -------------------------11113333------------1111- >VINCULIN; SWP:P12003; PDB:1QKRA; KDEEFPEQKAGEAINQPAARQLHDEARKWSSKGNDIIAAAKRALLAESRLVRGGSGNKRA --------2222------------3333--2222-------------3333--3333--- LIQCAKDIAKASDEVTRLAKEVAKQCTDKRIRTNLLQVCERIPTISTQLKILSTVKATLG -------------------------------------3333------------------- RTNISDEESEQATELVHNAQNLQSVKETVREAEAASIKIRTDAGFTLRWVRK -----------------------------------------1111------- >CYTOCHROME CD1 NITRITE RE; SWP:P72181; PDB:1QKSA; DPAAALEDHKTRTDNRYEPSLDNLAQQDVAAPGAPEGVTALSDAQYNEANKIYFERCAGC ----------------------3333--------2222---------------------- HGVLRKGATGKALTPDLTRDLGFDYLQSFITYASPAGMPNWGTSGELSAEQVDLMANYLL -1111--------3333-----------3333-------------------------111 LDPAAPPEFGMKEMRESWKVHVAPEDRPTQQMNDWDLENLFSVTLRDAGQIALIDGSTYE 1---------------------3333---------3333------1111----------- IKTVLDTGYAVHISRLSASGRYLFVIGRDGKVNMIDLWMKEPTTVAEIKIGSEARSIETS ----------------1111------1111-----1111--------------------- KMEGWEDKYAIAGAYWPPQYVIMDGETLEPKKIQSTRGMTYDEQEYHPEPRVAAILASHY -2222------------------------------------------------------- RPEFIVNVKETGKILLVDYTDLNNLKTTEISAERFLHDGGLDGSHRYFITAANARNKLVV -----------------------------------------1111------3333----- IDTKEGKLVAIEDTGGQTPHPGRGANFVHPTFGPVWATSHMGDDSVALIGTDPEGHPDNA --------------------!!!!---------------------------33333333- WKILDSFPALGGGSLFIKTHPNSQYLYVDATLNPEAEISGSVAVFDIKAMTGDGSDPEFK -------------------1111------1111-3333-------3333----------- TLPIAEWAGITEGQPRVVQGEFNKDGTEVWFSVWNGKDQESALVVVDDKTLELKHVIKDE -----3333-------------1111---------1111-------------------11 RLVTPTGKFNVYNTMTDTY 11-----------1111-- >TOXIN 7 FROM PANDINUS IMP; SWP:P58490; PDB:1QKYA; DEAIRCTGTKDCYIPCRYITGCFNSRCINKSCKCYGCT -------3333----------------%%%%------- >ANTIBODY; SWP:Q54181; PDB:1QKZA; VTTYKLVINGKTLKGETTTKAVDAATAEKVFKQYANDNGVDGEWTYDDATKTFTVTEK ---------1111----------------------1111--------1111------- >Protein G'; SWP:Q54181; PDB:1QKZH; DVKLVESGGGLVKPGRSLKLSCAASGFTFSDYYMFWVRQTPEQRLEWVATISDGGAYTYY ------------2222-----------3333--------1111--------1111----- PDSVKGRFTISRDNAKNNLYLQMNSLKSEDTGMYYCARDPLEYYGMDYWGQGTSVAVSSA 3333----------------------3333------------------------------ KTTAPSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDL ---------------------------------------%%%%----------------- YTLSSSVNVTSSTWPSQSITCNVAHPASSTKVDKKIVPR ---------1111------------1111---------- >NUCLEASE; SWP:P13717; PDB:1QL0A; SIDNCAVGCPTGGSSKVSIVRHAYTLNNNSTTKFANWVAYHITKDTPASGKTRNWKTDPA --------------------3333------------------1111-----------111 LNPADTLAPADYTGANAALKVDRGHQAPLASLAGVSDWESLNYLSNITPQKSDLNQGAWA 11111--3333--3333----------33331111-------3333----3333------ RLEDQERKLIDRADISSVYTVTGPLYERDMGKLPGTQKAHTIPSAYWKVIFINNSPAVNH ------3333-1111-----------------1111------------------3333-- YAAFLFDQNTPKGADFCQFRVTVDEIEKRTGLIIWAGLPDDVQASLKSKPGVLPELMGCK ------111111113333----------------1111-------1111--3333----- N - >CYTOCHROME C552; SWP:P54820; PDB:1QL3A; ADPAAGEKVFGKCKACHKLDGNDGVGPHLNGVVGRTVAGVDGFNYSDPMKAHGGDWTPEA -3333-----------------------2222-------2222----------------- LQEFLTNPKAVVKGTKMAFAGLPKIEDRANLIAYLEGQQ ------1111-2222--------------------1111 >POSTSYNAPTIC DENSITY PROT; SWP:P31016; PDB:1QLCA; AEKVMEIKLIKGPKGLGFSIAGGVGNQHIPGDNSIYVTKIIEGGAAHKDGRLQIGDKILA -----------1111-------------------------22223333------------ VNSVGLEDVMHEDAVAALKNTYDVVYLKVAKPSNA ----------------------------------- >PHOSPHOLIPASE A2; SWP:P82287; PDB:1QLLA; SLFELGKMILQETGKNPAKSYGAYGCNCGVLGRGKPKDATDRCCYVHKCCYKKLTGCNPK ---------------3333------------------3333---------1111---333 KDRYSYSWKDKTIVCGENNPCLKELCECDKAVAICLRENLGTYNKKYRYHLKPFCKKA 3-------%%%%--------------------------3333-3333---3333---- >METHENYLTETRAHYDROMETHANO; SWP:P94954; PDB:1QLMA; MVSVNENALPLVERMIERAELLNVEVQELENGTTVIDCGVEAAGGFEAGLLFSEVCMGGL --3333-------------1111-----1111------------------------%%%% ATVELTEFEHDGLCLPAVQVTTDHPAVSTLAAQKAGWQVQVGDYFAMGSGPARALALKPK ---------iiii----------3333-------------!!!!-----3333-----33 ETYEEIDYEDDADVAILCLESSELPDEDVAEHVADECGVDPENLYLLVAPTASIVGSVQV 33-------------------------------------3333----------------- SARVVETGLYKLLEVLEYDVTRVKYATGTAPIAPVADDDGEAMGRTNDCILYGGTVYLYV ------------------1111-------------------------------------- EGDDELPEVVEELPSEASEDYGKPFMKIFEEADYDFYKIDPGVFAPARVVVNDLSTGKTY --1111---111133331111--3333----%%%%11111111----------------- TAGEINVDVLKESFSL ---------------- >HERPES SIMPLEX VIRUS PROT; SWP:P03170; PDB:1QLOA; MSWALEMADTFLDNMRVGPRTYADVRDEINKRGR ---3333----------------3333------- >ALPHA-1-ANTITRYPSIN; SWP:P01009; PDB:1QLPA; FNKITPNLAEFAFSLYRQLAHQSNSTNIFFSPVSIATAFAMLSLGTKADTHDEILEGLNF -1111-----------------------------------3333-----------1111- NLTEIPEAQIHEGFQELLRTLNQPDSQLQLTTGNGLFLSEGLKLVDKFLEDVKKLYHSEA 1111-3333--------------------------------------------------- FTVNFGDTEEAKKQINDYVEKGTQGKIVDLVKELDRDTVFALVNYIFFKGKWERPFEVKD ---3333-------------1111----------1111------------------3333 TEEEDFHVDQVTTVKVPMMKRLGMFNIQHCKKLSSWVLLMKYLGNATAIFFLPDEGKLQH ------------------------------1111-------------------2222--- LENELTHDIITKFLENEDRRSASLHLPKLSITGTYDLKSVLGQLGITKVFSNGADLSGVT ------------3333-------------------------1111-33331111-1111- EEAPLKLSKAVHKAVLTIDEKGTEAAGAMFLEAIPMSIPPEVKFNKPFVFLMIEQNTKSP ------------------------------------------------------------ LFMGKVVNPTQK ------------ >S100C PROTEIN; SWP:P31950; PDB:1QLSA; PTETERCIESLIAIFQKHAGRDGNNTKISKTEFLIFMNTELAAFTQNQKDPGVLDRMMKK ----------------------------------------3333-----1111------- LDLDSDGQLDFQEFLNLIGGLAIACHDSFIKSTQK -1111------------------------------ >ESTERASE; SWP:Q7SIA5; PDB:1QLWA; VPKTPAGPLTLSGQGSFFVGGRDVTSETLSLSPKYDAHGTVTVDQMYVRYQIPQRAKRYP -------------------------------3333------------------------- ITLIHGCCLTGMTWETTPDGRMGWDEYFLRKGYSTYVIDQSGRGRSATDISAINAVKLGK -----------1111-1111-------------------2222------------1111- APASSLPDLFAAGHEAAWAIFRFGPRYPDAFKDTQFPVQAQAELWQQMVPDWLGSMPTPN -3333-------------------------1111--3333---3333----3333----- PTVANLSKLAIKLDGTVLLSHSQSGIYPFQTAAMNPKGITAIVSVEPGECPKPEDVKPLT ---------------------1111----------2222------------11113333- SIPVLVVFGDHIEEFPRWAPRLKACHAFIDALNAAGGKGQLMSLPALGVHGNSHMMMQDR -----------1111-----------------1111------3333--------3333-- NNLQVADLILDWIGRNTA ------------------ >METHIONINE ADENOSYLTRANSF; SWP:P13444; PDB:1QM4A; GAFMFTSESVGEGHPDKICDQISDAVLDAHLKQDPNAKVACETVCKTGMVLLCGEITSMA ----------1111---------------33331111----------------------- MIDYQRVVRDTIKHIGYDDSAKGFDFKTCNVLVALEQQSPEDVGAGDQGLMFGYATDETE -------------------1111-3333------------------------------33 ECMPLTIVLAHKLNTRMADLRRSGVLPWLRPDSKTQVTVQYVQDNGAVIPVRVHTIVISV 33-3333------------------1111------------------------------- QHNEDITLEAMREALKEQVIKAVVPAKYLDEDTIYHLQPSGRFVIGGPQGDAGVTGRKII ------3333--------3333--3333---------1111----------------333 VDTYGGWGAHGGGAFSGKDYTKVDRSAAYAARWVAKSLVKAGLCRRVLVQVSYAIGVAEP 3--iiii-------22221111-------------------------------------- LSISIFTYGTSKKTERELLEVVNKNFDLRPGVIVRDLDLKKPIYQKTACYGHFGRSEFPW ------------------------------------------3333--------111111 EVPKKLVF 11------ >R-CHII; SWP:NA; PDB:1QM7A; TMCYSHTTTSRAILTNCPGETNCYKKSRRHPPKMVLGRGCGCPTVAPGIKLNCCTTDKCN ----------------2222-----------------------------------2222- Y - >Alpha-1-antitrypsin [Prec; SWP:P01009; PDB:1QMBB; LEAIPRSIPPEVKFNAPFVFLMIEQNTKSPLFMGKVVNPTQK -------------------------------------1111- >PENICILLIN-BINDING PROTEI; SWP:P14677; PDB:1QMEA; TVPAKRGTIYDRNGVPIAEDATSPNRSYPNGQFASSFIGLAQLHENEDGSKSLLGTSGME ----------1111-------------1111--3333--------1111--------333 SSLNSILAGTDGRTMDGKDVYTTISSPLQSFMETQMDAFQEKVKGKYMTATLVSAKTGEI 3----------------------------------------------------------- LATTQRPTFDADTKEGITEDFVWRDILYQSNYEPGSTMKVMMLAAAIDNNTFPGGEVFNS -------------22221111---3333------3333--------1111--1111---- SELKIADATIRDWDVNEGLTGGRMMTFSQGFAHSSNVGMTLLEQKMGDATWLDYLNRFKF ------------------------------------------------------------ GVPTRFGLTDEYAGQLPADNIVNIAQSSFGQGISVTQTQMIRAFTAIANDGVMLEPKFIS -------------------3333--3333----------------1111----------- AIYDPNDQTARKSQKEIVGNPVSKDAASLTRTNMVLVGTDPVYGTMYNHSTGKPTVTVPG ---------------------------------------------------------222 QNVALKSGTAQIADEKNGGYLVGLTDYIFSAVSMSPAENPDFILYVTVQQPEHYSGIQLG 2------------1111------------------3333--------------------- EFANPILERASAMKDSLNLQQSPYPMPSVKDISPGDLAEELRRNLVQPIVVGTGTKIKNS ------------------------------------------------------------ SAEEGKNLAPNQQVLILSDKAEEVPDMYGWTKETAETLAKWLNIELEFQGSGSTVQKQDV --2222--2222--------------2222------------------------------ RANTAIKDIKKITLTLGD 2222-------------- >ACETOHYDROXY-ACID ISOMERO; SWP:Q01292; PDB:1QMGA; SATTFDFDSSVFKKEKVTLSGHDEYIVRGGRNLFPLLPDAFKGIKQIGVIGWGSQAPAQA ------------------iiii-------333311113333-------------3333-- QNLKDSLTEAKSDVVVKIGLRKGSNSFAEARAAGFSEENGTLGDMWETISGSDLVLLLIS --------------------1111------1111-3333---------1111-------- DSAQADNYEKVFSHMKPNSILGLSHGFLLGHLQSLGQDFPKNISVIAVCPKGMGPSVRRL -----------11112222------3333---1111----------------3333---- YVQGKEVNGAGINSSFAVHQDVDGRATDVALGWSIALGSPFTFATTLEQEYKSDIFGERG -3333-----------------------------1111---------------------- ILLGAVHGIVECLFRRYTESGMSEDLAYKNTVECITGVISKTISTKGMLALYNSLSEEGK -----------------1111------------------------------1111----- KDFQAAYSASYYPSMDILYECYEDVASGSEIRSVVLAGRRFYEKEGLPAFPMGKIDQTRM -------------------------------------------iiii------------- WKVGEKVRSVRPAGDLGPLYPFTAGVYVALMMAQIEILRKKGHSYSEIINESVIEAVDSL ------3333-2222-----------------------1111------------------ NPFMHARGVSFMVDNCSTTARLGSRKWAPRFDYILSQQALVAVDNGAPINQDLISNFLSD -------33331111---------------------------1111-------------- PVHEAIGVCAQLRPSVDISVTADADFVRPELRQA ---------1111-------1111---3333--- >RNA 3'-TERMINAL PHOSPHATE; SWP:P46849; PDB:1QMHA; MIALDGAQGEGGGQILRSALSLSMITGQPFTITSIRAGRAKPGLLRQHLTAVKAATEICG ----1111---3333--------------------1111--------------------- ATVEGAELGSQRLLFRPGTVRGGDYRFAIGSAGSCTLVLQTVLPALWFADGPSRVEVSGG ------2222-----------------------3333----33331111----------- TDNPSAPPADFIRRVLEPLLAKIGIHQQTTLLRHGFYPAGGGVVATEVSPVASFNTLQLG --1111--------------1111------------------------------------ ERGNIVQMRGEVLLAGVPRHVAEREIATLAGSFSLHEQNIHNLPRDQGPGNTVSLEVESE -----------------3333----------------------3333-----------11 NITERFFVVGEKRVSAEVVAAQLVKEVKRYLASTAAVGEYLADQLVLPMALAGAGEFTVA 11------------3333-----------3333----------------1111------- HPSCHLLTNIAVVERFLPVRFSLIETDGVTRVSI ---------------------------------- >BETA-GALACTOSIDE-BINDING ; SWP:P23668; PDB:1QMJA; QGLVVTQLDVQPGECVKVKGKILSDAKGFSVNVGKDSSTLMLHFNPRFDCHGDVNTVVCN -----------------------------------1111----------iiii------- SKEDGTWGEEDRKADFPFQQGDKVEICISFDAAEVKVKVPEVEFEFPNRLGMEKIQYLAV --iiii------------2222-------------------------1111--------- EGDFKVKAIKFS ------------ >MANNOSE BINDING LECTIN, F; SWP:Q9ZTA9; PDB:1QMOA; AQSLSFSFTKFDPNQEDLIFQGHATSTNNVLQVTKLDSAGNPVSSSAGRVLYSAPLRLWE --------------1111--------%%%%------------------------------ DSAVLTSFDTIINFEISTPYTSRIADGLAFFIAPPDSVISYHGGFLGLFPNAN ---------------------------------1111----!!!!-------- >Mannose lectin; SWP:Q9ZTA9; PDB:1QMOE; SNVVAVEFDTYLNPDYGDPNYIHIGIDVNSIRSKVTAKWDWQNGKIATAHISYNSVSKRL ------------3333-------------------------------------------- SVTSYYAGSKPATLSYDIELHTVLPEWVRVGLSASTGQDKERNTVHSWSFTSSLWTN --------------------------------------------------------- >HUMAN THIOREDOXIN PEROXID; SWP:P32119; PDB:1QMVA; SGNARIGKPAPDFKATAVVDGAFKEVKLSDYKGKYVVLFFYPLDFTFVPTEIIAFSNRAE !!!!2222----------iiii----33332222-----------------------333 DFRKLGCEVLGVSVDSQFTHLAWINTPRKEGGLGPLNIPLLADVTRRLSEDYGVLKTDEG 33333---------------------3333------------1111----------1111 IAYRGLFIIDGKGVLRQITVNDLPVGRSVDEALRLVQAFQYTDEHGEVCPAGWKPGSDTI ---------1111--------1111------------------------22222222--- KPNVDDSKEYFSKHN --3333--------- >PROTEASE; SWP:P03305; PDB:1QMYA; MELTLYNGEKKTFYSRPNNHDNAWLNAILQLFRYVEEPFFDWVYSSPENLTLEAIKQLED ----1111-------------------------------3333----------------- LTGLELHEGGPPALVIWNIKHLLHTGIGTASRPSEVCVVDGTDMSLADFHAGIFLKGQEH ------------------3333------3333------------3333------------ AVFACVTSNGWYAIDDEDFYPWTPDPSDVLVFVPYD ------1111----!!!!------3333-------- >CYTOCHROME CH; SWP:Q7SIA4; PDB:1QN2A; EGDAAAGEKAFAPCKACHNFEKNGVGPTLKGVVGAKAGEGADGYAFSDALKKSGLTWDQA ----------3333--------------2222---------------------------- DLKQWLADPKKKVPGTKMVFPGISDPKKVDDIIAYLKTK --------33332222----------------------- >TRANSCRIPTION INITIATION ; SWP:P28147; PDB:1QNAA; HPSGIVPTLQNIVSTVNLDCKLDLKAIALQARNAEYNPKRFAAVIMRIREPKTTALIFAS 3333--------------------------------3333-----------------333 GKMVCTGAKSEDFSKMAARKYARIVQKLGFPAKFKDFKIQNIVGSCDVKFPIRLEGLAYS 3------------------------1111------------------------------- HAAFSSYEPELFPGLIYRMKVPKIVLLIFVSGKIVITGAKMRDETYKAFENIYPVLSEFR 3333---3333-----------------3333------------------------1111 KI -- >CYCLOPHILIN; SWP:Q25756; PDB:1QNGA; SKRSKVFFDISIDNSNAGRIIFELFSDITPRTCENFRALCTGEKIGSRGKNLHYKNSIFH -----------%%%%---------1111-----------------1111----2222--- RIIPQFMCQGGDITNGNGSGGESIYGRSFTDENFNMKHDQPGLLSMANAGPNTNSSQFFI ----------------------1111---------------------------------- TLVPCPWLDGKHVVFGKVIEGMNVVREMEKEGAKSGYVKRSVVITDCGEL ----1111--------------------11113333-------------- >NITROUS-OXIDE REDUCTASE; SWP:Q7SIA3; PDB:1QNIA; AHVAPGELDEYYGFWSGGHQGEVRVLGVPSMRELMRIPVFNVDSATGWGITNESKEILGG ---2222---------!!!!--------------------------2222---------- DQQYLNGDCHHPHISMTDGRYDGKYLFINDKANTRVARIRLDIMKTDKITHIPNVQAIHG ----------------iiii---------------------------------------- LRLQKVPKTNYVFCNAEFVIPQPNDGTDFSLDNSYTMFTAIDAETMDVAWQVIVDGNLDN -----------------------------3333--------------------------- TDADYTGKYATSTCYNSERAVDLAGTMRNDRDWVVVFNVERIAAAVKAGNFKTIGDSKVP ---------------1111--3333----------------------------!!!!--- VVDGRGESEFTRYIPVPKNPHGLNTSPDGKYFIANGKLSPTVSVIAIDKLDDLFEDKIEL -------3333--------------3333------!!!!------3333-3333----11 RDTIVAEPELGLGPLHTTFDGRGNAYTTLFIDSQVCKWNIADAIKHYNGDRVNYIRQKLD 11-------------------------------------------1111----------- VQYQPGHNHASLTESRDADGKWLVVLSKFSKDRFLPVGPLHPENDQLIDISGEEMKLVHD ---------2222----------------!!!!--------------------------- GPTYAEPHDCILVRRDQIKTKKIYERNDPYFASCRAQAEKDGVTLESDNKVIRDGNKVRV -------------1111-------1111----------1111-3333------!!!!--- YMTSVAPQYGMTDFKVKEGDEVTVYITNLDMVEDVTHGFCMVNHGVSMEISPQQTASVTF ----------------2222-----------2222-----2222------2222------ TAGKPGVYWYYCNWFCHALHMEMVGRMLVEAA ----------------1111------------ >GROB[5-73]; SWP:AAC03540; PDB:1QNKA; TELRCQCLQTLQGIHLKNIQSVKVKSPGPHCAQTEVIATLKNGQKACLNPASPMVKKIIE ---------------------------------------1111--------1111---33 KMLKNGKSN 33------- >ENDO-1,4-B-D-MANNANASE; SWP:Q99036; PDB:1QNRA; ASSFVTISGTQFNIDGKVGYFAGTNCYWCSFLTNHADVDSTFSHISSSGLKVVRVWGFND -------------iiii--------1111----3333--------1111----------- VNTQPSPGQIWFQKLSATGSTINTGADGLQTLDYVVQSAEQHNLKLIIPFVNNWSDYGGI -----2222------1111-----1111------------------------------33 NAYVNAFGGNATTWYTNTAAQTQYRKYVQAVVSRYANSTAIFAWELGNEPRCNGCSTDVI 33-------1111-----------------33331111-------------2222----- VQWATSVSQYVKSLDSNHLVTLGDEGLGLSTGDGAYPYTYGEGTDFAKNVQIKSLDFGTF ----------3333--------------------3333-------------1111----- HLYPDSWGTNYTWGNGWIQTHAAACLAAGKPCVFEEYGAQQNPCTNEAPWQTTSLTTRGM --3333---3333------------1111------------3333-----------2222 GGDMFWQWGDTFANGAQSNSDPYTVWYNSSNWQCLVKNHVDAIN -----------1111-----1111-2222--------------- >METHYLATED-DNA--PROTEIN-C; SWP:P16455; PDB:1QNTA; EMKRTTLDSPLGKLELSGCEQGLHEIKLLGKDAVEVPAPAAVLGGPEPLMQCTAWLNAYF --------1111------1111-----------------------3333----------- HQPEAIEEFPVPALHHPVFQQESFTRQVLWKLLKVVKFGEVISYQQLAALAGNPKAARAV -33331111------3333-----------------2222--------11111111---- GGAMRGNPVPILIPCHRVVCSSGAVGNYSGGLAVKEWLLAHEGHRL -3333--------3333--1111----1111--------1111--- >CHITIN BINDING LECTIN, UE; SWP:Q9FVF8; PDB:1QNWA; SDDLSFNFDKFVPNQKNIIFQGDASVSTTGVLQVTKVSTTTSIGRALYAAPIQIWDSITG --------------1111--------1111------------------------------ KVASFATSFSFVVKADKSDGVDGLAFFLAPANSQIPSGSSAGMFGLFSSSDSKSSNQIIA -----------------------------------22223333---------1111---- VEFDTYFGKAYNPWDPDFKHIGIDVNSIKSIKTVKWDWRNGEVADVVITYRAPTKSLTVC -------33333333-----------------------2222---------1111----- LSYPSDGTSNIITASVDLKAILPEWVSVGFSGGVGNAAEFETHDVLSWYFTSNLE ----------------3333---------------3333---------------- >VES V 5; SWP:Q05110; PDB:1QNXA; NYCKIKCLKGGVHTACKYGSLKPNCGNKVVVSYGLTKQEKQDILKEHNDFRQKIARGLET 3333--1111--1111-------------------------------------1111--- RGNPGPQPPAKNMKNLVWNDELAYVAQVWANQCQYGHDTCRDVAKYQVGQNVALTGSTAA ------------------------------------------3333-------------- KYDDPVKLVKMWEDEVKDYNPKKKFSGNDFLKTGHYTQMVWANTKEVGCGSIKYIQEKWH ---3333------------11113333-3333--------3333-----------%%%%- KHYLVCNYGPSGNFKNEELYQTK -------------1111------ >Ig heavy chain V region 1; SWP:P01750; PDB:1QNZH; QVQLQQSGAELVKPGASVKMSCKASGYTFTTYPIEWMKQNHGKSLEWIGNFHPYSDDTNY ------------2222------------3333---------------------------- NEKFKGKAKLTVEKSSSTVYLEFSRLTSDDSAVYYCAIHYGSAYAMDYWGQGTSVTVSS 3333--------3333------------------------------------------- >Aliphatic amidase regulat; SWP:P10932; PDB:1QO0D; SANSLLGSLRELQVLVLNPPGEVSDALVLQLIRIGCSVRQCWPPPEAFDVPVDVVFTSIF -------1111------------------------------------------------- QNRHHDEIAALLAAGTPRTTLVALVEYESPAVLSQIIELECHGVITQPLDAHRVLPVLVS ---------------1111------------------------------3333------- ARRISEEMAKLKQKTEQLQDRIAGQARINQAKVLLMQRHGWDEREAHQHLSREAMKRREP -----------------------------------------------------------3 ILKIAQELL 333------ >N-((5-PHOSPHORIBOSYL)-FOR; SWP:Q9X0C7; PDB:1QO2A; LVVPAIDLFRGKVARIKGRKENTIFYEKDPVELVEKLIEEGFTLIHVVDLSNAIENSGEN --------iiii---%%%%--------------------------------------111 LPVLEKLSEFAEHIQIGGGIRSLDYAEKLRKLGYRRQIVSSKVLEDPSFLKSLREIDVEP 1-----33331111-------------------------3333----------1111--- VFSLDTRGGRVAFKGWLAEEEIDPVSLLKRLKEYGLEEIVHTEIEKDGTLQEHDFSLTKK ------iiii--1111-------------3333---------1111-------3333--- IAIEAEVKVLAAGGISSENSLKTAQKVHTETNGLLKGVIVGRAFLEGILTVEVKRYAR --1111----------3333--------1111--------3333-----33331111- >T-cell surface glycoprote; SWP:P20937; PDB:1QO3C; STVLDSLQHKVYWFCYGMKCYYFVMDRKTWSGCKQTCQSSSLSLLKIDDEDELKFLQLVV -1111--------------------------------1111------------------- PSDSCWVGLSYDNKKKDWAWIDNRPSKLALNTRKYNIRDGGCMLLSKTRLDNGNCDQVFI -------------------1111------------3333------1111----1111--- CICGKRLD -------- >EPOXIDE HYDROLASE; SWP:Q9UR30; PDB:1QO7A; KAFAKFPSSASISPNPFTVSIPDEQLDDLKTLVRLSKIAPPTYESLQADGRFGITSEWLT 2222--1111-----------3333----------------3333-1111---------- TMREKWLSEFDWRPFEARLNSFPQFTTEIEGLTIHFAALFSEREDAVPIALLHGWPGSFV -----------------1111-------iiii----------1111-----------333 EFYPILQLFREEYTPETLPFHLVVPSLPGYTFSSGPPLDKDFGLMDNARVVDQLMKDLGF 3------------3333---------2222-------------------------11111 GSGYIIQGGDIGSFVGRLLGVGFDACKAVHLNLCAMRAPPEGPSIESLSAAEKEGIARME 111------3333---------3333-----------------3333------------- KFMTDGLAYAMEHSTRPSTIGHVLSSSPIALLAWIGEKYLQWVDKPLPSETILEMVSLYW ----------------------1111---------------------3333--------1 LTESFPRAIHTYRETTPMLQKELYIHKPFGFSFFPKDLCPVPRSWIATTGNLVFFRDHAE 111-1111---------1111------------1111----3333-1111---------- GGHFAALERPRELKTDLTAFVEQVW ---3333------------------ >FLAVOCYTOCHROME C3 FUMARA; SWP:Q9Z4P0; PDB:1QO8A; TPDMGSFHADMGSCQSCHAKPIKVTDSETHENAQCKSCHGEYAELANDKLQFDPHNSHLG ------------3333--------1111------------3333--------11113333 DINCTSCHKGHEEPKFYCNECHSFDIKPMPFSDAKKKKSWDDGWDQDKIQKAIAAGPSET --1111---------3333-----------1111------------------3333---- TQVLVVGAGSAGFNASLAAKKAGANVILVDKAPFSGGNSMISAGGMNAVGTKQQTAHGVE -------------------3333--------------1111------------------- DKVEWFIEDAMKGGRQQNDIKLVTILAEQSADGVQWLESLGANLDDLKRSGGARVDRTHR -3333------1111----------------------1111--------2222------- PHGGKSSGPEIIDTLRKAAKEQGIDTRLNSRVVKLVVNDDHSVVGAVVHGKHTGYYMIGA -------------------1111------------------------------------- KSVVLATGGYGMNKEMIAYYRPTMKDMTSSNNITATGDGVLMAKEIGASMTDIDWVQAHP ---------1111-------1111-------1111--------1111----1111----- TVGKDSRILISETVRGVGAVMVNKDGNRFISELTTRDKASDAILKQPGQFAWIIFDNQLY ----------33331111----3333----1111-----------2222----------- KKAKMVRGYDHLEMLYKGDTVEQLAKSTGMKVADLAKTVSDYNGYVASGKDTAFGRADMP --------------------------------------------------3333------ LNMTQSPYYAVKVAPGIHHTMGGVAINTTASVLDLQSKPIDGLFAAGEVTGGVHGYNRLG --------------------------1111---1111--2222---3333---!!!!-22 GNAIADTVVFGRIAGDNAAKHALD 22---------------------- >SNUCYP-20; SWP:O43447; PDB:1QOIA; NSSPVNPVVFFDVSIGGQEVGRMKIELFADVVPKTAENFRQFCTGEFRKDGVPIGYKGST --------------iiii---------------------1111-----iiii---2222- FHRVIKDFMIQGGDFVNGDGTGVASIYRGPFADENFKLRHSAPGLLSMANSGPSTNGCQF ----2222----------------1111-------------------------------- FITCSKCDWLDGKHVVFGKIIDGLLVMRKIENVPTGPNNKPKLPVVISQCGEM ------1111-------------------1111--2222-------------- >MFE-23 RECOMBINANT ANTIBO; SWP:NA; PDB:1QOKA; QVKLQQSGAELVRSGTSVKLSCTASGFNIKDSYMHWLRQGPEQGLEWIGWIDPENGDTEY ------------2222-----------------------1111----------------- APKFQGKATFTTDTSSNTAYLQLSSLTSEDTAVYYCNEGTPTGPYYFDYWGQGTTVTVSS 3333---------1111---------3333------------------------------ GENVLTQSPAIMSASPGEKVTITCSASSSVSYMHWFQQKPGTSPKLWIYSTSNLASGVPA --------------2222--------------------2222------------222233 RFSGSGSGTSYSLTISRMEAEDAATYYCQQRSSYPLTFGAGTKLELK 33----------------1111------------------------- >TRYPTOPHAN SYNTHASE ALPHA; SWP:P00929; PDB:1QOPA; MERYENLFAQLNDRREGAFVPFVTLGDPGIEQSLKIIDTLIDAGADALELGVPFSDPLAD -----------1111--------2222-------------1111---------------- GPTIQNANLRAFAAGVTPAQCFEMLAIIREKHPTIPIGLLMYANLVFNNGIDAFYARCEQ -----------1111--------------------------3333--------------- VGVDSVLVADVPVEESAPFRQAALRHNIAPIFICPPNADDDLLRQVASYGRGYTYLLSRS -------11113333--------1111--------------------------------- GVTGAENRGPLHHLIEKLKEYHAAPALQGFGISSPEQVSAAVRAGAAGAISGSAIVKIIE ---3333-----------1111-----------3333----------------------- KNLASPKQMLAELRSFVSAMKAASR -1111----------------1111 >TRYPTOPHAN SYNTHASE ALPHA; SWP:P00933; PDB:1QOPB; TTLLNPYFGEFGGMYVPQILMPALNQLEEAFVSAQKDPEFQAQFADLLKNYAGRPTALTK -------!!!!-----3333---------------------------------------- CQNITAGTRTTLYLKREDLLHGGAHKTNQVLGQALLAKRMGKSEIIAETGAGQHGVASAL -3333----------33332222--------------1111------------------- ASALLGLKCRIYMGAKDVERQSPNVFRMRLMGAEVIPVHSGSATLKDACNEALRDWSGSY ----------------------------1111-------!!!!---------------33 ETAHYMLGTAAGPHPYPTIVREFQRMIGEETKAQILDKEGRLPDAVIACVGGGSNAIGMF 33---------------------------------------------------------3 ADFINDTSVGLIGVEPGGHGIETGEHGAPLKHGRVGIYFGMKAPMMQTADGQIEESYSIS 333--1111-------!!!!1111---3333------iiii------1111--------1 AGLDFPSVGPQHAYLNSIGRADYVSITDDEALEAFKTLCRHEGIIPALESSHALAHALKM 111--------------------------------------------3333--------- MREQPEKEQLLVVNLSGRGDKDIFTVHDIL 3333--------------3333-------- >QUINONE OXIDOREDUCTASE; SWP:P28304; PDB:1QORA; ATRIEFHKHGGPEVLQAVEFTPADPAENEIQVENKAIGINFIDTYIRSGLYPPPSLPSGL -------------------------1111----------3333----------------- GTEAAGIVSKVGSGVKHIKAGDRVVYAQSALGAYSSVHNIIADKAAILPAAISFEQAAAS -----------1111---2222------------------3333----33333333---- FLKGLTVYYLLRKTYEIKPDEQFLFHAAAGGVGLIACQWAKALGAKLIGTVGTAQKAQSA -----------------2222-----1111------------------------------ LKAGAWQVINYREEDLVERLKEITGGKKVRVVYDSVGRDTWERSLDCLQRRGLMVSFGNS ---------1111--------1111-----------3333----11112222------11 SGAVTGVNLGILNQKGSLYVTRPSLQGYITTREELTEASNELFSLIASGVIKVDVAEQQK 11-----3333-1111-------3333------------------1111------3333- YPLKDAQRAHEILESRATQGSSLLIP -3333--------------------- >CEN; SWP:Q41261; PDB:1QOUA; GRVIGDVVDHFTSTVKMSVIYNSIKHVYNGHELFPSAVTSTPRVEVHGGDMRSFFTLIMT 1111-----------------------2222--3333----------------------- DPDVPGPSDPYLREHLHWIVTDIPGTTDSSFGKEVVSYEMPRPNIGIHRFVFLLFKQKKR -----33331111--------------3333----------------------------- GVVCRDGFNTRKFTQENELGLPVAAVFFNCQRET --------------1111---------------- >BETA-GLUCOSIDASE; SWP:Q03506; PDB:1QOXA; SIHMFPSDFKWGVATAAYQIEGAYNEDGRGMSIWDTFAHTPGKVKNGDNGNVACDSYHRV -----1111-------3333-----%%%%----------22222222----!!!!1111- EEDVQLLKDLGVKVYRFSISWPRVLPQGTGEVNRAGLDYYHRLVDELLANGIEPFCTLYH -------------------3333-1111-------------------1111--------- WDLPQALQDQGGWGSRITIDAFAEYAELMFKELGGKIKQWITFNEPWCMAFLSNYLGVHA ---3333---!!!!3333------------------------------------------ PGNKDLQLAIDVSHHLLVAHGRAVTLFRELGISGEIGIAPNTSWAVPYRRTKEDMEACLR ------------------------------------------------------------ VNGWSGDWYLDPIYFGEYPKFMLDWYENLGYKPPIVDGDMELIHQPIDFIGINYYTSSMN -1111-------------3333----1111-----2222--------------------- RYNPGEAGGMLSSEAISMGAPKTDIGWEIYAEGLYDLLRYTADKYGNPTLYITENGACYN ----33331111----------1111---3333--------------------------- DGLSLDGRIHDQRRIDYLAMHLIQASRAIEDGINLKGYMEWSLMDNFEWAEGYGMRFGLV ---3333---3333--------------1111---------------!!!!--------- HVDYDTLVRTPKDSFYWYKGVISRGWLDL ----------------------------- >HEMOLYSIN E; SWP:P77335; PDB:1QOYA; IVADKTVEVVKNAIETADGALDLYNKYLDQVIPWQTFDETIKELSRFKQEYSQAASVLVG --------------------------3333----------11112222------------ DIKTLLMDSQDKYFEATQTVYEWCGVATQLLAAYILLFDEYNEKKASAQKDILIKVLDDG -----------------------------------3333--------------------- ITKLNEAQKSLLVSSQSFNNASGKLLALDSQLTNDFSEKSSYFQSQVDKIRKEAYAGAAA ------------------------------------1111--------------1111-- GVVVGPFGLIISYSIAAGVVEGKLIPELKNKLKSVQNFFTTLSNTVKQANKDIDAAKLKL -----iiii--------------------------------------------------- TTEIAAIGEIKTETETTRFYVDYDDLMLSLLKEAAKKMINTCNEYQKRHGKKTLF ------------3333--------------------------------------- >ACETYL XYLAN ESTERASE; SWP:Q99034; PDB:1QOZA; CPAIHVFGARETTVSQGYGSSATVVNLVIQAHPGTTSEAIVYPACGGQASCGGISYANSV ----------2222---!!!!-------3333---------------3333--------- VNGTNAAAAAINNFHNSCPDTQLVLVGYSQGAQIFDNALCGGGDPGEGITNTAVPLTAGA -----------------1111----------------------3333------------- VSAVKAAIFMGDPRNIHGLPYNVGTCTTQGFDARPAGFVCPSASKIKSYCDAADPYCCTG -----------1111---1111-------------------1111-----1111------ NDPNVHQGYGQEYGQQALAFINSQLS -3333---3333-------------- >PSAE PROTEIN; SWP:Q9WWP1; PDB:1QP2A; MVQRGSKVRILRPESYWFQDVGTVASVDQSGIKYPVIVRFEKVNYSGINTNNFAEDELVE -----------3333----------------------------3333------3333--- VEAPKAKPKK ---------- >ALPHA2D; SWP:NA; PDB:1QP6A; GEVEELEKKFKELWKGPRRGEIEELHKKFHELIKG ------------------3333------------- >FORMATE DEHYDROGENASE; SWP:Q8ZXP5; PDB:1QP8A; ELYVNFELPPEAEEELRKYFKIVRGGDLGNVEAALVSRITAEELAKPRLKFIQVVTAGLD --------3333---------------1111--------3333---------------33 HLPWESIPPHVTVAGNAGSNADAVAEFALALLLAPYKRIIQYGEKKRGDYGRDVEIPLIQ 33-11111111-----------------------------------------------22 GEKVAVLGLGEIGTRVGKILAALGAQVRGFSRTPKEGPWRFTNSLEEALREARAAVCALP 22---------------------------------------------------------- LNKHTRGLVKYQHLALAEDAVFVNVGRAEVLDRDGVLRILKERPQFIFASDVWWGRNDFA -1111----3333---1111------3333------------1111------3333--33 KDAEFFSLPNVVATPWVAGGYGNERVWRQVEAVRNLITYATGGRPRNIAKREDYIG 33--1111-----------1111--------------------------3333--- >LIGNIN PEROXIDASE; SWP:P11542; PDB:1QPAA; VACPDGVHTASNAACCAWFPVLDDIQQNLFHGGQCGAEAHEALRMVFHDSIAISPKLQSQ --3333-----33331111----------------------------------3333111 GKFGGGGADGSIITFSSIETTYHPNIGLDEVVAIQKPFIAKHGVTPGDFIAFAGAVGVSN 1--------3333----11113333----------------------------------- CPGAPQMQFFLGRPEATQAAPDGLVPEPFHTIDQVLARMLDAGGFDEIETVLLSAHSIAA 2222----------------------1111---------------3333-----3333-- ANDVDPTISGLPFDSTPGQFDSQFFVETQLRGTAFPGKTGIQGTVMSPLKGEMRLQTDHL ----1111-------1111--33333333-----------2222----2222-------- FARDSRTACEWQSFVNNQTKLQEDFQFIFTALSTLGHDMNAMIDCSEVIPAPKPVNFGPS -------------2222----------------22223333---3333------------ FFPAGKTHADIEQACASTPFPTLITAPGPSASVARIPPPPSPN --22223333----1111------------------------- >LCK KINASE; SWP:P06239; PDB:1QPCA; KPWWEDEWEVPRETLKLVERLGAGQFGEVWMGYYNGHTKVAVKSLKQGSMSPDAFLAEAN -11111111-3333---------1111------------------2222----------- LMKQLQHQRLVRLYAVVTQEPIYIITEYMENGSLVDFLKTPSGIKLTINKLLDMAAQIAE 3333--1111------------------1111333311113333---------------- GMAFIEERNYIHRDLRAANILVSDTLSCKIADFGLARLIEDNETAREGAKFPIKWTAPEA ---------------3333---1111------1111---------3333--3333-3333 INYGTFTIKSDVWSFGILLTEIVTHGRIPYPGMTNPEVIQNLERGYRMVRPDNCPEELYQ ------3333-----------1111----2222--------1111-----2222------ LMRLCWKERPEDRPTFDYLRSVLEDFFTATE ---1111-3333------------------- >3-PHOSPHOGLYCERATE KINASE; SWP:P00560; PDB:1QPG; SLSSKLSVQDLDLKDKRVFIRVDFNVPLDGKKITSNQRIVAALPTIKYVLEHHPRYVVLA 1111--3333--2222------------%%%%---------------------------- SHLGQPNGERNEKYSLAPVAKELQSLLGKDVTFLNDCVGPEVEAAVKASAPGSVILLENL ----------3333-3333--------------------------1111----------- RYHIEEEGSRKVDGQKVKASKEDVQKFRHELSSLADVYINDAFGTAHRAHSSMVGFDLPQ --1111-----iiii---------------3333-------3333----1111------- RAAGFLLEKELKYFGKALENPTRPFLAILGGAKVADKIQLIDNLLDKVDSIIIGGGMAFT ---------------1111--------------11113333--1111-------3333-- FKKVLENTEIGDSIFDKAGAEIVPKLMEKAKAKGVEVVLPVDFIIADAFSADANTKTVTD ---------------3333----------------------------------------3 KEGIPAGWQGLDNGPESRKLFAATVAKAKTIVWNGPPGVFEFEKFAAGTKALLDEVVKSS 333---------------------1111----------3333------------------ AAGNTVIIGGGDTATVAKKYGVTDKISHVSTGGGASLELLEGKELPGVAFLSEKK ---------------------3333-------------1111--3333------- >QUINOLINATE ACID PHOSPHOR; SWP:O06594; PDB:1QPOA; GLSDWELAAARAAIARGLDEDLRYGPDVTTLATVPASATTTASLVTREAGVVAGLDVALL ---------------------1111---------1111---------------3333--- TLNEVLGTNGYRVLDRVEDGARVPPGEALMTLEAQTRGLLTAERTMLNLVGHLSGIATAT ------1111-------2222--------------------------------------- AAWVDAVRGTKAKIRDTRKTLPGLRALQKYAVRTGGGVNHRLGLGDAALIKDNHVAAAGS ----1111------------2222------------------1111-------------- VVDALRAVRNAAPDLPCEVEVDSLEQLDAVLPEKPELILLDNFAVWQTQTAVQRRDSRAP -----------1111--------------3333-------------------------11 TVMLESSGGLSLQTAATYAETGVDYLAVGALTHSVRVLDIGLDM 11--------3333-------------3333------------- >PORICINE HEMOGLOBIN (ALPH; SWP:P01965; PDB:1QPWA; VLSAADKANVKAAWGKVGGQAGAHGAEALERMFLGFPTTKTYFPHFNLSHGSDQVKAHGQ ----------------!!!!---------------3333---1111-------------- KVADALTKAVGHLDDLPGALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHHPDDFNPS -----------1111------------------3333---------------1111---- VHASLDKFLANVSTVLTSKYR --------------1111--- >Hemoglobin subunit beta; SWP:P02067; PDB:1QPWB; VHLSAEEKEAVLGLWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSNADAVMGNPKV --------------11113333---------------33333333----33332222--- KAHGKKVLQSFSDGLKHLDNLKGTFAKLSELHCDQLHVDPENFRLLGNVIVVVLARRLGH ----------------33331111--------------3333----------------33 DFNPDVQAAFQKVVAGVANALAHKYH 333333---------------1111- >THIOREDOXIN PEROXIDASE 2; SWP:Q63716; PDB:1QQ2A; SGNAKIGHPAPSFKATAVMPDGQFKDISLSDYKGKYVVFFFYPLDFTFVCPTEIIAFSDR ----2222----------1111-----3333--------------------3333-3333 AEEFKKLNCQVIGASVDSHFSHLAWINTPKKQGGLGPMNIPLVSDPKRTIAQDYGVLKAD ----------------------------1111------------1111---1111----- EGISFRGLFIIDDKGILRQITINDLPVGRSVDEILRLVQAFQFTDKHGEVCPA -----------1111----------------------------1111------ >L-2-HALOACID DEHALOGENASE; SWP:Q60099; PDB:1QQ5A; MIKAVVFDAYGTLFDVQSVADATERAYPGRGEYITQVWRQKQLEYSWLRALMGRYADFWS --------2222--1111--------2222------------------------------ VTREALAYTLGTLGLEPDESFLADMAQAYNRLTPYPDAAQCLAELAPLKRAILSNGAPDM ----------1111-----------3333-----1111----1111-------------- LQALVANAGLTDSFDAVISVDAKRVFKPHPDSYALVEEVLGVTPAEVLFVSSNGFDVGGA -----11111111------3333------3333---------3333-------------- KNFGFSVARVARLSQEALARELVSGTIAPLTMFKALRMREETYAEAPDFVVPALGDLPRL --------------------1111---3333---------1111--------3333---- VRGMA ----- >BETA-2 MICROGLOBULIN; SWP:P30504; PDB:1QQDA; SHSMRYFSTSVSWPGRGEPRFIAVGYVDDTQFVRFDSDAASPRGEPREPWVEQEGPEYWD ------------2222----------!!!!-----------------1111---3333-- RETQKYKRQAQADRVNLRKLRGYYNQSEDGSHTLQRMFGCDLGPDGRLLRGYNQFAYDGK --------------------------------------------------------iiii DYIALNEDLRSWTAADTAAQITQRKWEAAREAEQRRAYLEGTCVEWLRRYLENGKETLQR -----3333------3333--------------------------------1111----- AEHPKTHVTHHPVSDHEATLRCWALGFYPAEITLTWQWDGEDQTQDTELVETRPAGDGTF -------------------------------------iiii-1111-------------- QKWAAVVVPSGEEQRYTCHVQHEGLPEPLTLRW -----------3333------1111-------- >VESICULAR TRANSPORT PROTE; SWP:NA; PDB:1QQEA; ISDPVELLKRAEKKGVPSSGFMKLFSGSDSYKFEEAADLCVQAATIYRLRKELNLAGDSF ---------------------------3333----------------------------- LKAADYQKKAGNEDEAGNTYVEAYKCFKSGGNSVNAVDSLENAIQIFTHRGQFRRGANFK ------------3333-----------1111----------------1111-3333---- FELGEILENDLHDYAKAIDCYELAGEWYAQDQSVALSNKCFIKCADLKALDGQYIEASDI ------------3333-------------------------------------------- YSKLIKSSMGNRLSQWSLKDYFLKKGLCQLAATDAVAAARTLQEGQSESNFLKSLIDAVN --------------3333-----------1111--------3333--3333--------- EGDSEQLSEHCKEFDNFMRLDKWKITILNKIKESIQQQEDD -----------------------------------3333-- >COMPLEMENT C3DG; SWP:P01026; PDB:1QQFA; GEQNMIGMTPTVIAVHYLDQTEQWEKFGLEKRQEALELIKKGYTQQLAFKQPISAYAAFN -----------------------33333333---------------11113333----11 NRPPSTWLTAYVSRVFSLAANLIAIDSQVLCGAVKWLILEKQKPDGVFQEDGPVIHQEMI 11----------------1111--------------------1111---------3333- GGFRNTKEADVSLTAFVLIALQEARDICEGQVNSLPGSINKAGEYLEASYLNLQRPYTVA -------3333-----------------1111-----------------1111------- IAGYALALMNKLELTKFLNTAKDRNRWEEPGQQLYNVEATSYALLALLLLKDFDSVPPVV ------1111-------3333%%%%--------------------------3333----- RWLNDERYYGGGYGSTQATFMVFQALAQYRADV --1111--2222--------------------- >INSULIN RECEPTOR SUBSTRAT; SWP:P35568; PDB:1QQGA; DVRKVGYLRKPKSMHKRFFVLRAASEAGGPARLEYYENEKKWRHKSSAPKRSIPLESCFN -----------------------------------------1111--------3333--- INKRADSKNKHLVALYTRDEHFAIAADSEAEQDSWYQALLQLHAFKEVWQVILKPKGLGQ ----------------1111---------------------------------------1 TKNLIGIYRLCLTSKTISFVKLNSEAAAVVLQLMNIRRCGHSENFFFIEVGRSAVTGPGE 111-----------------2222-------3333---------------1111------ FWMQVDDSVVAQNMHETILEAMRAMSD ------3333------------3333- >PAPILLOMAVIRUS TRANSCRIPT; SWP:P06790; PDB:1QQHA; KSKAHKAIELQMALQGLAQSAYKTEDWTLQDTCEELWNTEPTHCFKKGGQTVQVYFDGNK -------------------1111----3333--3333----------------------- DNCMTYVAWDSVYYMTDAGTWDKTATCVSHRGLYYVKEGYNTFYIEFKSECEKYGNTGTW ---------------3333---------3333----iiii--------------3333-- EVHFGNNVIDCNDSMCSTSDDTVS ---!!!!----------------- >FIBROBLAST GROWTH FACTOR ; SWP:Q02195; PDB:1QQKA; DIRVRRLFCRTQWYLRIDKRGKVKGTQEMRNSYNIMEIRTVAVGIVAIKGVESEYYLAMN --------1111-----------------------------------------------1 KEGKLYAKKECNEDCNFKELILENHYNTYASAKWGGEMFVALNQKGLPVKGKKTKKEQKT 111---------1111-----3333------------------------1111----333 AHFLPMAIT 3-------- >FIBROBLAST GROWTH FACTOR ; SWP:Q02195; PDB:1QQLA; DIRVRRLFCRTQWYLRIDKRGKVKGTQEMRNSYNIMEIRTVAVGIVAIKGVESEYYLAMN --------1111-----1111--------------------2222--------------- KEGKLYAKQTPNEECLFLERLEENHYNTYISKKHAEKNWFVGLKKNGSCKRGPRTHYGQK -----------1111---------------3333---------1111---3333-22221 AILFLPLPVSS 111-------- >Genome polyprotein; SWP:P03305; PDB:1QQP1; TTSAGESADPVTTTVENYGGETQIQRRQHTDVSFIMDRFVKVTPQNQINILDLMQVPSHT ---3333-------3333--------1111---------------------1111-1111 LVGALLRASTYYFSDLEIAVKHEGDLTWVPNGAPEKALDNTTNPTAYHKAPLTRLALPYT -----1111--------------------22223333----------------------- APHRVLATVYNGECRTLPTSFNYGAIKATRVTELLYRMKRAETYCPRPLLAIHPTEARHK -----------------1111--------------------------------------- QKIVAPVK -------- >Genome polyprotein; SWP:P03305; PDB:1QQP2; DKKTTTLLEDRILTTRNGHTTSTTQSSVGVTYGYATAEDFVSGPNTSGLETRVVQAERFF -------!!!!-------------------------------3333------3333---- KTHLFDWVTSDSFGRCHLLELPTDHKGVYGSLTDSYAYMRNGWDVEVTAVGNQFNGGCLL -------11112222-----------33333333-------------------------- VAMVPELCSIQKRELYQLTLFPHQFINPRTNMTAHITVPFVGVNRYDQYKVHKPWTLVVM ----------3333------------3333-----------------3333--------- VVAPLTVNTEGAPQIKVYANIAPTNVHVAGEFPSKE ------------------------------------ >Genome polyprotein; SWP:P03305; PDB:1QQP3; GIFPVACSDGYGGLVTTDPKTADPVYGKVFNPPRNQLPGRFTNLLDVAEACPTFLRFEGG -------2222----------------------2222-------------------2222 VPYVTTKTDSDRVLAQFDMSLAAKHMSNTFLAGLAQYYTQYSGTINLHFMFTGPTDAKAR -------------------11111111-------1111---------------1111--- YMVAYAPPGMEPPKTPEAAAHCIHAEWDTGLNSKFTFSIPYLSAADYTYTASDVAETTNV --------------33331111------------------------------1111--11 QGWVCLFQITHGKADGDALVVLASAGKDFELRLPVDARAE 11-----------2222--------1111----------- >Genome polyprotein; SWP:P03305; PDB:1QQP4; SGNTGSIINNYYMQQYQNSMDTQLGNDWFSKLASSAFSGLFGALLA ------------3333----------------1111---------- >STREPTOKINASE DOMAIN B; SWP:P00779; PDB:1QQRA; IQNQAKSVDVEYTVQFTPLNPDDDFRPGLKLTKLLKTLAIGDTITSQELLAQAQSILNKN ---------------------3333-------------2222------------------ HPGYTIYERDSSIVTHDNDIFRTILPMDQEFTYRVKNREQAYRINKKSGLNEEINNTDLI 2222-----------!!!!----------------------------------------- SEKYYVLKKGEKPYDPFD ------------------ >LYSOZYME C; SWP:NA; PDB:1QQYA; KIFSKCELARKLKSMGMDGFHGYSLANWVCMAEYESNFNTQAFNGRNSNGSSDYGIFQLN ------------11112222---3333--------%%%%---------------1111-1 SKWWCKSNSHSSANACNIMCSKFLDDNIDDDIACAKRVVKDPNGMSAWVAWVKHCKGKDL 111---3333---1111-3333------------------3333----3333--1111-1 SKYLASCNL 1111111-- >4'-PHOSPHOPANTETHEINYL TR; SWP:P39135; PDB:1QR0A; MKIYGIYMDRPLSQEENERFMTFISPEKREKCRRFYHKEDAHRTLLGDVLVRSVISRQYQ --------------------1111-------3333-3333----------------1111 LDKSDIRFSTQEYGKPCIPDLPDAHFNISHSGRWVIGAFDSQPIGIDIEKTKPISLEIAK -3333-----1111---1111---------!!!!-------------------------- RFFSKTEYSDLLAKDKDEQTDYFYHLWSMKESFIKQEGKGLSLPLDSFSVRLHQDGQVSI ---3333---33331111--------------------!!!!-----------iiii--- ELPDSHSPCYIKTYEVDPGYKMAVCAAHPDFPEDITMVSYEELLRAAA ---1111---------1111---------------------------- >TENASCIN; SWP:P10039; PDB:1QR4A; DNPKDLEVSDPTETTLSLRWRRPVAKFDRYRLTYVSPSGKKNEMEIPVDSTSFILRGLDA -----------------------------------1111-------1111---------- GTEYTISLVAEKGRHKSKPTTIKGSTVVGSPKGISFSDITENSATVSWTPPRSRVDSYRV ------------------------------------------------------------ SYVPITGGTPNVVTVDGSKTRTKLVKLVPGVDYNVNIISVKGFEESEPISGILKT ---1111--------3333------------------------------------ >PHOSPHOCARRIER PROTEIN HP; SWP:P23534; PDB:1QR5A; MEQQSYTIIDETGIHARPATMLVQTASKFDSDIQLEYNGKKVNLKSIMGVMSLGVGKDAE ---------3333------------------------------------------2222- ITIYADGSDEADAIQAITDVLSKEGLTE ----------3333-------------- >QUINONE-REDUCTASE; SWP:P05982; PDB:1QRDA; AVRRALIVLAHAERTSFNYAMKEAAVEALKKKGWEVVESDLYAMNFNPLISRNDITGEPK ------------1111-------------1111------3333-------1111------ DSENFQYPVESSLAYKEGRLSPDIVAEQKKLEAADLVIFQFPLYWFGVPAILKGWFERVL -----3333---------------------1111---------%%%%-3333-------- VAGFAYTYATMYDKGPFQNKKTLLSITTGGSGSMYSLQGVHGDMNVILWPIQSGILRFCG --11111111!!!!1111------------3333-1111------3333-------1111 FQVLEPQLVYSIGHTPPDARVQVLEGWKKRLETVWEESPLYFAPSSLFDLNFQAGFLLKK ----------1111------------------3333-------1111---3333------ EVQEEQKKNKFGLSVGHHLGKSIPADNQIKARK -----1111----3333iiii----1111---- >CARBONIC ANHYDRASE; SWP:P40881; PDB:1QREA; TVDEFSNIRENPVTPWNPEPSAPVIDPTAYIDPQASVIGEVTIGANVMVSPMASIRSDEG -------------1111--------1111--1111--------------2222------- MPIFVGDRSNVQDGVVLHALETINEEGEPIEDNIVEVDGKEYAVYIGNNVSLAHQSQVHG -----------2222--------1111--3333---iiii------------2222---- PAAVGDDTFIGMQAFVFKSKVGNNCVLEPRSAAIGVTIPDGRYIPAGMVVTSQAEADKLP ----------2222------------------------2222--2222-------1111- EVTDDYAYSHTNEAVVYVNVHLAEGYKETS --1111-1111---------------1111 >Gag polyprotein; SWP:P03345; PDB:1QRJB; QMKDLQAIKQEVSQAAPGSPQFMQTIRLAVQQFDPTAKDLQDLLQYLCSSLVASLHHQQL 3333-----------------3333----------------------------------- DSLISEAETRGITGYNPLAGPLRVQANNPQQQGLRREYQQLWLAAFAALPGSAKDPSWAS -------1111----3333----3333-----------------1111-------1111- ILQGLEEPYHAFVERLNIALDNGLPEGTPKDPILRSLAYSNANKECQKLLQARGHTNSPL -------------------3333------3333--------------------------3 GDMLRACQTWTPKDKTKVL 333---------------- >Pepsin A [Precursor]; SWP:P00790; PDB:1QRPE; VDEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCSSLACTNHNRFNPED --------%%%%-----------------------------1111-3333------3333 SSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYAPFDGI 1111--------------------------%%%%---------------3333------- LGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLSADDQSGSVVIFGGIDSSYYTGSLN ----33332222--------1111------------%%%%----------3333------ WVPVTVEGYWQITVDSITMNGEAIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASEN ------------------%%%%---1111-----1111--------------1111---1 SDGDMVVSCSAISSLPDIVFTINGVQYPVPPSAYILQSEGSCISGFQGMNLPTESGELWI 111----33331111------%%%%----3333-------------------1111---- LGDVFIRQYFTVFDRANNQVGLAPVA ----3333------1111-------- >DNA (5'-D(*GP*CP*GP*AP*TP; SWP:Q05783; PDB:1QRVA; SDKPKRPLSAYMLWLNSARESIKRENPGIKVTEVAKRGGELWRAMKDKSEWEAKAAKAKD -------------------------2222------------------------------- DYDRAVKEFEANG ---------1111 >PROTEIN (HOMEOBOX VENTRAL; SWP:P22808; PDB:1QRYA; GSHMSDGLPNKKRKRRVLFTKAQTYELERRFRQQRYLSAPEREHLTSLIRLTPTQVKIWF -------3333-------------------------------3333------3333---- QNHRYKTKRAQNEKGYEGHP ---------3333------- >2-OXOISOVALERATE DEHYDROG; SWP:P09060; PDB:1QS0A; NEYAPLRLHVPEPTGRPGCQTDFSYLRLNDAGQARKPPVDVDAADTADLSYSLVRVLDEQ ---------------2222---3333---2222----11113333-3333-------111 GDAQGPWAEDIDPQILRQGRALKTRIFDSRVVAQRQKKSFYQSLGEEAIGSGQALALNRT 1---1111----------------------------------2222-------1111333 DCFPTYRQQSILARDVSLVEICQLLSNERDPLKGRQLPIYSVREAGFFTISGNLATQFVQ 3------3333-----33333333--1111-iiii------3333-------2222---- AVGWAASAIKGDTKIASAWIGDGATAESDFHTALTFAHVYRAPVILNVVNNQWAISTFQA -------1111------------------------------------------!!!!333 IAGGESTTFAGRGVGCGIASLRVDGNDFVAVYAASRWAAERARRGLGPSLIEWVTYRAGP 3--22223333--1111------1111--------------1111--------------- HSTSDDPSKYRPADDWSHFPLGDPIARLKQHLIKIGHWSEEEHQATTAEFEAAVIAAQKE -11113333--11111111-------------1111------------------------ AEQYGTLANGHIPSAASFEDVYKEPDHLRRQRQEL -1111------------------------------ >2-oxoisovalerate dehydrog; SWP:P09061; PDB:1QS0B; ATTTTIQALRSADVLERDDNVVVYGQDVGYFGGVFRCTEGLQTKYGKSRVFDAPISESGI -----------------1111---222233331111---3333----------------- VGTAVGGAYGLRPVVEIQFADYFYPASDQIVSEARLRYRSAGEFIAPLTLRPCGGGIYGG ------3333--------33333333-----------1111------------------1 QTHSQSPEAFTQVCGLRTVPSNPYDAKGLLIASIECDDPVIFLEPKRLYNGPFDGHHDRP 111------1111-------------------------------3333------------ VTPWSKHPHSAVPDGYYTVPLDKAAITRPGNDVSVLTYGTTVYVAQVAAEESGVDAEVID ---1111----------------------------------------------------- LRSLWPLDLDTIVESVKKTGRCVVVHEATRTCGFGAELVSLVQEHCFHHLEAPIERVTGW --------3333-----------------2222-------3333--1111---------- DTPYPHAQEWAYFPGPSRVGAALKKVEV -----1111----------3333----- >ADP-RIBOSYLTRANSFERASE; SWP:Q844J9; PDB:1QS1A; TDKVEDFKEDKEKAKEWGKEKEKEWKLTATEKGKMNNFLDNKNDIKTNYKEITFSMAGSF --------------------3333----------------2222-11113333--22223 EDEIKDLKEIDKMFDKTNLSNSIITYKNVEPTTIGFNKSLTEGNTINSDAMAQFKEQFLD 333-------3333---------------3333--------!!!!------------222 RDIKFDSYLDTHLTAQQVSSKERVILKVTVPSGKGSTTPTKAGVILNNSEYKMLIDNGYM 2---------------------------------------------%%%%---------- VHVDKVSKVVKKGVECLQIEGTLKKSLDFKNDINAEAHSWGMKNYEEWAKDLTDSQREAL ----------iiii--------------!!!!----------------1111-------- DGYARQDYKEINNYLRNQGGSGNEKLDAQIKNISDALGKKPIPENITVYRWCGMPEFGYQ -----------------------------------1111-------------3333---1 ISDPLPSLKDFEEQFLNTIKEDKGYMSTSLSSERLAAFGSRKIILRLQVPKGSTGAYLSA 111-------------------------------3333-----------2222---3333 IGGFASEKEILLDKDSKYHIDKVTEVIIKGVKRYVVDATLLT -!!!!----------------------iiii----------- >PLASMEPSIN; SWP:O60989; PDB:1QS8A; SENDVIELDDVANIMFYGEGEVGDNHQKFMLIFDTGSANLWVPSKKCNSSGCSIKNLYDS ----------%%%%--------1111-----------------1111-3333------33 SKSKSYEKDGTKVDITYGSGTVKGFFSKDLVTLGHLSMPYKFIEVIDTDDLEPIYSSVEF 331111----------1111------------!!!!-----------1111-3333---- DGILGLGWKDLSIGSIDPIVVELKNQNKIDNALFTFYLPVHDVHAGYLTIGGIEEKFYEG -------3333------------------------------------------1111--- NITYEKLNHDLYWQIDLDVHFGKQTMEKANVIVDSGTTTITAPSEFLNKFFANLNVIKVP --------------------!!!!---------1111-----3333-------------- FLPFYVTTCDNKEMPTLEFKSANNTYTLEPEYYMNPILEVDDTLCMITMLPVDIDSNTFI -------1111---------1111----3333---------------------------- LGDPFMRKYFTVFDYDKESVGFAIAKN ---3333-------1111--------- >SOLUBLE LYTIC TRANSGLYCOS; SWP:P03810; PDB:1QSAA; DSLDEQRSRYAQIKQAWDNRQMDVVEQMMPGLKDYPLYPYLEYRQITDDLMNQPAVTVTN ----------------1111-------33331111-3333--------3333-3333--- FVRANPTLPPARTLQSRFVNELARREDWRGLLAFSPEKPGTTEAQCNYYYAKWNTGQSEE ----1111--------------1111--------------------------1111---- AWQGAKELWLTGKSQPNACDKLFSVWRASGKQDPLAYLERIRLAMKAGNTGLVTVLAGQM ---------------3333-------3333--3333--------1111--------1111 PADYQTIASAIISLANNPNTVLTFARTTGATDFTRQMAAVAFASVARQDAENARLMIPSL 33331111--------1111---------------------------------------- AQAQQLNEDQIQELRDIVAWRLMGNDVTDEQAKWRDDAIMRSQSTSLIERRVRMALGTGD ------------------1111----------------1111------------------ RRGLNTWLARLPMEAKEKDEWRYWQADLLLERGREAEAKEILHQLMQQRGFYPMVAAQRI -------11113333--------------1111-----------3333------------ GEEYELKIDKAPQNVDSALTQGPEMARVRELMYWNLDNTARSEWANLVKSKSKTEQAQLA ----------------3333-----------1111------------2222--------- RYAFNNQWWDLSVQATIAGKLWDHLEERFPLAYNDLFKRYTSGKEIPQSYAMAIARQESA ---1111-------------11113333------------1111--------------%% WNPKVKSPVGASGLMQIMPGTATHTVKMFSIPGYSSPGQLLDPETNINIGTSYLQYVYQQ %%----1111--1111-------------------3333-------------------11 FGNNRIFSSAAYNAGPGRVRTWLGNSAGRIDAVAFVESIPFSETRGYVKNVLAYDAYYRY 11-3333------------------iiii------1111--------------------1 FMGDKPTLMSATEWGRRY 111--------------- >BETA-TUBULIN BINDING POST; SWP:P48606; PDB:1QSDA; TQLDIKVKALKRLTKEEGYYQQELKDQEAHVAKLKEDKSVDPYDLKKQEEVLDDTKRLLP ------------------------------------33333333---------------- TLYEKIREFKEDLEQFLKTYQGTEDVSDARSAITSAQELLDS ------------------------------------------ >ENOYL-[ACYL-CARRIER-PROTE; SWP:P29132; PDB:1QSGA; GFLSGKRILVTGVASKLSIAYGIAQAMHREGAELAFTYQNDKLKGRVEEFAAQLGSDIVL 1111----------1111---------1111--------3333--------1111----- QCDVAEDASIDTMFAELGKVWPKFDGFVHSIGFAPGDQLDGDYVNAVTREGFKIAHDISS ----------------------------------3333---3333--------------- YSFVAMAKACRSMLNPGSALLTLSYLGAERAIPNYNVMGLAKASLEANVRYMANAMGPEG ---------1111-2222------3333----------------------------1111 VRVNAISAGPIRTLAASGIKDFRKMLAHCEAVTPIRRTVTIEDVGNSAAFLCSDLSAGIS ------------3333----3333--------1111----------------3333---- GEVVHVDGGFSIAAMNEL ------iiii-------- >HPA2 HISTONE ACETYLTRANSF; SWP:Q06592; PDB:1QSMA; DNITVRFVTENDKEGWQRLWKSYQDFYEVSFPDDLDDFNFGRFLDPNIKMWAAVAVESSS --------1111--------------------------------1111------------ EKIIGMINFFNHMTTWDFKDKIYINDLYVDENSRVKGAGGKLIQFVYDEADKLGTPSVYW -------------1111------------1111-----------------1111------ CTDESNHRAQLLYVKVGYKAPKILYKRKGY --11113333----------------2222 >TGCN5 HISTONE ACETYL TRAN; SWP:Q27198; PDB:1QSTA; LDFDILTNDGTHRNMKLLIDLKNIFSRQLPKMPKEYIVKLVFDRHHESMVILKNKQKVIG ----------------------------11113333------1111-------------- GICFRQYKPQRFAEVAFLAVTANEQVRGYGTRLMNKFKDHMQKQNIEYLLTYADNFAIGY -------1111---------1111-----------------1111--------------- FKKQGFTKEHRMPQEKWKGYIKDYDGGTLMECYIHPYVDY -1111-------33332222-------------------- >XYLOSE ISOMERASE; SWP:P50910; PDB:1QT1A; SYQPTPEDKFTFGLWTVGWQGRDPFGDATRGALDPAESVRRLAELGAHGVTFHDDDLIPF ------------1111------1111----------------1111------3333--22 GATDSERAEHIKRFRQGLDETGMKVPMATTNLFTHPVFKDGGFTANDRDVRRYALRKTIR 22--------------------------------3333---------------------- NIDLAVELGAQTYVAWGGREGAESGAAKDVRVALDRMKEAFDLLGEYVTSQGYDTPFAIE -----1111-------1111---1111--------------------------------- PKPNEPRGDILLPTIGHALAFIDGLERPELYGVNPEVGHEQMAGLNFPHGIAQALWAGKL ---------------------1111-3333----------1111---------------- FHIDLNGQSGIKYDQDLRFGPGDLRAAFWLVDLLESAGYEGPRHFDFKPPRTEDFDGVWA -------------------------------------------------1111------- SAAGCMRNYLILKERAAAFRADPEVQEALRAARLDELAQPTAGDGLQALLPDRSAFEDFD ---------------------------------3333----1111------111111113 PDAAAARGMAFERLDQLAMDHLLGARG 3333333-------------------- >EXFOLIATIVE TOXIN B; SWP:P09332; PDB:1QTFA; KEYSAEEIRKLKQKFEVPPTDKELYTHITDNARSPYNSVGTVFVKGSTLATGVLIGKNTI ---3333-----1111----1111------------------------------------ VTNYHVAREAAKNPSNIIFTPAQNRDAEKNEFPTPYGKFEAEEIKESPYGQGLDLAIIKL --33331111--1111------------------------------1111---------- KPNEKGESAGDLIQPANIPDHIDIAKGDKYSLLGYPYNYSAYSLYQSQIEMFNDSQYFGY --1111-1111-------------2222-----------2222----------------- TEVGNSGSGIFNLKGELIGIHSGKGGQHNLPIGVFFNRKISSLYSVDNTFGDTLGNDLKK -3333------1111-------------------1111--3333---------------- RAKLDK -1111- >CASPASE-8; SWP:Q14790; PDB:1QTNA; DKVYQMKSKPRGYCLIINNHNFAKAREKVPKLHSIRDRNGTHLDAGALTTTFEELHFEIK ----------------------------3333-----2222------------------- PHDDCTVEQIYEILKIYQLMDHSNMDCFICCILSHGDKGIIYGTDGQEAPIYELTSQFTG ----------------1111-1111-----------2222--1111---3333-----33 LKCPSLAGKPKVFFIQACQGDNYQKGIPVETD 331111-------------------------- >Caspase-8 [Precursor]; SWP:Q14790; PDB:1QTNB; TRYIPDEADFLLGMATVNNCVSYRNPAEGTWYIQSLCQSLRERCPRGDDILTILTEVNYE ----1111----------------------------------3333-------------- VSNKDDKKNMGKQMPQPTFTLRKKLVFPSD 1111-------------------------- >BLEOMYCIN-BINDING PROTEIN; SWP:Q53793; PDB:1QTOA; MVKFLGAVPVLTAVDVPANVSFWVDTLGFEKDFGDRDFAGVRRGDIRLHISRTEHQIVAD ------------------------------------------!!!!--------3333-- NTSAWIEVTDPDALHEEWARAVSTDYADTSGPAMTPVGESPAGREFAVRDPAGNCVHFTA -----------------1111---3333-----------1111------1111------- GE -- >GLUTAMINYL-TRNA SYNTHETAS; SWP:P00962; PDB:1QTQA; TNFIRQIIDEDLASGKHTTVHTRFPPEPNGYLHIGHAKSICLNFGIAQDYKGQCNLRFDD -----------------------------------------------1111--------- TNPVKEDIEYVESIKNDVEWLGFHWSGNVRYSSDYFDQLHAYAIELINKGLAYVDELTPE -3333-3333--------------------3333------------1111-------333 QIREYRGTLTQPGKNSPYRDRSVEENLALFEKMRAGGFEEGKACLRAKIDMASPFIVMRD 3--------------1111-------------------2222-------1111-3333-- PVLYRIKFAEHHQTGNKWCIYPMYDFTHCISDALEGITHSLCTLEFQDNRRLYDWVLDNI -----------------------3333-----------------------------1111 TIPVHPRQYEFSRLNLEYTVMSKRKLNLLVTDKHVEGWDDPRMPTISGLRRRGYTAASIR ---------------2222----------1111---1111-------------------- EFCKRIGVTKQDNTIEMASLESCIREDLNENAPRAMAVIDPVKLVIENYQGEGEMVTMPN ---------------3333--------------------------1111----------- HPNKPEMGSRQVPFSGEIWIDRADFREEANKQYKRLVLGKEVRLRNAYVIKAERVEKDAE 11111111------------1111-----3333----------2222----------111 GNITTIFCTYDADTLGVIHWVSAAHALPVEIRLYDRLFSVPNPGAADDFLSVINPESLVI 1--------------------3333----------------3333--3333--1111--- KQGFAEPSLKDAVAGKAFQFEREGYFCLDSRHSTAEKPVFNRTVGLRDT -----3333----------2222----------3333------------ >ENDONUCLEASE IV; SWP:P12638; PDB:1QTWA; MKYIGAHVSAAGGLANAAIRAAEIDATAFALFTKNQRQWRAAPLTTQTIDEFKAACEKYH ---------22223333----1111----------------------------------- YTSAQILPHDSYLINLGHPVTEALEKSRDAFIDEMQRCEQLGLSLLNFHPGSHLMQISEE -3333-----33331111--------------------1111----------%%%%---- DCLARIAESINIALDKTQGVTAVIENTAGQGSNLGFKFEHLAAIIDGVEDKSRVGVCIDT ------------3333-------------!!!!---3333----1111-3333------- CHAFAAGYDLRTPAECEKTFADFARTVGFKYLRGMHLNDAKSTFGSRVDRHHSLGEGNIG ---------------------------3333-----------2222------22223333 HDAFRWIMQDDRFDGIPLILETINPDIWAEEIAWLKAQQTEKAVA ---------3333----------3333------------------ >CALMODULIN; SWP:Q9SDJ0; PDB:1QTXA; ADQLTDEQIAEFKEAFSLFDKDGDGTITTKELGTVMRSLGQNPTEAELQDMINEVDADGN -------------------1111-------------1111------------1111---- GTIDFPEFLNLMARKMKDTDSEEELKEAFRVFDKDGNGFISAAELRHVMTNLGEKLTDEE -------------------1111---------1111------------------------ VDEMIREADVDGDGQVNYEEFVQVMMAK ----------------------1111-- >INFLUENZA RECOMBINANT HA2; SWP:A2V839; PDB:1QU1A; QAADLKSTQAAIDQINGKLNRVIEKTNEKFHQIEKEFSEVEGRIQDLEKYVEDTKIDLWS ------------------------------------------------------------ YNAELLVALENQHTIDLTDSEMNKLFEKTRRQLGSFKIYHKCDNACIESIRNGTYDHDVY ---------1111--11113333------------------------------------- RDEALNNRFQIKG ------------- >PROTEIN KINASE SPK1; SWP:P22216; PDB:1QU5A; EAETREQKLLHSNNTENVKSSKKKGNGRFLTLKPLPDSIIQESLEIQQGVNPFFIGRSED -------------%%%%-----------------3333-------------------333 CNCKIEDNRLSRVHCFIFKKRHAVGKSMYESPAQGLDDIWYCHTGTNVSYLNNNRMIQGT 3-----3333-------------------------------------------------- KFLLQDGDEIKIIWDKNNKFVIGFKVEINDTTGLFNEGLGMLQEQRVVLKQTAEEKDLVK --------------3333---------------2222--------------------111 KL 1- >PROTEIN KINASE PKR; SWP:NA; PDB:1QU6A; GSHMEMAGDLSAGFFMEELNTYRQKQGVVLKYQELPNSGPPHDRRFTFQVIIDGREFPEG ---------------------3333----------------------------------- EGRSKKEAKNAAAKLAVEILNKEKKAVSPLLLTTTNSSEGLSMGNYIGLINRIAQKKRLT ----3333----------3333-------------------------3333--------- VNYEQCASGVHGPEGFHYKCKMGQKEYSIGTGSTKQEAKQLAAKLAYLQILSEETGSGC ----------------------------------------------------------- >METHYL-ACCEPTING CHEMOTAX; SWP:P02942; PDB:1QU7A; RTEQQAASLEQTAASMEQLTATVKQNAENARQASHLALSASETAQRGGKVVDNVVQTMRD ----3333------------1111---3333----------------------------- ISTSSQKIADIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVAGEVRNLAQRSAQA -3333-------------------------------3333-------------------- AREIKSLIEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQV ----------------------------3333-----3333---3333-3333------- GLAVAEMDRVTQQNAALVEQSAAAAAALEEQASRLTEAVAVFRIQQQ ----------------------------------------------- >YJGF PROTEIN; SWP:P39330; PDB:1QU9A; SKTIATENAPAAIGPYVQGVDLGNMIITSGQIPVNPKTGEVPADVAAQARQSLDNVKAIV -----1111--------------------------------------------------- EAAGLKVGDIVKTTVFVKDLNDFATVNATYEAFFTEHNATFPARSVEVARLPKDVKIEIE 1111-3333---------3333------------1111------------2222------ AIAVRR ------ >ACUTOLYSIN-C; SWP:P60244; PDB:1QUAA; PAPQTSIELFLIVDHSMYAKYNSNSSKITTTLKARVNIMNAIYSSLNLVITLSGIEMWSA ------------------1111--------------------3333-------------- ADLITVQSSSRNTLKLFASWRETDLLKRTSNDNAQLLTATNFNGNTVGLAYLKTMCNSKY -----------------------3333--------------------------2222--- SVGLIQDHSAIPLLMAVTMAHELGHNLGMNHDGAGCSCATCIMAPVLSSGPAKSFSDCSK ----------3333--------------------------1111-----------3333- HDYQSFLTIHKPQCLLN ------------1111- >HUMAN BETA2-GLYCOPROTEIN ; SWP:P02749; PDB:1QUBA; GRTCPKPDDLPFSTVVPLKTFYEPGEEITYSCKPGYVSRGGMRKFICPLTGLWPINTLKC ----------------------2222------2222-2222------3333--------- TPRVCPFAGILENGAVRYTTFEYPNTISFSCNTGFYLNGADSAKCTEEGKWSPELPVCAP ----------2222-----------------2222----------1111----------- IICPPPSIPTFATLRVYKPSAGNNSLYRDTAVFECLPQHAMFGNDTITCTTHGNWTKLPE --------------------!!!!-2222------2222----------1111------- CREVKCPFPSRPDNGFVNYPAKPTLYYKDKATFGCHDGYSLDGPEEIECTKLGNWSAMPS -----------2222----------2222------2222----------1111------- CKASCKVPVKKATVVYQGERVKIQEKFKNGMLHGDKVSFFCKNKEKKCSYTEDAQCIDGT ---------------%%%%--1111-------------------------------iiii IEVPKCFKEHTDASDVKPC ---3333----3333---- >REPLICATION PROTEIN A 32 ; SWP:P35244; PDB:1QUQA; HIVPCTISQLLSATLVDEVFRIGNVEISQVTIVGIIRHAEKAPTNIVYKIDDMTAAPMDV -----33331111----------------------------------------------- RQWVDTNTVVPPETYVKVAGHLRSFQNKKSLVAFKIMPLEDMNEFTTHILEVINAHMVLS --------------------------------------------------------1111 K - >Replication protein A 32 ; SWP:P15927; PDB:1QUQB; DMMDLPRSRINAGMLAQFIDKPVCFVGRLEKIHPTGKMFILSDGEGKNGTIELMEPLDEE ----------33331111--------------3333------------------------ ISGIVEVVGRVTAKATILCTSYVQFKEDSHPFDLGLYNEAVKIIHDFPQFYPLG -----------1111-------------------------------3333---- >LYTIC MUREIN TRANSGLYCOSY; SWP:P41052; PDB:1QUSA; MVEPQHNVMQMGGDFANNPNAQQFIDKMVNKHGFDRQQLQEILSQAKRLDSVLRLMDNQG ----1111----1111--------------------------1111--3333-------- PNGAWLRYRKKFITPDNVQNGVVFWNQYEDALNRAWQVYGVPPEIIVGIIGVETRWGRVM -----------------------------------------3333---------iiii-- GKTRILDALATLSFNYPRRAEYFSGELETFLLMARDEQDDPLNLKGSFAGAMGYGQFMPS ---------------3333--------------------1111---1111--1111---- SYKQYAVDFSGDGHINLWDPVDAIGSVANYFKAHGWVKGDQVAVMANGQAPGLPNGFKTK ---------------1111------------1111-2222---------1111--1111- YSISQLAAAGLTPQQPLGNHQQASLLRLDVGTGYQYWYGLPNFYTITRYNHSTHYAMAVW ------1111------!!!!---------------------------------------- QLGQAVALARVQ -------3333- >HUMAN SKELETAL MUSCLE ALP; SWP:P35609; PDB:1QUUA; GSSNEIRRLERLEHLAEKFRQKASTHETWAYGKEQILLQKDYESASLTEVRALLRKHEAF --1111--------------------3333-----------1111--------------- ESDLAAHQDRVEQIAAIAQELNELDYHDAVNVNDRCQKICDQWDRLGTLTQKRREALERM ---3333--------------1111----------------------------------- EKLLETIDQLHLEFAKRAAPFNNWMEGAMEDLQDMFIVHSIEEIQSLITAHEQFKATLPE -----------------------------3333--------3333-------------33 ADGERQSIMAIQNEVEKVIQSYNIRISSSNPYSTVTMDELRTKWDKVKQLVPIRDQSLQE 33---------------------------1111--------------------------- ELARQHAN -------- >HSTX1 TOXIN; SWP:P59867; PDB:1QUZA; ASCRTPKDCADPCRKETGCPYGKCMNRKCKCNRC ----1111----------------%%%%------ >OBELIN; SWP:Q27709; PDB:1QV1A; YAVKLKTDFDNPRWIKRHKHMFDFLDINGNGKITLDEIVSKASDDICAKLEATPEQTKRH -------1111--------------1111------------------------------- QVCVEAFFRGCGMEYGKEIAFPQFLDGWKQLATSELKKWARNEPTLIREWGDAVFDIFDG --------1111-2222---------------------1111------------3333-- TITLDEWKAYGKISGISPSQEDCEATFRHCDLDNAGDLDVDEMTRQHLGFWYTLDPEADG --------------------------------1111------------------3333-1 LYGNGVP 111---- >F420-dependent methylenet; SWP:P94951; PDB:1QV9A; TVAKAIFIKCGNLGTSDLLDERADREDVEFRVVGTSVKDPECVEAAVEALDIAEDFEPDF ------------3333---1111-----------!!!!-------------3333----- IVYGGPNPAAPGPSKARELADSEYPAVIIGDAPGLKVKDEEEQGLGYILVKPDALGARRE ------1111-3333---1111--------33331111--1111-----1111----333 FLDPVEAIYNADLKVLAATGVFRVVQEAFDELIEKAKEDEISENDLPKLVIDRNTLLERE 3-3333----------1111--------------3333---3333------33331111- EFENPYAVKAAALEIAENVADVSVEGCFVEQDKERYVPIVASAHERKAAELADEARELEK ---3333------------------------3333------------------------1 SNDAVLRTPHAPDGKVLSKRKFEDPE 111-------1111------------ >BETA-GLYCOSIDASE; SWP:Q9YGA8; PDB:1QVBA; MKFPKDFMIGYSSSPFQFEAGIPGSEDPNSDWWVWVHDPENTAAGLVSGDFPENGPGYWN --------------1111---2222-------------------------3333--3333 LNQNDHDLAEKLGVNTIRVGVEWSRIFPKPTFNVKVPVERDENGSIVHVDVDDKAVERLD ---------1111--------3333-----3333------1111---------------- ELANKEAVNHYVEMYKDWVERGRKLILNLYHWPLPLWLHNPIMVRRMGPDRAPSGWLNEE -----------------1111-------------3333---------1111--!!!!--- SVVEFAKYAAYIAWKMGELPVMWSTMNEPNVVYEQGYMFVKGGFPPGYLSLEAADKARRN ---------------3333--------3333-------1111------------------ MIQAHARAYDNIKRFSKKPVGLIYAFQWFELLEGPAEVFDKFKSSKLYYFTDIVSKGSSI -----------3333--------------------------------------------- INVEYRRDLANRLDWLGVNYYSRLVYKIVDDKPIILHGYGFLCTPGGISPAENPCSDFGW --------------------------------------!!!!-2222-1111---1111- EVYPEGLYLLLKELYNRYGVDLIVTENGVSDSRDALRPAYLVSHVYSVWKAANEGIPVKG --3333------------------------1111-----------------1111----- YLHWSLTDNYEWAQGFRQKFGLVMVDFKTKKRYLRPSALVFREIATHNGIPDELQHLTLI ----------!!!!------------------------------------3333------ Q - >SINGLE STRANDED DNA BINDI; SWP:P02339; PDB:1QVCA; ASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKATGEMKEQTEWHRVVLF ------------------------------------------------------------ GKLAEVASEYLRKGSQVYIEGQLRTRKWTDQSGQDRYTTEVVVNVGGTMQMLGGRQGGGA -----------2222--------------------------------------------- PAGGNIGGGQPQGGWGQPQQPQGGN ------------------------- >FIMBRIAL PROTEIN; SWP:P17838; PDB:1QVEA; ISEFARAQLSEAMTLASGLKTKVSDIFSQDGSCPANTAATAGIEKDTDINGKYVAKVTTG ---------------------------------------2222-3333--1111------ GTAAASGGCTIVATMKASDVATPLRGKTLTLTLGNADKGSYTWACTSNADNKYLPKTCQT ---1111-------------1111----------1111-----------3333-1111-- ATTTTP ------ >CLPB PROTEIN; SWP:Q9RA63; PDB:1QVRA; ERWTQAAREALAQAQVLAQRMKHQAIDLPHLWAVLLKDERSLAWRLLEKAGADPKALKEL --------------------------3333-------33333333--------------- QERELARLPKVEGAEVGQYLTSRLSGALNRAEGLMEELKDRYVAVDTLVLALAEATPGLP ----1111---------------------------------------------------- GLEALKGALKELRGGRTVQTEHAESTYNALEQYGIDLTRLAAEGKLDPVIGRDEEIRRVI 3333-----------------------3333----33333333----------------- QILLRRTKNNPVLIGEPGVGKTAIVEGLAQRIVKGDVPEGLKGKRIVSLQMEFEERLKAV -1111----------22223333-----------------1111-------3333----- IQEVVQSQGEVILFIDELKPALARGELRLIGATTLDEYREIEKDPALERRFQPVYVDEPT -----------------3333-------------3333-----3333------------- VEETISILRGLKEKYEVHHGVRISDSAIIAAATLSHRYITERRLPDKAIDLIDEAAARLR ------------------------------------------------------------ MALESAPEEIDALERKKLQLEIEREALKKEKDPDSQERLKAIEAEIAKLTEEIAKLRAEW ------------------------------------------------------------ EREREILRKLREAQHRLDEVRREIELAERQYDLNRAAELRYGELPKLEAEVEALSEKLRG ------------------------------------------3333--------3333-- ARFVRLEVTEEDIAEIVSRWTGIPVSKLLEGEREKLLRLEEELHKRVVGQDEAIRAVADA -----------------1111-3333-----------33333333----3333------- IRRARAGLKDPNRPIGSFLFLGPTGVGKTELAKTLAATLFDTEEAMIRIDMTEYMEKHAV -------------------------------------------------3333----333 SRLQLTEAVRRRPYSVILFDEIEKAHPDVFNILLQILDDGRLTDSHGRTVDFRNTVIILT 3-------------------3333---------------------------1111----- SNLGSPLILEGLQKGWPYERIRDEVFKVLQQHFRPEFLNRLDEIVVFRPLTKEQIRQIVE -1111------1111-3333--------1111-33331111---------3333------ IQLSYLRARLAEKRISLELTEAAKDFLAERGYDPVFGARPLRRVIQRELETPLAQKILAG --------3333-----------------------!!!!--------------------- EVKEGDRVQVDVGPAGLVFAVPA --2222----------------- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:1QW2A; NYFQGHMQIDSIEIGGKVYQFFKSDLGNAPLLFIKGSKGYACGYLNETSNKVGDIAVRVG -------------iiii------------------1111--3333--------------- VKTLDDLSAKVVEASQEAQKVGINPGDVLRNVIDKLG --3333---------3333----222233333333-- >ALPHA-L-ARABINOFURANOSIDA; SWP:Q9XBQ3; PDB:1QW9A; KATMIIEKDFKIAEIDKRIYGSFIEHLGRAVYGGIYEPGHPQADENGFRQDVIELVKELQ ------1111----------------!!!!2222--3333---1111---------3333 VPIIRYPGGNFVSGYNWEDGVGPKEQRPRRLDLAWKSVETNEIGLNEFMDWAKMVGAEVN --------3333---3333---3333--------------------------1111---- MAVNLGTRGIDAARNLVEYCNHPSGSYYSDLRIAHGYKEPHKIKTWCLGNAMDGPWQIGH -----------------------------------------------------1111--- KTAVEYGRIACEAAKVMKWVDPTIELVVCGSSNRNMPTFAEWEATVLDHTYDHVDYISLH --------------------1111--------1111-------------3333------- QYYGNRDNDTANYLALSLEMDDFIRSVVAIADYVKAKKRSKKTIHLSFDEWNVWYHSNEA --------33331111--------------------------------------111133 DKLIEPWTVAPPLLEDIYNFEDALLVGCMLITLMKHADRVKIACLAQLVNVIAPIMTEKN 33-------------------------------1111--------------------222 GPAWKQTIYYPFMHASVYGRGVALHPVISSPKYDSKDFTDVPYLESIAVYNEEKEEVTIF 2----3333-------------------------1111-------------1111----- AVNRDMEDALLLECDVRSFEDYRVIEHIVLEHDNVKQTNSAQSSPVVPHRNGDAQLSDRK ---------------3333--------------1111--3333----------------- VSATLPKLSWNVIRLGK ----------------- >OUTER MEMBRANE LIPOPROTEI; SWP:P39281; PDB:1QWDA; HLESTSLYKKSSSTPPRGVTVVNNFDAKRYLGTWYEIARFDHRFERGLEKVTATYSLRDD ---------2222-2222-------3333------------1111------------111 GGLNVINKGYNPDRGMWQQSEGKAYFTGAPTRAALKVSFFGPFYGGYNVIALDREYRHAL 1----------1111-------------1111-------!!!!---------1111---- VCGPDRDYLWILSRTPTISDEVKQEMLAVATREGFDVSKFIWVQQPG ------------------------------1111-1111-------- >(2R)-PHOSPHO-3-SULFOLACTA; SWP:Q57703; PDB:1QWGA; MKAFEFLYEDFQRGLTVVLDKGLPPKFVEDYLKVCGDYIDFVKFGWGTSAVIDRDVVKEK ---3333---------------------------3333------!!!!1111-------- INYYKDWGIKVYPGGTLFEYAYSKGKFDEFLNECEKLGFEAVEISDGSSDISLEERNNAI ----1111-------------1111----------------------------------- KRAKDNGFMVLTEVGKKMPDKDKQLTIDDRIKLINFDLDAGADYVIIEGRESGKGKGLFD ---1111----------33333333------------3333--------3333--!!!!- KEGKVKENELDVLAKNVDINKVIFEAPQKSQQVAFILKFGSSVNLANIAFDEVISLETLR -----------------3333------------------1111-----1111------11 RGLRGDTFGKV 11-3333---- >OSMOTICALLY INDUCIBLE PRO; SWP:P0C0L2; PDB:1QWIA; TIHKKGQAHWESDIKRGKGTVSTESGVLNQQPYGFNTRFEGEKGTNPEELIGAAHAACFS ----------------------3333-------3333----------------------- ALSLLGEAGFTPTSIDTTADVSLDKVDAGFAITKIALKSEVAVPGIDASTFDGIIQKAKA -----1111----------------3333-------------2222-------------- GCPVSQVLKAEITLDYQLKS --3333-------------- >cytidine monophospho-N-ac; SWP:Q5M963; PDB:1QWJA; PPHLAALVLARGGSKGIPLKNIKRLAGVPLIGWVLRAALDAGVFQSVWVSTDHDEIENVA ------------------1111--iiii----------3333----------3333---- KQFGAQVHRRSSETSKDSSTSLDAIVEFLNYHNEVDIVGNIQATSPCLHPTDLQKVAEMI 1111------3333-1111------------1111------1111--------------- REEGYDSVFSVVRRHQFRWSEIQKGVREVTEPLNLNPAKRPRRQDWDGELYENGSFYFAK -----------------------------------3333--3333--------------3 RHLIEMGYLQGGKMAYYEMRAEHSVDIDVDIDWPIAEQRVLRFGYFGK 333----------------3333--3333--3333------------- >ALDO-KETO REDUCTASE FAMIL; SWP:P91020; PDB:1QWKA; TASIKLSNGVEMPVIGLGTWQSSPAEVITAVKTAVKAGYRLIDTASVYQNEEAIGTAIKE -----1111-------------3333------------------3333------------ LLEEGVVKREELFITTKAWTHELAPGKLEGGLRESLKKLQLEYVDLYLAHMPAAFNDDMS -------3333-------3333-2222----------------------------1111- EHIASPVEDVWRQFDAVYKAGLAKAVGVSNWNNDQISRALALGLTPVHNSQVELHLYFPQ ------------------------------------------------------1111-- HDHVDFCKKHNISVTSYATLGSPGRVNFTLPTGQKLDWAPAPSDLQDQNVLALAEKTHKT -------1111------1111--------3333---------3333-------------- PAQVLLRYALDRGCAILPKSIQENRIKENFEVFDFSLTEEDIAKLEESKNSQRLFLQDFM ---------1111--------------1111--------------3333-------3333 TGHPEDAFAAER -----1111--- >KATA CATALASE; SWP:P77872; PDB:1QWLA; MVNKDVKQTTAFGAPVWDDNNVITAGPRGPVLLQSTWFLEKLAAFDRERIPERVVHAKGS ---------1111------------1111--1111------------------------- GAYGTFTVTKDITKYTKAKIFSKVGKKTECFFRFSTVAGERGSADAVRDPRGFAMKYYTE -----------3333--3333-2222-------------1111----------------- EGNWDLVGNNTPVFFIRDAIKFPDFIHTQKRDPQTNLPNHDMVWDFWSNVPESLYQVTWV -----------------3333-----1111------------------------------ MSDRGIPKSFRHMDGFGSHTFSLINAKGERFWVKFHFHTMQGVKHLTNEEAAEIRKHDPD -3333---1111------------1111----------1111------------------ SNQRDLFDAIARGDYPKWKLSIQVMPEEDAKKYRFHPFDVTKIWYTQDYPLMEVGIVELN ---------1111------------3333---------1111--3333------------ KNPENYFAEVEQAAFTPANVVPGIGYSPDRMLQGRLFSYGDTHRYRLGVNYPQIPVNKPR ----3333-1111--3333-2222-------------------------33333333--- CPFHSSSRDGYMQNGYYGSLQNYTPSSLPGYKEDKSARDPKFNLAHIEKEFEVWNWDYRA --------------1111------------------------3333----------3333 DDSDYYTQPGDYYRSLPADEKERLHDTIGESLAHVTHKEIVDKQLEHFKKADPKYAEGVK ------------1111---------------1111-3333-------------------- KALEKHQKMMK ----------- >ALPHA-MANNOSIDASE II; SWP:Q24451; PDB:1QWNA; CQDVVQDVPNVDVQMLELYDRMSFKDIDGGVWKQGWNIKYDPLKYNAHHKLKVFVVPHSH -------------------------------1111-----1111-1111----------- NDPGWIQTFEEYYQHDTKHILSNALRHLHDNPEMKFIWAEISYFARFYHDLGENKKLQMK ------------------------------1111------------3333---------- SIVKNGQLEFVTGGWVMPDEANSHWRNVLLQLTEGQTWLKQFMNVTPTASWAIDPFGHSP --1111------------------------------------------------------ TMPYILQKSGFKNMLIQRTHYSVKKELAQQRQLEFLWRQIWDNKGDTALFTHMMPFYSYD -----1111------------------1111-------1111--1111-----------3 IPHTCGPDPKVCCQFDFKRMGSFGLSCPWKVPPRTISDQNVAARSDLLVDQWKKKAELYR 333----333311111111-1111--1111------1111---------------1111- TNVLLIPLGDDFRFKQNTEWDVQRVNYERLFEHINSQAHFNVQAQFGTLQEYFDAVHQAE --------------------------------33333333-------------------- RAGQAEFPTLSGDFFTYADRSDNYWSGYYTSRPYHKRMDRVLMHYVRAAEMLSAWHSWDG -------------------!!!!--1111-----------------------3333--33 MARIEERLEQARRELSLFQHHDGITGTAKTHVVVDYEQRMQEALKACQMVMQQSVYRLLT 33--------------------3333---------------------------------- KPSIYSPDFSFSYFTLDDSRWPGSGVEDSRTTIILGEDILPSKHVVMHNTLPHWREQLVD 1111---1111-----------2222---------------------------------- FYVSSPFVSVTDLANNPVEAQVSPVWSWHHDTLTKTIHPQGSTTKYRIIFKARVPPMGLA -----------1111--------------------------------------------- TYVLTISDSKPEHTSYASNLLLRKNPTSLPLGQYPEDVKFGDPREISLRVGNGPTLAFSE ----------1111----------------!!!!---------------!!!!-----11 QGLLKSIQLTQDSPHVPVHFKFLKYGVRSHGDRSGAYLFLPNGPASPVELGQPVVLVTKG 11-------3333----------------------------------------------1 KLESSVSVGLPSVVHQTIMRGGAPEIRNLVDIGSLDNTEIVMRLETHIDSGDIFYTDLNG 111------2222------------------!!!!----------------------%%% LQFIKRRRLDKLPLQANYYPIPSGMFIEDANTRLTLLTGQPLGGSSLASGELEIMQDRRL %-------11113333------------1111---------------2222--------- ASDDERGLGQGVLDNKPVLHIYRLVLEKVNNCVRPSKLHPAGYLTSAAHKASQSLLDPLD ----------------------------1111---1111--------------------- KFIFAENEWIGAQGQFGGDHPSAREDLDVSVMRRLTKSSAKTQRVGYVLHRTNLMQCGTP --------2222----1111---1111---------3333-------------------- EEHTQKLDVCHLLPNVARCERTTLTFLQNLEHLDGMVAPEVCPMETAAYVSSHS -------3333-----------1111------2222-----2222--------- >PHYTASE; SWP:O00092; PDB:1QWOA; CDTVDLGYQCSPATSHLWGQYSPFFSLEDELSVSSKLPKDCRITLVQVLSRHGARYPTSS ----------3333---!!!!-----2222-------2222------------------- KSKKYKKLVTAIQANATDFKGKFAFLKTYNYTLGADDLTPFGEQQLVNSGIKFYQRYKAL -------------------!!!!-3333-----------------------------333 ARSVVPFIRASGSDRVIASGEKFIEGFQQAKLADPGATNRAAPAISVIIPESETFNNTLD 3-----------3333-----------------1111-------------------1111 HGVCTKFEASQLGDEVAANFTALFAPDIRARAEKHLPGVTLTDEDVVSLMDMCSFDTVAR ------1111-------------------------2222--------------------- TSDASQLSPFCQLFTHNEWKKYNYLQSLGKYYGYGAGNPLGPAQGIGFTNELIARLTRSP 1111------------------------------3333---------------------- VQDHTSTNSTLVSNPATFPLNATMYVDFSHDNSMVSIFFALGLYNGTEPLSRTSVESAKE -------3333--3333---------------------------------------3333 LDGYSASWVVPFGARAYFETMQCKSEKEPLVRALINDRVVPLHGCDVDKLGRCKLNDFVK iiii3333--2222--------3333--------iiii---------1111--------- GLSWARSGGNWGECF ---------3333-- >MANNOSE-6-PHOSPHATE ISOME; SWP:P39841; PDB:1QWRA; TQSPIFLTPVFKEKIWGGTALRDRFGYSIPSESTGECWAISAHPKGPSTVANGPYKGKTL -------------1111-------------------------1111------1111---- IELWEEHREVFGGVEGDRFPLLTKLLDVKEDTSIKVHPDDYYAGENEEGELGKTECWYII ------3333%%%%--------------------------------iiii---------- DCKENAEIIYGHTARSKTELVTINSGDWEGLLRRIKIKPGDFYYVPSGTLHALCKGALVL --1111----------------11113333-------2222----2222----------- ETQQNSDATYRVYDYDRLDSNGSPRELHFAKAVNAATVPHVDGYIDESTESRKGITIKTF -------------%%%%-1111-----------------------------2222----- VQGEYFSVYKWDINGEAEAQDESFLICSVIEGSGLLKYEDKTCPLKKGDHFILPAQPDFT --1111-------------------------------!!!!----2222----------- IKGTCTLIVSHI ------------ >INTERFERON REGULATORY FAC; SWP:Q14653; PDB:1QWTA; ENPLKRLLVPGEEWEFEVTAFYRGRQVFQQTISCPEGLRLVGSEVGDRTLPGWPVTLPDP -3333---2222---------iiii--------1111---------------------33 GMSLTDRGVMSYVRHVLSCLGGGLALWRAGQWLWAQRLGHCHTYWAVSEELLPNSGHGPD 33-----------------!!!!-----!!!!---------------------------- GEVPKDKEGGVFDLGPFIVDLITFTEGSGRSPRYALWFCVGESWPQDQPWTKRLVMVKVV -----------------------1111----------------33333333--------- PTCLRALVEMARVGGASSLENTVDLHISNSHPLSLTSDQYKAYLQDLVEGMDFQGPGES ------------3333-----------------------------3333---------- >PHEROMONE-BINDING PROTEIN; SWP:P20797; PDB:1QWVA; SPEIMKNLSNNFGKAMDQCKDELSLPDSVVADLYNFWKDDYVMTDRLAGCAINCLATKLD --33333333----------1111-3333---------------33333333-1111--- VVDPDGNLHHGNAKDFAMKHGADETMAQQLVDIIHGCEKSAPPNDDKCMKTIDVAMCFKK ---------------------------------------------1111----------- EIHKLNWVPNMDLVIGEVLAEV ---------------------- >PEPTIDOGLYCAN HYDROLASE; SWP:O33599; PDB:1QWYA; VSYGTYYTIDSNGDYHHTPDGNWNQAMFDNKEYSYTFVDAQGHTHYFYNCYPKNANANGS -iiii----1111----------33331111-------1111------------------ GQTYVNPATAGDNNDYTASQSQQHINQYGYQSNVGPDASYYSSGHAKDASWLTSRKQLQP --------2222------------------------3333--------3333-------- YGQYHGGGAHYGVDYAMPENSPVYSLTDGTVVQAGWSNYGGGNQVTIKEANSNNYQWYMH -----------------2222--------------------------------------- NNRLTVSAGDKVKAGDQIAYSGSTGNSTAPHVHFQRMSGGIGNQYAVDPTSYLQ ------2222--2222-------------------------3333---3333-- >NPQTN SPECIFIC SORTASE B; SWP:Q8NX63; PDB:1QWZA; GHHHHHHHHHHSSGHISGDAMEDKQERANYEKLQQKFQMLMSKHQAHVRPQFESLEKINK -----------3333----------------------------1111-------333311 DIVGWIKLSGTSLNYPVLQGKTNHDYLNLDFEREHRRKGSIFMDFRNELKNLNHNTILYG 11-----2222------------1111--1111--1111----3333------------- HHVGDNTMFDVLEDYLKQSFYEKHKIIEFDNKYGKYQLQVFSAYKTTTKDNYIRTDFEND ------!!!!3333------1111------1111-------------------------- QDYQQFLDETKRKSVINSDVNVTVKDRIMTLSTCEDAYSETTKRIVVVAKIIKVS ----------------------1111----------------------------- >Vitamin D-dependent calci; SWP:P02633; PDB:1QX2A; KSPEEIKGAFEVFAAKEGDPNQISKEELKLVMQTLGPSLLKGMSTLDEMIEEVDKNGDGE ------------1111--1111-3333-------!!!!-2222----------1111--- VSFEEFLVMMKKISQ -----------1111 >NADH-CYTOCHROME B5 REDUCT; SWP:P20070; PDB:1QX4A; HMITLENPDIKYPLRLIDKEILSHDTRRFRFALPSPQHILGLPIGQHIYLSTRIDGNLVI ------1111------------------------1111----2222-------iiii--- RPYTPVSSDDDKGFVDLVVKVYFKEAGGKMPQYLENMNIGDTIEFRGPNGLLVYQGKGKF ---------------------------------11112222--------------iiii- AIRADKKSNPVVRTVKSVGMIAGGTGITPMLQVIRAVLKDPNDHTVCYLLFANQSEKDIL ----1111-------------------------------1111-----------333322 LRPELEELRNEHSSRFKLWYTVDKAPDAWDYSQGFVNEEMIRDHLPPPGEETLILMCGPP 22----------------------------------------------1111-------- PMIQFACLPNLERVGHPKERCFTF ----------------3333---- ---------------------------------------------------------- >THIOL PEROXIDASE; SWP:P37901; PDB:1QXHA; SQTVHFQGNPVTVANSIPQAGSKAQTFTLVAKDLSDVTLGQFAGKRKVLNIFPSIDTGVC -----iiii---------2222--------1111---33332222--------------- AASVRKFNQLATEIDNTVVLCISADLPFAQSRFCGAEGLNNVITLSTFRNAEFLQAYGVA ----------3333-----------33331111--2222--------------------- IADGPLKGLAARAVVVIDENDNVIFSQLVDEITTEPDYEAALAV ---1111-----------------------1111--3333---- >HA1; SWP:P46084; PDB:1QXMA; TNANDLRNNEVFFISPSNNTNKVLDKISQSEVKLWNKLSGANQKWRLIYDTNKQAYKIKV -1111-2222-----1111-------------------------------1111------ MDNTSLILTWNAPLSSVSVKTDTNGDNQYWYLLQNYISRNVIIRNYMNPNLVLQYNIDDT ------------------------1111----------------3333-------1111- LMVSTQTSSSNQFFKFSNCIYEALNNRNCKLQTQLNSDRFLSKNLNSQIIVLWQWFDSSR -----------------------2222-----3333------------------------ QKWIIEYNETKSAYTLKCQENNRYLTWIQNSNNYVETYQSTDSLIQYWNINYLDNDASKY --------1111-----------------3333--------------------------- ILYNLQDTNRVLDVYNSQIANGTHVIVDSYHGNTNQQWIINLI ---3333-------%%%%-2222-------------------- >SULFIDE DEHYDROGENASE; SWP:Q56748; PDB:1QXNA; ADMGEKFDATFKAQVKAAKADMVMLSPKDAYKLLQENPDITLIDVRDPDELKAMGKPDVK ---------------3333----------------------------------------- NYKHMSRGKLEPLLAKSGLDPEKPVVVFCKTAARAALAGKTLREYGFKTIYNSEGGMDKW ----------11113333------------3333---------------------3333- LEEGLPSLDRSHHHHHH ----------------- >CHORISMATE SYNTHASE; SWP:Q97Q57; PDB:1QXOA; RYLTAGESHGPRLTAIIEGIPAGLPLTAEDINEDLRRRQGGYGRGGRKIENDQVVFTSGV ------1111----------------3333------1111-------------------- RHGKTTGAPITDVINKDHQKWLDISAEDIEDRLKSKRKITHPRPGHADLVGGIKYRFDDL iiii------------33333333-----3333-2222----2222------------33 RNSLERSSARETTRVAVGAVAKRLLAELDEIANHVVVFGGKEIDVPENLTVAEIKQRAAQ 33------3333-------------1111--------iiii------------------- SEVSIVNQEREQEIKDYIDQIKRDGDTIGGVVETVVGGVPVGLGSYVQWDRKLDARLAQA -------1111------------------------------------1111--------- VVSINAFKGVEFGLGFEAGYRKGSQVDEILWSKEDGYTRRTNNLGGFEGGTNGQPIVVRG ---2222----!!!!3333--3333----------------1111--------------- VKPIPTLYKPLSVDIETHEPYKATVERSDPTALPAAGVEAVVATVLAQEILEKFSSDNLE -------------------------------3333------------------------- ELKEAVAKHRDYTKNY ------------1111 >METHIONYL AMINOPEPTIDASE; SWP:P0A078; PDB:1QXYA; MIVKTEEELQALKEIGYICAKVRNTMQAATKPGITTKELDNIAKELFEEYGAISAPIHDE --------------------------11112222-------------------------- NFPGQTCISVNEEVAHGIPSKRVIREGDLVNIDVSALKNGYYADTGISFVVGESDDPMKQ ---------!!!!-----------2222---------iiii------------------- KVCDVATMAFENAIAKVKPGTKLSNIGKAVHNTARQNDLKVIKNLTGHGVGLSLHEAPAH -------------111122223333---------1111---1111--------------- VLNYFDPKDKTLLTEGMVLAIEPFISSNASFVTEGKNEWAFETSDKSFVAQIEHTVIVTK -----1111----2222-------------------------1111------------11 DGPILTTKI 11------- >ENDOPLASMIN; SWP:P41148; PDB:1QY5A; KSEKFAFQAEVNRMMKLIINSLYKNKEIFLRELISNASDALDKIRLISLTDENALAGNEE ---------------------1111---------------------333311111111-- LTVKIKCDKEKNLLHVTDTGVGMTREELVKNLGTIAKSGTSEFLNKMTEAQEDGQSTSEL --------1111------------------------3333-------------------- IGQFGVGFYSAFLVADKVIVTSKHNNDTQHIWESDSNEFSVIADPRGNTLGRGTTITLVL -----3333--------------1111----------------1111------------- KEEASDYLELDTIKNLVKKYSQFINFPIYVWSSKTKTVWDWELMN 3333----------------------------------------- >NITROGEN REGULATORY PROTE; SWP:P80016; PDB:1QY7A; MKKIEAIIRPFKLDEVKIALVNAGIVGMTVSEVRGFGRQKGQTERYRGAEYTVEFLQKLK --------3333--------1111---------------------2222----------- LEIVVEDAQVDTVIDKIVAAARTGEIGDGKIFVSPVDQTIRIRTGEKNADAI -----3333---------------2222------------------------ >HYPOTHETICAL PROTEIN YDDE; SWP:P37757; PDB:1QYAA; LKPQVYHVDAFTSQPFRGNSAGVVFPADNLSEAQMQLIARELGHSETAFLLHSDDSDVRI -------------2222------------------------------------------- RYFTPTVEVPIHATVAAHYVRAKVLGLGNCTIWQTSLKHRVTIEKHNDDYRISLEQGTPG ---1111--------------------------------------%%%%----------- FEPPLEGETRAAIINALHLTEDDILPGLPIQVATTGHSKVMIPLKPEVDIDALSPDLNAL --------------1111-3333-2222----------------33333333-------- TAISKKIGCNGFFPFQIRPGKNETDGRMFSPAIGIVEDPVTGNANGPMGAWLVHHNVLPH -----------------2222--------3333-------1111---------------- DGNVLRVKGHQGRALGRDGMIEVTVTIRDNQPEKVTISGTAVILFHAEWAIEL -----------3333------------%%%%---------------------- >PHENYLCOUMARAN BENZYLIC E; SWP:Q9LL41; PDB:1QYCA; GSRSRILLIGATGYIGRHVAKASLDLGHPTFLLVRESTASSNSEKAQLLESFKASGANIV ---------1111-3333-----1111----------3333-----------1111---- HGSIDDHASLVEAVKNVDVVISTVGSLQIESQVNIIKAIKEVGTVKRFFPSEFGNDVDNV -----3333----3333-------3333---------------------------1111- HAVEPAKSVFEVKAKVRRAIEAEGIPYTYVSSNCFAGYFLRSLAQAGLTAPPRDKVVILG ----3333------------1111----------1111-1111---------------!! DGNARVVFVKEEDIGTFTIKAVDDPRTLNKTLYLRLPANTLSLNELVALWEKKIDKTLEK !!-------3333-------11111111-------3333------------1111----- AYVPEEEVLKLIADTPFPANISIAISHSIFVKGDQTNFEIGPAGVEASQLYPDVKYTTVD -----------1111-----------------1111----1111-3333-3333------ EYLSNFV -3333-- >PINORESINOL-LARICIRESINOL; SWP:Q9LD14; PDB:1QYDA; DKKSRVLIVGGTGYIGKRIVNASISLGHPTYVLFRPEVVSNIDKVQMLLYFKQLGAKLIE --------------3333-----1111----------2222------------------- ASLDDHQRLVDALKQVDVVISALAGGVLSHHILEQLKLVEAIKEAGNIKRFLPSEFGMDP ----3333----1111------------------------------------------11 DIMEHALQPGSITFIDKRKVRRAIEAASIPYTYVSSNMFAGYFAGSLAQLDGHMMPPRDK 11------------------------------------1111---%%%%----------- VLIYGDGNVKGIWVDEDDVGTYTIKSIDDPQTLNKTMYIRPPMNILSQKEVIQIWERLSE --------------3333----------3333--------3333----------3333-- QNLDKIYISSQDFLADMKDKSYEEKIVRCHLYQIFFRGDLYNFEIGPNAIEATKLYPEVK ----------------1111------------------------------3333-3333- YVTMDSYLERYV --3333------ >HYPOTHETICAL PROTEIN; SWP:Q8NW41; PDB:1QYIA; KKILFDVDGVFLSEERCFDVSALTVYELLDKCYLGLHSHIDWETLTDNDIQDIRNRIFQK ------2222---3333-----------------------1111-3333---------ii DKILNKLKSLGLNSNWDLFIVFSIHLIDILKKLSHDEIEAFYQDEPVELKLQNISTNLAD ii-----1111--3333----------------------------333311113333--- CFNLNEQLPLQFLDNVKVGKNNIYAALEEFATTELHVSDATLFSLKGALWTLAQEVYQEW -------3333---------------------1111---33332222------------- YLGSKLYEDVEKKIARTTFKTGYIYQEIILRPVDEVKVLLNDLKGAGFELGIATGRPYTE ----------------------1111-----3333--------1111---------3333 TVVPFENLGLLPYFEADFIATASDVLEAENYPQARPLGKPNPFSYIAALYGNNRDKYESY ---------3333-1111------------1111---3333-----------33333333 INKQDNIVNKDDVFIVGDSLADLLSAQKIGATFIGTLTGLKGKDAAGELEAHHADYVINH --------1111------3333----------------11111111---1111------3 LGELRGVLDNLLEHH 333------------ >PROTEIN-EXPORT PROTEIN SE; SWP:P15040; PDB:1QYNA; MTFQIQRIYTKDISFEAPNAPHVFQKDWQPEVKLDLDTASSQLADDVYEVVLRVTVTASL --------------------3333-------------------2222------------- GEETAFLCEVQQGGIFSIAGIEGTQMAHCLGAYCPNILFPYARECITSMVSRGTFPQLNL ------------------------------------------------------------ APVNFDALFMNYLQ -------------- --------------------------------------------------------- >HIGH LEVEL KASUGAMYCIN RE; SWP:P06992; PDB:1QYRA; QNFLNDQFVIDSIVSAINPQKGQAMVEIGPGLAALTEPVGERLDQLTVIELDRDLAARLQ -------------------2222------!!!!------1111----------------- THPFLGPKLTIYQQDAMTFNFGELAEKMGQPLRVFGNLPYNISTPLMFHLFSYTDAIADM -----1111-----3333--------------------3333-----------3333--- HFMLQKEVVNRLVAGPNSKAYGRLSVMAQYYCNVIPVLEVPPSAFTPPPKVDSAVVRLVP ----3333---------3333-------------------3333---------------- HATMPHPVKDVRVLSRITTEAFNQRRKTIRNSLGNLFSVEVLTGMGIDPAMRAENISVAQ ---------3333----------11113333-----------1111-11113333----- YCQMANYLAENA ------------ >TOP7; SWP:NA; PDB:1QYSA; DIQVQVNIDDNGKNFDYTYTVTTESELQKVLNELDYIKKQGAKRVRISITARTKKEAEKF ----------------------------------------------------3333---- AAILIKVFAELGYNDINVTFDGDTVTVEGQL --------------------!!!!------- >POLYPROTEIN 1AB; SWP:P59641; PDB:1QZ8A; ELSPVALRQMSCAAGTTQTACTDDNALAYYNNSKGGRFVLALLSDHQDLKWARFPKSDGT ----------------3333---------------------------------------- GTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQ ----------------1111--------2222------------------- >KYNURENINASE; SWP:P83788; PDB:1QZ9A; TTRNDCLALDAQDSLAPLRQQFALPEGVIYLDGNSLGARPVAALARAQAVIAEEWGNGLI ------------11113333----2222---1111----3333----------------- RSWNSAGWRDLSERLGNRLATLIGARDGEVVVTDTTSINLFKVLSAALRVQATRSPERRV -----------------3333--------------------------------------- IVTETSNFPTDLYIAEGLADMLQQGYTLRLVDSPEELPQAIDQDTAVVMLTHVNYKTGYM ---11113333---------------------33333333-1111--------------- HDMQALTALSHECGALAIWDLAHSAGAVPVDLHQAGADYAIGCTYKYLNGGPGSQAFVWV --------------------1111------3333-----------1111-2222------ SPQLCDLVPQPLSGWFGHSRQFAMEPRYEPSNGIARYLCGTQPITSLAMVECGLDVFAQT 3333----------1111--------------3333------------------3333-- DMASLRRKSLALTDLFIELVEQRCAAHELTLVTPREHAKRGSHVSFEHPEGYAVIQALID -----------------------1111--------3333--------------------- RGVIGDYREPRIMRFGFTPLYTTFTEVWDAVQILGEILDRKTWA -----------------3333-------------------3333 >PROTECTION OF TELOMERES P; SWP:O13988; PDB:1QZGA; VIDSLQLNELLNAGEYKIGELTFQSIRSSQELQKKNTIVNLFGIVKDFTPSRQSLHGTKD -----------------!!!!----3333--------------------------!!!!- WVTTVYLWDPTCDTSSIGLQIHLFSKQGNDLPVIKQVGQPLLLHQITLRSYRDRTQGLSK --------11111111-------------------2222-----------%%%%-----1 DQFRYALWPDFSSNSKDTLCPQPMPRLMKTGDKEEQFALLLNKIWDEQTN 111------1111----------1111----------------------- >ATP-DEPENDENT PROTEASE LA; SWP:P08177; PDB:1QZMA; SGYTEDEKLNIAKRHLLPKQIERNALKKGELTVDDSAIIGIIRYYTREAGVRGLEREISK ---------------------1111-1111------------------------------ LCRKAVKQLLLDKSLKHIEINGDNLHDYLGVQRF -----------1111-----11113333------ >DEMATIN; SWP:Q08495; PDB:1QZPA; PGLQIYPYEMLVVTNKGRTKLPPGVDRMRLERHLSAEDFSRVFAMSPEEFGKLALWKRNE -------3333-----------------------3333---------------------- LKKKASLF -------- >HYPOTHETICAL PROTEIN MDS0; SWP:Q96CD2; PDB:1QZUA; MERKFHVLVGVTGSVAALKLPLLVSKLLGLEVAVVTTERAKHFYSPQDIPVTLYSDADEW -------------3333--------------------3333---3333-------33331 EMWKSRSDPVLHIDLRRWADLLLVAPLDANTLGKVASGICDNLLTCVMRAWDRSKPLLFC 111-3333-----3333----------3333---1111---------11113333----- PAMNTAMWEHPITAQQVDQLKAFGYVEIPVGTIVDKVKEV -----3333---------3333------------------ >SIGNAL RECOGNITION 54 KDA; SWP:Q97ZE7; PDB:1QZWA; MLENIRDAVRKFLTGSTPYEKAVDEFIKDLQKSLISSDVNVKLVFSLTAKIKERLNKEKP --------3333-------3333----------3333----------------------- PSVLERKEWFISIVYDELSKLFGGDKEPNVNPTKLPFIIMLVGVQGSGKTTTAGKLAYFY ------------------3333------------------------------------33 KKRGYKVGLVAADVYRPAAYDQLLQLGNQIGVQVYGEPNNQNPIEIAKKGVDIFVKNKMD 33----------------------------------------------------1111-- IIIVDTAGRHGYGEETKLLEEMKEMYDVLKPDDVILVIDASIGQKAYDLASRFHQASPIG --------------------------------------3333----------33333333 SVIITKMDGTAKGGGALSAVVATGATIKFIGTGEKIDELETFNAKRFVSRILGMGDIESI ------------------------------------------------------------ LEKVKGLEEYDKIQKKMEDVMEGKGKLTLRDVYAQIIALRKMGPLSKVLQHIPGLGIMLP --3333---3333-------------------------3333------------------ TPSEDQLKIGEEKIRRWLAALNSMTYKELENPNIIDKSRMRRIAEGSGLEVEEVRELLEW ---------------3333-1111--------------------1111--------3333 YNNMNRLLKMVK ------------ >ACLACINOMYCIN-10-HYDROXYL; SWP:Q54527; PDB:1QZZA; LEPTDQDLDVLLKNLGNLVTPMALRVAATLRLVDHLLAGADTLAGLADRTDTHPQALSRL -------------1111------------------1111--------------------- VRHLTVVGVLEGGEKGRPLRPTRLGMLLADGHPAQQRAWLDLNGAVSHADLAFTGLLDVV ----1111--------------3333--1111--------3333---------------- RTGRPAYAGRYGRPFWEDLSADVALADSFDALMSCDEDLAYEAPADAYDWSAVRHVLDVG ------3333------------3333------33331111----1111-1111------- GGNGGMLAAIALRAPHLRGTLVELAGPAERARRRFADAGLADRVTVAEGDFFKPLPVTAD !!!!---------1111------------------1111--------------------- VVLLSFVLLNWSDEDALTILRGCVRALEPGGRLLVLDRADRFFSTLLDLRMLTFMGGRVR ------3333-------------11112222----------------------------- TRDEVVDLAGSAGLALASERTSGSTTLPFDFSILEFTAVS -3333----------------------------------- >OREXIN-A; SWP:O43612; PDB:1R02A; QPLPDCCRQKTCSCRLYELLHGAGNHAAGILTL ------%%%%-------------33333333-- >MITOCHONDRIAL FERRITIN; SWP:Q8N4E7; PDB:1R03A; SRVRQNFHPDSEAAINRQINLELYASYVYLSMAYYFSRDDVALNNFSRYFLHQSREETEH 1111-------------------------------------------------------- AEKLMRLQNQRGGRIRLQDIKKPEQDDWESGLHAMECALLLEKNVNQSLLELHALASDKG --------------------------------------------------------1111 DPHLCDFLETYYLNEQVKSIKELGDHVHNLVKMGAPDAGLAEYLFDTHTLG -------------------------------------1111------1111 >REVERSE TRANSCRIPTASE; SWP:NA; PDB:1R0AH; QITLKESGPGIVQPSQPFRLTCTFSGFSLSTSGIGVTWIRQPSGKGLEWLATIWWDDDNR -----------------------------------------2222--------------- YNPSLKSRLTVSKDTSNNQAFLNMMTVETADTAIYYCAQSAITSVTDSAMDHWGQGTSVT -33331111----3333----------3333----------------------------- VSSAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVL -------------------------------------------%%%%------------- QSDLYTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPADC %%%%---------1111-----------3333------------- >REVERSE TRANSCRIPTASE; SWP:NA; PDB:1R0AL; DIQMTQTTSSLSASLGDRVTISCSASQDISSYLNWYQQKPEGTVKLLIYYTSSLHSGVPS ----------------------------%%%%------3333------------2222-- RFSGSGSGTDYSLTISNLEPEDIATYYCQQYSKFPWTFGGGTKLEIKRADAAPTVSIFPP ------!!!!--------3333-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT ----3333---------------------------------------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNR -3333-------------------------- >HUNTINGTIN INTERACTING PR; SWP:O75146; PDB:1R0DA; DVRQEELGAVVDKEAATSAAIEDAVRRIEDNQARHASSGVKLEVNERILNSCTDLKAIRL --3333------------------------3333-------------------------- LVTTSTSLQKEIVESGRGAATQQEFYAKNSRWTEGLISASKAVGWGATQLVEAADKVVLH ----------------!!!!---------------------------------------- TGKYEELIVCSHEIAASTAQLVAASKVKANKHSPHLSRLQECSRTVNERAANVVASTKSG --3333-------------------1111------------------------------- QEQIEDRDT ---1111-- >1-DEOXY-D-XYLULOSE 5-PHOS; SWP:Q9X5F2; PDB:1R0KA; SQPRTVTVLGATGSIGHSTLDLIERNLDRYQVIALTANRNVKDLADAAKRTNAKRAVIAD -------------------------3333------------------------------3 PSLYNDLKEALAGSSVEAAAGADALVEAAMMGADWTMAAIIGCAGLKATLAAIRKGKTVA 333-------2222-------3333--1111----------3333--------------- LANKESLVSAGGLMIDAVREHGTTLLPVDSEHNAIFQCFPHHNRDYVRRIIITASGGPFR --3333-----------------------------111122221111---------1111 TTSLAEMATVTPERAVQGAKISIDSATMMNKGLELIEAFHLFQIPLEKFEILVHPQSVIH --33331111----------------------------------3333-----3333--- SMVEYLDGSILAQIGSPDMRTPIGHTLAWPKRMETPAESLDFTKLRQMDFEAPDYERFPA ----1111--------------------------------3333---------3333--- LTLAMESIKSGGARPAVMNAANEIAVAAFLDKKIGFLDIAKIVEKTLDHYTPATPSSLED ----------!!!!--------------1111--3333---------------------- VFAIDNEARIQAAALMESLP -------------------- >N-ACYLAMINO ACID RACEMASE; SWP:Q9RYA6; PDB:1R0MA; RMFKIEAAEIVVARLPLKTHKVVPLLILHGEGVQGVAEGTMEARPMYREETIAGALDLLR -----------------------------iiii--------------------------- GTFLPAILGQTFANPEAVSDALGSYRGNRMARAMVEMAAWDLWARTLGVPLGTLLGGHKE ------2222---------1111---------------------1111----1111---- QVEVGVSLGIQADEQATVDLVRRHVEQGYRRIKLKIKPGWDVQPVRATREAFPDIRLTVD ------------------------1111--------2222-------------------- ANSAYTLADAGRLRQLDEYDLTYIEQPLAWDDLVDHAELARRIRTPLCLDESVASASDAR %%%%-3333------3333---------1111------3333-------1111------- KALALGAGGVINLKVARVGGHAESRRVHDVAQSFGAPVWCGGMLESGIGRAHNIHLSTLS -------------3333--------------1111-----------------------11 NFRLPGDTSSASRYWERDLIQEPLEAVDGLMPVPQGPGTGVTLDREFLATVTEAQEEHRA 11-------3333-------------iiii------!!!!-------------------- >Ecdysone receptor; SWP:P34021; PDB:1R0NB; ELCLVCGDRASGYHYNALTCEGCKGFFRRSVTKSAVYCCKFGRACEMDMYMRRKCQECRL -------------iiii-------------1111---------------3333------- KKCLAVGMRPECVVPENQCAMKRREK ---1111-3333--------3333-- >ULTRASPIRACLE PROTEIN; SWP:P20153; PDB:1R0OA; HLCSICGDRASGKHYGVYSCEGCKGFFKRTVRKDLTYACRENRNCIIDKRQRNRCQYCRY -------------iiii-------------1111----------------1111------ QKCLTCGMKREAVQEE ---1111--------- >HEPATOCYTE GROWTH FACTOR ; SWP:P08581; PDB:1R0PA; TVHIDLSALNPELVQAVQHVVIGPSSLIVHFNEVIGRGHFGCVYHGTLLDKIHCAVKSLN ----3333--------3333--1111---1111----3333------------------- RITDIGEVSQFLTEGIIMKDFSHPNVLSLLGICLRSEGSPLVVLPYMKHGDLRNFIRNET ----------------3333--1111---------%%%%-------1111-------111 HNPTVKDLIGFGLQVAKGMKFLASKKFVHRDLAARNCMLDEKFTVKVADFGLARDMYDKE 1---------------------1111------3333---1111------!!!!---3333 FDSVHNKTGAKLPVKWMALESLQTQKFTTKSDVWSFGVLLWELMTRGAPPYPITVYLLQG --1111------1111--------------------------1111----------3333 RRLLQPEYCPDPLYEVMLKCWHPKAEMRPSFSELVSRISAIFSTFIGEHYVHVNATYVNV -----1111--------------3333--------------1111--------1111--- K - >Subtilisin Carlsberg [Pre; SWP:P00780; PDB:1R0RE; AQTVPYGIPLIKADKVQAQGFKGANVKVAVLDTGIQASHPDLNVVGGASFVAGEAYNTDG ----------------3333--2222---------1111-----------2222------ NGHGTHVAGTVAALDNTTGVLGVAPSVSLYAVKVLNSSGSGSYSGIVSGIEWATTNGMDV -----------------------1111--------1111--------------1111--- INMSLGGASGSTAMKQAVDNAYARGVVVVAAAGNSGNSGSTNTIGYPAKYDSVIAVGAVD ---------------------1111------------!!!!-----3333---------1 SNSNRASFSSVGAELEVMAPGAGVYSTYPTNTYATLNGTSMASPHVAGAAALILSKHPNL 111--1111--1111----------------------3333---------------1111 SASQVRNRLSSTATYLGSSFYYGKGLINVEAAAQ ---------1111----3333!!!!--3333--- >SUBTILISIN CARLSBERG; SWP:P01004; PDB:1R0RI; VDCSEYPKPACTLEYRPLCGSDNKTYGNKCNFCNAVVESNGTLTLSHFGKC --1111-------------1111-------------1111----------- >PROTEIN YWIB; SWP:NA; PDB:1R0UA; MKQETPITLHVKSVIEDDGNQEVIEFRTTGFYYVKQNKVYLSYYEEHDLGKVKTIVKVSE ----------------iiii--------------%%%%--------1111--------22 GEVLVMRSGAVKMNQRFVTGASTIAKYKMSFGELELKTSTKSIQSDLDEEKGRISIAYDM 22---------------2222-------1111---------------------------- HVGHLHNMTITYEGGT ---------------- >TRNA-INTRON ENDONUCLEASE; SWP:O29362; PDB:1R0VA; DFSTYYFVYEDLRDRGNKVKIQGEFLLTKKPYLPISERKTIRMEEIAEKARNFDELRLAV ------------1111-----!!!!----------1111----------2222------- VDEESEITYFRVYEPDMMGEQKEELPEIAGVLSDEYVITKQTEIFSRYFYGSEKGDLVTL -1111------------------------------------------------------- SLIESLYLLDLGKLNLLNADREELVKRAREVERNFDRRYEVYRNLKERGFVVKTGFKFGS ------------------------------------------------------3333-- EFRVYRKVESVDDLPHSEYLVDIADSREIRLIDLARAVRLAQNVRKRMVFAYGKNYLCFE ---------3333----------!!!!--------------1111------!!!!----- RVKV ---- >Cystic fibrosis transmemb; SWP:P26361; PDB:1R0WA; TGIIMENVTAFWEEGFGELLEKVQSFSHLCLVGNPVLKNINLNIEKGEMLAITGSTGSGK ------------2222-------------1111-----------2222------2222-- TSLLMLILGELEASEGIIKHSGRVSFCSQFSWIMPGTIKENIIFGVSYDEYRYKSVVKAC ------------------------------------------2222-----------111 QLQQDITKFAEQDNTVLGEGGVTLSGGQRARISLARAVYKDADLYLLDSPFGYLDVFTEE 1-3333--1111-----2222--------------------------------------- QVFESCVCKLMANKTRILVTSKMEHLRKADKILILHQGSSYFYGTFSELQSLRPDFSSKL --------1111---------3333----------iiii-------------------11 MGYDTFDQFTEERRSSILTETLRRFS 11--3333------------------ >ADP-RIBOSYL CYCLASE; SWP:P29241; PDB:1R12A; IVPTRELENVFLGRCKDYEITRYLDILPRVRSDCSALWKDFFKAFSFKNPCDLDLGSYKD ----------------------1111----------------------1111-1111--- FFTSAQQQLPKNKVMFWSGVYDEAHDYANTGRKYITLEDTLPGYMLNSLVWCGQRANPGF ---------2222---------------iiii---3333------2222----------- NEKVCPDFKTCPVQARESFWGMASSSYAHSAEGEVTYMVDGSNPKVPAYRPDSFFGKYEL ------3333-3333------------1111-----------1111---11113333-33 PNLTNKVTRVKVIVLHRLGEKIIEKCGAGSLLDLEKLVKAKHFAFDCVENPRAVLFLLCS 33--------------2222----2222----------1111---------------333 DNPNARECRLA 311111111-- >PULMONARY SURFACTANT-ASSO; SWP:P08427; PDB:1R13A; DEELQTELYEIKHQILQTMGVLSLQGSMLSVGDKVFSTNGQSVNFDTIKEMCTRAGGNIA ------------------------------!!!!------------------1111---- VPRTPEENEAIASIAKKYNNYVYLGMIEDQTPGDFHYLDGASVSYTNWYPGEPRGQGKEK ------------------------------------1111--------2222-------- CVEMYTDGTWNDRGCLQYRLAVCEF ----1111----------------- >FIBRINOGEN-BINDING PROTEI; SWP:Q9KI13; PDB:1R17A; GSNVNHLIKVTDQSITEGYDDSDGIIKAHDAENLIYDVTFEVDDKVKSGDTMTVNIDKNT ---3333--------------2222-1111------------33332222------1111 VPSDLTDSFAIPKIKDNSGEIIATGTYDNTNKQITYTFTDYVDKYENIKAHLKLTSYIDK ---------------1111-------------------3333----------------33 SKVPNNNTKLDVEYKTALSSVNKTITVEYQKPNENRTANLQSMFTNIDTKNHTVEQTIYI 33-------------!!!!---------------!!!!---------------------- NPLRYSAKETNVNISGNGDEGSTIIDDSTIIKVYKVGDNQNLPDSNRIYDYSEYEDVTND 1111---------------------1111-------1111---------3333-----33 DYAQLGNNNDVNINFGNIDSPYIIKVISKYDPNKDDYTTIQQTVTMQTTINEYTGEFRTA 33----------------------------1111-1111-----------3333------ SYDNTIAFSTSSGQGQGDLPP --------------------- >Protein-L-isoaspartate(D-; SWP:Q27869; PDB:1R18A; HMAWRSVGANNEDLIRQLKDHGVIASDAVAQAMKETDRKHYSPRNPYMDAPQPIGGGVTI ------------------1111----------11113333--------------%%%%-- SAPHMHAFALEYLRDHLKPGARILDVGSGSGYLTACFYRYIKAKGVDADTRIVGIEHQAE -3333-----1111---2222------!!!!---------------1111---------- LVRRSKANLNTDDRSMLDSGQLLIVEGDGRKGYPPNAPYNAIHVGAAAPDTPTELINQLA ------------3333-----------3333-3333-------------------11112 SGGRLIVPVGPDGGSQYMQQYDKDANGKVEMTRLMGVMYVPL 222--------------------1111--------------- >PALICOUREIN; SWP:P84645; PDB:1R1FA; TFCGETCRVIPVCTYSAALGCTCDDRSDGLCKRNGDP -------------1111-------------------- >NEPRILYSIN; SWP:P08473; PDB:1R1HA; GICKSSDCIKSAARLIQNMDATTEPCTDFFKYACGGWLKRNVIPETSSRYGNFDILRDEL -------------------11111111----------------1111------------- EVVLKDVLQEPKTEDIVAVQKAKALYRSCINESAIDSRGGEPLLKLLPDIYGWPVATENW -----------1111----------------------!!!!-33331111--3333--33 EQKYGASWTAEKAIAQLNSKYGKKVLINLFVGTDDKNSVNHVIHIDQPRLGLPSRDYYEC 33------3333---------------------1111----------------3333--- TGIYKEACTAYVDFMISVARLIRQEERLPIDENQLALEMNKVMELEKEIANATAKPEDRN !!!!--------------------------------------------------3333-- DPMLLYNKMTLAQIQNNFSLEINGKPFSWLNFTNEIMSTVNISITNEEDVVVYAPEYLTK 3333----------------------------------------1111------------ LKPILTKYSARDLQNLMSWRFIMDLVSSLSRTYKESRNAFRKALYGTTSETATWRRCANY ---3333-------------33331111-------------------------------- VNGNMENAVGRLYVEAAFAGESKHVVEDLIAQIREVFIQTLDDLTWMDAETKKRAEEKAL --------------------------------------3333----------------11 AIKERIGYPDDIVSNDNKLNNEYLELNYKEDEYFENIIQNLKFSQSKQLKKLREKVDKDE 11------3333----------1111--1111------------------1111--1111 WISGAAVVNAFYSSGRNQIVFPAGILQPPFFSAQQSNSLNYGGIGMVIGHEITHGFDDNG -------------1111----3333------1111------------------------- RNFNKDGDLVDWWTQQSASNFKEQSQCMVYQYGNFSWDLAGGQHLNGINTLGENIADNGG ---1111------------------------------1111----3333----------- LGQAYRAYQNYIKKNGEEKLLPGLDLNHKQLFFLNFAQVWCGTYRPEYAVNSIKTDVHSP --------------------2222------------3333----------3333------ GNFRIIGTLQNSAEFSEAFHCRKNSYMNPEKKCRVW ---------------------2222----------- >Ecdysone receptor; SWP:O18473; PDB:1R1KD; VPPLTANQKSLIARLVWYQEGYEQPSEEDLKRVTQTWDSDMPFRQITEMTILTVQLIVEF -------------------------3333------------------------------- AKGLPGFAKISQSDQITLLKACSSEVMMLRVARRYDAATDSVLFANNQAYTRDNYRKAGM ---2222------------------------1111-1111---1111---3333-----3 AYVIEDLLHFCRCMYSMMMDNVHYALLTAIVIFSDRPGLEQPLLVEEIQRYYLNTLRVYI 3333333-------3333-3333------3333-------33333333------------ LNQNSASPRCAVIFGKILGILTEIRTLGMQNSNMCISLKLKNRKLPPFLEEIWDVA --------------------------------------1111--------1111-- >OUTER MEMBRANE PROTEIN CL; SWP:P38367; PDB:1R1MA; PQYVDETISLSAKTLFGFDKDSLRAEAQDNLKVLAQRLSRTNIQSVRVEGHTDFMGSDKY ----------3333-!!!!------------------1111------------------- NQALSERRAYVVANNLVSNGVPVSRISAVGLGESQAQMTQVCEAEVAKLGAKVSKAKKRE ----------------1111-3333-----!!!!-------------------------- ALIACIEPDRRVDVKIRSIV -----3333----------- >GRB2-RELATED ADAPTOR PROT; SWP:O89100; PDB:1R1QA; IDIEFPEWFHEGLSRHQAENLLMGKDIGFFIIRASQSSPGDFSISVRHEDDVQHFKVMRD -----11111111-------3333-2222--------2222------1111--------1 TKGNYFLWTEKFPSLNKLVDYYRTTSISKQKQVFLRD 111---------------------------------- >TRANSCRIPTIONAL REPRESSOR; SWP:P30340; PDB:1R1TA; ELQAIAPEVAQSLAEFFAVLADPNRLRLLSLLARSELCVGDLAQAIGVSESAVSHQLRSL -----------------1111--------------------------------------- RNLRLVSYRKQGRHVYYQLQDHHIVALYQNALDHLQEC 1111------!!!!------------------------ >REPRESSOR PROTEIN; SWP:Q7A2M9; PDB:1R1UA; NTDTLERVTEIFKALGDYNRIRIMELLSVSEASVGHISHQLNLSQSNVSHQLKLLKSVHL ------------1111---------3333--------------3333--------1111- VKAKRQGQSMIYSLDDIHVATMLKQAIHHANHPK -----!!!!-------------------1111-- >ANTIGEN KI-67; SWP:P46013; PDB:1R21A; MWPTRRLVTIKRSGVDGPHFPLSLSTCLFGRGIECDIRIQLPVVSKQHCKIEIHEQEAIL -------------------------------3333-----3333---------------- HNFSSTNPTQVNGSVIDEPVRLKHGDVITIIDRSFRYENE ----------%%%%--------2222-------------- >IGG3-KAPPA ANTIBODY (LIGH; SWP:NA; PDB:1R24B; DVQLVESGGGLVQPGGSRKLSCAASGFTFSNFGMHWVRQAPEKGLEWVAYISSGGSSINY ----------------------------1111---------------------------- ADTVKGRFTISRDNPKNTLFLQMTSLRSEDTAIYYCTRGGTGTRSLYYFDYWGQGATLIV 1111---------1111------------------------------------------- SSATTTAPSVYPLVPGCSDTSGSSVTLGCLVKGYFPGPVTVKWNYGALSSGVRTVSSVLQ -----------------------------------------------------------i SGFYSLSSLVTVPSSTWPSQTVICNVAHPASKTDLIK iii---------------------------------- >THIOREDOXIN; SWP:NA; PDB:1R26A; PSVVDVYSVEQFRNIMSEDILTVAWFTAVWCGPCKTIERPMEKIAYEFPTVKFAKVDADN -----------------------------------------------1111-----3333 NSEIVSKCRVLQLPTFIIARSGKMLGHVIGANPGMLRQKLRDIIKD -------------------iiii----------------------- >B-CELL LYMPHOMA 6 PROTEIN; SWP:P41182; PDB:1R29A; SQIQFTRHASDVLLNLNRLRSRDILTDVVIVVSREQFRAHKTVLMACSGLFYSIFTDQLK ----1111-----------1111--------!!!!------------------3333111 RNLSVINLDPEINPEGFNILLDFMYTSRLNLREGNIMAVMATAMYLQMEHVVDTCRKFIK 1-------3333-------------------1111------------------------- AS -- >RIBONUCLEOTIDE REDUCTASE ; SWP:P17424; PDB:1R2FA; ISAINWNKIQDDKDLEVWNRLTSNFWLPEKVPLSNDIPAWQTLSAAEQQLTIRVFTGLTL ----------3333-------1111-3333-3333--3333------------------- LDTIQNIAGAPSLMADAITPHEEAVLSNISFMEAVHARSYSSIFSTLCQTKEVDAAYAWS ---------33331111-3333-------------------------------------- EENPPLQRKAQIILAHYVSDEPLKKKIASVFLESFLFYSGFWLPMYFSSRGKLTNTADLI ------------------------------------3333-------1111--------- RLIIRDEAVHGYYIGYKYQIALQKLSAIEREELKLFALDLLMELYDNEIRYTEALYAETG -------------------3333--------------------------------2222- WVNDVKAFLCYNANKALMNLGYEALFPPEMADVNPAILAALSP -----------------1111-----3333------------- >PROTEIN FKBI; SWP:Q9KIE5; PDB:1R2JA; ERDALLTDLVGDRAAEWDTSGELPRDLLVRLGADGLLCAEVAAEHGGLGLGSRENGEFTA -----------------------3333----------11113333-----3333------ HVGSLCSSLRSVMTSQGMAAWTVQRLGDAGQRATFLKELTSGLAAVGFSERQAGSDLSAM -------------------------------------------------1111--1111- RTRVRLDGDTAVVDGHKVWTTAAAYADHLVVFGLQEDGSGAVVVVPADTPGVRVERVPKP ------!!!!-----------3333--------------------1111----------- SGCRAAGHADLHLDQVRVPAGAVLAGSGASLPMLVAASLAYGRKSVAWGCVGILRACRTA --1111------------3333-2222----------3333------------------- AVAHARTREQFGRPLGDHQLVAGHIADLWTAEQIAARVCEYASDHMVPATILAKHVAAER ---------iiii3333------------------------------------------- AAAGAATAAQVLASAGAGHVVERAYRDAKLMEIIEGSSEMCRVMLAQHALALP -----------!!!!--------------3333---------------1111- >RAS-RELATED PROTEIN RAB-5; SWP:P20339; PDB:1R2QA; GNKICQFKLVLLGESAVGKSSLVLRFVKGQFHEFQESTIGAAFLTQTVCLDDTTVKFEIW -------------2222--------------------------------!!!!------- DTAGQERYHSLAPMYYRGAQAAIVVYDITNEESFARAKNWVKELQRQASPNIVIALSGNK ----333311113333----------1111------------------1111-------3 ADLANKRAVDFQEAQSYADDNSLLFMETSAKTSMNVNEIFMAIAKKLPKN 333---------------1111-------1111----------1111--- >TRIOSEPHOSPHATE ISOMERASE; SWP:P00939; PDB:1R2RA; SRKFFVGGNWKMNGRKKNLGELITTLNAAKVPADTEVVCAPPTAYIDFARQKLDPKIAVA -------------------------------1111------3333--------3333--- AQNCYKVTNGAFTGEISPGMIKDCGATWVVLGHSERRHVFGESDELIGQKVAHALSEGLG -----------2222------1111-----------------------------1111-- VIACIGEKLDEREAGITEKVVFEQTKVIADNVKDWSKVVLAYEPVWAIGTGKTATPQQAQ -----------1111-------------1111-3333------3333------------- EVHEKLRGWLKSNVSDAVAQSTRIIYGGSVTGATCKELASQPDVDGFLVGGASLKPEFVD ------------------------------3333------1111-----3333------- IINAKQ 1111-- >TROPONIN C; SWP:Q7ZZB9; PDB:1R2UA; MNDIYKAAVEQLTDEQKNEFKAAFDIFIQDAEDGCISTKELGKVMRMLGQNPTPEELQEM -------3333-------------3333--------3333------------3333---- IDEVDEDGSGTVDFDEFLVMMVRCMKDDS ------------3333-----3333---- >MUTM; SWP:P84131; PDB:1R2ZA; PQLPEVETIRRTLLPLIVGKTIEDVRIFWPNIIRHPRDSEAFAARMIGQTVRGLERRGKF ------------33332222--------3333-----------1111---------!!!! LKFLLDRDALISHLRMEGRYAVASALEPLEPHTHVVFCFTDGSELRYRDVRKFGTMHVYA -----------------------1111-----------1111------1111-------3 KEEADRRPPLAELGPEPLSPAFSPAVLAERAVKTKRSVKALLLDQTVVAGFGNIYVDESL 3331111--1111--1111------------------------1111----3333----- FRAGILPGRPAASLSSKEIERLHEEMVATIGEAVMKGGSTVRTYVNTQGEAGTFQHHLYV -----11111111--------------------1111--------1111---3333---2 YGRQGNPCKRCGTPIEKTVVAGRGTHYCPRCQR 222----------------%%%%---------- >BIOTIN SYNTHASE; SWP:P12996; PDB:1R30A; RPRWTLSQVTELFEKPLLDLLFEAQQVHRQHFDPRQVQVSTLLSIKTGACPEDCKYCPQS ----1111--3333---------------------------------------------1 SRYKTGLEAERLMEVEQVLESARKAKAAGSTRFCMGAAWKNPHERDMPYLEQMVQGVKAM 111----------------------3333-------------33333333---------- GLEACMTLGTLSESQAQRLANAGLDYYNHNLDTSPEFYGNIITTRTYQERLDTLEKVRDA ---------------------------------1111--------3333--------333 GIKVCSGGIVGLGETVKDRAGLLLQLANLPTPPESVPINMLVKVKGTPLADNDDVDAFDF 3---------------------------------------------1111-----3333- IRTIAVARIMMPTSYVRLSAGREQMNEQTQAMCFMAGANSIFYGCKLLTTPNPEEDKDLQ ----------3333------3333------------------------------------ LFRKLGLNPQQT --1111------ >3-HYDROXY-3-METHYLGLUTARY; SWP:P13702; PDB:1R31A; LDSRLPAFRNLSPAARLDHIGQLLGLSHDDVSLLANAGALPMDIANGMIENVIGTFELPY ----2222-----------------------------------3333------------- AVASNFQINGRDVLVPLVVEEPSIVAAASYMAKLARANGGFTTSSSAPLMHAQVQIVGIQ -------iiii----------------------3333----------------------- DPLNARLSLLRRKDEIIELANRKDQLLNSLGGGCRDIEVHTFADTPRGPMLVAHLIVDVR 3333-----------------------1111-------------1111------------ DAMGANTVNTMAEAVAPLMEAITGGQVRLRILSNLADLRLARAQVRITPQQLETAEFSGE -----------------------------------------------3333--------- AVIEGILDAYAFAAVDPYRAATHNKGIMNGIDPLIVATGNDWRAVEAGAHAYACRSGHYG ------------------------------3333-1111------------1111----- SLTTWEKDNNGHLVGTLEMPMPVGLVGGATKTHPLAQLSLRILGVKTAQALAEIAVAVGL -------1111---------------!!!!----------------3333---------- AQNLGAMRALATEGIQ ---------------- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:1R3DA; SLLSNQLHFAKPTARTPLVVLVHGLLGSGADWQPVLSHLARTQCAALTLDLPGHGTNPAE ------------1111-------2222-----------3333--------2222----33 AVEIEQTVQAHVTSEVPVILVGYSLGGRLIHGLAQGAFSRLNLRGAIIEGGHFGLQENEE 33------11111111----------------1111-1111------------------- KAARWQHDQQWAQRFSQQPIEHVLSDWYQQAVFSSLNHEQRQTLIAQRSANLGSSVAHLL -------------------------11113333--------------1111--------1 ATSLAKQPYLLPALQALKLPIHYVCGEQDSKFQQLAESSGLSYSQVAQAGHNVHHEQPQA 1111111------1111--------1111----------------------3333----- FAKIVQAIHSIID -------3333-- >TRNA PSEUDOURIDINE SYNTHA; SWP:Q9WZW0; PDB:1R3EA; MKHGILVAYKPKGPTSHDVVDEVRKKLKTRKVGHGGTLDPFACGVLIIGVNQGTRILEFY --------------3333------1111----------1111---------333333331 KDLKKVYWVKMRLGLITETFDITGEVVEERECNVTEEEIREAIFSFVGEYDQVPPAYSAK 111-------------11111111-------------------1111------------- KYKGERLYKLAREGKIINLPPKRVKIFKIWDVNIEGRDVSFRVEVSPGTYIRSLCMDIGY -iiii-----------------------------!!!!-------2222----------- KLGCGATAVELVRESVGPHTIEESLNVFEAAPEEIENRIIPLEKCLEWLPRVVVHQESTK ---------------!!!!3333--3333------1111-3333-1111----------- MILNGSQIHLEMLKEWDGFKKGEVVRVFNEEGRLLALAEAERNSSFLETLRKERVLTLRK -1111--------------2222------------------------------------- VFNTR ----- >MHC H2-TL-T10-129; SWP:Q31206; PDB:1R3HA; GSHSLRYFYTAVSRPGLGEPWFIIVGYVDDMQVLRFSSKEETPRMAPWLEQEEADDWEQQ ------------------------------------2222-----3333-------3333 THIVTIQGQLSERNLMTLVHFYNKSMDDSHTLQWLQDCDVEPDRHLCLWYNQLAYDSEDL ------1111----1111-----------------------1111---------%%%%-- PTLSENPSSCTQHLEGHCSDVLQKYLEKGKERLLRSDPPKAHVTRHPRPEGDVTLRCWAL ------------1111-------3333-3333---------------------------- GFYPADITLTWQKDGEELTVEFVETRPAGDGTFQKWAAVVVPLGKVQSYTCHVDHEGLPE ------------iiii--------------------------------------1111-- PLTLRWEP -------- >BETA-ALANINE SYNTHASE; SWP:Q96W94; PDB:1R3NA; GTLNLPAAAPLSIASGRLNQTILETGSQFGGVARWGQESHEFGMRRLAGTALDGAMRDWF -------------2222--------------------1111-----2222---------- TNECESLGCKVKVDKIGNMFAVYPGKNGGKPTATGSHLDTQPEAGKYDGILGVLAGLEVL ----1111-----1111------------------------------------------- RTFKDNNYVPNYDVCVVVWFNEEGARFARSCTGSSVWSHDLSLEEAYGLMSVGEDKPESV ------------------------------------------------------------ YDSLKNIGYIGDTPASYKENEIDAHFELHIEQGPILEDENKAIGIVTGVQAYNWQKVTVH ---------------3333----------------------------------------- GVGAHAGTTPWRLRKDALLMSSKMIVAASEIAQRHNGLFTCGIIDAKPYSVNIIPGEVSF ---------1111-------------------1111-------------1111------- TLDFRHPSDDVLATMLKEAAAEFDRLIKINDGGALSYESETLQVSPAVNFHEVCIECVSR -------------------------11111111--------------------------- SAFAQFKKDQVRQIWSGAGHDSCQTAPHVPTSMIFIPSKDGLSHNYYEYSSPEEIENGFK -3333-1111---------3333-3333---------2222---1111------------ VLLQAIINYDNYRVIRGH --------------3333 >UROPORPHYRINOGEN DECARBOX; SWP:P06132; PDB:1R3SA; FPELKNDTFLRAAWGEETDYTPVWCMRQAGRYLPEFRETRAAQDFFSTCRSPEACCELTL -----------------------------3333------3333----------------3 QPLRRFPLDAAIIFSGILVVPQALGMEVTMVPGKGPSFPEPLREEQDLERLRDPEVVASE 333------------1111--1111-----2222---------33331111-33333333 LGYVFQAITLTRQRLAGRVPLIGFAGAPWTLMTYMVEGGGSSTMAQAKRWLYQRPQASHQ --------------iiii------------------------------------------ LLRILTDALVPYLVGQVVAGAQALQLFESHAGHLGPQLFNKFALPYIRDVAKQVKARLRE -----------------------------3333-------------------------11 AGLAPVPMIIFAKDGHFALEELAQAGYEVVGLDWTVAPKKARECVGKTVTLQGNLDPCAL 11---------222211113333---------1111-------------------3333- YASEEEIGQLVKQMLDDFGPHRYIANLGHGLYPDMDPEHVGAFVDAVHKHSRLLRQ -------------------------------11113333------------3333- >ANGIOTENSIN I CONVERTING ; SWP:Q9BYF1; PDB:1R42A; STIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWSAFLKEQST ------------------------------------------------------------ LAQMYPLQEIQNLTVKLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIYSTGKVCNPDNP -33333333----------------3333-3333---------------------1111- QECLLLEPGLNEIMANSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMARANHYED -----------------------------------------------------1111--- YGDYWRGDYEVNGVDGYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLMNAYPSYISP -----3333----2222--3333------------------------------2222-11 IGCLPAHLLGDMWGRFWTNLYSLTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSV 11--1111--------11111111--3333---------1111--------------111 GLPNMTQGFWENSMLTDPGNVQKAVCHPTAWDLGKGDFRILMCTKVTMDDFLTAHHEMGH 1------3333----------------------%%%%---------3333---------- IQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINF ----1111--3333----1111------------------1111---------------- LLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYC ------------------------------1111-------------------------3 DPASLFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLFNML 333---3333----------------------1111---3333--2222---------33 RLGKSEPWTLALENVVGAKNMNVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSPYAD 33----3333----------------------------1111----------1111- >D-ALANYL-D-ALANINE DIPEPT; SWP:Q06241; PDB:1R44A; MEIGFTFLDEIVHGVRWDAKYATWDNFTGKPVDGYEVNRIVGTYELAESLLKAKELAATQ -2222------1111---1111---1111--2222------------------------- GYGLLLWDGYRPKRAVNCFMQWAAQPENNLTKESYYPNIDRTEMISKGYVASKSSHSRGS -----------3333------1111-----3333-----3333-1111-----3333--- AIDLTLYRLDTGELVPMGSRFDFMDERSHHAANGISCNEAQNRRRLRSIMENSGFEAYSL ------------------------33331111-------------------------111 EWWHYVLRDEPYPNSYFDFPVK 1--------------------- >MONO-ADP-RIBOSYLTRANSFERA; SWP:Q00901; PDB:1R45A; DTFTEFTNVEEAKKWGNAQYKKYGLSKPEQEAIKFYTRDASKINGPLRANQGNENGLPAD -------------------1111--------------------------iiii1111--- ILQKVKLIDQSFSKMKMPQNIILFRGDDPAYLGPEFQDKILNKDGTINKTVFEQVKAKFL ----------------------------3333---------1111---3333-------- KKDRTEYGYISTSLMSAQFGGRPIVTKFKVTNGSKGGYIDPISYFPGQLEVLLPRNNSYY ---------------3333-----------2222----3333------------------ ISDMQISPNNRQIMITAMIFK ------1111----------- >ALPHA-GALACTOSIDASE A; SWP:P06280; PDB:1R46A; LDNGLARTPTMGWLHWERFMCNLDCQEEPDSCISEKLFMEMAELMVSEGWKDAGYEYLCI ---------------3333---------1111------------------3333------ DDCWMAPQRDSEGRLQADPQRFPHGIRQLANYVHSKGLKLGIYADVGNKTCAGFPGSFGY ---------3333--------1111--------1111------------1111------- YDIDAQTFADWGVDLLKFDGCYCDSLENLADGYKHMSLALNRTGRSIVYSCEWPLYMWPF ----------------------------------------------------3333---- QKPNYTEIRQYCNHWRNFADIDDSWKSIKSILDWTSFNQERIVDVAGPGGWNDPDMLVIG -------1111--------------------------3333----------------222 NFGLSWNQQVTQMALWAIMAAPLFMSNDLRHISPQAKALLQDKDVIAINQDPLGKQGYQL 2---3333--------1111------------------1111-----1111--------- RQGDNFEVWERPLSGLAWAVAMINRQEIGGPRSYTIAVASLGKGVACNPACFITQLLPVK --------------------------------------------2222------------ RKLGFYEWTSRLRSHINPTGTVLLQLENTM ------3333------2222---------- >PROLINE/BETAINE TRANSPORT; SWP:P30848; PDB:1R48A; CGGDNIEQKIDDIDHEIADLQAKRTRLVQQHPR ---------------------------3333-- >Golgin subfamily A member; SWP:Q13439; PDB:1R4AE; PTEFEYLRKVLFEYMMGRETKTMAKVITTVLKFPDDQTQKILEREDARLMF -------------1111-----------1111---------------1111 >CYSTATIN C; SWP:P01034; PDB:1R4CA; GGPMDASVEEEGVRRALDFAVGEYNKASNDMYHSRALQVVRARKQIVAGVNYFLDVELGR ------3333-------------------------------------------------- TTCTKTQPNLDNCPFHDQPHLKRKAFCSFQIYAVPWQGTMTLSKSTCQDA ---1111-3333-----1111------------3333------------- >RNA POLYMERASE ALPHA SUBU; SWP:P04860; PDB:1R4GA; KPTMHSLRLVIESSPLSRAEKAAYVKSLSKCKTDQEVKAVMELVEEDIESLTN -----------------3333-------------------------------- >ANDROGEN RECEPTOR; SWP:P15207; PDB:1R4IA; PQKTCLICGDEASGAHYGALTCGSCKVFFKRAAEGKQKYLCASRNDCTIDKFRRKNCPSC ----------------------3333-----------------------1111------- RLRKCYEAGMTLGA -------------- >ARGONAUTE 1; SWP:Q32KD4; PDB:1R4KA; TAFYKAQPVIDFMCEVLDIRDINEQRKPLTDSQRVKFTKEIKGLKIEITHCGQMRRKYRV --------------------1111------------------------------------ CNVTRRPAQMQSFPLQLENGQTVECTVAKYFLDKYRMKLRYPHLPCLQVGQEHKHTYLPL ------3333------3333--------3333--------3333---------------- EVCNIVAGQRCIKKLTDMQTSTMIKATARSAPDREREINNLVKRADFN ------------------------------------------------ >NEDD8-activating enzyme E; SWP:Q8TBC4; PDB:1R4MB; DWEGRWNHVKKFLERSGPFTHPDFEPSTESLQFLLDTCKVLVIGAGGLGCELLKNLALSG -22221111-------11111111-------------------------------1111- FRQIHVIDMDTIDVSNLNRQFLFRPKDIGRPKAEVAAEFLNDRVPNCNVVPHFNKIQDFN ------------1111-------3333---------------------------1111-- DTFYRQFHIIVCGLDSIIARRWINGMLISLLNYEDGVLDPSSIVPLIDGGTEGFKGNARV 3333-----------------------1111-------1111---------!!!!----- ILPGMTACIECTLELYPPQVNFPMATIASMPRLPEHCIEYVRMLQWPKEQPFGEGVPLDG -2222--33331111--------3333-----3333---------3333---2222--11 DDPEHIQWIFQKSLERASQYNIRGVTYRLTQGVVKRIIPAVASTNAVIAAVCATEVFKIA 11---------------------------------------------------------- TSAYIPLNNYLVFNDVDGLYTYTFEAERKENCPACSQLPQNIQFSPSAKLQEVLDYLTNS ----------------------------1111---------------------------1 ASLQMKSPAITATNRTLYLQSVTSIEERTRKELGLVDGQEAVADVTTPQTVLFKLHFT 111------------------33331111--3333-----------2222-------- >SHIGA-LIKE TOXIN TYPE II ; SWP:Q9R398; PDB:1R4PA; REFTIDFSTQQSYVSSLNSIRTEISTPLEHISQGTTSVSVINHTPPGSYFAVDIRGLDVY ---------------------------1111-!!!!--------2222---------222 QARFDHLRLIIEQNNLYVAGFVNTATNTFYRFSDFTHISVPGVTTVSMTTDSSYTTLQRV 2------------------------------1111------------------------- AALERSGMQISRHSLVSSYLALMEFSGNTMTRDASRAVLRFVTVTAEALRFRQIQREFRQ ----2222-------------------------------------------------333 ALSETAPVYTMTPGDVDLTLNWGRISNVLPEYRGEDGVRVGRISFNNISAILGTVAVILN 3-1111---------------------3333--------!!!!----------------- CHECQITGDRPVIKINNTLWESNTAAAFLNRKSQFLYTTGK --------------%%%%------3333------------- >Shiga-like toxin II B sub; SWP:Q57249; PDB:1R4PB; ADCAKGKIEFSKYNEDDTFTVKVDGKEYWTSRWNLQPLLQSAQLTGMTVTIKSSTCESGS -------------1111-----iiii-----3333---------------------2222 GFAEVQFNND ---------- >SHT CYTOTOXIN A SUBUNIT; SWP:Q7AK38; PDB:1R4QA; KEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGTGDNLFAVDVRGIDPE ---------------------------3333-%%%%------------------------ EGRFNNLRLIVERNNLYVTGFVNRTNNVFYRFADFSHVTFPGTTAVTLSGDSSYTTLQRV -------------------------------1111----2222---------------33 AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRFVTVTAEALRFRQIQRGFRT 33--2222----------------------3333-------------------------- TLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRISFGSINAILGSVALIL ---1111---------------------------------!!!!---3333--------- NCHFPSMCPADGRVRGITHNKILWDSSTLGAILMRR ------------------%%%%-------1111--- >HYPOTHETICAL PROTEIN AQ_3; SWP:O66665; PDB:1R4VA; ETLRPKGFDKLDHYFRTELDIDLTDETIELLLNSVKAAFGKLFYGAEQRARWNGRDFIAL ----------------------------------------1111------1111----33 ADLNITKALEEHIKNFQKIEQDGVDELLEYIAFIPPVENVGEDLKSEYRNIGGLLLHADV 33-------------1111----------------------3333--3333--------- IKKATGERKPSREAEFVAQIVDKVF ----------3333------1111- >GLUTATHIONE S-TRANSFERASE; SWP:P24473; PDB:1R4WA; GPAPRVLELFYDVLSPYSWLGFEVLCRYQHLWNIKLKLRPALLAGIMKDSGNQPPAMVPH -----------3333------------1111----------------3333--1111333 KGQYILKEIPLLKQLFQVPMSVPKDFFGEHVKKGTVNAMRFLTAVSMEQPEMLEKVSREL 3------------3333-------3333-3333---------------1111-------- WMRIWSRDEDITESQNILSAAEKAGMATAQAQHLLNKISTELVKSKLRETTGAACKYGAF ------------3333------------------1111----------------1111-- GLPTTVAHVDGKTYMLFGSDRMELLAYLLGEKWMGPVPPTL --------iiii----------------------------- >COATOMER GAMMA SUBUNIT; SWP:Q9Y678; PDB:1R4XA; MHHHHHHMTRQEIFQEQLAAVPEFRGLGPLFKSSPEPVALTESETEYVIRCTKHTFTNHM --------------------3333-----------------2222--------------- VFQFDCTNTLNDQTLENVTVQMEPTEAYEVLYVPARSLPYNQPGTCYTLVALPKEDPTAV --------------------------------------2222-------------1111- ACTFSCMMKFTVKDCDPTTGETDDEGYEDEYVLEDLEVTVADHIQKVMKLNFEAAWDEVG --------------------------------------3333------------------ DEFEKEETFTLSTIKTLEEAVGNIVKFLGMHPCERSDKVPDNKNTHTLLLAGVFRGGHDI 1111------1111------------------%%%%-----------------2222--- LVRSRLLLLDTVTMQVTARSLEELPVDIILASV --------------------------------- >CHORISMATE SYNTHASE; SWP:P28777; PDB:1R53A; MSTFGKLFRVTTYGESHCKSVGCIVDGVPPGMSLTEADIQPQLTRRRPDRVEIQSGTEFG ----------------------------------3333-------------------iii KTLGTPIAMMIKNRETIGRVASGAIAEKFLAQNSNVEIVAFVTQIGEIKMNRDSFDPEFQ i-------------------------------------------!!!!----1111---- HLLNTITREKVDSMGPIRCPDASVAGLMVKEIEKYRGNKDSIGGVVTCVVRNLPTGLGEP --------------3333--3333-----------1111--------------------- CFDKLEAMLAHAMLSIPASKGFEIGSGFQGVSVPGSKHNDPFYRTKTNNSGGVQGGISNG ---------------2222----!!!!1111--3333----------------iiii--- ENIYFSVPFKSVRHDPAVTPRAIPIVEAMTALVLADALLIQKARDFS ------------------3333-------------------1111-- >ADAM 33; SWP:Q9BZ11; PDB:1R55A; TRKYLELYIVADHTLFLTRHRNLQHTKQRLLEVANYVDQLLRTLDIQVALTGLEVWTERD ----------------1111---------------------------------------- RSRVTQDANATLWAFLQWRRGLWAQRPHDSAQLLTGRAFQGATVGLAPVEGMCRAESSGG ---------------------3333--------------%%%%-------2222------ VSTDHSELPIGAAATMAHEIGHSLGLSHDPDGCCVEAAAESGGCVMAAATGHPFPRVFSA -------3333------------------1111----3333--1111------------- CSRRQLRAFFRKGGGACLSNAPS --------------1111----- >CONSERVED HYPOTHETICAL PR; SWP:Q99RB4; PDB:1R57A; MSNLEIKQGENKFYIGDDENNALAEITYRFVDNNEINIDHTGVSDELGGQGVGKKLLKAV --------2222-----3333--------------------------1111--------- VEHARENNLKIIASCSFAKHMLEKEDSYQDVYLGLEHHHHHH ----1111----------------3333-------------- >Glycerol kinase; SWP:O34153; PDB:1R59O; NYVMAIDQGTTSSRAIIFDRNGKKIGSSQKEFPQYFPKSGWVEHNANEIWNSVQSVIAGA ---------------------------------------------------------111 FIESGIRPEAIAGIGITNQRETTVVWDKTTGQPIANAIVWQSRQSSPIADQLKVDGHTEM 1------1111----------------------------------1111----------- IHEKTGLVIDAYFSATKVRWLLDNIEGAQEKADNGELLFGTIDSWLVWKLTDGQVHVTDY ------------3333----1111----3333--------------1111---------- SNASRTMLYNIHKLEWDQEILDLLNIPSSMLPEVKSNSEVYGHTRSVPIAGMAGDQQAAL -3333--------------3333------------------------------------3 FGQMAFEKGMIKNTYGTGAFIVMNTGEEPQLSDNDLLTTIGYGINGKVYYALEGSIFVAG 333---2222-------------------------------------------------3 SAIQWLRDGLRMIETSPQSEELAAKAKGDNEVYVVPAFTGLGAPYWDSEARGAVFGLTRG 333-----------3333----------%%%%---------!!!!------------111 TTKEDFVRATLQAVAYQSKDVIDTMKKDSGIDIPLLKVDGGAAKNDLLMQFQADILDIDV 13333-----------3333---------------------------------------- QRAANLETTALGAAYLAGLAVGFWKDLDELKSMAEEGQMFTPEMPAEERDNLYEGWKQAV -------3333---------------11111111--------------33331111---- >GLUTATHIONE TRANSFERASE; SWP:Q9GQG7; PDB:1R5AA; TTVLYYLPASPPCRSVLLLAKMIGVELDLKVLNIMEGEQLKPDFVELNPQHCIPTMDDHG ------3333----------------------333333333333---1111--------- LVLWESRVILSYLVSAYGKDENLYPKDFRSRAIVDQRLHFDLGTLYQRVVDYYFPTIHLG --------------------------3333------------------------------ AHLDQTKKAKLAEALGWFEAMLKQYQWSAANHFTIADIALCVTVSQIEAFQFDLHPYPRV ---3333----------------------------------------------3333--- RAWLLKCKDELEGHGYKEINETGAETLAGLFRSK ----------1111-------------------- >Eukaryotic peptide chain ; SWP:O74718; PDB:1R5BA; TEDATDLQNEVDQELLKDMYGKEHVNIVFIGHVDAGKSTLGGNILFLTGMVDKRTMEKIE -----3333----------------------1111----------1111----------- REAKERAYFETEHRRFSLLDAPGASQADIGVLVISARRGEFEAGFERGGQTREHAVLART 1111------1111--------------------------3333-1111---------11 QGINHLVVVINKMDEPSVQWSEERYKECVDKLSMFLRRVAGYNSKTDVKYMPVSAYTGQN 11---------3333-----3333------------------3333--------1111-- VKDRVDSSVCPWYQGPSLLEYLDSMTHLERKVNAPFIMPIASKYKDLGTILEGKIEAGSI -----3333--------------------------------------------------- KKNSNVLVMPINQTLEVTAIYDEADEEISSSICGDQVRLRVRGDDSDVQTGYVLTSTKNP -------------------------------2222---------11112222-------- VHATTRFIAQIAILELPSILTTGYSCVMHIHTAVEEVSFAKLLHKLDKTNRKSKKPPMFA ------------------------------------------------------------ TKGMKIIAELETQTPVCMERFEDYQYMGRFTLRDQGTTVAVGKVVKILD 2222---------------33333333---------------------- >AVIRULENCE PROTEIN; SWP:Q08242; PDB:1R5EA; DNVTSSQLLSVRHQLAESAGLPRDQHEFVSSQAPQSLRNRYNNLYSHTQRTLDMADMQHR ---------------3333---3333-------3333-------------3333------ YMTGASGINPGMLPHENVDDMRSAITDWSDMREALQHAMGIHADI ------------1111----------------------------- >PUTATIVE PHOSPHOTRANSACET; SWP:Q99ZQ5; PDB:1R5JA; SIRSLFGGLREKILGKNKIVFPEGNDERVVRAAARLKFEGLLEPIILGQSEEVRNLLTKL ------------2222--------------------------------3333-----111 GFADQDYTIINPNEYADFDKKEAFVEVRKGKATLEDADKLRDVNYFGVLVKGLADGVSGA 1---------3333------3333--------3333-----3333--------------- IHSTADTVRPALQIIKTKPGISRTSGVFLNRENTSERYVFADCAINIDPTAQELAEIAVN --3333-----------1111---------3333---------------3333------- TAETAKIFDIDPKIALSFSTKGSGKAPQVDKVREATEIATGLNPDLALDGELQFDAAFVP ---3333------------iiii-------------------1111-------------- ETAAIKAPDSAVAGQANTFVFPDLQSGNIGYKIAQRLGFDAIGPILQGLNKPVNDLSRGS -----------2222--------------------------------------------- SAEDIYKLAIITAAQAIES --------------3333- >ALPHA-TOCOPHEROL TRANSFER; SWP:P49638; PDB:1R5LA; QPGLAALRRRAREAGVPLAPLPLTDSFLLRFLRARDFDLDLAWRLLKNYYKWRAECPEIS 2222------------------------------%%%%-----------------3333- ADLHPRSIIGLLKAGYHGVLRSRDPTGSKVLIYRIAHWDPKVFTAYDVFRVSLITSELIV ----3333---------------1111------3333-1111------------------ QEVETQRNGIKAIFDLEGWQFSHAFQITPSVAKKIAAVLTDSFPLKVRGIHLINEPVIFH ---------------222233333333----------------------------3333- AVFSIKPFLTEKIKERIHHGNNYKQSLLQHFPDILPLEYGGEEFSEDICQEWTNFIKSED 33333333-33331111------------------3333--------------------- YLSSISE --1111- >SIR4-INTERACTING PROTEIN ; SWP:P38262; PDB:1R5MA; GFVKILKEIVKLDNIVSSTWNPLDESILAYGEKNSVARLARIVETYWKLTIIAELRHPFA -----------------------1111--------------------------------- LSTNQVTCLAWSHDGNSIVTGVENGELRLWNKTGALLNVLNFHRAPIVSVKWNKDGTHII -----------1111------1111-----1111------------------1111---- SMDVENVTILWNVISGTVMQHFELKGSLGVDVEWVDDDKFVIPGPKGAIFVYQITEKTPT --1111--------------------------------------iiii----1111---- GKLIGHHGPISVLEFNDTNKLLLSASDDGTLRIWHGGNGNSQNCFYGHSQSIVSASWVGD -------------------------1111------------------------------- DKVISCSMDGSVRLWSLKQNTLLALSIVDGVPIFAGRISQDGQKYAVAFMDGQVNVYDLK ------1111------1111-------2222-------1111------1111-------3 KLNSPLPIPLYASYQSSQDNDYIFDLSWNCAGNKISVAYSLQEGSVVAIPG 333-------------------------1111------------------- >CIRCADIAN OSCILLATION REG; SWP:Q8YT41; PDB:1R5PA; TYVLKLYVAGNTPNSVRALKTLKNILEQEFQGIYALKVIDVLKNPQLAEEDKILATPTLS -----------------------------iiii------1111-----------333333 KILPPPVRKIIGDLSDRERVLIGLDLLYEE 33---------------------------- >CIRCADIAN OSCILLATION REG; SWP:Q8YT42; PDB:1R5QA; EVDQQILLQQLKSDYRQILLSYFTTDLKEKIDKFINAVFCANIPVPEIIEIHMELIDEFS --3333--------------1111-------------------3333------------- KQLRLGDLMDYRLTLIDILAHLCEAYRGAIF ------------------------------- >PHEROMONE-BINDING PROTEIN; SWP:Q8WRW5; PDB:1R5RA; DWVPPEVFDLVAEDKARCMSEHGTTQAQIDDVDKGNLVNEPSITCYMYCLLEAFSLVDDE ------------------------3333---1111----3333--------1111----- ANVDEDIMLGLLPDQLQERAQSVMGKCLPTSGSDNCNKIYNLAKCVQESAPDVWFVI --------33331111----------------------------------------- >GAP JUNCTION ALPHA-1 PROT; SWP:P08050; PDB:1R5SA; GPLGSPSKDCGSPKYAYFNGCSSPTAPLSPMSPPGYKLVTGDRNNSSCRNYNKQASEQNW ------%%%%-------------------------------------------------- ANYSAEQNRMGQAGSTISNSHAQPFDFPDDNQNAKKVAAGHELQPLAIVDQRPSSRASSR -------------3333--------------1111----3333-----3333---%%%%- ASSRPRPDDLEI ------------ >CYTIDINE DEAMINASE; SWP:Q06549; PDB:1R5TA; KVGGIEDRQLEALKRAALKACELSYSPYSHFRVGCSILTNNDVIFTGANVENASYSNCIC --------------------1111--------------1111-----------3333--- AERSAMIQVLMAGHRSGWKCMVICGDSEDQCVSPCGVCRQFINEFVVKDFPIVMLNSTGS ---------1111---------------------------------1111-----1111- RSKVMTMGELLPMAFGPSHLN -----3333------3333-- >QUEUINE TRNA-RIBOSYLTRANS; SWP:P28720; PDB:1R5YA; RPRFSFSIAAREGKARTGTIEMKRGVIRTPAFMPVGTAATVKALKPETVRATGADIILGN -----------!!!!------1111---------------2222-----3333------- TYHLMLRPGAERIAKLGGLHSFMGWDRPILTDSGGYQVMLSLTKQSEEGVTFMLSPERSI -------------1111-----------------1111-------3333----------- EIQHLLGSDIVMAFDECTPYPATPSRAASSMERSMRWAKRSRDAFDSRKEQAENAALFGI ------------------------------------------------------------ QQGSVFENLRQQSADALAEIGFDGYAVGGLAVGEGQDEMFRVLDFSVPMLPDDKPHYLMG --!!!!---------------------------------------3333-1111---222 VGKPDDIVGAVERGIDMFDCVLPTRSGRNGQAFTWDGPINIRNARFSEDLKPLDSECHCA 2---------1111------3333---------1111--33331111------1111-33 VCQKWSRAYIHHLIRAGEILGAMLMTEHNIAFYQQLMQKIRDSISEGRFSQFAQDFRARY 33-----------------------------------------1111------------- F - >METAL-DEPENDENT HYDROLASE; SWP:P84132; PDB:1R61A; AMKVYDVTAPIYEGMPVYKNKPEKQPKRTTITNGYVTESRIDMDVHTGTHIDAPLHMVEG -----------2222-22223333--------!!!!-------1111------3333--- GATFETIPLNDLVGPCKLFDLTHVNDRITKDDIAHLDIQEGDFVLFKTKNSFEDAFHFEF --------1111--------1111----33331111--2222-----3333--------- IFVAEDAARYLADKQIRGVGIDALGIERAQEGHPTHKTLFSAGVIIIEGLRLKDVPEGRY -----------------------------2222------1111--------1111----- FMVAAPLKLVGTDAAPARVLLFDR --------2222------------ >NITROGEN REGULATION PROTE; SWP:P06712; PDB:1R62A; RVTESIHKVAERVVTLVSMELPDNVRLIRDYDPSLPELAHDPDQIEQVLLNIVRNALQAL ---------------3333--1111------1111------------------------- GPEGGEIILRTRTAFQLTLHGERYRLAARIDVEDNGPGIGLGLSIARNLIDQHSGKIEFT 3333--------------iiii-------------1111-----------1111------ SWPGHTEFSVYLPIRK -2222----------- >KEXIN; SWP:P13134; PDB:1R64A; LLPVKEAEDKLSINDPLFERQWHLVNPSFPGSDINVLDLWYNNITGAGVVAAIVDDGLDY --------1111--1111--3333----2222-------------2222---------11 ENEDLKDNFCAEGSWDFNDNTNLPKPRLSDDYHGTRCAGEIAAKKGNNFCGVGVGYNAKI 11---11113333--------------1111-----------------------1111-- SGIRILSGDITTEDEAASLIYGLDVNDIYSCSWGPADDGRHLQGPSDLVKKALVKGVTEG ---------------------3333----------------------------------- RDSKGAIYVFASGNGGTRGDNCNYDGYTNSIYSITIGAIDHKDLHPPYSEGCSAVMAVTY iiii----------1111--------1111---------------3333----------- SSGSGEYIHSSDINGRCSNSHGGTSAAAPLAAGVYTLLLEANPNLTWRDVQYLSILSAVG --iiii------%%%%------3333---------------3333--------------- LEKNADGDWRDSAMGKKYSHRYGFGKIDAHKLIEMSKTWENVNAQTWFYLPTLYVSQSTN 3333-----------------!!!!--------1111----------------------- STEETLESVITISEKSLQDANFKRIEHVTVTVDIDTEIRGTTTVDLISPAGIISNLGVVR 1111---------------------------------3333------1111--------3 PRDVSSEGFKDWTFMSVAHWGENGVGDWKIKVKTTENGHRIDFHSWRLKLFGESIDSSKT 333-------------1111---------------2222----------------3333- E - >REPRESSOR PROTEIN CI; SWP:P16117; PDB:1R69; SISSRVKSKRIQLGLNQAELAQKVGTTQQSIEQLENGKTKRPRFLPELASALGVSVDWLL ----------1111------------3333---1111----------------------- NGT --- >TDP-GLUCOSE-4,6-DEHYDRATA; SWP:Q9ZGH3; PDB:1R6DA; MRLLVTGGAGFIGSHFVRQLLAGAYPDVPADEVIVLDSLTYAGNRANLAPVDADPRLRFV ------1111--------------1111-----------1111----3333--1111--- HGDIRDAGLLARELRGVDAIVHFAAESHVDRSIAGASVFTETNVQGTQTLLQCAVDAGVG --1111-----1111--------------------3333---------------1111-- RVVHVSTNQVYGSIDSGSWTESSPLEPNSPYAASKAGSDLVARAYHRTYGLDVRITRCCN --------1111-------1111------------------------------------- NYGPYQHPEKLIPLFVTNLLDGGTLPLYGDGANVREWVHTDDHCRGIALVLAGGRAGEIY --2222------------1111-----!!!!-------3333------------2222-- HIGGGLELTNRELTGILLDSLGADWSSVRKVADRKGHDLRYSLDGGKIERELGYRPQVSF ------------------1111-3333------2222----------------------- ADGLARTVRWYRENRGWWEPLK -------------33333333- >VIRULENCE-ASSOCIATED V AN; SWP:P21206; PDB:1R6FA; GSSVLEELVQLVAAANIDISIKNRVITDDIELLKKILAYFLPEDAILKGGHDNQLQNGIK ---------3333------------1111------------1111--------------- RVKEFLESSPNTQWELRAFMAVMHFSLTADRIDDDILKVIVDSMNHHGDARSKLREELAE ---------------------------1111------------1111------------- LTAELKIYSVIQAEINKHLSSSGTINIHDKSINLMDKNLYGYTDEEIFKASAEYKILEKM ---------------------------1111----1111----3333--------1111- PQTTIQVDGSEKKIVSIKDFLGSENKRTGALGNLKNSYSYNKDSDKSRPLNDLVSQKTTQ -------------------1111-------------------------3333-------- LSDITSRFNSAIEALNRFIQKYDSVMQRLLDD -------------------------3333--- >SYNTENIN 1; SWP:O00560; PDB:1R6JA; GAMDPRTITMHKDSTGHVGFIFKNGKITSIVKDSSAARNGLLTEHNICEINGQNVIGLKD 1111--------1111------iiii----2222---------------iiii-2222-- SQIADILSTSGTVVTITIMPAF ---------------------- >RIBONUCLEASE PH; SWP:P50597; PDB:1R6LA; NRPSGRAADQLRPIRITRHYTKHAEGSVLVEFGDTKVICTVSAESGVPRFLKQGWLTAEY -1111-1111---------------------!!!!------------------------- GLPRSTGERNQREASRGKQGGRTLEIQRLIGRSLRAALDLSKLGENTLYIDCDVIQADGG -----------3333-------------------11113333------------------ TRTASITGATVALIDALAVLKKRGALKGNPLKQVAAVSVGIYQGVPVLDLDYLEDSAAET -----------------------------------------iiii--------------- DLNVVTDAGGFIEVQGTAEGAPFRPAELNALELAQQGQELFELQRAALAE -----3333-------------------------------------1111 >HPV11 REGULATORY PROTEIN ; SWP:P04015; PDB:1R6NA; HEAIAKRLDACQDQLLELYEENSIDIHKHIMHWKCIRLESVLLHKAKQMGLSHIGLQVVP ------------------3333--3333------------------1111---iiii--- PLTVSETKGHNAIEMQMHLESLAKTQYGVEPWTLQDTSYEMWLTPPKRCFKKQGNTVEVK 3333--------------------3333----3333--3333------------------ FDGCEDNVMEYVVWTHIYLQDNDSWVKVTSSVDAKGIYYTCGQFKTYYVNFNKEAQKYGS ---1111-------------------------3333----!!!!---------------- TNHWEVCYGSTVICSP ---------------- >ATP-DEPENDENT CLP PROTEAS; SWP:P75832; PDB:1R6OC; WLDFDQLDALKPPSMYKVILVNDDYTPMEFVIDVLQKFFSYDVERATQLMLAVHYQGKAI --------------------------3333------------------------------ CGVFTAEVAETKVAMVNKYARENEHPLLCTLEKA --------------------1111---------- >GENOME POLYPROTEIN; SWP:Q88653; PDB:1R6RA; NRVSTVQQLTKRFSLGMLQGRGPLKLFMALVAFLRFLTIPPTAGILKRWGTIKKSKAINV ----------------3333----3333-------------3333-3333---3333--- LRGFRKEIGRMLNILNRRRR -------------------- >TRYPTOPHANYL-TRNA SYNTHET; SWP:P23381; PDB:1R6UA; EDFVDPWTVQTSSAKGIDYDKLIVRFGSSKIDKELINRIERATGQRPHHFLRRGIFFSHR ------------1111-------1111--------------------------------- DNQVLDAYENKKPFYLYTGRGPSSEAHVGHLIPFIFTKWLQDVFNVPLVIQTDDEKYLWK -------1111---------------3333------------------------------ DLTLDQAYGDAVENAKDIIACGFDINKTFIFSDLDYGSSGFYKNVVKIQKHVTFNQVKGI -----------------------1111----3333------------3333--------- FGFTDSDCIGKISFPAIQAAPSFSNSFPQIFRDRTDIQCLIPCAIDQDPYFRTRDVAPRI ---11113333---333333331111-----------------11113333----3333- GYPKPALLHSTFFPALQGAQTKSASDPNSSIFLTDTAKQIKTKVNKHAFSGGRDTIEEHR --------------3333----1111---------------------------------- QFGGNCDVDVSFYLTFFLEDDDKLEQIRKDYTSGALTGELKKALIEVLQPLIAEHQARRK --------------------------------------3333---------------333 EVTDEIVKEFTPRKLSFD 3-3333------------ >SUBTILISIN-LIKE SERINE PR; SWP:Q93LQ6; PDB:1R6VA; SKAKDLASLPEIKSQGYHILFGELRDGEYTEGKILVGYNDRSEVDKIVKAVNGKVVLELP ----3333----------------2222-----------3333----------------- QIKVVSIKLNGMTVKQAYDKIKALALKGIRYVEPSYKRELIKPTVVKPNPDMYKIRKPGL --------------------1111------------------------1111-------- NSTARDYGEELSNELWGLEAIGVTQQLWEEASGTNIIVAVVDTGVDGTHPDLEGQVIAGY --------1111------1111-3333-----2222---------11111111------- RPAFDEELPAGTDSSYGGSAGTHVAGTIAAKKDGKGIVGVAPGAKIMPIVIFDDPALVGG --------2222--1111----------------------1111---------3333-!! NGYVGDDYVAAGIIWATDHGAKVMNHSWGGWGYSYTMKEAFDYAMEHGVVMVVSAGNNTS !!--------------1111------------------------1111------------ DSHHQYPAGYPGVIQVAALDYYGGTFRVAGFSSRSDGVSVGAPGVTILSTVPGEDSIGYE ------1111-----------%%%%---------1111--------------1111---- GHNENVPATNGGTYDYYQGTSMAAPHVTGVVAVLLQKFPNAKPWQIRKLLENTAFDFNGN --1111------------3333---------------11113333----------3333- GWDHDTGYGLVKLDAALQGPLPTQGGVEEFQVVVTDAKGNFGVPTVFVSMMRDNGSCYYA -----!!!!------3333----------------1111--------------------- KTGPDGIARFPHIDSGTYDIFVGGPDHWDRALAPYDGESIPGGYAIALRMAEERQASFVG --1111--------------------------1111------------3333-------- FGVSPDATQLNVNFNSTLQVKFSTNLSTLKDPQFVVVDPLLRGVYGRVAYARNQTYDLSL ---1111------------------1111--------1111---------2222---111 LSGQISFGIQTLLPAATDITIQGTVTLNGEDIPVYGVLKAGTTWTIIDDFGGLNLGTDSQ 1-------------------------iiii--------2222--------------3333 PIYVWWTIFGQ ----------- >O-SUCCINYLBENZOATE SYNTHA; SWP:P29208; PDB:1R6WA; MRSAQVYRWQIPMDAGVVLDRRLKTRDGLYVCLREGEREGWGEISPLPGFSQETWEEAQS -------------2222-----------------!!!!--------2222---------- VLLAWVNNWLAGDCELPQMPSVAFGVSCALAELTDTLPQAANYRAAPLCNGDPDDLILKL -------3333------------------------------------------------1 ADMPGEKVAKVRVGLYEAVRDGMVVNLLLEAIPDLHLRLDANRAWTPLKGQQFAKYVNPD 111----------------------------1111-----%%%%--------3333-333 YRDRIAFLEEPCKTRDDSRAFARETGIAIAWDESLREPDFAFVAEEGVRAVVIKPTLTGS 31111--------------------------3333-2222----2222-----3333--- LEKVREQVQAAHALGLTAVISSSIESSLGLTQLARIAAWLTPDTIPGLDTLDLMQAQQVR -----------1111-------------------------1111-----1111------- RWPGSTLPVVEVDALERLL -2222-----3333----- >ATP:SULFATE ADENYLYLTRANS; SWP:P08536; PDB:1R6XA; PAPHGGILQDLIARDALKKNELLSEAQSSDILVWNLTPRQLCDIELILNGGFSPLTGFLN --2222--------1111---------1111---------------1111---------- ENDYSSVVTDSRLADGTLWTIPITLDVDEAFANQIKPDTRIALFQDDEIPIAILTVQDVY ------------1111-----------333311112222--------------------- KPNKTIEAEKVFRGDPEHPAISYLFNVAGDYYVGGSLEAIQLPQHYDYPGLRKTPAQLRL --------------1111-----------------------------2222--------- EFQSRQWDRVVAFQTRNPMHRAHRELTVRAAREANAKVLIHPVVGLTKPGDIDHHTRVRV --1111-------------------------1111------------2222--------- YQEIIKRYPNGIAFLSLLPLAMRMSGDREAVWHAIIRKNYGASHFIVGRDHAGPGKNSKG ----1111-----------------------------1111--------2222---1111 VDFYGPYDAQELVESYKHELDIEVVPFRMVTYLPDEDRYAPIDQIDTTKTRTLNISGTEL ----1111---------1111-----------3333----3333---------------- RRRLRVGGEIPEWFSYPEVVKILRES ----------3333------------ >TRANSCRIPTIONAL REPRESSOR; SWP:P07674; PDB:1R71A; EADQVIENLQRNELTPREIADFIGRELAKGKKKGDIAKEIGKSPAFITQHVTLLDLPEKI --------1111--------------1111-------------------3333------- ADAFNTGRVRDVTVVNELVTAFKKRPEEVEAWLDDDTQEITRGTVKLLREFLDE ---1111---------------------------1111---------------- >50S RIBOSOMAL PROTEIN L29; SWP:P38514; PDB:1R73A; MKASELRNYTDEELKNLLEEKKRQLMELRFQLAMGQLKNTSLIKLTKRDIARIKTILRER ---3333--3333----------------------------------------------- ELGIRR ------ >GLYCINE N-METHYLTRANSFERA; SWP:Q14749; PDB:1R74A; YRTRSLGVAAEGLPDQYADGEAARVWQLYIGDTRSRTAEYKAWLLGLLRQHGCQRVLDVA ---------2222---1111-1111--3333-----------------1111-------- CGTGVDSIMLVEEGFSVTSVDASDKMLKYALKERWNRRHEPAFDKWVIEEANWMTLDKDV !!!!------1111--------3333----------11111111-------3333----- PEGGFDAVICLGNSFAHLPDCKGDQSEHRLALKNIASMVRAGGLLVIDHRNYDHILSTGC ------------3333---3333------------33332222--------3333----- APPGKNIYYKSDLTKDVTTSVLIVNNKAHMVTLDYTVGLSKFRLSYYPHCLASFTELLQA -----------------------%%%%--------------------------------1 AFGGKCQHSVLGDFKPYKPGQTYIPCYFIHVLKRT 111--------iiii-------------------- >HYPOTHETICAL PROTEIN; SWP:P84150; PDB:1R75A; VTFKNGKPTVKGTKTYPMFSNILYRIADTEARRWAFYNDSKELIIHVAVLFDYDSQIVPL ------------------%%%%------1111-------------------1111----! GDTTAFRIGKYLCEVDVRPLETQMFVEGSVTGWRVDTLEARTAEDERGYR !!!--------------2222-----------------------1111-- >PECTATE LYASE; SWP:Q9X592; PDB:1R76A; AVIGMNEAASALTPSRVSSLPDTQRAAWQEYLARSEAQLSRDKASLAAELAPGQPLPPPP ------------------------3333------------------33332222------ AEGKGADTMPLDKPAAWYTSKAARHVADVIVSFQTPAGGWGKNQPRDGALRLPGQHYTGE ----1111-----3333-------------11113333-------------2222----- NVAKVKRDRDWHYVGTIDNDATVTEIRFLAQVVSQLAPEEAAPYRDAALKGIEYLLASQF -----------------iiii-----------11113333-------------------1 PNGGWPQVWPLEGGYHDAITYNDDALVHVAELLSDIAAGRDGFGFVPPAIRTRALEATNA 111-----------------2222---------------iiii----------------- AIHCIVETQVVQDGKRLGWGQQHDALTLRPTSARNFEPAALSSTESARILLFLMEIEAPS ---------------------------------1111---------------1111---- DAVKQAIRGGVAWLNTSVIRDQGAKPLWSRFYSLDGNKPVFGDRDKTIHDDVMGISQERR ------------1111----------------------------------1111-3333- TGYAWYTTSPQKALSAFTKWEKRS -------3333---3333------ >Cell Wall Targeting Domai; SWP:O05156; PDB:1R77A; DDDKVKLYKTNKYGTLYKSESASFTANTDIITRLTGPFRSMPQSGVLRKGLTIKYDEVMK 1111------1111-----------------------1111------2222--------- QDGHVWVGYNTNSGKRVYLPVRTWNESTGELGPLWGTIK %%%%------1111------------------------- >DIACYLGLYCEROL KINASE, DE; SWP:Q16760; PDB:1R79A; GSSGSSGTTLASIGKDIIEDADGIAMPHQWLEGNLPVSAKCTVCDKTCGSVLRLQDWRCL ------------------------------------------------------------ WCKAMVHTSCKESLLTKCSGPSSG ------33333333---------- >SUCROSE PHOSPHORYLASE; SWP:Q84HQ2; PDB:1R7AA; MKNKVQLITYADRLGDGTIKSMTDILRTRFDGVYDGVHILPFFTPFDGADAGFDPIDHTK ---------1111----------------2222----------------iiii---1111 VDERLGSWDDVAELSKTHNIMVDAIVNHMSWESKQFQDVLAKGEESEYYPMFLTMSSVFP -3333------------------------1111--------!!!!1111----3333-11 NGATEEDLAGIYRPRPGLPFTHYKFAGKTRLVWVSFTPQQVDIDTDSDKGWEYLMSIFDQ 11-----1111-------------iiii--------1111---1111------------- MAASHVSYIRLDAVGYGAKEAGTSCFMTPKTFKLISRLREEGVKRGLEILIEVHSYYKKQ 1111---------1111--2222-------------------1111---------3333- VEIASKVDRVYDFALPPLLLHALSTGHVEPVAHWTDIRPNNAVTVLDTHDGIGVIDIGSD -3333------------------------------------------------3333--1 QLDRSLKGLVPDEDVDNLVNTIHANTHGESQAATGAAASNLDLYQVNSTYYSALGNDQHY 111-------3333---------1111--3333!!!!-----------3333---3333- IAARAVQFFLPGVPQVYYVGALAGKNDMELLRKTNNGRDINRHYYSTAEIDENLKRPVVK ------1111---------1111--------------3333------------------- ALNALAKFRNELDAFDGTFSYTTDDDTSISFTWRGETSQATLTFEPKRGLGVDNTTPVAM -----------3333-----------------------------3333--1111------ LEWEDSAGDHRSDDLIANPPVVA ----------------------- ------------------------------- >NRDH-REDOXIN; SWP:O69271; PDB:1R7HA; MSITLYTKPACVQCTATKKALDRAGLAYNTVDISLDDEARDYVMALGYVQAPVVEVDGEH ----------3333-------1111------3333--------1111------------- WSGFRPERIKQLQA --------3333-- >CONSERVED HYPOTHETICAL PR; SWP:Q5W1E8; PDB:1R7JA; KKSKLEIIQAILEACKSGSPKTRIMYGANLSYALTGRYIKMLMDLEIIRQEGKQYMLTKK --------------1111------------------------1111----!!!!------ GEELLEDIRKFNEMRKNMDQLKEKINSVLS ------------------------------ >PHAGE PROTEIN; SWP:Q81EU2_BACCR; PDB:1R7LA; AMKPRDINKLIASKIFGYEIKDDNIIKDGRYRLGIPLYSQNIESAWQVVEKLEYDVKVTK --------------------%%%%------------1111-3333--------------- TDLKPKYQVHVFVPGGVKMVFAETAPMAICKGALASV ---------------------------------1111 >INTRON-ENCODED ENDONUCLEA; SWP:P03882; PDB:1R7MA; NIKKNQVMNLGPNSKLLKEYKSQLIELNIEQFEAGIGLILGDAYIRSRDEGKTYCMQFEW --3333--------------3333------------------------------------ KNKAYMDHVCLLYDQWVLSPPHKKERVNHLGNLVITWGAQTFKHQAFNKLANLFIVNNKK -------------1111--------------------------3333--1111--%%%%- TIPNNLVENYLTPMSLAYWFMDDGGKWDYNKNSTNKSIVLNTQSFTFEEVEYLVKGLRNK ------1111-------------------------------1111--------------- FQLNCYVKINKNKPIIYIDSMSYLIFYNLIKPYLIPQMMYKLP ---------%%%%-----3333-------3333-11113333- >MPT51/MPB51 ANTIGEN; SWP:Q48923; PDB:1R88A; AAPYENLMVPSPSMGRDIPVAFLAGGPHAVYLLDAFNAGPDVSNWVTAGNAMNTLAGKGI -----------1111---------------------------3333---3333------- SVVAPAGGAYSMYTNWEQDGSKQWDTFLSAELPDWLAANRGLAPGGHAAVGAAQGGYGAM ----------%%%%----11113333---------------------------------- ALAAFHPDRFGFAGSMSGFLYPSNTTTNGAIAAGMQQFGGVDTNGMWGAPQLGRWKWHDP -----3333-----------1111-----------------3333---33333333---3 WVHASLLAQNNTRVWVWSPTNPGASDPAAMIGQAAEAMGNSRMFYNQYRSVGGHNGHFDF 333----------------------3333-------------------1111-------- PASGDNGWGSWAPQLGAMSGDIVGAIR ------3333----------------- >TRNA NUCLEOTIDYLTRANSFERA; SWP:O28126; PDB:1R89A; MKVEEILEKALELVIPDEEEVRKGREAEEELRRRLDELGVEYVFVGSYARNTWLKGSLEI --------3333----1111---------------1111------3333----2222--- DVFLLFPEEFSKEELRERGLEIGKAVLDSYEIRYAEHPYVHGVVKGVEVDVVPCYKLKEP ----------3333--------3333-----------------iiii------------- KNIKSAVDRTPFHHKWLEGRIKGKENEVRLLKGFLKANGIYGAEYKVRGFSGYLCELLIV ----3333------------2222-----------1111----3333------------- FYGSFLETVKNARRWTRRTVIDVAKGEVRKGEEFFVVDPVDEKRNVAANLSLDNLARFVH -----------11111111---1111-----------1111---1111------------ LCREFMEAPSLGFFKPKHPLEIEPERLRKIVEERGTAVFAVKFRKPDIVDDNLYPQLERA -----------1111-------3333----------------------3333-------- SRKIFEFLERENFMPLRSAFKASEEFCYLLFECQIKEISRVFRRMGPQFEDERNVKKFLS --------1111-----------------------------------3333--------- RNRAFRPFIENGRWWAFEMRKFTTPEEGVRSYASTHWHTLGKNVGESIREYFEIISGEKL ---------iiii----------------------3333--------------------3 FKEPVTAELCEMMGVKD 333-------------- >TRANSCRIPTION ACTIVATOR M; SWP:P71039; PDB:1R8DA; MKYQVKQVAEISGVSIRTLHHYDNIELLNPSALTDAGYRLYSDADLERLQQILFFKEIGF --------------3333----1111-------1111------------------1111- RLDEIKEMLDHPNFDRKAALQSQKEILMKKKQRMDEMIQTIDRTLLSVD ----------3333--------------------------3333----- >MULTIDRUG-EFFLUX TRANSPOR; SWP:P39075; PDB:1R8EA; ESYYSIGEVSKLANVSIKALRYYDKIDLFKPAYVDPDTSYRYYTDSQLIHLDLIKSLKYI -------------------------------------------3333------------- GTPLEEMKKAQDLEMEELFAFYTEQERQIREKLDFLSALEQTISLVKKRMKRQMEYPALG --3333---------------------------------------------1111----- EVFVLDEEEIRIIQTEAEGIGPENVLNASYSKLKKFIESADGFTNNSYGATFSFQPYTSI ----------------iiii1111-3333-----------------------------33 DEMTYRHIFTPVLTNKQISSITPDMEITTIPKGRYACIAYNFSPEHYFLNLQKLIKYIAD 33-------------------1111----------------------------------- RQLTVVSDVYELIIPIHYSPKKQEEYRVEMKIRIA ----------------------------------- >HYPOTHETICAL PROTEIN YBDK; SWP:P77213; PDB:1R8GA; LPDFHVSEPFTLGIELEQVVNPPGYDLSQDSSLIDAVKNKITAGEVKHDITESLELATDV ------------------------------------------------3333-------- CRDINQAAGQFSAQKVVLQAATDHHLEICGGGTHPFQKWNFGYLIQQATVFGQHVHVGCA ---------------------1111---------------!!!!---------------- SGDDAIYLLHGLSRFVPHFIALSAASPYQGTDTRFASSRPNIFSAFPDNGPPWVSNWQQF --------------------1111----------------1111-1111----------- EALFRCLSYTTIDSIKDLHWDIRPSPHFGTVEVRVDTPLTLSHAVNAGLIQATAHWLLTE ------3333---3333------------------------------------------- RPFKHQEKDYLLYKFNRFQACRYGLEGVITDPHTGDRRPLTEDTLRLLEKIAPSAHKIGA -----33331111----------------------------------------------3 SSAIEALHRQVVSGLNEAQLRDFVADGGSLIGLVKKHCEIWA 333------------3333----------------------- >REGULATORY PROTEIN E2; SWP:Q84294; PDB:1R8HA; SSATPIVQFQGESNCLKCFRYRLNDKHRHLFDLISSTWHWASPKAPHKHAIVTVTYHSEE --------------------------1111-----------1111--------------- QRQQFLNVVKIPPTIRHKLGFMSMHLL -----------3333-------3333- >TRAC; SWP:Q9L6G5; PDB:1R8IA; TELIKQGEQLEQMAQQLEQLKSQLETQKNMYESMAKTTNLGDLLGTSTNTLANNLPDNWK ---------------------------------1111-1111--1111------------ EVYSDAMNSSSSVTPSVNSMMGQFNAEVDDMTPSEAIAYMNKKLAEKGAYDRVMAEKAYN 33331111-------3333---1111---------------------1111--------- NQMQELSDMQALTEQIKSTPDLKSIADLQARIQTSQGAIQGEQAKLNLMNMLQQSQDKLL -----------------------------------------------------------3 RAQKDRA 333---- >KAIA; SWP:Q79PF6; PDB:1R8JA; VLSQIAICIWVESTAILQDCQRALSADRYQLQVCESGEMLLEYAQTHRDQIDCLILVAAN -------------------------3333-----------------1111------1111 PSFRAVVQQLCFEGVVVPAIVVGDRDPAKEQLYHSAELHLGIHQLEQLPYQVDAALAEFL ----------1111-------------------1111---1111---------------- RLAPVETMADHIMLMDPELSSQQRDLAQRLQERLGYLGVYYKRDPDRFLRNLPAYESQKL -------------------------------------------33331111-3333---- HQAMQTSYREIVLSYFSPNSNLNQSIDNFVNMAFFADVPVTKVVEIHMELMDEFAKKLRV -----------1111-----------------------3333------------------ EGRSEDILLDYRLTLIDVIAHLCEMYRRSIPR -----3333-----------------1111-- >4-HYDROXYTHREONINE-4-PHOS; SWP:P58717; PDB:1R8KA; SAQRVVITPGEPAGSGPDLVVQLAQRAWPIELVVCADGALLTERAALGLPLSLLPYSPDV ----------1111------------------------------------------1111 PAAPQPAGTLTLLPVSLRAPAISGQLTVENGPYVVETLARACDGCLNGEFAALITGPVHK -----2222------------2222-3333------------------------------ GVINDAGISFTGHTEFFEERSQAKKVVLATEELRVALATTHLPLRAIADAITPALLHEVI ---1111----------------------1111---------33331111---------- AILHHDLRTKFGIAEPRILVCGLNPHAGEGGHGTEEIDTIIPVLDELRAQGKLNGPLPAD ------------------------%%%%iiii-3333----------1111------111 TLFQPKYLDNADAVLAYHDQGLPVLKYQGFGRGVNITLGLPFIRTSVDHGTALELAGRGK 1--3333---------------------iiii-------------------3333----- ADVGSFITALNLAIKIVNTQ ---------------3333- >KUNITZ TRYPSIN INHIBITOR; SWP:P83667; PDB:1R8NA; SDAEKVYDIEGYPVFLGSEYYIVSAIIGAGGGGVRPGRTRGSMCPMSIIQEQSDLQMGLP -------1111---2222------1111----------2222----------1111---- VRFSSPEEKQGKIYTDTELEIEFVEKPDCAESSKWVIVKDSGEARVAIGGSEDHPQGELV --------------------------1111-------------------1111-3333-- RGFFKIEKLGSLAYKLVFCPKSDSGSCSDIGINYEGRRSLVLKSSDDVPFRVVFVKPRSG -------------------3333------------------------------------- SETES ----- >KUNITZ TRYPSIN INHIBITOR; SWP:P84144; PDB:1R8OA; RLVDTDGKPIENDGAEYYILPSVRGKGGGLVLAKSGGEKCPLSVVQSPSELSNGLPVRFK ---1111---------------------------!!!!----------3333-------- ASPRSKYISVGMLLGIEVIESPECAPKPSMWSVKSG ---------------------1111----------- >KUNITZ TRYPSIN INHIBITOR; SWP:P84145; PDB:1R8OB; WKLPSVTVGNPKVSVFGGPFKIEEGKSGYKDVYSSSKGRDLDDGIEVNKKKEKRLVVKDG ----------------------------------1111---------1111------222 NPFIIRFKKSG 2---------- >ADP-RIBOSYLATION FACTOR 1; SWP:P32889; PDB:1R8SA; MRILMVGLDAAGKTTILYKLKLGEIVTTIPTIGFNVETVEYKNISFTVWDVGGQDKIRPL -------2222---------3333----------------1111---------3333--- WRHYFQNTQGLIFVVDSNDRERVNEAREELMRMLAEDELRDAVLLVFANKQDLPNAMNAA ----2222-------11111111------------3333----------3333------- EITDKLGLHSLRHRNWYIQATCATSGDGLYEGLDWLSNQL ------3333------------1111----------1111 >Cytohesin-2; SWP:Q99418; PDB:1R8SE; NRKMAMGRKKFNMDPKKGIQFLVENELLQNTPEEIARFLYKGEGLNKTAIGDYLGEREEL ----------------------1111---------------2222-------1111---- NLAVLHAFVDLHEFTDLNLVQALRQFLWSFRLPGKAQKIDRMMEAFAQRYCLCNPGVFQS --------1111-2222--------1111------------------------2222--- TDTCYVLSYSVIMLNTDLHNPNVRDKMGLERFVAMNRGINEGGDLPEELLRNLYDSIRNE -------------------3333----------1111--iiii----------------- PFKIPED ------- >GLYCINE N-METHYLTRANSFERA; SWP:Q9QXF8; PDB:1R8XA; VDSVYRTRSLGVAAEGLPDQYADGEAARVWQLYIGDTRSRTAEYKAWLLGLLRQHGCHRV --------2222-1111---1111--------1111-----3333-------1111---- LDVACGTGVDSIMLVEEGFSVMSVDASDKMLKYALKERWNRRKEPSFDNWVIEEANWLTL ----!!!!------------------3333--------1111--3333-------1111- DKDVLSGDGFDAVICLGNSFAHLPDCKGDQSEHRLALKNIASMVRPGGLLVIDHRNYDYI ------------------1111--3333------------11112222------------ LSTGCAPPGKNIYYKSDLTKDITTSVLTVNNKAHMVTLDYTVQVGFSKFRLSYYPHCLAS ----------------------------iiii---------------------------- FTELVRAAFGGRCQHSVLGDFKPYKPGQAYVPCYFIHVLKKTD ------1111--------------2222--------------- >GLUTATHIONE TRANSFERASE; SWP:Q98GG1; PDB:1R9CA; MIEGLSHMTFIVRDLERMTRILEGVFDAREVYASDTEQFSLSREKFFLIGDIWVAIMQGE ------------------------------------------------!!!!-------- KLAERSYNHIAFKIDDADFDRYAERVGKLGLDMRPPRPGRSIYFYDDDNHMFELHTGTLT --------------3333--------1111------------------------------ ERLAR ----- >GLYCEROL DEHYDRATASE; SWP:Q8GEZ8; PDB:1R9DA; ISKGFSTQTERINILKAQILNAKPCVESERAILITESFKQTEGQPAILRRALALKHILEN --------3333------1111------------------1111---------------- IPITIRDQELIVGSLTKEPRSSQVFPEFSNKWLQDELDRLNKRTGDAFQISEESKEKLKD -----2222---------------3333-------11111111----------------- VFEYWNGKTTSELATSYMTEETREAVNCDVFTVGNYYYNGVGHVSVDYGKVLRVGFNGII ----2222------3333------1111-------------------------------- NEAKEQLEKNRSIDPDFIKKEKFLNSVIISCEAAITYVNRYAKKAKEIADNTSDAKRKAE -------1111------------------------------------------------- LNEIAKICSKVSGEGAKSFYEACQLFWFIHAIINIESNGHSISPARFDQYMYPYYENDKN ---------------------------------------------3333----------- ITDKFAQELIDCIWIKLNDINKVRDEISTKHFGGYPMYQNLIVGGQNSEGKDATNKVSYM ------------------------33333333--------------1111----3333-- ALEAAVHVKLPQPSLSVRIWNKTPDEFLLRAAELTREGLGLPAYYNDEVIIPALVSRGLT -------------------11113333---------------------------1111-- LEDARDYGIIGCVEPQKPGKTEGWHDSAFFNLARIVELTINSGFDKNKQIGPKTQNFEEM ---1111---------2222------------------------%%%%-------3333- KSFDEFMKAYKAQMEYFVKHMCCADNCIDIAHAERAPLPFLSSMVDNCIGKGKSLQDGGA --------------------------------------3333----3333---3333--- EYNFSGPQGVGVANIGDSLVAVKKIVFDENKITPSELKKTLNNDFKNSEEIQALLKNAPK --------------------------------------------2222------1111-- FGNDIDEVDNLAREGALVYCREVNKYTNPRGGNFQPGLYPSSINVYFGSLTGATPDGRKS ----3333-------------------1111----------------------1111-22 GQPLADGVSPSRGCDVSGPTAACNSVSKLDHFIASNGTLFNQKFHPSALKGDNGLMNLSS 22---!!!!-2222-----------1111----1111-------3333--3333------ LIRSYFDQKGFHVQFNVIDKKILLAAQKNPEKYQDLIVRVAGYSAQFISLDKSIQNDIIA -----1111---------3333------33331111---------1111-3333---111 RTEHVM 1----- >CORE PROTEIN P19; SWP:P50627; PDB:1R9FA; HTSPFKLPDESPSWTEWRLHNDETQDNPLGFKESWGFGKVVFKRYLRYDRTEASLHRVLG --------------------1111------------!!!!-------------------- SWTGDSVNYAASRFFGFDQIGCTYSIRFRGVSITVSGGSRTLQHLCEAIRSKQELQA -----------1111------------iiii------3333---------------- >FK506 BINDING PROTEIN FAM; SWP:O45418; PDB:1R9HA; KIDITPKKDGGVLKLIKKEGQGVVKPTTGTTVKVHYVGTLENGTKFDSSRDRGDQFSFNL ----1111------------------2222---------1111----3333--------- GRGNVIKGWDLGVATMTKGEVAEFTIRSDYGYGDAGSPPKIPGGATLIFEVELFEWSA -----3333-------2222------3333-----------2222------------- >TRANSKETOLASE; SWP:NA; PDB:1R9JA; HMASIEKVANCIRCLAADIVQGGKSGHPGTPMGMAPMSAVLWTEVMKYNSQDPDWVDRDR --------------------1111------------------------1111--1111-- FVMSNGHGCALQYALLHMAGYNLTMDDLKGFRQDGSRTPGHPERFVTPGVEVTTGPLGQG ----3333---------------------2222-------------2222-----2222- IANAVGLAIAEAHLAATFNRPGYNIVDHYTYVYCGDGCLMEGVCQEALSLAGHLALEKLI -------------------2222-----------3333-----------------1111- VIYDSNYISIDGSTSLSFTEQCHQKYVAMGFHVIEVKNGDTDYEGLRKALAEAKATKGKP --------33331111----------1111------------------------------ KMIVQTTTIGFGSSKQGTEKVHGAPLGEEDIANIKAKFGRDPQKKYDVDDDVRAVFRMHI ------------1111-3333--------------1111--------------------- DKCSAEQKAWEELLAKYTAAFPAEGAAFVAQMRGELPSGWEAKLPTNSSAIATRKASENC ------------------------------1111----3333------------------ LAVLFPAIPALMGGSADLTPSNLTRPASANLVDFSSSSKEGRYIRFGVREHAMCAILNGL ----3333---------1111----3333-----1111---------------------- DAHDGIIPFGGTFLNFIGYALGAVRLAAISHHRVIYVATHDSIGVGEDGPTHQPVELVAA ------------3333-------------------------333333331111------- LRAMPNLQVIRPSDQTETSGAWAVALSSIHTPTVLCLSRQNTEPQSGSSIEGVRHGAYSV 3333----------------------------------------11113333-------- VDVPDLQLVIVASGSEVSLAVDAAKALSGELRVRVVSMPCQELFDAQPDTYRQAVLPAGV ------------!!!!--------1111---------------1111---------2222 PVVSVEAYVSFGWEKYSHAHVGMSGFGASAPAGVLYKKFGITVEEVVRTGRELAKRFPDG ---------2222----------------------------------------------- TAPLKNSSFS ----3333-- >TypeIII-secreted protein ; SWP:Q7CQD4; PDB:1R9KA; EGRAVLTSKTVKDFMLQKLNSLDIKGNASKDPAYARQTCEAILSAVYSNNKDQCCKLLIS ------------------------------------------------------------ KGVSITPFLKEIGEAAQNAGLPGEIKNGVFTPGGAGANPFVVPLIASASIKYPHMFINHN -------------------------------------3333-----------3333-111 QQVSFKAYAEKIVMKEVTPLFNKGTMPTPQQFQLTIENIANKYLQNAS 1-----------1111--1111-------------------------- >GLYCINE BETAINE-BINDING P; SWP:P14177; PDB:1R9LA; ADLPGKGITVNPVQSTITEETFQTLLVSRALEKLGYTVNKPSEVDYNVGYTSLASGDATF --1111----------3333-----------1111------------------------- TAVNWTPLHDNMYEAAGGDKKFYREGVFVNGAAQGYLIDKKTADQYKITNIAQLKDPKIA ------1111------!!!!------------------------------------3333 KLFDTNGDGKADLTGCNPGWGCEGAINHQLAAYELTNTVTHNQGNYAAMMADTISRYKEG 1111------------2222------------------------------------1111 KPVFYYTWTPYWVSNELKPGKDVVWLQVPFSALPGDKNADTKLPNGANYGFPVSTMHIVA -----------3333--2222-----------2222------1111-------------- NKAWAEKNPAAAKLFAIMQLPVADINAQNAIMHDGKASEGDIQGHVDGWIKAHQQQFDGW --------------------3333-------1111------------------------- VNEALAAQK ---1111-- >CYTOCHROME P450 2C9; SWP:P11712; PDB:1R9OA; RGKLPPGPTPLPLQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDL -------------------------3333------------------------------- GEEFSGRGIFPLAERANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEAR 3333---------3333----1111--------------1111----------------- CLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKDQQFLNLMEKLNENIKILSS --------%%%%-------------------------1111------------------- PWIPIIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHN -----------3333-----------------3333--1111------------------ QPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSP ------------------------------------------------------------ CMQDRSHMPYTDAVVHEVQRYIDLLPTSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHD --3333-------------3333---------------!!!!--2222----33331111 NKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFLTSILQNFN -----1111-------1111----11111111-11111111------------------- LKSLVDPKNLDTTPVVNGFASVPPFYQLCFIPIHH -----3333-------------------------- >REPLICATION PROTEIN E1; SWP:P06789; PDB:1R9WA; KQGAMLAVFKDTYGLSFTDLVRTCTDWVTAIFGVNPTIAEGFKTLIQPFILYAHIQCLDC ---------------3333--------------------------3333----------- KWGVLILALLRYKCGKSRLTVAKGLSTLLHVPETCMLIQPPKLRSSVAALYWYRTGISNI -------------------------------1111------------------------- SEVMGDTPEWIQRLTIIQ -------3333------- >CYTOSINE DEAMINASE; SWP:P25524; PDB:1RA0A; ALQTIINARLPGEEGLWQIHLQDGKISAIDAQSGVMPITENSLDAEQGLVIPPFVEPHIH ---------2222--------iiii-------------2222--%%%%------------ LDTTQTAGQPNWNQSGTLFEGIERWAERKALLTHDDVKQRAWQTLKWQIANGIQHVRTHV --2222------3333------------1111---------------------------- DVSDATLTALKAMLEVKQEVAPWIDLQIVAFPQEGILSYPNGEALLEEALRLGADVVGAI ---1111------------3333--------1111---2222-------1111------3 PHFEFTREYGVESLHKTFALAQKYDRLIDVHCDEIDDEQSRFVETVAALAHHEGMGARVT 333---------------------------------1111--------------3333-- ASHTTAMHSYNGAYTSRLFRLLKMSGINFVANPLVNIHLQGRFDTYPKRRGITRVKEMLE ------1111---------------------3333-----1111--------------11 SGINVCFGHDGVFDPWYPLGTANMLQVLHMGLHVCQLMGYGQINDGLNLITHHSARTLNL 11-----------1111---------------1111--------3333-------1111- QDYGIAAGNSANLIILPAENGFDALRRQVPVRYSVRGGKVIASTQPAQTTVYLEQPEAID -----2222--------------------------iiii--------------------- YKR --- >GENOME POLYPROTEIN; SWP:P03300; PDB:1RA6A; GEIQWMRPSKEVGYPIINAPSKTKLEPSAFHYVFEGVKEPAVLTKNDPRLKTDFEEAIFS -------3333----------------1111------------1111-----------33 KYVGNKITEVDEYMKEAVDHYAGQLMSLDINTEQMLEDAMYGTDGLEALDLSTSAGYPYV 33----------------------1111--------3333--2222---1111------- AMGKKKRDILNKQTRDTKEMQKLLDTYGINLPLVTYVKDELRSKTKVEQGKSRLIEASSL ----3333----------------------------------3333-------------- NDSVAMRMAFGNLYAAFHKNPGVITGSAVGDPDLFWSKIPVLMEEKLFAFDYTGYDASLS ------------------------------333333333333------------3333-3 PAWFEALKMVLEKIGFGDRVDYIDYLNHSHHLYKNKTYVKGGMPSGSGTSIFNSMINNLI 333--------11113333--3333-------!!!!------------------------ IRTLLLKTYKGIDLDHLKMIAYGDDVIASYPHEVDASLLAQSGKDYGLTMTPADKSATFE --------22223333-----!!!!----------------3333-------%%%%---- TVTWENVTFLKRFFRADEKYPFLIHPVMPMKEIHESIRWTKDPRNTQDHVRSLCLLAWHN --3333--iiii----3333---------------------3333--------------- GEEEYNKFLAKIRSVPIGRALDLPEYSTLYDRWLDSF ----------11113333------------------- >DIHYDROFOLATE REDUCTASE; SWP:P00379; PDB:1RA9; MISLIAALAVDRVIGMENAMPWNLPADLAWFKRNTLDKPVIMGRHTWESIGRPLPGRKNI ---------%%%%----------------------------------------------- ILSSQPGTDDRVTWVKSVDEAIAACGDVPEIMVIGGGRVYEQFLPKAQKLYLTHIDAEVE --------1111----3333--1111----------------3333-------------- GDTHFPDYEPDDWESVFSEFHDADAQNSHSYCFEILERR --------3333-----------1111------------ >REP A2 ISO-1-CYTOCHROME C; SWP:P00044; PDB:1RAP; GSAKKGATLFKTRCLQCHTFDQGGANKVGPNLHGIFGRHSGQAEGYSYTDANIKKNVLWD -------------3333---2222-------2222-------2222------3333---1 ENNMSEYLTNPKKYIPGTKMAFGGLKKEKDRNDLITYLKKACE 111------3333-2222------------------------- >PROTEIN (RA-DOMAIN OF RAL; SWP:Q12967; PDB:1RAXA; QQVGDCCIIRVSLDVDNGNMYKSILVTSQDKAPAVIRKAMDKHNLEEEEPEDYELLQILS ------------------------------3333------111133333333-------- DDRKLKIPENANVFYAMNSTANYDFVLKKRTFT -----------3333--3333------------ >RUBREDOXIN; SWP:P00269; PDB:1RB9; MKKYVCTVCGYEYDPAEGDPDNGVKPGTSFDDLPADWVCPVCGAPKSEFEAA -------------3333-3333--22223333-1111-------3333---- >RIBULOSE 1,5 BISPHOSPHATE; SWP:P00880; PDB:1RBLA; SAAGYKAGVKDYKLTYYTPDYTPKDTDLLAAFRFSPQPGVPADEAGAAIAAESSTGTWTT ----------3333---1111--1111---------2222-------------------- VWTDLLTDMDRYKGKCYHIEPVAGEENSYFAFIAYPLDLFEEGSVTNILTSIVGNVFGFK 3333---3333----------2222----------3333-2222----------333311 AIRSLRLEDIRFPVALVKTFQGPPHGIQVERDLLNKYGRPMLGCTIKPKLGLSAKNYGRA 11----------33331111---------------------------------------- VYECLRGGLDFTKDDENINSQPFQRWRDRFLFVADAIHKSQAETGEIKGHYLNVTAPTCE ---3333-------1111--1111------------------------------------ EMMKRAEFAKELGMPIIMHDFLTAGFTANTTLAKWCRDNGVLLHIHRAMHAVIDRQRNHG ---------1111------3333-----------------------2222-----1111- IHFRVLAKCLRLSGGDHLHSGTVVGKLEGDKASTLGFVDLMREDHIEADRSRGVFFTQDW ------------------------------------------------3333-------% ASMPGVLPVASGGIHVWHMPALVEIFGDDSVLQFGGGTLGHPWGNAPGATANRVALEACV %%%-----------3333----------------1111--1111---------------- QARNEGRDLYREGGDILREAGKWSPELAAALDLWKEIKFEFETMDKL --1111-3333----------------------1111---------- >Ribulose bisphosphate car; SWP:P04716; PDB:1RBLM; SMKTLPKERRFETFSYLPPLSDRQIAAQIEYMIEQGFHPLIEFNEHSNPEEFYWTMWKLP -----------2222--------------------------------------------- LFACAAPQQVLDEVRECRSEYGDCYIRVAGFDNIKECQTSSFIVHRPGR -----3333-----------1111------------------------- >HYPOTHETICAL PROTEIN YLBA; SWP:P75713; PDB:1RC6A; GYREDLLANRAIVKHGNFALLTPDGLVKNIIPGFENCDATILSTPKLGASFVDYLVTLHQ ------------------------------2222---------3333-----------22 NGGNQQGFGGEGIETFLYVISGNITAKAEGKTFALSEGGYLYCPPGSLMTFVNAQAEDSQ 22-------2222--------------iiii----2222--------------------- IFLYKRRYVPVEGYAPWLVSGNASELERIVILLDFLPKELGFDMNMHILSFAPGASHGYI ----------2222-------3333-------------3333------------------ ETHVQEHGAYILSGQGVYNLDNNWIPVKKGDYIFMGAYSLQAGYGVAFSYIYSKDCNRDV ---------------------------2222----------------------------- EI -- >CYSTEINE-RICH SECRETORY P; SWP:P60623; PDB:1RC9A; NVDFDSESPRKPEIQNEIVDLHNSLRRSVNPTASNMLRMEWYPEAADNAERWAYRCIESH --3333---------------------------------------------3333----- SSYESRVIEGIKCGENIYMSPYPMKWTDIIHAWHDEYKDFKYGVGADPPNAVTGHYTQIV -3333--iiii-------------------------1111-------3333-3333---- WYKSYRIGCAAAYCPSSPYSYFFVCQYCPAGNFIGKTATPYTSGTPCGDCPSDCDNGLCT 1111---------1111---------------2222--------2222-1111-iiii-- NPCTRENKFTNCNTMVQQSSCQDNYMKTNCPASCFCQNKII ---------------1111-----------3333-1111-- >L FERRITIN; SWP:P07797; PDB:1RCD; SQVRQNFHQDCEAGLNRTVNLKFHSSYVYLSMASYFNRDDVALSNFAKFFRERSEEEKEH 1111---------------------------------1111------------------- AEKLIEYQNQRGGRVFLQSVEKPERDDWANGLEALQTALKLQKSVNQALLDLHAVAADKS --------1111--------------------------------------------1111 DPHMTDFLESPYLSESVETIKKLGDHITSLKKLWSSHPGMAEYLFNKHTLG --------------------------------------------------- >CATABOLIC ALANINE RACEMAS; SWP:Q9HTQ2; PDB:1RCQA; MRPARALIDLQALRHNYRLAREATGARALAVIKADAYGHGAVRCAEALAAEADGFAVACI -----------------------------------iiii--------3333--------- EEGLELREAGIRQPILLLEGFFEASELELIVAHDFWCVVHCAWQLEAIERASLARPLNVW ------1111------1111--3333---------------------------------- LMDSGMHRVGFFPEDFRAAHERLRASGKVAKIVMMSHFSRADELDCPRTEEQLAAFSAAS -----------3333------------------------1111----------------2 QGLEGEISLRNSPAVLGWPKVPSDWVRPGILLYGATPFERAHPLADRLRPVMTLESKVIS 222--------------1111-------3333---------3333--------------- VRDLPAGEPVGYGARYSTERRQRIGVVAMGYADGYPRHAADGTLVFIDGKPGRLVGRVSM ----------2222---------------3333--11112222---iiii--------11 DMLTVDLTDHPQAGLGSRVELWGPNVPVGALAAQFGSIPYQLLCNLKRVPRVYSGA 11----1111---2222-----1111-----------3333-1111---------- >CONSERVED HYPOTHETICAL PR; SWP:Q9X0E5; PDB:1RCUA; KKVVVVGYSGPVNKSPVSELRDICLELGRTLAKKGYLVFNGGRDGVELVSQGVREAGGTV ----------1111-3333------------1111------------------1111--- VGILPDEEAGNPYLSVAVKTGLDFQRSFVLLRNADVVVSIGGEIGTAIEILGAYALGKPV ------------------------------1111-------------------1111--- ILLRGTGGWTDRISQVLIDGKYLDNRRIVEIHQAWTVEEAVQIIEQI -----------1111-------------------------------- >CT610; SWP:O84616; PDB:1RCWA; NFLDQLDLIIQNKHLEHTFYVKWSKGELTKEQLQAYAKDYYLHIKAFPKYLSAIHSRCDD ----------------3333--1111----------------------------1111-- LEARKLLLDNLDEENGYPNHIDLWKQFVFALGVTPEELEAHEPSEAAKAKVATFRWCTGD ----------------------------1111----------------------3333-- SLAAGVAALYSYESQIPRIAREKIRGLTEYFGFSNPEDYAYFTEHEEADVRHAREEKALI -----------3333--------------------------------------------- ELLKDDADKVLEASQEVTQSLYGFLDSFL -----3333----------------1111 >TRYPTOPHAN SYNTHASE ALPHA; SWP:P42390; PDB:1RD5A; SRPVSDTMAALMAKGKTAFIPYITAGDPDLATTAEALRLLDGCGADVIELGVPCSDPYID -----------1111--------2222-------------1111------------1111 GPIIQASVARALASGTTMDAVLEMLREVTPELSCPVVLLSYYKPIMFRSLAKMKEAGVHG -----------1111------------3333----------3333----33331111--- LIVPDLPYVAAHSLWSEAKNNNLELVLLTTPAIPEDRMKEITKASEGFVYLVSVNGVTGP --11113333--------1111-------3333-------------------------11 RANVNPRVESLIQEVKKVTNKPVAVGFGISKPEHVKQIAQWGADGVIIGSAMVRQLGEAA 11------------3333------------------------------------------ SPKQGLRRLEEYARGMKNALG --------------------- >Mannose-binding protein C; SWP:P08661; PDB:1RDO1; KYFMSSVRRMPLNRAKALCSELQGTVATPRNAEENRAIQNVAKDVAFLGITDQRTENVFE -------------------1111----------------------------3333----- DLTGNRVRYTNWNEGEPNNVGSGENCVVLLTNGKWNDVPCSDSFLVVCEFS ------------2222---!!!!------1111-----1111--------- >cAMP-dependent protein ki; SWP:P05132; PDB:1RDQE; GNAAASVKEFLAKAKEDFLKKWETPSQNTAQLDQFDRIKTLGTGSFGRVMLVKHKESGNH ------------------------------1111--------------------1111-- YAMKILDKQKVVKLKQIEHTLNEKRILQAVNFPFLVKLEFSFKDNSNLYMVMEYVAGGEM -----------1111----------3333--1111-----------------------33 FSHLRRIGRFSEPHARFYAAQIVLTFEYLHSLDLIYRDLKPENLLIDQQGYIQVTDFGFA 33-------------------------------------3333---1111------1111 KRVKGRTWLCGTPEALAPEIILSKGYNKAVDWWALGVLIYEMAAGYPPFFADQPIQIYEK -----------3333-3333--------------------------------3333---- IVSGKVRFPSHFSSDLKDLLRNLLQVDLTKRFGNLKNGVNDIKNHKWFATTDWIAIYQRK --------1111--------------111122221111---11111111----------- VEAPFIPKFKGPGDTSNFDDYEEEEIRVINEKCGKEFTEF ----------11111111--------------33331111 >RIBONUCLEASE MS; SWP:P00653; PDB:1RDS; ESCEYTCGSTCYWSSDVSAAKAKGYSLYESGDTIDDYPHEYHDYEGFDFPVSGTYYEYPI ------!!!!--3333-----------1111--%%%%-----3333-------------- MSDYDVYTGGSPGADRVIFNGDDELAGVITHTGASGDDFVACSSS 1111---------------1111-------2222----------- >CONSERVED HYPOTHETICAL PR; SWP:Q9X116; PDB:1RDUA; MARVAIPSVGKDLSSMVSDRFARAEYFIIYDTESGNVEVVENTIADAHGTGPKVVQSLVS ------------------------------------------------------------ KGVEYLIASNVGRNAFETLKAAGVKVYRFEGGTVQEAIDAFSEGRLEELTTFTREG --------------33333333----------3333-------------------- >ARF guanine-nucleotide ex; SWP:P47102; PDB:1RE0B; GSHMASDRKTEFILCVETFNEKAKKGIQMLIEKGFIDSDSNRDIASFLFLNNGRLNKKTI 1111--------------------------1111----------------3333------ GLLLCDPKKTSLLKEFIDLFDFKGLRVDEAIRILLTKFRLPGESQQIERIVEAFSSKYSA --1111----------1111-2222----------------------------------1 DQSVQPDADSVFVLSYSIIMLNTDSHNPQVKDHMTFDDYSNNLRGCYNGKDFPRWYLHKI 111-----------------------3333------------2222iiii---------- YTSIKVKEIVMPEEH ----------3333- >3-CARBOXY-CIS,CIS-MUCONAT; SWP:Q88N37; PDB:1RE5A; NQLFDAYFTAPAMREIFSDRGRLQGMLDFEAALARAEASAGLVPHSAVAAIEAACQAERY -1111----3333--------------------------------------11113333- DTGALANAIATAGNSAIPLVKALGKVIATGVPEAERYVHLGATSQDAMDTGLVLQLRDAL ------------------------------1111111122223333-------------- DLIEADLGKLADTLSQQALKHADTPLVGRTWLQHATPVTLGMKLAGVLGALTRHRQRLQE -----------------------------%%%%--------------------------- LRPRLLVLQFGGASGSLAALGSKAMPVAEALAEQLKLTLPEQPWHTQRDRLVEFASVLGL ----------------33331111--------1111------------------------ VAGSLGKFGRDISLLMQTEAGEVFEPSAPKRNPVGAAVLIGAATRVPGLLSTLFAAMPQE ----------------1111------------3333-----------------1111--! HERSLGLWHAEWETLPDICCLVSGALRQAQVIAEGMEVDAARMRRNLDLTQGLVLAEAVS !!!--3333--------------------------------------11111111----- IVLAQRLGRDRAHHLLEQCCQRAVAEQRHLRAVLGDEPQVSAELSGEELDRLLDPAHYLG -3333--3333---------------------3333--------------11111111!! QARVWVARAVSEHQRFTA !!---------------- >DYNEIN LIGHT CHAIN 2; SWP:NA; PDB:1RE6A; SMSDRKAVIKNADMSEDMQQDAVDCATQAMEKYNIEKDIAAYIKKEFDKKYNPTWHCIVG ----------------------------------3333---------------------- RNFGSYVTHETKHFIYFYLGQVAILLFKSG ------------------------------ >T4 REGA; SWP:P04528; PDB:1REGX; MIEITLKKPEDFLKVKETLTRMGIANNKDKVLYQSCHILQKKGLYYIVHFKEMLRMDGRQ -------1111-----------------------------iiii----------1111-- VEMTEEDEVRRDSIAWLLEDWGLIEIVPGQRTFMKDLTNNFRVISFKQKHEWKLVPKYTI --------------------------------------------33331111---3333- GN -- >AHPLAAO; SWP:Q6STF1; PDB:1REOA; DRNPLEECFRETDYEEFLEIARNGLKATSNPKHVVVVGAGMSGLSAAYVLSGAGHQVTVL --11111111----------------------------------------1111------ EASERAGGRVRTYRNDKEDWYANLGPMRLPEKHRIVREYIRKFGLQLNEFSQENDNAWYF ------!!!!-----1111----------1111-------1111---------1111--- IKNIRKRVGEVKKDPGVLKYPVKPSEEGKSAGQLYEESLGKVVEELKRTNCSYILNKYDT %%%%-----------1111---1111-----------------------3333---1111 YSTKEYLLKEGNLSPGAVDMIGDLMNEDSGYYVSFPESLRHDDIFAYEKRFDEIVGGMDK --------------------------1111-----------------------2222--- LPTSMYRAIEEKVHLNAQVIKIQKNAEKVTVVYQTPAKEMASVTADYVIVCTTSRATRRI --------3333------------!!!!------------------------33331111 KFEPPLPPKKAHALRSVHYRSGTKIFLTCTKKFWEDEGIHGGKSTTDLPSRFIYYPNHNF -------------------------------3333------------3333--------3 TSGVGVIIAYGIGDDANFFQALDFKDCADIVINDLSLIHQLPREEIQTFCYPSMIQKWSL 333--------!!!!-1111---------------------3333-----------3333 DKYAMGGITTFTPYQFQHFSESLTASVDRIYFAGEHTAEAHGWIDSTIKSGLRAARDVNR -----------22221111-3333--2222---3333----------------------3 ASEQ 333- >Replication initiation pr; SWP:P03856; PDB:1REPC; SPRIVQSNDLTEAAYSLSRDQKRMLYLFVDQIRKSHDGICEIHVAKYAEIFGLTSAEASK ------------------------------------------------1111-3333--- DIRQALKSFAGKEVVFYESFPWFIKPAHSPSRGLYSVHINPYLIPFFIGLQNRFTQFRLS --------2222------------------2222-----33333333----------333 ETKEITNPYAMRLYESLCQYRKPDGSGIVSLKIDWIIERYQLPQSYQRMPDFRRRFLQVC 31111------------11113333------------1111-3333-3333--------- VNEINSRTPMRLSYIEKKKGRQTTHIVFSFRDIT ---------------------------------- >METHYLMALONYL-COA MUTASE; SWP:P11653; PDB:1REQA; STLPRFDSVDLGNAPVPADAARRFEELAAKAGTGEAWETAEQIPVGTLFNEDVYKDMDWL -----1111-------1111--------3333------1111-------33331111--- DTYAGIPPFVHGPYATMYAFRPWTIRQYAGFSTAKESNAFYRRNLAAGQKGLSVAFDLPT --2222--1111-1111---------------------------1111------------ HRGYDSDNPRVAGDVGMAGVAIDSIYDMRELFAGIPLDQMSVSMTMNGAVLPILALYVVT ----11111111-2222--------------22221111--------------------- AEEQGVKPEQLAGTIQNDILKEFMVRNTYIYPPQPSMRIISEIFAYTSANMPKWNSISIS -1111-3333--------3333----------------------------1111------ GYHMQEAGATADIEMAYTLADGVDYIRAGESVGLNVDQFAPRLSFFWGIGMNFFMEVAKL ---------3333----------------1111-33333333------------------ RAARMLWAKLVHQFGPKNPKSMSLRTHSQTSGWSLTAQDVYNNVVRTCIEAMAATQGHTQ -----------1111--3333---------3333----1111------------1111-- SLHTNSLDEAIALPTDFSARIARNTQLFLQQESGTTRVIDPWSGSAYVEELTWDLARKAW -----1111------------------------------1111----------------- GHIQEVEKVGGMAKAIEKGIPKMRIEEAAARTQARIDSGRQPLIGVNKYRLEHEPPLDVL ------1111-------------------------3333---2222-------------- KVDNSTVLAEQKAKLVKLRAERDPEKVKAALDKITWAAGNPDDKDPDRNLLKLCIDAGRA -----------------------------------------11111111--------111 MATVGEMSDALEKVFGRYTAQIRTISGVYSKEVKNTPEVEEARELVEEFEQAEGRRPRIL 1--------------------------3333----------------------------- LAKMGQDGHDRGQKVIATAYADLGFDVDVGPLFQTPEETARQAVEADVHVVGVSSLAGGH --------------------1111----------3333-----1111-----------33 LTLVPALRKELDKLGRPDILITVGGVIPEQDFDELRKDGAVEIYTPGTVIPESAISLVKK 33---------1111------------3333----1111-----2222------------ LRASLDA ------- >Methylmalonyl-CoA mutase ; SWP:P11652; PDB:1REQB; TLSLAGDFPKATEEQWEREVEKVLNRGRPPEKQLTFAECLKRLTVHTVDGIDIVPMYRPK ---1111---------------------------------1111--1111-------333 DAPKKLGYPGVAPFTRGTTVRNGDMDAWDVRALHEDPDEKFTRKAILEGLERGVTSLLLR 3------2222--3333--------------------3333------------------- VDPDAIAPEHLDEVLSDVLLEMTKVEVFSRYDQGAAAEALVSVYERSDKPAKDLALNLGL -1111-3333----11113333---------------------1111--3333------- DPIGFAALQGTEPDLTVLGDWVRRLAKFSPDSRAVTIDANIYHNAGAGDVAELAWALATG --------------1111----1111--3333----------1111-------------- AEYVRALVEQGFTATEAFDTINFRVTATHDQFLTIARLRALREAWARIGEVFGVDEDKRG -------1111-3333-1111---------------------------------1111-- ARQNAITSWRELTREDPYVNILRGSIATFSASVGGAESITTLPFTQALGLPEDDFPLRIA -------3333----3333-----------------------1111-------------- RNTGIVLAEEVNIGRVNDPAGGSYYVESLTRSLADAAWKEFQEVEKLGGMSKAVMTEHVT -----------1111--1111-----------------------1111------------ KVLDACNAERAKRLANRKQPITAVSEFPMIGARSIETKPFPAAPARKGLAWHRDSEVFEQ --------------------2222----2222--------------------1111---- LMDRSTSVSERPKVFLACLGTRRDFGGREGFSSPVWHIAGIDTPQVEGGTTAEIVEAFKK --------------------3333------------1111-------------------- SGAQVADLCSSAKVYAQQGLEVAKALKAAGAKALYLSGAFKEFGDDAAEAEKLIDGRLFM --------------------------------------3333!!!!------------22 GMDVVDTLSSTLDILGVAK 22----------------- >GAMMA DELTA-RESOLVASE; SWP:P03012; PDB:1RES; GRKRKIDRDAVLNMWQQGLGASHISKTMNIARSTVYKVINESN -------------3333-------------3333----3333- >BONE MORPHOGENETIC PROTEI; SWP:P12643; PDB:1REWA; SSCKRHPLYVDFSDVGWNDWIVAPPGYHAFYCHGECPFPLADHLNSTNHAIVQTLVNSVN ----------3333--------------------------3333---------------3 SKIPKACCVPTELSAISMLYLDENEKVVLKNYQDMVVEGCGCR 333------------------1111------------------ >Bone morphogenetic protei; SWP:P36894; PDB:1REWC; TLPFLKCYCSGHCPDDAINNTCITNGHCFAIIEEDDQGETTLASGCMKYEGSDFQCKDSP -----------------%%%%-------------1111----------2222------11 KAQLRRTIECCRTNLCNQYLQPTLPP 11----------22221111------ >5-ENOLPYRUVYLSHIKIMATE-3-; SWP:Q9S400; PDB:1RF6A; MKLKTNIRHLHGIIRVPGDKSISHRSIIFGSLAEGETKVYDILRGEDVLSTMQVFRDLGV -------------------------------------------------------1111- EIEDKDGVITVQGVGMAGLKAPQNALNMGNSGTSIRLISGVLAGADFEVEMFGDDSLSKR ----iiii-------------------!!!!----------1111---------3333-- PMDRVTLPLKKMGVSISGQTERDLPPLRLKGTKNLRPIHYELPIASAQVKSALMFAALQA -3333-3333---------1111---------------------------------1111 KGESVIIEKEYTRNHTEDMLQQFGGHLSVDGKKITVQGPQKLTGQKVVVPGDISSAAFWL --------------------1111-----!!!!-------------------3333---- VAGLIAPNSRLVLQNVGINETRTGIIDVIRAMGGKLEITEIDPVAKSATLIVESSDLKGT ------------------1111-------------------------------------- EICGALIPRLIDELPIIALLATQAQGVTVIKDAEELKVKETDRIQVVADALNSMGADITP -----33331111-------1111---------3333--------------1111----- TADGMIIKGKSALHGARVNTFGDHRIGMMTAIAALLVADGEVELDRAEAINTSYPSFFDD -------------------%%%%------------------------------1111--- LESLIHG -1111-- >Eukaryotic initiation fac; SWP:P39935; PDB:1RF8B; GSIGLEAEIETTTDETDDGTNTVSHILNVLKDATPIEDVFSFNYPEGIEGPDIKYKKEHV ---------------------3333-3333--------3333-----------1111--- KYTYGPTFLLQFKDKLNVKADAEWVQSTASKIVIPPGMGR ----3333--3333--------33333333---------- >HYPOTHETICAL PROTEIN RV29; SWP:O53240; PDB:1RFEA; TKQRADIVSEAEIADFVNSSRTGTLATIGPDGQPHLTAWYAVIDGEIWLETKAKSQKAVN ----------------------------1111----------iiii-----1111----- LRRDPRVSFLLEDGDTYDTLRGVSFEGVAEIVEEPEALHRVGVSVWERYTGPYTDEKPVD ---------------3333----------------------------------------- QNKRVGVRIVARRTRSWDHRKLGLPHSVGGSTA -------------------3333------1111 >CALMODULIN; SWP:Q42478; PDB:1RFJA; MADQLTEDQISEFKEAFSLFDKDGDGCITTKELGTVMRSLGQNPTEAELQDMINEVDADG --------------------1111-------------1111---------------1111 NGTIDFPEFLNLMARKMKDTDSEEELKEAFRVFDKDQNGFISAAELRHVMTNLGEKLTDE ---------------------------------1111----------------------- EVDEMIREADVDGDGQINYDEFVKVMMA ---------1111--------------- >FERREDOXIN; SWP:P00248; PDB:1RFKA; ATYKVTLINGLNKTIEVPDDQYILDAAEEAGIDLPYSCRAGACSTCAGKLISGTVDQSDQ -----------------1111----------------------1111---------1111 SFLDDDQIEAGYVLTCVAYPTSDCVIETHKEEELY -------------3333--------------1111 >PROBABLE TRNA MODIFICATIO; SWP:P25522; PDB:1RFLA; GSLLREGMKVVIAGRPNAGKSSLLNALAGREAAIVTDIAGTTRDVLREHIHIDGMPLHII -----------------------------------------1111--------------- DTAGLREASDEVERIGIERAWQEIEQADRVLFMVDGTTTDAVDPAEIWPEFIARLPAKLP ---------------------------------------1111----------------- ITVVRNKADITGETLGMSEVNGHALIRLSARTGEGVDVLRNHLKQSMGIHRD ---------3333----------------------3333------------- >L-SULFOLACTATE DEHYDROGEN; SWP:Q58820; PDB:1RFMA; ILKPENEKKLIIDVLKKFGVPEEDAKITADVFVDADLKGFTSHGIGRFPQYITALKLGNI ---------------1111--------------------33333333------------- NPKPDIKIVKESPATAVIDGDLGLGQVVGKKAELAIKKAKNVGVGVVATRNANHFGIAGY -----------1111----%%%%---------------------------------3333 YSELANQDIGITITNTEPAAPFGGKEKILGTNPIAIAFKGNKYKFSLDATASIARGKILE --------------------2222------------------------------111111 ALRKKIKIPEGCAVDKDGKPTTDPAKALEGCILPFGGPKGYGLALAIELSAIGGAEVGTK 11------------------------1111------------------1111----!!!! VKGTANPEERCTKGDLFIAINPEFFGKEEFKRKVDELLDEIKNSEPAEGFEILIPGEIEE -----1111-----------3333------------------------------------ RNKKRKDGFEIDKNLYNQLKEICNELGLNIEDYIE ----1111--------------------1111--- >COAGULATION FACTOR IX; SWP:P00740; PDB:1RFNA; VVGGEDAKPGQFPWQVVLNGKVDAFCGGSIVNEKWIVTAAHCVETGVKITVVAGEHNIEE -------22221111----------------1111---1111------------------ TEHTEQKRNVIRIIPHHNYNAA ---------------1111--- >Coagulation factor IX [Pr; SWP:P00740; PDB:1RFNB; MTCNIKNGRCEQFCKNSADNKVVCSCTEGYRLAENQKSCEPAVPFPCGRVSVSQTSK -1111%%%%-----------------2222--3333--------------------- >RIESKE PROTEIN; SWP:P08980; PDB:1RFS; TIAKDALGNDVIAAEWLKTHAPGDRTLTQGLKGDPTYLVVESDKTLATFGINAVCTHLGC ----1111------------2222-----2222-------1111---------------- VVPFNAAENKFICPCHGSQYNNQGRVVRGPAPLSLALAHCDVDDGKVVFVPWTETDFRTG -----1111-----------1111------------------iiii-------------- EAPWWSA --3333- >TRANSCRIPTIONAL REPRESSOR; SWP:Q57471; PDB:1RFYA; KKVELRPLIGLTRGLPPTDLETITIDAIRTHRRLVEKADELFQALPETYKTGQACGGPQH ----3333---2222--------------------------11113333----------- IRYIEASIEMHAQMSALNTLISILGFIPK ----------------------------- >HYPOTHETICAL PROTEIN APC3; SWP:P84133; PDB:1RFZA; AFINNLEQTARRWLEERGVTVEKIAELVYYLQSKYHPDLTEECIENVNRVISKREVQNAI --------------1111-3333---------1111-----------------3333--- LTGIQLDKLAEDGRLDEPLQSIIRRDEGLYGVDEILALSIVNVYGSIGFTNYGYIDKQKP -------3333---------------11113333------33333333------------ GILQYLNDKSTGKCNTFLDDIVGAIAAAASSRLAHRA 33331111------3333------------------- >SECOND SPLICE VARIANT P63; SWP:Q9H3D4; PDB:1RG6A; PTDCSIVSFLARLGCSSCLDYFTTQGLTTIYQIEHYSMDDLASLKIPEQFRHAIWKGILD ----------1111---3333-1111--333311113333------3333---------3 HRQLHEF 333---- >HEPARIN-BINDING GROWTH FA; SWP:P05230; PDB:1RG8A; HFNLPPGNYKKPKLLYCSNGGHFLRILPDGTVDGTRDRSDQHIQLQLSAESVGEVYIKST --------------------------1111------1111----------2222------ ETGQYLAMDTDGLLYGSQTPNEECLFLERLEENHYNTYISKKHAEKNWFVGLKKNGSCKR --------1111--------1111------%%%%-----33331111-----1111---3 GPRTHYGQKAILFLPLPV 333-22221111------ >S-ADENOSYLMETHIONINE SYNT; SWP:P04384; PDB:1RG9A; AKHLFTSESVSEGHPDKIADQISDAVLDAILEQDPKARVACETYVKTGMVLVGGEITTSA ----------1111---------------------------------------------- WVDIEEITRNTVREIGYVHSDMGFDANSCAVLSAIGKQSPDINQGVDRADPLEQGAGDQG ------------------3333--3333----------3333-------3333------- LMFGYATNETDVLMPAPITYAHRLVQRQAEVRKNGTLPWLRPDAKSQVTFQYDDGKIVGI ---------1111------------------1111-1111------------iiii---- DAVVLSTQHSEEIDQKSLQEAVMEEIIKPILPAEWLTSATKFFINPTGRFVIGGPMGDCG --------------------------3333-3333-3333----3333-----3333--- LTGRKIIVDTYGGMARHGGGAFSGKDPSKVDRSAAYAARYVAKNIVAAGLADRCEIQVSY ----3333--iiii-------22221111------------------------------- AIGVAEPTSIMVETFGTEKVPSEQLTLLVREFFDLRPYGLIQMLDLLHPIYKETAAYGHF 2222---------iiii-----------3333---1111-----------3333------ GREHFPWEKTDKAQLLRDAAGLK -11111111---33331111--- >BUTYRATE RESPONSE FACTOR ; SWP:P47974; PDB:1RGOA; STRYKTELCRPFEESGTCKYGEKCQFAHGFHELRSLTRHPKYKTELCRTFHTIGFCPYGP 1111-------------1111-------3333------1111-------------1111- RCHFIHNADE ---------- >PRESYNAPTIC DENSITY PROTE; SWP:P31016; PDB:1RGRA; EYEEITLERGNSGLGFSIAGGTDNPHIGDDPSIFITKIIPGGAAAQDGRLRVNDSILFVN --------------------3333--------------2222------------------ EVDVREVTHSAAVEALKEAGSIVRLYVMRRKPP --------------------------------- >FERREDOXIN; SWP:O88151; PDB:1RGVA; ALYINDDCTACDACVEECPNEAITPGDPIYVIDPTKCSECVGAFDEPQCRLVCPADCIPD -----------1111-----------------3333-iiii-----3333--1111---- NPDYRETREELQEKYDRLHG 1111---------------- >RESISTIN; SWP:Q99P87; PDB:1RGXA; CPIDEAIDKKIKQDFNSLFPNAIKNIGLNCWTVSSRGKLASCPEGTAVLSCSCGSACGSW -3333-----------------1111----------------2222-------%%%%--- DIREEKVCHCQCARIDWTAARCCKLQVAS --%%%%----------------------- >BETA-LACTAMASE; SWP:P05193; PDB:1RGYA; AKTEQQIADIVNRTITPLMQEQAIPGMAVAIIYEGKPYYFTWGKADIANNHPVTQQTLFE --------------------------------iiii----------1111---1111--- LGSVSKTFNGVLGGDAIARGEIKLSDPVTKYWPELTGKQWRGISLLHLATYTAGGLPLQI ----------------1111--11113333-3333-3333-------------------- PDDITDKAALLRFYQNWQPQWTPGAKRLYANSSIGLFGALAVKPSGMSYEEAMTRRVLQP 3333-----------------2222----3333--------3333-------------11 LKLAHTWITVPQSEQKNYAWGYREGKPVHVSPGQLDAEAYGVKSSVIDMARWVQANMDAS 11--------33331111----iiii--------3333--------------------11 HVQEKTLQQGIELAQSRYWRIGDMYQGLGWEMLNWPLKADSIINGSDSKVALAALPAVEV 11----------1111----!!!!-------------3333-----3333---------- NPPVPAVKASWVHKTGSTGGFGSYVAFVPEKNLGIVMLANKSYPNPVRVEAAWRILEKLQ -----------------1111------3333------------3333--------1111- >CLASS C BETA-LACTAMASE; SWP:P05364; PDB:1RGZA; PVSEKQLAEVVANTVTPLMKAQSVPGMAVAVIYQGKPHYYTFGKADIAANKPVTPQTLFE --------------------------------iiii----------1111---1111--- LGSISKTFTGVLGGDAIARGEISLDDPVTRYWPQLTGKQWQGIRMLDLATYTAGGLPLQV ----------------3333--11113333-3333-3333---3333------------- PDEVTDNASLLRFYQNWQPQWKPGTTRLYANASIGLFGALAVKPSGMPYEQAMTTRVLKP 1111-------------------------3333--------3333-------------11 LKLDHTWINVPKAEEAHYAWGYRDGKAVRAVRVSPGMLDAQAYGVKTNVQDMANWVMANM 11--------33331111----iiii---------1111--------------------- APENVADASLKQGIALAQSRYWRIGSMYQGLGWEMLNWPVEANTVVEGSDSKVALAPLPV 3333-----------1111----!!!!-------------3333-----3333------- AEVNPPAPPVKASWVHKTGSTGGFGSYVAFIPEKQIGIVMLANTSYPNPARVEAAYHILE --------------------1111------3333------------3333---------1 ALQ 111 >RIGHT-HANDED COILED COIL ; SWP:NA; PDB:1RH4; AALAQKKEIAYLLAKKAEILAALKKKQEIA 3333-------------------------- >EXCISIONASE; SWP:P03699; PDB:1RH6A; MYLTLQEWNARQRRPRSLETVRRWVRESRIFPPPVKDGREYLFHESAVKVDLNRP --------1111------------1111--------!!!!---1111-------- >RESISTIN-LIKE BETA; SWP:Q99P86; PDB:1RH7A; CSFESLVDQRIKEALSRQEPKTISCTSVTSSGRLASCPAGMVVTGCACGYGCGSWDIRNG ---3333------3333--------------------2222----------------iii NTCHCQCSVMDWASARCCRMA i-------------------- >PICCOLO PROTEIN; SWP:Q9JKS6; PDB:1RH8A; ASHPITGEIQLQINYDLGNLIIHILQARNLVPRDNNGYSDPFVKVYLLPGRGQVMVVQNA ---------------%%%%----------------------------------------- SAEYKRRTKYVQKSLNPEWNQTVIYKSISMEQLMKKTLEVTVWDYDRFSSNDFLGEVLID 33331111-----------------------3333------------------------3 LSSTSHLDNTPRWYPLKEQTES 3331111--------------- >ENDO-BETA-MANNANASE; SWP:Q8L5J1; PDB:1RH9A; NNFVYTDGTHFALNGKSLYINGFNAYWLMYIAYDPSTRIKVTNTFQQASKYKMNVARTWA ------!!!!--iiii--------1111-----1111-----------1111-------- FSHGGSRPLQSAPGVYNEQMFQGLDFVISEAKKYGIHLIMSLVNNWDAFGGKKQYVEWAV ---3333----2222----------------1111------------------------1 QRGQKLTSDDDFFTNPMVKGFYKNNVKVVLTRVNTITKVAYKDDPTILSWELINEPRCPS 111------3333--------------------------33333333----------111 DLSGKTFQNWVLEMAGYLKSIDSNHLLEIGLEGFYGNDMRQYNPNSYIFGTNFISNNQVQ 1----------------------------------33333333%%%%------3333-33 GIDFTTIHMYPNQWLPGLTQEAQDKWASQWIQVHIDDSKMLKKPLLIAEFGKSTKTPGYT 33-------3333-2222----------------------------------1111---- VAKRDNYFEKIYGTIFNCAKSGGPCGGGLFWQVLGQGMSSFDDGYQVVLQESPSTSRVIL ------------------1111---------------3333------3333--------- LQSLRLSKLS -----1111- >F420-DEPENDENT ALCOHOL DE; SWP:O93734; PDB:1RHCA; MKTQIGYFASLEQYRPMDALEQAIRAEKVGFDSVWVDDHFHPWYHDNAQSAQAWAWMGAA --------------------------------------------2222------------ LQATKKVFISTCITCPIMRYNPAIVAQTFATLRQMYPGRVGVAVGAGEAMNEVPVTGEWP -----------------------------------2222--------33333333----- SVPVRQDMTVEAVKVMRMLWESDKPVTFKGDYFTLDKAFLYTKPDDEVPLYFSGMGPKGA ------------------------------------------------------------ KLAGMYGDHLMTVAAAPSTLKNVTIPKFEEGAREAGKDPSKMEHAMLIWYSVDPDYDKAV --------------------------------1111-3333------------------- EALRFWAGCLVPSMFKYKVYDPKEVQLHANLVHCDTIKENYMCATDAEEMIKEIERFKEA --3333------1111------------11113333---------------------111 GINHFCLGNSSPDVNFGIDIFKEVIPAVRD 1------------------33333333--- >TYROSINE-PROTEIN KINASE R; SWP:Q06418; PDB:1RHFA; GAPVKLTVSQGQPVKLNCSVEGEEPDIQWVKDGAVVQNLDQLYIPVSEQHWIGFLSLKSV --------2222------------------iiii------------2222---------- ERSDAGRYWCQVEDGGETEISQPVWLTVEGVPFFTVEPKDLAVPPNAPFQLSCEAVGPPE 3333----------!!!!-------------------------2222------------- PVTIVWWRGTTKIGGPAPSPSVLNVTGVTQSTFSCEAHNLKGLASSRTATVHLQ --------------------------------------1111------------ >GRANULOCYTE COLONY-STIMUL; SWP:P09919; PDB:1RHGA; LPQSFLLKCLEQVRKIQGDGAALQEKLCATYKLCHPEELVLLGHSLGIPWAPLLAGCLSQ ----------------------------------3333-----1111------------- LHSGLFLYQGLLQALEGISPELGPTLDTLQLDVADFATTIWQQMEELGMMPAFASAFQRR --------------iiii3333----------------------1111------------ AGGVLVASHLQSFLEVSYRVLRHLA ------------------------- >FAB X5, LIGHT CHAIN; SWP:NA; PDB:1RHHB; QLLEQSGAEVKKPGSSVQVSCKASGGTFSMYGFNWVRQAPGHGLEWMGGIIPIFGTSNYA -----------2222------------!!!!-------2222-----------------3 QKFRGRVTFTADQATSTAYMELTNL 333---------1111--------- >Genome polyprotein [Fragm; SWP:Q82081; PDB:1RHI1; QTLASVSSGPKHTQSVPALTANETGATLPTRPSDNVETRTTYMHFNGSETDVESFLGRAA ---------------1111-3333------1111------------3333---------- CVHVTEIKNKNAAGLDNHRKEGLFNDWKINLSSLVQLRKKLELFTYVRFDSEYTILATAS -----------2222-3333---------------------------------------- QPEASSYSSNLTVQAMYVPPGAPNPKEWDDYTWQSASNPSVFFKVGETSRFSVPFVGIAS -----------------------------3333----------2222------------- AYNCFYDGYSHDDPDTPYGITVLNHMGSMAFRVVNEHDVHTTIVKIRVYHRAKHVEAWIP ------------------------------------------------------------ RAPRALPYVSIGRTNYPRDSKTIVKKRTNIKTY ----------------1111--------1111- >Genome polyprotein [Fragm; SWP:Q82081; PDB:1RHI2; GYSDRVQQITLGNSTITTQEARNAIVCYAEWPEYLSDNDASDVNKTSKPDISVCRFYTLD --------------------------%%%%-----3333----------1111------- SKTWKATSKGWCWKLPDALKDMGVFGQNMFYHSLGRTGYTIHVQCNATKFHSGCLLVVVI ----1111---------------------------------------------------- PEHQLASHEGGTVSVKYKYTHPGDRGIDLDTVEVAGGPTSDAIYNMDGTLLGNLLIFPHQ ------3333-----3333---3333-1111--2222---3333-----33331111--- FINMRTNNTATIVVPYINSVPIDSMTRHNNVSLMVVPIAPLNAPTGSSPTLPVTVTIAPM --3333-------------------------------------2222------------- CTEFTGIRSRSIVPQ --------------- >Genome polyprotein [Fragm; SWP:Q82081; PDB:1RHI3; GLPTTTLPGSGQFLTTDDRQSPSALPSYEPTPRIHIPGKVRNLLEIIQVGTLIPMNNTGT ------2222---1111-------2222-------------11111111----------- NDNVTNYLIPLHADRQNEQIFGTKLYIGDGVFKTTLLGEIAQYYTHWSGSLRISLMYTGP ---3333------------------11111111--------------------------1 ALSSAKIILAYTPPGTRGPEDKKEAMLGTHVVWDIGLQSTIVMTIPWTSGVQFRYTDPDT 111-----------------3333----------------------------------33 YTSAGYLSCWYLTSLILPPQTSGQVYLLSFISACPDFKLRLMKDTQTISQTDALTE 33---------------2222------------3333------------------- >SULFUR-SUBSTITUTED RHODAN; SWP:P00586; PDB:1RHS; VHQVLYRALVSTKWLAESVRAGKVGPGLRVLDASWYSPGTREARKEYLERHVPGASFFDI ------------------3333--1111--------2222-----------2222---33 EECRDKASPYEVMLPSEAGFADYVGSLGISNDTHVVVYDGDDLGSFYAPRVWWMFRVFGH 33--1111---------------------1111-------3333--3333-----1111- RTVSVLNGGFRNWLKEGHPVTSEPSRPEPAIFKATLNRSLLKTYEQVLENLESKRFQLVD -----2222----1111-------------------3333-------------------- SRAQGRYLGTQPEPDAVGLDSGHIRGSVNMPFMNFLTEDGFEKSPEELRAMFEAKKVDLT -----------------------2222---3333--1111------------1111-111 KPLIATRKGVTACHIALAAYLCGKPDVAIYDGSWFEWFHRAPPETWVSQGKG 1------------------1111------3333--------3333------- >CONSERVED HYPOTHETICAL PR; SWP:Q9X074; PDB:1RHXA; MALVLVKYGTDHPVEKLKIRSAKAEDKIVLIQNGVFWALEELETPAKVYAIKDDFLARGY ---------------3333-------------3333------------------------ SEEDSKVPLITYSEFIDLLEGEEKFIG 1111--------------%%%%----- >IMIDAZOLE GLYCEROL PHOSPH; SWP:P40919; PDB:1RHYA; SERIASVERTTSETHISCTIDLDHIPGVTEQKINVSTGIGFLDHMFTALAKHGGMSLQLQ ----------1111------------------------------------1111------ CKGDLTAEDCALALGEAFKKALGERKGIKRYGYAYAPLDESLSRAVIDISSRPYFMCHLP -----3333------------!!!!------------!!!!------------------- FTREKVGDLSTEMVSHLLQSFAFAAGVTLHIDSIRGENNHHIAESAFKALALAIRMAISR -----!!!!3333----------------------------------------------- >HEPATOMA-DERIVED GROWTH F; SWP:P51858; PDB:1RI0A; MSRSNRQKEYKCGDLVFAKMKGYPHWPARIDEMPEAAVKSTANKYQVFFFGTHETAFLGP ----------2222-----2222--------------------------3333-----33 KDLFPYEESKEKFGKPNKRKGFSEGLWEIENNPTVKASGY 33--3333----------1111------------------ >MRNA CAPPING ENZYME; SWP:Q8SR66; PDB:1RI5A; SKTINIRNANNFIKACLIRLYTKRGDSVLDLGCGKGGDLLKYERAGIGEYYGVDIAEVSI ----------------------2222------!!!!-3333--------------3333- NDARVRARNMKRRFKVFFRAQDSYGRHMDLGKEFDVISSQFSFHYAFSTSESLDIAQRNI -------------------------------------------1111------------- ARHLRPGGYFIMTVPSRDVILERYKQGRMSNDFYKIELEKMEDVPMESVREYRFTLLDSV 11112222------------------------------------1111-------2222- NNCIEYFVDFTRMVDGFKRLGLSLVERKGFIDFYEDEGRRNPELSKKMGLGCLTREESEV -----------------------------------------3333--------3333--- VGIYEVVVFRKL ------------ >PUTATIVE ISOMERASE YBHE; SWP:NA; PDB:1RI6A; SLKQTVYIASPESQQIHVWNLNHEGALTLTQVVDVPGQVQPMVVSPDKRYLYVGVRPEFR ----------1111-------1111-------------------1111------------ VLAYRIAPDDGALTFAAESALPGSLTHISTDHQGQFVFVGSYNAGNVSVTRLEDGLPVGV ------------------------------1111-------3333-------iiii---- VDVVEGLDGCHSANISPDNRTLWVPALKQDRICLFTVSDDGHLVAQDPAEVTTVEGAGPR ------2222-----1111------3333--------1111------------2222--- HMVFHPNEQYAYCVNELNSSVDVWELKDPHGNIECVQTLDMMPENFSDTRWAADIHITPD ----1111-------------------1111-----------1111-----------111 GRHLYACDRTASLITVFSVSEDGSVLSKEGFQPTETQPRGFNVDHSGKYLIAAGQKSHHI 1------------------1111--------------------1111------------- SVYEIVGEQGLLHEKGRYAVGQGPMWVVVNAHE ------3333----------------------- >CAMELID ANTIBODY HEAVY CH; SWP:P00698; PDB:1RI8A; VQLVESGGGSVQAGGSLRLSCAVSGYKDRNYCMGWFRRAPGKEREGVAVIDSSGRTAYAD -----------2222-----------------------2222--------1111----33 SVKGRFTISRDVALDTAYLQMNSLKPEDTAMYYCAAGWSSLGSCGTNRNRYNYWGQGTQV 33--------3333----------3333-----------iiii---3333---------- TVSS ---- >FYN-BINDING PROTEIN; SWP:O15117; PDB:1RI9A; EKEEKDFRKKFKYDGEIRVLYSTKVTTSITSKKWGTRDLQVKPGESLEVIQTTDDTKVLC 3333---------------------1111-----1111---2222--------------- RNEEGKYGYVLRSYLAD -3333-----3333--- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ---- >RIESKE IRON-SULFUR PROTEI; SWP:P07588; PDB:1RIE; AMSKIEIKLSDIPEGKNMAFKWRGKPLFVRHRTKKEIDQEAAVEVSQLRDPQHDLERVKK -------3333-2222-----iiii--------------11113333-----1111---1 PEWVILIGVCTHLGCVPIANAGDFGGYYCPCHGSHYDASGRIRKGPAPLNLEVPSYEFTS 111---------------------------------1111-------------------1 DDMVIVG 111---- >DNA HELICASE UVSW; SWP:P20703; PDB:1RIFA; MDIKVHFHDFSHVRIDCEESTFHELRDFFSFEADGYRFNPRFRYGNWDGRIRLLDYNRLL --------------------------------2222------------------1111-- PFGLVGQIKKFCDNFGYKAWIDPQINEKEELSRKDFDEWLSKLEIYSGNKRIEPHWYQKD 33333333----1111------3333-------------1111----------------- AVFEGLVNRRRILNLPTSAGRSLIQALLARYYLENYEGKILIIVPTTALTTQMADDFVDY ---------------1111--------------------------3333----------- RLFSHAMIKKIGGGASKDDKYKNDAPVVVGTWQTVVKQPKEWFSQFGMMMNDECHLATGK ---3333-----------------------3333111133331111------1111---- SISSIISGLNNCMFKFGLSGSLRDGKANIMQYVGMFGEIFKP ----33331111----------1111-3333----------- >LIGHT CHAIN OF ANTIBODY 1; SWP:NA; PDB:1RIHH; QVQLQQSGNELAKPGASMKMSCRASGYSFTSYWIHWLKQRPDQGLEWIGYIDPATAYTES ---------------------------1111--------1111----------------- NQKFKDKAILTADRSSNTAFMYLNSL 3333---------1111--------- >2,3-bisphosphoglycerate-d; SWP:Q11140; PDB:1RIIA; ANTGSLVLLRHGESDWNALNLFTGWVDVGLTDKGQAEAVRSGELIAEHDLLPDVLYTSLL ----------------------!!!!-------------------1111----------3 RRAITTAHLALDSADRLWIPVRRSWRLNERHYGALQGLDKAETKARYGEEQFMAWRRSYD 333------------1111----3333----!!!!----------------------111 TPPPPIERGSQFSQDADPRYADIGGGPLTECLADVVARFLPYFTDVIVGDLRVGKTVLIV 1-----2222---11111111-iiii---------------------------------- AHGNSLRALVKHLDQMSDDEIVGLNIPTGIPLRYDLDSAMRPLVRGGTYLDPEAAAAGAA -----------1111-33331111------------1111---2222------------- AVA --- >E6APC1 PEPTIDE; SWP:NA; PDB:1RIKA; YKFACPECPKRFMRSDHLTLHILLHENKK --------------3333---3333---- >RIBONUCLEASE H; SWP:P29253; PDB:1RIL; RKRVALFTDGACLGNPGPGGWAALLRFHAHEKLLSGGEACTTNNRMELKAAIEGLKALKE ------------------------------------------------------------ PCEVDLYTDSHYLKKAFTEGWLEGWRKRGWRTAEGKPVKNRDLWEALLLAMAPHRVRFHF -------------------------1111--3333------------------------- VKGHTGHPENERVDREARRQAQSQAKT -!!!!---------------------- >E6APC2 PEPTIDE; SWP:NA; PDB:1RIMA; YKFACPECPKRFMRSDHLSKHITLHELLGEERR -------------3333------3333------ >RIBOSOMAL PROTEIN S17; SWP:P23828; PDB:1RIP; QRKVYVGRVVSDKMDKTITVLVETYKKHPLYGKRVKYSKKYKAHDEHNEAKVGDIVKIME ----------------------------------------------iiii---------- TRPLSATKRFRLVEIVEKAVR --------------------- >CARBONIC ANHYDRASE XIV; SWP:Q9WVT6; PDB:1RJ5A; HHWTYEGPHGQDHWPTSYPECGGDAQSPINIQTDSVIFDPDLPAVQPH ------111111111111-1111--------3333---1111------ >ECTODYSPLASIN-A ISOFORM E; SWP:Q92838; PDB:1RJ8A; PAVVHLQGQGSAIQVKNDLSGGVLNDWSRITMNPKVFKLHPRSGELEVLVDGTYFIYSQV -------------3333-%%%%-------------------------------------- YYINFTDFASYEVVVDEKPFLQCTRSIETGKTNYNTCYTAGVCLLKARQKIAVKMVHADI --------------iiii---------------------------2222----------- SINMSKHTTFFGAIRLGEAP ----1111------------ >SIGNAL RECOGNITION PROTEI; SWP:P83749; PDB:1RJ9A; NLEEVLEELEMALLAADVGLSATEEILQEVRASGRKDLKEAVKEKLVGMLEPDERRATLR -------------1111------------------------------------------- KLGFNPQKPKPVEPKGRVVLVVGVNGVGKTTTIAKLGRYYQNLGKKVMFCAGDTFRAAGG -----------------------2222-------------1111-----------2222- TQLSEWGKRLSIPVIQGPEGTDSAALAYDAVQAMKARGYDLLFVDTAGRLHTKHNLMEEL -----------------2222----------------------------1111------- KKVKRAIAKADPEEPKEVWLVLDAVTGQNGLEQAKKFHEAVGLTGVIVTKLDGTAKGGVL ----------1111--------1111-----------------------1111------- IPIVRTLKVPIKFVGVGEGPDDLQPFDPEAFVEALLE ------------------1111--------------- >TYROSINE-PROTEIN KINASE 6; SWP:Q13882; PDB:1RJAA; SEPWFFGCISRSEAVRRLQAEGNATGAFLIRVSEKPSADYVLSVRDTQAVRHYKIWRRAG ---------3333----------------------------------------------- GRLHLNEAVSFLSLPELVNYHRAQSLSHGLRLAAPCRKHE ---------------------------------------- >FL CYTOKINE RECEPTOR; SWP:P36888; PDB:1RJBA; YESQLQMVQVTGSSDNEYFYVDFREYEYDLKWEFPRENLEFGKVLGSGAFGKVMNATAYG -------------1111-----1111--3333--3333---------1111--------- ISKTGVSIQVAVKMLKEREALMSELKMMTQLGSHENIVNLLGACTLSGPIYLIFEYCCYG ---------------------------------1111-------------------1111 DLLNYLRSKREKFLTFEDLLCFAYQVAKGMEFLEFKSCVHRDLAARNVLVTHGKVVKICD ------1111---------------------------------3333------------- FGLARDIMSDSNYVVRGNARLPVKWMAPESLFEGIYTIKSDVWSYGILLWEIFSLGVNPY 3333-33331111--!!!!--3333-3333------3333-------------------2 PGIPVDANFYKLIQNGFKMDQPFYATEEIYIIMQSCWAFDSRKRPSFPNLTSFLGCQL 222--3333---1111-----1111---------1111-3333--------------- >CAMELID HEAVY CHAIN ANTIB; SWP:P00698; PDB:1RJCA; VQLQASGGGSVQAGQSLRLSCATSATSSSNCMGWFRQAPGKEREGVAVIDTGRGNTAYAD 2222-------2222----------------------2222-----------------33 SVQGRLTISLDNNTLYLQMNSLKPEDTAMYYCAADTSTWYRGYCGTNPNYFSYWGQGTQV 33--------------------3333---------1111-------1111---------- TVS --- >carboxy methyl transferas; SWP:Q04081; PDB:1RJDA; ERIIQQTDYDALSCKLAAISVGYLPSSGLQRLSVDLSKKYTEWHRSYLITLKKFSRRAFG 3333---------------------3333------------------------------- KVDKAMRSSFPVMNYGTYLRTVGIDAAILEFLVANEKVQVVNLGCGSDLRMLPLLQMFPH -----1111-----------------------------------!!!!---------111 LAYVDIDYNESVELKNSILRESEILRISLGLSKEDTAKSPFLIDQGRYKLAACDLNDITE 1----------------------------------------------------1111--- TTRLLDVCTKREIPTIVISECLLCYMHNNESQLLINTIMSKFSHGLWISYDPIGGSQPND ----1111-1111--------3333-------------3333------------------ RFGAIMQSNLKESRNLEMPTLMTYNSKEKYASRWSAAPNVIVNDMWEIFNAQIPESERKR -----------------1111----------1111------------------3333--- LRSLQFLDELEELKVMQTHYILMKAQWH 3333-----------1111--------- >POTASSIUM CHANNEL TOXIN K; SWP:NA; PDB:1RJIA; TPYPVNCKTDRDCVMCGLGISCKNGYCQGCT --------11113333--------------- >EXPRESSED PROTEIN; SWP:Q9FK81; PDB:1RJJA; MATSGFKHLVVVKFKEDTKVDEILKGLENLVSQIDTVKSFEWGEDKESHDMLRQGFTHAF ------------------3333-----------1111-----------3333-------- SMTFENKDGYVAFTSHPLHVEFSAAFTAVIDKIVLLDFPVAAVKSSVVATP -----------3333-----------1111--------------------- >Outer surface protein B [; SWP:P17739; PDB:1RJLB; QVQLQQPGSVLVRPGASVKLSCKASGFTFTSSWMHWAKQRPGQGLEWIGEIHPNSGNTHY ------------2222-----------3333--------2222--------3333----- NEKFKGKATLTVDTSSSTAYVDLSSLTSEDSAVYYCARMRYGDYYAMDNWGQGTSVTVSS 1111---------1111---------3333----------!!!!---------------- AKTTAPPVYPLAPVCGDTTGSSVTLGCLVKGYFPESVTLLWNSGSLSSGVHTFPAVLQSD ----------------------------------------%%%%-------------%%% LYTLSSSVTVTSSTWPSQSITCNVAHPASSTKVDKKIEPRG %---------1111------------1111----------- >SMALL INDUCIBLE CYTOKINE ; SWP:O14625; PDB:1RJTA; FPMFKRGRCLCIGPGVKAVKVADIEKASIMYPSNNCDKIEVIITLKENKGQRCLNPKSKQ -------------------------------%%%%------------------------- ARLIIKKVERKNF --33331111--- >PARVALBUMIN ALPHA; SWP:P20472; PDB:1RJVA; MSMTDLLNAEDIKKAVGAFSATDSFDHKKFFQMVGLKKKSADDVKKVFHMLDKDKSGFIE ----------------1111---------------11113333---3333-3333----3 EDELGFILKGFSPDARDLSAKETKMLMAAGDKDGDGKIGVDEFSTLVAES 333----3333-------3333----------------------1111-- >ALCOHOL DEHYDROGENASE; SWP:P42328; PDB:1RJWA; MKAAVVEQFKEPLKIKEVEKPTISYGEVLVRIKACGVCHTDLHAAHGDWPVKPKLPLIPG -------2222------------2222----------3333------------------- HEGVGIVEEVGPGVTHLKVGDRVGIPWLYSACGHCDYCLSGQETLCEHQKNAGYSVDGGY ----------2222---2222-------------3333---33331111-2222------ AEYCRAAADYVVKIPDNLSFEEAAPIFCAGVTTYKALKVTGAKPGEWVAIYGIGGLGHVA ------3333----11113333-3333----------3333-2222-------3333--- VQYAKAMGLNVVAVDIGDEKLELAKELGADLVVNPLKEDAAKFMKEKVGGVHAAVVTAVS ------------------------1111-----3333----------------------- KPAFQSAYNSIRRGGACVLVGLPPEEMPIPIFDTVLNGIKIIGSIVGTRKDLQEALQFAA -----------2222-------------------------------------------11 EGKVKTIIEVQPLEKINEVFDRMLKGQINGRVVLTLEDK 11---------3333-------1111------------- >D-AMINOACYLASE; SWP:Q9AGH8; PDB:1RK6A; PFDYILSGGTVIDGTNAPGRLADVGVRGDRIAAVGDLSASSARRRIDVAGKVVSPGFIDS --------------------------!!!!------1111-------2222--------- HTHDDNYLLKHRDMTPKISQGVTTVVTGNCGISLAPLAHANPPAPLDLLDEGGSFRFARF ---3333---3333--3333--------iiii--------------1111--------33 SDYLEALRAAPPAVNAACMVGHSTLRAAVMPDLRREATADEIQAMQALADDALASGAIGI 33--------------------------------------------------1111---- STGAFYPPAAHASTEEIIEVCRPLITHGGVYATHMRDEGEHIVQALEETFRIGRELDVPV --3333--1111----------3333------------1111------------------ VISHHKVMGKLNFGRSKETLALIEAAMASQDVSLDAYPYVAGSTMLKQDRVLLAGRTLIT --------1111------------------------------------3333-------- WCKPYPELSGRDLEEIAAERGKSKYDVVPELQPAGAIYFMMDEPDVQRILAFGPTMIGSD -33331111--------1111-3333-3333----------------------------- GLPHDERPHPRLWGTFPRVLGHYSRDLGLFPLETAVWKMTGLTAAKFGLAERGQVQPGYY -1111---3333------------------------------------2222---2222- ADLVVFDPATVADSATFEHPTERAAGIHSVYVNGAAVWEDQSFTGQHAGRVLNR ---------------3333------------iiii---%%%%------------ >Protein within the bgcn g; SWP:P82804; PDB:1RK8C; TYLQSSEGKFIPATKRPDGTWRKARRVKDGYVP ----1111-------1111--------2222-- >PROTEIN AD-004; SWP:Q9Y3D8; PDB:1RKBA; LMLLPNILLTGTPGVGKTTLGKELASKSGLKYINVGDLAREEQLYDGYDEEYDCPILDED -----------2222--------------------------------------------- RVVDELDNQMREGGVIVDYHGCDFFPERWFHIVFVLRTDTNVLYERLETRGYNEKKLTDN ---------1111--------11113333------------------------------- IQCEIFQVLYEEATASYKEEIVHQLPSNKPEELENNVDQILKWIEQWIKDHNS -----------------3333-------3333----------------1111- >RIBOKINASE; SWP:P05054; PDB:1RKD; AGSLVVLGSINADHILNLQSFPTPGETVTGNHYQVAFGGKGANQAVAAGRSGANIAFIAC ----------------------2222----------------------1111-------- TGDDSIGESVRQQLATDNIDITPVSVIKGESTGVALIFVNGEGENVIGIHAGANAALSPA ---3333-------1111--1111--2222---------1111-------!!!!------ LVEAQRERIANASALLMQLESPLESVMAAAKIAHQNKTIVALNPAPARELPDELLALVDI ---------------------3333--------1111-------------33331111-- ITPNETEAEKLTGIRVENDEDAAKAAQVLHEKGIRTVLITLGSRGVWASVNGEGQRVPGF -----------------------------1111-------!!!!---------------- RVQAVDTIAAGDTFNGALITALLEEKPLPEAIRFAHAAAAIAVTRKGAQPSVPWREEIDA ---------------------1111----------------1111--3333--------- FLDRQR -1111- >HYPOTHETICAL PROTEIN; SWP:Q8ZYK2; PDB:1RKIA; MKKHIIIKTIPKKEEIISRDLCDCIYYYDNSVICKPIGPSKVYVSTSLENLEKCLQLHYF ---------2222-----------33331111-----2222----------------333 KKLVKNIEIFDEVHNSKPNCDKCLIVEIGGVYFVRRVNGVP 3-----------------------------------2222- >HYPOTHETICAL PROTEIN YIDA; SWP:P09997; PDB:1RKQA; SLAIKLIAIDMDGTLLLPDHTISPAVKNAIAAARARGVNVVLTTGRPYAGVHNYLKELHM ----------2222--1111-------------1111---------3333-----1111- EQPGDYCITYNGALVQKAADGSTVAQTALSYDDYRFLEKLSREVGSHFHALDRTTLYTAN -1111---%%%%------------------------------------------------ RDISYYTVHESFVATIPLVFCEAEKMDPNTQFLKVMMIDEPAILDQAIARIPQEVKEKYT ---1111----1111------1111-1111---------3333----11113333----- VLKSAPYFLEILDKRVNKGTGVKSLADVLGIKPEEIMAIGDQENDIAMIEYAGVGVAVDN ----1111----1111---------------3333------1111------------111 AIPSVKEVANFVTKSNLEDGVAFAIEKYVLN 1----1111-----3333------------- >AZURIN-I; SWP:P56547; PDB:1RKRA; AECSVDIAGNDGMQFDKKEITVSKSCKQFTVNLKHPGKLAKNVMGHNWVLTKQADMQGAV ---------1111---------3333-------------3333--------3333----- NDGMAAGLDNNYVKKDDARVIAHTKVIGGGETDSVTFDVSKLAAGEDYAYFCSFPGHFAL ------1111------3333-------2222------3333-2222-------2222--- MKGVLKLVD --------- >PROTEIN YFIR; SWP:O31560; PDB:1RKTA; SPKVTKEHKDKRQAEILEAAKTVFKRKGFELTTKDVVEESGFSRGGVYLYFSSTEEFRRI -%%%%-----------------------1111----------33333333--3333---- IETGLDEGLRKLDKSAEHQSVWASISSYLDELTEGLRDVADTLAPVQFEYLVTAWRNEER -------------------------------------3333------------------- RQYLEKRYDLFVERFSRLLQKGIDQGEFQPVQPLATIAKFFLNNDGIIQNALYFDEEKAD ----------------------1111----------------------------3333-- VSGLAESAKLYLKTVLQADEK --------------------- >HOMOSERINE KINASE; SWP:NA; PDB:1RKUA; DMEIACLDLEGVLVPEIWIAFAEKTGIDALKATTRDIPDYDVLMKQRLRILDEHGLKLGD --------2222----------33333333--3333---------------1111----- IQEVIATLKPLEGAVEFVDWLRERFQVVILSDTFYEFSQPLMRQLGFPTLLCHKLEIDDS ----1111--2222-------------------33333333-1111-----------111 DRVVGYQLRQKDPKRQSVIAFKSLYYRVIAAGDSYNDTTMLSEAHAGILFHAPENVIREF 1--------------------1111--------3333----------------------3 PQFPAVHTYEDLKREFLKASSRSLSL 333----------------------- >CDP-GLUCOSE-4,6-DEHYDRATA; SWP:Q57329; PDB:1RKXA; INNSFWQGKRVFVTGHTGFKGGWLSLWLQTMGATVKGYSLTAPTVPSLFETARVADGMQS -33332222-----1111-------------------------------1111------- EIGDIRDQNKLLESIREFQPEIVFHMAAQPLVRLSYSEPVETYSTNVMGTVYLLEAIRHV ---1111-----------------------3333-------------------------- GGVKAVVNITSDKCYDNKEWIWGYRENEAMGGYDPYSNSKGCAELVTSSYRNSFFNPANY ----------1111----------1111---------------------------3333- GQHGTAVATVRAGNVIGGGDWALDRIVPDILRAFEQSQPVIIRNPHAIRPWQHVLEPLSG ---------------------------------1111------1111-----3333---- YLLLAQKLYTDGAEYAEGWNFGPNDADATPVKNIVEQMVKYWGEGASWQLEAHYLKLDCS -----------3333--------3333---------------2222-------------- KAKMQLGWHPRWNLNTTLEYIVGWHKNWLSGTDMHEYSITEINNYMNTK --------------------------------3333------------- >ANTIVIRAL PROTEIN DAP-30; SWP:P24476; PDB:1RL0A; ATAYTLNLANPSASQYSSFLDQIRNNVRDTSLIYGGTDVAVIGAPSTTDKFLRLNFQGPR ----------------------------1111-%%%%--------------------111 GTVSLGLRRENLYVVAYLAMDNANVNRAYYFKNQITSAELTALFPEVVVANQKQLEYGED 1-------------------1111------1111---------11113333--------- YQAIEKNAKITTGDQSRKELGLGINLLITMIDGVNKKVRVVKDEARFLLIAIQMTAEAAR -----1111--!!!!3333-----------1111-------------------------- FRYIQNLVTKNFPNKFDSENKVIQFQVSWSKISTAIFGDCKNGVFNKDYDFGFGKVRQAK ------------------3333------------------iiii-------------333 DLQMGLLKYLGRPKS 3-------------- >SUPPRESSOR OF G2 ALLELE O; SWP:Q9Y2Z0; PDB:1RL1A; SKIKYDWYQTESQVVITLMIKNVQKNDVNVEFSEKELSALVKLPSGEDYNLKLELLHPII -----------------------3333--------------------------------3 PEQSTFKVLSTKIEIKLKKPEAVRWEKLEGQG 333----------------------------- >RIBOSOMAL PROTEIN L2; SWP:P04257; PDB:1RL2A; QYRIIDFKRDKDGIPGRVATIEYDPNRSANIALINYADGEKRYIIAPKNLKVGEISGPDA ----------2222---------3333--------1111-------2222------1111 DIKIGNALPLENIPVGTLVHNIELKPGRGGQLVRAAGTSAQVLGKEGKYVIVRLASGEVR --2222--3333-2222----------------------------!!!!----3333--- ILGKCRATVGEVG -3333-------- >FORMYLMETHIONINE DEFORMYL; SWP:Q8I372; PDB:1RL4A; KIVKYPDPILRRRSEEVTNFDDNLKRVVRKMFDIMYESKGIGLSAPQVNISKRIIVWNRI ------3333---------------------------------3333------------- FINPSIVEQSLVKLKLIEGCLSFPGIEGKVERPSIVSISYYDINGYKHLKILKGIHSRIF -------------------1111------------------1111--------------- QHEFDHLNGTLFIDKMTQVDKKKVRPKLNELIRD -----1111-3333-------------------- >CYTOTOXIN 1; SWP:P01451; PDB:1RL5A; LKCNKLVPIAYKTCPEGKNLCYKMFMMSDLTIPVKRGCIDVCPKNSLLVKYVCCNTDRCN --------------2222----------1111---------------------------- >RIBOSOMAL PROTEIN L6; SWP:P02391; PDB:1RL6A; PIEIPAGVTVTVNGNTVTVKGPKGELTRTFHPDMTITVEGNVITVTRPSDEKHHRALHGT ----2222----!!!!----1111------3333----!!!!------------------ TRSLLANMVEGVSKGYEKALELVGVGYRASKQGKKLVLSVGYSHPVEIEPEEGLEIEVPS -----------------------2222----!!!!---------------2222------ QTKIIVKGADKQRVGELAANIRAVRPPEPYKGKGIRYEGELVRL -------------------------------------------- >RLF; SWP:Q61193; PDB:1RLF; GSSDCRIIRVQMELGEDGSVYKSILVTSQDKAPSVISRVLKKNNRDSAVASEFELVQLLP --------------1111--------1111-1111-----1111-------------333 GDRELTIPHSANVFYAMDGASHDFLLRQRR 3----3333---3333-------------- >50S RIBOSOMAL PROTEIN L7A; SWP:O29494; PDB:1RLGA; VPEDMQNEALSLLEKVRESGKVKKGTNETTKAVERGLAKLVYIAEDVDPPEIVAHLPLLC --------------3333--------------1111-------------1111------- EEKNVPYIYVKSKNDLGRAVGIEVPCASAAIINEGELRKELGSLVEKIKGLQK ---------------------------------!!!!3333------1111-- >CONSERVED HYPOTHETICAL PR; SWP:Q9HII6_THEAC; PDB:1RLHA; HMVIPAEANIIVGYSHFIKTVEDLNEIIRTHVPGSKYGIGFSEASGDRLIRYDGNDDDLV ----2222-------------------33332222---------!!!!------------ KACIENIRRISAGHTFVILIRNAYPINILNAVKMCQEVGSIFAATANPLQIYGERGNGVL -----------2222--------------------------------------------- GVIDGYSPVGVES ------------- >TRP REPRESSOR BINDING PRO; SWP:P96726_BACSU; PDB:1RLIA; KIAVINGGTRSGGNTDVLAEKAVQGFDAEHIYLDYDSIIERILQCHILIFATPIYWFGMS ----------------------2222---------------1111---------%%%%-- GTLKLFIDRWSQTLRDPRFPDFKQQMSVKQAYVIAVGGDNPKIKGLPLIQQFEHIFHFMG ---------------3333--------------------3333----------------- MSFKGYVLGEGNRPGDILRDHQALSAASRLLKRSDA ------------2222------------1111---- >NRDI PROTEIN; SWP:P50618; PDB:1RLJA; MVQIIFDSKTGNVQRFVNKTGFQQIRKVDEMDHVDTPFVLVTYTTNFGQVPASTQSFLEK ----------------1111------1111---------------%%%%----------- YAHLLLGVAASGNKVWGDNFAKSADTISRQYQVPILHKFELSGTSKDVELFTQEVERVVT 3333--------33331111-------------------!!!!----------------- KSSAKM --3333 >HYPOTHETICAL PROTEIN TA01; SWP:Q9HLW6; PDB:1RLKA; VKKMVIAVRKDLDMGKGKIAAQVAHAAVTCAIRSMKINRDVFNEWYDEGQRKIVVKVNDL ---------------------------------------------1111----------- DEIMEIKRMADSMGIVNEIVQDRGYTQVEPGTITCIGLGPDEEEKLDKITGKYKLL -------------------------3333--------------------3333--- >PHOSPHATASE; SWP:P75792; PDB:1RLMA; AVKVIVTDMDGTFLNDAKTYNQPRFMAQYQELKKRGIKFVVASGNQYYQLISFFPELKDE --------2222--1111--------------1111---------3333----1111--- ISFVAENGALVYEHGKQLFHGELTRHESRIVIGELLKDKQLNFVACGLQSAYVSENAPEA ----%%%%----iiii------------------------------3333---1111--- FVALMAKHYHRLKPVKDYQEIDDVLFKFSLNLPDEQIPLVIDKLHVALDGIMKPVTSGFG ----------------1111------------3333---------1111--------222 FIDLIIPGLHKANGISRLLKRWDLSPQNVVAIGDSGNDAEMLKMARYSFAMGNAAENIKQ 2----2222---------------3333------1111------------1111------ IARYATDDNNHEGALNVIQAVLDNTYPFN -------1111------------------ >RIBONUCLEOTIDE REDUCTASE ; SWP:P00452; PDB:1RLR; RDGSTERINLDKIHRVLDWAAEGLHNVSISQVELRSHIQFYDGIKTSDIHETIIKAAADL --------3333--------2222-----------3333--------------------- ISRDAPDYQYLAARLAIFHLRKKAYGQFEPPALYDHVVKMVEMGKYDNHLLEDYTEEEFK -3333-----------------------------------1111--3333----1111-- QMDTFIDHDRDMTFSYAAVKQLEGKYLVQNRVTGEIYESAQFLYILVAACLFSNYPRETR --3333---1111--------------------------------------11113333- LQYVKRFYDAVSTFKISLPTPIMSGVRTPTRQFSSCVLIECGDSLDSINATSSAIVKYVS ------------------3333-------------------------------------- QRAGIGINAGRIRALGSPIRGGEAFHTGCIPFYKHFQTAVKSCSQGGVRGGAATLFYPMW ---------------------------------------------------------111 HLEVESLLVLKNNRGVEGNRVRHMDYGVQINKLMYTRLLKGEDITLFSPSDVPGLYDAFF 1-33331111-----3333----------------------------111111113333- ADQEEFERLYTKYEKDDSIRKQRVKAVELFSLMMQERASTGRIYIQNVDHCNTHSPFDPA ------------------------3333-------------------------------- IAPVRQSNLCLEIALPTKPLNDVNDENGEIALCTLSAFNLGAINNLDELDELAILAVRAL -------1111----------1111-------------1111--1111------------ DALLDYQDYPIPAAKRGAMGRRTLGIGVINFAYYLAKHGKRYSDGSANNLTHKTFEAIQY -3333-----3333---------------------1111-----1111------------ YLLKASNELAKEQGACPWFNETTYAKGILPIDTYKKDLDTIANEPLHYDWEALRESIKTH -----------------33333333---3333----3333-------------------- GLRNSTLSALMPSETSSQISNATNGIEPPRGYVSIKASKDGILRQVVPDYEHLHDAYELL -----------------1111---------------------------33331111--11 WEMPGNDGYLQLVGIMQKFIDQSISANTNYDPSRFPSGKVPMQQLLKDLLTAYKFGVKTL 11---3333---------------------33332222--------------1111---- YYQNTRDDIDDLSNFQL ----------1111--- >PHOSPHOLIPASE A2; SWP:P47712; PDB:1RLW; SSHKFTVVVLRATKVTKGAFGDMLDTPDPYVELFISTTPDSRKRTRHFNNDINPVWNETF ----------------------------------1111---------------------- EFILDPNQENVLEITLMDANYVMDETLGTATFTVSSMKVGEKKEVPFIFNQVTEMVLEMS ----1111------------------------3333-2222------------------- LEVASS ------ >TRANSCRIPTION INITIATION ; SWP:Q00403; PDB:1RLYA; ASTSRLDALPRVTCPNHPDAILVEDYRAGDMICPECGLVVGDRVIDVGSEWRTFSNDK -------------3333----------------------------------------- >DEOXYHYPUSINE SYNTHASE; SWP:P49366; PDB:1RLZA; APAGALAAVLKHSSTLPPESTQVRGYDFNRGVNYRALLEAFGTTGFQATNFGRAVQQVNA ----------------3333------1111--------3333------------------ MIEKKLEPLTSCTIFLGYTSNLISSGIRETIRYLVQHNMVDVLVTTAGGVEEDLIKCLAP ---1111------------3333-----------1111-------3333-----1111-- TYLGEFSLRGKELRENGINRIGNLLVPNENYCKFEDWLMPILDQMVMEQNTEGVKWTPSK ----3333------------!!!!------------------------------------ MIARLGKEINNPESVYYWAQKNHIPVFSPALTDGSLGDMIFFHSYKNPGLVLDIVEDLRL ----------1111---------------1111--------------------3333--- INTQAIFAKCTGMIILGGGVVKHHIANANLMRNGADYAVYINTAQEFDGSDSGARPDEAV ----1111-------------------3333--------------11113333-3333-1 SWGKIRVDAQPVKVYADASLVFPLLVAETFAQKMDAFMHEKNED 111--1111-------3333--------3333------1111-- >4-HYDROXYBENZOYL-COA REDU; SWP:O33819; PDB:1RM6A; GTVGVRTPLVDGVEKVTGKAKYTADIAAPDALVGRILRSPHAHARILAIDTSAAEALEGV -2222---1111---------1111--1111-------------------------2222 IAVCTGAETPVPFGVLPIAENEYPLARDKVRYRGDPVAAVAAIDEVTAEKALALIKVDYE ----3333-------1111--------------------------------1111----- VLPAYMTPKAAMKAGAIALHDDKPNNILREVHAEFGDVAAAFAEADLIREKTYTFAEVNH ------3333--2222---1111------------------1111--------------- VHMELNATLAEYDPVRDMLTLNTTTQVPYYVHLKVAACLQMDSARIRVIKPFLGGGFGAR -----------------------------------------1111----------iiii- TEALHFEIIAGLLARKAKGTVRLLQTREETFIAHRGRPWTEVKMKIGLKKDGKIAALALE ---3333-----------------------------------------1111-------- ATQAGGAYAGYGIITILYTGALMHGLYHIPAIKHDAWRVYTNTPPCGAMRGHGTVDTRAA --------!!!!-------1111------------------------------------- FEALLTEMGEELGIDSLKIRQINMLPQIPYVTMYAQRVMSYGVPECLEKVKAASGWEERK ---------1111-3333--1111-------1111-----------------------22 GKLPKGRGLGIALSHFVSGTSTPKHWTGEPHATVNLKLDFDGGITLLTGAADIGQGSNTM 22-2222-------------------------------3333-------------3333- ASQVAAEVLGVRLSRIRVISADSALTPKDNGSYSSRVTFMVGNASISAAEELKGVLVKAA -----------3333------1111-------%%%%3333-------------------- AKKLDAREEDIEVIDEMFMVSGSQDPGLSFQEVVKAAMVDSGTITVKGTYTCPTEFQGDK ------3333---%%%%--2222--------------2222-----------3333--11 KIRGSAIGATMGFCYAAQVVEASVDEITGKVTAHKVWVAVDVGKALNPLAVEGQTQGGVW 11-3333----------------------------------------------------- MGMGQALSEETVYDNGRMVHGNILDYRVPTIVESPDIEVIIVESMDPNGPFGAKEASEGM -------------iiii------------3333------------1111iiii----111 LAGFLPAIHEAVYEAVGVRATDFPLSPDRITELLDAKEAAA 1-----------------------------------3333- >4-hydroxybenzoyl-CoA redu; SWP:O33820; PDB:1RM6B; MNILTDFRTHRPATLADAVNALAAEATLPLGAGTDLLPNLRRGLGHPAALVDLTGIDGLA ---------------------------------------1111---------1111-111 TISTLADGSLRIGAGATLEAIAEHDAIRTTWPALAQAAESVAGPTHRAAATLGGNLCQDT 1---1111-----------------3333--------1111-3333-------------- RCTFYNQSEWWRSGNGYCLKYKGDKCHVIVKSDRCYATYHGDVAPALMVLDARAEIVGPA -1111-------1111-3333-------3333---------------1111------111 GKRTVPVAQLFRESGAEHLTLEKGELLAAIEVPPTGAWSAAYSKVRIRDAVDFPLAGVAA 1----3333----1111----2222---------!!!!---------------------- ALQRDGDRIAGLRVAITGSNSAPLMVPVDALLGGNWDDAAAETLAQLVRKTSNVLRTTIT ---------------------------3333----------------------------- GVKYRRRVLLAISRKVVDQLWEA ----------------------- >4-hydroxybenzoyl-CoA redu; SWP:O33818; PDB:1RM6C; MKNILRLTLNGRAREDLVPDNMLLLDYLRETVGLTGTKQGCDGGECGACTVLVDDRPRLA --------iiii------1111------------------------1111--iiii--11 CSTLAHQVAGKKVETVESLATQGTLSKLQAAFHEKLGTQCGFCTPGMIMASEALLRKNPS 11-33332222---3333--!!!!--------1111----1111---------------- PSRDEIKAALAGNLCRCTGYVKIIKSVETAAAARLCE -------1111-------------------------- >MATRIX METALLOPROTEINASE-; SWP:P51512; PDB:1RM8A; GQKWQHKHITYSIKNVTPKVGDPETRKAIRRAFDVWQNVTPLTFEEVPYSELENGKRDVD ----------------3333---------------------------1111--------- ITIIFASGFHGDSSPFDGEGGFLAHAYFPGPGIGGDTHFDSDEPWTLGNPNHDGNDLFLV ------------------------------!!!!-----1111----------------- AVHELGHALGLEHSNDPTAIMAPFYQYMETDNFKLPNDDLQGIQKIYGP ------1111-----1111------------------------------ >RAG1; SWP:P15919; PDB:1RMD; NCSKIHLSTKLLAVDFPAHFVKSISCQICEHILADPVETSCKHLFCRICILRCLKVMGSY 3333---1111----------1111-------------1111------------------ CPSCRYPCFPTDLESPVKSFLNILNSLMVKCPAQDCNEEVSLEKYNHHVSSHKESK --------3333--------------------2222----3333------------ >IGG2A-KAPPA R6.5 FAB (HEA; SWP:NA; PDB:1RMFH; QVQLQQSGPELVRPGVSVKISCKGSGYTFIDYAIHWVKESHAKSLEWIGVISAYSGDTNY ------------2222-----------1111--------1111----------------- NQKFKGKATMTVDKSSNTAYLELARLTSEDSAIYYCARGGWLLLSFDYWGQGTTLTVSSA ------------3333----------3333------------------------------ KTTAPSVTPLAPVCGDTTGSSVTLGVLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDL ---------------------------------------------2222----------- YTLSSSVTVTSSTWPSQSITCNVAHPASSTKVDKKI ---------1111----------------------- >RHAMNOGALACTURONASE A; SWP:Q00001; PDB:1RMG; QLSGSVGPLTSASTKGATKTCNILSYGAVADNSTDVGPAITSAWAACKSGGLVYIPSGNY ---------------------3333----------------------------------- ALNTWVTLTGGSATAIQLDGIIYRTGTASGNMIAVTDTTDFELFSSTSKGAVQGFGYVYH --------------------------------------------1111-----------1 AEGTYGARILRLTDVTHFSVHDIILVDAPAFHFTMDTCSDGEVYNMAIRGGNEGGLDGID 111--------------------------------------------------------- VWGSNIWVHDVEVTNKDECVTVKSPANNILVESIYCNWSGGCAMGSLGADTDVTDIVYRN ------------------------------------------------------------ VYTWSSNQMYMIKSNGGSGTVSNVLLENFIGHGNAYSLDIDGYWSSMTAVAGDGVQLNNI -------------------------------------------1111------------- TVKNWKGTEANGATRPPIRVVCSDTAPCTDLTLEDIAIWTESGSSELYLCRSAYGSGYCL ----------3333--------1111------------------------------!!!! KDSSSHTSYTTTSTVTAAPSGYSATTMAADLATAFGLTASIPIPTIPTSFYPGLTPYSAL ------------------2222----1111--------------------2222------ AG -- - ------------------------------- >DISINTEGRIN SCHISTATIN; SWP:P83658; PDB:1RMRA; NSVHPCCDPVICEPREGEHCISGPCCENCYFLNSGTICKRARGDGNQDYCTGITPDCPRN ---1111-------2222----1111iiii--2222-----------------------1 RYNV 111- >RIBGRASS MOSAIC VIRUS COA; SWP:P03580; PDB:1RMVA; SYNITNSNQYQYFAAVWAEPTPMLNQCVSALSQSYQTQAGRDTVRQQFANLLSTIVAPNQ -----3333---------------------------------------1111-------- RFPDTGFRVYVNSAVIKPLYEALMKSFDTRNRIIETEEESRPSASEVANATQRVDDATVA ---------------3333----3333-----------------------3333------ IRSQIQLLLNELSNGHGYMNRAEFEAILPWTTAPAT -------------------3333------------- >RIBONUCLEASE 4; SWP:P34096; PDB:1RNFA; MQDGMYQRFLRQHVHPEETGGSDRYCNLMMQRRKMTLYHCKRFNTFIHEDIWNIRSICST 3333----------3333------------1111---------------3333-3333-- TNIQCKNGKMNCHEGVVKVTDCRDTGSSRAPNCRYRAIASTRRVVIACEGNPQVPVHFDG ----1111---------------------------------------------------- >HYPOTHETICAL PROTEIN ORF9; SWP:Q54324; PDB:1RO2A; SSERIRYAKWMLEHGFNIIPIDPESKKPVLKEWQKYSHEMPSDEEKQRFLKMIEEGYNYA -3333------1111----------------3333------1111--------------- IPGGQKGMVIMDFESKEKLKAWIGESALEELCRKTLCTNTVHGGIHIYVLSNDIPPHKIN ----%%%%-------------------------------1111----------------- PLFEENGKGIIDLQSYNSYVLGLGSCVNHLHCTTDKCPWKEQNYTTCYTLYNELKEISKV --------------2222---2222---1111-1111-2222------------------ DLKSLLRFLAEKGKRLGITLSKTAKEWLEG -------------1111------------- >DISINTEGRIN ECHISTATIN; SWP:P17347; PDB:1RO3A; ECESGPCCRNCKFLKEGTICKRARGDDMDDYCNGKTCDCPRNPHKGPAT ---------------------3333------------------------ >ALPHA-2,3/8-SIALYLTRANSFE; SWP:Q9LAK3; PDB:1RO7A; KKVIIAGNGPSLKEIDYSRLPNDFDVFRCNQFYFEDKYYLGKKCKAVFYNPSLFFEQYYT ---------3333--1111-----------1111---------------3333------- LKHLIQNQEYETELICSNYNQAHLENENFVKTFYDYFPDAHLGYDFFKQLKDFNAYFKFH ----1111------------1111-------3333-1111-----1111----------- EIYFNQRITSGVYCAVAIALGYKEIYLSGIDFYQNGSSYAFDTKQKNLLKLAPNFKNDNS --------3333-----1111---------%%%%-------------------1111--- HYIGHSKNTDIKALEFLEKTYKIKLYCLCPNSLLANFIELAPNLNSNFIIQEKNNYTKDI -3333-------------1111------1111-1111----------------------- LIPSSEAYGKFSKNI ----------3333- >CYSTATIN D; SWP:P28325; PDB:1ROAA; GGIHATDLNDKSVQRALDFAISEYNKVINKDEYYSRPLQVMAAYQQIVGGVNYYFNVKFG ----------------------------------------------2222---------- RTTCTKSQPNLDNCPFNDQPKLKEEEFCSFQINEVPWEDKISILNYKCRKV ----1111----------2222----------------------------- >ANTI-SILENCING PROTEIN 1; SWP:P32447; PDB:1ROCA; GASIVSLLGIKVLNNPAKFTDPYEFEITFECLESLKHDLEWKLTYVGSSRSLDHDQELDS -----------------1111--------------------------1111--------- ILVGPVPVGVNKFVFSADPPSAELIPASELVSVTVILLSCSYDGREFVRVGYYVNNEYDE --------------------3333---3333----------iiii--------------- EELRENPPAKVQVDHIVRNILAEKPRVTRFNIVWD -----------3333-----1111----------- >CHIMERIC PROTEIN OF INTER; SWP:P10145; PDB:1RODA; SAKELRCQCIKTYSKPFHPKFIKELRVIESGPHCANTEIIVKLSDGRELCLDPASPIVKK -----------------3333--------------------------------------- IIEKMLNSDKSN -----3333--- >FERREDOXIN; SWP:P0A3C9; PDB:1ROE; ATYKVTLVRPDGSETTIDVPEDEYILDVAEEQGLDLPFSCRAGACSTCAGKLLEGEVDQS -----------------------------3333--------------------------- DQSFLDDDQIEKGFVLTCVAYPRSDCKILTNQEEELY -------1111----3333------------------ >FERREDOXIN; SWP:P46797; PDB:1ROF; MKVRVDADACIGCGVCENLCPDVFQLGDDGKAKVLQPETDLPCAKDAADSCPTGAISVEE ------3333-----3333------------------------3333---1111------ >FKBP59-I; SWP:P27124; PDB:1ROT; GVDISPKQDEGVLKVIKREGTGTETPMIGDRVFVHYTGWLLDGTKFDSSLDRKDKFSFDL --------------------------2222---------1111----3333--------- GKGEVIKAWDIAVATMKVGELCRITCKPEYAYGSAGSPPKIPPNATLVFEVELFEFKG ------------1111-----------1111-----3333------------------ >MSP-DOMAIN PROTEIN LIKE F; SWP:O01829; PDB:1ROWA; LTADPPACTVPAAGVSSTHKLVNGGAEKIVFKIKSSNNNEYRIAPVFGFVDPSGSKDVVI ----------3333------------------------------------2222------ TRTAGAPKEDKLVVHFASAPADATDAQAAFVAVAPAGTVTIPMSATA -----------------------------1111-------------- >THIAZOLE BIOSYNTHETIC ENZ; SWP:Q38814; PDB:1RP0A; YDLNAFTFDPIKESIVSREMTRRYMTDMITYAETDVVVVGAGSAGLSAAYEISKNPNVQV -1111------3333---------------------------------------1111-- AIIEQSVSPGGGAWLGGQLFSAMIVRKPAHLFLDEIGVAYDEQDTYVVVKHAALFTSTIM ---------!!!!---%%%%----------------------1111----3333------ SKLLARPNVKLFNAVAAEDLIVKGNRVGGVVTNWALVAQNHHTQSCMDPNVMEAKIVVSS -----1111-------------%%%%-------3333--1111----------------- CGHDGPFGATGVKRLKSIGMIDHVPGMKALDMNTAEDAIVRLTREVVPGMIVTGMEVAEI ----1111-------1111------------3333-----------2222---3333--- DGAPRMGPTFGAMMISGQKAGQLALKALGLPNAIDGTL -------------------------1111---3333-- >PANCREATIC LIPASE RELATED; SWP:P06857; PDB:1RP1; KEVCYEQIGCFSDAEPWAGTAIRPLKVLPWSPERIGTRFLLYTNKNPNNFQTLLPS ----2222-----------3333-------3333--------3333---------- >RNA POLYMERASE SIGMA FACT; SWP:NA; PDB:1RP3A; HMKNPYSNQIEREELILKYLPLVKAIATNIKKHLPEDVDIRDLISYGVIGLIKAVDNLST ------------------------------11113333---------------------- ENPKRAEAYIKLRIKGAIYDYLRSLDFGSRQVREKERRIKEVVEKLKEKLGREPTDEEVA -------------------------1111------------------------------- KELGISTEELFKTLDKINFSYILSLEEVFRDFARDYSELIPSSTNVEEEVIKRELTEKVK ----------------------------------11113333------------------ EAVSKLPEREKLVIQLIFYEELPAKEVAKILETSVSRVSQLKAKALERLREMLSN --1111------------------------------------------------- >RNA POLYMERASE SIGMA FACT; SWP:NA; PDB:1RP3B; VNRIELSRLIGLLLETSGTNKIEDKVTLSKIAQELSKNDVEEKDLEKKVKELKEKIEKGE ------------1111-------------------------------------------- YEVSDEKVVKGLIEFFT ----------------- >Hypothetical 65.0 kDa pro; SWP:Q03103; PDB:1RP4A; GSFNELNAINENIRDDLSALLKSDFFKYFRLDLYKQCSFWDANDGLCLNRACSVDVVEDW 3333-------------------1111--------------------%%%%-------33 DTLPEYWQPEILGSFNNDTKEADDSDDECKFLDQLCQTSKKPVDIEDTINYCDVNDFNGK 33-11113333-----------1111----3333---%%%%------------------- NAVLIDLTANPERFTGYGGKQAGQIWSTIYQDNCFTIGETGESLAKDAFYRLVSGFHASI -----3333--------------------------2222--------------------- GTHLSKEYLNTKTGKWEPNLDLFARIGNFPDRVTNYFNYAVVAKALWKIQPYLPEFSFCD -------------------3333--1111-------------------3333----1111 LVNKEIKNKDNVISQLDTKIFNEDLVFANDLSLTLKDEFRSRFKNVTKIDCVQCDRCRLW 11113333---3333--------1111--------------------------------- GKIQTTGYATALKILFEINDADEFTKQHIVGKLTKYELIALLQTFGRLSESIESVNFEKY ---------------3333----------------------------------------- GKRLLER ---3333 >PROSTATIC ACID PHOSPHATAS; SWP:P20646; PDB:1RPA; KELKFVTLVFRHGDRGPIETFPNDPIKESSWPQGFGQLTKWGMGQHYELGSYIRRRYGRF --------------------1111--33333333-----------------------333 LNNSYKHDQVYIRSTDVDRTLMSAMTNLAALFPPEGNSIWNPRLLWQPIPVHTVSLSEDR 3---------------3333--------------!!!!--1111----------1111-- LLYLPFRDCPRFQELKSETLKSEEFLKRLQPYKSFIDTLPSLSGFEDQDLFEIWSRLYDP -----------------3333-3333--3333------1111------3333-------- LYCESVHNFTLPTWATEDAMTKLKELSELSLLSLYGIHKQKEKSRLQGGVLVNEILKNMK ----1111---1111------------------------3333----------------- LATQPQKARKLIMYSAHDTTVSGLQMALDVYNGLLPPYASCHIMELYQDNGGHFVEMYYR ---1111---------3333----------------2222-------------------- NETQNEPYPLTLPGCTHSCPLEKFAELLDPVIPQDWATECMG -----------2222----3333----3333---3333---- >RECEPTOR PROTEIN TYROSINE; SWP:P28827; PDB:1RPMA; AIRVADLLQHITQMKCAEGYGFKEEYESFFEGQSAPWDSAKKDENRMKNRYGNIIAYDHS --1111----------%%%%-----3333-------3333-1111-----1111--3333 RVRLQTIEGDTNSDYINGNYIDGYHRPNHYIATQGPMQETIYDFWRMVWHENTASIIMVT ------------------------------------3333-------------------- NLVEVGRVKCCKYWPDDTEIYKDIKVTLIETELLAEYVIRTFAVEKRGVHEIREIRQFHF ---iiii--------------------------------------2222----------- TGWPDHGVPYHATGLLGFVRQVKSKSPPSAGPLVVHCSAGAGRTGCFIVIDIMLDMAERE --------------------------1111------------------------------ GVVDIYNCVRELRSRRVNMVQTEEQYVFIHDAILEACL -----------3333----------------------- >GDP-MANNOSE 4,6-DEHYDRATA; SWP:Q51366; PDB:1RPNA; RSALVTGITGQDGAYLAKLLLEKGYRVHGLVARRSSDTRWRLRELGIEGDIQYEDGDMAD ------1111----------1111----------------------1111------1111 ACSVQRAVIKAQPQEVYNLAAQSFVGASWNQPVTTGVVDGLGVTHLLEAIRQFSPETRFY -----------------------3333--------------------------1111--- QASTSEMFGLIQAERQDENTPFYPRSPYGVAKLYGHWITVNYRESFGLHASSGILFNHES -----3333-------1111---------------------------------------1 PLRGIEFVTRKVTDAVARIKLGKQQELRLGNVDAKRDWGFAGDYVEAMWLMLQQDKADDY 1113333-----------1111-----------------3333------1111------- VVATGVTTTVRDMCQIAFEHVGLDYRDFLKIDPAFFRPAEVDVLLGNPAKAQRVLGWKPR ------------------1111-3333----3333------------------------- TSLDELIRMMVEADLRRVSRE -----------------1111 >19 KDA PROTEIN; SWP:Q66104; PDB:1RPUA; NDTREQANGERWDGGSGGITSPFKLPDESPSWTEWRLYNDENPLGFKESWGFGKVVFKRY -------3333-------------------33333333-------------!!!!----- LRYDRTEASLHRVLGSWTGDSVNYAASRFLGANQVGCTYSIRFRGVSVTISGGSRTLQHL -------------------------3333-----------------------3333---- CEMAIRSKQELLQLTPVEV ---------------3333 >RIBULOSE-PHOSPHATE 3-EPIM; SWP:Q43843; PDB:1RPXA; SRVDKFSKSDIIVSPSILSANFSKLGEQVKAIEQAGCDWIHVDVMDGRFVPNITIGPLVV 3333-----------3333-3333-------------------------------3333- DSLRPITDLPLDVHLMIVEPDQRVPDFIKAGADIVSVHCEQSSTIHLHRTINQIKSLGAK --3333---------------------------------1111----------------- AGVVLNPGTPLTAIEYVLDAVDLVLIMSVNPGFGGQSFIESQVKKISDLRKICAERGLNP -----11113333---1111---------2222--------------------------- WIEVDGGVGPKNAYKVIEAGANALVAGSAVFGAPDYAEAIKGIKTSKRPE --------1111--------------3333----3333------------ >ADAPTOR PROTEIN APS; SWP:Q9JID9; PDB:1RPYA; ELSDYPWFHGTLSRVKAAQLVLAGGPRSHGLFVIRQSETRPGECVLTFNFQGKAKHLRLH -1111----------------22221111----------3333------iiii------- GQCHVQHLWFQSVFDLRHFHT ----------------3333- >PEPTIDE CHAIN RELEASE FAC; SWP:Q9X183; PDB:1RQ0A; MKEKKKEIEKLLARPDLTPEQMKNYGMEYAKIEEIENITNRIKETQEFIELLREEGENEL -----------------3333--------------------------3333--------- EIEKYEKELDQLYQELLFLLSPEASDKAIVEIRPGTGGEEAALFARDLFRMYTRYAERKG -------------------------------------3333------------------- WNLEVAEIHETDLGGIREVVFFVKGKNAYGILKYESGVHRVQRVPVTESGGRIHTSTATV ----------1111------------3333-1111---------1111------------ AVLPEIEEKDIEIRPEDLKIETFRASGYVNKTESAVRITHLPTGIVVSCQNERSQYQNKQ ------1111---3333------------------------------------------- TALRILRARLYQLQKEQKEREISQRSEKIRTYNFPQNRVTDHRINYTSYRLQEILDGDLD ---------------------------------1111---------------------33 EIISKLIEHDIENNLEEVL 33----------------- >CELL DIVISION PROTEIN FTS; SWP:O08378; PDB:1RQ2A; LAVIKVVGIGGGGVNAVNRMIEQGLKGVEFIAINTDAQALLMSDADVKLDVGRDSTGADP ---------------------------------------------------3333----- EVGRKAAEDAKDEIEELLRGADMVFVTAGEGGGTGTGGAPVVASIARKLGALTVGVVTRP -----------------2222------------3333----------------------- FSFEGKRRSNQAENGIAALRESCDTLIVIPNDRLLQMGDAAVSLMDAFRSADEVLLNGVQ 3333--------------3333-------33331111-1111------------------ GITDLITTPGLINVDFADVKGIMSGAGTALMGIGSARGEGRSLKAAEIAINSPLLEASME ----------------------2222-----------2222----------3333--333 GAQGVLMSIAGGSDLGLFEINEAASLVQDAAHPDANIIFGTVIDDSLGDEVRVTVIAAGF 3----------1111----------------1111--------3333------------- >30S RIBOSOMAL PROTEIN S17; SWP:O26894; PDB:1RQ6A; MGNIRTSFVKRIAKEMIETHPGKFTDDFDTNKKLVEEFSTVSTKHLRNKIAGYITRIISQ ----------------3333----------------------1111-------------- QK -- >CONSERVED HYPOTHETICAL PR; SWP:Q99TQ4; PDB:1RQ8A; MLTGKQKRYLRSLAHNIDPIFQIGKGGINENMIKQIDDTLENRELIKVHVLQNNFDDKKE ---3333------1111------------------------------------------- LAETLSEATRSELVQVIGSMIVIYRESKENKEIELP ----------------!!!!---------------- >TRANSCARBOXYLASE 5S SUBUN; SWP:Q70AC7; PDB:1RQBA; PREIEVSEPREVGITELVLRDAHQSLATRAEDVGACADIDAAGYWSVECWGGATYDSCIR --------------------------------1111--------------!!!!------ FLNEDPWERLRTFRKLPNSRLQLLRGQNLLGYRHYNDEVVDRFVDKSAENGDVFRVFDAN ----3333-------------------!!!!----3333--------1111--------- DPRNAHAAAVKKAGKHAQGTICYTISPVHTVEGYVKLAGQLLDGADSIALDAALLKPQPA 3333---------------------1111------------------------------- YDIIKAIKDTYGQKTQINLHCHSTTGVTEVSLKAIEAGVDVVDTAISSSLGPGHNPTESV -----------1111-------11113333----1111-----------!!!!------- AELEGTGYTTNLDYDRLHKIRDHFKAIRPKYKKFESKTLVDTSIFKSQIPGGLSNESQLR --2222------------------------3333-------3333--------------1 AQGAEDKDEVAEVPRVRKAAGFPPLVTPSSQIVGTQAVFNVGEYKRTGEFADILGYYGAS 111---------------------------------------------------1111-- PADRDPKVVKLAEEQSGKKPITQRPADLLPPEWEKQSKEAATLKGFNGTDEDVLTYALFP -----------------------3333---------------2222-------------- QVAPVFFEHRAEGPHSVALTDAQLKAEA --------3333------------1111 >GERANYLTRANSTRANSFERASE; SWP:P22939; PDB:1RQJA; MDFPQQLEACVKQANQALSRFIAPLPFQNTPVVETMQYGALLGGKRLRPFLVYATGHMFG ---------------------1111------------------------------3333- VSTNTLDAPAAAVECIHAYSLIHDDLPAMDDDDLRRGLPTCHVKFGEANAILAGDALQTL -3333--------------------3333-----iiii--3333---------------- AFSILSDADMPEVSDRDRISMISELASASGIAGMCGGQALDLDAEGKHVPLDALERIHRH ---------1111----------------1111----------2222------------- KTGALIRAAVRLGALSAGDKGRRALPVLDKYAESIGLAFQVQDDILDVVGDTATLGKRQG --------------------------------------------------3333---222 ADQQLGKSTYPALLGLEQARKKARDLIDDARQSLKQLAEQSLDTSALEALADYIIQRNK 2-------3333------------------------3333------------------- >5'-FLUORO-5'-DEOXYADENOSI; SWP:Q70GK9; PDB:1RQPA; RPIIAFMSDLGTTDDSVAQCKGLMYSICPDVTVVDVCHSMTPWDVEEGARYIVDLPRFFP ---------------------------1111---------2222-------111133332 EGTVFATTTYPATGTTTRSVAVRIKQAAKGGARGQWAGSGAGFERAEGSYIYIAPNNGLL 222------1111-------------------------!!!!---------------111 TTVLEEHGYLEAYEVTSPKVIPEQPEPTFYSREMVAIPSAHLAAGFPLSEVGRPLEDHEI 1---------------3333-----1111------------1111-1111-----3333- VRFNRPAVEQDGEALVGVVSAIDHPFGNVWTNIHRTDLEKAGIGYGARLRLTLDGVLPFE ---------------------------------3333-1111-2222-----%%%%---- APLTPTFADAGEIGNIAIYLNSRGYLSIARNAASLAYPYHLKEGMSARVEA -----3333--2222-----1111------------1111-2222------ >THAUMATIN I; SWP:P02883; PDB:1RQWA; ATFEIVNRCSYTVWAAASKGDAALDAGGRQLNSGESWTINVEPGTKGGKIWARTDCYFDD -----------------------!!!!----2222------2222-------------11 SGSGICKTGDCGGLLRCKRFGRPPTTLAEFSLNQYGKDYIDISNIKGFNVPMDFSPTTRG 11--------iiii-------------------iiii------1111------------- CRGVRCAADIVGQCPAKLKAPGGGCNDACTVFQTSEYCCTTGKCGPTEYSRFFKRLCPDA --------3333--3333-1111---3333---3333-1111----3333------1111 FSYVLDKPTTVTCPGSSNYRVTFCPTA --1111-----------------1111 >MIDDLE OPERON REGULATOR; SWP:P23848; PDB:1RR7A; RFPALLAELNDLLRGELSRLGVDPAHSLEIVVAICKHLGGGQVYIPRGQALDSLIRDLRI -----------------1111-----3333-----------------------------1 WNDFNGRNVSELTTRYGVTFNTVYKAIRRMRRLK 111--------------------------3333- >RIBONUCLEASE; SWP:P00684; PDB:1RRAA; AESSADKFKRQHMDTEGPSKSSPTYCNQMMKRQGMTKGSCKPVNTFVHEPLEDVQAICSQ --1111---------------11113333-1111--------------------3333-- GQVTCKNGRNNCHKSSSTLRITDCRLKGSSKYPNCDYTTTDSQKHIIIACDGNPYVPVHF ----1111------------------3333------------------------------ DASV ---- >ATP-DEPENDENT PROTEASE LA; SWP:P08177; PDB:1RREA; RVGQVTGLAWTEVGGDLLTIETACVPGKGKLTYTGSLGEVQESIQAALTVVRARAEKLGI ----------1111-----------------------------------------1111- NPDFYEKRDIHVHVPEGATPKDGPAAGIACTALVSCLTGNPVRADVATGEITLRGQVLPI 1111-----------1111----1111------------------------1111----- GGLKEKLLAAHRGGIKTVLIPFENKRDLEEIPDNVIADLDIHPVKRIEEVLTLALQNEP ----------1111------11111111-----------------3333---------- >SEED LIPOXYGENASE-3; SWP:P09186; PDB:1RRHA; RGHKIKGTVVLMRKNVLDVNSVTSVGGIIGQGLDLVGSTLDTLTAFLGRSVSLQLISATK ------------3333-3333--------------------3333--------------- ADANGKGKLGKATFLEGIITSLPTLGAGQSAFKINFEWDDGSGIPGAFYIKNFMQTEFFL -1111----------------11112222------------------------------- VSLTLEDIPNHGSIHFVCNSWIYNAKLFKSDRIFFANQTYLPSETPAPLVKYREEELHNL -----------------------3333-------------3333-3333----------- RGDGTGERKEWERIYDYDVYNDLGDPDKGENHARPVLGGNDTFPYPRRGRTGRKPTRKDP --------1111------------33331111-----------------------3333- NSESRSNDVYLPRDEAFGHLKSSDFLTYGLKSVSQNVLPLLQSAFDLNFTPREFDSFDEV -----------1111-----1111-------------------1111--------3333- HGLYSGGIKLPTDIISKISPLPVLKEIFRTDGEQALKFPPPKVIQVSKSAWMTDEEFARE 3333---------11111111-3333--------------3333----3333-------- MLAGVNPNLIRCLKDFPPRSKLDSQVYGDHTSQITKEHLEPNLEGLTVDEAIQNKRLFLL ------------------------3333------3333-------------1111----- DHHDPIMPYLRRINATSTKAYATRTILFLKNDGTLRPLAIELSLPHPQGDQSGAFSQVFL --33331111--1111-------------1111---------------3333-------- PADEGVESSIWLLAKAYVVVNDSCYHQLVSHWLNTHAVVEPFIIATNRHLSVVHPIYKLL ----3333------------------------------------------1111-----3 HPHYRDTMNINGLARLSLVNDGGVIEQTFLWGRYSVEMSAVVYKDWVFTDQALPADLIKR 333----------------2222-----3333---------3333-3333-------111 GMAIEDPSCPHGIRLVIEDYPYTVDGLEIWDAIKTWVHEYVFLYYKSDDTLREDPELQAC 1----3333-------------------------------3333--3333---------- WKELVEVGHGDKKNEPWWPKMQTREELVEACAIIIWTASALHAAVNFGQYPYGGLILNRP --------1111--1111---------------------------1111-----3333-- TLSRRFMPEKGSAEYEELRKNPQKAYLKTITPKFQTLIDLSVIEILSRHASDEVYLGERD --------------------------3333--------------1111-1111------- NPNWTSDTRALEAFKRFGNKLAQIENKLSERNNDEKLRNRCGPVQMPYTLLLPSSKEGLT -----------------------------3333333333331111---1111-------- FRGIPNSISI ---------- >LACTALDEHYDE REDUCTASE; SWP:P11549; PDB:1RRMA; ANRILNETAWFGRGAVGALTDEVKRRGYQKALIVTDKTLVQCGVVAKVTDKDAAGLAWAI -----------2222--------1111------------1111--------1111----- YDGVVPNPTITVVKEGLGVFQNSGADYLIAIGGGSPQDTCKAIGIISNNPEFADVRSLEG ------------------------------------------------3333--1111-- LSPTNKPSVPILAIPTTAGTAAEVTINYVITDEEKRRKFVCVDPHDIPQVAFIDADDGPP --------------------3333--------1111-------1111------------- ALKAATGVDALTHAIEGYITRGAWALTDALHIKAIEIIAGALRGSVAGDKDAGEEALGQY -------------------1111------------------------------------- VAGGFSNVGLGLVHGAHPLGAFYNTPHGVANAILLPHVRYNADFTGEKYRDIARVGVKVE ---3333---3333--------------------3333------!!!!------------ GSLEEARNAAVEAVFALNRDVGIPPHLRDVGVRKEDIPALAQAALDDVCTGGNPREATLE ------------------1111---3333---3333----------3333---------- DIVELYHTAWEGG ------------- >RAT ONCOMODULIN; SWP:P02631; PDB:1RRO; SITDILSAEDIAAALQECQDPDTFEPQKFFQTSGLSKMSASQVKDIFRFIDNDQSGYLDG 3333-----------11112222-------33331111------------1111----!! DELKYFLQKFQSDARELTESETKSLMDAADNDGDGKIGADEFQEMVHS !!--3333--1111---------------------------------- >E3 SUMO-protein ligase Ra; SWP:P49792; PDB:1RRPB; HFEPVVPLPDKIEVKTGEEDEEEFFCNRAKLFRFDVESKEWKERGIGNVKILRHKTSGKI ------------------------------------------------------------ RLLMRREQVLKICANHYISPDMKLTPNAGSDRSFVWHALDYADELPKPEQLAIRFKTPEE ------------------3333----iiii----------1111---------------- AALFKCKFEEAQSI ---------1111- >MUTY; SWP:P83847; PDB:1RRQA; PAREFQRDLLDWFARERRDLPWRKDRDPYKVWVSEVMLQQTRVETVIPYFEQFIDRFPTL -------------------1111------------------3333--------------- EALADADEDEVLKAWEGLGYYSRVRNLHAAVKEVKTRYGGKVPDDPDEFSRLKGVGPYTV ------3333--1111---------------------%%%%---3333---22223333- GAVLSLAYGVPEPAVNGNVMRVLSRLFLVTDDIAKCSTRKRFEQIVREIMAYENPGAFNE ------------------------1111---11113333-----------1111------ ALIELGALVCTPRRPSCLLCPVQAYCQAFAEGVAEELPVKMVKQVPLAVAVLADDEGRVL ---------------3333--3333-------1111------------------------ IRKRDSTGLLANLWEFPSCETDGADGKEKLEQMVGLQVELTEPIVSFEHAFSHLVWQLTV ---------2222------------3333---------------------1111------ FPGRLVHGGPVEEPYRLAPEDELKAYAFPVSHQRVWREYKEWAS -----------!!!!---1111------3333--------3333 >GLYCOSYLTRANSFERASE GTFD; SWP:Q9AFC7; PDB:1RRVA; MRVLLSVCGTRGDVEIGVALADRLKALGVQTRMCAPPAAEERLAEVGVPHVPVGLPQHMM ------------------------1111-------3333----------------1111- LQEGMPPPPPEEEQRLAAMTVEMQFDAVPGAAEGCAAVVAVGDLAAATGVRSVAEKLGLP -2222-----------------------3333-----------3333------------- FFYSVPSPVYLASPHLPPAYDEPTTPGVTDIRVLWEERAARFADRYGPTLNRRRAEIGLP ------------------------2222--------------------------1111-- PVEDVFGYGHGERPLLAADPVLAPLQPDVDAVQTGAWLLSDERPLPPELEAFLAAGSPPV ---33331111----------------------------------3333----------- HIGFGSSSGRGIADAAKVAVEAIRAQGRRVILSRGWTELVLPDDRDDCFAIDEVNFQALF ---!!!!-3333-----------1111-------1111----------------333311 RRVAAVIHHGSAGTEHVATRAGVPQLVIPRNTDQPYFAGRVAALGIGVAHDGPTPTFESL 11----------------1111--------!!!!-------------------------- SAALTTVLAPETRARAEAVAGMVLTDGAAAAADLVLAAVGR ------------------1111------------------- >GLYCOGEN SYNTHESIS PROTEI; SWP:P26649; PDB:1RRZA; MDHSLNSLNNFDFLARSFARMHAEGRPVDILAVTGNMDEEHRTWFCARYAWYCQQMMQAR ------------------3333------3333-33333333-3333-------------- ELELEH ------ >FMS1 PROTEIN; SWP:P50264; PDB:1RSGA; PAKKKVIIIGAGIAGLKAASTLHQNGIQDCLVLEARDRVGGRLQTVTGYQGRKYDIGASW ----------------------1111-------------!!!!-----%%%%-------- HHDTLTNPLFLEEAQLSLNDGRTRFVFDDDNFIYIDEERGRVDHDKELLLEIVDNESKFA --3333------------------------------------------------------ ELEFDCSFFQLVKYLLQRRQFLTNDQIRYLPQLCRYLELWHGLDWKLLSAKDTYFGHQGR -----------------3333--------------3333----1111-3333-------- NAFALNYDSVVQRIAQSFPQNWLKLSCEVKSITREPSKNVTVNCEDGTVYNADYVIITVP -------------1111-1111------------1111-----1111------------3 QSVLNLSVQPEKNLRGRIEFQPPLKPVIQDAFDKIHFGALGKVIFEFEECCWSNESSKIV 3333333------2222-------3333-------------------------------- TLANSTNEFVEIVRNAENLDELDSLSVTCWSQPLFFVNLSKSTGVASFLQAPLTNHIESI -----3333------------------1111------3333----------------111 REDKERLFSFFQPVLNKIKCLDSEDVIDGRANKPVLRNIIVSNWTRDPYSRGAYSACFPV 1-----------------1111--------------------33331111---------- DVVASNGQDSRIRFAGEHTIDGAGCAYGAWESGRREATRISDLLKLEHHH --------1111---1111--2222--------------------1111- >PRESYNAPTIC PROTEIN SAP97; SWP:Q62696; PDB:1RSOA; RKQDTQRALHLLEEYRSKLSQTEDRQLRSSIERVISIFQSNLFQALIDIQEFYEVTLLDN 3333-------------------3333-------------------3333---------- >Peripheral plasma membran; SWP:Q62915; PDB:1RSOB; GLLAAERAVSQVLDSLEEIHALTDSSEKDLDFLHSVFQDQHLHTLLDLYDKINTKS -3333---------------------------------3333-------------- >RIBOSOMAL PROTEIN S7; SWP:P17291; PDB:1RSS; LQPDLVYGDVLVTAFINKIMRDGKKNLAARIFYDACKIIQEKTGQEPLKVFKQAVENVKP --------------------%%%%-------------------------------1111- RMEVRSRRVGGANYQVPMEVSPRRQQSLALRWLVQAANQRPERRAAVRIAHELMDAAEGK --------iiii------------------------1111---------------1111- GGAVKKKEDVERMAEAHYRW -------------------- >SYNAPTOTAGMIN I; SWP:P21707; PDB:1RSY; GGGILDSMVEKLGKLQYSLDYDFQNNQLLVGIIQAAELPALDMGGTSDPYVKVFLLPDKK ----1111---------------------------------1111--------------- KKFETKVHRKTLNPVFNEQFTFKVPYSELGGKTLVMAVYDFDRFSKHDIIGEFKVPMNTV ------------------------33331111-----------------------1111- DFGHVTEEWRDLQSA --------------- >PURINE NUCLEOSIDE PHOSPHO; SWP:P00491; PDB:1RSZA; NGYTYEDYKNTAEWLLSHTKHRPQVAIICGSGLGGLTDKLTQAQIFDYSEIPNFPRSTVP ---3333-------1111-----------2222--3333-------33332222------ GHAGRLVFGFLNGRACVMMQGRFHMYEGYPLWKVTFPVRVFHLLGVDTLVVTNAAGGLNP ----------iiii---------3333--3333--------1111-------------33 KFEVGDIMLIRDHINLPGFSGQNPLRGPNDERFGDRFPAMSDAYDRTMRQRALSTWKQMG 332222----------3333--1111---3333------1111------------3333- EQRELQEGTYVMVAGPSFETVAECRVLQKLGADAVGMSTVPEVIVARHCGLRVFGFSLIT ----------------------------------------------3333---------- NKVIMDYESLEKANHEEVLAAGKQAAQKLEQFVSILMASIPL ----------------------------------3333---- >FIMBRIN; SWP:O59945; PDB:1RT8A; INEEERREFIKHINSVLAGDPDVGSRVPINTETFEFFDQCKDGLILSKLINDSVPDTIDE ----------------------1111--------3333-1111----------2222-11 RVLNKQRPLDNFKCIENNNVVINSAKAMGGISITNIGAGDILEGREHLILGLVWQIIRRG 11----------------------1111--------33331111-------------111 LLGKITLDQFLRLPPEKILLRWFNYHLKAANWPRTVSNFSKDVSDGENYTVLLNQLAPEL 1----333311113333----------------------3333-------------3333 CSRAPLQTTDVLQRAEQVLQNAEKLDCRKYLTPTAMVAGNPKLNLAFVAHLFNTHPGLEP --3333----------------1111---------------------------------- AEGEREARVFTLWLNSLDVTPSIHDFFNNLRDGLILLQAYDKITPNTVNWKKVNKAPASG --------------1111------1111-1111---------------1111----3333 DEMMRFKAVENCNYAVDLGKNQGFSLVGIQGADITDGSRTLTLALVWQMMRMNITKTLHS -------------------------111133331111-------------------1111 TLSDSDMVAWANSMAAKGGKGSQIRSFRDPSISTGVFVLDVLHGIKSEYVDYNLVTDGST ----------------------------3333-------------3333-3333------ EELAIQNARLAISIARKLGAVIFILPEDIVAVRPRLVLHFIGSLMAV ---------------1111-----33331111-----------1111 >Tissue-type plasminogen a; SWP:P00750; PDB:1RTFB; IKGGLFADIASHPWQAAIFAKHRGERFLCGGILISSCWILSAAHCFQERFPPHHLTVILG -------33331111-------------------1111-------1111-3333------ RTYRVVPGEEEQKFEVEKYIVHKEFDDDTYDNDIALLQLKSDSSRCAQESSVVRTVCLPP --1111---------------1111------------------------1111------- ADLQLPDWTECELSGYGKHEALSPFYSERLKEAHVRLYPSSRCTSQHLLNRTVTDNMLCA -----2222----------1111---------------3333-3333%%%%--1111--- GDTRSNLHDACQGDSGGPLVCLNDGRMTLVGIISWGLGCGQKDVPGVYTKVTNYLDWIRD ----------2222--------iiii--------------2222-----3333------- NMRP ---- >BACTERIAL LEUCYL AMINOPEP; SWP:Q01693; PDB:1RTQA; MPPITQQATVTAWLPQVDASQITGTISSLESFTNRFYTTTSGAQASDWIASEWQALSASL -----3333---3333-3333-------3333---1111-----------------1111 PNASVKQVSHSGYNQKSVVMTITGSEAPDEWIVIGGHLDSTIGSHTNEQSVAPGADDDAS ---------2222-------------1111------------11111111---------- GIAAVTEVIRVLSENNFQPKRSIAFMAYAAEEVGLRGSQDLANQYKSEGKNVVSALQLDM ------------1111-------------1111------------1111----------- TNYKGSAQDVVFITDYTDSNFTQYLTQLMDEYLPSLTYGFDTCGYACSDHASWHNAGYPA --------------------------------1111-----------3333--1111--- AMPFESKFNDYNPRIHTTQDTLANSDPTGSHAKKFTQLGLAYAIEMGSATG ------1111------11113333-11113333------------------ >GERANYLTRANSTRANSFERASE; SWP:Q8NWD6; PDB:1RTRA; TNLPMNKLIDEVNNELSVAINKSVMDTQLEESMLYSLNAGGKRIRPVLLLLTLDSLNTEY ---3333----------------------------------------------1111-33 ELGMKSAIALEMIHTYSLIHDDLPAMDNDDYRRGKLTNHKVYGEWTAILAGDALLTKAFE 33--------------------1111-----iiii-3333-------------------- LISSDDRLTDEVKIKVLQRLSIASGHVGMVGGQMLDMQSEGQPIDLETLEMIHKTKTGAL ----1111----------------1111----------2222------------------ LTFAVMSAADIANVDDTTKEHLESYSYHLGMMFQIKDDLLDCYGDEAKSTYVSLLGKDGA -------------------------------------------------3333------- EDKLTYHRDAAVDELTQIDEQFNTKHLLEIVDLFYSR --------------11113333--------------- >CONSERVED HYPOTHETICAL PR; SWP:Q9I4D4; PDB:1RTTA; IKVLGISGSLRSGSYNSAALQEAIGLVPPGMSIELADISGIPLYNEDVYALGFPPAVERF ----------1111-------3333--2222------1111---33331111-------- REQIRAADALLFATPEYNYSMAGVLKNAIDWASRPPEQPFSGKPAAILGASAGRFGTARA ----------------%%%%----------1111---3333-----------1111---- QYHLRQTLVFLDVHPLNKPEVMISSAQNAFDAQGRLLDDKARELIQQQLQALQL --------1111------------3333-------------------------- >RIBONUCLEASE U2; SWP:P00654; PDB:1RTU; CDIPQSTNCGGNVYSNDDINTAIQGALDDVANGDRPDNYPHQYYEASEDITLCCGSGPWS --------iiii-----------------1111-2222---------------------- EFPLVYNGPYYSSRDNYVSPGPDRVIYQTNTGEFCATVTHTGAASYDGFTQCS ------------1111-----------------------2222-1111----- >TRANSCRIPTIONAL ACTIVATOR; SWP:Q8U189; PDB:1RTWA; FSEELIKENENIWRRFLPHKFLIEAENTIKKENFEKWLVNDYYFVKNALRFALLAKAPDD --------3333-1111-3333-----------------------------------111 LLPFFAESIYYISKELEFEKKAQELGISLNGEIDWRAKSYVNYLLSVASLGSFLEGFTAL 1----------------------------------------------------------- YCEEKAYYEAWKWVRENLKERSPYQEFINHWSSQEFGEYVKRIEKILNSLAEKHGEFEKE --------------1111---1111----1111----------------3333------- RAREVFKEVSKFELIFWDIAY -----------------3333 >YVQK PROTEIN; SWP:O34899; PDB:1RTYA; KDSLRVESYGTIDELNSFIGLALAELSGQPGFEDLTAELLTIQHELFDCGGDLAIVTDYK -------------------------1111------------------------------- LTEESVSFLETRIDAYTAEAPELKKFILPGGSKCASLLHIARTITRRAERRVVALKSEEI ------------------------------------------------------------ HETVLRYLNRLSDYFFAGARVVNARSGIGDVEYERSA ------------------------------------- >DCOH-LIKE PROTEIN DCOHM; SWP:Q9CZL5; PDB:1RU0A; DAQWLTAEERDQLIPGLKAAGWSELSERDAIYKEFSFKNFNQAFGFMSRVALQAEKMNHH -----------------1111---1111-------------------------------- PEWFNVYNKVQITLTSHDCGGLTKRDVKLAQFIEKAAASL -----!!!!-------------3333---------1111- >ACETYL-COA SYNTHASE; SWP:P83789; PDB:1RU3A; INFDQIFEGAIEPGKEPKRLFKEVYEGAITATSYAEILLSRAIEKYGPDHPVGYPDTAYF -3333-----------------------------------------1111---------- LPVIRAFSGEEVRTLKDMVPILNRMRAQIKSELTFENARLAGEATWYAAEIIEALRYLKH -------------3333--------1111---------------------------1111 TPENPIVVPPWTGFIGDPVVRQYGIKMVDWTIPGEAIIIGRAKDSKAAKKIVDDLMGKGL 1111----------------------1111-----------------------3333--- MLFLCDEIIEQLLEENVKLGVDYIAYPLGNFTQVVHAANYALRAGLMFGGIAPGLRDAHR --------------------1111---------------------------2222----- DYQRRRVLAFVLYLGEHDMVKTAAAMGAIFTGFPVITDQPLPEDKQIKDWFISEPDYDKI ----------------------------1111---------1111-2222---------- VQTALEVRGIKITSIDIDLPINFGPAFEGESIRKGDMHVEFGGGKTPSFELVRMVGPDEI 3333-------------------1111-----------------------------1111 EDGKVEVIGPDIDSVEPGGRLPIGIVVDIYGRKMQEDFEPVLERRIHYFTNYGEGFWHTA 2222------3333--------------------333333333333-3333-2222---- QRDLTWVRISKEAFAKGARLKHLGQLLYAKFKQEFPSIVDRVQVTIYTDEQKVLELREIA ------------3333-------------1111--------------------------- RKKYAERDARLRELSDEAVDTYYSCLLCQSFAPTHVCIVSPERVGLCGAISWLDAKAAYE ----------33333333-------1111--1111----1111----------------- INPNGPNQPIPKEGLIDPVKGQWESFNEYIYKNSQRTIERMNLYTIMEYPMTSCGCFEAI -1111----------------------------%%%%----------------------- MAYLPELNGFMIVNREHSGMTPIGMTFSTLAGMVGGGTQTPGFMGIGKSYIGSRKFVKAD ---3333------1111---1111---------------2222---3333--11113333 GGLARVVWMPKDLKEQLRSIIEERAEEEGLGRDFIDKIADETVGTTVDEVLPFLEEKGHP -3333-------------------------11111111-3333--3333---------33 ALSMEPLL 33------ >PECTATE LYASE; SWP:P0C1A6; PDB:1RU4A; ADCSSDLTSGISTKRIYYVAPNGNSSNNGSSFNAPMSFSAAMAAVNPGELILLKPGTYTI --1111-iiii--------11111111---1111-----------2222----------- PYTQGKGNTITFNKSGKDGAPIYVAAANCGRAVFDFSFPDSQWVQASYGFYVTGDYWYFK --2222----------2222------%%%%--------1111-2222------------- GVEVTRAGYQGAYVIGSHNTFENTAFHHNRNTGLEINNGGSYNTVINSDAYRNYDPKKNG ------------------------------------iiii--------------3333-- SMADGFGPKQKQGPGNRFVGCRAWENSDDGFDLFDSPQKVVIENSWAFRNGINYWNDSAF --------!!!!--------------------------------------------1111 AGNGNGFKLGGNQAVGNHRITRSVAFGNVSKGFDQNNNAGGVTVINNTSYKNGINYGFGS ----------%%%%--------------------%%%%---------------------- NVQSGQKHYFRNNVSLSASVTVSNADAKSNSWDTGPAASASDFVSLDTSLATVSRDNDGT --2222-----------------------1111-----3333-----3333----1111- LPETSLFRLSANSKLINAGTKESNISYSGSAPDLGAFERN ----2222-1111-2222---2222--------------- >PUTATIVE N-TYPE ATP PYROP; SWP:Q8U2K6; PDB:1RU8A; GLADVAVLYSGGKDSNYALYWAIKNRFSVKFLVTVSENEESYYTINANLTDLQARALGIP ----------------------1111------------------1111------------ LVKGFTQGEKEKEVEDLKRVLSGLKIQGIVAGASKYQRKRIEKVAKELGLEVYTPAWGRD ----------3333------1111---------------------1111----------- AKEYRELLNLGFKIVVGVSAYGLDESWLGRILDESALEELITLNEKYKVHVAGEGGEFET 3333---------------11113333----------------------1111------- FVLDPLFKYKIVVDKAKKVWEPCTSSGKLIIEEAHLESKLEH --------------------3333------------------ >IMMUNOGLOBULIN 13G5, LIGH; SWP:NA; PDB:1RURH; EVQLEESGPELVRPGTSVKISCKASGYTFTNYWLGWVKQRPGHGFEWIGDIYPGGVYTTN ------------2222-----------1111--------2222--------1111----- NEKFRGKAILTADTSSSTAYMQLSSLTSEDSAVYFCARAGGYYTGGDYWGQGTSVTVSSA 3333---------1111---------3333------------------------------ KTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDL ----------------------------------------iiii---------------- YTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVP ------------------------3333---------- >IMMUNOGLOBULIN 13G5, LIGH; SWP:NA; PDB:1RURL; DIVLTQAAFSNPVTLGASASISCRSSKSLLNSNGIIHMYWYLQKPGQSPQLLIYQMSKLA -------------2222--------------------------2222------------2 SGAPDRFSGSGSGTDFTLRISRVEAEDVGVYYCAQNLELPYTFGGGTKLEIKRADAAPTV 222--------------------1111--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDTKDSTYSM -----3333-------------------------iiii---------------------- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRN ------3333------------1111----------- >LIM domain-binding protei; SWP:P70662; PDB:1RUTX; SWKRCAGCGGKIADRFLLYAMDSYWHSRCLKCSSCQAQLGDIGTSSYTKSGMILCRNDYI -------------------%%%%--3333--------3333-------iiii-------- RLFGNSGACSACGQSIPASELVMRAQGNVYHLKCFTCSTCRNRLVPGDRFHYINGSLFCE ----------------1111----iiii--1111----------2222----iiii--11 HDRPTALIGDVMVVGEPTLMGGEFGDEDERLITRLEN 11-3333-----------3333---1111-------- >MYOSIN-3 ISOFORM; SWP:P36006; PDB:1RUWA; KDPKFEAAYDFPGSGSSSELPLKKGDIVFISRDEPSGWSLAKLLDGSKEGWVPTAYMTPY ---------------1111---2222-------3333-----1111------3333---- KDTRNTVPV --------- >Hemagglutinin; SWP:Q9WFZ1; PDB:1RUZH; DTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCKLKGIAPLQLGKCNIAGW -----------------1111------------------------------!!!!----- LLGNPECDLLLTASSWSYIVETSNSENGTCYPGDFIDYEELREQLSSVSSFEKFEIFPKT ---33333333---------------------------------1111------------ SSWPNHETTKGVTAACSYAGASSFYRNLLWLTKKGSSYPKLSKSYVNNKGKEVLVLWGVH ------------------------1111-------------------------------- HPPTGTDQQSLYQNADAYVSVGSSKYNRRFTPEIAARPKVRDQAGRMNYYWTLLEPGDTI ----------------------1111---------------------------------- TFEATGNLIAPWYAFALNRGSGSGIITSDAPVHDCNTKCQTPHGAINSSLPFQNIHPVTI ----------------------------------------1111---------------- GECPKYVRSTKLRMATGLRNIPAR ------------------------ >Hemagglutinin; SWP:Q9WFZ1; PDB:1RUZI; GLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKSTQNAIDGITNKVNSVIEKMN 11112222-------------------3333----------------------------- TQFTAVGKEFNNLERRIENLNKKVDDGFLDIWTYNAELLVLLENERTLDFHDSNVRNLYE ------------------------------------------------------------ KVKSQLKNNAKEIGNGCFEFYHKCDDACMESVRNGTYDYP -1111--------%%%%-------3333------------ >Hemagglutinin; SWP:Q82500; PDB:1RV0H; DTLCIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCRLGGIAPLQLGKCNIAGW -----------------3333----------------------iiii----!!!!----- LLGNPECDLLLTVSSWSYIVETSNSDNGTCYPGDFIDYEELREQLSSVSSFEKFEIFPKT ---11113333---------------------------------1111------------ SSWPNHETTRGVTAACPYAGASSFYRNLLWLVKKGNSYPKLSKSYVNNKGKEVLVLWGVH --1111------3333--------1111-----%%%%----------------------- HPPTSTDQQSLYQNADAYVSVGSSKYDRRFTPEIAARPKVRGQAGRMNYYWTLLEPGDTI ---------------------------------------%%%%-----------2222-- TFEATGNLVAPRYAFALNRGSGSGIITSDAPVHDCDTKCQTPHGAINSSLPFQNIHPVTI ------------------------------------------------------------ GECPKYVKSTKLRMATGLRNIPAR ------------------------ >Hemagglutinin [Precursor]; SWP:P26562; PDB:1RV0I; GLFGAIAGFIEGGWTGLIDGWYGYHHQNEQGSGYAADQKSTQNAIDGITNKVNSVIEKMN ----2222-----3333----------3333----------------------------- TQFTAVGKEFNNLERRIKNLNKKVDDGFLDVWTYNAELLVLLENERTLDFHDSNVKNLYE ------------------------------------------------------------ KARSQLRNNAKEIGNGCFEFYHKCDDACMESVRNGTYDYP -------------%%%%--------3333----------- >SERINE HYDROXYMETHYLTRANS; SWP:P07511; PDB:1RV3A; WSSHEQMLAQPLKDSDAEVYDIIKKESNRQRVGLELIASENFASRAVLEALGSCLNNKYS --3333----3333----------------------1111--------3333-3333--- LGYPGQRYYGGTEHIDELETLCQKRALQAYGLDPQCWGVNVQPYSGSPANFAVYTALVEP --2222---------------------1111-1111----------------------22 HGRIMGLDLPDGGHLTHGFMTDKKKISATSIFFESMAYKVNPDTGYIDYDRLEENARLFH 22------1111-1111---1111---3333----------------------------- PKLIIAGTSCYSRNLDYGRLRKIADENGAYLMADMAHISGLVVAGVVPSPFEHCHVVTTT ------------------------1111------3333----------1111-------- THKTLRGCRAGMIFYRRGVRSEILYNLESLINSAVFPGLQGGPHNHAIAGVAVALKQAMT -!!!!------------------------------------------------------- PEFKEYQRQVVANCRALSAALVELGYKIVTGGSDNHLILVDLRSKGTDGGRAEKVLEACS ---------------------1111--2222---------3333---3333--------- IACNKNTCPGDKSALRPSGLRLGTPALTSRGLLEKDFQKVAHFIHRGIELTVQIQDDTGP ------------1111--------3333----3333----------------1111---- RATLKEFKEKLAGDEKHQRAVRALRQEVESFAALFPLPGLPGF ------------------------------------------- >CONSERVED HYPOTHETICAL PR; SWP:Q9K0A8; PDB:1RV9A; KNFLTADWPAPANVKTLITTRNGGVSQGAYQSLNLGTHVGDNPEAVRRNREIVQQQVGLP ----------1111------------!!!!------------------------------ VAYLNQIHSTVVVNAAEALGGTPDADASVDDTGKVACAVMTADCLPVLFCDRAGTAVAAA -------------3333---------------------------------1111------ HAGWRGLAGGVLQNTIAAMKVPPVEMMAYLGPAISADAFEVGQDVFDAFCTPMPEAATAF ----------------3333-3333---------3333-----------3333---1111 EGIGSGKFLADLYALARLILKREGVGGVYGGTHCTVLERDTFFSYRRDGATGRMASLIWL ---%%%%--------------------------33331111--3333------------- DG -- >FRUCTOSE-1,6-BISPHOSPHATE; SWP:Q9RHA2; PDB:1RVGA; MLVTGLEILKKAREEGYGVGAFNVNNMEFLQAVLEAAEEQRSPVILALSEGAMKYGGRAL ------------------------------------------------------------ TLMAVELAKEARVPVAVHLDHGSSYESVLRALRAGFTSVMIDKSHEDFETNVRETRRVVE ------------------------------------------1111-------------- AAHAVGVTVEAELGRLAGIEEKDALLTNPEEARIFMERTGADYLAVAIGTSHGAYKGKGR ------------------------------------------------------------ PFIDHARLERIARLVPAPLVLHGASAVPPELVERFRASGGEIGEAAGIHPEDIKKAISLG -----------------------------------1111---------3333-------- IAKINTDTDLRLAFTALIREALNKNPKEFDPRKYLGPAREAVKEVVKSRMELFGSVGRA ------------------------1111--------------------------2222- >ISOMERASE/LACTONIZING ENZ; SWP:Q8UAC1; PDB:1RVKA; IITDVEVRVFRTTTRRHSDSAGHAHPGPAHQVEQALTVRTEDGQEGHSFTAPEIVRPHVI ------------------1111-----------------1111-------3333-3333- EKFVKKVLIGEDHRDRERLWQDLAHWQRGSAAQLTDRTLAVVDCALWDLAGRSLGQPVYK ---333322221111-----------1111--------------------------3333 LIGGYRDKVLAYGSICGDELEGGLATPEDYGRFAETLVKRGYKGIKLHTWPPVSWAPDVK -------------------2222--3333------------------------------- DLKACAAVREAVGPDIRLIDAFHWYSRTDALALGRGLEKLGFDWIEEPDEQSLSSYKWLS ------------------------------------3333--------1111-------1 DNLDIPVVGPESAAGKHWHRAEWIKAGACDILRTGVNDVGGITPALKTHLAEAFGECEVH 111-------------------------------3333-------------1111----- GNTANLHVVAATKNCRWYERGLLHPFLEYDDGHDYLKSLSDPDRDGFVHVPDRPGLGEDI --------1111-----------11113333-1111------1111-------!!!!--- DFTFIDNNRV ---------- >RIBOFLAVIN SYNTHASE; SWP:P17621; PDB:1RVVA; MNIIQGNLVGTGLKIGIVVGRFNDFITSKLLSGAEDALLRHGVDTNDIDVAWVPGAFEIP ---------2222---------3333------------1111-3333-------3333-- FAAKKMAETKKYDAIITLGTVIRGATTHYDYVCNEAAKGIAQAANTTGVPVIFGIVTTEN ------------------------------------------------------------ IEQAIERAGTKAGNKGVDCAVSAIEMANLNRSFE ----1111--------------------3333-- >HEMAGGLUTININ; SWP:Q82766; PDB:1RVXA; DTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLEDSHNGKLCRLKGIAPLQLGKCNIAGW -----------------3333----------------------iiii----!!!!----- LLGNPECDPLLPVRSWSYIVETPNSENGICYPGDFIDYEELREQLSSVSSFERFEIFPKE ---11113333----------1111-------------------1111------------ SSWPNHNTNGVTAACSHEGKSSFYRNLLWLTEKEGSYPKLKNSYVNKKGKEVLVLWGIHH --1111-----3333-%%%%---1111-----%%%%------------------------ PPNSKEQQNLYQNENAYVSVVTSNYNRRFTPEIAERPKVRDQAGRMNYYWTLLKPGDTII --------------------------------------%%%%-----------2222--- FEANGNLIAPMYAFALRRGFGSGIITSNASMHECNTKCQTPLGAINSSLPYQNIHPVTIG ---------------------------------------1111----------------- ECPKYVRSAKLRMVTGLRNIPAR ----------------------- >Hemagglutinin [Precursor]; SWP:P03452; PDB:1RVXB; GLFGAIAGFIEGGWTGMIDGWYGYHHQNEQGSGYAADQKSTQNAINGITNKVNSVIEKMN 11112222-----3333----------1111----------------------------- IQFTAVGKEFNKLEKRMENLNNKVDDGFLDIWTYNAELLVLLENERTLDFHDSNVKNLYE ----------1111---------------------------------------------- KVKSQLKNNAKEIGNGCFEFYHKCDNECMESVRNGTYDYP -3333-------------------3333---1111----- >CONSERVED HYPOTHETICAL PR; SWP:Q8Z4J1; PDB:1RW0A; MNALIVPQWPLPKGVAACSSTRIGGVSLSPYDSLNLGAHCGDNPEHVEENRKRLFAAGNL 1111-------------------------------------------------------- PSKPVWLEQVHGKNVLRLTGEPYASKRADASYSNTPGTVCAVMTADCLPVLFCNREGTEV ----------------------------------2222---------------1111--- AAAHAGWRGLCEGVLEETVTCFADKPENIIAWLGPAIGPAAFEVGPEVRDAFLAKDAQAD ------------------3333--3333---------1111----------------333 SAFLPHGEKFLADIYQLARQRLANTGVEHVYGGDRCTFSESETFFSYRRDKTTGRMASFI 3----!!!!-------------1111---------3333------3333----------- WLI --- >CONSERVED HYPOTHETICAL PR; SWP:Q9HXX5; PDB:1RW1A; TYVLYGIKACDTKKARTWLDEHKVAYDFHDYKAVGIDREHLRRWCAEHGWQTVLNRAGTT -------------------1111------1111---------------3333--333333 FRKLDEAQKADLDEAKAIELLAQPSIKRPVLELGGRTLVGFKPDAYAAALA 33--3333------------------------------------------- >ATP-DEPENDENT DNA HELICAS; SWP:P13010; PDB:1RW2A; MHHHHHHKLKTEQGGAHFSVSSLAEGSVTSVGSVNPAENFRVLVKQKKASFEEASNQLIN ----------------------------------11113333------------------ HIEQFLDTNETPYFMKSIDCIRAFREEAIKFSEEQRFNNFLKALQEKVEIKQLNHFWEIV ----3333--1111------------3333---3333----------------------- VQDGITLITKEEASGSSVTAEEAKKFLAPKDK -------------------------------- >AMYLOID BETA A4 PROTEIN; SWP:Q95241; PDB:1RW6A; AVDKYLETPGDENEHAHFQKAKERLEAKHRERMSQVMREWEEAERQAKNLPKADKKAVIQ --33333333------------------------------------1111---------- HFQEKVESLEQEAANERQQLVETHMARVEAMLNDRRRLALENYITALQAVPPRPRHVFNM ---------------------------------------------3333----------- LKKYVRAEQKDRQHTLKHFEHVRMVDPKKAAQIRSQVMTHLRVIYERMNQSLSLLYNVPA ----------------------------------------------------3333---- VAEEIQDEVDEL --------3333 >YDR533CP; SWP:Q04432; PDB:1RW7A; APKKVLLALTSYNDVFYSDGAKTGVFVVEALHPFNTFRKEGFEVDFVSETGKFGWDEHSL ----------------1111-----------------1111------1111----3333- AKDFLNGQDETDFKNKDSDFNKTLAKIKTPKEVNADDYQIFFASAGHGTLFDYPKAKDLQ 1111----------1111----3333--3333-1111--------3333--3333----- DIASEIYANGGVVAAVCHGPAIFDGLTDKKTGRPLIEGKSITGFTDVGETILGVDSILKA ------1111-----!!!!3333----------1111---------------------11 KNLATVEDVAKKYGAKYLAPVGPWDDYSITDGRLVTGVNPASAHSTAVRSIDALK 11-------------------1111-------------3333------------- >CHONDROITIN AC LYASE; SWP:P84141; PDB:1RWHA; PGAAEFAALRNRWVDQITGRNVIQAGDPDFAKAITALNNKAADSLAKLDAAAGRTSVFTD ------------------1111-2222-----------------1111---------111 LSLAKDAEMVTTYTRLSQLATAWATPTAAVFGDAAVLAAIKAGLADANTLCYNDRKEEVG 11111-------------------2222-2222--------------------------- NWWSWEIGVPRALADAMVLLHAELSAAERTAYCAAIDHFVPDPWLQFPPKRGKITSVGAN 3333---------------3333------------------1111--3333--------- RVDLCQGIIIRSLAGEDPTKLNHAVAGLSQVWQYVTSGDGIFRDGSFIQHSTTPYTGSYG ---------------------------3333----------1111--------------- VVLLTGLSKLFSLLGGTAFEVSDPTRSIFFDAVEGSFAPVMINGAMADAVRGRSISREAN -------------2222-------------------3333-iiii-3333--3333---- TGYDLGASAIEAILLLARAMDPATAARWRGLCAGWIARDTYRPILNSASVPRTALVKQLE --------------3333-----------------1111---3333-------------- ATGVAPVAEATGHKLFPAMDRTMHRGPGWALSLALSSNRIAWYECGNGENNRGYHTGSGM ---------------3333------2222-------1111-----iiii1111-1111-- TYFYTSDLGQYDDAFWATANYNRLPGITVDTTPLPDKVEGQWGAAVPADEWSGATALGEV ------1111---1111--11112222-------2222--%%%%------------!!!! AAVGQHLVGPGRTGLTARKSWFVSGDVTVCLGADISTASGAKVETIVDHRNLHQGSNTLT ---------------------------------------------------!!!!----- TAAGTIAGTAGTVEVLGDGRWVHLEGFGGYAMLDDSPLHVLRETRSGSWSGVNINGSATV 1111----2222-----------2222--------------------3333-1111---- QQRNFATLYVNHGVGPVAGSYAYMVAPGASVDLTRKLLEGNKYSVIRNDATAQSVEFKTA ---------------------------------1111-%%%%------3333-----111 KTTAATFWKPGMAGDLGASGPACVVFSRHGNELSLAVSEPTQKAAGLTLTLPEGTWSSVL 1-----------!!!!------------!!!!------3333------------------ EGAGTLGTDADGRSTLTLDTTGLSGKTKLIKLKR --------1111-------2222----------- >Serine/threonine-protein ; SWP:O05871; PDB:1RWIB; QTVLPFTGIDFRLSPSGVAVDSAGNVYVTSEGMYGRVVKLATTVLPFNGLYQPQGLAVDG --------------------1111----------------------------------11 AGTVYVTDFNNRVVTLAAGSNNQTVLPFDGLNYPEGLAVDTQGAVYVADRGNNRVVKLAA 11-------------------------------------1111------1111-----22 GSKTQTVLPFTGLNDPDGVAVDNSGNVYVTDTDNNRVVKLEAESNNQVVLPFTDITAPWG 22-------------------1111-----1111-------------------------- IAVDEAGTVYVTEHNTNQVVKLLAGSTTSTVLPFTGLNTPLAVAVDSDRTVYVADRGNDR ---1111-----1111------2222-------------------1111-----3333-- VVKLTSLEHHHHHH ------1111---- >CYTOCHROME C FAMILY PROTE; SWP:Q74BP5; PDB:1RWJA; KGMTPPKTVNFKMKGVADAAFSHEFHLGMYKCNECHTKLFAYKAGAKRFTMADMDKGKSC !!!!-----------------33331111-1111--------2222---33331111-33 GACHNGKDAFSSASDCGKCHP 33--------11113333--- >FILAMENTOUS HEMAGGLUTININ; SWP:P12255; PDB:1RWRA; QGLVPQGQTQVLQGGNKVPVVNIADPNSGGVSHNKFQQFNVANPGVVFNNGLTDGVSRIG -----!!!!----!!!!---------1111------------------------------ GALTKNPNLTRQASAILAEVTDTSPSRLAGTLEVYGKGADLIIANPNGISVNGLSTLNAS -----3333-----------------------------------3333------------ NLTLTTGRPSVNGGRIGLDVQQGTVTIERGGVNATGLGYFDVVARLVKLQGAVSSKQGKP -----------!!!!------------3333--2222------------------2222- LADIAVVAGANRYDHATRRATPIAAGARGAAAGAYAIDGTAAGAMYGKHITLVSSDSGLG --------------1111------------2222-----3333------------2222- VRQLGSLSSPSAITVSSQGEIALGDATVQRGPLSLKGAGVVSAGKLASGGGAVNVAG --------------------------------------------------------- >HYPOTHETICAL PROTEIN PF10; SWP:Q8U1Z3; PDB:1RWSA; KMIKVKVIGRNIEKEIEWREGMKVRDILRAVGFNTESAIAKVNGKVVLEDDEVKDGDFVE ----------------------------1111---------iiii--1111--------- VIPVVSGG -------- >HYPOTHETICAL UPF0250 PROT; SWP:P30977; PDB:1RWUA; MKTKLNELLEFPTPFTYKVMGQALPELVDQVVEVVQRHAPGDYTPTVKPSSKGNYHSVSI -----------------------1111------------------------3333----- TINATHIEQVETLYEELGKIDIVRMVL -----3333------------------ >PARVALBUMIN ALPHA; SWP:P02625; PDB:1RWYA; SMTDLLSAEDIKKAIGAFTAADSFDHKKFFQMVGLKKKSADDVKKVFHILDKDKSGFIEE 1111----------1111-2222----------1111-------------1111------ DELGSILKGFSSDARDLSAKETKTLMAAGDKDGDGKIGVEEFSTLVAES ------33331111---------------1111---------------- >DNA POLYMERASE SLIDING CL; SWP:O29912; PDB:1RWZA; MIDVIMTGELLKTVTRAIVALVSEARIHFLEKGLHSRAVDPANVAMVIVDIPKDSFEVYN -----------------3333--------1111------1111--------1111----- IDEEKTIGVDMDRIFDISKSISTKDLVELIVEDESTLKVKFGSVEYKVALIDPSAIRKEP -----------------11111111---------------!!!!-------3333----- RIPELELPAKIVMDAGEFKKAIAAADKISDQVIFRSDKEGFRIEAKGDVDSIVFHMTETE ------------------------1111--------1111------1111------3333 LIEFNGGEARSMFSVDYLKEFCKVAGSGDLLTIHLGTNYPVRLVFELVGGRAKVEYILAP ---------------------11112222------------------%%%%--------- RIES ---- >Acyl-CoA dehydrogenase fa; SWP:Q9UKU7; PDB:1RX0A; TSCIDPSMGLNEEQKEFQKVAFDFAAREMAPNMAEWDQKELFPVDVMRKAAQLGFGGVYI ----1111--------------------3333------------------1111------ QTDVGGSGLSRLDTSVIFEALATGCTSTTAYISIHNMCAWMIDSFGNEEQRHKFCPPLCT 3333----------------3333------------------------------------ MEKFASYCLTEPGSGSDAASLLTSAKKQGDHYILNGSKAFISGAGESDIYVVMCRTGGPG ----------1111--1111-------!!!!---------2222---------------3 PKGISCIVVEKGTPGLSFGKKEKKVGWNSQPTRAVIFEDCAVPVANRIGSEGQGFLIAVR 333---------2222----------1111------------1111---2222------- GLNGGRINIASCSLGAAHASVILTRDHLNVRKQFGEPLASNQYLQFTLADMATRLVAARL --------------------------------iiii3333-------------------- MVRNAAVALQEERKDAVALCSMAKLFATDECFAICNQALQMHGGYGYLKDYAVQQYVRDS --------11111111--------------------------3333-11113333----- RVHQILEGSNEVMRILISRSLLQE -1111---------------1111 >PROTEIN TYROSINE PHOSPHAT; SWP:NA; PDB:1RXDA; PVEVTYKNRFLITHNPTNATLNKFIEELKKYGVTTIVRVCEATYDTTLVEKEGIHVLDWP ----------------3333-----------------------------1111------- FGAPPSNQIVDDWLSLVKIKFREEPGCCIAVHCVAGLGRAPVLVALALIEGGKYEDAVQF -----------------------2222---------!!!!--------1111-------- IRQKRRGAFNSKQLLYLEKYRPKRLRF ----1111--------1111------- >AFIMBRIAL ADHESIN AFA-III; SWP:Q57254; PDB:1RXLA; EECQVRVGDLTVAKTRGQLTDAAPIGPVTVQALGCNARQVALKADTDNFEQGKFFLISDN --------------1111----------------1111------3333---------111 NRDKLYVNIRPMDNSAWTTDNGVFYKNDVGSWGGTIGIYVDGQQTNTPPGNYTLTLTGGY 1--------------------------------------2222----------------- WAKDNKQGFTPSGTTGTTKLTVT ----------------------- >YFIT; SWP:O31562; PDB:1RXQA; NLSYPIGEYKPRESISKEQKDKWIQVLEEVPAKLKQAVEVTDSQLDTPYRDGGWTVRQVV 3333------------------------------------3333-----2222------- HHLADSHNSYIRFKLSLTEETPAIRPYDEKAWSELKDSKTADPSGSLALLQELHGRWTAL ----------------------------------3333---------------------- LRTLTDQQFKRGFYHPDTKEIITLENALGLYVWHSHHHIAHITELSRRGWS 11113333------------------------------------------- >GLYCYLPEPTIDE N-TETRADECA; SWP:P30419; PDB:1RXTA; GFTWDALDLGDRGVLKELYTLLNENYVEDDDNMFRFDYSPEFLLWALRPPGWLPQWHCGV --------------------3333--------------33333333------1111---- RVVSSRKLVGFISAIPANIHIYDTEKKMVEINFLCVHKKLRSKRVAPVLIREITRRVHLE --------------------------------------------3333------------ GIFQAVYTAGVVLPKPVGTCRYWHRSLNPRKLIEVKFSHLSRNMTMQRTMKLYRLPETPK ---------------------------33333333------------------------- TAGLRPMETKDIPVVHQLLTRYLKQFHLTPVMSQEEVEHWFYPQENIIDTFVVENANGEV -------1111-----------------------1111---------------------- TDFLSFYTLPSTIMNHPTHKSLKAAYSFYNVHTQTPLLDLMSDALVLAKMKGFDVFNALD ------------------------------------1111--------1111-------- LMENKTFLEKLKFGIGDGNLQYYLYNWKCPSMGAEKVGIVLQ --3333-1111----------------------1111----- >FLAP STRUCTURE-SPECIFIC E; SWP:O29975; PDB:1RXWA; ADIGDLFEREEVELEYFSGKKIAVDAFNTLYQFISIIRQPDGTPLKDSQGRITSHLSGIL 3333--------33332222------------------1111----1111---------- YRVSNMVEVGIRPVFVFDGEPPEFKKAEIEERKKRRAEAEEMWIAALQAGDKDAKKYAQA ---------------------1111-------------------------1111----11 AGRVDEYIVDSAKTLLSYMGIPFVDAPSEGEAQAAYMAAKGDVEYTGSQDYDSLLFGSPR 11-----------------------------------1111---------3333------ LARNLAIDVKPEIIILESNLKRLGLTREQLIDIAILVGTDYNEGVKGVGVKKALNYIKTY --------------------------------------------2222------------ GDIFRALKALKVVEEIRNFFLNPPVTDDYRIEFREPDFEKAIEFLCEEHDFSRERVEKAL -----------------------------------------------------------3 EKLKA 333-- >URIDINE PHOSPHORYLASE; SWP:P12758; PDB:1RXYA; SDVFHLGLTKNDLQGATLAIVPGDPDRVEKIAALMDKPVKLASHREFTTWRAELDGKPVI --------3333iiii-------3333----3333--------!!!!------iiii--- VCSTGIGGPSTSIAVEELAQLGIRTFLRIGTTGAIQPHINVGDVLVTTASVRLDGASLHF ------------------1111-------------11112222-----------3333-- APLEFPAVADFECTTALVEAAKSIGATTHVGVTASSDTFYPGQERYDTYSGRVVRHFKGS -3333----------------1111--------------3333----3333--3333--- MEEWQAMGVMNYEMESATLLTMCASQGLRAGMVAGVIVNRTQQEIPNAETMKQTESHAVK ----1111------3333----1111------------1111------------------ IVVEAARRLL -----1111- >ACETYL-COENZYME A SYNTHET; SWP:Q01574; PDB:1RY2A; QDYQRLHKESIEDPAKFFGSKATQFLNWSKPFDKVFIPDPKTGRPSFQNNAWFLNGQLNA -3333--------3333-----------------------------------2222---- CYNCVDRHALKTPNKKAIIFEGDEPGQGYSITYKELLEEVCQVAQVLTYSMGVRKGDTVA -----1111--1111--------------------------------------------- VYMPMVPEAIITLLAISRIGAIHSVVFAGFSSNSLRDRINDGDSKVVITTDESNRGGKVI -----3333---------------------------------------------iiii-- ETKRIVDDALRETPGVRHVLVYRKTNNPSVAFHAPRDLDWATEKKKYKTYYPCTPVDSED ------------1111----------------------33333333----------1111 PLFLLYTSGSTGAPKGVQHSTAGYLLGALLTMRYTFDTHQEDVFFTAGDIGWITGHTYVV ------------------------------------------------3333-------- YGPLLYGCATLVFEGTPAYPNYSRYWDIIDEHKVTQFYVAPTALRLLKRAGDSYIENHSL ---------------1111-1111---------------3333--1111---1111---- KSLRCLGSVGEPIAAEVWEWYSEKIGKNEIPIVDTYWQTESGSHLVTPLAGGVTPMKPGS -------------------------------------3333------------------- ASFPFFGIDAVVLDPNTGEELNTSHAEGVLAVKAAWPSFARTIWKNHDRYLDTYLNPYPG ----2222---------------------------1111---2222-----------222 YYFTGDGAAKDKDGYIWILGRVDDVVNVSGHRLSTAEIEAAIIEDPIVAECAVVGFNDDL 2---------1111------------------------------3333------------ TGQAVAAFVVLKLQDIKKHLVFTVRKDIGPFAAPKLIILVDDLPKTRSGKIMRRILRKIL ----------------------------1111-------------3333--3333----- ANPGIVRHLIDSVKL ------1111----- >INTERNAL KINESIN; SWP:Q8I4Y0; PDB:1RY6A; MIKVVVRKRPLSELEKKKKDSDIITVKNNCTLYIDEPRYKVDMTKYIERHEFIVDKVFDD ----------------------------------------------------------11 TVDNFTVYENTIKPLIIDLYENGCVCSCFAYGQTGSGKTYTMLGSQPYGQSDTPGIFQYA 11------------------------------2222----------2222---------- AGDIFTFLNIYDKDNTKGIFISFYEIYCGKLYDLLQKKEVVVKDLKILRVLTKEELILKM --------------------------iiii----------3333---------------- IDGVLLRKIGVNSQNDESSRSHAILNIDLKDINKNTSLGKIAFIDLAGSERGADTVSQNK ---------1111--3333------------3333------------1111--1111--- QTQTDGANINRSLLALKECIRAMDSDKNHIPFRDSELTKVLRDIFVGKSKSIMIANISPT ----------------------------------------3333---------------3 ISCCEQTLNTLRYSSRVKN 333---------------- >Fibroblast growth factor ; SWP:P22607; PDB:1RY7B; APYWTRPERMDKKLLAVPAANTVRFRCPAAGNPTPSISWLKNGREFRGEHRIGGIKLRHQ -----3333--------------------------------------------------- QWSLVMESVVPSDRGNYTCVVENKFGSIRQTYTLDVLERSPHRPILQAGLPANQTAVLGS ---------1111---------1111---------------------------------- DVEFHCKVYSDAQPHIQWLKHVEVNGSKVGPDGTPYVTVLKTAGANTTDKELEVLSLHNV -----------------------------1111--------------------------- TFEDAGEYTCLAGNSIGFSHHSAWLVVLPAEEE --------------------------------- >SURFACE PRESENTATION OF A; SWP:P35530; PDB:1RY9A; MSNINLVQLVRDSLFTIGCPPSIITDLDSHSAITISLDSMPAINIALVNEQVMLWANFDA --------------------1111-----------------------%%%%--------- PSDVKLQSSAYNILNLMLMNFSYSINELVELHRSDEYLQLRVVIKDDYVHDGIVFAEILH ---------------1111-1111--------------------3333------------ EFYQRMEILNGVL ---------1111 >GDP-MANNOSE MANNOSYL HYDR; SWP:NA; PDB:1RYAA; MMFLRQEDFATVVRSTPLVSLDFIVENSRGEFLLGKRTNRPAQGYWFVPGGRVQKDETLE -----------------------------------------2222--------2222--- AAFERLTMAELGLRLPITAGQFYGVWQHFYDDNFSGTDFTTHYVVLGFRFRVSEEELLLP ---------------3333---------------------------------3333---- DEQHDDYRWLTSDALLASDNVHANSRAYFLAEKRTGVPGL -----------------11113333333333332222--- >CRS2; SWP:Q9M5P4; PDB:1RYBA; YTPWLIAGLGNPGNKYYGTRHNVGFEMVDRIAAEEGITMNTIQSKSLLGIGSIGEVPVLV ------------3333--3333----------1111------%%%%------!!!!---- VKPQSYMNYSGEAIGPLAAYYQVPLRHILLIYDDTSLPNGVLRLQKKGGHGRHNGLQNVI -----33333333----------3333----------2222--------%%%%------- EHLDGRREFPRLSIGIGSPPGKMDPRAFLLQKFSSEERVQIDTALEQGVDAVRTLVLKGE --iiii-------------!!!!----------3333----------------------- RFNLVQ ------ >GLYCINE OXIDASE; SWP:O31616; PDB:1RYIA; MKRHYEAVVIGGGIIGSAIAYYLAKENKNTALFESGTMGGRTTSAAAGMLGAHAECEERD -----------------------1111---------22223333------1111------ AFFDFAMHSQRLYKGLGEELYALSGVDIRQHNGGMFKLAFSEEDVLQLRQMDDLDSVSWY ----------3333------------------------------------3333------ SKEEVLEKEPYASGDIFGASFIQDDVHVEPYFVCKAYVKAAKMLGAEIFEHTPVLHVERD --------1111-----------------------------1111--------------- GEALFIKTPSGDVWANHVVVASGVWSGMFFKQLGLNNAFLPVKGECLSVWNDDIPLTKTL ---------------------!!!!-----1111-------------------------- YHDHCYIVPRKSGRLVVGATMKPGDWSETPDLGGLESVMKKAKTMLPAIQNMKVDRFWAG -%%%%----1111-----------------------------------1111-------- LRPGTKDGKPYIGRHPEDSRILFAAGHFRNGILLAPATGALISDLIMNKEVNQDWLHAFR ----1111------1111-----------3333-----------1111------------ IDRK ---- >UNKNOWN; SWP:O27775; PDB:1RYJA; MVIGMKFTVITDDGKKILESGAPRRIKDVLGELEIPIETVVVKKNGQIVIDEEEIFDGDI ----------1111----------3333-------3333----------------2222- IEVIRVIYGG ---------- ------------------------------------------------------------ --------- >HYPOTHETICAL PROTEIN YFBM; SWP:P76483; PDB:1RYLA; MIGYFAEIDSEKINQLLESMDNIHDTLSGLRRLDIDKRWDFLHFGLTGTSAFDPAKNDPL ------------------------1111------!!!!-----------1111------- SRAVLGEHSLEDDGFLGLTWNQELAATIDRLESLDRNELRKQFSIKRLNEMEIYPGVTFS -------------------3333-------3333-------------------------1 EELEGQLFASIMLDMEKLISAYRRMLRQGNHALTVIV 111----------------------1111-------- >SEROTRANSFERRIN; SWP:P02787; PDB:1RYOA; KTVRWCAVSEHEATKCQSFRDHMKSVIPSDGPSVACVKKASYLDCIRAIAANEADAVTLD ---------------------------1111-----------------1111-------- AGLVYDAYLAPNNLKPVVAEFYGSKEDPQTFYYAVAVVKKDSGFQMNQLRGKKSCHTGLG -----------------------3333-----------2222--11112222-----222 RSAGWNIPIGLLYCDLPEPRKPLEKAVANFFSGSCAPCADGTDFPQLCQLCPGCGCSTLN 21111------3333--------------------2222333333331111-----3333 QYFGYSGAFKCLKDGAGDVAFVKHSTIFENLANKADRDQYELLCLDNTRKPVDEYKDCHL -----------3333-------11111111--33331111---1111---1111------ AQVPSHTVVARSMGGKEDLIWELLNQAQEHFGKDKSKEFQLFSSPHGKDLLFKDSAHGFL -------------------------------------------1111-----1111---- KVPPRMDAKMYLGYEYVTAIRNLR --2222------------------ >20S PROTEASOME; SWP:P21243; PDB:1RYPA; AGYDRHITIFSPEGRLYQVEYAFKATNQTNINSLAVRGKDCTVVISQKKVPDKLLDPTTV 1111------1111--------33331111---------------------11113333- SYIFCISRTIGMVVNGPIPDARNAALRAKAEAAEFRYKYGYDMPCDVLAKRMANLSQIYT ------1111---------------------------------------------3333- QRAYMRPLGVILTFVSVDEELGPSIYKTDPAGYYVGYKATATGPKQQEITTNLENHFKKS -3333-----------------------1111---------------------------- KIDHINEESWEKVVEFAITHMIDALGTEFSKNDLEVGVATKDKFFTLSAENIEERLVAIA --------3333-----------------1111------2222----------------- EQD --- >Proteasome component Y7; SWP:P23639; PDB:1RYPB; MTDRYSFSLTTFSPSGKLGQIDYALTAVKQGVTSLGIKATNGVVIATEKKSSSPLAMSET ------------1111-----------1111---------------------11113333 LSKVSLLTPDIGAVYSGMGPDYRVLVDKSRKVAHTSYKRIYGEYPPTKLLVSEVAKIMQE ------------------------------------1111-------------------- ATQSGGVRPFGVSLLIAGHDEFNGFSLYQVDPSGSYFPWKATAIGKGSVAAKTFLEKRWN ------------------------------3333-------------------------- DELELEDAIHIALLTLKESVEGEFNGDTIELAIIGDENPDLLGYTGIPTDKGPRFRKLTS ---3333-------3333------1111---------1111------------------- QEINDRLEAL ---1111--- >Proteasome component PRE5; SWP:P40302; PDB:1RYPF; FRNNYDGDTVTFSPTGRLFQVEYALEAIKQGSVTVGLRSNTHAVLVALKRNADELSSYQK 3333---1111-1111-------------------------------------------- KIIKCDEHMGLSLAGLAPDARVLSNYLRQQCNYSSLVFNRKLAVERAGHLLCDKAQKNTQ -----1111---------------------------------------------3333-- SYGGRPYGVGLLIIGYDKSGAHLLEFQPSGNVTELYGTAIGARSQGAKTYLERTLDTFIK ----------------1111------3333----------2222------------3333 IDGNPDELIKAGVEAISQSLRDESLTVDNLSIAIVGKDTPFTIYDGEAVAKYI -------------------------1111----------------1111---- >Proteasome component C1; SWP:P21242; PDB:1RYPG; GTGYDLSNSVFSPDGRNFQVEYAVKAVENGTTSIGIKCNDGVVFAVEKLITSKLLVPQKN --1111-----1111-3333-------------------------------33332222- VKIQVVDRHIGCVYSGLIPDGRHLVNRGREEAASFKKLYKTPIPIPAFADRLGQYVQAHT -------------------------------------------------------3333- LYNSVRPFGVSTIFGGVDKNGAHLYMLEPSGSYWGYKGAATGKGRQSAKAELEKLVDHHP -1111----------------------1111----------1111-------------33 EGLSAREAVKQAAKIIYLAHEDNKEKDFELEISWCSLSETNGLHKFVKGDLLQEAIDFAQ 33----------------3333-----------------iiii----------------- KEIN ---- >Proteasome component PRE3; SWP:P38624; PDB:1RYPH; ASIMAVTFKDGVILGADSRTTGAYIANRVTDKLTRVHDKIWCCRSSADTQAIADIVQYHL -------1111---------!!!!------------------------------------ ELYTSQYGTPSTETAASVFKELCYENKDNLTAGIIVAGYDDKNKGEVYTIPLGGSVHKLP -------------------------3333---------------------3333------ YAIAGGSTFIYGYCDKNFRENMSKEETVDFIKHSLSQAIKWDGSSGGVIRMVVLTAAGVE ----1111111------------------------------1111---------3333-- RLIFYPDEYEQL ----3333---- >Proteasome component PUP1; SWP:P25043; PDB:1RYPI; TTIVGVKFNNGVVIAADTRSTQGPIVADKNCAKLHRISPKIWCAGAGTAADTEAVTQLIG ---------------------!!!!------------1111------------------- SNIELHSLYTSREPRVVSALQMLKQHLFKYQGHIGAYLIVAGVDPTGSHLFSIHAHGSTD --------------3333---------1111------------1111------1111--- VGYYLSLGSGSLAAMAVLESHWKQDLTKEEAIKLASDAIQAGIWNDLGSGSNVDVCVMEI -------1111----------------------------------1111---------11 GKDAEYLRNYLTPNVREEKQKSYKFPRGTTAVLKESIVNICD 11-----------------------2222------------- >DNA-DIRECTED RNA POLYMERA; SWP:NA; PDB:1RYQA; EKACRHCHYITSEDRCPVCGSRDLSEEFVIIVDVENSEIAKKIGAKVPGKYAIRVR --------------------------------3333-------------------- >SWI/SNF-related, matrix-a; SWP:O14497; PDB:1RYUA; SSTTTNEKITKLYELGGEPERKMWVDRYLAFTEEKAMGMTNLPAVGRKPLDLYRLYVSVK ---------3333----3333--------------------------------------- EIGGLTQVNKNKKWRELATNLNVGTSSSAASSLKKQYIQCLYAFECKIERGEDPPPDIFA ----3333---------------------------------3333--------------- >PHENOL 2-HYDROXYLASE COMP; SWP:Q9LAG2; PDB:1RZ1A; DDRLFRNAGKFATGVTVITTELNGAVHGTANAFSVSLNPKLVLVSIGEKAKLEKIQQSKK ----------------------------------------------1111---------- YAVNILSQDQKVLSNFAGQLEKPVDVQFEELGGLPVIKDALAQISCQVVNEVQAGDHTLF ------11113333----------------iiii--2222-------------!!!!--- IGEVTDIKITEQDPLLFFSGKYHQLAQ -----------------iiii------ >CONSERVED HYPOTHETICAL PR; SWP:Q81L49; PDB:1RZ2A; IFMDYYENRKVMAEAQNIYEKSPMEEQSQDGEVRKQFKALQQINQEIVGWITMDDTQINY ---3333--------------------------1111-3333-1111-----2222---- PIVQAKDNDYYLFRNYKGEDMRAGSIFMDYRNDVKSQNRNTILYGHRMKDGSMFGSLKKM --------1111--1111--3333----33331111-----------1111!!!!----- LDEEFFMSHRKLYYDTLFEGYDLEVFSVYTTTTDFYYIETDFSSDTEYTSFLEKIQEKSL -----1111--------------------------------------------------- YKTDTTVTAGDQIVTLSTCDAGRLVVHAKLVKRQ -------1111----------------------- >HYPOTHETICAL PROTEIN RBST; SWP:P84134; PDB:1RZ3A; ELRDRIDFLCKTILAIKTAGRLVLGIDGLSRSGKTTLANQLSQTLREQGISVCVFHDDHI ------------1111-----------------------------1111----------- VERAKRYHTGNEEWFEYYYLQWDVEWLTHQLFRQLKASHQLTLPFYDHETDTHSKRTVYL -3333-------------------------3333-------------1111--------- SDSDIIEGVFLQRKEWRPFFDFVVYLDCPNIQKFINRYWKAEDYYLETEEPIKRADVVFD ------------33331111------------------------------3333------ >Eukaryotic translation in; SWP:Q9UBQ5; PDB:1RZ4A; AMFEQRANVGKLLKGIDRYNPENLATLERYVETQAKENAYDLEANLAVLKLYQFNPAFFQ 3333---------------3333-------------------------------3333-- TTVTAQILLKALTNLPHTDFTLCKCIDQAHQEERPIRQILYLGDLLETCHFQAFWQALDE -----------1111-----------3333------------------------------ NDLLEGITGFEDSVRKFICHVVGITYQHIDRWLLAELGDLSDSQLKVWSKYGWSADEQIF -3333-------------------------------------------1111-------- ICSQEESIKPKNIVEKIDFDSVSSIAS --3333--------------------- >FAB 48D LIGHT CHAIN; SWP:NA; PDB:1RZ7H; EVQLVQSGAEVKKPGATVKISCKASGYTFSDFYMYWVRQAPGKGLEWMGLIDPEDADTMY ------------2222------------1111-------2222----------------- AEKFRGRVTITADTSTDTGYLELSSL 3333---------------------- >FAB E51 LIGHT CHAIN; SWP:NA; PDB:1RZFH; VQLVQSGAEVNKPGSSVKVSCQASGATLNSHAFSWVRQAPGQGLEWMAGIIPIFGSSHYA -----------2222------------1111-------2222-----------------3 QKFRGRVTISADESTRTVYLHLRL 333---------1111-------- >FAB 412D LIGHT CHAIN; SWP:Q6GMX8; PDB:1RZGA; EVQLVQSGAEVKKPGSSVKVSCKASGGTFSNYAINWVRQAPGQGLEWMGGIIPIFNIAHY ------------2222-----------1111--------2222----------------- AQRFQGRVSITADESTSTAYMELSSL 3333---------1111--------- >FAB 47E LIGHT CHAIN; SWP:NA; PDB:1RZIB; QVQLLQSGAEVKKPGSSVKVSCKASGGTFSSYAISWVRQAPGQGLEWMGGIIPVFGSANY ------------------------------------------------------------ AQKFQGRVTITADEATSTTYMELSSL -1111-------3333---------- >NONSPECIFIC LIPID TRANSFE; SWP:P23096; PDB:1RZL; ITCGQVNSAVGPCLTYARGGAGPSAACCSGVRSLKAAASTTADRRTACNCLKNAARGIKG ---------3333--1111-----------------------------------1111-- LNAGNAASIPSKCGVSVPYTISASIDCSRVS ----------1111-----------3333-- >Agglutinin [Precursor]; SWP:P06750; PDB:1RZOB; ADVCMDPEPIVRIVGRNGLCVDVTGEEFFDGNPIQLWPCKSNTDWNQLWTLRKDSTIRSN --------------2222-----%%%%-2222-----------1111-----------ii GKCLTISKSSPRQQVVIYNCSTATVGATRWQIWDNRTIINPRSGLVLAATSGNSGTKLTV ii---------------------3333-----3333----3333--------2222---- QTNIYAVSQGWLPTNNTQPFVTTIVGLYGMCLQANSGKVWLEDCTSEKAEQQWALYADGS -----3333----------------2222-----!!!!---------3333----1111- IRPQQNRDNCLTTDANIKGTVVKILSCGPASSGQRWMFKNDGTILNLYNGLVLDVRRSDP --3333----------2222------3333--------1111------------222211 SLKQIIVHPFHGNLNQIWLPLF 11----------1111------ >Glucose-resistance amylas; SWP:P46828; PDB:1RZRG; NVTIYDVAREASVSATVSRVVNGNPNVKPSTRKKVLETIERLGYRPNAVARGLASKKTTT --3333--1111-------1111----3333----------------------------- VGVIIPDISNIFYAELARGIEDIATYKYNIILSNSDQNQDKELHLLNNLGKQVDGIIFSG ------3333-------------------------------------------------- NVTEEHVEELKKSPVPVVLAASIESTNQIPSVTIDYEQAAFDAVQSLIDSGHKNIAFVSG --3333---1111----------------------------------3333--------- TLEEPINHAKKVKGYKRALTESGLPVRDSYIVEGDYTYDSGIEAVEKLLEEDEKPTAIFV 1111----------------------3333------3333-------------------- GTDEALGVIHGAQDRGLNVPNDLEIIGFDNTRLSTVRPQLTSVVQPYDIGAVARLLTKYN ----3333----------------------3333---------------------3333- KETVDSSIVQLPHRIEFRQSTK ---------------------- >Phosphocarrier protein HP; SWP:O69250; PDB:1RZRT; AQKTFTVTADSGIHARPATTLVQAASKFDSDINLEFNGKTVNLKIMGVMSLGIQKGATIT --------1111----------1111--------------------1111---------- ISAEGSDEADALAALEDTMSKEGLGE ----11113333-------1111--- >REGULATORY PROTEIN CRO; SWP:P09964; PDB:1RZSA; MYKKDVIDHFGTQRAVAKALGISDAAVSQWKEVIPEKDAYRLEIVTAGALKYQENAYRQA -3333------3333-------3333--------3333-----1111------------- A - >GLYCOGEN SYNTHASE 1; SWP:P39670; PDB:1RZUA; MNVLSVSSEIYPLIKTGGLADVVGALPIALEAHGVRTRTLIPGYPAVKAAVTDPVKCFEF ----------------------------3333-----------3333------------- TDLLGEKADLLEVQHERLDLLILDAPAYYERSGGPYLGQTGKDYPDNWKRFAALSLAAAR --%%%%--------iiii------3333---------1111--1111------------- IGAGVLPGWRPDMVHAHDWQAAMTPVYMRYAETPEIPSLLTIHNIAFQGQFGANIFSKLA -----------------3333----------------------1111----11111111- LPAHAFGMEGIEYYNDVSFLKGGLQTATALSTVSPSYAEEILTAEFGMGLEGVIGSRAHV -3333-------%%%%----------------------1111----iiii------1111 LHGIVNGIDADVWNPATDHLIHDNYSAANLKNRALNKKAVAEHFRIDDDGSPLFCVISRL --------3333-3333--------33333333--------------------------- TWQKGIDLMAEAVDEIVSLGGRLVVLGAGDVALEGALLAAASRHHGRVGVAIGYNEPLSH 3333------------1111---------3333----3333--2222------------- LMQAGCDAIIIPSRFEPCGLTQLYALRYGCIPVVARTGGLADTVIDANHAALASKAATGV -------------------3333--------------3333------------------- QFSPVTLDGLKQAIRRTVRYYHDPKLWTQMQKLGMKSDVSWEKSAGLYAALYSQLIS -----------------------3333------1111-------------------- >PROTEIN AF2095(GR4); SWP:O28185; PDB:1RZWA; MTLKQVIVVRDDLKLSRGKLAVQVAHAAIIGYLKSDSSLRRKWLDEGQKKVVLKVKSLEE -------------------------------------3333--3333------------- LLGIKHKAESLGLVTGLVQDAGLTEVPPGTITAVVIGPDEERKIDKVTGNLPLLKLEHHH -------3333--------3333----------------------1111----------- HHH --- >CG5884-PA; SWP:O97111; PDB:1RZXA; ETHRRVRLLKHGSDKPLGFYIRDGTSVRVTASGLEKQPGIFISRLVPGGLAESTGLLAVN -----------1111--------------3333------------2222--3333--222 DEVIEVNGIEVAGKTLDQVTDMMVANSSNLIITVKPAN 2----iiii-2222---------1111----------- >30S RIBOSOMAL PROTEIN S8; SWP:P02361; PDB:1S03G; DPIADMLTRIRNGQAANKAAVTMPSSKLKVAIANVLKEEGFIEDFKVEGDTKPELELTLK 3333---------1111------------------------------------------- YFQGKAVVESIQRVSRPGLRIYKRKDELPKVMAGLGIAVVSTSKGVMTDRAARQAGLGGE -iiii------------------1111--------------1111-------1111---- IICYVA ------ >HYPOTHETICAL PROTEIN PF04; SWP:Q8U3L1; PDB:1S04A; MEWEMGLQEEFLELIKLRKKKIEGRLYDEKRRQIKPGDVISFEGGKLKVRVKAIRVYNSF -------3333----1111----------------------------------------- REMLEKEGLENVLPGVKSIEEGIQVYRRFYDEEKEKKYGVVAIEIEPLEY --------1111-------------------------------------- >CYTOCHROME C-556; SWP:P00150; PDB:1S05A; QQDLVDKTQKLMKDNGRNMMVLGAIAKGEKPYDQAAVDAALKQFDETAKDLPKLFPDSVK ---3333--------------------------3333--------1111----------- GLKPFDSKYSSSPKIWAERAKFDTEIADFAKAVDGAKGKIKDVDTLKAAMQPIGKACGNC ------------3333--------------------------33331111-1111----- HENFRDKEG 3333----- >Adenosylmethionine-8-amin; SWP:P12995; PDB:1S0AA; MTTDDLAFDQRHILHPFTSMTSPLPVYPVVSAEGCELILSDGRRLVDGMSSWWAAIHGYN --------------------------------------1111------%%%%--1111-- HPQLNAAMKSQIDAMSHVMFGGITHAPAIELCRKLVAMTPQPLECVFLADSGSVAVEVAM ---------------------------------------3333----------------- KMALQYWQAKGEARQRFLTFRNGYHGDTFGAMSVCDPDNSMHSLWKGYLPENLFAPAPQS -------1111----------------33331111--11113333--------------- RMGEWDERDMVGFARLMAAHRHEIAAVIIEPIVQGAGGMRMYHPEWLKRIRKICDREGIL -------------------3333-----------1111----3333-------------- LIADEIATGFGRTGKLFACEHAEIAPDILCLGALTGGTMTLSATLTTREVAETISNGEAG ----------1111---3333-----------1111----------3333------3333 CFMHGPTFMGNPLACAAANASLAILESGDWQQQVADIEVQLREQLAPARDAEMVADVRVL -----1111--------------3333-----------------------1111------ GAIGVVETTHPVNMAALQKFFVEQGVWIRPFGKLIYLMPPYIILPQQLQRLTAAVNRAVQ ------------------------------!!!!-----1111-------------1111 DETFFCQ -1111-- >ADENYLYL CYCLASE-ASSOCIAT; SWP:P54654; PDB:1S0PA; SVKEFQNLVDQHITPFVALSKKLAPEVGNQVEQLVKAIDAEKALINTASQSKKPSQETLL 3333-------------------------------------------------------- ELIKPLNNFAAEVGKIRDSNRSSKFFNNLSAISESIGFLSWVVVEPTPGPHVAEMRGSAE -------------------------------111133333333----------------- FYTNRILKEFKGVNQDQVDWVSNYVNFLKDLEKYIKQYHTTGLTWNPKGGDAKSAT ---------2222-------------------------1111---1111------- >TRANSLATION INITIATION FA; SWP:Q58657; PDB:1S0UA; SQAEVNIGMVGHVDHGKTSLTKALTGVWTDRGISIRLGYADCEIRKCPQCGTYTTKPRCP -----------1111--------------------------------------------- NCLAETEFLRRVSFVDSPGHETLMATMLSGASLMDGAILVIAANEPCPQPQTKEHLMALE ------------------------------------------------------------ ILGIDKIIIVQNKIDLVDEKQAEENYEQIKEFVKGTIAENAPIIPINIDVLLKAIQDFIP ------------3333----------------2222-1111------------------- TPKRDPDATPRMYVARSFDINKPGTEIKDLKGGVLGGAIIQGVFKVGDEIEIRPGIKVTE ----1111-----------------1111---------------2222------------ GNKTFWKPLTTKIVSLAAGNTILRKAHPGGLIGVGTTLDPYLTKSDALTGSVVGLPGTLP -----------------!!!!-----------------3333-%%%%2222---2222-- PIREKITIRANLLDRVVGTKEELKIEPLRTGEVLMLNIGTATTAGVITSARGDIADIKLK --------------------1111----2222-----!!!!---------!!!!------ LPICAEIGDRVAISRRVGSRWRLIGYGTIEG -----2222---------------------- >beta-subunit of trans-3-c; SWP:Q9EV85; PDB:1S0YA; PMISCDMRYGRTDEQKRALSAGLLRVISEATGEPRENIFFVIREGSGINFVEHGEHLPDY ---------------------------------3333--------3333--iiii----- VP -- >Beta-subunit of trans-3-c; SWP:Q9EV84; PDB:1S0YB; PFIECHIATGLSVARKQQLIRDVIDVTNKSIGSDPKIINVLLVEHAEANMSISGR ---------------------------------3333--------3333--iiii >HYPOTHETICAL PROTEIN TM14; SWP:Q9X1G8; PDB:1S12A; MIKVTVTNSFFEVTGHAPDKTLCASVSLLTQHVANFLKAEKKAKIKKESGYLKVKFEELE ------1111---------------------------1111------------------- NCEVKVLAAMVRSLKELEQKFPSQIRVEVIDNGS --------------------1111---------- >TOPOISOMERASE IV SUBUNIT ; SWP:P20083; PDB:1S14A; TDTTRPNHLGQEVIDNSVDEALAGHAKRVDVILHADQSLEVIDDGRGMPVDIHPEEGVPA ---------------------------------1111----------------3333--- VELILCISVVNALSKRVEVNVRRDGQVYNIAFENGEKVQDLQVVGTCGKRNTGTSVHFWP ----------------------iiii------iiii-----------1111--------- DETFFDSPRFSVSRLTHVLKAKAVLCPGVEITFKDEINNTEQRWCYQD 3333---------------------2222------------------- >TOPOISOMERASE IV SUBUNIT ; SWP:P20083; PDB:1S16A; TYNADAIEVLTGLEPVRRRPGMYTDTTRPNHLGQEVIDNSVDEALAGHAKRVDVILHADQ --3333----!!!!33333333------3333------------------------1111 SLEVIDDGRGMPVDIHPEEGVPAVELILCRLHAGGKFSNKNYQFSGGLHGVGISVVNALS ---------------------3333--------------------------3333----- KRVEVNVRRDGQVYNIAFENGEKVQDLQVVGTCGKRNTGTSVHFWPDETFFDSPRFSVSR --------%%%%------iiii-----------1111---------3333---------- LTHVLKAKAVLCPGVEITFKDEINNTEQRWCYQDGLNDYLAEAVNGLPTLPEKPFIGNFA -------------------------------------------2222------------- GDTEAVDWALLWLPEGGELLTESYVNLIPTMQGGTHVNGLRQGLLDAMREFCEYRNILPR 1111--------1111--------iiii-1111-------------------------22 GVKLSAEDIWDRCAYVLSVKMQDPQFAGQTKERLSSRQCAAFVSGVVKDAFILWLNQNVQ 22--3333-1111--------------3333----3333--------------------- AAELLAEMAISSAQRRMRAA -------------------- >Rho-associated, coiled-co; SWP:Q59GZ4; PDB:1S1CX; GSMLTKDIEILRRENEELTEKMKKAEEEYKLEKEEEISNLKAAFEKNINTERTLKTQAVN -----------------11111111---------------3333---------------- KLAEIMNRK -----1111 >APYRASE; SWP:NA; PDB:1S1DA; NWYNDTYPLSPPQRTPAGIRYRIAVIADLDTESRAQEENTWFSYLKKGYLTLSDSGDKVA --------------1111----------!!!!--------------------3333---- VEWDKDHGVLESHLAEKGRGMELSDLIVFNGKLYSVDDRTGVVYQIEGSKAVPWVILSDG ---------------iiii---------%%%%--------------!!!!--------!! DGTVEKGFKAEWLAVKDERLYVGGLGKEWTTTTGDVVNENPEWVKVVGYKGSVDHENWVS !!-------------%%%%-----------1111----1111-----1111--------- NYNALRAAAGIQPPGYLIHESACWSDTLQRWFFLPRRASQERYSEKDDERKGANLLLSAS ------1111---------------1111--------------3333------------1 PDFGDIAVSHVGAVVPTHGFSSFKFIPNTDDQIIVALKSEEDSGRVASYIMAFTLDGRFL 111-----------1111----------%%%%---------%%%%--------1111--- LPETKIGSVKYEGIEFI ----------------- >KV CHANNEL INTERACTING PR; SWP:Q9NZI2; PDB:1S1EA; GLEQLEAQTNFTKRELQVLYRGFKNECPSGVVNEETFKQIYAQFFPHGDASTYAHYLFNA -----1111-----------------1111--------------1111------------ FDTTQTGSVKFEDFVTALSILLRGTVHEKLRWTFNLYDINKDGYINKEEMMDIVKAIYDM -1111--------------------------------1111------------------- MGKYTYPVLKEDTPRQHVDVFFQKMDKNKDGIVTLDEFLESCQEDDNIMRSLQLFQNVMV !!!!-3333----------------1111--------------------------3333- E - >PUTATIVE CYTOCHROME P450; SWP:Q9FCA6; PDB:1S1FA; QAVPPVRDWPAVDLPGSDFDPVLTELMREGPVTRISLPNGEGWAWLVTRHDDVRLVTNDP --------------!!!!-------1111-------------------------111111 RFGREAVMDRQVTRLAPHFIPARGAVGFLDPPDHTRLRRSVAAAFTARGVERVRERSRGM 11-1111--------------22221111-----------3333-33333333------- LDELVDAMLRAGPPADLTEAVLSPFPIAVICELMGVPATDRHSMHTWTQLILSSSHGAEV ---------------3333-----------------3333-------------1111--- SERAKNEMNAYFSDLIGLRSDSAGEDVTSLLGAAVGRDEITLSEAVGLAVLLQIGGEAVT 3333------------------------------1111---------------------- NNSGQMFHLLLSRPELAERLRSEPEIRPRAIDELLRWIPHRNAVGLSRIALEDVEIKGVR ----------------------3333-----------------------------iiii- IRAGDAVYVSYLAANRDPEVFPDPDRIDFERNPHVSFGFGPHYCPGGMLARLESELLVDA -2222-----3333--------1111-1111---1111-11111111------------- VLDRVPGLKLAVAPEDVPFKKGALIRGPEALPVTWHA ------------3333--------------------- >Potassium voltage-gated c; SWP:Q9UK17; PDB:1S1GA; DELIVLNVSGRRFQTWRTTLERYPDTLLGSTEKEFFFNEDTKEYFFDRDPEVFRCVLNFY -------iiii----3333------3333-3333--------------3333----3333 RTGKLHYPRYECISAYDDELAFYGILPEIIGDCCYEEYKDRKRENLE -------1111---------1111-3333------------------ >CTP SYNTHASE; SWP:P08398; PDB:1S1MA; MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEV -----------------------------1111--------------1111-3333---- FVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHIT --1111------------------1111-----------------1111----------- NAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYM -----------------------22221111-------------3333------------ AASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKD ------------------1111-------------------------------------- VDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIE --3333------------------------------------------------------ LPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEG 3333---------------------------------3333---------------3333 MITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWR ----------------------------------------3333----------3333-- DENGNVETMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAG 1111-----------------3333------------------3333------------- RSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQA ------------------------------------------------------ ------------------------------------------------------------ >ALDO-KETO REDUCTASE FAMIL; SWP:P42330; PDB:1S1PA; QCVKLNDGHFMPVLGFGTYAPPEVPRSKALEVTKLAIEAGFRHIDSAHLYNNEEQVGLAI ----1111------------11113333------------------3333---------- RSKIADGSVKREDIFYTSKLWSTFHRPELVRPALENSLKKAQLDYVDLYLIHSPMSLKPG ---1111--3333-------1111-3333------------------------------- EELSPTDENGKVIFDIVDLCTTWEAMEKCKDAGLAKSIGVSNFNRRQLEMILNKPGLKYK ------1111-------------------1111--------------------2222--- PVCNQVECHPYFNRSKLLDFCKSKDIVLVAYSALGSQRDKRWVDPNSPVLLEDPVLCALA --------1111---------1111------1111---3333-3333-1111-------- KKHKRTPALIALRYQLQRGVVVLAKSYNEQRIRQNVQVFEFQLTAEDMKAIDGLDRNLHY ---------------1111-----------------1111-----------1111----- FNSDSFASHPNYPYS --3333--1111--- >TUMOR SUSCEPTIBILITY GENE; SWP:Q99816; PDB:1S1QA; VSESQLKKVSKYKYRDLTVRETVNVITLYKDLKPVLDSYVFNDGSSRELNLTGTIPVPYR --------1111----------------1111--------1111--------------ii GNTYNIPICLWLLDTYPYNPPICFVKPTSSTIKTGKHVDANGKIYLPYLHEWKHPQSDLL ii----------1111------------------11111111---3333----------- GLIQVIVVFGDEPPVFS ----------------- >ORF2; SWP:Q9K2L5; PDB:1S21A; PSRFVGQYTLTSIHQLSSEERENFLDAHDPMRVYDLNSETSVYRTTQREYVRNGYATGNP -1111------1111----------------1111-1111------33331111------ NSGAIIALHEELQESPYAQHIGARPDQADAYRPRTAHVSSLNTPSLNVMAGQGALSALHV -------1111---1111-----11111111-------1111--------1111------ TTEMRLGDFLDQGGKVYSDTSGGDSVEALIVTLPKGRKVPVNILD ----3333-1111--------------------2222-------- >RUBREDOXIN 2; SWP:P00272; PDB:1S24A; AYLKWICITCGHIYDEALGDEAEGFTPGTRFEDIPDDWCCPDCGATKEDYVLYEEK --------------1111-----------3333-3333-1111--3333------- >ORF1; SWP:NA; PDB:1S28A; GAKNSFDRLIDGLAKDYGPGFPEKKHEHEVYCFEFKEVSIRIYQDKFKWVYFLSDIGVID -----------------------------------1111--------------------- NLDSNACQSLLRLNEFNLRTPFFTVGLNEKKDGVVHTRIPLLNLDNVERRVFEALLNLSG --------------------------------------------3333------------ EVKKTFG ------- >LA PROTEIN; SWP:NA; PDB:1S29A; GSHPLSSENKQKLQKQVEFYFSDVNVQRDIFLKGKAENAEGFVSLETLLTFKRVNSVTTD ------------------------3333---------1111--3333------3333--3 VKEVVEAIRPSEKLVLSEDGLVRRRDPLP 333----3333-----1111--------- >PURINE TRANS DEOXYRIBOSYL; SWP:Q8RLY5; PDB:1S2DA; MKAVVPTGKIYLGSPFYSDAQRERAAKAKELLAKNPSIAHVFFPFDGFTDPDEKPEIGGI ----------------------------------1111----1111---3333--2222- RSMVWRDATYQNDLTGISNATCGVFLYDMDQLDDGSAFIGFMRAMHKPVILVPFTEHPEK ------------------------------------------1111----------3333 EKKMNLMIAQGVTTIIDGNTEFEKLADYNFNECPSNPVRGYGIY ----3333--------3333--------3333------------ >PUTATIVE ATP-DEPENDENT RN; SWP:P39517; PDB:1S2MA; NTFEDFYLKRELLMGIFEAGFEKPSPIQEEAIPVAITGRDILARAKNGTGKTAAFVIPTL ----------------1111--------------3333-----------3333------- EKVKPKLNKIQALIMVPTRELALQTSQVVRTLGKHCGISCMVTTGGTNLRDDILRLNETV ---3333-----------------------1111-------------3333--------- HILVGTPGRVLDLASRKVADLSDCSLFIMDEADKMLSRDFKTIIEQILSFLPPTHQSLLF --------------------1111------3333--1111-------1111--------- SATFPLTVKEFMVKHLHKPYEINLMEELTLKGITQYYAFVEERQKLHCLNTLFSKLQINQ -----------------------------2222-------3333---------------- AIIFCNSTNRVELLAKKITDLGYSCYYSHARMKQQERNKVFHEFRQGKVRTLVCSDLLTR ----------------------------1111---------------------------- GIDIQAVNVVINFDFPKTAETYLHRIGRSGRFGHLGLAINLINWNDRFNLYKIEQELGTE ---1111-----------------------1111--------3333--------1111-- IAAIPATIDKSLYVAEN --------3333----- >SUCROSE-PHOSPHATASE; SWP:P74325; PDB:1S2OA; MRQLLLISDLDNTWVGDQQALEHLQEYLGDRRGNFYLAYATGRSYHSARELQKQVGLMEP ---------2222-----------------1111-------------------------- DYWLTAVGSEIYHPEGLDQHWADYLSEHWQRDILQAIADGFEALKPQSPLEQNPWKISYH -----iiii---1111---------2222-----------1111---3333--------- LDPQACPTVIDQLTEMLKETGIPVQVIFSSGKDVDLLPQRSNKGNATQYLQQHLAMEPSQ -1111-----------------------%%%%-----3333---------------3333 TLVCGDSGNDIGLFETSARGVIVRNAQPELLHWYDQWGDSRHYRAQSSHAGAILEAIAHF ------11113333--------1111------------3333-----!!!!--------- DFLS ---- >PHOSPHOENOLPYRUVATE PHOSP; SWP:P56839; PDB:1S2WA; KVKKTTQLKQMLNSKDLEFIMEAHNGLSARIVQEAGFKGIWGSGLSVSAQLWTQVVEVLE --------------------------------3333------------------------ FMSDASDVPILLDADTGYGNFNNARRLVRKLEDRGVAGACLEDKLFGRAQPLADIEEFAL --1111-------------------------1111------------------------- KIKACKDSQTDPDFCIVARVEAFIAGWGLDEALKRAEAYRNAGADAILMHSKKADPSDIE ----------1111------3333---------------1111----------------- AFMKAWNNQGPVVIVPTKYYKTPTDHFRDMGVSMVIWANHNLRASVSAIQQTTKQIYDDQ -----%%%%------3333---3333---------------------------------- SLVNVEDKIVSVKEIFRL -1111-----3333---- >CAG-Z; SWP:NA; PDB:1S2XA; VDELGFNEAERQKILDSNSSLRNANEVRDKFIQNYATSLKDSNDPQDFLRRVQELRINQK -----------------------------------3333-------------------11 NFISFDAYYNYLNNLVLASYNRCKQEKTFAESTIKNELTLGEFVAEISDNFNNFTCDEVA 11--3333---------------------------------------------------- RISDLVASYLPREYLPPFIDGNGVAFQILGIDDFGKKLNEIVQDIGTKYIILSKNK ------11111111-1111-------------------------------1111-- >SPECTRIN BETA CHAIN, ERYT; SWP:P11277; PDB:1S35A; EQAFLQDLDDFQAWLSITQKAVASEDMPESLPEAEQLLQQHAGIKDEIDGHQDSYQRVKE ------------------------------------------------------------ SGEKVIQGQTDPEYLLLGQRLEGLDTGWDALGRMWESRSHTLAQCLGFQEFQKDAKQAEA -----2222-3333---------------------------------------------- ILSNQEYTLAHLEPPDSLEAAEAGIRKFEDFLGSMENNRDKVLSPVDSGNKLVAEGNLYS --------------------------------------3333----------11111111 DKIKEKVQLIEDRHRKNNEKAQEASVLLRDN ------------------------------- >NADH-UBIQUINONE OXIDOREDU; SWP:O43678; PDB:1S3AA; GLREIRIHLCQRSPGSQGVRDFIEKRYVELKKANPDLPILIRECSDVQPKLWARYAFGQE ---------------------------------1111-----------------3333-- TNVPLNNFSADQVTRALENVLSGKA ----1111----------------- >ARSENATE REDUCTASE; SWP:P08692; PDB:1S3CA; NITIYHNPASGTSRNTLEMIRNSGTEPTIILYLENPPSRDELVKLIADMGISVRALLRKN ------1111----------1111------3333------------3333-3333----- VEPYEQLGLAEDKFTDDQLIDFMLQHPILINRPIVVTPLGTRLCRPSEVVLDILQDAQKG ----1111-----------------3333-------3333-----11113333------- AFTKEDGEKVVDEAGKRL ---1111----1111--- >AMINE OXIDASE [FLAVIN-CON; SWP:P27338; PDB:1S3EA; NKCDVVVVGGGISGMAAAKLLHDSGLNVVVLEARDRVGGRTYTLRNQKVKYVDLGGSYVG ---------------------1111------------!!!!------------------2 PTQNRILRLAKELGLETYKVNEVERLIHHVKGKSYPFRGPFPPVWNPITYLDHNNFWRTM 222-------1111---------------iiii--------------------------- DDMGREIPSDAPWKAPLAEEWDNMTMKELLDKLCWTESAKQLATLFVNLCVTAETHEVSA ---111111111111--------------------------------------1111--- LWFLWYVKQCGGTTRIISTTNGGQERKFVGGSGQVSERIMDLLGDRVKLERPVIYIDQTR -------1111-------2222-----2222-----------!!!!-------------- ENVLVETLNHEMYEAKYVISAIPPTLGMKIHFNPPLPMMRNQMITRVPLGSVIKCIVYYK ------1111------------3333---------------1111--------------- EPFWRKKDYCGTMIIDGEEAPVAYTLDDTKPEGNYAAIMGFILAHKARKLARLTKEERLK -3333-----------3333---------3333---------!!!!---3333------- KLCELYAKVLGSLEALEPVHYEEKNWCEEQYSGGCYTTYFPPGILTQYGRVLRQPVDRIY -------1111--1111-------3333------------2222---3333----!!!!- FAGTETATHWSGYMEGAVEAGERAAREILHAMGKIPEDEIWQSEPESVDVPAQPITTTFL --3333---2222----------------1111--1111------------------333 ERHLPSVPGLLRLIGLTTI 3--------------3333 >ADENYLATE KINASE; SWP:P84139; PDB:1S3GA; MNIVLMGLPGAGKGTQADRIVEKYGTPHISTGDMFRAAIQEGTELGVKAKSFMDQGALVP -------2222----------1111----3333-------------------3333---- DEVTIGIVRERLSKSDCDNGFLLDGFPRTVPQAEALDQLLADMGRKIEHVLNIQVEKEEL ----------------1111---------------------------------------- IARLTGRRICKVCGTSYHLLFNPPQVEGKCDKDGGELYQRADDNPDTVTNRLEVNMNQTA -------------------------2222----------1111----------------- PLLAFYDSKEVLVNINGQKDIKDVFKDLDVILQGNGQ ------1111--------------------1111--- >10-FORMYLTETRAHYDROFOLATE; SWP:P28037; PDB:1S3IA; MKIAVIGQSLFGQEVYCQLRKEGHEVVGVFTIPDKDGKADPDGLEAEKDGVPVFKFPRWR -------------------1111-----------%%%%---------------------- ARGQALPEVVAKYQALGAELNVLPFCSQFIPMEVINAPRHGSIIYHPSLLPRHRGASAIN iiii---------3333--------------3333--1111------------------- WTLIHGDKKGGFTIFWADDGLDTGDLLLQKECEVLPDDTVSTLYNRFLFPEGIKGMVQAV --1111----------------------------1111---------------------- RLIAEGTAPRCPQSEEGATYEGIQKKETAKINWDQPAEAIHNWIRGNDKVPGAWTEACGQ --------------2222------3333----------------------------%%%% KLTFFNSTLNTSGLSTQGEALPIPGAHRPGVVTKAGLILFGNDDRMLLVKNIQLEDGKMM ----------2222--------2222------1111-----------------1111--- PASQFFK 3333--- >YUSO PROTEIN; SWP:O32181; PDB:1S3JA; SADQLSDIQLSLQALFQKIQPELESEKQGVTPAQLFVLASLKKHGSLKVSEIAEREVKPS 3333---------------------1111------------------3333------333 AVTLADRLEQKNLIARTHNTKDRRVIDLSLTDEGDIKFEEVLAGRKAIARYLSFLTEEEL 3-------1111------3333-----------------------------11113333- QAAHITAKLAQAAETD ---------------- >HU3S193 FAB FRAGMENT, LIG; SWP:NA; PDB:1S3KH; EVQLVESGGGVVQPGRSLRLSCSTSGFTFSDYYMYWVRQAPGKGLEWVAYMSNVGAITDY ------------2222-----------1111--------2222----------------- PDTVKGRFTISRDNSKNTLFLQMDSLRPEDTGVYFCARGTRDGSWFAYWGQGTPVTVSSA 3333--------3333----------3333---------1111----------------- STKGPSVFPLAPGTAALGCLVKDYFPQPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSSV --------------------------------%%%%--2222-------1111------- VTVPSSSLGTQTYICNVNHKPSNTKVDKKVEP ---1111------------1111--------- >HU3S193 FAB FRAGMENT, LIG; SWP:NA; PDB:1S3KL; DIQMTQSPSSLSASVGDRVTITCRSSQRIVHSNGNTYLEWYQQTPGKAPKLLIYKVSNRF -------------2222-------------1111---------2222------------2 SGVPSRFSGSGSGTDFTFTISSLQPEDIATYYCFQGSHVPFTFGQGTKLQITRTVAAPSV 2223333----------------1111--------------------------------- FIFPPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSL -----3333-------------------------iiii---------------------- SSTLTLSKADYEKHKVYACEVTHQGLSSPVTKSFNR ------3333------------1111---------- >HYPOTHETICAL PROTEIN MJ09; SWP:Q58346; PDB:1S3LA; MKIGIMSDTHDHLPNIRKAIEIFNDENVETVIHCGDFVSLFVIKEFENLNANIIATYGNN --------%%%%--------------------------3333---1111--------111 DGERCKLKEWLKDINEENIIDDFISVEIDDLKFFITHGHHQSVLEMAIKSGLYDVVIYGH 1-------------3333---------%%%%----------------3333--------- THERVFEEVDDVLVINPGECCGYLTGIPTIGILDTEKKEYREIVL --------iiii------3333------------1111------- >FERRITIN; SWP:O29424; PDB:1S3QA; SISEKVEALNRQINAEIYSAYLYLSASYFDSIGLKGFSNWRVQWQEELHAKFDFVSERGG -----------------------------1111--------------------------- RVKLYAVEEPPSEWDSPLAAFEHVYEHEVNVTKRIHELVEAQEKDFATYNFLQWYVAEQV ------------------------------------------------------------ EEEASALDIVEKLRLIGEDKRALLFLDKELSLRQ ---------------!!!!----------1111- >INTERMEDILYSIN; SWP:Q9LCB8; PDB:1S3RA; NSEAAKKALNDYIWGLQYDKLNILTHQGEKLKNHSSREAFHRPGEYVVIEKKKQSISNAT -----------------------------------------2222--------------- SKLSVSSANDDRIFPGALLKADQSLLENLPTLIPVNRGKTTISVNLPGLKNGESNLTVEN -----333311112222--------------------------------!!!!------- PSNSTVRTAVNNLVEKWIQNYSKTHAVPARMQYESISAQSMSQLQAKFGADFSKVGAPLN -3333----------------1111-----------------------3333-1111--- VDFSSVHKGEKQVFIANFRQVYYTASVDSPNSPSALFGSGITPTDLINRGVNSKTPPVYV -33333333----------------------3333--2222-----1111-3333----- SNVSYGRAMYVKFETTSKSTKVQAAIDAVVKGAKLKAGTEYENILKNTKITAVVLGGNPG -------------------------3333--------------3333------------- EASKVITGNIDTLKDLIQKGSNFSAQSPAVPISYTTSFVKDNSIATIQNNTDYIETKVTS --------3333------------------------------------------------ YKDGALTLNHDGAFVARFYVYWEELGHDADGYETIRSRSWSGNGYNRGAHYSTTLRFKGN ---------------------------1111--------1111----1111------111 VRNIRVKVLGATGLAWEPWRLIYSKNDLPLVPQRNISTWGTTLHPQFEDKVVK 1---------------------------------------3333--------- >P47 PROTEIN; SWP:NA; PDB:1S3SG; FTGEGQKLGSTAPQVLNTSSPAQQAENEAKASSSILINEAEPTTNIQIRLADGGRLVQKF -------------------3333--------------1111------------------- NHSHRISDIRLFIVDARPAMAATSFVLMTTFPNKELADENQTLKEANLLNAVIVQRLT ----3333---------3333----------------33333333------------- >HEAT SHOCK 70 KDA PROTEIN; SWP:P08107; PDB:1S3XA; KAAAIGIDLGTTYSCIGVFQHGKVEIIANDQGNRTTPSYVAFTDTERLIGDAAKNQVALN -------------------iiii-----1111-----------------------33333 PQNTVFDAKRLIGRKFGDPVVQSDMKHWPFQVINDGDKPKVQVSYKGETKAFYPEEISSM 333---333322221111------1111------%%%%------iiii----3333---- VLTKMKEIAEAYLGYPVTNAVITVPAYFNDSQRQATKDAGVIAGLNVLRIINEPTAAAIA ------------------------1111-------------------------------- YGLDRTGKGERNVLIFDLGGGTFDVSILTIDDGIFEVKATAGDTHLGGEDFDNRLVNHFV -3333-------------------------iiii-------------------------- EEFKRKHKKDISQNKRAVRRLRTACERAKRTLSSSTQASLEIDSLFEGIDFYTSITRARF ----------1111----------------1111-----------iiii----------- EELCSDLFRSTLEPVEKALRDAKLDKAQIHDLVLVGGSTRIPKVQKLLQDFFNGRDLNKS ------------------------3333-------3333----------1111------- INPDEAVAYGAAVQAAILMG -1111--------------- >AMINOGLYCOSIDE 6'-N-ACETY; SWP:NA; PDB:1S3ZA; HMDIRQMNKTHLEHWRGLRKQLWPGHPDDAHLADGEEILQADHLASFIAMADGVAIGFAD -------3333------3333-----3333-------1111---------iiii------ ASIRHDYVNGCDSSPVVFLEGIFVLPSFRQRGVAKQLIAAVQRWGTNKGCREMASDTSPE -------2222-------------3333-----------------1111--------111 NTISQKVHQALGFEETERVIFYRKRC 1-------1111-------------- >RNA-DEPENDENT RNA POLYMER; SWP:P19711; PDB:1S48A; VIREHNKWILKKIRFQGNLNTKKLNPGKLSEQLDREGRKRNIYNHQIGTISSAGIRLEKL ------3333-----------------------1111-------3333-------3333- PIVRAQTDTKTFHEAIRDKIDKSENRQNPELHNKLLEIFHTIAQPTLKHTYGEVTWEQLE -------------------------------------------3333-------3333-2 AGVNRKGAAGFLEKKNIGEVLDSEKHLVEQLVRDLKAGRKIKYYETAIPKNEKRDVSDDW 2221111--1111----3333--------------------------------------- QAGDLVVEKRPRVIQYPEAKTRLAITKVYNWVKQQPVVIPGYEGKTPLFNIFDKVRKEWD ------------------------------1111----22221111------------11 SFNEPVAVSFDTKAWDTQVTSKDLQLIGEIQKYYYKKEWHKFIDTITDHTEVPVITADGE 11------------3333-----------------3333----------------1111- VYIRNGQRGSGQPDTSAGNSLNVLTYAFCESTGVPYKSFNRVARIHVCGDDGFLITEKGL --------1111----3333--------------3333---------!!!!--------- GLKFANKGQILHEAGKPQKITEGEKKVAYRFEDIEFCSHTPVPVRWSDNTSSHAGRDTAV -----------1111-----3333-----3333--%%%%------1111-------3333 ILSKATRLDSSGERGTTAYEKAVAFSFLLYSWNPLVRRICLLVLSQQPETDPSKHATYYY -----------------------------1111---------3333-------------- KGDPIGAYKDVIGRNLSELKRTGFEKLANLNLSLSTLGVWTKHTSKRIIQDCVAIGKEEG --------------3333----------------1111---------------1111--- NWLVKPDRLISSKTGHLYIPDKGFTLQGKHYEQLQL ---1111----------------------------- >PROTEIN HI0227; SWP:P44583; PDB:1S4CA; MIISSLTNPNFKVGLPKVIAEVCDYLNTLDLNALENGRHDINDQIYMNVMEPETAEPSSK ----1111-1111------------11113333----------------------1111- KAELHHEYLDVQVLIRGTENIEVGATYPNLSKYEDYNEADDYQLCADIDDKFTVTMKPKM ----------------------------3333-----1111---------------2222 FAVFYPYEPHKPCCVEKIKKLVVKVPVKLI ----2222-----------------3333- >UROPORPHYRIN-III C-METHYL; SWP:P21631; PDB:1S4DA; FAGLPALEKGSVWLVGAGPGDPGLLTLHAANALRQADVIVHDALVNEDCLKLARPGAVLE -----------------------------------------------3333--------- FAGKRGPSPKQRDISLRLVELARAGNRVLRLKGGDPFVFGRGGEEALTLVEHQVPFRIVP ---------3333--------1111---------1111---------------------- GITAGIGGLAYAGIPVTHREVNHAVTFLTGHDRINWQGIASGSPVIVMYMAMKHIGAITA --3333---1111----3333-------------3333------------3333------ NLIAGGRSPDEPVAFVCNAATPQQAVLETTLARAEADVAAAGLEPPAIVVVGEVVRLRAA --1111-1111---------1111-----3333------------------3333----- LDWIGALDGRKLAADP -3333----------- >GALACTOKINASE; SWP:Q9HHB6; PDB:1S4EA; TVKSPGRVNLIGEHTDYTYGYVPAIDLYTIITDKVQLYSEHFNEKLDLTKEGSWIDYVKG --------------1111--------------------3333----------3333---- VLWVLIQEGYKIGGLKKITGDLPLGAGLSSSASFEVGILEVLNQLYNLNIDPLKKALLAK -----1111--------------------------------------------------- KAENEFVGVPCGILDQFAVVFGKKDNVIFLDTQTLQYEYIPFPKDVSVLVFYTGVKRELA ------------------------------------------1111----------3333 SSEYAERKRIAEESLRILGKESSKEVTEKDLGKLPPLHRKFFSYIVRENARVLEVRDALK ---------------------3333-33331111------------------------11 EGDIEKVGKILTTAHWDLAENYRVSCEELDFFVKKAELGAYGARLTGAGFGGSAIALVDK 11--------------------------------------------------------33 DKAKTIGDAILREYLAKFSWKAKYFVVKPSDGVG 33-------------------------------- >PUTATIVE CYTOPLASMIC PROT; SWP:Q8ZPR1; PDB:1S4KA; ANALELQALRRIFDTIEECTIYITQDNNSATWQRWEAGDIPISPEIIARLKEKARRQRRI ------------------------------------------------------------ NAIVDKINNRIGNNTRYFPDLSSFQSIYTEGDFIEWKIYQSVAAELFAHDLERLC -----------------------33331111---------------1111----- >GLYCOLIPID 2-ALPHA-MANNOS; SWP:P27809; PDB:1S4NA; KTTMDYITPSFKAGKPKACYVTLVRNKELKGLLSSIKYVENKINKKFPYPWVFLNDEPFT -3333-3333--------------1111-------------------------------- EEFKEAVTKAVSSEVKFGILPKEHWSYPEWINQTKAAEIRADAATKYIYGGSESYRHMCR --------------------3333---1111-----------11112222---------- YQSGFFWRHELLEEYDWYWRVEPDIKLYCDINYDVFKWMQENEKVYGFTVSIHEYEVTIP ----11113333------------------------------------------333311 TLWQTSMDFIKKNPEYLDENNLMSFLSNDNGKTYNLCHFWSNFEIANLNLWRSPAYREYF 11----------3333-----3333---iiii-------3333-----3333-------- DTLDHQGGFFYERWGDAPVHSIAAALFLPKDKIHYFSDIGYHHPPYDNCPLDKEVYNSNN ------3333------------------1111---1111-----------------1111 CECDQGNDFTFQGYSCGKEYYDAQGLVKPKNWKKFRE ---3333----1111-------------11111111- >Antiviral protein SKI8; SWP:Q02793; PDB:1S4UX; KVFIATANAGKAHDADIFSVSACNSFTVSCSGDGYLKVWDNKLLDNENPKDKSYSHFVHK ----------------------3333----1111-------------3333-------33 SGLHHVDVLQAIERDAFELCLVATTSFSGDLLFYRITRKKVIFEKLDLLDSDMKKHSFWA 33-----------------------1111--------------------3333------- LKWGASLSHRLVATDVKGTTYIWKFHPFADESNSLTLNWSPTLELQGTVESPMTPSQFAT --------------1111-----------33331111----------------------- SVDISERGLIATGFNNGTVQISELSTLRPLYNFENSIRSVKFSPQGSLLAIAHDSNSFGC ----1111----------------------------------3333--------iiii-- ITLYETEFGERIGSLSVFAHSSWVMSLSFNDSGETLCSAGWDGKLRFWDVKTKERITTLN -----------------------------3333----------------1111------- MHCDDIEIEEDILAVDEHGDSLAEPGVFDVKFLKKGWRSNESLCCVCLDRSIRWFR -1111--3333----1111---------------------------1111------ >CYSTEINE ENDOPEPTIDASE; SWP:O65039; PDB:1S4VA; TVPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTD ------------------------3333-------------------------------- QNQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPEND --!!!!------------------3333----------3333------------------ ENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWT ------3333-----------------------------------------1111----- VKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPIKKSSNN -----3333-iiii--------3333%%%%-------------- >INTEGRIN BETA-3; SWP:P05106; PDB:1S4XA; KLLITIHDRKEFAKFEEERARAKWDTANNPLYKEATSTFTNITYRGT --------------3333----------3333-------1111---- >ACTIVIN RECEPTOR TYPE IIB; SWP:P27040; PDB:1S4YA; RECIYYNANWELERTNQSGLERCEGEQDKRLHCYASWRNSSGTIELVKKGCWLDDFNCYD --------3333------------------------------------------3333-- RQECVATEENPQVYFCCCEGNFCNERFTHLP -------------------2222-------- ------------------------------ >GLUTAMATE RECEPTOR 6; SWP:P42260; PDB:1S50A; SNRSLIVTTILEEPYVLFKKSDKPLYGNDRFEGYCIDLLRELSTILGFTYEIRLVEDGKY -------------------------!!!!-------------------------1111-- GAQDDVNGQWNGMVRELIDHKADLAVAPLAITYVREKVIDFSKPFMTLGISILYRKGTPI -------------------------------3333------------------------- DSADDLAKQTKIEYGAVEDGATMTFFKKSKISTYDKMWAFMSSRRQSVLVKSNEEGIQRV -33331111----------------------------------3333------------- LTSDYAFLMESTTIEFVTQRNCNLTQIGGLIDSKGYGVGTPMGSPYRDKITIAILQLQEE --------------------1111----------------2222-------------111 GKLHMMKEKWWRGNGCPE 1----------------- >Tumor necrosis factor lig; SWP:O35235; PDB:1S55A; AQPFAHLTINAASIPSGSHKVTLSSWYHDRGWAKISNMTLSNGKLRVNQDGFYYLYANIC -----------------------------!!!!-------iiii---------------- FRHHETSGSVPTDYLQLMVYVVKTSIKIPSSHNLMKGGSTKNWSGNSEFHFYSINVGGFF ---3333-----------------3333-------------------------------- KLRAGEEISIQVSNPSLLDPDQDATYFGAFKVQDID --2222-------3333---1111------------ >NUCLEOSIDE DIPHOSPHATE KI; SWP:O64903; PDB:1S57A; SMEDVEETYIMVKPDGIQRGLVGEIISRFEKKGFKLIGLKMFQCPKELAEEHYKDLSAKS -----------------------------3333-------------------1111--11 FFPNLIEYITSGPVVCMAWEGVGVVASARKLIGKTDPLQAEPGTIRGDLAVQTGRNIVHG 11---------------------------------3333------------3333----- SDSPENGKREIGLWFKEGELCKWDSALATWLRE ---------------2222-----1111----- >B19 PARVOVIRUS CAPSID; SWP:P07299; PDB:1S58A; NPVKSMWSEGATFSANSVTCTFSRQFLIPYDPEHHYKVFSPAASSCHNASGKEAKVCTIT ------------------------------------------------------------ PIMGYSTPWRYLDFNALNLFFSPLEFQHLIENYGSIAPDALTVTISEIAVKDVTDKTGGG ------------------------------------------------------------ VQVTDSATGRLCMLVDHEYKYPYVLGQGQDTLAPELPIWVYFPPQYAYLTVGDVNTQGIS ------------------------------------------------------------ GDSKKLASEESAFYVLEHSSFQLLGTGGTATMSYKFPPVPPENLEGCSQHFYEMYNPLYG ------------------------------------------------------------ SRLGVPDTLGGDPKFRSLTHEDHAIQPQNFMPGPLVNSVSTKTGLSTGTSQNTRISLRPG -----------------22223333----------------------------------- PVSQPYHHWDTDKYVTGINAISHGQTTYGNAEDKEYQQGVGRFPNEKEQLKQLQGLNMHT 3333----------------------------------------3333------------ YFPNKGTQQYTDQIERPLMVGSVWNRRALHYESQLWSKIPNLDDSFKTQFAALGGWGLHQ --------------------------------------------------3333------ PPPQIFLKILPQSGPIGGIKSMGITTLVQYAVGIMTVTMTFKLGPRKATGRWNPQPGVYP -----------------1111--------------------------------------- PHAAGHLPYVLYDPTATDAKQHHRHGYEKPEELWTAKSRVHPL ------------------------------------------- >HYPOTHETICAL PROTEIN YESE; SWP:O31511; PDB:1S5AA; NEFEKACETLRKFAYLEKDKSWTELWDENAVFEFPYAPEGSPKRIEGKAAIYDYIKDYPK ----------------------11111111---11112222-------------1111-- QIHLSSFTAPTVYRSADSNTVIAEFQCDGHVIETGLPYRQSYISVIETRDGRIVRYRDYW -------------------------------1111-------------iiii-------- NPLVVKEAFGGSFLQ ------1111----- >CHOLERA ENTEROTOXIN, A CH; SWP:P01555; PDB:1S5DA; NDDKLYRADSRPPDEIKQSGGLMPRGQSQMNINLYDHARGTTGFVRHDDGYVSTSISLRS -----------------------1111--------------------iiii--------- AHLVGQTILSGHSTYYIYVIATAPNMFNVNDVLGAYSPHPDEQEVSALGGIPYSQIYGWY --------2222---------------3333-!!!!--3333---------3333----- RVHFGVLDEQLHRNRGYRDRYYSNLDIAPAADGYGLAGFPPEHRAWREEPWIHHAPPGCG --iiii-------1111----1111---33333333---11113333--3333--2222- TCDEKTQSLGVKFLDEYQSKVKRQIFSGYQSDIDTHNR ---------------------------1111--3333- >DNA POLYMERASE I; SWP:P26811; PDB:1S5JA; EWLEEAQENKIYFLLQVDYDGKKGKAVCKLFDKETQKIYALYDNTGHKPYFLVDLEPDKV --------------------1111-------------------------------3333- GKIPKIVRDPSFDHIETVSKIDPYTWNKFKLTKIVVRDPLAVRRLRNDVPKAYEAHIKYF -----1111----------------------------3333---1111------------ NNYMYDIGLIPGMPYVVKNGKLESVYLSLDEKDVEEIKKAFADSDEMTRQMAVDWLPIFE ---------2222----%%%%-------------------1111----------3333-- TEIPKIKRVAIDIEVYTPVKGRIPDSQKAEFPIISIALAGSDGLKKVLVLNRNDVNEGSV ------------------------3333-----------1111----------------- KLDGISVERFNTEYELLGRFFDILLEYPIVLTFNGDDFDLPYIYFRALKLGYFPEEIPID -iiii------------------1111------3333---------------1111---- VAGKDEAKYLAGLHIDLYKFFFNKAVRNYAFEGKYNEYNLDAVAKALLGTSKVDTLISFL --------1111---333333333333----------------------------3333- DVEKLIEYNFRDAEITLQLTTFNNDLTMKLIVLFSRISRLGIEELTRTEISTWVKNLYYW ---------------------%%%%---------------33331111------------ EHRKRNWLIPLKEEILAKSSNAVVIDPPAGIFFNITVLDFASLYPSIIRTWNLSYETVDI ----------3333--3333---------------------------------1111--- QQCKKPYEVKDETGEVLHIVCMDRPGITAVITGLLRDFRVKIYKKKAKNPNNSEEQKLLY ------------------------------------------------1111-------- DVVQRAMKVFINATYGVFGAETFPLYAPRVAESVTALGRYVITSTVKKAREEGLTVLYGD ----------1111-----3333---3333------------------------------ TDSLFLLNPPKNSLENIIKWVKTTFNLDLEVDKTYKFVAFSNYFGVYQDGKVDIKGMLVV ----------------------------------------------1111---------- KKVFNEVKELMISINSPNDVKEIKRKIVDVVKGSYEKLKIDAEKYLEALRSTFEQILRAF -----------3333---1111--------3333---------1111-1111--1111-- GVSWDEI ------- >Photosystem II reaction c; SWP:Q8DHJ2; PDB:1S5LZ; MTILFQLALAALVILSFVMVIGVPVAYASPQDWDRSKQLIFLGSGLWIALVLVVGVLN ---------------------3333-------3333------------------3333 >NAD-DEPENDENT DEACETYLASE; SWP:P75960; PDB:1S5PA; KPRVLVLTGAGISAESGIRTFRAADGLWEEHRVEDVATPEGFDRDPELVQAFYNARRRQL --------33333333------1111-%%%%3333----------------------333 QQPEIQPNAAHLALAKLQDALGDRFLLVTQNIDNLHERAGNTNVIHMHGELLKVRCSQSG 31111---------------!!!!------------3333-----11111111------- QVLDWTGDVTPEDKCPLRPHVVWFGEMPLGMDEIYMALSMADIFIAIGTSGHVYPAAGFV ----------------------2222----------------------------3333-- HEAKLHGAHTVELNLEPSQEFAEKYYGPASQVVPEFVEKLLKGLK ---1111--------------------3333-------------- >PROTEIN YBGC; SWP:P08999; PDB:1S5UA; TLFRWPVRVYYEDTDAGGVVYHASYVAFYERARTEMLRHHHFSQQALMAERVAFVVRKMT ---------3333-3333--3333------------3333-------1111--------- VEYYAPARLDDMLEIQTEITSMRGTSLVFTQRIVNAENTLLNEAEVLVVCVDPLKMKPRA -------2222-----------------------1111--------------1111---- LPKSIVAEF -3333---- >TOLA PROTEIN; SWP:P19934; PDB:1S62A; AEFGNTKNNGASGADINNYAGQIKSAIESKFYDASSYAGKTCTLRIKLAPDGMLLDIKPE --------------------------1111--3333------------1111-------- GGDPALCQAALAAAKLAKIPKPPSQAVYEVFKNAPLDFKPHH --3333-------1111------------3333--------- >Heme-regulated cyclic AMP; SWP:P76129; PDB:1S66L; NAADGIFFPALEQNMMGAVLINENDEVMFFNPAAEKLWGYKREEVIGNNIDMLIPRDLRP -1111-3333-----------1111-----3333------333322223333--333311 AHPEYIRHNREGGKARVEGMSRELQLEKKDGSKIWTRFALSKVSAEGKVYYLALVRDAS 11----------%%%%-----------1111-------------iiii----------- >RNA LIGASE 2; SWP:P32277; PDB:1S68A; MFKKYSSLENHYNSKFIEKLYSLGLTGGEWVAREKIHGTNFSLIIERDKVTCAKRTGPIL ---------1111-------1111---------------------1111----1111--1 PAEDFFGYEIILKNYADSIKAVQDIMETSAVVSYQVFGEFAGPGIQKNVDYCDKDFYVFD 111-%%%%3333-----------------------------2222--------------- IIVTTESGDVTYVDDYMMESFCNTFKFKMAPLLGRGKFEELIKLPNDLDSVVQDYNFTVD ----1111----------------------------33331111---------------- HAGLVDANKCVWNAEAKGEVFTAEGYVLKPCYPSWLRNGNRVAIKCKNSKFSE -----------------------------------1111--------3333-- >CYANOGLOBIN; SWP:P73925; PDB:1S69A; STLYEKLGGTTAVDLAVDKFYERVLQDDRIKHFFADVDMAKQRAHQKAFLTYAFGGTDKY -------------------------------1111----------------1111----- DGRYMREAHKELVENHGLNGEHFDAVAEDLLATLKEMGVPEDLIAEVAAVAGAPAHKRDV ----------------------------------1111-3333---------3333---- LNQ --- >PHOSPHOLIPASE A2 ISOFORM ; SWP:P60043; PDB:1S6BA; NTYQFKNMIQCTVPKRSWWDFADYGCYCGRGGSGTPIDDLDRCCQVHDNCYNSAREQGGC ----------------3333------------------------------------2222 RPKQKTYSYECKAGTLSCSGSNNSCAATVCDCDRLAAICFAGAPYNDNNYNIDLKARCQ 3333--------------3333-----------------1111--3333---3333--- >Phospholipase A2 isoform ; SWP:P60044; PDB:1S6BB; NRWQFKNMISCTVPSRSWWDFADYGCYCGRGGSGTPVDDLDRCCQVHDNCYNEAEKISGC ----------------3333------------------------------------2222 NPRFRTYSYECTAGTLTCTGRNNACAASVCDCDRLAAICFAGAPYNDNNYNIDLQARCN 3333-------iiii------------------------1111--1111---3333--- >ALBUMIN 8; SWP:P23110; PDB:1S6DA; PYGRGRTESGCYQQMEEAEMLNHCGMYLMKNLGERSQVSPRMREEDHKQLCCMK ---------3333-3333---33331111------------------------- >CALCIUM-DEPENDENT PROTEIN; SWP:P28583; PDB:1S6IA; AERLSEEEIGGLKELFKMIDTDNSGTITFDELKDGLKRVGSELMESEIKDLMDAADIDKS -----3333--3333------------3333---1111-----3333------------- GTIDYGEFIAATVHLNKLEREENLVSAFSYFDKDGSGYITLDEIQQACKDFGLDDIHIDD -------------------------------3333------------3333-----3333 MIKEIDQDNDGQIDYGEFAAMMRKRKGNGGIGRRTMRKTLNLRDALGLVDNGSNQVIEGY 1111-----------3333--------3333----------------------------- FK -- >ALKYLMERCURY LYASE; SWP:P77072; PDB:1S6LA; ADLLVPLLRELAKGRPVSRTTLAGILDWPAERVAAVLEQATSTEYDKDGNIIGYGLTLRE -----------------3333---------------1111-----!!!!----------- TSYVFEIDDRRLYAWCALDTLIFPALIGRTARVSSHCAATGAPVSLTVSPSEIQAVEPAG -----------------3333-3333----------------------1111-------- MAVSLVLPQEAADVRQSFCCHVHFFASVPTAEDWASKHQGLEGLAIVSVHEAFGLGQEFN ---------------------------------3333----------------------- RHLLQTMSSRTP ------------ >ENVELOPE GLYCOPROTEIN; SWP:Q913C7; PDB:1S6NA; ISEFQLKGTTYGVCSKAFKFLGTPADTGHGTVVLELQYTGTDGPCKVPISSVASLNDLTP ---3333------------------------------------------------3333- VGRLVTVNPFVSVATANAKVLIELEPPFGDSYIVVGRGEQQINHHWHKSGSSIGK ------------1111--------------------!!!!----------1111- >KVAP CHANNEL; SWP:P60980; PDB:1S6XA; ECGKFMWKCKNSNDCCKDLVCSSRWKWCVLASPF -----------11112222---1111-------- >6-PHOSPHO-BETA-GLUCOSIDAS; SWP:P84135; PDB:1S6YA; RLKIATIGGGSSYTPELVEGLIKRYHELPVGELWLVDIPEGKEKLEIVGALAKRVEKAGV -------1111-3333-------3333----------1111--------------1111- PIEIHLTLDRRRALDGADFVTTQFRVGGLEARAKDERIPLKYGVIGQETNGPGGLFKGLR --------3333-2222-------2222--------3333-------------------- TIPVILDIIRDEELCPDAWLINFTNPAGVTEAVLRYTKQEKVVGLCNVPIGRGVAKLLGV --------------1111----------------------------3333---------- DADRVHIDFAGLNHVFGLHVYLDGVEVTEKVIDLVAHPLGWEPDFLKGLKVLPCPYHRYY 3333-----------------iiii----------------3333----------3333- FQTDKLAEELEAAKTKGTRAEVVQQLEKELFELYKDPRGGAYYSDAACSLISSIYNDKRD -3333------------------------------------------------------- IQPVNTRNNGAIASISAESAVEVNCVITKDGPKPIAVGDLPVAVRGLVQQIKSFERVAAE -------iiii11111111--------1111---------3333---------------- AAVTGDYQTALVATINPLVPSDTIAKQILDELEAHKEYLPQFFKQAK ---------------1111------------33331111-------- >130 kDa myosin-binding su; SWP:P62140; PDB:1S70A; HMADGELNVDSLITRLLEVRGCRPGKIVQMTEAEVRGLCIKSREIFLSQPILLELEAPLK -------3333-------22222222------------------3333------------ ICGDIHGQYTDLLRLFEYGGFPPEANYLFLGDYVDRGKQSLETICLLLAYKIKYPENFFL -----------------------------------------------------1111--- LRGNHECASINRIYGFYDECKRRFNIKLWKTFTDCFNCLPIAAIVDEKIFCCHGGLSPDL --11113333-------------------------1111-----%%%%------------ QSMEQIRRIMRPTDVPDTGLLCDLLWSDPDKDVQGWGENDRGVSFTFGADVVSKFLNRHD -3333------------------------1111-----1111------------------ LDLICRAHQVVEDGYEFFAKRQLVTLFSAPNYCGEFDNAGGMMSVDETLMCSFQILKPSE ----------1111----------------2222-----------1111----------- KKAKYQYGG --------- >130 kDa myosin-binding su; SWP:Q90624; PDB:1S70B; MKMADAKQKRNEQLKRWIGSETDLEPPVVKRKKTKVKFDDGAVFLAACSSGDTEEVLRLL ----3333------3333--1111---------------------------3333----- ERGADINYANVDGLTALHQACIDDNVDMVKFLVENGANINQPDNEGWIPLHAAASCGYLD ---------1111-------1111--------1111-1111----------------333 IAEYLISQGAHVGAVNSEGDTPLDIAEEEAMEELLQNEVNRQGVDIEAARKEEERIMLRD 3---------1111-1111-3333---------------------3333----------- ARQWLNSGHINDVRHAKSGGTALHVAAAKGYTEVLKLLIQARYDVNIKDYDGWTPLHAAA --------------------3333--------------1111-1111-1111-------- HWGKEEACRILVENLCDMEAVNKVGQTAFDVADEDILGYLEELQKKQNLLH -----------1111-1111-1111-3333-------------1111---- >HEPATOCYTE NUCLEAR FACTOR; SWP:O08755; PDB:1S7EA; EINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRM ----3333--------------------------33331111----------3333---- WKWLQEPEFQRMSALRLAACKRKEQEHGKDRGNTPKKPRLVFTDVQRRTLHAIFKENKRP ------------------------------------------------3333-------- SKELQITISQQLGLELSTVSNFFMNAR 3333---1111----3333--3333-- >ACETYL TRANSFERASE; SWP:NA; PDB:1S7FA; MEIIPVSTTLELRAADESHVPALHQLVLKNKAWLQQSLDWPTSQEETRKHVQGNILLHQR ------1111-----3333----------33331111----------------------- GYAKMYLIFCQNEMAGVLSFNAIEPINKAAYIGYWLDESFQGQGIMSQSLQALMTHYARR ---------iiii-----------1111--------3333-------------------- GDIRRFVIKCRVDNQASNAVARRNHFTLEGCMKQAEYLNGDYHDVNMYARII ----------1111-----------------------iiii----------- >YKOF; SWP:O34911; PDB:1S7HA; SRIAGFRFSLYPMTDDFISVIKSALKKTDTSKVWTKTDHISTVLRGSIDHVFDAAKAIYL ---------------3333----------1111--------------------------- HAANSEQHIVMNGTFSIGCPGDTQGDTYLSKGDKRVNEDAVRGLKAEAPCQFALYPMNEP ------------------2222-------1111-11111111----------------11 DYMGLIMEAVDIAKAQGTFVQGVHYASELDGDAHDVFSTLEAVFRMAEQQTNHITMTVNL 11--------------------2222-----3333------------------------- SANSPS ------ >HYPOTHETICAL PROTEIN PA13; SWP:Q9I3Z5_PSEAE; PDB:1S7IA; LYFQGNMKYLCLIYFDEAKLAAVPAEELAAIVDECMTYSDQLGKAGHYIASHALQSVQTA ---------------11113333--------------------------------3333- TTLRHQGGRLAMTDGPFAETKEQLGGFYLIEARDLNQALQIAAKIPPGRLGCVEVRPVKE ----------------------------------------3333---1111--------- WEGS ---- >PHENAZINE BIOSYNTHESIS PR; SWP:Q839P3; PDB:1S7JA; MSYPYYIVDAFAEEVFKGNPAAVYVLEKWLPEAVMQNIAIENNLSETAFTVKEGQSYALR ------------------------------------------------------------ WFTPEREIDLCGHATLATAFVLFNYYSVAEETLHFTSQSGPLAVTKKEEYYYLDFPYILP --3333----------------------------------------!!!!---------- ERIPILPEYEAALGTKIYEAYLGRDLFFVLKDEETVAKITPDFSALKALDLGVGVIVTAS ---------------------------------------------3333----------- GDSVDFVSRTFFPKLRINEDPVCGSAHANLIPYWGKRLNQTTLSAYQVSPRGGFLTCEVK -----------3333-------3333---------1111---------3333-------2 ENRVIIGGTAKLFAKGEAYL 222----------------- >ACETYL TRANSFERASE; SWP:Q8ZPC0; PDB:1S7KA; EIIPVSTTLELRAADESHVPALHQLVLKNTRKHVQGNILLHQRGYAKMYLIFCQNEMAGV -----1111-----3333----------------------------------iiii---- LSFNAIEPINKAAYIGYWLDESFQGQGIMSQSLQALMTHYARRGDIRRFVIKCRVDNQAS -------1111--------1111------------------------------1111--- NAVARRNHFTLEGCMKQAEYLNGDYHDVNMYARIIDAD ----1111------------iiii-------------- >HIA; SWP:Q48152; PDB:1S7MA; GAKTEINKDGLTITPANGAGANNANTISVTKDGISAGGQSVKNVVSGLKKFGDANFDPLT ------1111----1111--%%%%-----1111--iiii----------2222---1111 SSADNLTKQNDDAYKGLTNLDEKGTDKQTPVVADNTAATVGDLRGLGWVISADKTTGGST ----------3333---------1111-----------33331111--------2222-- EYHDQVRNANEVKFKSGNGINVSGKTVNGRREITFELA ------2222------2222------------------ >Hypothetical UPF0122 prot; SWP:P67255; PDB:1S7OA; EKTNRMNALFEFYAALLTDKQMNYIELYYADDYSLAEIADEFGVSRQAVYDNIKRTEKIL ------------3333-3333--------------------------------------- ETYEMKLHMYSDYVVRSEIFDDMIAHYPHDEYLQEKISILTSIDNR --------------------------1111------------1111 >GENE 0.3 PROTEIN; SWP:P03775; PDB:1S7ZA; TYNNVFDHAYELKENIRYDDIRDTDDLHDAIHAADNAVPHYYADIFSVASEGIDLEFEDS ----------------------1111-3333---1111--------------------33 GLPDTKDVIRILQARIYEQLTIDLWEDAEDLLNEYLEEVEE 33----3333------------------------------- >PHOSPHOLIPASE A2 HOMOLOG; SWP:P49121; PDB:1S8IA; SLLELGKMILQETGKNAITSYGSYGCNCGWGHRGQPKDATDRCCFVHKCCYKKLTDCNHK ---------------3333---------2222------------------1111---333 TDRYSYSWKNKAIICEEKNPCLKEMCECDKAVAICLRENLDTYNKKYKAYFKFKCKKPET 3-------%%%%--------------------------3333--11113333-------- C - >TOXIN BMKK4; SWP:Q95NJ8; PDB:1S8KA; TQCQSVRDCQQYCLTPDRCSYGTCYCKTT -------------------%%%%------ >PUTATIVE ANTITERMINATOR; SWP:O06143; PDB:1S8NA; AVPRRVLIAEDEALIRMDLAEMLREEGYEIVGEAGDGQEAVELAELHKPDLVIMDVKMPR -----------------------1111--------------------------------- RDGIDAASEIASKRIAPIVVLTAFSQRDLVERARDAGAMAYLVKPFSISDLIPAIELAVS ----------1111--------1111------3333----------3333---------- RFREITALEGEVATLSERLETRKLVERAKGLLQTKHGMTEPDAFKWIQRAAMDRRTTMKR ------------------------------------------------------------ VAEVVLETLG ---------- >S-SYNTAXIN; SWP:O46345; PDB:1S94A; GFMEEFFEQVEEIRAMIDKISDNVDAVKKKHSDILSMKEELEELMTDIKRTANKVRGKLK -----------------------3333----3333-3333-------------------- TIELNIEQEESADLRIRKTQYSTISRKFVEVMSDYNTTQIDYRDRCKAR -----------3333--------------------------2222---- >SERINE/THREONINE PROTEIN ; SWP:P53041; PDB:1S95A; YSGPKLEDGKVTISFMKELMQWYKDQKKLHRKCAYQILVQVKEVLSKLSTLVETTLKETE ------iiii------------1111------------------1111--------1111 KITVCGDTHGQFYDLLNIFELNGLPSETNPYIFNGDFVDRGSFSVEVILTLFGFKLLYPD -------------------------1111----------------------------111 HFHLLRGNHETDNMNQIYGFEGEVKAKYTAQMYELFSEVFEWLPLAQCINGKVLIMHGGL 1-----11113333--------------3333-------1111-----%%%%-------- FSEDGVTLDDIRKIERNRQPPDSGPMCDLLWSDPQPQNGRSISKRGVSCQFGPDVTKAFL ----------1111----------------------------1111-------------- EENNLDYIIRSHEVKAEGYEVAHGGRCVTVFSAPNYCDQMGNKASYIHLQGSDLRPQFHQ 1111----------1111----iiii--------2222-----------3333------- FTAVPHPNVKPMAYANTLLQLGMM -----------11113333----- >GUANYLATE KINASE; SWP:P24234; PDB:1S96A; QGTLYIVSAPSGAGKSSLIQALLKTQPLYDTQVSVSHTTRQPRPGEVHGEHYFFVNHDEF ---------------------3333-1111------------------------------ KEISRDAFLEHAEVFGNYYGTSREAIEQVLATGVDVFLDIDWQGAQQIRQKPHARSIFIL --1111-------iiii-----------3333---------------------------- PPSKIELDRRLRGRGQDSEEVIAKRAQAVAESHYAEYDYLIVNDDFDTALTDLKTIIRAE -----------!!!!-----------------1111------------------------ RLRSRQKQRHDALISKLLAD -------------------- >PROTEIN YFHF; SWP:P36539; PDB:1S98A; SITLSDSAAARVNTFLANRGKGFGLRLGVRTSGCSGMAYVLEFVDEPTPEDIVFEDKGVK ----------------1111---------------------------1111----iiii- VVVDGKSMQFLDGTQLDFVKEGLNEGFKFTNPNVKDE ---1111-1111------------------1111--- >YKOF; SWP:O34911; PDB:1S99A; RIAGFRFSLYPTDDFISVIKSALAATDTSKVWTKTDHISTVLRGSIDHVFDAAKAIYLHA -----------1111------------1111----------------------------- ANSEQHIVNGTFSIGCPGDTQGDTYLDKRVNEDAVRGLKAEAPCQFALYPNEPDYGLIEA ---------------2222---------11111111---------------1111----- VDIAKAQGTFVQGVHYASELDGDAHDVFSTLEAVFRAEQQTNHITTVNLSANSPSRKNR -------------2222-----3333--------------------------1111--- >CHLOROCATECHOL 1,2-DIOXYG; SWP:O67987; PDB:1S9AA; ANTRVIELFDEFTDLIRDFIVRHEITTPEYETIMQYMISVGEAGEWPLWLDAFFETTVDS ----------------------------------------1111---------------- VSYGKGNWTSSAIQGPFFKEGAPLLTGKPATLPMRADEPGDRMRFTGSVRDTSGTPITGA --------------------------------------------------1111--1111 VIDVWHSTNDGNYSFFSPALPDQYLLRGRVVPAEDGSIEFHSIRPVPYEIPKAGPTGQLM -------3333-22221111------------1111--------------1111------ NSYLGRHSWRPAHIHIRITADGYRPLITQLYFEGDPYLDSDSCSAVKSELVLPVNKIDID -1111--------------2222--------2222-11111111--1111---------- GETWQLVDFNFILQHN ---------------- >PEROXISOMAL MULTIFUNCTION; SWP:P51659; PDB:1S9CA; AIGQKLPPFSYAYTELEAIMYALGVGASIKDPKDLKFIYEGSSDFSCLPTFGVIIGQKSM 1111------------------1111----3333----3333-----1111--1111--- MVLHGEQYLELYKPLPRAGKLKCEAVVADVLVVIIMDVYSYSEKELICHNQFSLFLSDKV ------------------------------------------------------------ KVAVAIPNRPPDAVLTDTTSLNQAALYRLSGDWNPLHIDPNFASLAGFDKPILHGLCTFG -------------------111133331111---1111---------------3333--- FSARRVLQQFADNDVSRFKAVKARFAKPVYPGQTLQTEMWKEGNRIHFQTKVQETGDIVI ----------%%%%1111-----------------------!!!!--------------- SNAYVDLA -------- >Dual specificity mitogen-; SWP:P36507; PDB:1S9IA; QKAKVGELKDDDFERISELGAGNGGVVTKVQHRPSGLIMARKLIHLEIKPAIRNQIIREL --------1111------------------------------------------------ QVLHECNSPYIVGFYGAFYSDGEISICMEHMDGGSLDQVLKEAKRIPEEILGKVSIAVLR 1111--------------------------11113333---------------------- GLAYLREKHQIMHRDVKPSNILVNSRGEIKLCDFGVSGQLIDSMVGTRSYMAPERLQGTH ----------------1111---3333------------------------3333----- YSVQSDIWSMGLSLVELAVGRYPIPPPDAKELEAIFGRPVVDRPAMAIFELLDYIVNEPP -3333----------------------33331111------------------------- PKLPNGVFTPDFQEFVNKCLIKNPAERADLKMLTNHTFIKRSEVEEVDFAGWLCKTLRLN ----------------------3333---------------------------------- QPG --- >Dual specificity mitogen-; SWP:Q02750; PDB:1S9JA; MELKDDDFEKISELGAGNGGVVFKVSHKPSGLVMARKLIHLEIKPAIRNQIIRELQVLHE ---3333--------------------1111------------------------3333- CNSPYIVGFYGAFYSDGEISICMEHMDGGSLDQVLKKAGRIPEQILGKVSIAVIKGLTYL --1111-------------------11113333--------3333--------------- REKHKIMHRDVKPSNILVNSRGEIKLCDFGVSGQLIDSMAVGTRSYMSPERLQGTHYSVQ -----------3333---3333--------------1111-----------------333 SDIWSMGLSLVEMAVGRYPIPPPDAKELELMFPPMAIFELLDYIVNEPPPKLPSGVFSLE 3----------------------3333--------------------------------- FQDFVNKCLIKNPAERADLKQLMVHAFIKRSDAEEVDFAGWLCSTIGLN -----------3333--33331111------------------------ >ARGININE DEIMINASE; SWP:P23793; PDB:1S9RA; SVFDSKFKGIHVYSEIGELESVLVHEPGREIDYITPARLDELLFSAILESHDARKEHKQF ---3333--------------------3333---3333-1111----------------- VAELKANDINVVELIDLVAETYDLASQEAKDKLIEEFLEDSEPVLSEEHKVVVRNFLKAK ----1111-----------------------------1111---------------1111 KTSRELVEIMMAGITKYDLGIEADHELIVDPMPNLYFTRDPFASVGNGVTIHYMRYKVRQ --------------3333-------------1111-3333---------------3333- RETLFSRFVFSNHPKLINTPWYYDPSLKLSIEGGDVFIYNNDTLVVGVSERTDLQTVTLL ------------1111-------1111----3333-------------1111-------- AKNIVANKECEFKRIVAINVPKWTNLMHLDTWLTMLDKDKFLYSPIANDVFKFWDYDLVN ------1111------------1111-1111------------3333------------- GGAEPQPVENGLPLEGLLQSIINKKPVLIPIAGEGASQMEIERETHFDGTNYLAIRPGVV -----------------------------2222--------------1111----2222- IGYSRNEKTNAALEAAGIKVLPFHGNQLSLGMGNARCMSMPLSRKDVKW --3333-------1111-------33331111-3333------------ >PUTATIVE COMPONENT OF ANA; SWP:NA; PDB:1S9UA; ATFLQRDEFAVTARVLGALFYYSPESHETAPLVQALLNDDWQAQWPLDAEALAPVAAFKT -3331111--------------1111--------------3333---3333--3333--- HSEESLPQAWQRLFIGPYALPSPPWGSVWLDRESVLFGDSTLALRQWRENGIQEPEDHFG ------------------------3333--1111-------------1111--------- SLLLLAAWLAENDRHHECEQLLAWHLFPWSSRFLDVFIDHAGHPFYQALGQLARLTLAQW ---------1111------------3333------------------------------- QAQLIIPVAVKPLFR 1111----------- >HLA class II histocompati; SWP:P01918; PDB:1S9VB; SPEDFVYQFKGMCYFTNGTERVRLVSRSIYNREEIVRFDSDVGEFRAVTLLGLPAAEYWN --------------------------------------3333------3333-------- SQKDILERKRAAVDRVCRHNYQLELRTTLQRRVEPTVTISPSRNLLVCSVTDFYPAQIKV --------------------------1111------------------------------ RWFRNDQEETAGVVSTPLIRNGDWTFQILVMLEMTPQRGDVYTCHVEHPSLQSPITVEWR -----------------------------------------------1111--------- A - >TUBULIN ALPHA CHAIN; SWP:O35414; PDB:1SA0E; ADMEVIELNKCTSGQSFEVILKPPSFDPSLEEIQKKLEAAEERRKYQEAELLKHLAEKRE ---------------------------------3333--1111----------------- HEREVIQKAIEENNNFIKMAKEKLAQKMESNKENREAHLAAMLERLQEKDKHAEEVRKNK -----------------------1111-------------------------1111--33 ELKE 33-- >TYPE II RESTRICTION ENZYM; SWP:P11405; PDB:1SA3A; MRTELLSKLYDDFGIDQLPHTQHGVTSDRLGKLYEKYILDIFKDIESLKKYNTNAFPQEK -------------3333-1111------------------------------1111---- DISSKLLKALNLDLDNIIDVSSSDTDLGRTIAGGSPKTDATIRFTFHNQSSRLVPLNIKH -------1111-3333--------------2222-----------1111----------- SSKKKVSIAEYDVETICTGVGISDGELKELIRKHQNDQSAKLFTPVQKQRLTELLEPYRE ---------------------------------------1111-----------3333-- RFIRWCVTLRAEKSEGNILHPDLLIRFQVIDREYVDVTIKNIDDYVSDRIAEGSKARKPG ----------------1111---------iiii---------------------3333%% FGTGLNWTYASGSKAKKMQFKG %%-------------------- >SERUM AMYLOID P COMPONENT; SWP:P02743; PDB:1SACA; HTDLSGKVFVFPRESVTDHVNLITPLEKPLQNFTLCFRAYSDLSRAYSLFSYNTQGRDNE ---2222----------------------------------------------2222--- LLVYKERVGEYSLYIGRHKVTSKVIEKFPAPVHICVSWESSSGIAEFWINGTPLVKKGLR ------2222----iiii------------------------------iiii-------2 QGYFVEAQPKIVLGQEQDSYGGKFDRSQSFVGEIGDLYMWDSVLPPENILSAYQGTPLPA 222---------------------1111----------------3333---1111----- NILDWQALNYEIRGYVIIKPLVWV ---1111----------------- >SERRATIA PROTEASE; SWP:P23694; PDB:1SAT; TGYDAVDDLLHYHERGNGIQINGKDSFSNEQAGLFITRENQTWNGYKVFGQPVKLTFSFP --------1111---iiii-%%%%----------1111---1111--------------- DYKFSSTNVAGDTGLSKFSAEQQQQAKLSLQSWADVANITFTEVAAGQKANITFGNYSQD --1111-1111---------------------------------3333------------ RPGHYDYGTQAYAFLPNTIWQGQDLGGQTWYNVNQSNVKHPATEDYGRQTFTHEIGHALG 2222---------------iiii-2222---33333333-----------------1111 LSHPGDYNAGEGDPTYADVTYAEDTRQFSLMSYWSETNTGGDNGGHYAAAPLLDDIAAIQ --------------3333--11113333--------1111--iiii-------------- HLYGANLSTRTGDTVYGFNSNTGRDFLSTTSNSQKVIFAAWDAGGNDTFDFSGYTANQRI -----------------------1111---3333----------------1111------ NLNEKSFSDVGGLKGNVSIAAGVTIENAIGGSGNDVIVGNAANNVLKGGAGNDVLFGGGG --2222---iiii------2222------------------------------------- ADELWGGAGKDIFVFSAASDSAPGASDWIRDFQKGIDKIDLSFFDKEANSSSFIHFVDHF ----------------3333-2222-------2222------------------------ SGTAGEALLSYNASSNVTDLSVNIGGHAAPDFLVKIVGQVDVATDFIV --2222----------------------------------3333---- >sulfite reductase, desulf; SWP:O28055; PDB:1SAUA; PELEVKGKKLRLDEDGFLQDWEEWDEEVAEALAKDTRFSPQPIELTEEHWKIIRYLRDYF ----iiii----1111---3333-----------3333---------------------- IKYGVAPPVRMLVKHCKKEVRPDCNLQYIYKLFPQGPAKDACRIAGLPKPTGCV -------3333---------1111--------1111-------------2222- >HYPOTHETICAL PROTEIN FLJ3; SWP:Q0JV00; PDB:1SAWA; RPLSRFWEWGKNIVCVGRNYASEPVLFLKPSTAYAPEGSPILMPAYTRNLHHELELGVVM -33333333--------------------1111--2222----1111------------- GKRCRAVPEAAAMDYVGGYALCLDMTARDVQDECKKKGLPWTLAKSFTASCPVSAFVPKE -------33333333-------------------------3333--2222-------333 KIPDPHKLKLWLKVNGELRQEGETSSMIFSIPYIISYVSKIITLEEGDIILTGTPKGVGP 3--1111------iiii-----3333------------------2222------------ VKENDEIEAGIHGLVSMTFKVEKPEY -2222-----2222------------ >PROBABLE BUTYRATE KINASE ; SWP:Q9X278; PDB:1SAZA; FRILTINPGSTSTKLSIFEDERVKQNFSHSPDELGRFQKILDQLEFREKIARQFVEETGY --------1111------!!!!---------3333---3333------------------ SLSSFSAFVSRGGLLDPIPGGVYLVDGLIKTLKSGKNGEHASNLGAIIAHRFSSETGVPA 1111------------------------------1111-3333----------------- YVVDPVVVDEEDVARVSGHPNYQRKSIFHALNQKTVAKEVARNKRYEENLVVAHGGGISI ----1111--3333--------------3333------------3333------------ AAHRKGRVIDVNNALDGDGPFTPERSGTLPLTQLVDLCFSGKFTYEEKKRIVGNGGLVAY ---iiii-----3333----------------3333-------------------3333- LGTSDAREVVRRIKQGDEWAKRVYRAAYQIAKWIGKAAVLKGEVDFIVLTGGLAHEKEFL ------------1111---------------------1111------------------- VPWITKRVSFIAPVLVFPGSNEEKALALSALRVLRGEEKPKNYSEESRRWRERYDSYLDG -------3333---------------------1111------------------------ ILR --- >RHODOCETIN ALPHA SUBUNIT; SWP:P81397; PDB:1SB2A; DCPDGWSSTKSYCYRPFKEKKTWEEAERFCTEQEKEAHLVSMENRLEAVFVDMVMENNFE --2222--1111----------------------------------------------%% NKIYRSWIGLKIENKGQRSNLEWSDGSSISYENLYEPYMEKCFLMDHQSGLPKWHTADCE %%-------------1111---1111-------------------------------111 EKNVFMCKFQLP 1----------- >Rhodocetin subunit beta; SWP:P81398; PDB:1SB2B; FRCPTTWSASKLYCYKPFKEKKTWIEAERFCAKQAENGHLVSIGSAAEADFLDLVIVVNF ---2222--1111-----------------33332222---------------------- RYRAWTGLTERNLKWTNGASVSYENLYEPYIRKCFVVQPWEGKSKWYKADCEEKNAFLCK --------------1111---------------------iiii------1111------- FPKP ---- >COPPER CHAPERONE SCATX1; SWP:P73213; PDB:1SB6A; MTIQLTVPTIACEACAEAVTKAVQNEDAQATVQVDLTSKKVTITSALGEEQLRTAIASAG ------1111----3333--------1111----3333---------------------- HEVE ---- >WBPP; SWP:Q8KN66; PDB:1SB8A; MMSRYEELRKELPAQPKVWLITGVAGFIGSNLLETLLKLDQKVVGLDNFATGHQRNLDEV ----------3333--------1111----------1111-------------------- RSLVSEKQWSNFKFIQGDIRNLDDCNNACAGVDYVLHQAALGSVPRSINDPITSNATNID 111133331111-----1111-------2222---------------------------- GFLNMLIAARDAKVQSFTYAASSSTYGDHPGLPKVEDTIGKPLSPYAVTKYVNELYADVF ---------1111------------!!!!-----1111---------------------- SRCYGFSTIGLRYFNVFGRRQDPNGAYAAVIPKWTSSMIQGDDVYINGDGETSRDFCYIE -----------------2222---1111-------------------------------- NTVQANLLAATAGLDARNQVYNIAVGGRTSLNQLFFALRDGLAENGVSYHREPVYRDFRE --------11113333--------------------------1111------------22 GDVRHSLADISKAAKLLGYAPKYDVSAGVALAMPWYIMFLK 22-----------------------------------1111 >SOYBEAN AGGLUTININ; SWP:P05046; PDB:1SBF; AETVSFSWNKFVPKQPNMILQGDAIVTSSGKLQLNKVDENGTPKPSSLGRALYSTPIHIW -----------2222-----------1111-------1111------------------- DKETGSVASFAASFNFTFYAPDTKRLADGLAFFLAPIDTKPQTHAGYLGLFNENESGDQV ---------------------1111----------1111----!!!!----2222----- VAVEFDTFRNSWDPPNPHIGINVNSIRSIKTTSWDLANNKVAKVLITYDASTSLLVASLV ---------1111-----------------------2222--------3333-------- YPSQRTSNILSDVVDLKTSLPEWVRIGFSAATGLDIPGESHDVLSWSFASNLPH -1111---------3333------------------------------------ >SULFATE-BINDING PROTEIN; SWP:P02906; PDB:1SBP; KDIQLLNVSYDPTRELYEQYNKAFSAHWKQETGDNVVIDQSHGGSGKQATSVINGIEADT ---------------------------------------------------1111----- VTLALAYDVNAIAERGRIDKNWIKRLPDDSAPYTSTIVFLVRKGNPKQIHDWNDLIKPGV ----3333----1111----3333--%%%%-----------2222-----3333--2222 SVITPNPKSSGGARWNYLAAWGYALHHNNNDQAKAEDFVKALFKNVEVLDSGARGSTNTF ---------------------------%%%%-----------1111-------------- VERGIGDVLIAWENEALLATNELGKDKFEIVTPSESILAEPTVSVVDKVVEKKDTKAVAE ------------------------------------------------3333-------- AYLKYLYSPEGQEIAAKNFYRPRDADVAKKYDDAFPKLKLFTIDEVFGGWAKAQKDHFAD ---------------1111------------3333------3333-------------22 GGTFDQISK 22------- >5,10-METHENYLTETRAHYDROFO; SWP:P75430; PDB:1SBQA; MDKNALRKQILQKRMALSTIEKSHLDQKINQKLVAFLTPKPCIKTIALYEPIKNEVTFVD -----------------3333---------------1111-----------2222---33 FFFEFLKINQIRAVYPKVISDTEIIFIDQETNTFEPNQIDCFLIPLVGFNKDNYRLGFGK 33-------------------------1111---1111-----------1111------- GYYDRYLMQLTRQQPKIGIAYSFQKGDFLADPWDVQLDLIINDE -----3333-----------1111------1111---------- >Putative uncharacterized ; SWP:Q52L64; PDB:1SBSH; EVNLEESGGGLVQPGGSMKLSCVASGFTFSNYWMNWVRQSPEKGLEWVADIRLKSNNYAT ------------2222-----------3333---------------------1111---- LYAESVKGRFTISRDDSKSSVYLQMNNLRAEDTGIYYCTRGAYYRYDYAMDYWGQGTSVT --3333---------1111---------1111-----------1111------------- VSSAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVL -------------------------------------------%%%%------------- QSDLYTLSSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVP %%%%---------3333-----------3333---------- >SKI ONCOGENE; SWP:P12755; PDB:1SBXA; GSHFPSDRSTERCETVLEGETISCFVVGGEKRLCLPQILNSVLRDFSLQQINAVCDELHI ----------------iiii------iiii----------1111-----------1111- YCSRCTADQLEILKVGILPFSAPSCGLITKTDAERLCNALLYG ------------------1111--------------------- >ALCOHOL DEHYDROGENASE; SWP:P10807; PDB:1SBYA; MDLTNKNVIFVAALGGIGLDTSRELVKRNLKNFVILDRVENPTALAELKAINPKVNITFH --2222---------------------------------------------3333----- TYDVTVPVAESKKLLKKIFDQLKTVDILINGAGILDDHQIERTIAINFTGLVNTTTAILD --11113333-----------------------------------------------333 FWDKRKGGPGGIIANICSVTGFNAIHQVPVYSASKAAVVSFTNSLAKLAPITGVTAYSIN 3-3333------------1111--1111-------------------3333--------- PGITRTPLVHTFNSWLDVEPRVAELLLSHPTQTSEQCGQNFVKAIEANKNGAIWKLDLGT -----3333-----%%%%1111--------------------------2222----iiii LEAIEWTKHWDSHI -------------- >PROBABLE AROMATIC ACID DE; SWP:Q9X728; PDB:1SBZA; KLIVGTGATGAPLGVALLQALREPNVETHLVSKWAKTTIELETPYSARDVAALADFSHNP ---------3333------3333--------3333--------------3333-----11 ADQAATISSGSFRTDGIVIPCSKTLAGIRAGYADGLVGRAADVVLKEGRKLVLVPREPLS 11--11113333------------------------------------------------ TIHLENLALSRGVAVPPPAFYNHPETVDDIVHHVVARVLDQFGLEHPRWQGL -------------------1111--3333--------3333----------- >INTERLEUKIN-1 BETA CONVER; SWP:P29466; PDB:1SC3A; TSSGSEGNVKLCSLEEAQRIWKQKSAEIYPIMDKSSRTRLALIICNEEFDSIPRRTGAEV ---1111-------------------------3333------------------2222-- DITGMTMLLQNLGYSVDVKKNLTASDMTTELEAFAHRPEHKTSDSTFLVFMSHGIREGIC ---------1111-----------------------3333--------------3333-- GKKHSEQVPDILQLNAIFNMLNTKNCPSLKDKPKVIIIQAARGDSPGVVWFKD 11113333----3333---------3333------------------------ >Caspase-1 [Precursor]; SWP:P29466; PDB:1SC3B; AIKKAHIEKDFIAFCSSTPDNVSWRHPTMGSVFIGRLIEHMQEYACSCDVEEIFRKVRFS -----------------2222-------------------------------------11 FEQPDGRAQMPTTERVTLTRCFYLFPGH 11----------------------2222 >D-3-PHOSPHOGLYCERATE DEHY; SWP:P08328; PDB:1SC6A; EKDKIKFLLVEGVHQKALESLRAAGYTNIEFHKGALDDEQLKESIRDAHFIGLRSRTHLT ---------------------1111-----------3333-------------------- EDVINAAEKLVAIGAFAIGTNQVDLDAAAKRGIPVFNAPFSNTRSVAELVIGELLLLLRG -------------------1111-----1111-----1111------------------- VPEANAKAHRGVGNSFEARGKKLGIIGYGHIGTQLGILAESLGYVYFYDIENKLPLGNAT -------1111------2222----------------------------------!!!!- QVQHLSDLLNSDVVSLHVPENPSTKNGAKEISLKPGSLLINASRGTVVDIPALADALASK ---3333-------------3333---------2222----------------------- HLAGAAIDVDPFTSPLAEFDNVLLTPHIGGSTQEAQENIGLEVAGKLIKYSDNGSTLSAV ---------33331111-1111-----------3333------------------2222- NFPEVSLPLHGGRRLHIHENRPGVLTALNKIFAEQGVNIAAQYLQTSAQGYVVIDIEADE ----------------------------------------------------------33 DVAEKALQAKAIPGTIRARLLY 33---------2222------- >STEM CELL FACTOR; SWP:P21583; PDB:1SCFA; NVKDVTKLVANLPKDYITLKYVPGDVLPSHCWISEVVQLSDSLTDLLDKFSNISEGLSNY --------11111111-----------33333333---------3333------------ SIIDKLVNIVDDLVECVKSPEPRLFTPEEFFRIFNRSIDAFKDFVVASETSDCVVS --------------3333---------------------------3333------- >PEANUT PEROXIDASE, MAJOR ; SWP:P22195; PDB:1SCHA; LSSNFYATKCPNALSTIKSAVNSAVAKEARMGASLLRLHFHDCFVQGCDASVLLDDTSNF ---1111--1111--------------3333------------------3333------- TGEKTAGPNANSIRGFEVIDTIKSQVESLCPGVVSCADILAVAARDSVVALGGASWNVLL --1111------------------------------------------1111-------- GRRDSTTASLSSANSDLPAPFFNLSGLISAFSNKGFTTKELVTLSGAHTIGQAQCTAFRT --------3333------1111---------1111------------------3333--- RIYNESNIDPTYAKSLQANCPSVGGDTNLSPFDVTTPNKFDNAYYINLRNKKGLLHSDQQ ----------------1111----3333----3333---------------------333 LFNGVSTDSQVTAYSNNAATFNTDFGNAMIKMGNLSPLTGTSGQIRTNCRKTN 3---3333---------------------------------------1111-- >Subtilisin E [Precursor]; SWP:P04189; PDB:1SCJB; EKKYIVGFKQTMSAMSSAKKKDVISQKGGKVEKQFKYVNAAAATLDEKAVKELKKDPSVA -----------------------1111------------------------33331111- YVEEDHIAHEY ----------- >SCORPION TOXIN OSK1; SWP:P55896; PDB:1SCO; GVIINVKCKISRQCLEPCKKAGMRFGKCMNGKCHCTPK ---------3333------------------------- >HEMOGLOBIN; SWP:P14821; PDB:1SCTA; VDAAVAKVCGSEAIKANLRRSWGVLSADIEATGLMLMSNLFTLRPDTKTYFTRLGDVQKG -------1111------------1111----------------11113333111111113 KANSKLRGHAITLTYALNNFVDSLDDPSRLKCVVEKFAVNHINRKISGDAFGAIVEPMKE 333-----------------1111----------------3333--3333---------- TLKARMGNYYSDDVAGAWAALVGVVQAAL -----!!!!---------------3333- >Globin-2 B chain; SWP:P14822; PDB:1SCTB; KVAELANAVVSNADQKDLLRMSWGVLSVDMEGTGLMLMANLFKTSPSAKGKFARLGDVSA --------1111--------------------------------3333333333331111 GKDNSKLRGHSITLMYALQNFVDALDDVERLKCVVEKFAVNHINRQISADEFGEIVGPLR 1111-----------------1111-----------------1111-3333--------- QTLKARMGNYFDEDTVAAWASLVAVVQASL ------!!!!----------------1111 >SCYLLATOXIN; SWP:P16341; PDB:1SCY; AFCNLRMCQLSCRSLGLLGKCIGDKCECVKH ------------1111-----3333------ >DIHYDROLIPOAMIDE SUCCINYL; SWP:P07016; PDB:1SCZA; ARSEKRVPMTRLRKRVAERLLEAKNSTAMLTTFNEVNMKPIMDLRKQYGEAFEKRHGIRL ----------------------1111---------------------------------- GFMSFYVKAVVEALKRYPEVNASIDGDDVVYHNYFDVSMAVSTPRGLVTPVLRDVDTLGM -3333-----------3333----!!!!-------------------------3333--- ADIEKKIKELAVKGRDGKLTVEDLTGGNFTITNGGVFGSLMSTPIINPPQSAILGMHAIK -------------------3333----------3333----------------------- DRPMAVNGQVEILPMMYLALSYDHRLIDGRESVGFLVTIKELLEDPTRLLLDV -----iiii-------------------------------------3333--- >PENICILLINASE REPRESSOR; SWP:Q6UB84; PDB:1SD4A; QVEISAEWDVNIIWDKKSVSANEIVVEIQKYKEVSDKTIRTLITRLYKKEIIKRYKSENI -----3333--3333-----------1111----------------1111------iiii YFYSSNIKEDDIKKTAKTFLNKLYGGDKSLVLNFAKNEELNNKEIEELRDILNDISKK -------3333------------%%%%-------1111-------------------- >COAGULATION FACTOR V; SWP:Q28107; PDB:1SDDA; AKLRQFYVAAQSIRWNTSFKKIVYREYEAYFQKEKPQSRTSGLLGPTLYAEVGDIMKVHF --------------------------------------------------2222------ KNKAHKPLSIHAQGIKYSKFSEGASYSDHTLPMEKMDDAVAPGQEYTYEWIISEHSGPTH -----------------1111---------3333-------------------------- DDPPCLTHIYYSYVNLVEDFNSGLIGPLLICKKGTLTEDGTQKMFEKQHVLMFAVFDESK ------------------------------------1111----------------3333 SWNQTSSLMYTVNGYVNGTMPDITVCAHLIGMSSGPELFSIHFNGQVLEQNHHKISAITL ------------------------------------------------------------ VSATSTTGRWTIASLIPRHFQAGMQAYI ---------------------------- >HYPOTHETICAL PROTEIN YCFC; SWP:P25746; PDB:1SDIA; GAKNYYDITLALAGICQSARLVQQLAHQGHCDADALHVSLNSIIDMNPSSTLAVFGGSEA -------------------------------------------------3333----333 NLRVGLETLLGVLNASSRQGLNAELTRYTLSLMVLERKLSSAKGALDTLGNRINGLQRQL 3------------------3333------------------2222--------------- EHFDLQSETLMSAMAAIYVDVISPLGPRIQVTGSPAVLQSPQVQAKVRATLLAGIRAAVL ---1111--------------3333---------3333---------------------- WHQVGGGRLQLMFSRNRLTTQAKQILAHLTPEL -1111------------------------3333 >KINESIN HEAVY CHAIN-LIKE ; SWP:Q41460; PDB:1SDMA; KIRVYCRLRPLCEKEIIAKERNAIRSVDEFTVEHLWKDDKAKQHMYDRVFDGNATQDDVF ---------------1111-------------------------------1111------ EDTKYLVQSAVDGYNVCIFAYGQTGSGKTFTIYGADSNPGLTPRAMSELFRIMKKDSNKF ----------------------2222--------3333---------------------- SFSLKAYMVELYQDTLVDLLLPKQAKRLKLDIKKDSKGMVSVENVTVVSISTYEELKTII -----------------11113333---------1111-------------3333----- QRGSEQRHTTGTLMNEQSSRSHLIVSVIIESTNLQTQAIARGKLSFVDLAGSERVKKEAQ ---3333-----11111111---------------------------------------- SINKSLSALGDVISALSSGNQHIPYRNHKLTMLMSDSLGGNAKTLMFVNISPAESNLDET -----------------------1111-------------------------3333---- HNSLTYASRVRSIVNDPSKNVSSKEVARLKKLVSYWELEEIQDE -------------------------------1111--------- >BSTYI; SWP:Q84AF2; PDB:1SDOA; MRIVEVYSHLNGLEYIQVHLPHIWEEIQEIIVSIDAEACRTILYSPVALNEAFKEKLEAK ----------------------------------3333-------------------111 GWKESRTNYYVTADPKLIRETLSLEPEEQKKVIEAAGKEALKSYNQTDFVKDRVAIEVQF 1-------------------1111---------1111-------------%%%%------ GKYSFVAYDLFVKHMAFYVSDKIDVGVEILPMKELSKEMSSGISYYEGELYNVIRQGRGV -1111-----------------------------3333-2222---------3333---- PAVPLVLIGIAP ------------ >APOPTOSIS 1 INHIBITOR; SWP:Q24306; PDB:1SE0A; NDLNREETRLKTFTDWPLDWLDKRQLAQTGMYFTHAGDKVKCFFCGVEIGSWEQEDQPVP ---------1111----1111-------------------------------11113333 EHQRWSPNCPLLRRRTTNNVPINAEALDRILPPISYD -----1111-1111----------------------- >SINGLE-STRAND BINDING PRO; SWP:Q9RY51; PDB:1SE8A; RGNHVYLIGALARDPELRYTGNGAVFEATVAGEDRVRNLPWYHRVSILGKPAEWQAERNL -------------------1111-------------------------------3333-- KGGDAVVVEGTLEYRQWEKRSAVNVKALREQLGTQPELIQDAGGGVRSGANEVLVLGNVT 2222------------------------------------1111---------------- RDPEIRYTPAGDAVLSLSIAVNENYQDRQGQRQEKVHYIDATLWRDLAENKELRKGDPVI -------1111---------------1111-----------------------2222--- GRLVNEGWTRNSTRVEATRVEALAR ------------------------- >UBIQUITIN FAMILY; SWP:Q9MAB9; PDB:1SE9A; EAEVHNQLEIKFRLTDGSDIGPKAFPDATTVSALKETVISEWPREKENGPKTVKEVKLIS -------------1111--------1111-------------3333-----3333----% AGKVLENSKTVKDYRSPVSNLAGAVTTMHVIIQAPVTEKEK %%%--11113333---------------------------- >HYPOTHETICAL PROTEIN YHAI; SWP:O07517; PDB:1SEDA; DSMDHRIERLEYYIQLLVKTVDMDRYPFYALLIDKGLSKEEGEAVMRICDELSEELATQK 3333-------------11111111----------------------------------1 AQGFVTFDKLLALFAGQLNEKLDVHETIFALYEQGLYQELMEVFIDIMKHFD 111---3333--------1111------------------------------ >CONSERVED HYPOTHETICAL PR; SWP:Q82ZQ3; PDB:1SEFA; KELLTSRAVIKKDNYAIIPHDGLVQNAVPGFENVDISILGSPKLGATFVDYIATFHKNGQ ------------------3333-----2222---------3333-----------2222- QTTGFGGDGIQTLVYVIDGRLRVSDGQETHELEAGGYAYFTPEMKMYLANAQEADTEVFL ------iiii----------------------2222----1111---------------- YKKRYQPLAGHQPYKVVGSIHDQQPEEYEGMTDVLLWSLLPKEFDFDMNMHILSFEPGAS -------2222-------1111----2222------------3333---------2222- HAYIETHVQEHGAYLISGQGMYNLDNEWYPVEKGDYIFMSAYVPQAAYAVGREEPLMYVY -----------------------%%%%----2222----2222----------------- SKDANREPEL ---------- >AAH2: LQH-ALPHA-IT (FACE); SWP:P01484; PDB:1SEGA; VKDGYIVKNYNCTYFCFRNAYCNEECTKLKGESGYCQWASPYGNACYCYKLPDHVPIRVP --------------------------1111---------1111--------3333----- GKCH ---- >RIBOSOMAL PROTEIN S8; SWP:P12879; PDB:1SEIA; VMTDPIADMLTAIRNANMVRHEKLEVPASKIKREIAEILKREGFIRDYEYIEDNKQGILR ----------------1111-------------------1111---------%%%%---- IFLKYGPNERVITGLKRISKPGLRVYVKAHEVPRVLNGLGIAILSTSQGVLTDKEARQKG -------------------2222----3333----2222------1111----------- TGGEIIAYVI ---------- >SERPIN K; SWP:P14754; PDB:1SEK; GETDLQKILRESNDQFTAQMFSEVVKANPGQNVVLSAFSVLPPLGQLALASVGESHDELL ------------------------------------3333-------------------- RALALPNDNVTKDVFADLNRGVRAVKGVDLKMASKIYVAKGLELNDDFAAVSRDVFGSEV 1111-------------3333---2222-------------------------------- QNVDFVKSVEAAGAINKWVEDQTNNRIKNLVDPDALDETTRSVLVNAIYFKGSWKDKFNK ---3333---------------%%%%-----3333-1111------------------33 ERTMDRDFHVSKDKTIKVPTMIGKKDVRYADVPELDAKMIEMSYEGDQASMIIILPNQVD 33------------------------------1111-------2222----------111 GITALEQKLKDPKALSRAEERLYNTEVEIYLPKFKIETTTDLKEVLSNMNIKKLFTPGAA 1-----------------1111------------------------1111-33332222- RLENLLKTKESLYVDAAIQKAFIEVNEEGAEAAAANAFKITTYSFHFVPKVEINKPFFFS -11111111--------------------------2222--------------------- LKYNRNSMFSGVCVQP --%%%%---------- >SEM-5; SWP:P29355; PDB:1SEMA; ETKFVQALFDFNPQESGELAFKRGDVITLINKDDPNWWEGQLNNRRGIFPSNYVCPYN --------------2222---------------1111------------1111----- >THIOREDOXIN-LIKE PROTEIN ; SWP:O95881; PDB:1SENA; LGKGFGDHIHWRTLEDGKKEAAASGLPLMVIIHKSWCGACKALKPKFAESTEISELSHNF -iiii1111------------------------1111-------------------1111 VMVNLEDEEEPKDEDFSPDGGYIPRILFLDPSGKVHPEIINENGNPSYKYFYVSAEQVVQ -----!!!!---3333-------------1111--1111-11111111------------ GMKEAQERLTGDAFR ---------%%%%-- >Ig heavy chain V region 5; SWP:P18525; PDB:1SEQH; EVKLVESGGGLVQPGGSLKLSCAASGFTFSTYTMSWARQTPEKKLEWVAYISKGGGSTYY ------------2222-----------3333--------1111----------------- PDTVKGRFTISRDNAKNTLYLQMSSL 3333--------3333---------- >Ig heavy chain V region 5; SWP:P18525; PDB:1SEQL; DIVLTQSPAIMSASLGSSVTLTCSASSSVSYMHWYQQKSGTSPVLLIYTTSNLASGVPSR -------------2222--------------------2222------------2222111 FSGSGSGTFYSLTISSVEASDAADYYCHQWSSYPWTFGGGTKLEIKRADAAPTVSIFPPS 1----------------3333--------------------------------------3 SEQLTSGGASVVCFLNNFYPKSINSKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTL 333-------------------------iiii---------------------------- TKNEYERHNSYTCEATHKSPIVKSFNRS ----1111-------------------- >PROTEIN (SERYL-TRNA SYNTH; SWP:SYS_THETH; PDB:1SERA; MVDLKRLRQEPEVFHRAIREKGVALDLEALLALDREVQREKEARLEALLLQVPLPPWPGA ------3333---------------3333-----33333333-----3333-----1111 PVGGEEANREIKRVGGPPEFSFPPLDHVALMEKNGWWEPRISQVSGSRSYALKGDLALYE ---1111------------------------1111----3333----------------- LALLRFAMDFMARRGFLPMTLPSYAREKAFLGTGHFPAYRDQVWAIAETDLYLTGTAEVV -------------------------3333---------3333---2222----------3 LNALHSGEILPYEALPLRYAGYAPAFRSEAGSFGKDVRGLMRVHQFHKVEQYVLTEASLE 333-2222--3333-----------------------!!!!------------------- ASDRAFQELLENAEEILRLLELPYRLVEVATGDMGPGKWRQVDIEVYLPSEGRYRETHSC -----------------------------1111-1111----------3333-------- SALLDWQARRANLRYRDPEGRVRYAYTLNNTALATPRILAMLLENHQLQDGRVRVPQALI --!!!!3333------1111---------------3333----11111111----33333 PYMGKEVLEPCG 333--------- >SERYL-tRNA SYNTHETASE; SWP:P34945; PDB:1SESA; MVDLKRLRQEPEVFHRAIREKGVALDLEALLALDREVQELKKRLQEVQTERNQVAKRVPK ------------------1111-------------------------------------- APPEEKEALIARGKALGEEAKRLEEALREKEARLEALLLQVPLPPWPGAPVGGEEANREI -------------------------------------1111----1111---3333---- KRVGGPPEFSFPPLDHVALMEKNGWWEPRISQVSGSRSYALKGDLALYELALLRFAMDFM --------------------1111----3333---------------------------- ARRGFLPMTLPSYAREKAFLGTGHFPAYRDQVWAIAETDLYLTGTAEVVLNALHSGEILP --------------3333-------1111-------------------3333-2222--3 YEALPLRYAGYAPAFRSEAGSFGKDVRGLMRVHQFHKVEQYVLTEASLEASDRAFQELLE 333--------------------------------------------------------- NAEEILRLLELPYRLVEVATGDMGPGKWRQVDIEVYLPSEGRYRETHSCSALLDWQARRA ------1111--------1111-1111---------3333-----------!!!!3333- NLRYRDPEGRVRYAYTLNNTALATPRILAMLLENHQLQDGRVRVPQALIPYMGKEVLEPC -----1111---------------3333--------1111----3333------------ G - >PROTOPORPHYRINOGEN OXIDAS; SWP:O24164; PDB:1SEZA; AKRVAVIGAGVSGLAAAYKLKIHGLNVTVFEAEGKAGGKLRSVSQDGLIWDEGANTTESE -------------------1111------------------------------------3 GDVTFLIDSLGLREKQQFPLSQNKRYIARNGTPVLLPSNPIDLIKSNFLSTGSKLQLLEP 333--------3333-------------iiii--------3333-----3333------- ILWSHESVSGFFQRHFGKEVVDYLIDPFVAGTCGGDPDSLSHHSFPELWNLEKRFGSVIL -----------------------------------3333-----3333---------333 GAIRSKLSKTSANKKRQRGSFSFLGGQTLTDAICKDLREDELRLNSRVLELSCSCTEDSA 3---------------------2222-----------1111------------------- IDSWSIISASPHKRQSEEESFDAVITAPLCDVKSKIAKRGNPFLLNFIPEVDYVPLSVVI ---------------------------1111-------------3333------------ TTFKRENVKYPLEGFGVLVPSKEQQHGLKTLGTLFSSFPDRAPNNVYLYTTFVGGSRNRE ---3333---------------1111-----------3333-1111--------3333-- LAKASRTELKEIVTSDLKQLLGAEGEPTYVNHLYWSKAFPLYGHNYDSVLDAIDKEKNLP 1111--------------------------------------1111------------22 GLFYAGNHRGGLSVGKALSSGCNAADLVISYLESVS 22----------------------------1111-- >CHAPERONE PROTEIN HTPG; SWP:P10413; PDB:1SF8A; SFIDRVKALLGERVKDVRLTHRLTDTPAIVSTDADESTQAKLFAAAGQKVPEVKYIFELN ---------!!!!------------------------------1111------------1 PDHVLVKRAADTEDEAKFSEWVELLLDQALLAERGTLEDPNLFIRRNQLLVS 111-----1111--------------------------3333---------- >YFHH HYPOTHETICAL PROTEIN; SWP:NA; PDB:1SF9A; VDLGTENLYFQSNAMEKRYSQMTPHELNTEIALLSEKARKAEQHGIINELAVLERKITMA ------------------1111-------------------1111--------------3 KAYLLNPEDYSPGETYRVENTEDEFTISYLNGVFAWGYRTSSPQQEEALPISVLQEKE 333--1111-2222---2222---------!!!!----1111-------3333----- >ADA O6-METHYLGUANINE-DNA ; SWP:P06134; PDB:1SFE; LAVRYALADCELGRCLVAESERGICAILLGDDDATLISELQQMFPAADNAPADLMFQQHV ---------1111------3333--------------------1111--1111------- REVIASLNQRDTPLTLPLDIRGTAFQQQVWQALRTIPCGETVSYQQLANAIGKPKAVRAV --------------------------------11112222--------11113333---- ASACAANKLAIVIPCHRVVRGDGSLSGYRWGVSRKAQLLRREAEN ---1111------3333----------1111-------------- >4-AMINOBUTYRATE AMINOTRAN; SWP:P22256; PDB:1SFFA; NSNKELMQRRSQAIPRGVGQIHPIFADRAENCRVWDVEGREYLDFAGGIAVLNTGHLHPK --------------3333-----------!!!!--1111------%%%%--1111----- VVAAVEAQLKKLSHTCFQVLAYEPYLELCEIMNQKVPGDFAKKTLLVTTGSEAVENAVKI -------1111----3333----------------------------------------- ARAATKRSGTIAFSGAYHGRTHYTLALTGKVNPYSAGMGLMPGHVYRALYPCPLHGISED ------------2222------------------2222-------------3333----- DAIASIHRIFKNDAAPEDIAAIVIEPVQGEGGFYASSPAFMQRLRALCDEHGIMLIADEV --------------3333----------1111---------------------------- QSGAGRTGTLFAMEQMGVAPDLTTFAKSIAGGFPLAGVTGRAEVMDAVAPGGLGGTYAGN --iiii----3333-----------!!!!iiii---------------2222--1111-- PIACVAALEVLKVFEQENLLQKANDLGQKLKDGLLAIAEKHPEIGDVRGLGAMIAIELFE --------------1111------------------33333333-----!!!!------% DGDHNKPDAKLTAEIVARARDKGLILLSCGPYYNVLRILVPLTIEDAQIRQGLEIISQCF %%%--------------------------1111-------11113333------------ DEAKQ -1111 >CORE PROTEIN; SWP:P14335; PDB:1SFKA; LSLTGLKRAMLSLIDGRGPTRFVLALLAFFRFTAIAPTRAVLDRWRSVNKQTAMKHLLSF -------------------------------------3333--1111------------- KKELGTLTSAINR ---------3333 >3-DEHYDROQUINATE DEHYDRAT; SWP:Q6GII7; PDB:1SFLA; HVEVVATITPQLETLIQKINHRIDAIDVLELRIDQFENVTVDQVAEMITKLKDSFKLLVT ---------------------3333-------1111------------------------ YRTKLQGGYGQFTNDSYLNLISDLANINGIDMIDIEWQADIDIEKHQRIITHLQQYNKEV --3333--------------------3333-----------------------1111--- IISHHNFESTPPLDELQFIFFKMQKFNPEYVKLAVMPHNKNDVLNLLQAMSTFSDTMDCK ----------------------3333------------3333----------3333---- VVGISMSKLGLISRTAQGVFGGALTYGCIGEPQAPGQIDVTDLKAQVTLY ------33333333-3333--------------2222---------3333 >CONSERVED HYPOTHETICAL PR; SWP:Q9RV77; PDB:1SFNA; MKHLGQTRSALHGSHAVITPETFVRTALAEWPGSAIVLHIAPVVGLGARFVQFTAEMPAG 1111-------1111---3333-----1111---------1111-------------222 AQATESVYQRFAFVLSGEVDVAVGGETRTLREYDYVYLPAGEKHMLTAKTDARVSVFEKP 2-----------------------------2222----2222------------------ YQTVEGVQAPGVYWGNERENPGYPFEGDDHLIARKLLPDEPAFDFMVSTMSFAPGASLPY ---------------3333--------------------3333---------2222---- AEVHYMEHGLLMLEGEGLYKLEENYYPVTAGDIIWMGAHCPQWYGALGRNWSKYLLYKDM --------------------!!!!------------2222-------------------- NRHPL ----- >ASFP; SWP:P29392; PDB:1SFP; LPRNTNCGGILKEESGVIATYYGPKTNCVWTIQMPPEYHVRVSIQYLQLNCNKESLEIID ----------------------------------1111-----------3333------- GLPGSPVLGKICEGSLMDYRSSGSIMTVKYIREPEHPASFYEVLYFQDPQA -2222---------------------------3333--------------- >ANTIGEN 85-A; SWP:P17944; PDB:1SFRA; AFSRPGLPVEYLQVPSPSMGRDIKVQFQSGGANSPALYLLDGLRAQDDFSGWDINTPAFE ---2222---------1111----------2222---------------3333------- WYDQSGLSVVMPVGGQSSFYSDWYQPACGKAGCQTYKWETFLTSELPGWLQANRHVKPTG -2222---------2222------------------------------------------ SAVVGLSMAASSALTLAIYHPQQFVYAGAMSGLLDPSQAMGPTLIGLAMGDAGGYKASDM ------3333----------3333----------1111-----------------3333- WGPKEDPAWQRNDPLLNVGKLIANNTRVWVYCGNGKPSDLGGNNLPAKFLEGFVRTSNIK --11113333--3333---------------------3333------------------- FQDAYNAGGGHNGVFDFPDSGTHSWEYWGAQLNAMKPDLQRALGATPN ------------------------------------------------ >HYPOTHETICAL PROTEIN; SWP:P84136; PDB:1SFSA; GIWGVDSAQVVTDQLFQCVRTELGYPKFWGRYLSEVPNVSEGLTRDEIVRIRNYGVKVLP ---------------------------------------------------1111----- IYNAFREAVGYANGQVAARNAVFHARRLGIPKNKLLFANIEDFFAVDAAWIAAWVETLYP -------------------------1111-----------1111-------------333 TGYRPGLYADPTKGDFAAAYCEAVSRNNQVAVQAVIWSAAPRPGTTKEQKAPRYQPAAPP 3--------------------------3333---------------3333---------- CSANVWVWQYGRDAEVCPVDTNLADRRLLDFLY ------------------------33331111- >34L PROTEIN; SWP:Q9DHS8; PDB:1SFUA; CTVNDAEIFSLVKKEVLSLNTNDYTTAISLSNRLKINKKKINQQLYKLQKEDTVKMVPSN ---------------11111111-------------------------1111-------- PPKWFKNYNC ------3333 >CONSERVED HYPOTHETICAL PR; SWP:O28271; PDB:1SFXA; HSNPLGELVKALEKLSFKPSDVRIYSLLLERGGRVSEIARELDLSARFVRDRLKVLLKRG -----------------3333--------------------------------------- FVRREIVEKGWVGYIYSAEKPEKVLKEFKSSILGEIERIEKFTDGS ---------------------------------------------- >NRH DEHYDROGENASE [QUINON; SWP:P16083; PDB:1SG0A; AGKKVLIVYAHQEPKSFNGSLKNVAVDELSRQGCTVTVSDLYAMNFEPRATDKDITGTLS ------------1111-----------------------3333-------1111------ NPEVFNYGVETHEAYKQRSLASDITDEQKKVREADLVIFQFPLYWFSVPAILKGWMDRVL --------------1111-------------------------%%%%------------- CQGFAFDIPGFYDSGLLQGKLALLSVTTGGTAEMYTKTGVNGDSRYFLWPLQHGTLHFCG ------3333-1111-2222----------3333-1111---3333----------1111 FKVLAPQISFAPEIASEEERKGMVAAWSQRLQTIWKEEPIPCTAHWHFGQ ----------3333------------------1111------3333---- >Tumor necrosis factor rec; SWP:P07174; PDB:1SG1X; ETCSTGLYTHSGECCKACNLGEGVAQPCGANQTVCEPCLDNVTFSDVVSATEPCKPCTEC ------------------2222----------------2222------------------ LGLQSMSAPCVEADDAVCRCAYGYYQDEETGHCEACSVCEVGSGLVFSCQDKQNTVCEEC !!!!------1111------1111---------------2222------!!!!------- PEGTYSDEANHVDPCLPCTVCEDTERQLRECTPWADAECE 2222-----------------1111------1111----- >3,2-TRANS-ENOYL-COA ISOME; SWP:P42126; PDB:1SG4A; SQRVLVEPDAGAGVAVMKFKNPPVNSLSLEFLTELVISLEKLENDKSFRGVILTSDRPGV 1111-----1111-------------------------------3333------------ FSAGLDLTEMCGRSPAHYAGYWKAVQELWLRLYQSNLVLVSAINGACPAGGCLVALTCDY -----3333--------------------------------------3333--3333--- RILADNPRYCIGLNETQLGIIAPFWLKDTLENTIGHRAAERALQLGLLFPPAEALQVGIV -----1111----3333-------------------------1111-------------- DQVVPEEQVQSTALSAIAQWMAIPDHARQLTKAMMRKATASRLVTQRDADVQNFVSFISK ----3333-----------3333-------------------1111-------------- DSIQKSLQM ----3333- >ORF, HYPOTHETICAL PROTEIN; SWP:NA; PDB:1SG5A; MSMNDTYQPINCDDYDNLELACQHHLMLTLELKDGEKLQAKASDLVSRKNVEYLVVEAAG -------------3333------------------------------------------- ETRELRLDKITSFSHPEIGTVVVSES -----1111-----2222-------- >PENTAFUNCTIONAL AROM POLY; SWP:P07547; PDB:1SG6A; PTKISILGRESIIADFGLWRNYVAKDLISDCSSTTYVLVTDTNIGSIYTPSFEEAFRKRA -----iiii-------3333---------------------------------------3 AEITPSPRLLIYNRPPGEVSKSRQTKADIEDWMLSQNPPCGRDTVVIALGGGVIGDLTGF 333-------------3333--------------------1111---------------- VASTYMRGVRYVQVPTTLLAMVDSSIGGKTAIDTPLGKNLIGAIWQPTKIYIDLEFLETL ----iiii--------------1111-------1111------------------3333- PVREFINGMAEVIKTAAISSEEEFTALEENAETILKAVRREVTPGEHRFEGTEEILKARI ---------------1111---------------------------------3333---- LASARHKAYVVSAGLRNLLNWGHSIGHAIEAILTPQILHGECVAIGMVKEAELARHLGIL ---------111111111111--------------------------------------- KGVAVSRIVKCLAAYGLPTSLKDARIRKLTAGKHCSVDQLMFNMALDKKIVLLSAIGTPY 3333--------1111---1111------2222---------3333--------2222-- ETRASVVANEDIRVVLA -----------3333-- >PUTATIVE CATION TRANSPORT; SWP:P39162; PDB:1SG7A; PYKTKSDLPESVKHVLPSHAQDIYKEAFNSAWDQYKDKEDRRDDASREETAHKVAWAAVK ----33333333------------------------------------------------ HEYAKGDDDKWHKKS --------------- >NERVE GROWTH FACTOR; SWP:P00757; PDB:1SGFA; NSQPWHVAVYRFNKYQCGGVLLDRNWVLTAAHCYNDKYQVWLGKEEPSDQHRLVSKAIPH --3333----------------1111---1111--------------------------- PDFNMSLLPQP ----------- >Kallikrein 1-related pept; SWP:P00756; PDB:1SGFG; IVGGFKCEKNSQPWHVAVYRYTQYLCGGVLLDPNWVLTAAHCYDDNYKVWLGKNNLFKDE -----------1111-----------------------1111------------1111-- PSAQHRFVSKAIPHPGFNMSLMFL -------------1111------- >EPHRIN TYPE-B RECEPTOR 2; SWP:P28693; PDB:1SGG; YTSFNTVDEWLDAIKMSQYKESFASAGFTTFDIVSQMTVEDILRVGVTLAGHQKKILNSI -----------1111----33333333----3333--3333-3333------------33 QVMRAQM 333333- >CITRATE LYASE, BETA SUBUN; SWP:Q9RUZ0; PDB:1SGJA; PPALLRSVLFAPGNRADLIAKLPRSAPDAVVIDLEDAVPGTAEAKAAARPVAHDAARDLI -----------1111---11111111--------3333---------------------- AAAPHLAVFVRVNALHSPYFEDDLSVLTPELSGVVVPKLEMGAEARQVAQMLQERSLPLP --1111-------1111--333333333333---------3333--------1111---- ILAGLETGAGVWNAREIMEVPEVAWAYFGAEDYTTDLGGKRTPGGLEVLYARSQVALAAR -------------------3333------------------3333--------------- LTGVAALDIVVTALNDPETFRADAEQGRALGYSGKLCIHPAQVALAHEYFG ---------------3333--------1111-------3333--------- >TRICHOMAGLIN; SWP:P84146; PDB:1SGLA; EFDYFILALQWAGTSCRSGGACCPYNGCCKADSPTQFTIHGLRPEYSGGERPSCCTGGSF ------------3333------2222---------------------------------- DPDEIMPFFGKLVEYWPTYRCALEQSCNNRKEILWGQQYEKHGTCASPVIKGEWNYFKKT 3333----------------------%%%%-----------3333--------------- LKLFMKYNVDKALEDAGIVASNSKMYDLKDIVVAVESAVGARPKLRCDEEGLVQKLSLCF -------------1111---------3333-----------------1111--------- DKDFKPRDCVQVGSCPRYVSLPEIPD 1111---------------------- >PUTATIVE HTH-TYPE TRANSCR; SWP:P42105; PDB:1SGMA; GDSREKILHTASRLSQLQGYHATGLNQIVKESGAPKGSLYHFFPNGKEELAIEAVTYTGK -------------------3333--------------3333------------------- IVEHLIQQSMDESSDPVEAIQLFIKKTASQFDNTESIKGIPVGLLASETALISEPLRTVC --------1111----------------11113333------------1111-------- MKVFKSWEAVFARKLMENGFAEEEANQLGTLINSMIEGGIMLSLTNKDKTPLLLIAEQIP ---------------1111-----------------------------3333-----333 VLVR 3--- >PROTEIN C14ORF129; SWP:Q9P0R6; PDB:1SGOA; METDCNPMELSSMSGFEEGSELNGFEGTDMKDMRLEAEAVVNDVLFAVNNMFVSKSLRCA -------------------------------------------3333------------- DDVAYINVETKERNRYCLELTEAGLKVVGYAFDQVDDHLQTPYHETVYSLLDTLSPAYRE ---------1111-------1111-----------1111-------1111---------- AFGNALLQRLEALKRDGQS ------------------- >POL POLYPROTEIN; SWP:P03367; PDB:1SGUA; PQITLWQRPLVTIKIGGQLREALLDTGADDTIFEEISLPGRWKPKMIGGIGGFVKVRQYD --------------iiii------1111--------------------2222-------- QIPIEICGHKVIGTVLVGPTPANVIGRNLMTQIGCTLNF -----iiii----------------3333-1111----- >TRNA PSEUDOURIDINE SYNTHA; SWP:O33335; PDB:1SGVA; ATGPGIVVIDKPAGMTSHDVVGRCRRIFATRRVGHAGTLDPMATGVLVIGIERATKILGL -----------2222----------1111----------1111-------!!!!--3333 LTAAPKSYAATIRLGQTTSTEDAEGQVLQSVPAKHLTIEAIDAAMERLRGEILEARPIRI 1111-------------11111111-------3333---------1111----------- DRFELLAARRRDQLIDIDVEIDCSSGTYIRALARDLGDALGVGGHVTALRRTRVGRFELD ----------!!!!---------------------------------------!!!!333 QARSLDDLAERPALSLSLDEACLLMFARRDLTAAEASAAANGRSLPAVGIDGVYAACDAD 3-------------------------------------1111---------------111 GRVIALLRDEGSRTRSVAVLRP 1--------!!!!--------- >PUTATIVE ABC TRANSPORTER; SWP:Q8U2E3; PDB:1SGWA; SKLEIRDLSVGYDKPVLERITMTIEKGNVVNFHGPNGIGKTTLLKTISTYLKPLKGEIIY ------------------------2222------2222-------1111----------i NGVPITKVKGKIFFLPEEIIVPRKISVEDYLKAVASLYGVKVNKNEIMDALESVEVLDLK iii33333333----------1111----------1111------------1111--111 KKLGELSQGTIRRVQLASTLLVNAEIYVLDDPVVAIDEDSKHKVLKSILEILKEKGIVII 13333----------3333-----------1111--1111-----------3333----- SSREELSYCDVNENLHKYST -----1111----3333--- >RNA POLYMERASE; SWP:Q70ET3; PDB:1SH0A; GTYCGAPILGPGSAPKLSTKTKFWRSSTAPLPPGTYEPAYLGGKDPRVKGGPSLQQVMRD --iiii-------------------------2222------1111------------333 QLKPFTEPRGKPPKPSVLEAAKKTIINVLEQTIDPPDKWSFAQACASLDKTTSSGHPHHM 33333---------------------------------------11111111-------- RKNDCWNGESFTGKLADQASKANLMFEEGKNMTPVYTGALKDELVKTDKIYGKIKKRLLW 3333-------------------------------------------------------- GSDLATMIRCARAFGGLMDELKTHCVTLPIRVGMNMNEDGPIIFERHSRYRYHYDADYSR ---------------------1111-----22223333--------3333---------3 WDSTQQRAVLAAALEIMVKFSSEPHLAQVVAEDLLSPSVVDVGDFTISINEGLPSGVPCT 333--------------1111-3333---------------------------------- SQWNSIAHWLLTLCALSEVTNLSPDIIQANSLFSFYGDDEIVSTDIKLDPEKLTAKLKEY -----------------------------------!!!!------------------111 GLKPTRPDKTEGPLVISEDLNGLTFLRRTVTRDPAGWFGKLEQSSILRQMYWTRGPNHED 1------------------2222-%%%%----1111-----------------------1 PSETMIPHSQRPIQLMSLLGEAALHGPAFYSKISKLVIAELKEGGMDFYVPRQEPMFRWM 111----3333----------1111----------------1111--------------- RFSDLSTWEGDRNLAPSFVNED ----1111--3333-------- >PLECTIN 1; SWP:Q9QXS1; PDB:1SH5A; FDERDRVQKKTFTKWVNKHLIKHWRAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRE -3333-------------1111--3333-----1111-1111------------------ KGRMRFHKLQNVQIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQV ---3333------------1111------------------------------------- SGQSEDMTAKEKLLLWSQRMVEGYQGLRCDNFTTSWRDGRLFNAIIHRHKPMLIDMNKVY --------------------1111--------3333---------33333333-----11 RQTNLENLDQAFSVAERDLGVTRLLDPEDVDVPQPDEKSIITYVSSLYDAMP 11-----------------------3333------------------1111- >EXTRACELLULAR SUBTILISIN-; SWP:Q8GB52; PDB:1SH7A; QSNAIWGLDRIDQRNLPLDRNYNANFDGFGVTAYVIDTGVNNNHEEFGGRSVSGYDFVDN ---------1111--------------2222---------11111111------------ DADSSDCNGHGTHVAGTIGGSQYGVAKNVNIVGVRVLSCSGSGTTSGVISGVDWVAQNAS -------------------------------------1111------------------- GPSVANMSLGGGQSTALDSAVQGAIQSGVSFMLAAGNSNADACNTSPARVPSGVTVGSTT ----------------------------------------3333--1111---------1 SSDSRSSFSNWGSCVDLFAPGSQIKSAWYDGGYKTISGTSMATPHVAGVAALYLQENNGL 111--1111--1111------------1111------3333---------------1111 TPLQLTGLLNSRASENKVSDTRGTTNKLLYSLADSGCEPDC ---------1111-------iiii----------3333--- >HYPOTHETICAL PROTEIN PA50; SWP:Q9HUE3_PSEAE; PDB:1SH8A; HMPLPTELARHLTEEKIAFVQRSGLRAEVLEPGYVRLRMPGAGNENHIGSMYAGALFTLA --------------------1111------2222------2222---------------- ELPGGALFLTSFDSARFYPIVKEMTLRFRRPAKGDIRVEARLDAERIRQLETEAGERGKA ------------------------------------------------------------ EYSLELQLTDEQGEVVAESAALYQLRSHARPGS ---------1111----------------2222 >HYPOTHETICAL PROTEIN PF08; SWP:Q8U2E0; PDB:1SHEA; STRGDLIRILGEIEEKMNELKMDGFNPDIILFGREAYNFLSNLLKKEMEEEGPFTHVSNI --------------------1111--------------------------------%%%% KIEILEELGGDAVVIDSKVLGLVPGAAKRIKIIK ----1111----------22222222-------- >FYN TYROSINE KINASE SH3 D; SWP:P06241; PDB:1SHFA; VTLFVALYDYEARTEDDLSFHKGEKFQILNSSEGDWWEARSLTTGETGYIPSNYVAPVD --------------------2222--------------------------1111----- ------------------------------------------------ >ANTIBODY RIG; SWP:A2KD53; PDB:1SHMA; VQLQESGGGLVQAGGSLRLSCAASGATGSTYDMGWFRQAPGKERESVAAINWGSAGTYYA -----------2222----------3333---------2222-----------------3 SSVRGRFTISRDNAKKTVYLQMNSLKPEDTAVYTCGAGRIRESWVTWWGQGTQVTVSS 333---------1111---------3333------------3333------------- >TRYPSIN INHIBITOR; SWP:P31713; PDB:1SHP; SICSEPKKVGRCKGYFPRFYFDSETGKCTPFIYGGCGGNGNNFETLHQCRAICRA 3333------------------1111--------2222----------------- >Hemoglobin subunit delta; SWP:P02042; PDB:1SHRB; VHLTPEEKTAVNALWGKVNVDAVGGEALGRLLVVYPWTQRFFESFGDLSSPDAVMGNPKV --------------11111111---------------33331111--------------- KAHGKKVLGAFSDGLAHLDNLKGTFSQLSELHCDKLHVDPENFRLLGNVLVCVLARNFGK ----------------1111------------------3333---------------!!! EFTPQMQAAYQKVVAGVANALAHKYH !--------------------3333- >SMALL HEAT SHOCK PROTEIN; SWP:Q57733; PDB:1SHSA; TGIQISGKGFMPISIIEGDQHIKVIAWLPGVNKEDIILNAVGDTLEIRAKRSPLMITESE -----------------1111------22221111---------------------3333 RIIYSEIPEEEEIYRTIKLPATVKEENASAKFENGVLSVILPKAESSIKKGINIE -----------------------3333-----%%%%-------3333-------- >Anthrax toxin receptor 2 ; SWP:P58335; PDB:1SHUX; SCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAERFVSPEMRLSFIVFSSQATIILPLTG --------------33331111---------------1111------------------- DRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAGGLKTSSIIIALTDGKLDGLVPS ----------3333-----------------------!!!!-------------!!!!-- YAEKEAKISRSLGASVYCVGVLDFEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQS ---------1111---------------------1111-----%%%%---------1111 C - >EPHRIN-A5; SWP:O08543; PDB:1SHXA; VADRYAVYWNSSNPRFQRGDYHIDVCINDYLDVFCPHYEDSVPEDKTERYVLYMVNFDGY ---------11113333--------2222---------1111-1111------------- SACDHTSKGFKRWECNRPHSPNGPLKFSEKFQLFTPFSLGFEFRPGREYFYISSAIPDNG ----------------1111----------------1111-------------------- RRSCLKLKVFVRPTNSCM ------------------ >Hepatocyte growth factor ; SWP:P08581; PDB:1SHYB; KYQLPNFTAETPIQNVILHEHHIFLGATNYIYVLNEEDLQKVAEYKTGPVLEHPDCFPCQ ------------------%%%%------------------------------33331111 DCSSKANLSGGVWKDNINMALVVDTYYDDQLISCGSVNRGTCQRHVFPHNHTADIQSEVH ----3333---------------------------------------------------- CIFSPQIEEPSQCPDCVVSALGAKVLSSVKDRFINFFVGNTINSSYFPDHPLHSISVRRL --------3333-----------------iiii-------------2222---------- KETKDGFMFLTDQSYIDVLPEFRDSYPIKYVHAFESNNFIYFLTVQRETLDAQTFHTRII 1111------1111----3333-------------iiii--------------------- RFCSINSGLHSYMEMPLECILTKEVFNILQAAYVSKPGAQLARQIGASLNDDILFGVFAQ ----3333---------------------------------------------------- SKPDSAEPMDRSAMCAFPIKYVNDFFNKINVRCLQHFYGPNHEHCFNRDEYRTEFTTALQ -2222----------------------------1111----------------------- RVDLFMGQFSEVLLTSISTFIKGDLTIANLGTSEGRFMQVVVSRSGPSTPHVNFLLDSHP ----iiii-------------!!!!------1111------------------------- VSPEVIVEHTLNQNGYTLVITGKKITKIPLNGLGCRHFQSCSQCLSAPPFVQCGWCHDKC ---------2222------------------1111-----3333---3333--------- VRSEECLSGTWTQQICLPA -3333-------------- >RHO GUANINE NUCLEOTIDE EX; SWP:P10824; PDB:1SHZA; AREVKLLLLGAGESGKSTFLKQMRIIHGQDFDQRAREEFRPTIYSNVIKGMRVLVDAREK -------------------------------3333------------------------- LHIPWGDNKNQLHGDKLMAFDTRAPMAAQGMVETRVFLQYLPAIRALWEDSGIQNAYDRR ------3333------3333------------33331111------------------11 REFQLGESVKYFLDNLDKLGVPDYIPSQQDILLARRPTKGIHETHFTFKDLHFKMFDVGG 11-------------3333----------------------------%%%%--------- QRSERKKWFECFEGVTAIIFCVALSDYDQVLMEDRQTNRMHESMKLFDSICNNKWFTDTS 3333------------------3333-----------------------1111--1111- IILFLNKKDLFEEKIKKSPLTICYPEYAGSNTYEEAAAYIQCQFEDLNKRKDTKEIYTHF -------------------11111111----3333--------3333------------- TCATDTKNVQFVFDAVTDVIIKNNLK -3333--------------------- >EUKARYOTIC TRANSLATION IN; SWP:Q9UL18; PDB:1SI2A; MAQPVIEFMCEVLDIRNIDEQPKPLTDSQRVRFTKEIKGLKVEVTHCGQMKRKYRVCNVT ------------------------------------2222------!!!!---------- RRPASHQTFPLQVECTVAQYFKQKYNLQLKYPHLPCLQVGQEQKHTYLPLEVCNIVAGQR --3333------------------------1111-------1111---3333-------- >Hepatocyte growth factor ; SWP:P14210; PDB:1SI5H; VVNGIPTRTNIGWMVSLRYRNKHICGGSLIKESWVLTARQCFPSRDLKDYEAWLGIHDVH ----------1111----%%%%---------------3333----3333--------111 GRGDEKCKQVLNVSQLVYGPEGSDLVLMKLARPAVLDDFVSTIDLPNYGSTIPEKTSCSV 1-----------------------------------1111-------------------- YGWGYTGLINYDGLLRVAHLYIMGNEKCSQLNESEICAGAEKIGSGPCEGDYGGPLVCEQ -----------------------3333----1111------------2222--------- HKMRMVLGVIVPGRGCAIPNRPGIFVRVAYYAKWIHKIILTYKVPQS --------------------------3333----------------- >Salivary nitrophorin; SWP:O76745; PDB:1SI6X; PPAQLSVHTVSWNSGHERAPTNLEELLGLNSGETPDVIAVAVQGFGFQTDKPQQGPACVK -------------!!!!-----3333--1111--------------3333---------- NFQSLLTSKGYTKLKNTITETMGLTVYCLEKHLDQNTLKNETIIVTVDDQKKSGGIVTSF -----1111-------------------3333-3333----------1111--------- TIYNKRFSFTTSRMSDEDVTSTNTKYAYDTRLDYSKKDDPSDFLFWIGDLNVRVETNATH -%%%%---------1111-1111-----3333---------------------------- AKSLVDQNNIDGLMAFDQLKKAKEQKLFDGWTEPQVTFKPTYKFKPNTDEYDLSATPSWT ----1111-----1111-----1111-2222-------------2222------------ DRALYKSGTGKTIQPLSYNSLTNYKQTEHRPVLAKFRVTL --------------------3333---------------- >CATALASE; SWP:Q834P5; PDB:1SI8A; QHLTTSQGSPVGDNQNSLTAGEFGPVLIQDVHLLEKLAHFNRERVPERVVHAKGAGAHGI ----1111------------1111--1111---------1111----------------- FKVSQSMAQYTKADFLSEVGKETPLFARFSTVAGELGSSDTLRDPRGFALKFYTDEGNYD ------3333--3333-2222-------------1111---------------------- LVGNNTPIFFIRDAIKFPDFIHSQKRNPRTHLKSPEAVWDFWSHSPESLHQVTILMSDRG ------------3333----------------------------3333--------3333 IPLSFRHMHGFGSHTFKWVNAAGEVFFVKYHFKTNQGIKNLESQLAEEIAGKNPDFHIED ---1111------------1111----------1111---------------1111---- LHNAIENQEFPSWTLSVQIIPYADALTMKETLFDVTKTVSQKEYPLIEVGTMTLNRNPEN ----1111------------3333------1111-----3333----------------3 YFAEVEQVTFSPGNFVPGIEASPDKLLQGRLFAYGDAHRHRVGANSHQLPINQAKAPVNN 333-------1111-2222-----------------------1111--3333-------- YQKDGNMRFNNGNSEINYEPNSYTETPKEDPTAKISSFEVEGNVGNYSYNQDHFTQANAL ----------------------3333---3333--------------------------- YNLLPSEEKENLINNIAASLGQVKNQEIIARQIDLFTRVNPEYGARVAQAIKQQ 1111--------------3333--3333--------3333-------------- >UBIQUITIN; SWP:P62988; PDB:1SIFA; LQLFIKTLTGKTFTVEMEPSDTIENLKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYN ------1111-------1111---------------1111----------33333333-- IQKESTLHLVL -2222------ >RNA POLYMERASE PRIMARY SI; SWP:P00579; PDB:1SIG; MEGEIDIAKRIEDGINQVQCSVAEYPEAITYLLEQYNRVEAEEARLSDLITGFVDDLAPT ------------------------3333----------1111--1111------------ ATHVGSELSQEDLDIDPELAREKFAELRAQYVVTRDTIKAHATAQEEILKLSEVFKQFRL 1111----3333--------------------3333----3333---------3333--- VPKQFDYLVNSMRVMMDRVRTQERLIMKLCVEQCKMPKKNFITLFTGNETSDTWFNAAIA -------------------------------1111-3333----------3333---111 MNKPWSEKLHDVSEEVHRALQKLQQIEEETGLTIEQVKDINRRMSIGEAKARRAKKEMVE 1-11113333------------------------------------------------11 ANLRLVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITR 11------------------------------3333--3333-----------------3 SIADQ 333-- >KUMAMOLISIN-AS; SWP:Q8GB88; PDB:1SIOA; AAPTAYTPLDVAQAYQFPEGLDGQGQCIAIIELGGGYDEASLAQYFASLGVPAPQVVSVS ----------------------2222---------------------------------- VDGASNQPTGDPSGPDGEVELDIEVAGALAPGAKFAVYFAPNTDAGFLDAITTAIHDPTL iiii------1111---------------1111--------------------------- KPSVVSISWGGPEDSWTSAAIAAMNRAFLDAAALGVTVLAAAGDSGSTDGEQDGLYHVDF -----------1111----------------1111------------iiii--------- PAASPYVLACGGTRLVASGGRIAQETVWNDGPDGGATGGGVSRIFPLPAWQEHANVPPSA ---1111----------iiii---------1111-------------1111--------- NPGASSGRGVPDLAGNADPATGYEVVIDGEATVIGGTSAVAPLFAALVARINQKLGKAVG 1111-------------3333-----iiii-----3333--------------------- YLNPTLYQLPADVFHDITEGNNDIANRAQIYQAGPGWDPCTGLGSPIGVRLLQALLP -333311111111---------------------------!!!!--------1111- >UNLIGANDED SIV PROTEASE; SWP:Q87706; PDB:1SIP; PQFSLWRRPVVTAHIEGQPVEVLLDTGADDSIVTGIELGPHYTPKIVGGIGGFINTKEYK --------------iiii------1111-------------------------------- NVEIEVLGKRIRGTIMTGDTPINIFGRNLLTALGMSLNF -----%%%%----------------3333-1111----- >GLUTARYL-COA DEHYDROGENAS; SWP:Q92947; PDB:1SIQA; EFDWQDPLVLEEQLTTDEILIRDTFRTYCQERLMPRILLANRNEVFHREIISEMGELGVL --1111--3333--------------------3333----------3333---------- GPTIKGYGCAGVSSVAYGLLARELERVDSGYRSAMSVQSSLVMHPIYAYGSEEQRQKYLP 1111-iiii--------------3333--------------------------------- QLAKGELLGCFGLTEPNSGSDPSSMETRAHYNSSNKSYTLNGTKTWITNSPMADLFVVWA --------------1111--3333--------1111------------3333-------- RCEDGCIRGFLLEKGMRGLSAPRIQGKFSLRASATGMIIMDGVEVPEENVLPGASSLGGP -1111-------2222-------------1111------------3333-1111------ FGCLNNARYGIAWGVLGASEFCLHTARQYALDRMQFGVPLARNQLIQKKLADMLTEITLG ----------------------------------iiii3333------------------ LHACLQLGRLKDQDKAAPEMVSLLKRNNCGKALDIARQARDMLGGNGISDEYHVIRHAMN ----------1111--3333----------------------------3333-------- LEAVNTYEGTHDIHALILGRAITGIQAFTA ------------------------------ >SCORPION INSECTOTOXIN I5A; SWP:P15222; PDB:1SIS; MCMPCFTTDPNMAKKCRDCCGGNGKCFGPQCLCNR --------1111--------------!!!!----- >DEOXYURIDINE 5'-TRIPHOSPH; SWP:O07199; PDB:1SIXA; HMSTTLAIVRLDPGLPLPSRAHDGDAGVDLYSAEDVELAPGRRALVRTGVAVAVPFGMVG -----------1111------2222-------------2222------------2222-- LVHPRSGLATRVGLSIVNSPGTIDAGYRGEIKVALINLDPAAPIVVHRGDRIAQLLVQRV ---------------1111----1111-------------------2222---------- ELVELVEVSSFDEAGLASTSRGDGG ---------3333------------ >NONSPECIFIC LIPID-TRANSFE; SWP:P83434; PDB:1SIYA; MTCGQVQGNLAQCIGFLQKGGVVPPSCCTGVKNILNSSRTTADRRAVCSCLKAAAGAVRG ---------------------------------------3333---------1111---- INPNNAEALPGKCGVNIPYKISTSTNCNSIN ----------3333----------------- >FERREDOXIN; SWP:P29603; PDB:1SJ1A; AWKVSVDQDTCIGDAICASLCPDVFEMNDEGKAQPKVEVIEDEELYNCAKEAMEACPVSA ---------------------------1111------------------------1111- ITIEEA ------ >SH3 domain-binding glutam; SWP:Q9H299; PDB:1SJ6A; MSGLRVYSTSVTGSREIKSQQSEVTRILDGKRIQYQLVDISQDNALRDEMRALAGNPKAT ---------------3333-------------------33333333-------------- PPQIVNGDQYCGDYELFVEAVEQNTLQEFLKLALE ---------------3333-11113333------- >TALIN 1; SWP:P26039; PDB:1SJ7A; LTSAQQALTGTINSSMQAVQAAQATLDDFETLPPLGQDAASKAWRKNKMDESKHEIHSQV ------------------------------------------------------------ DAITAGTASVVNLTAGDPAETDYTAVGCAVTTISSNLTEMSRGVKLLAALLEDEGGNGRP -----------1111--------------------------------------------- LLQAAKGLAGAVSELLRSAQPASAEPRQNLLQAAGNVGQASGELLQQ -------------------1111--3333-------------3333- >N-ACYLAMINO ACID RACEMASE; SWP:Q44244; PDB:1SJDA; MKLSGVELRRVQMPLVAPFRTSFGTQSVRELLLLRAVTPAGEGWGECVTMAGPLYSSEYN --------------------1111------------------------------------ DGAEHVLRHYLIPALLAAEDITAAKVTPLLAKFKGHRMAKGALEMAVLDAELRAHERSFA ---------------------3333----3333-------------------1111---- AELGSVRDSVPCGVSVGIMDTIPQLLDVVGGYLDEGYVRIKLKIEPGWDVEPVRAVRERF --------------------------------------------2222------------ GDDVLLQVDANTAYTLGDAPQLARLDPFGLLLIEQPLEEEDVLGHAELARRIQTPICLDE ---------%%%%-3333------3333---------1111------1111-------33 SIVSARAAADAIKLGAVQIVNIKPGRVGGYLEARRVHDVCAAHGIPVWCGGMIETGLGRA 33---------1111-------3333--------------1111---------------- ANVALASLPNFTLPGDTSASDRFYKTDITEPFVLSGGHLPVPTGPGLGVAPIPELLDEVT -------1111-------3333------------iiii------!!!!--------1111 TAKVWIG ------- >CALSEQUESTRIN, CARDIAC MU; SWP:P12637; PDB:1SJIA; GLNFPTYDGKDRVVSLTEKNFKQVLKKYDVLCLYYHESVSSDKVAQKQFQLKEIVLELVA ----------------3333-3333----------------------------------- QVLEHKDIGFVMVDAKKEAKLAKKLGFDEEGSLYVLKGDRTIEFDGEFAADVLVEFLLDL --1111----------------------2222---------------------------- IEDPVEIINSKLEVQAFERIEDQIKLIGFFKSEESEYYKAFEEAAEHFQPYIKFFATFDK ---------3333------------------33333333----3333-----------33 GVAKKLSLKMNEVDFYEPFMDEPIAIPDKPYTEEELVEFVKEHQRPTLRRLRPEDMFETW 33------2222----2222-----------1111----------------3333----- EDDLNGIHIVAFAERSDPDGYEFLEILKQVARDNTDNPDLSIVWIDPDDFPLLVAYWEKT -------------1111----------------3333--------1111------3333- FKIDLFKPQIGVVNVTDADSVWMEIPDDDDLPTAEELEDWIEDVLSGKIN -------------------------------------------------- >60 KDA CHAPERONIN 2; SWP:P06806; PDB:1SJPA; LEDPYEKIGAELVKEVAKKTTTATVLAQALVREGLRNVAAGANPLGLKRGIEKAVEKVTE -------------------------------------1111------------------- TLLKGAKEVETKEQIAATAAISAGDQSIGDLIAEAMDKVGNEGVITVEESNTFGLQLELT ----------3333-----------3333----------1111----------------- EGMRFDKGYISGYFVTDPERQEAVLEDPYILLVSSKVSTVKDLLPLLEKVIGAGKPLLII ----------3333--3333----------------------3333---3333------- AEDVEGEALSTLVVNKIRGTFKSVAVKAPGFGDRRKAMLQDMAILTGGQVISEEVGLTLE ------3333---------------------3333----------------3333--333 NADLSLLGKARKVVVTKDETTIVEGAGDTDAIAGRVAQIRQEIENSDSDYDREKLQERLA 3-3333------------------------------------------------------ KLAGGVAVIKAGAATEVELKERKHRIEDAVRNAKAAVEEGIVAGGGVTLLQAAPTLDELK --------------3333----------------3333-----%%%%3333---3333-- LEGDEATGANIVKVALEAPLKQIAFNSGLEPGVVAEKVRNLPAGHGLNAQTGVYEDLLAA -!!!!--------3333--------------------11112222----------3333- GVADPVKVTRSALQNAASIAGLFLTTE ------------------3333----- >R9; SWP:A2KD66; PDB:1SJVA; GGGLVQAGESLKLSCAASGGFMGWYRQAPGKQRELVATINSRGITNYADFVKGRFTISRD -----2222------------------2222--------1111----1111--------3 NAKKTVYLEMNSLEPEDTAVYYCYTHYFRSYWGQGTQVTVSS 333----------3333------------------------- >NOGALONIC ACID METHYL EST; SWP:NA; PDB:1SJWA; SRQTEIVRRMVSAFNTGRTDDVDEYIHPDYLNPATLEHGIHTGPKAFAQLVGWVRATFSE ------------------1111----1111-33331111-------------------11 EARLEEVRIEERGPWVKAYLVLYGRHVGRLVGMPPTDRRFSGEQVHLMRIVDGKIRDHRD 11---------!!!!--------------iiii-----------------iiii------ WPDFQGTLRQLGDPWPDDEGWR --------1111----3333-- >IMMUNOGLOBULIN VH DOMAIN; SWP:NA; PDB:1SJXA; QVQLQESGGGLVQAGGSLRLSCQASGNIFRINDMGWYRQAPGTQRELVAAITSGGSTKYA ------------2222----------3333---------2222--------1111----3 DSVKGRFTISKDNAKNTVYLQMNSLKPEDTAVYYCAAEDRHRIGTVGYWGQGTQVTVSS 333--------3333----------1111---------1111----------------- >MUTT/NUDIX FAMILY PROTEIN; SWP:Q9RVK2; PDB:1SJYA; EHDERTHVPVELRAAGVVLLNERGDILLVQEKGIEKAGLWHIPSGAVEDGENPQDAAVRE --------------------1111-----------------------2222--------- ACEETGLRVRPVKFLGAYLGRFPDGVLILRHVWLAEPEPGQTLAPAFTDEIAEASFVSRE ---------------------1111----------------------1111--------- DFAQLYAAGQIRMYQTKLFYADALREKGFPALPV -----1111------------------------- >PEPTIDOGLYCAN RECOGNITION; SWP:Q96LB9; PDB:1SK4A; PNIIKRSAWEARETHCPKMNLPAKYVIIIHTAGTSCTVSTDCQTVVRNIQSFHMDTRNFC ----3333---------------------------------------------------- DIGYHFLVGQDGGVYEGVGWHIQGSHTYGFNDIALGIAFIGYFVEKPPNAAALEAAQDLI ------------------1111----2222------------------------------ QAVVEGYLTPNYLLMGHSDVVNILSPGQALYNIISTWPHFKH --1111---------11111111-2222-------------- >HYPOTHETICAL PROTEIN PA-H; SWP:O69002; PDB:1SK7A; NLRSQRLNLLTNEPHQRLESLVKSKEPFASRDNFARFVAAQYLFQHDLEPLYRNEALARL -------------------------1111------------------3333--------- FPGLASRARDDAARADLADLGHPVPEGDQSVREADLSLAEALGWLFVSEGSKLGAAFLFK 2222------------------------3333----3333-------------------- KAAALELDENFGARHLAEPEGGRAQGWKSFVAILDGIELNEEEERLAAKGASDAFNRFGD -3333--1111-1111-----------------1111----------------------- LLERTFA ------- >MAJOR PRION PROTEIN 2; SWP:Q01880; PDB:1SKHA; MVKSKIGSWILVLFVAMWSDVGLCKKRPKP -3333------------------------- >Protein skinhead-1; SWP:P34707; PDB:1SKNP; GRQSKDEQLASDNELPVSAFQISEMSLSELQQVLKNESLSEYQRQLIRKIRRRGKNKVAA -------------------------3333------------------------------- RTCRQRRTDRHDKM -------------- >HYPOTHETICAL 7.5 KDA PROT; SWP:P20215; PDB:1SKVA; SKEVLEKELFELDEDVRELLSLIHEIKIDRITGNDKQKLGKAYFQVQKIEAELYQLIKVS 3333-------------------------------------------------------- HH -- >ATP synthase subunit alph; SWP:P09219; PDB:1SKYB; SQIQVSDVGTVIQVGDGIARAHGLDNVMSGEAVEFANAVMGMALNLEENNVGIVILGPYT --------------iiii-----11112222---3333---------------------- GIKEGDEVRRTGRIMEVPVGETLIGRVVNPLGQPVDGLGPVETTETRPIESRAPGVMDRR --------------------3333----1111----------------------1111-- SVHEPLQTGIKAIDALVPIGRGQRELIIGDRQTGKTSVAIDTIINQKDQNMICIYVAIGQ ----------3333---------------------------------------------- KESTVATVVETLAKHGAPDYTIVVTASASQPAPLLFLAPYAGVAMGEYFMIMGKHVLVVI --------------------------33333333-------------------------- DDLSKQAAAYRQLSLLLRRPPGREAYPGDIFYLHSRLLERAAKLSDAKGGGSLTALPFVE ---3333-------1111----iiii11113333--3333----3333------------ TQAGDISAYIPTNVISITDGQIFLQSDLFFSGVRPAINAGLSVSRVGGAAQIKAMKKVAG -%%%%--------------------3333------------------------------- TLRLDLAAYRELEFAQFSDDKATQANVARGARTVEVLKQDLHQPIPVEKQVLIIYALTRG -------------------3333---------3333---2222------------1111- FLDDIPVEDVRRFEKEFYLWLDQNGQHLLEHIRTTKDLPNEDDLNQAIEAFKKTFVVSQ -33333333---------------1111----3333---3333-----3333------- >ATP synthase subunit beta; SWP:P07677; PDB:1SKYE; MTRGRVIQVMGPVVDVKFENGHLPAIYNALKIQHKARNENEVDIDLTLEVALHLGDDTVR ---------!!!!-----%%%%--2222---------1111-------------%%%%-- TIAMASTDGLIRGMEVIDTGAPISVPVGQVTLGRVFNVLGEPIDLEGDIPADARRDPIHR ------2222-----------------3333-----3333---------3333------- PAPKFEELATEVEILETGIKVVDLLAPYIKGGKIGLFGGAGVGKTVLIQELIHNIAQEHG ---3333---------------------2222---------------------------- GISVFAGVGERTREGNDLYHEMKDSGVISKTAMVFGQMNEPPGARMRVALTGLTMAEYFR ----------------------3333----------1111------3333---------- DEQGQDGLLFIDNIFRFTQAGSEVSALLGRMPSAIGYQPTLATEMGQLQERITSTAKGSI ------------3333---------3333----%%%%1111---33331111-------- TSIQAIYVPADDYTDPAPATTFSHLDATTNLERKLAEMGIYPAVDPLVSTSRALAPEIVG ---------------------1111-------------------------11113333-- EEHYQVARKVQQTLERYKELQDIIAILGMDELSDEDKLVVHRARRIQFFLSQNFHVAEQF --------------------3333-----------------------1111--1111--- TGQPGSYVPVKETVRGFKEILEGKYDHLPEDRFRLVGRIEEVVEKAKAMG ------------------------1111-3333----3333--3333--- >ANTISTASIN; SWP:P15358; PDB:1SKZ; GCEEAGCPEGSACNIITDRCTCSGVRCRVHCPHGFQRSRYGCEFCKCRLEPMKATCDISE -------2222-------------------1111---1111---------------3333 CPEGMMCSRLTNKCDCKIDINCRKTCPNGLKRDKLGCEYCECRP -2222---------------------------1111-------- >MDC-SIGN1B TYPE I ISOFORM; SWP:Q9NNX6; PDB:1SL4A; CHPCPWEWTFFQGNCYFMSNSQRNWHDSITACKEVGAQLVVIKSAEEQNFLQLQSSRSNR ----2222--iiii------------------1111------------------------ FTWMGLSDLNQEGTWQWVDGSPLLPSFKQYWNRGEPNNVGEEDCAEFSGNGWNDDKCNLA -------3333-----1111---333311112222--2222------!!!!----1111- KFWICKKSAASC ------------ >AEQUORIN 1; SWP:P07164; PDB:1SL8A; NPKWIGRHKHMFNFLDVNHNGRISLDEMVYKASDIVINNLGATPEQAKRHKDAVEAFFGG ---------------1111--------------------------------------333 AGMKYGVETEWPEYIEGWKRLASEELKRYSKNQITLIRLWGDALFDIIDKDQNGAISLDE 3---------------------------1111----------------1111-------- WKAYTKSAGIIQSSEDCEETFRVCDIDESGQLDVDEMTRQHLGFWYTMDPACEKLYGGAV -----1111---------------1111-------------------------2222--- P - >BOVINE GALECTIN-1; SWP:P11116; PDB:1SLTA; CGLVASNLNLKPGECLRVRGEVAADAKSFLLNLGKDDNNLCLHFNPRFNAHGDVNTIVCN ----------2222--------1111---------1111----------iiii------- SKDAGAWGAEQRESAFPFQPGSVVEVCISFNQTDLTIKLPDGYEFKFPNRLNLEAINYLS --iiii------------------------3333----1111------3333-------- AGGDFKIKCVAFE ------------- >ECOTIN; SWP:P23827; PDB:1SLUA; IAPYPQAEKGMKRQVIQLTPQEDESTLKVELLIGQTLEVDCNLHRLGGKLENKTLEGWGY -------2222------------------------------------------------- DYYVFDKVSSPVSTMMHCPDKEKKFVTAYLGDAGMLRYNSKLPIVVYTPDNVDVKYRVWK -----------------------------!!!!-----3333------1111-------- AEEKIDNAVVR ----------- >TYROSINE-PROTEIN KINASE I; SWP:Q08881; PDB:1SM2A; VIDPSELTFVQEIGSGQFGLVHLGYWLNKDKVAIKTIREGAMSEEDFIEEAEVMMKLSHP -----------------------------------------------------3333-11 KLVQLYGVCLEQAPICLVFEFMEHGCLSDYLRTQRGLFAAETLLGMCLDVCEGMAYLEEA 11-------------------------------2222-3333------------------ CVIHRDLAARNCLVGENQVIKVSDFGPVKWASPEVFSFSRYSSKSDVWSFGVLMWEVFSE -------3333---------------1111-3333---------------------1111 GKIPYENRSNSEVVEDISTGFRLYKPRLASTHVYQIMNHCWKERPEDRPAFSRLLRQLAE ----------------1111-----3333------3333----3333--3333------- IAESG ----- >Ig heavy chain V-III regi; SWP:P01801; PDB:1SM3H; QVQLQESGGGLVQPGGSMKLSCVASGFTFSNYWMNWVRQSPEKGLEWVAEIRLKS ------------2222-----------3333--------1111------------ >CHLOROPLAST FERREDOXIN-NA; SWP:Q9M4D2; PDB:1SM4A; ISKKQDEGVVVNKFRPKEPYIGRCLLNTKITGDDAPGETWHMVFSTEGEIPYREGQSIGV -----2222-----3333-------------1111----------iiii---2222---- IADGVDANGKPHKLRLYSIASSALGDFGDSKTVSLCVKRLVYTNDKGEEVKGVCSNFLCD -----1111------------3333--------------------------------111 LKPGADVKITGPVGKEMLMPKDPNATVIMLGTGTGIAPFRSFLWKMFFEKHDDYKFNGLA 12222---------1111---1111-------------------------1111------ WLFLGVPTSSSLLYKEEFEKMKEKAPENFRLDFAVSREQTNEKGEKMYIQTRMAQYAEEL -------33332222-------------------1111--1111---33333333----- WTLLKKDNTFVYMCGLKGMEQGIDDIMSSLAAKEGIDWADYKKQLKKAEQWNVEVY --1111---------3333--------------------------1111------- >RECOMBINANT IB PRONAPIN; SWP:Q8GT96; PDB:1SM7A; QPQKCQREFQQEQHLRACQQWIRQQLAGSPFSENQWGPQQGPSLREQCCNELYQEDQVCV -3333----------33333333--------------1111---3333----11111111 CPTLKQAAKSVRVQGQHGPFQSTRIYQIAKNLPNVCNMKQIGTCPFIAI -3333-------------------------1111--------------- >MALTOGENIC AMYLASE; SWP:O69007; PDB:1SMAA; MRKEAIHHRSTDNFAYAYDSETLHLRLQTKKNDVDHVELLFGDPYEWHDGAWQFQTMPMR -3333-----!!!!---------------------------------%%%%--------- KTGSDGLFDYWLAEVKPPYRRLRYGFVLRAGGEKLVYTEKGFYHEAPSDDTAYYFCFPFL ----------------2222-----------------3333---------3333------ HRVDLFQAPDWVKDTVWYQIFPERFANGNPAISPKGARPWGSEDPTPTSFFGGDLQGIID 3333----3333------------------------------------------------ HLDYLADLGITGIYLTPIFRAPSNHKYDTADYFEIDPHFGDKETLKTLVKRCHEKGIRVM 3333--------------------------1111-3333--------------------- LDAVFNHCGYEFAPFQDVLKNGAASRYKDWFHIREFPLQTEPRPNYDTFAFVPHMPKLNT --------11113333----!!!!--1111----------------------------11 AHPEVKRYLLDVATYWIREFDIDGWRLDVANEIDHQFWREFRQAVKALKPDVYILGEIWH 11---------1111-------------3333----------------1111-------- DAMPWLRGDQFDAVMNYPLADAALRFFAKEDMSASEFADRLMHVLHSYPKQVNEAAFNLL -3333-------------------------------------3333---3333------- GSHDTPRLLTVCGGDVRKVKLLFLFQLTFTGSPCIYYGDEIGMTGGNDPECRKCMVWDPE -1111-3333-iiii----------1111-------3333-------------------- KQNKELYEHVKQLIALRKQYRALRRGDVAFLTADDEVNHLVYAKTDGNETVMIIINRSNE -------------------3333---------3333---------!!!!----------- AAEIPMPIDARGKWLVNLLTGERFAAEAETLCVSLPPYGFVLYAVESW --------3333------------------------------------ >17KD FETAL BRAIN PROTEIN; SWP:Q9H4G4; PDB:1SMBA; SASKQFHNEVLKAHNEYRQKHGVPPLKLKNLNREAQQYSEALASTRILKHSPESSRGQGE ------------------1111-------------------------------1111--- NLAWASYDQTGKEVADRWYSEIKNYNFQQPGFTSGTGHFTAMVWKNTKKMGVGKASASDG -------------------3333--3333---3333-------1111---------1111 SSFVVARYFPAGNVVNEGFFEENVLPP ---------------22223333---- >AMYLASE; SWP:P04745; PDB:1SMD; YSSNTQQGRTSIVHLFEWRWVDIALECERYLAPKGFGGVQVSPPNENVAIHNPFRPWWER -----2222-----2222-------------1111--------------------1111- YQPVSYKLCTRSGNEDEFRNMVTRCNNVGVRIYVDAVINHMCGNAVSAGTSSTCGSYFNP ---------3333------------1111-------------1111-------------1 GSRDFPAVPYSGWDFNDGKCKTGSGDIENYNDATQVRDCRLSGLLDLALGKDYVRSKIAE 111-3333--1111-3333--1111---------------iiii---3333--------- YMNHLIDIGVAGFRIDASKHMWPGDIKAILDKLHNLNSNWFPEGSKPFIYQEVIDLGGEP ----------------3333-3333---3333---------2222--------------- IKSSDYFGNGRVTEFKYGAKLGTVIRKWNGEKMSYLKNWGEGWGFMPSDRALVFVDNHDN -33333333------------------%%%%3333----3333---1111------3333 QRGHGAGGASILTFWDARLYKMAVGFMLAHPYGFTRVMSSYRWPRYFENGKDVNDWVGPP ------!!!!--3333-----------------------------------1111----- NDNGVTKEVTINPDTTCGNDWVCEHRWRQIRNMVNFRNVVDGQPFTNWYDNGSNQVAFGR -iiii------1111--%%%%-3333-3333--------2222----------------- GNRGFIVFNNDDWTFSLTLQTGLPAGTYCDVISGDKINGNCTGIKIYVSDDGKAHFSISN ------------------------------------------------1111------11 SAEDPFIAIHAESKL 11-------1111-- >MALATE DEHYDROGENASE, GLY; SWP:P19446; PDB:1SMKA; GFKVAILGAAGGIGQPLAMLMKMNPLVSVLHLYDVVNAPGVTADISHMDTGAVVRGFLGQ -------1111------------1111-----------------3333----------33 QQLEAALTGMDLIIVPAGVPRKPGMTRDDLFKINAGIVKTLCEGIAKCCPRAIVNLISNP 33-3333-----------------------------------------1111-------3 VNSTVPIAAEVFKKAGTYDPKRLLGVTMLDVVRANTFVAEVLGLDPRDVDVPVVGGHAGV 333---------1111--1111----3333--------------3333---------!!! TILPLLSQVKPPSSFTQEEISYLTDRIQNGGTEVVEAKAGAGSATLSMAYAAVKFADACL !---1111-----------------------------%%%%------------------- RGLRGDAGVIECAFVSSQVTELPFFASKVRLGRNGIEEVYSLGPLNEYERIGLEKAKKEL ------------------------------------------------------------ AGSIEKGVSFIRS ------------- >TRIGGERING RECEPTOR EXPRE; SWP:Q9NP99; PDB:1SMOA; ATKLTEEKYELKEGQTLDVKCDYTLEKFASSQKAWQIIRDGEMPKTLACTERPSKNSHPV -----------2222--------33331111-------2222------------------ QVGRIILEDYHDHGLLRVRMVNLQVEDSGLYQCVIYQPPKEPHMLFDRIRLVV -!!!!------------------3333-------------------------- >Proteinase inhibitor [Pre; SWP:P18958; PDB:1SMPI; SSLRLPSAAELSGQWVLSGAEQHCDIRLNTDVLDGTTWKLAGDTACLQKLLPEAPVGWRP ------3333-------------------------------------------------- TPDGLTLTQADGSAVAFFSRNRDRYEHKLVDGSVRTLKKK 1111----1111--------!!!!----1111-------- >INHIBITOR CH-66; SWP:P00796; PDB:1SMRA; LISPVVLTNYLNSQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSRLY ---------%%%%-------------------1111------1111---- >SESBANIA MOSAIC VIRUS COA; SWP:Q9EB06; PDB:1SMVA; GAITVLHCELTAEIGVTDSIVVSSELVMPYTVGTWLRGVADNWSKYSWLSVRYTYIPSCP ---------------------------3333--------1111----------------1 SSTAGSIHMGFQYDMADTVPVSVNKLSNLRGYVSGQVWSGSAGLCFINNSRCSDTSTAIS 111----------3333----33333333------1111---3333--------1111-- TTLDVSELGKKWYPYKTSADYATAVGVDVNIATDLVPARLVIALLDGSSSTAVAAGRIYD ---1111--------------------33333333------------------------- TYTIQMIEPTASALNL ----------3333-- >RIBONUCLEASE E; SWP:P21513; PDB:1SMXA; ANIYKGKITRIEPSLEAAFVDYGAERHGFLPLKEIAREYFPANYSAHGRPNIKDVLREGQ -----------1111---------------3333-3333-----------3333--2222 EVIVQIDKEERGNKGAALTTFISLAGS ----------!!!!---------2222 >NEUROTOXIN BMK M4; SWP:P45698; PDB:1SN4A; VRDAYIAKPENCVYHCAGNEGCNKLCTDNGAESGYCQWGGRYGNACWCIKLPDDVPIRVP --------------------------1111---------------------1111----- GKCH ---- >NEUROTOXIN BMK M8; SWP:P54135; PDB:1SNB; GRDAYIADSENCTYFCGSNPYCNDVCTENGAKSGYCQWAGRYGNACYCIDLPASERIKEG -------1111---------------1111---------------------1111----- GRCG ---- >NUCLEOBINDIN 1; SWP:Q02818; PDB:1SNLA; LKEVWEELDGLDPNRFNPKTFFILHDINSDGVLDEQELEALFTKELEKVYDPKNEEDDMR --------------------------------------------3333------------ EMEEERLRMREHVMKNVDTNQDRLVTLEEFLASTQRKEF --------------------------------------- >3,4-DIHYDROXY-2-BUTANONE ; SWP:Q60364; PDB:1SNNA; NNVEKAIEALKKGEIILVYDSDEREGETDMVVASQFITPEHIRIMRKDAGGLICTALHPD ---------1111-------1111--------3333-3333------------------- ICNKLGIPFMVDILEFASQKFKVLRELYPNDIPYDEKSSFSITINHRKTFTGITDNDRAF --------3333--------33331111-----------------1111----------- TIKKLAELVKEGRFNDFGKEFRSPGSVTLLRAAEGLVKNRQGHTEMTVALAELANLVPIT --------11113333-1111-----------2222---------------1111----- TICEMMGDDGNAMSKNETKRYAEKHNLIYLSGEEIINYY ------1111-------------------------3333 >COPPER-CONTAINING NITRITE; SWP:P38501; PDB:1SNRA; ATAAEIAALPRQKVELVDPPFVHAHSQVAEGGPKVVEFTMVIEEKKIVIDDAGTEVHAMA -----1111----------------------------------------1111------- FNGTVPGPLMVVHQDDYLELTLINPETNTLMHNIDFHAATGALGGGGLTEINPGEKTILR iiii--------2222--------3333-------3333-%%%%3333---2222----- FKATKPGVFVYHCAPPGMVPWHVVSGMNGAIMVLPREGLHDGKGKALTYDKIYYVGEQDF --------------2222----------------1111--1111---------------- YVPRDENGKYKKYEAPGDAYEDTVKVMRTLTPTHVVFNGAVGALTGDKAMTAAVGEKVLI ----1111------3333--------3333------iiii----!!!!----2222---- VHSQANRDTRPHLIGGHGDYVWATGKFNTPPDVDQETWFIPGGAAGAAFYTFQQPGIYAY ------------2222-----11111111-----------2222---------------- VNHNLIEAFELGAAAHFKVTGEWNDDLMTSVLAPSG -----------------------3333--------- >SNIFFER CG10964-PA; SWP:Q9W3H4; PDB:1SNYA; HNSILITGCNRGLGLGLVKALLNLPQPPQHLFTTCRNREQAKELEDLAKNHSNIHILEID ------------------------------------3333----------1111-----3 LRNFDAYDKLVADIEGVTKDQGLNVLFNNAGIAPKSARITAVRSQELLDTLQTNTVVPIL 3331111----------!!!!----------------1111------------------- AKACLPLLKKAAKANESQPGVGRAAIINSSILGSIQGNTDGGYAYRTSKSALNAATKSLS -1111---------1111-3333------33333333----------------------- VDLYPQRICVSLHPGWVKTDGGSSAPLDVPTSTGQIVQTISKLGEKQNGGFVNYDGTPLA ---1111--------------1111--3333--------11113333-----1111---- W - >ALDOSE 1-EPIMERASE; SWP:NA; PDB:1SNZA; HMASVTRAVFGELPSGGGTVEKFQLQSDLLRVDIISWGCTITALEVKDRQGRASDVVLGF -------------%%%%-------------------%%%%-------1111--------- AELEGYLQKQPYFGAVIGRVANRIAKGTFKVDGKEYHLAINKEPNSLHGGVRGFDKVLWT -3333------------------2222---iiii------------iiii--1111---- PRVLSNGVQFSRISPDGEEGYPGELKVWVTYTLDGGELIVNYRAQASQATPVNLTNHSYF --------------2222---------------!!!!----------------------- NLAGQASPNINDHEVTIEADTYLPVDETLIPTGEVAPVQGTAFDLRKPVELGKHLQDFHL 1111-----1111------------1111--------2222------------------- NGFDHNFCLKGSKEKHFCARVHHAASGRVLEVYTTQPGVQFYTGNFLDGTLKGKNGAVYP -----------------------1111----------------1111------iiii--2 KHSGFCLETQNWPDAVNQPRFPPVLLRPGEEYDHTTWFKFSVA 222--------2222--1111---------------------- >CGMP-INHIBITED 3',5'-CYCL; SWP:Q13370; PDB:1SO2A; LDLILVEEYDSLIEKMSNWNFPIFELVEKMGEKSGRILSQVMYTLFQDTGLLEIFKIPTQ --3333---------------3333----!!!!1111---------------1111---- QFMNYFRALENGYRDIPYHNRIHATDVLHAVWYLTTRPVPGLQQIHNGGRIAYISSKSCS ------------------------------------------------------------ NPDESYGCLSSNIPALELMALYVAAAMHDYDHPGRTNAFLVATNAPQAVLYNDRSVLENH --1111-3333-----------------------------1111------iiii------ HAASAWNLYLSRPEYNFLLHLDHVEFKRFRFLVIEAILATDLKKHFDFLAEFNAKANDVN -----------33331111---------------------3333---------------- SNGIEWSNENDRLLVCQVCIKLADINGPAKVRDLHLKWTEGIVNEFYEQGDEEANLGLPI ----11113333--------------11113333-------------------1111--- SPFMDRSSPQLAKLQESFITHIVGPLCNSYDAAGLLPGQWLESRRRIFCQLMHHLTENHK 22221111----------------------1111-----------------------333 IWK 3-- >SIALIDASE 2; SWP:Q9Y3R4; PDB:1SO7A; LPVLQKESVFQSGAHAYRIPALLYLPGQQSLLAFAEQRAELIVLRRGDYDAPTHQVQWQA -----------1111----------3333---------------------1111------ QEVVAQARLDGHRSMNPCPLYDAQTGTLFLFFIAIPGQVTEQQQLQTRANVTRLCQVTST ---3333-2222---------------------------3333--1111----------- DHGRTWSSPRDLTDAAIGPAYREWSTFAVGPGHCLQLNDRARSLVVPAYAYRKLHPIQRP iiii--------3333!!!!1111--------------1111------------------ IPSAFCFLSHDHGRTWARGHFVAQDTLECQVAEVETQRVVTLNARSHLRARVQAQSTNDG ----------iiii-------------------------------------------iii LDFQESQLVKKLVEPPPQGCQGSVISFPSPRSPAQWLLYTHPTHSWQRADLGAYLNPRPP i-------3333------------------------------------------------ APEAWSEPVLLAKGSCAYSDLQSMGTGPDGSPLFGCLYEANDYEEIVFLMFTLKQAFPAE -1111---------------------1111----------%%%%-------3333-3333 Y - >3-HYDROXYACYL-COA DEHYDRO; SWP:Q99714; PDB:1SO8A; RSVKGLVAVITGGASGLGLATAERLVGQGASAVLLDLPNSGGEAQAKKLGNNCVFAPADV --2222-----3333---------------------2222------------------11 TSEKDVQTALALAKGKFGRVDVAVNCAGIFQRVLDVNLMGTFNVIRLVAGEMGQNEPDQG 11------------------------------------------------3333---111 GQRGVIINTASVAAFEGQVGQAAYSASKGGIVGMTLPIARDLAPIGIRVMTIAPGLFGTP 1------------------3333-----3333---------3333--------------- LLTDPAEYAHLVQAIIENPFLNGEVIRL -----------------1111------- >CYTOCHROME C OXIDASE ASSE; SWP:Q92RG6; PDB:1SO9A; VEQASDLILDEKIKVTFDANVAAGLPWEFVPVQRDIDVRIGETVQIMYRAKNLASTPTTG ---------------------3333----------------------------------- QATFNVTPMAAGAYFNKVQCFCFTETTLEPGEEMEMPVVFFVDPEIVKPVETQGIKTLTL -----------1111---------------------------3333--3333-------- SYTFYPREPSK ----------- >PROTEASE DEGS; SWP:P31137; PDB:1SOTA; TPASYNLAVRRAAPAVVNVYNRGLNTNSHNQLEIRTLGSGVIDQRGYIITNKHVINDADQ -----------3333-----------1111------------1111----33332222-- IIVALQDGRVFEALLVGSDSLTDLAVLKINATGGLPTIPINARRVPHIGDVVLAIGNPYN ----1111-----------1111-----------------1111--2222-------111 LGQTITQGIISATGRIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELGINTLSFDEGI 1--------------------3333--------1111------1111------------- GFAIPFQLATKIDKLIRDGRVIRGYIGIGGRGIVVNEVSPDGPAANAGIQVNDLIISVDN -----------------------------------------3333--------------- KPAISALETDQVAEIRPGSVIPVVVQLTLQVTIQEYPA ----3333---11112222------------------- >5,10-METHENYLTETRAHYDROFO; SWP:O67621; PDB:1SOUA; MLKSELRKKVLHKRINLSEEERRRLSEKVISNLKSLPEFKKSKKVALYCPIKGEVDLTPL ------------1111-------------------3333-----------------1111 FPEVLKEKELILPKVEGNEISLYRVHSPACLGVGAFGIMEPVEGERVNPEDVDFIAVPGV -----------------------------------------------1111--------- AFDLEGYRLGFGKGYYDRLLKRVKGLKVGVAYSFQVFERLPRDAWDIPVDVLVTEKNVRR --------------33333333---------3333------------------------- LRDGRSLEHHHHHH -------------- >L-LACTATE DEHYDROGENASE; SWP:Q27797; PDB:1SOVA; GTVSRRKKIAMIGSGMIGGTMGYLCVLRELADVVLFDVVTGMPEGKALDDSQATSIADTN ---------------------------------------------------3333----- VSVTSANQYEKIAGSDVVIITAGLTKVPSRNDLLPFNAKIIREVAQGVKKYCPLAFVIVV -------33332222-------------3333---------------------------- TNPLDCMVKCFHEASGLPKNMVCGMANVLDSARFRRFIADQLEISPRDIQATVIGTHGDH -----------------1111-----------------------3333---------111 MLPLARYVTVNFP 1--3333------ >SP1F3; SWP:P08047; PDB:1SP1; KKFACPECPKRFMRSDHLSKHIKTHQNKK ----3333-------3333-3333----- >SP1F2; SWP:P08047; PDB:1SP2; RPFMCTWSYCGKRFTRSDELQRHKRTHTGEK ---------------3333-3333------- >CYTOCHROME C, PUTATIVE; SWP:Q8E9W8; PDB:1SP3A; ANPHKDVLKGPFTTGSEVTTQCLTCHEEQATDMMKTSHWTWELEQKLPDRTVVRGKKNSI --3333---------------3333---------------------1111----3333-- NNFCVAISSNEPRCTSCHAGYGWKDNTFDFKDKTKVDCLICHDTTGTYVKDPAGAGEPMA ------222233331111------1111---1111---1111--------1111------ KLDLAKIAQNVGAPVRDNCGSCHFYGKHGDLDSSMAYPDKATDVHMDSDGNNFQCQNCHT -------1111---3333-------------1111---3333----1111---3333--- TEKHQISGNAMGVSPGGIDHIGCENCHDSAPHSNKKLNTHTATVACQTCHIPFFAKNEPT -%%%%----2222-------------------------------3333------------ KMQWDWSTAGDDKPETVDQYGKHTYQKKKGNFVWEKMVKPQYAWYNGTANAYMAGDKMDS -----1111--------1111-------------------------------2222--11 NVVTKLTYPMGDINDAKAKIYPFKVHTGKQIYDKKLNIFITPKTYGKGGYWSEFDWNLAA 11---------1111------------------------------2222----------- KLGMEANPTMLEKGIKYSGEYDFAATEMWWRINHMVSPKEQALNCNDCHNKGTRLDWQAL ----------1111-----------------------3333--1111-!!!!---3333- GYQGDPMKNKQGPKHK ----3333-------- >4-HYDROXYPHENYLPYRUVATE D; SWP:NA; PDB:1SP8A; RFNPRSDRFHTLAFHHVELWCADAASAAGRFSFGLGAPLAARSDLSTGNSAHASLLLRSG -------------------------------------------3333-----------!! SLSFLFTAPYAHGADAATAALPSFSAAAARRFAADHGLAVRAVALRVADAEDAFRASVAA !!------------3333--3333---------------------------------111 GARPAFGPVDLGRGFRLAEVELYGDVVLRYVSYPDGAAGEPFLPGFEGVASPGAADYGLS 1----------iiii-------!!!!-------1111-----2222----1111------ RFDHIVGNVPELAPAAAYFAGFTGFHEFAEFTTGLNSMVLANNSENVLLPLNEPVHGTKR -----------------------------------------1111--------------- RSQIQTFLDHHGGPGVQHMALASDDVLRTLREMQARSAMGGFEFMAPPTSDYYDGVRRRA -----------------------------------------------------------1 GDVLTEAQIKECQELGVLVDRDDQGVLLQIFTKPVGDRPTLFLEIIQRIGCMEKDEKGQE 111------------------2222----------------------------------- YQKGGCGGFGKGNFSQLFKSIEDYEKSL ----2222-1111--------------- >4-HYDROXYPHENYLPYRUVATE D; SWP:P93836; PDB:1SP9A; VRKNPKSDKFKVKRFHHIEFWCGDATNVARRFSWGLGMRFSAKSDLSTGNMVHASYLLTS -----------------------3333-----------------3333-----------! GDLRFLFTAPYSPSTASIPSFDHGSCRSFFSSHGLGVRAVAIEVEDAESAFSISVANGAI !!!--------------1111--------------------------------------- PSSPPIVLNEAVTIAEVKLYGDVVLRYVSYKFLPGFERVEPLDYGIRRLDHAVGNVPELG ------------------------------------------------------------ PALTYVAGFTGFHQFASGLNSAVLASNDEMVLLPINEPVHGTKRKSQIQTYLEHNEGAGL -------------------------1111----------------3333---1111---- QHLALMSEDIFRTLREMRKRSSIGGFDFMPSPPPTYYQNLKKRVGDVLSDDQIKECEELG --------3333--------------------------3333------3333-------- ILVDRDDQGTLLQIFTKPLGDRPTIFIEIIQRVGCMMKDEEGKAYQSGGCGGFGKGNFSE ------------------------------------------------2222----3333 LF -- >Subtilisin BPN' [Precurso; SWP:P00782; PDB:1SPBP; EKKYIVGFKQTMSTMSAAKKKDVISEKGGKVQKQFKYVDAASATLNEKAVKELKKDPSVA ---------------3333-----1111-----------------3333------1111- YVEEDHVAHAY ----------- >HEMOGLOBIN; SWP:P56250; PDB:1SPGA; SLSATDKARVKALWDKIEGKSAELGAEALGRMLVSFPQTKIYFSEWGQDLGPQTPQVRNH -------------------3333-------------------3333----3333------ GAVIMAAVGKAVKSIDNLVGGLSQLSELHAFKLRVDPANFKILAHNIILVISMYFPGDFT -------------1111-1111-------------3333----------------1111- PEVHLSVDKFLACLALALSEKYR ------------------1111- >Hemoglobin subunit beta; SWP:P56251; PDB:1SPGB; VDWTDAERAAIKALWGKIDVGEIGPQALSRLLIVYPWTQRHFKGFGNISTNAAILGNAKV -------------3333-3333--------------3333-1111--------------- AEHGKTVMGGLDRAVQNMDNIKNVYKQLSIKHSEKIHVDPDNFRLLGEIITMCVGAKFGP ----------------11113333--------------3333------------------ SAFTPEIHEAWQKFLAVVVSALGRQYH --------------------------- >HISTIDINE-CONTAINING PHOS; SWP:P08877; PDB:1SPHA; AQKTFKVTADSGIHARPATVLVQTASKYDADVNLEYNGKTVNLKDIMGVMSLGIAKGAEI --------1111-----------3333--------iiii--1111-3333----2222-- TISASGADENDALNALEETMKSEGLGE -----1111-----------1111--- >FRUCTOSE 1,6-BISPHOSPHATA; SWP:P22418; PDB:1SPIA; AATQTKARTRSKYEIETLTGWLLKQPMAGVIDAELTIVLSSISLACKQIASLVQRAKLDV ----------------------------------3333---------------------- VSNEVFSSCLRSSGRTGIIASEEEDVPVAVEESYSGNYIVVFDPLDGSSNIDAAVSTGSI ----3333----3333------------------------------3333---------- FGIYSPNDECIVDSDHDDESQLSAEEQRCVVNVCQPGDNLLAAGYCMYSSSVIFVLTIGK ------------------------------------------------------------ GVYAFTLDPMYGEFVLTSEKIQIPKAGKIYSFNEGNYKMWPDKLKKYMDDLKEPGESQKP --------------------------------3333----3333---3333--------- YSSRYIGSLVGDFHRTLLYGGIYGYPRDAKSKNGKLRLLYECAPMSFIVEQAGGKGSDGH -------3333-------------------3333-------3333--------------- QRILDIQPTEIHQRVPLYIGSVEEVEKLEKYLA ---------------------1111-------- >KALLIKREIN 1; SWP:P06870; PDB:1SPJA; IVGGWECEQHSQPWQAALYHFSTFQCGGILVHRQWVLTAAHCISDNYQLWLGRHNLFDDE -------22221111----------------1111---1111------------1111-1 NTAQFVHVSESFPHPGFNMSLLENRQA 111----------3333---------- ------------------------------------------------------------ ------------ >MAJOR SEMINAL PLASMA GLYC; SWP:P35495; PDB:1SPPA; LDYHACGGRLTDDYGTIFTYKGPKTECVWTLQVDPKYKLLVSIPTLNLTCGKEYVEVLEG ---------------------------------3333----------------------- APGSKSLGKFCEGLSILNRGSSGMTVKYKRDSGHPASPYEIIFLRDSQG 2222--------------------------2222--------------- >Major seminal plasma glyc; SWP:P35496; PDB:1SPPB; ARINGPDECGRVIKDTSGSISNTDRQKNLCTWTILMKPDQKVRMAIPYLNLACGKEYVEV ----3333----------------------------1111-----------2222----- FDGLLSGPSYGKLCAGAAIVFLSTANTMTIKYNRISGNSSSPFLIYFYGSSP ---1111--------------------------------------------- >PUTATIVE POLYPROTEIN/PHOS; SWP:P0A8D6; PDB:1SPVA; TRIHVVQGDITKLAVDVIVNAANPSLGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTG --------1111----------3333-------------------------------222 HAVITLAGDLPAKAVVHTVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAIST 2-----!!!!-----------------------------------1111-------2222 GVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYDEENAHLYERLLTQQ 1111-----------------1111-------------------------- >SHORT-CHAIN REDUCTASE FAM; SWP:Q18946; PDB:1SPXA; TRFAEKVAIITGSSNGIGRATAVLFAREGAKVTITGRHAERLEETRQQILAAGVSEQNVN --2222-------------------1111-------------------------3333-- SVVADVTTDAGQDEILSTTLGKFGKLDILVNNAGQSIESYDATLNLNLRSVIALTKKAVP ----1111---------------------------------------------------- HLSSTKGEIVNISSIASGLHATPDFPYYSIAKAAIDQYTRNTAIDLIQHGIRVNSISPGL ---------------------1111---------------------1111---------- VATGFYSTMATMKECVPAGVMGQPQDIAEVIAFLADRKTSSYIIGHQLVVDGGSSLI ---------------3333---3333---------33331111-------iiii--- >CHORISMATE SYNTHASE; SWP:Q9PM41; PDB:1SQ1A; NTFGTRLKFTSFGESHGVAVGCIIDGPAGVKFDEEFLQNELDKRKGDKAQVLSGVFEGYT ---------------------------------------1111------------iiii- TGHPIAIVVFSARESVARVAGGAVAALLREFDICVQSGVFGVGTFVSNLKEEEFDFEFAK --------------------------3333-----------!!!!--------------- KSEIFCLDPKLESDFKNEILNARNSKDSVGAAVFTKVSGLIGLGEVLYDKLDSKLAHALG -------3333------------------------------------------------- INAVKAVEIGEGINASKRGSCNNDALKDGKFLSNHSGGILGGISNGENLILKTYFKPTPG 2222--------3333-3333------------3333--iiii----------------- RHDPCVGVRGSVVASAVRLVLADCLLLNASANLNNLKNAYG -------------------------1111--------1111 >Novel antigen receptor [F; SWP:Q8AXI4; PDB:1SQ2N; RVDQTPRSVTKETGESLTINCVLRDASYALGSTCWYRKKSGEGNEESISKGGRYVETVNS -----------2222-----------------------------------!!!!-----1 GSKSFSLRINDLTVEDGGTYRCGLGVAGGYCDYALCSSRYAECGDGTAVTVN 111---------3333---------------3333----------------- >GLYOXYLATE-INDUCED PROTEI; SWP:Q9I4J5; PDB:1SQ4A; KSSYYAPHGGHPALLTDRAFTEAYAVIPKGVRDIVTSHLPFWDNRWVIARPLSGFAETFS ---------------------------11111111---2222------------------ QYIVELAPNGGSDKPEQDPNAEAVLFVVEGELSLTLQGQVHAQPGGYAFIPPGADYKVRN ------2222-------1111--------------%%%%---2222----2222------ TTGQHTRFHWIRKHYQKVDGVPLPEAFVTNEQDIQPLVPDTEGRWSTTRFVDSDRHDHVN -----------------2222--------3333------%%%%----------------- IVNFEPGGVIPFAETHVEHGLYVLEGKAVYRLNQDWVEVEAGDFWLRAFCPQACYSGGPG ---------------------------------------2222----------------- RFRYLLYKDVNRHRLTLN ------------------ >PANTOTHENATE KINASE; SWP:P15044; PDB:1SQ5A; MTPYLQFDRNQWAALRDMLSEDEIARLKGINEDLSLEEVAEIYLPLSRLLNFYISSNLRR ------------------------------1111-------------------------- QAVLEQFLGTNQRIPYIISIAGSVAVGKSTTARVLQALLSRWPEHRRVELITTDGFLHPN ----------------------2222---------------3333------3333----- QVLKERGLMKKKGFPESYDMHRLVKFVSDLKSGVPNVTAPVYSHLIYDVIPDGDKTVVPD -------1111--3333--------------------------1111------------- ILILEGLNVLQSGMDYPHDPHHVFVSDFVDFSIYVDAPEDLLQTWYINRFLKFREGAFTD -----1111--33333333----3333-----------------------------1111 PDSYFHNYAKLTKEEAIKTAMTLWKEINWLNLKQNILPTRERASLILTKSANHAVEEVRL -----1111--------------------------3333----------2222------- RK -- >DH434; SWP:P16117; PDB:1SQ8A; MLMGERIRARRIQLGLNQAELAQKVGVDQQAIEQLENGKAKRPRFLPELARALGVAVDWL -----------1111------------3333-----------1111-------------- LNGA ---- >ANTIVIRAL PROTEIN SKI8; SWP:Q02793; PDB:1SQ9A; KVFIATANAGKAHDADIFSVSACNSFTVSCSGDGYLKVWDNKLLDNENPKDKSYSHFVHK ----------------------3333----1111---------22223333-------11 SGLHHVDVLQAIERDAFELCLVATTSFSGDLLFYRITREDETKKVIFEKLDLLDSDMKKH 11-----------------------1111------------------------3333--- SFWALKWGASNSHRLVATDVKGTTYIWKFHPFADESNSLTLNWSPTLELQGTVESPMTPS ------------------1111-----------33331111------------------- QFATSVDISERGLIATGFNNGTVQISELSTLRPLYNFESQHNNSNSIRSVKFSPQGSLLA --------1111------------------------------------------------ IAHDSNSFGCITLYETEFGERIGSLSVPGEFAHSSWVMSLSFNDSGETLCSAGWDGKLRF ----iiii----------------------------------3333-------------- WDVKTKERITTLNMHCDDIEIEEDILAVDEHGDSLAEPGVFDVKFLKKGWRSGMGADLNE --1111--------1111--3333----1111--------------2222--%%%%---- SLCCVCLDRSIRWFREAG -----1111--------- >HYPOTHETICAL PROTEIN PG13; SWP:Q99X56_STAAM; PDB:1SQEA; HMFMAENRLQLQKGSAEETIERFYNRQGIETIEGFQQMFVTKTLNTEDTDEVKILTIWES -----------222211113333--%%%%--2222------------------------- EDSFNNWLNSDVFKEAHDDGQQSPILSNKVFKYDIGYHYQK ----------3333---2222-------------------- >SUN PROTEIN; SWP:P36929; PDB:1SQGA; RNLRSMAAQAVEQVVEQGQSLSNILPPLQQKVSDKDKALLQELCFGVLRTLSQLDWLINK -------------------3333--3333--------------------3333------- LMARPMTGKQRTVHYLIMVGLYQLLYTRIPPHAALAETVEGAIAIKRPQLKGLINGVLRQ ------!!!!--------------------------------11111111---------- FQRQQEELLAEFNASDARYLHPSWLLKRLQKAYPEQWQSIVEANNQRPPMWLRINRTHHS ----------------1111----------------------1111--------3333-- RDSWLALLDEAGMKGFPHADYPDAVRLETPAPVHALPGFEDGWVTVQDASAQGCMTWLAP -----------------1111----------33332222--------3333--------- QNGEHILDLCAAPGGKTTHILEVAPEAQVVAVDIDEQRLSRVYDNLKRLGMKATVKQGDG 2222------------------------------------------------------33 RYPSQWCGEQQFDRILLDAPCSATGVIRRHPDIKWLRRDRDIPELAQLQSEILDAIWPHL 33--------------------111111113333---3333--------------3333- KTGGTLVYATCSVLPEENSLQIKAFLQRTADAELCETGTPEQPGKQNLPGAEEGDGFFYA 2222---------3333-----------1111------3333-------1111------- KLIK ---- >HYPOTHETICAL PROTEIN CG14; SWP:Q9VR51; PDB:1SQHA; GDILRPLSDSEVDELLDLYKVKFGIRNFHYLLLYNQRKWDRQLSEAQIPRNDLNHISLRK -----------------------1111------------------------11111111- QFYTHRRGNFRTWGTYVSLHRDIVQSVSFFSWQPDGAAELWECLEQTQLIEWTQGALLTN --------3333--------------------------------------1111------ VDLGFCNRVKELAVSRGVTAIQPRQCFGVLSHEDAFCAKVPDLPSEFEIRRLRAEDAAVH -1111--------1111--------------------------3333-----3333--33 DSWPNKGEGSLTYLQALVRFNKSLGICRSDTGELIAWIFQNDFSGLGLQVLPKAERRGLG 33------------------------------------------------1111------ GLLAAASREIARGEEITLTAWIVATNWRSEALLKRIGYQKDLVNEWIKLVPNS ----------------------1111--------------------------- >4-HYDROXYPHENYLPYRUVIC AC; SWP:P32755; PDB:1SQIA; GPKPERGRFLHFHSVTFWVGNAKQAASFYCNKMGFEPLAYKGLETGSREVVSHVIKQGKI -----------------------------------------3333-----------!!!! VFVLCSALNPWNKEMGDHLVKHGDGVKDIAFEVEDCEHIVQKARERGAKIVREPWVEEDK ----------------------------------------------------------11 FGKVKFAVLQTYGDTTHTLVEKINYTGRFLPGFEAPTYKDTLLPKLPSCNLEIIDHIVGN 11---------!!!!--------------2222------3333----------------- QPDQEMESASEWYLKNLQFHRFWSVLRSIVVANYEESIKMPINEPASQIQEYVDYNGGAG -2222---------------------------3333------------------------ VQHIALRTEDIITTIRHLRERGMEFLAVPSSYYRLLRENLKTSKIQVKENMDVLEELKIL ------------------3333----------------3333------------------ VDYDEKGYLLQIFTKPMQDRPTLFLEVIQRHNHQGFGAGNFNS ------------------------------------3333--- >oligoxyloglucan reducing-; SWP:Q8J0D2; PDB:1SQJA; YEFKNVAIGGGGYITGIVAHPKTKDLLYARTDIGGAYRWDAGTSKWIPLNDFIEAQDMNI ----------------------2222--------------1111-----33333333111 MGTESIALDPNNPDRLYLAQGRYVGDEWAAFYVSEDRGQSFTIYESPFPMGANDMGRNNG 1-------1111----------1111---------iiii--------------2222--- ERLAVNPFNSNEVWMGTRTEGIWKSSDRAKTWTNVTSIPDAFTNGIGYTSVIFDPERNGT -----1111-----------------iiii------------2222-------1111--- IYASATAPQGMYVTHDGGVSWEPVAGQPSSWLNRTTGAFPDKKPASIAPQPMKVALTPNF ---------------iiii------------33333333-----------------1111 LYVTYADYPGPWGVTFGEVWRQNRTSGAWDDITPRVGNSSPAPYNNQTFPAGGFCGLSVD ----------------------------------2222---------------------1 ATNPNRLVVITLDRDPGPALDSIYLSTDAGATWKDVTQLSSPSNLEGNWGHPTNAARYKD 111------------------------iiii-----3333----iiii---3333--111 GTPVPWLDFNNGPQWGGYGAPHGTPGLTKFGWWMSAVLIDPFNPEHLMYGTGATIWATDT 1------%%%%---2222-----2222------------1111----------------1 LSRVEKDWAPSWYLQIDGIEENAILSLRSPKSGAALLSGIGDISGMKHDDLTKPQKMFGA 1111111------------------------------------------1111------- PQFSNLDSIDAAGNFPNVVVRAGSSGHEYDSACARGAYATDGGDAWTIFPTCPPGMNASH -----------1111-----------------1111----------------22221111 YQGSTIAVDASGSQIVWSTKLDEQASGPWYSHDYGKTWSVPAGDLKAQTANVLSDKVQDG --------1111--------1111--------iiii------------------------ TFYATDGGKFFVSTDGGKSYAAKGAGLVTGTSLMPAVNPWVAGDVWVPVPEGGLFHSTDF -----iiii-----iiii-----2222----------1111-----------------ii GASFTRVGTANATLVSVGAPKAPSAVFIWGTDKPGSDIGLYRSDDNGSTWTRVNDQEHNY ii------------------------------------------iiii------1111ii SGPTMIEADPKVYGRVYLGTNGRGIVYADLTNEEKSTAKCANGQKGTHCY ii------1111---------------------------1111------- >DIHYDRONEOPTERIN ALDOLASE; SWP:Q9SF23; PDB:1SQLA; GDKLILKGLKFYGFHGAIAEERTLGQMFLVDIDAWVSLKKAGESDNLEDTISYVDIFSLA -----------------3333----------------3333----3333----------- KEIVEGSPRNLLETVAELIASKTLEKFHQINAVRVKLSKPNVALIKSTIDYLGVDIFRQR --------------------------3333-----------2222--------------- >PROGESTERONE RECEPTOR; SWP:P06401; PDB:1SQNA; QLIPPLINLLMSIEPDVIYAGHDNPDTSSSLLTSLNQLGERQLLSVVKWSKSLPGFRNLH ----------1111----------------------------------------3333-- IDDQITLIQYSWMSLMVFGLGWRSYKHVSGQMLYFAPDLILNEQRMKESSFYSLCLTMWQ -----------------------------------1111--3333--------------- IPQEFVKLQVSQEEFLCMKVLLLLNTIPLEGLRSQTQFEEMRSSYIRELIKAIGLRQGVV ---------------------------1111----------------------------- SSSQRFYQLTKLLDNLHDLVKQLHLYCLNTFIQSRALSVEFPEMMSEVIAAQLPKILAGM ----------------------------------1111---------------------- VKPLLFHK -------- >50S RIBOSOMAL PROTEIN L35; SWP:Q8TZV6; PDB:1SQRA; MRIKGVVLSYRRSKENQHNNVMIIKPLDVNSREEASKLIGRLVLWKSPSGKILKGKIVRV ------------------------------3333-1111--------%%%%--------- HGTKGAVRARFEKGLPGQALGDYVEIV ----------1111------------- >CONSERVED HYPOTHETICAL PR; SWP:Q97NR6; PDB:1SQSA; NKIFIYAGVRNHNSKTLEYTKRLSSIISSRNNVDISFRTPFNSELEISNSDSEELFKKGI ----------1111------------------------1111------------------ DRQSNADDGGVIKKELLESDIIIISSPVYLQNVSVDTKNFIERIGGWSHLFRLAGKFVVT --1111----------------------%%%%-----------3333---1111------ LDVAESNGSDNVSEYLRDIFSYGGQILHQVSITNSLKDIAEAQLEATYKIEDVLEGKIKY --------------------------------333311113333--------1111---- KTTDYQERAYQTLKLILENYDSEHFEKYWEKKRLFEANSLEEWYYVEN ---------------3333-1111-----1111--------------- >CHEX PROTEIN; SWP:Q9X1V3; PDB:1SQUA; MDARIVNALIGSVYETIRDVLGIEPKTGKPSTVSHIEIPHSLVTVIGITGGIEGSLIYSF ------------------------------------------------------------ SSETALKVVSAMMGGMEYNQLDELALSAIGELGNMTAGKLAMKLEHLGKHVDITPPTVVS ------------iiii---------------------------1111------------- GRDLKIKSFGVILKLPISVFSEEDFDLHLSVK -------------------------------- >SACCHAROMYCES CEREVISIAE ; SWP:Q9Y221; PDB:1SQWA; MRPLTEEETRVMFEKIAKYIGENLQLLVDRPDGTYCFRLHNDRVYYVSEKIMKLAANISG -------------------!!!!3333--1111------iiii----3333--3333--- DKLVSLGTCFGKFTKTHKFRLHVTALDYLAPYAKYKVWIKPGAEQSFLYGNHVLKSGLGR ---1111------1111----3333---3333--------------1111---3333--- ITENTSQYQGVVVYSMADIPLGFGVAAKSTQDCRKVDPMAIVVFHQADIGEYVRHE -----2222-----1111----------3333----1111-------3333----- ------------------------------------------------------------ -------------------------------------------------------- >APO-CCME; SWP:P33928; PDB:1SR3A; LRSNIDLFYTPGEILYGKRETQQMPEVGQRLRVGGMVMPGSVQRDPNSLKVTFTIYDAEG -----------3333----------1111--------2222------------------- SVDVSYEGILPDLFREGQGVVVQGELEKGNHILAKEVLAKHDENYTPPEVEKAM ----------3333---------------------------------------- >CYTOLETHAL DISTENDING TOX; SWP:O06522; PDB:1SR4A; LNLLSSSGPNRQVLPSEPSNFMTLMGQNGALLTVWALAKRNWLWAYPNIYSQDFGNIRNW ----1111-------3333------1111--------2222-----33331111------ KMEPGKHREYFRFVNQSLGTCVEAYGNGLIHDICSLDKLAQEFELLPTDSGAVVIKSVSQ ------2222--------------!!!!------11111111---------------111 GRCVTYNPVSTTFYSTVTLSVCDGATEPSRDQTWYLAPPVLEATAVN 1-------------------------1111----------------- >Cytolethal distending tox; SWP:O06523; PDB:1SR4B; NLSDFKVATWNLQGSSAVNESKWNINVRQLLSGEQGADILMVQEAGSLPSSAVRTSRVIQ 3333----------33333333----------1111------------1111-------- HGGTPIEEYTWNLGTRSRPNMVYIYYSRLDVGANRVNLAIVSRRQADEAFIVHSDSSVLQ --------------3333------------------------------------------ SRPAVGIRIGTDVFFTVHALATGGSDAVSLIRNIFTTFNSPPERRVYSWMVVGDFNRAPA --------!!!!---------%%%%---------------3333---------------- NLEVALRQEPAVSENTIIIAPTEPTHRSGNILDYAILHDAHLPRREQARERIGASLMLNQ ------------1111---------1111---------11112222----------3333 LRSQITSDHFPVSFVRDR ------------------ >Cytolethal distending tox; SWP:O06524; PDB:1SR4C; DPTTYPDVELSPPPRISLRSLLTAQPVKNDHYDSHNYLSTHWELIDYKGKEYEKLRDGGT ----1111------------------------1111-1111----------3333%%%%- LVQFKVVGAAKCFAFLGKGTTDCKDTDHTVFNLIPTNTGAFLIKDALLGFCITSHDFDDL -----2222------!!!!--33331111------1111--------------------- KLEPCGGSVSGRTFSLAYQWGILPPFGPSKILIP --------2222---------------------- >COBALAMIN BIOSYNTHESIS PR; SWP:O29535; PDB:1SR8A; MLIDPIELYRYPEKWIKDRDAEKKVRSGLYILTEDGYLRRGITTGTTASAAAVAAIASLK -----------1111--1111--3333-----------------------------3333 EKVEKVKVSTPAGVDVEVEVEAEKGFARVRKFSGDHEFDVTNGIIFEAEVCETSGIFFGR ---------3333---------iiii------!!!!--1111----------------22 GVGVKAGEKAVSRSAKLQILENFIKASREFNFSGGVRISVPDGEEVAKKTGNEKVGIKGG 22--iiii-------------------1111--------1111--------1111----- ISILGTTGFVEPWCKKLVETKLKIAMQYHRIAITTGRKAWLYARKKFPEYQPFVFGVHID -----------------------1111-------------------1111----!!!!33 EALKHPGEKIIVGFPGLLKIWAGSRDRIEERAREEGVRVVVI 33-----------3333------1111--------------- >2-ISOPROPYLMALATE SYNTHAS; SWP:P96420; PDB:1SR9A; TIVKPAGPPRVGQPSWNPQRASSPVNRYRPFAEEVEPIRLRNRTWPDRVIDRAPLWCAVD ---------22221111------1111--3333----------3333-----------11 LRDGNQALIDPSPARKRRFDLLVRGYKEIEVGFPSASQTDFDFVREIIEQGAIPDDVTIQ 11-3333-------------------------1111-----------3333--1111--- VLTQCRPELIERTFQACSGAPRAIVHFYNSTSILQRRVVFRANRAEVQAIATDGARKCVE --------------1111------------------------------------------ QAAKYPGTQWRFEYSPESYTGTELEYAKQVCDAVGEVIAPTPERPIIFNLPATVETTPNV -----------------3333-------------------3333---------------- YADSIEWSRNLANRESVILSLHPHNDRGTAVAAAELGFAAGADRIEGCLFGNGERTGNVC ------------3333--------1111---------1111-------%%%%-!!!!--- LVTLGLNLFSRGVDPQIDFSNIDEIRRTVEYCNQLPVHERHPYGGDLVYTAFSGSHQDAI -------1111--------------------------1111---1111------------ NKGLDAKLDADAADCDVDDLWQVPYLPIDPRDVGRTYEAVIKGGVAYIKTDHGLSLPRRL ----------1111-3333---2222--3333-----------3333------------- QIEFSQVIQKIEVSPKEWDAFAEEYLAPVRPLERIRQHVDAADDDGGTTSITATVKINGV -------3333--3333--------------------------2222---------iiii ETEISGSGNGPLAAFVHALADVGFDVAVLDYYEHASAGDDAQAAAYVEASVTISKTVWGV -----------------3333--------------------------------------- GIAPSITTASLRAVVSAVNRAA ---------------------- >SPARC; SWP:P09486; PDB:1SRA; PPCLDSELTEFPLRMRDWLKNVLVTLYERDEDNNLLTEKQKLRVKKIHENEKRLEAGDHP ---3333-----------------------%%%%---------------1111------3 VELLARDFEKNYNMYIFPVHWQFGQLDQHPIDGYLSHTELAPLRAPLIPMEHCTTRFFET 333-------3333---------------------33333333-1111-1111----333 CDLDNDKYIALDEWAGCFGIKQKDIDKDLVI 31111----3333--1111-3333-3333-- >COPPER,ZINC SUPEROXIDE DI; SWP:P07505; PDB:1SRDA; ATKKAVAVLKGTSNVEGVVTLTQEDDGPTTVNVRISGLAPGKHGFHLHEFGDTTNGCMST ------------------------------------------------------!!!!-- GPHFNPDKKTHGAPEDEVRHAGDLGNIVANTDGVAEATIVDNQIPLTGPNSVVGRALVVH ----1111---------------------1111--------------22222222----- ELEDDLGKGGHELSPTTGNAGGRLACGVVGLTPV ----%%%%----1111------------------ >ZINC FINGER PROTEIN ZFPM1; SWP:O35615; PDB:1SRKA; GSSGKRPFVCRICLSAFTTKANCARHLKVHTDTLS ------------------3333----3333----- >PNPASE; SWP:P05055; PDB:1SRO; AEIEVGRVYTGKVTRIVDFGAFVAIGGGKEGLVHISQIADKRVEKVTDYLQMGQEVPVKV ----------------1111------------------------1111------------ LEVDRQGRIRLSIKEA ---1111--------- >GTPASE-ACTIVATING PROTEIN; SWP:P47736; PDB:1SRQA; KVKLECNPTARIYRKHFLGKEHFNYYSLDTALGHLVFSLKYDVIGDQEHLRLLLRTKCRT ------11113333--2222-----------------------!!!!------------- YHDVIPISCLFPNVVQMAKLVCEDVNVDRFYPVLYPKASRLIVTFDEHVISNNFKFGVIY ---------------------3333---------1111------1111------------ QKLGQTSEEELFSTNEESPAFVEFLEFLGQKVKLQDFKGFRGGLDVTHGQTGTESVYCNF -3333-----------------------------------iiii---------------- RNKEIMFHVSTKLPYTEGDAQQLQRKRHIGNDIVAVVFQDENTPFVPDMIASNFLHAYVV --------3333---2222-----33331111-------------3333----------- VQAEGGPLYKVSVTARDDVPFFGPPLPDPAVFRKGPEFQEFLLTKLINAEYACYKAEKFA ---------------3333-------1111--------------------------1111 KLEERTRAALLETLYEELHIHSQSMMGLGG ------------------------------ >SPORULATION RESPONSE REGU; SWP:P06628; PDB:1SRRA; NEKILIVDDQSGIRILLNEVFNKEGYQTFQAANGLQALDIVTKERPDLVLLDMKIPGMDG ---------------------1111-------3333------------------------ IEILKRMKVIDENIRVIIMTAYGELDMIQESKELGALTHFAKPFDIDEIRDAVKKYLPL ----------1111--------------------------------------------- >GROEL (HSP60 CLASS); SWP:P61491; PDB:1SRVA; GYQFDKGYISPYFVTNPETMEAVLEDAFILIVEKKVSNVRELLPILEQVAQTGKPLLIIA ---------3333--3333------------------3333--------1111------- EDVEGEALATLVVNKLRGTLSVAAVKAPGFGDRRKEMLKDIAAVTGGTVISEELGFKLEN --------------------------------------------------3333--3333 ATLSMLGRAERVRITKDETTIVGGK -3333---------1111------- >Gamma-aminobutyric acid t; SWP:Q9Z0U4; PDB:1SRZA; EAEFVRICSKSYLTLENGKVFLTGGDLPALDGARVEFRCDPDFHLVGSSRSVCSQGQWST --------3333-----------------------------------------%%%%--- PKPHCQVN -------- >POLLEN ALLERGEN OLE E 6; SWP:O24172; PDB:1SS3A; DEAQFKECYDTCHKECSDKGNGFTFCEMKCDTDCSVKDVKEKLENYKPKN -3333----------3333---3333------------------------ >GLYOXALASE FAMILY PROTEIN; SWP:Q81F54; PDB:1SS4A; AAKNKLLRDNVSIVVESLDNAISFFEEIGLNLEGRANVEGEWAGRVTGLGSQCVEIAVTP ------------------------------------------3333------------11 DGHSRIELSRFLTPPTIADHRTAPVNALGYLRVFTVEDIDEVSRLTKHGAELVGEVVQYE 11-----------------------------------3333---3333------------ NSYRLCYIRGVEGILIGLAEELG ----------iiii--------- >NSFL1 COFACTOR P47; SWP:Q9UNZ2; PDB:1SS6A; GSEKRQHSSQDVHVVLKLWKSGFSLDNGELRSYQDPSNAQFLESIRRGEVPAELRRLAHG ------------------------1111------1111-------------3333----- GQVNLDMEDHRDEDFVKPKGAFKAFTGEGQKLGSTAPQVLST ---------!!!!----------------------------- >AP-1 LIKE TRANSCRIPTION F; SWP:P19880; PDB:1SSEA; NLDSNMFSNDFNFENQFDEQVSEFCSKMNQVCGTR ---------3333---------------------- >AP-1-like transcription f; SWP:P19880; PDB:1SSEB; NGSSLQNADKINNGNDNDNDNDVVPSKEGSLLRCSEIWDRITTHPKYSDIDVDGLCSELM ---------------------------------3333----------------------- AKAKCSERGVVINAEDVQLALNKHMN -------------------------- ------------------------------------------------------------ ------------------------------------------------------------ -------------------------------------- >HEPATOCYTE GROWTH FACTOR ; SWP:P08581; PDB:1SSLA; GSAMGCRHFQSCSQCLSAPPFVQCGWCHDKCVRSEECLSGTWTQQICL ----3333---------3333-----------33331111-------- >SERINE ACETYLTRANSFERASE; SWP:P43886; PDB:1SSQA; MNLDVWQHIRQEAKELAENEPMLASFFHSTILKHQNLGGALSYLLANKLANPIMPAISLR -3333--------------3333-------1111-3333-----------33333333-- EIIEEAYQSNPSIIDCAACDIQAVRHRDPAVELWSTPLLYLKGFHAIQSYRITHYLWNQN ---------------------------3333-11113333---------------1111- RKSLALYLQNQISVAFDVDIHPAAKIGHGIMFDHATGIVVGETSVIENDVSILQGVTLGG --------------------1111----------2222--1111--------2222---- TGKESGDRHPKVREGVMIGAGAKILGNIEVGKYAKIGANSVVLNPVPEYATAAGVPARIV ------------------2222--------2222--2222------2222---------- S - >LYSOZYME; SWP:P00720; PDB:1SSWA; MNIFEMLRIDEGLRLKIYKDTEGAAAAGIGHLLTKSPSLNAAKSELDKAIGRNTNGVITK -----------------------------------------------------iiii--- DEAEKLFNQDVDAAVRGILRNAKLKPVYDSLDAVRRAALINMVFQMGETGVAGFTNSLRM ---------------------------11113333------------------------- LQQKRWDEAAVNLAKSRWYNQTPNRAKRVITTFRTGTWDAYKNL -----------3333----------------------3333--- >PULMONARY SURFACTANT-ASSO; SWP:P07988; PDB:1SSZA; CWLCRALIKRIQAMIPKGGRMLPQLVCRLVLRCS ---------3333--------------------- >MRNA decapping enzyme var; SWP:Q53G42; PDB:1ST0B; VRLPFSGFRLQKVLRESARDKIIFLHGKVNEASGDGDGEDAVVILEKTPFQVEQVAQLLT -----------------------------1111--------------------------- GSPELQLQFSNDIYSTYHLFPPRQLNDVKTTVVYPATEKHLQKYLRQDLRLIRETGDDYR ----------!!!!-------3333-----------3333-------------------- NITLPHLESQSLSIQWVYNILDKKAEADRIVFENPDPSDGFVLIPDLKWNQQQLDDLYLI ------1111---------------1111----------------1111----------- AICHRRGIRSLRDLTPEHLPLLRNILHQGQEAILQRYRMKGDHLRVYLHYLPSYYHLNVH ---------3333-3333---------------------1111----------------- FTALGFEAPGSGVERAHLLAEVIENLECDPRHYQQRTLTFALRADDPLLKLLQEAQQ --1111-22221111-------------1111----------1111-------1111 >ACYL-COA-BINDING PROTEIN; SWP:P31787; PDB:1ST7A; VSQLFEEKAKAVNELPTKPSTDELLELYALYKQATVGDNDKEKPGIFNMKDRYKWEAWEN -------------------1111------------------------------------- LKGKSQEDAEKEYIALVDQLIAKYSS -------------3333--3333--- >FRUCTAN 1-EXOHYDROLASE II; SWP:Q93X60; PDB:1ST8A; QIEQPYRTGYHFQPPSNWMNDPNGPMLYQGVYHFFYQYNPYAATFGDVIIWGHAVSYDLV ---1111--------------------iiii-------1111------------------ NWIHLDPAIYPTQEADSKSCWSGSATILPGNIPAMLYTGSDSKSRQVQDLAWPKNLSDPF ------------3333------------------------1111----------3333-- LREWVKHPKNPLITPPEGVKDDCFRDPSTAWLGPDGVWRIVVGGDRDNNGMAFLYQSTDF ------1111---------1111---------1111---------%%%%----------- VNWKRYDQPLSSADATGTWECPDFYPVPLNSTNGLDTSVYGGSVRHVMKAGFEGHDWYTI -----------------------------------1111-1111-------iiii----- GTYSPDRENFLPQNGLSLTGSTLDLRYDYGQFYASKSFFDDAKNRRVLWAWVPETDSQAD ----1111---1111--------------------------------------------- DIEKGWAGLQSFPRALWIDRNGKQLIQWPVEEIEELRQNQVNLQNKNLKPGSVLEIHGIA ------------------1111-----------1111-----------2222-------1 ASQADVTISFKLEGLKEAEVLDTTLVDPQALCNERGASSRGALGPFGLLAMASKDLKEQS 111----------3333-----1111---------1111-------------1111---- AIFFRVFQNQLGRYSVLMCSDLSRSTVRSNIDTTSYGAFVDIDPRSEEISLRNLIDHSII --------1111---------1111-----------------3333---------!!!!- ESFGAGGKTCITSRIYPKFVNNEEAHLFVFNNGTQNVKISEMSAWSMKNAKFVVDQS ---%%%%----------3333------------------------------------ >Cystatin-B; SWP:P04080; PDB:1STFI; MMSGAPSATQPATAETQHIADQVRSQLEEKYNKKFPVFKAVSFKSQVVAGTNYFIKVHVG ------------------------------------------------------------ DEDFVHLRVFQSLPHENKPLTLSNYQTNKAKHDELTYF -----------------------------1111----- >SATELLITE PANICUM MOSAIC ; SWP:Q86993; PDB:1STMA; AAATSLVYDTCYVTLTERATTSFQRQSFPTLKGMGDRAFQVVAFTIQGVSAAPLMYNARL -----------------------3333--------------------------------- YNPGDTDSVHATGVQLMGTVPRTVRLTPRVGQNNWFFGNTEEAETILAIDGLVSTKGANA -2222-----------------------2222----11111111----------2222-- PSNTVIVTGCFRLAPSELQSS --------------------- >CYTOCHROME C PEROXIDASE, ; SWP:P00431; PDB:1STQA; LVHVASVEKGRSYEDFQKVYNAIALKLREDDEYDNYIGYGPVLVRLAWHTSGTWDKHDNT -------22223333--------------3333%%%%----------------------- GGSYGGTYRFKKEFNDPSNAGLQNGFKFLEPIHKEFPWISSGDLFSLGGVTAVQEMQGPK --1111----3333-3333----------------3333--------------------- IPWRCGRVDTPEDTTPDNGRLPDADKDADYVRTFFQRLNMNDREVVALSGAHTLGKTHLK ----------1111---------------------1111------------------333 NSGYEGPWTANNNVFDNSFYLNLLNEDWKLEKNDANNEQWDSKSGYLQLPTDYSLIQDPK 3-------------------------------1111-----3333--------------- YLSIVKEYANDQDKFFKDFSKAFEKLLENGITFPKDAPSPFIFKTLEEQGL -------------------------1111----1111-------3333--- >Heat-inducible transcript; SWP:Q9WZV5; PDB:1STZA; KLNDRQRKVLYCIVREYIENKKPVSSQRVLEVSNIEFSSATIRNDMKKLEYLGYIYQPHT ------------------------------------------------------------ SAGRIPTDKGLRFYYEEMLKISKETSEADLAVETFKSMPLADPEKVLFLAGNLLARLTEG --------------------1111----3333----!!!!-------------------- YVLIERPNTRDLKILRVMLIPVSEDYLIFSILTEFGVSKVTPIKTQERLNWEEIERQLNF ----------------------1111------1111------------------------ LLRGRTVGEVLMGKIESLKGSGFLRLIESLIGETVERYLDAGLENLLKDETLTLEDIRNL -2222---------3333-----------------------33331111----------- LEEVKDQKFLESLVGEGITVRIGREIGRKKLEKFAVFSGKYFKGESPIGSVYLFTSKVTK -3333----3333--------!!!!--3333-----------!!!!-------------- YDRNHRVFEYILNRLSEYFTSTS ----------------------- >NifU-like protein; SWP:Q9A1G2; PDB:1SU0B; LNHLYMAVVADHSKRPHHHGQLDGVEAVQLNNPTCGDVISLTVKFDEDKIEDIAFAGNGC ------------------------------------------------------------ TISTASSSMMTDAVIGKSKEEALALADIFSEMVQGQENPAQKELGEAELLAGVAKFPQRI ---------------------------------------3333------3333--3333- KCSTLAWNALKEAIKR ---------------- >HYPOTHETICAL PROTEIN YFCE; SWP:P76495; PDB:1SU1A; MMKLMFASDIHGSLPATERVLELFAQSGAQWLVILGDVLNHGPRNALPEGYAPAKVVERL -----------------------------------------1111--------------- NEVAHKVIAVRGNCDSEVDQMLLHFPITAPWQQVLLEKQRLFLTHGHLFGPENLPALNQN --3333-----11113333------------------------------1111----222 DVLVYGHTHLPVAEQRGEIFHFNPGSVSIPKGGNPASYGMLDNDVLSVIALNDQSIIAQV 2--------------!!!!-----------iiii-------------------------- AINP ---- >CARBON MONOXIDE DEHYDROGE; SWP:Q9F8A8; PDB:1SU8A; QNLKSTDRAVQQMLDKAKREGIQTVWDRYEAMKPQCGFGETGLCCRHCLQGPCRINPFGD 1111-------------------------1111--------------------------- EPKVGICGATAEVIVARGLDRSIAAGAAGHSGHAKHLAHTLKKAVQGKAASYMIKDRTKL ----1111-----------------------------------1111-1111-------- HSIAKRLGIPTEGQKDEDIALEVAKAALADFHEKDTPVLWVTTVLPPSRVKVLSAHGLIP ----------2222--------------1111-----1111------------1111--- AGIDHEIAEIMHRTSMGCDADAQNLLLGGLRCSLADLAGCYMGTDLADILFGTPAPVVTE --------------2222------------------------------------------ SNLGVLKADAVNVAVHGHNPVLSDIIVSVSKEMENEARAAGATGINVVGICCTGNEVLMR --11111111--------3333---------------1111------------------- HGIPACTHSVSQEMAMITGALDAMILDYQCIQPSVATIAECTGTTVITTMEMSKITGATH -------!!!!---------------------33333333---------1111-2222-- VNFAEEAAVENAKQILRLAIDTFKRRKGKPVEIPNIKTKVVAGFSTEAIINALSKLNAND ---3333----------------1111-------------------------33331111 PLKPLIDNVVNGNIRGVCLFAGCNNVKVPQDQNFTTIARKLLKQNVLVVATGCGAGALMR ------------------------33332222---------------------------- HGFMDPANVDELCGDGLKAVLTAIGEANGLGGPLPPVLHMGSCVDNSRAVALVAALANRL ----3333-----------------1111------------3333--------------- GVDLDRLPVVASAAEAMHEKAVAIGTWAVTIGLPTHIGVLPPITGSLPVTQILTSSVKDI --1111-----------3333---------------------1111---------3333- TGGYFIVELDPETAADKLLAAINERRAGLGLPW --------------------------1111--- >CAFFEOYL-COA O-METHYLTRAN; SWP:Q40313; PDB:1SUIA; KSLLQSDALYQYILETSVFPREHEAMKELREVTAKHPWNIMTTSADEGQFLSMLLKLINA -----3333--------------3333--------1111----------------1111- KNTMEIGVYTGYSLLATALAIPEDGKILAMDINKENYELGLPVIKKAGVDHKIDFREGPA ---------------------1111--------3333-----------3333-------- LPVLDEMIKDEKNHGSYDFIFVDADKDNYLNYHKRLIDLVKVGGVIGYDNTLWNGSVVAP ---------3333---------------------3333--3333-----1111-1111-- PDAPLRKYVRYYRDFVLELNKALAVDPRIEICMLPVGDGITICRRIK -------------------------1111------------------ >CONE ARRESTIN; SWP:Q9PTE7; PDB:1SUJA; SKVYKKTCPNAKLSIYLGKRDFVDHVEHVEPVDGVVLIDPEYLKDRKVFVTLTCAFRYGR --------------------------------------33332222-------------- DDLDLIGMSFRKDLYSLATQVYPPETKEPLTPLQEKLMKKLGAHAYPFCFKMGTNLPCSV ------------------------------------------------------------ TLQPGPDDTGKSCGVDFEVKAFCAENLEEKIHKRNSVQLVIRKVQFAPANLGVAPKTEIT ----1111-----------------3333--3333------------------------- RQFMLSDRPLHLEASLDKEIYYHGEPINVNVKINNTTGKIVKKIKIIVEQVTDVVLFSLD ----%%%%-------------2222----------------------------------- KYVKTVCAEETNDTVAANSTLSKTFSVTPMLANNREKRGLALDGKLKHEDTNLASTTVIR -----------------------------3333---2222----1111------------ PGMDKEVLGILVSYKVKVHLVVARGGILGDLTSSDVAVELPLTLMHPKPSDDIIIEEFAR ------------------------------------------------------------ QKL --- >Phosphate transport syste; SWP:Q9X256; PDB:1SUMB; NRLLNEKVEEFKKGVLKAGWFIEKMFRNSISSLVERNESLAREVIADEEVVDQMEVEIQE -3333------------------------------------------------------- KAMEVLGLFSPIGKPLLTVTAGIRVAELIENIADKCHDIAKNVLELMEEPPLKPLEDIPA ---------------------------------------------3333----------- MANQTSEMLKFALRMFADVNVEKSFEVCRMDSKVDDLYEKVREELLLYMMESPKYVKRAL -------------------3333----------------------------3333----- LLLEIAGNIEIIADYATNIVEVSVYMVQGEAYKCYHDELLLFKKS ----------------------------------!!!!------- >PAPS REDUCTASE; SWP:P17854; PDB:1SUR; SKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVS -------11113333------3333-1111-----------------------1111--- LHLVNQIRPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLW -------2222----------------------1111--------------------111 EQGVEGIEKYNDINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLP 1----------------------------------1111---1111-----iiii---11 IIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTH 11-------------------3333---------- >DNA GYRASE SUBUNIT A; SWP:O51396; PDB:1SUUA; ENIVVMLTKKGFLKRLSQNEYKLQGTGGKGLSSFDLNDGDEIVIALCVNTHDYLFMISNE -------1111-----1111----------------%%%%--------1111-----111 GKLYLINAYEIKDQNISELINLGDQEEILTIKNSKDLTDDAYLLLTTASGKIARFESTDF 1-----3333----3333----1111-----------1111-----1111-----3333- KAGVIVIKLNDKDFVTSAEIVFKDEKVICLSKKGSAFIFNSRDVRLTNRGTQGVCGMKLK ---------------------2222-----1111-----3333----2222--------2 EGDLFVKVLSVKENPYLLIVSENGYGKRLNMSKISELKRGATGYTSYKKSDKKAGSVVDA 222-------!!!!---------------3333----2222------------------- IAVSEDDEILLVSKRSKALRTVAGKVSEQGKDARGIQVLFLDNDSLVSVSKFI ---1111-----1111-----3333----1111-------------------- ---------------------------------- >ETS DNA-BINDING PROTEIN P; SWP:Q01842; PDB:1SV0A; LPPSLPSDPRLWSREDVLVFLRFCVREFDLPKLDFDLFQMNGKRLCLLTRADFGHRCPGA -3333--3333--------------1111----3333---33331111------------ GDVLHNVLQMLIIESHS ----------------- >LD15796p; SWP:Q7K119; PDB:1SV0C; PLGSDGLPLDPRDWTRADVWKWLINMAVSEGLEVTAELPQKFPMNGKALCLMSLDMYLCR --1111---3333---------------------33331111--333311113333---- VPVGGKMLYRDFRVRLARAMSR ---3333--------------- >2-KETO-4-PENTENOATE HYDRA; SWP:P77608; PDB:1SV6A; MTKHTLEQLAADLRRAAEQGEAIAPLRDLIGIDNAEAAYAIQHINVQHDVAQGRRVVGRK -------------------------3333-11113333-----------1111------- VGLTHPKVQQQLGVDQPDFGTLFADMCYGDNEIIPFSRVLQPRIEAEIALVLNRDLPATD ---------1111---------3333--------1111---------------------- ITFDELYNAIEWVLPALEVVGSRIRDWSIQFVDTVADNASCGVYVIGGPAQRPAGLDLKN ------------------------------------%%%%------------2222---- CAMKMTRNNEEVSSGRGSECLGHPLNAAVWLARKMASLGEPLRTGDIILTGALGPMVAVN ------%%%%-----3333----------------1111---2222-------------2 AGDRFEAHIEGIGSVAATFSS 222-----2222--------- >TICK-BORNE ENCEPHALITIS V; SWP:P14336; PDB:1SVB; SRCTHLENRDFVTGTQGTTRVTLVLELGGCVTITAEGKPSMDVWLDAIYQENPAKTREYC 3333----------2222-------2222-----2222---------------------- LHAKLSDTKVAARCPTMGPATLAEEHQGGTVCKRDQSDRGWGNHCGLFGKGSIVACVKAA -------------1111----3333--------------3333----------------- CEAKKKATGHVYDANKIVYTVKVEPHTGDYVAANETHSGRKTASFTISSEKTILTMGEYG -2222-------1111---------------1111-1111-----1111------!!!!- DVSLLCRVASGVDLAQTVILELDKTVEHLPTAWQVHRDWFNDLALPWKHEGAQNWNNAER ------3333---1111-----1111-------------1111---------------11 LVEFGAPHAVKMDVYNLGDQTGVLLKALAGVPVAHIEGTKYHLKSGHVTCEVGLEKLKMK 11-----!!!!----------------2222-----!!!!-------------1111--- GLTYTMCDKTKFTWKRAPTDSGHDTVVMEVTFSGTKPCRIPVRAVAHGSPDVNVAMLITP 1111---1111----------------------------------2222----------- NPTIENNGGGFIEMQLPPGDNIIYVGELSHQWFQK ------------------------!!!!------- >RIBULOSE BISPHOSPHATE CAR; SWP:O85040; PDB:1SVDA; TYWMPEYTPLDSDILACFKITPQPGVDREEAAAAVAAESSTGTWTTVWTDLLTDMDYYKG ---1111--1111---------2222--------------------3333---3333--- RAYRIEDVPGDDAAFYAFIAYPIDLFEEGSVVNVFTSLVGNVFGFKAVRGLRLEDVRFPL -------2222----------1111---------------11111111----------33 AYVKTCGGPPHGIQVERDKMNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKD 331111--------------------------------------------1111------ DENINSQPFMRWRDRFLFVQDATETAEAQTGERKGHYLNVTAPTPEEMYKRAEFAKEIGA 1111--1111-------------------------------------------------- PIIMHDYITGGFTANTGLAKWCQDNGVLLHIHRAMHAVIDRNPNHGIHFRVLTKILRLSG -----3333-----------------------2222-----1111--------------- GDHLHTGTVVGKLEGDRASTLGWIDLLRESFIPEDRSRGIFFDQDWGSMPGVFAVASGGI --------11111111-------------------1111------!!!!----------- HVWHMPALVNIFGDDSVLQFGGGTLGHPWGNAAGAAANRVALEACVEARNQGRDIEKEGK 3333----------------1111--1111-----------------------3333--- EILTAAAQHSPELKIAMETWKEIKF ------------------------- >Ribulose bisphosphate car; SWP:P45686; PDB:1SVDM; EMQDYKQSLKYETFSYLPPMNAERIRAQIKYAIAQGWSPGIEHVEVKNSMNQYWYMWKLP -----------2222-----------------1111--------3333------------ FFGEQNVDNVLAEIEACRSAYPTHQVKLVAYDNYAQSLGLAFVVYRGN 2222-3333-----------1111--------1111------------ >FUSION GLYCOPROTEIN; SWP:P04849; PDB:1SVFA; TAAVALVKANENAAAILNLKNAIQKTNAAVADVVQATQSLGTAVQAVQDHINSVVSPAIT ------------------------------------------------------------ AANY 1111 >GTP-BINDING PROTEIN YSXC; SWP:P38424; PDB:1SVIA; MKVTKSEIVISAVKPEQYPEGGLPEIALAGRSNVGKSSFINSLINRQTLNFYIINDELHF -------------3333-------------2222-------------------%%%%--- VDVPGYGFAKVSKSEREAWGRMIETYITTREELKAVVQIVDLRHAPSNDDVQMYEFLKYY -----------------------------3333-------3333-------------111 GIPVIVIATKADKIPKGKWDKHAKVVRQTLNIDPEDELILFSSETKKGKDEAWGAIKKMI 1--------3333-3333--------------1111----------------------11 NR 11 >POTASSIUM-TRANSPORTING AT; SWP:P03960; PDB:1SVJA; NRQASEFIPAQGVDEKTLADAAQLASLADETPEGRSIVILAKQRFNLRERDVQSLHATFV ---------2222---------3333----3333----------------3333------ PFTAQSRMSGINIDNRMIRKGSVDAIRRHVEANGGHFPTDVDQKVDQVARQGATPLVVVE ---1111----------------------------------------------------- GSRVLGVIALKDIVKG ---------------- >LARGE T ANTIGEN; SWP:Q9DH70; PDB:1SVMA; KQVSWKLVTEYAMETKCDDVLLLLGMYLEFQYSFEMCLKCIKKEQPSHYKYHEKHYANAA ---------------------------1111-3333--------3333---3333----- IFADSKNQKTICQQAVDTVLAKKRVDSLQLTREQMLTNRFNDLLDRMDIMFGSTGSADIE -1111-------------------------------------------1111-------- EWMAGVAWLHCLLPKMDSVVYDFLKCMVYNIPKKRYWLFKGPIDSGKTTLAAALLELCGG --------1111------------------2222-------------------------- KALNVNLPLDRLNFELGVAIDQFLVVFEDVKGTGGESRDLPSGQGINNLDNLRDYLDGSV -------1111-------2222-----------------------------3333----- KVNLEKKHLNKRTQIFPPGIVTMNEYSVPKTLQARFVKQIDFRPKDYLKHCLERSEFLLE ----------------------------33331111--------3333----------11 KRIIQSGIALLLMLIWYRPVAEFAQSIQSRIVEWKERLDKEFSLSVYQKMKFNVAMGIGV 111111------------3333-3333-------------------------------11 LD 11 >SINDBIS VIRUS CAPSID PROT; SWP:P27285; PDB:1SVPA; ALKLEADRLFDVKNEDGDVIGHALAMEGKVMKPLHVKGTIDHPVLSKLKFTKSSAYDMEF -------------1111--------iiii---3333-------3333-----3333---- AQLPVNMRSEAFTYTSEHPEGFYNWHHGAVQYSGGRFTIPRGVGGRGDAGRPIMDNSGRV ---3333-----------------1111----%%%%--------2222------1111-- VAIVLGGADEGTRTALSVVTWNSKGKTIKTTPEGTEEWSA ---------------------1111------2222----- >Guanine nucleotide-bindin; SWP:P10824; PDB:1SVSA; REVKLLLLGAGESGKSTIVKQMKIIHEAGYSEEECKQYKAVVYSNTIQSIIAIIRAMGRL ---------2222----------------------------------------------- KIDFGDAARADDARQLFVLAGAAEEGFMTAELAGVIKRLWKDSGVQACFNRSREYQLNDS ------------------33331111------------------------3333---111 AAYYLNDLDRIAQPNYIPTQQDVLRTRVPTTGIVETHFTFKDLHFKMFDVGGQRSERKKW 1------3333-----------1111-------------%%%%---------33331111 IHCFEGVTAIIFCVALSDYDLVLAEDEEMNRMHESMKLFDSICNNKWFTDTSIILFLNKK 1111----------1111----3333---------------11111111----------- DLFEEKIKKSPLTICYPEYAGSNTYEEAAAYIQCQFEDLNKRKDTKEIYTHFTCATDTKN ----3333--3333-1111----------------------1111--------1111--- VQFVFDAVTDVIIKNN ---------------- >GROEL PROTEIN; SWP:P06139; PDB:1SVTA; AAKDVKFGNDARVKMLRGVNVLADAVKVTLGPKGRNVVLDKSFGAPTITKDGVSVAREIE --------------------------1111--------------------3333-1111- LEDKFENMGAQMVKEVASKANDAAGDGTTTATVLAQAIITEGLKAVAAGMNPMDLKRGID --3333--------------------3333---------------1111-3333------ KAVTAAVEELKALSVPCSDSKAIAQVGTISANSDETVGKLIAEAMDKVGKEGVITVEDGT ---------------------------------3333----------------------- GLQDELDVVEGMQFDRGYLSPYFINKPETGAVELESPFILLADKKISNIREMLPVLEAVA -------------------------3333------------------3333------111 KAGKPLLIIAEDVEGEALATLVVNTMRGIVKVAAVKAPGFGDRRKAMLQDIATLTGGTVI 1----------------------------------------3333--------------- SEEIGMELEKATLEDLGQAKRVVINKDTTTIIDGVGEEAAIQGRVAQIRQQIEEATSDYD 1111--3333-3333---------------------3333-----------1111--333 REKLQERVAKLAGGVAVIKVGAATEVEMKEKKARVEDALHATRAAVEEGVVAGGGVALIR 3-----------------------3333------------------------%%%%---- VASKLADLRGQNEDQNVGIKVALRAMEAPLRQIVLNCGEEPSVVANTVKGGDGNYGYNAA --1111----------------1111--------1111-------------!!!!----- TEEYGNMIDMGILDPTKVTRSALQYAASVAGLMITTECMVTDLP -----3333-----------------------1111-------- >GROEL PROTEIN; SWP:P05380; PDB:1SVTO; MNIRPLHDRVIVKRKEVETKSAGGIVLTGSAAAKSTRGEVLAVGNGRILENGEVKPLDVK -----------------------------------------------2222--------- VGDIVIFNDGYGVKSEKIDNEEVLIMSESDILAIVEA ---------1111----%%%%-----3333------- >THREONINE ALDOLASE; SWP:O15839; PDB:1SVVA; PYSFVNDYSVGHPKILDLARDNTQHAGYGQDSHCAKAARLIGELLERPDADVHFISGGTQ -----------3333---1111----iiii----------------1111---------- TNLIACSLALRPWEAVIATQLGHISTHETGAIEATGHKVVTAPCPDGKLRVADIESALHE ----------1111----11111111-%%%%3333--------1111--3333------- NRSEHVIPKLVYISNTTEVGTQYTKQELEDISASCKEHGLYLFLDGARLASALSSPVNDL ----------------1111----------------------------------1111-- TLADIARLTDFYIGATKAGGFGEALIILNDALKPNARHLIKQRGALAKGWLLGIQFEVLK ----------------------------33332222----1111---------------% DNLFFELGAHSNKAAILKAGLEACGIRLAWPSASNQLFPILENTIAELNNDFDYTVEPLK %%%------------------1111---------------------------------11 DGTCIRLCTSWATEEKECHRFVEVLKRL 11-------11113333----------- >ANKYRIN REPEAT PROTEIN OF; SWP:P0AEX9; PDB:1SVXA; SDLGRKLLEAARAGQDDEVRILMANGADVNAADNTGTTPLHLAAYSGHLEIVEVLLKHGA ----------------------1111-1111-1111-3333--------------1111- DVDASDVFGYTPLHLAAYWGHLEIVEVLLKNGADVNAMDSDGMTPLHLAAKWGYLEIVEV -----1111-3333--------------1111-1111-1111-3333--1111------- LLKHGADVNAQDKFGKTAFDISIDNGNEDLAEILQKL ------1111-1111-3333----------------- >SEVERIN; SWP:P10733; PDB:1SVY; EYKPRLLHISGDKNAKVAEVPLATSSLNSGDCFLLDAGLTIYQFNGSKSSPQEKNKAAEV ----------!!!!--------1111-------------------1111----------- ARAIDAERKGLPKVEVFETDSDIPAEFWKLLGGKGAIAAKH -------iiii------------3333-1111--------- >TRIOSEPHOSPHATE ISOMERASE; SWP:P00940; PDB:1SW0A; APRKFFVGGNWKMNGDKKSLGELIHTLNGAKLSADTEVVCGAPSIYLDFARQKLDAKIGV --------------------------------1111------3333--------3333-- AAQNCYKVPKGAFTGEISPAMIKDIGAAWVILGHSERRHVFGESDELIGQKVAHALAEGL ------------2222-3333-1111-----------------------------1111- GVIACIGEKLDEREAGITEKVVFEQTKAIADNVKDWSKVVLAYEPVWAIGTGLWATPQQA ------------1111------------3333---1111-----3333------------ QEVHEKLRGWLKSHVSDAVAQSTRIIYGGSVTGGNCKELASQHDVDGFLVGGASLKPEFV -------------------------------3333------1111-----1111------ DIINAKH -1111-- >OSMOPROTECTION PROTEIN (P; SWP:O29280; PDB:1SW5A; ERVVIGSKPFNEQYILANMIAILLEENGYKAEVKEGLGGTLVNYEALKRNDIQLYVEYTG ------------------------1111-------------------------------- TAYNVILRKQPPELWDQQYIFDEVKKGLLEADGVVVAAKLGFRDDYALAVRADWAEENGV -------------------------------------------------------1111- EKISDLAEFADQLVFGSDPEFASRPDGLPQIKKVYGFEFKEVKQMEPTLMYEAIKNKQVD -333333331111----------1111-------------------1111---------- VIPAYTTDSRVDLFNLKILEDDKGALPPYDAIIIVNGNTAKDEKLISVLKLLEDRIDTDT ----1111---1111-----1111------------3333-------------------- MRALNYQYDVEKKDAREIAMSFLKEQGLVK ------------------------------ >REGULATORY PROTEIN SWI6; SWP:P09959; PDB:1SW6A; GPIITFTHDLTSDFLSSPLKIMKALPSPVVNDNEQKMKLEAFLQRLLFSFDSLLQEVNDA ------1111-3333----------------3333------------------------- FPNTQLNLNIPVDEHGNTPLHWLTSIANLELVKHLVKHGSNRLYGDNMGESCLVKAVKSV 3333--------1111-3333--1111--------1111------1111-33331111-- NNYDSGTFEALLDYLYPCLILEDSMNRTILHHIIITSGMTGCSAAAKYYLDILMGWIVKK --1111-3333------1111-1111------------2222----------------33 QNRPIQSGDSILENLDLKWIIANMLNAQDSNGDTCLNIAARLGNISIVDALLDYGADPFI 33------3333-----------1111-1111-------------------1111----- ANKSGLRPVDFGAG ------3333---- >PHOSPHONOACETALDEHYDE HYD; SWP:O31156; PDB:1SWVA; KIEAVIFAWAGTTVDYGCFAPLEVFMEIFHKRGVAITAEEARKPMGLLKIDHVRALTEMP --------2222--22221111------3333---------1111-------------33 RIASEWNRVFRQLPTEADIQEMYEEFEEILFAILPRYASPINGVKEVIASLRERGIKIGS 33------------------------------3333---------------1111----- TTGYTREMMDIVAKEAALQGYKPDFLVTPDDVPAGRPYPWMCYKNAMELGVYPMNHMIKV ----3333--------1111-------3333---------------------1111---- GDTVSDMKEGRNAGMWTVGVILGSSELGLTEEEVENMDSVELREKIEVVRNRFVENGAHF --3333----1111------22223333-------------------------1111--- TIETMQELESVMEHIEK ---3333---------- >TYPE II RESTRICTION ENZYM; SWP:P04390; PDB:1SX5A; SLRSDLINALYDENQKYDVCGIISAEGKIYPLGSDTAVLSTIFELFSRPIINKIAEKHGY -----------------------1111----------------------------1111- IVEEPKQQNHYPDFTLYKPSEPNKKIAIDIKTTYTNKENEKIKFTLGGYTSFIRNNTKNI -----------------1111---------------2222----------3333------ VYPFDQYIAHWIIGYVYTRVATRKSSLKTYNINELNEIPKPYKGVKVFLQDKWVIAGDLA --1111------------------------1111----------------3333------ GSGNTTNIGSIHAHYKDFVEGKGIFDSEDEFLDYWRNYERTSQLRNDKYNNISEYRNWIY --1111-------3333-----------------------3333---------------- RGRK ---- >LYSOZYME; SWP:P00720; PDB:1SX7A; MNIFEMLRIDEGLRLKIYKDTEGYYTIGIGHLLTKSPSLNAAKSELDKAIGRNCNGVITK -------------------1111------------------------------iiii--- DEAEKLFNQDVAAAVRGILRNAKLKPVYDSLDAVRECALINMVFQMGETGVAGFTNSLRM ---------------------------33333333------------------------- LQQKRWDEAAVNLAKSRWYNQTPNRAKRVITTFRTGTWDAYKNL 1111-------3333----------------------1111--- >GA REPEAT BINDING PROTEIN; SWP:Q00422; PDB:1SXDA; GSHMAALEGYRKEQERLGIPYDPIHWSTDQVLHWVVWVMKEFSMTDIDLTTLNISGRELC ---3333------1111----3333------------3333-------1111--333311 SLNQEDFFQRVPRGEILWSHLELLRKYVLAS 11----------------------------- >TRANSCRIPTIONAL REGULATOR; SWP:P11308; PDB:1SXEA; GSHMEEKHMPPPNMTTNERRVIVPADPTLWSTDHVRQWLEWAVKEYGLPDVNILLFQNID ----3333----------------------3333-----------------11111111- GKELCKMTKDDFQRLTPSYNADILLSHLHYLRETPLP ---33333333-----!!!!----------------- >Replication factor C subu; SWP:P40339; PDB:1SXJB; LQLPWVEKYRPQVLSDIVGNKETIDRLQQIAKDGNMPHMIISGMPGIGKTTSVHCLAHEL ---3333-----3333--------------------------------3333-------- LGRSYADGVLELNASDDRGIDVVRNQIKHFAQKKLHLPPGKHKIVILDEADSMTAGAQQA !!!!--------1111-------------1111---------------1111-------- LRRTMELYSNSTRFAFACNQSNKIIEPLQSQCAILRYSKLSDEDVLKRLLQIIKLEDVKY -3333--------------3333-33331111---------------------------- TNDGLEAIIFTAEGDMRQAINNLQSTVAGHGLVNADNVFKIVDSPHPLIVKKMLLASNLE -----------iiii--------------------------------------------- DSIQILRTDLWKKGYSSIDIVTTSFRVTKNLAQVKESVRLEMIKEIGLTHMRILEGVGTY ---------3333--3333-----------3333-------------------------- LQLASMLAKIHKLNNK ---------------- >Replication factor C subu; SWP:P38629; PDB:1SXJC; NLPWVEKYRPETLDEVYGQNEVITTVRKFVDEGKLPHLLFYGPPGTGKTSTIVALAREIY -----------1111---3333-------1111--------------------------! GKNYSNMVLELNASDDRGIDVVRNQIKDFASTRQIFSKGFKLIILDEADAMTNAAQNALR !!!1111----3333-------------1111--------------1111---------- RVIERYTKNTRFCVLANYAHKLTPALLSQCTRFRFQPLPQEAIERRIANVLVHEKLKLSP -----------------3333-33331111------------------------------ NAEKALIELSNGDMRRVLNVLQSCKATLDNPDEDEISDDVIYECCGAPRPSDLKAVLKSI ---------iiii------1111------------------------------------- LEDDWGTAHYTLNKVRSAKGLALIDLIEGIVKILEDYELQNEETRVHLLTKLADIEYSIS ---3333--------------3333--------1111---3333---------------- KGGNDQIQGSAVIGAIKASFEN ----------------3333-- >Replication factor C subu; SWP:P40348; PDB:1SXJD; PWVEKYRPKNLDEVTAQDHAVTVLKKTLKSANLPHMLFYGPPGTGKTSTILALTKELYGP ---------3333-------------1111------------------------------ DLMKSRILELNASDERGISIVREKVKNFARLTVSKPSKHDLENYPCPPYKIIILDEADSM ----------3333--3333----------------3333---------------3333- TADAQSALRRTMETYSGVTRFCLICNYVTRIIDPLASQCSKFRFKALDASNAIDRLRFIS --------3333--3333--------3333-3333------------3333--------- EQENVKCDDGVLERILDISAGDLRRGITLLQSASKGAQYLGDGKNITSTQVEELAGVVPH ----------3333-3333-------------1111-3333------3333--------3 DILIEIVEKVKSGDFDEIKKYVNTFMKSGWSAASVVNQLHEYYITNDNFDTNFKNQISWL 333--------------------------------------3333----3333------- LFTTDSRLNNGTNEHIQLLNLLVKISQL -----3333------------------- >Replication factor C subu; SWP:P38251; PDB:1SXJE; WVDKYRPKSLNALSHNEELTNFLKSLSDQPRDLPHLLLYGPNGTGKKTRCMALLESIFGP 3333----1111---3333----1111-1111-------------33331111------- GVYRNVVSSPYHLEITPSNDRIVIQELLKEVAQMEQRYKCVIINEANSLTKDAQAALRRT -------------------3333--------------------------3333------- MEKYSKNIRLIMVCDSMSPIIAPIKSQCLLIRCPAPSDSEISTILSDVVTNERIQLETKD ---3333-------------3333------------3333----3333------------ ILKRIAQASNGNLRVSLLMLESMALNNELALKSSSPIIKPDWIIVIHKLTRKIVKERSVN -----------3333---1111----%%%%------------------------------ SLIECRAVLYDLLAHCIPANIILKELTFSLLDVETLNTTNKSSIIEYSSVFDERLSLGNK -----------------3333--------1111-------------------3333---3 AIFHLEGFIAKVMCCLD 333-------------- >NOXIUSTOXIN; SWP:P08815; PDB:1SXM; TIINVKCTSPKQCSKPCKELYGSSAGAKCMNGKCKCYNN --------1111--------------------------- >PEPTIDOGLYCAN RECOGNITION; SWP:Q9VYX7; PDB:1SXRA; CPTIKLKRQWGGKPSLGLHYQVRPIRYVVIHHTVTGECSGLLKCAEILQNMQAYHQNELD -----3333------------------------------3333----------------- FNDISYNFLIGNDGIVYEGTGWGLRGAHTYGYNAIGTGIAFIGNFVDKLPSDAALQAAKD ----------1111------2222----22221111------------------------ LLACGVQQGELSEDYALIAGSQVISTQSPGLTLYNEIQEWPHWLSNPHHHHHH ------------------3333-----------------2222---------- >INORGANIC PYROPHOSPHATASE; SWP:O06379; PDB:1SXVA; HMQFDVTIEIPKGQRNKYEVDHETGRVRLDRYLYTPMAYPTDYGFIEDTLGDDGDPLDAL ----------2222------------------------------------1111------ VLLPQPVFPGVLVAARPVGMFRMVDEHGGDDKVLCVPAGDPRWDHVQDIGDVPAFELDAI -------2222-------------3333-----------3333----3333--------- KHFFVHYKDLEPGKFVKAADWVDRAEAEAEVQRSVERFKA -----1111---------------------------3333 >OKT3 FAB LIGHT CHAIN; SWP:P09693; PDB:1SY6A; MQSIKGNHLVKVYDYQEDGSVLLTCDAEAKNITWFKDGKMIGFLTEDKKKWNLGSNAKDP ---2222-----------------------------------------------3333-- RGMYQCKGSQNKSKPLQVYYRMQTPYKVSISGTTVILTCPQYPGSEILWQHNDKNIGGDE --------------------------------------------------%%%%------ DDKNIGSDEDHLSLKEFSELEQSGYYVCYPRGSKPEDANFYLYLRARV -1111--!!!!------3333--------22223333----------- >T-cell surface glycoprote; SWP:P09693; PDB:1SY6H; QVQLQQSGAELARPGASVKMSCKASGYTFTRYTMHWVKQRPGQGLEWIGYINPSRGYTNY ------------2222-----------1111--------2222----------------- NQKFKDKATLTTDKSSSTAYMQLSSLTSEDSAVYYCARYYDDHYCLDYWGQGTTLTVSSA 3333--------3333----------3333---------3333----------------- KTTAPSVYPLAPVCGGTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDL ---------------------------------------%%%%--2222----------- YTLSSSVTVTSSTWPSQSITCNVAHPASSTKVDKKIEPR ---------1111-----------3333----------- >CATALASE 1; SWP:Q9C168; PDB:1SY7A; ARLTTDYGVKQTTADDWLRIVSDDKIGPSLLEDPFARERIMRFDHERIPERVVHARGSGA ----1111-------------1111----3333---------1111-------------- FGKFKVYESASDLTMAPVLTDTSRETPVFVRFSTVLGSRGSADTVRDVRGFAVKFYTEEG ---------3333--3333------------------1111---------------1111 NWDLVGNNIPVFFIQDAIKFPDVIHAGKPEPHNEVPQAQSAHNNFWDFQFNHTEATHMFT ---------------3333--------------------------------3333----- WAMSDRAIPRSLRMMQGFGVNTYTLINAQGKRHFVKFHWTPELGVHSLVWDEALKLAGQD ---3333---1111------------1111----------1111----------3333-1 PDFHRKDLWEAIENGAYPKWKFGIQAIAEEDEHKFDFDILDATKIWPEDLVPVRYIGEME 111--------1111------------3333------1111-----1111---------- LNRNPDEFFPQTEQIAFCTSHVVNGIGFSDDPLLQGRNFSYFDTQISRLGVNFQELPINR -----------------3333-2222-----------------------1111--3333- PVCPVMNFNRDGAMRHTISRGTVNYYPNRFDACPPASLKEGGYLEYAQKVAGIKARARSA ---------------------------1111-----3333------------------33 KFKEHFSQAQLFYNSMSPIEKQHMINAFGFELDHCEDPVVYGRMVQRLADIDLGLAQTIA 33----------1111--------------------3333-------------------- EMVGGEAPTTTNHPNHGRKTINLSQTEFPPATPTIKSRRVAIIIADGYDNVAYDAAYAAI -----------------------3333-------2222------2222------------ SANQAIPLVIGPRRSKVTAANGSTVQPHHHLEGFRSTMVDAIFIPGGAKAAETLSKNGRA ------------------1111-------1111-3333---------------------- LHWIREAFGHLKAIGATGEAVDLVAKAIALPQVTVSSEAEVHESYGVVTLKKVKPESFTD -----------------------------3333----------iiii------1111--- AVKIAKGAAGFLGEFFYAIAQHRNWDRELDGLHSMIAY ------------------------------3333---- >THIOREDOXIN; SWP:Q7KQL8; PDB:1SYRA; MVKIVTSQAEFDSIISQNELVIVDFFAEWCGPCKRIAPFYEECSKTYTKMVFIKVDVDEV --------------------------1111----------------3333-----3333- SEVTEKENITSMPTFKVYKNGSSVDTLLGANDSALKQLIEKYA ------------------iiii--------------------- > cytoplasmic tail-binding; SWP:O95400; PDB:1SYXB; DVMWEYKWENTGDAELYGPFTSAQMQTWVSEGYFPDGVYCRKLDPPGGQFYNSKRIDFDL ----------------------------1111-1111-------3333---3333-3333 YT -- >RIBONUCLEOSIDE-DIPHOSPHAT; SWP:O84835; PDB:1SYYA; QADILDGKQKRVNLNSKRLVNCNQVDVNQLVPIKYKWAWEHYLNGCANNWLPTEIPMGKD ---33331111-1111------------------3333-------1111-1111------ IELWKSDRLSEDERRVILLNLGFFSTAESLVGNNIVLAIFKHVTNPEARQYLLRQAFEEA --------------------------------------3333------------------ VHTHTFLYICESLGLDEKEIFNAYNERAAIKAKDDFQMEITGKVLDPNFRTDSVEGLQEF ------------------3333-----3333-----3333-33331111----------- VKNLVGYYIIMEGIFFYSGFVMILSFHRQNKMIGIGEQYQYILRDETIHLNFGIDLINGI --------------------------1111------------------------------ KEENPEIWTPELQQEIVELIKRAVDLEIEYAQDCLPRGILGLRASMFIDYVQHIADRRLE ---3333-------------------------------iiii3333-------------1 RIGLKPIYHTKNPFPWM 111----------3333 >GLUCOKINASE; SWP:P46880; PDB:1SZ2A; KYALVGDVGGTNARLALCDIASGEISQAKTYSGLDYPSLEAVIRVYLEEHKVEVKDGCIA --------!!!!-------------------3333------------------------- IACPITGDWVATNHTWAFSIAEKKNLGFSHLEIINDFTAVSAIPLKKEHLIQFGGAEPVE ------------------3333-1111------------------3333---------22 GKPIAVYGAGTGLGVAHLVHVDKRWVSLPGEGGHVDFAPNSEEEAIILEILRAEIGHVSA 22--------------------------------------------------------33 ERVLSGPGLVNLYRAIVKADNRLPENLKPKDITERALADSCTDCRRALSLFCVIGRFGGN 33---------------1111------3333---------3333---------------- LALNLGTFGGVFIAGGIVPRFLEFFKASGFRAAFEDKGRFKEYVHDIPVYLIVHDNPGLL ------3333------3333----------------!!!!-1111--------------- GSGAHLRQTLGHIL -------1111--- >RRNA N-GLYCOSIDASE A CHAI; SWP:P81446; PDB:1SZ6A; YERLSLRTVQQTTGAEYFSFITLLRDFVSSGSFSNNIPLLRQSTIPVSEGSRFVLVELTN --------1111---------------------iiii--------1111----------1 AGGDSITAAIDVTNLYVVAYQAGQQSYFLKDAPAGAETQDFAGTTRSSLPFNGSYPDLER 111-------------------------22222222------------------------ YAGHRDQIPLGIDQLIASVTALRFPGGSTRTQARSILILIQMISEAARFNPILWRARQYI ---1111----------------------------------------------------- NSGASFLPDVYMLELETSWGQQSTQVQHSTDGVFNNPIALALPPGNVVTLTNIRDVIASL -----------------------------iiii--------3333------33331111- AIMLFVCGE --------- >TRAFFICKING PROTEIN PARTI; SWP:O43617; PDB:1SZ7A; KMSSELFTLTYGALVTQLCKDYENDEDVNKQLDKMGFNIGVRLIEDFLARSNVGRCHDFR -----------------------------------------------1111--------- ETADVIAKVAFKMYLGITPSITNWSPAGDEFSLILENNPLVDFVELPDNHSSLIYSNLLC ------------------------3333---------1111-----3333---1111--- GVLRGALEMVQMAVEAKFVQDTLKGDGVTEIRMRFIRRI --------------------3333--------------- >PCF11 PROTEIN; SWP:P39081; PDB:1SZ9A; DHDTEVIVKDFNSILEELTFNSRPIITTLTKLAEENISCAQYFVDAIESRIEKCMPKQKL --------------1111-----------------3333----------------3333- YAFYALDSICKNVGSPYTIYFSRNLFNLYKRTYLLVDNTTRTKLINMFKLWLNPNDTGLP -------------------3333---------1111-----------------%%%%--- LFEGSALEKIEQFLIKASAAALE --------------1111----- >mannose binding lectin-as; SWP:O00187; PDB:1SZBA; PLGPKWPEPVFGRLASPGFPGEYANDQERRWTLTAPPGYRLRLYFTHFDLELSHLCEYDF ---------------2222----------------2222--------------------- VKLSSGAKVLATLCGQESTDTERAPGKDTFYSLGSSLDITFRSDYSNEKPFTGFEAFYAA ---------------3333-----!!!!-------------------------------- EDIDECQVAPGEAPTCDHHCHNHLGGFYCSCRAGYVLHRNKRTCSEQ --------2222----------2222-----2222--1111------ >HER-1 PROTEIN; SWP:P34704; PDB:1SZHA; STLTKELIKDAAEKCCTRNRQECCIEIMKFGTPIRCGYDRDPKLPGYVYKCLQNVLFAKE ----------------1111--------------%%%%--1111------------1111 PKKKINLDDSVCCSVFGNDQEDSGRRCENRCKNLMTSPSIDAATRLDSIKSCSLLDNVLY 1111-3333-----------3333------------1111----------1111------ KCFEKCRSLRKDGIKIEVLQFEEYCEA --------------3333-3333---- >MANNOSE-6-PHOSPHATE RECEP; SWP:Q9DBG5; PDB:1SZIA; NRLPLTEAELALIATPPEDSDMASLQQQRQEQNYFVRLGSLSERLRNHAYEHSLGKLQNA --------------------3333------------3333-------------------- RQKAQETLQQLTSVLGLMESVKQAKPEQVEARALSMFRDITQQLQSMCVALGASIQGLPS -------------------3333--3333-------------------------222233 HVREQAQQARSQVNDLQATFSGIHSFQDLSAGVLAQTRERIARAREALDNTVEYVAQNTP 33----------------------3333--------------------------1111-1 AMWLVGPFAPGITE 111----------- >F-SPONDIN; SWP:P35446; PDB:1SZLA; GSETCIYSNWSPWSACSSSTCEKGKRMRQRMLKAQLDLSVPCPDTQDFQPCMGPGCSDED ------------------------------------1111-------------------- G - >ALPHA-GALACTOSIDASE; SWP:Q92456; PDB:1SZNA; IVMPDGVTGKVPSLGWNSWNAYHCDIDESKFLSAAELIVSSGLLDAGYNYVNIDDCWSMK --11112222-----------!!!!-3333--------11113333-------------- DGRVDGHIAPNATRFPDGIDGLAKKVHALGLKLGIYSTAGTATCAGYPASLGYEDVDAAD ---iiii----3333-----------1111------------1111---2222------- FADWGVDYLKYDNCNVPSDWQDEYVACNPDFVKTGPNGTCTTALDPTLAPPGYDWSTSKS -1111-----------1111-------1111---2222--33331111-22223333--- AERFGAMRNALAKQSHEIVLSMCIWGQADVFSWGNSTGISWRMSDDISPNWGSVTRILNL ----------1111---------iiii-33333333------------------------ NSFKLNSVDFWGHNDADMLEVGNGNLTAAETRTHFALWAAMKSPLLIGTDLAQLSQNNIN ---3333-2222-------2222--------------------------3333------- LLKNKHLLAFNQDSVYGQPATPYKWGINPDWTFNVTYPAEFWAGPSSKGHLVLMVNTLDI ---------------------------------------------1111----------- TATKEAKWNEIPGLSAGHYEVRDVWSDKDLGCLSSYKAAVAAHDTAVILVGKKCQRW ------33332222--------------------------2222------------- >6-OXOCAMPHOR HYDROLASE; SWP:Q93TU6; PDB:1SZOA; LATPFQEYSQKYENIRLERDGGVLLVTVHTEGKSLVWTSTAHDELAYCFHDIACDRENKV ---333311111111----iiii------iiii---------------------1111-- VILTGTGPSFCNEIDFTSFNLGTPHDWDEIIFEGQRLLNNLLSIEVPVIAAVNGPVTNAP --------------3333----------------------1111---------------- EIPVMSDIVLAAESATFQDGPHFPSGIVPGDGAHVVWPHVLGSNRGRYFLLTGQELDART --1111-----1111---11111111-----3333------------------------- ALDYGAVNEVLSEQELLPRAWELARGIAEKPLLARRYARKVLTRQLRRVMEADLSLGLAH -----------3333-----------1111------------------------------ EALAAIDLG ----3333- >DNA REPAIR PROTEIN RAD51; SWP:P25454; PDB:1SZPA; MVPIEKLQVNGITMADVKKLRESGLHTAEAVAYAPRKDLLEIKGISEAKADKLLNEAARL --3333------3333----3333----------33333333---3333---33333333 VPMGFVTAADFHMRRSELICLTTGSKNLDTLLGGGVETGSITELFGEFRTGKSQLCHTLA -------3333--3333-------33333333-------------------3333----- VTCQIPLDIGGGEGKCLYIDTEGTFRPVRLVSIAQRFGLDPDDALNNVAYARAYNADHQL -----3333---------------------33331111-----1111------------- RLLDAAAQMMSESRFSLIVVDSVMALYRTDELSARQMHLAKFMRALQRLADQFGVAVVVT ---------------------1111----------------------------------- NQVVTGGNIMAHSSTTRLGFKKGKGCQRLCKVVDSPCLPEAECVFAIYEDGVGDP -----------------------------------------------1111---- >2-METHYLCITRATE DEHYDRATA; SWP:P77243; PDB:1SZQA; EFDREIVDIVDYVNYEISSKVAYDTAHYCLLDTLGCGLEALEYPACKKLLGPIVPGTVVP ---------------------------------------1111---1111---2222--- NGVRVPGTQFQLDPVQAAFNIGAIRWLDFNDTWLAAEWGHPSDNLGGILATADWLSRNAV ----2222---------------------------------------------------1 ASGKAPLTKQVLTAIKAHEIQGCIALENSFNRVGLDHVLLVKVASTAVVAELGLTREEIL 111-------3333---------------3333--------------------------- NAVSLAWVDGQSLRTYRHAPNTGTRKSWAAGDATSRAVRLALAKTGEGYPSALTAPVWGF -------------1111-------------------------1111-------------- YDVSFKGESFRFQRPYGSYVENVLFKISFPAEFHSQTAVEAATLYEQQAAGKTAADIEKV ----iiii-----------------------1111------------1111-1111---- TIRTHEACIRIIDKKGPLNNPADRDHCIQYVAIPLLFGRLTAADYEDNVAQDKRIDALRE --------------------------3333--3333----3333-3333----------- KINCFEDPAFTADYHDPEKRAIANAITLEFTDGTRFEEVVVEYPIGHARRRQDGIPKLVD ---------------1111----------3333----------1111------------- KFKINLARQFPTRQQQRILEVSLDRARLEQPVNEYLDLYVI -----------------------3333---33333333--- >TRNA PSEUDOURIDINE SYNTHA; SWP:Q57261; PDB:1SZWA; IEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADA -3333----------------3333----------------------------------- LAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEPDLSAFQLEGCQVLEYARHKRKLR --1111-3333--------------------------3333--2222------------2 LGALKGNAFTLVLREVSNRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGAQRWAQRN 222--------------3333-----------------3333-2222------------- KRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVN --------------------1111-1111-2222---2222------3333-------11 DKELITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRALLYPQQLSWN 11--------------------------1111-------1111----------------- WWDDVTVEIRFWLPAGSFATSVVRELINTT -------------222233333333----- >THIOREDOXIN; SWP:P52230; PDB:1T00A; HMAGTLKHVTDDSFEQDVLKNDKPVLVDFWAAWCGPCRQIAPSLEAIAAEYGDKIEIVKL ---------3333----1111---------1111----------------3333------ NIDENPGTAAKYGVMSIPTLNVYQGGEVAKTIVGAKPKAAIVRDLEDFIAD 3333-----1111----------iiii-----------------3333--- >HYPOTHETICAL PROTEIN; SWP:Q81BA8; PDB:1T06A; MDFKTVMQELEALGKERTKKIYISNGAHEPVFGVATGAMKPIAKKIKLNQELAEELYATG ----------------------1111--------3333----1111-------------- NYDAMYFAGIIADPKAMSESDFDRWIDGAYFYMLSDYVVAVTLSESNIAQDVADKWIASG --------11113333---------1111-3333--------1111-------------- DELKMSAGWSCYCWLLGNRKDNAFSESKISDMLEMVKDTIHHSPERTKSAMNNFLNTVAI -------------------3333---------------3333-3333------------- SYVPLHEKAVEIAKEVGIVEVKRDNKKSSLLNASESIQKELDRGRLGFKRKYVRC -3333-----------------------------------11112222------- >HYPOTHETICAL UPF0269 PROT; SWP:Q9HU36; PDB:1T07A; HSRTVCRKYHEELPGLDRPPYPGAKGEDIYNNVSRKAWDEWQKHQTLINERRLNNAEDRK -----3333---------------------------------------1111-------- FLQQEDKFLSGEDY ------1111---- >2C-methyl-D-erythritol 2,; SWP:Q8EBR3; PDB:1T0AA; KIRIGHGFDVHKFGEPRPLILCGVEVPYETGLVAHSDGDVVLHAISDAILGAALGDIGKH --------------------iiii-------------------------------3333- FPDTDAAYKGADSRVLLRHCYALAKAKGFELGNLDVTIIAQAPKAPHIEDRQVLAADLNA -11111111-----------------------------------1111-------1111- DVADINVKATTTEKLGFTGRKEGIAVEAVVLLSRQ 3333-------%%%%3333---------------- >THUA-LIKE PROTEIN; SWP:Q5KY38; PDB:1T0BA; TPIRVVVWNEFRHEKKDEQVRAIYPEGMHTVIASYLAEAGFDAATAVLDEPEHGLTDEVL -----------3333--------1111---------1111---------2222------1 DRCDVLVWWGHIAHDEVKDEVVERVHRRVLEGMGLIVLHSGHFSKIFKKLMGTTCNLKWR 111---------3333------------1111---------------------------- EADEKERLWVVAPGHPIVEGIGPYIELEQEEMYGEFFDIPEPDETIFISWFEGGEVFRSG -----------11111111-------------------------------1111------ CTFTRGKGKIFYFRPGHETYPTYHHPDVLKVIANAVRWAAPVNRGEIVFGNVKPLEPIKA ----!!!!--------11111111------------------------------------ >INSULIN; SWP:P01308; PDB:1T0CA; EAEDLQVGQVELGGGPGAGSLQPLALEGSLQ -1111---------------3333--iiii- >Transposon Tn7 transposit; SWP:P05846; PDB:1T0FC; IKVVKPSDWDSLPDTDLRYIYSQRQPEKTMHERLKGKGVIVDMASLFKQ ----333311111111----11111111------1111---3333---- >CYTOCHROME B5 DOMAIN-CONT; SWP:NA; PDB:1T0GA; MGHHHHHHLEEFTAEQLSQYNGTDESKPIYVAIKGRVFDVTTGKSFYGSGGDYSMFAGKD ------------3333----------------iiii---3333-------1111------ ASRALGKMSKNEEDVSPSLEGLTEKEINTLNDWETKFEAKYPVVGRVVS ----------3333----------------------1111--------- >VOLTAGE-GATED CALCIUM CHA; SWP:Q8CC27; PDB:1T0HA; RREAERQAQAQLEKAKTKPVAFAVRTNVRYSAAQEDDVPVPGAISFEAKDFLHVKEKFNN ------------1111----------------1111----------2222--------11 DWWIGRLVKEGCEIGFIPSPVKLENRLQHEQRAK 11------2222------------------3333 >Voltage-dependent L-type ; SWP:Q8VGC3; PDB:1T0HB; RPFTPPYDVVPSRPVVLVGPSLKGYEVTDQKALFDFLKHRFEGRISITRVTADISLAKAI -------------------------3333---------1111----------3333---- IERSNTRSSLAEVQSEIERIFELARTLQLVVLDADTINHPAQLSKTSLAPIIVYVKISSP ---------------------1111--------1111---1111---------------- KVLQRLIKSRHLNVQVAADKLAQCPPQESFDVILDENQLEDACEHLADYLEAYWKATHPP ------1111-3333-----11113333-------------------------------- RT -- >YLR011WP; SWP:Q07923; PDB:1T0IA; MKVGIIMGSVRAKRVCPEIAAYVKRTIENSEELIDQKLKIQVVDLQQIALPLYEDDDELI --------------3333--------11111111---------3333------------3 PAQIKSVDEYADSKTRSWSRIVNALDIIVFVTPQYNWGYPAALKNAIDRLYHEWHGKPAL 333--1111-------------------------%%%%--------33333333------ VVSYGGHGGSKCNDQLQEVLHGLKMNVIGGVAVKIPVGTIPLPEDIVPQLSVHNEEILQL -----------------------------------2222---11111111---------- LASCI -3333 >ISOCITRATE DEHYDROGENASE ; SWP:O75874; PDB:1T0LA; MSKKISGGSVVEMQGDEMTRIIWELIKEKLIFPYVELDLHSYDLGIENRDATNDQVTKDA ----------------3333----------3333-------------------------- AEAIKKHNVGVKCATITPDEKRVEEFKLKQMWKSPNGTIRNILGGTVFREAIICKNIPRL -----------------------------------------------------1111--- VSGWVKPIIIGRHAYGDQYRATDFVVPGPGKVEITYTPSDGTQKVTYLVHNFEEGGGVAM 1111-----------!!!!------------------1111------------------- GMYNQDKSIEDFAHSSFQMALSKGWPLYLSTKNTILKKYDGRFKDIFQEIYDKQYKSQFE -------------------------------3333------------------------1 AQKIWYEHRLIDDMVAQAMKSEGGFIWACKNYDGDVQSDSVAQGYGSLGMMTSVLVCPDG 111-------------------------------------------1111------3333 KTVEAEAAHGTVTRHYRMYQKGQETSTNPIASIFAWTRGLAHRAKLDNNKELAFFANALE -----------3333--------------------------------------------- EVSIETIEAGFMTKDLAACIKGLPNVQRSDYLNTFEFMDKLGENLKIKLAQAKL ------1111--3333-----3333-3333------------------------ >Intercellular adhesion mo; SWP:P32942; PDB:1T0PB; QEFLLRVEPQNPVLSAGGSLFVNCSTDCPSSEKIALETSLSKELVASGMGWAAFNLSNVT -----------------------------------------------2222--------- GNSRILCSVYCNGSQITGSSNITVYG ----------iiii------------ >UPF0447 protein GK3416; SWP:Q5KUD5; PDB:1T0TV; QTLDGWYCLHDFRTIDWSAWKTLPNEEREAAISEFLALVDQWETTESEKQGSHAVYTIVG -------------------1111----------------------1111----------- QKADILFMILRPTLDELHEIETALNKTKLADYLLPAYSYVSVVELSNYLASGSEDPYQIP --------------------------3333------------------------333333 EVRRRLYPILPKTNYICFYPMDKRRQGNDNWYMLSMEQRRELMRAHGMTGRKYAGKVTQI 33-----------------------!!!!1111---------------33332222---- ITGSVGLDDFEWGVTLFSDDALQFKKLVYEMRFDEVSARFGEFGSFFVGTRLPMENVSSF ---2222---------------------------3333--------------3333---- FHV --- >TUBULIN FOLDING COFACTOR ; SWP:Q20728; PDB:1T0YA; MTEVYDLEITTNATDFPMEKKYPAGMSLNDLKKKLELVVGTTVDSMRIQLFDGDDQLKGE ----------------------1111---------------3333------3333----- LTDGAKSLKDLGVRDGYRIHAVDVTGGNED -----------------------1111--- >INSECT NEUROTOXIN; SWP:O77091; PDB:1T0ZA; KKNGYAVDSSGKVAECLFNNYCNNECTKVYYADKGYCCLLKCYCFGLADDKPVLDIWDST -------1111--------------------------%%%%------1111--------- KNYCDVQIIDLS ------------ >GLUCOSE-6-PHOSPHATE ISOME; SWP:P42861; PDB:1T10A; SLLNLPAWKRLQSLYEKYGNDSILSHFEKDHQRFQRYSIEIDLHSDDNFLFLDYSKSHIN 3333-------------11113333-3333-3333---------%%%%------------ DEIKDALVALAEERGVRAFAKAMFDGQRVNSTENRAVLHVALRNRSNRPIIVDGKDVMSD ----------------------1111----1111---3333--1111----%%%%----- VNNVLAQMKDFTERVRSGEWKGQTGKSIYNIVNIGIGGSDLGPVMVTEALKPFSKRDLHC --------------3333---1111----------!!!!----------3333-3333-- FFVSNVDGTHMAEVLKQVNLEETIFIIASKTFTTQETLTNAMSARNALMSYLKENGISTD -------------1111-3333------3333---3333-------------1111--22 GAVAKHFVALSTNTEKVREFGIDTVNMFAFWDWVGGRYSVWSAIGLSVMLSIGYDNFVEF 221111-----------3333-3333----1111111133331111-------------- LTGAHVMDNHFASTPTEQNLPMMLALVGIWYNNFFGSETQAVLPYDQYLWRLPAYLQQLD --------------3333---------------------------3333----------- MESNGKGVTKKSGAVAVQTGPIVFGEAGTNGQHAFYQLIHQGTKIIPCDFIGCVQTQNRV --------3333-----------------3333--------------------------- GDHHRTLMSNFFAQTEALMVGKNAEEVRQELVKSGMSGDAIENMIPHKTFTGSRPSNSIL -------------------------------1111-----11113333------------ VNALTPRALGAIIAMYEHKVLVQGAIWGINSYDQWGVELGKVLAKSILPQLKSGNIVSDH ----------------------------------1111--------3333-2222----- DGSTNGLINMFNTRAH ---------------- >NONSPECIFIC LIPID-TRANSFE; SWP:Q42952; PDB:1T12A; AITCGQVTSNLAPCLAYLRNTGPLGRCCGGVKALVNSARTTEDRQIACTCLKSAAGAISG --3333-----------------!!!!------3333----------------------- INLGKAAGLPSTCGVNIPYKISPSTDCSKVQ ---------3333-------------1111- >BREAST CANCER TYPE 1 SUSC; SWP:P38398; PDB:1T15A; RMSMVVSGLTPEEFMLVYKFARKHHITLTNLITEETTHVVMKTDAEFVCERTLKYFLGIA ---------3333-------------------1111-------1111-----------11 GGKWVVSYFWVTQSIKERKMLNEHDFEVRGDVVNGRNHQGPKRARESQDRKIFRGLEICC 11------------1111---3333---------------------11111111------ YGPFTNMPTDQLEWMVQLCGASVVKELSSFTLGTGVHPIVVVQPDAWTEDNGFHAIGQMC -------1111-----1111-----3333---1111------3333----1111--1111 EAPVVTREWVLDSVALYQCQELDTYLIPQIP ----------------------1111----- >CONSERVED HYPOTHETICAL PR; SWP:Q9A7I7; PDB:1T17A; MHRHVVTKVLPYTPDQLFELVGDVDAYPKFVPWITGMRTWNGRVDGAVSTVDAEAQVGFS ---------------------3333-----3333-------------------------- FLREKFATRVRRDKDARSIDVSLLYGPFKRLNNGWRFMPEGDATRVEFVIEFAFKSALLD ---------------------------------------!!!!----------------- AMLAANVDRAAGKLIACFEARAQQLHGA ---------------------------- >POTASSIUM CHANNEL KV1.1; SWP:Q16968; PDB:1T1DA; ERVVINVSGLRFETQLKTLNQFPDTLLGNPQKRNRYYDPLRNEYFFDRNRPSFDAILYFY ------iiii----3333------33333333-1111------------3333------- QSGGRLRRPVNVPLDVFSEEIKFYELGENAFERYREDEGF --------1111---------------------------- >KUMAMOLISIN; SWP:Q8RR56; PDB:1T1GA; AAPTAYTPLDVAQAYQFPEGLDGQGQCIAIIALGGGYDETSLAQYFASLGVSAPQVVSVS ------3333------------2222--------------------1111---------- VDGATNQPTGDPNGPDGEVELDIEVAGALAPGAKIAVYFAPNTDAGFLNAITTAVHDPTH iiii------1111---------------1111--------------------------- KPSIVSISWGGPEDSWAPASIAAMNRAFLDAAALGVTVLAAAGDSGSTDGEQDGLYHVDF -----------1111-3333-----------1111------------iiii--------- PAASPYVLACGGTRLVASAGRIERETVWNDGPDGGSTGGGVSRIFPLPSWQERANVPPSA ---1111----------1111---------3333-------------1111--------- NPGAGSGRGVPDVAGNADPATGYEVVIDGETTVIGGTSAVAPLFAALVARINQKLGKPVG 2222-------------1111-----iiii-----3333--------------------- YLNPTLYQLPPEVFHDITEGNNDIANRARIYQAGPGWDPCTGLGSPIGIRLLQALLP -333311113333---------------------------!!!!--------1111- >ARMADILLO REPEAT CONTAINI; SWP:Q8VZ40; PDB:1T1HA; GSPEFPEYFRCPISLELMKDPVIVSTGQTYERSSIQKWLDAGHKTCPKSQETLLHAGLTP ------------------------------3333-------------------------- NYVLKSLIALWCESNGIE 3333------3333---- >HYPOTHETICAL PROTEIN; SWP:Q9I3M0_PSEAE; PDB:1T1JA; HMRKIFLACPYSHADAEVVEQRFRACNEVAATIVRAGHVVFSQVSMSHPINLCLAELDRA -----------------------------------------3333---3333-3333--- AIGRLWAPVDAFYMDHLEELIVLDLPGWRDSAGIRREMEFFEAGGQRVSLWSEVEHEFR ------------------------2222-------------1111----3333--1111 >KURTOXIN; SWP:P58910; PDB:1T1TA; KIDGYPVDYWNCKRICWYNNKYCNDLCKGLKADSGYCWGWTLSCYCQGLPDNARIKRSGR ------------%%%%--3333-----1111------------------1111------- CRA --- >CHOLINE O-ACETYLTRANSFERA; SWP:P32738; PDB:1T1UA; ELDLPKLPVPPLQQTLATYLQCMQHLVPEEQFRKSQAIVKRFGAPGGLGETLQEKLLERQ 1111------------------1111-----------------2222------------- EKTANWVSEYWLNDMYLNNRLALPVNSSPAVIFARQHFQDTNDQLRFAACLISGVLSYKT ----1111-------3333--------------------3333----------------- LLDSHSLPTDWAKGQLSGQPLCMKQYYRLFSSYRLPGHTQDTLVAQKSSIMPEPEHVIVA -1111---2222----------3333---------------------------------- CCNQFFVLDVVINFRRLSEGDLFTQLRKIVKMASNEDERLPPIGLLTSDGRSEWAKARTV iiii-------%%%%--------------------------33333333----------- LLKDSTNRDSLDMIERCICLVCLDGPGTGELSDTHRALQLLHGGGCSLNGANRWYDKSLQ -------------1111------------------------iiii---1111-1111--- FVVGRDGTCGVVCEHSPFDGIVLVQCTEHLLKHMMTSNKKLVRADSVSELPAPRRLRWKC ---1111-------1111--------------1111------------------------ SPETQGHLASSAEKLQRIVKNLDFIVYKFDNYGKTFIKKQKYSPDGFIQVALQLAYYRLY ------------------------------------------------------------ QRLVPTYESASIRRFQEGRVDNIRSATPEALAFVQAMTDHKAAMPASEKLQLLQTAMQAQ -----------3333-----------------3333--------3333------------ TEYTVMAITGMAIDNHLLALRELARDLCKEPPEMFMDETYLMSNRFVLSTSQVPTTMEMF ------1111---------------------3333------1111--------------- CCYGPVVPNGYGACYNPQPEAITFCISSFHSCKETSSVEFAEAVGASLVDMRDLCSS ------1111------------------3333--------------------3333- >SH3 domain-binding glutam; SWP:Q91VW3; PDB:1T1VA; MSGLRVYSTSVTGSREIKSQQSEVTRILDGKRIQYQLVDISQDNALRDEMRTLAGNPKAT ----------------------------1111------3333-------------1111- PPQIVNGNHYCGDYELFVEAVEQDTLQEFLKLA -----!!!!------------------------ >CHROMOSOMAL PROTEIN MC1; SWP:P12770; PDB:1T23A; SNTRNFVLRDEDGNEHGVFTGKQPRQAALKAANRGSGTKANPDIIRLRERGTKKVHVFKA ---------%%%%-----------1111-------------------------------- WKEIVDAPKNRPAWMPEKISKPFVKKERIEKLE -----------3333------------------ >GDP-MANNOSE 4,6 DEHYDRATA; SWP:O60547; PDB:1T2AA; RNVALITGITGQDGSYLAEFLLEKGYEVHGIVRRSSSFNTGRIEHLYNMKLHYGDLTDST -------1111----------1111--------------11111111-------1111-- CLVKIINEVKPTEIYNLGAQSHVKISFDLAEYTADVDGVGTLRLLDAVKTCGLINSVKFY ---------------------3333----------------------------------- QASTSELYGKVQEIPQKETTPFYPRSPYGAAKLYAYWIVVNFREAYNLFAVNGILFNHES -----3333-------1111---------------------------------------1 PRRGANFVTRKISRSVAKIYLGQLECFSLGNLDAKRDWGHAKDYVEAMWLMLQNDEPEDF 1111111-----------------------1111-----3333------1111------- VIATGEVHSVREFVEKSFLHIGKTIVWEGKNENEVGRCKETGKVHVTVDLKYYRPTEVDF ------------------1111-------!!!!---------------3333-------- LQGDCTKAKQKLNWKPRVAFDELVREMVHADVELMRTN -------------------------------------- >P450CIN; SWP:Q8VQF6; PDB:1T2BA; TSLFTTADHYHTPLGPDGTPHAFFEALRDEAETTPIGWSEAYGGHWVVAGYKEIQAVIQN -3333--1111---1111-----------3333-------iiii---------------- TKAFSNKGVTFPRYETGEFELMMAGQDDPVHKKYRQLVAKPFSPEATDLFTEQLRQSTND -----1111------!!!!---1111------------3333-33331111--------- LIDARIELGEGDAATWLANEIPARLTAILLGLPPEDGDTYRRWVWAITHVENPEEGAEIF ----1111------------------------1111------------------------ AELVAHARTLIAERRTNPGNDIMSRVIMSKIDGESLSEDDLIGFFTILLLGGIDNTARFL -------------------------1111-iiii-------------------------- SSVFWRLAWDIELRRRLIAHPELIPNAVDELLRFYGPAMVGRLVTQEVTVGDITMKPGQT -------------------3333--------------------------!!!!--2222- AMLWFPIASRDRSAFDSPDNIVIERTPNRHLSLGHGIHRCLGAHLIRVEARVAITEFLKR ---33331111-----1111-1111-----1111-11111111----------------- IPEFSLDPNKECEWLMGQVAGMLHVPIIFPKGKRLSE ------1111--------------------------- >L-LACTATE DEHYDROGENASE; SWP:Q27743; PDB:1T2DA; APKAKIVLVGSGMIGGVMATLIVQKNLGDVVLFDIVKNMPHGKALDTSHTNVMAYSNCKV ---------------------------------------------------1111----- SGSNTYDDLAGADVVIVTAGFTKAPGKSDKEWNRDDLLPLNNKIMIEIGGHIKKNCPNAF ----33332222-----------22223333-3333-------------------1111- IIVVTNPVDVMVQLLHQHSGVPKNKIIGLGGVLDTSRLKYYISQKLNVCPRDVNAHIVGA ------3333-----------1111-----------------------1111-------- HGNKMVLLKRYITVGGIPLQEFINNKLISDAELEAIFDRTVNTALEIVNLHASPYVAPAA -1111--3333--iiii3333-1111---------------------------------- AIIEMAESYLKDLKKVLICSTLLEGQYGHSDIFGGTPVVLGANGVEQVIELQLNSEEKAK --------1111------------2222------------1111---------------- FDEAIAETKRMKALA --------------- >M12-VARIABLE HEAVY DOMAIN; SWP:NA; PDB:1T2JA; QVQLQESGGGLVQPGGSLRLSCAASGFTFSNSAMSWVRQAPGKGLEWVSSIS ------------2222-----------3333--------2222--------- >INTERFERON REGULATORY FAC; SWP:Q14653; PDB:1T2KA; TPKPRILPWLVSQLDLGQLEGVAWVNKSRTRFRIPWKHGLRQDAQQEDFGIFQAWAEATG -------------3333-2222----------------------3333--------1111 AYVPGRDKPDLPTWKRNFRSALNRKEGLRLAEDRSKDPHDPHKIYEFVNS ---------------------------------1111------------- ------------------------------------------------------------ - >AF-6 PROTEIN; SWP:P55196; PDB:1T2MA; MKEPEIITVTLKKQNGMGLSIVAAKGAGQDKLGIYVKSVVKGGAADVDGRLAAGDQLLSV --------------------------------------------3333-----------i DGRSLVGLSQERAAELMTRTSSVVTLEVAKQGA iii------------------------------ >FAB NNA7 HEAVY CHAIN; SWP:NA; PDB:1T2QH; EVQLLEESGPGLVQPSQSLSITCTVSGFSLTSYGVHWVRQSPGKGLEWLGVIWSGGSTDY -------------2222-----------1111--------------------1111---- NAAFISRLSISKDNSKSQVFFKMNSLQADDTAIYYCARNRGYSYAMDSWGQGTSVTVSSA 1111---------1111---------1111---------!!!!----------------- KTTPPSVYPLAPGSASMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLS -----------------------------------%%%%-------------iiii---- SSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRDC -----1111------------1111------------ >SORTASE; SWP:Q9S446; PDB:1T2WA; KPQIPKDKSKVAGYIEIPDADIKEPVYPGPATPEQLNRGVSFAEENESLDDQNISIAGHT ------1111------3333-----------3333--------11111111--------- FIDRPNYQFTNLKAAKKGSMVYFKVGNETRKYKMTSIRDVKPTDVGVLDEQKGKDKQLTL 3333--!!!!-----2222-----1111-------------------------------- ITADDYNEKTGVWEKRKIFVATEVK ------------------------- >putative transcriptional ; SWP:Q8ZQN9; PDB:1T33A; MNIPTTTTKGEQAKSQLIAAALAQFGEYGLHATTRDIAALAGQNIAAITYYFGSKEDLYL ---------------------------!!!!-------1111-3333------------- ACAQWIADFLGEKFRPHAEKAERLFSQPAPDRDAIRELILLACKNMIMLLTQEDTVNLSK ---------------------------------------------------3333----- FISREQLSPTSAYQLVHEQVIDPLHTHLTRLVAAYTGCDANDTRMILHTHALLGEVLAFR ---------3333-------------------------1111-------------3333- LGKETILLRTGWPQFDEEKAELIYQTVTCHIDLILHGLTQ ---------------------------------------- >HYPOTHETICAL PROTEIN YVDD; SWP:O06986; PDB:1T35A; KTICVFAGSNPGGNEAYKRKAAELGVYAEQGIGLVYGGSRVGLGTIADAIENGGTAIGVP ---------------------------1111---------------3333---------1 SGLFSGEVVHQNLTELIEVNGHERKAKSELADGFISPGGFGTYEELFEVLCWAQIGIHQK 11133331111----------------1111------------------1111------- PIGLYNVNGYFEPKVKYSIQEGFSNESHLKLIHSSSRPDELIEQQNY -----2222---------------3333--------3333------- >THIOL:DISULFIDE INTERCHAN; SWP:P45111; PDB:1T3BA; DAAIKRKLQSFNISNIVIKSSPISGIKTAVTDQGILYVSEDGKYLFEGKLYELTNNGPVD ------3333----------------------------3333-----------------1 VAGKILVDKLNSYKDEMIVYPAKNEKHVVTVFMDITCHYCHLLHQQLKEYNDLGITVRYL 111--------------------------------------------------------- AFPRAGMNNQTAKQMEAIWTAKDPVFALNEAEKGNLPKEVKTPNIVKKHYELGIQFGVRG --------3333------------------1111-------------------3333--- TPSIVTSTGELIGGYLKPADLLRALEETA -----1111---------3333------- >NEUROTOXIN TYPE E; SWP:Q00496; PDB:1T3CA; PKINSFNYNDPVNDRTILYIKPGGCQEFYKSFNIMKNIWIIPERNVIGTTPQDFHPPTSL ------1111-----------2222---------2222-------22223333-----11 KNGDSSYYDPNYLQSDEEKDRFLKIVTKIFNRINNNLSGGILLEELSKANPYLGNDNTPD 11------1111---------------------------------1111-----111111 NQFHIGDASAVEIKFSNGSQDILLPNVIIMGAEPDLFETNSSNISLRNNYMPSNHGFGSI 11---3333-----1111----------------3333-------2222--1111----- AIVTFSPEYSFRFNDNSMNEFIQDPALTLMHQLIHSLHGLYGAKGITTKYTITQKQNPLI -----1111-----1111---------------------------1111----------- TNIRGTNIEEFLTFGGTDLNIITSAQSNDIYTNLLADYKKIASKLSKVQVSNPLLNPYKD ----------------3333--3333------------------1111---1111----- VFEAKYGLDKDASGIYSVNINKFNDIFKKLYSFTEFDLATKFQVKCRQTYIGQYKYFKLS ----------1111---------------------------------------------- NLLNDSIYNISEGYNINNLKVNFRGQNANLNPRIITPITGRGLVKKIIRFC 1111-------!!!!!!!!22221111----------2222---------- >SERINE ACETYLTRANSFERASE; SWP:P05796; PDB:1T3DA; SCEELEIVWNNIKAEARTLADCEPLASFYHATLLKHENLGSALSYLANKLSSPIPAIAIR --------------------------------1111-----------1111--------- EVVEEAYAADPEIASAACDIQAVRTRDPAVDKYSTPLLYLKGFHALQAYRIGHWLWNQGR --------------------------1111-3333--------------------1111- RALAIFLQNQVSVTFQVDIHPAAKIGRGILDHATGIVVGETAVIENDVSILQSVTLGGTG -------------------1111---------2222--1111------------------ KSGGDRHPKIREGVIGAGAKILGNIEVGRGAKIGAGSVVLQPVPPHTTAAGVPARIVGKP ----------2222-------------2222--2222----------------------- DSDKPSDDQHFNG ------------- >HUZAF ANTIBODY LIGHT CHAI; SWP:Q6GMW1; PDB:1T3FA; DIQMTQSPSTLSASVGDRVTITCKASENVDTYVSWYQQKPGKAPKLLIYGASNRYTGVPS -------------2222---------------------2222------------222233 RFSGSGSGTDFTLTISSLQPDDFATYYCGQSYNYPFTFGQGTKVEVKRTVAAPSVFIFPP 33----------------1111-------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT 3333-------------------------iiii--------------------------- LSKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC -----1111--------3333------------- >HUZAF ANTIBODY LIGHT CHAI; SWP:NA; PDB:1T3FB; VQLVQSGAELKKPGSSVKVSCKASGYIFTSSWINWVKQAPGQGLEWIGRIDPSDGEVHYN -----------2222-----------1111--------2222-----------------3 QDFKDKATLTVDKSTNTAYMELSSLRSEDTAVYYCARGFLPWFADWGQGTLVTVSSASTK 333--------3333----------3333------------------------------- GPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYS ------------------------------------%%%%--2222-------3333--- LSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSC -------3333------------1111------------ >X-linked interleukin-1 re; SWP:Q9NZN1; PDB:1T3GA; KDYDAYLSYTKVDTGEEERFALEILPDLEKHYGYKLFIPDRDLIPTGTYIEDVARCVDQS ---------------3333----3333----------3333----1111-------1111 KRLIIVTPNYVVRRGWSIFELETRLRNLVTGEIKVILIECSELRGINYQEVEALKHTIKL -------------11111111-3333----------------------------1111-- LTVIKWHGPKCNKLNSKFWKRLQYEPF -------3333---------------- >PROBABLE CYSTEINE DESULFU; SWP:Q55793; PDB:1T3IA; PSLAATVRQDFPILNQEINGHPLVYLDNAATSQKPRAVLEKLMHYYENDNANVGAHQLSV -3333-33333333---iiii-----3333----3333---------------------- RATDAYEAVRNKVAKFINARSPREIVYTRNATEAINLVAYSWGMNNLKAGDEIITTVMEH --------------------3333-----------------------2222----11113 HSNLVPWQMVAAKTGAVLKFVQLDEQESFDLEHFKTLLSEKTKLVTVVHISNTLGCVNPA 333--------------------1111------3333-1111------------------ EEIAQLAHQAGAKVLVDACQSAPHYPLDVQLIDCDWLVASGHKMCAPTGIGFLYGKEEIL -------1111----------------3333--------3333----------------- EAMPPFFGGGEMIAEVFFDHFTTGELPHKFEAGTPAIAEAIALGAAVDYLTDLGMENIHN ----------------1111---------------------------------------- YEVELTHYLWQGLGQIPQLRLYGPNPKHGDRAALASFNVAGLHASDVATMVDQDGIAIRS ---------------1111-----3333----------22223333-------------- GHHCTQPLHRLFDASGSARASLYFYNTKEEIDLFLQSLQATIRFFS -%%%%----1111---------1111-------------------- ------------------------------------------------------------ -- >DUAL-SPECIFICITY TYROSINE; SWP:Q8GY31; PDB:1T3KA; MAMARSISYITSTQLLPLHRRPNIAIIDVRDEERNYDGHIAGSLHYASGSFDDKISHLVQ ----------3333------3333-------1111------------------------- NVKDKDTLVFHSALSQVRGPTCARRLVNYLDEKKEDTGIKNIMILERGFNGWEASGKPVC -----------------3333--------------------------------------- RCAEVPCKGDCA ------------ >CARBON STORAGE REGULATOR; SWP:P33911; PDB:1T3OA; HMLVLSRKINEAIQIGADIEVKVIAVEGDQVKLGIDAPKHIDIHRKEIYLTIQEENNRAA --------------%%%%---------------------------------1111----- ALSSDVISALSSQKK ---------1111-- >QUINOLINE 2-OXIDOREDUCTAS; SWP:P72223; PDB:1T3QA; SQLMRISATINGKPRVFYVEPRMHLADALREVVGLTGTKIGCEQGVCGSCTILIDGAPMR ---------iiii------1111------------------------1111--iiii--1 SCLTLAVQAEGCSIETVEGLSQGEKLNALQDSFRRHHALQCGFCTAGMLATARSILAENP 111-33332222---3333--%%%%----------------1111--------------- APSRDEVREVMSGNLCRCTGYETIIDAITDPAVAEAARRGEV ----------1111------3333------------1111-- >Quinoline 2-oxidoreductas; SWP:P72224; PDB:1T3QB; MMKHEVVALKKKSIGTSVLRREDTRLLTGRGRYIADLVLSGMLHVASLRSPFAHARIVSI 3333--------2222---33331111-----3333--2222------------------ DVADAQALPGVELVWCGADVAELSQGIVATMQVEGFQTTIQPLLANGVTRFVGEIVAVVV -------2222--------3333---------2222------------------------ ASSRAIAEDAAQLIQVEYEELPAVTGIEAALEGEARANDTLAGNVVSRTSRARDELAPIF ----------1111-----------------------1111------------------- ASSAGVVRGQFSCGRVSACPMETRGAVAQYEWTTQQLILWTATQMPSFVRTMVAMFCAIP ------------------------------------------------------1111-3 EHLIEVRVPDVGGGFGQKAHLHPEELLVCLLSRALGRPVRWIEDRQENFLGATHAKQQRN 333------------1111--3333----------------------------------- EMGLAFDGDGRFLALENRSITDGGAYNNLPWTQLVESHVGNAVILGVYKVPAVSEESIAV ------1111------------------------------1111!!!!------------ ATNKCPIGAYRGVGFTAGQIARETLIDRAARQLGLSPFEIRRRNVVMPEDFPFTNRLGQT -----------!!!!--------------------3333-------3333----1111-- HREGTYLQTINLLEEMVNPEAFRQRQAEARARGKYLGLGVSVFNEVTGTGTRTLSFLGTP ----------------------------3333---------------------------- TTTHDSATVRIDPTGKVTVTTSLASSGQGHETTLAQIAADVLGVPASDVVIQAGSTKNTY -----------1111-------------3333------------3333------------ GFGAYASRGAVIGAGSIGRAASIVRERVKQLAGHLLEAASEDIVIEDGLVHVAGVPAKGM ----%%%%------------------------------3333---------2222----- PFAEVVGAAYFADATHPPGFDATLEATATYDPSDLVLANGGHAAIVEIDASTYATRVTDF 3333-------3333-2222---------------------------------------- FAVEDCGTMINPMIVEGQIRGGIAQAIGQTLLEEVIYDDFGQLVTTTLMDYLIPTTLDVP -------------------------------------1111-----3333----1111-- DIRIRHLETPSPLVPGGIKGMGESAMISAPAAVVAAVNDALAHLEVVIETVPITPERIFR ----------1111------11113333------------3333---------------- SIQERP -1111- >Quinoline 2-oxidoreductas; SWP:P72222; PDB:1T3QC; MKFPAFSYRAPASLQEVIQVLADDPDARIIAGGQSLLPLLAFRLVYPSCLVDLRNVSELF -----------------------1111------------1111---------11111111 EISQSAGILSVGAMVTHFRNKTDPTVAKCVPILPKVLAHVAHQAVRNRGTLGGSLAHADA -----------------------------3333---1111-3333-------------11 GAEMPFLMATLGATMYIASSAGVRSVSATDFMKGHYFTDLEAGEVLVRVEIPIPALHWEF 113333--1111------3333----3333----------2222---------------- DEYARRKGDYALVMAAAGLSMQGGRCVAARIALGAVEERAHQAIRANDFLVGKVIDESTA -------------------------------------------3333--2222------- ATAAELATEGLEPRSDIHGSRDLRLSLAKAITQRVILKAAQGAMY -------2222----1111-------------------------- >PHOSPHORIBOSYLFORMYLGLYCI; SWP:P74881; PDB:1T3TA; MELRGSPALSAFRINKLLARFQAANLQVHNIYAEYVHFADLNAPLNDSEQAQLTRLLQYG ---------------------1111-----------------------------1111-- PALSSHTPAGKLLLVTPRPGTISPWSSKATDIAHNCGLQQVDRLERGVAYYIEASTLTAE -----------------2222-3333-------11113333------------------- QWRQVAAELHDRMMETVFSSLTDAEKLFIHHQPAPVSSVDLLGEGRQALIDANLRLGLAL -----1111-1111-----3333-3333-----------3333-3333-----1111--- AEDEIDYLQEAFTKLGRNPNDIELYMFAQANSEHCRHKIFNADWIIDGKPQPKSLFKMIK -------------------------------------3333----iiii----------- NTFETTPDYVLSAYKDNAAVMEGSAVGRYFADHNTGRYDFHQEPAHILMKVETHNHPTAI --33332222--------------------------------------------3333-- SPWPGAATGSGGEIRDEGATGRGAKPKAGLVGFSVSNLRIPGFEQPWEEDFGKPERIVTA -------------------!!!!----------------2222-1111-----3333--- LDIMTEGPLGGAAFNNEFGRPALTGYFRTYEEKVNSHNGEELRGYHKPIMLAGGIGNIRA ---------------3333----------------1111-------------------11 DHVQKGEIVVGAKLIVLGGPAMNIGFASVQRDNPEMERRCQEVIDRCWQLGDANPILFIH 11------2222-------------1111---3333-------------!!!!------- DVGAGGLSNAMPELVSDGGRGGKFELRDILSDEPGMSPLEIWCNESQERYVLAVAADQLP --2222---------1111-----1111----1111------------------1111-- LFDELCKRERAPYAVIGDATEEQHLSLHDNHFDNQPIDLPLDVLLGKTPKMTRDVQTLKA ---------------------------------------3333----------------- KGDALNRADITIADAVKRVLHLPTVAEKTFLVTIGDRTVTGMVARDQMVGPWQVPVADCA ------1111-----------3333--33331111--2222--------1111------- VTTASLDSYYGEAMSIGERAPVALLDFAASARLAVGEALTNIAATQIGDIKRIKLSANWM ------------------3333-------------------1111---3333-------- AAAGHPGEDAGLYDAVKAVGEELCPQLGLTIPVGKDSMSMKTRWQEGNEQREMTSPLSLV -2222------------------------------------------------------- ISAFARVEDVRHTLTPQLSTEDNALLLIDLGKGHNALGATALAQVYRQLGDKPADVRDVA --------3333----------------1111-----------1111------------- QLKGFYDAMQALVAARKLLAWHDRSDGGLLVTLAEMAFAGHCGVQVDIAALGDDHLAALF ------------------------2222-------------------1111--------- NEELGGVIQVRAEDRDAVEALLAQYGLADCVHYLGQALAGDRFVITANDQTVFSESRTTL ----------3333--------11113333----------------!!!!---------- RVWWAETTWQMQRLRDNPQCADQEHEAKANDTDPGLNVKLSFDINEDIAAPYIATGARPK -------------------------33331111---------111111113333------ VAVLREQGVNSHVEMAAAFHRAGFDAIDVHMSDLLGGRIGLGNFHALVACGGFSYGDVLG -----2222----------1111------3333------3333-------------2222 AGEGWAKSILFNHRVRDEFETFFHRPQTLALGVNGCQMMSNLRELIPGSELWPRFVRNHS --------1111---------1111----------------33332222--------333 DRFEARFSLVEVTQSPSLLLQGMVGSQMPIAVSHGEGRVEVRDDAHLAALESKGLVALRY 3---------------3333--2222------------------------1111------ VDNFGKVTETYPANPNGSPNGITAVTTENGRVTIMMPHPERVFRTVANSWHPENWGEDSP -1111------------2222-----1111-------3333--3333----1111---33 WMRIFRNARKQLG 33----------- >CONSERVED HYPOTHETICAL PR; SWP:Q9HTW3; PDB:1T3UA; TLTVQILDKEYCINCPDDERANLESAARYLDGKREIRSSGKVIGADRVAVAALNITHDLL ---------------1111-----------------1111-------------------- HRKERLDQESSSTRERVRELLDRVDRALAN -------------------------1111- >DNA PRIMASE; SWP:P02923; PDB:1T3WA; KRTTRILIGLLVQNPELATLVPPLENLDENKLPGLGLFRELVNTCLSQPGLTTGQLLEHY 3333-----------3333----1111----2222------------2222----3333- RGTNNAATLEKLSWDDIADKNIAEQTFTDSLNHFDSLLELRQEELIARERTHGLSNEERL -3333------------------------3333-------------3333---------- ELWTLNQELAK ----------- >COACTOSIN-LIKE PROTEIN; SWP:Q14019; PDB:1T3YA; ATKIDKEACRAAYNLVRDDGSAVIWVTFKYDGSTIVPGEQGAEYQHFIQQCTDDVRLFAF -----------------1111---------!!!!--------3333-33331111----- VRFTTGDAMSKRSKFALITWIGENVSGLQRAKTGTDKTLVKEVVQNFAKEFVISDRKELE -------1111----------1111--------------3333-----------3333-- EDFIKSELKKA ----------- >PROTEIN METHYLTRANSFERASE; SWP:P37186; PDB:1T43A; EYQHWLREAISQLQASESPRRDAEILLEHVTGRGRTFILAFGETQLTDEQCQQLDALLTR --3333--3333------------------------3333-------------------- RRDGEPIAHLTGVREFWSLPLFVSPATLIPRPDTECLVEQALARLPEQPCRILDLGTGTG ------------------------------33333333------3333------------ AIALALASERPDCEIIAVDRMPDAVSLAQRNAQHLAIKNIHILQSDWFSALAGQQFAMIV ---------3333----------------------------------------------- SNPPYIDEQDPHLQQGDVRFEPLTALVAADSGMADIVHIIEQSRNALVSGGFLLLEHGWQ ------11113333-3333--------------------------------------111 QGEAVRQAFILAGYHDVETCRDYGDNERVTLGRY 1--------------------------------- >Homo sapiens v-kit Hardy-; SWP:P10721; PDB:1T46A; GNNYVYIDPTQLPYDHKWEFPRNRLSFGKTLGAGAFGKVVEATAYGLIKSDAAMTVAVKM -------3333---3333--1111---------1111----------------------- LKPSAHLTEREALMSELKVLSYLGNHMNIVNLLGACTIGGPTLVITEYCCYGDLLNFLRR -1111--------------------1111-------------------1111-------- KRDSFLALDLEDLLSFSYQVAKGMAFLASKNCIHRDLAARNILLTHGRITKICDFGLARD 3333-----------------------1111------3333---2222------!!!!-3 IKNDSNYVVKGNARLPVKWMAPESIFNCVYTFESDVWSYGIFLWELFSLGSSPYPGMPVD 3331111--------3333-3333---------------------1111----------- SKFYKMIKEGFRMLSPEHAPAEMYDIMKTCWDADPLKRPTFKQIVQLIEKQISESTN ---------------1111---------1111-3333---------------1111- >4-HYDROXYPHENYLPYRUVATE D; SWP:Q53586; PDB:1T47A; DPFPVKGMDAVVFAVGNAKQAAHYYSTAFGMQLVAYSGPENGSRETASYVLTNGSARFVL --------------------------1111-------3333-----------!!!!---- TSVIKPATPWGHFLADHVAEHGDGVVDLAIEVPDARAAHAYAIEHGARSVAEPYELKDEH ---------------------------------3333-----1111-----------111 GTVVLAAIATYGKTRHTLVDRTGYDGPYLPGYVAAAPIVEPPAHRTFQAIDHCVGNVELG 1---------!!!!-------------------------------------------222 RMNEWVGFYNKVMGFTNMKEFVGDDIATEYSALMSKVVADGTLKVKFPINEPALAKKKSQ 2--------------------------------------1111----------------- IDEYLEFYGGAGVQHIALNTGDIVETVRTMRAAGVQFLDTPDSYYDTLGEWVGDTRVPVD ------------------------------1111------3333--3333---------- TLRELKILADRDEDGYLLQIFTKPVQDRPTVFFEIIERHGSMGFGKGNFKALFEAIEREQ -----------1111--------------------------------------------- EK -- >PURS; SWP:P12049; PDB:1T4AA; YKVKVYVSLKESVLDPQGSAVQHALHSTYNEVQDVRIGKYELTIEKSDRDLDVLVKECEK ---------1111---------------1111---------------------------- LLANTVIEDYRYEVEE ---3333--------- >ASPARTATE-SEMIALDEHYDE DE; SWP:P00353; PDB:1T4BA; MQNVGFIGWRGMVGSVLMQRMVEERDFDAIRPVFFSTSQLGQAAPSFGGTTGTLQDAFDL -------1111-------------1111----------2222--3333-------1111- EALKALDIIVTCQGGDYTNEIYPKLRESGWQGYWIDAASSLRMKDDAIIILDPVNQDVIT -------------------------1111---------1111-1111---3333------ DGLNNGIRTFVGGNCTVSLMLMSLGGLFANDLVDWVSVATYQAASGGGARHMRELLTQMG ---------------------------1111------------3333------------- HLYGHVADELATPSSAILDIERKVTTLTRSGELPVDNFGVPLAGSLIPWIDKQLDNGQSR -------333311113333---------------3333---2222--------1111--- EEWKGQAETNKILNTSSVIPVDGLCVRVGALRCHSQAFTIKLKKDVSIPTVEELLAAHNP ----------------------------------------------------------11 WAKVVPNDREITMRELTPAAVTGTLTTPVGRLRKLNMGPEFLSAFTVGDQLLWGAAEPLR 11--------------33332222----------3333----------1111-------- RMLRQLA ---1111 >SERINE/THREONINE-PROTEIN ; SWP:Q9JIH7; PDB:1T4HA; AVGSNDGRFLKFDIEIGRGSFKTVYKGLDTETTVEVAWCELQDRKLTKSERQRFKEEAEL ---1111-----------1111-------------------3333--------------1 KGLQHPNIVRFYDSWESTVKGKKCIVLVTELTSGTLKTYLKRFKVKIKVLRSWCRQILKG 111-1111---------------------------------------------------- LQFLHTRTPPIIHRDLKCDNIFITGPTGSVKIGDLGLATLKRASFAKAVIGTPEFAPEYE ----------------1111-------------333311111111--------------- EKYDESVDVYAFGCLEATSEYPYSECQNAAQIYRRVTSGVKPASFDKVAIPEVKEIIEGC ----------------------3333--------1111---3333--------------- IRQNKDERYSIKDLLNHAFFQ ---1111---------3333- >IMMUNOGLOBULIN IGG1, KAPP; SWP:NA; PDB:1T4KB; EVMLVESGPGLVAPSQSLSITCTVSGFSLSDYGVSWIRQPPGKGLEWLGVIWGDGSTYYA ---------------------------1111--------------------3333----3 SALKFRLTISKDSSKSQVFLNMHSLQ 333---------3333---------- >RIBONUCLEASE III; SWP:Q02555; PDB:1T4OA; TDKLDMNAKRQLYSLIGYASLRLHYVTVKKPTAVDPNSIVECRVGDGTVLGTGVGRNIKI ----1111---------3333----------3333--------1111------------- AGIRAAENALRDKKMLDFYAK --------------------- >C.Elegans p53 tumor suppr; SWP:Q20646; PDB:1T4WA; EKWMEIDVLKQKVAKSSDMAFAISSEHEKYLWTKMGCLVPIQVKWKLDKRHFNSNLSLRI ---------3333----------1111--------------------3333--------- RFVKYDKKENVEYAIRNPRSDVMKCRSHTEREQHFPFDSFFYIRNSEHEFSYSAEKGSTF -----333333331111-------33331111---1111--------------------- TLIMYPGAVQANFDIIFMCQEKCLDLDDRRKTMCLAVFLDDENGNEILHAYIKQVRIVAY ----2222-----------3333-3333------------1111---------------3 PRRDWKNFCEREDAKQ 333--------3333- >ADAPTIVE-RESPONSE SENSORY; SWP:Q06904; PDB:1T4YA; GSSLSPQALAQPLLLQLFVDTRPLSQHIVQRVKNILAAVEATVPISLQVINVADQPQLVE -----------------------------------1111-----------1111------ YYRLVVTPALVKIGPGSRQVLSGIDLTDQLANQLPQWLVQQEGIF -------------------------3333---------------- >ATTRACTIN; SWP:O96910; PDB:1T50A; DQNCDIGNITSQCQMQHKNCEDANGCDTIIEECKTSMVERCQNQEFESAAGSTTLGPQ --------3333---------------3333------3333----------------- >ETHR REPRESSOR; SWP:P96222; PDB:1T56A; GDDRELAILATAENLLEDRPLADISVDDLAKGAGISRPTFYFYFPSKEAVLLTLLDRVVN -------------3333--3333-------1111-----------3333----------- QADMALQTLAENPADTDRENMWRTGINVFFETFGSHKAVTRAGQAARATSVEVAELWSTF -------------------------------------------1111------------- MQKWIAYTAAVIDAERDRGAAPRTLPAHELATALNLMNERTLFASFAGEQPSVPEARVLD ---------------1111----------------------------------1111--- TLVHIWVTSIYGE ------------- >CONSERVED PROTEIN MTH1675; SWP:O27711; PDB:1T57A; EKKICYFEEPGKENTERVLELVGERADQLGIRNFVVASVSGETALRLSEVEGNIVSVTHH ----------3333-----------------------33333333--------------2 AGFREKGQLELEDEARDALLERGVNVYAGSHALSGVGRGISNRFGGVTPVEIAETLRVSQ 222-2222----------3333--------1111-------------3333--3333--3 GFKVCVEIAIAADAGLIPVDEEVIAIGGTAWGADTALVLTPAHNSVFDLRIHEVIAPRP 333--------1111-----------------------------1111----------- >ACYL CARRIER PROTEIN PHOS; SWP:Q8XFP4; PDB:1T5BA; MSKVLVLKSSILAGYSQSGQLTDYFIEQWREKHVADEITVRDLAANPVPVLDGELVGAMR -----------!!!!-----------------1111---------------33333333- DAPLTPRQQDALALSDELIAELKAHDVIVIAAPMYNFNIPTQLKNYFDLIARAGITFRYT ---------------------1111---------iiii-3333--------2222----1 EKGPEGLVTGKRAVVLSSRGGIHKDTPTDLIAPYLKVFLGFIGITDVNFVFAEGIAYGPE 111-------------------2222-------------1111----------1111--- VAAKAQADAKAAIDSVVAA ------------------- >CENTROMERIC PROTEIN E; SWP:Q02224; PDB:1T5CA; EGAVAVCVRVRPLNSREESLGETAQVYWKTDNNVIYQVDGSKSFNFDRVFHGNETTKNVY -------------------!!!!---------------------------3333------ EEIAAPIIDSAIQGYNGTIFAYGQTASGKTYTMMGSEDHLGVIPRAIHDIFQKIKKFPDR -----------------------2222--------3333-----------------3333 EFLLRVSYMEIYNETITDLLCGTQKMKPLIIREDVNRNVYVADLTEEVVYTSEMALKWIT -----------iiii--------------------------------------------- KGEKSRHYGETKMNQRSSRSHTIFRMILESREKGSVKVSHLNLVDLAGSERAARLKEGCN --------------3333-----------------------------3333--------- INRSLFILGQVIKKLSDGQVGGFINYRDSKLTRILQNSLGGNAKTRIICTITPVSFDETL --------------1111------1111-3333-3333-------------1111----- TALQFASTAKYMKNTPYVNEVS -------3333----------- >4-chlorobenzoyl CoA ligas; SWP:Q8GN86; PDB:1T5HX; QTVNELRRAATRAPDHCALAVPARGLRLTHAELRARVEAVAARLHADGLRPQQRVAVVAP -3333-------1111-----1111-------------------1111-2222------- NSADVVIAILALHRLGAVPALLNPRLKSAELAELIKRGETAAVIAVQVADAIFQSGSGAR ----------------------1111---------1111--------------------- IIFLGDLVRDGEPYSYGPPIEDPQREPAQPAFIFYTSGLPKAAIIPQRAAESRVLFSTQV --3333--iiii-------------1111----------------3333----------- GLRHGRHNVVLGLPLYHVVGFFAVLVAALALDGTYVVVEEFRPVDALQLVQQEQVTSLFA ----3333-----1111-----------1111------------------1111------ TPTHLDALAAAAAHAGSSLKLDSLRHVTFAGATPDAVLETVHQHLPGEKVNIYGTTEANS ---------11111111---3333------------------------------3333-- LYRQPKTGTEAPGFFSEVRIVRIGGGVDEIVANGEEGELIVAASDSAFVGYLNQPQATAE ---------------------22221111--2222--------3333---2222------ KLQDGWYRTSDVAVWTPEGTVRILGRVDDIISGGENIHPSEIERVLGTAPGVTEVVVIGL --iiii---------1111------3333--iiii--3333-------2222-------- ADQRWGQSVTACVVPRLGETLSADALDTFCRSSELADFKRPKRYFILDQLPKNALNKVLR ---------------2222----------1111--3333-------------1111---- RQLVQQVS -------- >C_TERMINAL DOMAIN OF A PR; SWP:Q13838; PDB:1T5IA; GLQQYYVKLKDNEKNRKLFDLLDVLEFNQVVIFVKSVQRCIALAQLLVEQNFPAIAIHRG ---------1111--------------------------------------------222 MPQEERLSRYQQFKDFQRRILVATNLFGRGMDIERVNIAFNYDMPEDSDTYLHRVARAGR 2------------------------------3333-----------------------22 FGTKGLAITFVSDENDAKILNDVQDRFEVNISELPEQTR 22------------------------------------- >HYPOTHETICAL PROTEIN MJ11; SWP:Q58588; PDB:1T5JA; LVKMRDKILGSVFGAVIGDALGMPTENLTKEEIKKLYGFVDSYVEPKNYLAGKLNKGEWT ---------------------3333----------------------1111---2222-- DDTEQAICLIKSLTKEGIDIKKFANCLIAWKNKNPPDIGLTSLMAIDKLENNDYSGVDSS ---------11113333------------1111-------------3333---------- SCGAAMRIYPLGIVFHNNLKKLKEEVIKASKITHNNKTAIAGALAIAFFVSSALKDRKDF ---1111-3333--3333------------------------------------------ SLLDECYNYIKDIDEEFAKKLLEIKNFNNLDYIYDYFGTGVKTDEVVPSAIATYLLTDNF ---------11113333----3333----------------1111--------------- KEGMLKCINAGGDTDSLASMYGAMAGAYYGFKNIPKEWIDGLKNKEVIFELAERLYHLAT ------1111---3333------------3333-33333333-3333------------- E - >UVRABC SYSTEM PROTEIN B; SWP:P56981; PDB:1T5LA; VEGRFQLVAPYEPQGDQPQAIAKLVDGLRRGVKHQTLLGATGTGKTFTISNVIAQVNKPT -------------!!!!------------------------------------------- LVIAHNKTLAGQLYSELKEFFPHNAVEYFVSYYDYAQPEAYVPQTDTYIEKDAKINDEID --------------------1111------3333-------------------------- KLRHSATSALFERRDVIIVASVSCIYGLGSPEEYRELVVSLRVGMEIERNALLRRLVDIQ ----------------------------------1111---------3333-----1111 YDRNDIDFRRGTFRVRGDVVEIFPASRDEHCIRVEFFGDEIERIREVDALTGEVLGEREH ---1111-2222-----------1111--------------------------------- VAIFPASHFVTREEKMRLAIQNIEQELEERLAELRAQGKLLEAQRLEQRTRYDLEMMREM ------------------------------------------------------------ GFCSGIENYSRHLALRPPGSTPYTLLDYFPDDFLIIVDESHVTLPQLRGMYNGDRARKQV ----3333--------2222---3333-----------3333------------------ LVDHGFRLPSALDNRPLTFEEFEQKINQIIYVSATPGPYELEHSPGVVEQIIRPTGLLDP -------3333-----------1111--------------1111--------1111---- TIDVRPTKGQIDDLIGEIRERVERNERTLVTTLTKKMAEDLTDYLKEAGIKVAYLHSEIK ------2222-----------1111--------------------1111----------- TLERIEIIRDLRLGKYDVLVGINLLREGLDIPEVSLVAILDADKEGFLRSERSLIQTIGR ------------------------------1111------1111-1111---------11 AARNANGHVIMYADTITKSMEIAIQETKRRRAIQEEYNRKHGIVPRTVKKEIRDV 11-1111------------------------------------------------ >Translation initiation fa; SWP:NA; PDB:1T5OA; SLRSIFWDDGLKLIDQTKLPEKLEVIECRNVEELADAIKKLAVRGAPALEAAGAYGIALA --------------3333----------------------------------------11 AREREFADVDELKEHLKKAADFLASTRPTAVNLFVGIERALNAALKGESVEEVKELALRE 11-----3333---------------1111------------3333-------------- AEKLAEEDVERNRKMGEYGAELLEDGDVVLTYCNAGRLATVDWGTALGVVRSAVEQGKEI -------------------11112222--------3333-----3333------------ RVIACETRPLNQGSRLTCWELMEDGIDVTLITDSMVGIVMQKGMVDKVIVGADRIVRDAV ---------------------1111------3333----1111----------------- FNKIGTYTVSVVAKHHNIPFYVAAPKATFDWERTAKDVVIEERPREELIFCGKRQIAPLN -------------1111-------3333-11113333--------------------111 VKVYNPAFDPTPLENVTALITEYGVIYPPYEVNVPKVLKF 1----------3333-----1111----3333-------- >LUKS-PV; SWP:O50603; PDB:1T5RA; NIENIGAEVVKRTEDTSSDKWGVTQNIQFDFVKDKKYNKDALILKMQGFINSKTTYYNYK ---------------------------------1111---------------------!! NTDHIKAMRWPFQYNIGLKTNDPNVDLINYLPKNKIDSVNVSQTLGYNIGGNFNSSFNYS !!-------------------1111----------------------------------- KTISYNQQNYISEVEHQNSKSVQWGIKANSFITKMSGHDPNLFVGYKPYSQNPRDYFVPD ------2222-------1111--------------1111-2222--1111-3333---33 NELPPLVHSGFNPSFIATVSHEKGSGDTSEFEITYGRNMDVTHATRRTNSYLEGSRIHNA 33-3333--------------2222----------------------------------- FVNRNYTVKYEVNWKTHEIKVKGHN ------------------------- >TYPE IV COLLAGEN; SWP:Q7SIB2; PDB:1T61A; GFLVTRHSQTTDDPQCPPGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTMPFLF ----------------2222-------------iiii----11111111----------- CNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEAPAMVMAVHSQT -1111--------------------3333---!!!!3333-------------------- IQIPQCPTGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRGTCNY ------2222-------------2222-----11111111------------1111---- YANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMR 1111--------3333---------------1111-------- >TYPE IV COLLAGEN; SWP:NA; PDB:1T61C; YLLVKHSQTDQEPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYC ---------------2222-------------%%%%----11111111------------ NPGDVCYYASRNDKSYWLSTTAPLPMMPVAEEDIRPYISRCSVCEAPAVAIAVHSQDVSI 1111--------------------------33331111---------------------- PHCPAGWRSLWIGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGARGTCHYYA ---2222----------------------11111111------------!!!!-----11 NKYSFWLTTIPEQSFQGTPSADTLKAGLIRTHISRCQVCMKN 11--------------------------1111---------- >CONSERVED HYPOTHETICAL PR; SWP:Q82ZD1; PDB:1T62A; MLKNVEVFWQNFLDKHELDMLMPDVWMFGDGSSEMGNRLGQLVVSGRKTATCSSLDIYKM -------------1111------------------------------------3333-11 EEEQLPKAGQYDIILDGQSQPLAIIRTTKVEIMPMNKVSESFAQAEGLDYWYEEHARFFK 11----2222-----1111----------------------------3333--------- EELAPYQLQFYPDMLLVCQSFEVVDLYTHHHHH ----------3333------------------- >HISTONE DEACETYLASE 8; SWP:Q9BY41; PDB:1T64A; LVPVYIYSPEYVSMCDSLAKIPKRASMVHSLIEAYALHKQMRIVKPKVASMEEMATFHTD -------------11112222--------------3333----------3333------- AYLQHLQKVSQEGDDDHPDSIEYGLGYDCPATEGIFDYAAAIGGATITAAQCLIDGMCKV ----------------11111111-1111--2222------------------------- AINWSGGWHHAKKDEASGFCYLNDAVLGILRLRRKFERILYVDLDLHHGDGVEDAFSFTS --1111-1111-----iiii-----------------------------------1111- KVMTVSLHKFSPGFFPGTGDVSDVGLGKGRYYSVNVPIQDGIQDEKYYQICESVLKEVYQ ----------2222-----1111--!!!!------------------------------- AFNPKAVVLQLGADTIAGDPMCSFNMTPVGIGKCLKYILQWQLATLILGGGGYNLANTAR ---------------2222-------------------3333------------------ CWTYLTGVILGKTLSSEIPDHEFFTAYGPDYVLEITPSCRPDRNEPHRIQQILNYIKGNL --------------------11111111-------------------------------1 KHVV 111- >IMMUNOGLOBULIN LIGHT CHAI; SWP:NA; PDB:1T66H; EVKLDETGGGLVQPGRPMKLSCVASGFTFSDYWMNWVRQSPEKGLEWVAQIRNKPYNYET ------------2222-----------3333---------------------3333---- YYSDSVKGRFTISRDDSKSSVYLQMNNLRAEDMGIYYCTSYGYHGAYWGQGTLVTVSAAK --3333---------1111---------1111---------1111--------------- TTAPSVYPLAPGTAALKSSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSALY --------------------------------------%%%%-------------%%%%- TLTSSVTVPSSSWPSQTVTCNVAHPASSTKVDKKIVPRNC --------3333------------1111------------ >RBSTP2229 GENE PRODUCT; SWP:P84137; PDB:1T6AA; ANTDLKLPAGKTTIEDVKQLLERYQALKKTGEQLGWAYEQAAFPYTVRIHESVLYLQGDG -------2222--------------------------------------%%%%------1 RLYKGAISVRTAGEETFIDIALPPGATHGDKGKANEFSKWLAKTLGGELHLFSGRTVFG 111--------!!!!-------1111------------------------1111----- >EXOPOLYPHOSPHATASE; SWP:O67040; PDB:1T6CA; PIMRVASIDIGSYSVRLTIAQIKDGKLSIILERGRITSLGTKVKETGRLQEDRIEETIQV ----------------------iiii---------------3333--------------- LKEYKKLIDEFKVERVKAVATEAIRRAKNAEEFLERVKREVGLVVEVITPEQEGRYAYLA ---------------------3333-1111------------------------------ VAYSLKPEGEVCVVDQGGGSTEYVFGKGYKVREVISLPIGIVNLTETFFKQDPPTEEEVK ----------------iiii------!!!!------------------------------ RFFEFLEKELSKVKKPVDTIVGLGGTITTLAALEYNVYPYDPQKVHGKVLTYGQIKKWFD --------3333-----------3333-------------33332222------------ TFKEIPSEERSKRFRQVEDRRAKVILAGIGIFLKTLEIFEKDCLIVSDWGLREGVLVSEV -11113333----333333331111----------------------------------- LKENHS ------ >Xylanase inhibitor [Precu; SWP:Q8H0K8; PDB:1T6EX; LPVLAPVTKDPATSLYTIPFHDGASLVLDVAGPLVWSTCDGGQPPAEIPCSSPTCLLANA --------------------iiii---------------2222-----1111----1111 YPAPGCPAPSCKPCTAYPYNPVSGACAAGSLSHTRFVANTTDGSKPVSKVNVGVLAACAP --2222------------------------------------------------------ SKLLASLPRGSTGVAGLANSGLALPAQVASAQKVANRFLLCLPTGGPGVAIFGGGPVPWP -1111--2222--------1111-----------------------------------33 QFTQSMPYTPLVTKGGSPAHYISARSIVVGDTRVPVPEGALATGGVMLSTRLPYVLLRPD 33-----------2222-------------------2222-2222--------------- VYRPLMDAFTKALAAQARAVEAVAPFGVCYDTKTLGNNLGGYAVPNVQLGLDGGSDWTMT ------------1111--------------3333---1111---------2222-----3 GKNSMVDVKQGTACVAFVEMKGVAPAVILGGAQMEDFVLDFDMEKKRLGFSRLPHFTGCG 333-----2222-----------------1111--------------------1111-22 GL 22 >Endo-1,4-beta-xylanase I ; SWP:P55329; PDB:1T6GC; AGINYVQNYNGNLGDFTYDESAGTFSMYWEDGVSSDFVVGLGWTTGSSNAITYSAEYSAS ---------3333------1111-----3333--------------------------11 GSSSYLAVYGWVNYPQAEYYIVEDYGDYNPCSSATSLGTVYSDGSTYQVCTDTRTNEPSI 11--------------------------1111---------iiii--------------- TGTSTFTQYFSVRESTRTSGTVTVANHFNFWAQHGFGNSDFNYQVMAVEAWSGAGSASVT ----------------------3333----1111-------------------------- IS -- >DNA POLYMERASE PROCESSIVI; SWP:P16790; PDB:1T6LA; PPTLALRLKPYKTAIQQLRSVIRALKENTTVTFLPTPSLILQTVRSHCVSKITFNSSCLY ---------------------11111111-------------------------3333-- ITDKSFQPKTINNSTPLLGNFMYLTSSKDLTKFYVQDISDLSAKISMCAPDFNMEFSSAC -------------------33331111--------------------------------- VHGQDIVRESENSAVHVDLDFGVVADLLKWIGPCPTGTVQILVHAGPPAIKFILTNGSEL ----------1111---------------------------------------1111--- EFTSNNRVSFHGVKNMRINVQLKNFYQTLLNCAVTKLPCTLRIVTEHDTLLYVASRNGLF ---1111-------------------------3333--------------------1111 AVENFLTEE --------- >PROBABLE ATP-DEPENDENT RN; SWP:Q13838; PDB:1T6NA; SGFRDFLLKPELLRAIVDCGFEHPSEVQHECIPQAILGMDVLCQAKSGMGKTAVFVLATL -1111-----------1111--------------1111-------2222-3333------ QQLEPVTGQVSVLVMCHTRELAFQISKEYERFSKYMPNVKVAVFFGGLSIKKDEEVLKKN -----2222---------------------1111--------------3333-------- CPHIVVGTPGRILALARNKSLNLKHIKHFILDECDKMLEQLDMRRDVQEIFRMTPHEKQV ---------------1111---------------3333------------1111------ MMFSATLSKEIRPVCRKFMQDPMEIFV -------3333----1111-------- >CONSERVED HYPOTHETICAL PR; SWP:Q8KF54; PDB:1T6SA; MQEQRQQLLRSLEALIFSSEEPVNLQTLSQITAHKFTPSELQEAVDELNRDYEATGRTFR ------------------------------------------------------------ IHAIAGGYRFLTEPEFADLVRQLLAPVIQRRLSRSMLEVLAVVAWHQPVTKGEIQQIRGA ---iiii-----3333-------------------------------------------- SPDYSIDRLLARGLIEVRGRADSPGRPLQYGTTEVFLDLFHL ---------1111---------2222---------------- >Putative uncharacterized ; SWP:O67859; PDB:1T6T1; PRNLSEWIKELKKASREAVILVEGKNDKKALSKFSIKNVIDLSGKRYADVVDLEGKWEKV -------------1111--------------1111------22223333---2222---- ILLFDLDTHGERINQKKELLSSQGFLVDENFRNFLKKWNIIHIEEI --------------------1111----3333---1111--3333- >SUPEROXIDE DISMUTASE [NI]; SWP:P80735; PDB:1T6UA; HCDLPCGVYDPAQARIEAESVKAVQEKMAGNDDPHFQTRATVIKEQRAELAKHHVSVLWS -------------------------3333------------------------------- DYFKPPHFEKYPELHQLVNDTLKAMSAAKGSKDPATGQKALDYIAQIDKIFWETKKA ---3333---1111-------------1111---------------------1111- >PHOSPHATASE; SWP:Q9RUV0; PDB:1T70A; MRVLFIGDVFGQPGRRVLQNHLPTIRPQFDFVIVNMENSAGGFGMHRDAARGALEAGAGC --------------------33333333-------1111iiii----------------- LTLGNHAWHHKDIYPMLSEDTYPIVRPLNYADPGTPGVGWRTFDVNGEKLTVVNLLGRVF -----1111----------------------1111----------------------222 MEAVDNPFRTMDALLERDDLGTVFVDFHAEATSEKEAMGWHLAGRVAAVIGTHTHVPTAD 2----------------------------------------2222--------------- TRILKGGTAYQTDAGFTGPHDSIIGSAIEGPLQRFLTERPHRYGVAEGRAELNGVALHFE -----------------------------------------------------------i GGKATAAERYRFIED iii------------ >PHOSPHATASE; SWP:P75429; PDB:1T71A; MMNSIKFIFLGDVYGKAGRNIIKNNLAQLKSKYQADLVIVNAENTTHGKGLSLKHYEFLK -----------------------------------------1111iiii----------1 EAGVNYITMGNHTWFQKLDLAVVINKKDLVRPLNLDTSFAFHNLGQGSLVFEFNKAKIRI 111------1111--3333--11111111------3333--------------------- TNLLGTSVPLPFKTTNPFKVLKELILKRDCDLHIVDFHAETTSEKNAFCMAFDGYVTTIF ----1111----------------3333-------------------------------- GTHTHVPSADLRITPKGSAYITDVGMCGPGFGSVIGANPEQSIRLFCAGSREHFEVSKCG -------------1111----------------iiii----------------------- AQLNGVFFEVDVNTKKVIKTEAIRIVEDDPRYLKQDYFNLI ----------------------------3333---3333-- >PHOSPHATE TRANSPORT SYSTE; SWP:O67053; PDB:1T72A; GGGGGGKLFKELEETKEQVIKAKLVQEAIDKATEALNKQNVELAEEVIKGDDTIDLLEVD --3333-----------------------------1111--------------------- IERRCIRIALYQPEAGDLRIGIYKIVSDLERGDEAENIAERAILLAEEPPLKPYVNINFS --------------3333--------3333-------------------------3333- EIVKEVNDSVISFIQQDTLLAKKVIEKDDTVDELYHQLERELTYVLEDPRNIKRAHLSFV -------------------------------------------33333333--------- ARHYERIADHAENVAEAAIYLSEGE ---------------------3333 >Lipopolysaccharide-respon; SWP:P50851; PDB:1T77A; GPVSLSTPAQLVAPSVVVKGTLSVTSSELYFEVDEEDPNFKKIDPKILAYTEGLHGKWLF ------------3333-----------------1111------33331111-------33 TEIRSIFSRRYLLQNTALEIFMANRVAVMFNFPDPATVKKVVNFLPRVGVGTSFGLPQTR 33--------iiii-------1111----------------1111--!!!!1111---33 RISLASPRQLFKASNMTQRWQHREISNFEYLMFLNTIAGRSYNDLNQYPVFPWVITNYES 33---------------------------------1111-3333---------------- EELDLTLPTNFRDLSKPIGALNPKRAAFFAERYESWEDDQVPKFHYGTHYSTASFVLAWL ------3333--11113333-3333----------------------------------1 LRIEPFTTYFLNLQGGKFDHADRTFSSISRAWRNSQRDTSDIKELIPEFYYLPEMFVNFN 111----------------1111--------------1111----3333----1111111 NYNLGVMDDGTVVSDVELPPWAKTSEEFVHINRLALESEFVSCQLHQWIDLIFGYKQQGP 1-----1111--------1111----------------3333-----------1111-33 EAVRALNVFYYLTYEGAVNLNSITDPVLREAVEAQIRSFGQTPSQLLIEPHPPR 33-------11112222-3333-------------------------------- >POL POLYPROTEIN; SWP:P35963; PDB:1T7IA; PQITLWKRPLVTIRIGGQLKEALLDTGADDTVLEEMNLPGRWKPKMIGGIGGFIKVRQYD --------------iiii------1111--------------------1111-------- QIPIEICGHKAIGTVLVGPTPTNVIGRNLLTQIGCTLNF -----iiii----------------3333-1111----- >5-methyltetrahydropteroyl; SWP:Q9X112; PDB:1T7LA; FTKAYAFGFPKIGEKREFKKALEDFWKGKITEEQFEEEMNKLRMYMVENYRKNVDVIPSN ------------1111-------------------------------------------- ELSYYDFVLDTAVMVGAVPERFGEYRGLSTYFDMARGGKALEMTKFFNTNYHYLVPEIET ------------1111--3333--------------1111-------------------- EEFYLLENKPLEDYLFFKSKGIETAPWVIGPFTFLYLSKRNGEWIRRPNQMEKLLESLVS -----------------1111------------------iiii---3333--3333---- VYKEVFEKLVENGCKEILVNEPAFVCDLEKAHWDLILNVYRELSEFPLTVFTYYDSVSDY ---------1111---------1111--3333--------1111---------------- EACVSLPVKRLHFDFVSNEENLKNLEKHGFPEDKKLVAGVINGRQPWKVDLRKVASLVEK --1111------------------------1111-------------------------- LGASAISNSCPLFHLPVTLELENNLPGGLKEKLAFAKEKLEELKMLKDFLEGKTFDLPNV ------------------1111---22221111---------------1111-------- SFEDFAVDLQAVERVRNLPEDSFRREKEYTERDRIQRERLNLPLFPTTTIGSFPQTPEVR -1111--3333---11113333-------------------------------------- KMRSKYRKGEISKEEYEAFIKEQIKKAIELQEEIGLDVLVHGEFERTDMVEFFAEKLNGI ------------------------------------------1111----------2222 ATTQNGWVLSYGSRCYRPPIIYGTVTRPEPMTLKEITYAQSLTEKPVKGMLTGPVTIMSW ----------!!!!-----------------3333----1111--------------111 SYYREDIPEREIAYQIALAINEEVKDLEEAGIKIVQIDEPAFREKAPIKKSKWPEYFEWA 1------3333-------------------------------1111--3333-------- INAFNLAANARPETQIHAHMCYSDFNEIIEYIHQLEFDVISIEASRSKGEIISAFENFKG ----------1111----------1111-3333-----------1111---------222 WIKQIGVGVWDIHSPAVPSINEMREIVERVLRVLPKELIWINPDCGLKTRNWDEVIPSLR 2---------------------------1111--3333--------11113333------ NMVALAKEMREKFE -------------- >ANDROGEN RECEPTOR; SWP:O97775; PDB:1T7RA; CQPIFLNVLEAIEPGVVCAGHDNNQPDSFAALLSSLNELGERQLVHVVKWAKALPGFRNL ---------------------1111----------------------------2222--- HVDDQMAVIQYSWMGLMVFAMGWRSFTNVNSRMLYFAPDLVFNEYRMHKSRMYSQCVRMR ----------------------------%%%%----1111-------------------- HLSQEFGWLQITPQEFLCMKALLLFSIIPVDGLKNQKFFDELRMNYIKELDRIIACKRKN ------1111------------------1111---------------------------- PTSCSRRFYQLTKLLDSVQPIARELHQFTFDLLIKSHMVSVDFPEMMAEIISVQVPKILS ----------------------------------3333-------------------111 GKVKPIYFHT 1--------- >BAG-1 COCHAPERONE; SWP:O44739; PDB:1T7SA; DKIIVGGKNALVDDAGFKLQYEKHNLSNLQKAYDLNLRDVADLERGFLEKPKQVEGKKLE -------------3333------------------------------------------- KKVKYFNEEAERHLETLDGNIITETTPENQAKRNREKRKTLVNGIQTLLNQNDALLRRLQ -------------------------3333-3333-------------------------- EYQS ---- >ZINC-ALPHA-2-GLYCOPROTEIN; SWP:P25311; PDB:1T7VA; DGRYSLTYIYTGLSKHVEDVPAFQALGSLNDLQFFRYNSKDRKSQPMGLWRQVEGMEDWK ----------------2222--------!!!!-----------------1111------- QDSQLQKAREDIFMETLKDIVEYYKDSTGSHVLQGRFGCEIENNRSSGAFWKYYYDGKDY -------------------------1111------------%%%%---------iiii-- IEFNKEIPAWVPFDPAAQITKQKWEAEPVYVQRAKAYLEEECPATLRKYLKYSKNILDRQ -------------3333-------------------------------33331111---- DPPSVVVTSHQAPGEKKKLKCLAYDFYPGKIDVHWTRAGEVQEPELRGDVLHNGNGTYQS ------------------------------------iiii------------1111---- WVVVAVPPQDTAPYSCHVQHSSLAQPLVVPWEA ------1111---------1111---------- >HYPOTHETICAL ACETYLTRANSF; SWP:Q8E989; PDB:1T82A; DELLNRLRQTWHSTIPVSEFQIAPLSFTDGELSVSAPLAPNINLHHTFAGSIYTITLTGW --------------3333---------%%%%-----------1111-------------- GVWLQQQLLNVDGDIVLADAHIRYLAPVTSAPEVKVRWPDTNLSPLQRGRKAKVKLEVQL ------------------------------------------3333-------------- FCDGKLCAQFDGLYVSVP -iiii------------- >LYSOZYME; SWP:P00720; PDB:1T8FA; MNIFEMLRIDEGLALAAYADAAGYYTIGIGHLLTKSPSLNAAKSELDKAIGRNTNGVITK -------------------1111--------------3333------------------- DEAEKLFNQDVDAAVRGILRNAKLKPVYDSLDAVRRAALINMVFQMGETGVAGFTNSLRM ---------------------------1111----------------3333--------- LQQKRWDEAAVNLAKSRWYNQTPNRAKRVITTFRTGTWDAYK 1111---------------------------------1111- >YLMD PROTEIN SEQUENCE HOM; SWP:P84138; PDB:1T8HA; MPDIFQQEARGWLRCGAPPFAGAVAGLTTKHGGESKGPFASLNMGLHVGDDRTDVVNNRR ----------------3333---------------!!!!--------------------- RLAEWLAFPLERWVCCEQVHGADIQKVTKSDRGNGAQDFATAVPGVDGLYTDEAGVLLAL --------3333---------------1111-2222-3333------------------- CFADCVPIYFVAPSAGLVGLAHAGWRGTAGGIAGHMVWLWQTREHIAPSDIYVAIGPAIG ------------1111------------------------------3333---------3 PCCYTVDDRVVDSLRPTLPPESPLPWRETSPGQYALDLKEANRLQLLAAGVPNSHIYVSE 333----------3333-1111-------2222------------------1111----- RCTSCEEALFFSHRRDRGTTGRMLAFIGRREE -33331111--3333iiii------------- >ACYL CARRIER PROTEIN; SWP:P02901; PDB:1T8KA; STIEERVKKIIGEQLGVKQEEVTNNASFVEDLGADSLDTVELVMALEEEFDTEIPDEEAE -----------------3333-11113333------------------------333311 KITTVQAAIDYINGHQA 11---------1111-- >AMP NUCLEOSIDASE; SWP:P15272; PDB:1T8SA; LTPAQALDKLDALYEQSVVALRNAIGNYITSGELPDENARKQGLFVYPSLTVTWDGSTTN ------------------------------------------1111-------------- PPKTRAFGRFTHAGSYTTTITRPTLFRSYLNEQLTLLYQDYGAHISVQPSQHEIPYPYVI ---------------------3333-----------------------------3333-- LDRSSAGLTRYFPTTFSPLSHFDARRVDFSLARLRHYTGTPVEHFQPFVLFTNYTRYVDE ------3333------------------------------3333---------3333--- FVRWGCSQILDPDSPYIALSCAGGNWITAETEAPEEAISDLAWKKHQPAWHLITADGQGI ----------1111------------------------3333-----------1111--- TLVNIGVGPSNAKTICDHLAVLRPDVWLIGHCGGLRESQAIGDYVLAHAYLRDDHVLDAV -----------------3333--------------11112222-----------1111-- LPPDIPIPSIAEVQRALYDATKLVSGRPGEEVKQRLRTGTVVTTDDRNWELRYSASALRF -1111------------------------3333--------------3333-3333---- NLSRAVAIDESATIAAQGYRFRVPYGTLLCVSDKPLHGEIKGAISEHLQIGIRAIDLLRA ---------------------------------1111----------------------- EGDRLHSRKLRTFNEPPFR !!!!---11111111---- >heparan sulfate D-glucosa; SWP:Q9Y663; PDB:1T8TA; PNSGTLALLLDEGSKQLPQAIIIGVKKGGTRALLEFLRVHPDVRAVGAEPHFFDRSYDKG ------------------------2222-----------1111------------1111- LAWYRDLMPRTLDGQITMEKTPSYFVTREAPARISAMSKDTKLIVVVRDPVTRAISDYTQ ----3333---2222-----1111--1111------------------------------ TLSKRPDIPTFESLTFKNGLIDTSWSAIQIGIYAKHLEHWLRHFPIRQMLFVSGERLISD ----1111-3333--------11113333----------1111-3333------3333-- PAGELGRVQDFLGLKRIITDKHFYFNKTKGFPCLKKAEGSSRPHCLGKTKGRTHPEIDRE ---------1111-----3333----3333----------------1111---------- VVRRLREFYRPFNLKFYQMTGHDFGWDG ---------------------------- -------------------------------------------------- >Probable methylmalonate-s; SWP:P42412; PDB:1T90A; EIRKLKNYINGEWVESKTDQYEDVVNPATKEVLCQVPISTKEDIDYAAQTAAEAFKTWSK --------%%%%---------------------------3333---------33333333 VAVPRRARILFNFQQLLSQHKEELAHLITIENGKNTKEALGEVGRGIENVEFAAGAPSLM --------------------------------------------------------3333 MGDSLASIATDVEAANYRYPIGVVGGIAPFNFPMMVPCWMFPMAIALGNTFILKPSERTP --------2222-------------------------------------------33333 LLTEKLVELFEKAGLPKGVFNVVYGAHDVVNGILEHPEIKAISFVGSKPVGEYVYKKGSE 333--------------------------------1111--------------------- NLKRVQSLTGAKNHTIVLNDANLEDTVTNIVGAAFGSAGERCMACAVVTVEEGIADEFMA -----------------1111--------------%%%%-1111-------1111----- KLQEKVADIKIGNGLDDGVFLGPVIREDNKKRTLSYIEKGLEEGARLVCDGRENVSDDGY -----1111---3333---------3333-----------1111---------------- FVGPTIFDNVTTEMTIWKDEIFAPVLSVIRVKNLKEAIEIANKSEFANGACLFTSNSNAI ----------1111--------------------------3333---------------- RYFRENIDAGMLGINLGVPAPMAFFPFSGWKSSFFGTLHANGKDSVDFYTRKKVVTARYP ---------------------1111----!!!!--------3333-1111---------- APDF ---- >GENERAL SECRETION PATHWAY; SWP:P15746; PDB:1T92A; GNKEKADRQKVVSDLVALEGALDMYKLDNSRYPTTEQGLQALVSAPSAEPHARNYPEGGY ---------------------------------33333333--------------2222- IRRLPQDPWGSDYQLLSPGQHGQVDIFSLGPDGVPESNDDIGNWTIGF ------1111--------2222-----------------1111----- >POLYMERASE (DNA DIRECTED); SWP:Q9UBT6; PDB:1T94A; KAQITSQQLRKAQLQVDRFAMELEQSRNLSNTIVHIDMDAFYAAVEMRDNPELKDKPIAV ------3333------------3333-----------------------3333------- GSMSMLSTSNYHARRFGVRAAMPGFIAKRLCPQLIIVPPNFDKYRAVSKEVKEILADYDP -3333-----3333----------------1111-------------------3333-11 NFMAMSLDEAYLNITKHLEERQNWPEDKRRYFIKMVENDNPGKEVNKLSEHERSISPLLF 11-----------------3333-1111-------------1111-------3333---- NPQILQNSVVFGTSAQEVVKEIRFRIEQKTTLTASAGIAPNTMLAKVCSDKNKPNGQYQI -----------------------------------------------3333--------- LPNRQAVMDFIKDLPIRKVSGIGKVTEKMLKALGIITCTELYQQRALLSLLFSETSWHYF ----------22223333------------1111-----------3333----------- LHISLGLGSTHLGKSMSVERTFSEINKAEEQYSLCQELCSELAQDLQLKGRTVTIKLKNV --1111----------------------------------------------------11 NFEVKTRASVSSVVSTEIFAIAKELLKTEIDADFPHPLRLRLMGVRISS 11---------------3333----3333---3333------------- >HYPOTHETICAL PROTEIN AF04; SWP:O29759; PDB:1T95A; DKAVIARLRKGGEEFEVLVDPYLARDLKEGKEVNFEDLLAAEEVFKDAKKGERASVDELR ---------%%%%-------------3333---3333----------1111--------- KIFGTDDVFEIARKIILEGEVQITAEQRRELEAKRKQIINFISRNTIDPRTNAPHPPSRI -------------------------------------------------------3333- ERALEEAKVHIDIFKSVEAQVKDIVKALKPILPLKFEEEIAIKIPPEHTGRAISALYNFG -----------11113333-------------------------3333------------ GVTREEWQRDGSWICVRIPSGYGDLDLLGKVAKGEALTKVLRRIG -------1111----------3333------%%%%---------- >CHROMOSOME PARTITION PROT; SWP:P60293; PDB:1T98A; VPELVAWARKNDFSISLPVDRLSFLLAVATLNGERLDGESEGELVDAFRHVSDAFEQTSE --------1111-----3333---------3333------------------1111-333 TIGVRANNAINDVRQRLLNRFTIYRLTPLGIGITDYYIRQREFSTLRLSQLSIVAGELKR 3-----------1111---------------------------3333------------- AADAAEEGGDEFHWHRNVYAPLKYSVAEIFDSIDLTQRLDEQQQQVKDDIAQLLNKDWRA ---------------------------------------------------1111-3333 AISSCELLLSETSGTLRELQDTLEAAGDKLQANLLRIQDATTHDDLHFVDRLVFDLQSKL ---------------------------------------------3333----------- DRIISWGQQSIDLWIGYDRHV --------------------- >ACETOLACTATE SYNTHASE, MI; SWP:P07342; PDB:1T9BA; MDTSFVGLTGGQIFNEMMSRQNVDTVFGYPGGAILPVYDAIHNSDKFNFVLPKHEQGAGH --1111------------1111--------3333-------------------------- MAEGYARASGKPGVVLVTSGPGATNVVTPMADAFADGIPMVVFTGQVPTSAIGTDAFQEA ------------------!!!!-------------------------1111--------- DVVGISRSCTKWNVMVKSVEELPLRINEAFEIATSGRPGPVLVDLPKDVTAAILRNPIPF 3333-1111--------3333------------------------3333----------3 VMQSINKAADLINLAKKPVLYVGAGILNHADGPRLLKELSDRAQIPVTTTLQGLGSFDQE 333--------1111-------3333--1111--------1111-----1111----111 DPKSLDMLGMHGCATANLAVQNADLIIAVGARFDDRVTGNISKFAPEARRAAAEGRGGII 1--------------------------------1111--3333--------1111----- HFEVSPKNINKVVQTQIAVEGDATTNLGKMMSKIFPVKERSEWFAQINKWKKEYPYAYME ----3333----------------------1111-------------------------- ETPGSKIKPQTVIKKLSKVANDTGRHVIVTTGVGQHQMWAAQHWTWRNPHTFITSGGLGT -2222---------------1111-----------------------------------2 MGYGLPAAIGAQVAKPESLVIDIDGDASFNMTLTELSSAVQAGTPVKILILNNEEQGMVT 222-----------1111------------------------------------------ QWQSLFYEHRYSHTHQLNPDFIKLAEAMGLKGLRVKKQEELDAKLKEFVSTKGPVLLEVE ------%%%%---------------1111-------3333-------------------- VDKKVPVLPMVAGGSGLDEFINFDPEVERQQTELRHKRTGGKH -----------!!!!1111-------------------iiii- >PROTEIN 1D10; SWP:O61793; PDB:1T9FA; SDEDFVTCYSVLKFINANDGSRLHSHDVKYGSGSGQQSVTAVKNSDDINSHWQIFPALNA ------2222------------------------------------1111---------- KCNRGDAIKCGDKIRLKHLTTGTFLHSHHFTAPLSKQHQEVSAFGSEAESDTGDDWTVIC --2222--2222------1111------------1111-------3333----------- NGDEWLESEQFKLRHAVTGSYLSLSGQQFGRPIHGQREVVGTDSITGGSAWKVAEGI -----1111-----------------------2222--------------------- >PROBABLE GTPASE ENGC; SWP:O34530; PDB:1T9HA; HMPEGKIIKALSGFYYVLDESEDSDKVIQCRGRGIFRKNKITPLVGDYVVYQAENDKEGY ----------iiii-----------------------------2222------------- LMEIKERTNELIRPPICNVDQAVLVFSAVQPSFSTALLDRFLVLVEANDIQPIICITKMD ---------------------------------------------1111--------333 LIEDQDTEDTIQAYAEDYRNIGYDVYLTSSKDQDSLADIIPHFQDKTTVFAGQSGVGKSS 3------------------------------3333---3333------------------ LLNAISPTRHVELIHTSGGLVADTPGFSSLEFTDIEEEELGYTFPDIREKSSSCKFRGCL ---------------iiii------------111133333333------3333--2222- HLKEPKCAVKQAVEDGELKQYRYDHYVEFMTEIKDRKPRY ---------------------------------------- >DNA ENDONUCLEASE I-CREI; SWP:P05725; PDB:1T9IA; NTKYNKEFLLYLAGFVDGNGSIIAQIKPNQSYKFKHQLSLTFQVTQKTQRRWFLDKLVDE ----------------------------1111--------------3333---------- IGVGYVRDRGSVSDYILSEIKPLHNFLTQLQPFLKLKQKQANLVLKIIEQLPSAKESPDK --------!!!!-----------------3333-----------------3333------ FLEVCTWVDQIAALNDSKTRKTTSETVRAVLDS --------------------------------- >PROBABLE METHYLTHIORIBOSE; SWP:Q9X013; PDB:1T9KA; LKTKTEWSGNSLKLLDQRKLPFIEEYVECKTHEEVAHAIKEIVRGAPAIGVAAAFGYVLG ---------------1111-----------3333-----------3333----------- LRDYKTGSLTDWKQVKETLARTRPTAVNLFWALNREKVFFENADRENLFEILENEALKAY 1111----3333-------------3333------------1111--------------- EDIEVNKAIGKNGAQLIKDGSTILTHCNAGALATVDYGTALGVIRAAVESGKRIRVFADE ------------3333-------------1111--------------------------- TRPYLQGARLTAWELKDGIEVYVITDNAGWLKRGLIDAVVVGADRIALNGDTANKIGTYS -----3333------------------3333---------------3333---------- LAVLAKRNNIPFYVAAPVSTIDPTIRSGEEIPIEERRPEEVTHCGGNRIAPEGVKVLNPA -----1111-------3333-1111-1111------3333-------------------- FDVTENTLITAIITEKGVIRPPFEENIKKILE ----3333-----1111----3333------- >PROBABLE PYRIDOXAMINE 5'-; SWP:O69755; PDB:1T9MA; LTGTIEAPFPEFEAPPANPMEVLRNWLERARRYGVREPRALALATVDGQGRPSTRIVVIA --------1111------------------------1111------1111---------- ELGERGVVFATHADSQKGRELAQNPWASGVLYWRESSQQIILNGRAERLPDERADAQWLS -----------1111------------------1111--------------------111 RPYQTHPMSIASRQSETLADIHALRAEARRLAETDGPLPRPPGYCLFELCLESVEFWGNG 13333-------2222--------------3333------2222---------------- TERLHERLRYDRDEGGWKHRYLQP %%%%-------------------- >Endo-1,4-beta-xylanase C ; SWP:Q00177; PDB:1TA3B; ASLNDLFVAAGKSYFGTCSDQALLQNSQNEAIVASQFGVITPENSMKWDALEPSQGNFGW -------1111--------3333--------------------11113333--2222--- SGADYLVDYATQHNKKVRGHTLVWHSQLPSWVSSIGDANTLRSVMTNHINEVVGRYKGKI ----------1111--------------3333-----------------------2222- MHWDVVNEIFNEDGTFRNSVFYNLLGEDFVRIAFETARAADPDAKLYINDYNLDSASYAK ----------1111--------------------------1111----------1111-- TQAMASYVKKWLAEGVPIDGIGSQAHYSSSHWSSTEAAGALSSLANTGVSEVAITELDIA -----------1111-----------------3333----------------------22 GAASSDYLNLLNACLNEQKCVGITVWGVSDKDSWRASDSPLLFDGNYQPKDAYNAIVNAL 22--------------1111--------3333--3333-----1111----------111 S 1 >DNA LIGASE, NAD-DEPENDENT; SWP:Q837V6; PDB:1TA8A; PLTLTAATTRAQELRKQLNQYSHEYYVKDQPSVEDYVYDRLYKELVDIETEFPDLITPDS ---------------------------------3333--------------3333-1111 PTQRVGGKVLSGFEKAPHDIPMYSLNDGFSKEDIFAFDERVRKAIGKPVAYCCELKIDGL ------------------------------------------------------------ AISLRYENGVFVRGATRGDGTVGENITENLRTVRSVPMRLTEPISVEVRGECYMPKQSFV ------iiii------!!!!------3333--1111------------------------ ALNEEREENGQDIFANPRNAAAGSLRQLDTKIVAKRNLNTFLYTVADFGPMKAKTQFEAL ------1111-------------1111-----1111-----------1111--------- EELSAIGFRTNPERQLCQSIDEVWAYIEEYHEKRSTLPYEIDGIVIKVNEFALQDELGFT ---1111---1111------------------1111-----------------------1 VKAPRWAIAYKFP 111---------- >GLYCEROL DEHYDROGENASE; SWP:O13702; PDB:1TA9A; FEESKDRIFTSPQKYVQGRHAFTRSYMYVKKWATKSAVVLADQNVWNICANKIVDSLSQN -------------------33333333------------------------------111 GMTVTKLVFGGEASLVELDKLRKQCPDDTQVIIGVGGGKTMDSAKYIAHSMNLPSIICPT 1--------------------11111111------------------------------- TASSDAATSSLSVIYQFQKYSFYPLNPNLIFIDTDVIVRAPVRFLISGIGDALSTWVETE -----1111--------------------------3333--------------------- SVIRSNSTSFAGGVASIAGRYIARACKDTLEKYALSAILSNTRGVCTEAFENVVEANTLM --1111------------------------------------------------------ SGLGFENGGLAAAHAIHNGMTAIHGPVHRLMHGEKVAYGTLVQVVLEDWPLEDFNNLASF ----1111----------3333-!!!!---3333-------------------------- MAKCHLPITLEELGIPNVTDEELLMVGRATLRPDESIHNMSKKFNPSQIADAIKAVDSYS -1111---3333--1111-------------1111-3333----3333------------ QKWQEQTGWTERFRLPPSRHSPHLTDIHP -----------------1111-------- ------------------------------------ >TAT PROTEIN; SWP:P12506; PDB:1TAC; LDPVDPNIEPWNHPGSQPKTASNRAHAKKSAYHSQVAFITKGLGISYGRKKRRQRRRPSQ ----------------33333333-----------------iiii------33331111- GGQTHQDPIPKQPSSQPRGDPTGPKE --------3333--1111-------- >TRANSDUCIN-ALPHA; SWP:P04695; PDB:1TADA; ARTVKLLLLGAGESGKSTIVKQMKIIHQDGYSLEECLEFIAIIYGNTLQSILAIVRAMTT ----------2222--------------------------------------------11 LNIQYGDSARQDDARKLMHMADTIEEGTMPKEMSDIIQRLWKDSGIQACFDRASEYQLND 11----------------3333--2222-----------------------3333---11 SAGYYLSDLERLVTPGYVPTEQDVLRSRVKTTGIIETQFSFKDLNFRMFDVGGQRSERKK 11--3333-33332222------1111-------------iiii---------3333111 WIHCFEGVTCIIFIAALSAYDMVLVEDDEVNRMHESLHLFNSICNHRYFATTSIVLFLNK 11111----------1111----3333------------------3333----------- KDVFSEKIKKAHLSICFPDYNGPNTYEDAGNYIKVQFLELNMRRDVKEIYSHMTCATDTQ -----3333--3333-1111----------------33331111----------1111-- NVKFVFDAVTDIIIKE ---------------- >TFIID TBP ASSOCIATED FACT; SWP:Q27272; PDB:1TAFA; PKDAQVIMSILKELNVQEYEPRVVNQLLEFTFRYVTSILDDAKVYANHARKKTIDLDDVR -----------1111----3333-----------------------1111---------- LATEVTLD -------- >Transcription initiation ; SWP:P49847; PDB:1TAFB; MLYGSSISAESMKVIAESIGVGSLSDDAAKELAEDVSIKLKRIVQDAAKFMNHAKRQKLS -------3333-----1111-------------------------------1111----3 VRDIDMSLKV 333---3333 >HIV-1 MATRIX PROTEIN; SWP:P04591; PDB:1TAM; MGARASVLSGGELDRWEKIRLRPGGKKKYKLKHIVWASRELERFAVNPGLLETSEGCRQI -----------------------------3333---------------33333333---- LGQLQPSLQTGSEELRSLYNTVATLYCVHQRIEIKDTKEALDKIEEEQNKSKKKAQQAAA ------1111-------------------------3333----------3333------- >Calcium/calmodulin-depend; SWP:Q01064; PDB:1TAZA; VGPTYSTAVLNCLKNLDLWCFDVFSLNQAADDHALRTIVFELLTRHNLISRFKIPTVFLM ------------1111-1111------1111------------------1111------- SFLDALETGYGKYKNPYHNQIHAADVTQTVHFLLRTGMVHCLSEIELLAIIFAAAIHDYE --------1111---------------------11111111------------------- HTGTTNSFHIQTKSECAIVYNDRSVLENHHISSVFRLMQDDEMNIFINLTKDEFVELRAL -------------------%%%%----------------11111111------------- VIEMVLATDMSCHFQQVKTMKTALQQRIDKPKALSLLLHAADISHPTKQWLVHSRWTKAL --------3333-------------------------------3333-3333-------- MEEFFRQGDKEAELGLPRTSTLVAQSQIGFIDFIVEPTFSVLTDVAEKSVQDPNPDVVSF ------------------------------------------------------------ RSTWVKRIQENKQKWKERAAS --------------------- >HYDROXYACID OXIDASE 3; SWP:Q07523; PDB:1TB3A; PLVCLADFKAHAQKQLSKTSWDFIEGEADDGITYSENIAAFKRIRLRPRYLRDMSKVDTR ---3333----3333------------!!!!3333-----3333---------------- TTIQGQEISAPICISPTAFHSIAWPDGEKSTARAAQEANICYVISSYASYSLEDIVAAAP --iiii------------3333-3333-------------------------------11 EGFRWFQLYMKSDWDFNKQMVQRAEALGFKALVITIDTPVLGNRRRDKRNQLNLEAKDLR 11---------------------------------------------1111--------- ALKEASFCWNDLSLLQSITRLPIILKGILTKEDAELAMKHNVQGIVVSNHGGRQLDEVSA -------3333--------------------------1111-------%%%%-------3 SIDALREVVAAVKGKIEVYMDGGVRTGTDVLKALALGARCIFLGRPILWGLACKGEDGVK 333--------iiii----------3333----1111------3333------------- EVLDILTAELHRCMTLSGCQSVAEISPDLIQF --------------------3333-3333--- >TRANSCRIPTION INITIATION ; SWP:P51123; PDB:1TBAA; EGSIGNGLDLTGILFGNIDSEGRLLQDDDGEGRGGTGFDAELRENIGSLSKLGLDSMLLE ---------1111-----3333-------------------3333--------3333--- VIDLKEA ------- >TAT PROTEIN; SWP:P12506; PDB:1TBC; LDPVDPNIEPWNHPGSQPKTACNRCHCKKCCYHCQVCFITKGLGISYGRKKRRQRRRPSQ ---------------------------------------3333-------3333-3333- GGQTHQDPIPKQPSSQPRGDPTGPKE --------3333-------------- >CGMP-SPECIFIC 3',5'-CYCLI; SWP:O76074; PDB:1TBFA; EEETRELQSLAAAVVPSAQTLKITDFSFSDFELSDLETALCTIRMFTDLNLVQNFQMKHE ---------1111---3333-1111----1111-------------------1111---- VLCRWILSVKKNYRKNVAYHNWRHAFNTAQCMFAALKAGKIQNKLTDLEILALLIAALSH ---------11111111---3333------------11113333---------------- DLDHPGVSNQFLINTNSELALMYNDESVLEHHHFDQCLMILNSPGNQILSGLSIEEYKTT 2222--------1111------%%%%----------------22221111---------- LKIIKQAILATDLALYIKRRGEFFELIRKNQFNLEDPHQKELFLAMLMTACDLSAITKPW --------------------------------3333----------------3333---- PIQQRIAELVATEFFDQGDRERKELNIEPTDLMNREKKNKIPSMQVGFIDAICLQLYEAL -----------------------------333333331111------------------- THVSEDCFPLLDGCRKNRQKWQALAE ---3333------------------- >PROTEIN KINASE C, GAMMA T; SWP:P63319; PDB:1TBN; QTDDPRNKHKFRLHSYSSPTFCDHCGSLLYGLVHQGMKCSCCEMNVHRRCVRSVPSLCGV -----------------------------3333----------------3333------- DHTERR ------ >Thrombin inhibitor rhodni; SWP:Q06684; PDB:1TBRR; EGGEPCACPHALHRVCGSDGETYSNPCTLNCAKFNGKPELVKVHDGPCEPDEDEDVCQEC --3333----------1111----------------1111--------------1111-2 DGDEYKPVCGSDDITYDNNCRLECASISSSPGVELKHEGPCRT 222------1111----3333--------2222---------- >Peroxisomal acyl-coenzyme; SWP:P41903; PDB:1TBUA; KILELVPLSPTSFVTKYLPTFGGTLVSQSLLASLHTVPLNFFPTSLHSYFIKGGDPRTKI -------------------------------------1111-------------1111-- TYHVQNLRNGRNFIHKQVSAYQHDKLIFTSMILFAV ---------1111--------%%%%----------- >HYPOTHETICAL 11.0 KDA PRO; SWP:P20222; PDB:1TBXA; STPFFYPEAIVLAYLYDNEGIATYDLYKKVNAEFPSTATFYDAKKFLIQEGFVKERQERG -----3333-----2222---3333--------------------------------%%% EKRLYLTEKGKLFAISLKTAIETYKQIKKRHHH %---------------------------1111- >HYPOXANTHINE PHOSPHORIBOS; SWP:Q27796; PDB:1TC1A; YEFAEKILFTEEEIRTRIKEVAKRIADDYKGKGLRPYVNPLVLISVLKGSFMFTADLCRA 1111------------------------1111---------------1111--------- LCDFNVPVRMEFICVSSYVRMLLDTRHSIEGHHVLIVEDIVDTALTLNYLYHMYFTRRPA -1111-----------------------2222---------------------------- SLKTVVLLDKREGRRVPFSADYVVANIPNAFVIGYGLDYDDTYRELRDIVVLRPE ---------1111--------------------iiii-%%%%1111--------- >Transposable element Tc3 ; SWP:P34257; PDB:1TC3C; PRGSALSDTERAQLDVMKLLNVSLHEMSRKISRSRHCIRVYLKDPVSYGTS ------3333------------------------------------2222- >Probable eukaryotic D-ami; SWP:NA; PDB:1TC5A; HMTIRVMLQAMDQGHLLVNNVDKYVRAGRGVMVYIAFLSDRDSAPITDEALRHAVGVLLH ------------------------------------------------------------ TKIFTHFSPEKMINQPQSLEECPEMDILIVPQASLGGKVKGRSVQFHQLVAKDVGAALYD -------1111------33331111--------1111--------1111----------- RFCHFVRVARGVDESRVDANGAPRSEGDAPKAEGWIKYNSRVISGTFGNRQGLRFESEGP -------1111-1111-1111---3333-----------------2222----------- FTHMFDI ------- >PHOSPHOLIPASE A2 ISOFORM ; SWP:Q6SLM2; PDB:1TC8A; NLYQLMNMIQCANTRTWPSYTNYGCYCGKGGSGTPVDDLDRCCYTHDHCYNDAKNIDGCN 3333------------3333----------------3333-------------------3 PVTKTYSYTCTEPTITCNDSKDKCARFVCDCDRTAAICFAKAPYNTSNVMIRSTNSCQ 333------------------3333-------------------1111--2222---- >LIPASE; SWP:P41365; PDB:1TCA; LPSGSDPAFSQPKSVLDAGLTCQGASPSSVSKPILLVPGTGTTGPQSFDSNWIPLSTQLG ----------------1111-----1111---------22223333----------1111 YTPCWISPPPFMLNDTQVNTEYMVNAITALYAGSGNNKLPVLTWSQGGLVAQWGLTFFPS ---------%%%%------------------1111----------------------333 IRSKVDRLMAFAPDYKGTVLAGPLDALAVSAPSVWQQTTGSALTTALRNAGGLTQIVPTT 3------------1111-1111--1111--3333---2222------------------- NLYSATDEIVQPQVSNSPLDSSYLFNGKNVQAQAVCGPLFVIDHAGSLTSQFSYVVGRSA ---1111---------3333---2222---3333--1111--3333-------------- LRSTTGQARSADYGITDCNPLPANDLTPEQKVAAAALLAPAAAAIVAGPKQNCEPDLMPY --1111--3333-3333-----1111-------1111--------------------333 ARPFAVGKRTCSGIVTP 31111----3333---- >TRIOSEPHOSPHATE ISOMERASE; SWP:P52270; PDB:1TCDA; KPQPIAAANWKCNGSESLLVPLIETLNAATFDHDVQCVVAPTFLHIPMTKARLTNPKFQI --------------3333-----------------------3333---------1111-- AAQNAITRSGAFTGEVSLQILKDYGISWVVLGHSERRLYYGETNEIVAEKVAQACAAGFH -----------2222------1111-------3333------------------------ VIVCVGETNEEREAGRTAAVVLTQLAAVAQKLSKEAWSRVVIAYEPVWAIGTGKVATPQQ ----------------------------111133331111-----1111-------3333 AQEVHELLRRWVRSKLGTDIAAQLRILYGGSVTAKNARTLYQMRDINGFLVGGASLKPEF --------------------------------3333------1111-----1111-3333 VEIIEATK ----1111 >TROPONIN C; SWP:P02586; PDB:1TCF; DQQAEARSYLSEEMIAEFKAAFDMFDADGGGDISVKELGTVMRMLGQTPTKEELDAIIEE ------33333333-----------1111---------------------------3333 VDEDGSGTIDFEEFLVMMVRQMKEDAKGKSEEELAELFRIFDRNADGYIDAEELAEIFRA -1111------------------------------------1111--------------- SGEHVTDEEIESLMKDGDKNNDGRIDFDEFLKMMEG -----3333--------1111--------------- >PURINE-NUCLEOSIDE PHOSPHO; SWP:Q9BMI9; PDB:1TCVA; ESVTANIENVKKVAHHIQKLTSIVPEIGIICGSGLGKLADGVKDKITIPYTKIPNFPQTS ----------------1111-----------2222-1111--------33332222---- HSGNLIFGTLSGRKVVVMQGRFHMYEGYSNDTVALPIRVMKLLGVKILMVSNAAGGLNRS ---------iiii---------3333-------------------------------111 LKLGDFVILKDHIYLPGLGLNNILVGPNQEAFGTRFPALSNAYDRDLRKLAVQVAEENGF 12222-----------1111-1111---3333-----------------------11113 GNLVHQGVYVMNGGPCYETPAECTMLLNMGCDVVGMSTIPEVVIARHCGIQVFAVSLVTN 333-----------------------1111---------------1111----------- ISVLDVESDGAQRAELMQSWFEKIIEKLPKD ----3333----------------1111--- >HEAD DECORATION PROTEIN; SWP:P36275; PDB:1TD4A; VRIFAGNDPAHTATGSSGISSPTPALTPLMLDEATGKLVVWDGQKAGSAVGILVLPLEGT --------------------------------------------2222------------ ETALTYYKSGTFATEAIHWPEVDEHKKANAFAGSALSHAALP ------------3333--------------2222-------- >ACETATE OPERON REPRESSOR; SWP:P16528; PDB:1TD5A; GHSRNLLAIVHPILRNLEESGETVNAVLDQSDHEAIIIDQVQCTHLRSAPIGGKLPHASG -----------------3333----------------------------2222------- AGKAFLAQLSEEQVTKLLHRKGLHAYTHATLVSPVHLKEDLAQTRKRGYSFDDEEHALGL ---------------3333-------1111--3333--------------------2222 RCLAACIFDEHREPFAAISISGPISRITDDRVTEFGAVIKAAKEVTLAYGG ----------------------3333-1111-------------------- >HYPOTHETICAL PROTEIN MG23; SWP:P75455; PDB:1TD6A; PNQFVNHLSALKKHFASYKELREAFNDYHKHNGDELTTFFLHQFDKVMELVKQKDFKTAQ ----------------------------------3333------------1111------ SRCEEELAAPYLPKPLVSFFQSLLQLVNHDLLEQQNAALASLPAAKIIELVLQDYPNKLN ----333311113333-------------------3333----------1111------- MIHYLLPKTKAFVKPHLLQRLQFVLTDSELLELKRFSFFQALNQIPGFQGEQVEYFNSKL ----11111111-333311113333-1111--------------3333---------111 KQKFTLTLGEFEIAQQPDAKAYFEQLITQIQQLFLKEPVNAEFANEIIDAFLVSYFPLHP 1--------------------------------3333----------------------- PVPLAQLAAKIYEYVSQIVLNEAVNLKDELIKLIVHTLYEQLDRPV -----------------------33333333------1111----- >PHOSPHATE ACETYLTRANSFERA; SWP:P39646; PDB:1TD9A; MADLFSTVQEKVAGKDVKIVFPEGLDERILEAVSKLAGNKVLNPIVIGNENEIQAKAKEL 1111-------2222-------1111---------------------------------- NLTLGGVKIYDPHTYEGMEDLVQAFVERRKGKATEEQARKALLDENYFGTMLVYKGLADG ----------1111--------------iiii---------------------------- LVSGAAHSTADTVRPALQIIKTKEGVKKTSGVFIMARGEEQYVFADCAINIAPDSQDLAE -------3333---1111------------------!!!!-------------------- IAIESANTAKMFDIEPRVAMLSFSTKGSAKSDETEKVADAVKIAKEKAPELTLDGEFQFD -------3333-------------------3333-------------3333--------3 AAFVPSVAEKKAPDSEIKGDANVFVFPSLEAGNIGYKIAQRLGNFEAVGPILQGLNMPVN 3333333----------------------------------------------------- DLSRGCNAEDVYNLALITAAQAL ----------------------- >NEI ENDONUCLEASE VIII-LIK; SWP:Q96FI4; PDB:1TDHA; PEGPELHLASQFVNEACRALVFGGCVEKSSVSRNPEVPFESSAYRISASARGKELRLILS -3333-----------1111--------3333------------------!!!!------ PLPGAQPQQEPLALVFRFGMSGSFQLVPREELPRHAHLRFYTAPPGPRLALCFVDIRRFG -2222----------------------1111-----------------------1111-- RWDLGGKWQPGRGPCVLQEYQQFRESVLRNLADKAFDRPICEALLDQRFFNGIGNYLRAE --------2222-----------------33331111-33331111-------3333--- ILYRLKIPPFEKARSVLEALQPELTLSQKIRTKLQNPDLLELCHSVPKEVVQLGGRGYGS --1111-11113333-3333------------1111-3333---------------2222 ESGEEDFAAFRAWLRCYGMPGMSSLQDRHGRTIWFQGDPGPLAP ----------3333-2222-------1111---------1111- >GLUTATHIONE S-TRANSFERASE; SWP:Q16772; PDB:1TDIA; KPKLHYFNGRGRMEPIRWLLAAAGVEFEEKFIGSAEDLGKLRNDGSLMFQQVPMVEIDGM ---------!!!!----------------------------1111-1111------iiii KLVQTRAILNYIASKYNLYGKDIKERALIDMYTEGMADLNEMILLLPLCRPEEKDAKIAL -------------1111---------------------------1111-3333------- IKEKTKSRYFPAFEKVLQSHGQDYLVGNKLSRADISLVELLYYVEELDSSLISNFPLLKA -------------------------%%%%------------------11111111----- LKTRISNLPTVKKFLQPGSPRKPPADAKALEEARKIFR -------------------------------------- >BIOSYNTHETIC THREONINE DE; SWP:P04968; PDB:1TDJ; QPLSGAPEGAEYLRAVLRAPVYEAAQVTPLQKMEKLSSRLDNVILVKREDRQPVHSFKLR --------3333-------3333------------------------1111--------- GAYAMMAGLTEEQKAHGVITASAGNHAQGVAFSSARLGVKALIVMPTATADIKVDAVRGF ----11113333-------------3333------------------------------- GGEVLLHGANFDEAKAKAIELSQQQGFTWVPPFDHPMVIAGQGTLALELLQQDAHLDRVF ------------------------------------------------1111-------- VPVGGGGLAAGVAVLIKQLMPQIKVIAVEAEDSACLKAALDAGHPVDLPRVGLFAEGVAV ------3333---------3333------1111--33331111------------1111- KRIGDETFRLCQEYLDDIITVDSDAICAAMKDLFEDVRAVAEPSGALALAGMKKYIALHN ---------------------3333-----------------3333-------------- IRGERLAHILSGANVNFHGLRYVSERCELGEQREALLAVTIPEEKGSFLKFCQLLGGRSV ---------------3333-----------------------------3333-------- TEFNYRFADAKNACIFVGVRLSRGLEERKEILQMLNDGGYSVVDLSDDEMAKLHVRYMVG -----------------------------------------------3333--------- GRPSHPLQERLYSFEFPESPGALLRFLNTLGTYWNISLFHYRSHGTDYGRVLAAFEYDCH ---------------------------3333----------------------------- DETNNPAFRFFLAG -------------- >CARNOBACTERIOCIN B2 IMMUN; SWP:P38582; PDB:1TDPA; MDIKSQTLYLNLSEAYKDPEVKANEFLSKLVVQCAGKLTASNSENSYIEVISLLSRGISS ----3333---------1111--3333------------3333------------3333- YYLSHKRIIPSSMLTIYTQIQKDIKNGNIDTEKLRKYEIAKGLMSVPYIYF -!!!!---------------------------------------------- >TENASCIN-R; SWP:Q05546; PDB:1TDQA; IPVIDGPTQILVRDVSDTVAFVEWTPPRAKVDFILLKYGLVGGEGGKTTFRLQPPLSQYS ----------------------------------------------------3333---- VQALRPGSRYEVSISAVRGTNESDASSTQFTTEIDAPKNLRVGSRTATSLDLEWDNSEAE -----------------!!!!--------------------------------------- AQEYKVVYSTLAGEQYHEVLVPKGIGPTTKTTLTDLVPGTEYGVGISAVMNSKQSIPATM ---------3333------------------------------------!!!!------- NARTELDSPRDLMVTASSETSISLIWTKASGPIDHYRITFTPSSGISSEVTVPRDRTSYT ----------------------------------------------------3333---- LTDLEPGAEYIISITAERGRQQSLESTVDAF -----------------!!!!---------- >Aggrecan core protein [Pr; SWP:P07897; PDB:1TDQB; EQCEEGWTKFQGHCYRHFPDRETWVDAERRCREQQSHLSSIVTPEEQEFVNKNAQDYQWI ---2222------------------------1111------------------------- GLNDRTIEGDFRWSDGHSLQFEKWRPNQPDNFFATGEDCVVMIWHERGEWNDVPCNYQLP ------2222--1111--------2222--------------1111-------1111--- FTCKKG ------ >FORMAMIDOPYRIMIDINE-DNA G; SWP:P42371; PDB:1TDZA; ELPEVETVRRELEKRIVGQKIISIEATYPRMVLTGFEQLKKELTGKTIQGISRRGKYLIF ---------------2222--------33331111-------2222-------!!!!--- EIGDDFRLISHLRMEGKYRLATLDAPREKHDHLTMKFADGQLIYADVRKFGTWELISTDQ ---------------------1111--1111--------------1111-------1111 VLPYFLKKKIGPEPTYEDFDEKLFREKLRKSTKKIKPYLLEQTLVAGLGNIYVDEVLWLA --------------3333------------------------------3333-------- KIHPEKETNQLIESSIHLLHDSIIEILQKAIKLGGSSILGSTGKMQNELQVYGKTGEKCS --11111111--------------------1111---------3333---2222------ RCGAEIQKIKVAGRGTHFCPVCQQK ----------iiii----3333--- >PROTEASE DEGS; SWP:P31137; PDB:1TE0A; FDSTDETPASYNLAVRRAAPAVVNVYNRGLNTNSHNQLEIRTLGSGVIMDQRGYIITNKH -----------------3333------------------------------------333 VINDADQIIVALQDGRVFEALLVGSDSLTDLAVLIIKATGGLPTIPINARRVPHIGDVVL 32222------1111-----------1111-----------------1111--2222--- AIGNPYNLGQTITQGIISATGRIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGIN -------------------------1111-----------1111------1111------ TLSFDKSNDGETPEGIGFAIPFQLATKIMDKLIRDGRVIRGYIGIGGREIAPLHAQGGGI ------------------------------------------------------------ DQLQGIVVNEVSPDGPAANAGIQVNDLIISVDNKPAISALETMDQVAEIRPGSVIPVVVM --------------3333----1111----%%%%-----------11112222------- RDDKQLTLQVTIQEYPAT ------------------ >Endo-1,4-xylanase [Precur; SWP:Q9HFH0; PDB:1TE1B; QSITTSQTGTNNGYYYSFWTNGGGEVTYTNGDNGEYSVTWVDCGDFTSGKGWNPANAQTV ----------%%%%-----------------iiii------------------------- TYSGEFNPSGNAYLAVYGWTTDPLVEYYILESYGTYNPSSGLTSLGQVTSDGGTYDIYST ------------------------------------------------------------ QRVNQPSIEGTSTFNQYWSVRTEKRVGGTVTTANHFAAWKALGLEMGTYNYMIVSTEGYE ------1111----------------------------3333------------------ SSGSSTITVS ---------- >PUTATIVE PHOSPHATASE; SWP:P77247; PDB:1TE2A; RQILAAIFDDGLLIDSEPLWDRAELDVASLGVDISRRNELPDTLGLRIDVVDLWYARQPW ---------------------------1111-33333333--2222-------------- NGPSRQEVVERVIARAISLVEETRPLLPGVREAVALCKEQGLLVGLASASPLHLEKVLTF --------------------------2222-------1111------------------- DLRDSFDALASAEKLPYSKPHPQVYLDCAAKLGVDPLTCVALEDSVNGIASKAARRSIVV -3333------1111-------------------3333------3333---1111----- PAPEAQNDPRFVLANVKLSSLTELTAKDLLG -3333-----3333-----3333-3333--- >CONSERVED PROTEIN MTH187; SWP:O26289; PDB:1TE4A; MADENKWVRRDVSTALSRMGDEAFEPLLESLSNEDWRIRGAAAWIIGNFQDERAVEPLIK --------2222--------1111-----------3333-----3333--3333-33333 LLEDDSGFVRSGAARSLEQIGGERVRAAMEKLAETGTGFARKVAVNYLETH 333-------------3333-----------3333--3333---------- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:1TE5A; CELLGMSANVPTDIVFSFTGLMQRGGGTGPHRDGWGIAFYEGRGVRLFQDPLASVDSEVA ------------------------------------------------------------ RLVQRFPIKSETVIGHIRQANVGKVGLSNTHPFIRELGGRYWTFAHNGQLADFQPKPGFY -------------------------3333-------%%%%-------------------- RPVGETDSEAAFCDLLNRVRRAFPEPVPVEVLLPVLISACDEYRKKGVFNALISDGDWLF ---------------------------3333------------1111------------- TFCSSKLAYITRRAPFGPARLKDADLTVDFHAETTPDDVVTVIATEPLTDNENWTLQQSG ---------------------------------------------------------222 EWVLWWGGEVLAK 2----iiii---- >HYPOTHETICAL UPF0267 PROT; SWP:Q46828; PDB:1TE7A; MQPNDITFFQRFQDDILAGRKTITIRDESESHFKTGDVLRVGRFEDDGYFCTIEVTATST --------------------------3333------------3333-------------- VTLDTLTEKHAEQENMTLTELKKVIADIYPGQTQFYVIEFKCL --3333-----1111---------------------------- >PKS18; SWP:Q7D8I1; PDB:1TEDA; AQLPPAPPTTVAVIEGLATGTPRRVVNQSDAADRVAELGQRERIPRVYQKSRITTRRMAV ----------------------------------3333---------1111--------- DPLDAKFDVFRREPATIRDRMHLFYEHAVPLAVDVSKRALAGLPYRAAEIGLLVLATSTG 1111-----1111--------------------------1111--3333----------- FIAPGVDVAIVKELGLSPSISRVVVNFMGCAAAMNALGTATNYVRAHPAMKALVVCIELC -----------1111-1111-------!!!!---------------1111--------33 SVNAVFADDINDVVIHSLFGDGCAALVIGASQVQEKLEPGKVVVRSSFSQLLDNTEDGIV 33-----------------------------1111--2222-------------1111-- LGVNHNGITCELSENLPGYIFSGVAPVVTEMLWDNGLQISDIDLWAIHPGGPKIIEQSVR ---1111-----1111----------------1111-3333---------3333------ SLGISAELAAQSWDVLARFGNMLSVSLIFVLETMVQQAESAKAISTGVAFAFGPGVTVEG ----3333--------------3333---------------------------------- MLFDIIRR -------- >DISINTEGRIN CHAIN A; SWP:P83658; PDB:1TEJA; SVNPCCDPVICKPRDGEHCISGPCCNNCKFLNSGTICQRARGDGNHDYCTGITTDCPRNR --1111-------2222--------iiii--2222-----------------------11 YN 11 >DISINTEGRIN CHAIN A; SWP:NA; PDB:1TEJB; NSVNPCCDPQTCKPIEGKHCISGPCCENCYFLRSGTICQRARGDGNNDYCTGITPDCPRN ---1111-------2222----1111iiii--2222-----------------------3 RYN 333 >TENASCIN; SWP:P24821; PDB:1TEN; RLDAPSQIEVKDVTDTTALITWFKPLAEIDGIELTYGIKDVPGDRTTIDLTEDENQYSIG -------------------------------------1111---------3333------ NLKPDTEYEVSLISRRGDMSSNPAKETFTT ---------------!!!!----------- >UMP-CMP KINASE; SWP:P30085; PDB:1TEVA; PLVVFVLGGPGAGKGTQCARIVEKYGYTHLSAGELLRDERKNPDSQYGELIEKYIKEGKI --------2222-----------------------------3333---------1111-- VPVEITISLLKREMDQTMAANAQKNKFLIDGFPRNQDNLQGWNKTMDGKADVSFVLFFDC --------------------3333---------------------2222----------- NNEICIERCLERGKSSGRSDDNRESLEKRIQTYLQSTKPIIDLYEEMGKVKKIDASKSVD ------------1111----------------------------1111------------ EVFDEVVQIFDKEG -------------- >STF0 SULFOTRANSFERASE; SWP:NA; PDB:1TEXA; DHPTAYLVLASQRSGSTLLVESLRATGVAGEPQEFFQYLPNTSMSPQPREWFADEDQSIL ----------------------3333-------1111----------------------- RLLDPLIEGKPDLAPATIWRDYIQTVGRTPNGVWGGKLMWNQTPLLVQRAKDLPDRSGSG ------------------------11111111------3333-------1111------- LLSAIRDVVGSDPVLIHIHRPDVVSQAVSFWRAVQTRVWRRAEYHAGAIAHVITMLRAQE --------------------------------------------3333------------ EGWRAWFTEENVEPIDVDYPYLWRNLTEVVGTVLEALGQDPRLAEWVERYRDQRDGLPL -----------------3333----3333-----1111-3333--3333---------- >Peptostreptococcal albumi; SWP:Q51911; PDB:1TF0B; TIDQWLLKNAKEDAIAELKKAGITSDFYFNAINKAKTVEEVNALKNEILKAHA ------------------1111------------------------------- >Negative regulator of all; SWP:P77734; PDB:1TF1A; GRENLYFQGHMDVLSVAGPFMRRLMLLSGETVNVAIRNGNEAVLIGQLECKSMVRMCAPL 1111---------------------------------%%%%------------------- GSRLPLHASGAGKALLYPLAEEELMSIILQTGLQQFTPTTLVDMPTLLKDLEQARELGYT ----1111-------1111-----------------1111-------------------- VDKEEHVVGLNCIASAIYDDVGSVVAAISISGPSSRLTEDRFVSQGELVRDTARDISTAL ------2222--------1111----------3333-3333-----------------11 GLKA 11-- >TRANSCRIPTION FACTOR IIIA; SWP:P03001; PDB:1TF3A; MKRYICSFADCGAAYNKNWKLQAHLSKHTGEKPFPCKEEGCEKGFTSLHHLTRHSLTHTG --------3333-----------3333-----------------------------3333 EKNFTCDSDGCDLRFTTKANMKKHFNRFHNIK -------------------------------- >T. FUSCA ENDO/EXO-CELLULA; SWP:P26221; PDB:1TF4A; EPAFNYAEALQKSMFFYEAQRSGKLPENNRVSWRGDSGLNDGADVGLDLTGGWYDAGDHV -----------------1111----1111-1111---11113333--------------- KFGFPMAFTATMLAWGAIESPEGYIRSGQMPYLKDNLRWVNDYFIKAHPSPNVLYVQVGD -------------------------------------------------1111------- GDADHKWWGPAEVMPMERPSFKVDPSCPGSDVAAETAAAMAASSIVFADDDPAYAATLVQ ---3333--3333----------3333-----------------1111------------ HAKQLYTFADTYRGVYSDCVPAGAFYNSWSGYQDELVWGAYWLYKATGDDSYLAKAEYEY --------------3333---1111--11113333----------------------333 DFLSTEQQTDLRSYRWTIAWDDKSYGTYVLLAKETGKQKYIDDANRWLDYWTVGVNGQRV 3---------------------3333----------------------------iiii-- PYSPGGMAVLDTWGALRYAANTAFVALVYAKVIDDPVRKQRYHDFAVRQINYALGDNPRN --3333----------------------1111-------------------1111-1111 SSYVVGFGNNPPRNPHHRTAHGSWTDSIASPAENRHVLYGALVGGPGSPNDAYTDDRQDY ---2222---------3333------1111-------2222--------------1111- VANEVATDYNAGFSSALAMLVEEYGGTPLADFPPTEEPDGPEIFVEAQINTPGTTFTEIK -----3333--------------------------------------------------- AMIRNQSGWPARMLDKGTFRYWFTLDEGVDPADITVSSAYNQCATPEDVHHVSGDLYYVE -------------------------22223333--------------------------- IDCTGEKIFPGGQSEHRREVQFRIAGGPGWDPSNDWSFQGIGNELAPAPYIVLYDDGVPV --2222-------1111--------------11113333--------1111---iiii-- WGTAP ----- >PREPROTEIN TRANSLOCASE SE; SWP:P28366; PDB:1TF5A; HMLGILNKRTLNRYEKIANDIDAIRGDYENLSDDALKHKTIEFKERLEKGATTDDLLVEA -----------------------3333------------------------3333----- FAVVREASRRVTGMFPFKVQLMGGVALHDGNIAEMKTGEGKTLTSTLPVYLNALTGKGVH -----------------------------------22223333---------1111---- VVTVNEYLASRDAEQMGKIFEFLGLTVGLNLNSMSKDEKREAYAADITYSTNNELGFDYL -------------------3333-------1111-------1111--------------- RDNMVLYKEQMVQRPLHFAVIDEVDSILIDEARTPLIISGQAAKSTKLYVQANAFVRTLK 1111--1111------------------1111-------------3333------3333- AEKDYTYDIKTKAVQLTEEGMTKAEKAFGIDNLFDVKHVALNHHINQALKAHVAMQKDVD -------------------------------11111111----------------2222- YVVEDGQVVIVDSFTGRLMKGRRYSEGLHQAIEAKEGLEIQNESMTLATITFQNYFRMYE ---%%%%-----------------iiii-------------------------------- KLAGMTGTAKTEEEEFRNIYNMQVVTIPTNRPVVRDDRPDLIYRTMEGKFKAVAEDVAQR --------3333------------------------------------------------ YMTGQPVLVGTVAVETSELISKLLKNKGIPHQVLNAKNHEREAQIIEEAGQKGAVTIATN ------------3333--------1111-----------------1111-2222----11 MAGRGTDIKLGEGVKELGGLAVVGTERHESRRIDNQLRGRSGRQGDPGITQFYLSMEDEL 112222----22221111------------------------iiii--------111133 MRRFGAERTMAMLDRFGMDDSTPIQSKMVSRAVESSQKRVEGNNFDSRKQLLQYDDVLRQ 33-3333-----------3333---3333------------------------------- QREVIYKQRFEVIDSENLREIVENMIKSSLERAIAAYTPREELPEEWKLDGLVDLINTTY ----------------------------------1111----2222-------------- LDEGALEKSDIFGKEPDEMLELIMDRIITKYNEKEEQFGKEQMREFEKVIVLRAVDSKWM ------------------------------------------------------------ DHIDAMDQLRQGIHLRAYAQTNPLREYQMEGFAMFEHMIESIEDEVAKFVMKAEI ------------1111--------------------------------------- >TRANSFERRIN; SWP:P19134; PDB:1TFD; VRWCAVNDHEASKCANFRDSMKKVLPEDGPRIICVKKASYLDCIKAIAAHEADAVTLDAG --------------------1111------------------------------------ LVHEAGLTPNNLKPVVAEFYGSKENPKTFYYAVALVKKGSNFQLNELQGKKSCHTGLGRS --3333------------------------------2222--3333---------22223 AGWNIPIGLLYCDLPEPRKPLEKAVASFFSGSCVPCADQLCQLCPGCGCSSSQPYFGYSG 333------------------------------2222---1111-------------111 AFKCLKDGLGDVAFVKQETIFENLPSKDERDQYELLCLDNTRKPVDEYEQCHLARVPSHA 1----------------3333----3333--------------33331111--------- VVARSVDGKEDLIWELLNQAQEHFGKDKSGDFQLFSSPHGKNLLFKDSAYGFFK ---------3333--3333-----------------1111-----3333----- >ELONGATION FACTOR TS; SWP:P43895; PDB:1TFE; AREGIIGHYIHHNQRVGVLVELNCETDFVARNELFQNLAKDLAMHIAMMNPRYVSAEEIP ----------1111----------------------------------------3333-- AEELEKERQIYIQAALNEGKPQQIAEKIAEGRLKKYLEEVVLLEQPFVKDDKVKVKELIQ ---------------1111---------------------------1111---3333--- QAIAKIGENIVVRRFCRFELGA ------------------2222 >UBIQUITIN THIOLESTERASE P; SWP:Q96DC9; PDB:1TFFA; NLISEKCDILSILRDHPENRIYRRKIEELSKRFTAIRKTKGDRNCFYRALGYSYLESLLG -------3333----33331111----------------------------------222 KSREIFKFKERVLQTPNDLLAAGFEEHKFRNFFNAFYSVVELVEKDGSVSSLLKVFNDQS 2------------------1111-3333-------------------------------- ASDHIVQFLRLLTSAFIRNRADFFRHFIDEEDIKDFCTHEVEPATECDHIQITALSQALS -------------------33333333----3333------------------------- IALQVEYVDHHVFPEAATPSVYLLYKTSHYNILYAADKH -------------------------%%%%---------- >TRANSTHYRETIN; SWP:P27731; PDB:1TFPA; CPLMVKVLDAVRGSPAANVAVKVFKKAADGTWQDFATGKTTEFGEIHELTTEEQFVEGVY --------------------------1111----------1111------1111------ RVEFDTSSYWKGLGLSPFHEYADVVFTANDSGHRHYTIAALLSPFSYSTTAVVS -----11113333--1111-----------------------1111-------- >RIBONUCLEASE H; SWP:P13319; PDB:1TFR; KEGICLIDFSQIALSTALVNFPDKEKINLSMVRHLILNSIKFNVKKAKTLGYTKIVLCID --------3333-----------------------------------1111--------- NAKSGYWRRDFAYYYKKTWDWEGYFESSHKVIDELKAYMPYIVMDIDKYEADDHIAVLVK -11113333--1111------------------------------22223333------- KFSLEGHKILIISSDGDFTQLHKYPNVKQWSPMHKKWVKIGSAEIDCMTKILKGDKKDNV --1111--------------3333------------------------------3333-- ASVKVRSDFWFTRVEGERTPSMKTSIVEAIANDREQAKVLLTESEYNRYKENLVLIDFDY -11111111----1111-----3333------3333--------------------1111 IPDNIASNIVNYYNSYKLPPRGKIYSYFVKAGLSKLTNSINEF -3333-------1111---1111---------3333--3333- >TOXIN FS2; SWP:P01414; PDB:1TFS; RICYSHKASLPRATKTCVENTCYKMFIRTHREYISERGCGCPTAMWPYQTECCKGDRCNK --------------------------3333------------------------------ >PHOSPHOPANTETHEINE ADENYL; SWP:Q50452; PDB:1TFUA; MTGAVCPGSFDPVTLGHVDIFERAAAQFDEVVVAILVNPAKTGMFDLDERIAMVKESTTH ---------------------------------------------------------111 LPNLRVQVGHGLVVDFVRSCGMTAIVKGLRTGTDFEYELQMAQMNKHIAGVDTFFVATAP 1----------------1111-------------------------------------33 RYSFVSSSLAKEVAMLGGDVSELLPEPVNRRLRDRLN 33-----------1111--1111-3333--------- >Tissue factor pathway inh; SWP:P10646; PDB:1TFXC; KPDFCFLEEDPGICRGYITRYFYNNQTKQCERFKYGGCLGNMNNFETLEECKNICEDG -3333----------------------------------------------------- >4-HYDROXYPHENYLPYRUVATE D; SWP:P93836; PDB:1TFZA; NPKSDKFKVKRFHHIEFWCGDATNVARRFSWGLGMRFSAKSDLSTGNMVHASYLLTSGDL -----------------------------------------3333-----------!!!! RFLFTAPYSPSLSAGEIKPTTTASIPSFDHGSCRSFFSSHGLGVRAVAIEVEDAESAFSI --------333311113333----1111-------------------------------- SVANGAIPSSPPIVLNEAVTIAEVKLYGDVVLRYVSYKEFLPGFERVEDASSFPLDYGIR -1111---------%%%%--------!!!!-----------------!!!!--------- RLDHAVGNVPELGPALTYVAGFTGFHQFASGLNSAVLASNDEMVLLPINEPVHGKSQIQT --------------------------------------1111------------------ YLEHNEGAGLQHLALMSEDIFRTLREMRKRSSIGGFDFMPSPPPTYYQNLKKRVGDVLSD -----------------------------1111---------3333-------1111--- DQIKECEELGILVDRDDQGTLLQIFTKPLGDRPTIFIEIIQRVGCMQSGGCGGFGKGNFS -----------------------------------------------2222---3333-3 ELFK 333- >MYOSIN TAIL REGION-INTERA; SWP:P47068; PDB:1TG0A; EPEVPFKVVAQFPYKSDYEDDLNFEKDQEIIVTSVEDAEWYFGEYQDSNGDVIEGIFPKS -----------------1111---2222------------------1111-------333 FVAVQG 3----- >Putative ATP-dependent Cl; SWP:Q16740; PDB:1TG6A; PLIPIVVYDIYSRLLRERIVCVMGPIDDSVASLVIAQLLFLQSESNKKPIHMYINSPGGV ------------------------------------------------------------ VTAGLAIYDTMQYILNPICTWCVGQAASMGSLLLAAGTPGMRHSLPNSRIMIHQPSGGAR ---------------------------------11112222---1111------------ GQATDIAIQAEEIMKLKKQLYNIYAKHTKQSLQVIESAMERDRYMSPMEAQEFGILDKVL ------------------------------------------------------------ VHPP ---- >BETA-GALACTOSIDASE; SWP:Q700S9; PDB:1TG7A; LLQKYVTWDEHSIFVNGERLMIFSGEVHPYRLPVASLYIDIFEKVKALGFNCVSFYVDWA --------3333--iiii---------1111--3333--------1111--------333 LLEGNPGHYSAEGIFDLQPFFDAAKEAGIYLLARPGPYINAEVSGGGFPGWLQRVDGILR 3---2222---!!!!------------------------%%%%-iiii---1111----- TSDEAYLKATDNYASNIAATIAKAQITNGGPIILYQPENEYSGACCGYNGFPDGSYMQYI ------------------------3333----------------iiii----3333---- EDHARDAGIVVPFISNDAWAAGHNAPGTGAGAVDIYGHDSYPLGFDCANPSTWPSGNLPT ----1111-------------2222---2222--------1111----1111-2222--- YFHTSHEQQSPSTPYSLVEFQGGAFDPWGGVGFAKCAALLNHEFERVFYKNDFSFGVAFL ---------1111-------------2222----------------------1111---- NLYMIFGGTNWGNLGHPGGYTSYDYGSAISESRNITREKYSELKLLGNFAKVSPGYLVAN ----------%%%%-1111----------1111---3333------------3333---- PGDLSTSTYTNTADLTVTPLLGSNSSASSFFVIRHSDYSSQASVEYKLTVPTSAGNLTIP -----------1111--------1111---------1111-----------1111----- QLGGSLTLSGRDSKIHVTDYDVAGTNILYSTAEVFTWKKFNNEKVLVLYGGPGEHHEFAV ---------------------iiii--------------!!!!-------2222------ SGASSSSVVEGSSSGISSKKVGKALVVAWDVSTARRIVQVGSLKVFLLDRNSAYNYWVPQ ------------2222-----------------------!!!!-----33331111---- VPTKGTAPGYSNQETTASSIIVKAGYLVRSAYLDGNDLHIQADFNATTPIEVVGAPSGAK ---------------1111--------------!!!!------------------1111- NLVINGKKTQTKVDKNGIWSASVAYTAPKVQLPSLKSLKWKSVDTLPEAKNTYDDSAWTS ---iiii------1111----------------3333------------1111-1111-- ADHAYTNNSAHSLQTPTSLFASDYGYHTGALLFRGHFTANGKEKTFFVQTKGGTAYGHSI -------1111--------3333-----------------------------2222---- WINETYVGSWAGTSINDNNNATYTLPTLQSGKNYVITVVIDNMGLDEDWTIGSEDMKNPR -!!!!-----------------------2222-----------------22223333--- GIIQYSLSGQEASAISWKLTGNLGGENYRDTVRGPLNEGGLYAERQGFHQPQPPTQKWDS ------22221111-------2222----3333-------------1111----1111-- SSPFTGLTKPGIRFYSTSFDLDLPSGYDIPLYFNFGNSTSTPAAYRVQLYVNGYQYGKYV -1111---------------------------------------------iiii-----1 NNIGPQTSFPVPEGILNYHGTNWLALSLWAQEDNGAKLDSFELINTTPVLTSLGEVKSVN 111----------------------------1111------------------------- QPKYQARKGAY ------2222- >TRANSFORMING GROWTH FACTO; SWP:P17125; PDB:1TGJ; ALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSADTTHST -----------------------------3333---------------2222-------- VLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMVVKSCKCS -----11113333--------------------------------------- >THERMOSTABLE B DNA POLYME; SWP:P56689; PDB:1TGOA; MILDTDYITEDGKPVIRIFKKENGEFKIDYDRNFEPYIYALLKDDSAIEDVKKITAERHG ---------iiii--------%%%%------------------------3333----iii TTVRVVRAEKVKKKFLGRPIEVWKLYFTHPQDVPAIRDKIKEHPAVVDIYEYDIPFAKRY i-------------%%%%----------11113333--33331111--------3333-- LIDKGLIPMEGDEELKMLAFDIETLYHEGEEFAEGPILMISYADEEGARVITWKNIDLPY ---------------------------------------------------------111 VDVVSTEKEMIKRFLKVVKEKDPDVLITYNGDNFDFAYLKKRSEKLGVKFILGREGSEPK 1----------------------------3333----------1111-----3333---- IQRMGDRFAVEVKGRIHFDLYPVIRRTINLPTYTLEAVYEAIFGQPKEKVYAEEIAQAWE -----------2222--------------------------------------------- TGEGLERVARYSMEDAKVTYELGKEFFPMEAQLSRLVGQSLWDVSRSSTGNLVEWFLLRK -------------------------------------------1111------------- AYERNELAPNKPDERELARRRESYAGGYVKEPERGLWENIVYLDFRSLYPSIIITHNVSP ------------33333333--------------------------------------11 DTLNREGCEEYDVAPQVGHKFCKDFPGFIPSLLGDLLEERQKVKKKMKATIDPIEKKLLD 11--2222---------------------------------------------------- YRQRAIKILANSFYGYYGYAKARWYCKECAESVTAWGRQYIETTIREIEEKFGFKVLYAD -------------3333-1111-------------------------------------- TDGFFATIPGADAETVKKKAKEFLDYINAKLPGLLELEYEGFYKRGFFVTKKKYAVIDEE -------2222---------------3333-!!!!--------------2222------- DKITTRGLEIVRRDWSEIAKETQARVLEAILKHGDVEEAVRIVKEVTEKLSKYEVPPEKL -------------------------------------------------------1111- VIYEQITRDLKDYKATGPHVAVAKRLAARGIKIRPGTVISYIVLKGSGRIGDRAIPFDEF --------3333--------------1111------------------2222---3333- DPAKHKYDAEYYIENQVLPAVERILRAFGYRKEDLRYQKTRQVGLGAWLKPKT 3333---------------------1111-3333------------------- >INSULIN-LIKE GROWTH FACTO; SWP:P01343; PDB:1TGRA; GPETLCGAELVDALQFVCGDRGFYFNKPKAKGIVDECCFRSCDLRRLEMYCA 1111-------------!!!!------------------------3333--- >Pancreatic secretory tryp; SWP:P00998; PDB:1TGSI; TSPQREATCTSEVSGCPKIYNPVCGTDGITYSNECVLCSENKKRQTPVLIQKSGPC ------------------------1111---------------------------- >GAMMA-CARDIOTOXIN; SWP:P01468; PDB:1TGXA; LKCNQLIPPFWKTCPKGKNLCYKMTMRAAPMVPVKRGCIDVCPKSSLLIKYMCCNTDKCN --------------2222-------2222----------------1111----------- >SENTRIN-SPECIFIC PROTEASE; SWP:Q9HC62; PDB:1TH0A; DLLELTEDMEKEISNALGHGPQDEILSSAFKLRITRGDIQTLKNYHWLNDEVINFYMNLL --------------------1111----%%%%----------2222-------------- VERNKKQGYPALHVFSTFFYPKLKSGGYQAVKRWTKGVNLFEQEIILVPIHRKVHWSLVV ---------------1111-------3333111122221111---------!!!!----- IDLRKKCLKYLDSMGQKGHRICEILLQYLQDESKTKRNSDLNLLEWTHHSMKPHEIPQQL --1111-----1111--------------------------3333------1111----- NGSDCGMFTCKYADYISRDKPITFTQHQMPLFRKKMVWEILHQQLL ---------------1111-----3333------------------ >Adenomatous polyposis col; SWP:P25054; PDB:1TH1C; KQAAVNAAVQRVQVLPDADLLHFATESTSLALLDEPFIQKDVELRIMPPVQ -3333-----------------------------------3333------- >NIFU1; SWP:Q84LK7; PDB:1TH5A; MLELNEENVEKVLNEIRPYLAGTGGGGLQFLMIKGPIVKVRLTGPAAVVRTVRIAVSKKL ----3333-------------------------!!!!-------------3333------ REKIPSIQIVQLLS ---3333------- >SMALL NUCLEAR RIBOPROTEIN; SWP:NA; PDB:1TH7A; GAMNFLAETAHKVLAESLNNLVLVKLKGNKEVRGMLRSYDQHMNLVLSDSEEIQSDGSGK ---1111---------2222------%%%%---------1111----------3333--- KLGTIVIRGDNVILISPL -------3333------- >Anti-sigma F factor antag; SWP:O32726; PDB:1TH8B; SLAIDLEVKQDVLIVRLSGELDHHTAEELREQVTDVLENRAIRHIVLNLGQLTFMDSSGL --------!!!!---------3333-----------------------1111-------- GVILGRYKQIKNVGGQMVVCAVSPAVKRLFDMSGLFKIIRVEADEQFALQALGVA ----------1111--------3333----11113333-----------1111-- >CATHEPSIN B; SWP:P00787; PDB:1THEA; LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL -----3333-11113333---------3333--------------iiii----------- TCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGARPPCT ---3333-!!!!------------------------------------------------ GEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDF ------------------3333----------------------------------1111 LTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVANSWNADWGDNGFFKILRGENHC ----------------------------iiii---------1111-iiii-------222 GIESEIVAGIPRT 2------------ >Imidazole glycerol phosph; SWP:Q9X0C6; PDB:1THFD; MLAKRIIACLDVKDGRVVKGSNFENLRDSGDPVELGKFYSEIGIDELVFLDITASVEKRK ------------iiii------1111-1111----------------------------- TMLELVEKVAEQIDIPFTVGGGIHDFETASELILRGADKVSINTAAVENPSLITQIAQTF --------1111--------------------1111------------------------ GSQAVVVAIDAKRVDGEFMVFTYSGKKNTGILLRDWVVEVEKRGAGEILLTSIDRDGTKS 3333---------iiii-----%%%%-------------------------1111----- GYDTEMIRFVRPLTTLPIIASGGAGKMEHFLEAFLAGADAALAASVFHFREIDVRELKEY ---------1111------------3333----1111---------1111---------- LKKHGVNVRLEGL -1111----2222 >LIPASE; SWP:P22394; PDB:1THG; APTAVLNGNEVISGVLEGKVDTFKGIPFADPPLNDLRFKHPQPFTGSYQGLKANDFSPAC ----------------!!!!------------!!!!-----------2222--------- MQLDPGNSLTLLDKALGLAKVIPEEFRGPLYDMAKGTVSMNEDCLYLNVFRPAGTKPDAK ----------------3333--3333-----1111----------------22221111- LPVMVWIYGGAFVYGSSAAYPGNSYVKESINMGQPVVFVSINYRTGPFGFLGGDAITAEG ---------1111-!!!!---------------------------3333----------- NTNAGLHDQRKGLEWVSDNIANFGGDPDKVMIFGESAGAMSVAHQLIAYGGDNTYNGKKL --3333------------3333---1111------------------%%%%---iiii-- FHSAILQSGGPLPYHDSSSVGPDISYNRFAQYAGCDTSASANDTLECLRSKSSSVLHDAQ ------------------------------1111-------------1111--------- NSYDLKDLFGLLPQFLGFGPRPDGNIIPDAAYELFRSGRYAKVPYISGNQEDEGTAFAPV -------iiii-1111------------------1111-------------1111-3333 ALNATTTPHVKKWLQYIFYDASEASIDRVLSLYPQTLSVGSPFRTGILNALTPQFKRVAA 1111-------------1111--------------3333-----!!!!---1111----- ILSDMLFQSPRRVMLSATKDVNRWTYLSTHLHNLVPFLGTFHGNELIFQFNVNIGPANSY -----------------1111-------1111--------22223333-----!!!!--- LRYFISFANHHDPNVGTNLLQWDQYTDEGKEMLEIHMTDNVMRTDDYRIEGISNFETDVN -------------------------3333----------------------------111 LYG 1-- >THERMITASE; SWP:P04072; PDB:1THM; YTPNDPYFSSRQYGPQKIQAPQAWDIAEGSGAKIAIVDTGVQSNHPDLAGKVVGGWDFVD ----1111------------3333----2222---------1111--2222--------- NDSTPQNGNGHGTHCAGIAAAVTNNSTGIAGTAPKASILAVRVLDNSGSGTWTAVANGIT --------------------------------1111--------1111------------ YAADQGAKVISLSLGGTVGNSGLQQAVNYAWNKGSVVVAAAGNAGNTAPNYPAYYSNAIA --1111------------------------1111--------------------1111-- VASTDQNDNKSSFSTYGSWVDVAAPGSSIYSTYPTSTYASLSGTSMATPHVAGVAGLLAS ----1111--1111--1111----------------------3333------------11 QGRSASNIRAAIENTADKISGTGTYWAKGRVNAYKAVQY 11----------1111--2222-------------1111 >CRCA PROTEIN; SWP:P37001; PDB:1THQA; TTFRENIAQTWQQPEHYDLYIPAITWHARFAERPWGGGFGLSRWDEKGNWHGLYAMAFKD 3333----------------------2222--------------1111-----------1 SWNKWEPIAGYGWESTWRPLADENFHLGLGFTAGVTARDNWNYIPLPVLLPLASVGYGPV 111------------------3333------------3333---------------!!!! TFQMTYIPGTYNNGNVYFAWMRFQFLE -------3333---------------- >THIOESTERASE; SWP:P05521; PDB:1THTA; QCKTIAHVLRVNNGQELHVWETPPKENVPFKNNTILIASGFARRMDHFAGLAEYLSTNGF -----------------------------------------1111----------1111- HVFRYDSLHHVEFTMTTGKNSLCTVYHWLQTKGTQNIGLIAASLSARVAYEVISDLELSF -------------3333------------1111----------3333----1111----- LITAVGVVNLRDTLEKALGFDYLSLPIDELPNDLDFEGHKLGSEVFVRDCFEHHWDTLDS --------------------3333-1111---------------------1111--3333 TLDKVANTSVPLIAFTANNDDWVKQEEVYDMLAHIRTGHCKLYSLLGSSHDLGENLVVLR ----1111-----------11113333-------1111---------------------- NFYQSVTKAAIAMDGGSLEIDVDFIEPDFEQLTIATVNERRLKAEIENRTPEMA ------------1111---------------------------3333------- >THIOREDOXIN; SWP:P20857; PDB:1THX; SKGVITITDAEFESEVLKAEQPVLVYFWASWCGPCQLMSPLINLAANTYSDRLKVVKLEI -------3333----1111---------11113333------------1111-------1 DPNPTTVKKYKVEGVPALRLVKGEQILDSTEGVISKDKLLSFLDTHLN 111----1111----------!!!!----------------------- >THIOREDOXIN H; SWP:Q8S3L3; PDB:1TI3A; AEEGQVIACHTVDTWKEHFEKGKGSQKLIVVDFTASWCPPCKMIAPIFAELAKKFPNVTF ----------3333---------------------------------------------- LKVDVDELKAVAEEWNVEAMPTFIFLKDGKLVDKTVGADKDGLPTLVAKHATA ----3333--------------------------------------------- >PLANT DEFENSIN; SWP:Q6T418; PDB:1TI5A; RTCMIKKEGWGKCLIDTTCAHSCKNRGYIGGNCKGMTRTCYCLVNC ------1111------3333---3333------------------- >Pyrogallol hydroxytransfe; SWP:P80564; PDB:1TI6B; MEQYYMVIDVAKCQDCNNCFMGCMDEHELNEWPGYTASMQRGHRWMNIERRERGTYPRND --------3333------3333---------2222----2222----------------- INYRPTPCMHCENAPCVAKGNGAVYQREDGIVLIDPEKAKGKKELLDTCPYGVMYWNEEE -------------------iiii---1111------1111----11111111-----111 NVAQKCTMCAHLLDDESWAPKMPRCAHNCGSFVYEFLKTTPEAMAKKVEEEGLEVIKPEL 1-----%%%%-1111--3333-----------------------------------3333 GTKPRVYYKNLYRFEKNYVTAGILVQGDCFEGAKVVLKSGGKEVASAETNFFGEFKFDAL ----------3333----------iiii----------iiii-------1111------- DNGEYTVEIDADGKSYSDTVVIDDKSVDLGFIKL ----------iiii-------------------- >HEMAGGLUTININ; SWP:Q6GYW3; PDB:1TI8A; ICLGHHAVSNGTKVNTLTERGVEVVNATETVERTNVPRICSKGKRTVDLGQCGLLGTITG ---------------1111---------------------2222----!!!!3333---- PPQCDQFLEFSADLIIERREGSDVCYPGKFVNEEALRQILRESGGIDKETMGFTYSGIRT 3333-------------1111--------------------------------------- NGATSACRRSGSSFYAEMKWLLSNTDN ---3333-------------------- >Hemagglutinin; SWP:Q4ZJH4; PDB:1TI8B; GLFGAIAGFIENGWEGLIDGWYGFRHQNAQGEGTAADYKSTQSAIDQITGKLNRLIEKTN 1111---------1111------------------------------------------- QQFELIDNEFTEVEKQIGNVINWTRDSMTEVWSYNAELLVAMENQHTIDLTDSEMNKLYE ------------------------------------------------------------ RVKRQLRENAEEDGTGCFEIFHKCDDDCMASIRNNTYDHSRYREEA ------------------------3333----------3333---- >LIPASE; SWP:O59952; PDB:1TIB; EVSQDLFNQFNLFAQYSAAAYCGKNNDAPAGTNITCTGNACPEVEKADATFLYSFEDSGV ---------------------3333---2222----%%%%----1111------------ GDVTGFLALDNTNKLIVLSFRGSRSIENWIGNLNFDLKEINDICSGCRGHDGFTSSWRSV ------------------------33333333-----------2222------------- ADTLRQKVEDAVREHPDYRVVFTGHSLGGALATVAGADLRGNGYDIDVFSYGAPRVGNRA ------------------------------------------------------------ FAEFLTVQTGGTLYRITHTNDIVPRLPPREFGYSHSSPEYWIKSGTLVPVTRNDIVKIEG ---3333--------------3333--3333-------------2222--3333------ IDATGGNNQPNIPDIPAHLWYFGLIGTCL --------------3333----------- >ERYTHRINA TRYPSIN INHIBIT; SWP:P09943; PDB:1TIE; VLLDGNGEVVQNGGTYYLLPQVWAQGGGVQLAKTGEETCPLTVVQSPNELSDGKPIRIES ---1111---2222-------3333--------!!!!----------1111--------- RLRSAFIPDDDKVRIGFAYAPKCAPSPWWTVVEGLSVKLSEDESTQFDYPFKFEQVSDQL -------2222---------3333-----------------3333------------111 HSYKLLYCEGKHEKCASIGINRDQKGYRRLVVTEDYPLTVVLKKDE 1--------3333---------1111-------------------- >TRANSLATION INITIATION FA; SWP:P03000; PDB:1TIF; KDFIINEQIRAREVRLIDQNGDQLGIKSKQEALEIAARRNLDLVLVAPNAKPPVCRIMDY ----!!!!---------1111---------------1111-------------------- GKFRFEQQKKEKEARK ------------1111 >TRANSLATION INITIATION FA; SWP:P03000; PDB:1TIG; INVKEVRLSPTIEEHDFNTKLRNARKFLEKGDKVKATIRFKGRAITHKEIGQRVLDRLSE --------1111---------------1111------------1111------------1 ACADIAVVETAPKMDGRNMFLVLAPKND 111-----------!!!!---------- >HEAT LABILE ENTEROTOXIN T; SWP:P43528; PDB:1TIIA; NDYFRADSRTPDEVRRSGGLIPRGQDEAYERGTPININLYDHARGTTGNTRYNDGYVSTT ---------------------1111---------------------------iiii---- TTLRQAHLLGQNMLGGYNEYYIYVVAAAPNLFDVNGVLGRYSPYPSENEYAALGGIPLSQ -------------2222---------------3333-!!!!--3333---------3333 IIGWYRVSFGAIEGGMHRNRDYRRDLFRGLSAAPNEDGYRIAGFPDGFPAWEEVPWREFA -------iiii-------111133332222---33333333---22223333--3333-- PNSCLP ------ ------------------------------------ >TRIOSEPHOSPHATE ISOMERASE; SWP:P00940; PDB:1TIMA; APRKFFVGGNWKMNGKRKSLGELIHTLDGAKLSADTEVVCGAPSIYLDFARQKLDAKIGV ------------------------------------------------------------ AAQNCYKVPKGAFTGEISPAMIKDIGAAWVILGHSERRHVFGESDELIGQKVAHALAEGL -----------------3333----------------------------------1111- GVIACIGEKLDEREAGITEKVVFQETKAIADNVKDWSKVVLAYEPVWAIGTGKTATPQQA --------------------------------------------3333------------ QEVHEKLRGWLKTHVSDAVAVQSRIIYGGSVTGGNCKELASQHDVDGFLVGGASLKPEFV --------------------------------------3333------------------ DIINAKH ------- >Protease synthase and spo; SWP:P21340; PDB:1TIQA; SVKKKCSREDLQTLQQLSIETFNDTFKEQNSPENKAYLESAFNTEQLEKELSNSSQFFFI ------3333---------------3333-3333-------------------------- YFDHEIAGYVKVNIDDAQSEEGAESLEIERIYIKNSFQKHGLGKHLLNKAIEIALERNKK -iiii--------!!!!----------------3333-----------------1111-- NIWLGVWEKNENAIAFYKKGFVQTGAHSFYGDEEQTDLIAKTLILE ------1111------------------------------------ >THYMIDYLATE SYNTHASE; SWP:P00471; PDB:1TIS; MKQYQDLIKDIFENGYETDDRTGTGTIALFGSKLRWDLTKGFPAVTTKKLAWKACIAELI ----------3333----------------------3333-------------------- WFLSGSTNVNDLRLIQHDSLIQGKTVWDENYENQAKDLGYHSGELGPIYGKQWRDFGGVD -1111--3333-------------1111-------1111---------3333--2222-- QIIEVIDRIKKLPNDRRQIVSAWNPAELKYMALPPCHMFYQFNVRNGYLDLQWYQRSVDV -1111------------------11111111----------------------------- FLGLPFNIASYATLVHIVAKMCNLIPGDLIFSGGNTHIYMNHVEQCKEILRREPKELCEL --------------------------------------------3333------------ VISGLPYKFRYLSTKEQLKYVLKLRPKDFVLNNYVSHPPIKGKMAV --------11113333------------------------------ >HIV-1 TRANSACTIVATOR PROT; SWP:P12506; PDB:1TIV; MDPVDPNIEPWNHPGSQPKTACNRCHCKKCCYHCQVCFIKKGLGISYGRKKRRQRRRPSQ ------------------------------------------------------------ GGQTHQDPIPKQPSSQPRGDPTGPKE -%%%%--------------------- >BIFUNCTIONAL PUTA PROTEIN; SWP:P09546; PDB:1TIWA; PQSVSRAAITAAYRRPETEAVSMLLEQARLPQPVAEQAHKLAYQLADKLRNQKNASGRAG ---------1111--3333-----3333-------------------------3333333 MGVALMCLAEALLRIPDKATRDALIRDSGEPLIRKGVDMAMRLMGEQFVTGETIAEALAN 33333-----3333--------------3333------------3333----3333---- ARKLEEKGFRYSYDMLGEAALTAADAQAYMVSYQQAIHAIGKASNGRGIYEGPGISIKLS -3333--------------------------------------iiii----------333 ALHPRYSRAQYDRVMEELYPRLKSLTLLARQYDIGINIDAEESDRLEISLDLLEKLCFEP 3---3333----------3333-------1111--------1111-------------33 ELAGWNGIGFVIQAYQKRCPLVIDYLIDLATRSRRRLMIRLVKGAYWDSEIKRAQMDGLE 33-------------3333----------------------------------------- GYPVYTRKVYTDVSYLACAKKLLAVPNLIYPQFATHNAHTLAAIYQLAGQNYYPGQYEFQ ------3333--------------3333------------------------1111---- CLHGMGEPLYEQVTGKVADGKLNRPCRIYAPVGTHETLLAYLVRRLLENGANTSFVNRIA -22223333------3333--------------1111-------------11113333-- DTSLPLDELVADPVTAVEKLAQQEGQTGLPHPKIPLPRD 11113333----------------------1111-3333 >CALMODULIN-RELATED PROTEI; SWP:Q9SRP5; PDB:1TIZA; SSAKRVFEKFDKNKDGKLSLDEFREVALAFSPYFTQEDIVKFFEEIDVDGNGELNADEFT ------------------------------3333------------1111---------- SCIEKML ------- >SPRED1; SWP:Q66JG9; PDB:1TJ6A; DSYARVRAVVMTRDDSSGGWLQLGGGGLSSVTVSKTLQPGDSGGTEFLVHGERLRDKTVI ---------------------2222-------------!!!!-----------1111--- LECVLRRDLVYNKVTPTFHHWRIGDKKFGLTFQSPADARAFDRGIRRAIEDLSQG -----1111-------------!!!!----------------------1111--- >ARGININOSUCCINATE LYASE; SWP:P11447; PDB:1TJ7A; LWGGRFTQAADQRFKQFNDSLRFDYRLAEQDIVGSVAWSKALVTVGVLTAEEQAQLEEAL --1111-------------3333-----------------3333---------------- NVLLEDVRARPQQILESDAEDIHSWVEGKLIDKVGQLGKKLHTGRSRNDQVATDLKLWCK ---------33333333-----------------3333-1111--3333----------- DTVSELLTANRQLQSALVETAQNNQDAVMPGYTHLQRAQPVTFAHWCLAYVEMLARDESR -----------------------1111-----%%%%------------------------ LQDALKRLDVSPLGCGALAGTAYEIDREQLAGWLGFASATRNSLDSVSDRDHVLELLSAA -----------2222----------------1111------------------------- AIGMVHLSRFAEDLIFFNTGEAGFVELSDRVTSGSSLMPQKKNPDALELIRGKCGRVQGA ---------------------------3333---3333---------------------- LTGMMMTLKGLPLAYNKDMQEDKEGLFDALDTWLDCLHMAALVLDGIQVKRPRCQEAAQQ -------2222----3333---3333---------------------------------% GYANATELADYLVAKGVPFREAHHIVGEAVVEAIRQGKPLEDLPLSELQKFSQVIDEDVY %%%---------1111----------------------3333-----3333333311113 PILSLQSCLDKRAAKGGVSPQQVAQAIAFAQARLG 333-----3333-2222------------------ >PROLYL 4-HYDROXYLASE ALPH; SWP:P13674; PDB:1TJCA; MFLTAEDSFELGKVAYTEADYYHTELWMEQALRQLDEGEISTIDKVSVLDYLSYAVYQQG --------------------------------------------------------1111 DLDKALLLTKKLLELDPEHQRANGNLKYFEYIMAK ---------------1111---------------- >Envelope glycoprotein; SWP:Q75760; PDB:1TJGH; RITLKESGPPLVKPTQTLTLTCSFSGFSLSDFGVGVWIRQPPGKALEWLAIIYSDDDKRY ------------2222--------------2222------2222--------1111---- SPSLNTRLTITKDTSKNQVVLVMTRV 33331111-----1111--------- >FAB 2F5 LIGHT CHAIN; SWP:NA; PDB:1TJGL; ALQLTQSPSSLSASVGDRITITCRASQGVTSALAWYRQKPGSPPQLLIYDASSLESGVPS -------------2222---------------------2222------------222233 RFSGSGSGTEFTLTISTLRPEDFATYYCQQLHFYPHTFGGGTRVDVRRTVAAPSVFIFPP 33----------------1111-------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT -----------------------------iiii--------------------------- LSKADYEKHKVYACEVTHQGLSSPVTKSFNRGE -33331111--------1111--------1111 >DNAK SUPPRESSOR PROTEIN; SWP:P18274; PDB:1TJLA; RKTSSLSILAIAGVEPYQEKPGEEYMNEAQLAHFRRILEAWRNQLRDEVDRTVTHMQDEA ------3333-------------------------------------------------- ANFPDPVDRAAQEEEFSLELRNRDRERKLIKKIEKTLKKVEDEDFGYCESCGVEIGIRRL ----3333---------------------------------------------------- EARPTADLCIDCKTLAEIREKQMAG --1111------------------- >IRON-RICH DPSA-HOMOLOG PR; SWP:Q9HMP7; PDB:1TJOA; STQKNARATAGEVEGSDALRMDADRAEQCVDALNADLANVYVLYHQLKKHHWNVEGAEFR --------2222----1111-----------------------------------1111- DLHLFLGEAAETAEEVADELAERVQALGGVPHASPETLQAEASVDVEDEDVYDIRTSLAN ------------------------------------------------------------ DMAIYGDIIEATREHTELAENLGDHATAHMLREGLIELEDDAHHIEHYLEDDTLVTQGAL -------------------1111-----------------------1111-----3333- >SIMILAR TO SYNAPTOTAGMINI; SWP:P21707; PDB:1TJXA; SGGGGGILEKLGDICFSLRYVPTAGKLTVVILEAKNLKKMDVGGLSDPYVKIHLMQNGKR ---1111--------------1111---------------2222-----------iiii- LKKKKTTIKKNTLNPYYNESFSFEVPFEQIQKVQVVVTVLDYDKIGKNDAIGKVFVGYNS -------------------------3333---------------------------2222 TGAELRHWSDMLANPRRPIAQWHTLQVEEEVDAMLAV -------------2222---------3333------- >SUGAR TRANSPORT PROTEIN; SWP:Q8Z2X8; PDB:1TJYA; GSAERIAFIPKLVGVGFFTSGGNGAQEAGKALGIDVTYDGPTEPSVSGQVQLVNNFVNQG --------------------------------------------------------1111 YDAIIVSAVSPDGLCPALKRAMQRGVKILTWDSDTKPECRSYYINQGTPKQLGSMLVEMA -----------1111------1111----------1111--------------------- AHQVDKEKAKVAFFYSSPTVTDQNQWVKEAKAKISQEHPGWEIVTTQFGYNDATKSLQTA ----------------1111-----------------1111-------%%%%-------- EGIIKAYPDLDAIIAPDANALPAAAQAAENLKRNNLAIVGFSTPNVMRPYVQRGTVKEFG ------------------------------------------3333-------------- LWDVVQQGKISVYVANALLKNMPMNVGDSLDIPGIGKVTVSPNSEQGYHYEAKGNGIVLL -----------------1111---2222---------------1111------------- PERVIFNKDNIDKYDF ------33333333-- >COPROPORPHYRINOGEN III OX; SWP:P11353; PDB:1TK1A; RNLPIRQQMEALIRRKQAEITQGLESIDTVKFHAGGGTSMVIQDGTTFEKGGVNVSVVYG ------------------------1111-------------------------------- QLSPAAVSAMKADHKDGVKFFACGLSMVIHPVNPHAPTTHLNYRYFETWNQDGTPQTWWF --------------------------------3333------------------------ GGGADLTPSYLYEEDGQLFHQLHKDALDKHDTALYPRFKKGIGGIFFDDYDERDPQEILK -----------3333---------------33333333---------------------- MVEDCFDAFLPSYLTIVKRRKDMPYTKEEQQWQAIRRGR ---------3333------1111---------------- >CG4244-PB; SWP:Q9Y0H4; PDB:1TK7A; GSPEFHMDALGPLPDGWEKKIQSDNRVYFVNHKNRTTQWEDPRTQGQEVSLINEGPLPPG ---------------------1111-------------------33331111-------- WEIRYTAAGERFFVDHNTRRTTFEDPRP -----3333------------------- >PHOSPHOHEPTOSE ISOMERASE ; SWP:Q9PNE6; PDB:1TK9A; SLINLVEKEWQEHQKIVQASEILKGQIAKVGELLCECLKKGGKILICGNGGSAADAQHFA -------------------------------------1111--------3333------- AELSGRYKKERKALAGIALTTDTSALSAIGNDYGFEFVFSRQVEALGNEKDVLIGISTSG ---------------------------------3333----------1111-----1111 KSPNVLEALKKAKELNLCLGLSGKGGGNKLCDHNLVVPSDDTARIQEHILIIHTLCQIID ------------1111------iiii---------------------------------3 ESF 333 >THREONYL-TRNA SYNTHETASE; SWP:P00955; PDB:1TKEA; MPVITLPDGSQRHYDHAVSPMDVALDIGPGLAKACIAGRVNGELVDACDLIENDAQLSII -----1111------------------------------iiii--1111----------- TAKDEEGLEIIRHSCAHLLGHAIKQLWPHTKMAIGPVIDNGFYYDVDLDRTLTQEDVEAL 1111----------------------1111------------------------------ EKRMHELAEKNYDVIKKKVSWHEARETFANRGESYKVSILDENIAHDDKPGLYFHEEYVD -------1111-----------------1111------------1111------!!!!-- MCRGPHVPNMRFCHHFKLMKTAGAYWRGDSNNKMLQRIYGTAWA --------3333------------22221111------------ >TITIN; SWP:Q8WZ42; PDB:1TKIA; KELYEKYMIAEDLGRGEFGIVHRCVETSSKKTYMAKFVKVKGTDQVLVKKEISILNIARH --1111---------1111-------1111-----------3333--------------1 RNILHLHESFESMEELVMIFEFISGLDIFERINTSAFELNEREIVSYVHQVCEALQFLHS 111---------------------------1111-------------------------- HNIGHFDIRPENIIYQTRRSSTIKIIEFGQARQLKPGDNFRLLFTAPEYYAPEVHQHDVV --------1111----1111--------------2222-------3333-3333------ STATDMWSLGTLVYVLLSGINPFLAETNQQIIENIMNAEYTFDEEAFKEISIEAMDFVDR --------------------1111------------------33331111-------111 LLVKERKSRMTASEALQHPWLKQKIERVSTKVIRTLKHRRYYHTLIKKDLNMVVSAARIS 1---3333--3333----1111-3333--------------------------3333-11 CGGAIRSQKGVSVAKVKVASI 11-----2222---------- >AMINOPEPTIDASE; SWP:P80561; PDB:1TKJA; APDIPLANVKAHLTQLSTIAANNGGNRAHGRPGYKASVDYVKAKLDAAGYTTTLQQFTSG ----3333--------------iiii-2222--------------1111---------ii GATGYNLIANWPGGDPNKVLMAGAHLDSVSSGAGINDNGSGSAAVLETALAVSRAGYQPD ii------------1111----------1111---------------------------- KHLRFAWWGAEELGLIGSKFYVNNLPSADRSKLAGYLNFDMIGSPNPGYFVYDDDPVIEK ---------3333--------1111----1111--------------------------- TFKNYFAGLNVPTEIETEGDGRSDHAPFKNVGVPVGGLFTGAGYTKSAAQAQKWGGTAGQ ------1111------1111--3333--1111------------------------2222 AFDRCYHSSCDSLSNINDTALDRNSDAAAHAIWTLSS ----2222---1111------------------1111 >SIMILAR TO CHLOROMUCONATE; SWP:O34508; PDB:1TKKA; MKIIRIETSRIAVPLTKPFKTALRTVYTAESVIVRITYDSGAVGWGEAPPTLVITGDSMD -------------------------------------1111---------1111------ SIESAIHHVLKPALLGKSLAGYEAILHDIQHLLTGNMSAKAAVEMALYDGWAQMCGLPLY ---------333322223333-------------------------------1111---- QMLGGYRDTLETDYTVSVNSPEEMAADAENYLKQGFQTLKIKVGKDDIATDIARIQEIRK -------------------------------3333------------------------- RVGSAVKLRLDANQGWRPKEAVTAIRKMEDAGLGIELVEQPVHKDDLAGLKKVTDATDTP -----------%%%%-------------1111----------1111-------------- IMADESVFTPRQAFEVLQTRSADLINIKLMKAGGISGAEKINAMAEACGVECMVGSMIET ---3333--------------------3333--------------1111----------- KLGITAAAHFAASKRNITRFDFDAPLMLKTDVFNGGITYSGSTISMPGKPGLGIIGAAL -------------1111-----3333-------------!!!!---------------- >3,4-DIHYDROXY-2-BUTANONE ; SWP:Q5A3V6; PDB:1TKSA; NIFTPIEEALEAYKNGEFLIVMDDEDRENEGDLIMAAELITQEKMAFLVRYSSGYVCVPL ------------1111---------1111------3333--------------------- SEERANQLELPPMLAGTAYTITCDFAEGTTTGISAHDRALTTRSLANPNSKPQDFIKPGH -----1111----------------2222-----------------11111111------ ILPLRAVPGLLKKRRGHTEAAVQLSTLAGLQPAGVICELVRDEDGLMMRLDDCIQFGKKH ------2222---------------1111------------------------------- GIKIININQLVEYISK ---------------- >TACHYLECTIN-2; SWP:Q27084; PDB:1TL2A; GGESMLRGVYQDKFYQGTYPQNKNDNWLARATLIGKGGWSNFKFLFLSPGGELYGVLNDK ---------iiii--------11113333---------1111-----1111-----iiii IYKGTPPTHDNDNWMGRAKKIGNGGWNQFQFLFFDPNGYLYAVSKDKLYKASPPQSDTDN --------11113333--------1111------1111-----iiii--------11113 WIARATEVGSGGWSGFKFLFFHPNGYLYAVHGQQFYKALPPVSNQDNWLARATKIGQGGW 333--------1111------1111---------------------3333--------11 DTFKFLFFSSVGTLFGVQGGKFYEDYPPSYAYDNWLARAKLIGNGGWDDFRFLFF 11------1111-----iiii--------11113333--------1111------ >RNA polymerase sigma fact; SWP:P00579; PDB:1TLHB; DVLAGLTAREAKVLRMRFGIDMNTDYTLEEVGKQFDVTRERIRQIEAKALRKLRHPSRSE 3333--3333-------------------------------------------------- VLRSFLDD -------- >HYPOTHETICAL UPF0130 PROT; SWP:Q9UX16; PDB:1TLJA; LVWEELREKALNKIYHDKEIGYLDPDILGFLLAFYRNRNDVYTQSSCSGRITIVDAEMPW -----------------1111--3333---3333-----------------------111 DRKNSTIIFKNHLRITEQDLEDVLSKNQVRRLWLIVQGPIIHIYAKNIETGWDILKIARE 1----------------------------------------------------------- AGFKHSGILATNQKGVLVELRTGIRMVHLLRESNTERVDKDKIKTLVNVCNEVLARGKQK --------------------------------3333--1111------------------ MNLLKDLLS -----3333 >HYPOTHETICAL PROTEIN YPJQ; SWP:P54173; PDB:1TLQA; YTNEVDITKDLNKRGVIEDIARIVQKLQEKYNPNLPLSVCENVEKVLNKREIIHAVLTGL -----------1111-3333--------3333---3333--------------------- ALDQLAEQKLLPEPLQHLVETDEPLYGIDEIIPLSIVNVYGSIGLTNFGYLDKEKIGIIK -----1111-------------111133331111-3333----------3333------- ELDESPDGIHTFLDDIVAALAAAAASRIAHTHQDLQ ---------3333----------------------- >PUTATIVE OXIDOREDUCTASE (; SWP:P75931; PDB:1TLTA; KLRIGVVGLGGIAQKAWLPVLAAASDWTLQGAWSPTRAKALPICESWRIPYADSLSSLAA ----------3333--3333------------------------1111-----3333--- SCDAVFVHSSTASHFDVVSTLLNAGVHVCVDKPLAENLRDAERLVELAARKKLTLMVGFN ---------3333--------1111-----------------------3333------33 RRFAPLYGELKTQLATAASLRMDKHRSNSVGPHDLYFTLLDDYLHVVDTALWLSGGKASL 33-------11111111------------------------------------------- DGGTLLTNDAGEMLFAEHHFSAGPLQITTCMHRRAGSQRETVQAVTDGALIDITDMREWR -------1111----------!!!!----------------------------%%%%--- EERGQGVVHKPIPGWQSTLEQRGFVGCARHFIECVQNQTVPQTAGEQAVLAQRIVDKIWR ------------11113333-----------------------!!!!------------- DAMS ---- >PROLINE RACEMASE; SWP:Q8YFD6; PDB:1TM0A; RSTKVIHIVGCHAEGEVGDVIVGGVAPPPGETVWEQSRFIANDETLRNFVLNKPRGGVFR ------------------------------------------------------------ HVNLLVPPKDPRAQGFIIEPADTPPSGSNSICVSTVLLDSGIIAQEPVTHVLEAPGGIIE -----------------------------------------------------3333--- VEAECRNGKAERISVRNVPSFADRLDAPLDVTGLGTIVDTAYGGDSFVIVDAAQIGEPGQ -----%%%%-----------------------------------------3333--3333 ARELAEIGVKITKAFRHPERDWRHISFCQITEPVTREGDVLTGVNTVPTGTGCSARAVLH -------3333--------------------------------------3333----111 AKGQKAGERFIGKSVTEFHCRLDKVLELGGKPAISPIISGRAWVTGTSQLLDPSDPFPHG 1--------------------------iiii--------------------1111-1111 Y - >HYPOTHETICAL PROTEIN MG35; SWP:P47596; PDB:1TM9A; MEQNNIKEQLISFFNQACSTHQERLDFICSTRESDTFSSVDVPLEPIKNIIEITKDENQQ -------------------------------------------3333------------- IEITKIAVNNIKTLSSVGATGQYMASFFSTNSEPAIIFCVIYFLYHFGFLKDNNKKQIIK ---------3333---------3333------------------1111----------33 KAYETIADNIADYLNEN 33-------1111---- >Genome polyprotein; SWP:P13899; PDB:1TME1; GSDNAEKGKVSNDDASVDFVAEPVKLPENQTRVAFFYDRAVPIGMLRPGQNIESTFVYQE ---3333------3333------------------------------------------- NDLRLNCLLLTPLPSFCPDSTSGPVKTKAPVQWRWVRSGGTTNFPLMTKQDYAFLCFSPF iiii---------------1111-1111-------------------------------- TYYKCDLEVTVSALGTDTVASVLRWAPTGAPADVTDQLIGYTPSLGETRNPHMWLVGAGN --------------------------2222------------3333----------2222 TQISFVVPYNSPLSVLPAAWFNGWSDFGNTKDFGVAPNADFGRLWIQGNTSASVRIRYKK -------------------------1111--22222222--------------------- MKVFCPRPTLFFPWPV ---------------- >Genome polyprotein; SWP:P13899; PDB:1TME2; DRVASDKAGNSATNTQSTVGRLCGYGEAHHGEHPASCADTATDKVLAAERYYTIDLASWT -------!!!!------------%%%%------1111-------3333-----------3 TTQEAFSHIRIPLPHVLAGEDGGVFGATLRRHYLCKTGWRVQVQCNASQFHAGSLLVFMA 3332222-------11113333------1111---------------1111--------- PEFYTGKGTKTGDMEPTDPFTMDTTWRAPQGAPTGYRYDSRTGFFAMNHQNQWQWTVYPH -----------------1111------------------------1111-33333333-- QILNLRTNTTVDLEVPYVNIAPTSSWTQHANWTLVVAVFSPLQYASGSSSDVQITASIQP ---1111-----------------3333-------------------------------- VNPVFNGLRHETVIA --------------- >Polyprotein [Fragment]; SWP:Q88496; PDB:1TME3; SPIAVTVREHKGCFYSTNPDTTVPIYGKTISTPNDYMCGEFSDLLELCKLPTFLGNPNSN -------1111---1111---------------3333-----33331111---------- NKRYPYFSATNSVPTTSLVDYQVALSCSCMCNSMLAAVARNFNQYRGSLNFLFVFTGAAM --------------------------3333-------3333----------------111 VKGKFLIAYTPPGAGKPTTRDQAMQATYAIWDLGLNSSFVFTAPFISPTHYRQTSYTSAA 1-----------------33333333---------------------------------- SVDGWVTVWQLTPLTYPSGTPVNSDILTLVSAGDDFTLRMPISPTKWVPQ --------------------------------3333-------------- >Polyprotein [Fragment]; SWP:Q88487; PDB:1TME4; SGNEGVIINNFYSNQYQNSIDLSAS ------------3333--------- >TRIMETHYLAMINE N-OXIDE RE; SWP:O87948; PDB:1TMO; NEDEWLTTGSHFGAFKMKRKNGVIAEVKPFDLDKYPTDMINGIRGMVYNPSRVRYPMVRL ---------1111------iiii------1111---3333-3333---1111-------- DFLLKGHKSNTHQRGDFRFVRVTWDKALTLFKHSLDEVQTQYGPSGLHAGQTGWRATGQL ----!!!!-3333-----------------------------1111-------------- HSSTSHMQRAVGMHGNYVKKIGDYSTGAGQTILPYVLGSTEVYAQGTSWPLILEHSDTIV -------------------------3333--3333-----1111---------------- LWSNDPYKNLQVGWNAETHESFAYLAQLKEKVKQGKIRVISIDPVVTKTQAYLGCEQLYV ----3333----------3333-------------------------------------- NPQTDVTLMLAIAHEMISKKLYDDKFIQGYSLGFEEFVPYVMGTKDGVAKTPEWAAPICG 2222----------------------------3333--------------3333------ VEAHVIRDLAKTLVKGRTQFMMGWCIQRQQHGEQPYWMAAVLATMIGQIGLPGGGISYGH -3333------------------3333-2222---------------2222--------- HYSSIGVPSSGAAAPGAFPRNLDENQKPLFDSSDFKGASSTIPVARWIDAILEPGKTIDA -%%%%-----------------1111--------iiii----1111------2222---i NGSKVVYPDIKMMIFSGNNPWNHHQDRNRMKQAFHKLECVVTVDVNWTATCRFSDIVLPA iii-----------------------------3333-----------3333--------- CTTYERNDIDVYGAYANRGILAMQKMVEPLFDSLSDFEIFTRFAAVLGKEKEYTRNMGEM -1111------------------------!!!!----------------3333%%%%--- EWLETLYNECKAANAGKFEMPDFATFWKQGYVHFGDGEVWTRHADFRNDPEINPLGTPSG --------------------------------------------------------1111 LIEIFSRKIDQFGYDDCKGHPTWMEKTERSHGGPGSDKHPIWLQSCHPDKRLHSQMCESR ------3333-------------------iiii-3333-------------!!!!11113 EYRETYAVNGREPVYISPVDAKARGIKDGDIVRVFNDRGQLLAGAVVSDNFPKGIVRIHE 333----iiii----------1111-2222-----------------11112222----- GAWYGPVGKDGSTEGGAEVGALCSYGDPNTLTLDIGTSKLAQACSAYTCLVEFEKYQGKV -------1111-3333-2222-----1111--------------1111------------ PKVSSFDGPIEVEI -------------- >HYDROXYQUINOL 1,2-DIOXYGE; SWP:Q5PXQ6; PDB:1TMXA; STPVSAEQQAREQDLVERVLRSFDATADPRLKQVMQALTRHLHAFLREVRLTEAEWETGI --------------------1111------------------------------------ GFLTDAGHVTNERRQEFILLSDVLGASMQTIAMNNEAHGDATEATVFGPFFVEGSPRIES -------------------------------------!!!!-----------------22 GGDIAGGAAGEPCWVEGTVTDTDGNPVPDARIEVWEADDDGFYDVQYDDDRTAARAHLLS 22--iiii------------1111-------------1111-3333-------------- GPDGGYAFWAITPTPYPIPHDGPVGRMLAATGRSPMRASHLHFMVTAPGRRTLVTHIFVE 1111------------------------1111--------------2222--------22 GDELLDRDSVFGVKDSLVKSFERQPAGAPTPGGREIDGPWSRVRFDIVLAPA 22-11111111--1111-------2222-2222------------------- >CHEY PROTEIN; SWP:Q56312; PDB:1TMY; GKRVLIVDDAAFMRMMLKDIITKAGYEVAGEATNGREAVEKYKELKPDIVTMDITMPEMN ---------------------1111------------------------------3333- GIDAIKEIMKIDPNAKIIVCSAMGQQAMVIEAIKAGAKDFIVKPFQPSRVVEALNKVS -----------1111------2222--------------------3333--------- >TMZIP; SWP:P04692; PDB:1TMZA; MDAIKKKMQMLKLDNYHLENEVARLKKLVGER 1111---------------------------- >TETRANECTIN; SWP:P05452; PDB:1TN3; ALQTVCLKGTKVHMKCFLAFTQTKTFHEASEDCISRGGTLSTPQTGSENDALYEYLRQSV ---------------------------------1111----------------------- GNEAEIWLGLNDMAAEGTWVDMTGARIAYKNWETEITAQPDGGKTENCAVLSGAANGKWF 1111-------3333-----1111-----------------!!!!---------%%%%-- DKRCRDQLPYICQFGIV --1111----------- >TUMOR NECROSIS FACTOR REC; SWP:P01374; PDB:1TNRA; KPAAHLIGDPSKQNSLLWRANTDRAFLQDGFSLSNNSLLVPTSGIYFVYSQVVFSGKAYS --------1111---------!!!!----------------------------------3 PKATSSPLYLAHEVQLFSSQYPFHVPLLSSQKMVYPGLQEPWLHSMYHGAAFQLTQGDQL 333--------------3333---------------2222--------------2222-- STHTDGIPHLVLSPSTVFFGAFAL -----3333---1111-------- >MU-TRANSPOSASE; SWP:P07636; PDB:1TNS; MELWVSPKELANLPGLPKTSAGVIYVAKKQGWQNRTRAGVKGGKAIEYNANSLPVEAKAA -----3333---------3333---3333-------------------3333--3333-- LLLRQGEIETSLGYFE -----------1111- >HYPOTHETICAL UPF0247 PROT; SWP:Q45601; PDB:1TO0A; NINIVTIGKLKEKYLKQGIEEYTKRLSAYAKIDIIELPDEKKIIKDKEGDRILSKISPDA -------------------------1111-----------------------11111111 HVIALAIEGKKTSEELADTIDKLATYGKSKVTFVIGGSLGLSDTVKRADEKLSFSKTFPH -----1111-----------3333------------3333-3333-------------33 QLRLILVEQIYRAFRINRGEPY 33-------------1111--- >Subtilisin BPN' [Precurso; SWP:P00782; PDB:1TO2E; AQSVPYGVSQIKAPALHSQGYTGSNVKVAVIDSGIDSSHPDLKVAGGASMVPSETNPFQD ----3333--------3333--2222---------1111-----------1111-1111- NNSHGTHVAGTVAALNNSIGVLGVAPSASLYAVKVLGADGSGQYSWIINGIEWAIANNMD ------------------------1111--------1111--3333--------1111-- VINMSLGGPSGSAALKAAVDKAVASGVVVVAAAGNEGTSGSSSTVGYPGKYPSVIAVGAV ----------------------1111------------!!!!-----3333--------- DSSNQRASFSSVGPELDVMAPGVSIQSTLPGNKYGAYNGTSMASPHVAGAAALILSKHPN 1111--1111--1111----------------------3333---------------111 WTNTQVRSSLENTTTKLGDSFYYGKGLINVQAAAQHHHHHH 1-----------------3333!!!!--3333--------- >Chymotrypsin inhibitor 2; SWP:Q40059; PDB:1TO2I; MKTEWPELVGKSVEEAKKVILQDKPAAQIIVLPVGTIVTKEYRIDRVRLFVDRLDNIAQV ----3333---------------1111---------------1111-----1111----- PRVG ---- >PUTATIVE ALDOLASE YIHT; SWP:Q9L7R9; PDB:1TO3A; LNNYTIKDITRASGGFALAVDQREARLFAAAGAKTPVADSVLTDFKVNAAKILSPYASAV -----1111-1111-------------3333------3333-----------3333---- LLDQQFCYRQAVEQNAVAKSCAIVAADDFIPGNGIPVDNVVLDKKINAQAVKRDGAKALK --3333-----1111--3333----------iiii----------------1111----- LLVLWRSDEDAQQRLNVKEFNELCHSNGLLSIIEPVVRPPRCGDKFDREQAIIDAAKELG -----11113333----------3333--------------------------------- DSGADLYKVEPLYGKGARSDLLTASQRLNGHINPWVILSSGVDEKLFPRAVRVAEAGASG ---------------------------3333-------22223333-------------- FLAGRAVWSSVIGLPDTELLRDVSAPKLQRLGEIVDEGKR ---33333333----3333--------------------- >SUPEROXIDE DISMUTASE; SWP:NA; PDB:1TO4A; MKAVCVMTGTAGVKGVVKFTQETDNGPVHVHAEFSGLKAGKHGFHVHEFGDTTNGCTSAG -----------------------------------------------------!!!!--- AHFNPTKQEHGAPEDSIRHVGDLGNVVAGADGNAVYNATDKLISLNGSHSIIGRSMVIHE ---1111----1111---1111------1111--------------11112222------ NEDDLGRGGHELSKVTGNAGGRLACGVVGLAAE ---iiii--1111-------------------- >GLYCERATE KINASE; SWP:P57098; PDB:1TO6A; MKIVIAPDSFKESLTAQQVAEAIKRGFQQSIADVECLLCPVGDGGEGTVDAIRHSLDLEE ---------------------------------------------------1111----- KCLQVTGSFGQKEVMRYFQKEQLALFEVADLVGLGKIPLEKRNPLQIQTRGIGELIRHLI --------------------------------1111-3333-1111-------------1 SQEIKEIYIGVGGTASNDGGIGIAAGLGYQFYDEDGNALPACGQSLLNLASVSTENRYKI 111-----------------------------1111-----33331111----------- PEDVHIRILADVVSPLCGHQGATYTFGKQKGLDSTMFEVVDQAIQDFYEKVSPATLKLKG 1111---------------------3333---3333---------------3333--222 AGAGGGIAGGLCAFAQASIVSGIDTCLDLIDFDKKVSDVDLVIVGEGRLDRQSLAGKAPI 2-------------------------------3333------------------------ GVAKRTPVGVPVVAICGSLVEDLPSLPFENIQAAFSILEKSEPLEDSLKNASLYLEHTAS --11112222---------1111----iiii-----------3333-------------- NIGHLLNMPKI ----------- >PERIPLASMIC BINDING PROTE; SWP:P96116; PDB:1TOAA; GKPLVVTTIGMIADAVKNIAQGDVHLKGLMGPGVDPHLYTATAGDVEWLGNADLILYNGL -------------------!!!!-------22223333----------1111------%% HLETKMGEVFSKLRGSRLVVAVSETIPVSQRLSLEEAEFDPHVWFDVKLWSYSVKAVYES %%1111----1111------1111--3333---%%%%----33333333----------- LCKLLPGKTREFTQRYQAYQQQLDKLDAYVRRKAQSLPAERRVLVTAHDAFGYFSRAYGF ----3333-------------------------11113333------------------- EVKGLQGVSTASEASAHDMQELAAFIAQRKLPAIFIESSIPHKNVEALRDAVQARGHVVQ ----------------------------------------------------1111---- IGGELFSDAMGDAGTSEGTYVGMVTHNIDTIVAALAR -----------22221111------------------ >THROMBIN; SWP:NA; PDB:1TOCR; SLNVLCNNPHTADCNNDAQVDRYFREGTTCLMSPACTSEGYASQHECQQACFVGGEDHSS --3333----------------------------------------1111---------- EMHSSCLGDPPTSCAEGTDITYYDSDSKTCKVLAASCPSGENTFESEVECQVACGAPIEG --3333------------------------------------------------------ >TONIN; SWP:P00759; PDB:1TON; IVGGYKCEKNSQPWQVAVINEYLCGGVLIDPSWVITAAHCYSNNYQVLLGRNNLFKDEPF -----------1111--------------1111---3333-----------------111 AQRRLVRQSFRHPDYIPLIPVHDHSNDLMLLHLSEPADITGGVKVIDLPTKEPKVGSTCL 1----------1111--------------------------------------2222--- ASGWGSTNPSEMVVSHDLQCVNIHLLSNEKCIETYKDNVTDVMLCAGEMEGGKDTCAGDS --------------------------33333333---3333------3333----2222- GGPLICDGVLQGITSGGATPCAKPKTPAIYAKLIKFTSWIKKVMKENP ----------------------2222-----3333--------1111- ---------------------------------------------------- >HYPOTHETICAL PROTEIN F53F; SWP:Q20728; PDB:1TOVA; ENESDKLNEEAAKNIMVGNRCEVTVGAQMARRGEVAYVGATKFKEGVWVGVKYDEPVGKN -----------11112222-----!!!!-------------------------------- DGSVAGVRYFDCDPKYGGFVRPVDVKVGDFPELSIDEI ---iiii-----2222----3333-------------- >NEUROGENIC LOCUS NOTCH HO; SWP:P46531; PDB:1TOZA; QDVDECSLGANPCEHAGKCINTLGSFECQCLQGYTGPRCEIDVNECVSNPCQNDATCLDQ ---1111---3333-----------------------------1111------------- IGEFQCICMPGYEGVHCEVNTDECASSPCLHNGRCLDKINEFQCECPTGFTGHLCQ -------------1111------1111--!!!!----------------------- >PRESYNAPTIC DENSITY PROTE; SWP:P31016; PDB:1TP5A; FLGEEDIPREPRRIVIHRGSTGLGFNIVGGEDGEGIFISFILAGGPADLSGELRKGDQIL -1111-------------------------%%%%-------2222--3333--2222--- SVNGVDLRNASHEQAAIALKNAGQTVTIIAQYKPEEYSRFEANSRVDSSGRIVTD -iiii-1111------------------------------2222--1111----- >HYPOTHETICAL PROTEIN PA13; SWP:Q9I430; PDB:1TP6A; CAYRREIHHAHVAIRDWLAGDSRADALDALMARFAEDFSMVTPHGVVLDKTALGELFRSK ------------------------------11111111---1111--------------- GGTRPGLRIEIDGESLLASGVDGATLAYREIQSDAAGRSERLSTVVLHRDDEGRLYWRHL ---2222------------1111----------3333------------1111------- QETFCG ------ >PEROXIREDOXIN; SWP:Q8S3L0; PDB:1TP9A; MAPIAVGDVLPDGKLAYFDEQDQLQEVSVHSLVAGKKVILFGVPGAFTPTCSLKHVPGFI ----2222----------1111-----3333-2222--------2222------------ EKAGELKSKGVTEILCISVNDPFVMKAWAKSYPENKHVKFLADGSATYTHALGLELDLQE ------1111---------------------1111-------1111---1111-----11 KGLGTRSRRFALLVDDLKVKAANIEGGGEFTVSSAEDILKDL 11-------------------------------33333333- >T-PLASMINOGEN ACTIVATOR F; SWP:P00750; PDB:1TPG; SYQVICRDEKTQMIYQQHQSWLRPVLRSNRVEYCWCNSGRAQCHSVPVKSCSEPRCFNGG --------1111------------------------iiii----------------iiii TCQQALYFSDFVCQCPEGFAGKSCEIDTRAT --------------------1111------- >METHOXY MYCOLIC ACID SYNT; SWP:Q79FX6; PDB:1TPYA; NDLTPHFEDVQAHYDLSDDFFRLFLDPTQTYSCAHFEREDMTLEEAQIAKIDLALGKLGL ------------11113333-----1111--------1111-------------1111-- QPGMTLLDIGCGWGATMRRAIAQYDVNVVGLTLSKNQAAHVQKSFDEMDTPRDRRVLLAG 2222------!!!!---------------------------------------------3 WEQFNEPVDRIVSIGAFEHFGHDRHADFFARAHKILPPDGVLLLHTITGLTRQQMVDHGL 333------------3333-3333------------1111---------------1111- PLTLWLARFLKFIATEIFPGGQPPTIEMVEEQSAKTGFTLTRRQSLQPHYARTLDLWAEA -----------------2222------------------------3333----------- LQEHKSEAIAIQSEEVYERYMKYLTGCAKLFRVGYIDVNQFTLAK --------------------------------------------- >SENESCENCE-ASSOCIATED FAM; SWP:Q39129; PDB:1TQ1A; AEESRVPSSVSVTVAHDLLLAGHRYLDVRTPEEFSQGHACGAINVPYMNRGASGMSKNTD -----------------------------33333333-2222--------3333------ FLEQVSSHFGQSDNIIVGCQSGGRSIKATTDLLHAGFTGVKDIVGGYSAWAKNGLPTKA ---3333---------------1111-----------------------3333------ >INTERFERON-INDUCIBLE GTPA; SWP:Q9QZ85; PDB:1TQ4A; NDLPSSFTGYFKKFNTGRKIISQEILNLIELRMRAGNIQLTNSAISDALKEIDSSVLNVA ----------1111-------------------------------------1111----- VTGETGSGKSSFINTLRGIGNEEEGAAKTGVMERHPYKHPNIPNVVFWDLPGIGSTNFPP ---2222------------1111------------------1111------3333----- DTYLEKMKFYEYDFFIIISATRFKKNDIDIAKAISMMKKEFYFVRTKVDSDITNEADGEP -------3333-----------------------1111-----------------22223 QTFDKEKVLQDIRLNCVNTFRENGIAEPPIFLLSNKNVCHYDFPVLMDKLISDLPIYKRH 3333333-------------1111---------1111--!!!!-----------3333-3 NFMVSLPNITDSVIEKKRQFLKQRIWLEGFAADLVNIIPSLTFLLDSDLETLKKSMKFYR 333--------------------------1111----1111------------------- TVFGVDETSLQRLARDWEIEVDQVEAMIKSPAVFKPTDEETIQERLSRYIQEFCLANGYL -------------1111-------11113333---------------------------- LPKNSFLKEIFYLKYYFLDMVTEDAKTLLKEICLRN -2222--3333------------------------- >PROTEIN YHHW; SWP:P46852; PDB:1TQ5A; IYLRKANERGHANHGWLDSWHTFSFANYYDPNFGFSALRVINDDVIEAGQGFGTHPHKDE ----1111----------------!!!!------!!!!--------2222---------- ILTYVLEGTVEHQDSGNKEQVPAGEFQISAGTGIRHSEYNPSSTERLHLYQIWIPEENGI ---------------------2222----!!!!--------------------------- TPRYEQRRFDAVQGKQLVLSPDARDGSLKVHQDELYRWALLKDEQSVHQIAAERRVWIQV -----------------------%%%%-------------2222---------------- VKGNVTINGVKASTSDGLAIWDEQAISIHADSDSEVLLFDLPPVHHH ------iiii--2222------------------------------- >Putative uncharacterized ; SWP:A0A5D9; PDB:1TQBB; QIQLVQSGPELKKPGETVKISCKASGYTFTNYGMNLVKQAPGKGFEWMGWINTFTGEPTY ------------2222-----------1111--------2222----------------- ADDFKGRFVFSLDTSASTAYLQINNLKNEDTATYFFTRGTDYWGQGTTLTVSSAKTTAPS 3333--------3333----------3333------------------------------ VYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLYTLSSS ----------------------------------iiii---------------------- VTVTSSTWPSQSITCNVAHPASSTKVDKKIEP -------------------------------- >Putative uncharacterized ; SWP:A0A5D9; PDB:1TQBC; DVVMSQTPLTLSVTIGQPASISCKSSQSLLDSDGKTYLNWLLQRPGQSPKRLIYLVSRLD -------------2222-------------1111---------2222------------2 SGVPDRFTGSGSGTDFTLKISRVEAEDLGIYFCWQGSHFPQTFGGGTKLEIKRADAAPTV 2223333-----------------3333-------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM -----3333-------------------------iiii--2222---------------- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC --------3333----------1111--------1111- >CHEMOTAXIS PROTEIN CHEA; SWP:Q56310; PDB:1TQGA; GSHMEYLGVFVDETKEYLQNLNDTLLELEKNPEDMELINEAFRALHTLKGMAGTMGFSSM ----------------------------------------------------1111---- AKLCHTLENILDKARNSEIKITSDLLDKIFAGVDMITRMVDKIVS -------------1111----------------------3333-- >CARBOXYLESTERASE PRECURSO; SWP:Q06174; PDB:1TQHA; PPKPFFFEAGERAVLLLHGFTGNSADVRMLGRFLESKGYTCHAPIYKGHGVPPEELVHTG ------------------2222------------1111-------2222--33331111- PDDWWQDVMNGYEFLKNKGYEKIAVAGLSLGGVFSLKLGYTVPIEGIVTMCAPMYIKSEE ---------------1111------------------1111------------------- TMYEGVLEYAREYKKREGKSEEQIEQEMEKFKQTPMKTLKALQELIADVRDHLDLIYAPT -----------------------------------1111------------1111----- FVVQARHDEMINPDSANIIYNEIESPVKQIKWYEQSGHVITLDQEKDQLHEDIYAFLESL --------------------------------------33331111----------1111 DW -- >RIBULOSE-PHOSPHATE 3-EPIM; SWP:P74061; PDB:1TQJA; KNIVVAPSILSADFSRLGEEIKAVDEAGADWIHVDVMDGRFVPNITIGPLIVDAIRPLTK -------3333-3333--------1111-------------------3333---1111-- KTLDVHLMIVEPEKYVEDFAKAGADIISVHVEHNASPHLHRTLCQIRELGKKAGAVLNPS -------------------------------1111-----------1111-------111 TPLDFLEYVLPVCDLILIMSVNPQSFIPEVLPKIRALRQMCDERGLDPWIEVDGGLKPNN 1333311111111-------------3333-----------1111-----------3333 TWQVLEAGANAIVAGSAVFNAPNYAEAIAGVRNSKRP --------------3333------------1111--- >CYTOCHROME P450 3A4; SWP:P08684; PDB:1TQNA; HSHGLFKKLGIPGPTPLPFLGNILSYHKGFCMFDMECHKKYGKVWGFYDGQQPVLAITDP ------1111---------------1111-------------------!!!!-------- DMIKTVLVKECYSVFTNRRPFGPVGFMKSAISIAEDEEWKRLRSLLSPTFTSGKLKEMVP -----------------------!!!!---1111---------11111111--------- IIAQYGDVLVRNLRREAETGKPVTLKDVFGAYSMDVITSTSFGVNIDSLNNPQDPFVENT ----------------------------------------------33331111111133 KKLLRFDFLDPFFLSITVFPFLIPILEVLNICVFPREVTNFLRKSVKRMKESRLEDTQKH 33----------------3333------------3333---------------------- RVDFLQLMIDSQNSSHKALSDLELVAQSIIFIFAGYETTSSVLSFIMYELATHPDVQQKL -------------------3333------------------------------------- QEEIDAVLPNKAPPTYDTVLQMEYLDMVVNETLRLFPIAMRLERVCKKDVEINGMFIPKG ------------------------------------3333-----------------222 VVVMIPSYALHRDPKYWTEPEKFLPERFSKKNKDNIDPYIYTPFGSGPRNCIGMRFALMN 2-----------3333--1111-3333----3333-1111-1111-----1111------ MKLALIRVLQNFSFKPCKETQIPLKLSLGGLLQPEKPVVLKVESRDGT ----------------1111---------------------------- >D-RIBULOSE-5-PHOSPHATE 3-; SWP:Q8I5L3; PDB:1TQXA; LKAIIAPSVLASNISKLAEETQRMESLGAEWIHLDVMDMHFVPNLSFGPPVINNLKKYTK -------3333-1111--------1111-------------------3333----1111- SIFFDVHLMVEYPEKYVPLLKTSNQLTFHFEALNEDTERCIQLAKEIRDNNLWCGISIKP -----------333333331111-----3333%%%%-----------1111-------11 KTDVQKLVPILDTNLINTVLVMTVEPGFGGQSFMHDMMGKVSFLRKKYKNLNIQVDGGLN 113333--3333---------------------1111----------1111--------- IETTEISASHGANIIVAGTSIFNAEDPKYVIDTMRVSVQKY -------1111------3333----------------3333 >BETA-KETOACYL SYNTHASE/AC; SWP:Q02059; PDB:1TQYA; RRVVITGVGVRAPGGNGTRQFWELLTSGRTATRRISFFDPSPYRSQVAAEADFDPVAEGF -----------2222---------1111-----------3333----------3333--- GPRELDRMDRASQFAVACAREAFAASGLDPDTLDPARVGVSLGSAVAAATSLEREYLLLS --------3333-----------3333-3333-1111--------!!!!----------- DSGRDWEVDAAWLSRHMFDYLVPSVMPAEVAWAVGAEGPVTMVSTGCTSGLDSVGNAVRA iiii----3333-1111---------------------------!!!!------------ IEEGSADVMFAGAADTPITPIVVACFDAIRATTARNDDPEHASRPFDGTRDGFVLAEGAA -------------------------------------3333--2222------------- MFVLEDYDSALARGARIHAEISGYATRCNAYHMTGLKADGREMAETIRVALDESRTDATD ------------------------------------3333----------------3333 IDYINAHGSGTRQNDRHETAAYKRALGEHARRTPVSSIKSMVGHSLGAIGSLEIAACVLA ----------3333-------------3333-------------!!!!------------ LEHGVVPPTANLRTSDPECDLDYVPLEARERKLRSVLTVGSGFGGFQSAMVLRDAETAGA ---------------3333----------------------2222------------111 A 1 >Actinorhodin polyketide p; SWP:Q02062; PDB:1TQYB; SVLITGVGVVAPNGLGLAPYWSAVLDGRHGLGPVTRFDVSRYPATLAGQIDDFHAPDHIP ----------1111-----------------------3333------------3333--3 GRLLPQTDPSTRLALTAADWALQDAKADPESLTDYDMGVVTANACGGFDFTHREFRKLWS 3331111---------------1111-3333-3333------------------------ EGPKSVSVYESFAWFYAVNTGQISIRHGMRGPSSALVAEQAGGLDALGHARRTIRRGTPL -3333-1111----1111-------------------!!!!------------1111--- VVSGGVDSALDPWGWVSQIASGRISTATDPDRAYLPFDERAAGYVPGEGGAILVLEDSAA ------------------3333------1111--2222---------------------- AEARGRHDAYGELAGCASTFDPAPGSGRPAGLERAIRLALNDAGTGPEDVDVVFADGAGV -3333-----------------2222-------------------3333----------- PELDAAEARAIGRVFGREGVPVTVPKTTTGRLYSGGGPLDVVTALMSLREGVIAPTAGVT 3333-----------2222------1111--!!!!------------------------- SVPREYGIDLVLGEPRSTAPRTALVLARGRWGFNSAAVLRRF --3333------------------------------------ >NECAP1; SWP:Q9CR95; PDB:1TQZA; MAAELEYESVLCVKPDVSVYRIPPRASNRGYRASDWKLDQPDWTGRLRITSKGKIAYIKL -------------------------------3333------------------------- EDKVSGELFAQAPVEQYPGIAVETVTDSSRYFVIRIQDGTGRSAFIGIGFTDRGDAFDFN ------------------------1111-------------------------------- VSLQDHFKWVKQE ------------- >STABLE PROTEIN 1; SWP:Q9AR79; PDB:1TR0A; TRTPKLVKHTLLTRFKDEITREQIDNYINDYTNLLDLIPSMKSFNWGTDLGMESAELNRG ---------------1111-------------3333-3333------------3333iii YTHAFESTFESKSGLQEYLDSAALAAFAEGFLPTLSQRLVIDYFLY i-----------------------------3333------------ >CONSERVED PROTEIN (MTH177; SWP:P0C0K9; PDB:1TR8A; DKDLRGVEEVVIKLKRKEIIIKNPKVNVEFGQKTYQVTGKARERSLEAEEIPEDDIELVN -------------1111------------------------------------------- QTGASREDATRALQETGGDLAEAIRL -------------1111--3333--- >BETA-HEXOSAMINIDASE; SWP:Q9KU37; PDB:1TR9A; SLGPLWLDVAGYELSAEDREILQHPTVGGVILFGRNYHDNQQLLALNKAIRQAAKRPILI -----------------------3333-----3333------------------------ GVDQEGGRVQRFREGFSRIPPAQYYARAENGVELAEQGGWLAAELIAHDVDLSFAPVLDG -----!!!!---2222----33331111-----------------1111----------- FACKAIGNRAFGEDVQTVLKHSSAFLRGKAVGATTGKHFPGHGAVIADSHLETPYDERET --3333----------------------1111---------------------------- IAQDAIFRAQIEAGVLDAPAHVVYPHYDAQPASGSSYWLKQVLREELGFKGIVFSDDLSE 3333-------------------3333---3333-------------------------3 GAAVGGPVERSHQALVAGCDILICNKREAAVEVLDNLPIEVPQAEALLKKQQFSYSELKR 333-----------------------------------------1111-----3333--- LERWQQASANQRLIEQFS -3333------------- >THIOREDOXIN REDUCTASE; SWP:P0A9P4; PDB:1TRB; GTTKHSKLLILGSGPAGYTAAVYAARANLQPVLITGMEKGGQLTTTTEVENWPGDPNDLT -------------------------------------22221111------2222----- GPLLMERMHEHATKFETEIIFDHINKVDLQNRPFRLNGDNGEYTCDALIIATGASARYLG ------------1111---------------------1111------------------- LPSEEAFKGRGVSACATSDGFFYRNQKVAVIGGGNTAVEEALYLSNIASEVHLIHRRDGF ----1111------3333----2222---------------------------------- RAEKILIKRLMDKVENGNIILHTNRTLEEVTGDQMGVTGVRLRDTQNSDNIESLDVAGLF --------------------------------3333------------------------ VAIGHSPNTAIFEGQLELENGYIKVQSGIHGNATQTSIPGVFAAGDVMDHIYRQAITSAG --------3333------iiii---------1111--2222----1111----------- TGCMAALDAERYLDGL -----------1111- >TRIOSEPHOSPHATE ISOMERASE; SWP:P0A858; PDB:1TREA; MRHPLVMGNWKLNGSRHMVHELVSNLRKELAGVAGCAVAIAPPEMYIDMAKREAEGSHIM -----------------------------1111---------3333-----1111----- LGAQNVNLNLSGAFTGETSAAMLKDIGAQYIIIGHSERRTYHKESDELIAKKFAVLKEQG -------------2222-----------------3333-1111-3333--------1111 LTPVLCIGETEAENEAGKTEEVCARQIDAVLKTQGAAAFEGAVIAYEPVWAIGTGKSATP --------------------------------------2222-----3333--------- AQAQAVHKFIRDHIAKVDANIAEQVIIQYGGSVNASNAAELFAQPDIDGALVGGASLKAD -------------3333-3333-----------3333--33331111-----3333---- AFAVIVKAAEAAKQA --------------- >TRYPSIN; SWP:P07477; PDB:1TRNA; IVGGYNCEENSVPYQVSLNSGYHFCGGSLINEQWVVSAGHCYKSRIQVRLGEHNIEVLEG -----------1111----------------------1111------------------- NEQFINAAKIIRHPQYDRKTLNNDIMLIKLSSRAVINARVSTISLPTAPPATGTKCLISG ------------1111-1111---------------1111-------------------- WGNTASSGADPDELQCLDAPVLSQAKCEASYPGKITSNMFCVGFLEGGKDSCQGDSGGPV ------------------------------2222-1111------------2222----- VCNGQLQGVVSWGDGCAQKNKPGVYTKVYNYVKWIKNTIAANS -iiii---------------------3333------------- >RIBONUCLEASE P PROTEIN CO; SWP:O28362; PDB:1TS9A; LQGVELIARDWIGLVEVVESPNHSEVGIKGEVVDETQNTLKITEKGLKVVAKRGRTFRVW 33333333--2222-------3333----------1111---1111-----2222----- YKGKIRIKGDLINFRPEDRIKRGLLKRAKGVWI iiii---3333---3333-------1111---- >CONSERVED HYPOTHETICAL PR; SWP:Q8NX24; PDB:1TSJA; MDIPKITTFLFNNQAEEAVKLYTSLFEDSEIITAKYGDPGTVQHSIFTLNGQVFAIDPIS -------------------------------------2222-------iiii-------- LFVTVKDTIEERLFNGLKDEGAILPKTNPPYREFAWVQDKFGVSFQLALPE ------3333----------------------------1111--------- >TS KAPA; SWP:P56219; PDB:1TSK; VVIGQRCYRSPDCYSACKKLVGKATGKCTNGRCDC ----------------------------%%%%--- >THERMONUCLEASE; SWP:P00644; PDB:1TT2A; KLHKEPATLIKAIDGDTVKLMYKGQPMTFRLLLVDTPEFNEKYGPEASAFTKKMVENAKK ---------------------iiii---------------2222----------1111-- IEVEFDKGQRTDKYGRGLAYKYADGKMVNEALVRQGLAKVAYVYKGNNTHEQLLRKAEAQ -----------1111-------iiii---------------------1111--------- AKKEKLNIWS -1111-1111 >PUTATIVE CYTOPLASMIC PROT; SWP:Q8ZR41; PDB:1TT4A; DFHVSEPYTLGIELEMQVINPPGYDLSQDSSTLIDAVKPQLTAGEIKHDITESMLEMATG -----2222-----------------------33333333-------------------- VCRDIDQAAAQLSAMQHVILQAASEHHLGICGGGTHPFQKWRTLENFGYLIQQATVFGQH -----------------------1111--------------3333-!!!!---------- VHVGCANGDDAIYLLHGLSHFVPHFIALSAASPYMQGADTRFACARLNIFSAFPDNGPMP ----------------3333--------------iiii---------1111-1111---- WVSNWQEFAGLFRRLSYTTMIDSIKDLHWDIRPSPAFGTVEVRVMDTPLTLDHAINMAGL -------------1111-----3333---------------------------------- IQATAHWLLTERPFKPQEQDYLLYKFNRFQACRYGLEGVLTDAYTGDRRRLADDTLRLLD ----------------1111--------------3333---------------------- NVTPSARKLGADSAIDALRLQVKKGGNEAQYMREFIADGGSLIGLVQKHCEIWA -----------------------------------1111-3333---------- >YHFP; SWP:O07615; PDB:1TT7A; STLFQALQAEKNADDVSVHVKTISTEDLPKDGVLIKVAYSGINYKDGLAGKAGGNIVREY -----------%%%%--------3333---------------33333333---------- PLILGIDAAGTVVSSNDPRFAEGDEVIATSYELGVSRDGGLSEYASVPGDWLVPLPQNLS ----------------33332222-------2222------------1111----11113 LKEAVYGTAGFTAALSVHRLEQNGLSPEKGSVLVTGATGGVGGIAVSLNKRGYDVVASTG 333----------------3333--1111------33333333-----3333-------- NREAADYLKQLGASEVISREDVYDGTLKALSKQQWQGAVDPVGGKQLASLLSKIQYGGSV -----------------3333----------------------3333---11112222-- AVSGLTGGGEVPATVYPFILRGVSLLGIDSVYCPDVRAAVWERSSDLKPDQLLTIVDREV ---------------------------------------------------3333----- SLEETPGALKDILQNRIQGRVIVKL 11113333----------------- >CHORISMATE-PYRUVATE LYASE; SWP:P26602; PDB:1TT8A; SHPALTQLRALRYSKEIPALDPQLLDWLLLEDSMTKRFEQQGKTVSVTMIREGFVEQNEI -3333--1111---------------------------1111-------------3333- PEELPLLPKESRYWLREILLSADGEPWLAGRTVVPVSTLSGPELALQKLGKTPLGRYLFT --3333---------------iiii---------3333-!!!!-1111!!!!--3333-- SSTLTRDFIEIGRDAGLWGRRSRLRLSGKPLLLTELFLPASPLY -------------iiii--------iiii--------1111--- >DENDRITIC CELL-DERIVED UB; SWP:Q8WUN7; PDB:1TTNA; GYECQLRLRLSTGKDLKLVVRSTDTVFHMKRRLHAAEGVEPGSQRWFFSGRPLTDKMKFE ---------iiii----------------------------------iiii------111 ELKIPKDYVVQVIVSQPVQN 1------------------- >LIN-12 AND GLP-1 TRANSCRI; SWP:Q9TYY1; PDB:1TTUA; VQSLTSDRMIDFLSNKEKYECVISIFHAKVAQKSYGNEKRFFCPPPCIYLIGQGWKLKKD ---------------3333---------------!!!!-------------3333----- RVAQLYKTEQQATELVAYIGIGSDTSERQQLDFPNIYDYCAAKTLYISDSDKRKYFDLNA -----------------------------------------------1111--------- QFFYGCGMEIGGFVSQRIKVISKPSKKKQSMKNTDCKYLCIASGTKVALFNRLRSQTVST ---3333---------------------------3333--------------%%%%---- RYLHVEGNAFHASSTKWGAFTIHLFDDERGLQETDNFAVRDGFVYYGSVVKLVDSVTGIA -----------------------------------------------------3333--- LPRLRIRKVDKQQVILDASCSEEPVSQLHKCAFQMIDNELVYLCLSHDKIIQHQATAINE ------------------1111---2222------------------------------- HRHQINDGAAWTIISTDKAEYRFFEAMGQVANPISPCPVVGSLEVDGHGEASRVELHGRD ------1111-------------------------------------------------- FKPNLKVWFGATPVETTFRSEESLHCSIPPVSQVRNEQTHWMFTNRTTGDVEVPISLVRD --------!!!!-----------------3333--3333-1111--------------11 DGVVYSSGLTFSYKS 11---------3333 >SECRETION CHAPERONE; SWP:Q7ARG9; PDB:1TTWA; TYSSLLEEFATELGLEEIETNELGHGAVTIDKIWVVHLAPINEKELVAFMRAGILTGQSQ --------------------1111-----------------1111--------------- LYDILRKNLFSPLSGVIRCALDKDDHWLLWSQLNINDTSGTQLASVLTSLVDKAVTLS ----1111-------------------------1111--------------------- >ONCOMODULIN; SWP:P32930; PDB:1TTXA; MSITDVLSADDIAAALQECRDPDTFEPQKFFQTSGLSKMSANQVKDVFRFIDNDQSGYLD ----------------------------------3333-------------3333----3 EEELKFFLQKFESGARELTESETKSLMAAADNDGDGKIGAEEFQEMVHS 333---3333-----3333------------------------------ >RNA POLYMERASE SIGMA FACT; SWP:P77994; PDB:1TTYA; KEAMRMLMREELEKVLKTLSPREAMVLRMRYGLLDGKPKTLEEVGQYFNVTRERIRQIEV --------------3333-3333----------------33333333------------- KALRKLRHPSRSKYLKSLLSLMDENEG --------------------------- >CONSERVED HYPOTHETICAL PR; SWP:Q8P6W3; PDB:1TTZA; ALTLYQRDDCHLCDQAVEALAQARAGAFFSVFIDDDAALESAYGLRVPVLRDPGRELDWP --------------------1111------------------1111-------------- FDAPRLRAWLDAAP ---------1111- >HYPOTHETICAL PROTEIN PA00; SWP:NA; PDB:1TU1A; HMTLYRLHEADLEIPDAWQDQSINIFKLPASGPAREASFVISRDASQGDAPFADYVARQL ------1111----3333------------!!!!------------!!!!---------- ENAEKQLPGFKLHKRWDINIHGHAAVLLDYQWQREGRDLMLRQVFIERRPAVLITTLTTT ------2222---------iiii----------iiii----------------------3 PADLPHHEPAWKQAMQTLVPRPT 333----------3333------ >Apocytochrome f [Precurso; SWP:Q93SW9; PDB:1TU2B; YPFWAQQTYPETPREPTGRIVCANCHLAAKPTEVEVPQSVLPDTVFKAVVKIPYDTSVQQ 3333----1111--1111-3333-------------------------------3333-- VGADGSKVGLNVGAVLMLPEGFKIAPEDRIPEELKEEIGDVYFQPYGEDKDNIVIVGPLP -1111--------------------3333-----------------3333---------3 GEQYQEIVFPVLSPNPANDKNIHFGKYSVHVGGNRGRGQVYPTGEKSNNNLYSAAATGTI 333-----------33333333------------------1111---------------- SKIAKQEGEDGSVKYLVDIKTESGEVVSDTIPAGPELIVSEGQAVTAGDALTNNPNVGGF --------------------1111---------------------2222----------- GQLDAEIVLQDANR -------------- >Rab GTPase-binding effect; SWP:Q15276; PDB:1TU3F; AQRLQTELDVSEQVQRDFVKLSQTLQVQLERIRQADSLERIRAILN -------------------------------1111----------- >COPPER AMINE OXIDASE, LIV; SWP:Q29437; PDB:1TU5A; QLFADLSREELTTVMSFLTQQLGPDLVDAAQARPSDNCVFSVELQLPPKAAALAHLDRGS ---------------------------3333-1111------------------------ PPPAREALAIVFFGGQPQPNVTELVVGPLPQPSYMRDVTVERHGGPLPYYRRPVLLREYL --------------------------------------1111-----1111--------- DIDQMIFNRELPQAAGVLHHCCSYKQGGQKLLTMNSAPRGVQSGDRSTWFGIYYNITKGG ---------3333---------2222---------------------------------1 PYLHPVGLELLVDHKALDPADWTVQKVFFQGRYYENLAQLEEQFEAGQVNVVVIPDRFSV 111--------------3333-------iiii---------------------------- QGNRVASSLWTFSFGLGAFSGPRVFDVRFQGERLAYEISLQEAGAVYGGNTPAAMLTRYM !!!!------------------------iiii-------------------3333----- DSGFGMGYFATPLIRGVDCPYLATYMDWHFVVESQTPKTLHDAFCVFEQNKGLPLRRHHS ------1111---2222--1111------------------------------------- DFLSHYFGGVAQTVLVFRSVSTMLNDYVWDMVFYPNGAIEVKLHATGYISSAFLFGAARR ---------------------------------1111-------------------3333 YGNQVGEHTLGPVHTHSAHYKVDLDVGGLENWVWAEDMAFVPTAIPWSPEHQIQRLQVTR -----2222---------------2222----------------1111------------ KQLETEEQAAFPLGGASPRYLYLASKQSNKWGHPRGYRIQTVSFAGGPMPQNSPMERAFS ----3333---2222-------------1111-----------------3333-333333 WGRYQLAITQRKETEPSSSSVFNQNDPWTPTVDFSDFINNETIAGKDLVAWVTAGFLHIP 33---------1111----1111--3333---3333------------------------ HAEDIPNTVTVGNGVGFFLRPYNFFDQEPSMD 3333---------------------------- >CATHEPSIN K; SWP:P43235; PDB:1TU6A; APDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSEN -----3333--------------3333-----------------------------3333 DGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKA !!!!-------------------1111-----------3333-----------2222--- LKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWII ------------------3333---------11111111------------iiii----- KNSWGENWGNKGYILMARNKNNACGIANLASFPKM ----1111-iiii------%%%%-1111------- >GLUTATHIONE S-TRANSFERASE; SWP:P46427; PDB:1TU7A; MSYKLTYFSIRGLAEPIRLFLVDQDIKFIDDRIAKDDFSSIKSQFQFGQLPCLYDGDQQI -----------3333------------------333333333333---------!!!!-- VQSGAILRHLARKYNLNGENEMETTYIDMFCEGVRDLHVKYTRMIYMAYETEKDPYIKSI ------------------------------------------------------------ LPGELAKFEKLLATRGNGRNLILGDKISYADYALFEELDVHQILDPHCLDKFPLLKVFHQ --------------------1111---3333-------------11111111-------- RMKDRPKLKEYCEKRDAAKVPVNGNGKQ ----------------------1111-- >HYPOTHETICAL PROTEIN PA39; SWP:Q9HX49; PDB:1TU9A; NAADRVMQSYGRCCASTGFFDDFYRHFLASSPQIRAKFATTDMTAQKHLLRAGIMNLVMY ---------------2222----------------1111--3333--------------1 ARGMSDSKLRALGASHSRAALDIRPELYDLWLDALLMAVAEHDRDCDAETRDAWRDVMGR 111-------------1111---3333---------------1111-------------- GIAVIKSYYGS -------1111 >HYPOTHETICAL PROTEIN APE0; SWP:Q9YE16; PDB:1TUAA; AMKPRIYVKVKPERLGAVIGPRGEVKAEIMRRTGTVITVDTENSMVIVEPEAEGIPPVNL ----------1111-33332222-----------------1111-------11113333- MKAAEVVKAISLGFPPEKAFRLLEEDQILVVVDLKQVVGDSQNHLKRIKGRIIGEGGRAR --------------33333333-2222----------!!!!------------2222--- RTIEEMTDTYINVGEYEVAIIGDYERAMAAKQAIEMLAEGRMHSTVYRHLERIMREIKRR ------------------------------------1111-------------------- ERLKMWARE --------- >REPLICATION PROTEIN E1; SWP:P06789; PDB:1TUEA; SGSNMSQWIRFRCSKIDEGGDWRPIVQFLRYQQIEFITFLGALKSFLKGTPKKNCLVFCG -----------3333------3333----1111----------------2222------- PANTGKSYFGMSFIHFIQGAVISFVNSTSHFWLEPLTDTKVAMLDDATTTCWTYFDTYMR 1111-------------------------11111111----------------------- NALDGNPISIDPLIQLKCPPILLTTNIHPAKDNRWPYLESRITVFEFPNAFPFDKNGNPV ----------------------------------3333---------------1111--- YEINDKNWKCFFERTWSRLDL ---------------3333-- >HYPOTHETICAL PROTEIN EGC0; SWP:Q99IU3; PDB:1TUHA; NEAEQNAETVRRGYAAFNSGDMKTLTELFDENASWHTPGRSRIAGDHKGREAIFAQFGRY -------------------------11111111-------1111---------------- GGETGGTFKAVLLHVLKSDDGRVIGIHRNTAERGGKRLDVGCCIVFEFKNGRVIDGREHF ---%%%%----------1111-----------iiii------------iiii-------- YDLYAWDEFWR ----------- >ODORANT BINDING PROTEIN A; SWP:Q9U9J5; PDB:1TUJA; IDQDTVVAKYMEYLMPDIMPCADELHISEDIATNIQAAKNGADMSQLGCLKACVMKRIEM -3333-----3333---------------3333----------1111-------3333-- LKGTELYVEPVYKMIEVVHAGNADDIQLVKGIANECIENAKGETDECNIGNKYTDCYIEK -------3333--------------------------------------------3333- LFS --- >NONSPECIFIC LIPID-TRANSFE; SWP:P82900; PDB:1TUKA; ACQASQLAVCASAILSGAKPSGECCGNLRAQQGCFCQYAKDPTYGQYIRSPHARDTLTSC ------3333--------------------3333-3333----3333----------111 GLAVPHC 1------ >TLP20; SWP:Q06691; PDB:1TUL; GTPDIIVNAQINSEDENVLDFIIEDEYYLKKRGVGAHIIKVASSPQLRLLYKNAYSTVSC --------------1111-----------------------------3333--------- GNYGVLCNLVQNGEYDLNAIMFNCAEIKLNKGQMLFQTKIWR -----------------------------2222--------- >PUTATIVE PHOSPHOMANNOMUTA; SWP:Q5SKJ3; PDB:1TUOA; MSAPIRFGTEGFRGVIAREFTFATLHRLAEAYGRHLLERGGGLVVVGHDTRFLADAFARA --------------2222------------------1111----------2222------ LSGHLAGMGLKVVLLKGPVPTPLLSFAVRHLKAAGGAMLTASHNPPQYLGVKFKDATGGP -----1111----------3333-----------------!!!!3333------1111-- IAQEEAKAIEALVPEEARALEGAYETLDLREAYFEALKAHLDLKALSGFSGVLYHDSMGG ---------1111------------------------111133331111-------iiii AGAGFLKGFLRHVGLEIPVRPIREEPHPLFHGVNPEPIPKNLGVTLAVLGPETPPSFAVA ----------1111------------1111-------3333------------------- TDGDADRVGVVLPGGVFFNPHQVLTTLALYRFRKGHRGRAVKNFAVTWLLDRLGERLGFG -1111-------------------------------------1111-------------- VTTTPVGFKWIKEEFLKGDCFIGGEESGGVGYPEHLPERDGILTSLLLLESVAATGKDLA --------------3333------1111---1111------------------------- EQFKEVEALTGLTHAYDRLDRPLAGLTPKGVDTLDGVKWLYEEAWVLFRASVRIYVEAQS ----------------------------------------2222---------------- PELVRALLEEARKLVEG -----------1111-- >PROTEIN YGIN; SWP:P40718; PDB:1TUVA; MLTVIAEIRTRPGQHHRQAVLDQFAKIVPTVLKEEGCHGYAPMVDCAAGVSFQSMAPDSI ------------%%%%-----------------2222------------1111------- VMIEQWESIAHLEAHLQTPHMKAYSEAVKGDVLEMNIRILQPG -------------------------1111-------------- >TETRACENOMYCIN POLYKETIDE; SWP:P39890; PDB:1TUWA; AYRALMVLRMDPADAEHVAAAFAEHDTTELPLEIGVRRRVLFRFHDLYMHLIEADDDIME ----------3333----------33333333---------------------------- RLYQARSHPLFQEVNERVGQYLTPYAQDWEELKDSKAEVFYSWTAP ----33331111------1111---1111-3333------------ >XYLANASE; SWP:P23360; PDB:1TUX; AAAQSVDQLIDARGKVYFGVATDQNRLTTGKNAAIIQADFGQVTPENSMKWDATEPSQGN ----------1111--------3333--!!!!--------------11113333--2222 FNFAGADYLVNWAQQNGKLIRGHTLVWHSQLPSWVVSITDKNTLTNVMKNHITTIMTRYI -------------1111--------------3333-----------------------22 GKIRAWDVVNEAFNEDGSLRQTVFNNVIGEDYIPIAFRTARAADPNAKLYINDYNLDSAS 22-----------1111----3333------------------3333----------111 KPKTSAIVKRVKKWRAAGVPIDGIGSQTHLSAGQGASIDAALPNLASAGTPEVAITELDI 1-------------1111------------22223333---------------------2 AGATSTDYVDVVNACLDVDSCIGITVWGVADPDSWRASTTPLLFDGNFNPKPAYNAIVQL 222--------------1111--------3333--3333-----1111------------ L - >DIACYLGLYCEROL KINASE ALP; SWP:O95217; PDB:1TUZA; MAKERGLISPSDFAQLQKYMEYSTKKVSDVLKLFEDGEMAKYVQGDAIGYEGFQQFLKIY --------3333-------------3333--1111---3333-------3333------- LEVDNVPRHLSLALFQSFETGHCLNETNVTKDVVCLNDVSCYFSLLEGGRPEDKLEWS --------------3333------------------------3333------------ -------------------------------- >Dihydroorotate dehydrogen; SWP:Q08210; PDB:1TV5A; FESYNPEFFLYDIFLKFCLKYIDGEICHDLFLLLGKYNILPYDTSNDSIYACTNIKHLDF -1111--3333------------------------------------1111---!!!!-- INPFGVAAGFDKNGVCIDSILKLGFSFIEIGTITPRGQTGNAKPRIFRDVESRSIINSCG ----------1111------3333-------------------------1111------- FNNMGCDKVTENLILFRKRQEEDKLLSKHIVGVSIGKNKDTVNIVDDLKYCINKIGRYAD -------------------11111111----------1111-3333--------3333-- YIAINVSSPNTPGLRDNQEAGKLKNIILSVKEEIDNLEFLWFNTTKKKPLVFVKLAPDLN ----------2222---------------------------------------------- QEQKKEIADVLLETNIDGMIISNTTTQINDIKSFENKKGGVSGAKLKDISTKFICEMYNY -----------1111----------------1111-------3333-------------- TNKQIPIIASGGIFSGLDALEKIEAGASVCQLYSCLVFNGMKSAVQIKRELNHLLYQRGY %%%%----------------------------3333---1111----------------- YNLKEAIGRKH -3333--1111 >MOLYBDENUM COFACTOR BIOSY; SWP:Q99S04; PDB:1TV8A; EQIKDKLGRPIRDLRLSVTDRCNFRCDYCMPKEVFGDDFVFLPKNELLTFDEMARIAKVY ----1111-----------------1111-3333-1111---3333-------------- AELGVKKIRITGGEPLMRRDLDVLIAKLNQIDGIEDIGLTTNGLLLKKHGQKLYDAGLRR 1111---------3333--3333-------2222--------1111-------1111--- INVSLDAIDDTLFQSINNRNIKATTILEQIDYATSIGLNVKVNVVIQKGINDDQIIPMLE ---------------------3333--------1111---------22221111------ YFKDKHIEIRFIEFMDVGNDNGWDFSKVVTKDEMLTMIEQHFEIDPVEPKYFGEVAKYYR --1111------------------------------1111----------2222------ HKDNGVQFGLITSVSQSFCSTCTRARLSSDGKFYGCLFATVDGFNVKAFIRSGVTDEELK -----------------3333------1111------------------1111------- EQFKALWQIRDDRYSDERTAQTVANRQ ----------------------3333- >METHANE MONOOXYGENASE COM; SWP:P22868; PDB:1TVCA; CRISFGEVGSFEAEVVGLNWVSSNTVQFLLQKRPDECGNRGVKFEPGQFMDLTIPGTDVS ------------------------------------------------------------ RSYSPANLPNPEGRLEFLIRVLPEGRFSDYLRNDARVGQVLSVKGPLGVFGLKERGMAPR ---------------------2222----------------------------------- YFVAGGTGLAPVVSMVRQMQEWTAPNETRIYFGVNTEPELFYIDELKSLERSMRNLTVKA ------------------------------------1111-------------------- CVWHPSGDWEGEQGSPIDALREDLESSDANPDIYLCGPPGMIDAACELVRSRGIPGEQVF ------------------------------------------------------------ FEKFLPSGAA ---------- >T CELL RECEPTOR; SWP:A0JD37; PDB:1TVDA; DKVTQSSPDQTVASGSEVVLLCTYDTVYSNPDLFWYRIRPDYSFQFVFYGDDSRSEGADF ------------2222----------------------1111------------------ TQ -- >PENICILLIN BINDING PROTEI; SWP:Q53613; PDB:1TVFA; GPHTSSYAQATNSDVTPVQAANQYGYAGLSAAYEPTSAVNVSQTGQLLYQYNIDTKWNPA ----1111-----------------22223333--------1111------1111---!! SMTKLMTMYLTLEAVNKGQLSLDDTVTMTNKEYIMSTLPELSNTKLYPGQVWTIADLLQI !!------------1111--1111-------------2222-----2222---------- TVSNSSNAAALILAKKVSKNTSDFVDLMNNKAKAIGMKNTHFVNPTGAENSRLRSFAPTK -1111---------------------------11111111--------3333!!!!-333 YKDQERTVTTARDYAILDLHVIKETPKILDFTKQLAPTTHAVTYYTFNFSLEGAKMSLPG 3-------------------------33331111----%%%%-----1111------222 TDGLKTGSSDTANYNHTITTKRGKFRINQVIMGAGDYKNLGGEKQRNMMGNALMERSFDQ 2--------------------!!!!----------------------------------- YKYVKILSKGEQRINGKKYYVENDLYDVLPSDFSKKDYKLVVEDGKVHADYPREFINKDY -------------iiii------------11113333-------------------1111 GPPTVEVHQ --------- >LOC51668 PROTEIN; SWP:Q9Y547; PDB:1TVGA; IDLCLSSEGSEVILATSSDEKHPPENIIDGNPETFWTTTGMFPQEFIICFHKHVRIERLV -11111111---------11113333----1111-------------------------- IQSYFVQTLKIEKSTSKEPVDFEQWIEKDLVHTEGQLQNEEIVAHGSATYLRFIIVSAFD --------------------------------2222------------------------ HFASVHSVSAEGTVVS ---------------- >HYPOTHETICAL UPF0054 PROT; SWP:Q9X1J7; PDB:1TVIA; MIRILGEGKGSKLLENLKEKLEEIVKKEIGDVHVNVILVSEDEIKELNQQFRGQDRPTDV ----------------------------------------3333---------------- LTFPLMEEDVYGEIYVCPLIVEENAREFNNTFEKELLEVVIHGILHLAGYDHEFEDKNSK ---------------------------------------------3333----------- EMFEKQKKYVEEVWGEWRSNPSEDSDPGKR ---------------3333----------- >COFILIN; SWP:P21566; PDB:1TVJA; MASGVTVNDEVIKVFNDMKVRKSSTPEEIKKRKKAVLFCLSDDKKQIIVEEAKQILVGDI -------3333-----------------------------1111-----3333--1111- GDTVEDPYTAFVKLLPLNDCRYALYDATYETKESKKEDLVFIFWAPESAPLKSKMIYASS ---------------1111--------------------------1111----------- KDAIKKKFTGIKHEWQVNGLDDIKDRSTLGEKLGGNVVVSLEGKPL ------------------3333------------------iiii-- >TUBULIN ALPHA CHAIN; SWP:P02550; PDB:1TVKA; RECISIHVGQAGVQIGNACWELYCLEHGIQPDGHVPRAVFVDLEPTVIDEVRTGTYRQLF ----------33333333----3333---------------------------2222--- HPEQLITGKEDAANNYARGHYTIGKEIIDLVLDRIRKLADQCTGLQGFSVFHSFGGGTGS 3333-----------1111---3333-3333-------------------------3333 GFTSLLMERLSVDYGKKSKLEFSIYPAPQVSTAVVEPYNSILTTHTTLEHSDCAFMVDNE ------3333---1111---------------3333----------------------33 AIYDICRRNLDIERPTYTNLNRLIGQIVSSITASLRFDGALNVDLTEFQTNLVPYPRGHF 33-------------3333------------------------3333------------- PLATYAPVISAEKAYHEQLSVAEITNACFEPANQMVKCDPRHGKYMACCLLYRGDVVPKD --------------------------------------------------------1111 VNAAIATIKTKRTIQFVDWCPTGFKVGINYEPPTVVPGGDLAKVQRAVCMLSNTTAIAEA --3333------------------------------------------------------ WARLDHKFDLMYAKRAFVHWYVGEGMEEGEFSEAREDMAALEKDYEEVGVDS --------------!!!!--3333--3333------3333------------ >Tubulin beta chain; SWP:P02554; PDB:1TVKB; REIVHIQAGQCGNQIGAKFWEVISDEHGIDPTGSYHGDSDLQLERINVYYNEAAGNKYVP -----------------------3333-----------------1111------------ RAILVDLEPGTMDSVRSGPFGQIFRPDNFVFGQSGAGNNWAKGHYTEGAELVDSVLDVVR -------33333333-----------------------3333--------3333------ KESESCDCLQGFQLTHSLGGGTGSGMGTLLISKIREEYPDRIMNTFSVVPSPKVSDTVVE -1111------------------3333--11113333----------------------- PYNATLSVHQLVENTDETYCIDNEALYDICFRTLKLTTPTYGDLNHLVSATMSGVTTCLR ---------3333----------3333------------3333-------------3333 FPGQLNADLRKLAVNMVPFPRLHFFMPGFAPLTSRGSQQYRALTVPELTQQMFDAKNMMA --------------------------------------------1111------------ ACDPRHGRYLTVAAVFRGRMSMKEVDEQMLNVQNKNSSYFVEWIPNNVKTAVCDIPPRGL --1111--------------3333------------------------------------ KMSATFIGNSTAIQELFKRISEQFTAMFRRKAFLHWYTGEGMDEMEFTEAESNMNDLVSE --------------------------------1111-----------------------1 YQQYQD 111--- >PROTEIN YTNJ; SWP:O34974; PDB:1TVLA; RADFIQFGAMIHGVGGTTDGWRHPDVDPSASTNIEFYMKKAQTAEKGLFSFIFIADGLFI -----------!!!!---333311111111--------------1111------------ SEKSIPHFLNRFEPITILSALASVTKNIGLVGTFSTSFTEPFTISRQLMSLDHISGGRAG 11113333-----------3333------------------------------------- WNLVTSPQEGAARNHSKSNLPEHTERYEIAQEHLDVVRGLWNSWEHDAFIHNKKTGQFFD -------33331111-----------------------------1111----3333---1 QAKLHRLNHKGKYFQVEGPLNIGRSKQGEPVVFQAGSSETGRQFAAKNADAIFTHSNSLE 111---------------------1111-------------------------------- ETKAFYADVKSRAADEGRDPSSVRIFPGISPIVADTEEEAEKKYREFAELIPIENAVTEA -------------1111-3333------------------------1111----3333-- KARNLTLREVAQEMAFPRTLFIGTPERVASLIETWFNAEAADGFIVGSDIPGTLDAFVEK -----------------------------------1111----------2222------- VIPILQERGLYRQDYRGGTLRENLGLGIPQ -----1111--------------------- >PTS SYSTEM, GALACTITOL-SP; SWP:P0A437; PDB:1TVMA; MGSSHHHHHHHHENLYFQGSKRKIIVACGGAVATSTMAAEEIKELCQSHNIPVELIQCRV ----------------------------------------------1111---------- NEIETYMDGVHLICTTARVDRSFGDIPLVHGMPFVSGVGIEALQNKILTILQG -3333-------------------------3333------------------- >CELLULASE; SWP:O86099; PDB:1TVNA; AVEKLTVSGNQILAGGENTSFAGPSLFWSNTGWGAEKFYTAETVAKAKTEFNATLIRAAI -------!!!!--iiii------------2222-3333---------------------- GHGTSTGGSLNFDWEGNMSRLDTVVNAAIAEDMYVIIDFHSHEAHTDQATAVRFFEDVAT -----2222-------------------1111----------3333-------------- KYGQYDNVIYEIYNEPLQISWVNDIKPYAETVIDKIRAIDPDNLIVVGTPTWSQDVDVAS -3333---------------------------------------------%%%%-3333- QNPIDRANIAYTLHFYAGTHGQSYRNKAQTALDNGIALFATEWGTVNADGNGGVNINETD ---------------1111-3333-------1111-----------1111---------- AWMAFFKTNNISHANWALNDKNEGASLFTPGGSWNSLTSSGSKVKEIIQGWGG ----------------------3333--22221111-----------1111-- >TRANSACTIVATOR PROTEIN; SWP:NA; PDB:1TVS; LEDRRIPGTAEENLQKSSGGVPGQNTGGQEARPNYHCQLCFLRSLGIDYLDASLRKKNKQ --------3333-----------------------------------------3333--3 RLKAIQQGRQPQYLL 3331111----3333 >NEUTROPHIL ACTIVATING PEP; SWP:P02775; PDB:1TVXA; LRCLCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKIVQKK ------------3333---------1111--------1111-----1111---------- LAGD ---- >PATHOGENESIS-RELATED CLAS; SWP:Q6T6J0; PDB:1TW0A; GVFVFRDETSSSVAPAKLYKALTKDSDTIAQKIDGPIQSIELVEGNGGVGTIKKITANEG --------------3333------33333333---------------2222--------- DKTSFVLQKVDAIDEANLGYDYSIVGGTGLPESLEKLSFETKVVAGSGGGSISKVTLKFH --------------1111--------11113333-------------------------- TKGDAPLSDAVRDDALAKGAGFFKAIEGYVLANPAEY -!!!!---------------------------3333- >CARMINOMYCIN 4-O-METHYLTR; SWP:Q06528; PDB:1TW3A; QIDALRTLIRLGSLHTPMVVRTAATLRLVDHILAGARTVKALAARTDTRPEALLRLIRHL -------------------------------1111------------------------- VAIGLLEEDAPGEFVPTEVGELLADDHPAAQRAWHDLTQAVARADISFTRLPDAIRTGRP 1111-----2222----3333--1111--------1111--------------------- TYESIYGKPFYEDLAGRPDLRASFDSLLACDQDVAFDAPAAAYDWTNVRHVLDVGGGKGG 3333------------------------1111-1111--3333-1111-------!!!!- FAAAIARRAPHVSATVLEMAGTVDTARSYLKDEGLSDRVDVVEGDFFEPLPRKADAIILS --------1111------------------------------------------------ FVLLNWPDHDAVRILTRCAEALEPGGRILIHERDDLHENSFNEQFTELDLRMLVFLGGAL -3333-------------11112222---------3333--3333--------------- RTREKWDGLAASAGLVVEEVRQLPSPTIPYDLSLLVLAPA -3333----------------------------------- >FATTY ACID-BINDING PROTEI; SWP:P80226; PDB:1TW4A; AFSGTWQVYAQENYEEFLKALALPEDLIKMARDIKPIVEIQQKGDDFVVTSKTPRQTVTN ------------------1111-3333---1111--------!!!!------3333---- SFTLGKEADITTMDGKKLKCTVHLANGKLVTKSEKFSHEQEVKGNEMVETITFGGVTLIR --2222-----1111---------iiii----1111----------------iiii---- RSKRV ----- >PROTEASE; SWP:Q9QM22; PDB:1TW7A; PQITLWQRPIVTIKIGGQLKEALLNTGADDTVLEEVNLPGRWKPKLIGGIGGFVKVRQYD --------------iiii------1111--------------------2222-------- QVPIEICGHKVIGTVLVGPTPANVIGRNLMTQIGCTLNF -----iiii------------------------------ >GLUTATHIONE S-TRANSFERASE; SWP:NA; PDB:1TW9A; MVHYKLTYFNGRGAGECARQVFALADQKYEDVRLTQETFVPLKATFPFGQVPVLEVDGQQ -----------!!!!-------1111--------333333331111---------iiii- LAQSQAICRYLAKTFGFAGATPFESALIDSLADAYTDYRAEMDKPKTDVLLPARTKFLGF ------------------------------------------------------------ ITKFLKKNSSGFLVGDKISWVDLLVAEHVADMTNRVPEYIEGFPEVKAHMERIQQTPRIK -------3333-------3333-------------11112222------------3333- KWIETRPETPF ----------- >COPPER HOMEOSTASIS PROTEI; SWP:P46719; PDB:1TWDA; ALLEICCYSECALTAQQNGADRVELCAAPKEGGLTPSLGVLKSVRQRVTIPVHPIIRPRG ---------------1111--------3333----------------------------- GDFCYSDGEFAAILEDVRTVRELGFPGLVTGVLDVDGNVDPREKIAAAGPLAVTFHRAFD ---------------------------------1111--------3333-------3333 CANPLYTLNNLAELGIARVLTSGQKSDALQGLSKIELIAHRDAPIIAGAGVRAENLHHFL ---------------------%%%%-3333-3333----------------3333----- DAGVLEVHSSAGAWQASPRYRNYSRYIVDGAAVAEKGIIERHQAK --------------------------------------------- >DNA-DIRECTED RNA POLYMERA; SWP:P04050; PDB:1TWFA; VGQQYSSAPLRTVKEVQFGLFSPEEVRAISVAKIRFPETMDETQTRAKIGGLNDPRLGSI -----------------------------------------iiii--------------% DRNLKCQTCQEGMNECPGHFGHIDLAKPVFHVGFIAKIKKVCECVCMHCGKLLLDEHNEL %%%---------------------------3333--------------------1111-- MRQALAIKDSKKRFAAIWTLCKTKMVCETDVPSEDDPTQLVSRGGCGNTQPTIRKDGLKL ---3333-3333-------3333------------3333----------------!!!!- VGSWKKDRATGDADEPELRVLSTEEILNIFKHISVKDFTSLGFNEVFSRPEWMILTCLPV ---------------------3333---1111----------------3333-------- PPPPVRPSISFNESQRGEDDLTFKLADILKANISLETLEHNGAPHHAIEEAESLLQFHVA -3333-------------1111---------------------3333------------- TYMDNDIAGQPQALQKSGRPVKSIRARLKGKEGRIRGNLMGKRVDFSARTVISGDPNLEL ---------------3333--------------3333---------------------11 DQVGVPKSIAKTLTYPEVVTPYNIDRLTQLVRNGPNEHPGAKYVIRDSGDRIDLRYSKRA 11---3333----------3333----------------------3333---1111---- GDIQLQYGWKVERHIMDNDPVLFNRQPSLHKMSMMAHRVKVIPYSTFRLNLSVTSPYNAD -----2222------2222----------1111----------------1111-1111-- FDGDEMNLHVPQSEETRAELSQLCAVPLQIVSPQSNKPCMGIVQDTLCGIRKLTLRDTFI ------------------------3333--------------------------3333-- ELDQVLNMLYWVPDWDGVIPTPAIIKPKPLWSGKQILSVAIPNGIHLQRFDEGTTLLSPK 3333-------1111---------------------3333-----------------111 DNGMLIIDGQIIFGVVEKKTVGSSNGGLIHVVTREKGPQVCAKLFGNIQKVVNFWLLHNG 1-----%%%%------3333---------------------------------------- FSTGIGDTIADGPTMREITETIAEAKKKVLDVTKEAQANLLTAKHGMTLRESFEDNVVRF ---3333---3333---------------------------------------------- LNEARDKAGRLAEVNLKDLNNVKQMVMAGSKGSFINIAQMSACVGQQSVEGKRIAFGFVD ----------------1111------------3333------------%%%%-----222 RTLPHFSKDDYSPESKGFVENSYLRGLTPQEFFFHAMGGREGLIDTAVKTAETGYIQRRL 2-1111-----3333------3333----------------------------------- VKALEDIMVHYDNTTRNSLGNVIQFIYGEDGMDAAHIEKQSLDTIGGSDAAFEKRYRVDL ---1111--1111---1111------%%%%--1111-----3333-------------11 LNTDHTLDPSLLESGSEILGDLKLQVLLDEEYKQLVKDRKFLREVFVDGEANWPLPVNIR 11-----1111--3333----------------------------3333----------- RIIQNAQQTFHIDHTKPSDLTIKDIVLGVKDLQENLLVLRGKNEIIQNAQRDAVTLFCCL --------------------3333-------1111------------------------- LRSRLATRRVLQEYRLTKQAFDWVLSNIEAQFLRSVVHPGEMVGVLAAQSIGEPATQMTL ----------------3333------------1111-----------------3333--- KKVTSGVPRLKEILNVAKNMKTPSLTVYLEPGHAADQEQAKLIRSAIEHTTLKSVTIASE -----------------------------3333-----------------3333------ IYYDPDPRSTVIPEDEEIIQLHFSLQQSPWLLRLELDRAAMNDKDLTMGQVGERIKQTFK -----------3333---1111-------------------1111--------------3 NDLFVIWSEDNDEKLIIRCRVVAEEDHMLKKIENTMLENITLRGVENIERVVMMKYDRKV 333-----3333--------------------------------2222------------ PSPTGEYVKEPEWVLETDGVNLSEVMTVPGIDPTRIYTNSFIDIMEVLGIEAGRAALYKE -1111---------------33331111---3333------------------------- VYNVIASDGSYVNYRHMALLVDVMTTQGGLTSVTRHGFNRSNTGALMRCSFEETVEILFE -----1111-------------1111-----------------1111------------- AGASAELDDCRGVSENVILGQMAPIGTGAFDVMIDEESL ----------------1111----!!!!----------- >DNA-directed RNA polymera; SWP:P08518; PDB:1TWFB; FEDESAPITAEDSWAVISAFFREKGLVSQQLDSFNQFVDYTLQDIICEDSTLIEISFGKI --1111--3333--------------3333--------------3333------------ YVTKPMVNESDGVTHALYPQEARLRNLTYSSGLFVDVKKRTYEKVFIGRLPIMLRSKNCY -----------------3333-1111--------------------------2222--33 LSEATESDLYKLKECPFDMGGYFIINGSEKVLIAQERSAGNIVQVFKKAAPSPISHVAEI 33--3333------1111------iiii--------------------3333-------- RSALEKGSRFISTLQVKLYGREGSSARTIKATLPYIKQDIPIVIIFRALGIIPDGEILEH --------------------------------2222----3333--1111---------- ICYDVNDWQMLEMLKPCVEDGFVIQDRETALDFIGRRGTALGIKKEKRIQYAKDILQKEF ---1111-------------3333---------------11113333------------- LPHITQLEGFESRKAFFLGYMINRLLLCALDRKDQDDRDHFGKKRLDLAGPLLAQLFKTL 3333-----------------------1111-----11111111---------------- FKKLTKDIFRYMQRTVELAINAKTITSGLKYALATGNWGAGVSQVLNRYTYSSTLSHLRR -------33331111-----3333-------------------------3333------- TNTPIAKPRQLHNTHWGLVCPAETPEGQACGLVKNLSLMSCISVGTDPMPIITFLSEWGM ------3333-3333-----------1111------1111-------------------- EPLEDYVPHQSPDATRVFVNGVWHGVHRNPARLMETLRTLRRKGDINPEVSMIRDIREKE -3333-11113333----iiii------------------------3333-----1111- LKIFTDAGRVYRPLFIVEDDESLGHKELKVRKGHIAKLMATEYQDEYTWSSLLNEGLVEY -----------------------------------------------3333--------- IDAEEEESILIAMQPEDLEPDVDPAKRIRVSHHATTFTHCEIHPSMILGVAASIIPFPDH -33331111----1111-------------------------3333----3333--1111 NQSPRNTYQSAMGKQAMGVFLTNYNVRMDTMANILYYPQKPLGTTRAMEYLKFRELPAGQ -3333-----3333--------3333------------------333311113333---- NAIVAIACYSGYNQEDSMIMNQSSIDRGLFRSLFFRSYMDQEKKYGMSITETFEKPQRTN -------------%%%%-----3333-2222--------------2222----------- TLRMKHGTYDKLDDDGLIAPGVRVSGEDVIIGKTTPISSKRDASTPLRSTENGIVDQVLV ------------1111-------------------------------------------- TTNQDGLKFVKVRVRTTKIPQIGDKFASRHGQKGTIGITYRREDMPFTAEGIVPDLIINP --3333--------------2222----------------3333---1111-------11 HAIPSRMTVAHLIECLLSKVAALSGNEGDASPFTDITVEGISKLLREHGYQSRGFEVMYN 11-33333333----------------------------------1111----------- GHTGKKLMAQIFFGPTYYQRLRHMVDDKIHARARGPGLRFGEMERDCMIAHGAASFLKER ------------------------1111-------------------------------- LMEASDAFRVHICGICGLMTVIAKLNHNQFECKGCDNKIDIYQIHIPYAAKLLFQELMAM ----------------------------------------------3333---------- NITPRLYTDRSRDF -------------- >DNA-directed RNA polymera; SWP:P16370; PDB:1TWFC; EEGPQVKIREASKDNVDFILSNVDLAMANSLRRVMIAEIPTLAIDSVEVETNTTVLADEF ------------------------------------------------------------ IAHRLGLIPLQSMDIEQLEYSRDCFCEDHCDKCSVVLTLQAFGESESTTNVYSKDLVIVS ----1111-----3333--3333------3333------------------3333----- NLMGRNIGHPIIQDKEGNGVLICKLRKGQELKLTCVAKKGIAKEHAKWGPAAAIEFEYDP -iiii--------------------2222-----------33333333------------ WNKLKHTDYWYEQDSAKEWPQSKNCEYEDPPNEGDPFDYKAQADTFYMNVESVGSIPVDQ -3333--------3333----1111------------1111---------------3333 VVVRGIDTLQKKVASILLALTQMDQD --------------------3333-- >DNA-directed RNA polymera; SWP:P20435; PDB:1TWFF; KAIPKDQRATTPYMTKYERARILGTRALQISMNAPVFVDLEGETDPLRIAMKELAEKKIP ------------------------------------------------------------ LVIRRYLPDGSFEDWSVEELIVDL ------1111-----3333----- >DNA-directed RNA polymera; SWP:P20436; PDB:1TWFH; SNTLFDDIFQVSEVDPGRYNKVCRIEAASTTQDQCKLTLDINVELFPVAAQDSLTVTIAS -----------------------------------------3333--------------- SLTRSWRPPQAGDRSLADDYDYVMYGTAYKFEEVSKDLIAVYYSFGGLLMRLEGNYRNLN ---------1111-------------------------------iiii------------ NLKQENAYLLIRR ------------- >DNA-directed RNA polymera; SWP:P27999; PDB:1TWFI; MTTFRFCRDCNNMLYPREDKENNRLLFECRTCSYVEEAGSPLVYRHELITNIGETAGVVQ ----------------------------------------------------1111--11 DIGSDPTLPRSDRECPKCHSRENVFFQSQQRRKDTSMVLFFVCLSCSHIFTSDQKNKRTQ 11--1111---------------------------------------------------- FS -- >DNA-directed RNA polymera; SWP:P22139; PDB:1TWFJ; MIVPVRCFSCGKVVGDKWESYLNLLQEDELDEGTALSRLGLKRYCCRRMILTHVDLIEKF -------------------------1111------1111---------------3333-- LRYNP ----- >DNA-directed RNA polymera; SWP:P38902; PDB:1TWFK; MNAPDRFELFLLGEGESKLKIDPDTKAPNAVVITFEKEDHTLGNLIRAELLNDRKVLFAA ----3333------------------------------3333------33331111---- YKVEHPFFARFKLRIQTTEGYDPKDALKNACNSIINKLGALKTNFETEWNLQTL ----3333-------------3333----------------------3333--- ---------------------------------------------- >DIAMINOPIMELATE DECARBOXY; SWP:Q58497; PDB:1TWIA; MLGNDTVEIKDGRFFIDGYDAIELAEKFGTPLYVMSEEQIKINYNRYIEAFKRWEEETGK 2222-----%%%%--iiii----------------------------------------- EFIVAYAYKANANLAITRLLAKLGCGADVVSGGELYIAKLSNVPSKKIVFNGNCKTKEEI ------3333----------1111--------------1111-1111------------- IMGIEANIRAFNVDSISELILINETAKELGETANVAFRINPNVNPKTHPKISTGLKKNKF -------------------------------------------3333------------- GLDVESGIAMKAIKMALEMEYVNVVGVHCHIGSQLTDISPFIEETRKVMDFVVELKEEGI ---1111--------1111------------------------------------1111- EIEDVNLGGGLGIPYYKDKQIPTQKDLADAIINTMLKYKDKVEMPNLILEPGRSLVATAG -----------------------------------1111------------11111111- YLLGKVHHIKETPVTKWVMIDAGMNDMMRPAMYEAYHHIINCKVKNEKEVVSIAGGLCES -----------1111--------33333333---------------------------11 SDVFGRDRELDKVEVGDVLAIFDVGAYGISMANNYNARGRPRMVLTSKKGVFLIRERETY 11-----------2222----------3333--2222---------3333--------33 ADLIAKDIVPPHLL 33-1111--1111- >INORGANIC PYROPHOSPHATASE; SWP:Q8U438; PDB:1TWLA; NPFHDLEPGPDVPEVVYAIIEIPKGSRNKYELDKKTGLLKLDRVLYSPFFYPVDYGIIPR 3333------------------2222---------------------------------- TWYEDDPFDIMVIMREPVYPLTIIEARPIGLFKMIDSGDKDYKVLAVPVEDPYFKDWKDI ------------------2222-------------iiii-----------3333----11 DDVPKAFLDEIAHFFKRYKELQGKEIIVEGWEGAEAAKREILRAIEMYKEKF 11-3333---------1111-------------------------------- >HYPOTHETICAL PROTEIN YYCE; SWP:P37479; PDB:1TWUA; KRFSSFQAAQIRIARPTGQLDEIIRFYEEGLCLKRIGEFSQHNGYDGVMFGLPHADYHLE --1111-----------------------------------iiii--------------- FTQYEGGSTAPVPHPDSLLVFYVPNAVELAAITSKLKHMGYQEVESENPYWSNGGVTIED -------------1111-------------------1111-------3333--------1 PDGWRIVFMNSKGISGK 111-------------- >ABC transporter, periplas; SWP:Q9KLD9; PDB:1TWYA; SEITISGSTSVARIDVLAEKYNQQHPETYVAVQGVGSTAGISLLKKGVADIATSRYLTES -------3333-------------1111---------------1111----------333 EAQNTLHTFTLAFDGLAIVVNQANPVTNLTREQLYGIYKGQITNWKQVGGNDQKIAVVTR 3-1111--------------3333-------------------3333------------- EASSGTRYSFESLGLTKTVKDREVSDVAPTALVVNSNSKTLVNHNTQAVGFISIGSVDKS 1111--------------!!!!------------------33331111----3333-111 VKAIQFEKADPTSDNIAKHTYQLSRPFLILHYSDNADEQTKEFIAFLKSESAKKLIVEYG 1----%%%%-----------------------------------------------1111 YIP --- >DHPS, DIHYDROPTEROATE SYN; SWP:Q81VW8; PDB:1TX2A; KWDYDLRCGEYTLNLNEKTLIMGILNVTPDSFSDGGSYNEVDAAVRHAKEMRDEGAHIID -------!!!!----------------------------------------1111----- IGGESFAKVSVEEEIKRVVPMIQAVSKEVKLPISIDTYKAEVAKQAIEAGAHIINDIWGA ----------------------------------------------1111-----1111- KAEPKIAEVAAHYDVPIILMHNRDNMNYRNLMADMIADLYDSIKIAKDAGVRDENIILDP ----------------------------------------------1111-3333----- GIGFAKTPEQNLEAMRNLEQLNVLGYPVLLGTSRKSFIGHVLDLPVEERLEGTGATVCLG 2222------------3333-----------2222---------1111------------ IEKGCEFVRVHDVKEMSRMAKMMDAMIGK 1111------------------------- >P50-RHOGAP; SWP:Q07960; PDB:1TX4A; PLPNQQFGVSLQHLQEKNPEQEPIPIVLRETVAYLQAHALTTEGIFRRSANTQVVREVQQ -1111------------1111-----------------1111-2222------------- KYNMGLPVDFDQYNALHLPAVILKTFLRELPEPLLTFDLYPHVVGFLNIDESQRVPATLQ -1111---3333--------------1111--33331111----3333-3333------- VLQTLPEENYQVLRFLTAFLVQISAHSDQNKMTNTNLAVVFGPNLLWAKDAAITLKAINP -1111------------------------------------1111--------------- INTFTKFLLDHQGELF ---------------- ------------------------------------------------------------ ------------- >RHO GUANINE NUCLEOTIDE EX; SWP:Q9NZN5; PDB:1TXDA; PPNWQQLVSREVLLGLKPCEIKRQEVINELFYTERAHVRTLKVLDQVFYQRVSREGILSP ---3333-11111111-------------------------------------------- SELRKIFSNLEDILQLHIGLNEQMKAVRKRNETSVIDQIGEDLLTWFSGPGEEKLKHAAA ------------------------------3333-------------------------- TFCSNQPFALEMIKSRQKKDSRFQTFVQDAESNPLCRRLQLKDIIPTQMQRLTKYPLLLD --------------------------------3333---33331111------------- NIAKYTEWPTEREKVKKAADHCRQILNFVNQAVKEAENKQRLEDYQRRLDTSSLVEELRN -------------------------------------------------------3333- LDLTKRKMIHEGPLVWKVNRDKTIDLYTLLLEDILVLLQKQDDRLVLRCTFSPVIKLSTV -3333-------------3333---------------------------------3333- LVRQVATNKALFVISMSDNGAQIYELVAQTVSEKTVWQDLICRMAASVKEQS -----------------------------3333--------------3333- >GLYCEROL-3-PHOSPHATE DEHY; SWP:O29390; PDB:1TXGA; MIVSILGAGAMGSALSVPLVDNGNEVRIWGTEFDTEILKSISAGREHPRLGVKLNGVEIF ------------------------------1111------1111--1111---------- WPEQLEKCLENAEVVLLGVSTDGVLPVMSRILPYLKDQYIVLISKGLIDFDNSVLTVPEA 1111----2222-------1111-------3333---------------%%%%------- VWRLKHDLRERTVAITGPAIAREVAKRMPTTVVFSSPSESSANKMKEIFETEYFGVEVTT 1111---3333------------1111-----------------------1111------ DIIGTEITSALKNVYSIAIAWIRGYESRKNVEMSNAKGVIATRAINEMAELIEILGGDRE ---------------------------------------------------------333 TAFGLSGFGDLIATFRGGRNGMLGELLGKGLSIDEAMEELERRGVGVVEGYKTAEKAYRL 3--3333------3333---------1111----------1111---3333--------- SSKINADTKLLDSIYRVLYEGLKVEEVLFELATFK ----------------------3333----1111- >translationally controlle; SWP:P84152; PDB:1TXJA; MKVYKDVFTNDEVCSDSYNQEDPFGIADFREIAFEVKSNKRIKGNGMGADVEQVIDIVDS ---------------------22223333------------------------------- FQLTSTSLSKKEYSVYIKNYMQKILKYLEEKKPDRVDVFKTKAQPLIKHILTNFDDFEFY -------------------------------3333-----------------3333---- MGESLDMDAGLTYSYYKGEEVTPRFVYISDGLYEEKF -11111111-------!!!!-------3333------ >GLUCANS BIOSYNTHESIS PROT; SWP:P33136; PDB:1TXKA; FSIDDVAKQAQSLAGKGYETPKSNLPSKYADYQQIQFNHDKAYWNNLKTPFKLEFYHQGY -3333-------1111-----------33333333--11112222--------------- FDTPVKINEVTATAVKRIKYSPDYFTFGDVQHDKDTVKDLGFAGFKVLYPINSKDKNDEI ----------1111------3333--!!!!--11111111------------1111---- VSLGASYFRVIGAGQVYGLSARGLAIDTALPSGEEFPRFKEFWIERPKPTDKRLTIYALL -----------2222-----------2222-----------------1111--------- DSPRATGAYKFVVPGRDTVVDVQSKIYLRDKVGKLGVAPLTSFLFGPNQPSPANNYRPEL -1111----------------------------------------3333----------- HDSNGLSIHAGNGEWIWRPLNNPKHLAVSSFSENPQGFGLLQRGRDFSRFEDLDDRYDLR ---------1111--------------------------------3333------3333- PSAWVTPKGEWGKGSVELVEIPTNDETNDNIVAYWTPDQLPEPGKENFKYTITFSRDEDK -----------------------------------------2222-----------3333 LHAPDNAWVQQTRRSTGDVKQSNLIRQPDGTIAFVVDFTGAEKKLPEDTPVTAQTSIGDN --1111-----------3333------------------------1111--------111 GEIVESTVRYNPVTKGWRLVRVKVKDAKKTTERAALVNADQTLSETWSYQLPANEVEHHH 1------------------------1111----------------------2222----- >METAL-BINDING PROTEIN YOD; SWP:P76344; PDB:1TXLA; HGKPLTEVEQKAANGVFDDANVQNRTLSDWDGVWQSVYPLLQSGKLDPVFQKKADADKTK -----------------3333------1111------------1111---------1111 TFAEIKDYYHKGYATDIEMIGIEDGIVEFHRNNETTSCKYDYDGYKILTYKSGKKGVRYL ----------------------iiii----!!!!---------------1111------- FECKDPESKAPKYIQFSDHIIAPRKSSHFHIFMGNDSQQSLLNEMENWPTYYPYQLSSEE ----1111--------------------------------3333--------3333---- VVEEMMSH -------- >MAUROTOXIN; SWP:P80719; PDB:1TXM; VSCTGSKDCYAPCRKQTGCPNAKCINKSCKCYGC ----3333-------------------------- >COPROPORPHYRINOGEN III OX; SWP:P11353; PDB:1TXNA; PAPQDPRNLPIRQQEALIRRKQAEITQGLESIDTVKFHADTWTRGNDGGGGTSVIQDGTT ---------3333----------------1111--------------------------- FEKGGVNVSVVYGQLSPAAVSAKADHKNLRLPDGVKFFACGLSVIHPVNPHAPTTHLNYR -------------------------1111-------------------1111-------- YFETWNQDGTPQTWWFGGGADLTPSYLYEEDGQLFHQLHKDALDKHDTALYPRFKKWCDE -----1111------------------3333---------------11113333--3333 YFYITHRKETRGIGGIFFDDYDERDPQEILKVEDCFDAFLPSYLTIVKRRKDPYTKEEQQ 3333-----------------------------------3333----------------- WQAIRRGRYVEFN ------------- >PUTATIVE BACTERIAL ENZYME; SWP:Q8VKT2; PDB:1TXOA; LVLRYAARSDRGLVRANNEDSVYAGARLLALADGGGHAAGEVASQLVIAALAHLDDDEPG -----------------------------------%%%%-------------3333---- GDLLAKLDAAVRAGNSAIAAQVEEPDLEGGTTLTAILFAGNRLGLVHIGDSRGYLLRDGE -----------------------3333-----------!!!!--------------iiii LTQITKDDTFVQTLVDEGRITPEEAHSHPQRSLIRALTGHEVEPTLTREARAGDRYLLCS --------------1111--33331111-1111-----------------2222-----3 DGLSDPVSDETILEALQIPEVAESAHRLIELALRGGGPDNVTVVVADLEH 333-----------1111--------------1111-------------- >RAB5 GDP/GTP EXCHANGE FAC; SWP:Q9UJ41; PDB:1TXUA; ETDRVSKEFIEFLKTFHKTGQEIYKQTKLFLEGHYKRDLSIEEQSECAQDFYHNVAERQT 3333-------3333--------------------1111--------------------1 RGKVPPERVEKIDQIEKYITRLYKYVFCPETTDDEKKDLAIQKRIRALRWVTPQLCVPVN 111-33333333---------3333---1111---------------1111--------1 EDIPEVSDVVKAITDIIEDSKRVPRDKLACITKCSKHIFNAIKITKNEPASADDFLPTLI 1113333-------------------------------------------1111------ YIVLKGNPPRLQSNIQYITRFCNPSRLTGEDGYYFTNLCCAVAFIEKLDAQSLNLSQEDF ----------------------3333------------------11113333-------3 DRYSG 333-- >INTEGRIN ALPHA-IIB; SWP:P08514; PDB:1TXVA; LNLDPVQLTFYAGPNGSQFGFSLDFHKDSHGRVAIVVGAPRTLGPSQEETGGVFLCPWRA -------------22222222------3333-------1111-1111-----------11 EGGQCPSLLFDLRDETRNVGSQTLQTFKARQGLGASVVSWSDVIVACAPWQHWNVLEKTE 11----------------%%%%-----22222222---------------------!!!! EAEKTPVGSCFLAQPESGRRAEYSPCRGNTLSRIYVENDFSWDKRYCEAGFSSVVTQAGE -----------------------1111---3333---------11112222----3333- LVLGAPGGYYFLGLLAQAPVADIFSSYRPGILLWHVSSQSLSFDSSNPEYFDGYWGYSVA ----1111iiii-----------11112222----1111----------2222------- VGEFDGDLNTTEYVVGAPTWSWTLGAVEILDSYYQRLHRLRAEQMASYFGHSVAVTDVNG ------1111----------%%%%------1111---------2222------------- DGRHDLLVGAPLYMESRADRKLAEVGRVYLFLQPRGPHALGAPSLLLTGTQLYGRFGSAI ---------1111-------------------------------------22222222-- APLGDLDRDGYNDIAVAAPYGGPSGRGQVLVFLGQSEGLRSRPSQVLDSPFPTGSAFGFS ----1111-------------1111---------1111-------------22222222- LRGAVDIDDNGYPDLIVGAYGANQVAVYRAQP -----1111----------------------- >PRIMOSOMAL REPLICATION PR; SWP:P07013; PDB:1TXYA; TNRLVLSGTVCRAPLRKVSPSGIPHCQFVLEHRSVQEEAGFHRQAWCQPVIVSGHENQAI ------------------1111---------------iiii-------------111133 THSITVGSRITVQGFISCKVLHAEQIELI 33--2222--------------------- >PUTATIVE EXOTOXIN (SUPERA; SWP:Q99QN1; PDB:1TY0A; KSDSENIKDVKLQLNYAYEIIPVDYTNCNIDYLTTHDFYIDISSYKKKNFSVDSEVESYI -----------------------------------------3333--------------3 TTKFTKNQKVNIFGLPYIFTRYDVYYIYGGVTPSVNSNSENSKIVGNLLIDGVQQKTLIN 333-2222-----------------------------------------iiii------- PIKIDKPIFTIQEFDFKIRQYLMQTYKIYDPNSPYIKGQLEIAINGNKHESFNLYDATSS --------------------------1111---------------------------111 STRSDIFKKYKDNKTINMKDFSHFDIYLWTK 1--------1111---3333----------- >PHENAZINE BIOSYNTHESIS PR; SWP:Q51793; PDB:1TY9A; GTLDAPFPEYQTLPADPMSVLHNWLERARRVGIREPRALALATADSQGRPSTRIVVISEI ------3333------------------------1111------1111------------ SDAGVVFSTHAGSQKGRELLHNPWASGVLYWRETSQQIILNGQAVRLPNAKADDAWLKRP 1111-----1111------------------1111--------------------11113 YATHPMSSVSRQSEELQDVQAMRNAARQLAELQGPLPRPEGYCVFELRLESLEFWGNGQE 333-------2222--------------3333------2222----------------%% RLHERLRYDRSDTGWNVRRLQP %%--------1111-------- >YJBS; SWP:O31618; PDB:1TYGA; MLTIGGKSFQSRLLLGTGKYPSFDIQKEAVAVSESDILTFAVRRMNIFEASQPNFLEQLD ---iiii--------------3333-----3333-----------1111----3333--3 LSKYTLLPNTAGASTAEEAVRIARLAKASGLCDMIKVEVIGCSRSLLPDPVETLKASEQL 333--------------------------------------3333--------------- LEEGFIVLPYTSDDVVLARKLEELGVHAIMPGASPIGSGQGILNPLNLSFIIEQAKVPVI 1111-----------------1111--------------------------1111----- VDAGIGSPKDAAYAMELGADGVLLNTAVSGADDPVKMARAMKLAVEAGRLSYEAGRIPLK --------------1111------3333-----3333----------------------- QY -- >ThiS protein; SWP:O31617; PDB:1TYGB; MLQLNGKDVKWKKDTGTIQDLLASYQLENKIVIVERNKEIIGKERYHEVELCDRDVIEIV ----------------------1111---------iiii--1111--------------- HFVGG ----- >CELLULOSOMAL SCAFFOLDIN; SWP:Q9FDJ9; PDB:1TYJA; GSVLTAIDNDKVAVGDKVTLTINVDKITNFSGYQFNIKYNTTYLQPWDTIADEAYTDSTM ------------2222-----------------------3333------------1111- PDYGTLLQGRFNATDMSKHNLSQGVLNFGRLYMNLSAYRASGKPESTGAVAKVTFKVIKE --------------------1111--------------3333------------------ IPAEGIKLATFENGSSMNNAVDGTMLFDWDGNMYSSSAYKVVQPGLIYPK -1111-----------2222%%%%----------1111------------ >ISOCITRATE DEHYDROGENASE; SWP:Q9YE81; PDB:1TYOA; SPPCTTEELSPPPGGSLVEYSGGSLRVPDNPVVAFIRGDGVGPEVVESALKVVDAAVKKV ----3333---1111-----iiii--------------!!!!------------------ YGGSRRIVWWELLAGHLAREKCGELLPKATLEGIRLARVALKGPLETPVGTGYRSLNVAI iiii----------------------3333------------------------------ RQALDLYANIRPVRYYGQPAPHKYADRVDMVIFRENTEDVYAGIEWPHDSPEAARIRRFL ---------------------1111-------------1111----1111---------- AEEFGISIREDAGIGVKPISRFATRRLMERALEWALRNGNTVVTIMHKGNIMKYTEGAFM --------1111-----------------------------------3333--------- RWAYEVALEKFREHVVTEQEVQEKYGGVRPEGKILVNDRIADNMLQQIITRPWDYQVIVA ----------3333--3333----iiii-2222-----------------3333------ PNLNGDYISDAASALVGGIGMAAGMNMGDGIAVAEPVHGTAPKYAGKDLINPSAEILSAS -------------11111111------2222---------1111---------------- LLIGEFMGWREVKSIVEYAIRKAVQSKKVTQDLARHMPGVQPLRTSEYTETLIAYIDEAD ----1111---------------------33331111----------------------3 LNEVLAG 333---- >TAILSPIKE PROTEIN; SWP:P12528; PDB:1TYV; YSIEADKKFKYSVKLSDYPTLQDAASAAVDGLLIDRDYNFYGGETVDFGGKVLTIECKAK ----1111-----3333-----------------------2222---iiii--------- FIGDGNLIFTKLGKGSRIAGVFMESTTTPWVIKPWTDDNQWLTDAAAVVATLKQSKTDGY ------------2222-------------------1111---------1111-------- QPTVSDYVKFPGIETLLPPNAKGQNITSTLEIRECIGVEVHRASGLMAGFLFRGCHFCKM ---3333--2222----1111--------------------------------------- VDANNPSGGKDGIITFENLSGDWGKGNYVIGGRTSYGSVSSAQFLRNNGGFERDGGVIGF -------------------------------------------------1111------- TSYRAGESGVKTWQGTVGSTTSRNYNLQFRDSVVIYPVWDGFDLGADTRPGDYPITQYPL ------------------------------------------------2222-3333-22 HQLPLNHLIDNLLVRGALGVGFGMDGKGMYVSNITVEDCAGSGAYLLTHESVFTNIAIID 22---------------------------------------------------------- TNTKDFQANQIYISGACRVNGLRLIGIRSLTIDAPNSTVSGITGMVDPSRINVANLAEEG -1111----------------------------1111---------3333---------- LGNIRANSFGYDSAAIKLRIHKLSKTLDSGALYSHINGGAGSGSAYTQLTAISGSTPDAV -------------------3333---------------2222----------%%%%---- SLKVNHKDCRGAEIPFVPDIASDDFIKDSSCFLPYWENNSTSLKALVKKPNGELVRLTLA ----2222-------------1111-----------------------1111-------- TL -- >PUTATIVE SUGAR KINASE; SWP:Q8ZKR2; PDB:1TYYA; NKVWVIGDASVDLVPEKQNSYLKCPGGASANVGVCVARLGGECGFIGCLGDDDAGRFLRQ ------------------------------------------------------------ VFQDNGVDVTFLRLDADLTSAVLIVNSFTYLVHPGADTYVSPQDLPPFRQYEWFYFSSIG --1111--1111--1111----------------3333--3333----2222----3333 LTDRPAREACLEGARRREAGGYVLFDVNLRSKWGNTDEIPELIARSAALASICKVSADEL ----------------1111---------------------------------------- CQLSGASHWQDARYYLRDLGCDTTIISLGADGALLITAEGEFHFPAPRVDVVDTTGAGDA -------3333--3333----------!!!!----------------------2222--- FVGGLLFTLSRANCWDHALLAEAISNANACGAAVTAKGATALPFPDQLNTFLS --------3333-----3333-----------3333-------33333333-- >HYPOTHETICAL PROTEIN; SWP:Q81C15; PDB:1TZ0A; GYFIETKTFTVKEGTSNIVVERFTGEGIIEKFEGFIDLSVLVKKVRRGDEEVVVIRWESE -----------2222-----1111---11112222------------------------- EAWKNWETSEEHLAGHRAGRGKPKPDHIINVDHAVYYVKSSKAAYQ ----3333----2222--------1111------------------ >CHIMERA OF NEUROPEPTIDE Y; SWP:P01304; PDB:1TZ4A; YPSKPDNPGEDAPAEDLAQYAADLRHYINLITRQRY ---------------3333----------3333--- >CHIMERA OF PANCREATIC HOR; SWP:P01298; PDB:1TZ5A; APLEPVYPGDNATPEQMARYYSALRRYINMLTRPRY ------------3333-------------------- >4-ALPHA-GLUCANOTRANSFERAS; SWP:O66937; PDB:1TZ7A; HMRLAGILLHVTSLPSPYGIGDLGKEAYRFLDFLKECGFSLWQVLPLNPTSLEAGNSPYS ---------3333----------3333-----------------------3333--1111 SNSLFAGNYVLIDPEELLEEDLIKERDLKRFPLGEALYEVVYEYKKELLEKAFKNFRRFE -------1111-3333-1111--3333------------------------------333 LLEDFLKEHSYWLRDYALYMAIKEEEGKEWYEWDEELKRREKEALKRVLNKLKGRFYFHV 3----------------------3333-1111---------------------------- FVQFVFFKQWEKLRRYARERGISIVGDLPMYPSYSSADVWTNPELFKLDGDLKPLFVAGV -----------------1111--------------------3333---1111-------- PPDFFSKTGQLWGNPVYNWEEHEKEGFRWWIRRVLHNLKLFDFLRLDHFRGFEAYWEVPY -----------------------------------1111--------3333-------22 GEETAVNGRWVKAPGKTLFKKLLSYFPKNPFIAEDLGFITDEVRYLRETFKIPGSRVIEF 22-----------------------1111------------------1111------111 AFYDKESEHLPHNVEENNVYYTSTHDLPPIRGWFENLGEESRKRLFEYLGREIKEEKVNE 1--1111--1111----------1111------1111----------------1111--- ELIRLVLISRAKFAIIQMQDLLNLGNEARMNYPGRPFGNWRWRIKEDYTQKKEFIKKLLG ----------------3333----3333-------------------------------1 IYGREV 111--- >MANNONATE DEHYDRATASE; SWP:Q82ZC9; PDB:1TZ9A; KWGFRWYGAAGDAIPLKHIRQIPGITGVVGTLLNKLPGDVWTVAEIQALKQSVEQEGLAL --------------11111111-------------2222--3333--------1111--- LGIESVAIHDAIKAGTDQRDHYIDNYRQTLRNLGKCGISLVCYSFKPIFGWAKTDLAYEN ---------3333--------------------1111----------------------1 EDGSLSLLFDQAVVENQPEDYQLIHSWEEERLQQFQELKAYAGVTEEDLVENLRYFLERV 111-------------3333-1111---------------2222-----------3333- IPVCEEENIKGIHPDDPPWEIFGLPRITKNLADLKRILSLVDSPANGITFCTGSLGADPT --------------------iiii-------------3333-3333-----------333 NDLPTIREIGHRINFVHFRNVKYLGEHRFEETAHPSVAGSLDAELQALVDVGYEGVIRPD 3-------3333----------------------3333----------3333-------- HGRAIWDEKAPGYGLYDRAGLTYIQGLYEATKAK ----%%%%-------3333--------------- >APAG PROTEIN; SWP:Q8EB92; PDB:1TZAA; ALDNSIRVEVKTEYIEQQSSPEDEKYLFSYTITIINLGEQAAKLETRHWIITDANGKTSE -1111-----------------------------------------------1111---- VQGAGVVGETPTIPPNTAYQYTSGTVLDTPFGIYGTYGVSESGEHFNAIIKPFRLATPGL -----iiii------------------------------1111----------------- LHLEHHHHHH ---------- >INOSITOL-TRISPHOSPHATE 3-; SWP:P17105; PDB:1TZDA; GLILKRSSEPEHYCVRLADVLRGCVPAFHGVVLQLQDLLDGFDGPCVLDCKGVRTYLEEE ------------------1111--------------1111----------------3333 LTKARERPKLRKDYKKLAVDPEAPTEEEHAPRYQWREGISSSTTLGFRIEGIKKADGSCS -1111-----------------------------------3333---------------- TDFKTTRSREQVTRVFEEFQGDAEVLKRYLNRLQQIRDTLEISDFFRRHEVIGSSLLFVH --1111--------------------------------------3333------------ DHCHRAGVWLIDFGKTTPLPNGQILDHRRPWEEGNREDGYLLGLDNLIGILANLAE 1111---------------2222--------------------------------- >GLUCOSE-1-PHOSPHATE CYTID; SWP:P26396; PDB:1TZFA; MASKAVILAGGLGVKPKPMVEIGGKPILWHIMKMYSVHGIKDFIICCGYKGYVIKEYFAN ---------------3333--iiii3333------1111--------2222--------- YFLHMSDVTFHMAENRMEVHHKRVEPWNVTLVDTGDSSMTGGRLKRVAEYVKDDEAFLFT 3333-------1111-------------------1111--------33331111------ YGDGVADLDIKATIDFHKAHGKKATLTATFPPGRFGALDIQAGQVRSFQEKPKGDGAMIN ----------------------------------------iiii---------------- GGFFVLNPSVIDLIDNDATTWEQEPLMTLAQQGELMAFEHPGFWQPMDTLRDKVYLEGLW --------3333---11111111------------------------------------- EKGKAPWKTWE ----3333--- >FAB 4E10; SWP:NA; PDB:1TZGH; QVQLVQSGAEVKRPGSSVTVSCKASGGSFSTYALSWVRQAPGRGLEWMGGVIPLLTITNY ------------2222-----------------------2222----------------- APRFQGRITITADRSTSTAYLELNSL 3333---------------------- >FAB YADS1 LIGHT CHAIN; SWP:Q6GMX8; PDB:1TZHA; DIQMTQSPSSLSASVGDRVTITCRASQASYSSVAWYQQKPGKAPKLLIYAASYLYSGVPS -------------2222----------1111-------2222------------222233 RFSGSGSGTDFTLTISSLQPEDFATYYCQSSASPATFGQGTKVEIKRTVAAPSVFIFPPS 33----!!!!--------3333-------------------------------------3 DEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLTL 3333333---------------------%%%%---------------------------- SKADYEKHKVYACEVTHQGLSSPVTKSFNR --3333----------1111---------- >FAB YADS1 LIGHT CHAIN; SWP:NA; PDB:1TZHB; EVQLVESGGGLVQPGGSLRLSCAASGFDIYDDDIHWVRQAPGKGLEWVAYIAPSYGYTDY ------------2222-----------3333--------2222--------3333----- ADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCSRSSDASYSYSAMDYWGQGTLVTV 3333--------1111----------1111---------3333----------------- SSASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQ ---------------11113333-------------------%%%%--2222-------3 SSGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEP 333----------3333------------1111--------- >FAB YADS2 LIGHT CHAIN; SWP:Q6GMX8; PDB:1TZIA; DIQMTQSPSSLSASVGDRVTITCRASQSYAYAVAWYQQKPGKAPKLLIYDASYLYSGVPS -------------2222-------------------------------------222211 RFSGSGSGTDFTLTISSLQPEDFATYYCQQAYSSPDTFGQGTKVEIKRTVAAPSVFIFPP 11----!!!!--------1111-------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT 33331111---------------------%%%%--------------------------- LSKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC -3333----------------------------- >Vascular endothelial grow; SWP:P15692; PDB:1TZIB; EVQLVESGGGLVQPGGSLRLSCAASGFAIYDYDIHWVRQAPGKGLEWVADIAPYAGATAY ------------2222-----------3333--------2222--------3333----- ADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCSRSSYAYYAAMDYWGQGTLVTVSS 3333----------------------1111------------iiii-------------- ASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSS -------------1111-----------------------------2222-------333 GLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKS 3-------------------------3333------------ >1-AMINOCYCLOPROPANE-1-CAR; SWP:Q00740; PDB:1TZJA; MNLQRFPRYPLTFGPTPIQPLARLSKHLGGKVHLYAKREDCNSGLAFGGNKTRKLEYLIP -3333--------------------1111--------3333---iiii3333-3333--- EALAQGCDTLVSIGGIQSNQTRQVAAVAAHLGMKCVLVQENWVNYSDAVYDRVGNIQMSR --1111--------1111----------1111--------------1111---------- ILGADVRLVPDRSWEDALESVRAAGGKPYAIPAGCSDHPLGGLGFVGFAEEVRAQEAELG ---------------------1111------2222--1111------------------- FKFDYVVVCSVTGSTQAGMVVGFAADGRADRVIGVDASAKPAQTREQITRIARQTAEKVG ---------------------------1111----------------------------- LERDIMRADVVLDERFAGPEYGLPNEGTLEAIRLCARTEGMLTDPVYEGKSMHGMIEMVR -----1111----------2222-----------------------3333--------11 NGEFPEGSRVLYAHLGGVPALNGYSFIFRDG 11--2222--------333311113333--- >PENICILLIN-INSENSITIVE MU; SWP:P14007; PDB:1TZPA; ATPWQKITQPVPGSAQSIGSFSNGCIVGADTLPIQSEHYQVMRTDQRRYFGHPDLVMFIQ -3333--------------1111-------------------3333-------------- RLSSQVSNLGMGTVLIGDMGMPAGGRFNGGHASHQTGLDVDIFLQLPKTRWTSAQLLRPQ ------1111----------1111------------------------------------ ALDLVSRDGKHVVSTLWKPEIFSLIKLAAQDKDVTRIFVNPAIKQQLCLDAGTDRDWLRK -----1111---1111-3333---------1111--------------------3333-- VRPWFQHRAHMHVRLRCPADSLECEDQPLPPSGDGCGAELQSWFEPLPPSCQALLDEHVI -----------------1111---------------3333-1111--------------- >CATHEPSIN E; SWP:P14091; PDB:1TZSA; KEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCTSPACKTHSRFQPSQSS -1111----------------------------------1111-3333------333311 TYSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGILG 11--------------------------------------------3333---------- LGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPGAGSELIFGGYDHSHFSGSLNWV --3333-%%%%-------1111--------------------------1111-------- PVTKQAYWQIALDNIQVGGTVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAAPVDG ----------------%%%%---1111-----1111------------------------ EYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDQFCSSGFQGLDIHPPAGPLWILGDV ----33331111------iiii----3333------------------------------ FIRQFYSVFDRGNNRVGLAPAV -3333-----1111-------- >N UTILIZATION SUBSTANCE P; SWP:Q9X286; PDB:1TZVA; MKTPRRRMRLAVFKALFQHEFRRDEDLEQILEEILDETYDKKAKEDARRYIRGIKENLSM ---------------------1111----------1111--------------------- IDDLISRYLEKWSLNRLSVVDRNVLRLATYELLFEKDIPIEVTIDEAIEIAKRYGTENSG -----1111---3333-3333--------------------------------------- KFVNGILDRIAKEHAPKEKFE ---------------3333-- >HISTONE H2A-IV; SWP:P02263; PDB:1TZYA; KAKSRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYLAAVLEYLTAEILELAGNAARD ------1111------------1111-----3333-----------------------11 NKKTRIIPRHLQLAIRNDEELNKLLGKVTIAQGGVLPNIQAVLLPK 11----3333--------------1111-2222------3333--- >HISTONE H2A-IV; SWP:P02279; PDB:1TZYB; RKESYSIYVYKVLKQVHPDTGISSKAMGIMNSFVNDIFERIAGEASRLAHYNKRSTITSR ----------------3333-----------------------------1111------- EIQTAVRLLLPGELAKHAVSEGTKAVTKYTSS -------------------------------- >HYPOTHETICAL PROTEIN L184; SWP:Q89FH0; PDB:1TZZA; VRIVDVREITKPISSTKMTTSLVAVVTDVVREGKRVVGYGFNSNGRYGQGGLIRERFASR ------------------------------------------------------------ ILEADPKKLLNEAGDNLDPDKVWAAMMINEKPGGHGERSVAVGTIDMAVWDAVAKIAGKP 11113333--1111---------------------3333--------------------- LFRLLAERHGVKANPRVFVYAAGGYYGLSMLRGEMRGYLDRGYNVVKMKIGGAPIEEDRM ------1111-------------------------------------------------- RIEAVLEEIGKDAQLAVDANGRFNLETGIAYAKMLRDYPLFWYEEVGDPLDYALQAALAE --------!!!!------%%%%-------------------------1111------333 FYPGPMATGENLFSHQDARNLLRYGGMRPDRDWLQFDCALSYGLCEYQRTLEVLKTHGWS 3-------1111---------------1111-----3333--------------1111-1 PSRCIPHGGHQMSLNIAAGLGLGGNESYPDLFQPYGGFPDGVRVENGHITMPDLPGIGFE 111------------------------2222-------------%%%%---------333 GKSDLYKEMKALAE 3-3333-------- >CHAPERONE PROTEIN HSCA; SWP:P36541; PDB:1U00A; MDVIPLSLGLETMGGLVEKVIPRNTTIPVARAQDFTTFKDGQTAMSIHVMQGERELVQDC ---------------------2222-------------2222-------------3333- RSLARFALRGIPALPAGGAHIRVTFQVDADGLLSVTAMEKSTGVEASIQVKPSYGLTDSE --------------2222---------1111--------1111----------------- IASMIKDSMSYAEQDVKARMLAEQKVEAARVLESLHGALAADAALLSAAERQVIDDAAAH -----------------------------------------3333-3333---------- LSEVAQGDDVDAIEQAIKNVDKQTQDFAARRMDQSVRRALKGHSVDE ---1111---------------------------------------- >trehalose-6-phosphate pho; SWP:NA; PDB:1U02A; SLIFLDYDGTLVPIINPEESYADAGLLSLISDLKERFDTYIVTGRSPEEISRFLPLDINI ------2222-----3333----------------------------------------- CYHGACSKINGQIVYNNGSDRFLGVFDRIYEDTRSWVSDFPGLRIYRKNLAVLYHLGLGA --------iiii---%%%%----------------------------------------- DKPKLRSRIEEIARIFGVETYYGKIIELRVPGVNKGSAIRSVRGERPAIIAGDDATDEAA -----------------------------2222---------!!!!-------------- FEANDDALTIKVGEGETHAKFHVADYIERKILKFIELGVQKK ---1111-----------------3333-------------- >HYPOTHETICAL PROTEIN PF05; SWP:Q8U3D2; PDB:1U04A; SKAIVVINLVKINKKIIPDKIYVYRLYSIYRLAYENVGIVIDPENLIIATTKELEYEGEF ------------1111-----------3333----------------------------- IPEGEISFSELRNDYQSKLVLRLLKENGIGEYELSKLLRKFRKPKTFGDYKVIPSVESVI ------3333--------------1111---------3333-----!!!!---------- KHDEDFYLVIHIIHQIQSKTLWELVNKDPKELEEFLTHKENLLKDIASPLKTVYKPCFEE ------------------------%%%%----------------1111----------22 YTKKPKLDHNQEIVKYWYNYHIERYWNTPEAKLEFYRKFGQVDLKQPAILAKFASKNYKI 22----------------------------------------1111-------------- YLLPQLVVPTYNAEQLAKEILEYTKLPEERKELLENILAEVDSDIIDKSLSEIEVEKIAQ --3333-----3333------1111---------------------------------33 ELENKIRVRDDKGNSVPISQLLWTNYSRKYPVILPYEVPEKFRKIREIPFIILDSGLLAD 33-------------------1111-------------3333-----------3333--- IQNFATNEFRELVKSYYEKVITEDLNSDKGIIEVVEQVSSFKGKELGLAFIAARNKLSSE -----------------------1111--------------------------3333--- KFEEIKRRLFNLNVISQVVNEDTLKNKRDKYDRNRLDLFVRHNLLFQVLSKLGVKYYVLD --------3333----------------1111---------------------------- YRFNYDYIIGIDVAPKRSEGYIGGSAVFDSQGYIRKIVPIKIGEQRGESVDNEFFKEVDK ----------------------------1111---------------------3333--- FKEFNIKLDNKKILLLRDGRITNNEEEGLKYISEFDIEVVTDVIKNHPVRAFANKYFNLG -1111--2222-----------------------------------------------ii GAIYLIPHKLKQAKGTPIPIKLAKKRIIKNGKVEKQSITRQDVLDIFILTRLNYGSISAD ii--------------------------iiii---------------3333-iiii---- RLPAPVHYAHKFANAIRNEWKIKEEFLAEGFLYFV ---------------1111---3333-----1111 >SPECTRIN ALPHA CHAIN, BRA; SWP:P07751; PDB:1U06A; ELVLALYDYQEKSPREVTMKKGDILTLLNSTNKDWWKVEVNDRQGFVPAAYVKKL ------------1111---2222----------------!!!!----3333---- >TONB PROTEIN; SWP:P94739; PDB:1U07A; ASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNA ---------------------------------1111----------------------- MRRWRYEPGKPGSGIVVNILFKINGTTEIQ 1111-------------------------- >HYPOTHETICAL AMINOTRANSFE; SWP:P77806; PDB:1U08A; PLIPQSKLPQLGTTIFTQMSALAQQHQAINLSQGFPDFDGPRYLQERLAHHVAQGANQYA ------------------------------------------------------------ PMTGVQALREAIAQKTERLYGYQPDADSDITVTAGATEALYAAITALVRNGDEVICFDPS 1111--------------------3333------3333----------2222-------- YDSYAPAIALSGGIVKRMALQPPHFRVDWQEFAALLSERTRLVILNTPHNPSATVWQQAD 1111----------------------------33331111----------------3333 FAALWQAIAGHEIFVISDEVYEHINFSQQGHASVLAHPQLRERAVAVSSFGKTYHMTGWK -------1111-------1111----3333--3333--3333------3333---1111- VGYCVAPAPISAEIRKVHQYLTFSVNTPAQLALADMLRAEPEHYLALPDFYRQKRDILVN -----------------------------------------1111--------------- ALNESRLEILPCEGTYFLLVDYSAVSTLDDVEFCQWLTQEHGVAAIPLSVFCADPFPHKL ---------------------1111---------------------3333---------- IRLCFAKKESTLLAAAERLRQL -----------------3333- >POLYPROTEIN; SWP:Q9QCE2; PDB:1U09A; GLIVDTRDVEERVHVMRKTKLAPTVAHGVFNPEFGPAALSNKDPRLNEGVVLDEVIFSKH -----------------------3333------------1111---2222------3333 KGDTKMSAEDKALFRRCAADYASRLHSVLGTANAPLSIYEAIKGVDGLDAMEPDTAPGLP --------------------------------------------2222-----------3 WALQGKRRGALIDFENGTVGPEVEAALKLMEKREYKFACQTFLKDEIRPMEKVRAGKTRI 333---3333------------------------------------------1111---- VDVLPVEHILYTRMMIGRFCAQMHSNNGPQIGSAVGCNPDVDWQRFGTHFAQYRNVWDVD ----3333-------------------3333--22223333--------1111------- YSAFDANHCSDAMNIMFEEVFRTEFGFHPNAEWILKTLVNTEHAYENKRITVEGGMPSGC --3333---------------3333-------------------!!!!-----------2 SATSIINTILNNIYVLYALRRHYEGVELDTYTMISYGDDIVVASDYDLDFEALKPHFKSL 222-----------------------1111-----!!!!---------3333-----111 GQTITPADKSDKGFVLGHSITDVTFLKRHFHMDYGTGFYKPVMASKTLEAILSFARRGTI 1-----------------1111--%%%%---------------------------2222- QEKLISVAGLAVHSGPDEYRRLFEPFQGLFEIPSYRSLYLRWVNAVCGDAAALEHH ---------3333---------3333--------------------------1111 >DNA REPLICATION PROTEIN; SWP:P03132; PDB:1U0JA; GMELVGWLVDKGITSEKQWIQEDQASYISFNAASNSRSQIKAALDNAGKIMSLTKTAPDY -------------------------------------------------------3333- LVGQQPVEDISSNRIYKILELNGYDPQYAASVFLGWATKKFGKRNTIWLFGPATTGKTNI --------3333-------1111------------1111-!!!!---------------- AEAIAHTVPFYGCVNWTNENFPFNDCVDKMVIWWEEGKMTAKVVESAKAILGGSKVRVSA ----3333------3333--1111---------------3333----------------- QIDPTPVIVTSNTNMCAVIDGNSTTFEHQQPLQDRMFKFELTRRLDHDFGKVTKQEVKDF -------------------!!!!--1111---1111---------1111----------- FRWAKDHVVEVEHEFYVKKGG --------------------- >GENE PRODUCT PA4716; SWP:Q9HV82; PDB:1U0KA; SRRYWQLDVFAERPLTGNGLAVFDDASALDDAAQAWTRELRQFESIFLLPGDDPRAFRAR -------------------------11113333-------------------1111---- IFTLEEELPFAGHPLLGAAALLHHLRGGDNEQHWTLHLASKSVALRSVRAGSGFYAEDQG -----------3333----------------------3333--------!!!!------- RAEFGATPDAGTCRWFAEAFSLSANDLSGHPPRVVSTGLPYLLLPVTAEALGRARQVNDL -----------------1111-3333---------------------3333--------- QEALDKLGAAFVYLLDVDGREGRTWDNLGLVEDVATGSAAGPVAAYLVEYGLAARGEPFV ----1111--------1111-----1111------3333--------1111--------- LHQGRFLERPSRLDVQVATDGSVRVGGHVQLLARAELLTSA ---1111----------3333-------------------- >PROBABLE GTPASE ENGC; SWP:Q9X242; PDB:1U0LA; LRRRGIVVSFHSNMVTVEDEETGERILCKLRGKFRLQNLKIYVGDRVEYTPDETGSGVIE ----------%%%%------------------3333-----2222--------------- NVLHRKNLLTKPHVANVDQVILVVTVKMPETSTYIIDKFLVLAEKNELETVMVINKMDLY -------------------------------------------1111--------3333- DEDDLRKVRELEEIYSGLYPIVKTSAKTGMGIEELKEYLKGKISTMAGLSGVGKSSLLNA --------------1111----------2222--3333----------2222-------- INPGLKLRTTTTAQLLKFDFGGYVVDTPGFANLEINDIEPEELKHYFKEFGDKQCFFSDC -----------------1111--------1111-----33331111-------------- NHVDEPECGVKEAVENGEIAESRYENYVKMFYELLGRR -------------------------------3333--- >PUTATIVE POLYKETIDE SYNTH; SWP:Q9FCA7; PDB:1U0MA; ATLCRPSVSVPEHVITMEETLELARRRHTDHPQLPLALRLIENTGVRTRHIVQPIEDTLE ---------------------------1111----------------------3333--- HPGFEDRNKVYEREAKSRVPAVIQRALDDAELLATDIDVIIYVSCTGFMMPSLTAWLINE ---------------------------1111-1111------------------------ MGFDSTTRQIPIAQLGCAAGGAAINRAHDFCTAYPEANALIVACEFCSLCYQPTDLGVGS --------------!!!!---------------1111--------------3333----- LLCNGLFGDGIAAAVVRGRGGTGVRLERNGSYLIPKTEDWIMYDVKATGFHFLLDKRVPA ------------------------------------1111-----1111-----1111-- TMEPLAPALKELAGEHGWDASDLDFYIVHAGGPRILDDLSTFLEVDPHAFRFSRATLTEY --3333-------------------------------1111-----3333---------- GNIASAVVLDALRRLFDEGGVEEGARGLLAGFGPGITAEMSLGCWQTA ---1111--------3333----------------------------- >von Willebrand factor [Pr; SWP:Q8CIZ8; PDB:1U0OC; FYCSKLLDLVFLLDGSSMLSEAEFEVLKAFVVGMMERLHISQKRIRVAVVEYHDGSRAYL -------------------3333---------3333----1111---------------- ELKARKRPSELRRITSQIKYTGSQVASTSEVLKYTLFQIFGKIDRPEASHITLLLTASQE --------------1111---------3333------1111---1111------------ PPRMARNLVRYVQGLKKKKVIVIPVGIGPHASLKQIRLIEKQAPENKAFLLSGVDELEQR 3333-----------1111-----------------------3333------33331111 RDEIVSYLCDLAPEAP ------1111------ >IMMUNOGLOBULIN HEAVY CHAI; SWP:NA; PDB:1U0QA; VQLQESGGGLVQAGGSLRLSCAASGRTFSTYAVGWFRQAPGKEREFVGYFGTRGGRTYYA -----------2222--------!!!!-----------2222-----------------3 DSVKGRFTIAIDNAKNTVYLQMNSL 333--------1111---------- >INORGANIC POLYPHOSPHATE/A; SWP:O33196; PDB:1U0RA; HRSVLLVVHTATETARRVEKVLGDNKIALRVLSAEAVEIEVVDADQHAADGCELVLVLGG --------------------------------3333--------1111------------ DGTFLRAAELARNASIPVLGVNLGRIGFLAEAEAEAIDAVLEHVVAQDYRVEDRLTLDVV --------------------------------3333------------------------ VRQGGRIVNRGWALNEVSLEKGPRLGVLGVVVEIDGRPVSAFGCDGVLVSTPTGSTAYAF --iiii---------------------------iiii-------------3333------ SAGGPVLWPDLEAILVVPNNAHALFGRPMVTSPEATIAIEIEADGHDALVFCDGRREMLI -------3333------------------------------1111------%%%%----- PAGSRLEVTRCVTSVKWARLDSAPFTDRLVRKFRLPVTGWR 2222-------------------3333-------------- >CHEMOTAXIS PROTEIN CHEY; SWP:Q56310; PDB:1U0SA; GFKTFYIKVILKEGTQLKSARIYLVFHKLEELKCEVVRTIPSVEEIEEEKFENEVELFVI -----------1111--------------1111------------1111----------- SPVDLEKLSEALSSIADIERVIIKEV ----------1111------------ >INORGANIC POLYPHOSPHATE/A; SWP:O33196; PDB:1U0TA; RSVLLVVHTGRDEATETARRVEKVLGDNKIALRVLSCELVLVLGGDGTFLRAAELARNAS --------3333-------------1111---------------3333------------ IPVLGVNLGRIGFLAEAEAEAIDAVLEHVVAQDYRVEDRLTLDVVVRQGGRIVNRGWALN -----------------3333--------------------------iiii--------- EVSLEKGPRLGVLGVVVEIDGRPVSAFGCDGVLVSTPTGSTAYAFSAGGPVLWPDLEAIL -----------------------------------3333-----1111----1111---- VVPNNAHALFGRPMVTSPEATIAIEIEADGHDALVFCDGRREMLIPAGSRLEVTRCVTSV ----------------1111----------------%%%%-----2222----------- KWARLDSAPFTDRLVRKFRLPVTGWRG --------3333---------2222-- >CHALCONE SYNTHASE 2; SWP:P30074; PDB:1U0VA; VSVSEIRKAQRAEGPATILAIGTANPANCVEQSTYPDFYFKITNSEHKTELKEKFQRMCD ------------------------------3333----------1111----------11 KSMIKRRYMYLTEEILKENPNVCEYMAPSLDARQAMLAMEVPRLGKEAAVKAIKEWGQPK 11----------------3333------------------------------------33 SKITHLIVCSTTTPDLPGADYQLTKLLGLRPYVKRVGVFQHGCFAGGTVLRLAKDLAENN 33---------------------------1111--------1111-----------1111 KGARVLVVCSEVTAVTFRGPSDTHLDSLVGQALFGDGAAALIVGSDPVPEIEKPIFEMVW -----------3333-----1111---3333----------------2222--------- TAQTIAPDSEGAIDGHLREAGLTFHLKGAVPDIVSKNITKALVEAFEPLGISDYNSIFWI --------2222-----1111-------------------------1111--1111---- AHPGGPAILDQVEQKLALKPEKMNATREVLSEYGNMSSACVLFILDEMRKKSTQNGLKTT ----3333----------3333-------------!!!!-------------1111---- GEGLEWGVLFGFGPGLTIETVVLRSVAI iiii------------------------ >PurE (N5-carboxyaminoimid; SWP:Q2QJL3; PDB:1U11A; SAPVVGIIMGSQSDWETMRHADALLTELEIPHETLIVSAHRTPDRLADYARTAAERGLNV ----------3333-----------1111------------------------1111--- IIAGAGGAAHLPGMCAAWTRLPVLGVPVESRALKGMDSLLSIVQMPGGVPVGTLAIGASG --------------3333--------------iiii------------------------ AKNAALLAASILALYNPALAARLETWRALQTASVPNSPI ------------------------------1111----- >HYPOTHETICAL UPF0244 PROT; SWP:P39432; PDB:1U14A; AHQVISATTNPAKIQAILQAFEEIFGEGSCHITPVAVESGVPEQPFGSEETRAGARNRVD -------------------------2222------------------------------- NARRLHPQADFWVAIEAGIDDDATFSWVVIDNGVQRGEARSATLPLPAVILDRVRQGEAL -----1111----------%%%%-----------------------3333---1111--- GPVSQYTGIDEIGRKEGAIGVFTAGKLTRSSVYYQAVILALSPFHNA ----------3333------1111----------------3333--- >NITROPHORIN 1; SWP:Q26239; PDB:1U17A; MKCTKNALAQTGFNKDKYFNGDVWYVTDYLDLEPDDVPKRYCAALAAGTASGKLKEALYC -------------3333--------------------------------iiii------- YDPKTQDTFYDVSELQEESPGKYTANFKKVEKNGNVKVDVTSGNYYTFTVMYADDSSALI ------------------2222--------1111-------------------1111--- HTCLHKGNKDLGDLYAVLNRNKDTNAGDKVKGAVTAASLKFSDFISTKDNKCEYDNVSLK -----%%%%-----------1111----------1111-3333----------------- SLLTK 3333- >RHODOPSIN; SWP:P02699; PDB:1U19A; MNGTEGPNFYVPFSNKTGVVRSPFEAPQYYLAEPWQFSMLAAYMFLLIMLGFPINFLTLY --------------1111---------1111-3333------------------------ VTVQHKKLRTPLNYILLNLAVADLFMVFGGFTTTLYTSLHGYFVFGPTGCNLEGFFATLG ----1111----------------------------------1111-------------- GEIALWSLVVLAIERYVVVCKPMSNFRFGENHAIMGVAFTWVMALACAAPPLVGWSRYIP ------------------------------------------------3333-------- EGMQCSCGIDYYTPHEETNNESFVIYMFVVHFIIPLIVIFFCYGQLVFTVKEAAAQQQES !!!!----------3333------------------------------------------ ATTQKAEKEVTRMVIIMVIAFLICWLPYAGVAFYIFTHQGSDFGPIFMTIPAFFAKTSAV -------------------------------------2222--3333------------- YNPVIYIMMNKQFRNCMVTTLCCGKNPLGDDEASTTVSKTETSQVAPA ------------------------------------------------ >MYO-INOSITOL-1-PHOSPHATE ; SWP:O28480; PDB:1U1IA; MKVWLVGAYGIVSTTAMVGARAIERGIAPKIGLVSELPHFEGIEKYAPFSFEFGGHEIRL ------1111-------------------2222111111113333--------------- LSNAYEAAKEHWELNRHFDREILEAVKSDLEGIVARKGTALNCGSGIKELGDIKTLEGEG --3333-------------------3333--------------------------3333- LSLAEMVSRIEEDIKSFADDETVVINVASTEPLPNYSEEYHGSLEGFERMIDEDRKEYAS --------1111------------------------3333----------11113333-- ASMLYAYAALKLGLPYANFTPSPGSAIPALKELAEKKGVPHAGNDGKTGETLVKTTLAPM ----------------------!!!!3333---3333----------------------- FAYRNMEVVGWMSYNILGDYDGKVLSARDNKESKVLSKDKVLEKMLGYSPYSITEIQYFP -1111-------------3333------3333--------3333---------------- SLVDNKTAFDFVHFKGFLGKLMKFYFIWDAIDAIVAAPLILDIARFLLFAKKKGVKGVVK -!!!!----------2222------------11113333--------------------- EMAFFFKSPMDTNVINTHEQFVVLKEWYSNLK -3333--------------------------- >5-methyltetrahydropteroyl; SWP:O50008; PDB:1U1JA; ASHIVGYPRGPKRELKFALESFWDGKSTAEDLQKVSADLRSSIWKQSAAGTKFIPSNTFA ---------------------------3333----------------------------- HYDQVLDTTALGAVPPRYGYTGGEIGLDVYFSARGNASVPAETKWFDTNYHYIVPELGPE --------------3333---------------------------------------111 VNFSYASHKAVNEYKEAKALGVDTVPVLVGPVSYLLLSKAAKGVDKSFELLSLLPKILPI 1---------------------------------1111------333333333333---- YKEVITELKAAGATWIQLDEPVLVDLEGQKLQAFTGAYAELESTLSGLNVLVETYFADIP --------1111----------------------------3333---------------- AEAYKTLTSLKGVTAFGFDLVRGTKTLDLVKAGFPEGKYLFAGVVDGRNIWANDFAASLS ------------------------------------------------------------ TLQALEGIVGKDKLVVSTSCSLLHTAVDLINETKLDDEIKSWAFAAQKVVEVNALAKALA ---------------------1111--3333----3333------------------111 GQKDEALFSANAAALASRRSSPRVTNEGVQKAAAALKGSDHRRATNVSARLDAQQKKLNL 1--3333--------------------------------------3333----------- PILPTTTIGSFPQTVELREDYVKAIKEEIKKVVDLQEELDIDVLVHGEPERNDVEYFGEQ -----------------------------------------------1111--------- LSGFAFTANGWVQSYGSRCVKPPVIYGDVSRPKATVFWSAAQSTSRPKGLTGPVTILNWS 1111----------!!!!------------------3333-------------------- FVRNDQPRHETCYQIALAIKDEVEDLEKGGIGVIQIDEAALREGLPLRKSEHAFYLDWAV --------------------------1111---------1111----1111--------- HSFRITNCGVQDSTQIHTHCYSHFNDIIHSIIDDADVITIENSRSDEKLLSVFREGVKYG ------1111-------------11113333---------------3333---2222--- AGIGPGVYDIHSPRIPSSEEIADRVNKLAVLEQNILWVNPDCGLKTRKYTEVKPALKNVD ----------------3333-----------------------11111111--------- AAKLIRSQ ----3333 >HFQ PROTEIN; SWP:Q9HUM0; PDB:1U1TA; SLQDPYLNTLRKERVPVSIYLVNGIKLQGQIESFDQFVILLKNTVSQMVYKHAISTVVPS ----------1111------1111-------------------------3333------- RPVRLP ------ >(3R)-hydroxymyristoyl-[ac; SWP:Q9HXY7; PDB:1U1ZA; DINEIREYLPHRYPFLLVDRVVELDIEGKRIRAYKNVSINEPFFNGHFPEHPIPGVLIIE 3333---------------------1111-----------1111---2222--------- AAQAAGILGFKLDVKPTLYYFVGSDKLRFRQPVLPGDQLQLHAKFISVKRSIWKFDCHAT ---------------------------------2222-----------iiii-------- VDDKPVCSAEIICAERK iiii------------- >U8 SNORNA-BINDING PROTEIN; SWP:Q6TEC1; PDB:1U20A; DKPRPRNISREESLQLEGYKHACHALLHAPSQAKLFDRVPIRRVLLMMMRFDGRLGFPGG ---------------------------------2222------------1111------- FVDTRDISLEEGLKRELEEELGPALATVEVTEDDYRSSQVREHPQKCVTHFYIKELKLEE --3333---------------3333-----3333-------------------------- IERIEAEAVNAKDHGLEVMGLIRVPLYTLRDRVGGLPAFLCNNFIGNSKSQLLYALRSLK ------33331111--------------1111--33331111--!!!!------------ LLREDQIQEVLKASHR ---------------- >CYTOHESIN 2; SWP:P97695; PDB:1U29A; PDREGWLLKLGGGRVKTWKRRWFILTDNCLYYFEYTTDKEPRGIIPLENLSIREVDDPRK -------------------------%%%%-----1111--------2222---------- PNCFELYIPNNKGQLIKACKTEADGRVVEGNHMVYRISAPTQEEKDEWIKSIQAAVSVD -------2222----------1111---------------------------------- >DYSTROGLYCAN; SWP:Q62165; PDB:1U2CA; AVPTVVGIPDGTAVVGRSFRVSIPTDLIASSGEIIKVSAAGKEALPSWLHWDPHSHILEG -------------2222------3333-----------2222---1111----------- LPLDTDKGVHYISVSAARLGANGSHVPQTSSVFSIEVYPEDHNACAADEPVTVLTVILDA --1111-------------1111------------------------------------- DLTKMTPKQRIDLLNRMQSFSEVELHNMKLVPVVNNRLFDMSAFMAGPGNAKKVVENGAL 3333-------------------3333------%%%%--1111----------------- LSWKLGCSLNQNSVPDIRGVETPAREGAMSAQLGYPVVGWHIANKKPT ---------3333---3333---------------------------- >2-hydroxy-6-ketonona-2,4-; SWP:P77044; PDB:1U2EA; QPQTEAATSRFLNVEEAGKTLRIHFNDCGQGDETVVLLHGSGPGATGWANFSRNIDPLVE ---3333--------iiii----------------------22223333-1111------ AGYRVILLDCPGWGKSDSVVNSGSRSDLNARILKSVVDQLDIAKIHLLGNSMGGHSSVAF ---------2222----------3333----------1111------------------- TLKWPERVGKLVLMGGGTGGMSLFTPMPTEGIKRLNQLYRQPTIENLKLMMDIFVFDTSD ---3333------------------------------------------3333---3333 LTDALFEARLNNMLSRRDHLENFVKSLEANPKQFPDFGPRLAEIKAQTLIVWGRNDRFVP -------------------------------------1111-----------1111---- MDAGLRLLSGIAGSELHIFRDCGHWAQWEHADAFNQLVLNFLARP ----------2222----------3333------------1111- >AORTIC PREFERENTIALLY EXP; SWP:Q15772; PDB:1U2HA; KAPPTFKVSLMDQSVREGQDVIMSIRVQGEPKPVVSWLRNRQPVRPDQRRFAEEAEGGLC ---------------2222-------------------%%%%----1111-----iiii- RLRILAAERGDAGFYTCKAVNEYGARQCEARLEVRG -------3333---------1111------------ >PEROXIDASE/CATALASE HPI; SWP:P13029; PDB:1U2KA; QDPLPQPIYNPTEQDIIDLKFAIADSGLSVSELVSVAWASASTFRGGDKRGGANGARLAL ----------------------------------------11112222----22221111 MPQRDWDVNAAAVRALPVLEKIQKESGKASLADIIVLAGVVGVEKAASAAGLSIHVPFAP -33333333-1111---------------------------------1111--------- GRVDARQDQTDIEMFELLEPIADGFRNYRARLDVSTTESLLIDKAQQLTLTAPEMTALVG ----------33333333----3333-------------------1111----------- GMRVLGANFDGSKNGVFTDRVGVLSNDFFVNLLDMRYEWKATDESKELFEGRDRETGEVK --1111------2222---2222----------3333-----3333-------------- FTASRADLVFGSNSVLRAVAEVYASSDAHEKFVKDFVAAWVKVMNLDRFDLL ---33333333---------1111------------------1111------ >HISTONE-LIKE PROTEIN HLP-; SWP:P11457; PDB:1U2MA; GADKIAIVNGSLFQQVAQKTGVSNTLERARRSNEERGKLVTRIQTAVKSVANSQDIDLVV ---------------------3333----------------------------------- DANAVAYNSSDVKDITADVLKQVK 3333----1111------------ >low molecular weight prot; SWP:Q10507; PDB:1U2PA; PLHVTFVCTGNICRSPMAEKMFAQQLRHRGLGDAVRVTSAGTGNWHVGSCADERAAGVLR --------------------------11111111-----------2222----------1 AHGYPTDHRAAQVGTEHLAADLLVALDRNHARLLRQLGVEAARVRMLRSFDPRSGTHALD 111----------3333-----------------1111-3333--3333-1111------ VEDPYYGDHSDFEEVFAVIESALPGLHDWVDERLAR ---11113333---------------------3333 >CADMIUM EFFLUX SYSTEM ACC; SWP:P20047; PDB:1U2WA; GYDEEKVNRIQGDLQTVDISGVSQILKAIADENRAKITYALCQDEELCVCDIANILGVTI -------------1111------------------------------------------- ANASHHLRTLYKQGVVNFRLALYSLGDEHIRQIMMIALAHKKEVK ----------1111-------------------------1111-- >ADP-SPECIFIC PHOSPHOFRUCT; SWP:O59355; PDB:1U2XA; IPEHLSIYTAYNANIDAIVKLNQETIQNLINAFDPDEVKRRIEEYPREINEPIDFVARLV -1111------------------------33333333-------------3333------ HTLKLGKPAAVPLVNEKNEWFDKTFRYEEERLGGQAGIIANTLAGLKIRKVIAYTPFLPK -------------------------------------------3333-----------33 RLAELFKKGVLYPVVENGELQFKPIQEAYREGDPLKINRIFEFRKGLKFKLGDETIEIPN 3311112222-----iiii----3333--2222--------------------------- SGRFIVSARFESISRIETREDIKPFLGEIGKEVDGAIFSGYQGLRTKYSDGKDANYYLRR ---------3333-----33331111---1111------3333----1111--------- AKEDIIEFKEKDVKIHVEFASVQDRKLRKKIITNILPFVDSVGIDEAEIAQILSVLGYRE ----------------------------------3333---------------1111--- LADRIFTYNRLEDSILGGIILDELNFEILQVHTTYYLYITHRDNPLSEEELAKSLEFGTT --------------------------------1111----1111---------------- LAAARASLGDIRGPDDYKVGLKVPFNERSEYVKLRFEEAKSRLRREYKVVVIPTRLVQNP ------------3333-3333----1111------------------------------- VLTVGLGDTISAGAFLTYLEFLKRH --2222------------------- >Histone-lysine N-methyltr; SWP:Q04089; PDB:1U2ZA; SSTFVDWNGPCLRLQYPLFDIEYLRSHEIYSGTPIQSISLRTTAKLQSILFSNYMEEYKV -----1111--------------------------3333-----------1111------ DFKRSTAIYNPMSEIGKLIEYSCLVFLPSPYAEQLKETILPDLNASFDNSDTKGFVNAIN ------------------------------------------------------------ LYNKMIREIPRQRIIDHLETIDKIPRSFIHDFLHIVYTRSIHPQANKLKHYKAFSNYVYG -----11113333----1111---3333------------3333-1111----3333--- ELLPNFLSDVYQQCQLKKGDTFMDLGSGVGNCVVQAALECGCALSFGCEIMDDASDLTIL -----------1111-2222------!!!!------------------------------ QYEELKKRCKLYGMRLNNVEFSLKKSFVDNNRVAELIPQCDVILVNNFLFDEDLNKKVEK -----------------------------------3333-------1111---------1 ILQTAKVGCKIISLKSLRSLTYQINFYNVENIFNRLKVQRYDLKEDSVSWTHSGGEYYIS 111--2222---------1111-----11113333------------1111--------- TVMEDVDESLFSPARVKYT ------3333--------- >Core histone macro-H2A.1; SWP:O75367; PDB:1U35C; KTSRSAKAGVIFPVGRMLRYIKKGHPKYRIGVGAPVYMAAVLEYLTAEILELAVNAARDN ------------------------------3333-------------------------- KKGRVTPRHILLAVANDEELNQLLKGVTIASGGVLPNIHPELLAKK ----------------------------2222------3333---- >NUCLEAR FACTOR NF-KAPPA-B; SWP:P25799; PDB:1U36A; NLKIVRMDRTAGCVTGGEEIWLLCDKVQKDDIQIRFYEEEVWEGFGDFSPTDVHRQFAIC ------------3333-----------1111-----------------3333-iiii--- FKTPKYKDVNITKPASVFVQLRRKSDLETSEPKPFLYYPE ---------------------------------------- >amyloid beta A4 precursor; SWP:Q02410; PDB:1U39A; PPVTTVLIRRPDLRYQLGFSVQNGIICSLMRGGIAERGGVRVGHRIIEINGQSVVATPHE --------------------------------3333------------iiii-------- KIVHILSNAVGEIHMKTMPA -------------------- >CRYPTOCHROME 1 APOPROTEIN; SWP:Q43125; PDB:1U3DA; CSIVWFRRDLRVEDNPALAAAVRAGPVIALFVWAPEEEGHYHPGRVSRWWLKNSLAQLDS ---------------------1111--------3333!!!!------------------- SLRSLGTCLITKRSTDSVASLLDVVKSTGASQIFFNHLYDPLSLVRDHRAKDVLTAQGIA --1111---------------------------------3333----------------- VRSFNADLLYEPWEVTDELGRPFSMFAAFWERCLSMPYDPESPLLPPKKIISGDVSKCVA ----------3333--3333---------------------------------1111--- DPLVFEDDSEKGSNALLARAWSPGWSNGDKALTTFINGPLLEYSKNRRKADSATTSFLSP -----------11113333------------------3333-1111-------------- HLHFGEVSVRKVFHLVRIKQVAWANEGNEAGEESVNLFLKSIGLREYSRYISFNHPYSHE --------------------------------------------------------1111 RPLLGHLKFFPWAVDENYFKAWRQGRTGYPLVDAGMRELWATGWLHDRIRVVVSSFFVKV ----1111---------------------------------------------------- LQLPWRWGMKYFWDTLLDADLESDALGWQYITGTLPDSREFDRIDNPQFEGYKFDPNGEY ---------------1111-----------------------------------1111-- VRRWLPELSRLPTDWIHHPWNAPESVLQAAGIELGSNYPLPIVGLDEAKARLHEALSQMW ----3333---3333--3333-3333-1111-2222------------------------ QLEAA 3333- >DNA endonuclease I-HmuI; SWP:P34081; PDB:1U3EM; MEWKDIKGYEGHYQVSNTGEVYSIKSGKTLKHQIPKDGYHRIGLFKGGKGKTFQVHRLVA -----2222------1111----1111-------1111---------------3333--- IHFCEGYEEGLVVDHKDGNKDNNLSTNLRWVTQKINVENQMSRGTLNVSKAQQIAKIKNQ -------2222---11111111-3333-----------------------------1111 KPIIVISPDGIEKEYPSTKCACEELGLTRGKVTDVLKGHRIHHKGYTFRYKLNG ------1111--------------------------------iiii-------- >T-CELL RECEPTOR ALPHA-CHA; SWP:Q5R1B3; PDB:1U3HA; QVRQSPQSLTVWEGETAILNCSYENSAFDYFPWYQQFPGEGPALLISILSVSDKKEDGRF ------------------------1111--------2222--------3333-------- TIFFNKREKKLSLHIADSQPGDSATYFCAASANSGTYQRFGTGTKLQVVP -------------------------------------------------- >GLUTATHIONE S-TRANSFERASE; SWP:P09792; PDB:1U3IA; EHIKVIYFDGRGRAESIRMTLVAAGVDYEDERISFQDWPKIKPTIPGGRLPAVKVTDDHG ----------!!!!-------------------333333331111--------------- HVKWMLESLAIARYMAKKHHMMGETDEEYYSVEKLIGQAEDVEHEYHKTLMKPQEEKEKI ------1111------1111--------------------------1111---------- TKEILNGKVPVLFNMICESLKGSTGKLAVGDKVTLADLVLIAVIDHVTDLDKGFLTGKYP -------------------1111----------3333-------------1111222233 EIHKHRENLLASSPRLAKYLSNRPATPF 33-------------------------- >PRION-LIKE PROTEIN; SWP:P27177; PDB:1U3MA; GSVVGGLGGYAMGRVMSGMNYHFDRPDEYRWWSENSARYPNRVYYRDYSSPVPQDVFVAD ------------------------3333------------------------3333---- CFNITVTEYSIGPAAKKNTSEAVAAANQTEVEMENKVVTKVIREMCVQQYREYRLAS --------------------------------------------------------- >Huntingtin-associated pro; SWP:P97924; PDB:1U3OA; SGGCELTVVLQDFSAAHSSELSIQVGQTVELLERPSERPGWCLVRTTERSPPQEGLVPSS ----------------------------------3333-------------------333 TLCISHS 3------ >ALCOHOL DEHYDROGENASE ALP; SWP:P07327; PDB:1U3TA; STAGKVIKCKAAVLWELKKPFSIEEVEVAPPKAHEVRIKMVAVGICGTDDHVVSGTMVTP -2222------------------------------------------------------- LPVILGHEAAGIVESVGEGVTTVKPGDKVIPLAIPQCGKCRICKNPESNYCLKNDVSNPQ ----------------2222---2222------------3333-------1111------ GTLQDGTSRFTCRRKPIHHFLGISTFSQYTVVDENAVAKIDAASPLEKVCLIGCGFSTGY --1111-----%%%%----iiii---------1111----111133333333-------- GSAVNVAKVTPGSTCAVFGLGGVGLSAIMGCKAAGAARIIAVDINKDKFAKAKELGATEC ---------2222-------------------------------3333----1111---- INPQDYKKPIQEVLKEMTDGGVDFSFEVIGRLDTMMASLLCCHEACGTSVIVGVPPDSQN -3333---3333-----iiii-----------------1111------------------ LSMNPMLLLTGRTWKGAILGGFKSKECVPKLVADFMAKKFSLDALITHVLPFEKINEGFD -------3333------%%%%--3333-------1111---3333-----3333------ LLHSGKSIRTILMF -------------- >ALCOHOL DEHYDROGENASE BET; SWP:P00325; PDB:1U3UA; STAGKVIKCKAAVLWEVKKPFSIEDVEVAPPKAYEVRIKMVAVGICRTDDHVVSGNLVTP -2222--------------------------2222----------3333----------- LPVILGHEAAGIVESVGEGVTTVKPGDKVIPLFTPQCGKCRVCKNPESNYCLKNDLGNPR ----------------2222---2222------------3333-1111--1111------ GTLQDGTRRFTCRGKPIHHFLGTSTFSQYTVVDENAVAKIDAASPLEKVCLIGCGFSTGY --1111-----iiii----iiii---------1111----111133333333-------- GSAVNVAKVTPGSTCAVFGLGGVGLSAVMGCKAAGAARIIAVDINKDKFAKAKELGATEC ---------2222-------------------------------1111----1111---- INPQDYKKPIQEVLKEMTDGGVDFSFEVIGRLDTMMASLLCCHEACGTSVIVGVPPASQN -3333---3333-----iiii-----------------1111-------------2222- LSINPMLLLTGRTWKGAVYGGFKSKEGIPKLVADFMAKKFSLDALITHVLPFEKINEGFD ---33333333-----------3333--------1111---3333-----3333------ LLHSGKSIRTVLTF -------------- >ALCOHOL DEHYDROGENASE GAM; SWP:P00326; PDB:1U3WA; STAGKVIKCKAAVLWELKKPFSIEEVEVAPPKAHEVRIKMVAAGICRSDEHVVSGNLVTP -2222----------2222----------------------------------------- LPVILGHEAAGIVESVGEGVTTVKPGDKVIPLFTPQCGKCRICKNPESNYCLKNDLGNPR ----------------1111---2222------------3333-1111--1111------ GTLQDGTRRFTCSGKPIHHFVGVSTFSQYTVVDENAVAKIDAASPLEKVCLIGCGFSTGY --1111-----iiii----iiii---------1111----11113333------------ GSAVKVAKVTPGSTCAVFGLGGVGLSVVMGCKAAGAARIIAVDINKDKFAKAKELGATEC ---------2222-------------------------------3333----1111---- INPQDYKKPIQEVLKEMTDGGVDFSFEVIGQLDTMMASLLCCHEACGTSVIVGVPPDSQN -3333------------iiii-----------------1111------------------ LSINPMLLLTGRTWKGAIFGGFKSKESVPKLVADFMAKKFSLDALITNVLPFEKINEGFD ---33331111------%%%%-3333--------1111---3333-----3333------ LLRSGKSIRTVLTF -1111--------- >ACTIVATED CDC42 KINASE 1; SWP:Q07912; PDB:1U46A; LTCLIGEKDLRLLEKLGGVVRRGEWDAPSGKTVSVAVKCPEAMDDFIREVNAMHSLDHRN -----3333-----------------1111----------------------3333-111 LIRLYGVVLTPPMKMVTELAPLGSLLDRLRKHQGHFLLGTLSRYAVQVAEGMGYLESKRF 1------------------1111--------!!!!------------------------- IHRDLAARNLLLATRDLVKIGDFGLMRALPQDDHYVMVPFAWCAPESLKTRTFSHASDTW -----3333----1111-----1111------------3333------------------ MFGVTLWEMFTYGQEPWIGLNGSQILHKIDKEGERLPRPEDCPQDIYNVMVQCWAHKPED --------1111----2222------------------2222--------3333--1111 RPTFVALRDFLLEAQ -----------3333 >CELL CYCLE ARREST PROTEIN; SWP:P26449; PDB:1U4CA; QIVQIEQAPKDYISDIKIIPSKSLLLITSWDGSLTVYKFDIQAKNVDLLQSLRYKHPLLC -------------------1111------------------------------------- CNFIDNTDLQIYVGTVQGEILKVDLIGSPSFQALTNNEANLGICRICKYGDDKLIAASWD ---------------------------------------------------------111 GLIEVIDPRNYGDGVIAVKNLNSNNTKVKNKIFTDTNSSRLIVGNNSQVQWFRLPLCNGT 1-----3333-------------------------------------------------- IEESGLKYQIRDVALLPKEQEGYACSSIDGRVAVEFFDDSSKRFAFRCHRNLAYPVNSIE --------------------------1111------------------------------ FSPRHKFLYTAGSDGIISCWNLQTRKKIKNFAKFNEDSVVKIACSDNILCLATSDDTFKT -----------1111----------------------------------------3333- NAATIELNASSIYIIFDYE ------------------- >ELASTASE; SWP:P14756; PDB:1U4GA; AEAGGPGGNQKIGKYTYGSDYGPLIVNDRCEMDDGNVITVDMNSSTDDSKTTPFRFACPT ---------------2222-------1111-----------iiii-1111---------- NTYKQVNGAYSPLNDAHFFGGVVFKLYRDWFGTSPLTHKLYMKVHYGRSVENAYWDGTAM -----iiii--------------------------------------------------- LFGDGATMFYPLVSLDVAAHEVSHGFTEQNSGLIYRGQSGGMNEAFSDMAGEAAEFYMRG ------------------------------------------------------------ KNDFLIGYDIKKGSGALRYMDQPSRDGRSIDNASQYYNGIDVHHSSGVYNRAFYLLANSP ----2222-------------3333------3333-22221111--------------22 GWDTRKAFEVFVDANRYYWTATSNYNSGACGVIRSAQNRNYSAADVTRAFSTVGVTCP 22---------------------3333---------1111----------1111---- >PHOSPHOLIPASE A2 ISOFORM ; SWP:Q6SLM1; PDB:1U4JA; NLKQFKNMIQCAGTRTWTSYIGYGCYCGYGGSGTPVDELDRCCYTHDHCYNKAANIPGCN ---------------3333-----------------3333-----------3333----1 PLIKTYSYTCTKPNITCNDTSDSCARFICDCDRTAAICFASAPYNINNIMISASTSCQ 111-----------------------------------------3333--11111111 >CARBOXYLESTERASE EST2; SWP:Q7SIG1; PDB:1U4NA; LDPVIQQVLDQLNRMPAPDYKHLSAQQFRSQQSLFPPVKKEPVAEVREFDMDLPGRTLKV ------------------3333------------------------------2222---- RMYRPEGVEPPYPALVYYHGGGWVVGDLETHDPVCRVLAKDGRAVVFSVDYRLAPEHKFP --------------------iiii--3333---------1111----------------- AAVEDAYDALQWIAERAADFHLDPARIAVGGDSAGGNLAAVTSILAKERGGPALAFQLLI ------------1111-1111-1111---------------------------------- YPSTGYDPAHPPASIEENAEGYLLTGGMSLWFLDQYLNSLEELTHPWFSPVLYPDLSGLP ------3333-3333-----------------------3333--1111-1111--2222- PAYIATAQYDPLRDVGKLYAEALNKAGVKVEIENFEDLIHGFAQFYSLSPGATKALVRIA ---------1111----------1111-----------22221111-------------- EKLRDALA -------- >SMALL INDUCIBLE CYTOKINE ; SWP:P13501; PDB:1U4RA; SSDTTPCCFAYIARPLPRAHIKEYFYTSGKCSNPAVVFVTNAQVCANPEKKWVREYINSL ----------------3333-------3333---------------11113333------ EM -- >SECRETED PROTEIN ASP-2; SWP:Q7Z1H1; PDB:1U53A; FGCPDNGMSEEARQKFLEMHNSLRSSVALGQAKDGAGGNAPKAAKMKTMAYDCEVEKTAM --------------------------1111---3333----------------------- NNAKQCVFKHSQPNQRKGLGENIFMSSDSGMDKAKAAEQASKAWFGELAEKGVGQNLKLT -3333------33332222--------1111----------------------------3 GGLFSRGVGHYTQMVWQETVKLGCYVEACSNMCYVVCQYGPAGNMMGKDIYEKGEPCSKC 333----3333----1111-------------------------2222------2222-- ENCDKEKGLCSA ----1111---- >HEME-BASED METHYL-ACCEPTI; SWP:Q8RBX6; PDB:1U55A; MKGTIVGTWIKTLRDLYGNDVVDESLKSVGWEPDRVITPLEDIDDDEVRRIFAKVSEKTG --------------------------1111-------1111--3333------------- KNVNEIWREVGRQNIKTFSEWFPSYFAGRRLVNFLMMMDEVHLQLTKMIKGATPPRLIAK ---------------------3333------------------1111------------- PVAKDAIEMEYVSKRKMYDYFLGLIEGSSKFFKEEISVEEVERGEKDGFSRLKVRIKFKN ---------------------------------------------iiii----------- PVFEYKKN -------- >GAG POLYPROTEIN; SWP:Q70622; PDB:1U57A; LEEMMTACQGVGGPGHKARVLAEAMSQVTNSATIMMQRGNFRNQRKIV --3333----------------------------------3333---- >MHC-I HOMOLOG M144; SWP:Q1XE09; PDB:1U58A; ESGLRYAYTLVVDGTANTRRCFGTGHVDGEAFVGYSNNKTHGIGRWVNASHVEEENKEFV --------------------------iiii-----%%%%--------------------- RQCKELQAELDKMQNNSKVIGVKTVQLDVGCTSKIEKHYAYDGNETECQKKLTEYRKLVL ------------------2222------------------iiii-------------333 ASAVSPQLEVERRSSGREGGMRLRCFARDYYPADLEIRWWKDDGGGGALPQTSKQHHDPL 3----------------------------------------------------------- PSGNGLYQKHIDVYVDGGLEHVYSCRVKGIATGLELQIVRWK ---------------22221111-----3333---------- >TYROSINE-PROTEIN KINASE Z; SWP:P43403; PDB:1U59A; KKLFLKRDNLLIADIELGCGNFGSVRQGVYRKQIDVAIKVLKQGTEKADTEEMMREAQIM -----3333----------3333-----------------------------------33 HQLDNPYIVRLIGVCQAEALMLVMEMAGGGPLHKFLVGKREEIPVSNVAELLHQVSMGMK 33------------------------3333--------1111------------------ YLEEKNFVHRDLAARNVLLVNRHYAKISDFGLSKALGADDSYYTARSAGKWPLKWYAPEC ------------3333-------------1111--!!!!------------3333-3333 INFRKFSSRSDVWSYGVTMWEALSYGQKPYKKMKGPEVMAFIEQGKRMECPPECPPELYA ------3333-----------1111--------!!!!-------------2222------ LMSDCWIYKWEDRPDFLTVEQRMRACYYSLASKVEGHHHHHH ---1111-3333------------------1111----2222 >SRC KINASE-ASSOCIATED PHO; SWP:Q86WV1; PDB:1U5DA; GSVIKQGYLEKKSKDHSFFGSEWQKRWCVVSRGLFYYYANEKSKQPKGTFLIKGYSVRMA ------------------------------2222-----1111--------2222----- PHLRRDSKKESCFELTSQDRRTYEFTATSPAEARDWVDQISFLLKDLS -----1111--------------------------------------- >CITE; SWP:O06162; PDB:1U5HA; MNLRAAGPGWLFCPADAPEAFAAAAAAADVVILDLEDGVAEAQKPAARNALRDTPLDPER -3333--------11111111--------------33333333-------------1111 TVVRINAGGTADQARDLEALAGTAYTTVMLPKAESAAQVIELAPRDVIALVETARGAVCA ------2222--------3333------------33331111------------------ AEIAAADPTVGMMWGAEDLIATLGGSSSRRADGAYRDVARHVRSTILLAASAFGRLALDA -----3333--------------------1111-----------------1111------ VHLDILDVEGLQEEARDAAAVGFDVTVCIHPSQIPVVRKAYAA ---1111-----------1111-------3333-----1111- >HYPOTHETICAL PROTEIN; SWP:Q9RW50; PDB:1U5KA; RSRTANRSGIVIRRRVTPAGDIIVTLLTPQGKLKAIARGGVKGPLSSSLNLFHHVGVQVY ----------------1111-------1111-------3333--3333-2222------- QGPHDLASVKQAVLEGALPTLAEPERYAFAHLMAEFADALFQEGEFSEQAFDLFAASLRG -----------------3333--------------------2222--------------- VAHQPDPEWVALVMSYKLLGLAGVIPQTARCARCGAPDPEHPDPLGGQLLCSKCAALPPY ----------------3333----------------------3333----3333------ PPAVLDFLRHAVRRTVRASFEQPVPSADRPALWRALEKFVTVQVGGVHSWRQLVPSGVPV --------------33331111--3333----------------------1111------ LS -- >PRION PROTEIN; SWP:Q9I9C0; PDB:1U5LA; GSVVGGLGGYALGSAMSGMRMNFDRPEERQWWNENSNRYPNQVYYKEYNDRSVPEGRFVR ------------------------------------------------------------ DCVNITVTEYKIDPNENQNVTQVEVRVMKQVIQEMCMQQYQQYQLAS ------------1111-------------------------1111-- >ALPHA 1 TYPE II COLLAGEN ; SWP:P02458; PDB:1U5MA; YVEFQEAGSCVQDGQRYNDKDVWKPEPCRICVCDTGTVLCDDIICEDVKDCLSPEIPFGE -----------iiii--3333----1111----iiii----------------------- CCPICPADLAAAA ------------- >SPECTRIN ALPHA CHAIN, BRA; SWP:P07751; PDB:1U5PA; ANKQQNFNTGIKDFDFWLSEVEALLASEDYGKDLASVNNLLKKHQLLEADISAHEDRLKD -----------------------1111--------------------------------- LNSQADSLMTSSAFDTSQVKDKRETINGRFQRIKSMAAARRAKLNESHRLHQFFRDMDDE -------1111------------------------------------------------- ESWIKEKKLLVSSEDYGRDLTGVQNLRKKHKRLEAELAAHEPAIQGVLDTGKKLSDDNTI ---------1111----------------------------------------------- GKEEIQQRLAQFVDHWKELKQLAAARGQRLE --------------------------3333- >SERINE/THREONINE PROTEIN ; SWP:Q9JLS3; PDB:1U5RA; DPDVAELFFKDDPEKLFSDLREIGHGSFGAVYFARDVRNSEVVAIKKMSYSGKQSNEKWQ -----------3333--------------------------------------------- DIIKEVRFLQKLRHPNTIQYRGCYLREHTAWLVMEYCLGSASDLLEVHKKPLQEVEIAAV -------1111--1111------------------------------------------- THGALQGLAYLHSHNMIHRDVKAGNILLSEPGLVKLGDFGSASIMAPANFVGTPYWMAPE -----------1111------3333-------------1111-----------1111333 VILAMDEGQYDGKVDVWSLGITCIELAERKPPLFNMNAMSALYHIAQNESPALQSGHWSE 3-------------------------------1111------------------------ YFRNFVDSCLQKIPQDRPTSEVLLKHRFVLRERPPTVIMDLIQRTKDAVRELDNLQYRKM -------1111-3333--333311113333---1111----------------1111--3 KKILFQEA 3331111- >APPEARS TO BE FUNCTIONALL; SWP:Q12483; PDB:1U5TA; VNKTILEKQSVELRDQLMVFQERLVEFAKKHNSELQASPEFRSKFMHMCSSIGIDPLSLF -----3333-----------------------------3333-----3333----3333- DRDKHLFTVNDFYYEVCLKVIEICRQTKDMNGGVISFQELEKVHFRKLNVGLDDLEKSID ---1111----------------------------------------------------- MLKSLECFEIFQIRGKKFLRSVPNELTSDQTKILEICSILGYSSISLLKANLGWEAVRSK -3333-------%%%%-------------------------------------------- SALDEMVANGLLWIDYQGGAEALYWDPSWITRQ ------1111---------------3333---- >Vacuolar protein-sorting-; SWP:Q06696; PDB:1U5TB; LDREKFLNKELFLDEIAREIYEFTLSEFKDLNSDTNYMIITLVDLYAMYNKSMRIGTGLI ----------------------------------------3333---------------- SPMEMREACERFEHLGLNELKLVKVNKRILCVTSEKFDVVKEKLVDLIGDNPGSDLLRLT ------------1111-------------------3333---------------3333-- QILSSNNSKSNWTLGILMEVLQNCVDEGDLLIDKQLSGIYYYKNSYWPS 3333-------------------3333---------------------- >Vacuolar protein-sorting-; SWP:P47142; PDB:1U5TC; SALPPVYSFPPLYTRQPNSLTRRQQISTWIDIISQYCKTKKIWYMSVDGTSKNLFNNEDI ------------------3333------------------------------1111---- QRSVSQVFIDEIWSQMTKEGKCLPIDQSGRRSSNTTTTRYFILWKSLDSWASLILQWFED ----3333--------3333-------------1111----------------------- SGKLNQVITLYELSEGDETVNWEFHRMPESLLYYCLKPLCDRNRATMLKDENDKVIAI --------------------------------3333---------------------- >ALLENE OXIDE SYNTHASE-LIP; SWP:O16025; PDB:1U5UA; WKNFGFEIFGEKYGQEELEKRIKDEHTPPPDSPVFGGLKLKLKKEKFKTLFTLGTTLKGF -----------------------1111--------------------------------- RRATHTVGTGGIGEITIVNDPKFPEHEFFTAGRTFPARLRHANLKYPDDAGADARSFSIK -----------------------------2222--------------1111--------- FADSDSDGPLDIVMNTGEANIFWNSPSLEDFVPVEEGDAAEEYVYKNPYYYYNLVEALRR ------------------------------1111--------11113333---------- APDTFAHLYYYSQVTMPFKAKDGKVRYCRYRALPGDVDIKEEDESGRLTEEEQRKIWIFS ---3333------------1111----------------33332222-3333--1111-- RHENEKRPDDYLRKEYVERLQKGPVNYRLQIQIHEASPDDTATIFHAGILWDKETHPWFD -1111--1111-------1111--------------1111-----1111--3333----- LAKVSIKTPLSPDVLEKTAFNIANQPASLGLLEAKSPEDYNSIGELRVAVYTWVQHLRKL --------------------3333-1111------1111--------------------- KIGSLV 2222-- >HYPOTHETICAL UPF0244 PROT; SWP:P39411; PDB:1U5WA; MHQVVCATTNPAKIQAILQAFHEIFGEGSCHIASVAVESGVPEQPFGSEETRAGARNRVA -------------------------2222------------------------------- NARRLLPEADFWVAIEAGIDGDSTFSWVVIENASQRGEARSATLPLPAVILEKVREGEAL -----3333----------------------3333------------------1111--- GPVMSRYEGAIGVFTAGKLTRASVYHQAVILALSPFHNAVYS ---1111-3333--iiii-3333---------3333-3333- >Tumor necrosis factor lig; SWP:Q9D777; PDB:1U5XA; KHSVLHLVPVNITSDVTEVMWQPVLRRGRGLEAQGDIVRVWDTGIYLLYSQVLFHDVTFT ---------------------------------!!!!----------------------- MGQVVSREGQGRRETLFRCIRSMPSDAYNSCYSAGVFHLHQGDIITVKIPRANAKLSLSP ---------------------------------------2222---------------11 HGTFLGFVKL 11-------- >PROBABLE GLUTAMINASE YBAS; SWP:P77454; PDB:1U60A; LDANKLQQAVDQAYTQFHSLNGGQNADYIPFLANVPGQLAAVAIVTCDGNVYSAGDSDYR ----------------1111--------3333---1111------1111------1111- FALESISKVCTLALALEDVGPQAVQDKIGADPTGLPFNSVIALELHGGKPLSPLVNAGAI --!!!!-----------------------------1111------iiii--1111----- ATTSLINAENVEQRWQRILHIQQQLAGEQVALSDEVNQSEQTTNFHNRAIAWLLYSAGYL --3333--------------------1111---------1111-----------1111-- YCDAMEACDVYTRQCSTLLNTIELATLGATLAAGGVNPLTHKRVLQADNVPYILAEMMME ------------------------------1111-----------3333----------- GLYGRSGDWAYRVGLPGKSGVGGGILAVVPGVMGIAAFSPPLDEDGNSVRGQKMVASVAK -!!!!--------------3333-----2222----------1111-------------- QLGYNVFKG ----1111- >HYPOTHETICAL PROTEIN; SWP:Q81J58; PDB:1U61A; IDAKQLNSLALAYGDAVYEQYIRYHLLQKGKVRPNQLHRLGTSFVSAKAQAKVVYHLLET -3333---------------------------3333---3333----------------- AFLTEEEEAVLRRGRNANSGTVPKNTDVQTYRHSTAFEALIGYHHLLNNRERLDEIVYKA ------------3333----------------------------1111------------ IAVLEE ------ >HYPOTHETICAL PROTEIN; SWP:Q9I0C1; PDB:1U69A; SKNTICLWYDSAALEAATFYAETFPDSAVLAVHRAPGDYPSGKEGDVLTVEFRVGIPCLG ------------------------------------------2222-------------- LNGGPAFRHSEAFSFQVATDDQAETDRLWNAIVDNGGEESACGWCRDKWGISWQITPRVL ---------3333---------------------------iiii--1111------3333 SEAIASPDRAAARRAFEATGRIDIATIEKAFK --1111-------------------------- >F105 LIGHT CHAIN; SWP:NA; PDB:1U6AH; VQLQESGPGLVKPSETLSLTCTVSGGSISSHYWSWIRQSPGKGLQWIGYIYYSGSTNYSP -----------2222------------2222-------2222--------1111------ SLKSRVTISVETAKNQFSLKLTSM ----------1111---------- >3-OXOACYL-[ACYL-CARRIER-P; SWP:P0A574; PDB:1U6EA; RSVGLLSVGAYRPERVVTNDEICQHIDSSDEWIYTRTGIKTRRFAADDESAASMATEACR -----------------3333------------------------1111----------- RALSNAGLSAADIDGVIVTTNTHFLQTPPAAPMVAASLGAKGILGFDLSAGAAGFGYALG ---1111-3333-----------------------11111111------!!!!------- AAADMIGGGAATMLVVGTEKLSPTIDMYDRGNCFIFADGAAAVVVGETPFQGIGPTVAGS --------------------3333-1111--1111------------------------- DGEQADAIRQDIDWITFAQNPSGPRPFVRLEGPAVFRWAAFKMGDVGRRAMDAAGVRPDQ 3333---------------1111---------------------------------1111 IDVFVPHQANSRINELLVKNLQLRPDAVVANDIEHTGNTSAASIPLAMAELLTTGAAKPG -----------------------1111----3333----1111--------------222 DLALLIGYGAGLSYAAQVVRMPK 2---------------------- >RNA-BINDING PROTEIN UBP1; SWP:Q967R0; PDB:1U6FA; MSQIPLVSQYDPYGQTAQLQQLQQQQQQHIPPTQMNPEPDVLRNLMVNYIPTTVDEVQLR -------------------------------------3333---------1111------ QLFERYGPIESVKIVCDRETRQSRGYGFVKFQSGSSAQQAIAGLNGFNILNKRLKVALAA ---1111------------------------------------2222------------- SGHQRPGIAGAVGDGNGYL ------------------- >Cullin-associated NEDD8-d; SWP:Q86VP6; PDB:1U6GC; ASYHISNLLEKMTSSDKDFRFMATNDLMTELQKDSIKLDDDSERKVVKMILKLLEDKNGE ---------1111----------------------------------------------- VQNLAVKCLGPLVSKVKEYQVETIVDTLCTNMLSDKEQLRDISSIGLKTVIGELPSALAA ----------------3333---------3333--------------------------- NVCKKITGRLTSAIAKQEDVSVQLEALDIMADMLSRQGGLLVNFHPSILTCLLPQLTSPR ----------------------------------------1111333311113333---3 LAVRKRTIIALGHLVMSCFVDLIEHLLSELSKNDSMSTTRTYIQCIAAISRQAGHRIGEY 333---------------3333-------------------------------------- LEKIIPLVVKFCNVDDDELREYCIQAFESFVRRCPKEVYPHVSTIINICLKYLTYDMSWK 3333----3333---1111----------------------------------------- VRRAAAKCLDAVVSTRHEMLPEFYKTVSPALISRFKEREENVKADVFHAYLSLLKQTRPV -----------1111-----3333-------3333------------------------- GETPLTMLQSQVPNIVKALHKQMKEKSVKTRQCCFNMLTELVNVLPGALTQHIPVLVPGI --------3333----------------------------------1111-33333333- IFSLNDKSSSSNLKIDALSCLYVILCNHSPQVFHPHVQALVPPVVACVGDPFYKITSEAL 1111-------------------1111-3333-1111---3333-3333----------- LVTQQLVKVIRPLDQPSSFDATPYIKDLFTCTIKRLKAADIDQEVKERAISCMGQIICNL ------3333----------1111-------3333------------------------- GDNLGSDLPNTLQIFLERLKNEITRLTTVKALTLIAGSPLKIDLRPVLGEGVPILASFLR 1111------------------3333------3333-------------------3333- KNQRALKLGTLSALDILIKNYSDSLTAAMIDAVLDELPPLISESDMHVSQMAISFLTTLA -------------------------1111----11111111----------------111 KVYPSSLSKISGSILNELIGLVRSPLLQGGALSAMLDFFQALVVTGTNNLGYMDLLRMLT 1----3333----------33331111----------33333333--------------- GPVYSQTHKQSYYSIAKCVAALTRACPKEGPAVVGQFIQDVKNSRSTDSIRLLALLSLGE 3333---3333-------------------3333------------3333---------- VGHHIDLSGQLELKSVILEAFSSPSEEVKSAASYALGSISVGNLPEYLPFVLQEITSQPK ---------3333-----------3333--------------33333333-------333 RQYLLLHSLKEIISSASVVGLKPYVENIWALLLKHCECAEEGTRNVVAECLGKLTLIDPE 3----------------------3333--------------3333--------3333333 TLLPRLKGYLISGSSYARSSVVTAVKFTISDHPQPIDPLLKNCIGDFLKTLEDPDLNVRR 3--3333--------------------------3333------3333------------- VALVTFNSAAHNKPSLIRDLLDTVLPHLYNETKVRKELIREVEMGPFKHTVDDGLDIRKA -----------------1111-------------1111-----!!!!----3333----- AFECMYTLLDSCLDRLDIFEFLNHVEDGLKDHYDIKMLTFLMLVRLSTLCPSAVLQRLDR -----3333---11113333-----------3333---------------1111----11 LVEPLRATCTTKVKANSVKQEFEKQDELKRSAMRAVAALLTIPEAEKSPLMSEFQSQISS 11----------------3333--------------1111-----3333------3333- NPELAA --1111 >HYPOTHETICAL PROTEIN; SWP:NA; PDB:1U6LA; SLQIVPYLIFNGNCREAFSCYHQHLGGTLEALPFGDSPEPADWKDKIHARLVVGSFALAS --------------------------------3333------------------------ DNHPAYPYEGIKGCSISLNVDSKAEAERLFNALAEGGSVQPLGPTFWAASFGFTDRFGVA -------------------------------3333-------------------1111-- WVNCEQD ------- >ACETYLTRANSFERASE, GNAT F; SWP:NA; PDB:1U6MA; SLIRSATKEDGQAIARLVLVILKDMELPILEEVSEEQMIDLLAEATAYPTYRYGYQRILV ------1111------------1111-3333----------------1111--3333--- YEHAGEVAGIAVGYPAEDEKIIDEPLREVFKKHGLAEDVRLFIEEETLPNEWYLDTISVD --%%%%--------3333--1111------1111-------------2222--------3 ERFRGMGIGSKLLDALPEVAKASGKQALGLNVDFDNPGARKLYASKGFKDVTTMTISGHL 333-----------------1111--------1111-------1111--------iiii- YNHMQKEVE --------- >GAG POLYPROTEIN; SWP:P03332; PDB:1U6PA; ATVVSGQKQDRQGGERRRSQLDRDQCAYCKEKGHWAKDCPKKPRGPRGPRPQTSLL -----------------%%%%-------------1111------------------ >CREATINE KINASE, M CHAIN; SWP:P00563; PDB:1U6RA; PFGNTHNKYKLNYKSEEEYPDLSKHNNHMAKVLTPDLYKKLRDKETPSGFTLDDVIQTGV ---------33333333----1111-3333----------1111-1111----------- DNPGHPFIMTVGCVAGDEESYTVFKDLFDPIIQDRHGGFKPTDKHKTDLNHENLKGGDDL ----------------3333---3333--------iiii1111------3333------- DPHYVLSSRVRTGKSIKGYTLPPHCSRGERRAVEKLSVEALNSLTGEFKGKYYPLKSMTE 3333-----------2222-3333----------------1111!!!!-----3333--- QEQQQLIDDHFLFDKPVSPLLLASGMARDWPDARGIWHNDNKSFLVWVNEEDHLRVISME ------1111-------33331111-22222222----3333------------------ KGGNMKEVFRRFCVGLQKIEEIFKKAGHPFMWNEHLGYVLTCPSNLGTGLRGGVHVKLAH -----------------------1111--------------3333--------------3 LSKHPKFEEILTRLRLQKRGTGGVDTAAVGSVFDISNADRLGSSEVEQVQLVVDGVKLMV 3331111------------1111------------------------------------- EMEKKLEKGQSIDDMIPAQK -------------------- >SH3 domain-binding glutam; SWP:O75368; PDB:1U6TA; VIRVYIASSSGSTAIKKKQQDVLGFLEANKIGFEEKDIAANEENRKWMRENVPENSRPAT ------1111------------------------------------------1111---- GYPLPPQIFNESQYRGDYDAFFEARENNAVYAFLGLTAPPGSKEAEVQAKQQALEHHHHH ---------!!!!-----------1111----------2222------------------ H - >EXOPOLYPHOSPHATASE; SWP:P29014; PDB:1U6ZA; EFAAVDLGSNSFHMVIARVVDGAMQIIGRLKQRVHLADGLGPDNMLSEEAMTRGLNCLSL -------3333--------iiii-------------11111111---------------- FAERLQGFSPASVCIVGTHTLRQALNATDFLKRAEKVIPYPIEIISGNEEARLIFMGVEH --1111--3333------3333-1111-----3333------------------------ TQPEKGRKLVIDIGGGSTELVIGENFEPILVESRRMGCVSFAQLYFPGGVINKENFQRAR -----------------------%%%%------------3333--2222----------- MAAAQKLETLTWQFRIQGWNVAMGASGTIKAAHEVLMEMGEKDGIITPERLEKLVKEVLR ------1111--------------------------1111-----------------111 HRNFASLSLPGLSEERKTVFVPGLAILCGVFDALAIRELRLSDGALREGVLYEMEGRFRH 1-3333--22223333----------------------------------------1111 QDVRSRTASSLANQYHIDSEQARRVLDTTMQMYEQWREQQPKLAHPQLEALLRWAAMLHE ---------------------------------------3333--------------111 VGLNINHSGLHRHSAYILQNSDLPGFNQEQQLMMATLVRYHRKAIKLDDLPRFTLFKKKQ 13333-2222------------2222-----------1111-----1111------3333 FLPLIQLLRLGVLLNNQRQATTTPPTLTLITDDSHWTLRFPHDWFSQNALVLLDLEKEQE -----------3333-!!!!-----------!!!!-----22221111------------ YWEGVAGWRLKIEEESTP ----2222---------- >FKBP-TYPE PEPTIDYL-PROLYL; SWP:Q9SCY2; PDB:1U79A; CEFSVSPSGLAFCDKVVGYGPEAVKGQLIKAHYVGKLENGKVFDSSYNRGKPLTFRIGVG -----1111--------------2222-----------------3333------------ EVIKGWDQGILGSDGIPPMLTGGKRTLRIPPELAYGDRGAGCKGGSCLIPPASVLLFDIE --3333------2222---2222------3333---------!!!!-------------- YIGKA ----- >cAMP-dependent protein ki; SWP:P00514; PDB:1U7EB; GRRRRGAISAEVYTEEDAASYVRKVIPKDYKTMAALAKAIEKNVLFSHLDDNERSDIFDA -----------------1111----------------1111-1111-------------- MFPVSFIAGETVIQQGDEGDNFYVIDQGEMDVYVNNEWATSVGEGGSFGELALIYGTPRA ------2222---2222----------------%%%%-----2222--3333-------- ATVKAKTNVKLWGIDRDSYRRILMGSTLRKRK -------------------------------- >PROBABLE AMMONIUM TRANSPO; SWP:P37905; PDB:1U7GA; AVADKADNAFICTALVLFTIPGIALFYGGLIRGKNVLSLTQVTVTFALVCILWVVYGYSL --------------------------1111-3333------------------------- ASGEGNNFFGNINWLLKNIELTAVGSIYQYIHVAFQGSFACITVGLIVGALAERIRFPAV -----1111----------1111----3333-------------------3333------ LIFVVVWLTLSYIPIAHVWGGGLLASHGALDFAGGTVVHINAAIAGLVGAYLIGKRVGFG ------------------------1111-----------------------------222 KEAFKPHNLPVFTGTAILYIGWFGFNAGSAGTANEIAALAFVNTVVATAAAILGWIFGEW 2---3333-------------------3333----------------------------- ALRGLPSLLGACSGAIAGLVGVTPACGYIGVGGALIIGVVAGLAGLWGVTLKRLLRVDDP ------------------------1111------------------------------33 CDVFGVHGVCGIVGCITGIFAASSLGGVGFAEGVTGHQLLVQLESIAITIVWSGVVAFIG 33----------------33331111----2222-------------------------- YKLADLTVGLRV ------------ >HYPOTHETICAL PROTEIN; SWP:Q9I3Y6; PDB:1U7IA; HSARVRPFLFQGVQAEAANFYLSLFDDAEILQIQRYGAEGPGPEGSVLKALFRLGDQSVH -------------3333----1111-----------------2222-------!!!!--- CIDSHVRHAFDFTPAFSFFVDCESNAQIERLAEALSDGGKALPLGDYGFSQRFAWLADRF ------------3333-------------------2222------------------111 GVSWQLNLAG 1--------- >GAG POLYPROTEIN; SWP:P03336; PDB:1U7KA; PLRGGNGQLQYWPFSSSDLYNWKNNNPSFSEDPGKLTALIESVLTTHQPTWDDCQQLLGT ---1111---------------1111-33333333------------------------- LLTGEEKQRVLLEARKAVRGNDGRPTQLPNEVDAAFPLERPDWDYTTQRGRNHLVLYRQL -------------------1111----3333------------1111------------- LLAGQNAGR -----3333 >VACUOLAR ATP SYNTHASE SUB; SWP:P31412; PDB:1U7LA; LYTANDFILISLPQNAQPVTAPGSKTDSWFNETLIGGRAFVSDFKIPEFKIGSLDTLIVE ------------1111-1111------------%%%%---------------3333---- SEELSKVDNQIGASIGKIIEILQGLNETSTNAYRTLPINNMPVPEYLENFQWQTRKFKLD ----------------------1111-----------%%%%-----------1111-111 KSIKDLITLISNESSQLDADVRATYANYNSAKTNLAAAERKKTGDLSVRSLHDIVKPEDF 1-------------------------------------3333--3333--1111-3333- VLNSEHLTTVLVAVPKSLKSDFEKSYETLSKNVVPASASVIAEDAEYVLFNVHLFKKNVQ --------------3333------1111-----2222------1111-------3333-- EFTTAAREKKFIPREFNYSEELIDQLKKEHDSAASLEQSLRVQLVRLAKTAYVDVFINWF ------1111-------------------------------------------------- HIKALRVYVESVLRYGLPPHFNIKIIAVPPKNLSKCKSELIDAFGFLGGNAFMYEPFVMY ----------------------------2222----------------1111-------- IINL ---- >FATTY ACID/PHOSPHOLIPID S; SWP:Q82ZE8; PDB:1U7NA; KIAVDAGGDNAPQAIVEGVLAKQDFPDIEFQLYGKEAEIKKYITDEKNITIIHTDEKIAS -------1111----------3333--------------1111--2222----------- DDEPVKAIRRKKTASVLAAQAVKNGEADAIFSAGNTGALLAAGLFIVGRIKNVERPGLST ----------1111-----------------------------------2222------- LPVGEPDKGFDLDLGANADNKPEHLVQYAVLGSFYAEKVRNVQNPRVGLLNNGTGSELTK ----1111------------3333------------------------------------ KAFELLAADETINFVGNVEARELLNGVADVVVTDGFTGNAVLKSIEGTANSLLKTAILSG --------1111------3333-----------3333----------------------3 ALLLKNALHGKDEDYSKHGGAVLFGLKAPVIKTHGATGPDAVRYTIRQIHTLETQVVPQL 333-----------3333---------------1111--------------1111----- VEYYE ----- >MAGNESIUM-DEPENDENT PHOSP; SWP:Q9D967; PDB:1U7PA; MTRLPKLAVFDLDYTLWPFWVDTHVDPPFHKSSDGTVRDRRGQNIQLYPEVPEVLGRLQS -----------2222----1111--------1111---1111-----1111-------11 LGVPVAAASRTSEIQGANQLLELFDLGKYFIQREIYPGSKVTHFERLHHKTGVPFSQMVF 11-----------------------3333------------------------3333--- FDDENRNIIDVGRLGVTCIHIRDGMSLQTLTQGLETFAKAQAGL ---3333----1111----------------------------- >Coenzyme A biosynthesis b; SWP:P24285; PDB:1U7UA; PVNDLKHLNIMITAGPTREPLDPVRYISDHSSGKMGFAIAAAAARRGANVTLVSGPVSLP --1111------------------------------------------------------ TPPFVKRVDVMTALEMEAAVNASVQQQNIFIGCAAVADYRAALTIKMVKNPDIVAGVAAL -2222-----------------1111------------------------------1111 KDHRPYVVGFAAETNNVEEYARQKRIRKNLDLICANSDNNALHLFWQDGDKVLPLERKEL ------------------------------------------------------------ LGQLLLDEIVTRYDEKNR --------------1111 >Coenzyme A biosynthesis b; SWP:P24285; PDB:1U7ZA; VNDLKHLNIMITAGPTREPLDPVRYISDHSSGKMGFAIAAAAARRGANVTLVSGPVSLPT ---1111-----------------------------------1111-------------- PPFVKRVDVMTALEMEAAVNASVQQQNIFIGCAAVADYRAATVAPEKIDELTIKMVKNPD 2222-----------------3333----------------------------------- IVAGVAALKDHRPYVVGFAAETNNVEEYARQKRIRKNLDLICANDVSQPTQGFNSDNNAL ----1111---------------------------------------1111--------- HLFWQDGDKVLPLERKELLGQLLLDEIVTRYDEKNR ---1111----------------------------- >PHOSPHOSULFOLACTATE SYNTH; SWP:O06739; PDB:1U83A; DFSLELPVRTNKPRETGQSILIDNGYPLQFFKDAIAGASDYIDFVKFGWGTSLLTKDLEE -------------------------------------3333--------3333-1111-- KISTLKEHDITFFFGGTLFEKYVSQKKVNEFHRYCTYFGCEYIEISNGTLPTNKEKAAYI -----1111-------------1111---------------------------------- ADFSDEFLVLSEVGSKDQSSEEWLEYIVEDEAGAEKVITEQIVDDIISSDIDINRLIFEA --1111------------------------------------3333-----1111----- PNKTLQQGFIQKIGPNVNLANIPFHDAIALETLRLGLRSDTFF ----------------------1111-----------3333-- >HYPOTHETICAL PROTEIN; SWP:Q5KVS1; PDB:1U84A; GQQLNRLLLEWIGAWDPFGLGKDAYDVEAASVLQAVYETEDARTLAARIQSIYEFAFDEP ---------------1111-1111---------3333----------------------- IPFPHCLKLARRLLELKQAAS -3333------------1111 >TALIN 1; SWP:P26039; PDB:1U89A; GSHMQAATEDGQLLRGVGAAATAVTQALNELLQHVKAHATGAGPAGRYDQATDTILTVTE -------1111-----------------------1111---------------------- NIFSSMGDAGEMVRQARILAQATSDLVNAIKADAEGESDLENSRKLLSAAKILADATAKM -------------------------------3333---3333------------------ VEAAKGAAAHPDSEEQQQR ----1111----------- >ADA POLYPROTEIN; SWP:P06134; PDB:1U8BA; KDDQRWQSVLARDPNADGEFVFAVRTTGIFRPSCRARHALRENVSFYANASEALAAGFRP ------------3333-------3333---1111-----3333----------1111--- CKRCQPDKANPRQHRLDKITHACRLLEQETPVTLEALADQVASPFHLHRLFKATTGTPKA -----------------------1111--------------------------------- WQQAWRAR -------- >Glyceraldehyde-3-phosphat; SWP:P04406; PDB:1U8FO; KVKVGVNGFGRIGRLVTRAAFNSGKVDIVAINDPFIDLNYMVYMFQYDSTHGKFHGTVKA --------------------------------1111------------------------ ENGKLVINGNPITIFQERDPSKIKWGDAGAEYVVESTGVFTTMEKAGAHLQGGAKRVIIS iiii--iiii--------3333-3333--------------3333--3333--------- APSADAPMFVMGVNHEKYDNSLKIISNASCTTNCLAPLAKVIHDNFGIVEGLMTTVHAIT ---------22223333-1111-------------------------------------3 ATQKTVDGPSGKLWRDGRGALQNIIPASTGAAKAVGKVIPELNGKLTGMAFRVPTANVSV 333------11113333-3333-------33333333-3333------------------ VDLTCRLEKPAKYDDIKKVVKQASEGPLKGILGYTEHQVVSSDFNSDTHSSTFDAGAGIA -----------3333----------1111----------33332222------3333--- LNDHFVKLISWYDNEFGYSNRVVDLMAHMASKE -----------------------------1111 >glycine cleavage system t; SWP:NA; PDB:1U8SA; SLTQHLVITAVGTDRPGICNEVVRLVTQAGCNIIDSRIAMFGKEFTLLMLISGSPSNITR --------------2222------------------------------------------ VETTLPLLGQQHDLITMMKRTSPHDHQTHAYTVEVYVESDDKLGLTEKFTQFFAQRQIGM --------------------------------------------3333-----1111--- ASLSAQTISNQFHIAISARVDSGCNLMQLQEEFDALCTALDVQGSLNFIKN --------------------1111--------------------------- >Gamma-aminobutyrate metab; SWP:P55792; PDB:1U8VA; MLMTAEQYIESLRKLNTRVYMFGEKIENWVDHPMIRPSINCVRMTYELAQDPQYADLMTT ---------3333-------iiii---33331111---------------33333333-- KSNLIGKTINRFANLHQSTDDLRKKVKMQRLLGQKTASCFQRCVGMDAFNAVFSTTYEID ---------3333----------------------------------------------- QKYGTNYHKNFTEYLKYIQENDLIVDGAMTDPKGDRGLAPSAQKDPDLFLRIVEKREDGI ----------------------------------11111111--1111-------1111- VVRGAKAHQTGSINSHEHIIMPTIAMTEADKDYAVSFACPSDADGLFMIYGRQSCDTRKM --------2222--------------3333------------2222------22223333 EEGADIDLGNKQFGGQEALVVFDNVFIPNDRIFLCQEYDFAGMMVERFAGYHRQSYGGCK 22221111-------------------1111-----1111-------------3333--- VGVGDVVIGAAALAADYNGAQKASHVKDKLIEMTHLNETLYCCGIACSAEGYPTAAGNYQ -------------------1111------------------------1111--3333--- IDLLLANVCKQNITRFPYEIVRLAEDIAGGLMVTMPSEADFKSETVVGRDGETIGDFCNK ------------------------------1111-------------------------1 FFAAAPTCTTEERMRVLRFLENICLGASAVGYRTESMHGAGSPQAQRIMIARQGNINAKK 111-1111-----------------1111------------3333--------------- ELAKAIAGIK ---------- >NUCLEOSIDE DIPHOSPHATE KI; SWP:P39207; PDB:1U8WA; MEQTFIMIKPDGVQRGLIGEVICRFEKKGFTLKGLKLISVERSFAEKHYEDLSSKSFFSG ------------1111--------------------------------------1111-- LVDYIVSGPVVAMIWEGKNVVLTGRKIIGATNPAASEPGTIRGDFAIDIGRNVIHGSDSV ----------------2222-----------1111------------1111--------- ESARKEIALWFPDGPVNWQSSVHPWVYET ----------1111-----1111------ >Maltose-6'-phosphate gluc; SWP:P54716; PDB:1U8XX; KKSFSIVIAGGGSTFTPGIVLLLDHLEEFPIRKLKLYDNDKERQDRIAGACDVFIREKAP ---------1111--------33331111----------3333---------------11 DIEFAATTDPEEAFTDVDFVAHIRVGKYARALDEQIPLKYGVVGQETCGPGGIAYGRSIG 11------3333-----------2222--------3333-----1111------------ GVLEILDYEKYSPDAWLNYSNPAAIVAEATRRLRPNSKILNICDPVGIEDRAQILGLSSR -----------1111------------------1111---------------1111--33 KEKVRYYGLNHFGWWTSIQDQEGNDLPKLKEHVSQYGYIPKTSWNDTFAKARDVQAADPD 33-----------------1111-------------------11113333-------111 TLPNTYLQYYLFPDDVKKSNPNHTRANEVEGREAFIFSQCDITREQSSENSEIKIDDHAS 1----3333--3333----33333333--------------------------------- YIVDLARAIAYNTGERLLIVENNGAIANFDPTAVEVPCIVGSNGPEPITVGTIPQFQKGL ---------------------iiii----1111-------1111---------------- EQQVSVEKLTVEAWAEKSFQKLWQALILSKTVPNARVARLILEDLVEANKDFWPELDQSP ----------------------------1111----------------3333-------- >RAS-RELATED PROTEIN RAL-A; SWP:Q9CXY0; PDB:1U8YA; SLALHKVIMVGSGGVGKSALTLQFMYDEFVEDYEPKSYRKKVVLDGEEVQIDILDTAGFR -----------2222--------------2222----------iiii------------- SGEGFLCVFSITEMESFAATADFREQILRVKEDENVPFLLVGNKSDLEDKRQVSVEEAKN ---------1111------------------------------11111111--------- RADQWNVNYVETSAKTRANVDKVFFDLMREIRAR ---------------------------------- >RAS-RELATED PROTEIN RAL-A; SWP:P63320; PDB:1U8ZA; SLALHKVIMVGSGGVGKSALTLQFMYDEFVEDYEPTKADSYRKKVVLDGEEVQIDILDTA -----------2222------------------1111---------iiii---------- GYAAIRDNYFRSGEGFLCVFSITEMESFAATADFREQILRVKEDENVPFLLVGNKSDLED -3333---------------1111-------------------1111-------333311 KRQVSVEEAKNRADQWNVNYVETSAKTRANVDKVFFDLMREIRAR 11-------------------------2222----------1111 >RECA PROTEIN; SWP:P03017; PDB:1U94A; KQKALAAALGQIEKQFGKGSIMRLGEDRSMDVETISTGSLSLDIALGAGGLPMGRIVEIY ----------------2222--1111-------------------------2222----- GPESSGKTTLTLQVIAAAQREGKTCAFIDAEHALDPIYARKLGVDIDNLLCSQPDTGEQA -2222------------3333------------------1111-3333------------ LEICDALARSGAVDVIVVDSVAALTPKAEIEGEGLAARMMSQAMRKLAGNLKQSNTLLIF -------3333--------3333-----1111---------------------------- INQTTGGNALKFYASVRLDIRRIGAVKEGENVVGSETRVKVVKNKIAAPFKQAEFQILYG -------3333------------------------------------------------- EGINFYGELVDLGVKEKLIEKAGAWYSYKGEKIGQGKANATAWLKDNPETAKEIEKKVRE -------------1111----------iiii----------------------------- LLLSNP ------ >CYTOCHROME C OXIDASE COPP; SWP:Q12287; PDB:1U96A; MTETDKKQEQENHAECEDKPKPCCVCKPEKEERDTCILFNGQDSEKCKEFIEKYKECMKG ----------------------3333---------------------------------- YGFEVPSAN --------- >APC35852; SWP:Q5KWF3; PDB:1U9CA; MSKRVLMVVTNHTTITDDHKTGLWLEEFAVPYLVFQEKGYDVKVASIQGGEVPLDPRSIN ---------------1111----3333--------1111---------------1111-- EKDPSWAEAEAALKHTARLSKDDAHGFDAIFLPGGHGTMFDFPDNETLQYVLQQFAEDGR --3333-----1111----3333-----------3333---1111----------1111- IIAAVHGPSGLVNATYKDGTPIVKGKTVTSFTDEEEREVGLDVHMPFLLESTLRLRGANF -----3333-1111-1111-1111----------------3333---------------- VRGGKWTDFSVRDGNLITGQNPQSSRSTAEKVVAALEERE ---2222-----!!!!----3333------------1111 >HYPOTHETICAL PROTEIN VC07; SWP:NA; PDB:1U9DA; APHLRFRAVEAHIVESLVPTLLNELSSLLSTARNAFTFELINTQYFAEGGVYPVEVLWFG -------------------------------1111-----------2222---------- REQQTQDQIAQVITDQIRQLLGADSHLAVVFIPLQRTAYYLDGQHF ----------------------------------1111--iiii-- >TRIGGERING RECEPTOR EXPRE; SWP:Q9JKE2; PDB:1U9KA; EEERYDLVEGQTLTVKCPFNIMKYANSQKAWQRLPDGKEPLTLVVTQRPFTRPSEVHMGK -------2222--------3333-----------2222----------1111-----!!! FTLKHDPSEAMLQVQMTDLQVTDSGLYRCVIYHPPNDPVVLFHPVRLVVT !----3333----------3333--------------------------- >TRANSCRIPTION ELONGATION ; SWP:P03003; PDB:1U9LA; AHAAIDTFTKYLDIDEDFATVLVEEGFSTLEELAYVPMKELLEIEGLDEPTVEALRERAK ---------1111---------1111--3333----3333---2222------------- NALATIAQ -------- >PARC; SWP:NA; PDB:1U9PA; MPQFNLRWPGGGPQFNLRWPREVLDLVRKVAEENGRSVNSEIYQRVMESFKKEGRIGGGG --------%%%%-------3333--------1111---------------1111------ REVLDLVRKVAEENGRSVNSEIYQRVMESFKKEGRI ------------------------------------ >RIBOSE-PHOSPHATE PYROPHOS; SWP:Q58761; PDB:1U9YA; MIVVSGSQSQNLAFKVAKLLNTKLTRVEYKRFPDNEIYVRIVDEINDDEAVIINTQKNQN -----1111----------------------3333------------------------- DAIVETILLCDALRDEGVKKITLVAPYLAYARQDKKFNPGEAISIRALAKIYSNIVDKLI ------------------------------------------3333-------------- TINPHETHIKDFFTIPFIYGDAVPKLAEYVKDKLNDPIVLAPDKGALEFAKTASKILNAE -----33331111--------3333----3333---------3333-------------- YDYLEIAPKTLDAKDRDVFIVDDIISTGGTMATAVKLLKEQGAKKIIAACVHPVLIGDAL --------------------------------------1111-------------!!!!- NKLYSAGVEEVVGTDTYLSEVSKVSVAEVIVDLL -----------------------------3333- >CELL DIVISION PROTEIN KIN; SWP:P50613; PDB:1UA2A; EKLDFLGEGQFATVYKARDKNTNQIVAIKKINRTALREIKLLQELSHPNIIGLLDAFGHK ------------------------------------------------------------ SNISLVFDFMETDLEVIIKDNSLVLTPSHIKAYMLMTLQGLEYLHQHWILHRDLKPNNLL ------------3333-----------3333-------------1111------1111-- LDENGVLKLADFGLAKSFGSPNRAYHQVVTRWYRAPELLFGARMYGVGVDMWAVGCILAE ----------------------------------3333-------3333----------- LLLRVPFLPGDSDLDQLTRIFETLGTPTEEQWPDMCSLPDYVTFKSFPGIPLHHIFSAAG ---------------------------3333--11111111---------3333-3333- DDLLDLIQGLFLFNPCARITATQALKMKYFSNRPGPTPGCQLPRPN -------------3333---------3333---------------- >ADP-DEPENDENT GLUCOKINASE; SWP:Q9V2Z6; PDB:1UA4A; PTWEELYKNAIEKAIKSVPKVKGVLLGYNTNIDAIKYLDSKDLEERIIKAGKEEVIKYSE ----------------3333-------------------------------------111 ELPDKINTVSQLLGSILWSIRRGKAAELFVESCPVRFYMKRWGWNELRMGGQAGIMANLL 1----------------------------------------------------------- GGVYGVPVIVHVPQLSRLQANLFLDGPIYVPTLENGEVKLIHPKEFSGDEENCIHYIYEF -----------------------------------------3333--------------- PRGFRVFEFEAPRENRFIGSADDYNTTLFIREEFRESFSEVIKNVQLAILSGLQALTKEN 2222-!!!!--------------3333---3333--3333-1111------3333-3333 YKEPFEIVKSNLEVLNEREIPVHLEFAFTPDEKVREEILNVLGMFYSVGLNEVELASIME ---------------1111---------------------3333---------------1 ILGEKKLAKELLAHDPVDPIAVTEAMLKLAKKTGVKRIHFHTYGYYLALTEYKGEHVRDA 111------1111----------------------------------------------- LLFAALAAAAKAMKGNITSLEEIREATSVPVNEKATQVEEKLRAEYGIKEGIGEVEGYQI -------------------------3333-------------------iiii--iiii-- AFIPTKIVAKPKSTVGIGDTISSSAFIGEFSFTL -------------2222-------------1111 >ALPHA-AMYLASE; SWP:P00691; PDB:1UA7A; PSIKSGTILHAWNWSFNTLKHNMKDIHDAGYTAIQTSPINQVKEGNQGDKSMSNWYWLYQ -3333-----2222------------------------------%%%%--33333333-- PTSYQIGNRYLGTEQEFKEMCAAAEEYGIKVIVDAVINHTTFDYAAISNEVKSIPNWTHG ------------------------------------------3333--3333-------- NTQIKNWSDRWDVTQNSLLGLYDWNTQNTQVQSYLKRFLERALNDGADGFRFDAAKHIEL -----------------iiii---1111--------------1111-------3333--1 PDDGSYGSQFWPNITNTSAEFQYGEILQDSASRDAAYANYMDVTASNYGHSIRSALKNRN 111------3333-------------------33333333-------------------- LGVSNISHYASDVSADKLVTWVESHDTYANDDEESTWMSDDDIRLGWAVIASRSGSTPLF -1111--------3333------------3333-3333----------1111-------- FSRPEGGGNGVRFPGKSQIGDRGSALFEDQAITAVNRFHNVMAGQPEELSNPQGNNQIFM ---2222iiii-------------3333-------------2222-----22221111-- NQRGSHGVVLANAGSSSVSINTATKLPDGRYDNKAGAGSFQVNDGKLTGTINARSVAVLY --------------------------------1111------%%%%-------------- PD -- >ATP-DEPENDENT DNA HELICAS; SWP:P09980; PDB:1UAAA; RLNPGQQQAVEFVTGPCLVLAGAGSGKTRVITNKIAHLIRGCGYQARHIAAVTFTNKAAR ---------------------------------------1111----------------- EMKERVGQTLGRKEARGLMISTFHTLGLDIIKREYAALGMKANFSLFDDTDQLALLKELT -----1111--------------------------1111--------------------- EGLIEDDKVLLQQLISTISNWKNDLKTPSQAAASAIGERDRIFAHCYGLYDAHLKACNVL 3333------------------------3333---------------------------- DFDDLILLPTLLLQANEEVRKRWQNKIRYLLVDEYQDTNTSQYELVKLLVGSRARFTVVG 3333------------3333-------------3333----------------------- DDDQSIYSWRGARPQNLVLLSQDFPALKVIKLEQNYRSSGRILKAANILIANNPHVFEKR 1111--3333--1111---------------------------------3333------- LFSELGYGAELKVLSANNEEHEAERVTGELIAHHFVNKTQYKDYAILYRGNHQSRVFEKF -----------------3333------------------3333-------3333------ LMQNRIPYKISGGTSFFSRPEIKDLLAYLRVLTNPDDDSAFLRIVNTPKREIGPATLKKL -1111---------11113333--------------------------------3333-- GEWAMTRNKSMFTASFDMGLSQTLSGRGYEALTRFTHWLAEIQRLAEREPIAAVRDLIHG ---------3333---3333---------------------------------------- MDYESWLYETSPSPKAAEMRMKNVNQLFSWMTEMLEGSELDEPMTLTQVVTRFTLRDMME ----------------------------------------------------1111---- REEELDQVQLMTLHASKGLEFPYVYMVGMEEGFLPHQSSIDEDNIDEERRLAYVGITRAQ -----------3333--------------------3333----------------3333- KELTFTLCKERRQYGELVRPEPSRFLLELPQDDLIW ------------iiii------3333---3333--- >Lysozyme C [Precursor]; SWP:P00703; PDB:1UACH; DVQLQESGPSLVKPSQTLSLTCSVTGDSITSDYWSWIRKFPGNRLEYMGYVSSFGSTFYN ------------2222-----------1111--------------------1111----1 PSLKSRISITRDTSKNQYYLDLNSVTTEDTATYYCANWDGDYWGQGTLVTVSAA 111---------1111---------3333-------1111-------------- >Lysozyme C [Precursor]; SWP:P00703; PDB:1UACL; DIVLTQSPATLSVTPGNSVSLSCRASQSIGNNLHWYQQKSHESPRLLIKYASQSISGIPS -------------2222-------------------------------------222211 RFSGSGSGTDFTLSINSVETEDFGMYFCQQSNSWPYTFGGGTKLEIK 11----------------1111------------------------- >Exocyst complex component; SWP:O54921; PDB:1UADC; SRQPPLVTGISPNEGIPWTKVTIRGENLGTGPTDLIGLTICGHNCLLTAEWMSASKIVCR ---------------2222-----------1111-----iiii-3333------------ VGQAKNDKGDIIVTTKSGGRGTSTVSFKLLKP -------------------------------- >UDP-N-ACETYLGLUCOSAMINE E; SWP:P28909; PDB:1UAE; MDKFRVQGPTKLQGEVTISGAKNAALPILFAALLAEEPVEIQNVPKLKDVDTSMKLLSQL -------------------------------3333----------------------111 GAKVERNGSVHIDARDVNVFCAPYDLVKTMRASIWALGPLVARFGQGQVSLPGGCTIGAR 1------------1111-----3333----3333-------------------------- PVDLHISGLEQLGATIKLEEGYVKASVDGRLKGAHIVMDKVSVGATVTIMCAATLAEGTT -3333----1111-----iiii------------------------------1111---- IIENAAREPEIVDTANFLITLGAKISGQGTDRIVIEGVERLGGGVYRVLPDRIETGTFLV ------------------1111----2222------------------------------ AAAISRGKIICRNAQPDTLDAVLAKLRDAGADIEVGEDWISLDMHGKRPKAVNVRTAPHP --1111--------3333-------------------------iiii------------- AFPTDMQAQFTLLNLVAEGTGFITETVFENRFMHVPELSRMGAHAEIESNTVICHGVEKL --1111-------1111--------------3333---1111-----!!!!--------- SGAQVMATDLRASASLVLAGCIAEGTTVVDRIYHIDRGYERIEDKLRALGANIERVKG ----------------------------------3333--------1111-------- >POLYGULURONATE LYASE; SWP:Q9RB42; PDB:1UAIA; EPCDYPAQQLDLTDWKVTLPIGSSGKPSEIEQPALDTFATAPWFQVNAKCTGVQFRAAVN ----3333--------------2222------3333----------1111-------111 GVTTSGSGYPRSELREMTDGGEEKASWSATSGTHTMVFREAFNHLPEVKPHLVGAQIHDG 1-2222------------%%%%-------------------------------------- DDDVTVFRLEGTSLYITKGDDTHHKLVTSDYKLNTVFEGKFVVSGGKIKVYYNGVLQTTI -----------------!!!!----------2222--------%%%%----iiii----- SHTSSGNYFKAGAYTQANCSNSSPCSSSNYGQVSLYKLQVTHS -----------------3333----1111-------------- >TRNA (GUANINE-N(1)-)-METH; SWP:P43912; PDB:1UALA; HMWIGVISLFPEMFKAITEFGVTGRAVKHNLLKVECWNPRDFTFDKHKTVDDRPYGGGPG ---------33333333---------1111-------3333---1111-----2222--- MLMMVQPLRDAIHTAKAAAGEGAKVIYLSPQGRKLDQGGVTELAQNQKLILVCGRYEGID ----------------------------1111----------1111--------!!!!-3 ERLIQTEIDEEWSIGDYVLTGGELPAMTLIDAVARFIPGVLSFADGLLDCPHYTRPEVLE 333---------------------------------2222--1111------------ii GLTVPPVLMSGHHEEIRKWRLKQSLQRTWLRRPELLEGLALTDEQRKLLKEAQAEHNSLE ii--3333-----------------------3333--------------------1111- HH -- >HYPOTHETICAL PROTEIN TT15; SWP:Q84BR2; PDB:1UANA; MLDLLVVAPHPDDGELGCGGTLARAKAEGLSTGILDLTRGEMGSKGTPEEREKEVAEASR ---------2222------------1111----------1111----------------- ILGLDFRGNLGFPDGGLADVPEQRLKLAQALRRLRPRVVFAPLEADRHPDHTAASRLAVA ------------2222-------------------------------3333--------- AVHLAGLRKAPLEGEPFRVERLFFYPGNHPFAPSFLVKISAFIDQWEAAVLAYRSQFTVG ------1111----------------------------1111----------3333---- PKGVEARKAMRRYWGNYLGVDYAEPFVSPLPVLYVPWSRA ---------------1111---------------1111-- >PROCOLLAGEN C-PROTEINASE ; SWP:Q15113; PDB:1UAPA; SPDAPTCPKQCRRTGTLQSNFCASSLVVTATVKSMVREPGEGLAVTVSLIGAYKTGGLDL --------------------------------------!!!!------------------ PSPPTGASLKFYVPCKQCPPMKKGVSYLLMGQVEENRGPVLPPESFVVLHRPNQDQILTN -----------------------------------------1111--------------- LSKRKCPSQPV ----------- >RHODANESE; SWP:Q5SJI0; PDB:1UARA; GYAHPEVLVSTDWVQEHLEDPKVRVLEVDEDILLYDTGHIPGAQKIDWQRDFWDPVVRDF ---3333---------1111------------3333---2222---3333---------- ISEEEFAKLERLGISNDTTVVLYGDKNNWWAAYAFWFFKYNGHKDVRLNGGRQKWVEEGR ---------1111-1111------%%%%----------1111------------------ PLTTEVPSYPPGRYEVPYRDESIRAYRDDVLEHIIKVKEGKGALVDVRSPQEYRGELEGA -------------------1111--1111------------------------------- LRAGHIPGAKNIPWAKAVNPDGTFKSAEELRALYEPLGITKDKDIVVYRIAERSSHSWFV -----2222---3333--1111-----------3333--1111------3333------- LKYLLGYPHVKNYDGSWTEWGNLVGVPIAKGEE ------------3333----------------- >ALPHA-GALACTOSIDASE; SWP:Q9FXT4; PDB:1UASA; FENGLGRTPQMGWNSWNHFYCGINEQIIRETADALVNTGLAKLGYQYVNIDDCWAEYSRD ------------------!!!!-----------------3333----------------1 SQGNFVPNRQTFPSGIKALADYVHAKGLKLGIYSDAGSQTCSNKMPGSLDHEEQDVKTFA 111--------1111--------1111------------1111----2222--------1 SWGVDYLKYDNCNDAGRSVMERYTRMSNAMKTYGKNIFFSLCEWGKENPATWAGRMGNSW 111----------%%%%----------------1111-----iiii-33333333----- RTTGDIADNWGSMTSRADENDQWAAYAGPGGWNDPDMLEVGNGGMSEAEYRSHFSIWALA --------3333------33333333-2222-------2222------------------ KAPLLIGCDVRSMSQQTKNILSNSEVIAVNQDSLGVQGKKVQSDNGLEVWAGPLSNNRKA --------1111----------------1111-----------iiii-------%%%%-- VVLWNRQSYQATITAHWSNIGLAGSVAVTARDLWAHSSFAAQGQISASVAPHDCKMYVLT ---------------3333---1111-----------------------2222------- PN -- >MOUSE-MUSASHI-1; SWP:Q61474; PDB:1UAWA; CKMFIGGLSWQTTQEGLREYFGQFGEVKECLVMRDPLTKRSRGFGFVTFMDQAGVDKVLA -------------3333---3333--------------------------------3333 QSRHELDSKTIDPKVAF 1111------------- >RIBONUCLEASE HII; SWP:O59351; PDB:1UAXA; MKVAGVDEAGRGPVIGPLVIGVAVIDEKNIERLRDIGVKDSKQLTPGQREKLFSKLIDIL -------------------------3333-------333311113333------------ DDYYVLLVTPKEIDERHHSMNELEAEKFVVALNSLRIKPQKIYVDSADVDPKRFASLIKA ------------1111---------------1111----------!!!!----------- GLKYEATVIAEHKADAKYEIVSAASIIAKVTRDREIEKLKQKYGEFGSGYPSDPRTKEWL ------------3333---------------------------------1111------- EEYYKQYGDFPPIVRRTWETARKIEERFRKN ----------11111111------------- >TYPE II 3-HYDROXYACYL-COA; SWP:Q7SIA1; PDB:1UAYA; ERSALVTGGASGLGRAAALALKARGYRVVVLDLRREGEDLIYVEGDVTREEDVRRAVARA ---------------------1111--------------------1111----------3 QEEAPLFAVVSAAGVGLAEKILGKEGPHGLESFRRVLEVNLLGTFNVLRLAAWAMRENPP 333-------------------1111----------------------------1111-- DAEGQRGVIVNTASVAAFEGQIGQAAYAASKGGVVALTLPAARELAGWGIRVVTVAPGLF 1111----------------2222-------------------3333------------- DTPLLQGLPEKAKASLAAQVPFPPRLGRPEEYAALVLHILENPMLNGEVVRLDGALRMAP -3333-----------1111-------3333----------1111-------%%%%---- R - >PHOSPHOMETHYLPYRIMIDINE K; SWP:Q7SIA0; PDB:1UB0A; MRVALTIAGSDSGGGAGVQADLKVFFRFGVYGTSALTLVTAQNTLGVQRVHLLPPEVVYA ----------3333-----------1111-------------1111-------3333--- QIESVAQDFPLHAAKTGALGDAAIVEAVAEAVRRFGVRPLVVDPVMAKEAAAALKERLFP --------------------------------1111---------------------333 LADLVTPNRLEAEALLGRPIRTLKEAEEAAKALLALGPKAVLLKGGHLEAVDLLATRGGV 3--------------------------------1111------------------3333- LRFSAPRVHTRNTHGTGCTLSAAIAALLAKGRPLAEAVAEAKAYLTRALKTAPSLGHGHG ------------2222-----------1111-----------------1111-------- PLDHWA --1111 >ATTACHMENT REGION BINDING; SWP:O42403; PDB:1UB1A; APAVPEASASPKQRRSIIRDRGPMYDDPTLPEGWTRKLKQRKSGRSAGKYDVYLINPQGK ------------------------------------------------------------ AFRSKVELIAYFEKVGDTSLDPNDFDFTVTGRGSPSRREQRPPKKAKSPKSPGSGRGRGR ---3333-------------1111---1111----------------------------- PKGSG ----- >CATALASE-PEROXIDASE; SWP:Q55110; PDB:1UB2A; STAEWWPKALNLDILSQHDRKTNPMGPDFNYQEEVQKLDAALKQDLQALMTDSQDWWPAD -----1111---1111--3333---1111-----1111---------3333--1111-22 WGHYGGLMIRLTWHAAGTYRIADGRGGAGTGNQRFAPLNSWPDNTNLDKARRLLWPIKQK 22-------------1111-----------3333--33333333---------------- YGNKLSWADLIAYAGTIAYESMGLKTFGFAFGREDIWHPEKDIYWGPEKEWFPPSTNPNS !!!!---------------1111----------------3333----------------- RYTGDRELENPLAAVTMGLIYVNPEGVDGNPDPLKTAHDVRVTFARMAMNDEETVALTAG --------3333---2222---1111iiii--------------1111------------ GHTVGKCHGNGNAALLGPEPEGADVEDQGLGWINKTQSGIGRNAVTSGLEGAWTPHPTQW ------------1111--3333----iiii---------!!!!------------1111- DNGYFAVCSLNYDWELKKNPAGAWQWEPINPREEDLPVDVEDPSIRRNLVMTDADMAMKM ------------------1111---------3333------1111------3333----- DPEYRKISERFYQDPAYFADVFARAWFKLTHRDMGPKARYIGPDVPQEDLIWQDPIPAGN -----------------------------------3333--1111----1111------- RNYDVQAVKDRIAASGLSISELVSTAWDSARTYRNSDKRGGANGARIRLAPQKDWEGNEP -----------------------------3333-----------3333--33333333-- DRLPKVLAVLEGISAATGATVADVIVLAGNVGVEQKARAAGVEIVLPFAPGRGDATAEQT -------------------------------------1111--------------3333- DTESFAVLEPIHDAIATGSSRTMRQRLKNCCLIATQLLGLTAPEMTVLIGGLRVLGTNHG 33333333--------------------------------------------1111-222 GTKHVVFTDREGVLTNDFFVNLTDMNYLWKPAGKNLYEICDRKTNQVKWTATRVDLVFGS 2--------2222--------------------------------------33331111- NSILRAYSELYAQDDNKEKFVRDFVAAWTKVMNADRFDLD ------------1111----------------1111---- >ALDOLASE PROTEIN; SWP:NA; PDB:1UB3A; DLAAHIDHTLLKPTATLEEVAKAAEEALEYGFYGLCIPPSYVAWVRARYPHAPFRLVTVV 3333-------1111----------------------1111------------------- GFPLGYQEKEVKALEAALACARGADEVDMVLHLGRAKAGDLDYLEAEVRAVREAVPQAVL ------------------------------------------------------1111-- KVILETGYFSPEEIARLAEAAIRGGADFLKTSTGFGPRGASLEDVALLVRVAQGRAQVKA ----3333-------------1111--------------------------iiii----- AGGIRDRETALRMLKAGASRLGTSSGVALVA ------------------------------- >MAZF PROTEIN; SWP:P33645; PDB:1UB4A; VSRYVPDMGDLIWVDFDPGHRPAVVLSPFMYNNKTGMCLCVPCTTQSKGYPFEVVLSGQE ------2222---------------------------------------1111------- RDGVALADQVKSIAWRARGATKKGTVAPEELQLIKAKINVLIG -----1111----3333-------------------------- >ANTIBODY 19G2, ALPHA CHAI; SWP:NA; PDB:1UB5A; EVKLLESGGGLVKPGGSLKLSCTASGITFSRYIMSWVRQIPEKRLEWVASISSGGITYYP ------------2222------------1111-------3333--------1111----3 DSVAGRFTISRDNVRNILYLQMSSLRSEDTALYYCARGQGRPYWGQGTLVTVSSAKTTPP 333----------------------3333------------------------------- SVYPAAPGCGDTTGSSVTLGCLVKGYFPEPVTVTWNSGGSSVHTFPALLQSGLYTMSSSV -------------------------------------------------iiii------- TVPSSTWPSTVTCSVAHPASSTTVDKKLE --1111----------3333--------- >ANTIBODY 19G2, ALPHA CHAI; SWP:NA; PDB:1UB6B; AALTQSPVSNPVTLGTSASISCRSTKSLLHSNGITYLYWYLQKPGQSPQLLIYQMSNLAS ------------2222-------------1111-------------------------22 GVPNRFSSSGSGTDFTLRINTVEAEDVGVYYCAQNLELPPTFGAGTKLELKRADAAPTVS 22--------------------1111---------------------------------- IFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMS ----3333-------------------------iiii----------------------- STLTLTKDEYERHNGYTCEATHKTSTSPIVKSF --------------------------------- >3-OXOACYL-[ACYL-CARRIER P; SWP:Q7SI99; PDB:1UB7A; SGILALGAYVPERVMTNADFEAYLDTSDEWIVTRTGIKERRVAAEDEYTSDLAFKAVEDL ---------------3333----------------------------------------- LRRHPGALEGVDAVIVATNTPDALFPDTAALVQARFGLKAFAYDLLAGPGWIYALAQAHA ---22222222---------------3333------------------------------ LVEAGLAQKVLAVGAEALSKIIDWNDRATAVLFGDGGGAAVVGKVREGYGFRSFVLGADG ----------------1111--3333-------------------2222---------33 TGAKELYHACVAPRLPDGTSMKNRLYMNGREVFKFAVRVMNTATLEAIEKAGLTPEDIRL 333333--------1111------------------------------1111-3333--- FVPHQANLRIIDAARERLGLPWERVAVNVDRYGNTSTASIPLALKEAVDAGRIREGDHVL --------------3333--3333---3333---!!!!---------------2222--- LVSFGAGLTWAAAVLTWGGA -------------------- >HYPOTHETICAL PROTEIN PH10; SWP:O58788; PDB:1UB9A; MEELKEIMKSHILGNPVRLGIMIFLLPRRKAPFSQIQKVLDLTPGNLDSHIRVLERNGLV -----------3333----------------3333--1111-------------1111-- KTYKVIADRPRTVVEITDFGMEEAKRFLSSLKAVIDGLDL ----------------------------------3333-- >RECA; SWP:Q59560; PDB:1UBCA; APDREKALELAMAQIDKNFGKGSVMRLGEEVRQPISVIPTGSISLDVALGIGGLPRGRVI ------------------------------------------------------------ EIYGPESSGKTTVALHAVANAQAAGGIAAFIDAEHALDPEYAKKLGVDTDSLLVSQPDTG ------------------------------------------3333-3333-------33 EQALEIADMLVRSGALDIIVIDSVAALVPRAEIEGLQARLMSQALRKMTGALNNSGTTAI 33-------1111---------1111----3333---------------3333------- FINQTGGKALKFYASVRLDVRRIETLKDGTDAVGNRTRVKVVKNKVSPPFKQAEFDILYG --------3333------------------------------------------------ QGISREGSLIDMGVEHGFIRKSGSWFTYEGEQLGQGKENARKFLLENTDVANEIEKKIKE ----3333---------------------------------------3333--------- KLG --- >Transcriptional repressor; SWP:P25490; PDB:1UBDC; TIACPHKGCTKMFRDNSAMRKHLHTHGPRVHVCAECGKAFVESSKLKRHQLVHTGEKPFQ ---------------------3333------------------------3333------- CTFEGCGKRFSLDFNLRTHVRIHTGDRPYVCPFDGCNKKFAQSTNLKSHILTHA --2222--------------------------2222------------3333-- >UBIQUITIN; SWP:P62988; PDB:1UBQ; MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYN ------1111-------1111---------------3333----%%%%--11113333-- IQKESTLHLVLRLRGG -2222----------- >FARNESYL DIPHOSPHATE SYNT; SWP:P08836; PDB:1UBY; SPVVVEREREEFVGFFPQIVRDLTEDGIGHPEVGDAVARLKEVLQYNAPGGKCNRGLTVV 3333--------------------3333------------------------3333---- AAYRELSGPGQKDAESLRCALAVGWCIELFQAASLVADDIMDQSLTRRGQLCWYKKEGVG -------3333-----------------------------------%%%%-33332222- LDAINDSFLLESSVYRVLKKYCRQRPYYVHLLELFLQTAYQTELGQMLDLITAPVSKVDL ------------------------1111-------------------------------1 SHFSEERYKAIVKYKTAFYSFYLPVAAAMYMVGIDSKEEHENAKAILLEMGEYFQIQDDY 1111111----------------------1111--1111--------------------- LDCFGDPALTGAVGTDIQDNKCSWLVVQCLQRVTPEQRQLLEDNYGRKEPEKVAKVKELY -----2222-------------------1111---3333--------------------- EAVGMRAAFQQYEESSYRRLQELIEKHSNRLPKEIFLGLAQKIYKRQK -----3333----------------------3333----1111----- >HYPOTHETICAL PROTEIN PH16; SWP:O59245; PDB:1UC2A; VVPLKRIDKIRWEIPKFDKRMRVPGRVYADEVLLEKMKNDRTLEQATNVAMLPGIYKYSI -----------------1111------------3333--------------2222----- VMPDGHQGYGFPIGGVAAFDVKEGVISPGGIGYDINCGVRLIRTNLTEKEVRPRIKQLVD -1111---------------------3333----------------33333333------ TLFKNVPSGVGSQGRIKLHWTQIDDVLVDGAKWAVDNGYGWERDLERLEEGGRMEGADPE --------2222------1111------------1111--33331111---------111 AVSQRAKQRGAPQLGSLGSGNHFLEVQVVDKIFDPEVAKAYGLFEGQVVVMVHTGSRGLG 1--------3333-----!!!!----------------1111-2222------------- HQVASDYLRIMERAIRKYRIPWPDRELVSVPFQSEEGQRYFSAMKAAANFAWANRQMITH --------------1111-----3333---1111-------------------------- WVRESFQEVFKQDPEGDLGMDIVYDVAHNIGKVEEHEVDGKRVKVIVHRKGATRAFPPGH -------------------------------------iiii---------------2222 EAVPRLYRDVGQPVLIPGSMGTASYILAGTEGAMKETFGSTCHGAGRVLSRKAATRQYRG ---3333-----------2222--------------%%%%-------------------- DRIRQELLNRGIYVRAASMRVVAEEAPGAYKNVDNVVKVVSEAGIAKLVARMRPIGVAKG -------------------------3333------------------------------- >GLOBIN; SWP:P02207; PDB:1UC3A; PIVDSGSVAPLSAAEKTKIRSAWAPVYSNYETSGVDILVKFFTSTPAAQEFFPKFKGMTS ------------------------3333----------------333311111111---3 ADQLKKSADVRWHAERIINAVNDAVASMDDTEKMSMKLRDLSGKHAKSFQVDPQYFKVLA 333----3333---------------11113333-----------------1111----- AVIADTVAAGDAGFEKLMSMICILLRSAY -------2222------------1111-- >CILIARY NEUROTROPHIC FACT; SWP:P26992; PDB:1UC6A; GPLGSVKPDPPENVVARPVPSNPRRLEVTWQTPSTWPDPESFPLKFFLRYRPLILDQWQH ------------------1111----------3333------------------------ VELSNGTAHTITDAYAGKEYIIQVAAKDNEIGTWSDWSVAAHATPWTEE --------------1111------------------------------- >LYSINE BIOSYNTHESIS ENZYM; SWP:Q84BR0; PDB:1UC8A; MLAILYDRIRPDERMLFERAEALGLPYKKVYVPALPMVLGERPKELEGVTVALERCVSQS ------------------------------3333---2222-3333-------------- RGLAAARYLTALGIPVVNRPEVIEACGDKWATSVALAKAGLPQPKTALATDREEALRLME ---------1111-----------------------1111-------------------- AFGYPVVLKPVIGGFQHQLFYIQEYVEKPGRDIRVFVVGERAIAAIYRAENCPLTEEVAR -------------------------------------!!!!------------------- LSVKAAEAVGGGVVAVDLFESERGLLVNEVNHTMEFKNSVHTTGVDIPGEILKYAWSLAS ------1111----------1111-----------3333----------------1111- >CHIMERIC HUMAN/MOUSE IGG ; SWP:NA; PDB:1UCBH; EVNLVESGGGLVQPGGSLKVSCVTSGFTFSDYYMYWVRQTPEKRLEWVAYISQGGDITDY ------------2222-----------3333--------1111--------1111----- PDTVKGRFTISRDNAKNSLYLQMSRLKSEDTAMYYCARGLDDGAWFAYWGQGTLVTVSVA 3333--------3333----------3333---------3333----------------- STKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPQPVTVSWNSGALTSGVHTFPAVLQSSG ------------1111-!!!!-------------------iiii---------------- LYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKRVEP ----------1111-----------3333---------- >Ig kappa chain C region; SWP:KAC_HUMAN; PDB:1UCBL; MTQIPVSLPVSLGDQASISCRSSQIIVHNNGNTYLEWYLQKPGQSPQLLIYKVSNRFSGV ----------2222-------------2222----------------------------- PDRFSGSGSGTDFTLKISRVEAEDLGVYYCFQGSHVPFTFGSGTKLEIKRTVAAPSVFIF 3333----!!!!--------1111------------------------------------ PPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSST -------------------------------iiii------------------------- LTLSKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC -------------------1111------------- >RIBONUCLEASE MC; SWP:P23540; PDB:1UCDA; FDSFWFVQQWPPAVCSFQKSGSCPGSGLRTFTIHGLWPQQSGTSLTNCPGSPFDITKISH -----------------------3333------------%%%%----------3333333 LQSQLNTLWPNVLRANNQQFWSHEWTKHGTCSESTFNQAAYFKLAVDMRNNYDIIGALRP 3--------------------------3333-----------------11113333--11 HAAGPNGRTKSRQAIKGFLKAKFGKFPGLRCRTDPQTKVSYLVQVVACFAQDGSTLIDCT 11-----------------------------------------------1111------- RDTCGANFIF ---------- >UBIQUITIN C-TERMINAL HYDR; SWP:P15374; PDB:1UCH; RWLPLEANPEVTNQFLKQLGLHPNWQFVDVYGMDPELLSMVPRPVCAVLLLFPITEKYEV ---------------------------------33333333------------------- FRTEEEEKIKSQGQDVTSSVYFMKQTISNACGTIGLIHAIANNKDKMHFESGSTLKKFLE ----------------3333------3333---------11111111--1111------1 ESVSMSPEERARYLENYDAIRVDLHFIALVHVDGHLYELDGRKPFPINHGETSDETLLED 111--3333-------3333-----------iiii----3333---------3333---- AIEVCKKFMERDPDELRFNAIALSAA -----------1111----------- >NUCLEOSIDE DIPHOSPHATE KI; SWP:P15531; PDB:1UCNA; ANCERTFIAIKPDGVQRGLVGEIIKRFEQKGFRLVGLKFMQASEDLLKEHYVDLKDRPWF --------------1111---------3333-------------------3333--1111 AGLVKYMHSGPVVAMVWEGLNVVKTGRVMLGETNPADSKPGTIRGDFCIQVGRNIIGGSD ------------------2222-----------3333------------1111------- SVESAEKEIGLWFHPEELVDYTSCAQNWIYE -------------1111-----1111----- >Apoptosis-associated spec; SWP:Q9ULZ3; PDB:1UCPA; MGRARDAILDALENLTAEELKKFKLKLLSVPLREGYGRIPRGALLSMDALDLTDKLVSFY -----------3333---------3333----------------------------3333 LETYGAELTANVLRDMGLQEMAGQLQAATHQ ------------------------------- >PROTEIN DSVD; SWP:Q46582; PDB:1UCRA; MEEAKQKVVDFLNSKSGSKSKFYFNDFTDLFPDMKQREVKKILTALVNDEVLEYWSSGST ----------------------3333----11113333------------------!!!! TMYGLKGAGKQAAA ----2222------ >ANTIFREEZE PEPTIDE RD1; SWP:P35751; PDB:1UCSA; NKASVVANQLIPINTALTLIMMKAEVVTPMGIPAEEIPKLVGMQVNRAVPLGTTLMPDMV -----------2222--3333-----------33333333---------2222--33332 KNYE 222- >IMMUNOGLOBULIN ALPHA FC R; SWP:P24071; PDB:1UCTA; QEGDFPMPFISAKSSPVIPLDGSVKIQCQAIREAYLTQLMIIKNSTYREIGRRLKTDPEF 3333--------------2222--------1111-------------------------- VIDHMDANKAGRYQCQYRIGHYRFRYSDTLELVVTGLYGKPFLSADRGLVLMPGENISLT -----3333------------------------------------------2222----- CSSAHIPFDRFSLAKEGELSLPQHQSGEHPANFSLGPVDLNVSGIYRCYGWYNRSPYLWS --------------2222--------------------3333---------3333----- FPSNALELVVT ----------- >EPHRIN TYPE-A RECEPTOR 8; SWP:P29322; PDB:1UCVA; GSSGSSGLTVGDWLDSIRMGRYRDHFAAGGYSSLGMVLRMNAQDVRALGITLMGHQKKIL --------33333333--3333----1111-----3333-33333333---3333----- GSIQTMRAQLTSTQGSGPSSG ------3333----------- >Prothrombin [Precursor]; SWP:P00735; PDB:1UCYK; IVEGQDAEVGLSPWQVMLFRKSPQELLCGASLISDRWVLTAAHCLLYPPWDKNFTVDDLL -------22221111-------------------------1111--3333----1111-- VRIGKHSRTRYERKVEKISMLDKIYIHPRYNWKENLDRDIALLKLKRPIELSDYIHPVCL -----------1111-----------1111---------------------1111----- PDKQTAAKLLHAGFKGRVTGWGNRRETWTTSVAEVQPSVLQVVNLPLVERPVCKASTRIR -3333------------------------------------------------------- ITDNMFCAGYKPGEGKRGDACEGDSGGPFVMKSPYNNRWYQMGIVSWGEGCDRDGKYGFY -1111-----1111------2222----------------------------2222---- THVFRLKKWIQKVIDRLGS -3333-------------- >70 KDA HEAT-SHOCK-LIKE PR; SWP:Q504P4; PDB:1UD0A; RGSHLESYAFNKATVEDEKLQGKINDEDKQKILDKCNEIISWLDKNQTAEKEEFEHQQKE -------------1111-------3333-------------------------------- LEKVCNPIITKLYQSAGGPGG 33333333--3333------- >AMYLASE; SWP:Q93I48; PDB:1UD2A; DGLNGTMMQYYEWHLENDGQHWNRLHDDAAALSDAGITAIWIPPAYKGNSQADVGYGAYD -----------1111----------------------------------1111------1 LYDLGEFNQKGTVRTKYGTKAQLERAIGSLKSNDINVYGDVVMNHKMGADFTEAVQAVQV 111-----iiii--1111------------1111-------------------------- NPTNRWQDISGAYTIDAWTGFDFSGRNNAYSDFKWRWFHFNGVDWDQRYQENHIFRFANT 1111-------------------3333--------1111-------1111------2222 NWNWRVDEENGNYDYLLGSNIDFSHPEVQDELKDWGSWFTDELDLDGYRLDAIKHIPFWY -------2222----------3333---------------------------11113333 TSDWVRHQRNEADQDLFVVGEYWKDDVGALEFYLDEMNWEMSLFDVPLNYNFYRASQQGG ----------------------------------1111--------------------33 SYDMRNILRGSLVEAHPMHAVTFVDNHDTQPGESLESWVADWFKPLAYATILTREGGYPN 333333-222233333333------11112222------3333----------------- VFYGDYYGIPNDNISAKKDMIDELLDARQNYAYGTQHDYFDHWDVVGWTREGSSSRPNSG -3333----1111---------------------------------------3333---- LATIMSNGPGGSKWMYVGRQNAGQTWTDLTGNNGASVTINGDGWGEFFTNGGSVSVYVNQ -----------------3333------1111--------1111----------------- >UBIQUITIN CORE MUTANT 1D7; SWP:Q6VZQ1; PDB:1UD7A; MQVFLKTLTGKTVTIEVEPSDTVENFKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYN ------1111-------1111---------------3333----------11113333-- IQKESTIHLVLRLRGG ---------------- >DNA POLYMERASE SLIDING CL; SWP:Q975N2; PDB:1UD9A; AHIVYDDVRDLKAIIQALLKLVDEALFDIKPEGIQLVAIDKAHISLIKIELPKEMFKEYD ----------------3333---------1111------1111--------3333----- VPEEFKFGFNTQYMSKLLKAAKRKEEIIIDADSPEVVKLTLSGALNRVFNVNNIEVLPPE --------------------------------3333------------------------ VPLEFDIKATINASGLKNAIGEIAEVADTLLISGNEEKVVVKGEGENKVEVEFSKDTGSL ----------------------------------1111---------------1111--- ADIEFNKESSSAYDVEYLNDIISLTKLSDYVKVAFADQKPMQLEFNMEGGGKVTYLLAPK ------------------------3333------------------2222---------- LS -- >UDP-GALACTOSE-4-EPIMERASE; SWP:P09147; PDB:1UDC; MRVLVTGGSGYIGSHTCVQLLQNGHDVIILDNLCNSKRSVLPVIERLGGKHPTFVEGDIR ------1111----------1111------------3333-----------------333 NEALMTEILHDHAIDTVIHFAGLKAVGESVQKPLEYYDNNVNGTLRLISAMRAANVKNFI 3-----------------------3333-------------------------------- FSSSATVYGDQPKIPYVESFPTGTPQSPYGKSKLMVEQILTDLQKAQPDWSIALLRYFNP ----------------1111--------------------------1111---------- VGAHPSGDMGEDPQGIPNNLMPYIAQVAVGRRDSLAIFGNDYPTEDGTGVRDYIHVMDLA ---3333------------------------------------1111------------- DGHVVAMEKLANKPGVHIYNLGAGVGNSVLDVVNAFSKACGKPVNYHFAPRREGDLPAYW ---------2222--------------------------------------2222----- ADASKADRELNWRVTRTLDEMAQDTWHWQSRHPQGYPD -------------------------------1111--- >TRANSCRIPTIONAL REGULATOR; SWP:O58873; PDB:1UDDA; MRVMITDKLRRDSEQIWKKIFEHPFVVQLYSGTLPLEKFKFYVLQDFNYLVGLTRALAVI ------------------------------------------------------------ SSKAEYPLMAELIELARDEVTVEVENYVKLLKELDLTLEDAIKTEPTLVNSAYMDFMLAT 1111---------------------------1111-3333-------------------- AYKGNIIEGLTALLPCFWSYAEIAEYHKDKLRDNPIKIYREWGKVYLSNEYLNLVGRLRK ------------------------------------------3333-------------- IIDSSGHSGYDRLRRIFITGSKFELAFWEMAWRGG -1111------------------------------ >URACIL-DNA GLYCOSYLASE; SWP:P10186; PDB:1UDH; LDWTTFRRVFLIDDAWRPLMEPELANPLTAHLLAEYNRRCQTEEVLPPREDVFSWTRYCT ------------3333---3333--3333------------------3333-3333---3 PDEVRVVIIGQDPYHHPGQAHGLAFSVRANVPPPPSLRNVLAAVKNCYPEARMSGHGCLE 333------------2222---2222-1111----------------1111--------- KWARDGVLLLNTTLTVKRGAAASHSRIGWDRFVGGVIRRLAARRPGLVFMLWGTHAQNAI ----------------2222-1111----------------------------------- RPDPRVHCVLKFSHPSPLSKVPFGTCQHFLVANRYLETRSISPIDWSV --1111---------3333--3333-3333------1111-------- >NAWAPRIN; SWP:P60589; PDB:1UDKA; NEKSGSCPDMSMPIPPLGICKTLCNSDSGCPNVQKCCKNGCGFMTCTTPVP -------------------------3333---------------------- >INTERSECTIN 2; SWP:Q9NZM3; PDB:1UDLA; GSSGSSGQKGWFPASHVKLLGPSSERATPAFHPVCQVIAMYDYAANNEDELSFSKGQLIN ----------------------------------------------3333---2222--- VMNKDDPDWWQGEINGVTGLFPSNYVKMTTDSSGPSSG -------------%%%%--------------------- >COACTOSIN-LIKE PROTEIN; SWP:Q9CQI6; PDB:1UDMA; GSEGAATMATKIDKEACRAAYNLVRDDGSAVIWVTFRYDGATIVPGDQGADYQHFIQQCT -------------3333------------------------------------------- DDVRLFAFVRFTTGDAMSKRSKFALITWIGEDVSGLQRAKTGTDKTLVKEVVQNFAKEFV -----------------------------------------------3333--------- ISDRKELEEDFIRSELKKAGGANYDAQSE --3333----------------------- >RIBONUCLEASE PH; SWP:O67069; PDB:1UDSA; RSDGRKEDQLRPVSIQRDFLEYPEGSCLISFGKTKVICTASVIENVPNWLKGKGQGWITA -----1111---------------------!!!!------------1111---------- EYSMLPRATQQRTIRESVQGRIGGRTHEIQRMIGRAMRTAVELTKIGERTIWVDCDVIQA -------%%%%--------------------------11113333--------------- DGGTATAAITGAFVAVADAIIKLHKEGIIEETPIKDFVAAVSVGIVNDRILLDLNFEEDS -----------------------1111------------------%%%%---------11 AAQVDMNVVGTGSGRLSEVHTMGEEYSFTKDELIKMLDLAQKGINELIELQKKLYVIQDG 11--------1111-------------------------------------1111--iii KWERSELKEVSSTT i------------- >THE GTP-BINDING PROTEIN O; SWP:Q7X493; PDB:1UDXA; MFQDVLVITVAAGRGGDGAVSFRREKFVPKGGPDGGDGGRGGSVYLRARGSVDSLSRLSK ------------------------------------------------3333--1111-- RTYKAEDGEHGRGSQQHGRGGEDLVIEVPRGTRVFDADTGELLADLTEEGQTVLVARGGA ------------%%%%-------------------------------2222--------- GGRGNMHFVSPTRQAPRFAEAGEEGEKRRLRLELMLIADVGLVGYPNAGKSSLLAAMTRA ---3333--3333-------------------------------1111------------ HPKIAPYPFTTLSPNLGVVEVSEEERFTLADIPGIIEGASEGKGLGLEFLRHIARTRVLL ------1111---------------------------3333------------------- YVLDAADEPLKTLETLRKEVGAYDPALLRRPSLVALNKVDLLEEEAVKALADALAREGLA ---1111----------------3333-----------11113333-------------- VLPVSALTGAGLPALKEALHALVRSTPPPEMPKPVPQAGVEVVPVAEGVYEVRAPEVERY ----------------------1111-------------------2222----------- LARIKGDLMEAAGYLQEVFRRQGVEAALRAKGVRAGDLVRIGGLEFEYIPEV 1111---3333-3333-----------1111--2222---iiii-------- >SINGLE-STRAND BINDING PRO; SWP:P71711; PDB:1UE1A; GDTTITIVGNLTADPELRFTPSGAAVANFTVASTPRIYDWKDGEALFLRCNIWREAAENV -------------------3333------------------------------------- AESLTRGARVIVSGRLKQRSFETREGEKRTVIEVEVDEIGPSLRYATAKVNKA ----------------------------------------------------- >367AA LONG HYPOTHETICAL C; SWP:Q972I2; PDB:1UE8A; MYDWFKQMRKESPVYYDGKVWNLFKYEDCKMVLNDHKRFSSNLTGYNDKLEMLRSGKVFF --------------------------------------------3333--3333------ DIPTRYTMLTSDPPLHDELRNLTADAFNPSNLPVDFVREVTVKLLSELDEEFDVIESFAI -1111-3333------------1111-3333-3333------1111--------1111-- PLPILVISKMLGINPDVKKVKDWSDLVALRLGRADEIFSIGRKYLELISFSKKELDSRKG ---------------------------1111----------------------------- KEIVDLTGKIANSNLSELEKEGYFILLMIAGNETTTNLIGNAIEDFTLYNSWDYVREKGA ----3333--------------------------------------1111--3333---- LKAVEEALRFSPPVMRTIRVTKEKVKIRDQVIDEGELVRVWIASANRDEEVFKDPDSFIP -----------------------------------------------3333--1111-11 DRTPNPHLSFGSGIHLCLGAPLARLEARIALEEFAKKFRVKEIVKKEKIDNEVLNGYRKL 11-----1111-11111111------------------------------1111------ VVRVERT ------- >INTERSECTIN 2; SWP:Q9NZM3; PDB:1UE9A; GSSGSSGEIAQVTSAYVASGSEQLSLAPGQLILILKKNTSGWWQGELQARGKKRQKGWFP -----------------------------------------------------------1 ASHVKLLGPSSERASGPSSG 111----------------- >ELONGATION FACTOR P; SWP:Q76G20; PDB:1UEBA; MISVTDLRPGTKVKMDGGLWECVEYQHQKLGRGGAKVVAKFKNLETGATVERTFNSGEKL --1111--------iiii-----------!!!!---------------------1111-- EDIYVETRELQYLYPEGEEMVFMDLETYEQFAVPRSRVVGAEFFKEGMTALGDMYEGQPI ---------------!!!!--------------3333--1111-2222------iiii-- KVTPPTVVELKVVDTPPGVRGDTVSGGSKPATLETGAVVQVPLFVEPGEVIKVDTRTGEY --------------------------------1111-----11112222----------- VGRA ---- >P450 MONOOXYGENASE; SWP:Q8RN03; PDB:1UEDA; DIDQVAPLLREPANFQLRTNCDPHEDNFGLRAHGPLVRIVGESSTQLGRDFVWQAHGYEV -------------2222-!!!!----------------------1111------------ VRRILGDHEHFTTRPQFEAQFVGQISTYDPPEHTRLRKMLTPEFTVRRIRRMEPAIQSLI -----------------1111--3333------------3333----------------- DDRLDLLEAEGPSADLQGLFADPVGAHALCELLGIPRDDQREFVRRIRRNARGLKARAAD ----------22223333-----------------1111--------------------- SAAFNRYLDNLLARQRADPDDGLLGMIVRDHGDNVTDEELKGLCTALILGGVETVAGMIG ------------------------------!!!!-------------------------- FGVLALLDNPGQIELLFESPEKAERVVNELVRYLSPVQAPNPRLAIKDVVIDGQLIKAGD --------3333--1111--------------------------------iiii--2222 YVLCSILMANRDEALTPDPDVLDANRAAVSDVGFGHGIHYCVGAALARSMLRMAYQTLWR ------3333-3333--1111-1111-----1111-11111111---------------- RFPGLRLAVPIEEVKYRSAFVDCPDQVPVTW -1111----1111------------------ >UNDECAPRENYL PYROPHOSPHAT; SWP:Q47675; PDB:1UEHA; LPAHGCRHVAIIMDGNGRWAKKQGKIRAFGHKAGAKSVRRAVSFAANNGIEALTLYAFSM -1111---------------1111-3333----------------1111----------3 ELFVWALDSEVKSLHRHNVRLRIIGDTSRFNSRLQERIRKSEALTAGNTGLTLNIAANYG 333--3333-----------------11113333----------1111------------ GRWDIVQGVRQLAEKVQQGNLQPDQIDEEMLNQHVCMHELAPVDLVIRTGGEHRISNFLL ---------------------1111---------2222----------------%%%%-3 WQIAYAELYFTDVLWPDFDEQDFEGALNAFANRE 333----------3333------------1111- >4-(cytidine 5'-diphospho); SWP:P83700; PDB:1UEKA; MERLAPAKVNLGLSVRFRREDGYHELHTLFAPFSLADRLVVEPVSSGLHFQGPYGRENLA ------------------1111-----------------------------2222----- YRAASLYLEAAGQPGGVRILLEKRIPEGAGLGGGSSDAAQVLLALQALYPAEVDLFALAR --------1111------------------------------------------------ TLGADVPFFLLGRGAEARGVGERLKPLALPPVPAVVFFPGLRVPTPLVYRAVRPEDFGPD --!!!!3333-------!!!!----------------------3333-11113333---- LPVEAILEALARGEEPPYWNSLEGPAFRLFPELKEVRGRMRALGLRGVLMSGSGSAFFGL -----------------------------3333-------1111------!!!!------ AEGPDHARRAAEALRAWGRAWAGTLGGG ---------------------------- >UV EXCISION REPAIR PROTEI; SWP:P54727; PDB:1UELA; MQVTLKTLQQQTFKIDIDPEETVKALKEKIESEKGKDAFPVAGQKLIYAGKILNDDTALK ------1111-------33333333--------------3333----iiii--1111333 EYKIDEKNFVVVMVTKPKAVSTPAPATLEHHHHHH 3---------------------------------- >KIAA1568 PROTEIN; SWP:Q9HCK4; PDB:1UEMA; GSSGSSGKNYDLSDLPGPPSKPQVTDVTKNSVTLSWQPGTPGTLPASAYIIEAFSQSVSN ------------------------------------------------------1111-- SWQTVANHVKTTLYTVRGLRPNTIYLFMVRAINPQGLSDPSPMSDPVRTQDSGPSSG --------------------------------------------------------- >KIAA0343 PROTEIN; SWP:Q92823; PDB:1UENA; GSSGSSGHSGEDLPMVAPGNVRVNVVNSTLAEVHWDPVPLKSIRGHLQGYRIYYWKTQSS --------------------------------------3333------------------ SKRNRRHIEKKILTFQGSKTHGMLPGLEPFSHYTLNVRVVNGKGEGPASPDRVFNTPEGS ------------------------------------------------------------ GPSSG ----- >PENAEIDIN-3A; SWP:P81058; PDB:1UEOA; QVYKGGYARPIPRPPPFVRPLPGGPIGPYNGCPVSCRGISFSQARSCCSRLGRCCHVGKG --------------------------------3333------------------------ YSG --- >Membrane Associated Guany; SWP:Q86UL8; PDB:1UEPA; GSSGSSGYKELDVHLRRMESGFGFRILGGDEPGQPILIGAVIAMGSADRDGRLHPGDELV -----------------------------------------2222-3333---2222--- YVDGIPVAGKTHRYVIDLMHHAARNGQVNLTVRRKVLSGPSSG ------2222------------3333----------------- >MEMBRANE ASSOCIATED GUANY; SWP:Q86UL8; PDB:1UEQA; GSSGSSGLFTRDASQLKGTFLSTTLKKSNMGFGFTIIGGDEPDEFLQVKSVIPDGPAAQD ------------1111-----------------------------------2222--111 GKMETGDVIVYINEVCVLGHTHADVVKLFQSVPIGQSVNLVLCRGYPLPFDPEDPANSGP 1---------------11113333--------2222------------------------ SSG --- >SUPEROXIDE DISMUTASE; SWP:P19665; PDB:1UESA; MTHELISLPYAVDALAPVISKETVEFHHGKHLKTYVDNLNKLIIGTEFENADLNTIVQKS ----------1111--------------------------1111-1111----------- EGGIFNNAGQTLNHNLYFTQFRPGKGGAPKGKLGEAIDKQFGSFEKFKEEFNTAGTTLFG -----------------1111--------------------------------------- SGWVWLASDANGKLSIEKEPNAGNPVRKGLNPLLTFDVWEHAYYLTYQNRRADHLKDLWS --------1111-------!!!!--1111---------3333----!!!!------3333 IVDWDIVESRY -------1111 >MEMBRANE ASSOCIATED GUANY; SWP:Q86UL8; PDB:1UEWA; GSSGSSGSLQTSDVVIHRKENEGFGFVIISSLNRPESGSTITVPHKIGRIIDGSPADRCA ------------------1111----------------------------2222-1111- KLKVGDRILAVNGQSIINMPHADIVKLIKDAGLSVTLRIIPQEELNSPSGPSSG ----------iiii-----3333------------------------------- >KIAA0343 PROTEIN; SWP:Q92823; PDB:1UEYA; GSSGSSGPTPAPVYDVPNPPFDLELTDQLDKSVQLSWTPGDDNNSPITKFIIEYEDAMHK -------------------------------------------------------1111- PGLWHHQTEVSGTQTTAQLNLSPYVNYSFRVMAVNSIGKSLPSEASEQYLTKASEPDKNP ------------------------------------------------------------ TSGPSSG ------- >KIAA1526 PROTEIN; SWP:Q9P202; PDB:1UEZA; GSSGSSGEVRLVSLRRAKAHEGLGFSIRGGSEHGVGIYVSLVEPGSLAEKEGLRVGDQIL -----------------------------3333--------------3333--------- RVNDKSLARVTHAEAVKALKGSKKLVLSVYSAGRISGPSSG ----------3333--------------------------- >KIAA1526 PROTEIN; SWP:Q9P202; PDB:1UF1A; GSSGSSGDRRSTLHLLQGGDEKKVNLVLGDGRSLGLTIRGGAEYGLGIYITGVDPGSEAE ------------------------------------------------------------ GSGLKVGDQILEVNGRSFLNILHDEAVRLLKSSRHLILTVKDVGRLPHARTTVDETKWIA ------------%%%%-----------3333----------------------3333--- SSSGPSSG -------- >HYPOTHETICAL PROTEIN TT15; SWP:Q5SI91; PDB:1UF3A; RRTVRYILATSNPGDLEALEKFVKLAPDTGADAIALIGNLPKAAKSRDYAAFFRILSEAH --------------------------1111----------33333333--------3333 LPTAYVPGPQDAPIWEYLREAANVELVHPERNVHETFTFWRGPYLVAGVGGEIADEGEPE -------1111-3333----------------2222------------------------ EHEALRYPAWVAEYRLKALWELKDYPKIFLFHTPYHKGLNEQGSHEVAHLIKTHNPLLVL -------3333-----3333-------------------1111----------------- VAGKGQKHELGASWVVVPGDLSEGEYSLLDLRARKLETGNVR ---------!!!!-----------------1111-------- >N-CARBAMYL-D-AMINO ACID A; SWP:P60327; PDB:1UF5A; TRQMILAVGQQGPIARAETREQVVVRLLDMLTKAASRGANFIVFPELALTTFFPRWHFTD --------------1111----------------1111--------1111--1111---- EAELDSFYETEMPGPVVRPLFEKAAELGIGFNLGYAELVVEGGVKRRFNTSILVDKSGKI ----1111-----3333-----------------------iiii---------------- VGKYRKIHLPGHKEYEAYRPFQHLEKRYFEPGDLGFPVYDVDAAKMGMFIANDRRWPEAW ---------------3333----3333-------------%%%%------3333------ RVMGLRGAEIICGGYNTPTHNPPVPQHDHLTSFHHLLSMQAGSYQNGAWSAAAGKAGMEE ---1111-------------33331111---------------1111-----------ii NCMLLGHSCIVAPTGEIVALTTTLEDEVITAAVDLDRCRELREHIFNFKQHRQPQHYGLI ii---------1111--------------------------------3333-33333333 AEL --- >TT1252 PROTEIN; SWP:Q56416; PDB:1UF9A; KHPIIIGITGNIGSGKSTVAALLRSWGYPVLDLDALAARARENKEEELKRLFPEAVVGGR ----------2222---------1111-----------------3333---3333-iiii LDRRALARLVFSDPERLKALEAVVHPEVRRLLEELSRLEAPLVFLEIPLLFEKGWEGRLH --------1111--------------------3333-------------3333-1111-- GTLLVAAPLEERVRRVARSGLSREEVLARERAQPEEEKRKRATWVLENTGSLEDLERALK -----------------------3333-----------1111------------------ AVLAELTG 3333---- >TT1467 PROTEIN; SWP:Q5SH28; PDB:1UFAA; ARFALVLHAHLPYVRAHGWPFGEETLYEAAETYLPLIRVLERLRAEGVEAPFTLGITPIL -------------2222----3333----------------------------------- AEQLADARIKEGFWAYAKDRLERAQGDYQRYRGTALEASARHQVAFWELTLDHFQRLSGD --1111------------------------2222----------------------%%%% LVAAFRKAEEGGQVELITSNATHGYSPLLGYDEALWAQIKTGVSTYRRHFAKDPTGFWLP -------------------1111-3333-------------------------------- EAYRPKGPWKPPVEGPPEGVRPGVDELLRAGIRYTFVDAHLVQGGEPLSPVESQEATYHV ----------------------3333-----------3333------------3333--- HELESGLRVLARNPETTLQVWSADYGYPGEGLYREFHRKDPLSGLHHWRVTHRKADLAEK --3333-------3333--------33333333------------------33333333- APYDPEAAFAKTEEHARHFVGLLERLAGRHPEGVILSPYDAELFGHWWYEGVAWLEAVLR -----------------------------1111------3333----------------- LLAQNPKVRPVTAREAVQGPAVRTALPEGSWGRGGDHRVWLNEKTLDYWEKVYRAEGARE 3333-------3333----------------2222-3333-3333--------------- AARRGVLPEGVLRQARELLLLEASDWPFLETGQAEAYARERYEEHARAFFHLLKGASPEE ------------------------------------------------------------ LRALEERDNPFPEADPRLYLF --------------3333--- >TT1696 PROTEIN; SWP:P83963; PDB:1UFBA; MNRARDWLEQARHNLRHAQGSLGLGDYAWACFAAQQAAEAALKGLHLARGQVAWGHSILD iiii-----------------1111---------------------1111---------- LLADLPEDVDVPEDLVEAAKVLDKYYIPTRYPDAHPAGPAARHYTRLEAEEALDLAQKIL -11111111--3333-----3333--33331111----3333------------------ AFVEEKL ------- ------------------------------------------------------------ --------------------------------- >LAMIN A; SWP:P48678; PDB:1UFGA; GSSGSSGQSQGGGSVTKKRKLESSESRSSFSQHARTSGRVAVEEVDEEGKFVRLRNKSNE ---------------------------------------------3333----------- DQSMGNWQIRRQNGDDPLMTYRFPPKFTLKAGQVVTIWASGAGATHSPPTDLVWKAQNTW -----------------------------2222--------------------------- GCGSSLRTALINSTGEEVAMRKLVRSGPSSG ------------------------------- >MAJOR CENTROMERE AUTOANTI; SWP:P07199; PDB:1UFIA; HMPVPSFGEAMAYFAMVKRYLTSFPIDDRVQSHILHLEHDLVHVTRKN -------------------1111------------------------- ------------------------------------------------------------ ------------------------ >PUTATIVE NUCLEAR PROTEIN ; SWP:Q8BVK9; PDB:1UFNA; GSSGSSGNDAVDFSPTLPVTCGKAKGTLFQEKLKQGASKKCIQNEAGDWLTVKEFLNEGG ---------3333-------!!!!----33333333-------1111--------3333- RATSKDWKGVIRCNGETLRHLEQKGLLFSGPSSG 1111--------iiii------------------ >HYPOTHETICAL PROTEIN TT16; SWP:P83821; PDB:1UFOA; RVRTERLTLAGLSVLARIPEAPKALLLALHGLQGSKEHILALLPGYAERGFLLLAFDAPR --------iiii------------------2222--------22221111-------222 HGEREGPPPSSKSPRYVEEVYRVALGFKEEARRVAEEAERRFGLPLFLAGGSLGAFVAHL 2--------3333----------------------------------------------- LLAEGFRPRGVLAFIGSGFPKLPQGQVVEDPGVLALYQAPPATRGEAYGGVPLLHLHGSR -1111-----------------2222-------------33333333%%%%------111 DHIVPLAREKTLEALRPHYPEGRLARFVEEGAGHTLTPLARVGLAFLEHWLEAR 1---3333------33331111-------------------------------- >PYR MRNA-BINDING ATTENUAT; SWP:P83822; PDB:1UFRA; RFKAELNAPERRALYRIAHEIVEANKGTEGLALVGIHTRGIPLAHRIARFIAEFEGKEVP ------3333-----------------2222----------------------------- VGVLDITLPQVRETRIPFDLTGKAIVLVDDVLYTGRTARAALDALIDLGRPRRIYLAVLV -------------------2222----------------------1111----------- DRGHRELPIRADFVGKNVPTSRSEVVKVKVEEVDGEDRVELWER --------------------3333-------------------- >SYNAPTOJANIN 2; SWP:O15056; PDB:1UFWA; GSSGSSGSSFQGPLDATVVVNLQSPTLEEKNEFPEDLRTELMQTLGSYGTIVLVRINQGQ ------------3333-----------------3333----------------------- MLVTFADSHSALSVLDVDGMKVKGRAVKISGPSSG -------3333-----3333-iiii---------- >KIAA1526 PROTEIN; SWP:Q9P202; PDB:1UFXA; GSSGSSGTLVRVKKSAATLGIAIEGGANTRQPLPRIVTIQRGGSAHNCGQLKVGHVILEV ------------------------------------------------------------ NGLTLRGKEHREAARIIAEAFKTKDRDYIDFLVTEFNSGPSSG ----2222------------------------------3333- >CHORISMATE MUTASE; SWP:Q84FH6; PDB:1UFYA; MVRGIRGAITVEEDTPEAIHQATRELLLKMLEANGIQSYEELAAVIFTVTEDLTSAFPAE -------------------------------1111--3333--------1111---3333 AARQIGMHRVPLLSAREVPVPGSLPRVIRVLALWNTDTPQDRVRHVYLREAVRLRPDLES --11111111---------2222---------------1111-------3333-3333-- A - >HYPOTHETICAL PROTEIN BAB2; SWP:Q69ZS7; PDB:1UFZA; GSSGSSGEYGYEDLRESSNSLLNHQLSEIDQARLYSCLDHMREVLGDAVPDDILTEAILK ------------------------------------------------------------ HKFDVQKALSVVLEQDGSGPSSG ---3333--------3333---- >SPLICING FACTOR 4; SWP:Q8CH02; PDB:1UG0A; GSSGSSGEEDYEQWLEIKVSPPEGAETRRVIEKLARFVAEGGPELEKVAMEDYKDNPAFT ---------11113333----3333-3333--------------------1111----33 FLHDKNSREFLYYRRKVAEIRKSGPSSG 33----3333------------------ >KIAA1010 PROTEIN; SWP:Q6XZF7; PDB:1UG1A; GSSGSSGASLLARYPPEKLFQAERNFNAAQDLDVSLLEGDLVGVIKKKDPMGSQNRWLID --------------1111-----------1111---1111-------------------- NGVTKGFVYSSFLKPYNPRRSHSDASSGPSSG --------3333-------------------- >2610100B20RIK GENE PRODUC; SWP:Q9DB00; PDB:1UG2A; GPSGSSGAGALPKASEATVCANNSKVSSTGEKVVLWTREADRVILTMCQEQGAQPHTFSV ------------------------------------------------------3333-- ISQQLGNKTPVEVSHRFRELMQLFHTACESGPSSG --3333--3333----------------------- >eukaryotic protein synthe; SWP:Q04637; PDB:1UG3A; SKAALSEEELEKKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRHGVESTLERSAI ----------------------------------33331111-----------1111--- AREHMGQLLHQLLCAGHLSTAQYYQGLYEILELAEDMEIDIPHVWLYLAELVTPILQEGG -----------------------------33333333---1111------------2222 VPMGELFREITKPLRPLGKAASLLLEILGLLCKSMGPKKVGTLWREAGLSWKEFLPEGQD --------1111--33333333---------------------------3333--22223 IGAFVAEQKVEYTLALPSEELNRQLEKLLKEGSSNQRVFDWIEANLSEQQIVSNTLVRAL 333------3333---------------------------------3333---------- MTAVCYSAIIFETPLRVDVAVLKARAKLLQKYLCDEQKELQALYALQALVVTLEQPPNLL -----1111--------------------1111--------------------------- RMFFDALYDEDVVKEDAFYSWES -------1111------------ >CYTOTOXIN 6; SWP:P80245; PDB:1UG4A; LKCNQLIPPFYKTCAAGKNLCYKMFMVAAPKVPVKRGCIDVCPKSSLLVKYVCCNTDRCN -------------------------2222----------------1111----------- >BETA-GLYCOSIDASE; SWP:Q8GEB3; PDB:1UG6A; NAEKFLWGVATSAYQIEGATQEDGRGPSIWDAFAQRPGAIRDGSTGEPACDHYRRYEEDI -----------3333------iiii--3333----22221111----!!!!--------- ALMQSLGVRAYRFSVAWPRILPEGRGRINPKGLAFYDRLVDRLLASGITPFLTLYHWDLP ---1111--------1111-1111-------------------1111------------3 LALEERGGWRSRETAFAFAEYAEAVARALADRVPFFATLNEPWCSAFLGHWTGEHAPGLR 333---!!!!3333---------------------------------------------- NLEAALRAAHHLLLGHGLAVEALRAAGARRVGIVLNFAPAYGEDPEAVDVADRYHNRFFL -----------------------1111--------------------------------- DPILGKGYPESPFRDPPPVPILSRDLELVARPLDFLGVNYYAPVRVAPGTGTLPVRYLPP -1111----------------2222--1111----------------------------- EGPATAMGWEVYPEGLYHLLKRLGREVPWPLYVTENGAAYPDLWTGEAVVEDPERVAYLE ----1111---3333--------------------------------------------- AHVEAALRAREEGVDLRGYFVWSLMDNFEWAFGYTRRFGLYYVDFPSQRRIPKRSALWYR ---------3333---------------!!!!------------1111------------ ERIARA ------ >2610208M17RIK PROTEIN; SWP:NA; PDB:1UG7A; GSSGSSGMSEVTRSLLQRWGASLRRGADFDSWGQLVEAIDEYQILARHLQKEAQAQHNNS ---------3333----------------------------------------------- EFTEEQKKTIGKIATCLELRSAALQSTQSQEEFKLEDLKKLEPILKNILTYNKEFPFDVQ ----------------------------------3333-----3333------------- PISGPSSG -------- ------------------------------------------------------------ --------------------------- >URACIL-DNA GLYCOSYLASE IN; SWP:P14739; PDB:1UGIA; TNLSDIIEKETGKQLVIQESILMLPEEVEEVIGNKPESDILVHTAYDESTDENVMLLTSD ------------------------------------------------------------ APEYKPWALVIQDSNGENKIKML ------------1111------- >RIKEN CDNA 2310057J16 PRO; SWP:Q80VC9; PDB:1UGJA; GSSGSSGPRLYKEPSAKSNKFIIHNALSHCCLAGKVNEPQKNRILEEIEKSKANHFLILF -------------------3333----------3333----------------------- RDSSCQFRALYTLSGETEELSRLAGYGPRTVTPAMVEGIYKYNSDRKRFTQIPAKTMSMS -------------3333--------------3333----------------------111 VDAFTIQGHLWQSKKSGPSSG 1-------------------- >SYNAPTOTAGMIN IV; SWP:Q9H2B2; PDB:1UGKA; GSSGSSGLGTLFFSLEYNFERKAFVVNIKEARGLPAMDEQSMTSDPYIKMTILPEKKHKV ------------------1111-------------------------------------- KTRVLRKTLDPAFDETFTFYGIPYTQIQELALHFTILSFDRFSRDDIIGEVLIPLSGIEL -----------------------1111--------------------------------3 SEGKMLMNREIISGPSSG 333--------------- -------------------------------------------------- >Microtubule-associated pr; SWP:Q62625; PDB:1UGMA; KTFKQRRSFEQRVEDVRLIREQHPTKIPVIIERYKGEKQLPVLDKTKFLVPDHVNMSELI -3333-----------------1111-------1111-------------1111------ KIIRRRLQLNANQAFFLLVNGHSMVSVSTPISEVYESERDEDGFLYMVYASQE ---------1111-----iiii-----------------1111---------- >LEUKOCYTE IMMUNOGLOBULIN-; SWP:Q8NHL6; PDB:1UGNA; GHLPKPTLWAEPGSVITQGSPVTLRCQGGQETQEYRLYREKKTAPWITRIPQELVKKGQF ----------------2222--------1111-----------3333---33331111-- PIPSITWEHTGRYRCYYGSDTAGRSESSDPLELVVTGAYIKPTLSAQPSPVVNSGGNVTL -----3333----------1111-----------------------------2222---- QCDSQVAFDGFILCKEQCLNSSSRAIFSVGPVSPSRRWWYRCYAYDSNSPYEWSLPSDLL --------------------------------1111---------1111----------- ELLVLG ------ >BCL2-ASSOCIATED ATHANOGEN; SWP:NA; PDB:1UGOA; GSSGSSGMDMGNQHPSISRLQEIQREVKAIEPQVVGFSGLSDDKNYKRLERILTKQLFEI -----------------------------33331111-------3333-----------3 DSVDTEGKGDIQQARKRAAQETERLLKELEQNASGPSSG 333-%%%%------------------------------- >NITRILE HYDRATASE ALPHA S; SWP:Q7SID2; PDB:1UGPA; TENILRKSDEEIQKEITARVKALESMLIEQGILTTSMIDRMAEIYENEVGPHLGAKVVVK --1111---------------------1111----------------------------- AWTDPEFKKRLLADGTEACKELGIGGLQGEDMMWVENTDEVHHVVVCTLSYPWPVLGLPP -------------------1111--2222--------1111----------3333----3 NWFKEPQYRSRVVREPRQLLKEEFGFEVPPSKEIKVWDSSSEMRFVVLPQRPAGTDGWSE 333---33331111--------------1111----------------------222233 EELATLVTRESMIGVEPAKAV 33-11113333---------- >Cobalt-containing nitrile; SWP:Q7SID3; PDB:1UGPB; MNGVYDVGGTDGLGPINRPADEPVFRAEWEKVAFAMFPATFRAGFMGLDEFRFGIEQMNP --33332222----------------3333------------------------1111-- AEYLESPYYWHWIRTYIHHGVRTGKIDLEELERRTQYYRENPDAPLPEHEQKPELIEFVN ------3333------------------------------1111-------3333----- QAVYGGLPASREVDRPPKFKEGDVVRFSTASPKGHARRARYVRGKTGTVVKHHGAYIYPD -------------------2222---------------3333------------------ TAGNGLGECPEHLYTVRFTAQELWGPEGDPNSSVYYDCWEPYIELV 3333--------------3333--1111----------3333---- >OLYGOPHRENIN-1 LIKE PROTE; SWP:Q9UNA1; PDB:1UGVA; GSSGSSGTPFRKAKALYACKAEHDSELSFTAGTVFDNVHPSQEPGWLEGTLNGKTGLIPE ----------------------3333---2222-------------------------11 NYVEFLSGPSSG 11---------- >AGGLUTININ ALPHA CHAIN; SWP:P18670; PDB:1UGXA; GKAFDDGAFTGIREINLSYNKETAIGDFQVVYDLNGSPYVGQNHVSFITGFTPVKISLDF -------------------1111----------iiii----------------------- PSEYIMEVSGYTGNVSGYVVVRSLTFKTNKKTYGPYGVTSGTPFNLPIENGLIVGFKGSI --------------iiii------------------------------------------ GYWLDYFSMYLSL ------------- >HIZOPUSPEPSIN I; SWP:Q02016; PDB:1UH7A; AGVGTVPMTDYGNDIEYYGQVTIGTPGKKFNLDFDTGSSDLWIASTLCTNCGSRQTKYDP -2222----------------------------------------------1111---33 NQSSTYQADGRTWSISYGDGSSASGILAKDNVNLGGLLIKGQTIELAKREAASFASGPND 331111----------1111-------------iiii--------------3333----- GLLGLGFDTITTVRGVKTPMDNLISQGLISRPIFGVYLGKAKNGGGGEYIFGGYDSTKFK ------1111--2222-----------------------3333-----------1111-- GSLTTVPIDNSRGWWGITVDRATVGTSTVASSFDGILDTGTTLLILPNNIAASVARAYGA ---------1111----------!!!!----------1111--------------1111- SDNGDGTYTISCDTSRFKPLVFSINGASFQVSPDSLVFEEFQGQCIAGFGYGNWDFAIIG -------------1111------iiii----3333-----iiii---------------3 DTFLKNNYVVFNQGVPEVQIAPVAE 3331111------------------ >LECTIN-D2; SWP:P83790; PDB:1UHAA; APECGERASGKRCPNGKCCSQWGYCGTTDNYCGQGCQSQCDYWRCGRDFGGRLCEEDMCC ----1111-----%%%%--1111----1111-2222----1111-1111-----%%%%-- SKYGWCGYSDDHCEDGCQSQCD 1111----3333-2222----- >KIAA1010 PROTEIN; SWP:Q6XZF7; PDB:1UHCA; GSSGSSGSEAEGNQVYFAVYTFKARNPNELSVSANQKLKILEFKDVTGNTEWWLAEVNGK -------------------------3333---2222------------------------ KGYVPSNYIRKTESGPSSG ----1111----------- >ASPARTATE 1-DECARBOXYLASE; SWP:P56065; PDB:1UHEA; ITIDEDLAKLAKLREGMKVEIVDVNNGERFSTYVILGKKRGEICVNGAAARKVAIGDVVI --------1111-2222---------------------2222----3333---2222--- ILAYASMNEDEINAHKPSIVLVDEKNEILEKGLEHHH ----------------------1111----------- >INTERSECTIN 2; SWP:Q9NZM3; PDB:1UHFA; GSSGSSGGEEYIALYPYSSVEPGDLTFTEGEEILVTQKDGEWWTGSIGDRSGIFPSNYVK ---------------------------2222-----------------------1111-- PKDSGPSSG --------- >OVALBUMIN; SWP:P01012; PDB:1UHGA; GSIGAASMEFCFDVFKELKVHHANENIFYCPIAIMSALAMVYLGAKDSTRTQINKVVRFD ----------------3333-2222----------------1111--------------- KLPGFGDIEAQCGTSVNVHSSLRDILNQITKPNDVYSFSLASRLYAEERYPILPEYLQCV -2222--3333-----2222-------1111---------------3333---------- KELYRGGLEPINFQTAADQARELINSWVESQTNGIIRNVLQPSVDSQTAMVLVNAIVFKG ------------1111---------------iiii---------1111------------ LWEKAFKDEDTQAMPFRVTESKPVQMMYQIGLFRVASMASEKMKILELPFAGTMSMLVLL ------3333---------------------------3333------------------- PDEVSGLEQLESIINFEKLTEWTSSNVMEERKIKVYLPRMKMEEKYNLTSVLMAMGITDV ------------------------3333------------------------1111-333 FSSSANLSGISSAELKISQAVHAAHAEINEAGREVVGAEAGVDAASVSEEFRADHPFLFC 31111------------------------------------------------------- IKHIATNAVLFFGRCVSP ---1111----------- >AEQUORIN 2; SWP:P02592; PDB:1UHKA; NSKLTSDFDNPRWIGRHKHMFNFLDVNHNGKISLDEMVYKASDIVINNLGATPEQAKRHK ------1111--------------1111-------------------------------- DAVEAFFGGAGMKYGVETDWPAYIEGWKKLATDELEKYAKNEPTLIRIWGDALFDIVDKD -------1111-2222---------------------1111----------------111 QNGAITLDEWKAYTKAAGIIQSSEDCEETFRVCDIDESGQLDVDEMTRQHLGFWYTMDPA 1-------------------------3333-----1111------------------333 CEKLYGGAVP 3-1111---- >Oxysterols receptor LXR-a; SWP:Q13133; PDB:1UHLB; MSPEQLGMIEKLVAAQQQCNRRSFEARQQRFAHFTELAIVSVQEIVDFAKQLPGFLQLSR -----------------------------------------------------3333-33 EDQIALLKTSAIEVMLLETSRRYNPGSESITDFSYNREDFAKAGLQVEFINPIFEFSRAM 33---------------------------------------------------------- NELQLNDAEFALLIAISIFSADRPNVQDQLQVERLQHTYVEALHAYVSIHHPHDRLMFPR 3333-3333------33331111----3333-------------------1111---333 MLMKLVSLRTLSSVHSEQVFALRLQDKKLPPLLSEIWDV 3--------------------1111-----3333----- >CALCINEURIN B-LIKE PROTEI; SWP:Q8LAS7; PDB:1UHNA; DPELLARDTVFSVSEIEALYELFKKISSAVIDDGLINKEEFQLALFKTNKKESLFADRVF -------------------------1111-------------1111--3333-------3 DLFDTKHNGILGFEEFARALSVFHPNAPIDDKIHFSFQLYDLKQQGFIERQEVKQMVVAT 3331111------------333311113333---------1111---------------- LAESGMNLKDTVIEDIIDKTFEEADTKHDGKIDKEEWRSLVLRHPSLLKNMTLQYLKDIT -1111------------------------------------------1111------111 TTFPSFVFH 1-------- >HYPOTHETICAL PROTEIN KIAA; SWP:Q9UPQ7; PDB:1UHPA; GSSGSSGKSLTLVLHRDSGSLGFNIIGGRPSVDNHDGSSSEGIFVSKIVDSGPAAKEGGL ------------------------------------------------------------ QIHDRIIEVNGRDLSRATHDQAVEAFKTAKEPIVVQVLRRTSGPSSG --------iiii-1111------------------------------ >SWI/SNF related, matrix a; SWP:Q61466; PDB:1UHRA; GSSGSSGQPPQFKLDPRLARLLGIHTQTRPVIIQALWQYIKTHKLQDPHEREFVLCDKYL -------------------3333----1111---------1111--------------33 QQIFESQRMKFSEIPQRLHALLMPPEPSGPSSG 33-------111111113333------------ >EXPRESSED PROTEIN; SWP:NA; PDB:1UHTA; GSSGSSGMVTPSLRLVFVKGPREGDALDYKPGSTIRVGRIVRGNEIAIKDAGISTKHLRI -------------------1111------2222--------------------------- ESDSGNWVIQDLGSSNGTLLNSNALDPETSVNLGDGDVIKLGEYTSILVNFVSGPSSG -------------------------1111----------------------------- >PRODUCT OF RIKEN CDNA 311; SWP:NA; PDB:1UHUA; GSSGSSGTPLSLTLDHWSEIRSRAHNLSVEIKKGPWRTFCASEWPTFDVGWPPEGTFDLT -------------------------------------1111-1111-----1111----- VIFEVKAIVFQDGPGSHPDQQPYITVWQDLVQNSPPWIKSGPSSG ------------3333---3333-----------1111------- >BETA-XYLOSIDASE; SWP:P36906; PDB:1UHVA; MIKVRVPDFSDKKFSDRWRYCVGTGRLGLALQKEYIETLKYVKENIDFKYIRGHGLLCDD ---------------3333------3333------------1111---------111133 VGIYREDVVGDEVKPFYNFTYIDRIFDSFLEIGIRPFVEIGFMPKKLASGTQTVFYWEGN 33----------------3333---------------------3333-------1111-- VTPPKDYEKWSDLVKAVLHHFISRYGIEEVLKWPFEIWNEPNLKEFWKDADEKEYFKLYK ----------------------------3333-------1111---2222---------- VTAKAIKEVNENLKVGGPAICGGADYWIEDFLNFCYEENVPVDFVSRHAYTSKQGEYTPH ---------1111----------3333--------------------------------- LIYQEIMPSEYMLNEFKTVREIIKNSHFPNLPFHITEYNTSYSPQNPVHDTPFNAAYIAR ----------------------1111-1111--------------3333----------- ILSEGGDYVDSFSYWTFSDVFEERDVPRSQFHGGFGLVALNMIPKPTFYTFKFFNAMGEE ---3333--------------------------------%%%%-3333----3333---- MLYRDEHMLVTRRDDGSVALIAWNEVMDKTENPDEDYEVEIPVRFRDVFIKRQLIDEEHG ------------1111---------------------------------------1111- NPWGTWIHMGRPRYPSKEQVNTLREVAKPEIMTSQPVANDGYLNLKFKLGKNAVVLYELT ------1111----------------------------%%%%------------------ ERIDESSTYIGLDDSKINGY ----33332222----2222 >PLECKSTRIN; SWP:Q9JHK5; PDB:1UHWA; GSSGSSGLGALYLSMKDPEKGIKELNLEKDKKVFNHCLTGSGVIDWLVSNKLVRNRQEGL --------------------------------------------------------3333 MISASLLSEGYLQPAGDLSKNAADGIAENPFLDSPDAFYYFPDSGPSSG ------------------------------------------------- >STAUFEN (RNA BINDING PROT; SWP:Q8CJ67; PDB:1UHZA; GSSGSSGPISRLAQIQQARKEKEPDYILLSERGMPRRREFVMQVKVGNEVATGTGPNKKI ----------------3333--------------------------------------33 AKKNAAEAMLLQLGYKASTSLQDSGPSSG 33--------------------------- >URACIL-DNA GLYCOSYLASE; SWP:Q7WYV4; PDB:1UI0A; TLELLQAQAQNCTACRLMEGRTRVVFGEGNPDAKLMIVGEGPGEEEDKTGRPFVGKAGQL --------1111---3333----------------------------------------- LNRILEAAGIPREEVYITNIVKCRPPQNRAPLPDEAKICTDKWLLKQIELIAPQIIVPLG -----1111-3333-----------%%%%------------------------------- AVAAEFFLGEKVSITKVRGKWYEWHGIKVFPMFHPAYLLRNPSRAPGSPKHLTWLDIQEV ------------33332222---iiii------3333-------2222------------ KRALDALPPKER ---1111----- >A-FACTOR RECEPTOR HOMOLOG; SWP:O66122; PDB:1UI5A; LRAEQTRATIIGAAADLFDRRGYESTTLSEIVAHAGVTKGALYFHFAAKEDLAHAILEIQ -3333-----------------1111---------------3333--3333--------- SRTSRRLAKDLDGRGYSSLEALRLTFGARLCVQGPVLRAGLRLATAGVPVRPLPHPFTEW ---------1111---3333---------------------------------------- REIATSRLLDAVRQSDVHQDIDVDSVAHTLVCSVVGTRVVREPRRLAEWYILIRGVPVTR -----------1111--1111--------------3333-----------------3333 RARYVTLAARLEQET --------------- ------------------------------------------------------------ - >BETA SUBUNIT OF BETA CONG; SWP:P25974; PDB:1UIJA; REDENNPFYFRSSNSFQTLFENQNGRIRLLQRFNKRSPQLENLRDYRIVQFQSKPNTILL -----1111-3333-------1111------1111-1111--1111-------------- PHHADADFLLFVLSGRAILTLVNNDDRDSYNLHPGDAQRIPAGTTYYLVNPHDHQNLKMI --------------------------------2222----2222---------------- WLAIPVNKPGRYDDFFLSSTQAQQSYLQGFSHNILETSFHSEFEEINRVLFGEEEEQRQQ -------2222---------------3333---------------------11111111- EGVIVELSKEQIRQLSRRAKSSSRKTISSEDEPFNLRSRNPIYSNNFGKFFEITPEKNPQ ---------3333---------3333------------------1111-----3333-33 LRDLDIFLSSVDINEGALLLPHFNSKAIVILVINEGDANIELVGIKLEVQRYRAELSEDD 33-----------2222---------------------------------------2222 VFVIPAAYPFVVNATSNLNFLAFGINAENNQRNFLAGEKDNVVRQIERQVQELAFPGSAQ ----2222------------------2222-----------------3333------333 DVERLLKKQRESYFVDA 3---------------- >ALPHA PRIME SUBUNIT OF BE; SWP:Q7XXT2; PDB:1UIKA; NPFHFNSKRFQTLFKNQYGHVRVLQRFNKRSQQLQNLRDYRILEFNSKPNTLLLPHHADA 1111-3333------1111------1111----1111----------------------- DYLIVILNGTAILTLVNNDDRDSYNLQSGDALRVPAGTTYYVVNPDNDENLRMITLAIPV --------------------------2222----2222---------------------- NKPGRFESFFLSSTQAQQSYLQGFSKNILEASYDTKFEEINKVLFGQESVIVEISKKQIR -2222--------1111---3333------------------------------------ ELSKHAKSSSRKTISSEDKPFNLRSRDPIYSNKLGKLFEITPEKNPQLRDLDVFLSVVDM 1111-----3333--------1111------1111-----3333---------------- NEGALFLPHFNSKAIVVLVINEGEANIELVGIPLEVRKYRAELSEQDIFVIPAGYPVVVN 2222---------------------------------------2222----2222----- ATSDLNFFAFGINAENNQRNFLAGSKDNVISQIPSQVQELAFPGSAKDIENLIKSQSESY ---------------------------3333----------------------------- FVDA ---- >DOUBLE-STRANDED RNA-BINDI; SWP:O70133; PDB:1UILA; GSSGSSGLESEEVDLNAGLHGNWTLENAKARLNQYFQKEKIQGEYKYTQVGPDHNRSFIA ---------------3333----3333------------------------3333----- EMTIYIKQLGRRIFAREHGSNKKLAAQSCALSLVRQLYHLGVIEAYSSGPSSG ----------------------------------------------------- >POLYAMINE AMINOPROPYLTRAN; SWP:P83816; PDB:1UIRA; MDYGMYFFEHVTPYETLVRRMERVIASGKTPFQDYFLFESKGFGKVLILDKDVQSTERDE -----------1111--------------1111----------------------1111- YIYHETLVHPAMLTHPEPKRVLIVGGGEGATLREVLKHPTVEKAVMVDIDGELVEVAKRH -------------------------3333--------1111------------------- MPEWHQGAFDDPRAVLVIDDARAYLERTEERYDVVIIDLTDPVGEDNPARLLYTVEFYRL 3333iiii--1111-----------------------------11113333--------- VKAHLNPGGVMGMQTGMILLRVHPVVHRTVREAFRYVRSYKNHIPGFFLNFGFLLASDAF -11112222--------------------1111----------3333------------- DPAAFSEGVIEARIRERNLALRHLTAPYLEAMFVLPKDLLEALEKETMVSTDQNPFYVTP 1111-2222---------------------1111----------------3333----11 EGEARQAPY 11------- >RED FLUORESCENT PROTEIN F; SWP:NA; PDB:1UISA; HMNSLIKENMRMMVVMEGSVNGYQFKCTGEGDGNPYMGTQTMRIKVVEGGPLPFAFDILA 1111---------------iiii----------1111-----------------333311 TSFSKTFIKHTKGIPDFFKQSFPEGFTWERVTRYEDGGVFTVMQDTSLEDGCLVYHAKVT 11-3333--------3333--------------1111-----------iiii-------- GVNFPSNGAVMQKKTKGWEPNTEMLYPADGGLRGYSQMALNVDGGGYLSCSFETTYRSKK ----1111-------------------iiii----------2222--------------- TVENFKMPGFHFVDHRLERLEESDKEMFVVQHEHAVAKFCDLP 1111-------------------%%%%---------------- >HUMAN DISCS LARGE 5 PROTE; SWP:Q8TDM6; PDB:1UITA; GSSGSSGGERRKDRPYVEEPRHVKVQKGSEPLGISIVSGEKGGIYVSKVTVGSIAHQAGL -------------------------------------------------2222------- EYGDQLLEFNGINLRSATEQQARLIIGQQCDTITILAQYNPHVHQLSSHSRSGPSSG ------------3333-3333--3333------------------------------ ------------------------------------------------------------ --- >ENOYL-COA HYDRATASE; SWP:P83702; PDB:1UIYA; VQVEKGHVAVVFLNDPERRNPLSPEALSLLQALDDLEADPGVRAVVLTGRGKAFSAGADL ---------------3333-------------------1111----------------33 AFLERVTELGAEENYRHSLSLRLFHRVYTYPKPTVAAVNGPAVAGGAGLALACDLVVDEE 33-------3333--------------------------------------------111 ARLGYTEVKIGFVAALVSVILVRAVGEKAAKDLLLTGRLVEAREAKALGLVNRIAPPGKA 1----3333----3333--3333--3333------------------------------- LEEAKALAEEVAKNAPTSLRLTKELLLALPGGLEDGFRLAALANAWVRETGDLAEGIRAF ---------------------------3333----------------------------- FEKRPPRF -------- >MACROPHAGE MIGRATION INHI; SWP:Q76BK2; PDB:1UIZA; MPVFTIRTNVCRDSVPDTLLSDLTKQLAKATGKPAEYIAIHIVPDQIMSFGDSTDPCAVC ----------3333-------------------3333------------%%%%------- SLCSIGKIGGPQNKSYTKLLCDILTKQLNIPANRVYINYYDLNAANVGWNGSTFA ------------------------------3333--------3333--iiii--- >DEUBIQUITINATING ENZYME U; SWP:O88811; PDB:1UJ0A; MARRVRALYDFEAVEDNELTFKHGELITVLDDSDANWWQGENHRGTGLFPSNFVTTDL --------------1111---2222----------------1111----1111----- >URIDINE-CYTIDINE KINASE 2; SWP:Q9BZX2; PDB:1UJ2A; EPFLIGVSGGTASGKSSVCAKIVQLLGQNEVDYRQKQVVILSQDSFYRVLTSEQKAKALK ---------2222-------------1111-3333------3333------------111 GQFNFDHPDAFDNELILKTLKEITEGKTVQIPVYDFVSHSRKEETVTVYPADVVLFEGIL 1-----3333------------1111---------1111--------------------1 AFYSQEVRDLFQMKLFVDTDADTRLSRRVLRDISERGRDLEQILSQYITFVKPAFEEFCL 111----3333-----------------------------------------------33 PTKKYADVIIPRGADNLVAINLIVQHIQDILNG 331111-----!!!!------------------ >Tissue factor [Precursor]; SWP:P13726; PDB:1UJ3B; QVQLLESGAVLARPGTSVKISCKASGFNIKDYYMHWVKQRPGQGLEWIGGNDPANGHSMY ------------2222-----------1111--------2222----------------- DPKFQGRVTITADTSTSTVFMELSSLRSEDTAVYYCARDSGYAMDYWGQGTLVTVSSAST 3333---------1111---------3333------------------------------ KGPSVFPLAPCSRSTSESTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLY -------------------------------------%%%%--2222-------3333-- SLSSVVTVPSSSLGTKTYTCNVDHKPSNTKVDKRVES --------1111------------1111--------- >RIBOSE 5-PHOSPHATE ISOMER; SWP:Q72J47; PDB:1UJ6A; RPLESYKKEAAHAAIAYVQDGVVGLGTGSTARYAVLELARRLREGELKGVVGVPTSRATE 1111----------1111---------3333----------------------------- ELAKREGIPLVDLPPEGVDLAIDGADEIAPGLALIKGGGALLREKIVERVAKEFIVIADH ---1111------1111-----------2222--------------------------33 TKKVPVLGRGPVPVEIVPFGYRATLKAIADLGGEPELRDGDEFYFTDGGHLIADCRFGPI 33---------------2222-------1111------!!!!---1111----------- GDPLGLHRALLEIPGVVETGLFVGATRALVAGPFGVEELLP ------------3333---------------1111------ >HYPOTHETICAL PROTEIN YFHJ; SWP:P37096; PDB:1UJ8A; SHHHHHHGSGLKWTDSREIGEALYDAYPDLDPKTVRFTDMHQWICDLEDFDDDPQASNEK -----------1111-----------11113333-3333-------1111--3333---- ILEAILLVWLDEA ------------- >PHOSPHOHISTIDINE PHOSPHAT; SWP:P76502; PDB:1UJCA; MQVFIMRHGDAALDAASDSVRPLTTNGCDESRLMANWLKGQKVEIERVLVSPFLRAEQTL ----------------3333------------------1111----------3333---- EEVGDCLNLPSSAEVLPELTPCGDVGLVSAYLQALTNEGVASVLVISHLPLVGYLVAELC --3333---------11111111------------3333--------------------2 PGETPPMFTTSAIASVTLDESGNGTFNWQMSPCNLK 222-----2222------1111--------3333-- >KIAA0559 PROTEIN; SWP:Q9Y6V0; PDB:1UJDA; GSSGSSGHYIFPHARIKITRDSKDHTVSGNGLGIRIVGGKEIPGHSGEIGAYIAKILPGG ------------------------------------------------------------ SAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSGEAEICVRLDLNMSGPSSG 3333----------------------------------------------------- >ADP-RIBOSYLATION FACTOR B; SWP:Q9UJY5; PDB:1UJKA; EPAMEPETLEARINRATNPLNKELDWASINGFCEQLNEDFEGPPLATRLLAHKIQSPQEW 33331111---------1111------------3333-1111------------------ EAIQALTVLETCMKSCGKRFHDEVGKFRFLNELIKVVSPKYLGSRTSEKVKNKILELLYS -------------------------3333------------1111--------------- WTVGLPEEVKIAEAYQMLKKQGIVK ----1111----------1111--- >Potassium voltage-gated c; SWP:Q12809; PDB:1UJLA; AIGNMEQPHMDSRIGWLHNLGDQIGKPYNSSGLGGPSIKDKY --------------3333------------------------ >DEHYDROQUINATE SYNTHASE; SWP:P83703; PDB:1UJNA; MQRLEVREPVPYPILVGEGVLKEVPPLAGPAALLFDRRVEGFAQEVAKALGVRHLLGLPG ------------------3333-------------3333--------------------- GEAAKSLEVYGKVLSWLAEKGLPRNATLLVVGGGTLTDLGGFVAATYLRGVAYLAFPTTT 3333-------------1111-1111-------------------2222----------- LAIVDASVGGKTGINLPEGKNLVGAFHFPQGVYAELRALKTLPLPTFKEGLVEAFKHGLI ----3333-------1111-----------------1111-------------------- AGDEALLKVEDLTPQSPRLEAFLARAVAVKVRVTEEDPLEKGKRRLLNLGHTLGHALEAQ --3333--11111111--------------------1111-3333--2222-------11 TRHALPHGMAVAYGLLYAALLGRALGGEDLLPPVRRLLLWLSPPPLPPLAFEDLLPYLSL 11--------------------1111-----------------------33333333--- HWVVPLAPGRLVVRPLPEGLLREAFAAWREELKGLGLL ------2222------3333------------------ >TRANSGELIN; SWP:P37804; PDB:1UJOA; GSSGSSGEELEERLVEWIVVQCGPDVGRPDRGRLGFQVWLKNGVILSKLVNSLYPEGSKP -----------------------------------3333----------3333------- VKVPENPPSMVFKQMEQVAQFLKAAEDYGVIKTDMFQTVDLYEGKDMAAVQRTLMALGSL -------------------------3333-------33331111---------------- AVTKNDGNYRGDPNWFMKSGPSSG ------------------------ >TRYPTOPHAN SYNTHASE ALPHA; SWP:P16608; PDB:1UJPA; MTTLEAFAKARSEGRAALIPYLTAGFPSREGFLQAVEEVLPYADLLEIGLPYSDGPVIQR ----------------------2222------------3333------------------ ASELALRKGMSVQGALELVREVRALTEKPLFLMTYLNPVLAWGPERFFGLFKQAGATGVI -----1111-------------1111-----------------3333------------- LPDLPPDEDPGLVRLAQEIGLETVFLLAPTSTDARIATVVRHATGFVYAVSVEVKDLVRR 11113333-------------------1111-------1111------------------ IKARTALPVAVGFGVSGKATAAQAAVADGVVVGSALVRALEEGRSLAPLLQEIRQGLQRL -1111------------------3333---------------------------1111-- PLP --- >PROBABLE METHYLISOCITRATE; SWP:Q56062; PDB:1UJQA; HSPGQAFRAALAKENPLQIVGAINANHALLAQRAGYQAIYLSGGGVAAGSLGLPDLGIST -----------------------------------------------------------3 LDDVLTDIRRITDVCPLPLLVDADIGFGSSAFNVARTVKSIAKAGAAALHIEDQVAIVSK 333--------------------------3333--------------------------- EEMVDRIRAAVDARTDPNFVIMARTDALAVEGLEAALDRAQAYVDAGADMLFPEAITELS ----------3333-3333------3333--------------1111----------333 MYRRFADVAQVPILANITEFGATPLFTTDELRSAHVAMALYPLSAFRAMNRAAEKVYTVL 3------------------------------1111------------------------- RQEGTQKNVIDIMQTRNELYESINYYQFEEKL -----11111111------------------- >HYPOTHETICAL PROTEIN AK01; SWP:Q9CZW6; PDB:1UJRA; PSSGSSGFLDKPTLLSPEELKAASRGNGEYAWYYEGRNGWWQYDERTSRELEDAFSKGKK --------------------------------------------3333------------ NTEMLIAGFLYVADLENMVQYRRNEHGRRRKIKRDIIDIPKKGVSGPSSG --------------1111---3333------------------------- >ACTIN-BINDING LIM PROTEIN; SWP:O94929; PDB:1UJSA; GSSGSSGNAVNWGMREYKIYPYELLLVTTRGRNRLPKDVDRTRLERHLSQEEFYQVFGMT ---------------------------------------11113333-3333-------3 ISEFDRLALWKRNELKKQARLFSGPSSG 3331111--------1111--------- >KIAA1568 PROTEIN; SWP:Q9HCK4; PDB:1UJTA; GSSGSSGRQVQKELGDVLVRLHNPVVLTPTTVQVTWTVDRQPQFIQGYRVMYRQTSGLQA --------1111-3333---------------------------------------3333 TSSWQNLDAKVPTERSAVLVNLKKGVTYEIKVRPYFNEFQGMDSESKTVRTTEESGPSSG ------------------------------------------------------------ >SCRIBBLE; SWP:Q14160; PDB:1UJUA; GSSGSSGPGLRELCIQKAPGERLGISIRGGARGHAGNPRDPTDEGIFISKVSPTGAAGRD ---------------------------------------------------1111-3333 GRLRVGLRLLEVNQQSLLGLTHGEAVQLLRSVGDTLTVLVCDGFESGPSSG -----------%%%%----------3333---------------------- >MEMBRANE ASSOCIATED GUANY; SWP:Q86UL8; PDB:1UJVA; GSSGSSGQAELMTLTIVKGAQGFGFTIADSPTGQRVKQILDIQGCPGLCEGDLIVEINQQ -----------------------------1111-------11112222--------%%%% NVQNLSHTEVVDILKDCPIGSETSLIIHRGSGPSSG -1111------------------------------- >RHO GUANINE NUCLEOTIDE EX; SWP:Q15052; PDB:1UJYA; GSSGSSGSHQLIVKARFNFKQTNEDELSVCKGDIIYVTRVEEGGWWEGTLNGRTGWFPSN ----------------------3333-----------------------2222----333 YVREIKSSERSGPSSG 3-------1111---- >POLY [ADP-RIBOSE] POLYMER; SWP:P09874; PDB:1UK0A; KSKLPKPVQDLIKMIFDVESMKKAMVEYEIDLQKMPLGKLSKRQIQAAYSILSEVQQAVS ----------------3333----------3333-------------------------- QGSSDSQILDLSNRFYTLIPHDFGMKKPPLLNNADSVQAKVEMLDNLLDIEVAYSLLRGG -------1111-----------------------3333---------------------- SDDSSKDPIDVNYEKLKTDIKVVDRDSEEAEIIRKYVKNTHATTHNAYDLEVIDIFKIER ------------3333-------------------------------------------2 EGECQRYKPFKQLHNRRLLWHGSRTTNFAGILSQGLRIAPPEAPVTGYMFGKGIYFADMV 222-33333333------------3333---------------3333------------- SKSANYCHTSQGDPIGLILLGEVALGNMYELKHASHISKLPKGKHSVKGLGKTTPDPSAN 3333------------------------------------2222---------------- ISLDGVDVPLGTGISSGVNDTSLLYNEYIVYDIAQVNLKYLLKLKFNFKT --%%%%-------------------------3333--------------- >BAG-FAMILY MOLECULAR CHAP; SWP:Q9JLV1; PDB:1UK5A; GSSGSSGAPAEPAAPKSGEAETPPKHPGVLKVEAILEKVQGLEQAVDSFEGKKTDKKYLM -------------------------3333------------------------------- IEEYLTKELLALDSVDPEGRADVRQARRDGVRKVQTILEKLEQKASGPSSG -------------------3333---------------------------- >2-hydroxy-6-oxo-7-methylo; SWP:P96965; PDB:1UK8A; NLEIGKSILAAGVLTNYHDVGEGQPVILIHGSGPGVSAYANWRLTIPALSKFYRVIAPDM 3333-----iiii-------------------22223333-11113333----------2 VGFGFTDRPENYNYSKDSWVDHIIGIMDALEIEKAHIVGNAFGGGLAIATALRYSERVDR 222-----2222---------------1111----------------------1111--- MVLMGAAGTRFDVTEGLNAVWGYTPSIENMRNLLDIFAYDRSLVTDELARLRYEASIQPG -------------------1111----------------3333----------3333222 FQESFSSMFPEPRQRWIDALASSDEDIKTLPNETLIIHGREDQVVPLSSSLRLGELIDRA 2---1111---3333-------33331111--------1111---3333----------- QLHVFGRCGHWTQIEQTDRFNRLVVEFFNEA ----------3333-------------3333 >ESTA; SWP:Q6ED33; PDB:1UKCA; SHNAQPVINLGYARYQGVRLEAGVDEFLGMRYASPPIGDLRFRAPQDPPANQTLQSATEY 1111---------------3333-------------!!!!-------------------- GPICIGLDEEESPGDISEDCLFINVFKPSTATSQSKLPVWLFIQGGGYAENSNANYNGTQ -----2222--2222------------11111111----------%%%%----------- VIQASDDVIVFVTFNYRVGALGFLASEKVRQNGDLNAGLLDQRKALRWVKQYIEQFGGDP -------------------1111----------------------------3333---11 DHIVIHGVSAGAGSVAYHLSAYGGKDEGLFIGAIVESSFWPTQRTVSEMEFQFERFVNDT 11------------------iiii--------------------3333---------111 GCSSARDSLECLREQDIATIQKGNTGSPFPGGSSSPLPDWYFLPVTDGSLVPDELYNAFD 11111------1111----3333-----2222---------------------------- AGNFIKVPVLVGDDTDEGSNFAYNASSSADVSRFFKNNYPNLTSQQLNEINQVYPRGKLL ---------------1111-------------------1111-----------------2 PRHAAYFGASSAAYGDATFTCPGNHVASSAARYLPNSVWNYRVNIIDESNIAGGIGVPHT 2221111--------------------------1111-------------1111---222 FELPAIFGAGSTGTLSSDSSYLTYNAAIIPVTMHYFISFVQTLNPNTYRYATAPEWNTWG 23333--2222----11111111-3333---------------3333--1111-----!! NGQRLRLQTNDTAMEAVPESSLQDCAFWKSLTVPMEV !!-----2222---------------------1111- >AVIRULENCE PROTEIN AVRPPH; SWP:Q52430; PDB:1UKFA; SLSDFSVASRDVNHNNICAGLSTEWLVMSSDGDAESRMDHLDYNGEGQSRGSERHQVYND 3333--------1111-----------1111----------1111--------------- ALRAALSNDDEAPFFTASTAVIEDAGFSLRREPKTVHASGGSAQLGQTVAHDVAQSGRKH -----1111-------------1111----------------------------2222-- LLSLRFANVQGHAIACSCEGSQFKLFDPNLGEFQSSRSAAPQLIKGLIDHYNSLNYDVAC ------------------!!!!----3333-----3333------------1111----- VNEFRVSV -------- >LECTIN; SWP:Q8GSD2; PDB:1UKGA; DSLSFGFPTFPSDQKNLIFQGDAQIKNNAVQLTKTDSNGNPVASTVGRILFSAQVHLWEK -------------------------%%%%------1111--------------------1 SSSRVANFQSQFSFSLKSPLSNGADGIAFFIAPPDTTIPSGSGGGLLGLFAPGTAQNTSA 111-----------------------------1111------!!!!----3333--1111 NQVIAVEFDTFYAQDSNTWDPNYPHIGIDVNSIRSVKTVKWDRRDGQSLNVLVTFNPSTR ------------33331111-----------------------2222------------- NLDVVATYSDGTRYEVSYEVDVRSVLPEWVRVGFSAASGEQYQTHTLESWSFTSTLLYTA -------1111---------3333------------------------------------ >MITOGEN-ACTIVATED PROTEIN; SWP:P45983; PDB:1UKHA; NFYSVEIGDSTFTVLKRYQNLKPIGSGAQGIVCAAYDAILERNVAIKKLSRPFQNQTHAK ------!!!!----------------3333------------------------3333-- RAYRELVLMKCVNHKNIIGLLNVFTPQKSLEEFQDVYIVMELMDANLCQVIQMELDHERM ----------------------------3333----------------3333---3333- SYLLYQMLCGIKHLHSAGIIHRDLKPSNIVVKSDCTLKILDFGLYYRAPEVILGMGYKEN --------------1111------1111---1111------------3333------111 VDIWSVGCIMGEMIKGGVLFPGTDHIDQWNKVIEQLGTPCPEFMKKLQPTVRTYVENRPK 1----------------------3333---------------------------1111-- YAGYSFEKLFPDVLFPNKLKASQARDLLSKMLVIDASKRISVDEALQHPYINVWYDPSEA ----3333--1111--------------------1111------------3333-3333- EAPPPKIEEWKELIYKEVMDL --------------------- >OSMOTICALLY INDUCIBLE PRO; SWP:P84124; PDB:1UKKA; PVRKAKAVWEGGLRQGKGVELQSQAFQGPYSYPSRFEEGEGTNPEELIAAAHAGFSALAA --------------------1111------3333-------------------------- SLEREGFPPKRVSTEARVHLEVVDGKPTLTRIELLTEAEVPGISSEKFLEIAEAAKEGCP --1111---------------------------------2222----------3333--- VSRALAGVKEVVLTARLV ----3333---------- >Sterol regulatory element; SWP:Q12772; PDB:1UKLC; RSSINDKIIELKDLVGTDAKHKSGVLRKAIDYIKYLQQVNHKLRQENVLKLANQKNKL ---------------2222---------------------------------3333-- >periplasmic divalent cati; SWP:O58720; PDB:1UKUA; MIIVYTTFPDWESAEKVVKTLLKERLIACANLREHRAFYWWEGKIEEDKEVGAILKTRED ----------------------------------------iiii-------------333 LWEELKERIKELHPYDVPAIIRIDVDDVNEDYLKWLIEETKK 3-----------------------------------1111-- >ACYL-COA DEHYDROGENASE; SWP:Q72JJ3; PDB:1UKWA; IDFSLTEEQRQLQALARRFAKEVILPVAQEYDEKEEVPWPVIEKLHEVGLLNAIIPEEYG -----------------------3333----------3333----1111--11113333- GMGLKMLDEVIVGEELAYACMGIYTIPMASDLGITPVLLAGTEEQKERFLRPLTEKPALA ----------------33333333-------------------------3333------- AFALSEPGNGSDAAALKTRAIRQGDHYVLNGTKMWISNGGEAEWVVVFATVNPELRHKGV -----1111--1111-------!!!!-------------------------33333333- VALVVERGTPGFKAIKIHGKMGQRASGTYELVFEDVKVPVENRLGEEGEGFKIAMQTLNK -----1111-------------3333------------3333------------------ TRIPVAAGSVGVARRALDEARKYAKEREAFGEPIANFQAIQFKLVDMLIGIETARMYTYY ----------------------------%%%%---------------------------- AAWLADQGLPHAHASAIAKAYASEIAFEAANQAIQIHGGYGYVREFPVEKLLRDVKLNQI -------------------------------------3333-33333333----3333-- YEGTNEIQRLIIARHILAA ------------------- >GCN2 EIF2ALPHA KINASE; SWP:Q9QZ05; PDB:1UKXA; GSSGSSGMESYSQRQDHELQALEAIYGSDFQDLRPDARGRVREPPEINLVLYPQGLAGEE ------------------------------------------------------------ VYVQVELRVKCPPTYPDVVPEIDLKNAKGLSNESVNLLKSHLEELAKKQCGEVMIFELAH ----------------------------------------------1111---3333--- HVQSFLSEHNKSGPSSG ----------------- >URIDYLATE KINASE; SWP:P15700; PDB:1UKZ; PAFSPDQVSVIFVLGGPGAGKGTQCEKLVKDYSFVHLSAGDLLRAEQGRAGSQYGELIKN ---3333--------2222-----------------------------2222-3333--- CIKEGQIVPQEITLALLRNAISDNVKANKHKFLIDGFPRKMDQAISFERDIVESKFILFF --------3333------------------------------------------------ DCPEDIMLERLLERGKTSGRSDDNIESIKKRFNTFKETSMPVIEYFETKSKVVRVRCDRS -------------------1111-----------------------1111---------- VEDVYKDVQDAIRDSL ---------------- >Flap endonuclease 1; SWP:P39748; PDB:1UL1X; GIQGLAKLIADVAPSAIRENDIKSYFGRKVAIDASMSIYQFLIAVTTSHLMGMFYRTIRM -2222--------3333---33332222-----------3333---3333---------- MENGIKPVYVFDGKPPQLKSGELAKRLVKVTKQHNDECKHLLSLMGIPYLDAPSEAEASC ------------------------------------------------------------ AALVKAGKVYAAATEDMDCLTFGSPVLMRHLTASEKKLPIQEFHLSRILQELGLNQEQFV ---------------11111111--------------------------3333---3333 DLCILLGSDYCESIRGIGPKRAVDLIQKHKSIEEIVRRLDPNKYPVPENWLHKEAHQLFL -------------------3333--------3333---------------3333------ EPEVLDPESVELKWSEPNEEELIKFMCGEKQFSEERIRSGVKRLSKSRQGSTQGRLDDFF -----3333-----------------------3333------------------3333-- KVTGSLSSAKRKE ------------- >NITROGEN REGULATORY PROTE; SWP:Q55247; PDB:1UL3A; MKKVEAIIRPFKLDEVKIALVNAGIVGMTVSEVRGFEFLQKLKIEIVVDEGQVDMVVDKL --------1111------------------------------------1111-------- VSAARTGEIGDGKIFISPVDSVVRIRTGEKDTEAI -------2222-------------1111------- >SQUAMOSA PROMOTER BINDING; SWP:Q9S7A9; PDB:1UL4A; LRLCQVDRCTADMKEAKLYHRRHKVCEVHAKASSVFLSGLNQRFCQQCSRFHDLQEFDEA ----------------33331111-3333----------------1111---3333---- KRSCRRRLAGHNERRRKSSGE --------------------- >SQUAMOSA PROMOTER BINDING; SWP:Q8S9G8; PDB:1UL5A; VARCQVPDCEADISELKGYHKRHRVCLRCATASFVVLDGENKRYCQQCGKFHLLPDFDEG -----3333----------1111-----------------------------3333---- KRSCRRKLERHNNRRKRKPVDKGGVA -------------------------- >MAP/MICROTUBULE AFFINITY-; SWP:Q03141; PDB:1UL7A; GSSGSSGRFTWSMKTTSSMDPSDMMREIRKVLGANNCDYEQRERFLLFCVHGDGHAENLV -------------------3333--------3333-------------------3333-- QWEMEVCKLPRLSLNGVRFKRISGTSIAFKNIASKIANELKL -------------------------------3333------- >GALECTIN-2; SWP:Q9P4R8; PDB:1ULDA; MLYHLFVNNQVKLQNDFKPESVAAIRSSAFNSKGGTTVFNFLSAGENILLHISIRPGENV -----------------2222---------1111--------1111---------1111- IVFNSRLKNGAWGPEERIPYAEKFRPPNPSITVIDHGDRFQIRFDYGTSIYYNKRIKENA ------1111-------------------------------------------------- AAIAYNAENSLFSSPVTVDVHGLLPPLPPA ------------------------------ >BIPHENYL DIOXYGENASE LARG; SWP:Q53122; PDB:1ULIA; WADADIAELVDERTGRLDPRIYTDEALYEQELERIFGRSWLLMGHETQIPKAGDFMTNYM ------3333---------1111-----------3333------1111--2222-----! GEDPVMVVRQKNGEIRVFLNQCRHRGMRICRADGGNAKSFTCSYHGWAYDTGGNLVSVPF !!!------1111------------------------------------1111------3 EEQAFPGLRKEDWGPLQARVETYKGLIFANWDADAPDLDTYLGEAKFYMDHMLDRTEAGT 333-11113333-------------------1111--------------------1111- EAIPGIQKWVIPCNWKFAAEQFCSDMYHAGTTSHLSGILAGLPTEGIQYRATWGGHGSGF -------------------------3333---------1111------------------ YIGDPNLLLAIMGPKVTEYWTQGPAAEKASERLGSTERGQQLMAQHMTIFPTCSFLPGIN -----------------------------------3333--------------------- TIRAWHPRGPNEIEVWAFTVVDADAPEEMKEEYRQQTLRTFSAGGVFEQDDGENWVEIQQ --------1111---------1111----------------1111--------------- VLRGHKARSRPFNAEMGLGQTDSDNPDYPGTISYVYSEEAARGLYTQWVRMMTSPDWAAL ----3333------2222------1111----------------------1111-----3 DATR 333- >Biphenyl dioxygenase; SWP:Q53123; PDB:1ULIB; FRTKPAPVDPSLQHEIEQFYYWEAKLLNDRRFQEWFDLLAEDIHYFMPIRTTRIMRETAQ --------3333-----------------------33331111----------3333111 EYSGAREYAHFDDNAQMMRGRLRKITSDVSWSENPASRTRHVISNVMIVDGEKPGEYHVS 1-------------------------1111----------------------2222---- SVFIVYRNRLERQLDIFAGERKDILRRTGSEAGFELAKRTILIDQSTILSNNLSFFF -----------------------------3333------------------------ >LECTIN-C; SWP:NA; PDB:1ULKA; APVCGVRASGRVCPDGYCCSQWGYCGTTEEYCGKGCQSQCDYNRCGKEFGGKECHDELCC ----1111-----%%%%--1111----3333-2222--1111---1111-----%%%%-- SQYGWCGNSDGHCGEGCQSQCSYWRCGKDFGGRLCTEDMCCSQYGWCGLTDDHCEDGCQS 1111----3333-2222----1111-1111-----%%%%--1111----3333-2222-- QCDLPT ------ >PUTATIVE ACETYL-COA ACETY; SWP:Q5SJM1; PDB:1ULQA; EAWIVEAVRTPIGKHGGALASVRPDDLLAHALSVLVDRSGVPKEEVEDVYAGCANQAGED -------------2222-1111-------------------3333------------111 NRNVARMALLLAGFPVEVAGCTVNRLCGSGLEAVAQAARAIWAGEGKVYIGSGVESMSRA 1-------------3333-------1111--------------------------3333- PYAVPKPERGFPTGNLVMYDTTLGWRFVNPKMQALYGTESMGETAENLAEMYGIRREEQD -------------------3333--------------------------1111------- RFALLSHQKAVRAWEEGRFQDEVVPVPVKRGKEEILVEQDEGPRRDTSLEKLAALRPVFR ----------------1111-----------------------111133331111----2 EGGTVTAGNSSPLNDGAAAVLLVSDDYAKAHGLRPLARVRAIAVAGVPPRIMGIGPVPAT 222--3333-------------------1111---------------33331111----- RKALERAGLSFSDLGLIELNEAFAAQALAVLREWSLSMEDQRLNPNGGAIALGHPLGASG ----1111-3333-----------------------1111---11113333----1111- ARILTTLVHEMRRRKVQFGLATMCIGVGQGIAVVVEGM -------------------------------------- >PUTATIVE ACYLPHOSPHATASE; SWP:Q72L64; PDB:1ULRA; PRLVALVKGRVQGVGYRAFAQKKALELGLSGYAENLPDGRVEVVAEGPKEALELFLHHLK ------------------------1111-------1111--------------------- QGPRLARVEAVEVQWGEEAGLKGFHVY --1111--------------------- >putative 3-oxoacyl-acyl c; SWP:Q5SK98; PDB:1ULSA; RLKDKAVLITGAAHGIGRATLELFAKEGARLVACDIEEGPLREAAEAVGAHPVVDVADPA -2222-----3333----------------------------------------1111-- SVERGFAEALAHLGRLDGVVHYAGITRDNFHWKPLEDWELVLRVNLTGSFLVAKAASEAR -----------------------------33333333----------------------- EKNPGSIVLTASRVYLGNLGQANYAASAGVVGLTRTLALELGRWGIRVNTLAPGFIETRT -----------3333--2222-------------------3333---------------3 AKVPEKVREKAIAATPLGRAGKPLEVAYAALFLLSDESSFITGQVLFVDGGRTIGA 3333333-------1111---3333---------3333----------iiii---- >LONG CHAIN FATTY ACID-COA; SWP:Q6L8F0; PDB:1ULTA; AFPSTMMDEELNLWDFLERAAALFGRKEVVSRLHTGEVHRTTYAEVYQRARRLMGGLRAL -----------3333--------1111-----1111---------------------111 GVGVGDRVATLGFNHFRHLEAYFAVPGMGAVLHTANPRLSPKEIAYILNHAEDKVLLFDP 1-2222-----------------------------1111---------1111------11 NLLPLVEAIRGELKTVQHFVVMDEKAPEGYLAYEEALGEEADPVRVPERAACGMAYTTGT 11------3333--------------2222-3333-----------1111---------- TGLPKGVVYSHRALVLHSLAASLVDGTALSEKDVVLPVVPMFHVNAWCLPYAATLVGAKQ ----------------------1111---1111------1111%%%%------------- VLPGPRLDPASLVELFDGEGVTFTAGVPTVWLALADYLESTGHRLKTLRRLVVGGSAAPR ----------------1111------------------------1111----------33 SLIARFERMGVEVRQGYGLTETSPVVVQNFVKSHLESLSEEEKLTLKAKTGLPIPLVRLR 33----1111---------------------1111------------------2222--- VADEEGRPVPKDGKALGEVQLKGPWITGGYYGNEEATRSALTPDGFFRTGDIAVWDEEGY --1111----------------1111---------------1111----------1111- VEIKDRLKDLIKSGGEWISSVDLENALMGHPKVKEAAVVAIPHPKWQERPLAVVVPRGEK -----------------------------1111--------------------------- PTPEELNEHLLKAGFAKWQLPDAYVFAEEIPRTSAGKFLKRALREQYKNYYGG ---------1111--3333-------------1111--------11111111- >ENOYL-ACYL CARRIER PROTEI; SWP:Q5SLI9; PDB:1ULUA; LTVDLSGKKALVGVTNQRSLGFAIAAKLKEAGAEVALSYQAERLRPEAEKLAEALGGALL ----2222--------------------------------1111--------1111---- FRADVTQDEELDALFAGVKEAFGGLDYLVHAIAFAPREAEGRYIDTRRQDWLLALEVSAY ---1111----------------------------3333--3333--------------- SLVAVARRAEPLLREGGGIVTLTYYASEKVVPKYNVAIAKAALEASVRYLAYELGPKGVR ---------11112222-------3333--2222--------------------1111-- VNAISAGPVFTKYDRVAQTAPLRRNITQEEVGNLGLFLLSPLASGITGEVVYVDAGYHIG -------------------1111---3333---------3333------------3333- >GLUCODEXTRANASE; SWP:Q9LBQ9; PDB:1ULVA; TAEPPGSPGAAATWTKGDKEGVGTSLNPASKVWYTLTEGTMSEVYYPHADTPNTRELQFA --------------------------3333------iiii-------1111--------- VSDGTSAQRESEQTTRTVELADPKALSYRQTTTDNAGRWRLTKTYVTDPRRSTVMLGVTF --------1111---------------------1111----------1111--------- EVLDGGDYQLFVLSDPSLAGTSGGDTGSVTDGALLASDLADAATPVATALVSSVGFGAVA -----------------%%%%--------%%%%-------1111---------------- NGYVGTSDGWTDLAADGRLDNASATAGPGNISQTGQIPLAAGGKTEFSLALGFGADTAEA --2222------------------------------------------------------ LATAKASLGTGYKKVSKSYTGEWKKYLNSLDAPATSLTGALRTQYDVSLMTVKSHEDKTF --------------------------1111---1111---------------1111---2 PGAFIASLTIPWGQAASAETHREGYHAVWARDMYQSVTALLAAGDEEAAARGVEWLFTYQ 222--------3333--------------------------------------------- QQPDGHFPQTSRVDGTIGQNGIQLDETAFPILLANQIGRTDAGFYRNELKPAADYLVAAG -1111------1111-------3333---------------------------------- PKTPQERWEETGGYSTSTLASQIAALAAAADIAGKNGDAGSAAVYRATADEWQRSTEKWM -----1111------------------------1111-------------------3333 FTTNGPVGDGKYYLRISATGNPNDGATRDWGNGAGVHPENAVLDGGFLEFVRLGVKAPAD ------!!!!----------1111------iiii---3333---------1111--1111 PYVADSLAETDASISQETPGGRMWHRYTYDGYGEKADGSPWDGTGIGRLWPLLSGERGEY -----------------1111-----2222----1111---------------------- ALANGQDALPYLETMHSAANAGYMIPEQVWDRDEPTSYGHELGRSTGSASPLSWAMAQYV -1111--3333----11111111-----------------2222---------------- RLAAGVKAGAPVETPQNVAARYAAGTPLSSPELSVTAPEALSTADSATAVVRGTTNAAKV ----------1111---------------------------------------------- YVSVNGTATEAPVTDGTFSLDVALTGAKNKVTVAAVAADGGTAVEDRTVLYYGSRIGALS ---iiii------iiii-------------------1111-------------------- DPAGDDNGPGTYRYPTNSAYVPGAFDLTGVDVYDAGDDYAFVATIAGEVTNPWGGQAISH ------!!!!------33332222----------!!!!---------------------- QRVNIYLGKGEGGATPGLPGTNINLEHAWDSVIVTDGRFDGAGVYAPDGTRTSAVSLLAV -------------------------------------%%%%----1111----------- PEARQIVTRVPKAALGGLDPATARMSVAMFGNAESGEGIGNVRPVYDGAYWEAGDPAWIK 1111------3333----3333-----------33332222---------3333-3333- EWRFGGGAGVFDGTIPSRDTDTDDPNALDVLVGEGQTQAAVLDWRAGSPVVVPMLGLQP --------------3333------------------3333--3333------------- >HYPOTHETICAL PROTEIN PH19; SWP:O59595; PDB:1ULYA; AKKVKVITDPEVIKVLEDTRRKILKLLRNKETISQLSEILGKTPQTIYHHIEKLKEAGLV -------------------------3333-------------------------1111-- EVKRTEKGNLVEKYYGRTADVFYINLYLGDEELRYIARSRLKTKIDIFKRLGYQFEENEL ------!!!!--------------------------------------1111-------- LNIDRSQKEFDATVRISKYIEEKEDALKDFSNEDIIHAIEWLSTAELARDEEYLELLKRL ----------------------33331111------------------------------ GSILK ----- >PYRUVATE CARBOXYLASE N-TE; SWP:O67483; PDB:1ULZA; MVNKVLVANRGEIAVRIIRACKELGIPTVAIYNEVESTARHVKLADEAYMIGTDPLDTYL ---------------------1111-------3333-----------------3333--- NKQRIINLALEVGADAIHPGYGFLAENAEFAKMCEEAGITFIGPHWKVIELMGDKARSKE ---------------------!!!!---------1111---------------------- VMKKAGVPVVPGSDGVLKSLEEAKALAREIGYPVLLKATAGGGGRGIRICRNEEELVKNY --1111------------------------------------------------------ EQASREAEKAFGRGDLLLEKFIENPKHIEYQVLGDKHGNVIHLGERDCSIQRRNQKLVEI -------1111----------------------------------------%%%%----- APSLILTPEKREYYGNIVTKAAKEIGYYNAGTMEFIADQEGNLYFIEMNTRIQVEHPVSE ----------------------1111-----------1111-----------1111---- MVTGIDIVKWQIKIAAGEPLTIKQEDVKFNGYAIECRINAEDPKKNFAPSTRVIERYYVP -----3333-------------3333------------------%%%%------------ GGFGIRVEHAAARGFEVTPYYDSMIAKLITWAPTWDEAVERMRAALETYEITGVKTTIPL -2222------2222------------------------------1111----------- LINIMKEKDFKAGKFTTKYLEEHPEVFEYEE ---------------11111111-1111--- >CHORISMATE SYNTHASE; SWP:P56122; PDB:1UM0A; MNTLGRFLRLTTFGESHGDVIGGVLDGMPSGIKIDYALLENEMKRRQGGRNVFITPRKED ------------------------------------------------------------ DKVEITSGVFEDFSTGTPIGFLIHNQRARSKDYDNIKNLFRPSHADFTYFHKYGIRDFRG ---------%%%%---------------------------2222---------------% GGRSSARESAIRVAAGAFAKMLLREIGIVCESGIIEIGGIKAKNYDFNHALKSEIFALDE %%%---------------------------------iiii------3333--------33 EQEEAQKTAIQNAIKNHDSIGGVALIRARSIKTNQKLPIGLGQGLYAKLDAKIAEAMMGL 33-----------------------------2222------------------------2 NGVKAVEIGKGVESSLLKGSEYNDLMDQKGFLSNRSGGVLGGMSNGEEIIVRVHFKPTPS 222----!!!!3333--3333-----3333---3333--iiii----------------- IFQPQRTIDINGNECECLLKGRHDPCIAIRGSVVCESLLALVLADMVLLNLTSKIEYLKT --------1111-------------------------------------11113333--- IYNEN ----- >KIAA1849 PROTEIN; SWP:Q96JH8; PDB:1UM1A; GSSGSSGYVFTVELERGPSGLGMGLIDGMHTHLGAPGLYIQTLLPGSPAAADGRLSLGDR ----------------1111------33333333------------3333-----2222- ILEVNGSSLLGLGYLRAVDLIRHGGKKMRFLVAKSDVETAKKIHSGPSSG ---%%%%------------------------------------------- >ANTIBODY 21H3 L CHAIN; SWP:NA; PDB:1UM5H; VQLQQSGPVLVKPGGSVKMSCKASEYTLTSYLFQWVKQKSGQGLEWIGYIYPYNGGTRYN -----------2222-----------------------2222-----------------3 EKFRGKATLTSDKSSNTAYLELSSLTSEDSAVYYCARSSMSDPGANWGPGTLVTVSSAST 333---------1111---------3333---------2222------------------ KGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLY ----------1111-----------------------%%%%--2222-------1111-- SLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEP --------3333------------1111--------- >ANTIBODY 21H3 L CHAIN; SWP:NA; PDB:1UM5L; LDIQMTQSPSSLSASLGERVSLTCRASQEISGYLYWLQQKPDGTIKRLIYAGSTLDSGVP --------------2222-----------iiii------1111------------11113 KRFSGSRSGSDYSLTISSLESEDFADYYCLQYASYPRTFGGGTKVEIKRTVAAPSVFIFP 333----!!!!--------1111------------------------------------- PSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTL -3333-------------------------%%%%-------------------------- TLSKADYEKHKVYACEVTHQGLSSPVTKSFNRGECG ------1111--------1111--------3333-- >SYNAPSE-ASSOCIATED PROTEI; SWP:Q92796; PDB:1UM7A; GSSGSSGRPGGDAREPRKIILHKGSTGLGFNIVGGEDGEGIFVSFILAGGPADLSGELRR -----------------------------------%%%%-------------3333---- GDRILSVNGVNLRNATHEQAAAALKRAGQSVTIVAQYRPEEYSRFESSGPSSG -----------------------1111----------3333------------ >ATP-dependent Clp proteas; SWP:O25926; PDB:1UM8A; LLSYIPAPKELKAVLDNYVIGQEQAKKVFSVAVYNHYKRLSFKEKLKKQDNQDSNVELEH ---------------------------------------------------3333----- LEEVELSKSNILLIGPTGSGKTLMAQTLAKHLDIPIAISDATSLTVENILTRLLQASDWN -3333----------2222----------1111------3333--3333-------%%%% VQKAQKGIVFIDEIDKIGEGVQQALLKIVEGSLVNQIDTSDILFICAGAFDGLAEIIKKR 3333--------3333-3333------3333-------1111-------3333------- TTQNVLGFTQEKMSKKEQEAILHLVQTHDLVTYGLIPELIGRLPVLSTLDSISLEAMVDI -------------------------3333------33331111----------------- LQKPKNALIKQYQQLFKMDEVDLIFEEEAIKEIAQLALERKTGARGLRAIIEDFCLDIMF -------3333-------------------------------3333-------------- DLPKLKGSEVRITKDCVLKQAEPLIIA -1111---------------------- >2-OXO ACID DEHYDROGENASE ; SWP:P84129; PDB:1UM9A; HRFETFTEEPIRLIGEEGEWLGDFPLDLEGEKLRRLYRDMLAARMLDERYTILIRTGKTS -----------------------------------------------------1111--- FIAPAAGHEAAQVAIAHAIRPGFDWVFPYYRDHGLALALGIPLKELLGQMLATKADPNKG -------------------2222-----1111----3333-3333-------3333-%%% RQMPEHPGSKALNFFTVASPIASHVPPAAGAAISMKLLRTGQVAVCTFGDGATSEGDWYA %--------1111--------1111-----------------------3333-------- GINFAAVQGAPAVFIAENNFPTIADKAHAFGIPGYLVDGMDVLASYYVVKEAVERARRGE ---------------------33333333--------1111--------------1111- GPSLVELRVYRYGPHRKKDPIPRFRRFLEARGLWNEEWEEDVREEIRAELERGLKEAEEA ---------------1111-------------------------------------1111 GPVPPEWMFEDVFAEKPWHLLRQEALLKEEL ---3333-------------------3333- >2-OXO ACID DEHYDROGENASE ; SWP:P84129; PDB:1UMDA; HRFETFTEEPIRLIGEEGEWLGDFPLDLEGEKLRRLYRDMLAARMLDERYTILIRTGKTS --------------1111------------------------------------------ FIAPAAGHEAAQVAIAHAIRPGFDWVFPYYRDHGLALALGIPLKELLGQMLATKADPNKG -------------------2222-----1111----3333-3333-------3333-%%% RQMPEHPGSKALNFFTVASPIASHVPPAAGAAISMKLLRTGQVAVCTFGDGATSEGDWYA %--------1111------2222-------------------------3333-------- GINFAAVQGAPAVFIAENNFYAISVDYRHQTHSPTIADKAHAFGIPGYLVDGMDVLASYY -------------------------3333------33333333-------1111------ VVKEAVERARRGEGPSLVELRVYRYGPHSSADDDSRYRPKEEVAFWRKKDPIPRFRRFLE --------1111----------------1111------3333--3333------------ ARGLWNEEWEEDVREEIRAELERGLKEAEEAGPVPPEWMFEDVFAEKPWHLLRQEALLKE ----------------------------------3333-------------------333 EL 3- >2-oxoisovalerate dehydrog; SWP:P84130; PDB:1UMDB; ALMTMVQALNRALDEEMAKDPRVVVLGEDVGKRGGVFLVTEGLLQKYGPDRVMDTPLSEA -------------------3333-------3333---11113333--1111------333 AIVGAALGMAAHGLRPVAEIQFADYIFPGFDQLVSQVAKLRYRSGGQFTAPLVVRMPSGG 3--------------------33333333-------1111-1111--------------- GVRGGHHHSQSPEAHFVHTAGLKVVAVSTPYDAKGLLKAAIRDEDPVVFLEPKRLYRSVK -----------33333333--------------------------------3333----- EEVPEEDYTLPIGKAALRREGKDLTLICYGTVMPEVLQAAAELAKAGVSAEVLDLRTLMP ----------2222---------------1111----------1111------------- WDYEAVMNSVAKTGRVVLVSDAPRHASFVSEVAATIAEDLLDMLLAPPIRVTGFDTPYPY ---------------------------------------3333----------------1 AQDKLYLPTVTRILNAAKRALDY 111-------------------- >385AA LONG CONSERVED HYPO; SWP:Q975V5; PDB:1UMGA; KTTISVIKADIGSLAGHHIVHPDTMAAANKVLASAKEQGIILDYYITHVGDDLQLIMTHT --------------------3333------------------------!!!!-------- RGELDTKVHETAWNAFKEAAKVAKDLGLYAAGQDLLSDSFSGNVRGLGPGVAEMEIEERA -----------------------1111--2222--1111--------------------- SEPIAIFMADKTEPGAYNLPLYKMFADPFNTPGLVIDPTMHGGFKFEVLDVYQGEAVMLS --------------------------11113333--3333-------------------- APQEIYDLLALIGTPARYVIRRVYRNEDNLLAAVVSIERLNLIYVGKDDPVMIVRLQHGL --------------------------------------------------------iiii PALGEALEAFAFPHLVPGWMRGSHYGPLMPVSQRDAKATRFDGPPRLLGLGFNVKNGRLV -------1111-------2222---------3333----%%%%-----------iiii-- GPTDLFDDPAFDETRRLANIVADYMRRHGPFMPHRLEPTEMEYTTLPLILEKLKDRFKK ---11113333----------------!!!!-----1111------------1111--- >F-BOX ONLY PROTEIN 2; SWP:Q80UW2; PDB:1UMHA; GSHFYFLSKRRRNLLRNPCGEEDLEGWSDVEHGGDGWKVEELPGDNGVEFTQDDSVKKYF -------1111----------!!!!-------!!!!------------------------ ASSFEWCRKAQVIDLQAEGYWEELLDTTQPAIVVKDWYSGRTDAGSLYELTVRLLSENED -------------3333--------------------------------------1111- VLAEFATGQVAVPEDGSWMEISHTFIDYGPGVRFVRFEHGGQDSVYWKGWFGARVTNSSV ------------1111-------------------------------------------- WVEP ---- >NADH-CYTOCHROME B5 REDUCT; SWP:P00387; PDB:1UMKA; TPAITLESPDIKYPLRLIDREIISHDTRRFRFALPSPQHILGLPVGQHIYLSARIDGNLV -------1111------------------------1111----2222-------iiii-- VRPYTPISSDDDKGFVDLVIKVYFKDTHPKFPAGGKMSQYLESMQIGDTIEFRGPSGLLV --------3333---------------1111---------11112222------------ YQGKGKFAIRPDKKSNPIIRTVKSVGMIAGGTGITPMLQVIRAIMKDPDDHTVCHLLFAN --iiii-----1111-------------------------------1111---------- QTEKDILLRPELEELRNKHSARFKLWYTLDRAPEAWDYGQGFVNEEMIRDHLPPPEEEPL -3333-------------3333-------------------------------3333--- VLMCGPPPMIQYACLPNLDHVGHPTERCFVF -----3333--------------3333---- >PHOTOSYNTHETIC APPARATUS ; SWP:Q53228; PDB:1UMQA; LAKGESLPPPPENPMSADRVRWEHIQRIYEMCDRNVSETARRLNMHRRTLQRILAKRSPR --!!!!---------1111--------------------------3333----1111--- >CONVULXIN ALPHA; SWP:O93426; PDB:1UMRA; GLHCPSDWYYYDQHCYRIFNEEMNWEDAEWFCTKQAKGAHLVSIKSAKEADFVAWMVTQN ----2222-----------------------11112222--------------------- IEESFSHVSIGLRVQNKEKQCSTKWSDGSSVSYDNLLDLYITKCSLLKKETGFRKWFVAS -1111-------------------1111--------3333-------1111--------1 CIGKIPFVCKFPPQC 111------------ >Convulxin beta [Precursor; SWP:O93427; PDB:1UMRC; GFCCPSHWSSYDRYCYKVFKQEMTWADAEKFCTQQHTGSHLVSFHSTEEVDFVVKMTHQS ----2222--!!!!-----------------11112222--------------------- LKSTFFWIGANNIWNKCNWQWSDGTKPEYKEWHEEFECLISRTFDNQWLSAPCSDTYSFV -------------1111---1111---------------------------1111----- CKFEA ----- >UMUD'; SWP:P04153; PDB:1UMUA; DYVEQRIDLNQLLIQHPSATYFVKASGDSIDGGISDGDLLIVDSAITASHGDIVIAAVDG -------3333----1111----------1111------------------------iii EFTVKKLQLRPTVQLIPNSAYSPITISSEDTLDVFGVVIHVVK i----------------3333-----1111------------- >SERINE/THREONINE-PROTEIN ; SWP:P53350; PDB:1UMWA; HLSDMLQQLHSVNASKPSERGLVRQEEAEDPACIPIFWVSKWVDYSDKYGLGYQLCDNSV ---------------1111------11111111-----------3333------1111-- GVLFNDSTRLILYNDGDSLQYIERDGTESYLTVSSHPNSLMKKITLLKYFRNYMSEHLLK ---1111-----1111------1111-----3333-3333-------------------- AGANITPREGDELARLPYLRTWFRTRSAIILHLSNGSVQINFFQDHTKLILCPLMAAVTY -1111-------------------1111----1111----------------1111---- IDEKRDFRTYRLSLLEEYGCCKELASRLRYARTMVDKLLSSRS -1111-------------------------------------- >BETAINE--HOMOCYSTEINE S-M; SWP:O09171; PDB:1UMYA; KRGILERLNAGEVVIGDGGFVFALEKRGYVKAGPWTPEAAVEHPEAVRQLHREFLRAGSN ------------------------------------3333-------------------- VMQTFTFYASSGQKVNEAACDIARQVADEGDALVAGGVSQTPSYLSCKSETEVKKIFHQQ ----------------------------------------3333----3333-------- LEVFMKKNVDFLIAEYFEHVEEAVWAVEALKTSGKPIAATMCIGPEGDLHGVSPGECAVR -----------------------------3333----------1111------------- LVKAGAAIVGVNCHFDPSTSLQTIKLMKEGLEAARLKAYLMSQPLAYHTPDCGKQGFIDL ------------------------------------------------1111---33331 PEFPFGLEPRVATRWDIQKYAREAYNLGVRYIGGCCGFEPYHIRAIAEELAPERGFLPPA 111----1111-------------1111------22223333---------3333--333 SEKHGSWGSGLDMHTKPWIRARARKEYWQNLRIASGRPYNPSMSKPDAWGVTKGAAELMQ 3---------------3333---3333---------1111-------------------- QKEATTEQQLRALFE --------------- >XYLOGLUCAN ENDOTRANSGLYCO; SWP:Q8GZD5; PDB:1UMZA; PVDVAFGRNYVPTWAFDHIKYFNGGNEIQLHLDKYTGTGFQSKGSYLFGHFSMQMKLVPG ----3333------1111---%%%%-------3333------------------------ DSAGTVTAFYLSSQNSEHDEIDFEFLGNRTGQPYILQTNVFTGGKGDREQRIYLWFDPTK -2222-----------------------2222---------iiii-----------1111 EFHYYSVLWNMYMIVFLVDDVPIRVFKNCKDLGVKFPFNQPMKIYSSLWNADDWATRGGL -----------------!!!!-------3333-----------------------%%%%- EKTDWSKAPFIASYRSFHIDGCEASVEAKFCATQGARWWDQKEFQDLDAFQYRRLSWVRQ ---3333-----------------------1111--11113333---------------- KYTIYNYCTDRSRYPSMPPECKRDRDI -----3333--------33331111-- >ANGIOGENIN; SWP:P03950; PDB:1UN3A; NSRYTHFLTQHYDAKPQGRDDRYCESIMRRRGLTSPCKDINDFIHGNKRSIKAICENKNG ----------------------------1111--------------3333-3333----- NPHRENLRISKSSFQVTTCKLHGGSPWPPCQYRATAGFRNVVVACENGLPVHLDQ ---------------------------------------------iiii------ >N-ACETYLGLUCOSAMINE-6-PHO; SWP:O34450; PDB:1UN7A; ESLLIKDIAIVTENEVIKNGYVGINDGKISTVSTERPKEPYSKEIQAPADSVLLPGMIDI ------------------------iiii-------------------------------- HIHGGYGADTMDASFSTLDIMSSRLPEEGTTSFLATTITQEHGNISQALVNAREWKAAEE ----iiii3333-3333------3333-------------3333-----------33331 SSLLGAELLGIHLEGPFVSPKRAGAQPKEWIRPSDVELFKKWQQEAGGLIKIVTLAPEED 111---------------1111!!!!1111-------------1111--------11112 QHFELIRHLKDESIIASMGHTDADSALLSDAAKAGASHMTHLYNAMSPFHHREPGVIGTA 222------1111---------------------------2222-----1111------- LAHDGFVTELIADGIHSHPLAAKLAFLAKGSSKLILITDSMRAKGLKDGVYEFGGQSVTV --1111-----------------------1111-------1111--------iiii---- RGRTALLSDGTLAGSILKMNEGARHMREFTNCSWTDIANITSENAAKQLGIFDRKGSVTV !!!!--1111-------3333-------------------------1111--------22 GKDADLVIVSSDCEVILTICRGNIAFISKEAD 22-------1111------iiii--------- >DIHYDROXYACETONE KINASE; SWP:P45510; PDB:1UN8A; MSQFFFNQRTHLVSDVIDGAIIASPWNNLARLESDPAIRIVVRRDLNKNNVAVISGGGSG -------3333------------1111-------1111--------3333---------- HEPAHVGFIGKGMLTAAVCGDVFASPSVDAVLTAIQAVTGEAGCLLIVKNYTGDRLNFGL ----1111-2222-------2222---------------3333----------------- AAEKARRLGYNVEMLIVGDDISLPDNKHPRGIAGTILVHKIAGYFAERGYNLATVLREAQ -----1111-------------1111-----3333------------------------- YAASNTFSLGVALSSCHLPQETDAAPRHHPGHAELGMGIHGEPGASVIDTQNSAQVVNLM --1111----------------------2222-----1111------------------- VDKLLAALPETGRLAVMINNLGGVSVAEMAIITRELASSPLHSRIDWLIGPASLVTALDM --------------------------------------1111--------------!!!! KGFSLTAIVLEESIEKALLTEVETSNWPTPVPPREITCVVSSHASARVEFQPSANALVAG ----------!!!!------------------------------------------3333 IVELVTATLSDLETHLNALDAKVGDGDTGSTFAAAAREIASLLHRQQLPLNNLATLFALI ------------------3333--------------------1111--1111-------- GERLTVVMGGSSGVLMSIFFTAAGQKLEQGANVVEALNTGLAQMKFYGGADEGDRTMIDA -------------------------------------------------------3333- LQPALTSLLAQPKNLQAAFDAAQAGAERTCLSSKANAESLLGNMDPGAQRLAMVFKALAE ----------1111-------------------------1111--------------111 SE 1- >GA UNASSEMBLED COAT PROTE; SWP:P07234; PDB:1UNAA; ATLHSFVLVDNGGTGNVTVVPVSNANGVAEWLSNNSRSQAYRVTASYRASGADKRKYTIK ------------------------------------1111-------------------- LEVPKIVELPVSAWKAYASIDLTIPIFAATDDVTVISKSLTGLFKVGNPIAEAISSQSGF ---------------------------1111-------------2222------------ YA -- >VILLIN 1; SWP:P09327; PDB:1UNCA; LSIEDFTQAFGMTPAAFSALPRWKQQNLKKEKGLF --------------------3333-----1111-- >ADVILLIN; SWP:O75366; PDB:1UNDA; YLSEQDFVSVFGITRGQFAALPGWKQLQMKKEKGLF --3333------------------------------ >Iron-superoxide dismutase; SWP:Q9M7R2; PDB:1UNFX; KVNAKFELKPPPYPLNGLEPVMSQQTLEFHWGKHHRTYVENLKKQVTELDGKSLEEIIVT -------------1111-----------------------------1111---------- AYNKGDILPAFNNAAQVWNHDFFWECMKPGGGGKPSGELLELIERDFGSFEKFLDEFKAA -%%%%--1111------------1111--------------------------------- AATQFGSGWAWLAYKASKLDADEDNKLVVIKSPNAVNPLVWGGYYPLLTIDVWEHAYYLD ---------------3333-------------!!!!3333------------3333---- FQNRRPDYISVFMDKLVSWDAVSSRLEQAKALSA !!!!--------------------------1111 >COLICIN E7; SWP:Q03708; PDB:1UNKA; MELKNSISDYTEAEFVQLLKEIEKENVAATDDVLDVLLEHFVKITEHPDGTDLIYYPSDN -----1111-------------------------------------33333333---111 RDDSPEGIVKEIKEWRAANGKPGFKQG 1---------------1111------- >CYCLIN-DEPENDENT KINASE 5; SWP:Q00535; PDB:1UNLA; MQKYEKLEKIGEGTYGTVFKAKNRETHEIVALKRVRLDDDDEGVPSSALREICLLKELKH ----------------------------------------1111---------3333--1 KNIVRLHDVLHSDKKLTLVFEFCDQDLKKYFDSCNGDLDPEIVKSFLFQLLKGLGFCHSR 111------------------------------%%%%----------------------- NVLHRDLKPQNLLINRNGELKLANFGLARAFGIPVRCYSAEVVTLWYRPPDVLFGAKLYS -------3333---3333------1111---------------3333----1111----- TSIDMWSAGCIFAELANAGRPLFPGNDVDDQLKRIFRLLGTPTEEQWPSMTKLPDYKPYP ------------------------------------------33331111--1111---- MYPATTSLVNVVPKLNATGRDLLQNLLKCNPVQRISAEEALQHPYFSDFCPP -------11111111--------------3333--333311111111----- >Cyclin-dependent kinase 5; SWP:Q15078; PDB:1UNLD; QASTSELLRCLGEFLCRRCYRLKHLSPTDPVLWLRSVDRSLLLQGWQDQGFITPANVVFL ------------------1111---1111------------------------------- YMLCRDVISSEVGSDHELQAVLLTCLYLSYSYMGNEISYPLKPFLVESCKEAFWDRCLSV --------1111----------------------------3333----3333-------- INLMSSKMLQINADPHYFTQVFSDLKNESG --------3333-------------3333- >DNA polymerase IV; SWP:Q47155; PDB:1UNNC; HHVGVERTMAEDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQ --------------3333----------------1111-------------1111----- EHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLGL -----------------------iiii------------------------ >RAC-ALPHA SERINE/THREONIN; SWP:P31749; PDB:1UNQA; SMSDVAIVKEGWLHKRGEYIKTWRPRYFLLKNDGTFIGYKERPQDVDQREAPLNNFSVAQ 3333----------------------------------------3333---------222 CQLMKTERPRPNTFIIRCLQWTTVIERTFHVETPEEREEWTTAIQTVADGLKKQEEEE 2--------------------------------------------------------- >POP2; SWP:P39008; PDB:1UOCA; PPIFLPPPNYLFVRDVWKSNLYSEFAVIRQLVSQYNHVSISTEFVGSKVDYHYQTMRANV ------3333------3333----------3333------------------------11 DFLNPIQLGLSLSDANGNKPDNGPSTWQFNFEFDPKKEIMSTESLELLRKSGINFEKHEN 11-----------1111----------------1111---3333---------3333--- LGIDVFEFSQLLMDSGLMMDDSVTWITYHAAYDLGFLINILMNDSMPNNKEDFEWWVHQY -------------------1111------------------------------------- MPNFYDLNLVYKIIQEFKNQYSLTTLADELGLPRFSIFTTTGGQSLLMLLSFCQLSKLSM ----------------------------------3333---------------------- HKFPNGTDFAKYQGVIYGIDGDQ --1111-33332222---2222- >26S PROTEASOME NON-ATPASE; SWP:O75832; PDB:1UOHA; CVSNLMVCNLAYSGKLEELKESILADKSLATRTDQDSRTALHWACSAGHTEIVEFLLQLG ---------------------------3333--1111-------------------3333 VPVNDKDDAGWSPLHIAASAGRDEIVKALLGKGAQVNAVNQNGCTPLHYAASKNRHEIAV ------1111-------------------1111-1111-1111-3333--1111------ MLLEGGANPDAKDHYEATAMHRAAAKGNLKMIHILLYYKASTNIQDTEGNTPLHLACDEE --1111-1111-1111-------------------1111------1111----------- RVEEAKLLVSQGASIYIENKEEKTPLQVAKGGLGLILKRMVEG --------1111------1111-3333--!!!!---------- >OLIGO-1,6-GLUCOSIDASE; SWP:P21332; PDB:1UOK; MEKQWWKESVVYQIYPRSFMDSNGDGIGDLRGIISKLDYLKELGIDVIWLSPVYESPNDD ---1111-------3333-----------------------------------------i NGYDISDYCKIMNEFGTMEDWDELLHEMHERNMKLMMDLVVNHTSDEHNWFIESRKSKDN iii--------3333-3333--------------------------------33331111 KYRDYYIWRPGKEGKEPNNWGAAFSGSAWQYDEMTDEYYLHLFSKKQPDLNWDNEKVRQD -1111---------------------------1111-------1111---1111------ VYEMMKFWLEKGIDGFRMDVINFISKEEGLPTVETEEEGYVSGHKHFMNGPNIHKYLHEM --------1111-------1111---2222------------3333-------------- NEEVLSHYDIMTVGEMPGVTTEEAKLYTGEERKELQMVFQFEHMDLDSGEGGKWDVKPCS ---3333--------2222---------1111--------333311111111-------- LLTLKENLTKWQKALEHTGWNSLYWNNHDQPRVVSRFGNDGMYRIESAKMLATVLHMMKG ------------1111----------1111-3333-------------------1111-- TPYIYQGEEIGMTNVRFESIDEYRDIETLNMYKEKVMERGEDIEKVMQSIYIKGRDNART ----2222----------3333--------------------------------3333-- PMQWDDQNHAGFTTGEPWITVNPNYKEINVKQAIQNKDSIFYYYKKLIELRKNNEIVVYG --------iiii---------1111----------1111--------------3333--- SYDLILENNPSIFAYVRTYGVEKLLVIANFTAEECIFELPEDISYSEVELLIHNYDVENG -----1111---------!!!!-----------------3333----------------- PIENITLRPYEAMVFKLK -------2222------- >THYMIDINE PHOSPHORYLASE; SWP:P19971; PDB:1UOUA; PKQLPELIRMKRDGGRLSEADIRGFVAAVVNGSAQGAQIGAMLMAIRLRGMDLEETSVLT ----------1111---------------------------------------------- QALAQSGQQLEWPEAWRQQLVDKHSTGGVGDKVSLVLAPALAACGCKVPMISGRGLGHTG ------------33331111-------22223333------1111--------------- GTLDKLESIPGFNVIQSPEQMQVLLDQAGCCIVGQSEQLVPADGILYAARDVTATVDSLP ----33332222----3333---------------11113333-----3333------33 LITASILSKKLVEGLSALVVDVKFGAVFPNQEQARELAKTLVGVGASLGLRVAAALTAMD 33--------3333-------------------------------1111----------- KPLGRCVGHALEVEEALLCMDGAGPPDLRDLVTTLGGALLWLSGHAGTQAQGAARVAAAL ------------------1111-------------------------------------- DDGSALGRFERMLAAQGVDPGLARALCSGSPAERRQLLPRAREQEELLAPADGTVELVRA -------------1111------------------------------------------- LPLALVLHELGALRLGVGAELLVDVGQRLRRGTPWLRVHRDGPALSGPQSRALQEALVLS -----------------------2222--2222--------------------3333--- DRAPFAAPLPFAELVLPP ------------------ >BUBBLE PROTEIN; SWP:P83799; PDB:1UOYA; DTCGSGYNVDQRRTNSGCKAGNGDRHFCGCDRTGVVECKGGKWTEVQDCGSSSCKGTSNG ---22221111-2222--3333------1111------iiii--------------1111 GATC ---- >PUTATIVE CELLULASE; SWP:Q79G13; PDB:1UOZA; GHIEGRHANPLAGKPFYVDPASAAMVAARNANPPNAELTSVANTPQSYWLDQAFPPATVG --------1111----------------------------1111------33333333-- GTVARYTGAAQAAGAMPVLTLYGIPHRDCGSYASGGFATGTDYRGWIDAVASGLGSSPAT ----------1111-----------2222------------------------!!!!--- IIVEPDALAMADCLSPDQRQERFDLVRYAVDTLTRDPAAAVYVDAGHSRWLSAEAMAARL ---22221111------------------------1111--------------------- NDVGVGRARGFSLNVSNFYTTDEEIGYGEAISGLTNGSHYVIDTSRNGAGPAPDAPLNWC ---3333------2222-----------------%%%%--------1111----2222-- NPSGRALGAPPTTATAGAHADAYLWIKRPGESDGTCGRGEPQAGRFVSQYAIDLAHNAGQ ----------------1111--------------iiii---2222----------1111- >6-PHOSPHO-BETA-GLUCOSIDAS; SWP:Q9X108; PDB:1UP7A; HMRIAVIGGGSSYTPELVKGLLDISEDVRIDEVIFYDIDEEKQKIVVDFVKRLVKDRFKV -------1111------------1111--------------------------%%%%--- LISDTFEGAVVDAKYVIFQFRPGGLKGRENDEGIPLKYGLIGQETTGVGGFSAALRAFPI ----3333-1111-------2222--------3333----------3333---------- VEEYVDTVRKTSNATIVNFTNPSGHITEFVRNYLEYEKFIGLCNVPINFIREIAEMFSAR -----------------------------------------------------------3 LEDVFLKYYGLNHLSFIEKVFVKGEDVTEKVFENLKLKIPDEDFPTWFYDSVRLIVNPYL 333-------2222-------iiii-------3333--------3333----------33 RYYLMEKKMFKKISTHELRAREVMKIEKELFEKYRTAVEIPEELTKRGGSMYSTAAAHLI 33---------------3333-------------------3333---------------- RDLETDEGKIHIVNTRNNGSIENLPDDYVLEIPCYVRSGRVHTLSQGKGDHFALSFIHAV ----------------iiii11111111--------iiii---------3333------- KMYERLTIEAYLKRSKKLALKALLSHPLGPDVEDAKDLLEEILEANREYVKLG -------------------------1111-3333---------1111------ >VANADIUM-DEPENDENT BROMOP; SWP:O81959; PDB:1UP8A; GIPADNLQSRAKASFDTRVAAAELALNRGVVPSFANGEELLYRNPDPDNTDPSFIASFTK ------------------------------------3333-------------1111-22 GLPHDDNGAIIDPDDFLAFVRAINSGDEKEIADLTLGPARDPETGLPIWRSDLANSLELE 22--1111----------------------1111-------------------------- VRGWENSSAGLTFDLEGPDAQSIAMPPAPVLTSPELVAEIAELYLMALGREIEFSEFDSP -----1111---------1111-------1111---------------11111111--33 KNAEYIQFAIDQLNGLEWFNTPAKLGDPPAEIRRRRGEVTVGNLFRGILPGSEVGPYLSQ 33--------------3333---2222------------3333-------1111----11 YIIVGSKQIGSATVGNKTLVSPNAADEFDGEIAYGSITISQRVRIATPGRDFMTDLKVFL 11-----2222--!!!!---11113333-----!!!!---------2222---------- DVQDAADFRGFESYEPGARLIRTIRDLATWVHFDALYEAYLNACLILLANGVPFDPNLPF -------2222-----------3333--------!!!!----------------1111-- QQEDKLDNQDVFVNFGSAHVLSLVTEVATRALKAVRYQKFNIHRRLRPEATGGLISVNKI --3333----------------------------------------3333---------- AAQKGESIFPEVDLAVEELGDILEKAEISNRKQNIADGDPDPDPSFLLPMAFAEGSPFHP -1111---3333---------------------------------------1111----- SYGSGHAVVAGACVTILKAFFDSGIEIDQVFEVDKDEDKLVKSSFKGTLTVAGELNKLAD ---------------------1111----------------------------------- NIAIGRNMAGVHYFSDQFESLLLGEQVAIGILEEQSLTYGENFFFNLPKFDGTTIQI ------3333--3333--------------------------------1111----- >CYTOCHROME C3; SWP:Q9L915; PDB:1UP9A; APAVPDKPVEVKGSQKTVMFPHAPHEKVECVTCHHLVDGKESYAKCGSSGCHDDLTAKKG ---------------------3333---3333----iiii----1111-----------1 EKSLYYVVHARGELKHTSCLACHSKVVAEKPELKKDLTGCAKSKCHP 111--------------------------3333---------3333- >CARBOXYETHYLARGININE SYNT; SWP:Q9LCV9; PDB:1UPAA; PTAAHALLSRLRDHGVGKVFGVVGREAASILFDEVEGIDFVLTRHEFTAGVAADVLARIT -----------1111----------3333-----2222------3333------------ GRPQACWATLGPGTNLSTGIATSVLDRSPVIALAAQSESHDIFPNDTHQCLDSVAIVAPS ---------!!!!------------------------1111-22222222-3333----- KYAVELQRPHEITDLVDSAVNAATEPVGPSFISLPVDLLGSSEGIDTNPPANTPAKPVGV -------3333-----------------------3333---2222--------------- VADGWQKAADQAAALLAEAKHPVLVVGAAAIRSGAVPAIRALAERLNIPVITTYIAKGVL -2222-------------------------3333------------------1111---- PVGHELNYGAVTGYDGILNFPALQTFAPVDLVLTVGYDYAEDLRPSWQKGIEKKTVRISP 2222----------3333--3333-1111--------3333------------------- TVNPIPRVYRPDVDVVTDVLAFVEHFETATASFGAKQRHDIEPLRARIAEFLADPETYED ----3333-------------------1111-------------------1111------ GRVHQVIDSNTVEEAAEPGEGTIVSDIGFFRHYGVLFARADQPFGFLTSAGCSSFGYGIP -3333-----------2222-------3333----------------------------- AAIGAQARPDQPTFLIAGDGGFHSNSSDLETIARLNLPIVTVVVNNDTNGLIELYQNIGH -------1111------------------------------------------------- HRSHDPAVKFGGVDFVALAEANGVDATRATNREELLAALRKGAELGRPFLIEVPVNYDFQ ---3333------------1111------------------1111--------------3 PGGFGALS 3333333- >DTDP-4-DEHYDRORHAMNOSE 3,; SWP:O06330; PDB:1UPIA; MKARELDVPGAWEITPTIHVDSRGLFFEWLTDHGFRAFAGHSLDVRQVNCSVSSAGVLRG -------2222---------3333-----------------------------2222--- LHFAQLPPSQAKYVTCVSGSVFDVVVDIREGSPTFGRWDSVLLDDQDRRTIYVSEGLAHG ----------------------------2222-2222----------------2222--- FLALQDNSTVMYLSAEYNPQREHTIATDPTLAVDWPLVDGAAPSLSDRDAAAPSFEDVRA -----------------3333------3333--------------3333---------11 SGLLPRWEQTQRFIGEMRG 11----------------- >MO25 PROTEIN; SWP:Q9Y376; PDB:1UPKA; KSPADIVKNLKESAVLEKSDKKAEKATEEVSKNLVAKEILYQTEAVAQLAQELYNSGLLS -------------3333------------------------------------1111--- TLVADLQLIDFEGKKDVAQIFNNILRRQIGTRTPTVEYICTQQNILFLLKGYESPEIALN ----3333---------------1111-!!!!---------------------------- CGILRECIRHEPLAKIILWSEQFYDFFRYVESTFDIASDAFATFKDLLTRHKLLSAEFLE ---------------------------3333----------------------------- QHYDRFFSEYEKLLHSENYVTKRQSLKLLGELLLDRHNFTITKYISKPENLKLNLLRDKS ----------3333--------------------3333---3333--3333--3333--- RNIQFEAFHVFKVFVANPNKTQPILDILLKNQAKLIEFLSKFQNDREDEQFNDEKTYLVK --------------------3333------------------3333-------------- QIRDLKRPAQQ -1111------ >PEPP1; SWP:Q8N658; PDB:1UPQA; LRRDPNLPVHIRGWLHKQDSSGLRLWKRRWFVLSGHCLFYYKDSREESVLGSVLLPSYNI ---3333-----------------------------------3333--------1111-- RPDGPGAPRGRRFTFTAEHPGMRTYVLAADTLEDLRGWLRALGRASR ------1111--------2222---------------------1111 >GLCNAC-ALPHA-1,4-GAL-RELE; SWP:Q934G8; PDB:1UPSA; AKDFPANPIEKAGYKLDFSDEFNGPTLDREKWTDYYLPHWCKDPESAKANYRFENGSLVE ----------2222-------------1111------1111-3333-------%%%%--- YITEDQKPWCPEHDGTVRSSAIMSFDKSWIHNFSGTTDNHERNEWRGYTTKYGYFEIRAK --1111---3333!!!!---------2222-1111------------------------- LSNTGGGGHQAWWMVGMQDDTNDWFNSKQTGEIDILETFFSKKDTWRIAAYGWNDPNFQT -----------------------1111-----------1111---------!!!!----- SWTISEDKVPSGDPTSEYHIYAMEWTPTALKFYYDNELFKVIYGSPDYEMGTILNIYTDA ------------1111-----------------iiii----------------------1 GSGAHNDVWPKEWAIDYMRVWKPVDGYKESLNNYLIRNRQTGKFLYIEENNDKVSYGDIT 111-------------------1111---------------------------------3 LKNEKNAKWSKEYRDGYTLLKNNETGEYLNIENQTGYIEHGKVPKTWWSAQWSEVPVDGY 3331111------iiii-------------1111---------1111---------iiii TRFVNRWKPNMSIHTESYEGVLQYGNVPNTYWTSQWQLIPVE -------1111---1111---------1111----------- >ADP-RIBOSYLATION FACTOR-L; SWP:P40616; PDB:1UPTA; HTRERILILGLDGAGKTTILYRLQVGEVVTTIPTIGFNVETVTYKNLKFQVWDLGGLTSI ----------2222-----------------------------%%%%---------3333 RPYWRCYYSNTDAVIYVVDSCDRDRIGISKSELVALEEEELRKAILVVFANKQDEQATSS 1111--------------11111111-----------3333----------------333 EANSLGLPALKDRKWQIFKTSATKGTGLDEAEWLVETLKSRQ 3-11111111--------------2222---------1111- >Golgin subfamily A member; SWP:Q13439; PDB:1UPTB; EPTEFEYLRKVLFEYGRETKTAKVITTVLKFPDDQTQKILEREDARLSWLRSSS -3333------------3333-----1111------------------------ >OXYSTEROLS RECEPTOR LXR-B; SWP:P55055; PDB:1UPVA; QLTAAQELMIQQLVAAQLQVTPWPLGADPQSRDARQQRFAHFTELAIISVQEIVDFAKQV -----------------------22221111------------------------33332 PGFLQLGREDQIALLKASTIEIMLLETARRYNHETECITFLKDFTYSKDDFHRAGLQVEF 222------------------------1111---------------3333-1111-3333 INPIFEFSRAMRRLGLDDAEYALLIAINIFSADRPNVQEPGRVEALQQPYVEALLSYTRI ---------3333-----------------1111-------------------------- KRPQDQLRFPRMLMKLVSLRTLSSVHSEQVFALRLQDKKLPPLLSEIWDVHE -1111---------------------------3333---------------- >RICIN; SWP:P02879; PDB:1UQ5A; QYPIINFTTAGATVQSYTNFIRAVRGRLTTGADVRHEIPVLPNRVGLPINQRFILVELSN --------2222----------------------iiii-----22221111--------1 HAELSVTLALDVTNAYVVGYRAGNSAYFFHPDNQEDAEAITHLFTDVQNRYTFAFGGAYD 111------------------!!!!--------------11111111------------- RLEQLAGNLRENIELGNGPLEEAISALYYYSTGGTQLPTLARSFIICIQMISEAARFQYI --------3333----------------3333---------------------------- EGEMRTRIRYNRRSAPDPSVITLENSWGRLSTAIQESNQGAFASPIQLQRRNGSKFSVYD -------------------------------------iiii--------1111------3 VSILIPIIALMVYRCAPPPSSQF 3331111---------------- >3-DEHYDROQUINATE DEHYDRAT; SWP:P43877; PDB:1UQRA; MKKILLLNGPNLNMLGKREPHIYGSQTLSDIEQHLQQSAQAQGYELDYFQANGEESLINR --------2222------1111-----------------1111----------------- IHQAFQNTDFIIINPGAFTHTSVAIRDALLAVSIPFIEVHLSNVHAREPFRHHSYLSDVA ---2222-------!!!!------------------------3333-3333----3333- KGVICGLGAKGYDYALDFAISELQKI -------------------------- >ALPHA,ALPHA-TREHALOSE-PHO; SWP:P31677; PDB:1UQTA; SRLVVVSNRIAPPDSAGGLAVGILGALKAAGGLWFGWSGETGNEDQPLKKVKKGNITWAS ----------------------------------------------------!!!!---- FNLSEQDLDEYYNQFSNAVLWPAFHYRLDLVQFQRPAWDGYLRVNALLADKLLPLLQDDD ----------------------11113333---3333--------------3333-1111 IIWIHDYHLLPFAHELRKRGVNNRIGFFLHIPFPTPEIFNALPTYDTLLEQLCDYDLLGF -----3333-------1111--------------33333333---------1111----- QTENDRLAFLDCLSNLTRVTTRSAKSHTAWGKAFRTEVYPIGIEPKEIAKQAAGPLPPKL ----------------------------iiii------------------1111--3333 AQLKAELKNVQNIFSVERLDYSKGLPERFLAYEALLEKYPQHHGKIRYTQIAPTSRGDVQ ------1111---------3333---------------3333-------------1111- AYQDIRHQLENEAGRINGKYGQLGWTPLYYLNQHFDRKLLMKIFRYSDVGLVTPLRDGMN ---------------------1111----------------------------------3 LVAKEYVAAQDPANPGVLVLSQFAGAANELTSALIVNPYDRDEVAAALDRALTMSLAERI 333---11111111------1111-----1111---3333----------1111------ SRHAEMLDVIVKNDINHWQECFISDLKQIVPR -------------------------3333--- >STE50 PROTEIN; SWP:P25344; PDB:1UQVA; GSHMNNEDFSQWSVDDVITWCISTLEVEETDPLCQRLRENDIVGDLLPELCLQDCQDLCD -------3333--3333-------------------------33331111---------- GDLNKAIKFKILINKMRDSKLEWKD -3333-------------------- >PUTATIVE BINDING PROTEIN ; SWP:P75797; PDB:1UQWA; AAKDVVVAVGSNFTTLDPYDANDTLSQAVAKSFYQGLFGLDKEKLKNVLAESYTVSDDGI ----------------3333-----------------------------------1111- TYTVKLREGIKFQDGTDFNAAAVKANLDRASDPANHLKRHNLYKNIAKTEAIDPTTVKIT -----------1111----------------3333---33331111------1111---- LKQPFSAFINILAHPATAISPAALEKYGKEIGFYPVGTGPYELDTWNQTDFVKVKKFAGY ------33331111------------!!!!--------------------------1111 WQPGLPKLDSITWRPVADNNTRAALQTGEAQFAFPIPYEQATLLEKNKNIELASPSIQRY -2222-------------------3333--------3333------1111---------- ISNVTQKPFDNPKVREALNYAINRPALVKVAFAGYATPATGVVPPSIAYAQSYKPWPYDP --1111-3333------3333----------iiii--------3333------------- VKARELLKEAGYPNGFSTTLWSSHNHSTAQKVLQFTQQQLAQVGIKAQVTADAGQRAAEV ------------------------------------------------------------ EGKGQKESGVRFYTGWSASTGEADWALSPLFASQNWPPTLFNTAFYSNKQVDDFLAQALK ---1111--------------3333------1111------1111------------111 TNDPAEKTRLYKAAQDIIWQESPWIPLVVEKLVSAHSKNLTGFWIPDTGFSFEDADLQ 1-----------------------------------3333-----------1111--- >ENDOXYLANASE; SWP:O68541; PDB:1UR1A; GLKSAYKDNFLIGAALNATIASGADERLNTLIAKEFNSITPENCMKWGVLRDAQGQWNWK -----1111-------3333----------------------1111-----1111----- DADAFVAFGTKHNLHMVGHTLVWHSQIHDEVFKNADGSYISKAALQKKMEEHITTLAGRY ---------------------------3333--1111----------------------2 KGKLAAWDVVNEAVGDDLKMRDSHWYKIMGDDFIYNAFTLANEVDPKAHLMYNDYNIERT 222-----------1111--------------------------1111------------ GKREATVEMIERLQKRGMPIHGLGIQGHLGIDTPPIAEIEKSIIAFAKLGLRVHFTSLDV ----------------------------------3333--------1111---------- DVLPSVWEEVSTRFEYKPERDPYTKGLPQEMQDKLAKRYEDLFKLFIKHSDKIDRATFWG --------1111----3333--1111----------------------3333-------- VSDDASWLNGFPIPGRTNYPLLFDRKLQPKDAYFRLLDLKRLEHHH -33331111--------------1111------------1111--- >GALACTANASE; SWP:Q65CX5; PDB:1UR4A; GLYVEKVSGLRKDFIKGVDVSSIIALEESGVAFYNESGKKQDIFKTLKEAGVNYVRVRIW ------2222---------1111---1111----3333---3333--1111--------- NDPYDANGNGYGGGNNDLEKAIQIGKRATANGMKLLADFHYSDFWADPAKQKAPKAWANL ----1111---iiii-------------1111--------------1111---3333--- NFEDKKTALYQYTKQSLKAMKAAGIDIGMVQVGNETNGGLAGETDWAKMSQLFNAGSQAV --------------------1111---------------iiii----------------- RETDSNILVALHFTNPETSGRYAWIAETLHRHHVDYDVFASSYYPFWHGTLKNLTSVLTS ---1111-------3333-----------1111----------3333------------- VADTYGKKVMVAETSYTYTAEDGDGHGNTAPKNGQTLNNPVTVQGQANAVRDVIQAVSDV ------------------------------------------------------------ GEAGIGVFYWEPAWIPVGPAHRLEKNKALWETYGSGWATSYAAEYDPEDAGKWFGGSAVD 3333--------------3333----------------33333333--3333------11 NQALFDFKGRPLPSLHVFQYVDTGTPF 11---1111---------3333----- >PROTEIN KINASE C-LIKE 1; SWP:Q16512; PDB:1URFA; GIPATNLSRVAGLEKQLAIELKVKQGAENMIQTYSNGSTKDRKLLLTAQQMLQDSKTKID ---3333----------------------------------------------------- IIRMQLRRALQADQLENQAAP --------------------- >3-MERCAPTOPYRUVATE SULFUR; SWP:P31142; PDB:1URHA; TTWFVGADWLAEHIDDPEIQIIDARMASPGQEDRNVAQEYLNGHIPGAVFFDIEALSDHT ------------1111------------------3333------2222---3333--333 SPLPHMLPRPETFAVAMRELGVNQDKHLIVYDEGNLFSAPRAWWMLRTFGVEKVSILGGG 3---------------------1111--------------------1111---------- LAGWQRDDLLLEEGAVELPEGEFNAAFNPEAVVKVTDVLLASHENTAQIIDARPAARFNA ---------------------------1111-----------------------1111-- EVDELRRGHIPGALNVPWTELVREGELKTTDELDAIFFGRGVSYDKPIIVSGSGVTAAVV ---------2222---3333----------------------------------3333-- LLALATLDVPNVKLYDGAWSEW ---------------------- >MAJOR DNA-BINDING PROTEIN; SWP:P04296; PDB:1URJA; TTIKVPPGPLGYVYARACPSEGIELLALLSARSGDSDVAVAPLVVGLTVESGFEANVAVV ---------------------3333------------------2222--1111------- VGSRTTAVSLKLTPSHYSSSVYVFHGGRHLDPSTQAPNLTRLCERARRHFGFSDYTPRPG -------------------------3333----------------------------111 DLKHETTGEALCERLGLDPDRALLYLVVTEGFKEAVCINNTFLHLGGSDKVTIGGAEVHR 11111------------1111-------11113333-------3333-----iiii---- IPVYPLQLFMPDFSRVIAEPFNANHRSIGEKFTYPLPFFNRPLNRLLFEAVVGPAAVALR ----3333----------1111--11112222------------------------1111 SRNVDAVARAAAHLAFDENHEGAALPADITFTAFAGGFEQRLASVMAGDAALALESIVSM ----------------1111------------------------------------3333 AVFDEPPTDISAWPLFEGQDTAAARANAVGAYLARAAGLVGAMVFSTNSALHLTEVDDAG --------1111-3333-------------------------------3333-------- PAHSKPSFYRFFLVPGTHVAANPQVDREGHVVPGFEGRPTAPLVGGTQEFAGEHLAMLSG ---------------1111------1111---------------------3333----%% FSPALLAKMLFYLERCDGAVIVMDVFRYVADSNQTDVPCNLCTFDTRHACVHTTLMRLRA %%3333-----3333---------------------------11111111-------333 RHPKFASAARGAIGVFGTMNSMYSDCDVLGNYAETYRAATERVMAELETLQYVDQAVPTA 3-------------------------1111-----------------------1111--- MGRLETIITNREALHTVVNNVRQVVDREVEQLMRNLVEFKFRDGLGEANHAMSLTLDPYA --3333----------------------------------3333------------1111 CGPCPLLQLLGRRSNLAVYQDLALSQCHGVFAGQSVEGRNFRNQFQPVLRRRVMDMFNNG ----------------------------------1111-3333----------------- FLSAKTLTVALSAICAPSLTAGQTAPAESSFEGDVARVTLGFPKELRVKSRVLFSAYQKP ------------------------------------------------------------ DKRVDILLGPLGFLLKQFHAAIFPNGKPPGSNQPNPQWFWTALQRNQLPARLLSREDIET ----33331111----------------------3333--3333---------------- IAFIKKFSLDYGAINFINLAPNNVSELAMYYMANQILRYCDHSTYFINTLTAIIAGSRRP -----------1111----------------------1111------------------- PSVQAAAAWSAQGGAGLEAGARALMDAVDAHPGAWTSMFASCNLLRPVMAARPMVVLGLS -3333-1111---1111---------33331111----------33331111-------- ISKYVFQAGNWASLMGGKNACPLLIFDRTRKFVLACPRAGFVCAASSLCEQLRGIISEGG ----------------33331111--1111-------2222------3333--------- AAVASSVFVATVKSLGPRTQQLQIEDWLALLEDEYLSEEMMELTARALERGNGEWSTDAA ---------------3333--------------------------------------333 LEVAHEAEA 3-------- >M-TOMOSYN ISOFORM; SWP:Q9Z152; PDB:1URQA; GIEGVKGAASGVVGELARARLALDERGQKLSDLEERTAAMMSSADSFSKHAHEMMLKY -----------------------------------------------------3333- >CG18505 PROTEIN; SWP:Q9VF36; PDB:1URRA; VAKQIFALDFEIFGRVQGVFFRKHTSHEAKRLGVRGWCMNTRDGTVKGQLEAPMMNLMEM 1111-------------------------1111-------1111---------------- KHWLENNRIPNAKVSKAEFSQIQEIEDYTFTSFDIKH --------2222------------------------- >MALTOSE-BINDING PROTEIN; SWP:Q9RHZ6; PDB:1URSA; QTITVWSWQTGPELQDVKQIAAQWAKAHGDKVIVVDQSSNPKGFQFYATAARTGKGPDVV ------------------------------------1111--1111----1111------ FGMPHDNNGVFAEEGLMAPVPSGVLNTGLYAPNTIDAIKVNGTMYSVPVSVQVAAIYYNK -----------1111----------1111-3333-1111iiii----------------- KLVPQPPQTWAEFVKDANAHGFMYDQANLYFDYAIIGGYGGYVFKDNNGTLDPNNIGLDT ------------------------1111--------1111------iiii-1111----- PGAVQAYTLMRDMVSKYHWMTPSTNGSIAKAEFLAGKIGMYVSGPWDTADIEKAKIDFGV --------------------1111--------1111-------3333------------- TPWPTLPNGKHATPFLGVITAFVNKESKTQAADWSLVQALTSAQAQQMYFRDSQQIPALL -----1111--------------1111-------------------------------33 SVQRSSAVQSSPTFKAFVEQLRYAVPMPNIPQMQAVWQAMSILQNIIAGKVSPEQGAKDF 33-----------------3333------3333--------------------------- VQNIQKG ------- >AMPHIPHYSIN; SWP:Q9Y092; PDB:1URUA; QNLGKVDRTADEIFDDHLNNFNRQQASANRLQKEFNNYIRCVRAAQAASKTLDSVCEIYE ----1111-----------------------------------------------11113 PQWSGYDALQAQTGASESLWADFAHKLGDQVLIPLNTYTGQFPEKKKVEKRNRKLIDYDG 333-------------------------------------3333---------------- QRHSFQNLQANANKRKDDVKLTKGREQLEEARRTYEILNTELHDELPALYDSRILFLVTN --1111---11113333-3333-------------------------------------- LQTLFATEQVFHNETAKIYSELEAIVDKLATESQR ------------------------------3333- >ALDOSE REDUCTASE; SWP:P15121; PDB:1US0A; MASRILLNNGAKMPILGLGTWKSPPGQVTEAVKVAIDVGYRHIDCAHVYQNENEVGVAIQ -------------------22221111--------1111------3333----------- EKLREQVVKREELFIVSKLWCTYHEKGLVKGACQKTLSDLKLDYLDLYLIHWPTGFKPGK --------3333-------1111--1111------------------------------- EFFPLDESGNVVPSDTNILDTWAAMEELVDEGLVKAIGISNFNHLQVEMILNKPGLKYKP -----1111-------------------1111--------------------2222---- AVNQIECHPYLTQEKLIQYCQSKGIVVTAYSPLGSPDRPWAKPEDPSLLEDPRIKAIAAK -------1111---------1111------11111111---1111-3333--------11 HNKTTAQVLIRFPMQRNLVVIPKSVTPERIAENFKVFDFELSSQDMTTLLSYNRNWRVCA 11---------3333----------------1111--------------1111------- LLSCTSHKDYPFHE 3333--1111---- >PUTATIVE GLUR0 LIGAND BIN; SWP:P83817; PDB:1US5A; AQEFITIGSGSTTGVYFPVATGIAKLVNDANVGIRANARSTGGSVANINAINAGEFEMAL ----------1111-------------3333----------------------------- AQNDIAYYAYQGCCIPAFEGKPVKTIRALAALYPEVVHVVARKDAGIRTVADLKGKRVVV --------------3333----1111----------------------33332222---- GDVGSGTEQNARQILEAYGLTFDDLGQAIRVSASQGIQLMQDKRADALFYTVGLGASAIQ -2222----------1111-1111---------------1111---------2222---- QLALTTPIALVAVDLNRIQAIAKKYPFYVGFNIPGGTYKGVDVTTPTVAVQAMLIASERL ---------------------3333--------22222222---------------3333 SEETVYKFMKAVFGNLEAFKKIHPNLERFFGLEKAVKGLPIPLHPGAERFYKEAGVLK -----------------------3333---3333------------------------ >Hsp90 co-chaperone Cdc37; SWP:Q16543; PDB:1US7B; HKTFVEKYEKQIKHFGMLRRWDDSQKYLSDNVHLVCEETANYLVIWCIDLEVEEKCALME --------------1111------------3333-3333----------------3333- QVAHQTIVMQFILELAKSLKVDPRACFRQFFTKIKTADRQYMEGFNDELEAFKERVRGRA ---------------------3333-------3333------------------------ KLRIEKAMKEYEEEERKKRLGPGGLDPVEVYESLPEELQKCMLQDAISKMDPTDAKYHMQ ----------------11111111-3333-1111--------3333-------------- RCIDSGLWVPNSKA -------------- >PUTATIVE STYRENE MONOOXYG; SWP:P83818; PDB:1USCA; MRSYRAQGPLPGFYHYYPGVPAVVGVRVEERVNFCPAVWNTGLSADPPLFGVSISPKRFT -------------1111----------!!!!-----------------------1111-- HGLLLKARRFSASFHPFGQKDLVHWLGSHSGREVDKGQAPHFLGHTGVPILEGAYAAYEL ---------------3333-----1111-3333-3333-----1111---2222------ ELLEVHTFGDHDLFVGRVVAVWEEEGLLDEKGRPKPGLALLYYGKGLYGRPAEETFAP -------!!!!------------2222-1111--2222-----%%%%----------- ---------------------------------------- >LEUCINE-SPECIFIC BINDING ; SWP:P04816; PDB:1USGA; DDIKVAVVGAMSGPIAQWGDMEFNGARQAIKDINAKGGIKGDKLVGVEYDDACDPKQAVA ------------1111----------------------iiii--------%%%%------ VANKIVNDGIKYVIGHLCSSSTQPASDIYEDEGILMISPGATNPELTQRGYQHIMRTAGL -----------------3333-3333-----------------3333------------3 DSSQGPTAAKYILETVKPQRIAIIHDKQQYGEGLARSVQDGLKAANANVVFFDGITAGEK 333---------------------------------------1111---------2222- DFSALIARLKKENIDFVYYGGYYPEMGQMLRQARSVGLKTQFMGPEGVGNASLSNIAGDA ---------------------------------1111-------3333-1111-----11 AEGMLVTMPKRYDQDPANQGIVDALKADKKDPSGPYVWITYAAVQSLATALERTGSDEPL 11--------11113333-------1111----3333----------------------- ALVKDLKANGANTVIGPLNWDEKGDLKGFDFGVFQWHADGSSTKAK ------------1111----1111------------1111------ >RIBOSE 5-PHOSPHATE ISOMER; SWP:O53192; PDB:1USLA; GMRVYLGADHAGYELKQRIIEHLKQTGHEPIDCGALRYDADDDYPAFCIAAATRTVADPG --------3333-----------1111-----------1111---------------222 SLGIVLGGSGNGEQIAANKVPGARCALAWSVQTAALAREHNNAQLIGIGGRMHTVAEALA 2------------------2222-------------------------1111-------- IVDAFVTTPWSKAQRHQRRIDILAEYERTHEAPPVPG ------------------------------------- >HEPATOCYTE NUCLEAR FACTOR; SWP:P83819; PDB:1USMA; MDWEERKRLVKTFAFPNFREALDFANRVGALAERENHHPRLTVEWGRVTVEWWTHSAGGV -------------------------------------------2222-------1111-- TEKDREMARLTDALLQR ------------3333- >ORGANIC HYDROPEROXIDE RES; SWP:Q9RTA8; PDB:1USPA; NVYTAEATATGGRAGTTRSSDDRLNLDLSVPAEGGDGGPGTNPEQLFAAGYAACFQGALG ------------------1111-------------------------------------- VVSRRQKIDVPADSTITARVGLQKAGLAFALDVELEGHFPGLSREQAEGLHAAHEVCPYS ------------------------!!!!----------2222------------------ AATRNNVDVRLKVRE --2222--------- >HEMAGGLUTININ-NEURAMINIDA; SWP:P32884; PDB:1USRA; GAPIHDPDFIGGIGKELIVDNASDVTSFYPSAFQEHLNFIPAPTTGSGCTRIPSFDMSAT ------1111-------------1111-----------------1111------------ HYCYTHNVILSGCRDHSHSHQYLALGVLRTTATGRIFFSTLRSISLDDTQNRKSCSVSAT -----------1111---------------1111-------------------------1 PLGCDMLCSKVTETEEEDYNSAVPTLMAHGRLGFDGQYHEKDLDVTTLFEDWVANYPGVG 111--------------3333-----------1111-------3333-1111-------- GGSFIDGRVWFSVYGGLKPNSPSDTVQEGKYVIYKRYNDTCPDEQDYQIRMAKSSYKPGR ----iiii---------2222---1111-------------------------1111333 FGGKRIQQAILSIKVSTSLGEDPVLTVPPNTVTLMGAEGRILTVGTSHFLYQRGSSYFSP 3----------------2222----------------------!!!!------------- ALLYPMTVSNKTATLHSPYTFNAFTRPGSIPCQASARCPNSCVTGVYTDPYPLIFYRNHT --------!!!!--------1111--------1111-------------------1111- LRGVFGTMLDSEQARLNPASAVFDSTSRSRITRVSSSSTKAAYTTSTCFKVVKTNKTYCL ---------------------------------------------------1111----- SIAEISNTLFGEFRIVPLLVEILKNDGV ---------------------------- >HISTONE H1; SWP:P53551; PDB:1USSA; KASSPSSLTYKEMILKSMPQLNDGKGSSRIVLKKYVKDTFSSKLKTSSNFDYLFNSAIKK --------3333-----3333------3333----------------------------- CVENGELVQPKGPSGIIKLNKKKVKLST -3333---1111---------------- >HISTONE H1; SWP:P53551; PDB:1USTA; KEEASSKSYRELIIEGLTALKERKGSSRPALKKFIKENYPIVGSASNFDLYFNNAIKKGV --------------------------------------------2222------------ EAGDFEQPKGPAGAVKLAKKKSPEVKKEKEVS ------1111---------------------- >Hsp90 co-chaperone AHA1; SWP:Q12449; PDB:1USUB; VDKNCIGWAKEYFKQKIVGVEAKKYAKIKSVSSIEGDCEVNQRGKVISLFDLKITVLIEG ----------------2222---------------------------------------- HVDSALPFEGSINVPEVAFDSEASSYQFDISIFKETSELSEAKPLIRSELLPKLRQIFQQ -----------------11111111----------3333----------3333------- FGKDLLATHGND ------------ >ATP PHOSPHORIBOSYLTRANSFE; SWP:Q9X0D3; PDB:1USYA; MDFLDFEKVFSFYSKATKKGFSPFFVPALEKAEEPAGNFFLDRKGNLFSIREDFTKTVLN ----3333--------1111---------------------------------------- HRKRYSPDSQIKVWYADFVYRYSGSDLVAEYQLGLEKVPRNSLDDSLEVLEIIVESASEF ----------------------!!!!----------------3333-------------- FEGPVIVEIGHTGVYEDLLKEIPKDLHEKVLNLIDTKNLAEIEFLSHMKKIDLSRVEKII ----------3333----11113333-------1111----------------------- EDSIYRRSPEHLKTMDLPLSVREDLLSASSFLQEKFPTVSVEIDLTLARTIEEYCGLIFT -------33331111--------------------1111-------3333---------- IYDTSSSRLVAAGGEYTVNGEKGVGGSIFLEGKTC ----------------------------------- >DR HEMAGGLUTININ STRUCTUR; SWP:P24093; PDB:1UT1A; GSFTPSGTTGTTKLTVTEKCQVRVGDLTVAKTRGQLTDAAPIGPVTVQALGCDARQVALK -------------------------------3333-2222-----------1111----- ADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSWGGIIGIYVDGQ -1111-%%%%----1111------------------iiii-------------------- QTNTPPGNYTLTLTGGYWAK 1111---------------- >SPHENISCIN-2; SWP:P83430; PDB:1UT3A; SFGLCRLRRGFCARGRCRFPSIPIGRCSRFVQCCRRVW -1111--------------------------------- >NO APICAL MERISTEM PROTEI; SWP:Q9C932; PDB:1UT7A; MGIQLTQLSLPPGFRFYPTDEELMVQYLCRKAAGYDFSLQLIAEIDLYKFDPWVLPNKAL ----3333--2222----------------1111-----------1111-11113333-- FGEKEWYFFSPRDRPNRVAGSGYWKATGTDKIISTEGQRVGIKKALVFYIGKAPKGTKTN ------------------!!!!------------iiii---------------------- WIMHEYRLIEPSDDWVLCRIYKKQ ------------------------ >CELLULOSE 1,4-BETA-CELLOB; SWP:NA; PDB:1UT9A; ILPQPDVRVNQVGYLPEGKKVATVVCNSTQPVKWQLKNAAGVVVLEGYTEPKGLDKDSQD -------------------------------------1111--------------1111- YVHWLDFSDFATEGIGYYFELPTVNSPTNYSHPFDIRKDIYTQMKYDALAFFYHKRSGIP ------3333----------1111--------------1111----------1111---- IEMPYAGGEQWTRPAGHIGIEPNKGDTNVPTWPQDDEYAGIPQKNYTKDVTGGWYDAGDH ---33333333---------------------3333------------------------ GKYVVNGGIAVWTLMNMYERAKIRGLDNWGPYRDGGMNIPEQNNGYPDILDEARWEIEFF ---------------------111111111111-----1111------------------ KKMQVTEKEDPSIAGMVHHKIHDFRWTALGMLPHEDPQPRYLRPVSTAATLNFAATLAQS 1111-33333333------------------3333------------------------- ARLWKDYDPTFAADCLEKAEIAWQAALKHPDIYAEYTPGSGGPGGGPYNDDYVGDEFYWA ------------------------------------------------------------ ACELYVTTGKDEYKNYLMNSPHYLEMPAKMGANGEDNGLWGCFTWGTTQGLGTITLALVE -------------------1111---------3333-------11113333--------- NGLPATDIQKARNNIAKAADRWLENIEEQGYRLPIKQAEDERGGYPWGSNSFILNQMIVM ---3333---------------------1111---------------3333--------- GYAYDFTGDSKYLDGMFDGISYLLGRNAMDQSYVTGYGERPLQNPHDRFWTPQTSKRFPA ---------------------1111-1111---2222-------------33333333-- PPPGIISGGPNSRFEDPTINAAVKKDTPPQKCFIDHTDSWSTNEITVNWNAPFAWVTAYL -----------------------11113333----1111------1111----------- DQYTD ----- >CELL DIVISION PROTEIN FTS; SWP:P29131; PDB:1UTAA; KDERRWMVQCGSFRGAEQAETVRAQLAFEGFDSKITTNNGWNRVVIGPVKGKENADSTLN ------------------------------------------------------------ RLKMAGHTNCIRLAAGG --3333----------- >CLATHRIN HEAVY CHAIN; SWP:P49951; PDB:1UTCA; ILPIRFQEHLQLQNLGINPANIGFSTLTMESDKFICIREKVGEQAQVVIIDMNDPSNPIR ----------3333---3333-1111--------------!!!!------3333------ RPISADSAIMNPASKVIALKAGKTLQIFNIEMKSKMKAHTMTDDVTFWKWISLNTVALVT --------------------!!!!-----1111------------------1111----1 DNAVYHWSMEGESQPVKMFDRHSSLAGCQIINYRTDAKQKWLLLTGISAQQNRVVGAMQL 111------------------3333----------1111----------1111------- YSVDRKVSQPIEGHAASFAQFKMEGNAEESTLFCFAVRGQAGGKLHIIEVGTPPTGNQPF --1111----------------2222------------1111-----------2222--- PKKAVDVFFPPEAQNDFPVAMQISEKHDVVFLITKYGYIHLYDLETGTCIYMNRISGETI ---------1111--------------------1111----------------------- FVTAPHEATAGIIGVNRKGQVLSVCVEEENIIPYITNVLQNPDLALRMAVRN -----3333------1111-------1111---------------------- >II PURPLE ACID PHOSPHATAS; SWP:P09889; PDB:1UTEA; PTPILRFVAVGDWGGVPNAPFHTAREMANAKAIATTVKTLGADFILSLGDNFYFTGVHDA ----------------------------------------------------------11 KDKRFQETFEDVFSDPSLRNVPWHVLAGNHDHLGNVSAQIAYSKISKRWNFPSPYYRLRF 11------3333--3333---------3333----------11113333----------- KIPRSNVSVAIFMLDTVTLCGNSDDFVSQQPERPRNLALARTQLAWIKKQLAAAKEDYVL --------------3333---33333333------------------------------- VAGHYPVWSIAEHGPTHCLVKQLLPLLTTHKVTAYLCGHDHNLQYLQDENGLGFVLSGAG ----------3333-------------1111----------------1111--------- NFMDPSKKHLRKVPNGYLRFHFGAENSLGGFAYVEITPKEMSVTYIEASGKSLFKTKLPR ------1111---2222------1111---------3333------1111---------- RA -- >UTEROGLOBIN; SWP:P02779; PDB:1UTG; GICPRFAHVIENLLLGTPSSYETSLKEFEPDDTMKDAGMQMKKVLDSLPQTTRENIMKLT -----------------------3333-----------------3333------------ EKIVKSPLCM -----3333- >LYSR-TYPE REGULATORY PROT; SWP:Q7WT50; PDB:1UTHA; TRNSFDPFASTRTFNLAMTDIGEMYFMPPLMEALAQRAPHIQISTLRPNAGNLKEDMESG -----1111----------------------------1111-----1111------1111 AVDLALGLLPELQTGFFQRRLFRHRYVCMFRKDHPSAKSPMSLKQFSELEHVGVVALNTG --------11112222--------------1111------------------------33 HGEVDGLLERAGIKRRMRLVVPHFIAIGPILHSTDLIATVPQRFAVRCEVPFGLTTSPHP 33------1111-----------1111---1111------3333-----1111------- AKLPDIAINLFWHAKYNRDPGNMWLRQLFVELFSEAHHH ------------3333----------------------- >GRB2-RELATED ADAPTOR PROT; SWP:O89100; PDB:1UTIA; VRWARALYDFEALEEDELGFRSGEVVEVLDSSNPSWWTGRLHNKLGLFPANYVAPMM -------------1111---2222----------------%%%%----1111----- >CYLR2; SWP:Q8VL32; PDB:1UTXA; MIINNLKLIREKKKISQSELAALLEVSRQTINGIEKNKYNPSLQLALKIAYYLNTPLEDI ----------1111-------------------1111------------------3333- FQWQPE ------ >NON-STRUCTURAL PROTEIN 2; SWP:P23065; PDB:1UTYA; FTKNIFVLDVTAKTLCGAIAKLSSQPYCQIKIGRVVAFKPVKNPEPKGYVLNVPGPGAYR --------1111--------1111--------------------2222------------ IQDGQDIISLMLTPHGVEATTERWEEWKFEGVSVTPMATRVQYNGVMVDAEIKYCKGMGI --!!!!------------------------------------%%%%-------------- VQPYMRNDFDRNEMPDLPGVMRSNYDIRELRQK ---------1111---2222-----3333---- >HISTIDINOL-PHOSPHATE AMIN; SWP:Q9X0D0; PDB:1UU1A; LIAKRAYPYETEKRDKTYLALNENPFPFPEDLVDEVFRRLNSDALRIYYDSPDEELIEKI ----------------------------------------3333---------------- LSYLDTDFLSKNNVSVGNGADEIIYVMMLMFDRSVFFPPTYSCYRIFAKAVGAKFLEVPL ---------3333-----3333----1111----------3333---------------- TKDLRIPEVNVGEGDVVFIPNPNNPTGHVFEREEIERILKTGAFVALDEAYYEFHGESYV 1111--------------------------------------------1111-------- DFLKKYENLAVIRTFSKAFSLAAQRVGYVVASEKFIDAYNRVRLPFNVSYVSQMFAKVAL ----------------11113333------------------------------------ DHREIFEERTKFIVEERERMKSALREMGYRITDSRGNFVFVFMEKEEKERLLEHLRTKNV ------------------------------------------------------------ AVRSFREGVRITIGKREENDMILRELEVFK ----1111---------------------- >3-PHOSPHOINOSITIDE DEPEND; SWP:O15530; PDB:1UU3A; QPRKKRPEDFKFGKILGEGSFSTVVLARELATSREYAIKILEKRHIIKENKVPYVTRERD -----1111--------------------------------------------------- VMSRLDHPFFVKLYFTFQDDEKLYFGLSYAKNGELLKYIRKIGSFDETCTRFYTAEIVSA -1111-1111-------------------11113333----------------------- LEYLHGKGIIHRDLKPENILLNEDMHIQITDFGTAKVLFVGTAQYVSPELLTEKSACKSS --------------3333---1111------1111------3333-3333------3333 DLWALGCIIYQLVAGLPPFRAGNEYLIFQKIIKLEYDFPEKFFPKARDLVEKLLVLDATK ------------------------------1111----2222--------------1111 RLGCEEMEGYGPLKAHPFFESVTWENLHQQTPPKLTA 22221111-3333--3333------3333-------- >BOVINE PANCREATIC TRYPSIN; SWP:P00974; PDB:1UUBA; DFCLEPPYTGPCRAAIIRYFYNAKAGLCQTFVYGGCRAKSNNFKSAEDCMRTCGGA ------------------------------------%%%%---------------- >ZINC-TYPE ALCOHOL DEHYDRO; SWP:P75691; PDB:1UUFA; IKAVGAYSAKQPLEPMDITRREPGPNDVKIEIAYCGVCHSDLHQVRSEWAGTVYPCVPGH -------1111------------1111--------------------------------- EIVGRVVAVGDQVEKYAPGDLVGVGCIVDSCKHCEECEDGLENYCDHMTGTYNSPTPDEP ---------1111---2222----------------11113333---------------- GHTLGGYSQQIVVHERYVLRIRHPQEQLAAVAPLLCAGITTYSPLRHWQAGPGKKVGVVG -------------3333------333333333333----------1111-2222------ IGGLGHMGIKLAHAMGAHVVAFTTSEAKREAAKALGADEVVNSRNADEMAAHLKSFDFIL -3333-------------------3333----3333-----1111----1111------- NTVAAPHNLDDFTTLLKRDGTMTLVGAPEVFNLIMKRRAIAGSMIGGIPETQEMLDFCAE ------------11112222--------3333---------------------------- HGIVADIEMIRADQINEAYERMLRGDVKYRFVIDNRTLTD ----------1111-------1111--------------- >CD44 ANTIGEN; SWP:P16070; PDB:1UUHA; AQIDLNITCRFAGVFHVEKNGRYSISRTEAADLCKAFNSTLPTAQEKALSIGFETCRYGF ----------iiii----%%%%------------1111----------1111-------- IEGHVVIPRIHPNSICAANNTGVYILTSNTSQYDTYCFNASAPPEEDCTSVTDLPNAFDG 2222--------1111iiii------------------1111------------------ PITITIVNRDGTRYVQKGEYRTNPEDIY -------1111-----------3333-- >PLATELET-ACTIVATING FACTO; SWP:Q5SW16; PDB:1UUJA; VLSQRQRDELNRAIADYLRSNGYEEAYSVFKKEAELDNEELDKKYAGLLEKKWTSVIRLQ -------------------------------1111------------------------- KKVELESKLNEAKE ---------1111- >TRYPAREDOXIN PEROXIDASE H; SWP:O96763; PDB:1UULA; GEAEDLHPAPDFNETALMPNGTFKKVALTSYKGKWLVLFFYPMDFTFVCPTEICQFSDRV ---2222----------1111-----33332222------------------------33 KEFSDIGCEVLACSMDSEYSHLAWTSIERKRGGLGQMNIPILADKTKCIMKSYGVLKEED 33-1111--------------------3333------------3333---1111--3333 GVAYRGLFIIDPKQNLRQITVNDLPVGRDVDEALRLVKAFQFVEKHGEVCPANWKPGDKT ----------1111--------1111--3333------------------22222222-- MKPDPEKSKEYFGA -----1111----- >DIHYDROOROTATE DEHYDROGEN; SWP:Q63707; PDB:1UUMA; YAEYLMPGLQRLLDPESAHRLAVRVTSLGLLPRATFQDSDMLEVKVLGHKFRNPVGIAAG --------3333--------------------------1111---iiii--------222 FDKNGEAVDGLYKLGFGFVEVGSVTPQPQEGNPRPRVFRLPEDQAVINRYGFNSHGLSVV 2----------1111------------------------3333----------------- EHRLRARQQKQAQLTADGLPLGINLGKNKTSEDAAADYAEGVRTLGPLADYLVVNVSSQG ---------------------------1111-------------3333------------ KTELRHLLSKVLQERDALKGTRKPAVLVKIAPDLTAQDKEDIASVARELGIDGLIVTNTT --------------1111------------------------------------------ VSRPVGLQGALRSETGGLSGKPLRDLSTQTIREMYALTQGRIPIIGVGGVSSGQDALEKI ---2222-1111-------3333--------------%%%%------------------- QAGASLVQLYTALIFLGPPVVVRVKRELEALLKERGFTTVTDAIGADHRR ---------3333-------------------1111--33332222---- >MANNOSYL-OLIGOSACCHARIDE ; SWP:Q6QT42; PDB:1UUQA; EHFVRVNGGHFELQGKPYVITGVNMWYAAYLGAPNEVGDRDRLAKELDNLKAIGVNNLRV ------!!!!--iiii--------11113333--3333------------1111------ LAVSEKSEINSAVKPAVTNGFGNYDETLLQGLDYLLVELAKRDMTVVLYFNNFWQWSGGM -------------------2222--3333----------1111---------------33 TQYMAWIEGEPVQDPNVTNEWEAFMAKSASFYRSEKAQQEYRKTLEKIITRVNSINGKAY 33-----------3333-----------3333---------------1111-------33 VDDATIMSWQLANEPRPGNSQTTAEEKQIYIDWVHAAAAYIKTLDAHHLVSSGSEGEMGS 333333-------------------------------------------------3333% VNDMQVFIDAHATPDIDYLTYHMWIRNWSWFDKTKPAETWPSAWEKAQNYMRAHIDVAKQ %%%-----11111111-------3333----11113333--------------------- LNKPLVLEEFGLDRDMGSYAMDSTTEYRDNYFRGVFELMLASLEQGEPSAGYNIWAWNGY -------------2222--1111-------------------1111-----------!!! GRTTRANYWWQEGDDFMGDPPQEEQGMYGVFDTDTSTIAIMKEFNARFQP !---1111--2222-----1111-------1111-----------3333- >STAT PROTEIN; SWP:O00910; PDB:1UURA; SSPQPILDTIYKLLSEQEQTLVQMIHEQSLLLNRLPPTLDENSLAPLKSLSQKQITLSGQ ----------------------------------------3333---------------- MNTEMSALDATKKGMILEPTDLAKLFALKQDLQIQFKQLSLLHNEIQSILNPQHSAPKPN ----------1111---------------------------------------------- VALVLKSQPFPVVISKGKQLGENQLVVLVLTGARSNFHINGPVKATMICDSHPPTTPLEM --------------2222------------------------------------------ DSQPIYPATLTAHFPLKFLAGTRKCSVNLKFGVNIRDLDNVTTTVESDASNPFVVITNEC ---------------------%%%%-----------1111-------------------- QWEGSAGVLLKKDAFDGQLEITWAQFINTLQRHFLIATKQDPVRPKRPLSSYDLKYIQTH -----------------------------------1111-3333---------------- FFGNRSIIHQQDFDKFWVWFGKSMQTLRYQRHISTLWQEGIIYGYMGRQEVNDALQNQDP -%%%%---1111-----------------2222--------------33331111---22 GTFIIRFSERNPGQFGIAYIGVEMPARIKHYLVQPNDTAAAKKTFPDFLSEHSQFVNLLQ 22--------2222-------------------3333------3333----1111----- WTKDTNGAPRFLKLHKDTALGSFAPKRTAPVPVGGEPLNS ---1111----------1111------------------- >MOLYBDOPTERIN BIOSYNTHESI; SWP:Q39054; PDB:1UUYA; GPEYKVAILTVSDTVSAGAGPDRSGPRAVSVVDSSSEKLGGAKVVATAVVPDEVERIKDI --------------1111------------------1111-------------------- LQKWSDVDEMDLILTLGGTGFTPRDVTPEATKKVIERETPGLLFVMMQESLKITPFAMLA ---------------------1111------------------------3333-3333-- RSAAGIRGSTLIINMPGNPNAVAECMEALLPALKHALKQIK -----------------3333-------3333--------- >INHIBITOR OF VERTEBRATE L; SWP:Q9HXB1; PDB:1UUZA; EEQPRLFELLGQPGYKATWHAMFKGESDVPKWVSDASGPSSPSTSLSLEGQPYVLANSCK ----3333------------1111--------1111-----------iiii--------2 PHDCGNNRLLVAFRGDKSAAYGLQVSLPDEPAEVMQTPSKYATYRWYGEPSRQVRELLMK 2221111------1111--------------3333-3333-------------------- QLESDPNWKL ----1111-- >PANCREATITIS-ASSOCIATED P; SWP:Q06141; PDB:1UV0A; ARIRCPKGSKAYGSHCYALFLSPKSWTDADLACQKRPSGNLVSVLSGAEGSFVSSLVKSI -----2222--!!!!-----------------------------------------1111 GNSYSYVWIGLHDPTQGTEGEGWEWSSSDVMNYFAWERNPSTISSPGHCASLSRSTAFLR ------------1111--------1111----------3333----------3333---- WKDYNCNVRLPYVCKFTD ----1111---------- >ARABINAN-ENDO 1,5-ALPHA-L; SWP:Q5MPF4; PDB:1UV4A; AFWGASNELLHDPTMIKEGSSWYALGTGLTEERGLRVLKSSDAKNWTVQKSIFTTPLSWW -----------------!!!!-------1111------------------------3333 SNYVPNYGQNQWAPDIQYYNGKYWLYYSVSSFGSNTSAIGLASSTSISSGGWKDEGLVIR ------------------iiii--------2222-----------3333----------- STSSNNYNAIDPELTFDKDGNPWLAFGSFWSGIKLTKLDKSTMKPTGSLYSIAARPNNGG -1111-----------1111--------!!!!----------------------1111-- ALEAPTLTYQNGYYYLMVSFDKCCDGVNSTYKIAYGRSKSITGPYLDKSGKSMLEGGGTI ---------iiii--------------------------1111---1111-3333----- LDSGNDQWKGPGGQDIVNGNILVRHAYDANDNGIPKLLINDLNWSSGWPSY ------------------------------%%%%---------1111---- ------------------------------------------------------------ -------------- >SERINE PROTEINASE INHIBIT; SWP:Q9NQ38; PDB:1UVGA; DSEMCKDYRVLPRIGYLCPKDLKPVCGDDGQTYNNPCMLCHENLIRQTNTHIRSTGKCEE 3333----------------------1111----3333--3333--------------33 SSTPGTTAASMPPSDE 33---3333------- >STARVATION-INDUCED DNA PR; SWP:Q8VP75; PDB:1UVHA; TIPGLSDKKASDVADLLQKQLSTYNDLHLTLKHVHWNVVGPNFIGVHEMIDPQVELVRGY -----------------------------------------3333--------------- ADEVAERIATLGKSPKGTPGAIIKDRTWDDYSVERDTVQAHLAALDLVYNGVIEDTRKSI -----------------3333--------------------------------------- EKLEDLDLVSQDLLIAHAGELEKFQWFVRAHLESAGG 3333--------------------------------- >P2 PROTEIN; SWP:P11124; PDB:1UVJA; PRRAPAFPLSDIKAQMLFANNIKAQQASKRSFKEGAIETYEGLLSVDPRFLSFKNELSRY -------33333333------------------------22221111------------- LTDHFPANVDEYGRVYGNGVRTNFFGMRHMNGFPMIPATWPLASNLKKRADADLADGPVS ---------1111--!!!!----3333--2222----------------1111------- ERDNLLFRAAVRLMFSDLEPVPLKIRKGSSTCIPYFSNDMGTKIEIAERALEKAEEAGNL -------------------------2222------------------------------- MLQGKFDDAYQLHQMGGAYYVVYRAQSTDAITLDPKTGKFVSKDRMVADFEYAVTGGEQG 1111--------------------------------------------------iiii-- SLFAASKDASRLKEQYGIDVPDGFFCERRRTAMGGPFALNAPIMAVAQPVRNKIYSKYAY --------3333-----------------------33333333----------------- TFHHTTRLNKEEKVKEWSLCVATDVSDHDTFWPGWLRDLICDELLNMGYAPWWVKLFETS --------------------------3333------------------------------ LKLPVYVGAPAPEQGHTLLGDPSNPDLEVGLSSGQGATDLMGTLLMSITYLVMQLDHTAP ----------2222------3333-------1111-----------------------33 HLNSRIKDMPSACRFLDSYWQGHEEIRQISKSDDAMLGWTKGRALVGGHRLFEMLKEGKV 331111------------1111--------!!!!-------3333--------------- NPSPYMKISYEHGGAFLGDILLYDSRREPGSAIFVGNINSMLNNQFSPEYGVQSGVRDRS ---------------iiii----33333333---------------------1111-333 KRKRPFPGLAWASMKDTYGACPIYSDVLEAIERCWWNAFGESYRAYREDMLKRDTLELSR 3-------3333--3333--1111------------------------------------ YVASMARQAGLAELTPIDLEVLADPNKLQYKWTEADVSANIHEVLMHGVSVEKTERFLRS -1111-----1111-----------1111---3333-3333------------------- VMPR ---- >HLA CLASS II HISTOCOMPATI; SWP:Q30066; PDB:1UVQA; DIVADHVASCGVNLYQFYGPSGQYTHEFDGDEQFYVDLERKETAWRWPEFSKFGGFDPQG ---------------------------iiii---------------33331111------ ALRNMAVAKHNLNIMIKRYNSTAATNEVPEVTVFSKSPVTLGQPNTLICLVDNIFPPVVN ----------------1111-------------------2222----------------- ITWLSNGQSVTEGVSETSFLSKSDHSFFKISYLTFLPSADEIYDCKVEHWGLDQPLLKHW ----iiii--2222-------1111------------2222-------1111-------- EP -- >DNA LIGASE III; SWP:P49916; PDB:1UW0A; MAEQRFCVDYAKRGTAGCKKCKEKIVKGVCRIGKVVPNPFSESGGDMKEWYHIKCMFEKL ------------------------------------------------------------ ERARATTKKIEDLTELEGWEELEDNEKEQITQHIADLSSKAAGTPKKKAVVQAKLTT -----------------3333------------------------------------ >ARTIFICIAL NUCLEOTIDE BIN; SWP:NA; PDB:1UW1A; DDKKTNWLKRIYRVRPCVKCKVAPRNWKVKNKHLRIYNMCKTCFNNSIDIGDDTYHGHDD -----------1111--------------!!!!-------------------1111---- WLMYADS ------- >U1 SMALL NUCLEAR RIBONUCL; SWP:P09234; PDB:1UW2A; MPKFYCDYCDTYLTHDSPSVRKTHCSGRKHKENVKDYYQKWMEEQAQSLIDKTTAAFQQG -----3333-----------------3333--3333------------------1111-- K - >PRION PROTEIN; SWP:P23907; PDB:1UW3A; LGGYMLGSAMSRPLIHFGNDYEDCYYRENMHRYPNQVYYRPVDQYSNQNNFVHDCVNITV ----------------------------1111--------3333---3333--------- KQHTVTTTTKGENFTETDIKIMERVVEQMCITQYQRESQAYYQRGA -------1111----------------------------------- >REGULATOR OF NONSENSE TRA; SWP:Q9BZI7; PDB:1UW4A; LSKVVIRRLPPTLTKEQLQEHLQPMPEHDYFEFFSNDTSLYPHMYARAYINFKNQEDIIL ---------1111----------------------------------------3333--- FRDRFDGYVFLDNKGQEYPAIVEFAPFQKAA ----2222---1111---------------- >Regulator of nonsense tra; SWP:Q9HAU5; PDB:1UW4B; RPPLQEYVRKLLYKDLSKVTTEKVLRQMRKLPWQDQEVKDYVICCMINIWNVKYNSIHCV -3333-----------1111-------11111111-----------------1111---- ANLLAGLVLYQEDVGIHVVDGVLEDIRLGMEVNQPKFNQRRISSAKFLGELYNYRMVESA ---------------------------------3333--------------1111--333 VIFRTLYSFTSFGVNPDGSPSSLDPPEHLFRIRLVCTILDTCGQYFDRGSSKRKLDCFLV 3---------22221111--11111111-------------3333--------------- YFQRYVWWKKSLEVWTKDHPFPIDIDYMISDTLELLRPKIKLCNSLEESIRQVQDLEREF -----------33333333-----------------1111-------------------- LIKLGLVN -------- >ACETYLCHOLINE-BINDING PRO; SWP:P58154; PDB:1UW6A; EFDRADILYNIRQTSRPDVIPTQRDRPVAVSVSLKFINILEVNEITNEVDVVFWQQTTWS ---------------1111---%%%%---------------------------------- DRTLAWNSSHSPDQVSVPISSLWVPDLAAYNAISKPEVLTPQLARVVSDGEVLYMPSIRQ 3333-------------3333-------1111--------------1111---------- RFSCDVSGVDTESGATCRIKIGSWTHHSREISVDPTTENSDDSEYFSQYSRFEILDVTQK -----2222-3333------------1111----------1111--1111---------- KNSVTYSCCPEAYEDVEVSLNFRKKGRS -----3333------------------- >FERULOYL ESTERASE A; SWP:O42807; PDB:1UWCA; ASTQGISEDLYNRLVEMATISQAAYADLCNIPSTIIKGEKIYNAQTDINGWILRDDTSKE -------------------------%%%%--1111--------------------1111- IITVFRGTGSDTNLQLDTNYTLTPFDTLPQCNDCEVHGGYYIGWISVQDQVESLVKQQAS ------------------------3333--2222-------------------------- QYPDYALTVTGHSLGASMAALTAAQLSATYDNVRLYTFGEPRSGNQAFASYMNDAFQVSS -1111--------------------1111---------------------------3333 PETTQYFRVTHSNDGIPNLPPAEQGYAHGGVEYWSVDPYSAQNTFVCTGDEVQCCEAQGG --------------3333--3333---------------3333----------3333--- QGVNDAHTTYFGMTSGACTWV ---3333--iiii2222---- >HYPOTHETICAL PROTEIN TM04; SWP:Q9WYV7; PDB:1UWDA; MSKKVTKEDVLNALKNVIDFELGLDVVSLGLVYDIQIDDQNNVKVLMTMTTPMCPLAGMI ------------------3333---------------1111------------------- LSDAEEAIKKIEGVNNVEVELTFDPPWTPERMSPELREKFGV -------1111----------------1111----------- >ANTIBODY 14D9; SWP:NA; PDB:1UWEH; LLAQSGPELVKPGASVKISCAASAYSITDFTIYWVKQSHGDSLEWIGGIDPHNGGGAYNQ ----------2222-----------1111--------%%%%-----------------33 KFRVKATLTVDTSSSTAYIHLNSLTSEDSAVYYCAIFYGNFF 33---------1111---------3333-------------- >ANTIBODY 14D9; SWP:NA; PDB:1UWEL; LDNVMTQSPSFMSTSVGDRVSVTCAASQNVGTNVAWYQQKPGQPPKALIYSTSYRYSGVP ---------------------------------------2222------------22221 DRFTGSGSGTDFTLTISNVNSEDLAEYFCQQYNIYPVTFGGGTKLEIKRTVAAPSVFIFP 111----------------3333------------------------------------- PSAEQLASGTASVVCLLNNFYPREAAVQWKVDNALQSGNSQESVTEQDSADSTYSLSSTL -3333-------------------------iiii-------------------------- TLSKADYEAHAVYACEVTHQGLSSPVTKSFNRG ------1111--------1111----------- >FIMH PROTEIN; SWP:P08191; PDB:1UWFA; FACKTANGTAIPIGGGSANVYVNLAPVVNVGQNLVVDLSTQIFCHNDYPETITDYVTLQR ----1111---2222-------------2222----3333-------3333--------- GSAYGGVLSNFSGTVKYSGSSYPFPTTSETPRVVYNSRTDKPWPVALYLTPVSSAGGVAI ----------------iiii-------------------------------1111----- KAGSLIAVLILRQTNNYNSDDFQFVWNIYANNDVVVPT 2222---------------------------------- >ANTIBODY 14D9; SWP:NA; PDB:1UWGH; QLLESGPELVEPGASVKVSCKASAYSITDFNIYWVKQSHGKNLEWIGGIDPHNGGPVYNQ ----------2222-----------3333-----------------------------33 KFNGKATLTVDKSSSTAFMHLNSLTSEDSAVYYCAIFYGNFFDYWGPGTTVTVSSASTKG 33--------3333----------1111---------!!!!------------------- PSVFPLAPTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSSVVTVPS ---------------------------%%%%--2222-------3333----------33 SSLGTQTYICNVNHKPSNTKVDKKVEPKS 33-----------3333------------ >ANTIBODY 14D9; SWP:NA; PDB:1UWGL; ELVMTQSPKFMSTSVGDRVSVTCKASQNVGTHVAWYQQKPGQSPKTLIYSASYRYSGVPD -------------2222---------------------2222------------222211 RFTGSGSGTDFTLTIRDVQSEDAAEYFCQQYNLFPVTFGGGTKLEIKRTVAAPSVFIFPP 11----------------3333-------------------------------------- SDEQLKSGTASVVCLLNNFYPREAAVAWKVDNALQSGNSQESVTEQDSADSTYSLSSTLT 33331111---------------------iiii--------------------------- LSKADYEKHKVYACEVTHQGLSSPVTKSFNRGE -----------------1111--------2222 >UROCANATE HYDRATASE; SWP:P25080; PDB:1UWKA; NNKYRDVEIRAPRGNKLTAKSWLTEAPLRMLMNNLDPQVAENPKELVVYGGIGRAARNWE --------------------3333-----------1111--------------------- CYDKIVETLTRLEDDETLLVQSGKPVGVFKTHSNAPRVLIANSNLVPHWANWEHFNELDA ------------1111----iiii-------1111----------3333---------11 KGLAMYGQMTAGSWIYIGSQGIVQGTYETFVEAGRQHYGGSLKGKWVLTAGLGGMGGAQP 11---------------------------------------2222---------3333-- LAATLAGACSLNIESQQSRIDFRLETRYVDEQATDLDDALVRIAKYTAEGKAISIALHGN ---1111--------3333---------------------------1111---------1 AAEILPELVKRGVRPDMVTDQTSAHDPLNGYLPAGWTWEQYRDRAQTEPAAVVKAAKQSM 111-----1111----------3333------2222------------------------ AVHVQAMLDFQKQGVPTFDYGNNIRQMAKEEGVADAFDFPGFVPAYIRPLFCRGVGPFRW ----------1111--------------11111111----------33331111------ AALSGEAEDIYKTDAKVKELIPDDAHLHRWLDMARERISFQGLPARICWVGLGLRAKLGL -----3333--------------------------------------------------- AFNEMVRSGELSAPVVIGRDHLDSGSVSSPNAETEAMRDGSDAVSDWPLLNALLNTAGGA ----------------------1111--1111----11111111---------------- TWVSLHHGGGVGMGFSQHSGMVIVCDGTDEAAERIARVLTNDPGTGVMRHADAGYDIAID -----------2222---------------------1111-------------------- CAKEQGLDLPMITG --------1111-- >FERREDOXIN VI; SWP:P80306; PDB:1UWMA; AKIIFIEHNGTRHEVEAKPGLTVMEAARDNGVPGIDADCGGACACSTCHAYVDPAWVDKL ------1111-------2222------1111-----1111------------33331111 PKALPTETDMIDFAYEPNPATSRLTCQIKVTSLLDGLVVHLPEKQI --------3333-----------3333---1111------------ >BETA-GALACTOSIDASE; SWP:P22498; PDB:1UWSA; MYSFPNSFRFGWSQAGFQSEMGTPGSEDPNTDWYKWVHDPENMAAGLVSGDLPENGPGYW ----1111------3333----2222-------------------------3333----- GNYKTFHDNAQKMGLKIARLNVEWSRIFPNPLPRPFDESKQDVTEVEINENELKRLDEYA ----------1111--------3333----------1111--------------3333-- NKDALNHYREIFKDLKSRGLYFILNMYHWPLPLWLHDPIRVRRGDFTGPSGWLSTRTVYE ---------------1111------------1111--------------!!!!------- FARFSAYIAWKFDDLVDEYSTMNEPNVVGGLGYVGVKSGFPPGYLSFELSRRAMYNIIQA -----------3333--------3333--------------------------------- HARAYDGIKSVSKKPVGIIYANSSFQPLTDKDMEAVEMAENDNRWWFFDAIIRGEITKIV ----------------------------1111---------------------------- RDDLKGRLDWIGVNYYTRTVVKRTEKGYVSLGGYGHGCERNSVSLAGLPTSDFGWEFFPE 1111-------------------1111------!!!!------1111----------333 GLYDVLTKYWNRYHLYMYVTENGIADDADYQRPYYLVSHVYQVHRAINSGADVRGYLHWS 3------------------------3333-----------------1111---------- LADNYEWASGFSMRFGLLKVDYNTKRLYWRPSALVYREIATNGAITDEIEHLNSVPPVKP -----!!!!------------------------------1111--3333--------111 LRH 1-- >23S RRNA (URACIL-5-)-METH; SWP:P55135; PDB:1UWVA; QIITVSVNDLDSFGQGVARHNGKTLFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSP ----------1111-----iiii-------2222------------------------11 ERETPRCPHFGVCGGCQQQHASVDLQQRSKSAALARLKHDVSEVIADVPWGYRRRARLSL 11----1111-------1111--------------------------------------- NYLPKTQQLQGFRKAGSSDIVDVKQCPILAPQLEALLPKVRACLGSLQARHLGHVELVQA ---1111------------------3333--------------1111------------3 TSGTLILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILETVSGEPWYDSNGLRLTFS 333------------------------------------------------iiii----3 PRDFIQVNAGVNQKVARALEWLDVQPEDRVLDLFCGGNFTLPLATQAASVVGVEGVPALV 333---------------------1111--------3333--1111-------------- EKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVQQIIKLEPI -------1111--------------33333333----------33333333---3333-- RIVYVSCNPATLARDSEALLKAGYTIARLALDFPHTGHLESVLFSRV -------------------1111---------2222----------- >ENDOGLUCANASE; SWP:P06564; PDB:1UWWA; VVHDPKGEAVLPSVFEDGTRQGWDWAGESGVKTALTIEEANGSNALSWEFGYPEVKPSDN -------------------iiii--1111----------%%%%---------------11 WATAPRLDFWKSDLVRGENDYVTFDFYLDPVRATEGANINLVFQPPTNGYWVQAPKTYTI 11-------------!!!!-------------------------1111------------ NFDELEEANQVNGLYHYEVKINVRDITNIQDDTLLRNIIFADVESDFAGRVFVDNVRFEG 11111111--iiii--------1111---1111--------------------------- A - >Class 1 outer membrane pr; SWP:Q51220; PDB:1UWXK; GIVMTQTPASQSASLGESVTITCLASQTIGTWLAWYQQKPGKSPQLLIYAATSLADGVPS -------------2222---------------------2222------------222233 RFSGSGSGTKFSFKISSLQAEDFVSYYCQQLSSTP 33----------------3333------------- >CARBOXYPEPTIDASE M; SWP:P14384; PDB:1UWYA; LDFNYHRQEGMEAFLKTVAQNYSSVTHLHSIGKSVKGRNLWVLVVGRFPKEHRIGIPEFK ---------------------3333----------------------------------- YVANMHGDETVGRELLLHLIDYLVTSDGKDPEITNLINSTRIHIMPSMNPDGFEAVKKPD ------------------------------------------------------------ CYYSIGRENYNQYDLNRNFPDAFEYNNVSRQPETVAVMKWLKTETFVLSANLHGGALVAS --------1111-1111------------------------------------------- YPFDNGVQATGALYSRSLTPDDDVFQYLAHTYASRNPNMKKGDECKNKMNFPNGVTNGYS ------33331111----------------------3333----------------3333 WYPLQGGMQDYNYIWAQCFEITLELSCCKYPREEKLPSFWNNNKASLIEYIKQVHLGVKG ------3333---------------------3333-----------------1111---- QVFDQNGNPLPNVIVEVQDRKHICPYRTNKYGEYYLLLLPGSYIINVTVPGHDPHITKVI ---1111---------1111--------1111---------------------------- IPEKSQNFSALKKDILLPFQGPSCPMIPLYRNLP ---------------------------3333--- >CYTIDINE DEAMINASE; SWP:P19079; PDB:1UWZA; MNRQELITEALKARDMAYAPYSKFQVGAALLTKDGKVYRGCNIENAAYSMCNCAEATALF -----------3333----------------1111-----------3333---------- KAVSEGDTEFQMLAVAADTPGPVSPCGACRQVISELCTKDVIVVLTNLQGQIKEMTVEEL --1111-------------------------------1111--------------3333- LPGAFSSEDL -----3333- >BNI1 PROTEIN; SWP:P41832; PDB:1UX5A; KYPRPHKKLKQLHWEKLDCTDNSIWGTGKAEKFADDLYEKGVLADLEKAFAAREIKSLAS ---------------------------3333----------------------------- KRKEDLQKITFLSRDISQQFGINLHMYSSLSVADLVKKILNCDRDFLQTPSVVEFLSKSE -----------------------3333---------------3333-------3333-33 IIEVSVNLARNYAPYSTDWEGVRNLEDAKPPEKDPNDLQRADQIYLQLMVNLESYWGSRM 33---------3333-------------------3333-----------1111------- RALTVVTSYEREYNELLAKLRKVDKAVSALQESDNLRNVFNVILAVGNFMNDTSKQAQGF ---------------------------------------------------3333----- KLSTLQRLTFIKDTTNSMTFLNYVEKIVRLNYPSFNDFLSELEPVLDVVKVSIEQLVNDC 3333--------3333---------------3333---3333---------3333----- KDFSQSIVNVERSVEIGNLSDSSKFHPLDKVLIKTLPVLPEARKKGDLLEDEVKLTIMEF --------------------1111-11113333-3333---------------------- ESLMHTYGEDSGDKFAKISFFKKFADFINEYKKAQAQNLAAEEEERLYIKH ---3333--3333---------------------------------3333- >THROMBOSPONDIN-1; SWP:P07996; PDB:1UX6A; ALADNCPLEHNPDQLDSDSDRIGDTCDNNQDIDEDGHQNNLDNCPYVPNANQADHDKDGK ------3333-----1111---3333----1111---1111--1111-1111-1111--- GDACDHDDDNDGIPDDKDNCRLVPNPDQKDSDGDGRGDACKDDFDHDSVPDIDDICPENV 3333--1111---3333--1111-1111-1111---3333--1111---3333-----11 DISETDFRRFQMIPLDPKGTSQNDPNWVVELVQTVNSDPGLAVGYDEFNAVDFSGTFFIN 11---------------------------------------------------------- TERDDDYAGFVFGYQSSSRFYVVMWKQVTQSYWDTNPTRAQGYSGLSVKVVKSTTGPGEH ------------------------------------------------------------ LRNALWHTGNTPGQVRTLWHDPRHIGWKDFTAYRWRLSHRPKTGFIRVVMYEGKKIMADS ----------2222------1111----------------1111-------!!!!----- GPIYDKTYAGGRLGLFVFSQEMVFFSDLKYECRDP ----------------------------------- >YJBI PROTEIN; SWP:O31607; PDB:1UX8A; NAPYEAIGEELLSQLVDTFYERVASHPLLKPIFPSDLTETARKQKQFLTQYLGGPPLYTE -3333--------------------33331111---3333---------1111--3333- EHGHPMLRARHLPFPITNERADAWLSCMKDAMDHVGLEGEIREFLFGRLELTARHMVNQ ----------3333---------------------------------------3333-- >FIBER PROTEIN; SWP:Q64823; PDB:1UXAA; YDTRTLWTTPDTSPNCTIAQDKDSKLTLVLTKCGSQILANVSLIVVAGKYHIINNKTNPK --------------------------------!!!!-----------1111------111 IKSFTIKLLFNKNGVLLDNSNLGKAYWNFRSGNSNVSTAYEKAIGFMPNLVAYPKPSNSK 1---------1111--1111----------!!!!--------3333-------------- KYARDIVYGTIYLGGKPDQPAVIKTTFNQETGCEYSITFNFSWSKTYENVEFETTSFTFS -3333----------1111----------------------------------------- YIAQE ----- >FRUCTOSE REPRESSOR; SWP:P0ACP1; PDB:1UXC; MKLDEIARLAGVSRTTASYVINGKAKQYRVSDKTVEKVMAVVREHNYHPN -----------------------3333----1111--------------- >MALATE DEHYDROGENASE; SWP:P80040; PDB:1UXJA; MRKKISIIGAGFVGSTTAHWLAAKELGDIVLLDIVEGVPQGKALDLYEASPIEGFDVRVT ---------------------1111-------------------------1111------ GTNNYADTANSDVIVVTSGAPLIKVNADITRACISQAAPLSPNAVIIMVNNPLDAMTYLA ---3333-----------------------------33331111---------------- AEVSGFPKERVIGQAGVLDAARYRTFIAMEAGVSVKDVQAMLMGGHGDEMVPLPRFSTIS ------3333-----------------------3333--------!!!!---3333--ii GIPVSEFIAPDRLAQIVERTRKGGGEIVNLLKTGSAYYAPAAATAQMVEAVLKDKKRVMP ii3333--------------------------------------------1111------ VAAYLTGQYGLNDIYFGVPVILGAGGVEKILELPLNEEEMALLNASAKAVRATLDTLKS ------2222------------1111--------------------------------- >YDEN PROTEIN; SWP:P96671; PDB:1UXOA; TKQVYIIHGYRASSTNHWFPWLKKRLLADGVQADILNPNPLQPRLEDWLDTLSLYQHTLH ------------1111----------1111--------3333------------3333-1 ENTYLVAHSLGCPAILRFLEHLQLRAALGGIILVSGFAKSLPTLQLDEFTQGSFDHQKII 111---------------1111------------------1111-3333----------1 ESAKHRAVIASKDDQIVPFSFSKDLAQQIDAALYEVQHGGHFLEDEGFTSLPIVYDVLTS 111-------1111---3333---------------------1111----3333----33 YFSK 33-- >GLYCERALDEHYDE-3-PHOSPHAT; SWP:O57693; PDB:1UXTA; AGLLEGVIKEKGGVPVYPSYLAGEWGGSGQEIEVKSPIDLATIAKVISPSREEVERTLDV ----------iiii---------------------------------------------- LFKRGRWSARDMPGTERLAVLRKAADIIERNLDVFAEVLVMNAGKPKSAAVGEVKAAVDR --------11113333-------------------------------------------- LRLAELDLKKIGGDYIPGDWTYDTLETEGLVRREPLGVVAAITPFNYPLFDAVNKITYSF -----1111-----------3333-----------------------------------1 IYGNAVVVKPSISDPLPAAMAVKALLDAGFPPDAIALLNLPGKEAEKIVADDRVAAVSFT 111-------3333-----------1111-1111-------1111--1111--------- GSTEVGERVVKVGGVKQYVMELGGGDPAIVLEDADLDLAADKIARGIYSYAGQRCDAIKL ------------------------------11113333----------%%%%-1111--- VLAERPVYGKLVEEVAKRLSSLRVGDPRDPTVDVGPLISPSAVDEMMAAIEDAVEKGGRV ----1111-----------------3333------------------------1111--- LAGGRRLGPTYVQPTFVEAPADRVKDMVLYKREVFAPVALAVEVKDLDQAIELANGRPYG -------------------33331111--------------------------3333--- LDAAVFGRDVVKIRRAVRLLEVGAIYINDMPRHGIGYYPFGGRKKSGVFREGIGYAVEAV --------------------------------!!!!------!!!!--------3333-- TAYKTIVFNYKGKGVWKYE ---------2222------ >Endo-1,4-beta-xylanase; SWP:O52780; PDB:1UXXX; KIESEEYNSLKSSTIQTIGTSDGGSGIGYIESGDYLVFNKINFGNGANSFKARVASGADT --1111-------------1111-------2222-------------------------- PTNIQLRLGSPTGTLIGTLTVASTGGWNNYEEKSCSITNTTGQHDLYLVFSGPVNIDYFI ---------1111------------1111------------------------------- FDSNG ----- >URIDINE DIPHOSPHO-N-ACETY; SWP:P08373; PDB:1UXY; HSLKPWNTFGIDHNAQHIVCAEDEQQLLNAWQYATAEGQPVLILGEGSNVLFLEDYRGTV --33333333-------------------------------------------------- IINRIKGIEIHDEPDAWYLHVGAGENWHRLVKYTLQEGMPGLENLALIPGCVGSSPIQNI ------------1111-----1111---------1111---3333-----3333-1111- GAYGVELQRVCAYVDSVELATGKQVRLTAKECRFGYRDSIFKHEYQDRFAIVAVGLRLPK -iiii3333---------1111-----3333-------11111111-------------- EWQPVLTYGDLTRLDPTTVTPQQVFNAVCHMRTTKLPDPKVNGNAGAFFKNPVVSAETAK -------!!!!---3333-3333--------------1111------------------- ALLSQFPTAPNYPQADGSVKLAAGWLIDQCQLKGMQIGGAAVHRQQALVLINEDNAKSED -----1111----1111----------11112222-!!!!--1111----------3333 VVQLAHHVRQKVGEKFNVWLEPEVRFIGASGEVSAVETIS ---------------------------1111--3333--- >CELLULASE B; SWP:O07653; PDB:1UXZA; MVIATIQAEDHSQQSGTQQETTTDTGGGKNVGYIDAGDWLSYAGTPVNIPSSGSYLIEYR ------1111-------------2222-------2222---3333--------------- VASQNGGGSLTFEEAGGAPVHGTIAIPATGGWQTWTTIQHTVNLSAGSHQFGIKANAGGW -------------2222------------------------------------------- NLNWIRINKTH ----------- >ENDO-1,4-BETA-XYLANASE A; SWP:Q8GJ44; PDB:1UY4A; SPIRRDAFSIIEAEEYNSTNSSTLQVIGTPNNGRGIGYIENGNTVTYSNIDFGSGATGFS -----------3333-------------1111-------2222--------!!!!----- ATVATEVNTSIQIRSDSPTGTLLGTLYVSSTGSWNTYNTVSTNISKITGVHDIVLVFSGP ----------------1111------------1111------------------------ VNVDNFIFSRSS ------------ >EPSILON-TOXIN; SWP:Q57398; PDB:1UYJA; SYDNVDTLIEKGRYNTKYNYLKRMEKYYPNAMAYFDKVTINPQGNDFYINNPKVELDGEP ----3333--------------3333---33331111----------------------- SMNYLEDVYVGKALLTNDTQQEQKLKSQSFTCKNTDTVTATTTHTVGTSIQATAKFTVPF ------------------------------------------------------------ NETGVSLTTSYSFANTNTNTNSKEITHNVPSQDILVPANTTVEVIAYLKKVNVKGNVKLV ------------------------------------------------------------ GQVSGSEWGEIPSYLAFPRDGYKFSLSDTVNKSDLNEDGTININGKGNYSAVMGDELIVK ------------------------3333--3333-2222--------------------- VRNLNTNNVQEYVIPVDSNIVKYRSLSIKAPGI --------------------------------- >HEAT SHOCK PROTEIN HSP 90; SWP:P07900; PDB:1UYLA; VETFAFQAEIAQLMSLIINTFYSNKEIFLRELISNSSDALDKIRYESLTDPSKLDSGKEL ---------------------------------------------3333--1111----- HINLIPNKQDRTLTIVDTGIGMTKADLINNLGTIAKSGTKAFMEALQAGADISMIGQFGV -------1111-----------------%%%%------------------3333-11113 GFYSAYLVAEKVTVITKHNDDEQYAWESSAGGSFTVRTDTGEPMGRGTKVILHLKEDQTE 3331111----------1111---------------------------------1111-- YLEERRIKEIVKKHSQFIGYPITLFVEK ---------------------------- >HEAT SHOCK PROTEIN HSP 90; SWP:P08238; PDB:1UYMA; EVETFAFQAEIAQLMSLIINTFYSNKEIFLRELISNASDALDKIRYESLTDPSKLDSGKE ------------------------3333-----------------1111---1111---- LKIDIIPNPQERTLTLVDTGIGMTKADLINNLGTIAKSGTKAFMEALQAGADISMIGQFG ----------------------------------------------2222-33331111- VGFYSAYLVAEKVVVITKHNDDEQYAWESSAGGSFTVRADHGEPIGRGTKVILHLKEDQT 33331111----------1111--------iiii---------------------11111 EYLEERRVKEVVKKHSQFIGYPITLYLEKER 1113333--------1111------------ >ACETYL-COA CARBOXYLASE; SWP:Q00955; PDB:1UYRA; PIATPYPVKEWLQPKRYKAHLMGTTYVYDFPELFRQASSSQWKNFSADVKLTDDFFISNE -------3333--------------1111----------------1111--1111----- LIEDENGELTEVEREPGANAIGMVAFKITVKTPEYPRGRQFVVVANDITFKIGSFGPQED ---1111------------------------3333-----------1111%%%%------ EFFNKVTEYARKRGIPRIYLAANSGARIGMAEEIVPLFQVAWNDAANPDKGFQYLYLTSE ------------------------------1111---------11111111--------- GMETLKKFDKENSVLTERTVINGEERFVIKTIIGSEDGLGVECLRGSGLIAGATSRAYHD -----1111----------------------------------------------3333- IFTITLVTCRSVGIGAYLVRLGQRAIQVEGQPIILTGAPAINKMLGREVYTSNLQLGGTQ -----------!!!!----1111----2222---------------------1111-333 IMYNNGVSHLTAVDDLAGVEKIVEWMSYVPAKRNMPVPILETKDTWDRPVDFTPTNDETY 3-1111-------------------1111--2222------------------------- DVRWMIEGRETESGFEYGLFDKGSFFETLSGWAKGVVVGRARLGGIPLGVIGVETRTVEN 3333------1111------2222----11113333------iiii-------------- LIPADPANPNSAETLIQEPGQVWHPNSAFKTAQAINDFNNGEQLPMMILANWRGFSGNEV ----1111-------------------------------------------------333 LKYGSFIVDALVDYKQPIIIYIPPTGELRGGSWVVVDPTINADQMEMYADVNARAGVLEP 3---------1111--------2222---3333---33333333---------------- QGMVGIKFRREKLLDTMNRLELLPIYGQISLQFADLHDRSSRMVAKGVISKELEWTEARR ------------------------------------------------------1111-- FFFWRLRRRLNEEYLIKRLSHQVGEASRLEKIARIRSWYPASVDHEDDRQVATWIEENYK -----------------------------------333311111111------------- TLDDKLKGLKLESFAQDLAKKIRSDHDNAIDGLSEVIK -------------------------------------- >FAB ANTIBODY LIGHT CHAIN; SWP:NA; PDB:1UYWH; VQLQQSGPELVKPGTSVKISCKTSGYTFTEYTIHWVKEAGGKSLAWIGGIDPNSGGTNYS -----------2222-----------1111-----------------------------3 PNFKGKATLTVDKSSSTAYMDLRSL 333---------1111--------- >EMSY PROTEIN; SWP:Q7Z589; PDB:1UZ3A; GSMPVVWPTLLDLSRDECKRILRKLELEAYAGVISALRAQGDLTKEKKDLLGELSKVLSI -------3333--------------------------3333------------------- STERHRAEVRRAVNDERLTTIAHNMSGPNSSSEWSIEGRRLV ------------------------------------------ >402AA LONG HYPOTHETICAL M; SWP:O58335; PDB:1UZ5A; KVVPLEKALEVVQSFKISPGIEEVPIEKGLGRIAAEDIYSPIDVPPFDRATVDGYAVRAE ------------------------11112222-------------------------333 DTFMASEASPVRLKVIGSVHAGEEPKFKLGKGEAAYISTGAMLPGNADAVIQFEDVERVN 311113333----------2222------2222----2222--2222----3333---%% GEILIYKPAYPGLGVMKKGIDIEKGRLLVKKGERLGFKQTALLSAVGINKVKVFRKPKVA %%-------2222---2222--2222---2222----------1111------------- VISTGNEIVPPGNELKPGQIYDINGRALCDAINELGGEGIFMGVARDDKESLKALIEKAV ----1111-------2222----------------------------------------- NVGDVVVISGGADLTASVIEELGEVKVHGIAIQPGKPTIIGVIKGKPVFGLPGYPTSCLT ----------------------------------1111----iiii-------------- NFTLLVVPLLLRALGREGKIGKKVARLKHKVFSVRRQFLPVKLEGDLAVPILKGSGAVTS -----------1111-------------------------------------3333---- FIDADGFVEIPETVESLDEGEEVEVTLFKGW ------------------------------- >IGG FAB (IGG3,KAPPA) LIGH; SWP:Q5XKG4; PDB:1UZ8A; DIVMTQAAFSNPVTLGTSASISCRSSKSLLYSNGITYLYWYLQKPGQSPQLLIYQMSNLA -------------2222-------------1111---------2222------------2 SGVPDRFSSSGSGTDFTLRISRVEAEDVGVYYCAQNLEVPWTFGGGTKLEIKRADAAPTV 2223333----------------1111--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM ----------------------------------iiii--2222---------------- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNE ------3333------------3333--------1111 >Ig heavy chain V region 4; SWP:P01806; PDB:1UZ8B; EVKLLESGGGLVQPGGSQKLSCAASGFDFSGYWMSWVRQAPGKGLEWIGEINPDSSTINY ------------2222-----------3333--------2222--------1111----- TPSLKDKFIISRDNAKNTLYLQMSKVRSEDTALYYCARETGTRFDYWGQGTTLTVSSATT 1111--------3333----------3333--------2222------------------ TAPSVYPLVPGGSSVTLGCLVKGYFPEPVTVKWNYGALSSGVRTVSSVLQSGFYSLSSLV --------------------------------%%%%--2222-------iiii------- TVPSSTWPSQTVICNVAHPASKTELIKRIEPR --1111------------1111---------- >HYPOTHETICAL PROTEIN FLJ2; SWP:O75400; PDB:1UZCA; QPAKKTYTWNTKEEAKQAFKELLKEKRVPSNASWEQAMKMIINDPRYSALAKLSEKKQAF ----------------------------1111--------3333---------------- NAYKVQTEK ---1111-- >Ribulose bisphosphate car; SWP:Q43832; PDB:1UZDC; MMVWTPVNNKMFETFSYLPPLTDEQIAAQVDYIVANGWIPCLEFATDHGFVYREHHNSPG ------------2222-----------------1111--------------------222 YYDGRYWTMWKLPMFGCRDPMQVLREIVACTKAFPDAYVRLVAFDNQKQVQIMGFLVQRP 2------------2222-3333-----------1111--------1111----------- KQPANKRSV --1111--- >ANGIOTENSIN CONVERTING EN; SWP:P22966; PDB:1UZEA; DEAEASKFVEEYDRTSQVVWNEYAEANWNYNTNITTETSKILLQKNMQIANHTLKYGTQA -----------------------------------------------------------1 RKFDVNQLQNTTIKRIIKKVQDLERAALPAQELEEYNKILLDMETTYSVATVCHPNGSCL 1113333---------------!!!!---------------------------------- QLEPDLTNVMATSRKYEDLLWAWEGWRDKAGRAILQFYPKYVELINQAARLNGYVDAGDS ------------------------------33331111-----------1111------- WRSMYETPSLEQDLERLFQELQPLYLNLHAYVRRALHRHYGAQHINLEGPIPAHLLGNMW --11111111------------------------------1111-------1111--111 AQTWSNIYDLVVPFPSAPSMDTTEAMLKQGWTPRRMFKEADDFFTSLGLLPVPPEFWNKS 1--11111111--3333--------------------------3333-----3333---- MLEKPTDGREVVCHASAWDFYNGKDFRIKQCTTVNLEDLVVAHHEMGHIQYFMQYKDLPV ------------------------------------------------------111133 ALREGANPGFHEAIGDVLALSVSTPKHLHSLNLLSSDEHDINFLMKMALDKIAFIPFSYL 33----3333---3333-----------1111-------------------3333----- VDQWRWRVFDGSITKENYNQEWWSLRLKYQGLCPPVPRTQGDFDPGAKFHIPSSVPYIRY -------1111--3333---------------------22223333----1111------ FVSFIIQFQFHEALCQAAGHTGPLHKCDIYQSKEAGQRLATAMKLGFSRPWPEAMQLITG ---------------1111---3333--2222----------3333-------------- QPNMSASAMLSYFKPLLDWLRTENELHGEKLGWPQ ------------------------1111------- >MAJOR ENVELOPE PROTEIN E; SWP:P27915; PDB:1UZGA; MRCVGVGNRDFVEGLSGATWVDVVLEHGGCVTTMAKNKPTLDIELQKTEATQLATLRKLC 3333---------------------2222-----2222---------------------- IEGKITNITTDSRCPTQGEAILPEEQDQNYVCKHTYVDRGWGNGCGLFGKGSLVTCAKFQ ---------------------3333----------------------------------- CLESIEGKIVQHENLKYTVIITVHTGDQHQVGNETQGVTAEITSQASTAEAILPEYGTLG --------------------------1111------------1111------2222---- LECSPRTGLDFNEMILLTMKDKAWMVHRQWFFDLPLPWTSGATTKTPTWNRKELLVTFKN ------------------!!!!----------------------------3333------ AHAKKQEVVVLGSQEGAMHTALTGATEIQTSGGTSIFAGHLKCRLKMDKLKLKGMSYAMC ------------------------------------------------------------ LNTFVLKKEVSETQHGTILIKVEYKGEDAPCKIPFSTEDGQGKAHNGRLITANPVVTKKE ---------------------------------------------------------111 EPVNIEAEPPFGESNIVIGIGDKALKINWYRK 1-------------------2222-------- >Ribulose bisphosphate car; SWP:P04716; PDB:1UZHC; MMVWTPVNNKMFETFSYLPPLTDEQIAAQVDYIVANGWIPCLEFAEHSNPEEFYWTMWKL ------------2222-----------------1111-----------3333-------- PMFGCRDPMQVLREIVACTKAFPDAYVRLVAFDNQKQVQIMGFLVQRPKTARDFQPANKR -2222-3333-----------1111----------------------1111----1111- SV -- >FIBRILLIN-1; SWP:P35555; PDB:1UZJA; TDVNECLDPTTCISGNCVNTPGSYICDCPPDFELNPTRVGCVDTRSGNCYLDIRPRGDNG -------11112222-------------2222--1111---------------------- DTACSNEIGVGVSKASCCCSLGKAWGTPCEMCPAVNTSEYKILCPGGEGFRPNPITVILE ------------3333-----------------1111------1111------------- DIDECQELPGLCQGGKCINTFGSFQCRCPTGYYLNEDTRVCD --3333-----2222----2222-----2222---------- >FIBRILLIN-1; SWP:P35555; PDB:1UZKA; TDVNECLDPTTCISGNCVNTPGSYICDCPPDFELNPTRVGCVDTRSGNCYLDICSNEIGV ---333333332222-------------2222--1111---------------------- GVSKASCCCSLGKAWGTPCECPAVNTSEYKILCPGGEGFRPNPITVILEDIDECQELPGL --3333--------------------------1111---------------3333----- CQGGKCINTFGSFQCRCPTGYYLNEDTRVCD 2222----2222-----2222---------- >3-OXOACYL-[ACYL-CARRIER P; SWP:P0A5Y5; PDB:1UZMA; AKPPFVSRSVLVTGGNRGIGLAIAQRLAADGHKVAVTHRGSGAPKGLFGVEVDVTDSDAV ---------------------------3333------------2222------------- DRAFTAVEEHQGPVEVLVSNAGLSARMTEEKFEKVINANLTGAFRVAQRASRSMQRNKFG ------------------------------------------------------------ RMIFIGSVSGNQANYAASKAGVIGMARSIARELSKANVTANVVAPGYIDTDMTRALDERI --------------------------------3333-------------3333---3333 QQGALQFIPAKRVGTPAEVAGVVSFLASEDASYISGAVIPVDGGMGM 1111----------3333---------1111----------iiii-- >RIBONUCLEOTIDE REDUCTASE ; SWP:Q50549; PDB:1UZRA; RVSAINWNRLQDEKDAEVWDRLTGNFWLPEKVPVSNDIPSWGTLTAGEKQLTMRVFTGLT -----1111-------------1111------3333--3333------------------ MLDTIQGTVGAVSLIPDALTPHEEAVLTNIAFMESVHAKSYSQIFSTLCSTAEIDDAFRW -------------3333------------------------------------------- SEENRNLQRKAEIVLQSYRGDEPLKRKVASTLLESFLFYSGFYLPMYWSSRAKLTNTADM ------------------------------------------------1111-------- IRLIIRDEAVHGYYIGYKFQRGLALVDDVTRAELKDYTYELLFELYDNEVEYTQDLYDEV --------------------3333---------------------------------111 GLTEDVKKFLRYNANKALMNLGYEALFPRDETDVNPAILSAL 1-----------------1111-----3333---33333333 >PSEUDOMONAS AERUGINOSA LE; SWP:Q9HYN5; PDB:1UZVA; ATQGVFTLPANTRFGVTAFANSSGTQTVNVLVNNETAATFSGQSTNNAVIGTQVLNSGSS -------------------------------%%%%------------------------- GKVQVQVSVNGRPSDLVSAQVILTNELNFALVGSEDGTDNDYNDAVVVINWPLG --------iiii----------%%%%---------------------------- >UBIQUITIN; SWP:P25604; PDB:1UZXA; SVPEAVVNWLFKVIQPIYNDGRTTFHDSLALLDNFHSLRPRTRVFTHSDGTPQLLLSIYG -------------1111-----------------1111--------1111---------- TISTGSIPVIWVPSYPVKPPFISINLENFDNTLPIQEYIDSNGWIALPILHCWDPAANLI -------------------------2222---3333---1111---3333---------- VVQELSLLHEPPQDQ -----1111------ >LECTIN (ECL); SWP:Q6YD91; PDB:1V00A; VETISFSFSEFEPGNNDLTLQGAAIITQSGVLQLTKINQNGMPAWDSTGRTLYTKPVHIW -----------2222-----------1111-------1111------------------- DMTTGTVASFETRFSFSIEQPYTRPLPADGLVFFMGPTKSKPAQGYGYLGVFNNSKQDNS -1111---------------------------------------!!!!---------333 YQTLAVEFDTFSNPWDPPQVPHIGIDVNSIRSIKTQPFQLDNGQVANVVIKYDASSKILL 3-----------1111------------------------2222---------1111--- AVLVYPSSGAIYTIAEIVDVKQVLPEWVDVGLSGATGAQRDAAETHDVYSWSFHASLPET -----1111---------3333----------------2222------------------ >DHURRINASE; SWP:Q41290; PDB:1V02A; RLSPWEIPRRDWFPPSFLFGAATSAYQIEGAWNEDGKGPSTWDHFCHNFPEWIVDRSNGD --1111--1111-1111------3333------iiii-----------33331111---- VAADSYHMYAEDVRLLKEMGMDAYRFSISWPRILPKGTLAGGINEKRVEYYNKLIDLLLE !!!!------------1111--------1111-11113333-----------------11 NGIEPYITIFHWDTPQALVDAYGGFLDERIIKDYTDFAKVCFEKFGKTVKNWLTFNEPET 11------------3333----!!!!3333---------------1111----------- FCSVSYGTGVLAPGRCSPGVSCAVPTGNSLSEPYIVAHNLLRAHAETVDIYNKYHKGADG ----------------2222-------3333------------------------!!!!- RIGLALNVFGRVPYTNTFLDQQAQERSMDKCLGWFLEPVVRGDYPFSMRVSARDRVPYFK --------------------------------------------3333---!!!!----- EKEQEKLVGSYDMIGINYYTSTFSKHIDLSPNNSPVLNTDDAYASQETKGPDGNAIGPPT ------2222-------------------1111---3333---------1111------- GNAWINMYPKGLHDILMTMKNKYGNPPMYITENGMGDIDKGDLPKPVALEDHTRLDYIQR -------3333------------------------------------------------- HLSVLKQSIDLGADVRGYFAWSLLDNFEWSSGYTERFGIVYVDRENGCERTMKRSARWLQ --------1111---------------!!!!--------------%%%%----------- EFNG ---- >SERUM PARAOXONASE/ARYLEST; SWP:P27170; PDB:1V04A; LFDRQKSSFQTRFNVHREVTPVELPNCNLVKGIDNGSEDLEILPNGLAFISSGLKYDKSG -------------1111------------2222---------1111-------------- KILLMDLNEKEPAVSELEIIGNTLDISSFNPHGISTFIDDDNTVYLLVVNHPGSSSTVEV ------------------------3333----------1111---------!!!!----- FKFQEEEKSLLHLKTIRHKLLPSVNDIVAVGPEHFYATNDHYFIDPYLKSWEMHLGLAWS ----1111---------1111-------------------------------1111---- FVTYYSPNDVRVVAEGFDFANGINISPDGKYVYIAELLAHKIHVYEKHANWTLTPLRVLS -----3333----------------1111-------1111-------1111--------- FDTLVDNISVDPVTGDLWVGCHPNGMRIFFYDAENPPGSEVLRIQDILSEEPKVTVVYAE -----------------------3333----1111----------1111----------- NGTVLQGSTVAAVYKGKLLIGTVFHKALYCDL -------------iiii--------------- >FILAMIN C; SWP:Q14315; PDB:1V05A; SDASKVVTRGPGLSQAFVGQKNSFTVDCSKAGNMMMVGVHGPKTPCEEVYVKHMGNRVYN -3333----3333---2222-------1111-----------------------%%%%-- VTYTVKEKGDYILIVKWGDESVPGSPFKVKVP ----------------!!!!-2222------- >HBP1; SWP:Q8R316; PDB:1V06A; PSTIWHCFLKGTRLCFHKESNKEWQDVEDFARAASCDNEEEIQMGTHKGYGSDGLKLLSH --------2222---3333------3333--1111---1111------2222-------- EESVSFGESVLKLTFDPGTVEDGLLTVECKLDHPFYVKNKGWSSFYPSLTVVQHGIPCCE ----iiii----------3333-------1111--------------------------- IHIGDVCLPPGHPDAINF -----------1111--- >BETA-GLUCOSIDASE; SWP:P49235; PDB:1V08A; MLSPSEIPQRDWFPSDFTFGAATSAYQIEGAWNEDGKGESNWDHFCHNHPERILDGSNSD --1111--1111-1111------3333------iiii-----------33331111---- IGANSYHMYKTDVRLLKEMGMDAYRFSISWPRILPKGTKEGGINPDGIKYYRNLINLLLE !!!!------------1111--------3333-11113333-----------------11 NGIEPYVTIFHWDVPQALEEKYGGFLDKSHKSIVEDYTYFAKVCFDNFGDKVKNWLTFND 11--------------------!!!!3333-----------------3333--------- PQTFTSFSYGTGVFAPGRCSPGLDCAYPTGNSLVEPYTAGHNILLAHAEAVDLYNKHYKR -------------------2222-------1111-------------------------1 DDTRIGLAFDVMGRVPYGTSFLDKQAEERSWDINLGWFLEPVVRGDYPFSMRSLARERLP 111---------------------------------------------------!!!!-- FFKDEQKEKLAGSYNMLGLNYYTSRFSKNIDISPNYSPVLNTDDAYASQEVNGPDGKPIG ---------2222--------------------------3333---------1111---- PPMGNPWIYMYPEGLKDLLMIMKNKYGNPPIYITENGIGDVDTKETPLPMEAALNDYKRL ----------3333----------------------------3333-------------- DYIQRHIATLKESIDLGSNVQGYFAWSLLDNFEWFAGFTERYGIVYVDRNNNCTRYMKES --------------------------------!!!!------------------------ AKWLKEFNTAK ----------- >ENDOGLUCANASE H; SWP:P16218; PDB:1V0AA; SAVGEKLDDFEGVLNWGSYSGEGAKVSTKIVSGKTGNGEVSYTGTTDGYWGTVYSLPDGD --------------------------------------------1111------------ WSKWLKISFDIKSVANEIRFIAEKSINGVGDGEHWVYSITPDSSWKTIEIPFSSFRRRLD 1111--------------------1111----------------------3333------ YQPPGQDSGTLDLDNIDSIHFYANNKSGKFVVDNIKLIGALEHHH --1111-----1111------------------------------ >DNA FRAGMENTATION FACTOR ; SWP:O54788; PDB:1V0DA; VSDITRFLSVFNEPHAGVIQAARQQLSDEQAPLRQKLLADLLHHVSQNITAETREQDPSW --3333----------3333---------------------------3333-3333-333 FEGLESRFRNKSGYLRYSCESRIRGYLREVSAYTSMVDEAAQEEYLRVLGSMCQKLKSVQ 3---3333---------------------333311113333------------------- YNGSYFDRGAEASSRLCTPEGWFSCQGPFDLESCLSKHSINPYGNRESRILFSTWNLDHI --3333----1111---1111------1111--1111---1111------3333------ IEKKRTVVPTLAEAIQDGREVNWEYFYSLLFTAENLKLVHIACHKKTTHKLECDRSRIYR -------------1111--------------3333----1111----------1111--- PQTGS ----- >ENDO-ALPHA-SIALIDASE; SWP:Q04830; PDB:1V0EA; SAKGDGVTDDTAALTSALNDTPVGQKINGNGKTYKVTSLPDISRFINTRFVYERIPGQPL ---------------------1111---iiii--------3333----------2222-- YYASEEFVQGELFKITDTPYYNAWPQDKAFVYENVIYAPYMGSDRHGVSRLHVSWVKSGD ---2222------------------------iiii------------2222--------i DGQTWSTPEWLTDLHPDYPTVNYHCMSMGVCRNRLFAMIETRTLAKNALTNCALWDRPMS iii-----------1111------------%%%%-------------------------- RSLHLTGGITKAANQRYATIHVPDHGLFVGDFVNFSNSAVTGVSGDMTVATVIDKDNFTV ---------------------------2222--------2222----------------- LTPNQQTSDLNNAGKNWHMGTSFHKSPWRKTDLGLIPSVTEVHSFATIDNNGFAMGYHQG -----------2222------3333-----------------------1111-------- DVAPREVGLFYFPDAFNSPSNYVRRQIPSEYEPDASEPCIKYYDGVLYLITRGTRGDRLG -----------------1111------3333-----------iiii--------1111-- SSLHRSRDIGQTWESLRFPHNVHHTTLPFAKVGDDLIMFGSERAENEWEAGAPDDRYKAS -------iiii--------------------!!!!--------2222-2222-------- YPRTFYARLNVNNWNADDIEWVNITDQIYQGGIVNSGVGVGSVVVKDNYIYYMFGGEDHF ---------3333--------------------------------!!!!----------- NPWTYGDNSAKDPFKSDGHPSDLYCYKMKIGPDNRVSRDFRYGAVPNRAVPVFFDTNGVR -------33331111---------------------------------------1111-- TVPAPMEFTGDLGLGHVTIRASTSSNIRSEVLMEGEYGFIGKSIPTDNPAGQRIIFCGGE -----------------------%%%%--------------------3333--------- GTSSTTGAQITLYGANNTDSRRIVYNGDEHLFQSADVKPYNDNVTALGGPSNRFTTAYLG --3333------------2222--------------------------1111-------- SNPIVT ------ >UDP-GALACTOPYRANOSE MUTAS; SWP:O06934; PDB:1V0JA; MTARFDLFVVGSGFFGLTIAERVATQLDKRVLVLERRPHIGGNAYSEAEPQTGIEVHKYG ------------3333------------------------!!!!---------------- AHLFHTSNKRVWDYVRQFTDFTDYRHRVFAMHNGQAYQFPMGLGLVSQFFGKYFTPEQAR --------------3333-------------iiii------------------------- QLIAEQAAEIDTADEEKAISLIGRPLYEAFVKGYTAKQWQTDPKELPAANITRLPVRYTF -----1111-3333-3333----------------------3333-1111---------- DNRYFSDTYEGLPTDGYTAWLQNMAADHRIEVRLNTDWFDVRGQLRPGSPAAPVVYTGPL ------------1111------11111111------3333333333331111-------- DRYFDYAEGRLGWRTLDFEVEVLPIGDFQGTAVMNYNDLDVPYTRIHEFRHFHPERDYPT -----1111----------------------------1111------3333-3333---- DKTVIMREYSRFAEDDDEPYYPINTEADRALLATYRARAKSETASSKVLFGGRLGTYQYL ------------------------------------------------------------ DMHMAIASALNMYDNVLAPHLRDGVPLL ---------------------------- >ENDO-1,4-BETA-XYLANASE A; SWP:P26514; PDB:1V0LA; ESTLGAAAAQSGRYFGTAIASGRLSDSTYTSIAGREFNMVTAENEMKIDATEPQRGQFNF --------1111-------3333--------------------11113333--2222--- SSADRVYNWAVQNGKQVRGHTLAWHSQQPGWMQSLSGSALRQAMIDHINGVMAHYKGKIV ----------1111--------------3333--------------------1111---- QWDVVNEAFADGSSGARRDSNLQRSGNDWIEVAFRTARAADPSAKLCYNDYNVENWTWAK -------------------3333--1111-----------3333----------1111-- TQAMYNMVRDFKQRGVPIDCVGFQSHFNSGSPYNSNFRTTLQNFAALGVDVAITELDIQG ---------------------------1111-------------1111---------222 APASTYANVTNDCLAVSRCLGITVWGVRDSDSWRSEQTPLLFNNDGSKKAAYTAVLDALN 2--------------1111--------3333--3333-----1111-----------111 GG 1- >PHOSPHOLIPASE D; SWP:P84147; PDB:1V0WA; SATPHLDAVEQTLRQVSPGLEGDVWERTSGNKLDGSAADPSDWLLQTPGCWGDDKCADRV ----------------1111---------------3333----------2222------- GTKRLLAKMTENIGNATRTVDISTLAPFPNGAFQDAIVAGLKESAAKGNKLKVRILVGAA ------------1111----------------------------1111------------ PHMNVIPSKYRDELTAKLGKAAENITLNVASMTTSKTAFSWNHSKILVVDGQSALTGGIN ---------------------1111----------1111---------%%%%-------- SWKDDYLDTTHPVSDVDLALTGPAAGSAGRYLDTLWTWTCQNKSNIASVWFAASGNAGCM -3333----------------------------------1111-2222-----!!!!--- PTMHKDTNPKASPATGNVPVIAVGGLGVGIKDVDPKSTFRPDLPTASDTKCVVGLHDNTN -3333----------------------------1111-------------------3333 ADRDYDTVNPEESALRALVASAKGHIEISQQDLNATCPPLPRYDIRLYDALAAKMAAGVK -3333-------------1111--------------------------------1111-- VRIVVSDPANRGYSQIKSLSEISDTLRNRLANITGGQQAAKTAMCSNLQLATFRSSPNGK ------3333------------------3333---------------------------- WADGHPYAQHHKLVSVDSSTFYIGSKNLYPSWLQDFGYIVESPEAAKQLDAKLLDPQWKY 1111-----------%%%%----------------------------------------- SQETATVDYARGICNA -1111----------- >NEURAMINIDASE; SWP:Q6XV27; PDB:1V0ZA; RTFLNLTKPLCEVNSWHILSKDNAIRIGEDAHILVTREPYLSCDPQGCRMFALSQGTTLR -------------------------------------------1111----------111 GRHANGTIHDRSPFRALISWEMGQAPSPYNTRVECIGWSSTSCHDGMSRMSICMSGPNNN 13333---------------------1111--------------------------1111 ASAVVWYGGRPITEIPSWAGNILRTQESECVCHKGVCPVVMTDGPANNRAATKIIYFKEG ------iiii----------------------iiii---------------------iii KIQKIEELAGNAQHIEECSCYGAGGVIKCICRDNWKGANRPVITIDPEMMTHTSKYLCSK i---------------------iiii---------------------------------- VLTDTSRPNDPTNGNCDAPITGGSPDPGVKGFAFLDGENSWLGRTISKDSRSGYEMLKVP -----------------------------------!!!!-------------------22 NAETDIQSGPISNQVIVNNQNWSGYSGAFIDYWANKECFNPCFYVELIRGRPKESSVLWT 22--1111----------------------1111-------------------1111--- SNSIVALCGSKKRLGSWSWHDGAEIIYFE -----------------------3333-- >LACCASE; SWP:Q6H9H7; PDB:1V10A; ATVALDLHILNANLDPDGTGARSAVTAEGTTIAPLITGNIDDRFQINVIDQLTDANMRRA --------------------------------------2222-----------1111--- TSIHWHGFFQAGTTEMDGPAFVNQCPIIPNESFVYDFVVPGQAGTYWYHSHLSTQYCDGL ----2222-22221111----------2222----------------------3333--- RGAFVVYDPNDPHLSLYDVDDASTVITIADWYHSLSTKAPPAPDTTLINGLGRNSANPSA -------1111-3333----1111-----------------------iiii-----1111 GQLAVVSVQSGKRYRFRIVSTSCFPNYAFSIDGHRMTVIEVDGVSHQPLTVDSLTIFAGQ --------2222------------------2222------iiii------------2222 RYSVVVEANQAVGNYWIRANPSNGRNGFTGGINSAIFRYQGAAVAEPTTSQNSGTALNEA ---------------------------2222-------2222---------------111 NLIPLINPGAPGNPVPGGADINLNLRIGRNATTADFTINGAPFIPPTVPVLLQILSGVTN 1-------------2222------------------------------3333-3333--3 PNDLLPGGAVISLPANQVIEISIPGGGNHPFHLHGHNFDVVRTPGSSVYNYVNPVRRDVV 333--2222--------------------------------------------------- SIGGGGDNVTFRFVTDNPGPWFLHCHIDWHLEAGLAVVFAEDIPNIPIANAISPAWDDLC --------------------------11111111-------3333-------3333---1 PKYNANN 111---- >Adenomatous polyposis col; SWP:P25054; PDB:1V18B; ADTLLHFATESTPDGLALLDEPFIQKDVELRIMPPV --------------------------3333------ >2-KETO-3-DEOXYGLUCONATE K; SWP:Q746L7; PDB:1V1AA; MLEVVTAGEPLVALVPQEPGHLRGKRLLEVYVGGAEVNVAVALARLGVKVGFVGRVGEDE --------------------3333------------------------------------ LGAMVEERLRAEGVDLTHFRRAPGFTGLYLREYLPLGQGRVFYYRKGSAGSALAPGAFDP ---------------1111-------------------------2222-----2222-33 DYLEGVRFLHLSGITPALSPEARAFSLWAMEEAKRRGVRVSLDVNYRQTLWSPEEARGFL 332222-------3333---------------3333----------3333---------- ERALPGVDLLFLSEEEAELLFGRVEEALRALSAPEVVLKRGAKGAWAFVDGRRVEGSAFA --3333----------------3333--------------1111---------------- VEAVDPVGAGDAFAAGYLAGAVWGLPVEERLRLANLLGASVAASRGDHEGAPYREDLEVL -----2222-----------1111-3333------------------1111--3333--- LK -- >OBSCURIN; SWP:Q5VST9; PDB:1V1CA; IFDIYVVTADYLPLGAEQDAITLREGQYVEVLDAAHPLRWLVRTKPTKSSPSRQGWVSPA -----------------------------------3333------------------111 YLDRRLKL 1------- >CALCINEURIN B-LIKE PROTEI; SWP:O81223; PDB:1V1GA; RPPGYEDPELLASVTPFTVEEVEALYELFKKLSSSIIDDGLIHKEEFQLALFRNRNRRNL ----------1111----------------1111--------3333------------11 FADRIFDVFDVKRNGVIEFGEFVRSLGVFHPSAPVHEKVKFAFKLYDLRQTGFIEREELK 11---------------------------1111---------------------3333-- EMVVALLHESELVLSEDMIEVMVDKAFVQADRKNDGKIDIDEWKDFVSLNPSLIKNMTLP -------1111---1111-------------------------------11111111-33 YLKDINRT 33------ >FIBRITIN, FIBER PROTEIN; SWP:P10104; PDB:1V1HA; VSIKKSSGLNFDNTAIAINAGKGLEFDTNTSESPDINPIKTKIGSGIDYNENGAMITKLG ---1111----!!!!-----2222-----1111----------2222--1111------2 AGLSFDNSGAITIGGSGYIPEAPRDGQAYVRKDGEWVLLSTFL 222--1111----------------------%%%%--3333-- >Exotoxin 1; SWP:Q9ZFS5; PDB:1V1PB; HDIRDLHRYYSSESFEYSNVSGKVENYNGSNVVRFNPKDQNHQLFLLGKDKEQYKEGLQG ------------------------------------2222-------1111--1111--- QNVFVVQELIDPNGRLSTVGGVTKKNNKTSETNTPLFVNKVNGEDLDASIDSFLIQKEEI ----------1111---------------------------!!!!--------------- SLKELDFKIRQQLVNNYGLYKGTSKYGKIIINLKDENKVEIDLGDKLQFERMGDVLNSKD --------------------!!!!-----------------3333--1111-----3333 IRGISVTINQI ----------- >LONG-CHAIN-FATTY-ACID-COA; SWP:Q6L8F0; PDB:1V25A; AFPSTMMDEELNLWDFLERAAALFGRKEVVSRLHTGEVHRTTYAEVYQRARRLMGGLRAL -----------3333--------1111-----1111---------------------111 GVGVGDRVATLGFNHFRHLEAYFAVPGMGAVLHTANPRLSPKEIAYILNHAEDKVLLFDP 1-2222-------------------1111------1111-------------------11 NLLPLVEAIRGELKTVQHFVVMDEKAPEGYLAYEEALGEEADPVRVPERAACGMAYTTGT 11------3333--------------2222-3333-----------1111---------- TGLPKGVVYSHRALVLHSLAASLVDGTALSEKDVVLPVVPMFHVNAWCLPYAATLVGAKQ ----------------------1111---1111----------%%%%------------- VLPGPRLDPASLVELFDGEGVTFTAGVPTVWLALADYLESTGHRLKTLRRLVVGGSAAPR ----------------1111------3333----------------------------33 SLIARFERMGVEVRQGYGLTETSPVVVQNFVKSHLESLSEEEKLTLKAKTGLPIPLVRLR 33----1111--------3333---------1111----------1111----2222--- VADEEGRPVPKDGKALGEVQLKGPWITGGYYGNEEATRSALTPDGFFRTGDIAVWDEEGY --1111----------------1111------33331111-1111----------1111- VEIKDRLKDLIKSGGEWISSVDLENAAVVAIPHPKWQERPLAVVGFAKWQLPDAYLKRAL ------------iiii--3333------------------------3333---------3 REQYKNYYGGA 333--1111-- >Regulating synaptic membr; SWP:Q9UQ26; PDB:1V27A; GSSGSSGGQLSIKLWFDKVGHQLIVTILGAKDLPSREDGRPRNPYVKIYFLPDRSDKNKR ------------------------------------------------------------ RTKTVKKTLEPKWNQTFIYSPVHRREFRERMLEITLWDQARVREEESEFLGEILIELETA ----------------------33331111-------------------------3333- LLDDEPHWYKLQTHDSGPSSG --------------------- >NITRILE HYDRATASE A CHAIN; SWP:Q84FS5; PDB:1V29A; DPRFPHHHPRPQSFWEARAKALESLLIEKRLLSSDAIERVIKHYEHELGPMNGAKVVAKA 11113333------------------1111--3333------------------------ WTDPEFKQRLLEDPETVLRELGYFGLQGEHIRVVENTDTVHNVVVCTLCSCYPWPLLGLP ------------------------2222--------1111-----3333---3333---- PSWYKEPAYRSRVVKEPRKVLQEFGLDLPDSVEIRVWDSSSEVRFMVLPQRPEGTEGMTE 3333-------1111------1111---1111---------------------------- EELAQIVTRDSMIGVAKVQPPKV --3333-3333------------ >NITRILE HYDRATASE A CHAIN; SWP:NA; PDB:1V29B; MNGIHDVGGMDGFGKIMYVKEEEDTYFKHDWERLTFGLVAGCMAQGLGMKAFDEFRIGIE --33332222---------1111-----3333-------------------3333---11 KMRPVDYLTSSYYGHWIATVAYNLLETGVLDEKELEDRTQAFMEKPDTKIQRWENPKLVK 11--------1111------------------------------1111------------ VVEKALLEGLSPVREVSSFPRFEVGERIKTRNIHPTGHTRFPRYVRDKYGVIEEVYGAHV ----------------------2222---------------1111--------------- FPDDAAHRKGENPQYLYRVRFDAEELWGVKQNDSVYIDLWEGYLEPVSH 33331111-------------3333--------------1111------ >GLUTATHIONE TRANSFERASE G; SWP:Q9BHB0; PDB:1V2AA; MDYYYSLISPPCQSAILLAKKLGITLNLKKTNVHDPVERDALTKLNPQHTIPTLVDNGHV -----11113333--------------------------------1111------iiii- VWESYAIVLYLVETYAKDDTLYPKDPKVRSVVNQRLFFDIGTLYKRIIDVIHLVMKKEQP ------------------3333--3333-------------------------------- SDEQMEKLKGALDLLEQFVTERAYAAADHLTVADICLLGTVTALNWLKHDLEPFPHIRAW ----------------1111------------------------------3333------ LERVRAEMPDYEEFSKQVADDTLAYVAS -------2222----------------- >23-kDa polypeptide of pho; SWP:NA; PDB:1V2BA; TDFQTYNGDGFKLQIPSKWNPNKEVEYPGQVLRFEDNFDATSNVIVAITPTDKKSITDFG -------2222----1111-------2222-----1111---------------3333-- SPEQFLSQVDYLLAVAIANVLETSTAEVGGKQYYYLSILTRTGGKHQLVTATVNDGKLYI --------3333---------------iiii----------------------%%%%--- CKAQAGDKRWFKGAKKFVENTATSFSLA -----3333-2222-------1111--- >GLUTAMINE AMINOTRANSFERAS; SWP:Q75WK2; PDB:1V2DA; MRLHPRTEAAIFPRMSGLAQRLGAVNLGQGFPSNPPPPFLLEAVRRALGRQDQYAPPAGL ------1111--------------------------3333----3333-------3333- PALREALAEEFAVEPESVVVTSGATEALYVLLQSLVGPGDEVVVLEPFFDVYLPDAFLAG -------------3333-------------------2222--------1111----1111 AKARLVRLDLTPEGFRLDLSALEKALTPRTRALLLNTPMNPTGLVFGERELEAIARLARA ----------1111--------11111111------------------------------ HDLFLISDEVYDELYYGERPRRLREFAPERTFTVGSAGKRLEATGYRVGWIVGPKEFMPR --------1111---------3333-1111-----3333---1111-------3333--- LAGMRQWTSFSAPTPLQAGVAEALKLARREGFYEALREGYRRRRDLLAGGLRAMGLRVYV ---3333--------------------------------------------1111----- PEGTYFLMAELPGWDAFRLVEEARVALIPASAFYLEDPPKDLFRFAFCKTEEELHLALER ----------2222----------------1111-------------------------- LGRVV ----- >TRNA (GM18) METHYLTRANSFE; SWP:Q9FAC4; PDB:1V2XA; MRERTEARRRRIEEVLRRRQPDLTVLLENVHKPHNLSAILRTCDAVGVLEAHAVNPTGGV ---------------1111------------3333--------------------iiii- PTFNETSGGSHKWVYLRVHPDLHEAFRFLKERGFTVYATALREDARDFREVDYTKPTAVL --------3333-----------------1111--------1111-1111---------- FGAEKWGVSEEALALADGAIKIPMLGMVQSLNVSVAAAVILFEAQRQRLKAGLYDRPRLD --1111---------------------------------------------1111----- PELYQKVLADW ----------- >3300001G02RIK PROTEIN; SWP:Q8VIK1; PDB:1V2YA; GSSGSSGMTVRVCKMDGEVMPVVVVQNATVLDLKKAIQRYVQLKQEREGGVQHISWSYVW -------------3333----------------------------1111----------- RTYHLTSAGEKLTEDRKKLRDYGIRNRDEVSFIKKLGQKSGPSSG -----------------3333------------------------ >CIRCADIAN CLOCK PROTEIN K; SWP:Q6L8K1; PDB:1V2ZA; STAFFFRRMSPADKRKLLDELRSIYRTIVLEYFNTDAKVNERIDEFVSKAFFADISVSQV ------------------------------2222-------------------------- LEIHVELMDTFSKQLKLEGRSEDILLDYRLTLIDVIAHLCEMYRRS -----------------!!!!------------------------- >HYPOTHETICAL UPF0131 PROT; SWP:O58558; PDB:1V30A; SVRIAVYGTLRKGKPLHWYLKGAKFLGEDWIEGYQLYFEYLPYAVKGKGKLKVEVYEVDK -------1111--11111111--------------------------------------- ETFERINEIEIGTGYRLVEVSTKFGKAFLWEWGSKPRGKRIKSGDFDEIRLEHHHHHH --------3333---------1111---------------3333-------------- >HYPOTHETICAL PROTEIN RAFL; SWP:Q9FMT4; PDB:1V31A; GSSGSSGVPEKFKLSTALMDVLGIEVETRPRIIAAIWHYVKARKLQNPNDPSFFNCDAAL --------------3333----------------------1111--3333---------- QKVFGEEKLKFTMVSQKISHHLSPPPPSGPSSG -----------3333-3333------------- >HYPOTHETICAL PROTEIN RAFL; SWP:Q9FT92; PDB:1V32A; GSSGSSGKRFEFVGWGSRQLIEFLHSLGKDTSEMISRYDVSDTIAKYISKEGLLDPSNKK ----------------3333---------------3333---------------3333-- KVVCDKRLVLLFGTRTIFRMKVYDLLEKHYKENQDSGPSSG -----------------3333-------------------- >DNA PRIMASE SMALL SUBUNIT; SWP:O57934; PDB:1V33A; MLLREVTREERKNFYTNEWKVKDIPDFIVKTLELREFGFDHSGEGPSDRKNQYTDIRDLE -------------------3333-33331111----------------------3333-- DYIRATAPYAVYSSVALYEKPQEMEGWLGTELVFDIDAKDLPLRRCEHEPGTVCPICLND -------------------3333-------------3333-------------------- AKEIVRDTVIILREELGFNDIHIIYSGRGYHIRVLDEWALKLDSKSRERILSFVSASEIE -----------------------------------3333--------------------- DVEEFRKLLLNKRGWFVLNHGYPRAFRLRFGYFILRIKLPHLINAGIRKSIAKSILKSKE 3333-------3333-----3333--------1111------1111-------------- EIYEEFVRKAILAAFPQGVGIESLAKLFALSTRFSKSYFDGRVTVDLKRILRLPSTLHSK ----------1111--------------------1111--3333-1111---2222---- VGLIAKYVGTNERDVMRFNPFKHAVPKFRKEEVKVEYKKFLESLGT ------------------1111---1111----------3333--- >PHOSPHOGLYCERATE MUTASE; SWP:Q53WB3; PDB:1V37A; MELWLVRHGETLWNREGRLLGWTDLPLTAEGEAQARRLKGALPSLPAFSSDLLRARRTAE ----------1111---------------------3333------------3333----1 LAGFSPRLYPELREIHFGALEGALWETLDPRYKEALLRFQGFHPPGGESLSAFQERVFRF 111-----3333----!!!!---3333----------------2222------------- LEGLKAPAVLFTHGGVVRAVLRALGEDGLVPPGSAVAVDWPRRVLVRLALD 1111-----------------1111-----2222----------------- >SAM-DOMAIN PROTEIN SAMSN-; SWP:P57725; PDB:1V38A; GSSGSSGRRENHQTIQEFLERIHLQEYTSTLLLNGYETLDDLKDIKESHLIELNIADPED -------------------11113333----------33331111------------333 RARLLSAAESLLSGPSSG 3----------------- >PROTEIN TYROSINE PHOSPHAT; SWP:O75365; PDB:1V3AA; MARMNRPAPVEVSYKHMRFLITHNPTNATLSTFIEDLKKYGATTVVRVCEVTYDKTPLEK -------------3333---------1111----------------------------11 DGITVVDWPFDDGAPPPGKVVEDWLSLVKAKFCEAPGSCVAVHCVAGLGRAPVLVALALI 11--------3333---------------------------------------------3 ESGMKYEDAIQFIRQKRRGAINSKQLTYLEKYRPKQRLRFKD 333-3333--------------33333333------------ >HEMAGGLUTININ-NEURAMINIDA; SWP:Q81080; PDB:1V3EA; ITHDVGIKPLNPDDFWRCTSGLPSLMKTPKIRLMPGPGLLAMPTTVDGCIRTPSLVINDL ---2222---3333------------------------------1111------------ IYAYTSNLITRGCQDIGKSYQVLQIGIITVNSDLVPDLNPRISHTFNINDNRKSCSLALL ------------------------------1111------------3333---------! NTDVYQLCSTPKVDERSDYASPGIEDIVLDIVNYDGSISTTRFKNNNISFDQPYAALYPS !!!-----------------------------1111-------3333------------- VGPGIYYKGKIIFLGYGGLEHPINENVICNTTGCPGKTQRDCNQASHSPWFSDRRMVNSI ------iiii--------------------2222---3333------3333--------- IVVDKGLNSIPKLKVWTISMRQNYWGSEGRLLLLGNKIYIYTRSTSWHSKLQLGIIDITD -----1111---------3333-----------%%%%----------------------1 YSDIRIKWTWHNVLSRPGNNECPWGHSCPDGCITGVYTDAYPLNPTGSIVSSVILDSQKS 111---------------33332222-----------------1111------------- RVNPVITYSTATERVNELAILNRTLSAGYTTTSCITHYNKGYCFHIVEINHKSLNTLQPM ---------1111--------1111--------------------------1111----- LFKTEIPKSCS ----------- >PLECKSTRIN 2; SWP:Q9WV52; PDB:1V3FA; GSSGSSGLHRIVDKMHDTSTGIRPSPNMEQGSTYKKTFLGSSLVDWLISSNFAASRLEAV ------3333----------------------------3333-----1111--------- TLASMLMEENFLRPVGVRSMGAIRSGDLAEQFLDDSTALYTFAESYKKKVSSKESGPSSG ------------------------------------------------------------ >FERRIPYOCHELIN BINDING PR; SWP:O59257; PDB:1V3WA; MAIYEINGKKPRIHPSAFVDENAVVIGDVVLEEKTSVWPSAVLRGDIEQIYVGKYSNVQD -----!!!!----1111--1111--------2222--2222-----------2222--22 NVSIHTSHGYPTEIGEYVTIGHNAMVHGAKVGNYVIIGISSVILDGAKIGDHVIIGAGAV 22----2222----------2222-------------------2222--------2222- VPPNKEIPDYSLVLGVPGKVVRQLTEEEIEWTKKNAEIYVELAEKHIKGRKRI -2222------------------------------------------------ >PEPTIDE DEFORMYLASE; SWP:P43522; PDB:1V3YA; MVYPIRLYGDPVLRRKARPVEDFSGIKRLAEDMLETMFEAKGVGLAAPQIGLSQRLFVAV ---------3333---------1111-----------1111----3333----------- ELRELVRRVYVVANPVITYREGLVEGTEGLSLPGLYSEEVPRAERIRVEYQDEEGRGRVL -1111--------------------------2222----------------1111----- ELEGYMARVFQHEIDHLDGILFFERLPKPKREAFLEANRAELVRFQKEA ---------------1111-3333------------------------- >SUGAR-BINDING TRANSPORT A; SWP:O57758; PDB:1V43A; VIKMVEVKLENLTKRFGNFTAVNKLNLTIKDGEFLVLLGPSGCGKTTTLRMIAGLEEPTE ---------------!!!!----------2222------2222----------------- GRIYFGDRDVTYLPPKDRNISMVFQHMTVYENIAFPLKKFPKDEIDKRVRWAAELLQIEE ----!!!!-11113333-----------------------1111-------------111 LLNRYPAQLSGGQRQRVAVARAIVVEPDVLLMDEPLSNLDAKLRVAMRAEIKKLQQKLKV 1----1111-------------1111-------1111----------------------- TTIYVTHDQVEAMTMGDRIAVMNRGQLLQIGSPTEVYLRPNSVFVATFIGAPEMNILEVS ----------------------iiii---------------------------------- VGDGYLEGRGFRIELPQMDLLKDYVGKTVLFGIRPEHMTVEGVHMKRTARLIGKVDFVEA --------------------1111---------1111----------------------- LGTDTILHVKFGDELVKVKLPGHIPIEPGREVKVIMDLDMIHVFDKDTEKAIV ----------!!!!----------------------1111------------- >ATP SULFURYLASE; SWP:Q5SKH7; PDB:1V47A; TLPALEIGEDERLDLENLATGAFFPVKGFMTREEALSVAHEMRLPTGEVWTIPILLQFRE --------------------1111-------------------1111------------- KPRVGPGNTVALLHGGERVALLHVAEAYELDLEALARAVFGTDSETHPGVARLYGKGPYA ----2222-----------------------------------3333------1111--- LAGRVEVLKPRPRTPLEKTPEEVRAFFRQRGWRKVVAFQTRNAPHRAHEYLIRLGLELAD -------------3333-------------------------------------3333-- GVLVHPILGAKKPDDFPTEVIVEAYQALIRDFLPQERVAFFGLATPMRYAGPKEAVFHAL -----------1111------------------3333----------------------- VRKNFGATHFLVGRDHAGVGDFYDPYAAHRIFDRLPPLGIEIVKVGAVFHCPLCGGIASE --1111--------2222-----1111--3333------------------3333---11 RTCPEGHREKRTAISMTKVRALLREGKAPPSELVRPELLPILRRGV 11-33331111-----------1111---1111-3333---3333- >GLUTAMATE-AMMONIA-LIGASE ; SWP:P30870; PDB:1V4AA; KPLSSPLQQYWQTVVERLPEPLAEESLSAQAKSVLTFSDFVQDSVIAHPEWLTELESQPP ----------------------1111---------------------------------- QADEWQHYAAWLQEALCNVSDEAGLRELRLFRRRIVRIAWAQTLALVTEESILQQLSYLA 1111---------1111---3333------------------------------------ ETLIVAARDWLYDACCREWGTPCNAQGEAQPLLILGGKLGGGELNFSSDIDLIFAWPERE ------------------------------------3333-------------------- LDNAQFFTRGQRLIKVLDQPTQDGFVYRVDRLRPFGESGPLVLSFAALEDYYQEQGRDWE --------------------1111--------2222------------------------ RYAVKARIGDSEGVYANELRALRPFVFRRYIDFSVIQSLRNKGIAREVRRRGLTDNIKLG -----------------------------------------------------------2 AGGIREIEFIVQVFQLIRGGREPSLQSRSLLPTLSAIAELHLLSENDAEQLRVAYLFLRR 222--------------33333333------------1111------------------- LENLLQSINDEQTQTLPSDELNRARLAWADFADWPQLTGALTAHTNVRRVFNELIG --------------------------------3333-------------------- >NADH-AZOREDUCTASE, FMN-DE; SWP:P41407; PDB:1V4BA; SKVLVLKSSILAGYSQSNQLSDYFVEQWREKHSADEITVRDLAANPIPVLDGELVGALRA ----------!!!!-----------------1111--------------------1111- PLTPRQQEALALSDELIAELKAHDVIVIAAPMYNFNISTQLKNYFDLVARAGVTFRYTEN -------------------1111---------%%%%-------------2222----111 GPEGLVTGKKAIVITSRGGIHKDGPTDLVTPYLSTFLGFIGITDVKFVFAEGIAYGPEMA 1-------------------2222-------------1111----------1111----- AKAQSDAKAAIDSIVSA ----------------- >OCTOPRENYL-DIPHOSPHATE SY; SWP:Q9X1M1; PDB:1V4EA; NSYELEKVKERIEQILSQFFPEQIMKDLPLYGKMLRVRLSILSFKNRGVEIGEDAISSLA --------------------3333--------------------1111------------ ALELVHLASLLHDDVIDGARFRRGKETINFMYGDKAAVAAGDLVLVSAFHTVEEIGNNKL ---------------------iiii--3333--------------------3333----- RRAFLNVIGKMSEAELIEQLSRYKPITKEEYLRIVEGKSGALFGLALQLPALLEGELGED ------------------1111-----------------------------1111----- LYNLGVTIGTIYQMFDDIMDFAGMEKIGKDGFLDLKNGVASFPLVTAMEKFPEARQMFEN ---------------------------1111---1111---------3333--------- RDWSGLMSFMREKGILKECEETLKVLVKNVIIENSWLRDF --3333----1111-------------------3333--- >Mucrocetin beta chain; SWP:Q6TPG9; PDB:1V4LB; GFCCPLGWSSYDEHCYQVFQQKMNWEDAEKFCTQQHRGSHLVSFHSSEEVDFVVSKTSPI ---------------------------------------------3333-------3333 LKHDFVWMGLSNVWNECAKEWSDGTKLDYKAWSGQSDCITSKTTDNQWLSMDCSSKRYVV --------------------1111-----------------1111------1111----- CKFQA ----- >271aa long hypothetical 5; SWP:Q975C3; PDB:1V4NA; EKASIGIIGGSGLYDPQILTNVKEIKVYTPYGEPSDNIILGELEGRKVAFLPRHGRGHRI --------------1111----------1111----------%%%%-----1111----- PPHKINYRANIWALKSLGVKWVIAVSAVGSLRLDYKPGDFVVPNQFIDMTKGRTYTFFDG 1111----------1111-------------1111------------------------- PTVAHVSMADPFCEHLRSIILDSAKDLGITTHDKGTYICIEGPRFSTRAESIVWKEVFKA ------------------------------------------------------------ DIIGMTLVPEVNLACEAEMCYSVIGMVTDYDVFADIPVTAEEVTKVMAENTAKVKKLLYE --------------1111------------------------------------------ VIRRLPEKPDERKCSCCQALKTALVL -1111----333311113333----- >ALANYL-TRNA SYNTHETASE; SWP:O58307; PDB:1V4PA; MYSIEVRTHSALHVVKGAVVKVLGSEAKWTYSTYVKGNKGVLIVKFDRKPSDEEIREIER -----------------------3333--------!!!!--------------------- LANEKVKENAPIKIYELPREEAEKMFGEDMYDLFPVPEDVRILKVVVIEDWNVNACNKEH ------------------------------------1111-------------------- TKTTGEIGPIKIRKVRFRKSKGLLEIHFELL --3333------------1111--------- >TRANSCRIPTIONAL REPRESSOR; SWP:NA; PDB:1V4RA; MPYKAPEGKGYADVATHFRTLIKSGELAPGDTLPSVADIRAQFGVAAKTVSRALAVLKSE -------------------3333----------------------1111--3333----- GLVSSRGALGTVVEKNPIVITGADRLKRMEKNGMRYAPGE ---------------------3333--3333--------- >GLUCOKINASE ISOFORM 2; SWP:P35557; PDB:1V4SA; TLVEQILAEFQLQEEDLKKVMRRMQKEMDRGLRLETHEEASVKMLPTYVRSTPEGSEVGD -------1111---------------------3333------------------------ FLSLDLGGTNFRVMLVKVGEGEEGQWSVKTKHQMYSIPEDAMTGTAEMLFDYISECISDF --------------------3333--------------1111------------------ LDKHQMKHKKLPLGFTFSFPVRHEDIDKGILLNWTKGFKASGAEGNNVVGLLRDAIKRRG ------------------------1111------iiii----2222-------------- DFEMDVVAMVNDTVATMISCYYEDHQCEVGMIVGTGCNACYMEEMQNVELVEGDEGRMCV -------------------33331111----------------33331111--------- NTEWGAFGDSGELDEFLLEYDRLVDESSANPGQQLYEKLIGGKYMGELVRLVLLRLVDEN --3333-1111-1111-------------22223333-------------------1111 LLFHGEASEQLRTRGAFETRFVSQVESDTGDRKQIYNILSTLGLRPSTTDCDIVRRACES -%%%%--3333-2222-----------------------1111----------------- VSTRAAHMCSAGLAGVINRMRESRSEDVMRITVGVDGSVYKLHPSFKERFHASVRRLTPS ------------------3333-----------------------------------222 CEITFIESEEGSGRGAALVSAVACKKAC 2--------------------1111--- >UDP-N-ACETYLGLUCOSAMINE 2; SWP:P83824; PDB:1V4VA; GKRVVLAFGTRPEATKAPVYLALRGIPGLKPLVLLTGQHREQLRQALSLFGIQEDRNLDV -------------------------2222------------------1111--------- QERQALPDLAARILPQAARALKEGADYVLVHGDTLTTFAVAWAAFLEGIPVGHVEAGLRS --------------------------------------------1111------------ GNLKEPFPEEANRRLTDVLTDLDFAPTPLAKANLLKEGKREEGILVTGQTGVDAVLLAAK -1111----------3333---------------1111-3333----------------- LGRLPEGLPEGPYVTVTHRRENWPLLSDLAQALKRVAEAFPHLTFVYPVHLNPVVREAVF ------------------33331111-------------1111--------3333----- PVLKGVRNFVLLDPLEYGSAALRASLLLVTDSGGLQEEGAALGVPVVVLRNVTERPEGLK --2222---------3333--------------------1111----------------- AGILKLAGTDPEGVYRVVKGLLENPEELSRRKAKNPYGDGKAGLVARGVAWRLGLGPRPE ----------------------------------1111--3333------1111------ DWLP ---- >HEMOGLOBIN ALPHA CHAIN; SWP:Q8AYM0; PDB:1V4XA; TTLSDKDKSTVKALWGKISKSADAIGADALGRMLAVYPQTKTYFSHWPDMSPGSGPVKAH -----------------3333---------------3333---3333---2222------ GKKVMGGVALAVSKIDDLTTGLGDLSELHAFKMRVDPSNFKILSHCILVVVAKMFPKEFT -------------3333------------------3333----------------3333- PDAHVSLDKFLASVALALAERYR ------------------1111- >Hemoglobin beta chain; SWP:Q8AYM1; PDB:1V4XB; VEWTQQERSIIAGIFANLNYEDIGPKALARCLIVYPWTQRYFGAYGDLSTPDAIKGNAKI ------------------3333------------33331111------------------ AAHGVKVLHGLDRAVKNMDNINEAYSELSVLHSDKLHVDPDNFRILGDCLTVVIAANLGD ------------333311113333--------------3333---------------!!! AFTVETQCAFQKFLAVVVFALGRKYH !------------------1111--- >3-ISOPROPYLMALATE DEHYDRO; SWP:P12010; PDB:1V53A; MKMKLAVLPGDGIGPEVMDAAIRVLKTVLDNDGHEAVFENALIGGAAIDEAGTPLPEETL ----------!!!!-----------------------------------------3333- DICRRSDAILLGAVGGPKWDHNPASLRPEKGLLGLRKEMGLFANLRPVKAYATLLNASPL --1111---------3333---33333333------1111----------11113333-- KRERVENVDLVIVRELTGGLYFGRPSERRGPGENEVVDTLAYTREEIERIIEKAFQLAQI 33331111-----------1111-------%%%%-------------------------- RRKKLASVDKANVLESSRMWREIAEETAKKYPDVELSHMLVDSTSMQLIANPGQFDVIVT ---------1111------------3333-3333-----3333-------3333------ ENMFGDILSDEASVITGSLGMLPSASLRSDRFGMYEPVHGSAPDIAGQGKANPLGTVLSA -----------1111--------------------------3333--------------- ALMLRYSFGLEKEAAAIEKAVDDVLQDGYCTGDLQVANGKVVSTIELTDRLIEKLN ---------------------------------------------------3333- >CYTOCHROME C OXIDASE POLY; SWP:P00396; PDB:1V54A; FINRWLFSTNHKDIGTLYLLFGAWAGMVGTALSLLIRAELGQPGTLLGDDQIYNVVVTAH 3333-----3333----------------------------------------------- AFVMIFFMVMPIMIGGFGNWLVPLMIGAPDMAFPRMNNMSFWLLPPSFLLLLASSMVEAG -----------------------1111-----3333----------------1111!!!! AGTGWTVYPPLAGNLAHAGASVDLTIFSLHLAGVSSILGAINFITTIINMKPPAMSQYQT ---1111--1111--------------------------------------11111111- PLFVWSVMITAVLLLLSLPVLAAGITMLLTDRNLNTTFFDPAGGGDPILYQHLFWFFGHP ------------------------------------11111111-3333----------- EVYILILPGFGMISHIVTYYSGKKEPFGYMGMVWAMMSIGFLGFIVWAHHMFTVGMDVDT ------------------1111-------------------1111-----1111------ RAYFTSATMIIAIPTGVKVFSWLATLHGGNIKWSPAMMWALGFIFLFTVGGLTGIVLANS -------------------------2222------------------------------3 SLDIVLHDTYYVVAHFHYVLSMGAVFAIMGGFVHWFPLFSGYTLNDTWAKIHFAIMFVGV 333--2222---------------------------3333-------------------- NMTFFPQHFLGLSGMPRRYSDYPDAYTMWNTISSMGSFISLTAVMLMVFIIWEAFASKRE ----------------------3333---------------------------------- VLTVDLTTTNLEWLNGCPPPYHTFEEPTYVNLK -----33333333-------------------- >CYTOCHROME C OXIDASE POLY; SWP:P00404; PDB:1V54B; AYPMQLGFQDATSPIMEELLHFHDHTLMIVFLISSLVLYIISLMLTTKLTHTSTMDAQEV -2222-------------------------------------1111------------33 ETIWTILPAIILILIALPSLRILYMMDEINNPSLTVKTMGHQWYWSYEYTDYEDLSFDSY 33----------------------1111-------------------------------- MIPTSELKPGELRLLEVDNRVVLPMEMTIRMLVSSEDVLHSWAVPSLGLKTDAIPGRLNQ --3333-22222222----------------------------3333------2222--- TTLMSSRPGLYYGQCSEICGSNHSFMPIVLELVPLKYFEKWSASML ---------------------1111-----------------1111 >Cytochrome c oxidase subu; SWP:P00415; PDB:1V54C; HQTHAYHMVNPSPWPLTGALSALLMTSGLTMWFHFNSMTLLMIGLTTNMLTMYQWWRDVI ------------------------------------------------------------ RESTFQGHHTPAVQKGLRYGMILFIISEVLFFTGFFWAFYHSSLAPTPELGGCWPPTGIH -----------------------------------------1111-3333-----2222- PLNPLEVPLLNTSVLLASGVSITWAHHSLMEGDRKHMLQALFITITLGVYFTLLQASEYY --1111----------------------1111---------------------------- EAPFTISDGVYGSTFFVATGFHGLHVIIGSTFLIVCFFRQLKFHFTSNHHFGFEAAAWYW ----1111-------------------------------1111--1111----------- HFVDVVWLFLYVSIYWWGS ------------------- >Cytochrome c oxidase subu; SWP:P00423; PDB:1V54D; SVVKSEDYALPSYVDRRDYPLPDVAHVKNLSASQKALKEKEKASWSSLSIDEKVELYRLK ---1111--------1111--------------------11113333------------- FKESFAEMNRSTNEWKTVVGAAMFFIGFTALLLIWEKHYVYGPIPHTFEEEWVAKQTKRM ---3333-----3333----------------------------1111------------ LDMKVAPIQGFSAKWDYDKNEWKK ----------3333---------- >Cytochrome c oxidase subu; SWP:P00426; PDB:1V54E; HETDEEFDARWVTYFNKPDIDAWELRKGMNTLVGYDLVPEPKIIDAALRACRRLNDFASA --3333----------1111----------1111-----3333--------1111-3333 VRILEVVKDKAGPHKEIYPYVIQELRPTLNELGISTPEELGLDKV --------3333-1111------------1111--3333------ >Cytochrome c oxidase subu; SWP:P00428; PDB:1V54F; ASGGGVPTDEEQATGLEREVMLAARKGQDPYNILAPKATSGTKEDPNLVPSITNKRIVGC -------3333------------1111-1111---------3333--------------- ICEEDNSTVIWFWLHKGEAQRCPSCGTHYKLVPHQLAH --2222-------------------------------- >Cytochrome c oxidase subu; SWP:P00429; PDB:1V54H; KIKNYQTAPFDSRFPNQNQTRNCWQNYLDFHRCEKAMTAKGGDVSVCEWYRRVYKSLCPI ----------3333---------------------------------------------- SWVSTWDDRRAEGTFPGKI ------------------- >Cytochrome c oxidase poly; SWP:P04038; PDB:1V54I; TALAKPQMRGLLARRLRFHIVGAFMVSLGFATFYKFAVAEKRKKAYADFYRNYDSMKDFE ------------------------------------------------------------ EMRKAGIFQSAK --1111------ >Cytochrome c oxidase poly; SWP:P07470; PDB:1V54J; FENRVAEKQKLFQEDNGLPVHLKGGATDNILYRVTMTLCLGGTLYSLYCLGWASFPHK ---3333---1111----1111-----------------------------3333--- >Cytochrome c oxidase poly; SWP:P13183; PDB:1V54K; APDFHDKYGNAVLASGATFCVAVWVYMATQIGIEWNPSPVGRVTPKEWR --3333--------------------------------2222------- >Cytochrome c oxidase subu; SWP:P00430; PDB:1V54L; HYEEGPGKNIPFSVENKWRLLAMMTLFFGSGFAAPFFIVRHQLLKK ----2222-------3333----------------------1111- >Cytochrome c oxidase poly; SWP:P10175; PDB:1V54M; ITAKPAKTPTSPKEQAIGLSVTFLSFLLPAGWVLYHLDNYKKS ------------------------------------3333--- >SPINOXIN; SWP:P84094; PDB:1V56A; IRCSGSRDCYSPCMKQTGCPNAKCINKSCKCYGC ----3333----------------------3333 >THIOL:DISULFIDE INTERCHAN; SWP:P77202; PDB:1V58A; ELPAPVKAIEKQGITIIKTFDAPGGMKGYLGKYQDMGVTIYLTPDGKHAISGYMYNEKGE ---------1111---------iiii------!!!!------3333---------1111- NLSNTLIEKEIYAPAGREMWQRMEQSHWLLDGKKDAPVIVYVFADPFCPYCKQFWQQARP -------------1111-----1111------1111-----------3333--------- WVDSGKVQLRTLLVGVIKPESPATAAAILASKDPAKTWQQYEASGGKLKLNVPANVSTEQ -1111------------1111------1111----------1111--------------- MKVLSDNEKLMDDLGANVTPAIYYMSKENTLQQAVGLPDQKTLNIIMGN -----------1111-----------%%%%------------------- >DIHYDROLIPOAMIDE DEHYDROG; SWP:P09624; PDB:1V59A; TINKSHDVVIIGGGPAGYVAAIKAAQLGFNTACVEKRGKLGGTCLNVGCIPSKALLNNSH -------------3333-------------------------3333-------------- LFHQMHTEAQKRGIDVNGDIKINVANFQKAKDDAVKQLTGGIELLFKKNKVTYYKGNGSF --------3333------------------------------------------------ EDETKIRVTPVDGLEGTVKEDHILDVKNIIVATGSEVTPFPGIEIDEEKIVSSTGALSLK -1111--------2222----------------------2222--------3333----- EIPKRLTIIGGGIIGLEMGSVYSRLGSKVTVVEFQPQIGASMDGEVAKATQKFLKKQGLD ----------------------1111----------------------------1111-- FKLSTKVISAKRNDDKNVVEIVVEDTKTNKQENLEAEVLLVAVGRRPYIAGLGAEKIGLE ------------------------------------------------22223333---- VDKRGRLVIDDQFNSKFPHIKVVGDVTFGPMLAHKAEEEGIAAVEMLKTGHGHVNYNNIP -1111-----------1111---1111----------------3333-------1111-- SVMYSHPEVAWVGKTEEQLKEAGIDYKIGKFPFAANSRAKTNQDTEGFVKILIDSKTERI -------------------1111--------3333----1111----------------- LGAHIIGPNAGEMIAEAGLALEYGASAEDVARVCHAHPTLSEAFKEANMAAYDKAIHC --------3333-------------33331111-----3333---------------- >CHITOSANASE; SWP:Q9ALZ1; PDB:1V5DA; AKEMKPFPQQVNYAGVIKPNHVTQESLNASVRSYYDNWKKKYLKNDLSSLPGGYYVKGEI ------------2222------------------------------1111---------- TGDADGFKPLGTSEGQGYGMIITVLMAGYDSNAQKIYDGLFKTARTFKSSQNPNLMGWVV ---iiii------------------22221111------------------1111----- ADSKKAQGHFDSATDGDLDIAYSLLLAHKQWGSNGTVNYLKEAQDMITKGIKASNVTNNN --3333--------------------------------------------------1111 QLNLGDWDSKSSLDTRPSDWMMSHLRAFYEFTGDKTWLTVINNLYDVYTQFSNKYSPNTG ----11111111---3333----------------------------------------- LISDFVVKNPPQPAPKDFLDESEYTNAYYYNASRVPLRIVMDYAMYGEKRSKVISDKVSS ----------------2222---1111-3333---------------3333--------- WIQNKTNGNPSKIVDGYQLNGSNIGSYPTAVFVSPFIAASITSSNNQKWVNSGWDWMKNK -----%%%%1111----1111-------3333-------11111111------------- RERYFSDSYNLLTMLFITGNWWKPVP -------------------------- >PYRUVATE OXIDASE; SWP:NA; PDB:1V5EA; DNKINIGLAVMKILESWGADTIYGIPSGTLSSLMDAMGEEENNVKFLQVKHEEVGAMAAV --------------1111--------1111-3333---%%%%------------------ MQSKFGGNLGVTVGSGGPGASHLINGLYDAAMDNIPVVAILGSRPQRELNMDAFQELNQN --1111--------------------------------------3333----------33 PMYDHIAVYNRRVAYAEQLPKLVDEAARMAIAKRGVAVLEVPGDFAKVEIDNDQWYSSAN 333333--------3333------------1111-------1111-----1111---333 SLRKYAPIAPAAQDIDAAVELLNNSKRPVIYAGIGTMGHGPAVQELARKIKAPVITTGKN 3--------------------------------1111-------------------1111 FETFEWDFEALTGSTYRVGWKPANETILEADTVLFAGSNFPFSEVEGTFRNVDNFIQIDI 33331111-----------------1111----------11111111-1111-------- DPAMLGKRHHADVAILGDAALAIDEILNKVDAVEESAWWTANLKNIANWREYINMLETKE 3333-------------------------------------------------------- EGDLQFYQVYNAINNHADEDAIYSIDVGNSTQTSIRHLHMTPKNMWRTSPLFATMGIAIP -----------------1111------3333--1111---3333---------------- GGLGAKNTYPDRQVWNIIGDGAFSMTYPDVVTNVRYNMPVINVVFSNTEYAFIKNKYEDT --------1111------3333---------------------------3333------- NKNLFGVDFTDVDYAKIAEAQGAKGFTVSRIEDMDRVMAEAVAANKAGHTVVIDCKITQD ------------------1111-------1111-----------1111------------ RPIPVETLKLDSKLYSEDEIKAYKERYEAANLVPFREYLEAEGLESKYIK ---1111---1111--------------1111-3333--1111------- >Serine proteinase inhibit; SWP:Q7M4T6; PDB:1V5IB; SAGKFIVIFKNDVSEDKIRETKDEVIAEGGTITNEYNMPGMKGFAGELTPQSLTKFQGLQ ---------1111------------------------2222-------3333----1111 GDLIDSIEEDHVAHAY ---------------- >KIAA1355 PROTEIN; SWP:Q9P2J2; PDB:1V5JA; GSSGSSGLSPPRGLVAVRTPRGVLLHWDPPELVPKRLDGYVLEGRQGSQGWEVLDPAVAG ------------------1111------------------------------------11 TETELLVPGLIKDVLYEFRLVAFAGSFVSDPSNTANVSTSGLSGPSSG 11---------------------!!!!--------------------- >microtubule-associated pr; SWP:Q61166; PDB:1V5KA; GSSGSSGQRRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKKVKFQAKL --------------------------3333-----------------------------3 EHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDSGPSSG 333----------------------1111-------1111--------------- >PDZ AND LIM DOMAIN 3; SWP:NA; PDB:1V5LA; GSSGSSGNVVLPGPAPWGFRLSGGIDFNQPLVITRITPGSKAAAANLCPGDVILAIDGFG -----------------------------------------3333----------%%%%- TESMTHADAQDRIKAASYQLCLKIDRAETRLWSPQVSSGPSSG ----3333----1111--------------------------- >SH2 AND PH DOMAIN-CONTAIN; SWP:Q9JID9; PDB:1V5MA; GSSGSSGNLAAKVELVDIQREGALRFMVADDAASGPGGTAQWQKCRLLLRRAVAGERFRL ------------------------------------------------------------ EFFVPPKASRPKVSIPLSAIIEVRTTMPLEMPEKDNTFVLKVENGAEYILETIDSLQKHS ----1111-------3333----------------------------------3333--- WVADIQGCVDSGPSSG ---------------- >PDI-LIKE HYPOTHETICAL PRO; SWP:O80763; PDB:1V5NA; GSSGSSGTEERLKEIEAKYDEIAKDWPKKVKHVLHEEHELELTRVQVYTCDKCEEEGTIW -------------3333-3333----------1111------------------------ SYHCDECDFDLHAKCALNEDTKESGPSSG -----------3333-------------- >1700011N24RIK PROTEIN; SWP:NA; PDB:1V5OA; GSSGSSGMLITVYCVRRDLTEVTFSLQVNPDFELSNFRVLCELESGVPAEEAQIVYMEQL ---------------%%%%----------------------------3333----iiii- LTDDHCSLGSYGLKDGDMVVLLQKDNVGLRTPGRTPSGPSSG --------3333-2222------------------------- >PLECKSTRIN HOMOLOGY DOMAI; SWP:NA; PDB:1V5PA; GSSGSSGMPYVDRQNRICGFLDIEDNENSGKFLRRYFILDTQANCLLWYMDNPQNLAVGA ----------------------------------------3333-------3333----- GAVGSLQLTYISKVSIATPKQKPKTPFCFVINALSQRYFLQANDQKDLKDWVEALNQASK ------3333-------1111--------------------------------------- SGPSSG ------ >GROWTH-ARREST-SPECIFIC PR; SWP:P11862; PDB:1V5RA; GSSGSSGNLLDDAVKRISEDPPCKCPTKFCVERLSQGRYRVGEKILFIRMLHNKHVMVRV ----------------------------------2222------------%%%%------ GGGWETFAGYLLKHDPCRMLQISRVDGKTSPSGPSSG -----3333------3333------------------ >MAP/MICROTUBULE AFFINITY-; SWP:Q03141; PDB:1V5SA; MKDHLIHNVHKEEHAHAHNKDYDIPTTENLYFQGSSGSSGDMMREIRKVLGANNCDYEQR -------------------------------------3333------------------- ERFLLFCVHGDGHAENLVQWEMEVCKLPRLSLNGVRFKRISGTSIAFKNIASKIANELKL ------------3333----------3333--------------3333------------ SGPSSG ------ >8430435I17RIK PROTEIN; SWP:NA; PDB:1V5TA; GSSGSSGLPIIVKWGGQEYSVTTLSEDDTVLDLKQFLKTLTGVLPERQKLLGLKVKGKPA ------------------------33333333-----------3333------------- ENDVKLGALKLKPNTKIMMMGTRESGPSSG ----3333---------------------- >SET BINDING FACTOR 1; SWP:Q8BK68; PDB:1V5UA; GSSGSSGRSYEGILYKKGAFMKPWKARWFVLDKTKHQLRYYDHRMDTECKGVIDLAEVEA -----------------------------------------------------3333--- VAPGTPTIGAPKTVDEKAFFDVKTTRRVYNFCAQDVPSAQQWVDRIQSCLSSGPSSG --------------------------------------------------------- >AMINOMETHYLTRANSFERASE; SWP:O58888; PDB:1V5VA; QMVKRVHIFDWHKEHARKIEEFAGWEMPIWYSSIKEEHLAVRNAVGIFDVSHMGEIVFRG -------3333----------iiii------------------------1111------1 KDALKFLQYVTTNDISKPPAISGTYTLVLNERGAIKDETLVFNMGNNEYLMICDSDAFEK 111----------1111------------1111-----------%%%%-----1111--- LYAWFTYLKRTIEQFTKLDLEIELKTYDIAMFAVQGPKARDLAKDLFGIDINEMWWFQAR -----------3333---------1111-------1111----------3333-2222-- WVELDGIKMLLSRSGYTGENGFEVYIEDANPYHPDESKRGEPEKALHVWERILEEGKKYG ---iiii----------------------1111-3333---3333----------3333- IKPCGLGARDTLRLEAGYTLYGNETKELQLLSTDIDEVTPLQANLEFAIYWDKDFIGKDA -------------------2222---------------3333--3333------2222-- LLKQKERGVGRKLVHFKMIDKGIPREGYKVYANGEMIGEVTSGTLSPLLNVGIGIAFVKE ------------------------2222---iiii-----------------------33 EYAKPGIEIEVEIRGQRKKAVTVTPPFYDPKKYGLFRET 33----------iiii------------1111-1111-- >MEIOTIC RECOMBINATION PRO; SWP:Q14565; PDB:1V5WA; PGFLTAFEYSEKRKMVFHITTGSQEFDKLLGGGIESMAITEAFGEFRTGKTQLSHTLCVT ------3333--1111------3333---------------------------------- AQLPGAGGYPGGKIIFIDTENTFRPDRLRDIADRFNVDHDAVLDNVLYARAYTSEHQMEL -----%%%%--------------333311113333-------1111-------------1 LDYVAAKFHEEAGIFKLLIIDSIMALFRVDFSGRGELAERQQKLAQMLSRLQKISEEYNV 111---------------------3333-------3333----------------1111- AVFVTNQMGHILAHASTTRISLRKGRGELRIAKIYDSPEMPENEATFAITAGGIGDAKE ------------3333------------------------------------------- >PHOSPHORIBOSYLANTHRANILAT; SWP:P83825; PDB:1V5XA; MRVKICGITRLEDALLAEALGAFALGFVLAPGSRRRIAPEAARAIGEALGPFVVRVGVFR ---------------------------------------------1111----------- DQPPEEVLRLMEEARLQVAQLHGEEPPEWAEAVGRFYPVIKAFPLEGPARPEWADYPAQA --3333-------------------3333---1111--------------3333------ LLLDGKRPGSGEAYPRAWAKPLLATGRRVILAGGIAPENLEEVLALRPYALDLASGVEEA --------------3333--------------------33333333-------3333--2 PGVKSAEKLRALFARLASLR 222------------3333- >RIKEN CDNA 1810037G04; SWP:Q9D8S9; PDB:1V60A; GSSGSSGMATRSCVSRGSAGSAAAGPVEAAIRAKLEQALSPEVLELRNESGGHAVPAGSE ------------------------------------------------------------ THFRVAVVSSRFEGMSPLQRHRLVHEALSEELAGPVHALAIQAKTPAQWRENPQLDISPP ----------------------------3333---------------3333--------- CLG --- >Rac/Cdc42 guanine nucleot; SWP:Q8K4I3; PDB:1V61A; GSSGSSGQILSEPIQAWEGDDIKTLGNVIFMSQVVMQHGACEEKEERYFLLFSSVLIMLS ---------------------3333----------------------------------- ASPRMSGFMYQGKIPIAGMVVNRLDEIEGSDCMFEITGSTVERIVVHCNNNQDFQEWMEQ ---------------2222-------------------------------3333------ LNRLTKSGPSSG ------------ >KIAA1719 PROTEIN; SWP:Q9C0E4; PDB:1V62A; GSSGSSGDTVANASGPLMVEIVKTPGSALGISLTTTSLRNKSVITIDRIKPASVVDRSGA ----------------------------------------------------3333---- LHPGDHILSIDGTSMEHCSLLEATKLLASISEKVRLEILPVPQSQRPLRPSSGPSSG ---------iiii---------------------------3333------------- >NUCLEOLAR TRANSCRIPTION F; SWP:P25976; PDB:1V63A; GSSGSSGPKKPPMNGYQKFSQELLSNGELNHLPLKERMVEIGSRWQRISQSQKEHYKKLA --------------3333------------------------------------------ EEQQRQYKVHLDLWVKSLSPQDRAAYKEYISNKRKSGPSSG -----------------------------3333-------- >NUCLEOLAR TRANSCRIPTION F; SWP:P25976; PDB:1V64A; GSSGSSGQLKDKFDGRPTKPPPNSYSLYCAELMANMKDVPSTERMVLCSQQWKLLSQKEK ------------------------------------------------------------ DAYHKKCDQKKKDYEVELLRFLESLPEEEQQRVLGEEKMLNISGPSSG ---------------------3333-----------3333-------- >RIKEN CDNA 2610044O15; SWP:NA; PDB:1V65A; GSSGSSGVTYDDVHMNFTEEEWDLLDSSQKRLYEEVMLETYQNLTDIGYNWQDHHIEESG -----------------3333----3333------------------------------- PSSG ---- >PROTEIN INHIBITOR OF ACTI; SWP:O75925; PDB:1V66A; MADSAELKQMVMSLRVSELQVLLGYAGRNKHGRKHELLTKALHLLKAGCSPAVQMKIKEL ----------1111---------1111-----------------1111------------ YRRRF -1111 >L-LACTATE DEHYDROGENASE A; SWP:Q9W7K5; PDB:1V6AA; ASTKEKLITHVSKEEPAGPTNKVTVVGVGMVGMAAAISILLKDLTDELALVDVMEDKLKG -3333----------------------------------1111----------------- EAMDLQHGSLFLKTHKIVADKDYSVTANSKVVVVTAGARQQEGESRLNLVQRNVNIFKFI -------3333------------1111-------------2222---------------- IPNIIKYSPNCILLVVSNPVDILTYVAWKLSGLPRNRVIGSGTNLDSARFRHLMGEKLGI -------1111----------------------3333---!!!!---------------- HPSNCHGWVIGEHGDSSVPVWSGVNVAGVFLQGLNPDMGTDKDKEDWKSVHKMVVDSAYE 3333---------1111--3333--iiii3333-1111-3333--3333----------- VIKLKGYTSWAIGMSAADLCQSILKNLRKCHPVSTLVKGMHGVNEEVFLSVPCILGNSGL ----------------------1111----------2222---------------1111- TDVVHMTLKSDEEKQLVKSAETLWGVQKDLTL --------3333-------------3333--- >HARMONIN ISOFORM A1; SWP:Q9ES64; PDB:1V6BA; GSEGAATMFSPEQIAGKDVRLLRIKKEGSLDLALEGGVDSPVGKVVVSAVYEGGAAERHG ---------1111------------------------------------------3333- GVVKGDEIMAINGKIVTDYTLAEAEAALQKAWNQGGDWIDLVVAVCPPKEYDDELTFF -------------------------------3333----------------------- >CYTOSKELETON-ASSOCIATED P; SWP:Q9D1E6; PDB:1V6EA; GSSGSSGVMVFISSSLNSFRSEKRYSRSLTIAEFKCKLELVVGSPASCMELELYGADDKF -------------3333--------1111-------3333----1111------------ YSKLDQEDALLGSYPVDDGCRIHVIDHSGSGPSSG ----------------------------------- >GLIA MATURATION FACTOR, B; SWP:NA; PDB:1V6FA; GSSGSSGSESLVVCDVAEDLVEKLRKFRFRKETHNAAIIMKIDKDERLVVLDEELEGVSP ----------------------------------------------------------11 DELKDELPERQPRFIVYSYKYQHDDGRVSYPLCFIFSSPVGCKPEQQMMYAGSKNKLVQT 113333----------------3333------------11113333-------------- AELTKVFEIRNTEDLTEEWLREKLGSGPSSG ----------3333-3333------------ >ACTIN BINDING LIM PROTEIN; SWP:Q6H8Q1; PDB:1V6GA; GSSGSSGLDYQRLYGTRCFSCDQFIEGEVVSALGKTYHPDCFVCAVCRLPFPPGDRVTFN --------3333-------------------%%%%--1111------------------- GKECMCQKCSLPVSVSGPSSG -----3333------------ >GALACTOSE-BINDING LECTIN; SWP:P02872; PDB:1V6IA; AETVSFNFNSFSEGNPAINFQGDVTVLSNGNIQLTNLNKVNSVGRVLYAMPVRIWSSATG -----------2222-----------1111-----1111--------------------- NVASFLTSFSFEMKDIKDYDPADGIIFFIAPEDTQIPAGSIGGGTLGVSDTKGAGHFVGV ------------------------------1111--2222-!!!!----3333------- EFDTYSNSEYNDPPTDHVGIDVNSVDSVKTVPWNSVSGAVVKVTVIYDSSTKTLSVAVTN ------3333-------------------------2222---------1111-------1 DNGDITTIAQVVDLKAKLPERVKFGFSASGSLGGRQIHLIRSWSFTSTLITT 111---------3333--------------1111------------------ >COBROTOXIN; SWP:P01430; PDB:1V6PA; LECHNQQSSQTPTTTGCSGGETNCYKKRWRDHRGYRTERGCGCPSVKNGIEINCCTTDRC ------!!!!-------1111-------------------------2222------2222 NN -- >PHOSPHOGLYCERATE KINASE; SWP:P09403; PDB:1V6SA; MRTLLDLDPKGKRVLVRVDYNVPVQDGKVQDETRILESLPTLRHLLAGGASLVLLSHLGR --1111--2222------------%%%%-------------------------------- PKGPDPKYSLAPVGEALRAHLPEARFAPFPPGSEEARREAEALRPGEVLLLENVRFEPGE ----3333------------1111-----1111------11112222---------1111 EKNDPELSARYARLGEAFVLDAFGSAHRAHASVVGVARLLPAYAGFLMEKEVRALSRLLK ----------1111-------3333----11113333------------------1111- DPERPYAVVLGGAKVSDKIGVIESLLPRIDRLLIGGAMAFTFLKALGGEVGRSLVEEDRL -------------3333-------1111---------------1111--!!!!--1111- DLAKDLLGRAEALGVRVYLPEDVVAAERIEAGVETRVFPARAIPVPYMGLDIGPKTREAF ----------1111---------------2222-----1111------------------ ARALEGARTVFWNGPMGVFEVPPFDEGTLAVGQAIAALEGAFTVVGGGDSVAAVNRLGLK -1111------------1111-------------1111--------------------33 ERFGHVSTGGGASLEFLEKGTLPGLEVLEG 33-------------------33331111- >HYPOTHETICAL UPF0271 PROT; SWP:O58714; PDB:1V6TA; MRVDLNSDLGESFGRYKLGLDEEVMKYITSANVACGWHAGDPLVMRKTVRLAKENDVQVG ------------!!!!----3333----------------------------1111---- AHPGYPDLMGFGRRYMKLTPEEARNYILYQVGALYAFAKAEGLELQHVKPHGALYNAMVK ------3333----------------------------1111------------------ EEDLARAVIEGILDFDKDLILVTLSNSRVADIAEEMGLKVAHEVFADRAYNPDGTLVPAV ---------------1111----2222------1111-------1111--1111------ IEDKEEIAERVISMVKDGGIRAINGEWVDLKVDTICVHGDNPKAVEITSYIRKVLEEEGV ---------------------1111------------------------------1111- KIVPMKEFI ---3333-- >endo-1,4-beta-D-xylanase ; SWP:P07986; PDB:1V6YA; STLGAAAAQSGRYFGTAIASGKLGDSAYTTIASREFNMVTAENEMKIDATEPQRGQFNFS -------1111-------3333--------------------11113333--2222---- AGDRVYNWAVQNGKQVRGHTLAWHSQQPGWMQSLSGSTLRQAMIDHINGVMGHYKGKIAQ ---------1111----------------------------------------2222--- WDVVNEAFSDDGSGGRRDSNLQRTGNDWIEVAFRTARAADPAAKLCYNDYNIENWTWAKT ------------------3333--1111-----------1111----------1111--- QGVYNMVRDFKQRGVPIDCVGFQSHLIVGQVPGDFRQNLQRFADLGVDVRITELDIRMRT --------------------------2222-1111------------------------- PSDATKLATQAADYKKVVQACMQVTRCQGVTVWGITDKYSWVPDVFPGEGAALVWDASYA -----------------------1111--------3333-3333-2222------1111- KKPAYAAVMEAFGSRS ---------------- >HYPOTHETICAL PROTEIN TTHA; SWP:Q72K73; PDB:1V6ZA; RPHRAFSPGLTGVLPLRETRHLVEVLRARVGDRFTVFDGEREALAEVVDLGPPLRYRVLE ------2222------------------2222---------------------------- ERRPEREVGVEVVLYVALLKGDKLAEVVRAATELGATRIQPLVTRHSVPKEGEGKLRRLR -------------------!!!!--------1111--------1111------------- AVALEAAKQSGRVVVPEVLPPIPLKAVPQVAQGLVAHVGATARVREVLDPEKPLALAVGP -------1111-----------3333----------1111--3333--1111-------3 EGGFAEEEVALLEARGFTPVSLGRRILRAETAALALLALCTAGEGR 333-------------------------------------3333-- >PROBABLE ANTIBIOTICS SYNT; SWP:Q5SM39; PDB:1V70A; MEIKDLKRLARYNPEKMAKIPVFQSERMLYDLYALLPGQAQKVHVHEGSDKVYYALEGEV ----3333----3333--------3333-------2222--------------------- VVRVGEEEALLAPGMAAFAPAGAPHGVRNESASPALLLVVTAPRP ---!!!!----2222----2222---------------------- >HYPOTHETICAL PROTEIN C320; SWP:O59791; PDB:1V71A; VLPTYDDVASASERIKKFANKTPVLTSSTVNKEFVAEVFFKCENFQKMGAFKFRGALNAL ---3333-------3333-----------------------33332222----------- SQLNEAQRKAGVLTFSSGNHAQAIALSAKILGIPAKIIMPLDAPEAKVAATKGYGGQVIM ----------------------------1111-------1111--------1111----- YDRYKDDREKMAKEISEREGLTIIPPYDHPHVLAGQGTAAKELFEEVGPLDALFVCLGGG -3333-----------------------3333---------------------------- GLLSGSALAARHFAPNCEVYGVEPEAGNDGQQSFRKGSIVHIDTPKTIADGAQTQHLGNY -------------1111------3333---------------------1111-------- TFSIIKEKVDDILTVSDEELIDCLKFYAARMKIVVEPTGCLSFAAARAMKEKLKNKRIGI ---3333----------------------------3333---------3333-------- IISGGNVDIERYAHFLSQ --------------1111 >ALDOLASE; SWP:Q59IT3; PDB:1V72A; RPPALGFSSDNIAGASPEVAQALVKHSSGQAGPYGTDELTAQVKRKFCEIFERDVEVFLV --------1111--------------------iiii------------------------ PTGTAANALCLSAMTPPWGNIYCHPASHINNDECGAPEFFSNGAKLMTVDGPAAKLDIVR ----------1111-1111----11111111-%%%%--1111---------%%%%----- LRERTREKVGDVHTTQPACVSITQATEVGSIYTLDEIEAIGDVCKSSSLGLHMDGSRFAN -------2222--------------1111---------------------------3333 ALVSLGCSPAEMTWKAGVDALSFGATKNGVLAAEAIVLFNTSLATEMSYRRKRAGHLSSK -------3333-3333--------3333-----------3333-3333---1111----- MRFLSAQIDAYLTDDLWLRNARKANAAAQRLAQGLEGLGGVEVLGGTEANILFCRLDSAM ------------%%%%----------------------------------------3333 IDALLKAGFGFYHDRWGPNVVRFVTSFATTAEDVDHLLNQVRLAA ----------------2222-----1111-----------3333- >PSYCHROPHILIC PHOSPHATASE; SWP:Q9S427; PDB:1V73A; TEFDGPYVITPISGQSTAYWICDNRLKTTSIEKLQVNRPEHCGDLPETKLSSEIKQIMPD ---------------------%%%%------%%%%----------------1111----- TYLGIKKVVALSDVHGQYDVLLTLLKKQKIIDSDGNWAFGEGHMVMTGDIFDRGHQVNEV -------------------------------1111---!!!!------------------ LWFMYQLDQQARDAGGMVHLLMGNHEQMVLGGDLRYVHQRYDIATTLINRPYNKLYGADT -----------1111-------------1111-1111--------1111-3333--1111 EIGQWLRSKNTIIKINDVLYMHGGISSEWISRELTLDKANALYRANVDASKKSLKADDLL -----1111--------------------1111------------1111----------- NFLFFGNGPTWYRGYFSETFTEAELDTILQHFNVNHIVVGHTSQERVLGLFHNKVIAVDS ------------33331111------------------------------%%%%------ SIKVGKSGELLLLENNRLIRGLYDGTRETLQ -------------%%%%----1111------ >COLICIN D; SWP:P17998; PDB:1V74A; LNDPLDSGRFSRKQLDKKYKHAGDFGISDTKKNRETLTKFRDAIEEHLSDKDTVEKGTYR --1111!!!!-------33333333------------------------1111-----33 REKGSKVYFNPNTMNVVIIKSNGEFLSGWKINPDADNGRIYLETGEL 33-----------------1111--------1111------------ >Colicin-D immunity protei; SWP:P11899; PDB:1V74B; MNKMAMIDLAKLFLASKITAIEFSERICVERRRLYGVKDLSPNILNCGEELFMAAERFEP -1111-------1111-----------------2222---3333----------1111-- DADRANYEIDDNGLKVEVRSILEKFKL 11111111------------------- >RNASE P PROTEIN PH1771P; SWP:O59425; PDB:1V76A; GRVTRRNIIWHELIGLRVRIVGSTHPAFVGIEGYVIDETRNMLVIAGDRIWKVPKDVSIF ---33331111-2222--------3333----------1111------------1111-- EFEADDGTKIKIPGERLVGRPEMRLKKRWKKW ---1111-----3333---3333--3333--- >HYPOTHETICAL PROTEIN PH18; SWP:O59543; PDB:1V77A; VKFIEMDIRDKEAYELAKEWFDEVVVSIKFNEEVDKEKLREARKEYGKVAILLSNPKPSL ----------------1111---------------------------------------- VRDTVQKFKSYLIYVESNDLRVIRYSIEKGVDAIISPWVNRKDPGIDHVLAKLMVKKNVA -------1111---------------1111-----1111--------------------- LGFSLRPLLYSNPYERANLLRFMMKAWKLVEKYKVRRFLTSSAQEKWDVRYPRDLISLGV -----3333-----------------------------------1111-----------1 VIGMEIPQAKASISMYPEIILK 111------------------- >TRANSCRIPTIONAL REGULATOR; SWP:Q8NMG3; PDB:1V7BA; MRTSKKEMILRTAIDYIGEYSLETLSYDSLAEATGLSKSGLIYHFPSRHALLLGMHELLA --------------------3333------------------------------------ DDWDKELRDITRDPEDPLERLRAVVVTLAENVSRPELLLLIDAPSHPDFLNAWRTVNHQW ------------3333---------1111------------3333----------3333- IPDTDDLENDAHKRAVYLVQLAADGLFVHDYIHDDVLSKSKRQAMLETILELIPS ---2222--------------------3333-----------------3333--- >THREONINE SYNTHASE; SWP:P83823; PDB:1V7CA; MRPPLIERYRNLLPVSEKTPVISLLEGSTPLIPLKGPEEARKKGIRLYAKYEGLNPTGSF --------3333---1111-----------------33331111------11111111-- KDRGMTLAVSKAVEGGAQAVACASTGNTAASAAAYAARAGILAIVVLPAGYVALGKVAQS ------------1111-------------------------------2222--------- LVHGARIVQVEGNFDDALRLTQKLTEAFPVALVNSVNPHRLEGQKTLAFEVVDELGDAPH 1111-------------------1111------3333----------------------- YHALPVGNAGNITAHWMGYKAYHALGKAKRLPRMLGFQAAGAAPLVLGRPVERPETLATA --------------------------------------11113333-----------333 IRIGNPASWQGAVRAKEESGGVIEAVTDEEILFAYRYLAREEGIFCEPASAAAMAGVFKL 3---------------1111---------------------------3333--------- LREGRLEPESTVVLTLTGHGLKDPATAERVAELPPPVPARLEAVAAAAGLL 1111--------------333333331111---------------1111-- >PHRIXOTOXIN 1; SWP:P61230; PDB:1V7FA; YCQKWMWTCDSARKCCEGLVCRLWCKKII ---2222--------2222---------- >3-ISOPROPYLMALATE DEHYDRA; SWP:O59393; PDB:1V7LA; MITTGKVWKFGDDISTDEITPGRYNLTKDPKELAKIAFIEVRPDFARNVRPGDVVVAGKN --------------3333--1111----3333---------1111----2222------- FGIGSSRESAALALKALGIAGVIAESFGRIFYRNAINIGIPLLLGKTEGLKDGDLVTVNW ----------------------------------------------33332222------ ETGEVRKGDEILMFEPLEDFLLEIVREGGILEYIRRRGDLCI ------!!!!--------------1111-------------- >Anti-colorectal carcinoma; SWP:Q65ZQ1; PDB:1V7MH; EVKLEESGGGLVQPGGSMKLSCAASGFTFSDAWMDWVRQSPEKGLEWVAEIRSKVNNHAI ------------2222-----------1111---------------------3333---- HYAESVKGRFTVSRDDSKSSVYLQMNSLRAEDTGIYYCSGWSFLYWGQGTLVTVSAAKTT ---------------1111---------1111--------%%%%---------------- PPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTL ------------------------------------iiii-------------------- SSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRD ------3333--------------------------- >Thrombopoietin [Precursor; SWP:P40225; PDB:1V7MV; CDLRVLSKLLRDSHVLHSRLSQCPEVHPLPTPVLLPAVDFSLGEWKTQMEETKAQDILGA ---3333----------------------------------3333---3333-------- VTLLLEGVMAARGQLGPTCLSSLLGQLSGQVRLLLGALQSLLGTQLPPQGRTTAHKDPNA ----------1111-----3333---------------------------------3333 IFLSFQHLLRGKVRFLMLVGGSTLC ------------1111--------- >EMS16 A CHAIN; SWP:Q7T2Q1; PDB:1V7PA; DFDCPSDWTAYDQHCYLAIGEPQNWYEAERFCTEQAKDGHLVSIQSREEGNFVAQLVSGF ----2222--!!!!-----------------1111---------------------3333 MHRSEIYVWIGLRDRREEQQCNPEWNDGSKIIYVNWKEGESKMCQGLTKWTNFHDWNNIN ------------------------1111--------2222--------1111-------1 CEDLYPFVCKFSAV 111----------- >EMS16 B chain; SWP:Q7T2Q0; PDB:1V7PB; CPLGWSSFDQHCYKVFEPVKNWTEAEEICMQQHKGSRLASIHSSEEEAFVSKLASKALKF -2222--%%%%-----------------11112222------------------------ TSMWIGLNNPWKDCKWEWSDNARFDYKAWKRRPYCTVMVVKPDRIFWFTRGCEKSVSFVC -----------------1111-------------------1111------1111------ KFLTDPA ------- >Integrin alpha-2 [Precurs; SWP:P17301; PDB:1V7PC; LIDVVVVCDESNSIYPWDAVKNFLEKFVQGLDIGPTKTQVGLIQYANNPRVVFNLNTYKT ----------3333-3333--------------1111----------------1111--- KEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSAASGGRRSATKVMVVVTDGESHDGSM ------1111--------------------11111111-1111-------------3333 LKAVIDQCNHDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALL --------1111------------1111----------------3333------333333 EKAGTLGEQIFSI 33----------- >HYPOTHETICAL PROTEIN PH19; SWP:O59580; PDB:1V7RA; MKIFFITSNPGKVREVANFLGTFGIEIVQLKHEYPEIQAEKLEDVVDFGISWLKGKVPEP --------------------1111----------------3333--------2222---- FMIEDSGLFIESLKGFPGVYSSYVYRTIGLEGILKLMEGAEDRRAYFKSVIGFYIDGKAY ----------1111--------------------1111----------------%%%%-- KFSGVTWGRISNEKRGTHGFGYDPIFIPEGSEKTFAEMTIEEKNALSHRGKALKAFFEWL -------------------!!!!----2222--3333----------------------- KVNLKY ------ >CHITOBIOSE PHOSPHORYLASE; SWP:Q76IQ9; PDB:1V7WA; MKYGYFDNDNREYVITRPDVPAPWTNYLGTEKFCTVISHNAGGYSFYNSPEYNRVTKFRP -------1111--------------------------1111-------3333-------- NATFDRPGHYVYLRDDDSGDYWSISWQPVAKSLDEAQYQIRHGLSYSKFQCDYNGIHARK -------------------------------1111-------2222------iiii---- TLFVPKGEDAEIWDVVIKNTSDQVRTISAFSFVEFSFSHIQSDNQNHQMSLYSAGTAYRP ----2222------------------------------3333---3333---------22 GLIEYDLYYNTDDFEGFYYLASTFDPDSYDGQRDRFLGLYRDEANPLAVEQGRCSNSAQT 22----1111---1111--------------3333------3333--------------- CYNHCGSLHKQFTLQPGEEIRFAYILGIGKGNGERLREHYQDVANIDAAFAAIKAHWDER --------------2222----------22223333------------------------ CAKFQVKSPNQGLDTMINAWTLYQAETCVVWSRFASFIEVGGRTGLGYRDTAQDAISVPH 1111------------------------------------------------3333-333 ANPEMTRKRIVDLLRGQVKAGYGLHLFDPDWFDPIHGIKDTCSDDHLWLIPTICKYVMET 3----------------3333-------3333----3333-11113333----------- GETSFFDQMIPYADGGEASVYEHMKAALDFSAEYVGQTGICKGLRADWNDCLNLGGGESS -----------1111--------------------1111---------1111-!!!!--- MVSFLHFWALQEFIDLAKFLGKDQDVNTYTEMAANVREACETHLWDDEGGWYIRGLTKNG --------------------------------------------------------1111 DKIGTAQQQEGRVHLESNTLAVLSGLASQERGEQAMDAVDEHLFSPYGLHLNAPSFSTPN ----1111-------------3333-------------------1111-----------1 DDIGFVTRVYQGVKENGAIFSHPNPWAWVAETKLGRGDRAMKFYDALNPYNQNDIIEKRI 1113333--2222-------1111-------1111------------33331111----- AEPYSYVQFIMGRDHQDHGRANHPWLTGTSGWAYFAVTNYILGVQSGFTGLSVDPCIPSD -----------1111------------------------3333---1111-------111 WPGFEVTRQWRGATYHIQVENPDHVSKGVKSITLNGAPIQGRIPPQAQGSDNQVVVVLG 1--------iiii-------1111---------iiii---------2222--------- >TRYPTOPHAN SYNTHASE ALPHA; SWP:P00928; PDB:1V7YA; MERYESLFAQLKERKEGAFVPFVTLGDPGIEQSLKIIDTLIEAGADALELGIPFATLRAF -----------1111--------2222--------------------------------- AAGVTPAQCFEMLALIRQKHPTIPIGLLMYANLVFNKGIDEFYAQCEKVGVDSVLVADVP -----------------------------------------------------------3 VEESAPFRQAALRHNVAPIFICPPNADDDLLRQIASYGRGYTYLLSRAGALPLNHLVAKL 3333333----1111-------11113333---------------1111--3333----- KEYNAAPPLQGFGISAPDQVKAAIDAGAAGAISGSAIVKIIEQHINEPEKMLAALKVFVQ 1111-----------3333--------------3333----------------------- PMKAATRS ---1111- >CREATININE AMIDOHYDROLASE; SWP:Q52548; PDB:1V7ZA; KSVFVGELTWKEYEARVAAGDCVLMLPVGALEQHGHHMCMNVDVLLPTAVCKRVAERIGA ---3333---------------------------------3333---------------- LVMPGLQYGYKSQQKSGGGNHFPGTTSLDGATLTGTVQDIIRELARHGARRLVLMNGHYE ------------1111-----------------------------------------333 NSMFIVEGIDLALRELRYAGIQDFKVVVLSYWDFVKDPAVIQQLYPEGFLGWDIEHGGVF 3---------------1111---------3333-----------1111--3333------ ETSLMLALYPDLVDLDRVVDHPPATFPPYDVFPVDPARTPAPGTLSSAKTASREKGELIL --------3333-3333-----------------3333-3333----1111--------- EVCVQGIADAIREEFPP ----------------- >Galactosylgalactosylxylos; SWP:Q9P2W7; PDB:1V84A; LPTIHVVTPTYSRPVQKAELTRMANTLLHVPNLHWLVVEDAPRRTPLTARLLRDTGLNYT ------------1111---------3333------------------------------- HLHVETPRNYKLRIPRGTMQRNLALRWLRETFPRNSSQPGVVYFADDDNTYSLELFEEMR ------3333----2222---------------------------1111--3333---11 STRRVSVWPVAFVGGLRYEAPRVNGAGKVVRWKTVFDPHRPFAIDMAGFAVNLRLILQRS 11----------%%%%-------1111---------1111----1111----------11 QAYFKLRGVKGGYQESSLLRELVTLNDLEPKAANCTKILVWHTRTEKPVLVNEGKKGFTD 11-------2222----3333--3333----%%%%--------------1111------1 PSVEI 111-- >SIMILAR TO RING FINGER PR; SWP:Q8R079; PDB:1V85A; GSSGSSGEHGLLVHKAVDKWTTEEVVLWLEQLGPWASLYRDRFLSERVNGRLLLTLTEEE ----------3333-1111------------------------1111-----------33 FSRAPYTIENSSHRRVILTELERVRSGPSSG 33-------3333------------------ >DNA segment, Chr 7, Wayne; SWP:Q9CYA9; PDB:1V86A; GSSGSSGDAGGGVGKELVDLKIIWNKTKHDVKVPLDSTGSELKQKIHSITGLPPAMQKVM ---------------------------------1111---------------3333---- YKGLVPEDKTLREIKVTSGAKIMVVGSTISGPSSG -----11113333---------------------- >DELTEX PROTEIN 2; SWP:Q8R3P2; PDB:1V87A; GSSGSSGEPEQVIRKYTEELKVAPEEDCIICMEKLAVASGYSDMTDSKALGPMVVGRLTK ---3333---------------------1111-3333--3333-------3333------ CSHAFHLLCLLAMYCNGNKDGSLQCPSCKTIYGEKTGTQPWGKMEVFRSGPSSG -----3333--------------------------------------------- >OXYSTEROL BINDING PROTEIN; SWP:Q9BZF1; PDB:1V88A; GSSGSSGIVMADWLKIRGTLKSWTKLWCVLKPGVLLIYKTQKNGQWVGTVLLNACEIIER ------------------------------------------------------------ PSKKDGFCFKLFHPLEQSIWAVKGPKGEAVGSITQPLPSSYLIIRATSESDGRCWMDALE ------------1111-------------------------------------------- LALKSGPSSG ---------- >HYPOTHETICAL PROTEIN KIAA; SWP:P42331; PDB:1V89A; GSSGSSGPIKMGWLKKQRSIVKNWQQRYFVLRAQQLYYYKDEEDTKPQGCMYLPGCTIKE ------------------------------------------------------------ IATNPEEAGKFVFEIIPASWDQNRMGQDSYVLMASSQAEMEEWVKFLRRVAGSGPSSG -----------------------------------3333----------3333----- >HYDROXYETHYLTHIAZOLE KINA; SWP:O58877; PDB:1V8AA; MKFIIEALKRVRERRPLVHNITNFVVMNTTANALLALGASPVMAHAEEELEEMIRLADAV 3333------------------3333-------------------3333----------- VINIGTLDSGWRRSMVKATEIANELGKPIVLDPVGAGATKFRTRVSLEILSRGVDVLKGN --------------------------------2222------------------------ FGEISALLGEEGGEEEAKKLTMNAAREFNTTVAVTGAVDYVSDGRRTFAVYNGHELLGRV -------------------------1111---------------------------1111 TGTGCMVAALTGAFVAVTEPLKATTSALVTFGIAAEKAYEEAKYPGSFHVKLYDWLYRIN -------------3333---------------------1111------------------ ENVIRTYAKVREVE -------------- >ADENOSYLHOMOCYSTEINASE; SWP:P50250; PDB:1V8BA; NKSKVKDISLAPFGKMQMEISENEMPGLMRIREEYGKDQPLKNAKITGCLHMTVECALLI ------1111---------3333------------3333-2222---------------- ETLQKLGAQIRWCSCNIYSTADYAAAAVSTLENVTVFAWKNETLEEYWWCVESALTWGDG --------------------3333---3333----------------------------- DDNGPDMIVDDGGDATLLVHKGVEYEKLYEEKNILPDPEKAKNEEERCFLTLLKNSILKN -------------------------------------1111------------------- PKKWTNIAKKIIGVSEETTTGVLRLKKMDKQNELLFTAINVNDAVTKQKYDNVYGCRHSL ------3333------------------1111-------33331111------------- PDGLMRATDFLISGKIVVICGYGDVGKGCASSMKGLGARVYITEIDPICAIQAVMEGFNV -----------2222--------------------------------------1111--- VTLDEIVDKGDFFITCTGNVDVIKLEHLLKMKNNAVVGNIGHFDDEIQVNELFNYKGIHI -33333333--------------333311112222-------------------2222-- ENVKPQVDRITLPNGNKIIVLARGRLLNLGCATGHPAFVMSFSFCNQTFAQLDLWQNKDT ---2222----1111-----%%%%-3333------3333---------------1111-- NKYENKVYLLPKHLDEKVALYHLKKLNASLTELDDNQCQFLGVNKSGPFKSNEYRY ----------3333--------3333-----------------1111---1111-- >MOAD RELATED PROTEIN; SWP:P83826; PDB:1V8CA; PKVNLYATFRDLTGKSQLELPGATVGEVLENLVRAYPALKEELFEGEGLAERVSVFLEGR ------3333-------------------------3333----------1111---iiii DVRYLQGLSTPLSPGATLDLFPPVAGGGFERTFGAFPPWLLERYLEEWGGTREGEGVYRL 3333-!!!!---1111-----------------------------1111----2222--2 PGAVVRFREVEPLKVGSLSIPQLRVEVEGEEAERWFERIAFAASR 222-----------!!!!--------------------------- >HYPOTHETICAL PROTEIN (TT1; SWP:P68591; PDB:1V8DA; MEGIRRAAQRAAEEFLQAFPMAPGSLFVLGGSTSEVLGTRPSLEAAHAVLEGLLPPLLER 3333-----------------2222----------------------------------- GVHVAVQACEHLNRALVVERETARAFGKEEVAVFPHPKAGGAKATAAFLRFRDPVMVESL --------3333-----------------------3333--------------------% KAQAHGGMDIGGVLIGMHLRPVAVPLRLSVRKIGEAVLLAAKTRPKLVGGARAVYTREEM %%%-----------3333--------------!!!!-------------1111------- LKKLEEFLP --------- >PANTOATE-BETA-ALANINE LIG; SWP:P83701; PDB:1V8FA; MRTVSTVAELRAALPREGVGFVPTMGYLHRGHLALVERARRENPFVVVSVFVNPLQFGPG ----------1111--------------3333--------------------3333-111 EDYHRYPRDLERDRALLQEAGVDLLFAPGVEEMYPEGFATRVQVEGPLTALWEGAVRPGH 13333-----------------------3333--2222-------3333--3333-2222 FQGVATVVARLFLLVQPQRAYFGEKDYQQLLVVRRMVRDLGFPVEVVGVPTVREEDGLAL ----------------------3333---------------------------1111--- SSRNVYLSPETRKKAPVLYRALLAMREVAGQGGSVAEALRAGEEALRAVPEFRKDYLAIV 1111---3333-----------------1111----------------3333-------- HPETLLPLSDWVAGARGIVAGRFPEARLIDNLEVYP -----------2222-------1111---------- >ANTHRANILATE PHOSPHORIBOS; SWP:P83827; PDB:1V8GA; MDAVKKAILGEVLEEEEAYEVMRALMAGEVSPVRAAGLLVALSLRGERPHEIAAMARAMR ------------------------1111-------------------3333--------1 EAARPLRVHRRPLLDIVGTGGDGKGLMNLSTLAALVAAAGGVAVAKHGNRAASSRAGSAD 111----------------------------------1111------------------- LLEALGVDLEAPPERVGEAIEELGFGFLFARVFHPAMRHVAPVRAELGVRTVFNLLGPLT -------11113333-------------3333------------------3333-3333- NPAGADAYVLGVFSPEWLAPMAEALERLGARGLVVHGEGADELVLGENRVVEVGKGAYAL 1111---------3333------------------------------------------- TPEEVGLKRAPLEALKGGGPEENAALARRLLKGEEKGPLADAVALAAGAGFYAAGKTPSL 3333------3333---------------1111---3333-------------------- KEGVALAREVLASGEAYLLLERYVAFLRA ------------------------3333- >SULFUR OXIDATION PROTEIN ; SWP:Q5SME6; PDB:1V8HA; PFRTIARLNPAKPKAGEEFRLQVVAQHPNEPGTRRDAEGKLIPAKYINLVEVYFEGEKVA -------------2222------------------1111--------------%%%%--- EARPGPSTSANPLYAFKFKAEKAGTFTIKLKDTDGDTGEASVKLEL -------------------------------1111----------- >KINESIN-LIKE PROTEIN KIF2; SWP:Q922S8; PDB:1V8KA; PNWEFARMIKEFRVTMECSPLTVTDPIEEHRICVCVRKRPLNKQELAKKEIDVISVPSKC --------------11111111-------------------------------------- LLLVHEPKLKVDLTKYLENQAFCFDFAFDETASNEVVYRFTARPLVQTIFEGGKATCFAY ---------1111---------------1111---------1111--------------- GQTGSGKTHTMGGDLQNASKGIYAMASRDVFLLKNQPRYRNLNLEVYVTFFEIYNGKVFD -2222-----------3333---------------3333--------------%%%%--- LLNKKAKLRVLEDSRQQVQVVGLQEYLVTCADDVIKMINMGSACRTNSSRSHACFQILLR -------------------2222------3333-------3333---------------- TKGRLHGKFSLVDLAGNERMEGAEINKSLLALKECIRALGQFRESKLTQVLRDSFIGENS -------------------3333-------------3333----------3333------ RTCMIAMISPGISSCEYTLNTLRYADRVKELS ----------3333--------------1111 >HYPOTHETICAL PROTEIN PAE2; SWP:Q8ZUJ3; PDB:1V8OA; AVEYLVDASALYALAAHYDKWIKHREKLAILHLTIYEAGNALWKEARLGRVDWAAASRHL -------------111133331111-----3333-----------1111----3333--- KKVSSFKVLEDPPLDEVRVAVERGLTFYDASYAYVAESSGLVLVTQDRELLAKTKGAIDV --------------------1111-3333-------1111-------------2222--- ETLLVRLAAQ ---------- >TT0826; SWP:P84123; PDB:1V8QA; RLGVKRYEGQVVRAGNILVRQRGTRFKPGKNVGMGRDFTLFALVDGVVEFQDRGRLGRYV ------2222--2222------------2222--1111--------------!!!!---- HVRPLA ------ >ADP-RIBOSE PYROPHOSPHATAS; SWP:Q84CU3; PDB:1V8YA; RTYLYRGRILNLALEGRYEIVEHKPAVAVIALREGRMLFVRQMRPAVGLAPLEIPAGLIE ------1111----!!!!--------------iiii-------3333------------2 PGEDPLEAARRELAEQTGLSGDLTYLFSYFVSPGFTDEKTHVFLAENLKEVEIEVVWMRP 222----------------------------3333------------------------- EEALERHQRGEVEFSATGLVGVLYYHAFLR ------1111-------------------- >TRYPTOPHAN SYNTHASE BETA ; SWP:Q8U093; PDB:1V8ZA; MWFGEFGGQYVPETLIEPLKELEKAYKRFKDDEEFNRQLNYYLKTWAGRPTPLYYAKRLT --!!!!-----3333--------------------------------------------- EKIGGAKIYLKREDLVHGGAHKTNNAIGQALLAKFMGKTRLIAETGAGQHGVATAMAGAL -----------33332222--------------1111---------------------11 LGMKVDIYMGAEDVERQKMNVFRMKLLGANVIPVNSGSRTLKDAINEALRDWVATFEYTH 11---------------------------------!!!!--------------------- YLIGSVVGPHPYPTIVRDFQSVIGREAKAQILEAEGQLPDVIVACVGGGSNAMGIFYPFV -------------------------------------------------------3333- NDKKVKLVGVEAGGKGLESGKHSASLNAGQVGVFHGMLSYFLQDEEGQIKPTHSIAPGLD -3333-------!!!!-------3333------%%%%------1111--------3333- YPGVGPEHAYLKKIQRAEYVTVTDEEALKAFHELSRTEGIIPALESAHAVAYAMKLAKEM -------------------------------------------3333--------3333- SRDEIIIVNLSGRGDKDLDIVLKVSG 1111---------3333-----1111 ------------------------------------- ------------------------------------- >NSFL1 COFACTOR P47; SWP:O35987; PDB:1V92A; MAEERQDALREFVAVTGAEEDRARFFLESAGWDLQIALASFYEDGG 3333-------3333---3333-----1111--------------- >5,10-METHYLENETETRAHYDROF; SWP:Q9RA47; PDB:1V93A; MKIRDLLKARRGPLFSFEFFPPKDPEGEEALFRTLEELKAFRPAFVSITYGAMGSTRERS --------------------------------------1111---------iiii----- VAWAQRIQSLGLNPLAHLTVAGQSRKEVAEVLHRFVESGVENLLALRGDPPRGERVFRPH -----3333----------2222------------1111-----------2222------ PEGFRYAAELVALIRERYGDRVSVGGAAYPEGHPESESLEADLRHFKAKVEAGLDFAITQ -----3333--------!!!!-------11111111------------------------ LFFNNAHYFGFLERARRAGIGIPILPGIMPVTSYRQLRRFTEVCGASIPGPLLAKLERHQ ----------------------------------------------------------11 DDPKAVLEIGVEHAVRQVAELLEAGVEGVHFYTLNKSPATRMVLERLGLRPA 11------------------------------%%%%--------1111---- >NUCLEAR RECEPTOR COACTIVA; SWP:Q9HCD5; PDB:1V95A; GSSGSSGPVDCSVIVVNKQTKDYAESVGRKVRDLGMVVDLIFLNTEVSLSQALEDVSRGG ---------------------3333------1111------------3333-----3333 SPFAIVITQQHQIHRSCTVNIMFGTPQEHRNMPQADAMVLVARNYERYKNECREKEREEI ---------3333----------------------------------------------- ARQASGPSSG ---------- >HYPOTHETICAL PROTEIN PH05; SWP:O58236; PDB:1V96A; PLPPDITFDSLALIKMHSQNMKRILEVTLAKFTVNLSIVTVYRYLTARLKKNIEAEFEIL -----------------1111---------------3333-------------------- KDIYNIVPLLDDIAIKAAQIEANLIKKEITLDMEDIITATTAIYTNSLLVTDDPKRYEPI ------------------------1111------------------------33333333 RRFGLDTMPLDKFIKEVELMVEKELI 1111---------------------- >XANTHINE DEHYDROGENASE; SWP:P80457; PDB:1V97A; ADELVFFVNGKKVVEKNADPETTLLAYLRRKLGLRGTKLGCGEGGCGACTVMLSKYDRLQ -------iiii-------1111------------------------1111---------- DKIIHFSANACLAPICTLHHVAVTTVEGIGSTKTRLHPVQERIAKSHGSQCGFCTPGIVM --------3333-1111-------1111--3333---------1111----1111----- SMYTLLRNQPEPTVEEIEDAFQGNLCRCTGYRPILQGFRTFAKSPSLFNPEEFMPLDPTQ ------------------1111--------3333--3333--------3333----3333 EPIFPPELLRLKDVPPKQLRFEGERVTWIQASTLKELLDLKAQHPEAKLVVGNTEIGIEM ----3333--1111-----------------------------3333------------- KFKNQLFPMIICPAWIPELNAVEHGPEGISFGAACALSSVEKTLLEAVAKLPTQKTEVFR ------------33331111----1111-------3333------------3333----- GVLEQLRWFAGKQVKSVASLGGNIITASPISDLNPVFMASGTKLTIVSRGTRRTVPMDHT ---1111---3333-------------1111----------------2222------111 FFPSYRKTLLGPEEILLSIEIPYSREDEFFSAFKQASRREDDIAKVTCGMRVLFQPGSMQ 1--2222---1111----------2222--------------------------2222-- VKELALCYGGMADRTISALKTTQKQLSKFWNEKLLQDVCAGLAEELSLSPDAPGGMIEFR ----------------------1111----------------------1111-------- RTLTLSFFFKFYLTVLKKLGKDKLDPTYTSATLLFQKHPPANIQLFQEVPNGQSKEDTVG ---------------------------3333------------------11113333222 RPLPHLAAAMQASGEAVYCDDIPRYENELFLRLVTSTRAHAKIKSIDVSEAQKVPGFVCF 2---1111---------3333---1111-------------------3333--2222--- LSADDIPGSNETGLFNDETVFAKDTVTCVGHIIGAVVADTPEHAERAAHVVKVTYEDLPA -3333-------1111-----------2222----------------1111--------- IITIEDAIKNNSFYGSELKIEKGDLKKGFSEADNVVSGELYIGGQDHFYLETHCTIAIPK ----------------------------1111---------------------------- GEEGEMELFVSTQNAMKTQSFVAKMLGVPVNRILVRVKRMGGGFGGKETRSTLVSVAVAL -iiii-----------------------1111----------iiii-------------- AAYKTGHPVRCMLDRNEDMLITGGRHPFLARYKVGFMKTGTIVALEVDHYSNAGNSRDLS ------------------------------------1111----------------!!!! HSIMERALFHMDNCYKIPNIRGTGRLCKTNLSSNTAFRGFGGPQALFIAENWMSEVAVTC -------1111---------------------------iiii------------------ GLPAEEVRWKNMYKEGDLTHFNQRLEGFSVPRCWDECLKSSQYYARKSEVDKFNKENCWK --------1111-2222-1111-------------------------------------- KRGLCIIPTKFGISFTVPFLNQAGALIHVYTDGSVLVSHGGTEMGQGLHTKMVQVASKAL ----------------3333---------1111--------------------------- KIPISKIYISETSTNTVPNSSPTAASVSTDIYGQAVYEACQTILKRLEPFKKKNPDGSWE --3333------1111-------%%%%3333----------------------1111--- DWVMAAYQDRVSLSTTGFYRTPNLGYSFETNSGNAFHYFTYGVACSEVEIDCLTGDHKNL ------1111-----------------1111----------------------------- RTDIVMDVGSSLNPAIDIGQVEGAFVQGLGLFTLEELHYSPEGSLHTRGPSTYKIPAFGS ---------------------------------------1111-----3333----1111 IPTEFRVSLLRDCPNKKAIYASKAVGEPPLFLGASVFFAIKDAIRAARAQHTNNNTKELF --------------1111%%%%----3333------------------------1111-- RLDSPATPEKIRNACVDKFTTLCVTGAPGNCKPWSLRV ------------------3333---------------- >THIOREDOXIN; SWP:Q5SI93; PDB:1V98A; GAPLTLVDFFAPWCGPCRLVSPILEELARDHAGRLKVVKVNVDEHPGLAARYGVRSVPTL ----------11113333------------3333------1111-----1111------- VLFRRGAPVATWVGASPRRVLEERLRPYLEGR ---iiii------------------------- >PRECORRIN-8X METHYL MUTAS; SWP:Q53WB0; PDB:1V9CA; RAIEEESFRIVDQEAGPHGFSPLEWPVVRRMIHATADFEYKALTRFSQGAVEAGLKAIQA 3333-------1111-----11113333----------3333------------------ GARILVDARMIACGLNPERLRLFGNEVVELLAHPEVVARTRAEAAVAYAWEKGLLDGAIV -------333333331111-1111-----1111-------------------3333---- GVGNAPTFLLALVEAIRQGARPALVLGMPVGFVNVLEAKRALMEAPVPWIVTEGRKGGST ----3333-------1111----------------------------------------- LVVAALHALIRLAADGGV ------------------ >DIAPHANOUS PROTEIN HOMOLO; SWP:O08808; PDB:1V9DA; VKELKVLDSKTAQNLSIFLGSFRMPYQEIKNVILEVNEAVLTESMIQNLIKQMPEPEQLK ------------------------------------1111-------------------- MLSELKEEYDDLAESEQFGVVMGTVPRLRPRLNAILFKLQFSEQVENIKPEIVSVTAACE ----33331111---------1111----------------------------------- ELRKSENFSSLLSFLCKLRDTKSADQKMTLLHFLAELCENDHPEVLKFPDELAHVEKASR -1111--3333-------------3333------------------3333-11113333- VSAENLQKSLDQMKKQIADVERDVQNFPAATDEKDKFVEKMTSFVKDAQEQYNKLRMMHS -----------------------1111--------------------------------- NMETLYKELGDYFVFDPKKLSVEEFFMDLHNFRNMFLQAVKENQKRRETEEKMRRAKLAK ---------------3333----------------------------------------- EKAEKERL -------- >CARBONIC ANHYDRASE II; SWP:P00921; PDB:1V9EA; SHHWGYGKHNGPEHWHKDFPIANGERQSPVDIDTKAVVQDPALKPLALVYGEATSRRMVN ------11113333-11113333---------3333---3333------1111------- NGHSFNVEYDDSQDKAVLKDGPLTGTYRLVQFHFHWGSSDDQGSEHTVDRKKYAAELHLV -------------------!!!!---------------1111-----iiii--------- HWNTKYGDFGTAAQQPDGLAVVGVFLKVGDANPALQKVLDALDSIKTKGKSTDFPNFDPG --1111--3333--1111-------------3333-----3333--2222---------1 SLLPNVLDYWTYPGSLTTPPLLESVTWIVLKEPISVSSQQMLKFRTLNFNAEGEPELLML 111---------------------------------3333-3333-----2222------ ANWRPAQPLKNRQVRGFPK ------------------- >RIBOSOMAL LARGE SUBUNIT P; SWP:P33643; PDB:1V9FA; FEPQDIPLDIVYEDEDIIIINKPRDLVVHPGAGNPDGTVLNALLHYYPPIADVPRAGIVH ------------------------------2222-------------------%%%%--- RLDKDTTGLMVVAKTVPAQTRLVESLQRREITREYEAVAIGHMTAGGTVDEPISRHPTKR --1111-------------------1111-----------------------------11 THMAVHPMGKPAVTHYRIMEHFRVHTRLRLRLETGRTHQIRVHMAHITHPLVGDPVYGGR 11---1111--------------------------2222-----1111--2222------ PRPPKGASEAFISTLRKFDRQALHATMLRLYHPISGIEMEWHAPIPQDMVELIEVMRADF ---2222----------------------------------------------------- EEHKDEVDWL ---------- >BOLA-LIKE PROTEIN RIKEN C; SWP:NA; PDB:1V9JA; MKGSSHHHHHHSSGASLVPRGSEGAATMELSADYLREKLRQDLEAEHVEVEDTTLNRCAT -------------------------1111-----------1111---------------- SFRVLVVSAKFEGKPLLQRHRLVNECLAEELPHIHAFEQKTLTPEQWTRQRRE --------3333--3333---------3333-----------33333333--- >RIBOSOMAL LARGE SUBUNIT P; SWP:P23851; PDB:1V9KA; DVIYEDDHILVLNKPSGTAVHGGSGLSFGVIEGLRALRPEARFLELVHRLDRDTSGVLLV --------------2222--------------------------------1111------ AKKRSALRSLHEQLREKGQKDYLALVRGQWQSHVKSVQAPLLKNILQSGERIVRVSQEGK ------------------------------3333-----------1111------1111- PSETRFKVEERYAFATLVRCSPVTGRTHQIRVHTQYAGHPIAFDDRYGDREFDRQLTEAG -------------------------2222------------------------------- TGLNRLFLHAAALKFTHPGTGEVRIEAPDEGLKRCLQKRNAR ------------------------------------------ >GLUTAMATE DEHYDROGENASE; SWP:Q9Y8I4; PDB:1V9LA; TGFLEYVLNYVKKGVELGGFPEDFYKILSRPRRVLIVNIPVRLDGGGFEVFEGYRVQHCD 3333-------------------------------------------------------- VLGPYKGGVRFHPEVTLADDVALAILMTLKNSLAGLPYGGAKGAVRVDPKKLSQRELEEL ------------------------------------------------------------ SRGYARAIAPLIGDVVDIPAPDVGTNAQIMAWMVDEYSKIKGYNVPGVFTSKPPELWGNP -------3333----------1111-------------------1111----3333---- VREYATGFGVAVATREMAKKLWGGIEGKTVAIQGMGNVGRWTAYWLEKMGAKVIAVSDIN ----3333----------------2222------------------1111---------- GVAYRKEGLNVELIQKNKGLTGPALVELFTTKDNAEFVKNPDAIFKLDVDIFVPAAIENV ----3333-----3333---------3333-----------3333--------------- IRGDNAGLVKARLVVEGANGPTTPEAERILYERGVVVVPDILANAGGVIMSYLEWVENLQ ----3333------------------------------1111-------------3333- WYIWDEEETRKRLENIMVNNVERVYKRWQREKGWTMRDAAIVTALERIYNAMKIRGWI --------------------------1111----------------------3333-- >V-TYPE ATP SYNTHASE SUBUN; SWP:P74902; PDB:1V9MA; FAYLNARVRVRRGTLLKESFFQEALDLSFADFLRLLSETVYGGELAGQGLPDVDRAVLRT ----------3333--3333------------------3333------3333-------- QAKLVGDLPRLVTGEAREAVRLLLLRNDLHNLQALLRAKATGRPFEEVLLLPGTLREEVW ------3333----------------------------1111-3333--------3333- RQAYEAQDPAGMAQVLAVPGHPLARALRAVLRETQDLARVEALLAKRFFEDVAKPALRDY ----------------1111---------------------------------------- LALEVDAENLRTAFKLQGSGLAPDAFFLKGGRFVDRVRFARLMEGDYAVLDELSGTPFSG ---------------2222--3333----------------1111-3333--1111-111 LSGVRDLKALERGLRCVLLKEAKKGVQDPLGVGLVLAYVKEREWEAVRLRLLARRAYFGL 1--------------------------1111----------------------------- PRAQVEEEVVCP -----1111--- >MALATE DEHYDROGENASE; SWP:O59028; PDB:1V9NA; FEKGYVDENYIRVPKDRLFSFIVRVLTKLGVPEEDAKIVADNLVADLRGVESHGVQRLKR ------1111---3333---------1111---------------111133333333--- YVDGIISGGVNLHPKIRVIREGPSYALIDGDEGLGQVVGYRSKLAIKKAKDTGIGIVIAR ---------------------1111----%%%%--------------------------- NSNHYGIAGYYALAAEEGIGISTNSRPLVAPTGGIERILGTNPIALAAPTKDKPFLLDAT ------3333---3333-------------2222-------------------------- SVVPIGKLEWAINREGNITTKVEEVFNGGALLPLGGFGELLGGHKGYGLSLVDILSGILS ---3333---------------3333------------1111---------------111 GGTWSKYVKNTSEKGSNVCHFFVIDIEHFIPLEEFKEKISQIEEIKSSRKHPEFERIWIH 1--3333--3333-----------3333----------------------3333----22 GEKGFLTETRLKLGIPIYRKVLEELNEIAKRVGVEGL 22----------------------------------- >CYCLOPHILIN B; SWP:P20752; PDB:1V9TA; AKGDPHVLLTTSAGNIELELDKQKAPVSVQNFVDYVNSGFYNNTTFHRVIPGFMIQGGGF ----------1111------------------------1111-------2222------- TEQMQQKKPNPPIKNEADNGLRNTRGTIAMARTADKDSATSQFFINVADNAFLDHGQRDF 1111-------------------2222----------------------1111--1111- GYAVFGKVVKGMDVADKISQVPTHDVGPYQNVPSKPVVILSATVLP -----------------1111----!!!!----------------- >KIAA0561 PROTEIN; SWP:O60307; PDB:1V9VA; GSSGSSGPKATAQMEGRLQEFLTAYAPGARLALADGVLGFIHHQIVELARDCLAKSGENL ------------------------------------3333--------------3333-- VTSRYFLEMQEKLERLLQDAHERSDSEEVSFIVQLVRKLLIIISRPARSGPSSG ------------------------3333---------3333------------- >PUTATIVE 42-9-9 PROTEIN; SWP:NA; PDB:1V9WA; GSEGAATMATFEEVSVLGFEEFDKAVKEHESKTIFAYFSGSKDTEGKSWCPDCVEAEPVI --------------------------1111------------3333-----3333----- REGLKHVTEDCVFIYCQVGDKPYWKDPNNDFRQKLKITAVPTLLKYGTPQKLVESECCQS --3333-------------1111--11111111--------------------1111-33 SLVEMIFSED 33-------- >POLY (ADP-RIBOSE) POLYMER; SWP:NA; PDB:1V9XA; GSSGSSGHKPWRAEYAKSSRSSCKTCKSVINKENFRLGKLVQSTHFDGIMPMWNHASCIL ------------------------------------------------------111133 KKTKQIKSVDDVEGIESLRWEDQQKIRKYVESGAGSNTSTSTGTSTSSSGPSSG 33-----------1111--------1111------%%%%--------------- >HEME PAS SENSOR PROTEIN; SWP:Q7AE17; PDB:1V9YA; GIFFPALEQNMMGAVLINENDEVMFFNPAAEKLWGYKREEVIGNNIDMLIPRDLRPAHPE ------1111-------1111---------------333322223333--1111------ YIRHNRERELQLEKKDGSKIWTRFALSKVSAEGKVYYLALVRD -------------1111-------------iiii--------- >UROPORPHYRIN-III C-METHYL; SWP:Q5SKH6; PDB:1VA0A; GRVYLVGAGPGDPELLTLKAYRLLKEAPVVLYDRLVDERVLALAPGEKVYVGKEEKQEEI -----------3333-----------------111133331111---------------- HRLLLRHARAHPFVVRLKGGDPMVFGRGGEEVLFLLRHGVPVEVVPGVTSLLASGLPLTH -------1111---------1111-----------1111---------3333-------2 RGLAHGFAAVSGVLEGGGYPDLRPFARVPTLVVLMGVGRRVWIAKELLRLGRDPREPTLF 222----------2222----3333----------3333--------1111-1111---- VERASTPKERRVHARLEEVAEGKVEVRPPALWILGEVVRVF -----1111-----33331111-------------3333-- >TRANSCRIPTION FACTOR SP1; SWP:P08047; PDB:1VA1A; MDPGKKKQHICHIQGCGKVYGKTSHLRAHLRWHTGER ---------------------3333------------ >ARYLESTERASE; SWP:P22862; PDB:1VA4A; STFVAKDGTQIYFKDWGSGKPVLFSHGWLLDADMWEYQMEYLSSRGYRTIAFDRRGFGRS ----1111------------------2222-----------1111--------2222--- DQPWTGNDYDTFADDIAQLIEHLDLKEVTLVGFSMGGGDVARYIARHGSARVAGLVLLGA ------------------------------------------------1111-------- VTPLFGQKPDYPQGVPLDVFARFKTELLKDRAQFISDFNAPFYGINKGQVVSQGVQTQTL -------1111----3333------------------------1111------------- QIALLASLKATVDCVTAFAETDFRPDMAKIDVPTLVIHGDGDQIVPFETTGKVAAELIKG --3333-----------------3333------------------3333--------222 AELKVYKDAPHGFAVTHAQQLNEDLLAFLKR 2----2222--3333---------------- >GLUTAMATE--CYSTEINE LIGAS; SWP:P06980; PDB:1VA6A; MIPDVSQALAWLEKHPQALKGIQRGLERETLRVNADGTLATTGHPEALGSALTHKWITTD --------------1111---------------1111-------3333-3333------- FAEALLEFITPVDGDIEHMLTFMRDLHRYTARNMGDERMWPLSMPSYIAEGQDIELAQYG -1111------------------------1111!!!!-----------2222-------- TSNTGRFKTLYREGLKNRYGALMQTISGVHYNFSLPMAFWQAKSGDISGADAKEKISAGY --------------------3333-----------3333--------3333--------- FRVIRNYYRFGWVIPYLFGASPAISSSFLTSLPFEKTESGMYYLPYATSLRLSDLGYTNK ------------------------3333--------1111---1111-33331111--33 SQSNLGITFNDLYEYVAGLKQAIKTPSEEYAKIGIEKDGKRLQINSNVLQIENELYAPIR 33-----------------3333----3333-----iiii----------3333------ PKRVTRSGESPSDALLRGGIEYIEVRSLDINPFSPIGVDEQQVRFLDLFMVWCALADAPE -----2222---------------------1111-------------------------- MSSSELACTRVNWNRVILEGRKPGLTLGIGCETAQFPLPQVGKDLFRDLKRVAQTLDSIN ------------------1111------!!!!---------------------------- GGEAYQKVCDELVACFDNPDLTFSARILRSMIDTTGKAFAEAYRNLLREEPLEILREEDF --------------33331111---------------------------------3333- VAEREASERRQQEMEAADTEPFAVWLE --------------------3333--- >MAGUK P55 SUBFAMILY MEMBE; SWP:Q9JLB2; PDB:1VA8A; GSSGSSGPITDERVYESIGHYGGETVKIVRIEKARDIPLGATVRNEMDSVIISRIVKGGA ------------------------------------------------------------ AEKSGLLHEGDEVLEINGIEIRGKDVNEVFDLLSDMHGTLTFVLIPSSGPSSG --------------------------------3333----------------- >Down syndrome cell adhesi; SWP:Q8TD84; PDB:1VA9A; GSSGSSGISTEEAAPDGPPMDVTLQPVTSQSIQVTWKAPKKELQNGVIRGYQIGYRENSP ----------------------------------------3333--------------22 GSNGQYSIVEMKATGDSEVYTLDNLKKFAQYGVVVQAFNRAGTGPSSSEINATTLESGPS 22------------------------------------3333------------------ SG -- >RHOPHILIN, RHO GTPASE BIN; SWP:Q8BWR8; PDB:1VAEA; GSSGSSGSASKRWSPPRGIHFTVEEGDLGFTLRGNTPVQVHFLDPHCSASLAGAKEGDYI -------------------------------------------11113333--------- VSIQGVDCKWLTVSEVMKLLKSFGGEEVEMKVVSLLDSTSSMHNKSGPSSG --iiii-------------3333---------------------------- >HYPOTHETICAL PROTEIN PH00; SWP:O57770; PDB:1VAJA; FKIKDEWGEFLVRLARRAIEEYLKTGKEIEPPKDTPPELWEKGVFVTLNRYNVPPQTALR -------------------------------11113333--------------1111--- GCIGFPTPIYPLVEATIKAAIYSAVDDPRFPPVKLEEDNLVVEVSVLTPPELIEGPPEER --------------------------3333---3333------------------11113 PRKIKVGRDGLIVEKGIYSGLLLPQVPVEWGWDEEEFLAETCWKAGLPPDCWLDEDTKVY 333-2222------!!!!----3333----------------1111-1111--1111--- KFTAEIFEEEYPRGPIKRKPL ----------2222------- >PHOSPHOLIPASE A2; SWP:P51972; PDB:1VAPA; NLFQFEKLIKKMTGKSGMLWYSAYGCYCGWGGQGRPKDATDRCCFVHDCCYGKVTGCNPK ---------------3333------------------3333---------1111---333 MDIYTYSVDNGNIVCGGTNPCKKQICECDRAAAICFRDNLKTYDSKTYWKYPKKNCKEES 3-------iiii--------------------------3333-3333----3333----- EPC --- >ALGINATE LYASE PA1167; SWP:Q9I4H0; PDB:1VAVA; PDLSTWNLTIPQGRPAITISTSQLQRDYRSDYFQRTADGIRFWVPVNGSHTRNSEFPRSE -----------------------------1111--1111-----1111--1111------ LRETLSSGRPYNWRYARADNWLEATLRIEAVPSTRRMIIGQIHSDGSNSGQAAPLVKLLY ----1111-----------------------3333------------------------- QLRLDQGRVQALVRERPDDGGTRAYTLMDGIPLGQPFSYRIGVSRSGLLSVSVNGSALEQ --!!!!---------1111------------2222--------1111-----iiii---- QLDPQWAYQGLYFKAGLYLQDNRGPSSEGGRATFSELRVSHQ --3333------------------1111-------------- >URIC ACID OXIDASE; SWP:NA; PDB:1VAXA; TKVVLGQNQYGKAEVRLVKVTRNTARHEIQDLNVTSQLRGDFEAAHTAGDNAHVVATDTQ --------------------------------------------------1111------ KNTVYAFARDGFATTEEFLLRLGKHFTEGFDWVTGGRWAAQQFFWDRINDHDHAFSRNKS ------3333-------------------1111--------------%%%%--------- EVRTAVLEISGSEQAIVAGIEGLTVLKSTGSEFHGFPRDKYTTLQETTDRILATDVSARW ---------!!!!-------------------------1111------------------ RYNTVEVDFDAVYASVRGLLLKAFAETHSLALQQTMYEMGRAVIETHPEIDEIKMSLPNK ----------------------------------------------1111---------- HHFLVDLQPFGQDNPNEVFYAADRPYGLIEATIQREGSRADHPIWSN ------3333------------------------2222----3333- >NSFL1 COFACTOR P47; SWP:O35987; PDB:1VAZA; ERRRHSGQDVHVVLKLWKTGFSLDNGDLRSYQDPSNAQFLESIRRGEVPAELRRLAHGGQ -----------------------------1111----------------3333------- VNLDMEDHRDEDFVKP ---------------- >COBROTOXIN B; SWP:P80958; PDB:1VB0A; LECHNQQSSQTPTTKTCSGETNCYKKWWSDHRGTIIERGCGCPKVKPGVNLNCCTTDRCN ------!!!!-------------------1111------------2222------2222- N - >THREONINE SYNTHASE; SWP:P00934; PDB:1VB3A; MKLYNLKDHNEQVSFAQAVTQGLGKNQGLFFPHDLPEFSLTEIDEMLKLDFVTRSAKILS ----1111----------------%%%%-----------------1111----------- AFIGDEIPQEILEERVRAAFAFPAPVANVESDVGCLELFHGPTLAFKDFGGRFMAQMLTH ---33333333------------------------------------------------- IAGDKPVTILTATSGDTGAAVAHAFYGLPNVKVVILYPRGKISPLQEKLFCTLGGNIETV -!!!!-------------------2222---------2222------------!!!!--- AIDGDFDACQALVKQAFDDEELKVALGLNSANSINISRLLAQICYYFEAVAQLPQETRNQ --------------1111-3333--------3333--------------11113333--- LVVSVPSGNFGDLTAGLLAKSLGLPVKRFIAATNVNDTVPRFLHDGQWSPKATQATLSNA -------------------3333----------------------------------333 MDVSQPNNWPRVEELFRRKIWQLKELGYAAVDDETTQQTMRELKELGYTSEPHAAVAYRA 3----1111------------3333-------------------------3333------ LRDQLNPGEYGLFLGTAHPAKFKESVEAILGETLDLPKELAERADLPLLSHNLPADFAAL -----2222--------3333---------------3333--1111-------------- RKLMMNHQ ---1111- >TRANSLATION INITIATION FA; SWP:O58185; PDB:1VB5A; LPERVLEILREMKRERIKGASWLAKKGAEAFLTLAEELDESLLEDAIMELREEVVKVNPS --------------------------------------3333---------------111 MASLYNLARFIPVTNRRDILKSRALEFLRRMEEAKRELASIGAQLIDDGDVIITHSFSST 1--------------3333-----------------------33332222---------- VLEIIRTAKERKKRFKVILTESSPDYEGLHLARELEFSGIEFEVITDAQMGLFCREASIA --------1111-------------3333------1111------3333-1111------ IVGADMITKDGYVVNKAGTYLLALACHENAIPFYVAAETYKFHPTLKSGDVMLMERDLIR -------1111----2222------------------1111-----3333---------% GNVRIRNVLFDVTPWKYVRGIITELGIVIPPRDI %%%----------3333-----1111----1111 >PDZ AND LIM DOMAIN 2; SWP:NA; PDB:1VB7A; GSSGSSGLTVDVAGPAPWGFRISGGRDFHTPIIVTKVTERGKAEAADLRPGDIIVAINGQ -------------------------1111-------------3333----------iiii SAENMLHAEAQSKIRQSASPLRLQLDRSSGPSSG -----1111------------------------- >231aa long hypothetical p; SWP:Q972K9; PDB:1VBFA; ASEKEEILRKIKTQELAEAFNKVDRSLFLPENLKDYAYAHTHEALPILPGINTTALNLGI -------3333-------------3333-33331111--1111----2222--------- FLDELDLHKGQKVLEIGTGIGYYTALIAEIVDKVVSVEINEKYNYASKLLSYYNNIKLIL -1111--2222------!!!!-----3333-----------------1111--------- GDGTLGYEEEKPYDRVVVWATAPTLLCKPYEQLKEGGIILPIGVGRVQKLYKVIKKGNSP -3333-3333-----------------------1111------------------!!!!- SLENLGEVFGRIGGLYGFYDDYDDIEFRVNKLERQIKSIL ---------------------------------------- >PYRUVATE,ORTHOPHOSPHATE D; SWP:P11155; PDB:1VBGA; KKRVFHFGKGKSEGNKTMKELLGGKGANLAEMASIGLSVPPGFTVSTEACQQYQDAGCAL -------2222---1111------------------------------------------ PAGLWAEIVDGLQWVEEYMGATLGDPQRPLLLSVRSGAAVSMPGMMDTVLNLGLNDEVAA 2222--------------------3333-------------------------------- GLAAKSGERFAYDSFRRFLDMFGNVVMDIPRSLFEEKLEHMKESKGLKNDTDLTASDLKE -------3333------------------3333----------------1111------- LVGQYKEVYLSAKGEPFPSDPKKQLELAVLAVFNSWESPRAKKYRSINQITGLRGTAVNV -------------------3333----------3333----------------------- QCMVFGNMGNTSGTGVLFTRNPNTGEKKLYGEFLVNAQGEDVVAGIRTPEDLDAMKNLMP -------------------------------------33333333-----3333------ QAYDELVENCNILESHYKEMQDIEFTVQENRLWMLQCRTGKRTGKSAVKIAVDMVNEGLV ------------------------------------------------------------ EPRSAIKMVEPGHLDQLLHPQFENPSAYKDQVIATGLPASPGAAVGQVVFTAEDAEAWHS 33331111-----3333------33331111---------------------------11 QGKAAILVRAETSPEDVGGMHAAVGILTERGGMTSHAAVVARGWGKCCVSGCSGIRVNDA 11----------1111---------------1111------1111------------333 EKLVTIGGHVLREGEWLSLNGSTGEVILGKQPLSPPALSGDLGTFMAWVDDVRKLKVLAN 3----iiii--2222--------------------------------------------- ADTPDDALTARNNGAQGIGLCRTEHMFFASDERIKAVRQMIMAPTLELRQQALDRLLPYQ ----------1111-------33331111------------------------------- RSDFEGIFRAMDGLPVTIRLLDPPLHEFLPEGNIEDIVSELCAETGANQEDALARIEKLS --------1111-------------1111------------------------------- EVNPMLGFRGCRLGISYPELTEMQARAIFEAAIAMTNQGVQVFPEIMVPLVGTPQELGHQ --3333--!!!!-----------------------1111--------------------- VTLIRQVAEKVFANVGKTIGYKVGTMIEIPRAALVADEIAEQAEFFSFGTNDLTQMTFGY ----------------------------3333----3333-------------------- SRDDVGKFIPVYLAQGILQHDPFEVLDQRGVGELVKFATERGRKARPNLKVGICGEHGGE 33333333----1111----1111---------------------1111-----3333-- PSSVAFFAKAGLDYVSCSPFRVPIARLAAAQVLV -------1111------1111---------1111 >TYPE 2 MALATE/LACTATE DEH; SWP:Q746L8; PDB:1VBIA; MRWRADFLSAWAEALLRKAGADEPSAKAVAWALVEADLRGVGSHGLLRLPVYVRRLEAGL ----------------1111----------------111133333333------------ VNPSPTLPLEERGPVALLDGEHGFGPRVALKAVEAAQSLARRHGLGAVGVRRSTHFGMAG -----------!!!!------------------------------------------333 LYAEKLAREGFVAWVTTNAEPDVVPFGGREKALGTNPLAFAAPAPQGILVADLATSESAM 3-----1111--------------2222---------------1111-----------33 GKVFLAREKGERIPPSWGVDREGSPTDDPHRVYALRPLGGPKGYALALLVEVLSGVLTGA 33-----------3333--1111----3333------------------------1111- GVAHGIGRMYDEWDRPQDVGHFLLALDPGRFVGKEAFLERMGALWQALKATPPAPGHEEV -!!!!--1111---------------3333-----------------1111--2222--- FLPGELEARRRERALAEGMALPERVVAELKALGERYGVPW -2222----------------------------1111--- >PROSTAGLANDIN F SYNTHASE; SWP:NA; PDB:1VBJA; MLTQSLKLSNGVMMPVLGFGMWKLQDGNEAETATMWAIKSGYRHIDTAAIYKNEESAGRA -------1111------------------------------------3333--------- IASCGVPREELFVTTKLWNSDQGYESTLSAFEKSIKKLGLEYVDLYLIHWPGKDKFIDTW ------3333-------3333--------------------------------------- KAFEKLYADKKVRAIGVSNFHEHHIEELLKHCKVAPMVNQIELHPLLNQKALCEYCKSKN --------------------3333---1111------------1111---------1111 IAVTAWSPLGQGHLVEDARLKAIGGKYGKTAAQVMLRWEIQAGVITIPKSGNEARIKENG ------1111------------3333-------------------------------111 NIFDFELTAEDIQVIDGMNAGHRYGPDPEVFMNDF 1--------------1111-------3333-2222 >HYPOTHETICAL PROTEIN PH13; SWP:O59053; PDB:1VBKA; MNVVIVRYGEIGTKSRQTRSWFEKILMNNIREALVTEEVPYKEIFSRHGRIIVKTNSPKE ---------2222---------------------------------iiii---------- AANVLVRVFGIVSISPAMEVEASLEKINRTALLMFRKKAKEVGKERPKFRVTARRITKEF ---33332222------------------------------------------------- PLDSLEIQAKVGEYILNNENCEVDLKNYDIEIGIEIMQGKAYIYTEKIKGWGGLPIGTEG --------------1111------------------iiii--------------2222-- RMIGILHDELSALAIFLMMKRGVEVIPVYIGKDDKNLEKVRSLWNLLKRYSYGSKGFLVV ---------------------------------3333--------3333----------- AESFDRVLKLIRDFGVKGVIKGLRPNDLNSEVSEITEDFKMFPVPVYYPLIALPEEYIKS --3333-----------------3333-1111---------------3333--------- VKERLGL ------- >PECTATE LYASE 47; SWP:Q9AJM4; PDB:1VBLA; KELGHEVLKPYDGWAAYGEGTTGGAMASPQNVFVVTNRTELIQALGGNNHTNQYNSVPKI -3333-------3333!!!!-!!!!--1111-----------1111-3333--------- IYVKGTIDLNVDDNNQPVGPDFYKDPHFDFEAYLREYDPATWGKKEVEGPLEEARVRSQK -----------1111---3333--1111---------3333!!!!--------------- KQKDRIMVYVGSNTSIIGVGKDAKIKGGGFLIKNVDNVIIRNIEFEAPLDYFPEWDPTDG ------------------!!!!-------------------------------------- TLGEWNSEYDSISIEGSSHIWIDHNTFTDGDHPDRSLGTYFGRPFQQHDGALDIKNSSDF ----------------------------!!!!3333---%%%%----------------- ITISYNVFTNHDKVTLIGASDSRMADSGHLRVTLHHNYYKNVTQRLPRVRFGQVHIYNNY -------------------11111111--------------------------------- YEFSNLADYDFQYAWGVGVFSQIYAQNNYFSFDWDIDPSLIIKVWSKNEESMYETGTIVD ---1111----------2222---------------1111-------------------- LPNGRRYIDLVASYNESNTLQLKKEVTWKPMFYHVIHPTPSVPALVKAKAGAGNLH 1111----------1111-------------------3333---------2222-- >ARTOCARPIN; SWP:Q7M1T4; PDB:1VBOA; SQTITVGPWGGPGGNGWDDGSYTGIRQIELSYKEAIGSFSVIYDLNGDPFSGPKHTSKLP --------------------------------------------iiii------------ YKNVKIELKFPDEFLESVSGYTGPFSALATPTPVVRSLTFKTNKGRTFGPYGDEEGTYFN ------------------------3333-------------1111--------------- LPIENGLIVGFKGRTGDLLDAIGIHMSL ---------------------------- >ENDO-1,4-BETA-XYLANASE B; SWP:Q9WXS5; PDB:1VBUA; VSLRELAEKLNIYIGFAAINNFWSLSDAEKYMEVARREFNILTPENQMKWDTIHPERDRY -------1111---------3333-------------------------------1111- NFTPAEKHVEFAEENDMIVHGHTLVWHNQLPGWITGREWTKEELLNVLEDHIKTVVSHFK -3333-------1111--------------3333------------------------22 GRVKIWDVVNEAVSDSGTYRESVWYKTIGPEYIEKAFRWAKEADPDAILIYNDYSIEEIN 22-------------------3333------------------1111------------- AKSNFVYNMIKELKEKGVPVDGIGFQMHIDYRGLNYDSFRRNLERFAKLGLQIYITEMDV -----------------------------1111--------------------------- RIPLSGSEEYYLKKQAEVCAKIFDICLDNPAVKAIQFWGFTDKYSWVPGFFKGYGKALLF ----------------------------3333--------3333-3333-2222------ DENYNPKPCYYAIKEVLEKKIEER 1111-------------------- >HYPOTHETICAL PROTEIN B096; SWP:P0AB20; PDB:1VBVA; ASKFGIGQQVRHSLLGYLGVVVDIDPVAAPWYHVVMEDDNGLPVHTYLAEAQLSSELQDE ----2222----------------------------------------3333-------- HPEQPSMDELAQTIRKQ 1111------------- >TRYPSIN INHIBITOR BGIT; SWP:Q7M1Q1; PDB:1VBWA; SRCQGKSSWPQLVGSTGAAAKAVIERENPRVRAVIIKVGSGATKDFRCDRVRVWVTERGI --------1111---------------1111-----2222------1111-----1111- VARPPTIG -------- >PUTATIVE ANTI-SIGMA FACTO; SWP:Q9X1F5; PDB:1VC1A; MNNLKLDIVEQDDKAIVRVQGDIDAYNSSELKEQLRNFISTTSKKKIVLDLSSVSYMDSA -----------------------1111----------1111---------1111------ GLGTLVVILKDAKINGKEFILSSLKESISRILKLTHLDKIFKITDTVEEA -----------------------------------3333------3333- >GLYCERALDEHYDE 3-PHOSPHAT; SWP:P84125; PDB:1VC2A; MKVGINGFGRIGRQVFRILHERGVEVALINDLTDNKTLAHLLKYDSTYGRFPGAVGYDEE --------3333-------1111----------------------------------111 NLYVDGKAIRATAIKDPREIPWKQAGVGVVVESTGVFTDGEKARAHLEAGAKKVIITAPA 1--------------3333-------------------3333--3333------------ KNEDITVVLGVNHEQYDPAKHHILSNASCTTNSLAPVMKVLEKAFGVEKALMTTVHSYTN -------22223333-3333-------------3333---------------------11 DQRLLDLPHKDLRRARAAALNIIPTTTGAAKATALVLPSLKGRFDGMALRVPTPTGSISD 11--------------1111-------3333-----3333-------------------- ITALLKREVTAEEVNAALKAAAEGPLKGILAYTEDEIVLRDIVMDPHSSIVDGKLTKAIG ---------3333------------2222--------33332222------3333---!! NLVKVFAWYDNEWGYANRVADLVELVLKKGV !!----------------------------- >Aspartate 1-decarboxylase; SWP:Q72L22; PDB:1VC3B; VTVDQDLLDAAGILPFEQVDIYDITNGARLTTYALPGERGSGVIGINGAAAHLVKPGDLV -------------2222--------------------2222--------3333-2222-- ILVAYGVFDEEEARNLKPTVVLVDERNRILEVRKG --------33331111-------1111-------- >INDOLE-3-GLYCEROL PHOSPHA; SWP:P84126; PDB:1VC4A; MRPDLSRVPGVLGEIARKRASEVAPYPLPEPPSVPSFKEALLRPGLSVIAEVKRQSPSEG ----1111----------3333-----------------1111------------3333- LIREVDPVEAALAYARGGARAVSVLTEPHRFGGSLLDLKRVREAVDLPLLRKDFVVDPFM --------------1111---------------3333-------------------3333 LEEARAFGASAALLIVALLGELTGAYLEEARRLGLEALVEVHTERELEIALEAGAEVLGI ----1111------3333!!!!--------1111-------------------------- NNRDLATLHINLETAPRLGRLARKRGFGGVLVAESGYSRKEELKALEGLFDAVLIGTSLM ----------1111------------------------3333-1111--------3333- RAPDLEAALRELVG -------------- >HUMAN VASCULAR CELL ADHES; SWP:P19320; PDB:1VCAA; FKIETTPESRYLAQIGDSVSLTCSTTGCESPFFSWRTQIDSPLNGKVTNEGTTSTLTMNP -------------2222-------------------1111---------!!!!------- VSFGNEHSYLCTATCESRKLEKGIQVEIYSFPKDPEIHLSGPLEAGKPITVKCSVADVYP -1111---------!!!!-------------------------2222------------1 FDRLEIDLLKGDHLMKSQEFLEDADRKSLETKSLEVTFTPVIEDIGKVLVCRAKLHIDEM 111-------------------------------------1111---------------- DSVPTVRQAVKELQVYISP ------------------- >DNA TOPOISOMERASE I; SWP:P08585; PDB:1VCC; MRALFYKDGKLFTDNNFLNPVSDDNPAYEVLQHVKIPTHLTDVVVYEQTWEEALTRLIFV ------iiii---1111----111133333333---1111--------3333-------- GSDSKGRRQYFYGKMHV --1111------1111- >NDX1; SWP:Q75UV1; PDB:1VCDA; MELGAGGVVFNAKREVLLLRDRMGFWVFPKGHPEPGESLEEAAVREVWEETGVRAEVLLP ----------1111------1111---------2222----------------------- LYPTRYVNPKGVEREVHWFLMRGEGAPRLEEGMTGAGWFSPEEARALLAFPEDLGLLEVA -------1111------------------2222----------------3333------- LERLPL ------ >ISOPENTENYL-DIPHOSPHATE D; SWP:Q746I8; PDB:1VCFA; KTTTGLEGFRLRYQALAGLALSEVDLTTPFLGKTLKAPFLIGATENGERINLALAEAAEA ----1111-----1111--3333------------------------------------- LGVGLGSGRILLERPEALRSFRVRKVAPKALLIANLGLAQLRRYGRDDLLRLVELEADAL --------3333-1111-----33331111-------3333---3333------------ AFHVNPLQEAVQRGDTDFRGLVERLAELLPLPFPVVKEVGHGLSREAALALRDLPLAAVD ---------1111-----------------------------------1111-------- VAGAGGTSWARVEEWVELCEIGIPTARAILEVREVLPHLPLVASGGVYTGTDGAKALALG -----------3333-----------------------------------------1111 ADLLAVARPLLRPALEGAERVAAWIGDYLEELRTALFAIGARNPKEARGRVERV ------3333-3333-3333----------------1111--33332222---- >PHOSPHORIBOSYLTRANSFERASE; SWP:Q5SHW7; PDB:1VCHA; ETYPITVGGVTRHVPLIEPLPGRRIPLVEFLGDPEFTRAAAEALRPLVPKEAEILFTTET ------iiii---------2222------2222----------3333-1111-------3 SPIPLTHVLAEALGLPYVVARRRRRPYMEDPIIQEVQTGEVLWLDRRFAEKLLNQRVVLV 333---------------------2222----------------3333-1111------- SDVVASGETMRAMEKMVLRAGGHVVARLAVFRQGTPGLAVDTVAELPVL ------------------------------------------------- >HEMOLYTIC LECTIN CEL-III; SWP:Q868M7; PDB:1VCLA; VLCTNPLDIGELRSFKSKQCVDIVGNQGSGNIATYDCDGLSDQQIIICGDGTIRNEARNY ---------------------------------------3333----1111---3333-- CFTPDGSGNANVMSSPCTLYPEIPSSQRWRQGRRKTFTDNGGIEQVATEIINLASGKCLD -----------------------3333-----------1111------------------ IEGSDGTGDIGVYDCQNLDDQYFYVRSRGPELFYGRLRNEKSDLCLDVEGSDGKGNVLMY -----------------1111--------------------------------------- SCEDNLDQWFRYYENGEIVNAKSGMCLDVEGSDGSGNVGIYRCDDLRDQMWSRPNAYCNG ----1111---------------------------------------------3333-!! DYCSFLNKESNKCLDVSGDQGTGDVGTWQCDGLPDQRFKWVFDDWEVPTATWNMVGCDQN !!---------------------------------------------------------- GKVSQQISNTISFSSTVTAGVAVEVSSTIEKGVIFAKATVSVKVTASLSKAWTNSQSGTT ----------------------------3333--%%%%-------------1111----- AITYTCDNYDSDEEFTRGCMWQLAIETTEVKSGDLLVWNPQIVKCTRSNTAPGCAPFTKC --------1111-----------------1111--------------------------- ANEDCTFCTDI -3333------ >CTP SYNTHETASE; SWP:Q5SIA8; PDB:1VCOA; RPRKYVFITGGVVSSLGKGILTSSLGALLRARGYRVTAIKIDPYVNVDAGTMRPYEHGEV -----------------------------1111---------------1111-------- FVTADGAETDLDIGHYERFLDMDLSRGNNLTTGQVYLSVIQKERRGEYLSQTVQVIPHIT --1111---3333-----------1111-------------------iiii--------- DEIKERIRKVAEEQKAEIVVVEVGGTVGDIESLPFLEAIRQFRFDEGEGNTLYLHLTLVP -----------1111--------------1111-------------2222---------- YLETSEEFKTKPTQHSVATLRGVGIQPDILVLRSARPVPEEVRRKVALFTNVRPGHVFSS --------------------1111----------------------------1111---- PTVEHLYEVPLLLEEQGLGRAVERALGLEAVIPNLSFWQEAVRVLKHPERTVKIAIAGKY ----3333---------------1111--------------------------------- VDAYLSLLEALRHAGIKNRARVEVKWVDAESLADLEEAFRDVSGILVPGGFGVRGIEGKV ---------------------------3333--3333---------------2222---- RAAQYARERKIPYLGICLGLQIAVIEFARNVAGLKGANSTEFDPHTPHPVIDLMPEQLEV ---------------------------------2222-33331111--------3333-- GGTMRLGDWPMRIKPGTLLHRLYGKEEVLERHRHRYEVNPLYVDGLERAGLVVSATTPGM -------------2222---------------------3333-----------------% RGRGAGLVEAIELKDHPFFLGLQSHPEFKSRPMRPSPPFVGFVEAALAYQE %%%2222-----------------3333--3333----------------- >SEMLIKI FOREST VIRUS CAPS; SWP:P03315; PDB:1VCPA; CIFEVKHEGKVTGYACLVGDKVMKPAHVKGVIDNADLAKLAFKKSSKYDLECAQIPVHMR ------%%%%--------------3333------3333-------1111------11111 SDASKYTHEKPEGHYNWHHGAVQYSGGRFTIPTGAGKPGDSGRPIFDNKGRVVAIVLGGA 111-------------1111----%%%%--------2222------1111---------- NEGSRTALSVVTWNKDMVTRVTPEGSEEW ----------------------2222--- >Vesicle transport through; SWP:O89116; PDB:1VCSA; GSSGSSGEGYEQDFAVLTAEITSKIARVPRLPPDEKKQMVANVEKQLEEARELLEQMDLE --------------------------1111-----------------------------3 VREIPPQSRGMYSNRMRSYKQEMGKLETDFKRSRIASGPSSG 333--------------------3333-33331111------ >HYPOTHETICAL PROTEIN PH02; SWP:O57975; PDB:1VCTA; YEPKSVKEIFIEMKDTVELMVDLAYASLLFGDKEIAEEVLELEERIDLLNYQLMMHSVLA ---------------------------1111--------------------------111 ARNVKEAEQVITILQIANAIEDISNAAGDLAKMVLEGVELHPVIKETILEGEEIIGKIQV 1----------------------------------------------------------- YPESVIVGKTLGELDLATNTGVWIIAVRRGKRWIFGPNENFKIRAGDVLIGRGTRTSIDH 1111-22223333--3333---------!!!!-----1111--2222------------- LKEIARGAIRVIG ------------- >PROBABLE DEOXYRIBOSE-PHOS; SWP:Q8ZXK7; PDB:1VCVA; MIHLVDYALLKPYLTVDEAVAGARKAEELGVAAYCVNPIYAPVVRPLLRKVKLCVVADFP 1111------1111----------------------3333-3333--------------- FGALPTASRIALVSRLAEVADEIDVVAPIGLVKSRRWAEVRRDLISVVGAAGGRVVKVIT -------------------------------1111------------------------- EEPYLRDEERYTLYDIIAEAGAHFIKSSTGFAEEAYAARQGNPVHSTPERAAAIARYIKE 3333-3333----------------------------1111------------------- KGYRLGVKMAGGIRTREQAKAIVDAIGWGEDPARVRLGTSTPEALL ------------------------------3333------3333-- >VOLVATOXIN A2; SWP:Q6USC4; PDB:1VCYA; NVFQPVDQLPEDLIPSSIQVLKFSGKYLKLEQNKAYFDWPEFKTAIDNYTGEDLSFDKYD ---------1111-----------1111--iiii-------------------------- QSTINQREQEVGSMVDKIAKFLRDAFSAVVDLSKLGAIILNTFTNLEEESSSGFLQFSTN -------------------------1111----------------3333----------- NVKKNSSWEYRVLFSVPFGAPSYFYSLVTTILITADIEEKTGWWGLTSSTKKNFAVQIDA -------------------1111---------------3333----1111---------- LELVVKKGFKAPN ------------- >PROTEIN KINASE C, IOTA TY; SWP:P41743; PDB:1VD2A; GPLGSQVRVKAYYRGDIMITHFEPSISFEGLCNEVRDMCSFDNEQLFTMKWIDEEGDPCT --------------------------3333------------------------------ VSSQLELEEAFRLYELNKDSELLIHVFPC --1111----------------------- >RNASE NGR3; SWP:Q9SSV1; PDB:1VD3A; AQDFDFFYFVQQWPASYCDTRRSCCYPTTGKPDEDFSIHGLWPNYENGKWPQNCDRESSL ------------3333----------1111--------------1111------------ DESEISDLISTMEKNWPSLACPSSDGVRFWSHEWLKHGTCSALGERAYFQAALDFRKKSN 33331111----------------------------3333-------------------- LLENLKNAEITPRNGEHYTLESIKKAIEEGVGHSPYIECNVDTQGNHQIYQVYLCVDKTA -----1111--------------------------------1111-----------1111 TDFIDCPIFPHGRGCGSKIEFPPF ------------------------ >Transcription initiation ; SWP:P29083; PDB:1VD4A; RIETDERDSTNRASFKCPVCSSTFTDLEANQLFDPMTGTFRCTFCHTEVEEDESAMPKKD ------------------------3333-------------------------------- AR -- >GLYCEROPHOSPHORYL DIESTER; SWP:NA; PDB:1VD6A; RPLRLGHRGAPLKAKENTLESFRLALEAGLDGVELDVWPTRDGVFAVRHDPDTPLGPVFQ ---------1111------------1111----------1111---------11113333 VDYADLKAQEPDLPRLEEVLALKEAFPQAVFNVELKSFPGLGEEAARRLAALLRGREGVW ---------1111-3333-------1111--------2222-----------2222---- VSSFDPLALLALRKAAPGLPLGFLMAEDHSALLPCLGVEAVHPHHALVTEEAVAGWRKRG ---------------1111---------33331111-------3333---------1111 LFVVAWTVNEEGEARRLLALGLDGLIGDRPEVLLPLGG -----------------1111-------33333333-- >NADPH DEPENDENT THIOREDOX; SWP:Q39243; PDB:1VDC; LETHNTRLCIVGSGPAAHTAAIYAARAELKPLLFEGWMANDIAPGGQLTTTTDVENFPGF -------------3333---------------------%%%%-----1111--------1 PEGILGVELTDKFRKQSERFGTTIFTETVTKVDFSSKPFKLFTDSKAILADAVILAIGAV 111-3333---------1111--------------------------------------- AKRLSFVGSGEVLGGFWNRGISACAVCDGAAPIFRNKPLAVIGGGDSAMEEANFLTKYGS -----2222--------------333311111111------------------------- KVYIIHRRDAFRASKIMQQRALSNPKIDVIWNSSVVEAYGDGERDVLGGLKVKNVVTGDV --------------------1111------------------------------------ SDLKVSGLFFAIGHEPATKFLDGGVELDSDGYVVTKPGTTQTSVPGVFAAGDVQDKKYRQ -----------------3333------1111------------2222------------- AITAAGTGCMAALDAEHYLQEI ---------------------- >RECOMBINATION PROTEIN REC; SWP:Q9ZNA2; PDB:1VDDA; MKYPPSLVSLIRELSRLPGIGPKSAQRLAFHLFEQPREDIERLASALLEAKRDLHVCPIC -------------1111-------------3333-------------------------- FNITDAEKCDVCADPSRDQRTICVVEEPGDVIALERSGEYRGLYHVLHGVLSPMNGVGPD --------3333--------------3333---------------------3333----- KLHIKPLLPRVGQGMEVILATGTTVEGDATALYLQRLLEPLGAAISRIAYGVPVGGSLEY -------3333--------------------------3333---------------3333 TDEVTLGRALTGRQTVSKP ---------1111------ >MUCONOLACTONE ISOMERASE-L; SWP:Q72HY0; PDB:1VDHA; RHVPEPTHTLEGWHVLHDFRLLDFARWFSAPLEAREDAWEELKGLVREWRELEEAGQGSY -----------------------3333-------------------------1111---- GIYQVVGHKADLLFLNLRPGLDPLLEAEARLSRSAFARYLGRSYSFYSVVELGSQEKPLD ---------------------------------3333----------------------1 PESPYVKPRLTPRVPKSGYVCFYPMNKRRQGQDNWYMLPAKERASLMKAHGETGRKYQGE 111--3333--------------------!!!!3333----------------3333111 VMQVISGAQGLDDWEWGVDLFSEDPVQFKKIVYEMRFDEVSARYGEFGPFFVGKYLDEEA 1----------------------3333-----------3333------------------ LRAFLGL --1111- >TROPONIN I, FAST SKELETAL; SWP:P02644; PDB:1VDIA; KVNMDLRANLKQVKKEDTEKEKDLRDVGDWRKNIEEKSGMEGRKKMFEAGES ----------------3333-------3333------------1111----- >FUMARATE HYDRATASE CLASS ; SWP:P84127; PDB:1VDKA; RIERDTMGEVRVPADKYWGAQTQRSLENFRIGTDRFRMPLEIIRAYGMLKKAAARANLEL ----1111----1111---------------3333------------------------- GELPEEIAKAIIQAAEEVVQGKWDDHFPLVVFQTGSGTQTNMNVNEVIANRASEILGKPL ----------------------1111-------1111----------------1111-22 GSKYAHPNDHVNRGQSSNDTFPTAMYVAVALALHQRLYPAVEGLIRTFTAKAQAFDQIVK 22---------22223333---------------------------------1111---- VGRTHLMDAVPITLGQEIGSWAAQLKTTLAAVKEMEKGLYNLAIGGTAVGTGLNAHPRFG ---iiii-----------------------------------2222-----22221111- ELVAKYLAEETGLPFRVAENRFAALAAHDELVNVMGAIRTLAGALMKIGNDVRWLASGPY -------------------3333------------------------------------- AGIGEITIPANEPGSSIMPGKVNPTQVEALTMVVVRVYGNDHTVAFAGSQGNFQLNVYKP -----------------2222--3333-------------------1111-!!!!----- VMAYSTLESINLLADAVASFDAHLAQGIEPNLERIEEYLQKNPMLATALNKAIGYDKAAE -----------------------3333--------------3333---1111--1111-1 IVKKALKKTLKQAALELGYLTEEEFDRIVVPMRLAKPH 111-----3333--------3333-----3333----- >UBIQUITIN CARBOXYL-TERMIN; SWP:P57080; PDB:1VDLA; GSSGSSGMTVEQNVLQQSAAQKHQQTFLNQLREITGINDAQILQQALKDSNGNLELAVAF ------------3333-3333-----------------3333------------------ LTAKNAKTPPQEETSGPSSG -------------------- >PURINE PHOSPHORIBOSYLTRAN; SWP:O57827; PDB:1VDMA; MDKVYLTWWQVDRAIFALAEKLREYKPDVIIGVARGGLIPAVRLSHILGDIPLKVIDVKF --------------------3333------------------------------------ YKGERGEKPVITIPIHGDLKDKRVVIVDDVSDTGKTLEVVIEEVKKLGAKEIKIACLAMK --------------------------------------------1111-----------1 PWTSVVPDYYVFRTEKWIVFPWEEFPVIEKE 111----------------1111-------- >CYCLOPHILIN A; SWP:P14832; PDB:1VDNA; SQVYFDVEADGQPIGRVVFKLYNDIVPKTAENFRALCTGEKGFGYAGSPFHRVIPDFMLQ --------iiii--------------------------1111--2222------------ GGDFTAGNGTGGKSIYGGKFPDENFKKHHDRPGLLSMANAGPNTNGSQFFITTVPCPWLD -------------------------------------------------------3333- GKHVVFGEVVDGYDIVKKVESLGSPSGATKARIVVAKSGEL -----------3333----11111111-------------- >DIHYDROFOLATE REDUCTASE; SWP:P15093; PDB:1VDRA; ELVSVAALAENRVIGRDGELPWPSIPADKKQYRSRIADDPVVLGRTTFESMRDDLPGSAQ -----------------------------------1111-----33331111-------- IVMSRSERSFSVDTAHRAASVEEAVDIAASLDAETAYVIGGAAIYALFQPHLDRMVLSRV -----------------------------------------------3333--------- PGEYEGDTYYPEWDAAEWELDAETDHEGFTLQEWVRS -------------3333--------2222-------- >HYPOTHETICAL PROTEIN PH18; SWP:O59523; PDB:1VDWA; SVKTWRKIAIDIIRDFDHNIMPLFGNPKASETISIETKVVDKVAENIIISKFKDLGVNVV -------------------3333--3333-----------------------1111---- SEEIGRIDQGSDYTVVVDPLDGSYNFINGIPFFAVSVAIFHEKDPIYAFIYEPIVERLYE -1111--------------------1111-----------!!!!--------1111---- GIPGKGSYLNGEKIKVRELAEKPSISFYTKGKGTKIIDKVKRTRTLGAIALELAYLARGA --------iiii--------------------33331111---------------1111- LDAVVDIRNYLRPTDIAAGVVIAREAGAIVKDLDGKDVEITFSATEKVNIIAANNEELLE -----------3333--------1111----1111------------------------- TILRSIEK --1111-- >HYPOTHETICAL PROTEIN (RAF; SWP:Q9C5H4; PDB:1VDYA; GSSGSSGESYWRSRMIDAVTSDEDKVAPVYKLEEICDLLRSSHVSIVKEFSEFILKRLDN ----------------1111-------3333-----------------------3333-- KSPIVKQKALRLIKYAVGKSGSEFRREMQRNSVAVRNLFHYKGHPDPLKGDALNKAVRET -----------------------------------1111------------3333----- AHETISAIFSEENGSGPSSG -------------------- >A-TYPE ATPASE SUBUNIT A; SWP:NA; PDB:1VDZA; RPGEPVVGTGASLSVELGPGLLTSIYDGIQRPLEVIREKTGDFIARGVTAPALPRDKKWH ------------------------------------------------------------ FIPKAKVGDKVVGGDIIGEVPETSIIVHKIMVPPGIEGEIVEIAEEGDYTIEEVIAKVKT -----2222--2222-----------------2222-------------3333------3 PSGEIKELKMYQRWPVRVKRPYKEKLPPEVPLITGQRVIDTFFPQAKGGTAAIPGPFGSG 333-----------1111---------------------------2222------3333- KTVTQHQLAKWSDAQVVIYIGCGERGNEMTDVLEEFPKLKDPKTGKPLMERTVLIANTSN --------------------------3333-1111----------------------111 MPVAAREASIYTGITIAEYFRDMGYDVALMADSTSRWAEALPAYLASKLAEFYERAGRVV 13333---------------1111------------------3333-----3333----- TLGSDYRVGSVSVIGAVSPPGGDFSEPVVQNTLRVVKVFWALDADLARRRHFPAINWLTS -------------------------------1111-------3333-------------- YSLYVDAVKDWWHKNIDPEWKAMRDKAMALLQKESELQEIVRIVGPDALPERERAILLVA ---3333---------1111-------------------------1111----------- RMLREDYLQQDAFDEVDTYCPPEKQVTMMRVLLNFYDKTMEAINRGVPLEEIAKLPVREE ----------1111------3333------------------1111-3333--------- IGRMKFERDVSKIRSLIDKTNEQFEELFKKYGA --3333--3333------------1111----- >HYPOTHETICAL PROTEIN (ST2; SWP:Q96YV5; PDB:1VE0A; KIISKEFTVKTRSRFDSIDITEQVSEAIKGINNGIAHVIVKHTTCAIIINEAESGLKDFL -------------------------1111-----------------------3333---- NWAKKLVPPDGEFEHNIIDNNGHAHVISAIIGNSRVVPIIEGKLDLGTWQRIILLEFDGP -------1111-3333-----------------------iiii---1111---------- RTRTVLVKSGE ----------- >O-ACETYLSERINE SULFHYDRYL; SWP:Q5SLE6; PDB:1VE1A; MRVEGAIGKTPVVRLAKVVEPDMAEVWVKLEGLNPGGSIKDRPAWYMIKDAEERGILRPG -3333--------------1111------11111111----------------------- SGQVIVEPTSGNTGIGLAMIAASRGYRLILTMPAQMSEERKRVLKAFGAELVLTDPERRM --------------------------------1111--------1111------3333-- LAAREEALRLKEELGAFMPDQFKNPANVRAHYETTGPELYEALEGRIDAFVYGSGTGGTI -------------------1111-------------------%%%%-------------- TGVGRYLKERIPHVKVIAVEPARSNVLSGGKMGQHGFQGMGPGFIPENLDLSLLDGVIQV ----------1111------11113333--------2222-----11113333------- WEEDAFPLARRLAREEGLFLGMSSGGIVWAALQVARELGPGKRVACISPDGGWKYLSTPL 3333----------------3333---------------------------1111--333 YA 3- >UROPORPHYRIN-III C-METHYL; SWP:Q746N6; PDB:1VE2A; MRGKVYLVGAGFGGPEHLTLKALRVLEVAEVVLHDRLVHPGVLALAKGELVPVKTPQEAI -------------3333-----------------1111----1111---------3333- TARLIALAREGRVVARLKGGDPMVFGRGGEEALALRRAGIPFEVVPGVTSAVGALSALGL --------------------1111%%%%-------1111----------------1111- PLTHRGLARSFAVATGHDPALPLPRADTLVLLMGLKERLLERFPPETPLALLARVGWPGE ---2222----------1111----------------------1111------2222--- AVRLGRVEDLPGLGEGLPSPALLVVGKVVGLYGELLPKDHGL -----3333-3333---------------------------- >HYPOTHETICAL PROTEIN PH02; SWP:O57965; PDB:1VE3A; GFKEYYRVFPTYTDINSQEYRSRIETLEPLLKYKKRGKVLDLACGVGGFSFLLEDYGFEV ---------11111111----------3333------------!!!!-----3333---- VGVDISEDIRKAREYAKSRESNVEFIVGDARKLSFEDKTFDYVIFIDSIVHFEPLELNQV ----------------------------3333----------------1111-------- FKEVRRVLKPSGKFIYFTDLRELLPRLKEISKVIPDQEERTVVIEFSFRVRFNVWGKTGV --------1111--------3333-----------1111----------------3333- ELLAKLYFTKEAEEKVGNYSYLTVYNPK ---------------------------- >ATP PHOSPHORIBOSYLTRANSFE; SWP:P62381; PDB:1VE4A; MRRFALTVALPKGRMFREAYEVLKRAGLDLPEVLLHGKEGGVALLELRNKDVPIYVDLGI 1111--------1111-------1111----------2222------3333--------- AEIGVVGKDVLLDSGRDLFEPVDLGFGACRLSLIRRPGDTGPIRRVATKYPNFTARLLKE ------3333-------------------------1111-------------------11 RGWAADVVELSGNIELAAVTGLADAVVDVVQTGATLRAAGLVEVEVLAHSTARLVVNRQA 11----------3333-1111---------------1111-------------------- LKLKRAVLKPLIQRLRELSGS ---------------1111-- >THREONINE DEAMINASE; SWP:Q5SLL4; PDB:1VE5A; PSLQDLYAAFRRIAPYTHRTPLLTSRLLDGLLGKRLLLKAEHLQKTGSFKARGALSKALA ------------3333-----------------------33332222---------3333 LENPKGLLAVSSGNHAQGVAYAAQVLGVKALVVMPEDPYKKACARAYGAEVVDRGVTAKN ------------3333-------------------------------------------- REEVARALQEETGYALIHPFDDPLVIAGQGTAGLELLAQAGRMGVFPGAVLAPVGGGGLL ------------------------------------------------------------ AGLATAVKALSPTTLVLGVEPEAADDAKRSLEAGRILRLEAPPRTRADGVRTLSLGERTF ----------1111------1111----------------------1111--------33 PILRERVDGILTVSEEALLEAERLLFTRTKQVVEPTGALPLAAVLEHGARLPQTLALLLS 33-------------------------------3333--3333---3333---------- GGNRDFSP -------- >ACYLAMINO-ACID-RELEASING ; SWP:Q9YBQ2; PDB:1VE6A; EFSRIVRDVERLIAVEKYSLQGVVDGDKLLVVGFSEGSVNAYLYDGGETVKLNREPINSV ----------------------------------iiii---------------------- LDPHYGVGRVILVRDVSKGAEQHALFKVNTSRPGEEQRLEAVKPMRILSGVDTGEAVVFT ---2222---------iiii--------1111------3333------------------ GATEDRVALYALDGGGLRELARLPGFGFVSDIRGDLIAGLGFFGGGRVSLFTSNLSSGGL --1111------1111----------------!!!!-------iiii------------- RVFDSGEGSFSSASISPGMKVTAGLETAREARLVTVDPRDGSVEDLELPSKDFSSYRPTA ----1111-------1111----------------------------------3333--- ITWLGYLPDGRLAVVARREGRSAVFIDGERVEAPQGNHGRVVLWRGKLVTSHTSLSTPPR ------1111-------iiii----------------------iiii------1111--- IVSLPSGEPLLEGGLPEDLRRSIAGSRLVWVESFDGSRVPTYVLESGRAPTPGPTVVLVH --------------------------------1111------------------------ GGPFAEDSDSWDTFAASLAAAGFHVVMPNYRGSTGYGEEWRLKIIGDPCGGELEDVSAAA ------------------1111------------------3333---------------- RWARESGLASELYIMGYSYGGYMTLCALTMKPGLFKAGVAGASVVDWEEMYELSDAAFRN ---1111---------!!!!----------2222----------------1111------ FIEQLTGGSREIMRSRSPINHVDRIKEPLALIHPQNDSRTPLKPLLRLMGELLARGKTFE -------------1111---3333---------1111----3333-------1111---- AHIIPDAGHAINTMEDAVKILLPAVFFLATQRER ---------------------------------- >D-AMINO ACID OXIDASE; SWP:P00371; PDB:1VE9A; MRVVVIGAGVIGLSTALCIHERYHSVLQPLDVKVYADRFTPFTTTDVAAGLWQPYTSEPS --------3333----------1111-------------11111111------------- NPQEANWNQQTFNYLLSHIGSPNAANMGLTPVSGYNLFREAVPDPYWKDMVLGFRKLTPR -3333----------1111-11111111----------------1111---------333 ELDMFPDYRYGWFNTSLILEGRKYLQWLTERLTERGVKFFLRKVESFEEVARGGADVIIN 3---1111-----------------------3333------------------------- CTGVWAGVLQPDPLLQPGRGQIIKVDAPWLKNFIITHDLERGIYNSPYIIPGLQAVTLGG -!!!!-1111-3333-----------1111-------1111------------------- TFQVGNWNEINNIQDHNTIWEGCCRLEPTLKDAKIVGEYTGFRPVRPQVRLEREQLRFGS --2222-----3333--------------1111--------------------------- SNTEVIHNYGHGGYGLTIHWGCALEVAKLFGKVLEERNLL ----------!!!!3333---------------------- >ATP-DEPENDENT RNA HELICAS; SWP:P26196; PDB:1VECA; KGNEFEDYCLKRELLMGIFEMGWEKPSPIQEESIPIALSGRDILARAKNGTGKSGAYLIP ---1111-----------1111----------------------------11111111-- LLERLDLKKDNIQAMVIVPTRELALQVSQICIQVSKHMGGAKVMATTGGTNLRDDIMRLD -3333-----------------------------1111---------------------- DTVHVVIATPGRILDLIKKGVAKVDHVQMIVLDEADKLLSQDFVQIMEDIILTLPKNRQI ----------------3333---1111------------3333-------1111------ LLYSATFPLSVQKFMNSHLEKPYEIN -------------------------- >PROLINE-RICH PROTEIN FAMI; SWP:Q9M158; PDB:1VEEA; GSSGSSGSAKNAYTKLGTDDNAQLLDIRATADFRQVGSPNIKGLGKKAVSTVYNGEDKPG -------3333-------1111------3333--------3333---------3333-33 FLKKLSLKFKDPENTTLYILDKFDGNSELVAELVALNGFKSAYAIKDGAEGPRGWLNSSL 33--1111--1111------------------------------2222-----3333--- PWIEPKKTSGPSSG -------------- >ACETYLORNITHINE/ACETYL-LY; SWP:Q93R93; PDB:1VEFA; WRALLEAEKTLDSGVYNKHDLLIVRGQGARVWDAEGNEYIDCVGGYGVANLGHGNPEVVE --------------------------!!!!--1111------%%%%--1111-------- AVKRQAETLMAMPQTLPTPMRGEFYRTLTAILPPELNRVFPVNSGTEANEAALKFARAHT ------------1111-3333-------11113333------------------------ GRKKFVAAMRGFSGRTMGSLSVTWEPKYREPFLPLVEPVEFIPYNDVEALKRAVDEETAA -------2222-------3333--33331111----------2222--------1111-- VILEPVQGEGGVRPATPEFLRAAREITQEKGALLILDEIQTGMGRTGKRFAFEHFGIVPD -------1111------------------------------iiii----3333------- ILTLAKALGGGVPLGVAVMREEVARSMPKGGHGTTFGGNPLAMAAGVAAIRYLERTRLWE ----!!!!---------------33332222----2222--------------------- RAAELGPWFMEKLRAIPSPKIREVRGMGLMVGLELKEKAAPYIARLEKEHRVLALQAGPT ------------1111-1111------------------------------------111 VIRFLPPLVIEKEDLERVVEAVRAVLA 1-----11113333------------- >NEDD8 ULTIMATE BUSTER-1; SWP:P54729; PDB:1VEGA; GSSGSSGNPHMWWLQDADPENNSRQASPSQESINQLVYMGFDTVVAEAALRVFGGNVQLA ------------------------------------3333-------------------- AQTLAHHGGSLPPDLQFSGPSSG ----3333--------------- >NIFU-LIKE PROTEIN HIRIP5; SWP:Q9QZ23; PDB:1VEHA; GSSGSSGSEEDDEVVAMIKELLDTRIRPTVQEDGGDVIYRGFEDGIVRLKLQGSCTSCPS -----------3333----------11113333--------------------------- SIITLKSGIQNMLQFYIPEVEGVEQVSGPSSG -------------------------------- >RIKEN CDNA 4931431F19; SWP:Q9D4I8; PDB:1VEJA; GSSGSSGARACSQSSQTALPTSLFTEGRYQQELEELKALGFANRDANLQALVATDGDIHA --------------------11111111-----------------------1111----- AIEMLLGASGPSSG ---1111------- >UBIQUITIN-SPECIFIC PROTEA; SWP:Q8L6Y1; PDB:1VEKA; GSSGSSGGEELLPDGVPEEVMESAQPVANEEIVAQLVSMGFSQLHCQKAAINTSNAGVEE ----------------------------3333------------------1111------ AMNWLLSHMDDPDIDAPISGPSSG ----1111--3333---------- >BETA-AMYLASE; SWP:P36924; PDB:1VEMA; AVNGKGMNPDYKAYLMAPLKKIPEVTNWETFENDLRWAKQNGFYAITVDFWWGDMEKNGD 2222---1111---------3333--------------1111--------3333----22 QQFDFSYAQRFAQSVKNAGMKMIPIISTHQCGGNVGDDCNVPIPSWVWNQKSDDSLYFKS 22-------------1111--------------2222------3333------3333--1 ETGTVNKETLNPLASDVIRKEYGELYTAFAAAMKPYKDVIAKIYLSGGPAGELRYPSYTT 111-------3333------------------33331111-------2222-------33 SDGTGYPSRGKFQAYTEFAKSKFRLWVLNKYGSLNEVNKAWGTKLISELAILPPSDGEQF 33--------------------------------------------3333---------- LMNGYLSMYGKDYLEWYQGILENHTKLIGELAHNAFDTTFQVPIGAKIAGVHWQYNNPTI --3333--------------------------------------------------3333 PHGAEKPAGYNDYSHLLDAFKSAKLDVTFTCLEMTDKGSYPEYSMPKTLVQNIATLANEK ----3333---------------------------------------------------- GIVLNGENALSIGNEEEYKRVAEMAFNYNFAGFTLLRYQDVMYNNSLMGKFKDLLGVTPV -------------3333---------------------1111------------------ MQTIVVKNVPTTIGDTVYITGNRAELGSWDTKQYPIQLYYDSHSNDWRGNVVLPAERNIE -----------2222-------3333%%%%------------------------------ FKAFIKSKDGTVKSWQTIQQSWNPVPLKTTSHTSSW ------1111-------------------------- >NEW ANTIGEN RECEPTOR VARI; SWP:Q6X1E7; PDB:1VERA; AWVDQTPRTATKETGESLTINCVLRDASYGLESTGWYRTKLGSTNEQTISIGGRYVETVN ------------------------------------------------------------ KGSKSFSLRIRDLRVEDSGTYKCGAFRLSEKGAGTVLTVK -------------3333----------------------- >NEW ANTIGEN RECEPTOR VARI; SWP:Q6X1E6; PDB:1VESA; AWVDQTPRTATKETGESLTINCVLRDASFELKDTGWYRTKLGSTNEQSISIGGRYVETVN ------------2222-----------------------2222--------!!!!----- KGSKSFSLRISDLRVEDSGTYKCQAFYSLPLGDYNYSLLFRGEKGAGTALTVK 1111---------3333------------------------------------ >LATE ENDOSOMAL/LYSOSOMAL ; SWP:O88653; PDB:1VETA; ADDLKRFLYKKLPSVEGLHAIVVSDRDGVPVIKVANDSAPEHALRPGFLSTFALATDQGS ----------33332222------1111-------11113333--3333---------11 KLGLSKNKSIICYYNTYQVVQFNRLPLVVSFIASSSANTGLIVSLEKELAPLFEELIKVV 11-----------1111----------------11113333------------------- EV -- >Mitogen-activated protein; SWP:Q9JHS3; PDB:1VETB; MLRPKALTQVLSQANTGGVQSTLLLNNEGSLLAYSGYGDTDARVTAAIASNIWAAYDRNG ----------3333-iiii------1111-----------3333---------------- NQAFNEDSLKFILMDCMEGRVAITRVANLLLCMYAKETVGFGMLKAKAQALVQYLEEP --1111---------1111------!!!!------1111------------------- >F-SPONDIN; SWP:P35446; PDB:1VEXA; GSIPCLLSPWSEWSDCSVTCGKGMRTRQRMLKSLAELGDCNEDLEQAEKCMLPECP --------------------------------1111-------------------- >GLUTATHIONE S-TRANSFERASE; SWP:P26697; PDB:1VF1A; AKPVLYYFNGRGKMESIRWLLAAAGVEFEEVFLETREQYEKLLQSGILMFQQVPMVEIDG ----------!!!!-------1111----------------------1111------iii MKLVQTRAILNYIAGKYNLYGKDLKERALIDMYVGGTDDLMGFLLSFPFLSAEDKVKQCA i-------------1111-----------------------11113333----------- FVVEKATSRYFPAYEKVLKDHGQDFLVGNRLSWADIHLLEAILMVEEKKSDALSGFPLLQ --------------------------%%%%-3333---------333311112222---- AFKKRISSIPTIKKFLAPGSKRKPISDDKYVETVRRVLRMYYDVKP ---------------------------------------------- >PALS1-ASSOCIATED TIGHT JU; SWP:Q8NI35; PDB:1VF6A; VLQVLDRLKMKLQEKGDTSQNEKLSMFYETLKSPLFNQILTLQQSIKQLKGQLNHILE ------------1111-1111------------------------------------- >MAGUK p55 subfamily membe; SWP:Q9JLB2; PDB:1VF6C; QDPDVEDLFSSLKHIQHTLVDSQSQEDISLLLQLVQNRDFQNAFKIHNAVT -3333---------1111-----------------------------1111 >MULTIDRUG RESISTANCE PROT; SWP:P52477; PDB:1VF7A; TLNTELPGRTNAFRIAEVRPQVNGIILKRLFKEGSDVKAGQQLYQIDPATYEADYQSAQA -------------------------------------2222------------------- NLASTQEQAQRYKLLVADQAVSKQQYADANAAYLQSKAAVEQARINLRYTKVLSPISGRI ---------------1111----------------------------3333--------- GRSAVTEGALVTNGQANAMATVQQLDPIYVDVTQPSTALLRLRRELASGQLERAGDNAAK ----------------------------------3333---------------------- VSLKLEDGSQYPLEGRLEFSEVSVDEGTGSVTIRAVFPNPNNELLPGMFVHAQLQEG ----1111------------------------------------------------- >SECRETORY PROTEIN; SWP:O35744; PDB:1VF8A; YQLMCYYTSWAKDRPIEGSFKPGNIDPCLCTHLIYAFAGMQNNEITYTHEQDLRDYEALN -------1111---3333--3333-1111-----------%%%%---------------- GLKDKNTELKTLLAIGGWKFGPAPFSAMVSTPQNRQIFIQSVIRFLRQYNFDGLNLDWQY -33331111--------3333--------------------------------------2 PGSRGSPPKDKHLFSVLVKEMRKAFEEESVEKDIPRLLLTSTGAGIIDVIKSGYKIPELS 222---3333-------------------------------------------------- QSLDYIQVMTYDLHDPKDGYTGENSPLYKSPYDIGKSADLNVDSIISYWKDHGAASEKLI --------------1111-----------1111!!!!------------1111-3333-- VGFPAYGHTFILSDPSKTGIGAPTISTGPPGKYTDESGLLAYYEVCTFLNEGATEVWDAP -------------1111-2222-------------2222---------1111------11 QEVPYAYQGNEWVGYDNVRSFKLKAQWLKDNNLGGAVVWPLDMDDFSGSFCHQRHFPLTS 11-----!!!!-----------------1111-------1111-1111--------3333 TLKGDLNIHSASC ---1111--1111 >BETA-GLUCOSIDASE; SWP:O58104; PDB:1VFFA; MPLKFPEMFLFGTATSSHQIEGNNRWNDWWYYEQIGKLPYRSGKACNHWELYRDDIQLMT -----1111-------3333-----------------------!!!!------------- SLGYNAYRFSIEWSRLFPEENKFNEDAFMKYREIIDLLLTRGITPLVTLHHFTSPLWFMK -----------3333-----------------------1111------------------ KGGFLREENLKHWEKYIEKVAELLEKVKLVATFNEPMVYVMMGYLTAYWPPFIRSPFKAF -!!!!3333------------1111----------------------------------- KVAANLLKAHAIAYELLHGKFKVGIVKNIPIILPASDKERDRKAAEKADNLFNWHFLDAI -------------------------------------3333------------------- WSGKYRGVFKTYRIPQSDADFIGVNYYTASEVRHTWNPLKFFFEVKLADISERKTQMGWS ------------------------------------3333--------------1111-- VYPKGIYMALKKASRYGRPLYITENGIATLDDEWRVEFIIQHLQYVHKAIEDGLDVRGYF -------------1111------------------------------------------- YWSFMDNYEWKEGFGPRFGLVEVDYQTFERRPRKSAYVYGEIARSKEIKDELLKRYGLPE --------!!!!--------------------3333-------------3333-----22 LQL 22- >POLY A POLYMERASE; SWP:O66728; PDB:1VFGA; MVGQIAKEMGLRAYIVGGVVRDILLGKEVWDVDFVVEGNAIELAKELARRHGVNVHPFPE 3333--------------------------------------------1111-------- FGTAHLKIGKLKLEFATARRETVEPASLKEDLIRRDFTINAMAISVNLEDYGTLIDYFGG -------!!!!-------------------------1111------3333---------- LRDLKDKVIRVLHPVSFIEDPVRILRALRFAGRLNFKLSRSTEKLLKQAVNLGLLKEAPR ------------111133333333----------------------------3333---- GRLINEIKLALREDRFLEILELYRKYRVLEEIIEGFQWNEKVLQKLYALRKVVDWHALEF ---------3333--------------3333----------------------------3 SEERIDYGWLYLLILISNLDYERGKHFLEEMSAPSWVRETYKFMKFKLGSLKEELKKAKE 333--3333--------------3333---------------3333------3333---- NYEVYRLLKPLHTSVLLLLMLEEELKEKIKLYLEKLRKVKLP -3333--------3333------------------1111--- >VANADIUM-BINDING PROTEIN ; SWP:Q86BW2; PDB:1VFIA; ISEFAPVDCKGQCTTPCEPLTACKEKCAESCETSADKKTCRRNCKKADCEPQDKVCDACR --------3333-3333--3333-------------------3333-------------- MKCHKACRAANCASECPKHEHKSDTCRACMKTNCK ----------------------3333--------- >NITROGEN REGULATORY PROTE; SWP:P83820; PDB:1VFJA; MKLIVAIVRPEKLNEVLKALFQAEVRGLTLSRVQGHGGETERVETYRGTTVKMELHEKVR --------3333--------1111------------------------------------ LEIGVSEPFVKPTVEAILKAARTGEVGDGKIFVLPVEKVYRIRTGEEDEAAVTPVQ -----3333---------------2222-------------1111--3333----- >ADENOSINE DEAMINASE; SWP:P56658; PDB:1VFLA; TPAFDKPKVELHVHLDGAIKPETILYYGKRRGIALPADTPEELQNIIGMDKPLTLPDFLA -------------1111------------------------------------------3 KFDYYMPAIAGCRDAIKRIAYEFVEMKAKDGVVYVEVRYSPHLLANSKVEPIPWNQAEGD 333-33332222---------------------------3333---------%%%%---- LTPDEVVSLVNQGLQEGERDFGVKVRSILCCMRHQPSWSSEVVELCKKYREQTVVAIDLA -------------------------------11111111--------------------- GDETIEGSSLFPGHVQAYAEAVKSGVHRTVHAGEVGSANVVKEAVDTLKTERLGHGYHTL -1111-33333333-------------------------------------------111 EDTTLYNRLRQENMHFEICPWSSYLTGAWKPDTEHAVIRFKNDQVNYSLNTDDPLIFKST 1----------------------3333--1111-3333---------------------3 LDTDYQMTKKDMGFTEEEFKRLNINAAKSSFLPEDEKKELLDLLYKAYR 333-----------------------1111------------------- >NAD(P)H\:FMN OXIDOREDUCTA; SWP:P46072; PDB:1VFRA; THPIIHDLENRYTSKKYDPSKKVSQEDLAVLLEALRLSASSINSQPWKFIVIESDAAKQR -----------------1111-------------1111-2222----------------- MHDSFANMHQFNQPHIKACSHVILFANKLSYTRDDYDVVLSKAVADKRITEEQKEAAFAS --1111--33333333---------------------------1111--3333------- FKFVELNCDENGEHKAWTKPQAYLALGNALHTLARLNIDSTTMEGIDPELLSEIFADELK ----11111111-3333-----------------------------3333-----3333- GYECHVALAIGYHHPSEDYNASLPKSRKAFEDVITIL ------------------3333------3333----- >ALANINE RACEMASE; SWP:Q65YW7; PDB:1VFSA; ETPTRVYAEIDLDAVRANVRALRARAPRSALMAVVKSNAYGHGAVPCARAAQEAGAAWLG -------------------------1111---------iiii---------1111----- TATPEEALELRAAGIQGRIMCWLWTPGGPWREAIETDIDVSVSGMWALDEVRAAARAAGR ----------1111----------2222-----1111------------------1111- TARIQLADTGLGRNGCQPADWAELVGAAVAAQAEGTVQVTGVWSHFACADEPGHPSIRLQ ----------------1111-----------1111------------1111--3333--- LDAFRDMLAYAEKEGVDPEVRHIANSPATLTLPETHFDLVRTGLAVYGVSPSPELGTPAQ -----------1111----------------1111-------3333-----3333-3333 LGLRPAMTLRASLALVKTVPAGHGVSYGHHYVTESETHLALVPAGYADGIPRNASGRGPV -------------------------2222----------------1111-3333------ LVAGKIRRAAGRIAMDQFVVDLGEDLAEAGDEAVILGDAERGEPTAEDWAQAAHTIAYEI -iiii----------------!!!!--2222------3333--------------3333- VTRIGGRVPRVYLGGLEHHHHH 11113333-------3333--- >PROTEIN (Fusion protein c; SWP:P33173; PDB:1VFVA; GASVKVAVRVRPFNSREMSRDSKCIIQMSGSTTTIVNPKQPKETPKSFSFDYSYWSHTSP -----------------1111-------!!!!----3333------------------33 EDINYASQKQVYRDIGEEMLQHAFEGYNVCIFAYGQTGAGKSYTMMGKQEKDQQGIIPQL 33---------------------------------2222----------2222------- CEDLFSRINDTTNDNMSYSVEVSYMEIYCERVRDLLNPKNKGNLRVREHPLLGPYVEDLS -------1111-1111-----------%%%%--1111------------------2222- KLAVTSYNDIQDLMDSGNKPRTVTSSRSHAVFNIIFTQKRHDAETNITTEKVSKISLVDL -----------------1111--------------------------------------- AGSEANINKSLTTLGKVISALAEMFIPYRDSVLTWLLRENLGGNSRTAMVAALSPADINY --------------------------1111------3333---------------3333- DETLSTLRYADRAKQIRNTVSVNH -----------3333--------- >PHOSPHATIDYLINOSITOL-3-PH; SWP:P40343; PDB:1VFYA; DWIDSDACMICSKKFSLLNRKHHCRSCGGVFCQEHSSNSIPLPDLGIYEPVRVCDSCFED ---------------1111------------3333------3333--------------- YEFIVTD ------- >RAS-RELATED PROTEIN RAB-7; SWP:P37727; PDB:1VG0A; DNLPSDFDVIVIGTGLPESIIAAACSRSGQRVLHVDSRSYYGGNWASFSFSGLLSWLKEY -------------------------1111------------!!!!-----------3333 QMWQEQILENEEAIPLSSKDKTIQHVEVFCYASQRITYSQIIKEGRRFNIDLVSKLLYSR -3333--1111-------------------------3333---1111------------- GLLIDLLIKSNVSRYAEFKNITRILAFREGTVEQVPCSRADVFNSKQLTMVEKRMLMKFL -------3333-1111-----------iiii----------------------------- TFCVEYEEHPDEYRAYEGTTFSEYLKTQKLTPNLQYFVLHSIAMETTSCTVDGLKATKKF ----1111----3333--------1111-------------------------------- LQCLGRYGNTPFLFPLYGQGELPQCFCRMCAVFGGIYCLRHSVQCLVVDKESRKCKAVID --2222--------22223333--------1111-------------------------1 QFGQRIISKHFIIEDSYLSENTCSRVQYRQISRAVLITDGSVLRTDADQQVSILTVPAEE 111----------3333-33331111---------------------------------- PGSFAVRVIELCSSTMTCMKGTYLVHLTCMSSKTAREDLERVVQKLFTPYTEIEKPRLLW ------------------2222-----------3333----------------------- ALYFNMRDSSDISRDCYNDLPSNVYVCSGPDSGLGNDNAVKQAETLFQQICPNEDFCPAP ------------3333----1111--------------------------2222------ P - >RHOMBOID FAMILY PROTEIN; SWP:Q8LB17; PDB:1VG5A; GSSGSSGSRQAPIANAAVLPQSQGRVAASEEQIQKLVAMGFDRTQVEVALAAADDDLTVA ----------------------------------------------------iiii3333 VEILMSQSGPSSG ------------- >RAS-RELATED PROTEIN RAB-7; SWP:P09527; PDB:1VG8A; VLLKVIILGDSGVGKTSLMNQYVNKKFSNQYKATIGADFLTKEVMVDDRLVTMQIWDTAG ---------2222--------------------------------%%%%----------- QERFQSLGVAFYRGADCCVLVFDVTAPNTFKTLDSWRDEFLIQASPRDPENFPFVVLGNK 3333----3333----------11113333-----------------3333--------3 IDLENRQVATKRAQAWCYSKNNIPYFETSAKEAINVEQAFQTIARNALKQETEVELYNEF 333--------------------------1111--------------------------- PEPI ---- >TR1.9 FAB; SWP:GC1_HUMAN; PDB:1VGEH; QVKLLEQSGAEVKKPGASVKVSCKASGYSFTSYGLHWVRQAPGQRLEWMGWISAGTGNTK -------------2222-----------1111--------2222---------------- YSQKFRGRVTFTRDTSATTAYMGLSSLRPEDTAVYYCARDPYGGGKSEFDYWGQGTLVTV -3333---------1111---------1111----------------------------- SSASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQ ------------------------------------------%%%%-------------3 SSGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSC 333----------3333-----------3333------------- >CONSERVED HYPOTHETICAL PR; SWP:Q5SJC3; PDB:1VGGA; ELKLIPIEKPENLNVILGQAHFIKTVEDLHEALVTAVPGIRFGLAFSEASGKRLVRRSGT ---------1111-----------------------2222---------!!!!------- DEALVELAVKNLLNLACGHVFLIVLGEGFYPINVLHAVKACPEVVRIYAATANPLKVVVA ----------------2222-----22223333-------1111---------------- EEGEQRAILGVDGFTPLGVEDEAEVAWRKDLLRRLGYKL --------------------------------------- >HYPOTHETICAL PROTEIN PH00; SWP:O57823; PDB:1VGJA; RAFIAIDVNESVRDSLVRAQDYIGSKEAKIKFVERENLHITLKFLGEITEEQAEEIKNIL ---------------------------------3333----------------------- KKIAEKYKKHEVKVKGIGVFPNPNYIRVIWAGIENDEIIREAREIEDELAKLGFKKEGNF -----------------------------------3333--------3333--------- VAHITLGRVKFVKDKLGLTKLKELANEDFGSFVVDAIELKKSTLTPKGPIYETLARFELS --------------3333--3333--------------------1111------------ E - >BETA-2-MICROGLOBULIN; SWP:P01902; PDB:1VGKA; GPHSLRYFVTAVSRPGLGEPRFIAVGYVDDTQFVRFDSDADNPRFEPRAPWMEQEGPEYW -------------2222----------!!!!-----1111--------1111---3333- EEQTQRAKSDEQWFRVSLRTAQRYYNQSKGGSHTFQRMFGCDVGSDWRLLRGYQQFAYDG -------------------------------------------1111----------iii RDYIALNEDLKTWTAADTAALITRRKWEQAGDAEYYRAYLEGECVEWLRRYLELGNETLL i-----1111------3333----------3333--------------------1111-- RTDSPKAHVTYHPRSQVDVTLRCWALGFYPADITLTWQLNGEDLTQDMELVETRPAGDGT --------------------------------------iiii-3333------------- FQKWAAVVVPLGKEQNYTCHVHHKGLPEPLTLRW ---------22221111-----1111-------- >ORF_ID:tlr0482 circadian ; SWP:Q79V61; PDB:1VGLA; KTYVLKLYVAGNTPNSVRALKTLNNILEKEFKGVYALKVIDVLKNPQLAEEDKILATPCL ------------------------------2222------1111-----------33333 AKVLPPPVRRIIGDLSNREKVLIGLDLLYEEIGDQA 333--------------------------------- >378AA LONG HYPOTHETICAL C; SWP:Q96ZM7; PDB:1VGMA; VEVSRGLENVIIKTTGLTYIDGINGILRYRGYDINDLVNYASYEELIHLMLYGELPNRQQ ---2222---------------------iiii---------------------------- LNQIKGIINESFEVPEQVISTIFSMPRNCDAIGMMETAFGILASIYDPKWNRATNKELAV -------1111----------11111111---------------------1111------ QIIAKTATITANIYRAKEGLKPKIPEPSESYAESFLAATFGKKPTQEEIKAMDASLILYT ---------------1111-------------------------------------1111 DHEVPASTTAALVASSTLSDMYSCIVAALAALKGPLHGGAAEEAFKQFVEIGSVENADKW --------------1111---------------1111---------------3333---- FEEKIIKGKSRLMGFGHRVYKTYDPRAKIFKTLAKSFAEKNENVKKYYEIAERIEKLGVD -----------2222--------------------------------------------- TFGSKHIYPNTDFYSGIVFYALGFPIYMFTSLFALSRVLGWLAHIIEYVEEQHRLIRPRA -3333----1111-----------1111-------------------------------- LYIGPEKREFKPIELR -----------3333- >373AA LONG HYPOTHETICAL C; SWP:Q974S5; PDB:1VGPA; MEIKKGLEDVYVKETEITYIDGELGRLYYRGYSIYDLAEFSNFEEVSYLILYGKLPNREE ---2222---------------------iiii------------------------3333 LNWFQEKLREERYLPDFIIKFLREVRKDAQPMDILRTAVSLLGIEDSKNDERTDIKGIKL ---------------------------------------------------3333----- ISKFPTIVANYARLRKGLDIIEPDPKLSHSENFLYMLYGDRPNEIKSKAMDVTLILHIDH -----------------------3333--------------------------------- EMNASTFASLVVASTFSDLYSSIVAGISALKGPLHGGANYEALKMFKEIGSPEKVNDYIL ------------1111------------1111-------3333-----------3333-- NRLSNKQRIMGFGHRVYKTYDPRARILKQYAKLLAEKEGGEIYTLYQIAEKVEEIGIKYL --------2222-----------------------------------------------3 GPKGIYPNVDFFSSIVFYSLGFEPDFFPAVFASARVVGWVAHIMEYIKDNKIIRPKAYYK 333--------------1111-1111------------------1111------------ GEIGKKYIPIDSR --------3333- >FORMYL-COENZYME A TRANSFE; SWP:O06644; PDB:1VGRA; TKPLDGINVLDFTHVQAGPACTQMMGFLGANVIKIERRGSGDMTRGWLQDKPNVDSLYFT -1111--------------------1111-------------3333----2222--3333 MFNCNKRSIELDMKTPEGKELLEQMIKKADVMVENFGPGALDRMGFTWEYIQELNPRVIL -----------33333333------1111-----------3333----------1111-- ASVKGYAEGHANEHLKVYENVAQCSGGAAATTGFWDGPPTVSGAALGESNSGMHLMIGIL ------2222-1111------------3333--1111----------------------- AALEIRHKTGRGQKVAVAMQDAVLNLVRIKLRDQQRLERTGILAEYPQAQPNFAFDRDGN ------------------------------------------11113333-----1111- PLSFDNITSVPRGGNAGGGGQPGWMLKCKGWETDADSYVYFTIAANMWPQICDMIDKPEW --3333----------!!!!-------2222--1111------1111---------3333 KDDPAYNTFEGRVDKLMDIFSFIETKFADKDKFEVTEWAAQYGIPCGPVMSMKELAHDPS --3333-33331111-------33331111---------1111----------------- LQKVGTVVEVVDEIRGNHLTVGAPFKFSGFQPEITRAPLLGEHTDEVLKELGLDDAKIKE -----------3333-----------------------2222------3333-------- LHAKQVV ------- >UDP-N-ACETYLGLUCOSAMINE 2; SWP:P27828; PDB:1VGVA; KVLTVFGTRPEAIKAPLVHALAKDPFFEAKVCVTAQHRELDQVLKLFSIVPDYDLNIQPG -----------------------3333-----------------1111------------ QGLTEITCRILEGLKPILAEFKPDVVLVHGDTTTTLATSLAAFYQRIPVGHVEAGLRTGD ------------------------------------------1111-------------1 LYSPWPEEANRTLTGHLAYHFSPTETSRQNLLRENVADSRIFITGNTVIDALLWVRDQVS 111-----------1111-------------1111-1111-------------------- SDKLRSELAANYPFIDPDKKILVTGHRRESFGRGFEEICHALADIATTHQDIQIVYPVHL -----------11111111--------------------------3333----------- NPNVREPVNRILGHVKNVILIDPQEYLPFVWLNHAWLILTDSGGIQEEAPSLGKPVLVRD 3333-------1111---------------------------3333-3333--------- TTERPEAVTAGTVRLVGTDKQRIVEEVTRLLKDENEYQASRAHNPYGDGQACSRILEALK ---3333------------------------------------1111------------- NNRISL ------ >4-diphosphocytidyl-2C-met; SWP:Q5F829; PDB:1VGWA; KRKNIALIPAAPKQYVEIGSKTVLEHVLGIFERHEAVDLTVVVVSPEDTFADKVQTAFPQ -----------------!!!!---------1111----------1111---------111 VRVWKNGGQTRAETVRNGVAKLLETGLAAETDNILVHDAARCCLPSEALARLIEQAGNAA 1---------------------3333--1111-----1111---3333-------1111- EGGILAVPVADTLKRAESGQISATVDRSGLWQAQTPQLFQAGLLHRALAAGITDEASAVE --------------------------2222-----------------------3333--1 KLGVRPLLIQGDARNLKLTQPQDAYIVRLLLD 111--------3333----------------- >SUCCINYL-DIAMINOPIMELATE ; SWP:Q9JYL2; PDB:1VGYA; TETQSLELAKELISRPSVTPDDRDCQKLAERLHKIGFAAEEHFGNTKNIWLRRGTKAPVV ---------------------%%%%-----3333--------!!!!-------------- CFAGHTDVVPTGPVEKWDSPPFEPAERDGRLYGRGAADKTSIACFVTACERFVAKHPNHQ ------------1111---1111---%%%%--2222------------------------ GSIALLITSDEEGDALDGTTKVVDVLKARDELIDYCIVGEPTAVDKLGDIKNGRRGSLSG -----------------3333-----1111---------------2222----------- NLTVKGKQGHIAYPHLAINPVHTFAPALLELTQEVWDEGNEYFPPTSFQISNINGGTGAT ---------11111111----------------------1111----------------- NVIPGELNVKFNFRFSTESTEAGLKQRVHAILDKHGVQYDLQWSCSGQPFLTQAGKLTDV ---------------3333-------------1111------------------------ ARAAIAETCGIEAELSTTGGTSDGRFIKAAQELIELGPSNATIHQINENVRLNDIPKLSA -------------------------3333----------1111-------3333------ VYEGILVRLL ---------- >HYPOTHETICAL UPF0247 PROT; SWP:Q9WVW7; PDB:1VH0A; LKITILAVGKLKEKYWKQAIAEYEKRLGPYTKIDIIEVPDEKAPENMSDKEIEQVKEKEG ---------------------------1111----------------------------- QRILAKIKPQSTVITLEIQGKMLSSEGLAQELNQRMTQGQSDFVFVIGGSNGLHKDVLQR ---11111111-----1111----------------------------1111-------- SNYALSFSKMTFPHQMMRVVLIEQVYRAFKIMRGEAY ------------3333--------------1111--- >3-DEOXY-MANNO-OCTULOSONAT; SWP:P04951; PDB:1VH1A; SFVVIIPARYASTPGKPLVDINGKPIVHVLERARESGAERIIVATDHEDVARAVEAAGGE ---------------1111-iiii---------3333----------------------- VCTRGTERLAEVVEKCAFSDDTVIVNVQGDEPIPATIIRQVADNLAQRQVGATLAVPIHN ------------------1111----------------------1111------------ AEEAFNPNAVKVVLDAEGYALYFSRATIPWDRDRFAEGLETVGDNFLRHLGIYGYRAGFI -----3333-----1111-------------3333------------------------- RRYVNWQPSPLEHIELEQLRVLWYGEKIHVAVAQEVPGTGVDTPEDLERVRAE --3333--3333---3333-----------------------3333------- >3-DEOXY-MANNO-OCTULOSONAT; SWP:P44490; PDB:1VH3A; SFTVIIPARPIQHVFEKALQSGASRVIIATDNENVADVAKSFGAEVCTSVNHNSGTERLA ---------------------------------------1111----------------- EVVEKLAIPDNEIIVNIQGDEPLIPPVIVRQVADNLAKFNVNASLAVKIHDAEELFNPNA ---1111-1111------------3333--------1111----------3333--1111 VKVLTDKDGYVLYFSRSVIPYDRDQFNLQDVQKVQLSDAYLRHIGIYAYRAGFIKQYVQW -----1111------------3333----1111----------------3333---1111 APTQLENLEKLEQLRVLYNGERIHVEL --3333----3333------------- >SUFD PROTEIN; SWP:P77689; PDB:1VH4A; NALQQWHHLFEAKRSPQAQQHLQQLLRTGLPTRKHENWKYTPLEGLINSQFVSIAGEISP -------------------------------1111--1111-3333-------------- QQRDALALTLDSVRLVFVDGRYVPALSDATEGSGYEVSINDDRQGLPDAIQAEVFLHLTE -----------------iiii-3333---2222---------1111-------------- SLAQSVTHIAVKRGQRPAKPLLLMHITQGVAGEEVNTAHYRHHLDLAEGAEATVIEHFVS -----------2222-------------------------------2222---------- LNDARHFTGARFTINVAANAHLQHIKLAFENPLSHHFAHNDLLLAEDATAFSHSFLLGGA ----------------2222----------1111-------------------------- VLRHNTSTQLNGENSTLRINSLAMPVKNEVCDTRTWLEHNKGFCNSRQLHKTIVSDKGRA ------------------------------------------------------2222-- VFNGLINVAQHAIKTDGQMTNNNLLMGKLAEVDTKPQLEIYADDVKCSHGATVGRIDDEQ --------2222--------------1111--------------------------3333 IFYLRSRGINQQDAQQMIIYAFAAELTEALRDEGLKQQVLARIGQRLPGG ----1111-----------------3333-----------------2222 >HYPOTHETICAL PROTEIN YDII; SWP:P77781; PDB:1VH5A; SLIWKRKITLEALNAMGEGNMVGFLDIRFEHIGDDTLEATMPVDSRTKQPFGLLHGGASV --------------1111-------------------------1111-1111-------- VLAESIGSVAGYLCTEGEQKVVGLEINANHVRSAREGRVRGVCKPLHLGSRHQVWQIEIF -----------1111!!!!----------------------------------------- DEKGRLCCSSRLTTAILE 1111-------------- >FLAGELLAR PROTEIN FLIS; SWP:P39739; PDB:1VH6A; ATPGELTLLYNGCLKFIRLAAQAIENDDERKNENLIKAQNIIQELNFTLNRNIELSASGA ---3333----------------1111-----------------------3333------ YDYYRRLVQANIKNDTGLAEVEGYVTDFRDAWKQAI ------------------------------------ >2-C-methyl-D-erythritol 2; SWP:P44815; PDB:1VH8A; SLIRIGHGFDVHAFGEDRPLIIGGVEVPYHTGFIAHSDGDVALHALTDAILGAAALGDIG ---------------------iiii--------------------------1111--111 KLFPKNADSRGLLREAFRQVQEKGYKIGNVDITIIAQAPKRPHIDARAKIAEDLQCDIEQ 1---3333--------------------------------1111------------3333 VNVKATTTEKLGFTGRQEGIACEAVALLIRQ -------iiii3333---------------- >HYPOTHETICAL PROTEIN YBDB; SWP:P15050; PDB:1VH9A; SLIWKRHLTLDELNATSDNTMVAHLGIVYTRLGDDVLEAEMPVDTRTHQPFGLLHGGASA ----------------------1111-----------------3333------------- ALAETLGSMAGFMMTRDGQCVVGTELNATHHRPVSEGKVRGVCQPLHLGRQNQSWEIVVF -----------11112222-----------------------------1111-------- DEQGRRCCTCRLGTAVLG 1111-------------- >PUTATIVE KHG/KDPG ALDOLAS; SWP:P44480; PDB:1VHCA; SYTTQQIIEKLRELKIVPVIALDNADDILPLADTLAKNGLSVAEITFRSEAAADAIRLLR ---------3333----------3333--------1111---------1111-------- ANRPDFLIAAGTVLTAEQVVLAKSSGADFVVTPGLNPKIVKLCQDLNFPITPGVNNPAIE --1111-------------------------------------1111------------- IALEGISAVKFFPAEASGGVKIKALLGPYAQLQIPTGGIGLHNIRDYLAIPNIVACGGSW --------------1111----------1111-------3333------3333-----11 FVEKKLIQSNNWDEIGRLVREVIDIIKE 11-------------------------- >AMINOPEPTIDASE/GLUCANASE ; SWP:P94521; PDB:1VHEA; KLDETLTLKDLTDAKGIPGNEREVRQVKSYIEPFADEVTTDRLGSLIAKKTGAENGPKII --3333----------2222----------3333------1111--------1111---- AGHLDEVGFVTQITDKGFIRFQTVGGWWAQVLAQRVTIVTKKGEITGVIGSKPPHILSPE -------------1111----------1111--------1111---------3333-333 ARKKSVEIKDFIDIGASSREEALEWGVLPGDIVPHFEFTVNNEKFLLAKAWDNRIGCAIA 3-----3333-----------------2222----------3333--------------- IDVLRNLQNTDHPNIVYGVGTVQEEVGLRGAKTAAHTIQPDIAFGVDVGIAGDTPGISEK -----3333-------------3333---------------------------2222333 EAQSKGKGPQIIVYDASVSHKGLRDAVVATAEEAGIPYQFDAIAGGGTDSGAIHLTANGV 3-----------------------------------------1111-3333----!!!!- PALSITIATRYIHTHAALHRDDYENAVKLITEVIKKLDRKTVDEITYQEGGSHH ------------------3333-----------1111-------------1111 >SONIC HEDGEHOG; SWP:Q62226; PDB:1VHH; KLTPLAYKQFIPNVAEKTLGASGRYEGKITRNSERFKELTPNYNPDIIFKDEENTGADRL -----2222-----1111-----------11113333------1111---3333-3333- MTQRCKDKLNALAISVMNQWPGVKLRVTEGWDEDGHHSEESLHYEGRAVDITTSDRDRSK -------------------2222--------------2222-----------11113333 YGMLARLAVEAGFDWVYYESKAHIHCSVKAENSVAAK --------1111-------1111------3333---- >HYPOTHETICAL PROTEIN YQEU; SWP:P54461; PDB:1VHKA; QRYFIELTKQQIEEAPTFSITGEEVHHIVNVRNEGDQIICCSQDGFEAKCELQSVSKDKV ---------------------3333-------2222-----1111--------------- SCLVIEWTNENRELPIKVYIASGLPKGDKLEWIIQKGTELGAHAFIPFQAARSVVKRERW -------------------------------------------------1111------- TKIAKEAAEQSYRNEVPRVDVHSFQQLLQRQDFDKCVVAYESAFSAIVSSLPKGSSLLIV -------------------------------------------333311112222----- FGPEGGLTEAEVERLTEQDGVTCGLGPRILRTETAPLYALSAISYQTELLR --3333---------1111-----------1111----------------- >PROTEIN YEBR; SWP:P76270; PDB:1VHMA; NKTEFYADLNRDFNALMAGETSFLATLANTSALLYERLTDINWAGFYLLEDDTLVLGPFQ ----------------2222---------------------------------------- GKIACVRIPVGRGVCGTAVARNQVQRIEDVHVFDGHIACDAASNSEIVLPLVVKNQIIGV --------2222------1111------11111111----------------%%%%---- LDIDSTVFGRFTDEDEQGLRQLVAQLEKVLATTDYKKFF ----------------------------3333-3333-- >PUTATIVE FLAVIN OXIDOREDU; SWP:Q9WXV1; PDB:1VHNA; VKVGLAPAGYTDSAFRTLAFEWGADFAFSEVSAKGFLNSQKTEELLPQPHERNVAVQIFG -------------------------------3333----3333----1111--------- SEPNELSEAARILSEKYKWIDLNAGCPVRKVVKEGAGGALLKDLRHFRYIVRELRKSVSG -------------------------------1111---1111------------------ KFSVKTRLGWEKNEVEEIYRILVEEGVDEVFIHTRTVVQSFTGRAEWKALSVLEKRIPTF ----------------------1111---------3333------33333333------- VSGDIFTPEDAKRALEESGCDGLLVARGAIGRPWIFKQIKDFLRSGKYSEPSREEILRTF -------------------------3333------------------------------- ERHLELLIKTKGERKAVVERKFLAGYTKDLKGARRFREKVKIEEVQILKEFYNFIKEVE -------------------1111---2222----------------------------- >ENDOGLUCANASE; SWP:Q9X0D8; PDB:1VHOA; ETGKLLELSNLDGPSGYETNVVSYIKSVIEPFVDEAKTTRHGSLIGYKKGKGIGKLAFFA 3333---3333--2222-----------1111------1111------------------ HVDEIGFVVSKVEGQFARLEPVYASKVRIYTKNGIERGVIGLAPHLQDSESRKKVPTYDE ------------!!!!--------------1111-------------33331111----- IFVDLSLCERGVRVGDIAVIDQTAFETNGKVVGKALDNRASCGVLVKVLEFLKRYDHPWD ---3333-----2222----------iiii----3333-------------1111----- VYVVFSVQEETGCLGALTGAYEINPDAAIVDVTFASEPPFSDHIELGKGPVIGLGPVVDR ------1111----------------------------------2222------3333-- NLVQKIIEIAKKHNVSLQEEAVGGRTDFVQLVRNGVRTSLISIPLKYHTPVEVDPRDVEE -------------------------1111--1111------------------------- LARLLSLVAVELE ------------- >ENHANCING LYCOPENE BIOSYN; SWP:P0ABU5; PDB:1VHQA; KKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEATETRN ----------------------------1111---------------------------- VLIEAARITRGEIRPLAQADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQ ----3333iiii--3333-3333------------------11111111----------- AHQAGKPLGFIAPALPKIFDFPLRLTIGTDIDTAEVLEEGAEHVPCPVDDIVVDEDNKIV -1111-----33331111----------------------------1111----1111-- TTPAYLAQNIAEAASGIDKLVSRVLVLAE ----------------------------- >SIMILAR TO PHOSPHINOTHRIC; SWP:NA; PDB:1VHSA; SLTLRLAEHRDLEAVVAIYNSTIASRVTADTEPVTPEDREWFSGHTESRPLYVAEDENGN -------3333----------3333---------3333-------1111------1111- VAAWISFETFYGRPAYNKTAEVSIYIDEACRGKGVGSYLLQEALRIAPNLGIRSLAFIFG ------------3333----------1111---------------3333---------11 HNKPSLKLFEKHGFAEWGLFPGIAEDGKRYDLKILGRELSE 11------3333----------------------------- >DEPHOSPHO-COA KINASE; SWP:P36679; PDB:1VHTA; SLRYIVALTGGIGSGKSTVANAFADLGINVIDADIIARQVVEPGAPALHAIADHFGANMI ----------2222---------1111--------------2222----------1111- AADGTLQRRALRERIFANPEEKNWLNALLHPLIQQETQHQIQQATSPYVLWVVPLLVENS 1111--------------------------------------------------3333-- LYKKANRVLVVDVSPETQLKRTMQRDDVTREHVEQILAAQATREARLAVADDVIDNNGAP -1111-------------------------------1111-----3333----------- DAIASDVARLHAHYLQLASQFVSQEKPE --------------------1111---- >HYPOTHETICAL PROTEIN AF15; SWP:O28751; PDB:1VHUA; EVLFEAKVGDITLKLAQGDITQYPAKAIVNAANKRLEHGGGVAYAIAKACAGDAGLYTEI -------!!!!-------1111----------1111----------------3333---- SKKAREQFGRDYIDHGEVVVTPANLEERGIKYVFHTVGPICSGWSEELKEKLYKAFLGPL -------------2222-------3333-------------------------------- EKAEEGVESIAFPAVSAGIYGCDLEKVVETFLEAVKNFKGSAVKEVALVIYDRKSAEVAL -------------22221111--------------------------------------- KVFERSL ------- >DIPHTHINE SYNTHASE; SWP:O29866; PDB:1VHVA; SLLTFVGLGLWDVKDISVKGLEAVREADEVYVEYYTSKLLSSIEEEEFFGKRVVELERSD -----------1111--------------------------3333-1111------3333 LEENSFRLIERAKSKSVVLLVPGDPVATTHSAIKLEAERKGVKTRIIHGASISTAVCGLT -1111-----3333-------------3333------1111---------3333--1111 GLHNYRFGKSATVSWHRSQTPVNVIKANRSIDAHTLLFLDLHPEPTIGHAVENLIAEDAQ --3333----------------------1111---------------------------- KDLYAVGIARAGSGEEVVKCDRLENLKKIDFGKPLHVVVLAKTLHFEFECLREFADAPAE ---------2222--------33331111----------------------------333 LERLV 31111 >PURINE NUCLEOSIDE PHOSPHO; SWP:Q9KPM0; PDB:1VHWA; ATPHINAQMGDFADVVLMPGDPLRAKYIAENFLDNAVQVCDVRNMFGYTGTYKGRRISVM -1111--2222-----------------------------2222-------iiii----- GHGMGIPSCSIYVTELIKDYGVKKIIRVGSCGAVNEGIKVRDVVIGMGACTDSKVNRIRF ----------------------------------11112222-----------------% KDHDFAAIADYKMVKAAEEAAKARGIDVKVGNLFSAELFYTPDPSMFDVMDKYGIVGVEM %%%------------------1111-------------------------1111------ EAAGIYGVAAEYGAKALAICTVSDHIKTGEQTTSEERQNTFNEMIEIALDSVLIGDQ ------------------------3333-------------------------3333 >PUTATIVE HOLLIDAY JUNCTIO; SWP:O34634; PDB:1VHXA; SLRILGLDLGTKTLGVALSDEGWTAQGIETIKINEAEGDYGLSRLSELIKDYTIDKIVLG ---------1111---------------------1111----------1111-------- FPKNNGTVGPRGEASQTFAKVLETTYNVPVVLWDERLTTAAEKLIAADVSRQKRKKVIDK --------------------------------------------1111------------ AAVILQGYLDSLNE -------------- >HYPOTHETICAL PROTEIN HI03; SWP:P44627; PDB:1VHYA; IPRIYHPISLENQTQCYLSEDAANHVARVLRTEGEQLELFDGSNHIYPAKIIESNKKSVK ---------2222------------------2222------------------------- VEILGRELADKESHLKIHLGQVIREFTIQKSVELGVNVITPLWSERCGVKLDAERDKKIQ -------------------------------1111--------1111----3333----- QWQKIAIAACEQCGRNIVPEIRPLKLQDWCAENDGALKLNLHPRAHYSIKTLPTIPAGGV ----------------------------1111---------1111--3333----1111- RLLIGSEGGLSAQEIAQTEQQGFTEILLGKRVLRTETASLAAISALQICFGDLGEEG -----3333------------------------------------------1111-- >ADP COMPOUNDS HYDROLASE N; SWP:P45799; PDB:1VHZA; SLSKSLQKPTILNVETVARSRLFTVESVDLEFSNGVRRVYERMRPTNREAVMIVPIVDDH -------------------1111--------1111---------------------%%%% LILIREYAVGTESYELGFSKGLIDPGESVYEAANRELKEEVGFGANDLTFLKKLSMAPSY --------1111-----------2222-----------------------------3333 FSSKMNIVVAQDLYPESLEGDEPEPLPQVRWPLAHMMDLLEDPDFNEARNVSALFLVREW -------------------------------3333-------1111-------------- LKGQGR -1111- >TRANSCRIPTIONAL REGULATOR; SWP:P94548; PDB:1VI0A; PKYQIIDAAVEVIAENGYHQSQVSKIAKQAGVADGTIYLYFKNKEDILISLFKEKGQFIE ----------------3333------------33333333-------------------- REEDIKEKATAKEKLALVISKHFSLLAGDHNLAIVTQLELRQSNLELRQKINEILKGYLN ---3333------------------------------1111------------------- ILDGILTEGIQSGEIKEGLDVRLARQIFGTIDETVTTWVNDQKYDLVALSNSVLELLVSG ---------------2222--------------------%%%%-3333------------ IHNK ---- >FATTY ACID/PHOSPHOLIPID S; SWP:P71018; PDB:1VI1A; SLRIAVDAGGDHAPKAVIDGVIKGIEAFDDLHITLVGDKTTIESHLTTTSDRITVLHADE ---------------------1111--1111------33331111----1111------- VIEPTDEPVRAVRRKKNSSVLAQEVAENRADACISAGNTGALTAGLFIVGRIKGIDRPAL -------3333-------------1111---------3333----------2222----- APTLPTVSGDGFLLLDVGANVDAKPEHLVQYAIGSVYSQQVRGVTSPRVGLLNVGTEDKK -----------------------3333-----------------------------1111 GNELTKQTFQILKETANINFIGNVEARDLLDDVADVVVTDGFTGNVTLKTLEGSALSIFK -3333-----33333333------3333-------------------------------- RDVTSTLVLKPKLKEKKEYSNYGGASLFGLKAPVIKAHGSSDSNAVFRAIRQAREVSQNV --------3333-----1111--------------------3333----------1111- AALIQEEVKEEKTDE --------3333--- >SHIKIMATE 5-DEHYDROGENASE; SWP:P28244; PDB:1VI2A; AKYELIGLAYPIRHSLSPEQNKALEKAGLPFTYAFEVDNDSFPGAIEGLKALKRGTGVSP -----------1111----------------------3333--------1111------- NKQLACEYVDELTPAAKLVGAINTIVNDDGYLRGYNTDGTGHIRAIKESGFDIKGKTVLL --3333---------------------iiii----3333-------1111--2222---- GAGGASTAIGAQGAIEGLKEIKLFNRRDEFFDKALAFAQRVNENTDCVVTVTDLADQQAF ---------------------------1111---------------------3333---- AEALASADILTNGTKVGKPLENESLVNDISLLHPGLLVTECVYNPHTKLLQQAQQAGCKT ---1111--------------------1111----------------------------- IDGYGLLWQGAEQFTLWTGKDFPLEYVKQVGFGA -3333----------------------------- >REGULATOR OF RIBONUCLEASE; SWP:Q9KPK1; PDB:1VI4A; RDITPDLCDKYESQVTLLNLPLQNFGQRSAFWGEIVTVRCYHDNSKVRDVLSQNGKGKVL ----------3333-----------------------------------1111-2222-- VVDGHGSCHKALGDQLAILAIKNDWEGVIIYGAVRDVVASEDLGIKALGTSPFKTEKRGA ---iiii-------------1111-----------3333--------------------- GQVNVTLTQNQIVEPGDYLYADWNGILSETALDVAE -------------2222----3333----------- >30S RIBOSOMAL PROTEIN S2P; SWP:O29132; PDB:1VI6A; EYEYLVPPDDYLAAGVHIGTQIKTGDMKKFIFKVRQDGLYVLDIRKLDERIRVAAKFLSR ------3333----1111-----33331111---1111-------------------111 YEPSKILLVAARQYAHKPVQMFSKVVGSDYIVGRFIPGTLTNPMLSEYREPEVVFVNDPA 13333------3333--------------------2222--1111------------333 IDKQAVSEATAVGIPVVALCDSNNSSADVDLVIPTNNKGRRALAIVYWLLAREIAKIRGQ 3--------1111-------1111-1111------------------------------- DFTYSIEDFEAEL ----3333----- >HYPOTHETICAL PROTEIN YIGZ; SWP:P27862; PDB:1VI7A; LMESWLIPAAPVTVVEEIKKSRFITMLAHTDGVEAAKAFVESVRAEHPDARHHCVAWVAG -----------------%%%%----------------------3333------------- APDDSQQLGFSDDGEPAGTAGKPMLAQLMGSGVGEITAVVVRYYGGILLGTGGLVKAYGG ---------------2222----------------------------------------- GVNQALRQLTTQRKTPLTEYTLQCEYHQLTGIEALLGQCDGKIINSDYQAFVLLRVALPA -----1111---------------1111--------1111------------------33 AKVAEFSAKLADFSRGSLQLLAIEEE 33-----------%%%%--------- >PYRIDOXAMINE KINASE; SWP:P77150; PDB:1VI9A; LKNILAIQSHVVYGHAGNSAAEFPRRLGANVWPLNTVQFSNHTQYGKWTGVPPSHLTEIV ----------------3333----1111-------------1111------3333----- QGIAAIDKLHTCDAVLSGYLGSAEQGEHILGIVRQVKAANPQAKYFCDPVGHPEKGCIVA ---11113333----------------------------1111----------------- PGVAEFHVRHGLPASDIIAPNLVELEILCEHAVNNVEEAVLAARELIAQGPQIVLVKHLA ----------3333--------------------------------1111---------1 RAGYSRDRFELLVTADEAWHISRPLVDFGRQPVGVGDVTSGLLLVKLLQGATLQEALEHV 111----------1111------------------------------------------- TAAVYEIVTTKAQEYELQVVAAQDRIAKPEHYFSATKLE -----------------33333333-------------- >SHIKIMATE KINASE; SWP:Q9PIB5; PDB:1VIAA; KNIVFIGFGSGKSTLARALAKDLDLVFLDSDFLIEQKFNQKVSEIFEQKRENFFREQEQK ------------------------------------------------------------ ADFFSSCEKACIATGGGFVNVSNLEKAGFCIYLKADFEYLKKRLDKDEISKRPLFYDEIK --------------1111----3333---------33331111-3333------------ AKKLYNERLSKYEQKANFILNIENKNIDELLSEIKKVIK --------------------------------------- >NEUROTOXIN B-IV; SWP:P01525; PDB:1VIB; ASATWGAAYACENNCRKKYDLCIRCQGKWAGKRGKCAAHCIIQKNNCKGKCKKE ------------------1111----------1111------33331111---- >3-DEOXY-MANNO-OCTULOSONAT; SWP:P44490; PDB:1VICA; SFTVIIPARFASSRLPGKPLADIKGKPMIQHVFEKALQSGASRVIIATDNENVADVAKSF --------------2222----iiii3333------1111-----------------111 GAEVCMTSVNHNSGTERLAEVVEKLAIPDNEIIVNIQGDEPLIPPVIVRQVADNLAKFNV 1--------------------------1111--------11113333--------1111- NMASLAVKIHDAEELFNPNAVKVLTDKDGYVLYFSRSVIPYDRDQFMNLQDVQKVQLSDA ----------3333--3333-----1111-------------1111----3333------ YLRHIGIYAYRAGFIKQYVQWAPTQLENLEKLEQLRVLYNGERIHVELAKEVPAVGVDTA -----------------3333----------3333--1111------------------- EDLEKVRAILAANGS --------------- >DIHYDROFOLATE REDUCTASE; SWP:P00383; PDB:1VIE; PSNATFGMGDRVRKKSGAAWQGQIVGWYCTNLTPEGYAVESEAHPGSVQIYPVAALERIN ------2222-------------------3333-------1111-------3333----- >HYPOTHETICAL PROTEIN AF17; SWP:O28478; PDB:1VIMA; SFLEVVSEHIKNLRNHIDLETVGEMIKLIDSARSIFVIGAGRSGYIAKAFAMRLMHLGYT ----------------------------------------3333----------1111-- VYVVGETVTPRITDQDVLVGISGSGETTSVVNISKKAKDIGSKLVAVTGKRDSSLAKMAD --2222---------------3333------------------------1111--1111- VVMVVKGKMKQERDEILSQLAPLGTMFELTAMIFLDALVAEIMMQKHLTEKDLEARHAVL --------1111---------iiii-----------------------3333-1111-11 EEGG 11-- >RIBOSOMAL SMALL SUBUNIT P; SWP:P45124; PDB:1VIOA; SLRLDKFIAENVGLTRSQATKAIRQSAVKINGEIVKSGSVQISQEDEIYFEDELLTWIEE ----------------------1111---iiii---1111--1111---%%%%------- GQYFMLNKPQGCVCSNDDYPTIYQFFDYPLAGKLHSAGRLDVDTTGLVLLTDDGQWSHRI ----------------------1111--3333--------1111---------------- TSPKHHCEKTYLVTLADPVEENYSAACAEGILLRGEKEPTKPAKLEILDDYNVNLTISEG -3333--------------1111---------2222------------------------ RYHQVKRMFAALGNKVVGLHRWKIGDVVLDESLEEGEYRPLTQSEIEKLV ---------1111----------!!!!--11112222---------1111 >PHOSPHOLIPASE A2; SWP:P81458; PDB:1VIP; NLFQFAEMIVKMTGKNPLSSYSDYGCYCGWGGKGKPQDATDRCCFVHDCCYEKVKSCKPK ---------------3333--------------------------------------333 LSLYSYSFQNGGIVCGDNHSCKRAVCECDRVAATCFRDNLNTYDKKYHNYPPSQCTGTEQ 3-------%%%%--------------------------3333-3333---3333------ C - >ADP-RIBOSE PYROPHOSPHATAS; SWP:P37128; PDB:1VIUA; QQITLIKDKILSDNYFTLHNITYDLTRKDGEVIRHKREVYDRGNGATILLYNTKKKTVVL ------------------------------------------------------------ IRQFRVATWVNGNESGQLIESCAGLLDNDEPEVCIRKEAIEETGYEVGEVRKLFELYSPG ----3333----1111---------iiii------------------------------- GVTELIHFFIAEYSDNQREDIEVLELPFSQALEIKTGEIRDGKTVLLLNYLQTSHLD -------------1111---------3333---1111-------------------- >PEPTIDASE T; SWP:P29745; PDB:1VIXA; SLDKLLERFLNYVSLDTQSKAGVRQVPSTEGQWKLLHLLKEQLEEMGLINVTLSEKGTLM ----------------------------3333-----------1111------1111--- ATLPANVPGDIPAIGFISHVDTSPDCSGKNVNPQIVENYRGGDIALGIGDEVLSPVMFPV -----------------------------------------------------3333--- LHQLLGQTLITTDGKTLLGADDKAGIAEIMTALAVLQQKKIPHGDIRVAFTPDEEVGKGA ---2222-----------------------------1111------------1111---- KHFDVDAFDARWAYTVDGGGVGELEFENFNAASVNIKIVGNNVHPGTAKGVMVNALSLAA ---3333------------2222--------------------33332222--3333--- RIHAEVPADESPEMTEGYEGFYHLASMKGTVERADMHYIIRDFDRKQFEARKRKMMEIAK -3333-33333333-!!!!----------3333--------------------------- KVGKGLHPDCYIELVIEDSYYNMREKVVEHPHILDIAQQAMRDCDIEPELKPIRGGTDGA 1111--3333------------3333---3333--------1111--------------- QLSFMGLPCPNLFTGGYNYHGKHEFVTLEGMEKAVQVIVRIAELTAQRKE -1111---------------1111-------------------3333--- >PCRB PROTEIN HOMOLOG; SWP:O34790; PDB:1VIZA; SLYDVTEWKHVFKLDPNKDLPDEQLEILCESGTDAVIIGGTEDNVLRMMSKVRRFLVPCV ---3333-------1111---------1111----------------------------- LEVSAIEAIVPGFDLYFIPSVLNSKNADWIVGMHQKAMKEYGELMSMEEIVAEGYCIANP ----3333------------1111-3333---------------1111------------ DCKAAALTEADADLNMDDIVAYARVSELLQLPIFYLEYSGVLGDIEAVKKTKAVLETSTL -------------------------------------!!!!--3333----1111----- FYGGGIKDAETAKQYAEHADVIVVGNAVYEDFDRALKTVAAVKGE -------------------------3333---------------- >ALCOHOL DEHYDROGENASE, ZI; SWP:Q9WYR7; PDB:1VJ0A; MGLKAHAMVLEKFNQPLVYKEFEISDIPRGSILVEILSAGVCGSDVHMFRGEDPRVPLPI -----------2222------------2222----------3333--1111-1111---- ILGHEGAGRVVEVNGEKRDLNGELLKPGDLIVWNRGITCGECYWCKVSKEPYLCPNRKVY ------------------1111---2222------------3333----3333-----22 GINRGCSEYPHLRGCYSSHIVLDPETDVLKVSEKDDLDVLAMAMCSGATAYHAFDEYPES 22--------------------1111-----1111------------------------- FAGKTVVIQGAGPLGLFGVVIARSLGAENVIVIAGSPNRLKLAEEIGADLTLNRRETSVE 2222------------------1111-----------------1111-----1111---- ERRKAIMDITHGRGADFILEATGDSRALLEGSELLRRGGFYSVAGVAVPQDPVPFKVYEW ---------iiii----------3333----11112222----------------3333- LVLKNATFKGIWVSDTSHFVKTVSITSRNYQLLSKLITHRLPLKEANKALELMESREALK -3333------------------------33333333----3333--------------- VILYPE ------ >PUTATIVE NADPH-DEPENDENT ; SWP:NA; PDB:1VJ1A; HIIQRVVLNSRPGKNGNPVAENFRVEEFSLLALNEGQVQVRTLYLSVDPYRCKNEDTGTD ------------1111--1111-----------2222---------------------33 YLAPWQLAQVADGGGIGIVEESKHQKLAKGDFVTSFYWPWQTKAILDGNGLEKVDPQLVD 33---2222--------------11112222---------------1111----3333ii GHLSYFLGAIGPGLTSLIGVQEKGHISAGSNQTVVSGAAGACGSLAGQIGHLLGCSRVVG ii-3333-------------------2222------1111-3333-----1111------ ICGTQEKCLFLTSELGFDAAVNYKTGNVAEQLREACPGGVDVYFDNVGGDISNTVISQNE ---------------------1111----------1111-------------------22 NSHIILCPPPLPPAVEAIRKERNITRERFTVLNYKDKFEPGILQLSQWFKEGKLKVKETV 22---------------------------33333333----------------------- AKGLENGVAFQSTGGNVGKQIVCISEDSSL --33333333-------------------- >NOVEL MANGANESE-CONTAININ; SWP:Q9X1H0; PDB:1VJ2A; MILKRAYDVTPQKISTDKVRGVRKRVLIGLKDAPNFVMRLFTVEPGGLIDRHSHPWEHEI ----3333-------1111---------1111-----------2222------------- FVLKGKLTVLKEQGEETVEEGFYIFVEPNEIHGFRNDTDSEVEFLCLIPKEGGE ------------------2222----2222------------------3333-- >ADENOMATOUS POLYPOSIS COL; SWP:Q64512; PDB:1VJ6A; MKPGDTFEVELAKTDGSLGISVTGGVNTSVRHGGIYVKAIIPKGAAESDGRIHKGDRVLA -------------%%%%-------3333---------------3333------------- VNGVSLEGATHKQAVETLRNTGQVVHLLLEKGQVP iiii-2222-------------------------- >BIFUNCTIONAL RELA/SPOT; SWP:Q54089; PDB:1VJ7A; INLTGEEVVALAAKYMNETDAAFVKKALDYATAAHFYQVRKSGEPYIVHPIQVAGILADL -----------3333-------------------2222-3333-3333---------111 HLDAVTVACGFLHDVVEDTDITLDNIEFDFGKDVRDIVDGVTKLGHRKMLMAMSKDIRVI 1-----------------------------------------------1111-------- LVKLADRLHNMRTLKQERISRETMEIYAPLAHRLGISRIKWELEDLAFRYLNETEFYKIS --------------3333-------------1111------------------------- HMMNEKRREREALVDDIVTKIKSYTTEQGLFGDVYGRPKHIYSIYRKMRDKKKRFDQIFD -1111--------------------1111---------------------!!!!---111 LIAIRCVMETQSDVYAMVGYIHELWRPMPGRFKDYIAAPKANGYQSIHTTVYGPKGPIEI 1--------------------------2222--3333--1111----------------- QIRTKEMHQVAEYGVAANWIKELVEL -------------------------- >PHOSPHOGLYCERATE KINASE; SWP:Q7SIB7; PDB:1VJDA; SLSNKLTLDKLDVKGKRVVMRVDFNVPMKNNQITNNQRIKAAIPSIKFCLDNGAKSVVLM 1111--3333--2222-------------------3333--3333--------------- SHLGRPDGIPMPDKYSLEPVAVELKSLLGKDVLFLKDCVGPEVEKACADPAAGSVILLEN -----iiii-3333--3333------------------------------2222-----1 LRFHVEEEGKGKDASGSKVKADPAKIEAFRASLSKLGDVYVNDAFGTAHRAHSSMVGVNL 1113333-----1111----------------1111-------3333----3333----- PKKAGGFLMKKELNYFAKALESPERPFLAILGGAKVADKIQLINNMLDKVNEMIIGGGMA -----------------------------------1111-----------------3333 FTFLKVLNNMEIGTSLFDEEGSKIVKDLMSKAEKNGVKITLPVDFVTADKFDENAKTGQA -----------!!!!--3333-----------1111---------------1111----- TVASGIPAGWMGLDCGPESSKKYSEAVARAKQIVWNGPVGVFEWEAFAQGTKALMDEVVK 3333--2222------------------------------3333---------------- ATSRGCITIIGGGDTATCCAKWNTEDKVSHVSTGGGASLELLEGKVLPGVDALSNV -1111-------3333---11111111-------------3333------------ >AUTOINDUCER-2 PRODUCTION ; SWP:Q9RRU8; PDB:1VJEA; ESFDLDHTKVKAPYVRLAGVKTTPKGDQISKYDLRFLQPNQGAIDPAAIHTLEHLLAGYM 1111-1111-------------1111-----------2222------------------3 RDHLEGVVDVSPMGRTGMYMAVIGEPDEQGVMKAFEAALKDTAGHDQPIPGVSELECGNY 333-------------------------------------------------3333--11 RDHDLAAARQHARDVLDQGLKVQETILL 11-------------------------- >DNA-BINDING PROTEIN, PUTA; SWP:Q9ABV9; PDB:1VJFA; KTRADLFAFFDAHGVDHKTLDHPPVFRVEEGLEIKAAPGGHTKNLFLKDAKGQLWLISAL ----------1111----------------------------------1111-------1 GETTIDLKKLHHVIGSGRLSFGPQELETLGVTPGSVTAFGLINDTEKRVRFVLDKALADS 111--33333333------------------2222-3333---1111--------3333- DPVNFHPLKNDATTAVSQAGLRRFLAALGVEPIVDFAAEVVG -------------------------1111------------- >PUTATIVE LIPASE FROM THE ; SWP:Q8YWS4; PDB:1VJGA; SKTQIRICFVGDSFVNGTGDPECLGWTGRVCVNANKKGYDVTYYNLGIRRDTSSDIAKRW -----------3333-2222-------------3333----------2222--------- LQEVSLRLHKEYNSLVVFSFGLNDTTLENGKPRVSIAETIKNTREILTQAKKLYPVLISP ---1111-1111--------3333---iiii----------------------------- APYIEQQDPGRRRRTIDLSQQLALVCQDLDVPYLDVFPLLEKPSVWLHEAKANDGVHPQA ----1111------------------1111-----3333--------------------- GGYTEFARIVENWDAWLNWF -------------3333--- >BET V I ALLERGEN FAMILY; SWP:P0C0B0; PDB:1VJHA; STLKGALSVKFDVKCPADKFFSAFVEDTNRPFEKNGKTEIEAVDLVKKTTIQSGSEIQKY --------------------------------1111-----------------3333--- FKTLKGSIAVTPIGVGDGSHVVWTFHFEKVHKDIDDPHSIIDESVKYFKKLDEAILNF ------------------------------1111------------------------ >12-OXOPHYTODIENOATE REDUC; SWP:Q8LAH7; PDB:1VJIA; SVPLLTPYKMGRFNLSHRVVLAPLTRQRSYGNVPQPHAAIYYSQRTTPGGFLITEATGVS -3333----!!!!---------------2222--3333----11112222---------1 DTAQGYQDTPGIWTKEHVEAWKPIVDAVHAKGGIFFCQIWHVGRVSNSGFQPNGKAPISC 111--------------------------------------!!!!-33332222------ SDKPLMPQIRSNGIDEALFTPPRRLGIEEIPGIVNDFRLAARNAMEAGFDGVEIHGANGY -----3333----------------33331111-----------------------iiii LIDQFMKDTVNDRTDEYGGSLQNRCKFPLEIVDAVAKEIGPDRVGIRLSPFADYMESGDT ------1111----1111---------------------3333-----1111-%%%%--- NPGALGLYMAESLNKYGILYCHVIEARMHTLMPMRKAFKGTFISAGGFTREDGNEAVSKG -------------1111------------------------------------------- RTDLVAYGRWFLANPDLPKRFQVDAPLNKYDRPTFYTSDPVVGYTDYPFLE ------------------------------3333------2222------- >PROTEIN-GLUTAMINE GLUTAMY; SWP:Q08188; PDB:1VJJA; AALGVQSINWQTAFNRQAHHTDKFSSQELILRRGQNFQVLMIMNKGLGSNERLEFIVSTG --------------------3333-----------------------1111--------- PYPSESAMTKAVFPLSNGSSGGWSAVLQASNGNTLTISISSPASAPIGRYTMALQIFSQG ---3333-----------------------!!!!-------1111------------iii GISSVKLGTFILLFNPWLNVDSVFMGNHAEREEYVQEDAGIIFVGSTNRIGMIGWNFGQF i-------------1111--1111---------------------1111--------111 EEDILSICLSILDRSLNFRRDAATDVASRNDPKYVGRVLSAMINSNDDNGVLAGNWSGTY 1--------3333----------------------------------------------2 TGGRDPRSWNGSVEILKNWKKSGLSPVRYGQCWVFAGTLNTALRSLGIPSRVITNFNSAH 222-3333-----------1111-------3333-------------------------- DTDRNLSVDVYYDPMGNPLDKGSDSVWNFHVWNEGWFVRSDLGPSYGGWQVLDATPQERS ------------1111----------------------11113333-------------i QGVFQCGPASVIGVREGDVQLNFDMPFIFAEVNADRITWLYDNTTGKQWKNSVNSHTIGR iii---------------------------------------1111-------------- YISTKAVGSNARMDVTDKYKYPEGSDQERQVFQKALGKLKPEPSIIGKLKVAGMLAVGKE -----2222-----3333---2222----------------------------------- VNLVLLLKNLSRDTKTVTVNMTAWTIIYNGTLVHEVWKDSATMSLDPEEEAEHPIKISYA --------------------------1111---------------2222--------333 QYERYLKSDNMIRITAVCKVPDESEVVVERDIILDNPTLTLEVLNEARVRKPVNVQMLFS 33333-3333---------2222------------------------2222--------- NPLDEPVRDCVLMVEGSGLLLGNLKIDVPTLGPKERSRVRFDILPSRSGTKQLLADFSCN ---------------2222------------2222-----------------------11 KFPAIKAMLSIDVAE 11------------- >MOLYBDOPTERIN CONVERTING ; SWP:Q8U3C7; PDB:1VJKA; SVKVKVKYFARFRQLAGVDEEEIELPEGARVRDLIEEIKKRHEKFKEEVFGEGYDEDADV -------------3333--------22223333--------3333---------1111-- NIAVNGRYVSWDEELKDGDVVGVFPPVS ---iiii--1111--2222--------- >HYPOTHETICAL PROTEIN TM01; SWP:Q9WY07; PDB:1VJLA; HRKAWVKTLALDRVSNTPVVILGIEGTNRVLPIWIGACEGHALALAEKEFPRPLTHDLLL -----------------------2222--------------------------------- SVLESLEARVDKVIIHSLKDNTFYATLVIRDLTAALIDIDSRPSDAIILAVKTGAPIFVS ------------------------------------------------------------ DNLVEKHSIELEVNERDLIN -------------------- >Zn-dependent hydrolase of; SWP:Q9WY50; PDB:1VJNA; HKITWFGHACFALEEGKTIVTDPFDPIPNVTADVVTESHQHNAHHLVKGNFRVIDRPGAY -------------------------------------------1111------------- TVNGVKIKGVETFHDGKNIVFVFEGEGIKVCHLGDLGHVLTPAQVEEIGEIDVLLVPVGG -iiii-------------------%%%%-------------------------------- TYTIGPKEAKEVADLLNAKVIIPHYKTKYLKFNLLPVDDFLKLFDSYERVGNILELFEKP -------------1111---------1111-----33333333----------------- KERKVVVEVQ ---------- >ALANINE--GLYOXYLATE AMINO; SWP:Q8YY48; PDB:1VJOA; ISINDNQRLQLEPLEVPSRLLLGPGPSNAHPSVLQANVSPVGHLDPAFLALDEIQSLLRY ----1111---------------------------------1111--------------- VWQTENPLTIAVSGTGTAAEATIANAVEPGDVVLIGVAGYFGNRLVDAGRYGADVRTISK --------------3333---------2222----------------3333--------- PWGEVFSLEELRTALETHRPAILALVHAETSTGARQPLEGVGELCREFGTLLLVDTVTSL -------------------------------------2222----1111---------22 GGVPIFLDAWGVDLAYSCSQKGLGCSPGASPFTSSRAIEKLQRRRTKVANWYLDNLLGKY 22---3333-----------1111----------------1111-----3333--3333- WGSERVYHHTAPINLYYALREALRLIAQEGLANCWQRHQKNVEYLWERLEDIGLSLHVEK -3333--------------------------------------------1111-----33 EYRLPTLTTVCIPDGVDGKAVARRLLNEHNIEVGGGLGELAGKVWRVGLGFNSRKESVDQ 33-1111-----2222--------------------!!!!---------11113333--- LIPALEQVLR ------1111 >myo-inositol-1-phosphate ; SWP:NA; PDB:1VJPA; HMVKVLILGQGYVASTFVAGLEKLRKGEIEPYGVPLARELPIGFEDIKIVGSYDVDRAKI -----------------------1111---2222-!!!!---3333---------3333- GKKLSEVVKQYWNDVDSLTSDPEIRKGVHLGSVRNLPIEAEGLEDSMTLKEAVDTLVKEW --3333-----1111-------------!!!!1111-----1111--------------- TELDPDVIVNTCTTEAFVPFGNKEDLLKAIENNDKERLTATQVYAYAAALYANKRGGAAF ---------------------3333----11113333----------------------- VNVIPTFIANDPAFVELAKENNLVVFGDDGATGATPFTADVLSHLAQRNRYVKDVAQFNI -------1111----------------------------------1111----------- GGNMDFLALTDDGKNKSKEFTKSSIVKDILGYDAPHYIKPTGYLEPLGDKKFIAIHIEYV --3333----------3333-----------------------3333------------- SFNGATDELMINGRINDSPALGGLLVDLVRLGKIALDRKEFGTVYPVNAFYMKNPGPAEE -iiii------------------------------1111----3333---------1111 KNIPRIIAYEKMRIWAGLKPKW ---------------------- >DESIGNED PROTEIN; SWP:NA; PDB:1VJQA; KTIFVIVPTNEEQVAFLEALAKQDELNFDWQNPPTEPGQPVVILIPSDVEWFLELKAKGI -------------------3333-2222-------2222---------3333---1111- PFTVYVEEGGS ----------- >4-NITROPHENYLPHOSPHATASE; SWP:NA; PDB:1VJRA; HVLDKIELFILDDGTFYLDDSLLPGSLEFLETLKEKNKRFVFFTNNSSLGAQDYVRKLRN 1111------------------2222-------1111------------3333------- GVDVPDDAVVTSGEITAEHLKRFGRCRIFLLGTPQLKKVFEAYGHVIDEENPDFVVLGFD ----1111--------------------------------1111---------------1 KTLTYERLKKACILLRKGKFYIATHPDINCPSKEGPVPDAGSIAAIEASTGRKPDLIAGK 111----------3333--------------1111------------------------- PNPLVVDVISEKFGVPKERAVGDRLYTDVKLGKNAGIVSILVLTGETTPEDLERAETKPD ---------------3333----3333--------------------------------- FVFKNLGELAKAVQ ---------3333- >ALPHA-AMYLASE; SWP:P06278; PDB:1VJS; LNGTLMQYFEWYMPNDGQHWKRLQNDSAYLAEHGITAVWIPPAYKGTSQADVGYGAYDLY ---------1111-----------------1111-------------1111------111 DLGEFHQKGTVRTKYGTKGELQSAIKSLHSRDINVYGDVVINHKGGADATEDVTAVEVDP 1-----%%%%--3333------------1111--------------------------33 ADRNRVISGEHLIKAWTHFHFPGRGSTYSDFKWHWYHFDGTDWDESRKLNRIYKFQGKAY 33------------------3333---------3333-------1111-----------3 DYLMYADIDYDHPDVAAEIKRWGTWYANELQLDGFRLDAVKHIKFSFLRDWVNHVREKTG 333-----1111--------------------------3333------------------ KEMFTVAEYWQNDLGALENYLNKTNFNHSVFDVPLHYQFHAASTQGGGYDMRKLLNSTVV --------------------------------------------iiii--3333---333 SKHPLKAVTFVDNHDTQPGQSLESTVQTWFKPLAYAFILTRESGYPQVFYGDMYGTKGDS 3-1111----------2222------3333------------------3333-------- QREIPALKHKIEPILKARKQYAYGAQHDYFDHHDIVGWTREGDSSVANSGLAALITDGPG ------3333--------------------------------3333-------------- GAKRMYVGRQNAGETWHDITGNRSEPVVINSEGWGEFHVNGGSVSIYVQ -------3333------3333--------1111---------------- >ALPHA-GLUCOSIDASE; SWP:NA; PDB:1VJTA; HKISIIGAGSVRFALQLVGDIAQTEELSREDTHIYDVHERRLNASYILARKYVEELNSPV ------3333-------------3333-1111---------------------------- KIVKTSSLDEAIDGADFIINTAYPYDPRYHDSGSQRWDEVTKVGEKHGYYRGIDSQELNV ------3333-2222----------3333--3333---------11112222---22222 STYTYVLSSYPDKLALEIAEKKKAPKAYLQTANPVFEITQAVRRWTGANIVGFCHGVAGV 222-----3333-----------1111--------------------------------- YEVFEKLDLDPEEVDWQVAGVNHGIWLNRFRYRGEDAYPLLDEWIEKKLPEWEPKNPWDT ----1111-1111-------2222-------%%%%------------3333----1111- QSPAADYKFYGLPIGDTVRNGSWKYHYNLETKKKWFGKFGGIDNEVERPKFHEQLRRARE -------------!!!!----3333-----------1111-------------------- RLIKLAEEVQQNPGKLTEEHPEIFPKGKLSGEQHIPFINAIANNKRVRLFLNVENQGTLK --------------3333--1111------------------------------%%%%11 DFPDDVVELPVWVDCCGIHREKVEPDLTHRIKIFYLWPRILREWNLEAYISRDRKVLEEI 111111-------1111-------------------------------33333333---- LIRDPRTKSYEQIVQVLDEIFNLPFNEELRRYYKE ---1111---------------1111---3333-- >HYPOTHETICAL PROTEIN; SWP:P84155; PDB:1VJUA; HSLAVEAVKDFLLKLQDDICEALEAEDGQATFVEDKWTREGGGGGRTRVVDGAVIEKGGV --------------------------------------2222------------------ NFSHVYGKGIAGCNFEAGVSLVIHPKNPHVPTSHANVRLFVAEREGKEPVWWFGGGFDLT ---------1111-------------1111-------------2222------------- PYYAVEEDCRDFHQVAQDLCKPFGADVYARFKGWCDEYFFIPYRNEARGIGGLFFDDLNE -------------------33331111--------------1111--------------- WPFEKCFEFVQAVGKGYDAYIPIVNRRKNTPYTEQQVEFQEFRRGRYAEFNLVIDRGTKF --------------------------1111------------------------------ GLQSGGRTESILISLPPRARWGYNWQPEPGTPEARLTEYFLTKRQWV ------3333-3333------2222--22223333---1111----- >UBIQUITIN CARBOXYL-TERMIN; SWP:P43593; PDB:1VJVA; QFAQLPVGFKNGNTCYLNATLQALYRVNDLRDILNYNPSQGVSNSGAQDEEIHKQIVIEK -------------3333----------3333-11113333--------3333-------- RCFENLQNKFKSVLPVVLLNTLRKCYPQFAERDFYKQQDAEELFTQLFHSSIVFGDKFSE -------------------------3333-------------------------333311 DFRIQFKTTIKDTANDNDITVKENESDSKLQCHISGTTNFRNGLLEGLNEKANSIYSVEK 11---------1111-------------------1111-------1111----------- KISRLPKFLTVQYVRFFWKRSTNKKSKILRKVVFPFQLDVADLTPEYAAEKVKVRDELRK -------------------1111------------------------------------- VEKEKNEKVTPREQYETQVALNESEKDQWLEEYKKHFPPNLEKGENPSCVYNLIGVITHQ ---------------------------------1111----2222--------------- GANSESGHYQAFIRDELDENKWYKFNDDKVSVVEKEKIESLAGGGESDSALILYKGFGL --1111-----------1111----!!!!----3333------------------2222 >FERREDOXIN(A); SWP:P46797; PDB:1VJW; MKVRVDADACIGCGVCENLCPDVFQLGDDGKAKVLQPETDLPCAKDAADSCPTGAISVE -----3333----------1111---1111----------3333--------------- >putative ferritin-like di; SWP:NA; PDB:1VJXA; HKVSDILTVAIRLEEEGERFYRELSEHFNGEIKKTFLELADQERIHAEIFRKSDQENWDE -----------------------1111-----------------------------3333 VDSYLAGYFYEVFPDTSEILRRKDLTLKEVLDIAISVEKDSIILYYELKDGLVNSDAQKT ---------------3333-----------------------------1111-3333--- VKKIIDQEKEHLRKLLEKREST ---------------------- >TGF-BETA RECEPTOR TYPE I; SWP:P36897; PDB:1VJYA; IARTIVLQESIGKGRFGEVWRGKWRGEEVAVKIFSSREERSWFREAEIYQTVMLRHENIL 3333-------------------iiii-------1111------------2222-1111- GFIAADNKDNGTWTQLWLVSDYHEHGSLFDYLNRYTVTVEGMIKLALSTASGLAHLHMEI ----------------------3333---------------------------------- VGTQGKPAIAHRDLKSKNILVKKNGTCCIADLGLAVRHDSATDTIDIRVGTKRYMAPEVL --------------1111---1111------1111----1111-------3333----11 DDSINMKHFESFKRADIYAMGLVFWEIARRCSIGGIHEDYQLPYYDLVPSDPSVEEMRKV 11--11113333---------------1111-iiii-----2222--------------- VCEQKLRPNIPNRWQSCEALRVMAKIMRECWYANGAARLTALRIKKTLSQLSQQEGIKM ----------3333-------------------3333---------------------- >ENDOGLUCANASE; SWP:Q9X274; PDB:1VJZA; IPRWRGFNLLEAFSIKSTGNFKEEDFLWAQWDFNFVRIPCHLLWSDRGNPFIIREDFFEK -------------1111----3333---1111----------------1111-3333--- IDRVIFWGEKYGIHICISLHRAPGYSVNKEVEEKTNLWKDETAQEAFIHHWSFIARRYKG ---------------------2222----------3333------------------222 ISSTHLSFNLINEPPFPDPQISVEDHNSLIKRTITEIRKIDPERLIIIDGLGYGNIPVDD 23333------------3333-------------------1111-------%%%%--111 LTIENTVQSCRGYIPFSVTHYKAEWVDSKDFPVPEWPNGWHFGEYWNREKLLEHYLTWIK 1-------------3333-2222----1111---------iiii---------------- LRQKGIEVFCGEGAYNKTPHDVVLKWLEDLLEIFKTLNIGFALWNFRGPFGILDSERKDV -1111------------------------------------------1111--------- EYEEWYGHKLDRKLELLRKY ----iiii--------3333 >HYPOTHETICAL PROTEIN; SWP:Q9FNG3; PDB:1VK0A; SASFDGPKFKTDGSYVQTKTIDVGSSTDISPYLSLIREDSILNGNRAVIFDVYWDVGFTK -------------------------------------------%%%%-----------11 TSGWSLSSVKLSTRNLCLFLRLPKPFHDNLKDLYRFFASKFVTFVGVQIEEDLDLLRENH 11----------3333-------------------1111--------------------- GLVIRNAINVGKLAAEARGTLVLEFLGTRELAHRVLWSDLGQLDSIEAKWEKAGPEEQLE -------------------3333----------------------33331111------- AAAIEGWLIVNVWDQLSDE ------------------- >CONSERVED HYPOTHETICAL PR; SWP:Q8U3S5; PDB:1VK1A; IPVKKVEYVFIELDKMPHEQLVQRELEDFIESVTGSGIFWKPMLLAKIPGTDEYLIVDGH -----------1111------------------1111----------2222--------- HRWAGLQKLGAKRAPSVILDYFDEGVKVYTWYPAFGDVNVIERLKAEGLEVIEDEKAEEA -------------------1111---------------------1111-----1111--- EGEIAFALIGEKSFAIPGGLEEQKVSKVLDEMDQAEIELVYYGLKEDAKADMEKGEIDYV ------------------3333-------------------------------------- FIRAPTKEEVMELVKRGEVFSPTTRHVLPFIPDKIDVKLEDLF -------------1111--------------------3333-- >URACIL-DNA GLYCOSYLASE TM; SWP:Q9WYY1; PDB:1VK2A; YTREELEIVSERVKKCTACPLHLNRTNVVVGEGNLDTRIVFVGEGPGEEEDKTGRPFVGR -3333-------1111----3333---------1111---------------------33 AGLLTELLRESGIRREDVYICNVVKCRPPNNRTPTPEEQAACGHFLLAQIEIINPDVIVA 33------1111-3333----------2222----------------------------- LGATALSFFVDGKKVSITKVRGNPIDWLGGKKVIPTFHPSYLLRNRSNELRRIVLEDIEK -----3333iiii--33332222----iiii------3333------------------- AKSFIKKE -1111--- >PHOSPHORIBOSYLFORMYLGLYCI; SWP:Q9X0X3; PDB:1VK3A; KLRYLNILKEKLGREPTFVELQAFSVWSEHCGYSHTKKYIRRLPKTGFEGNAGVVNLDDY -------------------------------------3333---------2222------ YSVAFKIESHNHPSAIEPYNGAATGVGGIIRDVLAGARPTAIFDSLHSRIIDGIIEGIAD ----------3333------------------------------------3333------ YGNSIGVPTVGGELRISSLYAHNPLVNVLAAGVVRNDLVDSKASRPGQVIVIFGGATGRD ----------------3333------------------------2222------------ GTKLSIQVGDPFAEKLIEAFLEVEEGLVEGAQDLGAGGVLSATSELVAKGNLGAIVHLDR -3333-----------------1111--------2222---------1111-----3333 VPLREPDEPWEILISESQERAVVTSPQKASRILEIARKHLLFGDVVAEVIEEPVYRVYRN -------3333-------------1111--------1111-----------------!!! DLVEVPVQLLANAPEEDIVEYTPGKIPEFKRVEFEEVNAREVFEQYDHVGTDTVVPPGFG !----1111----------------------------33333333----------3333- AAVRIKRDGGYSLVTHSRADLALQDTYWGTLIAVLESVRKTLSVGAEPLAITNCVNYGDP -----1111--------3333--------------------1111-------------33 DVDPVGLSATALKNACEFSGVPVASGNASLYNTYQGKPIPPTLVVGLGKVNPQKVAKPKP 33-------------------------------iiii-------------3333------ SKVFAVGWNDFELEREKELWRAIRKLSEEGAFILSSSQLLTRTHVETFREYGLKIEVKLP -----------1111-----------1111-----1111-------3333---------- EVRPAHQVLVFSERTPVVDVPVKEIGTLSR ------------------------------ >PFKB CARBOHYDRATE KINASE ; SWP:NA; PDB:1VK4A; HITFIGHVSKDVNVVDGKREIAYGGGVVGAITSSLLGVKTKVITKCTREDVSKFSFLRDN ------------------------3333------------------3333---------- GVEVVFLKSPRTTSIENRYTRESFLISAADPFTESDLAFIEGAVHINPLWYGEFPEDLIP --------------------------------33331111--------------3333-- VLRRKVFLSADAQGFVRVPENEKLVYRDWEKEKYLKYLDLFKVDSREAETLTGTNDLRES -3333-----3333-----iiii-------33331111---------------------- CRIIRSFGAKIILATHASGVIVFDGNFYEASFRSWSLEGRTGRGDTCTAAFLVGFVFKKS ----1111-------1111----------------33332222----------------- IEKATKFAAAVTSVKRHPGPLRREDLEAIS ---------------------33331111- >EXPRESSED PROTEIN; SWP:Q9LUJ3; PDB:1VK5A; GSLLRRAEMYQDYMKQVPIPTNRGSLIPFTSWVGLSISMKQLYGQPLHYLTNVLLQRWDQ 2222--------3333-------------------------------------------1 SRFGTDSEEQRLDSIIHPTKAEATIWLVEEIHRLTPSHLHMALLWRSDPMYHSFIDPIFP 111-------1111---------------------------------11111111----- E - >NADH PYROPHOSPHATASE; SWP:P32664; PDB:1VK6A; HDRIIEKLDHGWWVVSHEQKLWLPKGELPYGEAANFDLVGQRALQIGEWQGEPVWLVQQQ -----1111-------%%%%--2222-----3333--2222-------%%%%-------- RRHDGSVRQVIDLDVGLFQLAGRGVQLAEFYRSHKYCGYCGHEYPSKTEWALCSHCRERY -----33333333-3333------------------------------------------ YPQIAPCIIVAIRRDDSILLAQHTRHRNGVHTVLAGFVEVGETLEQAVAREVEESGIKVK -------------!!!!-----3333------------2222------------------ NLRYVTSQPWPFPQSLTAFAEYDSGDIVIDPKELLEANWYRYDDLPLLPPPGTVARRLIE -----------------------------3333-------1111-----2222------- DTVACRAEY ----3333- >HYPOTHETICAL PROTEIN TM04; SWP:Q9WYV6; PDB:1VK8A; PKVTVSIKVVPAVEDGRLHEVIDRAIEKISSWGKYEVGPSNTTVEGEFEEIDRVKELARY -------------3333-----------1111-----1111-----3333---------3 LEQFAKRFVLQLDIDYKAGGITIEEKVSKYR 333-------------2222-3333-3333- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:1VK9A; HVEKNLLRSALKIFEKKDLSLLAYSGRSIFESKDSGLKPVVELFKRFDNLEGSLVIDKVG --1111--------1111-------------------------------2222------- KAAASFLLKKPDHIHAKVISKPALKLNEYGQSFSYDEKIPFVLGKDGKSCPFEKLVLEDD --------------------------1111-------------3333------------- PEEIIRIVLSKF ------------ >HYPOTHETICAL PROTEIN; SWP:NA; PDB:1VKBA; HMAHIFVYGTLKRGQPNHKVMLDHSHGLAAFRGRGCTVESFPLVIAGEHNIPWLLYLPGK --------1111--111133333333--------------------1111------2222 GHCVTGEIYEVDEQMLRFLDDFEDCPSMYQRTALQVQVLEWEDPGDSVQCFVYTTATYAP -------------------------3333-----------------------------33 EWLFLPYHESYDSEGPHGLRYNPRENR 33---------1111-------1111- >PUTATIVE ACETYL TRANSFERA; SWP:Q8U4Q2; PDB:1VKCA; EYTIVDGEEYIEEIKKLDREISYSFVRFPISYEEYEERHEELFESLLSQGEHKFFVALNE ------3333-----------3333--------------------3333---------11 RSELLGHVWICITLDTVDYVKIAYIYDIEVVKWARGLGIGSALLRKAEEWAKERGAKKIV 11----------------------------1111-----------------1111----- LRVEIDNPAVKWYEERGYKARALIMEKPI ---1111------1111------------ >CONSERVED HYPOTHETICAL PR; SWP:Q9X0V2; PDB:1VKDA; HKVFTEKIPNIPWEERPEGYTGPVWRYSKNPIIGRNPVPKGARVFNSAVVPYNGEFVGVF -------1111-----2222------1111-------1111----------iiii----- RIDHKNTRPFLHFGRSKDGINWEIEPEEIQWVDVNGEPFQPSYAYDPRVVKIEDTYYITF --------------------------------1111---------------!!!!----- CTDDHGPTIGVGTKDFKTFVRLPNAYVPFNRNGVLFPRKINGKYVLNRPSDNGHTPFGDI ---------------------------------------iiii----------------- FLSESPDIHWGNHRFVLGRSSYNWWENLKIGAGPYPIETSEGWLLIYHGVTLTCNGYVYS ----------------------3333------------1111----------1111---- FGAALLDLDDPSKVLYRSRYYLLTPEEEYETVGFVPNVVFPCAALCDADTGRVAIYYGAA ------3333----------------1111----------------------------%% DTHVALAFGYIDEIVDFVKRNS %%----------------1111 >carboxymuconolactone deca; SWP:Q9X1V5; PDB:1VKEA; KKFVEARRELNEKVSRGTLNTKRFFNLDSAVYRPGKLDVKTKELMGLVASTVLRCDDCIR ----------------------------1111---------------------------- YHLVRCVQEGASDEEIFEALDIALVVGGSIVIPHLRRAVGFLEELREMEKNGETISL ------1111-----------------3333-----------------1111----- >glycerol uptake operon an; SWP:Q9X1F0; PDB:1VKFA; FKGIIAALWDDSIGEIEPDVVFLLKSDILNLKFHLKILKDRGKTVFVDDFVNGLGEGEEA --------------------------3333--------1111--------2222------ ILFVKKAGADGIITIKPKNYVVAKKNGIPAVLRFFALDSKAVERGIEQIETLGVDVVEVL -----------------------1111--------------------------------- PGAVAPKVARKIPGRTVIAAGLVETEEEAREILKHVSAISTSSRILWK 3333-------2222----------------3333-------3333-- >PUTATIVE SERINE HYDROLASE; SWP:Q04066; PDB:1VKHA; HTVRAISPDITLFNKTLTFQEISQNTREAVIYIHGGAWNDPENTPNDFNQLANTIKSDTE --------11111111------1111---------iiii1111--------------111 STVCQYSIEYRLSPEITNPRNLYDAVSNITRLVKEKGLTNINVGHSVGATFIWQILAALK 1----------------------------------------------------------- DPQEKSEAQLQLGLLQIVKRVFLLDGIYSLKELLIEYPEYDCFTRLAFPDGIQYEEEPSR -33333333-----1111------------------------3333-1111-----3333 VPYVKKALSRFSIDHLVHSYSDELLTLRQTNCLISCLQDYQLSFKLYLDDLGLHNDVYKN --------1111------1111------------------------------1111---- GKVAKYIFDNIC -------1111- >HYPOTHETICAL PROTEIN ATU3; SWP:NA; PDB:1VKIA; SRKTATELFEFLDGLGISHTTKQHEPVFTVAESQSLRDLIPGGHTKNLFVKDKKDQYFVL ------------1111--------------------1111-----------1111----- TVEENAVVDLKSVHKTIGAASRVSFGRPEKMLEYLGVVPGSVTVFGAINDTARQVTFVLD --1111--3333-------------------------2222---3333-1111------3 SDLLENELVNGHPLSNDQTTTIASKDLIRFLEATGHAPLVLKVSE 333-------------------3333-----1111---------- >heparan sulfate (glucosam; SWP:O35310; PDB:1VKJA; STQQLPQTIIIGVRKGGTRALLEMLSLHPDVAAAENEVHFFDWEEHYSQGLGWYLTQMPF ------------2222-----------1111--------1111--3333-----1111-- SSPHQLTVEKTPAYFTSPKVPERIHSMNPTIRLLLILRDPSERVLSDYTQVLYNHLQKHK -1111-------33333333-------1111----------------------------- PYPPIEDLLMRDGRLNLDYKALNRSLYHAHMLNWLRFFPLGHIHIVDGDRLIRDPFPEIQ ---3333---iiii----3333-------3333-----3333------------------ KVERFLKLSPQINASNFYFNKTKGFYCLRDSGKDRCLHESKGRAHPQVDPKLLDKLHEYF ------------3333----3333-------------1111-------3333-------- HEPNKKFFKLVGRTFDWH ------------------ >GLIA MATURATION FACTOR GA; SWP:Q9ERL7; PDB:1VKKA; VVCEVDPELKETLRKFRFRKETNNAAIIMKVDKDRQMVVLEDELQNISPEELKLELPERQ ---------------1111-------------1111----------------3333---- PRFVVYSYKYVHDDGRVSYPLCFIFSSPVGCKPEQQMMYAGSKNRLVQTAELTKVFEIRT -----------1111------------1111----------------------------3 TDDLTETWLKEKLAFFR 333---------3333- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:1VKMA; HVIIESRIEKGKPVVGETTVFVHGLPRKEAIELFRRAKEISREKGFQLAVIGILKGKIVA --------1111-----3333--------------------------------iiii--- GSEEELEAREGADKVGTREIPIVVAEGKNAATTVSATIFLSRRIGIEVVVTGGTGGVHPG ---------------3333--------------------------------------222 RVDVSQDLTESSSRAVLVSSGIKSILDVEATFELETLEIPLVGFRTNEFPLFFSRKSGRR 2---------------------1111--------1111------------!!!!------ VPRIENVEEVLKIYESKEELEKTLVLNPVPEEYEIPHDEIERLLEKIELEVEGKEVTPFL -----------------------------1111--3333----1111----!!!!----- LKKLVETNGRTLKANLALLEENVKLAGEIAVKLKR ----------------------------------- >N-ACETYL-GAMMA-GLUTAMYL-P; SWP:Q9X2A2; PDB:1VKNA; HIRAGIIGATGYTGLELVRLLKNHPEAKITYLSSRTYAGKKLEEIFPSTLENSILSEFDP -------1111-----------------------1111--3333-3333----------- EKVSKNCDVLFTALPAGASYDLVRELKGVKIIDLGADFRFDDPGVYREWYGKELSGYENI --------------2222---3333----------------------------2222--- KRVYGLPELHREEIKNAQVVGNPGCYPTSVILALAPALKHNLVDPETILVDAKSGVSGAG -----3333---3333---------------------1111--------------3333- RKEKVDYLFSEVNESLRPYNVAKHRHVPEEQELGKISGKKVNVVFTPHLVPTRGILSTIY ---3333-3333-------22223333--------------------------------- VKTDKSLEEIHEAYLEFYKNEPFVHVLPGIYPSTKWCYGSNHVFIGQEERTNTLILSAID -----------------1111-----------11112222--------1111-------1 NLVKGASGQAVQNNIFGLDETKGLEFTPIYP 111---------------1111--------- >INOSITOL-3-PHOSPHATE SYNT; SWP:Q18664; PDB:1VKOA; KRLIVESPNVKLEDGVLESRFTYRKNHFEHRADGLHVTPKEHDYSFKTVLKPRKTGLLLV ------1111--%%%%--------------1111-------------------------- GLGGNNGSTAVGSIFANQYAMTWRTKEGHSQANYFGSVTQTATVHLGYDSATQNQIFVPF 1111--------------------1111---------1111--------1111-----11 KDIVPILSPNDLIISGWDISDSNLYEAMGRAKVFEPELQEKLRPFMEPIVPLPSIYYPDF 11-----3333-----------------------3333---11113333-------3333 IASNQGDRANNVIPGDNKLEHLEHIRADIRKFKQEHELECVIVLWTANTERYTDVRQGLN -11111111--------------------------------------------------- ATADEIMESIRVNEDEVSPSNIFAVASILEGAHYINGSPQNTLVPGLIELAERHKVFVGG -----------------3333------1111------------3333----1111----- DDFKSGQTKFKSAFVDFLVSSGMKPESIVSYNHLGNNDGKNLSEARQFRSKEISKSSVVD -----3333-----------------------------------3333--------1111 DMVKSNQILFPDAKNPDYCVVIKYVPYVADSKRAMDEYICSIFMGGKQTFVVHNTCEDSL ---------1111-----------3333--------------iiii----------3333 LASPLIYDLAILTELASRVSYKVDDEYKPFHSVLSILSLLLKAPVVPPGTPISNAFMRQF ---------------1111--------------33331111-----2222----3333-- STLTKLVTALAGFPSDTDMQIEFFTQLPAAK -------------------3333-------- >AGMATINE IMINOHYDROLASE; SWP:Q8GWW7; PDB:1VKPA; RESPAEHGYYPAEWDSHAQTWIGWPERQDNWRHNALPAQRVFAGVAKAISKFEPVTVCAS --3333-----1111-----------1111-%%%%-------------3333-------3 PAQWENARKQLPEDIRVVESNDSWFRDSGPTFIVRKRNRNIAGIDWNFNAWGGANDGCYN 333--------1111--------3333---------------------%%%%-------- DWSHDLLVSRKILALERIPRFQHSILEGGSIHVDGEGTCLVTEECLLNKNRNPHSKEQIE -3333---------------------1111-----------3333--1111--------- EELKKYLGVQSFIWLPRGLYGDEDTNGHIDNCCFARPGVVLLSWTDDETDPQYERSVEAL ------------------22221111-3333----2222-------1111---------- SVLSNSIDARGRKIQVIKLYIPEPLYTEEESSGITQDGEAIPRLAGTRLAASYVNFYIAN -------1111-------------------1111---------2222----3333---22 GGIIAPQFGDPIRDKEAIRVLSDTFPHHSVVGIENAREIVLAGGNIHCITQQQPAEPT 22-------3333-----------1111------33333333--3333---------- >mannitol-specific PTS sys; SWP:P00550; PDB:1VKRA; SHVRKIIVACDAGMGSSAMGAGVLRKKIQDAGLSQISVTNSAINNLPPDVDLVITHRDLT ---------1111-------------------3333-----3333-1111-----3333- ERAMRQVPQAQHISLTNFLDSGLYTSLTERLVAAQRH ------1111------1111----------------- >ACYL CARRIER PROTEIN; SWP:NA; PDB:1VKUA; HERKKLIAKFVEIASEKGKDLETVDEENTFKELGFDSIDVIDLVFFEDEFALRIEDEEIS ------------------------11113333----------------------333311 KIRKVKDLIDIVIKKLEEID 11-3333------------- >PUTATIVE NITROREDUCTASE; SWP:NA; PDB:1VKWA; HNIFEAIENRHSVRDFLERKPERVKDDIENLLVKFITKKLDWKINLSSFPSYIYAKAEKH --------------------3333------------------------------------ FDELVEYGFQGEQIVLFLTAQGFGTCWARSPHPDVPYIIVFGYPRTRNFTRKRRPITSFL ------------------1111--------------------------------3333-- ENDLEELPPEIVKIVETILAPSALNRQPWKIKYTGGELCISSERPVDLGIALSHAYLTAR --3333-3333-----1111-2222----------------------------------- EIFKREPVIQKRGEDTYCLILNP ----------------------- >S-adenosylmethionine:tRNA; SWP:Q9WZ44; PDB:1VKYA; SEFDYELPPELIAQEPVEPRDASRLMVLHRKTQRIEHRIFREIIEYLEPGDLLVLNVSKV 1111---3333-------1111-------1111-----3333-11112222--------- IPARLYARKASIEILLIERLEEGIWKCLVRPGQKVKKGTELVIDEDLSAVCLGRGEDGTR --------------------2222------3333-2222----1111-------1111-- ILKFQPQDDRLIFEKGTAGLHFTPELIEKLKKKGVQFAEVVLHVHEEFYQVPKETVRKLR ----------------3333---------------------------------------- ETRERGNRIVAVGTTTVRTLETIARLPEQEEYVGKTDLFIYPPFEFKLVDALVTNFHLPR ---------------------3333--------------------------------222 STLLMLVAAFAGKDFVMEAYREAVKRRYRFFSFGDAMLIL 2-----------------------------1111------ >PHOSPHORIBOSYLAMINE--GLYC; SWP:Q9X0X7; PDB:1VKZA; VRVHILGSGGREHAIGWAFAKQGYEVHFYPGNAGTKRDGTNHPYEGEKTLKAIPEEDIVI -------------------1111--------3333---------!!!!3333-------- PGSEEFLVERSNVFGPVKEVARLEGSKVYAKRFKKYGIRTARFEVAETPEELREKIKKFS --1111---3333---33333333---------1111----------------------- PPYVIKADGLARGKGVLILDSKEETIEKGSKLIIGELIKGVKGPVVIDEFLAGNELSAAV ----------%%%%-----------------------2222------------------- VNGRNFVILPFVRDYKRLDGDRGPNTGGGSWGPVEIPSDTIKKIEELFDKTLWGVEKEGY -!!!!-------------%%%%---------------------------------1111- AYRGFLYLGLLHDGDPYILEYNVRLGDPETEVIVTLNPEGFVNAVLEGYRGGKEPVEPRG -----------iiii--------------------------------------------- FAVDVVLAARGYPDAPEKGKEITLPEEGLIFFAGVAEKDGKLVTNGGRVLHCGTGETKEE --------2222-------------------------%%%%------------------- ARRKAYELAEKVHFEGKTYRRDIA --------1111-2222------- >DTDP-4-DEHYDRORHAMNOSE RE; SWP:Q97GQ1; PDB:1VL0A; HKILITGANGQLGREIQKQLKGKNVEVIPTDVQDLDITNVLAVNKFFNEKKPNVVINCAA ------1111---------2222-------3333-1111--------------------- HTAVDKCEEQYDLAYKINAIGPKNLAAAAYSVGAEIVQISTDYVFDGEAKEPITEFDEVN --33331111-------------------1111-------1111---------1111--- PQSAYGKTKLEGENFVKALNPKYYIVRTAWLYGDGNNFVKTINLGKTHDELKVVHDQVGT ------------------------------------3333--3333-------------- PTSTVDLARVVLKVIDEKNYGTFHCTCKGICSWYDFAVEIFRLTGIDVKVTPCTTEEFPR -----------------------------------------------------3333--- PAKRPKYSVLRNYLELTTGDITREWKESLKEYIDLLQ -------------1111------3333---------- >6-PHOSPHOGLUCONOLACTONASE; SWP:Q9X0N8; PDB:1VL1A; KTVIYLLEDGYVDFVVEKIRTKMEKLLEEKDKIFVVLAGGRTPLPVYEKLAEQKFPWNRI -------------------------------------------------1111--3333- HFFLSDERYVPLDSDQSNFRNINEVLFSRAKIPSGNVHYVDTSLPIEKACEKYEREIRSA ----------1111------------------3333------------------------ TDQFDLAILGMGPDGHVASIFDLETGNKDNLVTFTDPSGDPKVPRVTLTFRALNTSLYVL -----------1111------3333----------------------------------- FLIRGKEKINRLTEILKDTPLPAYFVRGKEKTVWFVGK --------------1111--3333-------------- >ARGININOSUCCINATE SYNTHAS; SWP:Q9X2A1; PDB:1VL2A; KEKVVLAYSGGLDTSVILKWLCEKGFDVIAYVANVGQKDDFVAIKEKALKTGASKVYVED ---------------------1111----------------------------------- LRREFVTDYIFTALLGNAMYEGRYLLGTAIARPLIAKRQVEIAEKEGAQYVAHGATGKGN -------------1111--------3333------------------------------3 DQVRFELTYAALNPNLKVISPWKDPEFLAKFKTDLINYAMEKGIPIKVSKKRPYSEDENL 333---------3333---1111------------------------------------- MHISHEAGKLEDPAHIPDEDVFTWTVSPKDAPDEETLLEIHFENGIPVKVVNLKDGTEKT ------!!!!-1111--3333-----3333------------iiii-------------- DPLELFEYLNEVGAKNGVGRLDMVENRFIGIKSRGVYETPGATILWIAHRDLEGITMDKE ---------------------------------------------------3333----- VMHLRDMLAPKFAELIYNGFWFSPEMEFLLAAFRKAQENVTGKVTVSIYKGNVMPVARYS ------------------------------------2222-------------------- PYSLYNPGGFDATDSKGFINIHALRLKVHQLVKKGYQR --1111-------------------------------- >PMBA-RELATED PROTEIN; SWP:Q9WZI6; PDB:1VL4A; MTFEEFKDRLFALAKKNGVEVQISFLETREFSLRLANGDLDQYTDAGKFNVEIKVLKDGK --------------1111-----------------iiii-----------------iiii TGTFRTQVLENPEKCFEEALSNLQVKKEYFFEGGKEYREMETYVGRFEKLSVKEKMDMAK --------------------3333---------------------3333----------- KAHESAAKDERVVMVPTVMYKDMVIKKIITNTLGLDVESQMDGGFLFAMAIARDANPRSG --------3333------------------1111-------------------------- SWYELARTPEDLNPEEIGKRAAEEAISLIGSKTIPSGKYPVLMRNTALLDLMEMFIPMIS -------3333--------------1111---------------3333--33333333-- AENVQKNLSPLKGKLGEQVGNPAVSIKDLPYHPKGLSSTPFDDEGVPTTEKFVLENGVLK ----------2222------3333-------1111------1111--------------- TFLHNLKTARKEGVEPTGNGFVGGIRPVNLMLMPGEKSFEELLKEMDRGVVITEVEGMHA --------------------2222--------------------------------3333 GANSISGEFSLFAKGYWVENGEIAHGVEDITISGNFLDLLRKIVLVGNDVKVSQHTIAPS ------------------iiii-------------------------------------- VLVEVLDVA --------- >UNKNOWN CONSERVED PROTEIN; SWP:Q9KAF6; PDB:1VL5A; GSDLAKLQIAALKGNEEVLDVATGGGHVANAFAPFVKKVVAFDLTEDILKVARAFIEGNG --3333-3333-----------!!!!-----3333---------------------1111 HQQVEYVQGDAEQPFTDERFHIVTCRIAAHHFPNPASFVSEAYRVLKKGGQLLLVDNSAP ----------------------------1111-3333---------2222---------- ENDAFDVFYNYVEKERDYSHHRAWKKSDWLKLEEAGFELEELHCFHKTFIFEDWCDRNVT ----------------1111------------1111------------------------ TEKKQELSDFIKSKPTEYYQKFKIVVEDGRVYSFRGESILKARKPT ----------11113333--------iiii---------------- >MALATE OXIDOREDUCTASE; SWP:NA; PDB:1VL6A; HVDALEVHRFLKGKIRTALPVEKVDRETLSLLYTPGVADVARACAEDPEKTYVYTSRWNT ----------------------------------3333----------3333---3333- VAVVSDGSAVLGLGNIGPYGALPVEGKAFLFKAFADIDAFPICLSESEEEKIISIVKSLE ---------!!!!----3333-------------------------------------33 PSFGGINLEDIGAPKCFRILQRLSEENIPVFHDDQQGTAVVVSAAFLNALKLTEKKIEEV 33-----------3333--------------3333--------------------1111- KVVVNGIGAAGYNIVKFLLDLGVKNVVAVDRKGILNENDPETCLNEYHLEIARITNPERL -------3333------------------1111--33331111-------3333-1111- SGDLETALEGADFFIGVSRGNILKPEWIKKSRKPVIFALANPVPEIDPELAREAGAFIVA --3333-2222------------3333------------------------1111----- TGRSDHPNQVNNLLAFPGIKGAVEKRSKITKNLLSAVEAIARSCEPEPERIIPEAFDKVH ---------------3333---------------------------1111---1111--- LNVYTAVKGSA ----------- >HYPOTHETICAL PROTEIN ALR5; SWP:Q8YMA7; PDB:1VL7A; YAGFIQEFQSAIISTISEQGIPNGSYAPFVIDDAKNIYIYVSGLAVHTKNIEANPLVNVL 33331111--------1111-----------1111------1111--------------- FVDDEAKTNQIFARRRLSFDCTATLIERESQKWNQVVDQFQERFGQIIEVLRGLADFRIF ---3333--1111-------------2222--------------3333------------ QLTPKEGRFVIGFGA --------------- >GLUCONATE 5-DEHYDROGENASE; SWP:Q9WYS2; PDB:1VL8A; FDLRGRVALVTGGSRGLGFGIAQGLAEAGCSVVVASRNLEEASEAAQKLTEKYGVETMAF --2222-------------------1111------------------------------- RCDVSNYEEVKKLLEAVKEKFGKLDTVVNAAGINRRHPAEEFPLDEFRQVIEVNLFGTYY --1111-------------------------------3333------------------- VCREAFSLLRESDNPSIINIGSLTVEEVTMPNISAYAASKGGVASLTKALAKEWGRYGIR -------3333----------3333----------------------------3333--- VNVIAPGWYRTKMTEAVFSDPEKLDYMLKRIPLGRTGVPEDLKGVAVFLASEEAKYVTGQ ----------3333-3333-----------1111---3333---------1111------ IIFVDGGWTAN ------3333- >HYDROPEROXIDE RESISTANCE ; SWP:NA; PDB:1VLAA; HQARWIGNFHVRTDSNHDVLDTKEEVGGKDAAPRPLELVLTGLGCTGDVVSILRKKVIDQ ------------1111------3333------------------------------3333 KDFRIEIEYERTEEHPRIFTKVHLKYIFKFDGEPPKDKVEKAVQLSQEKYCSVSAILKCS ----------------------------------------------------3333---- SKVTYEIVYEN ----------- >ALDEHYDE OXIDOREDUCTASE; SWP:Q46509; PDB:1VLBA; MIQKVITVNGIEQNLFVDAEALLSDVLRQQLGLTGVKVGCEQGQCGACSVILDGKVVRAC -------iiii------1111-----------1111---------1111--iiii--333 VTKMKRVADGAQITTIEGVGQPENLHPLQKAWVLHGGAQCGFCSPGFIVSAKGLLDTNAD 3--11112222---3333----------------------1111---------1111--- PSREDVRDWFQKHRNACRCTGYKPLVDAVMDAAAVINGKKPETDLEFKMPADGRIWGSKY ----------1111--------------------------3333----------2222-- PRPTAVAKVTGTLDYGADLGLKMPAGTLHLAMVQAKVSHANIKGIDTSEALTMPGVHSVI -1111--1111--------1111-----------------------3333--2222---- THKDVKGKNRITGLITFPTNKGDGWDRPILCDEKVFQYGDCIALVCADSEANARAAAEKV 3333------------1111----------------2222----------------1111 KVDLEELPAYMSGPAAAAEDAIEIHPGTPNVYFEQPIVKGEDTGPIFASADVTVEGDFYV -----------3333--1111---2222-------------------------------- GRQPHMPIEPDVAFAYMGDDGKCYIHSKSIGVHLHLYMIAPGVGLEPDQLVLVANPMGGT -----------------1111------------------------1111----------- FGYKFSPTSEALVAVAAMATGRPVHLRYNYQQQQQYTGKRSPWEMNVKFAAKKDGTLLAM -1111----------------------------------------------1111----- ESDWLVDHGPYSEFGDLLTLRGAQFIGAGYNIPNIRGLGRTVATNHVWGSAFRGYGAPQS -----------2222----3333-2222---------------------------3333- MFASECLMDMLAEKLGMDPLELRYKNAYRPGDTNPTGQEPEVFSLPDMIDQLRPKYQAAL ----------------------------2222-1111----------------------- EKAQKESTATHKKGVGISIGVYGSGLDGPDASEAWAELNADGTITVHTAWEDHGQGADIG -------1111---------------------------1111------------------ CVGTAHEALRPMGVAPEKIKFTWPNTATTPNSGPSGGSRQQVMTGNAIRVACENLLKACE --------3333--3333------3333-------%%%%--------------------- KPGGGYYTYDELKAADKPTKITGNWTASGATHCDAVTGLGKPFVVYMYGVFMAEVTVDVA 2222--------1111----------1111------------------------------ TGQTTVDGMTLMADLGSLCNQLATDGQIYGGLAQGIGLALSEDFEDIKKHATLVGAGFPF ---------------------------------------------1111--3333----3 IKQIPDKLDIVYVNHPRPDGPFGASGVGELPLTSPHAAIINAIKSATGVRIYRLPAYPEK 333-------------1111iiii--1111---3333----------------------- VLEALKA ------- >3-ISOPROPYLMALATE DEHYDRO; SWP:Q9WZ26; PDB:1VLCA; HMKIAVLPGDGIGPEVVREALKVLEVVEKKTGKFEKVFGHIGGDAIDRFGEPLPEETKKI ---------!!!!----------------------------------------------- CLEADAIFLGSVGGPKWDDLPPEKRPEIGGLLALRKMLNLYANIRPIKVYRSLVHVSPLK 1111---------3333----1111------------------------11111111--3 EKVIGSGVDLVTVRELSYGVYYGQPRGLDEEKGFDTMIYDRKTVERIARTAFEIAKNRRK 333!!!!-----------3333------1111---------------------------- KVTSVDKANVLYSSMLWRKVVNEVAREYPDVELTHIYVDNAAMQLILKPSQFDVILTTNM ------1111-------------33331111-----------3333-------------- FGDILSDESAALPGSLGLLPSASFGDKNLYEPAGGSAPDIAGKNIANPIAQILSLAMMLE ----------33333333------------------3333-------------------- HSFGMVEEARKIERAVELVIEEGYRTRDIAEDPEKAVSTSQMGDLICKKLEEIW -------------------3333--3333--1111------------------- >FERRITIN; SWP:Q9X0L2; PDB:1VLGA; MMVISEKVRKALNDQLNREIYSSYLYLSMATYFDAEGFKGFAHWMKKQAQEELTHAMKFY ------------------------------------------------------------ EYIYERGGRVELEAIEKPPSNWNGIKDAFEAALKHEEFVTQSIYNILELASEEKDHATVS ---1111-------------------------------------------1111------ FLKWFVDEQVEEEDQVREILDLLEKANGQMSVIFQLDRYLGQRE -------------------------iiii3333------1111- >PHOSPHOPANTETHEINE ADENYL; SWP:Q9WZK0; PDB:1VLHA; MKAVYPGSFDPITLGHVDIIKRALSIFDELVVLVTENPRKKCMFTLEERKKLIEEVLSDL ------------------------------------1111-------------------- DGVKVDVHHGLLVDYLKKHGIKVLVRGLRAVTDYEYELQMALANKKLYSDLETVFLIASE ----------3333--------------1111---------------1111-------33 KFSFISSSLVKEVALYGGDVTEWVPPEVARALNEKLK 33-----------1111--1111-------------- >Spore coat polysaccharide; SWP:P39625; PDB:1VLIA; AAFQIANKTVGKDAPVFIIAEAGINHDGKLDQAFALIDAAAEAGADAVKFQFQADRYQKD ----!!!!--2222-----------%%%%-----------------------3333---- PDVSIFSLVQSEPAEWILPLLDYCREKQVIFLSTVCDEGSADLLQSTSPSAFKIASYEIN ------------1111--------1111----------------1111------3333-- HLPLLKYVARLNRPIFSTAGAEISDVHEAWRTIRAEGNNQIAIHCVAKYPAPPEYSNLSV --------1111-----2222-----------3333---------------3333----- IPLAAAFPEAVIGFSDHSEHPTEAPCAAVRLGAKLIEKHFTIDKNLPGADHSFALNPDEL ------1111-----------3333---1111----------1111----1111------ KEVDGIRKTEAELKQGITKPVSEKLLGSSYKTTTAIEGEIRNFAYRGIFTTAPIQKGEAF ------------1111-----3333--------1111-3333------------2222-- SEDNIAVLRPGQKPQGLHPRFFELLTSGVRAVRDIPADTGIVWDDILLKD 1111-----!!!!----3333--------------------3333----- >NADH-DEPENDENT BUTANOL DE; SWP:NA; PDB:1VLJA; HMENFVFHNPTKIVFGRGTIPKIGEEIKNAGIRKVLFLYGGGSIKKNGVYDQVVDSLKKH ---------------22223333----1111----------3333------------111 GIEWVEVSGVKPNPVLSKVHEAVEVAKKEKVEAVLGVGGGSVVDSAKAVAAGALYEGDIW 1-------------------------------------3333---------1111--333 DAFIGKYQIEKALPIFDVLTISATGTEMNGNAVITNEKTKEKYGVSSKALYPKVSIIDPS 3-----------------------3333--------1111------3333---------1 VQFTLPKEQTVYGAVDAISHILEYYFDGSSPEISNEIAEGTIRTIMKMTERLIEKPDDYE 111--------------------1111---------------------------1111-- ARANLAWSATIALNGTMAVGRRGGEWACHRIEHSLSALYDIAHGAGLAIVFPAWMKYVYR --------------1111---------------------------------------333 KNPAQFERFAKKIFGFEGEGEELILKGIEAFKNWLKKVGAPVSLKDAGIPEEDIDKIVDN 3-----------------------------------------3333---3333------- VMLLVEKNLKPKGASLGRIMVLEREDVREILKLAAK --------3333------------------------ >VIRAL INTERLEUKIN-10; SWP:P03180; PDB:1VLK; CDNFPQMLRDLRDAFSRVKTFFQTKDEVDNLLLKESLLEDFKGYLGCQALSEMIQFYLEE -----3333--------3333--------------------------------------- VMPQAENQDPEAKDHVNSLGENLKTLRLRLRRCHRFLPCENKSKAVEQIKNAFNKLQEKG --------3333--------------------11113333-------------------- IYKAMSEFDIFINYIEAYMTIK ---------------------- >SAM-DEPENDENT METHYLTRANS; SWP:Q9X119; PDB:1VLMA; HWHIFERFVNEYERWFLVHRFAYLSELQAVKCLLPEGRGVEIGVGTGRFAVPLKIKIGVE -3333-------3333--------------1111---------!!!!---1111------ PSERAEIARKRGVFVLKGTAENLPLKDESFDFALVTTICFVDDPERALKEAYRILKKGGY --------1111------3333--------------3333---------------2222- LIVGIVDRESFLGREYEKNKEKVFYKNARFFSTEELDLRKAGFEEFKVVQTLFKHPSELS ------------------3333--1111---3333---1111------------3333-- EIEPVKEGYGEGAFVVIRGTKK ---------------------- >AMINOMETHYLTRANSFERASE; SWP:P27248; PDB:1VLOA; QTPLYEQHTLCGARVDFHGWPLHYGSQIDEHHAVRTDAGFDVSHTIVDLRGSRTREFLRY -1111---1111----iiii------------------------------1111------ LLANDVAKLTKSGKALYSGLNASGGVIDDLIVYYFTEDFFRLVVNSATREKDLSWITQHA ----3333--2222------1111-----------1111-----3333-----------3 EPFGIEITVRDDLSIAVQGPNAQAKAATLFNDAQRQAVEGKPFFGVQAGDLFIATTGYTG 333------1111-----1111----1111-----------------!!!!--------- EAGYEIALPNEKAADFWRALVEAGVKPCGLGARDTLRLEAGNLYGQEDETISPLAANGWT --------3333------------------------------2222-11113333--111 IAWEPADRDFIGREALEVQREHGTEKLVGLVTEKGVLRNELPVRFTDAQGNQHEGIITSG 1---1111-2222------------------------2222-----1111---------- TFSPTLGYSIALARVPEGIGETAIVQIRNREPVKVTKPVFVRNGKAVAGLC --------------------------%%%%-----------iiii-3333- >NICOTINATE PHOSPHORIBOSYL; SWP:P39683; PDB:1VLPA; HSEPVIKSLLDTDYKITHAAVFTNFPDVTVTYKYTNRSSQLTFNKEAINWLKEQFSYLGN -------1111--3333-------1111---------1111--------------3333- LRFTEEEIEYLKQEIPYLPSAYIKYISSSNYKLHPEEQISFTSEEIEGKPTHYKLKILVS --------------1111---------3333--3333--------2222----------- GSWKDTILYEIPLLSLISEAYFKFVDIDWDYENQLEQAEKKAETLFDNGIRFSEFGTRRR -33333333---------------------2222-----------1111------3333- RSLKAQDLIQGIKAVNGNPDRNKSLLLGTSNILFAKKYGVKPIGTVAHEWVGVASISEDY -------------1111----1111---------------------3333--3333--33 LHANKNADCWINTFGAKNAGLALTDTFGTDDFLKSFRPPYSDAYVGVRQDSGDPVEYTKK 33-3333-------3333------1111---3333---3333------------------ ISHHYHDVLKLPKFSKIICYSDSLNVEKAITYSHAAKENGLATFGIGTNFTNDFRKKSEP -----------2222-------------------------------3333-----3333- QVKSEPLNIVIKLLEVNGNHAIKISDNLGKNGDPATVKRVKEELGYT ---------------iiii-------1111----------------- >ACETYL XYLAN ESTERASE; SWP:Q9WXT2; PDB:1VLQA; AFFDLPLEELKKYRPERYEEKDFDEFWEETLAESEKFPLDPVFERESHLKTVEAYDVTFS -----33331111------1111------------------------------------- GYRGQRIKGWLLVPKLEEEKLPCVVQYIGYNGGRGFPHDWLFWPSGYICFVDTRGQGSGW -iiii-----------------------2222---3333--3333-------2222---- LKGDTPDYPEGPVDPQYPGFTRGILDPRTYYYRRVFTDAVRAVEAAASFPQVDQERIVIA --------------------2222-3333-------------------11111111---- GGSQGGGIALAVSALSKKAKALLCDVPFLCHFRRAVQLVDTHPYAEITNFLKTHRDKEEI ------------------------------------------3333-------1111--- VFRTLSYFDGVNFAARAKIPALFSVGLDNICPPSTVFAAYNYYAGPKEIRIYPYNNHEGG ---3333------------------------3333--------------------3333- GSFQAVEQVKFLKKLFE ----------------- >MRNA DECAPPING ENZYME; SWP:Q9DAR7; PDB:1VLRA; PVRLPFSGFRVQKVLRESARDKIIFLHGKVNEGEDAVVILEKTPFQVEHVAQLLTGSPEL ------------------1111------------------------------1111---- KLQFSNDIYSTYNLFPPRHLSDIKTTVVYPATEKHLQKYMRQDLRLIRETGDDYRTITLP ----------------1111---------------------------------------- YLESQSLSIQWVYNILDKDRIVFENPDPSDGFVLIPDLKWNQQQLDDLYLIAICHRRGIR --------3333------------------------1111-------------------- SLRDLTPEHLPLLRNILREGQEAILKRYQVTGDRLRVYLHYLPSYYHLHVHFTALGFEAP 3333-3333---------------------1111-------------------1111--- GSGVERAHLLAQVIENLECDPKHYQQRTLTFALRTDDPLLQLLQKAQQER --1111-------------1111----------1111------------- >ASPARTATE RECEPTOR; SWP:P02941; PDB:1VLS; MNQQGFVISNELRQQQSELTSTWDLMLQTRINLSRSAARMMMDASNQQSSAKTDLLQNAK 3333--------------------------------1111------2222---------- TTLAQAAAHYANFKNMTPLPAMAEASANVDEKYQRYQAALAELIQFLDNGNMDAYFAQPT ------------3333--3333------------------------------------33 QGMQNALGEALGNYARVSENLYRQTF 33------------------------ >GAMMA-GLUTAMYL PHOSPHATE ; SWP:P54885; PDB:1VLUA; HSSSQQIAKNARKAGNILKTISNEGRSDILYKIHDALKANAHAIEEANKIDLAVAKETGL ----------------3333---------------------------------------- ADSLLKRLDLFKGDKFEVLQGIKDVAELEDPVGKVKARELDDGLTLYQVTAPVGVLLVIF 3333--------------------1111------------2222---------------- ESRPEVIANITALSIKSGNAAILKGGKESVNTFREAKIVNDTIAQFQSETGVPVGSVQLI --3333--------1111-------3333-----------------------2222---- ETRVSDLLDQDEYIDLVVPRGSNALVRKIKDTTKIPVLGHADGICSIYLDEDADLIKAKR ---3333--3333-----------------------2222---------1111------- ISLDAKTNCNAETLLINPKFSKWWEVLENLTLEGGVTIHATKDLKTAYFDKLNELGKLTE ----------------1111--------------------3333--------1111--33 AIQCKTVSLDLAAKFVTSTESAIQHINTHSSRHTDAIVTENKANAEKFKGVDSSGVYWNA 33-----------------------1111------------------------------- STRFADVGLDGLVSYQYQIRGDGQVASDY ----------------------------- >ORNITHINE CARBAMOYLTRANSF; SWP:P96108; PDB:1VLVA; HMSVNLKGRSLLTLLDFSPEEIRYLLDISKQVKMENRSKLRTERFKGMTLAMIFEKRSTR -----2222---3333--------------------------1111-------------- TRLAFETAFAEEGGHPIFLSPNDIHLGAKESLEDTARVLGRMVDAIMFRGYKQETVEKLA ---------1111------3333-3333----------1111---------3333----- EYSGVPVYNGLTDEFHPTQALADLMTIEENFGRLKGVKVVFMGDTRNNVATSLMIACAKM -----------33333333--------------3333------1111----------111 GMNFVACGPEELKPRSDVFKRCQEIVKETDGSVSFTSNLEEALAGADVVYTDVWARMALL 1-------3333------------3333---------3333-2222---------33333 KPYQVNERVMEMTGKSETIFMHCLPAVKGQEVTYEVIEGKQSRVWDEAENRKHTIKAVMI 333-------33331111--------2222---3333-1111------------------ ATLL ---- >UNKNOWN PROTEIN FROM 2D-P; SWP:P39179; PDB:1VLYA; FTPFPPRQPTASARLPLTLTLDDWALATITGADSEKYQGQVTADVSQAEDQHLLAAHCDA ----------3333------1111------1111---------11111111-------11 KGKWSNLRLFRDGDGFAWIERRSVREPQLTELKKYAVFSKVTIAPDDERVLLGVAGFQAR 11---------!!!!-----3333--------1111------------------------ AALANLFSELPSKEKQVVKEGATTLLWFEHPAERFLIVTDEATANLTDKLRGEAELNNSQ --3333-----1111----!!!!----------------3333----1111--------- QWLALNIEAGFPVIDAANSGQFIPQATNLQALGGISFKKGCYTGQEVARAKFRGANKRAL --------------3333----3333-3333----------2222-3333-2222----- WLLAGSASRLPEAGEDLELKGENWRRTGTVLAAVKLEDGQVVVQVVNNDEPDSIFRVRDD -----------2222--------------------1111----------1111---2222 ANTLHIEPLPYSLE -------------- >UNKNOWN PROTEIN; SWP:NA; PDB:1VM0A; KKNRIQVSNTKKPLFFYVNLAKRYQQYNDVELSALGAIATVVTVTEILKNNGFAVEKKIT 1111----11113333--------------------3333--------1111-------- SIVDIKPVQKAKIEITLVKSEKFDELAAA -------------------1111------ >DIHYDRODIPICOLINATE REDUC; SWP:Q9X1K8; PDB:1VM6A; HMKYGIVGYSGRMGQEIQKVFSEKGHELVLKVDVNGVEELDSPDVVIDFSSPEALPKTVD -------1111----------1111-------1111--------------3333------ LCKKYRAGLVLGTTALKEEHLQMLRELSKEVPVVQAYNFSIGINVLKRFLSELVKVLEDW ----------------3333------3333-----------------------3333--- DVEIVETHHRFKKDAPSGTAILLESALGKSVPIHSLRVGGVPGDHVVVFGNIGETIEIKH --------1111------------1111----------------------1111------ RAISRTVFAIGALKAAEFLVGKDPGMYSFEEVIFG ---3333-----------2222-----3333---- >RIBOKINASE; SWP:Q9X055; PDB:1VM7A; MFLVISVVGSSNIDIVLKVDHFTKPGETQKAIEMNVFPGGKGANQAVTVAKIGEKGCRFV -----------------------2222--------------------------------- TCIGNDDYSDLLIENYEKLGITGYIRVSLPTGRAFIEVDKTGQNRIIIFPGANAELKKEL --------------------------------------1111--------3333--3333 IDWNTLSESDILLLQNEIPFETTLECAKRFNGIVIFDPAPAQGINEEIFQYLDYLTPNEK -33331111---------3333----------------------33331111-------- EIEALSKDFFGEFLTVEKAAEKFLELGVKNVIVKLGDKGVLLVNKNEKKHFPTFKVKAVD ----------------------3333---------1111----1111------------- TTAAGDVFNGAFAVALSEGKNPEEAVIFGTAAAAISVTRLGAQSSIPAREEVEAFLKNL 2222-----------1111----------------1111--3333--3333-------- >UDP-N-ACTEYLGLUCOSAMINE P; SWP:NA; PDB:1VM8A; NVNDLKQRLSQAGQEHLLQFWNELSEAQQELYELQANFEELNSFFRKAIGEFDRSSHQEK 3333-----------11113333-3333-------------------------3333--3 VDAREPVPRQVLGSATRDQEQLQAWESEGLSQISQNKVAVLLLAGGQGTRLGVSYPKGYD 333----3333---------------------1111------------1111-------- VGLPSHKTLFQIQAERILKLQQLAEKHHGNKCTIPWYITSGRTESTKEFFTKHKFFGLKK --1111------------------1111-----------3333---------------33 ENVVFFQQGLPASFDGKIILEEKNKVSAPDGNGGLYRALAAQNIVEDEQRGICSIHVYCV 33----------1111-------------------------------3333--------- DNILVKVADPRFIGFCIQKGADCGAKVVEKTNPTEPVGVVCRVDGVYQVVEYSEISLATA -1111--------------------------1111-------%%%%----1111------ QRRSSDGRLLFNAGNIANHFFTVPFLKDVVNVYEPQLQHHVAQKKIPYVDSQGYFIKPDK ---1111-------------------------3333------------------------ PNGIKEKFVFDIFQFAKKFVVYEVLREDEFSPLKNADSQGKDNPTTARHALSLHHCWVLN -------11113333---------3333-------------------------------- AGGHFIDENGSRLPAIPVPIQCEISPLISYAGEGLEGYVADKEFHAPLIIDENGVHEL ------1111--------------3333------3333-------------------- >TOLUENE-4-MONOOXYGENASE S; SWP:Q00458; PDB:1VM9A; SFEKICSLDDIWVGEMETFETSDGTEVLIVNSEEHGVKAYQAMCPHQEILLSEGSYEGGV ------3333-2222-----1111-------------------------3333---iiii ITCRAHLWTFNDGTGHGINPDDAALAEYPVEVKGDDIYVSTKGILPNKA --------------------------------!!!!----2222----- >CELL DIVISION PROTEIN FTS; SWP:Q9WZ40; PDB:1VMAA; MGLFDFLKKGLQKTKETFFGRVVKLLKGKKLDDETREELEELLIQADVGVETTEYILERL -------------------------2222------------------------------1 EEKDGDALESLKEIILEILNFDTKLNVPPEPPFVIMVVGVNGTGKTTSCGKLAKMFVDEG 111-------------1111-------------------2222-------------1111 KSVVLAAADTFRAAAIEQLKIWGERVGATVISHSEGADPAAVAFDAVAHALARNKDVVII --------1111---------------------2222----------------------- DTAGRLHTKKNLMEELRKVHRVVKKKIPDAPHETLLVIDATTGQNGLVQAKIFKEAVNVT ---------------------3333-1111--------3333------------------ GIILTKLDGTAKGGITLAIARELGIPIKFIGVGEKAEDLRPFDPEAFVEVLLSE -----3333-------------------------1111---------------- >30S RIBOSOMAL PROTEIN S6; SWP:Q9WZ72; PDB:1VMBA; KERIYESMFIIAPNVPEEERENLVERVKKIIEERVKGKIDKVERMGMRKFAYEIKKFNEG -----------1111--------------------------------------%%%%--- DYTVIYFRCDGQNLQELENFYRVTPEIIRWQTFRRFDLEKKERKAQR -----------------------3333-------------------- >METHYLGLYOXAL SYNTHASE; SWP:Q9X0R7; PDB:1VMDA; PRRYKIFMDKKKRIALIAHDRRKRDLLEWVSFNLGTLSKHELYATGTTGALLQEKLGLKV ------------------1111-----------3333----------------------- HRLKSGPLGGDQQIGAMIAEGKIDVLIFFWDPLEPQAHDVDVKALIRIATVYNIPVAITR ----1111---------1111--------------1111----------1111------- STADFLISSPLMNDVYEKIQIDYEEELERRIRKVVE -----11113333----------------------- >FLAVOPROTEIN; SWP:NA; PDB:1VMEA; HPKIWTERIFDDPEIYVLRIDDDRIRYFEAVWEIPEGISYNAYLVKLNGANVLIDGWKGN --------------------------2222---1111---------2222-------333 YAKEFIDALSKIVDPKEITHIIVNHTEPDHSGSLPATLKTIGHDVEIIASNFGKRLLEGF 3-------3333-3333---------3333------------------------------ YGIKDVTVVKDGEEREIGGKKFKFVTPWLHWPDTVTYLDGILFSCDVGGGYLLPEILDDS ---------2222---iiii-----------------iiii--!!!!------------- NESVVERYLPHVTKYIVTVIGHYKNYILEGAEKLSSLKIKALLPGHGLIWKKDPQRLLNH -------------------3333----------3333----------------------- YVSVAKGDPKKGKVTVIYDSYGFVENVKKAIDSLKEKGFTPVVYKFSDEERPAISEILKD ---------2222--------3333---------1111--------------3333---- IPDSEALIFGVSTYEAEIHPLRFTLLEIIDKANYEKPVLVFGVHGWAPSARTAGELLKET 1111---------!!!!---------------------------------------1111 KFRILSFTEIKGSNDERKIEEAISLLKKELE --------------3333------------- >HYPOTHETICAL PROTEIN; SWP:NA; PDB:1VMFA; KTFHLTTQSRDEMVDITSQIETWIRETGVTNGVAIVSSLHTTAGITVNENADPDVKRDMI ------------------------------------------------------------ MRLDEVYPWHHENDRHMEGNTAAHLKTSTVGHAQTLIISEGRLVLGTWQGVYFCEFDGPR ----------1111-33333333---------------%%%%---1111----------- TNRKFVVKLLTD ------------ ------------------------------------------------------------ -------------------- >Uncharacterized conserved; SWP:Q97KL0; PDB:1VMHA; VIEYSLKTSNDDQFIDITNLVKKAVDESGVSDGMAVVFCPHTTAGITINENADPDVTRDI -------------------------3333-----------1111---------------- LVNLDKVFPKVGDYKHVEGNSHAHIKASLMGSSQQIIIENGKLKLGTWQGIYFTEFDGPR ---------------33333333---------------%%%%------------------ DRKVFVKII --------- >PUTATIVE PHOSPHATE ACETYL; SWP:NA; PDB:1VMIA; RCRELALRAPARVVFPDALDQRVLKAAQYLHQQGLATPILVANPFELRQFALSHGVADGL --3333------------------------------------3333-------------- QVIDPHGNLAREEFAHRWLARAGEKTPPDALEKLTDPLFAAAVSAGKADVCIAGNLSSTA ---1111---------------1111--3333----------1111-------------- NVLRAGLRIIGLQPGCKTLSSIFLLPQYSGPALGFADCSVVPQPTAAQLADIALASAETW ------------2222-------------------------------------------- RAITGEEPRVALSFSSNGSARHPCVANVQQATEIVRERAPKLVVDGELQFDAAFVPEVAA --------------------------------------1111------3333-------- QKAPASPLQGKANVVFPSLEAGNIGYKIAQRLGGYRAVGPLIQGLAAPHDLSRGCSVQEI --1111-%%%%----------------------------------------1111----- IELALVAAVPR -----1111-- >HYPOTHETICAL PROTEIN TM07; SWP:Q9WZI2; PDB:1VMJA; MKSYRKELWFHTKRRREFINITPLLEECVRESGIKEGLLLCNAMHITASVFINDDEPGLH ------------------------------------------------------------ HDFEVWLEKLAPEKPYSQYKHNDTGEDNADAHLKRTIMGREVVIAITDRKMDLGPWEQVF --------------33333333-----3333---------------%%%%---1111--- YGEFDGMRPKRVLVKIIGE ------------------- >PURINE NUCLEOSIDE PHOSPHO; SWP:Q9X1T2; PDB:1VMKA; MMKKIEEARTFISERTNLSPDILIILGGPFIEKVEDPVIIDYKDIPHFPGKLVFGRISDK -----------1111--------------3333-------33332222--------iiii PVMIMAGRFHLYEGHDPATVAFPVYLAKYVGVKGVVVTNAAGAINPEFKPGEIILVRDII ---------3333--3333-------------------------33332222-------- NFMFRNPLRGPNDEKIGPRFPDMSSVVDPEWARKIQERLSLKEGVYIGVLGPSYETPAEI -----1111---3333-------------------------------------------- RVFEKLGADLVGMSTVPEVIAAKHCGLKVVVFSCVTNMAAGISHEEVVRTTKMAQGKIEK ---1111-------------------------------2222------------------ ALTTAVEVF -----1111 >VITELLINE MEMBRANE OUTER ; SWP:P41366; PDB:1VMOA; RTREYTSVITVPNGGHWGKWGIRQFCHSGYANGFALKVEPSQFGRDDTALNGIRLRCLDG ------------------------------------------------------------ SVIESLVGKWGTWTSFLVCPTGYLVSFSLRSEKSQGGGDDTAANNIQFRCSDEAVLVGDG -------------------------------------------------1111------- LSWGRFGPWSKRCKICGLQTKVESPQGLRDDTALNNVRFFCCK -----------------------1111---------------- >NEUROTOXIN; SWP:P01492; PDB:1VNA; KEGYLVKKSDGCKYDCFWLGKNEHCNTECKAKNQGGSYGYCYAFACWCEGLPESTPTYPL ----------------------3333----1111-----------------1111----- PNKSC ----- >VANADIUM CHLOROPEROXIDASE; SWP:P49053; PDB:1VNS; VTPIPLPKIDEPEEYNTNYILFWNHVGLELNRVTHTVGGPLTGPPLSARALGMLHLAIHD -----------3333-----------------1111------------------------ AYFSICPPTDFTTFLSPDTENAAYRLPSPNGANDARQAVAGAALKMLSSLYMKPVEQPNP ---------------1111-3333--------------------------------3333 NPGANISDNAYAQLGLVLDRSVLEAPGGVDRESASFMFGEDVADVFFALLNDPRGASQEG ------------------------2222-1111------------------1111--222 YHPTPGRYKFDDEPTHPVVLIPVDPNNPNGPKMPFRQYHAPFYGKTTKRFATQSEHFLAD 2----2222---1111-------1111------------1111----------------- PPGLRSNADETAEYDDAVRVAIAMGGAQALNSTKRSPWQTAQGLYWAYDGSNLIGTPPRF 22221111------------------1111-------------1111---2222-3333- YNQIVRRIAVTYKKEEDLANSEVNNADFARLFALVDVACTDAGIFSWKEKWEFEFWRPLS ------------------------------------------------------------ GVRDDGRPDHGDPFWLTLGAPATNTNDIPFKPPFPAYPSGHATFGGAVFQMVRRYYNGRV ------1111---------------------------------------------2222- GTWKDDEPDNIAIDMMISEELNGVNRDLRQPYDPTAPIEDQPGIVRTRIVRHFDSAWELM ---1111----------3333-----------11113333-------------------- FENAISRIFLGVHWRFDAAAARDILIPTTTKDVYAVDNNGATVFQNVEDIRYTTRGTRED -------1111--1111--3333------2222---1111-----3333--------111 EEGLFPIGGVPLGIEIADEIFNNGLKPTPPEIQP 1-------------------1111----3333-- >Putative Xanthosine triph; SWP:Q9WY06; PDB:1VP2A; KLTVYLATTNPHKVEEIKMIAPEWMEILPSPEKIEVVEDGETFLENSVKKAVVYGKKLKH -----------------11111111----------------------------------- PVMADDSGLVIYSLGGFPGVMSARFMEEHSYKEKMRTILKMLEGKDRRAAFVCSATFFDP -----------1111--!!!!----1111----------1111----------------- VENTLISVEDRVEGRIANEIRGTGGFGYDPFFIPDGYDKTFGEIPHLKEKISHRSKAFRK -------------------------!!!!----2222--3333--3333----------- LFSVLEKIL --------- >AMINOTRANSFERASE, PUTATIV; SWP:NA; PDB:1VP4A; HVVNLEGKISKIGQNKSSIIREILKFAADKDAISFGGGVPDPETFPRKELAEIAKEIIEK -----11113333----------1111-1111--------1111---------------- EYHYTLQYSTTEGDPVLKQQILKLLERYGITGLDEDNLIFTVGSQQALDLIGKLFLDDES 3333-----3333--------------------1111-------------------1111 YCVLDDPAYLGAINAFRQYLANFVVVPLEDDGDLNVLERKLSEFDKNGKIKQVKFIYVVS ----------------1111--------1111------------11113333-------- NFHNPAGVTTSLEKRKALVEIAEKYDLFIVEDDPYGALRYEGETVDPIFKIGGPERVVLL ----------------------------------3333--------3333--3333---- NTFSKVLAPGLRIGVAGSKEFIRKIVQAKQSADLCSPAITHRLAARYLERYDLLEQLKPT -------3333-------------------------3333-----------3333----- IELYRRKRTVLNALEEYFSDIPGVKWVKSEGGLFIWLTLPEGFDTWEFEYAKRKKVFYVP --------------------2222---------------2222----------------- GRVFKVYDEPSPSRLSFCLPPDEKIVEGIKRLREVVLEYGKEKHLL ----1111------------3333---------------------- >2,5-DIKETO-D-GLUCONIC ACI; SWP:Q9X0A2; PDB:1VP5A; QVPKVTLNNGVEMPILGYGVFQIPPEKTEECVYEAIKVGYRLIDTAASYMNEEGVGRAIK ------1111---------22221111------------------3333----------- RAIDEGIVRREELFVTTKLWVSDVGYESTKKAFEKSLKKLQLEYIDLYLIHQPFGDVHCA --------3333-------1111--3333------------------------------- WKAMEEMYKDGLVRAIGVSNFYPDRLMDLMVHHEIVPAVNQIEIHPFYQRQEEIEFMRNY --------------------------------------------1111------------ NIQPEAWGPFAEGRKNIFQNGVLRSIAEKYGKTVAQVILRWLTQKGIVAIPKTVRRERMK -------1111--%%%%-------------------------1111-------------- ENISIFDFELTQEDMEKIATLDEGQSAFFSHRDPEVVKWICSLK ------------------1111-------1111------1111- >hypothetical protein, sim; SWP:Q98GN8; PDB:1VP6A; VRRGDFVRNWQLVAAVPLFQKLGPAVLVEIVRALRARTVPAGAVICRIGEPGDRMFFVVE ---------------3333--------------------2222---2222---------- GSVSVATPNPVELGPGAFFGEMALISGEPRSATVSAATTVSLLSLHSADFQMLCSSSPEI -------------2222---3333------------------------------------ AEIFRKTALERRG ------------- >EXODEOXYRIBONUCLEASE VII ; SWP:Q7W7Q2; PDB:1VP7A; ARPLPQDFETALAELESLVSAENGTLPLEQSLSAYRRGVELARVCQDRLAQAEQQVKVLE ------------------------------------------------------------ GDLLRPL ---3333 >HYPOTHETICAL PROTEIN AF01; SWP:O30133; PDB:1VP8A; HEKKIVYFNKPGRENTEETLRLAVERAKELGIKHLVVASSYGDTAKALEAEGLEVVVVTY -----------3333--------------------------33333333iiii------- HTGFVREGENTPPEVEEELRKRGAKIVRQSHILSGLERSISRKLGGVSRTEAIAEALRSL 2222-2222--3333----1111-------11113333---------------------- FGHGLKVCVEITIAADSGAIPIEEVVAVGGRSRGADTAVVIRPAHNNFFDAEIKEIICPR --------------1111----------------------------1111---------- NKR --- >2-C-methyl-D-erythritol 4; SWP:Q9X1B3; PDB:1VPAA; HMNVAILLAAGKGERMSENVPKQFLEIEGRMLFEYPLSTFLKSEAIDGVVIVTRREWFEV ------------3333----3333--%%%%3333--------3333-------3333-33 VEKRVFHEKVLGIVEGGDTRSQSVRSALEFLEKFSPSYVLVHDSARPFLRKKHVSEVLRR 33----3333--------------------3333---------------3333------- ARETGAATLALKNSDALVRVENDRIEYIPRKGVYRILTPQAFSYEILKKAHENGGEWADD -----------------------------2222---------------1111-------- TEPVQKLGVKIALVEGDPLCFKVTFKEDLELARIIAREWE ----1111--------3333----3333------3333-- >PUTATIVE MODULATOR OF DNA; SWP:Q8A1L2; PDB:1VPBA; MITDENKKLAQWAMDYALKNGCQAAKVLLYSSSNTSFELRDMDRLQQASEGGLSLSLYVD ----------------------------------------------------------ii GRYGSISTNRLNRKELETFIKNGIDSTRYLAKDEARVLADPSRYYKGGKPDLKLYDAKFA ii------------------------3333--1111---3333------------3333- SLNPDDKIEMAKAVAEEALGKDERIISVGSSYGDGEDFAYRLISNGFEGETKSTWYSLSA --3333-----------22223333----------------------------------- DITIRGEGEARPSAYWYESSLYMNDLIKKGIGQKALERVLRKLGQKKVQSGKYTMVVDPM ------!!!!-----------3333--------------1111--------------333 NSSRLLSPMISALNGSALQQKNSFLLNKLNEKIASDRLTLTDEPHLVKASGARYFDNEGI 33333------------1111-1111-2222---1111----------2222---1111- ATERRSIFDKGVLNTYFIDTYNAKKMGVDPTISGSSILVMETGDKNLDGLIAGVEKGILV --------iiii------------------------------------------------ TGFNGGNNNSSTGDFSYGIEGFLIENGKLTQPVSEMNVTGNLITLWNSLVATGNDPRLNS ------------------------iiii----------------------------1111 SWRIPSLVFEGVDFSGL ----------------- >TARTRONATE SEMIALDEHYDE R; SWP:Q8ZLV8; PDB:1VPDA; KVGFIGLGIGKPSKNLLKAGYSLVVSDRNPEAIADVIAAGAETASTAKAIAEQCDVIITL ----------------1111----------------1111-------------------- PNSPHVKEVALGENGIIEGAKPGTVLIDSSIAPLASREISDALKAKGVELDAPVSGGEPK --------------3333--2222-------3333------------------------- AIDGTLSVVGGDKAIFDKYYDLKAAGSVVHTGDIGAGNVTKLANQVIVALNIAASEALTL --------------------------------2222------------------------ ATKAGVNPDLVYQAIRGGLAGSTVLDAKAPVDRNFKPGFRIDLHIKDLANALDTSHGVGA -1111---------1111------------------------------------------ QLPLTAAVEQALRADGHGNDDHSALACYYEKLAKVEVTR ------------11111111------------------- >PHOSPHOGLYCERATE KINASE; SWP:P36204; PDB:1VPE; EKMTIRDVDLKGKRVIMRVDFNVPVKDGVVQDDTRIRAALPTIKYALEQGAKVILLSHLG ---3333--2222------------%%%%------------------------------- RPKGEPSPEFSLAPVAKRLSELLGKEVKFVPAVVGDEVKKAVEELKEGEVLLLENTRFHP ------3333-------------------------------11112222-----111133 GETKNDPELAKFWASLADIHVNDAFGTAHRAHASNVGIAQFIPSVAGFLMEKEIKFLSKV 33-----------1111------3333----3333--1111------------------- TYNPEKPYVVVLGGAKVSDKIGVITNLMEKADRILIGGAMMFTFLKALGKEVGSSRVEED ----------------33333333-3333----------------1111--------111 KIDLAKELVEKAKEKGVEIVLPVDAVIAQKIEPGVEKKVVRIDDGIPEGWMGLDIGPETI 1-----------1111------------------------3333---------------- ELFKQKLSDAKTVVWNGPMGVFEIDDFAEGTKQVALAIAALTEKGAITVVGGGDSAAAVN ------1111----------3333-----------------------------------1 KFGLEDKFSHVSTGGGASLEFLEGKELPGIASMRIKKA 1111111-------------------1111-------- >HYPOTHETICAL PROTEIN SSO2; SWP:NA; PDB:1VPHA; HMKVYFDDIYVSTARQFELVDITDQVEQIVEKSGIKNGICLIFVAHSTAAIVANEHERGL ------------------------------------------------------------ MEDILTKIKEFTEPSRSWKHNLIDDNAHAHLGATFLGAERVFPVREGKLVRGTWQNIFLV ------------1111-3333----3333---------------iiii------------ ELDGPRSERHITVEILGE ------------------ >DNA POLYMERASE III, BETA ; SWP:NA; PDB:1VPKA; HMKVTVTTLELKDKITIASKALAKKSVKPILAGFLFEVKDGNFYICATDLETGVKATVNA -----------------1111------3333-------%%%%------------------ AEISGEARFVVPGDVIQKMVKVLPDEITELSLEGDALVISSGSTVFRITTMPADEFPEIT ------------------3333----------!!!!----!!!!--------1111---- PAESGITFEVDTSLLEEMVEKVIFAAAKDEFMRNLNGVFWELHKNLLRLVASDGFRLALA ---------------------3333---33331111------------------------ EEQIENEEEASFLLSLKSMKEVQNVLDNTTEPTITVRYDGRRVSLSTNDVETVMRVVDAE ----------------------------------------------1111---------- FPDYKRVIPETFKTKVVVSRKELRESLKRVMVIASKGSESVKFEIEENVMRLVSKSPDYG --3333--------------------------3333------------------------ EVVDEVEVQKEGEDLVIAFNPKFIEDVLKHIETEEIEMNFVDSTSPCQINPLDISGYLYI -------------------3333----1111----------1111--------------- VMPIRLA ------- >ABC TRANSPORTER, ATP-BIND; SWP:Q9WZ14; PDB:1VPLA; GAVVVKDLRKRIGKKEILKGISFEIEEGEIFGLIGPNGAGKTTTLRIISTLIKPSSGIVT -----------!!!!----------2222------2222-------1111---------- VFGKNVVEEPHEVRKLISYLPEEAGAYRNMQGIEYLRFVAGFYASSSSEIEEMVERATEI iiii3333-----1111------------------------------------------- AGLGEKIKDRVSTYSKGMVRKLLIARALMVNPRLAILDEPTSGLDVLNAREVRKILKQAS --!!!!---3333--------------1111-------1111-----------------1 QEGLTILVSSHNMLEVEFLCDRIALIHNGTIVETGTVEELKERYKAQNIEEVFEEVVK 111--------33331111-------iiii---------------------------- >ACYL-COA HYDROLASE; SWP:Q9KEQ1; PDB:1VPMA; IQSYPVERSRTIQTRLVLPPDTNHLGTIFGGKVLAYIDEIAALTAKHANSAVVTASIDSV ----3333---------3333-1111---------------------------------- DFKSSATVGDALELEGFVTHTGRTSEVYVRVHSNNLLTGERTLTTESFLTVAVDESGKPK ------1111-----------1111----------------------------1111--- PVPQVEPQTEEEKRLYETAPARKENRKKRAAL ---------------------------1111- >HYPOTHETICAL PROTEIN TM16; SWP:NA; PDB:1VPQA; HMVYVGTSGFSFEDWKGVVYPEHLKPSQFLKYYWAVLGFRIVELNFTYYTQPSWRSFVQM -----------3333-----11113333------------------------------33 LRKTPPDFYFTVKTPGSVTHVLWKEGKDPKEDMENFTRQIEPLIEEQRLKMTLAQFPFSF 33------------3333--3333--------------------------------3333 KFSRKNVEYLEKLRESYPYELAVEFRHYSWDREETYEFLRNHGITFVVVDEPKLPGLFPY -------------1111---------3333-3333----1111----------2222--- RPITTTDYAYFRFHGRNERWFEAEGEERYDYLYSEEELKTLFEDVVELSRRVKETYVFFN ----------------1111---!!!!--------------------3333--------- NCYKGQAAINALQFKKMLEE --iiii-------------- >LUCIFERASE; SWP:O77206; PDB:1VPRA; EKGFEAGDNKLGGALNAKHVEKYGDNFKNGHKPEFHEDGLHKPEVGGKKFESGFHYLLEC iiii-----------33333333------------3333------------3333---33 HELGGKNASGGYGGPLCEDPYGSEVQATEKLLKEADSDRTLCFNNFQDPCPQLTKEQVAC 33----1111---1111-11113333------3333-----%%%%--------3333--2 KGFDYGDKTLKLPCGPLPWPAGLPEPGYVPKTNPLHGRWITVSGGQAAFIKEAIKSGMLG 2221111----1111----2222---------1111--------3333------------ AAEANKIVADTDHHQTGGYLRINQFGDVCTVDASVAKFARAKRTWKSGHYFYEPLVSGGN -----------!!!!---------!!!!-----3333--------2222-----1111-- LLGVWVLPEEYRKIGFFWEESGRCFRIERRAFPVGPYTFRQATEVGGKISFVFYVKVSND --------1111---------------------!!!!-------iiii------------ PESDPIPLQSRDYTALAGRDNAPTNLGKPYPTLAKDLDYPKKRD ---------------iiii----------------1111----- >POLYOMAVIRUS VP1 PENTAMER; SWP:P49302; PDB:1VPSA; GGMEVLDLVTGPDSVTEIEAFLNPRMGQPPTPESLTEGGQYYGWSRGINLATSDTEDSPG ----------2222-----------------------3333-----------1111---1 NNTLPTWSMAKLQLPMLNEDLTCDTLQMWEAVSVKTEVVGSGSLLDVHGFNKPTDTVNTK 111------------------------------------3333----------------- GISTPVEGSQYHVFAVGGEPLDLQGLVTDARTKYKEEGVVTIKTITKKDMVNKDQVLNPI ----------------------------1111--------3333------3333---333 SKAKLDKDGMYPVEIWHPDPAKNENTRYFGNYTGGTTTPPVLQFTNTLTTVLLDENGVGP 3----------1111---11111111---------------------------1111--- LCKGEGLYLSCVDIMGWRVTRNYDVHHWRGLPRYFKITLRKRWVK ---------------------%%%%-------------------- >VP39; SWP:P07617; PDB:1VPT; MDVVSLDKPFMYFEEIDNELDYEPESANEVAKKLPYQGQLKLLLGELFFLSKLQRHGILD -----------3333--------3333------2222----------------1111-22 GATVVYIGSAPGTHIRYLRDHFYNLGVIIKWMLIDGRHHDPILNGLRDVTLVTRFVDEEY 22---------3333-------1111-------------3333--1111----------- LRSIKKQLHPSKIILISDVASPSTADLLSNYALQNVMISILNPVASSLKWRCPFPDQWIK -----------------------------------------------------1111--- DFYIPHGNKMLQPFAPSYSAEMRLLSIYTGENMRLTRVTKSDAVNYEKKMYYLNKIVRNK ---------------1111------------------------------------3333- VVVNFDYPNQEYDYFHMYFMLRTVYCNKTFPTTKAKVLFLQQSIFRFLNIP -1111-----3333-----3333--------3333---------------- >VPU PROTEIN; SWP:P19554; PDB:1VPU; LQIDRLIDRITERAEDSGNESEGDQEELSALVERGHLAPWDVDDL ---------3333-----22223333------2222-3333---- >UPF0230 PROTEIN TM1468; SWP:Q9X1H9; PDB:1VPVA; KVKILVDSTADVPFSWEKYDIDSIPLYVVWEDGRSEPDEREPEEINFYKRIREAGSVPKT -------3333-33331111---------1111-------3333---------------- SQPSVEDFKKRYLKYKEEDYDVVLVLTLSSKLSGTYNSAVLASKEVDIPVYVVDTLLASG ---------------1111---------1111---------1111------------!!! AIPLPARVARELENGATIEEVLKKLDERKNKDFKAIFYVSNFDYLVKGGRVFVGNLLKIR !----------1111----------------------------------------2222- VCLHIENGELIPYRKVRGDKKAIEALIEKLREDTPEGSKLRVIGVHADNEAGVVELLNTL -----iiii--------------------3333-2222--------------------33 RKSYEVVDEIISPGKVITTHVGPGTVGFGIEVLERK 33---------------------------------- >PROTEIN (TRANSALDOLASE (E; SWP:Q9WYD1; PDB:1VPXA; HMKIFLDTANLEEIKKGVEWGIVDGVTTNPQRVKEICDLVKGPVSAEVVSLDYEGMVREA ------------------------------3333-------------------------- RELAQISEYVVIKIPMTPDGIKAVKTLSAEGIKTNVTLVFSPAQAILAAKAGATYVSPFV ------1111-----------------1111----------------------------- GRMDDLSNDGMRMLGEIVEIYNNYGFETEIIAASIRHPMHVVEAALMGVDIVTMPFAVLE ---1111-----------------------------3333--------------333333 KLFKHPMTDLGIERFMEDWKKYLENL 33------------------------ >HYPOTHETICAL PROTEIN EF03; SWP:NA; PDB:1VPYA; HIRLGLTSFSSTLYEYASHLPLVEDTAYYGIPPKERVAEWVKAVPENFRFVKVYSGISCQ -----------3333---------1111----3333----1111---------3333--- GEWQTYYASEEEITAFLESAPLIESKKLFAFLVQFSGTFGCTKENVAYLQKIRHWFKDLP -3333---3333---------3333----------1111--------------------- IAIELRNNSWYQPNFVKQLQFKENQFSLVIVDEPQIPTNPVPFYPYVTNPNLVLFRFHGR ------3333--1111-----1111----------1111---------1111-------- NAAGWKKRTLYHYNTQEIADLSEAVLKSQEAKEVGVIFNNNSGGDAAENALQQKVLNLS -----------------------------------------3333--------1111-- >CARBON STORAGE REGULATOR ; SWP:CSRA_PSEAE; PDB:1VPZA; HLILTRRVGETLVGDDVTVTVLGVKGNQVRIGVNAPKEVAVHREEIYQRIQKEK ------2222--------------!!!!-------1111---3333-------- >33 KDA CHAPERONIN; SWP:HSLO_THEMA; PDB:1VQ0A; HMIYYGTMFDHKVRFSIVRMRVVEEARNRHALSYLATVVLGRALIGAALVTPWLAEKERW --------%%%%---------------1111------------------3333-2222-- TLDIEGNGPIRRVVAQSTSEFTVRGYVANPKVELPLNEKGKFDVAGAIGQGVLRVVRDLG -------3333------1111-------1111----1111-------------------- LKTPFVSQVPLVSGEIAEDLAYYFAVSEQIPSAFSIGVLVDSDGVKIAGGFAVQIIDRTL ----------------------------------------1111------------1111 EQEKVEMIEKNIKNLPSISKLFQEAEPLDVLERIFGEKVGFVETAEIKYKCDCNREKAKN -----------1111-33331111------------------------------------ ALLVLDKKELEDMRKEGKGEVVCKWCNTRYVFSEEELEELLKFKVDD -1111------------------------------------------ >DEOXYCYTIDYLATE DEAMINASE; SWP:P16006; PDB:1VQ2A; MKASTVLQIAYLVSQESKCCSWKVGAVIEKNGRIISTGYNGSPAGGVNCCDYAAEQGWLL -------------1111------------iiii---------2222-------------- NKRFVLAKEHRSAHSEWSSKNEIHAELNAILFAAENGSSIEGATMYVTLSPCPDCAKAIA ------1111-----------------------------2222---------------11 QSGIKKLVYCETYDKNKPGWDDILRNAGIEVFNVPKKNLNKLNWENINEFCGE 11----------11112222----1111------3333----3333------- >Phosphoribosylformylglyci; SWP:NA; PDB:1VQ3A; HLPLFKFAIDVQYRSNVRDPRGETIERVLREEKGLPVKKLRLGKSIHLEVEAENKEKAYE -------------1111------------------------------------------- IVKKACEELLVNPVVEEYEVREL -----------3333-------- >50S ribosomal protein L39; SWP:P22452; PDB:1VQO2; GKKSKATKKRLAKLDNQNSRVPAWVMLKTDRRNHKRRHWRRNDTDE ---------------------3333-1111--1111---------- >50S ribosomal protein L44; SWP:P32411; PDB:1VQO3; MQMPRRFNTYCPHCNEHQEHEVEKVRSGRQTGMKWIDRQRERNSGIGNDGKFSKVPGGDK ---------------------------------3333-----------!!!!-------- PTKKTDLKYRCGECGKAHLREGWRAGRLEFQE -------------------------------- >23S RIBOSOMAL RNA; SWP:P20276; PDB:1VQOA; GRRIQGQRRGRGTSTFRAPSHRYKADLEHRKVEDGDVIAGTVVDIEHDPARSAPVAAVEF ---3333-----3333--1111-------------------------1111--------1 EDGDRRLILAPEGVGVGDELQVGVSAEIAPGNTLPLAEIPEGVPVCNVESSPGDGGKFAR 111-------------------1111--2222--3333-2222-------2222------ ASGVNAQLLTHDRNVAVVKLPSGEMKRLDPQCRATIGVVAGGGRTDKPFVKAGNKHHKMK 2222---------------1111-----1111----------1111----3333----11 ARGTKWPNVRGVAMNAVDHPFGGGGRQHPGKPKSISRNAPPGRKVGDIASKRTGRGG 11-------3333-33331111-------------11112222-------------- >50S ribosomal protein L3P; SWP:P20279; PDB:1VQOB; PQPSRPRKGSLGFGPRKRSTSETPRFNSWPSDDGQPGVQGFAGYKAGMTHVVLVNDEPNS --------------------------------------------------------1111 PREGMEETVPVTVIETPPMRAVALRAYEDTPYGQRPLTEVWTDEFHSELDRTLDVPEDHD -2222------------------------1111--------------1111--------3 PDAAEEQIRDAHEAGDLGDLRLITHTVPDAVPSVPKKKPDVMETRVGGGSVSDRLDHALD 333-----------------------33333333-------------------------- IVEDGGEHAMNDIFRAGEYADVAGVTKGKGTQGPVKRWGVQKRKGKHARQGWRRRIGNLG --------3333--2222--------------3333--------3333------------ PWNPSRVRSTVPQQGQTGYHQRTELNKRLIDIGEGDEPTVDGGFVNYGEVDGPYTLVKGS -------1111------------------------111122222222------------- VPGPDKRLVRFRPAVRPNDQPRLDPEVRYVSNESNQG ---2222------------------------------ >50S ribosomal protein L4P; SWP:P12735; PDB:1VQOC; MQATIYDLDGNTDGEVDLPDVFETPVRSDLIGKAVRAAQANRKQDYGSDEYAGLRTPAES ------1111--------3333-----------------1111-----1111-------- FGSGRGQAHVPKLDGRARRVPQAVKGRSAHPPKTEKDRSLDLNDKERQLAVRSALAATAD ------------%%%%---1111---------3333-------------------33333 ADLVADRGHEFDRDEVPVVVSDDFEDLVKTQEVVSLLEALDVHADIDRADETKIKAGQGS 333------------------3333---3333-------------------------333 ARGRKYRRPASILFVTSDEPSTAARNLAGADVATASEVNTEDLAPGGAPGRLTVFTESAL 3-------------------3333--2222---3333-3333-2222------------- AEVAER -3333- >50S ribosomal protein L5P; SWP:P14124; PDB:1VQOD; FHEMREPRIEKVVVHMGIGHANAEDILGEITGQMPVRTKAKRTVGEFDIREGDPIGAKVT 3333----------------1111------------------------------------ LRDEMAEEFLQTALPLAELATSQFDDTGNFSFGLDVTVNLVRPGYRVAKRDKASRSIPTK --3333------3333---3333-1111---------------3333----------333 HRLNPADAVAFIESTYDVEV 3------------------- >50S ribosomal protein L6P; SWP:P14135; PDB:1VQOE; PRVELEIPEDVDAEQDHLDITVEGDNGSVTRRLWYPDIDVSVDGDTVVIESDEDNAKTMS -------3333----!!!!----1111-------2222----!!!!-------------- TIGTFQSHIENMFHGVTEGWEYGMEVFYSHFPMQVNVEGDEVVIENFLGEKAPRRTTIHG -------------------------------------!!!!----2222----------- DTDVEIDGEELTVSGPDIEAVGQTAADIEQLTRINDKDVRVFQDGVYITRKP ------!!!!---------------------------3333----------- >50S ribosomal protein L7A; SWP:P12743; PDB:1VQOF; PVYVDFDVPADLEDDALEALEVARDTGAVKKGTNETTKSIERGSAELVFVAEDVQPEEIV 1111---------------------------------------------------3333- MHIPELADEKGVPFIFVEQQDDLGHAAGLEVGSAAAAVTDAGEADADVEDIADKVEELR ------------------------1111------------------------------- >50S ribosomal protein L10; SWP:P15825; PDB:1VQOG; IPEWKQEEVDAIVEMIESRNTLLERALDD ------------------3333------- >50S ribosomal protein L10; SWP:P60617; PDB:1VQOH; KPASMYRDIDKPAYTRREYITGIPGSKIAQHKMGRKQKDADDYPVQISLIVEETVQLRHG -3333----------1111---------------11113333------------------ SLEASRLSANRHLIKELGEEGDYKMTLRKFPHQVLRENKDGMRAAFGKIVGTAARVQAGE --------------------------------------------------------2222 QLFTAYCNVEDAEHVKEAFRRAYNKITPSCRIDSSPAGNA ----------1111------3333---------------- >50S ribosomal protein L11; SWP:P14122; PDB:1VQOI; GVPPTAELIKDEAGFETGSGEPQEDFVADLSVDQVKQIAEQKHPDLLSYDLTNAAKEVVG ----------3333----------------3333---11113333----3333------- TCTSLGVTIE ---------- >50S ribosomal protein L13; SWP:P29198; PDB:1VQOJ; AEFDADVIVDARDCIMGRVASQVAEQALDGETVAVVNAERAVITGREEQIVEKYEKRVDI ----------2222------------1111------3333-------------------- GNDNGYFYPKRPDGIFKRTIRGMLPHKKQRGREAFESVRVYLGNPYDEDGEVLDGTSLDR -3333---------------11111111------1111--------------2222--11 LSNIKFVTLGEISETLGANKTW 11-------------------- >50S ribosomal protein L14; SWP:P22450; PDB:1VQOK; MEALGADVTQGLEKGSLITCADNTGARELKVISVHGYSGTKNRLPKAGLGDKITVSVTKG ------------2222-----------------2222--2222----2222--------- TPEMRRQVLEAVVVRQRKPIRRPDGTRVKFEDNAAVIVDENEDPRGTELKGPIAREVAQR 3333-----------------1111-------------1111------------------ FGSVASAATMIV -3333------- >50S ribosomal protein L15; SWP:P12737; PDB:1VQOL; TSKKKRQRGSRTHGGGSHKNRRGAGHRGGRGDAGRDKHEFHNHEPLGKSGFKRPQKVQEE -3333------iiii-------3333-----2222----2222----------1111--- AATIDVREIDENVTLLAADDVAEFRVDVRDVVEEADDADYVKVLGAGQVRHELTLIADDF ----3333----3333----------3333------------------------------ SEGAREKVEGAGGSVELTDLGEERQ -----------------3333---- >50S ribosomal protein L15; SWP:P60618; PDB:1VQOM; ARSAYSYIRDAWENPGDGQLAELQWQRQQEWRNEGAVERIERPTRLDKARSQGYKAKQGV --3333-------11113333--------3333----------------1111---2222 IVARVSVRKGSARKRRHKAGRRSKRQGVTRITRRKDIQRVAEERASRTFPNLRVLNSYSV ---------------------3333----------3333---------1111-------- GQDGRQKWHEVILIDPNHPAIQNDDDLSWICADDQADRVFRGLTGAGRRNRGLSGKGKGS --1111--------1111--1111--3333-1111-3333--------1111----2222 EKTRPSLRSNGGKA ------3333---- >50S ribosomal protein L18; SWP:P14123; PDB:1VQON; ATGPRYKVPMRRRREARTDYHQRLRLLKSGKPRLVARKSNKHVRAQLVTLGPNGDDTLAS --1111---3333-------------3333--------1111--------1111------ AHSSDLAEYGWEAPTGNMPSAYLTGLLAGLRAQEAGVEEAVLDIGLNSPTPGSKVFAIQE ----3333-----------------------------------!!!!--2222------- GAIDAGLDIPHNDDVLADWQRTRGAHIAEYDEQLEEPLYSGDFDAADLPEHFDELRETLL -----------1111---3333-33333333----------------------------- DGDIEL ------ >50S ribosomal protein L18; SWP:P12733; PDB:1VQOO; SKTNPRLSSLIADLKSAARSSGGAVWGDVAERLEKPRRTHAEVNLGRIERYAQEDETVVV -------------------------------33333333----3333------------- PGKVLGSGVLQKDVTVAAVDFSGTAETKIDQVGEAVSLEQAIENNPEGSHVRVIR --------------------------------------------1111------- >50S ribosomal protein L19; SWP:P14119; PDB:1VQOP; TDLSAQKRLAADVLDVGKNRVWFNPERQGDIADAITREDVRELVDEGAIQAKDKKGNSRG -----------1111-1111---1111---1111-------------------------- RARERQKKRAYGHQKGAGSRKGKAGARQNSKEDWESRIRAQRTKLRELRDEGTLSSSQYR --------1111---1111---3333---------------------------------- DLYDKAGGGEFDSVADLERYIDA -----1111-------------- >50S ribosomal protein L21; SWP:P12734; PDB:1VQOQ; PSSNGPLEGTRGKLKNKPRDRGTSPPQRAVEEFDDGEKVHLKIDPSVPNGRFHPRFDGQT ----1111---1111-1111-----3333----2222------1111-----3333---- GTVEGKQGDAYKVDIVDGGKEKTIIVTAAHLRRQE ------!!!!----------------3333----- >50S ribosomal protein L22; SWP:P10970; PDB:1VQOR; GISYSVEADPDTTAKAMLRERQMSFKHSKAIAREIKGKTAGEAVDYLEAVIEGDQPVPFK --------3333----------------------2222---------------------- QHNSGVGHKSKVDGWDAGRYPEKASKAFLDLLENAVGNADHQGFDGEAMTIKHVAAHKVG --2222--1111---------------------------1111--1111----------- EQQGRKPRAMGRASAWNSPQVDVELILEEP --------iiii------------------ >50S ribosomal protein L23; SWP:P12732; PDB:1VQOS; SWDVIKHPHVTEKAMNDMDFQNKLQFAVDDRASKGEVADAVEEQYDVTVEQVNTQNTMDG ----------------------------1111---------------------------- EKKAVVRLSEDDDAQEVASRI --------33333333-1111 >50S ribosomal protein L24; SWP:P10972; PDB:1VQOT; SKQPDKQRKSQRRAPLHERHKQVRATLSADLREEYGQRNVRVNAGDTVEVLRGDFAGEEG --------------33333333------------------------------1111---- EVINVDLDKAVIHVEDVTLEKTDGEEVPRPLDTSNVRVTDLDLEDEKREARLESEDDSA ------1111---2222---1111-------1111------------------------ >50S ribosomal protein L24; SWP:P14116; PDB:1VQOU; RECDYCGTDIEPGTGTMFVHKDGATTHFCSSKCENNADLGREARNLEWTDTAR -------------------1111------------------333333333333 ------------------------------------------------------------ ----- >50S ribosomal protein L30; SWP:P14121; PDB:1VQOW; MHALVQLRGEVNMHTDIQDTLEMLNIHHVNHCTLVPETDAYRGMVAKVNDFVAFGEPSQE ---------22223333----1111--2222----------------1111--------- TLETVLATRAEPLEGDADVDDEWVAEHTDYDDISGLAFALLSEETTLREQGLSPTLRLHP ---------------------------------------1111--3333----------- PRGGHDGVKHPVKEGGQLGKHDTEGIDDLLEAMR 2222------3333----------------1111 >50S ribosomal protein L31; SWP:P18138; PDB:1VQOX; ERVVTIPLRDARAEPNHKRADKAMILIREHLAKHFSVDEDAVRLDPSINEAAWARGRANT ----------33331111-------------------3333-----------1111---- PSKIRVRAARFEEEGEAIVEAE -----------1111------- >50S ribosomal protein L32; SWP:P12736; PDB:1VQOY; TELQARGLTEKTPDLSDEDARLLTQRHRVGKPQFNRQDHHKKKRVSTSWRKPRGQLSKQR -------1111------------------------2222--1111--------1111333 RGIKGKGDTVEAGFRSPTAVRGKHPSGFEEVRVHNVDDLEGVDGDTEAVRIASKVGARKR 3---------3333--3333---3333-------333322221111-----3333----- ERIEEEAEDAGIRVLNPTYVEV -------1111----------- >50S ribosomal protein L37; SWP:P60619; PDB:1VQOZ; RSGRFGARYGRVSRRRVAEIESEMNEDHACPNCGEDRVDRQGTGIWQCSYCDYKFTGGSY -1111------------------------------------2222--------------- KPETPGGKTVRRS ---3333------ >PENICILLIN-BINDING PROTEI; SWP:O54286; PDB:1VQQA; DKEINNTIDAIEDKNFKQVYKDSSYISKSDNGEVEMTERPIKIYNSLGVKDINIQDRKIK --------------------------------3333------------------------ KVSKNKKRVDAQYKIKTNYGNIDRNVQFNFVKEDGMWKLDWDHSVIIPGMQKDQSIHIEN --1111----------1111---------------------1111-2222---------- LKSERGKILDRNNVELANTGTAYEIGIVPKNVSKKDYKAIAKELSISEDYIKQQMDQNWV ---------1111--------------3333-------------------------1111 QDDTFVPLKTVKKMDEYLSDFAKKFHLTTNETESRNYPLEKATSHLLGYVGPINSEELKQ 1111--------------------------------1111-------------3333--3 KEYKGYKDDAVIGKKGLEKLYDKKLQHEDGYRVTIVDDSNTIAHTLIEKKKKDGKDIQLT 333------------3333---1111---------------------------------- IDAKVQKSIYNNMKNDYGSGTAIHPQTGELLALVSTPSYDVYPFMYGMSNEEYNKLTEDK --3333------1111------------------------3333--------------11 KEPLLNKFQITTSPGSTQKILTAMIGLNNKTLDDKTSYKIDGKGWQKDKSWGGYNVTRYE 11---1111----!!!!---------------1111-----------3333--------- VVNGNIDLKQAIESSDNIFFARVALELGSKKFEKGMKKLGVGEDIPSDYPFYNAQISNKN -----------1111------------------------2222----------------- LDNEILLADSGYGQGEILINPVQILSIYSALENNGNINAPHLLKDTKNKVWKKNIISKEN ---------1111-------------------iiii------3333-------------- INLLTDGMQQVVNKTHKEDIYRSYANLIGKSGTAELKGRQIGWFISYDKDNPNMMMAINV ---------------1111--1111----------------------1111--------- KDVQDKGMASYNAKISGKVYDELYENGNKKYDIDE --1111!!!!-------------%%%%----1111 >HYPOTHETICAL PROTEIN CJ02; SWP:Q0PBQ7; PDB:1VQRA; HIGDNELLLKSVEVLPPLPDTVSKLRKYVSEANIETKVAEIISSDPLTAKLLQLANSPYY ---------1111-------------------------------------------3333 GFTREITTINQVITLLGVGNIINIVADSIRDNFKIDVSPYGLNTQNFLKTCNEEATFIAN -2222----------------------1111-----3333-------------------- WLNDEDKKLSHLLVPCALLRLGIVIFSNFLIQNHKDKDFLAFLNKNENLALAENEFLGVD ------------------------------1111-------------------------- HISFLGFLLHRWNFDDVLIESICFVRTPHAAREKVKKSAYALAITDHLFAPHDGSSPFNA ---------------------1111-3333-1111------------------------- KAAVALLKEAKTQGINFDLNNLLSKLPNKAKENLNKED ----------1111------------------1111-- >HYPOTHETICAL PROTEIN AGR_; SWP:NA; PDB:1VQSA; HFYEIRTYRLKNGAIPAYLKVVEDEGIEIQKSHLGELVGYFFSEIGPINEIVHIWAFSSL ----------2222---------------------------------------------- DDRAERRARLADPRWLSFLPKIRDLIEVAENKIKPARFSPL -----------------33331111----------1111-- >OROTIDINE 5'-PHOSPHATE DE; SWP:Q9WYG7; PDB:1VQTA; HMTPVLSLDMEDPIRFIDENGSFEVVKVGHNLAIHGKKIFDELAKRNLKIILDLKFCDIP ----------------------------3333----------3333-------------- STVERSIKSWDHPAIIGFTVHSCAGYESVERALSATDKHVFVVVKLTSMEGSLEDYMDRI -----------1111-----3333---------------------1111--3333----- EKLNKLGCDFVLPGPWAKALREKIKGKILVPGIRDVVTLEEMKGIANFAVLGREIYLSEN ---1111------------3333--------------33332222------3333----- PREKIKRIKE ---------- >ANTHRANILATE PHOSPHORIBOS; SWP:Q8YXQ9; PDB:1VQUA; TSWYLLLQQLIDGESLSRSQAAELQGWLSEAVPPELSGAILTALNFKGVSADELTGAEVL -3333---------------------1111------------------------------ QSQSKTNSPFSIIDTCGTGSSTFNISTAVAFVAAAYGVPVAKHGNRSSLTGSADVLEALG 1111-----------------------------1111-------------------1111 VNLGASPEKVQAALQEVGITFLFAPGWHPALKAVATLRRTLRIRTVFNLLGPLVNPLRPT ---------------------------33331111---------3333--11111111-- GQVVGLFTPKLLTTVAQALDNLGKQKAIVLHGRERLDEAGLGDLTDLAVLSDGELQLTTI -------3333--------1111--------1111---------------iiii------ NPQEVGVTPAPIGALRGGDVQENAEILKAVLQGKGTQAQQDAVALNAALALQVAGAVPLL 3333------3333---------------1111------------------------222 DHAQGVSVAKEILQTGTAWAKLAQLVYFLGN 2--------------------------1111 >THIAMINE MONOPHOSPHATE KI; SWP:O67883; PDB:1VQVA; RLKELGLIDLIKKTLESKVIDDTAPVSKKLLLTTDVLNEGVHFLRSYIPEAVGWKAISVN 3333--3333---------------------------2222--11113333--------- VSDVIANGGLPKWALISLNLPEDLEVSYVERFYIGVKRACEFYKCEVVGGNISKSEKIGI ----1111------------11113333-------------------------------- SVFLVGETERFVGRDGARLGDSVFVSGTLGDSRAGLELLLEKEEYEPFELALIQRHLRPT -----------------2222--------------------------------------- ARIDYVKHIQKYANASDISDGLVADANHLAQRSGVKIEILSEKLPLSNELKYCEKYGKNP -11113333-----------------33331111-----1111---3333--------33 IEYALFGGEDYQLLFTHPKERWNPFLDTEIGRVEEGGVFVDGKKVEP 33-------------------------------------iiii---- >HYPOTHETICAL PROTEIN AGR_; SWP:Q8UK99; PDB:1VQYA; HIVEERIYRIRGGKQEYLKLVREEGIAIQAPILGNLIGYFVTDIGPLSQVIHWGYASLDD ----------2222----------------3333-------------------------- RAERRGKLAEDQRWQAFIPRLSVLIESSENRILLPTDFSPLR ----------------33333333------------------ >LIPOATE-PROTEIN LIGASE, P; SWP:Q97QP1; PDB:1VQZA; HKYIINHSNDTAFNIALEEYAFKHLLDEDQIFLLWINKPSIIVGRHQNTIEEINRDYVRE ------------------------1111---------------11113333--------- NGIEVVRRISGGGAVYHDLNNLNYTIISKEDENKAFDFKSFSTPVINTLAQLGVKAEFTG -----------------1111---------1111--3333---------1111------- RNDLEIDGKKFCGNAQAYINGRIHHGCLLFDVDLSVLANALKVSKDKFESKGVKSVRARV -----iiii---------2222----------11113333-------------------- TNIINELPKKITVEKFRDLLLEYKKEYPETEYVFSEEELAEINRIKDTKFGTWDWNYGKS -3333------------------3333---------------------11113333---- PEFNVRRGIKFTSGKVEVFANVTESKIQDIKIYGDFFGIEDVAAVEDVLRGVKYEREDVL ----------1111--------%%%%--------------3333--1111---------- KALKTIDITRYFAGISREEIAEAVVG ------3333-2222----------- >PROBABLE 2-PHOSPHOSULFOLA; SWP:Q97E82; PDB:1VR0A; HKIDLIISADDIKEEKVKNKTAVVIDLRATSVITTALNNGCKRVVPVLTVEEALKKVKEY -------1111-33332222-------------------------------------111 GKDAILGGERKGLKIEGFDFSNSPEYTEDVVKGKTLITTTNGTRAIKGSETARDILIGSV 1--------iiii-2222--------33332222------------------------33 LNGEAVAEKIVELNNDVVIVNAGTYGEFSIDDFICSGYIINCVDRKKLELTDAATTAQYV 33---------------------%%%%--------------------------------- YKTNEDIKGFVKYAKHYKRIELGLKKDFEYCCKKDIVKLVPQYTNGEIL ---33333333------------------1111----------iiii-- >ACIREDUCTONE DIOXYGENASE; SWP:Q99JT9; PDB:1VR3A; VQAWYDESTADPRKPHRAQPDRPVSLEQLRTLGVLYWKLDADKYENDPELEKIRKRNYSW ----------1111---------------1111------3333----------------- DIITICKDTLPNYEEKIKFFEEHLHLDEEIRYILEGSGYFDVRDKEDKWIRISEKGDITL -----3333----------------------------------1111------2222--- PAGIYHRFTLDEKNYVKARLFVGEPVWTPYNRPADHFDARVQYSFLEGTA 2222------1111-------------------3333------3333--- >HYPOTHETICAL PROTEIN APC2; SWP:Q81H14; PDB:1VR4A; MIVTTTSGIQGKEIIEYIDIVNGEAIMGANIVRDLFASVRDVVGGRAGSYESKLKEARDI --------2222---------------3333------------1111----1111----- AMDEMKELAKQKGANAIVGVDVDYEVVRDGMLMVAVSGTAVRI ---------1111--------------iiii------------ >oligopeptide ABC transpor; SWP:Q9X0V0; PDB:1VR5A; HERNKTLYWGGALWSPPSNWNPFTPWNAVAGTIGLVYEPLFLYDPLNDKFEPWLAEKGEW -1111---------------111111112222---------------------------- VSNNEYVLTLRKGLRWQDGVPLTADDVVFTFEIAKKYTGISYSPVWNWLGRIERVDERTL ---------------1111---3333----3333--1111-3333--------------- KFVFSDPRYQEWKQLINTPIVPKHIWENKTEEEVLQAANENPVGSGPYYVESWADDRCVF ---------------------33331111------------------------1111--- KKNGNWWGIRELGYDPKPERIVELRVLSNNVAVGLKGELDWSNFFLPGVPVLKKAYGIVT --1111---------------------------------------2222----------- WYENAPYLPANTAGIYINVNKYPLSIPEFRRAAYAINPEKIVTRAYENVTAANPAGILPL -----------------1111-3333------33333333------------1111---3 PGYKYYPKEVVDKYGFKYDPEAKKILDELGFKDVNKDGFREDPNGKPFKLTIECPYGWTD 333-----------------------1111-----------1111---------2222-- WVSIQSIAEDLVKVGINVEPKYPDYSKYADDLYGGKFDLILNNFTTGVSATIWSYFNGVF ------------------------------------------1111-------------- YPDAVESEYSYSGNFGKYANPEVETLLDELNRSNDDAKIKEVVAKLSEILLKDLPFIPLW 3333---------1111------------------------------------------- YNGAWFQASEAVWTNWPTEKNPYAVPIGWNGWWQLTGIKTLFGIEAK -----------------1111-------22221111---1111---- >PHOSPHO-2-DEHYDRO-3-DEOXY; SWP:Q9WYH8; PDB:1VR6A; HMIVVLKPGSTEEDIRKVVKLAESYNLKCHISKGQERTVIGIIDRYVVADKFESLDCVES ------2222-----------3333----------------------3333---1111-- VVRVLKPYKLVSREFHPEDTVIDLGDVKIGNGYFTIIAGPCSVEGREMLMETAHFLSELG --------11113333-------------2222-----------------------1111 VKVLRGGAYKPRTSPYSFQGLGEKGLEYLREAADKYGMYVVTEALGEDDLPKVAEYADII -------------1111----------------------------3333----------- QIGARNAQNFRLLSKAGSYNKPVLLKRGFMNTIEEFLLSAEYIANSGNTKIILCERGIRT --3333----------1111-------1111----------------------------- FEKATRNTLDISAVPIIRKESHLPILVDPSHSGGRRDLVIPLSRAAIAVGAHGIIVEVHP ---------3333---------------------1111---------------------- EPEKALSDGKQSLDFELFKELVQEMKKLADALGVKVN 3333---3333-------------------------- >S-ADENOSYLMETHIONINE DECA; SWP:Q9WZC3; PDB:1VR7A; KSLGRHLVAEFYECDREVLDNVQLIEQEKQAAYESGATIVTSTFHRFLPYGVSGVVVISE --------------------------------3333------------------------ SHLTIHTWPEYGYAAIDLFTCGEDVDPWKAFEHLKKALKAKRVHVVEHERGRYDEIGIP -------3333----------1111--------------------------3333---- >GTP BINDING REGULATOR; SWP:Q9X1V7; PDB:1VR8A; HPPEAYSLDTAIFVLETRDYRLSDVKEIDSYGDVEKGKVAVFETEYGPVFLYVYKGEEAK --1111-----3333-1111----------!!!!---------1111------------- KIWKKLNGRVSIRSVLDLPNGKFSTVSNGKKIVAWWRKNWLFIVEGKNGVEEFVKHVYRV -----1111-----------------iiii------!!!!-------------------- YEEK ---- >CBS DOMAIN PROTEIN/ACT DO; SWP:Q9WZZ4; PDB:1VR9A; KVKKWVTQDFPVEESATVRECLHRRQYQTNECIVKDREGHFRGVVNKEDLLDLDLDSSVF 3333--------1111--------1111-------1111------3333----1111-11 NKVSLPDFFVHEEDNITHALLLFLEHQEPYLPVVDEERLKGAVSLHDFLEALIEALA 11--1111--11113333--------------------------------------- >ARGININE BIOSYNTHESIS BIF; SWP:Q9K8V3; PDB:1VRAA; ETANVLKLETGSVTSAKGFSAVGIHTGVKRKRKDLGAIVCEVPASSAAVYTLNKVQAAPL -------111111112222-------------------------------------3333 KVTQESIAVEGKLQAIVNSGIANACTGKRGLDDAYTRAVGAETFHIPEHYVAVTSTGVIG ----------------------------------------------1111---------- EFLPDVITNGIRQLKPEATIEGAHAFNEAILTTDTVEKHTCYQTIVNGKTVTVGGVAKGS ----------1111-------------1111--------------iiii----------- GIHPNA ------ >Arginine biosynthesis bif; SWP:Q9K8V3; PDB:1VRAB; TLSFVTTDANIDHGHLQGALSAITNETFNRITVDGDTSTNDVVVASGLAENETLTPEHPD --------------------------1111----------------3333----1111-- WANFYKALQLACEDLAKQIARDGEGATKLIEVEVTGAANDQEAGVAKQIVGSDLVKTAIY ----------------------2222------------3333----------------11 GADANWGRIICAIGYSGCEVNQETIDIAIGPIVTLKQSEPTGFSEEEATAYLKEADPVKI 11----------1111----1111----!!!!---%%%%--------------------- SVNLHIGNGTGKAWGCDLTYDYVRINAGY ------------------3333------- >PUTATIVE ASPARAGINYL HYDR; SWP:P46327; PDB:1VRBA; VLESIISPVTSEFLEEYWPVKPLVARGEVERFTSIPGFEKVRTLENVLAIYNNPVVVRFL ---1111---3333---------------3333-2222---------1111--------- VSPAEALEWYEKGAALEFDFTDLFIPQVRRWIEKLKAELRLPAGTSSKAIVYAAKNGGGF -------------------3333-3333-------------1111--------------- KAHFDAYTNLIFQIQGEKTWKLAKNENVSNPQHYDLSYPDDLQSYWKGDPPKEDLPDAEI ---------------------------------------3333--------1111----- VNLTPGTLYLPRGLWHSTKSDQATLALNITFGQPAWLDLLAALRKKLISDNRFRELAVNH ---2222---2222--------------------3333-----------3333-----33 QSLHESSKSELNGYLESLIQTLSENAETLTPEQIFQSQDSDFDPYQSTQLVFRQLLT 33-----------------------1111-------1111-------------1111 >INOSINE-5'-MONOPHOSPHATE ; SWP:Q9X168; PDB:1VRDA; MKEALTFDDVLLVPQYSEVLPKDVKIDTRLTRQIRINIPLVSAAMDTVTEAALAKALARE -----3333----------3333---------------------1111------------ GGIGIIHKNLTPDEQARQVSIVKKTIMSVIEHPNAARDEKGRLLVGAAVGTSPETMERVE ---------------------11113333--1111--1111----------1111----- KLVKAGVDVIVIDTAHGHSRRVIETLEMIKADYPDLPVVAGNVATPEGTEALIKAGADAV --1111--------------------------1111----------------1111---- KVGVGPGSICTTRVVAGVGVPQLTAVMECSEVARKYDVPIIADGGIRYSGDIVKALAAGA ------11113333-------------------1111----------3333--------- ESVMVGSIFAGTEEAPGETILYQGRKYKAYRGMGIEGMVPYKGTVKDVVHQLVGGLRSGM -----3333--1111--------------------------------------------- GYIGARTIKELQEKAVFVKIT --------------------- >PROPIONYL-COA CARBOXYLASE; SWP:Q9WZH5; PDB:1VRGA; SLRDKIEELKKIEKEIEQGGGPEKVEKQHRAGKLTAWERLELLLDPGTFVEIDKFVEHRN -----------------!!!!-----------------------2222----1111---- TYFGLDKVKLPRDGVITGVGEINGRKVAVFSQDFTVGGSLGEHAKKIVKLLDLALKGIPV -iiii----2222--------iiii-------3333------------------------ IGINDSGGARIQEGVDALAGYGEIFLRNTLASGVVPQITVIAGPCAGGAVYSPALTDFIV ---------3333-----------------2222------------3333--1111---- VDQTARFITGPNVIKAVTGEEISQEDLGGAVHNQKSGNAHFLADNDEKASLVRTLLSYLP ----------------------3333----3333----------3333-------1111- SNNAEEPPVEDPDTSLETPEDILDILPDNPNKGYDVRDVIKRVVDHGEFFEVQPYFAKNI -1111-----------------------1111------3333-2222-----11113333 VIGFARIQGKTVGIVANQPSVLAGVLDIDSSDKAARFIRFLDAFNIPILTFVDTPGYLPG ------iiii-------1111%%%%----------------1111--------------3 VAQEHGGIIRHGAKLLYAYSEATVPKITVILRKAYGGAYIAGSKHLGADVLAWPSAEIAV 3331111-----------------------------------3333------1111---- GPEGAANIIFKREIEASSNPEETRRKLIEEYKQQFANPYIAASRGYVDVIDPRETRKYIR -------1111--3333------------------------1111-----3333-3333- ALEVCETKVEYRPKKKHGNIPL ----1111-------------- >HYPOTHETICAL PROTEIN TM15; SWP:Q9X1N9; PDB:1VRMA; QYYELRDFALGTSVRIVVSSQKINPRTIAEAILEDKRITYKFSFTDERSVVKKINDHPNE --------%%%%---------------------------1111--1111----1111--- WVEVDEETYSLIKAACAFAELTDGAFDPTVGRLLELWGFTGNYENLRVPSREEIEEALKH -------------------1111------------------3333----------3333- TGYKNVLFDDKNRVVKNGVKIDLGGIAKGYALDRARQIALSFDENATGFVEAGGDVRIIG -3333----------%%%%---1111----------------1111-----%%%%----- PKFGKYPWVIGVKDPRGDDVIDYIYLKSGAVATSGDYERYFVVDGVRYHHILDPSTGYPA 2222--------------------------------------iiii-------------- RGVWSVTIIAEDATTADALSTAGFVAGKDWRKVVLDFPNGAHLLIVLEGGAIERSETFKL -----------------------------------3333-------2222----333311 FERE 11-- >CREATINE KINASE, M CHAIN; SWP:P04414; PDB:1VRPA; LNYSAAEEFPDLSKHNNHMAKALTLDIYKKLRDKETPSGFTLDDIIQTGVDNPGHPFIMT ---3333----1111-3333----------1111-1111------3333-----3333-- VGCVAGDEECYEVFKDLFDPVIEDRHGGYKPTDKHKTDLNQENLKGGDDLDPNYVLSSRV ------3333-1111--------------1111------1111-------1111------ RTGRSIKGIALPPHCSRGERRLVEKLCIDGLATLTGEFQGKYYPLSSMSDAEQQQLIDDH -----2222-3333----------------1111!!!!-----3333---------1111 FLFDKPISPLLLASGMARDWPDGRGIWHNNDKTFLVWVNEEDHLRVISMQKGGNMKEVFR -------33331111-------------1111---------------------------- RFCVGLKKIEDIFVKAGRGFMWNEHLGYVLTCPSNLGTGLRGGVHVKIPHLCKHEKFSEV -------------1111--------------3333--------------33331111--- LKRTRLQKRGTGGVDTAAVGSIYDISNADRLGFSEVEQVQMVVDGVKLMVEMEKRLENGK ---------------------------------3333----------------------- SIDDLMPAQK -3333----- >HIV-1 REVERSE TRANSCRIPTA; SWP:P04585; PDB:1VRTA; PIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNTPVFAI --------------------------------------1111------------------ KKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDED -----------------1111-----------3333--------------3333---111 FRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDIVIYQY 13333---------------------2222---------------------3333----! MDDLYVGSDLEIGQHRTKIEELRQHLLRWGLTTPDKKHQKEPPFLWMGYELHPDKWTVQP !!!-----------------------1111----3333-------iiii--1111----- IVLPEKDSWTVNDIQKLVGKLNWASQIYPGIKVRQLCKLLRGTKALTEVIPLTEEAELEL --------------------------------3333---2222-1111------------ AENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHTN -----3333-------1111---------2222-------2222---------------- DVKQLTEAVQKITTESIVIWGKTPKFKLPIQKETWETWWTEYWQATWIPEWEFVNTPPLV --------------------------------------1111------------------ KLWYQLEKEPIVGAETFYVDAGYVTNRGRQKVVTLTDTTNQKTELQAIYLALQDSGLEVN ------------------------1111----------3333------------------ IVTDSQYALGIIQAQPDQSESELVNQIIEQLIKKEKVYLAWVPAH -----------1111------------------------------ >PUTATIVE DNA LIGASE-LIKE ; SWP:P71571; PDB:1VS0A; FEFDNLAPLATHGTVAGLKASQWAFEGWDGYRLLVEADHGAVRLRSRSGRDVTAEYPQLR -1111---------22223333---------------iiii----1111--11113333- ALAEDLADHHVVLDGEAVVLDSSGVPSFSQQNRGRDTRVEFWAFDLLYLDGRALLGTRYQ ---1111-------------1111--3333---1111-----------iiii-1111333 DRRKLLETLANATSLTVPELLPGDGAQAFACSRKHGWEGVIAKRRDSRYQPGRRCASWVK 3-------------------------------1111-------1111--2222-1111-- DKHWNTQEVVIGGWRAVGSLLGIPGPGGLQFAGRVGTGLSERELANLKELAPLHTDESPF ------------------------2222---------------------3333------- DVPLPARDAKGITYVKPALVAEVRYSEWTPEGRLRQSSWRGLRPDKKPSEVVRE ----3333--------------------1111----------11113333---- >TRNA PSEUDOURIDINE SYNTHA; SWP:Q5SHU9; PDB:1VS3A; MRRLLLLCEYDGTLFAGLQRQGRGLRTVQGELERALPGIGALPKAVAAGRTDAGVHALAM -----------1111------%%%%--------------------------2222----- PFHVDVESAIPVEKVPEALNRLLPEDLKVVGAREVAPDFHARKDALWRAYRYRILVRPHP ----------3333---------1111--------11113333----------------- SPLLRHRALWVRRPLDLEAMEEALSLLLGRHNFLGFAKEETRPGERELLEARLQVAEGEA 3333------------------3333------3333---------------------111 GLEVRLYFRGKSFLRGQVRGMVGTLLEVGLGKRPPESLKAILKTADRRLAGPTAPAHGLY 1------------2222----------------3333--------3333-----3333-- FVEAAYPEE --------- >n/a; SWP:P26332; PDB:1VSGA; AAEKGFKQAFWQPLCQVSEELDDQPKGALFTLQAAASKIQKMRDAALRASIYAEINHGTN ------3333---------------------------------------------2222- RAKAAVIVANHYAMKADSGLEALKQTLSSQEVTATATASYLKGRIDEYLNLLLQTKESGT ---------------------------------------------------1111----- SGCMMDTSGTNTVTKAGGTIGGVPCKLQLSPIQPKRPAATYLGKAGYVGLTRQADAANNF -----3333---------------------------------11111111----3333-- HDNDAECRLASGHNTNGLGKSGQLSAAVTMAAGYVTVANSQTAVTVQALDALQEASGAAH ------3333--------------------%%%%-------------------------- QPWIDAWKAKKALTGAETAEFRNETAGIAGKTGVTKLVEEALLKKKDSEASEIQTELKKY ------------------1111----3333----------3333-----------3333- FSGHENEQWTAIEKLISEQPVAQNLVGDNQPTKLGELEGNAKLTTILAYYRMETAGKFEV ----3333-------------3333-2222--3333------------------------ LT -- >VSR ENDONUCLEASE; SWP:P09184; PDB:1VSRA; AIEKRLASLLTGQGLAFRVQDASLPGRPDFVVDEYRCVIFTHGCFWHHHHCYLFKVPATR ----------1111------1111--------1111--------1111--3333------ TEFWLEKIGKNVERDRRDISRLQELGWRVLIVWECALRGREKLTDEALTERLEEWICGEG ----------------------1111------3333--1111-------------1111- ASAQIDTQGIHLLA -----1111----- >Coat protein; SWP:P03579; PDB:1VTMP; PYTINSPSQFVYLSSAYADPVELINLCTNALGNQFQTQQARTTVQQQFADAWKPSPVMTV -----3333-----------1111-3333-------1111--3333-3333-----3333 RFPASDFYVYRYNSTLDPLITALLNSFDTRNRIIEVNNQPAPNTTEIVNATQRVDDATVA --------------3333---3333-------------------------1111------ IRASINNLANELVRGTGMFNQAGFETASGLVWTTTPAT -------------------3333--------------- >DELTA-ATRACOTOXIN-HV1; SWP:P13494; PDB:1VTX; CAKKRNWCGKTEDCCCPMKCVYAWYNEQGSCQSTISALWKKC --2222-------------------3333------3333--- >CCDB; SWP:P05703; PDB:1VUBA; MQFKVYTYRLFVDVQSDIIDTPGRRMVIPLASARLLSDKVSRELYPVVHIGDESWRMMTT 2222-----------1111------------3333----------------------111 DMASVPVSVIGEEVADLSHRENDIKNAINLMFWGI 1----3333-------3333--------------- >ORF2 CONTAINS A REVERSE T; SWP:O00378; PDB:1VYBA; GSNSHITILTLNINGLNSAIKRHRLASWIKSQDPSVCCIQETHLTCRDTHRLKIKGWRKI --------------------------------------------33333333-2222--- YQANGKQKKAGVAILVSDKTDFKPTKIKRDKEGHYIMVKGSIQQEELTILNIYAPNTGAP ----------------1111---------1111--------iiii--------------- RFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYR ---------3333---------------33333333------------------------ TLHPKSTEYTFFSAPHHTYSKIDHIVGSKALLSKCKRTEIITNYLSDHSAIKLELR --1111-------1111----------33331111--------------------- ------------------------------------------------------------ ----- >CYTOCHROME C2; SWP:P00094; PDB:1VYDA; GDAAKGEKEFNKCKTCHSIIAPDGTEIVKGAKTGPNLYGVVGRTAGTYPEFKYKDSIVAL -3333---------------1111------------2222-------------------- GASGFAWTEEDIATYVKDPGAFLKEKLDDKKAKTEMAFKLAKGGEDVAAYLASVVK 1111------------------------1111--------------------3333 >14 KDA FATTY ACID BINDING; SWP:NA; PDB:1VYFA; SMSSFLGKWKLSESHNFDAVMSKLGVSWATRQIGNTVTPTVTFTMDGDKMTMLTESTFKN -3333----------3333-------------1111-------------------3333- LSCTFKFGEEFDEKTSDGRNVKSVVEKNSESKLTQTQVDPKNTTVIVREVDGDTMKTTVT -----2222-----1111--------------------3333--------!!!!------ VGDVTAIRNYKRLS !!!!---------- >RNA POLYMERASE ALPHA SUBU; SWP:P22363; PDB:1VYIA; WSATNEEDDLSVEAEIAHQIAESFSKKYKFPSRSSGIFLYNFEQLKMNLDDIVKEAKNVP ----------------------1111--------------3333--------------22 GVTRLAHDGSKIPLRCVLGWVALANSKKFQLLVEADKLSKIMQDDLNRYTS 22---1111-------------------------------------1111- >OXYGEN-EVOLVING ENHANCER ; SWP:P12301; PDB:1VYKA; EARPIVVGPPPPLTKDRFYLQPLPPTEAAQRAKVSASEILNVKQFIDRKAWPSLQNDLRL ------------------------------------------------------------ RASYLRYDLKTVISAKPKDEKKSLQELTSKLFSSIDNLDHAAKIKSPTEAEKYYGQTVSN ------------11111111---------------------------------------- INEVLAKLG --------- >ARGONAUTE2; SWP:Q9VUQ5; PDB:1VYNA; AMPMIEYLERFSLKAKINNTTNLDYSRRFLEPFLRGINVVYTPPQSFQSAPRVYRVNGLS -----------------1111---------3333---------3333------------- RAPASSETFEHDGKKVTIASYFHSRNYPLKFPQLHCLNVGSSIKSILLPIELCSIEE --3333----iiii--------------------------3333----3333----- >AVIDIN; SWP:P02701; PDB:1VYOA; KCSLTGKWTNDLGSNMTIGAVNSRGEFTGTYITAVTATSNEIKESPLHGTQNTINKRTQP ---------1111--------1111---------------------------2222---- TFGFTVNWKFSESTTVFTGQCFIDRNGKEVLKTMWLLRSSVNDIGDDWKATRVGINIFTR -----------------------1111---------------33331111---------- L - >DEOXYURIDINE 5'-TRIPHOSPH; SWP:Q8II92; PDB:1VYQA; MHLKIVCLSDEVREMYKNHKTHHEDSGLDLFIVKDEVLKPKSTTFVKLGIKAIALQYKSN ------------------------------------------------------------ YYNIVNTSFLLFPRSSISKTPLRLANSIGLIDAGYRGEIIAALDNTSDQEYHIKKNDKLV ----------------1111---1111----3333------------------2222--- QLVSFTGEPLSFELVEELDETS ---3333--------------- >PENTAERYTHRITOL TETRANITR; SWP:P71278; PDB:1VYRA; AEKLFTPLKVGAVTAPNRVFMAPLTRLRSIEPGDIPTPLMGEYYRQRASAGLIISEATQI --1111---!!!!-------------------------------1111------------ SAQAKGYAGAPGLHSPEQIAAWKKITAGVHAEDGRIAVQLWHTGRISHSSIQPGGQAPVS 3333--2222-------------------1111---------!!!!-33332222----- ASALNANTRTSLRDENGNAIRVDTTTPRALELDEIPGIVNDFRQAVANAREAGFDLVELH -------------1111-------------1111-------------------------- SAHGYLLHQFLSPSSNQRTDQYGGSVENRARLVLEVVDAVCNEWSADRIGIRVSPIGTFQ -iiii------3333----1111---------------------1111----------ii NVDNGPNEEADALYLIEELAKRGIAYLHMSETDLAGGKPYSEAFRQKVRERFHGVIIGAG ii-----------------1111---------1111------------------------ AYTAEKAEDLIGKGLIDAVAFGRDYIANPDLVARLQKKAELNPQRPESFYGGGAEGYTDY ---------------------3333-------------------3333-----2222--- PS -- >CALCIUM CHANNEL BETA-3 SU; SWP:CCB3_RAT; PDB:1VYUA; SARREVESQAQQQLERAKHKPVAFAVRTNVSYCGVLDEECPVQGSGVNFEAKDFLHIKEK 3333-----------3333-------------33331111-2222----2222------- YSNDWWIGRLVKEGGDIAFIPSPQRLESIRLKQEQKVPPYDVVPSRPVVLVGPSLKGYEV -1111------2222----------------1111-----------------------33 TDQKALFDFLKHRFDGRISITRVTADLSLAKSIAEVQSEIERIFELAKSLQLVVLDADTI 33-----------2222--------3333--3333-----------1111------1111 NHPAQLAKTSLAPIIVFVKVSSPKVLQRLIRSRGKSQKHLTVQAYDKLVQCPPESFDVIL -33331111------------------------3333--3333----11113333----- DENQLDDACEHLAEYLEVYWRATHHP -------------------------- >CALCIUM CHANNEL BETA-4SUB; SWP:NA; PDB:1VYVA; AIRQEREQQAAIQLERAKSKPVAFAVKTNVSYCGALDEDVPVPSSAVSFDAKDFLHIKEK -------------3333--------------------------------2222------- YNNDWWIGRLVKEGCEIGFIPSPLRLENIRIQQEQKRKHIPPYDVVPSRPVVLVGPSLKG -1111------------------------------------------------------- YEVTDQKALFDFLKHRFDGRISITRVTADISLAKSSLAEVQSEIERIFELARSLQLVVLD -3333-----------2222--------1111---3333-----------1111------ ADTINHPAQLIKTSLAPIIVHVKVSSPKVLQRLIKSRGKSQSKHLNVQLVAADKLAQCPP 1111-33331111-------------------------1111------------------ EFDVILDENQLEDACEHLGEYLEAYWRATHT ------------------------------- >ORF K3; SWP:P90495; PDB:1VYXA; MEDEDVPVCWICNEELGNERFRACGCTGELENVHRSCLSTWLTISRNTACQICGVVYNTR 3333-----------------------3333----------------------------- >CHROMOSOME PARTITIONING P; SWP:Q9LCY0; PDB:1VZ0A; VVRLPLASIRPNPRQPRKRFAEESLKELADSIREKGLLQPLLVRPQGDGYELVAGERRYR ----3333-------------------------------------!!!!----------- AALMAGLQEVPAVVKDLTDREALELALVENLQREDLSPVEEARGYQALLEMGLTQEEVAR ------------------------------------------------1111-------- RVGKARSTVANALRLLQLPPEALEALERGEITAGHARALLMLEPEDRLWGLKEILEKGLS ----3333------11113333----------------11113333-------------3 VRQAEAL 333---- >ORNITHINE ACETYL-TRANSFER; SWP:Q53940; PDB:1VZ6A; TPRGFVVHTAPVGLADDGRDDFTVLASTAPATVSAVFTRSRFAGPSVVLCREAVADGQAR -2222----------------------------------1111----------1111--- GVVVLARNANVATGLEGEENAREVREAVARALGLPEGEMLIASTGVIGRQYPMESIREHL ----------------------------------1111---------------------1 KTLEWPAGEGGFDRAARAIMTTDTRPKEVRVSVGGATLVGIAKGVGMLEPDMATLLTFFA 111-------------1111------------iiii------------------------ TDARLDPAEQDRLFRRVMDRTFNAVSIDTDTSTSDTAVLFANGLAGEVDAGEFEEALHTA --------------------1111------------------1111-------------- ALALVKDIASDGEGAAKLIEVQVTGARDDAQAKRVGKTVVNSPLVKTAVHGCDPNWGRVA -----------2222--------------------------3333---1111-------- MAIGKCSDDTDIDQERVTIRFGEVEVYPPDDALRAAVAEHLRGDEVVIGIDLAIADGAFT -----1111---3333----!!!!-----3333------1111----------------- VYGCDLTEGYVRLNSE ------3333--1111 >DESULFOFERRODOXIN; SWP:Q46495; PDB:1VZIA; PERLQVYKCEVCGNIVEVLNGGIGELVCCNQDMKLMSENTVDAAKAKHVPVIEKIDGGYK -2222----------------------%%%%------------3333-------2222-- VKVGAVAHPMEEKHYIQWIELLADDKCYTQFLKPGQAPEAVFLIEAAKVVAREYCNIHGH ----------1111------------------2222------------------------ WKAEN ----- >OSTEOCALCIN; SWP:Q800Y1; PDB:1VZMA; KELTLAQTSLRVCTNMACDMADAQGIVAAYQAFYGPIPF ---3333--------3333-------------------- >RIBOSOMAL PROTEIN S6 KINA; SWP:O75582; PDB:1VZOA; QLLTVKHELRTANLTGHAEKVGIENFELLKVLGTGAYGKVFLVRKISGHDTGKLYAMKVL ---------------------1111----------------------1111--------- KKATIVQKAKTTEHTRTERQVLEHIRQSPFLVTLHYAFQTETKLHLILDYINGGELFTHL ----------1111-------------1111----------------------------- SQRERFTEHEVQIYVGEIVLALEHLHKLGIIYRDIKLENILLDSNGHVVLTDFGLSKEFV -------------------------1111------3333---1111-------------3 ADETERAYDFCGTIEYMAPDIVRGGDDKAVDWWSLGVLMYELLTGASPFTVDGEKNSQAE 333----1111--1111-----------------------------11112222------ ISRRILKSEPPYPQEMSALAKDLIQRLLMKDPKKRLGCGPRDADEIKEHLFFQKINWDDL ------------3333--------------333322221111------3333-------1 AAKKVPAPFKPVIRDELDV 111---------------- >ATP SYNTHASE COUPLING FAC; SWP:P02721; PDB:1VZSA; NKELDPVQKLFVDKIREYRTKRQTSGGPVDAGPEYQQDLDRELFKLKQMYGKADMNTFPN -------------11113333-------------------------------%%%%---- FTFEDPKFEVVEKPQS ---------------- >VARICELLA-ZOSTER VIRUS PR; SWP:P09286; PDB:1VZV; EALYVAGYLALYSKDEGELNITPEIVRSALPPTSKIPINIDHRKDCVVGEVIAIIEDIRG ------------3333-----3333--------------%%%%-------------1111 PFFLGIVRCPQLHAVLFEAAHSNFFGNRDSVLSPLERALYLVTNYLPSVSLSSKRLFTHV --------3333-------------3333-----3333---------------------- ALCVVGRRVGTVVNYDCTPESSIEPFRVLSMESKARLLSLVKDYAGLNKVWKVSEDKLAK -----------------3333-1111---3333---------------------1111-- VLLSTAVNNMLLRDRWDVVAKRRREAGIMGH --------1111-3333-------------- >PHOSPHORIBOSYL ISOMERASE ; SWP:P16250; PDB:1VZWA; SKLELLPAVDVRDGQAVRETSYGSPLEAALAWQRSGAEWLHLVDLDAAFGTGDNRALIAE -----------iiii-----------------1111------------------------ VAQAMDIKVELSGGIRDDDTLAAALATGCTRVNLGTAALETPEWVAKVIAEHGDKIAVGL -3333-----------------------------3333-------------!!!!----- DVRGTTLRGRGWTRDGGDLYETLDRLNKEGCARYVVTDIGPNLELLKNVCAATDRPVVAS --------------------------1111-------------------1111------- GGVSSLDDLRAIAGLVPAGVEGAIVGKALYAKAFTLEEALEATS ----3333-------3333------3333--------------- >33 KDA CHAPERONIN; SWP:P37565; PDB:1VZYA; MDYLVKALAYDGKVRAYAARTTDMVNEGQRRHGTWPTASAALGRTMTASLMLGAMLKGDD ---------iiii---------------------------------------1111!!!! KLTVKIEGGGPIGAIVADANAKGEVRAYVSNPQVHFDLNAAGKLDVRRAVGTNGTLSVVK ---------3333-------------------------1111--3333------------ DLGLREFFTGQVEIVSGELGDDFTYYLVSSEQVPSSVGVGVLVNPDNTILAAGGFIIQLM --2222-------------------------------------1111------------2 PGTDDETITKIEQRLSQVEPISKLIQKGLTPEEILEEVLGEKPEILETMPVRFHCPCSKE 222------------------------------------------------------333 RFETAILGLGKKEIQDMIEEDGQAEAVCHFCNEKYLFTKEELEGLRDQTT 3------------------------------------------------- >ACYL-COA OXIDASE; SWP:O65202; PDB:1W07A; EGIDHLADERNKAEFDVEDMKIVWAGSRHAFEVSDRIARLVASDPVFEKSNRARLSRKEL -----33333333--3333------------------------3333---1111------ FKSTLRKCAHAFKRIIELRLNEEEAGRLRHFIDQPAYVDLHWGMFVPAIKGQGTEEQQKK -----------------------------3333--------------------------- WLSLANKMQIIGCYAQTELGHGSNVQGLETTATLDPKTDEFVIHTPTQTASKWWPGGLGK ----1111---------1111--3333-------------------3333----2222-- VSTHAVVYARLITNGKDYGIHGFIVQLRSLEDHSPLPNITVGDIGTKMGNGAYNSMDNGF ------------%%%%-------------------2222-----------!!!!------ LMFDHVRIPRDQMLMRLSKVTREGEYVPSDVPKQLVYGTMVYVRQTIVADASNALSRAVC --------1111--------3333-------1111------------------------- IATRYSAVRRQFGAGIETQVIDYKTQQNRLFPLLASAYAFRFVGEWLKWLYTDVTERLAA ------------------3333-------------------------------------- SDFATLPEAHACTAGLKSLTTTATADGIEECRKLCGGHGYLWCSGLPELFAVYVPACTYE --1111----------------------------------3333---------------- GDNVVLQLQVARFLMKTVAQLGSGKVPVGTTAYMGRAAHLLQCRSGVQKAEDWLNPDVVL -----------------1111------!!!!1111-------------3333-------- EAFEARALRMAVTCAKNLSKFENQEQGFQELLADLVEAAIAHCQLIVVSKFIAKLEQDIG ----------------1111---------------------------------1111--- GKGVKKQLNNLCYIYALYLLHKHLGDFLSTNCITPKQASLANDQLRSLYTQVRPNAVALV 2222-----------------------1111--------------------3333----- DAFNYTDHYLNSVLGRYDGNVYPKLFEEALKDPLNDSVVPDGYQEYLRPVLQQQL -----3333--33331111------------3333------3333----1111-- >3-ISOPROPYLMALATE DEHYDRO; SWP:LEU3_MYCTU; PDB:1W0DA; MSKLAIIAGDGIGPEVTAEAVKVLDAVVPGVQKTSYDLGARRFHATGEVLPDSVVAELRN ---------!!!!-------------------------------------3333------ HDAILLGAIGDPSVPSGVLERGLLLRLRFELDHHINLRPARLYPGVASPLSGNPGIDFVV ----------33332222------------------------2222-------------- VREGTEGPYTGNGGAIRVGTPNEVATEVSVNTAFGVRRVVADAFERARRRRKHLTLVHKT ------1111------2222--------------------------------------33 NVLTFAGGLWLRTVDEVGECYPDVEVAYQHVDAATIHMITDPGRFDVIVTDNLFGDIITD 33--------------33331111----------------1111---------------- LAAAVCGGIGLAASGNIDATRANPSMFEPVHGSAPDIAGQGIADPTAAIMSVALLLSHLG ---11113333------1111------------3333-------------------1111 EHDAAARVDRAVEAHLATRGSERLATSDVGERIAAAL ------------------!!!!-----------1111 >3'-5' EXONUCLEASE ERI1; SWP:Q8IV48; PDB:1W0HA; ADSYYDYICIIDFEATCEEGNPPEFVHEIIEFPVVLLNTHTLEIEDTFQQYVRPEINTQL -----------------22221111----------------------------------- SDFCISLTGITQDQVDRADTFPQVLKKVIDWKLKELGTKYKYSLLTDGSWDSKFLNIQCQ --------------1111-------------1111------------3333--------- LSRLKYPPFAKKWINIRKSYGNFYKVPRSQTKLTILEKLGDYDGRPHCGLDDSKNIARIA ------3333----------------3333-33333333-----2222------------ VRLQDGCELRINEK --1111-------- >TRIOSEPHOSPHATE ISOMERASE; SWP:Q8NKN9; PDB:1W0MA; MRLPILIINFKAYGEAAGKRAVELAKAAERAARELGVNIVVAPNHLELGLVSQSVDIPVY ------------3333-3333----------------------3333---1111------ AQGADVEAGGAHTAHVSLENIKEAGGSGVILNHSEAPLKLNDLARLVAKAKSLGLDVVVC -----------2222------1111-------1111--------------1111------ APDPRTSLAAAALGPHAVAVEPPELIGTGRAVSRYKPEAIVETVGLVSRHFPEVSVITGA ----------1111-------3333-----3333----------------1111------ GIESGDDVAAALRLGTRGVLLASAAVKAKDPYAKIVELAKPLSELR -----------1111-------3333------------3333---- >ENDO-1,4-BETA-XYLANASE D; SWP:P45796; PDB:1W0NA; ITKVEAENMKIGGTYAGKISAPFDGVALYANADYVSYSQYFANSTHNISVRGASSNAGTA ----3333---------------------1111--------------------------- KVDLVIGGVTVGSFNFTGKTPTVQTLSNITHATGDQEIKLALTSDDGTWDAYVDFIEFSL -----iiii--------------------------------------------------- >SIALIDASE; SWP:P37060; PDB:1W0PA; ALFDYNATGDTEFDSPAKQGWMQDNTNNGSGVLTNADGMPAWLVQGIGGRAQWTYSLSTN ---------3333-3333-------!!!!------------------------------- QHAQASSFGWRMTTEMKVLSGGMITNYYANGTQRVLPIISLDSSGNLVVEFEGQTGRTVL -----------------------------------------1111-----2222------ ATGTAATEYHKFELVFLPGSNPSASFYFDGKLIRDNIQPTASKQNMIVWGNGSSNTDGVA --3333---------------------iiii----------------------------- AYRDIKFEIQGDVIFRGPDRIPSIVASSVTPGVVTAFAEKRVGGGDPGALSNTNDIITRT -----------------------------2222-------2222----1111-------- SRDGGITWDTELNLTEQINVSDEFDFSDPRPIYDPSSNTVLVSYARWPTDAAQNGDRIKP --------------3333-----------------------------1111-2222--11 WMPNGIFYSVYDVASGNWQAPIDVTDQVKERSFQIAGWGGSELYRRNTSLNSQQDWQSNA 11----------1111-------3333-----------------------1111------ KIRIVDGAANQIQVADGSRKYVVTLSIDESGGLVANLNGVSAPIILQSEHAKVHSFHDYE ---------------------------1111-----2222--------3333-------- LQYSALNHTTTLFVDGQQITTWAGEVSQENNIQFGNADAQIDGRLHVQKIVLTQQGHNLV ----1111-----iiii--------------------1111------------%%%%--- EFDAFYLAQQTPEVEKDLEKLGWTKIKTGNTMSLYGNASVNPGPGHGITLTRQQNISGSQ -----3333-3333--3333--------------------------------1111---2 NGRLIYPAIVLDRFFLNVMSIYSDDGGSNWQTGSTLPIPFRWKSSSILETLEPSEADMVE 222--------------------------------------------------------- LQNGDLLLTARLDFNQIVNGVNYSPRQQFLSKDGGITWSLLEANNANVFSNISTGTVDAS 1111-------------iiii-----------iiii----222233332222-------- ITRFEQSDGSHFLLFTNPQGNPAGTNGRQNLGLWFSFDEGVTWKGPIQLVNGASAYSDIY -----3333------------2222------------iiii------------------- QLDSENAIVIVETDNSNMRILRMPITLLKQKLT -------------iiii------33331111-- >TELOMERIC REPEAT BINDING ; SWP:P54274; PDB:1W0TA; KRQAWLWEEDKNLRSGVRKYGEGNWSKILLHYKFNNRTSVMLKDRWRTMKKL --------------------2222---3333--------------------- >TELOMERIC REPEAT BINDING ; SWP:Q15554; PDB:1W0UA; KKQKWTVEESEWVKAGVQKYGEGNWAAISKNYPFVNRTAVMIKDRWRTMKRLGMN --------------------2222---1111------------------1111-- >SYNAPTOTAGMIN IV; SWP:P50232; PDB:1W15A; RGELLVSLCYQSTTNTLTVVVLKARHLPLSDPYVKVNLYHAKKRISKKKTHVKKCTPNAV ----------1111---------------------------------------------- FNELFVFDIPCESLEEISVEFLVLDSERGSRNEVIGRLVLGATAEGSGGGHWKEICDFPR -------------1111---------2222----------1111-------------222 RQIAKWHMLCDG 2----------- >LEVANSUCRASE; SWP:Q5I5I3; PDB:1W18A; GVPGFPLPSIHTQQAYDPQSDFTARWTRADALQIKAHSDATVAAGQNSLPAQLTMPNIPA -2222-----------3333------------------11112222---1111------- DFPVINPDVWVWDTWTLIDKHADQFSYNGWEVIFCLTADPNAGYGFDDRHVHARIGFFYR ------------------1111----iiii--------1111--33331111-------- RAGIPASRRPVNGGWTYGGHLFPDGASAQVYAGQTYTNQAEWSGSSRLMQIHGNTVSVFY ----3333-1111---------333333332222-------------------------- TDVAFNRDANANNITPPQAIITQTLGRIHADFNHVWFTGFTAHTPLLQPDGVLYQNGAQN -------1111-------------------1111---------------------33331 EFFNFRDPFTFEDPKHPGVNYMVFEGNTAGQRGVANCTEADLGFRPNDPNAETLQEVLDS 111---------1111--------------2222---3333---2222------------ GAYYQKANIGLAIATDSTLSKWKFLSPLISANCVNDQTERPQVYLHNGKYYIFTISHRTT 3333-----------1111-----------2222-----------%%%%-------1111 FAAGVDGPDGVYGFVGDGIRSDFQPMNYGSGLTMGNPTDLNTAAGTDFDPSPDQNPRAFQ -2222---------------------%%%%--------3333--------1111--1111 SYSHYVMPGGLVESFIDTVENRRGGTLAPTVRVRIAQNASAVDLRYGNGGLGGYGDIPAN ------2222--------%%%%-------------!!!!---1111-iiii--------- RADVNIAGFIQD ----3333---- ------------------------------------------------------------ >3-PHOSPHOINOSITIDE DEPEND; SWP:Q9UPJ7; PDB:1W1HA; SNIEQYIHDLDSNSFELDLQFSEDEKRLLLEKQAGGNPWHQFVENNLILKMGPVDKRKGL ---1111-----------------------------1111--iiii-----------!!! FARRRQLLLTEGPHLYYVDPVNKVLKGEIPWSQELRPEAKNFKTFFVHTPNRTYYLMDPS !------------------------------1111-------------1111-----111 GNAHKWCRKIQEVWRQRYQSHPDAAVQ 1-------------------1111--- >PHOSPHATIDYLINOSITOL 3-KI; SWP:P35169; PDB:1W1NA; NELDVPEQVDKLIQQATSIERLCQHYIGWCPFW -----3333-------------3333------- >CYTOKININ DEHYDROGENASE 1; SWP:Q9T0N8; PDB:1W1OA; ALALDGKLRTDSNATAAASTDFGNITSALPAAVLYPSSTGDLVALLSAANSTPGWPYTIA 3333----------------3333------------------------------------ FRGRGHSLMGQAFAPGGVVVNMASLGDAAAPPRINVSADGRYVDAGGEQVWIDVLRASLA ---------11112222------1111---------1111-----1111----------- RGVAPRSWTDYLYLTVGGTLSNAGISGQAFRHGPQISNVLEMDVITGHGEMVTCSKQLNA ------------------1111---1111----3333--------1111-----1111-- DLFDAVLGGLGQFGVITRARIAVEPAPARARWVRFVYTDFAAFSADQERLTAPRSFGPMS -----2222-----------------------------3333------------------ YVEGSVFVNQSLATDLANTGFFTDADVARIVALAGERNATTVYSIEATLNYAAVDQELAS -------3333-----3333---------------------------------------- VLGTLSYVEGFAFQRDVAYAAFLDRVHGEEVALNKLGLWRVPHPWLNMFVPRSRIADFDR -1111--2222----------------------1111-------------3333------ GVFKGILQGTDIVGPLIVYPLNKSMWDDGMSAATPSEDVFYAVSLLFSSNDLARLQEQNR ----1111-------------3333-3333------------------------------ RILRFCDLAGIQYKTYLARHTDRSDWVRHFGAAKWNRFVEMKNKYDPKRLLSPGQDIFN ---------------------------------------------1111--3333---- >STRUCTURAL MAINTENANCE OF; SWP:SMC1_YEAST; PDB:1W1WA; GRLVGLELSNFKSYRGVTKVGFGESNFTSIIGPNGSGKSNMMDAISFVLGVLKDLIYRGP ---------------------!!!!-------22223333-----------1111----- QSAYVKAFYQKGNKLVELMRIISRNGDTSYKIDGKTVSYKDYSIFLENENILIKAKNFLV ----------------------1111-----iiii--3333------------------- FQGDVEQIAAQSPVELSRMFTFDYVSDHLDAIYRELTGNASLTKYHATPPLKRFKDMEYL 22223333---3333----------------------------------------3333- SGGEKTVAALALLFAINSYQPSPFFVLDEVDAALDITNVQRIAAYIRRHRNPDLQFIVIS ---------------3333---------1111------------------1111------ LKNTMFEKSDALVGVYRQQQENSSKIITLDLSNY -33331111---------1111--------1111 >Sister chromatid cohesion; SWP:Q12158; PDB:1W1WE; KAIVQMAKILRKELSEEKEVIFTDVLKSQAKREASRGFFDILSLATEGCIGLSQTEAFGN 1111-------1111-----3333------------------------------------ IKIDAKPALFE -----3333-- >PHOSPHOSERINE AMINOTRANSF; SWP:Q9RME2; PDB:1W23A; VKQVFNFNAGPSALPKPALERAQKELLNFNDTQMSVMELSHRSQSYEEVHEQAQNLLREL --------------3333------1111%%%%--3333-1111---------------11 LQIPNDYQILFLQGGASLQFTMLPMNLLTKGTIGNYVLTGSWSEKALKEAKLLGETHIAA 11-1111-------3333----------2222---------------------------- STKANSYQSIPDFSEFQLNENDAYLHITSNNTIYGTQYQNFPEINHAPLIADMSSDILSR -3333------3333---1111-------------------------------------- PLKVNQFGMIYAGAQKNLGPSGVTVVIVKKDLLNTKVEQVPTMLQYATHIKSDSLYNTPP --1111----------------------1111----22223333-------%%%%----- TFSIYMLRNVLDWIKDLGGAEAIAKQNEEKAKIIYDTIDESNGFYVGHAEKGSRSLMNVT ----------------------------------------iiii-----3333------- FNLRNEELNQQFLAKAKEQGFVGLNGHRSVGGCRASIYNAVPIDACIALRELMIQFKENA --------------------------3333-------11113333--------------- >STALKED-CELL DIFFERENTIAT; SWP:Q9A5I5; PDB:1W25A; SARILVVDDIEANVRLLEAKLTAEYYEVSTAMDGPTALAMAARDLPDIILLDVMMPGMDG --------------------3333------------------------------------ FTVCRKLKDDPTTRHIPVVLITALDGRGDRIQGLESGASDFLTKPIDDVMLFARVRSLTR ------------1111-----------------1111----------------------- FKLVIDELRQREASGRRMGVIAGAAARLDGLGGRVLIVDDNERQAQRVAAELGVEHRPVI -------------------1111------------------------------------- ESDPEKAKISAGGPVDLVIVNAAAKNFDGLRFTAALRSEERTRQLPVLAMVDPDDRGRMV --3333---1111-------------------------3333---------1111----- KALEIGVNDILSRPIDPQELSARVKTQIQRKRYTDYLRNNLDHSLELAVTDQLTGLHNRR --1111--------------------------------------3333------------ YMTGQLDSLVKRATLGGDPVSALLIDIDFFKKINDTFGHDIGDEVLREFALRLASNVRAI -----------1111-------------3333-------------------------111 DLPCRYGGEEFVVIMPDTALADALRIAERIRMHVSGSPFTVAHGREMLNVTISIGVSATA 1-----1111--------3333-------------------------------------- GEGDTPEALLKRADEGVYQAKASGRNAVVGKAAH 1111----------------3333---------- >PHENYLALANINE AMMONIA-LYA; SWP:P24481; PDB:1W27A; EDPLYWGIAAEAMTGSHLDEVKKMVAEYRKPVVKLGGETLTISQVAAISARDGSGVTVEL -1111-----1111------------1111------------------------------ SEAARAGVKASSDWVMDSMNKGTDSYGVTTGFGATSHRRTKQGGALQKELIRFLNAGIFG 3333-----------3333----------------------------------------- NGSDNTLPHSATRAAMLVRINTLLQGYSGIRFEILEAITKFLNQNITPCLPLRGTITDLV --1111----------------1111----3333-----------------------333 PLSYIAGLLTGRPNSKAVGPTGVILSPEEAFKLAGVEGGFFELQPKEGLALVNGTAVGSG 3----------1111---1111---------------------2222------------- MASMVLFEANILAVLAEVMSAIFAEVMQGKPEFTDHLTHKLKHHPGQIEAAAIMEHILDG ---------------------------------------1111----------------- SAYVKAAQKLHEMDPLQKPKQDRYALRTSPQWLGPQIEVIRSSTKMIEREINSVNDNPLI 3333------------------3333---3333----------------1111------- DVSRNKAIHGGNFQGTPIGVSMDNTRLAIAAIGKLMFAQFSELVNDFYNNGLPSNLSGGR -1111-----33333333--------------------------3333iiii2222---- NPSLDYGFKGAEIAMASYCSELQFLANPVTNHVQSAEQHNQDVNSLGLISSRKTSEAVEI 1111-!!!!-------------------1111----%%%%-------------------- LKLMSTTFLVGLCQAIDLRHLEENLKSTVKNTVSSVAKRVLTMGVNGELHPSRFCEKDLL ------------------------------------------------------------ RVVDREYIFAYIDDPCSATYPLMQKLRQTLVEHALKNGDNERNLSTSIFQKIATFEDELK ------11113333--1111----------------!!!!--33333333---------- ALLPKEVESARAALESGNPAIPNRIEECRSYPLYKFVRKELGTEYLTGEKVTSPGEEFEK -------------1111-----3333-1111-------1111----1111--3333---- VFIAMSKGEIIDPLLECLESWNGAPLPIC -------1111----1111---------- >INOSITOL-TRISPHOSPHATE 3-; SWP:P23677; PDB:1W2FA; SWVQLAGHTGSFKAAGTSGLILKRCSEPERYCLARLADALRGCVPAFHGVVERDGESYLQ 3333---------------------------------1111-----------%%%%---- LQDLLDGFDGPCVLDCKGVRTYLEEELTKARERPKLRKDYKKLAVDPEAPTEEEHAQRAV --1111---------------------------------------1111----------- TKPRYQWREGISSSTTLGFRIEGIKKADGSCSTDFKTTRSREQVLRVFEEFVQGDEEVLR 3333--------3333---------1111-----1111-------------iiii----- RYLNRLQQIRDTLEVSEFFRRHEVIGSSLLFVHDHCHRAGVWLIDFGKTTPLPDGQILDH ----------------3333-------------1111----------------------- RRPWEEGNREDGYLLGLDNLIGILASLAER --------------------------1111 >ACYLPHOSPHATASE; SWP:P84142; PDB:1W2IA; AIVRAHLKIYGRVQGVGFRWSMQREARKLGVNGWVRNLPDGSVEAVLEGDEERVEALIGW -------------------------------------1111------------------1 AHQGPPLARVTRVEVKWEQPKGEKGFRIVG 111-1111---------------------- >CYTOCHROME OXIDASE SUBUNI; SWP:Q9F3S9; PDB:1W2LA; MPLAELGARLYREKACFSCHSIDGSRLVGPSFKGLYGSTRTFEDGTTAVADENYLRESIL -----------1111----------------2222------1111--------------- QPGAKVVQGYPNVMPASYASLSEREVAALIEFIKQQQ 2222--2222----3333------------------- >CONGLUTIN; SWP:Q647G9; PDB:1W2QA; GPMRRERGRQGDSSSCERQVDRVNLKPCEQHIMQRIMGEQEQYDSYDIRSTRSSDQQQRC -3333---------3333-------------------3333-----3333-3333----- CDELNEMENTQGCMCEALQQIMENQCDRLQDRQMVQQFKRELMSLPQQCNFRAPQRCDLD ----------1111-----------1111-----------------1111--------33 VSGGRCS 33-3333 >BETA FRUCTOSIDASE; SWP:O33833; PDB:1W2TA; LFKPNYHFFPITGWMNDPNGLIFWKGKYHMFYQYNPRKPEWGNICWGHAVSDDLVHWRHL -----------------------iiii--------------------------------- PVALYPDDETHGVFSGSAVEKDGKMFLVYTYYRDPTHNKGEKETQCVVMSENGLDFVKYD --------------------iiii-----------1111-------------------11 GNPVISKPPEEGTHAFRDPKVNRSNGEWRMVLGSGKDEKIGRVLLYTSDDLFHWKYEGAI 11---------------------%%%%--------%%%%--------------------- FEDETTKEIDCPDLVRIGEKDILIYSITSTNSVLFSMGELKEGKLNVEKRGLLDHGTDFY --1111----------------------------------%%%%---------------- AAQTFFGTDRVVVIGWLQSWLRTGLYPTKREGWNGVMSLPRELYVENNELKVKPVDELLA ------------------33331111-3333--------------%%%%-------3333 LRKRKVFETAKSGTFLLDVKENSYEIVCEFSGEIELRMGNESEEVVITKSRDELIVDTTR ---------------------------------------1111------!!!!----111 SGVSGGEVRKSTVEDEATNRIRAFLDSCSVEFFFNDSIAFSFRIHPENVYNILSVKSNQV 11111----------------------------%%%%----------------------- KLEVFELENIWL ------------ >5-METHYLTHIORIBOSE-1-PHOS; SWP:Q06489; PDB:1W2WA; SLEAIVFDRSEPENVSVKVLDQLLLPYTTKYVPIHTIDDGYSVIKSQVRGAPAIAIVGSL ----------1111------3333------------------------------------ SVLTEVQLIKHNPTSDVATLYSLVNWESTKTVLNKRLDFLLSSRPTAVNLSNSLVEIKNI -----------11113333--3333----------------------------------- LKSSSDLKAFDGSLYNYVCELIDEDLANNKGDNGAKYLIDVLQKDGFKDEFAVLTICNTG 1111-------------------------------------------------------1 SLATSGYGTALGVIRSLWKDSLAKTDK 111------------------------ >Methylthioribose-1-phosph; SWP:Q06489; PDB:1W2WB; CPRGHVFPLETRPYNQGSRLTAYELVYDKIPSTLITDSSIAYRIRTSPIPIKAAFVGADR -----------------------------------1111--------------------- IVRNGDTANKIGTLQLAVICKQFGIKFFVVAPKTTIDNVTETGDDIIVEERNPEEFKVVT -1111--------------------------3333------3333------3333----- GTVINPENGSLILNESGEPITGKVGIAPLEINVWNPAFDITPHELIDGIITEEGVFTKNS -------------1111----------3333----------3333-----1111----11 SGEFQLESLF 11---3333- >DEOXYURIDINE 5'-TRIPHOSPH; SWP:Q9PMK9; PDB:1W2YA; MTNIEILENMLKLQQKLNDETNGLNWENGYTKEGKLISWRRCIYMECAELIDSFTWKHWK ------------------------3333--1111----------------1111------ NISSLTNWENVRIEIVDIWHFILSLLLEEYNNKDFKAIATEVNAVSVFQDFCKEEEYPNE 1111---------------------------------------------1111-----11 GDIYGILNDIELIIHKCSGFGFNLGELLSTYFTLAIKCGLNLEILYKTYIGKNVLNIFRQ 11---------------------------------1111--------------------- NNGYKDGSYKKTWNGKEDNEVLAQILEQELDFDTIYKKLEECYKKA --3333------iiii3333----------------------1111 >PYRR BIFUNCTIONAL PROTEIN; SWP:P65941; PDB:1W30A; ESRELMSAANVGRTISRIAHQIIEKTALDDPVGPDAPRVVLLGIPTRGVTLANRLAGNIT --------------------------1111--1111---------3333----------- EYSGIHVGHGALDITLYRDPLASTSIPAGGIDDALVILVDDVLYSGRSVRSALDALRDVG -------------1111---------22222222--------------------3333-- RPRAVQLAVLVDRGHRELPLRADYVGKNVPTSRSESVHVRLREHDGRDGVVISR -------------------------------1111-----3333---------- >ENDO-1,4-BETA-XYLANASE A ; SWP:P14768; PDB:1W32A; GLASLADFPIGVAVAASGGNADIFTSSARQNIVRAEFNQITAENIMKMSYMYSGSNFSFT 3333--------------11111111----------------------1111!!!!---- NSDRLVSWAAQNGQTVHGHALVWHPSYQLPNWASDSNANFRQDFARHIDTVAAHFAGQVK ------------------------3333-33331111-----------------2222-- SWDVVNEALFDSADDPDGRGSANGYRQSVFYRQFGGPEYIDEAFRRARAADPTAELYYND ----------33331111---iiii-------------------------1111------ FNTEENGAKTTALVNLVQRLLNNGVPIDGVGFQMHVMNDYPSIANIRQAMQKIVALSPTL -1111---------------1111------------1111-------------------- KIKITELDVRLNNPYDGNSSNNYTNRNDCAVSCAGLDRQKARYKEIVQAYLEVVPPGRRG ------------------------1111----3333------------------2222-- GITVWGIADPDSWLYTHQNLPDWPLLFNDNLQPKPAYQGVVEALSG -------33331111-%%%%-------1111--------------- >BBCRASP-1; SWP:O50957; PDB:1W33A; ETIASELKAIGKELEDQKKEENIQIAKIAKEKFDFLSTFKVGPYDLIDEDIQMKIKRTLY -3333----------------------------3333----------------------- SSLDYKKENIEKLKEILEILKKNSEHYNIIGRLIYHISWGIQFQIEQNLELIQNGVENLS ----------------------3333-------------------------33331111- QEESKSLLMQIKSNLEIKQRLKKTLNETLKVYNQNTQDNEKILAEHFNKYYKDFDTLKPA ----------------------------------------------------3333---- F - >Exodeoxyribonuclease V be; SWP:P08394; PDB:1W36B; MSDVAETLDPLRLPLQGERLIEASAGTGKTFTIAALYLRLLLGLGGSAAFPRPLTVEELL ---------1111----------2222---------------------------1111-- VVTFTEAATAELRGRIRSNIHELRIACLRETTDNPLYERLLEEIDDKAQAAQWLLLAERQ ----------------------------------------3333-3333----------1 MDEAAVFTIHGFCQRMLNLNAFESGMLFEQQLIEDESLLRYQACADFWRRHCYPLPREIA 111----------------3333------------------------------------- QVVFETWKGPQALLRDINRYLQGEAPVIKAPPPDDETLASRHAQIVARIDTVKQQWRDAV --1111------------------------------------------------------ GELDALIESSGIDRRKFNRSNQAKWIDKISAWAEEETNSYQLPESLEKPRHPLFEAIDQL -------------1111--3333--------------------3333-------111111 LAEPLSIRDLVITRALAEIRETVAREKRRRGELGFDDMLSRLDSALRSESGEVLAAAIRT 11-------------------------------3333------11111111--------- RFPVAMIDEFQDTDPQQYRIFRRIWHHQPETALLLIGDPKQAIYAFRGADIFTYMKARSE --------3333-3333--------------------1111---3333------------ VHAHYTLDTNWRSAPGMVNSVNKLFSQTDDAFMFREIPFIPVKSAGKNQALRFVFKGETQ -----------------------------11113333-------3333------------ PAMKMWLMEGESCGVGDYQSTMAQVCAAQIRDWLQAGQRGEALLMNGDDARPVRASDISV -------------3333-----------------3333---------------1111--- LVRSRQEAAQVRDALTLLEIPSVYLSNRDSVFETLEAQEMLWLLQAVMTPERENTLRSAL ---------------1111----1111--1111--------------------------- ATSMMGLNALDIETLNNDEHAWDVVVEEFDGYRQIWRKRGVMPMLRALMSARNIAENLLA -3333----------------------------------------------------111 TAGGERRLTDILHISELLQEAGTQLESEHALVRWLSQHILEPDSNASSQQMRLESDKHLV 1-------------------------3333--------------3333------3333-- QIVTIHKSKGLEYPLVWLPFITNFRVQEQAFYHDRHSFEAVLDLNAAPESVDLAEAERLA ---3333------------1111------------------------------------- EDLRLLYVALTRSVWHCSLGVAPLVRRRGDKKGDTDVHQSALGRLLQKGEPQDAAGLRTC -----------------------------------1111-------------1111---- IEALCDDDIAWQTAQTGDNQPWQVNDVSTAELNAKTLQRLPGDNWRVTSYSGLQQRGHGI 3333-----------------------------------------------------222 AQDLMPRLDVDAAGVASVVEEPTLTPHQFPRGASPGTFLHSLFEDLDFTQPVDPNWVREK 2-------1111------------1111----3333------------------------ LELGGFESQWEPVLTEWITAVLQAPLNETGVSLSQLSARNKQVEMEFYLPISEPLIASQL -------------------------------3333--1111----------------333 DTLIRQFDPLSAGCPPLEFMQVRGMLKGFIDLVFRHEGRYYLLDYKSNWLGEDSSAYTQQ 3---11111111-----------------------------------------1111--- AMAAAMQAHRYDLQYQLYTLALHRYLRHRIADYDYEHHFGGVIYLFLRGVDKEHPQQGIY ------1111-----------------------1111----------------------- TTRPNAGLIALMDEMFAG ------------3333-- >Exodeoxyribonuclease V ga; SWP:P07648; PDB:1W36C; MLRVYHSNRLDVLEALMEFIVERERLDDPFEPEMILVQSTGMAQWLQMTLSQKFGIAANI --------3333------------------------------------------------ DFPLPASFIWDMFVRVLPEIPKESAFNKQSMSWKLMTLLPQLLEREDFTLLRHYLTDDSD ----------------------------3333-33333333------3333--------- KRKLFQLSSKAADLFDQYLVYRPDWLAQWETGHLVEGLGEAQAWQAPLWKALVEYTHQLG 3333-----------------3333---1111--------3333------------1111 QPRWHRANLYQRFIETLESATTCPPGLPSRVFICGISALPPVYLQALQALGKHIEIHLLF ----------------------------------------3333---3333--------- TNPCRYYWGDIKDPAYLAKLLTRQRRHSFEDRELPLFRDSENAGQLFNSDGEQDVGNPLL ------------33331111-----1111----------------------------333 ASWGKLGRDYIYLLSDLESSQELDAFVDVTPDNLLHNIQSDILELENRAVAGVNIEEFSR 3-----------1111---------------------------------------3333- SDNKRPLDPLDSSITFHVCHSPQREVEVLHDRLLAMLEEDPTLTPRDIIVMVADIDSYSP -------1111--------------------------------3333------3333--- FIQAVFGSAPADRYLPYAISDRRARQSHPVLEAFISLLSLPDSRFVSEDVLALLDVPVLA --------------------------------------3333--------3333------ ARFDITEEGLRYLRQWVNESGIRWGIDDDNVRELELPATGQHTWRFGLTRMLLGYAMESA 1111-3333----------------------1111-----------------3333---- QGEWQSVLPYDESSGLIAELVGHLASLLMQLNIWRRGLAQERPLEEWLPVCRDMLNAFFL --------------3333-------------------------3333------------- PDAETEAAMTLIEQQWQAIIAEGLGAQYGDAVPLSLLRDELAQRLDQERISQRFLAGPVN --------------------------------3333-------1111---1111------ ICTLMPMRSIPFKVVCLLGMNDGVYPRQLAPLGFDLMSQKPKRGDRSRRDDDRYLFLEAL --------------------2222----------3333---3333--------------- ISAQQKLYISYIGRSIQDNSERFPSVLVQELIDYIGQSHYLPGDEALNCDESEARVKAHL ------------------------3333-------------------------------- TCLHTRMPFDPQNYQPGERQSYAREWLPAASQAGKAHSEFVQPLPFTLPETVPLETLQRF ------11111111--------3333-3333---------------------3333-333 WAHPVRAFFQMRLQVNFRTEDSEIPDTEPFILEGLSRYQINQQLLNALVEQDDAERLFRR 3-3333-----------------------------------------1111-3333---- FRAAGDLPYGAFGEIFWETQCQEMQQLADRVIACRQPGQSMEIDLACNGVQITGWLPQVQ -1111----3333-----------------3333------------%%%%---------- PDGLLRWRPSLLSVAQGMQLWLEHLVYCASGGNGESRLFLRKDGEWRFPPLAAEQALHYL ------------3333------------------------%%%%-------3333----- SQLIEGYREGMSAPLLVLPESGGAWLKTCYDAQNDAMLDDDSTLQKARTKFLQAYEGNMM -------1111------3333----------2222----3333----------------- VRGEGDDIWYQRLWRQLTPETMEAIVEQSQRFLLPLFRFNQS --333333331111---3333--------------------- >Exodeoxyribonuclease V al; SWP:P04993; PDB:1W36D; KLQKQLLEAVEHKQLRPLDVQFALTVAGDEHPAVTLAAALLSHDAGEGHVCLPLSRLENN -------3333----3333-------------------------1111----3333--11 EASHPLLATCVSEIGELQNWEECLLASQAVSRGDEPTPMILCGDRLYLNRMWCNERTVAR 11----------------3333-3333--------------------3333--------1 FFNEVNHAIEVDEALLAQTLDKLFPVSDEINWQKVAAAVALTRRISVISGGPGTGKTTTV 111--------3333----3333----------------1111-------1111------ AKLLAALIQMADGERCRIRLAAPTGKAAARLTESLGKALRQLPLTDEQKKRIPEDASTLH ------------------------------------3333-------------------- RLLHAGNPLHLDVLVVDEASMIDLPMMSRLIDALPDHARVIFLGDRDQLASVEAGAVLGD ---2222----------3333-----------------------111111112222---- ICAYANAGFTAERARQLSRLTGTHVPAGTGTEAASLRDSLCLLQKSYRFGSDSGIGQLAA -1111---------------------------3333------------------------ AINRGDKTAVKTVFQQDFTDIEKRLLQSGEDYIAMLEEALAGYGRYLDLLQARAEPDLII ----------------------------------------1111---------------- QAFNEYQLLCALREGPFGVAGLNERIEQFMQQKRQPSRLPEHETTWAMTVHKSQGSEFDH -------------------------3333-------------------3333-------- AALILPSQRTPVVTRELVYTAVTRARRRLSLYADERILSAAIATRTERRSGLAALFSSR ---------3333-------1111------------33331111------3333----- >UDP-N-ACETYLGLUCOSAMINE--; SWP:O15294; PDB:1W3BA; GPMELAHREYQAGDFEAAERHCMQLWRQEPDNTGVLLLLSSIHFQCRRLDRSAHFSTLAI -------------------------------3333------------------------- KQNPLLAEAYSNLGNVYKERGQLQEAIEHYRHALRLKPDFIDGYINLAAALVAAGDMEGA --1111------------------------------11113333---------------- VQAYVSALQYNPDLYCVRSDLGNLLKALGRLEEAKACYLKAIETQPNFAVAWSNLGCVFN -----1111-11111111-------1111---------------11113333------33 AQGEIWLAIHHFEKAVTLDPNFLDAYINLGNVLKEARIFDRAVAAYLRALSLSPNHAVVH 33--------------------3333-----------3333---333333331111---- GNLACVYYEQGLIDLAIDTYRRAIELQPHFPDAYCNLANALKEKGSVAEAEDCYNTALRL -------11113333-------3333----3333-----3333----3333--------- CPTHADSLNNLANIKREQGNIEEAVRLYRKALEVFPEFAAAHSNLASVLQQQGKLQEALM -------------3333---3333----------1111-3333------11113333--- HYKEAIRISPTFADAYSNMGNTLKEMQD -----1111---3333-------1111- >HEMOLYTIC LECTIN FROM LAE; SWP:Q7Z8V1; PDB:1W3FA; DIYIPPEGLYFRLLGFASRQVIFARNSPSPDVGLSPVNDQATDQYFSLIYGTGEHAGLYA -----1111----------------------------11111111------!!!!----- IKSKATGKVLFSRRPAEPYVGQIDGDGRYPDNWFKIEPGKTYLSKYFRLVQPSTGTALVS ----------------------------3333-------!!!!--------1111----- RTHLQPYFWNHPQTEVFDDQYFTFLFEDMSIDKIEYDLKDGRILSSTPNVLATQTLENTS -----------3333---------------------3333-------------------- SQTQEMSFNLSQTLTQTSTFAYTAGFTIAVGTAFKAGVPIFAETEFKVDISVDNQWNWGE ----------------------------2222------------------------2222 ENTFSKTYTATFSVRAGPGETVKAVSTVDSGIINVPFTAYLSSKSTGFEVTTEGIWRGVS ------------------------------------------------------------ SWDLRHTLTSVTA ------------- >2-KETO-3-DEOXY GLUCONATE ; SWP:Q97U28; PDB:1W3IA; PEIITPIITPFTKDNRIDKEKLKIHAENLIRKGIDKLFVNGTTGLGPSLSPEEKLENLKA -----------1111--------------1111-------33333333------------ VYDVTNKIIFQVGGLNLDDAIRLAKLSKDFDIVGIASYAPYYYPRMSEKHLVKYFKTLCE -------------------------3333------------------------------- VSPHPVYLYNYPTATGKDIDAKVAKEIGCFTGVKDTIENIIHTLDYKRLNPNMLVYSGSD ----------3333-----------------------------------1111------- MLIATVASTGLDGNVAAGSNYLPEVTVTIKKLAMERKIDEALKLQFLHDEVIEASRIFGS -----------------3333--------------------------------1111--3 LSSNYVLTKYFQGYDLGYPRPPIFPLDDEEERQLIKKVEGIRAKLVELKILKE 333------------------------------------------1111---- >NIMA-RELATED PROTEIN; SWP:NA; PDB:1W3OA; SDFYDPRERDPSVSRRPQNRQSDEWIRELLLRGTIARVATLWQGEDGAAFPFITPLAYAY -1111----1111---%%%%-----------------------1111------------- RPEQGDLVYHTNVVGRLRANAGQGHPATLEVSEIGQFLPSNSPLELSVQYRSVMVFGTAR -3333------------------------------------1111--------------- VLAGEDARAALTTLSERVFPGLKVGETTRPISEDDLKRTSVYSLSIDRWSGKENWAEQAI ------------------22222222---------1111--------------------- QEEDWPALGPEWLG -1111---3333-- >ANNEXIN A8; SWP:P13928; PDB:1W3WA; SSSHFNPDPDAETLYKAMKGIGTNEQAIIDVLTKRSNTQRQQIAKSFKAQFGKDLTETLK ------------------------------1111-------------------------- SELSGKFERLIVALMYPPYRYEAKELHDAMKGLGTKEGVIIEILASRTKNQLREIMKAYE ------------1111-------------------------------------------- EDYGSSLEEDIQADTSGYLERILVCLLQGSRDDVSSFVDPALALQDAQDLYAAGEKIRGT ------------------------------------------------------------ DEMKFITILCTRSATHLLRVFEEYEKIANKSIEDSIKSETHGSLEEAMLTVVKCTQNLHS ------------------------------------------------------------ YFAERLYYAMKGAGTRDGTLIRNIVSRSEIDLNLIKCHFKKMYGKTLSSMIMEDTSGDYK ------------------------1111-----------------3333----------- NALLSLVGSDP ----------- >50S RIBOSOMAL PROTEIN L30; SWP:P29160; PDB:1W41A; MVDFAFELRKAQDTGKIVMGARKSIQYAKMGGAKLIIVARNARPDIKEDIEYYARLSGIP --------------------------------------1111------------------ VYEFEGTSVELGTLLGRPHTVSALAVVDPGASRILALGGK ------------1111------------!!!!3333---- >NTPASE P4; SWP:Q94M05; PDB:1W44A; MIHLYDAKSFAKLRAAQYAAFHTDAPGSWFDHTSGVLESVEDGTPVLAIGVESGDAIVFD ------------------------2222--------11112222------3333-----1 KNAQRIVAYKEKSVKAEDGSVSVVQVENGFMKQGHRGWLVDLTGELVGCSPVVAEFGGHR 111------------1111-------iiii------------!!!!---------iiii- YASGMVIVTGKGNSGKTPLVHALGEALGGKDKYATVRFGEPLSGYNTDFNVFVDDIARAM ---------------------------!!!!----------2222--3333--------- LQHRVIVIDSLKNVIISRGAFDLLSDIGAMAASRGCVVIASLNPTSNDDKIVELVKEASR ----------1111---------------------------------3333--------- SNSTSLVISTDVDGEWQVLTRTGEGLQRLTHTLQTSYGEHSVLTIHTSQASGKAIQTVIK -----------2222-------2222------------%%%%----------------33 NDEL 33-- >DIHYDROLIPOYLLYSINE-RESID; SWP:P11961; PDB:1W4EA; NRRVIAMPSVRKYAREKGVDIRLVQGTGKNGRVLKEDIDAWLAGG ------1111---------3333---------------------- >DIHYDROLIPOYLLYSINE-RESID; SWP:P11961; PDB:1W4GA; NRRVIAMPSVRKWAREKGVDIRLVQGTGKNGRVLKEDIDAFLAGG -------------------3333---------------------- >PYRUVATE DEHYDROGENASE E2; SWP:Q8ZUR6; PDB:1W4IA; GSREVAAMPAARRLAKELGIDLSKVKGTGPGGVITVEDVKRYAEETAKATAPAPAPKAVE --------------------3333----2222--3333---------------------- KA -- >THYMIDINE KINASE; SWP:P04183; PDB:1W4RA; RGQIQVILGPMFSGKSTELMRRVRRFQIAQYKCLVIKYAKDTRYSSSFCTHDRNTMEALP --------------------------1111-------1111-3333-------------- ACLLRDVAQEALGVAVIGIDEGQFFPDIVEFCEAMANAGKTVIVAALDGTFQRKPFGAIL --3333--3333--------33331111-------1111----------1111--!!!!- NLVPLAESVVKLTAVCMECFREAAYTKRLGTEKEVEVIGGADKYHSVCRLCYFK -3333----------------------------------3333----3333--- >POLYBROMO 1 PROTEIN; SWP:Q90941; PDB:1W4SA; MYHVGDYVYVEPAEANLQPHIVCIERLWEDSAGEKWLYGCWFYRPNETFHLATRKFLEKE --2222-------2222------------1111----------1111---1111--2222 VFKSDYYNKVPVSKILGKCVVMFVKEYFKLCPENFRDEDVYVCESRYSAKTKSFKKIKLW ----------3333--------1111-----22223333--------------------- TMPVSSVRFVPRDVPLPVVRVASVFA -------------------------- >ARYLAMINE N-ACETYLTRANSFE; SWP:NA; PDB:1W4TA; HMTPLTPEQTHAYLHHIGIDDPGPPSLANLDRLIDAHLRRVAFENLDVLLDRPIEIDADK -----------------------------------------------1111--------- VFAKVVEGSRGGYCFELNSLFARLLLALGYELELLVARVRWGLPDDAPLTQQSHLMLRLY ------------3333---------1111----------22223333------------- LAEGEFLVDVGFGSANPPRALPLPGDEADAGQVHCVRLVDPHAGLYESAVRGRSGWLPLY 1111--------1111---------1111-----------1111-------1111----- RFDLRPQLWIDYIPRNWYTSTHPHSVFRQGLKAAITEGDLRLTLADGLFGQRAGNGETLQ -------3333----------11111111-------!!!!----!!!!------------ RQLRDVEELLDILQTRFRLRLDPASEVPALARRLAGLI --------------1111---------------3333- >THIOREDOXIN, MITOCHONDRIA; SWP:NA; PDB:1W4VA; STTFNIQDGPDFQDRVVNSETPVVVDFHAQWCGPCKILGPRLEKMVAKQHGKVVMAKVDI ---------------1111---------11113333----------1111--------33 DDHTDLAIEYEVSAVPTVLAMKNGDVVDKFVGIKDEDQLEAFLKKLIG 33-------------------iiii----------------------- >PHENYLACETONE MONOOXYGENA; SWP:Q47PU3; PDB:1W4XA; RRQPPEEVDVLVVGAGFSGLYALYRLRELGRSVHVIETAGDVGGVWYWNRYPGARCDIES ---------------3333-------1111-------------1111---2222----33 IEYCYSFSEEVLQEWNWTERYASQPEILRYINFVADKFDLRSGITFHTTVTAAAFDEATN 33--------------------------------------1111------------1111 TWTVDTNHGDRIRARYLIMASGQLSVPQLPNFPGLKDFAGNLYHTGNWPHEPVDFSGQRV -----1111----------------------2222--------1111-------2222-- GVIGTGSSGIQVSPQIAKQAAELFVFQRTPHFAVPARNAPLDPEFLADLKKRYAEFREES ---------------3333----------------------------------------- RNTPGGTHRYQGPKSALEVSDEELVETLERYWQEGGPDILAAYRDILRDRDANERVAEFI --------------1111-3333---------------1111--1111------------ RNKIRNTVRDPEVAERLVPKGYPFGTKRLILEIDYYEMFNRDNVHLVDTLSAPIETITPR -------------3333-----------------3333--1111-------------333 GVRTSEREYELDSLVLATGFDALTGALFKIDIRGVGNVALKEKWAAGPRTYLGLSTAGFP 3--------------------11113333----2222------1111---iiii-2222- NLFFIAGPGSPSALSNMLVSIEQHVEWVTDHIAYMFKNGLTRSEAVLEKEDEWVEHVNEI ------2222!!!!---------------------1111--------------------- ADETLYPMTASWYTGANVPGKPRVFMLYVGGFHRYRQICDEVAAKGYEGFVLT 111133333333-----2222---------------------11112222--- >Pancreatic lipase-related; SWP:Q95KP4; PDB:1W52X; KEVCYTPLGCFSDDKPWAGTLQRPLKSLPWSPEEVNTRFLLYTNKNPDSYQLITARDVAT -------------------3333-------3333--------3333-------3333333 IKSSNFQSSRKTHFVIHGFRDRGEDSWPSDMCKKILQVETTNCISVDWSSGAKAEYTQAV 3-----3333-----------------------3333----------3333---3333-- QNIRIVGAETAYLIQQLLTELSYNPENVHIIGHSLGAHTAGEAGRRLEGRVGRVTGLDPA -----------------------1111-------------------%%%%---------- EPCFQDASEEVRLDPSDAQFVDVIHTDASPMLPSLGFGMSQKVGHMDFFPNGGKQMPGCK 2222---3333--1111--------------------------------iiii--2222- RSFIDINGIWQGAQDYLACNHLKSFEYYSSSILNPDGFLAYPCDSYDKFQENGCFPCPAG ----3333-2222------1111----------1111----------------------- GCPKMGHYADQYKEKTSAVEQTFFLNTGESGDYTSWRYRVSITLAGSGKANGYLKVTLRG -----1111--1111--------------!!!!--------------------------- SNGNSKQYEIFKGSLQPDSSYTLDVDVNFIIGKIQEVKFVWNKTVLNLSKPQLGASRITV ------------------------------------------------------------ QSGADGTEYKFCGSGTVQDNVEQSLYPC ---------------------------- >PHOSPHOSERINE PHOSPHATASE; SWP:P40399; PDB:1W53A; MDFREVIEQRYHQLLSRYIAELTETSLYQAQKFSRKTIEHQIPPEEIISIHRKVLKELYP -------------------------------------1111-3333------------11 SLPEDVFHSLDFLIEVMIGYGMAY 11---------------------- >ISPD/ISPF BIFUNCTIONAL EN; SWP:Q9PM68; PDB:1W55A; SEMSLIMLAAGNSTRFNTKVKKQFLRLGNDPLWLYATKNLSSFYPFKKIVVTSSNITYMK -------------3333---------!!!!----------1111----------3333-1 KFTKNYEFIEGGDTRAESLKKALELIDSEFVMVSDVARVLVSKNLFDRLIENLDKADCIT 111-----------------3333----------1111----------3333-------- PALKVADTTLFDNEALQREKIKLIQTPQISKTKLLKKALDQNLEFTDDSTAIAAMGGKIW ----------iiii--3333----------------1111------------1111---- FVEGEENARKLTFKEDLKKLDLPTPSFEIFTGNGFDVHEFGENRPLLLAGVQIHPTMGLK ----3333----33331111---------------------------iiii--------- AHSDGDVLAHSLTDAILGAAGLGDIGELYPDTDMKFKNANSMELLKQAYDKVREIGFELI -!!!!------------1111--3333---1111-1111--------------------- NIDICVMAQSPKLKDFKQAMQSNIAHTLDLDEFRINVKATTTEKLGFIGRKEGMAVLSSV ------------1111--------------3333-------%%%%3333----------- NLKYFDWTR -----3333 >CELL DIVISION PROTEIN FTS; SWP:Q57816; PDB:1W5BA; LSPEDKELLEYLQQTKAKITVVGCGGAGNNTITRLKMEGIEGAKTVAINTDAQQLIRTKA -----------------------------------1111---------------1111-- DKKILIGKKLTRGLGAGGNPKIGEEAAKESAEEIKAAIQDSDMVFITCGLGGGTGTGSAP ------3333-----iiii----------------------------------3333--- VVAEISKKIGALTVAVVTLPFVMEGKVRMKNAMEGLERLKQHTDTLVVIPNEKLFEIVPN --------------------3333--------------3333-------3333------- MPLKLAFKVADEVLINAVKGLVELITKDGLINVDFADVKAVMNNGGLAMIGIGESDSEKR -3333------------------------------------------------------- AKEAVSMALNSPLLDVDIDGATGALIHVMGPEDLTLEEAREVVATVSSRLDPNATIIWGA ----------3333----------------1111------------11111111------ TIDENLENTVRVLLVITGVQSRIEFTDTGLKRKK ------------------3333---1111----- >PENICILLIN-BINDING PROTEI; SWP:P39844; PDB:1W5DA; DALSGQIDKILADHPALEGAMAGITVRSAETGAVLYEHSGDTRMRPASSLKLLTAAAALS -------------3333---------------------1111---!!!!----------- VLGENYSFTTEVRTDGTLKGKKLNGNLYLKGKGDPTLLPSDFDKMAEILKHSGVKVIKGN --1111-------------------------------3333--------1111------- LIGDDTWHDDMRLSPDMPWSDEYTYYGAPISALTASPNEDYDAGTVIVEVTPNQKEGEEP ----3333-----11111111--1111----------1111------------------- AVSVSPKTDYITIKNDAKTTAAGSEKDLTIEREHGTNTITIEGSVPVDANKTKEWISVWE --------------------------------2222------------------------ PAGYALDLFKQSLKKQGITVKGDIKTGEAPSSSDVLLSHRSMPLSKLFVPFMKLSNNGHA -------------1111------------3333---------3333-----1111-3333 EVLVKEMGKVKKGEGSWEKGLEVLNSTLPEFGVDSKSLVLRDGSGISHIDAVSSDQLSQL --------------------------3333---3333---------1111---------- LYDIQDQSWFSAYLNSLPVAGNPDRMVGGTLRNRMKGTPAQGKVRAKTGSLSTVSSLSGY ------1111---1111------1111!!!!------1111---------2222------ AETKSGKKLVFSILLNGLIDEEDGKDIEDQIAVILANQ -------------------3333-----------1111 >CELL DIVISION PROTEIN FTS; SWP:O08398; PDB:1W5FA; LKIKVIGVGGAGNNAINRMIEIGIHGVEFVAVNTDLQVLEASNADVKIQIGENITRGLGA ----------------------------------33331111--------3333iiii-i GGRPEIGEQAALESEEKIREVLQDTHMVFITAGFGGGTGTGASPVIAKIAKEMGILTVAI iii----------------1111--------------3333---------1111------ VTTPFYFEGPERLKKAIEGLKKLRKHVDTLIKISNNKLMEELPRDVKIKDAFLKADETLH ----3333--------------3333--------------------3333---------- QGVKGISELITKRGYIRLTSRFARIESVMKDAGAAILGIGVGKGEHRAREAAKKAMESKL ------3333------3333-------------------------------------111 IEHPVENASSIVFNITAPSNIRMEEVHEAAMIIRQNSSEDADVKFGLIFDDEVPDDEIRV 1--3333----------1111-----------1111-1111------------1111--- IFIATRFPDEDKILFP --------3333---- >GENERAL CONTROL PROTEIN G; SWP:P03069; PDB:1W5JA; RMRQIEDRLEEILSKLYHICNELARIRRLLGER -----------------------------1111 >DELTA-AMINOLEVULINIC ACID; SWP:Q59643; PDB:1W5QA; ANRAYPYTRLRRNRRDDFSRRLVRENVLTVDDLILPVFVLDGVNQRESIPSMPGVERLSI --------3333----------------3333----------------1111-------- DQLLIEAEEWVALGIPALALFPVTPVEKKSLDAAEAYNPEGIAQRATRALRERFPELGII ----------1111----------3333-----33331111------------1111--- TDVCLCEFTTHGQCGILDDDGYVLNDVSIDVLVRQALSHAEAGAQVVAPSDMMDGRIGAI ----11111111-----1111---3333------------------------2222---- REALESAGHTNVRVMAYSAKYASAYYGPFRDANRATYQMDPANSDEALHEVAADLAEGAD ----11111111-----------11113333-3333---33333333-------1111-- MVMVKPGMPYLDIVRRVKDEFRAPTFVYQVSGEYAMHMGAIQNGWLAESVILESLTAFKR ------1111------------------------------1111---------------- AGADGILTYFAKQAAEQLRRG -------1111---------- >ARYLAMINE N-ACETYLTRANSFE; SWP:O86309; PDB:1W5RA; MDLGGYLTRIGLDGRPRPDLGTLHAIVAAHNRSIPFENLDPLLGIPVADLSAEALFAKLV ----------------------------------------1111---------------- DRRRGGYQYEHNGLLGYVLEELGFEVERLSGRVVWMRADDAPLPAQTHNVLSVAVPGADG ------3333-----------------------22221111------------------- RYLVDVGFGGQTLTSPIRLEAGPVQQTRHEPYRLTRHGDDHTLAAQVRGEWQPLYTFTTE --------1111------------------------!!!!------iiii---------- PRPRIDLEVGSWYVSTHPGSHFVTGLTVAVVTDDARYNLRGRNLAVHRSGATEHIRFDSA --3333----------1111-----------1111----!!!!----2222--------- AQVLDAIVNRFGIDLGDLAGRDVQARVAEVLDT -------------33332222------------ >ORC2; SWP:Q9YFU8; PDB:1W5TA; GLFKDRRVFDENYIPPELRVRRGEAEALARIYLNRLLSGAGLSDVNMIYGSIGRVGIGKT ---------1111----------------------1111-----------2222------ TLAKFTVKRVSEAAAKEGLTVKQAYVNAFNAPNLYTILSLIVRQTGYPIQVRGAPALDIL --------------1111--------3333------------3333----2222------ KALVDNLYVENHYLLVILDEFQSMLSSPRIAAEDLYTLLRVHEEIPSRDGVNRIGFLLVA ---------------------3333-33333333-----1111---1111---------- SDVRALSYMREKIPQVESQIGFKLHLPAYKSRELYTILEQRAELGLRDTVWEPRHLELIS ------------33333333--------------------------1111-3333----- DVYGEDKGGDGSARRAIVALKMACEMAEAMGRDSLSEDLVRKAVSENASIQTHELEALSI ----1111-------------------1111----------------------3333--- HELIILRLIAEATLGGMEWINAGLLRQRYEDASLTMYNVKPRGYTQYHIYLKHLTSLGLV ------------------------------------------------------1111-- DAKPSTTLFRLAPHLPADRLIEVVDNIIQAKMASG -----------11113333------------1111 >B-CELL MITOGEN; SWP:Q4DA80; PDB:1W61A; FKKSFTCIDMHTEGEAARIVTSGLPHIPGSNMAEKKAYLQENMDYLRRGIMLEPRGHDDM -----------iiii--------------------------------------------- FGAFLFDPIEEGADLGIVFMDTGGYLNMCGHNSIAAVTAAVETGIVSVPAKATNVPVVLD ---------2222-------1111----------------1111----2222-------- TPAGLVRGTAHLQSGTESEVSNASIINVPSFLYQQDVVVVLPKPYGEVRVDIAFGGNFFA 1111-------------------------------------------------------- IVPAEQLGIDISVQNLSRLQEAGELLRTEINRSVKVQHPQLPHINTVDCVEIYGPPTNPE --3333-----3333----------------------1111----------------333 ANYKNVVIFGNRQADRSPCGTGTSAKMATLYAKGQLRIGETFVYESILGSLFQGRVLGEE 3-----------------------------1111--2222-----1111----------- RIPGVKVPVTKDAEEGMLVVTAEITGKAFIMGFNTMLFDPTDPFKNGFTLKQY -2222-33331111------------------------1111-1111------ >ADAPTOR-RELATED PROTEIN C; SWP:P22892; PDB:1W63A; MPAPIRLRELIRTIRTARTQAEEREMIQKECAAIRSSFREEDNTYRCRNVAKLLYMHMLG ------------3333---------------------1111------------------- YPAHFGQLECLKLIASQKFTDKRIGYLGAMLLLDERQDVHLLMTNCIKNDLNHSTQFVQG -------3333--------3333------------------------3333--------- LALCTLGCMGSSEMCRDLAGEVEKLLKTSNSYLRKKAALCAVHVIRKVPELMEMFLPATK ------------------3333-3333-----------------111111111111---- NLLNEKNHGVLHTSVVLLTEMCERSPDMLAHFRKLVPQLVRILKNLIMSGYSPEHDVSGI ------------------------3333----------------------------iiii SDPFLQVRILRLLRILGRNDDDSSEAMNDILAQVATNTETSKNVGNAILYETVLTIMDIK ---3333----------------------------------------------------- SESGLRVLAINILGRFLLNNDKNIRYVALTSLLKTVQTDHNAVQRHRSTIVDCLKDLDVS -3333------------------------3333-3333-3333--3333---1111---- IKRRAMELSFALVNGNNIRGMMKELLYFLDSCEPEFKADCASGIFLAAEKYAPSKRWHID ---------1111---------------1111---------------------3333--- TIMRVLTTAGSYVRDDAVPNLIQLITNSVEMHAYTVQRLYKAILGDYSQQPLVQVAAWCI ---------1111----------------------------------------------- GEYGDLLVSGQCEEEEPIQVTEDEVLDILESVLISNMSTSVTRGYALTAIMKLSTRFTCT --------------------3333--------------3333--------3333------ VNRIKKVVSIYGSSIDVELQQRAVEYNALFKKYDHMRSALLERMPVMEKV ----------------------------------3333------------ >AP-1 complex subunit beta; SWP:P52303; PDB:1W63B; TDSKYFTTTKKGEIFELKAELNSDKKEKKKEAVKKVIASMTVGKDVSALFPDVVNCMQTD ------------------------3333-----------3333-------3333------ NLELKKLVYLYLMNYAKSQPDMAIMAVNTFVKDCEDPNPLIRALAVRTMGCIRVDKITEY ------------------------------3333-------------------3333--- LCEPLRKCLKDEDPYVRKTAAVCVAKLHDINAQMVEDQGFLDTLKDLISDSNPMVVANRV ---------------------------------3333---3333---------------- AALSEIAESHPSSNLLDLKAQSINKLLTALNECTEWAQIFILDCLGNYMPKDDREAQSIC -----------3333--------------------------------------------- ERVTPRLSHANSAVVLSAVKVLMKFMDYYATLLKKLAPPLVTLLSAEPEPQYVPLRNINL --3333--------------1111------------------------------------ IVQKRPEILKHEMKVFFVKYNDPIYVKLEKLDIMIRLASQANIAQVLAELKEYATEVDVD ----3333----------3333------------------------------3333---- FVRKAVRAIGRCAIKVEQSAERCVSTLLDLIQTKVNYVVQEAIVVIKDIFRKYPNKYESV ------------------------------1111-------------3333--------- IATLCENLDSDDEPEARAAMIWIVGEYAERSDNADELLESFLDGFHDESTQVQLQLLTAI 3333---------------------------------------------33333333--- VKLFLKKPTETQELVQQVLSLATQDSDNPDLRDRGYIYWRLLSTDPVAAKEVVLAEKPLI -------1111------------------------------------------------- SEETDLIEPTLLDELICYIGTLASVYHKPPNAFVEG --------3333-----22223333---3333---- >AP-1 complex subunit mu-1; SWP:P35585; PDB:1W63M; SASAVYVLDLKGKVLICRNYRGDVDMSEVEHFMPILMEKEEEGMLSPILAHGGVRFMWIK ---------------------------3333--------1111----------------- HNNLYLVATSKKNACVSLVFSFLYKVVQVFSEYFKELEEESIRDNFVIIYELLDELMDFG -----------------------------3333-------------3333--1111---- YPQTTDSKILQEFITQEGHKLETGVSWRSEGIKYRKNEVFLDVIEAVNLLVSANGNVLRS ------1111-------------------------------------------------- EIVGSIKMRVFLSGMPELRLGLNDKVELEDVKFHQCVRLSRFENDRTISFIPPDGEFELM ------------------------------------------------------------ SYRLNTHVKPLIWIESVIEKHSHSRIEYMVKAKSQFKRRSTANNVEIHIPVPNDADSPKF ------------------------------------3333-------------------- KTTVGSVKWVPENSEIVWSVKSFPGGKEYLMRAHFGLKPPISVKFEIPYFTTSGIQVRYL ----------2222------------------------------------3333------ KIIEKSGYQAIPWVRYITQNGDYQLRTQ ---------------------------- >AP-1 complex subunit sigm; SWP:P61967; PDB:1W63Q; MMRFMLLFSRQGKLRLQKWYLATSDKERKKMVRELMQVVLARKPKMCSFLEWRDLKVVYK ------------------------------------------3333-------------- RYASLYFCCAIEGQDNELITLELIHRYVELLDKYFGSVCELDIIFNFEKAYFILDEFLMG -----------------3333--3333--------------------------1111-%% GDVQDTSKKSVLKAIEQADLLQEEDESPR %%----3333------------------- >LIPOYLTRANSFERASE; SWP:NA; PDB:1W66A; AGSIRSKLSAIDVRQLGTVDYRTAWQLQRELADARVAGGADTLLLLEHPAVYTAGRRTET 1-----------------------------------------------------111133 HERPIDGTPVVDTDRGGKITWHGPGQLVGYPIIGLAEPLDVVNYVRRLEESLIQVCADLG 33-1111---------------2222------------------------------1111 LHAGRVDGRSGVWLPGRPARKVAAIGVRVSRATTLHGFALNCDCDLAAFTAIVPCGISDA -----2222--------------------%%%%------------3333----%%%%--- AVTSLSAELGRTVTVDEVRATVAAAVCAALDGVLP -------------3333-----------1111--- >RIBONUCLEOSIDE-DIPHOSPHAT; SWP:P11157; PDB:1W68A; VEDEPLLRENPRRFVVFPIEYHDIWQMYKKAEASFWTAEEVDLSKDIQHWEALKPDERHF 1111-----1111------------------1111-3333-------------------- ISHVLAFFAASDGIVNENLVERFSQEVQVTEARCFYGFQIAMENIHSEMYSLLIDTYIKD ----------------------3333---------------------------------- PKEREYLFNAIETMPCVKKKADWALRWIGDKEATYGERVVAFAAVEGIFFSGSFASIFWL --------1111--1111------------------------------------------ KKRGLMPGLTFSNELISRDEGLHCDFACLMFKHLVHKPAEQRVREIITNAVRIEQEFLTE -----3333--------------------------------------------------- ALPVKLIGMNCTLMKQYIEFVADRLMLELGFNKIFRVENPF --3333--------------------1111----------- >PHENYLETHYLAMINE OXIDASE; SWP:P46881; PDB:1W6GA; ASPFRLASAGEISEVQGILRTAGLLGPEKRIAYLGVLDPARGAGSEAEDRRFRVFIHDVS -1111--------------1111--1111----------------------------111 GARPQEVTVSVTNGTVISAVELDTAATGELPVLEEEFEVVEQLLATDERWLKALAARNLD 1---------1111--------3333------3333------------------1111-3 VSKVRVAPLSAGVFEYAEERGRRILRGLAFVQDFPEDSAWAHPVDGLVAYVDVVSKEVTR 333------------3333--------------11113333------------------- VIDTGVFPVPAEHGNYTDPELTGPLRTTQKPISITQPEGPSFTVTGGNHIEWEKWSLDVG -----------------3333--------------1111------------iiii----- FDVREGVVLHNIAFRDGDRLRPIINRASIAEMVVPYGDPSPIRSWQNYFDTGEYLVGQYA ---------------!!!!--------------------3333-----3333--3333-- NSLELGCDCLGDITYLSPVISDAFGNPREIRNGICMHEEDWGILAKHSDLWSGINYTRRN ---2222--------------1111------------------------1111------- RRMVISFFTTIGNDYGFYWYLYLDGTIEFEAKATGVVFTSAFPEGGSDNISQLAPGLGAP ---------------------1111-----------------2222-------2222--- FHQHIFSARLDMAIDGFTNRVEEEDVVRQTMGPGNERGNAFSRKRTVLTRESEAVREADA -------------------------------2222--------------3333-----33 RTGRTWIISNPESKNRLNEPVGYKLHAHNQPTLLADPGSSIARRAAFATKDLWVTRYADD 33-------1111-1111-----------------11113333---1111-------111 ERYPTGDFVNQHSGGAGLPSYIAQDRDIDGQDIVVWHTFGLTHFPRVEDWPIMPVDTVGF 1-1111----------3333-------------------------3333----------- KLRPEGFFDRSPVLDVPAN ------------1111--- >LANOSTEROL SYNTHASE; SWP:P48449; PDB:1W6KA; CLRRRGGPYKTEPATDLGRWRLNCERGRQTWTYLQDAGREQTGLEAYALGLDTKNYFKDL ---------------3333---------------------------------1111---- PKAHTAFEGALNGMTFYVGLQAEDGHWTGDYGGPLFLLPGLLITCHVARIPLPAGYREEI -----------------11111111--------------------1111---2222---- VRYLRSVQLPDGGWGLHIEDKSTVFGTALNYVSLRILGVGPDDPDLVRARNILHKKGGAV --------1111----1111---3333------------1111----------1111333 AIPSWGKFWLAVLNVYSWEGLNTLFPEMWLFPDWAPAHPSTLWCHCRQVYLPMSYCYAVR 3-3333----------1111-----3333--1111--3333--3333------------- LSAAEDPLVQSLRQELYVEDFASIDWLAQRNNVAPDELYTPHSWLLRVVYALLNLYEHHH -----------1111----3333-33331111-3333-------------------1111 SAHLRQRAVQKLYEHIVADDRFTKSISIGPISKTINMLVRWYVDGPASTAFQEHVSRIPD ----------------------%%%%------------------1111----------11 YLWMGLDGMKMQGTNGSQIWDTAFAIQALLEAGGHHRPEFSSCLQKAHEFLRLSQVPDNP 11--1111------------------------33333333-----------1111----- PDYQKYYRQMRKGGFSFSTLDCGWIVSDCTAEALKAVLLLQEKCPHVTEHIPRERLCDAV -3333-----2222----3333---------------------1111----3333----- AVLLNMRNPDGGFATYETKRGGHLLELLNPSEVFGDIMIDYTYVECTSAVMQALKYFHKR --1111-1111------------3333--------------------------------- FPEHRAAEIRETLTQGLEFCRRQQRADGSWEGSWGVCFTYGTWFGLEAFACMGQTYRDGT 1111----------------11111111---------------------1111---iiii ACAEVSRACDFLLSRQMADGGWGEDFESCEERRYVQSAQSQIHNTCWAMMGLMAVRHPDI ------------11111111------3333----------------------1111---- EAQERGVRCLLEKQLPNGDWPQENIAGVFNKSCAISYTSYRNIFPIWALGRFSQLYPERA ----------11111111------------------1111---------------33333 LAGHP 333-- >GALECTIN-1; SWP:P09382; PDB:1W6NA; ASGLVASNLNLKPGELRVRGEVAPDAKSFVLNLGKDSNNLCLHFNPRFNAHGDANTIVCN -----------2222--------------------1111----------iiii------- SKDDGAWGTEQREAVFPFQPGSVAEVCITFDQANLTVKLPDGYEFKFPNRLNLEAINYMA --iiii------------2222--------3333----1111------1111-------- ADGDFKIKCVAFD ------------- >METHANOL DEHYDROGENASE SU; SWP:P16027; PDB:1W6SA; NDKLVELSKSDDNWVMPGKNYDSNNFSDLKQINKGNVKQLRPAWTFSTGLLNGHEGAPLV ---------1111--22223333---------11111111-------------------- VDGKMYIHTSFPNNTFALGLDDPGTILWQDKPKQNPAARAVACCDLVNRGLAYWPGDGKT iiii--------------1111--------------3333-------------------- PALILKTQLDGNVAALNAETGETVWKVENSDIKVGSTLTIAPYVVKDKVIIGSSGAELGV -------1111-------------------3333--------------------3333-- RGYLTAYDVKTGEQVWRAYATGPDKDLLLASDFNIKNPHYGQKGLGTGTWEGDAWKIGGG ----------------------3333-------33333333--3333---!!!!------ TNWGWYAYDPGTNLIYFGTGNPAPWNETMRPGDNKWTMTIFGRDADTGEAKFGYQKTPHD --------3333-------------3333------------------------------- EWDYAGVNVMMLSEQKDKDGKARKLLTHPDRNGIVYTLDRTDGALVSANKLDDTVNVFKS ----------------1111---------1111------------------1111----- VDLKTGQPVRDPEYGTRMDHLAKDICPSAMGYHNQGHDSYDPKRELFFMGINHICMDWEP ----------3333-------------3333----------------------------- FMLPYKAGQFFVGATLNMYPGPKGDRQNYEGLGQIKAYNAITGDYKWEKMERFAVWGGTM -----2222-----------1111------------------------------------ ATAGDLVFYGTLDGYLKARDSDTGDLLWKFKIPSGAIGYPMTYTHKGTQYVAIYYGVGGW --------------------------------------------iiii----------33 PGVGLVFDLADPTAGLGAVGAFKKLANYTQMGGGVVVFSLDGKGPYDDPNVGEWKS 33--1111--1111iiii3333-3333-----------2222-11113333----- >Methanol dehydrogenase su; SWP:P14775; PDB:1W6SB; YDGTKCKAAGNCWEPKPGFPEKIAGSKYDPKHDPKELNKQADSIKQMEERNKKRVENFKK -------2222----2222---2222------3333------------------------ TGKFEYDVAKISA ------3333--- >ENOLASE; SWP:Q8DPS0; PDB:1W6TA; HMSIITDVYAREVLDSRGNPTLEVEVYTESGAFGRGMVPSGGEHEAVELRDGDKSRYGGL 1111----------1111---------3333---------------------1111iiii GTQKAVDNVNNIIAEAIIGYDVRDQQAIDRAMIALDGTPNKGKLGANAILGVSIAVARAA ----------------22221111-------------1111------------------- ADYLEIPLYSYLGGFNTKVLPTPMMNIINGGSHSDAPIAFQEFMILPVGAPTFKEALRYG -----------------------------!!!!-------------1111---------- AEIFHALKKILKSRGLETAVGDEGGFAPRFEGTEDGVETILAAIEAAGYVPGKDVFLGFD -----------1111-----1111--------------------1111-2222------- CASSEFYDRKVYDYTKFEGEGAAVRTSAEQIDYLEELVNKYPIITIEDGMDENDWDGWKA -3333-------3333--1111----------------------------1111------ LTERLGKKVQLVGDDFFVTNTDYLARGIQEGAANSILIKVNQIGTLTETFEAIEMAKEAG -------------3333---3333--------------1111--------------1111 YTAVVSHRSGETEDSTIADIAVATNAGQIKTGSLSRTDRIAKYNQLLRIEDQLGEVAEYR --------------3333-----------------3333-------------!!!!---- GLKSFYNLK 33333333- >2,4-DIENOYL-COA REDUCTASE; SWP:Q16698; PDB:1W6UA; NTEALQSKFFSPLQKAMLPPNSFQGKVAFITGGGTGLGKGMTTLLSSLGAQCVIASRKMD ------------------22222222-----1111----------1111----------- VLKATAEQISSQTGNKVHAIQCDVRDPDMVQNTVSELIKVAGHPNIVINNAAGNFISPTE ----------------------1111-------------------------------333 RLSPNAWKTITDIVLNGTAFVTLEIGKQLIKAQKGAAFLSITTIYAETGSGFVVPSASAK 3-----------------------------------------1111-------------- AGVEAMSKSLAAEWGKYGMRFNVIQPGPIKTLDPTGTFEKEMIGRIPCGRLGTVEELANL -------------3333---------------!!!!---------1111----------- AAFLCSDYASWINGAVIKFDGGEEVLISGEFNDLRKVTKEQWDTIEEL -----3333-------------------1111-1111-------1111 >UBIQUITIN CARBOXYL-TERMIN; SWP:Q9Y4E8; PDB:1W6VA; MAEGGAADLDTQRSDIATLLKTSLRKGDTWYLVDSRWFKQWKKYVGFDSWDKYQMGDQNV ----------------1111----2222-------------------11111111-3333 YPGPIDNSGLLKDGDAQSLKEHLIDELDYILLPTEGWNKLVSWYTLMEGQEPIARKVVEQ ------3333--------------3333------------------2222---------- >NEUTROPHIL CYTOSOL FACTOR; SWP:Q15080; PDB:1W70A; LIKHMRAEALFDFTGNSKLELNFKAGDVIFLLSRINKDWLEGTVRGATGIFPLSFVKILK ----------------3333---2222--------1111----iiii----3333----- >PEPTIDYL-PROLYL CIS-TRANS; SWP:P65762; PDB:1W74A; LATATATLHTNRGDIKIALFGNHAPKTVANFVGLAQGTKDYSTQNASGGPSGPFYDGAVF ---------1111-------------------------------1111----1111---- HRVIQGFMIQGGDPTGTGRGGPGYKFADEFHPELQFDKPYLLAMANAGPGTNGSQFFITV ---2222-----1111--------------1111-------------------------- GKTPHLNRRHTIFGEVIDAESQRVVEAISKTATDGNDRPTDPVVIESITIS --3333---------------------1111--1111-------------- >2C-METHYL-D-ERYTHRITOL 4-; SWP:P69834; PDB:1W77A; MEKSVSVILLAGGSMPKQYIPLLGQPIALYSFFTFSRMPEVKEIVVVCDPFFRDIFEEYE ---------------1111------1111--------3333-------33333333--11 ESIDVDLRFAIPGKERQDSVYSGLQEIDVNSELVCIHDSARPLVNTEDVEKVLKDGSAVG 11------------3333----3333---------------------------------- AAVLGVPAKATIKEVNSDSLVVTLWEMQTPQVIKPELLKKGFELVKSEGLEVTDVSIVEY ---------------1111------------------------------------1111- LKHPVYVSQGSYTNIKVTTPDDLLLAERILSE ----------1111----1111---------- >FOLC BIFUNCTIONAL PROTEIN; SWP:P08192; PDB:1W78A; TPQAASPLASWLSYLENLHSKTIDLGLERVSLVAARLGVLKPAPFVFTVAGTNGKGTTCR --11113333-------------------------------------------------- TLESILMAAGYKVGVYSSPHLVRYTERVRVQGQELPESAHTASFAEIESARGDISLTYFE ----------------------3333---iiii--3333-----------!!!!------ YGTLSALWLFKQAQLDVVILEVGLGGRLDATNIVDADVAVVTSIALDHTDWLGPDRESIG ----------1111-----------1111------------------3333--------- REAGIFRSEKPAIVGEPEMPSTIADVAQEKGALLQRRGVEWNYSVTDHDWAFSDAHGTLE --11112222-------------------------2222--------------1111--- NLPLPLVPQPNAATALAALRASGLEVSENAIRDGIASAILPGRFQIVSESPRVIFDVAHN ---------------------------------------2222----------------- PHAAEYLTGRMKALPKNGRVLAVIGMLHDKDIAGTLAWLKSVVDDWYCAPLEGPRGATAE ----------1111------------1111------------------------------ QLLEHLGNGKSFDSVAQAWDAAMADAKAEDTVLVCGSFHTVAHVMEVIDARRS -3333---------------------1111-------------------3333 >D-ALANYL-D-ALANINE CARBOX; SWP:P39045; PDB:1W79A; RLTELREDIDAILEDPALEGAVSGVVVVDTATGEELYSRDGGEQLLPASNMKLFTAAAAL --------------3333---------------------1111---!!!!---------- EVLGADHSFGTEVAAESAPGRRGEVQDLYLVGRGDPTLSAEDLDAMAAEVAASGVRTVRG ---1111------------1111---------------------------1111------ DLYADDTWFDSERLVDDWWPEDEPYAYSAQISALTVAHGERFDTGVTEVSVTPAAEGEPA -----3333-----11113333--3333----------1111------------2222-- DVDLGAAEGYAELDNRAVTGAAGSANTLVIDRPVGTNTIAVTGSLPADAAPVTALRTVDE ----1111------------2222--------2222---------1111----------3 PAALAGHLFEEALESNGVTVKGDVGLGGVPADWQDAEVLADHTSAELSEILVPFMKFSNN 333----------1111----------------------------3333-----1111-- GHAEMLVKSIGQETAGAGTWDAGLVGVEEALSGLGVDTAGLVLNDGSGLSRGNLVTADTV -------------------------------1111--1111------------------- VDLLGQAGSAPWAQTWSASLPVAGESDPFVGGTLANRMRGTAAEGVVEAKTGTMSGVSAL ---------1111---3333------1111!!!!---2222-2222-------2222--- SGYVPGPEGELAFSIVNNGHSGPAPLAVQDAIAVRLAEYAGHQAPEG -----1111-------------------------------------- >LYSYL OXIDASE; SWP:Q96X16; PDB:1W7CA; AECVSNENVEIEAPKTNIWTSLAKEEVQEVLDLLHSTYNITEVTKADFFSNYVLWIETLK ----------------1111---------------------3333-1111---------- PNKTEALTYLDEDGDLPPRNARTVVYFGEGEEGYFEELKVGPLPVSDETTIEPLSFYNTN ---------------------------------------------1111-----1111-- GKSKLPFEVGHLDRIKSAAKSSFLNKNLNTTIMRDVLEGLIGVPYEDMGCHSAAPQLHDP -----3333----------------------------------3333------------- ATGATVDYGTCNINTENDAENLVPTGFFFKFDMTGRDVSQWKMLEYIYNNKVYTSAEELY ------------------3333--------------3333-------%%%%--------- EAMQKDDFVTLPKIDVDNLDWTVIQRNDSAPVRHLDDRKSPRLVEPEGRRWAYDGDEEYF ----1111------11113333----------2222------------------------ SWMDWGFYTSWSRDTGISFYDITFKGERIVYELSLQELIAEYGSDDPFNQHTFYSDISYG -iiii------------------iiii--------------------3333---3333-- VGNRFSLVPGYDCPSTAGYFTTDTFEYDEFYNRTLSYCVFENQEDYSLLRHTGASYSAIT -------2222--1111--------iiii-----------------------%%%%---- QNPTLNVRFISTIGNDYNFLYKFFLDGTLEVSVRAAGYIQAGYWNPETSAPYGLKIHDVL -----------------------1111-----------------33333333-------- SGSFHDHVLNYKVDLDVGGTKNRASQYVMKDVDVEYPWAPGTVYNTKQIAREVFENEDFN -----------------------------------1111------------------111 GINWPENGQGILLIESAEETNSFGNPRAYNIMPGGGGVHRIVKNSRSGPETQNWARSNLF 1---2222------------1111--------------------1111---1111----- LTKHKDTELRSSTALNTNALYDPPVNFNAFLDDESLDGEDIVAWVNLGLHHLPNSNDLPN ----1111----1111--1111---3333------------------------1111--- TIFSTAHASFMLTPFNYFDSENSRDTTQQVFYTYDDETEESNWEFYGNDWSSCGVEVAEP --1111----------------1111------------------iiii------------ NFEDYTYGRGTRINKK 3333------------ >FASCICLIN-LIKE PROTEIN; SWP:Q3IXZ6; PDB:1W7DA; ETGDIVETATGAGSFTTLLTAAEAAGLVDTLKGDGPFTVFAPTDAAFAALPEGTVEDLLK ----3333------1111----11113333------------------------------ PENKEKLTEILTYHVVPGEVMSSDLTEGMTAETVEGGALTVTLEGGPKVNGVSISQPDVD -----3333-----------3333--------3333------2222--iiii-------- ASNGVIHVIDGVLMPGA ----------------- >MYOSIN VA; SWP:Q02440; PDB:1W7JA; AASELYTKYARVWIPDPEEVWKSAELLKDYKPGDKVLQLRLEEGKDLEYCLDPKTKELPP -3333-2222--------------------2222-------------------------- LRNPDILVGENDLTALSYLHEPAVLHNLKVRFIDSKLIYTYCGIVLVAINPYEQLPIYGE ---3333-----1111-------------------------!!!!-------------33 DIINAYSGQNMGDMDPHIFAVAEEAYKQMARDERNQSIIVSGESGAGKTVSAKYAMRYFA 33---22223333---3333---------1111---------2222-------------- TVSGSASEANVEEKVLASNPIMESIGNAKTTRNDNSSRFGKYIEIGFDKRYRIIGANMRT -----------------------------1111--------------1111--------- YLLEKSRVVFQAEEERNYHIFYQLCASAALPEFKTLRLGNANYFHYTKQGGSPVIDGIDD ---3333----2222--3333---1111-33331111--111111111111---2222-- AKEMVNTRQACTLLGISDSYQMGIFRILAGILHLGNVEFASRDSDSCAIPPKHDPLTIFC -----------1111------------------1111----------------------- DLMGVDYEEMAHWLCHRKLATATETYIKPISKLHAINARDALAKHIYANLFNWIVDHVNK --------------------3333-----------------------------------1 ALHSTVKQHSFIGVLDIYGFETFEINSFEQFCINYANEKLQQQFNMHVFKLEQEEYMKEQ 111-----------------------------------------------------1111 IPWTLIDFYDNQPCINLIEAKMGVLDLLDEECKMPKGSDDTWAQKLYNTHLNKCALFEKP -1111--------------2222----------1111------------22221111--1 RLSNKAFIIKHFADKVEYQCEGFLEKNKDTVYEEQIKVLKSSKKFKLLPELFQKTVGHQF 111----------------2222--------3333---1111--33333333-------- RNSLHLLMETLNATTPHYVRCIKPNDFKFPFTFDEKRAVQQLRACGVLETIRISAAGFPS ---------3333----------------------------------------------- RWTYQEFFSRYRVLMKQKDVLSDRKQTCKNVLEKLILDKDKYQFGKTKIFFRAGQVAYLE ----------3333-3333------------------3333---1111---2222----- KIRADKLRAACIRIQKTIRGWLMRKKYMRMRR -------------------------------- >Myosin light polypeptide ; SWP:P14649; PDB:1W7JB; EFKEAFELFDRVGDGKILYSQCGDVMRALGQNPTNAEVLKVLGNPKSDELKSRRVDFETF ------1111-------3333-----1111---------------3333------33331 LPMLQAVAKDYLEGFRVFDKEGNGKVMGAELRHVLTTLGEKMTEEEVETVLAGHEDSNGC 111------3333-33331111----------------------------2222-1111- INYEAFLKHIL -------1111 >KYNURENINE--OXOGLUTARATE ; SWP:Q16773; PDB:1W7LA; QLQARRLDGIDYNPWVEFVKLASEHDVVNLGQGFPDFPPPDFAVEAFQHAVSGDFMLNQY ---3333-----3333----3333---------------3333-----1111---1111- TKTFGYPPLTKILASFFGELLGQEIDPLRNVLVTVGGYGALFTAFQALVDEGDEVIIIEP -11113333----------------3333--------------------2222------- FFDCYEPMTMMAGGRPVFVSLKPGPGELGSSSNWQLDPMELAGKFTSRTKALVLNTPNNP -----------------------------3333--------33331111----------- LGKVFSREELELVASLCQQHDVVCITDEVYQWMVYDGHQHISIASLPGMWERTLTIGSAG ---------------------------1111---iiii---33332222----------- TFSATGWKVGWVLGPDHIMKHLRTVHQNSVFHCPTQSQAAVAESFEREQLLFRQPSSYFV ---1111-------3333--------------------------------2222------ QFPQAMQRCRDHMIRSLQSVGLKPLIPQGSYFLITDISDFKRKMPDLPGAVDEPYDRRFV ----------------1111-----------------------------2222------- KWMIKNKGLVAIPVSIFYSVPHQKHFDHYIRFCFVKDEATLQAMDEKLRKWKVE ------------3333--33333333----------3333-------------- >NUCLEOSIDE DIPHOSPHATE KI; SWP:NA; PDB:1W7WA; AELERTFIAIKPDGVQRGLISEIISRFERKGFKLVGIKVLIPTKQFAQQHYHDLKERPFF --------------1111--------3333--------------3333--3333--1111 NGLCDFLSSGPVIAMVWEGEGVITYGRKLIGATDPQKSAPGTIRGDLAVVVGRNIIHGSD -----1111---------2222-----------3333-2222-------3333------- GPETAKDEIKLWFKPEELVSFTSNSEKWIYG -------------3333-----1111----- >APICAL MEMBRANE ANTIGEN 1; SWP:O61130; PDB:1W81A; SIPTVERSTRMSNPWKAFMEKYDIERTHSSGVRVDLGEDAEVENAKYRIPAGRCPVFGKG ------------11111111---------------------------------------- IVIENSDVSFLRPVATGDQKLKDGGFAFPNANDHISPMTLANLKERYKDNVEMMKLNDIA --------1111----------------------------------11113333------ LCRTHAASFVQNSNYRHPAVYDEKEKTCHMLYLSAQENMGFCFKPDKDESFENLVYLSKN -----1111--------------------------------------3333------111 VRNDWDKKCPRKNLGNAKFGLWVDGNCEEIPYVKEVEAEDLRECNRIVFGASASDQFKSK 1--3333---------------------------------------------------ii GRGFNWANFDSVKKKCYIFNTKPTCLINDKNFIATTALSHPQEVDLEFPCSIYKDEIERE ii--------------------------1111---1111---------1111-------- IKGERIVLPRIFISNDKESIKCPCEPERISQSTCNFYVCNCVEKRAEIKENNQVVIKEEF ---------------3333---------1111-----------------%%%%---3333 RDYY ---- >DIHYDROLIPOYLLYSINE-RESID; SWP:P21873; PDB:1W85A; TFQFPFAEQLEKVAEQFPTFQILNEEGEVVNEEAMPELSDEQLKELMRRMVYTRILDQRS -------------1111------1111---3333-------------------------- ISLNRQGRLGFYAPTAGQEASQIASHFALEKEDFILPGYRDVPQIIWHGLPLYQAFLFSR ---1111-------22223333---11113333----1111---------3333------ GHFHGNQIPEGVNVLPPQIIIGAQYIQAAGVALGLKMRGKKAVAITYTGDGGTSQGDFYE -3333---2222-------2222------------1111---------3333-------- GINFAGAFKAPAIFVVQNNRFAISTPVEKQTVAKTLAQKAVAAGIPGIQVDGMDPLAVYA ---------------------!!!!3333------33333333-------1111------ AVKAARERAINGEGPTLIETLCFRYGPHTMSGDSKELENEWAKKDPLVRFRKFLEAKGLW --------1111---------------------------3333-----------1111-- SEEEENNVIEQAKEEIKEAIKKADETPKQKVTDLISIMFEELPFNLKEQYEIYKEKESK ----------------------1111---3333-1111--------------------- >Pyruvate dehydrogenase E1; SWP:P21874; PDB:1W85B; AQMTMVQAITDALRIELKNDPNVLIFGEDVGVNGGVFRATEGLQAEFGEDRVFDTPLAES -------------------1111-------33331111---3333--------------- GIGGLAIGLALQGFRPVPEIQFFGFVYEVMDSICGQMARIRYRTGGRYHMPITIRSPFGG ---------1111--------333333333333--3333----iiii------------- GVHTPELHSDSLEGLVAQQPGLKVVIPSTPYDAKGLLISAIRDNDPVIFLEHLKLYRSFR ----2222---33331111--------------------------------3333----- QEVPEGEYTIPIGKADIKREGKDITIIAYGAMVHESLKAAAELEKEGISAEVVDLRTVQP -------------------------------------------1111------------- LDIETIIGSVEKTGRAIVVQEAQRQAGIAANVVAEINERAILSLEAPVLRVAAPDTVYPF ----------------------1111-------------1111----------------3 AQAESVWLPNFKDVIETAKKVMNF 333------3333----------- >SLIT PROTEIN; SWP:P24014; PDB:1W8AA; DCPAMCHCEGTTVDCTGRGLKEIPRDIPLHTTELLLNDNELGRISSDGLFGRLPHLVKLE --------!!!!---------------1111----------------3333-1111---- LKRNQLTGIEPNAFEGASHIQELQLGENKIKEISNKMFLGLHQLKTLNLYDNQISCVMPG ---------22222222----------------------------------------222 SFEHLNSLTSLNLASNPFNCNCHLAWFAEWLRKKSLNGGAARCGAPSKVRDVQIKDLPHS 21111---------------3333----------------------1111--3333-333 EFKCSSEGC 3-------- >HYPOTHETICAL UPF0001 PROT; SWP:P67080; PDB:1W8GA; DIAHNLAQVRDKISAAATRCGRSPEEITLLAVSKTKPASAIAEAIDAGQRQFGENYVQEG -----------------1111-3333------22223333----1111------------ VDKIRHFQELGVTGLEWHFIGPLQSNKSRLVAEHFDWCHTIDRLRIATRLNDQRPAELPP -------1111------------3333---------------------------3333-- LNVLIQINISDENSKSGIQLAELDELAAAVAELPRLRLRGLMAIPAPESEYVRQFEVARQ ----------1111----3333----------1111------------------------ MAVAFAGLKTRYPHIDTLSLGMSDDMEAAIAAGSTMVRIGTAIFGA ---------------------3333--------------3333--- >HYPOTHETICAL PROTEIN AF16; SWP:O28590; PDB:1W8IA; AALIDTGIFFGFYSLKDVHHDSVAIVVHAVEGKWGRLFVTNHILDETLTLLKYKKLPADK -------------1111--------------1111------------------------- FLEGFVESGVLNIIYTDDEVERKALEVFKARVYEKGFSYTDAISEVVAEELKLKLISYDS -----3333-------------------1111-2222----------------------- RFSLPTIGRDYWKSLDESERKRISAILREKGID ---------3333-------------------- >APICAL MEMBRANE ANTIGEN 1; SWP:O61130; PDB:1W8KA; PTVERSTRMSNPWKAFMEKYDIERTHSSGVRVDLGEDAEVENAKYRIPAGRCPVFGKGIV ----------11111111---3333----------------------------------- IENSDVSFLRPVATGGFAFPNANDHISPMTLANLKERYKDNVEMMKLNDIALCRTHAASF ------1111-------------------------1111--3333-----------1111 VSNYRHPAVYDEKEKTCHMLYLSAQENMYCSDAVFCFKPDKDESFENLVYLSKNVRNDWD -----------1111--------------------------3333------11111111- KKCPRKNLGNAKFGLWVDGNCEEIPYVKEVEAEDLRECNRIVFGASASDQPTFKSKGRGF ------------------------------------------------------iiii-- NWANFDSVKKKCYIFNTKPTCLINDKNFIATTALSHPQEVDLEFPCSIYKDEIEREIKKQ ------------------------1111---1111---------1111------------ ERIVLPRIFISNDKESIKCPCEPERISQSTCNFYVCNCVEKRAEIKENNQVVIKEEFRDY ------------3333---------1111-----------------%%%%---3333333 YE 3- >BACTERIAL SIALIDASE; SWP:Q02834; PDB:1W8OA; GEPLYTEQDLAVNGREGFPNYRIPALTVTPDGDLLASYDGRPTGIGAPGPNSILQRRSTD -----------2222-------------1111--------1111---------------i GGRTWGEQQVVSAGQTTAPIKGFSDPSYLVDRETGTIFNFHVYSQRQGFAGSRPGTDPAD iii--------------------------------------------3333-----1111 PNVLHANVATSTDGGLTWSHRTITADITPDPGWRSRFAASGEGIQLRYGPHAGRLIQQYT ------------iiii------3333---1111---------------1111-------- IINAAGAFQAVSVYSDDHGRTWRAGEAVGVGMDENKTVELSDGRVLLNSRDSARSGYRKV --1111----------iiii-------------------1111-------1111------ AVSTDGGHSYGPVTIDRDLPDPTNNASIIRAFPDAPAGSARAKVLLFSNAASQTSRSQGT ----iiii-------1111------------11112222--------------------- IRMSCDDGQTWPVSKVFQPGSMSYSTLTALPDGTYGLLYEPGTGIRYANFNLAWLGGICA -----iiii--------------------1111------------------3333----- PFTIPDVALEPGQQVTVPVAVTNQSGIAVPKPSLQLDASPDWQVQGSVEPLMPGRQAKGQ ---------2222-------------------------1111---------2222----- VTITVPAGTTPGRYRVGATLRTSAGNASTTFTVTVGLLDQARMSIADVDSEETAREDGRA -----2222------------1111-------------3333----------------33 SNVIDGNPSTFWHTEWSRADAPGYPHRISLDLGGTHTISGLQYTRRQNSANEQVADYEIY 33----1111-------1111--------------------------------------- TSLNGTTWDGPVASGRFTTSLAPQRAVFPARDARYIRLVALSEQTGHKYAAVAELEVEGQ ------------------------------------------------------------ R - >FRUCTOSE-BISPHOSPHATE ALD; SWP:P58315; PDB:1W8SA; NLTEKFLRIFARRGKSIILAYDHGIEHGPADFMDNPDSADPEYILRLARDAGFDGVVFQR ----------1111-------3333--33333333------------------------- GIAEKYYDGSVPLILKLNGKTTLYNGEPVSVANCSVEEAVSLGASAVGYTIYPGSGFEWK --------------------3333---------------1111--------2222----- MFEELARIKRDAVKFDLPLVVESFPRGGKVVNETAPEIVAYAARIALELGADAMKIKYTG --------------------------!!!!-1111------------------------- DPKTFSWAVKVAGKVPVLMSGGPKTKTEEDFLKQVEGVLEAGALGIAVGRNVWQRRDALK 3333-------!!!!-----------------------1111------1111-------- FARALAELVY ---------- ------------------------------------------------------------ ----------------------- >Penton protein P31; SWP:P27384; PDB:1W8XN; NPNQMTVTPVYNGCDSGEGPQSVRGYFDAVAGENVKYDLTYLADTQGFTGVQCIYIDNAE ----------2222---------------------------------------------- NDGAFEIDVEETGQRIKCPAGKQGYFPLLVPGRAKFVARHLGSGKKSVPLFFLN -1111------------------------------------------------- >Protein P16; SWP:P27392; PDB:1W8XP; MDKKKLLYWVGGGLVLILIWLWFRNRPAAQVASNWEGPPYMTYNQPQAGSVTLPVADSIT -----3333-----------------------3333------------------------ SQLNDYASSLNDYLASQAGV --3333-------------- >BETA-XYLOSIDASE; SWP:Q9ZFM2; PDB:1W91A; VNVPSNGREKFKKNWKFCVGTGRLGLALQKEYLDHLKLVQEKIGFRYIRGHGLLSDDVGI ------------3333------3333-------------------------11113333- YREVEIDGEMKPFYNFTYIDRIVDSYLALNIRPFIEFGFMPKALASGDQTVFYWKGNVTP --------------------------1111----------1111-------1111----- PKDYNKWRDLIVAVVSHFIERYGIEEVRTWLFEVWNEPNLVNFWKDANKQEYFKLYEVTA -----------------------3333---------1111---2222------------- RAVKSVDPHLQVGGPAICGGSDEWITDFLHFCAERRVPVDFVSRHAYTSKAPHKKTFEYY --33331111----------3333--------1111------------------------ YQELEPPEDMLEQFKTVRALIRQSPFPHLPLHITEYNTSYSPINPVHDTALNAAYIARIL --------------------1111-1111------------------------------- SEGGDYVDSFSYWTFSDVFEEMDVPKALFHGGFGLVALHSIPKPTFHAFTFFNALGDELL -3333---------------------------------------------3333------ YRDGEMIVTRRKDGSIAAVLWNLVMEKGEGLTKEVQLVIPVSFSAVFIKRQIVNEQYGNA ----------1111--------------------------------------------33 WRVWKQMGRPRFPSRQAVETLRQVAQPHVMTEQRRATDGVIHLSIVLSKNEVTLIEIEQV 33--1111----------------------------iiii-------------------- RDETSTYVGLDDGEITSYS --3333222233332222- >PROBABLE BRIX-DOMAIN RIBO; SWP:O26776; PDB:1W94A; HLLTTSRKPSQRTRSFSQRLSRIGWRYINRGKSLRDVLIEARGPVAVVSERHGNPARITF --------------------------------------------------iiii------ LDERGGERGYILFNPSFEKKPELADKAVRVSSCPPGSEGLCNLGLEVDESSSRDAWSIRT -1111----------------------------2222----------------------- DEEYAWVELDARGTPAGFKLLIRDFRVG ---------1111--------------- >ACETYL-COENZYME A CARBOXY; SWP:Q00955; PDB:1W96A; MEYEITNYSERHTELPGHFIGLNTVDKLEESPLRDFVKSHGGHTVISKILIANNGIAAVK -------333311113333----3333----------1111------------------- EIRSVRKWAYETFGDDRTVQFVAMATPEDLEANAEYIRMADQYIEVPGGTNNNNYANVDL ----------------------------------3333-----------33331111--- IVDIAERADVDAVWAGWGHASENPLLPEKLSQSKRKVIFIGPPGNAMRSLGDKISSTIVA -----------------!!!!-----------1111------------------------ QSAKVPCIPWSGTGVDTVHVDEKTGLVSVDDDIYQKGCCTSPEDGLQKAKRIGFPVMIKA 1111-----1111----------------3333-1111---------------------1 SEGGGGKGIRQVEREEDFIALYHQAANEIPGSPIFIMKLAGRARHLEVQLLADQYGTNIS 111----------3333-----------2222---------------------------- LFGRDCSVQRRHQKIIEEAPVTIAKAETFHEMEKAAVRLGKLVGYVSAGTVEYLYSHDDG ---------iiii-------------------------------------------1111 KFYFLELNPRLQVEHPTTEMVSGVNLPAAQLQIAMGIPMHRISDIRTLYGMNPHSASEID -----------1111-----------------1111-1111-----1111-1111----1 FEFKTQDATKKQRRPIPKGHCTACRITSEDPNDGFKPSGGTLHELNFRSSSNVWGYFSVG 111--------------------------------------------------------- NNGNIHSFSDSQFGHIFAFGENRQASRKHMVVALKELSIRGTVEYLIKLLETEDFEDNTI ------------------------------------1111-------------------- TTGWLDDLI 1111----- >General secretion pathway; SWP:P45782; PDB:1W97L; SEFLTVRLSSQKEADIPWLVWSAEQQEVIASGQVAGWEALHEIESYADQRSVVVLLAASD ----------1111------------------------333333332222------3333 LILTSVEQLENLPYLLEDAQDVEDVHFCVLSKGRETADVVGVDRLWLRACLDHLKACGFD ---------------------2222----------------------------1111--- VKRVLPDVLAIPRPEHGLAALQLGDEWLVRKSTTQGAVDAQWLSLLAASDWVQNEGEYLP -----3333-------------!!!!----------------3333---1111------- LQALTPLPELSLAETQEWRYEPSGLVQLLTQEALTSKFNLLTGSFKL --------------------------------1111-----!!!!-- >G1/S-specific cyclin-E1; SWP:P24864; PDB:1W98B; SPLPVLSWANREEVWKIMLNKEKTYLRDQHFLEQHPLLQPKMRAILLDWLMEVCEVYKLH --------------------1111---11111111---3333-----------------3 RETFYLAQDFFDRYMATQENVVKTLLQLIGISSLFIAAKLEEIYPPKLHQFAYVTDGACS 333-----------1111---1111---------------------3333-3333----- GDEILTMELMIMKALKWRLSPLTIVSWLNVYMQVAYLNDLHEVLLPQYPQQIFIQIAELL --------------%%%%---------------1111-----------3333-------- DLCVLDVDCLEFPYGILAASALYHFSSSELMQKVSGYQWCDIENCVKWMVPFAMVIRETG -----3333---3333------1111-----------3333------------------- SSKLKHFRGVADEDAHNIQTHRDSLDLLDK ------222233331111----33333333 >PESTICIDIAL CRYSTAL PROTE; SWP:P05519; PDB:1W99A; TPERVWNDFMTNTGNLIDQTVTAYVRTDANAKMTVVKDYLDQYTTKFNTWKREPNNQSYR -------1111-----------------------------------------1111---- TAVITQFNLTSAKLRETAVYFSNLVGYELLLLPIYAQVANFNLLLIRDGLINAQEWSLAR -------------------111122223333----------------------3333--- SAGDQLYNTMVQYTKEYIAHSITWYNKGLDVLRNKSNGQWITFNDYKREMTIQVLDILAL -----------------------------------------------------------3 FASYDPRRYPADKIDNTKLSKTEFTREIYTALVESPSSKSIAALEAALTRDVHLFTWLKR 333--------3333--------------------------------------------- VDFWTNTIYQDLRFLSANKIGFSYTNSSAMQESGIYGSSGFGSNLTHQIQLNSNVYKTSI --------1111---------------------------2222----------------- TDTSSPSNRVTKMDFYKIDGTLASYNSNITPTPEGLRTTFFGFSTNENTPNQPTVNDYTH ----------------1111-------------------------3333----1111--- ILSYIKTDVIDYNSNRVSFAWTHKIVDPNNQIYTDAITQVPAVKSNFLNATAKVIKGPGH ----------2222--------33331111-----------1111---1111-------- TGGDLVALTSNGTLSGRMEIQCKTSIFNDPTRSYGLRIRYAANSPIVLNVSYVLQGVSRG -----------------------------------------------------iiii--- TTISTESTFSRPNNIIPTDLKYEEFRYKDPFDAIVPMRLSSNQLITIAIQPLNMTSNNQV ----------2222------3333-------3333-------------------1111-- IIDRIEIIPITQSVLDET ------------------ >CRM1 PROTEIN; SWP:O14980; PDB:1W9CA; VIQLGRIYLDMLNVYKCLSENISAAIQANGEMVTKQPLIRSMRTVKRETLKLISGWVSRS ---1111----------------------3333--------------------------- NDPQMVAENFVPPLLDAVLIDYQRNVPAAREPEVLSTMAIIVNKLGGHITAEIPQIFDAV -------------------------3333-------------------3333-------- FECTLNMINKDFEEYPEHRTNFFLLLQAVNSHCFPAFLAIPPTQFKLVLDSIIWAFKHTM --------------------------------3333----3333---------------3 RNVADTGLQILFTLLQNVAQEEAAAQSFYQTYFCDILQHIFSVVTDTSHTAGLTMHASIL 333-----------------3333-------------------------2222------- AYMFNLVEEGKISTSLNPGNPVNNQIFLQEYVANLLKSAFPHLQDAQVKLFVTGLFSLNQ ---------------------------------------33333333------------- DIPAFKEHLRDFLVQIKEFAG ---------------3333-- >HYPOTHETICAL PROTEIN AF13; SWP:O28951; PDB:1W9HA; LTYRIGNGASVPISNTGELIKGLRNYGPYEVPSLKYNQIALIHNNQFSSLINQLKSQISS -----iiii--------------------------------------3333--------- KIDEVWHIHNINISEFIYDSPHFDSIKSQVDNAIDTGVDGIMLVLPEYNTPLYYKLKSYL ---------------------------------3333----------------------- INSIPSQFMRYDILTFYVDNLLVQFVSKLGGKPWILNVDPEKGSDIIIGTGATRIDNVNL ---------3333-------------1111--------3333------------------ FCFAMVFKKDGTMLWNEISPIVTSSEYLTYLKSTIKKVVYGFKKSNPDWDVEKLTLHVSG -------1111-----------3333-------------------1111----------- KRPKMKDGETKILKETVEELKKQEMVSRDVKYAILHLNETHPFWVMHPYEGTKVKLSSKR --------------------------------------------------------1111 YLLTLLQPYLVTPIKPLSVEIVSDNWTSEEYYHNVHEILDEIYYLSKMNWRGFRSRNLPV --------------------------3333----------------------------33 TVNYPKLVAGIIANVNRYGGYPINPEGNRSLQTNPWFL 33----------------------22223333--1111 >MYOSIN II HEAVY CHAIN; SWP:P08799; PDB:1W9IA; NPIHDRTSDYHKYLKVKQGDKRYIWYNPDPKERDSYECGEIVSETSDSFTFKTVDGQDRQ 33331111--------------------1111--------------------1111---- VKKDDANQRNPIKFDGVEDMSELSYLNEPAVFHNLRVRYNQDLIYTYSGLFLVAVNPFKR -1111-----3333----3333----------------1111-----!!!!--------- IPIYTQEMVDIFKGRRRNEVAPHIFAISDVAYRSMLDDRQNQSLLITGESGAGKTENTKK ---------1111--1111---3333----------------------2222-------- VIQYLASVAGRGVLEQQILQANPILEAFGNAKTTRNNNSSRFGKFIEIQFNSAGFISGAS --------------------------------1111--------------1111------ IQSYLLEKSRVVFQSETERNYHIFYQLLAGATAEEKKALHLAGPESFNYLNQSGCVDIKG --------3333--2222--3333------------------33331111-------222 VSDSEEFKITRQAMDIVGFSQEEQMSIFKIIAGILHLGNIKFEKGAGEGAVLKDKTALNA 2-----------------------------------1111-------------------- ASTVFGVNPSVLEKALMEPRILAGRDLVAQHLNVEKSSSSRDALVKALYGRLFLWLVKKI ----------------------!!!!---------------------------------- NNVLCQERKAYFIGVLDIYGFEIFKVNSFEQLCINYTNEKLQQFFNHHMFKLEQEEYLKE ------------------------------------------------------------ KINWTFIDFGLDSQATIDLIDGRQPPGILALLDEQSVFPNATDNTLITKLHSHFSKKNAK -------3333------------------------------------------2222111 YEEPRFSKTEFGVTHYAGQVMYEIQDWLEKNKDPLQQDLELCFKDSSDNVVTKLFNDPNI 1-------------1111-----2222--------3333--3333----3333---3333 ASRAKKGANFITVAAQYKEQLASLMATLETTNPHFVRCIIPNNKQLPAKLEDKVVLDQLR -----!!!!------------------1111----------------------------- CNGVLEGIRITRKGFPNRIIYADQFRFGITKIFFRAGQLARIE --------1111-------3333-----------22223333- >CHITINASE; SWP:Q873X9; PDB:1W9PA; ASSGYRSVVYFVNWAIYGRNHNPQDLPVERLTHVLYAFANVRPETGEVYMTDSWADIEKH ----------------3333-3333-1111------------------------------ YPGDSWSDTGNNVYGCIKQLYLLKKQNRNLKVLLSIGGWTYSPNFAPAASTDAGRKNFAK 2222-----------------3333-1111--------1111---3333----------- TAVKLLQDLGFDGLDIDWEYPENDQQANDFVLLLKEVRTALDSYSAANAGGQHFLLTVAS ------------------------------------------------iiii-------- PAGPDKIKVLHLKDMDQQLDFWNLMAYDYAGSFSSLSGHQANVYNDTSNPLSTPFNTQTA ------3333-----1111-----------1111-----------11111111------- LDLYRAGGVPANKIVLGMPLYGRSFANTDGPGKPYNGVGQGSWENGVWDYKALPQAGATE ----1111-1111------------------------------2222-3333--2222-- HVLPDIMASYSYDATNKFLISYDNPQVANLKSGYIKSLGLGGAMWWDSSSDKTGSDSLIT ---1111---------------------------------------1111--!!!!---- TVVNALGGTGVFEQSQNELDYPVSQYDNLRNGMQT ---11113333---------1111----1111--- >CHOLINE BINDING PROTEIN A; SWP:Q97N74; PDB:1W9RA; GSHMPEKKVAEAEKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAESDVEVKKAELEL ------------------------------------------------------------ VKEEAKEPRNEEKVKQAKAEVESKKAEATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKP -----------------------------------------------------3333-- >BH0236 PROTEIN; SWP:Q9KG76; PDB:1W9SA; DLKNPYERIQAEAYDAMSGIQTEGTDDDGGGDNIGWINDGDWVKYERVHFERDASSIEVR ---1111--1111-------------2222-------2222------------------- VASDTPGGRIEIRTGSPTGTLLGDVQVPNTGGWQQWQTVTGNVQIQPGTYDVYLVFKGSP ---------------1111----------------------------------------- EYDLMNVNWFVFRA -------------- >1-AMINOCYCLOPROPANE-1-CAR; SWP:Q08506; PDB:1W9YA; ENFPIISLDKVNGVERAATEIKDACENWGFFELVNHGIPREVDTVEKTKGHYKKCEQRFK ------3333--1111----------------------3333------------------ ELVASKALEGVQAEVTDDWESTFFLKHLPISNISEVPDLDEEYREVRDFAKRLEKLAEEL ------1111---1111--------------1111------------------------- LDLLCENLGLEKGYLKNAFYGSKGPNFGTKVSNYPPCPKPDLIKGLRAHTDAGGIILLFQ ----------2222------------------------3333------------------ DDKVSGLQLLKDGQWIDVPPRHSIVVNLGDQLEVITNGKYKSVHRVIAQKDGARSLASFY ----------iiii-------------------1111----------------------- NPGSDAVIYPAPALVQVYPKFVFDDYKLYAGLKFQAKEPRFEAKAE --1111----3333-------3333--------------------- >VP9; SWP:Q9YWN5; PDB:1W9ZA; ALPSNVKLSKGEVEKIAVTKKEMFDELAQCNLPTIELITREHTFNGDVIRFAAWLFLMNG -----------------------3333-%%%%---3333---%%%%---------3333- QKLMIANNVAVRMGMQYATNLAGNNVKITYVTSNNVVKLGHIAAGVLANPYSNKGSGLFI -------------------1111---------iiii------------------------ TYEHNLISNQIETGKVCVLFITSLSTTASSTNSFAYSACSVPIEDWDFNMIKLTAETSCA -----------2222----------1111------------3333-3333---------- SLTAMTNLVNSLVPGERTRPVGLYVDIPGVTVTTSASSGSLPLTTIPAVTPLIFSAYTKQ --------11113333------------------------------1111---------3 VEEVGVINTLYALSYLP 333-------------- >2-KETO-3-DEOXY-6-PHOSPHOG; SWP:Q9WXS1; PDB:1WA3A; KMEELFKKHKIVAVLRANSVEEAKEKALAVFEGGVHLIEITFTVPDADTVIKELSFLKEK ------------------------------1111-----------3333----3333111 GAIIGAGTVTSVEQCRKAVESGAEFIVSPHLDEEISQFCKEKGVFYMPGVMTPTELVKAM 1-----------------1111-------------------------------------1 KLGHTILKLFPGEVVGPQFVKAMKGPFPNVKFVPTGGVNLDNVCEWFKAGVLAVGVGSAL 111-------3333------------1111--------3333----3333------3333 VKGTPDEVREKAKAFVEKIRGC ------------------3333 >Importin alpha re-exporte; SWP:P33307; PDB:1WA5C; MSDLETVAKFLAESVIASTAKTSERNLRQLETQDGFGLTLLHVIASTNLPLSTRLAGALF ---------------3333-------------2222---------11113333------- FKNFIKRKWVDENGNHLLPANNVELIKKEIVPLMISLPNNLQVQIGEAISSIADSDFPDR --------------------------------------3333--------------2222 WPTLLSDLASRLSNDDMVTNKGVLTVAHSIFKRWRPLFRSDELFLEIKLVLDVFTAPFLN 3333----1111-------------------3333------------------------- LLKTVDEQITANEKASLNILFDVLLVLIKLYYDFNCQDIPEFFEDNIQVGMGIFHKYLSY -----------------------------------------------------3333--- SNPLLEHASVLIKVKSSIQELVQLYTTRYEDVFGPMINEFIQITWNLLTSISNQPKYDIL ---------------------------------1111----------------------- VSKSLSFLTAVTRIPKYFEIFNNESAMNNITEQIILPNVTLREEDVELFEDDPIEYIRRD --------------1111----------------3333---11113333----------- LEGTRRRACTDFLKELKEKNEVLVTNIFLAHMKGFVDQYMSNWKFKDLYIYLFTALAING -------------------------------------33333333--------------- NITNAGVSSTNNLLNVVDFFTKEIAPDLTSNNIPHIILRVDAIKYIYTFRNQLTKAQLIE --1111----1111---------3333---------------------3333-------- LMPILATFLQTDEYVVYTYAAITIEKILTIRESNTSPAFIFHKEDISNSTEILLKNLIAL -------1111---------------------1111-----33333333----------- ILKHGSSPEKLAENEFLMRSIFRVLQTSEDSIQPLFPQLLAQFIEIVTIMAKNPSNPRFT -1111-3333-----------------!!!!3333--------------3333------- HYTFESIGAILNYTQRQNLPLLVDSMMPTFLTVFSEDIQEFIPYVFQIIAFVVEQSATIP ----------11113333---------------11113333------------------3 ESIKPLAQPLLAPNVWELKGNIPAVTRLLKSFIKTDSSIFPDLVPVLGIFQRLIASKAYE 333333333333333--3333-------------------------------1111---- VHGFDLLEHIMLLIDMNRLRPYIKQIAVLLLQRLQNSKTERYVKKLTVFFGLISNKLGSD --------------33333333--------3333-------------------------- FLIHFIDEVQDGLFQQIWGNFIITTLPTIGNLLDRKIALIGVLNMVINGQFFQSKYPTLI ---------2222--------33331111--------------------------1111- SSTMNSIIETASSQSIANLKNDYVEEISTFGSHFSKLVSISEKPFDPLPEIDVNNGVRLY --------------3333----------2222----3333-------1111--------- VAEALNKYNAISGNTFLNTILPQLTQENQVKLNQLLVG ---------------33333333--------------- >ESAT-6 LIKE PROTEIN ESXB; SWP:P0A567; PDB:1WA8A; AEMKTDAATLAQEAGNFERISGDLKTQIDQVESTAGSLQGQWRGAAGTAAQAAVVRFQEA -----3333----------------------------1111------------------- ANKQKQELDEISTNIRQAGVQYSRADEEQQQALSSQMGF --------------3333--------1111-3333---- >6 kDa early secretory ant; SWP:P0A564; PDB:1WA8B; MTEQQWNFAGIEAAASAIQGNVTSIHSLLDEGKQSLTKLAAAWGGSGSEAYQGVQQKWDA ------1111----------------------------3333------------------ TATELNNALQNLARTISEAGQAMASTEGNVTGMFA ---------------------333322223333-- >PERIOD CIRCADIAN PROTEIN; SWP:PER_DROME; PDB:1WA9A; EDSFCCVISMHDGIVLYTTPSITDVLGYPRDMWLGRSFIDFVHLKDRATFASQITTGIAK --------------------3333------3333--3333--1111-------------- STFCVMLRRYRGLKSGGFGVIGRPVSYEPFRLGLTFREAPEEARSNGTNMLLVICATPIK ------------1111-------------------------------------------- SSYKVPDEILSQKSPKFAIRHTATGIISHVDSAAVSALGYLPQDLIGRSIMDFYHHEDLS ---------------------1111-----3333------3333----3333--1111-- VMKETYETVMKKGQTAGASFCSKPYRFLIQNGCYVLLETEWTSFVNPWSRKLEFVVGHHR -3333---------2222------------------------------------------ VFQGPKQCNVFEAAPTCKLKISEEAQSRNTRIKEDIVKRLAETVSRPSDTVKQEVSRRCQ --------1111-------------------------1111---------3333------ ALASFMETLMDEVSRADL ------------------ >TITIN; SWP:Q8WZ42; PDB:1WAAA; ALIEVEKPLYGVEVFVGETAHFEIELSEPDVHGQWKLKGQPLAASPDCEIIEDGKKHILI --------------2222------------------iiii----1111----!!!!---- LHNCQLGMTGEVSFQAANTKSAANLKVKEL ----3333---------------------- >CYTOCHROME C3; SWP:P00133; PDB:1WAD; VDVPADGAKIDFIAGGEKNLTVVFNHSTHKDVKCDDCHHDPGDKQYAGCTTDGCHNILDK ------------------------333311111111-----1111--1111-------33 ADKSVNSWYKVVHDAKGGAKPTCISCHKDKAGDDKELKKKLTGCKGSACHP 33-1111-----------------------!!!!------------3333- >SERINE/THREONINE-PROTEIN ; SWP:Q96SB4; PDB:1WAKA; CKYHLVKIGDLFNGRYHVIRKLGWGHFSTVWLSWDIQGKKFVAMKVVKSAEHYTETALDE ------2222--------------1111-------1111--------------------- IRLLKSVRNSDPNDPNREMVVQLLDDFKISGVNGTHICMVFEVLGHHLLKWIIKSNYQGL ----------11113333------------1111------------3333----%%%%-- PLPCVKKIIQQVLQGLDYLHTKCRIIHTDIKPENILLSVNEQYIRRLAAEATAGNFLVNP ------------------------------3333------------------------11 LEPKNAEKLKVKIADLGNACWVHKHFTEDIQTRQYRSLEVLIGSGYNTPADIWSTACMAF 111111---------1111-1111--------1111------------------------ ELATGDYLFEPHSGEEYTRDEDHIALIIELLGKVPRKLIVAGKYSKEFFTKKGDLKHITK -------------1111-----------------33331111-3333--1111------- LKPWGLFEVLVEKYEWSQEEAAGFTDFLLPMLELIPEKRATAAECLRHPWLNS ---------------------------3333---3333--3333---3333-- >TRP RNA-BINDING ATTENUATI; SWP:P19466; PDB:1WAPA; DFVVIKAVEDGVNVIGLTRGTDTKFHHSEKLDKGEVIIAQFTEHTSAIKVRGEALIQTAY -------------------------------2222------1111------------111 GEMKSEKK 1------- >GROWTH/DIFFERENTIATION FA; SWP:P43026; PDB:1WAQA; ARCSRKALHVNFKDMGWDDWIIAPLEYEAFHCEGLCEFPLRSHLEPTNHAVIQTLMNSMD ----------1111--3333--------------------3333---------------3 PESTPPTCCVPTRLSPISILFIDSANNVVYKQYEDMVVESCGCR 333-------------------1111------------------ >CHITOTRIOSIDASE 1; SWP:Q13231; PDB:1WB0A; AKLVCYFTNWAQYRQGEARFLPKDLDPSLCTHLIYAFAGMTNHQLSTTEWNDETLYQEFN ---------3333-!!!!--3333-1111-----------%%%%----1111-------- GLKKMNPKLKTLLAIGGWNFGTQKFTDMVATANNRQTFVNSAIRFLRKYSFDGLDLDWEY -33331111-------3333---------------------------------------2 PGSQGSPAVDKERFTTLVQDLANAFQQEAQTSGKERLLLSAAVPAGQTYVDAGYEVDKIA 222---3333--------------------------------------------3333-1 QNLDFVNLMAYDFHGSWEKVTGHNSPLYKRQEESGAAASLNVDAAVQQWLQKGTPASKLI 111-----------1111-----------3333-3333----------------3333-- LGMPTYGRSFTLASSSDTRVGAPATGSGTPGPFTKEGGMLAYYEVCSWKGATKQRIQDQK -------------1111-2222-------------2222-3333---------------- VPYIFRDNQWVGFDDVESFKTKVSYLKQKGLGGAMVWALDLDDFAGFSCNQGRYPLIQTL -----!!!!-----------------1111-------1111-1111-------------- RQELS ----- >TRANSLATION ELONGATION FA; SWP:NA; PDB:1WB1A; HMDFKNINLGIFGHIDHGKTTLSKVLTEIAGFSAFKLENYRITLVDAPGHADLIRAVVSA -------------2222----3333-3333------------------------------ ADIIDLALIVVDAKEGPKTQTGEHMLILDHFNIPIIVVITKSDNAGTEEIKRTEMIMKSI --------------------------3333------------------------------ LQSTHNLKNSSIIPISAKTGFGVDELKNLIITTLNNAEIIRNTESYFKMPLDHAFPIKGA -------------------------3333------------------------------- GTVVTGTINKGIVKVGDELKVLPINMSTKVRSIQYFKESVMEAKAGDRVGMAIQGVDAKQ ----------------------------------%%%%---------------------- IYRGILTSKDTKLQTVDKIVAKIKISDIFKYNLTPKMKVHLNVGMLIVPAVAVPFKKVTF -------!!!!------------------------------------------------- GKTEENIILNEVISGNEYAFELEEKVLAEVGDRVLITRLDLPPTTLRIGHGLIEEFKPIK -------------------------------------3333----------------333 DLNIKKEVLREGKVKIDKGRTVIDGLAQSKVAAEKLIGEEISIEGKDIVGKIKGTFGTKG 3-------------------------------3333------------------------ LLTAEFSGNVENRDKVILNRLRRWG ------------------------- >ENDO-1,4-BETA-XYLANASE Y; SWP:P51584; PDB:1WB4A; SFKYESAVQYRPAPDSYLNPCPQAGRIVKETYTGINGTKSLNVYLPYGYDPNKKYNIFYL -------------3333----------------1111--------22221111------- MHGGGENENTIFSNDVKLQNILDHAIMNGELEPLIVVTPTFNGGNCTAQNFYQEFRQNVI --22221111--------------------------------!!!!3333--------33 PFVESKYSTYAESTTPQGIAASRMHRGFGGFAMGGLTTWYVMVNCLDYVAYFMPLSGDYW 33-------------------1111-------------------3333------------ YGNSPQDKANSIAEAINRSGLSKREYFVFAATGSEDIAYANMNPQIEAMKALPHFDYTSD ----------------3333-1111-------11113333-----------3333----- FSKGNFYFLVAPGATHWWGYVRHYIYDALPYFFHELEHHHHHH ----------------3333-------3333------------ >SUPEROXIDE DISMUTASE [FE]; SWP:P80857; PDB:1WB7A; IQFKKYELPPLPYKIDALEPYISKDIIDVHYNGHHKGFVNGANSLLERLEKVVKGDLQTG -------------1111----------------------------------------222 QYDIQGIIRGLTFNINGHKLHALYWENMAPSGKGGGKPGGALADLINKQYGSFDRFKQVF 2-3333------------------1111-------------------------------- TETANSLPGTGWAVLYYDTESGNLQIMTFENHFQNHIAEIPIILILDEFEHAYYLQYKNK ---1111-----------------------------2222--------3333----!!!! RADYVNAWWNVVNWDAAEKKLQKYL ------3333-----------1111 >DNA MISMATCH REPAIR PROTE; SWP:P23909; PDB:1WB9A; SAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYTLFYDDAKRASQLLDISLTKRGAS ----3333--------------1111-----!!!!------------------------- AGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISD ----------3333--------1111----------3333-------------1111--3 EALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYA 333-1111---------1111--------------------------------------1 EDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLL 111-33332222------3333------------------3333-1111----------- QYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTP ----------1111------1111----------------1111----3333-------- MGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAGLQPVLRQVGDLERILARLALRTAR -----------------------------1111--------3333--------------3 PRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRD 333-----------------1111---------3333--------------------111 GGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQ 1---2222------------------------------1111--------------3333 SHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEA 11113333---------------------------------------------3333--- LQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPL ---------------------1111------------------3333------------- NLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGFMV --1111-------2222--------------1111------------------------- EMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFEL ------------1111----------------------------------------3333 TQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQK -3333-2222----------------------------------3333-3333------- LRELESIS ---1111- >WINGED BEAN ALBUMIN 1; SWP:P15465; PDB:1WBA; DDPVYDAEGNKLVNRGKYTIVSFSDGAGIDVVATGNENPEDPLSIVKSTRNIMYATSISS -----1111---2222-----------------!!!!1111------------------- EDKTPPQPRNILENMRLKINFATDPHKGDVWSVVDFQPDGQQLKLAGRYPNQVKGAFTIQ -----------2222----------2222------------------------------- KGSNTPRTYKLLFCPVGSPCKNIGISTDPEGKKRLVVSYQSDPLVVKFHRH ----2222-------------------1111-------------------- >GLYCOLIPID TRANSFER PROTE; SWP:P68265; PDB:1WBEA; EHLLRPLPADKQIETGPFLEAVSHLPPFFDCLGSPVFTPIKADISGNITKIKAVYDTNPT -------1111-------------3333-1111---3333------------------33 KFRTLQNILEVEKEMYGAEWPKVGATLALMWLKRGLRFIQVFLQSICDGERDENHPNLIR 33-------------!!!!--------------------------1111--3333----- VNATKAYEMALKKYHGWIVQKIFQAALYAAPYKSDFLKALSKGQNVTEEECLEKVRLFLV ----------3333------------1111------------------------------ NYTATIDVIYEMYTRMNAELNYKV ------------------------ >AGGLUTININ; SWP:O24313; PDB:1WBFA; KTISFNFNQFHQNEEQLKLQRDARISSNSVLELTKVVNGVPTWNSTGRALYAKPVQVWDS ----------2222-----------1111-------iiii-------------------- TTGNVASFETRFSFSIRQPFPRPHPADGLVFFIAPPNTQTGEGGGYFGIYNPLSPYPFVA ------------------------------------------!!!!----1111------ VEFDTFRNTWDPQIPHIGIDVNSVISTKTVPFTLDNGGIANVVIKYDASTKILHVVLVFP -------1111-----------------------2222---------1111--------3 SLGTIYTIADIVDLKQVLPESVNVGFSAATGDPSGKQRNATETHDILSWSFSASLPG 333---------3333---------------1111-1111----------------- >KHG/KDPG ALDOLASE; SWP:P0A955; PDB:1WBHA; MKNWKTSAESILTTGPVVPVIVVKKLEHAVPMAKALVAGGVRVLNVTLRTECAVDAIRAI ------------------------3333--------1111---------1111------- AKEVPEAIVGAGTVLNPQQLAEVTEAGAQFAISPGLTEPLLKAATEGTIPLIPGISTVSE ---1111----------------------------------------------------- LMLGMDYGLKEFKFFPAEANGGVKALQAIAGPFSQVRFCPTGGISPANYRDYLALKSVLC ----1111---------------------1111-----------3333-3333-1111-- IGGSWLVPADALEAGDYDRITKLAREAVEGAKL ---3333---------------------3333- >AVIDIN-RELATED PROTEIN 2; SWP:P56732; PDB:1WBIA; ARKCSLTGEWDNDLGSIMTIGAVNDNGEFDGTYITAVADNPGNITLSPLLGIQHKRASQP -2222------1111--------1111------------3333----------------- TFGFTVHWNFSESTSVFVGQCFVDRSGKEVLKTKWLQRLAVDDISDDWIATRVGNNDFTR -----------------------1111---------------33331111---------- QHT --- >BETA-2MICROGLOBULIN; SWP:P01899; PDB:1WBXA; PHSMRYFETAVSRPGLEEPRYISVGYVDNKEFVRFDSDAENPRYEPRAPWMEQEGPEYWE ------------2222----------iiii-----1111--------3333--------- RETQKAKGQEQWFRVSLRNLLGYYNQSAGGSHTLQQMSGCDLGSDWRLLRGYLQFAYEGR ------------------------------------------1111----------iiii DYIALNEDLKTWTAADMAAQITRRKWEQSGAAEHYKAYLEGECVEWLHRYLKNGNATLLR -----3333------3333-------------------------------------1111 TDSPKAHVTHHPRSKGEVTLRCWALGFYPADITLTWQLNGEELTQDMELVETRPAGDGTF -------------2222--------------------iiii-1111-------------- QKWASVVVPLGKEQNYTCRVYHEGLPEPLTLRWEP --------22221111-----1111---------- >ENDOGLUCANASE; SWP:P82186; PDB:1WC2A; NQKCSGNPRRYNGKSCASTTNYHDSHKGACGCGPASGDAQFGWNAGSFVAAASQMYFDSG ----------iiii-------------1111---------1111-------------111 NKGWCGQHCGQCIKLTTTGGYVPGQGGPVREGLSKTFMITNLCPNIYPNQDWCNQGSQYG 1----1111------------2222----------------------------------- GHNKYGYELHLDLENGRSQVTGMGWNNPETTWEVVNCDSEHNHDHRTPSNSMYGQCQCAH --1111--------1111--1111---------------33333333-3333-------- >ADENYLATE CYCLASE; SWP:O32393; PDB:1WC3A; SHMRPEPRLITILFSDIVGFTRMSNALQSQGVAELLNEYLGEMTRAVFENQGTVDKFVGD -------------------33331111-3333---------------1111------!!! AIMALYGAPEEMSPSEQVRRAIATARQMLVALEKLNQGWQERGLVGRNEVPPVRFRCGIH !----------------------------------------------------------- QGMAVVGLFGSQERSDFTAIGPSVNIAARLQEATAPNSIMVSAMVAQYVPDEEIIKREFL ----------3333------3333-----------------333311113333------- ELKGIDEPVMTCVINPNML -2222-------------- >THIOGLUCOSIDASE; SWP:Q95X01; PDB:1WCGA; YKFPKDFMFGTSTASYQIEGGWNEDGKGENIWDRLVHTSPEVIKDGTNGDIACDSYHKYK ---1111------3333------iiii-----------33331111----!!!!------ EDVAIIKDLNLKFYRFSISWARIAPSGVMNSLEPKGIAYYNNLINELIKNDIIPLVTMYH ------------------3333-1111--------------------1111--------- WDLPQYLQDLGGWVNPIMSDYFKEYARVLFTYFGDRVKWWITFNEPIAVCKGYSIKAYAP ---3333---!!!!---------------------------------------------- NLNLKTTGHYLAGHTQLIAHGKAYRLYEEMFKPTQNGKISISISGVFFMPKNAESDDDIE ---------------------------------------------------1111----- TAERANQFERGWFGHPVYKGDYPPIMKKWVDQKSKEEGLPWSKLPKFTKDEIKLLKGTAD ----------------------3333--------1111----------------2222-- FYALNHYSSRLVTFGSDPNPNFNPDASYVTSVDEAWLKPNETPYIIPVPEGLRKLLIWLK ------------------33333333------3333-----------3333--------- NEYGNPQLLITENGYGDDGQLDDFEKISYLKNYLNATLQAMYEDKCNVIGYTVWSLLDNF ------------------------------------------------------------ EWFYGYSIHFGLVKIDFNDPQRTRTKRESYTYFKNVVSTGKP !!!!-----------1111----------------------- >PROTEIN TYROSINE PHOSPHAT; SWP:Q12923; PDB:1WCHA; HSFLTNDELAVLPVVKVLPSGKYTGANLKSVIRVLRGLLDQGIPSKELENLQELKPLDQC --------------------------------------------------1111------ LIGQTKENRRKNRYKNILPYDATRVPLGDEGGYINASFIKIPVGKEEFVYIACQGPLPTT 333333331111-1111--3333----1111-----------!!!!----------1111 VGDFWQMIWEQKSTVIAMMTQEVEGEKIKCQRYWPNILGKTTMVSNRLRLALVRMQQLKG -----------------------!!!!---------2222----1111---------222 FVVRAMTLEDIQTREVRHISHLNFTAWPDHDTPSQPDDLLTFISYMRHIHRSGPIITHCS 2--------------------------2222----------------------------- AGIGRSGTLICIDVVLGLISQDLDFDISDLVRCMRLQRHGMVQTEDQYIFCYQVILYVLT ------------------1111-----------11112222------------------- RLQAEEEQ -------- >BCLA PROTEIN; SWP:Q83WA6; PDB:1WCKA; GLGLPAGLYAFNSGGISLDLGINDPVPFNTVGSQFGTAISQLDADTFVISETGFYKITVI --------------------2222------------------1111-------------- ANTATASVLGGLTIQVNGVPVPGTGSSLISLGAPIVIQAITQITTTPSLVEVIVTGLGLS ---------------iiii-2222-----2222--------------------------- LALGTSASIIIEKVAH ---------------- >TRANSCRIPTION ELONGATION ; SWP:P0AFF9; PDB:1WCNA; GDNKPADDLLNLEGVDRDLAFKLAARGVCTLEDLAEQGIDDLADIEGLTDEKAGALIMAA 3333-------2222-------1111---3333111133333333--------------- RNICWFGDEA ------3333 ------------------------------------------------------------ ------------------------------------------- >NON-CATALYTIC PROTEIN 1; SWP:Q9C171; PDB:1WCUA; VSATYSVVYETGKKLNSGFDNWGWDSKMSFKDNSLVLTADPDEYGAISLKNLNSNYYGKG ---------------2222-----------%%%%-----2222----------------- GCIYLQVKTETEGLVKVQGVRGYDETEAFNVGSFRSSSDFTEYKFEVDDEYQFDRIIVQD -------------------22223333--------------------3333--------1 GPASNIPIYMRYIIYSTGSCDDHILEHHH 111---------------3333-3333-- >Putative partitioning pro; SWP:Q72H90; PDB:1WCV1; KVRRIALANQKGGVGKTTTAINLAAYLARLGKRVLLVDLDPQGNATSGLGVRAERGVYHL ------------3333-----------1111--------33333333------------1 LQGEPLEGLVHPVDGFHLLPATPDLVGATVELAGAPTALREALRDEGYDLVLLDAPPSLS 111-3333----iiii-------3333----2222--3333---1111------------ PLTLNALAAAEGVVVPVQAEYYALEGVAGLLATLEEVRAGLNPRLRLLGILVTMYDGRTL -----------------------------------------1111----------3333- LAQQVEAQLRAHFGEKVFWTVIPRNVRLAEAPSFGKTIAQHAPTSPGAHAYRRLAEEVMA ------------!!!!-------------3333---3333-1111--------------- RVQE ---- >UROPORPHYRINOGEN III SYNT; SWP:NA; PDB:1WCWA; AVRVAYAGLRRKEAFKALAEKLGFTPLLFPVQATEKVPVPEYRDQVRALAQGVDLFLATT -------------------1111--------------3333-------1111-------- GVGVRDLLEAGKALGLDLEGPLAKAFRLARGAKAARALKEAGLPPHAVGDGTSKSLLPLL --------------------------------------1111---------33331111- PQGRGVAALQLYGKPLPLLENALAERGYRVLPLMPYRHLPDPEGILRLEEALLRGEVDAL ---------------------------------------------------1111----- AFVAAIQVEFLFEGAKDPKALREALNTRVKALAVGRVTADALREWGVKPFYVDETERLGS ---3333-----------------------------------1111-------------- LLQGFKRALQKEVA -------------- >ARIADNE-1 PROTEIN HOMOLOG; SWP:Q9Y4X5; PDB:1WD2A; WIAANTKECPKCHVTIEKDGGCNHMVCRNQNCKAEFCWVCLGPWEPHGSAWYNCNRYNEF -----------------------------1111----------3333------------- >ALPHA-L-ARABINOFURANOSIDA; SWP:Q9C4B1; PDB:1WD3A; MGPCDIYEAGDTPCVAAHSTTRALYSSFSGALYQLQRGSDDTTTTISPLTAGGIADASAQ -------1111-------------1111---------------------2222------- DTFCANTTCLITIIYDQSGNGNHLTQAPPGGFDGPDTDGYDNLASAIGAPVTLNGQKAYG -----------------------------------2222-------------iiii---- VFMSPGTGYRNNEATGTATGDEAEGMYAVLDGTHYNDACCFDYGNAETSSTDTGAGHMEA ---2222-----------!!!!--------1111------------1111---2222--- IYLGNSTTWGYGAGDGPWIMVDMENNLFSGADEGYNSGDPSISYRFVTAAVKGGADKWAI -----------------------------------3333--------------------- RGANAASGSLSTYYSGARPDYSGYNPMSKEGAIILGIGGDNSNGAQGTFYEGVMTSGYPS ---1111-------------2222-------------1111------------------- DDTENSVQENIVAAKYVVGSLVSGPSFTSGEVVSLRVTTPGYTTRYIAHTDTTVNTQVVD ---------------------------2222-------2222-------!!!!------1 DDSSTTLKEEASWTVVTGLANSQCFSFESVDTPGSYIRHYNFELLLNANDGTKQFHEDAT 111--------------1111----------2222----%%%%----------------- FCPQAALNGEGTSLRSWSYPTRYFRHYENVLYAASNGGVQTFDSKTSFNNDVSFEIETAF -----1111------3333-------%%%%---------1111-22223333------11 AS 11 >HYPOTHETICAL PROTEIN TT14; SWP:Q5SIB2; PDB:1WD5A; RFRDRRHAGALLAEALAPLGLEAPVVLGLPRGGVVVADEVARRLGGELDVVLVRKVGAPG ---------------3333-----------3333-----------------------222 NPEFALGAVGEGGELVLPYALRYADQSYLEREAARQRDVLRKRAERYRRVRPKAARKGRD 2--------1111-----3333-------------------------1111----2222- VVLVDDGVATGASEAALSVVFQEGPRRVVVAVPVASPEAVERLKARAEVVALSVPQDFAA -------------------------------------------1111------------3 VGAYYLDFGEVTDEDVEAILLEWAG 333---------------3333--- >PROTEIN-ARGININE DEIMINAS; SWP:Q9UM07; PDB:1WD8A; AQGTLIRVTPEQPTHAVCVLGTLTQLDICSSSFSINASPGVVVDIAHSTWPLDPGVEVTL ------------------2222---------------1111-----------1111---- TMKAASGSTGDQKVQISYYGPKTPPVKALLYLTAVEISLCADITRTGKVKQRTWTWGPCG ------------------------------------------------------------ QGAILLVNCDRDLDSEDLQDMSLMTLSTKTPKDFFTNHTLVLHVARSEMDKVRVFQATRK -------------33333333---------1111----------11111111-------- CSVVLGPKWPSHYLMVPGGKHNMDFYVEALAFPDTDFPGLITLTISLLDTSNLELPEAVV ---------------------------------1111----------------------- FQDSVVFRVAPWIMTPNTQPPQEVYACSLKSVTTLAMKAKCKLTICQDEMEIGYIQAPHK ---------------1111----------------------------------------- TLPVVFDSDFGYVTRGGLDSFGNLEVSPPVTVRGKEYPLGRILFGDSCYPSNDSRQMHQA --------------------1111------------1111----------1111---333 LQDFLSAQQVQAPVKLYSDWLSVGHVDEFLSFVPAPDRKGFRLLLASPRSCYKLFQEQQN 3----------------1111---3333--------2222-------------------- EGHGEALLFKQQKIKNILSNKTLREHNSFVERCIDWNRELLKRELGLAESDIIDIPQLFK --1111----------3333----------------------1111-3333--------- LKEFSKAEAFFPNMVNMLVLGKHLGIPKPFGPVINGRCCLEEKVCSLLEPLGLQCTFIND --%%%%------1111-----------------%%%%----------3333--------- CGTNVRRKPFSFKWWNMVP ------------3333--- >SCALLOP MYOSIN; SWP:P24733; PDB:1WDCA; RDERLSKIISMFQAHIRGYLIRKAYKKLQDQRIGLSVIQRNIRKWLVLRNWQWWKLYSKV -----------------------------------------------------------3 KPLL 333- >Myosin regulatory light c; SWP:P13543; PDB:1WDCB; LPQKQIQEMKEAFSMIDVDRDGFVSKEDIKAISEQLGRAPDDKELTAMLKEAPGPLNFTM -----------1111-1111----3333----3333----------3333------3333 FLSIFSDKLSGTDSEETIRNAFAMFDEQETKKLNIEYIKDLLENMGDNFNKDEMRMTFKE -------------------------1111----3333--------------------111 APVEGGKFDYVKFTAMIKGSGE 1--iiii--------------- >Myosin essential light ch; SWP:P07291; PDB:1WDCC; LSQDEIDDLKDVFELFDFWDGRDGAVDAFKLGDVCRCLGINPRNEDVFAVGGTHKMGEKS -----------------1111-----1111----3333----33333333---------- LPFEEFLPAYEGLMDCEQGTFADYMEAFKTFDREGQGFISGAELRHVLTALGERLSDEDV -1111-------1111------------1111-------3333----------------- DEIIKLTDLQEDLEGNVKYEDFVKKVMAGPYP -----------1111----------1111--- >Ribulose bisphosphate car; SWP:P18567; PDB:1WDDS; QVWPIEGIKKFETLSYLPPLTVEDLLKQIEYLLRSKWVPCLEFSKVGFVYRENHRSPGYY -----------2222----------------------------------------2222- DGRYWTMWKLPMFGCTDATQVLKELEEAKKAYPDAFVRIIGFDNVRQVQLISFIAYKPPG -----------2222-3333-----------1111--------1111----------222 C 2 >PROBABLE DIPHTHINE SYNTHA; SWP:Q9YDI2; PDB:1WDEA; EAVTLLLVGWGYAPGQTLEALDAVRRADVVYVESYTPGSSWLYKSVVEAAGEARVVEASR --------------------------------------3333------------------ RDLEERSREIVSRALDAVVAVVTAGDPVATTHSSLAAEALEAGVAVRYIPGVSGVQAARG -----3333------------------------------1111---------------33 ATLSFYRFGGTVTLPGPWRGVTPISVARRIYLNLCAGLHTTALLDVDERGVQLSPGQGVS 33-3333--------1111--------------1111---------3333---------- LLLEADREYAREAGAPALLARLPSVLVEAGAGGGHRVLYWSSLERLSTADVEGGVYSIVI -----------------3333---------iiii-----------1111----------- PARLSGVEEWLLAAASGQRRPLEYDRSVYETVEENCKKGVYEPV -----------------------------------1111----- >E2 GLYCOPROTEIN; SWP:P11224; PDB:1WDFA; QKMIASAFNNALGAIQDGFDATNSALGKIQSVVNANAEALNNLLNQLSNRFGAIDLSLDF ------------------------------------------------1111------33 EKLNVTLLDLTYEMNRIQDAIKKLNESYINL 33----------------------------- >E2 GLYCOPROTEIN; SWP:P11224; PDB:1WDGA; NQKMIASAFNNALGAIQDGFDATNSALGKIQSVVNANAEALNNLLNQLSLLNVTLLDLTY ------------------------------------------------------------ EMNRIQDAIKKLNESYINLKE -----------3333--3333 >HYPOTHETICAL PROTEIN TT09; SWP:NA; PDB:1WDIA; EGLEAYDYHLPPEQIAQEGVEPRDMARLMVVYREGPFRVAHKRVRDLPEFLRPGDVLVFN !!!!------3333-------1111-----------------33333333-2222----- ESKVIPARLLARKPTGGKVEILLVRERALLGPARKAPPGTRLLLLSPKDLAPVPGLQAEV ------------3333-------------------------------------------- VAVEEDLVAHLEEVGEVPAAPTAGLHFTPELLERLREMGVELRFLTLHVGPGTFRPMHAE ---------------------3333------------------------1111------- PYAIPEEVAEAVNRAKAEGRRVVAVGTTVVRALESAYREGVGVVAGEGETRLFIRPPYTF ---------------1111--------------1111----------------------- KVVDALFTNFHLPRSTLLMLVAAFLGRERTLEAYRLAVAEGYRFYSLGDAMLIL -----------2222----------------------1111---1111------ >HYPOTHETICAL PROTEIN TT18; SWP:Q5SI60; PDB:1WDJA; PLVLDLARPVSEEELRRLSELNPGYQWERSPEGRLWVSPTGGESGRRSLQLAYQLARWNE ---------------------2222----1111--------------------------- ERGLGVVFDSSTGFKFPDGSILSPDAAFVERGAWEALSEAEREGFPPLAPKAVFEVRSAS --------1111---1111--------------------------------------111 QDPEELRAKMGIYLRNGVLLGVLVDPYARAVEVFRPGKPPLRLEGVERVSLDPELPGFAL 1------------1111--------1111-----2222----------------2222-- SLPPLW -3333- >FATTY OXIDATION COMPLEX A; SWP:P28793; PDB:1WDKA; MIYEGKAITVTALESGIVELKFDLKGESVNKFNRLTLNELRQAVDAIKADASVKGVIVSS -------------iiii------2222----------------------1111------- GKDVFIVGADITEFVENFKLPDAELIAGNLEANKIFSDFEDLNVPTVAAINGIALGGGLE ---------3333-------3333--------------1111------------------ MCLAADFRVMADSAKIGLPEVKLGIYPGFGGTVRLPRLIGVDNAVEWIASGKENRAEDAL -1111-----1111----3333-------3333--------------------------- KVSAVDAVVTADKLGAAALDLIKRAISGELDYKAKRQPKLEKLKLNAIEQMMAFETAKGF ---------3333-----------3333-------3333--------------------- VAGQAGPNYPAPVEAIKTIQKAANFGRDKALEVEAAGFAKLAKTSASNCLIGLFLNDQEL -----1111------------1111----------------------------------- KKKAKVYDKIAKDVKQAAVLGAGIMGGGIAYQSASKGTPILMKDINEHGIEQGLAEAAKL ---------------------------------1111--------3333----------- LVGRVDKGRMTPAKMAEVLNGIRPTLSYGDFGNVDLVVEAVVENPKVKQAVLAEVENHVR ------------------3333-----1111----------------------3333--1 EDAILASNTSTISISLLAKALKRPENFVGMHFFNPVHMMPLVEVIRGEKSSDLAVATTVA 111--------------1111--1111-------1111--------1111---------- YAKKMGKNPIVVNDCPGFLVNRVLFPYFGGFAKLVSAGVDFVRIDKVMEKFGWPMGPAYL --------------2222----------------1111---------------------- MDVVGIDTGHHGRDVMAEGFPDRMKDDRRSAIDALYEAKRLGQKNGKGFYAYEKKLVDSS -----------------------------------1111---3333-------------- VLEVLKPIVYEQRDVTDEDIINWMMIPLCLETVRCLEDGIVETAAEADMGLVYGIGFPLF --3333---------------------------------------------------111 RGGALRYIDSIGVAEFVALADQYAELGALYHPTAKLREMAKNGQSFFG 1-------------------------3333------------------ >3-ketoacyl-CoA thiolase; SWP:P28790; PDB:1WDKC; SLNPRDVVIVDFGRTPMGRSKGGMHRNTRAEDMSAHLISKVLERNSKVDPGEVEDVIWGC --1111----------------1111------------------33333333-------- VNQTLEQGWNIARMASLMTQIPHTSAAQTVSRLCGSSMSALHTAAQAIMTGNGDVFVVGG ---!!!!-------3333---3333-------1111-----------1111--------- VEHMGHVSMMHGVDPNPHMSLYAAKASGMMGLTAEMLGKMHGISREQQDAFAVRSHQLAH --3333-1111----3333----3333-----------1111------------------ KATVEGKFKDEIIPMQGYDENGFLKIFDYDETIRPDTTLESLAALKPAFNPKGGTVTAGT -----1111---------1111-------33331111----3333-----------3333 SSQITDGASCMIVMSAQRAKDLGLEPLAVIRSMAVAGVDPAIMGYGPVPATQKALKRAGL --------------------------------------33331111---------1111- NMADIDFIELNEAFAAQALPVLKDLKVLDKMNEKVNLHGGAIALGHPFGCSGARISGTLL 3333---------3333-----11111111-----11113333----3333--------- NVMKQNGGTFGLSTMCIGLGQGIATVFERV ------------------------------ >GLUTAMINE BINDING PROTEIN; SWP:P10344; PDB:1WDNA; KLVVATDTAFVPFEFKQGDLYVGFDVDLWAAIAKELKLDYELKPMDFSGIIPALQTKNVD ---------------------------------------------3333----------- LALAGITITDERKKAIDFSDGYYKSGLLVMVKANNNDVKSVKDLDGKVVAVKSGTGSVDY --------3333-------------------1111----------------2222----- AKANIKTKDLRQFPNIDNAYMELGTNRADAVLHDTPNILYFIKTAGNGQFKAVGDSLEAQ --------------3333----1111-----------------1111------------- QYGIAFPKGSDELRDKVNGALKTLRENGTYNEIYKKWFGTEPK ------22223333----------1111--------------- >BETA-AMYLASE; SWP:P10538; PDB:1WDPA; SDSNMLLNYVPVYVMLPLGVVNVDNVFEDPDGLKEQLLQLRAAGVDGVMVDVWWGIIELK -----1111-------2222-1111---------------1111--------3333-333 GPKQYDWRAYRSLLQLVQECGLTLQAIMSFHQCGGNVGDIVNIPIPQWVLDIGESNHDIF 3----------------1111--------------2222------3333--33331111- YTNRSGTRNKEYLTVGVDNEPIFHGRTAIEIYSDYMKSFRENMSDFLESGLIIDIEVGLG --1111-------3333-----iiii--------------------1111---------2 PAGELRYPSYPQSQGWEFPGIGEFQCYDKYLKADFKAAVARAGHPEWELPDDAGKYNDVP 222-------3333-------------------------11111111-------111133 ESTGFFKSNGTYVTEKGKFFLTWYSNKLLNHGDQILDEANKAFLGCKVKLAIKVSGIHWW 33----22221111----------------------------2222-----------222 YKVENHAAELTAGYYNLNDRDGYRPIARMLSRHHAILNFTCLEMRDSEQPSDAKSGPQEL 2-1111------------------------1111------11113333-1111------- VQQVLSGGWREDIRVAGENALPRYDATAYNQIILNARPQGVNNNGPPKLSMFGVTYLRLS --------1111------------------------1111-1111--------------3 DDLLQKSNFNIFKKFVLKMHADQDYCANPQKYNHAITPLKPSAPKIPIEVLLEATKPTLP 333----------------iiii----3333-------------------3333------ FPWLPETDMKVDG ------------- >ELONGATION FACTOR G HOMOL; SWP:NA; PDB:1WDTA; GAMIRTVALVGHAGSGKTTLTEALLYKTGAKERRGRVEEGTTTTDYTPEAKLHRTTVRTG -----------2222--------------------3333--------------------- VAPLLFRGHRVFLLDAPGYGDFVGEIRGALEAADAALVAVSAEAGVQVGTERAWTVAERL -----iiii---------3333-------1111-------3333--3333---------- GLPRMVVVTKLDKGGDYYALLEDLRSTLGPILPIDLPLYEGGKWVGLIDVFHGKAYRYEN ---------3333--------------------------iiii------1111-----%% GEEREAEVPPEERERVQRFRQEVLEAIVETDEGLLEKYLEGEEVTGEALEKAFHEAVRRG %%------3333--------------3333--------------------------1111 LLYPVALASGEREIGVLPLLELILEALPSPTERFGDGPPLAKVFKVQVDPFMGQVAYLRL ---------1111-3333----------3333---------------------------- YRGRLKPGDSLQSEAGQVRLPHLYVPMGKDLLEVEEAEAGFVLGVPKAEGLHRGMVLWQG -----2222---1111----------!!!!-------2222------11112222----- EKPESEEVPFARLPDPNVPVALHPKGRTDEARLGEALRKLLEEDPSLKLERQEETGELLL ---3333----------------------------------------------------- WGHGELHLATAKERLQDYGVEVEFSVPKVPYRETIKKVAEGQGKYKKQTGGHGQYGDVWL ---------------1111----------------------------------------- RLEPASEYGFEWRITGGVIPSKYQEAIEEGIKEAAKKGVLAGFPVMGFKAIVYNGSYHEV --------------%%%%-3333----------3333----------------------- DSSDLAFQIAASLAFKKVMAEAHPVLLEPIYRLKVLAPQERVGDVLSDLQARRGRILGME -----------------------------------------1111--3333--------- QEGALSVVHAEVPLAEVLEYYKALPGLTGGAGAYTLEFSHYAEVPPHLAQRIVQERAQEG ------------1111--11113333-iiii-----------------------3333-- >TRAS1 ORF2P; SWP:NA; PDB:1WDUA; PPYRVLQANLQRKKLATAELAIEAATRKAAIALIQEPYVKGFRGVRVFQSTAQGDGTVKA ------------------------1111------------------------1111---- AIAVFDHDLDVIQYPQLTTNNIVVVGIRTRAWEITLVSYYFEPDKPIESYLEQIKRVERK -----1111----3333-1111------3333---------------------------- MGPKRLIFGGDANAKSTWWGSKEDDARGDQLMGTLGELGLHILNEGDVPTFDTRYQSRVD ---------------3333----------------------------------------- VTFCTEDMLDLIDGWRVDEDLVSSDHNGMVFNIRLQK ----3333----------------------------- >HYPOTHETICAL PROTEIN APE2; SWP:Q9Y8U3; PDB:1WDVA; EKVEEWIKARGLTWRLLIQKPTRTVAEAAALLGVSESEIVKTLIVLDNAGGVYAVVIPGD -------1111-----------------------3333-------------------111 KRLNINSKELAGKPVRLARANEVVELTGYPVGGVPPVALPPNIVLVVDRILLSRKKVYGG 1--3333----------------------1111------1111----3333--------- GGRENALLEFSPRELVEATGAVVADVSE --1111---------------------- >2-5A-DEPENDENT RIBONUCLEA; SWP:Q05823; PDB:1WDYA; AAVEDNHLLIKAVQNEDVDLVQQLLEGGANVNFQEEEGGWTPLHNAVQMSREDIVELLLR ------------------------1111-1111-------------1111--------11 HGADPVLRKKNGATPFLLAAIAGSVKLLKLFLSKGADVNECDFYGFTAFMEAAVYGKVKA 11------1111------------------------1111-1111--------------- LKFLYKRGANVNLRRKTKEDQERLRKGGATALMDAAEKGHVEVLKILLDEMGADVNACDN ----1111-1111--------1111----------------------------1111-11 MGRNALIHALLSSDDSDVEAITHLLLDHGADVNVRGERGKTPLILAVEKKHLGLVQRLLE 11------1111-------------1111-----------------1111-------333 QEHIEINDTDSDGKTALLLAVELKLKKIAELLCKRGASTDCGDLV 3---1111-1111-------------------3333--------- >ALKYL HYDROPEROXIDE REDUC; SWP:O87200; PDB:1WE0A; SLIGTEVQPFRAQAFQSGKDFFEVTEADLKGKWSIVVFYPADFSFVCPTELEDVQKEYAE -2222--------------------3333-----------------------------33 LKKLGVEVYSVSTDTHFVHKAWHENSPAVGSIEYIMIGDPSQTISRQFDVLNEETGLADR 33--------------------------1111------1111---1111---1111---- GTFIIDPDGVIQAIEINADGIGRDASTLINKVKAAQYVRENPGEVC -----1111--------------3333---------33332222-- >HEME OXYGENASE 1; SWP:P72849; PDB:1WE1A; SVNLASQLREGTKKSHSMAENVGFVKCFLKGVVEKNSYRKLVGNLYFVYSAMEEEMAKFK ---------------------------------3333-----------------333311 DHPILSHIYFPELNRKQSLEQDLQFYYGSNWRQEVKISAAGQAYVDRVRQVAATAPELLV 11--3333-1111--------------1111-----------------------3333-- AHSYTRYLGDLSGGQILKKIAQNAMNLHDGGTAFYEFADIDDEKAFKNTYRQAMNDLPID -------------------------------3333------------------1111--- QATAERIVDEANDAFAMNMKMFNELEGNLIKAIGIMVFNSLT ------------------------------------------ >10 kDa chaperonin; SWP:P61492; PDB:1WE3O; KTVIKPLGDRVVVKRIEEEPKTKGGIVLPDTAKEKPQKGKVIAVGTGRVLENGQRVPLEV ---------------------1111----------------------------------- KEGDIVVFAKYGGTEIEIDGEEYVILSERDLLAVLQ --------------------------3333------ >SPLICING FACTOR, PUTATIVE; SWP:Q8RXF1; PDB:1WE6A; GSSGSSGKFDESALVPEDQFLAQHPGPATIRVSKPNENDGQFMEITVQSLSENVGSLKEK ---------------3333-3333------------------------33333333---- IAGEIQIPANKQKLSGKAGFLKDNMSLAHYNVGAGEILTLSLRERSGPSSG 3333---3333----3333--3333-3333--------------------- >SF3A1 PROTEIN; SWP:Q8K4Z5; PDB:1WE7A; GSSGSSGTEDSLMPEEEFLRRNKGPVSIKVQVPNMQDKTEWKLNGQGLVFTLPLTDQVSV -----------------3333--------------------------------------- IKVKIHEATGMPAGKQKLQYEGIFIKDSNSLAYYNMASGAVIHLALKERSGPSSG -------------------------33333333---------------------- >TUDOR AND KH DOMAIN CONTA; SWP:Q80VL1; PDB:1WE8A; GSSGSSGILTENTPVFEQLSVPQRSVGRIIGRGGETIRSICKASGAKITCDKESEGTLLL ---------------------1111--------3333----------------------- SRLIKISGTQKEVAAAKHLILEKVSEDEELRKRIAHSASGPSSG -------------------------------------------- >PHD FINGER FAMILY PROTEIN; SWP:O81488; PDB:1WE9A; GSSGSSGQCGACGESYAADEFWICCDLCEMWFHGKCVKITPARAEHIKQYKCPSCSNKSG ----------------------------------1111---3333------3333----1 PSSG 111- >PHD FINGER FAMILY PROTEIN; SWP:Q9C810; PDB:1WEEA; GSSGSSGMERGVDNWKVDCKCGTKDDDGERMLACDGCGVWHHTRCIGINNADALPSKFLC ------------------3333---------------------3333-3333-------3 FRCIELSGPSSG 333--------- >CONSERVED HYPOTHETICAL PR; SWP:Q5SLJ9; PDB:1WEHA; RLLAVFVSSRLSPEDPLYARWVRYGEVLAEEGFGLACGGYQGGEALARGVKAKGGLVVGV -----------1111-----------------------------------1111------ TAPAFFPERRGPNPFVDLELPAATLPQRIGRLLDLGAGYLALPGGVGTLAELVLAWNLLY -33333333---1111-------------------------------------------- LRRGVGRPLAVDPYWLGLLKAHGEIAPEDVGLLRVVADEEDLRRFLRSL -----------3333------!!!!33333333------------1111 >Cytochrome c; SWP:P00004; PDB:1WEJH; EVQLQQSGAELVKPGASVKLSCTASGFNIKDTYMHWVKQRPEKGLEWIGRIDPASGNTKY ------------2222-----------1111--------2222---------1111---- DPKFQDKATITADTSSNTAYLQLSSLTSEDTAVYYCAGYDYGNFDYWGQGTTLTVSSAET 3333--------1111----------3333------------------------------ TPPSVYPLAPGTAALKSSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYT ----------3333-----------------------%%%%-------------%%%%-- LTSSVTVPSSTWPSQTVTCNVAHPASSTKVDKKIVPRNCGGDC -------1111-----------3333----------2222--- >Pterin-mimicking anti-idi; SWP:Q920E6; PDB:1WEJL; DIQMTQSPASLSASVGETVTITCRASGNIHNYLAWYQQKQGKSPQLLVYNAKTLADGVPS -------------2222-----------iiii------2222------------222233 RFSGSGSGTQYSLKINSLQPEDFGSYYCQHFWSTPWTFGGGTKLEIKRADAAPTVSIFPP 33----------------1111-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT -----------------------------iiii--2222--------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC -33331111--------3333--------1111- >HYPOTHETICAL PROTEIN TT14; SWP:Q5SHT6; PDB:1WEKA; PLIDQLHHEDSWRLFRILAEFVEGFETLSELQVPLVSVFGSARFGEGHPAYEAGYRLGRA ----------3333------------3333--------------2222------------ LAEAGFGVVTGGGPGVEAVNRGAYEAGGVSVGLNIELPHEQKPNPYQTHALSLRYFFVRK ----------------------3333---------------------------------- VLFVRYAVGFVFLPGGFGTLDELSEVLVLLQTEKVHRFPVFLLDRGYWEGLVRWLAFLRD ---1111----------------------1111----------3333------------- QKAVGPEDLQLFRLTDEPEEVVQALKA ----11113333----3333------- >RNA-BINDING PROTEIN 12; SWP:Q9NTZ6; PDB:1WELA; GSSGSSGKSPSGQKRSRSRSPHEAGFCVYLKGLPFEAENKHVIDFFKKLDIVEDSIYIAY ---------------------------------11113333----------3333----- GPNGKATGEGFVEFRNEADYKAALCRHKQYMGNRFIQVHPITKKGMLEKIDMIRKRLQSG 1111------------3333-3333----------------------------1111--- PSSG ---- >DEATH ASSOCIATED TRANSCRI; SWP:Q8C9B9; PDB:1WEMA; GSSGSSGECEVYDPNALYCICRQPHNNRFMICCDRCEEWFHGDCVGISEARGRLLERNGE ------------3333--1111-------------------------------------- DYICPNCTILSGPSSG ---33333333----- >INHIBITOR OF GROWTH FAMIL; SWP:Q8C0D7; PDB:1WENA; GSSGSSGDMPVDPNEPTYCLCHQVSYGEMIGCDNPDCSIEWFHFACVGLTTKPRGKWFCP ------------------3333-----------3333------3333------------- RCSQESGPSSG ----------- >CELLULOSE SYNTHASE, CATAL; SWP:Q9SWW6; PDB:1WEOA; GSSGSSGPKPLKNLDGQFCEICGDQIGLTVEGDLFVACNECGFPACRPCYEYERREGTQN ---------------------------------------------1111-3333------ CPQCKTRYKRLRGSPRVEGDEDEEDIDSGPSSG --------------------------------- >PHF8; SWP:NA; PDB:1WEPA; GSSGSSGMALVPVYCLCRQPYNVNHFMIECGLCQDWFHGSCVGIEEENAVDIDIYHCPDC -------------------------------------3333---33331111-------1 EAVFGPSIMKNWHSGPSSG 111---------------- >PHD FINGER PROTEIN 7; SWP:Q9DAG9; PDB:1WEQA; GSSGSSGELEPGAFSELYQRYRHCDAPICLYEQGRDSFEDEGRWRLILCATCGSHGTHRD ------------------------------3333------------------------33 CSSLRPNSKKWECNECLPASGPSSG 33-----------3333-------- >P120GAP; SWP:P20936; PDB:1WER; MPEEEYSEFKELILQKELHVVYALSHVCGQDRTLLASILLRIFLHEKLESLLLCTLNDRE -3333---------3333-------------------------1111------------- ISMEDEATTLFRATTLASTLMEQYMKATATQFVHHALKDSILKIMESKQSCELSPSKLEK 1111-3333----------------------------------1111------3333--- NEDVNTNLTHLLNILSELVEKIFMASEILPPTLRYIYGCLQKSVQHKWPTNTTMRTRVVS ------------------------3333-------------------1111-3333---- GFVFLRLICPAILNPRMFNIISDSPSPIAARTLILVAKSVQNLANLVEFGAKEPYMEGVN -------------3333-------------------------1111---11111111--- PFIKSNKHRMIMFLDELGNVPELPDTTEHSRTDLSRDLAALHEICVAHSDELRTLSNERG --------------3333-----------------------------------------3 AQQHVLKKLLAITELLQQKQNQYT 333--------------------- >INHIBITOR OF GROWTH FAMIL; SWP:Q8C0D7; PDB:1WEUA; GSSGSSGSPEYGMPSVTFGSVHPSDVLDMPVDPNEPTYCLCHQVSYGEMIGCDNPDCSIE --------------------------------------3333-----------3333--- WFHFACVGLTTKPRGKWFCPRCSQESGPSSG ------------------------------- >RIKEN CDNA 1110020M19; SWP:Q9D168; PDB:1WEVA; GSSGSSGADDFAMEMGLACVVCRQMTVASGNQLVECQECHNLYHQDCHKPQVTDKEVNDP ----------3333--------------------------------------------11 RLVWYCARCTRQMKRMAQKNQKSGPSSG 11---3333------%%%%--------- >DNA-BINDING FAMILY PROTEI; SWP:Q680Q4; PDB:1WEWA; GSSGSSGEDPFQPEIKVRCVCGNSLETDSMIQCEDPRCHVWQHVGCVILPDKPMDGNPPL ------------------3333-------------------------------------- PESFYCEICRLTSGPSSG -----3333--------- >HYPOTHETICAL PROTEIN (RIK; SWP:Q921F4; PDB:1WEXA; GSSGSSGSHHKVSVSPVVHVRGLCESVVEADLVEALEKFGTICYVMMMPFKRQALVEFEN -----------------------------3333--3333--------------------- IDSAKECVTFAADVPVYIAGQQAFFNYSTSKRITRPGNSGPSSG -----------------%%%%----------------------- >CALCIPRESSIN 1; SWP:Q9JHG6; PDB:1WEYA; GSSGSSGLIACVANDDVFSESETRAKFESLFRTYDKDTTFQYFKSFKRVRINFSNPLSAA -------------3333-------------3333-------------------------- DARLRLHKTEFLGKEMKLYFAQTLHIGSSHLAPPNPDKSGPSSG --1111----iiii------------------------------ >HETEROGENEOUS NUCLEAR RIB; SWP:P55795; PDB:1WEZA; GSSGSSGSSFQSTTGHCVHMRGLPYRATENDIYNFFSPLNPMRVHIEIGPDGRVTGEADV ---------------------------3333----------------------------- EFATHEDAVAAMAKDKANMQHRYVELFLNSTAGTSGSGPSSG ----3333---------------------------------- >TAR DNA-BINDING PROTEIN-4; SWP:Q13148; PDB:1WF0A; GSSGSSGVFVGRCTGDMTEDELREFFSQYGDVMDVFIPKPFRAFAFVTFADDQIAQSLCG -----------------3333----------------------------------3333- EDLIIKGISVHISNAEPKHNSNSGPSSG ----iiii-------------------- >RNA-BINDING PROTEIN RALY; SWP:Q9UKM9; PDB:1WF1A; GSSGSSGMSLKLQASNVTNKNDPKSINSRVFIGNLNTALVKKSDVETIFSKYGRVAGCSV -------------------------------------------3333-3333-------- HKGYAFVQYSNERHARAAVLGENGRVLAGQTLDINMAGEPKPDRSGPSSG -------------------------------------------------- >HETEROGENEOUS NUCLEAR RIB; SWP:P07910; PDB:1WF2A; GSSGSSGKTDPRSMNSRVFIGNLNTLVVKKSDVEAIFSKYGKIVGCSVHKGFAFVQYVNE -----------------------3333-------1111--------------------33 RNARAAVAGEDGRMIAGQVLDINLAAEPKVNRSGPSSG 33-------2222-iiii-------------------- >GTP-BINDING PROTEIN; SWP:Q72GH4; PDB:1WF3A; EKTYSGFVAIVGKPNVGKSTLLNNLLGVKVAPISPRPQTTRKRLRGILTEGRRQIVFVDT -------------------------------------------------!!!!------- PGLHKPMDALGEFMDQEVYEALADVNAVVWVVDLRHPPTPEDELVARALKPLVGKVPILL -------------------1111---------3333------------3333-------- VGNKLDAAKYPEEAMKAYHELLPEAEPRMLSALDERQVAELKADLLALMPEGPFFYPEDY ---3333--------------3333-----1111-----------1111-------1111 AKSDQTFGEWVAEILREEAMKRLWHEVPYAVATKVEEVAERENGVLYIKAILYVERPSQK -------------------1111!!!!1111------------------------3333- AIVIGEGGRKIKEIGQATRKQLEALLGKKVYLDLEVKVYPDWRKDPEALRELGYRS ----2222------------------------------------------------ >SIDEKICK 2 PROTEIN; SWP:Q58EX2; PDB:1WF5A; GSSGSSGRSAHLRVRQLPHAPEHPVATLSTVERRAINLTWTKPFDGNSPLIRYILEMSEN ----------------------------------------------------------22 NAPWTVLLASVDPKATSVTVKGLVPARSYQFRLCAVNDVGKGQFSKDTERVSLPESGPSS 22---------1111---------------------3333-------------------- G - >SIMILAR TO S.POMBE -RAD4+; SWP:Q92547; PDB:1WF6A; GSSGSSGSESICNSLNSKLEPTLENLENLDVSAFQAPEDLLDGCRIYLCGFSGRKLDKLR ---------------------3333----3333--------------------------- RLINSGGGVRFNQLNEDVTHVIVGDYDDELKQFWNKSAHRPHVVGAKWLLECFSKGYMLS ---1111-------3333---------3333----------------------------- EEPYIHSGPSSG 1111-------- >ENIGMA HOMOLOGUE PROTEIN; SWP:NA; PDB:1WF7A; GSSGSSGSVSLVGPAPWGFRLQGGKDFNMPLTISSLKDGGKASQAHVRIGDVVLSIDGIS ------------------------1111--------------1111---------iiii- AQGMTHLEAQNKIKACTGSLNMTLQRASAAAKSEPVSSGPSSG ------------3333--------------------------- >NEURABIN-I; SWP:Q9ULJ8; PDB:1WF8A; GSSGSSGLELFPVELEKDEDGLGISIIGMGVGADAGLEKLGIFVKTVTEGGAAQRDGRIQ -----------------------------------------------2222--3333--- VNDQIVEVDGISLVGVTQNFAATVLRNTKGNVRFVIGREKPSGPSSG -------iiii-----3333--------------------------- >NPL4 FAMILY PROTEIN; SWP:NA; PDB:1WF9A; GSSGSSGTMLRVRSRDGLERVSVDGPHITVSQLKTLIQDQLQIPIHNQTLSTNRNLLLAK -------------3333-------3333---------------1111------3333--- SPSDFLAFTDMADPNLRISSLNLAHGSMVYLAYEGERTIRGSGPSSG 33333333----1111-3333-------------------------- ------------------------------------- >HYPOTHETICAL PROTEIN 1500; SWP:Q8VDV8; PDB:1WFDA; GSSGSSGQDSDSTAAVAVLKRAVELDAESRYQQALVCYQEGIDMLLQVLKGTKESSKRCV ----------3333-----------11113333--------------------------- LRTKISGYMDRAENIKKYLDQEKEDGKSGPSSG -----------------------3333------ >RIKEN CDNA 2310008M20 PRO; SWP:Q8BFR6; PDB:1WFEA; GSSGSSGCSEVNVVKERPKTDEHKSYSCSFKGCTDVELVAVICPYCEKNFCLRHRHQSDH --------------------------------------------------3333--3333 DCEKLEVAKPRMAATQKLVRSGPSSG -3333--------------------- >RIKEN CDNA 2810002D23 PRO; SWP:NA; PDB:1WFFA; GSSGSSGIHHLPPVKAPLQTKKKIMKHCFLCGKKTGLATSFECRCGNNFCASHRYAEAHG ------------------------------------------3333---3333-3333-- CNYDYKSAGRRYLEEANPVSGPSSG ------------------------- >Regulating synaptic membr; SWP:Q9UQ26; PDB:1WFGA; GSSGSSGHSHSDKHPVTWQPSKDGDRLIGRILLNKRLKDGSVPRDSGAMLGLKVVGGKMT ------------------------------------3333------1111---------- ESGRLCAFITKVKKGSLADTVGHLRPGDEVLEWNGRLLQGATFEEVYNIILESKPEPQVE --------------------------------iiii-----3333--------------- LVVSRSGPSSG ----------- >ZINC FINGER (AN1-LIKE) FA; SWP:Q9SJM6; PDB:1WFHA; GSSGSSGQPSPPQRPNRCTVCRKRVGLTGFMCRCGTTFCGSHRYPEVHGCTFDFKSAGSG -------------------------------3333--------3333------------- PSSG ---- >NUCLEAR DISTRIBUTION GENE; SWP:O35685; PDB:1WFIA; GSSGSSGPNYRWTQTLAELDLAVPFRVSFRLKGKDVVVDIQRRHLRVGLKGQPPVVDGEL ------------------------------------------------2222-------- YNEVKVEESSWLIEDGKVVTVHLEKINKMEWWNRLVTSDPEINTKKINPENSKLSDLDSE ----3333---------------------------------------------------- TRSMVSGPSSG ----------- >PUTATIVE ELICITOR-RESPONS; SWP:NA; PDB:1WFJA; GSSGSSGPHGTLEVVLVSAKGLEDADFLNNMDPYVQLTCRTQDQKSNVAEGMGTTPEWNE ------------------------------------------------------------ TFIFTVSEGTTELKAKIFDKDVGTEDDAVGEATIPLEPVFVEGSIPPTAYNVVKDEEYKG ------------------3333-------------------------------------- EIWVALSFKPSGPSSG ---------------- >ZINC FINGER, FYVE DOMAIN ; SWP:NA; PDB:1WFKA; GSSGSSGMESRCYGCAVKFTLFKKEYGCKNCGRAFCNGCLSFSALVPRAGNTQQKVCKQC -------------------3333------------3333--------------------- HTILTRGSSDNASKWSPPQNYKSGPSSG ---------------------------- >ZINC FINGER PROTEIN 216; SWP:O88878; PDB:1WFLA; GSSGSSGPSSSQSEEKAPELPKPKKNRCFMCRKKVGLTGFDCRCGNLFCGLHRYSDKHNC -----------------------%%%%--------------1111----1111------- PYDYKAEASGPSSG --3333-------- >SYNAPTOTAGMIN XIII; SWP:Q7L8C5; PDB:1WFMA; GSSGSSGSWNQAPKLHYCLDYDCQKAELFVTRLEAVTSNHDGGCDCYVQGSVANRTGSVE ---------------------3333----------------------------3333--- AQTALKKRQLHTTWEEGLVLPLAEEELPTATLTLTLRTCDRFSRHSVAGELRLGLDGTSV ----------------------1111---------------------------------- PLGAAQWGELKTSGPSSG ------------------ >SIDEKICK 2; SWP:Q58EX2; PDB:1WFNA; GSSGSSGPQLVRTHEDVPGPVGHLSFSEILDTSLKVSWQEPGEKNGILTGYRISWEEYNR -----------------------------------------------------------3 TNTRVTHYLPNVTLEYRVTGLTALTTYTIEVAAMTSKGQGQVSASTISSGVPPSGPSSG 333-------------------------------------------------------- ------------------------------------------------------------ ------------------------------------------------------------ ---------- >ZINC FINGER (AN1-LIKE) FA; SWP:Q6NNI8; PDB:1WFPA; GSSGSSGTRGGDSAAAPLDPPKSTATRCLSCNKKVGVTGFKCRCGSTFCGTHRYPESHEC -----------------------------------------1111--------3333--- QFDFKGVASGPSSG -------------- >UNR PROTEIN; SWP:O75534; PDB:1WFQA; GSSGSSGGYPNGTSAALRETGVIEKLLTSYGFIQCSERQARLFFHCSQYNGNLQDLKVGD --------------------------------------------3333---3333----- DVEFEVSSDRRTGKPIAVKLVKISGPSSG ----------------------------- >1700129L13RIK PROTEIN; SWP:Q9D968; PDB:1WFTA; GSSGSSGPGAPSTVRISKNVDGIHLSWEPPTSPSGNILEYSAYLAIRTAQMQDNPSQLVF ------------------------------------------------------------ MRIYCGLKTSCTVTAGQLANAHIDYTSRPAIVFRISAKNEKGYGPATQIRWLQGNSKSGP --------------3333------------------------------------------ SSG --- >UNNAMED PROTEIN PRODUCT; SWP:NA; PDB:1WFUA; GSSGSSGMEPHKVVPLSKPHPPVVGKVTHHSIELYWDLEQKEKRQGPQEQWLRFSIEEED ----------------------------------------------3333---------- PKMHSYGVIYTGYATRHVVEGLEPRTLYKFRLKVTSPSGEYEYSPVVSVATTRESGPSSG ------------------------------------------------------------ >MEMBRANE ASSOCIATED GUANY; SWP:NA; PDB:1WFVA; GSSGSSGQDFDYFTVDMEKGAKGFGFSIRGGREYKMDLYVLRLAEDGPAIRNGRMRVGDQ -------------------1111-------3333--------------1111-------- IIEINGESTRDMTHARAIELIKSGGRRVRLLLKRGTGSGPSSG ---iiii-----3333---1111-------------------- >KALIRIN-9A; SWP:Q8BTT9; PDB:1WFWA; GSSGSSGSTMTVIKDYYALKENEICVSQGEVVQVLAVNQQNMCLVYQPASDHSPAAEGWV -------------------3333--------------1111--------3333------- PGSILAPFSGPSSG 3333---------- >PROBABLE RNA 2'-PHOSPHOTR; SWP:Q9YFP5; PDB:1WFXA; VRLSKTLAGILRHHPGRYGVRLTREGWARVSEVVEGLRKAGWSWVEEWHIVGVALHDPKG --------------3333----1111-----------11113333-----------1111 RYELRNGEIRARYGHSIPVNVEPLPGEPPPILYHGTTEEALPLIERGIRGRRLKVHLTSS ------------------------------------33333333---------------- LEDAVSTGRRHGNLVAVLLVDVECLRRRGLKVERSKTVYTVDWVPPECIAEVRRESL -------3333---------------------------------1111-----1111 >REGULATOR OF G-PROTEIN SI; SWP:P97492; PDB:1WFYA; GSSGSSGDQEVRLENRITFQLELVGLERVVRISAKPTKRLQEALQPILAKHGLSLDQVVL ----------------------3333--------------3333---------1111--- HRPGEKQPMDLENPVSSVASQTLVLDTPPDAKMSEARSSGPSSG ---------33333333--------------------------- >NITROGEN FIXATION CLUSTER; SWP:Q9D7P6; PDB:1WFZA; GSSGSSGENPRNVGSLDKTSKNVGTGLVGAPACGDVMKLQIQVDEKGKIVDARFKTFGCG ----------------1111-----------------------3333-----------33 SAIASSSLATEWVKGKTVEEALTIKNTDIAKELCLPPVKLHCSMLAEDAIKAALADYKLK 33------1111222233333333-----------33333333----------------- QESKSGPSSG ---------- >KIAA1579 PROTEIN; SWP:Q9HCJ3; PDB:1WG1A; GSSGSSGILVKNLPQDSNCQEVHDLLKDYDLKYCYVDRNKRTAFVTLLNGEQAQNAIQMF -----------------3333----3333--------1111-------3333-------- HQYSFRGKDLIVQLQPTDALLCSGPSSG ----%%%%-------------------- >ZINC FINGER (AN1-LIKE) FA; SWP:Q9SZ69; PDB:1WG2A; GSSGSSGPSRPVRPNNRCFSCNKKVGVMGFKCKCGSTFCGSHRYPEKHECSFDFKEVGSG ------------------------!!!!---3333--------3333------------- PSSG ---- >HYPOTHETICAL PROTEIN (RIK; SWP:Q9D0B0; PDB:1WG4A; GSSGSSGGPPTRRSDFRVLVSGLPPSGSWQDLKDHMREAGDVCYADVQKDGMGMVEYLRK -----------------------1111-------3333--------------------33 EDMEYALRKLDDTKFRSHEGETSYIRVYPERSSGPSSG 33--------------1111------------------ >HETEROGENEOUS NUCLEAR RIB; SWP:P55795; PDB:1WG5A; GSSGSSGNSPDTANDGFVRLRGLPFGCSKEEIVQFFSGLEIVPNGMTLPVDFQGRSTGEA -----------------------22223333----2222--2222-----1111------ FVQFASQEIAEKALKKHKERIGHRYIEIFKSSRAEVRTSGPSSG -----3333-3333-----------------3333--------- >HYPOTHETICAL PROTEIN (RIK; SWP:Q5SV54; PDB:1WG6A; GSSGSSGLKGEPDCYALSLESSEQLTLEIPLNDSGSAGLGVSLKGNKSRETGTDLGIFIK --------------------------------3333------------------------ SIIHGGAAFKDGRLRMNDQLIAVNGETLLGKSNHEAMETLRRSMSMEGNIRGMIQLVILR ------33333333--------iiii-3333--------------1111----------- RSGPSSG ------- >DEDICATOR OF CYTOKINESIS ; SWP:Q9BZ29; PDB:1WG7A; GSSGSSGAASLGSQKGGITKHGWLYKGNMNSAISVTMRSFKRRFFHLIQLGDGSYNLNFY ------------------------------------------------------------ KDEKISKEPKGSIFLDSCMGVVQNNKVRRFAFELKMQDKSSYLLAADSEVEMEEWITILN -------------3333------------------------------------------- KILQLNFEAAMQEKRNGDSHEDDESGPSSG ------------------------------ >predicted S-adenosylmethi; SWP:Q5SJD8; PDB:1WG8A; MTHVPVLYQEALDLLAVRPGGVYVDATLGGAGHARGILERGGRVIGLDQDPEAVARAKGL ------------3333-2222------!!!!--------------------------111 HLPGLTVVQGNFRHLKRHLAALGVERVDGILADLGVSSFHLDDPSRGFSYQKEGPLDMRM 12222-----1111-----1111-------------3333--3333-------------- GLEGPTAKEVVNRLPLEALARLLRELGEEPQAYRIARAIVAAREKAPIETTTQLAEIVRK ------------------------------------------------------------ AVGFRRAGHPARKTFQALRIYVNDELNALKEFLEQAAEVLAPGGRLVVIAFHSLEDRVVK ----33331111----------------------------2222---------------- RFLRESGLKVLTKKPLVPSEKEAAQNPRARSAKLRAAEKEA -------------------------3333------------ >Homocysteine-responsive e; SWP:Q15011; PDB:1WGDA; GSSGSSGVTLLVKSPNQRHRDLELSGDRGWSVGHLKAHLSRVYPERPRPEDQRLIYSGKL --------------------------1111-----------------1111--------- LLDHQCLRDLLPKQEKRHVLHLVCNVKSGPSSG -----3333------------------------ >HYPOTHETICAL PROTEIN 2610; SWP:Q8K0W9; PDB:1WGEA; GSSGSSGMAVFHDEVEIEDFQYDEDSETYFYPCPCGDNFAITKEDLENGEDVATCPSCSL ---------------3333---3333---------------3333--------------- IIKVIYDKDQFMCGETVSGPSSG ------3333------------- >UPSTREAM BINDING FACTOR 1; SWP:P25976; PDB:1WGFA; GSSGSSGKPSQEGGKGGSEKPKRPVSAMFIFSEEKRRQLQEERPELSESELTRLLARMWN ------------------------------------------33333333---------- DLSEKKKAKYKAREAALKAQSERKSGPSSG --3333-----33333333----------- >UBIQUITIN CARBOXYL-TERMIN; SWP:Q9JMA1; PDB:1WGGA; GSSGSSGYSVTVKWGKEKFEGVELNTDEPPMVFKAQLFALTGVQPARQKVMVKGGTLKDD ----------------------------3333---3333--------------------- DWGNIKMKNGMTVLMMGSADALPEEPSAKTSGPSSG ------------------------------------ >UBIQUITIN-LIKE 3; SWP:NA; PDB:1WGHA; GSSGSSGMSSHVPADMINLRLILVSGKTKEFLFSPNDSASDIAKHVYDNWPMDWEEEQVS ---------------------------------1111----------------------- SPNILRLIYQGRFLHGNVTLGALKLPFGKTTVMHLVARETLPEPNSQGQRSGPSSG --------------11113333---------------------------------- >RIKEN CDNA 2900073H19 PRO; SWP:NA; PDB:1WGKA; GSSGSSGMAAPLCVKVEFGGGAELLFDGVKKHQVALPGQEEPWDIRNLLVWIKKNLLKER --------------------3333-------------------3333------------3 PELFIQGDSVRPGILVLINDADWELLGELDYQLQDQDSILFISTLHGGSGPSSG 333--------------%%%%3333----------------------------- >TOLL-INTERACTING PROTEIN; SWP:Q9H0E2; PDB:1WGLA; GSSGSSGCSEEDLKAIQDMFPNMDQEVIRSVLEAQRGNKDAAINSLLQMGEEPSGPSSG ------------------------3333------%%%%--------------------- >UBIQUITIN CONJUGATION FAC; SWP:Q14139; PDB:1WGMA; GSSGSSGLQQQEEETYADACDEFLDPIMSTLMCDPVVLPSSRVTVDRSTIARHLLSDQTD -------------------3333------------------------------------- PFNRSPLTMDQIRPNTELKEKIQRWLAERKQQSGPSSG -------3333----3333------------------- >UBIQUITIN ASSOCIATED PROT; SWP:Q9NZ09; PDB:1WGNA; GSSGSSGAYSELQMLSPSERQCVETVVNMGYSYECVLRAMKKKGENIEQILDYLFAHSGP -------3333----3333----------------------------------------- SSG --- >VPS10 DOMAIN-CONTAINING R; SWP:Q96PQ0; PDB:1WGOA; GSSGSSGCEGGVDMQQSQVQLQCPLTPPRGLQVSIQGEAVAVRPGEDVLFVVRQEQGDVL ------------------------------------------2222-------------- TTKYQVDLGDGFKAMYVNLTLTGEPIRHRYESPGIYRVSVRAENTAGHDEAVLFVQVSGP ------------------3333-------------------------------------- SSG --- >PROBABLE CYCLIC NUCLEOTID; SWP:O82226; PDB:1WGPA; GSSGSSGVRRVPLFENMDERLLDAICERLKPCLFTEKSYLVREGDPVNEMLFIIRGRLES -----------------3333----3333------------------------------- VTTDGGRSGFYNRSLLKEGDFCGDELLTWALDPKSGSNLPSSTRTVKALTEVEAFALIAD ----------------2222--3333---------------------------------- ELKFVASQFRRSGPSSG ------3333------- >FYVE, RHOGEF AND PH DOMAI; SWP:Q69ZL1; PDB:1WGQA; GSSGSSGSTMSGYLYRSKGSKKPWKHLWFVIKNKVLYTYAASEDVAALESQPLLGFTVTL -------------------------------%%%%------------------------- VKDENSESKVFQLLHKGMVFYVFKADDAHSTQRWIDAFQEGTVSGPSSG ------------------------------------------------- >GROWTH FACTOR RECEPTOR-BO; SWP:Q14451; PDB:1WGRA; GSSGSSGRPHVVKVYSEDGACRSVEVAAGATARHVCEMLVQRAHALSDETWGLVECHPHL ---------------3333----------------------------------------- ALERGLEDHESVVEVQAAWPVGGDSRFVFRKNFASGPSSG ------3333------------------------------ >MYST HISTONE ACETYLTRANSF; SWP:Q9D1P2; PDB:1WGSA; GSSGSSGEPEVTVEIGETYLCRRPDSTWHSAEVIQSRVNDQEGREEFYVHYVGFNRRLDE ---------------------------------------3333----------------- WVDKNRLALTKTVKDAVQKNSEKYLSELAEQPERKITRNQKRKHDEINHVQKTYAEMDPT --3333---3333----------------------------------------------- TAALEKESGPSSG ------------- >amyloid beta (A4) precurs; SWP:Q9DBR4; PDB:1WGUA; GSSGSSGPTPKTELVQKFRVQYLGMLPVDRPVGMDTLNSAIENLMTSSSKEDWPSVNMNV ------------------------------------------------3333-------- ADATVTVISEKNEEEVLVECRVRFLSFMGVGKDVHTFAFIMDTGNQRFECHVFWCEPNAA %%%%----------------3333-------------------%%%%------------- NVSEAVQAACSGPSSG ---------------- >KIAA1068 PROTEIN; SWP:Q8IVD9; PDB:1WGVA; GSSGSSGQKNPDSYNGAVRENYTWSQDYTDLEVRVPVPKHVVKGKQVSVALSSSSIRVAM ------------------------------------------1111-------------- LEENGERVLMEGKLTHKINTESSLWSLEPGKCVLVNLSKVGEYWWNAILEGEEPIDIDSG -------------------3333------------------------------------- PSSG ---- >'SIGNAL RECOGINITION PART; SWP:NA; PDB:1WGWA; GSSGSSGADLGRKITSALRSLSNATIINEEVLNAMLKEVCTALLEADVNIKLVKQLRENV --------------------3333------------------------------------ KSAIDLEEMASGLNKRKMIQHAVFKELVKVKVYSGPSSG ------------------------3333----------- >KIAA1903 PROTEIN; SWP:Q6P0N0; PDB:1WGXA; GSSGSSGDKEWNEKELQKLHCAFASLPKHKPGFWSEVAAAVGSRSPEECQRKYMENPRGK -----------3333-----------------------1111--3333----3333---- GSQKHVTSGPSSG ------------- >RAP GUANINE NUCLEOTIDE EX; SWP:Q92565; PDB:1WGYA; GSSGSSGEEIFCHVYITEHSYVSVKAKVSSIAQEILKVVAEKIQYAEEDLALVAITFSGE ------------------------------3333-----------3333------3333- KHELQPNDLVISKSLEASGRIYVYRKDLADTLNPFAENSGPSSG ----1111------------------------------------ >CARBOXYPEPTIDASE 1; SWP:Q5SLM3; PDB:1WGZA; MTPEAAYQNLLEFQRETAYLASLGALAAWDQRTMIPKKGHEHRARQMAALARLLHQRMTD -----------------------------------1111--------------------- PRIGEWLEKVEGSPLVQDPLSDAAVNVREWRQAYERARAIPERLAVELAQAESEAESFWE ---------222211111111--------------------------------------- EARPRDDWRGFLPYLKRVYALTKEKAEVLFALPPAPGDPPYGELYDALLDGYEPGMRARE 3333---3333-------------------------------3333-3333-22223333 LLPLFAELKEGLKGLLDRILGSGKRPDTSILHRPYPVEAQRRFALELLSACGYDLEAGRL --------------------------3333-----3333--------------1111--- DPTAHPFEIAIGPGDVRITTRYYEDFFNAGIFGTLHEMGHALYEQGLPKEHWGTPRGDAV -----------2222-------1111-----------------11113333--3333--- SLGVHESQSRTWENLVGRSLGFWERFFPRAREVFASLGDVSLEDFHFAVNAVEPSLIRVE ---------------------------------3333---------1111------3333 ADEVTYNLHILVRLELELALFRGELSPEDLPEAWAEKYRDHLGVAPKDYKDGVMQDVHWA -3333--------------1111--3333-------------------1111-----333 GGLFGYFPTYTLGNLYAAQFFQKAEAELGPLEPRFARGEFQPFLDWTRARIHAEGSRFRP 3----3333-------------------------1111------------3333------ RVLVERVTGEAPSARPFLAYLEKKYAALYG --------------------------1111 >UBIQUITIN CARBOXYL-TERMIN; SWP:O94966; PDB:1WH0A; GSSGSSGVDEPESMVNLAFVKNDSYEKGPDSVVVHVYVKEICRDTSRVLFREQDFTLIFQ -----------------------------------------3333--------------- TRDGNFLRLHPGCGPHTTFRWQVKLRNLIEPEQCTFCFTASRIDICLRKRQSQRWGGLEA -----3333----1111------------3333--------------------------- PAARVGGASGPSSG -------------- >KIAA1095 PROTEIN; SWP:Q9UPQ7; PDB:1WH1A; GSSGSSGDIHQEMDREELELEEVDLYRMNSQDKLGLTVCYRTDDEDDIGIYISEIDPNSI ----------------------------3333---------------------------- AAKDGRIREGDRIIQINGIEVQNREEAVALLTSEENKNFSLLIARPELQLDEGWMDDDSG 3333-----------iiii---3333---------------------------------- PSSG ---- >HYPOTHETICAL PROTEIN AT5G; SWP:Q9FT92; PDB:1WH2A; GSSGSSGVRVLSYDKEKLNWLYKDPQGLVQGPFSLTQLKAWSDAEYFTKQFRVWMTGESM -----------------------3333-------------1111----------222211 ESAVLLTDVLRLSGPSSG 11---------------- >59 kDa 2'-5'-oligoadenyla; SWP:Q15646; PDB:1WH3A; GSSGSSGIQVFVKNPDGGSYAYAINPNSFILGLKQQIEDQQGLPKKQQQLEFQGQVLQDW ------------------------11113333-----------3333----%%%%----- LGLGIYGIQDSDTLILSKKKGSGPSSG -3333---------------------- >ZF-HD HOMEOBOX FAMILY PRO; SWP:Q9FKP8; PDB:1WH5A; GSSGSSGSSAEAGGGIRKRHRTKFTAEQKERMLALAERIGWRIQRQDDEVIQRFCQETGV -----------------------------------------------3333--------- PRQVLKVWLHNNKHSGPSSG -------------------- >HOMEOBOX PROTEIN CUX-2; SWP:O14529; PDB:1WH6A; GSSGSSGQYELYMYREVDTLELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPK --------3333-----------------------------------3333--------- PWSKLTQKGREPFIRMQLWLSDQLGQAVGQQPGASSGPSSG 3333-3333-------------------------------- >ZF-HD HOMEOBOX FAMILY PRO; SWP:Q9SB61; PDB:1WH7A; GSSGSSGSNPSSSGGTTKRFRTKFTAEQKEKMLAFAERLGWRIQKHDDVAVEQFCAETGV --------------------------------------%%%%-3333------------- RRQVLKIWMHNNKNSGPSSG --------1111-------- >HOMEOBOX PROTEIN CUX-2; SWP:O14529; PDB:1WH8A; GSSGSSGYSGSQAPGGIQEIVAMSPELDTYSITKRVKEVLTDNNLGQRLFGESILGLTQG ---------------3333----------------------------------------- SVSDLLSRPKPWHKLSLKGREPFVRMQLWLNDPHNVEKLRDMKKLSGPSSG ----------3333-3333-------------------------------- >40S RIBOSOMAL PROTEIN S3; SWP:P23396; PDB:1WH9A; GSSGSSGFKAELNEFLTRELAEDGYSGVEVRVTPTRTEIIILATRTQNVLGEKGRRIREL ------3333----------------------------------3333---iiii----- TAVVQKRFGFPEGSVELYAEKVATRGSGPSSG ----------2222------------------ >KIAA0147 PROTEIN; SWP:Q14160; PDB:1WHAA; GSSGSSGRHVACLARSERGLGFSIAGGKGSTPYRAGDAGIFVSRIAEGGAAHRAGTLQVG ---------------3333-------1111---2222----------------------- DRVLSINGVDVTEARHDHAVSLLTAASPTIALLLEREAGSGPSSG -----%%%%-1111------------------------------- >UBA/UBX 33.3 KDA PROTEIN; SWP:Q922Y1; PDB:1WHCA; GSSGSSGAELTALESLIEMGFPRGRAEKALALTGNQGIEAAMDWLMEHEDDPDVDEPLSG ----------------1111-3333--------------------1111----------- PSSG ---- >TUBULIN SPECIFIC CHAPERON; SWP:Q9D1E6; PDB:1WHGA; GSSGSSGNEELRAQQEAEAAQRLSEEKAQASAISVGSRCEVRAPDHSLRRGTVMYVGLTD --------3333------------------------------------------------ FKPGYWVGVRYDEPLGKNDGSVNGKRYFECQAKYGAFVKPSAVTVGDSGPSSG --------------------------------------3333----------- >CLIPR-59; SWP:Q9DB67; PDB:1WHHA; GSSGSSGKSPSSPSLGSLQQREGAKAEVGDQVLVAGQKQGIVRFYGKTDFAPGYWYGIEL ---------------------------------%%%%----------------------- DQPTGKHDGSVFGVRYFTCAPRHGVFAPASRIQRIGSGPSSG ---------------------------3333----------- >RIBOSOMAL PROTEIN L14; SWP:P04450; PDB:1WHI; MIQQESRLKVADNSGAREVLVIKVLGGSGRRYANIGDVVVATVKDATPGGVVKKGQVVKA --2222--------------------2222---2222---------2222--2222---- VVVRTKRGVRRPDGSYIRFDENACVIIRDDKSPRGTRIFGPVARELRDKDFMKIISLAPE ----3333--1111-------------1111-------------3333------------ VI -- >RIKEN CDNA 1700024K14; SWP:Q8CI96; PDB:1WHJA; GSSGSSGLPNSDHTTSRAMLTSLGLKLGDRVVIAGQKVGTLRFCGTTEFASGQWAGIELD ------------------3333----------%%%%------------------------ EPEGKNNGSVGRVQYFKCAPKYGIFAPLSKISKLKDSGPSSG ---------!!!!-------------3333------------ >RIKEN CDNA 1700024K14; SWP:Q8CI96; PDB:1WHKA; GSSGSSGEGTVKLHEGSQVLLTSSNEMATVRYVGPTDFASGIWLGLELRSAKGKNDGAVG ----------------------------------------------------------!! DKRYFTCKPNYGVLVRPSRVTYRGISGPSSG !!-------------3333------------ >CYLINDROMATOSIS TUMOR SUP; SWP:Q9NQC7; PDB:1WHLA; GSSGSSGIDVGCPVKVQLRSGEEKFPGVVRFRGPLLAERTVSGIFFGVELLEEGRGQGFT -----------------------------------------------------------i DGVYQGKQLFQCDEDCGVFVALDKLELIESGPSSG iiiiiii-------------3333----------- >CYLINDROMATOSIS TUMOR SUP; SWP:Q9NQC7; PDB:1WHMA; GSSGSSGPPLEINSRVSLKVGETIESGTVIFCDVLPGKESLGYFVGVDMDNPIGNWDGRF ----------------------------------22223333------------------ DGVQLCSFACVESTILLHINDIIPESSGPSSG -----3333--------3333----------- >HYPOTHETICAL PROTEIN RIKE; SWP:Q9D7B1; PDB:1WHNA; GSSGSSGSGIKAIRFDRRAYPPQI ---------------3333-1111 >ALLERGEN PHL P 2; SWP:P43214; PDB:1WHO; VPKVTFTVEKGSNEKHLAVLVKYEGDTMAEVELREHGSDEWVAMTKGEGGVWTFDSEEPL ------------1111------2222--------2222---------iiii--------- QGPFNFRFLTEKGMKNVFDDVVPEKYTIGATYAP ---------1111-------------2222---- >RNA HELICASE A; SWP:NA; PDB:1WHQA; GSSGIKNFLYAWCGKRKMTPAYEIRAVGNKNRQKFMCEVRVEGFNYAGMGNSTNKKDAQS ----3333--------------------3333---------------------------- NAARDFVNYLVRINEVKSEEVPAVGIVPPPSGPSSG ----------------3333---------------- >HYPOTHETICAL KIAA1002 PRO; SWP:Q9Y2K5; PDB:1WHRA; GSSGSSGTDSTGIDLHEFLVNTLKKNPRDRMMLLKLEQEILEFINDNNNQFKKFPQMTSY --------------3333------------------------3333-------------- HRMLLHRVAAYFGMDHNVDQTGKAVIINKTSNTRIPEQRFSEHIKDEKNTEFQQRFILSG -----------------------------3333-----3333------------------ PSSG ---- >SERINE CARBOXYPEPTIDASE I; SWP:P08819; PDB:1WHSA; IARLPGQPAVDFDMYGYITVDEGAGRSLFYLLQEAPEDAQPAPLVLWLNGGPGCSSVAYG ---2222--------------3333----------3333--------------------- ASEELGAFRVKPRGAGLVLNEYRWNKVANVLFLDSPAGVGFSYTNTSSDIYTSGDNRTAH ----------2222-----1111----------------------1111----------- DSYAFLAKWFERFPHYKYRDFYIAGESYAGHYVPELSQLVHRSKNPVINLKGFMVGNGLI ------------1111-------------------------------------------- DDYHDYVGTFEFWWNHGIVSDDTYRRLKEACLHDSFIHPSPACDAATDVATAEQGNIDMY -------------1111-------------11113333-------------------111 SLYTPVCNI 1-------- >Serine carboxypeptidase 2; SWP:P08819; PDB:1WHSB; SYDPCTERYSTAYYNRRDVQMALHANVTGAMNYTWATCSDTINTHWHDAPRSMLPIYREL -----------3333----------1111------------------------------- IAAGLRIWVFSGDTDAVVPLTATRYSIGALGLPTTTSWYPWYDDQEVGGWSQVYKGLTLV -----------1111---3333-------------------------------2222--- SVRGAGHEVPLHRPRQALVLFQYFLQGKPMPGQ -2222--1111---------------------- >POLYNUCLEOTIDE PHOSPHORYL; SWP:Q8K1R3; PDB:1WHUA; GSSGSSGPQKIFTPSAEIVKYTKIIAMEKLYAVFTDYEHDKVSRDEAVNKIRLDTEEHLK --------------------------------3333-----3333-------------33 EKFPEVDQFEIIESFNIVAKEVFRSIILNEYKRCDGRDSGPSSG 3333333333-3333--------3333----------------- >POLY(A)-SPECIFIC RIBONUCL; SWP:Q8VDG3; PDB:1WHVA; GSSGSSGGPDLQPKRDHVLHVTFPKEWKTSDLYQLFSAFGNIQISWIDDTSAFVSLSQPE -----------------------3333-------3333-------------------333 QVQIAVNTSKYAESYRIQTYAEYVGKKQKGKQVKSGPSSG 3--------------------------------------- >HYPOTHETICAL PROTEIN RIKE; SWP:Q8R3C6; PDB:1WHWA; GSSGSSGSGRLFVRNLSYTSSEEDLEKLFSAYGPLSELHYPIDSLTKKPKGFAFVTFMFP ----------------11113333----3333---------------------------- EHAVKAYAEVDGQVFQGRMLHVLPSTIKKEASQSGPSSG --------------%%%%--------------------- >HYPOTHETICAL PROTEIN RIKE; SWP:Q8R3C6; PDB:1WHXA; GSSGSSGRSKTVILAKNLPAGTLAAEIQETFSRFGSLGRVLLPEGGITAIVEFLEPLEAR -3333-------------3333--------3333--------3333-------------- KAFRHLAYSKFHHVPLYLEWAPIGVFGAAPQKKDSQHEQPAEKAESGPSSG -----2222------------1111-----33333333--3333------- >HYPOTHETICAL PROTEIN RIKE; SWP:Q6PHZ5; PDB:1WHYA; GSSGSSGKIGYGKANPTTRLWVGGLGPNTSLAALAREFDRFGSIRTIDHVKGDSFAYIQY ------------------------------------------------------------ ESLDAAQAACAKMRGFPLGGPDRRLRVDFAKSGPSSG -------------------1111-------------- >HYPOTHETICAL PROTEIN; SWP:Q5SH17; PDB:1WHZA; WPPRPEEVARKLRRLGFVERAKGGHRLYTHPDGRIVVVPFHSGELPKGTFKRILRDAGLT ------------1111-----iiii----1111--------------------------- EEEFHNL ------- >MITOGEN ACTIVATED PROTEIN; SWP:Q9WVS7; PDB:1WI0A; GSSGSSGPFCAMENQVLVIRIKIPNSGAVDWTVHSGPQLLFRDVLDVIGQVLPEATTTAF ----------------------------------3333--------3333-1111----- EYEDEDGDRITVRSDEEMKAMLSYYYSTVMEQQVNGQLIEPLQIFPRSGPSSG ---1111------3333------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------ >RIKEN CDNA 2700099C19; SWP:Q9CZG9; PDB:1WI2A; GSSGSSGNNELTQFLPRIVTLKKPPGAQLGFNIRGGKASQLGIFISKVIPDSDAHRAGLQ -----------------------2222--------------------------------- EGDQVLAVNDVDFQDIEHSKAVEILKTAREISMRVRFFSGPSSG -------------------------------------------- >DNA-BINDING PROTEIN SATB2; SWP:Q9UPW6; PDB:1WI3A; GSSGSSGPRSRTKISLEALGILQSFIHDVGLYPDQEAIHTLSAQLDLPKHTIIKFFQNQR ---------------------------------3333-----1111-------------- YHVKHSGPSSG ----------- >SYNTAXIN BINDING PROTEIN ; SWP:Q9WV89; PDB:1WI4A; GSSGSSGSPLDRDPAFRVITVTKETGLGLKILGGINRNEGPLVYIHEVIPGGDCYKDGRL ------------1111-------------------------------------------- KPGDQLVSINKESMIGVSFEEAKSIITRAKLRSESPWEIAFIRSGPSSG -----------------3333---3333--------------------- >RRP5 PROTEIN HOMOLOG; SWP:Q14690; PDB:1WI5A; GSSGSSGKNVNRVLSAEALKPGMLLTGTVSSLEDHGYLVDIGVDGTRAFLPLLKAQEYIR --------------3333--------------1111------------------------ QKNKGAKLKVGQYLNCIVEKVKGNGGVVSLSVGHSEVSTAIATEQQSWNLNNLSGPSSG ----------------------------------------------------------- >HYPOTHETICAL PROTEIN (RIK; SWP:Q9CW46; PDB:1WI6A; GSSGSSGILIRGLPGDVTNQEVHDLLSDYELKYCFVDKYKGTAFVTLLNGEQAEAAINTF -------------11113333--------------------------------------- HQSRLRERELSVQLQPTDALLCSGPSSG ----%%%%--------%%%%-------- >SH3-DOMAIN KINASE BINDING; SWP:Q8R551; PDB:1WI7A; GSSGSSGRRCQVAFSYLPQNDDELELKVGDIIEVVGEVEEGWWEGVLNGKTGMFPSNFIK --------------------------------------2222------------------ ELSGPSSG -------- >EUKARYOTIC TRANSLATION IN; SWP:P23588; PDB:1WI8A; GSSGSSGSRLPKSPPYTAFLGNLPYDVTEESIKEFFRGLNISAVRLPREPSNPERLKGFG ---------------------------3333--1111--------------3333----- YAEFEDLDSLLSALSLNEESLGNKRIRVDVADQAQDKDSGPSSG -----3333----1111--------------------------- >PROTEIN C20ORF116 HOMOLOG; SWP:Q80WW9; PDB:1WI9A; GSSGSSGFLTEFINYIKKSKVVLLEDLAFQMGLRTQDAINRIQDLLTEGTLTGVIDDRGK -------------------------------------------------------1111- FIYITPSGPSSG ------------ >hypothetical ubiquitin-li; SWP:Q3V209; PDB:1WIAA; GSSGSSGINVRLKFLNDTEELAVARPEDTVGTLKSKYFPGQESQMKLIYQGRLLQDPART -------------1111-------1111------------3333----iiii---1111- LSSLNITNNCVIHCHRSPPGAAVSGPSASSGPSSG ----------------------------------- - >HYPOTHETICAL PROTEIN RIKE; SWP:Q9CWP6; PDB:1WICA; GSSGSSGKKPLSVFKGPLLHISPAEELYFGSIESGEKKTLIVLTNVTKNIVAFKVRTTAP ----------------------------------------------------------11 EKYRVKPSNSSCDPGASIDIIVSPHGGLTVSAQDRFLIMAAEMEQSSGTGPAELSQFWKE 11-----------------------------------------------------1111- VPRNKVMEHRLRCHTVESSKPNSLMLSGPSSG -3333--------------------------- >DNA-BINDING PROTEIN RAV1; SWP:Q9ZWM9; PDB:1WIDA; RSAEALFEKAVTPSDVGKLNRLVIPKHHAEKHFPLPSSNVSVKGVLLNFEDVNGKVWRFR -----------3333-3333-----1111------------------------------- YSYWNSSQSYVLTKGWSRFVKEKNLRAGDVVSFSRSNGQDQQLYIGWKSRSGSDLDA --------------------1111-2222---------------------------- >RIM BINDING PROTEIN 2; SWP:O15034; PDB:1WIEA; GSSGSSGTSKQRYSGKVHLCVARYSYNPFDGPNENPEAELPLTAGKYLYVYGDMDEDGFY ----------------------------------3333----------------3333-- EGELLDGQRGLVPSNFVDFVQDNESRLASTSGPSSG ---1111-----1111-------------------- >RIKEN CDNA 4930408O21; SWP:Q9D9M4; PDB:1WIFA; GSSGSSGSKNEKEQLSKAKASVSSLNKVIQTKLTVGNLGLGLVVIQNGPYLQISHLINKG -----------------------------------1111--------------------- AAASDGILQPGDVLISVGHANVLGYTLREFLKLLQNITIGTVLQIKAYRGFLEIPQEWQD 3333---------------------3333-------------------------3333-- SGPSSG ------ >KIAA1808 PROTEIN; SWP:Q6H8Q1; PDB:1WIGA; GSSGSSGCDSCEKYITGRVLEAGEKHYHPSCALCVRCGQMFAEGEEMYLQGSSIWHPACR ----------------------------------------------------------33 QAARTEDSGPSSG 33----------- >MITOCHONDRIAL RIBOSOME RE; SWP:Q9D6S7; PDB:1WIHA; GSSGSSGLDHITVVTADGKVALNQIGQISMKSPQVILVNMASFPECTAAAIKAIRESGMN -------1111----------------------------1111----------1111--- LNPEVEGTLIRVPIPKVTSGPSSG ------------------------ >HYPOTHETICAL UPF0222 PROT; SWP:P60003; PDB:1WIIA; GSSGSSGRKPPPKKKMTGTLETQFTCPFCNHEKSCDVKMDRARNTGVISCTVCLEEFQTP ----------------------------------------1111---------------- ITYLSEPVDVYSDWIDACESGPSSG -33331111---------------- >ETHYLENE-INSENSITIVE3-LIK; SWP:O23116; PDB:1WIJA; SQFVLQDLQDATLGSLLSSLMQHCDPPQRKYPLEKGTPPPWWPTGNEEWWVKLGLPKSQS ---3333-3333----------------------------------11113333------ PPYRKPHDLKKMWKVGVLTAVINHMLPDIAKIKRHVRQSKCLQDKMTAKESAIWLAVLNQ ----3333----------------33333333---------------------------3 EESLIQQ 333---- >THIOREDOXIN-LIKE PROTEIN ; SWP:Q9CQM9; PDB:1WIKA; GSSGSSGLKVLTNKASVMLFMKGNKQEAKCGFSKQILEILNSTGVEYETFDILEDEEVRQ -------3333-----------------------------3333----------3333-- GLKTFSNWPTYPQLYVRGDLVGGLDIVKELKDNGELLPILKGESGPSSG ----------------------3333---------3333---------- >KIAA1045 PROTEIN; SWP:Q9UPV7; PDB:1WILA; GSSGSSGPREPVVNDEMCDVCEVWTAESLFPCRVCTRVFHDGCLRRMGYIQGDSAAEVTE ---------------------------------------3333----------------- MAHTETGWSCHYCDNINLLLTEESGPSSG ---------3333---------------- >KIAA0161 PROTEIN; SWP:P50876; PDB:1WIMA; GSSGSSGCKLCLGEYPVEQMTTIAQCQCIFCTLCLKQYVELLIKEGLETAISCPDAACPK ---------------3333----------------------3333------------111 QGHLQENEIECMVAAEIMQRYKKLQFERSGPSSG 1------------3333-------3333------ >FLOTILLIN 2; SWP:NA; PDB:1WINA; GSSGSSGQRISLEIMTLQPRCEDVETAEGVALTVTGVAQVKIMTEKELLAVACEQFLGKN -------------------------3333-------------------3333-------3 VQDIKNVVLQTLEGHLRSILGTLTVEQIYQDRDQFAKLVREVAAPDVGRMGIEILSFTIK 333----------------------------3333----------3333----------- DVYDKVDYLSSLGKTQTSGPSSG ---11113333------------ >PROTEIN ARGININE N-METHYL; SWP:Q922H1; PDB:1WIRA; GSSGSSGEPAHGRQHTPCLFCDRLFASAEETFSHCKLEHQFNIDSMVHKHGLEFYGYIKL -----------------3333-----3333----------------------3333---- INFIRLKNPTVEYMNSIYNPVPWEKDEYLKPVLEDDLLLQFDVEDLYEPVSTPFSSGPSS ---------3333-----------3333-------3333--3333--------------- G - ------------------------------------------------------------ ------------------------------------------------------------ ---- >TWITCHIN 18TH IGSF MODULE; SWP:Q23551; PDB:1WIT; LKPKILTASRKIKIKAGFTHNLEVDFIGAPDPTATWTVGDSGAALAPELLVDAKSSTTSI ---------------------------------------------1111----1111--- FFPSAKRADSGNYKLKVKNELGEDEAIFEVIVQ -----3333------------------------ >UBIQUITIN-SPECIFIC PROTEA; SWP:Q8L6Y1; PDB:1WIVA; GSSGSSGLLSHMDDPDIDAPISHQTSDIDQSSVDTLLSFGFAEDVARKALKASGGDIEKA -------------------------------------------------------3333- TDWVFNNSGPSSG -3333-------- >GLUCOSE-6-PHOSPHATE ISOME; SWP:Q72J00; PDB:1WIWA; RDLDREETYLVDRTGLALELRDLVGTGPVPGEAYPGPHAALGYGEGQFAALLSGLPDWGE -11113333--1111-------2222----------------!!!!----3333------ EGTLFLLEGGYDLGEAAGAAETGRARVVRVGFRPGVEVHIPPSPLAPYRYLRFLLLATGR -----------2222------!!!!-------1111------1111-------------- EEVLRSVDEALLEERRRLGPEVPVEENPAKFLAYTLLERLPLFYSPLFRPLEGAVQTLFA -------------3333-11113333---------2222-----3333------------ RVAKSLSLTPPPSALEFFLVGLEARHEQGDPLAAVLLGPGEEAALAKEILESRVDALAEV -------------------1111-3333---------------------1111------- PATGANRLAQVALWYRAWTAYYLALLYGVDPGDHGLLE -----3333----------------------------- >HOOK HOMOLOG 1; SWP:Q8BIL5; PDB:1WIXA; GSSGSSGLPLCDSLIIWLQTFKTASPCQDVKQLTNGVTMAQVLHQIDVAWFSESWLSRIK -------3333-----------------33333333-------33331111----1111- DDVGDNWRIKASNLKKVLHGITSYYHEFLGQQISEELIPDLNQITECADPVELGRLLQLI -3333-3333----------3333---------3333--3333-----3333-------- LGCAVNCEKKQEHIKNIMTLEESVQHVVMTAIQELMSKSGPSSG -1111-1111------3333------------3333-------- >DNA-BINDING PROTEIN SATB2; SWP:Q9UPW6; PDB:1WIZA; GSSGSSGKPEPTNSSVEVSPDIYQQVRDELKRASVSQAVFARVAFNRTQGLLSEILRKEE ------------------1111-------------------------------------- DPRTASQSLLVNLRAMQNFLNLPEVERDRIYQDERSGPSSG ------------------33333333--------------- >SQUAMOSA PROMOTER-BINDING; SWP:Q9S7P5; PDB:1WJ0A; AICCQVDNCGADLSKVKDYHRRHKVCEIHSKATTALVGGIMQRFCQQCSRFHVLEEFD ------------------1111-----3333--------------3333--------- >NUMB PROTEIN; SWP:Q9QZS3; PDB:1WJ1A; GSSGSSGASRPHQWQTDEEGVRTGKCSFPVKYLGHVEVDESRGMHICEDAVKRLKATGKK -------------3333-3333--------------------11113333---------- AVKAVLWVSADGLRVVDEKTKDLIVDQTIEKVSFCAPDRNFDRAFSYICRDGTTRRWICH --------3333------------------------------------------------ CFMAVKDTGERLSHAVGCAFAACLERKQKRSGPSSG ---------3333----------------------- >PROBABLE WRKY TRANSCRIPTI; SWP:Q9XI90; PDB:1WJ2A; VQTTSEVDLLDDGYRWRKYGQKVVKGNPYPRSYYKCTTPGCGVRKHVERAATDPKAVVTT -----------------------1111----------2222------------------- YEGKHNHDLPA ----------- ------------------------------------------------------------ --------------------------------------------------------- >KIAA0794 PROTEIN; SWP:O94888; PDB:1WJ4A; GSSGSSGTATNHQGLPAVDSEILEMPPEKADGVVEGIDVNGPKAQLMLRYPDGKREQITL -------------------------------------------------3333------- PEQAKLLALVKHVQSKGYPNERFELLTNFPRRKLSHLDYDITLQEAGLCPQETVFVQESG 11113333----------3333---------------------3333------------- PSSG ---- >HYPOTHETICAL PROTEIN (RIK; SWP:Q8K2X3; PDB:1WJ5A; GSSGSSGNKDNLDLAGLTSLLSEKIKEFLQEKKMQSFYQQELETVESLQSLASRPVTHST ------------3333---------------------333311113333-1111------ GSDQVELKDSGTSGVAQRVFKNALQLLQEKGLVFQRDSGSDKLYYVTTKDKDLQSGPSSG ------------3333-------------------------------------------- >HYPOTHETICAL PROTEIN (RSG; SWP:Q80X50; PDB:1WJ7A; GSSGSSGNQNQTQHKQRPQATAEQIRLAQMISDHNDADFEEKVKQLIDITGKNQDECVIA ------------------------------3333-------------------------- LHDCNGDVNRAINVLLEGNPDTHSWEMVGKKKGVSGQKSGPSSG -------------------------------------------- >CRISPR-ASSOCIATED PROTEIN; SWP:Q53WG9; PDB:1WJ9A; MWLTKLVLNPASRAARRDLANPYEMHRTLSKAVSRALEEGRERLLWRLEPARGLEPPVVL --------3333------------------------1111-----------!!!!----- VQTLTEPDWSVLDEGYAQVFPPKPFHPALKPGQRLRFRLRANPAKRLKTPAEKVAWLERR -------3333-2222-------------2222--------------------------- LEEGGFRLLEGERGPWVQILQDTFLEQVQAVLFEGRLEVVDPERALATLRRGVGPGKALG -------------------------------------------------------1111- LGLLSVAP -------- >HIV-1 INTEGRASE; SWP:P12497; PDB:1WJBA; FLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVASCDKCQLKGEAMHGQVD ------------3333------------------------3333----------- >PROBABLE ATP BINDING PROT; SWP:Q5SJV7; PDB:1WJGA; FKTILLAYDGSEHARRAAEVAKAEAEAHGARLIVVHAYEPVPDYLGEPFFEEALRRRLER -----------------------------------------3333--------------- AEGVLEEARALTGVPKEDALLLEGVPAEAILQAARAEKADLIVMGTRGLGALGSLFLGSQ --------------3333-----------------------------------3333--- SQRVVAEAPCPVLLV --------------- >TUDOR DOMAIN CONTAINING P; SWP:Q9H7E2; PDB:1WJIA; GSSGSSGVDEKALKHITEMGFSKEASRQALMDNGNNLEAALNVLLTSNKQKPVMGPPSGP --------3333---3333-------------------------3333------------ SSG --- >HYPOTHETICAL PROTEIN F20O; SWP:O49453; PDB:1WJJA; GSSGSSGSTVKRKPVFVKVEQLKPGTTGHTLTVKVIEANIVVPVTRKTRPASSLSRPSQP -----------------3333-2222---------------------------------- SRIVECLIGDETGCILFTARNDQVDLMKPGATVILRNSRIDMFKGTMRLGVDKWGRIEAT ----------------------3333-------------------------1111----- GAASFTVKEDNNLSLVEYESGPSSG ------------------------- >C330018D20RIK PROTEIN; SWP:Q9CWB7; PDB:1WJKA; GSSGSSGNLSASNRALPVLTLFTKAPCPLCDEAKEVLQPYKDRFILQEVDITLPENSTWY ----------------------------3333----1111---------33331111--- ERYKFDIPVFHLNGQFLMMHRVNTSKLEKQLRKLSGPSSG -----------%%%%-------3333-------------- >CYPHER PROTEIN; SWP:NA; PDB:1WJLA; GSSGSSGMSYSVTLTGPGPWGFRLQGGKDFNMPLTISRITPGSKAAQSQLSQGDLVVAID ---------------------------1111--------22221111---2222----ii GVNTDTMTHLEAQNKIKSASYNLSLTLQKS ii-1111--------3333----------- >BETA-SPECTRIN III; SWP:O15020; PDB:1WJMA; GSSGSSGEQMEGMLCRKQEMEAFGKKAANRSWQNVYCVLRRGSLGFYKDAKAASAGVPYH ---------------------------------------%%%%---------1111---- GEVPVSLARAQGSVAFDYRKRKHVFKLGLQDGKEYLFQAKDEAEMSSWLRVVNAAIASGP ------------------------------------------------------------ SSG --- >TUBULIN-FOLDING PROTEIN T; SWP:Q8CIV8; PDB:1WJNA; GSSGSSGQLLTLKIKCSNQPERQILEKQLPDSMTVQKVKGLLSRLLKVPVSELLLSYESS ---------------3333----------3333--------3333---3333------33 KMPGREIELENDLQPLQFYSVENGDCLLVRWSGPSSG 33------------3333------------------- >T-PLASTIN; SWP:P13797; PDB:1WJOA; GSSGSSGNDDIIVNWVNRTLSEAGKSTSIQSFKDKTISSSLAVVDLIDAIQPGCINYDLV ------------------------------1111-3333----------------3333- KSGNLTEDDKHNNAKYAVSMARRIGARVYALPEDLVEVKPKMVMTVFACLMGRGMKRVSG -----3333-------------------------1111----------3333-------- PSSG ---- >ZINC FINGER PROTEIN 295; SWP:Q9ULJ3; PDB:1WJPA; GSSGSSGASPVENKEVYQCRLCNAKLSSLLEQGSHERLCRNAAVCPYCSLRFFSPELKQE ------------------------------------------------------------ HESKCEYKKLTCLECMRTFKSSFSIWRHQVEVHNQNNMAPTSGPSSG 3333-3333---1111-----3333---------------------- >KIAA1798 PROTEIN; SWP:Q96JM7; PDB:1WJQA; GSSGSSGVKPPHGFQKKMKLEVVDKRNPMFIRVATVADTDDHRVKVHFDGWNNCYDYWID -----------------------3333------------------------3333----1 ADSPDIHPVGWCSKTGHPLQPPLSPLELMEASEHGGCSTPGSGPSSG 111----------------------1111------------------ >KIAA1617 PROTEIN; SWP:Q5VUG0; PDB:1WJRA; GSSGSSGPIDLITVGSLIELQDSQNPFQYWIVSVIENVGGRLRLRYVGLEDTESYDQWLF ------3333---------------------------iiii-------3333-------- YLDYRLRPVGWCQENKYRMDPPSEIYPLKMASEWKCTLEKSLIDAAKFPLPMEVFKDHAD ---------3333--------333333333333---------3333----3333------ LSGPSSG ------- >KIAA1798 PROTEIN; SWP:Q96JM7; PDB:1WJSA; GSSGSSGPYNKNGFKVGMKLEGVDPEHQSVYCVLTVAEVCGYRIKLHFDGYSDCYDFWVN -----------------------1111------------------------3333----- ADALDIHPVGWCEKTGHKLHPPKGYKEEEFNWQTYLKTCKAQAAPKSLFENQNITVIPSG ----------3333-----------3333-------1111----3333------------ FSGPSSG ------- >TRANSCRIPTION ELONGATION ; SWP:NA; PDB:1WJTA; GSSGSSGMGLEEELLRIAKKLEKMVSRKKTEGALDLLKKLNSCQMSIQLLQTTRIGVAVN ---------------------------------------------3333----3333--- GVRKHCSDKEVVSLAKVLIKNWKRLLDSPRTTKGERESGPSSG --------3333--------3333------------------- >NEDD8 ULTIMATE BUSTER-1; SWP:Q9Y5A7; PDB:1WJUA; GSSGSSGDNYRTTGIATIEVFLPPRLKKDRKNLLETRLHITGRELRSKIAETFGLQENYI ----------------------------------------3333---------------- KIVINKKQLQLGKTLEEQGVAHNVKAMVLELKQSSGPSSG ---%%%%--11113333----------------------- >CELL GROWTH REGULATING NU; SWP:NA; PDB:1WJVA; GSSGSSGMVFFTCNACGESVKKIQVEKHVSNCRNCECLSCIDCGKDFWGDDYKSHVKCIS --------------------3333----3333---------------------------- EGQKYGGKGYEAKSGPSSG ------------------- >PHOSPHOACETYLGLUCOSAMINE ; SWP:Q9CYR6; PDB:1WJWA; GSSGSSGAIYVDLPNRQLKVKVADRRVISTTDAERQAVTPPGLQEAINDLVKKYTLARAF ------------------------------------------3333-------------- VRPSGTEDIVRVYAEANSQESADRLAYEVSLLVFQLAGGIGERPQPSGPSSG ----------------------------------1111-------------- >SSRA-BINDING PROTEIN; SWP:Q8RR57; PDB:1WJXA; VLENRRARHDYEILETYEAGIALKGTEVKSLRAGKVDFTGSFARFEDGELYLENLYIAPV -----------------------!!!!---1111---2222----iiii----------- DPRRKRKLLLHKHELRRLLGKVEQKGLTLVPLKIYFNERGYAKVLLGLARGK 1111-------------2222--2222---------1111------------ >1700030A21RIK PROTEIN; SWP:Q91ZF0; PDB:1WJZA; GSSGSSGMALEQTLKKDWYSILGADPSANMSDLKQKYQKLILLYHPDKQSADVPAGTMEE -------------------1111------3333-----------3333------------ CMQKFIEIDQAWKILGNEETKKKYDLQRSGPSSG ---------------------------------- >KIAA0970 PROTEIN; SWP:Q9Y2H6; PDB:1WK0A; GSSGSSGDEETKAFEALLSNIVKPVASDIQARTVVLTWSPPSSLINGETDESSVPELYGY ------3333-----3333-------------------------iiii------------ EVLISSTGKDGKYKSVYVGEETNITLNDLKPAMDYHAKVQAEYNSIKGTPSEAEIFTTLS ------------------------------------------%%%%-------------- CEPDIPNPPRISGPSSG ----------------- >HYPOTHETICAL PROTEIN YK10; SWP:Q19853; PDB:1WK1A; GSSGSSGVKFLTVNDDILSMPQARNFCASAGGYLADDLGDDKNNFYSSIAANTQFWIGLF ------------------1111-----------------3333----------------- KNSDGQFYWDRGQGINPDLLNQPITYWANGEPSNDPTRQCVYFDGRSGDKSKVWTTDTCA -3333----------------------2222---3333-----3333-1111-------- TPRPFICQKHRYDSDHKPNTIGDASGPSSG ------------------------------ >HYPOTHETICAL PROTEIN; SWP:NA; PDB:1WK2A; ERPKLGLIVREPYASLIVDGRKVWEIRRRKTRHRGPLGIVSGGRLIGQADLVGVPLYAWV ----------------1111--------------------%%%%---------------- LENAFRYEKPLHVPFVDLSEVR -----------------1111- >VALYL-TRNA SYNTHETASE; SWP:P96142; PDB:1WKAA; GKLYTLRYEVEGGGFIEIATVRPETVFADQAIAVHPEDERYRHLLGKRARIPLTEVWIPI ---------2222--------33331111-----1111--3333------2222------ LADPAVEKDFGTGALKVTPAHDPLDYEIGERHGLKPVSVINLEGRMEGERVPEALRGLDR --11111111-------3333-------------------1111---33333333----- FEARRKAVELFREAGHLVKEEDY -----------1111-------- >LEUCYL-TRNA SYNTHETASE; SWP:O58698; PDB:1WKBA; LNFKAIEEKWQKRWLEAKIFEPNIRDKPKEKKFYITVAFPYLSGHLHVGHARTYTIPDVI --------------1111----3333-3333----------------------------- ARFKRMQGYNVLFPMAWHITGSPIVGIAERIKNRDPKTIWIYRDVYKVPEEILWTFEDPI ----1111----------------------------------------3333-------- NIVKYFMKAAKETFIRAGFSVDWSREFYTTSLFPPFSKFIEWQFWKLKEKGYIVKGAHRV --------------1111---3333----1111--------------1111--------- RWDPVVGTPLGDHDLMEGEDVPILDYIIIKFELRENGEVIYLPAATLRPETVYGVTNMWV -----------------3333-----------------------------1111------ NPNATYVKAKVRRKDKEETWIVSKEAAYKLSFQDREIEVIEEFKGEKLIGKYVRNPVSGD 1111-------------------------1111------------1111----------- EVIILPAEFVDPDNATGVVMSVPAHAPFDHVALEDLKRETEILEKYDIDPRIVFPAVEEV ----------1111-------33333333-3333-------------------------1 NKLGIKSQKDKEKLEQATKTIYKAEYHKGIFKVPPYEGKPVQEVKEAIAKEMLEKGIAEI 111--------------------3333------------3333----------------- MYEFAEKNVISRFGNRAVIKIIHDQWFIDYGNPEWKEKARKALERMKILPETRRAQFEAI ----------1111--------------1111----------1111---3333------- IDWLDKKACARKIGLGTPLPWDPEWVIESLSDSTIYMAYYTISRHINKLRQEGKLDPEKL ------------------1111-----1111------3333--------1111--3333- TPEFFDYIFLEEFSEDKEKELEKKTGIPAEIIHEMKEEFEYWYPLDWRCSGKDLIPNHLT --------------------------------------------------3333------ FFIFNHVAIFREEHWPKGIAVNGFGTLEGQKMSKSKGNVLNFIDAIEENGADVVRLYIMS ----------3333------------iiii--3333---------------------333 LAEHDSDFDWRRKEVGKLRKQIERFYELISQFAEYEVKGNVELKDIDRWMLHRLNKAIKE 3-!!!!-------------------------3333------------------------- TTNALEEFRTRTAVQWAFYSIMNDLRWYLRRTEGRDDEAKRYVLRTLADVWVRLMAPFTP ----1111-----------------------2222-------------------3333-- HICEELWEKLG ----------- >HB8 TT1367 PROTEIN; SWP:Q5SHW9; PDB:1WKCA; TKAELRRRARAAWRRLDLKALSRAVGAALLPWLRERGFRHILLYHPLPHELNLLPLEAYP -----------1111-------------------------------------3333---- ARYYLPKVAGKGLTVHPFGPLAEPTTPPEDPRVLDLVVVPGLAFDREGYRLGHGQGFYDR --------!!!!-----------------3333-----------1111------------ FLKEVRAATVGVVPQALLFPALPRDPWDVPVDHLATEAGVEAVKRP 3333---------3333-------1111-------1111------- >NUCLEOSIDE DIPHOSPHATE KI; SWP:Q5SLV5; PDB:1WKJA; MERTFVMIKPDGVRRGLVGEILARFERKGFRIAALKLMQISQELAERHYAEHREKPFFPG ------------1111--------------------------------3333--1111-- LVRFITSGPVVAMVLEGPGVVAEVRKMMGATHPKDALPGTIRGDFATTIDENVIHGSATL ---1111------------------------3333-2222-------3333--------- EDAQREIALFFRPEELL -----------1111-- >TERMINAL FLOWER 1 PROTEIN; SWP:P93003; PDB:1WKOA; RVIEPLIMGRVVGDVLDFFTPTTKMNVSYNKKQVSNGHELFPSSVSSKPRVEIHGGDLRS 1111--1111------------------iiii--2222--3333------------1111 FFTLVMIDPDVPGPSDPFLKEHLHWIVTNIPGTTDATFGKEVVSYELPRPSIGIHRFVFV ------------33331111----------22223333---------------------- LFRQKQRRVIFPNIPSRDHFNTRKFAVEYDLGLPVAAVFFNAQRE --------------------------------------------- >FLOWERING LOCUS T PROTEIN; SWP:Q9SXZ2; PDB:1WKPA; RDPLIVSRVVGDVLDPFNRSITLKVTYGQREVTNGLNLRPSQVQNKPRVEIGGEDLRNFY -1111---------------------!!!!--2222--3333------------1111-- TLVMVDPDVPSPSNPHLREYLHWLVTDIPATTGTTFGNEIVSYENPSPTAGIHRVVFILF ----------11111111----------22223333------------------------ RQLGRQTVYAPGWRQNFNTREFAEIYNLGLPVAAVFYNSQRES ------------------------------------------- >GUANINE DEAMINASE; SWP:O34598; PDB:1WKQA; MNHETFLKRAVTLACEGVNAGIGGPFGAVIVKDGAIIAEGQNNVTTSNDPTAHAEVTAIR -3333------------1111----------iiii-------------1111-------- KACKVLGAYQLDDCILYTSCEPCPMCLGAIYWARPKAVFYAAEHTDAAEAGFDDSFIYKE ----------2222----------------------------3333-1111-------33 IDKPAEERTIPFYQVTLTEHLSPFQAWRNFANKKEY 33-3333---------1111---------1111--- >POLYPOROPEPSIN; SWP:P17576; PDB:1WKRA; AAGSVPATNQLVDYVVNVGVGSPATTYSLLVDTGSSNTWLGADKSYVKTSTSSATSDKVS ------------------------------------------------1111-------- VTYGSGSFSGTEYTDTVTLGSLTIPKQSIGVASRDSGFDGVDGILGVGPVDLTVGTLSPH --1111------------!!!!---------------2222---------3333-----1 TSTSIPTVTDNLFSQGTIPTNLLAVSFEPTTSESSTNGELTFGATDSSKYTGSITYTPIT 111---------1111-----------------------------1111----------- STSPASAYWGINQSIRYGSSTSILSSTAGIVDTGTTLTLIASDAFAKYKKATGAVADNNT -------------------------------1111------------------------- GLLRLTTAQYANLQSLFFTIGGQTFELTANAQIWPRNLNTAIGGSASSVYLIVGDLGSDS -------------------iiii----3333---1111-1111----------------- GEGLDFINGLTFLERFYSVYDTTNKRLGLATTSFTTATSN --------33331111-----1111------1111----- >YEAST KILLER TOXIN; SWP:P10410; PDB:1WKT; GDGYLIMCKNCDPNTGSCDWKQNWNTCVGIGANVHWMVTGGSTDGKQGCATIWEGSGCVG -----------3333--------------------------------------------- RSTTMCCPANTCCNINTGFYIRSYRRVE ---------------------------- >ALPHA-ACTININ 3; SWP:Q08043; PDB:1WKUA; AWEKQQRKTFTAWCNSHLRKAGTQIENIEEDFRNGLKLMLLLEVISGERLPRPDKGKMRF ----------------3333------1111-1111----------------------333 HKIANVNKALDFIASKGVKLVSIGAEEIVDGNLKMTLGMIWTIILRFAIQDISVEETSAK 3------------1111-------------------------------1111-%%%%--- EGLLLWCQRKTAPYRNVNVQNFHTSWKDGLALCALIHRHRPDLIDYAKLRKDDPIGNLNT ----------3333--------3333-------------3333-3333-1111------- AFEVAEKYLDIPKMLDAEDIVNTPKPDEKAIMTYVSCFYHAFAGA --------------------------------------------- >CYSTEINE SYNTHASE; SWP:Q9YBL2; PDB:1WKVA; ALADISGYLDVLDSVRGFSYLENAREVLRSGEARCLGNPRSEPEYVKALYVIGASRIPVG ---3333--3333----3333----------------3333--------1111------- DGCSHTLEELGVFDISVPGEMVFPSPLDFFERGKPTPLVRSRLQLPNGVRVWLKLEWYNP -----3333-1111---1111------------------------%%%%----------- FSLSVKDRPAVEIISRLSRRVEKGSLVADATSSNFGVALSAVARLYGYRARVYLPGAAEE ---------------------2222-----------------------------111133 FGKLLPRLLGAQVIVDPEAPSTVHLLPRVMKDSKNEGFVHVNQFYNDANFEAHMRGTARE 33----1111-----3333-3333-----------------3333--------------- IFVQSRRGGLALRGVAGSLGTSGHMSAAAFYLQSVDPSIRAVLVQPAQGDSIPGIRRVET -----1111-----------------------3333---------------2222-3333 GMLWINMLDISYTLAEVTLEEAMEAVVEVARSDGLVIGPSGGAAVKALAKKAAEGDLEPG --3333---------------------------------------------1111----- DYVVVVPDTGFKYLSLVQNALE --------3333------1111 >ENDO-BETA-1,4-MANNANASE; SWP:Q4W8M3; PDB:1WKYA; GRPANSGFYVSGTTLYDANGNPFVMRGINHGHAWYKDQATTAIEGIANTGANTVRIVLSD ----------!!!!--1111----------33331111--------1111---------- GGQWTKDDIQTVRNLISLAEDNNLVAVLEVHDATGYDSIASLNRAVDYWIEMRSALIGKE -------------------1111-------1111-----------------33332222- DTVIINIANEWFGSWDGAAWADGYKQAIPRLRNAGLNNTLMIDAAGWGQFPQSIHDYGRE --------------------------------------------%%%%------------ VFNADPQRNTMFSIHMYEYAGGNASQVRTNIDRVLNQDLALVIGEFGHRHTNGDVDESTI ----1111-------------------------1111------------1111------- MSYSEQRGVGWLAWSWKGNGPEWEYLDLSNDWAGNNLTAWGNTIVNGPYGLRETSKLSTV -------------------33331111---3333------------2222-------333 FTPTTLYDFEESTQGWTGSSLSRGPWTVTEWSSKGNHSLKADIQMSSNSQHYLHVIQNRS 3-----------iiii-----------------------------2222----------- LQQNSRIQATVKHAGMTARLYVKTGHGYTWYSGSFVPINGSSGTTLSLDLSNVQNLSQVR ------------------------1111---------------------1111-1111-- EIGVQFQSESNSSGQTSIYIDNVIVE -------------------------- >ACETYL-COENZYME A ACETYLT; SWP:Q9BWD1; PDB:1WL4A; GSDPVVIVSAARTIIGSFNGALAAVPVQDLGSTVIKEVLKRATVAPEDVSEVIFGHVLAA ----------------2222-11113333---------------3333----------22 GCGQNPVRQASVGAGIPYSVPAWSCQMIGSGLKAVCLAVQSIGIGDSSIVVAGGMENMSK 22---------1111-3333------------------------------------3333 APHLAYLRTGVKIGEMPLTDSILCDGLTDAFHNCHMGITAENVAKKWQVSREDQDKVAVL ----------------------------------3333------1111------------ SQNRTENAQKAGHFDKEIVPVLVSTRKGLIEVKTDEFPRHGSNIEAMSKLKPYFLTDGTG -----------1111---------1111----------222233331111---------- TVTPANASGINDGAAAVVLMKKSEADKRGLTPLARIVSWSQVGVEPSIMGIGPIPAIKQA --1111--------------------------------------33331111-------- VTKAGWSLEDVDIFEINEAFAAVSAAIVKELGLNPEKVNIEGGAIALGHPLGASGCRILV -3333-3333-----------------------1111-11113333---33333333--- TLLHTLERMGRSRGVAALCIGGGMGIAMCVQRE --------------------------------- >ARABINANASE-TS; SWP:Q93HT9; PDB:1WL7A; VHFHPFGNVNFYEMDWSLKGDLWAHDPVIAKEGSRWYVFHTGSGIQIKTSEDGVHWENMG 1111-----3333------------------!!!!------2222--------------- RVFPSLPDWCKQYVPEKDEDHLWAPDICFYNGIYYLYYSVSTFGKNTSVIGLATNRTLDP ------3333---3333------------iiii--------2222-------------11 RDPDYEWKDMGPVIHSTASDNYNAIDPNVVFDQEGQPWLSFGSFWSGIQLIQLDTETMKP 11--------------1111-----------1111--------!!!!------------- AAQAELLTIASRGEEPNAIEAPFIVCRNGYYYLFVSFDFCCRGIESTYKIAVGRSKDITG 1111----------------------iiii-----------!!!!-----------1111 PYVDKNGVSMMQGGGTILDAGNDRWIGPGHCAVYFSGVSAILVNHAYDALKNGEPTLQIR ---1111-1111-----------------------!!!!-----------%%%%------ PLYWDDEGWPYL ----1111---- >GMP SYNTHASE [GLUTAMINE-H; SWP:O59071; PDB:1WL8A; MMIVIMDNGGQYVHRIWRTLRYLGVETKIIPNTTPLEEIKAMNPKGIIFSGGPSLENTGN ---------1111-----------------1111-----1111----------1111!!! CEKVLEHYDEFNVPILGICLGHQLIAKFFGGKVGRGEKAEYSLVEIEIIDEEIFKGLPKR !-----3333-------------------------------------------2222--- LKVWESHMDEVKELPPKFKILARSETPIEAMKHEELPIYGVQFHPEVAHTEKGEEILRNF --------------2222-------------------------1111------------- AKLCGE -1111- >SERYL-TRNA SYNTHETASE; SWP:Q9N0F3; PDB:1WLEA; RNLLYEHAREGYSALPLLDMESLCAYPEDAARALDLRKGELRSKDLPGIISTWQELRQLR -------1111-----------------------1111---1111--------------- EQIRSLEEEKEAVTEAVRALVVNQDNSQVQQDPQYQSLRARGREIRKQLTLLYPKEAQLE ---------------------------3333----------------------------- EQFYLRALRLPNQTHPDVPVGDESQARVLHVVGDKPAFSFQPRGHLEIAEKLDIIRQKRL -----1111-----1111---3333----------------------------------1 SHVSGHRSYYLRGAGALLQHGLVNFTLNKLIHRGFTPMTVPDLLRGVVFEGCGMTPNAKP 111---------------------------1111----------3333-1111-1111-- SQIYNIDPSRFEDLNLAGTAEVGLAGYFMDHSVAFRDLPIRMVCSSTCYRAETDTGPWGL ------1111-----------------------3333----------------------- YRVHHFTKVEMFGVTGPGLEQSSELLEEFLSLQMEILTELGLHFRVLDMPTQELGLPAYR -------------------------------------1111--------1111------- KFDIEAWMPGRGRFGEVTSASNCTDFQSRRLHIMFQTEAGELQFAHTVNATGCAVPRLLI --------3333----------!!!!----------3333-------------------- ALLESYQQKDGSVLVPPALQPYLGTDRITTPTHVPLQYIGPNQPQ -------1111----3333-------------------------- >PEROXISOME BIOGENESIS FAC; SWP:Q5BL07; PDB:1WLFA; GGAVVTVAFTNARDCFLHLPRRLVAQLHLLQNQAIEVASDHQPTYLSWVEGRHFNENVAE ------------------------1111-2222--------------------------- INRQVGQKLGLSSGDQVFLRPCSHVVSCQQVEVEPLSADDWEILELHAISLEQHLLDQIR ------1111-2222----------------------------1111------------- IVFPKAVVPIWVDQQTYIFIQIVTLMPAAPYGRLETNTKLLIQP --2222------1111------------------1111------ >FLAGELLAR HOOK PROTEIN FL; SWP:P16322; PDB:1WLGA; GLDVAISQNGFFRLVDSNGSVFYSRNGQFKLDENRNLVNMQGMQLTGYPATGTPPTIQQG ---------------1111------------1111---1111------------------ ANPAPITIPNTLMAAKSTTTASMQINLNSTDPVPSKTPFSVSDADSYNKKGTVTVYDSQG ---------------------------1111--------1111-------------1111 NAHDMNVYFVKTKDNEWAVYTHDSSDPAATAPTTASTTLKFNENGILESGGTVNITTGTI ------------2222------1111---------------1111--------------% NGATAATFSLSFLNSMQQNTGANNIVATNQNGYKPGDLVSYQINNDGTVVGNYSNEQEQV %%%--------2222----------------------------1111-----1111---- LGQIVLANFANNEGLASQGDNVWAATQASGVALLGTAGSGNFGKLTNGALEAS ----------3333-----------3333------2222-------------- >INTERFERON STIMULATED GEN; SWP:Q96AZ6; PDB:1WLJA; EVVAMDCEMVGLGPHRESGLARCSLVNVHGAVLYDKFIRPEGEITDYRTRVSGVTPQHMV ------------1111----------1111-----------------3333---333322 GATPFAVARLEILQLLKGKLVVGHDLKHDFQALKEDMSGYTIYDTSTDRLLWREAKLVSL 22-3333--------2222------------------------3333-----1111---- RVLSERLLHKSIQNSLLGHSSVEDARATMELYQISQRIRARRGLPRLA --------------1111---------------------1111----- >PROTEIN CGI-38; SWP:Q9CRB6; PDB:1WLMA; GSSGSSGMAASTDIAGLEESFRKFAIHGDPKASGQEMNGKNWAKLCKDCKVADGKAVTGT ----------------------------3333--------------1111-------333 DVDIVFSKVKAKSARVINYEEFKKALEELATKRFKGKSKEEAFDAICQLIAGKEPANIGV 3---1111-----------------------------3333------------------- TKAKTGGAVDRLTDTSKYTGSHKERSGPSSG ------------------------------- >AFADIN; SWP:Q9QZQ1; PDB:1WLNA; GSSGSSGPEKLPYLVELSPDGSDSRDKPKLYRLQLSVTEVGTEKFDDNSIQLFGPGIQPH ------1111-------1111--------------------------------------- HCDLTNMDGVVTVTPRSMDAETYVDGQRISETTMLQSGMRLQFGTSHVFKFVDPSGPSSG ----------------3333---%%%%-------------------------3333---- >SUFE PROTEIN; SWP:Q72KV6; PDB:1WLOA; MVPPKLKQALELFKSLPKELRSQVLLEYAAKVPPPPPGVELERVHECQTPFFVHADVEGG --3333-------------------------------------3333------------- KVRLYFHVPDEAPTVKAFAGLLREGLEGESPEAVLEVPPGFYRGYGLEEFFTPLRLRGLE -------11113333----------2222--3333-----------1111---3333--- AALLRLQAQVRKALTS -----------3333- >GEMININ; SWP:O88513; PDB:1WLQA; KENPSSQYWKEVAEQRRKALYEALKENEKLHKEIEQKDSEIARLRKENKDLAEVAEHVQY --3333------------------------------------------------------ AEVIERLSN ---3333-- >DNA replication factor Cd; SWP:Q8R4E9; PDB:1WLQC; KAPAYQRFHALAQPGLPGLVLPYKYQVLVEFHSDTIVSLHNRSETVTFAKVKQGVQERKR --3333-3333----------3333--------------1111----------------- FEERNVGQIKTVYPSYRFRQECNVPTFKDSIKRSDYQLTIEPLLGQEGATQLTATCLLQR -3333--------------------------3333------------------------- RQVFRQNLVERVKEQHKVFLASLNPPAVPDDQLTRWHPRFNVDEVPDIEPAELPQPPV ----------------------------3333----11111111-------------- >L-ASPARAGINASE; SWP:O57797; PDB:1WLSA; RILILGGGTIASVKGERGYESALSVSKILKLAGISSEAKIEARDLNVDSTLIQPSDWERL -------3333------------3333-----1111-----------3333-3333---- AKEIEKEVWEYDGIVITHGTDTAYSASLSFLRNPPIPIVLTGSLPITEKNSDAPFNLRTA ----1111----------3333----------------------1111------------ LEFVKLGIRGIYIAFNGKVLGVRASKIRSGFDAFESINYPNVAEIKDDKLRILHIPDFYG --------------iiii-1111----------------------%%%%----------- DEFFSDIKYEPKVLVIKLIPGLSGDIVREALRLGYKGIILEGYGVGGIPYRGTDLFEVVS ------------------2222-------------------------------------- SISKRIPVVLTTQAIYDGVDLQRYKVGRIALEAGVIPAGDTKEATITKLWILGHTKNIEE 3333--------------------------1111---------------3333------- VKQLGKNITGELTRVS ---------------- >176aa long hypothetical d; SWP:Q96Z62; PDB:1WLTA; MPFEFENLGMGIILIKPKVFPDKRGFFLEVFKSEDFTKMRIPNVIQTNMSFSRKGVVRGL ---------------------3333-----------1111------------2222---- HYQRTPKEQGKIIFVPKGRILDVAVDVRKSSPTFGKYVKAELNEENHYMLWIPPGFAHGF ---------------------------1111-2222----------------2222---- QALEDSIVIYFITHNEYSPPHERCISYSYIDWPIKEVIISDKDLQCPSLEKAEVFD -----------------3333------------------3333----3333----- >PHENYLACETIC ACID DEGRADA; SWP:Q5SJP3; PDB:1WLUA; MRDPFMEALGLKVLHLAPGEAVVAGEVRADHLNLHGTAHGGFLYALADSAFALASNTRGP ------1111------2222-------1111-1111-------------------1111- AVALSCRMDYFRPLGAGARVEARAVEVNLSRRTATYRVEVVSEGKLVALFTGTVFRL --------------2222-----------1111--------iiii------------ >ALPHA-ACTININ 4; SWP:O43707; PDB:1WLXA; GSTEKQLEAIDQLHLEYAKRAAPFNNWMESAMEDLQDMFIVHTIEEIEGLISAHDQFKST -3333--1111-------------------------------3333-------------- LPDADREREAILAIHKEAQRIAESNHIKLSGSNPYTTVTPQIINSKWEKVQQLVPKRDHA -------------3333-3333-------------------------------------- LLEEQSKQQ ---1111-- >2-HALOACRYLATE REDUCTASE; SWP:Q59I44; PDB:1WLYA; VMAAVIHKKGGPDNFVWEEVKVGSPGPGQVRLRNTAIGVNFLDTYHRAPPIVVGFEAAAV ----------1111-----------2222----------3333-3333------------ VEEVGPGVTDFTVGERVCTCLPPLGAYSQERLYPAEKLIKVPKDLDLDDVHLAGLMLKGM ----2222---2222------------------3333----1111--------------- TAQYLLHQTHKVKPGDYVLIHAAAGGMGHIMVPWARHLGATVIGTVSTEEKAETARKLGC ------------2222-----11113333------------------------------- HHTINYSTQDFAEVVREITGGKGVDVVYDSIGKDTLQKSLDCLRPRGMCAAYGHASGVAD ----3333----------iiii---------3333----11112222------1111--- PIRVVEDLGVRGSLFITRPALWHYMSNRSEIDEGSKCLFDAVKAGVLHSSVAKTFPLREA --------3333-------3333------------------1111----------3333- AAAHKYMGGRQTIGSIVLLPQA -----3333------------- >CAP-binding protein compl; SWP:Q5THR3; PDB:1WLZA; HMATADRDILARLHKAVTSHYHAITQEFENFDTMKTNTISREEFRAICNRRVQILTDEQF -1111--------------------------1111------------------------- DRLWNEMPVNAKGRLKYPDFLSRFS ---1111--1111------------ >PROLINE IMINOPEPTIDASE; SWP:O32449; PDB:1WM1A; LRGLYPPLAAYDSGWLDTGDGHRIYWELSGNPNGKPAVFIHGGPGGGISPHHRQLFDPER ------------------------------1111--------------3333----1111 YKVLLFDQRGCGRSRPHASLDNNTTWHLVADIERLREMAGVEQWLVFGGSWGSTLALAYA -------2222----2222----------------------------------------- QTHPERVSEMVLRGIFTLRKQRLHWYYQDGASRFFPEKWERVLSILSDDERKDVIAAYRQ --3333------------3333-------3333-3333--1111--3333---------- RLTSADPQVQLEAAKLWSVWEGETVTLLPSRESASFGEDDFALAFARIENHYFTHLGFLE 1111----------------1111-----3333---------------------%%%%-- SDDQLLRNVPLIRHIPAVIVHGRYDMACQVQNAWDLAKAWPEAELHIVEGAGHSYDEPGI --3333-33331111------------------------3333----------1111333 LHQLMIATDRFAG 3--------1111 >UBIQUITIN-LIKE PROTEIN SM; SWP:P61956; PDB:1WM3A; HINLKVAGQDGSVVQFKIKRHTPLSKLMKAYCERQGLSMRQIRFRFDGQPINETDTPAQL -------1111-------1111---------------3333----iiii--11113333- EMEDEDTIDVFQ --2222------ >NEUROTOXIN BMP01; SWP:Q9U8D2; PDB:1WM7A; ATCEDCPEHCATQNARAKCDNDKCVCEPK --1111-1111------------------ >CARBONYL REDUCTASE [NADPH; SWP:P16152; PDB:1WMAA; SGIHVALVTGGNKGIGLAIVRDLCRLFSGDVVLTARDVTRGQAAVQQLQAEGLSPRFHQL ------------------------------------------------1111-------- DIDDLQSIRALRDFLRKEYGGLDVLVNNAGIAFKVADPTPFHIQAEVTMKTNFFGTRDVC 1111-----------------------------2222----------------------- TELLPLIKPQGRVVNVSSIMSVRALKSCSPELQQKFRSETITEEELVGLMNKFVEDTKKG --3333-2222------------3333-----------------------------1111 VHQKEGWPSSAYGVTKIGVTVLSRIHARKLSEQRKGDKILLNACCPGWVRTDMAGPKATK -3333----------------------------2222-------------33331111-- SPEEGAETPVYLALLPPDAEGPHGQFVSEKRVEQW 3333-----------1111--------%%%%---- >D(-)-3-HYDROXYBUTYRATE DE; SWP:Q5KST5; PDB:1WMBA; MLKGKVAVVTGSTSGIGLGIATALAAQGADIVLNGFGDAAEIEKVRAGLAAQHGVKVLYD -2222-------------------1111-------------------------------- GADLSKGEAVRGLVDNAVRQMGRIDILVNNAGIQHTALIEDFPTEKWDAILALNLSAVFH --1111-------------------------------3333------------------- GTAAALPHMKKQGFGRIINIASAHGLVASANKSAYVAAKHGVVGFTKVTALETAGQGITA ---------------------1111---2222--------------------2222---- NAICPGWVRTPLVEKQISALAEKNGVDQETAARELLSEKQPSLQFVTPEQLGGTAVFLAS ---------------------------------------3333----------------3 DAAAQITGTTVSVDGGWTAR 333----------iiii--- >PROTEASE; SWP:Q93UV9; PDB:1WMDA; NDVARGIVKADVAQSSYGLYGQGQIVAVADTGLDTGRNDSSMHEAFRGKITALYALGRTN --------------------2222--------!!!!--33333333-----------222 NANDTNGHGTHVAGSVLGNGSTNKGMAPQANLVFQSIMDSGGGLGGLPSNLQTLFSQAYS 2-------------------------1111--------1111-1111-----------11 AGARIHTNSWGAAVNGAYTTDSRNVDDYVRKNDMTILFAAGNEGPNGGTISAPGTAKNAI 11-----------iiii-----------------------------------1111---- TVGATENLRPSFGSYADNINHVAQFSSRGPTKDGRIKPDVMAPGTFILSARSSLAPDSSF --------33331111-1111-1111----1111-----------------11113333- WANHDSKYAYMGGTSMATPIVAGNVAQLREHFVKNRGITPKPSLLKAALIAGAADIGLGY ----1111----3333-------------------------------------------- PNGNQGWGRVTLDKSLNVAYVNESSSLSTSQKATYSFTATAGKPLKISLVWSDAPASTTA -3333-----33331111---------2222--------3333-------------1111 SVTLVNDLDLVITAPNGTQYVGNDFTSPYNDNWDGRNNVENVFINAPQSGTYTIEVQAYN -------------1111---2222------------------------------------ VPVGPQTFSLAIVN -------------- >NETRIN RECEPTOR UNC5H2; SWP:Q8K1S3; PDB:1WMGA; YAFKIPLSIRQKICSSLDAPNSRGNDWRLLAQKLSDRYLNYFATKASPTGVILDLWEARQ ---------------33331111--------11111111--1111----------3333- QDDGDLNSLASALEEGKSELVAATDG ---3333------------------- >Partitioning defective 6 ; SWP:Q9NPB6; PDB:1WMHB; SIVEVKSKFDAEFRRFALPRASVSGFQEFSRLLRAVHQIPGLDVLLGYTDAHGDLLPLTN --------!!!!------1111----------------2222-------1111------- DDSLHRALASGPPPLRLLVQKR ---------------------- >HYPOTHETICAL PROTEIN PHS0; SWP:O73966; PDB:1WMIA; MTYRVKIHKQVVKALQSLPKAHYRRFLEFRDILEYEPVPREKFDVIKLEGTGDLDLYRAR -------------1111---------------------3333------------------ LGDYRVIYSVNWKDKVIKILKLKPRGRA -----------1111------------- >Putative uncharacterized ; SWP:O73967; PDB:1WMIB; GDVLKELERLKVEIQRLEAMLMPEERDEDITEEEIAELLELARDEDPENWIDAEELPEPE --------------------------3333--3333----1111-3333--3333----- D - >THIOREDOXIN H-TYPE; SWP:Q42443; PDB:1WMJA; MAAEEGVVIACHNKDEFDAQMTKAKEAGKVVIIDFTASWCGPCRFIAPVFAEYAKKFPGA -------------3333--1111--------------------1111---------1111 VFLKVDVDELKEVAEKYNVEAMPTFLFIKDGAEADKVVGARKDDLQNTIVKHVGATAASA ---------3333--------------2222-------------3333------------ SA -- >RAS-RELATED PROTEIN RAB-9; SWP:P51151; PDB:1WMSA; AGKSSLFKVILLGDGGVGKSSLMNRYVTNKFDTTIGVEFLNKDLEVDGHFVTMQIWDTAG -------------2222----------------------------iiii----------- QERFRSLRTPFYRGSDCCLLTFSVDDSQSFQNLSNWKKEFIYYADVKEPESFPFVILGNK 33331111-3333---------1111---------------1111--3333--------3 IDISERQVSTEEAQAWCRDNGDYPYFETSAKDATNVAAAFEEAVRRVLAT 333-----------------------------2222-------------- >ISTX; SWP:P0C194; PDB:1WMTA; VHTNIPCRGTSDCYEPCEKKYNCARAKCMNRHCNCYNNCPW -----------1111-------------%%%%--------- >HEMOGLOBIN D ALPHA CHAIN; SWP:P83134; PDB:1WMUA; MLTEDDKQLIQHVWEKVLEHQEDFGAEALERMFIVYPSTKTYFPHFDLHHDSEQIRHHGK ---------------3333----------------3333---1111--2222-------- KVVGALGDAVKHIDNLSATLSELSNLHAYNLRVDPVNFKLLSHCFQVVLGAHLGREYTPQ -----------1111------------------3333----------------1111--- VQVAYDKFLAAVSAVLAEKYR ----------------1111- >Hemoglobin A/D subunit be; SWP:P83133; PDB:1WMUB; VHWTSEEKQYITSLWAKVNVGEVGGEALARLLIVYPWTQRFFASFGNLSSANAILHNAKV --------------11113333---------------33331111--------------- LAHGQKVLTSFGEAVKNLDNIKKTFAQLSELHCEKLHVDPENFKLLGNILIIVLATHFPK -------------1111---3333--------------3333------------------ EFTPASQAAWTKLVNAVAHALALGYH -------------------------- >WW DOMAIN CONTAINING OXID; SWP:Q9NZC7; PDB:1WMVA; GSAKRKRVAGDLPYGWEQETDENGQVFFVDHINKRTTYLDPRLAFTVDDNPTKP --------------------1111------------------------------ >GERANYLGERANYL DIPHOSPHAT; SWP:Q5SMD0; PDB:1WMWA; MVPAPEAIRQALQERLLARLDHPDPLYRDLLQDYPRRGGKMLRGLLTVYSALAHGAPLEA -----------------1111-----------3333---------------1111----- GLEAATALELFQNWVLVHDDIEDGSEERRGRPALHRLHPMPLALNAGDAMHAEMWGLLAE ---------------------------iiii-1111--3333------------------ GLARGLFPPEVLLEFHEVVRRTAYGQHLDLLWTLGGTFDLRPEDYFRMVAHKAAYYTAVA -------3333------------------------------------------------- PLRLGALLAGKTPPAAYEEGGLRLGTAFQIVDDVLNLEGGEAYGKERAGDLYEGKRTLIL ------1111---3333-----------------3333-------2222-1111------ LRFLEEAPPEERARALALLALPREAKPEAEVGWLLERLLASRALAWAKAEAKRLQAEGLA ---------------------3333----------------------------------- LLEAAFQDLPGKEALDHLRGLLAALVER --3333---------------------- >COG3291: FOG: PKD REPEAT; SWP:P71140; PDB:1WMXA; LLDVQIFKDSPVVGWSGSGMGELETIGDTLPVDTTVTYNGLPTLRLNVQTTVQSGWWISL -------------------------%%%%--------iiii------------------- LTLRGWNTHDLSQYVENGYLEFDIKGKEGGEDFVIGFRDKVYERVYGLEIDVTTVISNYV ---iiii---3333--------------------------1111----------3333-- TVTTDWQHVKIPLRDLMKINNGFDPSSVTCLVFSKRYADPFTVWFSDIKITSE -----------3333--------1111-------------------------- >lectin CEL-I, N-acetyl-D-; SWP:Q7M462; PDB:1WMZA; NQCPTDWEAEGDHCYRFFNTLTTWENAHHECVSYSCSTLNVRSDLVSVHSAAEQAYVFNY ---2222--!!!!------------------1111-1111-------------------1 WRGIDSQAGQLWIGLYDKYNEGDFIWTDGSKVGYTKWAGGQPDNWNNAEDYGQFRHTEGG 111-------------3333-----1111--------2222---%%%%--------%%%% AWNDNSAAAQAKYMCKLTFE -----1111----------- >HISTIDINE-CONTAINING PHOS; SWP:Q9SLX1; PDB:1WN0A; QLNALLSSMFASGLVDEQFQQLQMLQEDGGTPGFVAEVVTLFCDDADRIISELAALLDQP 3333-----1111--3333-----------2222---------------------1111- IVDFDKVDAYVHQLKGSSASVGAQKVKFTCMQFRQLCQDKNRDGCIMALAVVRNEFYDLR ------------------------------------------------------------ NKFQTMLQLEQ ----------- >DIPEPTIDASE; SWP:O58691; PDB:1WN1A; MRLEKFIHLLGERGFDGALISPGTNLYYLTGLRLHEVGERLAILAVSAEGDYRFLAPSLY ----------1111----------------------!!!!------1111------3333 ENVVNNFPATFWHDGENPYAKLREILEELGISKGRILIEDTMRADWLIGIMKLGKFTFQP 3333----------------------------------1111-------1111------3 LSSLIKELRMIKDKEEVKMMEHASRIADKVFEEILTWDLIGMKERELALKIELLIRELSD 333----3333----------------------1111-2222------------------ GIAFEPIVASGENAANPHHEPGERKIRKGDIIILDYGARWKGYCSDITRTIGLGELDERL ---------!!!!--1111--------------------iiii----------------- VKIYEVVKDAQESAFKAVREGIKAKDVDSRAREVISKAGYGEYFIHRTGHGLGLDVHEEP ------------------22223333--------3333-1111----------------- YIGPDGEVILKNGMTFTIEPGIYVPGLGGVRIEDDIVVDEGKGRRLTKAERELIIL --1111----2222------------------------%%%%-------------- >PEPTIDYL-TRNA HYDROLASE; SWP:O74017; PDB:1WN2A; MFKYKQVIVARADLKLSKGKLAAQVAHGAVTAAFEAYKKKREWFEAWFREGQKKVVVKVE ------------------------------------------------------------ SEEELFKLKAEAEKLGLPNALIRDAGLTEIPPGTVTVLAVGPAPEEIVDKVTGNLKLL 3333--------1111-------1111---2222---------333333331111--- >BLASTICIDIN-S DEAMINASE; SWP:P78986; PDB:1WN5A; PLSQEESTLIERATATINSIPISEDYSVASAALSSDGRIFTGVNVYHFTGGPCAELVVLG ----------------1111-------------1111---------1111---------- TAAAAAAGNLTCIVAIGNENRGILSPCGRCRQVLLDLHPGIKAIVKDSDGQPTAVGIREL --1111-----------%%%%----------------1111-----1111-----3333- LPSGY ----- >FAMILY B DNA POLYMERASE; SWP:Q9HH84; PDB:1WN7A; MILDTDYITEDGKPVIRIFKKENGEFKIEYDRTFEPYFYALLKDDSAIEEVKKITAERHG ---------iiii--------------------------------3333--------%%% TVVTVKRVEKVQKKFLGRPVEVWKLYFTHPQDVPAIRDKIREHPAVIDIYEYDIPFAKRY %---------------------------11111111------1111-------------- LIDKGLVPMEGDEELKMLAFDIETLYEEGEEFAEGPILMISYADEEGARVITWKNVDLPY --------------------------3333------------------------------ VDVVSTEREMIKRFLRVVKEKDPDVLITYNGDNFDFAYLKKRCEKLGINFALGRDGSEPK -------------------------------------------1111-----1111---- IQRMGDRFAVEVKGRIHFDLYPVIRRTINLPTYTLEAVYEAVFGQPKEKVYAEEITTAWE ---!!!!----2222-----------------------------------3333------ TGENLERVARYSMEDAKVTYELGKEFLPMEAQLSRLIGQSLWDVSRSSTGNLVEWFLLRK ---3333--------------------------3333--33331111------------- AYERNELAPNKPDEKELARRRQSYEGGYVKEPERGLWENIVYLDFRSLYPSIIITHNVSP -----------------------------------------------------1111-11 DTLNREGCKEYDVAPQVGHRFCKDFPGFIPSLLGDLLEERQKIKKKMKATIDPIERKLLD 11------------------------------------------1111---3333----- YRQRAIKILANSYYGYYGYARARWYCKECAESVTAWGREYITMTIKEIEEKYGFKVIYSD --------1111-3333-1111--------------------------1111-------- TDGFFATIPGADAETVKKKAMEFLKYINAKLPGALELEYEGFYERGFFVTKKKYAVIDEE ---------------------------------------------------------111 GKITTRGLEIVRRDWSEIAKETQARVLEALLKDGDVEKAVRIVKEVTEKLSKYEVPPEKL 1----------------1111-------------11113333------3333----1111 VIHEQIPHVAVAKRLAATVISYIVLRAIPFDEFDPTKHKYDAEYYIENQVLPAVERILRA ------1111---------------------------------1111---3333------ FGYRKEDLR --------- >THE HYPOTHETICAL PROTEIN ; SWP:Q72IG8; PDB:1WNAA; VRVGRAAPRVSLEALKAALGGLKLSEAKVYLITDWQDKRDQARYALLLHTGKKDLLVPDA ------------------!!!!-1111----------1111------------------- FGPAFPGGEEALSELVGLLLAQGARRFYEAVVSPGETALLDLPPEELLKRVAIANPTDPG -3333--------------1111---------11113333-------------------1 IYL 111 >PUTATIVE BETAINE ALDEHYDE; SWP:P77674; PDB:1WNBA; MQHKLLINGELVSGEGEKQPVYNPATGDVLLEIAEASAEQVDAAVRAADAAFAEWGQTTP ------iiii---------------------------------------33331111--- KVRAECLLKLADVIEENGQVFAELESRNCGKPLHSAFNDEIPAIVDVFRFFAGAARCLNG -------------------------------3333------------------1111--- LAAGEYLEGHTSMIRRDPLGVVASIAPWNYPLMMAAWKLAPALAAGNCVVLKPSEITPLT ------2222--------------------------------1111-------3333--- ALKLAELAKDIFPAGVVNILFGRGKTVGDPLTGHPKVRMVSLTGSIATGEHIISHTASSI -------1111-2222------3333-------1111------------------3333- KRTHMELGGKAPVIVFDDADIEAVVEGVRTFGYYNAGQDCTAACRIYAQKGIYDTLVEKL ---------------1111---------------iiii1111------3333-------- GAAVATLKSGAPDDESTELGPLSSLAHLERVGKAVEEAKATGHIKVITGGEKRKGNGYYY ---1111---1111------------------------3333------------------ APTLLAGALQDDAIVQKEVFGPVVSVTPFDNEEQVVNWANDSQYGLASSVWTKDVGRAHR --------1111------------------------------------------------ VSARLQYGCTWVNTHFMLVSEMPHGGQKLSGYGKDMSLYGLEDYTVVRHVMVKH ------------------3333----!!!!----------3333---------- >E2 GLYCOPROTEIN; SWP:P59594; PDB:1WNCA; QKQIANQFNKAISQIQESLTTTSTALGKLQDVVNQNAQALNTLVKQINASVVNIQEEIDR 11113333-----3333------------------------------------------- LNEVAKNLN -----3333 >LATEXIN; SWP:P70202; PDB:1WNHA; TMEIPPTHYAASRAASVAENCINYQQGTPHKLFLVQTVQQASKEDIPGRGHKYHLKFSVE ----1111-------------------------------------2222----------- EIIQKQVTVNCTAEVLYPQMGQGSAPEVNFTFEGEIGKNPDEEDNTFYQSLMSLKRPLEA ------------------------------------------------------------ QDIPDNFGNVSPQMKPVQHLAWVACGYVMWQNSTEDTWYKMLKIQTVKQVQRNDDFIELD ----1111--3333---------------11111111----------------------- YTILLHDIASQEIIPWQMQVLWHPQYGTKVKHNSRLPK -------------------------------------- >TRIMERELYSIN II; SWP:P20165; PDB:1WNIA; FPQRYIELAIVVDHGMYKKYNQNSDKIKVRVHQMVNHINEMYRPLNIAISLNRLQIWSKK -----------------1111--------------------3333--------------- DLITVKSASNVTLESFGNWRETVLLKQQNNDCAHLLTATNLNDNTIGLAYKKGMCNPKLS -------3333-----------3333---------------%%%%----2222------- VGLVQDYSPNVFMVAVTMTHELGHNLGMEHDDKDKCKCEACIMSDVISDKPSKLFSDCSK ---------3333------------------3333-----1111---------------- NDYQTFLTKYNPQCILNA -----------3333--- >ISOLEUCYL-TRNA SYNTHETASE; SWP:P56690; PDB:1WNYA; DPSVYVRFPLKEPKKLGLEKASLLIWTTTPWTLPGNVAAAVHPEYTYAAFQVGDEALILE -----------3333-------------33331111-----1111------!!!!----- EGLGRKLLGEGTPVLKTFPGKALEGLPYTPPYPQALEKGYFVVLADYVSQEDGTGIVHQA --------1111------33332222------------------1111-----------1 PAFGAEDLETARVYGLPLLKTVDEEGKLLVEPFKGLYFREANRAILRDLRGRGLLFKEES 111-------------------1111-----------------------1111------- >METHYLGLYOXAL SYNTHASE; SWP:Q5SHD6; PDB:1WO8A; MKALALIAHDAKKDEMVAFCLRHKDVLARYPLLATGTTGARIQEATGLAVERVLSGPLGG --------3333-----------3333---------------------------3333-- DLQIGARVAEGKVLAVVFLQDPLTAKPHEPDVQALMRVCNVHGVPLATNLVAAEALIAWI -------1111---------1111-1111------------------------------- RKGTPQ 1111-- ----------------------------------- >PRIMOSOMAL REPLICATION PR; SWP:P07013; PDB:1WOCA; TNRLVLSGTVCRAPLRKVSPSGIPHCQFVLEHRSVQEEAGFHRQAWCQPVIVSGHENQAI ------------------1111---------------%%%%-----------------11 THSITVGSRITVQGFISCHKAKNGLSKVLHAEQIELI 11--2222------------3333------------- >AGMATINASE; SWP:Q9RZ04; PDB:1WOHA; GPAHLPYGGIPTFARAPLVQPDGDWQADVAALGVPFDIALGFRPGARFAPRALREASLRS ----1111---2222----1111-------------1111----3333--------1111 VPPFTGLDGKTRLQGVTFADAGDVILPSLEPQLAHDRITEAARQVRGRCRVPVFLGGDHS -----1111---2222-------------------------------------------- VSYPLLRAFADVPDLHVVQLDAHLDFTDTRNDTKWSNSSPFRRACEALPNLVHITTVGLR --------1111-----------------%%%%--1111--------1111--------- GLRFDPEAVAAARARGHTIIPMDDVTADLAGVLAQLPRGQNVYFSVDVDGFDPAVIPGTS ------------1111----3333--------1111----------1111---------- SPEPDGLTYAQGMKILAAAAANNTVVGLDLVELAPNLDPTGRSELLMARLVMETLCEVFD ---------------------------------33331111----------------111 HVL 1-- >2',3'-CYCLIC-NUCLEOTIDE 3; SWP:P09543; PDB:1WOJA; LPLYFGWFLTKKSSETLRKAGQVFLEELGNHKAFKKELRQFVPEKMDLVTYFGKRPPGVL ------------------------------------1111------3333---------- HCTTKFCDYGKAPGAEEYAQQDVLKKSYSKAFTLTISALFVTPKTTGARVELSEQQLQLW -------iiii2222-----------2222----------------------33331111 PSDVDKLSPTDNLPRGSRAHITLGCAADVEAVQTGLDLLEILRQEKGGSRGEEVGELSRG -------1111--2222--------1111---------------1111--------2222 KLYSLGNGRWMLTLAKNMEVRAIFTGYYG -----iiii-------------------- >122AA LONG CONSERVED HYPO; SWP:Q974G3; PDB:1WOLA; MKRVEDWIKQAERDLEEARYAKSGGYYELACFLSQQCAEKAVKGLLQFQGIEKRGHSISH --3333---------------1111-------------------------------3333 LLTNPPADILQCATFLDKQYTPSRYPDVYYEGAPYEYYTERDADECINCAIRILNWVKGQ ---------------1111-------------3333---------------------111 IK 1- >SIGMA FACTOR SIGB REGULAT; SWP:O07015; PDB:1WOMA; GHMTSILSRNHVKVKGSGKASIMFAPGFGCDQSVWNAVAPAFEEDHRVILFDYVGSGHSD --------------------------2222--1111------------------------ LRAYDLNRYQTLDGYAQDVLDVCEALDLKETVFVGHSVGALIGMLASIRRPELFSHLVMV ----3333-----------------------------------------3333------- GPSPCYLNDPPEYYGGFEEEQLLGLLEMMEKNYIGWATVFAATVLNQPDRPEIKEELESR -----------------3333-------------------------1111---------- FCSTDPVIARQFAKAAFFSDHREDLSKVTVPSLILQCADDIIAPATVGKYMHQHLPYSSL 1111----------------33331111---------------3333------------- KQMEARGHCPHMSHPDETIQLIGDYLKAHV --------3333-------------1111- >INORGANIC POLYPHOSPHATE/A; SWP:Q7WT42; PDB:1WOQA; NAPLIGIDIGGTGIKGGIVDLKKGKLLGERFRVPTPQPATPESVAEAVALVVAELSARPE ---------1111--------------------------------------------111 APAAGSPVGVTFPGIIQHGVVHSAANVDKSWLNTDIDALLTARLGRPVEVINDADAAGLA 1-1111----------iiii-------3333----------------------------- EARYGAGAGVKGTVLVITLGTGIGSAFIFDGKLVPNAELGHLEIDGHDAETKASAVARER ----1111--------------------iiii-----------iiii3333-------11 DGLSWDEYSVLLQRYFSHVEFLFSPELFIVGGGISKRADEYLPNLRLRTPIVPAVLRNEA 11-----------------------------1111-33333333------------1111 GIVGAAIEIALQH ------------- >AMINOMETHYLTRANSFERASE; SWP:Q9WY54; PDB:1WOSA; MKRTPLFEKHVELGAKMVDFAGWEMPLYYTSIFEEVMAVRKSVGMFDVSHMGEFLVKGPE ---1111------------iiii------------------------1111------111 AVSFIDFLITNDFSSLPDGKAIYSVMCNENGGIIDDLVVYKVSPDEALMVVNAANIEKDF 1-----------11112222-------1111-----------1111-----3333----- NWIKSHSKNFDVEVSNISDTTALIAFQGPKAQETLQELVEDGLEEIAYYSFRKSIVAGVE ------1111------3333-------111133331111--3333-2222-----iiii- TLVSRTGYTGEDGFELMLEAKNAPKVWDALMNLLRKIDGRPAGLGARDVCRLEATYLLYG ------------------3333------------1111-------------------222 QDMDENTNPFEVGLSWVVKLNKDFVGKEALLKAKEKVERKLVALELSGKRIARKGYEVLK 2--11113333--3333-1111-2222-----3333----------------2222---i NGERVGEITSGNFSPTLGKSIALALVSKSVKIGDQLGVVFPGGKLVEALVVKKPFYRGSV iii-----------1111--------33332222-----2222----------------- R - >PUTATIVE MINIMAL NUCLEOTI; SWP:NA; PDB:1WOTA; HMDLETLRARREAVLSLCARHGAVRVRVFGSVARGEAREDSDLDLLVAFEEGRTLLDHAR --3333-----------------------3333----------------22223333--- LKLALEGLLGVRVDIVSERGLAPRLREQVLREAIPL ---------------------3333----------- >THIOREDOXIN -RELATED PROT; SWP:Q9BRA2; PDB:1WOUA; YEEVSVSGFEEFHRAVEQHNGKTIFAYFTGSKDAGGKSWCPDCVQAEPVVREGLKHISEG ------------------1111----------1111---3333----------1111222 CVFIYCQVGEKPYWKDPNNDFRKNLKVTAVPTLLKYGTPQKLVESECLQANLVEMLFSE 2--------------1111---------------2222----!!!!------------- >HEME OXYGENASE 2; SWP:P74133; PDB:1WOVA; TNLAQKLRYGTQQSHTLAENTAYMKCFLKGIVEREPFRQLLANLYYLYSALEAALRQHRD --------------------------1111---------------------------111 NEIISAIYFPELNRTDKLAEDLTYYYGPNWQQIIQPTPCAKIYVDRLKTIAASEPELLIA 1-------3333--------------1111------3333-------------------- HCYTRYLGDLSGGQSLKNIIRSALQLPEGEGTAMYEFDSLPTPGDRRQFKEIYRDVLNSL ---------------------1111-2222-3333-1111----------------3333 PLDEATINRIVEEANYAFSLNREVMHDLEDLIKAAIGEHTFDLLTRQDRPGSTEGHPITL --------------------------------------------------1111------ MVGE ---- >177aa long conserved hypo; SWP:Q970Z7; PDB:1WOZA; GKDSPLVNFLGDLDELNSFIGFAISKIPWEDKKDLERVQVELFEIGEDLSTQSSKKKIDE -----------------------1111-----------------------%%%%------ KYVKWLEERTVEYRKESGPVKLFVIPGGSEEASVLHVTRSVARRVERNAVKYTKELPEIN --------------------------------------------------3333-1111- RIIVYLNRLSSLLFAALVANKRRNVSEKIYDIGKFW --------------------1111------------ >TOPOISOMERASE IV; SWP:NA; PDB:1WP5A; VASEDVIVTVTKDGYVKRTSLRSYAASNGQDFAKDTDRLLALENTKDVLLLFTNKGNYLY ----------1111------------%%%%---1111------1111-----1111---- CPVHELPDIRWKDLGQHIANIIPIDRDEEIIKAIPINDFELNGYFLFVTRNGVKKTELKH -3333----1111---3333----1111---------1111-------1111----3333 YKAQRYSKPLTGINLKNDDQVVDVHLTDGNELFLVTHNGYALWFDESEVSIVGVRAAGVK -----------------------------------1111-----3333------------ GNLKEGDYIVSGQLITSKDESIVVATQRGAVKKKLTEFEKATRAKRGVVILRELKANPHR ---2222---------1111-----1111----1111----------------------- ISGFVVAQDSDTIYLQTEKSFIETIKVGDIRFSDRYSNGSFVLDEEENGRVISVWKVEAE -------1111-----1111-----3333--------------3333------------- DKTEKLAAALEHHHH --------------- >FUSION; SWP:Q9IH63; PDB:1WP8A; NINKLKSSIESTNEAVVKLQETAEKTVYVLTALQDISSQISSMNQSLQQSKDYIKEAQKI ------------------------------------------------------------ LDTV 3333 >ATP-DEPENDENT RNA HELICAS; SWP:Q8TZH8; PDB:1WP9A; MVLRRDLIQPRIYQEVIYAKCKETNCLIVLPTGLGKTLIAMMIAEYRLTKYGGKVLMLAP ---3333-----------------------2222-------------------------- TKPLVLQHAESFRRLFNLPPEKIVALTGEKSPEERSKAWARAKVIVATPQTIENDLLAGR ------------------1111--------3333---------------3333------- ISLEDVSLIVFDEAHRAVGNYAYVFIAREYKRQAKNPLVIGLTASPGSTPEKIMEVINNL ------------------------------------------------------------ GIEHIEYRSENSPDVRPYVKGIRFEWVRVDLPEIYKEVRKLLREMLRDALKPLAETGLLE --------1111--3333-----------------------------------1111--- SSSPDIPKKEVLRAGQIINEEMAKGNHDLRGLLLYHAMALKLHHAIELLETQGLSALRAY --11113333-----------1111----------------------------------- IKKLYEEAKAGSTKASKEIFSDKRMKKAISLLVQAKEIGLDHPKMDKLKEIIREQLQRKQ -----------------3333--------------1111-------------------11 NSKIIVFTNYRETAKKIVNELVKDGIKAKRFVGQASKQREQKLILDEFARGEFNVLVATS 11--------------------------------------------------------33 VGEEGLDVPEVDLVVFYEPVPSAIRSIQRRGRTGRHMPGRVIILMAKGTRDEAYYWSSR 33--1111---------------------3333-------------------------- >HYPOTHETICAL PROTEIN YFBU; SWP:NA; PDB:1WPBA; QESTMEMTNAQRLILSNQYKMMTMLDPANAERYRRLQTIIERGYGLQMRELDREFGELKE --3333-------------------3333----------1111------3333------- ETCRTIIDIMEMYHALHVSWSNLQDQQSIDERRVTFLGFDAATEARYLGYVRFMVNVEGR -----------------------------3333------3333----------------- YTHFDAGTHGFNAQTPMWEKYQRMLNVWHACPRQYHLSANEINQIINA 1111----%%%%--------------3333--------------1111 >MTX-HSTX1; SWP:P80719; PDB:1WPDA; VSCTGSKDCYAPCRKQTGCPYGKCMNRKCKCNRC ------------3333------------------ >Sarcoplasmic/endoplasmic ; SWP:P04191; PDB:1WPGA; MEAAHSKSTEECLAYFGVSETTGLTPDQVKRHLEKYGHNELPAEEGKSLWELVIEQFEDL --3333------------1111-------------------------1111--------- LVRILLLAACISFVLAWFEEGEETITAFVEPFVILLILIANAIVGVWQERNAENAIEALK -------------3333----------------------------3333----------1 EYEPEMGKVYRADRKSVQRIKARDIVPGDIVEVAVGDKVPADIRILSIKSTTLRVDQSIL 111-----------------3333-2222----2222-------------------3333 TGESVSVIKHTEPVPDPRAVNQDKKNMLFSGTNIAAGKALGIVATTGVSTEIGKIRDQMA ---------------11113333-----2222-------------!!!!----------- ATEQDKTPLQQKLDEFGEQLSKVISLICVAVWLINIGHFNDPVHGGSWIRGAIYYFKIAV ----------------------------------------------3333---1111--- ALAVAAIPEGLPAVITTCLALGTRRMAKKNAIVRSLPSVETLGCTSVICSDKTGTLTTNQ -------1111-----------------------11113333---------2222----- MSVCKMFIIDKVDGDFCSLNEFSITGSTYAPEGEVLKNDKPIRSGQFDGLVELATICALC ------------!!!!--------------------%%%%--33333333---------- NDSSLDFNETKGVYEKVGEATETALTTLVEKMNVFNTEVRNLSKVERANACNSVIRQLMK --------1111------3333----------1111--33333333----3333------ KEFTLEFSRDRKSMSVYCSPAKSSRAAVGNKMFVKGAPEGVIDRCNYVRVGTTRVPMTGP -------3333----------33333333------------1111--------------- VKEKILSVIKEWGTGRDTLRCLALATRDTPPKREEMVLDDSSRFMEYETDLTFVGVVGML ----------1111-----------------1111-111111113333------------ DPPRKEVMGSIQLCRDAGIRVIMITGDNKGTAIAICRRIGIFGENEEVADRAYTGREFDD ---1111-------1111-----------------------------1111--------- LPLAEQREACRRACCFARVEPSHKSKIVEYLQSYDEITAMTGDGVNDAPALKKAEIGIAM -------------------3333--------1111--------1111------------1 GSGTAVAKTASEMVLADDNFSTIVAAVEEGRAIYNNMKQFIRYLISSNVGEVVCIFLTAA 111----1111---11113333-------------------------------------- LGLPEALIPVQLLWVNLVTDGLPATALGFNPPDLDIMDRPPRSPKEPLISGWLFFRYMAI -------3333-------------3333------1111----1111----3333------ GGYVGAATVGAAAWWFMYAEDGPGVTYHQLTHFMQCTEDHPHFEGLDCEIFEAPEPMTMA -----------------------------1111-----3333------------------ LSVLVTIEMCNALNSLSENQSLMRMPPWVNIWLLGSICLSMSLHFLILYVDPLPMIFKLK ------------1111---------1111---------------1111------1111-- ALDLTQWLMVLKISLPVIGLDEILKFIARNYLEG --3333---------------------------- >HYPOTHETICAL UPF0207 PROT; SWP:YFBR_ECOLI; PDB:1WPHA; KQSHFFAHLSRLKLINRWPLRNVRTENVSEHSLQVAVAHALAAIKNRKFGGNVNAERIAL ----------3333---------------------------------------------- LAYHDASEVLTGDLPTPQEYKAIEKIAQQKLVDVPEELRDIFAPLIDEHAYSDEEKSLVK --11113333-------3333-------------3333---3333--------------- QADALCAYLKCLEELAAGNNEFLLAKTRLEATLEARRSQEDYFEIFVPSFH --------------11113333-----------1111---------3333- >Hypothetical 15.6 kDa pro; SWP:P36141; PDB:1WPIA; MSFWKTLQRQPRTISLFTNDIASNIKSQKCLQLLKGDVSHRFDVEIANRFPTWDQLQYMR -----------------------3333-------------------------------33 TSCPQGPVSLQRQIPKLDSVLKYKHTDPTFGMDLQKCVQRGLWNPKEALWVDWENKLVGN 33111133333333------------------33333333-------------------- EPADIDKYIIQRK 33333333----- >MANGANESE-DEPENDENT INORG; SWP:P37487; PDB:1WPNA; EKILIFGHQNPDTDTICSAIAYADLKNKLGFNAEPVRLGQVNGETQYALDYFKQESPRLV --------------------------1111------------------------------ ETAANEVNGVILVDHNERQQSIKDIEEVQVLEVIDHHRIANFETAEPLYYRAEPVGCTAT --3333----------1111---1111-----------------------------3333 ILNKMYKENNVKIEKEIAGLMLSAIISDSLLFKSPTCTDQDVAAAKELAEIAGVDAEEYG ------1111-------------------%%%%1111-----------------3333-- LNMLKAG ------- >HUMAN CYTOMEGALOVIRUS PRO; SWP:P16753; PDB:1WPOA; QAVAPVYVGGFLARYDQSLLPRDVVEHWLHAVALPLNINHDDTAVVGHVAAQSVRDGLFC --------------------3333-------------iiii------------1111--- LGCVTSPRFLEIVRRASEKSELVSRGPVSPLQPDKVVEFLSGSYAGLSLSSTPFKHVALC -----3333------1111-3333------------------------------------ SVGRRRGTLAVYGRDPEWVQRFPDLTAADRDGLRAQWQRGDPFRSDSYGLLGNSVDAYIR --------------3333---3333--------3333----------------------- ERLPKLRYDKQLVGVTERESYVKA -------------3333------- >HUT OPERON POSITIVE REGUL; SWP:P10943; PDB:1WPUA; TLHKERRIGRLSVLLLLNEAEESTQVEELERDGWKVCLGKVGSMDAHKVIAAIETASKKS --1111--------1111--------------------------3333------------ GVIQSEGYRESHALYHATMEALHGVTRGEMLLGSLLRTVGLRFAVLRGNPYESEAEGDWI --------------------3333-------3333------------------------- AVSLYGTIGAPIKGLEHETFGVGINHI --------------------------- >3-ISOPROPYLMALATE DEHYDRO; SWP:P50455; PDB:1WPWA; GFTVALIQGDGIGPEIVSKSKRILAKINELYSLPIEYIEVEAGDRALARYGEALPKDSLK ---------!!!!---------------1111---------------------------- IIDKADIILKGPVGESAADVVVKLRQIYDMYANIRPAKSIPGIDTKYGNVDILIVRENTE -1111--------1111----1111----------------------------------! DLYKGFEHIVSDGVAVGMKIITRFASERIAKVGLNFALRRRKKVTCVHKANVMRITDGLF !!!-------2222----------------------1111--------3333-------- AEACRSVLKGKVEYSEMYVDAAAANLVRNPQMFDVIVTENVYGDILSDEASQIAGSLGIA ---33332222------------3333-1111-------3333------------1111- PSANIGDKKALFEPVHGAAFDIAGKNIGNPTAFLLSVSMMYERMYELSNDDRYIKASRAL -----1111---------3333-------------------------------------- ENAIYLVYKERKALTPDVGGNATTDDLINEIYNKLG --------------3333------------------ >Carboxypeptidase Y inhibi; SWP:P14306; PDB:1WPXB; MNQAIDFAQASIDSYKKHGILEDVIHDTSFQPSGILAVEYSSSAPVAMGNTLPTEKARSK 1111-3333-----------------1111----------1111--------3333---- PQFQFTFNKQMQNAYVPQDDDLFTLVMTDPDAPSKTDHKWSEFCHLVECDLKLLNTEFFA -----------------3333--------------------------------------- SEFNTKGSNTLIEYMGPAPPKGSGPHRYVFLLYKQPKGVDSSKFSKIKDRPNWGYGTPAT -------------------2222------------22223333--------%%%%----- GVGKWAKENNLQLVASNFFYAETK ------------------------ >TYROSYL-TRNA SYNTHETASE; SWP:P00951; PDB:1WQ3A; MASSNLIKQLQERGLVAQVTDEEALAERLAQGPIALVCGFDPTADSLHLGHLVPLLCLKR ----------1111------------3333-----------------3333--------- FQQAGHKPVALVGGATGLIGDPSFKAAERKLNTEETVQEWVDKIRKQVAPFLDFDCGENS --------------1111-----------------------------3333-----1111 AIAANNYDWFGNMNVLTFLRDIGKHFSVNQMINKEAVKQRLNREDQGISFTEFSYNLLQG -----3333-------------1111----------3333--------3333-------- YDFACLNKQYGVVLCIGGSDQWGNITSGIDLTRRLHQNQVFGLTVPLITKADGTKFGKTE -----------------11113333------------------------1111-222211 GGAVWLDPKKTSPYKFYQFWINTADADVYRFLKFFTFMSIEEINALEEEDKNSGKAPRAQ 11----1111-------------3333--------------------------------- YVLAEQVTRLVHGEEGLQAAKR ------------------1111 --------------------------------------------------------- >VASCULAR ENDOTHELIAL GROW; SWP:P67863; PDB:1WQ8A; VRPFLEVHERSACQARETLVPILQEYPDEISDIFRPSCVAVLRCSGCCTDESLKCTPVGK --------------------3333-1111--------------------1111------- HTVDIQIMRVNPRTQSSKMEVMKFTEHTACECRPRRKQG --------------------------------------- >VASCULAR ENDOTHELIAL GROW; SWP:P67861; PDB:1WQ9A; EVRPFLDVYQRSACQTRETLVSILQEHPDEISDIFRPSCVAVLRCSGCCTDESMKCTPVG ---------------------3333-1111--------------------1111------ KHTADIQIMRMNPRTHSSKMEVMKFMEHTACECRPA ------------------------------------ >PHOSPHO-SUGAR MUTASE; SWP:O58651; PDB:1WQAA; MGKLFGTFGVRGIANEKITPEFAMKIGMAFGTLLKREGRKKPLVVVGRDTRVSGEMLKEA ------------2222----------------------------------1111------ LISGLLSVGCDVIDVGIAPTPAVQWATKHFNADGGAVITASHNPPEYNGIKLLEPNGMGL ------------------3333-----------------!!!!3333------1111--- KKEREAIVEELFFKEDFDRAKWYEIGEVRREDIIKPYIEAIKSKVDVEAIKKRKPFVVVD --------------------1111-----------------1111--------------- TSNGAGSLTLPYLLRELGCKVITVNAQPDGYFPARNPEPNEENLKEFMEIVKALGADFGV %%%%----------3333----------1111-------3333----------------- AQDGDADRAVFIDENGRFIQGDKTFALVADAVLKEKGGGLLVTTVATSNLLDDIAKKHGA --1111------1111---3333--------------------11113333----1111- KVMRTKVGDLIVARALYENNGTIGGEENGGVIFPEHVLGRDGAMTVAKVVEIFAKSGKKF -------2222--------------1111---1111----------------------33 SELIDELPKYYQIKTKRHVEGDRHAIVNKVAEMARERGYTVDTTDGAKIIFEDGWVLVRA 33-1111---------------------------1111------------3333------ SGTEPIIRIFSEAKSKEKAQEYLNLGIELLEKALS ------------------------------1111- -------------------------------- >RIBOSOME RECYCLING FACTOR; SWP:Q10794; PDB:1WQGA; IDEALFDAEEKMEKAVAVARDDLSTIRTGRANPGMFSRITIDYYGAATPITQLASINVPE ----------------------1111-----33331111---iiii--3333-------- ARLVVIKPYEANQLRAIETAIRNSDLGVNPTNDGALIRVAVPQLTEERRRELVKQAKHKG ---------1111----------------------------------------------- EEAKVSVRNIRRKAMEELHRIRKEGEAGEDEVGRAEKDLDKTTHQYVTQIDELVKHKEGE -----------------------------------------------------------1 LLEV 111- >Insulin-like growth facto; SWP:P22692; PDB:1WQJB; AIHCPPCSEEKLARCRPPVGCEELVREPGCGCCATCALGLGMPCGVYTPRCGSGLRCYPP -----------1111------------!!!!-------2222--1111---2222----2 RGVEKPLHTLMHGQGVCMEL 222----------------- >Insulin-like growth facto; SWP:P05019; PDB:1WQJI; PETLCGAELVDALQFVCGDRGFYFNKPTGYGSSSRRAPQTGIVDECCFRSCDLRRLEMYC ----------------!!!!------------1111--------------------1111 AP -- >TOXIN APETX1; SWP:P61541; PDB:1WQKA; GTTCYCGKTIGIYWFGTKTCPSNRGYTGSCGYFLGICCYPVD -------------------------------2222------- >ETHYLBENZENE DIOXYGENASE ; SWP:Q51743; PDB:1WQLA; NWSDEEIKALVDEEKGLLDPRIFSDQDLYEIELERVFARSWLLLGHEGHIPKAGDYLTTY --------------------1111-----------1111------3333--2222----- MGEDPVIVVRQKDRSIKVFLNQCRHRGMRIERSDFGNAKSFTCTYHGWAYDTAGNLVNVP ----------1111------------------------------------1111------ YEKEAFCDCGFDKADWGPLQARVDTYKGLIFANWDTEAPDLKTYLSDATPYMDVMLDRTE 3333-------3333-------------------1111-3333-!!!!3333------11 AVTQVITGMQKTVIPCNWKFAAEQFCSDMYHAGTMAHLSGVLSSLPPEMDLSQVKLPSSG 11--------------3333--------3333-------------11113333------- NQFRAKWGGHGTGWFNDDFALLQAIMGPKVVDYWTKGPAAERAKERLGKVLPADRMVAQH ----------------------------------------------3333-1111----- MTIFPTCSFLPGINTVRTWHPRGPNEIEVWSFIVVDADAPEDIKEEYRRKNIFTFNQGGT ----------------------1111---------11113333------------2222- YEQDDGENWVEVQRGLRGYKARSRPLCAQMGAGVPNKNNPEFPGKTSYVYSEEAARGFYH ------------------3333------2222------3333------------------ HWSRMMSEPSWDTLKS ---------3333--- >ETHYLBENZENE DIOXYGENASE ; SWP:NA; PDB:1WQLB; DLTKPIEWPEMPVSLELQNAVEQFYYREAQLLDYQNYEAWLALLTQDIQYWMPIRTTHTS 1111---------------------------1111-----11111111----------33 RNKAMEYVPPGGNAHFDETYESMRARIRARVSGLNWTEDPPSRSRHIVSNVIVRETESAG 331111--2222----------------------3333-------------------222 TLEVSSAFLCYRNRLERMTDIYVGERRDILLRVSDGLGFKIAKRTILLDQSTITANNLSQ 2---------------------------------!!!!---------------------- FF -- >3C-LIKE PROTEASE; SWP:Q9DU47; PDB:1WQSA; APPTLWSRVVRFGSGWGFWVSPTVFITTTHVIPTGVREFFGEPIESIAIHRAGEFTQFRF -33331111-----------1111---1111-------%%%%1111-----!!!!----- SRKVRPDLTGMVLEEGCPEGVVCSILIKRDSGELLPLAVRMGAIASMKIQGRLVHGQSGM ----3333----------------------------------------2222-------- LLTGANAKGMDLGTLPGDCGAPYVYKRNNDWVVCGVHAAATKSGNTVVCAVQA --------------2222----------------------1111--------- >PROTO-ONCOGENE TYROSINE-P; SWP:P07332; PDB:1WQUA; GSSGSSGEVQKPLHEQLWYHGAIPRAEVAELLVHSGDFLVRESQGKQEYVLSVLWDGLPR ------3333-33331111--------1111--2222-----------------iiii-- HFIIQSLDNLYRLEGEGFPSIPLLIDHLLSTQQPLTKKSGVVLHRAVPSGPSSG -----------------------------------3333--------------- >HYPOTHETICAL PROTEIN PH17; SWP:O59452; PDB:1WR2A; MKEEAVRVIEEVLKQGRTAMVEYEAKQVLKAYGLPVPEEKLAKTLDEALEYAKEIGYPVV ------------1111-------------1111--------------------------- LKLMSPQILHKSDAKVVMLNIKNEEELKKKWEEIHENAKKYRPDAEILGVLVAPMLKPGR ----1111-3333----------------------------1111--------------- EVIIGVTEDPQFGHAIMFGLGGIFVEILKDVTFRLVPITEKDARKMIQEIKAYPILAGAE -----------------------3333----------------------11113333--- EPADIDAIVDMLLKVSKLVDDLKDYIKEMDLNPVFVYNKGEGAVIVDSRIILKPK ---------------------1111------------2222-------------- >UBIQUITIN-PROTEIN LIGASE ; SWP:NA; PDB:1WR3A; GSPPLPPGWEEKVDNLGRTYYVNHNNRSTQWHRPSL -------------3333------------------- >UBIQUITIN-PROTEIN LIGASE ; SWP:Q8CFI0; PDB:1WR4A; GSPGLPSGWEERKDAKGRTYYVNHNNRTTTWTRPIM -----------------------1111--------- >ADP-RIBOSYLATION FACTOR B; SWP:Q9NZ52; PDB:1WR6A; VTKRLHTLEEVNNNVRLLSELLHYSQEDSSDGDRELKELFDQCENKRRTLFKLASETEDN ------------------------3333---3333------------------1111--- DNSLGDILQASDNLSRVINSYKTIIEGQ ---------------------------- >NEDD4-2; SWP:Q8CFI0; PDB:1WR7A; GSPGIQSFLPPGWEMRIAPNGRPFFIDHNTKTTTWEDPRLK 3333-------------3333---------------1111- >PHOSPHOGLYCOLATE PHOSPHAT; SWP:O50129; PDB:1WR8A; KIIIIDGTIYPNRMIHEKALEAIRRAESLGIPMVGTV ----2222-1111------------------------ >DJVLGB; SWP:O97032; PDB:1WRBA; KYDSIPVSVTGPDYSATNVIENFDELKLDPTIRNNILLASYQRPTPIQKNAIPAILEHRD ----------------------3333---3333---------------------1111-- IMACAQTGSGKTAAFLIPIINHLVCQDLKTAYPKCLILAPTRELAIQILSESQKFSLNTP -----------------------------------------------------1111--- LRSCVVYGGADTHSQIREVQMGCHLLVATPGRLVDFIEKNKISLEFCKYIVLDEADRMLD ------------------------------------1111-------------------- MGFEPQIRKIIEESNMPSGINRQTLMFSATFPKEIQKLAADFLYNYIFMTVG -----------------!!!!------------------------------- >TARGET OF MYB PROTEIN 1; SWP:O60784; PDB:1WRDA; GPLGSEQIGKLRSELEMVSGNVRVMSEMLTELVPTQAEPADLELLQELNRTCRAMQQRVL 3333----------------------------1111------------------------ ELIPQIANEQLTEELLIVNDNLNNVFLRHERFERFRTG -3333--------------------------------- >FERREDOXIN II; SWP:P00237; PDB:1WRIA; AYKVTLKTPDGDITFDVEPGERLIDIGSEKADLPLSCQAGACSTCLGKIVSGTVDQSEGS -------1111------222233333333-------------1111---------3333- FLDDEQIEQGYVLTCIAIPESDVVIETHKEDEL ------------3333------------1111- >Methylated-DNA--protein-c; SWP:Q973C7; PDB:1WRJA; MIVYGLYKSPFGPITVAKNEKGFVMLDFCDCAERSSLDNDYFTDFFYKLDLYFEGKKVDL --------1111------3333----------1111-3333----------1111----- TEPVDFKPFNEFRIRVFKEVMRIKWGEVRTYKQVADAVKTSPRAVGTALSKNNVLLIIPC -----3333----------11112222---------------------1111------33 HRVIGEKSLGGYSRGVELKRKLLELEGIDV 33---------1111--------1111--- >DUAL SPECIFICITY PHOSPHAT; SWP:NA; PDB:1WRMA; MGNGMNKILPGLYIGNFKDARDAEQLSKNKVTHILSVHDSARPMLEGVKYLCIPAADSPS --------2222---3333------------------1111----------------111 QNLTRHFKESIKFIHECRLRGESCLVHCLAGVSRSVTLVIAYIMTVTDFGWEDALHTVRA 1-3333-----------1111----------------------1111------------- GRSCANPNVGFQRQLQEFEKHEVHQYRQWLKEEY -1111----------------------------- >43 KDA TAIL PROTEIN; SWP:P08558; PDB:1WRUA; NTVTLRADGRLFTGWTSVSVTRSIESVAGYFELGVNVPPGTDLSGLAPGKKFTLEIGGQI ------iiii------------3333-----------2222-33332222---------- VCTGYIDSRRRQMTADSMKITVAGRDKTADLIDCAAVYSGGQWKNRTLEQIARDLCAPYG -------------3333---------------------------------------1111 VTVRWELSDKESSAAFPGFTLDHSETVYEALVRASRARGVLMTSNAAGELVFSRAASTAT ---------3333----------------------------------------------- DELVLGENLLTLDFEEDFRDRFSEYTVKSRKGTATDSDVTRYRPMIIIADSKITAKDAQA ---2222----------1111--------------3333--------------3333--- RALREQRRRLAKSITFEAEIDGWTRKDGQLWMPNLLVTIDASKYAIKTTELLVSKVTLIL ------------------------1111------------3333---------------- NDQDGLKTRVSLAPREGFLVPVESD ---------------1111------ >BRANCHED-CHAIN AMINO ACID; SWP:Q5SM19; PDB:1WRVA; QIKAGLIWMNGAFVPQEEAKTSVLSHALHYGTSVFEGIRAYETAKGPAIFRLKEHVKRFY --------iiii--3333---1111-----------------1111-------------- NSAKVLRMEIPFAPEELEEAIKEVVRRNGYRSCYIRPLAWMGAKALGVNPLPNNPAEVMV ---1111-----3333---------1111--------------------3333------- AAWEWVRKGARLITSSWARFPANVMPGKAKVGGNYVNSALAKMEAVAAGADEALLLDEEG -----3333-----------1111-1111-3333-----------1111-------1111 YVAEGSGENLFFVRDGVIYALEHSVNLEGITRDSVIRIAKDLGYEVQVVRATRDQLYMAD -------------iiii-----!!!!-------------1111--------11111111- EVFMTGTAAEVTPVSMIDWRPIGKGTAGPVALRLREVYLEAVTGRRPEYEGWLTYVN ----------------%%%%-!!!!---------------1111-33331111---- >PEPTIDE DEFORMYLASE 1; SWP:Q819U0; PDB:1WS0A; MAVLEIIKHPNEVLETPCERVINFDKKLVKLLKDMHETMLIADGVGLAAPQVGVSLQVAV ----------3333-------------------------1111----3333--------- VDVDDDTGKIELINPSILEKRGEQVGPEGCLSFPGLYGEVERADYIKVRAQNRRGKVFLL ---3333----------------------1111------------------1111----- EAEGFLARAIQHEIDHLHGVLFTSKVTRYYE ---------------1111---1111----- >METHYLTRANSFERASE; SWP:Q5SJT0; PDB:1WS6A; VVRILGGKARGVALKVPASARPSPVRLRKALFDYLRLRYPRRGRFLDPFAGSGAVGLEAA ------1111-------------3333-----------1111-----------------1 SEGWEAVLVEKDPEAVRLLKENVRRTGLGARVVALPVEVFLPEAKAQGERFTVAFAPPYA 111--------------------------------3333--------------------- DLAALFGELLASGLVEAGGLYVLQHPKDLYLPLGERRVYGENALTLVEV ---------------2222------1111---------!!!!------- >MAVICYANIN; SWP:P80728; PDB:1WS8A; MATVHKVGDSTGWTTLVPYDYAKWASSNKFHVGDSLLFNYNNKFHNVLQVDQEQFKSCNS -----2222---------------1111--2222-------------------------- SSPAASYTSGADSIPLKRPGTFYFLCGIPGHCQLGQKVEIKVDP ---------------------------22221111--------- >ASPARAGINE AMIDOHYDROLASE; SWP:P50286; PDB:1WSAA; KPQVTILATGGTIAGYSAGAVTVDKLLAAVPAINDLATIKGEQISSIGSQEMTGKVWLKL -----------------------------3333--------------1111--------- AKRVNELLAQKETEAVIITHGTDTMEETAFFLNLTVKSQKPVVLVGAMRPGSSMSADGPM ---------1111--------1111------------------------1111------- NLYNAVNVAINKASTNKGVVIVMNDEIHAAREATKLNTTAVNAFASPNTGKIGTVYYGKV -------11111111-------%%%%--3333-------------3333------iiii- EYFTQSVRPHTLASEFDISKIEELPRVDILYAHPDDTDVLVNAALQAGAKGIIHAGMGNG ---------!!!!------------------------3333---1111------------ NPFPLTQNALEKAAKSGVVVARSSRVGSGSTTQEAEVDDKKLGFVATESLNPQKARVLLM --------------------------------------3333--------3333------ LALTKTSDREAIQKIFSTY --------------1111- >HYPOTHETICAL PROTEIN ST02; SWP:Q976G0; PDB:1WSCA; QEQLVAVNELNENLGKVLIKIARDSIANKLGILKINLEDYLSSLNDPILNKKGLAFVTLE -----1111-------------------------------3333-3333----------- TYYGNSTSLRGCIGYVEAVAPLKEIVSKAAIAAAFSDPRFPPLSKGEFDNIIIEVTVLTK ------------------------------------1111---33331111--------- PQEIDVENRWELPKKIKVGEDGLIVEYGILYSGLLLPQVPEYCWDEETFLAETCIKAGLE -------11113333-2222-------3333----3333---------------1111-- PDCWLNNKVKIKKFQGIIFREEKPKSEKILIIKPSEVKCKKEEI -33331111-------------2222------1111--3333-- >RIBONUCLEASE HI; SWP:P00647; PDB:1WSIA; LKQVEIFTDGSCLGNPGPGGYGAILRYRGREKTFSAGYTRTTNNRMALMAAIVALEALKE ------------------------------------------------------1111-- HCEVILSTDSQYVRQGITQWIHNWKARGWKTADKKPVKNVDLWQRLDAALGQHQIKWEWV --------------------------------------3333-------1111------- KGAGHPENERCNELARAAAMNPTLEDTGYQVE -----------------1111----1111--- >AXIN 1 PROTEIN; SWP:Q6IS36; PDB:1WSPA; CDSIVVAYYFCGEPIPYRTLVRGRAVTLGQFKELLTKKGSYRYYFKKVSDEFDCGVVFEE ---------iiii-----------------1111--------------1111-------- VREDEAILPVFEEKIIGKVEKVD --1111----iiii--------- >AMINOMETHYLTRANSFERASE; SWP:P48728; PDB:1WSRA; VLRRTPLYDFHLAHGGKMVAFAGWSLPVQYRDSHTDSHLHTRQHCSLFDVSHMLQTKILG ----1111---1111-----iiii--------------3333-------1111------1 SDRVKLMESLVVGDIAELRPNQGTLSLFTNEAGGILDDLIVTNTSEGHLYVVSNAGCWEK 111----1111---11112222-------1111--------------------1111--- DLALMQDKVRELQNQGRDVGLEVLDNALLALQGPTAAQVLQAGVADDLRKLPFMTSAVME ------------1111----------------1111----1111--3333-2222----- VFGVSGCRVTRCGYTGEDGVEISVPVAGAVHLATAILKNPEVKLAGLAARDSLRLEAGLC iiii--------------------3333----------3333------------------ LYGNDIDEHTTPVEGSLSWTLGKRRRAAMDFPGAKVIVPQLKGRVQRRRVGLMCEGAPMR 2222--1111------3333----------2222-----1111----------------2 AHSPILNMEGTKIGTVTSGCPSPSLKKNVAMGYVPCEYSRPGTMLLVEVRRKQQMAVVSK 222---1111------------1111--------1111-2222-----%%%%-------- MPFVPTNYYTL ----------- >MULTIPLE SUBSTRATE AMINOT; SWP:Q9V2W5; PDB:1WSTA; INFDSFFSEKAMLMKASEVRELLKLVETSDVISLAGGLPAPETFPVETIKKIAVEVLEEH -3333--3333-------------1111-----------3333----------------- ADKALQYGTTKGFTPLRLALARWMEKRYDIPMSKVEIMTVAGSQQALDLIGRVFLNPGDP 3333----3333-------------------------------------------2222- IVVEAPTYLAAIQAFKYYDPEFISIPLDDKGMRVDLLEEKLEELRKQGKRVKIVYTVSTF -------3333---3333---------1111-------------1111------------ QNPAGVTMSVDRRKKLLELANEYDFLIVEDGPYSELRYSGEPTPPIKHFDDYGRVIYLGT ------------------------------1111----------3333------------ FSKILAPGFRIGWVAAHPHLIRKMEIAKQSIDLCTNTFGQAIAWKYVENGYLDEHIPKII -----3333-------3333-------1111---------------11113333------ EFYKPRRDAMLEALEEYMPEGVEWTKPEGGMFVRVTLPEGIDTKLMMERAVAKGVAYVPG ------------------2222---------------2222------------------- EAFFVHRDKKNTMRLNFTYVPEETIREGVRRLAETIKEEMKRV ---1111-------------------------------3333- >MYELOID CELL LEUKEMIA SEQ; SWP:P97287; PDB:1WSXA; GPLGSEDDLYRQSLEIISRYLREQATGSKDSKPLGEAGAAGRRALETLRRVGDGVQRNHE -------3333------------3333--------------------------------- TAFQGMLRKLDIKNEGDVKSFSRVMVHVFKDGVTNWGRIVTLISFGAFVAKHLKSVNQES ------3333---1111--------3333-----3333---------------1111333 FIEPLAETITDVLVRTKRDWLVKQRGWDGFVEFFHVQDLEGG 3------------3333-------------3333-------- >ALDEHYDE DEHYDROGENASE, M; SWP:Q62760; PDB:1WT4A; GPLGSDLKDAEAVQKFFLEEIQLGEELLAQGDYEKGVDHLTNAIAVCGQPQQLLQVLQQT -22223333------------------1111------------1111------------- LPPPVFQLLTKL -3333--3333- >ANTI EGFR ANTIBODY FV REG; SWP:NA; PDB:1WT5A; QVQLVQSGAEVKKPGASVKVSCKASGYTFTSYWMHWVRQAPGQGLEWMGNIYPGSGGTNY ------------2222-----------1111---------------------1111---- AEKFKNRVTMTRDTSISTAYMELSRLRSDDTAVYYCARSGGPYFFDYWGQGTLVT 3333----------------------3333------------------------- >ANTI EGFR ANTIBODY FV REG; SWP:NA; PDB:1WT5C; DIVMTQSPLSLPVTPGEPASISCRSSQNIVHNNGITYLEWYLQKPGQSPQLLIYKVSDRF ------------------------------1111---------2222------------2 SGVPDRFSGSGSGTDFTLKISRVEAEDVGVYYCFQGSHIPPTFGQGTKVEI 2223333----------------1111------------------------ ------------------------------------------------------------ ----- >BUTX-MTX; SWP:P59936; PDB:1WT7A; WCSTCLDLACTGSKDCYAPCRKQTGCPNAKCINKSCKCYGC -----------3333----------------%%%%------ >AGKISACUTACIN A CHAIN; SWP:Q9DEF9; PDB:1WT9A; DCSSGWSSYEGHCYKVFKQSKTWTDAESFCTKQVNGGHLVSIESSGEADFVGQLIAQKIK --2222--iiii-----------------11112222-----------------3333-- SAKIHVWIGLRAQNKEKQCSIEWSDGSSISYENWIEEESKKCLGVHIETGFHKWENFYCE ----------------------1111--------3333-------3333--------111 QQDPFVCEA 1-------- >Anticoagulant protein-B [; SWP:Q9DEF8; PDB:1WT9B; DCPSDWSSYEGHCYKPFNEPKNWADAENFCTQQHTGSHLVSFQSTEEADFVVKLAFQTFD --1111--iiii-----------------11112222----------------------- YGIFWMGLSKIWNQCNWQWSNAAMLKYTDWAEESYCVYFKSTNNKWRSITCRMIANFVCE -----------1111---1111---------------------------3333------- FQA --- >5'-METHYLTHIOADENOSINE PH; SWP:Q9YAQ8; PDB:1WTAA; EITRPPGVRAHVGVIGGSGLYDPGIVENPVEVKVSTPYGNPSDFIVVGDVAGVKVAFLPR ----------------3333-1111----------1111----------iiii-----11 HGRGHRIPPHAINYRANIWALKALGVKWVISVSAVGSLREDYRPGDFVVPDQFIDMTKNR 11-----3333----------1111-------------11112222-------------- RHYTFYDGPVTVHVSMADPFCEDLRQRLIDSGRRLGYTVHERGTYVCIEGPRFSTRAESR ------------------------------------------------------------ VWKDVFKADIIGMTLVPEINLACEAQLCYATLAMVTDYDVWADRPVTAEEVERVMISNVE ----------------------1111-----------!!!!------------------- RARRMLYDVIPKLAGEPELERCSCCRALDTAAI --------3333-----111111113333---- >HETEROGENEOUS NUCLEAR RIB; SWP:Q14103; PDB:1WTBA; VKKIFVGGLSPDTPEEKIREYFGGFGEVESIELPMDNKTNKRRGFCFITFKEEEPVKKIM ---------------------3333---------------------------------33 EKKYHNVGLSKCEIKVAMS 33----------------- >ECOO109IR; SWP:Q9RPJ3; PDB:1WTEA; MNKQEVILKVQECAAWWILERQSKLTKLMSETMSINPFMTPFIFDYHSLNDFDELVEAII ------------------------1111-1111--3333-----1111------------ AKHLMTGHDTGFGKLIDEKILPRVFGAYKLDKSYRAANEPFIHPCFDEIDHVIQRDDGRI ---------------------------------------33331111-------1111-- ELLSLKAGKWTIQLTMAVQLNKAFHEIINNYPGVADNIVVGVFYGNSHGLTDKYRILRGI -------1111-------------------1111-----------3333-3333-1111- NTGANHNVIDIRDKVHVYAGKEFWSWLNNGEAETQHWVLEGIERAVKEADIKEKNKDLIE ----------3333-------------iiii-3333--------------3333------ KFKEHVAKKYNEQVLNADGTAQWHKLLEMINE ---------------1111------------- >TAIL-ASSOCIATED LYSOZYME; SWP:P16009; PDB:1WTHA; NNLNWFVGVVEDRMDPLKLGRVRVRVVGLHPPQRAQGDVMGIPTEKLPWMSVIQPITSAA --------------1111------------------------3333--------1111-- MSGIGGSVTGPVEGTRVYGHFLDKWKTNGIVLGTYGGIVREKPNRLEGFSDPTGQYPRRL iiii-------2222-------1111-----------------1111---3333------ GNDTNVLNQGGEVGYDSSSNVIQDSNLDTAINPDDRPLSEIPTDDNPNMSMAEMLRRDEG ----3333--------------1111----------3333---------3333------- LRLKVYWDTEGYPTIGIGHLIMKQPVRDMAQINKVLSKQVGREITGNPGSITMEEATTLF -------1111----------------3333----------------------------- ERDLADMQRDIKSHSKVGPVWQAVNRSRQMALENMAFQMGVGGVAKFNTMLTAMLAGDWE -----------------------------------------------------1111333 KAYKAGRDSLWYQQTKGRASRVTMIILTGNLESYGVEVKTPARSLLAMAATVAKSSDPAD 3---3333-3333-----------------3333----------22223333----1111 PPIPNDSRILFKEPVSSYKGEYPYVHTMETESGHIQEFDDTPGQERYRLVHPTGTYEEVS -----------------------------1111-------2222------3333-----1 PSGRRTRKTVDNLYDITNADGNFLVAGDKKTNVGGSEIYYNMDNRLHQIDGSNTIFVRGD 111--------------------------------------------------------- ETKTVEGNGTILVKGNVTIIVEGNADITVKGDATTLVEGNQTNTVNGNLSWKVAGTVDWD ------------------------------------------------------------ VGGDWTEKMASMSSISSGQYTIDGSRIDIGS ------------------------------- >Baseplate structural prot; SWP:P17172; PDB:1WTHD; LQRPGYPNLSVKLFDSYDAWSNNRFVELAATITTLTMRDSLYGRNEGMLQFYDSKNIHTK ---------------------------------------1111------------3333- MDGNEIIQISVANANDINNVKTRIYGCKHFSVSIIAIELGTIHSIENLKFGRPFFPDAGE -----------------------------------------3333--------------- SIKEMLGVIYQDRTLLTPAINAINAYVPDIPWTSTFENYLSYVREVALAVGSDKFVFVWQ ---------11113333------------------3333--------------------- DIMGVNMMDYDMMINQEPYPMIVGEPSQELKYPLAYDFVWLTKSNPHKRDPMKNATIYAH 1111----------------------------------------3333-3333------- SFLDSSIPMITTGKGENSIVVSRSGAYSEMTYRNGYEEAIRLQTMAQYDGYAKCSTIGNF ------------------------1111------3333-----3333------------- NLTPGVKIIFNDSKNQFKTEFYVDEVIHELSNNNSVTHLYMFTNATKLETIDPVKVKNEF ------------------------------------------------------------ K - >UREIDOGLYCOLATE DEHYDROGE; SWP:Q4U331; PDB:1WTJA; TQTVSYPQLIDLLRRIFVVHGTSPEVADVLAENCASAQRDGSHSHGIFRIPGYLSSLASG -----------------1111----------------111133333333-------1111 WVDGKAVPVVEDVGAAFVRVDACNGFAQPALAAARSLLIDKARSAGVAILAIRGSHHFAA --1111---------------%%%%----------------------------------- LWPDVEPFAEQGLVALSMVNSMTCVVPHGARQPLFGTNPIAFGAPRAGGEPIVFDLATSA 3333----1111--------------2222---------------1111----------- IAHGDVQIAAREGRLLPAGMGVDRDGLPTQEPRAILDGGALLPFGGHKGSALSMMVELLA -3333-----------------1111----3333----------!!!!------------ AGLTGGNFSFEFDWSKHPGAQTPWTGQLLIVIDPDKGAGQHFAQRSEELVRQLHGVGQER -------1111--1111---------------1111----3333---------1111--- LPGDRRYLERARSMAHGIVIAQADLERLQELA 2222---------------------------- >BENCE-JONES PROTEIN MCG (; SWP:P80362; PDB:1WTLA; DIQMTQSPSSLSASVGDRVTITCRASQDITNYVNWFQQRPGQAPKVLIYGASILETGVPS --------------------------------------2222------------222211 RFSGSGSGTDFTFTISSLQPEDIATYYCQQYDTLPLTFGGGTKVDIKR 11----------------1111-------------------------- >HYPOTHETICAL PROTEIN TTHA; SWP:Q5SM95; PDB:1WTYA; ASLARAVERLKAALERPKDEFIRDSAIQRFEFTFELAWKTLKTFLELQGLEARSPRAAIR ------------1111-----------------------------1111----------- GAFQVGLLPEDPFWLELELRNLTNHTYDEALAERIYAELPKALERFQELLRRLEE ------------------------------------------------------- >MOLYBDOPTERIN BIOSYNTHESI; SWP:O59354; PDB:1WU2A; KLVPYREALKLLLDDINEIEDTEKVPLREAVGRVLAEDIVTEFDIPPFDRAAVDGYAIRA ------------3333---------33332222-------------------------33 EDTFQAREYNPIELTVIEEVPAGNVAKEEVTTGKAIKVLTGTRIPKGANAVIQEVKREGD 33----1111----------2222------2222----2222---------------!!! KIYVLRPVAPGQNIAFTGEDVKKGEVVLRKGTILRPQDVALKALGIKKVPVKVKPKVGII !-------2222---2222--2222---2222--3333---1111--------------- ITGSELIEEPSEEGFKEGKIVETNSILQGLVEKFFGEPILYGVLPDDESIIKETLEKAKN --3333--------1111----3333--------------------3333---------- ECDIVLITDYAHKFVNLLFHGTTIKPGRPFGYGEKVFISGYPVSVFAQFNLFVKHALAKV ----------3333------------1111--%%%%------------------------ GAQNYEVKVKAILQDDIPSQLGRYEFIKIYYENGIARVIKKKGSGILSSLLASNAYLEIP -------------------2222--------%%%%------------3333--------1 EDSEGYRRGEEVWITLY 111-------------- >Interferon beta [Precurso; SWP:P01575; PDB:1WU3I; INYKQLQLQERTNIRKCQELLEQLNGKINLTYRADFKIPMEMTEKMQKSYTAFAIQEMLQ --------------------------------------3333------------------ NVFLVFRNNFSSTGWNETIVVRLLDELHQQTVFLKTVLEEKQEERLTWEMSSTALHLKSY ---1111--3333------------------------3333------------------- YWRVQRYLKLMKYNSYAWMVVRAEIFRNFLIIRRLTRNFQN ----------%%%%--------------------3333--- >XYLANASE Y; SWP:Q9KB30; PDB:1WU4A; EGAFYTREYRNLFKEFGYSEAEIQERVKDTWEQLFGDNPKIYYEVGDDLGYLLDTGNLDV -3333--------1111------------------------------------------- RTEGMSYGMMMAVQMDRKDIFDRIWNWTMKNMYMTEGVHAGYFAWSCQPDGTKNSWGPAP ------------1111--------------------1111-------1111--------- DGEEYFALALFFASHRWGDGDEQPFNYSEQARKLLHTCVHNGEGGPGHPMWNRDNKLIKF ---------------------------------------2222----------------- IPEVEFSDPSYHLPHFYELFSLWANEEDRVFWKEAAEASREYLKIACHPETGLAPEYAYY 1111---3333-3333--------3333------------------------------11 DGTPNDEKGYGHFFSDSYRVAANIGLDAEWFGGSEWSAEEINKIQAFFADKEPEDYRRYK 11-----------3333------------------------------11113333----1 IDGEPFEEKSLHPVGLIATNAMGSLASVDGPYAKANVDLFWNTPVRTGNRRYYDNCLYLF 111-----------------------1111---------1111----------------- AMLALSGNFKIWFP ---1111------- >HISTIDYL-TRNA SYNTHETASE; SWP:Q9HLX5; PDB:1WU7A; RLQIEKIRGFRDFYPEDDVEKFIFKTAEEAAEAFGFRRIDFPSLEYLDLYRIKSGEELLQ ------2222---3333--------------1111---------------11113333-- QTYSFVDKGGREVTLIPEATPSTVRVTSRKDLQRPLRWYSFPKVWRYEEPQAGRYREHYQ ------------------------------------------------------------ FNADIFGSDSPEADAEVIALASSILDRLGLQDIYEIRINSRKIEEIIGGTSSDPFSVFSI -------------------------1111----------3333---------3333---- IDRYHKISREEFVDQLRSAGIGEDGVSIADLCSGTRGIDEARITGKSSEEIARAAVEDLL 1111-----------1111-----------3333-------1111--3333--------- ASYGVKNVRYDFSIVRGLSYYTGIVFEAYDRSGQFRAILGGGRYDNLASLSGESVPAVGF 1111------11112222-----------1111----------1111------------- GGDAVISLLLKRENVQIPREKKSVYICRVGKINSSINEYSRKLRERGNVTVEIERGLSAQ --------------------------------3333-------1111------------- LKYASAIGADFAVIFGERDLERGVVTIRNYTGSQENVGLDSVVEHLISQAT ---------------3333------------------3333---------- >HYPOTHETICAL PROTEIN PH04; SWP:O58212; PDB:1WU8A; ITLTTDFGLKGPYVGEKVALRINPNAKIVDVTHSVTRHSILEGSFVEQVVKYSPKGTVHV -----------3333--------------------2222----------11112222--- GVIDPGVGTERRAIVIEGDQYLVVPDNGLATLPLKHIKVKSVYEIIPDKIRKFTGWEISS ---1111--------------------11113333--------------3333------- TFHGRDIFGPAGALIEKGIHPEEFGREIPVDSIVKLNVEPRKEGDVWILKVIYIDDFGNV -3333---------1111-3333-----3333----------------------1111-- ILNLENYEKPRTVELLDFNLRLPYLETYGLVEKGELALPGSHDYLEIAVNGSAAERLNVK --------------3333--------3333-2222-------------------1111-2 VGDELRVRLL 222------- >Microtubule-associated pr; SWP:Q15691; PDB:1WU9A; DEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYAT --------------------------------------1111----------------- >CONSERVED HYPOTHETICAL PR; SWP:P83815; PDB:1WUBA; MKWNLDPSHTSIDFKVRHMGIASVRGSLKVLSGSVETDEAGRPIQVEAVIDAASIATGEP -----1111----------------------------1111---------1111------ QRDGHLRSADFLHAEQYPEIRFVSTQIEPLGGNRYRIQGNLTIRDITKPVTLEAEVSAPI -------1111-3333--------------!!!!--------!!!!-------------- KDPWGMQRVAASASGQINRKDWNLTWNQVLELGALLVGEEVKFNLEVEAVAPAPVA -------------------1111--------------------------------- >BOUGANIN; SWP:Q8W4U4; PDB:1WUCA; YNTVSFNLGEAYEYPTFIQDLRNELAKGTPVCQLPVTLQTIADDKRFVLVDITTTSKKTV ------11111111----------------%%%%-------1111--------1111--- KVAIDVTDVYVVGYQDKWDGKDRAVFLDKVPTVATSKLFPGVTNRVTLTFDGSYQKLVNA -----------------iiii---------3333----2222------------------ AKVDRKDLELGVYKLEFSIEAIHGKTINGQEIAKFFLIVIQMVSEAARFKYIETEVVDRG ---3333--------------2222----------------------------------- LYGSFKPNFKVLNLENNWGDISDAIHKSSPQCTTINPALQLISPSNDPWVVNKVSQISPD -------------------------1111-------------1111------33333333 MGILKFKS -------- >ATP-DEPENDENT DNA HELICAS; SWP:P15043; PDB:1WUDA; NYDRKLFAKLRKLRKSIADESNVPPYVVFNDATLIEAEQPITASELSVNGVGRKLERFGK -----------------------3333--------------3333--2222-3333--33 PFALIRAHVDGD 33---------- >mandelate racemase/mucona; SWP:NA; PDB:1WUEA; GSHMNIQSIETYQVRLPLKTPFVTSYGRLEEKAFDLFVITDEQGNQGFGELVAFEQPDYV -----------------------1111-------------1111---------------- QETLVTERFIIQQHLIPLLLTEAIEQPQEVSTIFEEVKGHWMGKAALETAIWDLYAKRQQ -----------------1111------------3333-------------------1111 KSLTEFFGPTRRKIPVGISLGIQEDLPQLLKQVQLAVEKGYQRVKLKIRPGYDVEPVALI ------------------------------------1111--------1111-------- RQHFPNLPLMVDANSAYTLADLPQLQRLDHYQLAMIEQPFAADDFLDHAQLQRELKTRIC ---1111-----%%%%-3333----3333-----------1111-------1111----- LDENIRSLKDCQVALALGSCRSINLKIPRVGGIHEALKIAAFCQENDLLVWLGGMFESGV -3333--------------------3333------------------------------- GRALNLQFASQPTFSFPGDISATERYFYEDIITEPFILEQGTMTVPQGLGIGVTLSQTNL ----------3333-------3333-------------iiii------!!!!-------- LKYSQYQKIM ---------- >HYPOTHETICAL PROTEIN LIN2; SWP:NA; PDB:1WUFA; HMYFQKARLIHAELPLLAPFKTSYGELKSKDFYIIELINEEGIHGYGELEAFPLPDYTEE ------------------------------------------------------------ TLSSAILIIKEQLLPLLAQRKIRKPEEIQELFSWIQGNEMAKAAVELAVWDAFAKMEKRS 3333--------3333-------1111----3333--3333-------------1111-- LAKMIGATKESIKVGVSIGLQQNVETLLQLVNQYVDQGYERVKLKIAPNKDIQFVEAVRK ----------------------------------1111------------3333------ SFPKLSLMADANSAYNREDFLLLKELDQYDLEMIEQPFGTKDFVDHAWLQKQLKTRICLD -1111----------3333------1111-------------------3333-------1 ENIRSVKDVEQAHSIGSCRAINLKLARVGGMSSALKIAEYCALNEILVWCGGMLEAGVGR 111--------------------3333--------------------------------- AHNIALAARNEFVFPGDISASNRFFAEDIVTPAFELNQGRLKVPTNEGIGVTLDLKVLKK --------3333-------3333-----------------------!!!!-----3333- YTKSTEEILLN ----------- >Periplasmic [NiFe] hydrog; SWP:P21852; PDB:1WUIL; SSYSGPIVVDPVTRIEGHLRIEVEVENGKVKNAYSSSTLFRGLEIILKGRDPRDAQHFTQ -------------------------iiii------------3333-22223333------ RTCGVTYTHALASTRCVDNAVGVHIPKNATYIRNLVLGAQYLHDHIVHFYHLHALDFVDV -------------------------3333-----------------------3333--33 TAALKADPAKAAKVASSISPRKTTAADLKAVQDKLKTFVETGQLGPFTNAYFLGGHPAYY 331111--------3333--------------------3333-!!!!--1111--1111- LDPETNLIATAHYLEALRLQVKAARAMAVFGAKNPHTQFTVVGGVTCYDALTPQRIAEFE -------------------------3333-----------2222--3333---------- ALWKETKAFVDEVYIPDLLVVAAAYKDWTQYGGTDNFITFGEFPKDEYDLNSRFFKPGVV ---------------------------------------------1111----------- FKRDFKNIKPFDKMQIEEHVRHSWYEGAEARHPWKGQTQPKYTDLHGDDRYSWMKAPRYM %%%%-------1111----1111--------1111--------2222-----------ii GEPMETGPLAQVLIAYSQGHPKVKAVTDAVLAKLGVGPEALFSTLGRTAARGIETAVIAE ii-------------1111----------------------------------------- YVGVMLQEYKDNIAKGDNVICAPWEMPKQAEGVGFVNAPRGGLSHWIRIEDGKIGNFQLV ------------1111---------------------1111--------iiii------- VPSTWTLGPRCDKNKLSPVEASLIGTPVADAKRPVEILRTVHSFDPIACGVH 3333------1111------3333-----33333333---3333--3333-- >Periplasmic [NiFe] hydrog; SWP:P21853; PDB:1WUIS; LMGPRRPSVVYLHNAECTGCSESVLRAFEPYIDTLILDTLSLDYHETIMAAAGDAAEAAL -----------------------1111-----------------3333------------ EQAVNSPHGFIAVVEGGIPTAANGIYGKVANHTMLDICSRILPKAQAVIAYGTCATFGGV -----1111-----------%%%%----%%%%--------3333--------------33 QAAKPNPTGAKGVNDALKHLGVKAINIAGCPPNPYNLVGTIVYYLKNKAAPELDSLNRPT 33--------------3333---------------------------------1111-33 MFFGQTVHEQCPRLPHFDAGEFAPSFESEEARKGWCLYELGCKGPVTMNNCPKIKFNQTN 33---3333-1111--1111----1111--1111--3333--3333---3333--%%%%- WPVDAGHPCIGCSEPDFWDAMTPFYQN 3333------1111-3333---1111- >GTP CYCLOHYDROLASE I; SWP:Q5SH52; PDB:1WURA; EVDLERLQALAAEWLQVIGEDPGREGLLKTPERVAKAWAFLTRGYRQRLEEVVGGAVFPA --------------------11111111--------------3333------%%%%---- EGSEMVVVKGVEFYSMCEHHLLPFFGKVHIGYIPDGKILGLSKFARIVDMFARRLQVQER --------------------------------------------------------3333 LAVQIAEAIQEVLEPQGVGVVVEGVHLCMMMRGVEKQHSRTVTSAMLGVFRENQKTREEF -------------------------3333--!!!!------------3333--------- LSHLR 3333- >GALACTOKINASE; SWP:P51570; PDB:1WUUA; HHAALRQPQVAELLAEARRAFREEFGAEPELAVSAPGRVNLIGEHTDYNQGLVLPALELT -1111------------------------------------------------------- VLVGSPRKDGLVSLLTTSEGADEPQRLQFPLPTAQRSLEPGTPRWANYVKGVIQYYPAAP ------3333-------3333---------------------3333-------------- LPGFSAVVVSSVPLGGGLSSSASLEVATYTFLQQLCPDSGTIAARAQVCQQAEHSFAGPC -------------------------------3333------------------------- GIDQFISLGQKGHALLIDCRSLETSLVPLSDPKLAVLITNSNVRHASSEYPVRRRQCEEV ------------------------------1111-------------------------- ARALGKESLREVQLEELEAARDLVSKEGFRRARHVVGEIRRTAQAAAALRRGDYRAFGRL --------1111-3333--1111--3333------------------------------- VESHRSLRDDYEVSCPELDQLVEAALAVPGVYGSRTGGGFGGCTVTLLEASAAPHARHIQ ------------------------------------------------33333333---1 EHYGGTATFYLSQAADGAKVLCL 111-------------------- >BETA-HORDOTHIONIN; SWP:P21742; PDB:1WUWA; KSCCRSTLGRNCYNLCRVRGAQKLCANACRCKLTSGLKCPSSFPK ---------------------------------------3333-- >PCDHA4 PROTEIN; SWP:O88689; PDB:1WUZA; GNSQIHYSIPEEAKHGTFVGRIAQDLGLELTELVPRLFRVASKDRGDLLEVNLQNGILFV -------------2222-----------11113333------------------------ NSRIDREELCGRSAECSIHLEVIVDRPLQVFHVEVEVRDINDN ----3333-------------------------------3333 >THIAZOLE BIOSYNTHESIS PRO; SWP:Q9I6B4; PDB:1WV2A; TPFVIAGRTYGSRLLVGTGKYKDLDETRRAIEASGAEIVTVAVRRTNIPPDRYTILPNTA -----------------------------------------1111--------------- GCYDAVEAVRTCRLARELLDGHNLVKLEVLADQKTLFPNVVETLKAAEQLVKDGFDVMVY ------------------------------------------------------------ TSDDPIIARQLAEIGCIAVMPLAGLIGSGLGICNPYNLRIILEEAKVPVLVDAGVGTASD ------------------------2222-------------------------------- AAIAMELGCEAVLMNTAIAHAKDPVMMAEAMKHAIVAGRLAYLAGRMPRK ----3333------3333----3333------------------------ >similar to DNA segregatio; SWP:Q8NYF3; PDB:1WV3A; HKLIIKYNKQLKLNLRDGKTYTISEDERADITLKSLGEVIHLEQNNQGTWQANHTSINKV ------%%%%-----2222------1111---------------1111---%%%%----- LVRKGDLDDITLQLYTEADYASFAYPSIQDTTIGPNAYDDVIQSLNAIIIKDFQSIQESQ ----1111-------3333----------------1111--3333------3333----- YVRIVHDKNTDVYINYELQEQLTNKAYIGDHIYVEGIWLEVQADGLNVLSQNTVASSLIR ------1111---%%%%---------2222---iiii----1111--------------- L - >HYPOTHETICAL PROTEIN TTHA; SWP:Q5SJJ5; PDB:1WV8A; RTLKVQALWDGEAGVWVAESDDVPGLATEAATLEELLAKLAVVPELLEENGVALELPVEL ----------1111---------------------------------------------- RLEATRPLVF ---------- >RHODANESE HOMOLOG TT1651; SWP:Q5SKN0; PDB:1WV9A; RKVRPEELPALLEEGVLVVDVRPARSTPLPFAAEWVPLEKIQKGEHGLPRRPLLLVCEKG ---33333333-------------------------3333-------------------- LLSQVAALYLEAEGYEASLEGGLQAL ----------3333----2222---- >4-cresol dehydrogenase [h; SWP:P09787; PDB:1WVEC; SQWGSGKNLYDKVCGHCHKPEVGVGPVLEGRGLPEAYIKDIVRNGFRAMPAFPASYVDDE ------------------3333-----------------------!!!!---3333---- SLTQVAEYLSSLPAP --------------- >4-cresol dehydrogenase [h; SWP:P09788; PDB:1WVFA; AVLPKGVTQGEFNKAVQKFRALLGDDNVLVESDQLVPYNKIMMPVENAAHAPSAAVTATT ---2222----------------1111---33333333-------3333----------- VEQVQGVVKICNEHKIPIWTISTGRNFGYGSAAPVQRGQVILDLKKMNKIIKIDPEMCYA -------------------------2222!!!!--2222----1111------------- LVEPGVTFGQMYDYIQENNLPVMLSFSAPSAIAGPVGNTMDRGVGYTPYGEHFMMQCGME --1111-----------------------1111-----1111----11113333------ VVLANGDVYRTGMGGVPGSNTWQIFKWGYGPTLDGMFTQANYGICTKMGFWLMPKPPVFK --1111----!!!!-2222-1111--------3333------------------------ PFEVIFEDEADIVEIVDALRPLRMSNTIPNSVVIASTLWEAGSAHLTRAQYTTEPGHTPD -------3333------------------------------1111-3333---------- SVIKQMQKDTGMGAWNLYAALYGTQEQVDVNWKIVTDVFKKLGKGRIVTQEEAGDTQPFK ------------------------------------------------3333!!!!---- YRAQLMSGVPNLQEFGLYNWRGGGGSMWFAPVSEARGSECKKQAAMAKRVLHKYGLDYVA ----1111---1111------------------------------------1111----- EFIVAPRDMHHVIDVLYDRTNPEETKRADACFNELLDEFEKEGYAVYRVNTRFQDRVAQS ----1111---------1111------------------1111------3333---3333 YGPVKRKLEHAIKRAVDPNNILAPGRSGIDLNNDF ----------------1111--2222---1111-- >CDP-GLUCOSE 4,6-DEHYDRATA; SWP:P26397; PDB:1WVGA; SIDKNFWQGKRVFVTGHTGFKGSWLSLWLTEMGAIVKGYALDAPTVPSLFEIVRLNDLME --33332222-----1111----------------------------------3333--- SHIGDIRDFEKLRSSIAEFKPEIVFHMAAQPLVRLSYEQPIKTYSTNVMGTVHLLETVKQ ----1111------------------------3333------------------------ VGNIKAVVNITSDKCYDNREWVWGYRENEPMGGYDPYSNSKGCAELVASAFRNSFFNPAN -----------1111----------1111-----3333------------------3333 YEQHGVGLASVRAGNVIGGGDWAKDRLIPDILRSFENNQQVIIRNPYSIRPWQHVLEPLS ----------------------2222------------------1111-----3333--- GYIVVAQRLYTEGAKFSEGWNFGPRDEDAKTVEFIVDKMVTLWGDDASWLLDPHEAHYLK ------------3333--------3333-------------------------------- LDCSKANMQLGWHPRWGLTETLSRIVKWHKAWIRGEDMLICSKREISDYMSA ---------------------------------------------------- >TENSIN; SWP:Q04205; PDB:1WVHA; AACNVFSESLTGPQAISKAVAETLVADPTPTATIVHFKVSAQGITLTDNQRKLFFRRHYP --------!!------------1111-------------1111----------------3 LNVCL 33--- >putative phosphatases inv; SWP:Q8DTD6; PDB:1WVIA; TYKGYLIDLDGTIYKGKDRIPAGEDFVKRLQERQLPYILVTNNTTRTPEMVQEMLATSFN --------2222--!!!!-3333------------------------------------- IKTPLETIYTATLATIDYMNDMKRGKTAYVIGETGLKKAVAEAGYREDSENPAYVVVGLD ---1111----------------------------------------------------- TNLTYEKLTLATLAIQKGAVFIGTNPDLNIPTERGLLPGAGAILFLLEKATRVKPIIIKP -------------------------------1111------------------------- AVIMNKALDRLGVKRHEAIMVGDNYLTDITAGIKNDIATLLVTTGFTKPEEVPALPIQPD -------------1111------3333-----1111-----------33331111----- FVLSSLAEWDF ----3333--- >AT2G23090/F21P24.15; SWP:NA; PDB:1WVKA; GHHHHHHLEGGGNAQKSAMARAKNLEKAKAAGKGSQLEANKKAMSIQCKVCMQTFICTTS -------------1111---1111-----------------------3333--------- EVKCREHAEAKHPKADVVACFPHLKK -------1111----1111--1111- >POLY(RC)-BINDING PROTEIN ; SWP:Q15365; PDB:1WVNA; PLGSQTTHELTIPNNLIGCIIGRQGANINEIRQMSGAQIKIANPVEGSSGRQVTITGSAA -1111-------3333-33332222-------------------2222------------ SISLAQYLINARLS ----------1111 >SIALIC ACID SYNTHASE; SWP:Q9NR45; PDB:1WVOA; GSSGSSGSVVAKVKIPEGTILTMDMLTVKVGEPKGYPPEDIFNLVGKKVLVTVEEDDTIM ---------------2222--3333--------------3333----------2222--1 EELVDNHGKKIKSSGPSSG 111---------------- >HYPOTHETICAL PROTEIN PAE2; SWP:Q8ZVF7; PDB:1WVQA; SIKFELIDVPIPQGTNVIIGQAHFIKTVEDLYEALVTSVPGVKFGIAFCEASGKRLVRHE -----------2222-----------------------1111---------!!!!----- ANDEELRNLAIDLCKKIAAGVFVIYIRNAWPINVLNAIKNVPEVVRIFAATANPLKVIVA -----------------------------3333-------3333---------------- EVEPERRGVVGVVDGHSPLGVETEKDREERKKFLREVVKYKL --2222------------------------------------ >TRIFLIN; SWP:Q8JI39; PDB:1WVRA; NVDFDSESPRKPEIQNEIIDLHNSLRRSVNPTASNMLKMEWYPEAAANAERWAYRCIESH -3333--1111-------------1111-----------------------3333----- SSRDSRVIGGIKCGENIYMATYPAKWTDIIHAWHGEYKDFKYGVGAVPSDAVIGHYTQIV -3333--iiii------------------------------------1111-33333333 WYKSYRAGCAAAYCPSSKYSYFYVCQYCPAGNIIGKTATPYKSGPPCGDCPSDCDNGLCT 1111---------1111------------------3333-----2222-1111-iiii-- NPCTRENEFTNCDSLVQKSSCQDNYMKSKCPASCFCQNKII ------------------------------3333------- >HYPOTHETICAL PROTEIN ST21; SWP:Q96YJ2; PDB:1WVTA; KDSEIVKALGDLDELNSVLGVVSSLYPELSEVIQKLQNDIFSISSEIAGFDNFSDEKVKG -------------------------1111----------------1111----------- IEELITNYSKELEPLRNFVLPGGHIASSFLHLARAVCRRAERSVVTLLKESKAKEVHAKY --------1111-----------------------------------------3333--- LNRLSSLLFVLALVVNKRTNNPNVIWR --------------------------- >CHITINASE C; SWP:O50152; PDB:1WVVA; GGNNGFVVSEAQFNQMFPNRNAFYTYKGLTDALSAYPAFAKTGSDEVKKREAAAFLANVS --------------------3333-------33331111--------------------- HQTGGLFYIKEVNEANYPHYCDTTQSYGCPAGQAAYYGRGPIQLSWNFNYKAAGDALGIN ------------33333333-3333---1111-------1111----------------3 LLANPYLVEQDPAVAWKTGLWYWNSQNGPGTMTPHNAIVNNAGFGETIRSINGALECNGG 333-3333---3333-----------!!!!-------1111-3333----------iiii NPAQVQSRINKFTQFTQILGTTTGPNLSC ----------------------------- >TRNASE Z; SWP:Q9WZW8; PDB:1WW1A; MNIIGFSKALFSTWIYYSPERILFDAGEGVSTTLGSKVYAFKYVFLTHGHVDHIAGLWGV -----------------1111--------3333!!!!1111--------11111111--- VNIRNNGMGDREKPLDVFYPEGNRAVEEYTEFIKRANPDLRFSFNVHPLKEGERVFLRNA -------------------2222-------------1111---------2222------i GGFKRYVQPFRTKSEVSFGYHIFEVRRKFVTEEYHKKVLTISGDSLALDPEEIRGTELLI iii---------------------------------------------33332222---- HECTFLDARDNHAAIDEVMESVKAAGVKKVILYHISTRYIRQLKSVIKKYREEMPDVEIL -----------------------------------33333333------1111------- YMDPRKVFEM --1111---- >GALECTIN; SWP:NA; PDB:1WW7A; TTSAVNIYNISAGASVDLAAPVTTGDIVTFFSSALNLSAGAGSPNNTALNLLSENGAYLL ----------2222--------2222----------3333------------1111---- HIAFRLQENVIVFNSRQPNAPWLVEQRVSNVANQFIGSGGKAMVTVFDHGDKYQVVINEK ----------------2222---------3333-3333------------------!!!! TVIQYTKQISGTTSSLSYNSTEGTSIFSTVVEAVTYTGLA ---------------------------------------- >MALATE OXIDOREDUCTASE; SWP:O59029; PDB:1WW8A; IREKALEFHKNNFPGNGKIEVIPKVSLESREELTLAYTPGVAEPCKEIARDPGKVYEYTS 3333--1111------------------3333------3333----------3333---3 KGNLVAVVSDGSRILGLGNIGPLAGLPVEGKALLFKRFGGVDAFPIIKEQEPNKFIDIVK 333-------------------1111---------------------------------- AIAPTFGGINLEDIASPKCFYILERLREELDIPVFHDDQQGTAAVVLAGLLNALKVVGKK --3333----------3333---------------3333---------------1111-1 ISEITLALFGAGAAGFATLRILTEAGVKPENVRVVELVNGKPRILTSDLDLEKLFPYRGW 111-------------------1111-3333------iiii----33333333-2222-- LLKKTNGENIEGGPQEALKDADVLISFTRPGPGVIKPQWIEKNEDAIVFPLANPVPEILP 3333-1111--------2222---------------3333-------------------- EEAKKAGARIVATGRSDYPNQINNLLGFPGIFRGALDVRARTITDSIIAAAKAIASIVEE ---1111-------1111----3333-3333-----------------------3333-- PSEENIIPSPLNPIVYAREARAVAEEAKEGVARTKVKGEWVEEHTIRLIEFYENVIAPIN --------1111------------------------3333---------------3333- KKRREYS ---1111 >BDNF/NT-3 growth factors ; SWP:Q16620; PDB:1WWBX; VHFAPTITFLESPTSDHHWCIPFTVKGNPKPALQWFYNGAILNESKYICTKIHVTNHTEY ------------------------------------iiii----1111------------ HGCLQLDNPTHMNNGDYTLIAKNEYGKDEKQISAHFMGWPGID ---------3333---------3333----------------- >NT-3 GROWTH FACTOR RECEPT; SWP:Q16288; PDB:1WWCA; TVYYPPRVVSLEEPELRLEHCIEFVVRGNPPPTLHWLHNGQPLRESKIIHVEYYQEGEIS -------------------------------------iiii------------------- EGCLLFNKPTHYNNGNYTLIAKNPLGTANQTINGHFLKEPFPVDE ---------3333---------1111------------------- >NUCLEOPORIN 35; SWP:Q8R4R6; PDB:1WWHA; HLDDTWVTVFGFPQASASYILLQFAQYGNILKHVMSNTGNWMHIRYQSKLQARKALSKDG 3333--------3333------3333-----------------------------1111- RIFGESIMIGVKPCIDKNVME ---------------3333-- >HYPOTHETICAL PROTEIN TTHA; SWP:Q5SI95; PDB:1WWIA; LKVAEFERLFRQAAGLDVDKNDLKRVSDFLRNKLYDLLAVAERNAKYNGRDLIFEPDLPI ------------------3333-----------------------1111----3333--- AKGLQETLQEFRRDTALELKPVLDALAALPPLDLEVAEDVRNLLPELAGALVVAYARVLK ------------------------------------------------------------ ELDPALKNPQTEHHERAERVFNLLL --1111---3333------------ >CIRCADIAN CLOCK PROTEIN K; SWP:P74645; PDB:1WWJA; MSPFKKTYVLKLYVAGNTPNSVRALKMLKNILEQEFQGVYALKVIDVLKNPQLAEEDKIL ---------------------------------------------1111----1111--- ATPTLAKILPPPVRKIIGDLSDREKVLIGLDLLYDEIRE 33333333------------------------------- >PHOSPHOGLYCERATE DEHYDROG; SWP:O50095; PDB:1WWKA; MKVLVAAPLHEKAIQVLKDAGLEVIYEEYPDEDRLVELVKDVEAIIVRSKPKVTRRVIES -----------------1111-------------1111---------------------- APKLKVIARAGVGLDNIDVEAAKEKGIEVVNAPAASSRSVAELAVGLMFSVARKIAFADR 1111---------1111-----1111-----3333---------------1111------ KMREGVWAKKEAMGIELEGKTIGIIGFGRIGYQVAKIANALGMNILLYDPYPNEERAKEV -1111--3333-----2222-------------------------------------111 NGKFVDLETLLKESDVVTIHVPLVESTYHLINEERLKLMKKTAILINTSRGPVVDTNALV 1----------------------3333--------33331111-------3333------ KALKEGWIAGAGLDVFEEEPLPKDHPLTKFDNVVLTPHIGASTVEAQERAGVEVAEKVVK ------------------------3333----------1111------------------ ILKG ---- >MONOCYTE DIFFERENTIATION ; SWP:P10810; PDB:1WWLA; PCELDEESCSCNFSDPKPDWSSAFNCLGAADVELYGGGRSLEYLLKRVDTEADLGQFTDI ----!!!!----------3333-------------iiii-1111----1111-3333--- IKSLSLKRLTVRAARIPSRILFGALRVLGISGLQELTLENLEVTGTAPPPLLEATGPDLN 1111------------3333------3333------------------------------ ILNLRNVSWATRDAWLAELQQWLKPGLKVLSIAQAHSLNFSCEQVRVFPALSTLDLSDNP -----------------3333-------------------3333---1111-------11 ELGERGLISALCPLKFPTLQVLALRNAGMETPSGVCSALAAARVQLQGLDLSHNSLRDAA 11-3333----22221111----------------------------------------- GAPSCDWPSQLNSLNLSFTGLKQVPKGLPAKLSVLDLSYNRLDRNPSPDELPQVGNLSLK -------1111------------------------------------3333-------22 GNPFLDSE 223333-- >HYPOTHETICAL PROTEIN TT20; SWP:Q5SLX4; PDB:1WWMA; EVPGLLEEIKALPLRLDEERFRFWLQQDYPFVEALYRYQVGLLLEAPQAHRAPLVQALAT ---3333----------------------------------3333-3333---------- VEELDWLLLQGASPSAPVHPVRAGYIALLEEGRLPYAYRVVFFYFLNGLFLEAWAHHVPE -------1111------------------------------------------------- EGPWAELSQHWFAPEFQAVLYDLEVLARGLWEDLDPEVVRTYLRRILEAEKATWSLLL ------------1111---------3333-------------------------1111 >EXCITATORY INSECT SELECTI; SWP:O61668; PDB:1WWNA; KKNGYAVDSSGKVSECLLNNYCNNICTKVYYATSGYCCLLSCYCFGLDDDKAVLKIKDAT --------------------------------------------------------3333 KSYCDVQII --------- >HYPOTHETICAL PROTEIN TTHA; SWP:Q5SKK7; PDB:1WWPA; AEKALATLKELAFLEDPSPVERDAAIQRFEYTFEAFWKALQAYLREKEGLEGASPKGVIR 3333------1111---------------------------------------------- LAREVGLLRDEEARLALGVDDRSLTVHTYNEPLARAIFRRLPDYARLEQVLGRLRR -----------------------1111----------------------------- >TRNA ADENOSINE DEAMINASE ; SWP:O67050; PDB:1WWRA; MGKEYFLKVALREAKRAFEKGEVPVGAIIVKEGEIISKAHNSVEELKDPTAHAEMLAIKE -----------------1111---------iiii--------3333-1111--------- ACRRLNTKYLEGCELYVTLEPCIMCSYALVLSRIEKVIFSALDKKHGGVVSVFNILDEPT ---------2222-----------------------------1111-------3333333 LNHRVKWEYYPLEEASELLSEFFKKLRNNII 3-------------------------1111- >THREONYL-TRNA SYNTHETASE,; SWP:P26639; PDB:1WWTA; GSSGSSGDSKPIKVTLPDGKQVDAESWKTTPYQIACGISQGLADNTVIAKVNNVVWDLDR -----------------------------3333-------3333------%%%%--1111 PLEEDCTLELLKFEDEEAQAVYSGPSSG ---------------------------- >HYPOTHETICAL PROTEIN FLJ2; SWP:Q9H6S3; PDB:1WWUA; GSSGSSGFRVERSQPASQPLTYESGPDEVRAWLEAKAFSPRIVENLGILTGPQLFSLNKE --------------------11113333-------------------------------- ELKKVCGEEGVRVYSQLTMQKAFLEKQQSGSELSGPSSG -------3333---------------------------- >CONNECTOR ENHANCER OF KIN; SWP:Q969H4; PDB:1WWVA; GSSGSSGMEPVETWTPGKVATWLRGLDDSLQDYPFEDWQLPGKNLLQLCPQSLEALAVRS ---------3333-------------3333---3333---3333----11113333---- LGHQELILGGVEQLQALSSRLQTENSGPSSG ---------------------3333------ >Beta-nerve growth factor ; SWP:P01138; PDB:1WWWV; SSHPIFHRGEFSVCDSVSVWVGDKTTATDIKGKEVMVLGEVNINNSVFKQYFFETKCRDG --3333----------------------1111---------------------------- CRGIDSKHWNSYCTTTHTFVKALTMDGKQAAWRFIRIDTACVCVLSRK ----3333---------------------------------------- >E74-LIKE FACTOR 5 ESE-2B; SWP:Q58DT0; PDB:1WWXA; GSSGSSGSSHLWEFVRDLLLSPEENCGILEWEDREQGIFRVVKSEALAKMWGQRKKNDRM ---------3333----1111------------3333---------------11111111 TYEKLSRALRYYYKTGILERVDRRLVYKFGKNAHGWQEDKLSGPSSG 3333-33333333---------------------------------- >THIOREDOXIN-LIKE PROTEIN ; SWP:O43396; PDB:1WWYA; GSSGSSGGYMDLMPFINKAGCECLNESDEHGFDNCLRKDTTFLESDCDEQLLITVAFNQP -----------1111-3333----------1111-------------------------- VKLYSMKFQGPDNGQGPKYVKIFINLPRSMDFEEAERSEPTQALELTEDDIKEDGIVPLR -------------------------------3333-----------1111---------3 YVKFQNVNSVTIFVQSNQGEEETTRISYFTFIGTPVQATNMNDFKSGPSSG 333------------------------------------------------ >HYPOTHETICAL PROTEIN PH19; SWP:O59596; PDB:1WWZA; MDEIKIEKLKKLDKKALNELIDVYMSGYEGLEEYGGEGRDYARNYIKWCWKKASDGFFVA ---------------------------2222---------------------3333---- KVGDKIVGFIVCDKDWFSKYEGRIVGAIHEFVVDKKFQGKGIGRKLLITCLDFLGKYNDT -!!!!-------------1111-----------3333----------------------- IELWVGEKNYGAMNLYEKFGFKKVGKSGIWVRMIKRQ -----1111-------1111------!!!!------- >TRANSALDOLASE; SWP:Q5SJE8; PDB:1WX0A; MELYLDTASLEEIREIAAWGVLSGVTTNPTLVAKAFAAKGEALTEEAFAAHLRAICETVG ---------------3333----------------------------------------- GPVSAEVTALEAEAMVAEGRRLAAIHPNIVVKLPTTEEGLKACKRLSAEGIKVNMTLIFS ---------------------33331111------------------------------- ANQALLAARAGASYVSPFLGRVDDISWDGGELLREIVEMIQVQDLPVKVIAASIRHPRHV -------1111---------3333---3333---------1111-----------3333- TEAALLGADIATMPHAVFKQLLKHPLTDIGL -------------33331111------3333 >nicotinate-nucleotide--di; SWP:Q7SIC7; PDB:1WX1A; MDPEVFAQARLRMDQLTKPPRALGYLEEVALRLAALQGRVKPELGRGAVVVAAADHGVVA ------------1111--2222------------------------------------11 EGVSAYPQEVTRQMVLNFLRGGAAINQFALAADCAVYVLDVGVVGELPDHPGLLKRKVRP 11----3333-------1111----------------------------1111------- GTANLAQGPAMTPEEAERALLAGREAARRAIAEGATLLAAGDMGIGNTTAAAALTAALLG ---1111-----------------------1111---------2222------------- LPPEAVVGGEEGLRRKRQAVARALARLHPGMGPLEVAAEVGGLELVAIAGIYLEGYEAGL -3333------------------1111----------------------------1111- PLVLDGFPVTAGALLAWKMAPGLRDHLFAGHLSREPGHRHQLEALGLRPLLDLDLALGEG ---------------------3333---------3333---------------------- TGAVLAMPLLRAAARILHMATFQEAGVSRG --------------1111--3333------ >CYTOPLASMIC PROTEIN NCK2; SWP:O43639; PDB:1WX6A; GSSGSSGLSNGQGSRVLHVVQTLYPFSSVTEEELNFEKGETMEVIEKPENDPEWWKCKNA --------------------------------------------------1111----11 RGQVGLVPKNYVVVLSDGPALHPAHSGPSSG 11-----3333-------------------- >UBIQUILIN 3; SWP:Q9H347; PDB:1WX7A; GSSGSSGSPAPVQDPHLIKVTVKTPKDKEDFSVTDTCTIQQLKEEISQRFKAHPDQLVLI ---------------------------------3333---------------1111---- FAGKILKDPDSLAQCGVRDGLTVHLVIKRQHRAMGNECPASGPSSG %%%%--11113333-------------------------------- >RIKEN CDNA 4931431F19; SWP:Q9D4I8; PDB:1WX8A; GSSGSSGVSGREPSSRIIRVSVKTPQDCHEFFLAENSNVRRFKKQISKYLHCNADRLVLI ---------------------------------------3333---------1111---- FTGKILRDQDILSQRGILDGSTVHVVVRSHSGPSSG iiii-------3333--------------------- >HLA-B ASSOCIATED TRANSCRI; SWP:Q5STX1; PDB:1WX9A; GSSGSSGLEVLVKTLDSQTRTFIVGAQMNVKEFKEHIAASVSIPSEKQRLIYQGRVLQDD ------------------------3333---------------3333----%%%%--333 KKLQEYNVGGKVIHLVERAPSGPSSG 33333--2222--------------- >AFADIN; SWP:Q9QZQ1; PDB:1WXAA; GSSGSSGSGGTLRIYADSLKPNIPYKTILLSTTDTADFAVAESLEKYGLEKENPKDYCIA ----------------------------------------------------3333---- RVMLPPGAQHSDERGAKEIILDDDECPLQIFREWPSDKGILVFQLKRRPPSGPSSG ----------------------------------3333------------------ >Epidermal growth factor r; SWP:Q9H6S3; PDB:1WXBA; GSSGSSGKYVKILYDFTARNANELSVLKDEVLEVLEDGRQWWKLRSRSGQAGYVPCNILG -------------------3333----------------------3333-----3333-- EASGPSSG -------- >TYROSINASE; SWP:Q83WS2; PDB:1WXCA; TVRKNQATLTADEKRRFVAAVLELKRSGRYDEFVRTHNEFIMSDTDSGERTGHRSPSFLP ----3333------------------------------------1111------1111-- WHRRFLLDFEQALQSVDSSVTLPYWDWSADRTVRASLWAPDFLGGTGRSTDGRVMDGPFA ----------------1111-----3333--11111111-----------------1111 ASTGNWPINVRVDSRTYLRRSLGGSVAELPTRAEVESVLAISAYDLPPYNSASEGFRNHL 3333----------------2222-------------3333--------1111------- EGWRGVNLHNRVHVWVGGQMATGVSPNDPVFWLHHAYVDKLWAEWQRRHPDSAYVPTGGT ----------------!!!!---1111---------------------1111-------2 PDVVDLNETMKPWNTVRPADLLDHTAYYTFDAL 222-1111--------3333--3333------- >MelC; SWP:Q83WS1; PDB:1WXCB; AAPESFDEVYKGRRIQGRPAGYEVFVDGVQLHVMRNADGSWISVVSHYDPVPTPRAAARA ---------iiii------------iiii------1111---3333-------------- AVDELQGAPLLP ----iiii---- >NH(3)-DEPENDENT NAD(+) SY; SWP:P18843; PDB:1WXIA; TLQQQIIKALGAKPQINAEEEIRRSVDFLKSYLQTYPFIKSLVLGISGGQDSTLAGKLCQ -----------------------------------1111--------------------- MAINELRLETGNESLQFIAVRLPYGVQADEQDCQDAIAFIQPDRVLTVNIKGAVLASEQA -----------1111--------------------------------------------- LREAGIELSDFVRGNEKARERMKAQYSIAGMTSGVVVGTDHAAEAITGFFTKYGDGGTDI -1111-----------------------------------33331111--2222------ NPLYRLNKRQGKQLLAALACPEHLYKKADEVALGVTYDNIDDYLEGKNVPQQVARTIENW 1111--3333-----1111-3333----3333----------1111-------------- YLKTEHKRRPPITVFDDFWKK ---3333-----11111111- >SINGLE-STRAND RECOGNITION; SWP:Q05344; PDB:1WXLA; SHMPKRATTAFMLWLNDTRESIKRENPGIKVTEIAKKGGEMWKELKDKSKWEDAAAKDKQ --------3333----------------------------33331111------------ RYHDEMRNYKPEA ------------- >A-Raf proto-oncogene seri; SWP:P10398; PDB:1WXMA; GSSGSSGGTVKVYLPNKQRTVVTVRDGMSVYDSLDKALKVRGLNQDCCVVYRLIKGRKTV -------------------------------------3333------------%%%%--- TAWDTAIAPLDGEELIVEVLSGPSSG -------------------------- >TOXIN APETX2; SWP:P61542; PDB:1WXNA; GTACSCGNSKGIYWFYRPSCPTDRGYTGSCRYFLGTCCTPAD --------------------1111-------1111------- >THO COMPLEX SUBUNIT 1; SWP:Q96FV9; PDB:1WXPA; GSSGSSGPDVRRDKPVTGEQIEVFANKLGEQWKILAPYLEMKDSEIRQIECDSEDMKMRA ----------------3333-------!!!!---1111---------------------- KQLLVAWQDQEGVHATPENLINALNKSGLSDLAESLTNDNETNSSGPSSG -----------1111----------------------------------- >GTP-BINDING PROTEIN; SWP:O58261; PDB:1WXQA; EIGVVGKPNVGKSTFFSAATLANVGVTYAITDHPCKELGCSPNPQNYEYRNGLALIPVKV -------------------------------------------------%%%%------- DVAFLDDLRASALIHVVDATGKTDPEGQPTDYHDPVEDIEFLEREIDYWIYGILSKGWDK -----------------3333--1111------3333-----------------2222-- FAKRIKLQKIKLESAIAEHLSGIGVNENDVWEAHKLNLPEDPTKWSQDDLLAFASEIRRV 1111---------------3333----------1111---3333---------------- NKPVIAANKADAASDEQIKRLVREEEKRGYIVIPTSAAAELTLRKAAKAGFIEYIPALVI --------3333-3333-----------------------------------------33 KEKVLDRFGSTGVQEVINRVVFDLLKLIPVYPVHDEQFGNVLPHVFLKKGSTPRDLAFKV 33---------------------------------------------2222--------- HTDLGKGFLYAINARTKRRVGEDYELQFNDIVKIVSV --------------------1111------------- >HAEMOGLOBIN PROTEASE; SWP:O88093; PDB:1WXRA; GTVNNELGYQLFRDFAENKGMFRPGATNIAIYNKQGEFVGTLDKAAMPDFSAVDSEIGVA -------3333---1111!!!!2222------1111-------------3333------- TLINPQYIASVKHNGGYTNVSFGDGENRYNIVDRNNAPSLDFHAPRLDKLVTEVAPTAVT ----------1111-------!!!!----------------------------------- AQGAVAGAYLDKERYPVFYRLGSGTQYIKDSNGQLTKMGGAYSWLTGGTVGSLSSYQNGE ----2222---------------------2222-----------------------iiii MISTSSGLVFDYKLNGAMPIYGEAGDSGSPLFAFDTVQNKWVLVGVLTAGNGAGGRGNNW ----3333--------------2222---------1111------------2222----- AVIPLDFIGQKFNEDNDAPVTFRTSEGGALEWSFNSSTGAGALTQGTTTYAMHGQQGNDL ------------1111------3333------------------!!!!-------!!!!1 NAGKNLIFQGQNGQINLKDSVSQGAGSLTFRDNYTVTTSNGSTWTGAGIVVDNGVSVNWQ 111-------------------!!!!-------------------------2222----- VNGVKGDNLHKIGEGTLTVQGTGINEGGLKVGDGKVVLNQQADNKGQVQAFSSVNIASGR ---2222-----------------------------------1111-------------- PTVVLTDERQVNPDTVSWGYRGGTLDVNGNSLTFHQLKAADYGAVLANNVDKRATITLDY -----------3333---2222----iiii----------3333---------------- ALRADKVALNGWSESGKGTAGNLYKYNNPYTNTTDYFILKQSTYGYFPTDQSSNATWEFV --1111------3333--2222-------------------------------1111--- GHSQGDAQKLVADRFNTAGYLFHGQLKGNLNVDNRLPEGVTGALVMDGAADISGTFTQEN --------------3333------------------2222-------------------- GRLTLQGHPVIHAYNTQSVADKLAASGDHSVLTQPTSFSQEDWENRSFTFDRLSLKNTDF ----------------------3333----------1111-------------------- GLGRNATLNTTIQADNSSVTLGDSRVFIDKNDGQGTAFTLEEGTSVATKDADKSVFNGTV ----------------------------1111----------------3333-------- NLDNQSVLNINDIFNGGIQANNSTVNISSDSAVLGNSTLTSTALNLNKGANALASQSFVS ----------------------------------------------2222---------- DGPVNISDATLSLNSRPDEVSHTLLPVYDYAGSWNLKGDDARLNVGPYSMLSGNINVQDK ---------------1111------------------1111------------------- GTVTLGGEGELSPDLTLQNQMLYSLFNGYRNIWSGSLNAPDATVSMTDTQWSMNGNSTAG -------------------------iiii------------------------------- NMKLNRTIVGFNGFTTLTTDNLDAVQSAFVMRTKADKLVINKSATGHDNSIWVNFLKKPS ------------------------------------------------------------ NKDTLDIPLVSAPEATADNLFRASTRVVGFSDVTPILSVRKEDGKKEWVLDGYQVARAAA ------------11111111---------------------------------------- TFMHISYNNFITEVN 3333----------- >UBIQUITIN-FOLD MODIFIER 1; SWP:NA; PDB:1WXSA; SMSKVSFKITLTSDPRLPYKVLSVPESTPFTAVLKFAAEEFKVPAATSAIITNDGIGINP ------------------------------------3333-----------3333---11 AQTAGNVFLKHGSELRIIPRDRVG 11---------------------- >HYPOTHETICAL PROTEIN FLJ2; SWP:Q8TE67; PDB:1WXTA; GSSGSSGLKMQVLYEFEARNPRELTVVQGEKLEVLDHSKRWWLVKNEAGRSGYIPSNILE -------------------1111----------------------3333-----3333-- PLSGPSSG -------- >PEROXISOMAL BIOGENESIS FA; SWP:Q9D0K1; PDB:1WXUA; GSSGSSGTNWASGEDDHVVARAEYDFVAVSDEEISFRAGDMLNLALKEQQPKVRGWLLAS --------3333-----------------3333------------3333----------- LDGQTTGLIPANYVKILGKRRGRKTIESGPSSG ----------3333--------3333------- >BAG-FAMILY MOLECULAR CHAP; SWP:Q99933; PDB:1WXVA; GSSGSSGLTVTVTHSNEKHDLHVTSQQGSSEPVVQDLAQVVEEVIGVPQSFQKLIFKGKS --------------------------------3333-----------1111----iiii- LKEMETPLSALGIQDGCRVMLIGKKNSGPSSG --------3333-------------------- >HYPOTHETICAL PROTEIN TTHA; SWP:Q5SIT4; PDB:1WXXA; RIQVNAKGAARLLSRHLWVFRRDVVSGPETPGLYPVYWGRRFLALALYNPHTDLAVRAYR -------------------3333--------------!!!!-------1111-------- FAPAEDPVAALLENLAQALARREAVLRQDPEGGYRLVHAEGDLLPGLVVDYYAGHAVVQA ----------------------------1111------3333-2222----iiii----- TAHAWEGLLPQVAEALRPHVQSVLAKNDARTRELEGLPLYVRPLLGEVPERVQVQEGRVR -33331111------3333-----------3333---------------------!!!!- YLVDLRAGQKTGAYLDQRENRLYERFRGERALDVFSYAGGFALHLALGFREVVAVDSSAE -----1111----3333------------------!!!!3333----------------- ALRRAEENARLNGLGNVRVLEANAFDLLRRLEKEGERFDLVVLDPPAFAKGKKDVERAYR ---------11111111--------------1111---------------1111------ AYKEVNLRAIKLLKEGGILATASCSHHTEPLFYAVAEAAQDAHRLLRVVEKRGQPFDHPV ---------11112222----------------------1111-----------1111-- LLNHPETHYLKFAVFQVL 11113333---------- >GERANYLGERANYL PYROPHOSPH; SWP:O58799; PDB:1WY0A; EKYEELFARIKEKAKLIDEKIFELIPEKDPRVLYEAARHYPLAGGKRVRPFVVLTSTEAV 3333-------------------------3333--33331111----------------- GGDPLRAIYPAVAIELIHNYSLVHDDIMDMDETRRGKPTVHRIWGVNMAILAGDLLFSKA --3333---------------------------!!!!--3333----------------- FEAVARAEIPPEKKARVLEVIVKASNELCEGQARDLEFEKKSTVTIEEYMEMISGKTGAL ------------------------------------------------------------ FEASAKVGGIIGTDNEEYIKALSSWGRNVGIAFQIWDDVLDLIADEKKLGKPVGSDIRKG --------------3333---------------------------3333-----3333-- KKTLIVAHFFENADEKDKQRFLKIFGKDIKSDVMEAIDLLKKYGSIDYAAEIAKDMIKKA ---------------------1111----------------------------------- NEALRILPKSKARMDLELLAKFIVERE ---3333--------------1111-- >HYPOTHETICAL PROTEIN PH06; SWP:O58404; PDB:1WY1A; WKDSPIIEANGTLDELTSFIGEAKHYVDEEMKGILEEIQNDIYKIMGEIGSKGKIEGISE 1111-----------------3333-----------------------1111-------- ERIKWLEGLISRYEEMVNLKSFVLPGGTLESAKLDVCRTIARRAERKVATVLREFGIGKE -------------1111----------3333----------------------------- ALVYLNRLSDLLFLLARVIEIE ---------------------- >XAA-PRO DIPEPTIDASE; SWP:O58885; PDB:1WY2A; MDIMNEKVKKIIEFMDKNSIDAVLIAKNPNVYYISGASPLAGGYILITGESATLYVPELE ---------------1111-------------------------------------1111 YEMAKEESNIPVEKFKKMDEFYKALEGIKSLGIESSLPYGFIEELKKKANIKEFKKVDDV ----------------3333--1111-------1111----------------------- IRDMRIIKSEKEIKIIEKACEIADKAVMAAIEEITEGKKEREVAAKVEYLMKMNGAEKPA ---3333---------------------------22223333---------1111----- FDTIIASGYRSALPHGVASDKRIERGDLVVIDLGALYQHYNSDITRTIVVGSPNEKQKEI ------!!!!-------------2222---------iiii-------------------- YEIVLEAQKKAVESAKPGITAKELDSIARNIIAEYGYGEYFNHSLGHGVGLEVHEWPRVS -----------1111-----------------11113333-------------------1 QYDETVLREGMVITIEPGIYIPKIGGVRIEDTILITKNGSKRLTKTERELI 111----2222---------2222--------------------------- >HYPOTHETICAL UPF0072 PROT; SWP:O67728; PDB:1WY5A; MNPESRVIRKVLALQNDEKIFSGERRVLIAFSGGVDSVVLTDVLLKLKNYFSLKEVALAH ------------------------------------------------1111-------- FNHMLRESAERDEEFCKEFAKERNMKIFVGKEDVRAFAKENRMSLEEAGRFLRYKFLKEI -----3333--------------------------------------------------- LESEGFDCIATAHHLNDLLETSLLFFTRGTGLDGLIGFLPKEEVIRRPLYYVKRSEIEEY -1111-----------------------------------------1111--3333---- AKFKGLRWVEDETNYEVSIPRNRIRHRVIPELKRINENLEDTFLKMVKVLRAEREFLEEE ----------3333---------------------------------------------- AQKLYKEVKKGNCLDVKKLKEKPLALQRRVIRKFIGEKDYEKVELVRSLLEKGGEVNLGK -----------------3333--------------------------------------- GKVLKRKERWL ----------- >HYPOTHETICAL PROTEIN ST16; SWP:Q970G9; PDB:1WY6A; EIIRKLMDAKKFLLDGYIDEGVKIVLEITKSSTKSEYNWFICNLLESIDCRYMFQVLDKI ------------1111-----------3333-----------------3333-------3 GSYFDLDKCQNLKSVVECGVINNTLNEHVNKALDILVIQGKRDKLEEIGREILNEVSASI 333-3333--3333----------------------1111-------------------- LVAIANALRRVGDERDATTLLIEACKKGEKEACNAVNTL -------3333------------------------1111 >HYPOTHETICAL PROTEIN PH19; SWP:O59611; PDB:1WY7A; RKKELAIALSKLKGFKNPKVWLEQYRTPGNAASELLWLAYSLGDIEGKVVADLGAGTGVL --------1111------3333-----------------1111-2222------!!!!-- SYGALLLGAKEVICVEVDKEAVDVLIENLGEFKGKFKVFIGDVSEFNSRVDIVINPPFGS -----------------3333-------1111---------3333--------------- QRKHADRPFLLKAFEISDVVYSIHLAKPEVRRFIEKFSWEHGFVVTHRLTTKIEIPHRKK -22223333--------------------------------------------------- LERITVDIYRFSKVI --------------- >NP95-LIKE RING FINGER PRO; SWP:NA; PDB:1WY8A; GSSGSSGMWIQVRTIDGSKTCTIEDVSRKATIEELRERVWALFDVRPECQRLFYRGKQLE -------------3333---------11113333-----------1111----%%%%--- NGYTLFDYDVGLNDIIQLLVRPDSGPSSG ---3333---------------------- >ALLOGRAFT INFLAMMATORY FA; SWP:O70200; PDB:1WY9A; KAQQEERLEGINKQFLDDPKYSNDEDLPSKLEAFKVKYMEFDLNGNGDIDIMSLKRMLEK -----------------1111--1111----------1111--1111-----------11 LGVPKTHLELKRLIREVSSGSEETFSYSDFLRMMLGKRSAILRMILMYEEK 11------------1111-------------------------3333---- >HYPOTHETICAL ASPARTYL-TRN; SWP:Q976I3; PDB:1WYDA; MYRSHFIADVTPEYDGKEVIWAGWVHLLRDLGGKKFIILRDKTGLGQVVVDKNSSAFGIS -----1111-1111----------------iiii------3333------1111-33331 QELTQESVIQVRGIVKADKRAPRGIELHAEEITLLSKAKAPLPLDVSGKVKADIDTRLRE 1112222-------------2222------------------------------------ RVLDLRRQEMQAVIKIQSLALKAFRETLYKEGFIEIFTPKIIASATEGGAQLFPVIYFGK ----------------------------1111------------------------iiii EAFLAQSPQLYKELMAGVVERVFEVAPAWRAEESDTPFHLAEFISMDVEMAFADYNDVMQ --------------1111------------------------------------------ LLEKILHNIVKTIKEEGKEELKILNYEPPEVKIPIKRLKYTEAIEILRSKGYNIKFGDDI ------------------------------------------------------------ GTPELRILNEELKEDLYFIVDWPSDARPFYTKSKSEPELSESFDLIYKFLEIVSGSTRNH ----------------------1111-1111----------------------------- KREVLEEALKKKGLKPESFEFFLKWFDYGMPPHAGFGMGLARLMVMLTGIQSVKEIVPFP ---------1111-33333333------------------3333-------1111----- RDKKRLTP -1111--- >XANTHINE DEHYDROGENASE/OX; SWP:P22985; PDB:1WYGA; ADELVFFVNGKKVVEKNADPETTLLVYLRRKLGLCGTKLGCGEGGCGACTVMISKYDRLQ ------------------1111------------------------1111---------- NKIVHFSVNACLAPICSLHHVAVTTVEGIGNTQKLHPVQERIARSHGSQCGFCTPGIVMS --------3333-1111-------3333--3333----------------1111------ MYTLLRNQPEPTVEEIENAFQGNLCRCTGYRPILQGFRTFAKPSLFNPEDFKPLDPTQEP ---3333----------1111--------------3333-------3333----3333-- IFPPELLRLKDTPQKKLRFEGERVTWIQASTMEELLDLKAQHPDAKLVVGNTEIGIEMKF --3333-----------------------------------3333--------------- KNMLFPLIVCPAWIPELNSVVHGPEGISFGASCPLSLVESVLAEEIAKLPEQKTEVFRGV ----------33331111----3333-----------------1111--1111-3333-- MEQLRWFAGKQVKSVASIGGNIITASPISDLNPVFMASGAKLTLVSRGTRRTVRMDHTFF --------3333---------33331111-3333---------------------3333- PGYRKTLLRPEEILLSIEIPYSKEGEFFSAFKQASRREDDIAKVTSGMRVLFKPGTIEVQ --------1111----------2222--------------------------2222---- ELSLCFGGMADRTISALKTTPKQLSKSWNEELLQSVCAGLAEELQLAPDAPGGMVEFRRT ------------------3333------------------------1111---------- LTLSFFFKFYLTVLQKLGRADLEDMAGKLDPTFASATLLFQKDPPANVQLFQEVPKDQSE ---------------------3333---------1111--------------------11 EDMVGRPLPHLAANMQASGEAVYCDDIPRYENELSLRLVTSTRAHAKITSIDTSEAKKVP 112222---1111----------3333--------------------------3333-22 GFVCFLTAEDVPNSNATGLFNDETVFAKDEVTCVGHIIGAVVADTPEHAQRAARGVKITY 22----3333----------------------2222----------------1111---- EDLPAIITIQDAINNNSFYGSEIKIEKGDLKKGFSEADNVVSGELYIGGQEHFYLETNCT ------------1111-------------------------------------------- IAVPKGEAGEMELFVSTQNTMKTQSFVAKMLGVPDNRIVVRVKRMGGGFGGKETRSTVVS ------%%%%-----------------------3333----------iiii--3333--- TALALAAHKTGRPVRCMLDRDEDMLITGGRHPFLAKYKVGFMKTGTVVALEVAHFSNGGN ------------------3333-------------------3333--------------- TEDLSRSIMERALFHMDNAYKIPNIRGTGRICKTNLPSNTAFRGFGGPQGMLIAEYWMSE -!!!!---------1111------------------------------------------ VAITCGLPAEEVRRKNMYKEGDLTHFNQKLEGFTLPRCWDECIASSQYLARKREVEKFNR -----------------------1111---------------------3333-------- ENRWKKRGLCIIPTKFGISFTLPFLNQGGALVHVYTDGSVLLTHGGTEMGQGLHTKMVQV ---------------------3333---------1111---------------------- ASRALKIPTSKIHISETSTNTVPNTSPTAASASADLNGQGVYEACQTILKRLEPFKKKKP -------3333------3333-------iiii3333----------------------11 TGPWEAWVMDAYTSAVSLSATGFYKTPNLGYSFETNSGNPFHYFSYGVACSEVEIDCLTG 11---------1111----------------------------------------1111- DHKNLRTDIVMDVGSSLNPAIDIGQVEGAFVQGLGLFTMEELHYSPEGSLHTRGPSTYKI --------------------------------------------1111-----3333--- PAFGSIPIEFRVSLLRDCPNKRAIYASKAVGEPPLFLASSIFFAIKDAIRAARAQHGDNA -1111--------------1111%%%%----3333-----------------1111---- KQLFQLDSPATPEKIRNACVDQFTTLCVTSKSWSVRI ----------------------3333----------- >SKELETAL MUSCLE LIM-PROTE; SWP:Q13643; PDB:1WYHA; GSSGSSGCSACGETVMPGSRKLEYGGQTWHEHCFLCSGCEQPLGSRSFVPDKGAHYCVPC -----------------------------1111--------------------------- YENKFASGPSSG --1111------ >PROTOCADHERIN BETA 14; SWP:Q6PB90; PDB:1WYJA; GSSGSSGAGSATITYSVLEETDRGSLVGNLAKDLGLSLRELITRGAQILSKGNKQLLQLE ---------------------2222----------------1111--------------- QKSGNLLLKEKLDREELCGSTNPCILHFQVLLKSPVQFIQGEIQLQDVNDHAPEFMEDES -------------3333------------------------------------------- GPSSG ----- >TRANSGELIN-2; SWP:P37802; PDB:1WYMA; GSSGSSGQKIEKQYDADLEQILIQWITTQCRKDVGRPQPGRENFQNWLKDGTVLCELINA ------------------------3333-------------------3333-3333-333 LYPEGQAPVKKIQASTMAFKQMEQISQFLQAAERYGINTTDIFQTVDLWEGKNMACVQRT 3---------------3333------------3333-3333------1111-3333---- LMNLGGLAVARDDGLFSGDPNWFPKKSKESGPSSG -------1111-------3333------------- >CALPONIN-2; SWP:Q99439; PDB:1WYNA; GSSGSSGNRLLSKYDPQKEAELRTWIEGLTGLSIGPDFQKGLKDGTILCTLMNKLQPGSV -------------------------3333----------3333--3333--3333----- PKINRSMQNWHQLENLSNFIKAMVSYGMNPVDLFEANDLFESGNMTQVQVSLLALAGKAK -----------------------3333-1111--3333-------3333--------333 TKGLQSGVDIGVKYSEKQERSGPSSG 3------------------------- >Microtubule-associated pr; SWP:Q9UPY8; PDB:1WYOA; GSSGSSGMAVNVYSTSVTSENLSRHDMLAWVNDSLHLNYTKIEQLCSGAAYCQFMDMLFP -------------------------------------------3333-----------22 GCVHLRKVKFQAKLEHEYIHNFKVLQAAFKKMGVDKIIPVEKLVKGKFQDNFEFIQWFKK 22-3333-1111-3333-----3333------------33333333--3333-------- FFDANYDGKDYNPLLARQGQDVAPPPNPGDQIFSGPSSG ------------3333---3333---------------- >CALPONIN 1; SWP:P51911; PDB:1WYPA; GSSGSSGNKLAQKYDHQREQELREWIEGVTGRRIGNNFMDGLKDGIILCEFINKLQPGSV --------------------------------------3333------------------ KKINESTQNWHQLENIGNFIKAITKYGVKPHDIFEANDLFENTNHTQVQSTLLALASMAK --------3333----------------1111--3333------3333-----------3 TKGNKVNVGVSGPSSG 333------------- >SPECTRIN BETA CHAIN, BRAI; SWP:O15020; PDB:1WYQA; GSSGSSGAKDALLLWCQMKTAGYPNVNVHNFTTSWRDGLAFNAIVHKHRPDLLDFESLKK ---------3333-----33333333-----------3333------------3333--- CNAHYNLQNAFNLAEKELGLTKLLDPEDVNVDQPDEKSIITYVATYYHYFSKMKALAVEG ----------------------------------3333-----------3333------- KSGPSSG ------- >RHO GUANINE NUCLEOTIDE EX; SWP:Q15052; PDB:1WYRA; GSSGSSGEEQIVTWLISLGVLESPKKTICDPEEFLKSSLKNGVVLCKLINRLMPGSVEKF ------------------------------------------------------------ CLDPQTEADCINNINDFLKGCATLQVEIFDPDDLYSGVNFSKVLSTLLAVNKATESGPSS -----3333-----------3333---------------3333----------------- G - >RIKEN CDNA 2310008M20 PRO; SWP:NA; PDB:1WYSA; GSSGSSGMAELDIGQHCQVQHCRQRDFLPFVCDGCSGIFCLEHRSKDSHGCSEVNVVKER ---------------------------------------3333-3333------------ PKTDEHKSYSGPSSG --------------- >GLYCINE DEHYDROGENASE SUB; SWP:Q5SKW8; PDB:1WYUA; MDYTPHTEEEIREMLRRVGAASLEDLFAHLPKEILSPPIDLPEPLPEWKVLEELRRLAAQ ---------------------3333-11113333-----------------------333 NLPAHKAFLGGGVRSHHVPPVVQALAARGEFLTAYTPYQPEVSQGVLQATFEYQTMIAEL 3--2222-----------3333------3333------3333------------------ AGLEIANASMYDGATALAEGVLLALRETGRMGVLVSQGVHPEYRAVLRAYLEAVGAKLLT ---------------------------------------------------1111----- LPLEGGRTPLPEVGEEVGAVVVQNPNFLGALEDLGPFAEAAHGAGALFVAVADPLSLGVL ---iiii------1111--------1111------------1111-------3333---- KPPGAYGADIAVGDGQSLGLPMGFGGPHFGFLATKKAFVRQLPGRLVSETVDVEGRRGFI -3333---------1111---%%%%---------33331111---------1111----- LTLQAREQYIRRAKAKSNITTNAQLTALMGAMYLAALGPEGLREVALKSVEMAHKLHALL ---111133331111--------------------------------------------- LEVPGVRPFTPKPFFNEFALALPKDPEAVRRALAERGFHGATPVPREYGENLALFAATEL --2222---------------------------1111-------3333---------111 HEEEDLLALREALKEVL 13333------------ >Glycine dehydrogenase sub; SWP:Q5SKW7; PDB:1WYUB; SFPLIFERSRKGRRGLKLVKAVPKAEDLIPKEHLREVPPRLPEVDELTLVRHYTGLSRRQ ---3333--2222----------3333--3333-----------------------1111 VGVDTTFYPLGSCTMKYNPKLHEEAARLFADLHPYQDPRTAQGALRLMWELGEYLKALTG -3333----2222-----3333----------11113333-------------------- MDAITLEPAAGAHGELTGILIIRAYHEDRGEGRTRRVVLVPDSAHGSNPATASMAGYQVR --------------------------1111----------1111--------1111---- EIPSGPEGEVDLEALKRELGPHVAALMLTNPNTLGLFERRILEISRLCKEAGVQLYYDGA ----1111-----------1111--------1111--1111-------1111------11 NLNAIMGWARPGDMGFDVVHLNLHKTFTVPHGGGGPGSGPVGVKAHLAPYLPVPLVERGE 11--2222-3333--------1111-----%%%%---------33331111--------- EGFYLDFDRPKSIGRVRSFYGNFLALVRAWAYIRTLGLEGLKKAAALAVLNARYLKELLK --------1111---------3333---------------------------------11 EKGYRVPYDGPSMHEFVAQPPEGFRALDLAKGLLELGFHPPTVYFPLIVKEALMVEPTET 11------------------2222---------1111--------1111--------111 EAKETLEAFAEAMGALLKKPKEWLENAPYSTPVRRLDELRANKHPKLTYFDEG 1--------------1111----1111--------------------1111-- >G/T MISMATCH-SPECIFIC THY; SWP:Q13569; PDB:1WYWA; AELLTKTLPDILTFNLDIVIIGINPGLMAAYKGHHYPGPGNHFWKCLFMSGLSEVQLNHM -3333----------------------------------------------------111 DDHTLPGKYGIGFTNMVERTTPGSKDLSSKEFREGGRILVQKLQKYQPRIAVFNGKCIYE 11111-----------------3333---------------------------------- IFSKEVFGVKVKNLEFGLQPHKIPDTETLCYVMPSSSARCAQFPRAQDKVHYYIKLKDLR ------------------------------------3333----3333------------ DQLKGIERNMDVQEVQYTFDLQLAQEDAKKMAVKEE -1111-------------------------3333-- >CRK-ASSOCIATED SUBSTRATE; SWP:P56945; PDB:1WYXA; LNVLAKALYDNVAESPDELSFRKGDIMTVLEQDTQGLDGWWLCSLHGRQGIVPGNRLKIL --------------3333---2222-------22222222----iiii----1111---2 VGMYDKKP 222----- >putative S-adenosylmethio; SWP:Q8A031; PDB:1WYZA; ETALYLLPVTLGDTPLEQVLPSYNTEIIRGIRHFIVEDVRSARRFLKKVDREIDIDSLTF --------------1111--3333---3333------------------3333------- YPLNKHTSPEDISGYLKPLAGGASGVISEDPGADVVAIAQRQKLKVIPLVGPSSIILSVA -------3333----3333----------3333------1111---------3333---- SGFNGQSFAFHGYLPIEPGERAKKLKTLEQRVYAESQTQLFIETPYRNHKIEDILQNCRP -------------------------------------------3333-----------11 QTKLCIAANITCEGEFIQTRTVKDWKGHIPKIPCIFLLYK 11------2222--------3333---------------- >Putative uncharacterized ; SWP:A0A5D7; PDB:1WZ1L; DVVMTQTPLSLPVSLGNQASISCRSSQSLVHSNGNTYLHWYLQKPGQSPKLLIYKVSNRF -------------2222-------------1111---------2222------------2 SGVPDRFSGSGSGTDFTLKISRVEAEDLGVYFCSQSTHVPFTFGSGTKLEIKR 2221111----------------3333-------------------------- >AUTOPHAGY 12B; SWP:Q9LVK3; PDB:1WZ3A; QKIVVHLRATGGAPILKQSKFKVSGSDKFANVIDFLRRQLHSDSLFVYVNSAFSPNPDES ---------iiii----------3333----------------------------1111- VIDLYNNFGFDGKLVVNYACSMAW ---------iiii----------- >POTASSIUM CHANNEL BLOCKIN; SWP:Q10726; PDB:1WZ5A; LVKCRGTSDCGRPCQQQTGPNSKCINRMCKCYG ------------3333--------3333----- >HMG-BOX TRANSCRIPTION FAC; SWP:Q8CDV1; PDB:1WZ6A; GSSGSSGARRPMNAFLLFCKRHRSLVRQEHPRLDNRGATKILADWWAVLDPKEKQKYTDM ---------------------------------3333----------------------- AKEYKDAFMKANPGYRSGPSSG ---------------------- >ENOYL-COA HYDRATASE; SWP:Q5SLS5; PDB:1WZ8A; LASLEARYPGLAFAWPRPGVLEITFRGEKLNAMPPALHRGLARVWRDLEAVEGVRAVLLR -------2222-----2222-------2222-----------33333333---------- GEGGVFSAGGSFGLIEEMRASHEALLRVFWEARDLVLGPLNFPRPVVAAVEKVAVGAGLA -%%%%---------------------------------1111------------------ LALAADIAVVGKGTRLLDGHLRLGVAAGDHAVLLWPLLVGMAKAKYHLLLNEPLTGEEAE -3333-----1111----3333-------33333333----------------------- RLGLVALAVEDEKVYEKALEVAERLAQGPKEALHHTKHALNHWYRSFLPHFELSLALEFL ---------3333-----------1111-------------------------------- GFSGKELEEGLKALKEKRPPEFP -------------1111------ >MASPIN PRECURSOR; SWP:P36952; PDB:1WZ9A; MDALQLANSAFAVDLFKQLEKEPLGNVLFSPICLSTSLSLAQVGAKGDTANEIGQVLHFE ---------------------1111----3333--------1111-------------11 NVKDVPFGFQTVTSDVNKLSSFYSLKLIKRLYVDKSLNLSTEFISSTKRPYAKELETVDF 11---------------3333------------3333-------------2222------ KDKLEETKGQINNSIKDLTDGHFENILADNSVNDQTKILVVNAAYFVGKWMKKFPESETK ----------------1111----1111----1111------------------3333-- EPFRLNKTDTKPVQMMNMEATFMGNIDSINKIIELPFQNKHLSMFILLPKDVEDESTGLE --------------------------1111------2222-------------------- KIEKQLNSESLSQWTNPSTMANAKVKLSIPKFKVEKMIDPKACLENLGLKHIFSEDTSDF ---------------3333-------------------------1111-3333------3 SGMSETKGVALSNVIHKVLEITEDGQHKDELNADHPFIYIIRHNKTRNIIFFGKFSP 333----------------------------------------1111---------- >ALPHA-AMYLASE A; SWP:Q8GPL8; PDB:1WZAA; FEKHGTYYEIFVRSFYDSDGDGIGDLKGIIEKLDYLNDGDPETIADLGVNGIWLMPIFKS ----------3333-----------------3333----1111----------------- PSYHGYDVTDYYKINPDYGTLEDFHKLVEAAHQRGIKVIIDLPINHTSERHPWFLKASRD ---------1111-3333-------------1111------------1111--------1 KNSEYRDYYVWAGPDTDTKETKLDGGRVWHYSPTGMYYGYFWSGMPDLNYNNPEVQEKVI 111-1111------------------------------3333------------------ GIAKYWLKQGVDGFRLDGAMHIFPPAQYDKNFTWWEKFRQEIEEVKPVYLVGEVWDISET -------------------33331111-----------------------------3333 VAPYFKYGFDSTFNFKLAEAVIATAKAGFPFGFNKKAKHIYGVYDREVGFGNYIDAPFLT ------------------------------------------------2222-------- NHDQNRILDQLGQDRNKARVAASIYLTLPGNPFIYYGEEIGMRGQGPHEVIREPFQWYNG 1111----1111----------------------2222--------3333---------- SGEGETYWEPAMYNDGFTSVEQEEKNLDSLLNHYRRLIHFRNENPVFYTGKIEIINGGLN -2222-----1111-------11111111--------------3333------------- VVAFRRYNDKRDLYVYHNLVNRPVKIKVASGNWTLLFNSGDKEITPVEDNNKLMYTIPAY ------------------------------------------------%%%%-----222 TTIVLEKE 2------- >MANNOSYL-3-PHOSPHOGLYCERA; SWP:O58690; PDB:1WZCA; MIRLIFLDIDKTLIPGYEPDPAKPIIEELKDMGFEIIFNSSKTRAEQEYYRKELEVETPF --------2222------3333-------------------------------------- ISENGSAIFIPKGYFPYIVIELGIRVEKIREELKKLENIYGLKYYGNSTKEEIEKFTGMP ------------------------3333-------1111----3333------------3 PELVPLAMEREYSETIFEWSRDGWEEVLVEGGFKVTMGSRFYTVHGNSDKGKAAKILLDF 333-3333--------------------1111---------------------------- YKRLGQIESYAVGDSYNDFPMFEVVDKVFIVGSLKHKKAQNVSSIIDVLEVIKH -1111---------33333333-------------------------------- >HEME OXYGENASE; SWP:Q54AI1; PDB:1WZDA; GLAVELKQSTAQAHEKAEHSTFMSDLLKGRLGVAEFTRLQEQAWLFYTALEQAVDAVRAS -------------------------1111----------------------------111 GFAESLLDPALNRAEVLARDLDKLNGSSEWRSRITASPAVIDYVNRLEEIRDNVDGPALV 1-3333-3333----------------3333----------------------------- AHHYVRYLGDLSGGQVIARMMQRHYGVDPEALGFYHFEGIAKLKVYKDEYREKLNNLELS ---------------------------33333333-1111-------------1111--- DEQREHLLKEATDAFVFNHQVFADLGKGL -----------------------1111-- >ALPHA-AMYLASE II; SWP:Q08751; PDB:1WZLA; MLLEAIFHEAKGSYAYPISETQLRVRLRAKKGDVVRCEVLYADRYASPEEELAHALAGKA -3333-----!!!!---------------2222---------1111-------------- GSDERFDYFEALLECSTKRVKYVFLLTGPQGEAVYFGETGFSAERSKAGVFQYAYIHRSE --1111--------1111------------------3333---3333---------3333 VFTTPEWAKEAVIYQIFPERFANGDPSNDPPGTEQWAKDARPRHDSFYGGDLKGVIDRLP ----3333--------3333----3333-2222---1111--1111-------------- YLEELGVTALYFTPIFASPSHHKYDTADYLAIDPQFGDLPTFRRLVDEAHRRGIKIILDA ---------------------------1111-3333-------------1111------- VFNHAGDQFFAFRDVLQKGEQSRYKDWFFIEDFPVSKTSRTNYETFAVQVPAMPKLRTEN -----1111--------!!!!1111------------------------1111---3333 PEVKEYLFDVARFWMEQGIDGWRLDVANEVDHAFWREFRRLVKSLNPDALIVGEIWHDAS --------------1111-------1111----------------1111---------33 GWLMGDQFDSVMNYLFRESVIRFFATGEIHAERFDAELTRARMLYPEQAAQGLWNLLDSH 33--------------------------------------1111----3333------11 DTERFLTSCGGNEAKFRLAVLFQMTYLGTPLIYYGDEIGMAGATDPDCLRPMIWEEKEQN 11----1111------------1111------2222------------------3333-- RGLFEFYKELIRLRHRLASLTRGNVRSWHADKQANLYAFVRTVQDQHVGVVLNNRGEKQT ----------------3333-----------1111-------!!!!-------------- VLLQVPESGGKTWLDCLTGEEVHGKQGQLKLTLRPYQGMILWNGR -----3333---------------%%%%----------------- >SAM-DEPENDENT METHYLTRANS; SWP:O59000; PDB:1WZNA; MYELYTLLAEYYDTIYRRRIERVKAEIDFVEEIFKEDAKREVRRVLDLACGTGIPTLELA -33331111---33333333-----------------------------!!!!------1 ERGYEVVGLDLHEEMLRVARRKAKERNLKIEFLQGDVLEIAFKNEFDAVTMFFSTIMYFD 111--------3333--------1111--------1111----------------1111- EEDLRKLFSKVAEALKPGGVFITDFPCGPVVWNEQKGEEKLVIMDWREVEPAVQKLRFKR -----------11112222----------------!!!!-----------1111------ LVQILRPNGEVKAFLVDDELNIYTPREVRLLAEKYFEKVKIYGNLKRELSPNDMRYWIVG -----1111----------------------1111-------%%%%---1111------- IAKS ---- >HPCE; SWP:Q5SJQ0; PDB:1WZOA; MKLARFLAKGRVHQGVYREGLLLDEAGEAHRPEDVTWLLPFTPGKILGVALNYAGLSRPE -------iiii------%%%%--1111---3333-------------------------- EPALFWKPNTSLLPHKGVVLYPKGARFVHYEVELAVVVGRPMKRVRAKDALDYVLGYTIA --------1111-2222----2222--------------------3333-1111------ NDLVARDYVRPPIRAKGRDTFLPLGPFLVVEEVEDPQDLWLRAYVNGELRQEGHTSRMLY ----3333---3333--2222-------------1111------iiii-----3333--- SVAELLEFISEFMTLEPYDVLLTGTPKGISQVRPGDVMRLEIEGLGALENPIEEEP ---------------2222-------------2222-----2222----------- >QUINOLINATE SYNTHETASE A; SWP:O57767; PDB:1WZUA; MDLVEEILRLKEERNAIILAHNYQLPEVQDIADFIGDSLELARRATRVDADVIVFAGVDF --------------------1111----1111-------------------------333 MAETAKILNPDKVVLIPSVEHILEAKRKYPNAPVVLYVNSTAEAKAYADVTVTSANAVEV 3-------1111------3333------1111--------33331111----3333---- VKKLDSDVVIFGPDKNLAHYVAKMTGKKIIPVFTLDDVERAKKLHPNAKLMIHPECIPEV 1111----------------------------------------1111----11113333 QEKADIIASTGGMIKRACEWDEWVVFTEREMVYRLRKLYPQKKFYPAREDAFCAITLKNI 1111-----------3333--------3333-------3333-----1111--------- YESLKDMKYKVEVPEEIARKARKAIERMLEM ------------------------------- >UBIQUITIN-CONJUGATING ENZ; SWP:O14933; PDB:1WZVA; ASMRVVKELEDLQKKPPPYLRNLSSDDANVLVWHALLLPDQPPYHLKAFNLRISFPPEYP ----------------1111--------1111-----------------------1111- FKPPMIKFTTKIYHPNVDENGQICLPIISSENWKPCTKTCQVLEALNVLVNRPNIREPLR -------------11111111---3333-----1111----------------3333--- MDLADLLTQNPELFRKNAEEFTLRFGVDRP ------------------------------ >PROBABLE ENDOGLUCANASE; SWP:P37696; PDB:1WZZA; APDAVAQQWAIFRAKYLRPSGRVVDTGNGGESHSEGQGYGLFAASAGDLASFQSWWARTN 1111-------------3333----3333------------------------------- LQHTNDKLFSWRFLKGHQPPVPDKNNATDGDLLIALALGRAGKRFQRPDYIQDAAIYGDV -------------2222-------------------------1111-------------- LNLTKAGPYVVLPGAVGFTKKDSVILNLSYYVPSLLQAFDLTADPRWRQVEDGIRLVSAG -----!!!!----------1111---1111------------------------------ RFGQWRLPPDWLAVNRATGALSIASGWPPRFSYDAIRVPLYFYWAHLAPNVLADFTRFWN --1111---------------------------3333-----1111-------------- NFGANALPGWVDLTTGARSPYNAPPGYLAVAECTGLDSAGELPTLDHAPDYYSAALTLLV --1111-------------------------------------3333------------- YIARAEETI -----1111 >SH3-CONTAINING GRB2-LIKE ; SWP:Q99962; PDB:1X03A; GTKLDDDFKEERKVDVTSRAVEITKTIEYLQPNPASRAKLSIPGYPQAEALLAEALKFGR ----3333------------------------11113333-------------------3 ELGDDCNFGPALGEVGEARELSEVKDSLDIEVKQNFIDPLQNLHDKDLREIQHHLKKLEG 333----3333------------------------------------------------- RRLDFDYKKKRQGKIPDEELRQALEKFDESKEIAESSFNLLEDIEQVSQLSALVQAQLEY -----------3333--------------------------------------------- HKQAVQILQQVTVRLEERIRQA ---------------------- >SH3-CONTAINING GRB2-LIKE ; SWP:Q99962; PDB:1X04A; GTKLDDDFKEERKVDVTSRAVEITKTIEYLAHLSSLLQAEALLAEALKFGRELGDDCNFG ----3333---------------------------------------3333--------- PALGEVGEARELSEVKDSLDIEVKQNFIDPLQNLHDKDLREIQHHLKKLEGRRLDFDYKK ------------------------------------------------------------ KRQGKIPDEELRQALEKFDESKEIAESSFNLLEDIEQVSQLSALVQAQLEYHKQAVQILQ -3333-3333-------------------------------------------------- QVTVRLEERIRQA ------------- >PLECKSTRIN; SWP:P08567; PDB:1X05A; GSSGSSGVILKEEFRGVIIKQGCLLKQGHRRKNWKVRKFILREDPAYLHYYDPAGAEDPL -----------1111------------3333--------------------3333----- GAIHLRGCVVTSVESNSNGRKSEEENLFEIITADEVHYFLQAATPKERTEWIKAIQMASR -------------------------------1111----------------------333 TGKSGPSSG 3-------- >ISOPULLULANASE; SWP:O00105; PDB:1X0CA; REFMAVTANNSQLLTWWHNTGEINTQTPVADGNVRQSGLYSVKVQTTPASSSLYYDSFVY ---------1111----------------1111--------------------------- LAIPGNGMSDQLQYTQGYNQTQAWTSFLYSHDATVKISRNGSSANSNVVIRPTSLNFPVR --2222-11111111------------------------1111--------3333----- YDNQSVYITVPYSPTGYRFSVEFDDDLISLAPSGARQPENALLIFASPFENSSTKPQPGS -%%%%-------1111------1111------------------------1111--2222 PNSIAPAPGRVLGLNTTSASTVVFNPGVYYFTGHDHMVLSSSVTWVYFAPGAYVKGAVEF ------------1111-----------------------1111-----2222-------- LSTASEVKASGHGVLSGEQYVWYADPDEGYQKASGANNNGLRMWRGTLGNSSQTFVLNGV ----------------11112222---%%%%--%%%%----------------------- TVSAPPFNSMDWSGNSLDLITCRVDDYKQVGAFYGQTDGLEMYPGTILQDVFYHTDDDGL ----------------1111----------------------2222-------------- KMYYSNVTARNIVMWKESVAPVVEFGWTPRNTENVLFDNVDVIHQAYANAGNNPGIFGAV ------------------------------------------------3333-------- NNYLYAPDGLSSNHSTGNSNMTVRNITWSNFRAEGSSSALFRINPIQNLDNISIKNVSIE -33331111--------1111--------------------------------------- SFEPLSINTTESWMPVWYDLNNGKQITVTDFSIEGFTVGNTTITASNAASVGRIDGVDPA ---3333------------------------------!!!!--3333----------333 YAGSVHYID 31111---- >ISCA; SWP:Q8DLM0; PDB:1X0GA; MVELTPAAIQELERLQILRIQVQPSECGDWRYDLALVAEPKPTDLLTQSQGWTIAIAAEA ---------1111---------------------------1111----%%%%----3333 AELLRGLRVDYIEDLMGGAFRFHNPNASQTCGCGMAFRVSRS 1111---------1111------1111---1111-------- >RAS GTPASE-ACTIVATING-LIK; SWP:P46940; PDB:1X0HA; GSSGSSGISLKYTAARLHEKGVLLEIEDLQVNQFKNVIFEISPTEEVGDFEVKAKFMGVQ -----------------------------3333--------------------------- METFMLHYQDLLQLQYEGVAVMKLFDRAKVNVNLLIFLLNKKFYGKSGPSSG -----------------------%%%%------------------------- >BROMODOMAIN-CONTAINING PR; SWP:P25440; PDB:1X0JA; GRVTNQLQYLHKVVKALWKHQFAWPFRQPVDAVKLGLPDYHKIIKQPDGTIKRRLENNYY ----------------1111--3333------11111111-------------------- WAASECQDFNTFTNCYIYNKPTDDIVLAQTLEKIFLQKVASPQEEQE -3333--------------11113333-------------------- >HOMOISOCITRATE DEHYDROGEN; SWP:Q8RQU4; PDB:1X0LA; AYRICLIEGDGIGHEVIPAARRVLEATGLPLEFVEAEAGWETFERRGTSVPEETVEKILS ----------3333----------3333----------------------3333---111 CHATLFGAATSPTRKVPGFFGAIRYLRRRLDLYANVRPAKSRPVPGSRPGVDLVIVRENT 1--------------2222--------1111------------2222------------3 EGLYVEQERRYLDVAIADAVISKKASERIGRAALRIAEGRPRKTLHIAHKANVLPLTQGL 333-------!!!!-------------------------3333------3333------- FLDTVKEVAKDFPLVNVQDIIVDNCAMQLVMRPERFDVIVTTNLLGDILSDLAAGLVGGL -----------1111----------------3333------------------------1 GLAPSGNIGDTTAVFEPVHGSAPDIAGKGIANPTAAILSAAMMLDYLGEKEAAKRVEKAV 111-----1111---------3333----------------------------------- DLVLERGPRTPDLGGDATTEAFTEAVVEALKSL ---------3333----------------1111 >AMINOTRANSFERASE II HOMOL; SWP:O57946; PDB:1X0MA; MLGDVERFFSKKALEMRASEVRELLKLVETSDIISLAGGLPNPKTFPKEIIRDILVEIME ---3333--3333-----3333-------------------3333--------------- KYADKALQYGTTKGFTPLRETLMKWLGKRYGISQDNDIMITSGSQQALDLIGRVFLNPGD --3333----3333------------------3333--------------------2222 IVVVEAPTYLAALQAFNFYEPQYIQIPLDDEGMKVEILEEKLKELKSQGKKVKVVYTVPT --------3333---3333--------------------------1111----------- FQNPAGVTMNEDRRKYLLELASEYDFIVVEDDPYGELRYSGNPEKKIKALDNEGRVIYLG ---------------------1111--------3333--------3333----------- TFSKILAPGFRIGWMVGDPGIIRKMEIAKQSTDLCTNVFGQVVAWRYVDGGYLEKHIPEI ------3333-------3333------------------------------3333----- RKFYKPRRDAMLEALEEFMPEGVKWTKPEGGMFIWVTLPDGIDSKKMLERAIKKGVAYVP ------------------------------------------3333-------------3 GEAFYAHRDVKNTMRLNFTYVDEDKIMEGIKRLAETIKEELKA 333-1111-------------3333-------------1111- >HYPOTHETICAL PROTEIN TLL0; SWP:Q8DMN3; PDB:1X0PA; GLHRLIYLSCATDGLSYPDLRDIMAKSEVNNLRDGITGMLCYGNGMFLQTLEGDRQKVSE -----------2222---------------------------%%%%-------------- TYARILKDPRHHSAEIVEFKAIEERTFINWSMRLVQLGEMDSDTIRRLRLKYSPAATFQP -------1111------------------------3333----------1111-----33 RSMTAEQCFRFLKELYDMSQGS 33-------------------- >PROBABLE PEROXIREDOXIN; SWP:Q9Y9L0; PDB:1X0RA; PGSIPLIGERFPEEVTTDHGVIKLPDHYVSQGKWFVLFSHPADFTPVCTTEFVSFARRYE -----2222------------------3333----------------------------- DFQRLGVDLIGLSVDSVFSHIKWKEWIERHIGVRIPFPIIADPQGTVARRLGLLHAESAT ------------------------------------------------1111-------- HTVRGVFIVDARGVIRTLYYPELGRLVDEILRIVKALKLGDSLKRAVPADWPNNEIIGEG ---------1111----------------------------------2222-----!!!! LIVPPPTTEDQARARESGQYRSLDWWFCWDTPASRDDVEEARRYLRRAAEKPAKLLYEEA -----------------------1111---------------------------3333-- >RIBONUCLEASE P PROTEIN CO; SWP:O59248; PDB:1X0TA; IVKRRDWEKKEKKKIAIERIDTLFTLAERVARYSPDLAKRYVELALEIQKKAKVKIPRKW --------------------------------------------------------3333 KRRYCKRCHTFLIPGVNARVRLRTKRMPHVVITCLECGYIMRYPYL ---------------------------------------------- >hypothetical methylmalony; SWP:Q974R9; PDB:1X0UA; KPPVEKLIEELRQLKEKAYKGGGDERIQFQHSKGKLTARERLALLFDDGKFNEIMTFATT -------------------!!!!-------1111--------------------1111-- RATEFGLDKQRFYGDGVVTGWGKVDGRTVFAYAQDFTVLGGSLGETHANKIVRAYELALK ---iiii----2222--------iiii-------1111%%%%------------------ VGAPVVGINDSGGARIQEGALSLEGYGAVFKMNVMASGVIPQITIMAGPAAGGAVYSPAL --------------3333-----------------2222-----------------3333 TDFIIMIKGDAYYMFVTGPEITKVVLGEEVSFQDLGGAVVHATKSGVVHFMVDSEQEAIN --------1111------------------3333-------------------------- LTKRLLSYLPSNNMEEPPYIDTGDPADRDATGVEQIVPNDAAKPYNMREIIYKIVDNGEF -----1111----------------------3333----------3333------%%%%- LEVHKHWAQNIIVGFARIAGNVVGIVANNPEEFGGSIDIDAADKAARFIRFCDAFNIPLI ---11113333------iiii-------3333iiii----------------1111---- SLVDTPGYVPGTDQEYKGIIRHGAKMLYAFAEATVPKITVIVRKSYGGAHIAMSIKSLGA --------------------------------------------------11113333-- DLVYAWPTAEIAVTGPEGAVRILYRKEIQQASNPDDVLKQRIAEYRKLFANPYWAAEKGL -----1111--------------------------------------------------- VDDVIEPKDTRRVIVAGLEMLKTKREYRYPKKHGNIPL -----3333-----------1111-------------- >Glycerol-3-phosphate dehy; SWP:P21695; PDB:1X0VA; ASKKVCIVGSGNWGSAIAKIVGGNAAQLAQFDPRVTWVFEEDIGGKKLTEIINTQHENVK ---------------------------1111-----------%%%%-----------333 YLPGHKLPPNVVAVPDVVQAAEDADILIFVVPHQFIGKICDQLKGHLKANATGISLIKGV 3------1111----3333-1111-------3333-------2222-1111--------- DEGPNGLKLISEVIGERLGIPSVLGANIASEVADEKFCETTIGCKDPAQGQLLKELQTPN ---------------------------33331111------------------------- FRITVVQEVDTVEICGALKNVVAVGAGFCDGLGFGDNTKAAVIRLGLEIAFAKLFCSGPV -----------------------------1111--------------------------- SSATFLESCGVADLITTCYGGRNRKVAEAFARTGKSIEQLEKELLNGQKLQGPETARELY 3333--3333-----------3333-------------------iiii------------ SILQHKGLVDKFPLFAVYKVCYEGQPVGEFIHCLQNHPEH ---111111113333----------1111----------- >PYRROLIDONE-CARBOXYLATE P; SWP:O73944; PDB:1X12A; MKVLVTGFEPFGGEKINPTERIAKDLDGIKIGDAQVFGRVLPVVFGKAKEVLEKTLEEIK ----------iiii-----------2222-!!!!-------------------------- PDIAIHVGLAPGRSAISIERIAVNAIDARIPDNEGKKIEDEPIVPGAPTAYFSTLPIKKI ---------2222------------------1111--------2222------------- MKKLHERGIPAYISNSAGLYLSNYVMYLSLHHSATKGYPKMSGFIHVPYIPEQIIDKIGK ----1111-----------------------------------------33333333--- GQVPPSMSYEMDLEAVKVAIEVALEELL ---------------------------- >NAD(P) TRANSHYDROGENASE S; SWP:P07001; PDB:1X13A; GRIGIPRERLTNETRVAATPKTVEQLLKLGFTVAVESGAGQLASFDDKAFVQAGAEIVEG ---------2222-------------1111-----22223333--3333-1111----!! NSVWQSEIILKVNAPLDDEIALLNPGTTLVSFIWPAQNPELMQKLAERNVTVMAMDSVPR !!-------------111111112222------3333----------------1111--- ISRAQSLDALSSMANIAGYRAIVEAAHEFGRFFTGQITAAGKVPPAKVMVIGAGVAGLAA 33331111-----------------------------1111------------------- IGAANSLGAIVRAFDTRPEVKEQVQSMGAEFLELGDGYAKVMSDAFIKAEMELFAAQAKE ----1111--------3333----1111-----------------------------111 VDIIVTTALIPGKPAPKLITREMVDSMKAGSVIVDLAAQNGGNCEYTVPGEIFTTENGVK 1--------2222----------11112222-----3333---11112222---1111-- VIGYTDLPGRLPTQSSQLYGTNLVNLLKLLCKEKDGNITVDFDDVVIRGVTVIRAGEITW -----3333-------------------------------3333---------iiii--- PAPPIQVS -------- >CRTF-RELATED PROTEIN; SWP:NA; PDB:1X19A; MSNNLNYYHRANELVFKGLIEFSCMKAAIELDLFSHMAEGPKDLATLAADTGSVPPRLEM ------------------------------------3333-------------------- LLETLRQMRVINLEDGKWSLTEFADYMFSPTPKEPNLHQTPVAKAMAFLADDFYMGLSQA -------------%%%%----------------1111---------------3333---1 VRGQKNFKGQVPYPPVTREDNLYFEEIHRSNAKFAIQLLLEEAKLDGVKKMIDVGGGIGD 111------------------------1111-------------2222-------!!!!- ISAAMLKHFPELDSTILNLPGAIDLVNENAAEKGVADRMRGIAVDIYKESYPEADAVLFC ----33331111------1111--------11111111---------------------- RILYSANEQLSTIMCKKAFDAMRSGGRLLILDMVIDDPENPNFDYLSHYILGAGMPFSVL --1111------------33332222----------1111-3333------1111----- GFKEQARYKEILESLGYKDVTMVRKYDHLLVQAVKP ------------------------iiii-------- >2-DEOXY-D-GLUCONATE 3-DEH; SWP:Q53W82; PDB:1X1EA; MERKALVTGGSRGIGRAIAEALVARGYRVAIASRNPEEAAQSLGAVPLPTDLEKDDPKGL --------------------------------------------------1111-3333- VKRALEALGGLHVLVHAAAVNVRKPALELSYEEWRRVLYLHLDVAFLLAQAAAPHMAEAG ------------------------1111-------------------------------- WGRVLFIGSVTTFTAGGPVPIPAYTTAKTALLGLTRALAKEWARLGIRVNLLCPGYVETE --------1111---!!!!----------------------3333-------------33 FTLPLRQNPELYEPITARIPMGRWARPEEIARVAAVLCGDEAEYLTGQAVAVDGGFLAY 33-----3333-------3333---3333---------3333----------iiii--- >SIGNAL-TRANSDUCING ADAPTO; SWP:Q9ULZ2; PDB:1X1FA; GSSGSSGQERLKITALPLYFEGFLLIKRSGYREYEHYWTELRGTTLFFYTDKKSIIYVDK --------------------------------------------------3333------ LDIVDLTCLTEQNSTEKNCAKFTLVLPKEEVQLKTENTESGEEWRGFILTVTELSVPQNV ------------------------------------------------------------ SLLPGQVIKLHEVLEREKKRRIESGPSSG --1111-----------3333-------- >PLECKSTRIN 2; SWP:Q9NYT0; PDB:1X1GA; GSSGSSGSLSTVELSGTVVKQGYLAKQGHKRKNWKVRRFVLRKDPAFLHYYDPSKEENRP -----------1111------------------------------------3333----- VGGFSLRGSLVSALEDNGVPTGVKGNVQGNLFKVITKDDTHYYIQASSKAERAEWIEAIK -----------------------------------1111--------------------- KLTSGPSSG --------- >XANTHAN LYASE; SWP:Q9AQS0; PDB:1X1IA; SDEFDALRIKWATLLTGGPALDPADSDIAARTDKLAQDANDYWEDMDLSSSRTYIWYALR -----------------11111111-----------------1111--1111---3333- GNGTSDNVNAVYERLRTMALAATTVGSSLYGNADLKEDILDALDWLYVNSYNSTRSRSAY 1111---------------11112222-2222-------------------1111----- NWWHWQLGIPMSLNDIAVLLYDDISAARMATYMDTIDYFTPSIGLTGAARAWQAIVVGVR 3333---------------3333------------------------------------- AVIVKDAVKLAAARNGLSGTGIFPYATGGDGFYADGSFVQHTTFAYTGGYGSSVLETTAN -1111-------------2222----------1111------------------------ LMYLLSGSTWSVSDPNQSNVWQWIYEAYRPLLYKGAMMDMVRGREISRSYAQDHAVGHGI ----2222-------------------3333-iiii-3333------11113333----- VASIVRLAQFAPAPHAAAFKQIAKRVIQEDTFSSFYGDVSTDTIRLAKAIVDDPSIAPAA ---------------------------------3333------------1111------- APNLYKQYAAMDRAVLQRPGFALGLALYSTRISSYESINSENGRGWYTGAGATYLYNQDL -------3333------2222-------1111-----%%%%-----------------11 AQYSEDYWPTVDAYRIPGTTVASGTPIASGTGTSSWTGGVSLAGQYGASGMDLSYGAYNL 11-%%%%----11112222--2222----------------%%%%--------------- SARKSWFMFDDEIVALGSGISSTAGIPIETVVDNRKLNGAGDNAWTANGAALSTGLGVAQ -------------------------------------1111-----iiii---------- TLTGVNWVHLAGNTADGSDIGYYFPGGATLQTKREARTGTWKQINNRPATPSTAVTRNYE -----------------------2222------------3333---3333---------- TMWIDHGTNPSGASYGYVLLPNKTSAQVGAYAADPAIEIVVNTSGVQSVKEKTLGLVGAN ------------------------------3333--------3333-----1111----- FWTDTTQTADLITSNKKASVMTREIADERLEASVSDPTQANNGTIAIELARSAEGYSADP --------!!!!------------2222-------3333--------------------- GITVTQLAPTIKFTVNVNGAKGKSFHASFQLG ----------------2222------------ >UBIQUITIN-LIKE PROTEIN SB; SWP:Q91W67; PDB:1X1MA; GSSGSSGMSLSDWHLAVKLADQPLAPKSILQLPETELGEYSLGGYSISFLKQLIAGKLQE ------------------3333-----------------------3333---3333-333 SVPDPELIDLIYCGRKLKDDQTLDFYGIQPGSTVHVLRKSWSGPSSG 3--3333----iiii------3333---------------------- >4-ALPHA-GLUCANOTRANSFERAS; SWP:Q06801; PDB:1X1NA; VPAVGEDFPIDYADWLPKRDPNDRRRAGILLHPTSFPGPYGIGDLGPQAFKFLDWLHLAG --2222--1111-------3333---------1111---------3333----------- CSLWQVLPLVPPGKRGNEDGSPYSGQDANCGNTLLISLEELVDDGLLKMEELPEPLPTDR -----------------2222----------3333------3333--3333--------- VNYSTISEIKDPLITKAAKRLLSSEGELKDQLENFRRDPNISSWLEDAAYFAAIDNSVNT -3333-3333-----------------------------3333----------------- ISWYDWPEPLKNRHLAALEEVYQSEKDFIDIFIAQQFLFQRQWKKVRDYARSKGISIMGD -3333----1111-------------------------------------1111------ MPIYVGYHSADVWANKKQFLLNRKGFPLIVSGVPPDAFSETGQLWGSPLYDWKAMEKDGF --------3333--1111---1111------------------------------1111- SWWVRRIQRATDLFDEFRIDHFRGFAGFWAVPSEEKIAILGRWKVGPGKPLFDAILQAVG -------------------------------3333------------------------- KINIIAEDLGVITEDVVQLRKSIEAPGMAVLQFAFGSDAENPHLPHNHEQNQVVYTGTHD ------------3333----1111-----3333----1111--3333----------111 NDTIRGWWDTLPQEEKSNVLKYLSNIEEEEISRGLIEGAVSSVARIAIIPMQDVLGLGSD 1------1111---------------3333--------1111-------3333----333 SRMNIPATQFGNWSWRIPSSTSFDNLDAEAKKLRDILATYGRL 3---1111---------33333333------------------ >NICOTINATE-NUCLEOTIDE PYR; SWP:Q5SJM3; PDB:1X1OA; WQGGLEEALRAWLREDLGQGDLTSLLVVPEDLEGEAVILAKEGGVLAGLWVAERVFALAD ----------------!!!!--------1111---------------------------1 PRTAFTPLVAEGARVAEGTEVARVRGPLRGILAGERLALNLLQRLSGIATLTRAYVEALA 111------2222--2222-------------------------------------1111 GTKAQILDTRKTTPGLRALEKYAVRVGGGRNHRYGLFDGILLKENHVRAAGGVGEAVRRA ------------2222-------------------------------------------- KARAPHYLKVEVEVRSLEELEEALEAGADLILLDNFPLEALREAVRRVGGRVPLEASGNM ----1111---------------1111--------------------iiii--------- TLERAKAAAEAGVDYVSVGALTHSAKALDLSLLVVRP --------3333-----3333---------------- >RIBONUCLEASE HII; SWP:O74035; PDB:1X1PA; MKIAGIDEAGRGPVIGPMVIAAVVVDENSLPKLEELKVRDSKKLTPKRREKLFNEILGVL -------------------------3333-------333333333333------------ DDYVILELPPDVIGSREGTLNEFEVENFAKALNSLKVKPDVIYADAADVDEERFARELGE --------------------1111-------1111---------------3333---111 RLNFEAEVVAKHKADDIFPVVSAASILAKVTRDRAVEKLKEEYGEIGSGYPSDPRTRAFL 1-----------3333----------------------3333-------33333333--- ENYYREHGEFPPIVRKGAGAIIGLAVGGVVIA ---1111---1111------------------ >TRYPTOPHAN SYNTHASE BETA ; SWP:P16609; PDB:1X1QA; LTLPDFPLPDARGRFGPYGGRYVPETLIPALEELEAAYREAKKDPAFLEELDHYLRQFAG ---------1111-!!!!-----3333--------------------------------- RPTPLYHAKRLSEYWGGAQVFLKREDLLHTGAHKINNTLGQALLARRGKRRVIAETGAGQ -----------------------3333--------------------------------- HGVSVATVAALFGLECVVYGEEDVRRQALNVFRKLLGAEVRPVAAGSRTLKDATNEAIRD ----------------------3333-------3333-------!!!!3333-------- WITNVRTTFYILGSVVGPHPYPVRDFQSVIGEEVKRQSLELFGRLPDALIAAVGGGSNAI ----1111----------------3333-------------------------------- GLFAPFAYLPEGRPKLIGVEAAGEGLSTGRHAASIGAGKRGVLHGSYYLLYDYPGVGPEH --1111----------------2222----------------%%%%-------------- SYYADAGVAEYASVTDEEALEGFKLLARLEGIIPALESAHAIAYAAKVVPEDKDQVVVIN -----------------------------------3333--------33331111----- LSGRGDKDVTEVRLLGG ----33333333----- >RAS-RELATED PROTEIN M-RAS; SWP:O08989; PDB:1X1RA; NLPTYKLVVVGDGGVGKSALTIQFFQKIFVPDYDPTIEDSYLKHTEIDNQWAILDVLDTA -----------2222------------------1111---------%%%%---------- GQEEFSAMREQYMRTGDGFLIVYSVTDKASFEHVDRFHQLILRVKDRESFPMILVANKVD -1111------------------1111---------------1111-----------333 LMHLRKVTRDQGKEMATKYNIPYIETSAKDPPLNVDKTFHDLVRVIRQQ 31111-------------------------------------------- >D(-)-3-HYDROXYBUTYRATE DE; SWP:Q5KST5; PDB:1X1TA; MLKGKVAVVTGSTSGIGLGIATALAAQGADIVLNGFGDAAEIEKVRAGLAAQHGVKVLYD -2222-------------------1111-------------------------------- GADLSKGEAVRGLVDNAVRQMGRIDILVNNAGIQHTALIEDFPTEKWDAILALNLSAVFH -------------------------------------3333------------------- GTAAALPHMKKQGFGRIINIASAHGLVASANKSAYVAAKHGVVGFTKVTALETAGQGITA ---------------------1111---------------------------2222---- NAICPGWVRTLLSEKQPSLQFVTPEQLGGTAVFLASDAAAQITGTTVSVDGGWTAR ----------3333-3333----------------3333----------iiii--- >LECTIN; SWP:Q8L5H4; PDB:1X1VA; AIKVGAWGGNGGSAFDMGPAYRIISVKIFSGDVVDAVDVTFTYYGKTETRHFGGSGGTPH ------------------------------------------iiii-------------- EIVLQEGEYLVGMKGEFGNYHGVVVVGKLGFSTNKKSYGPFGNTGGTPFSLPIAAGKISG -------------------iiii------------------------------------- FFGRGGDFIDAIGVYLEP ------------------ >OROTIDINE 5'-PHOSPHATE DE; SWP:O26232; PDB:1X1ZA; VMNRLILAMDLMNRDDALRVTGEVREYIDTVKIGYPLVLSEGMDIIAEFRKRFGCRIIAD 2222-------------------3333------3333----------------------- FKVADIPETNEKICRATFKAGADAIIVHGFPGADSVRACLNVAEEMGREVFLLTEMSHPG ----------------------------3333-------------------------333 AEMFIQGAADEIARMGVDLGVKNYVGPSTRPERLSRLREIIGQDSFLISPGVGAQGGDPG 3--3333---------1111------3333-----------3333-------1111-333 ETLRFADAIIVGRSIYLADNPAAAAAGIIESIKDL 3----------3333-------------------- >MORICIN; SWP:Q7YZB4; PDB:1X22A; GKIPVKAIKKAGAAIGKGLRAINIASTAHDVYSFFKPKHKKK ---1111----------------------------------- >HYPOTHETICAL UPF0076 PROT; SWP:Q973T6; PDB:1X25A; HMETVFTEKAPKPVGPYSQAIKVGNTLYVSGQIPIDPRTNEIVKGDIKVQTRQVLDNIKE ------1111------------!!!!---------------------------------- IVKAAGFSLSDVAMAFVFLKDMNMFNDFNSVYAEYFKDKPPARVTVEVSRLPKDALIEIA --1111-3333---------3333-------3333---------------2222------ VICSK ----- >LIPOATE-PROTEIN LIGASE A; SWP:P32099; PDB:1X2GA; STLRLLISDSYDPWFNLAVEECIFRQPATQRVLFLWRNADTVVIGRAQNPWKECNTRREE -----------3333-----------------------------11113333--333311 DNVRLARRSSGGGAVFHDLGNTCFTFAGKPEYDKTISTSIVLNALNALGVSAEASGRNDL 11---------------1111-----------3333---------1111----------- VVKTVEGDRKVSGSAYRETKDRGFHHGTLLLNADLSRLANYLNPDKKKLAAKGRVTNLTE ---1111-----------1111-------------3333-----------------3333 LLPGITHEQVCEAITEAFFAHYGERVEAEIISPNKTPDLPNFAETFARQSSWEWNFGQAP -1111--------------------------3333---2222--------3333------ AFSHLLDERFTWGGVELHFDVEKGHITRAQVFTDSLNPAPLEALAGRLQGCLYRADLQQE ---------1111--------iiii----------------------2222--------- CEALLVDFPEQEKELRELSAWAGAVR ---33331111----------1111- >HEF HELICASE/NUCLEASE; SWP:Q8TZH8; PDB:1X2IA; ALTLAERQRLIVEGLPHVSATLARRLLKHFGSVERVFTASVAELMKVEGIGEKIAKEIRR --------------2222---------------------3333---2222---------- VITAPYIE -------- >KELCH-LIKE ECH-ASSOCIATED; SWP:Q9Z2X8; PDB:1X2JA; VGRLIYTAGGYFRQSLSYLEAYNPSNGSWLRLADLQVPRSGLAGCVVGGLLYAVGGRNNS ----------------------------------------------iiii---------1 PDGNTDSSALDCYNPMTNQWSPCASMSVPRNRIGVGVIDGHIYAVGGSHGCIHHSSVERY 111----------3333--------------------iiii-------!!!!-------- EPERDEWHLVAPMLTRRIGVGVAVLNRLLYAVGGFDGTNRLNSAECYYPERNEWRMITPM -1111-------------------%%%%-------------------3333--------- NTIRSGAGVCVLHNCIYAAGGYDGQDQLNSVERYDVETETWTFVAPMRHHRSALGITVHQ -----------!!!!-------------------------------------------%% GKIYVLGGYDGHTFLDSVECYDPDSDTWSEVTRMTSGRSGVGVAVTMEPC %%--------------------1111------------------------ >HOMEOBOX PROTEIN CUX-2; SWP:O14529; PDB:1X2LA; GSSGSSGAGPGAEEEQLDTAEIAFQVKEQLLKHNIGQRVFGHYVLGLSQGSVSEILARPK ------------------------------------------------------------ PWRKLTVKGKEPFIKMKQFLSDEQNVLALRTIQVRSGPSSG 3333-3333-------------------------------- >LAG1 LONGEVITY ASSURANCE ; SWP:Q8C172; PDB:1X2MA; GSSGSSGTAQPNAILEKVFTAITKHPDEKRLEGLSKQLDWDVRSIQRWFRQRRNQEKPSG -----------------------------------3333-33333333----3333---- PSSG ---- >HOMEOBOX PROTEIN PKNOX1; SWP:P55347; PDB:1X2NA; GSSGSSGKNKRGVLPKHATNVMRSWLFQHIGHPYPTEDEKKQIAAQTNLTLLQVNNWFIN ----------------------------3333---3333-----1111-3333------- ARRRILQSGPSSG ------------- >PROTEIN ARGININE N-METHYL; SWP:P55345; PDB:1X2PA; GSSGSSGEEFVAIADYAATDETQLSFLRGEKILILRQTTADWWWGERAGCCGYIPANHVG ----------------------------------------------2222---------- KHSGPSSG -------- >SIGNAL TRANSDUCING ADAPTE; SWP:O75886; PDB:1X2QA; GSSGSSGSEIQLNNKVARKVRALYDFEAVEDNELTFKHGEIIIVLDDSDANWWKGENHRG --------------------------------------------------------3333 IGLFPSNFVTTNLNIETEAAAVSGPSSG ----1111-------------------- >SARCOSINE OXIDASE ALPHA S; SWP:Q50LF0; PDB:1X31A; SKPQRLSAAQTAGARINRDEALTLTVDGQQLSAFRGDTVASAMLANGLRSCGNSMYLDRP ------33332222--1111-----%%%%------------------------------- RGIFSAGVEEPNALITVGARHQADINESMLPATTVSVTDGLNATLLSGLGVLDPSEDPAY ------1111--------------------1111---2222------------------- YDHVHVHTDVLVVGAGPAGLAAAREASRSGARVMLLDERPEAGGTLREASGEQIDGIDAA --------------------------1111------------!!!!-------iiii--- QWIDAVTEELAAAEETTHLQRTTVFGSYDANYILAAQRRTVHLDGPSGQGVSRERIWHIR ----------------------------%%%%-------1111----------------- AKQVVLATAAHERPIVFENNDRPGIMLAGSVRSYLNRFGVRAGSKIAVATTNDSVYPLVS ----------------------------------------------------1111---- ELAASGGVVAVIDARQNISAAAAQAVTDGVTVLTGSVVANTEADASGELSAVLVATLDEQ --1111------------3333---------------------1111----------111 RNLGEAQRFEADVLAVSGGFNPVVHLHSQRQGKLNWDTSIHAFVPADAVANQHLAGALTG 1----------------------33331111------------------------3333- LLDTASALSTGAATGAAAASAAGFEKIAEVPQALAVPAGETRPVWLVPSLSGDDAVHYKF -------------------1111-------------------------1111-3333--- HFVDLQRDQTVADVLRATGAGMQSVEHIKRYTSISTANDQGKTSGVAAIGVIAAVLGIEN ----1111---------1111-----------222211111111---------------- PAQIGTTTFRAPYTPVSFAALAGRTRGELLDPARLTAMHPWHLAHGAKFEDVGQWKRPWY 3333-----------------!!!!!!!!------1111------------!!!!----- YPQDGESMDEAVYRECKAVRDSVGMLDASTLGKIEIRGKDAAEFLNRMYTNGYTKLKVGM --%%%%---------------------1111------1111-----------11112222 GRYGVMCKADGMIFDDGVTLRLAEDRFLMHTTTGGAADVLDWLEEWLQTEWPELDVTCTS -------1111-----------1111------------------------1111------ VTEQLATVAVVGPRSRDVIAKLASSLDVSNDAFKFMAFQDVTLDSGIEARISRISFSGEL 3333-------1111-------1111--3333-2222-----3333-------------- AFEIAIPAWHGLQVWEDVYAAGQEFNITPYGTETMHVLRAEKGFIIVGQDTDGTVTPQDA ------1111-----------3333--------------1111--2222------3333- GMEWVVSKLKDFVGKRSFSREDNVREDRKHLVSVLPVDSSLRLAEGAALVAADAVASEGV -3333------222233333333--------------1111---------1111--iiii TPMEGWVTHAYNSPALGRTFGLALIKNGRNRIGEVLKTPVDGQLVDVQVSDLVLFDPEGS -------------1111---------33332222-----iiii------------1111- RRD --- >Subunit beta of sarcosine; SWP:Q50LF2; PDB:1X31B; ADLLPEHPEFLWNNPEPKKSYDVVIVGGGGHGLATAYYLAKNHGITNVAVLEKGWLAGGN ------------------------------------------------------222233 MARNTTIIRSNYLWDESAGIYEKSLKLWEELPEELEYDFLFSQRGVLNLAHTLGDVRESI 33---------------------------------------------------------- RRVEANKFNGVDAEWLTPEQVKEVCPIINTGDNIRYPVMGATYQPRAGIAKHDHVAWAFA ------1111--------------1111---------------1111------------- RKANEMGVDIIQNCEVTGFLKDGEKVTGVKTTRGTILAGKVALAGAGHSSVLAELAGFEL --------------------------------------------!!!!------------ PIQSHPLQALVSELFEPVHPTVVMSNHIHVYVSQAHKGELVMGAGIDSYNGYGQRGAFHV -------------------------1111-----3333------------------3333 IEEQMAAAVELFPIFARAHVLRTWGGIVDTTMDASPIISKTPIQNLYVNCGWGTGGFKGT -----------3333---------------1111--------2222-----!!!!3333- PGAGYTLAHTIAHDEPHKLNAPFALERFETGHLIDEHGAAAV ----------------3333---3333---------3333-- >Subunit gamma of sarcosin; SWP:Q50LE9; PDB:1X31C; QLRRSPAAHLAAAMEAAEVAGERAVTLREVAFTTQLGLRAVPGSTGHAALAAATGVGLPA ----1111------1111----------------------2222-----1111------- AVGEVAGDVSGTAVLWLGPDEFLLAAEENPALLDTLQGALGQEPGQVLDLSANRSVLQLE 2222---3333-----------------3333-------!!!!------1111------- GPAAALVLRKSCPADLHPREFGVNRAITTSLANIPVLLWRTGEQSWRILPRASFTEHTVH 11113333--------3333----------%%%%----------------3333------ WLIDAMSEFSAAEVA -----3333------ >Subunit delta of sarcosin; SWP:Q50LF1; PDB:1X31D; MMLIECPNCGPRNENEFKYGGEAHVAYPEDPNALSDKEWSRYLFYRGNKKGIFAERWVHS ------------1111------------------------------------------11 GGCRKWFNALRDTVSYEFKAVYRAGEARPQL 11--------------------2222----- >chloroplast signal recogn; SWP:O22265; PDB:1X32A; GSGEVNKIIGSRTAGEGAMEYLIEWKDGHSPSWVPSSYIAADVVSEY ------------------------%%%%------------------- >COAT PROTEIN; SWP:Q9EB06; PDB:1X36A; RGGITVLTHSELSAEIGVTDSIVVSSELVMPYTVGTWLRGVAANWSKYSWLSVRYTYIPS -----------------------------3333--------1111--------------- CPSSTAGSIHMGFQYDMADTVPVSVNQLSNLRGYVSGQVWSGSAGLCFINGTRCSDTSTA -1111----------3333----3333---2222---11113333-----------1111 ISTTLDVSKLGKKWYPYKTSADYATAVGVDVNIATPLVPARLVIALLDGSSSTAVAAGRI -----3333----------------3333----1111----------------------- YCTYTIQMIEPTASALNN ------------3333-- >ATP-DEPENDENT PROTEASE LA; SWP:P37945; PDB:1X37A; AGYTEIEKLEIVKDHLLPKQIKEHGLKKSNLQLRDQAILDIIRYYTREAGVRSLERQLAA ------------------33333333---------------------------------- ICRKAAKAIVAEERKRITVTEKNLQDFIGKRIFRY ----------------------3333--------- >BETA-D-GLUCAN EXOHYDROLAS; SWP:Q9XEI3; PDB:1X38A; DYVLYKDATKPVEDRVADLLGRMTLAEKIGQMTQIERLVATPDVLRDNFIGSLLSGGGSV --33331111---------1111------------3333------1111------2222- PRKGATAKEWQDMVDGFQKACMSTRLGIPMIYGIDAVHGQNNVYGATIFPHNVGLGATRD -2222------------------1111------------1111------------3333- PYLVKRIGEATALEVRATGIQYAFAPCIAVCRDPRWGRCYESYSEDRRIVQSMTELIPGL --------------3333--------------3333--1111---3333----------- QGDVPKDFTSGMPFVAGKNKVAACAKHFVGDGGTVDGINENNTIINREGLMNIHMPAYKN ----11112222----1111---------111122222222-------------3333-- AMDKGVSTVMISYSSWNGVKMHANQDLVTGYLKDTLKFKGFVISDWEGIDRITTPAGSDY ---------------iiii3333---------------------22221111--2222-- SYSVKASILAGLDMIMVPNKYQQFISILTGHVNGGVIPMSRIDDAVTRILRVKFTMGLFE -------------------------------------3333---------------3333 NPYADPAMAEQLGKQEHRDLAREAARKSLVLLKNGKTSTDAPLLPLPKKAPKILVAGSHA ----33333333------------------------1111----------------1111 DNLGYQCGGWTIEWQGDTGRTTVGTTILEAVKAAVDPSTVVVFAENPDAEFVKSGGFSYA ------------1111-------------------1111--------------------- IVAVGEHPYTETKGDNLNLTIPEPGLSTVQAVCGGVRCATVLISGRPVVVQPLLAASDAL ----------3333-----------------3333------------------------- VAAWLPGSEGQGVTDALFGDFGFTGRLPRTWFKSVDQLPMNVGDAHYDPLFRLGYGLTTN ---------------1111--------------3333---2222-------2222----- AT -- >SYNAPSE ASSOCIATED PROTEI; SWP:Q96A49; PDB:1X3AA; GSSGSSGTNDEETIQQQILALSADKRNFLRDPPAGVQFNFDFDQMYPVALVMLQEDELLS ----------3333-----3333-3333-------------33333333---33333333 KMRFALVPKLVKEEVFWRNYFYRVSLIKQSAQLTSGPSSG ---1111--------------------------------- >Transforming growth facto; SWP:Q15582; PDB:1X3BA; GSSGSSGMGTVMDVLKGDNRFSMLVAAIQSAGLTETLNREGVYTVFAPTNEAFRALPPRE ---------3333----33333333-------3333--------------3333------ RSRLLGDAKELANILKYHIGDEILVSGGIGALVRLKSLQGDKLEVSLKNNVVSVNKEPVA -------3333------------------------------------iiii--%%%%--- EPDIMATNGVVHVITNVLQPSGPSSG -------------------------- >ZINC FINGER PROTEIN 292; SWP:O60281; PDB:1X3CA; GSSGSSGRKKPVSQSLEFPTRYSPYRPYRCVHQGCFAAFTIQQNLILHYQAVHKSDLPAF ----------------------------------------3333--3333---------- SAEVEEESGPSSG ------------- >Fibronectin type-III doma; SWP:Q9Y2H6; PDB:1X3DA; GSSGSSGAEIFTTLSCEPDIPNPPRIANRTKNSLTLQWKAPSDNGSKIQNFVLEWDEGKG -------------------------------------------------------%%%%- NGEFCQCYMGSQKQFKITKLSPAMGCKFRLSARNDYGTSGFSEEVLYYTSGCSGPSSG ---------------------------------------------------------- >SINGLE-STRAND BINDING PRO; SWP:Q9AFI5; PDB:1X3EA; GDTTITVVGNLTADPELRFTPSGAAVANFTVASTPRMEWKDGEALFLRCNIWREAAENVA -------------------3333------------------------------------- ESLTRGSRVIVTGRLKQRSFETREKRTVVEVEVDEIGPSLRYATAKVNKA ---2222------------------------------------------- >LEUPAXIN; SWP:O60711; PDB:1X3HA; GSSGSSGKDFLAMFSPKCGGCNRPVLENYLSAMDTVWHPECFVCGDCFTSFSTGSFFELD -------------------------------%%%%-----------------------%% GRPFCELHYHHRRGSGPSSG %%------------------ >HEMOGLOBIN COMPONENT V; SWP:Q7M422; PDB:1X3KA; AFVGLSDSEEKLVRDAWAPIHGDLQGTANTVFYNYLKKYPSNQDKFETLKGHPLDEVKDT ------------------3333----------------333333331111--33331111 ANFKLIAGRIFTIFDNCVKNVGNDKGFQKVIADMSGPHVARPITHGSYNDLRGVIYDSMH -----------------1111-------------3333------------------3333 LDSTHGAAWNKMMDNFFYVFYECLDGRCSQFS ----------------------1111-3333- >HYPOTHETICAL PROTEIN PH04; SWP:O58231; PDB:1X3LA; MIAMDIREIGLRLVGEAIKAADPYRAVLNAVKVSDDKIIVQGKEFEIKGKVYVIALGKAA --------------------------------------------------------1111 CEMARAIEDILDVEDGVAVTKYGYGKELKRIKVIEAGHPIPDEKSILGAKEALSILNRAR ------1111----------2222-----------------------------------1 ENDIVFILISGGGSALFELPEEGISLEDLKLTTDLLLKSGAKIHEINTVRKHISKVKGGK 111------2222-------2222------------1111-3333-----------iiii LAKMIKGTGIVLIISDVVGDNLEAIASGPTVKDPTTFEDAKRILELYDIWEKVPESVRLH -1111-----------22223333%%%%----------------1111-----3333--- IERGLRGEVEETLKEDLPNVHNFLIASNSISCEAIAREAQRLGFKAYIMTTTLEGEAKDA ---1111---------1111-------------------1111----------------- GLFIGSIVQEIAERGRPFEPPVVLVFGGETTVTIEGKGGKGGPNQEIALSATRKISDLEA ---------------------------------------------------3333----- LIVAFDTDGTDGPTDAAGGIVDGTTYKKLREKGIDVEKVLKEHNSYEALKKVGGLLFTGP -----1111--------------------1111--------------------------- TGTNVNSIVIAIVTSK ---------------- >PROPIONATE KINASE; SWP:O06961; PDB:1X3MA; FPVVLVINCGSSSIKFSVLDVATCDVLMAGIADGMNTENAFLSINGDKPINLAHSNYEDA ---------------------------------2222------iiii------------- LKAIAFELEKRDLTDSVALIGHRIAHGGELFTQSVIITDEIIDNIRRVSPLAPLHNYANL -------1111-3333----------!!!!-----------------3333--------- SGIDAARHLFPAVRQVAVFDTSFHQTLAPEAYLYGLPWEYFSSLGVRRYGFHGTSHRYVS ---------1111-------3333---3333-----3333-------------------- RRAYELLDLDEKDSGLIVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSGDVDFG ----1111-3333-----------------iiii-------------------------- AMAWIAKETGQTLSDLERVVNKESGLLGISGLSSDLRVLEKAWHEGHERARLAIKTFVHR ------------------------------------------------------------ IARHIAGHAASLHRLDGIIFTGGIGENSVLIRQLVIEHLGVLGLTLDVEMNKQPNSHGER --------1111---------3333-------------3333-----3333--------- IISANPSQVICAVIPTNEEKMIALDAIHLGNVKA ---3333---------------------1111-- >ACYL CARRIER PROTEIN; SWP:Q5SL79; PDB:1X3OA; MTEQEIFEKVKAVIADKLQVEPEKVTLEARFIEDLGADSLDTVELIMGLEDEFGLEISDE ---------------1111-3333-1111----------------------------333 EAEKIRTVKDAVEYIKAKLG 31111--------------- >CPSRP43; SWP:O22265; PDB:1X3PA; AVAESVIGKRVGDDGKTIEYLVKWTDMSDATWEPQDNVDSTLVLLYQQQQPMNE ------------------------------------------33333333---- >CPSRP43; SWP:O22265; PDB:1X3QA; GSQVFEYAEVDEIVEKRGKGKDVEYLVRWKDGGDCEWVKGVHVAEDVAKDYEDGLEY -------------------------------------------111133333333-- >RAS-RELATED PROTEIN RAB-1; SWP:Q9NP72; PDB:1X3SA; DEDVLTTLKILIIGESGVGKSSLLLRFTDDTFDPELAATIGVDFKVKTISVDGNKAKLAI 1111----------2222--------------1111--------------iiii------ WDTAGQERFRTLTPSYYRGAQGVILVYDVTRRDTFVKLDNWLNELETYCTRNDIVNLVGN -----3333--3333-2222-------1111-------------1111------------ KIDKENREVDRNEGLKFARKHSLFIEASAKTCDGVQCAFEELVEKIIQTPGLWES 3333-----3333-----1111--------------------------3333--- >TRANSCRIPTIONAL REGULATOR; SWP:P10958; PDB:1X3UA; GSHMDDANDIRARLQTLSERERQVLSAVVAGLPNKSIAYDLDISPRTVEVHRANVMAKMK -----------------3333--------------------------------------- AKSLPHLVRMALAGGFGPS --3333------------- >CYTOCHROME B5; SWP:Q17091; PDB:1X3XA; CGDKKYTKEEVAKHNTQNDLWIIYDGEVHDMTSFYKEHPGGKVILNKAGQDATSVLKTLA ------33333333-1111----iiii---33331111--------2222---------- PHVKAADVVMKKLKQTCIGKVK ---------------------- >PEPTIDE:N-GLYCANASE; SWP:Q02890; PDB:1X3ZA; NNIDFDSIAKMLLIKYKDFILSKFKKAAPVENIRFQNLVHTNQFAQGVLGQSQHLCTVYD ----3333-------------1111----------------------------3333--- NPSWHSIVLETLDLDLIYKNVDKEFAKDGHAEGENIYTDYLVKELLRYFKQDFFKWCNKP ------------------------3333-------------------------------- DCNHCGQNTSENMTPLGSQGPNGEESKFNCGTVEIYKCNRCGNITRFPRYNDPIKLLETR -3333-----------------1111---------------------------------- KGRCGEWCNLFTLILKSFGLDVRYVWNREDHVWCEYFSNFLNRWVHVDSCEQSFDQPYIY ---------------1111-------------------1111-------------3333- SINWNKKMSYCIAFGKDGVVDVSKRYILQNELPRDQIKEEDLKFLCQFITKRLRYSLNDD -1111---------1111---3333--------------------------1111--111 EIYQLACRDEQEQIELIRGK 1--------------3333- >UV excision repair protei; SWP:P32628; PDB:1X3ZB; GSIGLTVEDLLSLRQVVSGNPEALAPLLENISARYPQLREHIMANPEVFVSMLLEAV -----3333-----3333-3333------3333--3333--1111-------3333- >ARAP2; SWP:NA; PDB:1X40A; GSSGSSGMSSVSEVNVDIKDFLMSINLEQYLLHFHESGFTTVKDCAAINDSLLQKIGISP ----------------3333--11113333----------3333----3333-------- TGHRRRILKQLQIILSKMQDIPIYASGPSSG 3333---------3333-------------- >TRANSCRIPTIONAL ADAPTOR 2; SWP:O75478; PDB:1X41A; GSSGSSGDPSWTAQEEMALLEAVMDCGFGNWQDVANQMCTKTKEECEKHYMKYFSGPSSG --------------------------2222-3333--33333333--------------- >HYPOTHETICAL PROTEIN PH04; SWP:O58216; PDB:1X42A; IRAVFFDFVGTLLSVEGEAKTHLKIEEVLGDYPLNPKTLLDEYEKLTREAFSNYAGKPYR -------2222-----------------!!!!--3333---------------------- PIRDIEEEVRKLAEKYGFKYPENFWEIHLRHQRYGELYPEVVEVLKSLKGKYHVGITDSD --------------------1111-------------1111------------------3 TEYLAHLDALGIKDLFDSITTSEEAGFFKPHPRIFELALKKAGVKGEEAVYVGDNPVKDC 333--------1111-----3333--------------------3333------------ GGSKNLGTSILLDRKGEKREFWDKCDFIVSDLREVIKIVDELN ---1111-----1111-33331111------------------ >SH3 DOMAIN GRB2-LIKE PROT; SWP:Q9JK48; PDB:1X43A; GSSGSSGLNDLKESSNNRKARVLYDYDAANSTELSLLADEVITVFSVVGMDSDWLMGERG -----------------------------3333-----------------1111------ NQKGKVPITYLELLNSGPSSG ------3333----------- >MYOSIN-BINDING PROTEIN C,; SWP:Q00872; PDB:1X44A; GSSGSSGIMVTKQLEDTTAYCGERVELECEVSEDDANVKWFKNGEEIIPGPKSRYRIRVE -------------------2222------------------iiii--------------- GKKHILIIEGATKADAAEYSVMTTGGQSSAKLSVDLKSGPSSG -----------3333---------------------------- >amyloid beta (A4) precurs; SWP:Q02410; PDB:1X45A; GSSGSSGDVFIEKQKGEILGVVIVESGWGSILPTVIIANMMHGGPAEKSGKLNIGDQIMS --------------------------------------------3333------------ INGTSLVGLPLSTCQSIIKGLKNQSRVKLNIVSGPSSG iiii-----3333--1111-1111-------------- >HEMOGLOBIN COMPONENT VII; SWP:Q7M421; PDB:1X46A; DPTWVDMEAGDIALVKSSWAQIHDKEVDILYNFFKSYPASQAKFSAFAGKDLESLKDTAP ---------------------1111-----------3333------222233332222-- FALHATRIVSVINEAIALMGVAENRPALKNVLKQQGINHKGRGVTAAHFEEFETALEAFL ---------------1111-3333-------------3333------------------- ESHASGYNAGTKKAWDSAFNNMYSVVFPEL ---2222------------------3333- >DGCR8 PROTEIN; SWP:Q9NRW2; PDB:1X47A; GSSGSSGEFVINPNGKSEVCILHEYMQRVLKVRPVYNFFECENPSEPFGASVTIDGVTYG ----------------3333---------------------------------------- SGTASSKKLAKNKAARATLEILIPDFVKQTSESGPSSG -------------------------------------- >Interferon-induced, doubl; SWP:Q03963; PDB:1X48A; GSSGSSGYIGLVNSFAQKKKLSVNYEQCEPNSELPQRFICKCKIGQTMYGTGSGVTKQEA ----------------1111-----------------------%%%%------------- KQLAAKEAYQKLLKSPPKTAGTSGPSSG -----------------------3333- >Interferon-induced, doubl; SWP:Q03963; PDB:1X49A; GSSGSSGMASDTPGFYMDKLNKYRQMHGVAITYKELSTSGPPHDRRFTFQVLIDEKEFPE --------------3333----------------------------------%%%%---- AKGRSKQEARNAAAKLAVDILDNENKVDCHTSGPSSG -------------------3333-------------- >splicing factor, arginine; SWP:NA; PDB:1X4AA; GSSGSSGMSGGGVIRGPAGNNDCRIYVGNLPPDIRTKDIEDVFYKYGAIRDIDLKNRRGG -------------------------------------------1111------------- PPFAFVEFEDPRDAEDAVYGRDGYDYDGYRLRVEFPRSGRGTGSGPSSG ---------3333------------iiii-------------------- >HETEROGENEOUS NUCLEAR RIB; SWP:P22626; PDB:1X4BA; GSSGSSGMEKTLETVPLERKKREKEQFRKLFIGGLSFETTEESLRNYYEQWGKLTDCVVM ------------------------3333-------------------------------- RDPASKRSRGFGFVTFSSMAEVDAAMAARPHSIDGRVVEPKRAVAREESGSGPSSG --------------------------------iiii-------------------- >SPLICING FACTOR, ARGININE; SWP:Q6PDM2; PDB:1X4CA; GSSGSSGGPPSRRSENRVVVSGLPPSGSWQDLKDHMREAGDVCYADVYRDGTGVVEFVRK -----------------------11113333---3333---------------------- EDMTYAVRKLDNTKFRSHEGETAYIRVKVDGPRSPSYGRSRSSGPSSG ----------------1111---------------------------- >MATRIN 3; SWP:Q8K310; PDB:1X4DA; GSSGSSGQKGRVETRRVVHIMDFQRGKNLRYQLLQLVEPFGVISNHLILNKINEAFIEMA ---------------------------3333----------------------------- TTEDAQAAVDYYTTTPALVFGKPVRVHLSQKYKRIKSGPSSG 3333----1111------iiii-------------------- >RNA binding motif, single; SWP:Q15434; PDB:1X4EA; GSSGSSGLYIRGLQPGTTDQDLVKLCQPYGKIVSTKAILDKTTNKCKGYGFVDFDSPSAA -------------11113333----1111--------------------------3333- QKAVTALKASGVQAQMAKQSGPSSG -------3333-------------- >MATRIN 3; SWP:Q8K310; PDB:1X4FA; GSSGSSGKKPEGKPDQKFDQKQELGRVIHLSNLPHSGYSDSAVLKLAEPYGKIKNYILMR --------------------------------------3333------------------ MKSQAFIEMETREDAMAMVDHCLKKALWFQGRCVKVDLSEKYKKLVSGPSSG ----------3333-----1111----------------------------- >NUCLEOLYSIN TIAR; SWP:Q01085; PDB:1X4GA; GSSGSSGNTKQLRFEDVVNQSSPKNCTVYCGGIASGLTDQLMRQTFSPFGQIMEIRVFPE ------------3333-----3333------------3333-----3333---------- KGYSFVRFSTHESAAHAIVSVNGTTIEGHVVKCYWGKESPDMTSGPSSG -------------------------iiii-------------------- >RNA-BINDING PROTEIN 28; SWP:Q8CGC6; PDB:1X4HA; GSSGSSGLPSDVTEGKTVFIRNLSFDSEEEALGEVLQQFGDLKYVRVVLHPDTEHSKGCA -----------------------33333333-----3333-------------------- FAQFMTQEAAQKCLAAASLEAEGGGLKLDGRQLKVDLAVTRDEAASGPSSG -----3333----33333333------%%%%-------------------- >INHIBITOR OF GROWTH PROTE; SWP:Q9NXR8; PDB:1X4IA; GSSGSSGYCICNQVSYGEMVGCDNQDCPIEWFHYGCVGLTEAPKGKWYCPQCTAAMKRRG --------1111-----------3333-----3333------------3333-------- SRHKSGPSSG ---------- ------------------------------------------------------------ --------------- >SKELETAL MUSCLE LIM-PROTE; SWP:Q14192; PDB:1X4KA; GSSGSSGCQECKKTIMPGTRKMEYKGSSWHETCFICHRCQQPIGTKSFIPKDNQNFCVPC -------------------------------1111---------------------3333 YEKQHASGPSSG 3333-------- >SKELETAL MUSCLE LIM-PROTE; SWP:Q14192; PDB:1X4LA; GSSGSSGCAGCTNPISGLGGTKYISFEERQWHNDCFNCKKCSLSLVGRGFLTERDDILCP --------------------------------------------2222----------33 DCGKDISGPSSG 33---------- >FAR UPSTREAM ELEMENT BIND; SWP:Q91WJ8; PDB:1X4MA; GSSGSSGHGDGPGNAVQEIMIPASKAGLVIGKGGETIKQLQERAGVKMVMIQDGPQNTGA ---------------------3333----------------------------------- DKPLRITGDPYKVQQAKEMVLELIRDQGSGPSSG --------1111--3333---1111--------- >FAR UPSTREAM ELEMENT BIND; SWP:Q91WJ8; PDB:1X4NA; GSSGSSGHQQQRSVMTEEYKVPDGMVGFIIGRGGEQISRIQQESGCKIQIAPDSGGLPER ---------------------11111111------------------------------- SCMLTGTPESVQSAKRLLDQIVEKGRSGPSSG ------3333---------------------- >SPLICING FACTOR 4; SWP:Q8CH02; PDB:1X4OA; GSSGSSGKVSPPEDEEAKNLAEKLARFIADGGPEVETIALQNNRENQAFSFLYDPNSQGY -------------3333------------------------3333----33333333--- RYYRQKLDEFRKSGPSSG ------------------ >Putative splicing factor,; SWP:Q8IX01; PDB:1X4PA; GSSGSSGVGTIDQLVKRVIEGSLSPKERTLLKEDPAYWFLSDENSLEYKYYKLKLAEMQR -----------------------------333333333333----3333----------- SGPSSG ------ >U4/U6 SMALL NUCLEAR RIBON; SWP:O43395; PDB:1X4QA; GSSGSSGMALSKRELDELKPWIEKTVKRVLGFSEPTVVTAALNCVGKGMDKKKAADHLKP ----------3333---3333---3333-----3333-------1111-3333-----11 FLDDSTLRFVDKLFEAVEEGRSSRHSSGPSSG 11---3333---1111---------------- >PARP14 PROTEIN; SWP:Q2EMV9; PDB:1X4RA; GSSGSSGKSIRLAKEKESQADYISTYVEWQYIDKNITQCFDKMTNMKLEVAWKAKKKDTV -------------3333-----3333--------------3333----3333-------- VQIHNQDFTVDLSTNTATAPQGQTFTVQRLVKASGPSSG --%%%%-----1111------------------------ >ZINC FINGER HIT DOMAIN CO; SWP:Q9UHR6; PDB:1X4SA; GSSGSSGMEPAGPCGFCPAGEVQPARYTCPRCNAPYCSLRCYRTHGTCAENFYSGPSSG ---------------------------------------3333---3333--------- >HYPOTHETICAL PROTEIN LOC5; SWP:Q69ZQ2; PDB:1X4TA; GSSGSSGKVKERRPFLASECTELPKAEKWRRQIIGEISKKVAQIQNAGLGEFRIRDLNDE ---------------3333----------------------------------------- INKLLREKGHWEVRIKELGGPDYGKVSGPSSG ---------------1111----3333----- >ZINC FINGER, FYVE DOMAIN ; SWP:Q5T4F4; PDB:1X4UA; GSSGSSGRYPTNNFGNCTGCSATFSVLKKRRSCSNCGNSFCSRCCSFKVPKSSMGATAPE ------------------------3333-------------------------------- AQRETVFVCASCNQTLSKSGPSSG --------3333------------ >HYPOTHETICAL PROTEIN LOC1; SWP:Q8WV99; PDB:1X4VA; GSSGSSGRKIFTNKCERAGCRQREMMKLTCERCSRNFCIKHRHPLDHDCSGEGHPTSSGP -------------------------------------3333------------------- SSG --- >HYPOTHETICAL PROTEIN FLJ1; SWP:Q9H8U3; PDB:1X4WA; GSSGSSGSRSKQKSRRRCFQCQTKLELVQQELGSCRCGYVFCMLHRLPEQHDCTFDHMGR -------------3333--------3333------------1111-3333---------- GSGPSSG ------- >Fibronectin type-III doma; SWP:Q9Y2H6; PDB:1X4XA; GSSGSSGPDQCKPPQVTCRSATCAQVNWEVPLSNGTDVTEYRLEWGGVEGSMQICYCGPG -----------------------------------------------3333--------- LSYEIKGLSPATTYYCRVQALSVVGAGPFSEVVACVTPPSSGPSSG ---------------------------------------------- >biregional cell adhesion ; SWP:Q6AZB0; PDB:1X4YA; GSSGSSGPVAGPYITFTDAVNETTIMLKWMYIPASNNNTPIHGFYIYYRPTDSDNDSDYK -------------------------------------------------3333-3333-- KDMVEGDRYWHSISHLQPETSYDIKMQCFNEGGESEFSNVMICETKARSGPSSG ----1111---------------------3333--------------------- >biregional cell adhesion ; SWP:NA; PDB:1X4ZA; GSSGSSGSQPDHGRLSPPEAPDRPTISTASETSVYVTWIPRGNGGFPIQSFRVEYKKLKK -----------------------------1111--------------------------- VGDWILATSAIPPSRLSVEITGLEKGISYKFRVRALNMLGESEPSAPSRPYVVSGSGPSS -----------1111---------------------1111-------------------- G - >GALECTIN-4; SWP:P56470; PDB:1X50A; GSSGSSGHQQLNSLPTMEGPPTFNPPVPYFGRLQGGLTARRTIIIKGYVPPTGKSFAINF -------------------------------------------------1111------- KVGSSGDIALHINPRMGNGTVVRNSLLNGSWGSEEKKITHNPFGPGQFFDLSIRCGLDRF --------------------------2222------------------------------ KVYANGQHLFDFAHRLSAFQRVDTLEIQGDVTLSYVQISGPSSG ---%%%%--------1111------------------------- >A/G-SPECIFIC ADENINE DNA ; SWP:Q9UIF7-3; PDB:1X51A; GSSGSSGPRKASRKPPREESSATCVLEQPGALGAQILLVQRPNSGLLAGLWEFPSVTWEP -----------------------------3333--------------------------- SEQLQRKALLQELQRWAGPLPATHLRHLGEVVHTFSHIKLTYQVYGLALEGQTPVTTVPP ------------3333-----3333----------------------------------- GARWLTQEEFHTAAVSTAMKKVFRVYQGQSGPSSG -----3333-------3333--------------- >PELOTA HOMOLOG; SWP:Q9BRX2; PDB:1X52A; GSSGSSGTVASRLSDTKAAGEVKALDDFYKMLQHEPDRAFYGLKQVEKANEAMAIDTLLI --------------3333----------------3333---------------------- SDELFRHQDVATRSRYVRLVDSVKENAGTVRIFSSLHVSGEQLSQLTGVAAILRFPVPSG 3333-------------------1111------3333--------%%%%----------- PSSG ---- >Activator of 90 kDa heat ; SWP:O95433; PDB:1X53A; GSSGSSGIPTCKITLKETFLTSPEELYRVFTTQELVQAFTHAPATLEADRGGKFHMVDGN ---------------------3333--1111--3333-------------------%%%% VSGEFTDLVPEKHIVMKWRFKSWPEGHFATITLTFIDKNGETELCMEGRGIPAPEEERTR -------------------1111----------------------------3333----- QGWQRYYFEGIKQTFGYGASGPSSG ------------------------- >ASPARAGINYL-TRNA SYNTHETA; SWP:O57980; PDB:1X54A; MIEKVYCQEVKPELDGKKVRLAGWVYTNMRVGKKIFLWIRDSTGIVQAVVAKNVVGEETF -----3333-3333----------------!!!!------1111------1111------ EKAKKLGRESSVIVEGIVKADERAPGGAEVHVEKLEVIQAVSEFPIPENPEQASPELLLD -3333-2222----------3333iiii--------------------1111-3333--- YRHLHIRTPKASAIMKVKETLIMAAREWLLKDGWHEVFPPILVTGAVEGGATLFKLKYFD 33331111----------------------------------------3333-----!!! KYAYLSQSAQLYLEAAIFGLEKVWSLTPSFRAEKSRTRRHLTEFWHLELEAAWMDLWDIM !----------------------------------------------------------- KVEEELVSYMVQRTLELRKKEIEMFRDDLTTLKNTEPPFPRISYDEAIDILQSKGVNVEW ---------------------3333---3333-------------------1111---22 GDDLGADEERVLTEEFDRPFFVYGYPKHIKAFYMKEDPNDPRKVLASDMLAPEGYGEIIG 22----------1111---------1111-1111--3333-----------iiii----- GSQREDDYDKLLNRILEEGMDPKDYEWYLDLRRYGSVPHSGFGLGVERLVAWVLKLDHIR ---------------1111-3333-33331111------------------------333 WAALFPRTPARLYP 3------1111--- >ENDOTHELIAL DIFFERENTIATI; SWP:O60869; PDB:1X57A; GSSGSSGDRVTLEVGKVIQQGRQSKGLTQKDLATKINEKPQVIADYESGRAIPNNQVLGK ---------------------------------------------3333----3333--- IERAIGLKLRGKDIGKPIEKGPRAKSGPSSG ----------1111----------------- >HYPOTHETICAL PROTEIN 4930; SWP:Q8C0V1; PDB:1X58A; GSSGSSGRKDFTKEEVNYLFHGVKTMGNHWNSILWSFPFQKGRRAVDLAHKYHRLISGPS -------------------------------------------3333-------1111-- SG -- >HISTIDYL-TRNA SYNTHETASE; SWP:P12081; PDB:1X59A; GSSGSSGMAERAALEELVKLQGERVRGLKQQKASAELIEEEVAKLLKLKAQLGPDESKQK ---------------------------------3333----------------------- FVLKTPKSGPSSG ------------- ------------------------------------------------------------ ----------------------------------------------- >SIGNAL TRANSDUCING ADAPTO; SWP:NA; PDB:1X5BA; GSSGSSGMPLFTANPFEQDVEKATNEYNTTEDWSLIMDICDKVGSTPNGAKDCLKAIMKR ---------------3333-33333333---3333------3333--3333-------33 VNHKVPHVALQALTLLGACVANCGKIFHLEVCSRDFATEVRAVIKNKAHPKVCEKLKSLM 33---------------3333--3333-3333----------------3333-------- VEWSEEFQKDPQFSLISATIKSMKEEGITFPPAGSQTSGPSSG ---------3333-3333----1111----------------- >PROTEIN DISULFIDE-ISOMERA; SWP:P07237; PDB:1X5CA; GSSGSSGPVKVLVGKNFEDVAFDEKKNVFVEFYAPWCGHCKQLAPIWDKLGETYKDHENI ------------------------------------3333---------3333------- VIAKMDSTANEVEAVKVHSFPTLKFFPASADRTVIDYNGERTLDGFKKFLESGGQSGPSS -----1111--3333----------------------------------3333------- G - >PROTEIN DISULFIDE-ISOMERA; SWP:Q15084; PDB:1X5DA; GSSGSSGDVIELTDDSFDKNVLDSEDVWMVEFYAPWCGHCKNLEPEWAAAASEVKEQTKG --------------3333--1111-------------3333------------------- KVKLAAVDATVNQVLASRYGIRGFPTIKIFQKGESPVDYDGGRTRSDIVSRALDLFSDNA -------3333-33331111-----------------------3333-------3333-- PPPELLESGPSSG ---------%%%% >THIOREDOXIN DOMAIN CONTAI; SWP:Q9H3N1; PDB:1X5EA; GSSGSSGNVRVITDENWRELLEGDWMIEFYAPWCPACQNLQPEWESFAEWGEDLEVNIAK --------------3333------------11113333------------3333------ VDVTEQPGLSGRFIINALPTIYHCKDGEFRRYQGPRTKKDFINFISDKEWKSIEPVSSWF -------3333-------------------------------------3333-------- SGPSSG ------ >NEOGENIN; SWP:Q92859; PDB:1X5FA; GSSGSSGEHAPATTGPLPSAPRDVVASLVSTRFIKLTWRTPASDPHGDNLTYSVFYTKEG -----------------------------1111-------3333---------------- IARERVENTSHPGEMQVTIQNLMPATVYIFRVMAQNKHGSGESSAPLRVETQPESGPSSG ----------2222---------------------------------------------- >NEOGENIN; SWP:Q92859; PDB:1X5GA; GSSGSSGRVETQPEVQLPGPAPNLRAYAASPTSITVTWETPVSGNGEIQNYKLYYMEKGT ------------------------------------------------------------ DKEQDVDVSSHSYTINGLKKYTEYSFRVVAYNKHGPGVSTPDVAVRTLSDSGPSSG ------------------------------------------------3333---- >NEOGENIN; SWP:Q92859; PDB:1X5HA; GSSGSSGDVAVRTLSDVPSAAPQNLSLEVRNSKSIMIHWQPPAPATQNGQITGYKIRYRK ------------------------------------------3333-------------1 ASRKSDVTETLVSGTQLSQLIEGLDRGTEYNFRVAALTINGTGPATDWLSAETFESDLDE 111---------1111---------------------3333------------------- TRVPEVSGPSSG ------------ >NEOGENIN; SWP:Q92859; PDB:1X5IA; GSSGSSGPATDWLSAETFESDLDETRVPEVPSSLHVRPLVTSIVVSWTPPENQNIVVRGY ------------------3333-------------------------------------- AIGYGIGSPHAQTIKVDYKQRYYTIENLDPSSHYVITLKAFNNVGEGIPLYESAVTRPHT -------3333-----3333---------------------3333-------------%% SGPSSG %%---- >NEOGENIN; SWP:Q92859; PDB:1X5JA; GSSGSSGPMMPPVGVQASILSHDTIRITWADNSLPKHQKITDSRYYTVRWKTNIPANTKY ------------------------------3333-------------------------- KNANATTLSYLVTGLKPNTLYEFSVMVTKGRRSSTWSMTAHGTTFELSGPSSG ----------------------------------------------------- >NEOGENIN; SWP:Q92859; PDB:1X5KA; GSSGSSGTAHGTTFELVPTSPPKDVTVVSKEGKPKTIIVNWQPPSEANGKITGYIIYYST --------------------------------1111-------3333------------- DVNAEIHDWVIEPVVGNRLTHQIQELTLDTPYYFKIQARNSKGMGPMSEAVQFRTPKASG ----3333------%%%%------------------------------------------ PSSG ---- >EPHRIN TYPE-A RECEPTOR 8; SWP:P29322; PDB:1X5LA; GSSGSSGQAAPSQVVVIRQERAGQTSVSLLWQEPEQPNGIILEYEIKYYEKDKEMQSYST ----------------------1111-----------------------3333------- LKAVTTRATVSGLKPGTRYVFQVRARTSAGCGRFSQAMEVETGKPSGPSSG --------------------------3333-------------3333---- >CALCYCLIN-BINDING PROTEIN; SWP:Q9HB71; PDB:1X5MA; GSSGSSGVVAPITTGYTVKISNYGWDQSDKFVKIYITLTGVHQVPTENVQVHFTERSFDL ---------------------------------------1111-3333------------ LVKNLNGKSYSMIVNNLLKPISVEGSSKKVKTDTVLILCRKKVENTRWDYLTQVEKECKE ----------------------3333---------------------------------- KSGPSSG ------- >HARMONIN; SWP:Q9Y6N9; PDB:1X5NA; GSSGSSGSPGNRENKEKKVFISLVGSRGLGCSISSGPIQKPGIFISHVKPGSLSAEVGLE ------------------------------------3333--------------1111-- IGDQIVEVNGVDFSNLDHKEAVNVLKSSRSLTISIVAAAGRELFMTDRSGPSSG -------%%%%----------------------------3333----------- >RNA binding motif, single; SWP:P29558; PDB:1X5OA; GSSGSSGLKASGVQAQMAKQQEQDPTNLYISNLPLSMDEQELENMLKPFGQVISTRILRD -----------------------3333------1111----------------------3 SSGTSRGVGFARMESTEKCEAVIGHFNGKFIKTPPGVSAPTEPLLCKFSGPSSG 333-----------3333-------2222------------------------- >NEGATIVE ELONGATION FACTO; SWP:P18615; PDB:1X5PA; GSSGSSGERRAPRKGNTLYVYGEDMTPTLLRGAFSPFGNIIDLSMDPPRNCAFVTYEKME -------------------------3333----3333----------------------- SADQAVAELNGTQVESVQLKVNIARKQPMLDSGPSSG --------2222-%%%%-------------------- >LAP4 PROTEIN; SWP:Q14160; PDB:1X5QA; GSSGSSGEPARIEEEELTLTILRQTGGLGISIAGGKGSTPYKGDDEGIFISRVSEEGPAA ---------------------------------------------------------333 RAGVRVGDKLLEVNGVALQGAEHHEAVEALRGAGTAVQMRVWRESGPSSG 3---2222----iiii---------------------------------- >GLUTAMATE RECEPTOR INTERA; SWP:Q9C0E4; PDB:1X5RA; GSSGSSGGGQIVHTETTEVVLCGDPLSGFGLQLQGGIFATETLSSPPLVCFIEPDSPAER ----------------------------------------------------22223333 CGLLQVGDRVLSINGIATEDGTMEEANQLLRDAALAHKVVLEVEFDSGPSSG ------------iiii3333-----------3333----------------- >COLD-INDUCIBLE RNA-BINDIN; SWP:Q14011; PDB:1X5SA; GSSGSSGMASDEGKLFVGGLSFDTNEQSLEQVFSKYGQISEVVVVKDRETQRSRGFGFVT --------------------3333------------------------------------ FENIDDAKDAMMAMNGKSVDGRQIRVDQAGKSSDNRSGPSSG ---3333-----------%%%%-------------------- >SPLICING FACTOR 3B SUBUNI; SWP:Q15427; PDB:1X5TA; GSSGSSGIFIGNLDPEIDEKLLYDTFSAFGVILQTPKIMRDPDTGNSKGYAFINFASFDA -------------11113333----3333---------------------------3333 SDAAIEAMNGQYLCNRPITVSYAFKKDSKGSGPSSG ------------------------------------ >Splicing factor 3B subuni; SWP:Q15427; PDB:1X5UA; GSSGSSGPISERNQDATVYVGGLDEKVSEPLLWELFLQAGPVVNTHMPKDRVTGQHQGYG ------------3333-------33331111----3333--------------------- FVEFLSEEDADYAIKIMDMIKLYGKPIRVNKASAHNKNLSGPSSG -----3333---------------------3333----------- >PCFK1; SWP:P83591; PDB:1X5VA; ACGILHDNCVYVPAQNPCCRGLQCRYGKCLVQV -----------3333------------------ >ZINC FINGER PROTEIN 64, I; SWP:Q9NPA5; PDB:1X5WA; GSSGSSGHPEKCSECSYSCSSKAALRIHERIHCTDRPFKCNYCSFDTKQPSNLSKHMKKF ---------------------3333--3333----------------------------- HGDMSGPSSG ---------- >Fibronectin type-III doma; SWP:Q9Y2H6; PDB:1X5XA; GSSGSSGPSMPASPVLTKAGITWLSLQWSKPSGTPSDEGISYILEMEEETSGYGFKPKYD ------------------------------------------------------------ GEDLAYTVKNLRRSTKYKFKVIAYNSEGKSNPSEVVEFTTCPDSGPSSG ------------------------3333--------------------- >MYOSIN BINDING PROTEIN C,; SWP:NA; PDB:1X5YA; GSSGSSGPTSAPQHLTVEDVTDTTTTLKWRPPDRIGAGGIDGYLVEYCLEGSEEWVPANK ------------------------------------------------2222-------- EPVERCGFTVKDLPTGARILFRVVGVNIAGRSEPATLLQPVTIRESGPSSG --------------------------3333--------------------- >RECEPTOR-TYPE TYROSINE-PR; SWP:P23468; PDB:1X5ZA; GSSGSSGDIQVITQTGVPGQPLNFKAEPESETSILLSWTPPRSDTIANYELVYKDGEHGE ------------------------------------------------------------ EQRITIEPGTSYRLQGLKPNSLYYFRLAARSPQGLGASTAEISARTMQSSGPSSG ------------------------------1111--------------------- >Sporulation-specific N-ac; SWP:Q06320; PDB:1X60A; LKKTSSSGLYKVQIGAFKVKANADSLASNAEAKGFDSIVLLKDGLYKVQIGAFSSKDNAD ------------------------------3333--------------------3333-- TLAARAKNAGFDAIVILES ------------------- >THYROID RECEPTOR INTERACT; SWP:Q15654; PDB:1X61A; GSSGSSGCGGCGEDVVGDGAGVVALDRVFHVGCFVCSTCRAQLRGQHFYAVERRAYCEGC -----------------------%%%%--3333-----------------%%%%------ YVATLESGPSSG ---3333----- >C-TERMINAL LIM DOMAIN PRO; SWP:O00151; PDB:1X62A; GSSGSSGSIGNAQKLPMCDKCGTGIVGVFVKLRDRHRHPECYVCTDCGTNLKQKGHFFVE -------------------------------------1111-----------------%% DQIYCEKHARERVSGPSSG %%--3333----------- >SKELETAL MUSCLE LIM-PROTE; SWP:Q13642; PDB:1X63A; GSSGSSGKCTTREDSPKCKGCFKAIVAGDQNVEYKGTVWHKDCFTCSNCKQVIGTGSFFP ------------------------------------------------------------ KGEDFYCVTCHETKFASGPSSG --------3333---------- ------------------------------------------------------------ ----------------------------- >UNR PROTEIN; SWP:O75534; PDB:1X65A; GSSGSSGREMGVIAAMRDGFGFIKCVDRDVRMFFHFSEILDGNQLHIADEVEFTVVPDML ------------------------------------1111------------------11 SAQRNHAIRIKKLPKGTVSFHSHSGPSSG 11--------------------------- >Friend leukemia integrati; SWP:Q01543; PDB:1X66A; GSSGSSGPPNMTTNERRVIVPADPTLWTQEHVRQWLEWAIKEYSLMEIDTSFFQNMDGKE ----------------------3333----------------------3333----3333 LCKMNKEDFLRATTLYNTEVLLSHLSYLRESSSGPSSG ----3333-----3333---------3333-------- >DREBRIN-LIKE PROTEIN; SWP:Q9UJU6; PDB:1X67A; GSSGSSGMAANLSRNGPALQEAYVRVVTEKSPTDWALFTYEGNSNDIRVAGTGEGGLEEM ------------------------1111----------------------------3333 VEELNSGKVMYAFCRVKDPNSGLPKFVLINWTGEGVNDVRKGACASHVSTMASFLKGAHV 3333----------------------------33333333-------------------- TINARAEEDVEPECIMEKVASGPSSG -----3333-3333---3333----- >FHL5 PROTEIN; SWP:Q5TD97; PDB:1X68A; GSSGSSGCVACSKPISGLTGAKFICFQDSQWHSECFNCGKCSVSLVGKGFLTQNKEIFCQ 3333---1111--------------%%%%--1111-----------------iiii--33 KCGSGMDTDISGPSSG 33-------------- >LIM DOMAIN KINASE 2; SWP:P53671; PDB:1X6AA; GSSGSSGKDYWGKFGEFCHGCSLLMTGPFMVAGEFKYHPECFACMSCKVIIEDGDAYALV -----------------3333----------%%%%--1111------------------- QHATLYCGKCHNEVVSGPSSG ------33333333------- >RHO GUANINE EXCHANGE FACT; SWP:Q5VV41; PDB:1X6BA; GSSGSSGWQGLSSKGDLPQVEITKAFFAKQADEVTLQQADVVLVLQQEDGWLYGERLRDG ------------------------------------------------------------ ETGWFPEDFARFISGPSSG -----3333---------- >Tyrosine-protein phosphat; SWP:P29350; PDB:1X6CA; GSSGSSGWYHGHMSGGQAETLLQAKGEPWTFLVRESLSQPGDFVLSVLSDQPKAGPGSPL -----------------------------------3333---------------2222-- RVTHIKVMCEGGRYTVGGLETFDSLTDLVEHFKKTGIEEASGAFVYLRQPYYSGPSSG ---------%%%%--------------3333--------------------------- >INTERLEUKIN-16; SWP:Q14005; PDB:1X6DA; GSSGSSGATLKQLDGIHVTILHKEEGAGLGFSLAGGADLENKVITVHRVFPNGLASQEGT -----------------------------------1111----------------3333- IQKGNEVLSINGKSLKGTTHHDALAILRQAREPRQAVIVTRKLTPEAMPDLNSSGPSSG -2222----iiii-------3333----------------------------------- >ZINC FINGER PROTEIN 24; SWP:P17028; PDB:1X6EA; GSSGSSGIHSGEKPYGCVECGKAFSRSSILVQHQRVHTGEKPYKCLECGKAFSQNSGLIN ------3333---------------3333----3333----------------3333--- HQRIHTSGPSSG ------------ >ZINC FINGER PROTEIN 462; SWP:Q96JM2; PDB:1X6FA; GSSGSSGLKRDFIILGNGPRLQNSTYQCKHCDSKLQSTAELTSHLNIHNEEFQKRAKRQE ------------------------------------3333-------------------- RRKQLLSKQKYADGAFADFKQESGPSSG ---------------------------- ------------------------------------------------------------ --------------------- >TRANSCRIPTIONAL REPRESSOR; SWP:P49711; PDB:1X6HA; GSSGSSGRTHTGEKPYACSHCDKTFRQKQLLDMHFKRYHDPNFVPAAFVCSKCGKTFTRR --------------------------3333------------------------------ NTMARHADNCAGPDGVEGENSGPSSG ------1111---------------- >HYPOTHETICAL PROTEIN YGFY; SWP:P64559; PDB:1X6IA; HMDINNKARIHWACRRGMRELDISIMPFFEHEYDSLSDDEKRIFIRLLECDDPDLFNWLM --1111-----3333----------------3333-----------1111-------111 NHGKPADAELEMMVRLIQTRNRERGPVAI 1---------------------------- >Glutathione-dependent for; SWP:Q51669; PDB:1X6MA; GHMVDTSGVKIHPAVDNGIKPAQPGFAGGTLHCKCSTNPVRVAVRAQTAHNHVCGCTKCW -----2222--1111-------2222---------------------------------- KPEGAIFSQVAVVGRDALEVLEGAEKLEIVNAEAPIQRHRCRDCGVHMYGRIENRDHPFY -------------3333-----3333----1111-------------------1111-22 GLDFVHTELSDEDGWSAPEFAAFVSSIIESGVDPSRMEAIRARLRELGLEPYDALSPPLM 22---3333----------------3333---3333--------1111------------ DAIATHIAKRSGALAA ---------------- >EUKARYOTIC INITIATION FAC; SWP:Q9N9V6; PDB:1X6OA; KTYPLAAGALKKGGYVCINGRPCKVIDLSVSKTHAKVSIVATDIFTGNRLEDQAPSTHNV -----3333-2222---iiii---------------------------------1111-- EVPFVKTYTYSVLDIQANEDPSLPAHLSLMDDEGESREDLDMPPDPALATQIKEQFDSGK -------------------1111--------------------------------1111- DVLVVVVSAMGTEQVLQTKNAAE --------iiii----------- >Bifunctional 3'-phosphoad; SWP:O43252; PDB:1X6VB; HVSRNKRGQVVGTRGGFRGCTVWLTGLSGAGKTTVSMALEEYLVCHGIPCYTLDGDNIRQ -------1111---------------2222-----------------------1111--- GLNKNLGFSPEDREENVRRIAEVAKLFADAGLVCITSFISPYTQDRNNARQIHEGASLPF 1111-----------------------3333---------------------3333---- FEVFVDAPLHVCEQRDVKGLYKKARAGFTGIDSEYEKPEAPELVLKTDSCDVNDCVQQVV ---------------------------2222--------------1111----------- ELLQERDIVPVDASYEVKELYVPENKLHLAKTDAETLPALKINKVDMQWVQVLAEGWATP ---1111---------------1111------3333-------------------1111- LNGFMREREYLQCLHFDCLLDGGVINLSVPIVLTATHEDKERLDGCTAFALMYEGRRVAI -------------------2222-------------------2222------iiii---- LRNPEFFEHRKEERCARQWGTTCKNHPYIKMVMEQGDWLIGGDLQVLDRVYWNDGLDQYR ----------------------1111-----1111--------------------3333- LTPTELKQKFKDMNADAVSAFQLRNPVHNGHALLMQDTHKQLLERGYRRPVLLLHPLGGW ----------1111---------------------------------------------- TKDDDVPLMWRMKQHAAVLEEGVLNPETTVVAIFPSPMMYAGPTEVQWHCRARMVAGANF -1111-------------1111--3333------------!!!!---------1111--- YIVGRDPAGMPHPETGKDLYEPSHGAKVLTMAPGLITLEIVPFRVAAYNKKKKRMDYYDS -----2222-----------1111----1111-----------------1111------2 EHHEDFEFISGTRMRKLAREGQKPPEGFMAPKAWTVLTEYYKSLEK 222---------------------2222------------------ >FIMBRIAL PROTEIN; SWP:P02973; PDB:1X6ZA; GTEFARSEGASALASVNPLKTTVEEALSRGWSVKSGTGTEDATKKEVPLGVAADANKLGT --------------------------1111---------------------1111----- IALKPDPADGTADITLTFTMGGAGPKNKGKIITLTRTAADGLWKCTSDQDEQFIPKGCSR -------------------11113333----------------------3333-2222-- >Rab GTPase-binding effect; SWP:Q15276; PDB:1X79B; ETRDQVKKLQLMLRQANDQLEKTMKDKQELEDFIKQSSEDSSHQISALVLRAQASEILLE 3333-------------------------------------------------------- ELQQGLSQAKRDVQEQMAVLMQSREQVSEE ------------------------3333-- >Coagulation factor IX [Fr; SWP:P16293; PDB:1X7AC; IVGGENAKPGQFPWQVLLNGKIDAFCGGSIINEKWVVTAAHCIEPGVKITVVAGEYNTEE 2222---2222--------------------1111------------------------- TEPTEQRRNVIRAIPHHSYNATVNKYSHDIALLELDEPLTLNSYVTPICIADKEYTNIFL -1111------------------------------------1111------3333----- KFGSGYVSGWGRVFNRGRSATILQYLKVPLVDRATCLRSTKFTIYSNMFCAGFHEGGKDS ------------------------------------1111-------------------- CQGDSGGPHVTEVEGTSFLTGIISWGEECAVKGKYGIYTKVSRYVNWIKEKTKLT ---2222------------------------------------------1111-- >ORNITHINE CYCLODEAMINASE; SWP:Q88H32; PDB:1X7DA; TYFIDVPTSDLVHDIGVAPFIGELAAALRDDFKRWQAFDKSARVASHSEVGVIELPVADK ----3333-------------------------3333----------1111--------- SRYAFKYVNGHPANTARNLHTVAFGVLADVDSGYPVLLSELTIATALRTAATSLAAQALA ----------11111111------------------------------------------ RPNARKALIGNGAQSEFQALAFHKHLGIEEIVAYDTDPLATAKLIANLKEYSGLTIRRAS 1111-------3333--------------------------------3333--------- SVAEAVKGVDIITTVTADKAYATIITPDLEPGHLNAVGGDCPGKTELHADVLRNARVFVE -----2222-------------------------------2222---33331111----- YEPQTRIEGEIQQLPADFPVVDLWRVLRGETEGRQSDSQVTVFDSVGFALEDYTVLRYVL 3333------11111111---3333----------1111--------------------- QQAEKRGGTKIDLVPWVEDDPKDLFSHTRGRA --3333-------------1111-1111---- >OUTER SURFACE PROTEIN; SWP:Q81HJ5; PDB:1X7FA; MERKLGISLYPEHSTKEKDMAYISAAARHGFSRIFTCLLSVAEFKEIINHAKDNNMEVIL ---------3333--------------------------------------1111----- DVAPAVFYSDLSFFAELGADGIRLDVGFDGLTEAKMTNNPYGLKIELNVSNDIAYLENIL --3333---------------------------3333-----------------333333 SHQANKSALIGCHNFYPQKFTGLPYDYFIRCSERFKKHGIRSAAFITSHVANIGPWDIND 33--3333---------2222--------------1111--------------------- GLCTLEEHRNLPIEVQAKHLWATGLIDDVIIGNAYASEEELEKLGNLNRYMLQLKVHFVD ----3333-----------------------------------3333-----------11 EATEVEKRATLQELHVRRGDITEYMVRSTEVRKKYKDYDFPVRESVLQERGQVVIGNNSF 11-------------------1111--3333---1111----------2222----1111 GKYKGELQIILKEMPIDERKNIVGTIAEEELFLLDYVGAWTQFTCVE 1111------------1111------3333---11112222------ >PUTATIVE KETOACYL REDUCTA; SWP:P16544; PDB:1X7GA; SEVALVTGATSGIGLEIARRLGKEGLRVFVCARGEEGLRTTLKELREAGVEADGRTCDVR ---------------------------------------------------------111 SVPEIEALVAAVVERYGPVDVLVNNAGRPGGGATAELADELWLDVVETNLTGVFRVTKQV 1-------------------------------3333------------------------ LKAGGMLERGTGRIVNIASTGGKQGVVHAAPYSASKHGVVGFTKALGLELARTGITVNAV ------1111--------1111---2222--------------------1111------- CPGFVETPMAASVREHYSDIWEVSTEEAFDRITARVPIGRYVQPSEVAEMVAYLIGPGAA ------3333-----1111----------------1111---3333---------3333- AVTAQALNVCGGLGNY ---------iiii--- >HYPOTHETICAL PROTEIN DR18; SWP:NA; PDB:1X7LA; MGHTMPAHTPPAQTAPAAQKAGAQALPVTVQGATVAAVPPSIRDTAAYMTLTNKSDQPIK ------------------------------------------------------------ LVGAATPLATSPMLMTTTHSGGMAGMKMVPWLTIPARGTLTLQRDGDHVMLMGLKRPLKV -----3333--------------------------------------------------- GETVNITLKATDGRTLNVAATVKKNIEGR ----------------------------- >RRNA METHYLTRANSFERASE; SWP:Q9F5K6; PDB:1X7OA; RNARFQQWQALLGNRNKRTRAGEFLVGVRPISLAVEHGWPVRTLLYDGQRELSKWARELL ---------11113333-----------------1111---------------------- RTVRTEQIAAPDLLELGEKNEAPPEVVAVVEPADDLDRIPVREDFLGVLFDRPTSPGNIG ---------3333----2222-------------3333----------------3333-- SIIRSADALGAHGLIVAGHAADVYDPKSVRSSTGSLFSLPAVRVPSPGEVDWVEARRAAG ------1111-----------11113333--iiii1111------3333----------- TPIVLVGTDEHGDCDVFDFDFTQPTLLLIGNETAGLSNAWRTLCDYTVSIPAGSASSLNA --------------1111-3333------------------------------------- ANAATAILYEAVRQRISGRTA --------------------- >BETA-2-MICROGLOBULIN; SWP:Q9TQP6; PDB:1X7QA; GSHSMRYFYTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGPEYW -------------3333----------!!!!-----1111--------3333---3333- DQETRNVKAQSQTDRVDLGTLRGYYNQSEDGSHTIQIMYGCDVGPDGRFLRGYRQDAYDG -------------------------------------------1111----------iii KDYIALNEDLRSWTAADMAAQITKRKWEAAHAAEQQRAYLEGRCVEWLRRYLENGKETLQ i-----1111------3333-------------------------------------111 RTDPPKTHMTHHPISDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDGT 1-------------------------------------iiii------------------ FQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRWE ---------22223333-----1111--------- >PA3566 PROTEIN; SWP:Q9HY51; PDB:1X7VA; HSTPLTLIATITAAPGHAEALERELRALVAPSRAEAGCLQYDLHQDRHDSHLFYIEQWRD -------------2222------------3333-2222----------1111-------- DAALERHQNTEHFLRFSRGNEALLQNVKIDQLYRLA -------------------3333------------- >GLUCOSE-6-PHOSPHATE ISOME; SWP:P83194; PDB:1X82A; YKEPFGVKVDFETGIIEGAKKSVRRLSDEGYFVDERAWKELVEKEDPVVYEVYAVEQEEK ---------------2222-----3333-------------------------------2 EGDLNFATTVLYPGKVGKEFFFTKGHFHAKLDRAEVYVALKGKGGLLQTPEGDAKWISEP 222------------!!!!----------1111---------------1111------22 GTVVYVPPYWAHRTVNIGDEPFIFLAIYPADAGHDYGTIAEKGFSKIVIEENGEVKVVDN 22----2222------------------1111------------------%%%%-----1 PRWKK 111-- >UROCANASE PROTEIN; SWP:NA; PDB:1X87A; VRPFAGTERRAKGWIQEAALRLNNNLHPDVKAARNWECYEAIVDTLLRLENDETLLIQSG ---------------------3333-3333---------------------------iii KPVAVFRTHPDAPRVLIANSNLVPAWATWDHGSWIYIGSQGIVQGTYETFAEVARQHFGG i-------1111-----------3333------------------------------iii TLAGTITLTAGLGGGGAQPLAVTNGGVCLAIEVDPARIQRRIDTNYLDTTDSLDAALEAK i2222---------3333-----------------------1111--------------- QAKEEKKALSIGLVGNAAEVLPRLVETGFVPDVLTDQTSAHDPLNGYIPAGLTLDEAAEL ---------------3333-----1111----------3333------2222-------- RARDPKQYIARAKQSIAAHVRALAQKQGAVTFDYGNNIRQVAKDEGVDDAFSFPGFVPAY ----3333----------------1111--------------11111111---------- IRPLFCEGKGPFRWVALSGDPEDIYKTDEVILREFSDNERLCHWIRAQKRIKFQGLPARI ----1111-------3333---------------3333---------------------- CWLGYGERAKFGKIINDVAKGELKAPIVIGRDHLDAIADWPILNALLNAVGGASWVSVHH -----------------1111--------------3333--------------------- GGGVGGYSIHAGVIVADGTKEAEKRLERVLTTDPGLGVIRHADAGYELAIRTAKEKGIDP ------------------------------------------------------------ LK -- >KINESIN-LIKE PROTEIN KIF1; SWP:P52732; PDB:1X88A; NIQVVVRCRPFNLAERKASAHSIVECDPVRKEVSVRTGGLADKSSRKTYTFDMVFGASTK ---------------------------1111--------!!!!------------11113 QIDVYRSVVCPILDEVIMGYNCTIFAYGQTGTGKTFTMEGERSPNEEYTWEEDPLAGIIP 333-------------------------2222-----------%%%%-33331111---- RTLHQIFEKLTDNGTEFSVKVSLLEIYNEELFDLLNPSSDVSERLQMFDDPRNKRGVIIK -------1111---------------%%%%-----11111111------1111-----22 GLEEITVHNKDEVYQILEKGAAKRTTAATLMNAYSSRSHSVFSVTIHMKETTIDGEELVK 22------1111--------------------3333---------------1111----- IGKLNLVDLAGSENNINQSLLTLGRVITALVERTPHVPYRESKLTRILQDSLGGRTRTSI -------------------------------------1111-------1111-------- IATISPASLNLEETLSTLEYAHRAKNILNKPE ------3333---------------------- >NEUTROPHIL GELATINASE-ASS; SWP:P80188; PDB:1X89A; TSDLIPAPPLSKVPLQQNFQDNQFQGKWYVVGLAGNAILREDKDPQKMYATIYELKEDKS --------3333-------3333--------------------------------1111- YNVTSVLFRKKKCDYWIRTFVPGSQPGEFTLGNIKSYPGLTSYLVRVVSTNYNQHAMVFF --------%%%%------------2222----33332222-------------------- KKVSQNREYFKITLYGRTKELTSELKENFIRFSKSLGLPENHIVFPVPIDQCID ---%%%%--------------------------1111-1111------------ >WEE1-LIKE PROTEIN KINASE; SWP:P30291; PDB:1X8BA; MKSRYTTEFHELEKIGSGEFGSVFKCVKRLDGCIYAIKRSKKPLAGSVDEQNALREVYAH --3333-------------------------------------2222------------1 AVLGQHSHVVRYFSAWAEDDHMLIQNEYCNGGSLADAISENYRIMSYFKEAELKDLLLQV 111--1111--------------------------------------------------- GRGLRYIHSMSLVHMDIKPSNIFISKVMFKIGDLGHVTRISSPQVEEGDSRFLANEVLQE -----------------1111------------1111-1111------3333-------- NYTHLPKADIFALALTVVCAAGAEPLPRNGDQWHEIRQGRLPRIPQVLSQEFTELLKVMI ---3333-----------1111-------------1111--------------------- HPDPERRPSAMALVKHSVL --3333---------3333 >HYPOTHETICAL PROTEIN YIIL; SWP:P32156; PDB:1X8DA; MIRKAFVMQVNPDAHEEYQRRHNPIWPELEAVLKSHGAHNYAIYLDKARNLLFAMVEIES ----------1111-----1111-----------------------1111---------- EERWNAVASTDVCQRWWKYMTDVMPANPDNSPVSSELQEVFYLP ----3333------------------1111-------------- >BETA-LACTAMASE; SWP:P26918; PDB:1X8HA; AGMSLTQVSGPVYVVEDNYYVQENSMVYFGAKGVTVVGATWTPDTARELHKLIKRVSRKP --------!!!!-----------------1111--------------------------- VLEVINTNYHTDRAGGNAYWKSIGAKVVSTRQTRDLMKSDWAEIVAFTRKGLPEYPDLPL ---------1111--------------------------------------3333----- VLPNVVHDGDFTLQEGKVRAFYAGPAHTPDGIFVYFPDEQVLYGGCILKEKLGNLSFADV -------------iiii--------------------------!!!!-------1111-3 KAYPQTLERLKAMKLPIKTVIGGHDSPLHGPELIDHYEALIKAAPQSS 333-------3333-----------------------------2222- >4-deoxy-L-threo-5-hexosul; SWP:Q46938; PDB:1X8MA; SLDVRQSIHSAHAKTLDTQGLRNEFLVEKVFVADEYTMVYSHIDRIIVGGIMPITKTVSV --------33331111--------------------------%%%%-------------- GGEVGKQLGVSYFLERRELGVINIGGAGTITVDGQCYEIGHRDALYVGKGAKEVVFASID -33331111------------------------------2222----------------3 TGTPAKFYYNCAPAHTTYPTKKVTPDEVSPVTLGDNLTSNRRTINKYFVPDVLETCQLSM 333---------------------------------------------1111-------- GLTELAPGNLWNTRMEVYFYFNMDDDACVFHMMGQPQETRHIVMHNEQAVISPSWSIHSG -----2222--------------1111-------1111--------------1111---- VGTKAYTFIWGMVGENQVF ------------------- >NITROPHORIN 4; SWP:Q94734; PDB:1X8QA; ACTKNAIAQTGFNKDKYFNGDVWYVTDYLDLEPDDVPKRYCAALAAGTASGKLKEALYHY --------22223333---------------1111-------------iiii-------- DPKTQDTFYDVSELQVESLGKYTANFKKVDKNGNVKVAVTAGNYYTFTVMYADDSSALIH -----------------2222--------1111------2222---------1111---- TCLHKGNKDLGDLYAVLNRNKDAAAGDKVKSAVSAATLEFSKFISTKENNCAYDNDSLKS ----!!!!-----------1111----------1111-3333---1111----------- LLTK 1111 >CYTOCHROME P450 51; SWP:P77901; PDB:1X8VA; SAVALPRVSGGHDEHGHLEEFRTDPIGLMQRVRDELGDVGTFQLAGKQVVLLSGSHANEF -------------------------------------------!!!!------------- FFRAGDDDLDQAKAYPFMTPIFGRRKEMLHNAALRGEQMKGHAATIEDQVRRMIADWGEA ----3333--11113333-----3333---33333333---------------1111--- GEIDLLDFFAELTIYTSSACLIGKKFRDQLDGRFAKLYHELERGTDPLAYVDPYLPIESF ---3333----------------3333------------------------1111----- RRRDEARNGLVALVADIMNGRIANPRDMLDVLIAVKATPRFSADEITGMFISMMFAGHHT ------------------------------------------------------1111-- SSGTASWTLIELMRHRDAYAAVIDELDELYGDGRSVSFHALRQIPQLENVLKETLRLHPP -----------------------------1111-33331111------------------ LIILMRVAKGEFEVQGHRIHEGDLVAASPAISNRIPEDFPDPHDFVPARYEQPRQEDLLN -------------iiii--2222----3333---------1111------3333--3333 RWTWIPFGAGRHRCVGAAFAIMQIKAIFSVLLREYEFEMAQPPESYRNDHSKMVVQLAQP ----1111-11111111------------3333--------1111--------------- AAVRYRRRT --------- >LAMIN A/C; SWP:P02545; PDB:1X8YA; LAAKEAKLRDLEDSLARERDTSRRLLAEKEREMAEMRARMQQQLDEYQELLDIKLALDME --1111------------------------------------------------------ IHAYRKLLEGEEER -------------- >invertase/pectin methyles; SWP:Q9LNF2; PDB:1X91A; SSEMSTICDKTLNPSFCLKFLNTKFASANLQALAKTTLDSTQARATQTLKKLQSIIDGGV -3333--1111--------------------------------------------3333- DPRSKLAYRSCVDEYESAIGNLEEAFEHLASGDGMGMNMKVSAALDGADTCLDDVKRLRS ----------------------------1111----------------------1111-- VDSSVVNNSKTIKNLCGIALVISNMLPRN ----------------------1111--- >PHOSPHOHEPTOSE ISOMERASE; SWP:Q9HVZ0; PDB:1X92A; DMQHRIRQLFQASIETKQQALEVLPPYIEQASLVMVNALLNEGKILSCGNGGSAGDAQHF -------------------------------------------------!!!!------- SSELLNRFERERPSLPAVALTTDSSTITSIANDYSYNEVFSKQIRALGQPGDVLLAISTS ----------------------------------3333----------2222-----111 GNSANVIQAIQAAHDREMLVVALTGRDGGGMASLLLPEDVEIRVPSKITARIQEVHLLAI 1-----------------------!!!!---33331111--------------------- HCLCDLIDRQLFGS -------------- >HYPOTHETICAL PROTEIN HP02; SWP:O25010; PDB:1X93A; TRAVSLYFSDEQYQKLEKMANEEEESVGSYIKRYILKALRKIE --------------------1111------------------- >PUTATIVE PHOSPHOHEPTOSE I; SWP:Q9KPY2; PDB:1X94A; MYQDLIRSELTEAADVLQKFLSDDHNIAQIEAAAKLIADSFKQGGKVLSCGNGGSHCDAM -3333------------------------------------------------------- HFAEELTGRYRENRPGYPGIAIDYVFSRYVEAVGAKGDVLFGLSTSGNSGNILKAIEAAK ----------------------------------2222-----1111------------- AKGMKTIALTGKDGGKMAGLADVEIRVPHFGYADRIQEVHIKIIHIIIQLIEKEMA ----------!!!!3333-------------------------------------- >LECTIN; SWP:NA; PDB:1X99A; GHSYSITLRVYQTNRDRGYFSIVEKTVWHFANGGTWSEANGAHTLTGGSGTSGLRFSTKG -------------3333------------%%%%-----iiii--------------1111 ERITVAVGVHNYKRWCDVVTGLKPDETALVINPQYYNNGGRDYVREKQLAEYSVTSAIGT ---------iiii---------11113333-----%%%%-----3333-------1111- KVEVVYTVAEGNNLEANVIFS --------------------- >HYPOTHETICAL MEMBRANE PRO; SWP:Q9HL76; PDB:1X9BA; RNLSDRAKFESMINSPSKSVFVRNLNELEALAVRLGKSYRIQLDQAKEKWKVK ------------------------------------3333------------- >Endoplasmic reticulum man; SWP:Q9UKM7; PDB:1X9DA; HLNYRQKGVIDVFLHAWKGYRKFAWGHDELKPVSRSFSEWFGLGLTLIDALDTMWILGLR -----------------------2222----1111------------------------- KEFEEARKWVSKKLHFEKDVDVNLFESTIRILGGLLSAYHLSGDSLFLRKAEDFGNRLMP -----------------------------------------------------------1 AFRTPSKIPYSDVNIGTGVAHPPRWTSDSTVAEVTSIQLEFRELSRLTGDKKFQEAVEKV 1111111---------------1111---3333--------------------------- TQHIHGLSGKKDGLVPMFINTHSGLFTHLGVFTLGARADSYYEYLLKQWIQGGKQETQLL --3333----iiii-------------2222---2222---------------------- EDYVEAIEGVRTHLLRHSEPSKLTFVGELAHGRFSAKMDHLVCFLPGTLALGVYHGLPAS -----------------------------iiii-----3333----------1111---- HMELAQELMETCYQMNRQMETGLSPEIVHFNLYPQPGRRDVEVKPADRHNLLRPETVESL ------------------1111------------2222-----3333------------- FYLYRVTGDRKYQDWGWEILQSFSRFTRVPSGGYSSINNVQDPQKPEPRDKMESFFLGET ----------------------------1111------1111----------1111---- LKYLFLLFSDDNLLSLDAYVFNTEAHPLPIWT --------------3333-------------- >GLOBIN IV, EXTRACELLULAR; SWP:P13579; PDB:1X9FA; DCCSYEDRREIRHIWDDVWSSSFTDRRVAIVRAVFDDLFKHYPTSKALFERVKIDEPESG -------------3333------------------------333311111111--1111- EFKSHLVRVANGLKLLINLLDDTLVLQSHLGHLADQHIQRKGVTKEYFRGIGEAFARVLP ------------------1111-----------------2222--------------333 QVLSCFNVDAWNRCFHRLVARIAKDLP 3---------------------1111- >Extracellular globin-2; SWP:P02218; PDB:1X9FB; KKQCGVLEGLKVKSEWGRAYGSGHDREAFSQAIWRATFAQVPESRSLFKRVHGDDTSHPA ----------------------------------------3333-------3333----- FIAHADRVLGGLDIAISTLDQPATLKEELDHLQVQHEGRKIPDNYFDAFKTAILHVVAAQ ---------------1111----------------------3333--------------- LGRCYDREAWDACIDHIEDGIKGHH ------------------------- >Extracellular globin-3 [P; SWP:P11069; PDB:1X9FC; HEHCCSEEDHRIVQKQWDILWRDTESSKIKIGFGRLLLTKLAKDIPEVNDLFKRVDIEHA ----------------3333----3333-------------------333333333333- EGPKFSAHALRILNGLDLAINLLDDPPALDAALDHLAHQHEVREGVQKAHFKKFGEILAT ---------------------1111--------------1111---3333---------- GLPQVLDDYDALAWKSCLKGILTKISSRL 3333---------------------1111 >Hemoglobin chain d1 [Prec; SWP:O61233; PDB:1X9FD; ECLVTESLKVKLQWASAFGHAHERVAFGLELWRDIIDDHPEIKAPFSRVRGDNIYSPEFG --------------------3333--------------33333333---1111------- AHSQRVLSGLDITISMLDTPDMLAAQLAHLKVQHVERNLKPEFFDIFLKHLLHVLGDRLG -------------1111--------------1111----3333----------------1 THFDFGAWHDCVDQIIDGIK 111----------------- >PUTATIVE MAR1; SWP:Q4QGT7; PDB:1X9GA; MSRLMPHYSKGKTAFLCVDLQEAFSKRIENFANCVFVANRLARLHEVVPENTKYIVTEHY ------1111----------3333---1111----------------------------3 PKGLGRIVPEITLPKTAHLIEKTRFSCVVPQVEELLEDVDNAVVFGIEGHACILQTVADL 333----3333--1111------------------------------1111--------- LDMNKRVFLPKDGLGSQKKTDFKAAIKLMSSWGPNCEITTSESILLQMTKDAMDPNFKRI 1111-------------3333-----------------------------1111-3333- SKLLKEEPPIPL 3333-------- >GLUCOSE-6-PHOSPHATE ISOME; SWP:Q8ZWV0; PDB:1X9IA; SQLLQDYLNWENYILRRVDFPTSYVVEGEVVRIEAMPRLYISGMGGSGVVADLIRDFSLT --------3333-------------iiii--------------!!!!-----------11 WNWEVEVIAVKDYFLKARDGLLIAVSYSGNTIETLYTVEYAKRRRIPAVAITTGGRLAQM 11-----------------------3333-------------------------3333-- GVPTVIVPKASAPRAALPQLLTAALHVVAKVYGIDVKIPEGLEPPNEALIHKLVEEFQKR -----------3333-----------------------------------------1111 PTIIAAESMRGVAYRVKNEFNENAKIEPSVEILPEAHHNWIEGSERAVVALTSPHIPKEH -----1111------------------------3333-3333----------1111---- QERVKATVEIVGGSIYAVEMHPKGVLSFLRDVGIASVKLAEIRGVNPLATPRIDALKRRL ------------------------------------------------------------ >DNA POLYMERASE; SWP:P00581; PDB:1X9MA; MIVSDIEANALLESVTKFHCGVIYDYSTAEYVSYRPSDFGAYLDALEAEVARGGLIVFHN ----------3333-----------1111-----1111---------------------3 GHKYDVPALTKLAKLQLNREFHLPRENCIDTLVLSRLIHSNLKDTDMGLLRSGKLPGALE 333--------------------3333-----------1111---iiii-3333------ AWGYRLGEMKGEYKDDFKRMLEEQGEEYVDGMEWWNFNEEMMDYNVQDVVVTKALLEKLL ---------------------1111---2222---------------------------- SDKHYFPPEIDFTDVGYTTFWSESLEAVDIEHRAAWLLAKQERNGFPFDTKAIEELYVEL --111111111111---------------------------------------------- AARRSELLRKLTETFGSWYQPKGGTEMFCHPRTGKPLPKYPRIKTPKVGGIFKCELDTRE ------------------------------------3333-------------------- YVAGAPYTPVEHVVFNPSSRDHIQKKLQEAGWVPTKYTDKGAPVVDDEVLEGVRVDDPEK -2222----------1111--------1111------1111--------1111------- QAAIDLIKEYLMIQKRIGQSAEGDKAWLRYVAEDGKIHGSVNPNGAVTGRATHAFPNLAQ ----------------------11111111-1111----------1111----------- IPGVRSPYGEQCRAAFGAEHHLDGITGKPWVQAGIDASGLELRCLAHFMARFDNGEYAHE --1111-3333-------------------------------------3333iiii---- ILNGDIHTKNQIAAELPTRDNAKTFIYGFLYGAGDEKIGQIVGAGKERGKELKKKFLENT -----------------3333----3333----3333----------------------- PAIAALRESIQQTLVEVKWKRRWIKGLDGRKVHVRSPHAALNTLLQSAGALICKLWIIKT -------------------------1111------3333--------------------- EEMLVEKGLKHGWDGDFAYMAWVHDEIQVGCRTEEIAQVVIETAQEAMRWVGDHWNFRCL -----------1111-------------------------------------1111---- LDTEGKMGPNWAICH ---------3333-- >DNA LIGASE I; SWP:P18858; PDB:1X9NA; DPSGYNPAKNNYHPVEDACWKPGQKVPYLAVARTFEKIEEVSARLRVETLSNLLRSVVAL 3333--------3333----2222--3333------------3333-------------- SPPDLLPVLYLSLNHLGPPQQGLELGVGDGVLLKAVAQATGRQLESVRAEAAEKGDVGLV 3333-------------1111----------------------------------3333- AELPPPPLTASGVFSKFRDIARLTGSASTAKKIDIIKGLFVACRHSEARFIARSLSGRLR -------------------3333-2222-------------------------------- LGLAEQSVLAALSQAVSLTPPGQEFPPAVDAGKGKTAEARKTWLEEQGILKQTFCEVPDL -----------------------------1111-------------------------33 DRIIPVLLEHGLERLPEHCKLSPGIPLKPLAHPTRGISEVLKRFEEAAFTCEYKYDGQRA 33---------11111111----------------------------------------- QIHALEGGEVKIFSRNQEDNTGKYPDIISRIPKIKLPSVTSFILDTEAVAWDREKKQIQP ----1111-----1111-------------1111-3333--------------------3 FQVLTTRKRKEVDASEIQVQVCLYAFDLIYLNGESLVREPLSRRRQLLRENFVETEGEFV 333---------3333--------------iiii-1111---------------2222-- FATSLDTKDIEQIAEFLEQSVKDSCEGLVKTLDVDATYEIAKRSHNWLKLKKDYLDGVGD -------------------1111---------------3333------------------ TLDLVVIGAYLGRGKRAGRYGGFLLASYDEDSEELQAICKLGTGFSDEELEEHHQSLKAL -------------3333------------1111--------------------------- VLPSPRPYVRIDGAVIPDHWLDPSAVWEVKCADLSLSPIYPAARGLVDSDKGISLRFPRF -----1111---------------------------------2222-------------- IRVREDKQPEQATTSAQVACLYRKQS -------3333--------------- >4m5.3 anti-fluorescein si; SWP:NA; PDB:1X9QA; SDVVMTQTPLSLPVSLGDQASISCRSSQSLVHSNGNTYLRWYLQKPGQSPKVLIYKVSNR --------------2222-------------1111---------2222------------ VSGVPDRFSGSGSGTDFTLKINRVEAEDLGVYFCSQSTHVPWTFGGGTKLEKDGGVKLDE ----3333----------------3333-----------------------%%%%----- TGGGLVQPGGAMKLSCVTSGFTFGHYWMNWVRQSPEKGLEWVAQFRNKPYNYETYYSDSV ------2222-----------3333---------------------3333------3333 KGRFTISRDDSKSSVYLQMNNLRVEDTGIYYCTGASYGMEYLGQGTSVTVS ---------1111---------3333---------iiii------------ >UMECYANIN; SWP:P42849; PDB:1X9UA; MEDYDVGGDMEWKRPSDPKFYITWATGKTFRVGDELEFDFAAGMHDVAVVTKDAFDNCKK ----2222--------1111----2222--2222------2222---------------- ENPISHMTTPPVKIMLNTTGPQYYICTVGDHCRVGQKLSINVVGA ---------------------------!!!!1111---------- >DNA MISMATCH REPAIR PROTE; SWP:P23367; PDB:1X9ZA; QSFGRVLTIVHSDCALLERDGNISLLSLPVAERWLRQAQLTPGEAPVCAQPLLIPLRLKV ------------------iiii------------------2222---------------- SAEEKSALEKAQSALAELGIDFQSDAQHVTIRAVPLPLRQQNLQILIPELIGYLAKQSVF ---------------1111---------------3333---3333--------1111--- EPGNIAQWIARNLSEHAQWSAQAITLLADVERLCPQLVKTPPGGLLQSVDLHPAIKALKD ----------------------------------3333---1111--------------- >PUTATIVE NADPH DEPENDENT ; SWP:Q5L022; PDB:1XA0A; SAFQAFVVNKTETEFTAGVQTISDDLPEGDVLVRVHYSSVNYKDGLASIPDGKIVKTPFV ----------------------------------------333333331111-------- PGIDLAGVVVSSQHPRFREGDEVIATGYEIGVTHFGGYSEYARLHGEWLVPLPKGLTLKE -----------------2222-----!!!!--------------3333--------3333 AAIGTAGFTAALSIHRLEEHGLTPERGPVLVTGATGGVGSLAVSLAKRGYTVEASTGKAA -----------------1111-1111------1111---------1111----------- EHDYLRVLGAKEVLARELDKQRWAAAVDPVGGRTLATVLSRRYGGAVAVSGLTGGAEVPT -------------------------------11113333--2222--------------- TVHPFILRGVSLLGIDSVYCPDLRLRIWERLAGDLKPDLERIAQEISLAELPQALKRILR -3333--------------------------------3333-----3333---------- GELRGRTVVRLA ------------ >BETA2-CHIMAERIN; SWP:P52757; PDB:1XA6A; PPIWKSYLYQLQQEAPRPKRIICPREVENRPKYYGREFHGIISREQADELLGGVEGAYIL ----------3333----------------------------3333-------------- RESQRQPGCYTLALRFGNQTLNYRLFHDGKHFVGEKRFESIHDLVTDGLITLYIETKAAE ----------------------------------------------------------33 YISKMTTNPIYEHIGYATLLRYEKTHNFKVHTFRGPHWCEYCANFMWGLIAQGVRCSDCG 33-3333--3333----------------------------------------------- LNVHKQCSKHVPNDCQPDLKRIKKVYCCDLTTLVKAHNTQRPMVVDICIREIEARGLKSE -----3333------1111-----2222-------------------------------- GLYRVSGFTEHIEDVKMAFDRDGEKADISANVYPDINIITGALKLYFRDLPIPVITYDTY 2222---3333----3333---------------------------------11111111 SKFIDAAKISNADERLEAVHEVLMLLPPAHYETLRYLMIHLKKVTMNEKDNFMNAENLGI -3333-----3333-------1111---------------1111--1111---------- VFGPTLMRPPEDSTLTTLHDMRYQKLIVQILIENEDVLF ---------------3333-------------------- >FLUORESCENT PROTEIN FP538; SWP:Q9U6Y4; PDB:1XA9A; SKHGLKEEMTMKYHMEGCVNGHKFVITGEGIGYPFKGKQTINLCVIEGGPLPFSEDILSA ------------------iiii----------3333-----------------3333-33 GFDRIFTEYPQDIVDYFKNSCPAGYTWGRSFLFEDGAVCICNVDITVSVKENCIYHKSIF 333333---3333-3333--------------1111------------------------ NGMNFPADGPVMKKMTTNWEASCEKIMPVPKQGILKGDVSMYLLLKDGGRYRCQFDTVYK ----------------------------2222------------1111------------ AKSVPSKMPEWHFIQHKLLREDRSDAKNQKWQLTEHAIAFPSAL -------------------------------------------- >3-ISOPROPYLMALATE DEHYDRO; SWP:Q5SIY4; PDB:1XAA; MKVAVLPGDGIGPEVTEAALKVLRALDEAEGLGLAYEVFPFGGAAIDAFGEPFPEPTRKG --------!!!!------------------------------------------------ VEEAEAVLLGSVGGPKWDGLPRKIRPETGLLSLRKSQDLFANLRPAKVFPGLERLSPLKE -------------3333---3333------------------------22221111--33 EIARGVDVLIVRELTGGIYFGEPRGMSEAEAWNTERYSKPEVERVARVAFEAARKRRKHV 332222----------3333------3333---------------------3333----- VSVDKANVLEVGEFWRKTVEEVGRGYPDVALEHQYVDAMAMHLVRSPARFDVVVTGNIFG ----1111-----------------1111------------------------------- DILSDLASVLPGSLGLLPSASLGRGTPVFEPVHGSAPDIAGKGIANPTAAILSAAMMLEH --------33333333-------------------3333--------------------- AFGLVELARKVEDAVAKALLETPPPDLGGSAGTEAFTATVLRHLA -----------------------3333------------------ >3-ISOPROPYLMALATE DEHYDRO; SWP:Q5SIY4; PDB:1XAD; MKVAVLPGDGIGPEVTEAALKVLRALDEAEGLGLAYEVFPFGGAAIDAFGEPFPEPTRKG --------!!!!------------------------------------------------ VEEAEAVLLGSVGGPKWDQNPRELRPEKGLLSIRKQLDLFANLRPVKVFESLSDASPLKK -------------3333-------------------------------33331111---3 EYIDNVDFVIVRELTGGIYFGEPRGMSEAEAWNTERYSKPEVERVARVAFEAARKRRKHV 333-------------3333------3333------------------------------ VSVDKANVLEVGEFWRKTVEEVGRGYPDVALEHQYVDAMAMHLVRSPARFDVVVTGNIFG ----1111-3333--------------------------------1111----------- DILSDLASVLPGSLGLLPSASLGRGTPVFEPVHGSAPDIAGKGIANPTAAILSAAMMLEH ------1111-------------------------3333--------------------- AFGLVELARKVEDAVAKALLETPPPDLGGSAGTEAFTATVLRHLA ----3333---------------3333-------------1111- >3-DEHYDROQUINATE SYNTHASE; SWP:Q6GGU4; PDB:1XAGA; MKLQTTYPSNNYPIYVEHGAIKYIGTYLNQFDQSFLLIDEYVNQYFANKFDDILSYENVH ----------------2222----3333-------------------11111111----- KVIIPAGEKTKTFEQYQETLEYILSHHVTRNTAIIAVGGGATGDFAGFVAATLLRGVHFI ------3333-------------1111-------------------------%%%%---- QVPTTILAHDSSVGGKVGINSKQGKNLIGAFYRPTAVIYDLDFLKTLPFKQILSGYAEVY ----3333-1111-------1111---------------33331111------------- KHALLNGESATQDIEQHFKDREILQSLNGMDKYIAKGIETKLDIVVADEKEQGVRKFLNL ------3333---1111--33333333--------------------1111-------22 GHTFGHAVEYYHKIPHGHAVMVGIIYQFIVANALFDSKHDISHYIQYLIQLGYPLDMITD 22-------------------------------------3333-----1111---3333- LDFETLYQYMLSDKKNDKQGVQMVLMRQFGDIVVQHVDQLTLQHACEQLKTYF -3333-----------3333-------2222------3333-------3333- >3-DEHYDROQUINATE SYNTHASE; SWP:Q2YY89; PDB:1XAHA; MKLQTTYPSNNYPIYVEHGAIKYIGTYLNQFDQSFLLIDEYVNQYFANKFDNVHKVIIPA ----------------------33333333---------3333--3333----------! GEKTKTFEQYQETLEYILSHHVTRNTAIIAVGGGATGDFAGFVAATLLRGVHFIQVPTTI !!!-------------------------------------------iiii--------33 LAHDSSVGGKVGINSKQGKNLIGAFYRPTAVIYDLDFLKTLPFKQILSGYAEVYKHALLN 33-1111-------3333-----------------3333--3333--------------- GESATQDIEQHFKDREILQSLNGMDKYIAKGIETKLDIVVADEKEQGVRKFLNLGHTFGH -3333---------------2222-----------------1111-------2222---- AVEYYHKIPHGHAVMVGIIYQFIVANALFDSKHDISHYIQYLIQLGYPLDGVQMVLMRQF ---------------------------------3333---------------------22 GDIVVQHVDQLTLQHACEQLKTY 22-----------------3333 >SARS ORF7A ACCESSORY PROT; SWP:P59635; PDB:1XAKA; ELYHYQECVRGTTVILKEPCPSGTYEGNSPFHPLADNKFALTCTSTHFAFACADGTHTYQ --------2222----------------------%%%%-------------1111----- LRARSV ------ >MITOCHONDRIAL PROTEIN IMP; SWP:P25491; PDB:1XAOA; SFKRDGDDLVYEAEIDLLTAIAGGEFALEHVSGDWLKVGIVPGEVIAPGMRKVIEGKGMP -----------------------------3333-------------2222---------- YGNLIIKFTIKFPENHFTSEENLKKLEEILPPRIVPAIPKKATVDECVLADFDPA -----------------------3333-----------2222------------- >RETINOIC ACID RECEPTOR BE; SWP:P10826; PDB:1XAPA; AELDDLTEKIRKAHQETFPSLCQLGKYTTNSSADHRVRLDLGLWDKFSELATKCIIKIVE --------------1111-3333------------------------------------- FAKRLPGFTGLTIADQITLLKAACLDILILRICTRYTPEQDTMTFSDGLTLNRTQMHNAG ----2222-----------------------------1111---1111------------ FGPLTDLVFTFANQLLPLEMDDTETGLLSAICLICGDRQDLEEPTKVDKLQEPLLEALKI !!!!----------3333----------------1111---------------------- YIRKRRPHMFPKILMKITDLRSISAKGAERVITLKMEIPGSMPPLIQEMLEN ---------------------------------------------------- >XENOBIOTIC ACETYLTRANSFER; SWP:P26841; PDB:1XAT; NYFESPFRGKLLSEQVSNPNIRVGRYSYYSGYYHGHSFDDCARYLMPDRDDVDKLVIGSF ----1111--3333---1111--2222---1111--3333-------------------- CSIGSGAAFIMAGNQGHRAEWASTFPFHFMHEEPAFAGAVNGYQPAGDTLIGHEVWIGTE -----------!!!!--1111----3333---3333---------------------222 AMFMPGVRVGHGAIIGSRALVTGDVEPYAIVGGNPARTIRKRFSDGDIQNLLEMAWWDWP 2--2222------------------2222-------------------------1111-3 LADIEAAMPLLCTGDIPALYQHWKQRQA 333---3333------------------ >B- AND T-LYMPHOCYTE ATTEN; SWP:Q7TSA3; PDB:1XAUA; CEVQLNIKRNSKHSAWTGELFKIECPVKYCVHRPNVTWCKHNGTIWVPLEVGPQLYTSWE -------2222----2222----------------------------------------- ENRSVPVFVLHFKPIHLSDNGSYSCSTNFNSQVINSHSVTIHVR ---------------3333---------!!!!------------ >OCCLUDIN; SWP:Q16625; PDB:1XAWA; WIREYPPITSDQQRQLYKRNFDTGLQEYKSLQSVLDEINKELSRLDKELDDYREESEEYM 3333------------------------------------------------1111---- AAADEYNRLKQVKGSADYKSKKNHCKQLKSKLSHIKKMVGDYDRQKT -------------------------------------------1111 >HYPOTHETICAL UPF0054 PROT; SWP:P71335; PDB:1XAXA; SVLVDLQIATENIEGLPTEEQIVQWATGAVQPEGNEVEMTVRIVDEAESHELNLTYRGKD -----------------3333--------------------------------------- RPTNVLSFPFECPDEVELPLLGDLVICRQVVEREASEQEKPLMAHWAHMVVHGSLHLLGY ------------------------------------------------------------ DHIEDDEAEEMESLETQIMQGLGF -------------------1111- >BACULOVIRAL IAP REPEAT-CO; SWP:Q96P09; PDB:1XB0A; TNLPRNPSMTGYEARLITFGTWMYSVNKEQLARAGFYAIGQEDKVQCFHCGGGLANWKPK -----3333-------1111---------------------------------------- EDPWEQHAKWYPGCKYLLEEKGHEYINNIHLT ----------1111-------------1111- >Elongation factor Ts, mit; SWP:P43896; PDB:1XB2B; SASSKELLMKLRRKTGYSFINCKKALETCGGDLKQAESWLHKQAQKEGWSKAARLHGRKT ----------------------------iiii---------------------------- KEGLIGLLQEGDTTVLVEVNCETDFVSRNLKFQQLVQQVALGTLLHCQNLKDQLSTYSKG ---------!!!!---------3333---------------------------------- FLNSSELSELPAGPEREGSLKDQLALAIGKLGENMILKRAAWVKVPAGFYVGSYVHGAMH ---------------------------------------------2222----------- SPSLHNLVLGKYGALVICETSELKANLADLGRRLGQHVVGMAPLSVGSLDDEPGGEAETK 1111------------------3333---------------------3333---1111-3 MLSQPYLLDPSITLGQYVQPHGVSVVDFVRFECGEG 333--1111---3333-3333----------2222- >AZURIN; SWP:P00282; PDB:1XB3A; AECSVDIQGNDQMQFNTNAITVDKSCKQFTVNLSHPGNLPKNVMGHNWVLSTAADMQGVV ---------1111---------1111-------------1111--------3333----- TCGMASGLDKDYLCPDDSRVIAHTKLIGSGEKDSVTFDVSKLKGGEQYMFFCTFPGHSAL -3333-1111------3333-------2222-----------1111-------2222--- MKGTLTLK -------- >STEROID HORMONE RECEPTOR ; SWP:P11474; PDB:1XB7A; LEVLFQGPVNALVSHLLVVEPEKLYAMAVATLCDLFDREIVVTISWAKSIPGFSSLSLSD --1111----------------------3333-----------------2222------- QMSVLQSVWMEVLVLGVAQRSLPLQDELAFAEDLVLDEEGARAAGLGELGAALLQLVRRL ------------------1111--------1111-------1111!!!!----------- QALRLEREEYVLLKALALANSDSVHIEDAEAVEQLREALHEALLEYEAGRRRAGRLLLTL 3333------------------1111-------------------3333----------- PLLRQTAGKVLAHFYGVKLEGKVPMHKLFLEMLEA ----------------------------------- >TYROSINE-PROTEIN KINASE S; SWP:P43405; PDB:1XBBA; VYLDRKLLTLEDKELGSGNFGTVKKGYYQMKKVVKTVAVKILKPALKDELLAEANVMQQL ---3333----------3333-----------------------------------1111 DNPYIVRMIGICEAESWMLVMEMAELGPLNKYLQQNRHVKDKNIIELVHQVSMGMKYLEE -1111------------------1111--------1111--------------------- SNFVHRDLAARNVLLVTQHYAKISDFGLSKALRADENYYKAKWPVKWYAPECINYYKFSS --------3333-------------1111---1111-------3333-3333-------- KSDVWSFGVLMWEAFSYGQKPYRGMKGSEVTAMLEKGERMGCPAGCPREMYDLMNLCWTY -------------1111----2222-----------------2222-------------- DVENRPGFAAVELRLRNYYYDVVNEGHH 3333------------------------ >50S RIBOSOMAL PROTEIN L7A; SWP:P54066; PDB:1XBIA; MAVYVKFKVPEEIQKELLDAVAKAQKIKKGANEVTKAVERGIAKLVIIAEDVKPEEVVAH -1111------------------------------------------------3333111 LPYLCEEKGIPYAYVASKQDLGKAAGLEVAASSVAIINEGDAEELKVLIEKVNVLKQ 1----1111--------------------------------------------1111 >DNAJ; SWP:P08622; PDB:1XBL; AKQDYYEILGVSKTAEEREIRKAYKRLAMKYHPDRNQGDKEAEAKFKEIKEAYEVLTDSQ ---------------3333---------------------------------------33 KRAAYDQYGHAAFEQ 33------------- >T PROTEIN; SWP:P24781; PDB:1XBRA; ELKVSLEERDLWTRFKELTNEMIVTKNGRRMFPVLKVSMSGLDPNAMYTVLLDFVAADNH ---------------1111-----1111--------------1111-------------- RWKYVNGEWVPGGKPEPQAPSCVYIHPDSPNFGAHWMKDPVSFSKVKLTNKMNGGGQIML ----%%%%-----------------1111-------------1111-------------- NSLHKYEPRIHIVRVGGTQRMITSHSFPETQFIAVTAYQNEEITALKIKHNPFAKAFLDA --------------------------3333---------3333-------3333------ KERN ---- >DIM1-LIKE PROTEIN; SWP:Q9NX01; PDB:1XBSA; MSFLLPKLTSKKEVDQAIKSTAEKVLVLRFGRDEDPVCLQLDDILSKTSSDLSKMAAIYL -------------------------------3333-------------3333-------- VDVDQTAVYTQYFDISYIPSTVFFFNGQHMKVDYGSPDHTKFVGSFKTKQDFIDLIEVIY -3333-----1111----------iiii-------------------------------- RGAMRGKLIVQSPIDPK --1111----------- >HYPOTHETICAL PROTEIN ISDG; SWP:Q8NX62_STAAW; PDB:1XBWA; TMKFMAENRLTLTKGTAKDIIERFYTRHGIETLGFDGMFVTQTLEQEDFDEVKILTVWKS ------------2222-----1111-iiii-----------------------------3 KQAFTDWLKSDVFKAAHKHVPIINNKVITYDIGYSYMK 333----------------------------------- >PARDAXIN P-4; SWP:P81861; PDB:1XC0A; GFFALIPKIISSPLFKTLLSAVGSALSSSGGQE ----3333------------------------- >PERIPLASMIC IRON-BINDING ; SWP:P17259; PDB:1XC1A; DITVYNGQHKEAAQAVADAFTRATGIKVKLNSAKGDQLAGQIKEEGSRSPADVFYSEQIP ---------------------------------3333--------------------333 ALATLSAANLLEPLPASTINETRGKGVPVAAKKDWVALSGRSRVVVYDTRKLSEKDLEKS 3----1111---------3333-2222--1111-------------------3333---3 VLNYATPKWKNRIGYVPTSGAFLEQIVAIVKLKGEAAALKWLKGLKEYGKPYAKNSVALQ 333--3333------1111----------------------------------------- AVENGEIDAALINNYYWHAFAREKGVQNVHTRLNFVRHRDPGALVTYSGAAVLKSSQNKD ------------3333--------3333--------%%%%1111--------1111---- EAKKFVAFLAGKEGQRALTAVRAEYPLNPHVVSTFNLEPIAKLEAPQVSATTVSEKEHAT ---------------------------1111-------3333------------------ RLLEQAGMK --------- >PUTATIVE FRUCTOKINASE; SWP:O05510; PDB:1XC3A; AMLGGIEAGGTKFVCAVGREDGTIIDRIEFPTKMPDETIEKVIQYFSQFSLQAIGIGSFG ------------------1111-----------------------3333----------- PVDNDKTSQTYGTITATPKAGWRHYPFLQTVKNEMKIPVGFSTDVNAAALGEFLFGEAKG ----1111----------2222---------------------------------1111- LDSCLYITIGTGIGAGAIVEGRLLQGLSHPEMGHIYIRRHPDDVYQGKCPYHGDCFEGLA ------------------iiii--------3333-----1111----------------- SGPAIEARWGKKAADLSDIAQVWELEGYYIAQALAQYILILAPKKIILGGGVMQQKQVFS -------------1111--------------------------------3333-3333-- YIYQYVPKIMNSYLDFSELSDDISDYIVPPRLGSNAGIIGTLVLAHQALQAEAAS ---------%%%%--33331111------1111------------------1111 >NUCLEAR RECEPTOR COREPRES; SWP:Q9Y618; PDB:1XC5A; NGLMADPMKVYKDRQVMNMWSEQEKETFREKFMQHPKNFGLIASFLERKTVAECVLYYYL ---------1111-------3333----------------1111-----3333------1 TKKNENYK 111----- >1-CYS PEROXIREDOXIN; SWP:Q86SB3; PDB:1XCCA; GYHLGATFPNFTAKASGIDGDFELYKYIENSWAILFSHPNDFTPVCTTELAELGKMHEDF --2222--------2222----3333---------------------------------3 LKLNCKLIGFSCNSKESHDKWIEDIKYYGKLNKWEIPIVCDESRELANKLKIMDEQEKDI 333-------------------------------------1111---3333-------11 TGLPLTCRCLFFISPEKKIKATVLYPATTGRNAHEILRVLKSLQLTYTTPVATPVNWNEG 11-----------1111--------3333--3333--------1111------2222--- DKCCVIPTLQDDEISKHFKNEITKVEMPSKKKYLRFVNL -----11113333-------------1111--------- >TRYPTOPHAN SYNTHASE ALPHA; SWP:P00928; PDB:1XCFA; MERYESLFAQLKERKEGAFVPFVTLGDLGIEQSLKIIDTLIEAGADALELGIPFVTPAQC -----------1111--------------------------------------------- FEMLAIIREKHPTIPIGLLMYANLVFNKGIDEFYARCEKVGVDSVLVADVPVEESAPFRQ --------------------3333----------------------11113333------ AALRHNVAPIFICPPNADDDLLRQIASYGRGFTYLLSRAAALPLNHLVAKLKEYNAAPPL --1111------------------------------------3333-----1111----- QGFGISAPDQVKAAIDAGAAGAISGSAIVKIIEQHINEPEKMLAALKVFVQPMKAATR ------3333----1111------------------------------------1111 >RHO GUANINE NUCLEOTIDE EX; SWP:O15085; PDB:1XCGA; QNWQHTVGKDVVAGLTQREIDRQEVINELFVTEASHLRTLRVLDLIFYQRMKKENLMPRE -3333-----3333-------------------------------------1111----- ELARLFPNLPELIEIHNSWCEAMKKLREEGPIIKEISDLMLARFDGPAREELQQVAAQFC -------------------------3333------------------------------1 SYQSIALELIKTKQRKESRFQLFMQEAESHPQCRRLQLRDLIISEMQRLTKYPLLLESII 111----------------------33333333---33331111---------------1 KHTEGGTSEHEKLCRARDQCREILKYVNEAVKQTENRHRLEGYQKRLDATALERASNPLA 111--------------------------------------------------------3 AEFKSLDLTTRKMIHEGPLTWRISKDKTLDLHVLLLEDLLVLLQKQDEKLLLKCTFSPVL 333---3333-------------------------1111--------------------- KLNAVLIRSVATDKRAFFIICTSKLGPPQIYELVALTSSDKNTWMELLEEAVRNA 3333--------1111--------------------------------------- >GUANIDINOACETATE N-METHYL; SWP:P10868; PDB:1XCLA; PLFAPGEDCGPAWRAAPAAYDTSDTHLQILGKPVMERWETPYMHSLAAAAASRGGRVLEV ---2222--1111-------1111----iiii------------------1111------ GFGMAIAASRVQQAPIKEHWIIECNDGVFQRLQNWALKQPHKVVPLKGLWEEVAPTLPDG ----------1111--------------------3333----------33333333---- HFDGILYDTYPLSEETWHTHQFNFIKTHAFRLLKPGGILTYCNLTSWGELMKSKYTDITA ------------3333-----------------2222-----3333-------------- MFEETQVPALLEAGFQRENICTEVMALVPPADCRYYAFPQMITPLVTKH ---------------3333----------1111---------------- >HYPOTHETICAL PROTEIN PTD0; SWP:Q9H0W9; PDB:1XCRA; CAEFSFHVPSLEELAGVMQKGLKDNFADVQVSVVDCPDLTKEPFTFPVKGICGKTRIAEV -------------------------------------3333------------------- GGVPYLLPLVNQKKVYDLNKIAKEIKLPGAFILGAGAGPFQTLGFNSEFMPVIQTESEHK -3333-----1111--------11112222--------3333--------------1111 PPVNGSYFAHVNPADGGCLLEKYSEKCHDFQCALLANLFASEGQPGKVIEVKAKRRTGPL ---------------------3333----------------------------------- NFVTCMRETLEKHYGNKPIGMGGTFIIQKGKVKSHIMPAEFSSCPLNSDEEVNKWLHFYE -------------!!!!------------------------------------------- MKAPLVCLPVFVSRDPGFDLRLEHTHFFSRHGEGGHYHYDTTPDIVEYLGYFLPAEFLYR -----------------------------------------3333--------------- IDQPKETHSIGRD ------------- >UBIQUITIN CARBOXYL-TERMIN; SWP:P15374; PDB:1XD3A; EGQRWLPLEANPEVTNQFLKQLGLHPNWQFVDVYGMDPELLSMVPRPVCAVLLLFPITEK ------------------------------------33331111---------------- YEVFRTEEEEKIKSQGQDVTSSVYFMKQTISNACGTIGLIHAIANNKDKMHFESGSTLKK -------------------3333------2222---------11111111---------- FLEESVSMSPEERARYLENYDAIRVTHETSAHEGQTEAPSIDEKVDLHFIALVHVDGHLY ----11113333-----------------1111------1111-----------iiii-- ELDGRKPFPINHGETSDETLLEDAIEVCKKFMERDPDELRFNAIALSAA --1111---------3333---------------1111----------- >ANTIFUNGAL PROTEIN GAFP-1; SWP:Q9AXZ2; PDB:1XD5A; SDRLNSGHQLDTGGSLAEGGYLFIIQNDCNLVLYDNNRAVWASGTNGKASGCVLKMQNDG ----2222--2222---!!!!----1111-----iiii------2222--------1111 NLVIYSGSRAIWASNTNRQNGNYYLILQRDRNVVIYDNSNNAIWATHTNVGN -----!!!!------------------1111--------------------- >GASTRODIANIN-4; SWP:Q1M0Y9; PDB:1XD6A; SDRLNAGKSLGAGGSLAEGPYLFIMQNDCNLVLYDNNRAVWASGTNGKASNCILKMQRDG ----2222--2222---!!!!----1111-----!!!!------2222--------1111 NLVIYSGSRAMWASNTNRQDGNYYLILQRDRNVVIYDNSNNAIWASGTNV -----!!!!------------------1111-----1111---------- >YWNA; SWP:P71036; PDB:1XD7A; SRLAVAIHILSLISMDEKTSSEIIADSVNTNPVVVRRMISLLKKADILTSRAGVPGASLK -----------3333---------------3333-------------------------- KDPADISLLEVYRAVQKNPKCPVGKKIQNALDETFESVQRAMENELASKSLKDVMN -1111-------1111----------------------------1111-3333--- >PR10.2A; SWP:Q9LLQ3; PDB:1XDFA; GVFTFEDESTSTIAPARLYKALVKDADAIIPKAVEAIQSIETVEGNGGPGTIKKLTLIEG -------------------------3333----3333----------2222-------!! GETKYVLHKIEAVDEANLRYNYSIVGGVGLPDTIEKISFETKLVEGANGGSIGKVTIKIE !!------------1111--------11111111-----------1111----------- TKGDAQPNEEEGKAAKARGDAFFKAIENYLSAHPEYN ------------------3333----------3333- >RV3303C-LPDA; SWP:O53355; PDB:1XDIA; VTRIVILGGGPAGYEAALVAATSHPETTQVTVIDCDGIGGAAVLDDCVPSKTFIASTGLR ---------3333----------1111------------------------------333 TELRRAPHLGFHKISLPQIHARVKTLAAAQSADITAQLLSMGVQVIAGRGELIDSTPGLA 3----3333-----3333--------------------1111------------------ RHRIKATAADGSTSEHEADVVLVATGASPRILPSAQPDGERILTWRQLYDLDALPDHLIV -------1111--------------------1111--------3333------------- VGSGVTGAEFVDAYTELGVPVTVVASQDHVLPYEDADAALVLEESFAERGVRLFKNARAA ---1111-------1111------------------------------------------ SVTRTGAGVLVTMTDGRTVEGSHALMTIGSVPNTSGLGLERVGIQLGRGNYLTVDRVSRT ------------1111---------------------3333-----2222---------- LATGIYAAGDCTGLLPLASVAAMQGRIAMYHALGEGVSPIRLRTVAATVFTRPEIAAVGV ---------1111---------------------------1111---------------- PQSVIDAGSVAARTIMLPLRTNARAKMSEMRHGFVKIFCRRSTGVVIGGVVVAPIASELI ----1111----------1111--------------------------------3333-- LPIAVAVQNRITVNELAQTLAVYPSLSGSITEAARRLMA ---------------1111-----3333------3333- >RNA EDITING LIGASE MP52; SWP:Q38FA5; PDB:1XDNA; QSDFSPYIEIDLPSESRIQSLHKSGLAAQEWVACEKVHGTNFGIYLINQGDHEVVRFAKR 1111-----------------33331111-------------------!!!!------11 SGIDPNENFFGYHILIDEFTAQIRILNDLLKQKYGLSRVGRLVLNGELFGAKYKHPLVPK 11-1111-%%%%1111--------------------------------------1111-- SEKWCTLPNGKKFPIAGVQIQREPFPQYSPELHFFAFDIKYSVSGAEEDFVLLGYDEFVE ------1111---3333----------------------------3333----------- FSSKVPNLLYARALVRGTLDECLAFDVENFTPLPALLGLGNYPLEGNLAEGVVIRHVRRG ----2222-------------11113333-----11111111-2222--------1111- DPAVEKHNVSTIIKLRCSSFEL -3333-----------3333-- >POLYPHOSPHATE KINASE; SWP:P28688; PDB:1XDOA; GQEKLYIEKELSWLSFNERVLQEAADKSNPLIERMRFLGIYSNNLDEFYKVRFAELKRRI --------------------3333-1111------------------------------- IISEEQGSNSHSRHLLGKIQSRVLKADQEFDGLYNELLLEMARNQIFLINERQLSVNQQN ----------1111---------------------------1111----3333------- WLRHYFKQYLRQHITPILINPDTDLVQFLKDDYTYLAVEIIRGDTIRYALLEIPSDKVPR ---------3333------1111------2222--------!!!!--------3333--- FVNLPPEAPRRRKPMILLDNILRYCLDDIFKGFFDYDALNAYSMKMTRDAEYDLVHEMEA --------3333----3333-33333333------------------------------- SLMELMSSSLKQRLTAEPVRFVYQRDMPNALVEVLREKLTISRYDSIVPGGRYHNFKDFI -3333------------------11113333-----1111--------------3333-- NFPNVGKANLVNKPLPRLRHIWFDKAQFRNGFDAIRERDVLLYYPYHTFEHVLELLRQAS -------1111--------3333-1111-3333------------------------333 FDPSVLAIKINIYRVAKDSRIIDSMIHAAHNGKKVTVVVELQARFDEEANIHWAKRLTEA 33333-------------3333------1111------------------------3333 GVHVIFSAPGLKIHAKLFLISRKENGEVVRYAHIGTGNFNEKTARLYTDYSLLTADARIT -----------------------------------------3333----------3333- NEVRRVFNFIENPYRPVTFDYLMVSPQNSRRLLYEMVDREIANAQQGLPSGITLKLNNLV -----------3333--------------------------------------------- DKGLVDRLYAASSSGVPVNLLVRGMCSLIPNLEGISDNIRAISIVDRYLEHDRVYIFENG -----------1111----------------2222------------------------- GDKKVYLSSADWMTRNIDYRIEVATPLLDPRLKQRVLDIIDILFSDTVKARYIDKELSNR ------------3333--------------------------1111-------------- YVPRGNRRKVRAQLAIYDYIKSLEQPE --------------------3333--- >Heparin-binding EGF-like ; SWP:Q99075; PDB:1XDTR; PCLRKYKDFCIHGECKYVKELRAPSCICHPGYHGERCHGLS -----2222---------1111------2222-1111---- >NAD+-dependent (R)-2-Hydr; SWP:NA; PDB:1XDWA; MKVLCYGVRDVELPIFEACNKEFGYDIKCVPDYLNTKETAEMAAGFDAVILRGNCFANKQ --------3333-----1111---------------------2222-------------- NLDIYKKLGVKYILTRTAGTDHIDKEYAKELGFPMAFVPRYSPNAIAELAVTQAMMLLRH -------------------1111-----1111---------3333--------------- TAYTTSRTAKKNFKVDAFMFSKEVRNCTVGVVGLGRIGRVAAQIFHGMGATVIGEDVFEI ------3333-----3333---3333-------------------1111----------- KGIEDYCTQVSLDEVLEKSDIITIHAPYIKENGAVVTRDFLKKMKDGAILVNCARGQLVD --3333----------------------3333--------11112222------1111-- TEAVIEAVESGKLGGYGCDVLDGEASVFGKDLEGQKLENPLFEKLVDLYPRVLITPHLGS --------------------2222--2222-2222----------1111--------111 YTDEAVKNMVEVSYQNLKDLAETGDCPNKIK 1------------------------1111-- >TCTEX1 LIGHT CHAIN PROTEI; SWP:O64980; PDB:1XDXA; MEGVDPAVEEAAFVADDVSNIIKESIDAVLQNQQYSEAKVSQWTSSCLEHCIKRLTALNK -----------------------------------3333--------------------- PFKYVVTCIIMQKNGAGLHTAASCWWDSTTDGSRTVRWENKSMYCICTVFGLAI --------------------------3333------------------------ >BACTERIAL SULFITE OXIDASE; SWP:P76342; PDB:1XDYA; KALEFSKPAAWQNNLPLTPADKVSGYNNFYEFGLDKADPAANAGSLKTDPWTLKISGEVA -------3333-------3333-------1111-11113333------------------ KPLTLDHDDLTRRFPLEERIYRMRCVEAWSMVVPWIGFPLHKLLALAEPTSNAKYVAFET -----3333---------------1111---------------------1111------- IYAPEQMPGQQDRFIGGGLKYPYVEGLRLDEAMHPLTLMTVGVYGKALPPQNGAPVRLIV --11113333---1111----------------3333-----iiii--3333-------1 PWKYGFKGIKSIVSIKLTRERPPTTWNLAAPDEYGFYANVNPYVDHPRWSQATERFIGSG 1113333----------------------1111-------1111-1111----------- RQPTLLFNGYADQVASLYRGLDL ----2222-111133332222-- >METHYLTRANSFERASE GIDB; SWP:P25813; PDB:1XDZA; NMNIEEFTSGLAEKGISLSPRQLEQFELYYDMLVEWNEKINLTSITEKKEVYLKHFYDSI -----------1111--------------------------------------------3 TAAFYVDFNQVNTICDVGAGAGFPSLPIKICFPHLHVTIVDSLNKRITFLEKLSEALQLE 333---3333---------------------3333------------------------- NTTFCHDRAETFGQRKDVRESYDIVTARAVARLSVLSELCLPLVKKNGLFVALKAAAEEE -------33331111------------------------3333-2222------------ LNAGKKAITTLGGELENIHSFKLPIEESDRNIMVIRKIKNTPKKYPRKPGTPNKSPIE -----------------------------------------3333--2222------- >NUCLEOPHOSMIN; SWP:P07222; PDB:1XE0A; RGSQNFLFGCELKADKKEYSFKVEDDENEHQLSLRTVSLGASAKDELHVVEAEGINYEGK ------------1111-----------------------1111------------1111- TIKIALASLKPSVQPTVSLGGFEITPPVILRLKSGSGPVYVSGQHLVA ---------1111----------------------------------- >HYPOTHETICAL PROTEIN PF09; SWP:Q8U2D2; PDB:1XE1A; IEILSKKPAGKVVVEEVVNIGKDVIIGTVESGIGVGFKVKGPSGIGGIVRIERNREKVEF ---------------------------------2222---1111--------%%%%---- AIAGDRIGISIEGKIGKVKKGDVLEIYQT -2222-------------2222------- >Hypothetical 22.5 kDa pro; SWP:Q03629; PDB:1XE7A; ANAAIEPASFVKVPMPEPPSSLQQLINDWQLIKHREGGYFKETDRSPYTMEVEKPVMVTR -------1111----------------------1111----------------------- NQSTLIYYLLTPDSPIGKFHKNINRIIHILQRGKGQYVLVYPDGQVKSFKVGFDYKNGEV ----------1111--------------------------1111---------3333--- SQWVVPGGVFKASFLLPNEEFDNGFLISEVVVPGFDFEDHTFLKGEDELKHLVGPEKAAE -----2222-----------%%%%-----------3333--------------------- LAFLAH 3333-- >OXIDOREDUCTASE, GFO/IDH/M; SWP:Q9KKQ4; PDB:1XEAA; SLKIAIGLGDIAQKAYLPVLAQWPDIELVLCTRNPKVLGTLATRYRVSATCTDYRDVLQY ---------------33331111-------------------------------3333-- GVDAVIHAATDVHSTLAAFFLHLGIPTFVDKPLAASAQECENLYELAEKHHQPLYVGFNR --------1111---------------------------------------------111 RHIPLYNQHLSELAQQECGALRSLRWEKHRHALPGDIRTFVFDDFIHPLDSVNLSRQCNL 1---------3333---!!!!--------------------------------------- DDLHLTYHSEGLLARLDVQWQTGDTLLHASNRQFGITTEHVTASYDNVAYLFDSFTQGKW ---------------------!!!!-------------------2222------------ RDNQESRVALKDWTPLASKGFDAVQDWLQVAAAGKLPTHIIERNLASHQLAEAICQQITQ %%%%--------------------------1111--3333-------------------- QVTK ---- >HYPOTHETICAL PROTEIN PA01; SWP:Q9I717; PDB:1XEBA; SLDWTCKHHADLTLKELYALLQLRTEVFVVEQKCPYQEVDGLDLVGDTHHLAWRDGQLLA -------3333------------------1111--------------------iiii--- YLRLLDPVRHEGQVVIGRVVSSSAARGQGLGHQLERALQAAERLWLDTPVYLSAQAHLQA -----3333%%%%--------3333---3333------------2222------3333-- YYGRYGFVAVTEVYLEDDIPHIGRRA -3333----------%%%%------- >POLYMERIC-IMMUNOGLOBULIN ; SWP:P01833; PDB:1XEDA; SPIFGPEEVNSVEGNSVSITCYYPPTSVNRHTRKYWCRQCITLISSEGYVSSKYAGRANL -----------2222------------------------------------1111----- TNFPENGTFVVNIAQLSQDDSGRYKCGLGINSRGLSFDVSLEVLEHHHHHH ---3333---------3333---------1111------------------ >CHEMOTAXIS-INHIBITING PRO; SWP:Q7WUJ0; PDB:1XEEA; NSGLPTTLGKLDERLRNYLKKGTKNSAQFEKMVILTENKGYYTVYLNTPLAEDRKNVELL ----------------3333----1111----------------3333--3333------ GKMYKTYFFKKGESKSSYVINGPGKTNEYAY ------------------------------- >PEPTIDE DEFORMYLASE; SWP:P27251; PDB:1XEOA; SVLQVLHIPDERLRKVAKPVEEVNAEIQRIVDDMFETMYAEEGIGLAATQVDIHQRIIVI -----------1111-----------------------1111----3333---------- DVSENRDERLVLINPELLEKSGETGIEEGCLSIPEQRALVPRAEKVKIRALDRDGKPFEL --1111-----------------------1111------------------1111----- EADGLLAICIQHEMDHLVGKLFMDYLSPLKQQRIRQKVEKLDRLK ---------------1111-3333--------------------- >NONSTRUCTURAL PROTEIN NS1; SWP:P03502; PDB:1XEQA; GATNATINFEAGILECYERFSWQRALDYPGQDRLHRLKRKLESRIKTHNKSEPENKRMSL 3333--------------------------------------------11111111---- EERKAIGVKMMKVLLFMDPSAGIEGFEPY ------------1111------------- >FERREDOXIN; SWP:P55907; PDB:1XER; GIDPNYRTNRQVVGEHSGHKVYGPVEPPVLGIHGTIVGVDFDLCIADGSCINACPVNVFQ --1111---------iiii----------------------------------------- WYDTPGHPASEKKADPVNEQACIFCMACVNVCPVAAIDVKPP ---2222----------1111----3333--1111------- >DIHYDROPINOSYLVIN SYNTHAS; SWP:Q02323; PDB:1XESA; FEGFRKLQRADGFASILAIGTANPPNAVDQSTYPDFYFRITGNEHNTELKDKFKRICERS ----------------------------3333----------1111----------1111 AIKQRYMYLTEEILKKNPDVCAFVEVPSLDARQAMLAMEVPRLAKEAAEKAIQEWGQSKS ----------------3333-------------------------------------333 GITHLIFCSTTTPDLPGADFEVAKLLGLHPSVKRVGVFQHGCFAGGTVLRMAKDLAENNR 3---------------------------1111--------1111--------------22 GARVLVICSETTAVTFRGPSETHLDSLVGQALFGDGASALIVGADPIPQVEKACFEIVWT 22---------1111----1111-----------------------2222---------- AQTVVPNSEGAIGGKVREVGLTFQLKGAVPDLISANIENCMVEAFSQFKISDWNKLFWVV -------1111-----1111-----1111---------------3333---1111----- HPGGRAILDRVEAKLNLDPTKLIPTRHVMSEYGNMSSACVHFILDQTRKASLQNGCSTTG -----------------1111--------------3333------------1111----i EGLEMGVLFGFGPGLTIETVVLKSVPI iii------------------------ >INTERNALIN C; SWP:Q8Y6A8; PDB:1XEUA; ESIQRPTPINQVFPDPGLANAVKQNLGKQSVTDLVSQKELSGVQNFNGDNSNIQSLAGMQ -----------------------------1111--33331111--------------333 FFTNLKELHLSHNQISDLSPLKDLTKLEELSVNRNRLKNLNGIPSACLSRLFLDNNELRD 31111------------3333------------------2222----------------- TDSLIHLKNLEILSIRNNKLKSIVMLGFLSKLEVLDLHGNEITNTGGLTRLKKVNWIDLT 1111--1111------------3333--1111------------2222------------ GQKCVNEPVKYQPELYITNTVKDPDGRWISPYYISNGGSYVDGCVLWELPVYTDEVSYKF ----------------------1111--------%%%%---------------------- SEYINVGETEAIFDGTVTQPIKN -----!!!!-------------- >SMC protein; SWP:Q877I1; PDB:1XEWX; MPYIEKLELKGFKSYGNKKVVIPFSKGFTAIVGANGSGKSNIGDAILFVLGGLSAKAKYA -----------!!!!------------------22223333------1111--------- EVAIYFNNEDRGFPIDEDEVVIRRRVYPDGRSSYWLNGRRATRSEILDILTAAMISPDGY -------1111---------------1111-----iiii-----------1111-1111- NIVLQGDITKFIKMSPLERRLLIDDIS ---2222-------------------- >SMC protein; SWP:Q877I1; PDB:1XEWY; VFMRTFEAISRNFSEIFAKLSPGGSARLILENPEDPFSGGLEIEAKPAGKDVKRIEAMSG --------------------2222-------33331111-------2222---3333--- GEKALTALAFVFAIQKFKPAPFYLFDEIDAHLDDANVKRVADLIKESSKESQFIVITLRD --------------------------1111--3333----------------------33 VMMANADKIIGVSMGVSKVVSLSLEKAMKILEEIRKKQGW 331111---------------------------------- >HEMOLYSIN; SWP:A2P8X3; PDB:1XEZA; AIKYYNAADWQALPSLAELRDLVINQQKRVLVDFSQISDAEGQAEMQAQFRKAYGVGFAN ---------------------------------1111----------------------- QFIVITEHKGELLFTPFDRTEETNTLPHVAFYISVNRAISDEECTFNNSWLWKNEKGSRP -------iiii----------3333--------------3333--------1111----- FCKDANISLIYRVNLERSLQYGIVGSATPDAKIVRISLDDDSTGAGIHLNDQLGYRQFGA --------------------------------------1111------------------ SYTTLDAYFREWSTDAIAQDYRFVFNASNNKAQILKTFPVDNINEKFERKEVSGFELGVT ------------------------------------------------------------ GGVEVSGDGPKAKLEARASYTQSRWLTYNTQDYRIERNAKNAQAVSFTWNRQQYATAESL -3333--!!!!------------------!!!!-------1111-----------3333- LNRSTDALWVNTYPVDVNRISPLSYASFVPKMDVIYKASATETGSTDFIIDSSVNIRPIY ------1111----------1111--------------1111------------------ NGAYKHYYVVGAHQSYHGFEDTPRRRITKSASFTVDWDHPVFTGGRPVNLQLASFNNRCI -----------------------------------11111111--------1111----- QVDAQGRLTANMCDSQQSAQSFIYDQLGRYVSASNTKLCLDGAALDALQPCNQNLTQRWE --1111-------1111-------1111---1111-----3333---------------- WRKGTDELTNVYSGESLGHDKQTGELGLYASSNDAVSLRTITAYTDVFNAQESSPILGYT -2222---------------------------1111------------------------ QGKMNQQRVGQDNRLYVRAGAAIDALGSASDLLVGGNGGSLSSVDLSGVKSITATSGDFQ ---------1111---------------1111-------------2222---------11 YGGQQLVALTFTYQDGRQQTVGSKAYVTNAHEDRFDLPDAAKITQLKIWADDWLVKGVQF 11----------1111---------------------2222------------------- DLN --- >C5A PEPTIDASE; SWP:P15926; PDB:1XF1A; NDPSQVKTLQEKAGKGAGTVVAVIDAGFDKNHEAWRLTDKTKARYQSKEDLEKAKKEHGI -11113333------2222--------1111---------------3333---------- TYGEWVNDKVAYYHDYSKDGKTAVDQEHGTHVSGILSGNAPSETKEPYRLEGAPEAQLLL -------------------------%%%%------------------------------- RVEIVNGLADYARNYAQAIRDAINLGAKVINSFGNAALAYANLPDETKKAFDYAKSKGVS -------------------------------------2222-------------1111-- IVTSAGNDSSFGGKTRLPLADHPDYGVVGTPAAADSTLTVASYSPDKQLTETVRVKTADQ ---------2222-----1111----------------------------------1111 QDKEPVLSTNRFEPNKAYDYAYANRGTKEDDFKDVKGKIALIERGDIDFKDKIAKAKKAG ----------------------!!!!-----1111------------------------- AVGVLIYDNQDKGFPIELPNVDQPAAFISRKDGLLLKDNPQKTITFNATPKVLPTASGTK ------------------------------------------------------3333-- LSRFSSWGLTADGNIKPDIAAPGQDILSSVANNKYAKLSGTSSAPLVAGIGLLQKQYETQ -1111----1111------------------------------------------3333- YPDTPSERLDLAKKVLSSATALYDEDEKAYFSPRQQGAGAVDAKKASAATYVTDKDNTSS ------------------------1111---3333!!!!--------------------- KVHLNNVSDKFEVTVNVHNKSDKPQELYYQATVQTDKVDGKHFALAPKVLYETSWQKITI --------------------------------------!!!!------------------ PANSSKQVTVPIDASRFSKDLLAQKNGYFLEGFVRFKQDPTKEELSIPYIGFRGDFGNLS -------------3333---3333--------------1111------------3333-- ALEKPIYDSKDGSSYYHEANSDAKDQLDGDGLQFYALKNNFTALTTESNPWTIIKAVKEG ----33333333-1111--2222-----------1111---------------------- VENIEDIESSEITETIFAGTFAKQDDDSHYYIHRHANGKPYAAISPNGDGNRDYVQFQGT ----------------2222--------------1111---------------------- FLRNAKNLVAEVLDKEGNVVWTSEVTEQVVKNYNNDLASTLGSTRFEKTRWDGKDKDGKV -------------3333------------------1111------3333----------- VANGTYTYRVRYTPISSGAKEQHTDFDVIVDNTTPEVATSATFSTEDRRLTLASKPKTSQ ---------------2222----------------------------------------- PVYRERIAYTYDEDLPTTEYISPNEDGTFTLPEEAETEGATVPLKSDFTYVVEDAGNITY -----------------------1111--------------------------------- TPVTKLLEGH -3333----- ----------------------------- >ALDOLASE C; SWP:P09972; PDB:1XFBA; HSYPALSAEQKKELSDIALRIVAPGKGILAADESVGSMAKRLSQIGVENTEENRRLYRQV ----------------------2222----------------1111-------------- LFSADDRVKKCIGGVIFFHETLYQKDDNGVPFVRTIQDKGIVVGIKVDKGVVPLAGTDGE 1111-1111----------3333--1111------------------------2222--- TTTQGLDGLSERCAQYKKDGADFAKWRCVLKISERTPSALAILENANVLARYASICQQNG -----2222-------1111-----------------3333---------------1111 IVPIVEPEILPDGDHDLKRCQYVTEKVLAAVYKALSDHHVYLEGTLLKPNMVTPGHACPI -----------------------------------1111-1111----------1111-- KYTPEEIAMATVTALRRTVPPAVPGVTFLSGGQSEEEASFNLNAINRCPLPRPWALTFSY -------------------3333------!!!!3333-------1111------------ GRALQASALNAWRGQRDNAGAATEEFIKRAEVNGLAAQGKYE 3333-------iiii3333----------------------- >ALANINE RACEMASE; SWP:Q50705; PDB:1XFCA; LAEAMVDLGAIEHNVRVLREHAGHAQLMAVVKADGYGHGATRVAQTALGAGAAELGVATV ---------------------!!!!---------iiii---------1111--------- DEALALRADGITAPVLAWLHPPGIDFGPALLADVQVAVSSLRQLDELLHAVRRTGRTATV ------1111----------2222-----1111--------------------------- TVKVDTGLNRNGVGPAQFPAMLTALRQAMAEDAVRLRGLMSHMPDDSINDVQAQRFTAFL -------------3333-----------1111-------------3333----------- AQAREQGVRFEVAHLSNSSATMARPDLTFDLVRPGIAVYGLSPVPALGDMGLVPAMTVKC ---1111----------------3333-------3333-----3333-iiii-------- AVALVKSIRAGEGVSYGHTWIAPRDTNLALLPIGYADGVFRSLGGRLEVLINGRRCPGVG --------2222--2222----------------1111-3333-------iiii------ RICMDQFMVDLGPGPLDVAEGDEAILFGPGIRGEPTAQDWADLVGTIHYEVVTSPRGRIT ------------------2222-------1111--3333-----------1111-!!!!- RTYREA ------ >DIPEPTIDYL AMINOPEPTIDASE; SWP:P42658; PDB:1XFDA; QKKKVTVEDLFSEDFKIHDPEAKWISDTEFIYREQKGTVRLWNVETNTSTVLIEGKKIES -----3333--3333---------------------------3333-------------- LRAIRYEISPDREYALFSYNVEPIYQHSYTGYYVLSKIPHGDPQSLDPPEVSNAKLQYAG --------1111-----------------------------------2222--------- WGPKGQQLIFIFENNIYYCAHVGKQAIRVVSTGKEGVIYNGLSDWLYEEEILKTHIAHWW -----------%%%%-----3333---------2222----------------------- SPDGTRLAYAAINDSRVPIMELPTYTGSIYPTVKPYHYPKAGSENPSISLHVIGLNGPTH 1111---------1111----------------------2222----------------- DLEMMPPDDPRMREYYITMVKWATSTKVAVTWLNRAQNVSILTLCDATTGVCTKKHEDES ---------------------------------3333----------------------- EAWLHRQNEEPVFSKDGRKFFFIRAIPQGGRGKFYHITVSSSQPNSSNDNIQSITSGDWD -------------1111------------------------------------------- VTKILAYDEKGNKIYFLSTEDLPRRRQLYSANTVGNFNRQCLSCDLVENCTYFSASFSHS ---------------------1111----------------------------------- MDFFLLKCEGPGVPMVTVHNTTDKKKMFDLETNEHVKKAINDRQMPKVEYRDIEIDDYNL ---------------------------------------1111----------------- PMQILKPATFTDTTHYPLLLVVDGTPGSQSVAEKFEVSWETVMVSSHGAVVVKCDGRGSG ------------------------2222-------------------------------- FQGTKLLHEVRRRLGLLEEKDQMEAVRTMLKEQYIDRTRVAVFGKDYGGYLSTYILPAKG ---------2222--3333-------3333-----3333-----!!!!------------ ENQGQTFTCGSALSPITDFKLYASAFSERYLGLHGLDNRAYEMTKVAHRVSALEEQQFLI -----------------3333--------------------1111-3333---------- IHPTADEKIHFQHTAELITQLIRGKANYSLQIYPDESHYFTSSSLKQHLYRSIINFFVEC --------------------------------2222-------------------3333- FRI --- >LOW-DENSITY LIPOPROTEIN R; SWP:P01130; PDB:1XFEA; GNVTLCEGPNKFKCHSGECITLDKVCNMARDCRDWSDEPIKECGTNECLDNNGGCSHVCN -------------1111--------------3333---3333-----3333--------- DLKIGYECLCPDGFQLVAQRRCE -1111-----2222--------- >Glucosamine--fructose-6-p; SWP:P17169; PDB:1XFFA; CGIVGAIAQRDVAEILLEGLRRLEYRGYDSAGLAVVDAEGHMTRLRRLGKVQMLAQAAEE ----------------------3333----------1111---------3333----111 HPLHGGTGIAHTRWATHGEPSEVNAHPHVSEHIVVVHNGIIENHEPLREELKARGYTFVS 1-------------------3333-----!!!!--------------------------- ETDTEVIAHLVNWELKQGGTLREAVLRAIPQLRGAYGTVIMDSRHPDTLLAARSGSPLVI --3333-------3333----------3333----------3333--------------- GLGMGENFIASDQLALLPVTRRFIFLEEGDIAEITRRSVNIFDKTGAEVKRQDIESNL -----------33333333-------2222----1111----1111------------ >UNKNOWN PROTEIN; SWP:Q949P3; PDB:1XFIA; EMVPFPQLPMPIENNYRACTIPYRFPSDDPKKATPNEISWINVFANSIPSFKKRAESDIT ----1111---2222---------11111111-------------------------333 VPDAPARAEKFAERYAGILEDLKKDPESHGGPPDGILLCRLREQVLRELGFRDIFKKVKD 3-----------------------1111------------------------1111---- EENAKAISLFPQVVSLSDAIEDDGKRLENLVRGIFAGNIFMSFLASCQNLVPRPWVIDDL -----------------------------------------33331111----------- ENFQAKWINKSWKKAVIFVDNSGADIILGILPFARELLRRGAQVVLAANELPSINDITCT -----1111------------!!!!------------1111-----------!!!!---- ELTEILSQLKNGQLLGVDTSKLLIANSGNDLPVIDLSRVSQELAYLSSDADLVIVEGMGR -------------iiii-1111------------1111------1111------------ GIETNLYAQFKCDSLKIGMVKHLEVAEFLGGRLYDCVFKFNEV ----1111-----------------------2222-------- >CONSERVED HYPOTHETICAL PR; SWP:Q9AAV3; PDB:1XFJA; ALPTVQSPLLSSLPGVKHAFFTRQGGVSKGIYDSLNVGRGSQDEPADVEENRARIARWFG ------3333--2222------------!!!!-----1111---------------1111 GGPEDLNVCYQIHSTIAIVADGSWGDARPEGDAVVSKTPGVICGAAADCAPVLLVDPEAR -3333------------------!!!!----------2222------------------- IVAAAHAGWRGALDGVVQSAVDRVELGASPANITGVVGPCIGPKSYEVGLEFLHRFEADC -----------------------1111-3333---------1111--------------2 PGSGRFFKPGASEDKRFFDLPAFVLDRLATAGVERREWVGRDTRAEEEWFFSNRRAFLNN 2221111----1111-------------1111---------3333-----------1111 DGDYGRLLSAITLE -------------- >FORMIMIDOYLGLUTAMASE; SWP:Q9KSQ2; PDB:1XFKA; TWQGRHDPEDGQAGRRVHHIACPIQVGELANQEPGVALIGFECDAGVERNKGRTGAKHAP ------33331111-3333-----33331111---------------1111---3333-- SLIKQALANLAWHHPIPIYDLGNIRCEGDELEQAQQECAQVIQQALPHARAIVLGGGHEI -----3333-----------------!!!!--------------3333------------ AWATFQGLAQHFLATGVKQPRIGIINFDAHFDLRTFESELAPVRPSSGTPFNQIHHFCQQ ------------1111-----------------------------1111----------- QGWDFHYACLGVSRASNTPALFERADKLGVWYVEDKAFSPLSLKDHLTQLQHFIDDCDYL ------------1111-3333----1111----3333-3333-----------1111--- YLTIDLDVFPAASAPGVSAPAARGVSLEALAPYFDRILHYKNKLMIADIAEYNPSFDIDQ ----1111-3333------------3333-----------------------3333-%%% HTARLAARLCWDIANAMAEQVQSI %----------------------- >THIOREDOXIN H1; SWP:P29448; PDB:1XFLA; MASEEGQVIACHTVETWNEQLQKANESKTLVVVDFTASWCGPCRFIAPFFADLAKKLPNV ------------------------1111--------1111----------------1111 LFLKVDTDELKSVASDWAIQAMPTFMFLKEGKILDKVVGAKKDELQSTIAKHLA -----3333-------------------iiii----------3333-------- >HEAVY CHAIN ANTIBODY; SWP:P00698; PDB:1XFPA; VQLQASGGGSVQAGGSLRLSCAASGYTIGPYCMGWFRQAPGKEREGVAAINSGGGSTYYA -----------2222-----------------------2222-----------------3 DSVKGRFTISQDNAKNTVYLLMNSLEPEDTAIYYCAADSTIYASYYECGHGLSTGGYGYD 333--------3333----------3333--------------------3333%%%%--- SWGQGTQVTVS ----------- >CONSERVED HYPOTHETICAL PR; SWP:Q82XK1; PDB:1XFSA; IDAELDLLKRELAVPVNLVWRGLTEPELLKKWFVPKPWSISDCRVDLRPGGEFYTVQDPE -3333-------------------3333-------------------2222------111 GNKFPNSGCFLEVTDEKRLIWTSALVKNYRPAVPVTAVIELQPTSSGTRYTACAHNTPGQ 1------------------------2222--------------1111------------- RKLHEEGFHEGWGTTITQLEELLKQEKAY -------------------------3333 >PHYCOERYTHRIN ALPHA-3 CHA; SWP:Q00433; PDB:1XG0A; AMDSAKAPQITIFDHRGCSRAPKESTGGKAGGQDDEMMVKVASTKVTVSESDAAKKLQEF --------------2222-----------------------------------------1 ITFEKGIDGPFTSKN 111------------ >Phycoerythrin alpha-2 cha; SWP:P30943; PDB:1XG0B; AMDKSAKAPVITIFDHRGCSRAPKEYTGAKAGGKDDEMMVKAQSVKIEVSTGTAEGVLAT ---------------2222---------------1111---------------------- SLAKMTK ------- >B-phycoerythrin beta chai; SWP:P27198; PDB:1XG0C; DAFSRVVTADSKAAYVGGADLQALKKFISEGNKRLDSVNSIVSNASCIVSDAVSGMICEN -----------------------1111-------------3333---------------- PSLISPSGCYTNRRMAACLRDGEIILRYVSYALLSGDASVLEDRCLNGLKETYSSLGVPA ----1111-------------------------------------2222----------- NSNARAVSIMKACAVAFVNNTASQKKLSTPQGDCSGLASEVGGYFDKVTAAIS ----------------------------------------------------- >PECTINESTERASE 1; SWP:P14280; PDB:1XG2A; IIANAVVAQDGTGDYQTLAEAVAAAPDKSKTRYVIYVKRGTYKENVEVASNKMNLMIVGD ------------------------------------------------3333-------- GMYATTITGSLNVVDGSTTFRSATLAAVGQGFILQDICIQNTAGPAKDQAVALRVGADMS -----------3333--3333-------2222-----------1111------------- VINRCRIDAYQDTLYAHSQRQFYRDSYVTGTVDFIFGNAAVVFQKCQLVARKPGKYQQNM -----------------------------------------------------2222--- VTAQGRTDPNQATGTSIQFCNIIASSDLEPVLKEFPTYLGRPWKEYSRTVVMESYLGGLI -------1111----------------33333333---------------------3333 NPAGWAEWDGDFALKTLYYGEFMNNGPGAGTSKRVKWPGYHVITDPAKAMPFTVAKLIQG 3333----!!!!1111---------1111-1111--1111----33333333------33 GSWLRSTGVAYVDGLYD 33-1111---------- >Pectinesterase inhibitor; SWP:P83326; PDB:1XG2B; FENHLISEICPKTRNPSLCLQALESDPRSASKDLKGLGQFSIDIAQASAKQTSKIIASLT --3333--3333---------------3333----------------------------1 NQATDPKLKGRYETCSENYADAIDSLGQAKQFLTSGDYNSLNIYASAAFDGAGTCEDSFE 111----------------------------------------------------1111- GPPNIPTQLHQADLKLEDLCDIVLVISNLLP ------------------------------- >PROBABLE METHYLISOCITRATE; SWP:P77541; PDB:1XG4A; SLHSPGKAFRAALTKENPLQIVGTINANHALLAQRAGYQAIYLSGGGVAAGSLGLPDLGI -----------1111------------------1111----------------------- STLDDVLTDIRRITDVCSLPLLVDADIGFGSSAFNVARTVKSMIKAGAAGLHIEDQVGAK -3333-------------------!!!!-------------------------------- RSGHRPNKAIVSKEEMVDRIRAAVDAKTDPDFVIMARTDALAVEGLDAAIERAQAYVEAG -1111------------------1111-1111-------3333----------------- AEMLFPEAITELAMYRQFADAVQVPILANITEFGATPLFTTDELRSAHVAMALYPLSAFR ----------3333------------------------------1111------------ AMNRAAEHVYNVLRQEGTQKSVIDTMQTRNELYESINYYQYEEKLDN ------------------11111111--------------------- >ARPG836; SWP:Q6UWP2; PDB:1XG5A; ARPGMERWRDRLALVTGASGGIGAAVARALVQQGLKVVGCARTVGNIEELAAECKSAGYP -22221111--------------------------------------------------- GTLIPYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFNVN --------1111-------------------------------1111------------- VLALSICTREAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLR ---------------1111----------1111-----3333------------------ QELREAQTHIRATCISPGVVETQFAFKLHDKDPEKAAATYECLKPEDVAEAVIYVLSTPA ----------------------3333--1111-----------3333-----------11 HIQIGDIQMRPTGS 11--------2222 >HYPOTHETICAL PROTEIN; SWP:NA; PDB:1XG7A; GSRELVEIIKGIGIEGAKEVEEKVDRQFYALQYLFRHQDPEMFIKLVIANSLVSYQLTGR --------------------------------------------------1111------ GEDWWWEFARYFSGREVDSIWKAYGEFLPKSKNNRRLIEAKLNRIRKVEGFLSTLTLKDL -----------2222-----------3333---------------------3333----- EGYYKNMKMLWKALIKIMGSREDSKTIVFTVKMFGYASRIAFSRFIPYPMEIPIPEDLRI ---------------1111-1111---------------1111-----3333-------- KSVTSKLTQEKPTKFWMKIGQESGVPPLHIDSLIWPLLGNADLTPLDIELRNKLMKLTEL -----------------------------33333333-----3333------------11 LGL 11- >HYPOTHETICAL PROTEIN SA07; SWP:NA; PDB:1XG8A; AVVVYGADVICASCVNAPTSKDIYDWLQPLLKRKYPNIFYTYIDITKDLTDHDLQFIERI ----------3333--------------------1111-----------3333------1 EQDELFYPLITMNDEYVADGYIQTKQITRFIDQKLVNE 111--------%%%%-------3333------------ >NITROGEN METABOLITE REPRE; SWP:O59919; PDB:1XGKA; QQKKTIAVVGATGRQGASLIRVAAAVGHHVRAQVHSLKGLIAEELQAIPNVTLFQGPLLN ---------1111----------1111--------------------1111--------- NVPLMDTLFEGAHLAFINTTSQAGDEIAIGKDLADAAKRAGTIQHYIYSSMPDHSLYGPW --------2222--------1111-------------3333-----------1111---- PAVPMWAPKFTVENYVRQLGLPSTFVYAGIYNNNFTSLPYPLFQMELMPDGTFEWHAPFD --11113333--------------------1111-------------1111--------1 PDIPLPWLDAEHDVGPALLQIFKDGPQKWNGHRIALTFETLSPVQVCAAFSRALNRRVTY 111-----3333-------------3333------------------------------- VQVPKVEIKVNIPVGYREQLEAIEVVFGEHKAPYFPLPEFSPGGVISQRVTDEARKLWSG ------------3333--------------------3333-2222---1111-------- WRDMEEYAREVFPIEEEANGLDWML ----------------1111-1111 >METHIONINE AMINOPEPTIDASE; SWP:P56218; PDB:1XGSA; MDTEKLMKAGEIAKKVREKAIKLARPGMLLLELAESIEKMIMELGGKPAFPVNLSINEIA ------------------------2222-------------1111----------!!!!- AHYTPYKGDTTVLKEGDYLKIDVGVHIDGFIADTAVTVRVGMEEDELMEAAKEALNAAIS -----2222----2222---------iiii--------2222------------------ VARAGVEIKELGKAIENEIRKRGFKPIVNLSGHKIERYKLHAGISIPNIYRPHDNYVLKE --22223333---------1111------------2222-----------1111----22 GDVFAIEPFATIGAGQVIEVPPTLIYMYVRDVPVRVAQARFLLAKIKREYGTLPFAYRWL 22-----------------------------------------------!!!!--3333- QNDMPEGQLKLALKTLEKAGAIYGYPVLKEIRNGIVAQFEHTIIVEKDSVIVTTE -----------------------------1111---------------------- >EPSIN 4; SWP:Q14677; PDB:1XGWA; VVMNYSEIESKVREATNDDPWGPSGQLMGEIAKATFMYEQFPELMNMLWSRMLKDNKKNW -----------------------3333---------3333---------------1111- RRVYKSLLLLAYLIRNGSERVVTSAREHIYDLRSLENYHFVDEHGKDQGINIRQKVKELV -----------------------------------------1111--------------- EFAQDDDRLREERKKA ------------3333 >K42-41L FAB LIGHT CHAIN; SWP:NA; PDB:1XGYH; QVQLQQSGPELVRPGASVKISCKASGYTFTDYYINWVKQRPGQGLEWIGWIFPRNGNTKY ---------------------------3333--------1111----------------- NEKFKGKATLTVDKSSSTAFMQLSSLTSEDSAVYFCATTVSYVMDYWGQGTTVTVSSAKT ------------1111----------1111------------------------------ TPPSVYPLAPGSATNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLS -----------------------------------%%%%-------------iiii---- SSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRD -----3333------------1111----------- >Igk-V28 protein [Fragment; SWP:Q5XKG4; PDB:1XGYL; DIVMTQAAFSNPVTLGTSASISCRSSKSLLHSNGITYLYWYLQRPGQSPQLLIYRMSNLA ------------------------------1111-------------------------- SGVPDRFSGSGSGTDFALRISRVEAEDVGVYYCGQMLEHPLTFGTGTKLELKRADAAPTV ---1111----------------3333--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM -----3333-------------------------%%%%--2222---------------- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNR ----------1111--------3333---------- >BETA-2-MICROGLOBULIN; SWP:P30474; PDB:1XH3A; GSHSMRYFYTAMSRPGRGEPRFIAVGYVDDTQFVRFDSDAASPRTEPRAPWIEQEGPEYW -------------2222----------!!!!-----1111--------1111---3333- DRNTQIFKTNTQTYRESLRNLRGYYNQSEAGSHIIQRMYGCDLGPDGRLLRGHDQSAYDG ----------------------1111-1111------------1111----------iii KDYIALNEDLSSWTAADTAAQITQRKWEAARVAEQLRAYLEGLCVEWLRRYLENGKETLQ i-----3333-----------------------------------------------111 RADPPKTHVTHHPVSDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDRT 1-------------------------------------iiii-3333------------- FQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRWEP ---------22221111-----1111---------- >POLYPEPTIDE N-ACETYLGALAC; SWP:O08912; PDB:1XHBA; NRSLPDVRLEGCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLV --------3333-------------------------------------1111------- DDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCEC -----3333--------------------------------------------------- TAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQRE 2222----------1111---------------------------1111-------3333 MDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTL -1111-1111-------------------1111--11111111---3333---------- EIVTCSHVGHVFGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKL --1111------3333-----------!!!!-------2222------------------ QCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQ ---3333-----1111----------------------%%%%2222----------!!!! VFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCLDK ----1111------------2222----------!!!!---------------------- ATEEDSQVPSIRDCTGSRSQQWLLRNV ----1111--------1111------- >NADH OXIDASE /NITRITE RED; SWP:Q8U1K9; PDB:1XHCA; SKVVIVGNGPGGFELAKQLSQTYEVTVIDKEPVPYYSKPMLSHYIAGFIPRNRLFPYSLD ------------------1111--------------3333---------3333----333 WYRKRGIEIRLAEEAKLIDRGRKVVITEKGEVPYDTLVLATGARAREPQIKGKEYLLTLR 3-1111--------------------1111-------------------2222------- TIFDADRIKESIENSGEAIIIGGGFIGLELAGNLAEAGYHVKLIHRGAMFLGLDEELSNM ----------------------------------1111-----------%%%%------- IKDMLEETGVKFFLNSELLEANEEGVLTNSGFIEGKVKICAIGIVPNVDLARRSGIHTGR ---------------------1111--1111--------------------1111----- GILIDDNFRTSAKDVYAIGDCAEYSGIIAGTAKAAMEQARVLADILKGEPRRYNFKFRST ----1111---2222---1111-iiii-----------------1111------------ VFKFGKLQIAIIGNTKGEGKWIEDNTKVFYIGAVVFNDIRKATKLE ---!!!!---------------2222-------------------- >PUTATIVE ACETYLTRANSFERAS; SWP:NA; PDB:1XHDA; AIYPYKEKKPKIASSAFIADVTITGDVYVGEESSIWFNTVIRGDVSPTIIGDRVNVQDQC ----!!!!----1111-------------2222--------------------------- TLHQSPQYPLILEDDVTVGHQVILHSCHIKKDALIGGSIILDGAEIGEGAFIGAGSLVSQ ----3333----2222-------------2222-------2222--2222--2222--22 GKKIPPNTLAFGRPAKVIRELTAEDRKDERIRTQYVEKGQYYKSLQ 22----------------------------------------1111 >AEROBIC RESPIRATION CONTR; SWP:P03026; PDB:1XHFA; QTPHILIVEDELVTRNTLKSIFEAEGYDVFEATDGAEHQILSEYDINLVIDINLPGKNGL ----------------------1111-------3333----------------------- LLARELREQANVALFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLSRT -----1111------------------------------------------------ ------------------------------------------------------------ ---------------------------- >PUTATIVE PROTEASE LA HOMO; SWP:Q58812; PDB:1XHKA; EPKVGVIYGLAVLGAGGIGDVTKIIVQILESKNPGTHLLNISGDIAKHSITLASALSKKL ------------------------------------------------------------ VAEKKLPLPKKDIDLNNKEIYIQFSQSYSKIDGDSATAAVCLAIISALLDIPLKQDFAIT 1111------------------------1111---------------------------- GSLDLSGNVLAIGGVNEKIEAAKRYGFKRVIIPEANIDVIETEGIEIIPVKTLDEIVPLV ---1111-------------------------3333---------------33333333- FDLD ---- >Short-chain dehydrogenase; SWP:Q19774; PDB:1XHLA; RFSGKSVIITGSSNGIGRSAAVIFAKEGAQVTITGRNEDRLEETKQQILKAGVPAEKINA -2222------------------------------------------------1111--- VVADVTEASGQDDIINTTLAKFGKIDILVNNAGANLADGTANTDQPVELYQKTFKLNFQA ---1111----------------------------------1111--------------- VIEMTQKTKEHLIKTKGEIVNVSSIVAGPQAHSGYPYYACAKAALDQYTRCTAIDLIQHG -----------------------3333----3333--------------------3333- VRVNSVSPGAVATGFMGAMGLPETASDKLYSFIGSRKECIPVGHCGKPEEIANIIVFLAD -------------3333------------------11113333---3333---------3 RNLSSYIIGQSIVADGGSTLVMGMQTHDLMSVLS 3331111-------iiii---3333--3333--- >CELLULAR REPRESSOR OF E1A; SWP:O75629; PDB:1XHNA; GSLPPREDAARVARFVTHVSDWGALATISTLEAVRGRPFADVLSLSDGPPGAGSGVPYFY ----3333----------------------3333-------------------------- LSPLQLSVSNLQENPYATLTTLAQTNFCKKHGFDPQSPLCVHILSGTVTKVNETEDIAKH -3333---------------3333---------1111--------------3333----- SLFIRHPEKTWPSSHNWFFAKLNITNIWVLDYFGGPKIVTPEEYYNVT --3333-----3333----------------------------1111- >CHORISMATE MUTASE; SWP:A3DDB7; PDB:1XHOA; VWAIRGATTVSDNTADEIVAETQKLLKEAEKNGLEEDDIISIIFTVTKDLDAAFPAIAAR -----------------------------1111-3333--------3333---------- NGWTSTALCNEIDVPGSLEKCIRVHVNTDKDKKDIKHVYLNGAKVL -------------2222-------------3333-----!!!!--- >HYPOTHETICAL UPF0131 PROT; SWP:P39323; PDB:1XHSA; MRIFVYGSLRHKQGNSHWMTNAQLLGDFSIDNYQLYSLGHYPGAVPGNGTVHGEVYRIDN -------3333----1111---------------------------------------33 ATLAELDALRTRGGEYARQLIQTPYGSAWMYVYQRPVDGLKLIESGDWLDRDK 33--------------------1111----------2222------------- >DNA POLYMERASE; SWP:P03680; PDB:1XI1A; PRKMYSCAFETTTKVEDCRVWAYGYMNIEDHSEYKIGNSLDEFMAWVLKVQADLYFHNLK -------------1111---------3333---------------------------333 FAGAFIINWLERNGFKWSADGLPNTYNTIISRMGQWYMIDICLGYKGKRKIHTVIYDSLK 3-------3333------------------1111-----------%%%%-------3333 KLPFPVKKIAKDFKLTVLKGDIDYHKERPVGYKITPEEYAYIKNDIQIIAEALLIQFKQG ----------1111--------------2222--3333------------------1111 LDRMTAGSDSLKGFKDIITTKKFKKVFPTLSLGLDKEVRYAYRGGFTWLNDRFKEKEIGE ---------------------3333------------3333--------3333------- GMVFDVNSLYPAQMYSRLLPYGEPIVFEGKYVWDEDYPLHIQHIRCEFELKEGYIPTIQI ---------------------------------3333-------------2222------ KRSRFYKGNEYLKSSGGEIADLWLSNVDLELMKEHYDLYNVEYISGLKFKATTGLFKDFI ------------------------------------------------------------ DKWTYIKTTSEGAIKQLAKLMLNSLYGKFASNPDVTGKVPYLKENGALGFRLGEEETKDP ------------------------3333--------------1111-------------- VYTPMGVFITAWARYTTITAAQACYDRIIYCDTDSIHLTGTEIPDVIKDIVDPKKLGYWA -3333------------------3333----------------3333-------2222-- HESTFKRAKYLRQKTYIQDIYMKEVDGKLVEGSPDDYTDIKFSVKCAGMTDKIKKEVTFE -----------2222----------------------------------3333----111 NFKVGFSRKMKPKPVQVPGGVVLVDDTFTIK 1-2222------------------------- >THIAMINE PHOSPHATE PYROPH; SWP:Q8U192; PDB:1XI3A; NLRNKLKLYVITDRRLKPEVESVREALEGGATAIQMRIKNAPTREMYEIGKTLRQLTREY 3333--------3333----------1111---------------------------111 DALFFVDDRVDVALAVDADGVQLGPEDMPIEVAKEIAPNLIIGASVYSLEEALEAEKKGA 1----------------------1111---------1111-------------------- DYLGAGSVFPTDARVIGLEGLRKIVESVKIPVVAIGGINKDNAREVLKTGVDGIAVISAV --------------------------------------3333----1111------3333 MGAEDVRKATEELRKIVEEVLG ---------------------- >EXTRAGENIC SUPPRESSOR; SWP:Q8TZH9; PDB:1XI6A; KLKFWREVAIDIISDFETTIMPFFGNPDGGKLVKLVDKLAEDLILSRITELGVNVVSEEV 3333---------------3333------------------------3333--------- GVIDNESEYTVIVDPLDGSYNFIAGIPFFALSLAVFKKDKPIYAIIYEPMTERFFEGIPG ------------------------------------------------1111-------- EGAFLNGKRIKVRKSISFYSRGKGHEIVKHVKRTRTLGAIALELAYLAMGALDGVVDVRK ----iiii------------------1111----------------1111---------- YVRPTDIAAGTIIAKEAGALIKDSAGKDIDISFNATDRLDVIAVNSEELLKTIL --3333--------1111------------------------------------ ----------------------------------------------- >MOLYBDENUM COFACTOR BIOSY; SWP:Q8U034; PDB:1XI8A; RLTPYEEALSIVLNDLKEIEEVEYVPLKDALGRVLAEDIVASYDLSFAGEDVKKGDIALK ------------1111---------33332222-------------2222--2222---2 KGTILRPQDLALLKALGIRKVPVKVKPKVGIIITVDTNSIMLSALVERYFGEPILYGVVP 222----------1111------------------------------------------- DNEDLIRSALEKAKRECDLVLITGFVNLLFHGTTIRPGRPIGYGERVFVMSGYPVAVFTQ -------------------------------------1111--iiii------------- FHLFVKHALAKLVGAKDYEVKVRAVLEDDVPSQLGRYEFVRVMYRDGKAKVIKKGSGIIS --------------------------------------------iiii------------ SLVQSNAYLVVPEDVEGYRRGEEVWVTLY -1111-------------2222------- >PUTATIVE TRANSAMINASE; SWP:Q9P9M8; PDB:1XI9A; SIRASKRALSVEYPARELEKKGIKVIRLNIGDPVKFDFQPPEHMKEAYCKAIKEGHNYYG ----3333-----------------------3333-----3333---------------- DSEGLPELRKAIVEREKRKNGVDITPDDVRVTAAVTEALQLIFGALLDPGDEILVPGPSY 1111--------------------1111-------------------2222--------3 PPYTGLVKFYGGKPVEYRTIEEEDWQPDIDDIRKKITDRTKAIAVINPNNPTGALYDKKT 333----------------3333-------------1111-------------------- LEEILNIAGEYEIPVISDEIYDLMTYEGEHISPGSLTKDVPVIVMNGLSKVYFATGWRLG ------------------1111---------3333--------------11111111--- YMYFVDPENKLSEVREAIDRLARIRLCPNTPAQFAAIAGLTGPMDYLKEYMKKLKERRDY -----1111-------------1111---3333----------3333------------- IYKRLNEIPGISTTKPQGAFYIFPKIEVGPWKNDKEFVLDVLHNAHVLFVHGSGFGEYGA -------2222---------------------------------------3333-3333- GHFRAVFLPPIEILEEAMDRFEKFMKER ---------3333--------------- >D-XYLOSE ISOMERASE; SWP:P12851; PDB:1XIMA; VQATREDKFSFGLWTVGWQARDAFGDATRTALDPVEAVHKLAEIGAYGITFHDDDLVPFG ---1111----1111------1111----------------1111------1111--222 SDAQTRDGIIAGFKKALDETGLIVPMVTTNLFTHPVFKDGGFTSNDRSVRRYAIRKVLRQ 2--------------------------------3333---1111---------------- MDLGAELGAKTLVLWGGREGAEYDSAKDVSAALDRYREALNLLAQYSEDRGYGLRFAIEP ---------------1111---1111---------------------------------- KPNEPRGDILLPTAGHAIAFVQELERPELFGINPETGHEQMSNLNFTQGIAQALWHKKLF --------------------1111-3333----------1111----------------- HIDLNGQHGPKFDQDLVFGHGDLLNAFSLVDLLENGPDGAPAYDGPRHFDYKPSRTEDYD ----------------2222---------------------------------1111--- GVWESAKANIRMYLLLKERAKAFRADPEVQEALAASKVAELKTPTLNPGEGYAELLADRS ---------------------------------------1111---2222-------333 AFEDYDADAVGAKGFGFVKLNQLAIEHLLGAR 31111----1111------------------- >NUCLEOPORIN NUP159; SWP:P40477; PDB:1XIPA; ASSLKDEVPTETSEDFGFKFLGQKQILPSFNEKLPFASLQNLDISNSKSLFVAASGSKAV -----------------------------iiii------------1111-----iiii-- VGELQLLRDHITSDSTPLTFKWEKEIPDVIFVCFHGDQVLVSTRNALYSLDLEELSEFRT ------------------------------------------1111----1111------ VTSFEKPVFQLKNVNNTLVILNSVNDLSALDLRTKSTKQLAQNVTSFDVTNSQLAVLLKD ---------------------1111--------------------------------111 RSFQSFAWRNGEEKQFEFSLPSELEELPVEEYSPLSVTILSPQDFLAVFGNVISETDDEV 1-------iiii--------3333---3333----------------------------- SYDQKYIIKHIDGSASFQETFDITPPFGQIVRFPYYKVTLSGLIEPDANVNVLASSCSSE --------------------------------------------1111------1111-- VSIWDSKQVIEPSQDSERAVLPISEETDKDTNPIGVAVDVVTSGLPLVYILNNEGSLQIV -------------1111------------------------------------------- GLFH ---- >NUCLEOSIDE DIPHOSPHATE KI; SWP:Q8ID43; PDB:1XIQA; MEKSFIMIKPDGVQRGLVGTIIKRFEKKGYKLIAIKMLNPTEEILKEHYKELSDQPFFKN ------------1111---------3333-------------------1111-----333 LVAYISKGPVVAMVWEGVDMVKQGRKLIGETNPLTSNTGTIRGDFCLEVSKNVIHGSDSV 3--3333------------------------3333------------3333--------- ASANKEINIWFKAEELTQWKHHMKEWICS -----------3333-----1111----- >RXR-LIKE PROTEIN; SWP:Q8T5C6; PDB:1XIUA; NDMPVEQILEAELAVDPKIDTYIDAQKDPVTNICQAADKQLFTLVEWAKRIPHFTELPLE ------------1111----------------------------------2222------ DQVILLRAGWNELLIAGFSHRSIMAKDGILLATGLHVHRSSAHQAGVGTIFDRVLTELVA ---------------------1111-----1111--------1111-------------- KMRDMKMDKTELGCLRAVVLFNPDAKGLTAVQEVEQLREKVYASLEEYTKSRYPEEPGRF ---------------------1111---------------------------1111---- AKLLLRLPALRSIGLKCLEHLFFFKLIGDQPIDTFLMEMLE -------------------------------------1111 >T-cell surface glycoprote; SWP:P04234; PDB:1XIWB; MKIPIEELEDRVFVNCNTSITWVEGTVGTLLSDITRLDLGKRILDPRGIYRCNESTVQVH -------------------------------1111-----3333---------------- YRMCQS ------ >T-CELL SURFACE GLYCOPROTE; SWP:NA; PDB:1XIWD; EVQLQQSGPELVKPGASMKISCKASGYSFTGYTMNWVKQSHGKNLEWMGLINPYKGVSTY ------------2222-----------1111--------%%%%----------------- NQKFKDKATLTVDKSSSTAYMELLSLTSEDSAVYYCARSGYYGDSDWYFDVWGQGTTLTV 3333---------1111---------3333-----------1111--------------- FS -- >PEROXIREDOXIN; SWP:NA; PDB:1XIYA; MKENDLIPNVKVMIDVRNMNNNDFTSIDTHELFNNKKILLISLPGAFTPTSTKMIPGYEE -2222---------3333---------3333-------------2222------------ EYDYFIKENNFDDIYCITNNDIYVLKSWFKSMDIKKIKYISDGNSSFTDSMNMLVDKSNF -----------------------------------------1111---1111----3333 FMGMRPWRFVAIVENNILVKMFQEKDKQHNIQTDPYDISTVNNVKEFLKNN -------------iiii-------------------1111----------- >putative phosphotransfera; SWP:Q8ZL18_SALTY; PDB:1XIZA; AMDIHFRRHYVRHLPKEVSQNDIIKALASPLINDGMVVSDFADHVITREQNFPTGLPVEP 11----1111---------------------------1111------------------- VGVAIPHTDHKYVRQNAISVGILAEPVNFEDMGGEPDPVPVRVVFMLALGESNKQLNVLG --------3333------------------1111----------------1111------ WIMDVIQDEDFMQQLLVMNDDEIYQSIYTRISE --------------------------------- >SENSOR PROTEIN FIXL; SWP:P23222; PDB:1XJ4A; DAMIVIDGHGIIQLFSTAAERLFGWSELEAIGQNVNILMPEPDRSRHDSYISRYRTTSDP ------1111---------------33332222--1111--3333--------------- HIIGIGRIVTGKRRDGTTFPMHLSIGEMQSGGEPYFTGFVRDLTEH -2222-------1111-------------iiii------------- >SPERMIDINE SYNTHASE 1; SWP:Q9ZUB3; PDB:1XJ5A; STVIPGWFSESPWPGEAHSLKVEKVLFQGKSDYQDVIVFQSATYGKVLVLDGVIQLTERD ------------2222---------------------------------iiii---3333 ECAYQEITHLPLCSIPNPKKVLVIGGGDGGVLREVARHASIEQIDCEIDKVVDVSKQFFP ----------1111-----------------------3333------------------3 DVAIGYEDPRVNLVIGDGVAFLKNAAEGSYDAVIVDSSDPIGPAKELFEKPFFQSVARAL 33333331111----------1111------------------3333------------- RPGGVVCTQAESLWLHDIIEDIVSNCREIFKGSVNYAWTSVPTYPSGVIGFLCSTEGPDV 2222-------3333-------------------------1111---------------- DFKHPLNPIDESSSKSNGPLKFYNAEIHSAAFCLPSFAKKVIE 3333------33331111-----------1111-3333-1111 >MOBB PROTEIN HOMOLOG; SWP:NA; PDB:1XJCA; NVWQVVGYKHSGKTTLEKWVAAAVREGWRVGTVKHHGAVATAVEGDGLLQLHLRRPLWRL -------22223333-----------------------------iiii----------33 DDVLALYAPLRLDLVLVEGYKQERHPKVVLVRSEEDWASLQHLANIRAVIAWEPLEGPLA 33----3333---------1111----------------1111----------------- HPVFSLADDDEYIPWLNEVRTR ----1111---3333------- >PROTEIN KINASE C, THETA T; SWP:Q04759; PDB:1XJDA; IEDFILHKMLGKGSFGKVFLAEFKKTNQFFAIKALKKDVVLMDDDVECTMVEKRVLSLAW 1111------------------3333--------------1111-------------333 EHPFLTHMFCTFQTKENLFFVMEYLNGGDLMYHIQSCHKFDLSRATFYAAEIILGLQFLH 31111------------------------------------------------------- SKGIVYRDLKLDNILLDKDGHIKIADFGMCKENMLGDAKTNFCGTPDYIAPEILLGQKYN ---------3333---1111------1111----!!!!------3333-3333------3 HSVDWWSFGVLLYEMLIGQSPFHGQDEEELFHSIRMDNPFYPRWLEKEAKDLLVKLFVRE 333--------------------------------------1111--------------1 PEKRLGVRGDIRQHPLFREINWEELERKEIDPQNMFRNFF 111------33333333----------------------- >NIFU-LIKE PROTEIN; SWP:O32163; PDB:1XJSA; MSFNANLDTLYRQVIMDHYKNPRNKGVLNDSIVVDMNNPTCGDRIRLTMKLDGDIVEDAK ------------------------------------------------------------ FEGEGCSISMASASMMTQAIKGKDIETALSMSKIFSDMMQGKEYDDSIDLGDIEALQGVS -----3333-----------------------------------1111-3333------- KFPARIKCATLSWKALEKGVAKEEGGN -11113333------------------ >LYSOZYME; SWP:Q37875; PDB:1XJTA; GGAICAIAVITIVGNGNVRTNQAGLELIGNAEGCRRDPYCPAGVWTDGIGNTHGVTPGVR -----3333----------------------------------------------2222- KTDQQIAADWEKNILIAERCINQHFRGKDPDNAFSATSAAFNGCNSLRTYYSKARGRVET -------------------------3333-------------3333-----3333----- SIHKWAQKGEWVNCNHLPDFVNSNGVPLRGLKIRREKERQLCLTGLVNEH -----11113333--3333---%%%%----------------2222---- >LYSOZYME; SWP:Q37875; PDB:1XJUA; RTNQAGLELIGNAEGCRRDPYMCPAGVWTDGIGNGVTPGVRKTDQQIAADWEKNILIAER ------------------3333-3333-2222---------------------------- CINQHFRGKDMPDNAFSAMTSAAFNMGCNSLRTYYSKARGMRVETSIHKWAQKGEWVNMC ------3333-----------------3333-----1111----------1111------ NHLPDFVNSNGVPLRGLKIRREKERQLCLTGLVNEH -1111---iiii----------------2222---- >PROTECTION OF TELOMERES 1; SWP:Q9NUX5; PDB:1XJVA; ATNYIYTPLNQLKGGTIVNVYGVVKFFKPPYLSKGTDYCSVVTIVDQTNVKLTCLLFSGN -------1111----------------------------------1111----------3 YEALPIIYKNGDIVRFHRLKIQVYKKETQGITSSGFASLTFEGTLGAPIIPRTSSKYFNF 333-----2222-----------iiii-----2222-------2222------------- TTEDHKMVEALRVWASTHMSTLLKLCDVQPMQYFDLTCQLLGKAEVDGASFLLKVWDGTR 3333-------------------3333--------------------------------- TPFPSWRVLIQDLVLEGDLSHIHRLQNLTIDILVYDNHVHVARSLKVGSFLRIYSLHTKL ------------------------!!!!------!!!!---11112222----------- QSMNSENQTMLSLEFHLHGGTSYGRGIRVLPESNSDVDQLKKDLESANLTA ---1111-------------2222------1111----------------- >CALGRANULIN A; SWP:P05109; PDB:1XK4A; MLTELEKALNSIIDVYHKYSLIKGNFHAVYRDDLKKLLETESPQYIRKKGADVWFKELDI ------------------1111--1111--------------33333333--------11 NTDGAVNFQEFLILVIKMGVAAHKKSH 11----3333----------------- >Protein S100-A9; SWP:P06702; PDB:1XK4C; KMSQLERNIETIINTFHQYSVKLGHPDTLNQGEFKELVRKDLQNFLKKENKNEKVIEHIM ------------------1111--1111------------------3333---------- EDLDTNADKQLSFEEFIMLMARLTWASHE ---1111----3333-------------- >SNURPORTIN-1; SWP:O95149; PDB:1XK5A; HYANQLMLSEWLIDVPSDLGQEWIVVVCPVGKRALIVASRGSTSAYTKSGYCVNRFSSLL ---------------1111---------------------------1111---------- PGGNRRNSTAKDYTILDCIYNEVNQTYYVLDVMCWRGHPFYDCQTDFRFYWMHSKLPEEE --------------------3333----------iiii-11113333-------3333-- GLGEKTKLNPFKFVGLKNFPCTPESLCDVLSMDFPFEVDGLLFYHKQTHYSPGSTPLVGW 1111-1111-------------------1111------------3333------1111-- LRPYMVSDVLGVAVPAGPLTTKPD -1111--1111-----3333---- >CROTONOBETAINYL-COA:CARNI; SWP:P31572; PDB:1XK7A; HHLPPKFGPLAGLRVVFSGIEIAGPFAGQFAEWGAEVIWIENVAWADTIRVQPNYPQLSR -------1111------------------3333-------------3333---3333--- RNLHALSLNIFKDEGREAFLKLETTDIFIEASKGPAFARRGITDEVLWQHNPKLVIAHLS --------1111-------------------------1111---------1111------ GFGQYGTEEYTNLPAYNTIAQAFSGYLIQNGDVDQPPAFPYTADYFSGLTATTAALAALH ------3333---------------3333--1111-----3333---------------- KVRETGKGESIDIAYEVLRGQYFDYFNGGECPRSKGKDPYYAGCGLYKCADGYIVELVGI -----------------------3333------iiii---2222---------------- TQIEECFKDIGLAHLLGTPEIPEGTQLIHRIECPYGPLVEEKLDAWLATHTIAEVKERFA -----------3333--33332222---33331111------------------------ ELNIACAKVLTVPELESNPQYVARESITQWQTDGRTCKGPNIPKFKNNPGQIWRGPSHGD ----------33331111------------------------------------------ TAAILKNIGYSENDIQELVSKGLAKVED -----1111------------------- >DIVALENT CATION TOLERANT ; SWP:O60888; PDB:1XK8A; YVPGSVSAAFVTCPNEKVAKEIARAVVEKRLAACVNLIPQITSIYEWKGKIEEDSEVLMM -2222-----------------------------------------%%%%---------- IKTQSSLVPALTDFVRSVHPYEVAEVIALPVEQGNFPYLQWVRQVTE ---3333-----------------------------------1111- >RAN-BINDING PROTEIN 2; SWP:P49792; PDB:1XKEA; GSGEEDEKVLYSQRVKLFRFDAEVSQWKERGLGNLKILKNEVNGKLRMLMRREQVLKVCA ---------------------1111--------------3333---------1111---- NHWITTTMNLKPLSGSDRAWMWLASDFSDGDAKLEQLAAKFKTPELAEEFKQKFEECQRL ----1111----2222--------------------------3333-------------3 LLDIPLQTPK 333------- >HYPOTHETICAL PROTEIN RV26; SWP:Q7TY72; PDB:1XKFA; TTARDIMNAGVTCVGEHETLTAAAQYMREHDIGALPICGDDDRLHGMLTDRDIVIKGLAA -3333---------1111---------1111--------%%%%-------------3333 GLDPNTATAGELAIYYVDANASIQEMLNVMEEHQVRRVPVISEHRLVGIVTEADIARHLP --1111-3333------1111--------------------%%%%----------1111- >MAJOR MITE FECAL ALLERGEN; SWP:P08176; PDB:1XKGA; SIKTFEEYKKAFNKSYATFEDEEAARKNFLESVKYVQSNGGAINHLSDLSLDEFKNRFLM -------------------------------------------1111------------- SAEAFEHLKTQFDNACSINGNAPAEIDLRQMRTVTPIRMQGGCGSAWAFSGVAATESAYL -------3333---------------3333------------------------------ AYRDQSLDLAEQELVDCASQHGCHGDTIPRGIEYIQHNGVVQESYYRYVAREQSCRRPNA --------------------1111--3333-----------3333--------------- QRFGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTIIQRDNGYQPN ------------------------------------------------------------ YHAVNIVGYSNAQGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEEYPYVVILGQTG -----------iiii---------1111-iiii-------2222---------!!!!- >VON EBNER'S GLAND PROTEIN; SWP:P31025; PDB:1XKIA; DVSGTWYLKAMTVNLESVTPMTLTTLEGGNLEAKVTMSGRCQEVKAVLEKTDEPGKYTAD --------------1111-------2222-----------------------2222---i GGKHVAYIIRSHVKDHYIFYSEGEGKPVRGVKLVGRDPKNNLEALEDFEKAAGARGLSTE iii---------2222------------------------------------11111111 SILIPRQS -------- >EPIDERMAL GROWTH FACTOR R; SWP:Q9H2C9; PDB:1XKKA; ALLRILKETEFKKIKVLGSGAFGTVYKGLWIPVKIPVAIKELREKANKEILDEAYVMASV ------1111---------------------------------------------1111- DNPHVCRLLGICLTSTVQLITQLMPFGCLLDYVREHKDNIGSQYLLNWCVQIAKGMNYLE -1111------------------1111-------------3333---------------1 DRRLVHRDLAARNVLVKTPQHVKITDFGLAKLLGAEEKVPIKWMALESILHRIYTHQSDV 111------3333----3333---------1111-----1111-3333------3333-- WSYGVTVWELMTFGSKPYDGIPASEISSILEKGERLPQPPICTIDVYMIMVKCWMIDADS ---------1111--------3333----1111-----1111--------3333--3333 RPKFRELIIEFSKMARDPQRYLVIQGDERMSNFYRALMDEVVDADEYLI ----------------3333---2222---------------3333--- >SALICYLIC ACID-BINDING PR; SWP:Q6RYA0; PDB:1XKLA; EGKHFVLVHGACHGGWSWYKLKPLLEAAGHKVTALDLAASGTDLRKIEELRTLYDYTLPL ---------22223333-------------------2222-----3333----------- ELESLSADEKVILVGHSLGGNLGLAEKYPQKIYAAVFLAAFPDSVHNSSFVLEQYNERTP ---------------------------1111---------------1111----3333-1 AENWLDTQFLPYGSPEEPLTSFFGPKFLAHKLYQLCSPEDLALASSLVRPSSLFEDLSKA 111!!!!------1111---------------1111------------------3333-- KYFTDERFGSVKRVYIVCTEDKGIPEEFQRWQIDNIGVTEAIEIKGADHALCEPQKLCAS -----------------1111----------------------2222------------- LLEIAHKYN --------- >PUTATIVE PEPTIDYL-ARGININ; SWP:Q8KCB6; PDB:1XKNA; SEPTYFPPEWAPHASTWLSWPHKLESWPGKFEPVPAVFAELAYQLSRSETVNINVLDDAE -------1111-----------333322223333-------------------------- AQARELLKERDPEGKYAERIVFHRIPTNDAWCRDHGPNYVIRTQDGRRDKVINWEYNAWG ----------1111-3333-----------3333---------iiii---------%%%% GKYEPYDDDNAVPERVAKAQGLPVSTGVLEGGAIDVNGAGLLLTTTACLLNPNRNPSLGK -----3333--------------------1111-----------3333--11111111-- AEIEAQLRRYLGIEKVLWLGDGIAGDDTDGHVDDARFVNENTVVIAVEEDPEDENYKPLR ------------------------------1111---------------1111------- ENYELLKTTGLDGKPLNIVKLPPEPVYYDGERLPASYANFYIANTVVLVPTYRCPRDQQA ---------1111--------------%%%%----3333---1111-------1111--- IDILQQCFPKREVVGIDCSDLIWGLGAIHCVTHEEPALEHHHHH -------1111--------3333---3333---------3333- >CHAPERONE PROTEIN SYCN; SWP:P68640; PDB:1XKPA; QFRGESVQIVSGTLQSIADMAEEVTELSLDRLSDSQARVSDVEEQVNQYLSVPELEQQNV -iiii---------------3333------------------------------------ SELLSLLSNSPNISLSQLAYLEGSEEPSEQFMLCGLRDALGRPELAHLSHLVEQALVSMA -------------3333--3333--3333------------3333--------------- EEQGETIVLGARITPEAYRESQSGVNPLQPLRDTYRDAVMGYQGIYAIWSDLQRFPNGDI -------------------1111-------------------------------1111-- DSVILFLQALSADLQSQQSGSGRELGIVISDLQLEFG -------------1111--3333-------------- >Chaperone protein sycN; SWP:P61380; PDB:1XKPB; SWIEPIISHFCQDLGVPTSSPLSPLIQLEMAQSGTLQLEQHGATLTLWLARSLAWHRCED 1111-------1111--------------------------------------1111-33 AMVALTLTAAQSGALPLRAGWLGESQLVLFVSLDERSLTLPLLHQAFEQLLRLQQEVLA 33--1111-------------------------3333---------------------- >Chaperone protein yscB; SWP:Q56973; PDB:1XKPC; QNLLNLAASLGRPFVADQGVYRLTIDHLVMLAPHGSELVLRTPIDAPMLREGNNVNVTLL 3333---1111----------------------!!!!--------1111-!!!!------ RSLMQQALAWAKRYPQTLVLDDCGQLVLEARLRLQELDTHGLQEVINQLALLEHLIPQLT --------------------1111--------3333------------------3333-- P - >SHORT-CHAIN REDUCTASE FAM; SWP:Q9N5G4; PDB:1XKQA; PRFSNKTVIITGSSNGIGRTTAILFAQEGANVTITGRSSERLEETRQIILKSGVSEKQVN -------------------------1111-------------------1111--1111-- SVVADVTTEDGQDQIINSTLKQFGKIDVLVNNAGAAIPDAFGTTGTDQGIDIYHKTLKLN ----1111------------------------------1111--33333333-------- LQAVIEMTKKVKPHLVASKGEIVNVSSIVAGPQAQPDFLYYAIAKAALDQYTRSTAIDLA ---------------------------------------------------------333 KFGIRVNSVSPGMVETGFTNAMGMPDQASQKFYNFMASHKECIPIGAAGKPEHIANIILF 3---------------3333------------------33333333---3333------- LADRNLSFYILGQSIVADGGTSLVMGTQAHDV -----------------iiii---3333---- >CHEMOTAXIS PROTEIN CHEC; SWP:NA; PDB:1XKRA; HMKISERQKDLLKEIGNIGAGNAATAISYMINKKVEISVPNVEIVPISKVIFIAKDPEEI ---------------------------------------------11111111--1111- VVGVKMPVTGDIEGSVLLIMGTTVVKKILEILTGRAPDNLLNLDEFSASALREIGNIMCG ------------------------------------------------------------ TYVSALADFLGFKIDTLPPQLVIDMISAIFAEASIEELEDNSEDQIVFVETLLKVEEEEE ------------------------------------------------------1111-- PLTSYMMMIPKPGYLVKIFERMGIQ ----------2222-----1111-- >NUCLEAR PORE COMPLEX PROT; SWP:Q8WUM0; PDB:1XKSA; ESVNYDVKTFGSSLPVKVMEALTLAEVDDQLTINIDEGGWACLVCKEKLIIWKIALSPIT --------------3333-------3333------3333-----!!!!----------33 KLSVCKELQLPPSDFHWSADLVALSYSSTQAVAVMVATREGSIRYWPSLAGEDTYTEAFV 33---------------3333-------3333-----1111------3333--------- DKTYSFLTAVQGGSFILSSSGSQLIRLIPESSGKIHQHILPQGQGMSDLTLSSVLWDRER ------------------1111-------3333------------------------111 SSFYSLTSSNISKWELDDSSEKHAYSWDINRALKENITDAIWGSESNYEAIKEGVNIRYL 1--------------------------3333----------1111-33331111------ DLKQNCDGLVILAAAWHSADNPCLIYYSLITIEDNGCQMSDAVTVEVTQYNPPFQSEDLI ----3333--------1111-----------------------------------3333- LCQLTVPNFSNQTAYLYNESAVYVCSTGTGKFSLPQEKIVFNAQGDSVLGAGACGGVPII -------1111---------------!!!!------------iiii-------%%%%--- FSRNSGLVSITSRE -------------- >FATTY ACID SYNTHASE; SWP:P49327; PDB:1XKTA; VNLRSLLVNPEGPTLMRLNSVQSSERPLFLVHPIEGSTTVFHSLASRLSIPTYGLQCTRA 3333----1111--------------------1111-3333-3333-----------333 APLDSIHSLAAYYIDCIRQVQPEGPYRVAGYSYGACVAFEMCSQLQAQHNSLFLFDGSPT 3---------------3333-----------3333------------------------- YPGCEAEAETEAICFFVQQFTDMEHNRVLEALLPLKGLEERVAAAVDLIIKSHQGLDRQE ----------------3333-----------1111-----------------1111---- LSFAARSFYYKLRAAEQYTPKAKYYGNVMLLRAKTGLGADYNLSQVCDGKVSVHVIEGDH ---------------------------------------%%%%----------------- RTLLESGLESIISIIHSSLA -1111--------------- >DECORIN; SWP:P21793; PDB:1XKUA; GPVCPFRCQCHLRVVQCSDLGLEKVPKDLPPDTALLDLQNNKITEIKDGDFKNLKNLHTL ----2222--%%%%---------------1111-------------1111---1111--- ILINNKISKISPGAFAPLVKLERLYLSKNQLKELPEKMPKTLQELRVHENEITKVRKSVF ----------22223333--------------------1111-------------33332 NGLNQMIVVELGTNPLKSSGIENGAFQGMKKLSYIRIADTNITTIPQGLPPSLTELHLDG 222-------------3333-22223333--------------------3333------- NKITKVDAASLKGLNNLAKLGLSFNSISAVDNGSLANTPHLRELHLNNNKLVKVPGGLAD ------33332222----------------22223333------------------3333 HKYIQVVYLHNNNISAIGSNDFCPPGYNTKKASYSGVSLFSNPVQYWEIQPSTFRCVYVR ---------------------------1111-------------1111-33331111-33 AAVQL 33--- >DIHYDRODIPICOLINATE SYNTH; SWP:Q81WN7; PDB:1XKYA; MIDFGTIATAMVTPFDINGNIDFAKTTKLVNYLIDNGTTAIVVGGTTGESPTLTSEEKVA ---------------1111-------------1111--------33333333-------- LYRHVVSVVDKRVPVIAGTGSNNTHASIDLTKKATEVGVDAVMLVAPYYNKPSQEGMYQH --------%%%%----------------------1111---------------------- FKAIAESTPLPVMLYNVPGRSIVQISVDTVVRLSEIENIVAIKDAGGDVLTMTEIIEKTA ----1111--------3333---------------1111--------------------1 DDFAVYSGDDGLTLPAMAVGAKGIVSVASHVIGNEMQEMIAAFQAGEFKKAQKLHQLLVR 111-----3333----1111------3333------------------------------ VTDSLFMAPSPTPVKTALQMVGLDVGSVRLPLLPLTEEERVTLQSVMQSIPR --1111------------1111------------------------1111-- >REGULATORY PROTEIN BLAR1; SWP:P18357; PDB:1XKZA; NYKKPLHNDYQILDKSKIFGSNSGSFVMYSMKKDKYYIYNEKESRKRYSPNSTYKIYLAM --------------3333!!!!--------1111-----3333-------3333------ FGLDRHIINDENSRMSWNHKHYPFDAWNKEQDLNTAMQNSVNWYFERISDQIPKNYTATQ --1111--3333-----------3333---------------------1111-------- LKQLNYGNKNLGSYKSYWMEDSLKISNLEQVIVFKNMMEQNHFSKKAKNQLSSSLLIKKN -----!!!!-!!!!-1111---------------------------------1111---- EKYELYGKTGTGIVNGKYNNGWFVGYVITNHDKYYFATHLSDGKPSGKNAELISEKILKE -------------iiii-----------1111---------------------------- MGVL ---- >SECRETION CONTROL PROTEIN; SWP:P16161; PDB:1XL3C; AYDLSEFMGDIVALVDRWAGIHDIEHLANAFSLPTPEIVRFYQDLRMFRLFPLGVFSDEE --3333-------------3333----3333---3333--------3333-3333----- QRQNLLQMCQNAIDMAIESEEEELSELD ---------------------1111--- >PEROXISOMAL CARNITINE O-O; SWP:Q9DC50; PDB:1XL7A; ERTFQYQDSLPSLPVPALEESLKKYLESVKPFANEDEYKKTEEIVQKFQEGAGKRLHQKL -11111111-------------------3333---------------------------- LERARGKRNWLEEWWLNVAYLDVRIPSQLNVNFVGPCPHFEHYWPAREGTQLERGSLWHN --3333-------------3333--------------1111-----2222-3333----- LNYWQLLRREKLPVHKSGNTPLDNQFRLFSTCKVPGITRDSINYFKTESEGHCPTHIAVL ----------------!!!!--------------------------3333---------- CRGRAFVFDVLHEGCLITPPELLRQLTYIHKKCSNEPVGPSIAALTSEERTRWAKAREYL -----------iiii-------------------------33331111------------ ISLDPENLTLLEKIQTSLFVYSIEDSSPHATPEEYSQVFELLGGDPSVRWGDKSYNLISF ------------------------------33333333-----------1111------1 ANGIFGCCCDHAPYDAVVNIAHYVDERVLETEGRWKGSEKVRDIPLPEELVFTVDEKILN 111-------3333----------------%%%%-------------------------- DVSQAKAQHLKAASDLQIAASTFTLHPDTFIQLALQLAYYRLHGRPGCCYETATRYFYHG ------------1111----------------------------------------2222 RTETVRSCTVEAVRWCQSQDPSASLLERQQKLEAFAKHNKKDCSHGKGFDRHLLGLLLIA -------------------1111-------------------1111-------------- KEEGLPVPELFEDPLFSRSGGGGNFVLSTSLVGYLRVQGVVVPVHNGYGFFYHIRDDRFV 1111---3333-----1111-----------------------1111------------- VACSSWRSCPETDAEKLVQIFHAFHDIQLNTAHL -----3333--------------------3333- >SHE2P; SWP:P36068; PDB:1XLYA; DIKVTPGTSELVEQILALLSRYLSSYIHVLNKFISHLRRVATLRFERTTLIKFVKKLRFY ----1111---------------------------33331111----------------- NDSVLSYNASEFINEGKNELDPEADSFDKVILPIASMFVKSVETFDLLNYYLTQSLQKEI ---11113333------------------------------------------------- LSKTLNEDLTLTAESILAIDDTYNHFVKFSQWMIESLRIGSNLLDLEVVQFAIKSADEDN -----3333-------------------------1111--1111---------------1 IFLQEILPVNSEEEFQTLSAAWHSILDGKLSALDEEFDVVATKW 111------------------------------------3333- >PEPTIDE METHIONINE SULFOX; SWP:P54155; PDB:1XM0A; MAYNKEEKIKSLNRMQYEVTQNNGTEPPFQNEYWDHKEEGLYVDIVSGKPLFTSKDKFDS ---3333-----33333333----------1111------------------3333---- QCGWPSFTKPIEEEVEEKLDTSHGMIRTEVRSRTADSHLGHVFNDGPGPNGLRYCINSAA -------------------------------3333---------------------1111 LRFVPKHKLKEEGYESYLHLFNKLEHH ----11113333-33333333------ >THIAZOLE BIOSYNTHESIS PRO; SWP:O31618; PDB:1XM3A; SLTIGGKSFQSRLLLGTGKYPSFDIQKEAVAVSESDILTFAVRRNIFLEQLDLSKYTLLP ---iiii-----------------------3333------3333-------3333----- NTAGASTAEEAVRIARLAKASGLCDIKVEVIGCSRSLLPDPVETLKASEQLLEEGFIVLP ------------------1111-----------------------------1111----- YTSDDVVLARKLEELGVHAIPGASPIGSGQGILNPLNLSFIIEQAKVPVIVDAGIGSPKD ------------3333--------2222----------------------------3333 AAYAELGADGVLLNTAVSGADDPVKARAKLAVEAGRLSYEAGRIPLKQYGTASSPGE -------------3333----3333-------------1111--------3333--- >HYPOTHETICAL UPF0054 PROT; SWP:P77385; PDB:1XM5A; MSQVILDLQLACEDNSGLPEESQFQTWLNAVIPQFQEESEVTIRVVDTAESHSLNLTYRG -------------------3333--------1111------------------------- KDKPTNVLSFPFEVPPGMEMSLLGDLVICRQVVEKEAQEQGKPLEAHWAHMVVHGSLHLL ---------------------------------------------------------111 GYDHIEDDEAEEMEALETEIMLALGYEDPYIA 1--------------------1111------- >HYPOTHETICAL PROTEIN AQ_1; SWP:O67582_AQUAE; PDB:1XM7A; AMMYFISDTHFYHENIINLNPEVRFKGFEIVILTNLLKVLKPEDTLYHLGDFTWHFNDKN ---------22223333-------2222------------1111-------------111 EYLRIWKALPGRKILVMGNHDKDKESLKEYFDEIYDFYKIIEHKGKRILLSHYPAKDPIT 1----------------1111-33331111------------------------------ ERYPDRQEMVREIYFKENCDLLIHGHVHWNREGCACKDYRIECINANVEWNDYKPISERE ----------------------------------------------3333%%%%------ IDKLI ----- >GLYOXALASE II; SWP:Q9SID3; PDB:1XM8A; MQIELVPCLKDNYAYILHDEDTGTVGVVDPSEAEPIIDSLKRSGRNLTYILNTHHHYDHT -------------------------------------------------------11111 GGNLELKDRYGAKVIGSAMDKDRIPGIDMALKDGDKWMFAGHEVHVMDTPGHTKGHISLY 111-------------33331111-------2222---iiii------------------ FPGSRAIFTGDTMFSLSCGKLFEGTPKQMLASLQKITSLPDDTSIYCGHEYTLSNSKFAL 3333----!!!!-2222------------------11111111----------------- SLEPNNEVLQSYAAHVAELRSKKLPTIPTTVKMEKACNPFLRSSNTDIRRALRIPEAADE --1111-------------1111--------------33331111---------1111-- AEALGIIRKAKDDF ----------1111 >PLAKOPHILIN 1; SWP:Q13835; PDB:1XM9A; GLTIPKAVQYLSSQDEKYQAIGAYYIQHTCFQDESAKQQVYQLGGICKLVDLLRSPNQNV ---3333-3333--3333----------------3333-------------3333----- QQAAAGALRNLVFRSTTNKLETRRQNGIREAVSLLRRTGNAEIQKQLTGLLWNLSSTDEL --------------------------3333---3333----------------------3 KEELIADALPVLADRVIIPFSGWCVVDPEVFFNATGCLRNLSSADAGRQTMRNYSGLIDS 333-------------1111-------------------1111-------1111------ LMAYVQNCVAASRCDDKSVENCMCVLHNLSYRLDAEVPTRYRQLEYNALPEEETNPKGSG ---------------1111--------11113333----------------------333 WLYHSDAIRTYLNLMGKSKKDATLEACAGALQNLTASKGLMSSGMSQLIGLKEKGLPQIA 3--------------------------------------3333---------------33 RLLQSGNSDVVRSGASLLSNMSRHPLLHRVMGNQVFPEVTRLLTSHTGNTSNSEDILSSA 33---------------------3333-3333---------------------------- CYTVRNLMASQPQLAKQYFSSSMLNNIINLCRSSASPKAAEAARLLLSDMWSSKELQGVL ------33333333-----3333----3333-1111----------1111--1111---- >PREDICTED TRANSCRIPTIONAL; SWP:Q4CEJ8; PDB:1XMAA; SDVIRGYVDTIILSLLIEGDSYGYEISKNIRIKTDELYVIKETTLYSAFARLEKNGYIKS -3333----------3333--------------iiii----------------------- YYGEETKRRTYYRITPEGIKYYKQKCEEWELTKKVINKFVK ----------------------------------------- >IAA-AMINO ACID HYDROLASE ; SWP:P54970; PDB:1XMBA; KLLEFAKSPEVFDWMVKIRRKIHENPELGYEELETSKLIRSELELIGIKYRYPVAITGVI ---------------------------2222----------------------------- GYIGTGEPPFVALRADMDALPIQEGVEWEHKSKIAGKMHACGHDGHVTMLLGAAKILHEH ---------------------------1111--2222----------------------1 RHHLQGTVVLIFQPAEEGLSGAKKMREEGALKNVEAIFGIHLSARIPFGKAASRAGSFLA 111------------1111---------1111--------------2222---------- GAGVFEAVITGKTIDPVVAASSIVLSLQQLVSRETDPLDSKVVTVSKVNPDSITIGGTLR --------------------------3333-----3333--------------------- AFTGFTQLQQRVKEVITKQAAVHRCNASVNLTPNGREPMPPTVNNKDLYKQFKKVVRDLL --------------------------------%%%%------------------------ GQEAFVEAAPVMGSEDFSYFAETIPGHFSLLGMQDETNGYASSHSPLYRINEDVLPYGAA 1111----------3333-1111-----------1111---2222-----3333------ IHASMAVQYLKEKAS --------------- >DOUBLE-STRANDED RNA-SPECI; SWP:P55265; PDB:1XMKA; GSHMASLDMAEIKEKICDYLFNVSDSSALNLAKNIGLTKARDINAVLIDMERQGDVYRQG --1111-----------------------------1111-----------1111------ TTPPIWHLTDKKRERMQIK ---------3333------ >PHOSPHORIBOSYLAMINOIMIDAZ; SWP:Q63GT9; PDB:1XMPA; KSLVGVIMGSTSDWETMKYACDILDELNIPYEKKVVSAHRTPDYMFEYAETARERGLKVI ---------3333-----------------------1111------------1111---- IAGAGGAAHLPGMVAAKTNLPVIGVPVQSKALNGLDSLLSIVQMPGGVPVATVAIGKAGS --------------1111-------------iiii------------------------- TNAGLLAAQILGSFHDDIHDALELRREAIEKDVRE ----------3333--------------------- >THYMIDINE KINASE; SWP:Q9PPP5; PDB:1XMRA; IGWIEFITGPMFAGKTAELIRRLHRLEYADVKYLVFKPKISVEVESAPEILNYIMSNSFN ------------------------3333-----------------3333----------1 DETKVIGIDEVQFFDDRICEVANILAENGFVVIISGLDKNFKGEPFGPIAKLFTYADKIT 111------3333--------------------------3333--!!!!3333------- KLTAICNECGAEATHSLRKIDGKHADYNDDIVKIGCQEFYSAVCRHHHKVPNRPYLNSNS -------------------%%%%--1111------3333----3333---------1111 EEFIKFFKNK ----3333-- >PUTATIVE ACETYLTRANSFERAS; SWP:Q9CAQ2; PDB:1XMTA; PPKIVWNEGKRRFETEDHEAFIEYKMRNNGKVMDLVHTYVPSFKRGLGLASHLCVAAFEH -------1111---1111---------iiii---------1111---------------- ASSHSISIIPSCSYVSDTFLPRNPSWKPLIHSEVF -1111--------------11111111---1111- >Chimeric CD3 mouse Epsilo; SWP:P22646; PDB:1XMWA; DDAENIEYKVSISGTSVELTCPLDSDENLKWEKNGQELPQKHDKHLVLQDFSEVEDSGYY -------------------------1111---iiii------------------------ VCYTPASNKNTYLYLKARVGSADDAKKDAAKKDDAKKDDAKKDGSQTNKAKRALEVLEAE ----------------------------------------------------------%% DKVILKCNSSITLLQGTAGQEVSDNKTLNLGKRIEDPRGMYQCGENAKSFTLQVYYRM %%------------------------------1111---------------------- >HYPOTHETICAL PROTEIN VC18; SWP:NA; PDB:1XMXA; AMIHVGIIDQDPVRLVTPLLDHRTVSRHIIFIGDHTQTVIYQRLSDVLNKRNISTDFFEI 11---------11113333-1111---------3333----------3333--------- PAGSNTSAIKSAIRELAETLKARGEEVKFNASCGLRHRLLSAYEVFRSYHWPIFVVEPNS ----3333------------1111----------3333--------1111---------- DCLCWLYPEGNNDTQVQDRITIADYLTIFGARGEFNSPQLDQQLYQLGERWASNALELGP -------3333---------------1111------3333--------------3333-- GLATLNYLATTCRKEQKLDVELSDKQQGYRELNLLLSDLVEAKIASYENGILTFINEEAR ----------------------3333---------------------iiii--------- RFANGEWLETLVHSTVKQIQDDMPTIQDRSLNVQVYRQLGEREVRNELDVATVVNNKLHI ------------------33333333------------!!!!-----------%%%%--- IECKTKGMRDGDDTLYKLESLRDLLGGLQARAMLVSFRPLRHNDITRAEDLGLALIGPDE --------------------------1111--------------------------3333 LKDLKTHLTQWFKAAGGN ------------------ >BH1534 UNKNOWN CONSERVED ; SWP:Q9KCN5; PDB:1XN5A; MTRLPDIKKEVRFNAPIEKVWEAVSTSEGLAFWFMENDLKAETGHHFHLQSPFGPSPCQV ---------------3333-1111--3333-----------2222-----3333------ TDVERPIKLSFTWDTDGWSVTFHLKEEENGTIFTIVHSGWKQGDTKVEKAGAESAVVHER --------------------------3333------------------------------ MDRGWHDLVNERLRQIVE ------------------ >HYPOTHETICAL PROTEIN BC47; SWP:Q816V6; PDB:1XN6A; MEQQNTLNDIKQTIVFNASIQKVWSVVSTAEGIASWFMPNDFVLEVGHEFHVQSPFGPSP ------------------3333-3333---3333----------2222------------ CKVLEIDEPNHLSFSWDTDGWVVSFDLKDLGDNKTEFTLIHGGWKHPDEILPKANAKSSI ------------------------------%%%%----------------------3333 IRDRMSGGWVAIVNEKLKKVVEG ----------------------- >HYPOTHETICAL PROTEIN YHGG; SWP:P46845; PDB:1XN7A; MASLIQVRDLLALRGRMEAAQISQTLNTPQPMINAMLQQLESMGKAVRIQEEPDGCLSGS --3333-----------3333--1111--------------------------------- CKSCPEGKACLREWWALR ------------------ >30S RIBOSOMAL PROTEIN S24; SWP:Q8PZ95; PDB:1XN9A; MDIKIIKDKKNPLLNRRELDFIVKYEGSTPSRNDVRNKLAAMLNAPLELLVIQRIKTEYG ----------------------------------------1111-3333----------- MQESKGYAKLYEDADRMKQVEQEYVLKRNAVPGSETEGEEA ----------------------------------------- >DNA-REPAIR PROTEIN XRCC1; SWP:P18887; PDB:1XNAA; MPEIRLRHVVSCSSQDSTHCAENLLKADTYRKWRAAKAGEKTISVVLQLEKEEQIHSVDI ------------------------------------------------------------ GNDGSAFVEVLVGSSAGGAGEQDYEVLLVTSSFMSPSESRSGSNPNRVRMFGPDKLVRAA ------------------------------------3333-----------3333----1 AEKRWDRVKIVCSQPYSKDSPFGLSFVRFHS 111---------------------------- >XYLANASE; SWP:P48793; PDB:1XND; QTIGPGTGYSNGYYYSYWNDGHAGVTYTNGGGGSFTVNWSNSGNFVAGKGWQPGTKNKVI ------------------------------!!!!-------------------------- NFSGSYNPNGNSYLSIYGWSRNPLIEYYIVENFGTYNPSTGATKLGEVTSDGSVYDIYRT ------------------------------------1111---------iiii------- QRVNQPSIIGTATFYQYWSVRRNHRSSGSVNTANHFNAWASHGLTLGTMDYQIVAVEGYF ------1111--------------------3333-----1111----------------- SSGSASITVS ---------- >HYPOTHETICAL PROTEIN PF04; SWP:Q8U3J6; PDB:1XNEA; MKVYRLYLKDEYLEMVKSGKKRIEVRVAYPQLKDIKRGDKIIFNDLIPAEVVEVKKYETF --------3333-------------------33332222-------------------33 RQVLREEPIDKIFPDKPSFEKALKRFHNMYPKWKEYRYGVLAIKFRVLGRDKE 33------33333333------------------------------------- >LIPOPROTEIN NLPI; SWP:P39833; PDB:1XNFA; KSEVLAVPLQPTLQQEVILAREQILASRALTDDERAQLLYERGVLYDSLGLRALARNDFS 1111--------------------------3333------------1111---------- QALAIRPDPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAHLNRGIALYYGGRDK --------3333-------1111---------------11113333-------1111--- LAQDDLLAFYQDDPNDPFRSLWLYLAEQKLDEKQAKEVLKQHFEKSDKEQWGWNIVEFYL ------------1111---------------------------------3333----111 GNISEQTLERLKADATDNTSLAEHLSETNFYLGKYYLSLGDLDSATALFKLAVANNVHNF 1--3333-----------------------------1111------------33331111 VEHRYALLELSLLGQD 3333------------ >NH(3)-DEPENDENT NAD(+) SY; SWP:O25096; PDB:1XNGA; KDYQKLIVYLCDFLEKEVQKRGFKKVVYGLSGGLDSAVVGVLCQKVFKENAHALLMPSSV ------------------1111------------------------!!!!-------111 SMPENKTDALNLCEKFSIPYTEYSIAPYDAIFSSHFKDASLTRKGNFCARLRMAFLYDYS 13333--------------------3333------1111--------------------- LKSDSLVIGTSNKSERMLGYGTLFGDLACAINPIGELFKTEVYELARRLNIPKKILNKPP ---------------------2222------1111-----------1111-3333----- SADLFVGQSDEKDLGYPYSVIDPLLKDIEALFQTKPIDTETLAQLGYDEILVKNITSRIQ ----22223333----3333---------------------3333--------------- KNAFKLELPAIAKRF ---1111-------- >ENDOXYLANASE 11A; SWP:Q8J1V6; PDB:1XNKA; TLTSSATGTHNGYYYSFWTDGQGNIRFNLESGGQYSVTWSGNGNWVGGKGWNPGTDNRVI ---------iiii-----------------!!!!-------------------------- NYTADYRPNGNSYLAVYGWTRNPLIEYYVVESFGTYDPSTGATRMGSVTTDGGTYNIYRT ------------------------------------1111---------iiii------- QRVNAPSIEGTKTFYQYWSVRTSKRTGGTVTMANHFNAWRQAGLQLGSHDYQIVATEGYY ------1111--------------------3333-------------------------- SSGSATVNVG ---------- >TRANSCRIPTIONAL REGULATOR; SWP:Q8U030; PDB:1XNPA; GEELNRLLDVLGNETRRRILFLLTKRPYFVSELSRELGVGQKAVLEHLRILEEAGLIESR ---------------------3333---------1111-------------3333----- VEKIPRGRPRKYYIKKGLRLEILLTPTLFGSEYEAKGVRKSPEYEQAKELIKSQEPINVK --------------2222----------------------3333-----1111------- RELAEFLHELNERIREIIEEKRELEEARILIETYIENTRRLAEENRQIIEEIFRDIEKIL ---------------------------------------3333------------1111- PPGYARSLK 3333----- >CONSTITUTIVE ANDROSTANE R; SWP:O35627; PDB:1XNXA; LQLNQQQKELVQILLGAHTRHVGPLFDQFVQFKPPAYLFMHHRPFQPRGPVLPLLTHFAD ---------------------1111--3333---3333----1111-------------- INTFMVQQIIKFTKDLPLFRSLTMEDQISLLKGAAVEILHISLNTTFCLQTENFFCGPLC ----------------3333-----------------------1111--------!!!!- YKMEDAVHAGFQYEFLESILHFHKNLKGLHLQEPEYVLMAATALFSPDRPGVTQREEIDQ -3333-1111----------------1111------------------2222-3333--- LQEEMALILNNHIMEQQSRLQSRFLYAKLMGLLADLRSINNAYSYELQRLEE ---------------3333--------------------------------- >RECOMBINASE CRE; SWP:P06956; PDB:1XO0A; SDEVRKNLMDMFRDRQAFSEHTWKMLLSVCRSWAAWCKLNNRKWFPAEPEDVRDYLLYLQ -------------1111-3333---------------1111------------------- ARGLAVKTIQQHLGQLNMLHRRSGLPRPSDSNAVSLVMRRIRKENVDAGERAKQALAFER --------------------------3333----------------------------33 TDFDQVRSLMENSDRCQDIRNLAFLGIAYNTLLKIAEIARIRVKDISRTDGGRMLIHIGR 33---------------------------------------1111----iiii------- TKTLVSTAGVEKALSLGVTKLVERWISVSGVADDPNNYLFCRVRKNGVAAPSATSQLSTR --------------3333-----------11113333------1111------------- ALEGIFEATHRLIYGAKDDSGQRYLAWSGHSARVGAARDMARAGVSIPEIMQAGGWTNVN ---------------------------1111---------1111-3333----------- IVMNYIRNLDSETGAMVRLLED 3333-1111------------- >5'-EXONUCLEASE; SWP:P06229; PDB:1XO1A; RRNLMIVDGTNLGFRFPFASSYVSTIQSLAKSYSARTTIVLGDKGKSVFRLEHLPEYAFF ------------1111-------------------------------------1111-33 EYLKDAFELCKTTFPTFTIRGVEADDMAAYIVKLIGHLYDHVWLISTDGDWDTLLTDKVS 33------3333------22223333--------3333-------------11111111- RFSFTTRREYHLRDMYEHHNVDDVEQFISLKAIMGDLGDNIRGVEGIGAKRGYNIIREFG ----------3333-1111----------------3333----2222------------- NVLDIIDQLPLPGKQKYIQNLNASEELLFRNLILVDLPTYCVDAIAAVGQDVLDKFTKDI -----1111-----3333------------------3333-------------------- LEIAE ----- >CALCIUM AND INTEGRIN-BIND; SWP:Q99828; PDB:1XO5A; LLAEYQDLTFLTKQEILLAHRRFCELLPQEQRSVESSLRAQVPFEQILSLPELKANPFKE -3333------------------11113333-33331111--3333------1111---- RICRVFSTSPAKDSLSFEDFLDLLSVFSDTATPDIKSHYAFRIFDFDDDGTLNREDLSRL --------1111---------------3333-------------1111------------ VNCLTRLSASEMKQLIDNILEESDIDRDGTINLSEFQHVISRSPDFASSFKIVL -----------------------1111----------------3333------- >PROPIONYL-COA CARBOXYLASE; SWP:NA; PDB:1XO6A; DIHTTAGKLADLRRRIEEATHAGSARAVEKQHAKGKLTARERIDLLLDEGSFVELDEFAR -----------------------------------------------2222--------- HRSTNFGLDANRPYGDGVVTGYGTVDGRPVAVFSQDFTVFGGALGEVYGQKIVKVMDFAL ----%%%%----2222--------iiii-------1111--------------------- KTGCPVVGINDSGGARIQEGVASLGAYGEIFRRNTHASGVIPQISLVVGPCAGGAVYSPA ---------------3333-----------------2222------------------33 ITDFTVMVDQTSHMFITGPDVIKTVTGEDVGFEELGGARTHNSTSGVAHHMAGDEKDAVE 33----------------------------3333---3333------------------- YVKQLLSYLPSNNLSEPPAFPEEADLAVTDEDAELDTIVPDSANQPYDMHSVIEHVLDDA -----1111--1111-------------3333---------1111-----------2222 EFFETQPLFAPNILTGFGRVEGRPVGIVANQPMQFAGCLDITASEKAARFVRTCDAFNVP -----33333333------iiii-------3333%%%%----------------1111-- VLTFVDVPGFLPGVDQEHDGIIRRGAKLIFAYAEATVPLITVITRKAFGGAYDVMGSKHL ------------3333----------------------------------------3333 GADLNLAWPTAQIAVMGAQGAVNILHRRTIADAGDDAEATRARLIQEYEDALLNPYTAAE -------1111-------------------------1111--------------333311 RGYVDAVIMPSDTRRHIVRGLRQLRTKRESLPPKKHGNIPL 11------3333-----------1111-------------- >CYCLOPHILIN; SWP:Q4DPB9; PDB:1XO7A; MPVVTDKVYFDITIGDEPVGRVVIGLFGNDVPKTVENFKQLASGENGFGYKGSIFHRVIR -------------iiii--------------------------------2222-----22 NFMIQGGDFTNFDGTGGKSIYGTRFDDENLKIKHFVGAVSMANAGPNSNGSQFFVTTAPT 22----------------1111------------2222---------------------1 PWLDGRHVVFGKVVEGMDVVKKVENTKTGLNDKPKKAVKINDCGVL 111------------3333---1111--2222-------------- >AT1G01470; SWP:O03983; PDB:1XO8A; MASLLDKAKDFVADKLTAIPKPEGSVTDVDLKDVNRDSVEYLAKVSVTNPYSHSIPICEI 3333-------------------------------------------------------- SFTFHSAGREIGKGKIPDPGSLKAKDMTALDIPVVVPYSILFNLARDVGVDWDIDYELQI ------------------------------------3333-------------------- GLTIDLPVVGEFTIPISSKGEIKLPTFKDFF ------------------------------- >HYPOTHETICAL PROTEIN AT3G; SWP:NA; PDB:1XO9A; MSRNPEVLWAQRSDKVYLTVALPDAKDISVKCEPQGLFSFSALGAQGERFEFSLELYGKI -----------3333----------------------------1111------------- MTEYRKNVGLRNIIFSIQKEERSWWTRLLKSEEKPAPYIKVDWNKWCDEDEEVNSETASD -----------------------------------1111--3333--------------- DESAFVNQDSESSDDDGLLYLPDLEKARNK -3333-3333-------------3333--- >OLIGOPEPTIDE-BINDING PROT; SWP:P42061; PDB:1XOCA; KPQQGGDLVVGSIGEPTLFNSLYSTDDASTDIENMLYSFLTKTDEKLNVKLSLAESIKEL -------------------3333--------------------1111------------% DGGLAYDVKIKKGVKFHDGKELTADDVVFTYSVPLSKDYKGERGSTYEMLKSVEKKGDYE %%%------------1111---3333-----11111111---33331111---------- VLFKLKYKDGNFYNNALDSTAILPKHILGNVPIADLEENEFNRKKPIGSGPFKFKEWKQG -----------------------33331111333311111111--------------222 QYIKLEANDDYFEGRPYLDTVTYKVIPDANAAEAQLQAGDINFFNVPATDYKTAEKFNNL 2------1111-----------------------------------3333--3333---- KIVTDLALSYVYIGWNEKNELFKDKKVRQALTTALDRESIVSQVLDGDGEVAYIPESPLS ---------------33331111--------1111---------iiii--------3333 WNYPKDIDVPKFEYNEKKAKQMLAEAGWKDTNGDGILDKDGKKFSFTLKTNQGNKVREDI -----------------------1111-----------%%%%--------2222------ AVVVQEQLKKIGIEVKTQIVEWSALVEQMNPPNWDFDAMVMGWSLSTFPDQYDIFHSSQI --------------------------------------------------3333-3333- KKGLNYVWYKNAEADKLMKDAKSISDRKQYSKEYEQIYQKIAEDQPYTFLYYPNNHMAMP ----1111----------3333-------------------------------------1 ENLEGYKYHPKRDLYNIEKWWLAK 111--------11113333----- >SPRED1; SWP:Q66JG9; PDB:1XODA; SYARVRAVVMTRDDSSGGWLQLGGGGLSSVTVSKTTEFLVHGERLRDKTVVLECVLRRDL --------------------2222--------------------------------1111 VYNKVTPTFHHWRIGDKKFGLTFQSPADARAFDRGIRRAIEDLSQG ---------------------------------------------- >ESPA; SWP:Q47184; PDB:1XOUA; DVIDLFNKLGVFQAAILFAYYQAQSDLNLTTTVNNSQLEIQQSNTLNLLTSARSDQSLQY ----3333--------------3333---------------------------------- RTISGISL -------- >Orf3; SWP:O52124; PDB:1XOUB; GIVSQTRNKELLDKKIRSEIEAIKKIIAEFDVVKESVNELSEKAKTDPQAAEKLNKLIEG -----------------------------------------3333--------------- YTYGEERKLYDSALSKIEKLIETL ------------------------ >PLY PROTEIN; SWP:NA; PDB:1XOVA; AMSNYSMSRGHSDKCVGAEDILSEIKEAEKVLNAASDELKREGHNVKTFIDRTSTTQSAN -----------1111------------------------1111----------------- LNKIVNWHNANPADVHISVHLNAGKGTGVEVWYYAGDEKGRKLAVEISAKMAKALGLPNR -------1111----------------------2222----------------------- GAKATKDLRFLNSTKGTAVLLEVCFVDRKEDANAIHKSGMYDKLGIAIAEGLTGKTVAAK -------3333-------------------------2222-------------------3 NPNRHSGAVVDSVPMLSKMDFKSSPIKMYKAGSSLLVYEHNKYWYKAYINDKLCYIYKSF 333----------------1111------2222-------1111----%%%%----3333 CISNGKKDAKGRIKVRIKSAKDLRIPVWNNTKLNSGKIKWYSPGTKLSWYDNKKGYLELW -------1111-------3333-------3333--------2222--------------- YEKDGWYYTANYFLK ---------1111-- >HYPOTHETICAL PROTEIN AT3G; SWP:Q9SQZ9; PDB:1XOYA; SSAESASQIPKGQVDLLDFIDWSGVECLNQSSSHSLPNALKQGYREDEGLNLESDADEQL ---------------3333-----------11113333---------------------- LIYIPFNQVIKLHSFAIKGPEEEGPKTVKFFSNKEHMCFSNVNDFPPSDTAELTEENLKG -------------------3333--------------3333------------3333--- KPVVLKYVKFQNVRSLTIFIEANQSGSEVTKVQKIALYGST -----3333-------------1111--------------- >L-ALANYL-D-GLUTAMATE PEPT; SWP:Q37979; PDB:1XP2A; AMALTEAWLIEKANRKLNAGGMYKITSDKTRNVIKKMAKEGIYLCVAQGYRSTAEQNALY ---------------1111------------------1111------------------- AQGRTKPGAIVTNAKGGQSNHNYGVAVDLCLYTNDGKDVIWESTTSRWKKVVAAMKAEGF 2222----------2222--------------1111-------------------1111- KWGGDWKSFKDYPHFELCDAVSGEKIPAA -1111-----3333----3333------- >ENDONUCLEASE IV; SWP:Q81LV1; PDB:1XP3A; LKIGSHVSMSGKKMLLAASEEAVSYGATTFMIYTGAPQNTRRKPIEELNIEAGRKHMEQN ----------------------1111---------1111----3333----------111 GIEEIIIHAPYIINVGNTTKPETFQLGVDFLRMEIERTSALGVAKQIVLHPGAHVGAGAD 1------------------------------------1111------------iiii--- AGIQQIIKGLNEVLTPDQTVNIALETMAGKGTECGRSFEEIAKIIDGVKYNEKLSVCFDT --------------1111--------------------------3333-3333------- CHTHDAGYDIVNNFDGVLNEFDKIVGIDRLQVLHINDSKNVRGAGKDRHENIGFGHIGYK ---1111------------------3333-----------2222------2222------ ALHHIVHHPQLTHVPKILETPYVGEDKKDKKPPYKLEIEMLKNGTFDEGLLEKIKAQ -------1111-----------------------------------1111--3333- >D-ALANYL-D-ALANINE CARBOX; SWP:Q8DQ99; PDB:1XP4A; FTIAAKHAIAVEANTGKILYEKDATQPVEIASITKLITVYLVYEALENGSITLSTPVDIS ------------1111-----------------------------------1111----- DYPYQLTTNSEASNIPEARNYTVEELLEATLVSSANSAAIALAEKIAGSEKDFVDRAKLL ----1111---------------------------------------------------- EWGIQDATVVNTTGLNNETLGDNIYPGSKKDEENKLSAYDVAIVARNLIKKYPQVLEITK ---------------3333!!!!-22221111-------------------3333--333 KPSSTFAGTITSTNYLEGPAYRGGFDGLKTGTTDKAGESFVGTTVEKGRVITVVLNADHQ 3-------------------------------3333-----------------------1 DNNPYARFTATSSLDYISSTFTLRKIVQQGDAYQDSKAPVQDGKEDTVIAVAPEDIYLIE 111-3333-------3333--------2222----------------------------- RVGNQSSQSVQFTPDSKAIPAPLEAGTVVGHLTYEDKDLIGQGYITTERPSFEVADKKIE ---------------------------------------!!!!----------------- >RECA PROTEIN; SWP:P42443; PDB:1XP8A; AKERSKAIETAMSQIEKAFGKGSIMKLGAESKLDVQVVSTGSLSLDLALGVGGIPRGRIT -------------------2222--1111------------------------------- EIYGPESGGKTTLALAIVAQAQKAGGTCAFIDAEHALDPVYARALGVNTDELLVSQPDNG ----2222-------------1111-----------------1111-3333--------- EQALEIMELLVRSGAIDVVVVDSVAALTPRAEIPGLQARLMSQALRKLTAILSKTGTAAI ----------3333--------3333---1111-3333---------------------- FINQVGGRALKFYASVRLDVRKIGQPTVANTVKIKTVKNKVAAPFKEVELALVYGKGFDQ ---------1111----------------------------------------------- LSDLVGLAADMDIIKKAGSFYSYGDERIGQGKEKTIAYIAERPEMEQEIRDRVMAAIR --------1111----!!!!--%%%%-------------------------------- >XPA; SWP:P23025; PDB:1XPA; MEFDYVICEECGKEFMDSYLMNHFDLPTCDNCRDADDKHKLITKTEAKQEYLLKDCDLEK ----------------------------3333-3333-----3333-------------- REPPLKFIVKKNPHHSQWGDMKLYLKLQIVKRSLEVWGSQEALEEAKEVRQEN ----------------3333--------------------------------- >ESTROGEN RECEPTOR; SWP:P03372; PDB:1XPCA; ALSLTADQMVSALLDAEPPILYSEYDPTRPFSEASMMGLLTNLADRELVHMINWAKRVPG -------------1111--------1111-1111-----------------------222 FVDLTLHDQVHLLECAWLEILMIGLVWRSMEHPGKLLFAPNLLLDRNQGKCVEGMVEIFD 2---------------------------1111------1111--3333---2222----- MLLATSSRFRMMNLQGEEFVCLKSIILLNSGVYTFLSSTLKSLEEKDHIHRVLDKITDTL ----------------------------1111---------------------------- IHLMAKAGLTLQQQHQRLAQLLLILSHIRHMSNKGMEHLYSMKCKNVVPLYDLLLEMLDA ----1111---------------------------------------------------- HRLHA -1111 >CD209 ANTIGEN-LIKE PROTEI; SWP:Q9H8F0; PDB:1XPHA; AFERLCRHCPKDWTFFQGNCYFMSNSQRNWHDSVTACQEVRAQLVVIKTAEEQNFLQLQT 3333-----2222--iiii------------------1111------------------- SRSNRFSWMGLSDLNQEGTWQWVDGSPLSPSFQRYWNSGEPNNSGNEDCAEFSGSGWNDN --------------3333---1111---111111112222---%%%%-----!!!!---- RCDVDNYWICKKPAACFRD 1111--------------- >HYPOTHETICAL PROTEIN; SWP:Q9KVB4; PDB:1XPJA; MKKLIVDLDGTLTQANTSDYRNVLPRLDVIEQLREYHQLGFEIVISTARNMRTYEGNVGK -------2222-------3333--------------1111-------22221111--333 INIHTLPIITEWLDKHQVPYDEILVGKPWCGHDGFYIDDRAVRPSEFASMNLEEIHQLFE 3------------1111-------------1111---1111------------------3 KEKS 333- >3-HYDROXY-3-METHYLGLUTARY; SWP:Q79ZY6; PDB:1XPMA; AIGIDKINFYVPKYYVDMAKLAEARQVDPNKFLIGIGQTEMAVSPVNQDIVSMGANAAKD ----------------------1111-3333-------------1111---------111 IITDEDKKKIGMVIVATESAVDAAKAAAVQIHNLLGIQPFARCFEMKEAYAATPAIQLAK 1-3333------------------------------------------------------ DYLATRPNEKVLVIATDTARYGLNSGGEPTQGAGAVAMVIAHNPSILALNEDAVAYTEDV 1111-1111------------2222-3333------------------------------ YDFWRPTGHKYPLVDGALSKDAYIRSFQQSWNEYAKRQGKSLADFASLCFHVPFTKMGKK -----2222------3333---------------------3333---------3333--- ALESIIDNADETTQERLRSGYEDAVDYNRYVGNIYTGSLYLSLISLLENRDLQAGETIGL ----3333-----------------3333----!!!!---------------2222---- FSYGSGSVGEFYSATLVEGYKDHLDQAAHKALLNNRTEVSVDAYETFFKRFDDVEFDEEQ ----------------2222------------------------------1111--3333 DAVHEDRHIFYLSNIENNVREYHRPELE -33331111------%%%%-----1111 >HYPOTHETICAL PROTEIN PA13; SWP:Q9I420; PDB:1XPNA; GSSHHHHHHSSGLVPRGSHMASNPNDLPDFPEHEYAATQQVGGGVINGDLYLTSASGAIQ --------------------------------3333-----------------1111--- KGTNTKVALEPATSYMKAYYAKFGNLDAAKRDPDVQPPVLDPRRATYVREATTDQNGRFD --------------------------3333----------3333---------------- FDHIPNGTYYISSELTWSAQSDGKTITEGGTVTKLVTVSGSQPQKVLLTR -------------------------------------------------- >DNA-DIRECTED RNA POLYMERA; SWP:Q9HIC5; PDB:1XPPA; AESSLRVISKEKNSITVEMINYDNTLLRTLVEEILKDDQVDEARYYIKHPVIDNPQIYVR ------------------------------------1111--------1111-------- VKSGKPQSAIKRAVRKLSKLYEDLGTQFQKEFQRYESDH ----3333------------------------------- >PUTATIVE TROPINONE REDUCA; SWP:Q9ASX2; PDB:1XQ1A; SQRWSLKAKTVLVTGGTKGIGHAIVEEFAGFGAVIHTCARNEYELNECLSKWQKKGFQVT 3333-2222------------------3333----------------------------- GSVCDASLRPEREKLMQTVSSMFGGKLDILINNLGALDYTAEDFSFHISTNLESAYHLSQ ----1111--------------iiii---------------------------------- LAHPLLKASGCGNIIFMSGSIYSATKGALNQLARNLACEWASDGIRANAVAPAVIAGEPE -----3333------------------------------3333----------------- EVSSLVAFLCMPAASYITGQTICVDGGLTVNGFSYQPQ ----------3333---------------iiii----- >PROTEIN APAG; SWP:Q7VU61; PDB:1XQ4A; PVKPYDLTVSVTPRYVPEQSDPSQQQYVFAYTVRITNTGSHPAQVISRHWIITDGEERVQ ---------------3333--1111----------------------------1111--- EVRGLGVVGQQPLLAPGETFEYTSGCPLPTPIGTRGTYHCVGENGIPFEVPIAEFLLAPR ------%%%%-------------------------------1111--------------- T - >HEMOGLOBIN ALPHA-1 CHAIN; SWP:NA; PDB:1XQ5A; SLSSKDKDTVKALWGKIADKAEEIGSDALSRMLAVYPQTKTYFSHWKDLSPGSAPVNKHG --------------1111-----------------33331111----------------- KTIMGGIVDAVASIDDLNAGLLALSELHAFTLRVDPANFKILSHCILVLLAVKFPKDFTP ----------1111--3333--------------3333---------------3333--- EVHISYDKFFSALARALAEKYR ---------------1111--- >HEMOGLOBIN ALPHA-1 CHAIN; SWP:NA; PDB:1XQ5B; VVWTDFERATIADIFSKLDYEAVGGATLARCLIVYPWTQRYFGNFGNLYNAAAIMGNPMI --------------1111-------------------3333--------3333------- AKHGTTILHGLDRAVKNMDNIKATYAELSVLHSEKLHVDPDNFKLLSDCLTIVVAAQLGK ----------------11113333--------------3333---------------!!! AFSGEVQAAFQKFLSVVVSALGKQYH !------------------------- >UNKNOWN PROTEIN; SWP:Q94EG6; PDB:1XQ6A; SANLPTVLVTGASGRTGQIVYKKLKEGSDKFVAKGLVRSAQGKEKIGGEADVFIGDITDA ----------1111------------1111------------------1111---11113 DSINPAFQGIDALVILTSAVPKMKPGFDPTKGGRPEFIFEDGQYPEQVDWIGQKNQIDAA 3333333--------------------1111--------2222----------------- KVAGVKHIVVVGSMGGTNPDHPLNKLGNGNILVWKRKAEQYLADSGTPYTIIRAGGLLDK --------------1111--33332222-------------------------------- EGGVRELLVGKDDELLQTDTKTVPRADVAEVCIQALLFEEAKNKAFDLGSKPEGTSTPTK ----------%%%%1111---------------33333333----------2222----- DFKALFSQVTSRF ----1111----- >ALPHA-SYNUCLEIN; SWP:P37840; PDB:1XQ8A; MDVFMKGLSKAKEGVVAAAEKTKQGVAEAAGKTKEGVLYVGSKTKEGVVHGVATVAEKTK -3333---------------------------1111---33333333------------- EQVTNVGGAVVTGVTAVAQKTVEGAGSIAAATGFVKKDQLGKNEEGAPQEGILEDMPVDP ---------------------3333---------------------------------33 DNEAYEMPSEEGYQDYEPEA 33------------------ >PHOSPHOGLYCERATE MUTASE; SWP:Q8IIG6; PDB:1XQ9A; MTTYTLVLLRHGESTWNKENKFTGWTDVPLSEKGEEEAIAAGKYLKEKNFKFDVVYTSVL -------------3333-----!!!!-------------------1111----------- KRAICTAWNVLKTADLLHVPVVKTWRLNERHYGSLQGLNKSETAKKYGEEQVKIWRRSYD ---------------1111----3333----!!!!------------1111--------- IPPPKLDKEDNRWPGHNVVYKNVPKDALPFTECLKDTVERVLPFWFDHIAPDILANKKVM ------1111--33333333---1111-------------------------1111---- VAAHGNSLRGLVKHLDNLSEADVLELNIPTGVPLVYELDENLKPIKHYYLLDSEELKKKM -------------1111---3333--------------1111------------3333-- D - >GLYOXALASE/BLEOMYCIN RESI; SWP:Q81AI8; PDB:1XQAA; AGIKHLNLTVADVVAAREFLEKYFGLTCSGTRGNAFAVRDNDGFILTLKGKEVQYPKTFH -------------------------------!!!!----1111------------1111- VGFPQESEEQVDKINQRLKEDGFLVEPPKHAAYTFYVEAPGGFTIEVC ------------------1111----------------2222------ >HYPOTHETICAL UPF0066 PROT; SWP:P44740; PDB:1XQBA; NDLTLSPIAIIHTPYKEKFSVPRQPNLVEDGVGIVELLPPYNSPEAVRGLEQFSHLWLIF ----------------3333---3333---------------33332222---------- QVGVFASRATHRPNPLGSKVELRQVECINGNIFLHLGAVDLVDGTPIFDIKPYIAYADSE --3333---------------------%%%%----------2222----------11111 PNAQSSVKMTVEFTEQAKSAVKKREEKRPHLSRFIRQVLEDRIYGMSLYEFNVKWAGTVN 111------------------------2222-----1111-------!!!!--------- CVE --- >NUCLEOSIDE DIPHOSPHATE KI; SWP:Q8ZWY4; PDB:1XQIA; PVEKTLLILKPDAVARGLVDEIISRFKKAGLKIVALKVKASPEEIERFYPSSEEWLQSAG --------------------------1111------------------------------ QKLLKAYQELGIDPRAKIGTDDPVEVGRIIKRNLVKYTSGPNVVVLKGNRAVEIVRKLVG ------------3333---------------------------------3333------- PTSPHSAPPGTIRGDYSIDSPDLAAEEGRVVFNLVHASDSPSEAEREIRFWFREEEVLE --3333------------------1111------------------------3333--- >8-OXOGUANINE DNA GLYCOSYL; SWP:Q8ZVK6; PDB:1XQOA; AAESQLKRVIETLRRLGIEEVLKLERRDPQYRAVCNVVKRHGETVGSRLAMLNALISYRL ------------3333------3333---------------------------1111--i TGKGEEHWEYFGKYFSQLEVIDLCRDFLKYIETSPFLKIGVEARKKRALKACDYVPNLED iii-----------1111---------------1111-------------1111--1111 LGLTLRQLSHIVGARREQKTLVFTIKILNYAYMCSRGVNRVLPFDIPIPVDYRVARLTWC --------------1111------------------------1111------------11 AGLIDFPPEEALRRYEAVQKIWDAVARETGIPPLHLDTLLWLAGRAVLYGENLHGVPKEV 11-----------------------------3333---------------------3333 IALFQWRGGCRPP 3333--1111--- >HSPBP1 PROTEIN; SWP:Q9NZL4; PDB:1XQRA; RGQRGEVEQKSCLRVLSQPPPTAGEAEQAADQQEREGALELLADLCENDNAADFCQLSGH --------------1111-----3333--------------------------------- LLVGRYLEAGAAGLRWRAAQLIGTCSQNVAAIQEQVLGLGALRKLLRLLDRDACDTVRVK ----3333--3333---------------------------------------------- ALFAISCLVREQEAGLLQFLRLDGFSVLRAQQQVQKLKVKSAFLLQNLLVGHPEHKGTLC ----3333----------------3333-----------------------3333----- SGVQQLVALVRTEHSPFHEHVLGALCSLVTDFPQGVRECREPELGLEELLRHRCQLLQQH ------3333----3333-------3333-----------3333------------1111 EEYQEELEFCEKLLQTCFS 1111--------------- >HIT FAMILY HYDROLASE; SWP:Q4CCR3; PDB:1XQUA; LENCVFCKIIKRELPSTIYYEDERVIAIKDINPAAPVHVLIIPKEHIANVKEINESNAQI 1111----1111---------1111-----------------------3333-3333--- LIDIHKAANKVAEDLGIAEKGYRLITNCGVAAGQTVFHLHYHLLGGVDGPKI ------------1111-3333-------3333-------------------- >Fibroblast growth factor ; SWP:Q8WU20; PDB:1XR0B; MGSDTVPDNHRNKFKVINVDDDGNELGSGIMELTDTELILYTRKRDSVKWHYLCLRRYGY -----------------------------------------%%%%--------------- DSNLFSFESGRRCQTGQGIFAFKCARAEELFNMLQEIMQNNSINVVEEPVVERNNHQTEL ------------------------------------------------------------ EVPRTPRTP --------- >putative citrate lyase al; SWP:Q8ZRY1; PDB:1XR4A; AKETVTLNQQYVVPEGLQPYQGVTANSPWLASETEKRRRKICDSLEEAIRRSGLKNGTIS -3333----------------1111-3333---33333333------------------- FHHAFRGGDKVVNVAKLAEGFRDLTLASSSLIDAHWPLIEHIKNGVVRQIYTSGLRGKLG --1111-------------------------1111------------------------- EEISAGLENPVQIHSHGGRVKLIQSGELNIDVAFLGVPCCDEFGNANGFSGKSRCGSLGY ----------------------1111--------------1111---------------- AQVDAQYAKCVVLLTEEWVEFPNYPASIAQDQVDLIVQVDEVGDPEKITAGAIRLSSNPR ----------------------------1111---------------------------- ELLIARQAANVIEHSGYFCDGFSLQTGTGGASLAVTRFLEDKRRHNITASFGLGGITGTV ------------------2222-------------------------------------- DLHEKGLIKALLDTQSFDGDAARSLAQNPHHIEISTNQYANPASKGAACERLNVVLSALE --1111---------------------1111---3333--1111---------------- IDVNFNVNVTGSNGVLRGASGGHSDTAAGADLTIITAPLVRGRIPCVVEKVLTTVTPGAS -1111-----1111-----!!!!-----------------!!!!------------3333 VDVLVTDHGIAVNPARQDLLDNLRAAGVALTIEQLQQRAEQLTGKPQPIEFTDRVVAVVR -----1111---3333-------1111--------------------------------- YRDGSVIDVIRQVK 1111---------- >GENOME POLYPROTEIN; SWP:Q89649; PDB:1XR5A; GQVIARHKVREFNINPVNTPTKSKLHPSVFYDVFPGDKEPAVLSDNDPRLEVKLTESLFS -------3333----------------1111------------3333-----------33 KYKGNVNTEPTENMLVAVDHYAGQLLSLDIPTSELTLKEALYGVDGLEPIDITTSAGFPY 33---------------------------------3333--------------------- VSLGIKKRDILNKETQDTEKMKFYLDKYGIDLPLVTYIKDELRSVDKVRLGKSRLIEASS 1111-3333--3333----------------------------3333------------- LNDSVNMRMKLGNLYKAFHQNPGVLTGSAVGCDPDVFWSVIPCLMDGHLMAFDYSNFDAS --------------------------------3333---3333-------------3333 LSPVWFVCLEKVLTKLGFAGSSLIQSICNTHHIFRDEIYVVEGGMPSGCSGTSIFNSMIN -3333-----------------3333---------------------------------- NIIIRTLILDAYKGIDLDKLKILAYGDDLIVSYPYELDPQVLATLGKNYGLTITPPDKSE ---------------3333-----!!!!-----------------1111----------- TFTKMTWENLTFLKRYFKPDQQFPFLVHPVMPMKDIHESIRWTKDPKNTQDHVRSLCMLA -----3333--iiii-------1111----------------------------3333-- WHSGEKEYNEFIQKIRTTDIGKCLILPEYSVLRRRWLDLF ------------------3333-----3333--------- >GENOME POLYPROTEIN; SWP:Q82113; PDB:1XR6A; GQIKISKHANECGLPTIHTPSKTKLQPSVFYDVFPGSKEPAVLTDNDPRLKVNFKEALFS -------3333----------------1111---------------3333--------33 KYKGNTECSLNQHMEIAIAHYSAQLITLDIDSKPIALEDSVFGIEGLEALDLNTSAGFPY 33-----------------------1111------------------------------- VTMGIKKRDLINNKTKDISRLKEALDKYGVDLPMITFLKDELRKKEKISAGKTRVIEASS -----3333----------------------------------3333------------3 INDTILFRTTFGNLFSKFHLNPGVVTGSAVGCDPETFWSKIPVMLDGDCIMAFDYTNYDG 333-------------------------2222333333333333-------------333 SIHPVWFQALKKVLENLSFQSNLIDRLCYSKHLFKSTYYEVAGGVPSGCSGTSIFNTMIN 3-------------1111--3333------------------------------------ NIIIRTLVLDAYKNIDLDKLKIIAYGDDVIFSYKYTLDMEAIANEGKKYGLTITPADKST ---------------3333-----!!!!------------------1111-----%%%%- EFKKLDYNNVTFLKRGFKQDEKHTFLIHPTFPVEEIYESIRWTKKPSQMQEHVLSLCHLM -----1111--iiii-------1111-----3333---1111--1111-----------3 WHNGRKVYEDFSSKIRSVSAGRALYIPPYDLLKHEWYEKF 333----------11113333--------------1111- >GENOME POLYPROTEIN; SWP:Q82122; PDB:1XR7A; GQIQISKHVKDVGLPSIHTPTKTKLQPSVFYDIFPGSKEPAVLTEKDPRLKVDFDSALFS -------3333----------------1111------------1111-----------33 KYKGNTECSLNEHIQVAVAHYSAQLATLDIDPQPIAMEDSVFGMDGLEALDLNTSAGYPY 33-----------------------1111-------3333---2222------------- VTLGIKKKDLINNKTKDISKLKLALDKYGVDLPMITFLKDELRKKDKIAAGKTRVIEASS -----1111----------------------------------33331111--------- INDTILFRTVYGNLFSKFHLNPGVVTGCAVGCDPETFWSKIPLMLDGDCIMAFDYTNYDG ----------------------------2222333333333333------------3333 SIHPIWFKALGMVLDNLSFNPTLINRLCNSKHIFKSTYYEVEGGVPSGCSGTSIFNSMIN --3333--------1111---3333-----------------------2222-------- NIIIRTLVLDAYKHIDLDKLKIIAYGDDVIFSYKYKLDMEAIAKEGQKYGLTITPADKSS ---------------3333-----!!!!-----------------3333------%%%%- EFKELDYGNVTFLKRGFRQDDKYKFLIHPTFPVEEIYESIRWTKKPSQMQEHVLSLCHLM -----3333--iiii-------3333-----3333---------33333333-------3 WHNGPEIYKDFETKIRSVSAGRALYIPPYELLRHEWYEKF 333----------11113333--------------1111- >SUPEROXIDE DISMUTASE; SWP:Q81JK8; PDB:1XREA; SFQLPKLSYDYDELEPYIDSNTLSIHHGKHHATYVNNLNAALENYSELHNKSLEELLCNL ---------1111----------------------------11111111---------33 ETLPKEIVTAVRNNGGGHYCHSLFWEVMSPRGGGEPNGDVAKVIDYYFNTFDNLKDQLSK 33-3333-----------------1111---------3333------------------- AAISRFGSGYGWLVLDGEELSVMSTPNQDTPLQEGKIPLLVIDVWEHAYYLKYQNRRPEF ---------------!!!!------!!!!3333-----------3333----!!!!---- VTNWWHTVNWDRVNEKYLQAI --3333--------------- >Putative translation init; SWP:Q4CI45; PDB:1XRGA; YIEVVKTNKAPEAIGPYSQAIVTGSFVYTSGQIPINPQTGEVVDGGIEEQAKQVLENLKN ----------------------!!!!---------------------------------- VLEAAGSSLNKVVKTTVFIKDDSFAKVNEVYAKYFSEPYPARSCVEVSKLPKGVLIEIEA --1111-1111--------------------1111--------------2222------- VAIK ---- >UREIDOGLYCOLATE DEHYDROGE; SWP:P77555; PDB:1XRHA; SSKISRETLHQLIENKLCQAGLKREHAATVAEVLVYADARGIHSHGAVRVEYYAERISKG -------------------------------------111133333333----------- GTNREPEFRLEETGPCSAILHADNAAGQVAAKGEHAIKTAQQNGVAVVGISRGHSGAISY ---------------------%%%%3333------------------------------- FVQQAARAGFIGISCQSDPVVPFGGAEIYYGTNPLAFAAPGEGDEILTFDATTVQAWGKV ---------------------2222----------------!!!!----------3333- LDARSRNSIPDTWAVDKNGVPTTDPFAVHALLPAAGPKGYGLIDVLSGVLLGLPFGRQVS ---------------1111----1111---------------3333-3333---!!!!-- SYDDLHAGRNLGQLHIVINPNFFSSSELFRQHLSQTRELNAITPAPGFNQVYYPGQDQDI ---1111-----------3333---------------3333-----------2222---- KQRKAAVEGIEIVDDIYQYLISDALYNTSYE ------------3333--------------- >AT1G05000; SWP:Q9ZVN4; PDB:1XRIA; HLIPPLNFSVDNGIFRSGFPDSANFSFLQTLGLRSIIYLCPEPYPESNLQFLKSNGIRLF ----------1111------3333-3333------------------------------- QFGIEGNKEPFVNIPDHKIRALKVLLDEKNHPVLIHCKRGKHRTGCLVGCLRKLQKWCLT ----------------3333------3333------------3333-------------- SIFDEYQRFAAAKARVSDQRFEIFDVSS ---------!!!!--------------- >BLEOMYCIN RESISTANCE PROT; SWP:P17493; PDB:1XRKA; AKLTSAVPVLTARDVAEAVEFWTDRLGFSRVFVEDDFAGVVRDDVTLFISAVQDQVVPDN ---------------------------------1111----!!!!--------------- TQAWVWVRGLDELYAEWSEVVSTNFRDASGPAMTEIVEQPWGREFALRDPAGNCVHFVAE ---------------3333----3333-----------1111------1111-------- >D-LYSINE 5,6-AMINOMUTASE ; SWP:Q9ZFE6; PDB:1XRSA; MESKLNLDFNLVEKARAKAKAIAIDTQEFIEKHTTVTVERAVCRLLGIDGVDTDEVPLPN --1111----------------------3333--3333-----1111----1111-3333 IVVDHIKENNGLNLGAAMYIANAVLNTGKTPQEIAQAISAGELDLTKLPMKDLFEVKTKA ------11113333-----------------------------3333-----3333---- LSMAKETVEKIKNNRSIRESRFEEYGDKSGPLLYVIVATGNIYEDITQAVAAAKQGADVI ------------------------------------------------------------ AVIRTTGQSLLDYVPYGATTEGFGGTYATQENFRLMREALDKVGAEVGKYIRLCNYCSGL ----2222-------------2222---------------------------------11 CMPEIAAMGAIERLDVMLNDALYGILFRDINMQRTMIDQNFSRIINGFAGVIINTGEDNY 11-----------------1111--------------------------------3333- LTTADAFEEAHTVLASQFINEQFALLAGLPEEQMGLGHAFEMDPELKNGFLYELSQAQMA ----3333----------------1111-1111---------1111-3333--------- REIFPKAPLKYMPPTKFMTGNIFKGHIQDALFNMVTIMTNQRIHLLGMLTEALHTPFMSD ---1111-------1111-----------------------------1111--------- RALSIENAQYIFNNMESISEEIQFKEDGLIQKRAGFVLEKANELLEEIEQLGLFDTLEKG ----------------3333---------------------------------------- IFGGVKRPKDGGKGLNGVVSKDENYYNPFVELMLNK -%%%%--------1111----1111-3333------ >D-lysine 5,6-aminomutase ; SWP:Q9ZFE5; PDB:1XRSB; KVQLSFTLPLKNNERSAEAAKQIALKMGLEEPSVVMQQSLDEEFTFFVVYGNEILSMEET ------------3333-------3333-------------1111---------------- DEYIKENIGRKIVVVGASTGTDAHTVGIDAIMNMKGYAGHYGLERYEMIDAYNLGSQVAN ------------------!!!!--3333----3333iiii-3333-------------33 EDFIKKAVELEADVLLVSQTVTQKNVHIQNMTHLIELLEAEGLRDRFVLLCGGPRINNEI 33------------------------------------1111-3333------------- AKELGYDAGFGPGRFADDVATFAVKTLNDRMN ----------11113333-------------- >DIHYDROOROTASE; SWP:O66990; PDB:1XRTA; MLKLIVKNGYVIDPSQNLEGEFDILVENGKIKKIDKNILVPEAEIIDAKGLIVCPGFIDI ------------3333----------%%%%-----------------2222--------- HVHLRDPGQTYKEDIESGSRCAVAGGFTTIVCMPNTNPPIDNTTVVNYILQKSKSVGLCR --------3333------------------------------------------------ VLPTGTITKGRKGKEIADFYSLKEAGCVAFTDDGSPVMDSSVMRKALELASQLGVPIMDH -------2222------------------------------------------------- CEDDKLAYAEEIQIARDGILAQRTGGHVHIQHVSTKLSLEIIEFFKEKGVKITCEVNPNH -3333----------------------------------------1111-------3333 LLEDRLALIEGVKRGIIDCFATDHAPHQTGIIGLQTALPSALELYRKGIISLKKLIEMFT --------------------------------1111--------1111------------ INPARIIGVDLGTLKLGSPADITIFDPNKEWILNEETNLSKSRNTPLWGKVLKGKVIYTI --------------2222---------------3333-------1111------------ KDGKMVYKD iiii----- >4-deoxy-L-threo-5-hexosul; SWP:Q46938; PDB:1XRUA; AMDVRQSIHSAHAKTLDTQGLRNEFLVEKVFVADEYTVYSHIDRIIVGGIPITKTVSVGG --------33331111---------------2222------%%%%--------------- EVGKQLGVSYFLERRELGVINIGGAGTITVDGQCYEIGHRDALYVGKGAKEVVFASIDTG ---1111--1111----------------iiii----2222----------------333 TPAKFYYNCAPAHTTYPTKKVTPDEVSPVTLGDNLTSNRRTINKYFVPDVLETCQLSGLT 3---------------------3333------3333----------3333---------- ELAPGNLWNTPCHTHERREVYFYFNDDDACVFHGQPQETRHIVHNEQAVISPSWSIHSGV --2222--------1111-------1111-----1111-------------1111----- GTKAYTFIWGVGENQVFDDDHVAVKEIC ---------------1111---3333-- >SEQA PROTEIN; SWP:P36658; PDB:1XRXA; KTIEVDDELYSYIASHTKHIGESASDILRRLKF -------------1111-2222----------- >INHIBITOR OF VERTEBRATE L; SWP:P45502; PDB:1XS0A; DLTISSLAKGETTKAAFNQMVQGHKLPAWVMKGGTYTPAQTVTLGDETYQVMSACKPHDC --------------------2222--3333-------------------------2222- GSQRIAVMWSEKSNQMTGLFSTIDEKTSQEKLTWLNVNDALSIDGKTVLFAALTGSLENH -------------------------------------------------------3333- PDGFNFRS -------- >DEOXYCYTIDINE TRIPHOSPHAT; SWP:P28248; PDB:1XS1A; MRLCDRDIEAWLDEGRLSINPRPPVERINGATVDVRLGNKFRTFRGHTAAFIDLSGPKDE -----------------------3333-----------------1111----1111---- VSAALDRVMSDEIVLDEGEAFYLHPGELALAVTLESVTLPADLVGWLDGRSSLARLGLMV ---------------2222----2222------------1111------33331111--- HVTAHRIDPGWSGCIVLEFYNSGKLPLALRPGMLIGALSFEPLSGPAVRPYNRREDAKYR -------2222--------------------------------------33331111--- NQQGAVASRIDKD --------1111- >HYPOTHETICAL PROTEIN XC97; SWP:NA; PDB:1XS3A; MRKRPLDAETIRKLIESGLPEARVDVQGEDGVHFEATVVSPAFVGKAPLARHRMVYATLG ------3333-----------------------------3333-------------1111 ELMGGAIHALQLKTLTPDEA ---------------3333- >MEMBRANE LIPOPROTEIN TPN3; SWP:O07950; PDB:1XS5A; KDETVGVGVLSEPHARLLEIAKEEVKKQHIELRIVEFTNYVALNEAVMRGDILMNFFQHV -----------------------3333--------------------------------- PHMQQFNQEHNGDLVSVGNVHVEPLALYSRTYRHVSDFPAGAVIAIPNDSSNEARALRLL ---------------------------------1111-2222------3333-------- EAAGFIRMRAGSGLFATVEDVQQNVRNVVLQEVESALLPRVFDQVDGAVINGNYAIMAGL 1111--------11113333---1111------33333333--------------1111- SARRDGLAVEPDASAYANVLVVKRGNEADARVQAVLRALCGGRVRTYLKERYKGGEVAPA 3333-------3333----------1111----------------------1111----- >UPF0269 PROTEIN YGGX; SWP:P67617; PDB:1XS8A; MSRTIFCTYLQRDAEGQDFQLYPGELGKRIYNEISKDAWAQWQHKQTMLINEKKLNMMNA ------3333--------------------1111-------------------------- EHRKLLEQEMVSFLFEGKDVHIEGYTPEDKK ------------------------------- >BIS(5'-NUCLEOSYL)-TETRAPH; SWP:P50583; PDB:1XSAA; GPLGSMALRACGLIIFRRCLIPKVDNNAIEFLLLQASDGIHHWTPPKGHVEPGEDDLETA 1111-------------------------------------------------------- LRATQEEAGIEAGQLTIIEGFKRELNYVARNKPKTVIYWLAEVKDYDVEIRLSHEHQAYR ----------3333------------------------------1111------------ WLGLEEACQLAQFKEMKAALQEGHQFLCSIEAL -------------------------3333---- >11BETA-HYDROXYSTEROID DEH; SWP:Q6QLL4; PDB:1XSEA; NEKFRPEMLQGKKVIVTGASKGIGREIAYHLAKMGAHVVVTARSKEALQKVVARCLELGA ----33332222------------------------------------------------ ASAHYIAGSMEDMTFAEEFVAEAGNLMGGLDMLILNHVLYNRLTFFHGEIDNVRKSMEVN ------------------------------------------------3333-------- FHSFVVLSVAAMPMLMQSQGSIAVVSSVAGKITYPLIAPYSASKFALDGFFSTLRSEFLV --------------------------1111------------------------------ NKVNVSITLCILGLIDTETAIKATSGIYLGPASPKEECALEIIKGTALRQDEMYYVGSRW -----------------------------------------------------------3 VPYLLGNPGRKIMEFLSAAEYNWDNVLSNEKLYG 333---3333-----3333--------------- >PROBABLE RESUSCITATION-PR; SWP:O05594; PDB:1XSFA; NVVVTPAHEAVVRVGTKPGTEVPPVIDGSIWDAIAGCEAGGNWAINTGNGYYGGVQFDQG ------------------------1111-------------1111--------------- TWEANGGLRYAPRADLATREEQIAVAEVTRLRQGWGAWPVCAARAGAR -------3333-1111------------------1111---------- ------------------------ >UREIDOGLYCOLATE HYDROLASE; SWP:P77731; PDB:1XSQA; KLQVLPLSQEAFSAYGDVIETQQRDFFHIVERYHDLALVEILEQDCTLISINRAQPANLP -------33331111-----2222------------------------------------ LTIHELERHPLGTQAFIPKGEVFVVVVALGDDKPDLSTLRAFITNGEQGVNYHRNVWHHP --------1111----------------------1111--------------2222---- LFAWQRVTDFLTIDRGDNCDVESIPEQELCFA -------------------------------- >HYPOTHETICAL UPF0122 PROT; SWP:P67248; PDB:1XSVA; DLVKTLRNYLFDFYQSLLTNKQRNYLELFYLEDYSLSEIADTFNVSRQAVYDNIRRTGDL ---------33333333------------------------------------------- VEDYEKKLELYQKFEQRREIYDEKQHLSNPEQIQRYIQQLEDLE -------------------------1111--------------- >GUANINE NUCLEOTIDE EXCHAN; SWP:NA; PDB:1XSZA; ASHPEIEKAQREIIEAFNAKPKNGINKIKEICEQYKISPNEEIAEFFHQQRKNLDLEAVG --33331111---------------------------------------1111------- DYLSSPEAENQQVLKAFTSQMNFNGQSFVEGLRTFLKTFKLPGEAQKIDRLVQSFSGAYF -1111------------------------------3333--------------------- QQNPDVVSNADAAYLLAFQTIMLNTDLHNPSIPEKNKMTVDGLKRNLRGGNNGGDFDAKF --1111----------------------33333333--------------iiii------ LEELYSEIKAKPFELNFVKTSPGYELTSTTLNKDSTFKKLDSFLHSTDVNINTVFPGIGD -----------------------------3333-------3333-----1111-3333-- NVKTTVDQPKSWLSFFTGYKGTITLTDNKTSAQATIQVYTPNIFSKWLFGEQPRVIIQPG -----------------------------------------3333--------------- QTKESIDLAAKAAADFSSPVKNFKATYDYEVGDLIKAYDNQKKLITIERNLALKA -2222--------------------11113333---------------------- >variable region-containin; SWP:Q8I9N0; PDB:1XT5A; GQSIMTVRTTHTEVEVHAGGTVELPCSYQLANDTQPPVISWLKGASPDRSTKVFKGNYNW ----------------2222-------------------------3333----------- QGEGLGFVESDSYKESFGDFLGRASVANLAAPTLRLTHVHPQDGGRYWCQVAQWSIRTEF -------3333-----!!!!-------1111--------1111--------------333 GLDAKSVVLKVTGHT 3---------2222- >putative amino-acid trans; SWP:Q9PNV7; PDB:1XT8A; LNSLDKIKQNGVVRIGVFGDKPPFGYVDEKGNNQGYDIALAKRIAKELFGDENKVQFVLV ---------------------------1111-------------------1111------ EAANRVEFLKSNKVDIILANFTQTPQRAEQVDFCSPYMKVALGVAVPKDSNITSVEDLKD 3333-------------------3333-------------------1111---3333--- KTLLLNKGTTADAYFTQNYPNIKTLKYDQNTETFAALMDKRGDALSHDNTLLFAWVKDHP -----2222---------1111------3333----1111-------------------- DFKMGIKELGNKDVIAPAVKKGDKELKEFIDNLIIKLGQEQFFHKAYDETLKAHFGDDVK -------------------2222--------------1111---------3333-33333 ADDVVIEG 333----- >NATRIN 1; SWP:Q7T1K6; PDB:1XTAA; VDFNSESTRRKKKQKEIVDLHNSLRRRVSPTASNMLKMEWYPEAASNAERWANTCSLNHS -3333----------------------------------------------1111----- PDNLRVLEGIQCGESIYMSSNARTWTEIIHLWHDEYKNFVYGVGASPPGSVTGHYTQIVW 1111--iiii-------------------------11112222---2222-3333----3 YQTYRAGCAVSYCPSSAWSYFYVCQYCPSGNFQGKTATPYKLGPPCGDCPSACDNGLCTN 333---------1111---------------2222--------2222-1111-iiii--- PCTIYNKLTNCDSLLKQSSCQDDWIKSNCPASCFCRNKII -----------------------------3333------- >Cholera enterotoxin subun; SWP:P01556; PDB:1XTCD; TPQNITDLCAEYHNTQIYTLNDKIFSYTESLAGKREMAIITFKNGAIFQVEVPSSQHIDS ---3333-3333-------------------2222------3333-------------33 QKKAIERMKDTLRIAYLTEAKVEKLCTWNNKTPHAIAAISMAN 33----------------------------------------- >EUKARYOTIC INITIATION FAC; SWP:Q9N9V6; PDB:1XTDA; NASKTYPAAGALKKGGYVCINGRPCKVIDLSVSKTGKHGHAKVSIVATDIFTGNRLEDQA -------1111--------%%%%------------------------------------- PSTHNVEVPFVKTFTYSVLDIQPNEDPSLPSHLSLDDEGESREDLDPPDAALATQIKEQF 1111---------------------1111------1111--------------------3 DSGKEVLVVVVSAGTEQVLQTKNAA 333---------------------- >SERINE/THREONINE-PROTEIN ; SWP:Q9ERE3; PDB:1XTEA; KESCPSVSIPSSDEHREKKKRFTVYKVLVSVGRSEWFVFRRYAEFDKLYNSLKKQFPAMA ------------------------------!!!!------3333-----------3333- LKIPAKRIFGDNFDPDFIKQRRAGLNEFIQNLVRYPELYNHPDVRAFLQMDSPRHQ ----------1111--------------------3333-------------1111- >Hypothetical superoxide d; SWP:O31851; PDB:1XTMA; AFGHHVQLVNREGKAVGFIEIKESDDEGLDIHISANSLRPGASLGFHIHEKGSCVRPDFE ---------1111-------------------------2222------------------ SAGGHFNPLNKEHGFNNPMGHHAGDLPNLEVGADGKVDVIMNAPDTSLKKGSKLNILDED ------1111---1111----1111-----------------------2222-----333 GSAFIIHEQADDYLTNPSGNSGARIVCGALLG 3------------------------------- >COENZYME PQQ SYNTHESIS PR; SWP:Q88QV5; PDB:1XTOA; YIQVLGSAAGGGFPQWNCNCVNCKGYRDGTLKATARTQSSIALSDDGVHWILCNASPDIR --------!!!!--1111-3333-3333---------------------------1111- AQLQAFAPQPARALRDTGINAIVLLDSQIDHTTGLLSLREGCPHQVWCTDVHQDLTTGFP ---------------------------1111--33331111------------------3 LFNLSHWNGGLQWNRIELEGSFVIDACPNLKFTPFPLRSAAPPYSPHRFDPHPGDNLGLV 3331111----------------3333--------------1111-1111---------- EDTRTGGKLFYAPGLGQVDEKLLAHGADCLLVDGTLWEDDEQRRGVGTRTGREGHLAQNG -----------------------------------------3333--------------- PGGLEVLDGFPRQRKVLIHINNTNPILDENSPERAEVLRRGVEVAFDGSIELL ----3333------------3333----------------------------- >LMAJ004091AAA; SWP:Q4Q7M2; PDB:1XTPA; GPGSPRNLPISGRDTNGKTYRSTDEWKAELTGDLYDPEKGWYGKALEYWRTVPATVSGVL -------------1111----3333-------1111------------1111-----111 GGDHVHDVDIEGSRNFIASLPGHGTSRALDCGAGIGRITKNLLTKLYATTDLLEPVKHLE 1-1111----------1111------------!!!!-----3333--------------- EAKRELAGPVGKFILASETATLPPNTYDLIVIQWTAIYLTDADFVKFFKHCQQALTPNGY ---1111----------------------------1111------------11111111- IFFKENCSDRFLVDKEDSSLTRSDIHYKRLFNESGVRVVKEAFQEEWPTDLFPLKYALK -----------------------------------------------1111-------- >GTP-BINDING PROTEIN RHEB; SWP:Q15382; PDB:1XTQA; QSKSRKIAILGYRSVGKSSLTIQFVEGQFVDSYDPTIENTFTKLITVNGQEYHLQLVDTA -----------2222-------------------------------iiii---------- GQDEYSIFPQTYSIDINGYILVYSVTSIKSFEVIKVIHGKLLDMVGKVQIPIMLVGNKKD --1111--3333-----------1111------------------------------333 LHMERVISYEEGKALAESWNAAFLESSAKENQTAVDVFRRIILEAEKLE 31111---------------------1111------------------- >PROBABLE URACIL PHOSPHORI; SWP:Q980Q4; PDB:1XTTA; PLYVIDKPITLHILTQLRDKYTDQINFRKNLVRLGRILGYEISNTLDYEIVEVETPLGVK ------------------1111--------------------1111--------1111-- TKGVDITDLNNIVIINILRAAVPLVEGLLKAFPKARQGVIGASRVEVDGKEVPKDMDVYI -----3333---------1111------3333---------------------------- YYKKIPDIRAKVDNVIIADPMIATASTMLKVLEEVVKANPKRIYIVSIISSEYGVNKILS --------2222------------------------------------------------ KYPFIYLFTVAIDPELNNKGYILPGLGDAGDRAFG -1111-----------1111--------------- >PEPTIDYL-TRNA HYDROLASE; SWP:Q980V1; PDB:1XTYA; MIKMVIVVRSDIKMGKGKIAAQVAHAAVTLVVSIINSNNLRWKEWLNEWLHQGQPKIIVK -------------------------------------------------1111------- VNSLDEIISRAKKAETMNLPFSIIEDAGKTQLEPGTITCLGIGPAPENLVDSITGDLKLL --3333--------1111--------------2222-----------------1111--- >RIBOSE-5-PHOSPHATE ISOMER; SWP:Q12189; PDB:1XTZA; EDAKRAAAYRAVDENLKFDDHKIIGIGSGSTVVYVAERIGQYLHDPKFYEVASKFICIPT ----------------3333----------3333-----3333-3333-3333------- GFQSRNLILDNKLQLGSIEQYPRIDIAFDGADEVDENLQLIKGGGACLFQEKLVSTSAKT --------1111----3333--------------1111----1111-------1111--- FIVVADSRKKSPKHLGKNWRQGVPIEIVPSSYVRVKNDLLEQLHAEKVDIRQGGSAKAGP -----3333----2222----------3333----------------------3333--- VVTDNNNFIIDADFGEISDPRKLHREIKLLVGVVETGLFIDNASKAYFGNSDGSVEVTEK --1111-------------------------------------------1111------% HHHHHH %%%--- >PRION PROTEIN; SWP:Q5S1W7; PDB:1XU0A; IGGYMLGNAVGRMSYQFNNPMESRYYNDYYNQMPNRVYRPMYRGEEYVSEDRFVRDCYNM ----------------------------3333---------------------------- SVTEYIIKPAEGKNNSELNQLDTTVKSQIIREMCITEYRRGS ------3333-------------------------------- >Tumor necrosis factor rec; SWP:O14836; PDB:1XU1R; SLSCRKEQGKFYDHLLRDCISCASICGQHPKQCAYFCE ----3333------------33332222-3333----- >Tumor necrosis factor rec; SWP:Q02223; PDB:1XU2R; CSQNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNA ---------------3333-------33333333-- >VARIANT SURFACE GLYCOPROT; SWP:P26332; PDB:1XU6A; GSHMLEVLTQKHKPAESQQQAAETEGSCNKKDQNECKSPCKWHNDAENKKCTLDKEEAKK -----------------------333333333333-3333-------------------- VADETAKDGKTGNTNTTGSS -----3333----------- >CORTICOSTEROID 11-BETA-DE; SWP:P28845; PDB:1XU9A; QPLNEEFRPEMLQGKKVIVTGASKGIGREMAYHLAKMGAHVVVTARSKETLQKVVSHCLE -----------2222-------------------------------------------11 LGAASAHYIAGTMEDMTFAEQFVAQAGKLMGGLDMLILNHITNTSLNLFHDDIHHVRKSM 11---------------------------------------------------------- EVNFLSYVVLTVAALPMLKQSNGSIVVVSSLAGKVAYPMVAAYSASKFALDGFFSSIRKE -----------------------------1111--------------------------- YSVSRVNVSITLCVLGLIDTETAMKAVSGIVHMQAAPKEECALEIIKGGALRQEEVYYDS ---------------------------!!!!1111------------------------- SLWTTLLIRNPSRKILEFLYSTSYNMDRF 3333-33333333------1111--3333 >PHENAZINE BIOSYNTHESIS PR; SWP:Q51792; PDB:1XUBA; MHNYVIIDAFASVPLEGNPVAVFFDADDLPPAQMQRIAREMNLSESTFVLKPRNGGDALI ------------2222---------1111------------------------------- RIFTPVNELPFAGAPLLGTAIALGAHTDNHRLYLETQMGTIAFELERQNGSVIAASMDQP ---------------------3333----------1111--------iiii--------- IPTWTALGRDAELLKALGISDSTFPIEIYHNGPRHVFVGLPSIDALSALHPDHRALSNFH ---------------------------------------------3333---3333---- DMAINCFAGAGRRWRSRMFSPAYGVVEDAATGSAAGPLAIHLARHGQIEFGQPVEILQGV ---------!!!!-------1111------1111--------------2222------33 EIGRPSLMFAKAEGRAEQLTRVEVSGNGVTFGRGTIVL 33------------1111-------------------- >SUPEROXIDE DISMUTASE; SWP:Q81LW0; PDB:1XUQA; KHELPNLPYAYDALEPHFDKETMNIHHTKHHNTYITNLNAALEGHAELADKSVEELVANL ---------1111--------------------------1111---1111--------33 NEVPEAIRTAVRNNGGGHANHTFFWTILSPNGGGQPVGELATAIEAKFGSFDAFKEEFAK 33-3333-----------------11111111-----3333------------------- AGATRFGSGWAWLVVNNGELEVTSTPNQDSPLTEGKTPVIGLDVWEHAYYLNYQNRRPDY ---------------iiii------!!!!3333-----------3333----!!!!---- IGAFWNVVDWNAAEKRYQEA -3333--------------- >polysialic acid capsule b; SWP:Q57265; PDB:1XUUA; QNNNEFKIGNRSVGYNHEPLIICEIGINHEGSLKTAFEMVDAAYNAGAEVVKHQTHIVED -------!!!!--1111-------!!!!iiii-----------1111---------3333 EMSDEAKQVIPGNADVSIYEIMERCALNEEDEIKLKEYVESKGMIFISTPFSRAAALRLQ --3333----1111-------------------------1111----------------- RMDIPAYKIGSGECNNYPLIKLVASFGKPIILSTGMNSIESIKKSVEIIREAGVPYALLH ---------3333------------------------3333------------------- CTNIYPTPYEDVRLGGMNDLSEAFPDAIIGLSDHTLDNYACLGAVALGGSILERHFTDRM -------3333------------1111-----------------1111----------11 DRPGPDIVCSMNPDTFKELKQGAHALKLARGGKKDTIIAGEKPTKDFAFASVVADKDIKK 11-1111-------------------------1111-3333------------------- GELLSGDNLWVKRPGNGDFSVNEYETLFGKVAACNIRKGAQIKKTDIE -------------------3333-1111--------2222--1111-- >HYPOTHETICAL PROTEIN MM05; SWP:Q8PZJ2; PDB:1XUVA; NPTRITAEPGKQEIIITREFDAPRELVFKAFTDPDLYTQWIGPRGFTTALKIFEPKNGGS -------2222---------------------33331111--2222---------2222- WQYIQKDPEGNEYAFHGVNHDVTEPERIISTFEFEGLPEKGHVILDTARFEALPGDRTKL ---------------------------------1111----------------------- TSHSVFQTIEDRDGLQSGEEGINDSYERLDELLEKKKLEH -----------------------------------3333- >SULFOTRANSFERASE; SWP:O43704; PDB:1XV1A; PKDILRKDLKLVHGYPMTCAFASNWEKIEQFHSRPDDIVIATYPKSGTTWVSEIIDMILN 1111-------iiii--3333------1111--1111-----2222-------------% DGDIEKCKRGFITEKVPMLEMTLPGLRTSGIEQLEKNPSPRIVKTHLPTDLLPKSFWENN %%%3333---3333---1111------------1111----------1111-----1111 CKMIYLARNAKDVSVSYYHFDLMNNLQPFPGTWEEYLEKFLTGKVAYGSWFTHVKNWWKK -----------------------1111------------1111-2222-----------3 KEEHPILFLYYEDMKENPKEEIKKIIRFLEKNLNDEILDRIIHHTSFEVMKDNPLVNYTH 333-----------------------1111----------------3333--1111-111 LPTTVMDHSKSPFMRKGTAGDWKNYFTVAQNEKFDAIYETEMSKTALQFRTEI 13333-3333-------------------------------1111-------- >hypothetical protein, sim; SWP:Q99R36; PDB:1XV2A; NVLYQHGTLGTLAGLLEGTATINELLEHGNLGIATLTGSDGEVIFLDGKAYHANEHKEFI -------------------------1111----------------iiii----1111--- ELKGDEKVPYASITNFKASKTFPLQQLSQDDVFAQIKNELSENLFSAVKIYGTFKHHVRP ------------------------------------------------------------ AQQPPYTRLIDSARRQPEEKRQDIRGAIVGFFTPELFHGVGSAGFHIHFADDERAYGGHV -------3333----------------------3333-------------1111------ LDFEVDDVVVEIQNFETFQQHFPVNNETFVKAKIDYKDVAEEIREAE ----------------------1111---------2222-------- >PENAEIDIN-4D; SWP:Q962A7; PDB:1XV3A; HSSGYTRPLRKPSRPIFIRPIGCDVCYGIPSSTARLCCFRYGDCCHL ---------------------1111---------------------- >DNA ALPHA-GLUCOSYLTRANSFE; SWP:P04519; PDB:1XV5A; SMRICIFMARGLEGGVTKFSLEQRDWFIKNGHEVTLVYAKDKSFTRTSSHDHKSFSIPVI ---------------------------1111--------------1111--1111----3 LAKEYDKALKLVNDCDILIINSVPATSVQEATINNYKKLLDNIKPSIRVVVYQHDHSVLS 333-------1111----------11113333-------11113333---------3333 LRRNLGLEETVRRADVIFSHSDNGDFNKVLMKEWYPETVSLFDDIEEAPTVYNFQPPMDI --------------------11113333-3333--------------------------- VKVRSTYWKDVSEINMNINRWIGRTTTWKGFYQMFDFHEKFLKPAGKSTVMEGLERSPAF ---------3333----------------------------3333-----------3333 IAIKEKGIPYEYYGNREIDKMNLAPNQPAQILDYINSEMLERMSKSGFGYQLSKLNQKYL ---1111------33331111---------------------1111---------3333- QRSLEYTHLELGACGTIPVFWKSTGENLKFRVDNTPLTSHDSGIIWFDENDMESTFERIK -----------1111--------------------3333--------1111--------- ELSSDRALYDREREKAYEFLYQHQDSSFCFKEQFDIITK ------------------------3333----------- >GLYCINE N-METHYLTRANSFERA; SWP:P13255; PDB:1XVAA; VDSVYRTRSLGVAAEGIPDQYADGEAARVWQLYIGDTRSRTAEYKAWLLGLLRQHGCHRV --------2222-3333---1111-----------------3333-------1111---- LDVACGTGVDSIMLVEEGFSVTSVDASDKMLKYALKERWNRRKEPAFDKWVIEEANWLTL ----!!!!------1111--------3333--------1111--3333-------33331 DKDVPAGDGFDAVICLGNSFAHLPDSKGDQSEHRLALKNIASMVRPGGLLVIDHRNYDYI 111--!!!!--------3333---1111----------------2222------------ LSTGCAPPGKNIYYKSDLTKDITTSVLTVNNKAHMVTLDYTVQVPGAGRDGAPGFSKFRL ------------------------------------------------------------ SYYPHCLASFTELVQEAFGGRCQHSVLGDFKPYRPGQAYVPCYFIHVLKKTG ---------------1111--------------2222--------------- >hypothetical protein, sim; SWP:Q8NWQ6; PDB:1XVHA; TGNLQTAINDKSGTLASQNFLDADEQKRNAYNQAVSAAETILNTAKTAVEQALNNVNNAK -----------------3333--------------------------------------- HALNGTQNLNNAKQAAITAINGASDLNQKQKDALKAQANGAQRVSNAQDVQHNATELNT ----------------------1111-----------1111-3333------------- >PUTATIVE MANNOSYL-3-PHOSP; SWP:P76329; PDB:1XVIA; IQQPLLVFSDLDGTLLDSHSYDWQPAAPWLTRLREANVPVILCSSKTSAEMLYLQKTLGL ----------2222---------3333------1111------------------11112 QGLPLIAENGAVIQLAEQWQEIDGFPRIISGISHGEISLVLNTLREKEHFKFTTFDDVDD 222---%%%%-----1111----------------------------------3333-33 ATIAEWTGLSRSQAALTQLHEASVTLIWRDSDERMAQFTARLNELGLQFMQGARFWHVLD 33-------------3333----------------------------------------1 ASAGKDQAANWIIATYQQLSGKRPTTLGLGDGPNDAPLLEVMDYAVIVKGLN 111----------------------------3333-3333------------ >MN TRANSPORTER; SWP:Q79EF9; PDB:1XVLA; GETEEKKKVLTTFTVLADMVQNVAGDKLVVESITRIGAEIHGYEPTPSDIVKAQDADLIL -------------------1111!!!!------------------3333--3333----- YNGMNLERWFEQFLGNVKDVPSVVLTEGIEPIPIADGPYTDKPNPHAWMSPRNALVYVEN --%%%%1111-3333--------1111----------------------3333------- IRQAFVELDPDNAKYYNANAAVYSEQLKAIDRQLGADLEQVPANQRFLVSCEGAFSYLAR --------3333-------------------------3333------------------- DYGMEEIYMWPINAEQQFTPKQVQTVIEEVKTNNVPTIFCESTVSDKGQKQVAQATGARF ------------------3333-----------------------3333-3333------ GGNLYVDSLSTEEGPVPTFLDLLEYDARVITNGLLAGTN --------------------------------------- >Orphan nuclear receptor N; SWP:Q14994; PDB:1XVPB; PVQLSKEQEELIRTLLGAHTRHMGTMFEQFVQFRPPAHLFIHHQPLPTLAPVLPLVTHFA ----------------------------3333---1111-------1111-3333----- DINTFMVLQVIKFTKDLPVFRSLPIEDQISLLKGAAVEICHIVLNTTFCLQTQNFLCGPL -----------------3333--3333---------3333----1111------------ RYTIEDGARVGFQVEFLELLFHFHGTLRKLQLQEPEYVLLAAMALFSPDRPGVTQRDEID --------------------------------3333----------3333------3333 QLQEEMALTLQSYIKGQQRRPRDRFLYAKLLGLLAELRSINEAYGYQIQHIQGLSAMMPL ------------3333----------------------------------22223333-- LQEICS ------ >THIOL PEROXIDASE; SWP:P66952; PDB:1XVQA; AQITLRGNAINTVGELPAVGSPAPAFTLTGGDLGVISSDQFRGKSVLLNIFPSVDTPVCA -----------------2222--------1111---33332222---------------3 TSVRTFDERAAASGATVLVSKDLPFAQKRFCNVMPASAFRDSFGEDYGVTIADGPMAGLL 333-------------------33331111--------------1111-----1111--- ARAIVVIGADGNVAYTELVPEIAQEPNYEAALAALGATS -------1111---------------------------- >PROTEIN APAG; SWP:Q9KUS3; PDB:1XVSA; DVSLPCIKIQVQTRYIEEQSNPEYQRFVFAYLITIKNLSSQTVQLSRRWLITDADGKQTV ---------------3333-3333----------------------------1111---- VEGDGVVGEQPRIKANDEYTYSSGTALDTPVGVQGQYLIDEQGESFTVEIEPFRLAVPHV -----iiii----2222----------------------1111----------------- >HYPOTHETICAL PROTEIN RV22; SWP:YM38_MYCTU; PDB:1XVWA; MLNVGATAPDFTLRDQNQQLVTLRGYRGAKNVLLVFFPLAFTGIQGELDQLRDHLPEFEN --2222--------1111---33332222------------------------3333--1 DDSAALAISVGPPPTHKIWATQSGFTFPLLSDFWPHGAVSAYGVNEQAGIANRGTFVVDR 111-------------------------------2222--------------------11 SGIIRFAEMKQPGEVRDQRLWTDALAALTA 11--------------3333----3333-- >YFUA; SWP:Q56925; PDB:1XVXA; SNDSGIVVYNAQHENLVKSWVDGFTKDTGIKVTLRNGGDSELGNQLVQEGSASPADVFLT ------------------------------------------------!!!!-------- ENSPAMVLVDNAKLFAPLDAVTQAQVAQEYRPEHGRWTGIAARSTVFVYNPEKISEAELP ---------1111-------------3333-------------------1111-3333-- KSIMDLAKPEWKGRWAASPSGADFQAIVSAMLELKGEKATLEWLKAMKTNFTAYKGNSTV -3333--3333------1111--------------------------------------- MKAVNAGQIDGGVIYHYYRFVDQAKTGENSGKTQLHYFKHQDPGAFVSISGGGVLASSKH ---1111--------------1111-1111--------%%%%1111--------1111-- PKEAQEFVKWITGKSGQDILRTNNAFEYAVGVDAASNPKLVPLKDLDAPKVEPSKLNSKK ------------------------------2222--3333-3333------1111----- VVELMTEAGLL -----1111-- >SFUA; SWP:P21408; PDB:1XVYA; GIVIYNAQHENLVKSWVDGFTKDTGIKVTLRNGGDSELGNQLVQEGSASPADVFLTENSP --------------------------------------------!!!!------------ AMVLVDNAKLFAPLDAATLAQVEPQYRPSHGRWIGIAARSTVFVYNPAKLSDAQLPKSLL -----1111-----333333333333-3333--------------3333-3333---333 DLAKPEWKGRWAASPSGADFQAIVSALLELKGEKATLAWLKAMKTNFTAYKGNSTVMKAV 3--3333------1111------------------------------------------- NAGQVDSGVIYHYYPFVDGAKTGENSNNIKLYYFKHQDPGAFVSISGGGVLASSKHQQQA ----------3333--------1111--------%%%%1111--------1111------ QAFIKWITGKQGQEILRTNNAFEYAVGVGAASNPKLVPLKDLDAPKVDAAQLNSKKVVEL ---------3333-3333--------2222--3333-3333------3333--------- MTEAGLL -1111-- >SULFIREDOXIN; SWP:Q9BYN0; PDB:1XW3A; GPHMSIHSGRIAAVHNVPLSVLIRPLPSVLDPAKVQSLVDTIREDPDSVPPIDVLWIKGA -----------------3333-----------------------3333----------11 QGGDYFYSFGGCHRYAAYQQLQRETIPAKLVQSTLSDLRVYLGASTPDLQ 11----------------1111-------------------!!!!----- >GLUTATHIONE S-TRANSFERASE; SWP:P09488; PDB:1XW6A; PMILGYWDIRGLAHAIRLLLEYTDSSYEEKKYTMGDAPDYDRSQWLNEKFKLGLDFPNLP ---------!!!!-------1111-----------------3333--1111--------- YLIDGAHKITQSNAILCYIARKHNLCGETEEEKIRVDILENQTMDNHMQLGMICYNPEFE ---!!!!---3333--------------------------------------1111-333 KLKPKYLEELPEKLKLYSEFLGKRPWFAGNKITFVDFLVYDVLDLHRIFEPKCLDAFPNL 3-------------------!!!!-1111--------------------11111111--- KDFISRFEGLEKISAYMKSSRFLPRPVFSKMAVWGNK ----------3333----3333------1111----- >UPF0271 PROTEIN YBGL; SWP:P75746; PDB:1XW8A; KIDLNADLGEGCASDAELLTLVSSANIACGFHAGDAQIQACVREAIKNGVAIGAHPSFPS ---------------3333---------------3333---------------------- AQLPPETVYAQTLYQIGALATIARAQGGVRHVKPHGLYNQAAKEAQLADAIARAVYACDP ---3333----------------1111------------3333---------------11 ALILVGLAGSELIRAGKQYGLTTREEVFADRGYQADGSLVPRSQSGEEQALAQTLEVQHG 11----2222------1111-------------3333----------------------- RVKSITGEWATVAAQTVCLHGDGHALAFARRLRSAFIVVAALEH ---1111-----------------3333---------------- >THIOREDOXIN; SWP:Q9V429; PDB:1XWAA; AMVYQVKDKADLDGQLTKASGKLVVLDFFATWCGPCKMISPKLVELSTQFADNVVVLKVD ------------------!!!!-------1111--------------------------3 VDECEDIAMEYNISSMPTFVFLKNGVKVEEFAGANAKRLEDVIKANI 333-----1111----------iiii--------------------- >Follicle-stimulating horm; SWP:P23945; PDB:1XWDC; CHHRICHCSNRVFLCQESKVTEIPSDLPRNAIELRFVLTKLRVIQKGAFSGFGDLEKIEI ------------------------------------------------------------ SQNDVLEVIEADVFSNLPKLHEIRIEKANNLLYINPEAFQNLPNLQYLLISNTGIKHLPD ----------------1111-------1111----------1111--------------- VHKIHSLQKVLLDIQDNINIHTIERNSFVGLSFESVILWLNKNGIQEIHNCAFNGTQLDE 1111------------1111-------2222---------------------2222---- LNLSDNNNLEELPNDVFHGASGPVILDISRTRIHSLPSYGLENLKKLRARSTYNLKKLPT ----------------------------------------1111---------------- LE -- >COMPLEMENT C5; SWP:P01031; PDB:1XWEA; GSHMADCGQMQEELDLTISAETRKQTACKPEIAYAYKVSITSITVENVFVKYKATLLDIY ------------------3333-1111-3333---------------------------- KTGEAVAEKDSEITFIKKVTCTNAELVKGRQYLIMGKEALQIKYNASFRYIYPLDSLTWI -----------------1111---------------------------------3333-- EYWPRDTTCSSCQAFLANLDEFAEDIFLNGC -------------1111-------------- >AUTOIMMUNE REGULATOR; SWP:O43918; PDB:1XWHA; GAMAQKNEDECAVCRDGGELICCDGCPRAFHLACLSPPLREIPSGTWRCSSCLQATVQEV ------------------------------3333-------------------------- QPRAEE ------ >SKD1 PROTEIN; SWP:O75351; PDB:1XWIA; AIVIERPNVKWSDVAGLEGAKEALKEAVILPIKFPHLFTGKRTPWRGILLFGPPGTGKSY ---------1111-----------------3333----!!!!------------------ LAKAVATEANNSTFFSISSSDLVSKWLGESEKLVKNLFQLARENKPSIIFIDEIDSLCGS -----------------------------3333--------------------------- RSENESEAARRIKTEFLVQMQGVGVDNDGILVLGATNIPWVLDSAIRRRFEKRIYIPLPE --------------------------2222-------3333------------------- PHARAAMFKLHLGTTQNSLTEADFRELGRKTDGYSGADISIIVRDALMQPVRKVQSATHF ---------3333------3333-------2222-------------------------- KKVRGPSRADPNHLVDDLLTPCSPGDPGAIEMTWMDVPGDKLLEPVVSMSDMLRSLSNTK ---------3333------------2222---3333-1111--------------1111- PTVNEHDLLKLKKFTEDFGQEG ---------------------- >PHOSPHATE UPTAKE REGULATO; SWP:NA; PDB:1XWMA; TFADDLASLHNKLIEMGRLTEVALQQAIEAFQTQNANLAMAVIDGDGSIDALEEEVNDFA -3333------------------------------------------------------- LWLIAAQQPVATDLRRIVAAIKIASDIERIADFAVNIAKACIRIGGQPFVMDIGPLVLMY -----------------------------------------1111--------------- RLATDMVSTAIAAYDREDASLAAQIADMDHRVDEQYGEMMASLLAVAKTDAATLAQMNVL ---------------------3333----------------------------------- ALVARYIERTADHATNIAEHLVYLVKGKHYDF -------------------------------- >PEPTIDYL-PROLYL CIS-TRANS; SWP:Q9Y3C6; PDB:1XWNA; MAAIPPDSWQPPNVYLETSMGIIVLELYWKHAPKTCKNFAELARRGYYNGTKFHRIIKDF -----------------3333--------------------------------------- MIQGGDPTGTGRGGASIYGKQFEDELHPDLKFTGAGILAMANAGPDTNGSQFFVTLAPTQ ------%%%%-----3333----------------------------------------1 WLDGKHTIFGRVCQGIGMVNRVGMVETNSQDRPVDDVKIIKAYPSG 111-----------3333---------------------------- >DER F II; SWP:Q00855; PDB:1XWVA; DQVDVKDCANNEIKKVMVDGCHGSDPCIIHRGKPFTLEALFDANQNTKTAKIEIKASLDG -----------------2222!!!!----2222------------------------iii LEIDVPGIDTNACHFMKCPLVKGQQYDAKYTWNVPKIAPKSENVVVTVKLVGDNGVLACA i-------------------2222----------1111---------------------- IATHAKIRD --------- >Low molecular weight phos; SWP:P24666; PDB:1XWWA; AEQATKSVLFVCLGNICRSPIAEAVFRKLVTDQNISENWVIDSGAVSDWNVGRSPDPRAV ------------------------------11113333---------1111--------- SCLRNHGIHTAHKARQITKEDFATFDYILCMDESNLRDLNRKSNQVKTCKAKIELLGSYD ---1111----------3333--------------------3333---------3333-1 PQKQLIIEDPYYGNDSDFETVYQQCVRCCRAFLEKAH 111------11113333-------------------- >DEOXYRIBONUCLEASE TATD; SWP:P27859; PDB:1XWYA; MFDIGVNLTSSQFAKDRDDVVACAFDAGVNGLLITGTNLRESQQAQKLARQYSSCWSTAG ------11111111----------1111-------------------------------- VHPHDSSQWQAATEEAIIELAAQPEVVAIGECGLDFNRNFSTPEEQERAFVAQLRIAADL -333311113333---------3333---------------------------------- NMPVFMHCRDAHERFMTLLEPWLDKLPGAVLHCFTGTREEMQACVAHGIYIGITGWVCDE ------------------11111111------------------1111-----3333--- RRGLELRELLPLIPAEKLLIETDAPYLLPRDLTPKPSSRRNEPAHLPHILQRIAHWRGED ---3333-3333-1111-----------1111---------3333---------1111-- AAWLAATTDANVKTLFGIAF -------------------- >SPHINGOMYELINASE I; SWP:Q8I914; PDB:1XX1A; ADNRRPIWNLAHMVNAVAQIPDFLDLGANALEADVTFKGSVPTYTYHGTPCDFGRDCIRW ---------------3333----3333----------!!!!----------2222----- EYFNVFLKTLREYTTPGNAKYRDGFILFVLDLKTGSLSNDQVRPAGENVAKELLQNYWNN --------------2222---1111--------11113333----------------%%% GNNGGRAYVVLSLPDIGHYEFVRGFKEVLKKEGHEDLLEKVGYDFSGPYLPSLPTLDATH %-------------1111---------------33331111------------------- EAYKKAGVDGHIWLSDGLTNFSPLGDMARLKEAIKSRDSANGFINKIYYWSVDKVSTTKA ---1111-------------------------------1111------------------ ALDVGVDGIMTNYPNVLIGVLKESGYNDKYRLATYDDNPWETFKN -------------------------1111----11111111---- >3,2-TRANS-ENOYL-COA ISOME; SWP:P23965; PDB:1XX4A; FSNKRVLVEKAGIAVMKFKNPPVNSLSLEFLTEFVISLEKLENDKSIRGVILTSERPGIF -------------------------------------------1111------------- SAGLDLMEMYGRNPAHYAEYWKAVQELWLRLYLSNLTLISAINGASPAGGCLMALTCDYR ----3333----3333--------------1111-------------3333--1111--- IMADNSKYTIGLNESLLGIVAPFWLKDNYVNTIGHRAAERALQLGTLFPPAEALKVGLVD ----1111----3333-------------------------------------------- EVVPEDQVHSKARSVMAKWFTIPDHSRQLTKSMMRKATADNLIKQREADIQNFTSFISRD ---3333----------------------------------3333--------------- SIQKSLHVYLEKLK -------------- >THYMIDINE KINASE; SWP:Q97F65; PDB:1XX6A; YRPKDHGWVEVIVGPYSGKSEELIRRIRRAKIAKQKIQVFKPEEDVVSHGEKEQAVAIKN --2222-----------------------------------------------------3 SREILKYFEEDTEVIAIDEVQFFDDEIVEIVNKIAESGRRVICAGLDDFRGKPFGPIPEL 333-----3333------3333-------------------------1111--!!!!--- AIAEFVDKIQAICVVCGNPATRTQRLINGKPAFYDDPVESYEARCRKCHVVPQ --------------------------iiii--------------3333----- >OXETANOCIN-LIKE PROTEIN; SWP:Q8U3R1; PDB:1XX7A; SIDLILLAGKLKRIPRMGWLIKGVPNPESVADHSYRVAFITLLLAEELKKKGVEIDVEKA ---------3333------3333-------------------------1111-------- LKIAIIHDLGEAIITDLPLSAQKYLNKEEAEAKALKDVLPEYTELFEEYSKALTLEGQLV -----11113333----3333-----------------1111------3333-------- KIADKLDMIIQAYEYELSGAKNLSEFWNALEDLEKLEISRYLREIIEEVRRL ---------------1111---3333-----3333----------------- >SAC7D; SWP:P13123; PDB:1XX8A; MVKVKFKYKGEEKEVDTSKIKKVARVGKMVSFTYDDNGKTGRGAVSEKDAPKELLDMLAR -------%%%%----3333--------------------------3333-3333---111 AEREKK 1----- >COAGULATION FACTOR XI; SWP:P03951; PDB:1XX9A; IVGGTASVRGEWPWQVTLHTTSPTQR -------22221111----------- ------------------------------------------------------------ ----------- >Ecotin [Precursor]; SWP:P23827; PDB:1XXDC; PLEKIAPYPQAEKGMKRQVIQLTPQEDESTLKVELLIGQTLEVDCNLHRLGGKLENKTLE -3333---------------------3333----------------------------22 GWGYDYYVFDKVSSNDFTRVVCPDGKKEKKFVTAYLGDAGMLRYNSKLPIVVYTPDNVDV 22---------------------------------!!!!-----3333------------ KYRVWKAEEKIDNAVVR ----------------- >ENTEROTOXIN; SWP:Q5D1K7; PDB:1XXGA; QPDPKLDELNKVSDYKSNKGTMGNVMNLYMSPPVEGRGVINSRQFLSHDLIFPIEYKSYN ----3333-------------------------------------1111------!!!!- EVKTELENTELANNYKGKKVDIFGVPYFYTCIIPKSEPFGGCCMYGGLTFNSSENRDKLI --------------2222---------2222----------------------------- TVQVTIDNRQSLGFTITTNKNMVTIQELDYKARHWLTKEKKLYEFDGSAFESGYIKFTEK -----%%%%----------------------------------1111-----------11 NNTSFWFDLFPKKELVPFVPYKFLNIYGDNKVVDSKSIKMEVFLNTH 11---------1111---1111-3333------3333---------- >YCGJ PROTEIN; SWP:NA; PDB:1XXLA; IKTAECRAEHRVLDIGAGAGHTALAFSPYVQECIGVDATKEVEVASSFAQEKGVENVRFQ 3333--1111------!!!!-----3333------------------------------- QGTAESLPFPDDSFDIITCRYAAHHFSDVRKAVREVARVLKQDGRFLLVDHYAPEDPVLD --3333---------------3333---------------2222-----------3333- EFVNHLNRLRDPSHVRESSLSEWQAFSANQLAYQDIQKWNLPIQYDSWIKRGGTPADREK ----------3333-----------------------------------1111------- QIITHLNHASDEARDTFCITLNQNGQPISFCLKAILIQGIKREG -----1111------------1111------------------- >MANNOSE-BINDING LECTIN; SWP:Q8LGR3; PDB:1XXQA; TQTTGTSQTIEVGLWGGPGGNAWDDGSYTGIREINLSHGDAIGAFSVIYDLNGQPFTGPT --------------------------------------------------iiii------ HPGNEPSFKTVKITLDFPNEFLVSVSGYTGVLARLATGKDVIRSLTFKTNKKTYGPYGKE ----3333-----------------------1111------------------------- EGTPFSLPIENGLIVGFKGRSGFVVDAIGFHLSL ---------------------------------- >PHOSPHOLIPASE A2 HOMOLOG ; SWP:Q9I834; PDB:1XXSA; SLFELGKMILQETGKNPAKSYGVYGCNCGVGGRGKPKDATDRCCYVHKCCYKKLTGCDPK ---------------3333--------------------------------------333 KDRYSYSWKDKTIVCGENNSCLKELCECDKAVAICLRENLDTYNKKYRYNYLKPACKKAD 3-------%%%%--------------------------3333-3333--1111------- PC -- >DIHYDRODIPICOLINATE SYNTH; SWP:P63945; PDB:1XXXA; GFDVAARLGTLLTAMVTPFSGDGSLDTATAARLANHLVDQGCDGLVVSGTTGESPTTTDG --3333-------------1111--------------1111-------33333333---- EKIELLRAVLEAVGDRARVIAGAGTYDTAHSIRLAKACAAEGAHGLLVVTPYYSKPPQRG ------------1111----------------------1111------------------ LQAHFTAVADATELPMLLYDIPGRSAVPIEPDTIRALASHPNIVGVKDAKADLHSGAQIM -------1111---------3333---------------1111----------------- ADTGLAYYSGDDALNLPWLAMGATGFISVIAHLAAGQLRELLSAFGSGDIATARKINIAV ----------3333----1111------3333---------------------------- APLCNAMSRLGGVTLSKAGLRLQGIDVGDPRLPQVAATPEQIDALAADMRAASVLR -------------------------------------------------------- >UNKNOWN PROTEIN; SWP:NA; PDB:1XY7A; HLVFTEFKQLLVEAQKVGDAVTFYKSAFGAIESHVLSSELNLAGSSFVVCDVSSLPGFST ------------2222-------------------------iiii-----33332222-- AKSEGSGVTFLLGTKDAEAAVAKAVDAGAVKVEVTEAEVELGFKGKVTDPFGVTWIFAE -3333-------------------1111--------------------1111------- >putative N-acetyl-gamma-g; SWP:Q93Z70; PDB:1XYGA; KDIRIGLLGASGYTGAEIVRLLANHPHFQVTLMTADRKAGQSMESVFPHLRAQKLPTLVS ---------------------1111----------1111--3333-3333---------3 VKDADFSTVDAVFCCLPHGTTQEIIKELPTALKIVDLSADFRLRNIAEYEEWYGQPHKAV 333-1111----------------11111111--------------------------33 ELQKEVVYGLTEILREDIKKARLVANPGCYPTTIQLPLVPLLKANLIKHENIIIDAKSGV 331111----------3333--------3333---------1111--------------- SGAGRGAKEANLYSEIAEGISSYGVTRHRHVPEIEQGLSDVAQSKVTVSFTPHLMPMIRG 1111---3333----------------1111----------------------------- MQSTIYVEMAPGVRTEDLHQQLKTSYEDEEFVKVLDEGVVPRTHNVRGSNYCHMSVFPDR ---------22223333--------1111------2222--33332222----------- IPGRAIIISVIDNLVKGASGQALQNLNIMLGYPETTGLLHQPLFP 2222-------1111------------1111-1111--------- >CYCLOPHILIN-LIKE PROTEIN ; SWP:Q9H2H8; PDB:1XYHA; MSVTLHTDVGDIKIEVFCERTPKTCENFLALCASNYYNGCIFHRNIKGFMVQTGDPTGTG ------1111-------3333-------------3333-------2222-----1111-- RGGNSIWGKKFEDEYSEYLKHNVRGVVSMANNGPNTNGSQFFITYGKQPHLDMKYTVFGK ----1111-------3333----------------------------1111--------- VIDGLETLDELEKLPVNEKTYRPLNDVHIKDITIHANPFA ----------1111-------------------------- >DNA-BINDING PROTEINS 7A/7; SWP:P13123; PDB:1XYIA; MVKVKFKYKGEEKEVDTSKIKKVWRAGKAVSFTYDDNGKTGRGAVSEKDAPKELLDMLAR -------iiii----3333------!!!!------iiii------3333----------- AEREKK 3333-- >PRION PROTEIN; SWP:O18754; PDB:1XYJA; VVGGLGGYMLGSAMSRPLIHFGNDYEDRYYRENMYRYPNQVYYRPVDQYSNQNNFVHDCV --------------------------33331111---------------------1111- NITVRQHTVTTTTKGENFTETDMKIMERVVEQMCVTQYQKESEAYYQRRAS --------------------------------------------------- >ENDO-1,4-BETA-XYLANASE I; SWP:P36218; PDB:1XYN; ASINYDQNYQTGGQVSYSPSNTGFSVNWNTQDDFVVGVGWTTGSSAPINFGGSFSVNSGT -------------------1111------------------------------------- GLLSVYGWSTNPLVEYYIMEDNHNYPAQGTVKGTVTSDGATYTIWENTRVNEPSIQGTAT ------------------------------------iiii-------------1111--- FNQYISVRNSPRTSGTVTVQNHFNAWASLGLHLGQMNYQVVAVEGWGGSGSASQSVSN --------------------------1111---------------------------- >MAJOR PRION PROTEIN; SWP:P49927; PDB:1XYQA; VVGGLGGYMLGSAMSRPLIHFGSDYEDRYYRENMYRYPNQVYYRPVDQYSNQNSFVHDCV ----------------------1111-3333-3333------------------------ NITVKQHTVTTTTKGENFTETDVKMIERVVEQMCITQYQKEYEAYAQRGAS ----------3333------------------------------------- >MAJOR PRION PROTEIN; SWP:P67986; PDB:1XYWA; VVGGLGGYMLGSAMSRPLIHFGNDYEDRYYRENMYRYPNQVYYRPVDQYNNQNTFVHDCV ---------------------------------3333-------3333--3333------ NITVKQHTVTTTTKGENFTETDIKMMERVVEQMCITQYQRESEAYYQRGAS ----------1111------------------------------------- >MAJOR PRION PROTEIN; SWP:P04925; PDB:1XYXA; VVGGLGGYMLGSAMSRPMIHFGNDWEDRYYRENMYRYPNQVYYRPVDQYSNQNNFVHDCV --------------------------------------------3333------------ NITIKQHTVTTTTKGENFTETDVKMMERVVEQMCVTQYQKESQAYYDGRRSS ----------3333-------------------------------------- >1,4-BETA-D-XYLAN-XYLANOHY; SWP:P10478; PDB:1XYZA; NALRDYAEARGIKIGTCVNYPFYNNSDPTYNSILQREFSMVVCENEMKFDALQPRQNVFD -3333--1111-------3333----------------------11113333--2222-- FSKGDQLLAFAERNGMQMRGHTLIWHNQNPSWLTNGNWNRDSLLAVMKNHITTVMTHYKG -----------1111--------------3333------------------------222 KIVEWDVANECMDDSGNGLRSSIWRNVIGQDYLDYAFRYAREADPDALLFYNDYNIEDLG 2-----------3333------------1111-----------1111------------- PKSNAVFNMIKSMKERGVPIDGVGFQCHFINGMSPEYLASIDQNIKRYAEIGVIVSFTEI -------------1111-------------------------------1111-------- DIRIPQSENPATAFQVQANNYKELMKICLANPNCNTFVMWGFTDKYTWIPGTFPGYGNPL ----1111----------------------1111--------3333-3333-2222---- IYDSNYNPKPAYNAIKEALM --1111-------------- >PYRR BIFUNCTIONAL PROTEIN; SWP:P41007; PDB:1XZNA; MQKAVVMDEQAIRRALTRIAHEIIERNKGIDGCVLVGIKTRGIYLARRLAERIEQIEGAS -----------------------------2222--------------------------- VPVGELDITLYRVPFPVTERNVILVDDVLFTGRTVRAAMDAVMDLGRPARIQLAVLVDRG ----------------2222---------------------1111--------------- HRELPIRADFVGKNVPTSRSELIVVELSEVDGIDQVSIHEK -----------------1111-----3333----------- >HYPOTHETICAL PROTEIN YPMQ; SWP:P54178; PDB:1XZOA; QQIKDPLNYEVEPFTFQNQDGKNVSLESLKGEVWLADFIFTNCETICPPMTAHMTDLQKK -----------------1111---33332222---------------------------- LKAENIDVRIISFSVDPENDKPKQLKKFAANYPLSFDNWDFLTGYSQSEIEEFALKSFKA -1111----------3333---------1111---------------------------- IVKKPEGEDQVIHQSSFYLVGPDGKVLKDYNGVENTPYDDIISDVKSASTLK --------------------1111----------------------3333-- >PROBABLE TRNA MODIFICATIO; SWP:Q9WYA4; PDB:1XZPA; LMDTIVAVATPPGKGAIAILRLSGPDSWKIVQKHLRTRSKIVPRKAIHGWIHENGEDVDE -----------------------1111----1111------2222--------------- VVVVFYKSPKSYTGEDMVEVMCHGGPLVVKKLLDLFLKSGARMAEPGEFTKRAFLNGKMD ----------3333------------------------------2222-----1111--- LTSAEAVRDLIEAKSETSLKLSLRNLKGGLRDFVDSLRRELIEVLAEIRVELDYPDEIET --------------------------------------------------3333------ NTGEVVTRLERIKEKLTEELKKADAGILLNRGLRMVIVGKPNVGKSTLLNRLLNEDRAIV ---------------------------------------3333----------1111--- TDIPGTTRDVISEEIVIRGILFRIVDTAGVRSETNDLVERLGIERTLQEIEKADIVLFVL -3333-----------iiii---------------------------------------- DASSPLDEEDRKILERIKNKRYLVVINKVDVVEKINEEEIKNKLGTDRHMVKISALKGEG 1111------------1111--------------------------1111---3333--- LEKLEESIYRETQEIFERGSDSLITNLRQKQLLENVKGHLEDAIKSLKEGMPVDMASIDL ------------------1111------------------------1111---------- ERALNLLDEVTGRSFREDLLDTIFSNFCVGK ------3333------------3333----- >PURPLE ACID PHOSPHATASE; SWP:Q9SE00; PDB:1XZWA; LPNAEDVDMPWDSDVFAVPSGYNAPQQVHITQGDYEGRGVIISWTTPYDKAGANKVFYWS --3333---11111111------------------------------------------- ENSKSQKRAMGTVVTYKYYNYTSAFIHHCTIKDLEYDTKYYYRLGFGDAKRQFWFVTPPK -----------------!!!!------------------------!!!!----------- PGPDVPYVFGLIGDIGQTHDSNTTLTHYEQNSAKGQAVLFMGDLSYSNRWPNHDNNRWDT -1111-------------------------3333-----------333322223333--- WGRFSERSVAYQPWIWTAGNHEIDYAPDIGEYQPFVPFTNRYPTPHEASGSGDPLWYAIK ----3333----------1111----1111--------------3333----3333---- RASAHIIVLSSYSGFVKYSPQYKWFTSELEKVNRSETPWLIVLVHAPLYNSYEAHYMEGE !!!!-----1111--2222--------3333-1111----------------22221111 AMRAIFEPYFVYYKVDIVFSGHVHSYERSERVSNVAYNIVNAKCTPVSDESAPVYITIGD ----------1111------------------------1111------1111-------- GGNSEGLASEMTQPQPSYSAFREASFGHGIFDIKNRTHAHFSWHRNQDGASVEADSLWLL --3333---------1111---------------1111------33331111-------- NRYW ---- >INOSITOL 1,4,5-TRISPHOSPH; SWP:P11881; PDB:1XZZA; SFLHIGDICSLYAEGSTNGFISTLGLVDDRCVVQPEAGDLNNPPKKFRDCLFKLCPMNRY ---2222--------------------------1111-1111---3333----------- SAQKQFWKASTTDAVLLNKLHHAADLEKKQNETENRKLLGTVIQYGNVIQLLHLKSNKYL -----------------------------------1111----2222------1111--- TVNKRLPALLEKNAMRVTLDEAGNEGSWFYIQPFYKLRSIGDSVVIGDKVVLNPVNAGQP --1111--------------------------------2222--2222------------ LHASSHQLVDNPGCNEVNSVNCNTSWKIVLFLEHHH -------3333---------------------3333 >FYVE-RING FINGER PROTEIN ; SWP:Q8WZ73; PDB:1Y02A; PSCKSCGAHFANTARKQTCLDCKKNFCMTCSSQPRLCLLCQRFRATAFQREELMKMKVKD -----------3333-----------1111-------------1111-----1111---- LRDYLSLHDISTEMCREKEELVLLVLGQQPV -----1111--1111-3333-----1111-- >ANTIFREEZE PEPTIDE SS-3; SWP:P04367; PDB:1Y03A; GSMNAPARAAAKTAADALAAAKKTAADAAAAAAAA ---3333---------------------------- >DESULFOFERRODOXIN (RBO); SWP:O83795; PDB:1Y07A; RELSFFLQFFLGMDAPAGSSVACGSEVLRAVPVGTVDAAKEKHIPVVEVHGHEVKVKVGS ---------------2222---!!!!------------3333-------!!!!------- VAHPMTPEHYIAWVCLKTRKGIQLKELPVDGAPEVTFALTADDQVLEAYEFCNLHGVWSG -----1111--------1111------1111--------1111----------------- K - >HYPOTHETICAL PROTEIN SPY0; SWP:Q9F1R7; PDB:1Y08A; TSVWTKGVTPPANFTQGEDVFHAPYVANQGWYDITKTFNGKDDLLSGAATAGNMLHWWFD ----2222--------1111---------------------1111--------------- QNKDQIKRYLEEHPEKQKINFNGEQMFDVKEAIDTKNHQLDSKLFEYFKEKAFPYLKHLG ------------3333----iiii--------1111----------------2222---- VFPDHVIDMFINGYRLSLTNHGPTPVKEGSKDPRGGIFDAVFTRGDQSKLLTSRHDFKEK ----------------1111-----------11111111------3333----------- NLKEISDLIKKELTEGKALGLSHTYRINHVINLWGADFDSNGNLKAIYVTDSDSNASIGM --------------------------------------1111--------33333333-- KKYFVGVNSAGKVAISAKEIKEDNIGAQVLGLFTLSTGQDSWNQTN -------1111---------1111-------------1111-1111 >XANTHINE PHOSPHORIBOSYLTR; SWP:P42085; PDB:1Y0BA; AELKRKIEEEGVVLSDQVLKVDSFLNHQIDPLLQRIGDEFASRFAKDGITKIVTIESSGI -------------%%%%---1111-----3333--------1111-----------3333 APAVTGLKLGVPVVFARKHKSLTLTDNLLTASVYSFTKQTESQIAVSGTHLSDQDHVLII 3333--3333----------1111----------------------3333-1111----- DDFLANGQAAHGLVSIVKQAGASIAGIGIVIEKSFQPGRDELVKLGYRVESLARIQSLEE -----------------1111-----------1111----------------------%% GKVSFVQE %%------ >Putative N-acetylmannosam; SWP:P65517; PDB:1Y0EA; ALPHGLIVSCQALPDEPLHSSFISKALAAYEGGAVGIRANTKEDILAIKETVDLPVIGIV ------------2222---3333------1111--------------------------- KRDYDHSDVFITATSKEVDELIESQCEVIALDATLQQRPKETLDELVSYIRTHAPNVEIA -----------------------------------------------------1111--- DIATVEEAKNAARLGFDYIGTTLHGYTSYTQGQLLYQNDFQFLKDVLQSVDAKVIAEGNV --------------------1111--1111---3333%%%%------------------- ITPDYKRVDLGVHCSVVGGAITRPKEITKRFVQVE -----------------1111-------------- >PROTEIN YCEI; SWP:P37904; PDB:1Y0GA; ADYKIDKEGQHAFVNFRIQHLGYSWLYGTFKDFDGTFTFDEKNPAADKVNVTINTTSVDT ------1111---------%%%%----------------33331111------------- NHAERDKHLRSADFLNTAKYPQATFTSTSVKKDGDELDITGDLTLNGVTKPVTLEAKLIG ----------3333-3333-------------!!!!--------iiii------------ QGDDPWGGKRAGFEAEGKIKLKDFNIKTDLGPASQEVDLIISVEGVQQK -------------------3333-------1111--------------- >HYPOTHETICAL PROTEIN RV07; SWP:NA; PDB:1Y0HA; GMTSPVAVIARFMPRPDARSALRALLDAMITPTRAEDGCRSYDLYESADGGELVLFERYR 1111----------3333------------3333-1111-------1111---------- SRIALDEHRGSPHYLNYRAQVGELLTRPVAVTVLAPLDEAS -------------------3333------------------ >HYPOTHETICAL PROTEIN PA45; SWP:Q9HVP2; PDB:1Y0KA; NEADYLRLLTRQAEQANDFLSNARKWDRERWVCQRFLEALNVPYRQEDFAAPGEQPPDVL -------------------1111--------------1111---3333------------ FKGAGFEVFFVLDERPQRIAAAELQARLAPTLRKKAHNYSERGIDHGELDLLAFVNLKRA iiii----------------------------------------1111--------1111 VPDFNTPFPPPTEYLRQGWRSLSVGPTFARVLFAHSGAPEFLRANLGRSILFDAGVGL --1111--------3333------1111------111133331111------2222-- >CATALYTIC ANTIBODY FAB 34; SWP:NA; PDB:1Y0LH; EVKLLESGGGLAQPGGSLKLSCAASGFDFRRYWMTWVRQAPGKGLEWIGEINPDSRTINY ------------2222-----------3333--------2222----------------- MPSLKDKFIISRDNAKNSLYLQLSRL -------------1111--------- >CATALYTIC ANTIBODY FAB 34; SWP:NA; PDB:1Y0LL; ELVVTQESALTTSPGETVTLTCRSSSAV ------------2222------------ >1-phosphatidylinositol-4,; SWP:P10686; PDB:1Y0MA; TFKSAVKALFDYKAQREDELTFTKSAIIQNVEKQDGGWWRGDYGGKKQLWFPSNYVEEMI ---------------1111---2222----------------iiii-----1111----- N - >HYPOTHETICAL UPF0270 PROT; SWP:Q9HYE3; PDB:1Y0NA; HMLIPHDLLEADTLNNLLEDFVTRETPLDVRVERARHALRRGEAVILFDPESQQCQLMLR ----1111------------1111--------------1111----------------33 SEVPAELLRD 33-3333--- >FUMARATE REDUCTASE FLAVOP; SWP:Q02469; PDB:1Y0PA; ADNLAEFHVQNQECDSCHTPDGELSNDSLTYENTQCVSCHGTLAEVAETTKHEHYNAHAS -------3333-1111--1111---1111-----------------1111-11111111- HFPGEVACTSCHSAHEKSMVYCDSCHSFDFNMPYAKKWLRDEPTIAELAKDKSERQAALA ------1111---------3333--------------------3333------------- SAPHDTVDVVVVGSGGAGFSAAISATDSGAKVILIEKEPVIGGNAKLAAGGMNAAWTDQQ -------------------------1111--------------3333------------- KAKKITDSPELMFEDTMKGGQNINDPALVKVLSSHSKDSVDWMTAMGADLTDVGMMGGAS 1111---------------------------------------1111--------2222- VNRAHRPTGGAGVGAHVVQVLYDNAVKRNIDLRMNTRGIEVLKDDKGTVKGILVKGMYKG ------2222---------------1111--------------1111------------- YYWVKADAVILATGGFAKNNERVAKLDPSLKGFISTNQPGAVGDGLDVAENAGGALKDMQ ---------------3333-------3333-------1111-3333---1111----111 YIQAHPTLSVKGGVMVTEAVRGNGAILVNREGKRFVNEITTRDKASAAILAQTGKSAYLI 1-------------------1111----1111-------------------2222----- FDDSVRKSLSKIDKYIGLGVAPTADSLVKLGKMEGIDGKALTETVARYNSLVSSGKDTDF ------------------------------------------------------------ ERPNLPRALNEGNYYAIEVTPGVHHTMGGVMIDTKAEVMNAKKQVIPGLYGAGEVTGGVH --------------------------------1111---1111--2222---3333---! GANRLGGNAISDIITFGRLAGEEAAKYS !!!-2222-------------------- >ARSENICAL RESISTANCE OPER; SWP:O30069_ARCFU; PDB:1Y0UA; HLEEWIKADSLEKADEYHKRYNYAVTNPVRRKILRLDKGRSEEEIQTLSLSKKQLDYHLK -----------------------------------1111-3333-1111----------- VLEAGFCIERVGERWVVTDAGKIV --1111----!!!!----1111-- >FRV OPERON PROTEIN FRVX; SWP:O59196; PDB:1Y0YA; MVDYELLKKVVEAPGVSGYEFLGIRDVVIEEIKDYVDEVKVDKLGNVIAHKKGEGPKVMI ---------------22221111--------1111------1111--------------- AAHMDQIGLMVTHIEKNGFLRVAPIGGVDPKTLIAQRFKVWIDKGKFIYGVGASAPDWDQ --------------1111----------3333----------2222----------1111 IFIDIGAESKEEAEDMGVKIGTVITWDGRLERLGKHRFVSIAFDDRIAVYTILEVAKQLK ------------------2222-------------------------------------- DAKADVYFVATVQEEVGLRGARTSAFGIEPDYGFAIDVTIAADIPGTPEHKQVTHLGKGT ------------3333---------------------------22221111---2222-- AIKIMDRSVICHPTIVRWLEELAKKHEIPYQLEILLGGGTDAGAIHLTKAGVPTGALSVP -----1111------------------------------3333----!!!!--------- ARYIHSNTEVVDERDVDATVELMTKALENIHELKI ----------------------------3333--- >PROTEIN PRODUCT OF AT3G21; SWP:Q9LIG0; PDB:1Y0ZA; LLLVETPIPQQKHYESKPFPAVISPPPALSLPLFTQTIKTQKHYLDSLLHESGAVLFRGF -------3333--iiii------------3333--------------------------- PVNSADDFNDVVEAFGFDELPYVGGAAPRTSVVGRVFTANESPPDQKIPFHHEMAQVREF ------------3333----------------!!!!------3333-------1111--- PSKLFFYCEIEPKCGGETPIVLSHVVYERMKDKHPEFVQRLEEHGLLYVRVLGEDDDPSS --------------------------------------------------------1111 PIGRGWKSTFLTHDKNLAEQRAVDLGMKLEWTEDGGAKTVMGPIPAIKYDESRNRKVWFN ------------------------------------------------------------ SMVAAYTGWEDKRNDPRKAVTFGDGKPLPADIVHDCLRILEEECVAVPWQRGDVLLIDNW ----------11113333---1111------------------------2222----333 AVLHSRRPFDPPRRVLASLCK 3-------------------- >HYPOTHETICAL PROTEIN PA00; SWP:Q9I747; PDB:1Y12A; AVDFIKIGDVKGESKDKTHAEEIDVLAWSWGSQSGSHGGGGAGKVNVQDLSFTKYIDKST ------!!!!-----1111-------------------------------------3333 PNLACSSGKHYPQAKLTIRKAGGENQVEYLIITLKEVLVSSVSTGGSGGEDRLTENVTLN ----3333--------------1111---------------------------------- FAQVQVDYQPQKADGAKDGGPVKYGWNIRQNVQA -----------1111------------1111--- >6-PYRUVOYL TETRAHYDROPTER; SWP:Q6LEZ4; PDB:1Y13A; NSSAEVSVESPSFSFNCAHFIAYNGFRETLHGHNYNVSLKVRGYVRDDGYVIDFSILKEK ---------3333---------2222-------------------1111---3333---- VKKVCNKLDHHFILPIYSDVLKFENVKNNIKIICEDNSEYSFPERDCIKLPIKHSSTEEI --------------1111-------!!!!----1111-----3333-------------- GQYILNQLIEEDVSLLKSRHIHYIEISVSESPTQKAIVHKYI ----------------1111----------1111-------- >DNA-directed RNA polymera; SWP:P20433; PDB:1Y14A; ELIALNLSEARLVIKEALVERRRAFKRSQTREKELESIDVLLEQTTGGNNKDLKNTMQYL ------------------------------3333-------------------------- TNFSRFRDQETVGAVIQLLKSTGLHPFEVAQLGSLACDTADEAKTLIPSLNNKISDDELE ----------------------------------------------1111---------- RILKELSNLETLY ------1111--- >DNA-directed RNA polymera; SWP:P34087; PDB:1Y14B; MFFIKDLSLNITLHPSFFGPRMKQYLKTKLLEEVEGSCTGKFGYILCVLDYDNIDIQFNV ---------------------------------2222------------3333------- KYRAVVFKPFKGEVVDGTVVSCSQHGFEVQVGPMKVFVTKHLMPQDLTFNASYQSSEDVI ---------2222---------1111----!!!!----1111------------------ TIKSRIRVKIEGCISQVSSIHAIGSIKEDYLGAI 2222-----------------------2222--- >ANTICOAGULANT PROTEIN A; SWP:Q9DEF9; PDB:1Y17A; DCSSSWSSYEGHCYKAFKQSKTWADAESFCTKQVNGGHLVSIESSGEADFVAHLIAQKIK --2222--iiii----------------3333-2222----------------------- SAKIHVWIGLRAQNKEKQCSIEWSDGSSISYENWIEEESKKCLGVHKATGFRKWENFYCE ----------------------1111---------1111------3333--------111 QRDPFVCEA 1-------- >ELASTASE INHIBITOR; SWP:P16895; PDB:1Y1BA; KPDCPLICTMQYDPVCGSDGITYGNACMLLGASCRSDTPIELVHKGRC -------------------------------3333------------- ------------------------------------------------ >ARSENATE REDUCTASE (ARSC); SWP:O28910; PDB:1Y1LA; KVLFVCIHNTARSVMAEALFNAMAKSWKAESAGVEKAERVDETVKRLLAERGLKAKEKPR ------------------3333--------------------------1111-------- TVDEVNLDDFDLIVTVCEESSCVVLPTDKPVTRWHIENPAGKDEGTYRRVLAEIEERVKK 3333-3333---------------------------------!!!!-------------- LVGE ---- >METHIONINE AMINOPEPTIDASE; SWP:O33343; PDB:1Y1NA; RTALSPGVLSPTRPVPNWIARPEYVGKPAAQEGSEPWVQTPEVIEKMRVAGRIAAGALAE ---------------1111--1111----------------------------------- AGKAVAPGVTTDELDRIAHEYLVDNGAYPSTLGYKGFPKSCCTSLNEVICHGIPDSTVIT 3333-----3333---------1111--3333-iiii-------!!!!-----------2 DGDIVNIDVTAYIGGVHGDTNATFPAGDVADEHRLLVDRTREATMRAINTVKPGRALSVI 222---------iiii-------------------------------111122223333- GRVIESYANRFGYNVVRDFTGHGIGTTFHNGLVVLHYDQPAVETIMQPGMTFTIEPMINL --------1111--------------------------3333----2222---------- GALDYEIWDDGWTVVTKDRKWTAQFEHTLLVTDTGVEILTCL -------3333----1111----------------------- >PENICILLIN-BINDING PROTEI; SWP:Q5KXY4; PDB:1Y1OA; TLEDDLNATNEYYRERGIAVIHKKPTPVQIVRVDYPKRSAAVITEAYFRQASTTDYNGVY -------------1111------------------------------------------i RGKYIDFEAKETKNKTAFPLKNFHAHQIRHEQVVAHGGICFAILRFSLLNETYLLDASHL iii--------------------3333------1111---------1111-----3333- IAWWNKQEAGGRKSIPKQEIERHGHSIPLGYQPRLDYISVVDNVYF ----3333-------3333--------------------------- >ALDEHYDE REDUCTASE II; SWP:Q9UUN9; PDB:1Y1PA; AKIDNAVLPEGSLVLVTGANGFVASHVVEQLLEHGYKVRGTARSASKLANLQKRWDAKYP --------2222-----1111----------------------3333-----------22 GRFETAVVEDMLKQGAYDEVIKGAAGVAHIASVVSFSNKYDEVVTPAIGGTLNALRAAAA 22-------1111-11111111----------------3333------------------ TPSVKRFVLTSSTVSALIPKPNVEGIYLDEKSWNLESIDKAKTLPESDPQKSLWVYAASK 3333-------1111----2222-----1111-3333-------3333------------ TEAELAAWKFMDENKPHFTLNAVLPNYTIGTIFDPETQSGSTSGWMMSLFNGEVSPALAL ---------------------------------3333---3333----------3333-- MPPQYYVSAVDIGLLHLGCLVLPQIERRRVYGTAGTFDWNTVLATFRKLYPSKTFPADFP ---------------------1111------------------------1111------- DQGQDLSKFDTAPSLEILKSLGRPGWRSIEESIKDLVGSETA ------------------1111-------------------- >Leishmania Major Homolog ; SWP:Q9N9M3; PDB:1Y1XA; PTSTGVYAPSARHMNDNQELMEWFRAVDTDGSGAISVPELNAALSSAGVPFSLATTEKLL ----1111------11113333-----1111--------------2222----------3 HMYDKNHSGEITFDEFKDLHHFILSMREGFRKRDSSGDGRLDSNEVRAALLSSGYQVSEQ 3331111----3333------------------1111------------3333------- TFQALMRKFDRQRRGSLGFDDYVELSIFVCRVRNVFAFYDRERTGQVTFTFDTFIGGSVS ---------1111--------------------------1111---------------11 IL 11 >HISTIDINE TRIAD PROTEIN; SWP:O07513; PDB:1Y23A; ENCIFCKIIAGDIPSAKVYEDEHVLAFLDISQVTKGHTLVIPKTHIENVYEFTDELAKQY ----------------------------3333-2222----------3333--------3 FHAVPKIARAIRDEFEPIGLNTLNNNGEKAGQSVFHYHMHIIPRYGKGDGFGAVWKTHAD 333-----------------------1111---------------2222--------111 DYKPEDLQNISSSIAKRLA 1-3333--------3333- >HYPOTHETICAL PROTEIN S086; SWP:Q83LS2; PDB:1Y2IA; QFSTTPTLEGLTIVEYCGVVTGEAILGANIFRDFFAGIRDIVGGRSGAYEKELRKAREIA -------2222-----------------------1111----33333333---------- FEELGSQARALGADAVVGIDIDYETVGQNGSLVSVSGTAVKTRRNI --------1111--------------1111---------------- >CAMP-SPECIFIC 3',5'-CYCLI; SWP:Q08499; PDB:1Y2KA; TEQEDVLAKELEDVNKWGLHVFRIAELSGNRPLTVIMHTIFQERDLLKTFKIPVDTLITY ----------1111-----------1111------------------------------- LMTLEDHYHADVAYHNNIHAADVVQSTHVLLSTPALEAVFTDLEILAAIFASAIHDVDHP ----11111111----------------33333333-------------------2222- GVSNQFLINTNSELALMYNDSSVLENHHLAVGFKLLQEENCDIFQNLTKKQRQSLRKMVI -------1111------%%%%--------------------1111--------------- DIVLATDMSKHMNLLADLKTMVETKKVTSSGVLLLDNYSDRIQVLQNMVHCADLSNPTKP ------3333-----------------1111-----3333--------------3333-3 LQLYRQWTDRIMEEFFRQGDRERERGMEISPMCDKHNASVEKSQVGFIDYIVHPLWETWA 333-------------------1111---22221111----------------------- DLVHPDAQDILDTLEDNREWYQSTIP ---------------------1111- >PHENYLALANINE AMMONIA-LYA; SWP:P11544; PDB:1Y2MA; ASTNLAVAGSHLPTTQVTQVDIVEKLAAPTDSTLELDGYSLNLGDVVSAARKGRPVRVKD ---3333------1111------------------------------------------- SDEIRSKIDKSVEFLRTEDAISLQKALLEHQLCGVLPSSFDSFRLGRGLENSLPLEVVRG ----------------------------1111------3333-2222-1111-3333--- ATIRVNSLTRGHSAVRLVVLEALTNFLNHGITPIVPLRGTISASGDLSPLSYIAAAISGH ---------------3333------------------------------------11111 PDSKVHVVHEGKEKILYAREAALFNLEPVVLGPKEGLGLVNGTAVSASATLALHDAHLSL 111-----%%%%----3333-1111------2222------------------------- LSQSLTATVEAVGHAGSFHPFLHDVTRPHPTQIEVAGNIRKLLEGSRFAVHHEEEVDEGI -------3333-------3333--------------------2222----1111------ LRQDRYPLRTSPQWLGPLVSDLIHAHAVLTIEAGQSTTDNPLIDVENKTSHHGGNFQAAA ----3333---3333-----------------------------1111-----1111--- VANTEKTRLGLAQIGKLNFTQLTELNAGNRGLPSCLAAEDPSLSYHCKGLDIAAAAYTSE ----------------------------iiii2222---3333----------------- LGHLANPVTTHVQPAEANQAVNSLALISARRTTESNDVLSLLLATHLYCVLQAIDLRAIE -------1111------------------------------------------------- FEFKKQFGPAIVSLIDQHFGSATGSNLRDELVEKVNKTLAKRLEQTNSYDLVPRWHDAFS ------------------3333-----------------------1111----------- FAAGTVVEVLSSTSLSLAAVNAWKVAAAESAISLTRQVRETFWSAASTSSPALSYLSPRT -------1111----------------------------------33333333---3333 QILYAFVREELGVKARRGDVFLGKQEVTIGSNVSKIYEAIKSGRINNVLLKL --------1111-----3333------------------1111--------- >BAI1-ASSOCIATED PROTEIN 2; SWP:Q9UQB8; PDB:1Y2OA; SLSRSEEHRLTENVYKTIEQFNPSLRNFIAGKNYEKALAGVTYAAKGYFDALVKGELASE -3333----------------------------------------------------111 SQGSKELGDVLFQAEVHRQIQNQLEELKSFHNELLTQLEQKVELDSRYLSAALKKYQTEQ 1--3333----------------------------------------------------- RSKGDALDKCQAELKKLRKKSQGSKNPQKYSDKELQYIDAISNKQGELENYVSDGYKTAL -------------------------3333------------------------------- TEERRRFCFLVEKQCAVAKNSAAYHSKGKELLAQKLPLWQQACADPSKIPERAVQLQQVA --------------------------------------------1111------------ >LECTIN; SWP:Q00022; PDB:1Y2TA; TYTISIRVYQTTPKGFFRPVERTNWKYANGGTWDEVRGEYVLTMGGSGTSGSLRFVSSDT --------------------------%%%%-----iiii--------------------- DESFVATFGVHNYKRWCDIVTNLTNEQTALVINQEYYGVPIRDQARENQLTSYNVANAKG ----------%%%%---------11113333-------------------------1111 RRFAIEYTVTEGDNLKANLIIG ---------------------- >FLUOROACETATE DEHALOGENAS; SWP:Q39CA8; PDB:1Y37A; MFEGFERRLVDVGDVTINCVVGGSGPALLLLHGFPQNLHMWARVAPLLANEYTVVCADLR -2222------!!!!-----------------------1111-33331111-------22 GYGGSSKPVGAPDHANYSFRAMASDQRELMRTLGFERFHLVGHDRGGRTGHRMALDHPDS 22--------1111------------------------------------------3333 VLSLAVLDIIPTYVMFEEVDRFVARAYWHWYFLQQPAPYPEKVIGADPDTFYEGCLFGWG ---------------11113333-------3333-------------------------3 ATGADGFDPEQLEEYRKQWRDPAAIHGSCCDYRAGGTIDFELDHGDLGRQVQCPALVFSG 333------------------------------------------1111----------1 SAGLMHSLFEMQVVWAPRLANMRFASLPGGHFFVDRFPDDTARILREFLSDARS 1113333--3333-3333-------------3333------------------- >COPPER-TRANSPORTING ATPAS; SWP:Q04656; PDB:1Y3JA; NSSKCYIQVTGMTCASCVANIERNLRREEGIYSILVALMAGKAEVRYNPAVIQPPMIAEF ------------------------------------------------------------ IRELGFGATVIENIEGR -1111------------ >ALGQ1; SWP:Q9KWT6; PDB:1Y3NA; REATWVTEKPLTLKIHMHFRDKWVWDENWPVAREVARLTNVKLVGVANRAATNSQEQFNL -1111--------------------1111------------------------------- MMASGQLPDIVGGDNLKDKFIRYGMEGAFIPLNKLIDQNAPNLKAFFKTHPEVQRAITAP 1111-------------------1111-------------------------------11 DGNIYYLPYVPDGLVSRGYFIRQDWLDKLHLKTPQTVDELYTVLKAFKEKDPNGNGKADE 11------------------------------------------------1111------ IPFINRDPEEVFRLVNFWGARSTGSNTWMDFYVENGKIKHPFAEVAFKDGIKHVAQWYKE ------3333-----------------------%%%%--3333----------------- GLIDPEIFTRKARSREQTFGNNIGGMTHDWFASTALFNDALSKNIPGFKLVPMAPPINSK ---1111---1111----1111---------3333---------2222---------111 GQRWEEDARQIPRPDGWAITATNKNPVETIKLFDFYFGPKGRELSNFGVPGLTYDIKNGK 1------------------1111---------3333------------2222----iiii PVYKDTVLKAAQPVNNQMYDIGAQIPIGFWQDYEYERQWTNDVALQGIDMYIKNKYVLPQ ----3333----------1111-------------------------------------- FTGVNLTVEEREIYDKYWPDVKTYMFEMGQSWVMGTKDPEKTWNDYQQQLKNRGFYQVMI ------3333---------------------------3333------------------- VMQKAYDRQY ---------- >HYPOTHETICAL PROTEIN YXAG; SWP:P42106; PDB:1Y3TA; CTHSLPKEKMPYLLRSGEGERYLFGRQVATVMANGRSTGDLFEIVLLSGGKGDAFPLHVH --------------2222-----!!!!------3333------------2222------- KDTHEGILVLDGKLELTLDGERYLLISGDYANIPAGTPHSYRMQSHRTRLVSYTMKGNVA -----------------iiii----2222----2222----------------------- HLYSVIGNPYDHAEHPPYASEEVSNERFAEAAAVATIVFLDEAKPACSAKLAELTELPDG 3333-----------------------3333----------------------------- AVPYVLESGEGDRLLTGDQLHRIVAAQKNTDGQFIVVSSEGPKGDRIVDHYHEYHTETFY ------2222-----!!!!------3333iiii--------------------------- CLEGQMTMWTDGQEIQLNPGDFLHVPANTVHSYRLDSHYTKMVGVLVPGLFEPFFRTLGD -----------------2222----2222---------------------3333------ PYEGHIFPCKPQALRFDRILQNIEALDLKV --------------3333------------ >Tyrosyl-tRNA synthetase, ; SWP:P12063; PDB:1Y42X; PKYTAKINEAEENWQARAEAIKKGKKQNTWDLFEERGYVKDTAGTKEHIAELMRTRRIGA ------------------------------------------------------------ YVGIDPTAPSLHVGHLLPLMPLFWMYLEGYKAFTLIGGSTAKIGDPTGDATMNMTKIHYQ ----------------3333------------------3333------------------ LKKLWENVDTQMRARGYEADWARKRGIVNNNHWWNKQPMLEVLRRVGHALRIGPMLSRDT ------------1111---1111----------------------1111--3333--333 VKNKMTQGDGVSFAEFTYPIMQGWDWFELFYQQGVQMQIGGSDQYGNIISGLEVVKAARE 3---------------------------------------1111--------------11 SEPDPQERKYVTPKTALDECVGFTVPLLTDSSGAKFGKSAGNAIWLDPYQTSVFDFYGYF 11-3333-------1111-----------1111-2222--------3333---------1 VRRSDQEVENLLKLFTFMPISEITKTMEEHIKDPSKRVAQHTLAREVVTLVHGKQEASAA 1113333-------------------------3333------------------------ EDQHRMMYTG ---------- -------------------------------- >Aspergillopepsin-2 [Precu; SWP:P24665; PDB:1Y43B; EEYCASAWVGIDGDTCETAILQTGVDFCYEDGQTSYDAWYEWYPDYAYDFSDITISEGDS -------------------------------------------------------2222- IKVTVEATSKSSGSATVENLTTGQSVTHTFSGNVEGDLCETNAEWIVEDFESGDSLVAFA -------------------1111----------------------------!!!!----- DFGSVTFTNAEATSGGSTVGPSDATVMDIEQDGSVLTETSVSGDSVTVTYV -------------iiii---1111------%%%%----------------- >RIBONUCLEASE Z; SWP:P54548; PDB:1Y44A; ELLFLGTGAGIPAKARNVTSVALKLLEERRSVWLFDCGEATQHQLHTTIKPRKIEKIFIT ------------3333---------------------2222--------3333------- HHGDHVYGLPGLLGSRSFQGGEDELTVYGPKGIKAFIETSLAVTKTHLTYPLAIQEIEEG -11111111-------1111---------------------1111--------------- IVFEDDQFIVTAVSVIHGVEAFGYRVQEKDVPGSLLEPPKKGRSVVFSGDTRVSDKLKEL ----1111---------------------------------------------------- ARDCDVVHEATFAKEDRKLAYDYYHSTTEQAAVTAKEARAKQLILTHISARYQGDASLEL 2222--------1111-------------------1111---------1111--3333-- QKEAVDVFPNSVAAYDFLEVNVPRG ---3333-------2222------- >DUEFERRI (DF2); SWP:NA; PDB:1Y47A; DYLRELYKLEQQAMKLYREASERVGDPVLAKILEDEEKHIEWLETI 3333---------------------3333----------------- >Maltose binding protein f; SWP:P02928; PDB:1Y4CA; EGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWA --------1111--------------------------3333-----1111--------3 HDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLP 333----1111-------33333333-----1111iiii--------------------- NPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGV ----3333--------1111---------3333-----1111------iiii-------- DNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYG ---------------------1111--------1111-------3333------------ VTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVA ------iiii-------------1111--------------------------------- LKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKD 3333---1111-----------------------------------------3333---- AQTNSSSLGIEGRSSEELLKIALQEAQKTLQQAQELAKKGGGEEQLKRALKRADRNLWAA ----------------------------------1111---------------------- QELAKKGGGGEELLKQALQQAQQLLRQAQELAKKGGGEELLKQALQQAQQLLQQAQELAK ---3333----------------------------------------------------- >CYSTEINE PROTEASE; SWP:P0C1S6; PDB:1Y4HA; QVQYENTLKNFKIREQQFDNSWCAGFSMAALLNATKNTDTYNAHDIMRTLYPEVSEQDLP ---------------------------------1111-------------1111333311 NCATFPNQMIEYGKSQGRDIHYQEGVPSYNQVDQLTKDNVGIMILAQSVSQNPNDPHLGH 11-----------1111------------------1111------------1111----- ALAVVGNAKINDQEKLIYWNPWDTELSIQDADSSLLHLSFNRDYNWYGSMIGY ---------iiii------3333------1111-----%%%%----------- >METHIONINE GAMMA-LYASE; SWP:Q84AR1; PDB:1Y4IA; SDCRTYGFNTQIVHAGQQPDPSTGALSTPIFQTSTFVFDSAEQGAARFALEESGYIYTRL -1111--------2222----------------------------------------111 GNPTTDALEKKLAVLERGEAGLATASGISAITTTLLTLCQQGDHIVSASAIYGCTHAFLS 1--------------------------------------2222----------------- HSMPKFGINVRFVDAGKPEEIRAAMRPETKVVYIETPANPTLSLVDIETVAGIAHQQGAL -------------3333----11111111-------------------------1111-- LVVDNTFMSPYCQQPLQLGADIVVHSVTYINGHGDVIGGIIVGKQEFIDQARFVGLKDIT ---------11113333------------------------------------------- GGCMSPFNAWLTLRGVKTLGIRMERHCENALKIARFLEGHPSITRVYYPGLSSHPQYELG ---------------------------------------1111----1111--------- QRQMSLPGGIISFEIAGGLEAGRRMINSVELCLLAVSLGDTETLIQHPASMTHSPVAPEE --------------2222-------1111-----------------3333------3333 RLKAGITDGLIRLSVGLEDPEDIINDLEHAIRKATF ------1111--------3333-------------- >SULFATASE MODIFYING FACTO; SWP:Q8NBJ7; PDB:1Y4JA; TSMVQLQGGRFLMGTNSPDSRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEA ----------------1111iiii---------------------------------333 EMFGWSFVFEDFVSDELRNKAQPMKSVLWWLPVEKAFWRQPAGPGSGIRERLEHPVLHVS 3-------3333-3333---------!!!!------3333--2222-1111--------- WNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQPNRTNLWQGKFPKGDKAED --------1111-----------3333------1111--------------------111 GFHGVSPVNAFPAQNNYGLYDLLGNVWEWTASPYQAAEQDMRVLRGASWIDTADGSANHR 1-----1111----1111-----------------3333--------1111--------- ARVTTRMGNTPDSASDNLGFRCAADA -1111----1111------------- >PHOSPHOLIPASE A2 HOMOLOG ; SWP:P24605; PDB:1Y4LA; SLFELGKMILQETGKNPAKSYGAYGCNCGVLGRGKPKDATDRCCYVHKCCYKKLTGCNPK ---------------3333------------------3333----------------333 KDRYSYSWKDKTIVCGENNSCLKELCECDKAVAICLRENLNTYNKKYRYYLKPLCKKADA 3-------%%%%--------------------------3333-3333---3333------ C - >HERV-FRD_6p24.1 provirus ; SWP:P60508; PDB:1Y4MA; NIDTMAKALTTMQEQIDSLAAVVLQNRRGLDMLTAAQGGICLALDEKCCFWVN ---------------------------------3333----1111-------- >putative iron-uptake ABC ; SWP:Q9PIV4; PDB:1Y4TA; SELNIYSARHYNADFEIIKKFEEKTGIKVNHTQAKASELIKRLSLEGSNSPADIFITADI ----------3333-------------------------------!!!!----------- SNLTEAKNLGLLSPVSSKYLEEFIPAHLRDKDKEWFAITKRARIIAYNKNTNIDISKMKN ------1111--------------1111-1111--------------1111---1111-3 YEDLAKAEFKGEIVMRSATAPYSKTLLASIIANDGNKEAKAWAKGVLENLATNPKGGDRD 333--3333-------1111--------------------------1111------3333 QARQVFAGEAKFAVMNTYYIGLLKNSKNPKDVEVGNSLGIIFPNQDNRGTHINISGIAMT ---------------3333---1111--------1111---------------------1 KSSKNQDAAKKFMEFMLSPEIQKILTDSNYEFPIRNDVELSQTVKDFGTFKEDQIPVSKI 111-------------------------------------33333333-------3333- AENIKEAVKIYDEVGFR ----------------- >EXO-INULINASE; SWP:Q96TU3; PDB:1Y4WA; FNYDQPYRGQYHFSPQKNWMNDPNGLLYHNGTYHLFFQYNPGGIEWGNISWGHAISEDLT ----2222--------------------iiii---------------------------- HWEEKPVALLARGFGSDVTEMYFSGSAVADVNNTSGFGKDGKTPLVAMYTSYYPVAQTLP ----------2222---------------1111-------------------------33 SGQTVQEDQQSQSIAYSLDDGLTWTTYDAANPVIPNPPSPYEAEYQNFRDPFVFWHDESQ 33---2222---------iiii-------------------1111-----------1111 KWVVVTSIAELHKLAIYTSDNLKDWKLVSEFGPYNAQGGVWECPGLVKLPLDSGNSTKWV -------3333------------------------------------------------- ITSGLNPGGPPGTVGSGTQYFVGEFDGTTFTPDADTVYPGNSTANWMDWGPDFYAAAGYN ---------2222-------------------1111------------------------ GLSLNDHVHIGWMNNWQYGANIPTYPWRSAMAIPRHMALKTIGSKATLVQQPQEAWSSIS --1111--------33331111-------------------%%%%---------3333-- NKRPIYSRTFKTLSEGSTNTTTTGETFKVDLSFSAKSKASTFAIALRASANFTEQTLVGY ---------------------------------1111-----------1111-------- DFAKQQIFLDRTHSGDVSFDETFASVYHGPLTPDSTGVVKLSIFVDRSSVEVFGGQGETT -1111--------------1111----------1111----------------!!!!--- LTAQIFPSSDAVHARLASTGGTTEDVRADIYKIASTW -------1111-------------------------- >PHOSPHOCARRIER PROTEIN HP; SWP:P42013; PDB:1Y51A; AEKTFKVVSDSGIHARPATILVQTASKWNSEIQLEYNGKTVNLKSIMGVMSLGIPKGATI --------1111------------3333-------iiii--1111---------2222-- KITAEGADAAEAMAALTDTLAKEGLAE -----1111------------------ >Avidin-related protein 4/; SWP:P56734; PDB:1Y55X; KCSLTGKWTNNLGSIMTIRAVNSRGEFTGTYLTAVADNPGNITLSPLLGIQHKRASQPTF ---------1111--------1111------------3333------------------- GFTVHWNFSESTTVFTGQCFIDRNGKEVLKTMWLLRSSVNDISYDWKATRVGYNNFTRLS ---------------------1111---------------33331111------------ >HYPOTHETICAL PROTEIN PH13; SWP:O59088; PDB:1Y56A; MRPLDLTEKRGKKVTIYFEGKELEAYEGEKLPVALLANEIYWLTTSNEGRKRGAFTFGPV -1111--------------------22223333--1111------1111----------- PMTVNGVKGLEARRIKVKDGMKIERQGYYDFHEEEIERVVVDVAIIGGGPAGIGAALELQ ----------3333---2222--------------------------------------- QYLTVALIEERGWLGGDMWLKGIKQEGFNKDSRKVVEELVGKLNENTKIYLETSALGVFD ---------------3333--------------------11113333------------- KGEYFLVPVVRGDKLIEILAKRVVLATGAIDSTMLFENNDMPGVFRRDFALEVMNVWEVA -------------------------------------1111------------------- PGRKVAVTGSKADEVIQELERWGIDYVHIPNVKRVEGNEKVERVIDMNNHEYKVDALIFA ---------------------------------------------1111----------- DGRRPDINPITQAGGKLRFRRGYYSPVLDEYHRIKDGIYVAGSAVSIKPHYANYLEGKLV -------------------iiii-----1111--2222---3333---3333-------- GAYILKEFGYDAQPCIYEEKLREYEPESLSIPRIPLDKFNLEDVQICGCDVSLKKVDEVI -----1111---3333------------------3333-3333----------------- RKGITDLQIIKRLTHLAMGFCQGRYCLFNGAVVVSQRTGKKLSEIDLPVARSPIKNVKMG --------------222211111111--------------1111--------------33 ILAR 33-- >Dye-linked L-proline dehy; SWP:Q5R1N3; PDB:1Y56B; LPEKSEIVVIGGGIVGVTIAHELAKRGEEVTVIEKRFIGSGSTFRCGTGIRQQFNDEANV -----------------------1111---------22223333---------------- RVMKRSVELWKKYSEEYGFSFKQTGYLFLLYDDEEVKTFKRNIEIQNKFGVPTKLITPEE --------------1111----------------------------1111---------- AKEIVPLLDISEVIAASWNPTDGKADPFEATTAFAVKAKEYGAKLLEYTEVKGFLIENNE ----1111---------------------------------------------------- IKGVKTNKGIIKTGIVVNATNAWANLINAMAGIKTKIPIEPYKHQAVITQPIKRGTINPM --------------------1111----1111--------------------2222---- VISFKYGHAYLTQTFHGGIIGGIGYEIGPTYDLTPTYEFLREVSYYFTKIIPALKNLLIL -----%%%%----3333---------------------------------3333------ RTWAGYYAKTPDSNPAIGRIEELNDYYIAAGFSGHGFMMAPAVGEMVAELITKGKTKLPV ---------1111------1111--------iiii3333--------------------- EWYDPYRFERGELR ---11113333--- >MOLYBDENUM COFACTOR BIOSY; SWP:Q816R0; PDB:1Y5EA; KEVRCKIVTISDTRTEETDKSGQLLHELLKEAGHKVTSYEIVKDDKESIQQAVLAGYHKE --------------3333-----------1111--------------------------- DVDVVLTNGGTGITKRDVTIEAVSALLDKEIVGFGELFRISYLEDIGSSALSRAIGGTIG -------------1111------1111------------------!!!!---------!! RKVVFSPGSSGAVRLANKLILPELGHITFELHR !!-----------------3333------1111 >HYPOTHETICAL PROTEIN RV26; SWP:O06186; PDB:1Y5HA; TTARDINAGVTCVGEHETLTAAAQYREHDIGALPICGDDDRLHGLTDRDIVIKGLAAGLD -3333--------1111--------1111--------%%%%------------3333--1 PNTATAGELARDSIYYVDANASIQELNVEEHQVRRVPVISEHRLVGIVTEADIARHLP 111-33333333-----11113333---1111-------iiii-----3333------ >CORTICOSTEROID 11-BETA-DE; SWP:P50172; PDB:1Y5MA; EEFRPEMLQGKKVIVTGASKGIGREMAYHLSKMGAHVVLTARSEEGLQKVVSRCLELGAA ---33332222-------------------------------------------1111-- SAHYIAGTMEDMTFAEQFIVKAGKLMGGLDMLILNHITQTSLSLFHDDIHSVRRVMEVNF -----------------------------------------------3333--------- LSYVVMSTAALPMLKQSNGSIAVISSLAGKMTQPMIAPYSASKFALDGFFSTIRTELYIT -------------------------1111----------------------------111 KVNVSITLCVLGLIDTETAMKEISGIINAQASPKEECALEIIKGTALRKSEVYYDKSPLT 1--------------3333---2222------------------1111--------1111 PILLGNPGRKIMEFFSLRYYNKDMF 1111------------11111111- >RNA polymerase II transcr; SWP:P32776; PDB:1Y5OA; PSHSGAAIFEKVSGIIAINEDVSPAELTWRSTDGDKVHTVVLSTIDKLQATPASSEKMML 3333----%%%%------------------3333------3333-------1111----- RLIGKVDESKKRKDNEGNEVVPKPQRHMFSFNNRTVMDNIKMTLQQIISRYKDAD -------1111---------------------------------------3333- >FORMALDEHYDE-ACTIVATING E; SWP:Q9FA38; PDB:1Y60A; AKITKVQVGEALVGDGNEVAHIDLIIGPRGSPAETAFCNGLVNNKHGFTSLLAVIAPNLP ---------------1111--------2222--------1111-2222-------2222- CKPNTLMFNKVTINDARQAVQMFGPAQHGVAMAVQDAVAEGIIPADEADDLYVLVGVFIH -------------------------------------------1111------------1 WEAADDAKIQKYNYEATKLSIQRAVNGEPKASVVTEQRKSATHPFAAN 111--------------------1111---------3333--1111-- >CONKUNITZIN-S1; SWP:P0C1X2; PDB:1Y62A; RPSLCDLPADSGSGTKAEKRIYYNSARKQCLRFDYTGQGGNENNFRRTYDCQRTCL ---1111-----------------1111---------------------------- >LMAJ004144AAA PROTEIN; SWP:Q4Q7A6; PDB:1Y63A; EQPKGINILITGTPGTGKTSMAEMIAAELDGFQHLEVGKLVKENHFYTETHIIEEKDEDR ------------2222-------------------3333--------------------- LLDFMEPIMVSRGNHVVDYHSSELFPERWFHMVVVLHTSTEVLFERLTKRQYSEAKRAEN ---------------------33333333------------------------------- MEAEIQCICEEEARDAYEDDIVLVRENDTLEQMAATVEEIRERVEVLK -----------------3333--------------------------- >ENGRAILED HOMEODOMAIN; SWP:NA; PDB:1Y66A; QWSEEVERKLKEFVRRHQEITQETLHEYAQKLGLNQQAIEQFFREFEQRK -3333-----------------------------------------1111 >MANGANESE SUPEROXIDE DISM; SWP:Q9RUV2; PDB:1Y67A; AAYTLPQLPYAYDALEPHIDARTMEIHHTKHHQTYVDNANKALEGTEFADLPVEQLIQQL ----------1111---------------------------------1111333311113 DRVPADKKGALRNNAGGHANHSMFWQIMGQGQGANQPSGELLDAINSAFGSFDAFKQKFE 3333333-----------------1111-------------------------------- DAAKTRFGSGWAWLVVKDGKLDVVSTANQDNPLMGEAIAGVSGTPILGVDVWEHAYYLNY ----------------iiii------!!!!---------------------3333----! QNRRPDYLAAFWNVVNWDEVSKRYAAAKLV !!!------3333------------1111- >23S RIBOSOMAL RNA; SWP:P16174; PDB:1Y698; MISDIRKDAEVRMDKCVEAFKTQISKIRTGGGGTEERRKDLTKIVRGEAEQARVAVRNVR 1111--------------33333333-----------3333------------------- RDANDKVKALLKDKEISEDDDRRSQDDVQKLTDAAIKKIEAALADKEAELMQF ------1111--------------------------------------1111- >PHOSPHORELAY PROTEIN LUXU; SWP:Q9ZBB6; PDB:1Y6DA; MNTDVLNQQKIEELSAEIGSDNVPVLLDIFLGEMDSYIGTLTELQGSEQLLYLKEISHAL --------------!!!!-----------------------------3333--------- KSSAASFGADRLCERAIAIDKKAKANQLQEQGMETSEMLALLHITRDAYRSWTN ----------3333--------------3333----3333-------------- >PEPTIDE DEFORMYLASE; SWP:Q93LE9; PDB:1Y6HA; SVRKILRMGDPILRKISEPVTEDEIQTKEFKKLIRDMFDTMRHAEGVGLAAPQIGILKQI ---------3333-------1111-----------------1111----3333------- VVVGSEDNERYPGTPDVPERIILNPVITPLTKDTSGFWEGCLSVPGMRGYVERPNQIRMQ -------1111-----------------------------1111---------------- WMDEKGNQFDETIDGYKAIVYQHECDHLQGILYVDRLKDTKLFGFNETLDSSHNVLD --1111--------------------1111-3333---3333--------------- >MG-CHELATASE COFACTOR GUN; SWP:P72583; PDB:1Y6IA; MSDNLTELSQQLHDASEKKQLTAIAALAEMGEGGQGILLDYLAKNVPLEKPVLAVGNVYQ -----------------------------------------1111------3333----- TLRNLEQETITTQLQRNYPTGIFPLQSAQGIDYLPLQEALGSQDFETADEITRDKLCELA ---------------------------------------1111----------------- GPGASQRQWLYFTEVEKFPALDLHTINALWWLHSNGNFGFSVQRRLWLASGKEFTKLWPK ---2222---33331111---------------%%%%---------------3333---- IGWKSGNVWTRWPKGFTWDLSAPQGHLPLLNQLRGVRVAESLYRHPVWSQYGW ---2222---------------2222-----1111------------------ >L-LACTATE DEHYDROGENASE; SWP:Q4CDK5; PDB:1Y6JA; RSKVAIIGAGFVGASAAFTMALRQTANELVLIDVFAIGEAMDINHGLPFMGQMSLYDYSD ---------3333-----------------------3333--3333-------------- VKDCDVIVVTAGATRLDLAKKNVMIAKEVTQNIMKYYNHGVILVVSNPVDIITYMIQKWS 2222---------3333---------------3333-----------3333------333 GLPVGKVIGSGTVLDSIRFRYLLSEKLGVDVKNVHGYIIGEHGDSQLPLWSCTHIAGKNI 3-1111-----------------------3333---------1111--3333--iiii-- NEYDKKKIAEDVKTAGATIIKNKGATYYGIAVSINTIVETLLKNQNTIRTVGTVINGMYG ------3333-----------------------------------------------iii IEDVAISLPSIVNSEGVQEVLQFNLTPEEEEALRFSAEQVKKVLNEVKN i-----------------------------------------3333--- >Interleukin-10 receptor a; SWP:Q13651; PDB:1Y6KR; GTELPSPPSVWFEAEFFHHILHWTPIPQQSESTCYEVALLRYGIESWNSISQCSQTLSYD -----------------------------1111--------------------------- LTAVTLDLYHSNGYRARVRAVDGSRHSQWTVTNTRFSVDEVTLTVGSVNLEIHNGFILGK -1111-1111-----------!!!!-----------3333-------------------- IQLPRPKMAPAQDTYESIFSHFREYEIAIRKVPGQFTFTHKKVKHEQFSLLTSGEVGEFC ---------33333333------------------------------------------- VQVKPSVASRSNKGMWSKEECISLT ------1111--------------- >UBIQUITIN-CONJUGATING ENZ; SWP:Q96LR5; PDB:1Y6LA; STSAKRIQKELAEITLDPPPNCSAGPKGDNIYEWRSTILGPPGSVYEGGVFFLDITFSPD 3333--------------2222-------1111-------2222-2222--------111 YPFKPPKVTFRTRIYHCNINSQGVICLDILKDNWSPALTISKVLLSICSLLTDCNPADPL 1--------------11111111---333311113333----------------1111-- VGSIATQYMTNRAEHDRMARQWTKRYAT ---------------------------- >EXCISIONASE FROM TRANSPOS; SWP:Q79DA1; PDB:1Y6UA; IPIWERYTLTIEEASKYFRIGENKLRRLAEENKNANWLIMNGNRIQIKRKQFEKIIDTL -1111---------------3333----3333-------------------33333333 >ALKALINE PHOSPHATASE; SWP:P00634; PDB:1Y6VA; TPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEITA --------------1111-1111----------1111------------2222------- ARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGAL ------1111---1111-----------------------3333-----------2222- GVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPG --1111----------1111---------11113333-------1111---------333 NALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSDA 31111--------------------------------1111--------1111------- ASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLA --3333---3333----------------------1111---------33331111---- QMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEG ------------3333-------------------------------------------- NTLVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIA -----------------1111----------1111------------------------- AYGPHAANVVGLTDQTDLFYTMKAALGLK --2222-------3333------------ >PHOSPHORIBOSYL-ATP PYROPH; SWP:P0A5B1; PDB:1Y6XA; VKTFEDLFAELGDRARTRPADSTTVAALDGGVHALGKKLLEEAGEVWLAAEHESNDALAE --3333------------1111-------------------------------------- EISQLLYWTQVLISRGLSLDDVYRKL ------------1111-33333333- >HEAT SHOCK PROTEIN, PUTAT; SWP:Q8IL32; PDB:1Y6ZA; PIWKQDEKSLTENDYYSFYKNTFKAYDDPLAYVHFNVEGQISFNSILYIPGSLPWELSKN 1111-------------------------------------------------3333--- MFRGIRLYVKRVFINDKFSESIPRWLTFLRGIVDSENSKMLSIINKRIVLKSISMMKGLK --------iiii----3333--3333---------------------------------- ETGGDKWTKFLNTFGKYLKIGVVEDKENQEEIASLVEFYSINSGDKKTDLDSYIENMKED ------------------------3333----1111---1111----------1111111 QKCIYYISGENKKTAQNSPSLEKLKALNYDVLFSLEPIDEFCLSSLTVNKYKGYEVLDVN 1-----------------------1111---------------3333---iiii---111 KA 1- >KINASE-ASSOCIATED PROTEIN; SWP:NA; PDB:1Y71A; TFEIGEIVTGIYKTGKYIGEVTNSRPGSYVVKVLAVLKHPVQERRALAFREQTNIPEQVK --2222-----iiii---------2222-------------------2222--------- KYEGEIPDYTESLKLALETQNSFSEDDSPFAERSLETLQQLKKDYKL ----------------------2222--------------------- >LIN 7 HOMOLOG B; SWP:O88951; PDB:1Y74A; LGLERDVSRAVELLERLQRSGELPPQKLQALQRVLQSRFCSAIREVYEQLYDTLDIT -3333---------------------------------------------------- -------------------------------------------------- >PHOSPHOLIPASE A2 ISOFORM ; SWP:Q5G291; PDB:1Y75A; NTYQFRNMIQCTVPSRSWWDFADYGCYCGCGSGTPVDDLDRCCQVHCNCYRQAGEISGCR ------------------1111---------------------------------22223 PKFKTYTYECSGGTLTCKGDNNACAASSCDCDRLAAICFAGAPYNDNNYNIDLKARCN 333-----------------------------------1111--3333---3333--- >Phospholipase A2 isoform ; SWP:Q5G290; PDB:1Y75B; NIKQFNNMIQCTVPARSWWDFADYGCYCGSGSGSPVDDLDRCCQVHDNCYNAGGGVTGCA ----------------3333--------------------------------3333---- PKSKTYTYECSQGTLTCSGENSACAATVCDCDRLAAICFAGAPYNDNNYNIDLKSRCQ ---------------------3333-------------------3333---3333--- >PROTEIN ASSOCIATED TO TIG; SWP:O55164; PDB:1Y76A; NPAAEKMQVLQVLDRLRGKLQEKGDTTQNEKLSAFYETLKSPLFNQILTLQQSIKQLKGQ --------------------------3333------------------------------ LS -- ------------------------------------------------------------ >Peptidyl-dipeptidase dcp; SWP:P24171; PDB:1Y791; TTMNPFLVQSTLPYLAPHFDQIANHHYRPAFDEGMQQKRAEIAAIALNPQMPDFNNTILA ---3333-----%%%%-1111-3333---------------------------1111--- LEQSGELLTRVTSVFFAMTAAHTNDELQRLDEQFSAELAELANDIYLNGELFARVDAVWQ 1111-------------------------------------------3333--------- RRESLGLDSESIRLVEVIHQRFVLAGAKLAQADKAKLKVLNTEAATLTSQFNQRLLAANK 1111---------------------1111------------------------------- SGGLVVNDIAQLAGMSEQEIALAAEAAREKGLDNKWLIPLLNTTQQPALAEMRDRATREK -------3333----------------1111--------------3333----3333--- LFIAGWTRAEKNDANDTRAIIQRLVEIRAQQATLLGFPHYAAWKIADQMAKTPEAALNFM ------1111--1111----------------1111--------1111------------ REIVPAARQRASDELASIQAVIDKQQGGFSAQPWDWAFYAEQVRREKFDLDEAQLKPYFE ----------------------1111-----1111---------------33333333-- LNTVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRVWEIFDHNGVGLALFYGDFFARDSK ------------------------------1111------1111------------1111 SGGAWMGNFVEQSTLNKTHPVIYNVCNYQKPAAGEPALLLWDDVITLFHEFGHTLHGLFA -------------------------------2222----3333----------------- RQRYATLSGTNTPRDFVEFPSQINEHWATHPQVFARYARHYQSGAAMPDELQQKMRNASL ---3333!!!!-1111---------------------------------------1111- FNKGYEMSELLSAALLDMRWHCLEENEAMQDVDDFELRALVAENMDLPAIPPRYRSSYFA ------------------1111-3333-------------1111--1111----3333-- HIFGGGYAAGYYAYLWTQMLADDGYQWFVEQGGLTRENGLRFREAILSRGNSEDLERLYR ----------------------------1111-------------1111----------- QWRGKAPKIMPMLQHRGLNI -------------1111--- >BETA-XYLOSIDASE, FAMILY 4; SWP:NA; PDB:1Y7BA; SLIKNPILRGFNPDPSICRADTDYYIATSTFEWFPGVQIHHSKDLVNWHLVAHPLNRTSL -------------------!!!!------!!!!-----------------------3333 LDMKGNPNSGGIWAPDLSYHDGKFWLIYTDVKVTDGMWKDCHNYLTTCESVDGVWSDPIT --22222222---------iiii--------------------------1111------- LNGSGFDASLFHDNDGKKYLVNMYWDQRTYNHNFYGIVLQEYSDKEKKLIGKAKIIYKGT ------------1111-----------1111------------1111------------3 DIKYTEGPHIYHIGDYYYLFTAEGGTTYEHSETVARSKNIDGPYEIDPEYPLLTSWHDPR 333---------!!!!----------1111--------1111----1111----1111-- NSLQKCGHASLVHTHTDEWYLAHLVGRPLPVGNQPVLEQRGYCPLGRETSIQRIEWVDNW -------------1111-----------------3333----1111----------%%%% PRVVGGKQGSVNVEAPKIPEVKWEKTYDEKDNFDSDKLNINFQSLRIPLTENIASLKAKK --2222--------------------------------1111-------1111-----22 GNLRLYGKESLTSTFTQAFIARRWQSFKFDASTSVSFSPDTFQQAAGLTCYYNTENWSTI 22-------1111---------------------------1111--------1111---- QVTWNEDKGRVIDIVCCDNFHFDMPLKSNVIPIPKDVEYIHLKVEVRVETYQYSYSFDGI -----------------iiii--3333------1111---------!!!!---------- NWSKVPAIFESRKLSDDYVQGGGFFTGAFVGINCIDITGNNKPADFDYFCYKEE ---------3333-3333-----------------3333--------------- >PROBABLE M18-FAMILY AMINO; SWP:Q45055; PDB:1Y7EA; QNPWIYLNEEEKNQILNFSESYKKFISKFKTEREVTAYALDKAKKLGFINAEEKKNLPGD -3333----3333----------------------------------------------- KIFYTCREKSVAFAIIGKNPIEDGNFIVSHTDSPRLDAKPSPISEENELTFIKTNYYGGI -------------------3333----------------------%%%%----------- KKYQWLSTPLSIRGVVFLKNGEKVEINIGDNENDPVFVIPDILNLKILIGSLPIETKEKN -1111-------------------------2222-------------------------3 KVKLATLQLIKEKYKIEEEDFVSSEIEIVPAGTAKDVGFDKALIGAYGQDDKICVFTSLE 333---------------3333---------------1111-------2222-------- SIFDLEETPNKTAICFLVDKEEIDSRYLEYFVSDIFKIKKSEYNNLHVQKALWNSKSISA ------------------3333-------------333311113333------------- DVCAAINPEQNAPQLGYGIPIKYTDAELVSYIRQLLNKNNIAWQVATLGKGGTVAKFLAG -------------2222-------3333----------------------3333--3333 YGIRTIDGPAVISHSPEITSKFDLYNAYLAYKAFYRE -------------------------------3333-- ---------------------------------------- >O-ACETYLSERINE SULFHYDRYL; SWP:P45040; PDB:1Y7LA; AIYADNSYSIGNTPLVRLKHFGHNGNVVVKIEGRNPSYSVCRIGANMVWQAEKDGTLTKG ----3333--------------%%%%----11112222---------------------- KEIVDATSGNTGIALAYVAAARGYKITLTMPETMSLERKRLLCGLGVNLVLTEGAKGMKG ------------------------------1111--------1111------3333---- AIAKAEEIVASDPSRYVMLKQFENPANPQIHRETTGPEIWKDTDGKVDVVVAGVGTGGSI -----------1111----1111-----------------1111---------------- TGISRAIKLDFGKQITSVAVEPVESPVISQTLAGEEVKPGPHKIQGIGAGFIPKNLDLSI ---------------------3333-----1111------------------11113333 IDRVETVDSDTALATARRLMAEEGILAGISSGAAVAAADRLAKLPEFADKLIVVILPSAS -------------------------------------------3333------------- ERYLSTALF ----3333- >HYPOTHETICAL PROTEIN BSU1; SWP:O34816; PDB:1Y7MA; LTYQVKQGDTLNSIAADFRISTAALLQANPSLQAGLTAGQSIVIPGLPDPYTIPYHIAVS -----2222--------------------1111---2222---1111-1111-------- IGAKTLTLSLNNRVKTYPIAVGKILTQTPTGEFYIINRQRNPGGPFGAYWLSLSAAHYGI 1111-----%%%%---------1111----------------!!!!--------2222-- HGTNNPASIGKAVSKGCIRHNKDVIELASIVPNGTRVTINR ----3333-----------------------2222------ >ATP-DEPENDENT CLP PROTEAS; SWP:P63788; PDB:1Y7OA; IPVVIESYDIYSRLLKDRIILTGPVEDNANSVIAQLLFLDAQDSTKDIYLYVNTPGGSVS ----------------------------------------------------------33 AGLAIVDTNFIKADVQTIVGAASGTVIASSGAKGKRFLPNAEYIHQPAPEHLLKTRNTLE 33------------------------1111-2222--1111------------------- KILAENSGQSEKVHADAERDNWSAQETLEYGFIDEIANN ---------------3333-------------------- >HYPOTHETICAL PROTEIN AF14; SWP:O28869; PDB:1Y7PA; LRGLRIIAENKIGVLRDLTTIIANITFAQTFLIKHGEHEGKALIYFEIEGGDFEKILERV ----------2222-----------------------2222------------------3 KTFDYIIEIEEEESFERVFGKRVIILGGGALVSQVAIGAISEADRHNLRGERISVDTMPV 333----------3333----------------------------3333----------- VGEEEIAEAVKAVSRLHRAEVLVLAGGIMGGKITEEVKKLRKSGIRVISLSMFGSVPDVA ----------3333-2222-------------------3333------------3333-- DVVISDPVMAGTLAVMHISEKAKFDLDRVKGR ------------------3333--1111---- >ZINC FINGER PROTEIN 174; SWP:Q15697; PDB:1Y7QA; GSKNCPDPELCRQSFRRFCYQEVSGPQEALSQLRQLCRQWLQPELHTKEQILELLVMEQF -------------3333--------3333------------------------------- LTILPEEIQARVRHRCLMSSKEIVTLVEDFHRASKKPK 3333--------------3333---------------- >HYPOTHETICAL PROTEIN SA21; SWP:Q99RQ6; PDB:1Y7RA; VKVTYDIPTCEDYCALRINAGSPKTREAAEKGLPNALFTVTLYDKDRLIGGRVIGDGGTV -----------------1111----------3333--------!!!!------------- FQIVDIAVLKSYQGQAYGSLIEHIKYIKNVSVESVYVSLIADYPADKLYVKFGFPTEPDS --------3333---3333------3333--2222--------3333--1111------- GGYIKY ------ >MALATE DEHYDROGENASE; SWP:P10584; PDB:1Y7TA; MKAPVRVAVTGAAGQIGYSLLFRIAAGEMLGKDQPVILQLLEIPQAMKALEGVVMELEDC ----------11113333------------1111--------3333-------------- AFPLLAGLEATDDPKVAFKDADYALLVGAAPRKAGMERRDLLQVNGKIFTEQGRALAEVA -1111-------3333-2222-----------2222------------------------ KKDVKVLVVGNPANTNALIAYKNAPGLNPRNFTAMTRLDHNRAKAQLAKKTGTGVDRIRR 1111-----------------------3333----------------------3333--- MTVWGNHSSTMFPDLFHAEVDGRPALELVDMEWYEKVFIPTVAQRGAAIIQARGASSAAS -------1111---1111-----1111--------------------------------- AANAAIEHIRDWALGTPEGDWVSMAVPSQGEYGIPEGIVYSFPVTAKDGAYRVVEGLEIN ----------------2222----------iiii------------iiii---------3 EFARKRMEITAQELLDEMEQVKALGLI 333------------------1111-- >ACYL-COA HYDROLASE; SWP:Q81EE4; PDB:1Y7UA; KGKTANESRVFKTSRVFPTDLNDHNTLFGGKILSEDVASISASRHSRKECVTASDWVDFL ---3333---------3333-1111----------------------------------- HPVRSSDCVSYESFVIWTGRTSEVFVKVVSEYLISGEKRIAATSFVTFVALSKENNPVPV ---1111----------------------------------------------------- PRVIPDTEEEKESHRIAVLRAEQRHIRKAESKKVATLLTF ---------------------------------------- >Halotolerant alpha-type c; SWP:NA; PDB:1Y7WA; NPNDGYDYMQHGFDWPGLQEGGTTKYPACSGSNQSPIDINTNQLMEPSSRSGTSAVSLNG -------11111111----%%%%--3333----------1111--3333--------!!! LNVDGAQADGITLTNAKVDLEQGMKVTFDQPAANLPTIEIGGTTKSFVPIQFHFHHFLSE !--3333-------------2222---------------iiii----------------- HTINGIHYPLELHIVMQEQDPADVATAQLAVIGIMYKYSENGDAFLNSLQTQIEGKIGDG --iiii----------------3333---------------------------------- TASYGDTGVSIDNINVKTQLLPSSLKYAGYDGSLTTPGCDERVKWHVFTTPREVTREQMK --2222--------3333------------------------------------3333-- LFVDVTMGAHAGADVVNNRMIQDLGDREVYKYNY ---------1111----------!!!!------- >MAJOR VAULT PROTEIN; SWP:Q14764; PDB:1Y7XA; GSHMQVVLPNTALHLKALLDFEDKDGDKVVAGDEWLFEGPGTYIPRKEVEVVEIIQATII ----------------------3333-------------1111--3333----------- RQNQALRLRARKECWDRDGKERVTGEEWLVTTVGAYLPAVFEEVLDLVDAVIL ----------------------3333--------------------------- >C.AHDI; SWP:Q7X0F0; PDB:1Y7YA; HDHYADLVKFGQRLRELRTAKGLSQETLAFLSGLDRSYVGGVERGQRNVSLVNILKLATA ------------------1111-------------------1111--------------- LDIEPRELF ---3333-- >PREDICTED COBALAMIN BINDI; SWP:Q2RJ67; PDB:1Y80A; MPSVGKIVLGTVKGDLHDIGKNLVAMMLESGGFTVYNLGVDIEPGKFVEAVKKYQPDIVG -----------2222---------------------------3333-------------- MSALLTTTMMNMKSTIDALIAAGLRDRVKVIVGGAPLSQDFADEIGADGYAPDAASATEL ----1111---------------1111------1111----------------------- CRQLL ----- >CONSERVED HYPOTHETICAL PR; SWP:Q8U2V3; PDB:1Y81A; FRKIALVGASKNPAKYGNIILKDLLSKGFEVLPVNPNYDEIEGLKCYRSVRELPKDVDVI -----------11113333-----1111------3333--iiii----3333-1111--- VFVVPPKVGLQVAKEAVEAGFKKLWFQPGAESEEIRRFLEKAGVEYSFGRCIVET ----------------1111------2222------------------------- >HYPOTHETICAL PROTEIN; SWP:Q8U3V0; PDB:1Y82A; PLPPDITFDSLALIKHSQSKKILEITLAKFTVNLSIVTVYRYLTVRAYLKKNIELELDVL ----------------------------------------------1111---------- KDIYNIVPLNEEIAIKAAQIEADLRKGPDIEDVLTAATAIYTKSLLITDDSKRYEPRRFG -------------------------------------------------3333---1111 LDTPLDKFVKEVELVEKEL ---------------3333 >HYPOTHETICAL PROTEIN AF15; SWP:O28724; PDB:1Y88A; NLYFQGHMVARLLEEHGFETKTNVIVQGNCVEQEIDVVAERDGERYMIECKFHNIPVYTG -------------1111-----------------------iiii---------------- LKEAMYTYARFLDVEKHGFTQPWIFTNTKFSEEAKKYAGCVGIKLTGWSYPEKEGIEVLL -------------3333---------------------------------2222------ ESKGLYPITILRIDKEVLDELVRAGLVFCRDVVSAGEEKLREIGLSAKKAREVIAEAKKV 1111--3333---3333----------3333----------------------------- IGGS ---- >DEVB PROTEIN; SWP:Q9KL51; PDB:1Y89A; INHKIFPTADAVVKSLADDLAYSQQGQPVHISLSGGSTPKLFKLLASQPYANDIQWKNLH -------------------3333------------3333---------3333--3333-- FWWGDERCVAPDDAESNYGEANALLFSKINPAQNIHRILGENEPQAEAERFAQAAHVIPT ---------1111-----------1111--3333----1111------------------ ENGTPVFDWILLGVGADGHTASLFPGQTDYADANLSVVASHPESGQLRVSKTAKVLQAAK iiii----------1111-----2222-1111-----------------------1111- RISYLVLGAGKAEIVEQIHTTPAEQLPYPAAKIHSTSGVTEWYLDSDAAAKIA -------3333----------3333--3333-------------33331111- >HYPOTHETICAL PROTEIN AF14; SWP:NA; PDB:1Y8AA; HMFFTDWEGPWILTDFALELCMAVFNNARFFSNLSEYDDYLAYEVRREGYEAGYTLKLLT ----------------------------------------------22222222------ PFLAAAGVKNRDVERIAELSAKFVPDAEKAMATLQERWTPVVISTSYTQYLRRTASMIGV ---1111-3333-------------------------------------------1111- RGELHGTEVDFDSIAVPEGLREELLSIIDVIASLSGEELFRKLDELFSRSEVRKIVESVK ---------1111-----------------1111---------------3333--1111- AVGAGEKAKIMRGYCESKGIDFPVVVGDSISDYKMFEAARGLGGVAIAFNGNEYALKHAD ---------------1111---------3333-------1111---------3333---- VVIISPTAMSEAKVIELFMERKERAFEVLSAVSIPETEIYIMENSDFGEVLEKSKRMRVR --------------------!!!!11111111-2222---3333---------------- LRGLAGELGGS --3333----- >S-ADENOSYLMETHIONINE-DEPE; SWP:Q97GJ5; PDB:1Y8CA; NCYNKFAHIYDKLIRADVDYKKWSDFIIEKCVENNLVFDDYLDLACGTGNLTENLCPKFK -------------------------------1111----------!!!!-33333333-- NTWAVDLSQELSEAENKFRSQGLKPRLACQDISNLNINRKFDLITCCLDSTNYIIDSDDL ------------------1111--------3333------------%%%%---------- KKYFKAVSNHLKEGGVFIFDINSYYKLSQVLGNNDFNYDDDEVFYYWENQFEDDLVSYIS -----------2222--------------------------------------------- FFVRDGEFYKRFDEEHEERAYKEEDIEKYLKHGQLNILDKVDCYSNKKVEKFTERITYLV -------------------------------------------------1111------- KLGG ---- >UNC-13 HOMOLOG A; SWP:Q62768; PDB:1Y8FA; QHNFEVWTATTPTYCYECEGLLWGIARQGMRCTECGVKCHEKCQDLLNADC -----------------------------------------3333------ >MAP/MICROTUBULE AFFINITY-; SWP:O08679; PDB:1Y8GA; HIGNYRLLKTIGKGNFAKVKLARHILTGKEVAVKIIDKTQLNSSSLQKLFREVRIKVLNH -1111-------------------1111--------3333-------------------1 PNIVKLFEVIETEKTLYLVEYASGGEVFDYLVAHGRKEKEARAKFRQIVSAVQYCHQKFI 111--------1111------1111------------------------------1111- VHRDLKAENLLLDADNIKIALDAFCGAPPYAAPELFQGKKYDGPEVDVWSLGVILYTLVS -----3333------------1111-3333-3333------------------------- GSLPFDGQNLKELRERVLRGKYRIPFYSTDCENLLKKFLILNPSKRGTLEQIKDRWNVGH -----------------------------------------1111--3333-----2222 EDDELKPYVEPLPDYKDPRRTELVSGYTREEIQDSLVGQRYNEVATYLLLGY -----------------3333--------------1111---------1111 >FIS1; SWP:P40515; PDB:1Y8MA; MTKVDFWPTLKDAYEPLYPQQLEILRQQVVSEGGPTATIQSRFNYAWGLIKSTDVNDERL --------3333-----------------111111113333------3333--3333--- GVKILTDIYKEAESRRRECLYYLTIGCYKLGEYSMAKRYVDTLFEHERNNKQVGALKSMV ---------------------------1111----------------------------- EDKIQKETLKGVVVAGGVHHHHHH ------------------------ >Dihydrolipoyllysine-resid; SWP:Q15120; PDB:1Y8OA; PKQIERYSRFSPSPLSIKQFLDFGRDNACEKTSYMFLRKELPVRLANTMREVNLLPDNLL ------1111-----------------------------------------11113333- NRPSVGLVQSWYMQSFLELLEYENKSPEDPQVLDNFLQVLIKVRNRHNDVVPTMAQGVIE -------------------1111---------------------1111------------ YKEKFGFDPFISTNIQYFLDRFYTNRISFRMLINQHTLLFGGDTNPVHPKHIGSIDPTCN --------3333-----------------------------------1111!!!!----- VADVVKDAYETAKMLCEQYYLVAPELEVEEFNAKAPDKPIQVVYVPSHLFHMLFELFKNS -------------------------------3333------------------------- MRATVELYEDRKEGYPAVKTLVTLGKEDLSIKISDLGGGVPLRKIDRLFNYMYSPLFGYG ----------------------------------------3333--------------33 LPISRLYARYFQGDLKLYSMEGVGTDAVIYLKALSSESFERLPVFNKSAWRHYKTTPEAD 33-------------------------------3333------------3333------- DWSNPSSEPRDASK ----------3333 >Dihydrolipoyllysine-resid; SWP:P10515; PDB:1Y8OB; SYPPHMQVLLPALSPTMTMGTVQRWEKKVGEKLSEGDLLAEIETDKATIGFEVQEEGYLA -------------1111----------------2222----------------------- KILVPEGTRDVPLGTPLCIIVEKEADISAFADYTDLK ----2222---2222-------33333333------- >UBIQUITIN-LIKE 1 ACTIVATI; SWP:Q9UBE0; PDB:1Y8QA; GISEEEAAQYDRQIRLWGLEAQKRLRASRVLLVGLKGLGAEIAKNLILAGVKGLTMLDHE ---------3333----------------------------------------------- QVTPEDPGAQFLIRTGSVGRNRAEASLERAQNLNPMVDVKVDTEDIEKKPESFFTQFDAV -----11111111-------3333---------3333-------1111-33331111--- CLTCCSRDVIVKVDQICHKNSIKFFTGDVFGYHGYTFANLGEHEFVEEKTETTMVKKKVV -----------------1111--------!!!!--------------------------- FCPVKEALEVDWSSEKAKAALKRTTSDYFLLQVLLKFRTDKGRDPSSDTYEEDSELLLQI --------------------1111---------------------3333----------- RNDVLDSLGISPDLLPEDFVRYCFSEMAPVCAVVGGILAQEIVKALSQRDPPHNNFFFFD -----1111--------------------------------------------------- GMKGNGIVECLGP ------------- >HYPOTHETICAL PROTEIN RV09; SWP:O53896; PDB:1Y8TA; GSVEQVAAKVVPSVVLETDLEEGSGIILSAEGLILTNNHVIAAAAPKTTVTFSDGRTAPF ---------3333---------------3333----33333333---------------- TVVGADPTSDIAVVRVQGVSGLTPISLGSSSDLRVGQPVLAIGSPLGLEGTVTTGIVSAL -----3333-------------------3333-2222-------iiii------------ NRPVSTNTVLDAIQTDAAINPGNSGGALVNNAQLVGVNSAIATLQSGSIGLGFAIPVDQA -------------------2222------------------------------------- KRIADELISTGKASHASLGVQVTNDKDTLGAKIVEVVAGGAAANAGVPKGVVVTKVDDRP ------1111--------------------------22223333---2222----!!!!- INSADALVAAVRSKAPGATVALTFQDPSGGSRTVQVTLGKA --------------2222----------------------- >UBIQUITIN-CONJUGATING ENZ; SWP:P61081; PDB:1Y8XA; GSASAAQLRIQKDINELNLPKTCDISFSDPDDLLNFKLVICPDEGFYKSGKFVFSFKVGQ ------------3333---1111-----1111------------1111----------11 GYPHDPPKVKCETVYHPNIDLEGNVCLNILREDWKPVLTINSIIYGLQYLFLEPNPEDPL 11-------------11111111---111111111111--------3333---------- NKEAAEVLQNNRRLFEQNVQRSRGGYIGSTYFERCLK --------------------------!!!!------- >MACROPHAGE METALLOELASTAS; SWP:P39900; PDB:1Y93A; GPVWRKHYITYRINNYTPDMNREDVDYAIRKAFQVWSNVTPLKFSKINTGMADILVVFAR ----------------3333---------------------------------------- GAHGDDHAFDGKGGILAHAFGPGSGIGGDAHFDEDEFWTTHSGGTNLFLTAVHEIGHSLG -----------------------!!!!-----1111--------------------1111 LGHSSDPKAVMFPTYKYVDINTFRLSADDIRGIQSLYG -----1111---------1111---------------- >SEMINAL RIBONUCLEASE; SWP:P00669; PDB:1Y94A; KESAAAKFERQHMDSSTSAASSSNYCNLMMCCRKMTQGKCKPVNTFVHESLADVKAVCSQ -----------------11111111-----1111--------------------3333-- KKVTCKDGQTNCYQSKSTMRITDCRETGSSKYPNCAYKTTQVEKHIIVACGGKPSVPVHF ----1111---------------------------------------------------- DASV ---- >GEM-ASSOCIATED PROTEIN 6; SWP:Q8WXD5; PDB:1Y96A; MSEWMKKGPLEWQDYIYKEVRVTASEKNEYKGWVLTTDPVSANIVLVNFLEDGSMSVTGI -3333-------3333--------%%%%---------------------1111------- MGHAVQTVETMNEGDHRVREKLMHLF 3333---------------------- >Gem-associated protein 7; SWP:Q9H840; PDB:1Y96B; AQESLESQEQRARAALRERYLRSLLAMVGHQVSFTLHEGVRVAAHFGATDLDVANFYVSQ ------------------------1111--------iiii---------1111------- LQTPIGVQAEALLRCSDIISYTFKP --1111-------3333-------- >THREE PRIME REPAIR EXONUC; SWP:Q9BQ50; PDB:1Y97A; GSEAPRAETFVFLDLEATGLPSVEPEIAELSLFAVHRSSLENPEHGALVLPRVLDKLTLC -------------------3333------------3333--------------------- CPERPFTAKASEITGLSSEGLARCRKAGFDGAVVRTLQAFLSRQAGPICLVAHNGFDYDF ---------------------1111---------------1111---------3333--- PLLCAELRRLGARLPRDTVCLDTLPALRGLDRAHGYSLGSLFHRYFRAEPSSAEGDVHTL -------1111------------------------------------------------- LLIFLHRAAELLAWADEQARGWAHIEPYLP --------------------3333------ >NADP-DEPENDENT ALCOHOL DE; SWP:P35630; PDB:1Y9AA; MKGLAMLGIGRIGWIEKKIPECGPLDALVRPLALAPCTSDTHTVWAGAIGDRHDMILGHE -------2222-----------1111---------------------------------- AVGQIVKVGSLVKRLKVGDKVIVPAITPDWGEEESQRGYPMHSGGMLGGWKFSNFKDGVF --------1111---2222---------1111--1111-1111-2222--2222------ SEVFHVNEADANLALLPRDIKPEDAVMLSDMVTTGFHGAELANIKLGDTVCVIGIGPVGL -------3333-----11113333--------------------2222------------ MSVAGANHLGAGRIFAVGSRKHCCDIALEYGATDIINYKNGDIVEQILKATDGKGVDKVV --------------------3333---1111-----3333--------1111-------- IAGGVHTFAQAVKMIKPGSDIGNVNYLGEGDNIDIPRSEWGVGMGHKHIHGGLTPGGRVR -----3333--1111--------------------3333-iiii---------------- MEKLASLISTGKLDTSKLITHRFEGLEKVEDALMLMKNKPADLIKPVVRIHYDDEDTLH -------1111---3333---------------------3333--------1111---- >CONSERVED HYPOTHETICAL PR; SWP:Q9K2J6; PDB:1Y9BA; TTLPRITARVDVDTQDLLAKAAALAGSSINSFVLNAAIEKAKQVIEREQALKLSQADAVL ------------------------------------------------------------ LEALDNPAVVNAKLKLASE -3333-------------- >LOW TEMPERATURE REQUIREME; SWP:Q9ZIM5; PDB:1Y9IA; KQSALESKARSWLIERGVEIDDIAELVLFLQQKYHPGLELDICRQNVEHVLRKREVQNAV -------------1111-3333--------33332222---------------------- LTGIQLDVAEKGELVQPLQNIISADEGLYGVDEILALSIVNVYGSIGFTNYGYIDKVKPG --------1111-------------1111--------------1111-----------!! ILAKLNEHDGIAVHTFLDDIVGAIAAAAASRLAHSYHD !!1111---------------------------1111- >SEC1 FAMILY DOMAIN CONTAI; SWP:Q62991; PDB:1Y9JA; ASIRERQTVALKRMLNFNVPHVKNSPGEPVWKVLIYDRFGQDIISPLLSVKELRDMGITL ------------3333--------2222--------3333-------------------- HLLLHSDRDPIRDVPAVYFVMPTEENIDRLCQDLRNQLYESYYLNFISAISRSKLEDIAN ----------1111--------3333---------------------------------- AALAANAVTQVAKVFDQYLN -3333---------3333-- >IAA ACETYLTRANSFERASE; SWP:Q81FK8; PDB:1Y9KA; SVVIERIPKEAIPKSLLLLADPSERQIATYVQRGLTYVAKQGGSVIGVYVLLETRPKTEI -------1111---3333------------1111------iiii----------2222-- NIAVAEHLQGKGIGKKLLRHAVETAKGYGSKLEVGTGNSSVSQLALYQKCGFRIFSIDFD ----3333-----------------1111-------1111-------1111------222 YFSKHYEEEIIENGIVCRDIRLAELN 2----------iiii----------- >LIPOPROTEIN MXIM; SWP:Q06083; PDB:1Y9LA; EKEWHIVPVSKDYFSIPNDLLWSFNTTNKSINVYSKCISGKAVYSFNAGKFMGNFNVKEV ---------3333---1111--------------------------iiii---------2 DGCFMDAQKIAIDKLFSMLKDGVVLKGNKINDTILIEKDGEVKLKLIRGI 222----------------------------------iiii--------- >TRANSCRIPTIONAL REGULATOR; SWP:Q9KQN0; PDB:1Y9QA; TDVFKSQIANQLKNLRKSRGLSLDATAQLTGVSKALGQIERGESSPTIATLWKIASGLEA ----------------1111------------------1111------------------ SFSAFFANDPQLLSSERSFPDDLNKIHTLFPYAADTGLEIFEITLLDHHQQSSPHALGVI 3333-33333333-----1111-----------1111--------%%%%------2222- EYIHVLEGIKVFFDEQWHELQQGEHIRFFSDQPHGYAAVTEKAVFQNIVAYPR ------------%%%%----2222----------------------------- >PUTATIVE IRON BINDING PRO; SWP:Q7VXW9; PDB:1Y9UA; DEVSLYTTREPKLIQPLLDAFAKDSGIKVNTVFVKDGLLERVRAEGDKSPADVLMTVDIG ---------3333--------------------------------1111----------- NLIDLVNGGVTQKIQSQTLDSVVPANLRGAEGSWYALSLRDRVLYVEKDLKLDSFRYGDL -----1111--------------1111--%%%%-------------1111-----3333- ADPKWKGKVCIRSGQHPYNTALVAAMIAHDGAEATEKWLRGVKANLARKAAGGDRDVARD -3333-------11113333----------------------1111------3333---- ILGGICDIGLANAYYVGHMKNAEPGTDARKWGDAIKVVRPTFAGGTHVNISGAAVAAHAP -----------3333---11112222-----1111--------------------1111- NKANAVKLLEYLVSEPAQTLYAQANYEYPVRAGVKLDAVVASFGPLKVDTLPVAEIAKYR ----------------------1111----------3333-------------------- KQASELVDKVGFDN ----------1111 >ACETYLTRANSFERASE; SWP:Q81CG1; PDB:1Y9WA; MYMKHIENGTRIEGEYIKNKVIQYNMSILTDEVKQPMEEVSLVVKNEEGKIFGGVTGTMY -------------------------11113333------------3333----------% FYHLHIDFLWVDESVRHDGYGSQLLHEIEGIAKEKGCRLILLDSFSFQAPEFYKKHGYRE %%%--------3333-----------------1111--------1111-----1111--- YGVVEDHPKGHSQHFFEKRL -------2222--------- >ALKALINE SERINE PROTEASE; SWP:Q65Z69; PDB:1Y9ZA; AETTPWGQTFVGATVLSDSQAGNRTICIIDSGYDRSHNDLNANNVTGTNNSGTGNWYQPG ----3333----1111-1111------------1111--1111------1111-1111-- NNNAHGTHVAGTIAAIANNEGVVGVMPNQNANIHIVKVFNEAGWGYSSSLVAAIDTCVNS ---------------------------------------3333----------------- GGANVVTMSLGGSGSTTTERNALNTHYNNGVLLIAAAGNAGDSSYSYPASYDSVMSVAAV --------------------------------------------------1111------ DSNLDHAAFSQYTDQVEISGPGEAILSTVTVGEGRLADITIGGQSYFSNGVVPHNRLTPS 1111--1111-------------------2222-------iiii-3333----------! GTSYAPAPINASATGALAECTVNGTSFSCGNMANKICLVERVGNQGSSYPEINSTKACKT !!!----------------------------2222--------------3333------- AGAKGIIVYSNSALPGLQNPFLVDANSDITVPSVSVDRATGLALKAKLGQSTTVSNQGNQ ----------3333---------1111---------------33332222---------- DYEYYNGTSMATPHVSGVATLVWSYHPECSASQVRAALNATADDLSVAGRDNQTGYGMIN ------33333333-----------3333------------------------!!!!--- AVAAKAYLDESCTGP ------33331111- >SMG-7 TRANSCRIPT VARIANT ; SWP:NA; PDB:1YA0A; MSLQSAQYLRQAEVLKADMTDSKLGPAEVWTSRQALQDLYQKMLVTDLEYALDKKVEQDL 3333-----------3333---------3333---------------------------- WNHAFKNQITTLQGQAKNRANPNRSEVQANLSLFLEAASGFYTQLLQELCTQSSSCSYIC ------------------------------------------------------------ QHCLVHLGDIARYRNQTSQAESYYRHAAQLVPSNGQPYNQLAILASSKGDHLTTIFYYCR ------------------------------3333-----------1111----------- SIAVKFPFPAASTNLQKALSKALESRDEVKTKWGVSDFIKAFIKFHGHVYLSKSLEKLSP -----------------------------------------------------3333--- LREKLEEQFKELLFQKAFNSQQLVHVTVINLFQLHHLRDFSNETEQHTYSQDEQLCWTQL --------------------------------------3333------------------ LALFMSFLGILCKCPLQNSQEESYNAYPLPAVKVSMDWLRLRPRVFQEAVVDERQYIWPW -----------------------------------------3333--33331111----- LISLLNSFHPHEEDLSISATPLPEEFELQGFLALRPSFRNLDFSKGHKEGQQRRIRQQRL -----1111-------------3333-222211111111---------1111-------- ISIGKWIADNQPRLIQCENEVGKLLFITEIPELILEDP ----------3333-----%%%%--------------- >Telethonin; SWP:O15273; PDB:1YA5T; MATSELSSEVSEENSERREAFWAEWKDLTLSTRPEEGSSLHEEDTQRHETYHQQGQSQVL --------------1111---------------3333----------------------- VQRSPWLMMRMGILGRGLQEYQLPYQRVL ------------2222------------- >APOLIPOPROTEIN E; SWP:P08226; PDB:1YA9A; PEVTDQLEWQSNQPWEQALNRFWDYLRWVQTLSDQVQEELQSSQVTQELTALMEDTMTEV ------1111------------------3333---------------------------- KAYKKELEEQLGPVAEETRARLGKEVQAAQARLGADMEDLRNRLGQYRNEVHTMLGQSTE -------1111---3333---------------------------------1111----- EIRARLSTHLRKMRKRLMRDAEDLQKRLAVYKAGAGVSAIRERLGPLV ------------------------------------------------ >ASPARTATE AMINOTRANSFERAS; SWP:P23542; PDB:1YAAA; SATLFNNIELLPPDALFGIKQRYGQDQRATKVDLGIGAYRDDNGKPWVLPSVKAAEKLIH --1111----------------------------------1111---------------- NDSSYNHEYLGITGLPSLTSNAAKIIFGTQSDALQEDRVISVQSLSGTGALHISAKFFSK -1111-----11113333---------1111--1111----------------------- FFPDKLVYLSKPTWANHMAIFENQGLKTATYPYWANETKSLDLNGFLNAIQKAPEGSIFV -------------1111----1111----------1111--------------2222--- LHSCAHNPTGLDPTSEQWVQIVDAIASKNHIALFDTAYQGFATGDLDKDAYAVRLVE --------------------------------------------3333---3333-- >YCAC GENE PRODUCT; SWP:P21367; PDB:1YACA; TKPYVRLDKNDAAVLLVDHQAGLLSLVRDIEPDKFKNNVLALGDLAKYFNLPTILTTSAE -------1111------------3333--------------------------------- TGPNGPLVPELKAQFPDAPYIARPGNINAWDNEDFVKAVKATGKKQLIIAGVVTEVCVAF -3333---------1111---------3333--------3333---------1111---- PALSAIEEGFDVFVVTDASGTFNEITRHSAWDRMSQAGAQLMTWFGVACELHRDWRNDIA --------------1111----------------1111---------------3333--- GLATLFSNHIPDYRNLMTSYDTLT ------------------------ >REGULATORY PROTEIN TENI; SWP:P25053; PDB:1YADA; MELHAITDDSKPVEELARIIITIQNEVDFIHIRERSKSAADILKLLDLIFEGGIDKRKLV ----------------------1111-------1111------------1111-3333-- MNGRVDIALFSTIHRVQLPSGSFSPKQIRARFPHLHIGRSVHSLEEAVQAEKEDADYVLF ------------------2222---------1111------------------------- GHVFRGVSLLSDIKQRISIPVIAIGGMTPDRLRDVKQAGADGIAVMSGIFSSAEPLEAAR ------------1111-----------3333--------------3333----------- RYSRKLKEMR ---------- >ACTIN; SWP:P02579; PDB:1YAGA; EVAALVIDNGSGMCKAGFAGDDAPRAVFPSIVGRPRHQGIMVGMGQKDSYVGDEAQSKRG -----------------2222-------------------2222-------------333 ILTLRYPIEHGIVTNWDDMEKIWHHTFYNELRVAPEEHPVLLTEAPMNPKSNREKMTQIM 3-------iiii---------------------3333----------------------- FETFNVPAFYVSIQAVLSLYSSGRTTGIVLDSGDGVTHVVPIYAGFSLPHAILRIDLAGR -------------------1111-------------------iiii-3333--------- DLTDYLMKILSERGYSFSTTAEREIVRDIKEKLCYVALDFEQEMQTAAQSSSIEKSYELP ---------1111-------------------------------1111--1111----11 DGQVITIGNERFRAPEALFHPSVLGLESAGIDQTTYNSIMKCDVDVRKELYGNIVMSGGT 11-----3333----33333333---------------11113333-----------111 TMFPGIAERMQKEITALAPSSMKVKIIAPPERKYSVWIGGSILASLTTFQQMWISKQEYD 1-2222------------1111------1111---------33331111----------- ESGPSIVHHKCF --3333-3333- >TRANSCRIPTIONAL ACTIVATOR; SWP:P25052; PDB:1YAKA; KFSEECRSAAAEWWEGSFVHPFVQGIGDGTLPIDRFKYYVLQDSYYLTHFAKVQSFGAAY ---------3333-3333--3333------------------------------------ AKDLYTTGRMASHAQGTYEAEMALHREFAELLEISEEERKAFKPSPTAYSYTSHMYRSVL ---------------------------------------------------------333 SGNFAEILAALLPCYWLYYEVGEKLLHCDPGHPIYQKWIGTYGGDWFRQQVEEQINRFDE 3---------------------1111--------------1111---------------- LAENSTEEVRAKMKENFVISSYYEYQFWGMAYRKEGWSD -1111---------------------------------- >CHYMOPAPAIN; SWP:P14080; PDB:1YAL; YPQSIDWRAKGAVTPVKNQGACGSWAFSTIATVEGINKIVTGNLLELSEQELVDCDKHSY -----3333----------------------------------------------1111! GCKGGYQTTSLQYVANNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNETSFL !!!-----------------3333----------3333---------------------- GALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDGKNYIIIKNSWG ----------------------------------------------%%%%---------1 PNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFKGFA 111-iiii-----------2222------------- >PROTEASOME ALPHA SUBUNIT; SWP:P25156; PDB:1YARA; TVFSPDGRLFQVEYAREAVKKGSTALGMKFANGVLLISDKKVRSRLIEQNSIEKIQLIDD ---1111----------------------2222----------1111-3333------11 YVAAVTSGLVADARVLVDFARISAQQEKVTYGSLVNIENLVKRVADQMQQYTQYGGVRPY 11---------------------------------------------------------- GVSLIFAGIDQIGPRLFDCDPAGTINEYKATAIGSGKDAVVSFLEREYKENLPEKEAVTL ---------3333------3333----------1111----------------------- GIKALKSSLEEGEELKAPEIASITVGNKYRIYDQEEVKKFL -----1111--------------2222-----3333-1111 >Proteasome activator prot; SWP:Q9U8G2; PDB:1YARO; KRAALIQNLRDSYTETSSFAVIEEWAAGTLQEIEGIAKAAAEAHGVIRNSTYGRAQAEKS ---------11113333-----------------------------1111--3333---- PEQLLGVLQRYQDLCHNVYCQAETIRTVIAIRIPEHKEEDNLGVAVQHAVLKIIDELEIK ----------------------------1111---------------------------- TLGSGEKSGSGGAPTPIGMYALREYLSARSTVEDKLLGGGSQSPSLLLELRQIDADFMLK --------1111------------------------------------------------ VELATTHLSTMVRAVINAYLLNWKKLIQPRTGSDHMVS -------------------------------------- >FK506 BINDING PROTEIN; SWP:P20081; PDB:1YAT; GKDRISPGDGATFPKTGDLVTIHYTGTLENGQKFDSSVDRGSPFQCNIGVGQVIKGWDVG i-------------2222---------1111------1111------------3333--3 IPKLSVGEKARLTIPGPYAYGPRGFPGLIPPNSTLVFDVELLKVN 333-2222------3333-!!!!-2222----------------- >HYPOTHETICAL PROTEIN BSU1; SWP:O31698; PDB:1YAVA; LLEATVGQFMIEADKVAHVQVGNNLEHALLVLTKTGYTAIPVLDPSYRLHGLIGTNMIMN ----3333---3333----2222--------------------1111------------- SIFGLERIEFEKLDQITVEEVMLTDIPRLHINDPIMKGFGMVINNGFVCVENDEQVFEGI ---1111-3333----3333---------11113333--------------1111----- FTRRVVLKELNKHI -------------- >VIRULENCE SENSOR PROTEIN ; SWP:P14147; PDB:1YAXA; MDKTTFRLLRGESNLFYTLAKWENNKISVELPENLDMQSPTMTLIYDETGKLLWTQRNIP ----------------1111--%%%%--------------------3333--------33 WLIKSIQPEWLKTNGFHEIETNVDATSTLLSEDHSAQEKLKEVREDDDDAEMTHSVAVNI 3333333333---------------3333-11113333--------1111---------- YPATARMPQLTIVVVDTIPIELKRSYMHHHHHH ------------------3333----------- >prophage LambdaBa02, N-ac; SWP:Q81WA9; PDB:1YB0A; MEIRKKLVVPSKYGTKCPYTMKPKYITVHNTYNDAPAENEVNYMITNNNEVSFHVAVDDK --------3333------------------------------------------------ QAIQGIPWERNAWACGDGNGPGNRESISVEICYSKSGGDRYYKAENNAVDVVRQLMSMYN ------------------------------------------------------------ IPIENVRTHQSWSGKYCPHRMLAEGRWGAFIQKVKSG -3333--3333-------------------------- >17-BETA-HYDROXYSTEROID DE; SWP:Q8NBQ5; PDB:1YB1A; RRKSVTGEIVLITGAGHGIGRLTAYEFAKLKSKLVLWDINKHGLEETAAKCKGLGAKVHT ----2222-----1111------------------------------------------- FVVDCSNREDIYSSAKKVKAEIGDVSILVNNAGVVYTSDLFATQDPQIEKTFEVNVLAHF ---------------------------------------3333----------------- WTTKAFLPAMTKNNHGHIVTVASAAHVSVPFLLAYCSSKFAAVGFHKTLTDELAALQITG ---------------------------3333----------------------1111111 VKTTCLCPNFVNTGFIKNPSTSLGPTLEPEEVVNRLMHGILTEQKMIFIPSSIAFLTTLE 1------3333---3333------------------------------------------ RIL --- >HYPOTHETICAL PROTEIN TA08; SWP:Q9HJW1; PDB:1YB2A; PVILVSEDEYGKFDESTNSILVGKHHLGSRVIEPGDELIVSGKSFIVSDFSPYFGRVICG --------------------------------2222---iiii---------3333---- LRPGDILEVGVGSGNSSYILYALNGKGTLTVVERDEDNLKKADNLSEFYDIGNVRTSRSD ----------!!!!--------iiii-----------------3333---1111-----3 IADFISDQYDAVIADIPDPWNHVQKIASKPGSVATFYLPNFDQSEKTVLSLSASGHHLET 333--------------3333-------2222------------------3333------ VELKRRILVREGATRPASDDLTHTAFITFAIKKSGVYRI ---------2222--3333-------------------- >HYPOTHETICAL PROTEIN; SWP:Q8U4C0; PDB:1YB3A; LKEVHELLNRIWGDIFELREELKEELKGFTVEEVSEVFNAYLYIDGKWEEKYPHPAFAVK -------------------------2222--------------iiii------------- PGGEVGATPQGFYFVFAFPKEELSKEFIEDVIRAFEKLFIYGAENFLEDFYNFEHPISGD -------3333-------3333---------------------3333----3333----- EVWDRIVNSDEEINFEVDLGFDKEEVKREIKRFIELARRYNLL -------------------------------------1111-- >TARTRONIC SEMIALDEHYDE RE; SWP:Q8ZR83; PDB:1YB4A; KLGFIGLGIGSPAINLARAGHQLHVTTIGPVADELLSLGAVNVETARQVTEFADIIFIVP ----------------1111-----------3333----------3333----------- DTPQVEDVLFGEHGCAKTSLQGKTIVDSSISPIETKRFAQRVNEGADYLDAPVSGGEIGA -------------------2222-------3333-------------------------- REGTLSIVGGEQKVFDRVKPLFDILGKNITLVGGNGDGQTCKVANQIIVALNIEAVSEAL ------------------------------------------------------------ VFASKAGADPVRVRQALGGFASSRILEVHGERINRTFEPGFKIALHQKDLNLALQSAKAL ---1111-----------11113333---------------------------------- ALNLPNTATCQELFNTCAANGGSQLDHSAVQALELANHKL ---------------------1111----3333------- >QUINONE OXIDOREDUCTASE; SWP:Q08257; PDB:1YB5A; KLMRAVRVFEFGGPEVLKLRSDIAVPIPKDHQVLIKVHACGVNPVETYIRSGTYSRKPLL ------------3333------------!!!!----------3333--3333-------- PYTPGSDVAGVIEAVGDNASAFKKGDRVFTSSTISGGYAEYALAADHTVYKLPEKLDFKQ ---------------1111---2222------------------1111----11113333 GAAIGIPYFTAYRALIHSACVKAGESVLVHGASGGVGLAACQIARAYGLKILGTAGTEEG ---------------------2222--------3333----------------------- QKIVLQNGAHEVFNHREVNYIDKIKKYVGEKGIDIIIEMLANVNLSKDLSLLSHGGRVIV ----1111-----1111-3333------3333-------3333---------2222---- VGSRGTIEINPRDTMAKESSIIGVTLFSSTKEEFQQYAAALQAGMEIGWLKPVIGSQYPL --------------1111------1111------------------------------33 EKVAEAHENIIHGSGATGKMILLL 33---------------------- >URIDYLATE KINASE; SWP:P65932; PDB:1YBDA; QIKYKRVLLKLSGESLGSDPFGINHDTIVQTVGEIAEVVKGVQVGIVVGGGNIFRGVSAQ -----------3333----------------------------------33333333333 AGSDRATADYGATVNALALKDAFETLGIKARVQSALSQQIAETYARPKAIQYLEEGKVVI 3----------------------1111--------------------------------- FAAGTGNPFFTTDTAAALRGAENCDVLKATNVDGVYTADPKKDPSATRYETITFDEALLK --!!!!--------------------------------33331111-------------- NLKVDATAFALCRERKLNIVVFGIAKEGSLKRVITGEDEGTLVHC ------------1111------3333------------------- >NICOTINATE PHOSPHORIBOSYL; SWP:Q8UIS9; PDB:1YBEA; MTKTDIATRWKLDPIVRSLIDTDFYKLLMLQMIWKLYPEVDATFSLINRTKTVRLAEEID 3333-------------1111-3333----------1111-------------------- EMELREQLDHARTLRLSKKENIWLAGNTFYGRSQIFEPEFLSWLSSYQLPEYELFKRDGQ ----------1111-----------------------------1111---------%%%% YELNFHGRWMDTTLWEIPALSIINELRSRSAMRSLGYFTLDVLYARAKAKMWEKVERLRE -------33333333----------------3333------------------------- LPGLRISDFGTRRRHSFLWQRWCVEALKEGIGPAFTGTSNVLLAMDSDLEAVGTNAHELP 3333-----3333------------------3333------------------------- MVVAALAQTNEELAAAPYQVLKDWNRLYGGNLLIVLPDAFGTAAFLRNAPEWVADWTGFR ---3333-------3333----------!!!!-------------11113333------- PDSAPPIEGGEKIIEWWRKMGRDPRTKMLIFSDGLDVDAIVDTYRHFEGRVRMSFGWGTN -----------------1111-3333--------------------2222-------333 LTNDFAGCAPLKPISIVCKVSDANGRPAVKLSDNPQKATGDPAEVERYLKFFGEED 3---------------------iiii-------3333------------------- >AMP NUCLEOSIDASE; SWP:Q7MVU1; PDB:1YBFA; TKQEIVENWLPRYTQRQLIDFEPYILLTNFSHYLHVFAEHYGVPIVGEHTSPNASAEGVT ----------------1111---------3333-----1111------------------ LINFGGSANAATIDLLWAIHPKAVIFLGKCGGLKLENALGDYLLPIAAIRGEGTSNDYLP -------------------------------3333-----------------3333---3 EEVPSLPSFSVLRAISSAIQNKGKDYWTGTVYTTNRRVWEYDEKFKDYLRSTHASGVDET 333----------------1111---------------1111-------1111-----33 ATLTVGFANKIPGALLLISDRPFPEGVKTEESNFAEEHLLGIDALEIIRENK 33----1111-----------------------3333--------------- >ACETOLACTATE SYNTHASE, CH; SWP:P17597; PDB:1YBHA; TFISRFAPDQPRKGADILVEALERQGVETVFAYPGGASMEIHQALTRSSSIRNVLPRHEQ ----------------------1111--------1111------1111---------333 GGVFAAEGYARSSGKPGICIATSGPGATNLVSGLADALLDSVPLVAITGQVPRRMIGTDA 3---------------------!!!!-------------------------3333----- FQETPIVEVTRSITKHNYLVMDVEDIPRIIEEAFFLATSGRPGPVLVDVPKDIQQQLAIP ---------1111--------3333------------------------3333------- NWEQAMRLPGYMSRMPKPPEDSHLEQIVRLISESKKPVLYVGGGCLNSSDELGRFVELTG -----------1111---------------1111--------1111-------------- IPVASTLMGLGSYPDDELSLHMLGMHGTVYANYAVEHSDLLLAFGVRFDDRVTGKLEAFA -----3333------1111----1111---------------------3333--3333-- SRAKIVHIDIDSAEIGKNKTPHVSVCGDVKLALQGMNKVLENRAEELKLDFGVWRNELNV ----------3333---------------------------------------------- QKQKFPLSFKTFGEAIPPQYAIKVLDELTDGKAIISTGVGQHQMWAAQFYNYKKPRQWLS -----------!!!!-------------%%%%---------------------------- SGGLGAMGFGLPAAIGASVANPDAIVVDIDGDGSFIMNVQELATIRVENLPVKVLLLNNQ --------------------1111------3333-------------------------- HLGMVMQWEDRFYKANRAHTFLGDPAQEDEIFPNMLLFAAACGIPAARVTKKADLREAIQ ------------%%%%-------3333------3333--1111-------3333------ TMLDTPGPYLLDVICPHQEHVLPMIPSGGTFNDVITEGDGR ---------------1111------22221111-------- >NON-TOXIN HAEMAGGLUTININ ; SWP:Q45871; PDB:1YBIA; SLNDKIVTISCKADTNLFFYQVAGNVSLFQQTRNYLERWRLIYDSNKAAYKIKSMDIHNT -2222-----1111-------iiii-------------------3333------------ NLVLTWNAPTHNISTQQDSNADNQYWLLLKDIGNNSFIIASYKNPNLVLYADTVARNLKL --------------------1111------2222------3333---------------- STLNNSNYIKFIIEDYIISDLNNFTCKISPILDLNKVVQQVDVTNLNVNLYTWDYGRNQK -----3333-----------2222-----3333--------1111----------3333- WTIRYNEEKAAYQFFNTILSNGVLTWIFSNGNTVRVSSSNDQNNDAQYWLINPVSDTDET ------1111-----3333-------3333------------------------------ YTITNLRDTTKALDLYGGQTANGTAIQVFNYHGDDNQKWNIRNP ----3333------2222--2222---------1111------- ---------------------------------------------------- >HYDROLASE, ALPHA/BETA HYD; SWP:Q8VJU6; PDB:1YBTA; AERLATIFTDIVGSTQHAAALGDDRWRDLLDNHDTIVCHEIQRFGGREVNTAGDGFVATF -----------------------------------------1111--------------- TSPSAAIACADDIVDAVAALGIEVRIGIHAGEVEVRDASHGTDVAGVAVHIGARVCALAG ----------------3333-------------------------3333------11112 PSEVLVSSTVRDIVAGSRHRFAERGEQELKGVPGRWRLCVLRDD 222--------1111-------------2222------------ >HEPATOCYTE GROWTH FACTOR ; SWP:Q04756; PDB:1YBWA; ACGRRHKKIIGGSSSLPGSHPWLAAIYIGDSFCAGSLVHTCWVVSAAHCFSHSPPRDSVS ---2222--------22221111---------------1111---33331111------- VVLGQHFFNRTTDVTQTFGIEKYIPYTLYSVFNPSDHDLVLIRLKKKGDRCATRSQFVQP -------------------------11111111-------------%%%%----1111-- ICLPEPGSTFPAGHKCQIAGWGHLDENVSGYSSSLREALVPLVADHKCSSPEVYGADISP ----2222--2222----------3333---------------------1111!!!!-11 NMLCAGYFDCKSDACQGDSGGPLACEKNGVAYLYGIISWGDGCGRLHKPGVYTRVANYVD 11---------------2222-----iiii-----------iiii--------3333--- WINDRI ------ >CONSERVED HYPOTHETICAL PR; SWP:Q4CBW4; PDB:1YBXA; INNLVKQAQKQRDERVQEELKEKTVEASAGGGAVTVVATGRKDIKEITIKPEVVDPDDVE -----------------------------iiii-----1111-------3333-1111-- LQDLILAAVNEALRKADEVTAEISKIT -----------------------1111 >TRANSLATION ELONGATION FA; SWP:A3DDQ3; PDB:1YBYA; ISAGDFKNGVTFELDGQIFQVIEFQHVKPGAAFVRTKLKNIVTGATIEKTFNPTDKPKAH -3333-2222---iiii----------------------------------3333----- IERKDQYLYNDGDLYYFDTETFEQLPLGKDKIGDALKFVKENEIVKVLSHKGNVFGIEPP ----------!!!!-------------3333!!!!11112222------iiii------- NFVELEVTDTTATGATKPAIVETGASIKVPLFVNKGDIIRIDTRTGEYERV --------------------1111-----11112222-------------- >CHORISMATE MUTASE; SWP:NA; PDB:1YBZA; GSTTLKLLRKEIDKIDNQIISLLKKRLEIAQAIGKIKKELNLPIEDRKREEEVLRRAGEF --------------------------------------------------------!!!! REIFEKILEVSKDVQR ------------1111 >Kunitz-type protease inhi; SWP:O43278; PDB:1YC0I; QHQHQMHQTEDYCLASNKVGRCRGSFPRWYYDPTEQICKSFVYGGCLGNKNNYLREEECI --3333--------------------------1111------------------------ LACRGV ------ >NAD-DEPENDENT DEACETYLASE; SWP:Q9WYW0; PDB:1YC5A; MKMKEFLDLLNESRLTVTLTGAGISTPSGIPDFQNVFDIDFFYSHPEEFYRFAKEGIFPM ------------------------3333------1111---------------------1 LQAKPNLAHVLLAKLEEKGLIEAVITQNIDRLHQRAGSKKVIELHGNVEEYYCVRCEKKY 111------------1111--------------1111-----1111-------------- TVEDVIKKLESSDVPLCDDCNSLIRPNIVFFGENLPQDALREAIGLSSRASLMIVLGSSL -------3333------------------2222--------------------------- VVYPAAELPLITVRSGGKLVIVNLGETPFDDIATLKYNMDVVEFARRVMEEGGI ---3333-----1111----------1111------------------------ >Coat protein; SWP:P03602; PDB:1YC61; AAGQGKAIKAIAGYSISKWEASSDAITAKATNAMSITLPHELSSEKNKELKVGRVLLWLG ----------2222------------2222--------3333-3333------------- LLPSVAGRIKACVAEKQAQAEAAFQVALAVADSSKEVVAAMYTDAFRGATLGDLLNLQIY -3333-------------3333---------1111----------------3333----- LYASEAVPAKAVVVHLEVEHVRPTFDDFFTPVYR ---------------------------------- >anti-VSG immunoglobulin h; SWP:NA; PDB:1YC7A; VQLVESGGGSVQAGGSLRLSCAVSGSTYSPCTTGWYRQAPGKEREWVSSISSPGTIYYQD -----------2222-----------------------2222---------2222---33 SVKGRFTISRDNAKNTVYLQMNSLQREDTGMYYCQIQCGVRSIREYWGQGTQVTVS 33---------1111---------3333---------------------------- >CYTOCHROME C; SWP:P00044; PDB:1YCC; GSAKKGATLFKTRCLQCHTVEKGGPHKVGPNLHGIFGRHSGQAEGYSYTDANIKKNVLWD -------------3333---2222-------2222-----------------3333---3 ENNMSEYLTNPKKYIPGTKMAFGGLKKEKDRNDLITYLKKACE 333-------33332222------------------------- >Hypothetical 27.3 kDa pro; SWP:P38777; PDB:1YCDA; VQIPKLLFLHGFLQNGKVFSEKSSGIRKLLKKANVQCDYIDAPVLLEKKDLPFEMDDEKW ------------------------------1111------------3333---------- QATLDADVNRAWFYHSEISHELDISEGLKSVVDHIKANGPYDGIVGLSQGAALSSIITNK ---1111----------3333--------------------------------------3 ISELVPDHPQFKVSVVISGYSFTEPDPEHPGELRITEKFRDSFAVKPDMKTKMIFIYGAS 333-2222-----------------1111------3333-1111-------------111 DQAVPSVRSKYLYDIYLKAQNGNKEKVLAYEHPGGHMVPNKKDIIRPIVEQITSSLQ 1----------------1111-3333--------------3333---------1111 >NITRIC OXIDE REDUCTASE; SWP:Q9FDN7; PDB:1YCHA; SQPVAITDGIYWVGAVDWNIRYFHGPAFSTHRGTTYNAYLIVDDKTALVDTVYEPFKEEL ------2222-------------!!!!-------------------------3333---- IAKLKQIKDPVKLDYLVVNHTESDHAGAFPAIMELCPDAHVLCTQRAFDSLKAHYSHIDF ---1111--------------1111----------1111--------------------- NYTIVKTGTSVSLGKRSLTFIEAPMLHWPDSMFTYVPEEALLLPNDAFGQHIATSVRFDD ----------------------2222---------3333------2222--------111 QVDAGLIMDEAAKYYANILMPFSNLITKKLDEIQKINLAIKTIAPSHGIIWRKDPGRIIE 1-----------------3333-------------------------------3333--- AYARWAEGQGKAKAVIAYDTMWLSTEKMAHALMDGLVAGGCEVKLFKLSVSDRNDVIKEI ----------------------------------------------1111-------333 LDARAVLVGSPTINNDILPVVSPLLDDLVGLRPKNKVGLAFGAYGWGGGAQKILEERLKA 3-----------%%%%-3333-------3333--------------------------11 AKIELIAEPGPTVQWVPRGEDLQRCYELGRKIAARIAD 11------------------------------------ >PEPTIDOGLYCAN RECOGNITION; SWP:O75594; PDB:1YCKA; CSPIVPRNEWKALASECAQHLSLPLRYVVVSHTAGSSCNTPASCQQQARNVQHYHMKTLG -----3333--------------------------------------------------- WCDVGYNFLIGEDGLVYEGRGWNFTGAHSGHLWNPMSIGISFMGNYMDRVPTPQAIRAAQ -----------------------------3333------------------3333----- GLLACGVAQGALRSNYVLKGHRDVQRTLSPGNQLYHLIQNWPHYRSP ------------1111---3333-----------------2222--- >putative Ca2+-dependent m; SWP:Q9SYT0; PDB:1YCNA; SATLKVSDSVPAPSDDAEQLRTAFEEDLIISILAHRSAEQRKVIRQAYHETYGEDLLKTL -----------3333----------3333--1111-------------------1111-- DKELSNDFERAILLWTLEPGERDALLANEATKRWTSSNQVLMEVACTRTSTQLLHARQAY ----------------------------------3333---------------------- HARYKKSLEEDVAHHTTGDFRKLLVSLVTSYRYEGDEVNMTLAKQEAKLVHEKIKDKHYN ----------------------------------------------------3333-111 DEDVIRILSTRSKAQINATFNRYQDDHGEEILKSLEEGDDDDKFLALLRSTIQCLTRPEL 1----------------------------3333-11111111------------------ YFVDVLRSAINKTGTDEGALTRIVTTRAEIDLKVIGEEYQRRNSIPLEKAITKDTRGDYE ---------------2222-------1111------------------------------ KMLVALLGEDDA ----11111111 >BRANCHED-CHAIN PHOSPHOTRA; SWP:Q834I9; PDB:1YCOA; MITVSIAGGSQPEILQLVKKALKEAEQPLQFIVFDTNENLDTENLWKYVHCSDEAAVAQE ----------------------------------------------------3333---- AVSLVATGQAQILLKGIIQTHTLLKEMLKSEHQLKNKPILSHVAMVELPAGKTFLLTDCA ----------------------------3333---------------------------- MNIAPTQATLIEIVENAKEVAQKLGLHHPKIALLSAAENFNPKMPSSVLAKEVTAHFNDQ ---------------------1111---------------1111----------1111-- QEATVFGPLSLDLATSEEAVAHKRYSGPIMGDADILVVPTIDVGNCLYKSLTLFGHAKVG ------------------------------------------------------------ GTIVGTKVPVVLTSRSDSTESKFHSLRFAMRQVHHH -------------3333------------------- >MDM2; SWP:P56273; PDB:1YCQA; EKLVQPTPLLLSLLKSAGAQKETFTMKEVIYHLGQYIMAKQLYDEKQQHIVHCSNDPLGE --------------1111-----------------------------------------3 LFGVQEFSVKEPRRLYAMISRNLVSANV 333----1111----------------- >Apoptosis-stimulating of ; SWP:Q13625; PDB:1YCSB; PLALLLDSSLEGEFDLVQRIIYEVDDPSLPNDEGITALHNAVCAGHTEIVKFLVQFGVNV 3333---------------3333-------1111-3333--------------------- NAADSDGWTPLHCAASCNNVQVCKFLVESGAAVFAMTYSDMQTAADKCEEMEEGYTQCSQ ---1111-------1111--------1111-1111-------3333-------------- FLYGVQEKMGIMNKGVIYALWDYEPQNDDELPMKEGDCMTIIHREDEDEIEWWWARLNDK -----------%%%%-----------1111------------------------------ EGYVPRNLLGLYP ----3333----- >CONSERVED HYPOTHETICAL PR; SWP:Q8TZN2; PDB:1YCYA; SLLEKVLKEWKGHKVAVSVGFTGTLEDFDEEVILLKDVVDVIGNRGKQLIGLEDINWILL 3333---1111----------------------------1111-------3333------ >UVRABC SYSTEM PROTEIN C; SWP:Q9WYA3; PDB:1YD0A; MKEKIRKKILLAPEEPGVYIFKNKGVPIYIGKAKRLSNRLRSYLNPQTEKVFRIGEEADE ----------------------iiii---------------3333--------------- LETIVVMNEREAFILEANLIKKYRPKYNV ------------------------1111- >UVRC; SWP:Q5KWH6; PDB:1YD6A; MNERLKEKLAVLPEQPGCYLMKDKHGTVIYVGKAKSLKERVRSYFTGTHDGKTQRLVEEI -------3333------------------------------3333----3333---1111 ADFEYIVTSSNAEALILEMNLIKKHDPKYNVMLKD --------------------------33331111- >2-keto acid:ferredoxin ox; SWP:Q8U046; PDB:1YD7A; RFPFPVGEPDFIQGDEAIARAAILAGCRFYAGYPITPASEIFEAALYPLVDGVVIQEDEI ----------------------1111-------------3333----1111--------- ASIAAAIGASWAGAKATATSGPGFSLQENITETPVVIVDVQDHSLIVLSPSTVQEAFDFT -------------------!!!!---3333------------------------------ IRAFNLSEKYRTPVILLTDAEVGHRERVYIPNPDEIEIINRK -------------------------------3333------- >CORE HISTONE MACRO-H2A.1; SWP:Q02874; PDB:1YD9A; GFTVLSTKSLFLGQKLQVVQADIASIDSDAVVHPTNTDFYIGGEVGSTLEKKGGKEFVEA ---------1111--------3333----------1111--------------------- VLELRKKNGPLEVAGAAVSAGHGLPAKFVIHCNSPVWGSDKCEELLEKTVKNCLALADDR -----------2222-----2222---------------------------------111 KLKSIAFPSIGSGRNGFPKQTAAQLILKAISSYFVSTMSSSIKTVYFVLFDSESIGIYVQ 1-----------1111----------------11111111-------------------- EMAKLDAN -1111--- >RETINAL DEHYDROGENASE/RED; SWP:Q9BPX1; PDB:1YDEA; GTRYAGKVVVVTGGGRGIGAGIVRAFVNSGARVVICDKDESGGRALEQELPGAVFILCDV ---2222-------------------1111--------3333-------1111-----11 TQEDDVKTLVSETIRRFGRLDCVVNNAGHHPPPQRPEETSAQGFRQLLELNLLGTYTLTK 11--------------------------------3333---------------------- LALPYLRKSQGNVINISSLVGAIGQAQAVPYVATKGAVTAMTKALALDESPYGVRVNCIS -----------------3333---------------------------3333-------- PGNIWTPLWEELAALMPDPRASIREGMLAQPLGRMGQPAEVGAAAVFLASEANFCTGIEL ------------1111-------------3333---3333-----------1111----- LVTGGAELGY --iiii---- >HYDROLASE, HALOACID DEHAL; SWP:Q97Q24; PDB:1YDFA; SLYKGYLIDLDGTIYKGKDRIPAGETFVHELQKRDIPYLFVTNNTTRTPESVKEMLAQNF ---------2222--!!!!----------------------------------------- NIDTPLSTVYTATLATIDYMNDLGLEKTVYVVGEAGLKEAIKAAGYVEDKEKPAYVVVGL ----3333---------------------------------------------------- DWQVDYEKFATATLAIQKGAHFIGTNPDLNIPTERGLLPGAGSLITLLEVATRVKPVYIG -----------------------------------------------1111--------- KPNAIIMDKAVEHLGLEREELIMVGDNYLTDIRAGIDNGIPTLLVTTGFTKAEEVAGLPI ----------------3333---------------1111-----------3333------ APTHVVSSLAEWDFD -------1111---- >TRP REPRESSOR BINDING PRO; SWP:Q9RYU4; PDB:1YDGA; APVKLAIVFYSSTGTGYAMAQEAAEAGRAAGAEVRLLKVRETAPQDVIDGQDAWKANIEA ---------------------------1111------------333311113333---11 MKDVPEATPADLEWAEAIVFSSPTRFGGATSQMRAFIDTLGGLWSSGKLANKTFSAMTSA 11-----33333333---------iiii-------------------------------- QNVNGGQETTLQTLYMTAMHWGAVLTPPGYTDEVIFKSGGNPYGASVTANGQPLLENDRA -1111-----------3333---------------1111-3333---------------- SIRHQVRRQVELTAKLLEGGS -----------------1111 >AT5G11950; SWP:Q84MC2; PDB:1YDHA; QRSRFRKICVFCGSHSGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLGLISRRVYEGG ---------------------------------1111-------------------1111 LHVLGIIPKALPIEISGETVGDVRVVADHERKAAAQEAEAFIALPGGYGTEELLEITWSQ -------1111-----------------------1111--------3333---------- LGIHKKTVGLLNVDGYYNNLLALFDTGVEEGFIKPGARNIVVSAPTAKELEKEEYT ------------iiii-----------------33333333----3333------- >VINCULIN ISOFORM VCL; SWP:P18206-2; PDB:1YDIA; HMPVFHTRTIESILEPVAQQISHLVIMHEAIPDLTAPVAAVQAAVSNLVRVGKETVQTTE ------------------------------------------------------------ DQILKRDMPPAFIKVENACTKLVQAAQMLQSDPYSVPARDYLIDGSRGILSGTSDLLLTF 3333---------------------------1111------------------------- DEAEVRKIIRVCKGILEYLTVAEVVETMEDLVTYTKNLGPGMTKMAKMIDERQQELTHQE --------------------3333----------------------------1111---- HRVMLVNSMNTVKELLPVLISAMKIFVTTKNSKNQGIEEALKNRNFTVEKMSAEINEIIR ------------------------------------------------------------ VLQLTSWDEDAW 1111-1111--- >GENERAL TRANSCRIPTION FAC; SWP:Q6ZYL4; PDB:1YDLA; SHGTRKGLIECDPAKQFLLYLDESNALGKKFIIQDIDDTHVFVIAELVNVLQERVGELDQ ------------------------1111-------------------------3333--- NAFSLTQK 1111---- >HYPOTHETICAL PROTEIN YQGN; SWP:P54491; PDB:1YDMA; QLRKKTLEALSALSNEDILQKTERYKYLFSLPEWQNAGTIAVTISRGLEIPTRPVIEQAW 3333-----3333-----------------3333-----------!!!!--3333----- EEGKQVCIPKCTKKQFRTYQTDDQLETVYAGLLEPVKTKEVNPSQIDLIVPGVCFDVNGF ---------------------------1111----------3333----------1111- RVGFGGGYYDRYLSEYEGKTVSLLLECQLFAHVPRLPHDIPVHKLITEDRIISCF ------33333333----------1111--------------------------- >HYDROXYMETHYLGLUTARYL-COA; SWP:Q8YEF2; PDB:1YDNA; AEHVEIVEAARDGLQNEKRFVPTADKIALINRLSDCGYARIEATSFVSPKWVPQLADSRE ----------3333---------------------------------33333333-3333 VAGIRRADGVRYSVLVPNKGYEAAAAAHADEIAVFISASEGFSKANINCTIAESIERLSP --------------------------------------3333------------------ VIGAAINDGLAIRGYVSCVVECPYDGPVTPQAVASVTEQLFSLGCHEVSLGDTIGRGTPD ----------------------------------------3333-------------333 TVAALDAVLAIAPAHSLAGHYHDTGGRALDNIRVSLEKGLRVFDASVGGLGGCPFAPGAK 3------3333-3333------1111-------------------1111---1111---- GNVDTVAVVELHEGFETGLDLDRLRSAGLFTQALRQD ------------------------------------- >HMG-COA LYASE; SWP:O34873; PDB:1YDOA; PYPKKVTIKEVGPRDGLQNEPVWIATEDKITWINQLSRTGLSYIEITSFVHPKWIPALRD ---------------1111-------------------------------33333333-- AIDVAKGIDREKGVTYAALVPNQRGLENALEGGINEACVFSASETHNRKNINKSTSESLH ---1111---2222---------------------------------------------- ILKQVNNDAQKANLTTRAYLSTVFGCPYEKDVPIEQVIRLSEALFEFGISELSLGDTIGA ---------1111-------------------3333--------3333------------ ANPAQVETVLEALLARFPANQIALHFHDTRGTALANVTALQGITVFDGSAGGLGGCPYAP -3333------------3333------1111-3333-3333-------2222---3333- GSSGNAATEDIVYLEQDIKTNVKLEKLLSAAKWIEEKGKPLPSRNLQVFKS ---------------------------------3333-------------- >MHC CLASS I ANTIGEN; SWP:P17693; PDB:1YDPA; SHSMRYFSAAVSRPGRGEPRFIAMGYVDDTQFVRFDSDSASPRMEPRAPWVEQEGPEYWE ------------2222----------!!!!-----1111--------1111--------- EETRNTKAHAQTDRMNLQTLRGYYNQSEASSHTLQWMIGCDLGSDGRLIRGYERYAYDGK --------------------------1111------------1111----------iiii DYLALNEDLRSWTAADTAAQISKRKCEAANVAEQRRAYLEGTCVEWLHRYLENGKEMLQR -----1111------3333----------------------------------------- ADPPKTHVTHHPVFDYEATLRCWALGFYPAEIILTWQRDGEDQTQDVELVETRPAGDGTF -------------------------------------------1111------------- QKWAAVVVPSGEEQRYTCHVQHEGLPEPLMLRWKQ --------22221111-----3333---------- >AT5G01610; SWP:Q9M015; PDB:1YDUA; SDQIFNKVGSYWLGQKANKQFDSVGNDLNSVSTSIEGGTKWLVNKIKGKMQKPLPELLKE ------------------------------------------------------------ YDLPIGIFPGDATNYEFDEETKKLTVLIPSICEVGYKDSSVLKFTTTVTGHLEKGKLTDV ------------------3333-------------------------------------- EGIKTKVMIWVKVTSISTDASKVYFTAGMKKSRSRDAYGVQRNGLRVDKF -------------------------------------------------- >AX110P-LIKE PROTEIN; SWP:Q9SZ83; PDB:1YDWA; QIRIGVGCADIARKVSRAIHLAPNATISGVASRSLEKAKAFATANNYPESTKIHGSYESL --------3333------------------------------1111-1111----33331 LEDPEIDALYVPLPTSLHVEWAIKAAEKGKHILLEKPVANVTEFDKIVDACEANGVQIDG 111----------1111--------1111----------------------1111----- TWVHNPRTALLKEFLSDSERFGQLKTVQSCFSFAGDEDFLKNDIRVKPGLDGLGALGDAG -11113333---1111--------------------------11111111---------- WYAIRATLLANNFELPKTVTAFPGAVLNEAGVILSCGASLSWEDGRTATIYCSFLANLTE --------1111---------------1111----------------------------- ITAIGTKGTLRVHDFIIPYKETEASFTTSTKAWFNDLVTAWVSPPSEHTVKTELPQEACV -------------------1111-----------1111----------------3333-- REFARLVYWPSISRKTQLVVDAVKESVDKNYQQISLS ----------------------------%%%%----- >type I restriction enzyme; SWP:Q49434; PDB:1YDXA; TPKLKLNNNINWTKRTIDSLFDLKKGELEKELITPEGKYEYFNGGVKNSGRTDKFNTFKN ---------------3333---------3333-1111----------------------- TISVIVGGSCGYVRLADKNFFCGQSNCTLNLLDPLELDLKFAYYALKSQQERIEALAFGT --------2222----------1111------3333-----------------3333--- TIQNIRISDLKELEIPFTSNKNEQHAIANTLSVFDERLENLASLIEINRKLRDEYAHKLF -----3333-------------------------------------------------11 SLDEAFLSHWKLEALQSQHEITLGEIFNFKSGKYLKSEERLEEGKFPYYGAGIDNTGFVA 11----------3333-----3333----------1111--------------------- EPNTEKDTISIISNGYSLGNIRYHEIPWFNGTGSIALEPNNEIYVPFFYCALKYLQKDIK --------------1111---------------------11113333------------- ERKSDDSPFLSLKLAGEIKVPYVKSFQLQRKAGKIVFLLDQKLDQYKKELSSLTVIRDTL ------------------------------------------------------------ LKKLFPDT -3333--- >GLYCEROPHOSPHORYL DIESTER; SWP:P09394; PDB:1YDYA; NEKIVIAHRGASGYLPEHTLPAKAMAYAQGADYLEQDLVMTKDDNLVVLHDHYLDRVTDV ----------3333-2222---------------------------------------33 ADRFPDRARKDGRYYAIDFTLDEIKSLKFTEGFDIENGKKVQTYPGRFPMGKSDFRVHTF 33-1111-1111--3333-----1111--------iiii----1111-2222-------- EEEIEFVQGLNHSTGKNIGIYPEIKAPWFHHQEGKDIAAKTLEVLKKYGYTGKDDKVYLQ ------------------------------1111-----------1111--1111----- CFDADELKRIKNELEPKMGMELNLVQLIAYTDWNETQQKQPDGSWVNYNYDWMFKPGAMK -------------3333------------1111------1111------3333-2222-- QVAEYADGIGPDYHMLIEETSQPGNIKLTGMVQDAQQNKLVVHPYTVRSDKLPEYTPDVN -3333--------111111112222----------1111--------1111-1111---- QLYDALYNKAGVNGLFTDFPDKAVKFLN -------3333-------3333------ >HYPOTHETICAL UPF0334 KINA; SWP:O67322; PDB:1YE8A; KIIITGEPGVGKTTLVKKIVERLGKRAIGFWTEEVRRTGFRIITTEGKKKIFSSKFFTSK ------2222------------!!!!-----------------1111------------- KLVGSYGVNVQYFEELAIPILERAYREAKKDRRKVIIIDEIGKELFSKKFRDLVRQIHDP --!!!!------------------------3333---------3333-----------11 NVNVVATIPIRDVHPLVKEIRRLPGAVLIELTPENRDVILEDILSLLER 11--------------------2222-----1111-------------- >CYTOCHROME C; SWP:P00045; PDB:1YEA; GSAKKGATLFKTRCQQCHTIEEGGPNKVGPNLHGIFGRHSGQVKGYSYTDANINKNVKWD --------------1111--2222-------2222-------2222------3333---- EDSMSEYLTNPKKYIPGTKMAFAGLKKEKDRNDLITYMTKAAK ----------33332222------------------------- >CYTOCHROME C; SWP:P00045; PDB:1YEB; GSAKKGATLFKTRCQQCHTIEEGGPNKVGPNLHGIFGRHSGQVKGYSYTDANINKNVKWD -------------3333---2222-------2222-------2222-------------3 EDSMSEYLTNPKKYIPGTKMAFGGLKKEKDRNDLITYLKKACE 333-------33332222------------------------- >IGG1 FAB FRAGMENT (D.2.4); SWP:NA; PDB:1YEDH; AVKLQQSGPELVRPGTSVKLSCKTSGYIFTSYWIHWLKQSSGQGLEWIARIYPGTGGTYY ----------------------------1111---------------------------- NEKFKGKATLTADKSSSTAYMQLSSL 3333---------1111--------- >Ig gamma-2A chain C regio; SWP:GCAM_MOUSE; PDB:1YEEH; EVKLQESGAELVRPGASVKLSCKTSGYIFTSYWIHWVKQRAAAGLEWIARIYPGTGSSYY ------------2222-----------1111----------------------------- NVKFKGKATLTADKSSSTAYMQLSSLKSDDSAVYFCVRWGFIPVREDYVLDYWGQGTLVT 3333---------1111---------1111------------1111-------------- VSSAKTTAPSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVL --------------------------------------------iiii------------ QSDLYTLSSSVTVTSSTWPSQSITCNVAHPASSTKVDKKIEP ----------------------------3333---------- >IG ANTIBODY D2.3 (LIGHT C; SWP:NA; PDB:1YEJH; EMQLQQSGAELLRPGTSVKLSCKTSGYIFTSYWIHWVKQRSGQGLEWIARIYPGTGSTYY ------------2222-----------1111----------------------------- NEKFKGKATLTADKSSSTAYMQLSTLKSEDSAVYFCTRWGFIPVREDYVMDYWGQGTLVT 3333---------1111---------3333-----------3333--------------- VSSAKTTAPSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVL --------------------------------------------iiii------------ QSDLYTLSSSVTVTSSTWPSQSITCNVAHPASSTKVDKKIEP -----------------------------1111--------- >Putative uncharacterized ; SWP:A0A5D9; PDB:1YEJL; DIVMTQSPLTLSVTIGQPASISCKSSQSLLYSNGKTYLNWLLQRPGQSPKRLIHLVSKLD -------------2222-------------1111---------2222------------2 SGVPDRITGSGSGTDFTLKISRVEAADLGVYYCVQGTHFPYTFGGGTKLEILRADAAPTV 222--------------------3333--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM -----33331111---------------------iiii---------------------- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC ------3333------------1111------------- >AT1G16640; SWP:Q9FX77; PDB:1YELA; MADTGEVQFMKPFISEKSSKSLEIPLGFNEYFPAPFPITVDLLDYSGRSWTVRMKKRGEK -------------3333-------33333333-----------1111---------!!!! VFLTVGWENFVKDNNLEDGKYLQFIYDRDRTFYVIIYGHNMC -----------1111--------------------------- >HYPOTHETICAL PROTEIN; SWP:Q8U2H2; PDB:1YEMA; SEVEIKFKIKLEDFLHTLNTFNPEFVRYEEQEDVYFEVPRPKLLRIRGVHNLKKYYLTFK ------------------1111--------------------------3333-------- EILDENNEEFYEVEFEIGDFEKAVEVFKRLGFKIQATIKKKRWVYKLNGVTLEVNRVEGI ---------------------------1111---------------%%%%---------- GDFVDIEVISDSPEEAKEKIWEVAKMLGLKEEDVEPRLYLELI ------------------------1111-3333----3333-- >MM1357; SWP:Q8PX65; PDB:1YEZA; MFREESRSVPVEEGEVYDVTIQDIARQGDGIARIEGFVIFVPGTKVGDEVRIKVERVLPK ---------------------------------iiii-------2222---------111 FAFASVVE 1------- >Type I restriction-modifi; SWP:Q57594; PDB:1YF2A; MFYKEENFKKTEIGEIPEDWEIVELKDVCKKIKAGGTPKTSVEEYYKNGTIPFVKIEDIT -----------------------3333-----------11111111--------3333-- NSNKYLTNTKIKITEEGLNNSNAWIVPKNSVLFAMYGSIGETAINKIEVATNQAILGIIP -------------------------------------2222----------3333----- KDNILESEFLYYILAKNKNYYSKLGMQTTQKNLNAQIVKSFKIPLPPLEEQKQIAKILTK 2222-----------------------------3333----------------------- IDEGIEIIEKSINKLERIKKGLMHKLLTKGIGHSRFKKSEIGEIPEDWEVFEIKDIFEVK --------------------------------------1111--1111---3333----- TGTTPSTKKSEYWENGEINWITPLDLSRLNEKIYIGSSERKVTKIALEKCNLNLIPKGSI -----11111111---------------iiii---------------1111----2222- IISTRAPVGYVAVLTVESTFNQGCKGLFQKNNDSVNTEFYAYYLKFKKNLLENLSGGSTF --------------------1111------------------------------------ KELSKSMLENFKIPLPPLEEQKQIAKILSSVDKSIELKKQKKEKLQRMKKKIMELLLTGK ----------------3333---------------------------------------- VRVKT ----- >DNA ADENINE METHYLASE; SWP:P04392; PDB:1YF3A; MLGAIAYTGNKQSLLPELKSHFPKYNRFVDLFCGGLSVSLNVNGPVLANDIQEPIIEMYK ------22223333---3333-----------!!!!-3333------------------- RLINVSWDDVLKVIKQYKLSKTSKEEFLKLREDYNKTRDPLLLYVLHFHGFSNMIRINYK 3333---------------1111---------------3333-------2222----111 GNFTTPFGKRTINKNSEKRFNHFKQNCDKIIFSSLHFKDVKILDGDFVYVDPPYLITVAD 1-------------------------1111-----3333--------------1111-11 YNKFWSEDEEKDLLNLLDSLNDRGIKFGLSNVLEHHGKENTLLKEWSKKYNVKHLNKKYV 11--------------------------------------------1111---------- FNIYHSKEKNGTDEVYIFN 3333--------------- >Himalayan mistletoe ribos; SWP:Q6ITZ3; PDB:1YF8A; YERLDLDVTSQTTGEEYFRFITLLRDYVSSGSFSNEIPLLRQSGGGVEAARFVLVELTNE ---------------------------------iiii---------------------11 GGDSITAAIDVTNLYVVAYQAGSQSYFLSGPGTHLFTGTTRSSLPFNGSYPDLEQYAGHR 11--------------------------------------------------------11 KQIPLGIDQLIQSVTALRFPGNTRTQARSILILIQMISEAARFNPILWRARQYINSGASF 11--------------------1111---------------------------------- LPDVYMLELETSWGQQSTQVQQSTEGVFNNPIRLAIPGNFVTLTNVRDVIASLAIMLFVC -------------------1111---------------------3333------------ >Beta-galactoside-specific; SWP:Q6ITZ3; PDB:1YF8B; CSASEPTVRIVGRNGMNVDVRDDDFHDGNQIQLWPSKSNNDPNQLWTIKRDGTIRSNGSC ------------iiii---2222--2222-----------3333-----------iiii- LTTYGYTAGVYVMIFDCNTAVREATIWQIWGNGTIINPRSNLALAASSGIKGTTLTVQTL ------2222-----3333-3333-------------1111--------2222------- DYTLGQGWLAGNDTAPREVTIYGFNDLCMESNGGSVWVETCVSQQNDRWALYGDGSIRPE --3333-----------------%%%%----!!!!----------------1111---33 QNQDQCLTSGRDSVAGINIVSCSGGSSGQRWVFTNEGAILNLKNGLAMDVANPGLGQIII 33-------------------3333----------------------------------- YPATGKPNQMWLPVP -----3333------ >UBIQUITIN CARRIER PROTEIN; SWP:Q4Q5L3; PDB:1YF9A; SNRRREMDYMRLCNSTRKVYPSDTVAEFWVEFKGPEGTPYEDGTWMLHVQLPSDYPFKSP -----------------------1111-------2222-1111--------1111----- SIGFCNRILHPNVDERSGSVCLDVINQTWTPMYQLENIFDVFLPQLLRYPNPSDPLNVQA ---------1111-1111-----------11113333----------------------- AHLLHADRVGFDALLREHVSTHATPQKALESIPEAYRP ----------------------------11113333-- >TRANSITION STATE REGULATO; SWP:P08874; PDB:1YFBA; FMKSTGIVRKVDELGRVVIPIELRRTLGIAEKDALEIYVDDEKIILKKYKPN -----------1111----3333------2222------%%%%--------- >PHOSPHOGLYCERATE MUTASE 1; SWP:P18669; PDB:1YFKA; AYKLVLIRHGESAWNLENRFSGWYDADLSPAGHEEAKRGGQALRDAGYEFDICFTSVQKR -----------1111-----!!!!------------------------------------ AIRTLWTVLDAIDQMWLPVVRTWRLNERHYGGLTGLNKAETAAKHGEAQVKIWRRSYDVP -------------1111----3333----!!!!--------------------------- PPPMEPDHPFYSNISKDRRYADLTEDQLPSCESLKDTIARALPFWNEEIVPQIKEGKRVL ----1111-333311111111--3333-------------------------1111---- IAAHGNSLRGIVKHLEGLSEEAIMELNLPTGIPIVYELDKNLKPIKPMQFLGDEETVRKA ------------------33331111------------1111------------------ MEA --- >FUMARASE; SWP:P08417; PDB:1YFM; SFRTETDAFGEIHVPADKYWGAQTQRSFQNFKIGGARERMPLPLVHAFGVLKKSAAIVNE --------------1111--------1111-22221111-33333333------------ SLGGLDPKISKAIQQAADEVASGKLDDHFPLVVFQTGSGTQSNMNANEVISNRAIEIVHP -------------------3333-1111-------1111--------------------- NNHCNQSQSSNDTFPTVMHIAASLQIQNELIPELTNLKNALEAKSKEFDHIVKIGRTHLQ --1111------1111-----------------------------1111-------iiii DATPLTLGQEFSGYVQQVENGIQRVAHSLKTLSFLAQGGTAVGTGLNTKPGFDVKIAEQI --------------------------------------------22222222-------- SKETGLKFQTAPNRFEALAAHDAIVECSGALNTLACSLFKIAQDIRYLGSGPRCGYHELM ------------------------------------------------------------ LPENEPGSSIMPGKVNPTQNEALTQVCVQVMGNNAAITFAGSQGQFELNVFKPVMIANLL -------3333----------------------------1111-!!!!------------ NSIRLITDAAYSFRVHCVEGIKANEPRIHELLTKSLMLVTALNPKIGYDAASKVAKNAHK ----------------1111----3333-------11113333----------------- KGITLKESALELGVLTEKEFDEWVVPEHML ---------------1111-----1111-- >RECEPTOR PROTEIN TYROSINE; SWP:P18052; PDB:1YFOA; KYPPLPVDKLEEEINRRMADDNKLFREEFNALPACPIQATCEAASKEENKEKNRYVNILP -----3333-------------------------------3333-33331111-1111-- YDHSRVHLTPVEGVPDSDYINASFINGYQEKNKFIAAQGPKEETVNDFWRMIWEQNTATI 3333------2222--------------------------3333---------------- VMVTNLKERKECKCAQYWPDQGCWTYGNVRVSVEDVTVLVDYTVRKFCIQQQRLITQFHF -------iiii--------------!!!!------------------------------- TSWPDFGVPFTPIGMLKFLKKVKACNPQYAGAIVVHCSAGVGRTGTFVVIDAMLDMMHSE ------------------------------------------------------------ RKVDVYGFVSRIRAQRCQMVQTDMQYVFIYQALLEHYLY -----------1111------------------------ >CELL CYCLE ARREST PROTEIN; SWP:P26449; PDB:1YFQA; MQIVQIEQAPKDYISDIKIIPSKSLLLITSWDGSLTVYKFDIQAKNVDLLQSLRYKHPLL --------------------1111-----1111--------------------------- CCNFIDNTDLQIYVGTVQGEILKVDLIGSPSFQALTNNEANLGICRICKYGDDKLIAASW ---------------1111---------------------------------------11 DGLIEVIDPRNYGDGVIAVKNLNSNNTKVKNKIFTMDTNSSRLIVGMNNSQVQWFRLPLC 11-----3333!!!!-----------------------1111-----%%%%--------1 EDDNGTIEESGLKYQIRDVALLPKEQEGYACSSIDGRVAVEFFDDQGDDYNSSKRFAFRC 111-------------------!!!!------1111-------1111----1111----- HRLNLKDTNLAYPVNSIEFSPRHKFLYTAGSDGIISCWNLQTRKKIKNFAKFNEDSVVKI ---3333----------------------1111--------------------------- ACSDNILCLATSDDTFKTNAAIDQTIELNASSIYIIFDYENP --1111------3333------1111---------------- >ALANYL-TRNA SYNTHETASE; SWP:O67323; PDB:1YFSA; SLSAHEIRELFLSFFEKKGHTRVKSAPLVPENDPTLLFVNAGMVPFKNVFLGLEKRPYKR --------------------------------3333----3333--3333---------- ATSCQKCLRVSGKHNDLEQVGYTSRHHTFFEMLGNFSFGDYFKKEAIEYAWEFVTEVLKL ----------!!!!-3333----------------------------------------- PKEKLYVSVYKDDEEAYRIWNEHIGIPSERIWRLGEEDNFWQMGDVGPCGPSSEIYVDRG 3333-----1111--------3333-3333----3333---------------------3 EEYEGDERYLEIWNLVFMQYNRDENGVLTPLPHPNIDTGMGLERIASVLQGKNSNFEIDI 333!!!!---------------1111---------------------------1111111 IFPLIQFGEEVSGKKYGEKFETDVALRVIADHLRAITFAISDGVIPSNEGRGYVIRRILR 1------------------------------------------------3333------- RAMRFGYKLGIENPFLYKGVDLVVDIMKEPYPELELSREFVKGIVKGEEKRFIKTLKAGM ------1111----3333--------------3333------------------------ EYIQEVIQKALEEGRKTLSGKEVFTAYDTYGFPVDLIDEIAREKGLGIDLEGFQCELEEQ ---------------------------1111-3333-----1111--------------- RERARKHPVYSHLKELGKTSAFVGAAAL -1111---------33331111-3333- >3-HYDROXYANTHRANILATE-3,4; SWP:Q1LCS4; PDB:1YFUA; MLTYGAPFNFPRWIDEHAHLLKPPVGNRQVWQDSDFIVTVVGGPNHRTDYHDDPLEEFFY -1111-----------3333---------------------------------------- QLRGNAYLNLWVDGRRERADLKEGDIFLLPPHVRHSPQRPEAGSACLVIERQRPAGMLDG -----------%%%%------2222----------------------------2222--- FEWYCDACGHLVHRVEVQLKSIVTDLPPLFESFYASEDKRRCPHCGQVHPGRAA --------------------3333------------3333-------------- >HYPOXANTHINE-GUANINE PHOS; SWP:Q8R7L0; PDB:1YFZA; SPMEDIEEILITEEQLKAKVKELGEMITRDYEGKDLVLIGVLKGAIMFMSGLSRAIDLPL --1111------------------------2222--------3333------1111---- SIDFLAVSSYGSSTKSSGIVKIIKDHDIDIEGKDVLIVEDIIDSGLTLAYLRETLLGRKP -----------------------------2222--------------------------- RSLKICTILDKPERREADVKVDYCGFKIPDKFVVGYGLDYAEKYRNLPFIGVLKPELY ----------3333-------------------------iiii1111------3333- >COP ASSOCIATED PROTEIN; SWP:Q48271; PDB:1YG0A; MKATFQVPSITCNHCVDKIEKFVGEIEGVSFIDVSVEKKSVVVEFDAPATQDLIKEALLD ---------------------3333----------1111------11111111-----33 AGQEVV 33---- >GENE ACTIVATOR APHA; SWP:Q9X399; PDB:1YG2A; SLPHVILTVLSTRDATGYDITKEFSASIGYFWKASHQQVYRELNKMGEQGLVTCVLEVYS ---------------3333---11113333----------------1111---------- ITQAGRSALGEWFDQPTAHPTVRDEFSAKLMACSVQSAEPYRLQLAELVEESRKLVAHYQ -------------------------------3333------------------------- EIEAAYYANPAVLDKQQRLERLTLRRNLLVRQAWIQWADEVLAELNAMA --------3333---------------------------------3333 >ASPARTIC PROTEASE BLA G 2; SWP:P54958; PDB:1YG9A; KLVHVFINTQYAGITKIGNQNFLTVFDSTSCNVVVASQECVGGACVCP --------------------------1111------1111-!!!!--- >Hypothetical 37.9 kDa pro; SWP:P53757; PDB:1YGAA; DNKYGVITIGDEKKFQATIAPLGATLVDLKVNGQSVVQGYSNVQDYLTDGNMMGATVGRY -1111-----1111----------------iiii-------33331111----------- ANRIAKGVFSLDDGPHKLTVNNCGNTNHSSISSLNLKQYKASPVENPSKGVYVVEFKLLD ----------1111-------iiii-%%%%--3333-----------2222--------- DHTQPNPNEFPGDLEVTVKYTLNVAEMTLDMEYQAQLVRGDATPINMTNHSYFNLNKVKS -----------------------1111----------------------------33331 EKSIRGTEVKVCSNKSLEVTEGALLPTGKIIERNIATFDSTKPTVLHEDTPVFDCTFIID 1112222-----------------------------1111------1111---------1 ANKDLKTTDSVSVNKLVPVFKAYHPESHIKFEVSTTEPTVHLYTGDNLCGKFVPRSGFAV 111------1111-------------------------------1111----2222---- QQGRYVDAINRDEWRGCVLLKRGEVYTSKTQYKFDI ------3333-11111111-2222------------ >LIPOXYGENASE-1; SWP:P08170; PDB:1YGE; MFSAGHKIKGTVVLMPKNELEVNPDGSAVDNLNAFLGRSVSLQLISATKADAHGKGKVGK ---------------1111----------------!!!!-----------1111------ DTFLEGINTSLPTLGAGESAFNIHFEWDGSMGIPGAFYIKNYMQVEFFLKSLTLEAISNQ --------------%%%%------------------------------------------ GTIRFVCNSWVYNTKLYKSVRIFFANHTYVPSETPAPLVSYREEELKSLRGNGTGERKEY ------------3333-------------3333-3333-------------------111 DRIYDYDVYNDLGNPDKSEKLARPVLGGSSTFPYPRRGRTGRGPTVTDPNTEKQGEVFYV 1------------33333333-------3333------------3333------------ PRDENLGHLKSKDALEIGTKSLSQIVQPAFESAFDLKSTPIEFHSFQDVHDLYEGGIKLP 1111-----33333333----------------1111-------3333-3333------3 RDVISTIIPLPVIKELYRTDGQHILKFPQPHVVQVSQSAWMTDEEFAREMIAGVNPCVIR 333---1111-3333--------------3333--1111--------------------- GLEEFPPKSNLDPAIYGDQSSKITADSLDLDGYTMDEALGSRRLFMLDYHDIFMPYVRQI -----------3333--------3333--iiii----------------33331111--1 NQLNSAKTYATRTILFLREDGTLKPVAIELSLPHSAGDLSAAVSQVVLPAKEGVESTIWL 111--------------1111-------------2222--------------3333---- LAKAYVIVNDSCYHQLMSHWLNTHAAMEPFVIATHRHLSVLHPIYKLLTPHYRNNMNINA --------------------------------------1111-----3333--------- LARQSLINANGIIETTFLPSKYSVEMSSAVYKNWVFTDQALPADLIKRGVAIKDPSTPHG -------2222-----3333--3333----11113333---------------1111--- VRLLIEDYPYAADGLEIWAAIKTWVQEYVPLYYARDDDVKNDSELQHWWKEAVEKGHGDL ----------------------------------3333------------------3333 KDKPWWPKLQTLEDLVEVCLIIIWIASALHAAVNFGQYPYGGLIMNRPTASRRLLPEKGT --1111---------------------------1111-----3333----------1111 PEYEEMINNHEKAYLRTITSKLPTLISLSVIEILSTHASDEVYLGQRDNPHWTSDSKALQ --------------------------------1111-1111-2222--1111-------- AFQKFGNKLKEIEEKLVRRNNDPSLQGNRLGPVQLPYTLLYPSSEEGLTFRGIPNSISI ---------------------3333-----1111---1111------------------ >TRANSCRIPTIONAL ACTIVATOR; SWP:Q03330; PDB:1YGHA; KIEFRVVNNDNTKENMMVLTGLKNIFQKQLPKMPKEYIARLVYDRSHLSMAVIRKPLTVV -----------------------------1111----------3333------------- GGITYRPFDKREFAEIVFCAISSTEQVRGYGAHLMNHLKDYVRNTSNIKYFLTYADNYAI -------3333----------1111-2222-------------------------1111- GYFKKQGFTKEITLDKSIWMGYIKDYEGGTLMQCSMLPRIRYLD ---1111-------33332222--1111---------------- >HYPOTHETICAL PROTEIN BSU3; SWP:O05247; PDB:1YGMA; CTFFEKHHRKWDILLEKSTGVMEAMKVTSEEKEQLSTAIDRMNEGLDAFIQLYNESEIDE ------1111-------------------1111-1111---------------------- PLIQLDDDTAELMKQARDMYGQEKLNEKLNTIIKQILSISVSEEGEKELVPR -----3333---------------3333------------------------ >CD45 PROTEIN TYROSINE PHO; SWP:P08575; PDB:1YGRA; EKQLNVEPIHADILLETYKRKIADEGRPFLAEFQSIPRVFSKFPIKEARKPFNQNKNRYV ----------------------%%%%------1111----------3333--3333--33 DILPYDYNRVELSEINGDAGSNYINASYIDGFKEPRKYIAAQGPRDETVDDFWRIWEQKA 33------------------------------------------11113333---1111- TVIVVTRCEEGNRNKCAEYWPSEEGTRAFGDVVVKINQHKRCPDYIIQKLNIVNKKEKAT ---------%%%%---------------!!!!---------------------------- GREVTHIQFTSWPDHGVPEDPHLLLKLRRRVNAFSNFFSGPIVVHSSAGVGRTGTYIGID ------------2222---3333-------3333-1111--------------------- ALEGLEAENKVDVYGYVVKLRRQRCLVQVEAQYILIHQALVEYNQFGETEVNLSELHPYL -------------------1111----------------------------3333----- HNKKRDPPSEPSPLEAEFQRLPSYRSWRTQHIGNQEENKSKNRNSNVIPYDYNRVPLKSK ------1111-3333--3333---------33333333-----------1111------- YINASFISYWKPEVIAAQGPLKETIGDFWQIFQRKVKVIVLTELKHGDQEICAQYWGEGK --------------------1111-------1111------------------------- QTYGDIEVDLKDTDKSSTYTLRVFELRHSKRKDSRTVYQYQYTNWSVEQLPAEPKELISI --!!!!----------------------------------------------3333---- QVVKQKLPQKNHKSTPLLIHCRDGSQQTGIFCALLNLLESAETEEVVDIFQVVKALRKAR -3333--------------------------------------------------3333- LGVSTFEQYQFLYDVIASTYP --------------------- >SMAD4; SWP:Q13485; PDB:1YGS; APEYWCSIAYFEMDVQVGETFKVPSSCPIVTVDGYVDPSGGDRFCLGQLSNVHRTEAIER -----------!!!!--------1111------------------1111-1111------ ARLHIGKGVQLECKGEGDVWVRCLSDHAVFVQSYYLDREAGRAPGDAVHKIYPSAYIKVF 3333!!!!-----------------------------1111-2222-----2222----- DLRQCHRQMQQQAATAQAVDDLRRLCILRMSFVKGWGPDYPRQSIKETPCWIEIHLHRAL ------------------3333-1111----------------1111------------- QLLDEVLHTM -----1111- >CYTOPLASMIC DYNEIN LIGHT ; SWP:Q94524; PDB:1YGTA; SQFIVDDVSKTIKEAIETTIGGNAYQHDKVNNWTGQVVENCLTVLTKEQKPYKYIVTAMI -------------------2222--3333------------------------------- MQKNGAGLHTASSCYWNNDTDGSCTVRWENKTMYCIVSVFGLAV ----------------3333---------1111----------- >D-3-PHOSPHOGLYCERATE DEHY; SWP:P0A544; PDB:1YGYA; SLPVVLIADKLAPSTVAALGDQVEVRWVDGPDRDKLLAAVPEADALLVRSATTVDAEVLA -----------33333333---------33333333---1111----------------- AAPKLKIVARAGVGLDNVDVDAATARGVLVVNAPTSNIHSAAEHALALLLAASRQIPAAD -1111---------1111-----1111-----1111------------------------ ASLREHTWKRSSFSGTEIFGKTVGVVGLGRIGQLVAQRIAAFGAYVVAYDPYVSPARAAQ --1111--3333-----2222----------------------------1111------- LGIELLSLDDLLARADFISVHLPKTPETAGLIDKEALAKTKPGVIIVNAARGGLVDEAAL ------------------------3333----333311112222---------------- ADAITGGHVRAAGLDVFATEPCTDSPLFELAQVVVTPHLGASTAEAQDRAGTDVAESVRL ------------------------3333-1111--------------------------- ALAGEFVPDAVNVGGGVVNEEVAPWLDLVRKLGVLAGVLSDELPVSLSVQVRGELAAEEV ------1111--------33331111----------------------------1111-3 EVLRLSALRGLFSAVIEDAVTFVNAPALAAERGVTAEICKASESPNHRSVVDVRAVGADG 333--------3333-3333------------------------------------1111 SVVTVSGTLYGPQLSQKIVQINGRHFDLRAQGINLIIHYVDRPGALGKIGTLLGTAGVNI ----------1111------iiii-----------------2222--------------- QAAQLSEDAEGPGATILLRLDQDVPDDVRTAIAAAVDAYKLEVVDLS ----------------------------------------------- >HSPC150 protein similar t; SWP:NA; PDB:1YH2A; SMQRASRLKRELHMLATEPPPGITCWQDKDQMDDLRAQILGGANTPYEKGVFKLEVIIPE -------------------2222-------1111----------1111----------11 RYPFEPPQIRFLTPIYHPNIDSAGRICLDVLKLPPKGAWRPSLNIATVLTSIQLLMSEPN 11--------------11111111---1111--------1111----------------3 PDDPLMADISSEFKYNKPAFLKNARQWTEKHARQK 333-------------------------------- >UBIQUITIN-CONJUGATING ENZ; SWP:P62256; PDB:1YH6A; PSPGKRRMDTDVVKLIESKHEVTILGGLNEFVVKFYGPQGTPYEGGVWKVRVDLPDKYPF --------------------------1111-------2222-2222--------1111-- KSPSIGFMNKIFHPNIDEASGTVCLDVINQTWTALYDLTNIFESFLPQLLAYPNPIDPLN ------------1111-1111-----------11113333-------------------- GDAAAMYLHRPEEYKQKIKEYIQKYATEEALK --------------------------3333-- >UPF0269 PROTEIN YGGX; SWP:P52065; PDB:1YHDA; MGSRTIFCTFLQREAEGQDFQLYPGELGKRIYNEISKEAWAQWQHKQTMLINEKKLNMMN -------------------------3333------3333-----------------3333 AEHRKLLEQEMVNFLFEGKEVHIEGYTPEDKK 3333---------------------------- >HYPOTHETICAL PROTEIN SPY1; SWP:NA; PDB:1YHFA; ASYINNIEHAKVLDLTQEVIEQDQLSRTLVQRQDLGITVFSLDKGQEIGRHSSPGDAVTI -------------3333---2222-------1111-------2222-------------- LSGLAEITIDQETYRVAEGQTIVPAGIPHALYAVEAFQLLVVVKPEA ----------------2222---2222-------------------- >FARNESYL PYROPHOSPHATE SY; SWP:Q8WS26; PDB:1YHLA; MASMERFLSVYDEVQAFLLDQLQSKYEIDPNRARYLRIMMDTTCLGGKYFRGMTVVNVAE ------------------------------------------------3333-------- GFLAVTQHDEATKERILHDACVGGWMIEFLQAHYLVEDDIMDGSVMRRGKPCWYRFPGVT --1111----------------------------------------iiii-33331111- TQCAINDGIILKSWTQIMAWHYFADRPFLKDLLCLFQKVDYATAVGQMYDVTSMCDSNKL 3333------------------1111-----------------------1111--3333- DPEVAQPMTTDFAEFTPAIYKRIVKYKTTFYTYLLPLVMGLFVSEAAASVEMNLVERVAH 3333-------1111--------------------------11113333-3333------ LIGEYFQVQDDVMDCFTPPEQLGKVGTDIEDAKCSWLAVTFLGKANAAQVAEFKANYGDK -----------------3333-----3333----------1111---------------- DPAKVAVVKRLYSEANLQADFAAYEAEVVREVESLIEQLKVKSPTFAESVAVVWEKTHKR ------------------------------------------------------------ KK -- >Rab-interacting lysosomal; SWP:Q96NA2; PDB:1YHNB; SREEFEQILQERNELKAKVFLLKEELAYFQRELLTDHRVPSLLLEAMKVAVRKQRKKIKA -1111-----------------------------1111---------------------- KMLGT ----- >CALCIUM-DEPENDENT CELL AD; SWP:P54657; PDB:1YHPA; SVDANKVKFFFGKNCTGESFEYNKGETVRFNNGDKWNDKFMSCLVGSNVRCNIWEHNEID --1111------------------------3333-------------------------- TPTPGKFQELAQGSTNNDLTSINGLSKFQVLPGAFQWAVDVKIVNKVNSTAGSYEMTITP ------------------3333-------------------------------------- YQVDKVACKDGDDFVQLPIPKLTPPDSEIVSHLTVRQTHTPYDYVVNGSVYFKYSPTTGQ -----------------------1111--------------------------------- VTVIKKDETFPKNMTVTQDDNTSFIFNLNSEK -----3333----------2222--------- >DSPB; SWP:Q840G9; PDB:1YHTA; TKQTGLMLDIARHFYSPEVIKSFIDTISLSGGNFLHLHFSDHENYAIESHLLNQRAENAV ---------------------------1111---------3333-----1111-3333-- QGKDGIYINPYTGKPFLSYRQLDDIKAYAKAKGIELIPELDSPNHMTAIFKLVQKDRGVK -1111------------------------1111--------------------------- YLQGLKSRQVDDEIDITNADSITFMQSLMSEVIDIFGDTSQHFHIGGDEFGYSVESNHEF --1111--------1111-----------------!!!!-----------1111-3333- ITYANKLSYFLEKKGLKTRMWNDGLIKNTFEQINPNIEITYWSYDGDTQDKNEAAERRDM -----------1111------1111333311111111-------%%%%------------ RVSLPELLAKGFTVLNYNSYYLYIVPKASPTFSQDAAFAAKDVIKNWDLGVWDGRNTKNR -------1111------3333-------1111---------------1111-!!!!1111 VQNTHEIAGAALSIWGEDAKALKDETIQKNTKSLLEAVIHKTNG --3333---------1111------------------------- >HEMOGLOBIN A1 CHAIN; SWP:P80592; PDB:1YHUA; ACAMLERAKVKDEWAKAYGIGAARSKFGDALWRNVFNYAPNARDIFESVNSKDMASPEFK --------------------3333--------------3333---33333333------- AHIARVLGGLDRVISMLDNQATLDADLAHLKSQHDPRTIDPVNFVVFRKALIATVAGTFG ------------3333-----------------3333--3333----------------3 VCFDVPAWQGCYNIIAKGITGSDAA 333---------------------- >Giant hemoglobins B chain; SWP:P80592; PDB:1YHUB; DYVCGPLQRLKVKRQWAEAYGSGNSREEFGHFIWSHVFQHSPAARDMFKRVRGDNIHTPA ----------------------------------------33333333---3333----- FRAHATRVLGGLDMCIALLDDEPVLNTQLAHLAKQHETRGVEAAHYDTVNHAVMMGVENV -----------------1111--------------------3333--------------- IGSEVFDQDAWKPCLNVITNGIQG ------1111-------------- >HEMOGLOBIN A1 CHAIN; SWP:NA; PDB:1YHUC; AANCADAAAAIVQAQWEDVWSAAAAAASRVSAGEEVFAALFKMVPAAKNLFTRVNVADIN ----------------1111----3333------------33331111--33333333-- SPEFQGHVVRVMGGLDILINALDDIPTLESMLDHLAGQHAVRDGVTGAGFQLMATVLMES -----------------------3333-----------3333---1111----------3 LPQVVEGFNPDAWASCLAGIAAAISSAL 333------------------------- >HEMOGLOBIN A1 CHAIN; SWP:NA; PDB:1YHUD; AASCTTEDRREMQLMWGNVWSAQFTGRRIAIAQAVFKDLFANVPDAVGLFGAVKGDEVNS --------------------------------------------3333-33333333--- NEFKAHCIRVVNGLDSSIGLLSDPATLNEQLSHLATQHKARSGVTKGGFSAIAQSFLRVM 3333---------------------------------1111---3333----------33 PQVASCFNPDAWSRCFNRITTGMTEPLPA 33--------------------------- >SERINE/THREONINE-PROTEIN ; SWP:Q13153; PDB:1YHVA; SDEEILEKLRSIVSVGDPKKKYTRFEKIGQGASGTVYTAMDVATGQEVAIRQMNLQQQPK -----------------------------------------------------3333--- KELIINEILVMRENKNPNIVNYLDSYLVGDELWVVMEYLAGGSLTDVVTETCMDEGQIAA ---------------1111--------!!!!-----------3333-------------- VCRECLQALEFLHSNQVIHRDIKSDNILLGMDGSVKLTDFGFCAQITPEQSKRSEMVGTP ----------------------3333---1111-------------------------33 YWMAPEVVTRKAYGPKVDIWSLGIMAIEMIEGEPPYLNENPLRALYLIATNGTPELQNPE 33-3333--------------------------2222--------------------333 KLSAIFRDFLNRCLDMDVEKRGSAKELLQHQFLKIAKPLSSLTPLIAAAKEAT 3---------------3333--3333---3333-------------------- >Tryptophanyl-tRNA synthet; SWP:Q9RVD6; PDB:1YI8B; ARPRVLTGDRPTGALHLGHLAGSLQNRVRLQDEAELFVLLADVQALTDHFDRPEQVRENV ---------------33333333------------------3333---1111-------- LAVALDYLAAGLDPQKTTCVVQSAVPELAELTVYFLNLVTVSHLRQNPTVKAEIAQKGYG -------1111-1111----3333---------3333-----------------1111-- ERVPAGFFVYPVSQAADIAAFGATLVPVGDDQLPMLEQTREIVRRFNALYAPVLAEPQAQ ------------------1111------3333---------------------------- LSRVPRLPGLDGQAKMSKSLGNAIALGDSADEVARKVMGMYTDPGHLRASDPGRVEGNPV --------3333----3333----11113333----1111--------------222233 FTFLDAFDPDPARVQALKDQYRAGGLGDVKVKKHLIDVLNGVLAPIRTRRAEYERDPDAV 33-------3333--33331111------------------------------------- LRFVTEGTARGREVAAQTLGQVRRAMRLFGH ---------------------------2222 >PEPTIDYL-GLYCINE ALPHA-AM; SWP:P14925; PDB:1YI9A; CLGTIGPVTPLDASDFALDIRMPGVTPKESDTYFCMSMRLPVDEEAFVIDFKPRASMDTV ------------------------------------------------------------ HHMLLFGCNMPSSTGSYWFCDEGTCTDKANILYAWARNAPPTRLPKGVGFRVGGETGSKY ------------------1111-------------2222-----2222------------ FVLQVHYGDISAFRDNHKDCSGVSVHLTRVPQPLIAGMYLMMSVDTVIPPGEKVVNADIS ------------------------------------------------------------ CQYKMYPMHVFAYRVHTHHLGKVVSGYRVRNGQWTLIGRQNPQLPQAFYPVEHPVDVTFG -----------------------------iiii-------1111-------------222 DILAARCVFTGEEICNLYIMYYMEAKYALSFMTCTKNVAPDMFRTIPAEANIPIP 2----------------------3333-----------333311113333----- >BETA-1,4-XYLOSIDASE; SWP:P94489; PDB:1YIFA; KITNPVLKGFNPDPSICRAGEDYYIAVSTFEWFPGVQIHHSKDLVNWHLVAHPLQRVSQL ----------------------------!!!!-----------------------3333- DMKGNPNSGGVWAPCLSYSDGKFWLIYTDVKVVDGAWKDCHNYLVTCETINGDWSEPIKL -22222222---------%%%%-------------------------------------- NSSGFDASLFHDTDGKKYLLNMLWDHRIDRHSFGGIVIQEYSDKEQKLIGKPKVIFEGTD -----------1111-----------1111------------1111------------33 RKLTEAPHLYHIGNYYYLLTAEGGTRYEHAATIARSANIEGPYEVHPDNPILTSWHDPGN 33---------!!!!----------1111--------3333------------1111--- PLQKCGHASIVQTHTDEWYLAHLTGRPIHPDDDSIFQQRGYCPLGRETAIQKLYWKDEWP ------------1111-----------------3333----1111----------%%%%- YVVGGKEGSLEVDAPSIPETIFEATYPEVDEFEDSTLNINFQTLRIPFTNELGSLTQAPN -2222--------------------------------1111-------3333-------- HLRLFGHESLTSTFTQAFVARRWQSLHFEAETAVEFYPENFQQAAGLVNYYNTENWTALQ --------1111---------------------------1111--------1111----- VTHDEELGRILELTICDNFSFSQPLNNKIVIPREVKYVYLRVNIEKDKYYYFYSFNKEDW ----------------%%%%-----------1111---------!!!!------------ HKIDIALESKKLSDDYIRGGGFFTGAFVGMQCQDTGGNHIPADFRYFRYKEK ------------1111------------------------------------ >ANNEXIN A5; SWP:P17153; PDB:1YIIA; AKYTRGTVTAFSPFDARADAEALRKAMKGMGTDEETILKILTSRNNAQRQEIASAFKTLF ------------------------------------------------------------ GRDLVDDLKSELTGKFETLMVSLMRPARIFDAHALKHAIKGAGTNEKVLTEILASRTPAE -------------------------3333------------------------------- VQNIKQVYMQEYEANLEDKITGETSGHFQRLLVVLLQANRDPDGRVDEALVEKDAQVLFR ----------------------------------3333---------------------- AGELKWGTDEETFITILGTRSVSHLRRVFDKYMTISGFQIEETIDRETSGDLEKLLLAVV -1111---------------------------------3333------------------ KCIRSVPAYFAETLYYSMKGAGTDDDTLIRVMVSRSEIDLLDIRHEFRKNFAKSLYQMIQ ----------------------------------1111---------------------- KDTSGDYRKALLLLCG ---------------- >RESPONSE REGULATORY PROTE; SWP:O30989; PDB:1YIOA; AKPTVFVVDDDMSVREGLRNLLRSAGFEVETFDCASTFLEHRRPEQHGCLVLDMRMPGMS ----------3333----------------------------1111-------------- GIELQEQLTAISDGIPIVFITAHGDIPMTVRAMKAGAIEFLPKPFEEQALLDAIEQGLQL --------1111----------3333---------------------------------- NAERRQARETQDQLEQLFSSLTGREQQVLQLTIRGLMNKQIAGELGIAEVTVKVHRHNIM -----------------1111---------1111-------------------------- QKLNVRSLANLVHLVEKY ------------------ >QUINOHEMOPROTEIN ALCOHOL ; SWP:Q4W6G0; PDB:1YIQA; ADIPANVDGARIIAADKEPGNWMSTGRTYDEQRYSPLKQISDQNVGQLGLAWSYKLDLDR -------------33331111--11111111---------11111111------------ GVEATPIVVDGVMYTTGPFSVVYALDARDGRLIWKYDPQSDRHRAGEACCDAVNRGVAVW -----------------%%%%-------------------3333---------------- KGKVYVGVLDGRLEAIDAKTGQRAWSVDTRADHKRSYTITGAPRVVNGKVVIGNGGAEFG -------1111--------------------1111----------%%%%------1111- VRGYVTAYDAETGKEAWRFYTVPGDPKLPPEGKGMEIAAKTWFGDAYVEQGGGGTAWDSF ------------------------3333----------1111---3333----------- AYDPELNLLYIGVGNGSLWDPKWRSQAKGDNLFLSSIVAVNADTGEYVWHYQTTPGDAWD ---1111------------3333-%%%%-------------------------------- YTATQHMILAELPIDGKPRKVLMQAPKNGFFYVIDRATGELLSAKGIVPQSWTKGMDMKT -------------iiii--------3333------------------------------- GRPILDEENAAYWKNGKRNLVTPAFWGAHDWQPMSYNPDTGLVYIPAHIMSAYYEHIPEA ------11111111---------1111---------1111-------------------- PKRNPFKSMYQLGLRTGMMPEGAEGLLEMAKSWSGKLIAWDPVKQQAAWEVPYVTIFNGG -----------------------------1111--------------------------- TLSTAGNLVFEGSADGRVIAYAADTGEKLWEQPAASGVMAAPVTYSVDGEQYVTFMAGWG ----------------------------------------------iiii---------- GAFSTFAGALSLRAGVQPYAQVLTYKLGGTAKLQEPAPRPDTPKPPALSNDTASIEAGAK -------33333333--------------------------------------------- LYDGYCSQCHGIHAVSGGVLPDLRKLTPEKHQMFLGILFGGRVPDGMPSFADAFTPEQVD -----------%%%%------1111----------------3333--------------- QIHQYLIKRAHDLHQEGDTWKQFS -----------------3333--- >NICOTINATE PHOSPHORIBOSYL; SWP:Q9HW26; PDB:1YIRA; LAESAFSERIVQNLLDTDFYKLTMMQAVLHNYPNAEVEWEFRCRNQEDLRLYLPAIREQL ------------1111-3333----------1111-------1111--1111-------- EYLAGLAISDEQLAFLERIPFLAPDFIRFLGLFRFNPRYVQTGIENDEFFLRLKGPWLHV --1111------------1111-------------3333-----%%%%-------33333 ILFEVPLLAMISEVRNRARYPAATVEQARERLQEKFDWLRREASAEELAGFKMADFGTRR 333----------------11113333----------------3333------------- RFSYRVHEAVVSGLKEDFPGCFVGTSNVHLARKLDLKPLGTMAHEWLMAHQQLGPRLIDS -------------------------------1111-------3333---------3333- QSAALDCWVREYRGLLGIALTDCITTDAFLRDFDLYFAKLFDGLRHDSGDPLLWAEKTIA -----------iiii--------------1111--------------------------- HYLKLGIDPLTKTLVFSDGLDLPRALKIYRALQGRINVSFGIGTHFTCDLPGVEPMNIVV --1111-1111--------------------2222-------1111---2222------- KMSACNGHPVAKISDTPPDFIHYLKHVFQV ----iiii---------3333--------- >ADENYLOSUCCINATE LYASE; SWP:Q21774; PDB:1YISA; ASEDKFESVLSTRYCKNSPLVSILSETNKATLWRQLWIWLAEAEKELGLKQVTQDAIDEK -------3333---11113333----------------------11113333-------- SNRDVFDWPFIRSEERKLKHDVAHNHAFGKLCPTAAGIIHLGATSCFVQDNADLIAYRDS -1111--------------------------3333111122223333------------- IDHILKRFATVIDRLAAFSLKNKEVVTVGRTHYQTASLVTVGKRGVLWAQELLAFQSLSE -------------------1111-------%%%%-------------------------- FRDKRFRGIKGATGTQDSFLTLFAGDESKVEALDELVTKKANFSNRFLITGQTYSRQQDS --------------------1111------------------------------------ QLVFSLSLLGAAAKKVCTDIRVLQAFGELLEPKKNPKSERCCALSRKLINAPQEALTILA ----------------------------------------------------------11 DQGLERTLDDSAGRRLIPDVLLTAEALLTTLQNIFEGLSVQTDNVKKIVEDEIAFLGLEK 11!!!!----3333---------------------------------------------- ALQTADPFFDSVRDRVVGLVNNPINFTGRCVSQTESFIAKELKPTIDKYLD --------1111---------3333-!!!!---------------3333-- >ITCHY E3 UBIQUITIN PROTEI; SWP:Q8C863; PDB:1YIUA; GAMGPLPPGWEKRTDSNGRVYFVNHNTRITQWEDPRS --------------1111------------------- >MYELIN P2 PROTEIN; SWP:P02690; PDB:1YIVA; SNKFLGTWKLVSSENFDDYMKALGVGLATRKLGNLAKPTVIISKKGDVITIRTESGFKNT 3333----------------1111--------------------!!!!------1111-- EISFKLGQEFDETTADNRKAKSTVTLAAGALNQVQKWNGNETTIKRKLVDGKMVVECKMA ----2222-----1111-------------------!!!!------------------!! SVVCTRIYEKV !!--------- >DEOXYRIBONUCLEASE YCFH; SWP:P0AFQ7; PDB:1YIXA; MFLVDSHCHLDGLDYESLHKDVDDVLAKAAARDVKFCLAVATTLPSYLHMRDLVGERDNV --------1111------------------------------------------------ VFSCGVHPLNQNDPYDVEDLRRLAAEEGVVALGETGLDYYYTPETKVRQQESFIHHIQIG ------1111-----3333----------------------------------------- RELNKPVIVHTRDARADTLAILREEKVTDCGGVLHCFTEDRETAGKLLDLGFYISFSGIV -------------------------3333------------------1111-----3333 TFRNAEQLRDAARYVPLDRLLVETDSPYLAPVPHRGKENQPAMVRDVAEYMAVLKGVAVE -1111----------1111--------------------3333----------------- ELAQVTTDNFARLFHIDASRLQSIR -----------1111-3333----- >KYNURENINE AMINOTRANSFERA; SWP:Q95VY4; PDB:1YIZA; KFDLPKRYQGSTKSVWVEYIQLAAQYKPLNLGQGFPDYHAPKYALNALAAAANSPDPLAN ----------1111--------------------------3333---------------- QYTRGFGHPRLVQALSKLYSQLVDRTINPMTEVLVTVGAYEALYATIQGHVDEGDEVIII -------3333----------------------------------------2222----- EPFFDCYEPMVKAAGGIPRFIPLKPNKTGGTISSADWVLDNNELEALFNEKTKMIIINTP ---1111-------------------------3333--------33331111-------- HNPLGKVMDRAELEVVANLCKKWNVLCVSDEVYEHMVFEPFEHIRICTLPGMWERTITIG ------------------------------1111----------33332222-------- SAGTFSLTGWKIGWAYGPEALLKNLQMVHQNCVYTCATPIQEAIAVGFETELKRLKSPEC ------1111-------3333------3333----------------------1111--3 YFNSISGELMAKRDYMASFLAEVGMNPTVPQGGYFMVADWSSLDSKVDLTQETDARKDYR 333--------------------------------------------------------- FTKWMTKSVGLQGIPPSAFYSEPNKHLGEDFVRYCFFKKDENLQKAAEILRKWKGSS ----------------11113333-1111---------------------------- >UBIQUITIN; SWP:P68198; PDB:1YJ1A; LQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEIPPDQQRLIFAGKQLEDGRTLSDYNI ------1111-------1111--------------1111----iiii--11113333--- QKESTLHLVL 2222------ >5' polynucleotide kinase-; SWP:Q9JLV6; PDB:1YJ5A; LGWESLKKLLVFTASGVKPQGKVAAFDLDGTLITTRSGKVFPTSPSDWRILYPEIPKKLQ -------------2222----------2222---1111-----1111----1111----- ELAAEGYKLVIFTNQGIGRGKLPAEVFKGKVEAVLEKLGVPFQVLVATHAGLNRKPVSGW --3333----------1111------------------------------3333------ DHLQEQANEGIPISVEDSVFVGDAAGRLANWAPGRKKKDFSCADRLFALNVGLPFATPEE -------------3333------------------------------------------- FFLKWPAARFELPAFDPRTISSAGPLYLPESSSLLSPNPEVVVAVGFPGAGKSTFIQEHL ---------------3333---------3333--------------2222---------- VSAGYVHVNRDTLGSWQRCVSSCQAALRQGKRVVIDNTNPDVPSRARYIQCAKDAGVPCR 1111----3333--------------1111------------------------------ CFNFCATIEQARHNNRFRETDPSHAPVSDVFSYRKQFEPPTLAEGFLEILEIPFRLQEHL --------------------1111----------------3333------------1111 DPALQRLYRQFSEG -------------- >ESCJ; SWP:Q8VQD3; PDB:1YJ7A; MKEQLYTGLTEKEANQMQALLLSNDVNVSKEMDKSGNMTLSVAAADFVRAITILNNNGFP ---------------------1111-------1111------3333--------1111-- KKKFADIEVIFPSPSQENAKINYLKEQDIERLLSKIPGVIDCSVSLNVSSAAVLVISSPE -----3333--------------------------2222------------------111 VNLAPSVIQIKNLVKNSVDDLKLENISVVIKSSS 1-3333--------1111---3333--------- >GLYCEROL-3-PHOSPHATE DEHY; SWP:Q8I5P5; PDB:1YJ8A; YRNLFDKLKDGPLKISILGSGNWASAISKVVGTNAKNNYLFENEVRMWIRDEFERMVDII ------3333---------------------------3333------------------- NNKHENTKYLKGVPLPHNIVAHSDLASVINDADLLIFIVPCQYLESVLASIKEIKIASHA ---------2222--1111----3333-2222-------3333-------------1111 KAISLTKGFIVKKNQMKLCSNYISDFLNIPCSALSGANIAMDVAMENFSEATIGGNDKDS -----------%%%%-----------------------33331111----------3333 LVIWQRVFDLPYFKINCVNETIEVEICGALKNIITLACGFCDGLNLPTNSKSAIIRNGIN ---------1111----------------------------1111--------------- EMILFGKVFFQKFNENILLESCGFADIITSFLAGRNAKCSAEFIKSTPKKTWEELENEIL -------------3333--3333----------------------!!!!----------i KGQKLQGTVTLKYVYHMIKEKNMTNEFPLFTVLHKISFENEDPSSLLKTFMNNKINQ iii---------------11113333--------------------3333------- >T-cell-specific surface g; SWP:P10747; PDB:1YJDC; NKILVKQSPMLVAYDNAVNLSCKYSYNLFSREFRASLHKGLDSAVEVCVVYGNYSQQLQV -------------%%%%----------------------1111----------------- YSKTGFNCDGKLGNESVTFYLQNLYVNQTDIYFCKIEVMYPPPYLDNEKSNGTIIHVK ------------------------1111------------------------------ >T-cell-specific surface g; SWP:P10747; PDB:1YJDH; VQLQQSGPELVKPGTSVRISCEASGYTFTSYYIHWVKQRPGQGLEWIGCIYPGNVNTNYN --------------------------1111--------2222-----------------1 EKFKDKATLIVDTSSNTAYMQLSRMTSEDSAVYFCTRSHYGLDWNFDVWGAGTTVTVSSA 111----------------------1111------------------------------- KTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDL -----------------2222------------------%%%%-------------iiii YTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIV ---------1111-----------3333--------- >ORPHAN NUCLEAR RECEPTOR N; SWP:P22829; PDB:1YJEA; NLLTSLIRAHLDSGPNTAKLDYSKFQELVLPRFGKEDAGDVQQFYDLLSGSLDVIRKWAE --3333---3333--3333--1111---------------------------------11 KIPGFIELSPGDQDLLLESAFLELFILRLAYRSKPGEGKLIFCSGLVLHRLQCARGFGDW 11--3333--------------------------1111---1111---3333-----333 IDNILAFSRSLHSLGVDVPAFACLSALVLITDRHGLQDPRRVEELQNRIASCLKEHMAAV 3-----------------------3333---------3333------------------- SCLSRLLGKLPELRTLCTQGLQRIFCLKLEDLVPPPPIVDKIFMDT ---------------------------------------------- >SURFACE PROTEIN VSPA; SWP:O34000; PDB:1YJGA; TKNITDAVAFAKSVKDVHTLVKSIDELAKAIGKKIGANGLETDADKNAKLISGAYSVISA ---------------------------1111----1111--------------------- VDTKLASLEKKVGISDDLKGKITTVKNASTSFLTKAKSKTADLGKDDVKDADAKTAIDIA -------1111----------------------------------------------111 DTGAKDKGAEELIKLNTAIDALLTSAEAAVTAAINAL 1------------------------------------ >POLYNUCLEOTIDE 5'-HYDROXY; SWP:Q9JLV6; PDB:1YJMA; LGSRGRLWLQSPTGGPPPIFLPSDGQALVLGRGPLTQVTDRKCSRNQVELIADPESRTVA -----------2222-----------------3333---3333----------1111--- VKQLGVNPSTVGVHELKPGLSGSLSLGDVLYLVNGLYPLTLRWEELS ----------!!!!--2222----2222----%%%%----------- >COPPER-TRANSPORTING ATPAS; SWP:Q04656; PDB:1YJRA; MGDGVLELVVRGMTCASCVHKIESSLTKHRGILYCSVALATNKAHIKYDPEIIGPRDIIH --------------------------------------1111------------------ TIESLGFEPSLVKIE --------------- >HYPOTHETICAL PROTEIN RV13; SWP:P64819; PDB:1YK3A; ADDALVRLARERFDLPDQVRRLARPPVPSLEPPYGLRVAQLTDAEMLAEWMNRPHLAAAW -------1111----3333--------------------1111----------3333--- EYDWPASRWRQHLNAQLEGTYSLPLIGSWHGTDGGYLELYWAAKDLISHYYDADPYDLGL ----3333-------1111---------%%%%--------3333--1111---1111--- HAAIADLSKVNRGFGPLLLPRIVASVFANEPRCRRIMFDPDHRNTATRRLCEWAGCKFLG -------3333--3333------------3333-------1111-------1111----- EHDTTNRRMALYALEAPT ------------------ >RUBREDOXIN; SWP:Q9V099; PDB:1YK4A; AKLSCKICGYIYDEDEGDPDNGISPGTKFEDLPDDWVCPLCGAPKSEFERIE ------------3333--1111-22223333-1111-------3333----- >ADENYLATE CYCLASE; SWP:O30820; PDB:1YK9A; DKYDEASVLFADIVGFTERASSTAPADLVRFLDRLYSAFDELVDQHGLEKIEVSGDSYMV ------------------------------3333----3333------------------ VSGVPRPRPDHTQALADFALDMTNVAAQLKDPRGNPVPLRVGLATGPVVAGVVGSRRFRY ---------3333---------3333---------------------------------- CVWGDAVNVASRMESTDSVGQIQVPDEVYERLKDDFVLRERGHINVKGKGVMRTWYLIGR ---3333---3333----------3333---%%%%------------------------- KVAA ---- >MONOTHIOL GLUTAREDOXIN YD; SWP:P37010; PDB:1YKAA; MSTTIEKIQRQIAENPILLYMKGSPKLPSCGFSAQAVQALAACGERFAYVDILQNPDIRA -----------------------3333---3333---------------------3333- ELPKYANWPTFPQLWVDGELVGGCDIVIEMYQRGELQQLIKETAAKYKSEEPDAE --3333---------%%%%------------------------------------ >ADENYLATE CYCLASE; SWP:P94182; PDB:1YKDA; VTEVEQKLQIVHQTLSMLDSHGFENILQEMLQSITLKTGELLGADRTTIFLLDEEKQELW -----------------2222--------------------------------1111--- SIVAAGEGDRSLEIRIPADKGIAGEVATFKQVVNIPFDFYHDPRSIFAQKQEKITGYRTY ------%%%%------1111-----------------33333333--------------- TMLALPLLSEQGRLVAVVQLLNKLKPYSPPDALLAERIDNQGFTSADEQLFQEFAPSIRL --------3333----------------11113333--1111------------------ ILESSRSFYIATQKQRAAAAMMKAVKSLSQSSLDLEDTLKRVMDEAKELMNADRSTLWLI ------------------------------------------------------------ DRDRHELWTKITQDNGSTKELRVPIGKGFAGIVAASGQKLNIPFDLYDHPDSATAKQIDQ -1111------------------2222-----------------33331111-------- QNGYRTCSLLCMPVFNGDQELIGVTQLVNKKKTGEFPPYNPETWPIAPECFQASFDRNDE ---------------------------------------3333-----1111-------- EFMEAFNIQAGVALQNAQLFATV ----------------------- >RNA polymerase II mediato; SWP:P47822; PDB:1YKEB; DRLTQLQICLDQMTEQFCATLNYIDKNHGFERLTVVPPEEFSNTIDELSTDIILKTRQIN -------------------3333----3333----------------------------- KLIDSLPGVDVSAEEQLRKIDMLQKKLVEVEDEKIEAIKKKEKLLRHVDSLIEDFVDGI ------------3333------------------------------------------- >NADP-DEPENDENT ALCOHOL DE; SWP:P14941; PDB:1YKFA; MKGFAMLSIGKVGWIEKEKPAPGPFDAIVRPLAVAPCTSDIHTVFEGAIGERHNMILGHE -------2222-----------1111---------------------------------- AVGEVVEVGSEVKDFKPGDRVVVPAITPDWRTSEVQRGYHQHSGGMLAGWKFSNVKDGVF --------1111---2222------------3333------2222--------------- GEFFHVNDADMNLAHLPKEIPLEAAVMIPDMMTTGFHGAELADIELGATVAVLGIGPVGL -------3333-----111133331111----------------2222------------ MAVAGAKLRGAGRIIAVGSRPVCVDAAKYYGATDIVNYKDGPIESQIMNLTEGKGVDAAI ------1111----------------------------------------iiii------ IAGGNADIMATAVKIVKPGGTIANVNYFGEGEVLPVPRLEWGCGMAHKTIKGGLCPGGRL ----3333--------2222----------------3333%%%%---------------- RMERLIDLVFYKRVDPSKLVTHVFRGFDNIEKAFMLMKDKPKDLIKPVVILA ---------------3333------3333-----3333--1111-------- >Sulfite reductase [NADPH]; SWP:P38038; PDB:1YKGA; ITIISASQTGNARRVAEALRDDLLAAKLNVKLVNAGDYKFKQIASEKLLIVVTSTQGEGE ---------------------------------3333-33331111----------%%%% PPEEAVALHKFLFSKKAPKLENTAFAVFSLGDTSYEFFCQSGKDFDSKLAELGGERLLDR -3333--------1111--------------3333-2222-------------------- VDADVEYQAAASEWRARVVDALKSRA ---3333--------------1111- >RNA POLYMERASE II MEDIATO; SWP:Q08278; PDB:1YKHA; NYQYKIQELRKLLKSLLLNYLELIGVLSINPDMYERKVENIRTILVNIHHLLNEYRPHQS 3333-----------------1111-----1111-------------------------- RESLIMLLEEQLEYKRGEIREIEQVCKQVHDKLTS ----------------------------------- >RNA polymerase II mediato; SWP:P47822; PDB:1YKHB; TDRMTQLQICLDQMTEQFCATLNYIDKNHGFEVVPPEEFSNTIDELSTDIILKTRQINKL ----------------------------------3333---------------------- IDSLPGVDVSAEEQLRKIDMLQKKLVEVEDEKIEAIKKKEKLMRHVDSMIEDFV ---2222----------------------------------------------- >OXYGEN-INSENSITIVE NAD(P); SWP:P38489; PDB:1YKIA; DIISVALKRHSTKAFDASKKLTPEQAEQIKTLLQYSPSSTNSQPWHFIVASTEEGKARVA ---------------1111------------------2222------------------3 KSAAGNYVFNERKMLDASHVVVFCAKTAMDDVWLKLVVDQEDADGRFATPEAKAANDKGR 333!!!!----------------------------------1111--------------- KFFADMHRKDLHDDAEWMAKQVYLNVGNFLLGVAALGLDAVPIEGFDAAILDAEFGLKEK ---------------------------------1111--------------------111 GYTSLVVVPVGHHSVEDFNATLPKSRLPQNITLTEV 1------------11113333------1111----- >Genome polyprotein [conta; SWP:P03314; PDB:1YKSA; SHMLKKGMTTVLDFHPGAGKTRRFLPQILAECARRRLRTLVLAPTRVVLSEMKEAFHGLD -1111---------------------------1111-----------------1111--- VKFHTQAFSAHGSGREVIDAMCHATLTYRMLEPTRVVNWEVIIMDEAHFLDPASIAARGW ----------------------------1111---------------------------- AAHRARANESATILMTATPPGTSDEFPHSNGEIEDVQTDIPSEPWNTGHDWILADKRPTA ----1111----------2222----------------------------1111------ WFLPSIRAANVMAASLRKAGKSVVVLNRKTFEKKPDFILATDIAEMGANLCVERVLDCRT ----------------1111----------------------11111111---------- AFKPVLVDEGRKVAIKGPLRISASSAAQRRGRIGRNPNRDGDSYYYSEPTSENNAHHVCW -------iiii------------------3333--1111--------------1111--- LEASMLLDNMEVRGGMVAPLYGVEGTKTPVSPGEMRLRDDQRKVFRELVRNCDLPVWLSW -----3333--2222-----!!!!------2222-------------------------- QVAKAGLKTNDRKWCFEGPEEHEILNDSGETVKCRAPGGAKKPLRPRWCDERVSSDQSAL -------11113333---1111---1111------2222----------3333------- SEFIKFAEGRR ----------- >HYPOTHETICAL PROTEIN PXO2; SWP:Q9RMX2; PDB:1YKUA; KCLLCRYLKERQEKFISDWKKKVIIRERDPYKEEIIKNGEHLLSAFIMYLKEEISLQEIE ------------------1111---1111-------------------1111--3333-- ITSKKIARERIDAKVNIAEFIHNTNVAKIEIMNILTLLNPDLQQYQALVKKINQFFDHLI ------------------------------------------------------------ YYTVHSYYEQKA ------------ >RUBISCO-LIKE PROTEIN; SWP:Q8KBL4; PDB:1YKWA; EDVKGFFASRESLDMEQYLVLDYYLESVGDIETALAHFCSEQSTFRLVHAAKVIDYEVIE -3333---3333-3333----------------------1111----------------- ELEQLSYPVKHSETGKIHACRVTIAHPHCNFGPKIPNLLTAVCGEGTYFTPGVPVVKLMD --------------------------3333-------------3333--2222------- IHFPDTYLADFEGPKFGIEGLRDILNAHGRPIFFGVVKPNIGLSPGEFAEIAYQSWLGGL ---33331111--------------------------------3333--------1111- DIAKDDEMLADVTWSSIEERAAHLGKARRKAEAETGEPKIYLANITDEVDSLMEKHDVAV -----1111--11113333----------------------------1111--------1 RNGANALLINALPVGLSAVRMLSNYTQVPLIGHFPFIASFSRMEKYGIHSKVMTKLQRLA 111------3333----------------------3333--------------------- GLDAVIMPGFGDRVMTPEEEVLENVIECTKPMGRIKPCLPVPGGSDSALTLQTVYEKVGN ----------1111-----------------!!!!-----------1111---------- VDFGFVPGRGVFGHPMGPKAGAKSIRQAWEAIEQGISIETWAETHPELQAMVDQ -------------1111--------------1111-----3333---------- >ARGININE N-SUCCINYLTRANSF; SWP:P80357; PDB:1YLEA; HLVRPAQAADLPQVQRLAADSPVGVTSLPDDAERLRDKILASEASFAAEVSYNGEESYFF ------3333-----------3333----------------------------------- VLEDSASGELVGCSAIVASAGFSEPFYSFRNETFVHASRSLSIHNKIHVLSLCHDLTGNS ------------------2222----------------1111-----------1111--- LLTSFYVQRDLVQSVYAELNSRGRLLFASHPERFADAVVVEIVGYSDEQGESPFWNAVGR -------1111------------------3333-------------------------33 NFFDLNYIEAEKLSGLKHYPIYVPLLPDAAQESGQVHPRAQITFDILREGFETDNYIDIF 33-------------------3333-----------1111-------------------- DGGPTLHARTSGIRSIAQSRVVPVKIGEKSGRPYLVTNGQLQDFRAVVLDLDWAPGKPVA --------33331111------------------------1111---------2222--- LSVEAAEALGVGEGASVRLVAVGS -----------2222--------- >RRF2 FAMILY PROTEIN; SWP:NA; PDB:1YLFA; KISSRFSIAVHILSILKNNPSSLCTSDYAESVNTNPVVIRKISYLKQAGFVYVNGGAGLL -------------------3333-3333-1111--------------------------- KDLHEITLLDVYHAVNVIGANIQAVLEIILIQAQSAEEVLRNITGQLFETLQE -3333------------------------------------------------ >PHOSPHOENOLPYRUVATE CARBO; SWP:Q6W6X5; PDB:1YLHA; DLNKLVKELNDLGLTDVKEIVYNPSYEQLFEEETKPGLEGFDKGTLTTLGAVAVDTGIFT ---------1111---------------------2222!!!!----1111---------- GRSPKDKYIVCDETTKDTVWWNSEAAKNDNKPMTQETWKSLRELVAKQLSGKRLFVVEGY --3333-----3333-------------------------------1111---------- CGASEKHRIGVRMVTEVAWQAHFVKNMFIRPTDEELKNFKADFTVLNGAKCTNPNWKEQG ---1111---------3333-----------33331111--------3333-11111111 LNSENFVAFNITEGIQLIGGTWYGGEMKKGMFSMMNYFLPLKGVASMHCSANVGKDGDVA ----------1111--------3333-----------3333------------1111--- IFFGLSGTGKTTLSTDPKRQLIGDDEHGWDESGVFNFEGGCYAKTINLSQENEPDIYGAI ----22223333---1111----------1111-----------22223333----1111 RRDALLENVVVRADGSVDFDDGSKTENTRVSYPIYHIDNIVRPVSKAGHATKVIFLTADA 2222-------1111--11113333-------1111----------------------11 FGVLPPVSKLTPEQTEYYFLSGFTAPTPTFSACFGAAFLSLHPIQYADVLVERMKASGAE 11--------------------------------3333---3333--------------- AYLVNTGWNGTGKRISIKDTRGIIDAILDGSIEKAEMGELPIFNLAIPKALPGVDPAILD --------3333--------------11111111----------------22221111-3 PRDTYADKAQWQVKAEDLANRFVKNFVKYTANPEAAKLVGAGPK 333----------------------3333--------3333--- >PUTATIVE ACYL-COA THIOEST; SWP:P44886; PDB:1YLIA; RQSKGVLLLRTLAMPSDTNANGDIFGGWIMSQMDMGGAILAKEIAHGRVVTVAVESMNFI -------------1111-1111--3333-------------------------------- KPISVGDVVCCYGQCLKVGRSSIKIKVEVWVKKVASEPIGERYCVTDAVFTFVAVDNNGR ---2222-----------1111---------------2222--------------1111- SRTIPRENNQELEKALALISEQ ---------------------- >BETA-LACTAMASE CTX-M-9A; SWP:Q9L5C8; PDB:1YLJA; QTSAVQQKLAALEKSSGGRLGVALIDTADNTQVLYRGDERFPMCSTSKVMAAAAVLKQSE -----------------------------------1111------------------333 TQKQLLNQPVEIKPADLVNYNPIAEKHVNGTMTLAELSAAALQYSDNTAMNKLIAQLGGP 31111-------1111------33332222------------------------1111-- GGVTAFARAIGDETFRLDRTEPTLNTAIPGDPRDTTTPRAMAQTLRQLTLGHALGETQRA -------1111----------------2222----------------------------- QLVTWLKGNTTGAASIRAGLPTSWTAGDKTGSGDYGTTNDIAVIWPQGRAPLVLVTYFTQ -----1111--11113333-1111---------%%%%----------------------- PQQNAESRRDVLASAARIIAEGL -1111--3333--------2222 >HYPOTHETICAL PROTEIN RV12; SWP:P64797; PDB:1YLKA; GTVTDDYLANNVDYASGFKGPLPMPPSKHIAIVACMDARLDVYRMLGIKEGEAHVIRNAG -------------3333-------------------1111--------2222-------- CVVTDDVIRSLAISQRLLGTREIILLHHTDCGMLTFTDDDFKRAIQDETGIRPTWSPESY ------------------------------3333-------------------------- PDAVEDVRQSLRRIEVNPFVTKHTSLRGFVFDVATGKLNEVTP ----------------1111----------------------- >CONSERVED HYPOTHETICAL PR; SWP:Q9HU79; PDB:1YLLA; SELRILRAVDYPRPGSTEEIARDGGDGLDGFGWRLSIADVGESGGFSGFAGYQRIISVLE ------3333--------------------------------------2222-------- GGGRLRVDGAESAPLRARQAFAFSGDSEVHCTLLDGAIRDFNLIYAPRRHRARLQWLRVE ------iiii-----2222----1111------------------1111----------- GELDWHGTASTLLLFAQQDGVAISLQGQPRGQLAAHDCLCAEGLQGLQHWRLTAHEPAWV ------------------------iiii-----2222----------------------- CAVELDSL -------- >HYPOTHETICAL PROTEIN BSU3; SWP:O32126; PDB:1YLMA; YFVDRSKIEKTLGFFEHQLALFDSQTDWQSEIGELALQRIGHLLIECILDTGNDIDGFIR -------------------------------------------------------1111- DPGSYDDIDILVDEKVVTEKEGDELKKLIAYRKTLVQQYLLADSGELYRLIKAHQTALQD ---3333------------------------------3333------------------- FPKRIRSYLETELGPVSAF ------------------- >HYPOTHETICAL PROTEIN VCA0; SWP:Q9KNC3; PDB:1YLNA; TVSTINSTDALAMVEHSELTLSITTPVGTKFVCRTPFIGTHTDKFLLVEMPKISADDLQY ----------1111----------1111-------------------------------- FFQEGFWMNIRAISPRGEGALIHFRSQLMHILQEPVPMAFLSIPNTMQVSQLRKEPRFEL --2222---------!!!!----------------------------------------- NLAGKVLFDEHRGDCELRDLSRSGCRFITPPLGKTYQVGDLVALEIFSDLRGTKTFPPLT -------iiii---------1111-----1111---2222--------1111-------- GKICNLQRSLHHARYGLEFNEEGRNNAKNLLAQLKFNGTKLTLNA --------1111------------------1111----------- >HYPOTHETICAL PROTEIN SF24; SWP:Q83K87; PDB:1YLOA; ADLSLLKALSEADAIASSEQEVRQILLEEAARLQKEVRFDGLGSVLIRLNESTGPKVICA --------------22223333--------1111-----1111----------------- HDEVGFVRSISREGAIDVLPVGNVRAARQLQPVRITTREECKIPGLLDGDRQGNDVSARV ----------1111----------------------1111-------------------- DIGARTYDEVQAGIRPGDRVTFDTTFQVLPHQRVGKAFDDRLSCYLLVTLLRELHDAELP -----3333-----2222-----------%%%%--------------------1111--- AEVWLVASSSEEVGLRGGQTATRAVSPDVAIVLDTACWAKNFDYGAANHRQIGNGPLVLS ----------1111-------------------------1111-3333--2222------ DKSLIAPPKLTAWIETVAAEIGVPLQADFSNGGTDGGAVHLTGTGVPTLVGPATRHGHCA 1111-------------------------------------!!!!--------------- ASIADCRDILQEQLLSALIQRLTRETVVQLTDFR ------------------1111-----1111--- >putative nucleotidyltrans; SWP:NA; PDB:1YLQA; HMKEIKEITKKDVQDAEIYLYGSVVEGDYSIGLSDIDVAIVSDVFEDRNRKLEFFGKITK ------------1111-----3333----------------3333--------------- KFFDSPFEFHILTKKEWKMSKRFIRKYRRLD --------------------1111------- >HYPOTHETICAL PROTEIN APC3; SWP:Q5KZY7; PDB:1YLXA; EFAPRSVVIEEFIDTLEPEAYGLDQVGIFEEHGEGNRYYVGYTINKDDEITIHPFVKNER ---3333-----1111--1111-----------!!!!--------iiii--------111 GELALEKQEWTVRKDGREKKGFHSLQEAEEVIHS 1------------iiii------3333------- >FIBRINOTIC ENZYME COMPONE; SWP:Q9BLI8; PDB:1YM0A; IVGGIEARPYEFPWQVSVRRKSSDSHFCGGSIINDRWVVCAAHCMQEA -----------1111-------------------------3333---- >CARBONIC ANHYDRASE (CARBO; SWP:O53573; PDB:1YM3A; TNPVAAWKALKEGNERFVAGRPQHPSQSQKPTAVIFGCADSRVAAEIIFDQGLGDMFVVR ------------------------%%%%---------1111--3333----2222----- TAGHVIDSAVLGSIEYAVTVLNVPLIVVLGHDSCGAVNAALAAINDGTLPGGYVRDVVER -%%%%---------------------------------------------!!!!--3333 VAPSVLLGRRDGLSRVDEFEQRHVHETVAILMARSSAISERIAGGSLAIVGVTYQLDDGR --------1111-----------------------3333---------------1111-- AVLRDHIGNIGEE ------------- >Hypothetical 32.6 kDa pro; SWP:P38765; PDB:1YM5A; TLMVPFKQVDVFTEKPFMGNPVAVINFLEIDENEVSQEELQAIANWTNLSETTFLFKPSD --------------2222--------11113333-------------------------3 KKYDYKLRIFTPRSELPFAGHPTIGSCKAFLEFTKNTTATSLVQECKIGAVPITINEGLI 333----------------------------1111----------1111------iiii- SFKAPMADYESISSEMIADYEKAIGLKFIKPPALLHTGPEWIVALVEDAETCFNANPNFA ------------------------------------------------------------ MLAHQTKQNDHVGIILAGPKKEAAIKNSYEMRAFAPVINVYEDPVCGSGSVALARYLQEV -----------------------------------1111------3333----------- YKFEKTTDITISEGGRLKRNGLMLASIKKEADNSTSYYIAGHATTVIDGKIKVH -------------3333------------1111--------------------- >Protein L [Precursor]; SWP:Q51918; PDB:1YMHE; KEEVTIKVNLIFADGKIQTAEFKGTFEEATAEAYRYADLLAKVNGEYTWDLEDGGNHMNI ----------------------------------------------------iiii---- KFAGK ----- >CASEIN KINASE II, ALPHA C; SWP:P68400; PDB:1YMIA; SGPVPSRARVYTDVNTHRPREYWDYESHVVEWGNQDDYQLVRKLGRGKYSEVFEAINITN ----------111111113333-3333------3333----------------------- NEKVAVKILKPVKKKKIKREIKILENLRGGPNIITLADIVKDPVSRTPALVFEHVNNTDF ------------3333----------2222----------------------------33 KQLYQTLTDYDIRFYMYEILKALDYCHSMGIMHRDVKPHNVLIDHEHRKLRLIDWGLAEF 333333------------------------------3333----1111------1111-- YHPGQEYNVRVASRYFKGPELLVDYQMYDYSLDMWSLGCMLASMIFRKEPFFHGHDNYDQ -2222-------3333-3333----------------------------------3333- LVRIAKVLGTEDLYDYIDKYNIELDPRFNDILGRHSRKRWERFVHSENQHLVSPEALDFL ---------------------------------------3333-33331111-------- DKLLRYDHQSRLTAREAMEHPYFYTVVKDQARMG ------3333---------3333-3333------ >MBP PEPTIDE; SWP:NA; PDB:1YMMD; QALSIQEGENATMNCSYKTSINNLQWYRQNSGRGLVHLILIRSNEREKHSGRLRVTLDTS ---------------------------------------------------------333 KKSSSLLITASRAADTASYFCATDTTSGTYKYIFGT 3-----------1111-------------------- >SUGAR-PHOSPHATE PHOSPHATA; SWP:Q8A090; PDB:1YMQA; TKALFFDIDGTLVSFETHRIPSSTIEALEAAHAKGLKIFIATGRPKAIINNLSELQDRNL -------2222---------3333-------1111---------3333---33331111- IDGYITMNGAYCFVGEEVIYKSAIPQEEVKAMAAFCEKKGVPCIFVEEHNISVCQPNEMV -----%%%%----!!!!-------3333----------------------------3333 KKIFYDFLHVNVIPTVSFEEASNKEVIQMTPFITEEEEKEVLPSIPTCEIGRWYPAFADV ----------------3333--------------------33331111------------ TAKGDTKQKGIDEIIRHFGIKLEETMSFGDGGNDISMLRHAAIGVAMGQAKEDVKAAADY -2222----------1111-1111------1111------------1111----1111-- VTAPIDEDGISKAMKHFGII ---1111------------- >STEROIDOGENIC FACTOR 1; SWP:P33242; PDB:1YMTA; GSSGGPNVPELILQLLQLEPEEDQVRARIVGCLQQPAPFSLLCRMADQTFISIVDWARRC ---------------1111-3333------------------------------------ MVFKELEVADQMTLLQNSWSELLVLDHIYRQVQYGKEDSILLVTGQEVELSTVAVQAGSL -3333------------------------------------1111---3333-------- LHSLVLRAQELVLQLHALQLDRQEFVCLKFLILFSLDVKFLNNHSLVKDAQEKANAALLD -------------------------------1111-3333-------------------- YTLSHYPHSGDKFQQLLLSLVEVRALSMQAKEYLYHKHLGNEMPRNNLLIEMLQA -----1111----------------------------1111--2222-------- >CC45; SWP:NA; PDB:1YMZA; MPLPPGWERRTDVEGKVYYFNVRTLTTTWERPTIILE ---2222--------------1111------------ >TRUNCATED CELL SURFACE PR; SWP:Q99QS1; PDB:1YN3A; GSTVPYTITVNGTSQNILSNLTFNKNQNISYKDLEGKVKSVLESNRGITDVDLRLSKQAK ---------1111----------------------------------------------- YTVNFKNGTKKVIDLKSGIYTANLINSSDIKSININID ----1111-----1111--------1111--------- >EAPH1; SWP:Q99S64; PDB:1YN4A; GKHTVPYTISVDGITALHRTYFVFPENKKVLYQEIDSKVKNELASQRGVTTEKINNAQTA ----------%%%%----------------3333-------------------1111--- TYTLTLNDGNKKVVNLKKNDDAKNSIDPSTIKQIQIVVK -----1111-----33331111----3333--------- >EAPH2; SWP:Q99VA9; PDB:1YN5A; AKEMQNVPYTIAVDGIMAFNQSYLNLPKDSQLSYLDLGNKVKALLYDERGVTPEKIRNAK ------------iiii-----------------------------------33331111- SAVYTITWKDGSKKEVDLKKDSYTANLFDSNSIKQIDINVKTK -------1111-----1111--------3333----------- >NBP2P; SWP:Q12163; PDB:1YN8A; GQRAVALYDFEPENDNELRLAEGDIVFISYKHGQGWLVAENESGSKTGLVPEEFVSYIQ -------------1111---2222--------2222----1111------1111----- >POLYNUCLEOTIDE 5'-PHOSPHA; SWP:P24656; PDB:1YN9A; MFPARWHNYLQCGQVIKDSNLICFKTPLRPELFAYVTSEEDVWTAEQIVKQNPSIGAIID --2222----------------------33331111-3333----------1111----- LTNTSKYYDGVHFLRAGLLYKKIQVPGQTLPPESIVQEFIDTVKEFTEKCPGMLVGVHCT --------3333-1111--------------------------------2222------- HGINRTGYMVCRYLMHTLGIAPQEAIDRFEKARGHKIERQNYVQDLLI ------------------------------------------------ >ENDO-1,4-BETA-XYLANASE; SWP:O43097; PDB:1YNA; TTPNSEGWHDGYYYSWWSDGGAQATYTNLEGGTYEISWGDGGNLVGGKGWNPGLNARAIH --------iiii-----------------!!!!--------------------------- FEGVYQPNGNSYLAVYGWTRNPLVEYYIVENFGTYDPSSGATDLGTVECDGSIYRLGKTT -----------------------------------1111---------iiii-------- RVNAPSIDGTQTFDQYWSVRQDKRTSGTVQTGCHFDAWARAGLNVNGDHYYQIVATEGYF -----1111--------------------3333-----1111------------------ SSGYARITVADVG ------------- >HYPOTHETICAL PROTEIN AF14; SWP:O28840; PDB:1YNBA; MDDVVKFIHEVGSLKLTPRSGWLKLGIRLPESVAEHNFRAAIIAFILALKSGESVEKACK ------------1111--33331111----------------------1111-------- AATAALFHDLHEARTMDLHKIARRYVSCDEEGAREEQLSWMESKPDFSDVEVYVSDADKL ----11113333------3333---------------3333-----1111---------- ELAFQGVEYSQQVSYAIRFAENVELKTDAAKEIYRVLMERKNPVWWR ------------33333333--------------------------- >PEPTIDYL-PROLYL CIS-TRANS; SWP:P62937; PDB:1YNDA; VNPTVFFDIAVDGEPLGRVSFELFADKVPKTAENFRALSTGEKGFGYKGSCFHRIIPGFM ----------iiii--------------------------1111--2222-----2222- CQGGDFTRHNGTGGKSIYGEKFEDENFILKHTGPGILSMANAGPNTNGSQFFICTAKTEW ---------------1111-------------2222---------------------333 LDGKHVVFGKVKEGMNIVEAMERFGSRNGKTSKKITIADCGQLE 3------------3333----11113333--------------- >SUCCINYLARGININE DIHYDROL; SWP:P76216; PDB:1YNFA; NAWEVNFDGLVGLTHHYAHRFQVSNPRLAAKQGLLKKALADAGFPQAVIPPHERPFIPVL -----------1111----------------------------------------3333- RQLGFSGSDEQVLEKVARQAPHWLSSVSSASPWVANAATIAPSADTLDGKVHLTVANLNN 1111--------------------1111-------------33331111--------333 KFHRSLEAPVTESLLKAIFNDEEKFSVHSALPQVALLGDEGAANHNRLGGHYGEPGQLFV 31111---------------3333---------3333----1111-----3333------ YGREEGNDTRPSRYPARQTREASEAVARLNQVNPQQVIFAQQNPDVIDQGVFHNDVIAVS ---2222-------------------------1111----------1111--1111---- NRQVLFCHQQAFARQSQLLANLRARVNGFAIEVPATQVSVSDTVSTYLFNSQLLSRDDGS !!!!---1111--------------2222----3333---------1111-----1111- LVLPQECREHAGVWGYLNELLAADNPISELKVFDLRESANGGGPACLRLRVVLTEEERRA ---3333---------------------------3333----3333-----------333 VNPAVNDTLFNALNDWVDRYYRDRLTAADLADPQLLREGREALDVLSQLLNLGSVYPFQR 33333--------------------3333------------------1111----3333- >IG GAMMA LIGHT CHAIN; SWP:NA; PDB:1YNLH; RVQLLESGAELMKPGASVQISCKATGYTFSFYWIEWVKERPGHGLEWIGEILPGSGRTNY ------------2222-----------1111--------2222--------2222----- REKFKGKATFTADTSSNTAYMQLSSLTSEDSAVYYCTRGYSSMDYWGQGTSVTVSAAKTT 3333----------------------3333------------------------------ PPSVYPLAPGCGDTTGSSVTLGCLVKGYFPESVTVTWNSGSLSSSVHTFPALLQSGLYTM ---------%%%%----------------------------------------iiii--- SSSVTVPSSTWPSQTVTCSVAHPASSTTVDKKLEPSGPI ------3333-----------3333-------------- >Kappa light chain C_regio; SWP:Q65ZC0; PDB:1YNLL; ELVMTQSPLSLPVSLGDQASISCRPSQSLVHSNGNTYLHWYLQKPGQSPKLLIYRVSNRF -------------2222-------------1111---------2222------------2 SGVPDRFSGSGSGTAFTLKISRVEAEDLGVYFCSQGTHVPYTFGGGTKLELKRADAAPTV 2223333----------------1111--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM ----------------------------------iiii---------------------- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC ------33331111--------1111--------3333- >OXIDOREDUCTASE; SWP:NA; PDB:1YNPA; MKKRQLGTSDLHVSELGFGCMSLGTDETKARRIMDEVLELGINYLDTADLYNQGLNEQFV -----!!!!----------------------------------------%%%%------- GKALKGRRQDIILATKVSKAYIKEAVKDSLRRLQTDYIDLYQLHGGTIDDPIDETIEAFE -3333-3333------------------------------------1111---------- ELKQEGVIRYYGISSIRPNVIKEYLKRSNIVSIMMQYSILDRRPEEWFPLIQEHGVSVVV -------------------------------------11113333--------------- RGPVARGLLSRRPLPEGEGYLNYRYDELKLLRESLPTDRPLHELALQYCLAHDVVATVAA -1111---------2222-!!!!----------------------------3333----- GASSIDQVKANVQAVEATPLTAEERQHIQKLAKAAVYEQHRE -------------1111------------------------- >CYTOCHROME C-552; SWP:P15452; PDB:1YNRA; NEQLAKQKGCMACHDLKAKKVGPAYADVAKKYAGRKDAVDYLAGKIKKGGSGVWGSVPMP --------1111-------------------2222----------------1111----- PQNVTDAEAKQLAQWILSIK ---------------1111- >REPLICATION FACTOR-A PROT; SWP:P22336; PDB:1YNXA; TRPIFAIEQLSPYQNVWTIKARVSYKGEIKTWHNQRGDGKLFNVNFLDTSGEIRATAFND -----3333-3333---------------------------------3333--------- FATKFNEILQEGKVYYVSKAKLQPAKPQFTNLTHPYELNLDRDTVIEECFDESN -----3333----------------3333------------------------- >D-HYDANTOINASE; SWP:Q5DLU2; PDB:1YNYA; KKWIRGGTVVTAADTYQADVLIEGERVVAIGHQGAEEIDATGCYVIPGGIDPHTHLDMPF ----------------------!!!!-------------2222----------------! GGTVTADDFFTGTRAAAFGGTTSIVDFCLTKKGESLKSAIATWHEKARGKAVIDYGFHLM !!!-------------------------------3333--------2222---------- IAEANDQVLEELESVISSEGITSLKVFMAYKNVFQADDETLFKTLVKAKELGALVQVHAE ------------------------------------------------------------ NGDVLDYLTKKALAEGNTDPIYHAYTRPPEAEGEATGRAIALTALAGSQLYVVHVSCASA ------------1111------3333-3333----------------------------- VQRIAEAREKGWNVYGETCPQYLALDVSIMDQPDFEGAKYVWSPPLREKWNQEVLWSALK -------1111-------3333----1111----3333---------3333--------- NGILQTVGSDHCPFNFRGQKELGRGDFTKIPNGGPLIEDRLTILYSEGVRQGRISLNQFV --------------------1111-3333------3333---------1111-------- DISSTKAAKLFGMFPRKGTIAVGSDADIVIFDPHVKRTLSVETHHMNVDYNPFEGMEVYG --------------------2222---------------3333-------1111------ EVVSVLSRGSFVVRDKQFVGQAGSGQYIKRTTFEQ ------iiii---%%%%---2222----------- >DYNEIN LIGHT CHAIN 1; SWP:Q8I5R9; PDB:1YO3A; SVVKNVDMTEEMQIDAIDCANQALQKYNVEKDIAAHIKKEFDRKYDPTWHCVVGRNFGSY ----1111---------------------------------------------------- VTHETKNFIYFYIGQVAILLFKSG ------------!!!!-------- >SAM pointed domain-contai; SWP:O95238; PDB:1YO5C; QPIHLWQFLKELLLKPHSYGRFIRWLNKEKGIFKIEDSAQVARLWGIRKNRPAMNYDKLS --------------33331111-----1111-------------------1111------ RSIRQYYKKGIIRKPDISQRLVYQFVHP ------1111--------2222------ >PUTATIVE CARBONYL REDUCTA; SWP:P90780; PDB:1YO6A; MSPGSVVVTGANRGIGLGLVQQLVKDKNIRHIIATARDVEKATELKSIKDSRVHVLPLTV -------------------------1111--------3333-3333---1111-----11 TCDKSLDTFVSKVGEIVGSDGLSLLINNAGVLLSYGTNTEPNRAVIAEQLDVNTTSVVLL 11---------------3333--------------1111--------------------- TQKLLPLLKNAASKESGDQLSVSRAAVITISSGLGSITDNTSGSAQFPVLAYRMSKAAIN --------------------1111-------11113333--------------------- MFGRTLAVDLKDDNVLVVNFCPGWVEQSTAELISSFNKLDNSHNGRFFMRNLKPYEF ---------1111-----------3333-------11111111-----1111----- >REGULATORY PROTEIN ROP; SWP:P03051; PDB:1YO7A; MTKQEKTALNMARFIRSQTLTLLEKLNELDADEQADICESLHDHADELYRSCLASFKKNG ------------------------3333-------------------------------- QIDEQADICESLHDHADELYRSCLARFGGSKQEKTALNMARFIRSQTLTLLEKLNELAKG -----------------------------------------------------3333--- >THROMBOSPONDIN-2; SWP:P35442; PDB:1YO8A; DGCLSNPCFPGAQCSSFPDGSWSCGFCPVGFLGNGTHCEDLDECALVPDICFSTSKVPRC 3333------------1111-------2222----------------------------- VNTQPGFHCLPCPPRYRGNQPVGVGLEAAKTEKQVCEPENPCKDKTHNCHKHAECIYLGH -------------------------1111----------3333------1111-----11 FSDPYKCECQTGYAGDGLICGEDSDLDGWPNLNLVCATNATYHCIKDNCPHLPNSGQEDF 11-------2222---------1111----------------------1111-1111-11 DKDGIGDACDDDDDNDGVTDEKDNCQLLFNPRQADYDKDEVGDRCDNCPYVHNPAQIDTD 11----1111--------3333-------3333--------1111--1111-1111-111 NNGEGDACSVDIDGDDVFNERDNCPYVYNTDQRDTDGDGVGDHCDNCPLVHNPDQTDVDN 1---3333--1111---2222--1111-3333-1111---3333--1111-1111-1111 DLVGDQCDNNEDIDDDGHQNNQDNCPYISNANQADHDRDGQGDACDPDDDNDGVPDDRDN ----1111---1111---1111--1111-1111-1111---3333--1111---3333-- CRLVFNPDQEDLDGDGRGDICKDDFDNDNIPDIDDVCPENNAISETDFRNFQVPLDPKGT 1111-1111-1111---3333--1111---3333-----1111----------------- TQIDPNWVIRHQGKELVQTANSDPGIAVGFDEFGSVDFSGTFYVNTDRDDDYAGFVFGYQ ---------%%%%----------------------------------------------- SSSRFYVVWKQVTQTYWEDQPTRAYGYSGVSLKVVNSTTGTGEHLRNALWHTGNTPGQVR 1111--------------------------------------3333-3333---2222-- TLWHDPRNIGWKDYTAYRWHLTHRPKTGYIRVLVHEGKQVADSGPIYDQTYAGGRLGLFV ----3333----------------1111-------!!!!--------------------- FSQEVYFSDLKYECRD ---------------- >PUTATIVE FLAVOPROTEIN; SWP:Q5SL73; PDB:1YOAA; MNLEAKKKVLRSFTYGLYVLTAKDGDEVAAGTVNWVTQASFQPPLVAVGLKRDSHLHALV ---------1111----------!!!!--------------------------------- ERTGKLALMTLAHDQKAIAQDFFKPTVREGDRLNGHPFEPSPTFGLPLLTELPYWLEAEV -----------1111----1111-----!!!!iiii------------3333-------- RHLYPGGDHSLVVAEVVEAGVRREEKPLVMWDTGWFYGG ----------------------------3333------- >FLAVODOXIN 2; SWP:P00324; PDB:1YOBA; AKIGLFFGSNTGKTRKVAKSIKKRFDDETMSDALNVNRVSAEDFAQYQFLILGTPTLGEG ---------------------1111---------3333-33333333----------%%% ELPGLSSDAENESWEEFLPKIEGLDFSGKTVALFGLGDQVGYPENYLDALGELYSFFKDR %--3333-----333333331111-2222--------33331111-3333-------111 GAKIVGSWSTDGYEFESSEAVVDGKFVGLALDLDNQSGKTDERVAAWLAQIAPEFGLSL 1--------2222----1111%%%%----------3333-----------3333----- >HYPOTHETICAL PROTEIN PA18; SWP:Q9I2R0; PDB:1YOCA; MSQMMQMYQQVGPAQFSAMIGQFAPYFASIAPQFVELRPGYAEVTFPKRREVLNHIGTVH -----------------------33331111------2222-------3333-1111--- AIALCNAAELAAGTMTDASIPAGHRWIPRGMTVEYLAKATGDVRAVADGSQIDWQATGNL --------------------1111------------------------11111111---- VVPVVAYVDDKPVFRAEITMYVSQA -------iiii-------------- ----------------------------- >HYPOTHETICAL PROTEIN YBEK; SWP:P41409; PDB:1YOEA; GSALPILLDCDPGHDDAIAIVLALASPELDVKAITSSAGNQTPEKTLRNVLRMLTLLNRT -------------------------3333-----------------------------11 DIPVAGGAVKPLMRELIIAESGLDGPALPEPTFAPQNCTAVELMAKTLRESAEPVTIVST 11---------------------------------------------------------- GPQTNVALLLNSHPELHSKIARIVIMGGAMGLGNWTPAAEFNIYVDPEAAEIVFQSGIPV ------------33331111---------------1111-------------1111---- VMAGLDVTHKAQIHVEDTERFRAIGNPVSTIVAELLDFFLEKWGFVGAPLHDPCTIAWLL ---33331111--3333----3333-------------------------3333------ KPELFTSVERWVGVETQGKYTQGMTVVDYYYLTGNKPNATVMVDVDRQGFVDLLADRLKF 3333----------------2222---1111--------------------------333 YA 3- >PROTO-ONCOGENE TYROSINE-P; SWP:P12931; PDB:1YOJA; EIPRESLRLEVKLGQGGEVWMGTWNGTTRVAIKTLMKKLRHEKLVQLYAVVSEEPIYIVT --3333----------------------------------1111---------------- EYMNKGSLLDFLKGETGKYLRLPQLVDMSAQIASGMAYVERMNYVHRDLRAANILVGENL --1111-------3333----------------------1111------3333----%%% VCKVAPIKWTAPEAALYGRFTIKSDVWSFGILLTELTTKGRVPYPGMVNREVLDQVERGY %----3333-3333------3333-----------1111----2222--------1111- RMPCPPECPESLHDLMCQCWRKEPEERPTFEYLQAFLEDYFTSTEPQYQPGENL ----22223333----------3333--3333------1111--1111------ >ASPARTATE AMINOTRANSFERAS; SWP:P00509; PDB:1YOO; MFENITTAPADPILGLADLLRADERPGKIDLGMGVYNDETGKTPVLTSVKKAEQYLLENE -1111--------------1111--------------1111------------------- TTKNYLGIDGIPEFGRCTQELLFGKGSALINDKRARTAQTPGGTGALRVAADFLAKNTSV ------1111-------------------1111--------------------------- RRVWVSNPGWPTHKSVFNSAGLEVREYAYYDAENHTLDFDALINSLNEAQAGDVVLFHGC ---------3333----1111------------------------33332222------- CHNPTGIDPTLEQWQTLAQLSVEKGWLPLFDFAYQGFARGLEEDAEGLRAFAAMHKELIV ---------------------------------2222---3333---------------- ASSYSKNFGLYNERVGTCTLVAADSETVDRAFSQMKAAIRVNYSSPPAHGASVVATILGN ----------1111------------------------1111--------------1111 DALRAIWEQELTDMRQRIQRMRQLFVNTLQEKGANRDFSFTIKQNGMFFFGGLTKEQVLR -----------------------------1111-----3333------------------ LREEFGVYAVASGRLNVAGMTPDNLAPLCEAIVAVL ---------1111--3333-3333------------ >KTI11P; SWP:NA; PDB:1YOPA; MVSTYDEIEIEDMTFEPENQMFTYPCPCGDRFQIYLDDMFEGEKVAVCPSCSLMIDVVFD --------3333-------------3333-----3333---------------------- KEDLAEYYEEAGIHPPEPIAAAA -33333333-------------- >AMYLOID PROTEIN-BINDING P; SWP:Q13564; PDB:1YOVA; KLLKEQKYDRQLRLWGDHGQEALESAHVCLINATATGTEILKNLVLPGIGSFTIIDGNQV ----------1111------------------------------3333------------ SGEDAGNNFFLQRSSIGKNRAEAAMEFLQELNSDVSGSFVEESPENLLDNDPSFFCRFTV 3333-------3333----------------1111---------------33331111-- VVATQLPESTSLRLADVLWNSQIPLLICRTYGLVGYMRIIIKEHPVIESHPDNALEDLRL ------------------------------!!!!------------------------11 DKPFPELREHFQSYDLDHMEKKDHSHTPWIVIIAKYLAQWYSETNGRIPKTYKEKEDFRD 11------------3333--3333------------------------------------ LIRQGILKNENGAPEDEENFEEAIKNVNTALNTTQIPSSIEDIFNDDRCINITKQTPSFW -1111---1111----3333------1111---------------3333----------- ILARALKEFVAKEGQGNLPVRGTIPDMIADSGKYIKLQNVYREKAKKDAAAVGNHVAKLL -----------3333--------------------------------------------- QSIGQAPESISEKELKLLCSNSAFLRVVRCRSLAEEYGLDTINKDEIISSMDNPDNEIVL --------------------1111----------------------------1111---- YLMLRAVDRFHKQQGRYPGVSNYQVEEDIGKLKSCLTGFLQEYGLSVMVKDDYVHEFCRY --------------------11113333------------1111-----3333------% GAAEPHTIAAFLGGAAAQEVIKIITKQFVIFNNTYIYSGMSQTSATFQL %%%-------------------1111----------------------- >HYPOTHETICAL PROTEIN PA36; SWP:NA; PDB:1YOXA; ELDYRILGESQTVEIELDPGETVIAEAGANYTGDIRFTARTHFTNEGQGKQHVAFAAPYP -------------------------2222------------------------------- GSVVAVDLDDVGGRLFCQKDSFLCAAYGTRVGIAEGFILQKLEGDGLVFVHAGGTLIRRQ ------3333%%%%---3333--------------------------------------- LNGETLRVDTGCLVAFTDGIDYDVQLAGLLLTTLKGSGTVWLQSLPFSRLAGRIYDATFR --------3333---------------------------------3333----------- AREEVR ------ >HYPOTHETICAL PROTEIN AF09; SWP:O29321; PDB:1YOZA; GHMLYINSFLDRMGEIIRGEKSVEEADKLLDQKNIFEMFRSDCEEILNLYKSGKAEKEEV ---------------------3333----------------------------------- QRNFYLLKTYVVSQLSIHFERLKEFAESKGEKKLDPEVINEIALYIDRVEKEV ----------------------------------3333----------3333- >FII; SWP:NA; PDB:1YP1A; ASPQVSVTLQLVVDSSMFAKYNGDAKKIVTVLDTRVNIMKSIFKPLLLLITLSGIEMWTS ------------------1111--------------------3333-------------- KDLITVKPAGDLTLSLFADWRQTLLLSRILNDNAQLQTAVDFRGAVVGLAFVGTMCNAKY -----------------------3333--------------------------2222--- SAGIIQDFSAIPLLMAVVMAHELGHNLGMLHDDGYSCDCDVCIMAPSLSSDPTKVFSNCS --------------------------------3333-----1111--------------- LILYEDFLSNEEPDCIDNA ------------3333--- >Glucose-1-phosphate adeny; SWP:P23509; PDB:1YP2A; QTCLDPDASRSVLGIILRLYPLTKKRAKPAVPLGANYRLIDIPVSNCLNSNISKIYVLTQ ------3333----------------3333-------3333------1111--------- FNSASLNRHLSRAYAEGFVEVLAAQQSPENPDWFQGTADAVRQYLWLFEEHTVLEYLILA --------------------------3333-------------3333------------- GDHLYRMDYEKFIQAHRETDADITVAALPMDEKRATAFGLMKIDEEGRIIEFAEKPQGEQ ------------------------------33331111-----1111------------- LQAMKVDTTILGLDDKRAKEMPFIASMGIYVISKDVMLNLLRDKFPGANDFGSEVIPGAT -1111-3333-----3333-------------------------1111--1111-----1 SLGMRVQAYLYDGYWEDIGTIEAFYNANLGITKKPVPDFSFYDRSAPIYTQPRYLPPSKM 111--------------------------1111---------1111-------------- LDADVTDSVIGEGCVIKNCKIHHSVVGLRSCISEGAIIEDSLLMGADYYETDADRKLLAA --------------------------------2222----------------------11 KGSVPIGIGKNCHIKRAIIDKNARIGDNVKIINKDNVQEAARETDGYFIKSGIVTVIKDA 11------2222-------2222--2222---3333-----3333----iiii---2222 LIPSGIII --2222-- >TRICYCLON A; SWP:NA; PDB:1YP8A; CGESCFLGTCYTKGCSCGEWKLCYGTNGGTIFD ----3333---2222--1111-----iiii--- >Subtilisin-chymotrypsin i; SWP:P01053; PDB:1YPCI; MKTEWPELVGKSVAAAKKVILQDKPEAQIIVLPVGTIVTMEYRIDRVRLFVDKLDNIAQV ----3333----------3333-1111-----2222------1111-----1111----- PRVG ---- >GMP REDUCTASE; SWP:Q81JJ9; PDB:1YPFA; NVFDYEDIQLIPAKCIVNSRSECDTTVTLGKHKFKLPVVPANMQTIIDERIATYLAENNY ---3333-----------3333------!!!!----------1111-------------- FYIMHRFQPEKRISFIRDMQSRGLIASISVGVKEDEYEFVQQLAAEHLTPEYITIDIAHG -------3333--------1111---------3333------------------------ HSNAVINMIQHIKKHLPESFVIAGNVGTPEAVRELENAGADATKVGIGPGKVCITKIKTG ---------------1111----------------------------------------- FGTGGWQLAALRWCAKAASKPIIADGGIRTNGDVAKSIRFGATMVMIGSLFAGHEESPGE --2222--------1111-----------3333--------------3333--3333--- TINVEGKKMFVEHKGSLEDTLIEMEQDLQSSISYAGGTKLDSIRTVDYVVVKNSI --------------------------------1111--33331111--------- >Chymotrypsinogen A; SWP:P00766; PDB:1YPHC; IVNGEEAVPGSWPWQVSLQDKTGFHFCGGSLINENWVVTAAHCGVTTSDVVVAGEFDQGS -------22221111----1111---------1111---3333--1111-------1111 SSEKIQKLKIAKVFKNSKYNSLTINNDITLLKLSTAASFSQTVSAVCLPSASDDFAAGTT ---------------1111--------------------1111------1111--2222- CVTTGWGLTRY ----------- >Chymotrypsinogen A; SWP:P00766; PDB:1YPHE; ANTPDRLQQASLPLLSNTNCKKYWGTKIKDAMICAGASGVSSCMGDSGGPLVCKKNGAWT ---------------33333333!!!!-1111----------2222--------iiii-- LVGIVSWGSSTCSTSTPGVYARVTALVNWVQQTLAAN --------11111111-----3333------------ >oxidised low density lipo; SWP:P78380; PDB:1YPQA; CSAPCPQDWIWHGENCYLFSSGSFNWEKSQEKCLSLDAKLLKINSTADLDFIQQAISYSS -----2222--!!!!------------------1111------------------1111- FPFWMGLSRRNPSYPWLWEDGSPLMPHLFRVRGAVSQTYPSGTCAYIQRGAVYAENCILA ----------1111---1111-----------------1111-----iiii----1111- AFSICQKKANL ----------- >PROFILIN; SWP:P07274; PDB:1YPRA; SWQAYTDNLIGTGKVDKAVIYSRAGDAVWATSGGLSLQPNEIGEIVQGFDNPAGLQSNGL 3333-----3333--------1111----------------------------------- HIQGQKFMLLRADDRSIYGRHDAEGVVCVRTKQTVIIAHYPPTVQAGEATKIVEQLADYL -iiii---------------!!!!----------------11113333------------ IGVQY 1111- >THYMIDYLATE SYNTHASE; SWP:P04818; PDB:1YPVA; VPPHGELQYLGQIQHILRGVRKDDRTGTGTLSVFGMQARYSLRDEFPLLTTKRVFWKGVL ---3333-----------------------------------------------3333-- EELLWFIKGSTNAKELSSKGVKDLGPVYGFQWRHFGAEYRDMESDYSGQGVDQLQRVIDT -----------33333333--------------2222---1111-2222----------- IKTNPDDRRIIMAWNPRDLPLMALPPHALQFYVVNSELSCQLYQRSGDMGLGVPFNIASY ---1111-------3333---------------%%%%----------------------- ALLTYMIAHITGLKPGDFIHTLGDAHIYLNHIEPLKIQLQREPRPFPKLRILRKVEKIDD --------1111---------------1111------1111---------------1111 FKAEDFQIEGYNPHPTIKME -3333--------------- >putative vitamin-B12 inde; SWP:Q8Y8Q1; PDB:1YPXA; NQVAPFYADHVGSILRTKGIKDAREKFQSGEITALELRKIENTEIKYIVEKQKEVGLKSI --------------------------------1111------------------------ TDGEFRRWHFDFLENLDGVEGYSVKITGPIDFTTHPFIEDFIFLKEAVGDNHVAKQTIPS -%%%%------33332222---------------3333---------------------- PALHYRGDIEYQPYLDDAEKFANDLATAYQKAIQAFYDAGCRYLQLDDTSWSYLCSDEGF -----------3333---------------------------------3333-------- DPETLQETYKNLINEAIKHKPADVITHICRGGYGPVAETLFGKLNIDGFFLEYDNERFAP ---3333-------1111------------------------------------------ LKYVTRPDLKIVLGLITSKTGEEDEAAIKARIEEASEIVPLSQLRLSPQCGFATEEEQWD -----1111------------------------------3333----------------- KLRYVVRLANDIWGE --------------- >VIRION MEMBRANE PROTEIN; SWP:P07612; PDB:1YPYA; AASIQTTVNTLSERISSKLEQEANASAQTKCDIEIGNFYIRQNHGCNLTVKNMCSADADA -------------------------1111------------------------------- QLDAVLSAATETYSGLTPEQKAYVPAMFTAALNIQTSVNTVVRDFENYVKQTCNSSAVVD ------------11113333----------------1111--------------1111-- NKLKIQNVIIDECYGAPGSPTNLEFINTGSSKGNCAIKALMQLTTKATTQIAPKQVAGTG ---------------1111----------3333----------------------iiii- VQ -- >H2-T22 PROTEIN; SWP:NA; PDB:1YPZE; GDQVEQSPSALSLHEGTDSALRCNFTTTMRSVQWFRQNSRGSLISLFYLASGTKENGRLK ------------------------------------------------------------ SAFDSKERRYSTLHIRDAQLEDSGTYFCAADTWHISEGYELGTDKLVFGQGTQVTVEPKS ------------------1111-------------------------------------- QPPAKPSVFIMKNGTNVACLVKDFYPKEVTISLRSSKKIVEFDPAIVISPSGKYSAVKLG ------------------------------------------------------------ QYGDSNSVTCSVQHNSETVHSTDFEAA ----3333-----iiii--3333---- >H2-T22 PROTEIN; SWP:NA; PDB:1YPZF; HGKLEQPEISISRPRDETAQISCKVFIESFRSVTIHWYRQKPNQGLEFLLYVLATPTHIF --------------------------3333------------------------------ LDKEYKKMEASKNPSASTSILTIYSLEEEDEAIYYCSYGEGSSGFHKVFAEGTKLIVIPS ------------------------------------------------------------ DKRLDADISPKPTIFLPSVAETNLHKTGTYLCLLEAFFPDVIRVYWKEKDGNTILDSQEG -----------------------------------------------3333--------- DTLKTNDTYMKFSWLTVPERAMGKEHRCIVKHENNKGGADQAIFFPSIKK -------------------------------------------------- >GLUTATHIONE S-TRANSFERASE; SWP:Q93698; PDB:1YQ1A; PSYKLTYFFFRGLGEPIRLLFHLAGVQFEEVRNPDQTWLDIKDSTPKQLPVLNIDGFELP ----------!!!!-------3333-------1111--3333------------------ QSGAILRYLARKFGFAGKTPEEEAWVDAVHDLFKDFLAEFKKFAAERRSGEVEKFRSEFF ----------------------------------------------------3333---3 LPARNTYFNILNGLLEKSNSGFLIGSDITFADLVVVDNLLTLKNYGLFDESEFTKLAALR 333---------------------------------------------3333-------- EKVNSYPGIKEYIAKRPV --1111-----3333--- >BETA-GALACTOSIDASE; SWP:Q8KRF6; PDB:1YQ2A; ADVSYLTDQGPGSGRRVPARSWLHSDAPALSLNGDWRFRLLPAAPGTAGAGSVLPSGETV ---3333------------------------------------2222-----------11 EGVAAESYDDAAWDTLPVPSHWVMGQDGKYGRPIYTNVQYPFPIDPPHVPDANPTGDFRR 11--1111-1111-------1111-iiii------------------------------- RFDVPAQWFESTTAALTLRFDGVESRYKVWVNGQEIGVGSGSRLAQEFDVSDALRAGSNL ----3333-1111-----------------iiii------1111-----1111------- LVVRVHQWSAASYLEDQDQWWLPGIFRDVTLQARPAGGITDAWLRTGWSARSGAGTGTID --------3333----------------------2222---------------------- PEITADATAFPVTLSVPELGVNVTWKSAEEVAPLALENVEPWSAEVPRLYEASVSSAAES -----1111------3333-------3333------------3333---------1111- ISVRLGFRTVRIVGDQFLVNGRRVVFHGVNRHETHPDRGRVFDEAGAREDLALMKRFNVN ------------!!!!--iiii---------------!!!!-------------1111-- AIRTSHYPPHPRLLDLADEMGFWVILECDLETHGFEAGGWVENPSDVPAWRDALVDRMER ---------3333------------------333311112222---3333---------- TVERDKNHPSIVMWSLGNESGTGSNLAAMAAWAHARDSSRPVHYEGDYTGAYTDVYSRMY ----1111-----------------------------------1111------------- SSIPETDSIGRNDSHALLLGCDSAESARQRTKPFILCEYVHAMGNGPGAMDQYEALVDKY -----------------2222-------1111---------------------------3 PRLHGGFVWEWRDHGIRTRTAEGMEFFAYGGDFGEVVHDSNFVMDGMVLSDSTPTPGLYE 333----------------1111-----2222------!!!!------1111-------- FKQIVSPIRLGLSLPAGGKPTLAVANLRHTADASDVVLRWRVEHDGAVAASGEVAAEGSD --------------2222--------------1111-------iiii------------- GPLRAGESATIALPAMPAAPLGETWLTVEAVLRDATGWAPAGHPLGAVQLDLSAPAVPTR ---2222----------------------------11112222----------------- SPRPATPLDGALPVSLGPATFDAGTLVSLAGQPVSGPRLELWRAPTDNDRGAGFGAYGPG ---------------!!!!--iiii---iiii-------------3333-------1111 DPWLNSGRGVPAPSSEAVWKQAGLDRLTRRVEDVAALPDGIRVRTRYAAADSTHSVAVEE 1111%%%%--------------1111----------------------2222-------- NWQLDGGELCLRIDITPSAGWNLVWPRIGVRWDLPTDVDGAAWFGAGPRESYPDSMHATM ----iiii--------------------------1111-------------11111111- VARHAASLEELNVPYARPQETGHRSDVRWLELDRAGAPWLRIDAEPDAAGRRPGFSLARH ------3333-----------------------iiii---------1111---------- TAQEIAAAGHPHELPTPSHSYLYVDAAQHGLGSRACGPDVWPDFALRPEARTLKLRISPA ----3333-3333-------------------3333----3333---------------- >Succinate dehydrogenase I; SWP:Q9YHT2; PDB:1YQ3B; AATSRIKKFSIYRWDPDKPGDKPRMQTYEVDLNKCGPMVLDALIKIKNELDSTLTFRRSC --------------1111------------1111-------------------------- REGICGSCAMNIAGGNTLACTKKIDPDLSKTTKIYPLPHMYVVKDLVPDLSNFYAQYKSI -----1111--iiii--3333-----1111-------------!!!!----------111 EPYLKKKDESKQGKEQYLQSIEDRQKLDGLYECILCACCSTSCPSYWWNGDKYLGPAVLM 1---------2222-----3333-3333-1111----3333--3333------------- QAYRWMIDSRDDYTEERLAQLQDPFSLYRCHTIMNCTRTCPKGLNPGKAIAEIKKMMATY ----1111----------1111----1111-----33331111----------------- KE -- >Putative uncharacterized ; SWP:Q5ZIS0; PDB:1YQ3C; ATTAKEEMARFWEKNTKSSRPLSPHISIYKWSLPMAMSITHRGTGVALSLGVSLFSLAAL -------------3333-------1111-------------------------------- LLPEQFPHYVAVVKSLSLSPALIYSAKFALVFPLSYHTWNGIRHLVWDMGKGFKLSQVEQ ----3333---------------------------------------------------- SGVVVLILTLLSSAGIAAIS ---------------1111- >MINOR CAPSID PROTEIN; SWP:P22536; PDB:1YQ5A; GGVTDALSLYSTSTGGPASIAANALTDFDLSGALTVNSVGTGLTKSAAGIQLAAGKSGLY ----------1111---------------1111------------1111---2222---- QITTVKNNTVTTGNYLLRVKYGSSDFVVACPASSLTAGGTISLLIYCNVLGVVSLDVLKF ------1111----------!!!!--------3333---------------3333----- SLCNDGAALSNYIINITAAKIN ---------------------- >PERIPLASMIC [NIFE] HYDROG; SWP:P12943; PDB:1YQ9A; KKRPSVVYLHNAECTGCSESVLRTVDPYVDELILDVISMDYHETLMAGAGHAVEEALHEA --------------------1111-----------------3333---!!!!-------- IKGDFVCVIEGGIPMGDGGYWGKVGGRNMYDICAEVAPKAKAVIAIGTCATYGGVQAAKP ---------------iiii----iiii3333----3333--------------1111--- NPTGTVGVNEALGKLGVKAINIAGCPPNPMNFVGTVVHLLTKGMPELDKQGRPVMFFGET ------3333-3333------------3333----------------1111-3333---3 VHDNCPRLKHFEAGEFATSFGSPEAKKGYCLYELGCKGPDTYNNCPKQLFNQVNWPVQAG 333-1111----------1111--1111--1111--3333---3333--%%%%-3333-- HPCIACSEPNFWDLYSPFYSA ----1111-3333---1111- >Periplasmic [NiFe] hydrog; SWP:P12944; PDB:1YQ9H; NKIVVDPITRIEGHLRIEVEVEGGKIKNAWSMSTLFRGLEMILKGRDPRDAQHFTQRACG ---------------------%%%%-----------------22223333---------- VCTYVHALASVRAVDNCVGVKIPENATLMRNLTMGAQYMHDHLVHFYHLHALDWVNVANA -------------------------------------------------1111--33331 LNADPAKAARLANDLSPRKTTTESLKAVQAKVKALVESGQLGIFTNAYFLGGHPAYVLPA 111-------------------------------------!!!!--1111--1111---- EVDLIATAHYLEALRVQVKAARAMAIFGAKNPHTQFTVVGGCTNYDSLRPERIAEFRKLY -------------------------------------2222--3333------------- KEVREFIEQVYITDLLAVAGFYKNWAGIGKTSNFLTCGEFPTDEYDLNSRYTPQGVIWGN -----------------3333--1111---------------11111111-------%%% DLSKVDDFNPDLIEEHVKYSWYEGADAHHPYKGVTKPKWTEFHGEDRYSWMKAPRYKGEA %-------3333----1111--------3333--------2222-----------iiii- FEVGPLASVLVAYAKKHEPTVKAVDLVLKTLGVGPEALFSTLGRTAARGIQCLTAAQEVE ------------1111-------------------3333--------------------- VWLDKLEANVKAGKDDLYTDWQYPTESQGVGFVNAPRGMLSHWIVQRGGKIENFQLVVPS ----------------------------------1111--------%%%%-------333 TWNLGPRCAEGKLSAVEQALIGTPIADPKRPVEILRTVHSYDPCIACGVH 3------1111------3333-----3333--------1111-------- >HISTONE H1; SWP:P53551; PDB:1YQAA; KASSPSSLTYKEMILKSMPQLNDGKGSSRIVLKKYVKDTYPIVGSASNFDYLFNSAIKKC -----------------3333%%%%---------------3333-3333----------- VENGELVQPKGPSGIIKLNKKKVKLST -------3333---------------- >UBIQUILIN 3; SWP:Q9H347; PDB:1YQBA; SGLVPRGSPHLIKVTVKTPKDKEDFSVTDTCTIQQLKEEISQRFKAHPDQLVLIFAGKIL !!!!---1111----------------1111---------------3333----iiii-- KDPDSLAQCGVRDGLTVHLVIKRQHRAM 33333333---2222------------- >SINAPYL ALCOHOL DEHYDROGE; SWP:Q94G59; PDB:1YQDA; SPEEEHPVKAFGWAARDQSGHLSPFNFSRRATGEEDVRFKVLYCGVCHSDLHSIKNDWGF 3333----------------------------1111----------3333-----1111- SMYPLVPGHEIVGEVTEVGSKVKKVNVGDKVGVGCLVGACHSCESCANDLENYCPKMILT ------------------1111---2222----------------111133331111--- YASIYHDGTITYGGYSNHMVANERYIIRFPDNMPLDGGAPLLCAGITVYSPLKYFGLDEP ----1111-------------3333----------11113333---------1111---- GKHIGIVGLGGLGHVAVKFAKAFGSKVTVISTSPSKKEEALKNFGADSFLVSRDQEQMQA --------------------------------3333--------------3333---333 AAGTLDGIIDTVSAVHPLLPLFGLLKSHGKLILVGAPEKPLELPAFSLIAGRKIVAGSGI 3--------------------33332222--------------3333------------- GGMKETQEMIDFAAKHNITADIEVISTDYLNTAMERLAKNDVRYRFVIDVGNTLAATKP -------------------------1111-------1111-------------3333-- >HYPOTHETICAL UPF0204 PROT; SWP:O29630; PDB:1YQEA; HMKLVVCSESDTAGQNIKDNLLTFADFEEKDVGEFKLYLSDEFYIAETKERLIYADHIDE -------1111---------1111-------!!!!---------------33332222-- KLAKYIDFEEILFASRHSSKDGRKIFTVHVSGNVGTADFGGKPYSLAKPSPQTMKNYVLA -3333-------------1111-------------------------------------- LRERLDRKPEFEFTMEVTHHGPSEISKPSAFYEIGSTEEEWKDREAAEVVAEAMLDAIRA ---11111111-------------------------3333-------------------- EKMDWNVAVGVGGTHYAPRQTEIMLTTTFTFGHNFAKYTFEHLTAEFLVKAVKLSEAEYI -------------1111------------------11111111----------------- IIDEKSVNSAVKKIVNEAAEVAGVEVLKSKKVKKDFRLV --1111-3333---------------------------- >HYPOTHETICAL PROTEIN LMAJ; SWP:Q4Q7K6; PDB:1YQFA; ASDAALADATRRELEEEMGRSDKPEQPTPPAGWQVVRKPGTCTFDLTKSFEGEDLVVRYS 1111------------1111-----------------2222--------iiii------- TNQDSDKANSHNIFVYITQKNGQTMQADLSIEEGELVLNNIRFYDEAALAKDTGAEAEAK ---3333-----------1111---------%%%%----------3333----------- RNELYTGPLVHELDYDLLNCVMTYLEKRGVDEKLGEFVVLYSFWAEQQDYEAWLTTMNKF 1111----3333-------------1111-3333------------------------11 AS 11 >PYRROLINE-5-CARBOXYLATE R; SWP:Q9K1N1; PDB:1YQGA; NVYFLGGGNAAAVAGGLVKQGGYRIYIANRGAEKRERLEKELGVETSATLPELHSDDVLI -----------------------------------------------------1111--- LAVKPQDEAACKNIRTNGALVLSVAAGLSVGTLSRYLGGTRRIVRVPNTPGKIGLGVSGY ---33333333----iiii---------------1111---------------------- AEAEVSETDRRIADRIKSVGLTVWLDDEEKHGITGISGSGPAYVFYLLDALQNAAIRQGF -3333-----------1111------3333-------------------------1111- DAEARALSLATFKGAVALAEQTGEDFEKLQKNVTSKGGTTHEAVEAFRRHRVAEAISEGV ------------------------------111122223333------------------ CACVRRSQEERQYQ ---------3333- >IG HYPOTHETICAL 16092; SWP:Q81IG4_BACCR; PDB:1YQHA; AQQVTSFSVVPQAKTKDVYSVVDKAIEVVQQSGVRYEVGAETTLEGELDVLLDVVKRAQQ ----------------3333--------1111---------------------------- ACVDAGAEEVITSIKIHYRPSTGVTIDEKVWKYRDEYA --1111------------3333--3333-33333333- >XANTHOSINE PHOSPHORYLASE; SWP:P45563; PDB:1YQQA; QFSHNPLFCIDIIKTYKPDFTPRVAFILGSGLGALADQIENAVAISYEKLPGFPVSTVHG -----------3333----------------33331111------33332222----222 HAGELVLGHLQGVPVVCMKGRGHFYEGRGMTIMTDAIRTFKLLGCELLFCTNAAGSLRPE 2--------%%%%---------3333--1111-------------------------333 VGAGSLVALKDHINTMPGTPMVGLNDDRFGERFFSLANAYDAEYRALLQKVAKEEGFPLT 32222-------------1111---3333------------------------------- EGVFVSYPGPNFETAAEIRMMQIIGGDVVGMSVVPEVISARHCDLKVVAVSAITNMAEGL ----------------------------------------1111------------2222 SDVKLSHAQTLAAAELSKQNFINLICGFLRKIA --------------1111-----------1111 >D-ALANYL-D-ALANINE CARBOX; SWP:P15555; PDB:1YQSA; LPAPDDTGLQAVLHTALSQGAPGAMVRVDDNGTIHQLSEGVADRATGRAITTTDRFRVGS ----------------1111---------iiii---------3333----1111------ VTKSFSAVVLLQLVDEGKLDLDASVNTYLPGLLPDDRITVRQVMSHRSGLYDYTNDMFAQ -------------1111--11113333-2222--1111---------------3333--- TVPGFESVRNKVFSYQDLITLSLKHGVTNAPGAAYSYSNTNFVVAGMLIEKLTGHSVATE -------1111------------------2222----3333------------------- YQNRIFTPLNLTDTFYVHPDTVIPGTHANGYLTPDEAGGALVDSTEQTVSWAQSAGAVIS ------11111111---------------------2222---------33331111---- STQDLDTFFSALMSGQLMSAAQLAQMQQWTTVNSTQGYGLGLRRRDLSCGISVYGHTGTV ------------------------1111------------------1111---------2 QGYYTYAFASKDGKRSVTALANTSNNVNVLNTMARTLESAFCGKP 222------1111-------------------------------- >RNASE L INHIBITOR; SWP:Q8U306; PDB:1YQTA; EEDCVHRYGVNAFVLYRLPVVKEGVVGIVGPNGTGKSTAVKILAGQLIPNLCGDNDSWDG --------2222-------------------------------------%%%%---3333 VIRAFRGNELQNYFEKLKNGEIRPVVKPQYVDLIPKAVKGKVIELLKKADETGKLEEVVK --1111-----------------------3333-------3333---------3333--- ALELENVLEREIQHLSGGELQRVAIAAALLRNATFYFFDEPSSYLDIRQRLNAARAIRRL ---1111---3333-------------1111--------3333----------------- SEEGKSVLVVEHDLAVLDYLSDIIHVVYGEPGVYGIFSQPKGTRNGINEFLRGYLKDENV 1111-------------------------2222----------------------1111- RFRPYEIKFTKTGERVEIERETLVTYPRLVKDYGSFRLEVEPGEIKKGEVIGIVGPNGIG ------------1111----------------!!!!------------------------ KTTFVKLAGVEEPTEGKIEWDLTVAYKPQYIKADYEGTVYELLSKIDASKLNSNFYKTEL ------------------------------------------11113333---------- LKPLGIIDLYDREVNELSGGELQRVAIAATLLRDADIYLLDEPSAYLDVEQRLAVSRAIR -11113333---3333-------------1111--------1111--------------- HLEKNEKTALVVEHDVLIDYVSDRLVFEGEPGKYGRALPPGREGNRFLASIGITFRRDPD --1111-----------------------2222--------------------------- TGRPRANKEGSVKDREQKEKGEYYYIA -------2222---------------- >Lysozyme C [Precursor]; SWP:P00698; PDB:1YQVH; EVQLQQSGAELMKPGASVKISCKASGYTFSDYWIEWVKQRPGHGLEWIGEILPGSGSTNY ------------2222-----------1111----------------------------- HERFKGKATFTADTSSSTAYMQLNSL 3333---------------------- >PERIPLASMIC [NIFE] HYDROG; SWP:P18187; PDB:1YQWA; AKHRPSVVWLHNAECTGCTEAAIRTIKPYIDALILDTISLDYQETIMAAAGEAAEAALHQ ------------------------------------------3333-------------- ALEGKDGYYLVVEGGLPTIDGGQWGMVAGHPMIETTKKAAAKAKGIICIGTCSAYGGVQK ---1111-----------%%%%----iiii---------1111-------------3333 AKPNPSQAKGVSEALGVKTINIPGCPPNPINFVGAVVHVLTKGIPDLDENGRPKLFYGEL -----------------------------------------------1111-3333---3 VHDNCPRLPHFEASEFAPSFDSEEAKKGFCLYELGCKGPVTYNNCPKVLFNQVNWPVQAG 333-1111--1111----1111--1111--3333--3333---3333--%%%%-3333-- HPCLGCSEPDFWDTMTPFYEQG ----1111-3333---1111-- >Periplasmic [NiFe] hydrog; SWP:P18188; PDB:1YQWQ; PTPQSTFTGPIVVDPITRIEGHLRIMVEVENGKVKDAWSSSQLFRGLEIILKGRDPRDAQ -----------------------------iiii------------3333-22223333-- HFTQRACGVCTYVHALASSRCVDDAVKVSIPANARMMRNLVMASQYLHDHLVHFYHLHAL ---------------------------------------------------------333 DWVDVTAALKADPNKAAKLAASIAPARPGNSAKALKAVQDKLKAFVESGQLGIFTNAYFL 3--33331111--------1111---3333--------------------!!!!--1111 GGHKAYYLPPEVNLIATAHYLEALHMQVKAASAMAILGGKNPHTQFTVVGGCSNYQGLTK --1111-----------------------------------------2222--3333--- DPLANYLALSKEVCQFVNECYIPDLLAVAGFYKDWGGIGGTSNYLAFGEFATDDSSPSKH ---------------------------------1111----------------------- LATSQFPSGVITGRDLGKVDNVDLGAIYEDVKYSWYAPGGDGKHPYDGVTDPKYTKLDDK -----------%%%%-------1111----1111---------3333--------2222- DHYSWMKAPRYKGKAMEVGPLARTFIAYAKGQPDFKKVVDMVLGKLSVPATALHSTLGRT ----------iiii-------------1111-----------------3333-------- AARGIETAIVCANMEKWIKEMADSGAKDNTLCAKWEMPEESKGVGLADAPRGALSHWIRI ------------------------------------------------1111-------- KGKKIDNFQLVVPATWNLGPRGAQGDKSPVEEALIGTPIADPKRPVEILRTVHAFDPCIA %%%%-------3333------1111------3333-----3333-------3333--333 CGVH 3--- >LETHAL FACTOR; SWP:P15917; PDB:1YQYA; LSRYEKWEKIKQHYQHWSDSLSEEGRGLLKKLQIPIEPKKDDIIHSLSQEEKELLKRIQI -----------------11113333------------------1111-----------11 DSSDFLSTEEKEFLKKLQIDIRDSLSEEEKELLNRIQVDSSNPLSEKEKEFLKKLKLDIQ 11------------------------------------------3333------3333-- PYDINQRLQDTGGLIDSPSINLDVRKQYKRDIQNIDALLHQSIGSTLYNKIYLYENMNIN -------------1111-----------------------------1111-------333 NLTATLGADLVDSTDNTKINRGIFNEFKKNFKYSISSNYMIVDINERPALDNERLKWRIQ 3---3333------1111--------3333------------------------------ LSPDTRAGYLENGKLILQRNIGLEIKDVQIIKQSEKEYIRIDAKVVPKSKIDTKIQEAQL -1111---------------------------iiii----------3333---------- NINQEWNKALGLPKYTKLITFNVHNRYASNIVESAYLILNEWKNNIQSDLIKKVTNYLVD -------1111-1111--------1111------------------3333--------11 GNGRFVFTDITLPNIAEQYTHQDEIYEQVHSKGLYVPESRSILLHGPSKGVELRNDSEGF 11--------333333331111-3333--------3333---------------3333-- IHEFGHAVDDYAGYLLDKNQSDLVTNSKKFIDIFKEEGSNLTSYGRTNEAEFFAEAFRLM ----------------1111--1111----------1111-3333--------------- HSTDHAERLKVQKNAPKTFQFINDQIKFIINSLV ---3333--------------------------- >COENZYME A DISULFIDE REDU; SWP:O52582; PDB:1YQZA; PKIVVVGAVAGGATCASQIRRLDKESDIIIFEKDRDMSFANCALPYVIGEVVEDRRYALA --------3333-------1111----------------3333------------1111- YTPEKFYDRKQITVKTYHEVIAINDERQTVSVLNRKTNEQFEESYDKLILSPGASANSLG ------------------------1111-------------------------------- FESDITFTLRNLEDTDAIDQFIKANQVDKVLVVGAGYVSLEVLENLYERGLHPTLIHRSD --1111------------------------------------------------------ KINKLMDADMNQPILDELDKREIPYRLNEEINAINGNEITFKSGKVEHYDMIIEGVGTHP --11113333--------1111------------!!!!--3333---------------- NSKFIESSNIKLDRKGFIPVNDKFETNVPNIYAIGDIATSHYRHVDLPASVPLAWGAHRA -1111-------1111----1111---2222---3333---------------------- ASIVAEQIAGNDTIEFKGFLGNNIVKFFDYTFASVGVKPNELKQFDYKMVEVTQGAHANY --------------------------!!!!-------11111111------------111 YPGNSPLHLRVYYDTSNRQILRAAAVGKEGADKRIDVLSMAMMNQLTVDELTEFEVAYAP 1----------------------------------------1111-33331111----11 PYSHPKDLINMIGYKAK 11----------1111- >PHOSPHINOTHRICIN ACETYLTR; SWP:Q8UGX8; PDB:1YR0A; SVELRDATVDDLSGIEIYNDAVVNTTAIWNEVVVDLENRKDWFAARTSRGFPVIVAILDG -------33333333------------------------------------------iii KVAGYASYGDWRAFDGYRHTREHSVYVHKDARGHGIGKRLQALIDHAGGNDVHVLIAAIE i------------3333----------1111---3333--------1111---------1 AENTASIRLHESLGFRVVGRFSEVGTKFGRWLDLTCELKL 111-------1111------------iiii---------- >CELL-DIVISION INITIATION ; SWP:Q5L0X5; PDB:1YR1A; GSEWRRIAYVYDRQTFFPLLENGRLLKQEGTKTAPSDAPVLVGWKDGDAIAEMTGQLAEL -----------%%%%----1111--1111-----3333---------------------- PAAVLGAMSEIHYKPTREYEDRVIVYMNDGYEVSATIRQFADKLSHYPAIAAALDRNVK 3333-----------1111-------1111-----3333---------3333------- >PROLYL OLIGOPEPTIDASE; SWP:Q9ZNM8; PDB:1YR2A; PPYPASPQVPLVEDHFGEKVSDPWRWLEADVRTDAKVAAWVQAQSAYTAAYLKQLPERAA --------------iiii---111111113333-----------------33331111-- LEKRMKALIDYERFGLPQRRGASVFYSWNSGLMNQSQLLVRPADAPVGTKGRVLLDPNTW -----1111----------!!!!------------------11112222------3333- ATALDAWAASDDGRLLAYSVQDGGSDWRTVKFVGVADGKPLADELKWVKFSGLAWLGNDA ---------1111--------!!!!---------1111---------------------- LLYSRFAEPLNYNQTVWLHRLGTPQSADQPVFATPELPKRGHGASVSSDGRWVVITSSEG -------------------22223333------1111---------1111---------- TDPVNTVHVARVTNGKIGPVTALIPDLKAQWDFVDGVGDQLWFVSGDGAPLKKIVRVDLS ------------iiii--------------------!!!!-----2222----------- GSTPRFDTVVPESKDNLESVGIAGNRLFASYIHDAKSQVLAFDLDGKPAGAVSLPGIGSA --------------------------------%%%%------1111-------------- SGLSGRPGDRHAYLSFSSFTQPATVLALDPATAKTTPWEPVHLTFDPADFRVEQVFYPSK -----2222--------------------1111------------3333---------11 DGTKVPMFIVRRKDAKGPLPTLLYGYGGFNVALTPWFSAGFMTWIDSGGAFALANLRGGG 11---------1111------------%%%%------3333------------------- EYGDAWHDAGRRDKKQNVFDDFIAAGEWLIANGVTPRHGLAIEGGSNGGLLIGAVTNQRP ------1111!!!!---------------1111--1111-------------------33 DLFAAASPAVGVMDMLRFDQFTAGRYWVDDYGYPEKEADWRVLRRYSPYHNVRSGVDYPA 33-----------1111---------------1111----------3333---------- ILVTTADTDDRVVPGHSFKYTAALQTAAIGPKPHLIRIEPIDKQIEETADVQAFLAHFTG ------------------------------------------------------------ LTPRPWSSVDKLAAALEHHH --------------3333-- >CYTOCHROME P450-CAM; SWP:P00183; PDB:1YRCA; NLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQ -----11113333----11111111------------1111-------iiii-------- LIREAYEDYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLEN ------------------------------1111--3333-----3333-33333333-- RIQELACSLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDG ------------3333---3333-----------------3333---------------- SMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGG --------------------------------------iiii------------------ LDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHG --3333---------------------3333--------------------------iii VQLKKGDQILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIV i--2222----33331111-----1111-1111-----1111-1111------------- TLKEWLTRIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPATTKAV --------------2222-------------------3333---- >HYPOTHETICAL PROTEIN PA32; SWP:Q9HYX1; PDB:1YREA; LPITLQRGALRLEPLVEADIPELVSLAEANREALQYMDGPTRPDWYRQSLAEQREGRALP ------!!!!-----3333-----------3333----1111------------------ LAVRLGVQLVGTTRFAEFLPALPACEIGWTWLDQAQHGSGLNRMIKYLMLKHAFDNLRMV ----!!!!----------3333----------3333------------------------ RVQLSTAASNLRAQGAIDKLGAQREGVLRNHRRLAGGRLDDTFVYSITDHEWPQVKAALE ------1111-------3333------------1111----------3333--------- ASF --- >PROTEIN KINASE C, DELTA T; SWP:Q05655; PDB:1YRKA; HAPFLRIAFNSYELGSLQAEDEANQPFCAVKMKEALSTERGKTLVQKKPTMYPEWKSTFD -----------------------------------------------------2222--- AHIYEGRVIQIVLMRAAEEPVSEVTVGVSVLAERCKKNNGKAEFWLDLQPQAKVLMSVQY ---2222--------2222-------3333-------iiii------------------- FL -- >KETOL-ACID REDUCTOISOMERA; SWP:P05793; PDB:1YRLA; NYFNTLNLRQQLAQLGKCRFMGRDEFADGASYLQGKKVVIVGCGAQGLNQGLNMRDSGLD -------------1111----333311113333---------------------1111-- ISYALRKEAIAEKRASWRKATENGFKVGTYEELIPQADLVINLTPDKQHSDVVRTVQPLM -----33331111-------1111----3333------------3333--------1111 KDGAALGYSHGFNIVEVGEQIRKDITVVMVAPKCPGTEVREEYKRGFGVPTLIAVHPEND ---------------------1111-----------------1111---------33331 PKGEGMAIAKAWAAATGGHRAGVLESSFVAEVKSDLMGEQTILCGMLQAGSLLCFDKLVE 111-------------3333---------------------------------------- EGTDPAYAEKLIQFGWETITEALKQGGITLMMDRLSNPAKLRAYALSEQLKEIMAPLFQK -------------------------------3333-3333-------------------- HMDDIISGEFSSGMMADWANDDKKLLTWREETGKTAFETAPQYEGKIGEQEYFDKGVLMI -------------------%%%%-----------3333---------3333--------- AMVKAGVELAFETMVDSGIIEESAYYESLHELPLIANTIARKRLYEMNVVISDTAEYGNY --------------1111----------1111---------------------------- LFSYACVPLLKPFMAELQPGDLGKAIPEGAVDNGQLRDVNEAIRSHAIEQVGKKLRGYMT -1111------3333--2222---------------------11113333---------- DMKRIAV ------- >ALPHA-LACTALBUMIN; SWP:P29752; PDB:1YROA; TELTKCKVSHAIKDIDGYQGISLLEWACVLFHTSGYDTQAVVNDNGSTEYGLFQISDRFW -----------3333-2222----------------1111----------1111------ CKSSEFPESENICGISCDKLLDDELDDDIACAKKILAIKGIDYWKAYKPMCSEKLEQWRC --3333----1111---1111------------------33333333-------3333-- EKP --- >Beta-1,4-galactosyltransf; SWP:P08037; PDB:1YROB; LTACPEESPLLVGPMLIEFNIPVDLKLVEQQNPKVKLGGRYTPMDCISPHKVAIIIPFRN -------1111--------------------1111------------------------- RQEHLKYWLYYLHPILQRQQLDYGIYVINQAGESMFNKAKLLNVGFKEALKDYDYNCFVF ------------------------------------------------3333-------- SDVDLIPMNDHNTYRCFSQPRHISVAMDKFGFSLPYVQYFGGVSALSKQQFLSINGFPNN -1111---------------------3333-----1111--------------------- YWGWGGEDDDIYNRLAFRGMSVSRPNAVIGKTRMIRHSRDKKNEPNPQRFDRIAHTKETM ---------------1111------3333----------2222--1111-----333333 LSDGLNSLTYMVLEVQRYPLYTKITVDIGTPS 33-3333----------1111----------- >DEATH-ASSOCIATED PROTEIN ; SWP:O43293; PDB:1YRPA; STFRQEDVEDHYEMGEELGSGQFAIVRKCRQKGTGKEYAAKFIKKRRLSSSRRGVSREEI ------3333-------------------------------------------------- EREVNILREIRHPNIITLHDIFENKTDVVLILELVSGGELFDFLAEKESLTEDEATQFLK -------------------------------------------1111------------- QILDGVHYLHSKRIAHFDLKPENIMLLDKNVPNPRIKLIDFGIAHKIEAGNEFKNIFGTP -------------------3333-----------------------------------33 EFVAPEIVNYEPLGLEADMWSIGVITYILLSGASPFLGETKQETLTNISAVNYDFDEEYF 33-3333------3333----------------1111------------------33331 SNTSELAKDFIRRLLVKDPKRRMIAQSLEHSWIKA 111-------1111---3333--3333--3333-- >N-ACETYLGLUCOSAMINE-6-PHO; SWP:P0AF18; PDB:1YRRA; MYALTQGRIFTGHEFLDDHAVVIADGLIKSVCPVAELPPEIEQRSLNGAILSPGFIDVQL -----------------------iiii-----3333-2222----iiii----------- NGCGGVQFNDTAEAVSVETLEIMQKANEKSGCTNYLPTLITTSDELMKQGVRVMREYLAK --iiii----3333---------------------------------------------- HPNQALGLHLEGPWLNLVKKTHNPNFVRKPDAALVDFLCENADVITKVTLAPEMVPAEVI ----------------------------------------3333------1111-3333- SKLANAGIVVSAGHSNATLKEAKAGFRAGITFATHLYNAMPYITGREPGLAGAILDEADI ---1111------------------1111-----2222-----1111---------1111 YCGIIADGLHVDYANIRNAKRLKGDKLCLVTDATAPAGANIEQFIFAGKTIYYRNGLCVD ----------------------!!!!-------3333--------iiii----------1 ENGTLSGSSLTMIEGVRNLVEHCGIALDEVLRMATLYPARAIGVEKRLGTLAAGKVANLT 111------------------------------------------------2222----- AFTPDFKITKTIVNGNEVVTQ --1111------iiii----- >BIFUNCTIONAL HEMOLYSIN-AD; SWP:P15318; PDB:1YRTA; AGYANAADRESGIPAAVLDGIKAVAKEKNATLMFRLVNPHSTSLIAEGVATKGLGVHAKS -------------------------1111--------1111-----------3333---- SDWGLQAGYIPVNPNLSKLFGRAPEVIARADNDVNSSLAHGHTAVDLTLSKERLDYLRQA ---1111-----1111------3333---------------------------------- GLVTGMADGVVASNHAGYEQFEFRVKETSDGRYAVQYRRKGGDDFEAVKVIGNAAGIPLT ---------------------------1111-------2222----------1111---- ADIDMFAIMPHLSNFRDSARSSVTSGDSVTDYLARTRRALDRERIDLLWKIARAGARSAV ------------------------------------------------------------ GTEARRQFRYDGDMNIGVITDFELEVRNALNRRAHAVGAQDVVQHGTEQNNPFPEADEKI -3333-----!!!!--------------------------------1111---------- FVVSATGESQMLTRGQLKEYIGQQRGEGYVFYENRAYGVAGKSLFDDGLGA ---1111----------------3333------11111111---------- >UBIQUITIN-CONJUGATING LIG; SWP:NA; PDB:1YRVA; SMGRAYLLLHRDFCDLKENNYKGITAKPVSEDMMEWEVEIEGLQNSVWQGLVFQLTIHFT 2-------------------2222-----3333------------1111----------1 SEYNYAPPVVKFITIPFHPNVDPHTGQPCIDFLDNPEKWNTNYTLSSILLALQVMLSNPV 111--------------1111--------1111-3333-1111-----------1111-- LENPVNLEAARILVKDESLYRTILRLFN ------------------------1111 >HYPOTHETICAL PROTEIN RSPH; SWP:Q53119; PDB:1YRXA; AGHMVSCCYRSLAAPDLTLRDLLDIVETSQAHNARAQLTGALFYSQGVFFQWLEGRPAAV -------------1111---------------------------iiii------------ AEVMTHIQRDRRHSNVEILAEEPIAKRRFAGWHMQLSCSEADMRSLGLAESRQIVTVG ---------1111-------------------------3333-1111----------- >XYLAN BETA-1,4-XYLOSIDASE; SWP:Q9K6P5; PDB:1YRZA; RIQNPILPGFHPDPSIVRVGDDYYIATSTFEWFPGVRIHHSRDLKHWRFVSSPLTRTSQL ------------------!!!!------!!!!-----------------------3333- DMKGNMNSGGIWAPCLSYHDGTFYLIYTDVKQWHGAFKDAHNYLVTAQNIEGPWSDPIYL -22222222---------iiii--------------------------1111-------- NSSGFDPSLFHDDDGRKWLVNMIWDYRKGNHPFAGIILQEYSEAEQKLVGPVKNIYKGTD ------------------------------------------1111------------33 IQLTEGPHLYKKDGYYYLLVAEGGTEYEHAATLARSQSIDGPYETDPSYPLVTSTGQPEL 33---------iiii----------1111--------1111----1111-------3333 ALQKAGHGSLVETQNGEWYLAHLCGRPLKGKYCTLGRETAIQKVNWTEDGWLRIEDGGNH ------------1111----------------1111----------1111---1111--- PLREVTAPDLPEHPFEKEPELDDFDAPQLHHQWNTLRIPADPSWCSLEERPGHLRLRGME -----------------------------3333-------3333-----2222------- SLTSVHSQSLVARRQQSFHCEVETKLEYQPESFQHMAGLVIYYDTEDHVYLHVTWHEEKG 1111---------------------------1111--------1111------------- KCLQIIQTKGGNYDELLASPIPLAEEKAVYLKGRIHRETMHLYFKQEGEAEWQPVGPTID --------iiii-----------1111--------!!!!--------------------- VTHMSDDSAKQVRFTGTFVGMATQDLSGTKKPADFDYFRYKE 3333-------------------------------------- >Lipase [Precursor]; SWP:P22088; PDB:1YS1X; ADNYAATRYPIILVHGLTGTDKYAGVLEYWYGIQEDLQQRGATVYVANLSGFQSDDGPNG --1111---------2222--2222----2222----1111---------------1111 RGEQLLAYVKTVLAATGATKVNLVGHSQGGLTSRYVAAVAPDLVASVTTIGTPHRGSEFA ---------------------------------------3333---------1111---- DFVQGVLAYDPTGLSSTVIAAFVNVFGILTSSSNNTNQDALAALKTLTTAQAATYNQNYP ---------1111-----------------1111-------------------------- SAGLGAPGSCQTGAPTETVGGNTHLLYSWAGTAIQPTISVGGVTGATDTSTIPLVDPANA 1111-2222---------iiii-----------------iiii----3333----3333- LDPSTLALFGTGTVMVNRGSGQNDGVVSKCSALYGQVLSTSYKWNHLDEINQLLGVRGAN -3333---------1111---------3333-------------3333----iiii-111 AEDPVAVIRTHANRLKLAGV 1--------------1111- >ASPARTATE-SEMIALDEHYDE DE; SWP:Q57658; PDB:1YS4A; KIKVGVLGATGSVGQRFVQLLADHPFELTALAASERSAGKKYKDACYWFQDRDIPENIKD -------1111----------------------3333---3333----------3333-- VVIPTDPKHEEFEDVDIVFSALPSDLAKKFEPEFAKEGKLIFSNASAYREEDVPLVIPEV -----11111111---------------------1111-------11111111---3333 NADHLELIEIQREKRGWDGAIITNPNCSTICAVITLKPIDKFGLEAVFIATQAVSGAGYN -----------------------------------3333--------------3333--- GVPSAILDNLIPFIKNEEEKQTESLKLLGTLKDGKVELANFKISASCNRVAVIDGHTESI -------------2222--------------%%%%------------------------- FVKTKEGAEPEEIKEVDKFDPLKDLNLPTYAKPIVIREEIDRPQPRLDRNEGNGSIVVGR -------------------1111---1111-------------33331111--------- IRKDPIFDVKYTALEHNTIRGAAGASVLNAEYFVKKYI ---------------1111------------------- >LIPOPROTEIN; SWP:NA; PDB:1YS5A; MQSHSALTAFQTEQIQDSEHSGKMVAKRQFRIGDIAGEHTSFDKLPEGGRATYRGTAFGS ----------------3333------------------3333----------------11 DDAGGKLTYTIDFAAKQGNGKIEHLKSPELNVDLAAADIKPDGKRHAVISGSVLYNQAEK 11------------------------3333------------!!!!-------------- GSYSLGIFGGKAQEVAGSAEVKTVNGIRHIGLAAKQ ------------------------------------ >TRANSCRIPTIONAL REGULATOR; SWP:Q10531; PDB:1YS7A; SPRVLVVDDDSDVLASLERGLRLSGFEVATAVDGAEALRSATENRPDAIVLDINMPVLDG ---------------------1111----------------------------------- VSVVTALRAMDNDVPVCVLSARSSVDDRVAGLEAGADDYLVKPFVLAELVARVKALLRRR -------1111--------------------1111---------3333------------ GSTATSSSETITVGPLEVDIPGRRARVNGVDVDLTKREFDLLAVLAEHKTAVLSRAQLLE ------------!!!!---1111---iiii-----------------2222--------- LVWGYDFADTNVVDVFIGYLRRKLEAGPRLLHTVRGVGFVLRMQ ---------------------------------2222------- >PROTEIN SPY1043; SWP:Q99ZW4; PDB:1YS9A; PYKGYLIDLDGTIYQGKNRIPAGERFIKRLQERGIPYLLVTNNTTRTPEMVQSMLANQFH --------2222--!!!!-3333------------------------------------- VETSIETIYTATMATVDYMNDMNRGKTAYVIGETGLKSAIAAAGYVEELENPAYVVVGLD ---3333---------------------------------1111---------------- SQVTYEMLAIATLAIQKGALFIGTNPDLNIPTERGLMPGAGALNALLEAATRVKPVFIGK ---3333----------------------------------------------------- PNAIIMNKSLEVLGIQRSEAVMVGDNYLTDIMAGIQNDIATILVTTGFTRPEEVPTLPIQ ---------------3333------3333-----1111-----------33331111--- PDHVLSSLDEWRL ------3333--- >DNA-BINDING PROTEIN SATB1; SWP:Q01826; PDB:1YSEA; NTEVSSEIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLR ---------------------3333----------3333-------3333---------- AMQNFLQLPEAERDRIYQDERERSLNAA ---------------------------- >APOPTOSIS REGULATOR BCL-X; SWP:Q07817; PDB:1YSGA; MSMAMSQSNRELVVDFLSYKLSQKGYSWSQFSDVEENRTEAPEGTESEAVKQALREAGDE --------1111--------3333---1111----------------------------- FELRYRRAFSDLTSQLHITPGTAYQSFEQVVNELFRDGVNWGRIVAFFSFGGALCVESVD -----3333---1111---------------3333---------------------3333 KEMQVLVSRIAAWMATYLNDHLEPWIQENGGWDTFVELYGNNAAAESRKGQERLEHHHHH ---------------------3333-33333333---------%%%%------------- H - >PROTEIN YXEP; SWP:P54955; PDB:1YSJA; ADKAFHTRLINRRDLHEHPELSFQEVETTKKIRRWLEEEQIEILDVPQLKTGVIAEIKGR -3333---------------2222------------1111-----3333----------- EDGPVIAIRADIDALPIQEQTNLPFASKVDGTHACGHDFHTASIIGTALLNQRRAELKGT ----------------------1111--2222----------------33333333---- VRFIFQPAEEIAAGARKVLEAGVLNGVSAIFGHNKPDLPVGTIGVKEGPLASVDRFEIVI -------3333----------1111---------11112222------------------ KGKNSIDPIAAAGQIISGLQNAVVSITRVQAGTSWNVIPDQAEEGTVRTFQKEARQAVPE ---------------------------------------------------------333 HRRVAEGIAAGYGAQAEFKWFPYLPSVQNDGTFLNAASEAAARLGYQTVHAEQSPGGEDF 3-------3333-----------------3333--------1111------------333 ALYQEKIPGFFVWGTNGTEEWHHPAFTLDEEALTVASQYFAELAVIVLETI 33333--------------2222---------------------------- >HMG-COA SYNTHASE; SWP:Q9FD71; PDB:1YSLA; MTIGIDKISFFVPPYYIDMTALAEARNVDPGKFHIGIGQDQMAVNPISQDIVTFAANAAE -----------------------1111------------------1111----------- AILTKEDKEAIDMVIVGTESSIDESKAAAVVLHRLMGIQPFARSFEIKEAYGATAGLQLA ------------------------------------------------------------ KNHVALHPDKKVLVVAADIAKYGLNSGGEPTQGAGAVAMLVASEPRILALKEDNVMLTQD ------1111------------2222-3333----------------------------- IYDFWRPTGHPYPMVDGPLSNETYIQSFAQVWDEHKKRTGLDFADYDALAFHIPYTKMGK ------2222-------------------------------3333---------1111-- KALLAKISDQTEAEQERILARYEESIIYSRRVGNLYTGSLYLGLISLLENATTLTAGNQI ------1111----------------3333-----1111-----------11112222-- GLFSYGSGAVAEFFTGELVAGYQNHLQKETHLALLDNRTELSIAEYEAMFAETLDTDIDQ ------------------2222------------1111----------------1111-- TLEDELKYSISAINNTVRSYRN -----2222----iiii----- >CALCYCLIN-BINDING PROTEIN; SWP:Q9CXW3; PDB:1YSMA; MASVLEELQKDLEEVKVLLEKSTRKRLRDTLTSEKSKIETELKNKMQQKSQKKPE ------------------------3333--------------------%%%%--- >TRANSCRIPTIONAL REGULATOR; SWP:P76268; PDB:1YSPA; MDLIRSADIQMRELSRLTKETIHLGALDEDSIVYIHKIDSMIGRRNPLYSTAIGKVLLAW ---------------------------------------------------------111 RDRDEVKQILEGVEYKRSTERTITSTEALLPVLDQVREQGYGEDNEEQEEGLRCIAVPVF 13333----1111-----1111--3333--------------------2222-------- DRFGVVIAGLSISFPTLRFSEERLQEYVAMLHTAARKISAQMGY 1111-----------------------------------1111- >HTH-TYPE TRANSCRIPTIONAL ; SWP:P37671; PDB:1YSQA; SLNIIHIAAPHLEALNIATGETINFSSREDDHAILIYKLEPTTGMLRTRAYIGQHMPLYC ----------------------------!!!!------------------2222--3333 SAMGKIYMAFGHPDYVKSYWESHQHEIQPLTRNTITELPAMFDELAHIRESGAAMDREEN -----------3333--------3333---1111-------------------------- ELGVSCIAVPVFDIHGRVPYAVSISLSTSRLKQVGEKNLLKPLRETAQAISNELGFTAIT 2222--------1111-------------------3333------------------111 G 1 >SENSOR-TYPE HISTIDINE KIN; SWP:Q10560; PDB:1YSRA; DDHVPVDITDLLDRAAHDAARIYPDLDVSLVPSPTCIIVGLPAGLRLAVDNAIANAVKHG ----------------------2222---------------------------------- GATLVQLSAVSSRAGVEIAIDDNGSGVPEGERQVVFERFSLGLALVAQQAQLHGGTASLE -----------3333------------1111-------------------1111------ NSPLGGARLVLRLPGPS -1111------------ >REPLICASE POLYPROTEIN 1AB; SWP:P59641; PDB:1YSYA; GHSKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSV -----------------------------3333------3333----------------- LLSMQGAVDINRLCEEMLDNRATLQ -------------33331111---- >RIBONUCLEASE D; SWP:P09155; PDB:1YT3A; MNYQMITTDDALASLCEAVRAFPAIALDTEFVRTRTYYPQLGLIQLFDGEHLALIDPLGI ------------------1111---------------------------------3333- TDWSPLKAILRDPSITKFLHAGSEDLEVFLNVFGELPQPLIDTQILAAFCGRPMSWGFAS -----------1111--------------------------------1111-1111---- MVEEYSGVTLDKSESRTDWLARPLTERQCEYAAADVWYLLPITAKLMVETEASGWLPAAL -----------1111--1111-----------------------------1111------ DECRLMQMRRQEVVAPEDAWRDITNAWQLRTRQLACLQLLADWRLRKARERDLAVNFVVR --------3333--333333331111---------------------------3333--- EEHLWSVARYMPGSLGELDSLGLSGSEIRFHGKTLLALVEKAQTLPEDALPQPMLNLMDM -----------------------------------------11111111------33332 PGYRKAFKAIKSLITDVSETHKISAELLASRRQINQLLNWHWKLKPQNNLPELISGWRGE 222--------------------3333-----------------------3333------ LMAEALHNLLQEYPQ ---------1111-- >INORGANIC POLYPHOSPHATE/A; SWP:Q9X255; PDB:1YT5A; MKIAILYREEREKEGEFLKEKISKEHEVIEFGEANAPGRVTADLIVVVGGDGTVLKAAKK --------------------------------3333------------------------ AADGTPMVGFKAGRLGFLTSYTLDEIDRFLEDLRNWNFREETRWFIQIESELGNHLALND ---------------------1111-------1111-------------1111------- VTLERDLSGKMVEIEVEVEHHSSMWFFADGVVISTPTGSTAYSLSIGGPIIFPECEVLEI -----1111--------!!!!--------------1111--3333------1111----- SPIAPQFFLTRSVVIPSNFKVVVESQRDINMLVDGVLTGKTKRIEVKKSRRYVRILRPPE -----%%%%------1111-------------iiii---------------------333 YDYVTVIRDKLGYGRR 33333----------- >THIOSULFATE SULFURTRANSFE; SWP:Q9I0N4; PDB:1YT8A; IAVRTFHDIRAALLARRELALLDVREEDPFAQAHPLFAANLPLSRLELEIHARVPRRDTP -----------------------------3333-1111---3333---3333---1111- ITVYDDGEGLAPVAAQRLHDLGYSDVALLDGGLSGWRNAGGELFRDVNVPSKAFGELVEA ------------------1111------2222----1111-------------------- ERHTPSLAAEEVQALLDARAEAVILDARRFDEYQTSIPGGISVPGAELVLRVAELAPDPR ---------------1111-----------------2222----1111--3333---333 TRVIVNCAGRTRSIIGTQSLLNAGIPNPVAALRNGTIGWTLAGQQLEHGQTRRFGAISQD 3------------------------------2222----1111----------------- TRKAAAQRARAVADRAGVERLDLAGLAQWQDEHDRTTYLLDVRTPEEYEAGHLPGSRSTP -------------1111--------------1111-----------------2222---- GGQLVQETDHVASVRGARLVLVDDDGVRANSASWLAQGWQVAVLDGLSEADFSERGAWSA ------3333---2222---------3333-----------------3333--------- PLPRQPRADTIDPTTLADWLGEPGTRVLDFTASANYAKRHIPGAAWVLRSQLKQALERLG ----------------------------------------2222---1111--------- TAERYVLTCGSSLLARFAVAEVQALSGKPVFLLDGGTSAWVAAGLPTEDGESLLASPRID ------------3333----------------2222----1111---------------- RYRRPYEGTDNPREAQGYLDWEFGLVEQLGRDGTHGFFVIE ---1111----3333-------------------------- >OLIGORIBONUCLEASE; SWP:P39287; PDB:1YTAA; SANENNLIWIDLETGLDPERDRIIEIATLVTDANLNILAEGPTIAVHQSDEQLALDDWNV --1111----------3333-----------1111-------------3333-------- RTHTASGLVERVKASTGDREAELATLEFLKQWVPAGKSPICGNSIGQDRRFLFKYPELEA ---1111--------------------3333--2222-------3333--------3333 YFHYRYLDVSTLKELARRWKPEILDGFTKQGTHQADDIRESVAELAYYREHFIKL -------3333--------33331111---------------------------- >PROTEIN (TATA BINDING PRO; SWP:P13393; PDB:1YTBA; SGIVPTLQNIVATVTLGCRLDLKTVALHARNAEYNPKRFAAVIMRIREPKTTALIFASGK ----------------------------------3333-----------------3333- MVVTGAKSEDDSKLASRKYARIIQKIGFAAKFTDFKIQNIVGSCDVKFPIRLEGLAFSHG -----------------------3333--------------------------------1 TFSSYEPELFPGLIYRMVKPKIVLLIFVSGKIVLTGAKQREEIYQAFEAIYPVLSEFRKM 111--3333-----------------3333--------3333------------1111-- >YEAST ISO-2 CYTOCHROME C; SWP:P00045; PDB:1YTC; GSAKKGATLFKTRCQQCHTIEEGGPNKVGPNLHGIFGRHSGQVKGYSYTDAIINKNVKWD -------------3333---2222-------2222-------2222-------------- EDSMSEYLTNPKKYIPGTKMAFAGLKKEKDRNDLITYMTKAAK ---------3333-2222------------------------- >Acetyl-CoA decarbonylase/; SWP:O30273; PDB:1YTLA; KMATLLEKGKPVANMIKKAKRPLLIVGPDMTDEMFERVKKFVEKDITVVATGSAITRFID ------------------------------3333------1111------!!!!----11 AGLGEKVNYAVLHELTQFLLDPDWKGFDGQGNYDLVLMLGSIYYHGSQMLAAIKNFAPHI 111111--------------1111-1111---------------------------3333 RALAIDRYYHPNADMSFGNLWKKEEDYLKLLDEILAEL ---------1111------3333-----------1111 >PHOSPHOENOLPYRUVATE CARBO; SWP:O09460; PDB:1YTMA; SLSESLAKYGITGATNIVHNPSHEELFAAETQASLEGFEKGTVTEMGAVNVMTGVYTGRS ------1111---------------------1111!!!!----3333-----!!!!---3 PKDKFIVKNEASKEIWWTSDEFKNDNKPVTEEAWAQLKALAGKELSNKPLYVVDLFCGAN 333-----3333------3333-------3333--------------------------3 ENTRLKIRFVMEVAWQAHFVTNMFIRPTEEELKGFEPDFVVLNASKAKVENFKELGLNSE 333------------------------33332222--------3333---3333------ TAVVFNLAEKMQIILNTWYGGEMKKGMFSMMNFYLPLQGIAAMHCSANTDLEGKNTAIFF ------------------3333-----------3333------------1111------- GLSGTGKTTLSTDPKRLLIGDDEHGWDDDGVFNFEGGCYAKVINLSKENEPDIWGAIKRN -22223333---1111----------3333-----------22223333----3333222 ALLENVTVDANGKVDFADKSVTENTRVSYPIFHIKNIVKPVSKAPAAKRVIFLSADAFGV 2-------1111--11113333-------1111----------------------1111- LPPVSILSKEQTKYYFLSGFTAKLAGTERGITEPTPTFSSCFGAAFLTLPPTKYAEVLVK -----------------------%%%%---------------3333---3333------- RMEASGAKAYLVNTGWNGTGKRISIKDTRGIIDAILDGSIDTANTATIPYFNFTVPTELK ----------------1111--------------11113333------------------ GVDTKILDPRNTYADASEWEVKAKDLAERFQKNFKKF --3333-3333---------------------3333- >BETA CRYSTALLIN B2; SWP:P43320; PDB:1YTQA; LNPKIIIFEQENFQGHSHELNGPCPNLKETGVEKAGSVLVQAGPWVGYEQANCKGEQFVF ----------%%%%-----------3333---------------------%%%%------ EKGEYPRWDSWTSSRRTDSLSSLRPIKVDSQEHKIILYENPNFTGKKMEIIDDDVPSFHA ------1111------------------------------%%%%------------3333 HGYQEKVSSVRVQSGTWVGYQYPGYRGLQYLLEKGDYKDSSDFGAPHPQVQSVRRIRDMQ --------------------------------------3333------------------ W - >Troponin T, fast skeletal; SWP:P12620; PDB:1YTZT; SYSSYLAKADQKRGKKQTARETKKKVLAERRKPLNIDHLNEDKLRDKAKELWDWLYQLQT -1111---------------------3333----------3333---------------- EKYDFAEQIKRKKYEIVTLRNRIDQAQKHS ------------------------------ >MAJOR TROPISM DETERMINANT; SWP:Q775D6; PDB:1YU0A; VQFRGGTTAQHATFTGAAREITVDTDKNTVVVHDGATAGGFPLARHDLVKTAFIKADKSA ------33331111--2222----------------2222----33332222----1111 VAFTRTGNATASIKAGTIVEVNGKLVQFTADTAITMPALTAGTDYAIYVCDDGTVRADSN -------------2222---iiii---------------2222------1111------- FSAPTGYTSTTARKVGGFHYAPGSNAAAQAGGNTTAQINEYSLWDIKFRPAALDPRGMTL ---22223333---------------------------1111--1111------2222-- VAGAFWADIYLLGVNHLTDGTSKYNVTIADGSASPKKSTKFGGDGSAAYSDGAWYNFAEV iiii----------3333-----------1111----1111-----------3333---- MTHHGKRLPNYNEFQALAFGTTEATSSGGTDVPTTGVNGTGATSAWNIFTSKWGVVQASG -1111----3333--------------------2222-2222--3333------------ CLWTWGNEFGGVNGASEYTANTGGRGSVYAQPAAALFGGAWNGTSLSGSRAALWYSGPSF ---------------------------------------11113333---------1111 SFAFFGARGVCDHLIL ---------------- >Villin-1; SWP:P02640; PDB:1YU5X; PTKLETFPLDVLVNTAAEDLPRGVDPSRKENHLSDEDFKAVFGMTRSAFANLPLWKQQNL ---------------3333-11111111-1111---------------1111-------- KKEKGLF -1111-- >RRNA METHYLTRANSFERASE; SWP:P21236; PDB:1YUB; MNKNIKYSQNFLTSEKVLNQIIKQLNLKETDTVYEIGTGKGHLTTKLAKISKQVTSIELD --------------------1111-------------------3333------------- SHLFNLSSEKLKLNTRVTLIHQDILQFQFPNKQRYKIVGNIPYHLSTQIIKKVVFESRAS ------------------------------------------------------------ DIYLIVEEGFYKRTLDIHRTLGLLLHTQVSIQQLLKLPAECFHPKPKVNSVLIKLTRHTT --------3333---------33331111---------1111------------------ DVPDKYWKLYTYFVSKWVNREYRQLFTKNQFHQAMKHAKVNNLSTITYEQVLSIFNSYLL --33333333--------------------------------1111-------------- FNGRK ----- >ORPHAN NUCLEAR RECEPTOR N; SWP:O00482; PDB:1YUCA; ASIPHLILELLKCEPDEPQVQAKIMAYLQQEQANRSKHEKLSTFGLMCKMADQTLFSIVE ---3333---1111-----------------33331111--------------------- WARSSIFFRELKVDDQMKLLQNCWSELLILDHIYRQVVHGKEGSIFLVTGQQVDYSIIAS ------3333-3333-------------------------2222--1111---3333--- QAGATLNNLMSHAQELVAKLRSLQFDQREFVCLKFLVLFSLDVKNLENFQLVEGVQEQVN ---------------------------------------1111----3333--------- AALLDYTMCNYPQQTEKFGQLLLRLPEIRAISMQAEEYLYYKHLNGDVPYNNLLIEMLHA ----------3333---------------------------------------------- >HYPOTHETICAL PROTEIN SO07; SWP:Q8EIN8; PDB:1YUDA; QNADDFIKFLELEQHVEGGFYRSSYRSETAFDPSRQLWSSIYFLLRTGEVSHFHRLTADE -------1111---1111---------------------------1111----------- WYFHAGQSLTIYISPEGELTTAQLGLDLAAGERPQFLVPKGCIFGSANQDGFSLVGCVSP -------------1111------------------------------------------- GFTFDDFELFSQEALLAYPQHKAVVQKLSRPE ---1111-----3333--1111-1111----- >HEAD VERTEX PROTEIN GP24; SWP:P19896; PDB:1YUEA; AKINELLRESTTTNSNSIGRPNLVALTRATTKLIYSDIVATQRTNQPVAAFYGIKYLNPD --------3333-----------------3333-3333---------------------- NEQITELTEESKLTLNKGDLFKYNNIVYKVLEDTPFATIEESDLELALQIAIVLLKVRLF -------111111112222---iiii---------------------------------- SDEIADARFQINKWQTAVKSRKLKTGITVELAQDLEANGFDAPNFLEDLLATEADEINKD --------------------------------------3333------------------ ILQSLITVSKRYKVTGITDSGFIDLSYASAPEAGRSLYRVCEVSHIQKESTYTATFCVAS ----------------------------------3333--------1111---------- ARAAAILAASGWLKHKPEDDKYLSQNAYGFLANGLPLYCDTNSPLDYVIVGVVENIGEKE ------------------------------3333---------------------!!!!- IVGSIFYAPYTEGLDLDDPEHVGAFKVVVDPESLQPSIGLLVRYALSANPYTVAKDEKEA ------------------------------------------------1111---1111- RIIDGGDDKAGRSDLSVLLGVKLPK ---1111------------------ >FAB FRAGMENT; SWP:NA; PDB:1YUHH; QVQFQQSGAELVKPGASVKLSCKASGYTFTSYLMHWIKQRPGRGLEWIGRIDPNNVVTKF ------------2222-------------------------------------------- NEKFKSKATLTVDKPSSTAYMELSSLTSEDSAVYYCARYAYCRPMDYWGQGTTVTVSSAA 3333---------1111------------------------------------------- TTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGALSSGVHTFPAVLQSDLY ------------------------------------------------------------ TLSSSVTVPASTWPSGTVTCNVAHPASSTAVDKKIVPR -------------------------------------- >GAGA-FACTOR; SWP:Q08605; PDB:1YUIA; PKAKRAKHPPGTEKPRSRSQSEQPATCPICYAVIRQSRNLRRHLELRHFAKPGV --------------11113333-------------3333--------------- >INTEGRIN BETA-2 A CHAIN; SWP:P05107; PDB:1YUKA; QECTKFKVSSCRECIESGPGCTWCQKLNFTGPGDPDSIRCDTRPQLLMRGCAADDIMDPT -----------------1111----2222-----------------1111-1111----- SLAETQEQKQLSPQKVTLYLRPGQAAAFNVTFRR --------------------2222---------- >Integrin beta-2 [Precurso; SWP:P05107; PDB:1YUKB; SRVFLDHNALPDTLKVTYDSFCSNGVTHRNQPRGDCDGVQINVPITFQVKVTATECIQEQ ----------1111--------%%%%-------------2222----------------- SFVIRALGFTDIVTVQVLPQCECRCRDQSRDRSLCHGKGFLECGICRCDTGYIGKNCEHH -----2222------------------1111-2222-----iiii---2222-------- >'Probable nicotinate-nucl; SWP:Q9HX21; PDB:1YUMA; GKRIGLFGGTFDPVHIGHMRSAVEMAEQFALDELRLLPNARPPHRETPQVSAAQRLAMVE -----------------------------------------1111--------------- RAVAGVERLTVDPRELQRDKPSYTIDTLESVRAELAADDQLFMLIGWDAFCGLPTWHRWE --2222------3333---------------11111111------3333--1111--111 ALLDHCHIVVLQRPDADSEPPESLRDLLAARSVADPQALKGPGGQITFVWQTPLAVSATQ 1-------------------3333----------1111---------------------- IRALLGAGRSVRFLVPDAVLNYIEAHHLYRAP ----1111--2222------------------ >BETA-LACTOGLOBULIN; SWP:P02755; PDB:1YUPA; IIVTQTMKDLDVQKVAGTWYSLAMAASDISLLDAQSAPLRVYVEELKPTPGGDLEILLQK ----------3333-------------3333--1111-----------3333-------- WENGKCAQKKIIAEKTEIPAVFKIDALNENKVLVLDTDYKKYLLFCMENSAEPEQSLACQ -----------------1111----%%%%-------------------3333-1111--- CLVRTPEVDDEAMEKFDKALKALPMHIRLSFNPTQLEEQCRV ------------------------------------------ >S100 CALCIUM-BINDING PROT; SWP:Q99584; PDB:1YURA; MAAEPLTELEESIETVVTTFFTFARQEGRKDSLSVNEFKELVTQQLPHLLKDVGSLDEKM ------------------------------------------------------------ KSLDVNQDSELKFNEYWRLIGELAKEIRKKKDLKIRKK -----3333----------------------3333--- >RNA-DEPENDENT RNA POLYMER; SWP:P26660; PDB:1YUYA; SMSYSWTGALITPCSPEEEKLPINPLSNSLLRYHNKVYCTTSKSASLRAKKVTFDRMQVL -----------------------3333-----3333----3333---------------- DAYYDSVLKDIKLAASKVSARLLTLEEACQLTPPHSARSKYGFGAKEVRSLSGRAVNHIK ----------------------------11111111------------------------ SVWKDLLEDSQTPIPTTIMAKNEVFCVDKKAARLIVYPDLGVRVCEKMALYDVTQKLPQA --------------------------------------3333--------3333------ VMGASYGFQYSPAQRVEFLLKAWAEKKDPMGFSYDTRCFDSTVTERDIRTEESIYQACSL -!!!!3333----------------------------3333-------------3333-- PEEARTAIHSLTERLYVGGPMFNSKGQSCGYRRCRASGVLTTSMGNTITCYVKALAACKA --------------3333----1111------------1111------------------ AGIVAPTMLVCGDDLVVISESQGTEEDERNLRAFTEAMTRYSAPPGDPPRPEYDLELITS ----------!!!!------------------------1111-----------3333--% CSSNVSVALGPQGRRRYYLTRDPTTPIARAAWETVRHSPVNSWLGNIIQYAPTIWVRMVL %%%------1111------------------------------------11113333--- MTHFFSILMAQDTLDQNLNFEMYGSVYSVSPLDLPAIIERLHGLDAFSLHTYTPHELTRV ------------1111-----iiii----3333---------3333-------------- ASALRKLGAPPLRAWKSRARAVRASLISRGGRAAVCGRYLFNWAVKTKLKLTPLPEARLL ----------3333--------------------------3333---------3333--- DLSSWFTVGAGGGDIYHS ---3333----------- >NIGERYTHRIN; SWP:P30820; PDB:1YUZA; MKVRAQVPTVKNATNFNMVADSKTAVGSTLENLKAAIAGETGAHAKYTAFAKAAREQGYE --------3333------------------------------------------1111-- QIARLFEATAAAELIHIGLEYALVAEMEPGYEKPTVPSAYSCDLNLISGANGEIYETSDM ---------------------------1111----------------------------- YPAFIRKAQEEGNSKAVHVFTRAKLAESVHAERYLAAYNDIDAPDDDKFHLCPICGYIHK ---------------------------------------1111----------------- GEDFEKCPICFRPKDTFTAY ------------3333---- >P-30 PROTEIN; SWP:P22069; PDB:1YV4A; DWLTFQKKHITNTRDVDCDNILSTNLFHCKDKNTFIYSRPEPVKAICKGIIASKNVLTTS ----------------3333---3333------------3333-1111------------ EFYLSDCNVTSRPCKYKLKKSTNKFCVTCENQAPVHFVGVGSC -----------2222--------------%%%%---------- >HYDROLASE, HALOACID DEHAL; SWP:Q836C7; PDB:1YV9A; SLDYQGYLIDLDGTIYLGKEPIPAGKRFVERLQEKDLPFLFVTNNTTKSPETVAQRLANE ----------2222---------------------------------------------- FDIHVPASLVYTATLATIDYMKEANRGKKVFVIGEAGLIDLILEAGFEWDETNPDYVVVG -----3333-------------------------1111----1111-------------- LDTELSYEKVVLATLAIQKGALFIGTNPDKNIPTERGLLPGAGSVVTFVETATQTKPVYI -11113333-------1111-------------1111----------------------- GKPKAIIMERAIAHLGVEKEQVIMVGDNYETDIQSGIQNGIDSLLVTSGFTPKSAVPTLP ------------------------------------------------------------ TPPTYVVDSLDEWTFEG --------3333----- >FALCIPAIN 2; SWP:Q9N6S8; PDB:1YVBA; QMNYEEVIKKYREENFMDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKN ----------------------------------------3333---------------- KLITLSEQELVDCSFKNYGCNGGLINNAFEDMIELG -----------------!!!!--3333--------- >MRR5; SWP:Q6M142; PDB:1YVCA; MAFGKPAMKNVPVEAGKEYEVTIEDMGKGGDGIARIDGFVVFVPNAEKGSVINVKVTAVK -----------------------------------iiii-------2222---------- EKFAFAERVL ---------- >RAS-RELATED PROTEIN RAB-2; SWP:P35285; PDB:1YVDA; SLRELKVCLLGDTGVGKSSIVWRFVEDSFDPNINPTIGASFMTKTVQYQNELHKFLIWDT -----------2222--------------1111--------------!!!!--------- AGQERFRALAPMYYRGSAAAIIVYDITKEETFSTLKNWVRELRQHGPPSIVVAIAGNKCD --333311113333----------11113333-------------------------333 LTDVREVMERDAKDYADSIHAIFVETSAKNAININELFIEISRRIP 31111-----------1111------------3333---------- >HCV NS5B POLYMERASE; SWP:NA; PDB:1YVFA; MSMSYTWTGALITPCAAEESKLPINALSNSLLRHHNLVYSTTSRSASLRQKKVTFDRLQV --------------------------1111---1111----3333--------------- LDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPPHSAKSKFGYGAKDVRSLSSRAVNHI ---------------------------------1111-----------1111-------- RSVWKDLLEDTDTPIQTTIMAKNEVFCVQPEKGGRKPARLIVFPDLGVRVCEKMALYDVV ----------------------------3333------------3333---------111 STLPQAVMGSSYGFQYSPKQRVEFLVNTWKAKKCPMGFSYDTRCFDSTVTENDIRVEESI 1------!!!!3333-------------1111-----------3333------------- YQCCDLAPEARQAIRSLTERLYVGGPMTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKA --------------------3333----1111------------1111------------ AAACRAAKLQDCTMLVNGDDLVVICESAGTQEDAASLRVFTEAMTRYSAPPGDPPQPEYD ----------------!!!!------------------------1111-----------1 LELITSCSSNVSVAHDASGKRVYYLTRDPTVPLARAAWETARHTPVNSWLGNIIMYAPTL 111--%%%%---------------------------1111---------------1111- WARMILMTHFFSILLAQEQLEKALDCQIYGACYSIEPLDLPQIIERLHGLSAFSLHSYSP ------------------1111-----iiii----3333---------3333-------- GEINRVASCLRKLGVPPLRVWRHRARSVRAKLLSQGGRAAICGKYLFNWAVRTKLKLTPI -------------------------------------3333-----3333---------- PAASRLLLSGWFVAGYSGGDIYHS ---------------2222----- >TETANUS TOXIN, LIGHT CHAI; SWP:P04958; PDB:1YVGA; PITINNFRYSDPVNNDTIIMMEPPYCKGLDIYYKAFKITDRIWIVPERYEFGTKPEDFNP -------1111-----------1111------------2222-------22223333--- PSSASEYYDPNYLRTDSDKDRFLQTMVKLFNRIKNNVAGEALLDKIINAIPYLGNSYSLL --------1111-----------------------3333-------------------11 DKFDTNSNSVSFNLLEQDPSGATTKSAMLTNLIIFGPGPVLNKNEVRGIVLRVDNKNYFP 11----1111-------3333------------------1111---------------11 CRDGFGSIMQMAFCPEYVPTFDNSKYFQDPALLLMHELIHVLHGLYGMQVSSHEIPISAE 11-----------3333-----------3333-------------------------333 ELFTFGGQDANLISIDIKNDLYEKTLNDYKAIANKLSQVTSCNDPNIDIDSYKQIYQQKY 33333-3333-------------------------------------3333--------- QFDKDSNGQYIVNEDKFQILYNSIMYGFTEIELGKKFNIKTRLSYFSMNHDPVKIPNLLD ----1111--------------------------1111-----1111---------1111 DTIYNDTEGFNIESKDLKSEYKGQNMRVNTNAFRNVD -------!!!!3333---%%%%------1111----- >CBL E3 UBIQUITIN PROTEIN ; SWP:P22681; PDB:1YVHA; PGTVDKKMVEKCWKLMDKVVRLCQNPKLALKNSPPYILDLLPDTYQHLRTILSRYEGKME ------------------------3333-------3333--------------------- TLGENEYFRVFMENLMKKTKQTISLFKEGKERMYEENSQPRRNLTKLSLIFSHMLAELKG ----------------------------------2222---------------------- IFPSGLFQGDTFRITKADAAEFWRKAFGEKTIVPWKSFRQALHEVHPISSGLEAMALKST -2222--3333---------------!!!!------------------------------ IDLTCNDYISVFEFDIFTRLFQPWSSLLRNWNSLAVTHPGYMAFLTYDEVKARLQKFIHK -1111-----------------3333-----------3333------------3333--2 PGSYIFRLSCTRLGQWAIGYVTADGNILQTIPHNKPLFQALIDGFREGFYLFPDGRNQNP 222------------------1111---------------------------iiii---- DLTG ---- >HISTIDINE-CONTAINING PHOS; SWP:Q6VAK4; PDB:1YVIA; SAAAALRDQLTALLSSMFSQGLVDEQFQQLQMLQDTPGFVSEVVTLFCDDADRIINEIAT -----------------1111--------------1111--------------------1 LLEQPVVNFDKVDAYVHQLKGSSASVGAQKVKFTCMQFRQFCQDKSRDGCLMALAVVRND 111--------------------------------------------------------- FYDLRNKFQTMLQLEQQIQA -------------------- >TYROSINE-PROTEIN KINASE J; SWP:P52333; PDB:1YVJA; PTIFEERHLKYISQLGKGNFGSVELCRYDPLGDNTGALVAVKQLQHSGPDQQRDFQREIQ ----3333--------------------1111---------------------------- ILKALHSDFIVKYRGVSYGPGRPELRLVMEYLPSGCLRDFLQRHRARLDASRLLLYSSQI -3333-1111---------------------3333-------------3333-------- CKGMEYLGSRRCVHRDLAARNILVESEAHVKIADFGLAKLLPLDKDVVREPGQSPIFWYA ------3333-------3333-------------1111---1111---------1111-3 PESLSDNIFSRQSDVWSFGVVLYELFTYCDKSCSPSAEFLRMMGCERDVPALCRLLELLE 333---------------------1111------------------------------11 EGQRLPAPPACPAEVHELMKLCWAPSPQDRPSFSALGPQLDMLWSGSR 11-----22223333-----1111-1111--3333--------1111- >HYPOTHETICAL PROTEIN BSU3; SWP:O32248; PDB:1YVKA; KLRIELGEETNDELYDLLLLADPSKDIVDEYLERGECYTAWAGDELAGVYVLLKTRPQTV ----------3333---3333----------------------------------2222- EIVNIAVKESLQKKGFGKQLVLDAIEKAKKLGADTIEIGTGNSSIHQLSLYQKCGFRIQA -------1111-----------------------------3333-------1111----- IDHDFFLRHYDEDIFENGIQCRDVRLYLDLL -2222----------%%%%------------ >60-KDA SS-A/RO RIBONUCLEO; SWP:P42700; PDB:1YVRA; MDQTQPLNEKQVPNSEGCYVWQVSDMNRLRRFLCFGSEGGTYYIEEKKLGQENAEALLRL -1111--1111--1111-------------------22221111---------------- IEDGKGCEVVQEIKTFSQEGRAAKQEPTLFALAVCSQCSDIKTKQAAFRAVPEVCRIPTH 1111----------------------------------------------------3333 LFTFIQFKKDLWGRALRKAVSDWYNTKDALNLAMAVTKYKQRNGWSHKDLLRLSHIKPAN -----------------------1111-------1111---------------------- EGLTMVAKYVSKGWKEVQEAYKEKELSPETEKVLKYLEATERVKRTKDELEIIHLIDEYR --------------------2222-----------------3333--------------- LVREHLLTIHLKSKEIWKSLLQDMPLTALLRNLGKMTADSVLAPASSEVSSVCERLTNEK -1111-3333--------------------------1111--2222-----------333 LLKKARIHPFHILVALETYKKGHLRWIPDTSIVEALDNAFYKSFKLVEPTGKRFLLAIDV 3-1111-3333-----------------------------1111---------------- SASMNQRVLGSILNASVVAAAMCMLVARTEKDSHMVAFSDEMLPCPITVNMLLHEVVEKM 3333---%%%%------------------------------------11113333----- SDITMGSTDCALPMLWAQKTNTAADIFIVFTDCETNVEDVHPATALKQYREKMGIPAKLI --------1111-----1111-----------------------------1111------ VCAMTSNGFSIADPDDRGMLDICGFDSGALDVIRNFTLDL ------------1111------------------------ >HYPOTHETICAL PROTEIN AQ_1; SWP:O67434; PDB:1YVUA; EALLNLYRIEYRPKDTTFTVFKPTHEIQKEKLNKVRWRVFLQTGLPTFRREDEFWCAGKV ----------------------------1111---------------------------- EKDTLYLTLSNGEIVELKRVGEEEFRGFQNERECQELFRDFLTKTKVKDKFISDFYKKFR ------------------------------------------1111------------33 DKITVQGKNRKIALIPEVNEKVLKSEEGYFLLHLDLKFRIQPFETLQTLLERNDFNPKRI 33----------------------3333----------------------------2222 RVKPIGIDFVGRVQDVFKAKEKGEEFFRLCERSTHKSSKKAWEELLKNRELREKAFLVVL -----------------3333-3333--------3333---------------------- EKGYTYPATILKPVLTYERNEVADIVREPGKRLNLIRYILRRYVKALRDYGWYISPEEER ------3333-------3333--11113333----------------1111--------- AKGKLNFKDTVLDAKGKNTKVITNLRKFLELCRPFVKKDVLSVEIISVSVWRKEEFLKEL ------------1111-------------------------------------------- INFLKNKGIKLKIKGKSLILAQTREEAKEKLIPVINKIKDVDLVIVFLEFLLYDFVKREL ---------------------------------1111------------3333------- LKKIPSQVILNRTLKNENLKFVLLNVAEQVLAKTGNIPYKLKEIEGKVDAFVGIDISRIT ---------3333------------------1111------------------------- RDGKTVNAVAFTKIFNSKGELVRYYLTSYPAFGEKLTEKAIGDVFSLLEKLGFKKGSKIV ---------------1111-----------------------------1111-2222--- VHRDGRLYRDEVAAFKKYGELYGYSLELLEIIKRNNPRFFSNEKFIKGYFYKLSEDSVIL -------3333--------1111------------------------------------- ATYNQVYEGTHQPIKVRKVYGELPVEVLCSQILSLTLNYSSFQPIKLPATVHYSDKITKL ------2222-----------------------------1111----3333-3333---- LRGIEPIKKEGDIYWL ---------------- >AMINE OXIDASE, FLAVIN-CON; SWP:Q888A4; PDB:1YVVA; TVPIAIIGTGIAGLSAAQALTAAGHQVHLFDKSRGSGGRSSKRSDAGALDGAQYFTARDR --------------------1111------------------------------------ RFATAVKQWQAQGHVAEWTPLLYNFHAGRLSPSPDEQVRWVGKPGSAITRARGDPVSFSC ---------------------------------3333----11113333----------- RITEVFRGEEHWNLLDAEGQNHGPFSHVIIATPAPQASTLLAAAPKLASVVAGVKDPTWA --------------------------------333311113333------1111------ VALAFETPLQTPQGCFVQDSPLDWLARNRSKPERDDTLDTWILHATSQWSRQNLDASREQ --------------------------33332222------------------1111---- VIEHLHGAFAELIDCTPAPVFSLAHRWLYARPAGAHEWGALSDADLGIYVCGDWCLSGRV ---------1111-----------------------------3333-----1111----- EGAWLSGQEAARRLLEHLQL -------------------- >PHOSPHORIBOSYL-ATP PYROPH; SWP:Q81G00; PDB:1YVWA; AFKLLYKTIEERKGSPLPESYTNYLFSKGEDKILKKIGEECAEVIIACKNNDKEEVVKEV ----------------3333---------------------------1111--------- DVFYHCFVLLAEKNIALEDVREVKERNGKL ---------------3333----------- >SUCCINYLGLUTAMATE DESUCCI; SWP:Q7NU26; PDB:1YW4A; THSPSFLQHALSSSDTRAEWPLPGGLAARWLAPGCVELNGDARGADSVLLSCGVHGNETA ---------------------2222------2222---1111------------1111-- PIEVVDGLTDIAAGQLALNCRLLVFANLDAIRQGVRYGNYDNRLFNGAHARHPELPESVR --------------------------------------------iiii3333-------- AAELETLAAEFFAGARARKLHYDLHTAIRGSVFEKFAIYPFLHRTHKREQLAWLQRCGIE -----------1111--------------------------------------------- AVLLHTQPANTFSYFTSQYCEADAFTLELGKARPFGQNDLSRFSGIDGALRGLLSNPQAN ----------3333---1111------------2222-3333-------------1111- VPDLDEDKLPLFRAKYDLVKHSFKLNLADSVENFTLLPDGLIAATGGEERILFPNPAVKP ----1111-------------------11112222-------------------3333-- GLRAGIVVEPARLPS --------------- >PEPTIDYL PROLYL CIS/TRANS; SWP:Q59KZ2; PDB:1YW5A; MASTSTGLPPNWTIRVSRSHNKEYFLNQSTNESSWDPPYGTDKEVLNAYIAKFKNNGYKP --------2222-------------------------2222--------------%%%%- LVNEDGQVRVSHLLIKNNQSRKPKSWKSPDGISRTRDESIQILKKHLERILSGEVKLSEL --1111------------------1111-------------------------------- ANTESDCSSHDRGGDLGFFSKGQMQPPFEEAAFNLHVGEVSNIIETNSGVHILQRTG ------1111-iiii----2222--------11112222------1111-------- >SUCCINYLGLUTAMATE DESUCCI; SWP:P76215; PDB:1YW6A; PDFLALTLTGKKPVITEREINGVRWRWLGDGVLELTPLTPPQGALVISAGIHGNETAPVE ----------------------------2222-----------------------3333- LDALLGAISHGEIPLRWRLLVILGNPPALKQGKRSDNRFGGRWQLFAESGETCRARELEQ ------------------------3333-------------------------------- CLEDFYDQGKESVRWHLDLHTAIRGSLHPQFGVLPQRDIPWDEKFLTWLGAAGLEALVFH ------------------------------------------------1111-------- QEPGGTFTHFSARHFGALACTLELGKALLRQFAVTASAIAALLSGESVGIVRTPPLRYRV ----------------------------!!!!---------1111--------------- VSQITRHSPSFEHASDTLNFPFEKGTLLAQDGEERFTVTHDVEYVLFPNPLVALGLRAGL -------1111-----------2222---------------------------------- LEKIS ----- >PHOSPHOTYROSINE PROTEIN P; SWP:P96830; PDB:1YWFA; RELPGAWNFRDVADTATALRPGRLFRSSELSRLDDAGRATLRRLGITDVADLRSSREVAR --2222----3333-1111----------------------------------------- RGPGRVPDGIDVHLLPFPDLADSINDAATRYMTDEYRQFPTRNGAQRALHRVVTLLAAGR --------------------------------------1111-------------1111- PVLTHCFAGKDRTGFVVALVLEAVGLDRDVIVADYLRSNDSVPQLRARISEMIQQRFDTE ---------------------1111-----------3333------------1111---- LAPEVVTFTKARLSDGVLGVRAEYLAAARQTIDETYGSLGGYLRDAGISQATVNRMRGVL -3333--------3333---3333-------------------1111------------- L - >UROKINASE PLASMINOGEN ACT; SWP:Q9UMV0; PDB:1YWHA; LRCMQCKTNGDCRVEECALGQDLCRTTIVRLWEEGEELELVEKSCTHSEKTNRTLSYRTG ------3333-------2222-----------!!!!----------3333---------- LKITSLTEVVCGLDLCNQGNSGRSRYLECISCGSSDMSCERGRHQSLQCRSPEEQCLDVV ---------------------------------3333-------------1111------ THWIQRPKDDRHLRGCGYLPGCPGSNGFHNNDTFHFLKCCNTTKCNEGPILELENLPQNG -----------------------------------------2222------3333----- RQCYSCKGQSTHGCSSEETFLIDCRGPMNQCLVATGTHEPKNQSYMVRGCATASMCQHAH --------1111------------!!!!-----------------------3333--333 LGDAFSMNHIDVSCCTKSGCNHPDLDVQ 33333----------------------- >4-deoxy-L-threo-5-hexosul; SWP:Q838L9; PDB:1YWKA; LQNMETRYTHSPADIRHYSTEQLRDEFLVEKVFIPGAISLTYTHNDRMIFGGVTPTTEEL ----------33331111---------------2222-------%%%%------------ EIILDKELGVDYFLERRELGVINIGGPGFIEIDGAKETMKKQDGYYIGKETKHVRFSSEN ----3333-------------------------------2222----2222--------3 PDNPAKFYISCVPAHHKYPNVKISIDEITPMETGDPLTLNQRKIYQYIHPNVCESCQLQM 333-------------------------------3333----------3333-------- GYTILEPGSAWNTRMEAYVYFDMEEDTRIFHMMGKPDETKHLVMSNEQAAISPSWSIHSG -----2222--------------1111-------1111--------------1111---- VGTSNYSFIWAMCGE --------------- >HYPOTHETICAL UPF0213 PROT; SWP:Q830S9; PDB:1YWLA; MENKKSHYFYVLLCQDGSFYGGYTTEPERRLTEHNSGTGAKYTRLAKRRPVIMIHTEKFE -------------1111---------------3333------------------------ TRSEATKAEAAFKKLTRKQKEQYLKTFHLEHHHHHH ----------1111---------3333--------- >C PROTEIN ALPHA-ANTIGEN; SWP:Q02192; PDB:1YWMA; STIPGSAATLNTSITKNIQNGNAYIDLYDVKLGKIDPLQLIVLEQGFTAKYVFRQGTKYY --2222----3333----iiii-------1111--3333----2222-------!!!!-- GDVSQLQSTGRASLTYNIFGEDGLPHVKTDGQIDIVSVALTIYDSTTLRDKIEEVRTNAN -3333--------------1111----1111----------------------------- DPKWTEESRTEVLTGLDTIKTDIDNNPKTQTDIDSKIVEVNELEKLLVLKLAAALEHHHH 3333---------------------------------------1111------------- >VASCULAR ENDOTHELIAL GROW; SWP:P35968; PDB:1YWNA; LPYDASKWEFPRDRLKLGKPLGRGAFGQVIEADAFGIDKTATCRTVAVKMLTHSEHRALM ---3333---1111---------------------1111--------------------- SELKILIHIGHHLNVVNLLGACTKPGGPLMVIVEFCKFGNLSTYLRSKRNEFVPFLTLEH -----------1111--------------------1111------1111----------- LICYSFQVAKGMEFLASRKCIHRDLAARNILLSEKNVVKICDIKDPDVRKGDARLPLKWM ---------------1111------3333----%%%%-----------2222--3333-- APETIFDRVYTIQSDVWSFGVLLWEIFSLGASPYPGVKIDEEFCRRLKEGTRMRAPDYTT 3333-----------------------------2222------------------1111- PEMYQTMLDCWHGEPSQRPTFSELVEHLGNLLQANAQQD --------1111-3333---------------3333--- >NITROREDUCTASE FAMILY PRO; SWP:Q81EW9; PDB:1YWQA; SATTTNLKEAIVNRRSIRKVTKNDAITKERIEEVLKTALHAPTSFNMQSGRMVVLMDGEH ----------------------3333----------------2222-------------- EKFWDIVKETLRARVPAENFEATVERLKGFHAGVGTVLFFEDQATVEKMQENAPLYKDQF ---------------3333----------3333--------------3333-3333---- PFWSHQGNAMLQHTVWMLLSAEGIGASLQHYNPIVDAEVKETWNIPAEWSLVGQMPFGEP -------------------1111---------1111---------1111----------- NEQPAERTFLPTEDVVKFY ----------3333----- >14-3-3 PROTEIN SIGMA; SWP:P31947; PDB:1YWTA; MERASLIQKAKLAEQAERYEDMAAFMKGAVEKGEELSCEERNLLSVAYKNVVGGQRAAWR ------------------------------------------------------------ VLSSIEQKSNPEVREYREKVETELQGVCDTVLGLLDSHLIKEAGDAESRVFYLKMKGDYY ----------3333------------------------2222------------------ RYLAEVATGDDKKRIIDSARSAYQEAMDISKKEMPPTNPIRLGLALNFSVFHYEIANSPE ---1111!!!!-----------------------1111---------------------- EAISLAKTTFDEAMADLHTLSEDSYKDSTLIMQLLRDNLTLWT ----------3333-3333-3333------------------- >HYPOTHETICAL PROTEIN PA46; SWP:Q9HVI1; PDB:1YWUA; MSDQHDERRRFHRIAFDADSEILQGERRWEVLLHDVSLHGILVGQPQDWNGDPQRPFEAR ------------------------------------1111-----------1111----- LYLGLDVLIRMEISLAWARDGLLGFECQHIDLDSISHLRRLVELNLGDEELLERELALLV ---1111-----------iiii-------------------------3333---3333-- SAHDD ----- >HYPOTHETICAL PROTEIN PA47; SWP:Q9HV61; PDB:1YWWA; MNSDVIKGKWKQLTGKIKERWGDLTDDDLQAADGHAEYLVGKLQERYGWSKERAEQEVRD ----33331111--------33333333------3333---------------------- FSDRL 3333- >30S RIBOSOMAL PROTEIN S24; SWP:P61193; PDB:1YWXA; MDISIISDRNNPLLQRREIKFTVSFDAATPSIKDVKMKLVAVLNANKQVLVVDTLDQIFG -------------------------------3333------------------------- KLEAEGYAKIYNDEKAMATIETKSVLEKNKIEEEAEAEVAEE ------------------------3333-------------- >HYPOTHETICAL PROTEIN PA20; SWP:Q9I293; PDB:1YWYA; MSIEIDSEQGVCSVEIEGSRHRAPVDSLRIGTDAEARLSVLYIDGKRLHISEEDAQRLVV ------1111-------------1111-----------------------3333------ AGAEDQRRHLMADD -------------- >HYPOTHETICAL PROTEIN YSNE; SWP:P94562; PDB:1YX0A; MHIKIDDLTGRQVVSLVNEHLHSMTLMSPPESIHALGLEKLRGPEITFWSAWEGDELAGC ---------3333----------------------------------------------- GALKELDTRHGEIKSMRTSASHLRKGVAKQVLQHIIEEAEKRGYERLSLETGSMASFEPA -----------------------------------------------------3333--- RKLYESFGFQYCEPFADYGEDPNSVFMTKKL ---1111------------------------ >HYPOTHETICAL PROTEIN PA22; SWP:Q9I1L3; PDB:1YX1A; LHPVSISLSSYGADLVRSRGQASFLPLLAAGAQRVELREELFAGPPDTEALTAAIQLQGL -------3333--------------3333--------1111--------------1111- ECVFSSPLELWREDGQLNPELEPTLRRAEACGAGWLKVSLGLLPEQPDLAALGRRLARHG -----------1111--1111-------1111-----------------------1111- LQLLVENDQTPQGGRIEVLERFFRLAERQQLDLATFDIGNWRWQEQAADEAALRLGRYVG ---------3333-----------------------33333333----------3333-- YVHCKAVIRNRDGKLVAVPPSAADLQYWQRLLQHFPEGVARAIEYPLQGDDLLSLSRRHI ---------1111------------------11112222--------------------- AALARLGQ --1111-- >AMINOMETHYLTRANSFERASE; SWP:P54378; PDB:1YX2A; LKRTPLFDLYKEYGGKTIDFGGWELPVQFSSIKKEHEAVRTAAGLFDVSHGEVEVSGNDS ---1111---1111----------------3333----------------------1111 LSFLQRLTNDVSALTPGRAQYTACYPDGGTVDDLLIYQKGENRYLLVINASNIDKDLAWK ---------3333-2222------1111--------------------3333-------1 EHAAGDVQIDNQSDQIALLAVQGPKAEAILKNLTDADVSALKPFAFIDEADISGRKALIS 111--------3333-------1111--3333----3333-2222------iiii----- RTGYTGEDGYEIYCRSDDAHIWKKIIDAGDAYGLIPCGLGARDTLRFEANIPLYGQELTR --------------3333----------3333--------------------2222--11 DITPIEAGIGFAVKHKKESDFFGKSVLSEQKENGAKRKLVGLEIEKGIPRHGYEVFQNGK 113333--3333-1111---2222-------------------------2222---iiii SVGKVTTGTQSPTLGKNVGLALIDSETSEIGTVVDVEIRKKLVKAKVVKTPF -----------1111--------3333-2222-----%%%%----------- >HYPOTHETICAL PROTEIN DSRC; SWP:O87899; PDB:1YX3A; MADTIEVDGKQFAVDEEGYLSNLNDWVPGVADVMAKQDNLELTEEHWDIINFLREYYEEY ------iiii----1111--------3333-----1111--------------------- QIAPAVRVLTKAVGKKLGKEKGNSKYLYSLFPYGPAKQACRFAGLPKPTGCV ----3333----------------------1111------------------ >26S PROTEASOME NON-ATPASE; SWP:P55036; PDB:1YX4A; HMLGLGASDFEFGVDPSADPELALALRVSMEEQRQRQEEEARRAAAASAAEAGIATTGTE --------------3333------------------------------------------ DSDDALLKMTISQQEFGRTGLPDLSSMTEEEQIAYAMQMSLQGAEFGQAESA ---------------------------3333--------------------- >CALSENSIN; SWP:Q25088; PDB:1YX7A; MACKVKAELEAAFKKLDANGDGYVTALELQTFMVTLDAYKALSKDKVKEASAKLIKMADK ------------------------3333------------------3333---------- NSDGKISKEEFLNANAELLCQLK ------3333-----3333---- >serine (or cysteine) prot; SWP:Q91WP6; PDB:1YXAA; LASINTDFAFSLYKELVLKNPDTNIVFSPLSISAALALVSLGAKGNTLEEILEGLKFNLT ---------------------------3333--------1111-3333---------333 ETSEADIHQGFGHLLQRLNQPKDQVQISTGSALFIEKRQQILTEFQEKAKTLYQAEAFTA 3--------------------%%%%----------3333--------------------- DFQQPRQAKKLINDYVRKQTQGMIKELVSDLDKRTLMVLVNYIYFKAKWKVPFDPLDTFK 3333---------------iiii--------1111------------------3333--- SEFYCGKRRPVIVPMMSMEDLTTPYFRDEELSCTVVELKYTGNASALFILPDQGRMQQVE ----------------------------1111-------------------2222----1 ASLQPETLRKWKNSLKPRMIDELHLPKFSISTDYSLEDVLSKLGIREVFSTQADLSAITG 111-------------------------------------1111-33331111------- TKDLRVSQVVHKAVLDVAETGTEAAAATGVKFVPMSAKLYPLTVYFNRPFLIMIFDTETE -----------------1111--------------------------------------- IAPFIAKIANPK ------------ >PHOSPHORIBOSYL-ATP PYROPH; SWP:Q9EWK0; PDB:1YXBA; KTFEELFTELQHKAANTSRTAELVDKGVHAIGKKVVEEAAEVWAAEYEGKDAAAEEISQL -----------1111------3333----------------------------------- LYHVQVVARGISLDDVYAHLL ------1111------3333- >PHOSPHOLIPASE A2; SWP:Q6T179; PDB:1YXHA; NIYQFKNMIQCTVPSRSWWDFADYGCYCGRGGSGTPVDDLDRCCQVHDNCYNQAQEITGC ------------------1111----------------------------------2222 RPKWKTYTYECSQGTLTCKGRNNACAATVCDCDRLAAICFAGAPYNDNNYNIDLKARCQ 3333-------iiii---1111-----------------1111--3333---3333--- >PHOSPHOLIPASE A2 ISOFORM ; SWP:P60045; PDB:1YXLA; NLYQFKNMIQCTVPSRSWQDFADYGCYCGKGGSGTPVDDLDRCCQVHDNCYNEAENISGC ----------------3333------------------------------------2222 RPYFKTYSYECTQGTLTCKGDNNACAASVCDCDRLAAICFAGAPYNDANYNIDLKARCN 3333-------iiii---1111-----------------1111--3333---3333--- >PEROXISOMAL TRANS 2-ENOYL; SWP:Q9BY49; PDB:1YXMA; GRSYLAPGLLQGQVAIVTGGATGIGKAIVKELLELGSNVVIASRKLERLKSAADELQANL -----22222222----------------------------------------------- PPTKQARVIPIQCNIRNEEEVNNLVKSTLDTFGKINFLVNNGGGQFLSPAEHISSKGWHA 1111---------1111-------------------------------3333-------- VLETNLTGTFYMCKAVYSSWMKEHGGSIVNIIVPTKAGFPLAVHSGAARAGVYNLTKSLA --------------------------------------2222------------------ LEWACSGIRINCVAPGVIYSQTAVENYGSWGQSFFEGSFQKIPAKRIGVPEEVSSVVCFL --1111-------------333311113333-11113333--------3333-------- LSPAASFITGQSVDVDGGRSLYTHSYEVPDHDNWPKGAGDLSVVKKMKETFKEKAKL -3333------------3333-3333------------------------------- >4-HYDROXYTHREONINE-4-PHOS; SWP:Q9I5U4; PDB:1YXOA; SLRFALTPGEPAGIGPDLCLLLARSAQPHPLIAIASRTLLQERAGQLGLAIDLKDVSPAA ---------1111-------------------------------------------1111 WPERPAKAGQLYVWDTPLAAPVRPGQLDRANAAYVLETLTRAGQGCLDGHFAGITAPVHK ------2222------------2222-3333--------------1111----------- GVINEAGIPFSGHTEFLADLTHTAQVVLATRGLRVALATTHLPLREVADAISDERLTRVA -----------------------------2222---------33333333---------- RILHADLRDKFGIAHPRILVCGLNPHAGEGGHLGREEIEVIEPCLERLRGEGLDLIGPLP -----------------------2222-iiii--3333----------1111-------1 ADTLFTPKHLEHCDAVLAYHDQGLPVLKYKGFGAAVNVTLGLPIIRTSVDHGTALDLAGS 111--33331111------3333-------2222-------------------1111--- GRIDSGSLQVALETAYQAASRC -----------------3333- >VACUOLAR PROTEIN SORTING ; SWP:Q9UN37; PDB:1YXRA; MTTSTLQKAIDLVTKATEEDKAKNYEEALRLYQHAVEYFLHAIKYEAHSDKAKESIRAKC ------------------3333-3333---------------------3333-------- VQYLDRAEKLKDYLRSK ----------------- >Putative N-acetylmannosam; SWP:P65522; PDB:1YXYA; KPTKEKLMEQLKGGIIVSCQALPGEPLYSETGGIMPLMAKAAQEAGAVGIRANSVRDIKE ----------2222-------2222---1111----------3333-------------- IQAITDLPIIGIIKKDYPPQEPFITATMTEVDQLAALNIAVIAMDCTKRDRHDGLDIASF 3333------------------------------1111------------1111------ IRQVKEKYPNQLLMADISTFDEGLVAHQAGIDFVGTTLSGYTPYSRQEAGPDVALIEALC -------1111---------------1111-----1111--1111--------------- KAGIAVIAEGKIHSPEEAKKINDLGVAGIVVGGAITRPKEIAERFIEALK ---------------------1111------3333-----------1111 >S-adenosylmethionine:tRNA; SWP:O32054; PDB:1YY3A; DLFDFELPERLIAQVPLEQRDASRLMVLDKHTGELTDSSFKHIISFFNEGDCLVLNNTRV -------3333------------------------------------------------- LPARLFGTKEDTGAKVELLLLKQETGDKWETLAKPAKRVKKGTVVTFGDGRLKAICTEEL ----------------------------------3333--------!!!!---------1 EHGGRKMEFQYDGIFYEVLESLGEMPLPPYIKEQLDDKEAAAPTAGLHFTEEILQQLKDK 111-----------3333--3333-------3333------------------------- GVQIEFITLHVGLGTFRMHAEFYQMSEETAAALNKVRENGGRIISVGTTSTRTLETIAGE -----------3333-----------------------------------------3333 HDGQFKASSGWTSIFIYPGYEFKAIDGMITNFHLPKSSLIMLVSALAGRENILRAYNHAV --------------------------------------33333333-------------- EEEYRFFSFGDAMLI --------------- >ESTROGEN RECEPTOR BETA; SWP:Q9UEV6; PDB:1YY4A; LSPEQLVLTLLEAEPPHVLISRPSAPFTEASMMMSLTKLADKELVHMISWAKKIPGFVEL ----------------------------3333---------------------2222--- SLFDQVRLLESCWMEVLMMGLMWRSIDHPGKLIFAPDLVLDRDEGKCVEGILEIFDMLLA 3333--------------------1111------------3333-----3333------- TTSRFRELKLQHKEYLCVKAMILLNSSMDSSRKLAHLLNAVTDALVWVIAKSGISSQQQS ----------------------1111----------------------------3333-- MRLANLLMLLSHVRHASNKGMEHLLNMKCKNVVPVYDLLLEMLNA ---------------------------1111-------------- >STRINGENT STARVATION PROT; SWP:Q8ZB63; PDB:1YY7A; AANKRSVMTLFSGPTDIFSHQVRIVLAEKGVSVEIEQVEADNLPQDLIDLNPYRTVPTLV 3333----------------------------------3333--------1111------ DRELTLYESRIIMEYLDERFPHPPLMPVYPVARGSSRLMMHRIEHDWYSLLYKIEQGNAQ !!!!-------------------------------------------------------- EAEAARKQLREELLSIAPVFNETPFFMSEEFSLVDCYLAPLLWRLPVLGIEFTGAGSKEL ---------------3333-------------3333-------3333------2222--- KGYMTRVFERDAFLASLTEAEREMHL -------------33333333-1111 >CETUXIMAB FAB LIGHT CHAIN; SWP:NA; PDB:1YY8A; DILLTQSPVILSVSPGERVSFSCRASQSIGTNIHWYQQRTNGSPRLLIKYASESISGIPS -------------2222---------------------2222------------222233 RFSGSGSGTDFTLSINSVESEDIADYYCQQNNNWPTTFGAGTKLELKRTVAAPSVFIFPP 33----------------1111-------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT 3333-------------------------%%%%--------------------------- LSKADYEKHKVYACEVTHQGLSSPVTKSFNRGA -----1111--------1111------------ >CETUXIMAB FAB LIGHT CHAIN; SWP:NA; PDB:1YY8B; QVQLKQSGPGLVQPSQSLSITCTVSGFSLTNYGVHWVRQSPGKGLEWLGVIWSGGNTDYN ------------2222-----------1111--------------------1111----3 TPFTSRLSINKDNSKSQVFFKMNSLQSNDTAIYYCARALTYYDYEFAYWGQGTLVTVSAA 333---------1111---------3333----------1111----------------- STKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSG -----------------iiii------------------%%%%--2222-------1111 LYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKRVEPKS ----------1111------------1111----------- >TRIOSEPHOSPHATE ISOMERASE; SWP:Q5SJR1; PDB:1YYAA; MRRVLVAGNWKMHKTPSEARVWFAELKRLLPPLQSEAAVLPAFPILPVAKEVLAETQVGY -----------------------------------------1111-------1111---- GAQDVSAHKEGAYTGEVSARMLSDLGCRYAIVGHSERRRYHGETDALVAEKAKRLLEEGI ------------2222------1111-----------------------------1111- TPILCVGEPLEVREKGEAVPYTLRQLRGSLEGVEPPGPEALVIAYEPVWAIGTGKNATPE ------------1111-------------2222---3333------1111---------- DAEAMHQAIRKALSERYGEAFASRVRILYGGSVNPKNFADLLSMPNVDGGLVGGASLELE ------------------3333-----------3333------1111-----1111---- SFLALLRIAG ---------- >PUTATIVE LATE EMBRYOGENES; SWP:NA; PDB:1YYCA; GHHHHHHLEASADEKVVEEKASVISSLLDKAKGFFAEKLANIPTPEATVDDVDFKGVTRD ------------------------------------------------------------ GVDYHAKVSVKNPYSQSIPICQISYILKSATRTIASGTIPDPGSLVGSGTTVLDVPVKVA ------------------------------------------------------------ YSIAVSLMKDMCTDWDIDYQLDIGLTFDIPVVGDITIPVSTQGEIKLPSLRDFF ----------------------------3333---------------------- >PEROXIDASE MANGANESE-DEPE; SWP:Q02567; PDB:1YYDA; AVCPDGTRVSHAACCAFIPLAQDLQETIFQNECGEDAHEVIRLTFHDAIAISRSQGPKAG --1111----33333333----------%%%%--------------1111-33333333- GGADGSMLLFPTVEPNFSANNGIDDSVNNLIPFMQKHNTISAADLVQFAGAVALSNCPGA ----3333-111111111111----------3333---------------------2222 PRLEFLAGRPNKTIAAVDGLIPEPQDSVTKILQRFEDAGGFTPFEVVSLLASHSVARADK ----------------------1111-------------------------3333----- VDQTIDAAPFDSTPFTFDTQVFLEVLLKGVGFPGSANNTGEVASPLPLGSGSDTGEMRLQ -1111-------1111--33333333-----------2222--------!!!!------- SDFALAHDPRTACIWQGFVNEQAFMAASFRAAMSKLAVLGHNRNSLIDCSDVVPVPKPAT -----------------2222-------------1111---1111---1111-------- GQPAMFPASTGPQDLELSCPSERFPTLTTQPGASQSLIAHCPDGSMSCPGVQFNGPA ------22223333----1111------------------1111------------- >ATP-dependent protease hs; SWP:P39070; PDB:1YYFD; SSFHATTIFAVQHKGRSAMSGDGQVTFGQAVVMKHTARKVRKLFNGKVLAGFAGSVADAF ------------%%%%-------------------------------------------- TLFEKFEAKLEEYNGNLKRAAVELAKEWRSDKVLRKLEAMLIVMNQDTLLLVSGTGEVIE ------------iiii-----------------3333---------------1111---- PDDGILAIGSGGNYALAAGRALKKHAGESMSASEIARAALETAGEICVYTNDQIILEELE --------1111------------------------------33331111---------- >Envelope glycoprotein gp1; SWP:P35961; PDB:1YYMG; EVKLENVTENFNMWKNNMVEQMHEDIISLWDQSLKPCVKLTPLCVGAGSCNTSVITQACP -----------11113333----------------------------------------- KVSFEPIPIHYCAPAGFAILKCNDKKFNGTGPCTNVSTVQCTHGIRPVVSTQLLLNGSLA -------------2222------------------------------------------- EEEIVIRSENFTNNAKTIIVQLNESVVINCTGAGHCNLSKTQWENTLEQIAIKLKEQFGN ---------3333---------------------------------------------11 NKTIIFNPSSGGDPEIVTHSFNCGGEFFYCNSTQLFTWNDRNITLPCRIKQIINMWQEVG 11----------3333------iiii-----3333------------------------- KAMYAPPIRGQIRCSSNITGLLLTRDGGKDTNGTEIFRPGGGDMRDNWRSELYKYKVVKI ------------------------------------------3333-----1111----- E - >TRICHODIENE SYNTHASE; SWP:P13513; PDB:1YYQA; ENFPTEYFLNTTVRLLEYIRYRDSNYTREERIENLHYAYNKAAHHFAQPRQQQLLKVDPK ------------------------------------------------------------ RLQASLQTIVGMVVYSWAKVSKECMADLSIHYTYTLVLDDSKDDPYPTMVNYFDDLQAGR ----------------1111-----------------1111---33331111-------- EQAHPWWALVNEHFPNVLRHFGPFCSLNLIRSTLDFFEGCWIEQYNFGGFPGSHDYPQFL ------------33333333----------------------1111---2222------- RRMNGLGHCVGASLWPKEQFNERSLFLEITSAIAQMENWMVWVNDLMSFYKEFDDERDQI -----3333------3333-3333-------------------------1111-3333-- SLVKNYVVSDEISLHEALEKLTQDTLHSSKQMVAVFSDKDPQVMDTIECFMHGYVTWHLC -----------------------------------1111--------------------- DRRFRLSEIYEKVKEEKTEDAQKFCKFYEQAANVGAVSPSEWAYPPVAQLANV 3333---------------------------------3333------------ >PUTATIVE TRANSCRIPTIONAL ; SWP:Q7CP90; PDB:1YYVA; QLREGNLFAEQCPSREVLKHVTSRWGVLILVALRDGTHRFSDLRRGGVSELAQSLQALEQ -----1111----------------------3333---3333------------------ DGFLNRVSYPVVPPHVEYSLTPLGEQVSDVAALADWIELNLPQVLAQRE ------------------------------------------------- >TRANSLATIONALLY CONTROLLE; SWP:TCTP_HUMAN; PDB:1YZ1A; FMIIYRDLISHDEMFSDIYKIREIADGLCLEVEGKMVSRITGVDIVMNHHLQETSFTKEA ------------------------iiii-------------------------------- YKKYIKDYMKSIKGKLEEQRPERVKPFMTGAAEQIKHILANFKNYQFFIGENMNPDGMVA -------------------3333-----------------1111-----33331111--- LLDYREDGVTPYMIFFKDGLEMEKC ----1111-------1111------ >DUAL SPECIFICITY PHOSPHAT; SWP:NA; PDB:1YZ4A; HMGNGMTKVLPGLYLGNFIDAKDLDQLGRNKITHIISIHESPQPLLQDITYLRIPVADTP ---------2222--------------1111---------------------------11 EVPIKKHFKECINFIHCCRLNGGNCLVHSFAGISRSTTIVTAYVMTVTGLGWRDVLEAIK 11----------------1111------------------------------------33 ATRPIANPNPGFRQQLEEFGWASSQKLRRQLEERFGE 331111--3333------------------------- >Probable translation init; SWP:Q9V0E4; PDB:1YZ7A; KAKLQEFKRAQKAENLLKLAAEKLGKDFETAWREVWVPLEEEWGEVYAAFEDAAKDGIDV -3333-----------------------------------------------------11 LKGHVPDEWLPVLKEIIDNYVEVPTVTIDAEFEITVPKPNGVEIIKEALIRARDRANKEK 11---3333----------------------------1111-------------111122 DVEVKFTYLGAPRYRIDITAPDYYKAEEVLESIAEEILRVIKEAGGEATLLRKEKR 22------------------------------------------------------ - >REDESIGNED APO-CYTOCHROME; SWP:Q0SXH8; PDB:1YZAA; ADLEDNDETGNDNGKGGEKADNAAQVKDALTKMRAAALDAQKATPPKLEDKSPDSPEMKD ------------------------------------------------------------ FRHGFDILVGQIDDALKLANEGKVKEAQAAAEQLKTTIRAYNQKYG ----------------------3333----3333-3333---1111 >LIPASE/ACYLHYDROLASE; SWP:Q839J6; PDB:1YZFA; MRKIVLFGDSITAGYLDEAVSPVLVDLVKRDIAAMGLEEVAVINAGMPGDTTEDGLKRLN ---------3333-!!!!----------------------------2222---------- KEVLIEKPDEVVIFFGANDASLDRNITVATFRENLETMIHEIGSEKVILITPPYADSGRR ---------------3333-1111------------------3333----------1111 PERPQTRIKELVKVAQEVGAAHNLPVIDLYKAMTVYPGTDEFLQADGLHFSQVGYELLGA -------------------1111--------------3333--1111------------- LIVREIKGRLKPKQA -----3333------ >ADP-RIBOSYLATION FACTOR-L; SWP:Q96KC2; PDB:1YZGA; AKLWSLFCNQEHKVIIVGLDNAGKTTILYQFLMNEVVHTSPTIGSNVEEIVVKNTHFLMW ------------------2222---------%%%%------2222------!!!!----- DIGGQESLRSSWNTYYSNTEFIILVVDSIDRERLAITKEELYRMLAHEDLRKAAVLIFAN ----333333331111----------11111111------------3333---------- KQDMKGCMTAAEISKYLTLSSIKDHPWHIQSCCALTGEGLCQGLEWMT 3333-------------3333------------1111----------- >TRNA (GUANINE-N(7)-)-METH; SWP:P67506; PDB:1YZHA; GATELLEANPQYVVLNPLEAKAKWRDLFGNDNPIHVEVGSGKGAFVSGAKQNPDINYIGI ---------------33332222----------------!!!!3333-3333-------- DIQKSVLSYALDKVLEVGVPNIKLLWVDGSDLTDYFEDGEIDRLYLNFSDPWPKKRHEKR --3333------------------------3333--2222-------------3333111 RLTYKTFLDTFKRILPENGEIHFKTDNRGLFEYSLVSFSQYGKLNGVWLDLHASDFEGNV 1--------------2222-------------------3333-------3333------- TEYEQKFSNKGQVIYRVEAEF ------3333----------- >RAS-RELATED PROTEIN RAB-9; SWP:Q9R0M6; PDB:1YZLA; KSSLFKIILLGDGGVGKSSLMNRYVTNKFDSQLFHTIGVEFLNKDLEVDGHFVTMQIWDT -----------2222--------------------------------------------- AGQERFRSLRTPFYRGSDCCLLTFSVDDSQSFQNLSNWKKEFIYYADVKEPESFPFVILG --3333---33332222-------1111---------------------3333------- NKTDIKERQVSTEEAQAWCKDNGDYPYFETSAKDSTNVAAAFEEAVRRILAT -3333-----------------------------2222----------3333 >FYVE-finger-containing Ra; SWP:Q9H1K0; PDB:1YZMA; GSPLLQQIHNITSFIRQAKAAGRMDEVRTLQENLRQLQDEYDQQQT ------------------1111------------------------ >SMALL GTP BINDING PROTEIN; SWP:P20340; PDB:1YZQA; FKLVFLGEQSVGKTSLITRFMYDSFDNTYQATIGIDFLSKTMYLEDRTIRLQLWDTAGQE -------2222--------------------------------1111-----------33 RFRSLIPSYIRDSAAAVVVYDITNVNSFQQTTKWIDDVRTERGSDVIIMLVGNKTDLADK 331111---2222-------1111-----------------!!!!--------3333111 RQVSIEEGERKAKELNVMFIETSAKAGYNVKQLFRRVAAALPGM 1----------------------3333----------3333--- >HYPOTHETICAL PROTEIN; SWP:Q4D3U8; PDB:1YZVA; SRLLKHYGSCKTAFFCCDIQEKFMGRIANSANCVFVANRFAGLHTALGTAHSVYIVTEQY -----1111----------3333---1111-----------------3333--------- PKGLGATSADIRLPPDAHVFSKKRFAMLVPQVMPLVDLPEVEQVVLWGFETHVCILQTAA -------1111--1111-----------33331111-3333--------1111------- ALLDMKKKVVIAVDGCGSQSQGDHCTAIQLMQSWSGDGCYISTSESILMQLLKDASDPVF --1111-----3333------------------3333----------------1111-33 KTIAPLMKQTHPIRI 333333--------- >GFP-LIKE NON-FLUORESCENT ; SWP:Q95W85; PDB:1YZWA; GLLKESMRIKMYMEGTVNGHYFKCEGEGDGNPFAGTQSMRIHVTEGAPLPFAFDILAPCC ----------------iiii-------------------------------33331111- SRTFVHHTAEIPDFFKQSFPEGFTWERTTTYEDGGILTAHQDTSLEGNCLIYKVKVHGTN -------%%%%-3333--------------1111-----------!!!!----------- FPADGPVMKNKSGGWEPSTEVVYPENGVLCGRNVMALKVGDRHLICHHYTSYRSKKAVRA -1111-------------------iiii----------!!!!--------------3333 LTMPGFHFTDIRLQMLRKKKDEYFELYEASVARYSDLPEK ---------------------------------------- >GLUTATHIONE S-TRANSFERASE; SWP:Q9Y2Q3; PDB:1YZXA; LPRTVELFYDVLSPYSWLGFEILCRYQNIWNINLQLRPSLITGIMKDSGNKPPGLLPRKG ---------1111------------1111----------------------3333----- LYMANDLKLLRHHLQIPIHFPKDFLSVMLEKGSLSAMRFLTAVNLEHPEMLEKASRELWM -----------1111-------3333--------------------3333---------- RVWSRNEDITEPQSILAAAEKAGMSAEQAQGLLEKIATPKVKNQLKETTEAACRYGAFGL ----------3333----------3333----1111----------------1111---- PITVAHVDGQTHMLFGSDRMELLAHLLGEKWMGPIPPA -------------------------------------- >HYPOTHETICAL PROTEIN HI10; SWP:P44093; PDB:1YZYA; LGVIADDFTGASDIASFLVENGLSTVQMNGVPTQSLNSKVDAIVISLKSRSNPVNEAIEQ ------------------------------------------------------------ SLRAYQWLKENGCTQFYFKYCSTFDSTAKGNIGPVTDALLDELNEDFTVITPALPVNGRT --------1111--------1111--1111-----------------------3333--- IFNGYLFVGDVLLSESGMKNHPITPMVDANLMRLMDAQAKGKTGLVAYADVIKGASRVQE -iiii--!!!!33333333----------------1111-------33331111------ CFAELKAQGYRYAVVDAVDNSQLEVLAEAVADFKLVTGGSGLGAYMAARLSGGKKGTNAF ---------------------------1111------------------------1111- TPTKGKTVVLSGSCSVMTNKQVEKYREKAPHFQLDVEQAIHNENYIEQLYQWVIANLDSE -----------------------3333--------------1111-------1111---- FAPMVYATVPPDALKAIQHQFGVDQASHAIENTFAKLAAKLKQYGVTNFITAGGETSSIV ---------3333----------------------------------------------- VQELGFTGFHIGKQIAPGVPWLKAVEEDIFLALKSGNFGKEDFFEYAQGMFL ---------------2222---------------1111-1111----3333- >DNA repair endonuclease X; SWP:Q92889; PDB:1Z00B; MDSETLPESEKYNPGPQDFLLKMPGVNAKNCRSLMHHVKNIAELAALSQDELTSILGNAA -------3333-----------------------------3333---3333--------- NAKQLYDFIHTSFAEVVSKGKGKK ------------------------ >2-oxo-1,2-dihydroquinolin; SWP:O05935; PDB:1Z02A; ISDARANNAKTQSQYQPYKDAAWGFINHWYPALFTHELEEDQVQGIQICGVPIVLRRVNG -1111--3333----3333-1111---------3333-2222-----iiii------iii KVFALKDQCLHRGVRLSEKPTCFTKSTISCWYHGFTFDLETGKLVTIVANPEDKLIGTTG i-------------3333-----1111-------------------1111--3333---- VTTYPVHEVNGMIFVFVREDDFPDEDVPPLAHDLPFRFPERSEQFPHPLWPSSPSVLDDN --------iiii------11113333--3333------1111----1111----1111-- AVVHGMHRTGFGNWRIACENGFDNAHILVHKDNTIVHAMDWVLPLGLLPTSDDCIAVVED ------------------3333---33331111--------------------------- DDGPKGMMQWLFTDKWAPVLENQELGLKVEGLKGRHYRTSVVLPGVLMVENWPEEHVVQY ------------3333------1111---------------------------2222--- EWYVPITDDTHEYWEILVRVCPTDEDRKKFQYRYDHMYKPLCLHGFNDSDLYAREAMQNF --------------------------------------------------------3333 YYDGTGWDDEQLVATDISPITWRKLASRWNRGIAKPGRGVAGAVKDTSLIFKQTADGKRP 11113333----3333-----------------------2222----------------- GYKVEQI ------- >TRANSCRIPTIONAL REGULATOR; SWP:Q9KQJ1; PDB:1Z05A; DHIKQINAGRVYKLIDQKGPISRIDLSKESELAPASITKITRELIDAHLIHETTVQEAIS ---------------------------1111-------------1111----------11 RGRPAVGLQTNNLGWQFLSMRLGRGYLTIALHELGGEVLIDTKIDIHEIDQDDVLARLLF 11---------2222-------2222------3333------------------------ EIEEFFQTYAAQLDRVTSIAITLPGLVNSEQGIVLQMPHYNVKNLALGPEIYKATGLPVF --------3333----------------1111---------------------------- VANDTRAWALAEKLFGHSQDVDNSVLISIHHGLGAGIVLDGRVLQGRHGNIGELGHIQID -----------------1111-----------------iiii---111122221111--- PQGKRCHCGNYGCLETVASSQAIRDQVTARIQAGEPSCLATVEEISIEDICAAAADGDPL -----3333---3333--------------1111--1111-------------1111--- AVDVIQQLGRYLGAAIAIVINLFNPEKILIGGVINQAKSILYPSIEQCIREQSLPVYHQD ------------------------------------3333-------------3333--- LKLVESRFYKQATMPGAALIKQALYDGLLLMKVVEG ----------1111---------------------- >RAS-RELATED PROTEIN RAB-3; SWP:O35963; PDB:1Z06A; RIFKIIVIGDSNVGKTCLTYRFCAGRFPDRTEATIGVDFRERAVDIDGERIKIQLWDTAG ---------2222--------------------------------iiii----------- QERFRKSMVQHYYRNVHAVVFVYDMTNMASFHSLPAWIEECKQHLLANDIPRILVGNKCD 33331111---------------1111------------------------------333 LRSAIQVPTDLAQKFADTHSMPLFETSAKNPNDNDHVEAIFMTLA 31111-----------1111----------1111------3333- >RAS-RELATED PROTEIN RAB-2; SWP:Q9UL25; PDB:1Z08A; AYSFKVVLLGEGCVGKTSLVLRYCENKFNDKHITTLGASFLTKKLNIGGKRVNLAIWDTA ----------1111---------------------------------------------- GQPIYYRDSNGAILVYDITDEDSFQKVKNWVKELRKMLGNEICLCIVGNKIDLEKERHVS -----2222-------1111-----------------------------33331111--- IQEAESYAESVGAKHYHTSAKQNKGIEELFLDLCKRMIET --------1111-------1111----------------- >RAS-RELATED PROTEIN RAB-2; SWP:P61019; PDB:1Z0AA; AYAYLFKYIIIGDTGVGKSCLLLQFTDKRFQDLTIGVEFGARMITIDGKQIKLQIWDTAG ------------2222-----------------2222--------iiii--------222 QESFRSITRSYYRGAAGALLVYDITRRDTFNHLTTWLEDARQHSNSNMVIMLIGNKSDLE 23333--33332222-------1111------------------1111-------33331 SRREVKKEEGEAFAREHGLIFMETSAKTASNVEEAFINTAKEIYEK 111--3333-------------------2222-------------- >RAB14, MEMBER RAS ONCOGEN; SWP:P61106; PDB:1Z0FA; NYSYIFKYIIIGDMGVGKSCLLHQFTEKKFMADCPHTIGVEFGTRIIEVSGQKIKLQIWD ------------2222--------------------------------iiii-------- TAGQERFRAVTRSYYRGAAGALMVYDITRRSTYNHLSSWLTDARNLTNPNTVIILIGNKA iiii----------1111-------1111--------------11111111-------33 DLEAQRDVTYEEAKQFAEENGLLFLEASAKTGENVEDAFLEAAKKIY 331111-------------------------2222------------ >Rabenosyn-5; SWP:Q9H1K0; PDB:1Z0JB; IEEELLLQQIDNIKAYIFDAKQCGRLDEVEVLTENLRELKHTLAKQKGGTD -3333---------------------------------------1111--- >5'-AMP-ACTIVATED PROTEIN ; SWP:P80386; PDB:1Z0NA; ARPTVFRWTGGGKEVYLSGSFNNWSKLPTRSQNNFVAILDLPEGEHQYKFFVDGQWTHDP ------------------1111--------iiii-----------------iiii---11 SEPIVTSQLGTVNNIIQVKKTDFEVF 11----1111---------------- >HYPOTHETICAL PROTEIN SPY1; SWP:Q99YR7; PDB:1Z0PA; MSYEKEFLKDFEDWVKTQIQVNQLAMATSQEVADERAKDAFIRYESKLDAYEFLLGKFDN ----------------------------3333---------------------------- YKNGKAFHDIPDE 1111-3333---- >PROBABLE INORGANIC POLYPH; SWP:O30297; PDB:1Z0SA; MRAAVVYKTDGHVKRIEEALKRLEVEVELFNQPSEELENFDFIVSVGGDGTILRILQKLK --------------------1111------------1111---------------1111- RCPPIFGINTGRVGLLTHASPENFEVELKKAVEKFEVERFPRVSCSAMPDVLALNEIAVL -------------1111--3333---------------------3333------------ SRKPAKMIDVALRVDGVEVDRIRCDGFIVATQIGSTGYAFSAGGPVVEPYLECFILIPIA --2222-------iiii-------------3333-----1111----1111--------- PFRFGWKPYVVSMERKIEVIAEKAIVVADGQKSVDFDGEITIEKSEFPAVFFKNEKRFRN ---------------------------%%%%----------------------1111333 LFGKVRSIG 3---1111- >PUTATIVE PROTEASE LA HOMO; SWP:O29883; PDB:1Z0WA; YKLFITEGYEVGRVNGLAVIGESAGIVLPIIAEVTPSMEGRVIATGRLQEIAREAVMNVS ---------------------------------------------1111----------- AIIKKYTGRDISNMDVHIQFVGTYEGVEGDSASISIATAVISAIEGIPVDQSVAMTGSLS -----------------------2222----------------------1111------1 VKGEVLPVGGVTQKIEAAIQAGLKKVIIPKDNIDDVLLDAEHEGKIEVIPVSRINEVLEH 111---------------1111------1111------1111----------3333---- VLEDGKKKNRLMSKFKELELAAV ----------------------- >TRANSCRIPTIONAL REGULATOR; SWP:Q837P6; PDB:1Z0XA; KLSKDTIIAAAFSLLEKSPTLEQLSRKVAKQLGVQAPAIYWYFKNKQALLQSAEAIEEHF -------------------3333-----------3333------------------1111 QEPALCGEWYSDLLAFENYYDLYQQFPCAVAIEIQTVPAYPQRLRHLNQGILREAGFSPE -------3333----------1111--------------------------3333----- THLAVTSLQHLLFGIDATEEKQLVSQVLNGDDYLKEQVLHKQYVSDNELTYEESIQFHSI ----------------------------------------------------3333---- HQKSAFIQAVKTYLDGLQADNTSSSK ---------------------1111- >LEU/ILE/VAL-BINDING PROTE; SWP:P02917; PDB:1Z15A; EDIKVAVVGAMSGPVAQYGDQEFTGAEQAVADINAKGGIKGNKLQIVKYDDACDPKQAVA ------------1111----------------------iiii--------%%%%------ VANKVVNDGIKYVIGHLCSSSTQPASDIYEDEGILMITPAATAPELTARGYQLILRTTGL -----------------3333-3333----------------3333-------------3 DSDQGPTAAKYILEKVKPQRIAIVHDKQQYGEGLARAVQDGLKKGNANVVFFDGITAGEK 333---------------------------------------1111-------------- DFSTLVARLKKENIDFVYYGGYHPEMGQILRQARAAGLKTQFMGPEGVANVSLSNIAGES ---------1111-------------------------------3333-3333-----11 AEGLLVTKPKNYDQVPANKPIVDAIKAKKQDPSGAFVWTTYAALQSLQAGLNQSDDPAEI 11--------11113333---------------1111------------------3333- AKYLKANSVDTVMGPLTWDEKGDLKGFEFGVFDWHANGTATDAK ---1111---1111----1111------------1111------ >REPLICATION PROTEIN A 32 ; SWP:P15927; PDB:1Z1DA; ANGLTVAQNQVLNLIKACPRPEGLNFQDLKNQLKHMSVSSIKQAVDFLSNEGHIYSTVDD -------------------1111-3333----1111----------------------11 DHFKSTDAE 11------- >STILBENE SYNTHASE; SWP:Q9SLV5; PDB:1Z1EA; VSVSGIRKVQRAEGPATVLAIGTANPPNCVDQSTYADYYFRVTNSEHMTDLKKKFQRICE ------------------------------3333------11113333------------ RTQIKNRHMYLTEEILKENPNMCAYKAPSLDAREDMMIREVPRVGKEAATKAIKEWGQPM ------------------3333------------------------------------33 SKITHLIFCTTSGVALPGVDYELIVLLGLDPSVKRYMMYHQGCFAGGTVLRLAKDLAENN 33---------------------------1111--------1111-----------1111 KDARVLIVCSENTSVTFRGPSETDMDSLVGQALFADGAAAIIIGSDPVPEVENPLFEIVS ------------3333----1111-----------------------2222--------- TDQQLVPNSHGAIGGLLREVGLTFYLNKSVPDIISQNINDALSKAFDPLGISDYNSIFWI --------1111-----1111-----1111----------------1111--1111---- AHPGGRAILDQVEEKVNLKPEKMKATRDVLSNYGNMSSACVFFIMDLMRKKSLEAGLKTT ------------------3333-------------!!!!-------------1111---- GEGLDWGVLFGFGPGLTIETVVLRSMAI iiii------------------------ >CGMP-DEPENDENT 3',5'-CYCL; SWP:O00408; PDB:1Z1LA; HASDDEYTKLLHDGIQPVAAIDSNFASFTYTPRSLPEDDTSMAILSMLQDMNFINNYKID -1111-----1111--3333-1111-11113333-3333---------------1111-- CPTLARFCLMVKKGYRDPPYHNWMHAFSVSHFCYLLYKNLELTNYLEDIEIFALFISCMC -----------1111------3333---------------3333--3333---------- HDLDHRGTNNSFQVASKSVLAALYSSEGSVMERHHFAQAIAILNTHGCNIFDHFSRKDYQ -2222------------------3333-----------------22221111-------- RMLDLMRDIILATDLAHHLRIFKDLQKMAEVGYDRNNKQHHRLLLCLLMTSCDLSDQTKG ---------------------------------1111-----------------1111-- WKTTRKIAELIYKEFFSQGDLEKAMGNRPMEMMDREKAYIPELQISFMEHIAMPIYKLLQ ----------------------1111---33331111----------------------- DLFPKAAELYERVASNREHWTKVSFTIRGLPSNNSLDF --1111-------------1111-------1111---- >High-molecular-weight cyt; SWP:Q8VUI3; PDB:1Z1NX; PKVDAIVIDTAAVFGKLEQPGVVFYHEKHTTALEKMAKDCTSCHVETEGKLSFKFARTVD -1111---1111--------------------------3333------------------ PTSKNAMAEQYHANCMACHEKVVGSYPTAPQAAECKRCHVGPGVEGATVTPKPSLDLNLH -------------------------1111----3333-------3333------------ GRHVVAEAKRLQVKEDESCKACHHTYDEAQKKLVYAKGEEGSCVYCHKQEPLPSPVDRVV -------------33333333--------------2222--1111--------------- PSTRDASHESCVNCHLSTRKAQTESGPVLCVGCHTAEAQAAWKKTAETPRLFRGQPDATL ----------------------------3333--33331111------------------ LVAGAATANGTVDVNWAAAGPGPVAFDHKAHEGFVGNCVTCHHPTQTGGSLAACGVACHT ------3333----3333-----------3333----3333---1111----1111---- TTGSKDGNFVTTAQSAHQLGVTTSCVGCHTTQANARKECAGCHAPMQKTALSQNSCIQCH ---3333----------------------------3333---1111-----11111111- EAGFPTSGTQTLGKEEREATAAKILAAKDEKPKTVPLENVPEKLTLNYMKGDEWQAAEFP -------------------------1111------3333--------------------- HRKIYQKLVEEAAKSPMANHFHGDALTMCSGCHHNAKPSLNPPKCASCHSKPFQERTANQ -----------------------33333333------------3333----1111-1111 PGLKGAFHNQCIGCHQEMQVNPKATDCQGCHKPKNS ----------------------11113333------ >HYPOTHETICAL PROTEIN PA33; SWP:Q9HYR3_PSEAE; PDB:1Z1SA; HMNAKEILVHSLRLLENGDARGWCDLFHPEGVLEFPYAPPGWKTRFEGRETIWAHMRLFP --------------1111-----33331111-------2222-------------1111- EHLTVRFTDVQFYETADPDLAIGEFHGDGVATVSGGKLAQDYISVLRTRDGQILLYRDFW ----------------1111-----------1111-------------iiii-------- NPLRHLEALG ---------- >DISINTEGRIN; SWP:Q5EE07; PDB:1Z1XA; NSVHPCCDPVKCEPREGEHCISGPCCRNCKFLNAGTICKRAMLDGLHDYCTGVTSDCPRN ---1111---------------1111iiii-----------------------------1 RYNH 111- >OOKINETE SURFACE PROTEIN ; SWP:O96555; PDB:1Z1YA; AVTVDTICNGQLVQMSNHFCMCNEGLVHLSENTCEENECETLGACGEFGQCIENPDPAQV --1111----------------2222---1111------------2222------1111- NMYCGCIEGYTLEDTCVLDVCQYNCGESGECIVEYLSEIQSAGCSCAIGVPNPEDECTTG ------2222-------3333----1111------iiii------------1111----- ETACQLCNTDNEVCNVEGVYCQCMEGFTFDENVCLGPHH -------3333----iiii----2222------------ >MINOR TAIL PROTEIN U; SWP:P03732; PDB:1Z1ZA; KHTELRAAVLDALEKHDTGATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAE ------------------------------1111-------------------------- LHIEVFLPAQVPDSELDAWMESRIYPVMSDIPALSDLITSMVASGYDYRRDDDAGLWSSA -------1111--------------------3333------------------------- DLTYVITYE --------- >YOP PROTEINS TRANSLOCATIO; SWP:P68590; PDB:1Z21A; SAEKTREVLWQQYYASNPPDHAVLEVLATPVREALLARFGQHQGSVVPAIDLPELRSVLQ --------------------------------------1111----3333---------- QFDSFGKRWEAILLQVLEGILPYLSELINKELMILL ------------------------------------ >CRK-ASSOCIATED SUBSTRATE; SWP:Q63767; PDB:1Z23A; GSGREPLELEVAVETLARLQQGVSTTVAHLLDLVGSASGPGGWRSTSEPQEPPVQDLKAA --------------------------------------------------1111------ VAAVHGAVHELLEFARSAVSSATHTSDRTLHAKLSRQLQKMEDVYQTLVVHGQVLDSGRG -----------------------3333-------------------------3333---- GPGFTLDDLDRLVACSRAVPEDAKQLASFLHGNASLLFRRTKA ----3333------------------------3333------- >INSECTICYANIN A FORM; SWP:P00305; PDB:1Z24A; GDIFYPGYCPDVKPVNDFDLSAFAGAWHEIAKLPLENENQGKCTIAEYKYDGKKASVYNS ------------------3333------------3333---------------------- FVSNGVKEYMEGDLEIAPDAKYTKQGKYVMTFKFGQRVVNLVPWVLATDYKNYAINYNCD --iiii-----------3333------------!!!!----------------------- YHPDKKAHSIHAWILSKSKVLEGNTKEVVDNVLKTFSHLIDASKFISNDFSEAACQYSTT -3333----------------------------1111---1111------3333------ YSLTGPDRH --------- >RAS-RELATED PROTEIN RAB-2; SWP:P35288; PDB:1Z2AA; VAIKMVVVGNGAVGKSSMIQRYCKGIFTKDYKKTIGVDFLERQIQVNDEDVRLMLWDTAG ---------2222--------------------------------%%%%--------iii QEEFDAITKAYYRGAQACVLVFSTTDRESFEAISSWREKVVAEVGDIPTALVQNKIDLLD i------33332222-------1111----------------------------333311 DSCIKNEEAEGLAKRLKLRFYRTSVKEDLNVSEVFKYLAEKHLQ 11----------------------1111--3333---------- >MALATE DEHYDROGENASE; SWP:Q7CRW4; PDB:1Z2IA; TVLARLDELERFCRAVFLAVGTDEETADAATRAMMHGTRLGVDSHGVRLLAHYVTALEGG ----3333---------------------------------33331111-------3333 RLNRRPQISRVSGFGAVETIDADHAHGARATYAAMENAMALAEKFGIGAVAIRNSSHFGP -------------!!!!----%%%%----------------------------------- AGAYALEAARQGYIGLAFCNSDSFVRLHDGAMRFHGTNPIAVGVPAADDMPWLLDMATSA --------1111--------------2222------------------------------ VPYNRVLLYRSLGQQLPQGVASDGDGVDTRDPNAVEMLAPVGGEFGFKGAALAGVVEIFS -3333-----------2222--1111----3333-------!!!!--------------- AVLTGMRLSFDLAPMGGPDFSTPRGLGAFVLALKPEAFLERDVFDESMKRYLEVLRGSPA -------3333----------------------3333-----------------1111-- REDCKVMAPGDREWAVAAKREREGAPVDPVTRAAFSELAEKFSVSPPTYH 2222---2222--------------------------------------- >NATURAL KILLER CELL RECEP; SWP:Q07763; PDB:1Z2KA; CPDSSEEVVGVSGKPVQLRPSNIQTKDVSVQWKKTEQGSHRKIEILNWYNDGPSWSNVSF ------------------------------------------------------------ SDIYGFDYGDFALSIKSAKLQDSGHYLLEITNTGGKVCNKNFQLLILD ------------------3333-------------------------- >ALLANTOATE AMIDOHYDROLASE; SWP:P77425; PDB:1Z2LA; LITHFRQAIEETLPWLSSFGADPAGGMTRLLYSPEWLETQQQFKKRMAASGLETRFDEVG ---------------3333--1111----------------------3333-----1111 NLYGRLNGTEYPQEVVLSGSHIDTVVNGGNLDGQFGALAAWLAIDWLKTQYGAPLRTVEV ----------1111------------------3333------------------------ VAMAEEEGSRFPYVFWGSKNIFGLANPDDVRNICDAKGNSFVDAMKACGFTLPNAPLTPR ----------------3333-----3333----------------1111----------- QDIKAFVELHIEQGCVLESNGQSIGVVNAIVGQRRYTVTLNGESNHAGTTPMGYRRDTVY -----------------1111-----------------------------3333------ AFSRICHQSVEKAKRMGDPLVLTFGKVEPRPNTVNVVPGKTTFTIDCRHTDAAVLRDFTQ ------------------------------------------------------------ QLENDMRAICDEMDIGIDIDLWMDEEPVPMNKELVATLTELCEREKLNYRVMHSGAGHDA ----------1111----------------------------1111-----------333 QIFAPRVPTCMIFIPSINGISHNPAERTNITDLAEGVKTLALMLYQLAWQK 3-3333---------2222---1111--3333------------------- >interferon, alpha-inducib; SWP:P05161; PDB:1Z2MA; WDLTVKMLAGNEFQVSLSSSMSVSELKAQITQKIGVHAFQQRLAVHPSGVALQDRVPLAS ------------------------------------3333-------------------- QGLGPGSTVLLVVDKSDEPLSILVRNNKGRSSTYEVRLTQTVAHLKQQVSGLEGVQDDLF ---2222------------------1111-------1111---------------1111- WLTFEGKPLEDQLPLGEYGLKPLSTVFMNLRL ---iiii--11113333---2222-------- >Inositol-tetrakisphosphat; SWP:Q9XYQ1; PDB:1Z2NX; QTVSLFIWLPESKQKTLFISTKNHTQFELNNIIFDVTLSTELPDKEPNAIITKRTHPVGK ----------------------------%%%%---------------------------- MADEMRKYEKDHPKVLFLESSAIHDMMSSREEINALLIKNNIPIPNSFSVKSKEEVIQLL -----------1111----------------------1111------------------1 QSKQLILPFIVKPENAQGTFNAHQMKIVLEQEGIDDIHFPCLCQHYINHNNKIVKVFCIG 111---------------3333-------33331111-----------%%%%------!! NTLKWQTRTSLPNVHRCGIKSVDFNNQHLEDILSWPEGVIDKQDIIENSANRFGSKILED !!----------------------1111---11112222---------1111-------3 PILLNLTSEAEMRDLAYKVRCALGVQLCGIDFIKENEQGNPLVVDVNVFPSYGGKVDFDW 333-------------------------------%%%%------------iiii------ FVEKVALCYTE ----------- >LM5-1; SWP:NA; PDB:1Z2QA; GPLGSMGEKQSKGYWQEDEDAPACNGCGCVFTTTVRRHHCRNCGYVLCGDCSRHRAAIPM ----------------3333-----------3333------------3333--------- RGITEPERVCDACYLALRSSNMAG ------------------------ >UBIQUITIN-CONJUGATING ENZ; SWP:P35129; PDB:1Z2UA; HMALKRIQKELQDLGRDPPAQCSAGPVGDDLFHWQATIMGPPESPYQGGVFFLTIHFPTD --------------------------!!!!----------1111-2222--------111 YPFKPPKVAFTTRIYHPNINSNGSICLDILRSQWSPALTISKVLLSICSLLCDPNPDDPL 1--------------11111111---333311113333----------------1111-- VPEIARIYKTDRERYNQLAREWTQKYAM ---------------------------- >VACUOLAR PROTEIN SORTING ; SWP:Q9QZ88; PDB:1Z2WA; MLVLVLGDLHIPHRCNSLPAKFKKLLVPGKIQHILCTGNLCTKESYDYLKTLAGDVHIVR ------------------333311112222------------------3333-------- GDFDENLNYPEQKVVTVGQFKIGLIHGHQVIPWGDMASLALLQRQFDVDILISGHTHKFE 2222-1111-------!!!!-----------3333------3333--------------- AFEHENKFYINPGSATGAYNALETNIIPSFVLMDIQASTVVTYVYQLIGDDVKVERIEYK ---iiii------1111------------------!!!!--------!!!!--------- KS -- >PROBABLE TRNA PSEUDOURIDI; SWP:Q8Q0M2; PDB:1Z2ZA; EVPEIEKQIGINLYSTDTTGLGGQLRQEIEDFIVKEITNREEGEEGKYLIVELTKRDWDT --3333---------------------1111----------------------------- HHLTRTLSRILQVSQKRISVAGTKDKRALTTQKISIFDTDASEIEKIHLKDIELKVLGRS -------------3333----------------------33331111------------- RKSVELGDLWGNDFRITVRNIENSPEETEALLKKTTDEILAQGGVPNFFGIQRFGSVRPV -------------------------------------------------3333------- THLVGKAIVEGNFEKAALLYIAEPFPEEPEETKNARQFVKDTLDFKEGLKTYPLRLGHER ------------------------------3333---3333-----------33333333 ANHLIANPEDYSGSFRVLPQNLYRFVHGYQSYIYNIILCRRIEAGIPLNRAVEGDIVCFR -1111-22221111------3333------------------------------------ NEVGLPDSSKTEKVTSETVNANRLLKLGRAFITAPLPGYNTEFASGIPGEIENGVLKELG -----------------3333--1111------------------3333----------- VSLEGFNIEKFPESSKGTRREVLLEVKPKFEAGEDELNPGKSKAVLEFLPKGSYATTVLR -1111--3333-----------------------3333---------------3333333 EYKVNPLQ 3---1111 >PURINE NUCLEOSIDE PHOSPHO; SWP:A2E7Y6; PDB:1Z34A; ATPHNSAQVGDFAETVLMCGDPLRAKLIAETYLENPKLVNNVRGIQGYTGTYKGKPISVM --------------------3333----------------2222-------iiii----- GHGMGLPSICIYAEELYSTYKVKTIIRVGTCGAIDMDIHTRDIVIFTSAGTNSKINRIRF ----------------------------------11112222-----------------% MDHDYPATASFDVVCALVDAAKELNIPAKVGKGFSTDLFYNPQTELAQLMNKFHFLAVEM %%%---------------------------------------3333----1111------ ESAGLFPIADLYGARAGCICTVSDHILHHEERQNSFQNMMKIALEAAIKL 3333-----1111-----------1111--3333---------------- >TRNA-SPECIFIC ADENOSINE D; SWP:P68398; PDB:1Z3AA; SEVEFSHEYWMRHALTLAKRAWDEREVPVGAVLVHNNRVIGEGWNRPIGRHDPTAHAEIM --2222----------------------------%%%%---------11111111----- ALRQGGLVMQNYRLIDATLYVTLEPCVMCAGAMIHSRIGRVVFGARDAKTGAAGSLMDVL -------------2222-----------------------------3333--------11 HHPGMNHRVEITEGILADECAALLSDFFRMRRQEIK 11---------------------------------- >UBIQUITIN-CONJUGATING ENZ; SWP:P52478; PDB:1Z3DA; TTPSRRRLMRDFKKLQEDPPAGVSGAPTEDNILTWEAIIFGPQETPFEDGTFKLSLEFTE -------------------2222----1111---------------2222--------11 EYPNKPPTVKFISKMFHPNVYADGSICLDILQNRWSPTYDVAAILTSIQSLLDEPNPNSP 11--------------11111111---3333----1111-----------1111------ ANSLAAQLYQENRREYEKRVQQIVEQSWLNF -----------3333----------3333-- >REGULATORY PROTEIN SPX; SWP:O31602; PDB:1Z3EA; HMVTLYTSPSCTSCRKARAWLEEHEIPFVERNIFSEPLSIDEIKQILRMTEDGTDEIIST ---------------------1111------1111-----------1111--3333--11 RSKVFQKLNVNVESMPLQDLYRLINEHPGLLRRPIIIDEKRLQVGYNEDEIRRFLPRKV 11--------3333-3333---------------------------3333-1111---- >DNA-directed RNA polymera; SWP:P20429; PDB:1Z3EB; KEKVLEMTIEELDLSVRSYNCLKRAGINTVQELANKTEEDMMKVRNLGRKSLEEVKAKLE 3333---3333-----------1111-----------------2222------------1 ELGLGLR 111---- >RAD54-like; SWP:Q7ZV09; PDB:1Z3IX; LGLRRAGVRKALHDPFEDGALVLYEPPAISAHDLIKADKEKLPVHVVVDPVLSKVLRPHQ -------------------------------------1111-------33331111---- REGVKFLWDCVTGRRIENSYGCIMADEMGLGKTLQCITLIWTLLKQSPDCKPEIDKVIVV ---------1111--2222---------------------------1111---------- SPSSLVRNWYNEVGKWLGGRVQPVAIDGGSKDEIDSKLVNFISQQGMRIPTPILIISYET -3333-----------!!!!---------------------------------------- FRLHAEVLHKGKVGLVICDEGHRLKNSDNQTYLALNSMNAQRRVLISGTPIQNDLLEYFS ---33332222---------11111111----------------------33331111-- LVHFVNSGILGTAQEFKKRFEIPILKGRDADASDKDRAAGEQKLQELISIVNRCLIRRTS -----3333------------------------------------------1111----- DILSKYLPVKIEQVVCCNLTPLQKELYKLFLKQAKPVESLQTGKISVSSLSSITSLKKLC 3333------------------------------3333---------------------- NHPALIYEKCLTGEEGFDGALDLFPQNYSTKAVEPQLSGKMLVLDYILAMTRTTTSDKVV -3333--3333-----22223333---------3333----------------------- LVSNYTQTLDLFEKLCRNRRYLYVRLDGTMSIKKRAKIVERFNNPSSPEFIFMLSSKAGG -------------------------------------------3333-------3333-i CGLNLIGANRLVMFDPDWNPANDEQAMARVWRDGQKKTCYIYRLLSTGTIEEKILQRQAH iii-3333----------3333---------2222-------------3333-------- KKALSSCVVDEEQDVERHFSLGELRELFSLNEKTLSDTHDRFRCRRCVNGRQVRPPPDDS ---------------------------------------3333----iiii-----1111 DCTCDLSNWHHCADKRGLRDPVLQASWDAAVSFVFHQRSHEDQR 11113333--------------3333-3333---------3333 >THAUMATIN-LIKE PROTEIN; SWP:NA; PDB:1Z3QA; ATFEIVNRCSYTVWAAAVPGGGRQLNQGQSWTINVNAGTTGGRIWGRTGCSFDGSGRGRC -------------------------2222------2222-------------1111---- QTGDCGGVLSCTAYGNPPNTLAEFALNQFNNLDFFDISLVDGFNVPMDFSPTSGGCRGIR ----%%%%---------------------------------------------------- CAADINGQCPGALKAPGGCNNPCTVFKTDQYCCNSGACSPTDYSQFFKRNCPDAYSYPKD ---3333--3333-2222--3333---3333-3333----3333------1111--1111 DQTTTFTCPGGTNYRVVFCP --------2222-------- >POLYPROTEIN; SWP:Q80J38; PDB:1Z3RA; ISKGLTYTMCDKAKFTWKRAPTDSGHDTVVMEVAFSGTKPCRIPVRAVAHGAPDVDVAML ---1111---1111-------------------------------------3333----- ITPNPTMENNGGGFIEMQLPPGDNIIYVGELKHQWFQKG ---------------------------!!!!-------- >ANGIOPOIETIN-2; SWP:O15123; PDB:1Z3UA; SFRDCAEVFKSGHTTNGIYTLTFPNSTEEIKAYCDMEAGGGGWTIIQRREDGSVDFQRTW ---3333------------------------------iiii------------------- KEYKVGFGNPSGEYWLGNEFVSQLTNQQRYVLKIHLKDWEGNEAYSLYEHFYLSSEELNY --------1111----------3333----------------------------3333-- RIHLKGLTGTAGKISSISQPGNDFSTKDGDNDKCICKCSQMLTGGWWFDACGPSNLNGMY -------------------------1111-------3333-------------------- YPQRQNTNKANGIKWAAWKGSGYSLKATTMMIRPAD -2222------------------------------- >PUTATIVE CYTIDYLTRANSFERA; SWP:Q8DIJ1; PDB:1Z3XA; FVTPALADLQEQLYNGNEKSQLAASTLSTAGTEGYHLLQEFLKDSATFSPPPAPWIRGQA ---1111-----------------3333----------------1111----1111---- YRLLFHSPEASVQAFLQQHYPQGVIPLRSDRGVDYQELAKLLVAEKFEAADRLTTQKLCE ---1111----------------------------------------------------- LAGPLAQKRRWLYFTEVEQLPIPDLQTIDQLWLAFSLGRFGYSVQRQLWLGCGQNWDRLW ------------33331111---------------iiii--------------------- EKIGWRQGKRWPRYPNEFIWDLSAPRGHLPLTNQLRGVQVLNALLNHPAWTA 1111--!!!!----1111--11112222-----1111---------3333-- >APICAL MEMBRANE ANTIGEN 1; SWP:Q7KQK5; PDB:1Z40A; NPWTEYMAKYDIEEVHGSGIRVDLGEDAEVAGTQYRLPSGKCPVFGKGIIIENSNTTFLT 11111111---------------------iiii-----------------------1111 PVATKDGGFAFPPTEPLMSPMTLDEMRHFYKDNKYVKNLDELTLCSRHAGNMIPDNDKNS ---------------------------1111----1111---------1111-%%%%--- NYKYPAVYDDKDKKCHILYIAAQENNGPRYCFCFRPAKDISFQNYTYLSKNVVDNWEKVC --------------------------------------3333------11111111---- PRKNLQNAKFGLWVDGNCEDIPHVNEFPAIDLFECNKLVFELSASDQPKQYEQHLTDYEK -------------%%%%------------------------------------------- IKEGFKNKNASMIKSAFLPTDRYKSHGKGYNWGNYNTETQKCEIFNVKPTCLINNSSYIA 1111---iiii---1111--1111iiii--------------------------1111-- TTALSHPIEVE -3333------ >Probable NADH-dependent f; SWP:P54550; PDB:1Z41A; ARKLFTPITIKDTLKNRIVSPCYSSHEKDGKLTPFHAHYISRAIGQVGLIIVEASAVNPQ -3333-------------------1111----3333---------------------111 GRITDQDLGIWSDEHIEGFAKLTEQVKEQGSKIGIQLAHAGRKAELEGDIFAPSAIAFDE 1----------3333-----------1111---------!!!!---------------11 QSATPVESAEKVKETVQEFKQAAARAKEAGFDVIEIHAAHGYLIHEFLSPLSNHRTDEYG 11------------------------------------iiii--------------1111 GSPENRYRFLREIIDEVKQVWDGPLFVRVSASDYTDKGLDIADHIGFAKWKEQGVDLIDC ----------------3333--------------2222------------1111------ SSGALVHADINVFPGYQVSFAEKIREQADATGAVGITDGSAEEILQNGRADLIFIGRELL ------------2222----------------------------1111-------3333- RDPFFARTAAKQLNTEIPAPVQYERGW ----------1111-----3333---- >GAL10 BIFUNCTIONAL PROTEI; SWP:P04397; PDB:1Z45A; SKIVLVTGGAGYIGSHTVVELIENGYDCVVADNLSNSTYDSVARLEVLTKHHIPFYEVDL -------1111----------1111---------------------------------33 CDRKGLEKVFKEYKIDSVIHFAGLKAVGESTQIPLRYYHNNILGTVVLLELMQQYNVSKF 33-----3333--------------3333------------------------------- VFSSSATVYGDATRFPNMIPIPEECPLGPTNPYGHTKYAIENILNDLYNSDKKSWKFAIL ------33333333-------1111-------------------------1111------ RYFNPIGAHPSGLIGEDPLGIPNNLLPYMAQVAVGRREKLYIFRDGTPIRDYIHVVDLAK --------3333-------------------1111------------------------- GHIAALQYLEAYNENEGLCREWNLGSGKGSTVFEVYHAFCKASGIDLPYVLNLTAKPDRA --------33331111--------------3333-------------------------- KRELKWQTELQVEDSCKDLWKWTTENPFGYQLRGVEARFSAEDMRYDARFVTIGAGTRFQ -------------------------1111--2222------2222--------2222--- ATFANLGASIVDLKVNGQSVVLGYENEEGYLNPDSAYIGATIGRYANRISKGKFSLCNKD --------------iiii-------3333-------2222---------------%%%%- YQLTVNNGVNANHSSIGSFHRKRFLGPIIQNPSKDVFTAEYMLIDNEKDTEFPGDLLVTI ------!!!!-%%%%--3333-----------2222---------3333----------- QYTVNVAQKSLEIVYKGKLTAGEATPINLTNHSYFNLNKPYGDTIEGTEIMVRSKKSVDV -----1111----------------------------3333---2222------------ DKNMIPTGNIVDREIATFNSTKPTVLGPKNPQFDCCFVVDENAKPSQINTLNNELTLIVK 1111------------1111-------------------3333------1111------- AFHPDSNITLEVLSTEPTYQFYTGDFLSAGYEARQGFAIEPGRYIDAINQENWKDCVTLK ---1111----------------1111----2222----------333311111111--2 NGETYGSKIVYRFS 222----------- >PUTATIVE ABC-TRANSPORTER ; SWP:NA; PDB:1Z47A; GSMTIEFVGVEKIYPGGARSVRGVSFQIREGEMVGLLGPSGSGKTTILRLIAGLERPTKG -------------2222-----------2222------2222------------------ DVWIGGKRVTDLPPQKRNVGLVFQNYALFQHMTVYDNVSFGLREKRVPKDEMDARVRELL ---iiii-11113333------2222----------------1111-------------- RFMRLESYANRFPHELSGGQQQRVALARALAPRPQVLLFDEPFAAIDTQIRRELRTFVRQ ----1111----1111-------------1111-------1111---------------- VHDEMGVTSVFVTHDQEEALEVADRVLVLHEGNVEQFGTPEEVYEKPGTLFVASFIGESN -----------------------------iiii---------------------2222-- VWTRAVQNGRIEVAGAALPVDPAVSEGSEVAVVVRPKDVELQPASEREAHAQVVRSAFKG ------%%%%--iiii----33332222------1111------3333------------ SYSACWIRTKDGEVWEVHVPSADRHRWSPGAWVHMNVTRWFIFPR --------1111-------3333----2222-------------- >TRANSCRIPTIONAL REGULATOR; SWP:Q9KBG0; PDB:1Z4EA; HVTIREATEGDLEQMVHMLADDVLGRKRERYEKPLPVSYVRAFKEIKKDKNNELIVACNG -------3333------------3333---------------------1111------!! EEIVGMLQVTFTPYLTYQGSWRATIEGVRTHSAARGQGIGSQLVCWAIERAKERGCHLIQ !!-------------%%%%-----------1111-------------------------- LTTDKQRPDALRFYEQLGFKASHEGLKMHF ---1111-------1111------------ >TOR INHIBITION PROTEIN; SWP:Q2EES9; PDB:1Z4HA; MQHELQPDSLVDLKFIMADTGFGKTFIYDRIKSGDLPKAKVIHGRARWLYRDHCEFKNKL ------------------------------------------------3333-------- LSRANG ------ >General control of amino ; SWP:Q92830; PDB:1Z4RA; SGIIEFHVIGNSLTPKANRRVLLWLVGLQNVFSHQLPRMPKEYIARLVFDPKHKTLALIK -----------------------------------1111----------1111------i DGRVIGGICFRMFPTQGFTEIVFCAVTSNEQVKGYGTHLMNHLKEYHIKHNILYFLTYAD iii----------3333---------3333-----------------1111--------1 EYAIGYFKKQGFSKDIKVPKSRYLGYIKDYEGATLMECELNPR 111---------------33332222---2222---------- >HEMAGGLUTININ-NEURAMINIDA; SWP:P04850; PDB:1Z4VA; INDNRYINGINQFYFSIAEGRNLTLGPLLNMPSFIPTATTPEGCTRIPSFSLTKTHWCYT --3333---------3333--------------------1111---------3333---- HNVILNGCQSNQFVSMGIIEPTSAGFPFFRTLKTLYLSDGVNRKSCSISTVPGGCMMYCF ---------------------1111-------------------------2222------ VSTQPERDDYFSAAPPEQRIIIMYYNDTIVERIINPPGVLDVWATLNPGTGSGVYYLGWV ----33333333--------------------------2222-------------iiii- LFPIYGGVIKGTSLWNNQANKYFIPQMVAALCSQNQATQVQNAKSSYYSSWFGNRMIQSG --------2222-----2222---11111111----------1111-----%%%%----- ILACPLRQDLTNECLVLPFSNDQVLMGAEGRLYMYGDSVYYYQRSNSWWPMTMLYKVTIT ----------------------------------!!!!---------------------- FTNGQPSAISAQNVPTQQVPRPGTGDCSATNRCPGFCLTGVYADAWLLTNPSSTSTFGSE -iiii------------------!!!!1111------------------1111--2222- ATFTGSYLNTATQRINPTMYIANNTQIISSQQFGSSGQEAAYGHTTCFRDTGSVMVYCIY ----------------------------------2222---------------------- IIELSSSLLGQFQIVPFIRQVTLS ------------------------ >AEROLYSIN; SWP:P09167; PDB:1Z52A; PVYPDQLRLFSLGQGVCGDKYRPVNREEAQSVKSNIVGMMGQWQISGLANGWVIMGPGYN --1111------2222-2222---------------11111111----%%%%---3333- GEIKPGTASNTWCYPTNPVTGEIPTLSALDIPDGDEVDVQWRLVHDSANFIKPTSYLAHY ----------------------------------3333---------------------- LGYAWVGGNHSQYVGEDMDVTRDGDGWVIRGNNDGGCDGYRCGDKTAIKVSNFAYNLDPD -------1111-2222------------------------3333-------------111 SFKHGDVTQSDRQLVKTVVGWAVNDSDTPQSGYDVTLRYDTATNWSKTNTYGLSEKVTTK 1-------------------------------------------------3333------ NKFKWPLVGETELSIEIAANQSWASQNGGSTTTSLSQSVRPTVPARSKIPVKIELYKADI ---------------------3333----------------------------------- SYPYEFKADVSYDLTLSGFLRWGGNAWYTHPDNRPNWNHTFVIGPYKDKASSIRYQWDKR --------------------------1111-----------------3333-------11 YIPGEVKWWDLNWTIQQNGLSTMQNNLARVLRPVRAGITGDFSAESQFAGNIEIGAPVPL 113333----------------------1111---------------------------- AGLRLEIPLDAQELSGLGFNNVSLSVTPAAN ---------33333333----------1111 >PROBABLE THIOESTERASE; SWP:Q5SJV0; PDB:1Z54A; MESVTRIKVRYAETDQMGVVHHSVYAVYLEAARVDFLERAGLPYHRVEARGVFFPVVELG ---------3333-1111--3333------------------------------------ LTFRAPARFGEVVEVRTRLAELSSRALLFRYRVEREGVLLAEGFTRHLCQVGERAARIPE ----------------------------------iiii------------%%%%----33 DIYRALSVLHLK 33---3333--- >LIGASE INTERACTING FACTOR; SWP:P53150; PDB:1Z56A; ADKLYKDICCVNDSYRNIKESDSSNRNRVEQLARERELLDKLLETRDERTRAMMVTLLNE 3333--3333-----1111---------------1111-------1111----------- KKKKIRELHEILRQNNI -------1111------ >DNA ligase 4; SWP:Q08387; PDB:1Z56C; SNIFAGLLFYVLSDYVTEDTGIRITRAELEKTIVEHGGKLIYNVILKRHSIGDVRLISCK ----------------------------3333---------------------------- TTTECKALIDRGYDILHPNWVLDCIAYKRLILIEPNYCFNVSQKMRAVAEKRVDCLGDSF ------------------33333333---------------------------------- ENDISETKLSSLYKSQLSLPPMGELEIDSEVRRFPLFIEMKIKLFGGKITDQQSLCNLII ----3333------------------------------3333-3333------------- IPYTDPILRKDCMNEVHEKIKEQIKASDTIPKIARVVAPEWVDHSINENCQVPEEDF --------------3333---3333-------------------1111--------- >DUAL SPECIFICITY PROTEIN ; SWP:P49759; PDB:1Z57A; HCQSGDVLSARYEIVDTLGEGAFGKVVECIDHKAGGRHVAVKIVKNVDRYCEAARSEIQV --2222--------------3333---------iiii----------------------- LEHLNTTDPNSTFRCVQMLEWFEHHGHICIVFELLGLSTYDFIKENGFLPFRLDHIRKMA -------1111------------iiii--------------------------------- YQICKSVNFLHSNKLTHTDLKPENILFVQSDYTEAYNRDERTLINPDIKVVDFGSATYDD --------------------3333----------------------------1111-111 EHHSTLVSTRHYRAPEVILALGWSQPCDVWSIGCILIEYYLGFTVFPTHDSKEHLAMMER 1-------3333----1111---3333--------------------------------- ILGPLPKHMIQKTRKRKYFHHDRLDWDEHSSAGRYVSRACKPLKEFMLSQDVEHERLFDL -----3333-----3333-!!!!---1111-----------3333-----3333------ IQKMLEYDPAKRITLREALKHPFFDLLKK --1111-3333--3333---33331111- >RC-RNASE 3; SWP:Q9DFY7; PDB:1Z5FA; DWETFQKKHLTDTKKVKCDVEMAKALFDCKKTNTFIYALPGRVKALCKNIRDNTDVLSRD --3333----------3333---------------------------------------- AFLLPQCDRIKLPCHYKLSSSTNTICITCVNQLPIHFAGVGSCP -------------------------------------------- >APHA PROTEIN; SWP:Q5MB24; PDB:1Z5GA; TLNPGTNVAKLAEQAPVHWVSVAQIENSLTGRPPMAVGFDIDDTVLFSSPGFWRGKKTYS --------------------------1111----------2222---------------1 PDSDDYLKNPAFWEKMNNGWDEFSIPKEAARQLIDMHVRRGDSIYFVTGRSQTKTETVSK 111-----------------1111------------------------------------ TLADNFHIPAANMNPVIFAGDKPEQNTKVQWLQEKNMRIFYGDSDNDITAARDCGIRGIR ---1111-3333---------1111------------------3333----1111----- ILRAANSTYKPLPQAGAFGEEVIVNSEY ---1111------2222----------- >PROCARBOXYPEPTIDASE B; SWP:P09955; PDB:1Z5RA; GHSYEKYNNWETIEAWTKQVTSENPDLISRTAIGTTFLGNNIYLLKVGKPGPNKPAIFMD -----------------------1111--------1111--------------------- CGFHAREWISHAFCQWFVREAVLTYGYESHMTEFLNKLDFYVLPVLNIDGYIYTWTKNRM ---1111----------------2222------------------------------111 WRKTRSTNAGTTCIGTDPNRNFDAGWCTTGASTDPCDETYCGSAAESEKETKALADFIRN 1------2222-----1111----2222-----1111------2222------------- NLSSIKAYLTIHSYSQMILYPYSYDYKLPENNAELNNLAKAAVKELATLYGTKYTYGPGA ---------------------------------------------------------333 TTIYPAAGGSDDWAYDQGIKYSFTFELRDKGRYGFILPESQIQATCEETMLAIKYVTNYV 3-------------1111--------------!!!!-3333------------------- LGHL ---- >E3 SUMO-protein ligase Ra; SWP:P49792; PDB:1Z5SD; SLDVLIVYELTPTAEQKALATKLKLPPTFFCYKNRPDYVSEEEEDDEDFETAVKKLNGKL ------------3333----1111-----3333--------------------3333--- YLDGS ----- >TUBULIN GAMMA-1 CHAIN; SWP:P23258; PDB:1Z5VA; PREIITLQLGQCGNQIGFEFWKQLCAEHGISPEAIVTDRKDVFFYQADDEHYIPRAVLLD -------------------------1111----------3333----------------- LEPRVIHSILNSPYAKLYNPENIYLSGNNWASGFSQGEKIHEDIFDIIDREADGSDSLEG ---3333----1111---1111-------------------------------------- FVLCHSIAGGTGSGLGSYLLERLNDRYPKKLVQTYSVFPNQSDVVVQPYNSLLTLKRLTQ ----------3333-------------------------------3333----------- NADCLVVLDNTALNRIATDRLHIQNPSFSQINQLVSTIMSASTTTLRYPGYMNNDLIGLI --------------------------3333---------1111-----------333333 ASLIPTPRLHFLMTGYTPLTTDQSVRKTTVLDVMRRLLQPKNVMVSTGTNHCYIAILNII 33------------------------------------3333------------------ QGEVDPTQVHKSLQRIRERKLANFIPWGPASIQVALSRKSPYLRVSGLMMANHTSISSLF ------------------------------------------------------3333-- ERTCRQYDKLRKREAFLEQFRKEDMFKDNFDEMDTSREIVQQLIDEYHAATR ----------------------------3333------------33333333 >Ecdysone receptor [Fragme; SWP:A3EZJ4; PDB:1Z5XE; PITPEQEELIHRLVYFQNEYEHPSPEDIKRIVNAAPEEENVAEERFRHITEITILTVQLI -----------------1111--------------------------------------- VEFSKRLPGFDKLIREDQIALLKACSSEVMMFRMARRYDAETDSILFATNQPYTRESYTV --33332222------------------------1111---------------------- AGMGDTVEDLLRFCRHMCAMKVDNAEYALLTAIVIFSERPSLSEGWKVEKIQEIYIEALK -----------------1111----------------------3333------------- AYVENRRKPYATTIFAKLLSVLTELRTLGNMNSETCFSLKLKNRKVPSFLEEIWDVV ----------------------------------------------3333------- >Ultraspiracle protein [Fr; SWP:A3EZJ5; PDB:1Z5XU; VSDICQAADRQLYQLIEWAKHIPHFTELPVEDQVILLKSGWNELLIAGFSHRSMSVKDGI ---------------------2222---3333-----------------3333------- MLATGLVVHRNCAHQAGVGAIFDRVLTELVAKMREMKMDKTELGCLRSIVLFNPEAKGLK --------1111-----1111-------------------------------1111---- STQQVENLREKVYAILEEYCRQTYPDQSGRFAKLLLRLPALRSIGLKCLEHLFFFKLVGN -----------------------1111---------------3333----2222------ TSIDSFLLSMLES ------------- >HELICASE OF THE SNF2/RAD5; SWP:NA; PDB:1Z5ZA; KIETNVYCNLTPEQAAMYKAEVENLFNNIDSVTGIKRKGMILSTLLKLKQIVDHPALLKG ---------------------------3333----------------------------- GEQSVRRSGKMIRTMEIIEEALDEGDKIAIFTQFVDMGKIIRNIIEKELNTEVPFLYGEL ---3333--------------1111-------------------------------1111 SKKERDDIISKFQNNPSVKFIVLSVKAGGFGINLTSANRVIHFDRWWNPAVENVIVHKLI --------------3333-------2222----1111----------3333--------- SVGTLEEKIDQLLAFKRSLFKDIISSGDSWITELSTEELRKVIELSVGGY 2222-----------3333---3333-3333---------------2222 >TFIIH basal transcription; SWP:Q13888; PDB:1Z60A; LDAFQEIPLEEYNGERFCYGCQGELKDQHVYVCAVCQNVFCVDCDVFVHDSLHSCPGCI -------3333-------3333----------3333----3333-3333---------- >PRION-LIKE PROTEIN DOPPEL; SWP:Q9QUG3; PDB:1Z65A; MKNRLGTWWVAILCMLLASHLSTVKARGIK ------3333-------33333333----- >HYPOTHETICAL PROTEIN S400; SWP:Q83IZ7; PDB:1Z67A; LFDEVVGAFLKGDAGKYQAILSWVEEQGGIQVLLEKLQSGGLGAILSTWLSNQQRNQSVS ----------!!!!----------1111---------1111------------------- GEQLESALGTNAVSDLGQKLGVDTSTASSLLAEQLPKIIDALSPQGEVQANNDLLSAGEL ------------------------------------------1111--3333-3333--- LKGKLF --1111 >FIBROBLAST ACTIVATION PRO; SWP:Q12884; PDB:1Z68A; MRALTLKDILNGTFSYKTFFPNWISGQEYLHQSADNNIVLYNIETGQSYTILSNRTMKSV ------------------------------------------------------3333-- NASNYGLSPDRQFVYLESDYSKLWRYSYTATYYIYDLSNGEFVRGNELPRPIQYLCWSPV -------1111------------------------3333--------------------- GSKLAYVYQNNIYLKQRPGDPPFQITFNGRENKIFNGIPDWVYEEEMLATKYALWWSPNG -------%%%%-----1111---------2222------3333-------------1111 KFLAYAEFNDTDIPVIAYSYYGDEQYPRTINIPYPKAGAKNPVVRIFIIDTTYPAYVGPQ ---------1111----------------------2222-------------3333---- EVPVPAMIASSDYYFSWLTWVTDERVCLQWLKRVQNVSVLSICDFREDWQTWDCPKTQEH ----3333-------------1111-----------------------------3333-- IEESRTGWAGGFFVSTPVFSYDAISYYKIFSDKDGYKHIHYIKDTVENAIQITSGKWEAI -------------------1111--------1111---------3333------------ NIFRVTQDSLFYSSNEFEEYPGRRNIYRISIGSYPPSKKCVTCHLRKERCQYYTASFSDY ----------------%%%%-------------------------3333--------222 AKYYALVCYGPGIPISTLHDGRTDQEIKILEENKELENALKNIQLPKEEIKKLEVDEITL 2--------------------------------------3333----------------- WYKMILPPQFDRSKKYPLLIQVYGGPCSQSVRSVFAVNWISYLASKEGMVIALVDGRGTA ----------3333----------2222---------------------------2222- FQGDKLLYAVYRKLGVYEVEDQITAVRKFIEMGFIDEKRIAIWGWSYGGYVSSLALASGT -------1111-2222-------------------3333--------------------- GLFKCGIAVAPVSSWEYYASVYTERFMGLPTKDDNLEHYKNSTVMARAEYFRNVDYLLIH -------------3333-------------3333---------33331111--------- GTADDNVHFQNSAQIAKALVNAQVDFQAMWYSDQNHGLSGLSTNHLYTHMTHFLKQCFS 1111---3333--------1111-----------3333--------------------- >Coenzyme F420-dependent N; SWP:Q8PZ66; PDB:1Z69A; MKFGIEFVPSDPALKIAYYAKLSEQQGFDHVWITDHYNNRDVYSTLTVLALNTNSIKIGP -----------------------1111--------1111-3333-----1111------- GVTNSYTRNPAITASSIASIAEISGGRAVLGLGPGDKATFDAMGIAWKKPLATTKEAIQA ------------------------------------------------------------ IRDFISGKKVSMDGEMIKFAGAKLAFKAGNIPIYMGAQGPKMLELAGEIADGVLINASHP ----------------------------------------------------------33 KDFEVAVEQIKKGAEKAGRDPSEVDVTAYACFSIDKDPVKAVNAAKVVVAFIVAGSPDLV 33-----------------1111-----------------------------11113333 LERHGIPVEAKSQIGAAIAKGDFGALMGGLVTPQMIEAFSICGTPDDCMKRIKDLEAIGV -1111-3333---------------------------------------------1111- TQIVAGSPIGPAKEKAIKLIGKEIIAK --------------------------- >FATTY ACID SYNTHESIS PROT; SWP:Q965D7; PDB:1Z6BA; DTSIDIEDIKKILPHRYPFLLVDKVIYMQPNKTIIGLKQVSTNEPFFNGHFPQKQIMPGV ----------------------------2222-----------3333---1111---333 LQIEALAQLAGILCLKSDNNLFLFAGVDGVRWKKPVLPGDTLTMQANLISFKSSLGIAKL 3-----------------------------------2222------------1111---- SGVGYVNGKVVINISEMTFALS -----iiii------------- >VITAMIN K-DEPENDENT PROTE; SWP:P07225; PDB:1Z6CA; KDVDECSLKPSICGTAVCKNIPGDFECECPEGYRYNLKSKSCEDIDECSENMCAQLCVNY ------------%%%%--------------------1111-----1111----------- PGGYTCYCDGKKGFKLAQDQKSCEVVS ---------3333-------------- >PENICILLIN-BINDING PROTEI; SWP:P04287; PDB:1Z6FA; NIKTMIPGVPQIDAESYILIDYNSGKVLAEQNADVRRDPASLTKMMTSYVIGQAMKAGKF 3333---------------------------1111---!!!!------------1111-- KETDLVTIGNDAWATGNPVFKGSSLMFLKPGMQVPVSQLIRGINLQSGNDACVAMADFAA 1111----333311113333--------2222---------------------------- GSQDAFVGLMNSYVNALGLKNTHFQTVHGLDADGQYSSARDMALIGQALIRDVPNEYSIY --------------1111-------------1111-----------------33333333 KEKEFTFNGIRQLNRNGLLWDNSLNVDGIKTGHTDKAGYNLVASATEGQMRLISAVMGGR ------iiii------33331111----------------------!!!!---------- TFKGREAESKKLLTWGFRFFETVNPLKVGKEFASEPVWFGDSDRASLGVDKDVYLTIPRG 3333----------------------2222---------------------------222 RMKDLKASYVLNSSELHAPLQKNQVVGTINFQLDGKTIEQRPLVVLQEIPEGNF 21111---------------2222--------iiii------------------ >GUANYLATE KINASE; SWP:Q8I2M1; PDB:1Z6GA; NIYPLVICGPSGVGKGTLIKKLLNEFPNYFYFSVSCTTRKKREKEKEGVDYYFIDKTIFE ---------2222------------1111------------11112222----------- DKLKNEDFLEYDNYANNFYGTLKSEYDKAKEQNKICLFEMNINGVKQLKKSTHIKNALYI -------------%%%%----3333---------------3333---------------- FIKPPSTDVLLSRLLTRNTENQEQIQKRMEQLNIELHEANLLNFNLSIINDDLTLTYQQL --------------1111--------------------1111------------------ KNYLLNSYIHL ----------- >BIOTIN/LIPOYL ATTACHMENT ; SWP:Q9R9I3; PDB:1Z6HA; TVSIQMAGNLWKVHVKAGDQIEKGQEVAILESMKMEIPIVADRSGIVKEVKKKEGDFVNE ---------------2222--2222------iiii-----------------2222--22 GDVLLELSNSTQ 22----3333-- >PEPTIDOGLYCAN-RECOGNITION; SWP:Q9GNK5; PDB:1Z6IA; DFVERQQWLAQPPQKEIPDLELPVGLVIALPTNSENCSTQAICVLRVRLLQTYDIESSQK ---3333----------------------------------------------------- CDIAYNFLIGGDGNVYVGRGWNKMGAHMNNINYDSQSLSFAYIGSFKTIQPSAKQLSVTR ---------1111------------11113333--------------------------- LLLERGVKLGKIAPSYRFTASSKLMPSVTDFKADALYASFANWTHWS ------------1111---3333-11111111-33331111-1111- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:1Z6MA; ADISVIDATKVNTETGLHIGESNAPVKIEFINVRCPYCRKWFEESEELLAQSVKSGKVER -3333-1111----------1111-------1111------------------------- IIKLFDKEKESLQRGNVHHYIDYSAPEQALSALHKFATQDEWGNLTLEEVATYAEKNLGL --------3333---------1111------------33331111--------------- KEQKDATLVSAVIAEANAAHIQFVPTIIIGEYIFDESVTEEELRGYIEK ----------------------------!!!!--3333----------- >HYPOTHETICAL PROTEIN PA12; SWP:Q9I4A4; PDB:1Z6NA; MASYAELFDIGEDFAAFVGHGLATEQGAVARFRQKLESNGLPSALTERLQRIERRYRLLV -----------------1111--------------------3333--------------- AGEMWCPDCQINLAALDFAQRLQPNIELAIISKGRAEDDLRQRLALERIAIPLVLVLDEE --1111----------------3333---------------1111------------111 FNLLGRFVERPQAVLDGGPQALAAYKAGDYLEHAIGDVLAIIEGAA 1----------------3333------------------------- >FERRITIN LIGHT CHAIN; SWP:NA; PDB:1Z6OA; ADTCYNDVALDCGITSNSLALPRCNAVYGEYGSHGNVATELQAYAKLHLERSYDYLLSAA ------------3333----1111---%%%%----------------------------- YFNNYQTNRAGFSKLFKKLSDEAWSKTIDIIKHVTKRGDKMNFDQHSTMKTERKNYTAEN ----------------------------------1111---1111--------------- HELEALAKALDTQKELAERAFYIHREATRNSQHLHDPEIAQYLEEEFIEDHAEKIRTLAG ------------------------------1111-------------------------- HTSDLKKFITANNGHDLSLALYVFDEYLQKTV ----------%%%%------------------ >FERRITIN LIGHT CHAIN; SWP:NA; PDB:1Z6OM; TQCNVNPVQIPKDWITMHRSCRNSMRQQIQMEVGASLQYLAMGAHFSKDVVNRPGFAQLF -----------3333--------------------------------1111--------- FDAASEEREHAMKLIEYLLMRGELTNDVSSLLQVRPPTRSSWKGGVEALEHALSMESDVT ------------------1111-----1111----------------------------- KSIRNVIKACEDDSEFNDYHLVDYLTGDFLEEQYKGQRDLAGKASTLKKLMDRHEALGEF ------------1111-------------------------------1111--------- IFDKKLLGIDV ---3333---- >MLC PROTEIN; SWP:P50456; PDB:1Z6RA; QIKQTNAGAVYRLIDQLGPVSRIDLSRLAQLAPASITKIVHELEAHLVQELGLVVETEAW --------------------3333--1111-3333-------1111-------------- HYLSLRISRGEIFLALRDLSSKLVVEESQELALKDDLPLLDRIISHIDQFFIRHQKKLER -------2222------3333--------------------------------3333--- LTSIAITLPGIIDTENGIVHRPFYEDVKEPLGEALEQHTGVPVYIQHDISAWTAEALFGA -----------------------1111-------------------------------11 SRGARDVIQVVIDHNVGAGVITDGHLLHAGSSSLVEIGHTQVDPYGKRCYCGNHGCLETI 11-----------------------2222------3333---1111--3333---3333- ASVDSILELAQLRLNQSSSLHGQPLTVDSLCQAALRGDLLAKDIITGVGAHVGRILAIVN ------------------------------------------------------------ LFNPQKILIGSPLSKAADILFPVISDSIRQQALPAYSQHISVESTQFSNQGTAGAALVKD --------------------------------3333----------------3333---- AYNGSLLIRLLQG --------1111- >NP95-LIKE RING FINGER PRO; SWP:Q96PU4; PDB:1Z6UA; EAFQLTPQQQHLIREDCQNQKLWDEVLSHLVEGPNFLKKLEQSFMCVCCQELVYQPVTTE ---------------3333---------3333---------1111-------------11 CFHNVCKDCLQRSFKAQVFSCPACRHDLGQNYIMIPNEILQTLLDLFFPGYSKGR 11--------------------------1111---------------22222222 >ADP-RIBOSYLATION FACTOR 4; SWP:P18085; PDB:1Z6XA; TISSLFSRLFGKKQMRILMVGLDAAGKTTILYKLKLGEIVTTIPTIGFNVETVEYKNICF 3333-3333----------------------3333---------2222------------ TVWDVGGQDRIRPLWKHYFQNTQGLIFVVDSNDRERIQEVADELQKMLLVDELRDAVLLL --------------------------------3333-------------3333------- FANKQDLPNAMAISEMTDKLGLQSLRNRTWYVQATCATQGTGLYEGLDWLSNELS ---1111----3333--11113333----------3333-----------3333- >SEPIAPTERIN REDUCTASE; SWP:P35270; PDB:1Z6ZA; HMLGRAVCLLTGASRGFGRTLAPLLASLLSPGSVLVLSARNDEALRQLEAELGAERSGLR -------------------------11112222-------------------3333---- VVRVPADLGAEAGLQQLLGALRELPRPKGLQRLLLINNAGSLGDVSKGFVDLSDSTQVNN ------1111-------------------------------------3333--------- YWALNLTSMLCLTSSVLKAFPDSPGLNRTVVNISSLCALQPFKGWALYCAGKAARDMLFQ ----------------------2222--------3333---2222--------------- VLALEEPNVRVLNYAPGPLDTDMQQLARETSVDPDMRKGLQELKAKGKLVDCKVSAQKLL -----1111--------------------------------------------------- SLLEKDEFKSGAHVDFYDK --------2222--1111- >Sulfatase-modifying facto; SWP:Q8NBK3; PDB:1Z70X; LAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGY ----------------------3333---------------------------------- LTEAEKFGDSFVFEGMLSVAAAPWWLPVKGANWRHPEGPDSTILHRPDHPVLHVSWNDAV ------------3333-----1111--22221111--1111-1111-------------- AYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGF --------------------iiii----1111---2222----------------1111- QGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYM ----1111---1111--------------------------------------------- HRSYYRYRCAARSQNTPDSSASNLGFRCAADRLP 3333---1111----1111--------------- >TRANSCRIPTIONAL REGULATOR; SWP:Q97RS7; PDB:1Z72A; QDYAFQPGLTVGELLKSSQKDWQAAINHRFVKELFAGTIENKVLKDYLIQDYHFFDAFLS -------------3333------------------------------------------- LGACVAHADKLESKLRFAKQLGFLEADEDGYFQKAFKELKVAENDYLEVTLHPVTKAFQD -----------------------------------------3333--------------- LYSAVASSDYAHLLVLVIAEGLYLDWGSKDLALPEVYIHSEWINLHRGPFFAEWVQFLVD -------------------------1111------3333--------------------- ELNRVGKREDLTELQQRWNQAVALELAFFDIGY -------------------------33333333 >TRANSCRIPTIONAL REGULATOR; SWP:Q9X0C0; PDB:1Z77A; LSKRDAILKAAVEVFGKKGYDRATTDEIAEKAGVAKGLIFHYFKNKEELYYQAYSVTEKL -------------------3333-----------3333---------------------- QKEFENFLKNRNRDIFDFERWIEKKLEYSASHPEEADFLITLVSVDEGLRKRILLDLEKS ---------11113333----------------------3333---------------33 QRVFFDFVREKLKDLDLAEDVTEEIALKFLWFFSGFEEVYLRTYQGKPELLKRDNTLVEE 33---------1111--1111----------------------2222------------- VKVLRILKKGTK ------------ >CONSERVED HYPOTHETICAL PR; SWP:Q9I3J6; PDB:1Z7AA; DYPRDLIGYGNNPPHPHWPGDARIALSFVLNYEEGGERCVLHGDKESEAFLSEVAAQPLQ ------!!!!-------2222-----------2222--3333------------------ GVRHSESLYEYGSRAGVWRLLKLFKRRNVPLTVFAVAAAQRNPEVIRAVADGHEICSHGY -------------------------------------3333-------1111-------- RWIDYQYDEAQEREHLEAIRILTELTGQRPVGWYTGRTGPNTRRLVEEGGFLYDSDTYDD --------------------------------------11113333-------------- DLPYWDPASTAEKPHLVIPYTLDTNDRFTQVQGFNNGEQFFQYLKDAFDVLYEEGATAPK -----11113333----------------2222--------------------3333--- LSIGLHCRLIGRPARAALERFIQYAQSHDKVWFARREDIARHWHREHPFQ -----1111--3333---------1111---------------------- >CHORIONIC SOMATOMAMMOTROP; SWP:P01243; PDB:1Z7CA; QTVPLSRLFDHAMLQAHRAHQLAIDTYQEFEETYIPKDQKYSFLSFCFSDSIPTSNLELL ------------------------------------11113333--3333---------- RISLLLIESWLEPVRFLRSMFANNLVYDTSDSDDYHLLKDLEEGIQTLMGRLEQILKQTY -------11113333--1111--------------------------------------- SKFDTDALLKNYGLLYCFRKDMDKVETFLRMVQCRSVEGSC ------------------------------------2222- >ORNITHINE AMINOTRANSFERAS; SWP:Q7RT90; PDB:1Z7DA; KTPEDYINNELKYGAHNYDPIPVVLKRAKGVFVYDVNDKRYYDFLSAYSSVNQGHCHPNI ----------------------------!!!!--1111------%%%%--1111------ LNAMINQAKNLTICSRAFFSVPLGICERYLTNLLGYDKVLMMNTGAEANETAYKLCRKWG -------------------3333------------------------------------- YEVKKIPENMAKIVVCKNNQFSKVPYDDLEALEEELKDPNVCAFIVEPIQGEAGVIVPSD ------2222---------------------------1111---------3333------ NYLQGVYDICKKYNVLFVADEVQTGLGRTGKLLCVHHYNVKPDVILLGKALSGGHYPISA ------------------------iiii----3333-----------!!!!iiii----- VLANDDIMLVIKPGEHGSTYGGNPLAASICVEALNVLINEKLCENAEKLGGPFLENLKRE ---33333333------1111----------------1111------------------- LKDSKIVRDVRGKGLLCAIEFKNELVNVLDICLKLKENGLITRDVHDKTIRLTPPLCITK ---1111-----!!!!-----3333-------------------%%%%------1111-- EQLDECTEIIVKTVKFFD -------------3333- >TETANUS TOXIN LIGHT CHAIN; SWP:P04958; PDB:1Z7HA; PITINNFRYSDPVNNDTIIMMEPPYCKGLDIYYKAFKITDRIWIVPERYEFGTKPEDFNP -------1111-----------1111------------2222-----------3333--- PSSLIEGASEYYDPNYLRTDSDKDRFLQTMVKLFNRIKNNVAGEALLDKIINAIPYLGNS ------------1111-------------------------------------------- YSLLDKFDTNSNSVSFNLLEQDPSGATTKSAMLTNLIIFGPGPVLNKNEVRGIVLRVDNK --1111--------------------------------------------------%%%% NYFPCRDGFGSIMQMAFCPEYVPTFDNVIENITSLTIGKSKYFQDPALLLMHELIHVLHG ------------------------------------------------------------ LYGMQVSSHEIIPSKQEIYMQHTYPISAEELFTFGGQDANLISIDIKNDLYEKTLNDYKA --------------------------3333-----3333--------------------- IANKLSQVTSCNDPNIDIDSYKQIYQQKYQFDKDSNGQYIVNEDKFQILYNSIMYGFTEV ----1111----1111----------1111---1111----------------------- ELGKKFNIKTRLSYFSMNHDPVKIPNLLDDTIYNDTEGFNIESKDLKSEYKGQNMRVNTN ----------------------------3333----!!!!3333---%%%%------111 AFRNVD 1----- >Ovomucoid; SWP:P68390; PDB:1Z7KB; VPMDCSRYPNTTSEEGKVMILCNKALNPVCGTDGVTYDNECVLCAHNLEQGTSVGKKHDG ---3333-----1111--------------1111-------------------------- EC -- >UBIQUITIN-ACTIVATING ENZY; SWP:Q02053; PDB:1Z7LA; IPICTLKNFPNAIEHTLQWARDEFEGLFKQPAENVNQYLTDSKFVERTLRLAGTQPLEVL -3333-------------------------------1111-3333--1111--------- EAVQRSLVLQRPQTWGDCVTWACHHWHTQYCNNIRQLLHNFPPDQLTSSGAPFWSGPKRC -------1111--3333------------------------1111-1111----!!!!-- PHPLTFDVNNTLHLDYVMAAANLFAQTYGLTGSQDRAAVASLLQSVQVPEFTPKSGVKIH ------1111---------------1111------------3333--------------- VSDQELVDDSRLEELKATLPSPDKLPGFKMYPIDFEKDDDSNFHMDFIVAASNLRAENYD --------------------3333-----------3333--------------------- ISPADRHKSKLIAGK --------------- >ATP PHOSPHORIBOSYLTRANSFE; SWP:Q02147; PDB:1Z7MA; YLLPEESAEMTLNQVKSLRQIEGRLRKLFSLKNYQEVMPPSFEYTQLYTALESNGKTFNQ -----------------------------1111----------3333------------1 EKMFQFIKHEGQSITLRYDFTLPLVRLYSQIKDSTSARYSYFGKIFRKEKENYQIGIELF 111----1111---------------3333------------------------------ GESADKSELEILSLALQVIEQLGLNKTVFEIGSAKFFQRLCQLADGSTELLTELLLKKDL -------------------------------------------%%%%------------- SGLNAFIEKNNFSKELRGLLKEIFITNELSRLENLVTNTKDDVLISSFDQLKEFSEKLSM ---------------------3333--------------------------------333 IKPIIIDLGMVPKMDYYTDLMFKAYSSAANQPILSGGRYDQLLSNFQEEAFAIGFCCHMD 3-----1111---1111--------1111---------33333333-------------- TILKALERQEL ------3333- >ATP phosphoribosyltransfe; SWP:Q02129; PDB:1Z7ME; MIKIAITKGRIQKQVTKLLENADYDVEPIRELQIKTKDDLQIIFGKPNDVITFLEHGIVD --------3333-----------------------1111--------------------- IGFVGKDTLDENDFDDYYELLYLKIGQCIFALASYPDFSNKNFQRHKRIASKYPRVTKKY ----3333--------------------------3333---------------------- FAQKQEDIEIIKLEGSVELGPVVGLADAIVDIVETGNTLSANGLEVIEKISDISTRMIVN -1111----------3333-----------------3333-------------------- KSSFKFKKDKIIEMVERLED ---------------1111- >GLUTAREDOXIN; SWP:Q5PSJ1; PDB:1Z7PA; MASKQELDAALKKAKELASSAPVVVFSKTYCGYCNRVKQLLTQVGASYKVVELDELSDGS ----------------1111-------------------------------3333----- QLQSALAHWTGRGTVPNVFIGGKQIGGCDTVVEKHQRNELLPLLQDAAATAKNPAQL -------------------%%%%--------------------------3333---- >HYPOTHETICAL PROTEIN EF06; SWP:Q838C3_ENTFA; PDB:1Z7UA; ATTDKQTSINLALSTINGKWKLSLDELFQGTKRNGELRALDGITQRVLTDRLREEKDGLV ------------1111-2222-----------3333---2222-----------1111-- HRESFNELPPRVEYTLTPEGYALYDALSSLCHWGETFAQKKARLN --------------------------------------------- >CYSTEINE SYNTHASE; SWP:P47998; PDB:1Z7WA; SRIAKDVTELIGNTPLVYLNNVAEGCVGRVAAKLEMMEPCSSVKDRIGFSMISDAEKKGL -----3333----------3333----------33331111------------------- IKPGESVLIEPTSGNTGVGLAFTAAAKGYKLIITMPASMSTERRIILLAFGVELVLTDPA -2222------------------------------33333333----1111------333 KGMKGAIAKAEEILAKTPNGYMLQQFENPANPKIHYETTGPEIWKGTGGKIDGFVSGIGT 3----------------------1111-------------------%%%%---------- GGTITGAGKYLKEQNANVKLYGVEPVESAILSGGKPGPHKIQGIGAGFIPSVLNVDLIDE --------------1111------11111111-----------------11113333--- VVQVSSDESIDMARQLALKEGLLVGISSGAAAAAAIKLAQRPENAGKLFVAIFPSFGERY ----------------------------------------3333-----------33331 LSTVLFDATRKEAEAMTFEA 1111111-----1111---- >Ribonuclease inhibitor; SWP:P13489; PDB:1Z7XW; SLDIQSLDIQCEELSDARWAELLPLLQQCQVVRLDDCGLTEARCKDISSALRVNPALAEL ----------------------3333-------------3333----------1111--- NLRSNELGDVGVHCVLQGLQTPSCKIQKLSLQNCCLTGAGCGVLSSTLRTLPTLQELHLS ----------------11111111------------3333------11113333------ DNLLGDAGLQLLCEGLLDPQCRLEKLQLEYCSLSAASCEPLASVLRAKPDFKELTVSNND -----------------3333------------3333----------1111--------- INEAGVRVLCQGLKDSPCQLEALKLESCGVTSDNCRDLCGIVASKASLRELALGSNKLGD ------------------------------3333----------3333------------ VGMAELCPGLLHPSSRLRTLWIWECGITAKGCGDLCRVLRAKESLKELSLAGNELGDEGA -----3333--3333--------------------------3333----2222------- RLLCETLLEPGCQLESLWVKSCSFTAACCSHFSSVLAQNRFLLELQISNNRLEDAGVREL -----1111---------------3333-------------------------------- CQGLGQPGSVLRVLWLADCDVSDSSCSSLAATLLANHSLRELDLSNNCLGDAGILQLVES -----2222------------3333----------------------------------3 VRQPGCLLEQLVLYDIYWSEEMEDRLQALEKDKPSLRVIS 333-----------------------------3333---- >CYCLOPHILIN; SWP:Q7RSH5; PDB:1Z81A; TIIPYYLSNLLTNPSNPVVFMDINLGNNFLGKFKFELFQNIVPKTSENFRQFCTGEYKVN -----3333---1111------------------------------------------%% NLPVGYKNTIFHRVIKEFMIQGGDFINHNGSGSLSIYGEKFDDENFDIKHDKEGLLSMAN %%------------2222------------------------------------------ SGPNTNGCQFFITTKKCEWLDGKNVVFGRIIDNDSLLLLKKIENVSVTPYIYKPKIPINV ----------------3333---------------------1111--------------- VECGEL ------ >GLYCEROL-3-PHOSPHATE DEHY; SWP:NA; PDB:1Z82A; ERFFVLGAGSWGTVFAQLHENGEEVILWARRKEIVDLINVSHTSPYVEESKITVRATNDL ------------------1111---------------------1111-----------33 EEIKKEDILVIAIPVQYIREHLLRLPVKPSVLNLSKGIEIKTGKRVSEIVEEILGCPYAV 33-1111------1111-3333-------------------------------------- LSGPSHAEEVAKKLPTAVTLAGENSKELQKRISTEYFRVYTCEDVVGVEIAGALKNVIAI ---------1111--------------------1111----------------------- AAGILDGFGGWDNAKAALETRGIYEIARFGFFGADQKTFGLAGIGDLVTCNSRYSRNRRF -----3333-------------------------3333-3333--------1111----- GELIARGFNPLKLLESSNQVVEGAFTVKAVKIAKENKIDPISEEVYRVVYEGKPPLQSRD ---1111------3333---3333-----------------------------3333--- LR -- >galactose-1-phosphate uri; SWP:Q9FK51; PDB:1Z84A; SPELRKDPVTNRWVIFSPRPTDFKSKSPSSCPFCIGREQECAPELFRVPDHDPNWKLRVI ------3333--------1111--------1111--3333-------------------- ENLYPALSRNLETQSRTIVGFGFHDVVIESPVHSIQLSDIDPVGIGDILIAYKKRINQIA -------11113333----------------11113333--------------------- QHDSINYIQVFKNQGASAGASMSHSHSQMMALPVVPPTVSSRLDGTKDYFEETGKCCLCE -3333---------1111-------------------------------------33333 AKSKHFVIDESSHFVSVAPFAATYPFEIWIIPKDHSSHFHHLDDVKAVDLGGLLKLMLQK 333--------------------2222----------1111------------------- IAKQLNDPPYNYMIHTSPLKVTESQLPYTHWFLQIVPQLSGVGGFEIGTGCYINPVFPED ---------------------33331111------------------------------- VAKVMREVSLT ----1111--- >HYPOTHETICAL PROTEIN TM13; SWP:Q9X1A0; PDB:1Z85A; PHLFYGTAQNGEVIFDEREAHHRVVRLKEGDVIEATDGNGFSYTCILKSLKKKTAAAKIV --------iiii----------1111-2222-------------------1111------ KVEEKEKEPTEKLSVVVPIGRWERTRFLIEKCVELGVDEIFFHKFERSQHEISLDKAKIV --------------------3333--------1111--------1111------------ VREAAKQCKRYLFPKVSFLEKLEFSGNVITLDLQNLLDANLEGSITVVVGPEGGFSEKER ----------------------------------3333------------1111------ ELLRSSTTIVLRFETAAILTVGYIALKKQKI ------------------------------- >SERINE PROTEASE HEPSIN; SWP:P05981; PDB:1Z8GA; EPLYPVQVSSADARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTHSELDVRT ------------------------------1111----------------------3333 AGAAGTSGFFCVDEGRLPHTQRLLEVISVCDCPRGRFLAAICQDCGRRKLPIVGGRDTSL ---------------3333--3333------1111-----------------------22 GRWPWQVSLRYDGAHLCGGSLLSGDWVLTAAHCFPERNRVLSRWRVFAGAVAQASPHGLQ 221111----iiii---------------1111-1111-3333--------1111----- LGVQAVVYHGGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPVCLPAAGQALVDGKIC --------111133331111------------------1111------2222--2222-- TVTGWGNTQYYGQQAGVLQEARVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGIDACQG --------1111--------------3333--3333-----2222----1111----222 DSGGPFVCEDSISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEAS 2---------1111-----------------2222-----3333---------------- GMVTQL ------ >CONSERVED HYPOTHETICAL PR; SWP:O25554; PDB:1Z8MA; MLKLNLKKSFQKDFDKLLLNGFDDSVLNEVILTLRKKEPLDPQFQDHALKGKWKPFRECH ----------------1111--33333333----------1111---------------- IKPDVLLVYLVKDDELILLRLGSHSELF -----------2222-------1111-- >6-DEOXYERYTHRONOLIDE B HY; SWP:Q00441; PDB:1Z8OA; TVPDLESDSFHVDWYRTYAELRETAPVTPVRFLGQDAWLVTGYDEAKAALSDLRLSSDPK ---11111111--------------------%%%%----------------3333--111 KKYPGVEVEFPAYLGFPEDVRNYFATNMGTSDPPTHTRLRKLVSQEFTVRRVEAMRPRVE 1-2222---3333-------------3333------------------------------ QITAELLDEVGDSGVVDIVDRFAHPLPIKVICELLGVDEKYRGEFGRWSSEILVMDPERA ------1111--------1111---------------3333--------------3333- EQRGQAAREVVNFILDLVERRRTEPGDDLLSALIRVQDDDDGRLSADELTSIALVLLLAG ------------------------------------------------------------ FEASVSLIGIGTYLLLTHPDQLALVRRDPSALPNAVEEILRYIAPPETTTRFAAEEVEIG ---------------------------1111---------------------------ii GVAIPQYSTVLVANGAANRDPKQFPDPHRFDVTRDTRGHLSFGQGIHFCMGRPLAKLEGE ii--2222-----------3333--1111-1111-22221111-11111111-------- VALRALFGRFPALSLGIDADDVVWRRSLLLRGIDHLPVRLDG ---------1111----3333-----------------1111 >COXSACKIEVIRUS B4 POLYPRO; SWP:P08292; PDB:1Z8RA; GPYGHQSGAVYVGNYKVVNRHLATHVDWQNCVWEDYNRDLLVSTTTAHGCDTIARCQCTT -----------!!!!---1111-------------1111--------------------- GVYFCASKSKHYPVSFEGPGLVEVQESEYYPKRYQSHVLLATGFSEPGDAGGILRCEHGV -----3333--------------------------------------------------- IGLVTMGGEGVVGFADVRDLLWLEDDAMEQ ----------------1111---------- >DNA PRIMASE; SWP:Q9X4D0; PDB:1Z8SA; MAKKLLPAFQNAERLLLAHMMRSRDVALVVQERIGGRFNIEEHRALAAYIYAFYEEGHEA -----------------------------------------3333------3333----- DPGALISRIPGELQPLASELSLLLIADDVSEQELEDYIRHVLNRPKWLMLKVKEQEKTEA 3333------3333-----------1111----------------3333--------333 ERRKDFLTAARIAKEMIEMKKMLSSS 3------------------------- >ALPHA-HEMOGLOBIN STABILIZ; SWP:Q9NZD4; PDB:1Z8UA; ALLKANKDLISAGLKEFSVLLNQQVFNDALVSEEDMVTVVEDWMNFYINYYRQQVTGEPQ --------------------11113333-----------------------------333 ERDKALQELRQELNTLANPFLAKYRDFLKS 3----------------------------- ------------------------------------ >Organic hydroperoxide res; SWP:O34777; PDB:1Z91A; MKLENQLSFLLYASSREMTKQYKPLLDKLNITYPQYLALLLLWEHETLTVKKMGEQLYLD -1111-----------------33331111-----------------------------3 SGTLTPMLKRMEQQGLITRKRSEEDERSVLISLTEDGALLKEKAVDIPGTILGLSKQSGE 333------------------3333-------------3333-1111---1111---!!! DLKQLKSALYTLLETLH !---------------- >Interleukin-2 receptor al; SWP:P01589; PDB:1Z92B; PELCDDDPPEIPHATFKAMAYKEGTMLNCECKRGFRRIKSGSLYMLCTSSWDNQCQCTSS ----------2222-----------------2222--2222------------------- CREPPPWENEATERIYHFVVGQMVYYQCVQGYRALHRGPAESVCKMTHGKTRWTQPQLIC ------------------2222-------------------------------------- TG -- >CONSERVED HYPOTHETICAL PR; SWP:Q7NY36; PDB:1Z94A; PNTIRLHRVLSAPPERVYRAFLDPLALAKWLPPEGFVCKVLEHDARVGGAYKEFLAFASG ------------3333-3333--3333-----2222---------2222----------- QKHAFGGRYLELVPGERIRYTDRFDDAGDITTITLAPLSCGADLSIVQEGIPDAIPPENC ------------2222---------------------1111----------33333333- YLGWQQSLKQLAALVEPD ------------------ >UBA-DOMAIN PROTEIN MUD1; SWP:Q10256; PDB:1Z96A; GLNSKIAQLVSMGFDPLEAAQALDAANGDLDVAASFLL ---------1111------------iiii--------- >CARBONIC ANHYDRASE III; SWP:P07451; PDB:1Z97A; EWGYASHNGPDHWHELFPNAKGENQSPIELHTKDIRHDPSLQPWSVSYDGGSAKTILNNG ----1111333311113333----------3333---3333-------1111-------- KTCRVVFDDTYDRSMLRGGPLPGPYRLRQFHLHWGSSDDHGSEHTVDGVKYAAELHLVHW -----------------!!!!---------------1111-----iiii----------- NPKYNTFKEALKQRDGIAVIGIFLKIGHENGEFQIFLDALDKIKTKGKEAPFTKFDPSSL -11113333--------------------3333-----3333--2222---------111 FPASRDYWTYQGSLTTPPCEECIVWLLLKEPMTVSSDQMAKLRSLLSSAENEPPVPLVSN 1---------------------------------3333---------------------- WRPPQPINNRVVRASFKHHHHHH ------!!!!--------1111- >TRANSLATION INITIATION FA; SWP:P04766; PDB:1Z9BA; NEFELGTRGSSRVDLQEQRSVKTRVSLDDLFEQIKQGEMKELNLIVKADVQGSVEALVAA -----------------3333----------------------------1111------- LQKIDVEGVRVKIIHAAVGAITESDISLATASNAIVIGFNVRPDANAKRAAESEKVDIRL 1111-----------------3333--1111----------------------------- HRIIYNVIEEIEAAM --------------- >URIDYLATE KINASE; SWP:P65938; PDB:1Z9DA; EPKYQRILIKLSGEALAGEKGVGIDIPTVQAIAKEIAEVHVSGVQIALVIGGGNLWRGEP -----------3333-----------------------3333---------3333----- AADAGDRVQADYTGLGTVNALVADSLQHYGVDTRVQTAIPQNVAEPYIRGRALRHLEKNR -------------------------------------------------------1111- IVVFGAGIGSPYFSTDTTAALRAAEIEADAILAKNGVDGVYNADPKKDANAVKFDELTHG -------------------------------------------1111------------- EVIKRGLKIDATASTLSDNDIDLVVFNNEAGNIQRVVFGEHIGTTVSNK ----------------------------2222---1111---------- ------------------------------------------------------------ ----------------------------- >MEMBRANE-ASSOCIATED PROST; SWP:Q9N0A4; PDB:1Z9HA; LQLTLYQYKTCPFCSKVRAFLDFHALPYQVVEVNPVLRAEIKFSSYRKVPILVAQEGESS -------1111----------1111--------33331111--------------!!!!- QQLNDSSVIISALKTYLVSGQPLEEIITYYPAMKAVNDQGKEVTEFGNKYWLMLNEKEAQ ----3333-------------33333333-------1111-------1111--------- QVYSGKEARTEEMKWRQWADDWLVHLISPNVYRTPTEALASFDYIVREGKFGAVEGAVAK ----3333-------------3333--3333--------------------3333----- YMGAAAMYLISKRLKSRHRLQDNVREDLYEAADKWVAAVGKDRPFMGGQKPNLADLAVYG ---------------1111-------------------------1111---3333----- VLRVMEGLDAFDDLMQHTHIQPWYLRVERAITEA --1111---------------------------- >Vesicle-associated membra; SWP:Q9Z270; PDB:1Z9LA; HAKHEQILVLDPPSDLKFKGPFTDVVTTNLKLQNPSDRKVCFKVKTTAPRRYCVRPNSGV -----------------------------------------------3333--------- IDPGSIVTVSVLQPFDYDPNEKSKHKFVQTIFAPPNISDEAVWKEAKPDELDSKLRCVFE -2222------------1111------------1111--3333---1111---------- >GAPA225; SWP:Q8N126; PDB:1Z9MA; QDDSQPWTSDETVVAGGTVVLKCQVKDHEDSSLQWSNPAQQTLYFGEKRALRDNRIQLVT -1111--------2222----------%%%%-----1111----!!!!----3333---- STPHELSISISNVALADEGEYTCSIFTMPVRTAKSLVTVLGIPQ -1111--------3333--------------------------- >SUPEROXIDE DISMUTASE [CU-; SWP:Q59452; PDB:1Z9NA; EKIVVPVQQLDPQNGNKDVGTVEITESAYGLVFTPKLHDLAHGLHGFHIHEKPSCEPKEK -----------------------------------------------------------% DGKLVAGLGAGGHWDPKQTQKHGYPWSDDAHMGDLPALFVMHDGSATTPVLAPRLKKLAE %%%-----------1111-----1111---1111------1111-------1111-3333 VKGHSLMIHAGGDNHSDHPAPLGGGGPRMACGVIK --------------------iiii----------- >HYPOTHETICAL UPF0124 PROT; SWP:P33644; PDB:1Z9TA; SKLIVPQWPQPKGVAACSSTRIGGVSLPPYDSLNLGAHCGDNPDHVEENRKRLFAAGNLP ----------1111---------------------------------------------- SKPVWLEQVHGKDVLKLTGYASKRADASYSNTPGTVCAVTADCLPVLFCNRAGTEVAAAH -------------------------------2222--------------3333------- AGWRGLCAGVLEETVSCFADNPENILAWLGPAIGPRAFEVGGEVREAFAVDAKASAAFIQ --------------3333--1111---------1111---------------3333---- HGDKYLADIYQLARQRLANVGVEQIFGGDRCTYTENETFFSYRRDKTTGRASFIWLI !!!!-------------1111---------3333------3333------------- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:1Z9VA; HMTFCLETYLQQSGEYEIHMKRAGFRECAAMIEKKARRVVHIKPGEKILGARIIGIPPVP ------------------------3333-------------------------------- IGIDEERSTVMIPYTKPCYGTAVVELPVDPEEIERILEVAEP ------------------------------------------ >CYTOSKELETON ASSEMBLY CON; SWP:P32790; PDB:1Z9ZA; GMERGIVQYDFMAESQDELTIKSGDKVYILDDKKSKDWWMCQLVDSGKSGLVPAQFIEPV --------------1111---2222---------------------------3333---- >POSSIBLE ACYL-[ACYL-CARRI; SWP:O53442; PDB:1ZA0A; DALTLELEPVVEANMTRHLDTEDIWFAHDYVPFDQGENFAFLGGRDWDPSQSTLPRTITD ------------------1111---3333--3333---3333-----3333--------- ACEILLILKDNDWWGRWLGRWTAEEHLHAIALREYLVVTREVDPVANEDVRVKYTQVETL ------------------------------------1111--3333--3333-------- VYMAFYERCGAVFCRNLAAQIEEPILAGLIDRIARDEVRHEEFFANLVTHCLDYTRDETI -----------------1111----------------------------------3333- AAIAARAADLDVLGADIEAYRDKLQNVADAGIFGKPQLRQLISDRITAWGLAGEPSLKQF -----------2222-2222-------1111---------------11111111--3333 VT -- >IGG LIGHT CHAIN; SWP:NA; PDB:1ZA6B; QVQLVQSGAEVVKPGASVKISCKASGYTFTDHAIHWVKQNPGQRLEWIGYFSPGNDDFKY ------------2222-------------------------------------------- NERFKGKATLTADTSASTAYVELSSLRSEDTAVYFCTRSLNMAYWGQGTLVTVSSASTKG 3333---------3333---------3333------------------------------ PSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSL ---------2222----------------------------2222-------3333---- SSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSCQPREPQVYTLPPSRDELTKNQV ---------------------3333-------------------------3333------ SLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVF -----------------------------------1111----------33331111--- SCSVMHEALHNHYTQKSLSLSPGK -----1111--------------- >COAT PROTEIN; SWP:P03601; PDB:1ZA7A; QGRAIKAWTGYSVSKWTASCAAAEAKVTSAITISLPNELSSERNKQLKVGRVLLWLGLLP -------2222------------2222--------3333-3333--------------11 SVSGTVKSCVTETQTTAAASFQVALAVADNSKDVVAAMYPEAFKGITLEQLAADLTIYLY 11-------------3333---------1111--------1111---------------- SSAALTEGDVIVHLEVEHVRPTFDDSFTPVY -----2222-----------3333------- >VHL-1; SWP:NA; PDB:1ZA8A; CGESCAMISFCFTEVIGCSCKNKVCYLNSIS ----------1111------iiii--iiii- >FRUCTOSE-BISPHOSPHATE ALD; SWP:P00883; PDB:1ZAIA; PHSHPALTPEQKKELSDIAHRIVAPGKGILAADESTGSIAKRLQSIGTENTEENRRFYRQ -----------------------2222--------------------------------- LLLTADDRVNPCIGGVILFHETLYQKADDGRPFPQVIKSKGGVVGIKVDKGVVPLAGTNG -111133331111-----3333----1111-3333--1111-------------2222-- ETTTQGLDGLSERCAQYKKDGADFAKWRCVLKIGEHTPSALAIMENANVLARYASICQQN ------2222-------1111------------1111--------------------111 GIVPIVEPEILPDGDHDLKRCQYVTEKVLAAVYKALSDHHIYLEGTLLKPNMVTPGHACT 1-----------------------------------1111-3333----------1111- QKYSHEEIAMATVTALRRTVPPAVTGVTFLSGGQSEEEASINLNAINKCPLLKPWALTFS --------------------3333------iiii-------------------------- YGRALQASALKAWGGKKENLKAAQEEYVKRALANSLACQGKYTPSGQAGAAASESLFISN -3333-------iiii1111----------------1111------------------33 HAY 33- >ADENYLATE KINASE; SWP:P43188; PDB:1ZAKA; ADPLKVMISGAPASGKGTQCELIKTKYQLAHISAGDLLRAEIAAGSENGKRAKEFMEKGQ ----------2222---------------------------------------------- LVPDEIVVNMVKERLRQPDAQENGWLLDGYPRSYSQAMALETLEIRPDTFILLDVPDELL --3333-----3333--3333-------------------1111-----------3333- VERVVGRRLDPVTGKIYHLKYSPPENEEIASRLTQRFDDTEEKVKLRLETYYQNIESLLS --3333--------------------3333---------3333--------------333 TYENIIVKVQGDATVDAVFAKIDELLGSILEKKNEMVSST 3--------------------------------------- >Ig heavy chain Mem5 [Frag; SWP:P84751; PDB:1ZANH; QVQLKESGPGLVQPSQTLSLTCTVSGFSLTNNNVNWVRQATGRGLEWMGGVWAGGATDYN ---------------------------3333--------2222--------1111----- SALKSRLTITRDTSKSQVFLKMHSL 1111--------1111--------- >LOC500183 protein; SWP:Q4KM66; PDB:1ZANL; DIQMTQSPASLSASLGETVTIECRASEDIYNALAWYQQKPGKSPQLLIYNTDTLHTGVPS -------------2222-----------%%%%------2222------------222233 RFSGSGSGTQYSLKINSLQSEDVASYFCQHYFGYPRTFGGGTKLELKRADAAPTVSIFPP 33----!!!!--------3333-------------------------------------- SSEQLASGGASVVCLLNNFYPKDISVKWKIDGSERQNGVLDSVTDQDSKDSTYSMSSTLT 33333333---------------------iiii--------------------------- LTKAEYESHNSYTCEVTHKTSTSPVVKSFNRGE -----1111--------3333--------1111 >RIO2 KINASE; SWP:O30245; PDB:1ZARA; NIAELYGKMGKHSWRIMDAIFKNLWDYEYVPLQLISSHARIGEEKARNILKYLSDLRVVQ -----1111----------1111------------------------------1111--- NRQKDYEGSTFTFIGLSLYSLHRLVRSGKVDAIGKLMGEGKESAVFNCYSEKFGECVVKF ------------------------1111-----------1111----------------- HKVKVKEHFSVLAIRSARNEFRALQKLQGLAVPKVYAWEGNAVLMELIDAKELYRVRVEN ------------------------1111----------!!!!---------3333----- PDEVLDMILEEVAKFYHRGIVHGDLSQYNVLVSEEGIWIIDFPQSVEVGEEGWREILERD -------------------------1111---1111-----1111-2222---------- VRNIITYFSRTYRTEKDINSAIDRILQ --------------------------- >L,D-TRANSPEPTIDASE; SWP:Q3Y185; PDB:1ZATA; KEQLASMNAIANVKATYSINGETFQIPSSDIMSWLTYNDGKVDLDTEQVRQYVTDLGTKY ---------1111-----iiii----3333-------%%%%------------------- NTSTNDTKFKSTKRGEVTVPVGTYSWTIQTDSETEALKKAILAGQDFTRSPIVQGGTTAD 1111-------------------------------------------------------- HPLIEDTYIEVDLENQHMWYYKDGKVALETDIVSGKPTTPTPAGVFYVWNKEEDATLKGT ------------1111-----iiii----------3333--------------------- NGTPYESPVNYWMPIDWTGVGIHDSDWQPEYGGDLWKTRGSHGCINTPPSVMKELFGMVE ------------------------1111---!!!!------------------------2 KGTPVLVF 222----- >DNA LIGASE; SWP:P63973; PDB:1ZAUA; QTAPEVLRQWQALAEEVREHQFRYYVRDAPIISDAEFDELLRRLEALEEQHPELRTPDSP -3333---------------------------------------1111-------1111- TQLVGGAGFATDFEPVDHLERMLSLDNAFTADELAAWAGRIHAEVGDAAHYLCELKIDGV -----------------------------3333-----------%%%%------------ ALSLVYREGRLTRASTRGDGRTGEDVTLNARTIADVPERLTPGDDYPVPEVLEVRGEVFF ----------------!!!!---------------------------------------- RLDDFQALNASLVEEGKAPFANPRNSAAGSLRQKDPAVTARRRLRMICHGLGHVEGFRPA ---------------------3333----------------------------------- TLHQAYLALRAWGLPVSEHTTLATDLAGVRERIDYWGEHRHEVDHEIDGVVVKVDEVALQ ---------1111------------------------------------------33331 RRLGSTSRAPRWAIAYKYPPE 111------------------ >50S RIBOSOMAL PROTEIN L10; SWP:P29394; PDB:1ZAVA; VMLTRQQKELIVKEMSEIFKKTSLILFADFLGFTVADLTELRSRLREKYGDGARFRVVKN -----------------1111--------2222---------------!!!!-------- TLLNLALKNAEYEGYEEFLKGPTAVLYVTEGDPVEAVKIIYNFYKDKKADLSRLKGGFLE -------1111---3333--------------3333----------------------ii GKKFTAEEVENIAKLPSKEELYAMLVGRVKAPITGLVFALSGILRNLVYVLNAIKEKK ii--3333---1111------------------------------------------- >BRO1 PROTEIN; SWP:P48582; PDB:1ZB1A; MKPYLFDLKLKDTEKLDWKKGLSSYLKKSYGSSQWRTFYDEKATSELDHLRNNANGELAP ------------------------------11113333-------------1111----- SSLSEQNLKYYSFLEHLYFRLGSKGSRLKMDFTWYDAEYSSAQKGLKYTQHTLAFEKSCT ---------------------------------------------------3333----- LFNIAVIFTQIARENINEDYKNSIANLTKAFSCFEYLSENFLNSPSVDLQSENTRFLANI ------------1111-----------------------------3333----------- CHAEAQELFVLKLLNDQISSKQYTLISKLSRATCNLFQKCHDFMKEIDDDVAIYGEPKWK -----------------1111---------------------------1111---3333- TTVTCKLHFYKSLSAYYHGLHLEEENRVGEAIAFLDFSMQQLISSLPFKTWLVEFIDFDG ----------------------1111------------------3333---1111----- FKETLEKKQKELIKDNDFIYHESVPAVVQVDSIKALDAIKSPTWEKILEPYMQDVANKYD ----------------------------1111----------3333-3333--------- SLYRGII ------- >NEUROTOXIN; SWP:Q60393; PDB:1ZB7A; PVNIKNFNYNDPINNDDIIMMEPFNDPGPGTYYKAFRIIDRIWIVPERFTYKDVYEYYDP -------1111---------------------------2222----------------11 TYLKTDAEKDKFLKTMIKLFNRINSKPSGQRLLDMIVDAIPYLGNASTPPDKFAANVANV 11------------------------------------------11111111----1111 SINKKIIQPGAEDQIKGLMTNLIIFGPGPVLSDNFTDSMIMNGHSPISEGFGARMMIRFC --------------------------------------------3333-----------1 PSCLNVFNNVQENKIFSRRAYFADPALTLMHELIHVLHGLYGIKISNLPITPFMQHSDPV 111--------------------------------------------------------- QAEELYTFGGHDPSVISPSTDMNIYNKALQNFQDIANRLNIVSSAQGSGIDISLYKQIYK -----------1111--------------------------------------------- NKYDFVEDPNGKYSVDKDKFDKLYKALMFGFTETNLAGEYGIKTRYSYFSEYLPPIKTEK -------1111-----------------------------------1111---------1 LLDNTIYTQNEGFNIASKNLKTEFNGQNKAVNKEAYEEISLEHLVIYRIAMCKP 111-------!!!!3333--2222---------------3333----------- >ORGANIC HYDROPEROXIDE RES; SWP:Q9PCF4; PDB:1ZB9A; NSLEKVLYTAIVTATGGRDGSVVSSDNVLNVKLSVPQGLGGPGGSGTNPEQLFAAGYSAF -----------------------1111--------3333--------------------- IGALKFVANKEKVDLPAEPRVEGRVGIGEIPGGFGLVVELRIAVSGMERSMLQTLVDKAH -------------------------------------------2222------------- RVCPYSNATRGNIDVVLILID --------2222--------- >Genome polyprotein; SWP:Q84769; PDB:1ZBA1; TTTTGESADPVTTTVENYGGDTQVQRRHHTDVGFIMDRFVKINSLSPTHVIDLMQTHKHG ---3333-------3333--------3333-3333----------------1111-1111 IVGALLRAATYYFSDLEIVVRHDGNLTWVPNGAPEAALSNTSNPTAYNKAPFTRLALPYT -----1111--------------------22223333--1111----------------- APHRVLATVYDGTNKYSTQLPASFNYGAIQAQAIHELLVRMKRAELYCPRPLLAIKVTSQ --------------------3333------------------------------------ DRYKQKIIAPA ----------- >Genome polyprotein; SWP:Q84769; PDB:1ZBA2; DRLLTTRNGHTTSTTQSSVGVTYGYSTEEDHVAGPNTSGLETRVVQAERFFKKFLFDWTT -------!!!!----------------------3333------3333-----------11 DKPFGYLTKLELPTDHHGVFGHLVDSYAYMRNGWDVEVSAVGNQFNGGCLLVAMVPEWKA 112222------------------------------------1111-------------- FDTREKYQLTLFPHQFISPRTNMTAHITVPYLGVNRYDQYKKHKPWTLVVMVLSPLTVSN -3333--1111------3333-----------------3333----------------11 TAAPQIKVYANIAPTYVHVAGELPSKE 11------------------------- >Genome polyprotein; SWP:Q84769; PDB:1ZBA3; GIFPVACADGYGGLVTTDPKTADPVYGKVYNPPKTNYPGRFTNLLDVAEACPTFLRFDDG -------2222---1111------------------------3333----------!!!! KPYVVTRADDTRLLAKFDVSLAAKHMSNTYLSGIAQYYTQYSGTINLHFMFTGSTDSKAR -------------------11111111-------1111---------------1111--- YMVAYIPPGVETPPDTPEEAAHCIHAEWDTGLNSKFTFSIPYVSAADYAYTASDTAETTN ---------------33331111------------------------------1111--1 VQGWVCVYQITHGKAENDTLLVSASAGKDFELRLPIDPRTQ 111-----------2222--------1111----------- >Rabphilin-3A; SWP:P47709; PDB:1ZBDB; EELTDEEKEIINRVIARAEKETEQERIGRLVDRLETRKNVAGDGVNRCILCGEQLGLGSA -----3333---------------------------1111---------------iiii- SVVCEDCKKNVCTKCGVETSNNRPHPVWLCKICLEQREVWKRSGAWFFKGFPKQVLPQP -----------3333-------------------------11113333----------- >RIBONUCLEASE H-RELATED PR; SWP:Q9KEI9; PDB:1ZBFA; EIIWESLSVDVGSQGNPGIVEYKGVDTKTGEVLFEREPIPIGTNNMGEFLAIVHGLRYLK -----------------------------------------------------------1 ERNSRKPIYSNSQTAIKWVKDKKAKSTLVRNEETALIWKLVDEAEEWLNTHTYETPILKW 111---------------------------3333-------------------------- QTDKWGEIKADY 3333---1111- >HYPOTHETICAL PROTEIN AF17; SWP:NA; PDB:1ZBMA; SHKIRVAHTPDADDAFFYATHGKVDTWLEIEHVIEDIETLNRKAFNAEYEVTAISAHAYA ----------3333---------------------3333----1111---------3333 LLDDKYRILSAGASVGDGYGPVVVAKSEISLDGKRIAVPGRYTTANLLLKLAVEDFEPVE ------------------------------2222-----1111----------------- PFDRIIQAVLDEEVDAGLLIHEGQITYADYGLKCVLDLWDWWSEQVKLPLPLGLNAIRRD 1111----------------3333---1111----------3333------------111 LSVEVQEEFLRARESIAFAIENPDEAIEYAKYSRGLDRERAKRFAYVNDYTYNPESVDAA 1-------------------------------iiii-----------3333--------- LKKLYEAEAKGLI -------1111-- >HYPOTHETICAL PROTEIN BPP1; SWP:Q7WJT0; PDB:1ZBOA; AEIPLFPLSNALFPAGVLRLRVFEIRYLDVRRCIADGSEFGVVVLEQGTEVRRPDGREVL ------------2222-------3333--------------------------------- ARAGTARIDHWEAPPALLELACTGTGRFRLHACTQGKYGLWTGQAEPVPDDAPLEVPPEL ------------------------------------iiii----------------3333 ARSASALGRLIARLQREGVPPHIPAAPFRLDDCGWVADRWAELSLPPADKARLLLLPPLD -------------------1111--------3333----------3333----------- RLREIDAVLAA ----------- >HYPOTHETICAL PROTEIN VPA1; SWP:Q87HD3; PDB:1ZBPA; TQWKNALSEGQLQQALELLIEAIKASPKDASLRSSFIELLCIDGDFERADEQLQSIKLFP -3333--------------------1111-----------3333--------------33 EYLPGASQLRHLVKAAQARKDFAQGAATAKVLGENEELTKSLVSFNLSVSQDYEQVSELA 33--------------------------------3333---------------------- LQIEELRQEKGFLANDTSFSDVRDIDDRLGGYIELFSTAGNYFLVPIASINTLEIKSATS -------------iiii-------------------3333-----3333----------- LLESVWRPVEFDIDGLGEGEGHPTYVDSESDAQKLGRETDWKQIADKEVYLGLGLKCWLV 3333--------2222--------1111-------------------------------- GEALPISDLQNLQVIKELALE ----3333------------- >17-BETA-HYDROXYSTEROID DE; SWP:P51659; PDB:1ZBQA; SPLRFDGRVVLVTGAGAGLGRAYALAFAERGALVVVNDLGGDFKGVGKGSLAADKVVEEI ----2222-------------------1111----------1111--------------- RRRGGKAVANYDSVEEGEKVVKTALDAFGRIDVVVNNAGILRDRSFARISDEDWDIIHRV 1111--------3333----------------------------3333------------ HLRGSFQVTRAAWEHMKKQKYGRIIMTSSASGIYGNFGQANYSAAKLGLLGLANSLAIEG ----------------------------3333---2222--------------------3 RKSNIHCNTIAPNAGSRMTQTVMPEDLVEALKPEYVAPLVLWLCHESCEENGGLFEVGAG 333------------3333------------3333---------1111------------ WIGKLRWERTLGAIVRQKNHPMTPEAVKANWKKICDFENASKPQSIQESTGSIIEVLSKI ----------------2222--3333------------------3333------------ DS -- >CONSERVED HYPOTHETICAL PR; SWP:Q7MXM8; PDB:1ZBRA; KRLFLPEWAPQEAVQLTWPHDRTDWAYLDEVETCFVRIATAILRHERLIVVCPDRKRVFG -----3333----------1111----3333----------------------3333--- LLPPELHHRLYCFELPSNDTWARDHGGISLLADGRPIADFAFNGWGKFAAHHDNLITRRL --33331111----------3333-------iiii----------------1111----- HALGLFAEGVTLDNRLAFVLEGGALETDGEGTLLTTDSCLFEPNRNAGLSRTAIIDTLKE ------2222----1111---1111----------3333--11113333----------- SLGVSRVLSLRHGALAGDDTDGHIDTLARFVDTRTIVYVRSEDPSDEHYSDLTAEQELKE ----------------------3333----------------1111-3333--------- LRRPDGQPYRLVPLPAEALYDGADRLPATYANFLIINGAVLVPTYDSHLDAVALSVQGLF --1111-----------------------------2222-------1111-3333-3333 PDREVIGIDCRPLVKQHGSLHCVTQYPQGFIR ----------3333-----3333---2222-- >HYPOTHETICAL PROTEIN PG11; SWP:Q7MVG4; PDB:1ZBSA; ILIGDSGSTKTDWCIAKEGKSLGRFQTSGINPFQQDRNEIDTALRSEVLPAIGQKASSIR ------------------------------------1111-------3333---1111-- AVYFYGAGCTPAKAPLNEALDSLPHCDRIEVAGDLGAARALCGDSEGIACILGTGSNSCL -----2222-------------1111---------------------------------- FDGREIKANVSPLGYILGDEGSGAVLGRLFIGSLLKGQPEGLCEAFLQEYGLTSADIIES ------------------2222-----------1111-2222------------------ VYRKPFPNRFLAGFSPFIAQHLDIPAVYSLVQNSFDDFLVRNVLRYNRPDLPLHFIGSVA -------------3333---33333333--------------3333-------------- FHYREVLSSVIKKRGLTLGSVLQSPEGLIQYHHNNHV 1111-------1111---------------------- >PEPTIDE CHAIN RELEASE FAC; SWP:Q8DU64; PDB:1ZBTA; HNIYDQLQAVEDRYEELREEANSRETVAVYREYKQVVQNIADAQEPELEEAKEELKNSKV ------------------3333-----------------------3333----------- AKEEYEEKLRFLLLPKDPNDDKNIILEIRGAAGGDEAALFAGDLLNYQKYAENQGWKFEV --------1111----1111-------------3333----------------------- EASANGVGGLKEVVAVSGQSVYSKLKYESGAHRVQRVPVTESQGRVHTSTATVLVPEVEE -------------------3333-1111---------3333-----------------33 VEYEIDPKDLRVDIYHAKVATAVRIIHLPTNIKVEQEERTQQKNRDKAKIIRARVADHFA 33---3333------------------1111----------------------------- QIAQDEQDATVGTGDRSERIRTYNFPQNRVTDHRIGLTLQKLDSILSGKLDEVIDALILY ----1111----------------1111----------------1111------------ DQTQKLEELN ---------- >Regulatory protein SIR1; SWP:P21691; PDB:1ZBXB; TEEEYVSPRFLVADGFLIDLAEEKPINPKDPRLLTLLKDHQRAMIDQMNLVKWNDFKKYQ ------3333--iiii---1111---11113333---3333----------33333333- DPIPLKAKTLFKFCKQIKKKFLRGADFKLHTLPMTVLCSCVPILLDDQTVQYLYDDSLEH ------3333----------------------------------3333------1111-- >Tyrosine-protein phosphat; SWP:P35236; PDB:1ZC0A; TPREVTLHFLRTAGHPLTRWALQRQPPSPKQLEEEFLKIPSNFVSPEDLDIPGHASKDRY -----------------------------------3333-----3333--22221111-1 KTILPNPQSRVCLGRAQSQEDGDYINANYIRGYDGKEKVYIATQGPMPNTVSDFWEMVWQ 111--3333-----1111-------------2222-----------1111---------- EEVSLIVMLTQLRECVHYWPTEEETYGPFQIRIQDMKECPEYTVRQLTIQYQEERRSVKH ----------1111-----------!!!!---------------------!!!!------ ILFSAWPDHQTPESAGPLLRLVAEVEESPETAAHPGPIVVHCSAGIGRTGCFIATRIGCQ ------2222-------------------------------------------------- QLKARGEVDILGIVCQLRLDRGGMIQTAEQYQFLHHTLALYAGQLP --------------------2222-----------------1111- >UBIQUITIN FUSION DEGRADAT; SWP:P53044; PDB:1ZC1A; MFSGFSSFGGGNGFVNMPQTFEEFFRCYPIAMMNDRIRKDDANFGGKIFLPPSALSKLSM ---------3333---------------3333-3333----------------------- LNIRYPMLFKLTANETGRVTHGGVLEFIAEEGRVYLPQWMMETLGIQPGSLLQISSTDVP -------------1111-----------3333--------------2222---------- LGQFVKLEPQSVDFLDISDPKAVLENVLRNFSTLTVDDVIEISYNGKTFKIKILEVKPES ------------------3333------------3333-----iiii------------3 SSKSICVIETDLVTDFAPPVGYVEPDYK 333------------------------- >Exocyst complex component; SWP:O54924; PDB:1ZC3B; GQYLVYNGDLVEYEADHMAQLQRVHGFLMNDCLLVATWLPQRRGMYRYNALYPLDRLAVV --------------1111----------------------------------1111---- NVKDNPPMKDMFKLLMFPESRIFQAENAKIKREWLEVLEETKRALSDKR ------------------------------------------------- >PROBABLE N-ACETYLGLUCOSAM; SWP:Q7NU07; PDB:1ZC6A; PSIRYLIGVDGGGTGTRIRLHASDGTPLAAEGGASALSQGIAKSWQAVLSTLEAAFQQAG -----------3333------1111----------3333-----------------1111 LPAAPASACAIGLGLSGVHNRQWAGEFESQAPGFARLSLATDGYTTLLGAHGGQPGIIVA ----3333------------------------------------------iiii------ LGTGSIGEALYPDGSHREAGGWGYPSGDEASGAWLGQRAAQLTQALDGRHSHSPLTRAVL ----------1111------------------------------1111------------ DFVGGDWQAAWNGRATPAQFARLAPLVLSAARVDPEADALLRQAGEDAWAIARALDPQDE ---------3333---------------3333--------------------1111---- LPVALCGGLGQALRDWLPPGFRQRLVAPQGDSAQGALLLLQ ------------3333--------------------3333- >G ALPHA I/12; SWP:P27600; PDB:1ZCAA; RLVKILLLGAGESGKSTFLKQMRIIHGREFDQKALLEFRDTIFDNILKGSRVLVDARDKL ------------------------------3333-------------------------- GIPWQHSENEKHGMFLMAFENKAGLPVEPATFQLYVPALSALWRDSGIREAFSRRSEFQL -----3333------1111-2222-----------------------------3333--- GESVKYFLDNLDRIGQLNYFPSKQDILLARKATKGIVEHDFVIKKIPFKMVDVGGQRSQR ----------1111-1111-----------------------%%%%---------33331 QKWFQCFDGITSILFMVSSSEYDQVLMEDRRTNRLVESMNIFETIVNNKLFFNVSIILFL 1111111----------1111----1111------------------3333--------- NKMDLLVEKVKSVSIKKHFPDFKGDPHRLEDVQRYLVQCFDRKRRNRSKPLFHHFTTAID -3333---------33333333--1111------------1111------------3333 TENIRFVFHAVKDTILQE ------------------ >G ALPHA I/13; SWP:P27601; PDB:1ZCBA; ARLVKILLLGAGESGKSTFLKQMRIIHGQDFDQRAREEFRPTIYSNVIKGMRVLVDAREK ------------------------------------------------------------ LHIPWGDNKNQLHGDKLMAFDTRAPMAAQGMVETRVFLQYLPAIRALWEDSGIQNAYDRR ----------------111133333333------------------------------33 REFQLGESVKYFLDNLDKLGVPDYIPSQQDILLARRPTKGIHEYDFEIKNVPFKMVDVGG 33---1111-------3333---------------------------%%%%--------- WFECFDSVTSILFLVSSSEFDQVLMEDRQTNRLTESLNIFETIVNNRVFSNVSIILFLNK 3333-----------1111----1111------------------3333----------- TDLLEEKVQVVSIKDYFLEFEGDPHCLRDVQKFLVECFRGKRRDQQPLYHHFTTAINTEN -----3333-------1111--1111------------1111-----------1111--- IRLVFRDVKDTILHDNLK ------------------ >GLYCEROPHOSPHODIESTER PHO; SWP:Q8U887; PDB:1ZCCA; MTKIVSHRGANRFAPENTFAAADLALQQGADYIELDVRESADGVLYVIHDETLDRTTNGT ---------1111------------1111----------1111----------------- GPVGHMLSSEIDTLDAGGWFDDRFKGAIVPRLDAYLEHLRGRAGVYIELKYCDPAKVAAL -3333-33333333--33333333--------------2222----------3333---- VRHLGMVRDTFYFSFSEEMRQGLQSIAPEFRRMMTLDIAKSPSLVGAVHHASIIEITPAQ -----3333-----------------1111----3333--1111------------3333 MRRPGIIEASRKAGLEIMVYYGGDDMAVHREIATSDVDYINLDRPDLFAAVRSGMAELLL --1111-------------------------1111--------3333------------- >HYPOTHETICAL PROTEIN ATU2; SWP:Q8UC50; PDB:1ZCEA; ANYWLYKSEPFKWSWEQKAKGETGEEWTGVRNYQARNNRAKIGDKGFFYHSNEGLDVVGI --------3333----33333333----------------2222---------------- VEVCALSHPDSTAEGDLKWDCVDIRAVCDPQPVSLKDVKANPKLEKSLVTSRLSVQPVTE ---------1111---------------------------3333--1111---------- EEYLEVCRGGLANPPKSPD ------------------- >L-ASPARAGINASE; SWP:Q6Q4F4; PDB:1ZCFA; NLPNIVILATGGTIAGSAAANTQTTGYKAGALGVETLIQAVPELKTLANIKGEQVASIGS -----------1111----1111-----------3333--3333--------------33 ENMTSDVLLTLSKRVNELLARSDVDGVVITHGTDTLDESPYFLNLTVKSDKPVVFVAAMR 33--------------1111------------------33331111-------------- PATAISADGPMNLYGAVKVAADKNSRGRGVLVVLNDRIGSARFISKTNASTLDTFKAPEE 1111--------------3333-----------iiii--1111----------------- GYLGVIIGDKIYYQTRLDKVHTTRSVFDVTNVDKLPAVDIIYGYQDDPEYMYDASIKHGV ------%%%%--------------------------------------------1111-- KGIVYAGMGAGSVSKRGDAGIRKAESKGIVVVRSSRTGSGIVPPDAGQPGLVADSLSPAK --------------------------------------------1111------------ SRILLMLALTKTTNPAVIQDYFHAY -------------3333-------- >HYPOTHETICAL OXIDOREDUCTA; SWP:P94424; PDB:1ZCHA; MNEVIKSLTDHRSIRSYTDEPVAQEQLDQIIEAVQSAPSSINGQQVTVITVQDKERKKKI -------1111----------------------3333--%%%%----------------- SELAGGQPWIDQAPVFLLFCADFNRAKIALEDLHDFKMEITNGLESVLVGAVDAGIALGT -1111--3333---------------------------1111------------------ ATAAAESLGLGTVPIGAVRGNPQELIELLELPKYVFPLSGLVIGHPADRSAKKPRLPQEA -----1111-------1111-----------2222---------------------3333 VNHQETYLNQDELTSHIQAYDEQMSEYMNKRTNGKETRNWSQSIASYYERLYYPHIREML -------------------------------iiii------------------------- EKQGFKVEK 1111----- >PEROXISOMAL BIFUNCTIONAL ; SWP:P07896; PDB:1ZCJA; SGQAKALQYAFFAEKSANKWSTPSGASWKTASAQPVSSVGVLGLGTMGRGIAISFARVGI -----------33333333--3333-3333------------------------1111-- SVVAVESDPKQLDAAKKIITFTLEKEASRAHQNGQASAKPKLRFSSSTKELSTVDLVVEA ------------------------------1111-------------------------- VFEDMNLKKKVFAELSALCKPGAFLCTNTSALNVDDIASSTDRPQLVIGTHFFSPAHVMR -------------------2222---------3333-1111-3333--------1111-- LLEVIPSRYSSPTTIATVMSLSKKIGKIGVVVGNCYGFVGNRMLAPYYNQGFFLLEEGSK ------1111------------1111--------2222----------------1111-3 PEDVDGVLEEFGFKMGPFRVSDLAGLDVGWKIRKGQGLTGPSLPPGTPVRKRGNSRYSPL 333--------------------------------------------------------- GDMLCEAGRFGQKTGKGWYQYDKPLGRIHKPDPWLSTFLSQYREVHHIEQRTISKEEILE ----1111--------------2222-----3333------------------------- RCLYSLINEAFRILEEGMAARPEHIDVIYLHGYGWPRHKGGPMFYAASVGLPTVLEKLQK -------------1111---3333-----------1111--------------------- YYRQNPDIPQLEPSDYLRRLVAQGSPPLKEWQSLAGPHG ----11113333--------1111--3333-1111---- >CALPAIN 1, LARGE [CATALYT; SWP:P07384; PDB:1ZCMA; NAIKYLGQDYEQLRVRCLQSGTLFRDEAFPPVPQSLGYKDLGPNSSKTYGIKWKRPTELL ---2222----------1111----1111--3333---------3333------1111-- SNPQFIVDGATRTDICQGALGDCWLLAAIASLTLNDTLLHRVVPHGQSFQNGYAGIFHFQ -----------1111----------------------------------2222------- LWQFGEWVDVVVDDLLPIKDGKLVFVHSAEGNEFWSALLEKAYAKVNGSYEALSGGSTSE --iiii------------iiii-------1111-----------11113333----3333 AFEDFTGGVTEWYELRKAPSDLYQIILKALERGSLLGCSIDISSVLDMEAITFKKLVKGH -------------1111-1111------------------------2222-1111----- AYSVTGAKQVNYRGQVVSLIRMRNPWGEVEWTGAWSDSSSEWNNVDPYERDQLRVKMEDG -----------iiii--------3333-----2222--3333------------------ EFWMSFRDFMREFTRLEICNL --------------------- >2-DEHYDRO-3-DEOXYPHOSPHOH; SWP:Q8U0A9; PDB:1ZCOA; MKYSKEYDEKTVVKINDVKFGEGFTIIAGPCSIESREQIMKVAEFLAEVGIKVLRGGAFK 33333333------!!!!2222------------------------1111---------- PRTSPYSFQGYGEKALRWMREAADEYGLVTVTEVMDTRHVELVAKYSDILQIGARNSQNF ---1111----------------------------1111-------------3333---- ELLKEVGKVENPVLLKRGMGNTIQELLYSAEYIMAQGNENVILCERGIRTFETATRFTLD -----1111------------------------1111----------------------3 ISAVPVVKELSHLPIIVDPSHPAGRRSLVIPLAKAAYAIGADGIMVEVHPEPEKALSDSQ 333----------------3333-3333----------------------3333------ QQLTFDDFLQLLKELEALGWKG ---------------1111--- >PEPTIDYL-PROLYL CIS-TRANS; SWP:Q9UNP9; PDB:1ZCXA; SNPQVYMDIKIGNKPAGRIQMLLRSDVVPMTAENFRCLCTHEKGFGFKGSSFHRIIPQFM ----------!!!!--------------------------1111--2222-----2222- CQGGDFTNHNGTGGKSIYGKKFDDENFILKHTGPGLLSMANSGPNTNGSQFFLTCDKTDW ---------------1111--------------------------------------333 LDGKHVVFGEVTEGLDVLRQIEAQGSKDGKPKQKVIIADCGEYV 3------------3333----11111111--------------- >BIFUNCTIONAL PURINE BIOSY; SWP:Q9X0X6; PDB:1ZCZA; MKRILVSLYEKEKYLDILRELHEKGWEIWASSGTAKFLKSNGIEANDVSTITGFENLLGG ---------3333--------1111-------------1111----3333------%%%% LVKTLHPEIFAGILGPEPRWDVVFVDLYPPPDIDIGGVALLRAAAKNWKKVKPAFDMETL -1111-----------------------------------------3333---------- KLAIEIDDEETRKYLAGMTFAFTSVYDSIRANQFVEGISLAFKREDLQLRYGENPHEKAF ----------------------------------2222---------------1111--- VYGKPAFEILHEGKTISFNNILDAENAWFMAKNLPRMGAVVVKHQSPCGAAIGEDKVEIV ------------------------------1111--------%%%%-------------- KKAIEADDESSFGGILAVNFEMDEEVAKSLKKYLEVIVAPSFTQEAIEVLSKKKVRLLKP ------33332222------------3333-------------------1111------- GDYASWAGKMAFGSLVLSERKYPEGNFELVVGEPLSEKELEDLEFAYRVVEGAKSNAVLI ----------iiii-----------------------------------1111------- AKDGVTVGIGSGQPSRKRAAWIATVMAGEKAKGAVAASDAFFPFPDSLEILAQAGVKAVV -iiii-----------------------3333---------------------------- APLGSIRDEEVIEKARELGITFYKAPSRVFRH ----1111------------------------ >HYPOTHETICAL PROTEIN PF05; SWP:NA; PDB:1ZD0A; HHHGSLEIRTKVGEICISKVWLTDEQINKLFDRFKGDYQVVNAECADKVIFATIIAIKAV -2222------------------------------------------------------- KEGRSIAKTVPGEILVRLSGNRQIKEAIKKVGAKEGENYIVTFGENASALLQKILSTLEI ------------------------------------------------------------ KELELERCDLEYAKKAFEDIA --------------------- >SULFOTRANSFERASE 4A1; SWP:NA; PDB:1ZD1A; GEFESKYFEFHGVRLPPFCRGKMEEIANFPVRPSDVWIVTYPKSGTSLLQEVVYLVSQGE -2222----iiii--1111------------1111------------------------- QLPVLEYPQPGLDIIKELTSPRLIKSHLPYRFLPSDLHNGDSKVIYMARNPKDLVVSYYQ ---1111---33331111----------3333--3333---------------------- FHGTFQEFCRRFMNDKLGYGSWFEHVQEFWEHRMDSNVLFLKYEDMHRDLVTMVEQLARF ---3333----1111-22223333-----1111-1111---3333--------------- LGVSCDKAQLEALTEHCHQLVDQCCNAEALPVGRGRVGLWKDIFTVSMNEKFDLVYKQKM ---------------------11111111---1111-3333------------------! GKCDLTFDFYL !!!-------- >EPOXIDE HYDROLASE 2, CYTO; SWP:P34913; PDB:1ZD3A; TLRAAVFDLDGVLALPAVFGVLGRTEEALALPRGLLNDAFQKGGPEGATTRLMKGEITLS --------2222----3333-----------2222-----2222-------1111--333 QWIPLMEENCRKCSETAKVCLPKNFSIKEIFDKAISARKINRPMLQAALMLRKKGFTTAI 3--------------------------------------------------1111----- LTNTWLDDRAERDGLAQLMCELKMHFDFLIESCQVGMVKPEPQIYKFLLDTLKASPSEVV --------1111---------3333-----3333------3333----------3333-- FLDDIGANLKPARDLGMVTILVQDTDTALKELEKVTGIQLLNTPAPLPTSCNPSDMSHGY ----3333-------------------------------------------3333----- VTVKPRVRLHFVELGSGPAVCLCHGFPESWYSWRYQIPALAQAGYRVLAMDMKGYGESSA ---1111---------------------3333-------------------2222----- PPEIEEYCMEVLCKEMVTFLDKLGLSQAVFIGHDWGGMLVWYMALFYPERVRAVASLNTP --3333--------------1111----------------------3333---------- FIPANPNMSPLESIKANPVFDYQLYFQEPGVAEAELEQNLSRTFKSLFRASDESVLSMHK ----------------3333-3333-------------------------------3333 VCEAGGLFVNSPEEPSLSRMVTEEEIQFYVQQFKKSGFRGPLNWYRNMERNWKWACKSLG 3333-------------3333----------------3333-1111---------1111- RKILIPALMVTAEKDFVLVPQMSQHMEDWIPHLKRGHIEDCGHWTQMDKPTEVNQILIKW -----------1111---33331111---1111----------3333------------- LDSDARN ------- >DNA POLYMERASE III ALPHA ; SWP:P74750; PDB:1ZD7A; CLSFGTEILTVEYGPLPIGKIVSEEINCSVYSVDPEGRVYTQAIAQWHDRGEQEVLEYEL --1111---------------------------1111----------------------1 EDGSVIRATSDHRFLTTDYQLLAIEEIFARQLDLLTLENIKQTEEALDNHRLPFPLLDAG 111-----1111---1111--------1111---------------1111----1111-- TIKVKVIGRRSLGVQRIFDIGLPQDHNFLLANGAIAAN -----------------------------1111----- >GTP:AMP PHOSPHOTRANSFERAS; SWP:Q9UIJ7; PDB:1ZD8A; LLRAVIMGAPGSGKGTVSSRITTHFELKHLSSGDLLRDNMLRGTEIGVLAKAFIDQGKLI --------2222----------------------------------------3333---- PDDVMTRLALHELKNLTQYSWLLDGFPRTLPQAEALDRAYQIDTVINLNVPFEVIKQRLT ------------1111-------------------------------------------- ARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQREDDKPETVIKRLKAYEDQTKPVLEY -----1111---3333----2222----------3333---------------------- YQKKGVLETFSGTETNKIWPYVYAFLQTKVPQ -------------3333------1111----- >ADP-RIBOSYLATION FACTOR-L; SWP:Q96BM9; PDB:1ZD9A; SKEEMELTLVGLQYSGKTTFVNVIASGQFNEDMIPTVGFNMRKITKGNVTIKLWDIGGQP -----------2222------------------------------!!!!---------33 RFRSMWERYCRGVSAIVYMVDAADQEKIEASKNELHNLLDKPQLQGIPVLVLGNKRDLPG 3311113333----------11111111------------3333----------3333-- ALDEKELIEKMNLSAIQDREICCYSISCKEKDNIDITLQWLIQHSK -----------3333---------------2222-------1111- >UBIQUITIN-CONJUGATING ENZ; SWP:Q16763; PDB:1ZDNA; ENLPPHIIRLVYKEVTTLTADPPDGIKVFPNEEDLTDLQVTIEGPEGTPYAGGLFRMKLL ----------------------2222----3333-------------1111--------- LGKDFPASPPKGYFLTKIFHPNVGANGEICVNVLKRDWTAELGIRHVLLTIKCLLIHPNP -------------------11111111---333322221111---------3333---33 ESALNEEAGRLLLENYEEYAARARLLTEIHG 33----------------------------- >DIHYDROFOLATE REDUCTASE; SWP:NA; PDB:1ZDRA; MISHIVAMDENRVIGKDNRLPWHLPADLAYFKRVTMGHAIVMGRKTFEAIGRPLPGRDNV --------1111---%%%%----3333-------2222---------------2222--- VVTGNRSFRPEGCLVLHSLEEVKQWIASRADEVFIIGGAELFRATMPIVDRLYVTKIFAS ----1111-----------------1111---------------3333------------ FPGDTFYPPISDDEWEIVSYTPGGKDEKNPYEHAFIIYERK ----------1111-----------1111------------ >STEROIDOGENIC FACTOR 1; SWP:Q13285; PDB:1ZDTA; PNVPELILQLLQLEPDEDQVRARILGSLPDQPAAFGLLCRMADQTFISIVDWARRCMVFK ----------1111----3333-----------------------------------333 ELEVADQMTLLQNCWSELLVFDHIYRQVQHGKEGSILLVTGQEVELTTVATQAGSLLHSL 3------------------------------1111--3333---3333------------ VLRAQELVLQLLALQLDRQEFVCLKFIILFSLDLKFLNNHILVKDAQEKANAALLDYTLC -----------1111-----------------3333------------------------ HYPHSGDKFQQLLLCLVEVRALSMQAKEYLYHKHLGNEMPRNNLLIEMLQAKQ ---------------------------------1111---------------- >AROMATIC PRENYLTRANSFERAS; SWP:Q4R2T2; PDB:1ZDYA; EAADVERVYAAMEEAAGLLGVACARDKIYPLLSTFQDTLVEGGSVVVFSMASGRHSTELD ----------------1111---3333-------3333-------------!!!!----- FSISVPTSHGDPYATVVEKGLFPATGHPVDDLLADTQKHLPVSMFAIDGEVTGGFKKTYA -----3333-----------------3333------------------------------ FFPTDNMPGVAELSAIPSMPPAVAENAELFARYGLDKVQMTSMDYKKRQVNLYFSELSAQ --1111---------1111-----------1111-----------------------333 TLEAESVLALVRELGLHVPNELGLKFCKRSFSVYPTLNWETGKIDRLCFAVISNDPTLVP 3-------------------------1111------------------------------ SSDEGDIEKFHNYATKAPYAYVGEKRTLVYGLTLSPKEEYYKLGAYYHITDVQRGLLKAF ----------------------------------1111---------------------- D - >REDUCTASE, ASSEMBLY PROTE; SWP:O30064_ARCFU; PDB:1ZE0A; HMREHLKLFSLIFSYPDEDKLGKAIALAEGIGLTEIAQTLKQVDIEALQVEYTSLFISSH ---------------------------------3333----------------------- PSVPCPPYQSYFEEGSVYGKASLRAAELYSKYGLNYVYESEPPDHISVELEFLSMNPELL ----------3333----3333-------------------1111----------3333- SDFRDWFLEFAKCVEEKSEIYATFARAFRKFLEK ------------------3333------------ >Outer membrane usher prot; SWP:P30130; PDB:1ZE3D; DLYFNPRFLLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQL ----3333----1111-------------%%%%-------------1111---------- ASMGLNTASVAGMNLLADDACVPLTTMVQDATAHLDVGQQRLNLTIPQAFMSNRAR -----33332222---1111-------2222-----1111------3333------ >SHORT SYNTHETIC D-AMINO A; SWP:NA; PDB:1ZEAH; QIQLVQSGPELKTPGETVRISCKASGYTFTTYGMSWVKQTPGKGFKWMGWINTYSGVPTY ------------2222-----------1111----------------------------- ADDFKGRFAFSLETSASTAYLQINNL 1111---------1111--------- >If kappa light chain [Fra; SWP:A2NHM3; PDB:1ZEAL; DVLMTQTPLSLPVSLGDQASISCKSSQSIVHS -------------2222--------------- >ALKALINE PHOSPHATASE; SWP:P05187; PDB:1ZEDA; IIPVEEENPDFWNREAAEALGAAKKLQPAQTAAKNLIIFLGDGMGVSTVTAARILKGQKK --3333----------------1111--------------2222---------------- DKLGPEIPLAMDRFPYVALSKTYNVDKHVPDSGATATAYLCGVKGNFQTIGLSAAARFNQ ---1111-3333-----------1111------------------2222-------2222 CNTTRGNEVISVMNRAKKAGKSVGVVTTTRVQHASPAGTYAHTVNRNWYSDADVPASARQ ---2222---------1111---------11113333-------1111-3333------- EGCQDIATQLISNMDIDVILGGGRKYMFRMGTPDPEYPDDYSQGGTRLDGKNLVQEWLAK ----------------------------2222-1111--3333----------------- RQGARYVWNRTELMQASLDPSVTHLMGLFEPGDMKYEIHRDSTLDPSLMEMTEAALRLLS 2222--------------1111-------------3333-3333-------------333 RNPRGFFLFVEGGRIDHGHHESRAYRALTETIMFDDAIERAGQLTSEEDTLSLVTADHSH 31111-----------------------------------3333-3333----------- VFSFGGYPLRGSSIFGLAPGKARDRKAYTVLLYGNGPGYVLKDGARPDVTESESGSPEYR --------22221111-----1111----------------iiii----3333--1111- QQSAVPLDEETHAGEDVAVFARGPQAHLVHGVQEQTFIAHVMAFAACLEPYTACDLAPPA ----------------------2222-----------------1111!!!!--------- G - >HYPOTHETICAL PROTEIN SO44; SWP:Q8E972; PDB:1ZEEA; YNTEAFDEWIRSRFVELNSQLEQLYYQQTDRANVQEVGTELKHTLESEGRELVKALLDEG -------------------------------------3333------------1111--- NTDEGFDSAFDLLGNVGLYAACRRHEITEPTRETTSPLLEASALAHIGASIGVTPRFATA -----------------------------3333------------------------333 HLTTHNRAHNGIYKRFTDLPDEKLFVDYNTKGILAYKRASDALLKIQPLGISHPISHDLL 3-------iiii------3333---------------------3333--11113333--- RVTKQALQDVIESNQQLFNRLDTDRFFYCVRPYYKPYRVGSVVYRGANAGDFAGINVIDL ---------------------3333----3333-----!!!!-----1111--------1 TLGLCFANEASYSQLVDKFLYPEDQQILRECRRPNLDDFLQAKGCIHQDWYQENLKLFIE 111--11113333-1111-----------------------3333--------------- VCELHGQTAIQHHNELVTKYVLLASLERLRDRRAAVLRDDIRTRYYDLKKLKDSLR ----------------3333--------------------------------1111 >INSULIN; SWP:P01315; PDB:1ZEIA; FVNQHLCGSHLVEALYLVCGERGFFYTDKAAKGIVEQCCTSICSLYQLENYCN -------------------3333-------------------------1111- >3-HYDROXYACYL-COA DEHYDRO; SWP:O28262; PDB:1ZEJA; HKVFVIGAGLGRGIAIAIASKHEVVLQDVSEKALEAAREQIPEELLSKIEFTTTLEKVKD -------------------------------------111133331111-----1111-- CDIVEAVFEDLNTKVEVLREVERLTNAPLCSNTSVISVDDIAERLDSPSRFLGVHWNPPH --------------------3333-----------------1111--1111------111 VPLVEIVISRFTDSKTVAFVEGFLRELGKEVVVCKGQSLVNRFNAAVLSEASRIEEGVRA 1-------1111------------1111-------------------------3333--- EDVDRVWKHHLGLLYTLFGPLGNLDYIGLDVAYYASLYLYKRFGDEKFKPPEWLQEKIKK --------------------------------------------3333--3333---111 GEVGVKAGKGIYEYGPKAYEERVERLKKLLRFLGLE 1--3333-------1111------------------ >HYPOTHETICAL PROTEIN RV28; SWP:NA; PDB:1ZELA; MVVSPAGADRRIPTWASRVVSGLARDRPVVVTKEDLTQRLTEAGCGRDPDSAIRELRRIG ---1111-----3333------------------------1111---------------- WLVQLPVKGTWAFIPPGEAAISPYLPLRSWLARDQNAGFMLAGASAAWHLGYLDRQPDGR ------2222----2222---------------1111----------------------- IPIWLPPAKRLPDGLASYVSVVRIPWNAADTALLAPRPALLVRRRLDLVAWATGLPALGP -----1111--33331111-------33333333--3333-1111------iiii----- EALLVQIATRPASFGPWADLVPHLDDLVADCSDERLERLLSGRPTSAWQRASYLLDSGGE ---------3333--33333333----1111--------22223333--------1111- PARGQALLAKRHTEVMPVTRFTTAHSGESVWAPEYQLVDELVVPLLRVIGK -------1111--------------------3333---------------- >XYLITOL DEHYDROGENASE; SWP:Q8GR61; PDB:1ZEMA; KKFNGKVCLVTGAGGNIGLATALRLAEEGTAIALLDMNREALEKAEASVREKGVEARSYV --2222-----1111----------1111--------------------1111------- CDVTSEEAVIGTVDSVVRDFGKIDFLFNNAGYQGAFAPVQDYPSDDFARVLTINVTGAFH -1111--------------------------------3333------------------- VLKAVSRQMITQNYGRIVNTASMAGVKGPPNMAAYGTSKGAIIALTETAALDLAPYNIRV ---------1111--------3333---2222--------------------3333---- NAISPGYMGPGFMWERQVELQAKVGSQYFSTDPKVVAQQMIGSVPMRRYGDINEIPGVVA ---------------------33331111--------------1111---1111------ FLLGDDSSFMTGVNLPIAGG ---3333------------- >Cation efflux system prot; SWP:P77214; PDB:1ZEQX; METMSEAQPQVISATGVVKGIDLESKKITIHHDPIAAVNWPEMTMRFTITPQTKMSEIKT ----------------------1111---------1111----------1111-----22 GDKVAFNFVQQGNLSLLQDIKVSQ 22--------%%%%---------- >SWI5; SWP:P08153; PDB:1ZFD; DRPYSCDHPGCDKAFVRNHDLIRHKKSHQEKA -------2222------3333---3333---- >INOSINE MONOPHOSPHATE DEH; SWP:P50099; PDB:1ZFJA; SNWDTKFLKKGYTFDDVLLIPAESHVLPNEVDLKTKLADNLTLNIPIITAADTVTGSKAI -1111-------1111----------1111-------1111------------------- AIARAGGLGVIHKNSITEQAEEVRKVKRSENGVIIDPFFLTPEHKVSEAEELQRYRISGV ----------------------------------------3333---------------- PIVETLANRKLVGIITNRDRFISDYNAPISEHTSEHLVTAAVGTDLETAERILHEHRIEK ----3333-------3333---------------------2222---------1111--- LPLVDNSGRLSGLITIKDIEKVIEFPHAAKDEFGRLLVAAAVGVTSDTFERAEALFEAGA ----1111----------------1111--1111----------1111-------1111- DAIVIDTAHGHSAGVLRKIAEIRAHFPNRTLIAGNIATAEGARALYDAGVDVVKVGIGPG --------1111---------------------------------1111----------1 SICTTRVVAGVGVPQVTAIYDAAAVAREYGKTIIADGGIKYSGDIVKALAAGGNAVLGSF 111-3333--------------------------------3333----1111-------- AGTDEAPGETEIYQGRKYKTYRGGSIAAKKNKLVPEGIEGRVAYKGAASDIVFQLGGIRS --3333------iiii------------------------------3333---------- GGYVGAGDIQELHENAQFVESGAGLIESHPHDVQITNEAPNYSV -1111--------------------------------------- >LASP-1; SWP:P80171; PDB:1ZFO; MNPNCARCGKIVYPTEKVNCLDKFWHKACF ------------3333----------1111 >PLECTASIN; SWP:Q53I06; PDB:1ZFUA; GFGCNGPWDEDDMQCHNHCKSIKGYKGGYCAKGGFVCKCY -----3333------------2222------%%%%----- >HYPOTHETICAL UPF0213 PROT; SWP:Q9KGL3; PDB:1ZG2A; MNHYVYILECKDGSWYTGYTTDVDRRIKKHASGKGAKYTRGRGPFRLVATWAFPSKEEAM ---------1111----------------------------------------------- RWEYEVKHLSRRKKEQLVSLKGGPYENTTKLSTT ----------------------3333-------- >ISOFLAVANONE 4'-O-METHYLT; SWP:Q29U70; PDB:1ZG3A; GSEESELYHAQIHLYKHVYNFVSSMALKSAMELGIADAIHNHGKPMTLSELASSLKLHPS -3333----------------------------------3333--------------111 KVNILHRFLRLLTHNGFFAKTIVKGKEGDEEEEIAYSLTPPSKLLISGKPTCLSSIVKGA 1-------------------------------------3333------1111-------- LHPSSLDMWSSSKKWFNEDKEQTLFECATGESFWDFLNKDSESSTLSMFQDAMASDSRMF -3333---------------------------------1111------------------ KLVLQENKRVFEGLESLVDVGGGTGGVTKLIHEIFPHLKCTVFDQPQVVGNLTGNENLNF ------33331111-------!!!!---------1111------3333------1111-- VGGDMFKSIPSADAVLLKWVLHDWNDEQSLKILKNSKEAISHKGKDGKVIIIDISIDETS ---1111------------3333----------------3333-------------1111 DDRGLTELQLDYDLVMLTMFLGKERTKQEWEKLIYDAGFSSYKITPISGFKSLIEVYP ----------------------------------3333-------------------- >CHALCONE REDUCTASE; SWP:Q40309; PDB:1ZGDA; EIPTKVLTNTSSQLKMPVVGMGSAPDFTCKKDTKDAIIEAIKQGYRHFDTAAAYGSEQAL ------1111---------------1111-----------1111------1111------ GEALKEAIELGLVTRDDLFVTSKLWVTENHPHLVIPALQKSLKTLQLDYLDLYLIHWPLS -------------3333-------3333-1111--------------------------- SQPGKFSFPIDVADLLPFDVKGVWESMEESLKLGLTKAIGVSNFSVKKLENLLSVATVLP ----------3333----------------1111-------------------------- AVNQVEMNLAWQQKKLREFCNAHGIVLTAFSPVRKGASRGPNEVMENDMLKEIADAHGKS -------1111---------1111------1111!!!!---3333--------------- VAQISLRWLYEQGVTFVPKSYDKERMNQNLRIFDWSLTKEDHEKIAQIKQNRLIPGPTKP ---------1111--------3333--1111-------------1111------------ GLNDLYDD -1111--- >Putative low molecular we; SWP:P39155; PDB:1ZGGA; MDIIFVCTGNTCRSPMAEALFKSIAEREGLNVNVRSAGVFASPNGKATPHAVEALFEKHI ------1111-------------------------------------------------- ALNHVSSPLTEELMESADLVLAMTHQHKQIIASQFGRYRDKVFTLKEYVTGSHGDVLDPF -------------------------------------3333--3333-----------22 GGSIDIYKQTRDELEELLRQLAKQLKKDRR 223333------------------------ >METHIONYL-TRNA FORMYLTRAN; SWP:Q4CGU0; PDB:1ZGHA; LNIIIATTKSWNIKNAQKFKKENESKYNTTIITNKDELTFEKVKLINPEYILFPHWSWII --------------------1111---------3333-33333333-------------- PKEIFENFTCVVFHTDLPFGRGGSPLQNLIERGIKKTKISAIKVDGGIDTGDIFFKRDLD 3333-------------------------1111--------------------------- LYGTAEEIFRASKIIFNDIPELLTKRPVPQKQEGEATVFQRRKPEQSEISPDFDLEKIYD ------------------------------------------3333---1111------- YIRLDGEGYPRAFIKYGKYRLEFSRASKNGKIIADVEIIEG -----2222------!!!!--------2222---------- >KELCH-LIKE ECH-ASSOCIATED; SWP:Q14145; PDB:1ZGKA; PKVGRLIYTAGGYFRQSLSYLEAYNPSNGTWLRLADLQVPRSGLAGCVVGGLLYAVGGRN ------------------------------------------------iiii-------- NSPDGNTDSSALDCYNPTNQWSPCAPSVPRNRIGVGVIDGHIYAVGGSHGCIHHNSVERY -1111--------------------------------iiii-------!!!!-------- EPERDEWHLVAPLTRRIGVGVAVLNRLLYAVGGFDGTNRLNSAECYYPERNEWRITANTI -1111------------------%%%%--------------------1111--------- RSGAGVCVLHNCIYAAGGYDGQDQLNSVERYDVETETWTFVAPKHRRSALGITVHQGRIY --------!!!!------------------------------------------iiii-- VLGGYDGHTFLDSVECYDPDTDTWSEVTRTSGRSGVGVAVT ------------------1111------------------- >TRBC1 protein [Fragment]; SWP:Q8N2T6; PDB:1ZGLM; DSVTQMEGPVTLSEEAFLTINCTYTATGYPSLFWYVQYPGEGLQLLLKATKADDKGSNKG ---------------------------------------------------------iii FEATYRKETTSFHLEKGSVQVSDSAVYFCALSGGDSSYKLIFGSGTRLLVRPDNPDPVYS i------------------3333------------------------------------- VCLFTDFDSQTNDVYITDKTVLDMRSMDFKSNSAVAWSNSDFACANAFNNSIIPEDTF -----------------------1111---------------1111------------ >TRBC1 protein [Fragment]; SWP:Q8N2T6; PDB:1ZGLP; GGGGVTQTPRYLIKTRGQQVTLSCSPISGHRSVSWYQQTPGQGLQFLFEYFNETQRNKGN --------------------------------------------------iiii------ FPGRFSGRQFSNSRSEMNVSTLELGDSALYLCASSLADRVNTEAFFGQGTRLTVVEDLKN ---------1111----------------------------------------------- VFPPEVAVFEPSEAEISHTQATLVCLATGFYPDHVELSWWVNGKEVHSGVSTDPQPLKEQ -------------3333-----------------------%%%%---------------3 PALNDSRYSLSSRLRVSATFWQNPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEAW 333-------------3333-----------------1111------------------- G - >RED FLUORESCENT PROTEIN D; SWP:Q9U6Y8; PDB:1ZGOA; NVIKEFMRFKVRMEGTVNGHEFEIEGEGEGRPYEGHNTVKLKVTKGGPLPFAWDILSPQF ----------------iiii----------1111-----------------33331111- SKVYVKHPADIPDYKKLSFPEGFKWERVMNFEDGGVVTVTQDSSLQDGCFIYKVKFIGVN -1111--1111-3333--------------1111-----------iiii----------- FPSDGPVMQKKTMGWEASTERLYPRDGVLKGEIHKALKLKDGGHYLVEFKSIYMAKKPVQ -1111-------------------iiii----------1111------------------ LPGYYYVDSKLDITSHNEDYTIVEQYERTEGRHHLFL ----------------1111-------------1111 >MANNOSE/GLUCOSE-SPECIFIC ; SWP:P83304; PDB:1ZGSA; KGMISVGPWGGSGGNYWSFKANHAITEIVIHVKDNIKSISFKDASGDISGTFGGKDPREN ------------------------------------------1111-----------111 EKGDEKKIKIHWPTEYLKSISGSYGDYNGVLVIRSLSFITNLTTYGPFGSTSGGESFSIP 1-------------------------iiii---------1111----------------- IADSVVVGFHGRAGYYLDALGIFVQPVPHGTISFGPWGGPAGDDAFNFKVGSWIKDIIIY --------------------------2222------------------------------ ADAAINSIAFKDANGHCYGKFGGQDPNDIGVEKKVEIDGNLEHLKSISGTYGNYKGFEVV -----------1111---------1111---------3333------------iiii--- TSLSFITNVTKHGPFGIASGTSFSIPIEGSLVTGFHGKSGYYLDSIGIYVKPRDVEGSIS ------------------------------------------------------------ IGPWGGSGGDPWSYTANEGINQIIIYAGSNIKSVAFKDTSGLDSATFGGVNPKDTGEKNT -------------------------------------1111---------1111------ VSINWPSEYLTSISGTYGQYKFKDVFTTITSLSFTTNLATYGPFGKASATSFSIPIHNNM ------------------------------------------------------------ VVGFHGRAGDYLDAIGIFVKPD ---------------------- >PEROXISOME PROLIFERATOR A; SWP:Q86U60; PDB:1ZGYA; PESADLRALAKHLYDSYIKSFPLTKAKARAILTGKTTDKSPFVIYDMNSLMMGEDKIKFK ------------------------------------------------------------ HITPLQEQSKEVAIRIFQGCQFRSVEAVQEITEYAKSIPGFVNLDLNDQVTLLKYGVHEI -------------------------------------2222------------------- IYTMLASLMNKDGVLISEGQGFMTREFLKSLRKPFGDFMEPKFEFAVKFNALELDDSDLA ---3333--1111---iiii-------1111------------------1111------- IFIAVIILSGDRPGLLNVKPIEDIQDNLLQALELQLKLNHPESSQLFAKLLQKMTDLRQI --------1111---------------------------1111----------------- VTEHVQLLQVIKKTETDMSLHPLLQEIYKDLY -------------------------------- >TorCAD operon transcripti; SWP:P38684; PDB:1ZGZA; PHHIVIVEDEPVTQARLQSYFTQEGYTVSVTASGAGLREIQNQSVDLILLDINLPDENGL ------------------------------------------------------------ LTRALRERSTVGIILVTGRSDRIDRIVGLEGADDYVTKPLELRELVVRVKNLLWRIDQ ------------------------------------------------------1111 >NON-STRUCTURAL POLYPROTEI; SWP:Q9WMX2; PDB:1ZH1A; FFSCQRGYKGVWRGDGIMQTTCPCGAQITGHVKNGSMRIVGPRTCSNTWHGTFPINAYTT ---------------------1111-------iiii-----11113333------1111- GPCTPSPAPNYSRALWRVAAEEYVEVTRVGDFHYVTGMTTDNVKCPCQVPAPEFFTEVDG ----------------------------!!!!------------------3333---iii VRLHRYAPACKPLLREEVTFLVGLNQYLVGSQLPCEPEPDVAV i--------------------------2222-1111------- >KDP operon transcriptiona; SWP:P21866; PDB:1ZH2A; TNVLIVEDEQAIRRFLRTALEGDGMRVFEAETLQRGLLEAATRKPDLIILDLGLPDGDGI -------------------3333------------------------------1111--- EFIRDLRQWSAVPVIVLSARSEESDKIAALDAGADDYLSKPFGIGELQARLRVALRRHSQ -----3333------------3333----------------------------------- >LUPUS LA PROTEIN; SWP:P05455; PDB:1ZH5A; GDNEKAALEAKICHQIEYYFGDFNLPRDKFLKEQIKLDEGWVPLEIIKFNRLNRLTTDFN 3333-------------------3333-----1111-iiii-3333-------------- VIVEALSKSKAELEISEDKTKIRRSPSKPLPEVTDEYKNDVKNRSVYIKGFPTDATLDDI -----1111------1111-----1111-----------------------1111----- KEWLEDKGQVLNIQRRTLHKAFKGSIFVVFDSIESAKKFVETPGQKYKETDLLILFKDDY -1111-----------1111--------------------------!!!!-----3333- F - >OXIDOREDUCTASE; SWP:Q9WYE8; PDB:1ZH8A; LRKIRLGIVGCGIAARELHLPALKNLSHLFEITAVTSRTRSHAEEFAKVGNPAVFDSYEE -------------------------1111---------3333------------------ LLESGLVDAVDLTLPVELNLPFIEKALRKGVHVICEKPISTDVETGKKVVELSEKSEKTV 1111----------3333--------1111-----------3333--------------- YIAENFRHVPAFWKAKELVESGAIGDPVFNWQIWVGDENNKYVHTDWRKKPKHVGGFLSD ---3333--------------1111-----------111133331111----2222---- GGVHHAAARLILGEIEWISAVAKDLSPLLGGDFLSSIFEFENGTVGNYTISYSLKGNERF -------------------------3333----------1111----------------- EITGTKGKISISWDKIVLNEEEKVPQENSYQKEFEDFYQVVAEGKPNDLGSPVQALKDLA ---1111----1111--!!!!--------------------------------------- FIEACVRSAGNKVFVSSLL -------iiii--3333-- >HYPOTHETICAL PROTEIN HP12; SWP:O25839; PDB:1ZHCA; MFHEFRDEISVLKANNPHFDKIFEKHNQLDDDIKTAEQQNASDAEVSHMKKQKLKLKDEI -1111----------3333----------------------3333--------------- HSMIIEYREKQKSERA ---------------- >MANNAN-BINDING LECTIN; SWP:Q9RHG4; PDB:1ZHSA; ASYKVNIPAGPLWSNAEAQQVGPKIAAAHQGNFTGQWTTVVESAMSVVEVELQVENTGIH --------------------------1111----------2222---------------- EFKTDVLAGPLWSNDEAQKLGPQIAASYGAEFTGQWRTIVEGVMSVIQIKYTF -------------------------1111----------2222---------- >HYPOTHETICAL PROTEIN ATU0; SWP:Q8UHE1; PDB:1ZHVA; APRIKLKILNGSYGIARLSASEAIPAWADGGGFVSITRTDDELSIVCLIDRIPQDVRVDP ------------------1111--1111-------------------3333-1111---- GWSCFKFQGPFAFDETGIVLSVISPLSTNGIGIFVVSTFDGDHLLVRSNDLEKTADLLAN -------------------------1111--------3333-----3333---------- AGHSLLLEHHHHHH -------------- >KES1 PROTEIN; SWP:P35844; PDB:1ZHXA; DPSQYASSSSWTSFLKSIASFNGDLSSLSAPPFILSPISLTEFSQYWAEHPELFLEPSFI 33331111---------1111--1111---3333--------------------3333-- NDDNYKEHCLIDPEVESPELARMLAVTKWFISTLKSQYCSRNESLGSEKKPLNPFLGELF 111111111111------------------------------------------2222-- VGKWENKEHPEFGETVLLSEQVSHHPPVTAFSIFNDKNKVKLQGYNQIKASFTKSLMLTV -----11113333----------------------1111-------------1111---- KQFGHTMLDIKDESYLVTPPPLHIEGILVASPFVELEGKSYIQSSTGLLCVIEFSGRGYF ---------!!!!-------------1111-------------1111------------- SGKKNSFKARIYKDSKDSKDKEKALYTISGQWSGSSKIIKANKKEESRLFYDAARIPAEH -------------3333--3333-------1111--------1111-----3333----- LNVKPLEEQHPLESRKAWYDVAGAIKLGDFNLIAKTKTELEETQRELRKEEEAKGISWQR ----3333-1111----------------------------------------------- RWFKDFDYSVTPEEGALVPEKDDTFLKLASALNLSTKNAPSGTLVGDKEDRKEDLSSIHW ------------1111-----------------------222222223333--------- RFQRELWDEEKEIVL ------1111----- >DNA GYRASE SUBUNIT A; SWP:P09097; PDB:1ZI0A; TQEDVVVTLSHQGYVKYQPLSDFIDRLLVANTHDHILCFSSRGRVYSMKVYQLPEATRGA ---------3333-----------------1111-----1111-----1111-------- RGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMATANGTVKKTVLTEFNRLRTAGKVAI ---1111-------------------1111-----1111------1111----------- KLVDGDELIGVDLTSGEDEVMLFSAEGKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVS --2222--------!!!!-----1111-----3333----------------2222---- LIVPRGDGAILTATQNGYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQ -------------1111-----3333-----------------------------3333- IMMITDAGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAE ----3333-----3333----------------2222---------- >CARBOXYMETHYLENEBUTENOLID; SWP:P0A114; PDB:1ZI8A; MLTEGISIQSYDGHTFGALVGSPAKAPAPVIVIAQDIFGVNAFMRETVSWLVDQGYAAVC --2222---1111--------------------------------------1111----- PDLYARQAPGTALDPQDERQREQAYKLWQAFDMEAGVGDLEAAIRYARHQPYSNGKVGLV -1111--2222--1111----------1111--------------33331111------- GYSLGGALAFLVASKGYVDRAVGYYGVGLEKQLNKVPEVKHPALFHMGGQDHFVPAPSRQ ------------1111-----------111133331111--------------------- LITEGFGANPLLQVHWYEEAGHSFARTGSSGYVASAAALANERTLDFLVPLQS --------1111----------1111--1111---------------3333-- >ADENYLATE KINASE; SWP:P27142; PDB:1ZIN; MNLVLMGLPGAGKGTQAEKIVAAYGIPHISTGDMFRAAMKEGTPLGLQAKQYMDRGDLVP -------2222-----------------------------------------1111---- DEVTIGIVRERLSKDDCQNGFLLDGFPRTVAQAEALETMLADIGRKLDYVIHIDVRQDVL -------------1111-----------------------1111---------------- MERLTGRRICRNCGATYHLIFHPPAKPGVCDKCGGELYQRADDNEATVANRLEVNMKQMK -------------------------2222----------1111----------------- PLVDFYEQKGYLRNINGEQDMEKVFADIRELLGGLAR --------------------------------3333- >GAMMA CRYSTALLIN E; SWP:P02528; PDB:1ZIRA; GKITFYEDRGFQGRHYECSTDHSNLQPYFSRCNSVRVDSGCWMLYEQPNFTGCQYFLRRG --------%%%%------------3333-------------------%%%%--------- DYPDYQQWMGFSDSVRSCRLIPHSSSHRIRIYEREDYRGQMVEITDDCPHLQDRFHFSDF ---3333---------------------------%%%%---------------------- HSFHVMEGYWVLYEMPNYRGRQYLLRPGEYRRYHDWGAMNARVGSLRRIMDFY ---------------%%%%------------3333------------------ >TRANSCRIPTIONAL REGULATOR; SWP:O66551; PDB:1ZITA; MKRVLVVDDEESITSSLSAILEEEGYHPDTAKTLREAEKKIKELFFPVIVLDVWMPDGDG -------------2222------------------------------------------3 VNFIDFIKENSPDSVVIVITGHGSVDTAVKAIKKGAYEFLEKPFSVERFLLTIKHAFEEY 333-------1111------------33331111-------------------------- S - >CALPAIN 9; SWP:O14815; PDB:1ZIVA; SFEQMRQECLQRGTLFEDADFPASNSSLFYSPQIPFVWKRPGEIVKNPEFILGGATRTDI -----------------3333--3333------------3333-------------1111 CQGELGDCWLLAAIASLTLNQKALARVIPQDQSFGPGYAGIFHFQFWQHSEWLDVVIDDR -------3333-----------3333---------------------------------- LPTFRDRLVFLHSADHNEFWSALLEKAYAKLNGSYEALKGGSAIEAMEDFTGGVAETFQT --------------1111---------------33332222-----------------11 KEAPENFYEILEKALKRGSLLGCFIDTRSAAESEARTPFGLIKGHAYSVTGIDQVSFRGQ 11--------------------------3333----1111----------------iiii RIELIRIRNPWGQVEWNGSWSDSSPEWRSVGPAEQKRLCHTALDDGEFWMAFKDFKAHFD -----------------2222---3333--3333-1111--------------------- KVEICNLT -------- >TOLL-LIKE RECEPTOR 3; SWP:O15455; PDB:1ZIWA; VSHEVADCSHLKLTQVPDDLPTNITVLNLTHNQLRRLPAANFTRYSQLTSLDVGFNTISK --------------------3333-------------33333333--------------- LEPELCQKLPMLKVLNLQHNELSQLSDKTFAFCTNLTELHLMSNSIQKIKNNPFVKQKNL -33333333----------------33331111------------------1111-1111 ITLDLSHNGLSSTKLGTQVQLENLQELLLSNNKIQALKSEELDIFANSSLKKLELSSNQI --------------------1111-------------3333---1111------------ KEFSPGCFHAIGRLFGLFLNNVQLGPSLTEKLCLELANTSIRNLSLSNSQLSTTSNTTFL ---22223333----------------------1111-----------------111133 GLKWTNLTMLDLSYNNLNVVGNDSFAWLPQLEYFFLEYNNIQHLFSHSLHGLFNVRYLNL 33------------------22221111----------------11112222-------- KRSFTKQLPKIDDFSFQWLKCLEHLNMEDNDIPGIKSNMFTGLINLKYLSLSNSFTSLRT -----------22223333--------------------2222----------------- LTNETFVSLAHSPLHILNLTKNKISKIESDAFSWLGHLEVLDLGLNEIGQELTGQEWRGL ---1111-1111---------------22221111------------------3333--1 ENIFEIYLSYNKYLQLTRNSFALVPSLQRLMLRRVALKNVDSSPSPFQPLRNLTILDLSN 111-------------11111111--------------------1111-1111------- NNIANINDDMLEGLEKLEILDLQHNNLARLWKHANPGGPIYFLKGLSHLHILNLESNGFD ------11112222-----------------1111-----1111-1111----------- EIPVEVFKDLFELKIIDLGLNNLNTLPASVFNNQVSLKSLNLQKNLITSVEKKVFGPAFR --11111111----------------22221111----------------3333--1111 NLTELDMRFNPFDCTCESIAWFVNWINET ----------------------------- >Probable ferredoxin-depen; SWP:P71753; PDB:1ZJ8A; RNEGQWALGHREPLNANEELKKAGNPLDVRERIENIYAKQGFDSIDKTDLRGRFRWWGLY ----1111-------3333-----3333-------3333-3333-3333----------- TQREQGYDGTWTGDDNIDKLEAKYFMMRVRCDGGALSAAALRTLGQISTEFARDTADISD -------3333-3333--------------2222-----------------%%%%----- RQNVQYHWIEVENVPEIWRRLDDVGLQTTEACGDCPRVVLGSPLAGESLDEVLDPTWAIE ---------3333------------------------------22221111---3333-- EIVRRYIGKPDFADLPRKYKTAISGLQDVAHEINDVAFIGVNHPEHGPGLDLWVGGGLST -----22221111----------------3333--------------------------- NPMLAQRVGAWVPLGEVPEVWAAVTSVFRDYGYRRLRAKARLKFLIKDWGIAKFREVLET ------------3333-------------------1111-3333---------------- EYLKRPLIDGPAPEPVKHPIDHVGVQRLKNGLNAVGVAPIAGRVSGTILTAVADLMARAG ---------------------------3333--------iiii-------------1111 SDRIRFTPYQKLVILDIPDALLDDLIAGLDALGLQSRPSHWRRNLMACSGIEFCKLSFAE -----------------3333--------1111-----3333-------3333------- TRVRAQHLVPELERRLEDINSQLDVPITVNINGCPNSCARIQIADIGFKGQMIDDGHGGS 3333--------------3333-----------3333--1111----------------- VEGFQVHLGGHLGLDAGFGRKLRQHKVTSDELGDYIDRVVRNFVKHRSEGERFAQWVIRA ---------------------------3333----------------22223333----- EEDDLR 3333-- >TREHALULOSE SYNTHASE; SWP:Q2PS28; PDB:1ZJAA; KPGAPWWKSAVFYQVYPRSFKDTNGDGIGDFKGLTEKLDYLKGLGIDAIWINPHYASPNT ----3333-------3333----------------------------------------- DNGYDISDYREVMKEYGTMEDFDRLMAELKKRGMRLMVDVVINHSSDQHEWFKSSRASKD -------1111-3333-------------1111--------------------3333111 NPYRDYYFWRDGKDGHEPNNYPSFFGGSAWEKDPVTGQYYLHYFGRQQPDLNWDTPKLRE 1-1111------iiii------1111------------------1111---1111----- ELYAMLRFWLDKGVSGMRFDTVATYSKTPGFPDLTPEQMKNFAEAYTQGPNLHRYLQEMH ---------1111-------1111---2222---3333--333311111111-------- EKVFDHYDAVTAGEIFGAPLNQVPLFIDSRRKELDMAFTFDLIRYDRALDRWHTIPRTLA ---1111-----------33333333-3333--------3333----1111--------- DFRQTIDKVDAIAGEYGWNTFFLGNHDNPRAVSHFGDDRPQWREASAKALATVTLTQRGT ----------3333----------1111-3333-----1111------------------ PFIFQGDELGMTNYPFKTLQDFDDIEVKGFFQDYVETGKATAEELLTNVALTSRDNARTP ---2222----------------3333-------1111---------3333--3333--- FQWDDSANAGFTTGKPWLKVNPNYTEINAAREIGDPKSVYSFYRNLISIRHETPALSTGS ------%%%%------------------------1111--------------3333---- YRDIDPSNADVYAYTRSQDGETYLVVVNFKAEPRSFTLPDGMHIAETLIESSSPAAPAAG ----1111---------iiii-----------------2222---------------222 AASLELQPWQSGIYKVK 2-----2222------- >AMINOPEPTIDASE AMPS; SWP:Q8NVU1; PDB:1ZJCA; NYKEKLQQYAELLVKVGMNVQPKQPVFIRSSVETLELTHLIVEEAYHCGASDVRVVYSDP --------------------2222------1111-------------------------- TLKRLKFENESVEHFANHEIKSYDVEARMDYVKRGAANLALISEDPDLMDGIDSQKLQAF --------------------3333-------1111---------11112222-------- QQQNARAFKGYMESVQKNQFPWVVAAFPSKAWAKRVYPELSVEEAYIKFIDEVFDIVRID ------------------------------------3333--------------1111-- GNDPVENWRQHIANLSVYAQKLQQKNYHALHYVSEGTDLTVGLAKNHIWEDATSYVNGKE ---------------------------------2222------2222------------- QAFIANIPTEEVFTAPDRNRVDGYVTNKLPLSYNGTIIDQFKLMFKDGEIIDFSAEKGEA ----------------1111------------%%%%---------iiii----------- VLKDLINTDEGSRRLGEVALVPDDSPISNRNTIFYNTLFDENAACHLAIGSAYAFNIQGG ----11113333------------3333-------33331111---------11112222 TEMTVEEKIASGLNDSNVHVDFMIGSSDLTIYGIFEDGSKELVFENGNWASTF -------------------------1111-----1111------iiii-1111 >PYRUVATE KINASE, ISOZYMES; SWP:P14618; PDB:1ZJHA; TFLEHMCRLDIDSPPITARNTGIICTIGPASRSVETLKEMIKSGMNVARLNFSHGTHEYH -----11111111--------------1111---------1111---------------- AETIKNVRTATESFASDPILYRPVAVALDTKGPEIRTGLIKGSGTAEVELKKGATLKITL -------------33331111--------------------------------------- DNAYMEKCDENILWLDYKNICKVVEVGSKIYVDDGLISLQVKQKGADFLVTEVENGGSLG 3333------------111111112222----iiii------------------------ SKKGVNLPGAAVDLPAVSEKDIQDLKFGVEQDVDMVFASFIRKASDVHEVRKVLGEKGKN ------2222------------------1111----------3333--------3333-- IKIISKIENHEGVRRFDEILEASDGIMVARGDLGIEIPAEKVFLAQKMMIGRCNRAGKPV -----------------------------3333----1111------------------- ICATQMLESMIKKPRPTRAEGSDVANAVLDGADCIMLSGETAKGDYPLEAVRMQHLIARE ---------------------------3333-------1111------------------ AEAAIYHLQLFEELRRLAPITSDPTEATAVGAVEASFKCCSGAIIVLTKSGRSAHQVARY -1111----------------------------------------------------111 RPRAPIIAVTRNPQTARQAHLYRGIFPVLCKDPVQEAWAEDVDLRVNFAMNVGKARGFFK 1--------------------2222----------------------------1111--2 KGDVVIVLTGWRPGSGFTNTMRVVPVP 222------------------------ >HYPOTHETICAL PROTEIN PH19; SWP:O59622; PDB:1ZJJA; MVAIIFDMDGVLYRGNRAIPGVRELIEFLKERGIPFAFLTNNSTKTPEMYREKLLKMGID -------2222-------2222--------------------------------1111-- VSSSIIITSGLATRLYMSKHLDPGKIFVIGGEGLVKEMQALGWGIVTLDEARQGSWKEVK -3333---------------------------------3333----3333----3333-- HVVVGLDPDLTYEKLKYATLAIRNGATFIGTNPDATLPGEEGIYPGAGSIIAALKVATNV ---------------------1111-------------1111------------------ EPIIIGKPNEPMYEVVREMFPGEELWMVGDRLDTDIAFAKKFGMKAIMVLTGVSSLEDIK -------------------2222-------------------------------333311 KSEYKPDLVLPSVYELIDYLK 11---------33333333-- >MANNAN-BINDING LECTIN SER; SWP:O00187; PDB:1ZJKA; TAHACPYPMAPPNGHVSPVQAKYILKDSFSIFCETGYELLQGHLPLKSFTAVCQKDGSWD -----------------------2222------2222---!!!!---------------- RPMPACSIVDCGPPDDLPSGRVEYITGPGVTTYKAVIQYSCEETFYTMKVNDGKYVCEAD ----------------2222------2222-2222----------------------111 GFWTSSKGEKSLPVCEPVCGLSARTTGGQIYGGQKAKPGDFPWQVLILGGTTAAGALLYD 1---1111----------------------------22221111---------------- NWVLTAAHAVYEQKHDASALDIRMGTLKRLSPHYTQAWSEAVFIHEGYTHDAGFDNDIAL -----33333333--3333--------1111-------------1111------------ IKLNNKVVINSNITPICLPRKEAESFMRTDDIGTASGWGLTQRGFLARNLMYVDIPIVDH ---------1111------111111112222-------1111------------------ QKCTAAYEKPPYPRGSVTANMLCAGLESGGKDSCRGDSGGALVFLDSETERWFVGGIVSW ----1111---------1111---------------2222------1111---------- GSMNCEAGQYGVYTKVINYIPWIENIISDF -------------------------1111- >JINGZHAOTOXIN-VII; SWP:NA; PDB:1ZJQA; GCGGLMAGCDGKSTFCCSGYNCSPTWKWCVYARP --------3333---------------------- >TRNA (GUANOSINE-2'-O-)-ME; SWP:O67577; PDB:1ZJRA; LVLEKRLKRLREVLEKRQKDLIVFADNVKNEHNFSAIVRTCDAVGVLYLYYYHAEGKKAK 3333-------------1111--------------------------------------- INEGITQGSHKWVFIEKVDNPVQKLLEFKNRGFQIVATWLSKESVNFREVDYTKPTVLVV -3333iiii-------------------1111--------1111-1111----------- GNELQGVSPEIVEIADKKIVIPMYGMAQSLNVSVATGIILYEAQRQREEKGMYSRPSLSE --1111-33331111-----------------------------------1111------ EEIQKILKKWAYEDVIK ----------------- >R-SPECIFIC ALCOHOL DEHYDR; SWP:Q84EX5; PDB:1ZK4A; SNRLDGKVAIITGGTLGIGLAIATKFVEEGAKVMITGRHSDVGEKAAKSVGTPDQIQFFQ ---2222-------------------1111--------3333---------3333----- HDSSDEDGWTKLFDATEKAFGPVSTLVNNAGIAVNKSVEETTTAEWRKLLAVNLDGVFFG -11113333---------------------------3333-------------------- TRLGIQRMKNKGLGASIINMSSIEGFVGDPSLGAYNASKGAVRIMSKSAALDCALKDYDV ---------------------1111---1111---------------------------- RVNTVHPGYIKTPLVDDLPGAEEAMSQRTKTPMGHIGEPNDIAYICVYLASNESKFATGS -----------3333--2222-----33333333---3333---------3333------ EFVVDGGYTAQ ----iiii--- >F17G ADHESIN SUBUNIT; SWP:Q9RH91; PDB:1ZK5A; AVSFIGSTENDVGPSQGSYSSTHNLPFVYNTGHNIGYQNANVWRISGGFCVGLDGKVDLP ---------------------------------------------iiii----------- VVGSLDGQSIYGLTEEVGLLIWMGDTNYSRGTAMSGNSWENVFSGWCVGNYVSTQGLSVH ----iiii-----1111---------3333------------------------------ VRPVILKRNSSAQYSVQKTSIGSIRMRPYNGSSAGSVQTTVNFSLNPFTLND --------1111----------------iiii-!!!!--------------- >FOLDASE PROTEIN PRSA; SWP:P24327; PDB:1ZK6A; GKIRASHILVADKKTAEEVEKKLKKGEKFEDLAKEYSTDSSASKGGDLGWFAKEGQMDET ---------------------------3333--------3333iiii----3333----- FSKAAFKLKTGEVSDPVKTQYGYHIIKKTEE ----3333----------------------- >MERCURIC REDUCTASE; SWP:P00392; PDB:1ZK7A; MEPPVQVAVIGSGGAAMAAALKAVEQGAQVTLIERGTIGGTCVNVGCVPSKIMIRAAHIA -----------------------1111-------------3333---------------- HLRRESPFDGGIAATVPTIDRSKLLAQQQARVDELRHAKYEGILGGNPAITVVHGEARFK -----1111-------------------------------------1111---------- DDQSLTVRLNEGGERVVMFDRCLVATGASPAVPPIPGLKESPYWTSTEALASDTIPERLA --------1111----------------------2222---------------------- VIGSSVVALELAQAFARLGSKVTVLARNTLFFREDPAIGEAVTAAFRAEGIEVLEHTQAS ---------------1111-----------1111------------1111---------- QVAHMDGEFVLTTTHGELRADKLLVATGRTPNTRSLALDAAGVTVNAQGAIVIDQGMRTS ----iiii----1111----------------11113333-----1111----1111--- NPNIYAAGDCTDQPQFVYVAAAAGTRAAINMTGGDAALDLTAMPAVVFTDPQVATVGYSE 1111---1111----3333----------1111--------------------------- AEAHHDGIETDSRTLTLDNVPRALANFDTRGFIKLVIEEGSHRLIGVQAVAPEAGELIQT ---1111--------1111----1111-----------------------2222------ AALAIRNRMTVQELADQLFPYLTMVEGLKLAAQTFNKDVKQLSCCAG ----1111-----1111-----3333-----------3333------ >TRANSCRIPTIONAL REGULATOR; SWP:Q815X4; PDB:1ZK8A; IGLTLQKIVETAAEIADANGVQEVTLASLAQTLGVRSPSLYNHVKGLQDVRKNLGIYGIK -------------------3333------------33333333----------------- KLHNRLEEAAEDKRDEAIHALGEAYVAFVRKHPGLYEATFLRDEEVRKAGDGIVKLCLQV ---------2222------------------------1111------------------3 LQQYGLEGENALHATRGFRSICHGFASIEQQGGFGLPLDLDISLHVLLETFIKGLR 333-------------------------1111------------------------ >TRANSCRIPTION FACTOR RELB; SWP:Q04863; PDB:1ZK9A; LVPRGSHMNTSELRICRINKESGPCTGGEELYLLCDKVQKEDISVVFSTASWEGRADFSQ ---!!!!--3333----------3333-----------1111------1111------33 ADVHRQIAIVFKTPPYEDLEISEPVTVNVFLQRLTDGVCSEPLPFTYLPR 33-------------------------------1111------------- >PEPTIDYL-PROLYL CIS-TRANS; SWP:Q13356; PDB:1ZKCA; SSGLVPRGSGYVRLHTNKGDLNLELHCDLTPKTCENFIRLCKKHYYDGTIFHRSIRNFVI ---------------1111-------3333-------------1111-------2222-- QGGDPTGTGTGGESYWGKPFKDEFRPNLSHTGRGILSMANSGPNSNRSQFFITFRSCAYL ---1111------1111-------1111----------------------------3333 DKKHTIFGRVVGGFDVLTAMENVESDPKTDRPKEEIRIDATTVFVDPYEEADAQIAQERK ---------------------------------------------1111----------- TQLKVAP ------- >DUF185; SWP:Q6N1P6; PDB:1ZKDA; IDQTALATEIKRLIKAAGPPVWRYELCLGHPEHGYYVTRFTTSPEISQFGELLGLWSASV -------------------3333------------------3333--------------- WKAADEPQTLRLIEIGPGRGTADALRALRVLPILYQSLSVHLVEINPVLRQKQQTLLAGI -1111-----------!!!!-----------3333-----------3333------3333 RNIHWHDSFEDVPEGPAVILANEYFDVLPIHQAIKRETGWHERVIEIGASGELVFGVAAD -------3333------------3333--------1111--------1111--------- PIPGFEALLPPLARLSPPGAVFEWRPDTEILKIASRVRDQGGAALIIDYGHLRSDVGDTF ---3333--3333---2222---------------------------------------- QAIASHSYADPLQHPGRADLTAHVDFDALGRAAESIGARAHGPVTQGAFLKRLGIETRAL ---------11112222----------------1111----------------------- SLAKATPQVSEDIAGALQRLTGEGRGAGSFKVIGVSDPKIETLVALSDD -----------------------iiii---------3333--2222--- >HYPOTHETICAL PROTEIN HP15; SWP:P64665; PDB:1ZKEA; MFEKIRKILADIEDSQNEIEMLLKLANLSLGDFIEIKRGSMDMPKGVNEAFFTQLSEEVE -----------------------------------------------3333--------- RLKELINALNKIKKGLLVFGS -----------3333--iiii >HYPOTHETICAL PROTEIN PA52; SWP:Q9HTY7; PDB:1ZKIA; PAREQISAYSELVGLDPVSLGDGVAEVRLPAAHLRNRGGVHGGALFSLDVTGLACSSSHG -3333---------------2222------1111-1111--------------------1 FDRQSVTLECKINYIRAVADGEVRCVARVLHAGRRSLVVEAEVRQGDKLVAKGQGTFAQL 111-----------------------------1111--------!!!!------------ >EXTENDED-SPECTRUM BETA-LA; SWP:Q99QC1; PDB:1ZKJA; DPLRPVVDASIQPLLKEHRIPGMAVAVLKDGKAHYFNYGVANRESGAGVSEQTLFEIGSV 1111------------------------iiii---------3333--------------- SKTLTATLGAYAVVKGAMQLDDKASRHAPWLKGSAFDSITMGELATYSAGGLPLQFPEEV ------------------11113333-1111-------------------------3333 DSSEKMRAYYRQWAPVYSPGSHRQYSNPSIGLFGHLAASSLKQPFAPLMEQTLLPGLGMH -----------------2222-----------------1111------------1111-- HTYVNVPKQAMASYAYGYSKEDKPIRVNPGMLADEAYGIKTSSADLLRFVKANIGGVDDK ------33331111----1111-------2222--------------------------- ALQQAISLTHQGHYSVGGMTQGLGWESYAYPVTEQTLLAGNSAKVILEANPTAAPREQVL -------1111----!!!!-----------------------3333-------------- FNKTGSTNGFGAYVAFVPARGIGIVMLANRNYPIEARIKAAHAILAQLAG ------1111------3333------------3333-------------- >Peptide corresponding to ; SWP:Q9NQR1; PDB:1ZKKA; KSKAELQSEERKRIDELIESGKEEGMKIDLIDGKGRGVIATKQFSRGDFVVEYHGDLIEI ------------------------------2222----------2222------------ TDAKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRETNRLGRLINHSKCGNCQTKLHDI ------------3333--------%%%%------------3333---------------i DGVPHLILIASRDIAAGEELLYDYGDRSKASIEAHPWLKH iii-----------2222---------33331111----- >High-affinity cAMP-specif; SWP:Q13946; PDB:1ZKLA; DYNGQAKCMLEKVGNWNFDIFLFDRLTNGNSLVSLTFHLFSLHGLIEYFHLDMMKLRRFL 3333-------1111---------1111--3333------------1111---------- VMIQEDYHSQNPYHNAVHAADVTQAMHCYLKEPKLANSVTPWDILLSLIAAATHDLDHPG ---11113333---------------------3333------------------------ VNQPFLIKTNHYLATLYKNTSVLENHHWRSAVGLLRESGLFSHLPLESRQQMETQIGALI ------1111------%%%%------------------1111------------------ LATDISRQNEYLSLFRSHLDRGDLCLEDTRHRHLVLQMALKCADICNPCRTWELSKQWSE ---3333-----------------1111-----------------3333-3333------ KVTEEFFHQGDIEKKYHLGVSPLCDRHTESIANIQIGFMTYLVEPLFTEWARFSNTRLSQ --------------------22221111-------------------------------- TMLGHVGLNKASWKGLQ ----------------- >GLYCINE CLEAVAGE SYSTEM H; SWP:Q9WY55; PDB:1ZKOA; HLKKKYTKTHEWVSIEDKVATVGITNHAQEQLGDVVYVDLPEVGREVKKGEVVASIESVK ------1111-----!!!!----------------------2222--2222------111 AAADVYAPLSGKIVEVNEKLDTEPELINKDPEGEGWLFKEISDEGELEDLLDEQAYQEFC 1-----------------3333-3333-----1111----------1111---------- AQ -- >HYPOTHETICAL PROTEIN BA10; SWP:NA; PDB:1ZKPA; AKTVVGFWGGFPEAGEATSGYLFEHDGFRLLVDCGSGVLAQLQKYITPSDIDAVVLSHYH ------------2222--------iiii------2222---3333-3333---------1 HDHVADIGVLQYARLITSATKGQLPELPIYGHTFDENGFHSLTHEPHTKGIPYNPEETLQ 1111111--------------------------------3333----------1111--- IGPFSISFLKTVHPVTCFARITAGNDIVVYSADSSYIPEFIPFTKDADLFICECNYAHQE !!!!------------------!!!!---------------1111----------1111- AAKAGHNSTEVASIAKDANVKELLLTHLPHTGNPADLVTEAKQIFSGHITLAHSGYVWNS 3333------------------------------------------------2222---- >THIOREDOXIN REDUCTASE 2, ; SWP:Q9JLT4; PDB:1ZKQA; QQSFDLLVIGGGSGGLACAKEAAQLGKKVAVADYVEPSPRGTKWGLGGTCVNVGCIPKKL ----------------------1111-----------1111------3333--------- MHQAALLGGMIRDAHHYGWEVAQPVQHNWKTMAEAVQNHVKSLNWGHRVQLQDRKVKYFN ------------3333-------------------------------------------- IKASFVDEHTVRGVDKGGKATLLSAEHIVIATGGRPRYPTQVKGALEYGITSDDIFWLKE --------------1111-----------------------2222-----3333------ SPGKTLVVGASYVALECAGFLTGIGLDTTVMMRSIPLRGFDQQMSSLVTEHMESHGTQFL ---------------------1111----------------------------------- KGCVPSHIKKLPTNQLQVTWEDHASGKEDTGTFDTVLWAIGRVPETRTLNLEKAGISTNP ----------1111--------------------------------33331111------ KNQKIIVDAQEATSVPHIYAIGDVAEGRPELTPTAIKAGKLLAQRLFGKSSTLMDYSNVP -------1111---1111---3333----------------------------------- TTVFTPLEYGCVGLSEEEAVALHGQEHVEVYHAYYKPLEFTVADRDASQCYIKMVCMREP ---------------3333----3333---------33331111---------------- PQLVLGLHFLGPNAGEVTQGFALGIKCGASYAQVMQTVGIHPTCSEEVVKLHISKRSGLE ------------3333--------1111-----1111------33331111--3333--- PT -- >Major allergen I polypept; SWP:FEL1B_FELCA; PDB:1ZKRA; MEICPAVKRDVDLFLTGTPDEYVEQVAQYKALPVVLENARILKNCVDAKMTEEDKENALS ---3333------------------3333------------------------------- LLDKIYTSPLCVKMAETCPIFYDVFFAVANGNELLLDLSLTKVNATEPERTAMKKIQDCY -------1111-----------------------------1111---------------- VENGLISRVLDGLVMTTISSSKDCMG 11111111-3333--------1111- >GROWTH/DIFFERENTIATION FA; SWP:Q9UK05; PDB:1ZKZA; AGSHCQKTSLRVNFEDIGWDSWIIAPKEYEAYECKGGCFFPLADDVTPTKHAIVQTLVHL ------------3333--3333-------------------------------------- KFPTKVGKACCVPTKLSPISVLYKDDMGVPTLKYHYEGMSVAECGCR -1111-------------------1111------------------- >HYPOTHETICAL PROTEIN PA51; SWP:Q9HTZ1; PDB:1ZL0A; SRPSSDQTWQPIDGRVALIAPASAIATDVLEATLRQLEVHGVDYHLGRHVEARYRYLAGT -------------------------------------1111-----1111---!!!!--- VEQRLEDLHNAFDMPDITAVWCLRGGYGCGQLLPGLDWGRLQAASPRPLIGFSDISVLLS ---------11111111----------3333-1111----3333-------!!!!----- AFHRHGLPAIHGPVATGLGLSPLSAPREQQERLASLASVSRLLAGIDHELPVQHLGGHKQ --1111-------3333------------------------1111--------------- RVEGALIGGNLTALACMAGTLGGLHAPAGSILVLEDVGEPYYRLERSLWQLLESIDARQL --------------1111-1111---2222---------------------11113333- GAICLGSFTDCPRKEVAHSLERIFGEYAAAIEVPLYHHLPSGHGAQNRAWPYGKTAVLEG ------------iiii--3333------1111------------------2222----!! NRLRWG !!---- >LIN-7; SWP:P90976; PDB:1ZL8A; GSLNLERDVQRILELMEHVQKTGEVNNAKLASLQQVLQSEFFGAVREVYETVY -----------------------------------1111-------------- >GLUTATHIONE S-TRANSFERASE; SWP:Q09596; PDB:1ZL9A; MVSYKLTYFNGRGAGEVSRQIFAYAGQQYEDNRVTQEQWPALKETCAAPFGQLPFLEVDG ------------3333------------------3333---------1111------iii KKLAQSHAIARFLAREFKLNGKTAWEEAQVNSLADQYKDYSSEARPYFYAVMGFGPGDVE i-------------1111------------------------------------------ TLKKDIFLPAFEKFYGFLVNFLKASGSGFLVGDSLTWIDLAIAQHSADLIAKGGDFSKFP -------------------------------------------------1111--1111- ELKAHAEKIQAIPQIKKWIETRPVTPF --------------------------- >HYPOTENSIVE PHOSPHOLIPASE; SWP:Q8AXY1; PDB:1ZLBA; SLWQFGKMINYVMGESGVLQYLSYGCYCGLGGQGQPTDATDRCCFVHDCCYGKVTGCNPK -------------1111---------------------------------1111---333 IDSYTYSKKNGDVVCGGDNPCKKQICECDRVATTCFRDNKDTYDIKYWFYGAKNCQEKSE 3-------iiii--------------------------1111-3333---3333------ PC -- >PTR NECROSIS TOXIN; SWP:P78737; PDB:1ZLDA; GNIGQVDIDSVILGRPGAIGSWELNNFITIGLNRVNADTVRVNIRNTGRTNRLIITQWDN -2222-----11112222------1111-------1111--------------------- TVTRGDVYELFGDYALIQGRGSFCLNIRSDTGRENWRMQLEN ---------------------------1111----------- >Carboxypeptidase inhibito; SWP:Q5EPH2; PDB:1ZLHB; NECVSKGFGCLPQSDCPQEARLSYGGCSTVCCDLSKLTGCKGKGGECNPLDRQCKELQAE ---1111----3333-3333------------3333--3333------1111----3333 SASCGKGQKCCVWL 33332222------ >DORMANCY SURVIVAL REGULAT; SWP:P95193; PDB:1ZLJA; DPLSGLTDQERTLLGLLSEGLTNKQIADRFLAEKTVKNYVSRLLAKLGERRTQAAVFATE ----------------1111-----------------------------3333------- LKRSRPP ------- >CARDIAC PHOSPHOLAMBAN; SWP:P26678; PDB:1ZLLA; MEKVQYLTRSAIRRASTIEMPQQARQKLQNLFINFCLILICLLLICIIVMLL ---3333---3333---------------------------------3333- >OSTEOCLAST STIMULATING FA; SWP:Q92882; PDB:1ZLMA; GQVKVFRALYTFEPRTPDELYFEEGDIIYITDMSDTNWWKGTSKGRTGLIPSNYVAEQ ---------------1111---2222----------------iiii----1111---- >PETAL DEATH PROTEIN; SWP:Q05957; PDB:1ZLPA; KTTMHRLIEEHGSVLMPGVQDALSAAVVEKTGFHAAFVSGYSVSAAMLGLPDFGLLTTTE ----------------------------1111---------------------------- VVEATRRITAAAPNLCVVVDGDTGGGGPLNVQRFIRELISAGAKGVFLEDQVWPKKCGHM --------------------!!!!------------------------------------ RGKAVVPAEEHALKIAAAREAIGDSDFFLVARTDARAPHGLEEGIRRANLYKEAGADATF ------3333-------------------------------------------------- VEAPANVDELKEVSAKTKGLRIANMIEGGKTPLHTPEEFKEMGFHLIAHSLTAVYATARA -------------------------2222-----3333-1111----------------- LVNIMKILKEKGTTRDDLDQMATFSEFNELISLESWYEMESKFK ----------------1111--3333------------------ >PLECKSTRIN; SWP:P08567; PDB:1ZM0A; GVIIKQGCLLKQGHRRKNWKVRKFILREDPAYLHYYDPAGAEDPLGAIHLRGCVVTSVEE -------------------------------------------------2222------- ENLFEIITADEVHYFLQAATPKERTEWIKAIQMASR -------1111---------------------1111 >DEOXYNUCLEOSIDE KINASE; SWP:Q9XZT6; PDB:1ZM7A; TKYAEGTQPFTVLIEGNIGSGKTTYLNHFEKYKNDICLLTEPVEKWRNVNGVDLLELMYK -2222-----------2222--------3333------------1111iiii-------- DPKKWAMPFQSYVTLTMLQSHTAPTNKKLKIMERSIFSARYCFVENMRRNGSLEQGMYNT ------------------------------------------------------------ LEEWYKFIEESIHVQADLIIYLRTSPEVAYERIRQRARSEESCVPLKYLQELHELHEDWL -------------------------------------3333---3333------------ IHQRRPQSCKVLVLDAD ----------------- >NUCLEASE; SWP:P38446; PDB:1ZM8A; ISVHLLLGNPSGATPTKLTPDNYLMVKNQYALSYNNSKGTANWVAWQLNSSWLGNAERQD -3333---1111---3333-------3333-----3333---------3333-------- NFRPDKTLPAGWVRVTPSMYSGSGYARGHIAPSADRTKTTEDNAATFLMTNMMPQTPDNN --------1111---33332222--------3333-------3333-3333----3333- RNTWGNLEDYCRELVSQGKELYIVAGPNGSLGKPLKGKVTVPKSTWKIVVVLDSPGSGLE --------------3333---------------2222----------------2222333 GITANTRVIAVNIPNDPELNNDWRAYKVSVDELESLTGYDFLSNVSPNIQTSIESKVDN 3-1111-----------------1111--------------1111-------------- >BACTEROCIN TRANSPORT ACCE; SWP:Q8DP51_STRR6; PDB:1ZMAA; AFLDNIKDLEVTTVVRAQEALDKKETATFFIGRKTCPYCRKFAGTLSGVVAETKAHIYFI ---1111-------------------------1111------------------------ NSEEPSQLNDLQAFRSRYGIPTVPGFVHITDGQINVRCDSSSAQEIKDFAGL ---3333--------1111----------iiii------------------- >ACETYLXYLAN ESTERASE RELA; SWP:Q97LM8; PDB:1ZMBA; VKSFLLGQSNAGRGFINEVPIYNERIQLRNGRWQTEPINYDRPVSGISLAGSFADAWSQK --------------1111----1111---------------1111--3333-----3333 NQEDIIGLIPCAEGGSSIDEWALDGVLFRHALTEAKFAESSELTGILWHQGESDSLNGNY ------------22223333-3333-------------------------1111------ KVYYKKLLLIIEALRKELNVPDIPIIIGGLGDFLGKERFGKGCTEYNFINKELQKFAFEQ ------------------------------1111---1111------------------- DNCYFVTASGLTCNPDGIHIDAISQRKFGLRYFEAFFNRKHVLEPLINENELLNLNYART -------------3333------------------1111------1111----------- HTKAEKIYIKSDFALGKISYDEFTSELKINNDLE ------------1111------------------ >DIHYDROLIPOYL DEHYDROGENA; SWP:P09622; PDB:1ZMDA; QPIDADVTVIGSGPGGYVAAIKAAQLGFKTVCIEKNETLGGTCLNVGCIPSKALLNNSHY ------------3333-------1111--------------------------------- YHMAHGTDFASRGIEMSEVRLNLDKMMEQKSTAVKALTGGIAHLFKQNKVVHVNGYGKIT ------3333-------------------------------------------------- GKNQVTATKADGGTQVIDTKNILIATGSEVTPFPGITIDEDTIVSSTGALSLKKVPEKMV --------1111--------------------2222--------3333------------ VIGAGVIGVELGSVWQRLGADVTAVEFLGHVGGVGIDMEISKNFQRILQKQGFKFKLNTK --------------------------------2222------------1111-------- VTGATKKSDGKIDVSIEAASGGKAEVITCDVLLVCIGRRPFTKNLGLEELGIELDPRGRI ------3333-------1111------------------------3333-----1111-- PVNTRFQTKIPNIYAIGDVVAGPMLAHKAEDEGIICVEGMAGGAVHIDYNCVPSVIYTHP --1111---1111---1111------------------1111-----3333--------- EVAWVGKSEEQLKEEGIEYKVGKFPFAANSRAKTNADTDGMVKILGQKSTDRVLGAHILG ------------------------3333-------------------------------- PGAGEMVNEAALALEYGASCEDIARVCHAHPTLSEAFREANLAASFGKSINF -3333--------1111-----1111-----3333----------------- >Proline utilization trans; SWP:P25502; PDB:1ZMEC; SVACLSCRKRHIKCPGGNPCQKCVTSNAICEYLEPSKKIVVSTKYLQQLQKDLNDKTEEN -------------------33331111--------------------------------- NRLKALLLER -----3333- >NEUTROPHIL DEFENSIN 4; SWP:P12838; PDB:1ZMMA; VCSCRLVFCRRTELRVGNCLIGGVSFTYCCT ---------1111-------iiii------- >HALOHYDRIN DEHALOGENASE; SWP:Q93MS3; PDB:1ZMOA; VIALVTHARHFAGPAAVEALTQDGYTVVCHDASFADAAERQRFESENPGTIALAEQKPER ------1111----------1111------1111------------2222------1111 LVDATLQHGEAIDTIVSNDYIPRPMNRLPLEGTSEADIRQMFEALSIFPILLLQSAIAPL -----1111--------------------2222--------------------------- RAAGGASVIFITSSVGKKPLAYNPLYGPARAATVALVESAAKTLSRDGILLYAIGPNFFN 1111--------1111---1111-------------------3333-------------- NPTYFPTSDWENNPELRERVDRDVPLGRLGRPDEMGALITFLASRRAAPIVGQFFAFTGG -----3333--------------3333---3333------------3333-------iii YLP i-- >DEFENSIN 5; SWP:Q01523; PDB:1ZMPA; ATCYCRTGRCATRESLSGVCEISGRLYRLCCR ----------1111-------iiii------- >DEFENSIN 6; SWP:Q01524; PDB:1ZMQA; AFTCHCRRSCYSTEYSYGTCTVMGINHRFCCL ----------1111-------iiii------- >PHOSPHOGLYCERATE KINASE; SWP:P0A7A1; PDB:1ZMRA; SVIKMTDLDLAGKRVFIRADLNVPVKDGKVTSDARIRASLPTIELALKQGAKVMVTSHLG ---3333--2222------------%%%%---------3333---3333----------- RPTEGEYNEEFSLLPVVNYLKDKLSNPVRLVKDYLDGVDVAEGELVVLENVRFNKGEKKD --2222-3333-----------------------------2222-----11112222--- DETLSKKYAALCDVFVMDAFGTAHRAQASTHGIGKFADVACAGPLLAAELDALGKALKEP --------1111------3333----11113333-------------------------- ARPMVAIVGGSKVSTKLTVLDSLSKIADQLIVGGGIANTFIAAQGHDVGKSLYEADLVDE -----------3333--------------------------1111--!!!!--1111--- AKRLLTTCNIPVPSDVRVATEFSETAPATLKSVNDVKADEQILDIGDASAQELAEILKNA ----------------------1111-----1111-1111----------------1111 KTILWNGPVGVFEFPNFRKGTEIVANAIADSEAFSIAGGGDTLAAIDLFGIADKISYIST ----------3333------------------------------------1111------ GGGAFLEFVEGKVLPAVAMLEERAKK -------------------------- >HALOALCOHOL DEHALOGENASE ; SWP:Q7AUG5; PDB:1ZMTA; STAIVTNVKHFGGMGSALRLSEAGHTVACHDESFKQKDELEAFAETYPQLKPMSEQEPAE ------1111----------1111------3333------------1111---------- LIEAVTSAYGQVDVLVSNDIFAPEFQPIDKYAVEDYRGAVEALQIRPFALVNAVASQMKK --------------------------1111-3333------------------------- RKSGHIIFITSATPFGPWKELSTYTSARAGACTLANALSKELGEYNIPVFAIGPNYLHSE -----------3333---------------------33333333--------------!! DSPYFYPTEPWKTNPEHVAHVKKVTALQRLGTQKELGELVAFLASGSCDYLTGQVFWLAG !!-------1111-----------1111--------------3333-1111-------ii GFPMIERWPGMP ii-----2222- >ANTIBODY CABBCII-10:LYS3; SWP:P00698; PDB:1ZMYA; QVQLVESGGGSVQAGGSLRLSCTASGYTIGPYCMGWFRQAPGGEREAVAAINMGGGITYY ------------2222-------------------------------------------- ADSVKGRFTISRDNAKNTVTLQMNSLKPEDTAMYYCAADSTIYASYYECGHGLSTGGYGY 3333----------------------3333------------------------------ DSWGQGTQVTVSS ------------- >PHAGE-RELATED CONSERVED H; SWP:Q7WLM8; PDB:1ZN6A; CSHYQALKDQERRKYFAAHPSAEVPADWPRYGAFIRRPLVPEREAATGRWGIPPGTRPEK --------3333---------------2222---------------------11111111 LAEASKKNTSNARSETAHQLWTFRNAWAKAQHCIIPADAIYEPDWRSGKAVPTRFTRADG ---1111-----3333----------------------------1111--------1111 APLGIAGLWDRYRNAAGEWIDSYTLTINADDDPLFRDYHQAGKEKRVVILPDGAYGDWLT -------------1111-----------1111-3333-------------3333------ APATDTRDFLLPYPADRLVAAAVKL -3333-1111---1111-------- >ADENINE PHOSPHORIBOSYLTRA; SWP:P07741; PDB:1ZN8A; DSELQLVEQRIRSFPDFPTPGVVFRDISPVLKDPASFRAAIGLLARHLKATHGGRIDYIA 3333--3333----------------3333---------------------!!!!----- GLDSRGFLFGPSLAQELGLGCVLIRKRGKLPGPTLWASYSLEYGKAELEIQKDALEPGQR ---3333------------------2222-----------!!!!------1111-2222- VVVVDDLLATGGTMNAACELLGRLQAEVLECVSLVELTSLKGREKLAPVPFFSLLQYE ---------------------1111-----------3333------------------ >CARBONIC ANHYDRASE IV; SWP:P22748; PDB:1ZNCA; WCYEVQAESSNYPCL --3333--------- >MAJOR URINARY PROTEIN; SWP:P11589; PDB:1ZNDA; EEASSTGRNFNVEKINGEWHTIILASDKREKIEDNGNFRLFLEQIHVLEKSLVLKFHTVR ---1111---3333-------------3333-1111-----------------------% DEECSELSMVADKTEKAGEYSVTYDGFNTFTIPKTDYDNFLMAHLINEKDGETFQLMGLY %%%------------2222-----------------------------iiii-------- GREPDLSSDIKERFAQLCEEHGILRENIIDLSNANRC ------------------1111-1111---1111--- >PLP SYNTHASE; SWP:Q5L3Y2; PDB:1ZNNA; KGGVIMDVVNAEQAKIAEAAGAVAVMALEGGVARMADPTVIEEVMNAVSIPVMAKVRIGH -----------------1111---------------3333----------------2222 YVEARVLEALGVDYIDESEVLTPADEEFHIDKRQFTVPFVCGCRDLGEAARRIAEGASML -------3333------1111---------3333------------------1111---- RTKGEPGTGNIVEAVRHMRKVNAQIRKVVNMSEDELVAEAKQLGAPVEVLREIKRLGRLP ----2222-----------------------1111----------3333----------- VVNFAAGGVTTPADAALMMHLGADGVFVGSGIFKSENPEKYARAIVEATTHYEDYELIAH ------------------1111------3333------------------1111-----3 LSKGL 333-- >HYPOTHETICAL UPF0244 PROT; SWP:Q9KU27; PDB:1ZNOA; RKIIIASQNPAKVNAVRSAFSTVFPDQEWEFIGVSVPSEVADQPSDEETKQGALNRVRNA ------------------------------------------------------------ KQRHPGAEYYVGLEAGIEENKTFAWIVESDQQRGESRSACLLPPLVLERLLGDVDEVENI ------------------------------------------3333------------33 KQKGGAIGLLTRHHLTRSTVYHQALILALIPFINPEHYPS 33--------------------------3333-3333--- >HYPOTHETICAL PROTEIN ATU3; SWP:Q8U9W0; PDB:1ZNPA; DSAQAIIRELGLEPHPEGGFYHQTFRDKAGGERGHSTAIYYLLEKGVRSHWHRVTDAVEV -------1111---1111------------3333-------------------------- WHYYAGAPIALHLSQDGREVQTFTLGPAILEGERPQVIVPANCWQSAESLGDFTLVGCTV ---------------------------1111--------2222----------------- SPGFAFSSFVAEPGWSPG ----1111---------- >GUANYLATE KINASE; SWP:P0A5I4; PDB:1ZNWA; VGRVVVLSGPSAVGKSTVVRCLRERIPNLHFSVSATTRAPRPGEVDGVDYHFIDPTRFQQ -------------------------1111-----------22222222------------ LIDQGELLEWAEIHGGLHRSGTLAQPVRAAAATGVPVLIEVDLAGARAIKKTMPEAVTVF -1111-------%%%%------------------------------------1111---- LAPPSWQDLQARLIGRGTETADVIQRRLDTARIELAAQGDFDKVVVNRRLESACAELVSL -------------!!!!-------------------3333-------------------- LV -- >ORNITHINE DECARBOXYLASE A; SWP:P54370; PDB:1ZO0A; ILYSDERLNVTEEPTSNDKTRVLSIQCTLTEAKQVTWRAVWNGGGLYIELPAGPLPEGSK ----2222---------------------------------!!!!--------------- DSFAALLEFAEEQLRADHVFICFPKNREDRAALLRTFSFLGFEIVRPGHPLVPKRPDACF ------------------------------------------------------------ MVYTLE ------ >NUCLEAR TRANSPORT FACTOR ; SWP:Q5CQI4; PDB:1ZO2A; SINLNPQFDQIGKQFVQHYYQTFQTNRPALGGLYGPQSMLTWEDTQFQGQANIVNKFNSL ----1111-----------------3333-----1111------------------3333 NFQRVQFEITRVDCQPSPNNGSIVFVTGDVRIDDGQPLKFSQVFNLMPSGNGGFMIFNDL -------------------------------%%%%------------------------- FRLN ---- >2,2-DIALKYLGLYCINE DECARB; SWP:P16932; PDB:1ZODA; LNDDATFWRNARHHLVRYGGTFEPMIIERAKGSFVYDADGRAILDFTSGQMSAVLGHCHP 1111--------------------------!!!!--1111------------1111---- EIVSVIGEYAGKLDHLFSEMLSRPVVDLATRLANITPPGLDRALLLSTGAESNEAAIRMA ----------------1111----------------2222-------------------- KLVTGKYEIVGFAQSWHGMTGAAASATYSAGRKGVGPAAVGSFAIPAPFTYRPRFERNGA -----------------------1111-----------2222------3333----%%%% YDYLAELDYAFDLIDRQSSGNLAAFIAEPILSSGGIIELPDGYMAALKRKCEARGMLLIL -------------------------------1111----2222--------1111----- DEAQTGVGRTGTMFACQRDGVTPDILTLSKTLGAGLPLAAIVTSAAIEERAHELGYLFYT -------1111--3333-----------1111-------------------1111----1 THVSDPLPAAVGLRVLDVVQRDGLVARANVMGDRLRRGLLDLMERFDCIGDVRGRGLLLG 111----------------1111----------------------1111----------- VEIVKDRRTKEPADGLGAKITRECMNLGLSMNIVQLPGMGGVFRIAPPLTVSEDEIDLGL -----3333--------------------------2222--------1111--------- SLLGQAIERAL ----------- >ALKYL HYDROPEROXIDE-REDUC; SWP:P56876; PDB:1ZOFA; MVVTKLAPDFKAPAVLGNNEVDEHFELSKNLGKNGVILFFWPKDFTFVCPTEIIAFDKRV -2222--------------------3333------------------------------- KDFHEKGFNVIGVSIDSEQVHFAWKNTPVEKGGIGQVSFPMVADITKSISRDYDVLFEEA ---1111---------3333--3333-1111------------1111---------%%%% IALRGAFLIDKNMKVRHAVINDLPLGRNADEMLRMVDALLHFEEHGEVCP ---------1111-------------3333----------3333------ >ESTERASE; SWP:Q3HWU8; PDB:1ZOIA; SYVTTKDGVQIFYKDWGPRDAPVIHFHHGWPLSADDWDAQLLFFLAHGYRVVAHDRRGHG ----1111---------1111-----------------------1111-------2222- RSSQVWDGHDMDHYADDVAAVVAHLGIQGAVHVGHSTGGGEVVRYMARHPEDKVAKAVLI --------------------------2222------------------3333-------- AAVPPLMVQTPGNPGGLPKSVFDGFQAQVASNRAQFYRDVPAGPFYGYNRPGVEASEGII ---------3333----3333-----------3333--3333-1111--2222------- GNWWRQGMIGSAKAHYDGIVAFSQTDFTEDLKGIQQPVLVMHGDDDQIVPYENSGVLSAK ------1111--------------------1111---------------3333------- LLPNGALKTYKGYPHGMPTTHADVINADLLAFIRS ---------------3333---------------- >PRESYNAPTIC PROTEIN SAP97; SWP:Q62696; PDB:1ZOKA; EYEEITLERGNSGLGFSIAGGTDNPHIGDDSSIFITKIITGGAAAQDGRLRVNDCILRVN ----------------------------------------------------------ii EADVRDVTHSKAVEALKEAGSIVRLYVKRRKAF ii------3333--------------------- ----------------------------------------------- >ISOCITRATE DEHYDROGENASE; SWP:Q9X0N2; PDB:1ZORA; MEKVKVKNPIVELDGDEMARVMWKMIKEKLILPYLDIQLVYFDLGIKKRDETDDQITIEA ------------------------------3333-------------------------- AKAIKKYGVGVKCATITPDAERVKEYNLKKAWKSPNATIRAYLDGTVFRKPIMVKNVPPL -----------------------------------------------------1111--- VKRWKKPIIIGRHAYGDIYNAVEAKVEGPAEVELVVRNKENKTLLVHKFEGNGVVMAMHN 3333------------3333---------------------------------------- LEKSIRSFAQSCINYAISEKVDIWFATKDTISKVYHAYFKDIFQEEVDKRKEELEKAGVN ---------------------------3333---3333----------------1111-- YRYMLIDDAAAQILRSEGGMLWACMNYEGDIMSDMIASGFGSLGLMTSVLVSPDGVYEFE ------------1111---------------------3333----------1111----- AAHGTVRRHYYRYLKGEKTSTNPTASIFAWTGAIRKRGELDGTPEVCEFADKLEKAVINT ------------1111-------------------------------------------- IESGVITKDLQPFTEPPIDKYVTLEEFIDEVKKNLEKLL ------33331111---------------------1111 >5'-methylthioadenosine / ; SWP:Q8DQ16; PDB:1ZOSA; MKIGIIAAMPEELAYLVQHLDNTQEQVVLGNTYHTGTIASHEVVLVESGIGKVMSAMSVA --------3333----1111-------iiii------%%%%------------------- ILADHFQVDALINTGSAGAVAEGIAVGDVVIADKLAYHDVDVTAFGYAYGQMAQQPLYFE --------------------22222222-------------3333--22222222----- SDKTFVAQIQESLSQLDQNWHLGLIATGDSFVAGNDKIEAIKSHFPEVLAVEMEGAAIAQ ---------11111111---------------------------1111------------ AAHTLNLPVLVIRAMSDNANHEANIFFDEFIIEAGRRSAQVLLAFLKALD --1111-------------1111--------------------------- >MONOMERIC SARCOSINE OXIDA; SWP:P23342; PDB:1ZOVA; STHFDVIVVGAGSMGMAAGYYLAKQGVKTLLVDSFDPPHTNGSHHGDTRIIRHAYGEGRE -----------3333-------1111-------------------------------333 YVPFALRAQELWYELEKETHHKIFTQTGVLVYGPKGGSAFVSETMEAANIHSLEHELFEG 3--------------1111--------------2222-----------1111------!! KQLTDRWAGVEVPDNYEAIFEPNSGVLFSENCIQAYRELAEAHGATVLTYTPVEDFEVTE !!----1111--1111------------------------1111--------------11 DLVTIKTAKGSYTANKLVVSMGAWNSKLLSKLDVEIPLQPYRQVVGFFECDEAKYSNNAH 11------------------!!!!-----1111-----------------3333-3333- YPAFMVEVENGIYYGFPSFGGSGLKIGYHSYGQQIDPDTINREFGAYPEDEANLRKFLEQ -------1111-------%%%%-------------1111---22223333------3333 YMPGANGELKKGAVCMYTKTPDEHFVIDLHPKYSNVAIAAGFSGHGFKFSSVVGETLAQL -1111--------------1111---------1111-----%%%%3333----------- ATTGKTEHDISIFSLNRDALK ---------111111111111 >3-OXOACYL-[ACYL-CARRIER-P; SWP:Q8NXE2; PDB:1ZOWA; MNVGIKGFGAYAPEKIIDNAYFEQFLDTSDEWISKMTGIKERHWADDDQDTSDLAYEASV -----------------33333333--------------------11113333------- KAIADAGIQPEDIDMIIVATATGDMPFPTVANMLQERLGTGKVASMDQLAACSGFMYSMI ---1111-3333-----------------------1111----------!!!!------- TAKQYVQSGDYHNILVVGADKLSKITDLTDRSTAVLFGDGAGAVIIGEVSEGRGIISYEM --------------------3333-----3333----------------2222------- GSDGTGGKHLYLDKDTGKLKMNGREVFKFAVRIMGDASTRVVEKANLTSDDIDLFIPHQA --11111111--------------------------------1111-3333--------- NIRIMESARERLGISKDKMSVSVNKYGNTSAASIPLSIDQELKNGKLKDDDTIVLVGFGG ---------1111-1111---3333---!!!!---------------2222--------- GLTWGAMTIKWG ------------ >CLM-1; SWP:Q6SJQ7; PDB:1ZOXA; EDPVTGPEEVSGQEQGSLTVQCRYTSGWKDYKKYWCQGVPQRSCKTLVETDASEQLVKKN ------------2222--------3333-----------1111---------------!! RVSIRDNQRDFIFTVTMEDLRMSDAGIYWCGITKGGLDPMFKVTVNIGPV !!------------------3333-------------------------- >Succinate dehydrogenase [; SWP:P21912; PDB:1ZOYB; PRIKKFAIYRWDPDKTGDKPHMQTYEIDLNNCGPMVLDALIKIKNEIDSTLTFRRSCREG --------------2222---------1111---3333---------------------- ICGSCAMNINGGNTLACTRRIDTNLDKVSKIYPLPHMYVIKDLVPDLSNFYAQYKSIEPY --------iiii--1111-----3333-------------!!!!----------1111-- LKKKDESQEGKQQYLQSIEEREKLDGLYECILCACCSTSCPSYWWNGDKYLGPAVLMQAY -----1111----------33332222--------1111--------------------- RWMIDSRDDFTEERLAKLQDPFSLYRCHTIMNCTGTCPKGLNPGKAIAEIKKMMATYKE -1111----------1111-----------3333--1111------------1111--- >RNA POLYMERASE II HOLOENZ; SWP:O94503; PDB:1ZP2A; WASSQLTQLFLSTDLESLEPTCLSKDTIYQWKVVQTFGDRLRLRQRVLATAIVLLRRYML -----------------------3333--------------------------------- KKNEEKGFSLEALVATCIYLSCKVEECPVHIRTICNEANDLWSLKVKLSRSNISEIEFEI ----------------------1111---3333---------------3333-------- ISVLDAFLIVHHPYTSLEQAFHDGIINQKQLEFAWSIVNDSYASSLCLMAHPHQLAYAAL ---%%%%-------------------------------------3333--3333------ LISCCNDENTIPKLLDLIKSTDAFKVILCVQRIISIYYFEDIEAAAL ---1111-------33333333-------------------3333-- >5,10-METHYLENETETRAHYDROF; SWP:P00394; PDB:1ZP4A; FFHASQRDALNQSLAEVQGQINVSFQFFPPRTSEMEQTLWNSIDRLSSLKPKFVSVTYGR 3333------------2222--------------------------1111---------- THSIIKGIKDRTGLEAAPHLTCIDATPDELRTIARDYWNNGIRHIVALRGDLPPGPEMYA ------------------------------------------------------------ SDLVTLLKEVADFDISVAAYPEVHPEAKSAQADLLNLKRKVDAGANRAITQFFFDVESYL -------3333--------11113333---------------------------3333-- RFRDRCVSAGIDVEIIPGILPVSNFKQAKKFADMTNVRIPAWMAQMFDGLDDDAETRKLV ------1111------------------------------------2222---------- GANIAMDMVKILSREGVKDFHFYTLNRAEMSYAICHTLGVRP ------------1111-------%%%%--------------- >HYPOTHETICAL PROTEIN ATU3; SWP:Q8UBK1; PDB:1ZP6A; DLGGNILLLSGHPGSGKSTIAEALANLPGVPKVHFHSDDLWGYIKHGRIDPWLPQSHQQN ----------------------------------------3333-------------333 RIQIAADVAGRYAKEGYFVILDGVVRPDWLPAFTALARPLHYIVLRTTAAEAIERCLDRG 3--3333----3333----------33331111--------------------3333--- GDSLSDPLVVADLHSQFADLGAFEHHVLPVSGKDTDQALQSAINALQSGRFRID -----3333----------!!!!----------1111----------------- >RECOMBINATION PROTEIN U; SWP:P39792; PDB:1ZP7A; TLEDDLNETNKYYLTNQIAVIHKKPTPVQIVNAYFKQSSTTDYNGIYKGRYIDFEAKETK -------------1111-----------------------------%%%%---------- NKTSFPLQNFHDHQIEHMKQVKAQDGICFVIISAFDQVYFLEADKLFYFWDRKEKNGRKS -----1111------------1111--------%%%%----------------------- IRKDELEETAYPISLGYAPRIDYISIIEQLYFS --------------------------------- >PYRUVATE DECARBOXYLASE; SWP:P06672; PDB:1ZPDA; SYTVGTYLAERLVQIGLKHHFAVAGDYNLVLLDNLLLNKNMEQVYCCNELNCGFSAEGYA ------------1111--------1111-------------------------------- RAKGAAAAVVTYSVGALSAFDAIGGAYAENLPVILISGAPNNNDHAAGHVLHHALGKTDY ------------3333------------------------3333---------------- HYQLEMAKNITAAAEAIYTPEEAPAKIDHVIKTALREKKPVYLEIACNIASMPCAAPGPA ------3333--------3333-----------------------1111---------33 SALFNDEASDEASLNAAVDETLKFIANRDKVAVLVGSKLRAAGAEEAAVKFTDALGGAVA 33-------------------------------------1111----------------- TMAAAKSFFPEENALYIGTSWGEVSYPGVEKTMKEADAVIALAPVFNDYSTTGWTDIPDP -1111----1111-------!!!!-2222-----------------3333-%%%%---33 KKLVLAEPRSVVVNGIRFPSVHLKDYLTRLAQKVSKKTGSLDFFKSLNAGELKKAAPADP 33----------iiii----------------------------3333----------11 SAPLVNAEIARQVEALLTPNTTVIAETGDSWFNAQRMKLPNGARVEYEMQWGHIGWSVPA 11-----------11111111------------1111--2222---------2222---- AFGYAVGAPERRNILMVGDGSFQLTAQEVAQMVRLKLPVIIFLINNYGYTIEVMIHDGPY -------3333------3333----------------------------3333----333 NNIKNWDYAGLMEVFNGNGGYDSGAAKGLKAKTGGELAEAIKVALANTDGPTLIECFIGR 3-----3333------2222--------------------------------------11 EDCTEELVKWGKRVAAANSRKPVNK 11----------------------- >PHOSPHORIBOSYL-AMP CYCLOH; SWP:O26347; PDB:1ZPSA; VNILLNFRHNINGEDLIIAVAQDHETGEVLMVAYMNREALRRTLETGTAHYWSTSRGKLW 3333-------------------------------------------------------- LKGESSGHVQRVKDVLVDCDGDAVVLKVEQEGGACHTGYRSCFYRSIDGDELKVREDAVK 2222-------------1111-------------1111----------------1111-- VFDP ---- >IRON TRANSPORT MULTICOPPE; SWP:P38993; PDB:1ZPUA; ETHTFNWTTGWDYRNVDGLKSRPVITCNGQFPWPDITVNKGDRVQIYLTNGMNNTNTSMH --------------3333--------iiii--------2222------------------ FHGLFQNGTASMDGVPFLTQCPIAPGSTMLYNFTVDYNVGTYWYHSHTDGQYEDGMKGLF 2222-22221111-2222-----2222-----------------------3333------ IIKDDSFPYDYDEELSLSLSEWYHDLVTDLTKSFMSVYNPTGAEPIPQNLIVNNTMNLTW -----------------------------------1111------------%%%%----- EVQPDTTYLLRIVNVGGFVSQYFWIEDHEMTVVEIDGITTEKNVTDMLYITVAQRYTVLV ------------------------2222------iiii------------2222------ HTKNDTDKNFAIMQKFDDTMLDVIPSDLQLNATSYMVYNKTAALPTQNYVDSIDNFLDDF ----------------3333----1111----------1111---------------333 YLQPYEKEAIYGEPDHVITVDVVMDNLKNGVNYAFFNNITYTAPKVPTLMTVLSSGDQAN 3-------------------------1111-----!!!!---------------!!!!-- NSEIYGSNTHTFILEKDEIVEIVLNNQDTGTHPFHLHGHAFQTIQRDRTYDDALGEVPHS 3333----------------------------------------------3333------ FDPDNHPAFPEYPMRRDTLYVRPQSNFVIRFKADNPGVWFFHCHIEWHLLQGLGLVLVED -1111----------------2222----------------------------------- PFGIQDAHSQQLSENHLEVCQSCSVATEGNAAANTLDLTDLTGENVQHA ------3333----------1111------------------------- >ACT DOMAIN PROTEIN; SWP:P67382; PDB:1ZPVA; KAIITVVGKDKSGIVAGVSGKIAELGLNIDDISQTVLDEYFTAVVSDEKQDFTYLRNEFE ----------2222----------------------!!!!----------3333------ AFGQTLNVKINIQSAAIFE ---1111------3333-- >Putative uncharacterized ; SWP:Q746F4; PDB:1ZPWX; GKRLYAVAYDIPDDTRRVKLANLLKSYGERVQLSVFECYLDERLLEDLRRRARRLLDLGQ ---------------------------------------------------3333-1111 DALRIYPVAGQVEVLGVGPLPE ---------------------- >HYPOTHETICAL PROTEIN NE01; SWP:Q82XT5; PDB:1ZPYA; DGYFEPTQELSDETRDHRAIISLREELEAVDLYNQRVNACKDKELKAILAHNRDEEKEHA -----1111-3333---------------------------3333--------------- ALLEWIRRCDPAFDKELKDYLFTNKPIAH ----------------------------- >GLUTAMYL-TRNA(GLN) AMIDOT; SWP:Q9V0T9; PDB:1ZQ1A; RVDEFLKERNINVGDFVRITKEEDGEEVTYEGYIPPYELSAGDTLVLKLENGYNIGIALE ---------------------------------------------------------333 KIRRIEVLERAKVKPEVHFEALIEGKPGLPEVTIIGTGGTIASRIDYETGAVYPAFTAEE 3---------------------------------------------3333---------- LAKALPEIFEVANVKPKLLFNIFSEDKPKHWVKIAHEVAKALNSGDYGVVVAHGTDTGYT ------1111------------33333333-----------------------------3 AAALSFLRNLGKPVVLVGAQRSSDRPSSDAANLICSVRATSEVAEVVVHGETGDTYCLAH 333------------------3333----------------------------------- RGTKVRKHTSRRDAFRSINDVPIAKIWPNGEIEFLRKDYRKRSDEEVEVDDKIEEKVALV ----------3333------------3333------------------------------ KVYPGISSEIIDFLVDKGYKGIVIEGTGLGHTPNDIIPSIERAVEEGVAVCTSQCIYGRV --2222--------------------!!!!--1111------------------------ NLNVYSTGRKLLKAGVIPCEDLPETAYVKLWVLGHTQNLEEVRKLTNYAGEITPYTRFDT ---------------------3333-----1111-------------------------- YLR --- >Glutamyl-tRNA(Gln) amidot; SWP:Q9V0U0; PDB:1ZQ1C; TDKFNYEELGLKVGLEIHRQLDTKKLFSPVPSELSDKVEFTFQRRLRPTMSELGEIDPAA ------------------------------------------------------------ LEEFKKGRVYVYEGNYELTDLVYMDEEPPRGPDREALEVALQIAYLLNAKPVDEVYYMRK --------------1111-3333---------------------1111------------ IVIDGSNVSGFQRTAIIATDGKVETPWGAVGIPTICLEEDAARIIERKDKEVIYRLDRLG ------1111--------------1111---------------------------3333- IPLIEISTTPDIHHPEQAKVVAKFIGDALRATKKVKRGLGTIRQDLNVSIKGGARIEIKG -------------3333--------------------------------2222------- VQELDMIPIIIEREVERQLNLLKIRDELRKRGVKPKDIKEEFYDVTDIFENTKSKIIARV --1111---------------------------3333---------1111---------- IKKGGKVLAIKLPKFRGLIGREIQPGRRLGTEFADRAKKYVPGIFHIDELPNYGISQEEV 1111----------2222-----22223333--------------3333--iiii----- NKVIERLNLSEEDAFVLVAAEEEKAKNALREVIKRAREAIEGVPEETRRALPDGNTEYMR ---------1111-------------------------------------3333------ PLPGKARMYPETDIPPLRIPDDLKKKIKENLPELPQAKVERYVKEYKLDRSLAQTLVDDE ----------3333------------1111------------------3333-------- RDELFEELVSMGVKPSLAASILVVVLKG --3333---------------------- >Homeotic protein bicoid; SWP:Q9UAM0; PDB:1ZQ3P; GPRRTRTTFTSSQIAELEQHFLQGRYLTAPRLADLSAKLALGTAQVKIWFKNRRRRHKIQ -------------------3333------------------3333--------------- SDQHKDQS -------- >ORNITHINE CARBAMOYLTRANSF; SWP:Q8P8J2; PDB:1ZQ6A; LKHFLNTQDWSRAELDALLTQAALFKRNKLGSELKGKSIALVFFNPSMRTRTSFELGAFQ -----3333------------------------2222----------------------- LGGHAVVLQPGKDAWPIEFNLGTVMDGDTEEHIAEVARVLGRYVDLIGVRAFPKFVDWSK --------3333---------------------------1111-------------3333 DREDQVLKSFAKYSPVPVINMETITHPCQELAHALALQEHFGTPDLRGKKYVLTWTYHPK 1111-------------------------------------------------------- PLNTAVANSALTIATRMGMDVTLLCPTPDYILDERYMDWAAQNVAESGGSLQVSHDIDSA --------------------------3333------------------------------ YAGADVVYAKSWGALPFFGNWEPEKPIRDQYQHFIVDERKMALTNNGVFSHCLPLRRNVK 2222---------3333---3333------------33331111-----------2222- ATDAVMDSPNCIAIDEAENRLHVQKAIMAALV -------1111--------------------- >HYPOTHETICAL PROTEIN MM04; SWP:Q8PZK8; PDB:1ZQ7A; LTETEGRAAVKLARKTIEIFLSKGKSPRSGVELSPVFEEYRGVFVTLTEGGLLRGCIGHP ---------------------------------3333-----------iiii-------- YPDSTLKEAILDSAISAATRDPRFPTVEQDEKNILVEVTILTQPEKINASPKELPDKVEI --------------------1111---3333------------------33333333--- GKHGLIVKQGYCQGLLLPQVAPENDDSIDFLSHTCKAGLSPDAWVKGAEVYCFEGQIFKE --------!!!!----33333333---------------11111111------------- KEPDGEVIEEKFLEHHH -2222------------ >PROBABLE DIMETHYLADENOSIN; SWP:Q9UNQ2; PDB:1ZQ9A; QHILKNPLIINSIIDKAALRPTDVVLEVGPGTGNMTVKLLEKAKKVVACELDPRLVAELH --------------3333-1111------!!!!-----1111------------------ KRVQGTPVASKLQVLVGDVLKTDLPFFDTCVANLPYQISSPFVFKLLLHRPFFRCAILMF --2222-1111------3333-------------3333---------------------- QREFALRLVAKPGDKLYCRLSINTQLLARVDHLMKVGKNNFRPPPKVESSVVRIEPKNPP -------------1111-------------------3333-------------------- PPINFQEWDGLVRITFVRKNKTLSAAFKSSAVQQLLEKNYRIHCSVHNIIIPEDFSIADK -----------------1111---1111-----------------------1111----- IQQILTSTGFSDKRARSMDIDDFIRLLHGFNAEGIHFS ---------11113333------------3333----- >Tissue factor pathway inh; SWP:P48307; PDB:1ZR0B; PTGNNAEICLLPLDYGPCRALLLRYYYDRYTQSCRQFLYGGCEGNANNFYTWEACDDACW ---33331111-----------------1111------------------------1111 RIE --- >H2AFY PROTEIN; SWP:O75367; PDB:1ZR5A; DGFTVLSTKSLFLGQKLNLIHSEISNLAGFEVEAIINPTNADIDLKDDLGNTLEKKGGKE ----------1111--------33331111---------1111----------------- FVEAVLELRKKNGPLEVAGAAVSAGHGLPAKFVIHCNSPVWGADKCEELLEKTVKNCLAL ---------------2222-----3333-----------2222----------------- ADDKKLKSIAFPSIGSGRNGFPKQTAAQLILKAISSYFVSTMSSSIKTVYFVLFDSESIG -1111-----------1111-----------------1111------------------- IYVQEMAKL --------- >GLUCOOLIGOSACCHARIDE OXID; SWP:Q6PW77; PDB:1ZR6A; NSINACLAAADVEFHEEDSEGWDMDGTAFNLRVDYDPAAIAIPRSTEDIAAAVQCGLDAG -------1111----1111----1111--3333--------------------------- VQISAKGGGHSYGSYGFGGEDGHLMLELDRMYRVSVDDNNVATIQGGARLGYTALELLDQ ----------11111111------------------1111----1111------------ GNRALSHGTCPAVGVGGHVLGGGYGFATHTHGLTLDWLIGATVVLADASIVHVSETENAD ---------1111-----------1111----3333--------1111-----1111--- LFWALRGGGGGFAIVSEFEFNTFEAPEIITTYQVTTTWNRKQHVAGLKALQDWAQNTMPR ----------------------------------------------------------11 ELSMRLEINANALNWEGNFFGNAKDLKKILQPIMKKAGGKSTISKLVETDWYGQINTYLY 11----------------------------------------------------1111ii GADLNITYNYDVHEYFYANSLTAPRLSDEAIQAFVDYKFDNSSVRPGRGWWIQWDFHGGK ii------------------------------------------2222----------11 NSALAAVSNDETAYAHRDQLWLWQFYDSIYDYENNTSPYPESGFEFMQGFVATIEDTLPE 111111-1111----1111-----------3333--------3333-------3333-33 DRKGKYFNYADTTLTKEEAQKLYWRGNLEKLQAIKAKYDPEDVFGNVVSVEPIAY 33---3333-1111----------1111----------1111------------- >HUNTINGTIN-INTERACTING PR; SWP:NA; PDB:1ZR7A; GSWTEHKSPDGRTYYYNTETKQSTWEKPDD -------1111------------------- >ZINC FINGER PROTEIN 593; SWP:O00488; PDB:1ZR9A; DPNAEFDPDLPGGGLHRCLACARYFIDSTNLKTHFRSKDHKKRLKQLSVEPYSQEEAERA ---------2222----3333--------------------------------------- AGMGSYV ------- >Heparan sulfate glucosami; SWP:O14792; PDB:1ZRHA; VAPNGSAQQLPQTIIIGVRKGGTRALLEMLSLHPDVAAAENEVHFFDWEEHYSHGLGWYL -1111------------2222-----------1111--------1111-----------1 SQMPFSWPHQLTVEKTPAYFTSPKVPERVYSMNPSIRLLLILRDPSERVLSDYTQVFYNH 111---1111-----3333--1111-------1111-----------------------3 MQKHKPYPSIEEFLVRDGRLNVDYKALNRSLYHVHMQNWLRFFPLRHIHIVDGDRLIRDP 333-----3333---%%%%-11113333----------3333-1111------------- FPEIQKVERFLKLSPQINASNFYFNKTKGFYCLRDSGRDRCLHESKGRAHPQVDPKLLNK -----------------1111----3333-------------3333-------------- LHEYFHEPNKKFFELVGRTFDWH ----------------------- >E1B-55KDA-ASSOCIATED PROT; SWP:NA; PDB:1ZRJA; GMDVRRLKVNELREELQRRGLDTRGLKAELAERLQAALSGPSSG --3333----------1111------------------------ >L-2-HALOACID DEHALOGENASE; SWP:Q53464; PDB:1ZRN; YIKGIAFDLYGTLFDVHSVVGRCDEAFPGRGREISALWRQKQLEYTWLRSLMNRYVNFQQ --------2222--------------2222------------------------------ ATEDALRFTCRHLGLDLDARTRSTLCDAYLRLAPFSEVPDSLRELKRRGLKLAILSNGSP --------------------------3333----3333-------1111----------- QSIDAVVSHAGLRDGFDHLLSVDPVQVYKPDNRVYELAEQALGLDRSAILFVASNAWDAT -------11111111------3333-------------------3333------------ GARYFGFPTCWINRTGNVFEEMGQTPDWEVTSLRAVVELF ------------------------------------1111 >ERYTHROCYTE BINDING ANTIG; SWP:Q25735; PDB:1ZROA; NEVLSNCREKRKGMKWDCKKKNDRSNYVCIPDRRIQLCIVNLAIIKTYTKETMKDHFIEA -3333----------------3333-------------3333------------------ SKKESQLLLKKNDNKYNSKFCNDLKNSFLDYGHLAMGNDMDFGGYSTKAENKIQEVFKGA -----------%%%%--------------------------------------------- HGEISEHKIKNFRKKWWNEFREKLWEAMLSEHKNNICKNIPQEELQITQWIKEWHGEFLL ----------------------------3333---------------------------- ERDNRAKLPKSKCKNNALYEACEKECIDPCMKYRDWIIRSKFEWHTLSKEYETQKVPKEN ------------!!!!--11113333---------------------------------- AENYLIKISENKNDAKVSLLLNNCDAEYSKYCDCKHTTTLVKSVLNGNDNTIKEKREHID -----------3333----------------------------111111113333----- LDDFSKFGCDKNSVDTNTKVWECKKPYKLSTKDVCVPPRRQELCLGNIDRIYDKNLLMIK ---------3333-------------1111------3333------3333---------- EHILAIAIYESRILKRKYKNKDDKEVCKIINKTFADIRDIIGGTDYWNDLSNRKLVGKIN -----------------11113333----------------------------------1 TNSNYVHRNKQNDKLFRDEWWKVIKKDVWNVISWVFKDKTVCKEDDIENIPQFFRWFSEW 111-----------------------------3333-1111-3333-------------- GDDYCQDKTKMIETLKVECCEDDNCKRKCNSYKEWISKKKEEYNKQAKQYQEYQKGNNYK ----------------------------------------------------------11 MYSEFKSIKPEVYLKKYSEKCSNLNFEDEFKEELHSDYKNKCTMCPEV 111111-----------1111---3333--3333-------------- >E-2/E-2' PROTEIN; SWP:Q9ZFE7; PDB:1ZRRA; SALTIFSVKDPQNSLWHSTNAEEIQQQLNAKGVRFERWQADRDLGAAPTAETVIAAYQHA ---------1111---------------1111----------------3333-------- IDKLVAEKGYQSWDVISLRADNPQKEALREKFLNEHTHGEDEVRFFVEGAGLFCLHIGDE ---------------------1111----1111--------------------------- VFQVLCEKNDLISVPAHTPHWFDMGSEPNFTAIRIFDNPEGWIAQFTGDDIASAYPRLA --------------2222-------------------3333--------3333------ >LACTOPHAGE P2 RECEPTOR BI; SWP:Q71AW2; PDB:1ZRUA; TIKNFTFGSNNDGKLYMMLTGMDYRTIRRKDWSSPLNTALNVQYTNTSIIAGGRYFELLN --------------------------------------------------%%%%------ ETVALKGDSVNYIHANIDLTQTANPVSLSAETANNSNGVDINNGSGVLKVCFDIVTTSGT -----------------3333------------------1111--------------111 GVTSTKPIVQTSTLDSISVNDMTVSGSIDVPVQTLTVEAGNGLQLQLTKKNNDLVIVRFF 1--------------------------------------iiii------%%%%------- GSVSNIQKGWNMSGTWVDRPFRPAAVQSLVGHFAGRDTSFHIDINPNGSITWWGANIDKT ------2222-------3333-----------2222--------1111------------ PIATRGNGSYFIK ------------- >STOMOXYN; SWP:Q8T9R8; PDB:1ZRXA; RGFRKHFNKLVKKVKHTISETAHVAKDTAVIAGSGAAVVAAT ---------------------33333333--------3333- >PROTEIN KINASE C, IOTA; SWP:P41743; PDB:1ZRZA; LGLQDFDLLRVIGRGSYAKVLLVRLKKTDRIYAMKVVKKELVNDDEDIDWVQTEKHVFEQ -------------------------------------------11113333--------3 ASNHPFLVGLHSCFQTESRLFFVIEYVNGGDLMFHMQRQRKLPEEHARFYSAEISLALNY 3331111--------3333----------------1111--------------------- LHERGIIYRDLKLDNVLLDSEGHIKLTDYGMCKEGLRPGDTTSFCGTPNYIAPEILRGED -1111------1111---1111--------------2222-------11113333----- YGFSVDWWALGVLMFEMMAGRSPFDQNTEDYLFQVILEKQIRIPRSMSVKAASVLKSFLN -3333-----------------------------1111---------------------- KDPKERLGCLPQTGFADIQGHPFFRNVDWDMMEQKQVVPPFKPVQLPDDDDIVRKIDQSE -----2222-----------3333---33331111-------------33331111---- FEGFEYINPL ---------- >LACTOCOCCUS LACTIS MG1363; SWP:A2RJ45; PDB:1ZS3A; TKLMIDEKYAKELDKAEIDHHKPTAGAMLGHVLSNLFIENIRLTQAGIYAKSPVKCEYLR -3333------------------------------------------------------- EIAQREVEYFFKISDLLLDENEIVPSTTEEFLKYHKFITEDPKAKYWTDEDLLESFIVDF ----------------3333--------------------1111---------------- QAQNMFITRAIKLANKEEKFALAAGVVELYGYNLQVIRNLAGDLGKSVADF -----------------------------------------1111-3333- >REGULATORY PROTEIN CII; SWP:P03042; PDB:1ZS4A; GSHMANKRNEALRIESALLNKIAMLGTEKTAEAVGVDKSQISRWKRDWIPKFSMLLAVLE ------------------------------------3333------------------11 WGVVDDDMARLARQVAAILTNK 11--------------1111-- >NUCLEOSIDE DIPHOSPHATE KI; SWP:Q13232; PDB:1ZS6A; TGAHERTFLAVKPDGVQRRLVGEIVRRFERKGFKLVALKLVQASEELLREHYAELRERPF !!!!-------33331111---------3333-----------3333----3333--333 YGRLVKYMASGPVVAMVWQGLDVVRTSRALIGATNPADAPPGTIRGDFCIEVGKNLIHGS 3-----1111---------2222-----------3333------------1111------ DSVESARREIALWFRADELLCWEDSAGHWLYE --------------1111-----3333----- >HISTOCOMPATIBILITY 2, M R; SWP:Q860W6; PDB:1ZS8A; SHWLKTFRIVIMEPGILEPRFIQVSYVDSIQYQGFDSRSGMQPRAAWMKQEPPEYWKNET --------------------------!!!!--------------3333---3333----- EHAMGASLLARRTLIYMVTENNNKKNDYHTLQEVFGCNVAHDGSFLGGHYGLTYYGYDYI ------------------1111-------------------------------2222--- ILNEDLNSWTTEGKVGGKFNSVTEGWRTYLKGECTERFLRCLDLGKETLLRSDAPRTHVT --3333--------------------------------------3333------------ HKVTVTLRCWALGFYPADITLTWKRDGKNHTQDMELPDTRPAGDGTFQKWAAVVVPFGEE ------------------------------------------------------------ LRYTCHVHHEGLPGPLTLKWG --------------------- >E-1 ENZYME; SWP:Q9UHY7; PDB:1ZS9A; LSVPAEVTVILLDIEGTTTPIAFVKDILFPYIEENVKEYLQTHWEEEECQQDVSLLRKQA ---1111------2222--3333------3333---------3333-------------- EEDAHLDGAVPIPAASGNGVDDLQQIQAVVDNVCWQSLDKTTALKQLQGHWRAAFTAGRK 1111-2222--------------------------------------------------- AEFFADVVPAVRKWREAGKVYIYSSGSVEAQKLLFGHSTEGDILELVDGHFDTKIGHKVE ---1111-------1111-------------------1111-3333-----3333-1111 SESYRKIADSIGCSTNNILFLTDVTREASAAEEADVHVAVVVRPGNAGLTDDEKTYYSLI 3333---------1111------3333----1111-------2222-------------- TSFSELYL -------- >RHO GUANINE NUCLEOTIDE EX; SWP:Q14155; PDB:1ZSGA; MTDNSNNQLVVRAKFNFQQTNEDELSFSKGDVIHVTRVEEGGWWEGTLNGRTGWFPSNYV --------------------3333-----------------------2222--------- REVKA ----- >HYPOTHETICAL PROTEIN; SWP:Q8IDI8; PDB:1ZSOA; KNTVVRIKAELENVKRLFCDDEYLWIFNIRDSTSSLTRDNIQFRKTDILEIPNSRGTANF -------------------1111-------1111---------1111------------- IKWTEYPKYSTINFVNTKNSCSYEEVNNNEWRDFASFECRGIELIDFFPSNNFIVEDTKG -----------------------3333-----------------------------1111 KLYYDVNLSDQNWCDYNEEHECVGIYNLEYEVN -------1111-----3333------------- >MYOTUBULARIN-RELATED PROT; SWP:Q13614; PDB:1ZSQA; MEEPPLLPGENIKDMAKDVTYICPFTGAVRGTLTVTNYRLYFKSMERDPPFVLDASLGVI ------2222---------------------------------------------3333- NRVEKIGGASSRGENSYGLETVCKDIRNLRFAHKPEGRTRRSIFENLMKYAFPVSNNLPL -------1111----------------------3333----------------1111--3 FAFEYKEVFPENGWKLYDPLLEYRRQGIPNESWRITKINERYELCDTYPALLVVPANIPD 333--------3333--------1111--1111---1111----1111------111133 EELKRVASFRSRGRIPVLSWIHPESQATITRCSQPMVGVSGKRSKEDEKYLQAIMDSNAQ 33-------2222----------------------------------------------- SHKIFIFDARPSVNAVANKAKGGGYESEDAYQNAELVFLDIHNIHVMRESLRKLKEIVYP --------------------------33331111-------------------------- NIEETHWLSNLESTHWLEHIKLILAGALRIADKVESGKTSVVVHSSDGWDRTAQLTSLAM --3333------------------------------------------------------ LMLDGYYRTIRGFEVLVEKEWLSFGHRFQLRVGHGDKNHADADRSPVFLQFIDCVWQMTR ---3333------------------------------1111------------------- QFPTAFEFNEYFLITILDHLYSCLFGTFLCNSEQQRGKENLPKRTVSLWSYINSQLEDFT -------------------------1111-----------------3333-----3333- NPLYGSYSNHVLYPVASMRHLELWVGYYIRWNP 1111------------1111---3333------ >NADP-dependent leukotrien; SWP:Q14914; PDB:1ZSVA; SMTKTWTLKKHFVGYPTNSDFELKTSELPPLKNGEVLLEALFLTVDPYMRVAAKRLKEGD ----------------3333-----------2222----------3333-3333--2222 TMMGQQVAKVVESKNVALPKGTIVLASPGWTTHSISDGKDLEKLLTEWPDTIPLSLALGT --------------33332222----------------------111111113333---- VGMPGLTAYFGLLEICGVKGGETVMVNAAAGAVGSVVGQIAKLKGCKVVGAVGSDEKVAY ---------------------------1111----------------------------- LQKLGFDVVFNYKTVESLEETLKKASPDGYDCYFDNVGGEFSNTVIGQMKKFGRIAICGA ----------3333-----------3333------------------------------3 ISTYNRTGPLPPGPPPEIVIYQELRMEAFVVYRWQGDARQKALKDLLKWVLEGKIQYKEY 333--------------------------1111--------------------------- IIEGFENMPAAFMGMLKGDNLGKTIVKA ---3333--------------------- >GLYOXALASE FAMILY PROTEIN; SWP:NA; PDB:1ZSWA; AMYEIKGHHHISMVTKNANENNHFYKNVLGLRRVKMTVNQDDPSMYHLFYGDKTGSPGTE --------------------------------------1111---------11112222- LSFFEIPLVGRTYRGTNAITRIGLLVPSEDSLHYWKERFEKFDVKHSEMTTYANRPALQF -----1111------------------------------------------%%%%----- EDAEGLRLVLLVSNGEKVEHWETWEKSEVPAKHQIQGMGSVELTVRRLDKMASTLTEIFG -1111-------iiii-1111--1111--1111--------------------------- YTEVSRNDQEAIFQSIKGEAFGEIVVKYLDGPTEKPGRGSIHHLAIRVKNDAELAYWEEQ ---------------2222-----------------2222-------------------- VKQRGFHSSGIIDRFYFKSLYFRESNGILFEIATDGPGFTVDGDVEHLGEKLDLPPFLED -1111------------------1111----------1111--3333-------333311 QRAEIEANLAPIEEK 11---1111------ >VOLTAGE-GATED POTASSIUM C; SWP:Q13303; PDB:1ZSXA; FYRNLGKSGLRVSCLGLGTWVTFGGQITDEMAEQLMTLAYDNGINLFDTAEVYAAGKAEV ----!!!!-------------2222--------------1111------3333iiii--- VLGNIIKKKGWRRSSLVITTKIFWGGKAETERGLSRKHIIEGLKASLERLQLEYVDVVFA -----------3333------------1111----------------------------- NRPDPNTPMEETVRAMTHVINQGMAMYWGTSRWSSMEIMEAYSVARQFNLTPPICEQAEY ---1111----------------------------------------------------- HMFQREKVEVQLPELFHKIGVGAMTWSPLACGIVSGKYDSGIPPYSRASLKGYQWLKDKI 11113333------------------1111-----1111---22221111---------- LSEEGRRQQAKLKELQAIAERLGCTLPQLAIAWCLRNEGVSSVLLGASNADQLMENIGAI -------------------1111---------11113333-----------------333 QVLPKLSSSIIHEIDSILGNKP 33333----------------- >MITOCHONDRIAL 2-ENOYL THI; SWP:Q9BV79; PDB:1ZSYA; VDLGTENLYFQSMPARVRALVYGHHGDPAKVVELKNLELAAVRGSDVRVKMLAAPINPSD -1111---------------------3333------------1111----------3333 INMIQGNYGLLPELPAVGGNEGVAQVVAVGSNVTGLKPGDWVIPANAGLGTWRTEAVFSE -----------------------------1111---2222------------------11 EALIQVPSDIPLQSAATLGVNPCTAYRMLMDFEQLQPGDSVIQNASNSGVGQAVIQIAAA 11---------------------------------2222-----1111------------ LGLRTINVVRDRPDIQKLSDRLKSLGAEHVITEEELRRPEMKNFFKDMPQPRLALNCVGG ----------------------1111-----3333-----1111---------------- KSSTELLRQLARGGTMVTYGGMAKQPVVASVSLLIFKDLKLRGFWLSQWKKDHSPDQFKE ------11112222-------2222---------1111---------------------- LILTLCDLIRRGQLTAPACSQVPLQDYQSALEASMKPFISSKQILTM --------1111----------3333------1111----------- >Stringent starvation prot; SWP:P45206; PDB:1ZSZC; SSPKRPYYLRGFYDWLVDNSFTPYLVVDATYLGVNVPVEYVKDGQIVLNLSASATGNLQL ---3333---------1111-------1111----------iiii-----3333------ TNDFIQFNQRFKGVSRELYIPMGAALAIYARENGDGMMFEPEEIYDELN 1111------iiii------3333-----------------3333---- >BETA-2-MICROGLOBULIN; SWP:P04223; PDB:1ZT1A; MGPHSLRYFHTAVSRPGLGKPRFISVGYVDDTQFVRFDSDAENPRYEPRVRWMEQVEPEY -------------------------------------------------1111---3333 WERNTQIAKGNEQIFRVNLRTALRYYNQSAGGSHTFQRMYGCEVGSDWRLLRGYEQYAYD --------------------------------------------1111----------ii GCDYIALNEDLKTWTAADMAALITKHKWEQAGDAERDRAYLEGTCVEWLRRYLQLGNRTD ii-----3333------3333-----------------------------3333------ SPKAHVTRHSRPEDKVTLRCWALGFYPADITLTWQLNGEELTQDMELVETRPAGDGTFQK -----------------------------------%%%%--------------------- WASVVVPLGKEQYYTCHVYHQGLPEPLTLRWEP ------2222---------3333---------- >INSULIN-LIKE GROWTH FACTO; SWP:P08833; PDB:1ZT3A; WKEPCRIELYRVVESLAKAQETSGEEISKFYLPNCNKNGFYHSRQCETSMDGEAGLCWCV ------------------3333-------------1111--------------------- YPWNGKRIPGSPEIRGDPNC -------2222--------- >T-CELL SURFACE GLYCOPROTE; SWP:P15813; PDB:1ZT4A; RLFPLRCLQISSFANSSWTRTDGLAWLGELQTHSWSNDSDTVRSLKPWSQGTFSDQQWET --------------1111--------!!!!---------------1111!!!!3333--- LQHIFRVYRSSFTRDVKEFAKMLRLSYPLELQVSAGCEVHPGNASNNFFHVAFQGKDILS ----------------------------------------------------iiii---- FQGTSWEPTQEAPLWVNLAIQVLNQDKWTRETVQWLLNGTCPQFVSGLLESGKSELKKQV ------------3333----------3333------------------3333--3333-- KPKAWLSRGPSPGPGRLLLVCHVSGFYPKPVWVKWMRGEQEQQGTQPGDILPNADETWYL ------------------------------------------------------------ RATLDVVAGEAAGLSCRVKHSSLEGQDIVLYW ------3333---------3333--------- >HYPOTHETICAL PROTEIN TM08; SWP:NA; PDB:1ZTCA; HELKILVTGGNVFVPGRLNAHFSTVVYLEHKDRRIIIDPGNLSSDELEEKFSELGISPDD -----------------------------!!!!-------3333------------3333 ITDVLFTHVHLDHIFNSVLFENATFYVHEVYKTKNYLSFGTIVGRIYSKVISSWKNVVLL ---------3333------1111-----3333--3333------------1111------ KGEESLFDEKVKVFHTPWHAREHLSFLLDTENAGRVLITGDITPNRLSYYDIIKGYGSVQ ------%%%%---------1111------------------------------------- VKNFLDRVGRIDLLVFPHDAPLKPEV -------------------------- >HYPOTHETICAL PROTEIN PFU-; SWP:Q8U363; PDB:1ZTDA; SEIDKGLAKFGDSLINFLYSLALTEFLGKPTGDRVPNASLAIALELTGLSKNLRRVDKHA -----------------------------------3333-----33331111-3333--- KGDYAEALIAKAWLMGLISEREAVEIIKKNLYPEVLDFSKKKEAIGRALAPLLVIISERL ---------------------------1111-----3333-------------------- YSSQV 1111- >POLY(RC)-BINDING PROTEIN ; SWP:Q15365; PDB:1ZTGA; ILTIRLLMHGKEVGSIIGKKGESVKRIREESGARINISEGNCPERIITLTGPTNAIFKAF -----------------2222--------------------------------------- AMIIDKLEEDIN ------------ >RIO1 SERINE PROTEIN KINAS; SWP:O28471; PDB:1ZTHA; DLKKIESYLDKLRIKEKDGEERKIYAEVLDGRTLKTLYKLSAKGYITAGGVISTGKEANV --------------3333------------------------------------------ FYADGVFDGKPVAAVKIYRIDEYLYGDERFDPKEKVFIWTEKEFRNLERAKEAGVSVPQP ------iiii----------1111--3333--------------------1111------ YTYKNVLLEFIGEDELPAPTLVELGRELKELDVEGIFNDVVENVKRLYQEAELVHADLSE ------------%%%%---3333--3333-----------------------------33 YNIYIDKVYFIDGQAVTLRHPAESYLERDVRNIIRFFSKYGVKADFEELKEVKGE 33--------------1111---------------3333-----3333--1111- >FUSION GLYCOPROTEIN; SWP:P06828; PDB:1ZTMA; ITKLQHVGVLVNSPKGMKISQNFETRYLILSLIPKIEDSNSCGDQQIKQYKRLLDRLIIP ---3333---------------------------------1111---------------- LYDGLRLQKDVIVSDIEKLKEAIRDTNKAVQSVQSSIGNLIVAIKSVQDYVNKEIVPSIA --------------3333------------------------------------------ RLGCEAAGLQLGIALTQHYSELTNIFGDNIGSLQEKGIKLQGIASLYRTNITEIFTTSTV ---------------------------1111-------33331111---3333------- DKYDIYDLLFTESIKVRVIDVDLNDYSITLQVRLPLLTRLLNTQIYRVDSISYNIQNREW --------1111------------------------------------------------ YIPLPSHIMTKGAFLGGADVKECIEAFSSYICPSDPGFVLNHEMESCLSGNISQCPRTVV ----------!!!!-----1111--1111---------------------3333------ KSDIVPRYAFVNGGVVANCITTTCTCNGIGNRINQPPDQGVKIITHKECNTIGINGMLFN -3333------------------------------3333-----3333------------ TNKEGTLAFYTPNDITLNNSVALDPIDISIELNKAKSDLEESKEWIRRSNQKLDSI ---------------------------3333------------------------- >BASOPHILIC LEUKEMIA EXPRE; SWP:Q9H3H3; PDB:1ZTPA; EDGFTAEHLAAEAAADDPWLVFDARTTPATELDAWLAKYPPSQVTRYGDPGSPNSEPVGW 3333------------------3333-3333--------1111-11112222-------- IAVYGQGYSPNSGDVQGLQAAWEALQTSGRPITPGTLRQLAITHHVLSGKWLHLAPGFKL ----2222--------------3333---------------------------------- DHAWAGIARAVVEGRLQVAKVSPRAKEGGRQVICVYTDDFTDRLGVLEADSAIRAAGIKC --------------------------------------1111-----------1111--- LLTYKPDVYTYLGIYRANRWHLCPTLYESRFQGSRVLDRANNVEL -----33331111----1111------------------------ >SEGMENTATION POLARITY HOM; SWP:P02836; PDB:1ZTRA; DEKRPRTAFSSEQLARAKREFNENRYLTERRRQQLSSELGLNEAQIKIWFQNKRAKIRRS --------------------1111-3333-3333-------1111--------------- >HYPOTHETICAL PROTEIN YQBG; SWP:P45923; PDB:1ZTSA; MLLITPDELKSYSVFESVKTRPDELLKQDILEATADIILKVGHDFSDAEYIPLPETVRLA --------------3333----------------------------1111---------- LLKLSQFYALINGDESIIKGYTTEKIGDYSYTLGDGSSLQKPDVYALIKDYVKPADPDLE --------------------------------1111-------33331111--------- GIEAKVRMRSILEHHHHHH ----3333----------- >BACTERIOPHYTOCHROME; SWP:Q9RZA4; PDB:1ZTUA; PLPFFPPLYLGGPEITTENCEREPIHIPGSIQPHGALLTADGHSGEVLQMSLNAATFLGQ ------1111-----333311111111----1111-----------------3333---- EPTVLRGQTLAALLPEQWPALQAALPPGCPDALQYRATLDWPGHLSLTVHRVGELLILEF 3333----3333-------------22221111--------------------------- EPTEPHALRNAFALESAPNLRALAEVATQTVRELTGFDRVLYKFAPDATGEVIAEARREG -----------3333-----------------------------1111---------222 LHAFLGHRFPASDIPAQARALYTRHLLRLTADTRAAAVPLDPVLNTQTNAPTPLGGAVLR 2--2222--3333------1111---------------------3333-----1111--- ATSPHQYLRNGVGSSLSVSVVVGGQLWGLIACHHQTPYVLPPDLRTTLEYLGRLLSLQVQ ---------------------%%%%----------------------------------- VKEAHHHH -------- >Igk-C protein; SWP:Q58EU4; PDB:1ZTXH; QVQLQQSGSELMKPGASVQISCKATGYTFSDYWIEWVKQRPGHGLEWIGDILCGTGRTRY ------------2222-----------3333----------------------------- NEKLKAMATFTADTSSNTAFMQLSSL 3333---------------------- >CHITIN OLIGOSACCHARIDE BI; SWP:Q9KUA3; PDB:1ZU0A; RSELTIVPDFYPTMVRNFNPYLATNLRTTTDFIYEPLVVFNEMKGNTPVFRLAESYKMAD ------------------1111------------------------------------11 DLMSVTFDIRKGVKWSDGEAFTADDVVYSFGLLKAKPELDQRGINKWVTSVEKVDEYKVR 11------------1111---3333----------3333---3333-------------- FRLSEANSNVPYEISLIPIVAEHVWKDVKDPTTFTNENPVGTGPFTVIDTFTPQLYIQCR ------1111---1111---33331111-3333------------------1111----- NPNYWDAANLEVDCLRVPQIANNDQLLGKIVNSELDWTSSFVPDIDRTYAAANPNHHYWY -----3333----------------------------------3333-11111111---- PAAGTQAFMVNFKNPDPAKKEALDNVDFRRAFSMALDRQTIIDIAFYGSGTVNDFASGLG --------------------------------3333-------1111-------1111-3 YAFEAWSDEATHKKYKGFNTYDVEGSKKLLAKAGFKDVNGDGFVETPSGKSFELLIQSPN 33311113333---3333---------------------------1111---------22 GWTDFNNTVQLAVEQLQEVGIKAKARTPEFAVYNQAMLEGTYDVAYTNYFHGADPFTYWN 22--------------1111----------------1111--------------3333-- SGYNSALQSGDGMPRFAMHYFTDKKLDGLLDSFYKTADKNEQLAIAHGIQKIIAENQVTI ---3333--2222-1111-----------1111--------------------------- PVMSGAWMYQYNTTRFTGWWSEENPKGRPSVWAGIPERLLHVLDLKPVK --------------------3333-------2222-----1111----- >RNA BINDING PROTEIN ZFA; SWP:Q8AVN9; PDB:1ZU1A; ADEFGNGDALDLPVGKDAVNSLIRENSHIFSDTQCKVCSAVLISESQKLAHYQSRKHANK -------------------------3333------1111-----------1111------ VRRYMAINQGEDSVPAKKFKAAPAEISDGEDRSKCCPVCNMTFSSPVVAESHYIGKTHIK ------------------------------1111--1111----3333------------ NLRLREQ ------- >MITOCHONDRIAL IMPORT RECE; SWP:P82874; PDB:1ZU2A; SMDTETEFDRILLFEQIRQDAENTYKSNPLDADNLTRWGGVLLELSQFHSISDAKQMIQE ------3333-------------------------------------------------- AITKFEEALLIDPKKDEAVWCIGNAYTSFAFLTPDETEAKHNFDLATQFFQQAVDEQPDN -----------1111-----------------------------------------1111 THYLKSLEMTAKAPQLHAEAYKQGLGGSHHHHHH -------3333----------------------- >FTSY; SWP:Q6MTB9; PDB:1ZU4A; PMEKAMLKSAFNFSKDIKKLSKKYKQADDEFFEELEDVLIQTDMGMKMVLKVSNLVRKKT ------------------3333----------------------------------1111 KRDTSFENIKDALVESLYQAYTDNDWYRIDFKENRLNIFMLVGVNGTGKTTSLAKMANYY 1111---------------------------------------2222------------- AELGYKVLIAAADTFRAGATQQLEEWIKTRLNNKVDLVKANKLNADPASVVFDAIKKAKE -------------------------------1111----------3333----------- QNYDLLLIDTAGRLQNKTNLMAELEKMNKIIQQVEKSAPHEVLLVIDATTGQNGVIQAEE ------------3333--------------33331111--------3333---------- FSKVADVSGIILTKMDSTSKGGIGLAIKELLNIPIKMIGVGEKVDDLLAFDIDQYIVHLS 1111---------3333-------------------------1111-------------3 SGFMQ 333-- >Adenylyltransferase thiF; SWP:P30138; PDB:1ZUD1; MNDRDFMRYSRQILLDDIALDGQQKLLDSQVLIIGLGGLGTPAALYLAGAGVGTLVLADD --------------3333------------------3333-------1111--------- DDVHLSNLQRQILFTTEDIDRPKSQVSQQRLTQLNPDIQLTALQQRLTGEALKDAVARAD ---3333---11111111------------------------------------------ VVLDCTDNMATRQEINAACVALNTPLITASAVGFGGQLMVLTPPWEQGCYRCLWPAGVVG -------------------------------!!!!-------------3333-------- PVVGVMGTLQALEAIKLLSGIETPAGELRLFDGKSSQWRSLALRRASGCPVCGG ---------------------------------------------1111----- >Protein thiS; SWP:O32583; PDB:1ZUD2; QILFNDQAMQCAAGQTVHELLEQLDQRQAGAALAINQQIVPREQWAQHIVQDGDQILLFQ ---iiii---------------------------%%%%--3333------2222------ VIAGG ----- >PHAGE 434 CRO PROTEIN; SWP:P03036; PDB:1ZUG; MQTLSERLKKRRIALKMTQTELATKAGVKQQSIQLIEAGVTKRPRFLFEIAMALNCDPVW --3333------1111-3333-------3333---1111-----------------3333 LQYGTKRGKAA ----------- >SHIKIMATE KINASE; SWP:P56073; PDB:1ZUHA; QHLVLIGFMGSGKSSLAQELGLALKLEVLDTDMIISERVGLSVREIFEELGEDNFRMFEK -------2222------------------------------------------------- NLIDELKTLKTPHVISTGGGIVMHENLKGLGTTFYLKMDFETLIKRLNQLNNLTQAKELF -----1111--------1111--1111--------------------------------- EKRQALYEKNASFIIDARGGLNNSLKQVLQF -------1111-----1111----------- >HYPOTHETICAL PROTEIN LLAC; SWP:A2RLG8; PDB:1ZUJA; SIDEKYEAEVKKSEIDHHKPTAGAMLSHVLSNIFYEKISLMQAGLYAKSANYRIKFREIA -----------------------------------------3333---3333-------- LKEDEWFYLISEQLLDENELVPTTLDEFVSNHKFIENDPKAKYWTDEALIENFINDFQNQ -----------------------3333----------1111---3333------------ NLFIGRAIKLAQKEEKFSLELAIRKLYGYNLSIIPYFAGELGKTIGEF ----------------3333------------------1111------ >SULFATE ADENYLYLTRANSFERA; SWP:Q87WW0; PDB:1ZUNA; HVDKLTHLKQLEAESIHIIREVAAEFDNPVLYSIGKDSAVLHLARKAFFPGKLPFPVHVD ----------------------------------3333---------------------- TRWKFQEYRFRDQVEEGLDLITHINSAKHTDIKTEGLKQALDKHGFDAAFGGARRDEEKS ---------------------------3333----------------------1111333 RAKERVYSFRDSKHRWDPKNQRPELWNVYNGNVNKGESIRVFPLSNWTELDIWQYIYLEG 3---------1111--3333-------------2222----1111--------------- IPIVPLYFAA ---------- >SULFATE ADENYLYLTRANSFERA; SWP:NA; PDB:1ZUNB; LGQHERKELRFLTCGNVDDGKSTLIGRLLHDSKIGDDLALLVDGLQAITIDVAYRYFSTA ---------------22223333------1111-------------------------33 KRKFIIADTPGHEQYTRNATGASTCDLAIILVDARYGVQTQTRRHSYIASLLGIKHIVVA 33---------3333-----3333--------3333-------------1111------- INKDLNGFDERVFESIKADYLKFAEGIAFKPTTAFVPSALKGDNVVNKSERSPWYAGQSL ---1111------------------------------33332222---1111-------- EILETVEIASDRNYTDLRFPVQYVNRPNLNFRGFAGTLASGIVHKGDEIVVLPSGKSSRV 3333-----------------------1111------------2222------------- KSIVTFEGELEQAGPGQAVTLTEDEIDISRGDLLVHADNVPQVSDAFDALVWAEEPLPGK ----3333-----2222------------------1111-----------------2222 KYDIKRATSYVPGSIASITHRVDVNTLEEGPASSLQLNEIGRVKVSLDAPIALDGYSSNR -----------------------------------2222---------------3333-- TTGAFIVIDRLTNGTVAAGIIA 1111------------------ >HYPOTHETICAL PROTEIN LOC9; SWP:Q8WVN8; PDB:1ZUOA; GSVQASDRLMKELRDIYRSQSYKTGIYSVELINDSLYDWHVKLQKVDPDSPLHSDLQILK -3333----------------1111------%%%%-----------1111---------- EKEGIEYILLNFSFKDNFPFDPPFVRVVLPVLSGGYVLGGGALCMELLTKQGWSSAYSIE --------------1111---------------%%%%2222---3333-----3333--- SVIMQINATLVKGKARVQFGANKNQYNLARAQQSYNSIVQIH ----------1111---1111----------------3333- >HYPOTHETICAL PROTEIN TM17; SWP:NA; PDB:1ZUPA; HQVRIERAERIESELEEHVGDQTFVEESRFLEEDEQREGEILDQIIFVDGKRRSFVRITT ------------------------------3333-------------------------1 DEGITGIFAELCVGAVIWDREGGTKTLFSPDKPPVKERVLGFSQSFQEEGYEEVGGILFK 111-------------------------3333----------3333-------%%%%--- VVKEGKDAQSIDLYRSLEIEEVRKHDKNILIVKDGPAARELPFEENVGPIGLVKNIGVTE -------------------------------------3333------------------- LSKEDFKKLRFLKKGKRSKFVSSLKKVGAYVKLIDGEGIRGLVRLETYVKDDNQIPYIRK ------3333--2222----------------------2222--------1111------ VFDDLAKTLPHLTADLPNILPIQFLEENLSYYLTDKNYNTRLFAYI -------3333-----------------3333--3333-------- >BZZ1 PROTEIN; SWP:P38822; PDB:1ZUUA; ENKVLYAYVQKDDDEITITPGDKISLVARDTGSGWTKINNDTTGETGLVPTTYIRI -----------1111---2222---------------------------3333--- >GLUTAMATE RACEMASE 1; SWP:P94556; PDB:1ZUWA; MLEQPIGVIDSGVGGLTVAKEIMRQLPKENIIYVGDTKRCPYGPRPEEEVLQYTWELTNY 1111--------3333---------1111------3333--11113333----------- LLENHHIKMLVIACNTATAIALDDIQRSVGIPVVGVIQPGARAAIKVTDNQHIGVIGTEN ------------------------------------------------------------ TIKSNAYEEALLALNPDLKVENLACPLLVPFVESGKFLDQTADEIVKTSLYPLKDTSIDS --------------1111------33333333-----------------3333------- LILGCTHYPILKEAIQRYMGEHVNIISSGDETAREVSTILSYKGLLNQSPIAPDHQFLTT ----33331111------------------------------------------------ GARDQFAKIADDWFHVECISL --------------------- >green to red photoconvert; SWP:Q5S6Z9; PDB:1ZUXA; MSAIKPDMKINLRMEGNVNGHHFVIDGDGTGKPFEGKQSMDLEVKEGGPLPFAFDILTTA -----------------iiii----------3333---------------------1111 FNRVFAEYPDHIQDYFKQSFPKGYSWERSLTFEDGGICIARNDITMEGDTFYNKVRFHGV -3333---1111-3333--------------1111-----------!!!!---------- NFPANGPVMQKKTLKWEPSTEKMYVRDGVLTGDITMALLLEGNAHYRCDFRTTYKAKEKG --1111-------------------iiii----------2222----------------- VKLPGYHFVDHCIEILSHDKDYNKVKLYEHAVAHSGLPD --------------------------------------- >MYOSIN-5 ISOFORM; SWP:Q04439; PDB:1ZUYA; PMFEAAYDFPGSGSPSELPLKKGDVIYITREEPSGWSLGKLLDGSKEGWVPTAYMKPH -------------1111---2222-------3333---------------3333---- >DOUBLESEX PROTEIN; SWP:P23023; PDB:1ZV1A; QDVFLDYCQKLLEKFRYPWELMPLMYVILKDADANIEEASRRIEEGQYVVNEYSRQHNL -----------------3333--------1111-------------------------- >Regulator of G-protein si; SWP:Q9UGC6; PDB:1ZV4X; LYFQSMNPTAEEVLSWSQNFDKMMKAPAGRNLFREFLRTEYSEENLLFWLACEDLKKEQN -3333----------1111----------------3333--3333--------------- KKVIEEKARMIYEDYISILPKEVSLDSRVREVINRNLLDPNPHMYEDAQLQIYTLMHRDS ----------------------------------------1111---------------- FPRFLNSQIYKSFVESTA ------------------ >LYSOZYME C; SWP:NA; PDB:1ZV5A; VQLVESGGGSVQAGESLRLSCAASGVTYKNYCIGWFRQAPGKDREGVVFINSDGGITYYA --------------------------------------2222--------1111-----3 DSVKGRFTISQDNAKNTVYLQMNSLKPEDTASYYCAAGYRNYGQCATRYWGQGTQVTVS 333--------3333----------3333-----------iiii--------------- >SPIKE GLYCOPROTEIN; SWP:P59594; PDB:1ZV7A; DISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGK 3333---------------------------------- >CELLULOSOMAL SCAFFOLDIN A; SWP:Q7WYN3; PDB:1ZV9A; APTSSIEIVLDKTTASVGEIVTASINIKNITNFSGCQLNKYDPAVLQPVTSSGVAYTKST ---------------2222----------------------3333----1111---1111 PGAGTILNSDFNLRQVADNDLEKGILNFSKAYVSLDDYRTAAAPEQTGTVAVVKFKVLKE --------------------1111------------------------------------ ETSSISFEDTTSVPNAIDGTVLFDWNGDRIQSGYSVIQPAVINLDIKAS ---------1111---iiii-------------------------3333 >E2 GLYCOPROTEIN; SWP:Q3I5J5; PDB:1ZVAA; ALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDISGGRGGDISGINASVVNIQKEID ------------------------------------1111----2222------------ RLNEVAKNLNESLID --------------- >ALLENE OXIDE CYCLASE; SWP:Q9LS03; PDB:1ZVCA; KVQELSVYEINDLDRHSPKILKNARFGLGDLVPFTNKLYTGDLKKRVGITAGLCVVIEHV ---------------------------------------1111----------------- PEKNGDRFEATYSFYFGDYGHLSVQGPYLTYEDSFLAITGGAGIFEGAYGQVKLQQLVYP ---------------!!!!---------1111---------!!!!--------------- TKLFYTFYLKGLANDLPLELIGTPVPPSKDVEPAPEAKALKPSGVVSNFTN ----------------3333-------1111---3333--1111------- >SMAD UBIQUITINATION REGUL; SWP:Q9HAU4; PDB:1ZVDA; KRDLVQKLKILRQELSQQQPQAGHCRIEVSREEIFEESYRQVKRPKDLWKRLIKFRGEEG -----------------------------3333----------3333-------2222-- LDYGGVAREWLYLLSHELNPYYGLFQYSRDDIYTLQINPDSAVNPEHLSYFHFVGRIGAV ------------------3333-----1111------1111--1111------------1 FHGHYIDGGFTLPFYKQLLGKSITLDDELVDPDLHNSLVWILENDITGVLDHTFCVEHNA 111-------3333--1111---3333--------------------3333--------i YGEIIQHELKPNGKSIPVNEENKKEYVRLYVNWRFLRGIEAQFLALQKGFNEVIPQHLLK iii------2222----------------------2222---------------333311 TFDEKELELIICGLGKIDVNDWKVNTRLKHCTPDSNIVKWFWKAVEFFDEERRARLLQFV 11-----------------------------1111------------------------- TGSSRVPLQGFKALQGAAGPRLFTIHQIDACTNNLPKAHTCFNRIDIPPYESYEKLYEKL ------11111111----------------3333----3333------------------ LTAIEE ------ >3-HYDROXYANTHRANILATE 3,4; SWP:P47096; PDB:1ZVFA; AMFNTTPINIDKWLKENEGLLKPPVNNYCLHKGGFTVMIVGGPNERTDYHINPTPEWFYQ ----------------3333---------------------------------------- KKGSMLLKVVDETDAEPKFIDIIINEGDSYLLPGNVPHSPVRFADTVGIVVEQDRPGGEN ------------------------2222-------------------------------- DKIRWYCSHCRQVVHESELQMLDLGTQVKEAILDFENDVEKRTCFHCKTLNYARPQ ------------------------1111----------3333-------------- >LYSOZYME C; SWP:NA; PDB:1ZVHA; DVQLVESGGGSVQAGGSLRLSCAASGYIASINYLGWFRQAPGKEREGVAAVSPAGGTPYY ------------2222----------3333---------2222----------------- ADSVKGRFTVSLDNAENTVYLQMNSLKPEDTALYYCAAARQGWYIPLNSYGYNYWGQGTQ 3333---------1111---------3333--------------11111111-------- VTVS ---- >MN-CADHERIN; SWP:Q8QGH3; PDB:1ZVNA; SGWVWNQFFVLEEYTGTDPLYVGKLHSDMDRGDGSIKYILSGEGAGIVFTIDDTTGDIHA ----------3333------------1111-----------2222--------------- IQRLDREERSQYTLRAQALDRRTGRPMEPESEFIIKIQD ----3333------------------------------- >HYPOTHETICAL PROTEIN VC08; SWP:Q9KTT6; PDB:1ZVPA; SGIKSLELLLQSMSPELMAGDYVFCTVNGALSDYLSLEPIATFREPEGLTLVLEAEKAQQ --3333---1111----------------33333333-------1111----------11 AGLESSALFSLITLTVHSEAVGLTAAFATKLAEHGISANVIAGYYHDHIFVQKEKAQQAL 11-----------------------------1111-------3333-----3333----- QALGEFAQ --3333-- >TRANSFORMING PROTEIN P21/; SWP:P01112; PDB:1ZVQA; MTEYKLVVVGAGGVGKSALTIQLIQNHFVDEYDPTIEDSYRKQVVIDGETCLLDILDTAG ----------2222------------------1111---------iiii----------- GEEYSAMRDQYMRTGEGFLCVFAINNTKSFEDIHQYREQIKRVKDSDDVPMVLVGNKCDL ----3333--------------1111------------------------------1111 AARTVESRQAQDLARSYGIPYIETSAKTRQGVEDAFYTLVREIRQH --------------1111----------2222----------3333 >MHC CLASS I ANTIGEN; SWP:Q70UQ4; PDB:1ZVSA; GSHSMKYFYTSMSRPGRGQPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWVEQEGPEYW -------------2222----------------------------------11113333- DRETRNMKTETQNAPVNLRTLLRYYNQSEAGSHTLQRMVGCDLGPDGRLLRGYEQYAYDG -------------------------------------------3333------------- KDYIALNEDLRSWTAADVAAQNTQRKWEAADVAESMRAYLEGQCVEWLPRYLEKGKETLQ ------1111------3333-------1111----------------------------- RTDPPKTHVTHHPVSDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDGT --------------------------------------iiii-3333------------- FQKWAAVVVPSGEEQRYTCHVQHEGLPKPHTLKWEPHH ------------3333---------------------- >TOPOISOMERASE IV SUBUNIT ; SWP:P20082; PDB:1ZVTA; SEPVTIVLSQMGWVRSAKGHDIDAPGLNYKAGDSFKAAVKGKSNQPVVFVDSTGRSYAID --------1111----------3333---2222--------1111-----1111-----3 PITLPSARGQGEPLTGKLTLPPGATVDHMLMESDDQKLLMASDAGYGFVCTFNDLVARNR 333---------3333----2222--------1111--------------3333----11 AGKALITLPENAHVMPPVVIEDASDMLLAITQAGRMLMFPVSDLPQLSKGKGNKIINIPS 11------2222---------1111-----3333-----3333----------------- AEAARGEDGLAQLYVLPQSTLTIHVGKRKIKLRPEELQKVTGERGRRGTLMRGLQRIDRV --1111------------------!!!!----33333333--2222----2222------ EIDSP ----- >LYSOZYME C; SWP:P00698; PDB:1ZVYA; DVQLVESGGGSVQAGGSLRLSCAASGSTDSIEYMTWFRQAPGKAREGVAALYTHTGNTYY ------------2222-----------1111--------2222----------------- TDSVKGRFTISQDKAKNMAYLRMDSVKSEDTAIYTCGATRKYVPVRFALDQSSYDYWGQG 3333--------3333----------3333-------------3333--3333------- TQVTV ----- ------------------------------------------------------------ ----- >UBIQUITIN; SWP:P61864; PDB:1ZW7A; MQIFVKTLTGATITLEVESSDTIDNVKSKIQAAPGIPPDQQELIFAGKQLEDGRTLSDYN ------1111-------1111---------------1111----------11113333-- IQKESTLHLVLRLRGGHHHHHH ---------------------- >ZINC-RESPONSIVE TRANSCRIP; SWP:P47043; PDB:1ZW8A; DLKCKWKECPESASSLFDLQRHLLKDHVSQDFKHPMEPLACNWEDCDFLGDDTASIVNHI -----1111-----3333------------------------1111-----3333----- NAQH ---- >GAMMA CRYSTALLIN S; SWP:O35486; PDB:1ZWMA; SKTGGKISFYEDRNFQGRRYDCDCDCADFRSYLSRCNSIRVEGGTWAVYERPNFSGHMYI ------------%%%%-----------------------------------%%%%----- LPQGEYPEYQRWMGLNDRLGSCRAVHLSSGGQAKIQVFEKGDFNGQMYETTEDCPSIMEQ -------3333-----------------------------%%%%-----------3333- FHLREIHSCKVVEGTWIFYELPNYRGRQYLLDKKEYRKPVDWGAASPAIQSFRRIVE ---------------------%%%%------------3333---------------- >MAJOR STRUCTURAL SUBUNIT ; SWP:P33553; PDB:1ZWTA; MEQSASDSNKSQNAISEVMSATSAINGLYIGQTSYSGLDSTILLNTSAIPDNYKDTTNKK ----33333333-------------1111------------1111--------------- ITNPFGGELNVGPANNNTAFGYYLTLTRLDKAACVSLATLNLGTSAKGYGVNISGENNIT ---------------3333-----------------1111--------------3333-- SFGNSADQAAKSTAITPAEAATACKNTDSTNKVTYFMK -------------------------------------- >SH3-CONTAINING GRB2-LIKE ; SWP:Q62420; PDB:1ZWWA; TKLDDDFKEMERKVDVTSRAVMEIMTKTIEYLQPNPASRPQAEALLAEAMLKFGRELGDD ----------------------------------3333---------------------- CNFGPALGEVGEAMRELSEVKDSLDMEVKQNFIDPLQNLHDKDLREIQHHLKKLEGRRLD ------------------------------------------------------------ FGYKKKRQGKIPDEELRQALEKFDESKEIAESSMFNLLEMDIEQVSQLSALVQAQLEYHK ------2222-3333--------------------------------------------- QAVQILQQVTVRLEERIRQ ------------------- >SPHINGOMYELINASE-C; SWP:Q9RLV9; PDB:1ZWXA; YPGNFKITSHNVYLFSRNIYPNWGQMHRADLIAQADYMKNNDVVILNEAFDTSASHRLLN ---------------3333----3333-------3333---------------------- NLREMYPHQTPVIGRSKHGWDKTEGALEDGGVAVVSQWPIVEKSQHIFQRGGGADRLSNK --3333----------2222-------------------------------!!!!----- GFAYVKIMKNGKPYHIIGTHTQADDSLISKDTSRAIRAEQMQEIQTFIAKKNIPKDEIIF --------iiii------------1111-------------------------1111--- IGGDLNVNYGTDEYHDMLKLLNVSSPANFNGQMATWDPTTNSMLKESYPKAAPEYLDYIF -------2222-------1111-------3333---1111-3333--1111--------- VENGHARPHSWHNKVLHTKSPQWSVKSWFKTYTYQDFSDHYPVVGFTD -1111---------------------!!!!------------------ >HYPOTHETICAL UPF0244 PROT; SWP:Q9KU27; PDB:1ZWYA; VRKIIIASQNPAKVNAVRSAFSTVFPDQEWEFIGVSVPSEVADQPSDEETKQGALNRVRN ------------------------1111-------------------------------- AKQRHPGAEYYVGLEAGIEENKTFAWIVESDQQRGESRSACLLPPLVLERLRQAELGDVD ------------------!!!!-------1111-----------------11113333-- EVFGGGAIGLLTRHHLTRSTVYHQALILALIPFINPEHYP -----------%%%%--------------3333-3333-- >GUANIDINOACETATE N-METHYL; SWP:Q14353; PDB:1ZX0A; PIFAPGENCSPAWGAAPAAYDAADTHLRILGKPVMERWETPYMHALAAAASSKGGRVLEV ---2222-33331111----1111----iiii------------------1111------ GFGMAIAASKVQEAPIDEHWIIECNDGVFQRLRDWAPRQTHKVIPLKGLWEDVAPTLPDG -!!!!-----3333---------------------3333---------33333333---- HFDGILYDTYPLSEETWHTHQFNFIKNHAFRLLKPGGVLTYCNLTSWGELMKSKYSDITI ------------1111-----------------2222-----3333--1111-------- MFEETQVPALLEAGFRRENIRTEVMALVPPADCRYYAFPQMITPLVTKG ---------------3333----------1111---------------- >UBP3-ASSOCIATED PROTEIN B; SWP:P53741; PDB:1ZX2A; SMGVTVQDICFAFLQNYYERMRTDPSKLAYFYASTAELTHTNYPTVKVTGRENINKFFSR -----------------------33333333-1111------------------------ NDAKVRSLKLKLDTIDFQYTGHLHKSILIMATGEMFWTGTPVYKFCQTFILLPSSNGSTF --------------------2222-----------------------------1111--- DITNDIIRFI ---------- >HYPOTHETICAL PROTEIN NE02; SWP:Q82XL7; PDB:1ZX3A; EVQQPDPRKNWIENDSGVIYLLESWLKAKSQETGKEISDIFANAVEFNIVLKDWGKEKLE ------------------------------------------------------------ ETNTEYQNQQRKLRKTYIEYYDR ------------------1111- >PLASMID PARTITION PAR B P; SWP:Q38420; PDB:1ZX4A; ALQHSIREIGLRLRKNDGSQKDIAAKEGLSQAKVTRALQAASAPEELVALFPVQSELTFS ----3333------11113333-------3333-----------3333----3333---- DYKTLCAVGDEGNKNLEFDQLIQNISPEINDILSIEAEDEVKNKILRLITKEASLLTDKG -----------------------------1111--------------------------- SKDKSVVTELWKFEDKDRFARKRVKGRAFSYEFNRLSKELQEELDRIGHILRKS ------------------------------------------------------ >MANNOSEPHOSPHATE ISOMERAS; SWP:O30200; PDB:1ZX5A; GELPSFIFQAQENLVERPWGGEWIALLKGFRQSGIGESWEFSAHTSRPSTVLVKGQQLSI ----------------1111-----1111--------------3333-----iiii---- ELFSKHRDELLGRAAEKFSKFPILVRLIDAASPTQVHVHPSDKAAESLGEAEGGVESAWL ----------!!!!3333------------------------------------------ VFNKGKAYAGFKEDVKIEELEEKLKEEDFDFKTLLNTFETTPYDTFVIRPGIPHAGEGLR --------------------3333-------1111-----2222----2222-------- VLEVSSNSTLAYFFNENDWEKVKKVLNTKKVEEFEVKGKKGAETENFGLEVVDVTGTAEI --------------3333--3333-------3333--------1111------------- KTGGVNILYAAEGYFILRGKETADLHRGYSCLVPASTDSFTVESERGKIVRIYLKV -------------------------2222--------------------------- >YPR154WP; SWP:Q06449; PDB:1ZX6A; EYVEALYQFDPQQDGDLGLKPGDKVQLLEKLSPEWYKGSCNGRTGIFPANYVKPAF ------------2222---2222--------1111----iiii----1111----- >HYPOTHETICAL PROTEIN TM13; SWP:NA; PDB:1ZX8A; HRVELLFESGKCVIDLNEYEVVKLLKEKIPFESVVNTWGEEIYFSTPVNVQKENPREVVE ------1111------333-----3333-------------------------------2 IGDVGYWPPGKALCLFFGKTPSDDKIQPASAVNVIGKIVEGLEDLKKIKDGEKVAVRFAS 222----1111-----------------------------333311112222-------- S - ------------------------------------ >SERINE/THREONINE-PROTEIN ; SWP:P15442; PDB:1ZXEA; SLRYASDFEEIAVLGQGAFGQVVKARNALDSRYYAIKKIRHTEEKLSTILSEVLLASLNH -3333-----------1111---------------------3333----33333333--1 QYVVRYYAAWLERRNFVKKKSTLFIQEYCENRTLYDLIHSENLNQQRDEYWRLFRQILEA 111-----------------------------3333-----3333--------------- LSYIHSQGIIHRNLKPNIFIDESRNVKIGDFGLAKNAIGTAYVATEVLDGHYNEKIDYSL ----1111------------1111-------------------3333-----3333---- GIIFFEIYPFSTGERVNILKKLRSVSIEFPPDFDDNKKVEKKIIRLLIDHDPNKRPGART -----------------------1111--1111-----------------3333------ LLNSGWLPVKHQDEVIKEALK ----------3333------- >CALC; SWP:Q8KNF0; PDB:1ZXFA; NYDPFVRHSVTVKADRKTAFKTFLEGFPEWWPNNFRTTKVGAPLGVDKKGGRWYEIDEQG -------------------------------11112222-----------------3333 EEHTFGLIRKVDEPDTLVIGWRLNGFGRIDPDNSSEFTVTFVADGQKKTRVDVEHTHFDR ---------------------------------------------------------333 MGTKHAKRVRNGMDKGWPTILQSFQDKIDEEGAKK 3-----------1111------------------- >IMMUNOGLOBULIN G BINDING ; SWP:Q53759; PDB:1ZXGA; MYYLVVNKQQNAFYEVLNMPNLNEDQRNAFIQSLKDDPSQSANVLAEAQKLNDVQAPKA ------------------3333--3333---------1111---------1111----- >IMMUNOGLOBULIN G BINDING ; SWP:P19909; PDB:1ZXHA; MYYLVVNKGQNAFYETLTKAVDAETARNAFIQSLKDDGVQGVWTYDDATKTFTVQA -------------------------------------3333--------------- >HYPOTHETICAL PROTEIN MG37; SWP:P75223; PDB:1ZXJA; DYDIFQGHMANLKSTAKLVKPIQYDEVIEVERIFADPAFIEQHRQRILASFKDAKESALY ----------------------1111---------3333----------------3333- HELTHIVIKDNLFSCAMNAIVGYFEFNIDEAELKNVMEGLKRDEDNTVQAIAEKIIKKAL --------------------1111-------------1111------------------- VFNHLQKEWKVEITDEVVKNVISLYYEKTNQSVREYLDDKQKFEGVRTALLEERMVLETI ------1111------------3333------3333-----------------------1 NHFKFHFNLTGQ 111--------- >CADHERIN-8; SWP:P97291; PDB:1ZXKA; SWVWNQMFVLEEFSGPEPILVGRLHTDLDPGKIKYILSGDGAGTIFQINDITGDIHAIKR ---------3333-------------------------2222------------------ LDREEKAEYTLTAQAVDFETNKPLEPPSEFIIKVQD -3333------------------------------- >DNA TOPOISOMERASE II, ALP; SWP:P11388; PDB:1ZXMA; SVERIYQKKTQLEHILLRPDTYIGSVELVTQQMWVYDEDVGINYREVTFVPGLYKIFDEI 3333--------------1111--------------2222---------3333------- LVNAADNKQRDPKMSCIRVTIDPENNLISIWNNGKGIPVVEHKVEKMYVPALIFGQLLTS -----3333-3333--------1111---------------------3333--------- SNYDDDEKKVTGGRNGYGAKLCNIFSTKFTVETASREYKKMFKQTWMDNMGRAGEMELKP -----------------------------------1111--------%%%%--------- FNGEDYTCITFQPDLSKFKMQSLDKDIVALMVRRAYDIAGSTKDVKVFLNGNKLPVKGFR -------------3333-------------------------------iiii-------- SYVDMYLKDKLDETGNSLKVIHEQVNHRWEVCLTMSEKGFQQISFVNSIATSKGGRHVDY -----------3333------------------------------iiii-1111------ VADQIVTKLVDVVKKKNAVKAHQVKNHMWIFVNALIENPTFDSQTKENMTLQPKSFGSTC -------------------3333-------------------3333-----3333----- QLSEKFIKAAIGC ------------- >CONSERVED HYPOTHETICAL PR; SWP:Q8A1P1; PDB:1ZXOA; LIADSGSTKTDWCVVLNGAVIKRLGTKGINPFFQSEEEIQQKLTAVYFYGAGCTPEKAPV -----------------------------3333--------1111--------1111333 LRRAIADSLPVIGNIKANSDLAAAHGLCGQKAGIACILGTGSNSCFYNGKEIVSNISPLG 3----------------------------------------------------------- FILGDEGSGAVLGKLLVGDILKNQLPATLKEEFLKQFDLTPPEIIDRVYRQPFPNRFLAS --------------------------3333---------1111---------3333---- LSPFIAQHLEEPAIRQLVNSFIAFFRRNVQYDYKQYPVHFIGSIAYCYKEILQDAARQTG 33331111----3333-------------------------------------------- IQIGKILQSPEGLIQYHSQLS ----------3333------- >INTERCELLULAR ADHESION MO; SWP:P13598; PDB:1ZXQ; KVFEVHVRPKKLAVEPKGSLEVNCSTTCNQPEVGGLETSLNKILLDEQAQWKHYLVSNIS --------------2222-----------------------------1111--------- HDTVLQCHFTCSGKQESMNSNVSVYQPPRQVILTLQPTLVAVGKSFTIECRVPTVEPLDS ----------iiii--------------------------2222------------1111 LTLFLFRGNETLHYETFGKAAPAPQEATATFNSTADREDGHRNFSCLAVLDLMSRGGNIF -----------------------------------3333------------1111----- HKHSAPKMLEIY ------------ >functional macrophage inf; SWP:Q98158; PDB:1ZXTA; VSYTPNSCCYGFQQHPPPVQILKEWYPTSPACPKPGVILLTKRGRQICADPSKNWVRQLM 1111-------------3333-------1111--------1111-----1111------1 QRLPAIAHH 111------ >AT5G01750 PROTEIN; SWP:Q9LZX1; PDB:1ZXUA; GGVVVDPKYCAPYPIDAIVRKDGNFVITDVNGNLLFKVKEPVFGLHDKRVLLDGSGTPVV -----3333-------------------1111---------2222-------1111---- TLREDRWQVFRGGSTDQRDLLYTVKRTKLDVFLGHNKDKRCDFRVKGSWLERSCVVYAGE ----------!!!!-3333-------------1111------------1111-------- SDAIVAQHRKGKDNFSVTVYPNVDYAFIASLVVILDDVNR -------------------2222----------------- >6-PHOSPHOFRUCTOKINASE; SWP:P80019; PDB:1ZXXA; MKRIGILTSGGDAPGMNAAVRAVTRVAIANGLEVFGIRYGFAGLVAGDIFPLESEDVAHL ------------2222-----------1111---------------------3333---1 INVSGTFLYSARYPEFAEEEGQLAGIEQLKKHGIDAVVVIGGDGSYHGALQLTRHGFNSI 111--1111---3333-------------1111--------3333-------1111---- GLPGTIDNDIPYTDATIGYDTACMTAMDAIDKIRDTASSHHRVFIVNVMGRNCGDIAMRV -------------------------------------1111--------!!!!------- GVACGADAIVIPERPYDVEEIANRLKQAQESGKDHGLVVVAEGVMTADQFMAELKKYGDF -1111-----1111--------------1111--------3333---------3333--- DVRANVLGHMQRGGTPTVSDRVLASKLGSEAVHLLLEGKGGLAVGIENGKVTSHDILDLF -------3333-----------------------1111--------%%%%----333311 DESHRGDYDLLKLNADLSR 11----------3333--- >PEPTIDE DEFORMYLASE, MITO; SWP:NA; PDB:1ZXZA; DLPEIVASGDPVLHEKAREVDPGEIGSERIQKIIDDMIKVMRLAPGVGLAAPQIGVPLRI ---------3333----------1111----------------------3333------- IVLEDTKEYISYAPKEEILAQERRHFDLMVMVNPVLKERSNKKALFFEGCLSVDGFRAAV -----3333----3333--------------------------------1111------- ERYLEVVVTGYDRQGKRIEVNASGWQARILQHECDHLDGNLYVDKMVPRTFRTVDNLDLP -----------1111--------------------1111-3333--------3333---- LAEGCPKLGSHH -2222------- >SERINE/THREONINE-PROTEIN ; SWP:P15442; PDB:1ZY4A; SLRYASDFEEIAVLGQGAFGQVVKARNALDSRYYAIKKIRHTEEKLSTILSEVMLLASLN -3333-----------1111---------------------33333333------1111- HQYVVRYYAAWLERRNFVKKKSTLFIQMEYCENGTLYDLIHSENLNQQRDEYWRLFRQIL 1111---------------------------------------3333------------- EALSYIHSQGIIHRDLKPMNIFIDESRNVKIGDFGLAKNVHRAMYVATEVLDGTGHYNEK ------1111------1111---1111-----------111111113333---------- IDMYSLGIIFFEMIYPFSTGMERVNILKKLRSVSIEFPPDFDDNKMKVEKKIIRLLIDHD -------------------------------1111--11113333--------------3 PNKRPGARTLLNSGWLPVKHQDEVIKEALKS 333---------------------------- >RNA-specific adenosine de; SWP:P78563; PDB:1ZY7A; SRQPIPSEGLQLHLPQVLADAVSRLVLGKFGDLTDNFSSPHARRKVLAGVVMTTGTDVKD ----------------------------------%%%%1111--------------3333 AKVISVSTGTKCINGEYMSDRGLALNDCHAEIISRRSLLRFLYTQLELYLNNKDDQKRSI -------------3333-----------------------------------3333---- FQKSERGGFRLKENVQFHLYISTSPCGDARIFKARGQLRTKIESGEGTIPVRSNASIQTW ---1111----2222-----------3333---2222----2222--------------- DGVLQGERLLTMSCSDKIARWNVVGIQGSLLSIFVEPIYFSSIILGSLYHGDHLSRAMYQ --1111----------------------3333--------------------------33 RISNIEDLPPLYTLNKPLLSGISNAEARQPGKAPNFSVNWTVGDSAIEVINATTGKDELG 33------2222----------------------------2222-------1111-1111 RASRLCKHALYCRWMRVHGKVPSHLLRSKITKPNVYHESKLAAKEYQAAKARLFTAFIKA --1111-----------11113333--------------------------------111 GLGAWVEKPTEQDQFSLT 1---------1111---- >ALPHA-GALACTOSIDASE; SWP:O33835; PDB:1ZY9A; HEIFGKTFREGRFVLKEKNFTVEFAVEKIHLGWKISGRVKGSPGRLEVLRTKAPEKVLVN --iiii----------3333--------2222---------------------------- NWQSWGPCRVVDAFSFKPPEIDPNWRYTASVVPDVLERNLQSDYFVAEEGKVYGFLSSKI --1111-----1111------33333333---3333-----------2222--------- AHPFFAVEDGELVAYLEYFDVEFDDFVPLEPLVVLEDPNTPLLLEKYAELVGENNARVPK -------iiii------%%%%--------------------------------------- HTPTGWCSWYHYFLDLTWEETLKNLKLAKNFPFEVFQIDDAYEKDIGDWLVTRGDFPSVE -----------!!!!------------1111----------------1111------333 EAKVIAENGFIPGIWTAPFSVSETSDVFNEHPDWVVKENGEPKAYRNWNKKIYALDLSKD 3----1111-------1111-1111-----1111---iiii-----%%%%---------- EVLNWLFDLFSSLRKGYRYFKIDFLFAGAVPGERKKNITPIQAFRKGIETIRKAVGEDSF ------------------------3333----------3333-------------1111- ILGCGSPLLPAVGCVDGRIGPDTAPFWGEHIEDNGAPAARWALRNAITRYFHDRFWLNDP -------3333----------------1111---------------1111---------- DCLILREEKTDLTQKEKELYSYTCGVLDNIIESDDLSLVRDHGKKVLKETLELLGGRPRV ----------------------------------3333-----------3333------- QNISEDLRYEIVSSGTLSGNVKIVVDLNSREYHLEKE ------------------------------------- >TRANSCRIPTION REGULATOR, ; SWP:NA; PDB:1ZYBA; FDTLLQLPLFQGLCHEDFTSILDKVKLHFIKHKAGETIIKSGNPCTQLCFLLKGEISIVT --33333333----------3333--------2222---2222----------------- NAKENIYTVIEQIEAPYLIEPQSLFGNTNYASSYVAHTEVHTVCISKAFVLSDLFRYDIF -2222--------------3333------------------------------------- RLNYNIVSNRAQNLYSRLWDEPTLDLKSKIIRFFLSHCEKPQGEKTFKVKDDLARCLDDT --------------3333---------------3333----------------------- RLNISKTLNELQDNGLIELHRKEILIPDAQKLL ---------------------------3333-- >THIOREDOXIN-DEPENDENT PER; SWP:P35705; PDB:1ZYEA; PAVTQHAPYFKGTAVVSGEFKEISLDDFKGKYLVLFFYPLDFTFVCPTEIIAFSDKASEF -----------------------33332222-----------3333-------------- HDVNCEVVAVSVDSHFSHLAWINTPRKNGGLGHMNIALLSDLTKQISRDYGVLLEGPGLA ------------------------3333------------1111---------------- LRGLFIIDPNGVIKHLSVNDLPVGRSVEETLRLVKAFQFVEA -------1111--------1111---------------1111 >METHYLOSOME SUBUNIT PICLN; SWP:P35521; PDB:1ZYIA; QQQPETEAVLNGKGLGTGTLYIAESRLSWLDGSGLGFSLEYPTISLHAVSRDLNAYPREH ----------------------3333----1111-------------------------- LYVMVNAKFGEESKESVAEEEDSDDDVEPIAEFRFVPSDKSALEAMFTAMCECQAL --------------------------------------3333-------------- >HYPOTHETICAL PROTEIN YIHE; SWP:P0C0K4; PDB:1ZYLA; SAFTFQTLHPDTIMDALFEHGIRVDSGLTPLNSYENRVYQFQDEDRRRFVVKFYRPERWT -----------------------------------------------------------3 ADQILEEHQFALQLVNDEVPVAAPVAFNGQTLLNHQGFYFAVFPSVGGRQFEADNIDQME 333-------------------------------!!!!-------------3333----- AVGRYLGRMHQTGRKQLFIHRPTIGLNEYLIEPRKLFEDATLIPSGLKAAFLKATDELIA -----------1111--------------------------------------------- AVTAHWREDFTVLRLHGDCHAGNILWRDGPMFVDLDDARNGPAVQDLWMLLNGDKAEQRM --1111--------------1111----------1111---3333-1111---------- QLETIIEAYEEFSEFDTAEIGLIEPLRAMRLVYYLAWLMRRWADPAFPKNFPWLTGEDYW -------3333----3333---------------------11113333------------ LRQTATFIEQAKVLQEPPLQLTPMY ----------3333----------- >ENZYME I; SWP:P08839; PDB:1ZYMA; SGILASPGIAFGKALLLKEDEIVIDRKKISADQVDQEVERFLSGRAKASAQLETIKTKAG -----------------------------3333--------------------------- ETFGEEKEAIFEGHIMLLEDEELEQEIIALIKDKHMTADAAAHEVIEGQASALEELDDEY ----3333------3333------------------3333-------------------- LKERAADVRDIGKRLLRNILGLKIIDLSAIQDEVILVAADLTPSETAQLNLKKVLGFITD -------------------------1111------------3333--------------- AGGRTSHTSIMARSLELPAIVGTGSVTSQVKNDDYLILDAVNNQVYVNPTNEVIDKMRAV -----3333---1111--------3333--2222-------------------------- QEQVASE 1111--- >SERINE PROTEASE; SWP:Q9EB08; PDB:1ZYOA; SFYSPVKAGDEPASLVAIKSGPTTIGFGCRTKIEDCLLTAHHVWCNSMRPTGLAKAGKQV ------2222-3333----!!!!----------------3333-----------iiii-- SVEDWEISMSSSDKMLDFAIVRVPTHVWSKLGVKSTPLVCPSSKDVITCYGGSSSDCLMS ------------1111-------------------------------------1111--- GVGSSSTSEFTWKLTHTCPTAAGWSGTPLYSSRGVVGMHVGFEEIGKLNRGVNMFYVANY ---------3333-------2222------3333---------2222------------- LLRS ---- >ALPHA-LIKE NEUROTOXIN BMK; SWP:P45697; PDB:1ZYWA; NSVRDAYIADSHNCVYECARNEYCNDLCTKNGAKSGYCQWVGKYGNGCWCIELPDNVPIK -------------------3333-----1111---------------------------- VGGKCH ------ >HISTONE DEACETYLASE-LIKE ; SWP:Q70I53; PDB:1ZZ1A; AIGYVWNTLYGWVDTGTGSLAAANLTARMQPISHHLAHPDTKRRFHELVCASGQIEHLTP ------3333--------------1111------1111-----------11113333--- IAAVAATDADILRAHSAAHLENMKRVSNLPTGGDTGDGITMMGNGGLEIARLSAGGAVEL ----------------------------1111----------2222-------------- TRRVATGELSAGYALVNPPGHHAPHNAAMGFCIFNNTSVAAGYARAVLGMERVAILDWDV ---3333------------11111111-iiii---------------------------- HHGNGTQDIWWNDPSVLTISLHQHLCFPPDSGYSTERGAGNGHGYNINVPLPPGSGNAAY ---------1111---------2222------3333--!!!!---------2222----- LHAMDQVVLPALRAYRPQLIIVGSGFDASMLDPLARMMVTADGFRQMARRTIDCAADICD ----------------------------1111--------------------------ii GRIVFVQEGGYSPHYLPFCGLAVIEELTGVRSLPDPYHEFLAGMGGNTLLDAERAAIEEI ii---------3333-------------------11113333------------------ VPLLADI -3333-- >HYDROXYPROPYLPHOSPHONIC A; SWP:Q56185; PDB:1ZZ6A; TASTGFAELLKDRREQVKMDHAALASLLGETPETVAAWENGEGGELTLTQLGRIAHVLGT --------------1111------------3333--------1111---------1111- SIGALTPPAGNDLDDGVIIQMPDERPILKGVYYVYNCLVRTKRAPSLVPLVVDVLTDNPD 3333---------iiii---3333----------------1111-------------333 DAKFNSGNEFLFVLEGEIHMKWGDKEALLPTGASMFVEEHVPHAFTAAKGTGSAKLIAVN 3----------------------------2222----2222------2222--------- F - >GLUCOSE-6-PHOSPHATE ISOME; SWP:Q5SLL6; PDB:1ZZGA; MLRLDTRFLPGFPEALSRHGPLLEEARRRLLAKRGEPGSMLGWMDLPEDTETLREVRRYR -----1111---------------------1111-2222-33333333------------ EANPWVEDFVLIGIGGSALGPKALEAAFNESGVRFHYLDHVEPEPILRLLRTLDPRKTLV --1111-------!!!!------------------------------------3333--- NAVSKSGSTAETLAGLAVFLKWLKAHLGEDWRRHLVVTTDPKEGPLRAFAEREGLKAFAI ---3333-------------------!!!!3333-------------------------- PKEVGGRFSALSPVGLLPLAFAGADLDALLMGARKANETALAPLEESLPLKTALLLHLHR 111111111111-----3333-----------------11113333-3333-------11 HLPVHVFMVYSERLSHLPSWFVQLHDESLGKVDRQGQRVGTTAVPALGPKDQHAQVQLFR 11---------1111-----------------1111------------1111-------- EGPLDKLLALVIPEAPLEDVEIPEVEGLEAASYLFGKTLFQLLKAEAEATYEALAEAGQR ------------------------22221111---------------------------- VYALFLPEVSPYAVGWLMQHLMWQTAFLGELWEVNAFDQPGVELGKVLTRKRLAG --------------------------------------3333------------- >CYTOCHROME C PEROXIDASE; SWP:NA; PDB:1ZZHA; ALREEAKGLFEVIPMQAPVTRDKIDLGAMLFFDPRMSKSGVFSCQSCHNVGLGGVDGLET ---------------------------------1111-----3333--1111-------- SIGHGWQKGPRNAPTALNAVFNVAQFWDGRAPDLAAQAMNNTPENLVATVQSMPGYVEAF ---3333--------2222----------------------------------------- AKAFPGQKDPISFDNFALAVEAFEATLITPNSKFDQWLMGADGAMSADEKAGLKLFIDTG ------------------------------------11111111---------------3 CAACHNGINIGGNGYYPFGVVEKPGRFAVTATADDEYVFRAGPLRNIALTAPYFHSGKVW 333---1111-----------------------3333------2222------1111--- DLREAVSVMANSQLGATLDDTQVDQITAFLGTLTGEQPEVVHPILPVRSAQTPRPEH ------------------------------1111--------------1111----- >HETEROGENEOUS NUCLEAR RIB; SWP:P61978; PDB:1ZZKA; MGPIITTQVTIPKDLAGSIIGKGGQRIKQIRHESGASIKIDEPLEGSEDRIITITGTQDQ -----------1111-----2222-----------------1111--------------- IQNAQYLLQNSVKQYSGKFF -------------------- >PUTATIVE DEOXYRIBONUCLEAS; SWP:P39408; PDB:1ZZMA; MICRFIDTHCHFDFPPFSGDEEASLQRAAQAGVGKIIVPATEAENFARVLALAENYQPLY ----------1111--2222---------------------3333----------3333- AALGLHPGMLEKHSDVSLEQLQQALERRPAKVVAVGEIGLDLFGDDPQFERQQWLLDEQL -----33331111----------------------------------------------- KLAKRYDLPVILHSRRTHDKLAMHLKRHDLPRTGVVHGFSGSLQQAERFVQLGYKIGVGG -----------------------------1111----------------1111-----33 TITYPRASKTRDVIAKLPLASLLLETDAPDMPLNGFQGQPNRPEQAARVFAVLCELRREP 33-1111-3333-11111111-----------2222------------------------ ADEIAQALLNNTYTLFNVP ------------------- >RV1677; SWP:O53924; PDB:1ZZOA; TVPAQLQFSAKTLDGHDFHGESLLGKPAVLWFWAPWCPTCQGEAPVVGQVAASHPEVTFV --3333-----1111---33332222-------1111----------------3333--- GVAGLDQVPAMQEFVNKYPVKTFTQLADTDGSVWANFGVTQQPAYAFVDPHGNVDVVRGR ---------------------------1111-3333------------1111-------- MSQDELTRRVTALT ----------3333 >PROTO-ONCOGENE TYROSINE-P; SWP:P00519; PDB:1ZZPA; SGAITKGVVLDSTEALCLAISRNSEQMASHSAVLEAGKNLYSFCVSYVDSIQQMRNKFAF -------------------1111------------------------1111--------- REAINKLENNLRELQICPATAGSGPAATQDFSKLLSSVKEISDIVQRLE --------------------------------------------1111- >DUAL SPECIFICITY PROTEIN ; SWP:Q9Y6W6; PDB:1ZZWA; MAELTPILPFLFLGNEQDAQDLDTMQRLNIGYVINVTTHLPLYHYEKGLFNYKRLPATDS -------1111---3333-----------------------2222--------------1 NKQNLRQYFEEAFEFIEEAHQCGKGLLIHCQAGVSRSATIVIAYLMKHTRMTMTDAYKFV 111-3333-----------1111------------------------------------- KGKRPIISPNLNFMGQLLEFEEDLNNG ---1111----------------1111 >CYTOCHROME B562; SWP:P0ABE7; PDB:256BA; ADLEDNMETLNDNLKVIEKADNAAQVKDALTKMRAAALDAQKATPPKLEDKSPDSPEMKD ---------------------------------------1111-3333---1111----- FRHGFDILVGQIDDALKLANEGKVKEAQAAAEQLKTTRNAYHQKYR ------------------1111------------------------ >APOLIPOPROTEIN A-I; SWP:P02647; PDB:2A01A; DEPPQSPWDRVKDLATVYVDVLKDSGRDYVSQFEGSALGKQLNLKLLDNWDSVTSTFSKL ----------3333---------------------------------------------- REQLGPVTQEFWDNLEKETEGLRQEMSKDLEEVKAKVQPYLDDFQKKWQEEMELYRQKVE ------------------------------------------------------------ PLRAELQEGARQKLHELQEKLSPLGEEMRDRARAHVDALRTHLAPYSDELRQRLAARLEA -------------------------------------------3333------------- LKENGGARLAEYHAKATEHLSTLSEKAKPALEDLRQGLLPVLESFKVSFLSALEEYTKKL --------------3333------------------------------------------ NTQ --- >FERRIC-PSEUDOBACTIN 358 R; SWP:P25184; PDB:2A02A; SQEWTLDIPAQSMNSALQALAKQTDTQLLYSPEDIGGLRSSALKGRHDLQSSLRILLQGT ------------------------------3333%%%%---------------------- GLRYQIDGNTVTVTASAAAKDG ------!!!!------------ >FE-SUPEROXIDE DISMUTASE H; SWP:Q4YW77; PDB:2A03A; MAITLPKLKYALNALSPHISEETLSFHYNKHHAGYVNKLNGLIKDTPLANKSLTDILKES ----------1111-------------------------------1111----------- TGAIFNNAAQIWNHSFYWDSMGPNCGGEPHGEIKEKIQEDFGSFNNFKDQFSNVLCGHFG -----------------1111---------3333-------------------------- SGWGWLALNKNNKLVILQTHDAGNPIKENTGIPILTCDVWEHAYYIDYRNDRLSYVKAWW --------1111-------!!!!--1111----------3333----!!!!--------- NLVNWNFANENLKNALN ----------------- >CYSTEINE-RICH SECRETORY P; SWP:P16563; PDB:2A05A; GSCASCPNNCENGLCTNSCDFEDLLSNCESLKTSAGCKHELLKTKCQATCLCEDKIH --1111------------------1111-------33333333---3333------- >Forkhead box protein P2; SWP:O15409; PDB:2A07F; VRPPFTYATLIRQAIMESSDRQLTLNEIYSWFTRTFAYFRRNAATWKNAVRHNLSLHKCF -----------------1111---------------1111-3333-----------3333 VRVENVKGAVWTVDEVEYQKRR --------------33333333 >Hypothetical 41.8 kDa pro; SWP:P32793; PDB:2A08A; AMATAVALYNFAGEQPGDLAFKKGDVITILKKSDSQNDWWTGRTNGKEGIFPANYVRVS --------------2222---2222---------1111-----iiii----1111---- >DER F 13; SWP:Q1M2P5; PDB:2A0AA; MASIEGKYKLEKSEKFDEFLDKLGVGFMVKTAAKTLKPTFEVAIENDQYIFRSLSTFKNT --------------3333-------3333-3333--------------------%%%%-- EAKFKLGEEFEEDRADGKRVKTVIQKEGDNKFVQTQFGDKEVKIIREFNGDEVVVTASCD ------------------------------------!!!!-------------------- GVTSVRTYKRI ----------- >HPT DOMAIN; SWP:P22763; PDB:2A0B; SKSEALLDIPMLEQYLELVGPKLITDGLAVFEKMMPGYVSVLESNLTAQDKKGIVEEGHK 3333-------------------------------------------------------- IKGAAGSVGLRHLQQLGQQIQSPDLPAWEDNVGEWIEEMKEEWRHDVEVLKAWVAKAT ---------------------3333-3333---------------------------- >PTS SYSTEM, NITROGEN REGU; SWP:Q9K082; PDB:2A0JA; SLIGEILPLSHIVLDMEVGSKKRLFEEAGLLLERESSLSHADVFECLFAREKLGSTGLGQ -3333--3333--------------------------------------3333-----ii GVAIPHGRHAGVKQATGAFIRTREPVGFDAPDGKPVSLIFILLVPENATGEHLEVLSKLA ii------1111-----------------1111--------------3333--------- GKFSQKSIRESLMTVSSAEEVRAILT ----------1111------------ >VOLTAGE-GATED POTASSIUM C; SWP:NA; PDB:2A0LC; QIVLTQSPAIMSASLGDRVTMTCTASSSVSSSYLHWYQQKPGSSPKLWIYSTSNLASGVP -----------------------------------------------------------3 ARFSGSGSGTSYSLTISSMEAEDAATYYCHQFHRSLTFGSGTKLE 333----!!!!---------------------------------- >ARGINASE SUPERFAMILY PROT; SWP:Q4DSA0; PDB:2A0MA; TDDPRLLSLFSAQREEDADIVIIGFPYDEGCVRNGGRAGAKKGPAAFRFFLQRLGSVNNL ----3333-----3333--------------1111---3333-------1111------- ELNVDASHLKLYDAGDITASTLEEAHEKLESKVFTVLARGAFPFVIGGGNDQSAPNGRAM -----1111---------------------------1111-------------------- LRAFPGDVGVINVDSHLDVRPPLQDGRVHSGTPFRQLLEESSFSGKRFVEFACQGSQCGA ---2222---------------1111--1111-------11113333------3333--- LHAQYVRDHQGHLMWLSEVRKKGAVAALEDAFGLTGKNTFFSFDVDSLKSSDMPGVSCPA ------1111---------------------------------1111------------- AVGLSAQEAFDMCFLAGKTPTVMMMDMSELNPLVEEYRSPRVAVYMFYHFVLGFATRP ------------------1111--------3333-------------------1111- >6-PYRUVOYL TETRAHYDROPTER; SWP:NA; PDB:2A0SA; DQIAELLVESPLFSFNCAHFIAFKGFRETLHGHNYNVSLRLRGNIQGDGYVIDFSILKEK ---------3333---------2222-------------------1111----------- VRKVCKQLDHHFILPMYSDVLNIQEVNDNFKITCEDNSEYSFPKRDCVQIPIKHSSTEEI -------------------------!!!!----1111-----3333-------------- GLYILNQLIEEIDLPFLKTRSVNYMEVTVSESPSQKATVHRNI -------------------------------1111-------- >INITIATION FACTOR 2B; SWP:Q4Q0R9; PDB:2A0UA; SKPHHATLESIKYTPGSLRLLDQRKLPLETVFDDVLTVEDIWSAIKERVRGAPAIAVSAA -------------2222----3333-----------3333-------------------- LGIAVATQRKAANGELKSGREVQTFLLTSCDFVTSRPTAVNLFNCLRDLKAQVDKLDPTK ----------------------------------------3333--------33333333 AAAEVAQAFVELAEAVYTNDVAFNEGIRHGAAHILAAAKAEGRDKVSILTICNTGALATS ------------------------------------------------------3333-- RYGTALGVVRQLFYDGKLERVYACETRPWNQGARLTVYECVQEDIPCTLICDGAASSLLN ------------1111----------------------------------33333333-- RKIDAVVVGADRICQNGDTANKIGTYNLAVSAKFHGVKLYVAAPTTTLDVKTASGNHVEI -------------1111----2222------------------1111-3333-3333--- EEREPTEITTNLVTKQRVVADGPHLSIWNPVFDITPSELITGGIITEKGVQAPAASAPYY ----3333-------------1111----------3333------3333----------- DIASIIAQA 3333----- >Carbon dioxide concentrat; SWP:P73407; PDB:2A10A; QSAVGSIETIGFPGILAAADAMVKAGRITIVGYIRAGSARFTLNIRGDVQEVKTAMAAGI ------------------------------------%%%%-------------------- DAINRTEGADVKTWVIIPRPHENVVAVLPIDFSPEVEPFREAAE -----2222-----------------------3333-------- >RIBONUCLEASE III; SWP:P66666; PDB:2A11A; IRSRQPLLDALGVDLPDELLSLALTHRSYAYENGGLPTNERLEFLGDAVLGLTITDALFH --------3333------------------1111-------------------------- RHPDRSEGDLAKLRASVVNTQALADVARRLCAEGLGVHVLLGRGEANTGGADKSSILADG -1111-----------------------------3333---------------------- MESLLGAIYLQHGMEKAREVILRLFGPLLDAAPT ------------------------------3333 >AT1G79260; SWP:O64527; PDB:2A13A; PPVHPFVAPLSYLLGTWRGQGEGEYPTIPSFRYGEEIRFSHSGKPVIAYTQKTWKLESGA ---33331111-------------3333-------------------------------- PHAESGYFRPRPDGSIEVVIAQSTGLVEVQKGTYNVDEQSIKLKSDLVGNASKVKEISRE ----------1111-------1111----------1111--------------------- FELVDGKLSYVVRSTTTNPLQPHLKAILDKL ---iiii-------1111------------- >INDOLETHYLAMINE N-METHYLT; SWP:O95050; PDB:2A14A; FTGGDEYQKHFLPRDYLATYYSFDGSPSPEAEMLKFNLECLHKTFGPGGLQGDTLIDIGS ---------------------------------------------2222----------- GPTIYQVLAACDSFQDITLSDFTDRNREELEKWLKKEPGAYDWTPAVKFACELEGNSGRW ---3333-3333----------3333----------1111-----------11113333- EEKEEKLRAAVKRVLKCDVHLGNPLAPAVLPLADCVLTLLAMECACCSLDAYRAALCNLA -----------------1111-1111--------------3333---------------1 SLLKPGGHLVTTVTLRLPSYMVGKREFSCVALEKGEVEQAVLDAGFDIEQLLHSPQSYSV 1112222--------------!!!!---------------------------------33 TNAANNGVCCIVARKKP 33--------------- >HYPOTHETICAL PROTEIN RV07; SWP:P71817; PDB:2A15A; TQSPALIASQSSWRCVQAHDREGWLALMADDVVIEDPIGKSVTNPDGSGIKGKEAVGAFF ------------------------11111111--------1111---------------- DTHIAANRLTVTCEETFPSSSPDEIAHILVLHSEFDGGFTSEVRGVFTYRVNKAGLITNM ----1111-------------------------------------------3333----- RGYWNLDMMTFGN ----3333----- >Interferon-induced, doubl; SWP:P19525; PDB:2A19B; AHTVDKRFGMDFKEIELIGSGGFGQVFKAKHRIDGKTYVIKRVKYNNEKAEREVKALAKL --------------------------------------------------------1111 DHVNIVHYNGCWDGFDYDRSKTKCLFIQMEFCDKGTLEQWIEKRRGEKLDKVLALELFEQ -1111--------------------------------------1111------------- ITKGVDYIHSKKLINRDLKPSNIFLVDTKQVKIGDFGLVTSLKNDGKRRSKGTLRYMSPE ------------------3333-------------1111--------------1111333 QISSQDYGKEVDLYALGLILAELLHVCDTAFETSKFFTDLRDGIISDIFDKKEKTLLQKL 3------3333----------------------------1111--3333----------- LSKKPEDRPNTSEILRTLTVWKK ---3333---------------- >Carbon dioxide concentrat; SWP:P72761; PDB:2A1BA; SIAVGMIETRGFPAVVEAADSMVKAARVTLVGYEKIGSGRVTVIVRGDVSGVQASVSAGI --------------------3333------------------------------------ EAANRVNGGEVLSTHIIARPHENLEYVLPIRYTEEVEQFRT --------------------3333-----------3333-- >URIDYLATE KINASE; SWP:P43890; PDB:2A1FA; LSQPIYKRILLKLSGEALQGEDGLGIDPAILDRMAVEIKELVEMGVEVSVVLGGGNLFRG -------------3333--1111----3333----------1111--------3333--- AKLAKAGMNRVVGDHMGMLATVMNGLAMRDSLFRADVNAKLMSAFQLNGICDTYNWSEAI ---1111-------------------------1111----------2222---------- KMLREKRVVIFSAGTGNPFFTTDSTACLRGIEIEADVVLKATKVDGVYDCAKLYKNLSYA ------------!!!!-------------------------------------------- EVIDKELKVMDLSAFTLARDHGMPIRVFNMGKPGALRQVVTGTEEGTTICEG ----------------------------3333-------------------- >BRANCHED CHAIN AMINOTRANS; SWP:O15382; PDB:2A1HA; SSFKAADLQLEMTQKPHKKPGPGEPLVFGKTFTDHMLMVEWNDKGWGQPRIQPFQNLTLH ---3333-------------3333-----------------1111--------------1 PASSSLHYSLQLFEGMKAFKGKDQQVRLFRPWLNMDRMLRSAMRLCLPSFDKLELLECIR 111-----------------1111------------------1111-------------- RLIEVDKDWVPDAAGTSLYVRPVLIGNEPSLGVSQPRRALLFVILCPVGAYFPGGSVTPV -----3333---2222-------------------------------------------- SLLADPAFIRAWVGGVGNYKLGGNYGPTVLVQQEALKRGCEQVLWLYGPDHQLTEVGTMN ----3333---22221111-33333333-------1111--------1111--------- IFVYWTHEDGVLELVTPPLNGVILPGVVRQSLLDMAQTWGEFRVVERTITMKQLLRALEE ------1111-----------------------------------------------111 GRVREVFGSGTACQVCPVHRILYKDRNLHIPTMENGPELILRFQKELKEIQYGIRAHEWM 1---------------------%%%%----3333---------------1111---3333 FPV --- >DNA EXCISION REPAIR PROTE; SWP:P07992; PDB:2A1IA; NSIIVSPRQRGNPVLKFVRNVPWEFGDVIPDYVLGQSTCALFLSLRYHNLHPDYIHGRLQ -----3333--3333-----------------------------------1111------ SLGKNFALRVLLVQVDVKDPQQALKELAKMCILADCTLILAWSPEEAGRYLETYKAYEQK ------------------------------------------------------------ PADLLMEKL ---3333-- >DNA excision repair prote; SWP:P07992; PDB:2A1JB; DPADLLMEKLEQDFVSRVTECLTTVKSVNKTDSQTLLTTFGSLEQLIAASREDLALCPGL --------------------1111-------------------------33333333--- GPQKARRLFDVLHEPFLKV --3333------------- >GP32 SINGLE STRANDED DNA ; SWP:Q7Y265; PDB:2A1KA; DKGEWKLKLDASGNGQAVIRFLPAKTDDALPFAILVNHGFKKNGKWYIETCSSTHGDYDS ---------3333------------1111------------%%%%-----1111---111 CPVCQYISKNDLYNTNKTEYSQLKRKTSYWANILVVKDPQAPDNEGKVFKYRFGKKIWDK 1------------------------------------33333333--------3333--- INAMIAVDTEMGETPVDVTCPWEGANFVLKVKQVSGFSNYDESKFLNQSAIPNIDDESFQ -------3333-----1111-------------iiii--1111-------2222------ KELFEQMVDLSEMTSKDKFKSFEELNTKFNQVLGT ---1111-3333--1111----------------- >Phosphatidylinositol tran; SWP:P53812; PDB:2A1LA; VLIKEFRVVLPCSVQEYQVGQLYSVAEASKNETGGGEGIEVLKNEPYENDGEKGQYTHKI ------------3333-------------1111iiii----------------------- YHLKSKVPAFVRMIAPEGSLVFHEKAWNAYPYCRTIVTNEYMKDDFFIKIETWHKPDLGT -------3333----2222-------------------3333------------------ LENVHGLDPNTWKTVEIVHIDIADRSQVEPADYKADEDPALFQSVKTKRGPLGPNWKKEL --1111-33331111-----111111113333-11111111-----------1111---- ANTPCPKMCAYKLVTIKFKWWGLQSKVENFIQKQEKRIFTNLHRQLFCWIDKWIDLTMED -------------------2222-------------------------333311113333 IRRMEDETQKELETMRKKGSVRGTSAADA ----------------------------- >POLY(A)-SPECIFIC RIBONUCL; SWP:O95453; PDB:2A1RA; MEIIRSNFKSNLHKVYQAIEEADFFAIDGEFSGISDGPSVGFDTPEERYQKLKKHSMDFL ---3333-1111------------------------------------------------ LFQFGLCTFKYDYTDSKYITKSFNFYVFPKPFNRSSPDVKFVCQSSSIDFLASQGFDFNK --------------------------------3333---------------1111----- VFRNGIPYLNQEEERQLRHAKEQEELNDAVGFSRVIHAIANSGKLVIGHNMLLDVMHTVH ------------------3333-----3333--------3333----------------- QFYCPLPADLSEFKEMTTCVFPRLLDTKLMASTQPFKDIINNTSLAELEKRLKETPFNPP --------3333------------------------------------------------ KVESAEGFPSYQLHEAGYDAYITGLCFISMANYLGSFLSPPKIHVSARSKLIEPFFNKLF ----2222------------------------3333---------11111111------- >CONSERVED HYPOTHETICAL PR; SWP:Q9RRT5; PDB:2A1VA; LQTPMQTVDDLRSVCDELPHSLETFPFDDETLVFKVGYLSKSRMYALTDITQDPLRLSLK -------------3333--------------------1111-------1111-------- VDPERGEELRQAHPQSIAPGYHLNKKHWVTVTLDGTVPAELLGELLRGSYLLVTKKGFTK ------------1111-------3333--------------------------------- AERKELGLPDSLEGGSHH ------------------ >PHYTANOYL-COA DIOXYGENASE; SWP:O14832; PDB:2A1XA; QFQYTLDNLTLEQRKFYEENGFLVIKNLVPDADIQRFRNEFEKICRKEVKPLGLTVMRDV -------------------------------------------1111------------- TISKSEKMITKVQDFQEDKELFRYCTLPEILKYVECFTGPNIMAMHTMLINKPPDPLHQD --------------1111333333333333---3333---------------------33 LHYFPFRPSDLIVCAWTAMEHISRNNGCLVVLPGTHKGSLKPHDYHGIQDEENKARVHLV 33-----3333-----------3333-----2222------------------------- MEKGDTVFFHPLLIHGSGQNKTQGFRKAISCHFASADCHYIDVKGTSQENIEKNLKDIWM -2222----1111---------------------1111----2222-3333--3333--- FRARLVKGERTNL ------------- >Regulating synaptic membr; SWP:Q9JIR9; PDB:2A20A; QEQKGDAPTCGICHKTKFADGCGHNCSYCQTKFCARCGGRVSLRSNKVMWVCNLCRKQQE ---------------------------------1111-------------------1111 >VACUOLAR PROTEIN SORTING ; SWP:Q5CNU4; PDB:2A22A; DFGDLVLLIGDLKIPYGAKELPSNFRELLATDKINYVLCTGNVCSQEYVEMLKNITKNVY --------------1111---3333-33333333-----------------1111----- IVSGDLDSAIFNPDPESNGVFPEYVVVQIGEFKIGLMHGNQVLPWDDPGSLEQWQRRLDC ---1111------3333-----------!!!!------1111-2222------------- DILVTGHTHKLRVFEKNGKLFLNPGTATGAFSALTPDAPPSFMLMALQGNKVVLYVYDLR ---------------iiii------1111--3333------------!!!!--------i DGKTNVAMSEFSK iii---------- >V(D)J RECOMBINATION ACTIV; SWP:P21784; PDB:2A23A; GPLGSPEFGYWITCCPTCDVDINTWVPFYSTELNKPAMIYCSHGDGHWVHAQCMDLEERT --------------------3333--------------------------3333---333 LIHLSEGSNKYYCNEHVQIARA 3--1111--------------- >UBIQUITIN LIGASE SIAH1; SWP:Q8IUQ4; PDB:2A25A; PYSCPCPSCKWQGSLDAVMPHLMHQHKSITTLQGEDIVFLATDINVDWVMMQSCFGFHFM -------------3333--------1111------------------------iiii--- LVLEKQQQFFAIVQLIGTRKQAENFAYRLELNGHRRRLTWEATPRSIHEGIATAIMNSDC -----------------33331111--------------------3333-----1111-- LVFDTSIAQLFAENGNLGINVTISMC -------------------------- ------------------------------------------------ >BZZ1 PROTEIN; SWP:P38822; PDB:2A28A; GAMEAIYAYEAQGDDEISIDPGDIITVIRGDDGSGWTYGECDGLKGLFPTSYCK ------------1111---2222-----------------iiii----1111-- >DEATH-ASSOCIATED PROTEIN ; SWP:Q9UIK4; PDB:2A2AA; MEPFKQQKVEDFYDIGEELGSGQFAIVKKCREKSTGLEYAAKFIKKRQSRASRRGVSREE -------3333----------1111----------------------------------- IEREVSILRQVLHHNVITLHDVYENRTDVVLILELVSGGELFDFLAQKESLSEEEATSFI -------1111-1111--------1111-----------3333-1111------------ KQILDGVNYLHTKKIAHFDLKPENIMLLDKNIPIPHIKLIDFGLAHEIEDGVEFKNIFGT --------------------3333-----------------1111---2222-------3 PEFVAPEIVNYEPLGLEADMWSIGVITYILLSGASPFLGDTKQETLANITSVSYDFDEEF 333-3333------3333----------------1111------------------3333 FSHTSELAKDFIRKLLVKETRKRLTIQEALRHPWITPVDNQQAMVRRESVVNLENFRKQY 1111-------1111---3333---------3333------------------------- VRRR ---- >BACTERIOCIN CURVACIN A; SWP:P0A311; PDB:2A2BA; ARSYGNGVYCNNKKCWVNRGEATQSIIGGMISGWASGLAGM ------------------------33333333--------- >N-ACETYLGALACTOSAMINE KIN; SWP:Q01415; PDB:2A2CA; ATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLPM ----------3333-----------------------------------3333------- AVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIDKTKPLWHNYFLCGLKGIQEHFG -----------------------3333---------------3333----------1111 LSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSERYIG --------------------------------------------------------1111 TEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHFNIR ----3333------2222----------------1111-----------3333------- VMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEICRC ------------1111-3333--3333-------3333---------------------- LGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEAARVLQFKKICEEAPENMVQLLGEL -------------1111--------------------------------1111------- MNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSFLAN --------------------------1111-------------------3333------- VHKAYYQKQSLFATKPGGGALVLLEA -------1111--------------- >Exocyst complex component; SWP:Q9VDE6; PDB:2A2FX; NILWELLHNMRDHYNEVLLQRWVHVFREILDKEQFLPMVVQNTEEYECIIERFPFHSEQL --------------------------------%%%%-----1111--------------- EPKKFPFSRMVPEVYHQAKEFMYACMKFAEELTLSPNEVAAMVRKAANLLLTRSFSGCLS ------------------------------------------------------------ VVFRQPSITLTQLIQIIIDTQYLEKAGPFLDEFVCHMTNTERAMFHVARQDAEKQVGLRI ----3333---------------------------------------------------- CSKIDEFFELSAYDWLPGIASAFITDMISYLKSTFDSFAFKLPHIAQAACRRTFEHIAEK -------11111111----------------------1111------------------- IYSIMYDSTGALTQINLDLMQCEFFAASEPVPGLKEGELSKYFLRNRQLLDLLILE --------3333------------1111---------3333--------------- >PYRIDOXAMINE 5'-PHOSPHATE; SWP:P65682; PDB:2A2JA; PEKDGGDLDFDWLDDGWLTLLRRWLNDAQRAGVSEPNAMVLATVADGKPVTRSVLCKILD -1111---33331111------------1111--1111------iiii-----------1 ESGVAFFTSYTSAKGEQLAVTPYASATFPWYQLGRQAHVQGPVSKVSTEEIFTYWSMRPR 111-----1111-----------------3333--------------------1111--- GAQLGAWASQQSRPVGSRAQLDNQLAEVTRRFADQDQIPVPPGWGGYRIAPEIVEFWQGR -------------------------------1111------------------------- ENRMHNRIRVANGRLERLQPGS ----------iiii-------- >M-PHASE INDUCER PHOSPHATA; SWP:P30305; PDB:2A2KA; MELIGDYSKAFLLQTVDGKHQDLKYISPETMVALLTGKFSNIVDKFVIVDCRYPYEYEGG ---1111------------1111-------------1111----------------1111 HIKTAVNLPLERDAESFLLKSPIAPKRVILIFHSEFSSERGPRMCRFIRERDRAVNDYPS -2222-----------------------------------------------1111---- LYYPEMYILKGGYKEFFPQHPNFCEPQDYRPMNHEAFKDELKTFRLKTRSW --------2222---33331111-------1111----------------- >UNKNOWN; SWP:NA; PDB:2A2LA; SLMNKSQQVQTITLAAAQQMAAAVEKKATEINVAVVFSVVDRGGNTLLIQRMDEAFVSSC ----------------------------1111--------1111-------11113333- DISLNKAWSACSLKQGTHEITSAVQPGQSLYGLQLTNQQRIIIFGGGLPVIFNEQVIGAV ----------1111-33333333-2222-2222--%%%%------------%%%%----- GVSGGTVEQDQLLAQCALDCFSALE -------------------3333-- >HYPOTHETICAL PROTEIN BT31; SWP:Q8A309; PDB:2A2MA; RKRTFAIPASRLTGRLTTLKSDVPAADSLFWKLWNGSLDTAVQVLQTDYFKGIAAGTLDP -------3333-!!!!--------1111--------------3333-------------- NAYGSLVQDGYYCFRGRDDYATAATCAQDETLREFFKAKAKSYDEYNETYHQTWHLREAS ---------------------------------------------------------333 GLIPGTDIKDYADYEAYVAGSLASPYCVVLPCEYLWPWIANFLDGYTPTNSLYRFWIEWN 3---------------------3333---3333---------1111-1111---3333-- GGTPNGAYQGNLEQYRDKIDEDKAVEIFNTANYELKVFTSSTILT ---3333----33331111--------------------1111-- >peptidylprolyl isomerase ; SWP:Q96BP3; PDB:2A2NA; QAEGPKRVSDSAIIHTSMGDIHTKLFPVECPKTVENFCVHSRNGYYNGHTFHRIIKGFMI ---------------1111------------------------1111-------2222-- QTGDPTGTGMGGESIWGGEFEDEFHSTLRHDRPYTLSMANAGSNTNGSQFFITVVPTPWL ---1111------1111-------1111----------------------------1111 DNKHTVFGRVTKGMEVVQRISNVKVNPKTDKPYEDVSIINITVK ------------3333---1111--------------------- >SELENOPROTEIN M; SWP:Q8VHC3; PDB:2A2PA; MTNYRPDWNRLRGLARGRVETCGGCQLNRLKEVKAFVTEDIQLYHNLVMKHLPGADPELV ------3333-----------1111-3333-3333---3333------------------ LLSRNYQELERIPLSQMTRDEINALVQELGFYRKSAPEAQVPPEYLWAPAKPPEEASEHD -------------3333------------------1111--3333--------------- DLEHHHHHH -iiii---- >GLUTATHIONE S-TRANSFERASE; SWP:P09211; PDB:2A2RA; MPPYTVVYFPVRGRCAALRMLLADQGQSWKEEVVTVETWQEGSLKASCLYGQLPKFQDGD -----------!!!!-------1111--------3333----3333-1111------!!! LTLYQSNTILRHLGRTLGLYGKDQQEAALVDMVNDGVEDLRCKYISLIYTNYEAGKDDYV !--------------------------------------------------1111----- KALPGQLKPFETLLSQNQGGKTFIVGDQISFADYNLLDLLLIHEVLAPGCLDAFPLLSAY --3333----------%%%%---------3333-------------22221111------ VGRLSARPKLKAFLASPEYVNLPINGNGKQ ------------------1111-------- >ALPHA-2U-GLOBULIN; SWP:P02761; PDB:2A2UA; EEASSTRGNLDVAKLNGDWFSIVVASNKREKIEENGSMRVFMQHIDVLENSLGFKFRIKE ---3333---3333-------------3333-2222-----------2222--------% NGECRELYLVAYKTPEDGEYFVEYDGGNTFTILKTDYDRYVMFHLINFKNGETFQLMVLY %%%---------------------------------------------iiii-------- GRTKDLSSDIKEKFAKLCEAHGITRDNIIDLTKTDRCL -----------------------1111--3333----- >JINGZHAOTOXIN-XI; SWP:P0C247; PDB:2A2VA; ECRKMFGGCSVDSDCCAHLGCKPTLKYCAWDGTF ---------------------3333--------- >TRYPSIN; SWP:P00761; PDB:2A31A; IVGGYTCAANSIPYQVSLNSGSHFCGGSLINSQWVVSAAHCYKSRIQVRLGEHNIDVLEG -------22221111---------------1111---1111------------1111--- NEQFINAAKIITHPNFNGNTLDNDIMLIKLSSPATLNSRVATVSLPRSCAAAGTECLISG ------------1111--------------------1111-------------------- WGNTKSSGSSYPSLLQCLKAPVLSDSSCKSSYPGQITGNMICVGFLEGGKDSCQGDSGGP -----------------------333333332222-1111-------------------- VVCNGQLQGIVSWGYGCAQKNKPGVYTKVCNYVNWIQQT ---------------------------------3333-- >HYPOTHETICAL PROTEIN; SWP:Q9ZQD5; PDB:2A33A; QKSKFRRICVFCGSSQGKKSSYQDAAVDLGNELVSRNIDLVYGGGSIGLGLVSQAVHDGG ------------------3333-----------1111----------------------- RHVIGIIPKTLVGEVRAVADHQRKAEAKHSDAFIALPGGYGTLEELLEVITWAQLGIHDK -------3333---------------1111----------------------1111---- PVGLLNVDGYYNSLLSFIDKAVEEGFISPTAREIIVSAPTAKELVKKLEE ------%%%%-----------------33333333--------------- >HYPOTHETICAL PROTEIN PA40; SWP:Q9HX10; PDB:2A35A; TPKRVLLAGATGLTGEHLLDRILSEPTLAKVIAPARKALAEHPRLDNPVGPLAELLPQLD --------1111------------1111-------------1111-----33333333-- GSIDTAFCCLGTTIKEAGSEEAFRAVDFDLPLAVGKRALEMGARHYLVVSALGADAKSSI ------------3333----------------------1111--------22221111-- FYNRVKGELEQALQEQGWPQLTIARPSLLFGPREEFRLAEILAAPIAGKYHGIEACDLAR ------------1111--------------1111--3333-------------------- ALWRLALEEGKGVRFVESDELRKLGKGS ----1111-------------------- >PROTEIN E(SEV)2B; SWP:Q08012; PDB:2A36A; MEAIAKHDFSATADDELSFRKTQILKILNMEDDSNWYRAELDGKEGLIPSNYIEMKNHD ------------1111---2222-------------------------3333------- >TITIN ISOFORM N2-B; SWP:Q8WZ42; PDB:2A38A; MTTQAPTFTQPLQSVVVLEGSTATFEAHISGFPVPEVSWFRDGQVISTSTLPGVQISFSD -----------------2222-------------------%%%%------2222----ii GRAKLTIPAVTKANSGRYSLKATNGSGQATSTAELLVKAETAPPNFVQRLQSMTVRQGSQ ii--------1111---------1111----------------------------2222- VRLQVRVTGIPTPVVKFYRDGAEIQSSLDFQISQEGDLYSLLIAEAYPEDSGTYSVNATN ------------------iiii----3333----!!!!--------3333---------1 SVGRATSTAELLVQ 111----------- >DE NOVO THREE-HELIX BUNDL; SWP:NA; PDB:2A3DA; MGSWAEFKQRLAAIKTRLQALGGSEAELAAFEKEIAAFESELQAYKGKGNPEVEALRKEA 3333----------------3333---3333--3333-3333------------------ AAIRDELQAYRHN -------3333-- >U1 SMALL NUCLEAR RIBONUCL; SWP:P09012; PDB:2A3JA; TPPHTEPSQVVLITNINPEVPKEKLQALLYALASSQGDILDIVVDLSDDNSGKAYIVFAT ----------------33333333------------------------------------ QESAQAFVEAFQGYPFQGNPLVITFSETPQSQVAED --------------------------------3333 >AMP DEAMINASE; SWP:O80452; PDB:2A3LA; QPDPIAADILRKEPEQETFVRLNVPLEVPTSDEVEAYKCLQECLELRKRYVFQETVAPWE ------------------------------------------------------------ KEEPFAHYPQGKSDHCFEMQDGVVHVFANKDAKEDLFPVADATAFFTDLHHVLKVIAAGN -------------------iiii------------------------------3333--- IRTLCHRRLVLLEQKFNLHLMLNADKEFLAQKSAPHRDFYNVRKVDTHVHHSACMNQKHL -----------------------------11113333----------------------- LRFIKSKLRKEPDEVVIFRDGTYLTLREVFESLDLTGYDLNVDLLDVHADKSTFHRFDKF ------------------------------------1111------------------33 NLKYNPCGQSRLREIFLKQDNLIQGRFLGEITKQVFSDLEASKYQMAEYRISIYGRKMSE 33--------3333--------------3333------------------------3333 WDQLASWIVNNDLYSENVVWLIQLPRLYNIYKDMGIVTSFQNILDNIFIPLFEATVDPDS --------1111--3333--------3333--------3333---------3333-3333 HPQLHVFLKQVVGFDLVDDESKPERRPTKHMPTPAQWTNAFNPAFSYYVYYCYANLYVLN ------------------1111----------1111------------------------ KLRESKGMTTITLRPHSGEAGDIDHLAATFLTCHSIAHGINLRKSPVLQYLYYLAQIGLA ---1111--------------------------------3333----------------- MSPLSNNSLFLDYHRNPFPVFFLRGLNVSLSTDDPLQIHLTKEPLVEEYSIAASVWKLSA -3333-------1111---------------------------3333-----------33 CDLCEIARNSVYQSGFSHALKSHWIGKDYYKRGPDGNDIHKTNVPHIRVEFRDTIWKEEM 33-------3333---33333333-1111---1111-3333---3333------------ QQVYLGKAVISDEVVP ---------------- >COG3005: Nitrate/TMAO red; SWP:Q30WH0; PDB:2A3MA; AEAPADGLKMENTKMPVIFNHSSHSSYQCADCHHPVDGKENLAKCATAGCHDVFDKKDKS -------------------333311111111----iiii----1111-------3333-1 VHSYYKIIHDRKATTVATCMSCHLEAAGSDKDLKKELTGCKKSKCHP 111-----------------------!!!!------------3333- >putative glucosamine-fruc; SWP:Q8ZJX7; PDB:2A3NA; LGFNQDEYLTSAREIIAARQKAEQVADEIYQAGFSSLFFASVGGSLAPAINEFAKELTTL ------------------------------------------3333-------------- PVYVEQAAELIHKGNKRLNKDSVVITLSKSGDTKESVAIAEWCKAQGIRVVAITKNADSP --------------11111111-----3333------------1111--------1111- LAQAATWHIPRHKNGVEYEYLLYWLFFRVLSRNNEFASYDRFASQLEILPANLLKAKQKF -1111---------3333-----------------1111----3333------------- DPQADAIASRYHNSDYWVGGAEWGEVYLFSCILEEQWKRTRPVSSAEFFHGALELLEKDV ----------1111-----------------------------3333---3333--1111 PLILVKGEGKCRALDERVERFASKITDNLVVIDPKAYALDGIDDEFRWIAPCVVSTLLVD --------1111--------------------3333--22223333-------------- RLAAHFEKYTGHSLDIRRYYRQFDY ------------1111--2222--- >MYOCYTE NUCLEAR FACTOR; SWP:P42128; PDB:2A3SA; ESKPPYSYAQLIVQAISSAQDRQLTLSGIYAHITKHYPYYRTADKGWQNSIRHNLSLNRY ---------------1111-----1111---3333------------------------- FIKVPRSQEEPGKGSFWRIDPASEAKLVEQAFRKRRQRGVS --------------------1111-----1111-------- >SITE-SPECIFIC RECOMBINASE; SWP:O68847; PDB:2A3VA; GSQFLLSVREFMQTRYYAKKTIEAYLHWITRYIHFHNKKHPSLMGDKEVEEFLTYLAVQG ---------------------------------------3333----------------- KVATKTQSLALNSLSFLYKEILKTPLSLEIRFQRSQLERKLPVVLTRDEIRRLLEIVDPK ---------------------------------------------------------333 HQLPIKLLYGSGLRLMECMRLRVQDIDFDYGAIRIWQGKGGKNRTVTLAKELYPHLKEQI 3-3333---------------3333-----------------------3333-------- ALAKRYYDRDLHQKNYGGVWLPTALKEKYPNAPYEFRWHYLFPSFQLSLDPESDVMRRHH ----------------------------1111--3333---------------------- MNETVLQKAVRRSAQEAGIEKTVTCHTLRHSFATHLLEVGADIRTVQEQLGHTDVKTTQI -----------------------3333---------1111-3333--------3333--- YTHSGVLSPLSRL -------3333-- -------------------------------- >Fibrinogen alpha chain [P; SWP:P02671; PDB:2A45G; SACKDSDWPFCSDEDWNYKCPSGCRMKGLIDEVNQDFTNRINKLKNSL -----------3333--------------------------------- >Fibrinogen beta chain [Pr; SWP:P02675; PDB:2A45H; KVERKAPDAGGCLHADPDLGVLCPTGCQLQEALLQQERPIRNSVDELNNNVE ---------------3333--------------------------------- >Fibrinogen gamma chain [P; SWP:P02679; PDB:2A45I; DNCCILDERFGSYCPTTCGIADFLSTYQTKVDKDLQSLED ------3333------------------------------ >GFP-LIKE FLUORESCENT CHRO; SWP:Q9U6Y6; PDB:2A46A; NKFIGDDMKMTYHMDGCVNGHYFTVKGEGNGKPYEGTQTSTFKVTMANGGPLAFSFDILS 3333-------------iiii----------3333----------1111---------11 TVFNRCFTAYPTSMPDYFKQAFPDGMSYERTFTYEDGGVATASWEISLKGNCFEHKSTFH 11-3333---3333-3333--------------1111-----------!!!!-------- GVNFPADGPVMAKKTTGWDPSFEKMTVCDGILKGDVTAFLMLQGGGNYRCQFHTSYKTKK ----1111-------------------iiii----------1111--------------- PVTMPPNHVVEHRIARTDLDKGGNSVQLTEHAVAHIT -------------------3333-------------- >DEOXYRIBOSE-PHOSPHATE ALD; SWP:Q7RMC9; PDB:2A4AA; NYTEKFAAWSVICLTDHTFLDENGTEDDIRELCNESVKTCPFAAAVCVYPKFVKFINEKI --------------------1111------------------------3333-------- KQEINPFKPKIACVINFPYGTDSMEKVLNDTEKALDDGADEIDLVINYKKIIENTDEGLK ----------------------------------1111---------------------- EATKLTQSVKKLLTNKILKVIIEVGELKTEDLIIKTTLAVLNGNADFIKTSTGKVQINAT ----------------------3333--------------1111---------------- PSSVEYIIKAIKEYIKNNPEKNNKIGLKVSGGISDLNTASHYILLARRFLSDNFRIGSSS -----------------3333--------------------------------------- LVIKLRKVIS ---------- >CADHERIN-11; SWP:P55288; PDB:2A4CA; SGWVWNQFFVIEEYTGPDPVLVGRLHSDIDSGDGNIKYILSGEGAGTIFVIDDKSGNIHA ----------1111------------1111-----------2222--------------- TKTLDREERAQYTLMAQAVDRDTNRPLEPPSEFIVKVQD ----3333------------------------------- >UBIQUITIN-CONJUGATING ENZ; SWP:Q13404; PDB:2A4DA; GVKVPRNFRLLEELEEGQKGVGDGTVSWGLEDDEDMTLTRWTGMIIGPPRTIYENRIYSL ---------------1111-!!!!-------1111---------------1111------ KIECGPKYPEAPPFVRFVTKINMNGVNSSNGVVDPRAISVLAKWQNSYSIKVVLQELRRL ----1111--------------2222-------33333333---1111----------33 MMSKENMKLPQPPEGQCYS 33---1111---------- >SELENOPROTEIN SEP15; SWP:Q9VVJ7; PDB:2A4HA; MASHHHHHHLDQQPAAQRTYAKAILEVCTCKFRAYPQIQAFIQSGRPAKFPNLQIKYVRG ------------------------------33333333-----------1111------- LDPVVKLLDASGKVQETLSITKWNTDTVEEFFETHLAKDGAGKNSYSVVEDADGDDDEDY ------------------------------------------------------------ LRTNRI ------ >3-OXOACYL-[ACYL CARRIER P; SWP:Q53WH2; PDB:2A4KA; GRLSGKTILVTGAASGIGRAALDLFAREGASLVAVDREERLLAEAVAALEAEAIAVVADV 1111-------1111----------1111----------------3333----------- SDPKAVEAVFAEALEEFGRLHGVAHFAGVAHSALPLEAWEKVLRVNLTGSFLVARKAGEV --------------------------11111111-------------------------- LEEGGSLVLTGSVAGLGAFGLAHYAAGKLGVVGLARTLALELARKGVRVNVLLPGLIQTP -2222-------2222--------------------------1111------------33 MTAGLPPWAWEQEVGASPLGRAGRPEEVAQAALFLLSEESAYITGQALYVDGGRSIV 33---3333-------3333---3333---------1111----------iiii--- >NS3 PROTEASE/HELICASE; SWP:Q91RS4; PDB:2A4RA; APITAYAQQTRGLLGCIITSLTGRDKNQVEGEVQIVSTATQTFLATCINGVCWTVYHGAG -----------------------------------------------iiii---1111-- TRTIASPKGPVIQMYTNVDQDLVGWPAPQGSRSLTPCTCGSSDLYLVTRHADVIPVRRRG -----1111--------1111--------------------------1111--------- DSRGSLLSPRPISYLKGSSGGPLLCPAGHAVGLFRAAVCTRGVAKAVDFIPVENLETTMR ----------33332222------1111-----------iiii--------------111 S 1 >PEROXIREDOXIN DOT5; SWP:P40553; PDB:2A4VA; DVNELEIGDPIPDLSLLNEDNDSISLKKITENNRVVVFFVYPRASTPGSTRQASGFRDNY -----2222--------1111--------------------------------------- QELKEYAAVFGLSADSVTSQKKFQSKQNLPYHLLSDPKREFIGLLGAKKTPLSGSIRSHF --1111-----------------------------1111----------3333------- IFVDGKLKFKRVKISPEVSVNDAKKEVLEVAEKFKE --iiii------------------------------ >MITOMYCIN-BINDING PROTEIN; SWP:O05205; PDB:2A4XA; SARISLFAVVVEDMAKSLEFYRKLGVEIPAEADSAPHTEAVLDGGIRLAWDTVETVRSYD ---------------------1111---3333---------------------------1 PEWQAPTGGHRFAIAFEFPDTASVDKKYAELVDAGYEGHLKPWNAVWGQRYAIVKDPDGN 111----------------------------1111---------1111-------1111- VVDLFAPLPLE -------3333 >GFP-like non-fluorescent ; SWP:Q9GZ28; PDB:2A50A; GSASFLKKTMPFKTTIEGTVNGHYFKCTGKGEGNPFEGTQEMKIEVIEGGPLPFAFHILS -3333--------------iiii-----------1111----------------333311 TSC 11- >GFP-like non-fluorescent ; SWP:Q9GZ28; PDB:2A50B; SKTFIKYVSGIPDYFKQSFPEGFTWERTTTYEDGGFLTAHQDTSLDGDCLVYKVKILGNN 3333---iiii-3333--------------1111-----------!!!!----------- FPADGPVMQNKAGRWEPATEIVYEVDGVLRGQSLMALKCPGGRHLTCHLHTTYRSKKPAS -1111-1111--------------%%%%----------2222---------------333 ALKMPGFHFEDHRIEIMEEVEKGKCYKQYEAAVGRYCDAAPSKLGHN 3---------------------------------------------- >NUCLEOCAPSID PROTEIN; SWP:Q9Q095; PDB:2A51A; LTCFNCGKPGHTARMCRQPRQEGCWNCGSKEHRFAQCPK -----------3333-----------------3333--- ------------------------------------------------------------ ------------------------------------------------------------ ------------- >ADP-RIBOSYLATION FACTOR 6; SWP:P62330; PDB:2A5DA; KEMRILMLGLDAAGKTTILYKLKLGQSVTTIPTVGFNVETVTYKNVKFNVWDVGGQDKIR ---------2222-----------------------------%%%%---------33331 PLWRHYYTGTQGLIFVVDCADRDRIDEARQELHRIINDREMRDAIILIFANKQDLPDAMK 111---2222-------11111111------------1111----------3333----- PHEIQEKLGLTRIRDRNWYVQPSCATSGDGLYEGLTWLTSNYK --------1111------------1111----------1111- >L-LYSINE 2,3-AMINOMUTASE; SWP:Q9XBQ8; PDB:2A5HA; NRRYELFKDVSDADWNDWRWQVRNRIETVEELKKYIPLTKEEEEGVAQCVKSLRAITPYY 3333--1111-----------1111------1111---------------------3333 LSLIDPNDPNDPVRKQAIPTALELNKAAADLEDPLHEDTDSPVPGLTHRYPDRVLLLITD 11111111----3333-----1111-1111--1111------2222-------------- CSYCRHCTRRRFAGQSDDSPERIDKAIDYIRNTPQVRDVLLSGGDALLVSDETLEYIIAK ---1111-3333-1111---------------1111--------1111------------ LREIPHVEIVRIGSRTPVVLPQRITPELVNLKKYHPVWLNTHFNHPNEITEESTRACQLL ---1111-------3333-3333-3333--3333----------3333-----------3 ADAGVPLGNQSVLLRGVNDCVHVKELVNKLVKIRVRPYYIYQCDLSLGLEHFRTPVSKGI 333----------2222--3333-------1111-----------22221111------- EIIEGLRGHTSGYCVPTFVVDAPGGGGKTPVPNYVISQSHDKVILRNFEGVITTYSEPIN ----------1111-------2222---------------------1111---------- YTPGCNCDVCTGKKKVHKVGVAGLLNGEGALEPVGLERNK ------3333-------------1111-----22221111 >RAS-RELATED PROTEIN RAB-2; SWP:Q8WUD1; PDB:2A5JA; HSSGLVPRGSYLFKYIIIGDTGVGKSCLLLQFTDKRFQPIGVEFGARMVNIDGKQIKLQI ------2222---------2222---------------------------iiii------ WDTAGQESFRSITRSYYRGAAGALLVYDITRRETFNHLTSWLEDARQHSSSNMVIMLIGN --iiii------33332222-------1111------------------1111------- KSDLESRRDVKREEGEAFAREHGLIFMETSAKTACNVEEAFINTAKEIYRKIQQGLF 33331111--3333-------------------2222--------------1111-- >TRP REPRESSOR BINDING PRO; SWP:NA; PDB:2A5LA; SPYILVLYYSRHGATAEARQIARGVEQGGFEARVRTVPAVSTALYATLEDLKNCAGLALG ------------3333---------1111------------------------------- SPTRFGNASPLKYFLDGTSSLWLTGSLVGKPAAVFTSTASLHGGQETTQLSLLPLLHHGL ---iiii-----------------1111-------------------------------- VLGIPYTPYGASHFAGADGKRSLDEHELTLCRALGKRLAETAGKLGS ------1111-----1111---------------------------- >N-METHYL-D-ASPARTATE RECE; SWP:Q00959; PDB:2A5SA; NHLSIVTLEEAPFVIVEDIDPLETCVRNTVPCRKFVKINNSTNEGMNVKKCCKGFCIDIL -------------------------!!!!------------------------------- KKLSRTVKFTYDLYLVTNGKHGKKVNNVWNGMIGEVVYQRAVMAVGSLTINEERSEVVDF ------------------------%%%%-------1111-----------3333------ SVPFVETGISVMVSRGTQVTGLSDKKFQRPHDYSPPFRFGTVPNGSTERNIRNNYPYMHQ --------------------111133331111------------------3333------ YMTRFNQRGVEDALVSLKTGKLDAFIYDAAVLNYKAGRDEGCKLVTIGSGYIFATTGYGI -3333-----------1111--------------33332222--------3333------ ALQKGSPWKRQIDLALLQFVGDGEMEELETLWLTGICH --2222-------------------------------- >Cell death protein 4; SWP:P30429; PDB:2A5YB; MLCEIECRALSTAHTRLIHDFEPRDALTYLEGKNIFTEDHSELISKMSTRLERIANFLRI ---------------------3333----------------------------------- YRRQASELGPLIDFFNYNNQSHLADFLEDYIDFAINEPDLLRPVVIAPQFSRQMLDRKLL ---------------1111--------------------------3333----------1 LGNVPKQMTCYIREYHVDRVIKKLDEMCDLDSFFLFLHGRAGSGKSVIASQALSKSDQLI 111-----------------------1111---------2222----------------- GINYDSIVWLKDSGTAPKSTFDLFTDILLMLKSEDDLLNFPSVEHVTSVVLKRMICNALI ---------------1111----------1111---------1111----------3333 DRPNTLFVFDDVVQEETIRWAQELRLRCLVTTRDVEISNAASQTCEFIEVTSLEIDECYD -------------------------------------3333------------------- FLEAYGMPMPEKEEDVLNKTIELSSGNPATLMMFFKSCEPKTFEKMAQLNNKLESRGLVG --1111-------------3333%%%%------3333----------------------- VECITPYSYKSLAMALQRCVEVLSDEDRSALAFAVVMPPGVDIPVKLWSCVIPVEQLDDE ----------------3333------------------------3333------------ VADRLKRLSKRGALLSGKRMPVLTFKIDHIIHMFLKHVVDAQTIANGISILEQRLLEIET ----------------------------------------------1111---------- VIRPEDFPKFMQLHQKFYDSL ---1111---3333------- >HYPOTHETICAL PROTEIN SO29; SWP:Q8ED25; PDB:2A5ZA; TTPGLSPSEKLKLSTLTTSIATSDFYASYDFHSIGLTSANNISLLSTGNISLQNILSEGN -----------3333-----------------2222--iiii----%%%%-------!!! HFGVQPIVSSTTANASFLAGLAIFPKESELEVTVYFKTPSAFNPAQLTVIGSTSIGLGIS !----------2222---------3333--------------1111--------!!!!-- DRSGLIIENGNAFGGIVKASAATETGSTYALSTSTWYICKFKLTDDRFKVTLYSDSGTQL -------!!!!------iiii---------------------1111-------1111--- YSYTSTAAFRADNATAHIGFKTQCKTATAGISLISIDLIEFKAKVSATRAKV ---------------------------------------------------- >TRANSCRIPTIONAL REGULATOR; SWP:Q9WZG9; PDB:2A61A; KQPFERILREICFMVKVEGRKVLRDFGITPAQFDILQKIYFEGPKRPGELSVLLGVAKST ---------------------3333----------------------------------- VTGLVKRLEADGYLTRTPDPADRRAYFLVITRKGEEVIEKVIERRENFIEKITSDLGKEK ------------------1111-------------------------------------- SSKILDYLKELKGVMERNFSKQ ---------------1111--- >REGULATORY PROTEIN CRO; SWP:P03040; PDB:2A63A; MEQRITLKDYAMRFGQTKTAKDLGVQQSAINKWIHAGRKIFLTINADGSVYAEEVKPDPS --------------------3333---------1111----------------------- NKKTTA ------ >ORPHAN NUCLEAR RECEPTOR N; SWP:O00482; PDB:2A66A; ELCPVCGDKVSGYHYGLLTCESCKGFFKRTVQNNKRYTCIENQNCQIDKTQRKRCPYCRF -------------iiii------------------------------1111--------- QKCLSVGMKLEAVRADRMRGGRNKFGPMYKRDRAL --------3333-1111-----1111--------- >ISOCHORISMATASE FAMILY PR; SWP:Q82ZG7; PDB:2A67A; AKNRALLLIDFQKGIESPTQQLYRLPAVLDKVNQRIAVYRQHHAPIIFVQHEETELPFGS -----------1111------------------------1111---------11112222 DSWQLFEKLDTQPTDFFIRKTHANAFYQTNLNDLLTEQAVQTLEIAGVQTEFCVDTTIRA 1111-3333--3333--------1111---------------------1111-------- HGLGYTCLTPKTTSTLDNGHLTAAQIIQHHEAIWAGRFLTFLS 1111-------------------------------1111---- >HYPOTHETICAL PROTEIN TM08; SWP:Q9WZX7; PDB:2A6AA; NVLALDTSQRIRIGLRKGEDLFEISYTGEKKHAEILPVVVKKLLDELDLKVKDLDVVGVG ----------------!!!!--------3333-----------------3333------- IGPGGLTGLRVGIATVVGLVSPYDIPVAPLNSFETAKSCPADGVVLVARRARKGYHYCAV -------------------3333-----------3333---------------------- YLKDKGLNPLKEPSVVSDEELEEITKEFSPKIVLKDDLLISPAVLVEESERLFREKKTIH ----------------------------------------3333---------------1 YYEIEPLYLQK 111-------- >HELIX-TURN-HELIX MOTIF; SWP:NA; PDB:2A6CA; HKRSQLLIVLQEHLRNSGLTQFKAAELLGVTQPRVSDLRGKILFSLESLIDITSIGLKVE -------------1111------------------------111--------1111---- INIKD ----- >DNA-DIRECTED RNA POLYMERA; SWP:Q9Z9H6; PDB:2A6HA; MLDSKLKAPVFTVRTQGREYGEFVLEPLERGFGVTLGNPLRRILLSSIPGTAVTSVYIED ---3333--------------------------------------------------111 VLHEFSTIPGVKEDVVEIILNLKELVVRFLNPSLQTVTLLLKAEGPKEVKARDFLPVADV 1-1111-------3333----3333-----3333-------------------------- EIMNPDLHIATLEEGGRLNMEVRVDRGVGYVPAEKHGIKDRINAIPVDAVFSPVRRVAFQ ---1111------------------------3333-----1111---------------- VEDTRLGQRTDLDKLTLRIWTDGSVTPLEALNQAVEILREHLTYFSNPQ ------------------------------------------------- >DNA-directed RNA polymera; SWP:Q8RQE9; PDB:2A6HC; MEIKRFGRIREVIPLPPLTEIQVESYRRALQADVPPEKRENVGIQAAFRETFPIEEEDKG ----------------1111----------33333333---------------------- KGGLVLDFLEYRLGEPPFPQDECREKDLTYQAPLYARLQLIHKDTGLIKEDEVFLGHIPL -------------------3333------------------------------------- MTEDGSFIINGADRVIVSQIHRSPGVYFTPDPARPGRYIASIIPLPKRGPWIDLEVEPNG -3333------------------------------------------------------- VVSMKVNKRKFPLVLLLRVLGYDQETLARELGAYGELVQGLMDESVFAMRPEEALIRLFT -----------1111-------3333-----1111-3333---3333---3333----33 LLRPGDPPKRDKAVAYVYGLIADPRRYDLGEAGRYKAEEKLGIRLSGRTLARFEDGEFKD 33----------3333----------------33333333-------------------- EVFLPTLRYLFALTAGVPGHEVDDIDHLGNRRIRTVGELMTDQFRVGLARLARGVRERML -----------3333--------1111-------3333-----------------1111- MGSEDSLTPAKLVNSRPLEAAIREFFSRSQLSQFKDETNPLSSLRHKRRISALGPGGLTR --------3333----------------3333---------------------------- ERAGFDVRDVHRTHYGRICPVETPEGANIGLITSLAAYARVDELGFIRTPYRRVVGGVVT ----3333--3333---------------------------------------------- DEVVYMTATEEDRYTIAQANTPLEGNRIAAERVVARRKGEPVIVSPEEVEFMDVSPKQVF --------3333-----3333-----------------------1111------3333-- SVNTNLIPFLEHDDANRALMGSNMQTQAVPLIRAQAPVVMTGLEERVVRDSLAALYAEED 3333----3333------------1111-------------------------------- GEVAKVDGNRIVVRYEDGRLVEYPLRRFYRSNQGTALDQRPRVVVGQRVRKGDLLADGPA --------------1111------------1111-----------------------111 SENGFLALGQNVLVAIMPFDGYNFEDAIVISEELLKRDFYTSIHIERYEIEARDTKLGPE 1-----------------------------------------------------1111-- RITRDIPHLSEAALRDLDEEGVVRIGAEVKPGDILVGRTSFKGESEPTPEERLLRSIFGE ----------1111---------2222--2222--------------3333--------- KARDVKDTSLRVPPGEGGIVVRTVRLRRGDPGVELKPGVREVVRVYVAQKRKLQVGDKLA -----------------------------------2222--------------3333--- NRHGNKGVVAKILPVEDMPHLPDGTPVDVILNPLGVPSRMNLGQILETHLGLAGYFLGQR 3333---------3333--------------3333------------------------- YISPIFDGAKEPEIKELLAQAFEVYFGKRKGEGFGVDKREVEVLRRAEKLGLVTPGKTPE -----------------------------------------------------2222--- EQLKELFLQGKVVLYDGRTGEPIEGPIVVGQMFIMKLYHMVEDKMHARSTGPYSLITQQP ---------------------------------------3333----------------- LGGKAQFGGQRFGEMEVWALEAYGAAHTLQEMLTLKSDDIEGRNAAYEAIIKGEDVPEPS ------------3333---------3333--------------------1111------- VPESFRVLVKELQALALDVQTLDEKDNPVDIFEGLASKR -----------3333------------------------ >DNA-directed RNA polymera; SWP:Q8RQE8; PDB:2A6HD; KKEVRKVRIALASPEKIRSWSYGEVEKPETINYRTLKPERDGLFDERIFGPIKDYECACG ------------3333-------------------------1111--------------- KYKRQRFEGKVCERCGVEVTKSIVRRYRMGHIELATPAAHIWFVKDVPSKIGTLLDLSAT ----1111---3333------3333--------------3333----------------- ELEQVLYFSKYIVLDPKGAILNGVPVEKRQLLTDEEYRELRYGKQETYPLPPGVDALVKD -------------------------------------------------------3333- GEEVVKGQELAPGVVSRLDGVALYRFPRRVRVEYVKKERAGLRLPLAAWVEKEAYKPGEI -1111-----------------------------------------3333---------3 LAELPEPYLFGDKIVAAIDPEEEVIAEAEGVVHLHEPASILVVKARVYPFEDDVEVSTGD 333--------------------------------------------------------- RVAPGDVLADGGKVKSDVYGRVEVDLVRNVVRVVESYDIDARMGAEAIQQLLKELDLEAL ------------------------------------------------------------ EKELLEEMKHPSRARRAKARKRLEVVRAFLDSGNRPEWMILEAVPVLPPDLRPMVQVDGG ------------------------------------3333--------1111-------- RFATSDLNDLYRRLINRNNRLKKLLAQGAPEIIIRNEKRMLQEAVDALLDNGRRGAPVTN ----3333----------------3333--3333-------------------------- PGSDRPLRSLTDILSGKQGRFRQNLLGKRVDYSGRSVIVVGPQLKLHQCGLPKRMALELF ----------------------------------------33333333---3333----- KPFLLKKMEEKGIAPNVKAARRMLERQRDIKDEVWDALEEVIHGKVVLLNRAPTLHRLGI ---------------------11113333--------1111--------------1111- QAFQPVLVEGQSIQLHPLVCEAFNADFDGDQMAVHVPLSSFAQAEARIQMLSAHNLLSPA ---------------3333-------------------3333---------1111----- SGEPLAKPSRDIILGLYYITQVRKEKKGAGLEFATPEEALAAHERGEVALNAPIKVAGRE ----------------3333-----------33331111--------------------- TSVGRLKYVFANPDEALLAVAHGIVDLQDVVTVRYMGKRLETSPGRILFARIVAEAVEDE -----------3333----------1111-----------------------------33 KVAWELIQLDVPQEKNSLKDLVYQAFLRLGMEKTARLLDALKYYGFTFSTTSGITIGIDD 33-1111-----------------------3333-----------------------333 AVIPEEKKQYLEEADRKLLQIEQAYEMGFLTDRERYDQILQLWTETTEKVTQAVFKNFEE 3--1111-----------------1111--3333-------------------------- NYPFNPLYVMAQSGARGNPQQIRQLCGLRGLMQKPSGETFEVPVRSSFREGLTVLEYFIS ----33333333-----3333-------------------------3333----3333-- SHGARKGGADTALRTADSGYLTRKLVDVTHEIVVREADCGTTNYISVPLFQPDEVTRSLR ----------------------------3333---------------------------- LRKRADIEAGLYGRVLAREVEVLGVRLEEGRYLSMDDVHLLIKAAEAGEIQEVPVRSPLT ---------------------------------3333-------------------3333 CQTRYGVCQKCYGYDLSMARPVSIGEAVGIVAAQSIGEPGTQLTMRDITQGLPRVIELFE -------3333-----------2222-------------1111-------3333------ ARRPKAKAVISEIDGVVRIEETEEKLSVFVESEGFSKEYKLPKEARLLVKDGDYVEAGQP -----------------------------------------3333----------2222- LTRGAIDPHQLLEAKGPEAVERYLVEEIQKVYRAQGVKLHDKHIEIVVRQMMKYVEVTDP ----------------3333------------1111------------1111-------- GDSRLLEGQVLEKWDVEALNERLIAEGKTPVAWKPLLMGVTKSALSTKSWLSAASFQNTT ---------3333--------------------------------------------333 HVLTEAAIAGKKDELIGLKENVILGRLIPAGTGSDFVRFTQVVDQKTLKAIEEARKEAVE 3-----1111-------3333--------------------------------------- A - >DNA-directed RNA polymera; SWP:Q8RQE7; PDB:2A6HE; AEPGIDKLFGMVDSKYRLTVVVAKRAQQLLRHGFKNTVLEPEERPKMQTLEGLFDDPNAE -------------3333---------------1111---------------3333--333 TWAMKELLTGRLVFGENLVPEDRLQKEMERIYPGE 3----3333-------------------------- >RNA polymerase pricipal s; SWP:Q5SKW1; PDB:2A6HF; KISTSDPVRQYLHEIGQVPLLTLEEEVELARKVEEGMEAIKKLSEITGLDPDLIREVVRA -------------------------------------------------3333------- KILGSARVRHIPGLKETLDPKTVEEIDQKLKSLPKEHKRYLHIAREGEAARQHLIEANLR -----------------------------------3333--------------------- LVVSIAKKYTGRGLSFLDLIQEGNQGLIRAVEKFEYKRRFKFSTYATWWIRQAINRAIAD ----3333---------------------3333-3333---------------------- QARTIRIPVHMVETINKLSRTARQLQQELGREPTYEEIAEAMGPGWDAKRVEETLKIAQE ------------------------------------3333------3333---------- PVSLETPIGDEKDSFYGDFIPDEHLPSPVDAATQSLLSEELEKALSKLSEREAMVLKLRK --------------1111---------3333----------------------------- GLIDGEEVGAFFGVTRERIRQIENKALRKLKYHESRTRKLRDFLD ------1111----3333--------------------------- >GERMLINE ANTIBODY 36-65 F; SWP:NA; PDB:2A6IB; EVQLQQSGAELVRAGSSVKMSCKASGYTFTSYGINWVKQRPGQGLEWIGYINPGNGYTKY ------------2222-----------3333----------------------------- NEKFKGKTTLTVDKSSSTAYMQLRSLTSEDSAVYFCARSVYYGGSYYFDYWGQGTTLTVS 3333----------------------3333------------------------------ SAKTTPPSVYPLAPGSANSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLY --------------------------------------%%%%------------------ TLSSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVPRD --------3333------------1111----------- >ISHP608 TRANSPOSASE; SWP:Q933Z0; PDB:2A6MA; AVLYKSNHNVVYSCKYHIVWCPKYRRKVLVGAVEMRLKEIIQEVAKELRVEIIEMQTDKD ---------------------2222----------------------------------- HIHILADIDPSFGVMKFIKTAKGRSSRILRQEFNHLKTKLPTLWTNSCFISTVGGAPLNV --------3333-3333---------------3333------------------------ VKQYIENQQN -----1111- >POSSIBLE PHOSPHOGLYCERATE; SWP:Q6MWZ7; PDB:2A6PA; RNHRLLLLRHGETAWSTLGRHTGGTEVELTDTGRTQAELAGQLLGELELDDPIVICSPRR ------------3333---------------------------1111-----------33 RTLDTAKLAGLTVNEVTGLLAEWDYGSYEGLTTPQIRESEPDWLVWTHGCPAGESVAQVN 33--------------3333----!!!!---3333----11113333--2222------- DRADSAVALALEHMSSRDVLFVSHGHFSRAVITRWVQLPLAEGSRFAMPTASIGICGFEH ------------3333-----------------1111----3333---2222------ii GVRQLAVLGLTGH ii----------- >ANTITOXIN YEFM; SWP:P69346; PDB:2A6QA; GPHMRTISYSEARQNLSATMMKAVEDHAPILITRQNGEACVLMSLEEYNSLEETAYLLRS -----------------------1111--------------------------------- PANARRLMDSIDSLKSGKGTEKDIIE -------------------------- >TOXIN YOEB; SWP:P69348; PDB:2A6SA; MKLIWSEESWDDYLYWQETDKRIVKKINELIKDTRRTPFEGKGKPEPLKHNLSGFWSRRI ---------------1111-----------------11112222----!!!!-------- TEEHRLVYAVTDDSLLIAACRYH ----------------------- >SPAC19A8.12; SWP:O13828; PDB:2A6TA; MSFTNATFSQVLDDLSARFILNLPAEEQSSVERLCFQIEQAHWFYEDFIRAQNDQLPSLG -------------------33333333---3333-------------------------- LRVFSAKLFAHCPLLWKWHEEAFDDFLRYKTRIPVRGAIMLDMSMQQCVLVKGWKASSGW ------------3333--------------------------------------1111-- GFPKGKIDKDESDVDCAIREVYEETGFDCSSRINPNEFIDMTIRGQNVRLYIIPGISLDT ----------------------------1111-1111-----iiii----------1111 RFEISKIEWHNLMDLPTFKMKNKFYMVIPFLAPLKKWIKKRNIANNTTKE ----------3333------3333--3333-------------1111--- >EMP46P; SWP:Q12396; PDB:2A6VA; KWNKGYSLPNLLEVTDQQKELSQWTLGDKVKLEEGRFVLTPGKNTKGSLWLKPEYSIKDA --3333---3333--3333-1111--------iiii------------------------ MTIEWTFRSFGFRGSTKGGLAFWLKQGNEGDSTELFGGSSKKFNGLMILLRLDDKLGESV ----------------------------------%%%%---------------------- TAYLNDGTKDLDIESSPYFASCLFQYQDSMVPSTLRLTYNPLDNHLLKLQMDNRVCFQTR -----------3333------------------------3333-------%%%%------ KVKFMGSSPFRIGTSAINDASKESFEILKMKLYDGVIE --3333------------1111---------------- >EMP47P (FORM2); SWP:P43555; PDB:2A6ZA; GSDASKLSSDYSLPDLINTRKVPNNWQTGEQASLEEGRIVLTSNQNSKGSLWLKQGFDLK --3333-3333-----------1111--!!!!--2222---------------------- DSFTMEWTFRSVGYSGQTDGGISFWFVQDSNIPRDKQLYNGPVNYDGLQLLVDNNGPLGP -------------------------------------%%%%--------------1111- TLRGQLNDGQKPVDKTKIYDQSFASCLMGYQDSSVPSTIRVTYDLEDDNLLKVQVDNKVC -------------11113333----------------------1111-------iiii-- FQTRKVRFPSGSYRIGVTAQNGAVNNNAESFEIFKMQFFNGV ----------------------2222---------------- >Complement C3 [Precursor]; SWP:P01024; PDB:2A74B; DEDIIAEENIVSRSEFPESWLWNVEDLKEPPKNGISTKLMNIFLKDSITTWEILAVSMSD -----3333----------------------iiii------------------------- KKGICVADPFEVTVMQDFFIDLRLPYSVVRNEQVEIRAVLYNYRQNQELKVRVELLHNPA ----------------------------2222-------------------------111 FCSLATTKRRHQQTVTIPPKSSLSVPYVIVPLKTGLQEVEVKAAVYHHFISDGVRKSLKV 1----1111--------------------------------------------------- VPE --- >Complement C3 [Precursor]; SWP:P01024; PDB:2A74C; TCNKFDLKVTIKPAPKNTMILEICTRYRGDQDATMSILDISMMTGFAPDTDDLKQLANGV -1111-------------------------------------2222---------1111- DRYISKYELDKAFSDRNTLIIYLDKVSHSEDDCLAFKVHQYFNVELIQPGAVKVYAYYNL ------3333---------------------------------------------1111- EESCTRFYHPEKEDGKLNKLCRDELCRCAEENCFIQKDKVTLEERLDKACEPGVDYVYKT --------1111---------!!!!-------------------------1111------ RLVKVQLSNDFDEYIMAIEQTIKSGSDEVQVGQQRTFISPIKCREALKLEEKKHYLMWGL -----------------------------2222------1111-3333-2222------3 SSDFWGEKPNLSYIIGKDTWVEHWPEEDECQDEENQKQCQDLGAFTESMVVFGCPN 333------------1111------1111--------------------------- >IMMUNOGLOBULIN LIGHT CHAI; SWP:NA; PDB:2A77H; DVKLVESGGGLVKPGGSLRLSCAASGFTFRNYGMSWVRQTPEKRLEWVAAISGNSLYTSY ------------2222-----------3333--------1111--------1111----- PDSVKGRFTISRDNAKNNLYLQMSSLRSEDTALYFCARHDDYPYFFDVWGAGTTVTASSA 3333--------3333----------3333------------------------------ KTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDL ---------------------------------------%%%%----------------- YTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPR ---------1111------------1111---------- >If kappa light chain [Fra; SWP:A2NHM3; PDB:2A77L; DVLMTQSPLSLPVSLGDQASISCRCSQSIVKSNGHTYLEWYLQKPGKSPKLLIYKVSNRF -------------2222-------------1111---------1111------------2 SGVPDRFSGSGSGTDFTLRISRVEAEDLGVYYCFQGSHIPWTFGGGTKLESKRADAAPTV 2223333----------------1111--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM -----33331111---------------------iiii--2222---------------- SSTLTLTKDEYERHNSYTCEATHKTSPIVKSFNR ----------1111-------------------- >GAMMA-ADAPTIN APPENDAGE D; SWP:P22892; PDB:2A7BA; MIPSITAYSKNGLKIEFTFERSNTNPSVTVITIQASNSTELDMTDFVFQAAVPKTFQLQL ---------iiii-----------1111------------------------1111---- LSPSSSVVPAFNTGTITQVIKVLNPQKQQLRMRIKLTYNHKGSAMQDLAEVNNFPPQSWQ ---------%%%%----------1111------------iiii-----------3333-- >CARB; SWP:Q9XB60; PDB:2A7KA; MVFEENSDEVRVITLDHPNKHNPFSRTLETSVKDALARANADDSVRAVVVYGGAERSFSA ------!!!!--------1111-------------------1111-------2222---- GGDFNEVKQLRSEDIEEWIDRVIDLYQAVLNVNKPTIAAVDGYAIGMGFQFALMFDQRLM --333311113333--------------1111-------------------1111----- ASTANFVMPELKHGIGCSVGAAILGFTHGFSTMQEIIYQCQSLDAPRCVDYRLVNQVVES 1111----3333----------------------------------------------33 SALLDAAITQAHVMASYPASAFINTKRAVNKPFIHLLEQTRDASKAVHK 33-----------1111-------------------------------- >Hypothetical ubiquitin-co; SWP:NA; PDB:2A7LA; SMQKRLQKELLALQNDPPPGMTLNEKSVQNSITQWIVDMEGAPGTLYEGEKFQLLFKFSS 113333-----------2222--1111--------------2222-2222--------11 RYPFDSPQVMFTGENIPVHPHVYSNGHICLSILTEDWSPALSVQSVCLSIISMLS 11----------------11111111---333311113333----------1111 >N-ACYL HOMOSERINE LACTONE; SWP:Q7B8C3; PDB:2A7MA; MTVKKLYFIPAGRCMLDHSSVNSALTPGKLLNLPVWCYLLETEEGPILVDTGMPESAVNN -----------------33331111----------------1111--------3333--1 EGLFNGTFVEGQILPKMTEEDRIVNILKRVGYEPDDLLYIISSHLHFDHAGGNGAFTNTP 1112222-2222-----3333-----------1111---------1111--3333----- IIVQRTEYEAALHREEYMKECILPHLNYKIIEGDYEVVPGVQLLYTPGHSPGHQSLFIET ---3333------33333333----------------2222------------------- EQSGSVLLTIDASYTKENFEDEVPFAGFDPELALSSIKRLKEVVKKEKPIIFFGHDIEQE --------!!!!--33331111---------------------------------33331 KSCRVFPEYI 111------- >HUNTINGTIN INTERACTING PR; SWP:Q9BYW2; PDB:2A7OA; TSSELAKKSKEVFRKEMSQFIVQCLNPYRKPDCKVGRITTTEDFKHLARKLTHGVMNKEL 3333--------------------3333-1111--------------------------- KYCKNPEDLECNENVKHKTKEYIKKYMQKFGAVYKPKEDT -----1111-----------------3333----1111-- >NEUROTOXIN; SWP:P60277; PDB:2A7TA; GEDGYIADGDNCTYICTFNNYCHALCTDKKGDSGACDWWVPYGVVCWCEDLPTPVPIRGS --------------------------1111---------1111--------3333----- GKCR ---- >SERINE HYDROXYMETHYLTRANS; SWP:P34897; PDB:2A7VA; GWTGQESLSDSDPEMWELLQREKDRQCRGLELIASENFCSRAALEALGSCLNNKYSEGVD ------3333----------------------1111--------------1111------ EIELLCQRRALEAFDLDPAQWGVNVQPYSGSPANLAVYTALLQPHDRIMGYKLNPKTGLI -----------1111-3333---------------------------------------- DYNQLALTARLFRPRLIIAGTSAYARLIDYARMREVCDEVKAHLLADMAHISGLVAAKVI -----------------------------------------------3333--------- PSPFKHADIVTTTTHKTLRGARSGLIFYRKGVKAVDPKTGREIPYTFEDRINFAVFPSLQ -3333---------!!!!-----------------------------3333--------- GGPHNHAIAAVAVALKQACTPMFREYSLQVLKNARAMADALLERGYSLVSGGTDNHLVLV -----------------------------------------------2222--------- DLRPKGLDGARAERVLELVSITANKNTCPGDRSAITPGGLRLGAPALTSRQFREDDFRRV ---------------------------2222--------------3333---3333---- VDFIDEGVNIGLEVKSKTAKLQDFKSFLLKDSETSQRLANLRQRVEQFARAFPMPGFDEH -------------1111------------------------------3333--------- >PHOSPHORIBOSYL-ATP PYROPH; SWP:Q7P0E6; PDB:2A7WA; DVLKNIADTLEARREAAPQSSYVASLFHKGEDAILKKVAEEAAETLASKDKDKLHLVREV ----------------1111---------------------------------------- ADLWFHTVLLTYHGLRPEDVVELHRREG ----------1111-3333--------- >HYPOTHETICAL PROTEIN RV23; SWP:P64983; PDB:2A7YA; MHAKVGDYLVVKGTTTERHDQHAEIIEVRSADGSPPYVVRWLVNGHETTVYPGSDAVVVT ------------3333-------------------------------------------3 ATEHAEAEKRAAARAGHAAT 333----------------- >PANTOATE--BETA-ALANINE LI; SWP:P0A5R0; PDB:2A84A; IPAFHPGELNVYSAPGDVADVSRALRLTGRRVMLVPTMGALHEGHLALVRAAKRVPGSVV ----2222-----3333--------1111-------------------------2222-- VVSIFVNPMQFGAPDDDLAQLRAEGVEIAFTPTTAAMYPDGLRTTVQPGPLAAELEGGPR ---------2222--------1111-------3333-1111-------3333-!!!!--1 PTHFAGVLTVVLKLLQIVRPDRVFFGEKDYQQLVLIRQLVADFNLDVAVVGVPTVREADG 111----------------------3333---------------------------1111 LAMSSRNRYLDPAQRAAAVALSAALTAAAHAATAGAQAALDAARAVLDAAPGVAVDYLEL ---3333------------------------------------------1111------- RDIGLGPMPLNGSGRLLVAARLGTTRLLDNIAIEIGT -1111----------------!!!!------------ >THIOREDOXIN REDUCTASE; SWP:P52214; PDB:2A87A; AHHPVRDVIVIGSGPAGYTAALYAARAQLAPLVFEGTSFGGALMTTTDVENYPGFRNGIT ------------------------1111------------1111----------1111-- GPELMDEMREQALRFGADLRMEDVESVSLHGPLKSVVTADGQTHRARAVILAMGAAARYL ------------1111---------------------1111------------------- QVPGEQELLGRGVSSCATCDGFFFRDQDIAVIGGGDSAMEEATFLTRFARSVTLVHRRDE -33331111------333333332222--------------------------------- FRASKIMLDRARNNDKIRFLTNHTVVAVDGDTTVTGLRVRDTNTGAETTLPVTGVFVAIG -------------1111------------------------------------------- HEPRSGLVREAIDVDPDGYVLVQGRTTSTSLPGVFAAGDLVDRTYRQAVTAAGSGCAAAI ----3333------1111------------2222---3333------------------- DAERWLAEHAATG ------------- >CARBONIC ANHYDRASE 2; SWP:P45148; PDB:2A8DA; MDKIKQLFANNYSWAQRMKEENSTYFKELADHQTPHYLWIGCSDSRVPAEKLTNLEPGEL ----------------------------3333---------3333--3333----2222- FVHRNVANQVIHTDFNCLSVVQYAVDVLKIEHIIICGHTNCGGIHAAMADKDLGLINNWL ----------1111---------------------------------------3333--- LHIRDIWFKHGHLLGKLSPEKRADMLTKINVAEQVYNLGRTSIVKSAWERGQKLSLHGWV -------------11111111--------------------------------------- YDVNDGFLVDQGVMATSRETLEISYRNAIARLSILDEENIL -1111-------------------------1111-1111-- >HYPOTHETICAL PROTEIN YKTB; SWP:Q45498; PDB:2A8EA; TQRFTEEDFNTFTIEGLDAREVLKETVRPKLTALGEHFAPTLSALTGDEFPHVAKHARRS ----33333333---3333--3333------------------------------1111- VNPPADSWVAFANSKRGYKKLPHFQIGLWESHVFVWFAIIYESPIKEEYGKLLEVNQETI -----------------1111------------------3333----------------- TKNIPDSFVWSADHTKPGVHKQSEDKEQLKTLFERLQTVKKAELLCGIQLQKEEVLNNNQ ----1111----1111----3333--------------1111------------------ EFLQRIDDAFKQLAFLYRLTQKVTQ ------------------------- >THREONINE ASPARTASE 1; SWP:Q9H6P5; PDB:2A8IA; GGFVLVHAGAGYHSESKAKEYKHVCKRACQKAIEKLQAGALATDAVTAALVELEDSPFTN -------------3333--------------------------------------1111- AGGSNLNLLGEIECDASIDGKSLNFGAVGALSGIKNPVSVANRLLCEGQKGKLSAGRIPP ------1111--------3333-------------3333---------3333-------- CFLVGEGAYRWAVDHGIPSCPPNITTRFSLAAFKRNKRKLELGTLDTVGAVVVDHEGNVA ------------1111----3333------------------1111-------1111--- AAVSSGGLALKHPGRVGQAALYGCGCWAENTGAHNPYSTAVSTSGCGEHLVRTILARECS -----------2222-33332222--------------------------1111------ HALQAEDAHQALLETQNKFISSPFLASEDGVLGGVIVLRSCRCTLLVEFLWSHTTESCVG ------------------1111--1111-------------------------------- YSAQDGKAKTHISRLPPGAVAGQSVAIEGGVCRLE -3333--------------2222------------ >THREONINE ASPARTASE 1; SWP:Q9H6P5; PDB:2A8JA; GGFVLVHAGAGYHSESKAKEYKHVCKRACQKAIEKLQAGALATDAVTAALVELEDSPFTN -----------------------------------1111----------------1111- AGMGSNLNLLGEIECDASIMDGKSLNFGAVGALSGIKNPVSVANRLLCEGQKGKLGRIPP -2222--1111--------------------------3333------------------- CFLVGEGAYRWAVDHGIPSCPTVGAVVVDHEGNVAAAVSSGGLALKHPGRVGQAALYGCG ------------1111------------1111----------22222222-33332222- CWAENTGAHNPYSTAVSTSGCGEHLVRTILARECSHALQAEDAHQALLETMQNKFISSPF ------1111---------------1111-------------------------1111-- LASEDGVLGGVIVLRSCLLVEFLWSHTTESMCVGYMSAQDGKAKTHISRLPPGAVAGQSV 1111----------------------------------------------22222222-- AIEGGVCRLE ---------- >COLICIN E5; SWP:P18000; PDB:2A8KA; LKIDQKIRGQMPERGWTEDDIKNTVSNGATGTSFDKRSPKKTPPDYLGRNDPATVYGSPG -----------1111----------------------3333----------------222 KYVVVNDRTGEVTQISDKTDPGWVDDSRIQWGNK 2---------------1111-----1111----- >CYTIDINE AND DEOXYCYTIDYL; SWP:Q8UHJ4; PDB:2A8NA; AERTHFMELALVEARSAGERDEVPIGAVLVLDGRVIARSGNRTRELNDVTAHAEIAVIRM -----------------1111---------iiii---------11111111--------- ACEALGQERLPGADLYVTLEPCTMCAAAISFARIRRLYYGAQDPKGGAVESGVRFFSQPT ---------2222----------------1111---------3333----!!!!1111-- CHHAPDVYSG ---------- >DIHYDROLIPOYL DEHYDROGENA; SWP:P66004; PDB:2A8XA; THYDVVVLGAGPGGYVAAIRAAQLGLSTAIVEPKYWGGVCLNVGCIPSKALLRNAELVHI ----------3333-------1111---------22221111------------------ FTKDAKAFGISGEVTFDYGIAYDRSRKVAEGRVAGVHFLKKNKITEIHGYGTFADANTLL -----1111-------3333-------------------1111----------------- VDLNDGGTESVTFDNAIIATGSSTRLVPGTSLSANVVTYEEQILSRELPKSIIIAGAGAI --1111--------------------2222--1111-33331111------------333 GEFGYVLKNYGVDVTIVEFLPRALPNEDADVSKEIEKQFKKLGVTILTATKVESIADGGS 3------1111------------1111------------3333----------------- QVTVTVTKDGVAQELKAEKVLQAIGFAPNVEGYGLDKAGVALTDRKAIGVDDYRTNVGHI -------%%%%----------------------3333-----1111----------1111 YAIGDVNGLLQLAHVAEAQGVVAAETIAGAETLTLGDHRLPRATFCQPNVASFGLTEQQA ---3333----------------------------------------------------- RNEGYDVVVAKFPFTANAKAHGVGDPSGFVKLVADAKHGELLGGHLVGHDVAELLPELTL 1111--------3333---------------------------------3333------- AQRWDLTASELARNVHTHPTSEALQECFHGLVGHINF ------33331111----------------------- >DELTEX PROTEIN; SWP:Q23985; PDB:2A90A; AHAVSVWEFESRGKWLPYSPAVSQHLERAHAKKLTRVLSDADPSLEQYYVNVRTTQESLT ----------iiii-----------------------33331111--------------- IGVRRFYAPSSPAGKGTKWEWSGGSADSNNDWRPYNHVQSIIEDAWARGEQTLDLSNTHI -------1111-1111-------------------------------------3333--- GLPYTINFSNLTQLRQPSGPRSIRRTQQAPYPLVK ----------------------------------- >RECEPTOR TYROSINE-PROTEIN; SWP:P04626; PDB:2A91A; STQVCTGTDMKLRLPASPETHLDMLRHLYQGCQVVQGNLELTYLPTNASLSFLQDIQEVQ ---------!!!!---3333--------2222------------1111--1111------ GYVLIAHNQVRQVPLQRLRIVRGTQLFEDNYALAVLDNGDPLPVTGASPGGLRELQLRSL --------------1111-------2222---------------------------1111 TEILKGGVLIQRNPQLCYQDTILWKDIFHKNNQLALTLIDTNRSRACHPCSPMCKGSRCW ------------1111-1111-3333--1111------------------3333------ GESSEDCQSLTRTVCAGGCARCKGPLPTDCCHEQCAAGCTGPKHSDCLACLHFNHSGICE --1111-------------------3333--1111-------------------iiii-- LHCPALVTYNTDTFESMPNPEGRYTFGASCVTACPYNYLSTDVGSCTLVCPLHNQEVTAE ------------------1111------------2222--1111--------------33 DGTQKCEKCSKPCARVCYGLGMEHLREVRAVTSANIQEFAGCKKIFGSLAFLPESFDGDP 33----------------22221111-----1111---2222---------3333----1 ASNTAPLQPEQLQVFETLEEITGYLYISAWPDSLPDLSVFQNLQVIRGRILHNGAYSLTL 111----3333-1111--------------3333--3333----------2222------ QGLGISWLGLRSLRELGSGLALIHHNTHLCFVHTVPWDQLFRNPHQALLHTANRPEDECV ---------1111------------1111--11113333---1111--------3333-- GEGLACHQLCAKGHCWGPGPTQCVND ------1111--------3333---- >L-LACTATE DEHYDROGENASE; SWP:Q4PRK9; PDB:2A92A; TPKPKIVLVGSGMIGGVMATLIVQKNLGDVVMFDVVKNMPQGKALDTSHSNVMAYSNCKV ---------------------------------------------------1111----- TGSNSYDDLKGADVVIVTAGFTKAPGKSDKEWNRDDLLPLNNKIMIEIGGHIKNLCPNAF ----33332222-------------------------------------------1111- IIVVTNPVDVMVQLLFEHSGVPKNKIIGLGGVLDTSRLKYYISQKLNVCPRDVNALIVGA ------3333-----------1111-----------------------3333-------- HGNKMVLLKRYITVGGIPLQEFINNKKITDEEVEGIFDRTVNTALEIVNLLASPYVAPAA -1111--3333--iiii3333--------------------------------------- AIIEMAESYLKDIKKVLVCSTLLEGQYGHSNIFGGTPLVIGGTGVEQVIELQLNAEEKTK --------1111------------2222------------1111---------------- FDEAVAETKRMKALIH -----------1111- >BOTULINUM NEUROTOXIN TYPE; SWP:P30996; PDB:2A97A; PVAINSFNYNDPVNDDTILYMQIPYEEKSKKYYKAFEIMRNVWIIPERNTIGTNPSDFDP -------1111-----------22221111--------2222-----------3333--- PASLKNGSSAYYDPNYLTTDAEKDRYLKTTIKLFKRINSNPAGKVLLQEISYAKPYLGND 11112222----1111---------------------------------1111-----11 HTPIDEFSPVTRTTSVNIKLSTNVESSMLLNLLVLGAGPDIFESCCYPVRKLIDPDVVYD 111111----1111-----1111----------------3333---------1111---- PSNYGFGSINIVTFSPEYEYTFNESFIADPAISLAHELIHALHGLYGARGVTYEETIPIR 1111----------------------------------------------3333-----3 LEEFLTFGGQDLNIITSAMKEKIYNNLLANYEKIATRLSEVNSAPPEYDINEYKDYFQWK 333-----3333------------------------3333----1111----------11 YGLDKNADGSYTVNENKFNEIYKKLYSFTESDLANKFKVKCRNTYFIKYEFLKVPNLLDD 11---1111--------------1111----------------------------3333- DIYTVSEGFNIGNLAVNNRGQSIKLNPKIIDS ------!!!!!!!!2222-------3333--- >INOSITOL 1,4,5-TRISPHOSPH; SWP:Q96DU7; PDB:2A98A; EDGRILKRFCQCEQRSLEQLMKDPLRPFVPAYYGMVLQDGQTFNQMEDLLADFEGPSIMD ----------------------1111-----------iiii------1111--------- CKMGSRTYLEEELVKARERPRPRKDMYEKMVAVDPGAPTPEEHAQGAVTKPRYMQWRETM --------11111111-----------------2222-3333------------------ SSTSTLGFRIEGIKKADGTCNTNFKKTQALEQVTKVLEDFVDGDHVILQKYVACLEELRE --------------1111----------3333--------iiii---------------- ALEISPFFKTHEVVGSSLLFVHDHTGLAKVWMIDFGKTVALPDHQTLSHRLPWAEGNRED -----3333-------------1111---------------------------------- GYLWGLDNMICLLQGLAQS ---------------1111 >SULFITE OXIDASE; SWP:P07850; PDB:2A9DA; DPFAGDPPRHPGLRVNSQKPFNAEPPAELLAERFLTPNELFFTRNHLPVPAVEPSSYRLR 1111-----3333------------1111-------3333------------3333---- VDGPTLSLSLAELRSRFPKHEVTATLQCAGNRRSEMSRVRPVKGLPWDIGAISTARWGGA ---------------------------1111-----3333-------------------- RLRDVLLHAGFPEELQGEWHVCFEGLDADPGGAPYGASIPYGRALSPAADVLLAYEMNGT ----------------------------3333-------------3333-------iiii ELPRDHGFPVRVVVPGVVGARSVKWLRRVAVSPDESPSHWQQNDYKGFSPCVDWDTVDYR --3333-------22223333----------------1111-------11111111-333 TAPAIQELPVQSAVTQPRPGAAVPPGELTVKGYAWSGGGREVVRVDVSLDGGRTWKVARL 3----------------2222---------------%%%%---------iiii------- MGDKAPPGRAWAWALWELTVPVEAGTELEIVCKAVDSSYNVQPDSVAPIWNLRGVLSTAW -----2222-------------2222---------1111------3333-1111------ HRVRVSVQD --------- >putative malic enzyme ((S; SWP:Q99ZS1; PDB:2A9FA; LKNQLGQLALEQAKTFGGKLEVQPKVDIKTKHDLSIAYTPGVASVSSAIAKDKTLAYDLT -------------1111------------3333------3333--------3333----3 TKKNTVAVISDGTAVLGLGDIGPEAAMPVMEGKAALFKAFAGVDAIPIVLDTKDTEEIIS 333-----------!!!!---3333----------------------------3333--- IVKALAPTFGGINLEDISAPRCFEIEQRLIKECHIPVFHDDQHGTAIVVLAAIFNSLKLL ----3333------------------------------1111------------------ KKSLDEVSIVVNGGGSAGLSITRKLLAAGATKVTVVDKFGIINEQEAAQLAPDIAKVTNR --1111-------------------1111-------1111--1111-------3333--- EFKSGTLEDALEGADIFIGVSAPGVLKAEWISKMAARPVIFAMANPIPEIYPDEALEAGA ------3333-----------------3333-------------------3333-3333- YIVGTGRSDFPNQINNVLAFPGIFRGALDARAKTITVEMQIAAAKGIASLVPDDALSTTN ------3333----3333------------------------------------------ IIPDAFKEGVAEIVAKSVRSVVL ---------------1111---- >ARGININE DEIMINASE; SWP:P13981; PDB:2A9GA; TKLGVHSEAGKLRKVMVCSPGLAHQRLTPSNCDELLFDDVIWVNQAKRDHFDFVTKMRER --------------------3333---1111-1111------------------------ GIDVLEMHNLLTETIQNPEALKWILDRKITADSVGLGLTSELRSWLESLEPRKLAEYLIG -----------------------3333--1111-1111-------33333333-1111-- GVAADDLPASEGANILKMYREYLGHSSFLLPPLPNTQFTRDTTCWIYGGVTLNPMYWPAR --1111---3333---------------------33331111--------------3333 RQETLLTTAIYKFHPEFANAEFEIWYGDPDKDHGSSTLEGGDVMPIGNGVVLIGMGERSS -3333--------1111----------3333-!!!!---1111----------------- RQAIGQVAQSLFAKGAAERVIVAGLPKSRAAMHLDTVFSFCDRDLVTVFPEVVKEIVPFS --------------------------------1111-------------1111------- LRPDPSSPYGMNIRREEKTFLEVVAESLGLKKLRVVETGREQWDDGNNVVCLEPGVVVGY ------1111--------3333---1111-----------------------2222---1 DRNTYTNTLLRKAGVEVITISASELGRGRGGGHAMTCPIVRDPID 111-------1111---------3333---3333----------- >Potassium channel toxin a; SWP:P13487; PDB:2A9HE; FTNVSCTTSKECWSVCQRLHNTSRGKCMNKKCRCYS --------3333------------------------ >INTERLEUKIN-1 RECEPTOR-AS; SWP:Q8R4K2; PDB:2A9IA; KPLTPSTYIRNLNVGILRKLSDFIDPQEGWKKLAVAIKKPSGDDRYNQFHIRRFEALLQT ---11113333--------------%%%%---------1111------------3333-- GLSPTCELLFDWGTTNCTVGDLVDLLVQIELFAPATLLLPDAVPQ ------------1111----------------------3333--- >n/a; SWP:NA; PDB:2A9MH; QVQLVESGGNLVQPGGSLRLSCAASGFTFGSFSMSWVRQAPGGGLEWVAGLSARSSLTHY ------------2222-----------3333--------2222----------------- ADSVKGRFTISRDNAKNSVYLQMNSLRVEDTAVYYCARRSYDSSGYWGHFYSYMDVWGQG 3333---------1111---------3333---------3333---3333---------- TLVTVS ------ >RESPONSE REGULATOR; SWP:Q9S1K0; PDB:2A9OA; KKILIVDDEKPISDIIKFNMTKEGYEVVTAFNGREALEQFEAEQPDIIILDLMLPEIDGL --------------------1111------------------------------------ EVAKTIRKTSSVPILMLSAKDSEFDKVIGLELGADDYVTKPFSNRELQARVKALLRR -----------------------------3333------------------------ >COMPETENCE/DAMAGE-INDUCIB; SWP:Q8UFF5; PDB:2A9SA; MSLFPGDIEELARRIITDFTPLGLMVSTAESCTGGLIAGALTEIAGSSAVVDRGFVTYTN -------------------1111-------1111-----11112222------------- DAKRDMLGVGTETLTTFGAVSRQTALQMAHGALYRSRANFAVAVTGIAGPGGGSAEKPVG ---------3333-------------------1111-----------------3333222 LVHLATKARNGNVLHHEMRYGDIGRTEIRLATVRTALEMLIALNQAG 2------1111------------------------------------ >UBIQUITIN CARBOXYL-TERMIN; SWP:P40818; PDB:2A9UA; SVPKELYLSSSLKDLNKKTEVKPEKISTKSYVHSALKIFKTAEECRLDRDEERAYVLYKY ---------------1111--3333--------------------1111----------- VTVYNLIKKRPDFKQQQDYFHSILGPGNIKKAVEEAERLSESLKLRYEEAEVRKKLEEKD ------1111-------------------------------------------------- RQEEAQRLQQKRQ ---------3333 >GMP SYNTHASE; SWP:NA; PDB:2A9VA; HLKIYVVDNGGQWTHREWRVLRELGVDTKIVPNDIDSSELDGLDGLVLSGGAPNIDEELD -----------3333------1111----------3333--------------3333111 KLGSVGKYIDDHNYPILGICVGAQFIALHFGASVVKAKHPEFGKTKVSVHSENIFGGLPS 1--------------------------1111--------------------!!!!----- EITVWENHNDEIINLPDDFTLAASSATCQVQGFYHKTRPIYATQFHPEVEHTQYGRDIFR ---------------3333-----1111-----------------1111----------- NFIGICASYREIQKE --------------- >Hemoglobin subunit beta-C; SWP:P45721; PDB:2AA1B; VEWTDFERATIKDIFSKLEYDVVGPATLARCLVVYPWTQRYFGKFGNLYNAAAIAQNAMV --------------11111111---------------33331111--------------- SKHGTTILNGLDRAVKNMDDITNTYAELSVLHSEKLHVDPDNFKLLADCLTIVVAARFGS ----------------11113333--------------3333---------------!!! AFTGEVQAAFQKFMAVVVSSLGKQYR !------------------1111--- >PUTATIVE N-ACETYLMANNOSAM; SWP:P45425; PDB:2AA4A; MTTLAIDIGGTKLAAALIGADGQIRDRRELPTPASQTPEALRDALSALVSPLQAHAQRVA ------------------1111--------------------------1111-------- IASTGIIRDGSLLALNPHNLGGLLHFPLVKTLEQLTNLPTIAINDAQAAAWAEFQALDGD -------%%%%----3333!!!!------------------------------1111333 ITDMVFITVSTGVGGGVVSGCKLLTGPGGLAGHIGHTLADPHGPVCGCGRTGCVEAIASG 3-----------------%%%%---1111---3333---1111--1111---3333---- RGIAAAAQGELAGADAKTIFTRAGQGDEQAQQLIHRSARTLARLIADIKATTDCQCVVVG ---1111!!!!-----------1111---------------------------------- GSVGLAEGYLALVETYLAQEPAAFHVDLLAAHYRHDAGLLGAALLAQGE 3333-2222-------33333333------------------------- >ALPHA-AMYLASE; SWP:P56271; PDB:2AAA; LSAASWRTQSIYFLLTDRFGRTDNSTTATCNTGNEIYCGGSWQGIIDHLDYIEGMGFTAI -33331111-----3333--1111------3333----------3333----1111---- WISPITEQLPQDTADGEAYHGYWQQKIYDVNSNFGTADNLKSLSDALHARGMYLMVDVVP ------------1111-1111----1111-3333-3333-------3333---------- DHMGYAGNGNDVDYSVFDPFDSSSYFHPYCLITDWDNLTMVEDCWEGDTIVSLPDLDTTE -------3333-3333-----3333--------1111-------------------3333 TAVRTIWYDWVADLVSNYSVDGLRIDSVLEVQPDFFPGYNKASGVYCVGEIDNGNPASDC --------------------------3333-3333-------------------3333-- PYQKVLDGVLNYPIYWQLLYAFESSSGSISNLYNMIKSVASDCSDPTLLGNFIENHDNPR -----------3333--------1111-----------------1111------1111-3 FAKYTSDYSQAKNVLSYIFLSDGIPIVYAGEEQHYAGGKVPYNREATWLSGYDTSAELYT 333--------------------------3333------------3333%%%%------- WIATTNAIRKLAIAADSAYITYANDAFYTDSNTIAMAKGTSGSQVITVLSNKGSSGSSYT ---------------1111--------------------2222---------3333---- LTLSGSGYTSGTKLIEAYTCTSVTVDSSGDIPVPMASGLPRVLLPASVVDSSSLCG --------2222-------------1111------%%%%--------3333----- >ANTI-IDIOTYPIC MONOCLONAL; SWP:NA; PDB:2AABH; DVQLVESGGGLVQPGGSRKLSCAASGFTFSSFGMHWVRQAPEKGLEWVAYISSDSSNIYY ------------2222-----------1111--------2222----------------- ADTVKGRFTISRDNPKNTLFLQMTSLRSEDTAMYYCARSNYVGYHVRWYFDVWGAGTTVT 3333----------------------1111------------------------------ VSSAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVL -------------------------------------------%%%%------------- QSDLYTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRD -------------11113333--------1111----------- >ANTI-IDIOTYPIC MONOCLONAL; SWP:NA; PDB:2AABL; DIQLTQSPASLAVSLGQRVTISCRASESVEYYGSSLMQWYQQKPGQPPKLLIYAASNVES -------------2222------------------------------------------- GVPARFSGSGSGTDFSLNIHPVEEDDIAMYFCQQSRKIPYTFGGGTKLEIKRADAAPTVS --3333----------------3333---------------------------------- IFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMS ----3333---------------------------------------------------- STLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRGE -----3333------------3333------------ >Ricin [Precursor]; SWP:P02879; PDB:2AAIB; ADVCMDPEPIVRIVGRNGLCVDVRDGRFHNGNAIQLWPCKSNTDANQLWTLKRDNTIRSN --------------2222-----%%%%-2222------------1111---1111---ii GKCLTTYGYSPGVYVMIYDCNTAATDATRWQIWDNGTIINPRSSLVLAATSGNSGTTLTV ii----------------3333----------1111----------------2222---- QTNIYAVSQGWLPTNNTQPFVTTIVGLYGLCLQANSGQVWIEDCSSEKAEQQWALYADGS -----1111----------------2222-----!!!!-----------------1111- IRPQQNRDNCLTSDSNIRETVVKILSCGPASSGQRWMFKNDGTILNLYSGLVLDVRASDP --------------------------33332222----1111------------222233 SLKQIILYPLHGDPNQIWLPLF 33-------------------- >UBIQUITIN CONJUGATING ENZ; SWP:P25865; PDB:2AAK; MSTPARKRLMRDFKRLQQDPPAGISGAPQDNNIMLWNAVIFGPDDTPWDGGTFKLSLQFS --------------------2222----1111-------------1111----------- EDYPNKPPTVRFVSRMFHPNIYADGSICLDILQNQWSPIYDVAAILTSIQSLLCDPNPNS -----------------11111111----1111---3333-----------1111-1111 PANSEAARMYSESKREYNRRVRDVVEQSWT -------------------------3333- >MALONATE SEMIALDEHYDE DEC; SWP:Q9EV83; PDB:2AALA; PLLKFDLFYGRTDAQIKSLLDAAHGAMVDAFGVPANDRYQTVSQHRPGEMVLEDTGLGYG ---------------------------------1111----------------iiii--- RSSAVVLLTVISRPRSEEQKVCFYKLLTGALERDCGISPDDVIVALVENSDADWSFGRGR -1111--------------------------------3333--------1111---%%%% AEFLTGDLV 3333----- >HYPOTHETICAL PROTEIN TM14; SWP:Q9X1D0; PDB:2AAMA; EGWFPFDNWLYQLQNADPVEISSSGFEIAVIDYSKDGSESGEYSPEEIKIVDAGVVPVAY --------------------1111-------------3333---------1111------ VNIGQAEDYRFYWKESWYTNTPEWLGEEDPAWPGNYFVKYWYNEWKEIVFSYLDRVIDQG ------1111----3333---1111---1111------1111--------------3333 FKGIYLDRIDSFEYWAQEGVISRRSAARKINFVLEIAEYVRERKPDLIIPQNGENILDFD -------1111----1111---------------------------------33331111 DGQLASTVSGWAVENLFYLKTIPLEENETKSRLEYLIRLNRKGKFILSVDYVDDGSDSFE -3333-------------!!!!-----------------1111----------------- NISRILDYYEKAKRNGCIPYAARSDLELDENVIEGIQPPE ------------1111------1111------2222---- >AURACYANIN A; SWP:Q8RMH6; PDB:2AANA; GPVTIEIGSKGEELAFDKTELTVSAGQTVTIRFKNNSAVQQHNWILVKGGEAEAANIANA ---------!!!!----------2222--------------------------------3 GLSAGPAANYLPADKSNIIAESPLANGNETVEVTFTAPAAGTYLYICTVPGHYPLMQGKL 333--1111-----1111-------2222-------------------2222-------- VVN --- >CALCIUM-DEPENDENT PROTEIN; SWP:Q06850; PDB:2AAOA; NKFKKALRVIAESLSEEEIAGLKEFNIDADKSGQITFEELKAGLKRVGANLKESEILDLQ 3333------1111-------------1111-----------3333-------------3 AADVDNSGTIDYKEFIAATLHLNKIEREDHLFAAFTYFDKDGSGYITPDELQQACEEEEL 333------------------33331111---------1111------------------ RDVDQDNDGRIDYNEFVAQ ---1111------------ >FILAMIN A; SWP:P21333; PDB:2AAVA; CGHVTAYGPGLTHGVVNKPATFTVNTKDAGEGGLSLAIEGPSKAEISCTDNQDGTCSVSY -------1111---2222------------------------------------------ LPVLPGDYSILVKYNEQHVPGSPFTARVTGDD ------------------1111---------- >MINERALOCORTICOID RECEPTO; SWP:P08235; PDB:2AAXA; ALVPQLSTISRALTPSPVMVLENIEPEIVYAGYDSSKPDTAENLLSTLNRLAGKQMIQVV ---------------------1111--------1111----------------------- KWAKVLPGFKNLPLEDQITLIQYSWMSLLSFALSWRSYKHTNSQFLYFAPDLVFNEEKMH -33332222-------------------------------%%%%----1111-------- QSAMYELCQGMHQISLQFVRLQLTFEEYTIMKVLLLLSTIPKDGLKSQAAFEEMRTNYIK ----------------------------------------3333---------------- ELRKMVTKCNNSGQSWQRFYQLTKLLDSMHDLVSDLLEFCFYTFRESHALKVEFPAMLVE ---1111----------------------------------------1111--------- IISDQLPKVESGNAKPLYFHR --------1111--------- >THYMIDYLATE SYNTHASE; SWP:P45351; PDB:2AAZA; RSNPDHEEYQYLDLIRRIINVGEVRPDRTGTGTVALFAPPSFRFSLADNTLPLLTTKRVF --1111--------------------3333----------------%%%%---------- LRGVIAELLWFVSGCTDAKMLSSQGVGIWDGNGSKEFLEKVGLGHRREGDLGPVYGFQWR ---------------------1111-1111--------111111112222---------- HFGAEYTDADGDYKGKGVDQLQRVIDTIKNNPTDRRIILSAWNPKDLPLMALPPCHMFCQ 2222---1111-2222--------------1111--------33331111---------- FFVSLPPADSPGSKPKLSCLMYQRSCDLGLGVPFNIASYALLTHMIALITDTEPHEFILQ ------1111--------------------3333-------------------------- MGDAHVYRDHVEPLKTQLEREPRDFPKLKWARSKEEIGDIDGFKVEDFVVEGYKPWGKID ------1111-3333-1111------------3333--1111-3333------------- MKMSA ----- >YAJL; SWP:Q46948; PDB:2AB0A; SASALVCLAPGSEETEAVTTIDLLVRGGIKVTTASVASDGNLAITCSRGVKLLADAPLVE --------2222------------1111---------iiii----1111-------3333 VADGEYDVIVLPGGIKGAECFRDSTLLVETVKQFHRSGRIVAAICAAPATVLVPHDIFPI 1111------------------------------1111-------3333---1111---- GNMTGFPTLKDKIPAEQWLDKRVVWDARVKLLTSQGPGTAIDFGLKIIDLLVGREKAHEV -----111111111111---------1111-----1111--------------------- ASQLVMAAGIYNYYE 1111--1111----- >HYPOTHETICAL PROTEIN; SWP:Q9H7C9; PDB:2AB1A; STSPEIASLSWGQKVKGSNTTYKDCKVWPGGSRTWDWRETGTEHSPGVQPADVKEVVEKG ---------2222-2222---------2222----3333---------3333-3333--- VQTLVIGRGSEALKVPSSTVEYLKKHGIDVRVLQTEQAVKEYNALVAQGVRVGGVFHSTC ---------------3333----1111------3333--------1111----------- >ZNF29; SWP:NA; PDB:2AB3A; MVYVCHFENCGRSFNDRRKLNRHKKIHTG ----------------------3333--- >MRNA MATURASE; SWP:Q9ZZW7; PDB:2AB5A; KLNTDNPIYAYIVGLFEGDGWITISKKGKYLLYELGIEHIRDIQLLYKIKNILGIGKVTI --------------------------------------3333------------------ KKLKKDGTIKECKFNVRNKNHLKNIIIPIFNKYPLTNKHYDYLYFKDNLLKDIKYYNDLS ------------------------------------------------------3333-- YYLRPIKPFNTTEDILNKNYFSSWLIGFFEAKSCFSIYKPNKKKTASFEVSNNNEVLAIK ------------------------------------------------------------ SYLKINNNIYNEFNNSKTTKSINDIKNVVFINNNPIKLLGYKKLQYLLFLKDLRTITKYN ------------------------------1111-----3333-------------3333 NYFKIPSKY --------- ------------------------------- >ENDONUCLEASE III; SWP:P0AB83; PDB:2ABK; MNKAKRLEILTRLRENNPHPTTELNFSSPFELLIAVLLSAQATDVSVNKATAKLYPVANT -----------------------------------------------------3333--- PAAMLELGVEGVKTYIKTIGLYNSKAENIIKTCRILLEQHNGEVPEDRAALEALPGVGRK ---------------1111--------------------iiii----------2222--- TANVVLNTAFGWPTIAVDTHIFRVCNRTQFAPGKNVEQVEEKLLKVVPAEFKVDCHHWLI -----------------------------------------------33331111----- LHGRYTCIARKPRCGSCIIEDLCEYKEKVDI ------------33331111----------- >BCL-2 HOMOLOG; SWP:P89884; PDB:2ABOA; SGTYWATLITAFLKTVSKVEELDCVDSAVLVDVSKIITLTQEFRRHYDSVYRADYGPALK --1111-------1111--------1111------------------3333----3333- NWKRDLSKLFTSLFVDVINSGRIVGFFDVGRYVCEEVLCPGSWTEDHELLNDCMTHFFIE 3333----3333------3333-------------------------------------- NNLMNHFPLED -3333------ >FRUCTOSE 1-PHOSPHATE KINA; SWP:Q9KEM5; PDB:2ABQA; IYTVTLNPSIDYIVQVENFQQGVVNRSERDRKQPGGKGINVSRVLKRLGHETKALGFLGG ---------------------------------------------1111----------- FTGAYVRNALEKEEIGLSFIEVEGDTRINVKIKGKQETELNGTAPLIKKEHVQALLEQLT ----------1111---------------------------------3333-----3333 ELEKGDVLVLAGSVPQAPQTIYRSTQIAKERGAFVAVDTSGEALHEVLAAKPSFIKPNHH --2222-----------1111------------------------3333----------- ELSELVSKPIASIEDAIPHVQRLIGEGIESILVSFAGDGALFASAEGFHVNVPSGEVRNS -----------3333-------------------!!!!-----1111------------- VGAGDSVVAGFLAALQEGKSLEDAVPFAVAAGSATAFSDGFCTREEVERLQQQLQRTIKK -------------------3333-----------1111----3333-------------- EG -- >ADENOSINE KINASE; SWP:Q9TVW2; PDB:2ABSA; TGPMRVFAIGNPILDLVAEVPSSFLDEFFLKRGDATLATPEQMRIYSTLDQFNPTSLPGG --------------------3333-1111-2222----33333333---1111------- SALNSVRVVQKLLRKPGSAGYMGAIGDDPRGQVLKELCDKEGLATRFMVAPGQSTGVCAV --------------2222---------3333------------------2222------- LINEKERTLCTHLGACGSFRLPEDWTTFASGALIFYATAYTLTATPKNALEVAGYAHGIP --%%%%------!!!!-----11113333-----------------------------22 NAIFTLNLSAPFCVELYKDAMQSLLLHTNILFGNEEEFAHLAKVHNLVNKEHAVEVCTGA 22-------3333-----------1111-------------------------------3 LRLLTAGQNTSATKLVVMTRGHNPVIAAEQTADGTVVVHEVGVPVVAAEKIVDTNGAGDA 333-%%%%-----------!!!!-------1111------------3333---2222--- FVGGFLYALSQGKTVKQCIMCGNACAQDVIQHVGFSLSFT --------1111----------------1111-------- >PDX2 PROTEIN; SWP:Q5ND68; PDB:2ABWA; SEITIGVLSLQGDFEPHINHFIKLQIPSLNIIQVRNVHDLGLCDGLVIPGGESTTVRRCC --------1111-------------1111------33331111----------------- AYENDTLYNALVHFIHVLKKPIWGTCAGCILLSKNVENIKLYSNFGNKFSFGGLDITICR -%%%%------------------------1111---------1111-------------- NFNDSFICSLNIISDSSAFKKDLTAACIRAPYIREILSDEVKVLATFSHESYGPNIIAAV ---------------3333------------------3333------------------- EQNNCLGTVFHPELLPHTAFQQYFYEKVKNYKYSLE -!!!!-----1111---------------------- ------------------------------------------------------------ -------------- >HYPOTHETICAL PROTEIN TA07; SWP:Q9HK62; PDB:2ABYA; MQKGLEIAFQTINGLDESLVQALAGVTASDFPDLDIKYNIFLVDLYGQKYFRILFQSKKL ------------%%%%--------------3333----------%%%%------------ SELHPEERKKVREKFDENSRMQYSELMTKYHDLKKQGKIKDRPVKEVHEEYDLWEDPIWQ ---3333------------------------------------------------3333- YI -- >INVERTASE; SWP:Q43866; PDB:2AC1A; NQPYRTGFHFQPPKNWMNDPNGPMIYKGIYHLFYQWNPKGAVWGNIVWAHSTSTDLINWD -1111--------------------iiii------------------------------- PHPPAIFPSAPFDINGCWSGSATILPNGKPVILYTGIDPKNQQVQNIAEPKNLSDPYLRE ---------1111-----------1111---------1111----------1111----- WKKSPLNPLMAPDAVNGINASSFRDPTTAWLGQDKKWRVIIGSKIHRRGLAITYTSKDFL ---1111-----3333--1111---------1111---------!!!!------------ KWEKSPEPLHYDDGSGMWECPDFFPVTRFGSNGVETSSFGEPNEILKHVLKISLDDTKHD ----------------------------------1111--1111----------1111-- YYTIGTYDRVKDKFVPDNGFKMDGTAPRYDYGKYYASKTFFDSAKNRRILWGWTNESSSV --------1111----2222--1111---------------3333--------------- EDDVEKGWSGIQTIPRKIWLDRSGKQLIQWPVREVERLRTKQVKNLRNKVLKSGSRLEVY --------------------3333-----------1111------------2222----- GVTAAQADVEVLFKVRDLEKADVIEPSWTDPQLICSKMNVSVKSGLGPFGLMVLASKNLE --1111----------3333----1111----------1111------------------ EYTSVYFRIFKARQNSNKYVVLMCSDQSRSSLKEDNDKTTYGAFVDINPHQPLSLRALID ------------2222----------------3333-----------3333--------- HSVVESFGGKGRACITSRVYPKLAIGKSSHLFAFNYGYQSVDVLNLNAWSMNSAQIS --------iiii------------!!!!----------------------------- >MAP KINASE-INTERACTING SE; SWP:Q9HBH9; PDB:2AC3A; GSTDSFSGRFEDVYQLQEDVLGEGAHARVQTCINLITSQEYAVKIIEKQPGHIRSRVFRE --------3333----------------------------------------3333---- VEMLYQCQGHRNVLELIEFFEEEDRFYLVFEKMRGGSILSHIHKRRHFNELEASVVVQDV ----1111-1111-------------------1111------------------------ ASALDFLHNKGIAHRDLKPENILCEHPNQVSPVKICDFDLGSCGSAEYMAPEVVEAFSEE -------1111------3333-----------------------3333-3333------- ASIYDKRCDLWSLGVILYILLSGYPPFVGRCGSDCGWACPACQNMLFESIQEGKYEFPDK ---------------------------------------------------------333 DWAHISCAAKDLISKLLVRDAKQRLSAAQVLQHPWVQ 31111-------1111---3333---------3333- >PURINE NUCLEOSIDE PHOSPHO; SWP:Q81T09; PDB:2AC7A; SVHIEAKQGEIAESILLPGDPLRAKYIAETFLEDVTCYNNVRGMLGFTGTYKGKRVSVQG 1111--2222---------3333----------------2222-------iiii------ TGMGVPSISIYVNELIQSYGVKNLIRVGTCGAIQKDVKVRDVIIAMTACTDSNMNRLTFP ---------------------------------11112222-----------------22 GFDFAPAANFDLLKKAYDAGTEKGLHVRVGNVLTADVFYRESMDMVKKLGDYGVLAVEME 22-----------------------------------------------1111------- TTALYTLAAKYGVNALSVLTVSDHIQTTFNEMIEIALDAA ---------------------------------------- >PUTATIVE ADENYLATE CYCLAS; SWP:Q87NV8; PDB:2ACAA; QGQFEVELKYRVKNHDAFLNVKQIEHEVFENNQESDWFYDTPQRTLTQQGKSLVLREIQP --------------------1111----------------1111--1111---------- AGIKLWIVKGPEADRCEATNITKLDSAQSLENGYEVIQCSKKIRSIFFVGEFHITLDFLD -----------1111---------------------------------!!!!-------- GFGHFAEFAITDDETALARYRERLVALAQQFHLSEADREHRSYKEILSA ------------------3333-----3333--3333-----3333--- >REPLICASE POLYPROTEIN 1AB; SWP:Q6RD32; PDB:2ACFA; HHHMPVNQFTGYLKLTDNVAIKCVDIVKEAQSANPMVIVNAANIHLKHGGGVAGALNKAT ---------------1111-----------------------1111-----------111 NGAMQKESDDYIKLNGPLTVGGSCLLSGHNLAKKCLHVVGPNLNAGEDIQLLKAAYENFN 1-----------------2222-----!!!!----------3333--3333-------11 SQDILLAPLLSAGIFGAKPLQSLQVCVQTVRTQVYIAVNDKALYEQVVMDYL 11------22221111------------------------------------ ----------------------------------- >Oxysterols receptor LXR-a; SWP:Q9Z0Y9; PDB:2ACLB; VQLSPEQLGMIEKLVAAQQQCNRRSFSDRLRVTPWPIAPDPQSREARQQRFAHFTELAIV ---3333---------------1111------------------3333------------ SVQEIVDFAKQLPGFLQLSREDQIALLKTSAIEVMLLETSRRYNPGSESITFLKDFSYNR -------3333--1111---------------------3333------------------ EDFAKAGLQVEFINPIFEFSRAMNELQLNDAEFALLIAISIFSADRPNVQDQLQVERLQH ---1111-1111----------3333----------------1111-------------- TYVEALHAYVSINHPHDPLMFPRMLMKLVSLRTLSSVHSEQVFALRLQDKKLPPLLSEIW ----------------3333-----------------3333-3333-------------- DV -- >MUCIN-1; SWP:Q16615; PDB:2ACMA; SFFFLSFHISNLQFNSSLEDPSTDYYQELQRDISEMFLQIYKQGGFLGLSNIKFRPG --------------3333----------------------3333------------- ----------------------------------------------- >ACTINIDAIN PRECURSOR; SWP:P00785; PDB:2ACT; LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKITSGSLISLSEQELIDCGRTQ -----3333--------------3333-------------------------------!! NTRGCDGGYITDGFQFIINDGGINTEENYPYTAQDGDCDVALQDQKYVTIDTYENVPYNN !!!!!!--3333------------3333----------3333------------------ EWALQTAVTYQPVSVALDAAGDAFKQYASGIFTGPCGTAVDHAIVIVGYGTEGGVDYWIV ------3333-----------------------------------------iiii----- KNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPVKY ----1111-iiii-------!!!!%%%%---------- >TRITERPENE UDP-GLUCOSYL T; SWP:Q5IFH7; PDB:2ACVA; MSDINKNSELIFIPAPGIGHLASALEFAKLLTNHDKNLYITVFCIKFPGMPFADSYIKSV ----------------------------------1111---------------------- LASQPQIQLIDLPEVEPPPQELLKSPEFYILTFLESLIPHVKATIKTILSNKVVGLVLDF ---1111-----------3333------------------------------------11 FCVSMIDVGNEFGIPSYLFLTSNVGFLSLMLSLKNRQIEEVFDDSDRDHQLLNIPGISNQ 11------3333-------------------3333-1111-----3333----------- VPSNVLPDACFNKDGGYIAYYKLAERFRDTKGIIVNTFSDLEQSSIDALYDHDEKIPPIY -1111-3333---------------1111--------3333-----------1111---- AVGPLLDLKGQPNPKLDQAQHDLILKWLDEQPDKSVVFLCFGSMGVSFGPSQIREIALGL ------------1111-----------11112222------------------------- KHSGVRFLWSNSAEKKVFPEGFLEWMELEGKGMICGWAPQVEVLAHKAIGGFVSHCGWNS -------------3333-2222-----------------------3333----------- ILESMWFGVPILTWPIYAEQQLNAFRLVKEWGVGLGLRVDYRKGSDVVAAEEIEKGLKDL ----------------!!!!---------------------2222--------------- MDKDSIVHKKVQEMKEMSRNAVVDGGSSLISVGKLIDDITG -1111-------------33332222--------------- >G PROTEIN-COUPLED RECEPTO; SWP:P43250; PDB:2ACXA; KARKGKSKKWRQMLQFPHISQCEELRLSLERDYHSLCERNPIGRLLFREFCATRPELSRC ------1111-------3333----3333------------------------------- VAFLDGVAEYEVTPDDKRKACGRNLTQNFLSHTGPDLIPEVPRQLVTNCTQRLEQGPCKD ---------11113333-------------1111---2222-----------1111---- LFQELTRLTHEYLSVAPFADYLDSIYFNRFLQWKWLERQPVTKNTFRQYRVLGKGGFGEV -----------------------------------1111--1111---------1111-- CACQVRATGKMYACKKLEKKRIKKRKGEAMALNEKQILEKVNSRFVVSLAYAYETKDALC ------------------------------------------1111--------1111-- LVLTLMNGGDLKFHIYHMGQAGFPEARAVFYAAEICCGLEDLHRERIVYRDLKPENILLD ----------------------------------------------------3333---1 DHGHIRISDLGLAVHVPEGQTIKGRVGTVGYMAPEVVKNERYTFSPDWWALGCLLYEMIA 111------1111---2222-------1111------------3333------------- GQSPFQQRKKKIKREEVERLVKEVPEEYSERFSPQARSLCSQLLCKDPAERLGCRGGSAR ----------------------------------------------33332222--!!!! EVKEHPLFKKLNFKRLGAGMLEPPFKPDPQAIYCEPTDQDFYQKFATGSVPIPWQNEMVE ----3333-------1111----------------------------------------- TECFQELNVFGLDGS --------------- >ACYLPHOSPHATASE; SWP:P41500; PDB:2ACY; AEGDTLISVDYEIFGKVQGVFFRKYTQAEGKKLGLVGWVQNTDQGTVQGQLQGPASKVRH -!!!!-------------------------1111-------------------------- MQEWLETKGSPKSHIDRASFHNEKVIVKLDYTDFQIVK ---------1111------------------------- >SULFOTRANSFERASE 1C2; SWP:O75897; PDB:2AD1A; TKRLSVNYVKGILQPTDTCDIWDKIWNFQAKPDDLLISTYPKAGTTWTQEIVELIQNEGD --------iiii--33331111-3333---1111-----2222---------------11 VEKSKRFPFLEMKISGLEQAHAMPSPRILKTHLPFHLLPPSLLEKNCKIIYVARNPKDNM 11------3333---------------------3333-----1111-------------- VSYYHFQRMNKALPAPGTWEEYFETFLAGKVCWGSWHEHVKGWWEAKDKHRILYLFYEDM ---------1111----3333---------22223333-----------------3333- KKNPKHEIQKLAEFIGKKLDDKVLDKIVHYTVGDWKKHFTVAQNERFDEDYKKKMTDTRL ------------1111------3333--------1111---------------------- TFHF ---- >METHANOL DEHYDROGENASE SU; SWP:P38539; PDB:2AD6A; DADLDKQVNTAGAWPIATGGYYSQHNSPLAQINKSNVKNVKAAWSFSTGVLNGHEGAPLV ---------2222--11111111---------33331111-------------------- IGDMMYVHSAFPNNTYALNLNDPGKIVWQHKPKQDASTKAVMCCDVVDRGLAYGAGQIVK !!!!--------------3333--------------3333-------------%%%%--- KQANGHLLALDAKTGKINWEVEVCDPKVGSTLTQAPFVAKDTVLMGCSGAELGVRGAVNA -1111-------------------3333----------!!!!------1111-------- FDLKTGELKWRAFATGSDDSVRLAKDFNSANPHYGQFGLGTKTWEGDAWKIGGGTNWGWY ----------------3333---111111113333--3333---!!!!------------ AYDPKLNLFYYGSGNPAPWNETMRPGDNKWTMTIWGRDLDTGMAKWGYQKTPHDEWDFAG ---1111------------3333------------------------------------- VNQMVLTDQPVNGKMTPLLSHIDRNGILYTLNRENGNLIVAEKVDPAVNVFKKVDLKTGT ----------iiii--------1111------------------3333------------ PVRDPEFATRMDHKGTNICPSAMGFHNQGVDSYDPESRTLYAGLNHICMDWEPFMLPYRA ---3333-------------3333----------------------------------22 GQFFVGATLAMYPGPNGPTKKEMGQIRAFDLTTGKAKWTKWEKFAAWGGTLYTKGGLVWY 22--------------1111---------------------------------------- ATLDGYLKALDNKDGKELWNFKMPSGGIGSPMTYSFKGKQYIGSMYGVGGWPGVGLVFDL -----------------------------------%%%%----------3333------- TDPSAGLGAVGAFRELQNHTQMGGGLMVFSL -1111iiii-----1111------------- >Methanol dehydrogenase su; SWP:P38540; PDB:2AD6B; YDGQNCKEPGNCWENKPGYPEKIAGSKYDPKHDPVELNKQEESIKAMDARNAKRIANAKS -------2222----2222---2222------3333------------------------ SGNFVFDVK --------- >POLYPYRIMIDINE TRACT-BIND; SWP:P26599; PDB:2AD9A; GDSRSAGVPSRVIHIRKLPIDVTEGEVISLGLPFGKVTNLLMLKGKNQAFIEMNTEEAAN ----------------------3333----------------3333-------------- TMVNYYTSVTPVLRGQPIYIQFSNHKELKTDSSPNQAR --------------------------------3333-- >POLYPYRIMIDINE TRACT-BIND; SWP:P26599; PDB:2ADBA; DAGMAMAGQSPVLRIIVENLFYPVTLDVLHQIFSKFGTVLKIITFTKNNQFQALLQYADP ------------------------------------------------------------ VSAQHAKLSLDGQNIYNACCTLRIDFSKLTSLNVKYNNDKSRDYTRPDLPSGDSQPSLDQ ---------------%%%%----------------------------------------- TMAAAFG -3333-- >POLYPYRIMIDINE TRACT-BIND; SWP:P26599; PDB:2ADCA; GRIAIPGLAGAGNSVLLVSNLNPERVTPQSLFILFGVYGDVQRVKILFNKKENALVQMAD ---------------------3333-3333------------------------------ GNQAQLAMSHLNGHKLHGKPIRITLSKHQNVQLPREGQEDQGLTKDYGNSPLHRFKKPGS ----------2222------------------------3333-----------------3 KNFQNIFPPSATLHLSNIPPSVSEEDLKVLFSSNGGVVKGFKFFQKDRKMALIQMGSVEE 333---------------33333333-----3333--------3333------------- AVQALIDLHNHDLGENHHLRVSFSKSTI -------2222----------------- >VON WILLEBRAND FACTOR; SWP:NA; PDB:2ADFH; QIQLVQSGPELKKPGETVKISCKASGYTFINYGMNWVKQAPGKGLKWMGWKNTNTGETTY ------------2222-----------1111--------2222----------------- GEEFRGRFAFSLETSVSTAYLQINNLKNEDTATYFCARDNPYYALDYWGQGTTVTVSSAK 1111---------1111---------3333------------------------------ TTAPSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLY -----------%%%%-!!!!------------------%%%%-------------%%%%- TLSSSVTVTSSTWPSQSITCNVAHPASSTKVDKKIEPR --------3333------------1111---------- >VON WILLEBRAND FACTOR; SWP:NA; PDB:2ADFL; DIQMTQSPSSLSASLGGKVTITCKASQDINKYIAWYQHKPGKGPRLLIHYTSTLQPGIPS -------------2222-----------iiii------2222------------222211 RFSGSGSGRDYSFSISNLEPEDIATYYCLQYDNLRTFGGGTKLEIKRADAAPTVSIFPPS 11----!!!!--------3333-------------------------------------- SEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTL ----------------------------iiii---------------------------- TKDEYERHNSYTCEATHKTSTSPIVKSFN 3333------------------------- >Q425 FAB LIGHT CHAIN; SWP:NA; PDB:2ADGA; ETTVTQSPASLSVAIGEKVTIRCITSTDIDDDMNWYQQKPGEPPKFFISEGNTLRPGVPS -------------2222-------------------------------------222211 RFSSSGYGTDFVFTIENMLSEDVADYYCLQSDTLPLTFGSGTKLEIKRADAAPTVSIFPP 11----------------1111-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT 3333-------------------------------------------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNR -----------------1111---------- >Q425 FAB LIGHT CHAIN; SWP:NA; PDB:2ADGB; EVQLVESGGDLVKPGGSLKLSCAASGFTFSSYGMSWVRQTPDKGLEWVATISSGGSYTYY ------------2222-----------3333--------------------3333----- PDNVKGRFTISRDNAKNTLYLQMSSL 3333---------------------- >CCDA; SWP:Q9S0Z5; PDB:2ADLA; MKQRITVTVDSDSYQLLKAYDVNISGLVSTTMQNEARRLRAERWKVENQEGMVEVARFIE -----------3333--------3333--------------------------------- MNGSFADENKDW ------------ >ADR1; SWP:P07248; PDB:2ADR; RSFVCEVCTRAFARQEHLKRHYRSHTNEKPYPCGLCNRAFTRRDLLIRHAQKIHSGNLGE -------------------3333------------------------------------- >ALPHA-1-SYNTROPHIN; SWP:Q61234; PDB:2ADZA; ASGRRAPRTGLLELRCGAGSGAGGERWQRVLLSLAEDALTVSPADGEPGPEPEPAQLNGA ------------------------------------------------------------ AEPGAAPPQLPEALLLQREVSPYFKNSAGGTSVGWDSPPASPLQRQPSSPGPQPRNLSEA -------------3333------------------------------------------- KHVSLKMAYVSRRCTPTDPEPRYLEICAADGQDAVFLRAKDEASARSWAGAIQAQIGT ---------------------------------------------------------- >Membrane-bound lytic mure; SWP:P0A935; PDB:2AE0X; SKPTDRGQQYKDGKFTQPFSLVNQPDAVGAPINAGDFAEQINHIRNSSPRLYGNQSNVYN ---1111-----------------------------------------------3333-- AVQEWLRAGGDTRNMRQFGIDAWQMEGADNYGNVQFTGYYTPVIQARHTRQGEFQYPIYR ----------1111------------------------------------!!!!------ MPPKRGRLSSRAEIYAGALSDKYILAYSNSLMDNFIMDVQGSGYIDFGDGSPLNFFSYAG --3333-------1111--3333------------------------------------- KNGHAYRSIGKVLIDRGEVKKEDMSMQAIRHWGETHSEAEVRELLEQNPSFVFFKPQSFA -------------1111--3333---------1111------------------------ PVKGASAVPLVGRASVASDRSIIPPGTTLLAEVPLLDNNGKFNGQYELRLMVALDVGGAI ---1111---2222----3333-2222---------1111----------------1111 KGQHFDIYQGIGPEAGHRAGWYNHYGRVWVLKTAP !!!!------------------------------- >TROPINONE REDUCTASE-II; SWP:P50163; PDB:2AE2A; AGRWNLEGCTALVTGGSRGIGYGIVEELASLGASVYTCSRNQKELNDCLTQWRSKGFKVE -----2222-------------------------------------------1111---- ASVCDLSSRSERQELMNTVANHFHGKLNILVNNAGIVIYKEAKDYTVEDYSLIMSINFEA ----1111--------------%%%%--------------1111---------------- AYHLSVLAHPFLKASERGNVVFISSVSGALAVPYEAVYGATKGAMDQLTRCLAFEWAKDN ------------------------3333---2222--------------------3333- IRVNGVGPGVIATSLVEMTIQDPEQKENLNKLIDRCALRRMGEPKELAAMVAFLCFPAAS -----------------------------------1111---3333---------3333- YVTGQIIYVDGGLMANCGF ---------iiii------ >ACETYLTRANSFERASE, GNAT F; SWP:Q839D1; PDB:2AE6A; STSLTIRLVAEADWPALHALDQIISLAAYQEKKDETIFVAISGQQLAGFIEVHPPTSLAA ---------3333------3333---3333---------------------------333 HQKQWLLSIGVSPDFQDQGIGGSLLSYIKDAEISGIHKLSLRVATNQEAIRFYEKHGFVQ 3----------1111----------------1111------------------1111--- EAHFKEEFYINGHYCDDYQYAYFI ------------------------ >IMIDAZOLEGLYCEROL-PHOSPHA; SWP:P64373; PDB:2AE8A; IYQKQRTQLNISISDDQSPSHINTGVGFLNHLTLFTFHSGLSLNIEAQGDDHHVTEDIGI -------------------------3333------------------------------- VIGQLLLEIKDKKHFVRYGTYIPDETLARVVVDISGRPYLSFNASLSKEKVGTFDTELVE --------3333--------------------------------------!!!!------ EFFRAVVINARLTTHIDLIRGGNTHHEIEAIFKAFSRALGIALTAT ---------------------------------------------- >ARGINASE 1; SWP:P05089; PDB:2AEBA; SRTIGIIGAPFSKGQPRGGVEEGPTVLRKAGLLEKLKEQECDVKDYGDLPFADIPNDSPF -----------------------------------------------------------! QIVKNPRSVGKASEQLAGKVAEVKKNGRISLVLGGDHSLAIGSISGHARVHPDLGVIWVD !!!-----------------------------------------------1111------ AHTDINTPLTTTSGNLHGQPVSFLLKELKGKIPDVPGFSWVTPCISAKDIVYIGLRDVDP ------1111--------3333--3333------2222-------1111----------- GEHYILKTLGIKYFSMTEVDRLGIGKVMEETLSYLLGRKKRPIHLSFDVDGLDPSFTPAT -----------------------------------3333--------1111-3333---- GTPVVGGLTYREGLYITEEIYKTGLLSGLDIMEVNPSLGKTPEEVTRTVNTAVAITLACF ----------------------------------3333-------------------111 GLAREGNHKPIDYL 1-3333-------- >OROTATE PHOSPHORIBOSYLTRA; SWP:Q9A076; PDB:2AEEA; AMTLASQIATQLLDIKAVYLKPEDPFTWASGIKSPIYTDNRVTLSYPKTRDLIENGFVET --------------------1111----iiii--------3333---------------- IKAHFPEVEVIAGTATAGIPHGAIIADKMTLPFAYIRSKPKGNQIEGRVLKGQKMVIIED ----1111-------1111------------------------------2222------- LISTGGSVLDAAAAASREGADVLGVVAIFTYELPKASQNFKEAGIKLITLSNYTELIAVA --------------------------------3333------------------------ KLQGYITNDGLHLLKKFKEDQVNWQ ------------------------- >CALCIUM-GATED POTASSIUM C; SWP:O27564; PDB:2AEFA; RHVVICGWSESTLECLRELRGSEVFVLAEDENVRKKVLRSGANFVHGDPTRVSDLEKANV --------------3333-----------3333--------------1111----11112 RGARAVIVDLESDSETIHCILGIRKIDESVRIIAEAERYENIEQLRMAGADQVISPFVIS 222----------------------------------3333----3333-----3333-- GRLMSRSIDDGYEAMFVQDVLATRRMVEVPIPEGSKLEGVSVLDADIHDVTGVIIIGVGR ---1111----------1111----------2222-----3333--3333---------- GDELIIDPPRDYSFRAGDIILGIGKPEEIERLKNYISALVP --------1111--2222----------------------- >HYPOTHETICAL PROTEIN AGR_; SWP:Q8UKK6; PDB:2AEGA; CNLYREDKDWVSKWAQDAESLINLPAYQNPDQGPIVRNTADGKKQLVHARWGLPSPIFVQ ---------3333-1111----------2222------1111------------------ KKAAEARADKLKAKGKAFDINELIREPDRGVTNVRKLNLPHWTRWFGVEHRCLVPVTSFA -----------1111--------------------11113333---3333---------- EPDPASKQEGGNVPNAWFARDEAKSLFFAGIHVPQWKSVRKVRDGLTTDDLYGFLTTDPN --1111--------------1111----------------3333---------------- DLVKPIHEKAPVLLLTREETEIWRAPWDEAKHLARPLPNDALIILSREPYGSSIV --3333-------------------33333333----1111-------2222--- >OUTER CAPSID PROTEIN VP4,; SWP:P11196; PDB:2AENA; TVEPVLDGPYQPTTFKPPNDYWLLISSNTNGVVYESTNNNDFWTAVIAVEPHVSQTNRQY ------------------------------------------------------------ ILFGENKQFNVENNSDKWKFFEMFKGSSQGDFSNRRTLTSSNRLVGMLKYGGRVWTFHGE -iiii---------------------1111-------------------iiii------- TPRATTDSSNTADLNNISIIIHSEFYIIPRSQESKCNEYINNGL ------------3333------------3333------------ >NEURAMINIDASE; SWP:Q80DL0; PDB:2AEPA; AEYRNWSKPQCKITGFAPFSKDNSIRLSAGGDIWVTREPYVSCDPDKCYQFALGQGTTLN ----------------------33331111------------------------------ NRHSNDTVHDRTPYRTLLMNELGVPFHLGTKQVCIAWSSSSCHDGKAWLHVCVTGHDENA 1111-------1111-----2222--1111-------------------------1111- TASFIYDGRLVDSIGSWSKKILRTQESECVCINGTCTVVMTDGSASGRADTKILFIEEGK -----iiii----------------------iiii---------------------iiii IVHISPLSGSAQHVEECSCYPRYPGVRCVCRDNWKGSNRPIVDINVKDYSIVSSYVCSGL ------------------------------------------------------------ VGDTPRKNDSSSSSHCLNPNNEEGGHGVKGWAFDDGNDVWMGRTISEKFRSGYETFKVIE -------3333-----------------------!!!!---------------------3 GWSKPNSKLQINRQVIVDRGNRSGYSGIFSVEGKSCINRCFYVELIRGRKQETEVWWTSN 3332222----------1111--------------------------------------- SIVVFCGTSGTYGTGSWPDGADINLMPI ---------------------1111--- >Ig kappa chain V region M; SWP:P84750; PDB:2AEPH; EVKLVESGGGLVQPGGSLSLSCATSGFTFIDYYMSWFRQPPGKALEWLGLIRNK ---------------------------3333--------2222----------- >Ig kappa chain V region M; SWP:P84750; PDB:2AEPL; DILMTQSQKFLSTSVGDRVSVTCKASQNVGTNVAWYQKKPGQSPKPLMYSASYRYSGVPD -------------2222---------------------2222-------1111-2222-- RFTGSGSGTDFTLTISNVQSEDLAEYFCQQFNRYPLTFGSGTKLELKRADAAPLNNFYPK ------------------1111-------------------------------------- DTDQDSKDS --------- >HYPOTHETICAL PROTEIN MJ01; SWP:Q57622; PDB:2AEUA; LRLEKARKIILEILNEKGRDALYDLSGLSGGFLIDEKDKALLNTYIGSSYFAEKVNEYGL -----------------3333--------------------------------------- KHLGGDENDKCVGFNRTSSAILATILALKPKKVIHYLPELPGHPSIERSCKIVNAKYFES -----1111--------------------------------------------------- DKVGEILNKIDKDTLVIITGSTMDLKVIELENFKKVINTAKNKEAIVFVDDASGARVRLL -33331111-1111-------3333---------------------------------11 FNQPPALKLGADLVVTSTDKLMEGPRGGLLAGKKELVDKIYIEGTKFGLEAQPPLLAGIY 11--3333---------------------------------------------------- RALKNFNLERIRKAFERAKNFDLSKIEKLNKELKAIDDNINIVYERTPTGFVIKRVYKDD ------3333------------3333------33331111------1111---------- TINIKKLIEIGFNLLKNYGIITITVAGMPGASKSLRIDLTSRDAERIDDNYIIKAIVESI ----------------------3333-----------11113333--------------3 KMAFKS 333--- >COPROPORPHYRINOGEN III OX; SWP:P36551; PDB:2AEXA; EEDELAHRCSSFMAPPVTDLGELRRRPGDMKTKMELLILETQAQVCQALAQVDGGANFSV --------1111------3333---1111------------------------------- DRWERKEGGGGISCVLQDGCVFEKAGVSISVVHGNLSEEAAKQMRSRGKVLKTKDGKLPF ----1111------------------------------------1111-----!!!!--- CAMGVSSVIHPKNPHAPTIHFNYRYFEVEEADGNKQWWFGGGCDLTPTYLNQEDAVHFHR ------------1111-------------------------------------------- TLKEACDQHGPDLYPKFKKWCDDYFFIAHRGERRGIGGIFFDDLDSPSKEEVFRFVQSCA ---------1111-------------3333------------------------------ RAVVPSYIPLVKKHCDDSFTPQEKLWQQLRRGRYVEFNLLYDRGTKFGLFTPGSRIESIL ---3333------1111---------------------------1111-----------1 MSLPLTARWEYMHSPSENSKEAEILEVLRHPRDWVR 111------------1111----------------- >REGULATOR OF G-PROTEIN SI; SWP:P41220; PDB:2AF0A; ADLGTENLYFQSMKPSPEEAQLWSEAFDELLASKYGLAAFRAFLKSEFCEENIEFWLACE ---------3333--------33333333---------------1111------------ DFKKTKSPQKLSSKARKIYTDFIEKEAPKEINIDFQTKTLIAQNIQEATSGCFTTAQKRV -1111------------------2222---------------------1111-------- YSLMENNSYPRFLESEFYQDLCKKPQ -------------------------- >Phosphate acetyltransfera; SWP:P38503; PDB:2AF4C; VTFLEKISERAKKLNKTIALPETEDIRTLQAAAKILERGIADIVLVGNEADIKALAGDLD ---------3333--------1111----------------------------3333--- LSKAKIVDPKTYEKKDEYINAFYELRKHKGITLENAAEIMSDYVYFAVMMAKLGEVDGVV 1111---3333--------------3333--------1111---------1111------ SGAAHSSSDTLRPAVQIVKTAKGAALASAFFIISVPDEYGSDGTFLFADSGMVEMPSVED -----3333-----------!!!!----------------iiii---------------- VANIAVISAKTFELLVQDVPKVAMLSYSTKGSAKSKLTEATIASTKLAQELAPDIAIDGE ----------------------------iiii-------------------1111----- LQVDAAIVPKVAASKAPGSPVAGKANVFIFPDLNCGNIAYKIAQRLAKAEAYGPITQGLA --------3333---2222-2222------------------------------------ KPINDLSRGCSDEDIVGAVAITCVQAAAQDK ------------------------------- >Engineered Outer Surface ; SWP:P14013; PDB:2AF5A; NSVSVDLPGEMKVLVSKEKNKDGKYDLIATVDKLELKGTSDKNNGSGVLEGVKADKSKVK -------------------1111-----------------------------1111---- LTISDDLGQTTLEVFKEDGKTLVSKKVTSKDKSSTEEKFNEKGELSEKKITRADKSSTEE ---1111--------3333---------1111-------1111--------1111----- KFNEKGELSEKKITRADKSSTEEKFNEKGEVSEKIITRADGTRLEYTGIKSDGSGKAKEV --------------1111-------1111--------1111--------1111------- LKGYVLEGTLTAEKTTLVVKEGTVTLSKNISKSGEVSVELNDTDSSAATKKTAAWNSGTS ----------3333------!!!!------3333-----------3333-------1111 TLTITVNSKKTKDLVFTKENTITVQQYDSNGTKLEGSAVEITKLDEIKNALK -----%%%%------------------1111----------------3333- >THYMIDYLATE SYNTHASE THYX; SWP:P66930; PDB:2AF6A; ETAPLRVQLIAKTDFLAPPDVPWTTDADGGPALVEFAGRACYQSWSKPNPKTATNAGYLR -----------------1111----------------3333-------3333-------- HIDVGHFSVLEHASVSFYITGISRSCTHELIRHRHFSYSQLSQRYVPEKDSRVVVPPGED -----------------------------------------3333--1111--------- DADLRHILTEAADAARATYSELLAKLEAKFADQPNAILRRKQARQAARAVPNATETRIVV -----------------------------3333-3333----3333----1111------ TGNYRAWRHFIARASEHADVEIRRLAIECLRQLAAVAPAVFADFEVTTLADGTEVATSPL --------------3333------------------33333333----1111-----111 A 1 >GAMMA-CARBOXYMUCONOLACTON; SWP:O26336; PDB:2AF7A; ERYRRGEILNRNRKSYTAIRDELEDVAPDLARFVAEFAYGDVYSRGVLDLKTRELLTLAA 3333------------3333---------------------1111---3333-------- LTVLRADDQLKSHVRGALNAGCSKDEIIEVIQAVYAGFPAAINAVLAAKEVFTE -----------------1111---------------3333-------------- >NAG ISOMERASE; SWP:Q8ZKT7; PDB:2AFAA; LKWFNTLSHNRWLEQETDRIFNFGKNAVVPTGFGWLGNKGQIKEEGTHLWITARLHVYSV --2222----------------3333--1111----1111-------------------- AASGRPGAYDLVDHGIKANGALRDKKYGGWYACVNDQGVVDASKQGYQHFFALLGAASAV -------------------3333-----------3333-------------------333 TTGHPEARKLLDYTIEVIEKYFWSEEEQCLESWDEAFSQTEDYRGGNANHAVEAFLIVYD 3--1111----------------3333------1111--------3333----------- VTHDKKWLDRALRIASVIIHDVARNGDYRVNEHFDSQWNPIRDYNKDNPAHRFRAYGGTP ---3333--------------3333%%%%-----1111--11113333----------33 GHWIEWGRLLHLHAALEARFETPPAWLLEDAKGLFHATIRDAWAPDGADGFVYSVDWDGK 33---------------------3333----------------------------1111- PIVRERVRWPIVEAGTAYALYTLTDDSQYEEWYQKWWDYCIKYLDYENGSWWQELDADNK -------------------------------------------------------1111- VTTGKQDIYHLLHCLVIPRLPLAPGLAPAVAAGLLDINA -------3333----3333--------------2222-- >2-KETO-3-DEOXYGLUCONATE K; SWP:NA; PDB:2AFBA; HKVVTFGEILRLSPPDHKRIFQTDSFDVTYGGAEANVAAFLAQLDAYFVTKLPNNPLGDA --------------%%%%3333-------------------------------------- AAGHLRKFGVKTDYIARGGNRIGIYFLEIGASQRPSKVVYDRAHSAISEAKREDFDWEKI -----1111--1111---------------!!!!-------22223333-1111-3333- LDGARWFHFSGITPPLGKELPLILEDALKVANEKGVTVSCDLNYRARLWTKEEAQKVIPF 2222-----3333---1111------------------------3333------------ EYVDVLIANEEDIEKVLGISVEGLNREAYAKIAEEVTRKYNFKTVGITLRESISATVNYW ------------------------------------------------------------ SVVFENGQPHFSNRYEIHIVDRVGAGDSFAGALIYGSLGFDSQKKAEFAAAASCLKHTIP ----iiii-------------2222------------------------------1111- GDFVVLSIEEIEKLASG ----------------- >PROTEIN ASL1650; SWP:NA; PDB:2AFDA; GSHMKTIQPASVEDIQSWLIDQFAQQLDVDPDDIDMEESFDNYDLNSSKALILLGRLEKW ----------------------3333---3333-11113333----3333---------- LGKELNPVLIFNYPTIAQLAKRLGELYL -----3333------------------- >MKI67 FHA domain-interact; SWP:Q9BYG3; PDB:2AFFB; VDQGPPVCPTFLERRKSQVAELNDDDKDDEIVFKQPI -----------------------3333---------- >Nitrogenase iron protein ; SWP:P00459; PDB:2AFHE; AMRQCAIYGKGGIGKSTTTQNLVAALAEMGKKVMIVGCDPKADSTRLILHSKAQNTIMEM --------------------------1111--------------3333-----------3 AAEAGTVEDLELEDVLKAGYGGVKCVESGGPEPGVGCAGRGVITAINFLEEEGAYEDDLD 333--1111--3333---2222---------2222--------------1111------- FVFYDVLGDVVCGGFAMPIRENKAQEIYIVCSGEMMAMYAANNISKGIVKYANSGSVRLG -----------3333---1111----------------------------1111------ GLICNSRNTDREDELIIALANKLGTQMIHFVPRDNVVQRAEIRRMTVIEYDPKAKQADEY --------2222----------------------------1111-3333-11113333-- RALARKVVDNKLLVIPNPITMDELEELLMEFGIMEVEDESIVGKTAEEV ----------------------------1111-----3333---3333- >GENE RICH CLUSTER, C9 GEN; SWP:O88838; PDB:2AFJA; GSSARQSTPTSQALYSDFSPPEGLEELLSAPPPDLVAQRHHGWNPKDCSENIDVKEGGLC ----------------------33333333------------------------------ FERRPVAQSTDGVRGKRGYSRGLHAWEISWPLEQRGTHAVVGVATALAPLQADHYAALLG -----3333-------------------------------------------3333---- SNSESWGWDIGRGKLYHQSKGLEAPQYPAGPQGEQLVVPERLLVVLDMEEGTLGYSIGGT -----------------------------1111--------------1111-----%%%% YLGPAFRGLKGRTLYPSVSAVWGQCQVRIRYMGERRVEETRRIHRD --1111-3333----------3333--------------------- >SEA RAVEN TYPE II ANTIFRE; SWP:P05140; PDB:2AFPA; QRAGPNCPAGWQPLGDRCIYYETTAMTWALAETNCMKLGGHLASIHSQEEHSFIQTLNAG ---------------------------3333----------------3333--3333--- VVWIGGSACLQAGAWTWSDGTPMNFRSWCSTKPDDVLAACCMQMTAAADQCWDDLPCPAS -----------------------------------3333--------------------- HKSVCAMTF --------- >COBALAMIN BIOSYNTHESIS PR; SWP:Q8EXP7; PDB:2AFRA; QITNLGRNIENKSFSIIDEEAGPHSFAQEEWEVVRRIIHATADFDYKNITKIHPQAIDSG --------------------------3333------------3333------1111---- IQALKKGCPIVCDVQMILSGLNPERLKVYGCKTYCFISDEDVIENAKRKNSTRAIESIQK ---1111------33331111----3333------1111--------------------- ANSFNLLNESIIVIGNAPTALLEIEKLIRQEGIKPALIVGVPVGFVSAKESKESILKLEY ----1111--------3333---------------------------------------- YNVTSIPYILTMGRKGGSTIAVAILHALLLLSSKRG ------------------------------------ >GLUTAMINYL-PEPTIDE CYCLOT; SWP:Q16769; PDB:2AFWA; ASAWPEEKNYHQPAILNSSALRQIAEGTSISEMWQNDLQPLLIERYPGSPGSYAARQHIM --33333333---------------------------3333----2222----------- QRIQRLQADWVLEIDTFLSQTPYGYRSFSNIISTLNPTAKRHLVLACHYDSKYFSHWNNR --1111--------------1111-----------1111-----------------%%%% VFVGATDSAVPCAMMLELARALDKKLLSLKPDLSLQLIFFDGEEAFLHWSPQDSLYGSRH ---1111----------------3333----------------------1111------- LAAKMASTPHPPGARGTSQLHGMDLLVLLDLIGAPNPTFPNFFPNSARWFERLQAIEHEL ----------2222---3333---------------------1111-------------- HELGLLKDHSLEGRYFQNYSYGGVIQDDHIPFLRRGVPVLHLIPSPFPEVWHTMDDNEEN 1111-----3333-------------3333-1111------------1111-11113333 LDESTIDNLNKILQVFVLEYLHL ----------------------- >BENZALDEHYDE LYASE; SWP:Q9F4L3; PDB:2AG0A; AMITGGELVVRTLIKAGVEHLFGLHGAHIDTIFQACLDHDVPIIDTRHEAAAGHAAEGYA -------------1111--------3333------------------------------- RAGAKLGVALVTAGGGFTNAVTPIANAWLDRTPVLFLTGSGALRDDETNTLQAGIDQVAM ------------!!!!---------------------------------2222------- AAPITKWAHRVMATEHIPRLVMQAIRAALSAPRGPVLLDLPWDILMNQIDEDSVIIPDLV 1111--------3333------------------------3333-----1111------- LSAHGARPDPADLDQALALLRKAERPVIVLGSEASRTARKTALSAFVAATGVPVFADYEG -------------------1111---------3333--------------------3333 LSMLSGLPDAMRGGLVQNLYSFAKADAAPDLVLMLGARFGLNTGHGSGQLIPHSAQVIQV -1111--3333---333311111111-------------3333!!!!----1111----- DPDACELGRLQGIALGIVADVGGTIEALAQATAQDAAWPDRGDWCAKVTDLAQERYASIA --3333-----------------------------------------------------1 AKSSSEHALHPFHASQVIAKHVDAGVTVVADGALTYLWLSEVMSRVKPGGFLCHGYLGSM 111-2222----------11111111----------------1111--------3333-- GVGFGTALGAQVADLEAGRRTILVTGDGSVGYSIGEFDTLVRKQLPLIVIIMNNQSWGAT --------------1111-------33333333-------1111---------------- LHFQQLAVGPNRVTGTRLENGSYHGVAAAFGADGYHVDSVESFSAALAQALAHNRPACIN ---------------------------1111------------------1111------- VAVALDPIPPEELI --------3333-- >GANGLIOSIDE GM2 ACTIVATOR; SWP:P17900; PDB:2AG4A; HMSSFSWDNCDEGKDPAVIRSLTLEPDPIVVPGNVTLSVVGSTSVPLSSPLKVDLVLEKE ----------3333---------------------------------------------- VAGLWIKIPCTDYIGSCTFEHFCDVLDMLIPTGEPCPEPLRTYGLPCHCPFKEGTYSLPK ----------iiii----------------2222----3333------------------ SEFVVPDLELPSWLTTGNYRIESVLSSSGKRLGCIKIAASLKGI ----------1111------------iiii-------------- >DEHYDROGENASE/REDUCTASE (; SWP:Q9BUT1; PDB:2AG5A; MGRLDGKVIILTAAAQGIGQAAALAFAREGAKVIATDINESKLQELEKYPGIQTRVLDVT -1111---------------------1111--------3333------2222-----111 KKKQIDQFANEVERLDVLFNVAGFVHHGTVLDCEEKDWDFSMNLNVRSMYLMIKAFLPKM 1-------3333----------------3333-3333----------------------- LAQKSGNIINMSSVASSVKGVVNRCVYSTTKAAVIGLTKSVAADFIQQGIRCNCVCPGTV 1111-----------3333-------------------------1111------------ DTPSLQERIQARGNPEEARNDFLKRQKTGRFATAEEIAMLCVYLASDESAYVTGNPVIID ---------------------33331111----------------3333----------i GGWSLG iii--- >TYROSYL-TRNA SYNTHETASE; SWP:Q57834; PDB:2AG6A; MDEFEMIKRNTSEIISEEELREVLKKDEKSALIGFEPSGKIHLGHYLQIKKMIDLQNAGF -------2222-----------3333-----------------------------1111- DIIILLADLHAYLNQKGELDEIRKIGDYNKKVFEAMGLKAKYVYGSSFQLDKDYTLNVYR -----------1111----------------------------33331111--------- LALKTTLKRARRSMELIAREDENPKVAEVIYPIMQVNPLHYEGVDVAVGGMEQRKIHMLA 3333-------1111---------3333--------11112222-----3333------- RELLPKKVVCIHNPVLTGLDGEGKMSSSKGNFIAVDDSPEEIRAKIKKAYCPAGVVEGNP -----------------1111----3333----1111--------------22222222- IMEIAKYFLEYPLTIKRPEKFGGDLTVNSYEELESLFKNKELHPMDLKNAVAEELIKILE -----------------3333--------------------------------------- PIRKRLL ------- >MACHADO-JOSEPH DISEASE PR; SWP:P54252; PDB:2AGAA; GPLGSMESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTS ---3333----------------------------------------------------- EDYRTFLQQPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICN --------------------3333----------------3333-----3333------- YKEHWFTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEAD -------------------1111-----1111------------------------3333 QLLQMIRVQQ 3333------ >GANGLIOSIDE GM2 ACTIVATOR; SWP:Q60648; PDB:2AGCA; GGFSWDNCDEGKDPAVIKSLTIQPDPIVVPGDVVVSLEGKTSVPLTAPQKVELTVEKEVA -------%%%%-----------------------------------------------ii GFWVKIPCVEQLGSCSYENICDLIDEYIPPGESCPEPLHTYGLPCHCPFKEGTYSLPTSN ii---------------------------------------------------------- FTVPDLELPSWLSTGNYRIQSILSSGGKRLGCIKIAASLKGR --------3333------------iiii-------------- >Zinc finger protein HRX; SWP:Q03164; PDB:2AGHC; SDDGNILPSDIMDFVLKNTPSMQALGESPES -------3333-------------------- >YVO FAB, LIGHT CHAIN; SWP:NA; PDB:2AGJH; VTLKESGPTLVKPTQTLTLTCTFSGFSLTTTGEGVGWIRQPPGKALEFLAFIYWNDAKRY ----------------------------------------2222---------------- NPSLQSRLTITKDASKKQVVLTLTNLDPVDTATYYCARTSGWDIEFEYWGQGTLVTVSSG ------------3333----------3333------------------------------ SASAPTLFPLVSCENSSPSSTVAVGCLAQDFLPDSITFSWKYKNNSDISSTRGFPSVLRG ------------------------------------------------------------ GKYAATSQVLLPSKDVMQGTDEHVVCKVQHPNGNKEKDVPLPVVI -----------3333------------------------------ >Putative uncharacterized ; SWP:Q6P5S8; PDB:2AGJL; EIVLTQSPGTLSLSPGERATLSCRASETVSNDKVAWYQQKPGQAPRLLIYGASSRATGIP -------------2222-----------2222---------------------------- DRFSGSGSGTDFTLSISGLEPEDFVVYYCQQYASSPRTFGQGTKVEIKRTVAAPSVFIFP -------------------3333------------------------------------- PSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTL ---3333-----------------------iiii-------------------------- TLSKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC --33331111--------1111------------- >1-(5-phosphoribosyl)-5-[(; SWP:P40545; PDB:2AGKA; TKFIGCIDLHNGEVKQQHPSSYYAKLYKDRDVQGCHVIKLGPNNDDAAREALQESPQFLQ ---------iiii-----3333-------------------------------------- VGGGINDTNCLEWLKWASKVIVTSWLFTKEGHFQLKRLERLTELCGKDRIVVDLSCRKTQ -----1111-3333-------------1111--------------1111----------2 DGRWIVAMNKWQTLTDLELNADTFRELRKYTNEFLIHAGGIDELLVSKLFEWTKDYDDLK 222-----iiii---------------1111---------------------1111---- IVYAGGAKSVDDLKLVDELSHGKVDLTFGSSLDIFGGNLVKFEDCCRWNEKQG -------------------iiii-----33331111------------3333- >POLY(BETA-D-MANNURONATE) ; SWP:Q44493; PDB:2AGMA; GSDGEPLVGGDTDDQLQGGSGADRLDGGAGDDILDGGAGRDRLSGGAGADTFVFSAREDS ------------------------------------------------------------ YRTDTAVFNDLILDFEASEDRIDLSALGFSGLGDGYGGTLLLKTNAEGTRTYLKSFEADA --1111---------3333-------------------------1111----------11 EGRRFEVALDGDHTGDLSAANVVFAATGTTTELEVLGDSGTQAGAIV 11---------------------%%%%-------------------- >AROMATIC AMINE DEHYDROGEN; SWP:Q0VKG7; PDB:2AGYA; REVLTGGHSVSAPQENRIYVMDSVFMHLTESRVHVYDYTNGKFLGMVPTAFNGHVQVSND ------------3333-------3333------------------------------111 GKKIYTMTTYHERITRGKRSDVVEVWDADKLTFEKEISLPPKRVQGLNYDGLFRQTTDGK 1-----------------------------------------------1111-------- FIVLQNASPATSIGIVDVAKGDYVEDVTAAAGCWSVIPQPNRPRSFMTICGDGGLLTINL -----------------3333-----3333--------1111-------1111------- GEDGKVASQSRSKQMFSVKDDPIFIAPALDKDKAHFVSYYGNVYSADFSGDEVKVDGPWS 1111------------3333-----------------1111------------------- LLNDEDKAKNWVPGGYNLVGLHRASGRMYVFMHPDGKEGTHKFPAAEIWVMDTKTKQRVA --3333----------------1111----------2222-------------------- RIPGRDALSMTIDQQRNLMLTLDGGNVNVYDISQPEPKLLRTIEGAAEASLQVQFHPVGG ---%%%%------1111------------------------------------------- >Aralkylamine dehydrogenas; SWP:Q0VKG6; PDB:2AGYD; EVNSCDYWRHCAVDGFLCSCCGGTTTTCPPGSTPSPISIGTCHNPHDGKDYLISYHDCCG 1111--3333------3333--------2222---------------------------- KTACGRCQCNTQTRERPGYEFFLHNDVNWCMANENSTFHCTTSVLVGL -----------2222-1111-------1111----------------- >COG0546: PREDICTED PHOSPH; SWP:Q97T51; PDB:2AH5A; TSITAIFFDLDGTLVDSSIGIHNAFTYTFKELGVPSPDAKTIRGFGPPLESSFATCLSKD -----------------------------------------3333------------333 QISEAVQIYRSYYKAKGIYEAQLFPQIIDLLEELSSSYPLYITTTKDTSTAQDAKNLEIH 3--------------3333----2222------1111-----------------111133 HFFDGIYGSSPEAPHKADVIHQALQTHQLAPEQAIIIGDTKFDLGARETGIQKLAITWGF 33-------3333-----------1111-3333------3333----------------- GEQADLLNYQPDYIAHKPLEVLAYFQ ------1111------3333-3333- >BH1595, UNKNOWN CONSERVED; SWP:Q9KCH6; PDB:2AH6A; DTRVVAYGTTDELNSFVGSAITQLDENTFADIRGELFKIQHELFDCGGDLALPYKAKQEI --------------------1111--------------------------------3333 VDFLEQRIDAYIKEAPELERFILPGGSEAAASLHVCRTIARRAERYVVRLQQEGEINPIV ------------------------------------------------------------ LKYLNRLSDYFFAVARVVNSRLQVPDVEYE ------------------------------ >GREEN FLUORESCENT PROTEIN; SWP:P42212; PDB:2AHAA; SKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGDLTLKFISTTGKLPVPWPTLV -3333---------------iiii-----------1111---------------3333-3 TTFVQCFSRYPDHMKRHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIE 3333333---333311113333----------------------------!!!!------ LKGIDFKEDGNILGHKLEYNYNCHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQ ------1111-1111--------------------------------------------- NTPIGDGPVLLPDNHYLSTCSALSKDPNEKRDHMVLLERVTAAGI -------------------------1111---------------- >CHLORIDE INTRACELLULAR CH; SWP:Q9Y696; PDB:2AHEA; EPLIELFVKAGSDGESIGNCPFSQRLFMILWLKGVVFSVTTVDLKRKPADLQNLAPGTHP ----------1111----------------------------1111--------2222-- PFITFNSEVKTDVNKIEEFLEEVLCPPKYLKLSPKHPESNTAGMDIFAKFSAYIKNSRPE ----iiii---------------------------3333------------------333 ANEALERGLLKTLQKLDEYLNSPLSTRKFLDGNEMTLADCNLLPKLHIVKVVAKKYRNFD 3---------------------------1111---------------------------- IPKEMTGIWRYLTNAYSRDEFTNTCPSDKEVEIAYSDVAKRLPSKVPK -3333-------------3333----------1111------------ >UNSATURATED GLUCURONYL HY; SWP:Q9RC92; PDB:2AHFA; MWQQAIGDALGITARNLKKFGDRFPHVSDGSNKYVLNDNTDWTDGFWSGILWLCYEYTGD -------------------!!!!-------------------3333-------------- EQYREGAVRTVASFRERLDRFENLDHHNIGFLYSLSAKAQWIVEKDESARKLALDAADVL ------------------------------------------------------------ MRRWRADAGIIQAWGPKGDPENGGRIIIDCLLNLPLLLWAGEQTGDPEYRRVAEAHALKS -----1111------2222---------3333---------------------------- RRFLVRGDDSSYHTFYFDPENGNAIRGGTHQGNTDGSTWTRGQAWGIYGFALNSRYLGNA -----1111------------------------1111----------------------- DLLETAKRMARHFLARVPEDGVVYWDFEVPQEPSSYRDSSASAITACGLLEIASQLDESD -------------11111111----------1111-----------------11111111 PERQRFIDAAKTTVTALRDGYAERDDGEAEGFIRRGSYHVRGGISPDDYTIWGDYYYLEA ---------------------------------------1111----------------- LLRLERGVTGYWYERGR ----------------- >Replicase polyprotein 1ab; SWP:P59641; PDB:2AHME; LKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRK ------------------------------------------------------------ LDNDALNNIINNARDGCVPLNIIPLTTAAKLMVVVPDYGTYKNTCDGNTFTYASALWEIQ ------------1111---------2222-------33333333-------%%%%----- QVVDADSKIVQLSEINMDNSPNLAWPLIVTALRAN ---1111---3333-33331111------------ >THAUMATIN-LIKE PROTEIN; SWP:P50694; PDB:2AHNA; ATISFKNNCPYMVWPGTLTSDQKPQLSTTGFELASQASFQLDTPVPWNGRFWARTGCSTD -------------------%%%%----------2222----------------------1 ASGKFVCATADCASGQVMCNGNGAIPPATLAEFNIPAGGGQDFYDVSLVDGFNLPMSVTP 111---------------iiii-------------------------1111--------- QGGTGDCKTASCPANVNAVCPSELQKKGSDGSVVACLSACVKFGTPQYCCTPPQNTPETC --------------3333--3333---1111------3333---3333-------3333- PPTNYSEIFHNACPDAYSYAYDDKRGTFTCNGGPNYAITFCP --3333------1111-----1111----------------- >TRANSLATION INITIATION FA; SWP:Q980A5; PDB:2AHOA; AWPKVQPEVNIGVVGHVDHGKTTLVQAITGIWTSGMTIKLGYAETNIGVCESCKKPEAYV -----------------------1111----------------------1111------- TEPSCKSCGSDDEPKFLRRISFIDAPGHEVLMATMLSGAALMDGAILVVAANEPFPQPQT ---------------------------333333333333-----------------3333 REHFVALGIIGVKNLIIVQNKVDVVSKEEALSQYRQIKQFTKGTWAENVPIIPVSALHKI --------------------3333------------------------------3333-- NIDSLIEGIEEYIKTPYRDLSQKPVMLVIRSFDVNKPGTQFNELKGGVIGGSIIQGLFKV -----------------------------------22223333---------------22 DQEIKVLPGLRVVSYEPIFTKISSIRFGDEEFKEAKPGGLVAIGTYLDPSLTKADNLLGS 22---------------------------------------------3333-----2222 IITLADAEVPVLWNIRIKYNLLERVVGAKEMLKVDPIRAKETLMLSVGSSTTLGIVTSVK -------------------------------------2222-----!!!!---------- KDEIEVELRRPVAVWSNNIRTVISRQIAGRWRMIGWGLVEI ----------------------------------------- >Translation initiation fa; SWP:Q97Z79; PDB:2AHOB; MIYSRSKLPSEGEILIATVKQVFDYGSYVSLDEYGGLQAFLPWSEVSNIRDVLKENRKVI ---------2222--------------------%%%%----3333--3333--------- VKVIRVDRRKGTVDVSLKKVTDDERRKKNLQWKKIQRLDKILELVSQKLKLSEKDAWEQV --------------------3333-1111------------------------------- AWKLEAKYGDPITAIEKAVKEGEKILIDAGVPEIWVKPLLEEASKHAEERKVKMSGLITV --3333---3333------------3333------3333--------------------- RTNEPLGVEKIKEVISKALENIEQDYESLLNIKIYTIGAPRYRVDVVGTNPKEASEALNQ ------------3333---------------------------------1111------- IISNLIKIGKEENVDISVV ------------------- >GENERAL CONTROL PROTEIN G; SWP:P03069; PDB:2AHPA; RMKQLEDKVEELLKNYHLENEVARLKKLVGER ----------------------------1111 >RNA POLYMERASE SIGMA FACT; SWP:O66858; PDB:2AHQA; TYSLRTFFVRESAEGLTQGELMKLIKEIVENEDKRKPYSDQEIANILKEKGFKVARRTVA --------------------------1111--1111--3333------------------ KYREMLG ------- >PUTATIVE PYRROLINE CARBOX; SWP:Q9A1S9; PDB:2AHRA; AKIGIIGVGKASAIIKGLKQTPHELIISGSSLERSKEIAEQLALPYASHQDLIDQVDLVI ----------3333-3333---------------------------------1111---- LGIKPQLFETVLKPLHFKQPIISAAGISLQRLATFVGQDLPLLRIPNNAQILQSSTALTG ---1111-3333-----------------------------------3333--------- NALVSQELQARVRDLTDSFGSTFDISEKDFDTFTALAGSSPAYIYLFIEALAKAGVKNGI 11113333-----------------3333--------------------------1111- PKAKALEIVTQTVLASASNLKTSSQSPHDFIDAICSPGGTTIAGLELERLGLTATVSSAI -----------------------------------22223333----1111--------- DKTIDKAKSL ---------- >PUTATIVE ENZYME YDIF; SWP:Q8X5X6; PDB:2AHUA; VKPPRINGRVPVLSAQEAVNYIPDEATLCVLGAGGGILEATTLITALADKYKQTQTPRNL -----iiii---------11112222---------22223333----------------- SIISPTGLGDRADRGISPLAQEGLVKWALCGHWGQSPRISDLAEQNKIIAYNYPQGVLTQ -------------!!!!---2222--------3333------1111-------------- TLRAAAAHQPGIISDIGIGTFVDPRQQGGKLNEVTKEDLIKLVEFDNKEYLYYKAIAPDI ----1111------2222-3333---iiii-3333---------%%%%------------ AFIRATTCDSEGYATFEDEVYLDALVIAQAVHNNGGIVQVQKVKKATLHPKSVRIPGYLV --------1111---1111------------1111--------2222-1111---3333- DIVVVDPDQSQLYGGAPVNRFISGDFTLDLPLNQRKLVARRALFERKGAVGNVGVGIADG -----1111---------3333------------------3333-2222------1111- IGLVAREEGCADDFILTVETGPIGGITSGANVNTRAILDTSQFDFYHGGGLDVCYLSFAE ---------1111----1111------------------------1111----------- VDQHGNVGVHKFNGKIGTGGFIDISATSKKIIFCGTLTAGSLKTEIADGKLNIVQEGRVK -1111------iiii----3333-----------------------%%%%---------- KFIRELPEITFSGKIALERGLDVRYITERAVFTLKEDGLHLIEIAPGVDLQKDILDKDFT ----------------1111------1111----1111------22223333-3333--- PVISPELKLDERLFIDAAGFVLPEA ---1111--3333------------ >RECEPTOR TYROSINE-PROTEIN; SWP:Q15303; PDB:2AHXA; QSVCAGTENKLSSLSDLEQQYRALRKYYENCEVVMGNLEITSIEHNRDLSFLRSVREVTG ------------------------------------------------3333-------- YVLVALNQFRYLPLENLRIIRGTKLYEDRYALAIFLNYRKDGNFGLQELGLKNLTEILNG -------------1111-------2222----------------------1111------ GVYVDQNKFLCYADTIHWQDIVRNPWPSNLTLVSTGCGRCHKSCTGRCWGPTENHCQTLT ------1111--33333333------3333----------3333-------1111----- RTVCAEQCDGRCYGPYVSDCCHRECAGGCSGPKDTDCFACMNFNDSGACVTQCPQTFVYN 11113333-------3333--1111-------------------iiii------------ PTTFQLEHNFNAKYTYGAFCVKKCPHNFVVDSSSCVRACPSSKMEVEENGIKMCKPCTIC --------1111---!!!!--------------------1111----%%%%--------- PKACDGIGTGSLMSAQTVDSSNIDKFINCTKINGNLIFLVTGIHGDPYNAIEAIDPEKLN -----2222---------1111---2222---------3333----1111----3333-1 VFRTVREITGFLNIQSWPPNMTDFSVFSNLVTIGGRVLYSGLSLLILKQQGITSLQFQSL 111--------------1111--1111-----------------------------1111 KEISAGNIYITDNSNLCYYHTINWTTLFSTINQRIVIRDNRKAENCTAEGMVCNHLCSSD ------------1111-3333---3333-------------33331111----1111--- GCWGPGPDQCLSCRRFSRGRICIESCNLYDGEFREFENDSICVECDPQCEKMEDGLLTCH -----------------!!!!----------------iiii----1111---iiii---- GPGPDNCTKCSHFKDGPNCVEKCPDGLQGANSFIFKYADPDRECHPCHPNCTQGCNGPTS --------------!!!!----------1111------1111-----1111-------11 HDCIYYPWTGH 11--------- >HYPOTHETICAL PROTEIN SO16; SWP:Q8EGA7; PDB:2AI4A; GLAQFIKVNVTLENGEPVFIYTDANGQVCQGDITVTQAGTITYLLNDQTLKGLKFVGVGF ------------iiii------1111---------------------------------- VTPFDGIIDAVTISSDGLVQLVDLDKTPGTTKFQFVLSNTANTLLVLSPD -1111--------1111--------------------------------- >BETA-ELICITIN CINNAMOMIN; SWP:P15569; PDB:2AIBA; TACTATQQTAAYKTLVSILSESSFSQCSKDSGYSMLTATALPTNAQYKLMCASTACNTMI ---------------3333--------------1111----------------------- KKIVALNPPDCDLTVPTSGLVLDVYTYANGFSSKCASL ---1111---------------------------1111 >GUANINE NUCLEOTIDE EXCHAN; SWP:Q63K41; PDB:2AICA; TGDAKQAIRHFVDEAVKQVAHARTPEIRQDAEFGRQVYEATLCAIFSEAKDRFCMDPATR -3333-------------------3333-----------------------3333-%%%% AGNVRPAFIEALGDAARATGLPGADKQGVFTPSGAGTNPLYTEIRLRADTLMGAELAARP ---33333333------------------------------------------------- EYRELQPYARQQAIDLVANALPAERSNTLVEFRQTVQTLEATYRRAAQDASRDEKGATNA --3333----------------3333--------------------------1111---- ADGA ---- >Peptide deformylase; SWP:Q8DP79; PDB:2AIEP; SAIERIVKAAHLIDMCDIIREGNPSLRTVAEEVTFPLSDQEIILGEKMMQFLKHSQDPVM -------1111--3333-----3333---------------------------------- AEKMGLRGGVGLAAPQLDISKRIIAVLVPNIEAYDLEAIMYNPKIVSHSVQDAALGEGEG ------------3333--------------------------------------1111-- CLSVDRNVPGYVVRHARVTVDYFDKDGEKHRIKLKGYNSIVVQHEIDHINGIMFYDRINE 1111-------------------1111--------------------1111-3333---- KDPFAVKDGLLILE ------2222---- >RIBOSOMAL PROTEIN L7A; SWP:Q5CWT1; PDB:2AIFA; FPLASPDLNNKIINLVQQACNYKQLRKGANEATKALNRGIAEIVLLAADAEPLEILLHLP ------3333---------1111-----------------------1111-3333----- LVCEDKNTPYVFVRSKVALGRACGVSRPVIAAAITSKDGSSLSSQITELKDQIEQ ---1111-------------1111------------2222----------1111- ------------------------------------------------------------ ------- >METALLO-BETA-LACTAMASE L1; SWP:P52700; PDB:2AIOA; EVPLPQLRAYTVDASWLQPMAPLQIADHTWQIGTEDLTALLVQTPDGAVLLDGGMPQMAS ------------3333---------1111------------------------------- HLLDNMKARGVTPRDLRLILLSHAHADHAGPVAELKRRTGAKVAANAESAVLLARGGSDD ------1111-3333---------1111--------------------------iiii-- LHFGDGITYPPANADRIVMDGEVITVGGIVFTAHFMAGHTPGSTAWTWTDTRNGKPVRIA ------------------2222---iiii----------------------iiii----- YADSLSAPGYQLQGNPRYPHLIEDYRRSFATVRALPCDVLLTPHPGASNWDYAAGARAGA --------------1111-------------------------3333---11111111-- KALTCKAYADAAEQKFDGQLAKETAG -------------------------- >PROTEIN C ACTIVATOR; SWP:P09872; PDB:2AIQA; VIGGDECNINEHRFLALVYANGSLCGGTLINQEWVLTARHCDRGNMRIYLGMHNLKVLNK -------11111111----------------------3333------------1111-11 DALRRFPKEKYFCLNTRNDTIWDKDIMLIRLNRPVRNSAHIAPLSLPSNPPSVGSVCRIM 11-----------------------------------1111----------2222----- GWGTITSPNATLPDVPHCANINILDYAVCQAAYKGLAATTLCAGILEGKDTCKGDSGGPL ------------------------3333-----------------------2222----- ICNGQFQGILSVGGNPCAQPRKPGIYTKVFDYTDWIQSIISGNTDATCPP ---------------------------3333------------------- >Aspartate carbamoyltransf; SWP:P0A7F3; PDB:2AIRB; MTHDNKLQVEAIKRGTVIDHIPAQIGFKLLSLFKLTETDQRITIGLNLPSGEMGRKDLIK ---------------------2222-----1111-------------------------- IENTFLSEDQVDQLALYAPQATVNRIDNYEVVGKSRPSLPERIDNVLVCPNSNCISHAEP -----------1111--2222-----------------------------11113333-- VSSSFAVRKRANDIALKCKYCEKEFSHNVVLAN ----------------------------3333- >CYTOCHROME C, TESTIS-SPEC; SWP:P00015; PDB:2AIUA; GDAEAGKKIFVQKCAQCHTVEKGGKHKTGPNLWGLFGRKTGQAPGFSYTDANKNKGVIWS -------------3333---2222-------2222-------2222-------------- EETLMEYLENPKKYIPGTKMIFAGIKKKSEREDLIKYLKQATSS ----------33332222-------------------------- >FRAGMENT OF NUCLEOPORIN N; SWP:Q02630; PDB:2AIVA; GPNENYYISPSLDTLSSYSLLQLRKVPHLVVGHKSYGKIEFLEPVDLAGIPLTSLGGVII -----------3333------------------3333-------------3333------ TFEPKTCIIYANLPNRPKRGEGINVRARITCFNCYPVDKSTRKPIKDPNHQLVKRHIERL -------------------------------------3333----------3333----- KKNPNSKFESYDADSGTYVFIVNHAAEQT ------------1111------------- >Outer membrane protein P6; SWP:P10324; PDB:2AIZP; CSSSNNDAAGNGAAQTFGGYSVADLQQRYNTVYFGFDKYDITGEYVQILDAHAAYLNATP --------------------3333-----------------3333-------------11 AAKVLVEGNTDERGTPEYNIALGQRRADAVKGYLAGKGVDAGKLGTVSYGEEKPAVLGHD 11------------3333---------------------3333----------------3 EAAYSKNRRAVLAY 333----------- >PROBABLE CADMIUM-TRANSPOR; SWP:Q60048; PDB:2AJ0A; MAEKTVYRVDGLSCTNCAAKFERNVKEIEGVTEAIVNFGASKITVTGEASIQQVEQAGAF ---------------------------3333----------------------------- EHLKIIPEKEA ----------- >HYPOTHETICAL UPF0301 PROT; SWP:Q9KUP8; PDB:2AJ2A; SDIEVGHSNLTNHFLVAPSKDPYFKRSVIYICEHNQDGAGLINAPIDITVGGLKQVDIEP --------------------3333----------1111----------3333-1111--- AYPQSHQENLKKPVFNGGPVSEDRGFILHRPRDHYESSKTDDIAVTTSKDILTVLGTEAE -----3333-----------1111-----------------------3333-----1111 PEGYIVALGYSGWSAGQLEVELTENSWLTIEADPELIFNTPVHEKWQKAIQKLGISPAQ -------------22223333-----------3333----3333-33333333------ >FAB M18, LIGHT CHAIN; SWP:Q6PIH7; PDB:2AJ3A; LQMTQSPSFLSASVGDRVSITCRASQDIQKFLAWYQLTPGDAPKLLMYSASTLQSGVPSR ------------2222---------------------2222------------2222333 FSGSGSGTEFTLTISGLQPEDFATYYCQHLKRYPYTFGQGTKLEISRTVAAPSVFIFPPS 3----------------1111--------------------------------------3 DEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDNKDSTYSLSSTLTL 3333333---------------------iiii---------------------------- SKADYEKHKVYACEVTHQGLSSPVTKSFNRGE ----------------1111------------ >Immunoglobulin heavy vari; SWP:Q6GMX1; PDB:2AJ3B; VQLLESGPGVVKPSETLSLTCTVSGASVNNYYWTWVRQPPGKGLEWIGNVYDSGDTNYNP --------------------------3333--------------------1111------ SLSSRLSLSMDTSKNQFSLRLSSV --1111-----1111--------- >GALACTOKINASE; SWP:P04385; PDB:2AJ4A; VIVPEFNSSAELPRPLAEKCPSIIKKFISAYDAKPDFVARSPGRVNLIGEHIDYCDFSVL ------3333-----------------------------------------1111----- PLAIDFDMLCAVKVLNEKNPSITLINADPKFAQRKFDLPLDGSYVTIDPSVSDWSNYFKC ---------------------------3333-------1111-----3333--------- GLHVAHSFLKKLAPERFASAPLAGLQVFCEGDVPTGSGLSSSAAFICAVALAVVKANMGP -------------3333-----------------------------------------11 GYHMSKQNLMRITVVAEHYVGVNNGGMDQAASVCGEEDHALYVEFKPQLKATPFKFPQLK 11------------3333-----------------2222--------------------- NHEISFVIANTLVVSNKFETAPTNYNLRVVEVTTAANVLAATYGVVLLNKGNLRDFMNVY ---------------33333333-----------------1111---------------- YARYHNISTPWNGDIESGIERLTKMLVLVEESLANKKQGFSVDDVAQSLNCSREEFTRDY --------------------------------3333------------------------ LTTSPVRFQVLKLYQRAKHVYSESLRVLKAVKLMTTFTADEDFFKQFGALMNESQASCDK --------------------------------------3333------------------ LYECSCPEIDKICSIALSNGSYGSRLTGAGWGGCTVHLVPGGPNGNIEKVKEALANEFYK -----------------------------------------1111-------------33 VKYPKITDAELENAIIVSKPALGSCLYEL 331111----------------------- >HYPOTHETICAL PROTEIN MW06; SWP:NA; PDB:2AJ6A; HRTLNKDEHNYIKQIANIHETLLSQVESNYKCTKLSIALRYEICSRLEHTNDKIYIYENE ----1111---------------1111------------------1111---------%% GQLIAFIWGHFSNEKSVNIELLYVEPQFRKLGIATQLKIALEKWAKTNAKRISNT %%---------3333---------3333--------------------------- >HYPOTHETICAL PROTEIN BH36; SWP:NA; PDB:2AJ7A; KVIETKYSGKLEVAEDRLIAFDQGIPAFEDEKEFVLLPFAAGTPYYTLQSTKTVDLAFII -------------1111---11112222-----------2222---------1111---- VNPFSFFPEYRVKLPEATIAQLNITNENDVAIFSLLTVKEPFSETTVNLQAPIVINANKQ --33331111---------------3333-----------3333-----------3333- GKQLVLGDTAYNRKQPLFQKELVLAK -------------------------- >ANKYRIN REPEAT FAMILY PRO; SWP:Q5ZSV0; PDB:2AJAA; NLTIHNIENYENDPQLRLIPWILWENLFQHFISANELSLTLSYKEAIHIFLPGTKNEQVR -----3333-----------------------1111------------------------ QLLCLYYAHYNRNAKQLWSDAHKKGIKSEVICFVAAITGCSSALDTLCLLSDEIVKQAEN --------3333-1111---------3333---------3333--1111--------%%% YQAFRLAAENGHLHVLNRLCELAPTEIAIQAENYHAFRLAAENGHLHVLNRLCELAPTEA %------------------------------%%%%-----1111-3333--33331111- TAIQAENYYAFRWAAVGRGHHNVINFLLDCPVLAYAEIHEFEYGEKYVNPFIARHVNRLK ---------------!!!!3333-----------3333-----3333------------- EHDAFKLSNPDGVFDLVTKSECLQGFYLRNLIRRNDEVLLDDIRFLLSIPGIKALAPTAT -------------------------------33333333---------1111-------- IPGDANELLRLALRLGNQGACALLLSIPSVLALTK 2222----------------3333----------- >TELOMERE REPEAT-BINDING P; SWP:Q9LTI6; PDB:2AJEA; QRRIRRPFSVAEVEALVQAVEKLGTGRWRDVKLCAFEDADHRTYVDLKDKWKTLVHTAKI --------3333--3333-------------------1111-3333-------------- SPQQRRGEPVPQELLNRVLNAHGYWTQQQMQQLQQNV --%%%%-----3333------------iiii------ >LEUCYL-TRNA SYNTHETASE; SWP:P07813; PDB:2AJGA; EGVEITFNVNDYDNTLTVYTTRPDTFMGCTYLAVAAGHPLAQKAAENNPELAAFIDECRN ---------------------33331111-----1111----3333-------------- TKVAEAEMATMEKKGVDTGFKAVHPLTGEEIPVWAANFVLMEYGTGAVMAVPGHDQRDYE ---11111111------------------------1111-----------3333------ FASKYGLNIKPVILAADGSEPDLSQQALTEKGVLFNSGEFNGLDHEAAFNAIADKLTAMG --------------1111------------------!!!!----------------1111 VGERKV ------ >PYRIDOXAL KINASE; SWP:O00764; PDB:2AJPA; SRVLSIQSHVIRGYVGNRAATFPLQVLGFEIDAVNSVQFSNHTGYAHWKGQVLNSDELQE --------------!!!!------1111-------------1111--------------- LYEGLRLNNMNKYDYVLTGYTRDKSFLAMVVDIVQELKQQNPRLVYVCDPVLGDKWDGEG -----1111-------------------------------1111---------------- SMYVPEDLLPVYKEKVVPLADIITPNQFEAELLSGRKIHSQEEALRVMDMLHSMGPDTVV ----3333-------3333--------------------3333-------1111------ ITSSDLPSPQGSNYLIVLGSQRRRVVMERIRMDIRKVDAVFVGTGDLFAAMLLAWTHKHP ------------------------------------------------------------ NNLKVACEKTVSTLHHVLQRTIQCAKAQAGEGVRPSPMQLELRMVQSKRDIEDPEIVVQA -----------------------------------3333----3333-3333-------- TVL --- >SUGAR KINASE, PFKB FAMILY; SWP:Q9WZT5; PDB:2AJRA; HVLTVTLNPALDREIFIEDFQVNRLYRINDLSKTQSPGGKGINVSIALSKLGVPSVATGF -----------------------------1111---------------1111-------- VGGYGKILVEELRKISKLITTNFVYVEGETRENIEIIDEKNKTITAINFPGPDVTDDVNH -----------33333333-------------------1111------------------ FLRRYKTLSKVDCVVISGSIPPGVNEGICNELVRLARERGVFVFVEQTPRLLERIYEGPE ------3333----------22223333--------1111-------------------- FPNVVKPDLRGNHASFLGVDLKTFDDYVKLAEKLAEKSQVSVVSYEVKNDIVATREGVWL --------2222---iiii---3333---------------------------1111--- IRSKEEIDTSHLLGAGDAYVAGVYYFIKHGANFLEAKFGFASALAATRRKEKYPDLEAIK -------11112222----------------3333-------------------333333 KEYDHFTVERVK 33---------- >L-ARABINOSE ISOMERASE; SWP:P08202; PDB:2AJTA; MTIFDNYEVWFVIGSQHLYGPETLRQVTQHAEHVVNALNTEAKLPCKLVLKPLGTTPDEI -3333----------11113333--------------------------------3333- TAICRDANYDDPCAGLVVWLHTFSPAKMWINGLTMLNKPLLQFHTQFNAALPWDSIDMDF ---------3333--------------------------------------1111-3333 MNLNQTAHGGREFGFIGARMRQQHAVVTGHWQDKQAHERIGSWMRQAVSKQDTRHLKVCR ----3333---------1111--------1111------------------3333----- FGDNMREVAVTDGDKVAAQIKFGFSVNTWAVGDLVQVVNSISDGDVNALVDEYESCYTMT ----22221111-----------------3333----1111------------------3 PATQIHGEKRQNVLEAARIELGMKRFLEQGGFHAFTTTFEDLHGLKQLPGLAVQRLMQQG 3332222------------------------------11112222--------------- YGFAGEGDWKTAALLRIMKVMSTGLQGGTSFMEDYTYHFEKGNDLVLGSHMLEVCPSIAV ---------------------2222--------------2222-----------3333-- EEKPILDVQHLGIGGKDDPARLIFNTQTGPAIVASLIDLGDRYRLLVNCIDTVKTPHSLP ----------------------------------------------------------11 KLPVANALWKAQPDLPTASEAWILAGGAHHTVFSHALNLNDMRQFAEMHDIEITVIDNDT 11------------------------------------------------------1111 RLPAFKDALRWNEVYYGF -------------3333- >ANTIBODY 7A1 FAB'; SWP:NA; PDB:2AJUH; EVKLSESGPGLVKPSQSLSLTCTVTGYSITTNYAWTWIRQFPGNKLEWMGYIRSSVITRY ------------2222------------------------2222---------------- NPSLKSRISITQDTSKNQFFLQLNSVTTEDTATYYCARYDYYGNTGDYWGQGTSVTVSSA 3333---------1111---------3333---------1111-!!!!------------ KTTPPSVYPLAPGTAALKSSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDL ----------------------------------------iiii---------------- YTLTSSVTVPSSTWPSQTVTCNVAHPASSTKVDKKIVPR ------------------------3333----------- >ANTIBODY 7A1 FAB'; SWP:NA; PDB:2AJUL; DIVITQDELSNPVTSGESVSISCRSSRSLLYKDGRTYLNWFLQRPGQSPQLLIYLMSTRA -------------2222-------------1111---------2222------------2 SGVSDRFSGSGSGTDFTLEISRVKAEDVGVYYCQQFVEYPFTFGSGTKLEIKRADAAPTV 2223333----------------1111--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM ----------------------------------iiii--2222---------------- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNR ------33331111--------1111---------- >ADENYLATE KINASE ISOENZYM; SWP:P08760; PDB:2AK3A; GASARLLRAAIMGAPGSGKGTVSSRITKHFELKHLSSGDLLRDNMLRGTEIGVLAKTFID -------------2222-----------------------------------------11 QGKLIPDDVMTRLVLHELKNLTQYNWLLDGFPRTLPQAEALDRAYQIDTVINLNVPFEVI 11--------------33331111----------------1111---------------- KQRLTARWIHPGSGRVYNIEFNPPKTMGIDDLTGEPLVQREDDRPETVVKRLKAYEAQTE ----------1111---3333----2222----------3333----------------- PVLEYYRKKGVLETFSGTETNKIWPHVYAFLQTKLPQRSQETSVTP ------------------1111--------3333------------ >HLA-B35 VARIANT; SWP:NA; PDB:2AK4D; QKVTQAQTEISVVEKEDVTLDCVYETRDTTYYLFWYKQPPSGELVFLIRRNSFDEQNEIS ------------2222----------------------3333--------1111------ GRYSWNFQKSTSSFNFTITASQVVDSAVYFCALSGFYNTDKLIFGTGTRLQVFPNIQNPD --------1111---------3333----------------------------------- PAVYQLRDSKSSDKSVCLFTDFDSQTNVSQSKDSDVYITDKCVLDMRSMDFKSNSAVAWS ----------------------1111------1111----------1111---------- NKSDFACANAFNNSIIPEDTFFPS -11113333-1111---------- >Dynamin-1; SWP:P21575; PDB:2AKAB; MEDLIPLVNRLQDAFSAIGQNADLDLPQIAVVGGQSAGKSSVLENFVGRDFLPRGSGIVT 1111----------1111--1111---------11113333------------------- RRPLVLQLVNSTTEYAEFLHCKGKKFTDFEEVRLEIEAETDRVTGTNKGISPVPINLRVY -----------------1111--------------------------------------- SPHVLNLTLVDLPGMTKVPVGDQPPDIEFQIRDMLMQFVTKENCLILAVSPANSDLANSD -----------------------1111----------1111-------------3333-- ALKIAKEVDPQGQRTIGVITKLDLMDEGTDARDVLENKLLPLRRGYIGVVNRSQKDIDGK --------1111--------1111-2222----1111----1111----------1111- KDITAALAAERKFFLSHPSYRHLADRMGTPYLQKVLNQQLTNHIRDTLPGLRNKLQSQL ----------------11111111-----------------------1111----1111 -------------------------------- >FERREDOXIN--NITRITE REDUC; SWP:P05314; PDB:2AKJA; RLEPRVEERDGFWVLKEEFRSGINPAEKVKIEKDPMKLFIEDGISDLATLSMEEVDKSKH --------iiii---1111-------------------1111------------------ NKDDIDVRLKWLGLFHRRKHHYGRFMMRLKLPNGVTTSEQTRYLASVIKKYGKDGCADVT ------------------------------2222----------------!!!!------ TRQNWQIRGVVLPDVPEIIKGLESVGLTSLQSGMDNVRNPVGNPLAGIDPHEIVDTRPFT ----------3333----------------------------1111-------------- NLISQFVTANSRGNLSITNLPRKWNPCVIGSHDLYEHPHINDLAYMPATKNGKFGFNLLV --------iiii-3333-------------3333--1111-------------------- GGFFSIKRCEEAIPLDAWVSAEDVVPVCKAMLEAFRDLGFRGNRQKCRMMWLIDELGMEA ----1111-----------3333-------------------1111-3333--------- FRGEVEKRMPEQVLERASSEELVQKDWERREYLGVHPQKQQGLSFVGLHIPVGRLQADEM ----3333-------------------------------2222------2222------- EELARIADVYGSGELRLTVEQNIIIPNVENSKIDSLLNEPLLKERYSPEPPILMKGLVAC ----------------------------3333------------------3333------ TGSQFCGQAIIETKARALKVTEEVQRLVSVTRPVRMHWTGCPNSCGQVQVADIGFMGCMT -----1111-------------------------------3333--3333---------- RDENGKPCEGADVFVGGRIGSDSHLGDIYKKAVPCKDLVPVVAEILINQFGAVPR -1111----------------------------3333------------------ >PHNA-LIKE PROTEIN; SWP:Q6N1A7; PDB:2AKKA; MSIEVRDCNGALLADGDNVSLIKDLKLKGSSTVLKRGTMIRGIRLTDSEDEIEGRTDKIK ------3333---2222----------2222---2222---------------------- GLVLRTEFLKKAGS -------------- >PHNA-LIKE PROTEIN PA0128; SWP:NA; PDB:2AKLA; MVSTLPPCPQCNSEYTYEDGALLVCPECAHEWSPNEAATASDDGKVIKDSVGNVLQDGDT ----------------------------------3333----------1111-------- ITVIKDLKVKGSSLVVKVGTKVKNIRLVDGDHDIDCKIDGIGAMKLKSEFVRKVGS --------1111----------------------------------1111------ >GLUTAMATE 5-KINASE; SWP:Q9PJ29; PDB:2AKOA; KRIVVKVGSHVISEENTLSFERLKNLVAFLAKLEKYEVILVTSAAISAGHTKLDIDRKNL -------3333--1111------------------------------------------- INKQVLAAIGQPFLISVYNELLAKFNKLGGQILLTGKDFDSRKATKHAKNAIDINLGILP --------------------3333----------3333---------------1111--- IINENDATAIEEIVFGDNDSLSAYATHFFDADLLVILSDIDGFYDKNPSEFSDAKRLEKI ---------3333--iiii-------1111----------------33331111------ THIKEEWLHGTGGIVTKLKAAKFLLEHNKKFLASGFDLSVAKTFLLEDKQIGGTLFE ---3333-----------------1111----------------------------- >GAMMA ENOLASE; SWP:P09104; PDB:2AKZA; SIEKIWAREILDSRGNPTVEVDLYTAKGLFRAAVPSGASTGIYEALELRDGDKQRYLGKG -----------1111---------1111------------1111-------1111iiii- VLKAVDHINSTIAPALISSGLSVVEQEKLDNLMLELDGTENKSKFGANAILGVSLAVCKA ----------------3333-3333-------------1111------------------ GAAERELPLYRHIAQLAGNSDLILPVPAFNVINGGSHAGNKLAMQEFMILPVGAESFRDA ---------------------------------!!!!-------------1111------ MRLGAEVYHTLKGVIKDKYGKDATNVGDEGGFAPNILENSEALELVKEAIDKAGYTEKIV -------------------3333---1111-------3333-------------1111-- IGMDVAASEFYRDGKYDLDFKSPTDPSRYITGDQLGALYQDFVRDYPVVSIEDPFDQDDW -----3333--iiii---1111--3333---------------------------1111- AAWSKFTANVGIQIVGDDLTVTNPKRIERAVEEKACNCLLLKVNQIGSVTEAIQACKLAQ ------1111------3333---------------------1111--------------1 ENGWGVMVSHRSGETEDTFIADLVVGLCTGQIKTGAPCRSERLAKYNQLMRIEEELGDEA 111--------------3333-----------------3333-------------!!!!- RFAGHNFRNPSVLHH --!!!!--------- >ENOLASE 1; SWP:P00924; PDB:2AL1A; AVSKVYARSVYDSRGNPTVEVELTTEKGVFRSIVPSGASTGVHEALEMRDGDKSKWMGKG -----------1111---------1111------------1111-------1111iiii- VLHAVKNVNDVIAPAFVKANIDVKDQKAVDDFLISLDGTANKSKLGANAILGVSLAASRA ---------------------3333--------------------1111----------- AAAEKNVPLYKHLADLSKSKTSPYVLPVPFLNVLNGGSHAGGALALQEFMIAPTGAKTFA -----------------------------------!!!!-------------1111---- EALRIGSEVYHNLKSLTKKRYGASAGNVGDEGGVAPNIQTAEEALDLIVDAIKAAGHDGK ----------------------------1111------------------------2222 VKIGLDCASSEFFKDGKYDLDFKNPNSDKSKWLTGPQLADLYHSLMKRYPIVSIEDPFAE -------3333--iiii---1111---1111----------------------------- DDWEAWSHFFKTAGIQIVADDLTVTNPKRIATAIEKKAADALLLKVNQIGTLSESIKAAQ -------------------3333---------------------1111------------ DSFAAGWGVMVSHRSGETEDTFIADLVVGLRTGQIKTGAPARSERLAKLNQLLRIEEELG --1111--------------3333-----------------3333-------------!! DNAVFAGENFHHGDKL !!---!!!!--3333- >TUG LONG ISOFORM; SWP:Q8VBT9; PDB:2AL3A; SAVSVLAPNGRRHTVKVTPSTVLLQVLEDTCRRQDFNPSEYDLKFQRTVLDLSLQWRFAN ------1111-------1111---------------3333--------------3333-- LPNNAKLEMVPVSRSR ---------------- >FOCAL ADHESION KINASE 1; SWP:Q00944; PDB:2AL6A; GAMERVLKVFHYFENSSEPTTWASIIRHGDATDVRGIIQKIVDCHKVKNVACYGLRLSHL -----------------1111-------11113333------1111--1111------11 QSEEVHWLHLDMGVSNVREKFELAHPPEEWKYELRIRYLPKGFLNQFTEDKPTLNFFYQQ 11------1111--------3333-3333------------------------------- VKNDYMLEIADQVDQEIALKLGCLEIRRSYGEMRGNALEKKSNYEVLEKDVGLRRFFPKS --------1111-----------------11111111--------------3333----- LLDSVKAKTLRKLIQQTFRQFANLNREESILKFFEILSPVYRFDKECFKCALGSSWIISV ----------------33331111-------------1111------------------- ELAIGPEEGISYLTDKGANPTHLADFNQVQTIQYSNSEDKDRKGMLQLKIAGAPEPLTVT ------------------------3333---------------------2222------- APSLTIAENMADLIDGYCRLVNGATQSFIIRPQTDDYAEIIDE -------------------1111----------1111------ >ADP-RIBOSYLATION FACTOR-L; SWP:Q9NVJ2; PDB:2AL7A; SKEEMELTLVGLQYSGKTTFVNVIAVGFNMRKVTKGNVTIKIWDIGGQPRFRSMWERYCR -----------2222-------------------iiii---------333311113333- GVNAIVYMIDAADREKIEASRNELHNLLDKPQLQGIPVLVLGNKRDLPNALDEKQLIEKM ---------11111111------------3333----------3333------------- NLSAIQDREICCYSISCKEKDNIDITLQWLIQHSK 3333------------1111----------1111- >STRUCTURAL POLYPROTEIN (P; SWP:P03315; PDB:2ALAA; YEHSTVMPNVVGFPYKAHIERPGYSPLTLQMQVVETSLEPTLNLEYITCEYKTVVPSPYV ---------2222----------------------------------------------- KCCGASECSTKEKPDYQCKVYTGVYPFMWGGAYCFCDSENTQLSEAYVDRSDVCRHDHAS --------------------------1111----------------------3333---- AYKAHTASLKAKVRVMYGNVNQTVDVYVNGDHAVTIGGTQFIFGPLSSAWTPFDNKIVVY -----------------------------------%%%%--------------------- KDEVFNQDFPPYGSGQPGRFGDIQSRTVESNDLYANTALKLARPSPGMVHVPYTQTPSGF ----------2222-2222----------------------------------------- KYWLKEKGTALNTKAPFGCQIKTNPVRAMNCAVGNIPVSMNLPDSAFTRIVEAPTIIDLT ---------3333--%%%%-----------------------3333--3333-------- CTVATCTHSSDFGGVLTLTYKTNKNGDCSVHSHSNVATLQEATAKVKTAGKVTLHFSTAS ---------------------------------3333----------------------- ASPSFVVSLCSARATCSASCEPPK --------!!!!------------ >PROTEIN DISULFIDE-ISOMERA; SWP:Q5RDG4; PDB:2ALBA; SDVLELTDDNFESRISDTGSAGLMLVEFFAPWCGHCKRLAPEYEAAATRLKGIVPLAKVD ------3333111111111111-----------3333------------1111------3 CTANTNTCNKYGVSGYPTLKIFRDGEEAGAYDGPRTADGIVSHLKKQAGPASV 333-3333--------------iiii-----------------3333--2222 >NHP2/L7AE FAMILY PROTEIN ; SWP:P39990; PDB:2ALEA; MSAPNPKAFPLADAALTQQILDVVQQAANLRQLKKGANEATKTLNRGISEFIIMAADCEP ----1111----------------------------------------------1111-3 IEILLHLPLLCEDKNVPYVFVPSRVALGRACGVSRPVIAASITTNDASAIKTQIYAVKDK 3333333-------------------------------------1111------------ IETLLILEHHHH ------------ >NON-SPECIFIC LIPID TRANSF; SWP:NA; PDB:2ALGA; MITCGQVSSSLAPCIPYVRGGGAVPPACCNGIRNVNNLARTTPDRQAACNCLKQLSASVP ----------3333----------3333---------------------------1111- GVNPNNAAALPGKCGVSIPYKISASTNCATVK ----------------------33333333-- >HYPOTHETICAL PROTEIN PA28; SWP:Q9I042; PDB:2ALIA; QLLHTAHIPVRWGDDSYGHVNNTLYFQYLEEARVAWFETLGIDLEGAAEGPVVLQSLHTY ----------11113333--3333-------------1111------------------- LKPVVHPATVVVELYAGRLGTSSLVLEHRLHTLEDPQGTYGEGHCKLVWVRHAENRSTPV -------------------------------3333------------------------- PDSIRAAIA --------- >ALDEHYDE REDUCTASE; SWP:P14550; PDB:2ALR; AASCVLLHTGQKMPLIGLGTWKSEPGQVKAAVKYALSVGYRHIDCAAIYGNEPEIGEALK ------3333-------------3333------------------3333----------- EDVGPGKAVPREELFVTSKLWNTKHHPEDVEPALRKTLADLQLEYLDLYLMHWPYAFERG ---------3333-------1111-3333------------------------------- DNPFPKNADGTICYDSTHYKETWKALEALVAKGLVQALGLSNFNSRQIDDILSVASVRPA ------1111-------------------------------------------------- VLQVECHPYLAQNELIAHCQARGLEVTAYSPLGVLLEEPVVLALAEKYGRSPAQILLRWQ ------1111-3333--------------11113333----------------------- VQRKVICIPKSITPSRILQNIKVFDFTFSPEEMKQLNALNKNWRYIVPMLTVDGKRVPRD ------------------1111-------------1111------------iiii----1 AGHPLYPFNDPY 111--------- >UDP-N-acetylmuramoylalani; SWP:Q8DNV6; PDB:2AM1A; KLTIHEIAQVVGAKNDISIFEDTQLEKAEFDSRLIGTGDLFVPLKGARDGHDFIETAFEN ----------------3333----------3333-2222---------3333----3333 GAAVTLSEKEVSNHPYILVDDVLTAFQSLASYYLEKTTVDVFAVTGSNGKTTTKDLAHLL ---------------------------------------------------3333----- STRYKTYKTQGNYNNEIGLPYTVLHPEGTEKLVLEGQDHLGDIHLLSELARPKTAIVTLV -------------------------2222---------2222------------------ GEAHLAFFKDRSEIAKGKQIADGASGSLLLAPADPIVEDYLPIDKKVVRFGQGAELEITD ------------------1111-2222------33331111---------2222------ LVERKDSLTFKANFLEQALDLPVTGKYNATNAIASYVALQEGVSEEQIRLAFQHLELTRN -----------1111-----------------------1111--------3333------ RTEWKKAANGADILSDVYNANPTAKLILETFSAIPANEGGKKIAVLADKELGDQSVQLHN --------------------3333-----1111------------------1111----- QILSLSPDVLDIVIFYGEDIAQLAQLASQFPIGHVYYFKKTEDQDQFEDLVKQVKESLGA -11113333--------1111---------2222------------------------11 HDQILLKGSNSNLAKLVESLEN 11-------------------- >SEPTUM FORMATION PROTEIN ; SWP:Q382A9; PDB:2AMHA; EEIRTMIIGTSSAFRANVLREHFGDRFRNFVLLPPDIDEKAYRAADPFELTESIARAKMK -------------------------------------3333----3333----------- AVLEKARQHPAIALTFDQVVVKGDEVREKPLSTEQCRSFIASYSGGGVRTVATYALCVVG ----3333-------------!!!!--------------------------------222 TENVLVAHNETETFFSKFGDDIVERTLERGACMNSAGGLVVEDEDMSRHVVRIVGTSYGV 2----------------------------3333------1111-3333------------ RGMEPAVVEKLLSQL ---3333----1111 >CALTRACTIN; SWP:P05434; PDB:2AMIA; RVGLTEEQKQEIREAFDLFDTDGSGTIDAKELKVAMRALGFEPKKEEIKKMISEIDKDGS -------------------3333------------------------------------- GTIDFEEFLTMMTAKM ---------------- >MODULATOR OF DRUG ACTIVIT; SWP:P0AEY7; PDB:2AMJA; SSNILIINGAKGQLNDTLTEVADGTLRDLGHDVRIVRADSDYDVKAEVQNFLWADVVIWQ --------------------------1111------------------------------ PGWWGAPWTVKKYIDDVFTEGHGTLYASDGRKYGSGGLVQGKKYLSLTWNAPEAFTEKDQ --------------------2222-------2222---2222----------11111111 FFHGVGVDGVYLPFHKANQFLGEPLPTFIANDVIKPDVPRYTEEYRKHLVEIFG ------3333--------1111-------------------------------- >SIS DOMAIN PROTEIN; SWP:NA; PDB:2AMLA; HPTTYINEEEECRVILADFQTNAEKLESLVKNGAKEWLILATGSSLNAAQSAKYYIENLA ----3333---------------------1111--------!!!!--------------- DVRITIEEPFNHLYYEKLSSHLDLVIGISQSGQSTSTISALERVKKEASVPVVALTSDVT ------------------1111------3333-------------------------111 SEIAEFADITLDIGSGKERVGYVTKGFTATVLTLLTGLHFAYKTVQIDETRFNNEISAFS 13333------------------------------------------------------- RAIDAIPATIAETEAFYERWQEEFATAPKFTAIGYGPTVGTCKEFETKFSETVRVPSQGL -------------------33331111--------1111--------------------- DLEAFHGPYLEVNPQHRIFFLETASAVTERLVLLRDYESKYTPFTYTVKFGKGEDDRTLV 3333--3333--1111---------------------3333------------------- IPTDLDEYQAPFLILPFQILAHHIAELKGNKLTERIYTDFGVAKSKTKPGDYA -----33333333-----------------1111--1111------------- >PUTATIVE SUPEROXIDE REDUC; SWP:Q9WZC6; PDB:2AMUA; HMKLSDFIKTEDFKKEKHVPVIEAPEKVKKDEKVQIVVTVGKEIPHPNTTEHHIRWIKVF --3333-----33331111---------2222----------------3333-------- FQPDGDPYVYEVGRYEFNAHGESVQGPNIGAVYTEPTVTTVVKLNRSGTIIALSYCNIHG --2222----------------1111---------------------------------- LWESSQKITVEE ------------ >HYPOTHETICAL PROTEIN NE21; SWP:Q82SY3; PDB:2AMWA; MQHLEAVRNILGDVLNLGERKHTLTASSVLLGNIPELDSMAVVNVITALEEYFDFSVDDD ----------------!!!!1111---------33333333---------1111---111 EISAQTFETLGSLALFVEHKLSH 1-1111----------------- >ADENOSINE DEAMINASE; SWP:Q7RMV2; PDB:2AMXA; GLVPRGSEIKFLKKEDVQNIDLNGMSKKERYEIWRRIPKVELHCHLDLTFSAEFFLKWAR ---1111------1111---3333---------1111-------3333------------ KYNLQPNMSDDEILDHYLFTKEGKSLAEFIRKAISVSDLYRDYDFIEDLAKWAVIEKYKE ----1111----------------------------1111-------------------- GVVLMEFRYSPTFVSSSYGLDVELIHKAFIKGIKNATELLNNKIHVALICISDTGHAAAS -------------------------------------1111---------------1111 IKHSGDFAIKHKHDFVGFDHGGREIDLKDHKDVYHSVRDHGLHLTVHAGEDATLPNLNTL 1111------3333------------1111-------1111---------1111--3333 YTAINILNVERIGHGIRVSESDELIELVKKKDILLEVCPISNLLLNNVKSMDTHPIRKLY ----------------3333-----------------------------3333------1 DAGVKVSVNSDDPGMFLSNINDNYEKLYIHLNFTLEEFMIMNNWAFEKSFVSDDVKSELK 111---------------3333-------------------------------------- ALYF ---- >PHOSPHOMANNOMUTASE 2; SWP:O15305; PDB:2AMYA; PGPALCLFDVDGTLTAPRQKITKEDDFLQKLRQKIKIGVVGGSDFEKVQEQLGNDVVEKY ---------2222-----------------3333------------------1111---- DYVFPENGLVAYKDGKLLCRQNIQSHLGEALIQDLINYCLSYIAKIKLPKKRGTFIEFRN ------------iiii-----3333-----------------1111-------------- GLNVSPIGRSCSQEERIEFYELDKKENIRQKFVADLRKEFAGKGLTFSIGGQISFDVFPD ----3333-----------------------------1111-----------------22 GWDKRYCLRHVENDGYKTIYFFGDKTNDHEIFTDPRTGYSVTAPEDTRRICELLFS 22333333331111--------------3333-3333-----3333---------- >AGGLUTININ; SWP:Q9M6E9; PDB:2AMZA; DPIKFTTGSATPASYNQFIDALRERLTGGLIYGIPVLRDPSTVEKPNQYVTVELSYSDTV ------------------------------%%%%----1111-3333------------- SIQLGIDLTNAYVVAYRAGSESFFFRNAPASASTYLFTGTQQYSLPFDGNYDDLEKWAHQ -----------------!!!!--------3333--------------------------- SRQRISLGLEALRQGIKFLRSGASDDEEIARTLIVIIQMVAEAARFRYVSKLVVISLSNR 1111-----------------------------------------3333----------- AAFQPDPSMLSLENTWEPLSRAVQHTVQDTFPQNVTLINVRQERVVVSSLSHPSVSALAL -----3333-------------1111----------------------11113333---- MLFVCNP ------- >Agglutinin-1 [Precursor]; SWP:Q9M6E9; PDB:2AMZB; ICSSHYEPTVRIGGRDGLCVDVSDNAYNNGNPIILWKCKDQLEVNQLWTLKSDKTIRSKG -------------2222----2222-----------------1111----1111---iii KCLTTYGYAPGNYVMIYDCSSAVAEATYWDIWDNGTIINPKSGLVLSAESSSMGGTLTVQ i----------------3333----------1111----3333--------1111----- KNDYRMRQGWRTGNDTSPFVTSIAGFFKLCMEAHGNSMWLDVCDITKEEQQWAVYPDGSI ----3333-----------------2222--------------33331111---1111-- RPVQNTNNCLTCEEHKQGATIVMMGCSNAWASQRWVFKSDGTIYNLYDDMVMDVKSSDPS ------------------------3333------------------------------33 LKQIILWPYTGNANQMWATLF 33---------1111------ >PUTATIVE KINASE; SWP:Q5PFG7; PDB:2AN1A; HFKCIGIVGHTTHEMLYRWLCDQGYEVIVEQQIAHELQLKNVPTGTLAEIGQQADLAVVV --------------------1111------------------------------------ GGDGNMLGAARTLARYDINVIGINRGNLGFLTDLDPDNALQQLSDVLEGRYISEKRFLLE --3333----------------------1111--1111-------1111----------- AQVCQQRISTAINEVVLHPGKVAHMIEFEVYIDETFAFSQRSDGLIISTPTGSTAYSLSA --------------------2222-------iiii--------------1111--3333- GGPILTPSLDAITLVPMFPHTLSARPLVINSSSTIRLRFSHDLEISCDSQIALPIQEGED -----1111-----------1111-----1111-------------!!!!-----2222- VLIRRCDYHLNLIHPKDYSYFNTLSTKLGWSKKLF --------------1111----------------- >PROTEIN PARD; SWP:P22995; PDB:2AN7A; MSRLTIDMTDQQHQSLKALAALQGKTIKQYALERLFPGDADADQAWQELKTMLGNRINDG ---------3333------------3333---3333------------------------ LAGKVSTKSVGEILDEELSGDRA ----------------------- >ATP-DEPENDENT PROTEASE LA; SWP:P0A9M0; PDB:2ANEA; RIEIPVLPLRDVVVYPHMVIPLFVGREKSIRCLEAAMDHDKKIMLVAQKEASTDEPGVND --------------2222--------------------------------------1111 LFTVGTVASILQMLKLPDGTVKVLVEGLQRARISALSDNGEHFSAKAEYL ---------------1111------------------------------- >PLASMEPSIN IV; SWP:O60990; PDB:2ANLA; SENDVIELDDVANLMFYGEGEVGDNHQKFMLIFDTGSANLWVPSKKCNSIGCSTKHLYDS ----------------------1111-----------------------3333------3 SKSKSYEKDGTKVEITYGSGTVRGFFSKDLVTLGYLSLPYKFIEVTDTDDLEPLYTAAEF 333--------------------------------------------------3333--- DGILGLGWKDLSIGSIDPIVVELKNQNKIDQALFTFYLPVHDKHSGYLTIGGIEEKFYEG -------3333------3333--1111--------------------------3333--- ELTYEKLNHDLFWQVDLDVNFGKTSMEKANVIVDSGTSTITAPTSFINKFFKDLNVIKVP ---------------------------------3333-------------3333------ FLPFYITTCNNKDMPTLEFKSANNTYTLEPEYYMEPLLDIDDTLCMLYILPVDIDKNTFI ----------------------------3333---------------------------- LGDPFMRKYFTVFDYDKESIGFAVAKN ----3333------3333--------- >NEURO-ONCOLOGICAL VENTRAL; SWP:P51513; PDB:2ANRA; GSQYFLKVLIPSYAAGSIIGKGGQTIVQLQKETGATIKLSKSKDFYPGTTERVCLIQGTI ----------33333333-2222----------------------2222----------- EALNAVHGFIAEKIREPQNPDRANQVKIIVPNSTAGLIIGKGGATVKAIEQSGAWVQLSQ -------------------3333-----------------%%%%---------------- KPLQNRVVTVSGEPEQNRKAVELIIQKIQEDPQ ------------3333----------------- >HYPOTHETICAL PROTEIN TM05; SWP:Q9WZ29; PDB:2ANUA; TEWLLCDFHVHTNSDGHLPLGEVVDLFGKHGVDVVSITDHIVDRRTLEQRKRNGEPLGAI ---------------------------1111----------------------------- TEDKFQDYLKRLWREQKRAWEEYGILIPGVEITNNTDLYHIVAVDVKEYVDPSLPVEEIV 3333----------------------------------------------3333------ EKLKEQNALVIAAHPDRKKLSWYLWANERFKDTFDAWEIANRDDLFNSVGVKKYRYVANS ---1111--------3333--3333--1111---------!!!!---------------- DFHELWHVYSWKTLVKSEKNIEAIKEAIRKNTDVAIYLRK ---3333--------------------------------- >LYSOZYME; SWP:P09963; PDB:2ANXA; MMQISSNGITRLKREEGERLKAYSDSRGIPTIGVGHTGKVDGNSVASGMTITAEKSSELL ------------------------1111-----------iiii--2222----------- KEDLQWVEDAISSLVRVPLNQNQYDAMCSLIFNIGKSAFAGSTVLRQLNLKNYQAAADAF -----------------------------------------------1111------333 LLWKKAGKDPDILLPRRRRERALFLS 3----!!!!-1111------------ >ALDEHYDE DEHYDROGENASE; SWP:P50578; PDB:2AO0A; AASCVLLHTGQKMPLIGLGTWKSEPGQVKAAIKYALTVGYRHIDCAAIYGNELEIGEALQ ------3333---------22222222--------1111------3333----------- ETVGPGKAVPREELFVTSKLWNTKHHPEDVEPALRKTLADLQLEYLDLYLMHWPYAFERG ---2222--3333-------1111-1111------------------------------- DNPFPKNADGTIRYDATHYKDTWKALEALVAKGLVRALGLSNFSSRQIDDVLSVASVRPA ------1111-------------------1111--------------------------- VLQVECHPYLAQNELIAHCQARGLEVTAYSPLGSSDRAWRDPNEPVLLEEPVVQALAEKY ------3333---------1111------11113333--------3333----------- NRSPAQILLRWQVQRKVICIPKSVTPSRILQNIQVFDFTFSPEEMKQLDALNKNLRFIVP ------------1111--------------1111--------------1111-------- MLTVDGKRVPRDAGHPLYPFNDPY -----------1111--------- >29-KDA GALACTOSE-BINDING ; SWP:O96048; PDB:2AO3A; PKFFYIKSELNGKVLDIEGQNPAPGSKIITWDQKKGPTAVNQLWYTDQQGVIRSKLNDFA ----------------2222--2222---------3333-------1111---------- IDASHEQIETQPFDPNNPKRAWIVSGNTIAQLSDRDIVLDIIKSDKEAGAHICAWKQHGG -------------1111-------------1111------2222--2222---------- PNQKFIIESE ---------- >ADAM 10; SWP:Q10741; PDB:2AO7A; CCYKLKPGKQCSPSQGPCCTAHCAFKSKTEKCRDDSDCAKEGICNGITALCPASDPKPNF -----2222--3333----1111------------1111--------------------- TDCNRHTQVCINGQCAGSICEKHGLEECTCKELCHVCCMKKMEPSTCASTGSVQWNKYFL -----------------3333------------------22221111-------3333%% GRTITLQPGSPCNDFRGYCDVFMRCR %%----2222--%%%%---1111--- >PHAGE PROTEIN; SWP:Q81ES4; PDB:2AO9A; MMAKLDELKQKLTAKQIQAAYLLVENELMEEEKRTQDEMANELGINRTTLWEWRTKNQDF ------3333-------------------------------------------------- IAFKSEVADSFLAEKREQVYSKLMQLILGPQPSVKAMQLYMQRFGLLTDKKVIEGDL -----------------------------------------1111------------ >DNA MISMATCH REPAIR PROTE; SWP:P44688; PDB:2AORA; MIPQTLEQLLSQAQSIAGLTFGELADELHIPVPIDLKRDKGWVGMLLERALGATAGSKAE -------------1111---------------------1111------1111-2222--- QDFSHLGVELKTLPINAEGYPLETTFVSLAPLVQNSGVKWENSHVRHKLSCVLWMPIEGS ---1111--------1111-------------------3333-----------------3 RHIPLRERHIGAPIFWKPTAEQERQLKQDWEELMDLIVLGKLDQITARIGEVMQLRPKGA 3333333-----------------------------11113333-1111----------- NSRAVTKGIGKNGEIIDTLPLGFYLRKEFTAQILNAFLET 1111------------------------------------ >HISTAMINE N-METHYLTRANSFE; SWP:P50135; PDB:2AOTA; MRSLFSDHGKYVESFRRFLNHSTEHQCMQEFMDKKLPGIIGRIGDTKSEIKILSIGGGAG --33333333-------------------------3333--1111-----------!!!! EIDLQILSKVQAQYPGVCINNEVVEPSAEQIAKYKELVAKTSNLENVKFAWHKETSSEYQ -------------2222--------------------1111--1111------------- SRMLEKKELQKWDFIHMIQMLYYVKDIPATLKFFHSLLGTNAKMLIIVVSGSSGWDKLWK --1111-------------3333-----------11112222-------1111------- KYGSRFPQDDLCQYITSDDLTQMLDNLGLKYEYDLLSTMDISDCFIDGNENGDLLWDFLT -3333----------3333--------------------------2222--------111 ETNFNATAPPDLRAELGKDLQEPEFSAKKEGKVLFNNTLSFIVIEA 1----------------3333-------iiii-------------- >PHOSPHOLIPASE A2 HOMOLOG; SWP:P82950; PDB:2AOZA; NLYQLWKMILQETGKNAAPSYGFYGCNCGVGSRGKPKDATDRCCFVHKCCYKALTDCSPK -----------------3333----------------3333---------1111---333 TDSYSYSWKDKTIVCGKNNPCLKQECECDKAVAICLRDNLDTYNKNYKIYPKPLCKKADD 3------------------------------------------3333---3333------ C - >PUTATIVE REGULATOR PROTEI; SWP:NA; PDB:2AP1A; AMYYGFDIGGTKIALGVFDSTRRLQWEKRVPTPHTSYSAFLDAVCELVEEADQRFGVKGS ------------------1111-------------------------------------- VGIGIPGMPETEDGTLYAANVPAASGKPLRADLSARLDRDVRLDNDANCFALSEAWDDEF ----------1111---11111111-------------------------------3333 TQYPLVMGLILGTGVGGGLVLNGKPITGQSYITGEFGHMRLPVDALTLMGFDFPLRRCGC --------------------iiii-------2222------3333----1111----333 GQMGCIENYLSGRGFAWLYQHYYDQSLQAPEIIALWEQGDEQAHAHVERYLDLLAVCLGN 3---3333---------------------------1111--------------------- ILTIVDPDLLVIGGGLSNFTAITTQLAERLPRHLLPVARAPRIERARHGDAGGMRGAAFL -------------3333-3333---33333333-1111----------1111------11 HLTD 11-- >CONSERVED HYPOTHETICAL PR; SWP:Q8NX77; PDB:2AP3A; AHMGIQRPTSTTTDKKEIKAYLKQVDKIKDDEEPIKTVGKKIAELDEKKKKLTEDVNSKD 2222------------------------------------------------3333---- TAVRGKAVKDLIKNADDRLKEFEKEEDAIKKSEQDFKKADNIDNDVKRKEVKQLDDVLKE ----------------------------------3333---------------------- KYKLHSDYAKAYKKAVNSEKTLFKYLNQNDATQQGVNEKSKAIEQNYKKLKEVSDKYTKV ------------------------------------------------------------ LNKVQKEKQDVD ------------ >ACETYLGLUTAMATE KINASE; SWP:P0A4Y6; PDB:2AP9A; IEALPTHIKAQVLAEALPWLKQLHGKVVVVKYGGNATDDTLRRAFAADAFLRNCGIHPVV ----3333---333333333333------------------------------------- VHGGGPQITALRRLGIEGDFKGGFRVTTPEVLDVARVLFGQVGRELVNLINAHGPYAVGI ----3333---3333-----------------------------------1111------ TGEDAQLFTAVRRSVTVDGVATDIGLVGDVDQVNTAALDLVAAGRIPVVSTLAPDADGVV --2222---------------------------------3333-----------1111-- HNINADTAAAAVAEALGAEKLLLTDIDGLYTRWPDRDSLVSEIDTGTLAQLLPTLELGVP --------------------------------------------------1111------ KVEACLRAVIGGVPSAHIIDGRVTHCVLVELFTDAGTGTKVVRGEGHHHHHH -------------------1111----------------------3333--- >PUTATIVE ESTERASE; SWP:Q8L9J9; PDB:2APJA; SPIPPNQIFILSGQNMAGRGGVFKDHHNNRWVWDKILPPECAPNSSILRLSADLRWEEAH ------------------2222---------------3333--1111---1111------ EPLHVDIDTGKVCGVGPGMAFANAVKNRLETDSAVIGLVPCASGGTAIKEWERGSHLYER -1111-------------------------1111------------3333-2222----- MVKRTEESRKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHDLNLPSLPIIQ ----------------------1111--------------------------1111---- VAIASGGGYIDKVREAQLGLKLSNVVCVDAKGLPLKSDNLHLTTEAQVQLGLSLAQAYLS -----------------------------2222--1111--------------------- NFC --- >HYPOTHETICAL PROTEIN PG08; SWP:Q7MW33; PDB:2APLA; KSTEKKELSHFRLKLETYLNEHFPESGNNPFITARSDEALTAYCDAVAQGFSHPEAESAS ----------------------------------------------1111---------- EVLYQGLHFSRYDTLVSVLEREFEQELPSPLPERLAPILLKNKAIQSVFAKYDLTDDFEA ---2222------------------------3333------------3333---1111-- SPEYEHLYTELTGTIVLLIESNHLPTI --------------------------- >PROTEIN HI1723; SWP:P45344; PDB:2APNA; MIDDMAVPLTFTDAAANKVKSLISEEENTDLKLRVYITGGGCSGFQYGFTFDEKVNDGDL -----------------------1111--------------------------------- TIEKSGVQLVIDPMSLQYLIGGTVDYTEGLEGSRFTVNNPNATSTCGCGSSFSI ----------------------------3333------3333------------ >PROBABLE TRNA PSEUDOURIDI; SWP:Q57612; PDB:2APOA; ELIVKEEVETNWDYGCNPYERKIEDLIKYGVVVVDKPRGPTSHEVSTWVKKILNLDKAGH ----------------1111---------------------------------------- GGTLDPKVTGVLPVALERATKTIPMWHIPPKEYVCLMHLHRDASEEDILRVFKEFTGRIY ----1111-------------3333----------------------------------- QRRIRKIHELELLDKDGKDVLFRVKCQSGTYIRKLCEDIGEALGTSAHMQELRRTKSGCF ---------------!!!!-------2222----------1111------------!!!! EEKDAVYLQDLLDAYVFWKEDGDEEELRRVIKPMEYGLRHLKKVVVKDSAVDAICHGADV 3333----------------------------3333-1111-----3333---1111--- YVRGIAKLSKGIGKGETVLVETLKGEAVAVGKALMNTKEILNADKGVAVDVERVYMDRGT 1111--------2222-----1111----------------------------------- YPRM ---- >Ribosome biogenesis prote; SWP:P81303; PDB:2APOB; ERKKCPKCGLYTLKEICPKCGEKTVIPKPPKFSLEDRWGKYRRLKRALKNKN --------------------------------3333-3333----------- >RHIZOPUSPEPSIN; SWP:P06026; PDB:2APR; AGVGTVPMTDYGNDIEYYGQVTIGTPGKKFNLDFDTGSSDLWIASTLCTNCGSGQTKYDP -2222----------------------------------------------1111---33 NQSSTYQADGRTWSISYGDGSSASGILAKDNVNLGGLLIKGQTIELAKREAASFASGPND 331111----------1111-------------iiii-------------3333------ GLLGLGFDTITTVRGVKTPMDNLISQGLISRPIFGVYLGKAKNGGGGEYIFGGYDSTKFK ------1111--2222-----------------------3333-----------3333-- GSLTTVPIDNSRGWWGITVDRATVGTSTVASSFDGILDTGTTLLILPNNIAASVARAYGA ---------1111----------!!!!----------1111--------------1111- SDNGDGTYTISCDTSAFKPLVFSINGASFQVSPDSLVFEEFQGQCIAGFGYGNWGFAIIG -------------1111------iiii----3333-----%%%%---------------3 DTFLKNNYVVFNQGVPEVQIAPVAE 3331111------------------ >CU,ZN SUPEROXIDE DISMUTAS; SWP:P24702; PDB:2APSA; EKLVVQVQQLDPVKGNKDVGTVEITESAYGLVFTPHLHGLAQGLHGFHIHQNPSCEPKEK ------------------------------------------------------------ DGKLVAGLGAGGHWDPKETKQHGYPWSDNAHLGDLPALFVEHDGSATNPVLAPRLKKLDE -------3333---1111-----1111---1111------1111-------1111-3333 VKGHSLMIHEGGDNHSDHPAPLGGGGPRMACGVIK --------------------iiii----------- >T CELL RECEPTOR BETA CHAI; SWP:NA; PDB:2APVA; AAVTQSPRNKVAVTGEKVTLSCQQTNNHNNMYWYRQDTGHGLRLIHYSYGVGNTEKGDIP ------------2222--------------------2222---------2222------- DGYKASRPSHEQFSLILVSATPSQSSVYFCASGVGGTLYFGAGTRLSVL --------1111--------3333---------!!!!------------ >T-CELL RECEPTOR BETA CHAI; SWP:P0A0L5; PDB:2AQ2A; EAAVTQSPRNKVAVTGEKVTLSCQQTNNHNNMYWYRQDTGHGLRLIHYSYGVGNTEKGDI -------------2222--------------------2222---------2222------ PDGYEASRPSQEQFSLILESATPSQTSVYFCASGGGGTLYFGAGTRLSVL ---------1111--------3333---------%%%%------------ >Enterotoxin type C-3 [Pre; SWP:P0A0L5; PDB:2AQ2B; SQPDPMPDDLHKSSEFTGTMGNMKYLYDDHYVSATKVKSVDKFLAHDLIYNISDKKLKNY -----1111--3333-------3333-----------------1111------------- DKVKTELLNEDLAKKYKDEVVDVYGSNYYVNCYFSSKDNVWWPGKTCMYGGITKHEGNHF ---------------1111----------------------2222---------2222-- DNGNLQNVLVRVYENKRNTISFEVQTDKKSVTAQELDIKARNFLINKKNLYEFNSSPYET -------------iiii----------------------------------3333----- GYIKFIENNGNTFWYDMMPAPGDKFDQSKYLMMYNDNKTVDSKSVKIEVHLTTK ------1111-----------------------1111---3333---------- >5'-D(*AP*TP*CP*CP*TP*CP*C; SWP:P12689; PDB:2AQ4A; KRIVACDDPDFLTSYFAHSRLHHLSAWKANLKDKFLNENIHKYTKITDKDTYIIFHIDFD ----1111----------------------------1111------3333---------- CFFATVAYLCRSSSFSACDFKRDPIVVCHGTKNSDIASCNYVARSYGIKNGMWVSQAEKM -------11111111---1111---------------------1111-1111-----111 LPNGIKLISLPYTFEQFQLKSEAFYSTLKRLNIFNLILPISIDEAVCVRIIPDTLNARLC 12222------------------------------------------------------- EEIRQEIFQGTNGCTVSIGCSDSLVLARLALKMAKPNGYNITFKSNLSEEFWSSFKLDDL ------------------------------------------1111---------11112 PGVGHSTLSRLESTFDSPHSLNDLRKRYTLDALKASVGSKLGMKIHLALQGQDDEESLKI 222--------------------------------------------1111--3333333 LYDPKEVLQRKSLSIDINWGIRFKNITQVDLFIERGCQYLLEKLNEINKTTSQITLKLMR 3--3333-------------------------------------1111------------ RCKDAPIEPPKYMGMGRCDSFSRSSRLGIPTNEFGIIATEMKSLYRTLGCPPMELRGLAL -1111-----2222------------------3333---------1111-3333------ QFNKLVDV -------- >CORONIN-1A; SWP:O89053; PDB:2AQ5A; SSKFRHVFGQPAKADQCYEDVRVSQTTWDSGFCAVNPKFMALIEASGGGAFLVLPLGKTG -1111-------3333--------------------------------------1111-- RVDKNVPLVGHTAPVLDIAWPHNDNVIASGSEDCTVMVWEIPDGGLVLPLREPVITLEGH --1111----------------1111----1111-------2222--------------- TKRVGIVAWHPTAQNVLLSAGDNVILVWDVGTGAAVLTLGPDVHPDTIYSVDWSRDGALI ------------2222-----------------------3333----------1111--- CTSCRDKRVRVIEPRKGTVVAEKDRPHEGTRPVHAVFVSEGKILTTGFSRMSERQVALWD ---1111------1111---------------------2222------1111-------1 TKHLEEPLSLQELDTSSGVLLPFFDPDTNIVYLCGKGDSSIRYFEITSEAPFLHYLSMFS 111-------------------------------2222---------------------- SKESQRGMGYMPKRGLEVNKEIARFYKLHERKCEPIAMTVPRKSDLFQEDLYPPTAGPDP -----------3333-1111---------------------------3333--------- ALTAEEWLGGRDAGPLLISLKDGYVPPKSR --------------------2222------ >PYRIDOXINE 5'-PHOSPHATE O; SWP:O06553; PDB:2AQ6A; VFDDKLLAVISGNSIGVLATIKHDGRPQLSNVQYHFDPRKLLIQVSIAEPRAKTRNLRRD 3333-----1111--------1111------------1111------3333--------- PRASILVDADDGWSYAVAEGTAQLTPPAAAPDDDTVEALIALYRNIAGEHSDWDDYRQAM --------1111-----------------1111--------------------------- VTDRRVLLTLPISHVYGLPPGMR ------------------2222- >PEPTIDE INHIBITOR; SWP:P0A722; PDB:2AQ9A; MIDKSAFVHPTAIVEEGASIGANAHIGPFCIVGPHVEIGEGTVLKSHVVVNGHTKIGRDN --1111--1111--------2222--2222--1111--2222------------------ EIYQFASIGEVNQDLKYAGEPTRVEIGDRNRIRESVTIHRGTVQGGGLTKVGSDNLLMIN --2222-------3333------------------------3333------------222 AHIAHDCTVGNRCILANNATLAGHVSVDDFAIIGGMTAVHQFCIIGAHVMVGGCSGVAQD 2--2222--------2222--------2222--2222--2222--2222----------- VPPYVIAQGNHATPFGVNIEGLKRRGFSREAITAIRNAYKLIYRSGKTLDEVKPEIAELA ----------------------1111---------------------3333-------33 ETYPEVKAFTDFFARSTRGLIR 333333------1111------ >H/ACA RIBONUCLEOPROTEIN C; SWP:Q6Q547; PDB:2AQAA; HLMYTLGPDGKRIYTLKKVTESGEITKSAHPARFSPDDKYSRQRVTLKKRFGLVPGQ ------1111--------------------------33333333------------- >TRYPTOPHAN HALOGENASE, PR; SWP:P95480; PDB:2AQJA; NKPIKNIVIVGGGTAGWMAASYLVRALQQQANITLIESAAIPRIGVGEATIPSLQKVFFD --------------------------------------------------3333---333 FLGIPEREWMPQVNGAFKAAIKFVNWRKSPDPSRDDHFYHLFGNVPNCDGVPLTHYWLRK 3---33333333------------------1111-------------iiii--------- REQGFQQPMEYACYPQPGALDGKLAPCLSDGTRQMSHAWHFDAHLVADFLKRWAVERGVN 1111---3333----------------1111-----------------------1111-- RVVDEVVDVRLNNRGYISNLLTKEGRTLEADLFIDCSGMRGLLINQALKEPFIDMSDYLL -----------1111------1111------------3333-------------1111-- CDSAVASAVPNDDARDGVEPYTSSIAMNSGWTWKIPMLGRFGSGYVFSSHFTSRDQATAD -----------3333---------------------2222-------3333--------- FLKLWGLSDNQPLNQIKFRVGRNKRAWVNNCVSIGLSSCFLEPLESTGIYFIYAALYQLV -------1111----------------!!!!---3333--------3333---------- KHFPDTSFDPRLSDAFNAEIVHMFDDCRDFVQAHYFTTSRDDTPFWLANRHDLRLSDAIK ----3333--------------------------1111---------------------- EKVQRYKAGLPLTTTSFDDSTYYETFDYEFKNFWLNGNYYCIFAGLGMLPDRSLPLLQHR -------------------------3333------------------------3333--- PESIEKAEAMFASIRREAERLRTSLPTNYDYLRSLRD ------------------------------------- >SUPEROXIDE DISMUTASE [CU-; SWP:P15453; PDB:2AQMA; ESTTVKMYEALPTGPGKEVGTVVISEAPGGLHFKVNMEKLTPGYHGFHVHENPSCAPGEK ----------1111------------1111-----------------------------i DGKIVPALAAGGHYDPGNTHHHLGPEGDGHMGDLPRLSANADGKVSETVVAPHLKKLAEI iii-2222------1111-----1111--1111------1111-------1111-3333- KQRSLMVHVGGDNYSDKPEPLGGGGARFACGVIE -------------------iiii----------- >SUPEROXIDE DISMUTASE [CU-; SWP:P57005; PDB:2AQPA; ASIEVKVQQLDPVNGNKDVGTVTITESNYGLVFTPDLQGLSAGLHGFHIHENPSCEPKEK --------------------------1111-----------------------------i EGKLTAGLGAGGHWDPKGAKQHGYPWQDDAHLGDLPALTVLHDGTATNPVLAPRLKHLDD iii-----------1111-----1111---1111------1111-------1111-3333 VRGHSIMIHTGGDNHSDHPAPLGGGGPRMACGVIK --------------------iiii----------- >PUTATIVE OROTIDINE-MONOPH; SWP:Q7RPE4; PDB:2AQWA; HFKTKLKNRRSEVNTCLCIGLDPDEDDIKNFKNEEQNGYKNIKNNNSNNNGIENIIKIGK --------3333-----------------------------------%%%%1111---33 EILLTDGENIQNLSEEDKFFYFFNHFCFYIINNTKEYALVYKNFAFYIPYGSVGINALKN 33---11113333--------------------1111-----3333-1111--------- VFDYLNSNIPTLDKINDIGNTVKNYRKFIFEYLKSDSCTINVYGTNLKDICFDYEKNKYY -----------------3333-------------------------1111---1111--- SAYVLIKTTNKDSFIFQNELSINDKQAYIVADETQKATELKIEQNNEFIGFVVGSNAFEE ---------1111--------iiii3333--------11113333--------1111--- KIIRNKFPDSYILSPGIGAQNGDLYKTLKNGYNKDYEKLLINVGRAITKSPDPKKSSESY ---3333----------1111-------------3333-----3333------------- YNQIIQIFKD ---------- >PREDICTED: inositol 1,4,5; SWP:P42335; PDB:2AQXA; MVQWSPFVMSFKKKYPWIQLAGHAGSFKAAANGRILKKHCESEQRCLDRLMADVLRPFVP 3333------------------2222----%%%%------------------1111---- AYHGDVVKDGERYNQMDDLLADFDSPCVMDCKMGVRTYLEEELTKARKKPSLRKDMYQKM -----------------1111-----------------3333------------------ VEVDPEAPTEEEKAQRAVTKPRYMQWRETISSTATLGFRIEGIKKEDGSVNRDFKKTKTR ---1111-------------------1111--------------3333-----1111--- EQVTEAFREFTKGNQNILIAYRDRLKAIRATLEISPFFKCHEVIGSSLLFIHDKKEQAKV ----------%%%%----------------3333-3333-------------1111---- WMIDFGKTTPLPEGQTLQHDVPWQEGNREDGYLSGLDNLIDILTEMSQG -----------------------2222----------------3333-- >TYPE I RESTRICTION ENZYME; SWP:P08957; PDB:2AR0A; NDLVAKLWKLCDNLRDGGVSYQNYVNELASLLFLKCKETGQEAEYLPEGYRWDDLKSRIG -------------------3333------------3333-3333--222233333333-3 QEQLQFYRKLVHLGEDDKKLVQAVFHNVSTTITEPKQITALVSNDSLDWQYFTPRPLIKT 333----------------3333-2222-------------------------3333--- IIHLLKPQPREVVQDPAAGTAGFLIEADRYVKSQTNDLDDLDGDTQDFQIHRAFIGLELV -----------------!!!!--------------iiii--3333--------------- PGTRRLALNCLLHDIEGNLDHGGAIRLGNTLGSDGENLPKAHIVATNPPFGSAAGTNITR ---------1111----3333-------11113333------------------------ TFVHPTSNKQLCFQHIIETLHPGGRAAVVVPDNVLFEGGKGTDIRRDLDKCHLHTILRLP --------------------2222------3333---!!!!------------------- TGIFYAQGVKTNVLFFTKGTVANPNQDKNCTDDVWVYDLRTNPSFGKRTPFTDEHLQPFE -------------------3333----------------------3333--3333----- RVYGEDPHGLSPRTEGEWSFNAEETEVADSEENKNTDQHLATSRWRKFSREWIRTAKSDS -----1111---------%%%%-------3333---3333--------------1111-- LDISWLKDKDPEPDVLAAEAGELVQALSELDALRELGASDEADLQRQLLEEAFGGV ------------3333---------------------------------------- >HYPOTHETICAL PROTEIN; SWP:Q4Q067; PDB:2AR1A; RAEDIHYWLLKSEPHKFSIDDLAKQKTSPWDGVRNYAARNNMRAMSVGDKVLFYHSNTKE 1111--------1111-------------------------11112222----------- PGVAGLAEVVRLAYDDFTALDKTSEYFDPKATKEKNPWKMVDVKFVARWDTVLTLHELKS --------------------1111---11111111------------------3333--- RRELQKMALFTQRRLSVQPVSASEYAYILRMNEEQQR 3333--3333-1111---------------------- >PHOSPHOINOSITIDE 3-KINASE; SWP:Q14CQ9; PDB:2AR5A; MDGRIKEVSVFTYHKKYNPDKHYIYVVRILREGQIEPSFVFRTFDEFQELHNKLSIIFPL ------------------------------1111------------------------33 WKLPGFPNRMVLGRTHIKDVAAKRKIELNSYLQSLMNASTDVAECDLVCTFFHGSHH 33--------------------------------11113333-------1111---- >ARABINOSE OPERON REGULATO; SWP:P03021; PDB:2ARCA; DPLLPGYSFNAHLVAGLTPIEANGYLDFFIDRPLGMKGYILNLTIRGQGVVKNQGREFVC 1111----------------2222----------------------------iiii---- RPGDILLFPPGEIHHYGRHPEAREWYHQWVYFRPRAYWHEWLNWPSIFANTGFFRPDEAH 2222----2222------1111------------11111111--------------3333 QPHFSDLFGQIINAGQGEGRYSELLAINLLEQLLLRRMEAI ----------------------------------------- >WILSON DISEASE ATPASE; SWP:P35670; PDB:2ARFA; AGHMVPRVMRVLLLGDVATLPLRKVLAVVGTAEASSEHPLGVAVTKYCKEELGTETLGYC ------------------------------------------------------------ TDFQAVPGCGIGCKVSNVEGILAHSERPLSAPASHLNEAGSLPAEKDAVPQTFSVLIGNR ----------------3333--------------------------------------33 EWLRRNGLTISSDVSDAMTDHEMKGQTAILVAIDGVLCGMIAIAD 33------------------------------%%%%--------- >HYPOTHETICAL PROTEIN AQ_1; SWP:NA; PDB:2ARHA; AVYEELLKTLENGINSEEGEIRLVRKSQGRFKEEFNFDLSLGSKPLLTLKVFLGRKPYWQ ---------------1111-----------1111-------------------------- PWVEVFGVNPNLRNVFFGSEAERKLYEFLSEHFGRIFVEYFEDKETTYELQKGVPPALSR --------1111---2222---------1111-------1111------1111-3333-- LGFELLKLGYTYFRDWFIPEGLEGGHKIQAEKPKTEEAKKRHLENLKKEFEEFIGKCEDE -----1111---------1111-------------------------------------- GLIKKVKERYNFLE -------------- >T-cell surface glycoprote; SWP:P01731; PDB:2ARJH; QVQLKESGPGLVQPSQTLSLTCTVSGFSLTSNSVHWVRQPPGKGLEWMGGIWGDGDTDYN ---------------------------3333--------2222--------1111----3 SALKSRLSISRDTSKNQVFLKMNSL 333--------1111---------- >T-cell surface glycoprote; SWP:P01731; PDB:2ARJL; DIVMTQSPSSLAVSAGERVTLNCKASQNVRNNIAWYQQKPGQSPKLLIYYASYRYTGVPD ----------------------------!!!!------2222------------222233 RFTGDGFGTDFTLAINSVQADDAAFYYCQRIYNSPYTFGAGTKLELIRADAAPTVSIFPP 33----------------3333-------------------------------------- SMEQLTSGGASVVCFVNNFYPRDISVKWKIDGSEQRDGVLDSVTDQDSKDSTYSMSSTLS 33331111---------------------iiii--2222--------------------- LTKVEYERHNLYTCEVVHKTSSSPVVKSFNR -3333------------1111---------- >FLAVODOXIN; SWP:FLAV_AQUAE; PDB:2ARKA; AGKVLVIYDTRTGNTKKAELVAEGARSLEGTEVRLKHVDEATKEDVLWADGLAVGSPTNG ------------3333-----------2222-----3333-------------------- LVSWKKRFFDDVLGDLWGEIDGKIACAFSSSGGWGGGNEVACSILTLNFGFLVFGVTDYV ------------3333----------------22223333-------------------- GKKFTLHYGAVVAGEPRSEEEKEACRRLGRRLAEWVAIFVDGRKELLEKIRKDPARFV ------------------------------------------3333------3333-- >GFP-LIKE NON-FLUORESCENT ; SWP:P83690; PDB:2ARLA; GSVIATQMTYKVYMSGTVNGHYFEVEGDGKGKPYEGEQTVKLTVTKGGPLPFAWDILSPQ -----------------iiii----------1111---------------------1111 CSIPFTKYPEDIPDYVKQSFPEGFTWERIMNFEDGAVCTVSNDSSIQGNCFTYHVKFSGL -3333---1111-3333--------------1111-----------!!!!---------- NFPPNGPVMQKKTQGWEPSSERLFARGGMLIGNNFMALKLEGGGHYLCEFKTTYKAKKPV --1111-------------------iiii----------2222----------------- KMPGYHYVDRKLDVTNHNKDYTSVEQCEISIARKPVVA -----------------1111----------------- >INHIBIN BETA A CHAIN; SWP:P08476; PDB:2ARPA; GLECDGNICCKKQFFVSFKDIGWNDWIIAPSGYHANYCEGECPSLSFHSTVINHYRMRGH ----------------3333--------------------------------11112222 SPFANLKSCCVPTKLRPMSMLYYDDGQNIIKKDIQNMIVEECGCS -----------------------1111------------------ >PLASMINOGEN ACTIVATOR INH; SWP:P05120; PDB:2ARRA; DLCVANTLFALNLFKHLAKASPTQNLFLSPWSISSTMAMVYMGSRGSTEDQMASVLQFNE -----------------3333-------------------1111------------3333 VKIHSSFRSLSSAINASTGNYLLESVNKLFGEKSASFREEYIRLCQKYYSSEPQAVDFLE -------------------------------1111------------------------- CAEEARKKINSWVKTQTKGKIPNLLPEGSVDGDTRMVLVNAVYFKGKWKTPFEKKLFPFR --------------1111-------2222-1111-------------------------- VNSAQRTPVQMMYLREKLNIGYIEDLKAQILELPYAGDVSMFLLLPDADVSTGLELLESE -1111-----------------3333---------------------------------- ITYDKLNKWTSKMAEDEVEVYIPQFKLEEHYELRSILRSMGMEDAFNKGRANFSGMSERN -------------------------------------1111-3333------33333333 DLFLSEVFHQAMVDVNEEGTTGRTGHGGPQFVADHPFLFLIMHKITNCILFFGRFSSP -------------------------------------------1111----------- >LIPOATE-PROTEIN LIGASE A; SWP:Q9HKT1; PDB:2ARSA; EGRLLLLETPGNTRSLAYDEAIYRSFQYGDKPILRFYRHDRSVIIGYFQVAEEEVDLDYK --------2222----------33332222---------------11111111--33331 KNGILARRYTGGGAVYHDLGDLNFSVVRSSDDDITSFRTNEAVVNSLRILGLDARPGELN 111--------------1111-----------3333-----------1111--------- DVSIPVNKKTDIAGEKKIGAAGARKGAKLWHAALVHTDLDLSAVLKSTRERVANVTDFVD 3333---1111------------2222-------------------3333---1111--- VSIDEVRNALIRGFSETLHIDFREDTITEKEESLARELFDKKYSTEEWNGLL ---------------1111----------------------11113333--- >HYPOTHETICAL PROTEIN PA43; SWP:NA; PDB:2ARZA; SVEAAKNARELLLKEYRAVLSTHSKKWPGFPFGSVVPYCLDAEGRPLILISRIAQHTHNL --------------------------2222----------1111------1111------ QADPRCSMLVGERGAEDIQAVGRLTLLAEARQLAEEEVAAAAERYYRYFPESADYHRVHD ----------------1111-------------3333--------3333----------- FDFWVLQPVQWRFIGGFGAIHWLAAERVPLANPFAGEAERGMVEHMNSDHAAAIAHYVEL --------------1111-----1111----1111------------------------- AGLPAHAAAQLAGIDTEGFHLRIGQGLHWLPFPAACGNPGAVRQALVQLARAERWPTV --------------1111---------------------------------------- >HYPOTHETICAL PROTEIN PH19; SWP:O59578; PDB:2AS0A; ARVVVDAQAARAIGKGAIVFKKGVVRVEGDIKPGDIVEVYTRGGKFLGKGFANPNSNIVR ------------1111---3333--------2222-----1111--------1111---- IVTKDKDVEINKDLFKRRIKKANEYRKKVLKYTNVYRVYGEADYLPGLIVDRFNDIASLQ ----1111--------------------------------11112222----!!!!---- ISSAGERFKLDVAEAIEVEPGIETVFEKNTGRSRRREGLPEIERVLLGKEKYRTIIQEGR ------------------3333------------1111-------------------!!! AKFIVDRGQKTGFFLDQRENRLALEKWVQPGDRVLDVFTYTGGFAIHAAIAGADEVIGID !-------------1111-----3333-2222------!!!!------1111-------- KSPRAIETAKENAKLNGVEDRKFIVGSAFEEEKLQKKGEKFDIVVLDPPAFVQHEKDLKA -------------11113333-----3333-----------------------3333--- GLRAYFNVNFAGLNLVKDGGILVTCSCSQHVDLQFKDIIAAGAKAGKFLKLEPYRTQAPD ------------11112222-------3333--------------------------111 HPILASKDTEYLKCLFLYVEDR 1----3333------------- >SERINE PROTEASE; SWP:Q53782; PDB:2AS9A; MEKNVTQVKDTNNFPYNGVVSFKDATGFVIGKNTIITNKHVSKDYKVGDRITAHPNGDKG ---------------------------------------------2222------!!!!- NGGIYKIKSISDYPGDEDISVMNIEEQAVERGPKGFNFNENVQAFNFAKDAKVDDKIKVI -------------------------------1111-3333-----------2222----- GYPLPAQNSFKQFESTGTIKRIKDNILNFDAYIEPGNSGSPVLNSNNEVIGVVYGGIGKI ----------------------!!!!----------2222---1111-----------22 GSEYNGAVYFTPQIKDFIQKHIEQHHH 22----------------1111----- >TRANSCRIPTION ELONGATION ; SWP:P0A5M2; PDB:2ASBA; STREGEIVAGVIQRDSRANARGLVVVRIGTETKASEGVIPAAEQVPGESYEHGNRLRCYV --2222------------1111-----------------3333-2222--2222------ VGVTRGAREPLITLSRTHPNLVRKLFSLEVPEIADGSVEIVAVAREAGHRSKIAVRSNVA -----------------3333---------3333-----------2222---------22 GLNAKGACIGPMGQRVRNVMSELSGEKIDIIDYDDDPARFVANALSPAKVVSVSVIDQTA 22-------2222---------iiii----------------1111-----------111 RAARVVVPDFQLSLAIGKEGQNARLAARLTGWRIDIRGDAPPPPPG 1------3333-----2222-------------------------- >NEUROTOXIN ALPHA-IT; SWP:P17728; PDB:2ASCA; MVRDAYIAKNYNCVYECFRDAYCNELCTKNGASSGYCQWAGKYGNACWCYALPDNVPIRV ------------------3333-----1111----------------------------- PGKCR ----- >HYPOTHETICAL PROTEIN RV20; SWP:Q10682; PDB:2ASFA; SDDALAFLSERHLALTTLRADNSPHVVAVGFTFDPKTHIARVITTGGSQKAVNADRSGLA ------1111--------1111----------------------2222------------ VLSQVDGARWLSLEGRAAVNSDIDAVRDAELRYAQRYRTPRPNPRRVVIEVQIERVLGSA -----!!!!---------------------------------1111------------33 DLLD 33-- >QUEUINE TRNA-RIBOSYLTRANS; SWP:Q9X1P7; PDB:2ASHA; MEFEVKKTFGKARLGVMKLHHGAVETPVFMPVGTNASVKLLTPRDLEEAGAEIILSNTFH --------!!!!------1111---------------2222-----3333---------- LMLKPGVEIIKLHRGLHNFMGWKRPILTDSGGFQVFSLPKIRIDDEGVVFRSPIDGSKVF ---------3333---------------------1111-----3333------------- LNPEISMEVQIALGSDICMVFDHCPVADYEEVKEATERTYRWALRSKKAFKTENQALFGI ------------------------------------------------------------ VQGGIYPDLRRESALQLTSIGFDGYAIGGLSIGEERSLTLEMTEVTVEFLPEDKPRYFMG --!!!!----------------------------3333-------3333-1111------ GGSPELILELVDRGVDMFDSVFPTRIARHGTALTWNGKLNLKASYNKRSLEPVDERCGCY ----------1111------3333---------1111--11111111------1111-33 TCKNFTRSYIHHLFDRGEVLGQILLTIHNINFMISLMKEVRRSIESGTFKELKSKVVEVY 33-----------1111------------------------------------------- S - >ASPARTIC PROTEINASE; SWP:P00799; PDB:2ASI; GSVDTPGYYDFDLEEYAIPVSIGTPGQDFLLLFDTGSSDTWVPHKGCTKSEGCVGSRFFD ----------1111---------------------------------3333--------3 PSASSTFKATNYNLNITYGTGANGLYFEDSIAIGDITVTKQILAYVDNVRGPTAEQSPNA 3331111-------------------------%%%%--------------3333------ DIFLDGLFGAAYPDNTAMEAEYGSTYNTVHVNLYKQGLISSPLFSVYMNTNSGTGEVVFG -----------1111------------------1111----------------------- GVNNTLLGGDIAYTDVMSRYGGYYFWDAPVTGITVDGSAAVRFSRPQAFTIDTGTNFFIM --3333---------------------------------------------1111----- PSSAASKIVKAALPDATETQQGWVVPCASYQNSKSTISIVMQKSGSSSDTIEISVPVSKM -------3333--------------------------------------------3333- LLPVDQSNETCMFIILPDGGNQYIVGNLFLRFFVNVYDFGNNRIGFAPLASAYENE -----------------------------1111----------------3333--- >ARTEMIN; SWP:Q5T4W7; PDB:2ASKA; ARGCRLRSQLVPVRALGLGHRSDELVRFRFCSGSCRRARSPHDLSLASLLGAGALRPPPG -----------3333-------------------3333-----------1111------- SRPVSQPCCRPTRYEAVSFMDVNSTWRTVDRLSATACGCLG --------------------1111----------------- >ASPARTATE RECEPTOR; SWP:P07017; PDB:2ASR; KSFVVSNQLREQQGELTSTWDLMLQTRINLSRSAVRMMMDSSNQQSNAKVELLDSARKTL ------------------------------------------3333-------------- AQAATHYKKFKSMAPLPEMVATSRNIDEKYKNYYTALTELIDYLDYGNTGAYFAQPTQGM ---------3333--3333------------------------------------3333- QNAMGERFAQYALSSEKLYRDI ---------------------- >Cyclin-dependent kinases ; SWP:P61024; PDB:2ASTC; QIYYSDKYDDEEFEYRHVMLPKDIAKLVPKTHLMSESEWRNLGVQQSQGWVHYMIHEPEP --------------------33331111-----------3333------------3333- HILLFRRPL --------- >Hepatocyte growth factor-; SWP:P26927; PDB:2ASUB; VVGGHPGNSPWTVSLRNRQGQHFCGGSLVKEQWILTARQCFSSCHMPLTGYEVWLGTLFQ --------1111----1111---------1111---1111-------2222--------- NPQHGEPSLQRVPVAKMVCGPSGSQLVLLKLERSVTLNQRVALICLPPEWYVVPPGTKCE --1111--------------2222-------------1111------2222--2222--- IAGWGETKGTGNDTVLNVALLNVISNQECNIKHRGRVRESEMCTEGLLAPVGACEGDYGG ------%%%%--------------3333--1111---1111------------2222--- PLACFTHNSWVLEGIIIPNRVCARSRWPAVFTRVSVFVDWIHKVM -----%%%%--------------2222-----3333--------- >HYPOTHETICAL PROTEIN AF15; SWP:O28769; PDB:2ASWA; GSSTITRPIIELSNTADKIAEGNLEAEVPHQNRADEIGILAKSIERLRRSLKVAME 3333------------------1111---1111----------------------- >NEUROTOXIN ALPHA-IT; SWP:P17728; PDB:2ATBA; MVRDAYIADDVNCVYECFRDAYCNELCTKNGASSGYCQWAGKYGNACWCYALPDNVPIRV --------1111------3333-----1111---------1111---------------- PGKCR ----- >Aspartate carbamoyltransf; SWP:P0A7F3; PDB:2ATCB; MTHNDKLQVAEIKRGTVINHIPAEIGFKLLSLFKLTETQDRITIGLNLPSGEMGRKDLIK ---------------------------3333----------------------------- IENTFLSEDEVDELALYAPQATVNRINDYEVVGKSRPSLPERNIDVLVCPDSNCISHAEP -----------1111---------------------------------------2222-- VSSSFAVRRADDIALKCKYCEKEFSHNVVLAN -------------------------------- >GLYCOGEN PHOSPHORYLASE, L; SWP:P06737; PDB:2ATIA; NVAELKKSFNRHLHFTLVKDRNVATTRDYYFALAHTVRDHLVGRWIRTQQHYYDKCPKRV -------------------3333------------------------------------- YYLSLEFYMGRTLQNTMINLGLQNACDEAIYQLGLDIEELEEIEEDAGLGNGGLGRLAAC ------------------------------1111-3333-3333---------------- FLDSMATLGLAAYGYGIRYEYGIFNQKIRDGWQVEEADDWLRYGNPWEKSRPEFMLPVHF ----------------------------iiii------1111--------3333------ YGKVEHTNTGTKWIDTQVVLALPYDTPVPGYMNNTVNTMRLWSARAPDYIQAVLDRNLAE ------1111-------------------------------------------------- NISRVLYPNDNFFEGKELRLKQEYFVVAATLQDIIRRFKASKFTVFDAFPDQVAIQLNDT ---------------3333--------------------------1111----------- HPALAIPELMRIFVDIEKLPWSKAWELTQKTFAYTNHTVLPEALERWPVDLVEKLLPRHL ---------------------------------------1111----------------- EIIYEINQKHLDRIVALFPKDVDRLRRMSLIEEEGSKRINMAHLCIVGSHAVNGVAKIHS --------------------3333-------------------------------3333- DIVKTKVFKDFSELEPDKFQNKTNGITPRRWLLLCNPGLAELIAEKIGEDYVKDLSQLTK --------3333--3333----------1111-----------------3333------3 LHSFLGDDVFLRELAKVKQENKLKFSQFLETEYKVKINPSSMFDVQVKRIHEYKRQLLNC 3331111------------------------------1111---------3333------ LHVITMYNRIKKDPKKLFVPRTVIIGGKAAPGYHMAKMIIKLITSVADVVNNDPMVGSKL ------------1111-------------1111----------------1111--!!!!- KVIFLENYRVSLAEKVIPATDLSEQISTAGTEASGTGNMKFMLNGALTIGTMDGANVEMA --------3333---3333--------2222----------1111-------!!!!---- EEAGEENLFIFGMRIDDVAALDKKGYEAKEYYEALPELKLVIDQIDNGFFSPKQPDLFKD ---3333------3333------------------------------1111--1111--- IINMLFYHDRFKVFADYEAYVKCQDKVSQLYMNPKAWNTMVLKNIAASGKFSSDRTIKEY -----------3333------------------3333------33333333--------- AQNIWNVEPSD ----------- >Voltage-gated potassium c; SWP:P0A334; PDB:2ATKC; SALHWRAAGAATVLLVIVLLAGSYLAVLAERGAPGAQLITYPRALWWSVATATTVGYGDL -3333------------------------2222------3333-------1111------ YPVTLWGRCVAVVVMVAGITSFGLVTAALATWFVGREQERRGH --------------------------------------1111- >HYALURONOGLUCOSAMINIDASE; SWP:P49370; PDB:2ATMA; RVFNIYWNVPTFMCHQYDLYFDEVTNFNIKRNSKDDFQGDKIAIFYDPGEFPALLSLKDG ---------33333333---11111111---2222------------------------- KYKKRNGGVPQEGNITIHLQKFIENLDKIYPNRNFSGIGVIDFERWRPIFRQNWGNMKIH ----iiii1111-------------------1111-------------3333-!!!!--- KNFSIDLVRNEHPTWNKKIELEASKRFEKYARFFMEETLKLAKKTRKQADWGYYGYPYCF -----------1111------------------------------1111---2222---- NMSPNNLVPECDVTAMHENDKMSWLFNNQNVLLPSVYVRQELTPDQRIGLVQGRVKEAVR --1111---------------33331111---------1111------------------ ISNNLKHSPKVLSYWWYVYQDETNTFLTETDVKKTFQEIVINGGDGIIIWGSSSDVNSLS ----1111----------1111-----------------1111--------3333----- KCKRLQDYLLTVLGPIAINVTEA ----------------------- >T-cell surface glycoprote; SWP:P10300; PDB:2ATPB; LIQTPSSLLVQTNHTAKMSCEVKSISKLTSIYWLRERQDPKDKYFEFLASWSSSKGVLYG ----------2222---------------------------------------------3 ESVDKKRNIILESSDSRRPFLSIMNVKPEDSDFYFCATVGSPKMVFGTGTKLTVV 333---------1111----------3333------------------------- >ACETYLTRANSFERASE, GNAT F; SWP:Q97SR8; PDB:2ATRA; ITIKKQEIVKLEDVLHLYQAVGWTNELEQALSHSLVIYLALDGDAVVGLIRLVGDGFSSV ---------33333333---------3333-----------!!!!--------------- FVQDLIVLPSYQRQGIGSSLKEALGNFKEAYQVQLATEETEKNVGFYRSGFEILSTYDCT -------3333---3333--------1111----------------------3333---- GIWINRE ------- >DIHYDRODIPICOLINATE SYNTH; SWP:P0A6L3; PDB:2ATSA; MFTGSIVAIVTPMDEKGNVCRASLKKLIDYHVASGTSAIVSVGTTGESATLNHDEHADVV -------------1111-------------------------33333333---------- MMTLDLADGRIPVIAGTGANATAEAISLTQRFNDSGIVGCLTVTPYYNRPSQEGLYQHFK ------iiii-------------------1111--------------------------- AIAEHTDLPQILYNVPSRTGCDLLPETVGRLAKVKNIIGIKEATGNLTRVNQIKELVSDD --1111--------3333---------------1111---------------3333-111 FVLLSGDDASALDFMQLGGHGVISVTANVAARDMAQMCKLAAEGHFAEARVINQRLMPLH 1-----3333----1111------3333------------1111---------------- NKLFVEPNPIPVKWACKELGLVATDTLRLPMTPITDSGRETVRAALKHAGLL ----------------1111--------------------------1111-- >RAS-LIKE ESTROGEN-REGULAT; SWP:Q96A58; PDB:2ATVA; AEVKLAIFGRAGVGKSALVVRFLTKRFIWEYDPTLESTYRHQATIDDEVVSMEILDTAGQ ---------2222------------------1111------------------------- EDTIQREGHMRWGEGFVLVYDITDRGSFEEVLPLKNILDEIKKPKNVTLILVGNKADLDH --------------------11113333--------------------------333311 SRQVSTEEGEKLATELACAFYECSACTGEGNITEIFYELCREVRRRRM 11-----------1111------------------------------- >SMALL GTP BINDING PROTEIN; SWP:NA; PDB:2ATXA; SMAHGPGALMLKCVVVGDGAVGKTCLLMSYANDAFPEEYVPTVFDHYAVSVTVGGKQYLL -----------------2222--------------------------------------- GLYDTAGQEDYDRLRPLSYPMTDVFLICFSVVNPASFQNVKEEWVPELKEYAPNVPFLLI -----------11113333----------1111--------------------------- GTQIDLRDDPKTLARLNDMKEKPICVEQGQKLAKEIGACCYVECSALTQKGLKTVFDEAI --3333------------------3333-----1111--------1111----------- IAILTP ------ >H. PYLORI PREDICTED CODIN; SWP:O24984; PDB:2ATZA; EELKLIKIDTSHYFEKKPGLGERVDYAGRCFYNKFQRVNALTSSLIQKHLKREIEIAHNL -------------------------iiii-------------------1111-------- ILRNDKVENIVFDYNGRNPERFYHKAQLLLREEGFNFTAYNTKTPGHLHLYVHKGHTELG ------------------------------1111-------------------------- EGERLVKTLSKLAQGLPKEWKVFPSNEWPKEFNILALPYEVFAKERGSSWAK ----------3333--------------1111-------------------- >DNA PRIMASE; SWP:O67465; PDB:2AU3A; MSSDIDELRREIDIVDVISEYLNLEKVGSNYRTNCPFHPDDTPSFYVSPSKQIFKCFGCG --------------------------!!!!------------------1111-------- VGGDAIKFVSLYEDISYFEAALELAKRYGKKLDLEKISKDEKVYVALDRVCDFYRESLLK --------------------------------3333------------------------ NREASEYVKSRGIDPKVARKFDLGYAPSSEALVKVLKENDLLEAYLETKNLLSPTKGVYR --------1111------------------------1111-33331111-----2222-- DLFLRRVVIPIKDPRGRVIGFGGRRIVEDKSPKYINSPDSRVFKKGENLFGLYEAKEYIK 1111--------1111-----------------------33331111-2222-------- EEGFAILVEGYFDLLRLFSEGIRNVVAPLGTALTQNQANLLSKFTKKVYILYDGDDAGRK -----------------1111--------------------------------------- AMKSAIPLLLSAGVEVYPVYLPEGYDPDEFIKEFGKEELRRLINSSGELFETLIKTAREN ---------1111--------2222------------------------------3333- LEEKTREFRYYLGFISDGVRRFALASEFHTKYKVPMEILLMKI -----------1111-------------------3333----- >CONSERVED DOMAIN PROTEIN; SWP:Q82ZV0; PDB:2AU5A; ALILSPNFEYEEITRSFLSNLAFTRGHFTGDISHFSPIVLAEEKDPNWLEEAAGGQGVIV ------3333----------------------------------1111------------ QSLLEDENFSSVEQLKGELARLIRLYFALAKDNLTENQESLYVDLFDKFTFLLLCSDEFI -----3333-------------------1111------------------------3333 YLDS 3333 >INORGANIC PYROPHOSPHATASE; SWP:P0A7A9; PDB:2AU7A; SLLNVPAGKDLPEDIYVVIEIPANADPIKYEIDKESGALFVDQFMSTAMFYPCNYGYINH 3333-------------------------------------------------------- TLSLDGDPVDVLVPTPYPLQPGSVTRCRPVGVLKMTDEAGEDAKLVAVPHSKLSKEYDHI -------------------2222-------------1111---------3333-1111-- KDVNDLPELLKAQIAHFFEHYKDLEKGKWVKVEGWENAEAAKAEIVASFERAKNK -3333--------------1111-2222--------------------------- >HYPOTHETICAL PROTEIN; SWP:NA; PDB:2AUAA; NETEFYAYHIVTRKKHIGQIPFNKNQHNTLYHFFFEREQLNANGEDGIQILNNHYKNDEL ---------------2222---------------------1111-----------%%%%- HINNENAKVVISYDQTIRAARETIVEVRLQEFPEYPSRLSCLYAAKSYEDALKWKALFDS -------------------------------3333-1111------------------11 YNREVLQIVKLRVIGSSFEGDGNLLPKEDGIPFSQKIEQARKYWKGNNELPELLINGEIE 11------------------3333------------------------------------ VVEIIDDF -------- >MYOSIN A TAIL INTERACTING; SWP:Q9NG97; PDB:2AUCA; NGKLRIEDASHNARKLGLAPSSTDEKKIRDLYGDSLTYEQYLEYLTCVHDRDNEELIKFS -------------1111------------------------------------------1 HFDNNSSGFLTKNQKNILTTWGDALTEQEANDALNAFSSEDRINYKLFCEDI 111-------3333-----------3333----------------------- >GROWTH FACTOR RECEPTOR-BO; SWP:Q14449; PDB:2AUGA; IHRSQPWFHHKISRDEAQRLIIQQGLVDGVFLVRDSQSNPKTFVLSMSHGQKIKHFQIIP 333311112222---------1111-2222------------------%%%%-------- VEDDGEMFHTLDDGHTRFTDLIQLVEFYQLNKGVLPCKLKHYCAR --iiii-----iiii-----------3333-!!!!---------- >Growth factor receptor-bo; SWP:Q14449; PDB:2AUHB; ENSLVAMDFSGQKSRVIENPTEALSVAVEEGLAWRKK ------------------------------3333--- >DNA-directed RNA polymera; SWP:Q9KWU6; PDB:2AUJD; LTDEEYRELRYGKQETYPLPAGVDALVKDGEEVVKGQELAPGVVSRMDGVALYRFPRRVR -3333--------------2222----------2222----------------------- VDYLRKERAALRIPLSAWVEKEAYRPGEVLAELSEPYLFRAEESGVVELKDLAEGHLIYL -------------3333-------2222-----------------------!!!!----- RQEEEVVARYFLPAGMTPLVVEGEIVEVGQPLAEGKGLLRLPRHMTAKEVEAEEEGDSVH -!!!!---------------2222--2222-----------1111--------------- LTLFLEWTEPKDYKVAPHMNVIVPEGAKVQAGEKIVAAIDPEEEVIAEAEGVVHLHEPAS -----------------------2222--2222-------3333---------------- ILVVKARVYPFEDDVEVTTGDRVAPGDVLADGGKVKSEIYGRVEVDLVRNVVRVVESY ---------------------------------------------------------- >DNA-DIRECTED RNA POLYMERA; SWP:P0A8T7; PDB:2AUKA; GSHMAAAESSIQVKNKGSIKLSNVKSVVNSSGKLVITSRNTELKLIDEFGRTKESYKVPY ---3333---------------------1111--------------1111--------22 GAVLAKGDGEQVAGGETVANWDPHTMPVITEVSGFVRFTDMIDGQTITRQTDELTGLSSL 22----2222-------------------------------2222--------------- VVLDSAERTAGGKDLRPALKIVDAQGNDVLIPGTDMPAQYFLPGKAIVQLEDGVQISSGD --------3333----------1111----------------2222----2222--2222 TLARIPQES --------- >Probable tRNA pseudouridi; SWP:Q9V1A5; PDB:2AUSC; ADIKREVIVKDDKAETNPKWGFPPDKRPIELHIQYGVINLDKPPGPTSHEVVAWIKRILN ----------1111--1111--1111------------------------------1111 LEKAGHGGTLDPKVSGVLPVALERATRVVQALLPAGKEYVALMHLHGDVPEDKIRAVMKE ----------3333-------!!!!--33331111---------------------1111 FEGEIIQRTRKVYYIEILEIDGRDVLFRVGVEAGTYIRSLIHHIGLALGVGAHMAELRRT --------------------!!!!-------2222------------------------- RSGPFKEDETLVTLHDLVDYYHFWKEDGIEEYIRKAIQPMEKAVEHLPKIWIKDSAVAAV -!!!!--1111----------------------1111--33331111------3333--1 AHGANLTVPGIVKLNAGIKKGDLVAIMTLKDELVALGKAMMSTQEMIERSKGIAVDVEKV 111---3333--------2222-----1111----------------------------- FMPRDWYPKLW --1111----- >Ribosome biogenesis prote; SWP:Q9V0E3; PDB:2AUSD; RIRKCPKCGRYTLKETCPVCGEKTKVAHPPRFSPEDPYGEYRRRLKRELLGIG -----------------------------------1111-------------- >POTENTIAL NAD-REDUCING HY; SWP:Q46505; PDB:2AUVA; MVPKGKYPISVCMGTACFVKGADKVVHAFKEQLKIDIGDVTPDGRFSIDTLRCVGGCALA -------------3333--------------------------------3333---1111 PIVMVGEKVYGNVTPGQVKKILAEY ----!!!!-----------3333-- >HYPOTHETICAL PROTEIN NE04; SWP:Q82X29; PDB:2AUWA; YFFPKLTAVEALAPYRLRTTWSTGEVLEVDVGDILRKIPDLAPILDPEAFARVHIAEWEG --------------------1111-------------333311113333----------- SVEWFDTEFGRDNVYAWAKEQAGEVSHEFGDWHRNNLSLTTAAEALGISRRVSYYRTAHK -------------------1111---------1111-3333------------------- IIPRTIWLACLGWEATRPETKTLPRTLP ---------------------------- >THIOREDOXIN-LIKE PROTEIN ; SWP:Q7R866; PDB:2AV4A; HHMLQHLNSGWAVDQAIVNEDERLVCIRFGHDYDPDCMKMDELLYKVADDIKNFCVIYLV ------------------------------3333--------------1111-------- DITEVPDFNTMYELYDPVSVMFFYRNKHMMIDLGTGNNNKINWPMNNKQEFIDIVETIFR 3333-11111111----------iiii--------------------------------- GARKGRGLVISPKDY -1111---------- >RIBONUCLEASE P PROTEIN CO; SWP:Q8U151; PDB:2AV5A; KKRYIAFKVISENQFNKDEIKEAIWNACLRTLGELGTAKAKPWLIKFDETTQTGIIRSDR ---------------3333---------------------------------------11 NHVYDVIFSLTLVSDINGNKAIIKVLGVSGTIKRLKRKFLSQFGWR 113333---------iiii--------------------1111--- >THIOESTERASE; SWP:Q9HU04; PDB:2AV9A; PRPLREQYLHFQPISTRWHDNDIYGHVNNVTYYAFFDTAVNTYLIERGGLDIQGGEVIGL ---3333---------1111-1111--3333----------------------------- VVSSSCDYFAPVAFPQRIEGLRVARLGNSSVQYELALFLEGQREACAAGRFVHVFVERRS ------------------------------------------------------------ SRPVAIPQELRDALAALQSSA --------------1111--- >ACYL CARRIER PROTEIN I, C; SWP:P07854; PDB:2AVAA; AKKETIDKVSDIVKEKLALGADVVVTADSEFSKLGADSLDTVEIVMNLEEEFGINVDEDK -------------3333--------1111----1111--3333------3333---3333 AQDISTIQQAADVIEGLLEKKA ---------------1111--- >CATECHOL-O-METHYLTRANSFER; SWP:Q86VU5; PDB:2AVDA; QCLLPPEDSRLWQYLLSRSMREHPALRSLRLLTLEQPQGDSMMTCEQAQLLANLARLIQA -----1111--------------------------2222----------------1111- KKALDLGTFTGYSALALALALPADGRVVTCEVDAQPPELGRPLWRQAEAEHKIDLRLKPA -------!!!!------11111111---------3333------11113333------33 LETLDELLAAGEAGTFDVAVVDADKENCSAYYERCLQLLRPGGILAVLRVLWRGKVLQPP 33---------2222---------1111-----------2222-------%%%%3333-2 KGDVAAECVRNLNERIRRDVRVYISLLPLGDGLTLAFKI 222------------------------------------ >MYOSIN-BINDING PROTEIN C,; SWP:Q14896; PDB:2AVGA; DDPIGLFVMRPQDGEVTVGGSITFSARVAGASLLKPPVVKWFKGKWVDLSSKVGQHLQLH ------------------------------------------------------------ DSYDRASKVYLFELHITDAQPAFTGGYRCEVSTKDKFDCSNFNLTVHEAM ----1111-----------1111--------------------------- >HEMERYTHRIN-LIKE DOMAIN P; SWP:Q9REU3; PDB:2AVKA; DVLVKWSEDLANLPSIDTQHKRLVDYINDLYRAARRRDMDKAREVFDALKNYAVEHFGYE ------1111-----------------------1111----------------------- ERLFADYAYPEATRHKEIHRRFVETVLKWEKQLAAGDPEVVMTTLRGLVDWLVNHIMKED --------1111------------------------------------------------ KKYEAYLRERGVS -------1111-- >ubiquinone/menaquinone bi; SWP:Q9X1A9; PDB:2AVNA; HKLRSWEFYDRIARAYDSYETPKWKLYHRLIGSFLEEYLKNPCRVLDLGGGTGKWSLFLQ -----------1111----------------------------------!!!!------1 ERGFEVVLVDPSKELEVAREKGVKNVVEAKAEDLPFPSGAFEAVLALGDVLSYVENKDKA 111--------------------------3333-------------%%%%---------- FSEIRRVLVPDGLLIATVDNFYTFLQQIEKDAWDQITRFLKTQTTSVGTTLFSFNSYAFK --------2222-------------------3333-------------1111-------3 PEDLDSLEGFETVDIRGIGVEYPDERISEREETIFRLEQELSRDRNIIWKADHIFFVLKK 333---2222------------3333-----------------33333333--------- KR -- >SYNTHETIC CONSENSUS TPR P; SWP:NA; PDB:2AVPA; AEAWYNLGNAYYKQGDYDEAIEYYQKALELDPRSAEAWYNLGNAYYKQGDYDEAIEYYQK -----------1111---------------1111-----------1111----------- ALELDPRS -------- >ADHESION A; SWP:NA; PDB:2AVRX; AASLVGELQALDAEYQNLANQEEARFNEERAQADAARQALAQNEQVYNELSQRAQRLQAE -----------------------------------------------------------3 ANTRFYKSQYQELASKYEDALKKLEAEMEQQKAVISDFEKIQALRAGNL 333-----------------------------------------1111- >DNA POLYMERASE III BETA S; SWP:Q9EVR1; PDB:2AVTA; MIQFSINRTLFIHALNTTKRAISTKNAIPILSSIKIEVTSTGVTLTGSNGQISIENTIPV ------------------1111-----3333-------1111------------------ GLLITSPGAILLEASFFINIISSLPDISINVKEIEQHQVVLTSGKSEITLKGKDVDQYPR --------------------1111----------%%%%----!!!!-------3333--- LQEVSTENPLILKTKLLKSIIAETAFAASLQESRPILTGVHIVLSNHKDFKAVATDSHRM -----------------------3333---33331111---------------------- SQRLITLDNTSADFMVVLPSKSLREFSAVFTDDIETVEVFFSPSQILFRSEHISFYTRLL ------------------3333--------3333-------1111----1111------- EGNYPDTDRLLMTEFETEVVFNTQSLRHAMERAFLISNATQNGTVKLEITQNHISAHVNS ------1111-----------------------------2222----------------- PEVGKVNEDLDIVSQSGSDLTISFNPTYLIESLKAIKSETVKIHFLSPVRPFTLTPGDEE ------------------------3333----1111----------1111-----1111- ESFIQLITPVRT ------------ >TRANSCRIPTIONAL ACTIVATOR; SWP:P11165; PDB:2AVUE; SIVQEARDIQLAMELITLGARLQMLESETQLSRGRLIKLYKELRGSPPPKGMLPFSTDWF -3333----------------3333--------------------------------333 MTWEQNVHASMFCNAWQFLLKTGLCNGVDAVIKAYRLYLEQCPQAEEGPLLALTRAWTLV 3-------------------------------------1111------------------ RFVESGLLQLSSCNCCGGNFITHAHQPVGSFACSLC --1111------------------------------ >REGULATORY PROTEIN SDIA; SWP:P07026; PDB:2AVXA; MSDKDFFSWRRTMLLRFQRMETAEEVYHEIELQAQQLEYDYYSLCVRHPVPFTRPKVAFY ---------------------------------3333------------1111------- TNYPEAWVSYYQAKNFLAIDPVLNPENFSQGHLMWNDDLFSEAQPLWEAARAHGLRRGVT -----------11113333-11113333-------3333---3333-------------- QYLMLPERALGFLSFSRCSAREIPILSDELQLKMQLLVRESLMALMRLNDE ----3333------------------3333--------------------- >30S ribosomal protein S21; SWP:P68679; PDB:2AVYU; IKVRENEPFDVALRRFKRSCEKAGVLAEVRRREFYEKPTTERKRAKASAVK ------------1111---------3333----!!!!------3333---- >B AND T LYMPHOCYTE ATTENU; SWP:Q7Z6A9; PDB:2AW2A; CDVQLYIKRQSEHSILAGDPFELECPVKYCANRPHVTWCKLNGTTCVKLEDRQTSWKEEK ------------------------------------------------------------ NISFFILHFEPVLPNDNGSYRCSANFQSNLIESHSTTLYVTDVKHHHHHH ------------1111---------iiii------------3333----- >NADP-DEPENDENT MALIC ENZY; SWP:P48163; PDB:2AW5A; GYLLTRNPHLNKDLAFTLEERQQLNIHGLLPPSFNSQEIQVLRVVKNFEHLNSDFDRYLL 3333-------!!!!-3333---------------------------------------- LMDLQDRNEKLFYRVLTSIEKFMPIVYTPTVGLACQQYSLVFRKPRGLFITIHDRGHIAS ---------------------3333---3333------------------3333--3333 VLNAWPEDVIKAIVVTDGERILGLGDLGCNGMGIPVGKLALYTACGGMNPQECLPVILDV --------------------!!!!------------------------3333-------- GTENEELLKDPLYIGLRQRRVRGSEYDDFLDEFMEAVSSKYGMNCLIQFEDFANVNAFRL ---------1111----------------------------1111--------------- LNKYRNQYCTFNDDIQGTASVAVAGLLAALRITKNKLSDQTILFQGAGEAALGIAHLIVM ---1111----3333--------------------3333--------------------- ALEKEGLPKEKAIKKIWLVDSKGLIVKGQEKEKFAHEHEEMKNLEAIVQEIKPTALIGVA ------------1111---1111-----3333---------------------------- AIGGAFSEQILKDMAAFNERPIIFALSNPTSKAECSAEQCYKITKGRAIFASGSPFDPVT -2222-----------------------3333-----------iiii------------- LPNGQTLYPGQGNNSYVFPGVALGVVACGLRQITDNIFLTTAEVIAQQVSDKHLEEGRLY 3333--------1111-----------------3333--------1111----1111--- PPLNTIRDVSLKIAEKIVKDAYQEKTATVYPEPQNKEAFVRSQMYSTDYDQ -3333-----------------------------------1111------- >DNA POLYMERASE III, BETA ; SWP:O06672; PDB:2AWAA; IHFSINKNLFLQALNTTKRAISSKNAIPILSTVKIDVTNEGITLIGSNGQISIENFISQK ----------------3333------3333-------3333----------------333 NEDAGLLITSLGSILLEASFFINVVSSLPDVTLDFKEIEQNQIVLTSGKSEITLKGKDSE 33333-------------------1111----------%%%%----!!!!--------11 QYPRIQEISASTPLILETKLLKKIINETAFAASTQESRPILTGVHFVLSQHKELKTVATD 11-------------------------3333------3333------------------- SHRLSQKKLTLEKNSDDFDVVIPSRSLREFSAVFTDDIETVEIFFANNQILFRSENISFY ----------------------------------3333-------1111----1111--- TRLLEGNYPDTDRLIPTDFNTTITFNVVNLRQSERARLLSSATQNGTVKLEIKDGVVSAH ----------1111------------------------3333----------!!!!---- VHSPEVGKVNEEIDTDQVTGEDLTISFNPTYLIDSLKALNSEKVTISFISAVRPFTLVPA --1111---------------------3333----1111--------------------- DTDEDFQLITPVRT -------------- >UBIQUITIN-CONJUGATING ENZ; SWP:P62253; PDB:2AWFA; GLVPRGSLLLRRQLAELNKNPVEGFSAGLIDDNDLYRWEVLIIGPPDTLYEGGVFKAHLT ---1111--------------2222-----3333-------------1111--------- FPKDYPLRPPKMKFITEIWHPNVDKNGDVCISILHEPPEERWLPIHTVETIMISVISMLA -1111--------------11111111---3333--------1111-------------- DP -- >38 KDA FK-506 BINDING PRO; SWP:Q14318; PDB:2AWGA; GSPEEWLDILGNGLLRKKTLVPGPPGSSRPVKGQVVTVHLQTSLENGTRVQEEPELVFTL -1111--1111------------1111---2222---------1111-----------22 GDCDVIQALDLSVPLMDVGETAMVTADSKYCYGPQGRSPYIPPHAALCLEVTLKTAVD 22---------3333-2222------3333--3333---------------------- >PRGX; SWP:Q04114; PDB:2AWIA; FKIGSVLKQIRQELNYHQIDLYSGISKSVYIKVEADSRPISVEELSKFSERLGVNFFEIL -----------1111-3333-2222-------1111------------------------ NRAGNSVNETGKEKLLISKIFTNPDLFDKNFQRIEPKRLTSLQYFSIYLGYISIAHHYNI ------------------33333333-------3333----------------------- EVPTFNKTITSDLKHLYDKRTTFFGIDCEIVSNLLNVLPYEEVSSIIKPYPIVDSFGKDY -3333-----------1111---3333------1111-33333333-------------- DLTIQTVLKNALTISINRNLKEAQYYINQFEHLKTIKNISINGYYDLEINYLKQIYQFLT ------------------------------3333-2222--------------------- DKNIDSYLNAVNIINIFKIIGKEDIHRSLVEELTKISAKEKFTPPKEVTMYYEN -----------------1111----------------1111-----33333333 >GREEN FLUORESCENT PROTEIN; SWP:P42212; PDB:2AWKA; KGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVT 3333---------------iiii----------3333----------------3333-33 TLVQCFSRYPDHMKQHDFFKSAMPEGYVQEMTISFKDDGNYKTRAEVKFEGDTLVNRIEL 333333---111111113333-------------2222-----------!!!!------- KGIDFKEDGNILGHKLEYNYNSHNVYITADKQKNGIKANFKIRHNIEDGSVQLADHYQQN -----1111-1111----------------1111-----------1111----------- TPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGI ------------------------1111---------------- >Maltose/maltodextrin impo; SWP:P68187; PDB:2AWNA; ASVQLQNVTKAWGEVVVSKDINLDIHEGEFVVFVGPSGCGKSTLLRMIAGLETITSGDLF -----------!!!!----------2222------2222--------------------- IGEKRMNDTPPAERGVGMVFQSYALEVINQRVNQVAEVLAIGRTLVAEPSVFLLDEPLSN ---------1111--------------------1111--3333------------1111- LDAALRVQMRIEISRLHKRLGRTMIYVTHDQVEAMTLADKIVVLDAGRVAQVGKPLELYH --------------------------------------------iiii-----3333--- YPADRFVAGFIGSPKMNFLPVKVTATAIDQVQVELPMPNRQQVWLPVESRDVQVGANMSL ----------------------------------------------------2222---- GIRPEHLLPSDIADVILEGEVQVVEQLGNETQIHIQIPSIRQNLVYRQNDVVLVEEGATF --3333------------------------------------------------2222-- AIGLPPERCHLFREDGTACRRLHKEPGVAS ----3333----1111-------------- >IRON SUPER-OXIDE DISMUTAS; SWP:Q4Y2M1; PDB:2AWPA; MAIILPKLKYALNALSPHISEETLNFHYNKHHAGYVNKLNGLIKDTPFATKSLVEIMKES ----------1111----------------------------2222-1111--------- TGAIFNNAAQIWNHSFYWDSMGPNCGGEPHGEIKEKIQEDFGSFNNFKNEFSNVLCGHFG -----------------1111---------3333--------------------1111-- SGWGWLVLNNNNKLVILQTHDAGNPIKDNTGIPILTCDIWEHAYYIDYRNDRPSYVKAWW -------------------!!!!----------------3333----!!!!--------- NLVNWNFANENLKKALQ ----------------- >SYNAPSE ASSOCIATED PROTEI; SWP:Q62696; PDB:2AWXA; KIMEIKLIKGPKGLGFSIAGGVGNQHIPGDNSIYVTKIIEGGAAHKDGKLQIGDKLLAVN ---------1111-------2222--2222--------2222--------2222----!! SVSLEEVTHEEAVTALKNTSDFVYLKVAKPTS !!------------------------------ >HYPOTHETICAL PROTEIN TM09; SWP:NA; PDB:2AX3A; HKEIDELTIKEYGVDSRILERAGISVVLAEEELGNLSDYRFLVLCGGGNNGGDGFVVARN --------------3333-----------------1111-------------------11 LLGVVKDVLVVFLGKKKTPDCEYNYGLYKKFGGKVVEQFEPSILNEFDVVVDAIFGTGLR 11-------------------------------------33331111------------- GEITGEYAEIINLVNKSGKVVVSVDVPSGIDSNTGKVLRTAVKADLTVTFGVPKIGHILF --------------------------2222-----------------------3333--- PGRDLTGKLKVANIGHPVHLINSINRYVITREVRSLLPERPRDSHKGTYGKVLIIAGSRL 3333---------------1111---------3333--------3333---------333 YSGAPVLSGGSLKVGTGLVKLAVPFPQNLIATSRFPELISVPIDTEKGFFSLQNLQECLE 3-3333-----1111-------------------3333------------3333------ LSKDVDVVAIGPGLGNNEHVREFVNEFLKTLEKPAVIDADAINVLDTSVLKERKSPAVLT 3333------2222---------------------------1111--------------- PHPGEARLVKKTVGDVKYNYELAEEFAKENDCVLVLKSATTIVTDGEKTLFNITGNTGLS -3333-1111-33332222------------------------------------3333- KGGSGDVLTGIAGFIAQGLSPLEASTVSVYLHGFAAELFEQDERGLTASELLRLIPEAIR --------------1111-----------------1111--1111--------------3 RLK 333 >Bifunctional 3'-phosphoad; SWP:O95340; PDB:2AX4A; QAHHVSRNKRGQVVGTRGGFRGCTVWLTGLSGAGKTTISFALEEYLVSHAIPCYSLDGDN -----3333--------------------2222-----------------------3333 VRHGLNRNLGFSPGDREENIRRIAEVAKLFADAGLVCITSFISPFAKDRENARKIHESAG 1111-1111----------------------------------------------3333- LPFFEIFVDAPLNICESRDVKGLYKRARAGEIKGFTGIDSDYEKPETPERVLKTNLSTVS ------------------1111----1111----2222--------------1111---- DCVHQVVELLQEQNIVPY ----------1111---- >Hypothetical 11.0 kDa pro; SWP:P40554; PDB:2AX5A; MVNVKVEFLGGLDAIFGKQRVHKIKMDKEDPVTVGDLIDHIVSTMINNPNDVSIFIEDDS -------------3333---------------3333---------------3333----- IRPGIITLINDTDWELEGEKDYILEDGDIISFTSTLHGG --------%%%%3333--------2222----------- >COLICIN E7; SWP:Q47112; PDB:2AXCA; SNSSVAAPAFGFPALAAPGAGTLGISVSGEALSAAIADIFAALKFSAWGIALYGILPSEI --------2222------iiii----------------------------------3333 AKDDPNSKIVTSLPAETVTNVQVSTLPLDQATVSVTKRVTDVVKDTRQHIAVVAGVPSVP 1111---------3333----3333-1111-------------%%%%------------- VVNAKPTRTPGVFHASFPGVPSLTVSTVKGLPVSTTLPRGITEDKGRTAVPAGFTFGGGS --------2222----2222-------2222------2222-------------2222-- HEAVIRFPKESGQKPVYVSVTDVLTPAQVKQRQDEEKRLQQEWNDAHP -------3333--------------------------------1111- >UBIQUITIN-PROTEIN LIGASE ; SWP:Q9UMT8; PDB:2AXIA; EQETLVRPKPLLLKLLKSVGAQKDTYTMKEVLFYLGQYIMTKRLYDEKQQHIVYCSNDLL ---------------3333--------------------------1111-----111133 GDLFGVPSFSVKEHRKIYTMIYRNLVVVNQQE 33-------3333--------1111------- >SF4 T CELL RECEPTOR BETA ; SWP:Q6GMR4; PDB:2AXJA; DGGITQSPKYLFRKEGQNVTLSCEQNLNHDAMYWYRQDPGQGLRLIYYSQIVNDFQKGDI --------------------------------------------------2222---111 AEGYSVSREKKESFPLTVTSAQKNPTAFYLCASRDRGTEKLFFGSGTQLSVLEDLNKVFP 1--------3333-----3333--------------------------------1111-- PEVAVFEPSEAEISHTQKATLVCLATGFYPDHVELSWWVNGKEVHSGVSTDPQPLKEQPA --------------------------------------iiii--2222------------ LNDSRYCLSSRLRVSATFWQNPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEAWGR --------------3333--1111-----------3333--------------------- AD -- >DISCREPIN; SWP:P84777; PDB:2AXKA; IDTNVKCSGSSKCVKICIDRYNTRGAKCINGRCTCYP --------11113333--------------------- >WERNER SYNDROME; SWP:Q14191; PDB:2AXLA; MDDSEDTSWDFGPQAFKLLSAVDILGEKFGIGLPILFLRGSNSQRLADQYRRHSLFGTGK 1111------3333---------------3333---------3333---3333-222211 DQTESWWKAFSRQLITEGFLVEVSRYNKFMKICALTKKGRNWLHKANTESQSLILQANEE 11--------------------1111---------3333--------------------- LCPKKLLLPSSKTVSSGTKEHCYN -------------------3333- >6-phosphofructo-2-kinase/; SWP:Q16875; PDB:2AXNA; LELTQSRVQKIWVPVDHRPSLPRSCGPNSPTVIVMVGLPARGKTYISKKLTRYLNWIGVP ------------------------------------------------------------ TKVFNVGEYRREAVKQYSSYNFFRPDNEEAMKVRKQCALAALRDVKSYLAKEGGQIAVFD ----3333----------3333-1111--------------------------------- ATNTTRERRHMILHFAKENDFKAFFIESVCDDPTVVASNIMEVKISSPDYKDCNSAEAMD ----------------1111-----------------------11111111--------- DFMKRISCYEASYQPLDPDKCDRDLSLIKVIDVGRRFLVNRVQDHIQSRIVYYLMNIHVQ ---------1111---1111-1111------iiii------------------------- PRTIYLCRHGENEHNLQGRIGGDSGLSSRGKKFASALSKFVEEQNLKDLRVWTSQLKSTI -------------------------------------------------------3333- QTAEALRLPYEQWKALNEIDAGVCEELTYEEIRDTYPEEYALREQDKYYYRYPTGESYQD -3333-------3333----!!!!---------------------------2222----- LVQRLEPVIMELERQENVLVICHQAVLRCLLAYFLDKSAEEMPYLKCPLHTVLKLTPVAY -----------3333----------------------33331111------------!!! GCRVESIYLNVESVCTHRERSNPLMRRNSSS !------------------------------ >HYPOTHETICAL PROTEIN ATU2; SWP:Q8UC14; PDB:2AXOA; AQEAVKGVVELFTSQGCASCPPADEALRKIQKGDVVGLSYHVDYWNYLGWTDSLASKENT -------------1111------------3333--------------------------- ERQYGYRALGRNGVYTPQAILNGRDHVKGADVRGIYDRLDAFKREGQGLNVPVSSKFAGD ------1111----------%%%%---1111-----------1111-------------- EVEIDIGAGNGKADVVVAYFTREQTVDVKKSYWHSVYDVQTVGWDGSPTVKLPASVVAKV ----------------------------------------------------3333---- KKGGCAVLLQTANASGDPAAIVGASILLGNETQLEHH ------------1111--------------------- >HYPOTHETICAL PROTEIN BSU2; SWP:O31896; PDB:2AXPA; TLIILEGPDCCFKSTVAAKLSKELKYPIIKGSSFELAKSGNEKLFEHFNKLADEDNVIID --------------------------------3333------------------------ RFVYSNLVYAKKFKDYSILTERQLRFIEDKIKAKAKVVYLHADPSVIKKRLRVRGDEYIE -3333--3333-------------------1111-------------------------- GKDIDSILELYREVSNAGLHTYSWDTGQWSSDEIAKDIIFLVELEHHHHHH -3333--------------------------------------3333---- >SACCHAROPINE DEHYDROGENAS; SWP:P38999; PDB:2AXQA; GKNVLLLGSGFVAQPVIDTLAANDDINVTVACRTLANAQALAKPSGSKAISLDVTDDSAL ---------3333---------1111----------------1111------1111---- DKVLADNDVVISLIPYTFHPNVVKSAIRTKTDVVTSSYISPALRELEPEIVKAGITVMNE ---1111-------3333------------------------------------------ IGLDPGIDHLYAVKTIDEVHRAGGKLKSFLSYCGGLPAPEDSDNPLGYKFSWSSRGVLLA -------------------------------------3333--1111-----------11 LRNSAKYWKDGKIETVSSEDLMATAKPYFIYPGYAFVCYPNRDSTLFKDLYHIPEAETVI 11------iiii----33333333------3333--------------11111111---- RGTLRYQGFPEFVKALVDMGMLKDDANEIFSKPIAWNEALKQYLGAKSTSKEDLIASIDS -----2222----------1111---3333----------------------------11 KATWKDDEDRERILSGFAWLGLFSDAKITPRGNALDTLCARLEELMQYEDNERDMVVLQH 11------------------1111------------------------1111-------- KFGIEWADGTTETRTSTLVDYGKVGGYSSMAATVGYPVAIATKFVLDGTIKGPGLLAPYS -----1111-------------2222-3333----------------------------3 PEINDPIMKELKDKYGIYLKEKTVA 333---------------------- >PHOTOSYSTEM Q(B) PROTEIN; SWP:P0A445; PDB:2AXTA; SANLWERFCNWVTSTDNRLYVGWFGVIMIPTLLAATICFVIAFIAAPPVDIDGIREPVSG --3333----1111--------3333---------------------------------- SLLYGNNIITGAVVPSSNAIGLHFYPIWEAASLDEWLYNGGPYQLIIFHFLLGASCYMGR 1111--3333-------3333----3333------------------------------- QWELSYRLGMRPWICVAYSAPLASAFAVFLIYPIGQGSFSDGMPLGISGTFNFMIVFQAE -----1111---3333----3333-------------3333----3333----------- HNILMHPFHQLGVAGVFGGALFCAMHGSLVTSSLIRETTETESANYGYKFGQEEETYNIV -3333-------------------------------------3333--2222-------- AAHGYFGRLIFQYASFNNSRSLHFFLAAWPVVGVWFTALGISTMAFNLNGFNFNHSVIDA ----------------------------------------------------------33 KGNVINTWADIINRANLGMEVMHERNAHNFPLDLA 33----3333-------------1111-------- >Photosystem II manganese-; SWP:P0A431; PDB:2AXTO; TLTYDDIVGTGLANKCPTLDDTARGAYPIDSSQTYRIARLCLQPTTFLVKEEPKNKRQEA --111122223333---------------------------------------------- EFVPTKLVTRETTSLDQIQGELKVNSDGSLTFVEEDGIDFQPVTVQMAGGERIPLLFTVK ------------------------------------------------------------ NLVASTQPNVTSITTSTDFKGEFNVPSYRTANFLDPKGRGLASGYDSAIALPQAKEEELA -------------3333------------1111-1111---------1111----11111 RANVKRFSLTKGQISLNVAKVDGRTGEIAGTFESEQLSDDDMGAHEPHEVKIQGVFYASI 111-------------------------------------%%%%---------------- EP -- >Photosystem II 12 kDa ext; SWP:Q9F1L5; PDB:2AXTU; EELVNVVDEKLGTAYGEKIDLNNTNIAAFIQYRGLYPTLAKLIVKNAPYESVEDVLNIPG ----3333-----2222--1111----------------------------3333----- LTERQKQILRENLEHFTVTEVETALVEGGDRYNNGLYK -----------1111------3333-%%%%-------- >DRAD INVASIN; SWP:Q7BG36; PDB:2AXWA; AELHLESRGGSGTQLRDGAKVATGRIICREAHTGFHVWMNERQVDGRAERYVVQSKDGRH ---------------2222------------------------%%%%-------1111-- ELRVRTGGDGWSPVKGEGGKGVSRPGQEEQVFFDVMADGNQDIAPGEYRFSVGGACVVPQ -------2222----2222----------------------------------------- EKLAAALEHHHHHH -------------- >POLY(RC)-BINDING PROTEIN ; SWP:Q15366; PDB:2AXYA; KNVTLTIRLLHGKEVGSIIGKKGESVKKREESGARINISEGNCPERIITLAGPTNAIFKA -------------------2222------------------------------------- FAIIDKLEE -----3333 >AROMATIC AMINO ACID AMINO; SWP:P95468; PDB:2AY1A; MLGNLKPQAPDKILALMGEFRADPRQGKIDLGVGVYKDATGHTPIMRAVHAAEQRMLETE 1111---------------------------------1111------------------- TTKTYAGLSGEPEFQKAMGELILGDGLKSETTATLATVGGTGALRQALELARMANPDLRV ------11113333--------!!!!-3333-----------------------1111-- FVSDPTWPNHVSIMNFMGLPVQTYRYFDAETRGVDFEGMKADLAAAKKGDMVLLHGCCHN ------3333-------------------------------3333-2222---------- PTGANLTLDQWAEIASILEKTGALPLIDLAYQGFGDGLEEDAAGTRLIASRIPEVLIAAS -------------------------------------3333------------------- CSKNFGIYRERTGCLLALCADAATRELAQGAMAFLNRQTYSFPPFHGAKIVSTVLTTPEL --1111-3333------------------------1111--------------------- RADWMAELEAVRSGMLRLREQLAGELRDLSGSDRFGFVAEHRGMFSRLGATPEQVKRIKE --------------------------------11111111-------------------- EFGIYMVGDSRINIAGLNDNTIPILARAIIEVGV ------1111--3333-3333------------- >DNA POLYMERASE III SUBUNI; SWP:P06710; PDB:2AYAA; MKALEHEKTPELAAKLAAEAIERDPWAAQVSQLSLPKLVEQVALNAWKEESDNAVCLHLR -----------------------------------------------------------3 SSQRHLNNRGAQQKLAEALSMLKGSTVELTIVEDDNPAVRTPLEWRQAIYEEKLAQARES 33333333333------------------------------------------------- IIADNNIQ -------- >WRKY TRANSCRIPTION FACTOR; SWP:Q9SI37; PDB:2AYDA; SRIVVHTQTLFDIVNDGYRWRKYGQKSVKGSPYPRSYYRCSSPGCPVKKHVERSSHDTKL ---------------------------2222----------2222-----------3333 LITTYEGKHDHDMPPG ---------------- >1,3-1,4-BETA-D-GLUCAN 4-G; SWP:P23904; PDB:2AYH; QTGGSFFEPFNSYNSGTWEKADGYSNGGVFNCTWRANNVNFTNDGKLKLGLTSSAYNKFD --------------------------!!!!----1111---1111---------2222-- CAEYRSTNIYGYGLYEVSMKPAKNTGIVSSFFTYTGPAHGTQWDEIDIEFLGKDTTKVQF -----------------------2222--------3333-----------3333------ NYYTNGVGGHEKVISLGFDASKGFHTYAFDWQPGYIKWYVDGVLKHTATANIPSTPGKIM ---iiii-----------3333---------1111----iiii----------------- MNLWNGTGVDDWLGSYNGANPLYAEYDWVKYTSN --------3333---------------------- >50S RIBOSOMAL PROTEIN L40; SWP:Q980V5; PDB:2AYJA; MPLTDPAKLQIVQQRVFLKKVCRKCGALNPIRATKCRRCHSTNLRLKKKELPTKKG -----------------------------3333----------------------- >U1 SMALL NUCLEAR RIBONUCL; SWP:P43332; PDB:2AYMA; AQTEQPPNQILFLTNLPEETNEMMLSMLFNQFPGFKEVRLVPNRHDIAFVEFTTELQSNA --------------------3333------------------------------------ AKEALQGFKITPTHAMKITFAKK ----2222--------------- >UBIQUITIN CARBOXYL-TERMIN; SWP:P54578; PDB:2AYNA; MELPCGLTNLGNTCYMNATVQCIRSVPELKDALKRYAGALASAQYITAALRDLFDSMDKT ------------3333-------------3333---------3333-------------- SSSIPPIILLQFLHMAFPQFAEKGEQGQYLQQDANECWIQMMRVLQQKLEAIEDDKSLID ----------------3333---------------------------------------- QFFGVEFETTMKCTESEEEEVTKGKENQLQLSCFINQEVKYLFTGLKLRLQEEITKQSPT ------------------------------------------------------------ LQRNALYIKSSKISRLPAYLTIQMVRFFYKEKESVNAKVLKDVKFPLMLDMYELCTPELQ -------------------------------------------------------33333 EKMVSFRSKFKDLEDEPFSFADDIGSNNCGYYDLQAVLTHQGRSSSSGHYVSWVKRKQDE 333---1111---------------------------------1111---------2222 WIKFDDDKVSIVTPEDILRLSGGGDWHIAYVLLYGPR ------------3333--------------------- >GLUTAREDOXIN-LIKE PROTEIN; SWP:O66753; PDB:2AYTA; MLLNLDVRMQLKELAQKEFKEPVSIKLFSQAIGCESCQTAEELLKETVEVIGEAVGQDKI ---------------------------------1111------------------1111- KLDIYSPFTHKEETEKYGVDRVPTIVIEGDKDYGIRYIGLPAGLEFTTLINGIFHVSQRK -----3333--------------------------------!!!!----------1111- PQLSEKTLELLQVVDIPIEIWVFVTTSCGYCPSAAVMAWDFALANDYITSKVIDASENQD ---------3333-----------1111----------------1111------3333-- LAEQFQVVGVPKIVINKGVAEFVGAQPENAFLGYIMAVYEKLKREKEQAL --1111--------%%%%--------3333-------------------- >NUCLEOSOME ASSEMBLY PROTE; SWP:P25293; PDB:2AYUA; IQDRLGSLVGQDSGYVGGLPKNVKEKLLSLKTLQSELFEVEKEFQVEMFELENKFLQKYK ---------------1111-----------------------------------3333-- PIWEQRSRIISGQEQPKPEQIAKGQEIVESLNETELLVDEEEQVKGIPSFWLTALENLPI ----------------------------11113333-----------------------3 VCDTITDRDAEVLEYLQDIGLEYLTDGRPGFKLLFRFDSSANPFFTNDILCKTYFYQKEL 333--3333---1111---------------------3333------------------- GYSGDFIYDHAEGCEISWKDNAHNVTVDLEMRKQTIEKITPIESFFNFFDPPKIQNEDQD 3333---------------33331111----------------1111------------- EELEEDLEERLALDYSIGEQLKDKLIPRAVDWFTGAALEFEFE ------------------------3333-3333-----3333- >UBIQUITIN-CONJUGATING ENZ; SWP:NA; PDB:2AYVA; QGLKRINKELNDLSKDPPTNCSAGPVGDDMFHWQATIMGPEDSPYSGGVFFLNIHFPSDY -----------------2222-------1111----------1111----------1111 PFKPPKVNFTTKIYHPNINSQGAICLDILKDQWSPALTISKVLLSISSLLTDPNPDDPLV --------------11111111---333311111111----------------1111--- PEIAHLYKSDRMRYDQTAREWSQKYA -------------------------- >SENSOR KINASE PROTEIN RCS; SWP:P14376; PDB:2AYXA; MGGSGVEGLSGKRCWLAVRNASLCQFLETSLQRSGIVVTTYEGQEPTPEDVLITDEVVSK -------------------3333-----------------------1111---------- KWQGRAVVTFCRRHIGIPLEKAPGEWVHSVAAPHELPALLARIYLIEMESDDPANALPST ------------------1111-------------------------------------- DKAVSDNDDMMILVVDDHPINRRLLADQLGSLGYQCKTANDGVDALNVLSKNHIDIVLSD -----------------------------------------33333333----------- VNMPNMDGYRLTQRIRQLGLTLPVIGVTANALAEEKQRCLESGMDSCLSKPVTLDVIKQT ------------------------------------------------------------ LTLYAERVRKSRDS -------------- >B2 PROTEIN; SWP:P68831; PDB:2AZ0A; PSKLALIQELPDRIQTAVEAAMGMSYQDAPNNVRRDLDNLHACLNKAKLTVSRMVTSLLE -------------------1111--1111------------------------------- KPSVVAYLEG ---------- >NUCLEOSIDE DIPHOSPHATE KI; SWP:P61136; PDB:2AZ3A; HDERTFVMVKPDGVQRGLIGDIVTRLETKGLKMVGGKFMRIDEELAHEHYAEHEDKPFFD -------------1111---------3333-------------------3333--1111- GLVSFITSGPVFAMVWEGADATRQVRQLMGATDAQDAAPGTIRGDYGNDLGHNLIHGSDH ----1111------------------------3333----------------------11 EDEGANEREIALFFDDDELVDWDRDASAWVYE 11---3333-----1111-----3333----- >HYPOTHETICAL PROTEIN EF29; SWP:Q82ZZ3; PDB:2AZ4A; AKTTVTFHSGILTIGGTVIEVAYKDAHIFFDFGTEFRPELDLPDDHIETLINNRLVPELK ----------------------!!!!----------1111----------1111------ DLYDPRLGYEYHGAEDKDYQHTAVFLSHAHLDHSRMINYLDPAVPLYTLKETKMILNSLN ---3333----------------------1111--3333-3333----3333-----111 RKGDFLIPSPFEEKNFTREMIGLNKNDVIKVGEISVEIVPVDHDAYGASALLIRTPDHFI 1-------11111111-------2222---!!!!----------2222------1111-- TYTGDLRLHGHNREETLAFCEKAKHTELLMMEGVSISFPEREPDPAQIAVVSEEDLVQHL -----------3333------------------1111------1111------------- VRLELENPNRQITFNGYPANVERFAKIIEKSPRTVVLEANMAALLLEVFGIEVRYYYAES ----------------3333---------------------------------------- GKIPELNPALEIPYDTLLKDKTDYLWQVVNQFDNLQEGSLYIHSDAQPLGDFDPQYRVFL --33331111------1111----------3333-2222----------3333------- DLLAKKDITFVRLACSGHAIPEDLDKIIALIEPQVLVPIHTLKPEKLENPYGERILPERG -------------------3333-------------------3333--1111-----222 EQIVL 2---- >AZURIN; SWP:P00280; PDB:2AZAA; AQCEATIESNDAMQYDLKEMVVDKSCKQFTVHLKHVGKMAKSAMGHNWVLTKEADKEGVA ---------1111---------1111-------------3333--------3333----- TDGMNAGLAQDYVKAGDTRVIAHTKVIGGGESDSVTFDVSKLTPGEAYAYFCSFPGHWAM ------3333------1111-------2222------1111-2222-------2222--- MKGTLKLSN --------- >PROTEASE RETROPEPSIN; SWP:P03367; PDB:2AZCA; PQITLWKRPLVTIKIGGQLKEALIDTGADDTVLEEMNLPGRWKPKIIGGIGGLIKVRQYD ------------------------1111-------------------------------- QIPIEICGHKAIGTVLIGPTPANIIGRNLLTQIGCTLNF -----iiii-------------------3333------- >TRANSCRIPTION FACTOR DP-1; SWP:Q14186; PDB:2AZEA; FAQECQNLEVERQRRLERIKQKQSQLQELILQQIAFKNLVQRNRHAEQQASRPPPPNSVI --------------------------------------------1111------3333-- HLPFIIVNTSKKTVIDCSISNDKFEYLFNFDNTFEIHDDIEVLKRMGMACGLESGSCSAE ---------3333-------------------------------------3333------ DLKMARSLVPKALEPYVTEMAQGTVGGVF ----1111-3333------1111------ >Transcription factor E2F1; SWP:Q01094; PDB:2AZEB; GRLEGLTQDLRQLQESEQQLDHLMNICTTQLRLLSEDTDSQRLAYVTCQDLRSIADPAEQ ----------------------------------------1111--3333-1111----- MVMVIKAPPETQLQAVDSSENFQISLKSKQGPIDVFLCPEE ----------------------------------------- -------------------------------------------- >GERANYLGERANYL PYROPHOSPH; SWP:Q97W92; PDB:2AZKA; MSIIEFWLEAKATIDRLIEQFLNSNRDWDLVDISSYILKDGKRFRGTLNMFFTVALGGDI 3333---------------------------33331111--------------1111-33 KDSYGGALAIEILHSASLALDDIVDLDATRRGDKAAWVVYGNRKVIFITNYLIPTALRII 33---------------------------%%%%-3333---------------------- QTSYGDDALNTSIELEKDTSVGALRDMYDNSDYIRTIELKTGSLFKLSTVLSAYASKHYN ---------------------------------------------------------333 TKQQMLDVGKYLGIIYQVIDDFVDYKTKKVEEIDGSAKQLFKYYREGKLEEYVRSVYLEY 3---------------------------3333--3333-33331111------------- KQKYDELISNIPFQSKYLSEIRSLPEFLANGLLKEA ------3333---3333---1111------------ >Putative 5-amino-6-(5-pho; SWP:Q58085; PDB:2AZNA; EKKPYIISNVGTLDGKLATINNDSRISCEEDLIRVHKIRANVDGIVGIGTVLKDDPRLTV -----------1111---1111----------------1111------------------ HKIKSDRNPVRIVVDSKLRVPLNARVLNKDAKTIIATTEDTNEEKEKKIKILEDGVEVVK --------------1111--11111111-------------------------------- CGRGKVDLKKLDILYDKGIKSILLEGGGTLNWGFKEGLVDEVSVYIAPKIFGGKEAPTYV -------3333---1111---------------1111---------------1111---- DGEGFKTVDECVKLELKNFYRLGEGIVLEFKVKK ------3333-----------!!!!--------- >HYPOTHETICAL PROTEIN PA12; SWP:NA; PDB:2AZPA; MQRIRIIDSHTGGEPTRLVIGGFPDLGQGDMAERRRLLGERHDAWRAACILEPRGSDVLV ----------iiii---------------------------3333----------3333- GALLCAPVDPEACAGVIFFNNSGYLGMCGHGTIGLVASLAHLGRIGPGVHRIETPVGEVE --------1111-------1111------------------------------1111--- ATLHEDGSVSVRNVPAYRYRRQVSVEVPGIGRVSGDIAWGGNWFFLVAGHGQRLAGDNLD ---1111-----------------------------------------------1111-- ALTAYTVAVQQALDDQDIRGEDGGAIDHIELFADDPHADSRNFVLCPGKAYDRSPCGTGT -------------1111--1111----------------------1111----------- SAKLACLAADGKLLPGQPWRQASVIGSQFEGRYEWLDGQPGGPIVPTIRGRAHVSAEATL -------1111--2222-----1111---------%%%%--------------------- LLADDDPFAWGIRRGS --1111---------- >CATECHOL 1,2-DIOXYGENASE; SWP:Q51433; PDB:2AZQA; VKISHTADIQAFFNQVAGLDHAEGKPRFKQIILRVLQDTARLIEDLEITEDEFWHAVDYL --11113333-------------------------------------------------- NRLGGRNEAGLLAAGLGIEHFLDLLQDAKDAEAGLGGGTPRTIEGPLYVAGAPLAQGEVR ------------------3333-------------------------------------- MDDGTDPGVVMFLQGQVFDANGKPLAGATVDLWHANTQGTYSYFDSTQSEFNLRRRIITD ------------------3333-------------1111-----3333-----------1 AEGRYRARSIVPSGYGCDPQGPTQECLDLLGRHGQRPAHVHFFISAFGHRHLTTQINFAG 111------------------------1111--------------2222--------222 DKYLWDDFAYATRDGLIGELRFVEDAAAARDRGVQGERFAELSFDFRLQGAQSPDAEARS 2-----1111--2222------------------------------------3333---- HRPRALQEG ---2222-- >MUTT/NUDIX FAMILY PROTEIN; SWP:Q836H1; PDB:2AZWA; KTPTFGKREETLTYQTRYAAYIIVSKPENNTMVLVQAPNGAYFLPGGEIEGTETKEEAIH --------3333--------------1111------3333---------!!!!------- REVLEELGISVEIGCYLGEADEYFYSNHRQTAYYNPGYFYVANTWRQLSEPLRTNTLHWV --------------------------3333------------------------------ APEEAVRLLKRGSHRWAVEKWLAAAS --------------------1111-- >ARYL HYDROCARBON RECEPTOR; SWP:P27540; PDB:2B02A; SNVSQPTEFISRHNIEGIFTFVDHRCVATVGYQPQELLGKNIVEFCHPEDQQLLRDSFQQ -------------1111---------------333322223333--3333---------- VVKLKGQVLSVFRFRSKNQEWLWRTSSFTFQNPYSDEIEYIICTNTNV ---2222--------1111----------------------------- >14-3-3 PROTEIN GAMMA; SWP:P61981; PDB:2B05A; VDREQLVQKARLAEQAERYDDMAAAMKNVTELNEPLSNEERNLLSVAYKNVVGARRSSWR -------------1111------------------------------------------- VISSIEQKTSADGNEKKIEMVRAYREKIEKELEAVCQDVLSLLDNYLIKNCSETQYESKV ---------3333---------------------------------3333-1111----- FYLKMKGDYYRYLAEVATGEKRATVVESSEKAYSEAHEISKEHMQPTHPIRLGLALNYSV --------------------------------------------1111------------ FYYEIQNAPEQACHLAKTAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWT -------------------------3333-1111------------------- >MUTT/NUDIX FAMILY PROTEIN; SWP:Q97QH6; PDB:2B06A; MSRSQLTILTNICLIEDLETQRVVMQYRAWSGYAFPGGHVENDEAFAESVIREIYEETGL -1111-----------------------------------1111---------------- TIQNPQLVGIKNWPLDTGGRYIVICYKATEFSGTLQSSEEGEVSWVQKDQIPNLNLAYDM -------------------------------------1111-----11111111--2222 LPLMEMMEAPDKSEFFYPRRTEDDWEKKIF --------1111--------1111------ >HYPOTHETICAL PROTEIN MJ07; SWP:Q58193; PDB:2B0AA; EILDLTQTLINFPRPGDPELRIIEKKIDGFIVSEIIMGSHLCTHIDYPKHVGLENRIPFK -------------2222---------iiii-------1111------3333-------22 DGIIKGKGYCISLDDFPGNKLPACDILLIYTGFSKYWGRDEYFEKIPEIPFLDDIIKSNI 22---------3333------------------1111-3333------------1111-- KCVGIDACTIGGFEEHKRLLSNNILIIENLNENLKNLVGKSFYFLGLPLKIFDIDASPIR -------------------1111---------33332222-------------------- CIAILE ------ >PUTATIVE PHOSPHATASE; SWP:NA; PDB:2B0CA; AKMLYIFDLGNVIVDIDFNRVLGAWSDLTRIPLASLKKSFHMGEAFHQHERGEISDEAFA --------2222------------------------1111--------1111-------- EALCHEMALPLSYEQFSHGWQAVFVALRPEVIAIMHKLREQGHRVVVLSNTNRLHTTFWP --------------------------------------1111-----------1111-33 EEYPEIRDAADHIYLSQDLGMRKPEARIYQHVLQAEGFSPSDTVFFDDNADNIEGANQLG 33----1111----3333--------------------3333-------------3333- ITSILVKDKTTIPDYFAKV -------1111-------- >PICORNAIN 3C (PROTEASE 3C; SWP:P03303; PDB:2B0FA; GPNTEFALSLLRKNIMTITTSKGEFTGLGIHDRVCVIPTHAQPGDDVLVNGQKIRVKDKY -------------------1111--------------3333-------iiii-------- KLVDPENINLELTVLTLDRNEKFRDIRGFISEDLEGVDATLVVHSNNFTNTILEVGPVTM ---1111------------------3333---------------3333------------ AGLINLSSTPTNRMIRYDYATKTGQCGGVLCATGKIFGIHVGGNGRQGFSAQLKKQYFVE -----%%%%----------------------2222--------!!!!------3333333 KQ 3- >TALIN-1; SWP:P26039; PDB:2B0HA; GIDPFTMGDPEGSFVDYQTTMVRTAKAIAVTVQEMVTKSNTSPEELGPLANQLTSDYGRL --------1111------------------------3333---1111------------- ASQAKPAAVAAENEEIGAHIKHRVQELGHGCSALVTKAGALQCSPSDVYTKKELIECARR ----3333-----------------------------------1111------------- VSEKVSHVLAALQAGNR ------------3333- >5,10-METHENYLTETRAHYDROME; SWP:Q58194; PDB:2B0JA; MKIAILGAGCYRTHAAAGITNFMRACEVAKEVGKPEIALTHSSITYGAELLHLVPDVKEV --------------3333---------------3333---!!!!---------3333--- IVSDPCFAEEPGLVVIDEFDPKEVMEAHLSGNPESIMPKIREVVKAKAKELPKPPKACIH -----1111------------------33331111-----------3333---------- LVHPEDVGLKVTSDDREAVEGADIVITWLPKGNKQPDIIKKFADAIPEGAIVTHACTIPT --3333------------2222------3333---------1111-2222---------- TKFAKIFKDLGREDLNITSYHPGCVPEMKGQVYIAEGYASEEAVNKLYEIGKIARGKAFK -----------3333---------3333-------------------------------- MPANLIGPVCDMCSAVTATVYAGLLAYRDAVTKILGAPADFAQMMADEALTQIHNLMKEK -1111-3333----------------------1111------------------------ GIANMEEALDPAALLGTADSMCFGPLAEILPTALKVLEKHKVVE 33333333-333311113333-!!!!-----------1111--- >GTP-sensing transcription; SWP:P39779; PDB:2B0LA; HHHHHHMSKAVVQMAISSLSYSELEAIEHIFEELDGNEGLLVASKIADRVGITRSVIVNA ---------------1111----------------------------------------- LRKLESAGVIESRSLGMKGTYIKVLNNKFLIELENLKS ----1111-----------------3333--------- >Development and different; SWP:Q8TDY4; PDB:2B0OE; DLTKLLIAEVKSRPGNSQCCDCGAADPTWLSTNLGVLTCIQCSGVHRELGVRFSRMQSLT ------------2222---------------1111--------------3333------- LDLLGPSELLLALNMGNTSFNEVMEAQLPSHGGPKPSAESDMGTRRDYIMAKYVEHRFAR ----3333-3333-----------1111--------1111---------------1111- RCTEPQRLWTAICNRDLLSVLEAFANGQDFGQPLPGPDAQAPEELVLHLAVKVANQASLP ---3333----1111--------1111---------------------------3333-- LVDFIIQNGGHLDAKAADGNTALHYAALYNQPDCLKLLLKGRALVGTVNEAGETALDIAR ----------1111-1111-----------------------------1111-------- KKHHKECEELLEQAQAGTFAFPLH ----------------1111---- >POSSIBLE ADENYL CYCLASE-A; SWP:Q5CS32; PDB:2B0RA; RQVVTNGSPKVELQKDTYLVENHVNCADPITLSEGSIKNKVSVRCSQNSRIIVEQKVNSI -----------------------------------1111--------------------- FIENCVGCIFLVNGVISSIEIVNCDDIKLQMTGIVPTISLDKSNKVNIYTSKEGKNVEVY --------------------------------------------------3333------ SSKSSEMNLLFPGEEEGDWKELAIPEQFVTKYNESKGKLESMVS --------------2222-------------------------- >Envelope glycoprotein gp1; SWP:P05877; PDB:2B0SH; EIQLEQSGAEVKKSGESLKISCQTSGYSFSDYWIGWVRQMPGKGLEWMGIFYPGDSDSRY ------------2222-----------3333--------2222----------------- SPSFEGQVTMSADRSTNTAHLQWSSL 3333--------1111---------- >Envelope glycoprotein gp1; SWP:P05877; PDB:2B0SL; QSVLTQPPSASGTPGQRISISCSGTSSN ------------2222------------ >NADP ISOCITRATE DEHYDROGE; SWP:P50216; PDB:2B0TA; AKIIWTRTDEAPLLATYSLKPVVEAFAATAGIEVETRDISLAGRILAQFPERLTEDQKVG ---------------------------1111-----------------1111-3333--- NALAELGELAKTPEANIIKLPNISASVPQLKAAIKELQDQGYDIPELPDNATTDEEKDIL -----------3333----------------------1111------------------- ARYNAVKGSAVNPVLREGNSDRRAPIAVKNFVKKFPHRMGEWSADSKTNVATMDANDFRH ----------3333----------3333--------------1111----------3333 NEKSIILDAADEVQIKHIAADGTETILKDSLKLLEGEVLDGTVLSAKALDAFLLEQVARA ------------------1111-----------2222----------------------- KAEGILFSAHLKATMMKVSDPIIFGHVVRAYFADVFAQYGEQLLAAGLNGENGLAAILSG --------------------------------3333-------------1111-----33 LESLDNGEEIKAAFEKGLEDGPDLAMVNSARGITNLHVPSDVIVDASMPAMIRTSGHMWN 33---3333------------------3333--11111111-3333-------%%%%--1 KDDQEQDTLAIIPDSSYAGVYQTVIEDCRKNGAFDPTTMGTVPNVGLMAQKAEEYGSHDK 111-------------3333--------------1111----------%%%%----1111 TFRIEADGVVQVVSSNGDVLIEHDVEANDIWRACQVKDAPIQDWVKLAVTRSRLSGMPAV -------------1111--------2222------------------------------- FWLDPERAHDRNLASLVEKYLADHDTEGLDIQILSPVEATQLSIDRIRRGEDTISVTGNV ---1111------------------2222------------------------------- LRDYNTDLFPILELGTSAKMLSVVPLMAGGGLFETGAGGSAPKHVQQVQEENHLRWDSLG -------------------------1111-----------3333-------------333 EFLALAESFRHELNNNGNTKAGVLADALDKATEKLLNEEKSPSRKVGEIDNRGSHFWLTK 3----------------3333--------------1111--------------------- FWADELAAQTEDADLAATFAPVAEALNTGAADIDAALLAVQGGATDLGGYYSPNEEKLTN -------------------------------------3333------------------- IMRPVAQFNEIVDAL ----3333---3333 >NUDIX HYDROLASE; SWP:Q82XR9; PDB:2B0VA; KPNVTVAAVIEQDDKYLLVEEIPRGTAIKLNQPAGHLEPGESIIQACSREVLEETGHSFL -----------%%%%--------------------------------------------- PEVLTGIYHWTCASNGTTYLRFTFSGQVVSFDPDRKLDTGIVRAAWFSIDEIRAKQAHRT ------------1111---------------1111--2222------------------3 PLVQCIEDYHAGKRYPLDILQYYDGS 333------------1111------- >GTP-sensing transcription; SWP:P39779; PDB:2B18A; MALLQKTRIINSMLQAAAGKPVNFKEMAETLRDVIDSNIFVVSRRGKLLGYSINQQIEND ------------------------------------------1111-------------- RMKKMLEDRQFPEEYTKNLFNVPETSSNLDINSEYTAFPVENRDLFQAGLTTIVPIIGGG -----------------------------1111-----33333333-------------- ERLGTLILSRLQDQFNDDDLILAEYGATVVGMEIL ----------------------------------- >EXOCYST COMPLEX COMPONENT; SWP:P19658; PDB:2B1EA; TLNSVASVKDLANEASKYEIILQKGINQVGLKQYTQVVHKLDDMLEDIQSREENSEFHGI -3333--------------------3333------------------------------- LTHLEQLIKRSEAQLRVYFISILNSIKPFDPQINITKKMPFPYYEDQQLGALSWILDYFH ---------------------1111----3333--------------------------- GNSEGSIIQDILVGERSKLILKCMAFLEPFAKGSSGMNSYTEALLGFIANEKSLVDDLYS ----------------------3333--------3333---------------------1 QYTESKPHVLSQILSPLISAYAKLFGANLKIVRFGFFSFELVESINDVKKSLRGKELQNY 111----------------------------------------------1111---1111 NLLQDCTQEVRQVTQSLFRDAIDRIIKKANSISTIPSNNGVTEATVDTMSRLRKFSEYKN ---------------------------1111----1111--------------------- GCLGAMDNITRENWLPSNYKEKEYTLQNWEDHNVLLSCFISDCIDTLAVNLERKAQIALM ---------1111------3333------------------------------------1 PNQEPDVANPNSSKNKHKQRIGFFILMNLTLVEQIVEKSELNLMLAGEGHSRLERLKKRY 111-----1111------------------------------1111-------------- ISYMVSDWRDLTANLMDSVFIDSSGKKSKDKEQIKEKFRKFNEGFEDLVSKTKQYKLSDP ------------------3333-------------------------------------- SLKVTLKSEIISLVMPMYERFYSRYKDSFKNPRKHIKYTPDELTTVLNQLVR ------------------------1111--3333------------------ >GENERAL CONTROL PROTEIN G; SWP:P03069; PDB:2B1FA; MKVKQLEDAVEELLSANYHLENAVARLKKLVG -3333--------------------------- >THIOL:DISULFIDE INTERCHAN; SWP:P0AA86; PDB:2B1KA; LESALIGKPVPKFRLESLDNPGQFYQADVLTQGKPVLLNVWATWCPTCRAEHQYLNQLSA ----2222--------1111-------1111----------1111-------------11 QGIRVVGMNYKDDRQKAISWLKELGNPYALSLFDGDGMLGLDLGVYGAPETFLIDGNGII 11----------------------------------------------------1111-- RYRHAGDLNPRVWEEEIKPLWEKYSKEAA ----------------------------- >SPE31; SWP:Q3Y6U7; PDB:2B1MA; APESWDWSKKGVITKVKFQGQCGSGWAFSATGAIEAAHAIATGNLVSLSEQELIDCVDES -----1111-----------------------------------------------3333 EGCYNGWHYQSFEWVVKHGGIASEADYPYKARDGKCKANEIQDKVTIDNYGVQILSNEST !!!!-----------1111---3333----------3333-------------------- ESEAESSLQSFVLEQPISVSIDAKDFHFYSGGIYDGGNCSSPYGINHFVLIVGYGSEDGV ----------------------1111---------!!!!-----------------iiii DYWIAKNSWGEDWGIDGYIRIQRNTGNLLGVCGMNYFASYPIIEK ---------1111-iiii--------1111%%%%----------- >MITOGEN-ACTIVATED PROTEIN; SWP:P53779; PDB:2B1PA; NQFYSVEVGDSTFTVLKRYQNLKPIGSGGIVCAAYDAVLDRNVAIKKLSRPFQNQTHAKR -------!!!!--------------------------------------1111------- AYRELVLMKCVNHKNIISLLNVFTPQKTLEEFQDVYLVMELMDANLCQVIQMELDHERMS ------------1111-----------3333-----------------------3333-- YLLYQMLCGIKHLHSAGIIHRDLKPSNIVVKSDCTLKILDFGLTRYYRAPEVILGMGYKE -------------1111------3333---1111----------1111---1111---11 NVDIWSVGCIMGEMVRHKILFPGRDYIDQWNKVIEQLGTPCPEFMKKLQPTVRNYVENRP 11--------------------------------------33331111-------1111- KYAGLTFPKLFPDSLFPADSEHNKLKASQARDLLSKMLVIDPAKRISVDDALQHPYINVW -----3333--3333-------------------------3333--33331111--3333 YDPAEVEAPPPDEREHTIEEWKELIYKEVMN -3333-------------------------- >CALMODULIN-LIKE PROTEIN 5; SWP:Q9NZT1; PDB:2B1UA; ARAGLEDLQVAFRAFDQDGDGHITVDELRRAMAGLGQPLPQEELDAMIREADVDQDGRVN -----------3333---------------3333-------------------------3 YEEFARMLAQE 333-------- >NAPHTHALENE DIOXYGENASE L; SWP:Q9X3R9; PDB:2B1XA; MLSNELRQTLQKGLHDVNSDWTVPAAIINDPEVHDVERERIFGHAWVFLAHESEIPERGD -----------------------3333-----------------------3333--2222 YVVRYISEDQFIVCRDEGGEIRGHLNACRHRGMQVCRAEMGNTSHFRCPYHGWTYSNTGS -----!!!!------1111------------------------------------3333- LVGVPAGKDAYGNQLKKSDWNLRPMPNLASYKGLIFGSLDPHADSLEDYLGDLKFYLDIV -----3333-%%%%-3333-----------iiii-----1111------!!!!------- LDRSDAGLQVVGAPQRWVIDANWKLGADNFVGDAYHTMMTHRSMVELGLAPPDPQFALYG ---1111--------------3333--------3333-1111----------1111---- EHIHTGHGHGLGIIGPPPGMPLPEFMGLPENIVEELERRLTPEQVEIFRPTAFIHGTVFP -----iiii--------------%%%%--------------------1111--------- NLSIGNFLMGKDHLSAPTAFLTLRLWHPLGPDKMEVMSFFLVEKDAPDWFKDESYKSYLR -----------1111---------------------------1111-------------- TFGISGGFEQDDAENWRSITRVMGGQFAKTGELNYQMGRGVLEPDPNWTGPGEAYPLDYA --1111--------------1111-3333------2222-----1111------------ EANQRNFLEYWMQLMLAESPL --------------------- >Iron-sulfur protein; SWP:Q9WVZ0; PDB:2B1XB; RVSDTTVREITEWLYMEAELLDAGKYREWLALVTEDLSYVVPIRVTREREAVTDVVEGMT --------------------1111-----11111111----------3333--------- HMDDDADSMEMRVLRLETEYAWAEDPPSRSRHFVTNVRVATGDSEDEFKVTSNLLLYRTR --------------1111--3333------------------------------------ GDVATYDVLSGERTDVLRRAGDSFLMAKRVVLLDQTTIMTHNLALIM -------------------!!!!------------------------ >HYPOTHETICAL PROTEIN ATU1; SWP:Q8UE48; PDB:2B1YA; PNFRYTHYDLKELRAGTTLEISLSSVNNVRLTGANFQRFTELLDFKYLGGVAKKSPIRIA -------------2222-----------------------3333---------------- VPETHWHLIIDAEGHSGLAESSVKLPAQPQATLTRKAS -----------2222----------------------- >ENTEROCHELIN ESTERASE; SWP:Q83SB9; PDB:2B20A; ALKVGSESWWQSKHGPEWQRLNDEFEVTFWWRDPQGSEEYSTIKRVWVYITGVTDHHQQP --2222---1111-----------------------3333------------1111---- QSQRIAGTDVWQWTTQLNANWRGSYCFIPTERDDIFSAPSPDRLELREGWRKLLPQAIAD ----2222---------1111-------------------------------3333---1 PLNPQSWKGGLGHAVSALEPQAPLQPGWDCPQAPEIPAKEIIWKSERLKNSRRVWIFTTG 111-----1111--------------3333---------------1111----------- DVTERPLAVLLDGEFWAQSPVWPVLTSLTHRQQLPPAVYVLIDAIDTTHRAHELPCNADF ---------------------3333-------------------------------3333 WLAVQQELLPLVKVIAPFSDRADRTVVAGQSFGGLSALYAGLHWPERFGCVLSQSGSYWW --------------------3333--------------------------------3333 PHRGGQQEGVLLEKLKAGEVSAEGLRIVLEAGIREPIRANQALYAQLHPIKESIFWRQVD -----------------------------------------------1111--------- GGHDALCWRGGLQGLIDLWQPLFH ---3333------------1111- ----------------------------- >HYPOTHETICAL PROTEIN; SWP:Q9H0Q9; PDB:2B25A; RPFQAGELILAETTKFKKLFRLNNFGLLNVPFGKIVGKFPGQILRSSFGKQYMLRRPALE ---2222-----------------------333322222222---1111--------333 DYVVLMKRGTAITFPKDINMILSMMDINPGDTVLEAGSGSGGMSLFLSKAVGSQGRVISF 3--------------------------2222------!!!!----------1111----- EVRKDHHDLAKKNYKHWRDSWKLSHVEEWPDNVDFIHKDISGATFDAVALDMLNPHVTLP --------------------------------------3333-----------3333333 VFYPHLKHGGVCAVYVVNITQVIELLDGIRTCELALSCEKISEVIVRDWLVCLVARPVHW 33333-2222-------3333--------------------------------------- QPGHTAFLVKLRKV -------------- >TELOMERASE REVERSE TRANSC; SWP:O77448; PDB:2B2AA; MLTRKEDLLTVLKQISALKYVSNLYEFLLATEKIVQTSELDTQFQEFLTTTIIASEQNLV ---3333-------3333----------------1111--------------------33 ENYKQMTIKQVIDDSIILLGNKQNYVQQIGTTTIGFYVEYRQTLYSSNFRNLLNIFGEED 33----------------!!!!--3333-----!!!!---3333---------------- FKYFLIDFLVFTKVEQNGYLQVAGVCLNQYFSVQVKQKKWYKNN -------------------------3333-3333---------- >SPERMIDINE SYNTHASE; SWP:Q9U2F0; PDB:2B2CA; KLHKGWFTEFSPDDLGAWPGQAFSLQVKKVLFHEKSKYQDVLVFESTTYGNVLVLDGIVQ ------------------------------------------------------iiii-- ATERDEFSYQEMLAHLPMFAHPDPKRVLIIGGGDGGILREVLKHESVEKVTMCEIDEMVI -3333--------------------------3333--------3333------------- DVAKKFLPGMSCGFSHPKLDLFCGDGFEFLKNHKNEFDVIITDSSYYELLRDALKEDGIL ----------1111-1111-------------------------1111------1111-- SSQGESVWLHLPLIAHLVAFNRKIFPAVTYAQSIVSTYPSGSMGYLICAKNANRDVTTPA -----3333-------------------------1111------------11111111-- RTLTAEQIKALNLRFYNSEVHKAAFVLPQFVKNALE --------1111----------1111---------- >AMMONIUM TRANSPORTER; SWP:O29285; PDB:2B2HA; MSDGNVAWILASTALVMLMVPGVGFFYAGMVRRKNAVNMIALSFISLIITVLLWIFYGYS -------------------------------1111------------------------- VSFGNDISGIIGGLNYALLSGVKGEDLLFMMYQMMFAAVTIAILTSAIAERAKVSSFILL ------iiii---1111-2222!!!!-----------------33332222--------- SALWLTFVYAPFAHWLWGGGWLAKLGALDFAGGMVVHISSGFAALAVAMTIGKRAGFEEY -----------------------------------------------------2222--- SIEPHSIPLTLIGAALLWFGWFGFNGGSALAANDVAINAVVVTNTSAAVAGFVWMVIGWI ------------------------1111-------------------------------- KGKPGSLGIVSGAIAGLAAITPAAGFVDVKGAIVIGLVAGIVCYLAMDFRIKKKIDESLD ----------------------1111---------------------------------- AWAIHGIGGLWGSVAVGILANPEVNGYAGLLFGNPQLLVSQLIAVASTTAYAFLVTLILA --------------------3333----3333-3333----------------------- KAVDAAVGLRVSSQEEYVGLDLSQHEEVAYT ------------------------------- >TRANSCRIPTION-REPAIR COUP; SWP:P30958; PDB:2B2NA; ACATLVAEIAERHAGPVVLIAPDMQNALRLHDEISQFTDQMVMNLADWETLPYDSFSPHQ ----------------------------------1111--------------------33 DIISSRLSTLYQLPTMQRGVLIVPVNTLMQRVCPHSFLHGHALVMKKGQRLSRDALRTQL 33---------3333--------3333------------------2222----------- DSAGYRHVDQVMEHGEYATRGALLDLFPMGSELPYRLDFFDDEIDSLRVFDVDSQRTLEE 1111--------2222---!!!!----2222--------%%%%--------1111----- VEAINLLPAHEFPTDKAAIELFRSQWRDTFEVKRDPEHIYQQVSKGTLPAGIEYWQPLFF ----------------------------------1111----1111--2222--3333-- SEPLPPLFSYFPANTLLVNTGDLETSAERFQADTLARFENRGVDPMRPLLPPQSLWLRVD -----3333-----------------------------------------3333------ ELFSELKN ----1111 >Putative uncharacterized ; SWP:Q5XFY8; PDB:2B2XH; EVQLVESGGGLVQPGGSLRLSCAASGFTFSRYTMSWVRQAPGKGLEWVAVISGGGHTYYL ------------2222-----------3333--------2222--------1111----3 DSVEGRFTISRDNSKNTLYLQMNSLRAEDTAVYYCTRGFGDGGYFDVWGQGTLVTVSSAK 333--------3333----------3333---------!!!!------------------ TTPPSVYPLAPSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSSSVT -------------------------------%%%%-------------%%%%-------- VPSSTWPSETVTCNVAHPASSTKVDKKIVP -1111------------1111--------- >Putative uncharacterized ; SWP:Q5XFY8; PDB:2B2XL; QIQLTQSPSSLSASVGDRVTITCSASSQVNHMFWYQQKPGKAPKPWIYLTSYLASGVPSR -------------2222--------------------2222------------2222333 FSGSGSGTDYTLTISSLQPEDFATYYCQQWSGNPWTFGQGTKVEIKRADAAPTVSIFPPS 3----------------1111--------------------------------------3 SEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTL 3331111---------------------iiii---------------------------- TKDEYERHNSYTCEATHKTSTSPIVKSFNR ----3333--------3333---------- >CHROMODOMAIN-HELICASE-DNA; SWP:O14646; PDB:2B2YA; EFETIERFMDCRIGRKGATGATTTIYAVEADGDPNAGFENKEPGEIQYLIKWKGWSHIHN --------------2222--------------1111---------------22223333- TWETEETLKQQNVRGMKKLDNYKKKDQETKRWLKNASPEDVEYYNCQQELTDDLHKQYQI --------1111--3333------------1111-------------------------- VGRIIAHSNQKSAAGYPDYYCKWQGLPYSECSWEDGALISKKFQACIDEYFSRKK -----------1111-------22221111----33333333------------- >PVIVAX HYPOTHETICAL PROTE; SWP:Q8I5F4; PDB:2B30A; KVEEALKGADIKLLLIDFDGTLFVDKDIKVPSENIDAIKEAIEKGYMVSICTGRSKVGIL 3333-2222--------------------------------------------------- SAFGEENLKKMNFYGMPGVYINGTIVYDQIGYTLLDETIETDVYAELISYLVEKNLVNQT --------------------iiii---1111------------------------1111- IFHRGESNYVTEDNKYADFLQKMYSENRSIIIRHNEMLKYRTMNKLMIVLDPSESKTVIG ---!!!!---1111-1111-------------3333------------------3333-- NLKQKFKNKLTIFTTYNGHAEVTKLGHDKYTGINYLLKHYNISNDQVLVVGDAENDIAML --------------1111-----2222---------------3333------1111---- SNFKYSFAVANATDSAKSHAKCVLPVSHREGAVAYLLKKVFDLK --------1111--------------3333---------1111- >PROTEIN SYNTHESIS INHIBIT; SWP:Q9WY58; PDB:2B33A; MKRFVETDKAPKAIGPYSQAVVVGNMMFVSGQIPIDPETGELVQGTIEEKTERVLENLKA ------1111------------!!!!---------------------------------- ILEAGGFSLKDVVKVTVFTTSMDYFQRVNEVYSRYFGDHRPARSFVAVAQLPRNVEIEIE --1111-3333---------3333--------------------------2222------ AIAVKEG ------- >MAR1 RIBONUCLEASE; SWP:Q20062; PDB:2B34A; ARINPTNSALFVCDLQEKFASNIKYFPEIITTSRRLIDAARILSIPTIVTEQYPKGLGHT ---1111--------1111----------------------------------------- VPTLKEGLAENTPIFDKTKFSMCIPPTEDTLKKVQNVILVGIEAHVCVLQTTYDLLERGL 33331111---------------3333-3333----------1111---------1111- NVHVVVDAVSSRSHTDRHFAFKQMEQAGAILTTSEATILGLVGGSDHPKFKEVQKLILTS ----1111----------------1111---------------1111-----3333---- APDTGLVPLSKL ------------ >KALATA B8; SWP:NA; PDB:2B38A; CGETCLLGTCYTTGCTCNKYRVCTKDGSVLN ----1111---2222--1111---------- >C3; SWP:Q2UVX4; PDB:2B39A; TPNILRLESEETVVLEAHGGQGTIQVSVTVHDFPAKKQVLSNENTQLNSNNGYLSTVTIK ------------------------------------------------------------ IPASKELKSDKGHKFVTVVATFGNVQVEKVVLISLQSGYLFIQTDKTIYTPGSTVLYRVF ---------------------%%%%----------------------------------- TVDHKLLPVGQTVFITIETPDGIPVKRDSKSSQNQFGILTLSWNIPELVNMGVWKIKAYY ------------------------------------------------------------ EDSPQQVFSAEFEVKEYVLPSFEVQLEPEEKFYYIDDPDGLKVNIIARFLYGEQVDGTAF ------------------------------------------------------------ VIFGVQDGDRRISLTHSLTRVPINDGNGEAILKRQVLLNGVQPSRADALVGKSIYVSATV -------------3333------%%%%------1111-------3333------------ ILQSGSDMVEAERTGIPIVTSPYQIHFTKTPKFFKPAMPFDLMVYVTNPDGSPARHIPVV ----------------------------------2222---------1111--------- TQGSNVQSLTQDDGVAKLSINTQNKRDPLTITVRTKKDNIPEGRQATRTMQALPYNTQGN 1111------------------------------------1111-------------%%% SNNYLHLSVPRVELKPGETLNVNFHLRTDPGEQAKIRYYTYMIMNKGKLLKVGRQYREPG %-------------------------------------------%%%%---------222 QDLVVLPLTITSDFIPSFRLVAYYTLINAKGQREVVADSVWVDVKDSCMGTLVVKNGGKE 2---------1111-------------1111----------------------------- EKHHRPGQQITLKIEADQGARVGLVAVDKGVFVLNKKNKLTQRKIWDVVEKADIGCTPGS ---------------------------3333---3333--3333----1111-------- GRNYAGVFTDAGLTLKTSQGLETQQRADPQCPQSVQLMEKRMDKAGQYSSDLRKCCEDGM ---------------------------------!!!!------3333-!!!!----1111 RDNPMKFPCQRRAQFILQGDACVKAFLDCCEYITQLRQQHSRDGALDDDIIPEEDIISRS --1111-33333333-----------------------1111---------%%%%----- QFPESWLWTVIEDLKQADKNGISTKLMNVFLKDSITTWEILAVSLSDKKGICVADPYEVT -----------------1111--------------------------------------- VMQDFFIDLRLPYSVVRNEQVEIRAILYNYREAENLKVRVELLYNPAFCSLATAKKRHQQ --------------------------------------------3333------------ TITIPARSSVAVPYVIVPLKIGLHEVEVKAAVYNHFISDGVKKTLKVVPEGVRVNKTVAV ------------------------------------------------------------ RTLNPEHLGQGGVQREEVPAADLSDQVPDTESETKILLQGTPVAQMTEDAIDGERLKHLI ---3333-------------------------------------3333-----3333--- QTPSGCGEQNMIGMTPTVIAVHYLDSTDQWEKFGLEKRQESLELIRKGYTQQLAFRQKSS ---------3333---------------3333-3333-------------3333------ AYAAFQYRPPSTWLTAYVVKVFALAANLIAIDSKDLCETVKWLILEKQKPDGIFQEDGPV ------------------------1111--------------------1111-------- IHQEMIGGFRDTREKDVSLTAFVLIALHEAKDICEAQVNSLGRSIAKAGDFLENHYRELR ---------------------------1111------------------------3333- RPYTVAIAAYALALLGKLEGDRLTKFLNTAKEKNRWEEPNQKLYNVEATSYALLALLARK ------------1111----------3333%%%%-------3333--------------- DYDTTPPVVRWLNEQRYYGGGYGSTQATFMVFQALAQYQKDVPDHKELNLDVSIQLPSRN -3333------1111--%%%%----------------------3333------------- SAVRHRILWESASLLRSEETKENERFTVKAEGKGQGTLSVVTVYHAKLKGKVSCKKFDLR ------------------------------------------------------------ VSIRPAPETVKKPQDAKGSMILDICTKYLGDQDATMSILDISMMTGFSPDVEDLKTLSTG ------------3333---------------------------------------1111- VDRYISKYEMNRDSNKNTLIIYLDKVSHTVEDCLSFKVHQYFNVGLIQPGAVKVYSYYNL -------------------------------------------------------1111- DETCIRFYHPDKEDGMLSKLCHKDTCRCAEENCFTLEDRLDKACEPGVDYVYKTRLIQKK ----------33331111-------33333333---1111-------------------- LEDDFDEYIMVIENIIKSGSDEVQVKQERKFISHIKCREALKLKEGAHYLVWGVSSDLWG ------------------2222-----------3333-3333-----------3333--- EKPKISYIIGKDTWVELWPEAEECQDEENQKQCEDLANFTENMVVFGCPN ---------3333-------3333----3333------------------ >GLUCOSE-BINDING PROTEIN; SWP:Q72KX2; PDB:2B3FA; MKLEIFSWWAGDEGPALEALIRLYKQKYPGVEVINATVTGGAGVNARAVLKTRMLGGDPP ---------!!!!--------------3333--------2222----------------- DTFQVHAGMELIGTWVVANRMEDLSALFRQEGWLQAFPKGLIDLISYKGGIWSVPVNIHR --------------3333-------------3333-------1111iiii---------- SNVMWYLPAKLKGWGVNPPRTWDKFLATCQTLKQKGLEAPLALGENWTQQHLWESVALAV --------------------------------------------3333------------ LGPDDWNNLWNGKLKFTDPKAVRAWEVFGRVLDCANKDAAGLSWQQAVDRVVQGKAAFNI ------3333----11113333---------11111111--------------------- MGDWAAGYMTTTLKLKPGTDFAWAPSPGTQGVFMMLSDSFGLPKGAKNRQNAINWLRLVG -------------------------2222-------------2222-------------- SKEGQDTSNPLKGSIAARLDSDPSKYNAYGQSAMRDWRSNRIVGSLVHGAVAPESFMSQF -------3333------11113333---------------------------33331111 GTVMEIFLQTRNPQAAANAAQAIADQVGLGRL ---------------------------2222- >METHIONINE AMINOPEPTIDASE; SWP:P53582; PDB:2B3HA; YRYTGKLRPHYPLMPTRPVPSYIQRPDYADHPLGMSESEQALKGTSQIKLLSSEDIEGMR -------------------3333--3333-1111-33331111----------------- LVCRLAREVLDVAAGMIKPGVTTEEIDHAVHLACIARNCYPSPLNYYNFPKSCCTSVNEV -------------11112222-------------1111--3333-%%%%-------!!!! ICHGIPDRRPLQEGDIVNVDITLYRNGYHGDLNETFFVGEVDDGARKLVQTTYECLMQAI -----------2222---------iiii-------------------------------1 DAVKPGVRYRELGNIIQKHAQANGFSVVRSYCGHGIHKLFHTAPNVPHYAKNKAVGVMKS 11122223333---------1111----------------------------------22 GHVFTIEPMICEGGWQDETWPDGWTAVTRDGKRSAQFEHTLLVTDTGCEILTRRLDSARP 22-----------------3333----1111------------------1111------3 HFMS 333- >TRNA ADENOSINE DEAMINASE; SWP:Q7A1Q5; PDB:2B3JA; MTNDIYFMTLAIEEAKKAAQLGEVPIGAIITKDDEVIARAHNLRETLQQPTAHAEHIAIE ------------------1111---------%%%%---------11111111-------- RAAKVLGSWRLEGCTLYVTLEPCVMCAGTIVMSRIPRVVYGADDPKGGCSGSLMNLLQQS ----------2222----------------1111---------3333-------111111 NFNHRAIVDKGVLKEACSTLLTTFFKNLRAN 11----------------------------- >HYPOTHETICAL PROTEIN AF11; SWP:O29141; PDB:2B3NA; EVKMMSLLEEMKGIYSKKGGKVKPFEKFEGELKEGYRFEYEKKLCEIDVAMFGLISGDLN ---------------1111-------------2222--------3333------------ PVHFDEDFASKTRFGGRVVHGMLTTSLVSAAVARLPGTVVLLEQSFRYTSPVRIGDVVRV 1111-------1111----3333--------1111-----------------2222---- EGVVSGVEKNRYTIDVKCYTGDKVVAEGVVKVLIW -------!!!!--------!!!!------------ >Phosphatidylinositol-4-ph; SWP:Q61194; PDB:2B3RA; AVKLSVSYRNGTLFIMVMHIKDLVTEDGADPNPYVKTYLLPDTHKTSKRKTKISRKTRNP --------%%%%-----------------------------1111--------------- TFNEMLVYSGYSKETLRQRELQLSVLSAESLRENFFLGGITLPLKDFNLSKETVKWYQLT -----------3333--------------------------------------------- A - >Peptide chain release fac; SWP:P0A7I0; PDB:2B3TB; AKLEALHERHEEVQALLGDAQTIADQERFRALSREYAQLSDVSRCFTDWQQVQQLQVLLL -3333----3333-----1111----------------------1111------------ PKDPDDERNAFLEVRAGTGGDEAALFAGDLFRMYSRYAEARRWRVEIMSASEGEHGGYKE ------------------------------------------------------------ IIAKISGDGVYGRLKFESGGHRVQRVPATESQGRIHTSACTVAVMPELPDAELPDVNPAD -------------3333---------1111------------------1111-------- LRIDTFRSSGAGGQHVNTTDSAIRITHLPTGIVVECQDERSQHKNKAKALSVLGARIHAA ------------3333------------------------3333---------------- EMAKRQRRNSDRNRTYNFPQGRVTDHRINLTLYRLDEVMEGKLDMLIEPIIQEHQADQLA -3333------------1111---3333-------3333---------------3333-- >HYPOTHETICAL PROTEIN YBIA; SWP:P30176; PDB:2B3WA; MPVRAQRIQHVMQDTIINFYSTSDDYGDFSNFAAWPIKVDGKTWPTSEHYFQAQKFLDEK ------------------------1111-1111-----iiii---3333----------- YREEIRRVSSPMVAARMGRDRSKPLRKNWESVKEQVMRKALRAKFEQHAELRALLLATAP ---------------1111------1111---------------3333-------1111- AKLVEHTENDAYWGDGGHGKGKNRLGYLLMELREQLAIEKLEHHHHHH ------------------------------------------------ >IRON-RESPONSIVE ELEMENT B; SWP:P21399; PDB:2B3YA; SNPFAHLAEPLDPVQPGKKFFNLNKLEDSRYGRLPFSIRVLLEAAIRNCDEFLVKKQDIE -1111------3333------3333--3333---3333-------1111-----3333-- NILHWNVTQHKNIEVPFKPARVILQDFTGVPAVVDFAAMRDAVKKLGGDPEKINPVCPAD -------1111--------------1111--------------1111-3333-------- LVIDHSIQVDFNRRADSLQKNQDLEFERNRERFEFLKWGSQAFHNMRIIPPGSGIIHQVN -------------1111------------------------------------------- LEYLARVVFDQDGYYYPDSLVGTDSHTTMIDGLGILGWGVGGIEAEAVMLGQPISMVLPQ ----------iiii---------1111--------------------1111--------- VIGYRLMGKPHPLVTSTDIVLTITKHLRQVGVVGKFVEFFGPGVAQLSIADRATIANMCP ----------1111-----------------2222-----1111-------------333 EYGATAAFFPVDEVSITYLVQTGRDEEKLKYIKKYLQAVGMFRDFNDPSQDPDFTQVVEL 3------------------------------------------11111111--------- DLKTVVPCCSGPKRPQDKVAVSDMKKDFESCLGAKQGFKGFQVAPEHHNDHKTFIYDNTE 3333---------1111--3333--------------------3333--------%%%%- FTLAHGSVVIAAITSCTNTSNPSVMLGAGLLAKKAVDAGLNVMPYIKTSLSPGSGVVTYY ---2222-------3333-----------------1111---1111-------3333--- LQESGVMPYLSQLGFDVVGYGCMTCIGNSGPLPEPVVEAITQGDLVAVGVLSGNRNFEGR -1111-----1111--------3333------3333--------------------2222 VHPNTRANYLASPPLVIAYAIAGTIRIDFEKEPLGVNAKGQQVFLKDIWPTRDEIQAVER -1111----------------------3333------------3333------------- QYVIPGMFKEVYQKIETVNESWNALATPSDKLFFWNSKSTYIKSPPFFENLTLDLQPPKS -----------1111--------------------1111------1111----------- IVDAYVLLNLGDSVTTDHISPAGNIARNSPAARYLTNRGLTPREFNSYGSRRGNDAVMAR --------------3333-----------------1111-3333--33331111------ GTFANIRLLNRFLNKQAPQTIHLPSGEILDVFDAAERYQQAGLPLIVLAGKEYGAGSSRD 22221111-3333---------1111------------1111------------------ WAAKGPFLLGIKAVLAESYERIHRSNLVGMGVIPLEYLPGENADALGLTGQERYTIIIPE ---------------------------1111------22223333--------------- NLKPQMKVQVKLDTGKTFQAVMRFDTDVELTYFLNGGILNYMIRKMAK --2222-----1111-----------------1111------------ >RIBOFLAVIN BIOSYNTHESIS P; SWP:P17618; PDB:2B3ZA; MEEYYMKLALDLAKQGEGQTESNPLVGAVVVKDGQIVGMGAHLKYGEAHAEVHAIHMAGA -------------1111--!!!!--------------------2222-3333------33 HAEGADIYVTLEPCSHYGKTPPCAELIINSGIKRVFVAMRDPNPLVAGRGISMMKEAGIE 33----------------------------------------3333-------------- VREGILADQAERLNEKFLHFMRTGLPYVTLKAAASLDGKIATSTGDSKWITSEAARQDAQ ----------------------------------1111---1111--------------- QYRKTHQSILVGVGTVKADNPSLTCRLPNVTKQPVRVILDTVLSIPEDAKVICDQIAPTW ---------------------------------------1111--11111111------- IFTTARADEEKKKRLSAFGVNIFTLETERIQIPDVLKILAEEGIMSVYVEGGSAVHGSFV ---1111--------1111-----------3333-----1111----------------- KEGCFQEIIFYFAPKLIGGTHAPSLISGEGFQSMKDVPLLQFTDITQIGRDIKLTAKPT --------------------------------1111-----------!!!!-------- >protein tyrosine phosphat; SWP:P26045; PDB:2B49A; VLIQFEQLYRKKPGLAITFAKLPQNLDKNRYKDVLPYDTTRVLLQGNEDYINASYVNMEI ---3333----2222--333333331111-1111--3333-------------------3 PAANLVNKYIATQGPLPHTCAQFWQVVWDQKLSLIVMLTTLTERGRTKCHQYWPDPPDVM 333------------1111-----------------------%%%%-------------- NHGGFHIQCQSEDCTIAYVSREMLVTNTQTGEEHTVTHLQYVAWPDHGVPDDSSDFLEFV -iiii---------1111------------------------------------------ NYVRSLRVDSEPVLVHCSAGIGRTGVLVTMETAMCLTERNLPIYPLDIVRKMRDQRAMMV ---1111-----------------------------1111-------------------- QTSSQYKFVCEAILRVYEE ------------------- >BH3024; SWP:Q9K8I2; PDB:2B4AA; QPFRVTLVEDEPSHATLIQYHLNQLGAEVTVHPSGSAFFQHRSQLSTCDLLIVSDQLVDL ----------3333--------1111-----------------3333------------- SIFSLLDIVKEQTKQPSVLILTTGRLIESSEHNLSYLQKPFAISELRAAIDYHKPS --------3333-----------------------------3333-----1111-- >DIHYDROOROTATE DEHYDROGEN; SWP:Q57U83; PDB:2B4GA; SLKVNILGHEFSNPFMNAAGVLCTTEEDLRRMTESESGSLIGKSCTLAPRTGNPEPRYFG -----iiii--------2222--------------------------------------- LPLGSINSMGLPNLGVDFYLSYAAQTHDYSRKPLFLSMSGLSVEESVEMVKKLVPITKEK 1111----------3333---------3333----------------------------- GTILELNLSCPNVPGKPQVGYDFDTTRTYLQKVSEAYGLPFGVKMPPYFDIAHFDMAAAV ------------2222-3333--------------------------------------- LNDFPLVKFITCVNSIGNGLVIDPANETVVIKPKQGFGGLGGKYVLPTALANVNAFFRRC ---3333-------------------------%%%%-----3333--------------1 PDKLVFGCGGVYSGEEAFLHILAGASMVQVGTALHDEGPIIFARLNKELQEIMTNKGYKT 111--------------------------------------------------------3 LDEFRGRVKTMD 3332222----- >OUTER CAPSID PROTEIN VP4; SWP:Q91HI9; PDB:2B4HA; ANEDIVVSKTSLWKEMQYNRDITIRFKFASSIVKSGGLGYKWSEISFKPANYQYTYTRDG ---------------------------------------------------------iii EEVTAHTTCSVNGMNDFNFNGGSLPTDFVISRYEVIKENSYVYVDYWDDSQAFRNMVYVR i----------------------1111---------1111---------3333------- SLAANLNSVICTGGDYSFALPVGQWPVMTGGAVSLHSAGVTLSTQFTDFVSLNSLRFRFR ------------------------------------------------------------ LTVEEPSFSITRTRVSRLYGLPAANPNNGKEYYEVAGRFSLISLVPS --------------------------iiii----------------- >PC4 and SFRS1-interacting; SWP:O75475; PDB:2B4JC; GSSMDSRLQRIHAEIKNSLKIDNLDVNRCIEALDELASLQVTMQQAQKHTEMITTLKKIR -------------------1111------------1111--------------------- RFKVSQVIMEKSTMLYNKFKNM -3333----------------- >GLYCINE BETAINE-BINDING P; SWP:P46922; PDB:2B4LA; DENASAAEQVNKTIIGIDPGSGIMSLTDKAMKDYDLNDWTLISASSAAMTATLKKSYDRK ---------%%%%----1111----------11111111-----------------1111 KPIIITGWTPHWMFSRYKLKYLDDPKQSYGSAEEIHTITRKGFSKEQPNAAKLLSQFKWT ----------3333---------1111------------2222----------1111--- QDEMGEIMIKVEEGEKPAKVAAEYVNKHKDQIAEWTKGVQKVKGDKINLAYVAWDSEIAS ----------1111---------------------2222--------------------- TNVIGKVLEDLGYEVTLTQVEAGPMWTAIATGSADASLSAWLPNTHKAYAAKYKGKYDDI ------------------------------------------------------------ GTSMTGVKMGLVVPQYMKNVNSIEDLKK -------------3333------1111- >GASTRIC INHIBITORY POLYPE; SWP:P09681; PDB:2B4NA; YAEGTFISDYSIAMDKIHQQDFVNWLLAQKGKKNDWKHNITQ ---3333-------------------1111------------ >MYO-INOSITOL HEXAPHOSPHAT; SWP:Q7WUJ1; PDB:2B4PA; MTVTEPVGSYARAERPQDFEGFVWRLDNDGKEALPRNFRTSADALRAPEKKFHLDAAYVP -----2222-11113333---------------------1111-----3333--1111-- SREGMDALHISGSSAFTPAQLKNVAAKLREKTAGPIYDVDLRQESHGYLDGIPVSWYGER -2222----------------------1111-----------------iiii-----222 DWANLGKSQHEALADERHRLHAALHKTVYIAPLGKHKLPEGGEVRRVQKVQTEQEVAEAA 21111-----------------2222--------%%%%-------------------111 GMRYFRIAATNHVWPTPENIDRFLAFYRTLPQDAWLHFHCEAGVGRTTAFMVMTDMLKNP 1--------2222-------------11111111------------------------11 SVSLKDILYRQHEIGGFYYGEFPIKTKDKDSWKTKYYREKIVMIEQFYRYVQENRADGYQ 11---------1111-----------33333333--------------------1111-- TPWSVWLKSHPAKA -------------- >Rhamnolipids biosynthesis; SWP:Q9RPT1; PDB:2B4QA; MHPYFSLAGRIALVTGGSRGIGQMIAQGLLEAGARVFICARDAEACADTATRLSAYGDCQ -3333-2222-------------------1111--------------------------- AIPADLSSEAGARRLAQALGELSARLDILVNNAGTSWGAALESYPVSGWEKVMQLNVTSV ---------------------------------------1111----------------- FSCIQQLLPLLRRSASAENPARVINIGSVAGISAMGEQAYAYGPSKAALHQLSRMLAKEL ---------------3333--------3333-------1111-----------------1 VGEHINVNVIAPGRFPSRMTRHIANDPQALEADSASIPMGRWGRPEEMAALAISLAGTAG 111-------------3333--1111----------1111---3333----------111 AYMTGNVIPIDGGFHL 1---------iiii-- >Glyceraldehyde-3-phosphat; SWP:Q8IKK7; PDB:2B4RO; ATKLGINGFGRIGRLVFRAAFGRKDIEVVAINDPFMDLNHLCYLLKYDSVHGQFPCEVTH ---------3333-----3333-------------------------------------- ADGFLLIGEKKVSVFAEKDPSQIPWGKCQVDVVCESTGVFLTKELASSHLKGGAKKVIMS %%%%--------------3333-3333--------------3333--------------- APPKDDTPIYVMGINHHQYDTKQLIVSNASCTTNCLAPLAKVINDRFGIVEGLMTTVHAS ----------22223333-3333------------------------------------- TANQLVVDGPSKGGKDWRAGRCALSNIIPASTGAAKAVGKVLPELNGKLTGVAFRVPIGT 1111------2222-3333--3333-------33333333-3333--------------- VSVVDLVCRLQKPAKYEEVALEIKKAAEGPLKGILGYTEDEVVSQDFVHDNRSSIFDMKA --------------3333----------1111----------33332222------1111 GLALNDNFFKLVSWYDNEWGYSNRVLDLAVHITT ---------------------------------- >RNA EDITING COMPLEX PROTE; SWP:Q86MV5; PDB:2B4VA; NPSPDHYAVWGKAIAENNRRVGPEHFRTAIRAQQQLQGLADKWTPDAKVYCCGSVTYGQE -------------------------------------------3333-------1111-2 RGSDLDLACFDDPYPSHEVQAKRTDKLRTVIKRYVPHYLRNNLLGLTEARTPVVKLRFAN 222--------------------------------33331111----------------3 DEKVARARYTPLSEEEDRKARTALLDVRNQCVGDNDVEYIAEKGRDNVEGIRVDRTTYGC 3333333------------------------------------1111------------- RIAIQCTSKEQIEAIGFFPDGKITRGREDYTRDVLDVRFVPEFYRWDISFVGYGVKNSYL -------3333-3333----------1111-----3333--------------------- IRHYLHNGPVAARHTAAVKAWGKATNGALTSYAVTVFIYYLLVTRQVLWVDPWSLPHPAH --------11113333-----1111-------------------------3333--3333 LPRYPDFSPLYDCDPTELGRLLHGFFIFYAHHFDYEREVVSLNRNRRSYRSDIGWNFPQN ---------------------------------3333-----------3333----2222 KKGTFSYNFCIEDPYEDVGTGGLNLVRHLHPAKFQLVKQEFLRAAQCERFLPTNAPEKSI -!!!!---------2222-----1111-----------------------3333---333 LG 3- >HYPOTHETICAL PROTEIN, CON; SWP:Q4QH86; PDB:2B4WA; KQVKAAFEANKRVYESVLLTFKGVDGYDVYNCSVPFSYKGKTHIYGRVEKRDIWAASHVR -----------------------2222----------iiii--------1111------- LFEETGKDEFTAVPELSWELEDPYIAKINNEIFGGTRVRILSYYGYFYRGTPDELTYFTR -----2222------------------%%%%-------------------1111------ GPGCKDIRVLQLQDGRLGVFSRPRVASIGFVILNSIDELGAEVIAKAPPLDILNAWGGVN -----------1111-------------------3333-----1111------------- QAYLLSSGKVGCIGHYSYEQSVYVNYAFVLDPQSRAITGAKIIGTKSCYPPCEPKVPLLA ----1111------------------------------------1111------------ DCVFASGIVRSDGKVDLYSGVGDSHEGRITIDYPFKGHGTIIGDLHFP ---------1111-------%%%%----------2222---------- >NAD-DEPENDENT DEACETYLASE; SWP:Q9NXA8; PDB:2B4YA; PSSSADFRKFFAKAKHIVIISGAGVSAESGVGYWRKWQAQDLATPLAFAHNPSRVWEFYH ---------------------33331111----!!!!3333------------------- YRREVGSKEPNAGHRAIAECETRLGKQGRRVVVITQNIDELHRKAGTKNLLEIHGSLFKT -----------------------1111---------------3333-----11111111- RCTSCGVVAENYKSPICPALSGKGAPEPGTQDASIPVEKLPRCEEAGCGGLLRPHVVWFG ----------------3333------2222-----3333-----2222---------222 ENLDPAILEEVDRELAHCDLCLVVGTSSVVYPAAFAPQVAARGVPVAEFNTETTPATNRF 2-------------------------------------3333-----------1111--- RFHFQGPCGTTLPEALA ------33333333--- >CYTOCHROME C; SWP:P62894; PDB:2B4ZA; GDVEKGKKIFVQKCAQCHTVEKGGKHKTGPNLHGLFGRKTGQAPGFSYTDANKNKGITWG -------------3333---2222-------2222-------2222-------------- EETLMEYLENPKKYIPGTKMIFAGIKKKGEREDLIAYLKKATNE ----------33332222---------------------1111- >PEROXISOME PROLIFERATOR A; SWP:Q03181; PDB:2B50A; LKAFSKHIYNAYLKNFNMTKKKARSILTGKASHTAPFVIHDIETLWQAEKGLVWKLPPYK -----------------------------1111--------------------------- EISVHVFYRCQCTTVETVRELTEFAKSIPSFSSLFLNDQVTLLKYGVHEAIFAMLASIVN ---------------------------3333---3333-----------------11111 KDGLLVANGSGFVTREFLRSLRKPFSDIIEPKFEFAVKFNALELDDSDLALFIAAIILCG 111---iiii-----3333--------------------1111----------------- DRPGLMNVPRVEAIQDTILRALEFHLQANHPDAQQLFPKLLQKMADLRQLVTEHAQMMQR -2222------------------------1111--------------------------- IKKTETETSLHPLLQEIYKDM ----1111---------2222 >Cellulosomal scaffolding ; SWP:Q06851; PDB:2B59B; GYKVSGYILPDFSFDATVAPLVKAGFKVEIVGTELYAVTDANGYFEITGVPANASGYTLK --------------33333333-------2222------1111----------------- ISRATYLDRVIANVVVTGDTSVSTSQAPIMMWVGDIVKDNSINLLDVAEVIRCFNATKGS --2222-----------------3333-------------------------22222222 ANYVEELDINRNGAINMQDIMIVHKHFGATSSDYDA ---33331111--------------22223333--- >C.BCLI; SWP:NA; PDB:2B5AA; MINEIEIKRKFGRTLKKIRTQKGVSQEELADLAGLHRTYISEVERGDRNISLINIHKICA -------------------1111-------------------1111-------------1 ALDIPASTFFRKMEEEN 111---------1111- ------------------------------------ >ALPHA-AMYLASE; SWP:A1GKL6; PDB:2B5DX; MRGKILIFLHAHLPYVHHPEYDHFLEERWLFEAITETYIPLLMMFDEIEDFRLTMSITPP -----------------3333--1111--------------------------------- LMEMLSSRDLQEKYERHMEKLIELANKEVERTKKEHPLKHKMAKFYREHFEKILNVFRSY -------------------------------1111----------------------111 DGNILEGFKKYQETGKLEIVTCNATHAFLPLYQMYPEVVNAQITVGVKNYEKHMKKHPRG 1----------3333------------33331111------------------------- IWLAECGYYQGLDLYLAQNNVEYFFVDSHAFWFADEQPRYGVYRPIMTPSGVFAFARDPE --2222----------1111------33331111---1111------1111------333 SSEQVWSAAVGYPGDPRYREFYRDIGFDREMEYIKDYIDPSGVRINTGIKYHRITSKSLD 33333-----33331111-----1111--3333-----3333-----------------3 ASQKEYYDIDLAMEAVEEHARDFLHKKESQARRLMDIMGVEPVIVAPFDAELFGHWWFEG 333----3333-------------------------------------3333-------- VFFLKRFFELVNESKDLKLVTASEVIDTLEEVQIATPADSSWGATNDWIYRHLHEMIERM ----------------------------------------------3333---------- IDLSKKYYNSSDPLVERVLNQMLRELFLAQSSDWAFIMTTRTSVQYAENRTKLHIKRFLN ------1111-3333---------------3333-------------------------- LYDQLVSGRIDEEMLRYYEWTDAIFPEINFRVMARDVI ------------------------11113333------ >PROTEIN DISULFIDE-ISOMERA; SWP:P17967; PDB:2B5EA; MQQEAVAPEDSAVVKLATDSFNEYIQSHDLVLAEFFAPWCGHCKNMAPEYVKAAETLVEK -------1111-----1111----------------1111-----------------111 NITLAQIDCTENQDLCMEHNIPGFPSLKIFKNSDVNNSIDYEGPRTAEAIVQFMIKQSQP 1------3333-------------------%%%%---------------------1111- AVAVVADLPAYLANETFVTPVIVQSGKIDADFNATFYSMANKHFNDYDFVSAENADDDFK ------------------------------------------1111-------1111--- LSIYLPSAMDEPVVYNGKKADIADADVFEKWLQVEALPYFGEIDGSVFAQYVESGLPLGY ----1111---------3333----------------------3333------------- LFYNDEEELEEYKPLFTELAKKNRGLMNFVSIDARKFGRHAGNLNMKEQFPLFAIHDMTE --------------------------------3333---3333--------------111 DLKYGLPQLSEEAFDELSDKIVLESKAIESLVKDFLKGDASPIVKSQEIFENQDSSVFQL 1---------------------------------3333---------------------- VGKNHDEIVNDPKKDVLVLYYAPWCGHCKRLAPTYQELADTYANATSDVLIAKLDHTEND 3333------1111-------1111-----------------------------3333-- VRGVVIEGYPTIVLYPGGKKSESVVYQGSRSLDSLFDFIKENGHFDVDGKALYEEAQEKA ------------------------------------------1111-------------- AEE --- >DIAMINE ACETYLTRANSFERASE; SWP:P21673; PDB:2B5GA; AKFVIRPATAADCSDILRLIKELAKYEQVILTEKDLLEDGFGEHPFYHCLVAEVPKEHWT --------3333---------3333-----------------------------3333-1 PEGHSIVGFAYYFTYDPWIGKLLYLEDFFVSDYRGFGIGSEILKNLSQVARCRCSSHFLV 111---------------------------1111-------------------------- AEWNEPSINFYKRRGASDLSSEEGWRLFKIDKEYLLKAT 1111-------1111-----1111------3333----- >CYSTEINE DIOXYGENASE TYPE; SWP:P21816; PDB:2B5HA; ELLKPRTLADLIRILHELFAGDEVNVEEVQAVLEAYESNPAEWALYAKFDQYRYTRNLVD ----------------1111------------------33333333---1111------- QGNGKFNLMILCWGEGHGSSIHDHTDSHCFLKLLQGNLKETLFDWPDKKSNEMIKKSERT %%%%---------2222------%%%%--------------------------------- LRENQCAYINDSIGLHRVENVSHTEPAVSLHLYSPPFDTCHAFDQRTGHKNKVTMTFHSK -2222----3333----------------------------------------------i FGIRTP iii--- >Interleukin-2 receptor su; SWP:P14784; PDB:2B5IB; SQFTCFYNSRAQISCVWSQTSCQVHAWPDRRRWQQTCELLPVSQASWACNLILGAPDSQK ------------------------------------------------------1111-- LTTVDIVTLRVLCREGVRWRVMAIQDFKPFENLRLMAPISLQVVHVETHRCNISWEISQA -1111----------------------1111----------------------------- SHYFERHLEFEARTLSPGHTWEEAPLLTLKQKQEWICLETLTPDTQYEFQVRVKPLQGEF 3333-----------33333333------------------------------------- TTWSPWSQPLAFRTKP ---------------- >Cytokine receptor common ; SWP:P31785; PDB:2B5IC; PLPEVQCFVFNVEYMNCTWQSSSEPQPTNLTLHYWYKNSDNDKVQKCSHYLFSEEITSGC ---------%%%%---------------------------------------%%%%---- QLQKKEIHLYQTFVVQLQDPREPRRQATQMLKLQNLVIPWAPENLTLHKLSESQLELNWN --3333------------1111---------1111------------------------- NRFLNHCLEHLVQYRTDWDHSWTEQSVDYRHKFSLPSVDGQKRYTFRVRSRFNPLCGSAQ ----1111------------------------------3333------------------ HWSEWSHPIHW ----------- >Non-structural protein V; SWP:P11207; PDB:2B5LC; LIETGLNTVEYFTSQQVTGTSSLGKNTIPPGVTGLLTNAPKIAIVPADDKTVPGKPIPNP ------3333--3333------------2222----------------3333------33 LLGLDSTPSTQTVLDLSGKTLPSGSYKGVKLAKFGKENLMTRFIEEPRENPDFKRGRDTG 33------------1111--------------------------------------1111 GFHRREYSIGWVGDEVKVTEWCNPSCSPITAAARRFECTCHQCPVTCSECERDT ------------------------------------------------------ >DAMAGE-SPECIFIC DNA BINDI; SWP:Q16531; PDB:2B5NA; NGIGIHEHASIDLPGIKGLWPLRSDPNRETYDTLVLSFVGQTRVLMLNGEEVEETELMGF ------------------------3333---------2222---------------3333 VDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVA ------------%%%%------------------------------------1111---- VGRALYYLQIHPQELRQISHTEMEHEVACLDITPLGLSPLCAIGLWTDISARILKLPSFE ---------------------------------------------3333----------- LLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQP ------------------------------1111-------------------------- TVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALAN ------------------------------------------------3333-------- NSTLTIGTIDEI ------------ >FERREDOXIN--NADP REDUCTAS; SWP:P31973; PDB:2B5OA; SVPVNIYRPKTPFLGKCIENYELVDEGGSGTVRHVTFDISEGDLRYLEGQSIGIIPPGED -------3333-------------2222----------1111----2222---------1 KNGKPHKLRLYSIASTRHGDMEDNKTVSLCVRQLEYQDPESGETVYGVCSTYLCNLPVGT 111------------1111---------------------------------11112222 DDVKITGPVGKEMLLPDDEDATVVMLATGTGIAPFRAFLWRMFKEQHEDYKFKGKAWLIF -----------------1111--------------------------------------- GVPYTANILYKDDFEKMAAENPDNFRLTYAISREQKTADGGKVYVQSRVSEYADELFEMI ---33332222-------------------1111--3333---3333------------- QKPNTHVYMCGLKGMQPPIDETFTAEAEKRGLNWEEMRRSMKKEHRWHVEVY -1111------3333------------1111----------1111------- >Antithrombin-III [Precurs; SWP:P01008; PDB:2B5TI; DICTAKPRDIPMNPMCIYRSPEKIPEATNRRVWELSKANSRFATTFYQHLADSKNDNDNI -11111111-----------------------------------------11111111-- FLSPLSISTAFAMTKLGACNDTLQQLMEVFKFDTISEKTSDQIHFFFAKLNCRLYRKANK -------------3333-------------1111-------3333--------------- ASKLVSANRLFGDKSLTFNETYQDISELVYGAKLQPLDFKENAEQSRAAINKWVSNKTEG -------------------------------------3333--------------1111- RITDVIPSEAINELTVLVLVNTIYFKGLWKSKFSPENTRKELFYKADGESCSASMMYQEG ------2222-1111------------------3333----------------------- KFRYRRVAEGTQVLELPFKGDDITMVLILPKPEKSLAKVEKELTPEVLQEWLDELEEMML -------%%%%----------------------------1111--------1111----- CVHMPRFRIEDGFSLKEQLQDMGLVDLFSPEKSKLPGIVAEGRDDLYVSDAFHKAFLEVN -------------------1111-3333------3333---------------------3 EEGSEAAASTAVVIAGRSLNPNRVCFKANRPFLVFIREVPLNTIIFMGRVANPCV 333----------------1111-------------------------------- >COLICIN E3; SWP:P00646; PDB:2B5UA; SAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALKGPFKFGLWGVALYGVLP -------2222------3333-------------------1111--------------33 SQIAKDDPNMMSKIVTSLPADDITESPVSSLPLDKATVNVNVRVVDDVKDERQNISVVSG 33----------------3333----3333-1111------------------------- VPMSCPVVDAKPTERPGVFTASIPGAPVLNISVNNSTPAVQTLSPGVTNNTDKDVRPAGF --------------2222----------------------------------------11 TQGGNTRDAVIRFPKDSGHNAVYVSVSDVLSPDQVKQRQDEENRRQQEWDATHPVEAAER 11---------------------------------------------------------- NYERARAELNQANEDVARNQERQAKAVQVYNSRKSELDAANKTLADAIAEIKQFNRFAHD -----------------------------------------------------3333--3 PMAGGHRMWQMAGLKAQRAQTDVNNKQAAFDAAAKEKSDADAALSSAMESRKKKEDKKRS 333--------------------------------------------------------- AENNLNDEKNKPRKGFKDYGHDYHPAPKTENIKGLGDLKPGIPKTPKQNGGGKRKRWTGD -------1111---33331111-----1111----------------------------% KGRKIYEWDSQHGELEGYRASDGQHLGSFDPKTGNQLKGPDPKRNIKKYL %%%-------------------------------------3333-3333- >GLUCOSE DEHYDROGENASE; SWP:Q977U7; PDB:2B5WA; MKAIAVKRGEDRPVVIEKPRPEPESGEALVRTLRVGVCGTDHEVIAGGHGGFPEGEDHLV ------2222-------------2222----------3333-------11112222---- LGHEAVGVVVDPNDTELEEGDIVVPTVRRPPASGTNEYFERDQPDMAPDGMYFERGIVGA -----------!!!!--2222---------1111----11113333-2222--------- HGYMSEFFTSPEKYLVRIPRSQAELGFLIEPISITEKALEHAYASRSAFDWDPSSAFVLG ----------3333----11111111-----------------1111------------- NGSLGLLTLAMLKVDDKGYENLYCLGRRDRPDPTIDIIEELDATYVDSRQTPVEDVPDVY --------------3333----------------------------3333-3333----- EQMDFIYEATGFPKHAIQSVQALAPNGVGALLGVPSDWAFEVDAGAFHREMVLHNKALVG -----------3333--3333--2222------------------------1111----- SVNSHVEHFEAATVTFTKLPKWFLEDLVTGVHPLSEFEAAFDDDDTTIKTAIEFSTV ----3333-------1111-------------33333333---1111---------- >YKUV PROTEIN; SWP:O31699; PDB:2B5XA; MKLRQPMPELTGEKAWLNGEVTREQLIGEKPTLIHFWSISCHLCKEAMPQVNEFRDKYQD ---------------------3333------------1111----------------111 QLNVVAVHMPRSEDDLDPGKIKETAAEHDITQPIFVDSDHALTDAFENEYVPAYYVFDKT 1----------1111----------1111-------11113333-------------111 GQLRHFQAGGSGMKMLEKRVNRVLAETE 1-----------3333-------1111- >HOMOSERINE O-ACETYLTRANSF; SWP:P45131; PDB:2B61A; SVQNVVLFDTQPLTLLGGKLSYINVAYQTYGTLNDEKNNAVLICHALTGDAEPYFDDGRD ---------------------------------3333--------1111----------- GWWQNFGAGLALDTDRYFFISSNVLGGCKGTTGPSSINPQTGKPYGSQFPNIVVQDIVKV --11112222--1111-------2222-----1111--------!!!!----3333---- QKALLEHLGISHLKAIIGGSFGGQANQWAIDYPDFDNIVNLCSSIYFSAEAIGFNHVRQA ------------------!!!!---------1111------------------------- VINDPNFNGGDYYEGTPPDQGLSIARLGLTYRTDLQLAKAFGRATKSDGSFWGDYFQVES ---1111iiii1111-------------------------!!!!-----1111------- YLSYQGKKFLERFDANSYLHLLRALDYDPSLGYENVKEALSRIKARYTLVSVTTDQLFKP ---------------------------1111--------3333--------1111---33 IDLYKSKQLLEQSGVDLHFYEFPSDYGHDAFLVDYDQFEKRIRDGLAGN 33--------1111---------1111-3333--1111----------- >COG0778: NITROREDUCTASE; SWP:Q97S03; PDB:2B67A; KFLELNKKRHATKHFTDKLVDPKDVRTAIEIATLAPSAHNSQPWKFVVVREKNAELAKLA -----1111---------------------------2222----------------1111 YGSNFEQVSSAPVTIALFTDTDLAKRARKIARVGGANNFSEEQLQYFKNLPAEFARYSEQ -----------------------------------------------------1111--- QVSDYLALNAGLVANLVLALTDQGIGSNIILGFDKSKVNEVLEIEDRFRPELLITVGYTD --------------------1111---------3333-------3333------------ EKLEPSYRLPVDEIIEKR ---------3333----- >DEFENSIN; SWP:Q4GWV4; PDB:2B68A; GFGCPGNQLKCNNHCKSISCRAGYCDAATLWLRCTCTDCNGKK -------3333--------------1111-------------- >UDP-GLUCURONATE DECARBOXY; SWP:Q8NBZ7; PDB:2B69A; RKRILITGGAGFVGSHLTDKLDGHEVTVVDNFFTGRKRNVEHWIGHENFELINHDVVEPL -------1111------------------------33333333--1111-----3333-- YIEVDQIYHLASPASPPNYYNPIKTLKTNTIGTLNLGLAKRVGARLLLASTSEVYGDPEV --------------3333----------------------------------1111---- HPQSEDYWGHVNPIGPRACYDEGKRVAETCYAYKQEGVEVRVARIFNTFGPRHNDGRVVS ---1111-------1111------------------------------------------ NFILQALQGEPLTVYGSGSQTRAFQYVSDLVNGLVALNSNVSSPVNLGNPEEHTILEFAQ -----1111----------------3333------------------------------- LIKNLVGSGSEIQFLSEAQDDPQKRKPDIKKAKLLGWEPVVPLEEGLNKAIHYFRKELEY -----------------2222-------3333---------------------------- QA -- >HYPOTHETICAL PROTEIN EF30; SWP:Q82ZI8; PDB:2B6CA; TLQFQKNPETAAKSAYKHQFVFAGIPAPERQALSKQLLKESHTWPKEKLCQEIEAYYQKT ----------------------------------------1111-----------1111- EREYQYVAIDLALQNVQRFSLEEVVAFKAYVPQKAWWDSVDAWRKFFGSWVALHLTELPT 3333-------33331111------33331111--------------------3333--- IFALFYGAENFWNRRVALNLQLLKEKTNQDLLKKAIIYDRTTEEFFIQKAIGWSLRQYSK ----2222----------1111!!!!-3333-------1111-3333------------- TNPQWVEELKELVLSPLAQREGSKYLAKA ----------------------1111--- >LACTOTRANSFERRIN; SWP:P24627; PDB:2B6DA; YTRVVWCAVGPEEQKKCQQWSQQSGQNVTCATASTTDDCIVLVLKGEADALNLDGGYIYT ---------------------1111----------------------------------- AGKCGLVPVLAENRKSSKHSSLDCVLRPTEGYLAVAVVKKANEGLTWNSLKDKKSCHTAV -1111-----------------3333------------3333---11112222-----22 DRTAGWNIPMGLIVNQTGSCAFDEFFSQSCAPGADPKSRLCALCAGDDQGLDKCVPNSKE 221111--------------1111------22221111--1111-------2222-3333 KYYGYTGAFRCLAEDVGDVAFVKNDTVWENTNGESTADWAKNLKREDFRLLCLDGTRKPV ----------------------3333----iiii--3333---3333----1111---11 TEAQSCHLAVAPNHAVVSRSDRAAHVEQVLLHQQALFGKNGKNCPDKFCLFKSETKNLLF 111111------------3333------------------1111----1111iiii---- NDNTECLAKLGGRPTYEEYLGTEYVTAIANLKKCSLEACAF 1111------------------------------------- >ADP-RIBOSYLATION FACTOR 5; SWP:P84085; PDB:2B6HA; RGSLFSRIFGKKQMRILMVGLDAAGKTTILYKLKLGEIVTTIPTIGFNVETVEYKNICFT --33331111----------2222------3333---------2222------!!!!--- VWDVGRPLWRHYFQNTQGLIFVVDSNDRERVQESADELQKMLQEDELRDAVLLVFANKQD -----3333--3333--------11111111------------3333----------333 MPNAMPVSELTDKLGLQHLRSRTWYVQATCATQGTGLYDGLDWLSHELSKR 3-------------3333------------3333-----------1111-- >PROTEINASE K; SWP:Q3HUQ2; PDB:2B6NA; ADQPSPTWGIDRIDQRNLPLDNNYHTDYDGSGVTAFVIDTGVLNTHNEFGGRASSGYDFI ------3333-------------------2222---------11111111---------- DND --- >LENS FIBER MAJOR INTRINSI; SWP:Q6J8I9; PDB:2B6OA; RSASFWRAIFAEFFATLFYVFFGLGASLRWAPGPLHVLQVALAFGLALATLVQAVGHISG 1111------------------------------3333---------------3333--- AHVNPAVTFAFLVGSQMSLLRAICYVVAQLLGAVAGAAVLYSVTPPAVRGNLALNTLHPG ---3333--------------------------------3333-------%%%%------ VSVGQATIVEIFLTLQFVLCIFATYDERRNGRLGSVALAVGFSLTLGHLFGMYYTGAGMN ------------------------------------------------------------ PARSFAPAILTRNFTNHWVYWVGPVIGAGLGSLLYDFLLFPRLKSVSERLSILKG 3333----1111---3333--1111--------------------3333--1111 >CYCLOPHILIN-LIKE PROTEIN; SWP:Q7RRM6; PDB:2B71A; LEEKIAYYKMKGHTERGYITIYTNLGDFEVELYWYHSPKTCLNFYTLCEMGFYDNTIFHR ----------------------1111-------3333-------------1111------ VIPNFVIQGGDPTGTGKGGKSIYGEYFEDEINKELKHTGAGILSMSNNGPNTNSSQFFIT -2222-----3333------1111-------1111------------------------- LAPLPHLDGKHTIFARVSKNMTCIENIASVQTTATNKPIFDLKILRTST ---3333-------------------1111--1111------------- >HYPOTHETICAL PROTEIN SMU.; SWP:Q8DUW5; PDB:2B78A; MIKLMVGSFAEKKLKRGVQLLSSRDYPNLNLDNQVVQLYSDADIFLGTAYLSKQNKGVGW -------------1111----3333--------------1111----------!!!!--- LISPKKVSLNVTYFIKLFQWSKDKRKNFAHSKLTTAYRLFNQDGDSFGGVTIDCYGDFVL ---------3333-----------3333------------!!!!----------!!!!-- FSWYNSFVYQIRDEIVAAFRQVYPNFLGAYEKIRFNVSAHLYGQEAPEQFLILENGISYN -----------------------------------------------------iiii--- VFLNDGLMTGIFLDQRQVRNELINGSAAGKTVLNLFSYTAAFSVAAAMGGAMATTSVDLA -----------3333---------1111-------------------------------3 KRSRALSLAHFEANHLDMANHQLVVMDVFDYFKYARRHHLTYDIIIIDPPSFEVFSVSKD 333--------1111--1111----------------------------------3333- YHKLIRQGLEILSENGLIIASTNAANMTVSQFKKQIEKGFGKQKHTYLDLQQLPSDFAVN --------11112222-------1111------------!!!!----------3333--1 VQDESSNYLKVFTIKV 1113333--------- >HYPOTHETICAL PROTEIN SMU.; SWP:NA; PDB:2B79A; MKFSFELAVNTKKEDAWTYYSQVNQWFVWEGDLEQISLEGEFTTGQKGKMKMEDMPELAF ---------------3333--1111----2222---------2222-------------- TLVEVRENQCFSDLTATPFGNVLFEHEILENPDGTISLRHSVSLTSDTTEEALAFLKQIF -----2222-------1111----------1111--------------1111-------- ADVPESVGKLKQILET ------------1111 >TYROSINE-PROTEIN KINASE J; SWP:O60674; PDB:2B7AA; QFEERHLKFLQQLGKGNFGSVEMCRYDPLQDNTGEVVAVKKLQHSTEEHLRDFEREIEIL --3333--------------------1111-----------------------------1 KSLQHDNIVKYKGVCYSNLKLIMEYLPYGSLRDYLQKHKERIDHIKLLQYTSQICKGMEY 111-1111-----------------1111--------3333------------------- LGTKRYIHRDLATRNILVENENRVKIGDFGLTKVLPQDKEKVKEPGESPIFWYAPESLTE -----------3333----1111-----1111---1111---------1111-3333--- SKFSVASDVWSFGVVLYELFTYIEKSKSPPAEFMRMIGNDKQGQMIVFHLIELLKNNGRL ------------------1111-3333---------------1111-------1111--- PRPDGCPDEIYMIMTECWNNNVNQRPSFRDLALRVDQIRDQMAG --2222--------------3333-------------------- >PRE-MRNA PROCESSING PROTE; SWP:P33203; PDB:2B7EA; GAMEAEKEFITMLKENQVDSTWSFSRIISELGTRDPRYWMVDDDPLWKKEMFEKYLSNR -----------------------------------3333----3333------------ >HTLV PROTEASE; SWP:P10274; PDB:2B7FA; PVIPLDPARRPVIKAQVDTQTSHPKTIEALLDTGADMTVIPIALFSSNTPLKNTSVLGAG -----1111----------------------1111-----3333---------------- GQTQDHFKLTSLPVLIRLPFRTTPIVLTSCLVDTKNNWAIIGRDALQQCQGVLYLP -----------------2222-------------------------1111------ >SCO1 PROTEIN; SWP:P23833; PDB:2B7KA; GKPSLGGPFHLEDMYGNEFTEKNLLGKFSIIYFGFSNCPDICPDELDKLGLWLNTLSSKY ------------1111-----1111---------1111---------------------- GITLQPLFITCDPARDSPAVLKEYLSDFHPSILGLTGTFDEVKNACKKYRVLVDHSIFFY -----------3333---------11113333--------------1111-3333----- LMDPEGQFVDALGRNYDEKTGVDKIVEHVKSYVPA --1111------1111------------1111--- >GLYCEROL-3-PHOSPHATE CYTI; SWP:O05155; PDB:2B7LA; MKRVITYGTYDLLHYGHIELLRRAREMGDYLIVALSTDEFNQIKHKKSYYDYEQRKMMLE ------------------------1111-------------------------------- SIRYVDLVIPEKGWGQKEDDVEKFDVDVFVMGHDWEGEFDFLKDKCEVIYLKRTE -1111----------------1111------1111---3333------------- >PROBABLE NICOTINATE-NUCLE; SWP:O25909; PDB:2B7NA; MEIRTFLERALKEDLGHGDLFERVLEKDFKATAFVRAKQEGVFSGEKYALELLEMTGIEC 1111------3333!!!!-3333---------------------3333-----1111--- VQTIKDKERFKPKDALMEIRGDFSMLLKVERTLLNLLQHSSGIATLTSRFVEALNSHKVR ----2222--2222-------------------------------------3333----- LLDTRKTRPLLRIFEKYSVLNGGASNHRLGLDDALMLKDTHLRHVKDLKSFLTHARKNLP -------2222--------1111--------------33331111---------3333-1 FTAKIEIECESFEEAKNAMNAGADIVMCDNLSVLETKEIAAYRDAHYPFVLLEASGNISL 111---------------3333------------------------1111--------33 ESINAYAKSGVDAISVGALIHQATFIDMHMKMA 33-3333--------33331111---------- >3-deoxy-D-arabino-heptulo; SWP:NA; PDB:2B7OA; ANTVDIPIDQLPSLPPLPTDLRTRLDAALAKPAAQQPTWPADQALARTVLESVPPVTVPS ---------------------------3333------------------1111----333 EIVRLQEQLAQVAKGEAFLLQGGDCAETFDNTEPHIRGNVRALLQAVVLTYGASPVVKVA 3----------1111----------------3333------------------------- RIAGQYAKPRSADIDALGLRSYRGDINGFAPDAAAREHDPSRLVRAYANASAANLVRALT --------------1111-------------3333------------------------- SSGLASLHLVHDWNREFVRTSPAGARYEALATEIDRGLRFSACGVADRNLQTAEIYASHE -3333---------------2222----------------1111--1111---------- ALVLDYERALRLSDGDDGEPQLFDLSAHTVWIGERTRQIDGAHIAFAQVIANPVGVKLGP --3333--------------------------1111-1111----3333----------- NTPELAVEYVERLDPHNKPGRLTLVSRGNHKVRDLLPPIVEKVQATGHQVIWQCDPHGNT -------------11112222------11113333-3333---3333------------- HESSTGFKTRHFDRIVDEVQGFFEVHRALGTHPGGIHVEITGENVTLAGRYETACDPRLN --------------------------1111------------------------------ TQQSLELAFLVAELRD ---------------- >DOUBLE-STRANDED RNA-SPECI; SWP:P51400; PDB:2B7TA; PGPVLPKNALMQLNEIKPGLQYMLLSQTGPVHAPLFVMSVEVNGQVFEGSGPTKKKAKLH ------3333------------------------------------------3333---- AAEKALRSFVQFP ------------- >CHARYBDIN; SWP:P84786; PDB:2B7UA; KAMTVKFTVELDIERLTGQTYTDFIKNLRRSLATWYLHGVPVLPLYNQEADPRGFDLKLT ------------1111--------------------iiii-------------------- FRGQVTTVRIHRDDLVLRGYQMQGAGKWLELERPSGHLIEGSELLEFGPSYEELAAAAQQ iiii----------------------------------2222------------------ DILDISYNKNALQDAVSKLAVSTNTRDRARSLIVVSQMFCEATRFVDIANHFAFNLESSE 1111--------------------------------------------------1111-- PVKLPQWMQNDLEKNWVRFSFMILKSNADPCYKFEPQTIYGKIIKTADELLNFLGIVEQH ----3333---3333-------------1111------iiii--------3333------ PDTRSPPCAAG -1111------ >DOUBLE-STRANDED RNA-SPECI; SWP:P51400; PDB:2B7VA; PSGKNPVMILNELRPGLKYDFLSESGESHAKSFVMSVVVDGQFFEGSGRNKKLAKARAAQ ----------3333-----------------------------------3333------- SALATVFNLHL ----------- >FAVIN BETA CHAIN; SWP:P02871; PDB:2B7YA; DEITSFSIPKFRPDQPNLIFQGGGYTTKEKLTLTKAVKNTVGRALYSLPIHIWDSETGNV --------------1111------------------------------------------ ADFTTTFIFVIDAPNGYNVADGFTFFIAPVDTKPQTGGGYLGVFNGKDYDKTAQTVAVEF -------------------------------------1111------------------- DTFYNAAWDPSNGKRHIGIDVNTIKSISTKSWNLQNGEEAHVAISFNATTNVLSVTLLYP ----3333--------------------------2222--------1111---------- N - >Favin; SWP:P02871; PDB:2B7YB; LTGYTLSEVVPLKDVVPEWVRIGFSATTGAEYATHEVLSWTFLSELT ----------3333--------------------------------- >LUCIFERASE-LIKE MONOOXYGE; SWP:Q81B18; PDB:2B81A; EKFANHFGYNRFAKDQLTLGVHIPIENYQFHAPTEKQVELVQKAEQYGFTGVWLRDVLLQ ------------2222-----------!!!!----------------------------- DPDFGDPATGQIYDIYLTYLASKTEKIAFGTSATVLSLRHPLRVAKEIATLDQLFPERIL 1111-1111-----------1111----------1111----------------2222-- GVSSGDRRADFKALGVSHETRGEKFREAFAYLEEILYKNFPSIQSTLGEVHGANLVPKPS ------3333------3333------------------------1111------------ KRVPTFITGFSQQNEWFAEHGDGWYYPRSPVHQAGAIGQWRELVEDYHPDVFKPFIQPHL ---------%%%%-1111----------3333---------------2222--------- DLSEDPNERPTPIRLGYRTGRKALIELLDIYKSIGVNHLFLALFDGQRPADEVLDELGEE ----1111----2222---------------1111-------1111-------------- VLPHFPAL 3333---- >CLASS B ACID PHOSPHATASE; SWP:P32697; PDB:2B82A; SPSPLNPGTNVARLAEQAPIHWVSVAQIENSLAGRPPMAVGFDIDDTVLFSSPGFWRGKK -----------------------------1111----------2222------------- TFSPESEDYLKNPVFWEKMNNGWDEFSIPKEVARQLIDMHVRRGDAIFFVTGRSPTKTET --11113333-------------1111--------------------------------- VSKTLADNFHIPATNMNPVIFAGDKPGQNTKSQWLQDKNIRIFYGDSDNDITAARDVGAR ------1111-3333---------2222------------------3333----1111-- GIRILRASNSTYKPLPQAGAFGEEVIVNSEY ------1111------2222----------- >CYTOPLASMIC PROTEIN NCK2; SWP:O43639; PDB:2B86A; MTEEVIVIAKWDYTAQQDQELDIKKNERLWLLDDSKTWWRVRNAANRTGYVPSNYVERK ----------------1111----------------------1111------------- >ZTAQ AFFIBODY; SWP:Q70AB8; PDB:2B87A; VDNKFNKELGWATWEIFNLPNLNGVQVKAFIDSLRDDPSQSANLLAEAKKLNDAQAPK 1111------------------3333----------3333------------------ >Protein A [Fragment]; SWP:Q70AB8; PDB:2B87B; VDNKFNKERVIAIGEIMRLPNLNSLQVVAFINSLRDDPSQSANLLAEAKKLNDAQAPK -------------------------------------1111----------------- >CATION-TRANSPORTING ATPAS; SWP:O29777; PDB:2B8EA; DALEVAEKVTAVIFDKTGTLTKGKPEVTDLVPLNGDERELLRLAAIAERRSEHPIAEAIV ----3333-------------------------------------1111----------- KKALEHGIELGEPEKVEVIAGEGVVADGILVGNKRLEDFGVAVSNEVELALEKLEREAKT ---1111-----------2222---iiii---33331111--------------1111-- AVIVARNGRVEGIIAVSDTLKESAKPAVQELKRGIKVGITGDNWRSAEAISRELNLDLVI -----iiii-----------1111------------------------------------ AEVLPHQKSEEVKKLQAKEVVAFVGDGINDAPALAQADLGIAVGSGDIVLIRDDLRDVVA ---3333----------------------3333--------------------------3 AIQ 333 >PAS FACTOR; SWP:NA; PDB:2B8IA; IRPYMKALIYETLVNLANQDPEQHATIRQNLYEQLDLPFDKQLALYAGALGPASSGKLEN -3333--------------3333-----------------------------1111---- HEAISNAVDSVVQLLEI ----------------- >HYPOTHETICAL PROTEIN MJ07; SWP:Q58174; PDB:2B8MA; GIEKVYEFKRDAKTKVVEKLVNTEHVQINHIVLPRGEQPKHYSNSYVHLIIIKGETLTLE ----------------------1111-------2222---------------------!! DQEPHNYKEGNIVYVPFNVKLIQNINSDILEFFVVKAPHPKKLNA !!-----2222---------------------------3333--- >GLYCERATE KINASE, PUTATIV; SWP:Q9X1S1; PDB:2B8NA; PESLKKLAIEIVKKSIEAVFPDRAVKETLPKLNLDRVILVAVGKAAWRAKAAYEVLGKKI ---------------------------3333-----------1111---------!!!!- RKGVVVTKYGHSEGPIDDFEIYEAGHPVPDENTIKTTRRVLELVDQLNENDTVLFLLSGG -------2222------------------3333--------------1111--------3 GSSLFELPLEGVSLEEIQKLTSALLKSGASIEEINTVRKHLSQVKGGRFAERVFPAKVVA 333-----2222--------------------------------iiii------------ LVLSDVLGDRLDVIASGPAWPDSSTSEDALKVLEKYGIETSESVKRAILQETPKHLSNVE -----22221111%%%%------------------------------------------- IHLIGNVQKVCDEAKSLAKEKGFNAEIITTSLDCEAREAGRFIASIKEVKFKDRPLKKPA ------------------1111-------------------------------------- ALIFGGETVVHVKGNGIGGRNQELALSAAIALEGIEGVILCSAGTDGTDGPTDAAGGIVD -----------------------------1111----------3333------------1 GSTAKTLKAGEDPYQYLKNNDSYNALKKSGALLITGPTGTNVNDLIIGLIV 111--------3333-1111------1111--------------------- >PROBABLE NUCLEOSIDE DIPHO; SWP:Q5UQL3; PDB:2B8QA; GLQRTLVLIKPDAFERSLVAEIMGRIEKKNFKIVSMKFWSKAPRNLIEQHYKEHSEQSYF --------------------------1111------------3333----3333--1111 NDNCDFMVSGPIISIVYEGTDAISKIRRLQGNILTPGTIRGDLANDIRENLIHASDSEDS -----1111----------------------1111------------------------- AVDEISIWFPET ------------ >THYMIDINE KINASE; SWP:Q9PPP5; PDB:2B8TA; IGWIEFITGPMFAGKTAELIRRLHRLEYADVKYLVFKPKIDTRSIRNIQSRTGTSLPSVE ------------------------3333------------3333---------------- VESAPEILNYIMSNSFNDETKVIGIDEVQFFDDRICEVANILAENGFVVIISGLDKNFKG --3333------33331111------3333------------1111----------1111 EPFGPIAKLFTYADKITKLTAICNECGAEATHSLRKIDGKHADYNDDIVKIGCQEFYSAV --!!!!3333--------------------------iiii--1111------3333---- CRHHHKVPNRPYLNSNSEEFIKFFKN 3333--2222---1111--------- >PURINE NUCLEOSIDE PHOSPHO; SWP:NA; PDB:2B94A; EMQRHIKLTPSQTTPVVLVVGDPGRVDKVKMLCDSYVDLAEYKSVECTYKGQKFLCVSHG --------3333-----------------1111---------------iiii-------2 VGSAGCAICFEELMNNGAKVIIRAGSCGSLQPTQMKRGDICICNAAVREDRVSHLMIYSD 222----------1111-------------1111-2222-----------3333---333 FPAVADFEVYDTLNKVAQELEVPVFNGISLSSDLYYPHKIIPTRLEDYSKANVAVVEMEV 3----------------------------------------1111---1111-------- ATLMVMGTLRKVKTGGIFIVDGCPLKWNLVPEKLENMIKISLETCARLAKKY ----------------------1111---------------------3333- >DYNEIN LIGHT CHAIN 2A; SWP:Q9NP97; PDB:2B95A; MAEVEETLKRLQSQKGVQGIIVVNTEGIPIKSTMDNPTTTQYASLMHSFILKARSTVRDI -----------------------3333-------3333------------------3333 DPQNDLTFLRIRSKKNEIMVAPDKDYFLIVIQNPTE ------------3333-------------------- >HYDROPHOBIN II; SWP:P79073; PDB:2B97A; AVCPTGLFSNPLCCATNVLDLIGVDCKTPTIAVDTGAIFQAHCASKGSKPLCCVAPVADQ -----------------%%%%----------------------1111------------- ALLCQKAIGT ------2222 >RIBOFLAVIN SYNTHASE; SWP:Q58584; PDB:2B99A; TKKVGIVDTTFARVDMASIAIKKLKELSPNIKIIRKTVPGIKDLPVACKKLLEEEGCDIV ---------------------------1111--------1111----------------- MALGMPGKAEKDKVCAHEASLGLMLAQLMTNKHIIEVFVHEDEAKDDKELDWLAKRRAEE --------3333---------------------------3333----------------- HAENVYYLLFKPEYLTRMAGKGLRQG ---------------1111------- >E7 PROTEIN; SWP:P06465; PDB:2B9DA; MKQPYAVVASCAYCEKLVRLTVLADHSAIRQLEEMLLRSLNIVCPLCTLQRQ -------------------------------------------3333----- >NOL1/NOP2/SUN DOMAIN FAMI; SWP:Q96P11; PDB:2B9EA; QLPRFVRVNTLKTCSDDVVDYFKRQGFSYQGRASSLDDLRALKGKHFLLDPLMPELLVFP --------3333----------1111--------33331111----------2222---- AQTDLHEHPLYRAGHLILQDRASCLPAMLLDPPPGSHVIDACAAPGNKTSHLAALLKNQG ----1111-----------3333---------2222-------------------%%%%- KIFAFDLDAKRLASMATLLARAGVSCCELAEEDFLAVSPSDPRYHEVHYILLDPSCSGVR --------------------------------3333-11111111--------------- LHALAGFQQRALCHALTFPSLQRLVYSTCSLCQEENEDVVRDALQQNPGAFRLAPALPAW -----------------1111----------3333-------11112222------1111 PHRGLSTFPGAEHCLRASPETTLSSGFFVAVIERV -------2222------3333-------------- >MITOGEN-ACTIVATED PROTEIN; SWP:P16892; PDB:2B9HA; MPKRIVYNISSDFQLKSLLGEGAYGVVCSATHKPTGEIVAIKKIEPFDKPLFALRTLREI -3333----3333-------------------1111------------3333-------- KILKHFKHENIITIFNIQRPDSFENFNEVYIIQELMQTDLHRVISTQMLSDDHIQYFIYQ -------1111----------3333----------------------------------- TLRAVKVLHGSNVIHRDLKPSNLLINSNCDLKVCDFGLARIIDVEFVATRWYRAPEVMLT ------------------3333---1111------1111---------3333-3333--- SAKYSRAMDVWSCGCILAELFLRRPIFPGRDYRHQLLLIFGIIGTPHSDNDLRCIESPRA ----3333---------------------------------------33331111----- REYIKSLPMYPAAPLEKMFPRVNPKGIDLLQRMLVFDPAKRITAKEALEHPYLQTYHDPN ---1111------3333-1111--------------3333---------33331111111 DEPEGEPIPPSFFEFDHYKEALTTKDLKKLIWNEIFS 1-------33333333--------------------- >ANTIMICROBIAL PEPTIDE LCI; SWP:P82243; PDB:2B9KA; AIKLVQSPNGNFAASFVLDGTKWIFKSKYYDSSKGYWVGIYEVWDRK -----------------%%%%---------3333------------- >PROPHENOLOXIDASE ACTIVATI; SWP:Q9GRW0; PDB:2B9LA; GHMAVVNIFGNASEYIPPGYEAPLGALTALPRCGTGADQGKKVCIVYHRCDGVTNTVTPE -------1111--------------3333-----!!!!-------1111----------- EVINTTGEGIFDIRENANECESYLDVCCGLPPVVPVLKPSFCGIRNERGLDFKITGQTNE -------------2222----1111--------------------1111----------- AEYGEFPWMVAVLKANEEQLVCGGSLIAPSVVLTGAHCVNSYQSNLDAIKIRAGEWDTLT -22221111------------------1111---33331111--------------1111 EKERLPYQERKIRQVIIHSNFNPKTVVNDVALLLLDRPLVQADNIGTICLPQQSQIFDST -----------------1111--------------------1111------2222----- ECFASGWGKKEFGSRHRYSNILKKIQLPTVDRDKCQADLRNTRLGLKFVLDQTFVCAGGE -------3333------------------------------3333-----1111------ QGKDTCTGDGGSPLFCPDPRNPSRYMQMGIVAWGIGCGDENVPGVYANVAHFRNWIDQEM --------2222-----1111-----------1111--1111-----3333--------- QAKGLSTTPYVE 1111--3333-- >HUMAN CYCLIN B1; SWP:P14635; PDB:2B9RA; VKDIYAYLRQLEAAQAVRPKYLLGREVTGNMRAILIDWLVQVQMKFRLLQETMYMTVSII -------33331111----1111------------------------------------- DRFMQNNSVPKKMLQLVGVTAMFIASKYEEMYPPEIGDFAFVTDNTYTKHQIRQMEMKIL ----------1111----------3333------3333---------------------- RALNFGLGRPLPLHFLRRASKIGEVDVEQHTLAKYLMELTMLDYDMVHFPPSQIAAGAFS ----------3333-----------------------3333-1111-------------- LALKILDNGEWTPTLQHYLSYTEESLLPVMQHLAKNVVMVNQGLTKHMTVKNKYATSKHA -------------3333----3333-------------1111-------------1111- KISTLPQLNSALVQ 3333---------- >TOPOISOMERASE I-LIKE PROT; SWP:Q9GPZ9; PDB:2B9SA; DLNWWEQENLRIAMKGERRWETLAHNGVLFPPEYEPHGIPIFYDGREFKMTPEEEEVATM --1111-------2222-------------------------iiii-------------- FAVMKEHDYYRMEVFRRNFFESWREILDKRQHPIRRLELCDFEPIYQWHLVQREKKLSRT 3333--3333---------------1111------3333--------------------- KEEKKAIKEKQDAEAEPYRYCVWDGRREQVANFRVEPPGLFRGRGKHPLMGKLKVRVQPE --------------3333----iiii--------------------1111-------333 DITINIGETAEVPVPPAGHKWAAVQHDHTVTWLAMWRDSVAGNMKYVMLAPSSSVKGQSD 3-----1111-----2222-------1111-------------------33333333--- MVKFEKARKLKDKVDDIRASYMEDFKSNDLHVAQRAVAMYFIDRLALRVGNEKGEDEADT ---------1111---------3333---------------------------1111--- VGCCSLRVEHIQLMPDNIVRFDFLGKDSIRYQNDVAVLPEVYALLQRFTRRKSPGMDIFD --11113333--------------2222---------3333-------11111111--11 QLNPTQLNDHLKSFMDGLSAKVFRTYNASITLDRWFKEKPWSTADKLAYFNKANTEVAIL 11--------333322223333-------------------------------------- CNHQKS ------ >DNA topoisomerase I-like ; SWP:Q8WQM6; PDB:2B9SB; KAVSLGTSKINYIDPRIICSWAKAQDVPINKIFSATIQKKFPWAMNAENFDF ---------------------------3333---------1111-------- >PUTATIVE AMINOOXIDASE; SWP:Q6A8X5; PDB:2B9WA; SISKDSRIAIIGAGPAGLAAGMYLEQAGFHDYTILERTDHVGGKCHSPNYHGRRYEMGAI --1111------------------1111-------------!!!!----iiii------- MGVPSYDTIQEIMDRTGDKVDGPKLRREFLHEDGEIYVPEKDPVRGPQVMAAVQKLGQLL --1111------------------------1111---3333------------------- ATKYQGYDANGHYNKVHEDLMLPFDEFLALNGCEAARDLWINPFTAFGYGHFDNVPAAYV -11111111-------3333--------1111--3333--11111111--3333------ LKYLDFVTMMSFAKGDLWTWADGTQAMFEHLNATLEHPAERNVDITRITREDGKVHIHTT -------------------11113333----3333---------------%%%%----11 DWDRESDVLVLTVPLEKFLDYSDADDDEREYFSKIIHQQYMVDACLVKEYPTISGYVPDN 11-----------3333--------------1111---------------------3333 MRPERLGHVMVYYHRWADDPHQIITTYLLRNHPDYADKTQEECRQMVLDDMETFGHPVEK -3333----------3333------------1111------------------------- IIEEQTWYYFPHVSSEDYKAGWYEKVEGMQGRRNTFYAGEIMSFGNFDEVCHYSKDLVTR --------------------------1111-%%%%---3333------------------ FFV --- >ARCHEAL EXOSOME RNA BINDI; SWP:O29758; PDB:2BA0A; RKIVLPGDLLSTNPRAAGYGTYVEGGKVYAKIIGLFDQTETHVRVIPLKGRYTPSVGDVV ----2222----1111-2222--iiii-----------1111------------2222-- IGIIREVAANGWAVDIYSPYQAFLPVSENPEMKPNKKPNEVLDIGDAIIAKVLNIDPKMK -------1111-------------33331111----1111--2222---------1111- VTLTMKDRICRPIRFGRIVAINPARVPRVIGKKGSMIKLLKSELDVQIVVGQNGLIWVNG ------3333-----------1111-----2222----------------1111------ DRRKVSIAEEAIYLIEQEAHTEGLTDRVAEFIKRRKAD -------------------------------------- >Probable exosome complex ; SWP:O29757; PDB:2BA0F; KPEKLIVDGLRLDGRKFDELRPIKIEASVLKRADGSCYLEMGKNKVIAAVFGPREVHPRH ------iiii1111--------------------------!!!!---------------- LQDPSKAIIRYRYNMAPFSVEERKRPGPDRRSIEISKVSKEAFEAVIMKELFPRSAIDIF ---------------1111-----------------------1111-33332222----- VEVLQADAGSRTACLNAASVALVDAGVPMKGMITSVAVGKADGQLVLDPMKEEDNFGEAD ------2222------------1111--------------iiii---------------- MPFAFLIRNGKIESIALLQMDGRMTRDEVKQAIELAKKGALQIYEMQREAILRRYIEVGE ------------------------------------------------------------ EMDEIT ------ >Probable exosome complex ; SWP:O29756; PDB:2BA0I; EDILVDIKRDYVLSKLRDNERIDGRGFDEFRKVEIIPNVIEKAEGSALVKLGDTQVVVGV ---------------1111--------------------3333-------!!!!------ KMQPGEPYPDTPDRGVIIVNAELVPLASPTFEPPDENSIELARVVDRGIRESEAVDLSKL -------1111--------------------------------------1111--3333- VIEEGEKVWIVFVDIHALDDDGNLLDASALAAIAALMNTKVPAERFDLGEDYLLPVRDLP -----------------------------------------3333--------------- VSVTSLIVGNKYLVDPSREEMSVGDTTLTITTDKDDNVVAMQKSGGYLLDEKLFDELLDV ----------------33333333--------1111------------------------ SINCARKLREKFK -------3333-- >ARCHAEAL EXOSOME RNA BIND; SWP:O30033; PDB:2BA1A; MRFVMPGDRIGSAEEYVKGEGVYEEGGELFAAVAGKLIIKDRVAKVESISPIPEIVKGDV ----2222----------2222--%%%%---------------------------2222- VLGRVVDLRNSIALIEVSSKKGENRGPSNRGIGILHVSNVDEGYVKEISEAVGYLDILKA -------------------2222------------3333--------1111-2222---- RVIGDNLRLSTKEEEMGVLRALCSNCKTEMVREGDILKCPECGRVEKRKISTDYGKGEW ------------1111------------------------------------2222--- >HYPOTHETICAL UPF0134 PROT; SWP:P75103; PDB:2BA2A; GTRYVTHKQLDEKLKNFVTKTEFKEFQTVVMESFAVQNQNIDAQGEQIKELQVEQKAQGK 1111--------3333-------------------------------------------- TLQLILEALQGINKRLDNLES --------------------- --------------------------------------------------- >ENDOCHITINASE (26 KD); SWP:P23951; PDB:2BAA; SVSSIVSRAQFDRMLLHRNDGACQAKGFYTYDAFVAAAAAFPGFGTTGSADAQKREVAAF 3333----------1111-1111-2222--------------2222-------------- LAQTSHETTGGWATAPDGAFAWGYCFKQERGASSDYCTPSAQWPCAPGKRYYGRGPIQLS -----------1111--1111------------------------2222-----1111-- HNYNYGPAGRAIGVDLLANPDLVATDATVGFKTAIWFWMTAQPPKPSSHAVIAGQWSPSG --------------3333-3333------------------------------------- ADRAAGRVPGFGVITNIINGGIECGHGQDSRVADRIGFYKRYCDILGVGYGNNLDCYSQR --1111---3333----------------------------------------------- PFA --- >FIBRINOGEN ALPHA CHAIN; SWP:P02672; PDB:2BAFA; MGTFREEGSVSSGTKQEFHTGKLVTTKGDKELLIDNEKVTSGHTTTTRRSCSKVITKTVT ------------------------------11113333------3333------------ NADGRTETTKEVVKSEDGSDCGDADFDWHHTFPSRGNLDDFFHRDKDDFFTRSSHEFDGR 1111-------------------------------------------------------- TGLAPEFAALGESGSSSSKTSTHSKQFVSSSTTVNRGGSAIESKHF -------------------------------1111----------- >GENOME POLYPROTEIN; SWP:P32540; PDB:2BAIA; MATTMEQEICAHSMTFEECPKCSALQYRNGFY --------------33331111----3333-- >YKUI PROTEIN; SWP:O35014; PDB:2BASA; LDPLDILTNIDDVLPYYQAIFSAEEQKVVGYEVLGRILADSEIQSLGPFFLDAGIPEEYK -----11113333-------------------------%%%%-------------3333- LEVDNRIIRQALDRFLEADSDLLIFNQDANLLLDHGESFLELLKEYEAKGIELHRFVLEI ---------------------------3333--iiii---------1111-1111----- TEHNFEGDIEQLYHLAYYRTYGIKIAVDNIGKESSNLDRIALLSPDLLKIDLQALKSPSY 3333---1111-----3333---------------3333------------3333--333 EHVLYSISLLARKIGAALLYEDIEANFQLQYAWRNGGRYFQGYYLVSPSETFLERDVLKQ 3------------------------3333------------3333--------1111--- RLKTEFHQFITHEKKKLETVYEHSEQFYKRVHQAVTSLRKNNLSSDDDFIKKLAEELTDC --------------------------------------2222--------------1111 SFRIYCDEEGDQLTGNVFKQDGEWIYQPEYAEKNWSWRPYFLENIRRNLRKGFFSDLYSD ------1111---------iiii----1111---3333-3333----------------- LETGEIRTFSYPDDQYLFIDLPYSYLYEQDGLI ---------------------33331111---- >PRE-MRNA SPLICING FACTOR ; SWP:P32523; PDB:2BAYA; MLCAISGKVPRRPVLSPKSRTIFEKSLLEQYVKDTGNDPITNEPLSIEEIVEIVPS ---------------------------------------------3333------- >HYPOTHETICAL PROTEIN BSU2; SWP:O34919; PDB:2BAZA; MQIKIKYLDETQTRINKMEQGDWIDLRAAEDVAIKKDEFKLVPLGVAMELPEGYEAHVVP --------1111----------------------2222------------2222------ RSSTYKNFGVIQTNSMGVIDESYKGDNDFWFFPAYALRDTKIKKGDRICQFRIMKKMPAV 11113333---1111----3333-1111--------------2222-------------- DLIEVDRL -------- >cobalamin biosynthesis pr; SWP:O29536_ARCFU; PDB:2BB3A; HIWIVGSGTCRGQTTERAKEIIERAEVIYGSRRALELAGVVDDSRARILRSFKGDEIRRI ---------------------------------------1111----------------- EEGREREVAVISTGDPVAGLGRVLREIAEDVEIKIEPAISSVQVALARLKVDLSEVAVVD -------------------33331111------------------------1111----- CFDAELTELLKYRHLLILADSHFPLERLGKRRVVLLENLCEGERIREGNADSIELESDYT --3333--3333-------1111-------------------------3333-------- IIFVEREV -------- >TRANSCOBALAMIN II; SWP:P20062; PDB:2BB5A; EMCEIPEMDSHLVEKLGQHLLPWMDRLSLEHLNPSIYVGLRLSSLQAGTKEDLYLHSLKL --------1111---33333333----1111----------------3333--------- GYQQCLLGSAFSEDDGDCQGKPSMGQLALYLLALRANCEFVRGHKGDRLVSQLKWFLEDE ----------------------------------1111---------------------- KRAIGHDHKGHPHTSYYQYGLGILALCLHQKRVHDSVVDKLLYAVEPFHQGHHSVDTAAM --------------------------1111---3333-----3333-------------- AGLAFTCLKRSNFNPGRRQRITMAIRTVREEILKAQTPEGHFGNVYSTPLALQFLMTSPM -------------3333-------------------3333---1111------1111--- PGAELGTACLKARVALLASLQDGAFQNALMISQLLPVLNHKTYIDLIFPDCLAPRVMLEP --------------------------3333------1111-3333----1111------- AAETIPQTQEIISVTLQVLSLLPPYRQSISVLAGSTVEDVLKKAHELGGFTYETQASLSG -------------------------------22223333-----------------1111 PYLTSVMGKAAGEREFWQLLRDPNTPLLQGIADYRPKDGETIELRLVSW -----iiii--------------------1111---2222--------- >TRANSCOBALAMIN II; SWP:Q9XSC9; PDB:2BB6A; NICEITEVDSTLVERLGQRLLPWMDRLSQEQLNPSIYVGLRLSSLQAGAKEAHYLHSLKL 1111----3333-------3333----1111---------------!!!!---------- SYQQSLLRPASNKDDNDSEAKPSMGQLALYLLALRANCEFIGGRKGDRLVSQLKRFLEDE --------1111----------------------1111---------------------- KRAIGHNHQGHPRTSYYQYSLGILALCVHQKRVHDSVVGKLLYAVEHKPHLLQDHVSVDT --------------------------1111---3333---------------3333---- MAMAGMAFSCLELSNLNPKQRNRINLALKRVQEKILKAQTPEGYFGNVYSTPLALQLLMG ----------------3333---------------11113333---3333---------- SLRPSVELGTACLKAKAALQASLQHKTFQNPLMISQLLPVLNQKSYVDLISPDCQAPRAL --------------------3333---------------1111-3333------------ LEPALETPPQAKVPKFIDVLLKVSGISPSYRHSVSVPAGSSLEDILKNAQEHGRFRFRTQ ------------------------------------2222-------------------- ASLSGPFLTSVLGRKAGEREFWQVLRDPDTPLQQGIADYRPKDGETIELRLVGW -1111-----iiii--------------------3333---2222--------- >EPHRIN TYPE-B RECEPTOR 4; SWP:P54760; PDB:2BBAA; HHHHHEETLLNTKLETADLKWVTFPQVDGQWEELSGLDEEQHSVRTYEVCDVQRAPGQAH ----------3333------------2222-------1111-------------1111-- WLRTGWVPRRGAVHVYATLRFTMLECLSLPRAGRSCKETFTVFYYESDADTATALTPAWM --------!!!!------------333322221111----------------1111---- ENPYIKVDTVAAEHLTRKRPGAEATGKVNVKTLRLGPLSKAGFYLAFQDQGACMALLSLH ------------------2222-------------------------------------- LFYKK ----- >COAT PROTEIN; SWP:Q6Q0J0; PDB:2BBDA; GEIYTETLQQTYAWTAGTNIPIKIPRNNFIRKIRVQLIGSISNSGTAAVTLPSAPFPYNL ------------------------------------------------------------ VQTFNLSYEGSKTLYSVSGTGLGILYYTTKGQNPAYPAPGTSVPASGSVNLNVWEFDLAR --------------------------1111-------2222--2222------------- FPATVQNIILSILTGQAPSGVSINASFYITITYERVTAQEILSEGGLGADGEPLATVLPK -----------------2222--------------------1111--1111--------- VIEIPTFNVPASSAPIHVAYLQPGQIYKRQLVYVINSTSGINNTDPTEYELKIVRGVPTD --------------------------------------!!!!------------------ KIKVSWAALQAENQAEYQVAPYSGASAIIDFRKYFNGDLDLTHAPSDSIEYDLALQNQDN ----------------------1111---3333--------------------------- VYSLYVSYVLPYYDQLAAL -------------3333-- >HYPOTHETICAL PROTEIN SO05; SWP:Q8EJE0; PDB:2BBEA; DYKINQQQIVCVASFLSKEGKTEALIAALASLIPDTRREAGCIRYELNVSRDEPRRVTFV -----------------2222------------3333-1111----------1111---- EKFVDIAAFDEHCAKDAIQHYFHQVMPELVESFHVETYHQVIA ------------------------3333--------------- >DIVALENT CATION TRANSPORT; SWP:Q9WZ31; PDB:2BBHA; PPGTLVYTGKYREDFEIEVNYSIEEFREFKTTDVESVLPFRDSSTPTWINITGIHRTDVV 1111-----------------1111-------3333--3333----------3333---- QRVGEFFGTHPLVLEDILNVHQRPKVEFFENYVFIVLKFTYDKHELESEQVSLILTKNCV ---------3333-----1111------1111-----------------------!!!!- LFQEKIGDVFDPVRERIRYNRGIIRKKRADYLLYSLIDALVDDYFVLLEKIDDEIDVLEE ----------------1111--3333---------------------------------- EVTVQRTHQLKRNLVELRKTIWPLREVLSSLYRDVPPLIE --3333-----------------------------3333- >Methylamine dehydrogenase; SWP:P29894; PDB:2BBKH; DEPRILEAPAPDARRVYVNDPAHFAAVTQQFVIDGEAGRVIGMIDGGFLPNPVVADDGSF -----------1111-----%%%%------------------------------1111-- IAHASTVFSRIARGERTDYVEVFDPVTLLPTADIELPDAPRFLVGTYPWMTSLTPDGKTL ----------------------------------------------1111---1111--- LFYQFSPAPAVGVVDLEGKAFKRMLDVPDCYHIFPTAPDTFFMHCRDGSLAKVAFGTEGT ---------------1111-------------------------1111------------ PEITHTEVFHPEDEFLINHPAYSQKAGRLVWPTYTGKIHQIDLSSGDAKFLPAVEALTEA ----------1111---------1111-----1111------1111-------------- ERADGWRPGGWQQVAYHRALDRIYLLVDQRDEWRHKTASRFVVVLDAKTGERLAKFEMGH -----------------1111---------1111-------------------------- EIDSINVSQDEKPLLYALSTGDKTLYIHDAESGEELRSVNQLGHGPQVITTADMG -------------------1111-------------------------------- >Methylamine dehydrogenase; SWP:P22619; PDB:2BBKL; TDPRAKWVPQDNDIQACDYWRHCSIDGNICDCSGGSLTNCPPGTKLATASVASCYNPTDG -1111-------1111--3333------3333--------2222------------1111 QSYLIAYRDCCGYNVSGRCPCLNTEGELPVYRPEFANDIIWCFGAEDDAMTYHCTISPIV -----------------------2222-11111111-----2222%%%%----------- GKAS ---- >VIRAL CASP8 AND FADD-LIKE; SWP:Q98325; PDB:2BBRA; SDSKEVPSLPFLRHLLEELDSHEDSLLLFLCHDAAPGCTTVTQALCSLSQQRKLTLAALV -3333----------1111---------1111--2222----------1111-------- EMLYVLQRMDLLKSRFGLSKEGAEQLLGTSFLTRYRKLMVCVGEELDSSELRALRLFACN -----------------------1111--------------1111--------------- LNPSLSTALSESSRFVELVLALENVGLVSPSSVSVLADMLRTLRRLDLCQQLVEYEQQEQ ---3333--1111---------1111--1111---------------------------- ARYRYCLHH --------- >Cystic fibrosis transmemb; SWP:P13569; PDB:2BBSA; STTEVVMENVTAFWEEGFGELFEKAKGTPVLKDINFKIERGQLLAVAGSTGAGKTSLLMM --------------------------------------2222------2222-------- IMGELEPSEGKIKHSGRISFCSQNSWIMPGTIKENIIGVSYDEYRYRSVIKACQLEEDIS -------------------------------------------------------3333- KFAEKDNIVLITLSGGQRARISLARAVYKDADLYLLDSPFGYLDVLTEKEIFESCVCKLM -1111----------------------------------2222--------------111 ANKTRILVTSKMEHLKKADKILILHEGSSYFYGTFSELQNLRPDFSSKLMSFDQFSAERR 1-----------------------iiii--------------3333----1111------ NSILTETLHRFSL ------------- >SUPPRESSOR OF CYTOKINE SI; SWP:O35718; PDB:2BBUA; EYQLVVNAVRKLQESGFYWSAVTGGEANLLLSAEPAGTFLIRDSSDQRHFFTLSVKTQSG ----------------------------3333--2222------!!!!------------ TKNLRIQCEGGSFSLQSDPRSTQPVPRFDCVLKLVHHYMPPPGTPSFSLPPTEPSSEVPE ------------------------2222-----3333----------------------- QPPAQALPGSTPKRAYYIYSGGEKIPLVLSRPLSSN ------------------------------------ >PROTEIN (BLACK BEETLE VIR; SWP:P04329; PDB:2BBVA; LTRLSQPGLAFLKCAFAPPDFNTDPGKGIPDRFEGKVVTRKDVLNQSINFTANRDTFILI -----3333-------1111---------------------------------------- APTPGVAYWVADVPAGTFPISTTTFNAVNFPGFNSMFGNAAASRSDQVSSFRYASMNVGI -------------2222--1111--------3333------------------------- YPTSNLMQFAGSITVWKCPVKLSNVQFPVATTPATSALVHTLVGLDGVLAVGPDNFSESF -------------------------------------------1111-----------11 IKGVFSQSVCNEPDFEFSDILEGIQTLPPANVTVATSGQPFNLAAGAEAVSGIVGWGNMD 11------------------------------3333----------1111---------- TIVIRVSAPTGAVNSAILKTWACLEYRPNPNAMLYQFGHDSPPCDEVALQEYRTVARSLP --------2222----------------1111-1111------------------1111- VAVIAAQN ---1111- >ADENYLATE KINASE 4, AK4; SWP:P27144; PDB:2BBWA; KLLRAVILGPPGSGKGTVCQRIAQNFGLQHLSSGHFLRENIKASTEVGEMAKQYIEKSLL ---------2222----------------------------------------3333--- VPDHVITRLMMSELENRRGQHWLLDGFPRTLGQAEALDKICEVDLVISLNIPFETLKDRL -3333-----------1111----------------3333-------------------1 SRRWIHPPSGRVYNLDFNPPHVHGIDDVTGEPLVQQEDDKPEAVAARLRQYKDVAKPVIE 111---1111---1111----2222----------1111--------------------- LYKSRGVLHQFSGTETNKIWPYVYTLFSNKITPIQSKEAY --1111--------3333--------3333-----1111- ------------------------------------------------- >NADH OXIDASE; SWP:NA; PDB:2BC0A; WGSKIVVVGANHAGTACIKTMLTNYGDANEIVVFDQNSNISFLGGMALWIGEQIAGPEGL -------------------------3333------------------------------- FYSDKEELESLGAKVYMESPVQSIDYDAKTVTALVDGKNHVETYDKLIFATGSQPILPPI --------1111----------------------%%%%---------------------2 KGAEIKEGSLEFEATLENLQFVKLYQNSADVIAKLENKDIKRVAVVGAGYIGVELAEAFQ 222--2222------2222----3333---------3333-------------------- RKGKEVVLIDVVDTCLAGYYDRDLTDLMAKNMEEHGIQLAFGETVKEVAGNGKVEKIITD ---------------2222-3333------------------------------------ KNEYDVDMVILAVGFRPNTTLGNGKIDLFRNGAFLVNKRQETSIPGVYAIGDCATIYDNA ------------------3333------1111----1111---2222---1111------ TRDTNYIALASNAVRTGIVAAHNACGTDLEGIGVQGSNGISIYGLHMVSTGLTLEKAKRL ----------------------1111---------------iiii------------111 GFDAAVTEYTDNQKPEFIEHGNFPVTIKIVYDKDSRRILGAQMAAREDVSMGIHMFSLAI 1-------------1111------------------------------3333-------- QEGVTIEKLALTDIFFLPHFNKPYNYITMAALGAKD ----33331111----3333---------------- >HLA class II histocompati; SWP:Q5SNZ7; PDB:2BC4A; LQNHTFLHTVYCQDGSPSVGLSEAYDEDQLFFFDFSQNTRVPRLPEFADWAQEQGDAPAI ------------------------!!!!------1111-----33331111--------- LFDKEFCEWMIQQIGPKLDGKIPVSRGFPIAEVFTLKPLEFGKPNTLVCFVSNLFPPMLT --------------3333---------------------2222----------------- VNWQHHSVPVEGFGPTFVSAVDGLSFQAFSYLNFTPEPSDIFSCIVTHEIDRYTAIAYWV ----%%%%----------------------------1111-------------------- PRNALPSD -------- >HLA class II histocompati; SWP:P28068; PDB:2BC4B; FVAHVESTCLLDDAGTPKDFTYCISFNKDLLTCWDPEENKMAPCEFGVLNSLANVLSQHL -----------1111----------%%%%-----------------1111---------- NQKDTLMQRLRNGLQNCATHTQPFWGSLTNRTRPPSVQVAKTTPFNTREPVMLACYVWGF ---------------------------1111----------------------------- YPAEVTITWRKNGKLVMPHSSAHKTAQPNGDWTYQTLSHLALTPSYGDTYTCVVEHIGAP ----------iiii--1111-----------------------------------1111- EPILRDWTPGL ----------- >CHOLESTEROL ESTERASE; SWP:P30122; PDB:2BCE; AKLGSVYTEGGFVEGVNKKLSLFGDSVDIFKGIPFAAAPKALEKPERHPGWQGTLKAKSF -------1111---------3333------------------------------------ KKRCLQATLTQDSTYGNEDCLYLNIWVPQGRKEVSHDLPVMIWIYGGAFLMGLSNYLYDG -------3333-----------------------------------------1111---- EEIATRGNVIVVTFNYRVGPLGFLSTGDSNLPGNYGLWDQHMAIAWVKRNIEAFGGDPDQ ------------------3333-----3333------------------3333---1111 ITLFGESAGGASVSLQTLSPYNKGLIKRAISQSGVGLCPWAIQQDPLFWAKRIAEKVGCP ------------------3333------------1111------3333------1111-- VDDTSKMAGCLKITDPRALTLAYKLPLGSTEYPKLHYLSFVPVIDGDFIPDDPVNLYANA --------------------------------3333---------------333333331 ADVDYIAGTNDMDGHLFVGMDVPAINSNKQDVTEEDFYKLVSGLTVTKGLRGAQATYEVY 111--------11113333---3333------------------3333----------11 TEPWAQDSSQETRKKTMVDLETDILFLIPTKIAVAQHKSHAKSANTYTYLFSQPSRMPIY 11-%%%%-------------------------------------------------3333 PKWMGADHADDLQYVFGKPFATPLGYRAQDRTVSKAMIAYWTNFARTGDPNTGHSTVPAN 1111--2222---1111----3333-3333------------------3333-------- WDPYTLEDDNYLEINKQMDSNSMKLHLRTNYLQFWTQTYQALPTVTPVVIGF -----3333---------1111---------------3333----------- >PROBABLE D-ALANYL-D-ALANI; SWP:Q7D6F2; PDB:2BCFA; VQPAGSVPIPDGPAQTWIVADLDSGQVLAGRDQNVAHPPASTIKVLLALVALDELDLNST --2222-------------------------1111----3333------------1111- VVADVADTQAECNCVGVKPGRSYTVRQLLDGLLLVSGNDAANTLAHLGGQDVTVAKNAKA ---3333----------2222--------------------------------------- ATLGATSTHATTPSGLDGPGGSGASTAHDLVVIFRAAANPVFAQIIAEPSAFPSDEQLIV 11111111---1111--1111----------------------3333------------- NQDELLQRYPGAIGGKTGYTNAARKTFVGAAARGGRRLVIAYGLVKEGGPTYWDQAATLF --3333--2222--------3333--------%%%%---------2222----------- DWGFALNPQASVGSL ------1111----- >Rab GDP-dissociation inhi; SWP:P39958; PDB:2BCGG; TIDTDYDVIVLGTGITECILSGLLSVDGKKVLHIDKQDHYGGEAASVTLSQLYEKFKQNP ----------------------------------------!!!!---------------- ISKEERESKFGKDRDWNVDLIPKFLMANGELTNILIHTDVTRYVDFKQVSGSYVFKQGKI -----------3333----------1111-----------1111-----------iiii- YKVPANEIEAISSPLMGIFEKRRMKKFLEWISSYKEDDLSTHQGLDLDKNTMDEVYYKFG ----------------------------------33331111--------------1111 LGNSTKEFIGHAMALWTNDDYLQQPARPSFERILLYCQSVARYGKSPYLYPMYGLGELPQ ------------------3333----------------------------2222------ GFARLSAIYGGTYMLDTPIDEVLYKKDTGKFEGVKTKLGTFKAPLVIADPTYFPEKCKST ------1111--------------2222-------1111---------33333333---- GQRVIRAICILNHPVPNTSNADSLQIIIPQSQLGRKSDIYVAIVSDAHNVCSKGHYLAII -----------------%%%%-------3333------------3333---2222----- STIIETDKPHIELEPAFKLLGPIEEKFMGIAELFEPREDGSKDNIYLSRSYDASSHFESM -------3333-----1111-------------------3333----------------- TDDVKDIYFRVTGHPLVLKQRQ ---------------------- >GTP-binding protein YPT1; SWP:P01123; PDB:2BCGY; SEYDYLFKLLLIGNSGVGKSCLLLRFSDDTYTNDYISTIGVDFKIKTVELDGKTVKLQIW -------------2222--------------------------------%%%%------- DTAGQERFRTITSSYYRGSHGIIIVYDVTDQESFNGVKMWLQEIDRYATSTVLKLLVGNK -iiii1111------2222-------1111------------------1111-------3 CDLKDKRVVEYDVAKEFADANKMPFLETSALDSTNVEDAFLTMARQIKESMSQQNLNETT 3331111----------------------1111--------------------------3 QKKEDKGNVNLKGQ 333------1111- >Guanine nucleotide-bindin; SWP:P21279; PDB:2BCJQ; RELKLLLLGTGESGKSTFIKQMRIIHGSGYSDEDKRGFTKLVYQNIFTAMQAMIRAMDTL ---------2222-----------------33333333---------------------- KIPYKYEHNKAHAQLVREVDVEKVSAFENPYVDAIKSLWNDPGIQECYDRRREYQLSDST -----3333------11111111----3333------------------3333-----33 KYYLNDLDRVADPSYLPTQQDVLRVRVPTTGIIEYPFDLQSVIFRMVDVGGQRSERRKWI 331111---------------1111--------------------------3333--333 HCFENVTSIMFLVALSEYDQVLVESDNENRMEESKALFRTIITYPWFQNSSVILFLNKKD 3------------1111----3333------------------1111------------- LLEEKIMYSHLVDYFPEYDGPQRDAQAAREFILKMFVDLNPDSDKIIYSHFTCATDTENI ---3333--3333-1111-----3333---------1111-3333-------11113333 RFVFAAVKDTILQLNLK ----------------- >BETA-2-MICROGLOBULIN; SWP:P05534; PDB:2BCKA; GSHSMRYFSTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGPEYW -------------2222----------!!!!-----1111--------1111-------- DEETGKVKAHSQTDRENLRIALRYYNQSEAGSHTLQMMFGCDVGSDGRFLRGYHQYAYDG ---------------------------1111------------3333----------iii KDYIALKEDLRSWTAADMAAQITKRKWEAAHVAEQQRAYLEGTCVDGLRRYLENGKETLQ i-----1111-----------------------------------------------111 RTDPPKTHMTHHPISDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDGT 1-------------------------------------iiii------------------ FQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRWEPGSGGGLNDIF ------------3333------1111-------------------- >F1845 fimbrial protein [P; SWP:P13719; PDB:2BCMB; TFQASGTTGITTLTVTEECRVQVGNVTATLARSKLKDDTAIGVIGVTALGCNGLQAALQA ------------------------------3333-2222-----------2222------ DPDNYDATNLYMTSRNHDKLNVKLKATDGSSWTYGNGVFYKTEGGNWGGHVGISVDGNQT 1111---------1111--------1111-----iiii--------------------11 DKPTGEYTLNLTGGYWTN 11---------------- >SUCCINYLGLUTAMATE DESUCCI; SWP:Q87Q40; PDB:2BCOA; SLFRQSFLTDTLDVHIVAPAEQVLSNGVQLKLYQRGVLEVIPENPTQETKNIIISCGIHG -----3333--------------1111------2222--------1111---------11 DETAPELVDSIIKDIESGFQKVDARCLFIIAHPESTLAHTRFLEENLNRLFDEKEHEPTK 11------------1111---------------------------1111---------33 ELAIADTLKLLVRDFYQDTEPKTRWHLDLHCAIRGSKHYTFAVSPKTRHPVRSKALVDFL 33-----------------3333------------------------------------- DSAHIEAVLLSNSPSSTFSWYSAENYSAQALTELGRVARIGENALDRLTAFDLALRNLIA --------------------------------------2222-3333------------- EAQPEHLSKPCIKYRVSRTIVRLHDDFDFFDDNVENFTSFVHGEVFGHDGDKPLAKNDNE ------------------------------1111------2222---------------- AIVFPNRHVAIGQRAALVCEVKTRFEEGELVYD -----11112222------------iiii---- >DNA POLYMERASE LAMBDA; SWP:Q9UGP5; PDB:2BCQA; HNLHITEKLEVLAKAYSVQGDKWRALGYAKAINALKSFHKPVTSYQEACSIPGIGKRMAE -3333-----------1111------------------------3333------------ KIIEILESGHLRKLDHISESVPVLELFSNIWGAGTKTAQMWYQQGFRSLEDIRSQASLTT ----------3333---3333--------2222--------------------------- QQAIGLKHYSDFLERMPREEATEIEQTVQKAAQAFNSGLLCVACGSYRRGKATCGDVDVL --------3333-------------------33331111--------------------- ITHPDGRSHRGIFSRLLDSLRQEGFLTDDLVSQEENGQQQKYLGVCRLPGPGRRHRRLDI --1111-----3333-----------------3333------------------------ IVVPYSEFACALLYFTGSAHFNRSMRALAKTKGMSLSEHALSTAVVRNTHGCKVGPGRVL ---3333----------------------------------------1111--------- PTPTEKDVFRLLGLPYREPAERDW ---------1111----3333--- >SEPIAPTERIN REDUCTASE; SWP:Q8KES3; PDB:2BD0A; KHILLITGAGKGIGRAIALEFARAARHHPDFEPVLVLSSRTAADLEKISLECRAEGALTD -----------------------33331111---------------------1111---- TITADISDMADVRRLTTHIVERYGHIDCLVNNAGVGRFGALSDLTEEDFDYTMNTNLKGT ----1111-------------------------------3333----------------- FFLTQALFALMERQHSGHIFFITSVAATKAFRHSSIYCMSKFGQRGLVETMRLYARKCNV -----------------------1111---1111---------------33333333--- RITDVQPGAVYTPMWGKVDDEMQALMMMPEDIAAPVVQAYLQPSRTVVEEIILRPTSGDI -----------3333----3333----3333-----------3333--------1111-- >ACP-SYNTHASE; SWP:Q7RB63; PDB:2BDDA; QGHHIIGIGTDILCVNRIYKILEKNINFIKKVLNPFELAEFETQNELKKLAIYVSKKFAA -------------3333-------3333-----3333----------------------- KEAILKSMGRLSMNDIEIKNDKYGKPHVYLYGKAKKVAYEMGIVKIFLSISDEKFIIQAQ -----1111--1111-----1111--------------1111------------------ ALAVGSN ------- >CYTOSOLIC IMP-GMP SPECIFI; SWP:Q5ZZB6; PDB:2BDEA; DTHKVFVNRIINRKIKLIGLDDHTLIRYNSKNFESLVYDLVKERLAESFHYPEEIKKFKF ---------------------------------------------------3333----- NFDDAIRGLVIDSKNGNILKLSRYGAIRLSYHGTKQISFSDQKKIYRSIYVDLGDPNYAI 1111-----------------1111--------------------------3333----- DTSFSIAFCILYGQLVDLKDTNPDKPSYQAIAQDVQYCVDKVHSDGTLKNIIIKNLKKYV ---------------------3333----------------------------------- IREKEVVEGLKHFIRYGKKIFILTNSEYSYSKLLLDYALSPFLDKGEHWQGLFEFVITLA --3333-------1111---------3333--------3333-22223333--------- NKPRFFYDNLRFLSVNPENGTTNVHGPIVPGVYQGGNAKKFTEDLGVGGDEILYIGDHIY --3333-----------------------------------------3333--------- GDILRLKKDCNWRTALVVEELGEEIASQIRALPIEKKIGEAAIKKELEQKYVDLCTRSID -----------------3333--------------------------------------- ESSQQYDQEIHDLQLQISTVDLQISRLLQEQNSFYNPKWERVFRAGAEESYFAYQVDRFA -----3333--------------------3333-------1111---------------- CIYEKLSDLLEHSPTYFRANRRLLAHDIDI ----33333333-----------1111--- >KALLIKREIN-4; SWP:Q9Y5K2; PDB:2BDGA; IINGEDCSPHSQPWQAALVMENELFCSGVLVHPQWVLSAAHCFQNSYTIGLGLHSLEADQ -------22221111----------------1111---1111--------------3333 EPGSQMVEASLSVRHPEYNRPLLANDLMLIKLDESVSESDTIRSISIASQCPTAGNSCLV 1111----------1111--2222--------------1111------------------ SGWGLLANGRMPTVLQCVNVSVVSEEVCSKLYDPLYHPSMFCAGGGQDQKDSCNGDSGGP -----1111---------------------------1111-----3333---2222---- LICNGYLQGLVSFGKAPCGQVGVPGVYTNLCKFTEWIEKTVQA -------------------2222-----3333----------- >SMALL INDUCIBLE CYTOKINE ; SWP:NA; PDB:2BDNH; EVQLQQSGAELVKAGASVKLSCPASGLNIKDTYMHWVKQRPEQGLEWIGRIDPANGNTKF ------------2222-----------3333----------------------------- DPKFQGKATITADTSSNTAYLQLSSLTSEDTAVYYCARGVFGFFDYWGQGTTLTVSSAKT 3333---------1111---------3333--------1111------------------ TAPSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLYT ----------3333-----------------------%%%%-------------iiii-- LSSSVTVTSSTWPSQSITCNVAHPASSTKVDKKIVPR -------3333-----------3333----------- >COPPER HOMEOSTASIS PROTEI; SWP:Q8DYB9; PDB:2BDQA; ILREFCAENLTDLTRLDKAIISRVELCDNLAVGGTTPSYGVIKEANQYLHEKGISVAVIR ---------1111---3333--------3333-----------------1111------- PRGGNFVYNDLELRIEEDILRAVELESDALVLGILTSNNHIDTEAIEQLLPATQGLPLVF ----------------------1111---------1111----------3333------- HAFDVIPKSDQKKSIDQLVALGFTRILLHGSSNGEPIIENIKHIKALVEYANNRIEIVGG -3333-3333-------------------------3333-----------%%%%------ GVTAENYQYICQETGVKQAHGTRIT --1111------------------- >UREIDOGLYCOLATE HYDROLASE; SWP:P59285; PDB:2BDRA; RTLIEPLTKEAFAQFGDVIETDGSDHFINNGSTRFHKLATVETAEPEDKAIISIFRADAQ -------33333333-----2222---%%%%-------------3333------------ DPLTVRLERHPLGSQAFIPLLGNPFLIVVAPVGDAPVSGLVRAFRSNGRQGVNYHRGVWH ---------1111-----------------------1111--------------2222-- HPVLTIEKRDDFLVVDRSGSGNNCDEHYFTEEQLILNPH -----------------------------3333------ >BH3686; SWP:Q9K6P2; PDB:2BDTA; KKLYIITGPAGVGKSTTCKRLAAQLDNSAYIEGDIINHVVGGYRPPWESDELLALTWKNI --------2222---------3333------3333----2222-1111------------ TDLTVNFLLAQNDVVLDYIAFPDEAEALAQTVQAKVDDVEIRFIILWTNREELLRRDALR ------------------------------3333--------------3333--1111-- KKGERCLELVEEFESKGIDERYFYNTSHLQPTNLNDIVKNLKTNPRFIFC --3333------------3333---11111111----------3333--- >phage-related conserved h; SWP:Q7WK92; PDB:2BDVA; CGRIAQKSAPEDYVEILWPNARLVAGPRYNIPPGTRPLTHRLVDQAEALARLPWGYKPHG --------3333------------------------------%%%%-----------111 SSFFINAKLETIERHGWPWKLIGTGRILVPADGWYEWKALDSGPKPAKQPYYIHGDAPLL 1----------------------------------------------------------- FAGLSAWRRGAELDEAHGFAIVTNDALGGMVDVHDRRPVALPPELAREWVDPATPVARAK -------2222--3333------3333----1111---------------3333------ EILRAGLPETAFSWYPVRQEVGSSKYQLPD -1111--3333------------------- >HYPOTHETICAL PROTEIN K11E; SWP:Q9NG91; PDB:2BDWA; STKFSDNYDVKEELGKGAFSVVRRCVHKTTGLEFAAKIINTKKLSARDFQKLEREARICR -3333----------------------------------3333----------------- KLQHPNIVRLHDSIQEESFHYLVFDLVTGGELFEDIVAREFYSEADASHCIQQILESIAY ---1111----------------------------1111--------------------- CHSNGIVHRNLKPENLLLASKAKGAAVKLADFGLAIEVNDSEAWHGFAGTPGYLSPEVLK -----------3333------2222------1111--------------1111------- KDPYSKPVDIWACGVILYILLVGYPPFWDEDQHRLYAQIKAGAYDYPSPEWDTVTPEAKS ----3333----------------------------------------1111-------- LIDSMLTVNPKKRITADQALKVPWICNRERVASAIHRQDTVDCLKKFNARRKLKGAILTT --------3333--3333---3333----------------------------------- MIATRNLSN --------- >MEXICAIN; SWP:P84346; PDB:2BDZA; YPESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCERRS -----------------------3333--------------------------------- HGCDGGYQTTSLQYVVDNGVHTEREYPYEKKQGRCRAKDKKGPKVYITGYKYVPANDEIS !!!!-----------------3333----------3333--------------------- LIQAIANQPVSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTAVGYGKTYLLLKNSWGPN ---3333----------3333------------------------------------111 WGEKGYIRIKRASGRSKGTCGVYTSSFFPIKG 1-iiii-----------2222----------- >PEPTIDE; SWP:P32361; PDB:2BE1A; NRSLNELSLSDILIAADVEGGLHAVDRRNGHIIWSIEPENFQPLIEIQEPSRLETYETLI ----------------1111----------------3333-------------------- IEPFGDGNIYYFNAHQGLQKLPLSIRQLVSTSPLHLKTNEDEKVYTGSMRTIMYTINMLN ---!!!!-----3333-------------------------------------------- GEIISAFGPGSKNGENMIVIGKTIFELGIHSYDGASYNVTYSTWQQNVLDVPLALQNTFS ------------------------------3333------------3333-3333----- KDGMCIAPFRDKSLLASDLDFRIARWVSPTFPGIIVGLFDVFNDLRTNENILVPHPFNPN ------------------------------------------------------------ KVYLDQTSNLSWFALSSQNFPSLVESAPISRYASSDRWRVSSIFEDETLFKNAIMGVHQI ------1111-----3333----1111--3333-1111-3333----------------- Y - >GTP PYROPHOSPHOKINASE; SWP:Q97QV1; PDB:2BE3A; TLEWEEFLDPYIQAVGELKIKLRGIRKQYRKQNKHSPIEFVTGRVKPIESIKEKARRGIT -----------------------------1111-------------3333---------3 YATLEHDLQDIAGLRVVQFVDDVKEVVDILHKRQDRIIQERDYITHRKASGYRSYHVVVE 333---------------1111-------3333--------------3333--------- YTVDTINGAKTILAEIQIRTLANFWATIEHSLNYKYQGDFPDEIKKRLEITARIAHQLDE ----1111---------------------------iiii--------------------- EGEIRDDIQEAQALFDP ----------3333--- >HYPOTHETICAL PROTEIN LOC4; SWP:Q5XJX1; PDB:2BE4A; SAFANLDAAGFLQIWQHFDADDNGYIEGKELDDFFRHLKKLQPKDKITDERVQQIKKSFS ---------------3333-------3333-----------1111--------------- AYDATFDGRLQIEELANILPQEENFLLIFRREAPLDNSVEFKIWRKYDADSSGYISAAEL 3333--------------------------------3333-------1111----3333- KNFLKDLFLQHKKKIPPNKLDEYTDAKIFDKNKDGRLDLNDLARILALQENFLLQFKDAS --------1111-----------------3333----33331111--------------- SQVERKRDFEKIFAHYDVSRTGALEGPEVDGFVKDELVRPSISGGDLDKFRECLLTHCDN ----------------1111----------------------3333-------------- KDGKIQKSELALCLGLKHKP -----------1111----- >Voltage-dependent L-type ; SWP:Q13933; PDB:2BE6D; EVTVGKFYATFLIQEYFRKFKKRKEQGLV 3333------------------------- >ASPARTATE CARBAMOYLTRANSF; SWP:P96174; PDB:2BE7A; ANPLFRKHIVSINDISRNELELIVKTAAKLKEQPQPELLKNKVIASCFFEASTRTRLSFE -1111-----3333------------------------2222------------------ TAIQRLGGSVIGFDNAGNTSLAKKGETLADSISVISSYADAFVMRHPQEGAARLASEFSN ---1111-------3333----------------3333---------2222--3333--- VPVINGGDGSNQHPTQTLLDLFSIYETQGRLDNLNIAFVGDLKYGRTVHSLAQALAKFDG -------!!!!----------------------------------3333----------- CKFHFIAPDALAMPEYICDELDEQNISYATYASIEEVVPEIDVLYMTRVQKERFDETEYQ -------3333--3333---------------33333333---------3333-3333-- HMKAGFILSASSLVHAKPNLKVLHPLPRVDEIATDVDKTPYAYYFQQAENGVYAREALLA --1111--333311111111------------3333--3333------------------ LVLNETIGE --------- >Aspartate carbamoyltransf; SWP:P96175; PDB:2BE7D; CNGYVIDHIPSGQGVKILKLFSLTDTKQRVTVGFNLKDLIKVENTEITKSQANQLALLAP ------------------1111-------------------------3333---1111-- NATINIIENFKVTDKHSLTLPNEVENVFPCPNSNCITHGEPVTSSFSIKKTKGNIGLKCK -------%%%%--------------------1111------------------------- YCEKTFSKDIVTE ------3333--- >CALCINEURIN B HOMOLOGOUS ; SWP:O43745; PDB:2BECA; IPDGDSIRRETGFSQASLLRLHHRFRALDRNKKGYLSRMDLQQIGALAVNPLGDRIIESF ------------------------------------3333----3333-1111----111 FPDGSQRVDFPGFVRVLAHFRPVEDEDTEKPEPLNSRRNKLHYAFQLYDLDRDGKISRHE 1----------------1111--3333-----1111------------1111-------- MLQVLRLMVGVQVTEEQLENIADRTVQEADEDGDGAVSFVEFTKSLEKMDVEQKMSIRIL ----3333-----3333----------------------------11113333----333 K 3 >DIAMINE ACETYLTRANSFERASE; SWP:Q96F10; PDB:2BEIA; SVRIREAKEGDCGDILRLIRELAEFEKLKISEEALRADGFGDNPFYHCLVAEILGPCVVG -------3333------------------------------------------------- YGIYYFIYSTWKGRTIYLEDIYVPEYRGQGIGSKIIKKVAEVALDKGCSQFRLAVLDWNQ -----------------------1111----------------1111--------3333- RADLYKALGAQDLTEAEGWHFFCFQGEATRKLAG -----1111--3333------------------- >CBP21; SWP:O83009; PDB:2BEMA; HGYVESPASRAYQCKLQLNTQCGSVQYEPQSVEGLKGFPQAGPADGHIASADKSTFFELD ----------------------3333-1111------------22221111-11113333 QQTPTRWNKLNLKTGPNSFTWKLTARHSTTSWRYFITKPNWDASQPLTRASFDLTPFCQF --1111-----------------------------------1111--3333--------- NDGGAIPAAQVTHQCNIPADRSGSHVILAVWDIADTANAFYQAIDVNLSK -iiii------------1111----------------------------- >UBIQUITIN-CONJUGATING ENZ; SWP:NA; PDB:2BEPA; AMANIAVQRIKREFKEVLKSEETSKNQIKVDLVDENFTELRGEIAGPPDTPYEGGRYQLE ----------------------1111-----------------------1111------- IKIPETYPFNPPKVRFITKIWHPNISSVTGAICLDILKDQWAAAMTLRTVLLSLQALLAA ---1111--------------1111--------333311113333-----------1111 AEPDDPQDAVVANQYKQNPEMFKQTARLWAHVYAGA -1111------------------------------- >S2 protein [Fragment]; SWP:Q64FG1; PDB:2BEZF; TSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQ ---------1111----------------------------- >EXTERIOR MEMBRANE GLYCOPR; SWP:P05884; PDB:2BF1A; HMELALNVTESFDAWENTVTEQAIEDVWQLFETSIKPCVKLSPLCIGAGHCNTSIIQESC ------------------------------------------------------------ FRYCAPPGYALLRCNDTNYSGFMPKCSKVVVSSCTRMMETQTSTWFGFNGTRAENRTYIY ------------------------------------3333-------------------- WHGRDNRTIISLNKYYNLTMKCRGAGWCWFGGNWKDAIKEMKQTIVKHPRYTGTNNTDKI ------------------------------------------------------------ NLTAPRGGDPEVTFMWTNCRGEFLYCKMNWFLNWVEDRDVTNQRPKERHRRNYVPCHIRQ ----------------------------3333---------------------------- IINTWHKVGKNVYLPPREGDLTCNSTVTSLIANIDWTDGNQTNITMSAEVAELYRLELGD ------------------------------------------------1111-3333--- YKLV ---- >TOLUENE-4-MONOOXYGENASE S; SWP:Q00459; PDB:2BF5A; NNVGPIIRAGDLVEPVIETAEIDNPGKEITVEDRRAYVRIAAEGELILTRKTLEEQLGRP --------3333-----------2222------2222----------------------- FNMQELEINLASFAGQIQADEDQIRFYFDKTM -----1111----------1111-----1111 >EXO-ALPHA-SIALIDASE; SWP:Q59310; PDB:2BF6A; VEGAVKTEPVDLFHPGFLNSSNYRIPALFKTKEGTLIASIDARRHGGADAPNNDIDTAVR 2222---------2222-------------1111-------------------------- RSEDGGKTWDEGQIIMDYPDKSSVIDTTLIQDDETGRIFLLVTHFPSKYGFWNAGLGSGF ---iiii-----------%%%%-----------------------22223333------- KNIDGKEYLCLYDSSGKEFTVRENVVYDKDSNKTEYTTNALGDLFKNGTKIDNINSSTAP --iiii------1111-----%%%%--1111-------1111---iiii---1111---- LKAKGTSYINLVYSDDDGKTWSEPQNINFQVKKDWMKFLGIAPGRGIQIKNGEHKGRIVV ---------------iiii-------3333--1111---------------1111----- PVYYTNEKGKQSSAVIYSDDSGKNWTIGESPNDNRKLENGKIINSKTLSDDAPQLTECQV -----1111----------iiii------1111---1111---3333---1111------ VEMPNGQLKLFMRNLSGYLNIATSFDGGATWDETVEKDTNVLEPYCQLSVINYSQKVDGK --1111-------------------iiii---------------------------iiii DAVIFSNPNARSRSNGTVRIGLINQVGTYENGEPKYEFDWKYNKLVKPGYYAYSCLTELS ----------------------------1111--------------------------11 NGNIGLLYEGTPSEEMSYIEMNLKYLESG 11--------------------------- >2-OXOISOVALERATE DEHYDROG; SWP:P12694; PDB:2BFDA; KPQFPGASAEFIDKLEFIQPNVISGIPIYRVMDRQGQIINPSEDPHLPKEKVLKLYKSMT ---1111-------------------------1111---3333----------------- LLNTMDRILYESQRQGRISFYMTNYGEEGTHVGSAAALDNTDLVFGQAREAGVLMYRDYP ------------1111-------2222-------11111111-------3333-1111-3 LELFMAQCYGNISDLGKGRQMPVHYGCKERHFVTISSPLATQIPQAVGAAYAAKRANANR 333-------1111-%%%%--------1111------2222------------------- VVICYFGEGAASEGDAHAGFNFAATLECPIIFFCRNNGYAISTPTSEQYRGDGIAARGPG ------3333-------------1111------------!!!!3333------3333333 YGIMSIRVDGNDVFAVYNATKEARRRAVAENQPFLIEAMTYRIGSTDHPISRLRHYLLSQ 3-------1111------------------------------------------------ GWWDEEQEKAWRKQSRRKVMEAFEQAERKPKPNPNLLFSDVYQEMPAQLRKQQESLARHL ------------------------------------------------------------ QTYGEHYPLDHFDK --3333-------- >2-oxoisovalerate dehydrog; SWP:P21953; PDB:2BFDB; AHFEYGQTQKMNLFQSVTSALDNSLAKDPTAVIFGEDVAFGGVFRCTVGLRDKYGKDRVF ---------------------------1111------11111111---3333--1111-- NTPLCEQGIVGFGIGIAVTGATAIAEIQFADYIFPAFDQIVNEAAKYRYRSGDLFNCGSL ---------------3333---------33333333------3333-3333-----1111 TIRSPWGCVGHGALYHSQSPEAFFAHCPGIKVVIPRSPFQAKGLLLSCIEDKNPCIFFEP -------------------3333------------------------------------3 KILYRAAAEEVPIEPYNIPLSQAEVIQEGSDVTLVAWGTQVHVIREVASMAKEKLGVSCE 333---------------------------------!!!!-------------------- VIDLRTIIPWDVDTICKSVIKTGRLLISHEAPLTGGFASEISSTVQEECFLNLEAPISRV --------------------------------2222------------3333-------- CGYDTPFPHIFEPFYIPDKWKCYDALRKMINY --------1111----------------1111 >GLGA GLYCOGEN SYNTHASE; SWP:Q9V2J8; PDB:2BFWA; GIDCSFWNESYLTGSRDERKKSLLSKFGMDEGVTFMFIGRFDRGQKGVDVLLKAIEILSS --3333--1111------------1111-----------------------------111 KKEFQEMRFIIIGKGDPELEGWARSLEEKHGNVKVITEMLSREFVRELYGSVDFVIIPSY 13333------------------------1111--------------1111--------- FEPFGLVALEAMCLGAIPIASAVGGLRDIITNETGILVKAGDPGELANAILKALELSRSD ----3333---1111-------!!!!----1111----2222----------1111---- LSKFRENCKKRAMSFS -----------3333- >LOC398457 PROTEIN; SWP:Q6DE08; PDB:2BFXA; RKFTIDDFDIGRPLGKGKFGNVYLAREKQNKFIMALKVLFKSQLEKEGVEHQLRREIEIQ ---1111--------------------1111---------------------------33 SHLRHPNILRMYNYFHDRKRIYLMLEFAPRGELYKELQKHGRFDEQRSATFMEELADALH 33--1111-------------------11113333------------------------- YCHERKVIHRDIKPENLLMGYKGELKIADFGWSVHAPSLRRRMCGTLDYLPPEMIEGKTH ------------3333---1111------1111-------------11113333------ DEKVDLWCAGVLCYEFLVGMPPFDSPSHTETHRRIVNVDLKFPPFLSDGSKDLISKLLRY --------------------1111------------------3333-------------- HPPQRLPLKGVMEHPWVKANSRRVLPPVYQ 3333-------------------------- >Inner centromere protein ; SWP:O13024; PDB:2BFXC; IPAWASGNLLTQAIRQQYYKPIDVDRMYGTIDSPKLEELFN -3333-3333------------3333-1111---3333--- >PENICILLIN-BINDING PROTEI; SWP:O70038; PDB:2BG1A; ITYDYLYFTTLAEAQERMYDYLAQRDNVSAKELKNEATQKFYRDLAAKEIENGGYKITTT ----------------------------3333---------------------------- IDQKIHSAMQSAVADYGYLLDDGTGRVEVGNVLMDNQTGAILGFVGGRNYQENQNNHAFD -----------------1111---------------------------3333-------- TKRSPASTTKPLLAYGIAIDQGLMGSETILSNYPTNFANGNPIMYANSKGTGMMTLGEAL ----!!!!----------1111--1111--------1111----!!!!-----------1 NYSWNIPAYWTYRMLRENGVDVKGYMEKMGYEIPEYGIESLPMGGGIEVTVAQHTNGYQT 111------------1111-3333--1111-------1111--2222------------- LANNGVYHQKHVISKIEAADGRVVYEYQDKPVQVYSKATATIMQGLLREVLSSRVTTTFK --iiii-----------1111--------------------------------------- SNLTSLNPTLANADWIGKTGTTNQDENMWLMLSTPRLTLGGWIGHDDNHSLSQQAGYSNN -------3333----------3333--------3333-------1111---1111----- SNYMAHLVNAIQQASPSIWGNERFALDPSVVKSEVLKSTGQKPGKVSVEGKEVEVTGSTV --------------1111--------1111------------------------------ TSYWANKSGAPATSYRFAIGGSDADYQNAWSSIVGS ------------------------------------ >PHOSPHOENOLPYRUVATE-PROTE; SWP:Q8R7R4; PDB:2BG5A; EGLKQLKDLPAETPDGKKVLAANIGTPKDVASALANGAEGVGLFRTEFLYDRNSLPSEEE -33331111---1111---------3333----1111--------3333----------- QFEAYKEVVEKGGRPVTIRTLDIGGDKELPYLDPKENPFLGYRAIRLCLDRPDIFKTQLR -----------------------3333-3333----3333--!!!!-------------- AILRASAYGNVQIYPISSVEEVRKANSILEEVKAELDREGVKYDKEIKVGIVEIPSAAVT -------------------------------------------1111------3333--- ADILAKEVDFFSIGTNDLTQYTLAVDRNEHVKEYYQPFHPAILRLVKVIDAAHKEGKFAA ---3333--------------------333311111111-------------1111---- CGEAGDPLAAVILLGLGLDEFSSATSIPEIKNIIRNVEYEKAKEIAEKALNSEAREIEKK -----3333----1111-----3333-------1111----------------------- DVIKDIG ------- >PRFA; SWP:Q4TVQ0; PDB:2BGCA; AQAEEFKKYLETNGIKPKQFHKKELIFNQWDPQEYCIFLYDGITKLTSISENGTIMNLQY --------------------2222---1111------------------1111------- YKGAFVIMSGFIDTETSVGYYNLEVISEQATAYVIKINELKELLSKNLTHFFYVFQTLQK -----------1111--------------------3333--------------------- QVSYSLAKFNDFSINGKLGSICSQLLILTYVYGKETPDGIKITLDNLTMQELGYSSGIAH -----------------------------------1111--------------------3 SSAVSRIISKLKQEKVIVYKNSCFYVQNLDYLKRYAPKLDEWFYLAPATWGKLN 333--------1111----%%%%----33333333------------3333--- >VINORINE SYNTHASE; SWP:Q70PR7; PDB:2BGHA; QMEKVSEELILPSSPTPQSLKCYKISHLDQLLLTCHIPFILFYPNPLDSNLDPAQTSQHL ----------------3333-----3333------------------------------- KQSLSKVLTHFYPLAGRINVNSSVDCNDSGVPFVEARVQAQLSQAIQNVVELEKLDQYLP ------33333333----2222------------------3333------33333333-- SAAYPGGKIEVNEDVPLAVKISFFECGGTAIGVNLSHKIADVLSLATFLNAWTATCRGET -----------1111--------3333--------1111---------------1111-- EIVLPNFDLAARHFPPVDNTPSPELVPDENVVMKRFVFDKEKIGALRAQASKNFSRVQLV -------3333-----2222---------------------------------------- VAYIWKHVIDVTRAKYGAKNKFVVVQAVNLRSRMNPPLPHYAMGNIATLLFAAVDAEWDK -----------------------------1111-----1111------------1111-- DFPDLIGPLRTSLEKTEDDHNHELLKGMTCLYELEPQELLSFTSWCRLGFYDLDFGWGKP 3333-----3333---------------------1111----------1111-------- LSACTTTFPKRNAALLMDTRSGDGVEAWLPMAEDEMAMLPVELLSLVDSDFSK ------------------1111-------------111133331111------ >FERREDOXIN-NADP(H) REDUCT; SWP:Q9L6V3; PDB:2BGIA; PDAQTVTSVRHWTDTLFSFRVTRPQTLRFRSGEFVMIGLLDDNGKPIMRAYSIASPAWDE -----------------------1111-------------1111-----------1111- ELEFYSIKVPDGPLTSRLQHIKVGEQIILRPKPVGTLVIDALLPGKRLWFLATGTGIAPF --------2222----3333-2222------------3333----------------111 ASLMREPEAYEKFDEVIMMHACRTVAELEYGRQLVEALQEDPLIGELVEGKLKYYPTTTR 133333333--------------3333------------------1111----------- EEFHHMGRITDNLASGKVFEDLGIAPMNPETDRAMVCGSLAFNVDVMKVLESYGLREGAN ------------1111-----------3333-------------------1111----11 SEPREFVVEKAFVGEGI 11--------------- >RHIZOME SECOISOLARICIRESI; SWP:Q94KL8; PDB:2BGKA; TNRLQDKVAIITGGAGGIGETTAKLFVRYGAKVVIADIADDHGQKVCNNIGSPDVISFVH ---2222-----1111-----------------------------------3333----- CDVTKDEDVRNLVDTTIAKHGKLDIMFGNVGVLSTTPYSILEAGNEDFKRVMDINVYGAF -1111---------------------------------3333------------------ LVAKHAARVMIPAKKGSIVFTASISSFTAGEGVSHVYTATKHAVLGLTTSLCTELGEYGI ---------3333---------1111---2222---------------------3333-- RVNCVSPYIVASPLLTDVFGVDSSRVEELAHQAANLKGTLLRAEDVADAVAYLAGDESKY --------------3333----3333---------------3333---------1111-- VSGLNLVIDGGYTRTNPAFPTALKHGLA --------iiii---------------- >ENDO-B1,4-MANNANASE 5C; SWP:Q840C0; PDB:2BGOA; SWTYTAASASITAPAQLVGNVGELQGAGSAVIWNVDVPVTGEYRINLTWSSPYSSKVNTL ----3333---------%%%%--------------------------------------- VMDGTALSYAFAEATVPVTYVQTKTLSAGNHSFGVRVGSSDWGYMNVHSLKLELLG -%%%%--------------------------------3333--------------- >ALDOSE REDUCTASE; SWP:P23901; PDB:2BGSA; QDHFVLKSGHAMPAVGLGTWRAGSDTAHSVRTAITEAGYRHVDTAAEYGVEKEVGKGLKA -----3333-------------1111--------1111------3333------------ AMEAGIDRKDLFVTSKIWCTNLAPERVRPALENTLKDLQLDYIDLYHIHWPFRLKDGAHM -1111-3333-------3333-3333----------------------------2222-- PPEAGEVLEFDMEGVWKEMENLVKDGLVKDIGVCNYTVTKLNRLLRSAKIPPAVCQMEMH --2222----------------1111---------------------------------2 PGWKNDKIFEACKKHGIHITAYSPLGSSEKNLAHDPVVEKVANKLNKTPGQVLIKWALQR 222---------1111------1111-----1111----------------------111 GTSVIPKSSKDERIKENIQVFGWEIPEEDFKVLCSIKDEKRVLTGEELFVNKTHGPYRSA 1--------------1111-------------1111--------3333----------33 RDVWDHEN 33-%%%%- >XPF ENDONUCLEASE; SWP:Q9YC15; PDB:2BGWA; MLEDPGGRPRVYVDVREERSPVPSILESLGVQVIPKQLPMGDYLVSDSIIVERKTSSDFA -------------3333---------1111------------------------------ KSLFDGRLFEQASRLAEHYETVFIIVEGPPVPRRYRGRERSLYAAMAALQLDYGIRLMNT ------------------------------------------------------------ MDPKGTALVIESLARLSTREGGQRIVIHKKPRLSDVREWQLYILQSFPGIGRRTAERILE -3333----------------2222---------------------2222---------- RFGSLERFFTASKAEISKVEGIGEKRAEEIKKILMTPYK ----3333----3333--2222----------------- >N-ACETYLMURAMOYL-L-ALANIN; SWP:P75820; PDB:2BGXA; GIVEKEGYQLDTRRQAQAAYPRIKVLVIHYTADDFDSSLATLTDKQVSSHYLVPAVPPRY -----------------------------------------------------------% NGKPRIWQLVPEQELAWHAGISAWRGATRLNDTSIGIELENRGWQKSAGVKYFAPFEPAQ %%%-------3333---------iiii-3333------------------------3333 IQALIPLAKDIIARYHIKPENVVAHADIAPQRKDDPGPLFPWQQLAQQGIGAWPDAQRVN -----------------3333--3333-1111-------------1111----------- FYLAGRAPHTPVDTASLLELLARYGYDVKPDMTPREQRRVIMAFQMHFRPTLYNGEADAE 3333--1111--3333--------------------------------3333-------- TQAIAEALLEKYGQ -------------- >YOAJ; SWP:O34918; PDB:2BH0A; AYDDLHEGYATYTGSGYSGGAFLLDPIPSDEITAINPADLNYGGVKAALAGSYLEVEGPK 3333---------------1111-------------3333-iiii-1111-------111 GKTTVYVTDLYPEGARGALDLSPNAFRKIGNKDGKINIKWRVVKAPITGNFTYRIKEGSS 1---------22222222---3333------------------------------2222- RWWAAIQVRNHKYPVKEYEKDGKWINEKDYNHFVSTNLGTGSLKVRTDIRGKVVKDTIPK -------------------%%%%------------------------1111--------- LPESGTSKAYTVPGHVQFPE -------------------- >General secretion pathway; SWP:P37093; PDB:2BH1X; IRRLPFSFANRFKLVLDWNEDFSQASIYYLAPLSMEALVETKRVVKHAFQLIELSQAEFE ---------1111-----1111-----------3333----------------------- SKLTQVYQ -------- >Cytochrome c-550 [Precurs; SWP:Q00499; PDB:2BH4X; EGDAAKGEKEFNKCKACHMVQAPDGTDIVKGGKTGPNLYGVVGRKIASVEGFKYGDGILE ---------------------1111------------2222-------2222-------- VAEKNPDMVWSEADLIEYVTDPKPWLVEKTGDSAAKTKKTFKLGKNQADVVAFLAQHSPD ----1111-----------------------1111----------------------111 AG 1- >1B11; SWP:P0AG67; PDB:2BH8A; KMTGIVKWFNADKGFGFITPDDGSKDVFVHFSAGSSGAAVRGNPQQGDRVEGKIKSITDF ----------1111------------------------------2222------------ GIFIGLDGGIDGLVHLSDISWAQAEA ----1111------------------ >GLUCOSE-6-PHOSPHATE 1-DEH; SWP:P11413; PDB:2BH9A; VQSDTHIFIIMGASGDLAKKKIYPTIWWLFRDGLLPENTFIVGYARSRLTVADIRKQSEP -----------33333333--------------------------------------111 FFKATPEEKLKLEDFFARNSYVAGQYDDAASYQRLNSHMNALHLGSQANRLFYLALPPTV 1---3333-------------------3333------------1111---------1111 YEAVTKNIHESCMSQIGWNRIIVEKPFGRDLQSSDRLSNHISSLFREDQIYRIDHYLGKE ---------------------------------------3333--1111----3333--3 MVQNLMVLRFANRIFGPIWNRDNIACVILTFKEPFGTEGRGGYFDEFGIIRDVMQNHLLQ 3333333----333311111111----------------33333333-3333-------- MLCLVAMEKPASTNSDDVRDEKVKVLKCISEVQANNVVLGQYVGNPDGEGEATKGYLDDP --------------------------------3333--------1111-1111-111111 TVPRGSTTATFAAVVLYVENERWDGVPFILRCGKALNERKAEVRLQFHDVAGDIFHQQCK 11-----------------3333-------------------------------%%%%-- RNELVIRVQPNEAVYTKMMTKKPGMFFNPEESELDLTYGNRYKNVKLPDAYERLILDVFC ------------------------------------------------3333-------- GSQMHFVRSDELREAWRIFTPLLHQIELEKPKPIPYIYGSRGPTEADELMKRVGFQYEGT ------------------3333--3333--------------3333----1111------ YKWVNPHKL --------- >FOOT-AND-MOUTH DISEASE VI; SWP:P03306; PDB:2BHGA; DLQKVGNTKPVELNLDGKTVAICCATGVFGTAYLVPRHLFAEKYDKILDGRATDSDYRVF 3333----------%%%%-----------------3333--------iiii-1111---- EFEIKVKGQDLSDAALVLHRGNKVRDITKHFRDTARKKGTPVVGVVNNADVGRLIFSGEA --------------------------3333------2222-------------------- LTYKDIVVTPGLFAYKAATRAGYAGGAVLAKDGADTFIVGTHSAGGNGVGYCSCVSRSLQ ----------------------2222-------------------iiii----------- KKAH ---- >TYPE IV SECRETION SYSTEM ; SWP:Q7CEG3; PDB:2BHMA; SYDTVMDKYWLSQYVIARETYDWYTLQKDYETVGMLSSPSEGQSYASQFQGDKALDKQYG 3333-----------------1111--------1111--------1111----3333-!! SNVRTSVTIVSIVPNGKGIGTVRFAKTTKRGDGETTHWIATIGYQYVNPSLMSESARLTN !!---------------------------------------------1111-3333---1 PLGFNVTSYRVDPEM 111------------ >CYSTEINE SYNTHASE B; SWP:P16703; PDB:2BHTA; MSTLEQTIGNTPLVKLQRMGPDNGSEVWLKLEGNNPAGSVDRAALSMIVEAEKRGRIKPG --3333------------------------33331111-------------------222 DVLIEATSGNTGIALAMIAALKGYRMKLLMPDNMSQERRAAMRAYGAELILVTKEQGMEG 2---------------------------------------------------3333---- ARDLALEMANRGEGKLLDQFNNPDNPKAHYTTTGPEIWQQTGGRITHFVSSMGTTGTITG -----------------3333--------------------------------------- VSEFMREQSKPVTIVGLQPEEGSSIPGIRRWPTEYLPGIFNASLVDEVLDIHQRDAENTM ---3333------------2222--------1111-11111111---------------- RELAVREGIFCGVSSGGAVAGALRVAKANPDAVVVAIICDRGDRYLSTGVFGE -----------------------------------------1111-------- >MALTOOLIGOSYLTREHALOSE TR; SWP:Q9RX51; PDB:2BHUA; SFQTQHDPRTRLGATPLPGGAGTRFRLWTSTARTVAVRVNGTEHVMTSLGGGIYELELPV ------3333-------%%%%-----------------iiii-------iiii------- GPGARYLFVLDGVPTPDPYARFLPDGVHGEAEVVDFGTFDWTDADWHGIKLADCVFYEVH 2222-----iiii---1111--1111--------1111----1111---3333------3 VGTFTPEGTYRAAAEKLPYLKELGVTAIQVMPLAAFDGQRGWGYDGAAFYAPYAPYGRPE 333-3333---------------------------------------1111-3333-333 DLMALVDAAHRLGLGVFLDVVYNHFGPSGNYLSSYAPSYFTDRFSSAWGMGLDYAEPHMR 3--------1111-----------------3333-1111------1111----------- RYVTGNARMWLRDYHFDGLRLDATPYMTDDSETHILTELAQEIHELGGTHLLLAEDHRNL ----------------------3333---------------------------------- PDLVTVNHLDGIWTDDFHHETRVTLTGEQEGYYAGYRGGAEALAYTIRRGWRYEGQFWAV -----------------------------!!!!--------------------------2 KGEEHERGHPSDALEAPNFVYCIQNHDQIGNRPLGERLHQSDGVTLHEYRGAAALLLTLP 222-------33333333------33331111----33332222-----------1111- MTPLLFQGQEWAASTPFQFFSDHAGELGQAVSEGRKKEFDVPDPQAEQTFLNSKLNWAER -----2222---------------------------------1111----1111-3333- EGGEHARTLRLYRDLLRLRREDPVLHNRQRENLTTGHDGDVLWVRTVTGAGERVLLWNLG ----------------------------3333-----!!!!------1111--------- QDTRAVAEVKLPFTVPRRLLLHTEGREDLTLGAGEAVLVG ----3333-------------1111------2222----- >COMB10; SWP:O24883; PDB:2BHVA; NKLLRTITADKMIPAFLITPISSQIAGKVIAQVESDIFAHMGKAVLIPKGSKVIGYYSNN 3333---2222------------------------------------2222--------- NKMGEYRLDIVWSRIITPHGINIMLTNAYNGLVGELIERNFQRYGVPLLLSTLTNGLLIG ----------------1111---------------------------------------- ITSAFGDYLLMQLMRQSGMGINQVVNQILRDKSKIAPIVVIREGSRVFISPNTDIFFPIP --------3333-------3333------3333--------2222--------------- RENEVIAEFLK %%%%------- >HYPOTHETICAL PROTEIN RV02; SWP:P96398; PDB:2BI0A; IRVGGPYFDDLSKGQVFDWAPGVTLSLGLAAAHQSIVGNRLRLALDSDLCAAVTGPGPLA -2222-1111-2222-------------------------3333---------------- HPGLVCDVAIGQSTLATQRVKANLFYRGLRFHRFPAVGDTLYTRTEVVGLRANSPKPGRA ------------33331111---------------2222----------------2222- PTGLAGLRTTIDRTDRLVLDFYRCALPASPDWKPGAVPGDDLSRIGADAPAPAADPTAHW ----------------------------1111---------1111---------1111-- DGAVFRKRVPGPHFDAGIAGAVLHSTADLVSGAPELARLTLNIAATHHDWRVSGRRLVYG --------------3333-------------------1111-------1111------33 GHTIGLALAQATRLLPNLATVLDWESCDHTAPVHEGDTLYSELHIESAQAHADGGVLGLR 33------------1111---------------2222----------------------- SLVYAVSDSASEPDRQVLDWRFSALQF --------1111--------------- >UDP-GALACTOPYRANOSE MUTAS; SWP:Q48485; PDB:2BI7A; KSKKILIVGAGFSGAVIGRQLAEKGHQVHIIDQRDHIGGNSYDARDSETNVMVHVYGPHI ---------------------1111------------!!!!------------1111--- FHTDNETVWNYVNKHAEMMPYVNRVKATVNGQVFSLPINLHTINQFFSKTCSPDEARALI -----------3333-------------%%%%---------------------------- AEKGDSTIADPQTFEEEALRFIGKELYEAFFKGYTIKQWGMQPSELPASILKRLPVRFNY -----------------------------------------3333-3333---------- DDNYFNHKFQGMPKCGYTQMIKSILNHENIKVDLQREFIVEERTHYDHVFYSGPLDAFYG ------------1111----------1111--------33331111----------1111 YQYGRLGYRTLDFKKFTYQGDYQGCAVMNYCSVDVPYTRITEHKYFSPWEQHDGSVCYKE 1111---------------------------3333-----------1111---------- YSRACEENDIPYYPIRQMGEMALLEKYLSLAENETNITFVGRLGTYRYLDMDVTIAEALK -----2222-------3333---------1111-------3333---------------- TAEVYLNSLTENQPMPVFTVSVR --------1111----------- >TEICHOIC ACID PHOSPHORYLC; SWP:Q8DQ62; PDB:2BIBA; QESSGNKIHFINVQEGGSDAIILESNGHFAMVDTGEDYDFPDGSDSRYPWREGIETSYKH ------------------------iiii----------------3333--2222--3333 VLTDRVFRRLKELSVQKLDFILVTHTHSDHIGNVDELLSTYPVDRVYLKKYSDSRITNSE ----------1111----------------1111-----------------3333--111 RLWDNLYGYDKVLQTATETGVSVIQNITQGDAHFQFGDMDIQLYNYENETDSSGELKKIW 1---------------1111-------3333----!!!!-----------1111------ DDNSNSLISVVKVNGKKIYLGGDLDNVHGAEDKYGPLIGKVDLMKFNHHHDTNKSNTKDF 3333--------iiii----!!!!-11113333--------------------------- IKNLSPSLIVQTSDSLPWKNGVDSEYVNWLKERGIERINAASKDYDATVFDIRKDGFVNI ------------------------------1111------------------1111---1 STSYKPIPSFQAGWHKSAYGNWWYQAPDSTGEYAVGWNEIEGEWYYFNQTGILLQNQWKK 111-------------1111-----1111----------iiii----1111--------- WNNHWFYLTDSGASAKNWKKIDGIWYYFNKENQMEIGWVQDKEQWYYLDVDGSMKTGWLQ %%%%----1111--------iiii----1111--------%%%%----1111-------- YMGQWYYFAPSGEMKMGWVKDKETWYYMDSTGVMKTGEIEVAGQHYYLEDSGAMKQGWHK --------1111----------------1111--------iiii----1111-------- KANDWYFYKTDGSRAVGWIKDKDKWYFLKENGQLLVNGKTPEGYTVDSSGAWLVDVSIEK !!!!----1111--------%%%%----1111-------1111---1111--3333---- S - >PHYTOTOXIC PROTEIN PCF; SWP:NA; PDB:2BICA; EDPLYCQAIGCPTLYSEANLAVSKECRDQGKLGDDFHRCCEEQCGSTTPASA ---------------3333---------------3333-------------- >BID; SWP:P55957; PDB:2BIDA; GSMDCEVNNGSSLRDECITNLLVFGFLQSCSDNSFRRELDALGHELPVLAPQWEGYDELQ -----------------------------------3333--------------------- TDGNRSSHSRLGRIEADSESQEDIIRNIARHLAQVGDSMDRSIPPGLVNGLALQLRNTSR ---------%%%%------3333--------------3333---------------!!!! SEEDRNRDLATALEQLLQAYPRDMEKEKTMLVLALLLAKKVASHTPSLLRDVFHTTVNFI ----------------1111-------3333---------3333---------------- NQNLRTYVRSLARNGMD ----3333---1111-- >NITRATE REDUCTASE [NADPH]; SWP:P49050; PDB:2BIIA; PFNSEPPLTKLYDSGFLTPVSLHFVRNHGPVPYVPDENILDWEVSIEGMVETPYKIKLSD -----------3333---3333------------3333---------------------- IMEQFDIYSTPVTMVCAGNRRKEQNMVKKGAGFNWGAAGTSTSLWTGCMLGDVIGKARPS ---------------1111----------------------------------------1 KRARFVWMEGADNPANGAYGTCIRLSWCMDPERCIMIAYQQNGEWLHPDHGKPLRVVIPG 111----------1111------3333--3333-------iiii--3333-------222 VIGGRSVKWLKKLVVSDRPSENWYHYFDNRVLPTMVTPEMAKSDDRWWKDERYAIYDLNL 23333----------------3333-------1111---------11113333------- QTIICKPENQQVIKISEDEYEIAGFGYNGGGVRIGRIEVSLDKGKSWKLADIDYPEDRYR -------2222-----------------iiii---------iiii--------3333--1 EAGYFRLFGGLVNVCDRMSCLCWCFWKLKVPLSELARSKDILIRGMDERMMVQPRTMYWN 111---iiii--3333------------------1111---------------------1 VTSMLNNWWYRVAIIREGESLRFEHPVVANKPGGWMDRVKAEGGDILDNNWGEVD 111-------------!!!!-------2222--------1111-1111-iiii-- >Proto-oncogene serine/thr; SWP:P11309; PDB:2BIKB; PLESQYQVGPLLGSGGFGSVYSGIRVSDNLPVAIKHVEKDRISDWGELPNGTRVPMEVVL -1111---------1111-------------------3333------1111---3333-- LKKVSSGFSGVIRLLDWFERPDSFVLILERPEPVQDLFDFITERGALQEELARSFFWQVL -------------------1111------------------------------------- EAVRHCHNCGVLHRDIKDENILIDLNRGELKLIDFGSGALLKDTVYTDFDGTRVYSPPEW ----------------3333--------------1111-------------1111-3333 IRYHRYHGRSAAVWSLGILLYDMVCGDIPFEHDEEIIGGQVFFRQRVSECQHLIRWCLAL -------------------------------33333333----------------1111- RPSDRPTFEEIQNHPWMQDVLLPQETAEIHLH 3333---------3333--------------- >Peptidyl-prolyl cis-trans; SWP:P30405; PDB:2BITX; GNPLVYLDVDANGKPLGRVVLELKADVVPKTAENFRALCTGEKGFGYKGSTFHRVIPSFM ----------iiii--------------------------1111--2222-----2222- CQAGDFTNHNGTGGKSIYGSRFPDENFTLKHVGPGVLSMANAGPNTNGSQFFICTIKTDW ---------------1111-------------2222---------------------333 LDGKHVVFGHVIEGMDVVKKIESFGSKSGRTSKKIVITDCGQLS 3--------------------11111111--------------- >SEX COMB ON MIDLEG-LIKE P; SWP:Q9UQR0; PDB:2BIVA; DFHWEEYLKETGSISAPSECFRQSQIPPVNDFKVGMKLEARDPRNATSVCIATVIGITGA --3333----------3333------------2222-----1111------------!!! RLRLRLDGSDNRNDFWRLVDSPDIQPVGTCEKEGDLLQPPLGYQMNTSSWPMFLLKTLNG !----2222--------1111----2222-1111-----------3333--------222 SEMASATLFKKEPPKPPLNNFKVGMKLEAIDKKNPYLICPATIGDVKGDEVHITFDGWSG 2---3333-------------2222-----1111------------!!!!----222233 AFDYWCKYDSRDIFPAGWCRLTGDVLQPPGTS 33----1111----2222-------------- >APOCAROTENOID-CLEAVING OX; SWP:P74334; PDB:2BIWA; QRSYSPQDWLRGYQSQPQEWDYWVEDVEGSIPPDLQGTLYRNGPGLLEIGDRPLKHPFDG ----------1111-----------------1111-------------!!!!---3333- DGMVTAFKFPGDGRVHFQSKFVRTQGYVEEQKAGKMIYRGVFGSQPAGGWLKTIFDLRLK ---------------------------------------1111-----33332222---- NIANTNITYWGDRLLALWEGGQPHRLEPSNLATIGLDDLGGILAEGQPLSAHPRIDPAST ---------%%%%----1111----------------------2222------------- FDGGQPCYVTFSIKSSLSSTLTLLELDPQGKLLRQKTETFPGFAFIHDFAITPHYAIFLQ -iiii---------------------1111---------------------1111----- NNVTLNGLPYLFGLRGAGECVQFHPDKPAQIILVPRDGGEIKRIPVQAGFVFHHANAFEE ---------1111--3333----1111--------------------------------i NGKIILDSICYNSLPQVDTDGDFRSTNFDNLDPGQLWRFTIDPAAATVEKQLMVSRCCEF iii------------------3333-3333------------1111-------------- PVVHPQQVGRPYRYVYMGAAHHSTGNAPLQAILKVDLESGTETLRSFAPHGFAGEPIFVP ---1111----------------------------------------------------- RPGGVAEDDGWLLCLIYKADLHRSELVILDAQDITAPAIATLKLKHHIPYPLHGSWAQT 2222-1111--------------------3333-------------------------- >ACETYLCHOLINE-BINDING PRO; SWP:NA; PDB:2BJ0A; QIRWTLLNQITGESDVIPLSNNTPLNVSLNFKLMNIVEADTEKDQVEVVLWTQASWKVPY --3333----1111--------------------------1111-------------333 YSSLLSSSSLDQVSLPVSKMWTPDLSFYNAIAAPELLSADRVVVSKDGSVIYVPSQRVRF 3--------------3333-------1111--------------1111------------ TCDLINVDTEPGATCRIKVGSWTHDNKQFALITGEEGVVNIAEYFDSPKFDLLSATQSLN ---1111-3333---------------------1111--1111----------------- RKKYSCCENMYDDIEITFAFRKK ---3333---------------- >NICKEL RESPONSIVE REGULAT; SWP:O58316; PDB:2BJ7A; MELIRFSISIPSKLLEKFDQIIEEIGYENRSEAIRDLIRDFIIRHEWEVGNEEVAGTITI ----------------------------3333----------1111-------------- VYNHDEGDVVKALLDLQHEYLDEIISSLHVHMDEHNCLEVIVVKGEAKKIKMIADKLLSL --1111-----------------------------------------------------2 KGVKHGKLVMTSTGKEL 222-------------- >3-PHOSPHOSHIKIMATE 1-CARB; SWP:P22487; PDB:2BJBA; KTWPAPTAPTPVRATVTVPGSKSQTNRALVLAALAAAQGRGASTISGALRSRDTELMLDA -----------------------------------1111--------------------- LQTLGLRVDGVGSELTVSGRIEPGPGARVDCGLAGTVLRFVPPLAALGSVPVTFDGDQAR ----------!!!!---------2222---!!!!3333---3333------------111 GRPIAPLLDALRELGVAVDGTGLPFRVRGNGSLAGGTVAIDASASSQFVSGLLLSAASFT 1----------1111-------------------------3333--------------11 DGLTVQHTGSSLPSAPHIAMTAAMLRQAGVDIDDSTPNRWQVRPGPVAARRWDIEPDLTN 11-----------------------1111------2222--------------------- AVAFLSAAVVSGGTVRITGWPRVSVQPADHILAILRQLNAVVIHADSSLEVRGPTGYDGF --------1111--------------3333-----1111--------------------- DVDLRAVGELTPSVAALAALASPGSVSRLSGIAHLRGHETDRLAALSTEINRLGGTCRET ---11111111------11112222--------3333-------------1111-----1 PDGLVITATPLRPGIWRAYADHRMAMAGAIIGLRVAGVEVDDIAATTKTLPEFPRLWAEM 111--------------%%%%---------3333-------3333----1111------- VG -- >LACTOSE OPERON REPRESSOR; SWP:P03023; PDB:2BJCA; MKPVTLYDVAEYAGVSVATVSRVVNQASHVSAKTREKVEAAMAELNYIPNRCAQQLAGKQ ---------------3333-------------------------------------1111 SL -- >ACYLPHOSPHATASE; SWP:Q97ZL0; PDB:2BJDA; MLKRMYARVYGLVQGVGFRKFVQIHAIRLGIKGYAKNLPDGSVEVVAEGYEEALSKLLER --------------------------1111-------1111------------------- IKQGPPAAEVEKVDYSFSEYKGEFEDFETY ----3333---------------------- >CHOLOYLGLYCINE HYDROLASE; SWP:P54965; PDB:2BJFA; CTGLALETKDGLHLFGRNMDIEYSFNQSIIFIPRNFKCVNKSNKKELTTKYAVLGMGTIF -------1111------------------------------------------------i DDYPTFADGMNEKGLGCAGLNFPVYVSYSKEDIEGKTNIPVYNFLLWVLANFSSVEEVKE iii-------1111------------------2222---3333----------------1 ALKNANIVDIPISENIPNTTLHWMISDITGKSIVVEQTKEKLNVFDNNIGVLTNSPTFDW 111-----------------------1111-------3333---------------3333 HVANLNQYVGLRYNQVPEFKLGDQSLTALGQGTGLVGLPGDFTPASRFIRVAFLRDAMIK --3333-1111---------!!!!-------3333------------------------- NDKDSIDLIEFFHILNNVAMVRGSTRTVEEKSDLTQYTSCMCLEKGIYYYNTYENNQINA -1111-3333-----1111-2222--1111------------1111-----3333----- IDMNKENLDGNEIKTYKYNKTLSINHVN -1111-1111------------------ >INOSITOL-1(OR 4)-MONOPHOS; SWP:P20456; PDB:2BJIA; DPWQECMDYAVTLAGQAGEVVREALKNEMNIMVKSSPADLVTATDQKVEKMLITSIKEKY 3333------------------3333---------1111--------------------1 PSHSFIGEESVAAGEKSILTDNPTWIIDPIDGTTNFVHGFPFVAVSIGFVVNKKMEFGIV 111-------1111------------------------------------%%%%------ YSCLEDKMYTGRKGKGAFCNGQKLQVSHQEDITKSLLVTELGSSRTPETVRIILSNIERL -----------2222---iiii--------1111-------------------------1 LCLPIHGIRGVGTAALNMCLVAAGAADAYYEMGIHCWDVAGAGIIVTEAGGVLLDVTGGP 111-----------------1111----------3333-3333---1111----1111-- FDLMSRRVIASSNKTLAERIAKEIQIIPLQRDDED -1111------------------------------ >1-PYRROLINE-5-CARBOXYLATE; SWP:Q5SI02; PDB:2BJKA; MTVEPFRNEPIETFQTEEARRAMREALRRVREEFGRHYPLYIGGEWVDTKERMVSLNPSA ------------------------------1111-------iiii-----------1111 PSEVVGTTAKAGKAEAEAALEAAWKAFKTWKDWPQEDRSRLLLKAAALMRRRKRELEATL ----------------------------3333---------------------------- VYEVGKNWVEASADVAEAIDFIEYYARAALRYRYPAVEVVPYPGEDNESFYVPLGAGVVI -----------------------------------------2222--------------- APWNFPVAIFTGMIVGPVAVGNTVIAKPAEDAVVVGAKVFEIFHEAGFPPGVVNFLPGVG ----------------3333--------1111----------------2222------11 EEVGAYLVEHPRIRFINFTGSLEVGLKIYEAAGRLAPGQTWFKRAYVETGGKDAIIVDET 11-------1111-----------------1111-2222------------------111 ADFDLAAEGVVVSAYGFQGQKCSAASRLILTQGAYEPVLERVLKRAERLSVGPAEENPDL 1-------------------1111------3333-----------1111---3333---- GPVVSAEQERKVLSYIEIGKNEGQLVLGGKRLEGEGYFIAPTVFTEVPPKARIAQEEIFG -----------------------------------------------11111111----- PVLSVIRVKDFAEALEVANDTPYGLTGGVYSRKREHLEWARREFHVGNLYFNRKITGALV --------------------------------3333------------------------ GVQPFGGFKLSGTNAKTGALDYLRLFLEMKAVAERF -------!!!!-------3333-1111--------- >TRAFFICKING PROTEIN PARTI; SWP:NA; PDB:2BJNA; GSMADEALFLLLHNEMVSGVYKSAEQGEVENGRCITKLENMGFRVGQGLIERFELDIMKF ------------------------22221111-----------------1111------- ICKDFWTTVFKKQIDNLRTNHQGIYVLQDNKFRLLTQMEHASKYLAFTCGLIRGGLSNLG --------------------2222-------1111---11111111----------1111 IKSIVTAEVSSMPACKFQVMIQK ----------------------- >ORGANIC HYDROPEROXIDE RES; SWP:P80242; PDB:2BJOA; ALFTAKVTARGGRAGHITSDDGVLDFDIVMPNAAAAGQTGTNPEQLFAAGYAACFGGALE ------------------1111-------11111111----------------------- HVAKEQNIEIDSEIEGQVSLMKDESDGGFKIGVTLVVNTKDLDREKAQELVNAAHEFCPY ---1111-------------------------------!!!!------------------ SKATRGNVDVKLELK ---2222-------- >MFP2A; SWP:Q7YXK2; PDB:2BJQA; EFEDTWAYNTIGSPFPDNPVRVKGQQNMYVALWYKFGKPIHGRAWNDNGNVECSFPYNKV ---------2222--------2222---------iiii--------iiii---------- ELTGARDLGGQIQILTATEQDPTEQFKKTGFWYEWRPYKDRVNDQLLQLVRCGQSTPVIM ---3333-----------------------------3333-----------!!!!----- KTKDGKDLLGYIDMSTEVAAVGVSGKSEQVAGGPIQDMLVLFRNVKAPPKGIKIYDDTWL -1111-----------------iiii-----3333------------------------- DLKYRDPFPAARNPIAAGGRKVKSDDGTEMFQYVALWYEHGQPVFGRAYPDSADKTLANF --2222--3333---2222----1111-----------iiii--------1111------ GWGGQENAGAEIGSFQMLVVPDPDILGFEYKWIPYKEAKAGGPFKPLHVGECTPCLLKDA -iiii---3333---------1111-----------------------!!!!------11 NGTERLGNLHMGMEKATAGLAGKDSAVSGPAVGDFLVLCR 11--------1111-----iiii-----3333-------- >MFP2B; SWP:Q7YXJ9; PDB:2BJRA; AKEDTWAFGPIGSPFPDNPVKALGQQNYVALWYKNGRPHGRAWNNGGVIECSFPYNKSEL ---------2222--------2222--------iiii-------iiii------%%%%-- TGVKDLGGQIQVLQYKGNHLSLGYWYNWIKYSDRFDKDKGAELRCGDSFPILWSERPGGA -3333------------3333--------33333333-------!!!!----1111---- LLGYADNKTEIARFSHDGKVDEVSGSALANLIIARELKGGPPYCECEECKSEPPKPIVRV ---------------iiii-----3333------------1111---------------- TLNEWADFRCGDPWPTVGTPVRALGRSLDTLPGENPDQYVALWYQSGEPVGRIWNDGGKI --------2222---------2222-----2222----------iiii-------iiii- AACFGWGGHEYRQKIGSIQILYELPEAIRGFDYDWKPFPEAAQFGAKEWIPVHVDHHKGN -----iiii---------------3333--------3333--3333---------1111- ISPAVLIVDGKEILGKADIRNERATIGYGGTEKVLVGPAVHSCVLCRKAKPGCTID -------iiii-------1111-----iiii----!!!!1111------2222--- >PLASMEPSIN II; SWP:P46925; PDB:2BJUA; SSNDNIELVDFQNIMFYGDAEVGDNQQPFTFILDTGSANLWVPSVKCTTAGCLTKHLYDS ----------%%%%--------1111-----------------1111-3333------33 SKSRTYEKDGTKVEMNYVSGTVSGFFSKDLVTVGNLSLPYKFIEVIDTNGFEPTYTASTF 331111--------------------------!!!!-----------3333--------- DGILGLGWKDLSIGSVDPIVVELKNQNKIENALFTFYLPVHDKHTGFLTIGGIEERFYEG -------3333------------1111------------2222----------3333--- PLTYEKLNHDLYWQITLDAHVGNIMLEKANCIVDSGTSAITVPTDFLNKMLQNLDVIKVP --------------------!!!!---------1111-----------1111------22 FLPFYVTLCNNSKLPTFEFTSENGKYTLEPEYYLQHIEDVGPGLCMLNIIGLDFPVPTFI 22-----1111---------1111----3333----33332222---------------- LGDPFMRKYFTVFDYDNHSVGIALAKKNL --------------1111----------- >PSP OPERON TRANSCRIPTIONA; SWP:P37344; PDB:2BJVA; GEANSFLEVLEQVSHLAPLDKPVLIIGERGTGKELIASRLHYLSSRWQGPFISLNCAALN ---------------3333--------2222------------1111-------3333-- ENLLDSELFGHERHPGRFERADGGTLFLDELATAPMMVQEKLLRVIEYGELERVGGPLQV ------------------1111-------3333--------------------------- NVRLVCATNADLPAMVNEGTFRADLLDALAFDVVQLPPLRERESDIMLMAEYFAIQMCRE -------------------------------------33333333--------------- IKLPLFPGFTERARETLLNYRWPGNIRELKNVVERSVYRHGTSDYPLDDIIIDPFKR ---------------------1111---------------------------3333- >MAJOR ALLERGEN API G 1; SWP:P49372; PDB:2BK0A; GVQTHVLELTSSVSAEKIFQGFVIDVDTVLPKAAPGAYKSVEIKGDGGPGTLKIITLPDG -------------3333----------------1111----------2222--------- GPITTMTLRIDGVNKEALTFDYSVIDGDILLGFIESIENHVVLVPTADGGSICKTTAIFH --------------1111---------------------------1111----------- TKGDAVVPEENIKYANEQNTALFKALEAYLIAN -!!!!---------------------------- >NON-HEME IRON-CONTAINING ; SWP:P80725; PDB:2BK6A; VDTKEFLNHQVANLNVFTVKIHQIGWYMRGHNFFTLHEKMDDLYSEFGEQMDEVAERLLA -----------------------------1111-------------------------11 IGGSPFSTLKEFLENASVEEAPYTKPKTMDQLMEDLVGTLELLRDEYKQGIELTDKEGDD 11---------------------------------------------------------- VTNDMLIAFKASIDKHIWMFKAFLGKAPLE ---------------------1111-1111 >TITIN HEART ISOFORM N2-B; SWP:Q8WZ42; PDB:2BK8A; GAMVSGQIMHAVGEEGGHVKYVCKIENYDQSTQVTWYFGVRQLENSEKYEITYEDGVAIL -------------2222-----------1111-----!!!!------------iiii--- YVKDITKLDDGTYRCKVVNDYGEDSSYAELFVKGVRE -----3333---------1111---------2222-- >CG9734-PA; SWP:Q9VF15; PDB:2BK9A; MNSDEVQLIKKTWEIPVATPTDSGAAILTQFFNRFPSNLEKFPFRDVPLEELSGNARFRA ----------------------------------33333333-111133331111----- HAGRIIRVFDESIQVLGQDGDLEKLDEIWTKIAVSHIPRTVSKESYNQLKGVILDVLTAA ------------1111-2222--------------1111-------------------11 SSLDESQAATWAKLVDHVYAIIFKAIDDDGNAK 11-------------------1111-1111--- >TAT-INTERACTING PROTEIN T; SWP:Q9BUP3; PDB:2BKAA; EALSKLREDFRMQNKSVFILGASGETGRVLLKEILEQGLFSKVTLIGRRKLTFDEEAYKN ----------1111------1111------------------------------3333-- VNQEVVDFEKLDDYASAFQGHDVGFCCLGTTRGKAGAEGFVRVDRDYVLKSAELAKAGGC ------333311113333------------3333---------------------1111- KHFNLLSSKGADKSSNFLYLQVKGEVEAKVEELKFDRYSVFRPGVLLCDRQESRPGEWLV -------22221111--------------------------------%%%%--------- RKFFGSLPDSWASGHSVPVVTVVRAMLNNVVRPRDKQMELLENKAIHDLGKA -1111--11113333-------------1111-------------------- >Fragile X mental retardat; SWP:Q06787; PDB:2BKDN; MEELVVEVRGSNGAFYKAFVKDVEDSITVAFENNWQPDRQIPFDVRFPPPVGYNKDINES ------------------------------------------------------------ DEVEVYSRANEKEPCCWWLAKVRMIKGEFYVIEYAACDATYNEIVTIERLRSVNPNKPAT -------------------------!!!!----------------3333----------- KDTFKIKLDVP 3333------- >ZINC-FINGER PROTEIN NBR1 ; SWP:Q14596; PDB:2BKFA; AMEPQVTLNVTFKNEIQSFLVSDPENTTWADIEAMVKVSFDLNTIQIKYLDEENEEVSIN -----------!!!!-------3333------------------------1111------ SQGEYEEALKMAVKQGNQLQMQVHEG --------------%%%%-------- >SYNTHETIC CONSTRUCT ANKYR; SWP:NA; PDB:2BKGA; SDLGKKLLEAARAGQDDEVRILMANGADVNAEDTYGDTPLHLAARVGHLEIVEVLLKNGA --------------------------------1111-------------------1111- DVNALDFSGSTPLHLAAKRGHLEIVEVLLKYGADVNADDTIGSTPLHLAADTGHLEIVEV 1111-1111-3333--------------1111-1111-1111-------1111------- LLKYGADVNAQDKFGKTAFDISIDNGNEDLAEILQ ------1111-1111-------1111-----1111 >UNCONVENTIONAL MYOSIN; SWP:Q29122; PDB:2BKHA; GKPVWAPHPTDGFQVGNIVDIGPDSLTIEPLNQKGKTFLALINQVFPAEEDSKKDVEDNC ----------------------------------------1111------1111----11 SLMYLNEATLLHNIKVRYSKDRIYTYVANILIAVNPYFDIPKIYSSETIKSYQGKSLGTM 11---3333--------1111-----!!!!-----------1111----1111------- PPHVFAIADKAFRDMKVLKLSQSIIVSGESGAGKTENTKFVLRYLTESYGTGQDIDDRIV --3333----------------------2222---------------------------- EANPLLEAFGNAKTVRNNNSSRFGKFVEIHFNEKSSVVGGFVSHYLLEKSRICVQGKEER -3333--------1111--------------1111------------3333----2222- NYHIFYRLCAGASEDIRERLHLSSPDNFRYLNRGCTRYFANKETDKQILQNRKSPEYLKA -3333------------------33333333---------333311111111-------- GSLKDPLLDDHGDFIRMCTAMKKIGLDDEEKLDLFRVVAGVLHLGNIDFEEAGCNLKNKS ---------------------1111------------------1111---------3333 TQALEYCAELLGLDQDDLRVSLTTRVMLTTAGGAKGTVIKVPLKVEQANNARDALAKTVY -------------------------------------------3333------------- SHLFDHVVNRVNQCFPFETSSYFIGVLDIAGFEYFEHNSFEQFCINYCNEKLQQFFNERI ------------------------------------------------------------ LKEEQELYQKEGLGVNEVHYVDNQDCIDLIEARLVGILDILDEENRLPQPSDQHFTSAVH -----------------------------------------3333--------------- QKHKDHFRLSIPRKSKLAIHRNIRDDEGFIIRHFAGAVCYETTQFVEKNNDALHMSLESL -----1111-3333---1111--1111-----1111-----22223333----------- ICESRDKFIRELFEFISVGNKFKTQLNLLLDKLRSTGASFIRCIKPNLKMTSHHFEGAQI ---------3333-------------------1111------------------------ LSQLQCSGMVSVLDLMQGGFPSRASFHELYNMYKKYMPDKLARLDPRLFCKALFKALGLN --------------3333--------------3333---3333----------------3 EIDYKFGLTKVFFRPGKFAEFDQIMKSDPDHLAELVKRVNHWLICSRWKKVQWCSLSVIK 333---1111---2222------------------------------------------- LKNKIKY ------- >AMINOGLYCOSIDE 3'-PHOSPHO; SWP:NA; PDB:2BKKB; SDLGKKLLEAARAGQDDEVRILMANGADVNANDWFGITPLHLVVNNGHLEIIEVLLKYAA ----------------------1111-1111--%%%%------1111-3333---1111- DVNASDKSGWTPLHLAAYRGHLEIVEVLLKYGADVNAMDYQGYTPLHLAAEDGHLEIVEV 1111-1111-------------------1111---------------------1111--- LLKYGADVNAQDKFGKTAFDISIDNGNEDLAEILQK ------3333-1111-3333--1111--3333---- >PROLYL ENDOPEPTIDASE; SWP:Q9X5N2; PDB:2BKLA; SYPATRAEQVVDTLHGVQVADPYRWLEDEKAPEVQTWMTAQNAHAREALAKFPGREALAA -------------iiii---1111---1111-----------------1111-------- RFKELFYTDSVSTPSRRNGRFFYVRTHKDKEKAILYWRQGESGQEKVLLDPNGWSKDGTV ----------------iiii------1111-------------------3333------- SLGTWAVSWDGKKVAFAQKPNAADEAVLHVIDVDSGEWSKVDVIEGGKYATPKWTPDSKG -------1111--------%%%%---------1111----------1111----1111-- FYYEWLPTDPSIKVDERPGYTTIRYHTLGTEPSKDTVVHERTGDPTTFLQSDLSRDGKYL --------111133331111------22223333-------------------1111--- FVYILRGWSENDVYWKRPGEKDFRLLVKGVGAKYEVHAWKDRFYVLTDEGAPRQRVFEVD ----------------2222------------------%%%%-----2222--------3 PAKPARASWKEIVPEDSSASLLSVSIVGGHLSLEYLKDATSEVRVATLKGKPVRTVQLPG 333-3333------------------%%%%------%%%%------1111---------- VGAASNLMGLEDLDDAYYVFTSFTTPRQIYKTSVSTGKSELWAKVDVPMNPEQYQVEQVF ---------------------1111------------------------3333------- YASKDGTKVPMFVVHRKDLKRDGNAPTLLYGYGGFNVNMEANFRSSILPWLDAGGVYAVA --1111---------1111--------------%%%%------3333---1111------ NLRGGGEYGKAWHDAGRLDKKQNVFDDFHAAAEYLVQQKYTQPKRLAIYGGSNGGLLVGA -----1111---1111!!!!---------------1111--1111-----!!!!------ AMTQRPELYGAVVCAVPLLDMVRYHLFGSGRTWIPEYGTAEKPEDFKTLHAYSPYHHVRP ----3333-----------33331111-33333333--3333-------11111111--- DVRYPALLMMAADHDDRVDPMHARKFVAAVQNSPGNPATALLRIEANAGHGGADQVAKAI --------------------------------2222------------1111-------- ESSVDLYSFLFQVLDVQ -----------1111-- >TRUNCATED HEMOGLOBIN FROM; SWP:Q5L1S0; PDB:2BKMA; EQWQTLYEAIGGEETVAKLVEAFYRRVAAHPDLRPIFPDDLTETAHKQKQFLTQYLGGPP -----------------------------11111111----------------1111--- LYTAEHGHPMLRARHLRFEITPKRAEAWLACMRAAMDEIGLSGPAREQFYHRLVLTAHHM ---------3333--------------------------------------------333 VNTPDHLD 3------- >SENTRIN-SPECIFIC PROTEASE; SWP:Q96LD8; PDB:2BKRA; MDPVVLSYMDSLLRQSDVSLLDPPSWLNDHIIGFAFEYFANSQFHDSSDHVSFISPEVTQ -------!!!!--------------------------------3333------------- FIKCTSNPAEIAMFLEPLDLPNKRVVFLAINDNSNQAAGGSHWSLLVYLQDKNSFFHYDS --------------33333333-------------------------------------- HSRSNSVHAKQVAEKLEAFLGRKGDKLAFVEEKAPAQQNSYDCGMYVICNTEALCQNFFR 2222-----------------2222----------------------------------- QQTESLLQLLTPAYITKKRGEWKDLIATLAKK ----3333------------------------ >Importin subunit beta-1; SWP:Q06142; PDB:2BKUB; MSTAEFAQLLENSILSPDQNIRLTSETQLKKLSNDNFLQFAGLSSQVLIDENTKLEGRIL -------------------------------------------------1111------- AALTLKNELVSKDSVKTQQFAQRWITQVSPEAKNQIKTNALTALVSIEPRIANAAAQLIA -----1111---3333--------------------------1111-3333--------- AIADIELPHGAWPELMKIMVDNTGAEQPENVKRASLLALGYMCESADALVSSSNNILIAI ------1111-3333--------3333--------------------3333--------- VQGAQSTETSKAVRLAALNALADSLIFIKNNMEREGERNYLMQVVCEATQAEDIEVQAAA -----------------------33333333---------------3333---------- FGCLCKIMSKYYTFMKPYMEQALYALTIATMKSPNDKVASMTVEFWSTICEEEIDIAYEL ----------3333--------------3333---------------------------- AQFPQSPLQSYNFALSSIKDVVPNLLNLLTRQNEDPEDDDWNVSMSAGACLQLFAQNCGN --1111-----3333----------3333------------------------------- HILEPVLEFVEQNITADNWRNREAAVMAFGSIMDGPDKVQRTYYVHQALPSILNLMNDQS -----------------------------1111---------------------1111-- LQVKETTAWCIGRIADSVAESIDPQQHLPGVVQACLIGLQDHPKVATNCSWTIINLVEQL -----------------1111-3333-----------1111------------------1 AEATPSPIYNFYPALVDGLIGAANRIDNEFNARASAFSALTTMVEYATDTVAETSASIST 111---3333-----------1111---%%%%-----------11113333--------- FVMDKLGQTMSVDENQLTLEDAQSLQELQSNILTVLAAVIRKSPSSVEPVADMLMGLFFR -------1111-------------------------------33333333---------- LLEKKDSAFIEDDVFYAISALAASLGKGFEKYLETFSPYLLKALNQVDSPVSITAVGFIA ------3333------------------3333-3333--------11113333------- DISNSLEEDFRRYSDAMMNVLAQMISNPNARRELKPAVLSVFGDIASNIGADFIPYLNDI ------1111---------33331111---3333------------------3333---- MALCVAAQNTKPENGTLEALDYQIKVLEAVLDAYVGIVAGLHDKPEALFPYVGTIFQFIA ----3333------------------------------1111-33333333--------- QVAEDPQLYSEDATSRAAVGLIGDIAAMFPDGSIKQFYGQDWVIDYIKRTRSGQLFSQAT -------3333----------------------3333----------3333--------- KDTARWAREQQKRQLSL -------------1111 >ALANINE-GLYOXYLATE AMINOT; SWP:P43567; PDB:2BKWA; KSVDTLLIPGPIILSGAVQKALDVPSLGHTSPEFVSIFQRVLKNTRAVFKSAAASKSQPF -------------------3333----1111--------------------3333----- VLAGSGTLGWDIFASNFILSKAPNKNVLVVSTGTFSDRFADCLRSYGAQVDVVRPLKIGE ----3333----------1111---------------------1111---------2222 SVPLELITEKLSQNSYGAVTVTHVDTSTAVLSDLKAISQAIKQTSPETFFVVDAVCSIGC --3333--------------------------------------3333-----1111--- EEFEFDEWGVDFALTASQAIGAPAGLSISLCSSRFDYALNDSKNGHVHGYFSSLRRWTPI ---3333------------------------3333-1111---------11113333--- ENYEAGKGAYFATPPVQLINSLDVALKEILEEGLHKRWDLHRESDWFKDSLVNGLQLTSV ------------------------------------------------------------ SRYPSNSAHGLTAVYVADPPDVIAFLKSHGVVIAGGIHKDIGPKYIRIGHGVTACNKNLP --------------------------------------------------3333------ YKNCFDLIKLALQRK --------------- >GLUCOSAMINE-6-PHOSPHATE D; SWP:O35000; PDB:2BKXA; MKVMECQTYEELSQIAARITADTIKEKPDAVLGLATGGTPEGTYRQLIRLHQTENLSFQN --------------------------1111-------1111----------------111 ITTVNLDEYAGLSSDDPNSYHFYMNDRFFQHIDSKPSRHFIPNGNADDLEAECRRYEQLV 1-------22221111-----------3333---1111----1111-------------- DSLGDTDIQLLGIGRNGHIGFNEPGTSFKSRTHVVTLNEQTRQANARYFPSIDSVPKKAL -------------1111-----22221111--------------3333--3333------ TMGIQTILSSKRILLLISGKSKAEAVRKLLEGNISEDFPASALHLHSDVTVLIDREAASL ------------------3333------3333--3333----1111--------3333-- RP -- >DNA/RNA-BINDING PROTEIN A; SWP:P60849; PDB:2BKYA; SNVVLIGKKPVMNYVLAALTLLNQGVSEIVIKARGRAISKAVDTVEIVRNRFLPDKIEIK ---------3333--------------------!!!!---------------2222---- EIRVGSQVVTSQDGRQSRVSTIEIAIRKK ----------1111--------------- >DNA/RNA-binding protein A; SWP:Q97ZF4; PDB:2BKYX; KLNEIVVRKTKNVEDHVLDVIVLFNQGIDEVILKGTGREISKAVDVYNSLKDRLGDGVQL -----------------------1111--------!!!!--------------!!!!--- VNVQTGSEVRDRRRISYILLRLKRVY ---------%%%%------------- >MAJOR PLASMODIAL MYOSIN H; SWP:Q9BJD3; PDB:2BL0A; RRIGEIVKVVQAAARGWVERKHFRQAREKSVSARIIQDNIRAYLEFKNWAWWKLFAKARP ---------------------------------------------1111--------333 LLV 3-- >Myosin regulatory light c; SWP:P08053; PDB:2BL0B; TASADQIQECFQIFDKDNDGKVSIEELGSALRSLGKNPTNAELNTIKGQLNAKEFDLATF --------------1111----3333----3333-------------------------- KTVYRKPIKTPTEQSKEMLDAFRALDKEGNGTIQEAELRQLLLNLGDALTSSEVEELMKE ---------3333--------33331111----------------------------111 VSVSGDGAINYESFVDMLVTGYPLA 1--1111------------------ >Myosin regulatory light c; SWP:Q8WSQ4; PDB:2BL0C; GDDQVSEFKEAFELFDSERTGFITKEGLQTVLKQFGVRVEPAAFNEMFNEADATGNGKIQ ---------------1111--------------------------------1111----- FPEFLSMMGRRMKQTTSEDILRQAFRTFDPEGTGYIPKAALQDALLNLGDRLKPHEFAEF -----------1111---------3333-------------------------------- LGITETEKGQIRYDNFINTMFT ------------33331111-- >MGC83862 PROTEIN; SWP:Q32NN2; PDB:2BL5A; QLQEKLYVPVKEYPDFNFVGRILGPRGLTAKQLEAETGCKIMVRGKGSMRDKKKEEQNRG --------333311113333-----3333------------------------------- KPNWEHLNEDLHVLITVEDAQNRAELKLKRAVEEVKKLLVPAAEGEDSLKKMKLMELAIL -11111111-------------------------------------3333----3333-- NGTYRDANLKSPALH 3333--1111----- >NUCLEOCAPSID PROTEIN P11; SWP:P69732; PDB:2BL6A; QTCYNCGKPGHLSSQCRAPKVCFKCKQPGHFSKQCRS ------------------3333------3333----- >ENTEROCINE A IMMUNITY PRO; SWP:Q47785; PDB:2BL7A; KNAKQIVHELYNDISISKDPKYSDILEVQKVYLKLEKQKYELDPSPLINRLVNYLYFTAY -------------3333-3333---------------1111------------------- TNKIRFTEYQEELIRNSE ------------------ >ENTEROCINE A IMMUNITY PRO; SWP:Q3Y0D8; PDB:2BL8A; NAKQIVHELYNDISISKDPKYSDILEVLQKVYLKLEPSPLINRLVNYLYFTAYTNKIRFT ------------3333-3333--------------------------------------- EYQEELIRNLSEIGRTAGINGLYRADYGDKSQF ---------------2222------22221111 >DIHYDROFOLATE REDUCTASE-T; SWP:O02604; PDB:2BL9A; ENLSDVFDIYAICACCKVAPTSAGTKNEPFSPRTFRGLGNKGTLPWKCNSVDMKYFSSVT ----1111-------------------1111--------iiii----------------- TYVDESKYEKLKWKRERYLRMEAKLQNVVVMGRSSWESIPKQYKPLPNRINVVLSKTLTK ---3333--------------------------------3333--2222---------33 EDVKEKVFIIDSIDDLLLLLKKLKYYKCFIIGGAQVYRECLSRNLIKQIYFTRINGAYPC 33--------------------------------------1111---------------- DVFFPEFDESEFRVTSVSEVYNSKGTTLDFLVYSKV -------3333-----------iiii---------- >GMP REDUCTASE I; SWP:P36959; PDB:2BLEA; MPRIDADLKLDFKDVLLRPKRSSLKSRAEVDLERTFTFRNSKQTYSGIPIIVANMDTVGT ----------3333-----------1111-------------------------1111-3 FEMAAVMSQHSMFTAIHKHYSLDDWKLFATNHPECLQNVAVSSGSGQNDLEKMTSILEAV 333----------------------------3333------------------------3 PQVKFICLDVANGYSEHFVEFVKLVRAKFPEHTIMAGNVVTGEMVEELILSGADIIKVGV 333--------1111-------------1111--------3333----1111-------- GPGSVCTTRTKTGVGYPQLSAVIECADSAHGLKGHIISDGGCTCPGDVAKAFGAGADFVM --11113333-------------------1111------------------3333----- LGGMFSGHTECAGEVIRKLKLFYGMSSDTAMNKHGVAEYRASEGKTVEVPYKGDVENTIL -3333--1111------------1111--------------------------------- DILGGLRSTCTYVGAAKLKELSRRATFIRVTQQHNTV ----------------33333333------------- >SULFITE:CYTOCHROME C OXID; SWP:Q9LA16; PDB:2BLFA; ADTVTLPFANGERPLVMYPGKRPLIGLTARPPQLETPFSVFDEGLITPNDAFFVRYHLAG -----------------2222---------------3333-------1111--------- IPLEIDPDAFRLEIKGKVGTPLSLSLQDLKNDFPASEVVAVNQCSGNSRGFVEPRVGGGQ -----3333----------------------------------11113333--------- LANGAMGNARWRGVPLKAVLEKAGVQAGAKQVTFGGLDGPVIPETPDFVKALSIDHATDG --------------3333-------2222------------3333-------3333---- EVMLAYSMNGADLPWLNGYPLRLVVPGYYGTYWVKHLNEITVIDKEFDGFWMKTAYRIPD -------iiii--1111-------22223333----------------1111-------- NACACTEPGKAPTATIPINRFDVRSFITNVENGASVKAGEVPLRGIAFDGGYGITQVSVS 1111--2222--------------------2222-------------------------- ADAGKSWTNATLDPGLGKYSFRGWKAVLPLTKGDHVLMCRATNARGETQPMQATWNPAGY -iiii-----------1111----------------------1111---------1111- MRNVVEATRVIAA ------------- >Sulfite:cytochrome c oxid; SWP:Q9LA15; PDB:2BLFB; APLTYELPDETAQLKPAPQPGFEAAQNNCAACHSVDYINTQPPGKGQAFWDAEVQKMIKV ---------------------------------33331111------------------- YHAPVDEADAKAIADYLAKTY --------------------- >PROTEIN YFBG; SWP:P77398; PDB:2BLLA; MRVLILGVNGFIGNHLTERLLREDHYEVYGLDIGSDAISRFLNHPHFHFVEGDISIHSEW ----------------------------------33331111-1111-----1111-333 IEYHVKKCDVVLPLVAIATPIEYTRNPLRVFELDFEENLRIIRYCVKYRKRIIFPSTSEV 3-----------------3333----------------------------------3333 YGMCSDKYFDEDHSNLIVGPVNKPRWIYSVSKQLLDRVIWAYGEKEGLQFTLFRPFNWMG !!!!-----1111------11111111--------------------------------- PRLDNLNAARIGSSRAITQLILNLVEGSPIKLIDGGKQKRCFTDIRDGIEALYRIIENAG ----1111-----------------------2222--------3333----------222 NRCDGEIINIGNPENEASIEELGEMLLASFEKHPLRHHFPPFAGFRVVVEHRKPSIRNAH 2-2222-----1111-----------------1111------------------------ RCLDWEPKIDMQETIDETLDFFLRTVDLTD ----------------------11113333 >PROTEIN YFBG; SWP:P77398; PDB:2BLNA; MKTVVFAYHDMGCLGIEALLAAGYEISAIFTHTDFYGSVARLAAERGIPVYAPDNVNHPL -------------------1111------------------------------------- WVERIAQLSPDVIFSFYYRHLIYDEILQLAPAGAFNLHGSLLPKYRGRAPLNWVLVNGET -----1111-------------33333333------------------3333-------- ETGVTLHRMVKRADAGAIVAQLRIAIAPDDIAITLHHKLCHAARQLLEQTLPAIKHGNIL -----------2222-----------1111-----------------------1111--- EIAQRENEATCFGRRTPDDSFLEWHKPASVLHNMVRAVADPWPGAFSYVGNQKFTVWSSR ----3333-------3333---3333----------------------!!!!-------- VHPHASKAQPGSVISVAPLLIACGDGALEIVTGQAGDGITMQGSQLAQTLGLVQGSRL --------2222----------------------!!!!--------------2222-- >ELONGATION FACTOR G; SWP:P13551; PDB:2BM0A; KVEYDLKRLRNIGIAAHIDAGKTTTTERILYYTGRIHKIATITAAVTTCFWKDHRINIID ----3333--------2222----------3333----------------%%%%------ APGHVDFTIEVERSMRVLDGAIVVFDSSQGVEPQSETVWRQAEKYKVPRIAFANKMDKTG ------3333---------------1111--1111-------------------3333-- ADLWLVIRTMQERLGARPVVMQLPIGREDTFSGIIDVLRMKAYTYGNDLGTDIREIPIPE ---------------------------1111---------------------------11 EYLDQAREYHEKLVEVAADFDENIMLKYLEGEEPTEEELVAAIRKGTIDLKITPVFLGSA 11--------------3333--------------------------1111---------1 LKNKGVQLLLDAVVDYLPSPLDIPPIKGTTPEGEVVEIHPDPNGPLAALAFKIMADPYVG 111---------------3333-------3333-------1111---------------- RLTFIRVYSGTLTSGSYVYNTTKGRKERVARLLRMHANHREEVEELKAGDLGAVVGLKET ------------2222-------------------1111-------2222---------- ITGDTLVGEDAPRVILESIEVPEPVIDVAIEPKTKADQEKLSQALARLAEEDPTFRVSTH 2222-----------------------------3333----------------------1 PETGQTIISGMGELHLEIIVDRLKREFKVDANVGKPQVAYRETITKPVDVEGKFIRQTGG 111---------------------1111-----------------------------iii RGQYGHVKIKVEPLPRGSGFEFVNAIVGGVIPKEYIPAVQKGIEEAMQSGPLIGFPVVDI i-------------2222-------------3333---------3333------------ KVTLYDGSYHEVDSSEMAFKIAGSMAIKEAVQKGDPVILEPIMRVEVTTPEEYMGDVIGD -------------------------------------------------3333------- LNARRGQILGMEPRGNAQVIRAFVPLAEMFGYATDLRSKTQGRGSFVMFFDHYQEVPKQV -1111--------!!!!-------33332222-------iiii----------------- QEKLIK -1111- >SCAFFOLDING DOCKERIN BIND; SWP:NA; PDB:2BM3A; KASSIELKFDRNKGEVGDILIGTVRINNIKNFAGFQVNIVYDPKVLMAVDPETGKEFTSS --------------2222----------2222---------3333------------111 TFPPGRTVLKNNAYGPIQIADNDPEKGILNFALAYSYIAGYKETGVAEESGIIAKIGFKI 1---------3333--------3333---------------3333--------------- LQKKSTAVKFQDTLSMPGAISGTQLFDWDGEVITGYEVIQPD ------------1111---iiii------------------- >PENTAPEPTIDE REPEAT FAMIL; SWP:O50390; PDB:2BM5A; MQQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLW -------------2222-2222-----------2222-2222----------------22 HSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLR 22------2222----------------2222-2222-2222-2222-2222-2222-22 KCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 22-2222-2222-2222-2222-2222--3333-----2222-----------1111--- AGG --- >CEPHALOSPORIN HYDROXYLASE; SWP:O85726; PDB:2BM8A; NDYSRQNFQDLNLFRGLGEDPAYHPPVLTDRPRDWPLDRWAEAPRDLGYSDFSPYQWRGL ---------33332222------------------33331111-------------iiii RMLKDPDTQAVYHDMLWELRPRTIVELGVYNGGSLAWFRDLTKIMGIDCQVIGIDRDLSR ----------------------------!!!!----------1111-----------111 CQIPASDMENITLHQGDCSDLTTFEHLREMAHPLIFIDNAHANTFNIMKWAVDHLLEEGD 1--3333---------333333333333-------------------------------- YFIIEDMIPYWYRYAPQLFSEYLGAFRDVLSMDMLYANASSQLDRGVLRRVA ------3333-------------1111--------11113333--------- >GLUTAMATE DEHYDROGENASE (; SWP:O96940; PDB:2BMAA; DQMNNVYERVMKLDPNQVEFLQAFHEILYSLKPLFMEEPKYLPIIETLSEPERAIQFRVC -------------------------------------3333------------------- WLDDNGVQRKNRCFRVQYNSALGPYKGGLRFHPSVNLSIVKFLGFEQIFKNSLTGLSMGG --1111-------------------------3333-------------3333-------- GKGGSDFDPKGKSDNEILKFCQAFMNELYRHIGPCTDVPAGDIGVGGREIGYLYGQYKKI --------2222--------------3333------------------------------ VNSFNGTLTGKNVKWGGSNLRVEATGYGLVYFVLEVLKSLNIPVEKQTAVVSGSGNVALY -----------3333----3333--------------1111-1111--------3333-- CVQKLLHLNVKVLTLSDSNGYVYEPNGFTHENLEFLIDLKEEKKGRIKEYLNHSSTAKYF -----1111-------3333---3333-------------1111-3333----3333--- PNEKPWGVPCTLAFPCATQNDVDLDQAKLLQKNGCILVGEGANMPSTVDAINLFKSNNII ---3333----------2222--3333--------------------------1111--- YCPSKAANAGGVAISGLEMSQNFQFSHWTRETVDEKLKEIMRNIFIACSENALKYTKNKY --3333---------------1111---3333---------------------------- DLQAGANIAGFLKVAESYIEQGCF ------------------------ >FOLIC ACID SYNTHESIS PROT; SWP:P53848; PDB:2BMBA; SWKRAFLAFGSNIGDRFKHIQMALQLLSREKTVKLRNISSIFESEPMYFKDQTPFMNGCV -----------------------------1111---------------1111-------- EVETLLTPSELLKLCKKIEYEELQRTIDLDIVMFLNSAGEDIIVNEPDLNIPHPRMLERT ---------------------3333----------1111------3333---1111--11 FVLEPLCELISPVHLHPVTAEPIVDHLKQLYDKQHDEDTLWKLVPLPYRSGVEPRFLKFK 113333----1111-------3333-----33333333----------2222-------- TATKTNRITVSPTYIMAIFNATPDSFSDGGEHFADIESQLNDIIKLCKDALYLHESVIID ------------------------1111-1111--------------------------- VGGCSTRPNSIQASEEEEIRRSIPLIKAIRESTELPQDKVILSIDTYRSNVAKEAIKVGV ------2222-------------------------3333----------------1111- DIINDISGGLFDSNMFAVIAENPEICYILSHTRGDISTMNRLAHYENFALGDSIQQEFVH ----11111111---------1111---------11111111-----1111-------%% NTDIQQLDDLKDKTVLIRNVGQEIGERYIKAIDNGVKRWQILIDPGLGFAKTWKQNLQII %%333322221111-----------------1111-3333-----2222----------- RHIPILKNYSFTMNSNNSQVYVNLRNMPVLLGPSRKKFIGHITKDVDAKQRDFATGAVVA --------------iiii-----2222-----2222----------3333-3333----- SCIGFGSDMVRVHDVKNCSKSIKLADAIYKGLE --1111--------------------------- >RAS-RELATED PROTEIN RAB4A; SWP:P20338; PDB:2BMEA; SETYDFLFKFLVIGNAGTGKSCLLHQFIEKKFKDDSNHTIGVEFGSKIINVGGKYVKLQI --------------2222--------------------------------iiii------ WDTAGQERFRSVTRSYYRGAAGALLVYDITSRETYNALTNWLTDARMLASQNIVIILCGN -----3333-------2222-------1111------------------1111------- KKDLDADREVTFLEASRFAQENELMFLETSALTGENVEEAFVQCARKILNKIESGE 33331111-----------1111----------2222------------------- >RNA HELICASE; SWP:Q91H74; PDB:2BMFA; MASIEDNPEIEDDIFRKKRLTIMDLHPGAGKTKRYLPAIVREAIKRGLRTLILAPTRVVA -------------------------------------------------------3333- AEMEEALRGLPIRYQTGREIVDLMCHATFTMRLLSPIRVPNYNLIIMDEAHFTDPASIAA ----1111---------------------------------------------------- RGYISTRVEMGEAAGIFMTATPPGSRDPFPQSNAPIMDEEREIPERSWNSGHEWVTDFKG -------1111------------------------------------------------- KTVWFVPSIKAGNDIAACLRKNGKKVIQLSRKTFDSEYIKTRTNDWDFVVTTDISEMGAN -------3333---------------------------3333---------3333----- FKAERVIDPRRCMKPVILTDGEERVILAGPMPVTHSSAAQRRGRVGRNPKNENDQYIYMG ------------------------------------------------------------ EPLENDEDCAHWKEAKMLLDNINTPEGIIPSMFEPEREKVDAIDGEYRLRGEARKTFVDL -----1111--------3333--3333------1111-----2222-------------- MRRGDLPVWLAYRVAAEGINYADRRWCFDGVKNNQILEENVEVEIWTKEGERKKLKPRWL --------------1111-33333333---1111-------------------------- DARIYSDPLALKEFKEFAAGRK 3333--3333------------ >FAB FRAGMENT OF CATALYTIC; SWP:Q58EV6; PDB:2BMKA; DIELTQSPAIMAASPGEKVTITCSATSGVNYMHWFQQKPGTSPKLWIYSTSNLASAVPAR -------------2222--------------------2222------------2222333 FSGSGSGTSYSLTISRMEAEDAATYYCQQRSTYPFTFGGGTKLELKRADAAPTVSIFPPS 3----------------3333--------------------------------------3 SEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTL 3331111---------------------iiii---------------------------- TKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC ----1111--------1111------------- >FAB FRAGMENT OF CATALYTIC; SWP:NA; PDB:2BMKB; EVKLQESGGGLVQPGHSLRLSCATSGFTFTDYYMSWVRQPPGKALEWLGLIRNKANGYTK ------------2222------------1111-------2222---------3333---- EYSASVKGRFTISRDNSQSILYLQMNALRAEDSATYYCVRDKGSYGNYEAWFAYWGQGTT ----------------------------3333--------------3333---------- VTVSSAKTTPPSVYPLAPGSQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVL -------------------------------------------%%%%--2222------- QSDLYTLSSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVPRD %%%%-------------------------1111----------- >THERMOSTABLE HEMOGLOBIN F; SWP:NA; PDB:2BMMA; MTFYEAVGGEETFTRLARRFYEGVAADPVLRPMYPEEDLGPAEERLRLFLMQYWGGPRTY --------1111-----------1111--1111------------------1111--333 SERRGHPRLRMRHFPYRIGAEERDRWLTHMRAAVDDLALPAHLEQQLWEYLVYAAYAMVN 3------3333-3333-----------------3333------------------3333- VPE --- >OXYGENASE-ALPHA NBDO; SWP:Q8RTL4; PDB:2BMOA; YQNLVSEAGLTQKLLIHGDKELFQHELKTIFARNWLFLTHDSLIPSPGDYVKAKMGVDEV -----2222-----1111---------------------3333--2222-----!!!!-- IVSRQNDGSVRAFLNVCRHRGKTLVHAEAGNAKGFVCGYHGWGYGSNGELQSVPFEKELY ----1111------------------------------------1111----2222---! GDAIKKKCLGLKEVPRIESFHGFIYGCFDAEAPPLIDYLGDAAWYLEPTFKYSGGLELVG !!!-3333-----------iiii-----1111------!!!!-3333------------- PPGKVVVKANWKSFAENFVGDGYHVGWTHAAALRAGQSVFSSIAGNAKLPPEGAGLQMTS ---------3333--------3333-1111------------2222---1111------1 KYGSGMGVFWGYYSGNFSADMIPDLMAFGAAKQEKLAKEIGDVRARIYRSFLNGTIFPNN 111--------1111--3333-------------3333--------1111---------- SFLTGSAAFRVWNPIDENTTEVWTYAFVEKDMPEDLKRRVADAVQRSIGPAGFWESDDNE ----------------------------1111---------------------------- NMETMSQNGKKYQSSNIDQIASLGFGKDVYGDECYPGVVGKSAIGETSYRGFYRAYQAHI -----------3333------2222---------------------------------11 SSSNWAEFENASRNWHI 11-----------1111 >Oxygenase-beta NBDO; SWP:Q8RTL3; PDB:2BMOB; MMINTQEDKLVSAHDAEEFHRFFVGHDSDLQQEVTTLLTREAHLLDIQAYKAWLEHFVAP ---33331111-----------1111------------------1111----------11 EIKYQVISRELRSTSERRYQLNDAVNLYNENYQQLKVRVEHQMDPQNWANNPKIRFTRFV 11-----------------------------------------11111111--------- TNVTAAKDKSAPEILHVRSNLILHRARRENQVDVFYATREDKWKRIEGGGIKLVERFVDY ----------1111-------------------------------2222----------- PERIPQTHNLLVFL -------------- >TOXIN BMTX2; SWP:Q9NII5; PDB:2BMT; FTNVSCSASSQCWPVCKKLFGTYRGKCMNSKCRCYS --------1111---------------%%%%----- >FLAVODOXIN; SWP:O25776; PDB:2BMVA; GKIGIFFGTDSGNAEAIAEKISKAIGNAEVVDVAKASKEQFNSFTKVILVAPTAGAGDLQ -------------------------------3333-3333-------------------- TDWEDFLGTLEASDFANKTIGLVGLGDQDTYSETFAEGIFHIYEKAKAGKVVGQTSTDGY ------111133331111--------33331111----------1111--------2222 HFEASKAVEGGKFVGLVIDEDNQDDLTDERISKWVEQVKGSFA ----3333iiii------33333333-----------3333-- >FERREDOXIN--NADP REDUCTAS; SWP:P21890; PDB:2BMWA; DVPVNLYRPNAPFIGKVISNEPLVKEGGIGIVQHIKFDLTGGNLKYIEGQSIGIIPPGVD -------3333-------------2222----------2222----2222---------1 KNGKPEKLRLYSIASTRHGDDVDDKTISLCVRQLEYKHPESGETVYGVCSTYLTHIEPGS 111------------1111-------------------------------------2222 EVKITGPVGKEMLLPDDPEANVIMLAGGTGITPMRTYLWRMFKDAERAANPEYQFKGFSW ----------------1111-----------------------------1111------- LVFGVPTTPNILYKEELEEIQQKYPDNFRLTYAISREQKNPQGGRMYIQDRVAEHADQLW ------33332222---------1111------1111--1111---3333---------- QLIKNQKTHTYICGPPPMEEGIDAALSAAAAKEGVTWSDYQKDLKKAGRWHVET ----1111------3333------------1111----------1111------ >ALKYL HYDROPEROXIDASE C; SWP:Q7BHK8; PDB:2BMXA; PLLTIGDQFPAYQLTALIGGDLSKVDAKQPGDYFTTITSDEHPGKWRVVFFWPKDFTFVC ---2222-------------3333----3333-----11112222--------------- PTEIAAFSKLNDEFEDRDAQILGVSIDSEFAHFQWRAQHNDLKTLPFPLSDIKRELSQAA --------------------------------------3333--------1111------ GVLNADGVADRVTFIVDPNNEIQFVSATAGSVGRNVDEVLRVLDALQS ---1111---------1111---------------------------- >RIPENING-ASSOCIATED PROTE; SWP:O22321; PDB:2BMZA; MNGAIKVGAWGGNGGSAFDMGPAYRIISVKIFSGDVVDGVDVTFTYYGKTETRHYGGSGG %%%%-----------------------------------------%%%%----------- TPHEIVLQEGEYLVGMAGEVANYHGAVVLGKLGFSTNKKAYGPFGNTGGTPFSLPIAAGK -------2222-----------iiii---------------------------------- ISGFFGRGGKFLDAIGVYLEP --------------------- >PSI; SWP:Q7JPS0; PDB:2BN5A; GADYSAQWAEYYRSVGKIEEAEAIEKTLKNKQN ----------------3333-----3333---- >CELL DIVISION ACTIVATOR C; SWP:P0AE62; PDB:2BN8A; SYVPRTEPAPPEHAIKMDSFRDVWMLRGKYVAFVLMGESFLRSPAFTVPESAQRWANQIR --------3333-------------iiii------------------3333--------- QEGEVTE ------- >URIDYLATE KINASE; SWP:P0A7F2; PDB:2BNEA; AKPVYKRILLKLSGEALQGTEGFGIDASILDRMAQEIKELVELGIQVGVVIGGGNLFRGA --------------11113333------------------------------1111---- GLAKAGMNRVVGDHMGMLATVMNGLAMRDALHRAYVNARLMSAIPLNGVCDSYSWAEAIS -------------------------------1111------------------------- LLRNNRVVILSAGTGNPFFTTDSAACLRGIEIEANVVLKATKVDGVFTADPPTATMYEQL -1111------!!!!--------------------------------------------- TYSEVLEKELKVMDLAAFTLARDHKLPIRVFNMNKPGALRRVVMGEKEGTLITE -------------3333--------------1111------1111--------- >MB2760; SWP:O33283; PDB:2BNGA; ETTEAIRAVEAFLNALQNEDFDTVDAALGDDLVYENVGFSRIRGGRRTATLLRRQGRVGF ------------------------33331111---2222--------------------- EVKIHRIGADGAAVLTERTDALIIGPLRVQFWVCGVFEVDDGRITLWRDYFDVYDFKGLL ---------!!!!----------!!!!------------iiii--------3333----- RGLVALVVPS ---------- >RIBONUCLEASE INHIBITOR; SWP:P10775; PDB:2BNH; MNLDIHCEQLSDARWTELLPLLQQYEVVRLDDCGLTEEHCKDIGSALRANPSLTELCLRT ----------3333----3333-------------3333-------1111---------- NELGDAGVHLVLQGLQSPTCKIQKLSLQNCSLTEAGCGVLPSTLRSLPTLRELHLSDNPL ------------1111----------------33333333------1111---------- GDAGLRLLCEGLLDPQCHLEKLQLEYCRLTAASCEPLASVLRATRALKELTVSNNDIGEA -------------1111------------3333--------------------------- GARVLGQGLADSACQLETLRLENCGLTPANCKDLCGIVASQASLRELDLGSNGLGDAGIA --------------------------3333----------3333---------------- ELCPGLLSPASRLKTLWLWECDITASGCRDLCRVLQAKETLKELSLAGNKLGDEGARLLC --3333-----------------3333----------3333------------------- ESLLQPGCQLESLWVKSCSLTAACCQHVSLMLTQNKHLLELQLSSNKLGDSGIQELCQAL ----1111------------3333------1111-----------------------333 SQPGTTLRVLCLGDCEVTNSGCSSLASLLLANRSLRELDLSNNCVGDPGVLQLLGSLEQP 3----------------3333-------------------------------------33 GCALEQLVLYDTYWTEEVEDRLQALEGSKPGLRVIS 33---------------------------------- >MODULATOR PROTEIN RSBR; SWP:P42409; PDB:2BNLA; SNQTVYQFIAENQNELLQLWTDTLKELSEQESYQLTDQVYENISKEYIDILLLSVKDENA ---------------------------1111----------------------3333333 AESQISELALRAVQIGLSKFLATALAEFWKRLYTKNDKESTELIWQIDRFFSPINTEIFN 3----------------------------------------------------------- QYSISWE --3333- >EPOXIDASE; SWP:Q56185; PDB:2BNMA; KTASTGFAELLKDRREQVKMDHAALASLLGETPETVAAWENGEGGELTLTQLGRIAHVLG ---------------1111-------------------1111-1111---------1111 TSIGALTPPAGNDLDDGVIIQMPDERPILKGVRDNVDYYVYNCLVRTKRAPSLVPLVVDV -3333---------iiii---3333------!!!!1111-------3333---------- LTDNPDDAKFNSGHAGNEFLFVLEGEIHMKWGDKENPKEALLPTGASMFVEEHVPHAFTA ---3333-------------------------1111------2222----2222------ AKGTGSAKLIAVNF 2222---------- >T-CELL RECEPTOR ALPHA CHA; SWP:TCB_HUMAN; PDB:2BNUA; QEVTQIPAALSVPEGENLVLNCSFTDSAIYNLQWFRQDPGKGLTSLLLIQSSQREQTSGR -------------------------------------2222--------1111----!!! LNASLDKSSGRSTLYIAASQPGDSATYLCAVRPTSGGSYIPTFGRGTSLIVHPYIQNPDP !-----1111---------3333------------------------------------- AVYQLRRSKSSDKSVCLFTDFDSQTNVSQSKDSDVYITDKCVLDMRSMDFKSNSAVAWSN ---------------------3333------1111----------1111----------- KSDFACANAFNNSIIPEDTFFPS ----3333-------1111---- >T-cell receptor beta chai; SWP:TCB_HUMAN; PDB:2BNUB; GVTQTPKFQVLKTGQSMTLQCAQDMNHEYMSWYRQDPGMGLRLIHYSVGAGITDQGEVPN -----------2222--------------------2222---------2222------22 GYNVSRSTTEDFPLRLLSAAPSQTSVYFCASSYVGNTGELFFGEGSRLTVLEDLKNVFPP 22-----------------3333-----------!!!!--------------3333---- EVAVFEPSEAEISHTQKATLVCLATGFYPDHVELSWWVNGKEVHSGVCTDPQPLKEQPAL -------------------------------------iiii--2222---------3333 NDSRYALSSRLRVSATFWQDPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEAWGRA -------------3333--1111------------------------------------- D - >50S RIBOSOMAL PROTEIN L30; SWP:P29160; PDB:2BO1A; MVDIAFELRKVIDSGKYTLGYRKTVQSLKMGGSKLIIIARNTRPDRKEDLEYYARLSGTP --------------------------------------1111------------------ VYEFEGTNVELGTAVGKPHTVSVVSILDAGESRILALGGKE ------------1111------------!!!!3333----- >MANNOSYLGLYCERATE SYNTHAS; SWP:Q9RFR0; PDB:2BO4A; SLVVFPFKHEHPEVLLHNVRVAAAHPRVHEVLCIGYERDQTYEAVERAAPEISRATGTPV ------------------------1111-------------------------------- SVRLQERLGTLRPGKGDGMNTALRYFLEETQWERIHFYDADITSFGPDWITKAEEAADFG --------------------------------------1111---3333-------1111 YGLVRHYFPRASTDAMITWMITRTGFALLWPHTELSWIEQPLGGELLMRREVAAMLYEDE ----------11113333-----------11111111--1111---------------33 RVRRRSDWGIDTLYTFVTVQQGVSIYECYIPEGKAHRLYGGLDDLRTMLVECFAAIQSLQ 33----------------1111-------3333-----------------------1111 HEVVGQPAIHRQEHPHRVPVHIAERVGYDVEATLHRLMQHWTPRQVELLELFTTPVREGL ------------------3333-------------1111---------11113333---3 RTCQRRPAFNFMDEMAWAATYHVLLEHFQPGDPDWEELLFKLWTTRVLNYTMTVALRGYD 333-----3333----------------2222---------------------3333--- YAQQYLYRMLGRYRYQAALEN ----------------1111- >ATP SYNTHASE OLIGOMYCIN S; SWP:P13621; PDB:2BO5A; FAKLVRPPVQIYGIEGRYATALYSAASKQNKLEQVEKELLRVGQILKEPKMAASLLNPYV ------------3333---------3333-3333--3333-------3333--------- KRSVKVKSLSDMTAKEKFSPLTSNLINLLAENGRLTNTPAVISAFSTMMSVHRGEVPCTV ---------1111--------------1111-----3333------1111-3333----- >CARBOXYPEPTIDASE A4; SWP:Q9UI42; PDB:2BO9A; SSNNFNYGAYHSLEAIYHEMDNIAADFPDLARRVKIGHSFENRPMYVLKFSTGKGVRRPA -----1111-----------------1111--------1111------------------ VWLNAGIHSREWISQATAIWTARKIVSDYQRDPAITSILEKMDIFLLPVANPDGYVYTQT -------1111----------------2222----------------------------- QNRLWRKTRSRNPGSSCIGADPNRNWNASFAGKGASDNPCSEVYHGPHANSEVEVKSVVD -1111------1111-----1111----2222-----1111------2222--------- FIQKHGNFKGFIDLHSYSQLLMYPYGYSVKKAPDAEELDKVARLAAKALASVSGTEYQVG -------------------------------1111------------------------- PTCTTVYPASGSSIDWAYDNGIKFAFTFELRDTGTYGFLLPANQIIPTAEETWLGLKTIM 3333-------------1111--------------!!!!-1111---------------- EHVRDNL --1111- >Latexin; SWP:Q9BS40; PDB:2BO9B; MEIPPTNYPASRAALVAQNYINYQQGTPHRVFEVQKVKQASMEDIPGRGHKYRLKFAVEE ---1111-------------------1111------------------------------ IIQKQVKVNCTAEVLYPSTGQETAPEVNFTFEGETGKNPDEEDNTFYQRLKSMKEPLEAQ ------------------------------------------------------------ NIPDNFGNVSPEMTLVLHLAWVACGYIIWQNSTEDTWYKMVKIQTVKQVQRNDDFIELDY ---1111--3333-------------------1111------------------------ TILLHNIASQEIIPWQMQVLWHPQYGTKVKHNSRLPK ------1111-----------1111------------ >Endoglucanase E-2 [Precur; SWP:P26222; PDB:2BOGX; NDSPFYVNPNMSSAEWVRNNPNDPRTPVIRDRIASVPQGTWFAHHNPGQITGQVDALMSA -------1111--------1111---------1111---------2222----------- AQAAGKIPILVVSNAPGRDCGAPSHSAYRSWIDEFAAGLKNRPAYIIVEPDLISLMSSCM --------------------------------------%%%%------22221111---- QHVQQEVLETMAYAGKALKAGSSQARIYFDAGHSAWHSPAQMASWLQQADISNSAHGIAT ---------------------1111------------------------3333------- NTSNYRWTADEVAYAKAVLSAIGNPSLRAVIDTSRNGNGPAGNEWCDPSGRAIGTPSTTN 2222-------------------1111--------1111--------------------- TGDPMIDAFLWIKLPGEADGCIAGAGQFVPQAAYEMAIAA --1111-------2222------2222---------3333 >COAGULATION FACTOR X; SWP:P00742; PDB:2BOKA; IVGGQECKDGECPWQALLINEENEGFCGGTILSEFYILTAAHCLYQAKRFKVRVGAVHEV -------22221111----1111---------1111---3333----------------- EVVIKHNRFTKETYDFDIAVLRLKTPITFRMNVAPACLPERDWAESTLMTQKTGIVSGFG -----1111--------------------2222--------------1111--------- RTHEKGEQSTRLKMLEVPYVDRNSCKLSSSFIITQNMFCAGYDTKQEDACQGDSGGPHVT -------------------------1111----1111------------2222------- RFKDTYFVTGIVSWGEGCARKGKYGIYTKVTAFLKWIDRSMKT -------------------2222-----3333----------- >Coagulation factor X; SWP:Q5JVE8; PDB:2BOKL; KLCSLDNGDCDQFCHENSVVCSCARGYTLADNGKACIPTGPYPCGKQTLE 3333-------------------2222--1111---------2222---- >SMALL HEAT SHOCK PROTEIN; SWP:Q7YZT0; PDB:2BOLA; SIFPTRDSRDLSSRRRSLIDWEFPQMALVPLDQVFDWAERSRQSLHDDIVNMHRNLFSLE -----------------1111-1111---------------------------------- PFTAMDNAFESVMKEMSAIQPREFHPELEYTQPGELDFLKDAYEVGKDGRLHFKVYFNVK ---------------1111-----1111-----1111--------1111----------- NFKAEEITIKADKNKLVVRAQKSVACGDAAMSESVGRSIPLPPSVDRNHIQATITTDDVL --1111------------------2222-------------33331111----------- VIEAPVNEPNYKAIKLSPEKGLAIQPSEVQERQLAVKNKEGLEIVTAEDGSKKIHLELKV ---------3333---------------------------------3333---------- DPHFAPKDVKVWAKGNKVYVHGVTGHREFYKAFVTPEVVDASKTQAEIVDGLMVVEAPLF 11111111-----!!!!----------------------3333----------------- K - >LIPID KINASE; SWP:P76407; PDB:2BONA; PASLLILNGKSTDNLPLREAIMLLREEGMTIHVRVTWEKGDAARYVEEARKFGVATVIAG -------------------------------------------------3333------- GGDGTINEVSTALIQCEGDDIPALGILPLGTANDFATSVGIPEALDKALKLAIAGDAIAI ------------1111--------------------1111-------------------- DMAQVNKQTCFINMATGGFGTRIALGSVSYIIHGLMRMDTLQPDRCEIRGENFHWQGDAL ----%%%%----------------3333---------------------2222------- VIGIGNGRQAGGGQQLCPNALINDGLLQLRIFTGDEILPALVSTLKSDEDNPNIIEGASS ----------------1111------------------------------1111------ WFDIQAPHDITFNLDGEPLSGQNFHIEILPAALRCRLPPDCPLLRST -------------iiii--------------------1111------ >URACIL-DNA GLYCOSYLASE; SWP:Q9RWH9; PDB:2BOOA; PIIPANLPEDWQEALLPEFSAPYFHELTDFLRQERKEYTIYPPAPDVFNALRYTPLGEVK --------------------------------3333------3333-3333---3333-- VLILGQDPYHGPNQAHGLSFSVRPGVRVPPSLRNIYKELTEDIPGFVAPKHGYLRSWAEQ ----------2222---2222-2222----------------2222-------3333--- GVLLLNAVLTVRAGQANSHQGKGWEHFTDAVIKAVNAKEERVVFILWGSYARKKKKLITG -----------2222-1111--3333--------3333----------3333-3333--1 KNHVVIESGHPSPLSEQYFFGTRPFSKTNEALEKAGRGPVEWQLPATVTE 111--------33331111-------------1111-------------- >PROTEIN (E2); SWP:P03122; PDB:2BOPA; SCFALISGTANQVKCYRFRVKKNHRHRYENCTTTWFTVADNGAERQGQAQILITFGSPSQ -----------------------1111-------------!!!!---------------- RQDFLKHVPLPPGMNISGFTASLDF ----------2222----------- >VERSATILE PEROXIDASE VPL2; SWP:O94753; PDB:2BOQA; ATCDDGRTTANAACCILFPILDDIQENLFDGAQCGEEVHESLRLTFHDAIGFSPTLGGGG --1111----3333---------------------------------1111--------- ADGSIIAFDTIETNFPANAGIDEIVSAQKPFVAKHNISAGDFIQFAGAVGVSNCPGGVRI --3333--3333--3333-3333------------------------------2222--- PFFLGRPDAVAASPDHLVPEPFDSVDSILARMGDAGFSPVEVVWLLASHSIAAADKVDPS -------------------1111---------1111---------3333--------333 IPGTPFDSTPEVFDSQFFIETQLKGRLFPGTADNKGEAQSPLQGEIRLQSDHLLARDPQT 3-------1111------3333-----------2222----2222--------------- ACEWQSMVNNQPKIQNRFAATMSKMALLGQDKTKLIDCSDVIPTPPALVGAAHLPAGFSL ----3333------------------22223333---1111-------------222233 SDVEQACAATPFPALTADP 33----3333--------- >SHIGA-LIKE TOXIN IIE B SU; SWP:Q47644; PDB:2BOSA; ADCAKGKIEFSKYNEDNTFTVKVSGREYWTNRWNLQPLLQSAQLTGMTVTIISNTCSSGS -------------1111-----iiii-----3333---------------------2222 GFAEVQFN -------- >EGF-LIKE MODULE CONTAININ; SWP:Q9UHX3; PDB:2BOUA; RGCARWCPQDSSCVNATACRCNPGFSSFSEIITTPMETCDDINECATSCGKFSDCWNTEG -------2222----------2222--------1111-----1111-----------222 SYDCVCSPGYEPVSGAKTFKNESENTCQDVDECSSGQHQCDSSTVCFNTVGSYSCRCRPG 2-----2222-1111-----3333------3333------1111----2222-------- WKPRHGIPNNQKDTVCE ---2222---------- >3-CHLOROCATECHOL 1,2-DIOX; SWP:Q8G9L3; PDB:2BOYA; STDRTGNIVGKMIAAINAVIKDEKVSYSEYKASTGWLISVGEKNEWPLFLDVFFEHAIES ----------------------------------------1111---------------- VAAESNRGSQSSIQGPYFIPGAPELSIPYTMPMRDDESGDTLIFRGEVVDQEGAPLADVL --1111-------------------------------------------1111------- LDMWQADAAGEYSFINPTLPDYLFRGKIRTDENGRFTLRTIVPAPYEIPKNGPTGALLAA ------1111-----11112222-------1111--------------1111------11 AGWHAWRPAHLHWIIAKEGYESLTTQLYFENGQWTGSDVANAVKPELLLSLDKIEAQSGP 11--------------2222--------1111-11111111--3333------------- HFETSYKFTLGKV ------------- >AFLATOXIN B1 ALDEHYDE RED; SWP:NA; PDB:2BP1A; RVASVLGTMEMGRRMDAPASAAAVRAFLERGHTELDTAFMYSDGQSETILGGLGLGLGGG -------1111----------------1111------1111iiii----1111--2222- DCRVKIATKANPWDGKSLKPDSVRSQLETSLKRLQCPQVDLFYLHAPDHGTPVEETLHAC ------------iiii-------------------------------11113333----- QRLHQEGKFVELGLSNYASWEVAEICTLCKSNGWILPTVYQGMYNATTRQVETELFPCLR ---1111-------------------------------------11113333-------- HFGLRFYAYNPLAGGLLTGKYKYEDKDGKQPVGRFFGNSWAETYRNRFWKEHHFEAIALV ---------1111-1111---3333-------1111-1111--------3333------- EKALQAAYGASAPSVTSAALRWMYHHSQLQGAHGDAVILGMSSLEQLEQNLAATEEGPLE -------!!!!------------------3333-------------------1111---3 PAVVDAFNQAWHLVAHECPNYFR 333----------3333------ >Capsid protein; SWP:P03641; PDB:2BPA1; SNIQTGAERMPHDLSHLGFLAGQIGRLITISTTPVIAGDSFEMDAVGALRLSPLRRGLAI -----------------------------------2222--------------------- DSTVDIFTFYVPHRHVYGEQWIKFMKDGVNATPLPTVNTTGYIDHAAFLGTINPDTNKIP -----------3333------------1111----------1111-1111---------- KHLFQGYLNIYNNYFKAPWMPDRTEANPNELNQDDARYGFRCCHLKNIWTAPLPPETELS ----------------1111------1111-3333------------------1111--- RQMTTSTTSIDIMGLQAAYANLHTDQERDYFMQRYRDVISSFGGKTSYDADNRPLLVMRS ---------------------------------3333---------3333---------- NLWASGYDVDGTDQTSLGQFSGRVQQTYKHSVPRFFVPEHGTMFTLALVRFPPTATKEIQ ------------1111-------------------------------------------3 YLNAKGALTYTDIAGDPVLYGNLPPREISMKDVFRSGDSSKKFKIAEGQWYRYAPSYVSP 333-----3333---3333---------3333-----1111----2222---------33 AYHLLEGFPFIQEPPSGDLQERVLIRHHDYDQCFQSVQLLQWNSQVKFNVTVYRNLPTTR 33---------------3333----33331111------------------------111 DSIMTS 1----- >Major spike protein; SWP:P03643; PDB:2BPA2; MFQTFISRHNSNFFSDKLVLTSVTPASSAPVLQTPKATSSTLYFDSLTVNAGNGGFLHCI ------------------------------------------------------------ QMDTSVNAANQVVSVGADIAFDADPKFFACLVRFESSSVPTTLPTAYDVYPLNGRHDGGY --------------------------------------------------------!!!! YTVKDCVTIDVLPRTPGNNVYVGFMVWSNFTATKCRGLVSLNQVIKEIICLQPLK --------------1111---------------------------------1111 >DECTIN-1; SWP:Q6QLQ4; PDB:2BPDA; QSCLPNWIMHGKSCYLFSFSGNSWYGSKRHCSQLGAHLLKIDNSKEFEFIESQTSSHRIN ---2222--!!!!------------------1111-----------------33331111 AFWIGLSRNQSEGPWFWEDGSAFFPNSFQVRNAVPQESLLHNCVWIHGSEVYNQICNTSS ---------1111---1111--------------------------!!!!----1111-- YSICEKE ------- >FE-SUPEROXIDE DISMUTASE; SWP:Q8IAY6; PDB:2BPIA; VITLPKLKYALNALSPHISEETLNFHYNKHHAGYVNKLNTLIKDTPFAEKSLLDIVKESS ---------1111-------------------------------1111------------ GAIFNNAAQIWNHTFYWDSMGPDCGGEPHGEIKEKIQEDFGSFNNFKEQFSNILCGHFGS ------------------------------------------------------------ GWGWLALNNNNKLVILQTHDAGNPIKDNTGIPILTCDIWEHAYYIDYRNDRASYVKAWWN -------1111-------!!!!--1111----------3333----!!!!---------- LVNWNFANENLKKAMQK ----------------- >NADPH-CYTOCHROM P450 REDU; SWP:P16603; PDB:2BPOA; NRDIAQVVTENNKNYLVLYASQTGTAEGFAKAFSKELVAKFNLNVMCADVENYDFESLND --------1111------------------------------------1111--1111-- VPVIVSIFISTYGEGDFPDGAVNFEDFICNAEAGALSNLRYNMFGLGNSTYEFFNGAAKK -----------------2222----------22221111--------------------- AEKHLSAAGAIRLGKLGEADDGAGTTDEDYMAWKDSILEVLKDELHLDEQEAKFTSQFQY -----1111----------3333-------------------1111-------------- TVLNEITDSMSLGEPSAHYLPSHQLNRNADGIQLGPFDLSQPYIAPIVKSRELFSSNDRN ------1111-----33333333----1111------1111------------------- CIHSEFDLSGSNIKYSTGDHLAVWPSNPLEKVEQFLSIFNLDPETIFDLKPLDPTVKVPF ---------------2222----------------------3333-------3333---- PTPTTIGAAIKHYLEITGPVSRQLFSSLIQFAPNADVKEKLTLLSKDKDQFAVEITSKYF --------------------3333---3333------------------------1111- NIADALKYLSDGAKWDTVPMQFLVESVPQMTPRYYSISSSSLSEKQTVHVTSIVENFPNP ------------------3333-------------------------------------- ELPDAPPVVGVTTNLLRNIQLAQNNVNIAETNLPVHYDLNGPRKLFANYKLPVHVRRSNF -1111----------------1111-3333-----------%%%%--------------- RLPSNPSTPVIMIGPGTGVAPFRGFIRERVAFLESQKNVSLGKHILFYGSRNTDDFLYQD ----3333-----------------------1111----------------------111 EWPEYAKKLDGSFEMVVAHSRLPNTKKVYVQDKLKDYEDQVFEMINNGAFIYVCGDAKGM 1-----1111------------------3333------------1111--------2222 AKGVSTALVGILSRGKSITTDEATELIKMLKTSGRYQEDVW -------------1111-3333------------------- >ANTHRANILATE PHOSPHORIBOS; SWP:P66992; PDB:2BPQA; PSWPQILGRLTDNRDLARGQAAWAMDQIMTGNARPAQIAAFAVAMTMKAPTADEVGELAG -3333----1111---2222-------1111--3333----------------------- VMLSHAHPLPADTVPDDAVDVVGTGGDGVNTVNLSTMAAIVVAAAGVPVVKHGNRAASSL --1111--------1111-------iiii-------------1111--------1111-- SGGADTLEALGVRIDLGPDLVARSLAEVGIGFCFAPRFHPSYRHAAAVRREIGVPTVFNL -------1111----------------------3333----2222----------3333- LGPLTNPARPRAGLIGCAFADLAEVMAGVFAARRSSVLVVHGDDGLDELTTTTTSTIWRV 3333-1111---------------------1111-------1111--------------- AAGSVDKLTFDPAGFGFARAQLDQLAGGDAQANAAAVRAVLGGARGPVRDAVVLNAAGAI iiii--------1111----3333---------------1111----------------- VAHAGLSSRAEWLPAWEEGLRRASAAIDTGAAEQLLARWVRFGRQ ------1111----------------1111-----------1111 >YUKD PROTEIN; SWP:NA; PDB:2BPSA; GSYIDITIDLKHYNGSVFDLRLSDYHPVKKVIDIAWQAQSVSMPPREGHWIRVVNKDKVF --------------------------3333----------------------3333---- SGECKLSDCGITNGDRLEIL 11113333------------ >Nucleoporin NUP1; SWP:P20676; PDB:2BPTB; NSSFTPSTVPNINFSATNLRPSDIFGANA -------------------33332222-- >14-3-3 BETA/ALPHA; SWP:P31946; PDB:2BQ0A; MDKSELVQKAKLAEQAERYDDMAAAMKAVTEQGHELSNEERNLLSVAYKNVVGARRSSWR -3333-------------------------------3333-------------------- VISSIEQKTERNEKKQQMGKEYREKIEAELQDICNDVLELLDKYLIPNATQPESKVFYLK ---------------------------------------------1111----------- MKGDYFRYLSEVASGDNKQTTVSNSQQAYQEAFEISKKEMQPTHPIRLGLALNFSVFYYE -------------!!!!-----------------------1111---------------- ILNSPEKACSLAKTAFDEAIAELDTLNEESYKDSTLIMQLLRDNLTLWTS ---------------------3333-3333-------------------- >BASIC CYTOCHROME C3; SWP:P94691; PDB:2BQ4A; PQVPADVVIDHLSNPNAKLEYKVKFSHKAHASLGTDAAACQKCHHKWDGKSEIGGCATEG ---------------3333------33333333--333333331111----------222 CHADTTSFKATEKDPKFLMTAFHSKSPMSCQGCHKEMKTAKKTTGPTACAQCHN 2-------1111-33333333--------------1111--------3333--- >Tartrate-resistant acid p; SWP:P13686; PDB:2BQ8X; ATPALRFVAVGDWGGVPNAPFHTAREMANAKEIARTVQILGADFILSLGDNFYFTGVQDI ----------------------------------------------------------11 NDKRFQETFEDVFSDRSLRKVPWYVLAGNHDHLGNVSAQIAYSKISKRWNFPSPFYRLHF 11------3333--3333------------1111-------11113333----------- KIPQTNVSVAIFMLDTVTLCGNSDDFLSQQPERPRDVKLARTQLSWLKKQLAAAREDYVL -2222---------3333-----1111--------------------------------- VAGHYPVWSIAEHGPTHCLVKQLRPLLATYGVTAYLCGHDHNLQYLQDENGVGYVLSGAG ----------3333--------3333-1111----------------1111--------- NFMDPSKRHQRKVPNGYLRFHYGTEDSLGGFAYVEISSKEMTVTYIEASGKSLFKTRLPR ------1111---2222------1111-------------------3333---------- RARP ---- >NEURONAL MIGRATION PROTEI; SWP:O43602; PDB:2BQQA; AKKVRFYRNGDRYFKGIVYAVSSDRFRSFDALLADLTRSLSDNINLPQGVRYIYTIDGSR -------2222----------3333--3333----------1111---------1111-- KIGSMDELEEGESYVCSSDNFFDDVEYTKNVNPNWS ---3333-2222-------------------3333- >INORGANIC PYROPHOSPHATASE; SWP:P56153; PDB:2BQXA; EVSHDADSLCVVIEISKHSNIKYELDKESGALMVDRVLYGAQNYPANYGFVPNTLGSDGD ----1111-------2222------------------------------------1111- PVDALVLSDVAFQAGSVVKARLVGVLNMEDESGMDEKLIALPIDKIDPTHSYVKDIDDLS ------------2222-------------1111---------3333---3333-3333-- KHTLDKIKHFFETYKDLEPNKWVKVKGFENKESAIKVLEKAIKAYQ ------------1111-2222---------------------3333 >14-3-3 PROTEIN EPSILON; SWP:P62258; PDB:2BR9A; DREDLVYQAKLAEQAERYDEMVESMKKVAGMDVELTVEERNLLSVAYKNVIGARRASWRI ------------------------------------------------------------ ISSIEQKEENKGGEDKLKMIREYRQMVETELKLICCDILDVLDKHLIPAANTGESKVFYY -------------3333-----------------------------1111---------- KMKGDYHRYLAEFATGNDRKEAAENSLVAYKAASDIAMTELPPTHPIRLGLALNFSVFYY -----------------------------------------1111--------------- EILNSPDRACRLAKAAFDDAIAELDTLSEESYKDSTLIMQLLRDNLTLWT ----------------------3333-3333------------------- >BIFUNCTIONAL POLYNUCLEOTI; SWP:Q96T60; PDB:2BRFA; GRLWLESPPGEAPPIFLPSDGQALVLGRGPLTQVTDRKCSRTQVELVADPETRTVAVKQL -------2222-------%%%%-------1111--3333----------1111------- GVNPSTTGQELKPGLEGSLGVGDTLYLVNGLHPLTLRWEE -----------2222----2222----%%%%--------- >ARABIDOPSIS THALIANA GENO; SWP:Q9LS02; PDB:2BRJA; KVQELSVYEINELDRHSPKILKNAFSLFGLGDLVPFTNKLYTGDLKKRVGITAGLCVVIE -----------------------------------------1111--------------- HVPEKKGERFEATYSFYFGDYGHLSVQGPYLTYEDSFLAITGGAGIFEGAYGQVKLQQLV -3333------------!!!!---------1111---------!!!!------------- YPTKLFYTFYLKGLANDLPLELTGTPVPPSKDIEPAPEAKALEPSGVISNYTN ------------------3333-------1111---------3333------- >FILAMIN A; SWP:P21333; PDB:2BRQA; GGAHKVRAGGPGLERAEAGVPAEFSIWTREAGAGGLAIAVEGPSKAEISFEDRKDGSCGV -3333----3333---2222-------1111---------------------1111---- AYVVQEPGDYEVSVKFNEEHIPDSPFVVPVASPS ---------------%%%%-2222---------- >Putative uncharacterized ; SWP:A0A5E3; PDB:2BRRH; VQLEQSGPELKKPGETVKISCKASGYTFTNYGMNWVKQAPGKGLKWMGWINTYTGEPTYA -----------2222-----------1111--------2222-----------------1 DDFKERFAFSLETSASAAYLQINNLKNEDTATYFCARDYYGSTYPYYAMDYWGQGTTVTV 111--------3333----------3333------------------------------- SSAKTTAPSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQ --------------2222--!!!!------------------%%%%-------------i SDLYTLSSSVTVTSSTWPSQSITCNVAHPASSTKVDKKIEPRGG iii---------3333------------1111------------ >Putative uncharacterized ; SWP:A0A5E3; PDB:2BRRL; ENVLTQSPAIMSASPGEKVTMTCRASSSVSSSYLHWYQQKSGASPKLWIYSTSNLASGVP -------------2222------------3333------2222------------22221 ARFSGSGSGTSYSLTISSVEAEDAATYYCQQYSGYPYTFGGGTKLEIKRADAAPTVSIFP 111----!!!!--------1111------------------------------------- PSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTL ------------------------------iiii-------------------------- TLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC --3333------------1111------------- >NAD(P) transhydrogenase s; SWP:P0AB67; PDB:2BRUC; AEETAELLKNSHSVIITPGYGMAVAQAQYPVAEITEKLRARGINVRFGIHPVAGRLPGHM -------3333--------3333-----------------------------------33 NVLLAEAKVPYDIVLEMDEINDDFADTDTVLVIGANDTVNPAAQDDPKSPIAGMPVLEVW 33-------3333----------------------------------------------- KAQNVIVFKRSMNTGYAGVQNPLFFKENTHMLFGDAKASVDAILKAL --------------------3333-3333-----3333--------- >URIDYLATE KINASE; SWP:NA; PDB:2BRXA; HMRIVFDIGGSVLVPENPDIDFIKEIAYQLTKVSEDHEVAVVVGGGKLARKYIEVAEKFN --------1111--------------------------------------------1111 SSETFKDFIGIQITRANAMLLIAALREKAYPVVVEDFWEAWKAVQLKKIPVMGGTHPGHT ------------------------!!!!-------3333----1111------------- TDAVAALLAEFLKADLLVVITNVDGVYTADPKKDPTAKKIKKMKPEELLEIVGKSVIDPL -----------------------------11111111------3333------------- AAKIIARSGIKTIVIGKEDAKDLFRVIKGDHNGTTIEP ---------------3333------1111--------- >NEDD9 INTERACTING PROTEIN; SWP:Q8VDP3; PDB:2BRYA; TNPAHDHFETFVQAQLCQDVLSSFQGLCRALGVESGGGLSQYHKIKAQLNYWSAKSLWAK ----------------------------1111-------------1111-3333------ LDKRASQPVYQQGQACTNTKCLVVGAGPCGLRAAVELALLGARVVLVEKRIKFSRHNVLH ------3333iiii-1111----------------------------------------- LWPFTIHDLRALGAKKFYGRFCTGTLDHISIRQLQLLLLKVALLLGVEIHWGVKFTGLQP -3333-------3333-1111-!!!!---------------------------------- PPRKGSGWRAQLQPNPPAQLASYEFDVLISAAGGKFVPEGFTIREMRGKLAIGITANFVN --2222--------------------------1111-2222------------------- GRTVEETQVPEISGYNQKFFQSLLKATGIDLENIVYYKDETHYFVMTAKKQCLLRLGVLR --3333-----------------------------------------------1111--- QDLSETDQLLGKANVVPEALQRFARAAADFATHGKLGKLEFAQDARGRPDVAAFDFTSMM ----3333--3333-------------------1111------1111------------- RAESSARVQEKHGARLLLGLVGDCLVEPFWPLGTGVARGFLAAFDAAWMVKRWAEGAGPL ----------iiii-------1111---3333-3333----------------------- EVLAERESLYQLLSQTSPENMHRNVAQYGLDPATRYPNLNLRAVTPNQVQDLYDMMDKE -----------1111-3333---3333---3333----------33333333------- >QUINOL-FUMARATE REDUCTASE; SWP:P17412; PDB:2BS2A; MKVQYCDSLVIGGGLAGLRAAVATQQKGLSTIVLSLIPVKRSHSAAAQGGMQASLGNSKM -------------3333------3333----------11113333------------111 SDGDNEDLHFMDTVKGSDWGCDQKVARMFVNTAPKAIRELAAWGVPWTRIHKGDRMAIIN 1---------------------------------------1111---------------- AQKTTITEEDFRHGLIHSRDFGGTKKWRTCYTADATGHTMLFAVANECLKLGVSIQDRKE --------1111-------------------!!!!------------------------- AIALIHQDGKCYGAVVRDLVTGDIIAYVAKGTLIATGGYGRIYKNTTNAVVCEGTGTAIA ------%%%%--------------------------------------1111-------- LETGIAQLGNMEAVQFHPTPLFPSGILLTEGCRGDGGILRDVDGHRFMPDYEPEKKELAS 1111-----1111-------------------1111----1111--3333----!!!!-- RDVVSRRMIEHIRKGKGVQSPYGQHLWLDISILGRKHIETNLRDVQEICEYFAGIDPAEK -------------------1111------3333-3333-----------------1111- WAPVLPMQHYSMGGIRTDYRGEAKLKGLFSAGEAACWDMHGFNRLGGNSVSEAVVAGMIV -----------------1111---2222---3333----!!!!-2222------------ GEYFAEHCANTQVDLETKTLEKFVKGQEAYMKSLVESKGTEDVFKIKNRMKDVMDDNVGI ------------------------------------------------------------ FRDGPHLEKAVKELEELYKKSKNVGIKNKRLHANPELEEAYRVPMMLKVALCVAKGALDR ------------------3333-------------------------------------- TESRGAHNREDYPKRDDINWLNRTLASWPNPEQTLPTLEYEALDVNEMEIAPGYRGYGAK ---!!!!-1111-------------------------------3333------------- GNYIENPLSVKRQEEIDKIQSELEAAGKDRHAIQEALMPYELPAKYKARNERLGDK -----3333---------------------------------3333---------- >Fumarate reductase iron-s; SWP:P17596; PDB:2BS2B; MGRMLTIRVFKYDPQSAVSKPHFQEYKIEEAPSMTIFIVLNMIRETYDPDLNFDFVCRAG ------------1111-------------------------------3333--------- ICGSCGMMINGRPSLACRTLTKDFEDGVITLLPLPAFKLIKDLSVDTGNWFNGMSQRVES --------iiii--3333-3333-----------------!!!!---------------- WIHAQKEHDISKLEERIEPEVAQEVFELDRCIECGCCIAACGTKIMREDFVGAAGLNRVV --------1111----------------------3333--------1111---------- RFMIDPHDERTDEDYYELIGDDDGVFGCMTLLACHDVCPKNLPLQSKIAYLRRKMVSVNM -1111---------------11111111---------1111-------------1111-- >F17A-G ADHESIN; SWP:Q99003; PDB:2BSCA; AVSFIGSTENDVGPSLGSYSLPFVYTRNKIGYQNANVWHISKGFCVGLDGKVDLPVVGSL -------------------------!!!!------------------------------i DGQSIYGLTEEVGLLIWMGDTKYSRGTAMSGNSWENVFSGWCVGANTASTQGLSVRVTPV iii-----1111---------1111----------------------------------- ILKRNRYSVQKTSIGSIRMRPYNGSSAGSVQTTVNFSLNPFTLND ---------------------iiii-------------------- >RECEPTOR BINDING PROTEIN; SWP:NA; PDB:2BSED; VQLQESGGGLVQAGGSLRLSCTASRRTGSNWCMGWFRQLAGKEPELVVALNFDYDMTYYA -----------2222-----------------------2222---------1111----3 DSVKGRFTVSRDSGKNTVYLQMNSLKPEDTAIYYCAARSGGFSSNRELYDGWGQGTQVTV 333----------------------1111------------------------------- SS -- >SIGMA C CAPSID PROTEIN; SWP:Q992I2; PDB:2BSFA; SLESTASHGLSFSPPLSVADGVVSLDMDPYFCSQRVSLTSYSAEAQLMQFRWMARGTNGS ---1111-----------%%%%-----1111----------------------------- SDTIDMTVNAHCHGRRTDYMMSSTGNLTVTSNVVLLTFDLSDITHIPSDLARLVPSAGFQ ------------!!!!----------------------3333--------1111--3333 AASFPVDVSFTRDSATHAYQAYGVYSSSRVFTITFPTGGDGTANIRSLTVRTGIDT -----------%%%%-----------1111-------------------------- >CHAPERONE PROTEIN SYCT; SWP:P0C2V9; PDB:2BSJA; TTFTELMQQLFLKLGLNHQVNENDVYTFEVDGHIQVLIACYHQQWVQLFSELGADLPTND --------------------1111----------------%%%%---------------- NLFGEHWPAHVQGRLDGKSILWSQQSLVGLDIDEMQAWLERFIDDIEQRKEPQNTKFQPN --------------iiii--------------------------------3333------ STSPILFI -------- ------------------------------------------------------------ ------------- >Mitochondrial import inne; SWP:P62072; PDB:2BSKB; LEVEADYNRTSACHRKCVPPHYKEAELSKGESVCLDRCVSKYLDIHERGKKLTELSQDE ---------------------------------------------------3333---- >HYPOTHETICAL PROTEIN INVO; SWP:Q9HZQ0; PDB:2BSNA; GSHPLPIPSLLIAGIGCRRGCSAEHLRALLERTLGEHGRSLAELDALASIDGKRDEPGLR -----------------2222-------------1111-3333----------------- QLATLLERPVHFLAPAVLHDYEPRLLSPSAVALRETGCSSVAEAAALALAERLGGGRADL -------------333333331111----------------------------------- LGAKRSDDRASIALARLLTER ------1111----------- >Trafficking protein A; SWP:Q5F881; PDB:2BSQE; ASVVIRNLSEATHNAIKFRARAAGRSTEAEIRLILDNIAKAQQTVRLGSMLASIGQEIGG -------------------------------------------------------1111- VELEDVRGR --------- >PURINE NUCLEOSIDE PHOSPHO; SWP:Q8T9Z7; PDB:2BSXA; MDNLLRHLKISKEQITPVVLVVGDPGRVDKIKVVCDSYVDLAYNREYKSVECHYKGQKFL ----------1111---------1111---3333---------!!!!------iiii--- CVSHGVGSAGCAVCFEELCQNGAKVIIRAGSCGSLQPDLIKRGDICICNAAVREDRVSHL -----------------3333-------------------2222-----------3333- LIHGDFPAVGDFDVYDTLNKCAQELNVPVFNGISVSSDMYYPNKIIPSRLEDYSKANAAV ------------------------------------------------3333-1111--- VEMELATLMVIGTLRKVKTGGILIVDGCPFKWDELVPHQLENMIKIALGACAKLATKYAL ---------------------------1111---3333---------------------- E - >DEOXYURIDINE 5'-TRIPHOSPH; SWP:P03195; PDB:2BSYA; CPHIRYAFQNDKLLLQQASVGRLTLVNKTTILLRPMKTTTVDLGLYARPPEGHGLMLWGS ---------1111-----iiii---------------------------2222------- TSRPVTSHVGIIDPGYTGELRLILQNQRRYNSTLRPSELKIHLAAFRYATPQMGPINHPQ ------------1111------------------2222---------------------- YPGDVGLDVSLPKDLALFPHQTVSVTLTVPPPSIPHHRPTIFGRSGLAMQGILVKPCRWR 2222-----------------------------2222----------------------1 RGGVDVSLTNFSDQTVFLNKYRRFCQLVYLHKHHLTSFYSPHSDAGVLGPRSLFRWASCT 111---------------2222--------3333-----1111-----3333-------- FEEVPSLAM ---3333-- >ARYLAMINE N-ACETYLTRANSFE; SWP:Q98D42; PDB:2BSZA; PFDLDAYLARIGYTGPRNASLDTLKALHFAHPQAIPFENIDPFLGRPVRLDLAALQDKIV -----------------------------------------1111--------------1 LGGRGGYCFEHNLLFMHALKALGFEVGGLAARVLWGQSEDAITARSHMLLRVELDGRTYI 111----------------1111------------------------------iiii--- ADVGFGGLTLTAPLLLEPGREQKTPHEPFRIVEADDHFRLQAAIGGDWRSLYRFDLQPQY ----3333-----------------------------------iiii------------3 EVDYSVTNYFLSTSPTSHFLSSVIAARAAPDRRYALRGNRLSIHHLTEQTEIATAADLAD 333----------11111111-------1111----!!!!-------------------- TLQGLLGIIIPDRTAFEAKVRETKIVE ---1111-------------1111--- >ADRENODOXIN 1; SWP:P00257; PDB:2BT6A; GDKITVHFINRDGETLTTKGKIGDSLLDVVVQNNLDIDGFGACEGTLACSTCHLIFEQHI ---------1111-------2222------1111--22221111-----1111---3333 FEKLEAITDEENDMLDLAYGLTDRSRLGCQICLTKAMDNMTVRVP 1111----------1111--------3333---3333-------- >LECTIN; SWP:Q8XXK6; PDB:2BT9A; SSVQTAATSWGTVPSIRVYTANNGKITERCWDGKGWYTGAFNEPGDNVSVTSWLVGSAIH ---------------------iiii-----------------------------!!!!-- IRVYASTGTTTTEWCWDGNGWTKGAYTSTN ------!!!!-------------------- >Trypsin inhibitor 3; SWP:P10293; PDB:2BTCI; RVCPKILMECKKDSDCLAECICLEHGYCG -----------3333-!!!!--1111--- >PTS-DEPENDENT DIHYDROXYAC; SWP:P76014; PDB:2BTDA; SLSRTQIVNWLTRCGDIFSTESEYLTGLDREIGDADHGLNMNRGFSKVVEKLPAIADKDI ------------------------------------------------------1111-- GFILKNTGMTLLSSVGGASGPLFGTFFIRAAQATQARQSLTLEELYQMFRDGADGVISRG ------------------3333-------------------------------------- KAEPGDKTMCDVWVPVVESLRQSSEQNLSVPVALEAASSIAESAAQSTITMQARKGRASY --2222-3333------------1111--------------------1111----3333- LGERSIGHQDPGATSVMFMMQMLALAAKE !!!!2222----------------1111- >DIHYDROLIPOYLLYSINE-RESID; SWP:P0AFG6; PDB:2BTGA; QNNDALSPAIRRLLAEHNLDASAIKGTGVGGRLTREDVEKWLAKA -------------------3333----------3333-------- >CARBON STORAGE REGULATOR ; SWP:Q47620; PDB:2BTIA; LILTRRVGETLIGDEVTVTVLGVKGNQVRIGVNAPKEVSVHREEIYQRIQAEKSQP -----2222--------------!!!!-------3333---3333-----1111-- >TRIOSEPHOSPHATE ISOMERASE; SWP:P00943; PDB:2BTMA; RKPIIAGNWKMNGTLAEAVQFVEDVKGHVPPADEVISVVCAPFLFLDRLVQAADGTDLKI ------------------------1111--3333-------3333-------2222---- GAQTMHFADQGAYTGEVSPVMLKDLGVTYVILGHSERRQMFAETDETVNKKVLAAFTRGL ------------2222------------------3333-----------------1111- IPIICCGESLEEREAGQTNAVVASQVEKALAGLTPEQVKQAVIAYEPIWAIGTGKSSTPE ------------1111-3333--------222233331111-----3333---------- DANSVCGHIRSVVSRLFGPEAAEAIRIQYGGSVKPDNIRDFLAQQQIDGALVGGASLEPA ------------------3333-----------1111---3333--------1111---- SFLQLVEAGRH ------3333- >TUBULIN BTUBA; SWP:Q8GCC5; PDB:2BTOA; VNNTIVVSIGQAGNQIAASFWKTVCLEHGIDPLTGQTAPGVAPRGNWSSFFSKLGESGSY ---------------------------------------------3333----------- VPRAIMVDLEPSVIDNVKATSGSLFNPANLISRTEGAGGNFAVGYLGAGREVLPEVMSRL --------------------!!!!-3333-------%%%%3333--3333---------- DYEIDKCDNVGGIIVLHAIGGGTGSGFGALLIESLKEKYGEIPVLSCAVLPSPQVSSVVT ----------------------3333---------------------------------3 EPYNTVFALNTLRRSADACLIFDNEALFDLAHRKWNIESPTVDDLNLLITEALAGITASM 333-------------------------------------3333---------------- RFEISLRELLTNLVPQPSLHFLMCAFAPLTPPDELGIEEMIKSLFDNGSVFAACSPMEGR ---------------1111----------------3333------1111-----3333-- FLSTAVLYRGIPLADAALAAMREKLPLTYWIPTAFKIGYVEQPGISHRKSMVLLANNTEI -----------3333------1111------------------3333------------- ARVLDRICHNFDKLWQRKAFANWYLNEGMSEEQINVLRASAQELVQSYQVAEE --------------1111-3333------------------------------ >14-3-3 PROTEIN TAU; SWP:NA; PDB:2BTPA; EKTELIQKAKLAEQAERYDDMATCMKAVTEQGAELSNEERNLLSVAYKNVVGGRRSAWRV ----------------------------3333---------------------------- ISSIEQKTKKLQLIKDYREKVESELRSICTTVLELLDKYLIANATNPESKVFYLKMKGDY ---------------------------------------3333----------------- FRYLAEVACGDDRKQTIDNSQGAYQEAFDISKKEMQPTHPIRLGLALNFSVFYYEILNNP ----1111!!!!-----------------------1111--------------------- ELACTLAKTAFDEAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTS ----------------3333------------------------- >Tubulin BtubB [Fragment]; SWP:Q8GCC1; PDB:2BTQB; REILSIHVGQCGNQIADSFWRLALREHGLTEAGTLKESNMEVFFHKVRDGKYVPRAVLVD -----------------------------1111--------------------------- LEPQLFDESSIVRKIPGAANNWARGYNVEGEKVIDQIMNVIDSAVEKTKGLQGFLMTHSI -----------------!!!!--------------------------------------- GGGSGSGLGSLILERLRQAYPKKRIFTFSVVPSPLISDSAVEPYNAILTLQRILDNADGA -------------------1111---------1111--1111------------------ VLLDNEALFRIAKAKLNRSPNYMDLNNIIALIVSSVTASLRFPGKLNTDLSEFVTNLVPF ---3333------------------------------1111-------3333-------- PGNHFLTASFAPMNFPDLARETFAQDNFTAAIDWQQGVYLAASALFRGDDVDENMATIRK --------------3333------1111---------------------------3333- SLNYASYMPASGGLKLGYAETAPEGFASSGLALVNHTGIAAVFERLIAQFDIMFDNHAYT ----3333---------------------------------------------1111-33 HWYENAGVSRDMMAKARNQIATLAQSYRDAS 331111---------------------3333 >PHOSPHORIBOSYL-AMINOIMIDA; SWP:Q81ZH0; PDB:2BTUA; EAGYEAVSRMKKHVQTTMRKEVLGGFGGMFDLSKFALEEPVLVSGTDGVGTKLMLAFMAD ------------3333--3333---------------------------3333------- KHDTIGIDAVAMCVNDIVVQGAEPLFFLDYIACGKAEPSKIENIVKGISEGCRQAGCALI ---3333----------1111-----------------3333------------------ GGETAEMPGMYSTEEYDLAGFTVGIVDKKKIVTGEKIEAGHVLIGLASSGIHSNGYSLVR --------------------------1111---1111----------------------- KVLLEDGELIYGRLELPLGEELLKPTKIYVKPILELLKNHEVYGMAHITGGGFIENIPRM -----------------------------------3333---------2222----3333 LPEGIGAEIELGSWKIQPIFSLLQEVGKLEEKEMFNIFNMGIGMVVAVKEEDAKDIVRLL -2222----1111-----------1111-3333-----iiii------3333-------- EEQGETARIIGRTVQGAGVTFN 3333------------------ >VP3 CORE PROTEIN; SWP:P56582; PDB:2BTVA; VDFTVPDVQQILDDIKALAAEQVYKIVKVPSTSFRHIVTQSRDRVLRVDTYYEEMSQVGD ----------------3333-----------------------------3333------- VITEDEPEKFYSTIIKKVRFIRGKGSFILHDIPARDHRGMEVAEPEVLGVEFKNVLPVLT ----------------------1111----------%%%%---3333-----3333---- AEHRAMIQNALDGSIIENGNVATRDVDVFIGACSEPIYRIYNRLQGYIEAVQLQELRNSI ----------------------------------3333---------------------- GWLERLGQRKRITYSQEVLTDFRRQDMIWVLALQLPVNPQVVWDVPRSSIANLIMNIATC -------1111----3333---3333-------------3333-2222------------ LPTGEYIAPNPRISSITLTQRITTTGPFAILTGSTPTAQQLNDVRKIYLALMFPGQIILD ---------3333--------------3333----------------------------- LKIDPGERMDPAVRMVAGVVGHLLFTAGGRFTNLTQNMARQLDIALNDYLLYMYNTRVQV -----3333--------------------------------------------------- NYGPTGEPLDFQIGRNQYDCNVFRADFATGTGYNGWATIDVEYRDPAPYVHAQRYIRYCG -------------------1111--3333----------------------------iii IDSRELINPTTYGIGMTYHCYNEMLRMLVAAGKDSEAAYFRSMLPFHMVRFARINQIINE i--------------------------------3333----------------------- DLHSVFSLPDDMFNALLPDLIAGAHQNADPVVLDVSWISLWFAFNRSFEPTHRNEMLEIA --------3333-----------------------3333--------------1111--- PLIESVYASELSVMKVDMRHLSLMQRRFPDVLIQARPSHFWKAVLNDSPEAVKAVMNLSH ----------------------3333---------3333--------------------- SHNFINIRDMMRWVLLPSLQPSLKLVLEEEAWAAANDFEDLMLTDQVYMHRDMLPEPRLD -----3333----------------------------1111------------------- DIERFRQEGFYYTNMLEAPPEIDRVVQYTYEIARLQANMGQFRAALRRIMDDDDWVRFGG 3333----------------1111------------1111---------1111------- VLRTVRVKFFDARPPDDILQGLPFSYDTNEKGGLSYATIKYATETTIFYLIYNVEFSNTP --------------3333------------------------------------111133 DSLVLINPTYTMTKVFINKRIVERVRVGQILAVLNRRFVAYKGKMRIMDITQSLKMGTKL 33------------------------11113333-------1111----1111------- AAPTV ----- >ALR0975 PROTEIN; SWP:Q8YY76; PDB:2BTWA; LSPNLIGFNSNEGEKLLLTSRSREDFFPLSQFVTQVNQAYCGVASIIVLNSLGINAPETA -3333-----------1111--33333333------3333---------1111-----33 QYSPYRVFTQDNFFSNEKTKAVIAPEVVARQGTLDELGRLIASYGVKVKVNHASDTNIED 33------3333---3333---------------------3333-------3333----- FRKQVAENLKQDGNFVIVNYLRKEIGQERGGHISPLAAYNEQTDRFLIDVSRYKYPPVWV -------1111---------3333----------------1111------3333------ KTTDLWKANTVDSVSQKTRGFVFVS ------------1111--------- >ACETYLGLUTAMATE KINASE; SWP:Q9X2A4; PDB:2BTYA; MRIDTVNVLLEALPYIKEFYGKTFVIKFGGSAMKQENAKKAFIQDIILLKYTGIKPIIVH ------------------2222-------3333-3333---------------------- GGGPAISQMMKDLGIEPVFKNGHRVTDEKTMEIVEMVLVGKINKEIVMNLNLHGGRAVGI ----------1111-----------------------------------1111------- CGKDSKLIVAEKETKHGDIGYVGKVKKVNPEILHALIENDYIPVIAPVGIGEDGHSYNIN --2222------------------------------1111----------3333------ ADTAAAEIAKSLMAEKLILLTDVDGVLKDGKLISTLTPDEAEELIRDGTVTGGMIPKVEC ---------------------------iiii-----3333----3333--!!!!------ AVSAVRGGVGAVHIINGGLEHAILLEIFSRKGIGTMIKELEG ----1111-------1111----------------------- >PYRUVATE DEHYDROGENASE KI; SWP:Q15119; PDB:2BTZA; GSAPKYIEHFSKFSPSPLSMKQFLDFGSSNACEKTSFTFLRQELPVRLANIMKEINLLPD ---------1111---------------------------------------------33 RVLSTPSVQLVQSWYVQSLLDIMEFLDKDPEDHRTLSQFTDALVTIRNRHNDVVPTMAQG 33-------------------3333----------3333--------1111--------- VLEYKDTYGDDPVSNQNIQYFLDRFYLSRISIRMLINQHTLIFDKHIGSIDPNCNVSEVV ----------------------------------------------!!!!----3333-- KDAYDMAKLLCDKYYMASPDLEIQEINAANSKQPIHMVYVPSHLYHMLFELFKNAMRATV --------------------------1111---------3333----------------1 ESHESSLILPPIKVMVALGEEDLSIKMSDRGGGVPLRKIERLFSYMYSTAPGYGLPISRL 111-------------------------------33333333------------------ YAKYFQGDLQLFSMEGFGTDAVIYLKALSTDSVERLPVYNKSAWRHYQTIQEAGDWCV -------------2222----------3333--------33331111----------- >ALR0975 PROTEIN; SWP:Q8YY76; PDB:2BU3A; LSPNLIGFNSNEGEKLLLTSRSREDFFPLSQFVTQVNQAYCGVASIIVLNSLGINAYRVF -1111-----------------33333333------1111---------1111------- TQDNFFSTKAVIAPEVVARQGTLDELGRLIASYGVKVKVNHASDTNIEDFRKQVAENLKQ 3333---3333-3333--------------1111------3333------------1111 DGNFVIVNYLRKEIGQERGGHISPLAAYNEQTDRFLIDVSRYKYPPVWVKTTDLWKANTV ---------3333----------------1111------3333----------------- DSVSQKTRGFVFVSK -1111---------- >MALES-ABSENT ON THE FIRST; SWP:O02193; PDB:2BUDA; GSHMDPLMQKIDISENPDKIYFIRREDGTVHRGQVLQSRTTENAAAPDEYYVHYVGLNRR ---------------3333-----3333------------3333-------------333 LDGWVGRHRISDNADDLGGITVLPAPPLAPDQ 3----3333---3333---------------- >ACETYLGLUTAMATE KINASE; SWP:Q9HTN2; PDB:2BUFA; TLSRDDAAQVAKVLSEALPYIRRFVGKTLVIKYGGNAMESEELKAGFARDVVLMKAVGIN --------------------3333------------11111111---------------- PVVVHGGGPQIGDLLKRLSIESHFIDGMRVTDAATMDVVEMVLGGQVNKDIVNLINRHGG -------------------------------3333------------------------- SAIGLTGKDAELIRAKKLTVTRQIIDIGHVGEVTGVNVGLLNMLVKGDFIPVIAPIGVGS -----1111-------------------------------------------------11 NGESYNINADLVAGKVAEALKAEKLMLLTNIAGLMDKQGQVLTGLSTEQVNELIADGTIY 11-----3333------------------------1111--------------3333--- GGMLPKIRCALEAVQGGVTSAHIIDGRVPNAVLLEIFTDSGVGTLISNRKRH 3333--------------------3333-3333------------------- >SERINE/THREONINE-PROTEIN ; SWP:O75716; PDB:2BUJA; NLYFQGHMVIIDNKHYLFIQKLGEFSYVDLVEGLHDGHFYALKRILCHEQQDREEAQREA ----------%%%%----------------------------------3333-------- DMHRLFNHPNILRLVAYCLRERGAKHEAWLLLPFFKRGTLWNEIERLKDKGNFLTEDQIL -------1111----------!!!!---------1111--------3333---------- WLLLGICRGLEAIHAKGYAHRDLKPTNILLGDEGQPVLMDLGSMNQACIHVEGSRQALTL -----------------------1111---1111-------------------------- QDWAAQRCTISYRAPELFSVQSHCVIDERTDVWSLGCVLYAMMFGEGPYDMVFQKGDSVA --------1111-3333-----------------------------------1111-333 LAVQNQIPQSPRHSSALWQLLNSMMTVDPHQRPHIPLLLSQLEALQPPAPG 33333----------------------3333----------3333------ >COAT PROTEIN; SWP:P03606; PDB:2BUKA; TMRAVKRMINTHLEHKRFALINSGNTNATAGTVQNLSNGIIQGDDINQRSGDQVRIVSHK --3333--1111---------------1111-----3333-------------------- LHVRGTAITVSQTFRFIWFRDNMNRGTTPTVLEVLNTANFMSQYNPITLQQKRFTILKDV -----------------------------1111-----1111---1111----------- TLNCSLTGESIKDRIINLPGQLVNYNGATAVAASNGPGAIFMLQIGDSLVGLWDSSYEAV ------------------------------3333-------------------------- YTDA ---- >PROTOCATECHUATE 3,4-DIOXY; SWP:P20371; PDB:2BURA; ELKETPSQTGGPYVHIGLLPKQANIEVFEHNLDNNLVQDNTQGQRIRLEGQVFDGLGLPL ----------1111----3333---------------1111------------1111--- RDVLIEIWQADTNGVYPSQADTQGKQVDPNFLGWGRTGADFGTGFWSFNTIKPGAVPGRK ----------1111---1111-------------------------------------%% GSTQAPHISLIIFARGINIGLHTRVYFDDEAEANAKDPVLNSIEWATRRQTLVAKREERD %%-----------2222---------1111-------3333---3333-1111------- GEVVYRFDIRIQGENETVFFDI ---------------------- >Protocatechuate 3,4-dioxy; SWP:P20372; PDB:2BURB; IIWGAYAQRNTEDHPPAYAPGYKTSVLRSPKNALISIAETLSEVTAPHFSADKFGPKDND ---------1111-----33333333-------------3333------3333-1111-- LILNYAKDGLPIGERVIVHGYVRDQFGRPVKNALVEVWQANASGRYRHPNDQYIGAMDPN -----------------------1111--------------------1111------111 FGGCGRMLTDDNGYYVFRTIKPGPYPWRNRINEWRPAHIHFSLIADGWAQRLISQFYFEG 1--------1111---------------------------------3333-------222 DTLIDSCPILKTIPSEQQRRALIALEDKSNFIEADSRCYRFDITLRGRRATYFENDLT 23333--------------1111---3333---------------------------- >REGULATOR OF G-PROTEIN SI; SWP:Q08116; PDB:2BV1A; KDVLSAAEVMQWSQSLEKLLANQTGQNVFGSFLKSEFSEENIEFWLACEDYKKTESDLLP ----------1111-------------------1111------------3333-3333-- CKAEEIYKAFVHSDAAKQINIDFRTRESTAKKIKAPTPTCFDEAQKVIYTLMEKDSYPRF -----------1111--------------3333---1111---------------3333- LKSDIYLNLLNDLQ ----------3333 >CIONA BETAGAMMA-CRYSTALLI; SWP:NA; PDB:2BV2A; GKIILFEDVEFGGKKLELETSVSDLNVHGFNDIVSSIIVESGTWFVFDDEGFSGPSYKLT --------%%%%-------------1111--------------------%%%%------- PGKYPNPGSWGGNDDELSSVKQQ -----3333-------------- >LECTIN CV-IIL; SWP:Q7NX84; PDB:2BV4A; AQQGVFTLPARINFGVTVLVNSAATQHVEIFVDNEPRAAFSGVGTGDNNLGTKVINSGSG -------------------------------%%%%---------------------!!!! NVRVQITANGRQSDLVSSQLVLANKLNLAVVGSEDGTDMDYNDSIVILNWPLG -------iiii----------%%%%---------------------------- >Tyrosine-protein phosphat; SWP:P54829; PDB:2BV5A; SPSRVLQAEELHEKALDPFLLQAEFFEIPMNFVDPKEYDIPGLVRKNRYKTILPNPHSRV -----------------------1111------3333--22221111-1111--3333-- CLTSPDPDDPLSSYINANYIRGYGGEEKVYIATQGPIVSTVADFWRMVWQEHTPIIVMIT -----1111-1111-------2222-----------1111-------------------- NIEEMNEKCTEYWPEEQVAYDGVEITVQKVIHTEDYRLRLISLKSGTEERGLKHYWFTSW 3333---------------iiii---------1111--------!!!!------------ PDQKTPDRAPPLLHLVREVEEAAQQEGPHCAPIIVHSAGIGRTGCFIATSICCQQLRQEG ----3333---------------1111--------------------------------- VVDILKTTCQLRQDRGGMIQTCEQYQFVHHVMSLYEKQLSH --------------2222----------------------- >HTH-TYPE TRANSCRIPTIONAL ; SWP:P0C1S0; PDB:2BV6A; NLKEQLCFSLYNAQRQVNRYYSNKVFKKYNLTYPQFLVLTILWDESPVNVKKVVTELALD 3333-3333-----------------1111------------------3333-------- TGTVSPLLKREQVDLIKRERSEVDQREVFIHLTDKSETIRPELSNASDKVASASSLSQDE ----------1111---------1111-----------33331111-----1111-3333 VKELNRLLGKVIHAF --------------- >C-PHYCOCYANIN ALPHA SUBUN; SWP:Q6B8L6; PDB:2BV8A; MKTPITEAIASADSQGRFLSNGELQSINGRYQRATASLEAARSLTSNAERLISGAAQSVY ------------1111-------------------------------------------- SKFPYTTQMQGPNYAADATGKAKCARDIGYYLRMVTYCLVVGATGPMDEYLIAGLSEINR ---3333---3333-------------------------------------2222----1 SFELSPSWYIEALEYIKDSHALSGQAANEANTYLDYAINALS 111-3333------------------------------1111 >C-phycocyanin beta subuni; SWP:Q6B8L7; PDB:2BV8B; MLDAFAKVVAQADARGEFLSNTQLDALANMIAEGNKRLDIVNRINSNASAIVSNSARALF ------------1111-------------------------------------------- AEQPQLIQPGGAYTNRRMAACLRDMEIVLRYVSYAEIAGDSSVLDDRCLNGLRETYQALG --3333-2222-------------------------------------2222-------- TPGSSVAVAIEKMKEASVSDANDSSGTPSGDCSSLSAELGTYFDRAASAVS -3333------------------2222--------------------1111 ------------------------------------------------------------ ------------------------------------------------------------ ----------------- >GLUTAMINE SYNTHETASE 1; SWP:P0A590; PDB:2BVCA; KTPDDVFKLAKDEKVEYVDVRFCDLPGIMQHFTIPASAFDKSVFDDGLAFDGSSIRGFQS -3333-----1111---------3333-------3333--3333------11112222-1 IHESDMLLLPDPETARIDPFRAAKTLNINFFVHDPFTLEPYSRDPRNIARKAENYLISTG 111-------3333--------------------------1111------------3333 IADTAYFGAEAEFYIFDSVSFDSRANGSFYEVDAISGWWNTGAATEADGSPNRGYKVRHK -----------------------1111------11111111----1111--------222 GGYFPVAPNDQYVDLRDKMLTNLINSGFILEKGHHEVGSGGQAEINYQFNSLLHAADDMQ 2----------------------1111--------------------------------- LYKYIIKNTAWQNGKTVTFMPKPLFGDNGSGMHCHQSLWKDGAPLMYDETGYAGLSDTAR ----------1111-------------------------iiii----1111%%%%----- HYIGGLLHHAPSLLAFTNPTVNSYKRLVPGYEAPINLVYSQRNRSACVRIPITGSNPKAK --------3333-------33333333----------------------------3333- RLEFRSPDSSGNPYLAFSAMLMAGLDGIKNKIEPQAPVDKDLYELPPEEAASIPQTPTQL -----------3333-------------------------3333-----1111-----33 SDVIDRLEADHEYLTEGGVFTNDLIETWISFKRENEIEPVNIRPHPYEFALYYDV 33--------33332222--3333--------------------3333---1111 >6-HYDROXY-D-NICOTINE OXID; SWP:Q8GAG1; PDB:2BVFA; KLATPLSIQGEVIYPDDSGFDAIANIWDGRHLQRPSLIARCLSAGDVAKSVRYACDNGLE -------------1111------------------------------------------- ISVRSGGHNPNGYATNDGGIVLDLRLMNSIHIDTAGSRARIGGGVISGDLVKEAAKFGLA -----------3333--------1111------1111----11113333----------- AVTGMHPKVGFCGLALNGGVGFLTPKYGLASDNILGATLVTATGDVIYCSDDERPELFWA -----11113333-------1111----3333--------1111-----1111------- VRGAGPNFGVVTEVEVQLYELPRKMLAGFITWAPSVSELAGLLTSLLDALNEMADHIYPS ---3333---------------------------1111------------1111------ VFVGVDENRAPSVTVCVGHLGGLDIAERDIARLRGLGRTVSDSIAVRSYDEVVALNAEVG -----1111-----------------------1111------------------------ SFEDGMSNLWIDREIAMPNARFAEAIAGNLDKFVSEPASGGSVKLEIEGMPFGNPKRTPA ----------------------------3333----1111-------------1111--- RHRDAMGVLALAEWSGAAPGSEKYPELARELDAALLRAGVTTSGFGLLNNNSEVTAEMVA --------------3333----------------------------3333---------- EVYKPEVYSRLAAVKREYDPENRFRHNYNIDPE ------------------1111----------- >TOXIN B; SWP:P18177; PDB:2BVLA; MSLVNRKQLEKMANVRFRTQEDEYVAILDALEEYHNMSENTVVEKYLKLKDINSLTDICI --------------2222--------------11111111-------------------- DTYKKSGRNKALKKFKEYLVTEVLELKNNNLTPVEKNLHFVWIGGQINDTAINYINQWKD --1111------------------------------------------------------ VNSDYNVNVFYDSNAFLINTLKKTVVESAINDTLESFRENLNDPRFDYNKFFRKRMEIIY -1111------1111-----------------------3333------------------ DKQKNFINYYKAQREENPELIIDDIVKTYLSNEYSKEIDELNTYIEESLNKITQNSGNDV ----------------3333--------------------------------------11 RNFGEFKNGESFNLYEQELVERWNLAAASDILRISALKEIGGMYLDVDMLPGIQPDLFES 11------1111---------------------------------1111----3333111 IEKPSSVTVDFWEMTKLEAIMKYKEYIPEYTSEHFDMLDEEVQSSFESVLASKSDKSEIF 1--3333-------------------2222-3333-------------------3333-- SSLGDMEASPLEVKIAFNSKGIINQGLISVKDSYCSNLIVKQIENRYKILNNSLNPAISE --------1111-----1111--------2222--------------------------- DNDFNTTTNTFIDSIMAEANADNGRFMMELGKYLRVGFFPDVKTTINLSGPEAYAAAYQD -------------------1111------1111-2222----3333-------------- LLMFKEGSMNIHLIEADLRNFEISKTNISQSTEQEMASLWSFDDARAKAQFEEYKRNYFE -------------3333-1111-3333----3333------------------------- GAL --- >BETA-2-MICROGLOBULIN; SWP:P18465; PDB:2BVPA; GSHSMRYFYTAMSRPGRGEPRFIAVGYVDDTQFVRFDSDAASPRMAPRAPWIEQEGPEYW -------------2222----------!!!!-----1111--------1111-------- DGETRNMKASAQTYRENLRIALRYYNQSEAGSHIIQVMYGCDVGPDGRLLRGHNQYAYDG ---------------------------1111------------1111----------iii KDYIALNEDLSSWTAADTAAQITQRKWEAARVAEQLRAYLEGLCVEWLRRYLENGKETLQ i-----3333-----------------------------------------------111 RADPPKTHVTHHPISDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDRT 1-------------------------------------%%%%-3333------------- FQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRW ---------22221111-----1111-------- >Prothrombin [Precursor]; SWP:P00734; PDB:2BVRH; IVEGSDAEIGMSPWQVMLFRKSPQELLCGASLISDRWVLTAAHCLLYPPWDKNFTENDLL -------22221111-------------------------3333--3333----1111-- VRIGKHSRTRYERNIEKISMLEKIYIHPRYNWRENLDRDIALMKLKKPVAFSDYIHPVCL -----------2222-----------1111---------------------1111----- PDRETAASLLQAGYKGRVTGWGNLKEKGQPSVLQVVNLPIVERPVCKDSTRIRITDNMFC ----------------------------------------------1111----1111-- AGYKPDEGKRGDACEGDSGGPFVMKSPFNNRWYQMGIVSWGEGCDRDGKYGFYTHVFRLK ---3333------2222----------------------------2222-----3333-- KWIQKVIDQFG ----------- >BETA-1,4-MANNANASE; SWP:Q9XCV5; PDB:2BVYA; TIAIVDADATAETRSLLSYLDGVRGEGILFGHQHTTSFGLTTGPTDGTTSDVKNVTGDFP --------------------3333------------------------------------ AVFGWDTLIIEGNERPGLAENTRDENIALFADYIRKADAIGGVNTVSAHVENFVTGGSFY -----------------3333------------------------------------111 DTSGDTLRAVLPGGSHHAELVAYLDDIAELADASRRDDGTLIPIVFRPWHENAGSWFWWG 1---3333--2222---------------------1111-----------------1111 AAYGSPGEYQELYRFTVEYLRDVKGVSNFLYAWGPGGGFGGNRDVYLRTYPGDAFVDVLG --------------------------------------iiii----1111-1111----- LDTYDSTGSDAFLAGLVADLRMIAEIADEKGKVSAFTEFGVSGGVGTNGSSPAQWFTKVL ----------------------------------------2222-1111----------- AAIKADPVASRNAYMETWANFDAGQHFVPVPGDALLEDFQAYAADPFTLFASEVTGAFDR -------1111----------3333-------1111--------3333-3333--1111- TVAAAPAQPVVHIASPADGARVASAPTTVRVRVGGTDVQSVTVEVAQVVDTLDLAYDGAL ----------------2222---------------------------------------- WWTAPWSPYTVTATATTAAGTLDVTNEVAAAL ----------------1111------------ >10-FORMYLTETRAHYDROFOLATE; SWP:NA; PDB:2BW0A; MKIAVIGQSLFGQEVYCHLRKEGHEVVGVFTVPDKDGKADPLGLEAEKDGVPVFKYSRWR ----------------------------------------------1111---------- AKGQALPDVVAKYQALGAELNVLPFCSQFIPMEIISAPRHGSIIYHPSLLPRHRGASAIN iiii---------3333-------------3333---1111------------------- WTLIHGDKKGGFSIFWADDGLDTGDLLLQKECEVLPDDTVSTLYNRFLFPEGIKGMVQAV --1111----------------------------1111---------------------- RLIAEGKAPRLPQPEEGATYEGIQKKETAKINWDQPAEAIHNWIRGNDKVPGAWTEACEQ --1111--------2222------3333----------------------------%%%% KLTFFNSTLNTSGLVPEGDALPIPGAHRPGVVTKAGLILFGNDDKMLLVKNIQLEDGKMI ----------2222--------2222------1111-----------------1111--- LASNFFK 3333--- >BYPASS OF FORESPORE C; SWP:O05391; PDB:2BW2A; AEVEHYEPLQVHVQLEKVYLDGDVSIEHKHEKVFSMDDFWAAYAGWTLVEQKKGYVLFRK ------------------3333--------------------3333-------------- QMDDISPLSKVNGYIGVSDNGVISTFHGRPEPASEPIQSFFQIDLERLESHMQKNLLKGI -----3333--------1111---------1111---------3333-1111-------- PFRTKAEFEDVIEHMKTYSG ---3333------------- >TRANSPOSASE; SWP:Q25438; PDB:2BW3A; SHQSRELKTVSADCKKEAIEKCAQWVVRDCRPFSAVSGSGFIDIKFFIKVKAEYGEHVNV -------------------------------3333---3333----------------33 EELLPSPITLSRKVTSDAKEKKALIGREIKSAVEKDGASATIDLWTDNYIKRNFLGVTLH 33------------------------------1111------------------------ YHENNELRDLILGLKSLDFERSTAENIYKKLKAIFSQFNVEDLSSIKFVTDRGANVVKSL --!!!!-----------1111--------------1111---1111-----------111 ANNIRINCSSHLLSNVLENSFEETPELNPILACKNIVKYFKKANLQHRLRSSLKSECPTR 1----------------------3333-------------11113333---------333 WNSTYTLRSILDNWESVIQILSEAGETQRIVHINKSIIQTVNILDGFERIFKELQTCSSP 3-3333---3333--------1111--1111----------------------------- SLCFVVPSILKVKEICSPDVGDVADIAKLKVNIIKNVRIIWEENLSIWHYTAFFFYPPAL 3333---------1111-1111------------------3333-3333------3333- HQQEKVAQIKEFCLSKEDLELINRSSFNELSATQLNQDISTTSFFFPQLTQNNSREPPVC ------------------------1111-----------3333----------------- PSDEFEFYRKEIVILSEDFKVEWWNLNSKKYPKLSKLALSLLSIPASSAASERTFSLAGN -------1111----1111--------3333---------------1111---------- IITEKRNRIGQQTVDSLLFLNSFYKNFCK ---3333---------------------- >COPPER-CONTAINING NITRITE; SWP:P25006; PDB:2BW4A; VDISTLPRVKVDLVKPPFVHAHDQVAKTGPRVVEFTMTIEEKKLVIDREGTEIHAMTFNG -3333-----------------------------------------1111-------iii SVPGPLMVVHENDYVELRLINPDTNTLLHNIDFHAATGALGGGALTQVNPGEETTLRFKA i--------2222--------1111-------3333-%%%%3333---2222-------- TKPGVFVYHCAPEGMVPWHVTSGMNGAIMVLPRDGLKDEKGQPLTYDKIYYVGEQDFYVP -----------2222----------------1111--1111------------------- KDEAGNYKKYETPGEAYEDAVKAMRTLTPTHIVFNGAVGALTGDHALTAAVGERVLVVHS -1111------3333--------3333------iiii----!!!!----2222------- QANRDTRPHLIGGHGDYVWATGKFRNPPDLDQETWLIPGGTAGAAFYTFRQPGVYAYVNH ---------2222-----11111111-----------2222------------------- NLIEAFELGAAGHFKVTGEWNDDLMTSVVKPASM --------------------3333---------- >ENDOGLUCANASE; SWP:O33897; PDB:2BW8A; TVELCGRWDARDVAGGRYRVINNVWGAETAQCIEVGLETGNFTITRADHDNGNNVAAYPA -----1111---%%%%----------------------------------!!!!------ IYFGCHWGACTSNSGLPRRVQELSDVRTSWTLTPITTGRWNAAYDIWFSPVTNSGNGYSG -----iiii---------3333-------------------------------1111222 GAELMIWLNWNGGVMPGGSRVATVELAGATWEVWYADWDWNYIAYRRTTPTTSVSELDLK 2------------------------iiii----------------------------333 AFIDDAVARGYIRPEWYLHAVETGFELWEGGAGLRSADFSVTVQKL 3-----1111--1111------------------------------ >UBIQUITIN-LIKE PROTEIN DS; SWP:NA; PDB:2BWBA; LDPEERYEHQLRQLNDMGFFDFDRNVAALRRSGGSVQGALDSLLNG -3333---------1111-------------iiii----------- >UBIQUITIN-LIKE PROTEIN DS; SWP:NA; PDB:2BWFA; DMSLNIHIKSGQDKWEVNVAPESTVLQFKEAINKANGIPVANQRLIYSGKILKDDQTVES ---------!!!!------1111---------------3333----iiii--11113333 YHIQDGHSVHLVKSQP ---2222--------- >ADENYLATE KINASE 5; SWP:Q9Y6K8; PDB:2BWJA; GFMEDLRKCKIIFIIGGPGSGKGTQCEKLVEKYGFTHLSTGELLREELASESERSKLIRD -----1111-------2222-------------------------------3333----- IMERGDLVPSGIVLELLKEAMVASLGDTRGFLIDGYPREVKQGEEFGRRIGDPQLVICMD -1111---3333-----------2222--------------------------------- CSADTMTNRLLQMSRSSLPVDDTTKTIAKRLEAYYRASIPVIAYYETKTQLHKINAEGTP ----------1111-----------------------------3333------------- EDVFLQLCTAIDSIFL ---------------- >ANGIOGENIN; SWP:P21570; PDB:2BWKA; DSRYTKFLTQHHDAKPKGRDDRYCERMMKRRSLTSPCKDVNTFIHGNKSNIKAICGANGS ----------------------------1111--------------3333-33331111- PYRENLRMSKSPFQVTTCKHTGGSPRPPCQYRASAGFRHVVIACENGLPVHFDESFFS --------------------------------------------iiii----3333-- >5-AMINOLEVULINATE SYNTHAS; SWP:P18079; PDB:2BWNA; DYNLALDKAIQKLHDEGRYRTFIDIEREKGAFPKAQWNRPDGGKQDITVWCGNDYLGMGQ ---------------------------2222-------1111-----------1111111 HPVVLAAMHEALEAVGAGSGGTRNISGTTAYHRRLEAEIAGLHQKEAALVFSSAYNANDA 13333----------------3333---3333---------------------------- TLSTLRVLFPGLIIYSDSLNHASMIEGIKRNAGPKRIFRHNDVAHLRELIAADDPAAPKL --3333--2222----11113333--------------2222-------33333333--- IAFESVYSMDGDFGPIKEICDIAEEFGALTYIDEVHAVGMYGPRGAGVAERDGLMHRIDI ------------------------------------2222-------------3333--- FNGTLAAYGVFGGYIAASARMVDAVRSYAPGFIFSTSLPPAIAAGAQASIAFLKTAEGQK ----------------------------3333-----------------------3333- LRDAQQMHAKVLKMRLKALGMPIIDHGSHIVPVVIGDPVHTKAVSDMLLSDYGVYVQPIN ----------------3333---------------------------------------- FPTVPRGTERLRFTPSPVHDLKQIDGLVHAMDLLW ----2222-------1111---------------- >REGULATING SYNAPTIC MEMBR; SWP:Q9JIS1; PDB:2BWQA; QFLSGQLSIKLWFDKVGHQLIVTILGAKDLPSREDGRPRNPYVKIYFLPDRSDKNKRRTK --------------------------------1111---------------3333----- TVKKTLEPKWNQTFIYSPVHRREFRERMLEITLWDQSEFLGEILIELETALLDDEPHWYK -------------------33331111------------------3333----------- LQ -- >PSATHYRELLA VELUTINA LECT; SWP:NA; PDB:2BWRA; SVVVISQALPVPTRIPGVADLVGFGNGGVYIIRNSLLIQVVKVINNFGYDAGGWRVEKHV -----3333---------------1111-------------------3333---1111-- RLLADTTGDNQSDVVGFGENGVWISTNNGNNTFVDPPKMVLANFAYAAGGWRVEKHIRFM --------------------------------------------1111---1111----- ADLRKTGRADIVGFGDGGIYISRNNGGGQFAPAQLALNNFGYAQGWRLDRHLRFLADVTG --------------1111-------iiii-----------3333--1111---------- DGLLDVVGFGENQVYIARNSGNGTFQPAQAVVNNFCIGAGGWTISAHPRVVADLTGDRKA ---------1111----------------------1111---1111-------------- DILGFGVAGVYTSLNNGNGTFGAVNLVLKDFGVNSGWRVEKHVRCVSSLTNKKVGDIIGF -----3333----------------------3333--1111--------3333------- GDAGVYVALNNGNGTFGPVKRVIDNFGYNQGWRVDKHPRFVVDLTGDGCADIVGFGENSV 1111----------------------3333--1111------------------------ WACMNKGDGTFGPIMKLIDDMTVSKGWTLQKTVRYAANLYL ---------------------1111--3333---------- >AMINOPEPTIDASE P; SWP:P15034; PDB:2BWVA; SEISRQEFQRRRQALVEQMQPGSAALIFAAPEVTRSADSEYPYRQNSDFWYFTGFNEPEA ---------------1111----------------!!!!--------------------- VLVLIKSDDTHNHSVLFNRVRDLTAEIWFGRRLGQDAAPEKLGVDRALAFSEINQQLYQL ----------------------------------1111-1111-----3333-----333 LNGLDVVYHAQGEYAYADVIVNSALEKLRKGSRQNLTAPATMIDWRPVVHEMRLFKSPEE 3--------2222-----------------3333-------------------------- IAVLRRAGEITAMAHTRAMEKCRPGMFEYHLEGEIHHEFNRHGARYPSYNTIVGSGENGC ----------------------22223333---------1111-----------!!!!-- ILHYTENEEMRDGDLVLIDAGCEYKGYAGDITRTFPVNGKFTQAQREIYDIVLESLETSL ----------2222---------iiii--------1111--------------------- RLYRPGTSILEVTGEVVRIMVSGLVKLGILKGDVDELIAQNAHRPFFMHGLSHWLGLDVA ---2222---------------------------------3333---------------- DVGVYGQDRSRILEPGMVLTVEPGLYIAPDAEVPEQYRGIGIRIEDDIVITETGNENLTA ------%%%%---2222----------1111--3333-------------1111----11 SVVKKPEEIEALMVAARKQ 11-------------1111 >XRP2 PROTEIN; SWP:O75695; PDB:2BX6A; KVDPKDYMFSGLKDETVGRLPGTVAGQQFLIQDCENCNIYIFDHSATVTIDDCTNCIIFL --3333-------------2222iiii--------------------------------- GPVKGSVFFRNCRDCKCTLACQQFRVRDCRKLEVFLCCATQPIIESSSNIKFGCFQWYYP ----------------------------------------------------------11 ELAFQFKDAGLSIFNNTWSNIHDFTPVLNWSLLPEDAVVQDYVPIPTTEELKAVRVSTEA 11----1111-1111-1111-------------11113333------3333-------11 NRSIVPISRGQRQKSSDESCLVVLFAGDYTIANARKLIDEMVGKGFFLVQTKEVSMKAED 11------!!!!--------------1111-----------1111--------------- AQRVFREKAPDFLPLLNKGPVIALEFNGDGAVEVCQLIVNEIFNGTKMFVSESKETASGD ----!!!!33331111-----------2222-----------2222------3333---- VDSFYNFADIQ ----------- >TRYPTOPHAN RNA-BINDING AT; SWP:O31466; PDB:2BX9A; MVIATDDLEVACPKCERAGEIEGTPCPACSGKGVILTAQGYTLLDFIQKHLNK ---1111-----1111------------------------------------- >AMINE OXIDASE [FLAVIN-CON; SWP:P21397; PDB:2BXRA; HMFDVVVIGGGISGLSAAKLLTEYGVSVLVLEARDRVGGRTYTIRNEHVDYVDVGGAYVG ---------------------1111------------!!!!----3333----------- PTQNRILRLSKELGIETYKVNVSERLVQYVKGKTYPFRGWNPIAYLDYNNLWRTIDNMGK ---3333----------------------------------3333------------333 EIPTDAPWEAQHADKWDKMTMKELIDKICWTKTARRFAYLFVNINVTSEPHEVSALWFLW 3-11111111--------------------------------------3333-------- YVKQCGGTTRIFSVGQERKFVGGSGQVSERIMDLLGDQVKLNHPVTHVDQSSDNIIIETL --------3333-------2222-----------!!!!--------------------11 NHEHYECKYVINAIPPTLTAKIHFRPELPAERNQLIQRLPMGAVIKCMMYYKEAFWKKKD 11------------33331111------------3333---------------3333--- YCGCMIIEDEDAPISITLDDTKPDGSLPAIMGFILARKADRLAKLHKEIRKKKICELYAK ---------------------1111----------------33333333----------- VLGSQEALHPVHYEEKNWCEEQYSGGCYTAYFPPGIMTQYGRVIRQPVGRIFFAGTETAT ---3333-------------1111--------2222---3333-----------1111-- KWSGYMEGAVEAGERAAREVLNGLG ------------------------- >NUCLEOCAPSID PROTEIN; SWP:P69598; PDB:2BXXA; HMSSGNASWFQAIKAKKLNTPPPKFEGSGVPDNENIKPSQQHGYWRRQARFKPGKGGRCP ------------------------------------3333-------------1111--- VPDAWYFYYTGTGPAADLNWGDTQDGIVWVAAKGADTKSRSNQGTRDPDKFDQYPLRFSD --------2222--33332222-2222----22223333-------3333-------111 GGPDGNFRWDFIPL 1--3333------- >BLUE LIGHT SENSING; SWP:NA; PDB:2BYCA; FMDELVSLTYRSRVRLADPVADIVQIMRASRVRNLRLGITGILLYNGVHFVQTIEGPRSA 1111-------------------------------------------------------- CDELFRLISADPRHQEILAFDLEPITARRFPDWSMRIVSRKELRALAPDLERLDLSGPED ----------1111---------------1111-------------1111------1111 VAELHRTIAASLSRGDA ------------1111- >CHANNEL ASSOCIATED PROTEI; SWP:Q15700; PDB:2BYGA; FQSMTVVEIKLFKGPKGLGFSIAGGVGNQHIPGDNSIYVTKIIDGGAAQKDGRLQVGDRL -------------1111-------2222--2222--------2222--------2222-- LMVNNYSLEEVTHEEAVAILKNTSEVVYLKVGKPTTIY --iiii-------------------------------- >CHRAC-14; SWP:Q9V452; PDB:2BYKA; MKSSMDTGLITNEVLFLMTKCTELFVRHLAGAAYTEEFGQRPGEALKYEHLSQVVNKNKN ---------------------------------3333!!!!-----3333-------333 LEFLLQIVPQKI 31111------- >CG13399-PA, isoform A; SWP:Q9V444; PDB:2BYKB; PNAVIGRLIKEALPESASVSKEARAAIARAASVFAIFVTSSSTALAHKQNHKTITAKDIL --3333-------1111--3333---------------------1111------------ QTLTELDFESFVPSLTQDLEVYRKVVKEK ---11113333------------------ >SOLUBLE ACETYLCHOLINE REC; SWP:Q8WSF8; PDB:2BYNA; ANLMRLKSDLFNRSPMYPGPTKDDPLTVTLGFTLQDIVKADSSTNEVDLVYYEQQRWKLN --------------------1111----------------------------------11 SLMWDPNEYGNITDFRTSAADIWTPDITAYSSTRPVQVLSPQIAVVTHDGSVMFIPAQRL 11--3333%%%%-----3333-------------------------1111---------- SFMCDPTGVDSEEGATCAVKFGSWVYSGFEIDLKTDTDQVDLSSYYASSKYEILSATQTR -----2222-3333------------1111-----------11111111----------- QVQHYSCCPEPYIDVNLVVKFRERR ----3333----------------- >LIPOPROTEIN LPPX; SWP:P65306; PDB:2BYOA; SDPALLAEIRQSLDATKGLTSVHVAVRTTGKVDSLLGITSADVDVRANPLAAKGVCTYND ---------------1111---------------iiii-------------------iii EQGVPFRVQGDNISVKLFDDWSNLGSISELSTSRVGVTQLLSGVTNLQAQGTEVIDGIST i-------!!!!----!!!!----------------------------------iiii-- TKITGTIPASSVKMLDPGAKSARPATVWIAQDGSHHLVRASIDLGSGSIQLTQSKWNEPV -------333333331111------------------------1111-------2222-- NVD --- >3-OXOACYL-[ACYL-CARRIER-P; SWP:P0A953; PDB:2BYWA; MKRVVITGLGIVSSIGNNQQEVLASLREGRSGITFSQELKDSGMRSHVWGNVKLDTTGLI ------------1111---------------------------------------2222- DRKVVRFMSDASIYAFLSMEQAIADAGLSPEAYQNNPRVGLIAGSGGGSPRFQVFGADAM 33331111---------------1111-3333---1111--------------------- RGPRGLKAVGPYVVTKAMASGVSACLATPFKIHGVNYSISSACATSAHCIGNAVEQIQLG -11113333---3333-1111-3333-1111---------!!!!---------------- KQDIVFAGGGEELCWEMACEFDAMGALSTKYNDTPEKASRTYDAHRDGFVIAGGGGMVVV -------------3333----1111-----33331111-2222----------------- EELEHALARGAHIYAEIVGYGATSDGADMVAPSGEGAVRCMKMAMHGVDTPIDYLNSHGT ------1111----------------------------------2222------------ STPVGDVKELAAIREVFGDKSPAISATAAMTGHSLGAAGVQEAIYSLLMLEHGFIAPSIN -3333-----------!!!!-------------!!!!----------------------- IEELDEQAAGLNIVTETTDRELTTVMSNSFGFGGTNATLVMRKLKD ----1111-------------------------------------- >GTP CYCLOHYDROLASE II; SWP:P0A7I7; PDB:2BZ1A; QLKRVAEAKLPTPWGDFLVGFEELATGHDHVALVYGDISGHTPVLARVHSECLTGDALFS -----------1111------------------------------------3333----- LRCDCGFQLEAALTQIAEEGRGILLYHRQEGRNIGLLNKIRAYALQDQGYDTVEANHQLG -----------------------------%%%%------------1111-------1111 FAADERDFTLCADFKLLGVNEVRLLTNNPKKVEILTEAGINIVERVPLIVG ------3333----1111---------3333-------------------- >Coagulation factor VII [P; SWP:P08709; PDB:2BZ6H; IVGGKVCPKGECPWQVLLLVNGAQLCGGTLINTIWVVSAAHCFDKIKNWRNLIAVLGEHD -------22221111-----------------------33331111-1111--------1 LSEHDGDEQSRRVAQVIIPSTYVPGTTNHDIALLRLHQPVVLTDHVVPLCLPERTFSERT 111---------------11112222----------------1111-------------3 LAFVRFSLVSGWGQLLDRGATALELMVLNVPRLMTQDCLQQSRKVGDSPNITEYMFCAGY 333------------------------------------------------1111----- SDGSKDSCKGDSGGPHATHYRGTWYLTGIVSWGQGCATVGHFGVYTRVSQYIEWLQKLMR -------3333--------iiii--------------2222-----3333-------111 SEPRPGVLLRAPFP 1------------- >Coagulation factor VII [P; SWP:P08709; PDB:2BZ6L; ICVNENGGCEQYCSDHTGTKRSCRCHEGYSLLADGVSCTPTVEYPCGKIPILE 3333-iiii----------------2222--3333-------------3333- >SH3-DOMAIN KINASE BINDING; SWP:Q96B97; PDB:2BZ8A; VEAIVEFDYQAQHDDELTISVGEIITNIRKEDGGWWEGQINGRRGLFPDNFVREIKK ------------1111---2222----------------!!!!----1111------ ------------------------------------------------------------ -- >KIAA0252 PROTEIN; SWP:Q92541; PDB:2BZEA; VSLPEELNRVRLSRHKLERWCHMPFFAKTVTGCFVRIGIGNHNSKPVYRVAEITGVVETA -------------------1111--33332222--------------------------- KVYQLGGTRTNKGLQLRHGNDQRVFRLEFVSNQEFTESEFMKWKEAMFSAGMQLPTDEIN -------------------------3333------------------------------- KKELSIKEALN ----------- >THIOPURINE S-METHYLTRANSF; SWP:P51580; PDB:2BZGA; EVQKNQVLTLEEWQDKWVNGKTAFHQEQGHQLLKKHLDTFLKGKSGLRVFFPLCGKAVEK 1111-----------------33331111-----------2222---------------- WFADRGHSVVGVEISELGIQEFFTEQNLSYSEEPITEIPGTKVFKSSSGNISLYCCSIFD --1111----------------------------3333-------3333-------1111 LPRTNIGKFDIWDRGALVAINPGDRKCYADTFSLLGKKFQYLLCVLSYDPTKHPGPPFYV 1111-----------1111-3333-------1111-------------1111-------- PHAEIERLFGKICNIRCLEKVDAFEERHKSWGIDCLFEKLYLLTEK 3333--------------------33331111-------------- >PROPIONYL-COA CARBOXYLASE; SWP:P96885; PDB:2BZRA; DIHTTAGKLAELHKRREESLHPVGEDAVEKVHAKGKLTARERIYALLDEDSFVELDALAK --------------------1111-----------------------2222----1111- HRSTNFNLGEKRPLGDGVVTGYGTIDGRDVCIFSQDATVFGGSLGEVYGEKIVKVQELAI ----iiii----2222--------iiii-------1111%%%%----------------- KTGRPLIGINDGAGARIQEGVVSLGLYSRIFRNNILASGVIPQISLIMGAAAGGHVYSPA ---------------3333-----------------2222------------------11 LTDFVIMVDQTSQMFITGPDVIKTVTGEEVTMEELGGAHTHMAKSGTAHYAASGEQDAFD 11-----2222-------------------3333-------------------------- YVRELLSYLPPNNSTDAPRYQAAAPTGPIEENLTDEDLELDTLIPDSPNQPYDMHEVITR -----1111--1111------------3333--33333333-----1111--------33 LLDDEFLEIQAGYAQNIVVGFGRIDGRPVGIVANQPTHFAGCLDINASEKAARFVRTCDC 33-------11113333------iiii-------1111iiii----------------11 FNIPIVMLVDVPGFLPGTDQEYNGIIRRGAKLLYAYGEATVPKITVITRKAYGGAYCVMG 11--------------3333---------------------------------------- SKDMGCDVNLAWPTAQIAVMGASGAVGFVYRQQIDKLRLRLQQEYEDTLVNPYVAAERGY 1111-------1111--------------------------------------------- VGAVIPPSHTRGYIGTALRLLERKKKHGNVPL -----3333----------------------- >FIBER PROTEIN 2; SWP:P16883; PDB:2BZUA; SLTTIWSISPTPNCSIYETQDANLFLCLTKNGAHVLGTITIKGLKGALREMHDNALSLKL ------------------------------!!!!----------!!!!------------ PFDNQGNLLNCALESSTWRYQETNAVASNALTFMPNSTVYPRNKTAHPGNMLIQISPNIT --1111-------3333---1111-----1111-------2222---------------- FSVVYNEINSGYAFTFKWSAEPGKPFHPPTAVFCYITEQ --------------------2222--------------- >CRK-LIKE PROTEIN; SWP:P46109; PDB:2BZYA; PVFAKAIQKRVPCAYDKTALALEVGDIVKVTRMNINGQWEGEVNGRKGLFPFTHVKIFDP ---------------1111---2222----------------!!!!----3333------ QNP --- >3-OXOACYL-(ACYL-CARRIER P; SWP:Q8I2S7; PDB:2C07A; KENYYYCGENKVALVTGAGRGIGREIAKMLAKSVSHVICISRTQKSCDSVVDEIKSFGYE -----------------------------3333---------------------1111-- SSGYAGDVSKKEEISEVINKILTEHKNVDILVNNAGITRDNLFLRMKNDEWEDVLRTNLN ------1111-------------------------------3333--------------- SLFYITQPISKRMINNRYGRIINISSIVGLTGNVGQANYSSSKAGVIGFTKSLAKELASR -------------------------3333---2222--------------------3333 NITVNAIAPGFISSISEQIKKNIISNIPAGRMGTPEEVANLACFLSSDKSGYINGRVFVI --------------------------3333---3333---------1111---------- DGGLSP iiii-- >ZINC BINDING ALCOHOL DEHY; SWP:Q8N4Q0; PDB:2C0CA; GVDLGTENLYFQSMMQKLVVTRLSPNFREAVTLSRDCPVPLPGDGDLLVRNRFVGVNASD -------------------------3333-------------1111----------1111 INYSAGRYDPSVKPPFDIGFEGIGEVVALGLSASARYTVGQAVAYMAPGSFAEYTVVPAS -----3333---------------------3333---2222----------------333 IATPVPSVKPEYLTLLVSGTTAYISLKELGGLSEGKKVLVTAAAGGTGQFAMQLSKKAKC 3-------33333333----------------2222-----1111--------------- HVIGTCSSDEKSAFLKSLGCDRPINYKTEPVGTVLKQEYPEGVDVVYESVGGAMFDLAVD ---------------1111-----3333----------1111--------!!!!----11 ALATKGRLIVIGFISGYQTPTGLSPVKAGTLPAKLLKKSASVQGFFLNHYLSKYQAAMSH 112222-------1111-1111-----1111--------------33333333------- LLEMCVSGDLVCEVDLGDLSPEGRFTGLESIFRAVNYMYMGKNTGKIVVELPH ----------------1111-----------------1111------------ >THIOREDOXIN PEROXIDASE 2; SWP:Q9BKL4; PDB:2C0DA; LVTKKAYNFTAQGLNKNNEIINVDLSSFIGQKYCCLLFYPLNYTFVCPTEIIEFNKHIKD 2222----------1111-----33332222----------3333-3333---------- FENKNVELLGISVDSVYSHLAWKNMPIEKGGIGNVEFTLVSDINKDISKNYNVLYDNSFA -------------------------3333------------1111---1111--%%%%-- LRGLFIIDKNGCVRHQTVNDLPIGRNVQEVLRTIDSIIHVDTSGEVCP -------1111--------1111------------------------- >WINDBEUTEL PROTEIN; SWP:O44342; PDB:2C0GA; CTGCVDLDELSFEKTVERFPYSVVKFDIASPYGEKHEAFTAFSKSAHKATKDLLIATVGV 2222---3333------------------------------------------------- KDYGELENKALGDRYKVDDKNFPSIFLFKGNADEYVQLPSHVDVTLDNLKAFVSANTPLY --!!!!------1111----------------------3333------------------ IGRDGCIKEFNEVLKNYANIPDAEQLKLIEKLQAKQEQLTDPEQQQNARAYLIYMRKIHE --222233331111-3333----------------1111--------------------- VGYDFLEEETKRLLRLKAGKVTEAKKEELLRKLNILEVFRVHKVTKTA ---------------1111---------------3333---------- >MANNAN ENDO-1,4-BETA-MANN; SWP:Q8WPJ2; PDB:2C0HA; AAVRLSVSGTNLNYNGHHIFLSGANQAWVNYARDFGHNQYSKGKSTFESTLSDMQSHGGN -------!!!!--iiii------------2222--%%%%3333-----------1111-- SVRVWLHIEGESTPEFDNNGYVTGIDNTLISDMRAYLHAAQRHNILIFFTLWNGAVKQST ----------------1111-----1111-----------1111-------------111 HYRLNGLMVDTRKLQSYIDHALKPMANALKNEKALGGWDIMNEPEGEIKPGESSSEPCFD 1---------------------------1111-----------------------3333- TRHLSGSGAGWAGHLYSAQEIGRFVNWQAAAIKEVDPGAMVTVGSWNMKADTDAMGFHNL -1111--2222------------------------1111--------1111--iiii--- YSDHCLVKAGGKQSGTLSFYQVHTYDWQNHFGNESPFKHSFSNFRLKKPMVIGEFNQEHG -----------1111-----------%%%%-1111----3333------------3333i AGMSSESMFEWAYTKGYSGAWTWSRTDVSWNNQLRGMQHLKSRTDHGQVQFGL iii--------------------3333----------1111------------ >Trafficking protein parti; SWP:O75865; PDB:2C0JB; ADTVLFEFLHTEMVAELWKMSLSVLEGMGFRVGQALGERLPRETLAFREELDVLKFLCKD -3333-------------------------------1111333322223333-------- LWVAVFQKQMDSLRTNHQGTYVLQDNSFPLLLGLQYLEEAPKFLAFTCGLLRGALYTLGI ---------------------------1111-333333331111-----------1111- ESVVTASVAALPVCKFQVVIPKS ----------------------- >HEMOGLOBIN; SWP:O96457; PDB:2C0KA; MNSEEVNDIKRTWEVVAAKMTEAGVEMLKRYFKKYPHNLNHFPWFKEIPFDDLPENARFK ----------------------------------333311113333--111111113333 THGTRILRQVDEGVKALSVDFGDKKFDDVWKKLAQTHHEKKVERRSYNELKDIIIEVVCS -------------11112222---------------3333--3333-------------- CVKLNEKQVHAYHKFFDRAYDIAFAEMAKM ------------------------------ >A197; SWP:Q6Q0L5; PDB:2C0NA; RTLFFIPSGSVRLPLIDFLVKNDIEYVILSRRNHVAVQREIALDFLEKDYDTLAFLDEDV ---------------------------------3333-------------------1111 VPIEIDFQKVEAKFNEGYDVVCGYYYLKTLRGYSVYRKDWEKEIFDGEVNGCGLGFTFIK -------------1111---------1111------------------------------ REFLEKIKRPAFLAIGEDVYFFSTHKPRTYALSSLKAYHFIDERLALSPDRKLILQNDHV -3333----------3333------------1111------------1111--------- ARIKHHH ------- >PHOSPHOSERINE AMINOTRANSF; SWP:Q59196; PDB:2C0RA; SERAYNFNAGPAALPLEVLERAQAEFVDYQHTGMSIMEMSHRGAVYEAVHNEAQARLLAL ----------------------------%%%%--3333-1111---------------11 LGNPTGYKVLFIQGGASTQFAMIPMNFLKEGQTANYVMTGSWASKALKEAKLIGDTHVAA 11------------3333----------2222-----------------3333------- SSEASNYMTLPKLQEIQLQDNAAYLHLTSNETIEGAQFKAFPDTGSVPLIGDMSSDILSR --1111-----3333----------------------------!!!!------------- PFDLNQFGLVYAGAQKNLGPSGVTVVIVREDLVAESPKHLPTMLRYDTYVKNNSLYNTPP --3333----------------------3333----11111111-----1111------- SFGIYMVNEVLKWIEERGGLEGVQQANRKKASLIYDAIDQSGGFYRGCVDVDSRSDMNIT ----------------------------------------iiii-----1111------- FRLASEELEKEFVKASEQEGFVGLKGHRSVGGLRASIYNAVPYESCEALVQFMEHFKRSR --------------------------1111-------11113333--------------- G - >CONSERVED DOMAIN PROTEIN; SWP:Q81XQ9; PDB:2C0SA; MNVTKLNDRIEAKKKELIYLVEKYGFTHHKVISFSQELDRLLNLLIELKTKKKRYSLLEH ------------------------1111-------------------------3333--- HHHH ---- >NOVW; SWP:NA; PDB:2C0ZA; HMRLRPLGIEGVWEITPERADPRGVFLDWYHVDRFAEAIGRPLRLAQANLSVSVRGVVRG --------2222--------1111-----------------------------2222--- IHFVDVPPGQAKYVTCVRGAVFDVVVDLRVGSPTYGCWEGTRLDDVSRRAVYLSEGIGHG ----------------------------2222-2222----------------2222--- FCAISDEATLCYLSSGTYDPATEHGVHPLDPELAIDWPTGTPLLSPRDQDALLLAEARDA ------------------3333----1111--------------3333---------111 GLLPTYATCQ 1---3333-- >MEMBRANE COPPER AMINE OXI; SWP:Q16853; PDB:2C10A; CQLFADLSREELTAVMRFLTQRLGPGLVDAAQARPSDNCVFSVELQLPPKAAALAHLDRG -1111------------------1111-3333-1111----------------------- SPPPAREALAIVFFGRQPQPNVSELVVGPLPHPSYMRDVTVERHGGPLPYHRRPVLFQEY ---------------------------------------3333-----1111-------- LDIDQMIFNRELPQASGLLHHCCFYKHRGRNLVTMTTAPRGLQSGDRATWFGLYYNISGA ----------3333----------------------------2222-------------- GFFLHHVGLELLVNHKALDPARWTIQKVFYQGRYYDSLAQLEAQFEAGLVNVVLIPDNGT -1111-------------3333-------iiii--------------------------- GGSWSLKSPVPPGPAPPLQFYPQGPRFSVQGSRVASSLWTFSFGLGAFSGPRIFDVRFQG 1111-------------------------!!!!------------------------iii ERLVYEISLQEALAIYGGNSPAAMTTRYVDGGFGMGKYTTPLTRGVDCPYLATYVDWHFL i--------------------3333---3333---1111---2222--1111-------- LESQAPKTIRDAFCVFEQNQGLPLRRHHSDLYSHYFGGLAETVLVVRSMSTLLNDYVWDT ------------------------------------------------------------ VFHPSGAIEIRFYATGYISSAFLFGATGKYGNQVSEHTLGTVHTHSAHFKVDLDVAGLEN --1111----------------------------2222---------------------- WVWAEDMVFVPMAVPWSPEHQLQRLQVTRKLLEMEEQAAFLVGSATPRYLYLASNHSNKW -------------1111----------------3333---2222-------------111 GHPRGYRIQMLSFAGEPLPQNSSMARGFSWERYQLAVTQRKEEEPSSSSVFNQNDPWAPT 1-----------------3333------------------1111----1111--3333-- VDFSDFINNETIAGKDLVAWVTAGFLHIPHAEDIPNTVTVGNGVGFFLRPYNFFDEDPSF -3333------------------------1111-----2222--------------3333 YSADSIYFRGDQDAGACEVNPLACLPQAAACAPDLPAFSHGGFSH -1111---33331111-----11113333---------------- >NITROALKANE OXIDASE; SWP:Q8X1D8; PDB:2C12A; VDFKLSPSQLEARRHAQAFANTVLTKASAEYSTQKDQLSRFQATRPFYREAVRHGLIKAQ -------------------------3333-1111---------------------3333- VPIPLGGTMESLVHESIILEELFAVEPATSITIVATALGLMPVILCDSPSLQEKFLKPFI -3333--------------------------------------------------3333- SGEGEPLASLMHSEPNGTANWLQKGGPGLQTTARKVGNEWVISGEKLWPSNSGGWDYKGA -------------11111111-2222---------!!!!---------2222-------- DLACVVCRVSDDPSKPQDPNVDPATQIAVLLVTRETIANNKKDAYQILGEPELAGHITTS -----------1111--11113333-----------11111111-----------1111- GPHTRFTEFHVPHENLLCTPGLKAQGLVETAFAMSAALVGAMAIGTARAAFEEALVFAKS -----------3333--------------------------------------------- DTRGGSKHIIEHQSVADKLIDCKIRLETSRLLVWKAVTTLEDEALEWKVKLEMAMQTKIY -%%%%------------------------------------33333333----------- TTDVAVECVIDAMKAVGMKSYAKDMSFPRLLNEVMCYPLFDGGNIGLRRRQMQRVMALED ------------------------------------3333-----------------111 YEPWAATYGS 11111----- >CARBOXYPEPTIDASE B; SWP:Q3T905; PDB:2C1CA; LPYDNYQELEVIDEYLDYIGEKYPDVATVVNAAESFEGRPIKYIKISTTNFE ----------------------------------1111-------------- >SOXX; SWP:O33434; PDB:2C1DA; DPVEDGLVIETDSGPVEIVTKTAPPAFLADTFDTIYSGWHFRDDSTRDLERDDFDNPAMV ----------1111----------3333--------3333-----------3333----- FVDRGLDKWNAAMGVNGESCASCHQGPESMAGLRAVMPRVDEHTGKLMIMEDYVNACVTE -------------1111-3333-----------1111-----------3333-------- RMGLEKWGVTSDNMKDMLSLISLQSRGMAVNVKIDGPAAPYWEHGKEIYYTRYGQLEMSC -------1111-------------2222------!!!!---------------1111-33 ANCHEDNAGNMIRADHLSQGQINGFPTYRLKDSGMVTAQHRFVGVRDTRAETFKAGSDDF 33----2222-!!!!------1111----1111-----------1111-----2222--- KALELYVASRGNGLSVEGVSVRH --------1111----------- >SoxX protein; SWP:Q9LCV0; PDB:2C1DB; CETAPKEVVYVEGAVEASLTGAPGNPEEGVRIMTTNALGNCVACHQIGALPDVEFPGTIA ---1111---iiii--------------------1111-1111---3333---------- PPLDGAGDRWTEAQLRGIVANAKMTFEGTFMPAFYKVDGFVRPGDGFSGKAGAEPLAPIL --2222----3333------3333-2222---------------!!!!------------ NAQQIEDVVAFLVTLKE ----------------- >BIFUNCTIONAL ENDO-1,4-BET; SWP:Q5YB84; PDB:2C1FA; MKFTVGNGQNQHKGVNDGFSYEIWLDNTGGNGSMTLGSGATFKAEWNAAVNRGNFLARRG ---------------iiii------------------!!!!--------2222------- LDFGSQKKATDYDYIGLDYAATYKQTASASGNSRLCVYGWFQNRGLNGVPLVEYYIIEDW -------1111-------------------------------2222-------------- VDWVPDAQGKMVTIDGAQYKIFQMDHTGPTINGGSETFKQYFSVRQQKRTSGHITVSDHF -------------iiii------------1111---------------------3333-- KEWAKQGWGIGNLYEVALNAEGWQSSGVADVTLLDVYTT ---1111-------------------------------- >DELTA-AMINOLEVULINIC ACID; SWP:Q59334; PDB:2C1HA; VHRPRRLRRTAALRNLVQENTLTVNDLVFPLFVMPGTNAVEEVSSMPGSFRFTIDRAVEE ---3333--1111-1111----1111----------------1111------3333---- CKELYDLGIQGIDLFGIPEQKTEDGSEAYNDNGILQQAIRAIKKAVPELCIMTDVALDPF ----1111-------------111133331111------------1111-------1111 TPFGHDGLVKDGIILNDETVEVLQKMAVSHAEAGADFVSPSDMMDGRIGAIREALDETDH 1111-------------------------------------------------------1 SDVGILSYAAKYASSFYGPFRDALHSAPQFGDKSTYQMNPANTEEAMKEVELDIVEGADI 111--------------33331111-------3333--11113333-------------- VMVKPGLAYLDIVWRTKERFDVPVAIYHVSGEYAMVKAAAAKGWIDEDRVMMESLLCMKR -----1111------------------------------1111----------------- AGADIIFTYYAKEAAKKLR -------1111-------- >PEPTIDOGLYCAN GLCNAC DEAC; SWP:Q8DP63; PDB:2C1IA; FEQKIESLKKEKDDQLSEGNQKEHFRQGQAEVIAYYPLQGEKVISSVRELINQDVKDKLE ----------------2222------!!!!------------------------------ SKDNLVFYYTEQEESGLKGVVNRNVTKQIYDIEETEKTSLGKVHLTEDGQPFTLDQLFSD ----------------2222-------------------------1111---3333---- ASKAKEQLIKELTSFDLSAWNFDYKDSQIILYEIALPVSAFFDVIQSSYLLEKDAALYQS ---------------3333-----%%%%--------33333333-3333----------- YFDKKHQKVVALTFNDGPNPATTPQVLETLAKYDIKATFFVLGKNVSGNEDLVKRIKSEG ---1111-----------1111-------------------33332222-------1111 HVVGNHSWSHPILSQLSLDEAKKQITDTEDVLTKVLGSSSKLMRPPYGAITDDIRNSLDL -----------3333-----------------------------2222------1111-- SFIMWDVDSLDWKSKNEASILTEIQHQVANGSIVLMHDIHSPTVNALPRVIEYLKNQGYT ---------3333---------------2222----------------------1111-- FVTIPEMLNTRLKAHELYYSRDE --3333--33332222---1111 >RESTRICTION ENDONUCLEASE; SWP:Q9F4C9; PDB:2C1LA; MNFFSLHPNVYATGRPKGLIGMLENVWVSNHTPGEGTLYLISGFSNYNGGVRFYETFTEH -------------------------------2222--------------3333------- INQGGRVIAILGGSTSQRLSSRQVVEELLNRGVEVHIINRKRILHAKLYGTSNNLGESLV 1111------------------------1111--------------------3333---- VSSGNFTGPGMSQNIEASLLLDNNTTQSMGFSWNDMISEMLNQNWHIHNMTNATDASPGW ------3333----------------------------1111-------22221111333 NLLYDERTTNLTLDETERVTLIVTLGHADTARIQAAPGTTAGQGTQYFWLSKDSYDFFPP 3---1111-----3333---------3333-----22221111----------1111--- LTIRNRRGTKATYSSLINMNYIDINYTDTQCRVTFEAENNFDFRLGTGKLRYTGVAKSND -----2222-------------------------------------3333------2222 IAAITRVGDSDYELRIIKQGTPEHSQLDPYAVSFIGNRGKRFGYISNEEFGRIIGVTF -----------------2222-----3333---------------------------- >Nucleoporin 50 kDa; SWP:Q9JIH2; PDB:2C1MB; MAKRVAEKELTDRNWDEEDEVEEMGTFSVASEEVMKNRAVKKAKRR ----------1111----------------33331111-------- >IGK-C PROTEIN; SWP:A0A5D9; PDB:2C1OA; ELVMTQSPLTLSVTIGQPASISCKSSQSLLYS -------------2222--------------- >IGK-C PROTEIN; SWP:A0A5D9; PDB:2C1PA; ELVMTQSPLTLSVTIGQPASISCKSSQSLLYSNGKTYLNWLLQRPGQSPKRLIYLVSKLD -------------2222-------------1111---------2222------------- SGDPDRFTGSGSGTDFTLKISRVEAEDLGIYYCVQGSHFPPTFGAGTKLELKRADAAPTV -----------------------3333--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM ------------------------------------------------------------ SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNE ------3333------------1111------------ >IGK-C PROTEIN; SWP:NA; PDB:2C1PB; VQLQQSGAELVRPGTSVKLSCKASGYSFTNYWMNWLRQRPGQGLDWIGMIHPSDSETRLN -----------2222-----------1111--------2222-----------------3 QKFKDKATLTVDRSSSTAYIQLSSPTSEDSAVYYCARDDYDGAFWGQGTLVTVSAAKTTP 333---------1111---------1111---------1111------------------ PSVYPLAPGSAAQTNSMVTLGCLVKGYFPAPVTVTWNSGSLSSGVHTFPAVLQSDLYTLS -----------------------------------%%%%-------------iiii---- SSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRDC -----1111------------1111------------ >BIOTIN BINDING PROTEIN A; SWP:NA; PDB:2C1SA; RKCELQGLWRNELGSNMTISALDVAGTFSGSYQTAVTATNKQILVSPLKGAQQPPGTKGQ 2222------1111--------1111---------------------------------- QPTFGFTVQWQFADSTTVFVGQCFVDRRGKEMLEMAWLLREEVPSRKDTWKATRVGTNVF -------------------------1111---------------33331111-------- TRV --- >Nucleoporin NUP2; SWP:P32499; PDB:2C1TC; AKRVADAQIQRETYDSNTKVASSAVMNRRKIAMPKR ---------3333----------------------- >DI-HAEM CYTOCHROME C PERO; SWP:A1B0G0; PDB:2C1VA; AIDNGALREEAKGVFEAIPEKMTAIKQTEDNPEGVPLTAEKIELGKVLFFDPRMSSSGLI ---------------------------3333-------------------11113333-- SCQTCHNVGLGGVDGLPTSIGHGWQKGPRNAPTMLNAIFNAAQFWDGRAADLAEQAKGPV 3333--1111----------2222---------2222------1111---3333------ QAGVEMSNTPDQVVKTINSMPEYVEAFKAAFPEEADPVTFDNFAAAIEQFEATLITPNSA -1111-------------------------1111-------------------------- FDRFLAGDDAAMTDQEKRGLQAFMETGCTACHYGVNFGGQDYHPFGLIAKPGAEVLPAGD ---11111111---------------1111---1111--------------3333-1111 TGRFEVTRTTDDEYVFRAAPLRNVALTAPYFHSGVVWELAEAVKIMSSAQIGTELTDQQA -3333---1111--------2222------1111-------------------------- EDITAFLGTLTGEQPVIDHPILPVRTGTTPLPTPM -------1111--------------1111------ >UDP-GLUCOSE FLAVONOID 3-O; SWP:O22304; PDB:2C1XA; NPHVAVLAFPFSTHAAPLLAVVRRLAAAAPHAVFSFFSTSQSNASIFQCNIKSYDISDGV ----------------------------1111---------------1111--------- PEGYVFAGRPQEDIELFTRAAPESFRQGMVMAVAETGRPVSCLVADAFIWFAADMAAEMG ---------------------------------------------1111----------- VAWLPFWTAGPNSLSTHVYIDEIREKIGVSGIQGREDELLNFIPGMSKVRFRDLQEGIVF -------------------------------2222----1111--11113333-2222-- GNLNSLFSRMLHRMGQVLPKATAVFINSFEELDDSLTNDLKSKLKTYLNIGPFNLITGCL -1111-----------3333--------1111-------------------1111----- QWLKERKPTSVVYISFGTVTTPPPAEVVALSEALEASRVPFIWSLRDKARVHLPEGFLEK -3333-----------------3333-------------------333311112222--- TRGYGMVVPWAPQAEVLAHEAVGAFVTHCGWNSLWESVAGGVPLICRPFFGDQRLNGRMV 1111--------------3333---------------------------!!!!------- EDVLEIGVRIEGGVFTKSGLMSCFDQILSQEKGKKLRENLRALRETADRAVGPKGSSTEN ---------2222--------------------------------------2222----- FITLVDLVSKPKDV ------1111---- >UDP-GLUCOSE 4-EPIMERASE; SWP:Q81K34; PDB:2C20A; NSILICGGAGYIGSHAVKKLVDEGLSVVVVDNLQTGHEDAITEGAKFYNGDLRDKAFLRD ------11113333----------------------3333----------1111------ VFTQENIEAVMHFAADSLVGVSMEKPLQYYNNNVYGALCLLEVMDEFKVDKFIFSSTAAT --------------------------------------------------------3333 YGEVDVDLITEETMTNPTNTYGETKLAIEKMLHWYSQASNLRYKIFRYFNVAGATPNGII ---------3333---------------------3333----------------1111-- GEDHRPETHLIPLVLQVALGQREKIMMFGDDYNTPDGTCIRDYIHVEDLVAAHFLGLKDL --------------------------------------------3333------------ QNGGESDFYNLGNGNGFSVKEIVDAVREVTNHEIPAEVAPRRAGDPARLVASSQKAKEKL ---------------------------1111----------------------------- GWDPRYVNVKTIIEHAWNWHQKQPNGYEK -------3333-----------1111--- >TRYPANOTHIONE-DEPENDENT G; SWP:Q4FWG9; PDB:2C21A; SRRMLHTMIRVGDLDRSIKFYTERLGMKVLRKWDVPEDKYTLVFLGYGPEMSSTVLELTY ----------------------------------3333----------3333-------- NYGVTSYKHDEAYGHIAIGVEDVKELVADMRKHDVPIDYEDESGFMAFVVDPDGYYIELL 2222--------------------------1111----------------1111------ NEKTMMEKAEADMKEQGTA ------------------- >Dihydroflavonol 4-reducta; SWP:P93799; PDB:2C29D; ETVCVTGASGFIGSWLVMRLLERGYTVRATVRDPTNVKKVKHLLDLPKAETHLTLWKADL ------3333----------1111--------11111111--1111-3333-------11 ADEGSFDEAIKGCTGVFHVATPMDFESKDPENEVIKPTIEGMLGIMKSCAAAKTVRRLVF 11-----3333-----------------3333---------------------------- TSSAGTVNIQEHQLPVYDESCWSDMEFCRAKKMTAWMYFVSKTLAEQAAWKYAKENNIDF --3333-----------1111-----------2222------------------------ ITIIPTLVVGPFIMSSMPPSLITALSPITGNEAHYSIIRQGQFVHLDDLCNAHIYLFENP -----------------3333---------3333------------------------11 KAEGRYICSSHDCIILDLAKMLREKYPEYNIPTEFKGVDENLKSVCFSSKKLTDLGFEFK 11-----------------------3333-----22221111----------3333---- YSLEDMFTGAVDTCRAKGLLPPSH --------------1111------ >SENSOR HISTIDINE KINASE; SWP:Q9WZV7; PDB:2C2AA; MENVTESKELERLKRIDRMKTEFIANISHELRTPLTAIKAYAETIYNSLGELDLSTLKEF -----------------------------------------------1111-----3333 LEVIIDQSNHLENLLNELLDFSRLERKSLQINREKVDLCDLVESAVNAIKEFASSHNVNV -----------------------------------------------------1111--- LFESNVPCPVEAYIDPTRIRQVLLNLLNNGVKYSKKDAPDKYVKVILDEKDGGVLIIVED ------------------------------11111111-----------iiii------- NGIGIPDHAKDRIFEQFYRVDTGLGLAITKEIVELHGGRIWVESEVGKGSRFFVWIPKDR -----3333-----2222-----!!!!------1111-------2222------------ A - >RAS-RELATED C3 BOTULINUM ; SWP:P60763; PDB:2C2HA; AIKCVVVGDGAVGKTCLLISYTTNAFPGDNYSANVMVDGKPVNLGLWDTAGQEDYDRLRP --------2222------------------------%%%%-----------1111----- LSYPQTDVFLICFSLVSPASFENVRAKWYPEVRHHCPHTPILLVGTKLDLRDDKDTIERL --2222-------1111-----------------------------3333---------- RDKKLAPITYPQGLAMAREIGSVKYLECSALTQRGLKTVFDEAIRAVL 1111-------------------------1111--------------- >RV0130; SWP:P96807; PDB:2C2IA; RTFESVADLAAAAGEKVGQSDWVTITQEEVNLFADATGDHQWIHVDPERAAAGPFGTTIA ---------1111---------------------------3333--------1111---- HGFTLALLPRLQHQYTVKGVKLAINYGLNKVRFPAPVPVGSRVRATSSLVGVEDLGNGTV ---1111------------------------------2222--------------iiii- QATVSTTVEVEGSAKPACVAESIVRYV ---------1111-------------- >DNA-BINDING STRESS RESPON; SWP:Q9RZN1; PDB:2C2JA; TEDLKKSVQALQNTLTELQALQLQTKQAHWNVSGTLWYTLHELLQDHYEGISKFADDVAE ---------------------------------1111----------------------- RQLSVGASSDGRAITIVAASRLPEIPGGFLDDAQVIQFFTYQYETVGQRIHQRVGDVEKV --1111------------------------------------------------------ DPTTANLLQEVEHIIEKYQWQMRAFLQNTPTDPNTGFDINNGKPVP ----------------------------1111---3333iiii--- >MALONYL COA-ACYL CARRIER ; SWP:Q8IVS2; PDB:2C2NA; MGQCSVLLFPGQGSQVVGMGRGLLNYPRVRELYAAARRVLGYDLLELSLHGPQETLDRTV ---------------22221111--2222----------------------3333--333 HCQPAIFVASLAAVEKLHHLQPSVIENCVAAAGFSVGEFAALVFAGAMEFAEGLYAVKIR 3-----------------------1111-----!!!!----------------------- AEAMQEASEAVPSGMLSVLGQPQSKFNFACLEAREHCKSLGIENPVCEVSNYLFPDCRVI --------------------1111-------------1111------------2222--- SGHQEALRFLQKNSSKFHFRRTRMLPVSGAFHTRLMEPAVEPLTQALKAVDIKKPLVSVY ------------3333----------------33331111------1111---------- SNVHGHRYRHPGHIHKLLAQQLVSPVKWEQTMHAIYERKKGRGFPQTFEVGPGRQLGAIL ---------3333------3333---------------2222------------------ KSCNMQAWKSYSAVDVL ----3333--------- >G/U MISMATCH-SPECIFIC DNA; SWP:Q9RWF4; PDB:2C2QA; VPDLTGSGEYLVPDVLQPGLTLVLVGTAPSGISARARAYYANPENKFWRTLHAVGLTPRQ --1111-----------------------------------1111--------------- LVPQEYATLPQYGLGLTDVAKRHSGVAAALPGEAWRPDELRRKVEHYRPRIVAFTSKRGA -11111111----------------3333-3333-------------------------- SETLGVPTGKLPYGPQPQPLDWPAETELWVLPSTSPLGHNHFRLEPWQALGDRVRELRGA ------3333------------1111---------------------------------- AEA --- >DNA-BINDING STRESS RESPON; SWP:Q9RS64; PDB:2C2UA; GGADHADAAHLGTVNNALVNHHYLEEKEFQTVAETLQRNLATTISLYLKFKKYHWDIRGR --3333---------1111-!!!!----------------------------------11 FFRDLHLAYDEFIAEIFPSIDEQAERLVALGGSPLAAPADLARYSTVQVPQETVRDARTQ 11---------------------------------------------------------- VADLVQDLSRVGKGYRDDSQACDEANDPVTADMYNGYAATIDKIRWMLQAIMDDERLD ----------------------1111-----------------------11111111- >STIP1 homology and U box-; SWP:Q9WUD1; PDB:2C2VS; DIPDYLCGKISFELMREPCITPSGITYDRKDIEEHLQRVGHFNPVTRSPLTQEQLIPNLA --3333--------------1111--------------------------3333------ MKEVIDAFISENGWV --------------- >METHYLENETETRAHYDROFOLATE; SWP:O50385; PDB:2C2XA; GAIMLDGKATRDEIFGDLKQRVAALDAAGRTPGLGTILVGDDPGSQAYVRGKHADCAKVG --------------------------------------------------------1111 ITSIRRDLPADISTATLNETIDELNANPDCTGYIVQLPLPKHLDENAALERVDPAKDADG --------1111--------------1111---------3333-----333333333333 LHPTNLGRLVLGTPAPLPCTPRGIVHLLRRYDISIAGAHVVVIGRGVTVGRPLGLLLTRR ----------------------------1111--2222-------1111---------11 SENATVTLCHTGTRDLPALTRQADIVVAAVGVAHLLTADMVRPGAAVIDVGVSRTDDGLV 11-------1111-33333333--------------3333-2222---------1111-- GDVHPDVWELAGHVSPNPGGVGPLTRAFLLTNVVELAERR --------------------3333---------------- >SERINE/THREONINE-PROTEIN ; SWP:Q9NQU5; PDB:2C30A; VTHEQFKAALRMVVDQGDPRLLLDSYVKIGEGSTGIVCLAREKHSGRQVAVKMMDLRKQQ --------3333-----3333--------------------------------------- RRELLFNEVVIMRDYQHFNVVEMYKSYLVGEELWVLMEFLQGGALTDIVSQVRLNEEQIA 3333-------1111-1111--------!!!!-----------33333333--------- TVCEAVLQALAYLHAQGVIHRDIKSDSILLTLDGRVKLSDFGFCAQISKDVPKRKLVGTP -----------------------1111---1111------1111---3333-------33 YWMAPEVISRSLYATEVDIWSLGIMVIEMVDGEPPYFSDSPVQAMKRLRDSPPPKLKNSH 33-3333------3333------------------1111------------------333 KVSPVLRDFLERMLVRDPQERATAQELLDHPFLLQTGLPECLVPLIQLY 3--------3333---3333---------3333----3333-------- >OXALYL-COA DECARBOXYLASE; SWP:P40149; PDB:2C31A; VELTDGFHVLIDALKMNDIDTMYGVVGIPITNLARMWQDDGQRFYSFRHEQHAGYAASIA --------------1111------------------------------3333-------- GYIEGKPGVCLTVSAPGFLNGVTSLAHATTNCFPMILLSGSSEREIVDLQQGDYEEMDQM ------------------------------------------33331111-------333 NVARPHCKASFRINSIKDIPIGIARAVRTAVSGRPGGVYVDLPAKLFGQTISVEEANKLL 3-3333--------3333------------------------3333-------------- FKPIDPAPAQIPAEDAIARAADLIKNAKRPVIMLGKGAAYAQCDDEIRALVEETGIPFLP ------------3333-------1111------------------------1111----- MGMAKGLLPDNHPQSAAATRAFALAQCDVCVLIGARLNWLMQHGKGKTWGDELKKYVQID ---2222-1111---1111------------------3333%%%%3333----------- IQANEMDSNQPIAAPVVGDIKSAVSLLRKALKGAPKADAEWTGALKAKVDGNKAKLAGKM -1111-----------------------1111-----1111-----------------11 TAETPSGMMNYSNSLGVVRDFMLANPDISLVNEGANALDNTRMIVDMLKPRKRLDSGTWG 11--2222---3333-------------------3333------------------1111 VMGIGMGYCVAAAAVTGKPVIAVEGDSAFGFSGMELETICRYNLPVTVIIMNNGGIYKGN -------------------------3333----------1111----------------- EADPQPGVISCTRLTRGRYDMMMEAFGGKGYVANTPAELKAALEEAVASGKPCLINAMID ----2222-1111----------1111--------------------------------1 PDAGVE 111--- >INHIBITOR OF CYSTEINE PEP; SWP:NA; PDB:2C34A; HMIAPLSVKDNDKWVDTHVGKTTEIHLKGNPTTGYMWTRVGFVGKDVLSDEILEVVCKYT -----------------------------1111-----2222------------------ PTPSSTPMVGVGGIYVVLVKPRKRGHHTLELVYTRPFEGIKPENERYTLHLNVK ----------------------------------3333--1111---------- >DNA-DIRECTED RNA POLYMERA; SWP:O15514; PDB:2C35A; EEDASQLIFPKEFETAETLLNSEVHMLLEHRKQQNESAEDEQELSEVFMKTLNYTARFSR ---------3333------------------------------------------1111- FKNRETIASVRSLLLQKKLHKFELACLANLCPETAEESKALIPSLEGRFEDEELQQILDD --------------------------------------------2222------------ IQTKRSFQY 3333----- >DNA-directed RNA polymera; SWP:P62487; PDB:2C35B; MFYHISLEHEILLHPRYFGPNLLNTVKQKLFTEVEGTCTGKYGFVIAVTTIDNIGAGVIQ -------------3333----------------2222----------------------- PGRGFVLYPVKYKAIVFRPFKGEVVDAVVTQVNKVGLFTEIGPMSCFISRHSIPSEMEFD -------------------2222---------1111----!!!!----3333-3333--- PNSNPPCYKTMDEDIVIQQDDEIRLKIVGTRVDKNDIFAIGSLMDDYLGLV -----------------2222------------------------------ >GLYCOPROTEIN D HSV-1; SWP:Q69091; PDB:2C36A; PVLDQLTDPPGVRRVYHIQAGLPDPFQPPSLPITVYYAVLERACRSVLLNAPSEAPQIVR --------2222-----------------------------1111--------3333-11 GASEDVRKQPYNLTIAWFRMGGNCAIPITVMEYTECSYNKSLGACPIRTQPRWNYYDSFS 113333------------------------------11112222-----------1111- AVSEDNLGFLMHAPAFETAGTYLRLVKINDWTEITQFILEHRAKGSCKYALPLRIPPSAC --1111--------1111---------!!!!---------------1111-----3333- LSPQAYQQGVTVDSIGMLPRFIPENQRTVAVYSLKIAGWHGPKAPYTSTLLPPELAPEDP ----------3333-------3333---------1111---------------------- EDSALLEDPVGTVAPQIPPNWHIPSIQDAATPYC -----------------1111---3333--1111 >PPIASE; SWP:Q9Y7F6; PDB:2C3BA; SMSQVFFDVEYAPVGTAETKVGRIVFNLFDKDVPKTAKNFRELCKRPAGEGYRESTFHRI ------------2222------------------------------2222-2222----- IPNFMIQGGDSRKHDKKGILSMAQFFITTAVTSWLDGKHVVFGEVADEKSYSVVKEIEAL --------------1111-------------3333----------------------111 GSSSGSVRSNTRPKIVNCGEL 11111---------------- >HYALURONIDASE, PHAGE ASSO; SWP:Q9A0M7; PDB:2C3FA; LRVQFKRMKAAEWARSDVILLESEIGFETDTGFARAGDGHNRFSDLGYISPLDYNLLTNK --------33331111----2222-----------------3333-------3333---- PNIDGLATKVETAQKLQQKADKETVYTKAESKQELDKKLNLKGGVMTGQLKFKPATGGAV -3333---------3333--3333----------1111-3333----------------- NIDLSSTRGAGVVVYSDNDTSDGPLMSLRTGKETFNQSALFVDYKGTTNAVNIAMRQPTT ---1111------------------------1111------------------------- PNFSSALNITSGNENGSAMQLRGSEKALGTLKITHENPSIGADYDKNAAALSIDIVKKTN ------------1111------------------------11111111---------222 GAGTAAQGIYINSTSGTTGKLLRIRNLSDDKFYVKSDGGFYAKETSQIDGNLKLKDPTAN 2-----------1111---------%%%%-----1111--------------------11 DHAATKAYVDKAISELKKLILK 11-------------------- >ALPHA-AMYLASE G-6; SWP:Q9KFR4; PDB:2C3GA; GHMASGLTIYFKKPDSWGTPHLYYYDTNPKVDEPTWSEAPEMEHYEGDWYTHTIEGVESV -------------3333-----------------1111-------!!!!----------- RLLFKDRGTNQWPGPGEPGFFRDQDGWFDGEWHVDRPG -------------2222--------------------- >GLUTATHIONE S-TRANSFERASE; SWP:P30711; PDB:2C3NA; GLELYLDLLSQPCRAVYIFAKKNDIPFELRIVDLIKGQHLSDAFAQVNPLKKVPALKDGD ------3333----------1111--------33333333-3333--1111------!!! FTLTESVAILLYLTRKYKVPDYWYPQDLQARARVDEYLAWQHTTLRRSCLRALWHKVMFP !-------------1111-3333----------------3333----------------- VFLGEPVSPQTLAATLAELDVTLQLLEDKFLQNKAFLTGPHISLADLVAITELMHPVGAG ------------------------------!!!!-1111---3333----------1111 CQVFEGRPKLATWRQRVEAAVGEDLFQEAHEVILKAKDFPPADPTIKQKLMPWVLAMIR ---2222------------------------33331111-------------------- >ALPHA-AMYLASE G-6; SWP:Q9KFR4; PDB:2C3VA; DATDITIYYKTGWTHPHIHYSLNQGAWTTLPGVPLTKSEEGVKVTIEAEEGSQLRAAFNN ---------------------%%%%-----------------------2222-------- GSGQWDNNQGRDYDFSSGVHTLADGRILSGTP ------%%%%------------iiii------ >INOSINE-URIDINE PREFERRIN; SWP:Q81QM4; PDB:2C40A; MKKVYFNHDGGVDDLVSLFLLLQMDNVELTGVSVIPADCYLEPAMSASRKIIDRFGKNTI -----------------------1111--------------------------------- EVAASNSRGKNPFPKDWRMHAFYVDALPILNESGKVVTHVAAKPAHHHLIETLLQTEEKT -------------3333---------33333333-------------------------- TLLFTGPLTDLARALYEAPIIENKIKRLVWMGGTFRTAGNVHEPEHDGTAEWNSFWDPEA -------3333--------3333---------------------------3333------ VARVWEANIEIDLITLESTNQVPLTIDIREQWAKERKYIGIDFLGQCYAIVPPLYYLWDV --------------33331111--3333----1111----------3333---------- LTAAFVGKADLAKVQTINSIVHTYGPSQGRTVETDDGRPVHVVYDVNHDRFFDYITRLAK -------1111-------------1111-----3333--------------------333 KV 3- >DPS FAMILY DNA-BINDING ST; SWP:Q8DG54; PDB:2C41A; TTTLKEQVLTTLKREQANAVVMYLNYKKYHWLTYGPLFRDLHLLFEEQGSEVFAMIDELA ----------------------------------1111---------------------- ERSLMLDGQPVADPADYLKVATVTPSSGQLTVKQMIEEAIANHELIITEMHQDAEIATEA ---1111-----33331111-------------------------------------111 GDIGTADLYTRLVQTHQKHRWFLKEFLAKGDGLVS 1-----------------------1111------- >AMINOADIPATE-SEMIALDEHYDE; SWP:Q9NRN7; PDB:2C43A; LYFQGHMEGVRWAFSCGTWLPSRAEWLLAVRSIQPEEKERIGQFVFARDAKAAMAGRLMI --------------3333-----------1111-------1111-3333----------- RKLVAEKLNIPWNHIRLQRTAKGKPVLAKDNPYPNFNFNISHQGDYAVLAAEPELQVGID ----------1111-----1111------------------------------------- IMKTSFPGRGSIPEFFHIMKRKFTNKEWETIRSFKDEWTQLDMFYRNWALKESFIKAIGV ------------------1111-------------------------------------- GLGFELQRLEFDLSPLNLDIGQVYKETRLFLDGEEEKEWAFEESKIDEHHFVAVALRKPT ----3333----------------------iiii-1111--------------------- QRQFTILNFNDLMSSAVPMTPEDPSFWDCFCFTEEIPIRN -------3333-1111------3333-----3333----- >ASPARTATE 1-DECARBOXYLASE; SWP:A1QXI4; PDB:2C45A; MLRTMLKSKIHRATVTCADLHYVGSVTIDADLMDAADLLEGEQVTIVDIDNGARLVTYAI ---------------------------------1111----------------------- TGERGSGVIGINGAAAHLVHPGDLVILIAYATMDDARARTYQPRIVFVDAYNKP --2222-------3333------------------3333---------1111-- >MRNA CAPPING ENZYME; SWP:NA; PDB:2C46A; SMAHNKIPPRWLNCPRRGQPVAGRFLPLKTMLGPRYDSQVAEENRFHPSMLSNYLKSLKV -------2222---------%%%%--------333311113333--3333---------- KMGLLVDLTNTRFYDRNDIEKEGIKYIKLQCKGHGECPTTENTETFIRLCERFPELIGVH ------------------3333----------iiii------------------------ CTHGFNRTGFLICAFLVEKMDWSIEAAVATFAQARPPGIYKGDYLKELFRRYGDIEEAPP -----------------------------------------------------3333--- PPLLPDWCFEDDED ----3333--1111 >CASEIN KINASE 1 GAMMA 2 I; SWP:P78368; PDB:2C47A; PNFRVGKKIELRLGKNLYTNEYVAIKLEPIKSRAPQLHLEYRFYKQLSATEGVPQVYYFG ----------------------------1111---3333--------------------- PGKYNAMVLELLGPSLEDLFDLCDRTFTLKTVLMIAIQLITRMEYVHTKSLIYRDVKPEN --------------------1111--------------------------------3333 FLVGRPGTKRQHAIHIIDFGLAKEYIDPETKKHIPYREHKSLTGTARYMSINTHLGKEQS ----2222-1111-----1111----------------------3333-----------3 RRDDLEALGHMFMYFLRGSLPWQGLKADTLKERYQKIGDTKRATPIEVLCENFPEEMATY 333----------------1111-----------------11113333-22223333--- LRYVRRLDFFEKPDYDYLRKLFTDLFDRSGFVFDYEYDWAGKPLPTPI ---11111111-------------------------1111-------- >BARNASE MCOEETI FUSION; SWP:P00648; PDB:2C4BA; VINTFDGVADYLQTYHKLPDNYITKSEAQALGWVASKGNLADVAPGKSIGGDIFSNREGK ---------------------------------3333-1111-2222--------1111- LPGKSGRTWREADINYTSGFRNSDRILYSSDWLIYKTTDAYQTFTKIRSSSMGVCPKILK ---2222---------------------1111------iiii--------2222------ KCRRDSDCLAGCVCGPNGFCGS ---1111-2222--1111---- >SUGAR KINASE MJ0406; SWP:Q57849; PDB:2C4EA; GGKMEKITCVGHTALDYIFNVEKFPEPNTSIQIPSARKYYGGAAANTAVGIKKLGVNSEL ---------------------------------------------------1111----- LSCVGYDFKNSGYERYLKNLDINISKLYYSEEEETPKAWIFTDKDNNQITFFLWGAAKHY ----1111---------1111--1111---------------1111-------!!!!-33 KELNPPNFNTEIVHIATGDPEFNLKCAKKAYGNNLVSFDPGQDLPQYSKEMLLEIIEHTN 33---------------------------2222------!!!!1111--------1111- FLFMNKHEFERASNLLNFEIDDYLERVDALIVTKGSKGSVIYTKDKKIEIPCIKAGKVID ------------------33333333-------!!!!-----1111-------------- PTGAGDSYRAGFLSAYVKGYDLEKCGLIGAATASFVVEAKGCQTNLPTWDKVVERLEKH 1111-------------------------------1111-1111----------3333- >AVIDIN; SWP:NA; PDB:2C4IA; TQPTFGFTVNWKFSESTTVFTGQCFIDRNGKEVLKTMWLLRSSVNDIGDDWKATRVGINI ---------------------------iiii--------------33331111------- FTRLKCSLTGKWTNDLGSNMTIGAVNSRGEFTGTYITAVTATSNEIKESPLHGTQNTIGS -------------1111--------1111------------------------------- TTVFTGQCFIDRNGKEVLKTMWLLRSSVNDIGDDWKATRVGINIFTRLSARKCSLTGKWT ----------1111---------------33331111----------------------- NDLGSNMTIGAVNSRGEFTGTYITAVTATSNEIKESPLHGTQNTINKRTQPTFGFTVNWK 1111--------1111---------------------------2222------------- FS -- >GLUTATHIONE S-TRANSFERASE; SWP:P28161; PDB:2C4JA; PMTLGYWNIRGLAHSIRLLLEYTDSSYEEKKYTMGDAPDYDRSQWLNEKFKLGLDFPNLP ---------!!!!-------1111-----------------3333--1111--------- YLIDGTHKITQSNAILRYIARKHNLCGESEKEQIREDILENQFMDSRMQLAKLCYDPDFE ---!!!!-------------1111-------------------------------11111 KLKPEYLQALPEMLKLYSQFLGKQPWFLGDKITFVDFIAYDVLERNQVFEPSCLDAFPNL 111-----------------!!!!-1111---3333-------------11111111--- KDFISRFEGLEKISAYMKSSRFLPRPVFSKMAVWGNK ----------3333----3333------3333----- >PHOSPHORIBOSYL PYROPHOSPH; SWP:Q14558; PDB:2C4KA; GYRVFSANSTAACTELAKRITERLGAELGKSVVYQETNGETRVEIKESVRGQDIFIIQTI ---------------------1111----------1111--------------------- PRDVNTAVMELLIMAYALKTACARNIIGVIPYFPYSKQSKMRKRGSIVCKLLASMLAKAG --3333----------------------------3333---------------------- LTHIITMDLHQKEIQGFFSFPVDNLRASPFLLQYIQEEIPNYRNAVIVAKSPDAAKRAQS ----------33333333---------------1111---3333------3333------ YAERLRLGLAVIHPPITVVGDVGGRIAIIVDDIIDDVESFVAAAEILKERGAYKIYVMAT ------------------------------------3333-------------------- HGILSAEAPRLIEESSVDEVVVTNTVPHEVQKLQCPKIKTVDISLILSEAIRRIHNGESM ---------------------------3333---3333---------------------3 AYLFR 333-- >GLYCOGEN PHOSPHORYLASE; SWP:Q8KQ56; PDB:2C4MA; QPLPAALVGSHVRAAAGTPADLATDRKFWTGLSRAVQERIADDWERTREAYGAARQQHYF ------------------3333----------------------------1111------ SAEFLMGRALLNNLTNLGLVDEAAAATRELGHELTDILEIENDAALGNGGLGRLAACFLD ---------------------------1111-3333-3333------------------- SAVTQDYPVTGYGLLYRFGLFRQSFNEGFQVEKPDPWREEEYPFTIRRASDQLVVCFDDM -------------------------iiii------------1111--3333--------- KTRAIPYDMPITGYGTHNVGTLRLWKAEPWEEFDYDAFNAQRFTDAIIERERVSDICRVL ------------2222--------------------------3333-------------- YPNDTTYEGKKLRVRQQYFFTSASLQAMIQDHLAHHKDLSNFAEFHSVQLNDTHPVLAIP --------------------------------------11111111------1111---- ELMRLLMDEHDMGWEESWAIVSKTFAYTNHTVLTEALEQWDEQIFQQLFWRVWEIIAEID ------------------------------------------------------------ RRFRLERAADGLDEETINRMAPIQHGTVHMAWIACYAAYSINGVAALHTEIIKAETLADW -----------------------%%%%--------------------------------- YALWPEKFNNKTNGVTPRRWLRMINPGLSDLLTRLSGSDDWVTDLDELKKLRSYADDKSV ---3333----------1111------------1111-3333----3333---------- LEELRAIKAANKQDFAEWILERQGIEIDPESIFDVQIKRLHEYKRQLMNALYVLDLYFRI ---------------------------1111---------1111---------------- KEDGLTDIPARTVIFGAKAAPGYVRAKAIIKLINSIADLVNNDPEVSPLLKVVFVENYNV -------------------1111----------------1111--3333----------- SPAEHILPASDVSEQISTAGKEASGTSNMKFMMNGALTLGTMDGANVEIVDSVGEENAYI -----3333--------2222---3333---1111-------!!!!-------1111--- FGARVEELPALRESYKPYELYETVPGLKRALDALDNGTLNDNNSGLFYDLKHSLIHGYGK ---11113333----3333---------------------%%%%--------------11 DASDTYYVLGDFADYRETRDRMAADYASDPLGWARMAWINICESGRFSSDRTIRDYATEI 11-11113333------------------------------------------------- WKLEPTPAV --------- >PROTEIN NAGD; SWP:P0AF24; PDB:2C4NA; MTIKNVICDIDGVLMHDNVAVPGAAEFLHGIMDKGLPLVLLTNYPSQTGQDLANRFATAG ---------------!!!!-2222--------------------------------1111 VDVPDSVFYTSAMATADFLRRQEGKKAYVVGEGALIHELYKAGFTITDVNPDFVIVGETR ---3333-----------1111-----------------1111----------------- SYNWDMMHKAAYFVANGARFIATNPDTHGRGFYPACGALCAGIEKISGRKPFYVGKPSPW -------------1111-----------------------------------------33 IIRAALNKMQAHSEETVIVGDNLRTDILAGFQAGLETILVLSGVSSLDDIDSMPFRPSWI 33---------3333------1111-----1111-----------33331111------- YPSVAEIDVI --3333---- >UBIQUITIN-CONJUGATING ENZ; SWP:NA; PDB:2C4PA; SMALKRIQKELSDLQRDPPAHCSAGPVGDDLFHWQATIMGPPDSAYQGGVFFLTVHFPTD ------------1111----------!!!!----------1111-------------111 YPFKPPKIAFTTKIYHPNINSNGSICLDILRSQWSPALTVSKVLLSICSLLCDPNPDDPL 1--------------11111111---1111----33333333------------1111-- VPDIAQIYKSDKEKYNRHAREWTQKYAM -------33333333------------- >3-DEHYDROQUINATE DEHYDRAT; SWP:NA; PDB:2C4WA; HMKILVIQGPNLNMLGHRDPRLYGMVTLDQIHEIMQTFVKQGNLDVELEFFQTNFEGEII --------2222-2222--3333----------------1111----------------- DKIQESVGSEYEGIIINPGAFSHTSIAIADAIMLAGKPVIEVHLTNIQAREEFRKNSYTG ---3333----------!!!!----------1111----------1111-3333------ AACGGVIMGFGPLGYNMALMAMVNILAEMKAFQEAQKNNP ----------3333-------------------------- >ENDOGLUCANASE; SWP:P71140; PDB:2C4XA; PENQAPKAIFTFSPEDPVTDENVVFNASNSIDEDGTIAYYVWDFGDGYEGTSTTPTITYK -----------------2222-----1111------------------------------ YKNPGTYKVKLIVTDNQGASSSFTATIKVTSATGDNSKFNFEDGTLGGFTTSGTNATGVV --------------1111----------------------1111-iiii----------- VNTTEKAFKGERGLKWTVTSEGEGTAELKLDGGTIVVPGTTTFRIWIPSGAPIAAIQPYI ------------------------------------2222-------------------- PHTPDWSEVLWNSTWKGYTVKTDDWNEITLTLPEDVDPTWPQQGIQVQTIDEGEFTIYVD --1111--------------------------11111111-------------------- AIDWLE ------ >Nuclear receptor coactiva; SWP:Q15788; PDB:2C52B; PTTVEGRNDEKALLEQLVSFLSGKDETELAELDRALGIDKLVQGGGLDVLSKLVPRGSL ---1111----------------3333---3333---1111------------------ >PROTEIN P6; SWP:NA; PDB:2C55A; LQSRPEPTAPPEESFRFGEETTTPSQKQEPIDKELYPLASLRSLFGSDPSSQ ----------------1111-------------------------------- >GDP-MANNOSE-3', 5'-EPIMER; SWP:Q93VR3; PDB:2C5AA; TYKELEREQYWPSENLKISITGAGGFIASHIARRLKHEGHYVIASDWKKNEHMTEDMFCD -1111-----3333-------1111----------1111--------------1111--- EFHLVDLRVMENCLKVTEGVDHVFNLAADMGGMGFIQSNHSVIMYNNTMISFNMIEAARI ----------------2222--------------1111---------------------- NGIKRFFYASSACIYPEFKQLETTNVSLKESDAWPAEPQDAFGLEKLATEELCKHYNKDF ----------3333-3333---------3333---------------------------- GIECRIGRFHNIYGPFGTWKGGREKAPAAFCRKAQTSTDRFEMWGDGLQTRSFTFIDECV -------------2222------------------------------------------- EGVLRLTKSDFREPVNIGSDEMVSMNEMAEMVLSFEEKKLPIHHIPGPEGVRGRNSDNNL -------------------------------3333------------------------- IKEKLGWAPNMRLKEGLRITYFWIKEQIEKEKAKGSDVSLYGSSKVVGTQAPVQLGSLRA ------------------------------------3333-------------2222--- ADGK ---- >Tyrosine-protein kinase r; SWP:P30530; PDB:2C5DC; ESPFVGNPGNITGARGLTGTLRCQLQVQGEPPEVHWLRDGQILELADSTQTQVPLGEDEQ -------------------------------------iiii------------------- DDWIVVSQLRITSLQLSDTGQYQCLVFLGHQTFVSQPGYVGLEGLPYFLEEPEDRTVAAN ---------------3333----------------------------------------- TPFNLSCQAQGPPEPVDLLWLQDAVPLATAPGHGPQRSLHVPGLNKTSSFSCEAHNAKGV ---------------------%%%%----2222----------------------3333- TTSRTATITVLP ------------ >T-SNARE affecting a late ; SWP:Q03322; PDB:2C5IT; DPFQQVVKDTKEQLNRINNYITRHNTAGDDDQEEEIQDILKDVEETIVDLDRSIIVKRDE -3333------------------------------------------------------- NEDVSGREAQVKNIKQQLDALKLRFDRRIQEST ---3333-------------------------- >1-phosphatidylinositol-4,; SWP:Q9P212; PDB:2C5LC; EESFFVQVHDVSPEQPRTVIKAPRVSTAQDVIQQTLCKAKYSLSILSNPNPSDYVLLEEV -----------1111-------1111------------1111-------3333------- VKDKSSQRVLLDQECVFQAQSKWKGAGKFILKLKEQV ----------1111-----1111---------3333- >CTP SYNTHASE; SWP:NA; PDB:2C5MA; SMKYILVTGGVISGIGKGIIASSVGTILKSCGLHVTSIKIDPYINIDATKDNNLTTGKIY ----------------------------1111---------------------------- QYVINKERKGDYLGKTVQVVPHITDAIQEWVMRQALIPEPQVCVIELGGTVGDIESMPFI -----------%%%%----------------------------------------3333- EAFRQFQFKVKRENFCNIHVSLVPQPSSTGEQKTKPTQNSVRELRGLGLSPDLVVCRCSN ----------3333------------------------------1111------------ PLDTSVKEKISMFCHVEPEQVICVHDVSSIYRVPLLLEEQGVVDYFLRRLDL --3333---1111---------------1111----33333333-------- >RRAA-LIKE PROTEIN YER010C; SWP:P40011; PDB:2C5QA; SDLQKLQRFSTCDISDGLLNVYNIPTGGYFPNLTAISPPQNSSIVGTAYTVLFAPIDDPR -----1111--------------1111---------------------------1111-- PAVNYIDSVPPNSILVLALEPHLQSQFHPFIKITQAMYGGLMSTRAQYLKSNGTVVFGRI ----1111-2222------3333---------------3333------------------ RDVDEHRTLNHPVFAYGVGSCAPKAVVKAVGTNVQLKILTSDGVTQTIPGDYIAGDNNGI -3333-----------------1111-------------1111------------1111- VRIPVQETDISKLVTYIEKSIEVDLLVSEDIKNGIPAKQAQNDRRSVLKKYI ---1111-----------------------1111---------33333333- >EARLY PROTEIN P16.7; SWP:P16517; PDB:2C5RA; NLSACEVAVLDLYEQSNIRIPSDIIEDLVNQRLQSEQEVLNYIETQRTYWKLENQKKLYR --3333-------1111---3333----------------------------1111---- GSLK ---- >PROBABLE THIAMINE BIOSYNT; SWP:Q81KU0; PDB:2C5SA; YEYILVRYGEMGKNRSKFVSTLKDNVKFKLKKFPNIKIDATHDRMYIQLNGEDHEAVSER ---------------------------1111-3333----1111----iiii-------- LKDVFGIHKFNLAMKVPSELEDIKKGALAAFLQVKGDVKTFKITVHRSYKHFPMRTMELL ------------------------------------------------1111--3333-- PEIGGHILENTEDITVDVHNPDVNVRVEIRSGYSYIMCDERMGAGGLPVGVGGKVMVLLS ------1111---------------------------------------1111------- GGIDSPVAAYLTMKRGVSVEAVHFHSPPFTSERAKQKVIDLAQELTKYCKRVTLHLVPFT ---------------------------------------------1111----------- EVQKTINKEIPSSYSMTVMRRMMMRITERIAEERNALAITTGESLGQVASQTLDSMHTIN ----------3333-----------------1111-----------1111--------33 EVTNYPVIRPLITMDKLEIIKIAEEIGTYDISIRPYKPKREKANRFEAKYDFTPLIDEAV 33------1111-----------1111---3333------------1111---------1 ANKETMVLQTVE 111--------- >RNA LIGASE; SWP:P00971; PDB:2C5UA; SQELFNNLELCKDSQRKFFYSDDVSASGRTYRIFSYNYASYSDWLLPDALECRGIFEDGE --------1111-3333-------1111-----------3333--2222--------!!! KPVRIASRPEKFFNLNENPFTNIDLNDVDYILTKEDGSLVSTYLDGDEILFKSKGSIKSE !------------22223333--1111-----------------!!!!----------33 QALANGILNINHHRLRDRLKELAEDGFTANFEFVAPTNRIVLAYQEKIILLNVRENETGE 33------1111----------------------1111---------------------- YISYDDIYKDATLRPYLVERYEIDSPKWIEEAKNAENIEGYVAVKDGSHFKIKSDWYVSL ------------3333-------------------------------------------- HSTKSSLDNPEKLFKTIIDGASDDLKAYADDEYSYRKIEAFETTYLKYLDRALFLVLDCH ----1111-------------------1111----------------------------- NKHCGKDRKTYAEAQGVAKGAGDHLFGIISLYQGYDSQEKVCEIEQNFLKNYKKFIPEGY ------3333------------1111---3333--------------33331111-2222 >Penicillin-binding protei; SWP:Q8DR59; PDB:2C5WB; SNYPAYMDNYLKEVINQVEEETGYNLLTTGMDVYTNVDQEAQKHLWDIYNTDEYVAYPDD ---1111-3333------------------------------------------------ ELQVASTIVDVSNGKVIAQLGARHQSSNVSFGINQAVETNRDWGSTMKPITDYAPALEYG ----------------------------------1111----!!!!-------------- VYESTATIVHDEPYNYPGTNTPVYNWDRGYFGNITLQYALQQSRNVPAVETLNKVGLNRA ---1111--------2222-----1111-----------1111----------------- KTFLNGLGIDYPSIHYSNAISSNTTESDKKYGASSEKMAAAYAAFANGGTYYKPMYIHKV ----1111------3333--------------------------1111------------ VFSDGSEKEFSNVGTRAMKETTAYMMTDMMKTVLTYGTGRNAYLAWLPQAGKTGTSNYTD -3333---------------------------------1111-1111------------- EEIENHIKTSQFVAPDELFAGYTRKYSMAVWTGYSNRLTPLVGNGLTVAAKVYRSMMTYL -----------------------------------3333---3333-------------- SEGSNPEDWNIPEGLYRNGEFVFKN ------------------------- >SET DOMAIN PROTEIN 2; SWP:P46995; PDB:2C5ZA; VSQSQRLEHNWNKFFASFVPNLIKKNPQSKQFDHENIKQCAKDIVKILTTKELKKDSSRA -------------------3333--3333------------------------------- PPDDLTKGKRHKVKEFINSYMDKIILKKKQKKA ----------------------------1111- >HUMAN MITOGEN-ACTIVATED P; SWP:Q99759; PDB:2C60A; SDVRIKFEHNGERRIIAFSRPVKYEDVEHKVTTVFGQPLDLHYNNELSILLKNQDDLDKA --------iiii------------------------------------------------ IDILDRSSSKSLRILLLS ------------------ >A-TYPE ATP SYNTHASE NON-C; SWP:Q60187; PDB:2C61A; GPLIFVEKTEPVGYNEIVNIKMGDGTVRRGQVLDSSADIVVVQVFFTGETLKLPASVDLL ------------2222-----1111------------------------------1111- GRILSGSGEPRDGGPRIVPDQLLDINGAAMNPYARLPPKDFIQTGISTIDGTNTLVRGQK ----1111-------------------------------------3333------2222- LPIFSASGLPHNEIALQIARQASVPGSESAFAVVFAAMGITNEEAQYFMSDFEKTGALER -----2222--------------2222-------------------------1111---- AVVFLNLADDPAVERIVTPRMALTAAEYLAYEHGMHVLVILTDITNYAEALRQMGAARNE ------11113333-----------------------------------1111!!!!--- VPGRRGYPGYMYTDLATLYERAGIVKGAKGSVTQIPILSMPGDDITHPIPDLSGYITEGQ 2222---3333------3333---2222--------------3333-----33331111- IVVARELHRKGIYPPINVLPSLSRLMNSGIGAGKTREDHKAVSDQMYAGYAEGRDLRGLV -------1111------1111-1111----2222-1111--------------------- AIVGKEALSERDTKFLEFADLFEDKFVRQGRNENRTIEDTLEIGWQILTHLPENQLGRID ---1111----------------------1111--------------333311111111- NKYIQKYHPAHR -------1111- >14-3-3 PROTEIN ETA; SWP:Q04917; PDB:2C63A; DREQLLQRARLAEQAERYDDMASAMKAVTELNEPLSNEDRNLLSVAYKNVVGARRSSWRV ----------------------------3333---------------------------- ISSIEQKTMADGNEKKLEKVKAYREKIEKELETVCNDVLSLLDKFLIKNCNDFQYESKVF ----------------------------------------------1111---------- YLKMKGDYYRYLAEVASGEKKNSVVEASEAAYKEAFEISKEQMQPTHPIRLGLALNFSVF -----------3333----------------------------1111------------- YYEIQNAPEQACLLAKQAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTS ------------------------3333-1111-------------------- >UBIQUITIN-PROTEIN LIGASE ; SWP:Q00987; PDB:2C6AA; SFEEDPEISLADYWKCTSCNEMNPPLPSHCNRCWALRENWLPEDKG ----3333-------------------------------------- >DUFFY RECEPTOR, ALPHA FOR; SWP:P22545; PDB:2C6JA; KCNDKRKRGERDWDCPAEKDICISDRRYQLCMKELTNLITFLKLNLKRKLMYDAAVEGDL ---------------1111----3333-----------3333------------------ LLKKNNYQYNKEFCKDIRWGLGDFGDIIMGTNMEGVENNLRSIFGTDEKAKQDRKQWWNE --1111------------------------------11113333---------------- SKEHIWRAMMFSLRSRLKEKFVWICKKDVPQIYRWIREWGRDYMSELPKEQGKLNEKCAS -----------3333---------------1111-------------------------- KLYYNNMAICMLPLCHDACKSYDQWITRKKKQWDVLSTKFSSVKKTNIATAYDILKQELN ----3333--------3333---------------------3333---------1111-- GFKEATFENEINKRDNLYNHLCPCVV ---------1111------------- >ANGIOTENSIN-CONVERTING EN; SWP:P12821; PDB:2C6NA; LDPGLQPGNFSADEAGAQLFAQSYNSSAEQVLFQSVAASWAHDTNITAENARRQEEAALL ------------1111-------------------------3333--------------- SQEFAEAWGQKAKELYEPIWQNFTDPQLRRIIGAVRTLGSANLPLAKRQQYNALLSNMSR ---------------------------11113333----1111----------------- IYSTAKVCLPNKTATCWSLDPDLTNILASSRSYAMLLFAWEGWHNAAGIPLKPLYEDFTA -----------------------------------------------3333--------- LSNEAYKQDGFTDTGAYWRSWYNSPTFEDDLEHLYQQLEPLYLNLHAFVRRALHRRYGDR ---1111----------------1111-------3333---------------------- YINLRGPIPAHLLGDMWAQSWENIYDMVVPFPDKPNLDVTSTMLQQGWNATHMFRVAEEF --------1111--1111-----1111---------------3333-------------- FTSLELSPMPPEFWEGSMLEKPADGREVVCHASAWDFYNRKDFRIKQCTRVTMDQLSTVH ---------------------------------------------------3333----- HEMGHIQYYLQYKDLPVSLRRGANPGFHEAIGDVLALSVSTPEHLHKIGLLDRVTNDTES -----------------------3333--------------------------------- DINYLLKMALEKIAFLPFGYLVDQWRWGVFSGRTPPSRYNFDWWYLRTKYQGICPPVTRN ------------3333------------1111--3333---------------------- ETHFDAGAKFHVPNVTPYIRYFVSFVLQFQFHEALCKEAGYEGPLHQCDIYRSTKAGAKL ---3333-3333-------------------------------1111------------- RKVLQAGSSRPWQEVLKDMVGLDALDAQPLLKYFQPVTQWLQEQNQQNGEVLGWPEYQWH -----!!!!-3333------------3333------------------------------ PPLPDNYPEGID ------------ >GMP REDUCTASE 2; SWP:Q9P2T1; PDB:2C6QA; SLDFKDVLLRPKRSTLKSRSEVDLTRSFSFRNSKQTYSGVPIIAANMDTVGTFEMAKVLC --3333-----------3333-------------------------1111---------- KFSLFTAVHKHYSLVQWQEFAGQNPDCLEHLAASSGTGSSDFEQLEQILEAIPQVKYICL -----------------------1111------------------------3333----- DVANGYSEHFVEFVKDVRKRFPQHTIMAGNVVTGEMVEELILSGADIIKVGIGPGSVCTT --------------------1111----------------1111----------1111-3 RKKTGVGYPQLSAVMECADAAHGLKGHIISDGGCSCPGDVAKAFGAGADFVMLGGMLAGH 333------------------1111----------3333----1111------3333--1 SESGGELIERDGKKYKLFYGMSSEMAMKKYAGGVAEYRASEGKTVEVPFKGDVEHTIRDI 111------%%%%------1111-----------1111---------------------- LGGIRSTCTYVGAAKLKELSRRTTFIRV --------------33333333------ >CLEC1B PROTEIN; SWP:Q9P126; PDB:2C6UA; SPCDTNWRYYGDSCYGFFRHNLTWEESKQYCTDMNATLLKIDNRNIVEYIKARTHLIRWV ---2222------------------------1111--------------1111------- GLSRQKSNEVWKWEDGSVISENMFEFLEDGKGNMNCAYFHNGKMHPTFCENKHYLMCERK -----2222---1111---33331111---1111-----iiii----1111--------- AG -- >FORKHEAD BOX PROTEIN K2; SWP:Q01167; PDB:2C6YA; DSKPPYSYAQLIVQAITMAPDKQLTLNGIYTHITKNYPYYRTADKGWQNSIRHNLSLNRY ------------------1111--------------33333333-------------111 FIKVPRSQEEPGKGSFWRIDPASESKLIEQAFRKRRPR 1-----1111---------3333------1111----- >NG,NG-DIMETHYLARGININE DI; SWP:P56965; PDB:2C6ZA; TFGRATHVVVRALPESLAQQALRRTKGDEVDFARAERQHQLYVGVLGSKLGLQVVQLPAD 2222---------3333------------------------------------------1 ESLPDCVFVEDVAVVCEETALITRPGAPSRRKEADMMKEALEKLQLNIVEMKDENATLDG 111-33333333---!!!!-------1111-----------1111-------1111---1 GDVLFTGREFFVGLSKRTNQRGAEILADTFKDYAVSTVPVVDALHLKSFCSMAGPNLIAI 111-----------1111-----------1111-----------1111------------ GSSESAQKALKIMQQMSDHRYDKLTVPDDTAANCIYLNIPSKGHVLLHRTPEEYPESAKV -------------1111----------3333------------------3333-3333-- YEKLKDHMLIPVSNSELEKVDGLLTCSSVLINK ---1111------3333-------1111----- >ELONGATION FACTOR TU-A; SWP:Q5SHN6; PDB:2C78A; KPHVNVGTIGHVDHGKTTLTAALTYVAAAENPNVEVKDYGDIDKAPEERARGITINTAHV ----------2222------------33331111---3333------------------- EYETAKRHYSHVDCPGHADYIKNMITGAAQMDGAILVVSAADGPMPQTREHILLARQVGV ----------------3333-------1111-------3333--1111-------1111- PYIVVFMNKVDMVDDPELLDLVEMEVRDLLNQYEFPGDEVPVIRGSALLALEQMHRNPKT --------3333------------------1111-1111-----------------1111 RRGENEWVDKIWELLDAIDEYIPTPVRDVDKPFLMPVEDVFTITGRGTVATGRIERGKVK 22223333---------------------------------------------------2 VGDEVEIVGLAPETRKTVVTGVEMHRKTLQEGIAGDNVGVLLRGVSREEVERGQVLAKPG 222-----------------------------2222---------1111-2222---222 SITPHTKFEASVYVLKKEEGGRHTGFFSGYRPQFYFRTTDVTGVVQLPPGVEMVMPGDNV 2--------------3333-------2222-----!!!!--------2222---2222-- TFTVELIKPVALEEGLRFAIREGGRTVGAGVVTKILE ------------2222-----iiii------------ >PROGESTERONE RECEPTOR; SWP:P06401; PDB:2C7AA; PQKICLICGDEASGCHYGVLTCGSCKVFFKRAMEGQHNYLCAGRNDCIVDKIRRKNCPAC ---------------iiii------------------------------3333------- RLRKCCQAGMVLGGRKFK -----1111--------- >CARBOXYLESTERASE; SWP:Q5G935; PDB:2C7BA; LSIAASPQELRRQVEEQSRLLTAAVQEPIAETRDVHIPVSGGSIRARVYFPKKAAGLPAV -3333---------------------------------2222------------------ LYYHGGGFVFGSIETHDHICRRLSRLSDSVVVSVDYRLAPEYKFPTAVEDAYAALKWVAD -----iiii--3333---------------------------2222-------------- RADELGVDPDRIAVAGDSAGGNLAAVVSILDRNSGEKLVKKQVLIYPVVNTGVPTASLVE --1111-1111------------------------------------------------- FGVAETTSLPIELVWFGRQYLKRPEEAYDFKASPLLADLGGLPPALVVTAEYDPLRDEGE 1111-----3333-----------3333-----1111------------1111------- LYAYKKASGSRAVAVRFAGVHGFVSFYPFVDAGREALDLAAASIRSGLQPS -----1111----------22221111------------------1111-- >ALPHA-L-ARABINOFURANOSIDA; SWP:Q4CJG5; PDB:2C7FA; KKARMTVDKDYKIAEIDKRIYGSFVEHLGRAVYDGLYQPGNSKSDEDGFRKDVIELVKEL -------1111------1111------!!!!---------11111111------------ NVPIIRYPGGNFVSNYFWEDGVGPVEDRPRRLDLAWKSIEPNQVGINEFAKWCKKVNAEI ---------1111---3333---3333--------------------------1111--- MMAVNLGTRGISDACNLLEYCNHPGGSKYSDMRIKHGVKEPHNIKVWCLGNAMDGPWQVG --------------------------1111------------------------1111-- HKTMDEYGRIAEETARAMKMIDPSIELVACGSSSKDMPTFPQWEATVLDYAYDYVDYISL --3333---------------1111--------1111-------------1111------ HQYYGNKENDTADFLAKSDDLDDFIRSVIATCDYIKAKKRSKKDIYLSFDEWNVWYHSNN -----3333----------------------------------------------1111- EDANIMQNEPWRIAPPLLEDIYTFEDALLVGLMLITLMKHADRIKIACLAQLINVIAPIV ---------------------------------------3333----------------- TERNGGAAWRQTIFYPFMHASKYGRGIVLQPVINSPLHDTSKHEDVTDIESVAIYNEEKE ----------1111---------------------------------------------- EVTIFAVNRNIHEDIVLVSDVRGMRLLEHIVLEHQDLKIRNSVNGEEVYPKNSDKFDDGI -----------------------------------1111--------------------- LTSMLRRASWNVIRIG ---------------- >RETINOBLASTOMA-BINDING PR; SWP:Q7Z6E9; PDB:2C7HA; GPLGSMSCVHYKFSSKLNYDTVTFDGLHISLCDLKKQIMGREKLKAADCDLQITNAQTKE ------------1111----------------------------1111------------ EYTDDNALIPKNSSVIVRRIPIGGVK ---1111--1111------------- >Phycoerythrocyanin beta c; SWP:P00313; PDB:2C7LB; MLDAFSRVVEQADKKGAYLSNDEINALQAIVADSNKRLDVVNRLTSNASSIVANAYRALV --3333------1111-------------------------------------------- AERPQVFNPGGPCFHHRNQAACIRDLGFILRYVTYSVLAGDTSVMDDRCLNGLRETYQAL --3333-2222------------------------------3333--------------- GTPGDAVASGIKKMKEAALKIANDPNGITKGDCSQLMSELASYFDRAAAAVA ---1111------------3333---------3333-----------1111- >RAB GUANINE NUCLEOTIDE EX; SWP:Q9UJ41; PDB:2C7NA; LCKKGCGYYGNPAWQGFCSKCWREEYHKARQKQIQEDWELAERLQREEEEAFASSQ -3333-----3333------------------------------------------ >MODIFICATION METHYLASE HH; SWP:P05102; PDB:2C7PA; MIEIKDKQLTGLRFIDLFAGLGGFRLALESCGAECVYSNEWDKYAQEVYEMNFGEKPEGD ------1111--------!!!!------1111---------------------------1 ITQVNEKTIPDHDILCAGFPCQAFSISGKQKGFEDSRGTLFFDIARIVREKKPKVVFMEN 111-3333------------11111111--!!!!11113333------------------ VKNFASHDNGNTLEVVKNTMNELDYSFHAKVLNALDYGIPQKRERIYMICFRNDLNIQNF 3333--%%%%-----------------------1111--------------3333----- QFPKPFELNTFVKDLLLPDSEVEHLVIDRKDLVMTNQEIEQTTPKTVRLGIVGKGGQGER ----------3333---33333333---1111-------------------!!!!2222- IYSTRGIAITLSAYGGGIFAKTGGYLVNGKTRKLHPRECARVMGYPDSYKVHPSTSQAYK --1111--------------------iiii----------1111-1111----------- QFGNSVVINVLQYIAYNIGSSLNFKPY -1111---------------------- >RECEPTOR-TYPE TYROSINE-PR; SWP:Q15262; PDB:2C7SA; MPAIRVADLLQHINLMKTSDSYGFKEEYESFFEGQSASWDVAKKDQNRAKNRYGNIIAYD ----3333-------------------1111-------3333-33331111-1111--33 HSRVILQPVDPSSDYINANYIDGYQRPSHYIATQGPVHETVYDFWRMIWQEQSACIVMVT 33-------1111-----------------------1111-------------------- NLVEVGRVKCYKYWPDDTEVYGDFKVTCVEMEPLAEYVVRTFTLERRGYNEIREVKQFHF --------------------!!!!---------1111--------2222----------- TGWPDHGVPYHATGLLSFIRRVKLSNPPSAGPIVVHCSAGAGRTGCYIVIDIMLDMAERE --------------------------1111------------------------------ GVVDIYNCVKALRSRRINMVQTEEQYIFIHDAILEACLCGETAIPVCEF -----------3333----------------------------3333-- >PTERIDINE REDUCTASE; SWP:NA; PDB:2C7VA; EAPAAVVTGAAKRIGRAIAVKLHQTGYRVVIHYHNSAEAAVSLADELNKERSNTAVVQAD ----------------------3333---------------------------------- LTNSNVLPASCEEIINSCFRAFGRCDVLVNNASAFYPTPLVQGKTVETQVAELIGTNAIA ---1111----------------------------------------------------- PFLLTMSFAQRQKGSSNLSIVNLCDAMVDQPMAFSLYNMGKHALVGLTQSAALELAPYGI ------------------------1111--------------------------3333-- RVNGVAPGVSLLPVAMGEEEKDKWRRKVPLGRREASAEQIADAVIFLVSGSAQYITGSII ------------1111-------3333--------3333---------1111-------- KVDGGLSLVHA --iiii----- >VASCULAR ENDOTHELIAL GROW; SWP:Q8TEV2; PDB:2C7WA; KVVSWIDVYTRATCQPREVVVPLTVELMGTVAKQLVPSCVTVQRCGGCCPDDGLECVPTG ------3333----------------2222------------------------------ QHQVRMQILMIRYPSSQLGEMSLEEHSQCECRPKKK ------------------------------------ >3-KETOACYL-COA THIOLASE 2; SWP:Q56WD9; PDB:2C7YA; DSAAYQRTSLYGDDVVIVAAHRTPLCKSKRGNFKDTYPDDLLAPVLRALIEKTNLNPSEV ------------------------------1111--3333-3333----------1111- GDIVVGTVLAPGSQRASECRMAAFYAGFPETVAVRTVNRQCSSGLQAVADVAAAIKAGFY ----------!!!!---------1111-3333-----------3333-------1111-- DIGIGAGLESMTTNPKFAQAQNCLLPMGVTSENVAQRFGVSRQEQDQAAVDSHRKAAAAT ------------------------------------------------------------ AAGKFKDEIIPVKTKLVDPKTGDEKPITVSVDDGIRPTTTLASLGKLKPVFKKDGTTTAG --1111-------------------------11111111----1111----------111 NSSQVSDGAGAVLLMKRSVAMQKGLPVLGVFRTFAAVGVDPAIMGIGPAVAIPAAVKAAG 1--------------------------------------3333----------------- LELDDIDLFEINEAFASQFVYCRNKLGLDPEKINVNGGAMAIGHPLGATGARCVATLLHE -3333------------------1111-1111-11113333----1111----------- MKRRGKDCRFGVVSMCIGTGMGAAAVFERGD ----3333---------1111---------- >GLUTAMINE-2-DEOXY-SCYLLO-; SWP:Q8G8Y2; PDB:2C81A; WPEWPQHSDRTRRKIEEVFQSNRWAISGYWTGEESMERKFAKAFADFNGVPYCVPTTSGS ------------------------1111-------------------------------- TALMLALEALGIGEGDEVIVPSLTWIATATAVLNVNALPVFVDVEADTYCIDPQLIKSAI -------1111-2222--------3333----1111---------------11113333- TDKTKAIIPVHLFGSMANMDEINEIAQEHNLFVIEDCAQSHGSVWNNQRAGTIGDIGAFS 1111--------------------------------1111----%%%%2222-------- CQQGKVLTAGEGGIIVTKNPRLFELIQQLRADSRVYCDDSSELMHGDMQLVKKGDIQGSN -3333-------------------------%%%%----3333-2222------------- YCLSEFQSAILLDQLQELDDKNAIREKNAMFLNDALSKIDGIKVMKRPPQVSRQTYYGYV ----------------------------------33332222-----3333--------- FRFDPVKFGGLNADQFCEILREKLNMGTFYLHPPYLPVHKNPLFCPWTKNRYLKSVRKTE ---3333iiii---------------3333------33333333333333333333--33 AYWRGLHYPVSERASGQSIVIHHAILLAEPSHLSLLVDAVAELARKFCV 331111------3333------3333--3333-----------3333-- >1-DEOXY-D-XYLULOSE 5-PHOS; SWP:P64012; PDB:2C82A; GRLRVVVLGSTGSIGTQALQVIADNPDRFEVVGLAAGGAHLDTLLRQRAQTGVTNIAVAD --------1111------------3333-------------------------------- EHAAQRVGDIPYHGSDAATRLVEQTEADVVLNALVGALGLRPTLAALKTGARLALANKES ----3333---------------------------3333-------3333------3333 LVAGGSLVLRAARPGQIVPVDSEHSALAQCLRGGTPDEVAKLVLTASGGPFRGWSAADLE --------33332222------------------3333----------1111--3333-- HVTPEQAGAHPTWSMGPMNTLNSASLVNKGLEVIETHLLFGIPYDRIDVVVHPQSIIHSM --1111------------------------------------1111-----3333----- VTFIDGSTIAQASPPDMKLPISLALGWPRRVSGAAAACDFHTASSWEFEPLDTDVFPAVE --1111------------------------2222-----------------3333----- LARQAGVAGGCMTAVYNAANEEAAAAFLAGRIGFPAIVGIIADVLHAADQWAVEPATVDD --------!!!!--------------1111--3333-----------3333-----3333 VLDAQRWARERAQRAVSGM ------------------- >Mono-ADP-ribosyltransfera; SWP:P15879; PDB:2C8EE; TYQEFTNIDQAKAWGNAQYKKYGLSKSEKEAIVSYTKSASEINGKLRQNKGVINGFPSNL ------------------1111----------------3333------iiii1111---- IKQVELLDKSFNKMKTPENIMLFRGDDPAYLGTEFQNTLLNSNGTINKTAFEKAKAKFLN --------------------------3333-1111-----1111-------------222 KDRLEYGYISTSLMNVSQFAGRPIITKFKVAKGSKAGYIDPISAFAGQLNMLLPRHSTYH 2--------------3333-----------2222----3333--2222------------ IDDMRLSSDGKQIIITATMM ------1111---------- >LIPOATE-PROTEIN LIGASE A; SWP:Q9HKT1; PDB:2C8MA; MEGRLLLLETPGNTRMSLAYDEAIYRSFQYGDKPILRFYRHDRSVIIGYFQVAEEEVDLD ---------1111---------------2222---------------11111111----- YMKKNGIMLARRYTGGGAVYHDLGDLNFSVVRSSDDMDITSMFRTMNEAVVNSLRILGLD --1111---------------1111-----------------------------1111-- ARPGELNDVSIPVNKKTDIMAGEKKIMGAAGAMRKGAKLWHAAMLVHTDLDMLSAVLKER -------1111---1111--!!!!---------2222----------------------- VANVTDFVDVSIDEVRNALIRGFSETLHIDFREDTITEKEESLARELFDKKYSTEEWNMG --1111--------------------------------------------11113333-- L - >CYTOCHROME C-L; SWP:P14774; PDB:2C8SA; SQGKEGGRDTPAVKKFLETGENLYIDDKSCLRNGESLFATSCSGCHGHLAEGKLGPGLND ---------------------1111---------------------1111---------- NYWTYPSNTTDVGLFATIFGGANGMMGPHNENLTPDEMLQTIAWIRHLYTGPKQDAVWLN ----3333-------------------------------------------333311113 DEQKKAYTPYKQGEVIPKDAKGQCKPLDE 3331111---2222--1111--------- >AFLATOXIN B1 ALDEHYDE RED; SWP:Q8CG76; PDB:2C91A; PLRPATVLGTMEMGRRMDASASAASVRAFLERGHSELDTAFMYCDGQSENILGGLGLGLG ---------1111----------------1111------3333iiii----1111--222 SGDCTVKIATKANPWEGKSLKPDSIRSQLETSLKRLQCPRVDLFYLHAPDHSTPVEETLC 2-------------iiii-------------------------------11113333--- ACHQLHQEGKFVELGLSNYASWEVAEICTLCKSNGWILPTVYQGMYNATTRQVEAELLPC ----------------------------------------------11113333------ LRHFGLRFYAYNPLAGGLLTGKYKYEDKDGKQPVGRFFGNNWAETYRNRFWKEHHFEAIA -----------1111-1111---1111-------1111-1111--------3333----- LVEKALQTTYGTNAPRMTSAALRWMYHHSQLQGTRGDAVILGMSSLEQLEQNLAATEEGP ---------!!!!------------------3333-------------------1111-- LEPAVVEAFDQAWNMVAHECPNYFR ---------------3333------ >6,7-DIMETHYL-8-RIBITYLLUM; SWP:P66034; PDB:2C92A; DASGVRLAIVASSWHGKICDALLDGARKVAAGCGLDDPTVVRVLGAIEIPVVAQELARNH -1111-------------------------1111----------3333------------ DAVVALGVVIRGQTPHFDYVCDAVTQGLTRVSLDSSTPIANGVLTTNTEEQALDRAGLPT --------------3333----------------------------------1111-111 SAEDKGAQATVAALATALTLRELRAHS 1-------------------------- >ADENYLATE KINASE 1; SWP:P00568; PDB:2C95A; SMEEKLKKTNIIFVVGGPGSGKGTQCEKIVQKYGYTHLSTGDLLRSEVSSGSARGKKLSE -----1111-------2222---------------------------------------- IMEKGQLVPLETVLDMLRDAMVAKVNTSKGFLIDGYPREVQQGEEFERRIGQPTLLLYVD -1111---------------------------------3333------------------ AGPETMTQRLLKRGETSGRVDDNEETIKKRLETYYKATEPVIAFYEKRGIVRKVNAEGSV ------------------1111-------------------------------------- DSVFSQVCTHLDALL --------------- >RECEPTOR-TYPE TYROSINE-PR; SWP:P28827; PDB:2C9AA; ETFSGGCLFDEPYSTCGYSQSEGDDFNWEQVNTLTKPTSDPWPSGSFLVNASGRPEGQRA -------------1111--------------3333---------------2222------ HLLLPQLKENDTHCIDFHYFVSSKSNSPPGLLNVYVKVNNGPLGNPIWNISGDPTRTWNR -------------------------------------%%%%------------------- AELAISTFWPNFYQVIFEVITSGHQGYLAIDEVKVLGHPCTRTPHFLRIQNVEVNAGQFA ------------------------------------------------------------ TFQCSAIGRTVAGDRLWLQGIDVRDAPLKEIKVTSSRRFIASFNVVNTTKRDAGKYRCIR -------------------2222------------------------------------- TEGGVGISNYAELVVK ---------------- >PERIDININ-CHLOROPHYLL A P; SWP:O76183; PDB:2C9EA; DAIADASKRFSDATYPIAEKFDWGGSSAIAKYIADASAGNPRQAALAVEKLLEVGLTMDP ---------------3333--1111---------1111----------------1111-- KLVRAAVEAHSKALDSAKKNAKLMASKEDFAAVNEALARMIASADKQKFAALRTAFPESR ---------------11111111-----------------1111------3333------ ELQGKLFAGNNAFEAEKAYDSFKALTSAVRDASINGAKAPVIAEDGPVGRAAKKFSEATY ------1111-----------------------iiii---------------------33 PIMDKLDWGKSPEISKYIETASAKNPKMMADGIDKTLEVALTMNQNAINDAVFAHVRAIK 3311111111----------3333---------------1111----------------- GALNTPGLVAERDDFARVNLALAKMIATADPAKFKALLTAFPGNADLQMALFAANNPEQA -1111%%%%---------------3333------3333-------------1111----- KAAYETFVALTSAVASS ----------------- >GREEN FLUORESCENT PROTEIN; SWP:Q9GPI6; PDB:2C9IA; MYPSIKETMRVQLSMEGSVNYHAFKCTGKGEGKPYEGTQSLNITITEGGPLPFAFDILSH -2222----------------------------1111--------------------333 AFIKVFAKYPKEIPDFFKQSLPGGFSWERVSTYEDGGVLSATQETSLQGDCIICKVKVLG 3--------1111-------3333--------1111-----------!!!!--------- TNFPANGPVMQKKTCGWEPSTETVIPRDGGLLLRDTPALMLADGGHLSCFMETTYKSKKE ---1111---------------------------------1111---------------- VKLPELHFHHLRMEKLNISDDWKTVEQHESVVASYSQVPSKLGHN ------------------1111----------------------- >GREEN FLUORESCENT PROTEIN; SWP:Q5ZQQ5; PDB:2C9JA; NLSVSVYMKGNVNNHEFEYDGIGGGDPNSGQFSLKTKLRGGKPLPFSYDIITMGFFRAFT -----------iiii----------3333---------------------1111-3333- KYPEGIADYFKGSFPEAFQWNRRIEFEDGGVINMSSDITYKDKVLHGDVWALGVNFPPNG --1111--3333-------------1111---------------------------1111 PVMKNEIVMEEPAEETLTAKNGVLVGFCPKAYLLKDGSYYYGHMTTFYRSKKSGQPLPGF -------------------iiii----------1111--------------1111----- HFIKHRLVKTKVEPGFKMVEQAEYATAHVCDLP ------------2222----------------- >PESTICIDAL CRYSTAL PROTEI; SWP:P16480; PDB:2C9KA; GELSAYTIVVGTVLTGFGFTTPLGLALIGFGTLIPVLFPAQDQSNTWSDFITQTKNIIKK --------------------1111---3333--------!!!!----------3333--- EIASTYISNANKILNRSFNVISTYHNHLKTWENNPNPQNTQDVRTQIQLVHYHFQNVIPE -----------------------------------------3333--------------- LVNSCPPNPSDCDYYNILVLSSYAQAANLHLTVLNQAVKFEAYLKNNTAIDYYPVLTKAI --------3333----------------------------3333-------3333----- EDYTNYCVTTYKKGLNLIKTTPDSNLDGNINWNTYNTYRTKMTTAVLDLVALFPNYDVGK -----------------1111-3333---------------------------------- YPIGVQSELTREIYQVLNFEESPYKYYDFQYQEDSLTRRPHLFTWLDSLNFYEKAQTTPN -----------------3333-3333---------------------------------- NFFTSHYNMFHYTLDNISQKSSVFGNHNVTDKLKSLGLATNIYIFLLNVISLDNKYLNDY -----------2222-----------------------2222-----------%%%%--- NNISKMDFFITNGTRLLEKELTAGSGQITYDVNKNIFGLPILKRREETLFPTYDNYSHIL ---------------------------------------------------1111----- SFIKSLSIPATYKTQVYTFAWTHSSVDPKNTIYTHLTTQIPAVKANSLGTASKVVQGPGH -------3333------------------------------1111---3333-------- TGGDLIDFKDHFKITCQHSNFQQSYFIRIRYASNGSANTRAVINLSIPGVAELGMALNPT -----------------------------------1111-------2222---------- FSGTDYTNLKYKDFQYLEFSNEVKFAPNQNISLVFNRSDVYTNTTVLIDKIEFLPITR ---------3333------------2222-----------1111-------------- >Trans-activator protein B; SWP:P03206; PDB:2C9LY; MLEIKRYKNRVAARKSRAKFKQLLQHYREVAAAKSSENDRLRLLLKQMCPSLDVDSIIPR ------------------------------------------------11113333---- TPD --- >RUVB-LIKE 1; SWP:Q9Y265; PDB:2C9OA; TTKTQRIASHSHVKGLGLDESGLAKQAASGLVGQENAREACGVIVELIKSKKMAGRAVLL ------1111--------1111-----iiii---------------------2222---- AGPPGTGKTALALAIAQELGSKVPFCPMVGSEVYSTEIKKTEVLMENFRRAIGLRIKETK -------------------3333-----3333---------------------------- EVYEGEVTELTPCHVIIGLKTAKGTKQLKLDPSIFESLQKERVEAGDVIYIEANSGAVKR --------------------1111-------------------2222------------- QGRCDTYATEFDLEAEEYVPLPKGDVHKKKEIIQDVTLHDLDVANGEINKVVNKYIDQGI ---3333-3333------------------------3333---------------1111- AELVPGVLFVDEVHMLDIECFTYLHRALESSIAPIVIFASNRGNCVIRGTEDITSPHGIP -----------3333--------------1111-------------2222----2222-3 LDLLDRVMIIRTMLYTPQEMKQIIKIRAQTEGINISEEALNHLGEIGTKTTLRYSVQLLT 3331111----------------------------------------------------- PANLLAKINGKDSIEKEHVEEISELFYDAKSSAKILAD ------1111----3333-------------------- >COPPER RESISTANCE PROTEIN; SWP:P12376; PDB:2C9QA; HPKLVSSTPAEGSEGAAPAKIELHFSENLVTQFSGAKLVMTAMPGMEHSPMAVKAAVSGG ---------2222----------------3333---------2222-------------- GDPKTMVITPASPLTAGTYKVDWRAVSSDTHPITGSVTFKVK -1111---------------------1111------------ >SUPPRESSOR OF CYTOKINE SI; SWP:O14508; PDB:2C9WA; QAARLAKALRELGQTGWYWGSMTVNEAKEKLKEAPEGTFLIRDSSHSDYLLTISVKTSAG -----------3333---!!!!--------11112222-------1111-------1111 PTNLRIEYQDGKFRLDSIICVKSKLKQFDSVVHLIDYYVQMCKHLYLTKPLYTSAPSLQH --------iiii--------3333---------------1111----------------- LCRLTINKCTGAIWGLPLPTRLKDYLEEYKFQV -----------3333----------3333---- >ADENYLATE KINASE ISOENZYM; SWP:P54819; PDB:2C9YA; GIRAVLLGPPGAGKGTQAPRLAENFCVCHLATGDMLRAMVASGSELGKKLKATMDAGKLV ------------3333-----------------------3333----------------- SDEMVVELIEKNLETPLCKNGFLLDGFPRTVRQAEMLDDLMEKRKEKLDSVIEFSIPDSL --------------3333------------------------------------------ LIRRITGRLIHPKSGRSYHEEFNPPKEPMKDDITGEPLIRRSDDNEKALKIRLQAYHTQT -----------1111--------------------------------------------- TPLIEYYRKRGIHSAIDASQTPDVVFASILAAFSKATC -------1111-----3333------------------ >NUCLEOCAPSID PROTEIN; SWP:Q4ZJS4; PDB:2CA1A; HMKADEMAHRRYCKRTIPPNYRVDQVFGPRTKGKEGNFGDDKMNEEGIKDGRVTAMLNLV --3333----3333-------3333--------------------!!!!-------1111 PSSHACLFGSRVTPKLQLDGLHLRFEFTTVVPCDDPQFDNYVKICDQCVDG ----------------1111-----------3333---------------- >MXIH; SWP:P0A223; PDB:2CA5A; DDGTQTLQGELTLALDKLAKNPSNPQLLAEYQSKLSEYTLYRNAQSNTVKVIKDVDAAIL --------------------1111------------------------------------ EH -- >RAN GTPASE-ACTIVATING PRO; SWP:P41391; PDB:2CA6A; ARFSIEGKSLKLDAITTEDEKSVFAVLLEDDSVKEIVLSGNTIGTEAARWLSENIASKKD ----2222--------3333----3333--------------------------1111-- LEIAEFSDIFTGRVKDEIPEALRLLLQALLKCPKLHTVRLSDNAFGPTAQEPLIDFLSKH -------------1111--------------1111----------3333----------1 TPLEHLYLHNNGLGPQAGAKIARALQELAVNKKAKNAPPLRSIICGRNRLENGSMKEWAK 111-----------------------------------------------3333------ TFQSHRLLHTVKMVQNGIRPEGIEHLLLEGLAYCQELKVLDLQDNTFTHLGSSALAIALK ------------------3333-------33331111--------------------333 SWPNLRELGLNDCLLSARGAAAVVDAFSKLENIGLQTLRLQYNEIELDAVRTLKTVIDEK 31111---------------------1111---------------3333----------- MPDLLFLELNGNRFSEEDDVVDEIREVFSTRGRGELDELDDMEE 1111----2222--1111-------------------------- >PUTATIVE NICKEL-RESPONSIV; SWP:O25896; PDB:2CA9A; DSIIRFSVSLQQNLLDELDNRIIKNGYSSRSELVRDMIREKLVNPNDESKIAVLVVIYDH ---------------------------------------3333---------------11 HQRELNQRMIDIQHASGTHVLCTTHIHMDEHNCLETIILQGNSFEIQRLQLEIGGLRGVK 11--------------------------1111-----------------------2222- FAKLTKASSFEY -------3333- >RUSTICYANIN; SWP:P24930; PDB:2CAKA; ALDTTWKEATLPQVKAMLQKDTGKVSGDTVTYSGKTVHVVAAAVLPGFPFPSFEVHGKKN -----------------1111-----------------------2222------%%%%-- PTLDIPGGATVDVTFINTNKGFGHSFDITQKVPPYAVMPVIDPIVAGTGFSPVPKDGKFG -----2222---------2222--------------------------------%%%%-- YTNFTWHPTAGTYYYVCQIPGHAATGMFGKIIVK ------------------22221111-------- >RUSTICYANIN; SWP:P24930; PDB:2CALA; LDTTWKEATLPQVKAMLEKDTGKVSGDTVTYSGKTVHVVAAAVLPGFPFPSFEVHDKKNP ---------------1111-----!!!!---------------2222------%%%%--- TLEIPAGATVDVTFINTNKGFGHSFDITKKGPPYAVMPVIDPIVAGTGFSPVPKDGKFGY ----2222---------2222--------------------------------iiii--- TDFTWHPTAGTYYYVCQIPGMAATGMFGKIVVK -----------------22221111-------- >HUMAN PHOSPHATE BINDING P; SWP:NA; PDB:2CAPA; DIDGGGATLPEKLYLTPDVLTAGFAPYIGVGSGKGKIAFLENKYNQFGTDTTKNVHWAGS -------1111----2222-2222--------------1111-3333------------- DSKLTATQLATYAADKEPGWGKLIQVPSVATSVAIPFRKAGANAVDLSVKELCGVFSGRI ----------------3333---------------------------------------- ADWSGITGAGRSGPIQVVYRAESSGTTELFTRFLNAKCTTQPGTFAVTTVFANSYSLGLT -----2222----------------------------------------3333-111133 PLAGAVAAIGSDGVMAALNDTTVAEGRITYISPDFAAPTLAGLDDATKVARTGKGVVSGV 33-----------------------------3333---3333--------------iiii AVEGKSPAAANVSAAISVVPLPAAADRGNPDVWVPVFGATTGGGVVAYPDSGYPILGFTD -------3333-3333------3333--3333---------iiii--------------- LIFSECYANATQTGQVRDFFTKHYGTSANDNAAIEANAFVPLPSNWKAAVRASFLTASNA ---------------------1111---------1111-----------------3333- LSIGNTNVCNGKGRPE -22223333------- >CANINE PARVOVIRUS EMPTY C; SWP:Q11213; PDB:2CAS; GVGISTGTFNNQTEFKFLENGWVEITANSSRLVHLNMPESENYRRVVVNNMDKTAVNGNM 1111---------------------------------------------------22221 ALDDIHAQIVTPWSLVDANAWGVWFNPGDWQLIVNTMSELHLVSFEQEIFNVVLKTVSES 111----------------3333------------------------------------- ATQPPTKVYNNDLTASLMVALDSNNTMPFTPAAMRSETLGFYPWKPTIPTPWRYYFQWDR -----------1111------1111----------------1111--------------- TLIPSHTGTSGTPTNIYHGTDPDDVQFYTIENSVPVHLLRTGDEFATGTFFFDCKPCRLT --------------------1111-----1111------3333----------------- HTWQTNRALGLPPFLNSLPQSEGATNFGDIGVQQDKRRGVTQMGNTNYITEATIMRPAEV ----3333------------------------1111------1111---3333------- GYSAPYYSFEASTQGPFKTPIAAGRGGAQTDENQAADGNPRYAFGRQHGQKTTTTGETPE ------------------------------3333----------3333--1111------ RFTYIAHQDTGRYPEGDWIQNINFNLPVTNDNVLLPTDPIGGKTGINYTNIFNTYGPLTA ------------3333----3333----------1111-iiii---3333-----1111- LNNVPPVYPNGQIWDKEFDTDLKPRLHVNAPFVCQNNCPGQLFVKVAPNLTNEYDPDASA ------------------------------------------------------1111-- NMSRIVTYSDFWWKGKLVFKAKLRASHTWNPIQQMSINVDNQFNYVPSNIGGMKIVYEKS ----------------------------------------1111---1111--------- QLAPRKLY -------- >VACUOLAR PROTEIN SORTING ; SWP:NA; PDB:2CAYA; HYWHYVETTSSGQPLLREGEKDIFIDQSVGLYHGKSKILQRQRGRIFLTSQRIIYIDDAK 1-------1111----2222------------!!!!-2222------------------1 PTQNSLGLELDDLAYVNYSSGFLTRSPRLILFFKDPSSSTEFVQLSFRKSDGVLFSQATE 111-----3333------------------------------------------------ RALENILTE --------- >GLUCOSAMINE-FRUCTOSE-6-PH; SWP:Q8U3U3; PDB:2CB0A; KTLTEIKQTPKGIIKADESFNQVKDKIRLPRRILYLGCGSSHFLAKLLAVTNHGGTGVAL ----------------------1111------------3333------------------ PCSEFLYSKEAYPIGKPELVVGISRSGETTEVLLALEKINTPKLGISAYESSLTRACDYS -------3333------------3333-------3333---------------1111--- LVVPTIEESVVTHSFTAFYFAYLQLLRHSYGLPLLEATEVAKATEKALEYENYIKEIVED ---------------------------1111----------------------------- FDFQNVIFLGSGLLYPVALEASLKKEAIFWSEAYPTFEVRHGFKAIADENTLVVLAQELF ----------!!!!-----------------------------11111111--------- EWHKKLVNEFKGQRARVLLISNSQQEFGQDYSIEVPRLSKDATPIPYLPVVQLLSYYKAV ----------1111------------------------3333-3333------------1 ARGLNPDNPRFLD 111-33332222- >O-ACETYL HOMOSERINE SULFH; SWP:Q76K51; PDB:2CB1A; MEYTTLAVLAGLPEDPHGAVGLPIYAVAAYGFKTLEEGQERFATGEGYVYARQKDPTAKA -3333---2222--1111-------------------------------3333------- LEERLKALEGALEAVVLASGQAATFAALLALLRPGDEVVAAKGLFGQTIGLFGQVLSLMG --------------------------------2222--------3333------3333-- VTVRYVDPEPEAVREALSAKTRAVFVETVANPALLVPDLEALATLAEEAGVALVVDNTFG -------------11111111-----------------------------------3333 AAGALCRPLAWGAHVVVESLTWASGHGSVLGGAVLSRETELWRNYPQFLQPWEALRARCF iiii--3333-----------3333--------------3333-3333--3333!!!!-- PERVRTLGLSLCGMALSPFNAYLLFQGLETVALRVARMSETARFLAERLQGHPKVKALRY ------------------------------------------------1111-------1 PGLPEDPAHRNARKYLASGGPILTLDLGDLERASRFLGAIRLLKAANLGDARTLLVHPWT 111--11113333-------------------------------------------3333 TTHSRLKEEARLQAGVTPGLVRVSVGLEDPLDLLALFEEALEAV 1111-------1111-1111--------------------1111 >SULFUR OXYGENASE REDUCTAS; SWP:P29082; PDB:2CB2A; PKPYVAINMAELKNEPKTFEMFASVGPKVMVTARHPGFVGFQNHIQIGILPFGNRYGGAK --------------3333-------33333333-1111-----------------1111- MDMTKESSTVRVLQYTFWKDWKDHEEMHRQNWSYLFRLCYSCASQMIWGPWEPIYEIIYA -------------------3333------------------3333--------------- NMPINTEMTDFTAVVGKKFAEGKPLDIPVISQPYGKRVVAFAEHSVIPGKEKQFEDAIVR ------3333--------11113333------iiii----------2222---------- TLEMLKKAPGFLGAMVLKEIGVSGIGSMQFGAKGFHQVLENPGSLEPDPNNVMYSVPEAK -------2222------------3333--------------------1111---3333-- NTPQQYIVHVEWANTDALMFGMGRVLLYPELRQVHDEVLDTLVYGPYIRILNPMMEGTFW ---------------------3333-------------1111--------------3333 REYLNE ------ >PEPTIDOGLYCAN-RECOGNITION; SWP:Q9VXN9; PDB:2CB3A; LSAIIPRSSWLAQKPMDEPLPLQLPVKYVVILHTATESSEKRAINVRLIRDMQSFHIESR 1111-3333--------------------------------------------------- GWNDIAYNFLVGCDGNIYEGRGWKTVGAHTLGYNRISLGISFIGCFMKELPTADALNMCR -----------------------------2222--------------------------- NLLARGVEDGHISTDYRLICHCQCNSTESPGRRLYEEIQTWPHFYNIEEEE -------------------3333----------------------3333-- >MOSQUITOCIDAL TOXIN; SWP:Q03988; PDB:2CB4A; NSPKDNTWIQAASLTWLDSSLLYQLISTRIPSFASPNGLHREQTIDSNTGQIQIDNEHRL -----3333----3333--------3333-33331111----------------3333-- LRWDRRPPNDIFLNGFIPRVTNQNLSPVEDTHLLNYLRTNSPSIFVSTTRARYNNLGLEI ------3333-----------------3333----------------------1111--- TPWTPHSANNNIIYRYEIFAPGGIDINASFSRNHNPFPNEDQITFPGGIRPEFIRSTYEY ----1111----------------3333--1111---1111---2222--1111------ HNGEIVRIWINPNFINPSTLNDVSGPSNISKVFWHENHSEGNNDSYNQDFDFAPNGEIPN iiii------1111-33331111--------------1111-----1111-3333----- NNLLNNNSLNVIQ ------------- >BLEOMYCIN HYDROLASE; SWP:Q13867; PDB:2CB5A; SSSGLNSEKVAALIQKLNSDPQFVLAQNVGTTHDLLDICLKRATVQRAQHVFQHAVPQEG ----------------------------1111-3333----------------------- KPITNQKSSGRSWIFSCLNVMRLPFMKKLNIEEFEFSQSYLFFWDKVERCYFFLSAFVDT ----------3333---------------------------------------------- AQRKEPEDGRLVQFLLMNPANDGGQWDMLVNIVEKYGVIPKKCFPESYTTEATRRMNDIL 1111-1111----11111111---3333-----------3333---3333---------- NHKMREFCIRLRNLVHSGATKGEISATQDVMMEEIFRVVCICLGNPPETFTWEYRDKDKN -------------------------------------------------------1111- YEKIGPITPLEFYREHVKPLFNMEDKICLVNDPRPQHKHNKLYTVEYLSNMVGGRKTLYN ----------------3333-3333--------3333-------2222--2222------ NQPIDFLKKMVAASIKDGEAVWFGCDVGKHFNSKLGLSDMNLYDHELVFGVSLKNMNKAE --3333--------1111-------1111---1111--1111--3333------------ RLTFGESLMTHAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYLCMTDEWFSEYVYE -1111--------------------------------1111-iiii-------------- VVVDRKHVPEEVLAVLEQEPIILPAWDPMGALA ---3333-33333333-------1111------ >ACYL-COA-BINDING PROTEIN; SWP:P07108; PDB:2CB8A; SQAEFEKAAEEVRHLKTKPSDEEMLFIYGHYKQATVGDINTERPGMLDFTGKAKWDAWNE ---------3333-------------------------------1111---------333 LKGTSKEDAMKAYINKVEELKKKYGI 3------------------------- >FENGYCIN SYNTHETASE; SWP:Q45563; PDB:2CB9A; SAAGEQHVIQLNQQGGKNLFCFPPISGFGIYFKDLALQLNHKAAVYGFHFIEEDSRIEQY -----------------------3333---------1111------------1111---- VSRITEIQPEGPYVLLGYSAGGNLAFEVVQAMEQKGLEVSDFIIVDAYKKDQSITADAYL --------------------------------1111------------------------ PEAVRETVMQKKRCYQEYWAQLINEGRIKSNIHFIEAGIQTETSGAMVLQKWQDAAEEGY -------------------------------------------3333----1111----- AEYTGYGAHKDMLEGEFAEKNANIILNILDKI -------3333---3333----------1111 >HYALURONIDASE; SWP:Q8XL08; PDB:2CBIA; VLVPNLNPTPENLEVVGDGFKITSSINLVGEEEADENAVNALREFLTANNIEINSENDPN -----------------------------3333-------------1111---------- STTLIIGEVDDDIPELDEALNGTTAENLKEEGYALVSNDGKIAIEGKDGDGTFYGVQTFK ------------3333--------11112222-----iiii------------------3 QLVKESNIPEVNITDYPTVSARGIVEGFYGTPWTHQDRLDQIKFYGENKLNTYIYAPKDD 333iiii--------------------------------------1111-------1111 PYHREKWREPYPESEMQRMQELINASAENKVDFVFGISPGIDIRFDGDAGEEDFNHLITK 1111-3333--3333-----------1111--------3333------------------ AESLYDMGVRSFAIYWDDIQDKSAAKHAQVLNRFNEEFVKAKGDVKPLITVPTEYDTGAM ----1111-----------------------------------------------3333- VSNGQPRAYTRIFAETVDPSIEVMWTGPGVVTNEIPLSDAQLISGIYNRNMAVWWNYPVT -%%%%------------1111--------------------------------------1 DYFKGKLALGPMHGLDKGLNQYVDFFTVNPMEHAELSKISIHTAADYSWNMDNYDYDKAW 111------------1111--------------3333------------3333------- NRAIDMLYGDLAEDMKVFANHSTRMDNKTWAKSGREDAPELRAKMDELWNKLSSKEDASA -------!!!!------------------------------------------------- LIEELYGEFARMEEACNNLKANLPEVALEECSRQLDELITLAQGDKASLDMIVAQLNEDT -----------------------3333--------------------------------- EAYESAKEIAQNKLNTALSSFAVISEKVAQSFIQEALSFDLTLI ----------------1111----1111-3333--11113333- >RIBONUCLEASE Z; SWP:P0A8V0; PDB:2CBNA; AMNLIFLGTSAGVPTRTRNVTAILLNLQHPTQSGLWLFDCGEGTQHQLLHTAFNPGKLDK ----------------------------------------2222---1111--3333--- IFISHLHGDHLFGLPGLLCSRSMSGIIQPLTIYGPQGIREFVETALRISGSWTDYPLEIV ------1111------------------------2222--------1111---------- EIGAGEILDDGLRKVTAYPLEHPLECYGYRIEEHDAPGALNAQALKAAGVPPGPLFQELK ------------------------------------------------------------ AGKTITLEDGRQINGADYLAAPVPGKALAIFGDTGPCDAALDLAKGVDVMVHEATLDITM ------1111---1111--------------------33333333-----------3333 EAKANSRGHSSTRQAATLAREAGVGKLIITHVSSRYDDKGCQHLLRECRSIFPATELAND ----1111------------------------33333333------------------22 FTVFNV 22---- >NEOCARZINOSTATIN; SWP:P0A3R9; PDB:2CBOA; AAPTATVTPSSGLSDGTVVKVAGAGLQAGTAYWVAQWARVDTGVWAYNPADNSSVTADAN -------------2222---------2222----------2222---3333------111 GSASTSLTVRRSFEGFLFDGTRWGTVDCTTAACQVGLSDAAGNGPEGVAISFNHH 1---------------1111------3333--------1111------------- >CUCUMBER BASIC PROTEIN; SWP:P00303; PDB:2CBP; AVYVVGGSGGWTFNTESWPKGKRFRAGDILLFNYNPSMHNVVVVNQGGFSTCNTPAGAKV ---2222-------33332222--2222------3333----------------2222-- YTSGRDQIKLPKGQSYFICNFPGHCQSGMKIAVNAL --------------------22221111-------- >NEOCARZINOSTATIN; SWP:P0A3R9; PDB:2CBQA; APTATVTPSSGLSDGTVVKVAGAGLQAGTAYWVYQRAAVDTGVHASNPADLSSVTADANG ------------2222---------2222----------2222---3333------1111 SASTSLTVRRSFEGFLFDGTRWGTVDCTTAACQVGLSDAAGNGPEGVAISF ---------------1111------3333--------1111---------- >NEOCARZINOSTATIN; SWP:P0A3R9; PDB:2CBTA; APTATVTPSSGLSDGTVVKVAGAGLQAGTAYWVAQSAWVDTGVYASNPADISSVTADANG -------------------------2222----------2222---3333---------- SASTSLTVRRSFEGFLWDGTRWGTVDCTTAACHVTLRDALSNGPEGVAISF -------------------------------------1111---------- >ATP-DEPENDENT CLP PROTEAS; SWP:P0A526; PDB:2CBYA; SLTDSVYERLLSERIIFLGSEVNDEIANRLCAQILLLAAEDASKDISLYINSPGGSISAG ---------3333----------------------------------------------- MAIYDTMVLAPCDIATYAMGMAASMGEFLLAAGTKGKRYALPHARILMHQPIAIQAEQFA -----------------------------11112222---1111---------------- VIKKEMFRLNAEFTGQPIERIEADSDRDRWFTAAEALEYGFVDHIITRA ------------------------1111--------------------- >MULTIDRUG RESISTANCE-ASSO; SWP:P33527; PDB:2CBZA; NSITVRNATFTWARSDPPTLNGITFSIPEGALVAVVGQVGCGKSSLLSALLAEMDKVEGH ------------3333-----------2222------2222-------1111-------- VAIKGSVAYVPQQAWIQNDSLRENILFGCQLEEPYYRSVIQACALLPDLEILPSGDRTEI -------------------------iiii-----------1111-3333----------- GEKGVNLSGGQKQRVSLARAVYSNADIYLFDDPLSAVDAHVGKHIFENVIGPKGMLKNKT 2222---------------------------1111---------------1111-1111- RILVTHSMSYLPQVDVIIVMSGGKISEMGSYQELLARDGAFAEFLRTYASH -------1111---------iiii--------------------------- >BETA-LACTAMASE; SWP:Q59517; PDB:2CC1A; APIDDQLAELERRDNVLIGLYAANLQSGRRITHRPDEMFAMCSTFKGYVAARVLQMAEHG 3333-----------------------------1111-------------------1111 EISLDNRVFVDADALVPNSPVTEARAGAEMTLAELCQAALQRSDNTAANLLLKTIGGPAA --1111----3333----3333--2222-------------------------------- VTAFARSVGDERTRLDRWEVELNSAIPGDPRDTSTPAALAVGYRAILAGDALSPPQRGLL -----1111----------------2222------------------------------- EDWMRANQTSSMRAGLPEGWTTADKTGSGDYGSTNDAGIAFGPDGQRLLLVMMTRSQAHD ----------3333--2222---------iiii--------------------------1 PKAENLRPLIGELTALVLPSLL 111--3333-------3333-- >PROTEIN VIRB8; SWP:P17798; PDB:2CC3A; GPHMTQEEAVVNASLWEYVRLRESYDADTAQYAYDLVSNFSAPMVRQNYQQFFNYPNPTS -------------------------3333--------1111------------------- PQVILGKHGRLEVEHIASNDVTPGVQQIRYKRTLIVDGKMPMASTWTATVRYEKVTSLPG 3333!!!!-------------2222----------2222-------------------33 RLRLTNPGGLVVTSYQTSEDTVSN 331111------------------ ------------------------------------------------------------ ---- >PEROXIDASE/CATALASE T; SWP:Q08129; PDB:2CCAA; MKYPVEGGGNQDWWPNRLNLKVLHQNPAVADPMGAAFDYAAEVATIDVDALTRDIEEVMT --3333--3333-1111---1111--3333---111133333333------------111 TSQPWWPADYGHYGPLFIRMAWHAAGTYRIHDGRGGAGGGMQRFAPLNSWPDNASLDKAR 1-1111-2222-------------------------11111111-33333333------- RLLWPVKKKYGKKLSWADLIVFAGNCALESMGFKTFGFGFGRVDQWEPDEVYWGKEATWL ---------!!!!---------------1111-------------------------222 GDERYSGKRDLENPLAAVQMGLIYVNPEGPNGNPDPMAAAVDIRETFRRMAMNDVETAAL 2-----------------2222---1111iiii-3333---------1111--------- IVGGHTFGKTHGAGPADLVGPEPEAAPLEQMGLGWKSSYGTGTGKDAITSGIEVVWTNTP --------------1111---3333-3333--------!!!!!!!!------------11 TKWDNSFLEILYGYEWELTKSPAGAWQYTAKDGAGAGTIPDPFGGPGRSPTMLATDLSLR 11-------1111-------1111------%%%%2222--------------33333333 VDPIYERITRRWLEHPEELADEFAKAWYKLIHRDMGPVARYLGPLVPKQTLLWQDPVPAV -------------------------------1111-3333--1111----3333------ SHDLVGEAEIASLKSQIRASGLTVSQLVSTAWAAASSFRGSDKRGGANGGRIRLQPQVGW ----------------1111-------------1111---------22221111-33333 EVNDPDGDLRKVIRTLEEIQESFNSAAPGNIKVSFADLVVLGGCAAIEKAAKAAGHNITV 3331111--------------------------------------------1111----- PFTPGRTDASQEQTDVESFAVLEPKADGFRNYLGKGNPLPAEYMLLDKANLLTLSAPEMT ---------3333-33333333-----1111--------3333------1111------- VLVGGLRVLGANYKRLPLGVFTEASESLTNDFFVNLLDMGITWEPSPADDGTYQGKDGSG ------1111-2222-2222-----------------1111---------------1111 KVKWTGSRVDLVFGSNSELRALVEVYGADDAQPKFVQDFVAAWDKVMNLDRFDVR ------33333333----------1111-----------------1111-3333- >Cyclin-A2; SWP:P20248; PDB:2CCHB; NEVPDYHEDIHTYLREMEVKCKPKVGYMKKQPDITNSMRAILVDWLVEVGEEYKLQNETL ---1111-----------1111-11111111------------------------3333- HLAVNYIDRFLSSMSVLRGKLQLVGTAAMLLASKFEEIYPPEVAEFVYITDDTYTKKQVL ---------3333---3333---------------------3333--1111--------- RMEHLVLKVLTFDLAAPTVNQFLTQYFLHQQPANCKVESLAMFLGELSLIDADPYLKYLP ---------%%%%-------------1111----------------3333--------33 SVIAGAAFHLALYTVTGQSWPESLIRKTGYTLESLKPCLMDLHQTYLKAPQHAQQSIREK 33------------------3333------3333-------------3333-------11 YKNSKYHGVSLLNPPETLNL 113333-3333--------- >THYMIDYLATE KINASE; SWP:P65248; PDB:2CCJA; SAFITFEGPEGSGKTTVINEVYHRLVKDYDVIMTREPGGVPTGEEIRKIVLEGNDMDIRT --------2222------------------------------------------------ EAMLFAASRREHLVLKVIPALKEGKVVLCDRYIDSSLAYQGYARGIGVEEVRALNEFAIN -------------------------------3333-----------------------ii GLYPDLTIYLNVSAEVGRERIIKNSNRLDQEDLKFHEKVIEGYQEIIHRFKSVNADQPLE ii---------------------------------------------------------- NVVEDTYQTIIKYLEKI ------------1111- >CELLULOSOMAL SCAFFOLDING ; SWP:Q06851; PDB:2CCLA; GVVVEIGKVTGSVGTTVEIPVYFRGVPSKGIANCDFVFRYDPNVLEIIGIDPGDIIVDPN -----------2222-----------1111----------3333--------1111---3 PTKSFDTAIYPDRKIIVFLFAEDSGTGAYAITKDGVFAKIRATVKSSAPGYITFDEVGGF 333-------1111------------1111------------------------------ ADNDLVEQKVSFIDGGVNVGNATPTKLEH -1111------------1111-------- >Endo-1,4-beta-xylanase Y ; SWP:P51584; PDB:2CCLB; LGVNGDGTINSTDLTMLKRSVLRAITLTDDAKARADVDKNGSINAADVLLLSRYLLRVI 22---------------------------------1111----3333------------ >CALEXCITIN; SWP:O76764; PDB:2CCMA; AAHQLSDFQRNKILRVFNTFYDCNHDGVIEWDDFELAIKKICNLHSWPTDGKKHNEARAT ---------------------1111----3333---------1111-1111--------- LKLIWDGLRKYADENEDEQVTKEEWLKMWAECVKSVEKGESLPEWLTKYMNFMFDVNDTS ------------1111-------------------1111------------------333 GDNIIDKHEYSTVYMSYGIPKSDCDAAFDTLSDGGKTMVTREIFARLWTEYFVSNDRGAK 3-------------1111--------------iiii-------------------1111- GNHLFGTLKL 1111------ >PEPTIDE N-GLYCANASE HOMOL; SWP:Q96IV0; PDB:2CCQA; GSASPAVAELCQNTPETFLEASKLLLTYADNILRNPNDEKYRSIRIGNTAFSTRLLPVRG 3333------------------------------11111111--1111------1111-- AVECLFEMGFEEGETHLIFPKKASVEQLQKIRDLIAIER -------------------1111-----------1111- >HELIX POMATIA AGGLUTININ; SWP:Q2F1K8; PDB:2CCVA; RVQSGKIDCGDDAGWAKVPSDDPGRDNTRELAKNITFASPYCRPPVVLLSITQLDVEQSQ ----------3333-------1111-------------------------------1111 NLRVIARLYSVSPSGFKASCYTWHNTKVYSMSI -----------1111-------!!!!------- >AZURIN II; SWP:P56275; PDB:2CCWA; AQCEATVESNDAMQYNVKEIVVDKSCKQFTMHLKHVGKMAKVAMGHNLVLTKDADKQAVA ----------------------1111-------------3333--------3333----- TDGMGAGLAQDYVKAGDTRVIAHTKVIGGGESDSVTFDVSKIAAGENYAYFCSFPGHWAM ------3333---2222----------2222------3333-2222-------2222111 MKGTLKLGS 1-------- >CYTOCHROME C; SWP:P00152; PDB:2CCYA; QSKPEDLLKLRQGLMQTLKSQWVPIAGFAAGKADLPADAAQRAENMAMVAKLAPIGWAKG ---------------------------1111--------------------3333--222 TEALPNGETKPEAFGSKSAEFLEGWKALATESTKLAAAAKAGPDALKAQAAATGKVCKAC 2--2222--3333-1111-------------------3333------------------- HEEFKQD ------- --------------------------- >CYTOCHROME P450 MONOOXYGE; SWP:O87605; PDB:2CD8A; VLDLGALGQDFAADPYPTYARLRAEGPAHRVRTPEGDEVWLVVGYDRARAVLADPRFSKD --3333--3333---------1111-------3333--------3333-----3333--3 WRNSTTPLTEAEAALNHNMLESDPPRHTRLRKLVAREFTMRRVELLRPRVQEIVDGLVDA 333-----3333-----3333------------3333----------------------- MLAAPDGRADLMESLAWPLPITVISELLGVPEPDRAAFRVWTDAFVFPDDPAQAQTAMAE ---------3333-----------------3333-3333--------------------- MSGYLSRLIDSKRGQDGEDLLSALVRTSDEDGSRLTSEELLGMAHILLVAGHETTVNLIA -----------2222---------------1111-3333---------1111-------- NGMYALLSHPDQLAALRADMTLLDGAVEEMLRYEGPVESATYRFPVEPVDLDGTVIPAGD ------------------1111----------------------------iiii--2222 TVLVVLADAHRTPERFPDPHRFDIRRDTAGHLAFGHGIHFCIGAPLARLEARIAVRALLE -----------3333--1111-1111-----1111-11111111---------------- RCPDLALDVSPGELVWYPNPMIRGLKALPIRW ---------3333-----1111---------- >GLUCOSE DEHYDROGENASE; SWP:O93715; PDB:2CDCA; MKAIIVKPPNAGVQVKDVDEKKLDSYGKIKIRTIYNGICGADREIVNGKLGKDFLVLGHE ------------------3333----------------3333--1111------------ AIGVVEESYHGFSQGDLVMPVNRRGCGICRNCLVGRPDFCETGEFGEAGIHKMDGFMREW ------------2222---------------11113333-------2222---------- WYDDPKYLVKIPKSIEDIGILAQPLADIEKSIEEILEVQKRVPVWTCDDGTLNCRKVLVV ---3333----33331111-------------------3333----1111-1111----- GTGPIGVLFTLLFRTYGLEVWMANRREPTEVEQTVIEETKTNYYNSSNGYDKLKDSVGKF ------------------------------------1111-----1111----------- DVIIDATGADVNILGNVIPLLGRNGVLGLFGFSTSGSVPLDYKTLQEIVHTNKTIIGLVN -----------3333-3333-2222-----------------------1111-------- GQKPHFQQAVVHLASWKTLYPKAAKMLITKTVSINDEKELLKVLREKEHGEIKIRILWE -3333--------3333---3333--------1111-----------2222-------- >TCR 5E; SWP:NA; PDB:2CDFA; NQVEQSPQFLSIQEGENLTVYCNSSSVFSSLQWYRQEPGEGPVLLVTVVTGGEVKKLKRL ------------------------------------2222---------2222---!!!! TFQFGDARKDSSLHITAAQPGDTGLYLCAGADRGSTLGRLYFGRGTQLTVWPDIQNPDPA ----1111----------3333---------3333------------------------- VYQLRDSKSSDKSVCLFTDFDSQTNVSQSKDSDVYITDKCVLDMRSMDFKSNSAVAWSNK --------------------1111------1111----------1111------------ SDFACANAFNN ---3333---- >TCR 5E; SWP:NA; PDB:2CDFB; EADIYQTPRYLVIGTGKKITLECSQTMGHDKMYWYQQDPGMELHLIHYSYGVNSTEKGDL -------------2222--------------------1111---------2222----33 SSESTVSRIRTEHFPLTLESARPSHTSQYLCASSEFRDGNEKLFFGSGTQLSVLEDLNKV 33-------3333--------3333-----------%%%%---------------3333- FPPEVAVFEPSEAEISHTQKATLVCLATGFYPDHVELSWWVNGKEVHSGVCTDPQPLKEQ ----------3333--------------------------iiii--2222---------1 PALNDSRYALSSRLRVSATFWQDPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEAW 111-------------------1111-----------3333------------------- GRAD ---- >TCR 5E; SWP:NA; PDB:2CDGA; QALSIQEGENATMNCSYKTSINNLQWYRQNSGRGLVHLILIRSNEREKHSGRLRVTLDTS -----2222--------------------%%%%----------------!!!!------- KKSSSLLITASRAADTASYFCAPFDRGSTLGRLYFGRGTQLTVWPDIQNPDPAVYQLRDS -----------3333--------------------------------------------- KSSDKSVCLFTDFDSQTNVSQSKDSDVYITDKCVLDMRSMDFKSNSAVAWSNKSDFACAN -----------------------1111---------3333-------------------- AFNN ---- >ADENYLATE KINASE; SWP:NA; PDB:2CDNA; HMRVLLLGPPGAGKGTQAVKLAEKLGIPQISTGELFRRNIEEGTKLGVEAKRYLDAGDLV --------2222------------------------------------------------ PSDLTNELVDDRLNNPDAANGFILDGYPRSVEQAKALHEMLERRGTDIDAVLEFRVSEEV 3333----------3333-----------------------1111--------------- LLERLKGRGRADDTDDVILNRMKVYRDETAPLLEYYRDQLKTVDAVGTMDEVFARALRAL ---------1111----------------------3333------------------111 GK 1- >BETA-AGARASE 1; SWP:Q6DN99; PDB:2CDPA; STASIAVEAENFNAVGGTFSDGQAQPVSVYTVNGNTAINYVNQGDYADYTIAVAQAGNYT -------3333--------------------iiii------2222--------------- ISYQAGSGVTGGSIEFLVNENGSWASKTVTAVPNQGWDNFQPLNGGSVYLSAGTHQVRLH -------------------iiii------------1111--------------------- GAGSNNWQWNLDKFTLSN ------------------ >ASPARTOKINASE; SWP:Q9LYU8; PDB:2CDQA; KGITCVMKFGGSSVASAERMKEVADLILTFPEESPVIVLSAMGKTTNNLLLAGEKAVSCG ---------3333--3333----------1111---------------------3333-3 VSNASEIEELSIIKELHIRTVKELNIDPSVILTYLEELEQLLKGIAMMKELTLRTRDYLV 3331111----------------------------------------------------- SFGECLSTRIFAAYLNTIGVKARQYDAFEIGFITTDDFTNGDILEATYPAVAKRLYDDWM -------------------------3333-------3333-------------------- HDPAVPIVTGFLGKGWKTGAVTTLGRGGSDLTATTIGKALGLKEIQVWKDVDGVLTCDPT ---------------------------------------------------------333 IYKRATPVPYLTFDEAAELAYFGAQVLHPQSMRPAREGEIPVRVKNSYNPKAPGTIITKT 31111------3333------------3333-3333---------1111----------- RDMTKSILTSIVLKRNVTMLDIASTRMLGQVGFLAKVFSIFEELGISVDVVATSEVSISL --2222-----------------3333--------------1111--------------- TLDPSKLWSRELIQQELDHVVEELEKIAVVNLLKGRAIISLIGNVQHSSLILERAFHVLY ---3333-----3333-------3333----------------3333------------- TKGVNVQMISQGASKVNISFIVNEAEAEGCVQALHKSFFESGDLSELLIQ ----------------------3333------------------------ >NADPH OXIDASE; SWP:Q9F1X5; PDB:2CDUA; MKVIVVGCTHAGTFAVKQTIADHPDADVTAYEMNDNISFLSGIALYLGKEIKNNDPRGLF ----------------------1111-------------------1111-2222--1111 YSSPEELSNLGANVQMRHQVTNVDPETKTIKVKDLITNEEKTEAYDKLIMTTGSKPTVPP -------1111------------3333--------------------------------- IPGIDSSRVYLCKNYNDAKKLFEEAPKAKTITIIGSGYIGAELAEAYSNQNYNVNLIDGH 2222-1111--------------3333--------------------1111--------- ERVLYKYFDKEFTDILAKDYEAHGVNLVLGSKVAAFEEVDDEIITKTLDGKEIKSDIAIL ---3333-3333--------1111----------------------1111---------- CIGFRPNTELLKGKVAMLDNGAIITDEYMHSSNRDIFAAGDSAAVHYNPTNSNAYIPLAT -------3333------1111----1111---1111---1111----1111------333 NAVRQGRLVGLNLTEDKVKDMGTQSSSGLKLYGRTYVSTGINTALAKANNLKVSEVIIAD 3-----------------------------iiii-------------------------- NYRPEFMLSTDEVLMSLVYDPKTRVILGGALSSMHDVSQSANVLSVCIQNKNTIDDLAMV ---1111-----------------------------3333-------1111-3333---- DMLFQPQFDRPFNYLNILGQAAQAQADKAH ----3333---------------------- >CARDIOTOXIN CTX I; SWP:P60304; PDB:2CDX; LKCNKLIPIASKTCPAGKNLCYKMFMMSDLTIPVKRGCIDVCPKNSLLVKYVCCNTDRCN -------------------------3333----------------1111------2222- >SUPEROXIDE DISMUTASE [MN]; SWP:NA; PDB:2CDYA; AYTLPQLPYAYDALEPHIDARTMEIHHTKHHQTYVDNANKALEGTEFADLPVEQLIQQLD ---------1111---------------------------------11113333111111 RVPADKKGALRNNAGGHANHSMFWQIMGQGQANQPSGELLDAINSAFGSFDAFKQKFEDA 113333-----------------1111--------------------------------- AKTRFGSGWAWLVVKDGKLDVVSTANQDNPLMGEAIAGVSGTPILGVDVWEHAYYLNYQN --------------iiii------!!!!3333-----------------3333----!!! RRPDYLAAFWNVVNWDEVSKRYAAAK !--------1111---------1111 >CYTOCHROME C6; SWP:Q93VA3; PDB:2CE0A; LDIQRGATLFNRACAACHDTGGNIIQPGATLFTKDLERNGVDTEEEIYRVTYFGKGRMPG --------------3333iiii---2222-------1111--------------!!!!-- FGEKCTPRGQCTFGPRLQDEEIKLLAEFVKFQADQGWPT -1111-3333----------------------1111--- >GTPase HRas [Precursor]; SWP:P01112; PDB:2CE2X; MTEYKLVVVGAGGVGKSALTIQLIQNHFVDECDPTIEDSYRKQVVIDGETCLLDILDTAG ----------2222------------------1111---------iiii----------- QEEYSAMRDQYMRTGEGFLCVFAINNTKSFEDIHQYREQIKRVKDSDDVPMVLVGNKSDL ----------------------1111------------------------------3333 AARTVESRQAQDLARSYGIPYIETSAKTRQGVEDAFYTLVREIRQH -----3333-----1111---------------------------- >CELL DIVISION PROTEIN FTS; SWP:Q9WZ49; PDB:2CE7A; YKPSGNKRVTFKDVGGAEEAIEELKEVVEFLKDPSKFNRIGARMPKGILLVGPPGTGKTL ---------3333------------------------1111-----------2222---- LARAVAGEANVPFFHISGSDFVELFVGVGAARVRDLFAQAKAHAPCIVFIDEIDAVGRHD ----------------111122222222------------1111---------------3 EREQTLNQLLVEMDGFDSKEGIIVMAATNRPDILDPALLRPGRFDKKIVVDPPDMLGRKK 333-------------3333---------1111-3333-2222----------------- ILEIHTRNKPLAEDVNLEIIAKRTPGFVGADLENLVNEAALLAAREGRDKITMKDFEEAI -----1111--------------22223333------------1111------------- DRVILISPAEKRIIAYHEAGHAVVSTVVPNGEPVHRISIIKYLVSRNELLDKLTALLGGR 1111-----------------------1111----------------------------- AAEEVVFGDVTSGAANDIERATEIARNMVCQLGMSEELGPLAWGKLRNYSEEVASKIDEE -----------1111--------------------------------------------- VKKIVTNCYERAKEIIRKYRKQLDNIVEILLEKETIEGDELRRILSE ------------------------------------3333------- >THYROXINE-BINDING GLOBULI; SWP:P05543; PDB:2CEOA; YKMSSINADFAFNLYRRFTVETPDKNIFFSPVSISAALVMLSFGACCSTQTEIVETLGFN -----------------------------------------1111!!!!----------3 LTDTPMVEIQHGFQHLICSLNFPKKELELQIGNALFIGKHLKPLAKFLNDVKTLYETEVF 333----------------------------------3333------------------- STDFSNISAAKQEINSHVEMQTKGKVVGLIQDLKPNTIMVLVNYIHFKAQWANPFDPSKT -----3333----------1111----------3333------------------3333- EDSSSFLIDKTTTVQVPMMHQMEQYYHLVDMELNCTVLQMDYSKNALALFVLPKEGQMES -----------------------------------------------------2222--- VEAAMSSKTLKKWNRLLQKGWVDLFVPKFSISATYDLGATLLKMGIQHAYSENADFSGLT -----3333----------------------------33331111-33331111-3333- EDNGLKLSNAAHKAVLHIGEKGTEAAAVPENTFLHPIIQIDRSFMLLILERSTRSILFLG ------------------3333-------------------------------------- KVVNPTE ------- >ARGINASE; SWP:P53608; PDB:2CEVA; KPISIIGVPMDLGQTRRGVDMGPSAMRYAGVIERLERLHYDIEDLGDIPIGKAERLHEQG ----------1111---3333-------------3333----------------3333-- DSRLRNLKAVAEANEKLAAAVDQVVQRGRFPLVLGGDHSIAIGTLAGVAKHYERLGVIWY 1111-------------------------------------------3333--------- DAHGDVNTAETSPSGNIHGMPLAASLGFGHPALTQIGGYSPKIKPEHVVLIGVRSLDEGE -------3333------------1111--3333-2222-----1111------------- KKFIREKGIKIYTMHEVDRLGMTRVMEETIAYLKERTDGVHLSLDLDGLDPSDAPGVGTP --------------------------------1111--------1111-3333------- VIGGLTYRESHLAMEMLAEAQIITSAEFVEVNPILDERNKTASVAVALMGSLFGEKLM -------------------------------3333-%%%%----------1111---- >PROTEIN HI0146; SWP:P44542; PDB:2CEYA; ADYDLKFGMNAGTSSNEYKAAEMFAKEVKEKSQGKIEISLYPSSQLGDDRAMLKQLKDGS -----------1111----------------iiii------%%%%--------------- LDFTFAESARFQLFYPEAAVFALPYVISNYNVAQKALFDTEFGKDLIKKMDKDLGVTLLS ------33333333---3333-2222---------------------------------- QAYNGTRQTTSNRAINSIADMKGLKLRVPNAATNLAYAKYVGASPTPMAFSEVYLALQTN ----------------33332222------------------------3333-------- AVDGQENPLAAVQAQKFYEVQKFLAMTNHILNDQLYLVSNETYKELPEDLQKVVKDAAEN -------3333----3333----------------------3333--------------- AAKYHTKLFVDGEKDLVTFFEKQGVKITHPDLVPFKESMKPYYAEFVKQTGQKGESALKQ --------------------1111---------------------------3333----- IEAINP 2222-- >CINNAMYL ALCOHOL DEHYDROG; SWP:Q53ZN0; PDB:2CF5A; AERKTTGWAARDPSGILSPYTYTLRETGPEDVNIRIICCGICHTDLHQTKNDLGMSNYPM -----------3333------------1111----------------1111--------- VPGHEVVGEVVEVGSDVSKFTVGDIVGVGCLVGCCGGCSPCERDLEQYCPKKIWSYNDVY -------------------------------------3333---3333-----------1 INGQPTQGGFAKATVVHQKFVVKIPEGMAVEQAAPLLCAGVTVYSPLSHFGLKQPGLRGG 111-------------1111--------1111---------------------------- ILGLGGVGHMGVKIAKAMGHHVTVISSSNKKREEALQDLGADDYVIGSDQAKMSELADSL ------------------------------------1111-----1111--1111----- DYVIDTVPVHHALEPYLSLLKLDGKLILMGVINNPLQFLTPLLMLGRKVITGSFIGSMKE ------------33333333------------------3333------------------ TEEMLEFCKEKGLSSIIEVVKMDYVNTAFERLEKNDVRYRFVVDVEGSNLDA --------1111--------3333-------------------1111----- >DPR; SWP:Q4A3W3; PDB:2CF7A; ASFSPRPDSKAVLNQAVADLSVAHSILHQVHWYMRGRGFMIWHPKMDEYMEEIDGYLAEM -----------------------------------2222---3333-------------- SERLITLGGAPFSTLKEFSENSQLKEVLGDYNVTIEEQLARVVEVFRYLAALFQKGFDVS ----1111---------------------------------------------------- DEEGDSVTNDIFNVAKASIEKHIWMLQAELGQAPKL ------------------------------------ >THYMIDYLATE SYNTHASE; SWP:O41156; PDB:2CFAA; MSAKLISVTKPVVEGVNTAEELIAYAARVSNPENQKTASGLLKYIRHKHWSIFETAFMTL ------------2222----------------------------11113333-------- ELKTSRGIAAQVLRHRSFHFQEFSQTWWATEQEKLYAQSMELYNKALEKGIAKECARFIL --------------3333-------3333-----------------1111-3333-1111 PLSTPTTIYMSGTIRDWIHYIELRTSNGTQREHIDLANACKEIFIKEFPSIAKALDWVH 1111------------------------------------------------1111--- >GLUTAMATE-1-SEMIALDEHYDE ; SWP:Q8DLK8; PDB:2CFBA; PIVFDHVKGAHIWDVDGNQYIDYVGSWGPAIVGHAHPEVIDALHAALEKGTSFGAPCLLE -------!!!!--1111------%%%%--1111-----------3333--------3333 NILAEMVIAAVPSVEMVRFVNSGTEACMAVLRLMRAYTQREKVIKFEGCYHGHAATLTAP ------------------------------------------------------------ YNDLEAVSRLFEQYPNDIAGVILEPVVGNAGFIPPDAGFLEGLRELTKQYGALLVFDEVM -------------1111------------------2222--------1111--------- TGFRIAYGGAQEKFGVTPDLTTLGKVIGGGLPVGAYGGRAEIMKMVAPAGPTLSGNPLAM 2222-1111---------------1111-----------3333----------------- TAGIKTLEILSRPGSYEHLDRITGKLVQGLLDAAREFGHEVCGGHISGMFGLFFTAGPVT -----------2222--------------------------------------------- NYEQAKQSDLKKFAAFHRGMLEQGIYLAPSQFEAGFTSLAHTEADIERTIAAARTVLSQL 33333333---------------------1111----1111------------------- >ALLERGEN; SWP:O93970; PDB:2CFEA; MSNVFFDITKNGAPLGTIKFKLFDDVVPKTAANFRALCTGEKGFGYAGSHFHRVIPDFML ---------iiii--------------------------1111--2222-----2222-- QGGDFTAGNGTGGKSIYGAKFADENFQLKHNKPGLLSMANAGPNTNGSQFFITTVVTSWL --------------1111--------------------------------------3333 DGKHVVFGEVIDGMNVVKAIEAEGSGSGKPRSRIEIAKCGVC ------------3333----11111111-------------- >THERMOSTABLE DNA LIGASE; SWP:P56709; PDB:2CFMA; RYLELAQLYQKLEKTTKLIKTRLVADFLKKVPDDHLEFIPYLILGEVFPEWDERELGVGE -----------1111------------11111111------1111---1111-------- KLLIKAVAATGIDAKEIEESVKDTGDLGESIALAVKKKKQKSFFSQPLTIKRVYQTLVKV ------------------3333--------------3333-------------------- AETTGEGSQDKKVKYLADLFDAEPLEAKYLARTILGTRTGVAEGLLRDAIAAFHVKVELV ------------------------------------------------------------ ERAYLTSDFGYVAKIAKLEGNEGLAKVQVQLGKPIKPLAQQAASIRDALLEGGEAEFEIK -----------------------1111--2222--------------------------- YDGARVQVHKDGSKIIVYSRRLENVTRAIPEIVEALKEAIIPEKAIVEGELVAIGENGRP ----------!!!!----1111--3333--------------------------1111-- LPFQYVLRRFRRKHNIEEEKIPLELNLFDVLYVDGQSLIDTKFIDRRRTLEEIIKQNEKI -3333---------3333-------------------1111------------------- KVAENLITKKVEEAEAFYKRALEGHEGLAKRLDAVYEPGNRGKKWLKIKPTENLDLVIIG ---------3333-----------------1111--2222-------------------- AEWGEGRRAHLFGSFILGAYDPETGEFLEVGKVGSGFTDDDLVEFTKLKPLIIKEEGKRV ----!!!!-------------1111----------------------3333----!!!!- WLQPKVVIEVTYQEIQKSPKYRSGFALRFPRFVALRDDKGPEDADTIERIAQLYELQEKK --------------------3333---------------1111----------------1 GKVES 111-- >GLUTAMYL-TRNA SYNTHETASE; SWP:Q8DLI5; PDB:2CFOA; TVRVRLAPSPTGNLHIGTARTAVFNWLYARHRGGKFILRIEDTDRERSRPEYTENILEGL --------------------------------------------3333------------ QWLGLTWDEGPYFQSDRLDLYRQAIQTLLDKGLAYYCYCTPEELEALRAEQKAKGQAPRY ------------3333-------------------------------------------- DNRHRHLTPEEQAAFEAAGRTPVIRFKIEDDRQIEWQDLVRGRVSWQGADLGGDMVIARA -1111----------1111---------1111--------------3333---------- APRGEIGYPLYNLVVVVDDIAMGITDVIRGEDHIGNTPKQILLYEALGATPPNFAHTPLI -2222-----3333---------------33333333-------1111------------ LNSTGQKLSKRDGVTSISDFRAMGYLAPALANYMTLLGWSPPEGVGELFTLDLAAKHFSF --------1111---3333-----------------------------------111133 ERINKAGARFDWDKLNWLNRQYIQQLEPEEFLAELIPLWQGAGYAFDEERDRPWLFDLAQ 33--------3333--------1111-------------1111---3333---------- LLQPGLNTLREAIDQGAVFFIPSVTFDSEAMAQLGQPQSATILAYLLEHLPAEPALTVAM -------3333-11111111-----------33331111------3333----------- GQQLIQQAAKAAGVKKGATMRTLRAALTGAVHGPDLMAAWQILHQRGWDEPRLAAALKQA -----------------------------------------------------------3 QTTS 333- >LACTOSE PERMEASE; SWP:P02920; PDB:2CFQA; MYYLKNTNFWMFGLFFFFYFFIMGAYFPFFPIWLHDINHISKSDTGIIFAAISLFSLLFQ --11113333---3333----3333-1111-----1111----3333------------- PLFGLLSDKLGLRKYLLWIITGMLVMFAPFFIFIFGPLLQYNILVGSIVGGIYLGFCFNA ---------------------1111----------------------------------- GAPAVEAFIEKVSRRSNFEFGRARMFGCVGWALGASIVGIMFTINNQFVFWLGSGCALIL -------3333--1111----------3333-----------------------1111-- AVLLFFAKTDAPSSATVANAVGANHSAFSLKLALELFRQPKLWFLSLYVIGVSCTYDVFD ----------------------------------33333333---------------333 QQFANFFTSFFATGEQGTRVFGYVTTMGELLNASIMFFAPLIINRIGGKNALLLAGTIMS 3-----3333-------------------------------------3333--------- VRIIGSSFATSALEVVILKTLHMFEVPFLLVGCFKYITSQFEVRFSATIYLVCFCFFKQL -----------3333-1111---------------------3333--------------- AMIFMSVLAGNMYESIGFQGAYLVLGLVALGFTLISVFTLSGPGPLSLLRRQVNEVA ---1111--------------------------1111----------------3333 >HUMAN PROTEIN TYROSINE PH; SWP:Q12913; PDB:2CFVA; MKLIRVENFEAYFKKQQADSNCGFAEEYEDLKLVGISQPKYAAELAENRGKNRYNNVLPY ----1111--------------------1111--1111-3333-33331111-------3 DISRVKLSVQTHSTDDYINANYMPGYHSKKDFIATQGPLPNTLKDFWRMVWEKNVYAIIM 333--------3333---------3333----------1111--------1111------ LTEYWPSAQDYGDITVAMTSEIVLWTIRDFTVKNIQESHPLRQFHFTTTDLLINFRYLVR ----------!!!!---------------------------------------------- DYPILVHCSAGVGRTGTFIAIDRLIYQIENENTVDVYGIVYDLRMHRPLMVQTEDQYVFL ----------------------------------------------------3333---- NQCVLDIVR --------- >HTH-TYPE TRANSCRIPTIONAL ; SWP:P96582; PDB:2CFXA; MKLDQIDLNIIEELKKDSRLSMRELGRKIKLSPPSVTERVRQLESFGIIKQYTLEVDQKK -------------------------------------------1111---------3333 LGLPVSCIVEATVKNADYERFKSYIQTLPNIEFCYRIAGAACYMLKINAESLEAVEDFIN -------------%%%%-------1111-------------------------------- KTSPYAQTVTHVIFSEIDTK -3333--------------- >THIOREDOXIN REDUCTASE 1; SWP:Q16881; PDB:2CFYA; SYDYDLIIIGGGSGGLAAAKEAAQYGKKVMVLDFVTPTPLGTRWGLGGTCVNVGCIPKKL ----------------------1111-----------1111-------3333-------- MHQAALLGQALQDSRNYGWKVEETVKHDWDRMIEAVQNHIGSLNWGYRVALREKKVVYEN ------------3333-----------------------------------1111----- AYGQFIGPHRIKATNNKGKEKIYSAERFLIATGERPRYLGIPGDKEYCISSDDLFSLPYC ------2222----1111----------------------2222-----3333------- PGKTLVVGASYVALECAGFLAGIGLDVTVMVRSILLRGFDQDMANKIGEHMEEHGIKFIR --------------------1111------------------------------------ QFVPIKVEQIEAGTPGRLRVVAQSTNSEEIIEGEYNTVMLAIGRDACTRKIGLETVGVKI ---------------------------------------------------3333----- NEKTGKIPVTDEEQTNVPYIYAIGDILEDKVELTPVAIQAGRLLAQRLYAGSTVKCDYEN ----------------1111---3333--------------------------------- VPTTVFTPLEYGACGLSEEKAVEKFGEENIEVYHSYFWPLEWTIPSRDNNKCYAKIICNT -------------------------3333---------33333333------------33 KDNERVVGFHVLGPNAGEVTQGFAAALKCGLTKKQLDSTIGIHPVCAEVFTTLSVTKRSG 33------------------------1111--------------33333333---3333- ASIL ---- >REGULATORY PROTEIN ASNC; SWP:P0ACI6; PDB:2CG4A; YLIDNLDRGILEALMGNARTAYAELAKQFGVSPETIHVRVEKMKQAGIITGARIDVSPKQ ---3333---------1111------------------------------------3333 LGYDVGCFIGIILKSAKDYPSALAKLESLDEVTEAYYTTGHYSIFIKVMCRSIDALQHVL --------------3333----------3333---------------------------- INKIQTIDEIQSTETLIVLQNPIMRTIKP ---1111---------------------- >Fatty acid synthase; SWP:P49327; PDB:2CG5B; QRDLVEAVAHILGIRDLAAVNLDSSLADLGLDALMSVEVRQTLERELNLVLSVREVRQLT --------------------1111------------------------------------ LRKLQELSSKA ----------- >FIBRONECTIN; SWP:Q5MD86; PDB:2CG7A; AEETCFDKYTGNTYRVGDTYERPKDSMIWDCTCIGAGRGRISCTIANRCHEGGQSYKIGD --------------2222-----!!!!-----------------1111--iiii--2222 TWRRPHETGGYMLECVCLGNGKGEWTCKPI -----3333---------iiii-------- >DIHYDRONEOPTERIN ALDOLASE; SWP:P59657; PDB:2CG8A; MDQLQIKDLEMFAYHGLFPSEKELGQKFIVSAILSYDMTKAATVHYGELCQQWTTWFQET -----------------3333----------------3333--------------1111- SEDLIETVAYKLVERTFESYPLVQEMKLELKKPWAPVHLSLDTCSVTIHRRKQRAFIALG ---3333------------3333--------1111------------------------- SNMGDKQANLKQAIDKLRARGIHILKESSVLASFANQVVEVETWLPAQDLLETLLAIESE ---------------------------------------------3333----------- LGRIDLDLLFVEDQILYTDDLILPHPYIAERLFVLESLQEIAPHFIHPILKQPIRNLYDA ----------!!!!---1111---1111-------------1111-------3333---- L - >RHAMNULOKINASE; SWP:Q8X899; PDB:2CGLA; TFRNCVAVDLGASSGRVLARYERECRSLTLREIHRFNNGLHSQNGYVTWDVDSLESAIRL ----------1111-------3333-----------------iiii-------------- GLNKVCAAGIAIDSIGIDTWGVDFVLLDQQGQRVGLPVAYRDSRTNGLAQAQQQLGKRDI -----1111------------------1111-------11111111-------------- YQRSGIQFLPFNTLYQLRALTEQQPELIPHIAHALLPDYFSYRLTGKNWEYTNATTTQLV --------1111-----------33333333------------------3333-3333-- NINSDDWDESLLAWSGANKAWFGRPTHPGNVIGHWICPQGNEIPVVAVASHDTASAVIAS -----------------3333--------------------------------------- PLNGSRAAYLSSGTWSLGFESQTPFTNDTALAANITNEGGAEGRYRVLKNIGLWLLQRVL ---1111--------------------------------2222----------------- QERQINDLPALIAATQALPACRFIINPNDDRFINPDECSEIQAACREAQPIPESDAELAR 1111----------1111-------11111111--------------------------- CIFDSLALLYADVLHELAQLRGEDFSQLHIVGGGCQNTLLNQLCADACGIRVIAGPVEAS -------------------------------3333------------------------- TLGNIGIQLTLDELNNVDDFRQVVSTTANLTTFTPNPDSEIAHYVALIHS -----------------------------------1111-------3333 >INNER NUCLEAR MEMBRANE PR; SWP:Q9Y2U8; PDB:2CH0A; GSPEFRWTKEEEETRQMYDMVVKIIDVLRSHNEACQENKDLQPYMPIPHVRDSLIQPHDR -------3333--------3333----------3333--------33333333---1111 KKMKKVWDRAVDFLAANESRVRTETRRIGGADFLVWRWIQPSASCDKILVIPSKVWQGQA 1111333333333333-----------iiii----------------------------- FHLDRRLERPHRD ------------- >3-HYDROXYKYNURENINE TRANS; SWP:Q4LAM2; PDB:2CH1A; KFTPPPASLRNPLIIPEKIMMGPGPSNCSKRVLTAMTNTVLSNFHAELFRTMDEVKDGLR -----3333--------------------------------1111--------------- YIFQTENRATMCVSGSAHAGMEAMLSNLLEEGDRVLIAVNGIWAERAVEMSERYGADVRT ---------------3333----------2222--------------------------- IEGPPDRPFSLETLARAIELHQPKCLFLTHGDSSSGLLQPLEGVGQICHQHDCLLIVDAV ---1111---------------------------------2222----1111-------- ASLCGVPFYMDKWEIDAVYTGAQVLGAPPGITPISISPKALDVIRNRRTKSKVFYWDLLL -2222---3333-------------------------------1111-----3333---- LGNYWGCYDEPKRYHHTVASNLIFALREALAQIAEEGLENQIKRRIECAQILYEGLGKMG --------------------------------------------------------1111 LDIFVKDPRHRLPTVTGIMIPKGVDWWKVSQYAMNNFSLEVQGGLGPTFGKAWRVGIMGE ------3333-1111------------------------------1111--------!!! CSTVQKIQFYLYGFKESLKATHPDYIF !--------------------1111-- >NAGK PROTEIN; SWP:NA; PDB:2CH5A; FMAAIYGGVEGGGTRSEVLLVSEDGKILAEADGLSTNHWLIGTDKCVERINEMVNRAKRK ---------------------1111-----------3333-------------------- AGVDPLVPLRSLGLSLSGGDQEDAGRILIEELRDRFPYLSESYLITTDAAGSIATATPDG ---2222----------1111--------------1111--------------------- GVVLISGTGSNCRLINPDGSESGCGGWGHMMGDEGSAYWIAHQAVKIVFDSIDNLEAAPH ---------------1111--------1111-2222------------------------ DIGYVKQAMFHYFQVPDRLGILTHLYRDFDKCRFAGFCRKIAEGAQQGDPLSRYIFRKAG ----------1111--33333333-1111---------------1111------------ EMLGRHIVAVLPEIDPVLFQGKIGLPILCVGSVWKSWELLKEGFLLALTQGRAQNFFSSF ---------3333--3333-1111------3333-3333---------3333-------- TLMKLRHSSALGGASLGARHIGHLLPMDYSANAIAFYSYTFS ------------------1111-----3333----------- >METHYL-ACCEPTING CHEMOTAX; SWP:Q9X0M7; PDB:2CH7A; GSHMKDVQTETFSVAESIEEISKANEEITNQLLGISKEMDNISTRIESISASVQETTAGS 3333-------------------------------------------------------- EEISSATKNIADSAQQAASFADQSTQLAKEAGDALKKVIEVTRMISNSAKDVERVVESFQ ----------------------------------------------------------33 KGAEEITSFVETINAIAEQTNLLALNAAIEAARAGEAGRGFAVVADEIRKLAEESQQASE 33---------------------------------3333--------------------- NVRRVVNEIRSIAEDAGKVSSEITARVEEGTKLADEADEKLNSIVGAVERINEMLQNIAA ------------------------------------------------------------ AIEEQTAAVDEITTAMTENAKNAEEITNSVKEVNARLQEISASTEEVTSRVQTIRENVQM ----------------------------------------------------------11 LKEIVARYK 11------- >33 KDA EARLY PROTEIN; SWP:P03228; PDB:2CH8A; VTAFLGERVTLTSYWRRVSLGPEIEVSWFKLGPGEEQVLIGRMHHDVIFIEWPFRGFFDI ---2222------------!!!!--------------------%%%%---3333------ HRSANTFFLVVTAANISHDGNYLCRMKLGETEVTKQEHLSVVKPLTLSVHSERSQFPDFS --!!!!--------3333---------!!!!----------------------------- VLTVTCTVNAFPHPHVQWLMPGGVMKEKDGSLSVAVDLSLPKPWHLPVTCVGKNDKEEAH --------------------------1111-----------------------!!!!--- GVYVSGYL -------- >CYSTATIN F; SWP:NA; PDB:2CH9A; TCSQDLNSRVKPGFPKTIKTNDPGVLQAARYSVEKFNNCTNDMFLFKESRITRALVQIVK ------------------1111--------------1111-------------------- GLKYMLEVEIGRTTCKKNQHLRLDDCDFQTNHTLKQTLSCYSEVWVVPWLQHFEVPVLRC ---------------------3333-----3333------------1111---------- HHHHHH ------ >RABPHILIN-3A; SWP:P47709; PDB:2CHDA; TLGALEFSLLYDQDNSNLQCTIIRAKGLKPMDSNGLADPYVKLHLLPGASKSNKLRTKTL -----------3333----------------1111---------------1111------ RNTRNPVWNETLQYHGITEEDMQRKTLRISVCDEDKFGHNEFIGETRFSLKKLKANQRKN ----------------------------------1111----------3333-2222--- FNICLERVI --------- >REPLICATION FACTOR C SMAL; SWP:O28219; PDB:2CHGA; FEIWVEKYRPRTLDEVVGQDEVIQRLKGYVERKNIPHLLFSGPPGTGKTATAIALARDLF --3333-----3333---------------------------2222-------------- GENWRDNFIEMNASDERGIDVVRHKIKEFARTAPIGGAPFKIIFLDEADALTADAQAALR 1111----------1111----------------iiii--------1111---------- RTMEMYSKSCRFILSCNYVSRIIEPIQSRCAVFRFKPVPKEAMKKRLLEICEKEGVKITE -----------------3333-33331111------------------------------ DGLEALIYISGGDFRKAINALQGAAAIGEVVDADTIYQITATA -------3333-------------3333----------3333- >PROTEIN RSC3288; SWP:Q8XUA5; PDB:2CHHA; AQQGVFTLPANTSFGVTAFANAANTQTIQVLVDNVVKATFTGSGTSDKLLGSQVLNSGSG -------------------------------%%%%---------------------!!!! AIKIQVSVNGKPSDLVSNQTILANKLNFAMVGSEDGTDNDYNDGIAVLNWPLG -------iiii------------------------------------------ >GLUCOSAMINIDASE; SWP:Q89ZI2; PDB:2CHOA; LQPPPQQLIVQNKTIDLPAVYQLNGGEEANPHAVKVLKELLGMLISIGEKGDKSVRKYSR ------------------------3333--------------------22221111-111 QIPDHKEGYYLSVNEKEIVLAGNDERGTYYALQTFAQLLKDGKLPEVEIKDYPSVRYRGV 1---2222-----1111-----------------3333-iiii----------------- VEGFYGTPWSHQARLSQLKFYGKNKMNTYIYGPKDDPYHSAPNWRLPYPDKEAAQLQELV ---------------------1111-------1111------3333-------------- AVANENEVDFVWAIHPGQDIKWNKEDRDLLLAKFEKMYQLGVRSFAVFFDDISGEGTNPQ ---1111--------3333------------------1111------------3333--- KQAELLNYIDEKFAQVKPDINQLVMCPTEYNKSWSNPNGNYLTTLGDKLNPSIQIMWTGD ------------1111--------------3333-1111----------1111------- RVISDITRDGISWINERIKRPAYIWWNFPVSDYVRDHLLLGPVYGNDTTIAKEMSGFVTN ------------------------------1111---------------3333------- PMEHAESSKIAIYSVASYAWNPAKYDTWQTWKDAIRTILPSAAEELECFAMHNSDLGPNG ----3333------------3333--------------3333----------------11 HGYRREESMDIQPAAERFLKAFKEGKNYDKADFETLQYTFERMKESADILLMNTENKPLI 11-----3333----------1111------------------------1111------- VEITPWVHQFKLTAEMGEEVLKMVEGRNESYFLRKYNHVKALQQQMFYIDQTSNQNPYQP ------------------------------------------------------------ GVKTATRVIKPLIDRTFATVVKFFNQKFNAHLDATTDYMPHKMNLPLQVKANRVLISPVE -------------------------1111--------------------!!!!------- IELDAIYPGENIQINFRLSAGLQKAPVKFVRFQFVLTIEKK ---------------------%%%%---------------- >CHLOROMUCONATE CYCLOISOME; SWP:P05404; PDB:2CHR; MKIDAIEAVIVDVPTKRPIQMSITTVHQQSYVIVRVYSEGLVGVGEGGSVGGPVWSAECA ------------------------------------------------------------ ETIKIIVERYLAPHLLGTDAFNVSGALQTMARAVTGNASAKAAVEMALLDLKARALGVSI -----------3333---11113333--3333---------------------1111--- AELLGGPLRSAIPIAWTLASGDTKRDLDSAVEMIERRRHNRFKVKLGFRSPQDDLIHMEA ---------------------------------1111------------3333------3 LSNSLGSKAYLRVDVNQAWDEQVASVYIPELEALGVELIEQPVGRENTQALRRLSDNNRV 333-----------iiii-------------1111--------1111------------- AIMADESLSTLASAFDLARDRSVDVFSLKLCNMGGVSATQKIAAVAEASGIASYGGTMLD ----3333--------------------3333---------------------------- STIGTSVALQLYSTVPSLPFGCELIGPFVLADTLSHEPLEIRDYELQVPTGVGHGMTLDE 3333-------1111---------3333-------------iiii--------------- DKVRQYARVS ---------- >ENTEROCHELIN UPTAKE PERIP; SWP:Q9PMU4; PDB:2CHUA; LPISMSDEGDSFLVKDSLGENKIPKNPSKVVILDLGILDTFDALKLNDKVVGVPAKNLPK ---------------1111--------------3333--------3333----3333-11 YLQQFKNKPSVGGVQQVDFEAINALKPDLIIISGRQSKFYDKLKEIAPTLFVGLDNANFL 111111----------------1111------11111111--------------3333-- SSFENNVLSVAKLYGLEKEALEKISDIKNEIEKAKSIVDEDKKALIILTNSNKISAFGPQ --------------------------------------1111-------!!!!----222 SRFGIIHDVLGINAVDENIKGKSINSEFILEKNPDYIFVVDRNVILGNKERAQGILDNAL 2---------------------------------------3333-------------333 VAKTKAAQNKKIIYLDPEYWYLASGNGLESLKTMILEIKNAVK 3-----1111------------iiii-----------3333-- >CYTOPLASMIC PROTEIN NCK1; SWP:P16333; PDB:2CI9A; GPLGSPWYYGKVTRHQAEMALNERGHEGDFLIRDSESSPNDFSVSLKAQGKNKHFKVQLK ----3333-----------------2222--------1111------------------i ETVYCIGQRKFSTMEELVEHYKKAPIFTSEQGEKLYLVKHLS iii--!!!!-------------------1111---------- >CYTOPLASMIC PROTEIN NCK2; SWP:O43639; PDB:2CIAA; SEWYYGNVTRHQAECALNERGVEGDFLIRDSESSPSDFSVSLKASGKNKHFKVQLVDNVY 1111-----------------2222--------1111------------------%%%%- CIGQRRFHTMDELVEHYKKAPIFTSEHGEKLYLVRALQ -!!!!---------1111------1111---------- >FERRITIN HEAVY CHAIN; SWP:P02794; PDB:2CIHA; TSQVRQNYHQDSEAAINRQINLDLYASYVYLSMSYYFDRDDVALKNFAKYFLHQSHEERE -1111---------------------------------1111------------------ HAEKLMKLQNQRGGRIFLQDIQKPDCDDWESGLNAMECALHLEKNVNQSLLELHKLATDK ---------1111--------------------------------------------111 NDPHLCDFIETHYLNEQVKAIKELGDHVTNLRKMGAPESGLAEYLFDKHTLG 1--------------------------------------------------- >HEXOSE-6-PHOSPHATE MUTARO; SWP:Q03161; PDB:2CIRA; PIKETDKEVVLTHPADETTSVHILKYGATVYSWKLKSEEQLWLSTAAKLDGSKPVRGGIP ----1111----3333---------%%%%-----iiii-----1111------------- LVFPVFGKNSTDEHLSKLPQHGLARNSTWEFLGQTKENPPTVQFGLKPEIANPELTKLWP -------------3333-22221111--------------------3333---------- MDYLLILTVELGSDYLKTAIEVENTSSSKELKFNWLFHTYFRIEDIEGTMVSNLAGMKLY --------------------------------------------1111-----2222--- DQLLKESYVDKHPVVTFNQETDVIYQNVSAERAIQIVDKGVQIHTLKRYNLPDTVVWNPW ----------------------------1111-----%%%%------------------- IEKSQGMADFEPKTGYQQMICIEPGHVHDFISLAPGKKWNAYQLLKE 3333--1111-11111111--------------2222---------- >ENDOGLUCANASE H; SWP:P16218; PDB:2CITA; LKIGAWVGTQPSESAIKSFQELQGRKLDIVHQFINWSTDFSWVRPYADAVYNNGSILMIT --------------------------------------3333--------1111------ WEPWEYNTVDIKNGKADAYITRMAQDMKAYGKEIWLRPLHAANGDWYPWAIGYSSRVNTN --1111-------1111-----------------------1111--1111--3333---- ETYIAAFRHIVDIFRANGATNVKWVFNVNCDNVGNGTSYLGHYPGDNYVDYTSIDGYNWG --------------11113333-----------2222--1111-3333------------ TTQSWGSQWQSFDQVFSRAYQALASINKPIIIAEFASAEIGGNKARWITEAYNSIRTSYN --3333----3333--------1111--------------------------------11 KVIAAVWFHENKETDWRINSSPEALAAYREAIGA 11---------------------------3333- >IMPORT INNER MEMBRANE TRA; SWP:P53220; PDB:2CIUA; SGDTQLFNRAVSMVEKNKDIRSLLQCDDGITGKERLKAYGELITNDKWTRNRPIVSTKKL ---------------------1111---1111---------------------------- DKEGRTHHYMRFHVESKKKIALVHLEAKESKQNYQPDFINMYVDVPGEKRYYLIKPKLHP 1111-----------1111-------------------------2222------------ VSN --- >CHLOROPEROXIDASE; SWP:P04963; PDB:2CIWA; EPGSGIGYPYDNNTLPYVAPGPTDSRAPCPALNALANHGYIPHDGRAISRETLQNAFLNH 11111111------------1111----3333---1111--1111--------------- MGIANSVIELALTNAFVVCEYVTGSDCGDSLVNLTLLAEPHAFEHDHSFSRKDYKQGVAN ---3333----------------------------1111---------------2222-1 SNDFIDNRNFDAETFQTSLDVVAGKTHFDYADMNEIRLQRESLSNELDFPGWFTESKPIQ 111------------------2222-----------------------2222-------- NVESGFIFALVSDFNLPDNDENPLVRIDWWKYWFTNESFPYHLGWHPPSPAREIEFVTSA ------------1111-3333------------------3333---------3333---- SSAVLAASVTSTPSSLPSGAIGPGAEAVPLSFASTMTPFLLATNAPYYAQDPTLGPND ----------------2222------------1111------------------1111 >INVERTASE INHIBITOR; SWP:O49908; PDB:2CJ4A; NNLVETTCKNTPNYQLCLKTLLSDKRSATGDITTLALIMVDAIKAKANQAAVTISKLRHS ------------------------3333----------------------------1111 NPPAAWKGPLKNCAFSYKVILTASLPEAIEALTKGDPKFAEDGMVGSSGDAQECEEYFKG --3333-------------------------------------------------1111- SKSPFSALNIAVHELSDVGRAIVRNLL ---------------------3333-- >SERYL-TRNA SYNTHETASE; SWP:NA; PDB:2CJAA; KLQFNLKAYFKKDAIAALFEEANSTLLTRGAPEGQGAKVTEWKLRIELTLQSGRYVRVHD ------------111111113333--------------------------------3333 AIFRLRKQLAEALGKKYKIGIRGIEVESFIIKVPADHELRLKVPYIKSENIEGGIQLELE ------------------------------------------2222----3333------ VGEAEKNRVPDRILTLLEEKIEAAQYGAKAEHWNLLWQREPEHPFKEDPTQAKEGWLKRG -3333------------------------------------------3333--------- SSRGQWIHGPQSARIFRTFEKIVLEELLEPLGYREIFPKLVTWEVWKSGHAKGVYPEIYY ----------------------------1111---------3333--------3333--- VCPPQTRDPDYWEEVADYYKVTHEVPTKLIKEKIAEPIGGCYAQCPPFWYVAGETLPNEE -------3333-------------------1111----------3333--------3333 IPVKVFDRSGTSHRYESGGIHGIERVDEFHRIEIVWIGTKEEVLKCAEELHDRYHIFNDI ---------------------3333----------------------------------- LDIEWRKARVNTVGTTDYEACLPYRGPDGEWLEFQNVSINGDKYPKGFNVKLQSGDELWS ---------------------33331111----------!!!!--------3333----- GCSGVGLERWAAVFLAQKGLDPANWPEEFRNRVGEPKGIRFL --------------------3333-3333------------- >L-LYSINE-EPSILON AMINOTRA; SWP:P63509; PDB:2CJGA; TTPDRVHEVLGRSMLVDGLDIVLDLTRSGGSYLVDAITGRRYLDMFTFVASSALGMNPPA -3333-----------------------!!!!-------------%%%%--------333 LVDDREFHAELMQAALNKPSNSDVYSVAMARFVETFARVLGDPALPHLFFVEGGALAVEN 3------------------3333------------------1111--------------- ALKAAFDWKSRHNQAHGIDPALGTQVLHLRGAFHGRSGYTLSLTNTKPTITARFPKFDWP -------------1111-1111------2222----3333------33332222------ RIDAPYMRPGLDEPAMAALEAEALRQARAAFETRPHDIACFVAEPIQGEGGDRHFRPEFF ---------------------------------2222----------1111----3333- AAMRELCDEFDALLIFDEVQTGCGLTGTAWAYQQLDVAPDIVAFGKKTQVCGVMAGRRVD -------1111------------1111--3333-----------!!!!-------!!!!- EVADNVFAVPSRLNSTWGGNLTDMVRARRILEVIEAEGLFERAVQHGKYLRARLDELAAD ----11112222------------------------------------------------ FPAVVLDPRGRGLMCAFSLPTTADRDELIRQLWQRAVIVLPAGADTVRFRPPLTVSTAEI 1111------!!!!-------------------------------------1111----- DAAIAAVRSALPVVT -----------3333 >RADIALIS; SWP:Q58FS3; PDB:2CJJA; GRPWSAKENKAFERALAVYDKDTPDRWANVARAVEGRTPEEVKKHYEILVEDIKYIESGK -------------------1111------------------------------------- VPF --- >NUCLEAR POLYADENYLATED RN; SWP:Q99383; PDB:2CJKA; KESCKMFIGGLNWDTTEDNLREYFGKYGTVTDLKIMKDPATGRSRGFGFLSFEKPSSVDE 3333-------3333--------3333--------------------------3333--- VVKTQHILDGKVIDPKRAIPRDEQDKTGKIFVGGIGPDVRPKEFEEFFSQWGTIIDAQLM -------iiii------------------------2222--------3333--------- LDKDTGQSRGFGFVTYDSADAVDRVCQNKFIDFKDRKIEIKRAEPRH 1111-3333--------3333-------------------------- >SECRETED CHITINASE; SWP:Q8CK55; PDB:2CJLA; FVVSEAQFDQMFPSRNSFYTYSGLTAALSAYPGFSNTGSDTVKKQEAAAFLANVGHETGG -----------11113333-------33331111-------------------------- LVYVVEQNTANYPHYCDASQPYGCPAGNDKYYGRGPVQLSWNFNYKAAGDALGIDLLNNP -------33333333-1111---3333-------1111----------------3333-3 DLVQNDSAVAWKTGLWYWNTQTGPGTMTPHDAMVNGAGFGETIRSINGSLECDGGNPGQV 333------------------!!!!------------3333------1111iiii----- QSRIDNYERFTQLLGVEPGGNLSC ------------------------ >EPOXIDE HYDROLASE; SWP:Q41415; PDB:2CJPA; KKIEHKMVAVNGLNMHLAELGEGPTILFIHGFPELWYSWRHQMVYLAERGYRAVAPDLRG ---------iiii---------------------3333--------1111-------222 YGDTTGAPLNDPSKFSILHLVGDVVALLEAIAPNEEKVFVVAHDWGALIAWHLCLFRPDK 2------111111113333------------1111---------------------3333 VKALVNLSVHFSKRNPKMNVVEGLKAIYGEDHYISRFQVPGEIEAEFAPIGAKSVLKKIL --------------1111----------1111-3333----------------------- TYRDPAPFYFPKGKGLEAIPDAPVALSSWLSEEELDYYANKFEQTGFTGAVNYYRALPIN ----------2222-1111---33333333----------------3333--3333---- WELTAPWTGAQVKVPTKFIVGEFDLVYHIPGAKEYIHNGGFKKDVPLLEEVVVLEGAAHF ---3333-------------111133332222------------1111-----------3 VSQERPHEISKHIYDFIQKF 333----------------- >RNA-DIRECTED RNA POLYMERA; SWP:Q96662; PDB:2CJQA; VIREHNKWILKKVRHQGNLNTKKTLNPGKLSEQNIYNNQIGTIMTEAGIRLEKLPVVTDT --1111-3333-------------------------3333---------3333------- KSFHEAIRDKIDKNENQQSPGLHDKLLEIFHTIAQPSLRHTYSDVTWEQLEAGVNRKGAA ------------------2222-------1111-1111-----------1111-1111-- GFLEKKNVGEVLDSEKHLVEQLIRDLKTGRKIRYYETAIPKNPRVIQYPEAKTRLAITKV -------------------------1111------------------------------- MYNWVKQQPVVIPGYEGKTPLFNIFNKVRKEWDLFNEPVAVSFDTKNWDTQVTSRDLRLI ---1111----222211111111-------1111------------3333--3333---- GEIQKYYYRKEWHKFIDTITDHMVEVPVITADGEVYIRNGQRGSGQPDTSAGNSMLNVLT --------3333---------3333----1111---------3333-------------- MMYAFCESTGVPYKSFNNRVARIHVCGDDGFLITEKGLGLKFANNGMQILHEAGKPQKIT -----------3333----------!!!!---------------------1111-----2 EGERMKVAYRFEDIEFCSHTPVPVRWSDNTSSYMAGRDTAVILSKMATRLGTIAYEKAVA 222------3333--%%%%------1111----------------------1111----- FSFLLMYSWNPLVRRICLLVLSQQPETTPSTQTTYYYKGDPIGAYKDVIGKNLCELKRTG ------1111---------11113333------------------------3333----- FEKLANLNLSLSTLGIWSKHTSKRIIQDCVTIGKEEGNWLVNADRLISSKTGHLYIPDKG -----------1111-----3333--------------------3333------------ YTLQGK ------ >UNC-13 HOMOLOG A; SWP:NA; PDB:2CJTA; VMSLLCVGVKKAKFDGAQEKFNTYVTLKVQNVKSTTIAVRGSQPSWEQDFMFEINRLDLG ----------------1111--------iiii---------------------------- LTVEVWNKGLIWDTMVGTVWIPLRTIRQSNEEGPGEWLTLDSGTKDPTFHRILLDAHFE ---------------------3333---------------------------------- >NQ16-113.8 ANTI-PHOX ANTI; SWP:Q65ZQ7; PDB:2CJUH; EVKLVESGGGLVQPGGSLRLSCATSGFTFTNYYMNWVRQPPGKALEWLVSIRNKA ------------2222------------1111----------------------- ------------------------------------------------------- >GTP-BINDING PROTEIN GEM; SWP:P55040; PDB:2CJWA; MTYYRVVLIGEQGVGKSTLANIFAGVHDSEVLGEDTYERTLMVDGESATIILLDMWENEW ----------2222---------------3333---------%%%%------------33 LHDHCMQVGDAYLIVYSITDRASFEKASELRIQLRRARQTEDIPIILVGNKSDLVRREVS 33-3333---------1111-----------------1111---------3333------ VSEGRAAVVFDKFIETSAAVQHNVKELFEGIVRQVRLRRDSKEKNERRLAYQKR -------1111-----------------------------3333---------- >RING-HYDROXYLATING DIOXYG; SWP:Q65AT1; PDB:2CKFA; MSGDTTLVDTVNASQSRQVFWDRDVYDLEIERIFSRAWLMLGHKSLLPKPGDFITTYMAE ---------------3333-----------------------3333--2222-----!!! DKIILSHQSDGTFRAFINSCTHRGNQICHADSGNAKAFVCNYHGWVYGQDGSLVDVPLES !------3333------------------------------------1111--------- RCYHNKLDKQELAAKSVRVETYKGFIFGCHDPEAPSLEDYLGEFRFYLDTIWEGGGAGLE --%%%%-3333----------iiii-----1111-------------------------- LLGPPMKSLLHCNWKVPVENFVGDGYHVGWTHAAALGQIGGPLAGLAGNRADDLGLQFTT ------------3333--------3333-1111-------3333-2222----------- RHGHGFGVIDNAAAAIHRKGDGWNKYLEDTRGEVRRKFGADRERLYVGHWNGAIFPNCSF -----------1111------------------------1111-1111------------ LYGTNTFKIWHPRGPHEIEVWTYTMVPSDADPATKSAIQREATRTFGTAGTLESDDGENM -------------1111---------1111---------------------3333----- SSATYVNRGVITRDGMMNSTMGVGYEGPHPVYPGIVGISFIGETSYRGFYRFWKEMIDAP ---3333--3333------2222------------------------------------- DWASVKANDDNWDSVFTNRNFWNEKLNA 3333---3333-3333-1111--1111- >Ring-hydroxylating dioxyg; SWP:Q65AT0; PDB:2CKFB; QVPVTPDVHYAVEAHYRAEVRLLQTGQYREWLHGMVAEDIHYWMPIYEQRFVRDRRPDPT ------------------------------------1111----------3333-----1 PDDAAIYNDDFEELKQRVERLYSGQVWMEDPPSKIRYFVSNVEAFEAENGELDVLSNILV 111--------------3333----3333------------------iiii--------- YRNRRQTEVTVHTLGREDKLRQDGNGFKVFRRKLILDARVTQDKNLYFFC -------------------------------------------------- >SENTRIN-SPECIFIC PROTEASE; SWP:Q9P0U3; PDB:2CKGA; EFPEITEEMEKEIKNVFRNGNQDEVLSEAFRLTITRKDIQTLNHLNWLNDEIINFYMNML -------------1111---1111----iiii---------------------------- MERSKEKGLPSVHAFNTFFFTKLKTAGYQAVKRWTKKVDVFSVDILLVPIHLGVHWCLAV -----2222------3333-------3333333322221111---------!!!!----- VDFRKKNITYYDSMGGINNEACRILLQYLKQESIDKKRKEFDTNGWQLFSKKSQIPQQMN --1111------------------------------------2222-------------3 GSDCGMFACKYADCITKDRPINFTQQHMPYFRKRMVWEILHRKLL 333-----------1111-----3333------------------ >ULILYSIN; SWP:Q8TL28; PDB:2CKIA; REIVKIPVVVHVVWNEEEENISDAQIQSQIDILNKDFRKLNSDVSQVPSVWSNLIADLGI ---------------3333---------------------1111---33331111----- EFFLATKDPNGNQTTGITRTQTSVTFFTTSDEVKFASSGGEDAWPADRYLNIWVCHVLKS -------1111----------------111111111111-----1111-----------1 EIGQDILGYAQFPGGPAETDGVVIVDAAFGTTGTALPPFDKGRTATHEIGHWLNLYHIWG 111------------3333-----1111---!!!!---------------1111--1111 DELRFEDPCSRSDEVDDTPNQADPNFGAPSYPHVSCSNGPNGDFNYDYVDDKCVFTQGQA --11111111----1111----------------%%%%-----------3333------- TRVNACLDGPRSSFLA --------1111---- >KIN17; SWP:O60870; PDB:2CKKA; TARTDYWLQPEIIVKIITKKLGEKYHKKKAIVKEVIDKYTAVVKMIDSGDKLKLDQTHLE ------------------11111111----------------------------3333-- TVIPAPGKRILVLNGGYRGNEGTLESINEKTFSATIVIETGPLKGRRVEGIQYEDISKLA ----2222------1111----------1111--------1111-------3333----- >POLYCOMB GROUP RING FINGE; SWP:P25916; PDB:2CKLA; RIKITELNPHLMCVLCGGYFIDATTIIECLHSFCKTCIVRYLETSKYCPICDVQVHKTRP --33333333------------------------------3333--------------33 LLNIRSDKTLQDIVYKLVPGLFKNEMKRRRDFYAAHPS 33---------------2222----------------- >E3 ubiquitin-protein liga; SWP:Q9CQJ4; PDB:2CKLB; KTWELSLYELQRTPQEAITDGLEIVSLHSELMCPICLDMLKNTMTTKECLHRFCADCIIT -----3333-----------------1111-----------------------3333--- ALRSGNKECPTCRKKLVSKRSLRPDPNFDALISKIY -----------------1111--------------- ------------------------------------------------------------ ----------------------------------- >RNA-DIRECTED RNA POLYMERA; SWP:Q69014; PDB:2CKWA; DEFQWKGLPVVKSGLDVGGMPTGTRYHRSPAWPEEQPGETHAPAPFGSGDKRYTFSQTEM ----iiii--------------------1111---2222-------1111---------- LVNGLKPYTEPTAGVPPQLLSRAVTHVRSYIETIIGTHRSPVLTYHQACELLERTTSCGP ------------------------------------------------33331111--11 FVQGLKGDYWDEEQQQYTGVLANHLEQAWDKANKGIAPRNAYKLALKDELRPIEKNKAGK 11--1111-----------------------1111--------------------1111- RRLLWGCDAATTLIATAAFKAVATRLQVVTPMTPVAVGINMDSVQMQVMNDSLKGGVLYC --------------------------1111-----2222-----------3333------ LDYSKWDSTQNPAVTAASLAILERFAEPHPIVSCAIEALSSPAEGYVNDIKFVTRGGLPS ----3333--------------1111--3333--------------!!!!--------11 GMPFTSVVNSINHMIYVAAAILQAYESHNVPYTGNVFQVETIHTYGDDCMYSVCPATASI 11-----------------------1111-----1111------!!!!-----3333--- FHTVLANLTSYGLKPKPTNTPVFLKRTFTQTPHGIRALLDITSITRQFYWLKANRTSDPS --------1111----------%%%%----1111-----33333333----------111 SPPAFDRQARSAQLENALAYASQHGPVMFDTVRQIAIKTAQGEGLVLVNTNYDQALATYN 1-------------------3333------------------------------------ AWFIGGT ------- >DNA-DIRECTED RNA POLYMERA; SWP:P47076; PDB:2CKZA; MKVLEERNAFLSDYEVLKFLTDLEKKHLWDQKSLAALRPYNHPELQGITRNVVNYLSIN ------------------------1111--------------3333--------1111- >PUTATIVE LAMINARINASE; SWP:Q874E3; PDB:2CL2A; ATYHLEDNWVGSAFLSTFTHEAIADPTHGRVNYVDQATALAKNLTYASGDTLILRADHTT ---------!!!!-----------1111-------------------------------- TLSPSGPGRNSVRIRSIKTYTTHVAVFDVRHMPQGCGTWPAAWETDEGDWPNGGEVDIIE --1111----------------------------2222-------3333-1111------ GVNDQSPNAMTLHTGANCAMPASRTMTGHATNNNCDVNTDGNTGCGVQAPTANSYGPSFN -%%%%------------------------------3333%%%%-------1111-----1 ANGGGWYAMERTNSFIKVWFFPRNAGNVPNDIASGPATINTDNWGTPTAFFPNTNCDIGS 111--------1111------1111---3333-------3333-------------3333 HFDANNIIINLTFCGDWAGQASIFNGAGCPGSCVDYVNNNPSAFANAYWDIASVRVYQ ----------------111133331111-------------1111------------- >CATECHOL O-METHYLTRANSFER; SWP:P22734; PDB:2CL5A; GDTKEQRILRYVQQNAKPGDPQSVLEAIDTYCTQKEWAMNVGDAKGQIMDAVIREYSPSL ----------------2222----------------------3333-------------- VLELGAYCGYSAVRMARLLQPGARLLTMEMNPDYAAITQQMLNFAGLQDKVTILNGASQD -----!!!!------11112222-----------------------1111------3333 LIPQLKKKYDVDTLDMVFLDHWKDRYLPDTLLLEKCGLLRKGTVLLADNVIVPGTPDFLA 3333-----------------3333--------1111--2222----------------- YVRGSSSFECTHYSSYLEYMKVVDGLEKAIYQGPS 11111111--------2222--------------- >DPS-LIKE PROTEIN; SWP:P95855; PDB:2CLBA; QEPKVVGVEILEKSGLDIKKLVDKLVKATAAEFTTYYYYTILRHLTGEGEGLKEIAEDAR ----3333--3333----------------------------------3333-------- LEDRLHFELTQRIYELGGGLPRDIRQLADISACSDAYLPENWKDPKEILKVLLEAEQCAI -------------1111-----------------------3333---------------- RTWKEVCDTYGKDPRTYDLAQRILQEEIEHEAWFLELLYGRPSGH --------2222--------------------------------- >AFLATOXIN B1 ALDEHYDE RED; SWP:O95154; PDB:2CLPA; RPATVLGAMEMGRRMDAPTSAAVTRAFLERGHTEIDTAFVYSEGQSETILGGLGLRLGGS -------1111----------------1111------1111iiii----1111------- DCRVKIDTKAIPLFGNSLKPDSLRFQLETSLKRLQCPRVDLFYLHMPDHSTPVEETLRAC ------------!!!!-------------------------------11113333----- HQLHQEGKFVELGLSNYAAWEVAEICTLCKSNGWILPTVYQGMYNAITRQVETELFPCLR ---1111----------3333-----------------------11113333-------- HFGLRFYAFNPLAGGLLTGKYKYEDKDGKQPVGRFFGNTWAEMYRNRYWKEHHFEGIALV ---------1111-1111---1111-------1111-1111--------3333------- EKALQAAYGASAPSMTSATLRWMYHHSQLQGAHGDAVILGMSSLEQLEQNLAAAEEGPLE --------1111-----------------3333---------3333-------------3 PAVVDAFNQAWHLVAHECPNYFR 333----------3333------ >MITOGEN-ACTIVATED PROTEIN; SWP:Q99683; PDB:2CLQA; LLEYDYEYDENGDRVVLGKGTYGIVYAGRDLSNQVRIAIKEIPERDSQPLHEEIALHKHL --------1111--------------------------------------------1111 KHKNIVQYLGSFSENGFIKIFMEQVPGGSLSALLRSKWGPLKDNEQTIGFYTKQILEGLK -1111--------iiii-----------------------1111---------------- YLHDNQIVHRDIKGDNVLINTYSGVLKISDFGTSKRLAGETFTGTLQYMAPEIIDKGPRG ------------3333----------------------------3333-3333---3333 YGKAADIWSLGCTIIEMATGKPPFYELGEPQAAMFKVGMFKVHPEIPESMSAEAKAFILK -3333----------------2222---3333--------------1111--------33 CFEPDPDKRACANDLLVDEFLKV 33--1111---------3333-- >RHO-RELATED GTP-BINDING P; SWP:Q92730; PDB:2CLSA; VVARCKLVLVGDVQCGKTAMLQVLAKDCYPETYVPTVFENYTACLETEEQRVELSLWDTS -----------2222--------------------------------------------- GSPYYDNVRPLCYSDSDAVLLCFDISRPETVDSALKKWRTEILDYCPSTRVLLIGCKTDL -3333-------2222-------1111-3333-------------1111---------33 RTDLSTLMELSHQKQAPISYEQGCAIAKQLGAEIYLEGSAFTSEKSIHSIFRTASMLCL 33------------------------------------3333----------------- >ATP SYNTHASE B CHAIN, MIT; SWP:P13619; PDB:2CLYA; GEFADKLNEQKIAQLEEVKQASIKQIQDAIDMEKSQQALVQKRHYLFDVQRNNIAMALEV -----------------------------------------3333--------------- TYRERLHRVYREVKNRLDYHISVQNMMRQKEQEHMINWVEKRVVQ -----------------------------------------1111 >ATP synthase D chain, mit; SWP:P13620; PDB:2CLYB; KLALKTIDWVAFGEIIPRNQKAVANSLKSWNETLTSRLATLPEKPPAIDWAYYKANVAKA ----------------1111----------------3333-------------------- GLVDDFEKKFNALKVPIPEDKYTAQVDAEEKEDVKSCAEFLTQSKTRIQEYEKELEKMRN ------------------------------------------------------------ >TYROSINE-PROTEIN PHOSPHAT; SWP:P18031; PDB:2CM2A; HHEMEKEFEQIDKSGSWAAIYQDIRHEASDFPCRVAKLPKNKNRNRYRDVSPFDHSRIKL --------------------------------3333-3333-----1111--3333---- HQEDNDYINASLIKMEEAQRSYILTQGPLPNTCGHFWEMVWEQKSRGVVMLNRVMEKGSL ---------------3333---------1111-----------------------%%%%- KCAQYWPQKEEKEMIFEDTNLKLTLISEDIKSYYTVRQLELENLTTQETREILHFHYTTW --------1111------------------1111-------------------------- PDFGVPESPASFLNFLFKVRESGSLSPEHGPVVVHCSAGIGRSGTFCLADTCLLLMDKRK ----------------------1111---------------------------------- DPSSVDIKKVLLEMRKFRMGLIQTADQLRFSYLAVIEGAKFIMG -1111--------3333--------------------------- >RABPHILIN-3A; SWP:P47709; PDB:2CM5A; ALYEEEIEERGKILVSLYSTQQGGLIVGIIRCVHLAADANGYSDPFVKLWLKPDKAKHKT --1111-------------1111--------------1111------------------- QIKKKTLNPEFNEEFFYDIKHSDLAKKSLDISVWDYDIGKSNDYIGGCQLGISAKGERLK -------------------3333---------------------------1111------ HWYECLKNKDKKIERWHQLQN --------------------- >MALATE DEHYDROGENASE; SWP:P61889; PDB:2CMD; MKVAVLGAAGGIGQALALLLKTQLPSGSELSLYDIAPVTPGVAVDLSHIPTAVKIKGFSG ------1111--------------2222-------3333------1111----------- EDATPALEGADVVLISAGVRRKPGMDRSDLFNVNAGIVKNLVQQVAKTCPKACIGIITNP --33332222-----------22223333-------------------1111-------3 VNTTVAIAAEVLKKAGVYDKNKLFGVTTLDIIRSNTFVAELKGKQPGEVEVPVIGGHSGV 333---------------1111----------------------1111---------111 TILPLLSQVPGVSFTEQEVADLTKRIQNAGTEVVEAKAGGGSATLSMGQAAARFGLSLVR 1---33332222------------------------iiii-------------------- ALQGEQGVVECAYVEGDGQYARFFSQPLLLGKNGVEERKSIGTLSAFEQNALEGMLDTLK 1111--------------------------1111-------------------------- KDIALGQEFVNK ------------ >HYPOTHETICAL PROTEIN 5; SWP:P59636; PDB:2CMEA; VPPALHLVDPQIQLTITADPKVYPIILRLGSNLSLSMARRNLDSLEARAFQSTPIVVQMT --------3333------------------------------------------------ KLATTEELPDEFVVVTAK ---3333----------- >IGKC protein; SWP:Q6GMX8; PDB:2CMRH; QLVQSGAEVRKPGASVKVSCKASGDTFSSYAISWVRQAPGQGLEWMGGIIPIFGTANYAQ ----------2222--------3333-----------2222--------3333-----33 AFQGRVTITANESTSTAYMELSSLRSEDTAIYYCARDNPTLLGSDYWGAGTLVTVSSAST 33--------3333----------1111-------------------------------- KGPSVFPLAGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSSVVTV -----------------------------%%%%--2222-------3333---------- PSSTQTYICNVNHKPSNTKVDKRV -------------1111------- >IGKC protein; SWP:Q6GMX8; PDB:2CMRL; DIQMTQSPSTLSASIGDRVTITCRASEGIYHWLAWYQQKPGKAPKLLIYKASSLASGAPS -------------2222---------------------2222------------222233 RFSGSGSGTDFTLTISSLQPDDFATYYCQQYSNYPLTFGGGTKLEIKRTVAAPSVFIFPP 33----------------3333-------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT 33331111---------------------------------------------------- LSKADYEKHKVYACEVTHQGLSSPVTKS -33333333--------1111------- >PUTATIVE PEPTIDYL-ARGININ; SWP:O24890; PDB:2CMUA; SLKRLAEFEKIQAILAFPHEFSDWAYCIKEARESFLNIIQTIAKHAKVLVCVHTNDTIGY -----1111-------------3333---------------1111-------11113333 ELKNLPGVEIAKVDTNDTWARDFGAISIENHGVLECLDFGFNGWGLKYPSNLDNQVNFKL -1111-------------3333-------iiii---------%%%%-----3333----- KSLGFLKHPLKTPYVLEGGSIESDGAGSILTNTQCLLEKNRNPHLNQNGIETLKKELGAK 1111------------1111-----------3333--11111111--------------- QVLWYSYGYLKGDDTDSHTDTLARFLDKDTIVYSACEDKNDEHYTALKKQEELKTFKKLD ---------2222----1111-----1111-------1111-----------1111-111 KTPYKLIPLEIPKAIFDENQQRLPATYVNFLLCNDALIVPTYNDPKDALILETLKQHTPL 1---------------1111------------!!!!-------1111------1111--- EVIGVDCNTLIKQHGSLHCVTQLYEGG --------3333----3333------- >CASEIN KINASE I ISOFORM G; SWP:Q9HCP0; PDB:2CMWA; MRVGKKIGCGNFGELRLGKNLYTNEYVAIKLEPIKSRAPQLHLEYRFYKQLGSAGEGLPQ -2222----1111-------------------1111------------------------ VYYFGPGKYNAMVLELLGPSLEDLFDLCDRTFTLKTVLMIAIQLLSRMEYVHSKNLIYRD -------------------------1111------------------------------- VKPENFLIGRQGNKKEHVIHIIDFGLAKEYIDPETKKHIPYREHKSLGTARYMSINTHLG -3333----3333-1111-----1111---------------------3333-3333--- KEQSRRDDLEALGHMFMYFLRGSLPWQGLKADTLKERYQKIGDTKRNTPIEALCENFPEE ---3333----------------1111--------------------------2222--- MATYLRYVRRLDFFEKPDYEYLRTLFTDLFEKKGYTFDYAYDWVGRPIPTPVGS -------11111111---------------1111------3333---------- >HYPOTHETICAL 13.2 KDA PRO; SWP:P20220; PDB:2CMXA; TLNSYKAEIYKILEKKGELTLEDILAQFEISVPSAYNIQRALKAICERHPDECEVQYKNR --3333-------------3333-------------------------1111-----333 KTTFKWIK 3------- >SPIKE GLYCOPROTEIN; SWP:P04883; PDB:2CMZA; KFTIVFPHNQKGNWKNVPSNYHYCPSSSDLNWHNDLIGTAIQVKMPKSHKAIQADGWMCH -----------------1111--1111-3333---------------------------- ASKWVTTCDFRWYGPKYITQSIRSFTPSVEQCKESIEQTKQGTWLNPGFPPQSCGYATVT --------------------------------------1111------------------ DAEAVIVQVTPHHVLVDEYTGEWVDSQFINGKCSNYICPTVHNSTTWHSDYKVKGLCDSN ------------------------3333iiii-------------------3333--111 LISMDITFFSEDGELSSLGKEGTGFRSNYFAYETGGKACKMQYCKHWGVRLPSGVWFEMA 1------------3333---------3333----2222----%%%%----3333------ DKDLFAAARFPECPEGSSISAPSQTSVDVSLIQDVERILDYSLCQETWSKIRAGLPISPV -----3333-------------3333-------------------------------333 DLSYLAPKNPGTGPAFTIINGTLKYFETRYIRVDIAAPILSRMVGMISGTTTERELWDDW 31111-------------iiii------------------------2222---------- APYEDVEIGPNGVLRTSSGYKFPLYMIGHGMLDSDLHLSSKAQVFEHPH --!!!!--2222---1111------------------------------ >CYTOSOLIC 5'-NUCLEOTIDASE; SWP:Q9H0P0; PDB:2CN1A; NPTRVEEIICGLIKGGAAKLQIITDFDMTLSRFSYKGKRCPTCHNIIDNCKLVTDECRKK ----------------1111-----2222-----iiii-----------1111------- LLQLKEKYYAIEVDPVLTVEEKYPYMVEWYTKSHGLLVQQALPKAKLKEIVAESDVMLKE ----------1111-----3333-------------------3333----1111------ GYENFFDKLQQHSIPVFIFSAGIGDVLEEVIRQAGVYHPNVKVVSNFMDFDETGVLKGFK ---------1111------------------1111--1111---------3333------ GELIHVFNKHDGALRNTEYFNQLKDNSNIILLGDSQGDLRMADGVANVEHILKIGYLNDR ----1111--------------3333--------3333-1111----------------3 VDELLEKYMDSYDIVLVQDESLEVANSILQKIL 333----3333---------------------- >BETA-1,4-XYLOGLUCAN HYDRO; SWP:Q70DK5; PDB:2CN2A; VTSVPYKWDNVVIGGGGGFPGIVFNETEKDLIYARADIGGAYRWDPSTETWIPLLDHFQD ---------------------------2222-----------------------3333-- EYSYYGVESIATDPVDPNRVYIVAGYTNDWLPNGAILRSTDRGETWEKTILPFKGGNPGR 3333-----------3333---------------------iiii---------------- SGERLAIDPNDNRILYLGTRCGNGLWRSTDYGVTWSKVESFPNPGTIIGVVWVVFDKSSS -------------------iiii------iiii----1111--------------1111- TPGNPTKTIYVGVADKNESIYRSTDGGVTWKAVPGQPKGLLPHHGVLASNGLYITYGDGK 2222----------3333------iiii----2222-----------1111--------- GQVWKFNTRTGEWIDITPIPYSSSDNRFCFAGLAVDRQNPDIIVTSNAWWPDEYIFRSTD -------------------1111------------1111--------------------i GGATWKNIWEWGYPERILHYEIDISAAPWLDWGTEKQLPEINPKLGWIGDIEIDPFNSDR iii--------------------1111---%%%%-------------------1111--- YVTGATIYGCDNLTDWDRGGKVKIEVKATGIEECAVLDLVSPPEGAPLVSAVGDLVGFVH -----------33331111--------2222----------------------------- DDLKVGPKKHVPSYSSGTGIDYAELVPNFALVAKAVKKISFSYDGGRNWFQPPNEAPNSV -1111-----1111--------1111-----------------iiii------------- GGGSVAVAADAKSVIWTPENASPAVTTDNGNSWKVCTNLGGAVVASDRVNGKKFYAFYNG -------1111----------------iiii----2222----------1111----iii KFYISTDGGLTFTDTKAPQLPKSVNKIKAVPGKEGHVWLAAREGGLWRSTDGGYTFEKLS i-----iiii-------------------2222-------!!!!------iiii----11 NVDTAHVVGFGKAAPGQDYAIYITGKIDNVLGFFRSDDAGKTWVRINDDEHGYGAVDTAI 11-----------2222---------%%%%-------iiii------1111--------- TGDPRVYGRVYIATNGRGIVYGEPAS -----2222----------------- >BETA-1,4-XYLOGLUCAN HYDRO; SWP:Q70DK5; PDB:2CN3A; VTSVPYKWDNVVIGGGGGFMPGIVFNETEKDLIYARAAIGGAYRWDPSTETWIPLLDHFQ ----------------------------2222--------------1111-----33333 MDEYSYYGVESIATDPVDPNRVYIVAGMYTNDWLPNMGAILRSTDRGETWEKTILPFKMG 3331111----------3333--------------------------------------- GNMPGRSMGERLAIDPNDNRILYLGTRCGNGLWRSTDYGVTWSKVESFPNPGTYIYDPNF ----1111------------------iiii------iiii----1111--------1111 DYTKDIIGVVWVVFDKSSSTPGNPTKTIYVGVADKNESIYRSTDGGVTWKAVPGQPKGLL 1111----------3333-2222----------3333------iiii----2222----- PHHGVLASNGMLYITYGDTCGPYDGNGKGQVWKFNTRTGEWIDITPIPYSSSDNRFCFAG ------1111-------------------------------------3333--------- LAVDRQNPDIIMVTSMNAWWPDEYIFRSTDGGATWKNIWEWGMYPERILHYEIDISAAPW ---1111----------------------iiii---------------------1111-- LDWGTEKQLPEINPKLGWMIGDIEIDPFNSDRMMYVTGATIYGCDNLTDWDRGGKVKIEV -%%%%--------------------1111----------------33331111------- KATGIEECAVLDLVSPPEGAPLVSAVGDLVGFVHDDLKVGPKKMHVPSYSSGTGIDYAEL -2222------------------------------1111------1111--------111 VPNFMALVAKADLYDVKKISFSYDGGRNWFQPPNEAPNSVGGGSVAVAADAKSVIWTPEN 1----------------------iiii--------------------1111--------- ASPAVTTDNGNSWKVCTNLGMGAVVASDRVNGKKFYAFYNGKFYISTDGGLTFTDTKAPQ -------iiii----22222222-------1111----iiii------------------ LPKSVNKIKAVPGKEGHVWLAAREGGLWRSTDGGYTFEKLSNVDTAHVVGFGKAAPGQDY ----------2222-------!!!!------iiii-------------------2222-- MAIYITGKIDNVLGFFRSDDAGKTWVRINDDEHGYGAVDTAITGDPRVYGRVYIATNGRG --------%%%%-------iiii------1111--------------2222--------- IVYGEPAS -------- >SERINE/THREONINE-PROTEIN ; SWP:O96017; PDB:2CN5A; HMSVYPKALRDEYIMSKTLGSGACGEVKLAFERKTCKKVAIKIISKRNVETEIEILKKLN -----33331111-------------------1111-------------------1111- HPCIIKIKNFFDAEDYYIVLELMEGGELFDKVVGNKRLKEATCKLYFYQMLLAVQYLHEN 1111----------------------3333--%%%%------------------------ GIIHRDLKPENVLLSSQEEDCLIKITDFGHSKILGETSLMRTLCGTPTYLAPEVLVSVGT -------3333----------------1111--------------3333-3333-33332 AGYNRAVDCWSLGVILFICLSGYPPFSEHRTQVSLKDQITSGKYNFIPEVWAEVSEKALD 222-----------------------------------1111----33331111------ LVKKLLVVDPKARFTTEEALRHPWLQDEDMKRKFQDLLSEENE --------3333---------3333-------------3333- >NADH-DEPENDENT NITRATE RE; SWP:P17571; PDB:2CND; GRIHCRLVAKKELSRDVRLFRFSLPSPDQVLGLPIGKHIFVCATIEGKLCMRAYTPTSMV ---------------------------------2222---------------------11 DEIGHFDLLVKVYFKNEHPKFPNGGLMTQYLDSLPVGSYIDVKGPLGHVEYTGRGSFVIN 11---------------1111---------3333-------------------------- GKQRNARRLAMICGGSGITPMYQIIQAVLRDQPEDHTEMHLVYANRTEDDILLRDELDRW -------------!!!!-3333-----3333---------------1111---------- AAEYPDRLKVWYVIDQVKRPEEGWKYSVGFVTEAVLREHVPEGGDDTLALACGPPPMIQF ---3333-----------3333---------3333------------------------- AISPNLEKMKYDMANSFVVF -------------------- >PHOSPHORIBOSYLAMINOIMIDAZ; SWP:P27616; PDB:2CNQA; SITKTELDGILPLVARGKVRDIYEVDAGTLLFVATDRISAYDVIMENSIPEKGILLTKLS ------iiii---------------1111----------%%%%-----2222-------- EFWFKFLSNDVRNHLVDIAPGKTIFDYLPAKLSEPKYKTQLEDRSLLVHKHKLIPLEVIV ----1111----------22223333--3333----33332222---------------- RGYITGSAWKEYVKTGTVHGLKQPQGLKESQEFPEPIFTPSTKAEHDENISPAQAAELVG -----------------iiii------2222----------------------------- EDLSRRVAELAVKLYSKCKDYAKEKGIIIADTKFEFGIDEKTNEIILVDEVLTPDSSRFW ----------------------------------------------------3333---- NGASYKVGESQDSYDKQFLRDWLTANKLNGVNGVKMPQDIVDRTRAKYIEAYETLTGSKW 3333-------------------11112222----------------------------- S - >SAFA PILUS SUBUNIT; SWP:Q8ZRK4; PDB:2CO3A; GSQKSVDIVFSSPQDLTVSLIPVSGLKAGKNAPSAKIAKLVVNSTTLKEFGVRGISNNVV -------------------------------2222------------------------- DSTGTAWRVAGKNTGKEIGVGLSSDSLRRSDSTEKWNGVNWMTFNSNDTLDIVLTGPAQN 1111-------------------3333--------iiii--------------------- VTADTYPITLDVVGY --------------- >VIRAL PROTEIN F93; SWP:Q6Q0J9; PDB:2CO5A; KYMRINYYIILKVLVINGSRLEKKRLRSEILKRFDIDISDGVLYPLIDSLIDDKILREEE --3333---------------3333-------------3333--------1111------ APDGKVLFLTEKGMKEFEELHEFFKKIVCHHH -------------------------------- >Putative fimbriae assembl; SWP:Q8ZRK3; PDB:2CO6B; LNSATKLFSVKLGATRVIYHAGTGATLSVSNPQNYPILVQSSVKAADKSSPAPFLVMPPL -------------------3333---------------------1111------------ FRLEANQQSQLRIVRTGGDMPTDRETLQWVCIKAVPPENEPGATLDLNLSINACDKLIFR ---2222----------------------------------------------------- PDAVKGTPEDVAGNLRWVETGNKLKVENPTPFYMNLASVTVGGKPITGLEYVPPFADKTL 3333--33333333-----!!!!-----------------iiii---------------- NMPGSHGDIEWRVITDFGGESHPFHYVL --------------1111---------- >Putative fimbriae assembl; SWP:Q8ZRK3; PDB:2CO7B; ATKLFSVKLGATRVIYHAGTAGATLSVSNPQNYPILVQSSVKAADKSSPAPFLVMPPLFR ----------------3333----------------------1111-------------- LEANQQSQLRIVRTGGDMPTDRETLQWVCIKAVPPETLDLNLSINACDKLIFRPDAVKGT -2222------------------------------------------------1111--3 PEDVAGNLRWVETGNKLKVENPTPFYMNLASVTVGGKPITGLEYVPPFADKTLNHGDIEW 3331111-----!!!!-----------------iiii----------------------- RVITDFGGESHPFHYVLK ---1111----------- >NEDD9 interacting protein; SWP:Q8TDZ2; PDB:2CO8A; GSSGSSGQHQEAGAGDLCALCGEHLYVLERLCVNGHFFHRSCFRCHTCEATLWPGGYEQH -------------------------1111---------3333------------------ PGDGHFYCLQHLPQTDSGPSSG ---------------------- >THYMUS HIGH MOBILITY GROU; SWP:Q8R4H0; PDB:2CO9A; GSSGSSGKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDGLG -----------------------3333----3333-3333---3333------------3 EEQKQVYKKKTEAAKKEYLKQLAAYRASLVSKSYTDSGPSSG 333-------3333---------33331111------3333- >PROTEIN KINASE C, D2 TYPE; SWP:Q9BZL6; PDB:2COAA; GSSGSSGTLREGWVVHYSNKDTLRKRHYWRLDCKCITLFQNNTTNRYYKEIPLSEILTVE ---------------------------------------------------3333----- SAQNFSLVPPGTNPHCFEIVTANATYFVGEMPGGTPGGPSGQGAEAARGWETAIRQALMS --------------------------------------------3333------------ GPSSG ----- >LCOR PROTEIN; SWP:Q96JN0; PDB:2COBA; GSSGSSGRGRYRQYNSEILEEAISVVMSGKMSVSKAQSIYGIPHSTLEYKVKERLGTLKN --------------3333-------------3333-------3333-------------- PPKKKMKLMR ---------- >FYVE, RHOGEF AND PH DOMAI; SWP:Q5JSP0; PDB:2COCA; GSSGSSGSLLCGPLRLSESGETWSEVWAAIPMSDPQVLHLQGGSQDGRLPRTIPLPSCKL -----------------%%%%---------3333-------------------3333--- SVPDPEERLDSGHVWKLQWAKQSWYLSASSAELQQQWLETLSTAAHSGPSSG ---3333-----------%%%%-------3333------------------- >CENTAURIN-DELTA 1; SWP:Q8WZ64; PDB:2CODA; GSSGSSGKVKSGWLDKLSPQGKRMFQKRWVKFDGLSISYYNNEKEMYSKGIIPLSAISTV ----------------------------------------------------3333---- RVQGDNKFEVVTTQRTFVFRVEKEEERNDWISILLNALKSQSLTSQSQASGPSSG --------------------------------------3333------------- >DEOXYNUCLEOTIDYLTRANSFERA; SWP:P04053; PDB:2COEA; GSSGSSGTGALMASSPQDIKFQDLVVFILEKKMGTTRRALLMELARRKGFRVENELSDSV --------------------------------------------------------1111 THIVAENNSGSDVLEWLQAQKVQVSSQPELLDVSWLIECIGAGKPVEMTGKHQLSGPSSG -------------------------------3333--------------3333------- >PROTEIN KIAA1914; SWP:Q8N4X5; PDB:2COFA; GSSGSSGLETSSYLNVLVNSQWKSRWCSVRDNHLHFYQDRNRSKVAQQPLSLVGCEVVPD -------%%%%------%%%%--------%%%%------%%%%----------------- PSPDHLYSFRILHKGEELAKLEAKSSEEMGHWLGLLLSESGSGPSSG ------------iiii------------------------------- >TRANSCRIPTIONAL REGULATOR; SWP:Q72IY7; PDB:2COHA; ETVSFKAGDVILYPGVPGPRDRAYRVLEGLVRLEAVDEEGNALTLRLVRPGGFFGEEALF -----2222--------1111---------------1111--------2222-------- GQERIYFAEAATDVRLEPLPENPDPELLKDLAQHLSQGLAEAYRRIERLATQRLKNRAAA -----------------------------------------------------3333--- LLELSETPLAHEEEGKVVLKATHDELAAAVGSVRETVTKVIGELAREGYIRSGYGKIQLL ------------iiii------------------------------------%%%%---- DLKGLKELAESRG -------1111-- >BRANCHED CHAIN AMINOTRANS; SWP:P54687; PDB:2COIA; VGTFKAKDLIVTPATILKEKPDPNLVFGTVFTDHMLTVEWSSEFGWEKPHIKPLQNLSLH ----3333-----------------2222------------------------------1 PGSSALHYAVELFEGLKAFRGVDNKIRLFQPNLNMDRMYRSAVRATLPVFDKEELLECIQ 111--------------------------------------------------------- QLVKLDQEWVPYSTSASLYIRPTFIGTEPSLGVKKPTKALLFVLLSPVGPYFNPVSLWAN -----3333--------------------------------------------------3 PKYVRAWKGGTGDCKMGGNYGSSLFAQCEAVDNGCQQVLWLYGEDHQITEVGTMNLFLYW 333---22221111-33331111------3333---------1111-------------- INEDGEEELATPPLDGIILPGVTRRCILDLAHQWGEFKVSERYLTMDDLTTALEGNRVRE -1111------------------------------------------------------- MFGSGTACVVCPVSDILYKGETIHIPTMENGPKLASRILSKLTDIQYGREERDWTIVL --------------------------1111---------------------3333--- >POLY [ADP-RIBOSE] POLYMER; SWP:P09874; PDB:2COKA; GSSGSSGDKPLSNMKILTLGKLSRNKDEVKAMIEKLGGKLTGTANKASLCISTKKEVEKM --------3333------------------------------3333-------------- NKKMEEVKEANIRVVSEDFLQDVSASTKSLQELFLAHILSSWGAEVKSGPSSG --3333---------------3333---------------------------- >NIN ONE BINDING PROTEIN; SWP:Q8BW10; PDB:2CONA; GSSGSSGVREARSYILRCHGCFKTTSDMNRVFCGHCGNKTLKKVSVTINDDGTLHMHFSR ------------------------------------------------3333-------- NPKVLNPRGLRYSSGPSSG ------------------- >Lipoamide acyltransferase; SWP:P11182; PDB:2COOA; GSSGSSGHQEIKGRKTLATPAVRRLAMENNIKLSEVVGSGKDGRILKEDILNYLEKQTGA ------------------3333---3333--3333----------3333------3333- ILPPSGPSSG ---------- >ACYL-COENZYME A BINDING D; SWP:Q9BR61; PDB:2COPA; GSSGSSGLAELFEKAAAHLQGLIQVASREQLLYLYARYKQVKVGNCNTPKPSFFDFEGKQ -------3333------3333-11113333-----3333------------33333333- KWEAWKALGDSSPSQAMQEYIAVVKKLDPGWNPQIPEKKGKEASGPSSG -----------3333-------3333----------------------- >NEW ANTIGEN RECEPTOR VARI; SWP:Q8JJ25; PDB:2COQA; ARVDQTPRIATKETGESLTINCVLRDTACALDSTNWYRTKLGSTKEQTISIGGRYSETVD ------------2222-----------------------2222--------!!!!----- EGSNSASLTIRDLRVEDSGTYKCKAYRRCAFNTGVGYKEGAGTVLTVK 1111---------1111------------------------------- >PINCH PROTEIN; SWP:P48059; PDB:2CORA; GSSGSSGEKARGLGKYICQKCHAIIDEQPLIFKNDPYHPDHFNCANCGKELTADARELKG -------------------------------%%%%--1111----------1111----- ELYCLPCHDKMGVSGPSSG ------------------- >SERINE/THREONINE PROTEIN ; SWP:Q7TSJ6; PDB:2COSA; GSSGSSGVNRQMLQELVNAGCDQEMAGRALKQTGSRSIEAALEYISKMSGPSSG ----------------3333-3333-------------------3333------ >ZINC FINGER PROTEIN 435; SWP:Q9H4T2; PDB:2COTA; GSSGSSGRSEWQQRERRRYKCDECGKSFSHSSDLSKHRRTHTGEKPYKCDECGKAFIQRS ------------------------------------3333------------------33 HLIGHHRVHTGSGPSSG 33-3333---------- >ECT2 PROTEIN; SWP:Q07139; PDB:2COUA; GSSGSSGFKVPPFQDCILSFLGFSDEEKHSMEEMTEMQGGSYLPVGDERCTHLIVEENTV -----------------------3333-------------------3333-----3333- KDLPFEPSKKLFVVKQEWFWGSIQMDARAGETMYLYEKANTPESGPSSG -------3333----3333---1111---3333---------------- >Beta-1,3-xylanase; SWP:Q8RS40; PDB:2COVD; PPENCQDDFNFNYVSDQEIEVYHVDKGWSAGWNYVCLNDYCLPGNKSNGAFRKTFNAVLG -3333-------------------------------%%%%------iiii-------222 QDYKLTFKVEDRYGQGQQILDRNITFTTQVCN 2----------iiii----------------- >KINESIN-LIKE PROTEIN KIF1; SWP:Q9NQT8; PDB:2COWA; GSSGSSGQALASDSEEADEVPEWLREGEFVTVGAHKTGVVRYVGPADFQEGTWVGVELDL ------------------------2222-------------------------------- PSGKNDGSIGGKQYFRCNPGYGLLVRPSRVRRATSGPSSG -----------------2222----3333----------- >DYNACTIN-1; SWP:Q14203; PDB:2COYA; GSSGSSGMAQSKRHVYSRTPSGSRMSAEASARPLRVGSRVEVIGKGHRGTVAYVGATLFA ------------------------------------------------------------ TGKWVGVILDEAKGKNDGTVQGRKYFTCDEGHGIFVRQSQIQVFEDSGPSSG -------------------iiii-------------3333------------ >CENTROSOME-ASSOCIATED PRO; SWP:Q5VT06; PDB:2COZA; GSSGSSGVEHEQQVTESPSLASVPTADELFDFHIGDRVLIGNVQPGILRFKGETSFAKGF ------------------------------------------------------------ WAGVELDKPEGNNNGTYDGIAYFECKEKHGIFAPPQKISHIPENFDDYVDINEDEDSGPS ----------------iiii-------------3333----------------------- SG -- >CLIPR-59 PROTEIN; SWP:Q96DZ5; PDB:2CP0A; GSSGSSGGNLMLSALGLRLGDRVLLDGQKTGTLRFCGTTEFASGQWVGVELDEPEGKNDG ----------3333----------%%%%-------------------------------- SVGGVRYFICPPKQGLFASVSKISKAVDASGPSSG -!!!!-------------3333------------- >CLIP-115; SWP:Q9UDT6; PDB:2CP2A; GSSGSSGAAEVGDDFLGDFVVGERVWVNGVKPGVVQYLGETQFAPGQWAGVVLDDPVGKN --------------------------%%%%------------------------------ DGAVGGVRYFECPALQGIFTRPSKLTRQPSGPSSG ---iiii-------------3333----------- >CLIP-115; SWP:Q9UDT6; PDB:2CP3A; GSSGSSGLRLGDRVLVGGTKTGVVRYVGETDFAKGEWCGVELDEPLGKNDGAVAGTRYFQ ----------------------------------------------------iiii---- CPPKFGLFAPIHKVIRIGSGPSSG ---------3333----------- >RESTIN; SWP:P30622; PDB:2CP5A; GSSGSSGMSMLKPSGLKAPTKILKPGSTALKTPTAVVAPVEKTISSEKASSTPSSETQEE ------------------------------------------------------------ FVDDFRVGERVWVNGNKPGFIQFLGETQFAPGQWAGIVLDEPIGKNDGSVAGVRYFQCEP ------------%%%%---------------------------------iiii------- LKGIFTRPSKLTRKVSGPSSG ------3333----------- >RESTIN; SWP:P30622; PDB:2CP6A; GSSGSSGATPPISNLTKTASESISNLSEAGSIKKGERELKIGDRVLVGGTKAGVVRFLGE ------------------------------------------------------------ TDFAKGEWCGVELDEPLGKNDGAVAGTRYFQCQPKYGLFAPVHKVTKIGFPSTTPAKAKA -----------------------iiii-------------3333---------------- NAVRRVMATTSASLKRSPSASSLSSMSSVASSVSSRPSRTGLLTETSGPSSG ----------------3333-------------------------------- >NEXT TO BRCA1 GENE 1 PROT; SWP:Q14596; PDB:2CP8A; GSSGSSGQTAALMAHLFEMGFCDRQLNLRLLKKHNYNILQVVTELLQLSGPSSG ----------------------3333-------%%%%----------------- ------------------------------------------------------------ ---- >KIAA0657 PROTEIN; SWP:O75147; PDB:2CPCA; GSSGSSGTDVSSWIVYPSGKVYVAAVRLERVVLTCELCRPWAEVRWTKDGEEVVESPALL --------------------------------------1111------------------ LQKEDTVRRLVLPAVQLEDSGEYLCEIDDESASFTVTVTEPPVRIIYSGPSSG ---------------3333---------------------------------- >APOBEC-1 STIMULATING PROT; SWP:Q9NQ94; PDB:2CPDA; GSSGSSGDEDTMSSVKILYVRNLMLSTSEEMIEKEFNNIKPGAVERVKKIRDYAFVHFSN -----------------------33333333-----3333-------------------- REDAVEAMKALNGKVLDGSPIEVTLAKPVDKDSSGPSSG --------3333---%%%%-------------------- >RNA-BINDING PROTEIN EWS; SWP:Q01844; PDB:2CPEA; GSSGSSGDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKE ------------------------------------------------------------ TGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSGPSSG -------------3333-------2222-iiii------------2222---- >RNA BINDING MOTIF PROTEIN; SWP:Q8R3C6; PDB:2CPFA; GSSGSSGLFIKNLNFSTTEETLKGVFSKVGAIKSCTISKKKNKAGVLLSMGFGFVEYKKP -------------3333--------3333------------3333--------------- EQAQKALKQLQGHTVDGHKLEVRISERATKPASGPSSG 3333---1111---iiii-------------------- ------------------------------------------- >RNA BINDING MOTIF PROTEIN; SWP:Q8R3C6; PDB:2CPHA; GSSGSSGQVPKKQTTSKILVRNIPFQANQREIRELFSTFGELKTVRLPKKMTGTGAHRGF -----------------------1111--------3333--------------------- GFVDFITKQDAKKAFNALCHSTHLYGRRLVLEWADSEVTVQSGPSSG -----------------------%%%%-------------------- >CCR4-NOT TRANSCRIPTION CO; SWP:Q8BT14; PDB:2CPIA; GSSGSSGASVRVVQKNLVFVVGLSQRLADPEVLKRPEYFGKFGKIHKVVINNSTSYAGSQ -----------------------3333-------11113333------------------ GPSASAYVTYIRSEDALRAIQCVNNVVVDGRTLKASLGTTKYCSYSGPSSG -----------------3333------iiii-----------3333----- >Non-POU domain-containing; SWP:Q99K48; PDB:2CPJA; GSSGSSGGEKTFTQRSRLFVGNLPPDITEEEMRKLFEKYGKAGEVFIHKDKGFGFIRLET ---3333----------------------------3333---------1111-------- RTLAEIAKVELDNMPLRGKQLRVRFACHSASLTSGPSSG ---------------iiii-------------------- >SPERM-ASSOCIATED ANTIGEN ; SWP:O75391; PDB:2CPMA; GSSGSSGQKVEFRKRMEKEVSDFIQDSGQIKKKFQPMNKIERSILHDVVEVAGLTSFSFG -------------------------------------------------3333------- EDDDCRYVMIFKKEFAPSDEELDSYRRGSGPSSG -3333-------1111--------------3333 >TAR RNA-BINDING PROTEIN 2; SWP:Q15633; PDB:2CPNA; GSSGSSGPVSPQQSECNPVGALQELVVQKGWRLPEYTVTQESGPAHRKEFTMTCRVERFI ----------------3333------1111------------------------------ EIGSGTSKKLAKRNAAAKMLLRVSGPSSG ----------------------------- >Fragile X mental retardat; SWP:P51114; PDB:2CPQA; GSSGSSGTKQLAAAFHEEFVVREDLMGLAIGTHGSNIQQARKVPGVTAIELDEDTGTFRI ---------------------3333------%%%%------------------------- YGESADAVKKARGFLEFVEDFIQVPSGPSSG ---3333-----3333--------------- >EXOSOME COMPONENT 10; SWP:Q01780; PDB:2CPRA; GSSGSSGKPIFTDESYLELYRKQKKHLNTQQLTAFQLLFAWRDKTARREDESYGYVLPNH ------------%%%%3333---------3333-------------------3333---- MMLKIAEELPKEPQGIIACCNPVPPLVRQQINEMHLLIQQAREMPLLKSEVAAGVKKSSG -----------3333-3333-----3333------------------------------- PSSG ---- >VACUOLAR SORTING PROTEIN ; SWP:O75351; PDB:2CPTA; GSSGSSGMSSTSPNLQKAIDLASKAAQEDKAGNYEEALQLYQHAVQYFLHVVKYEAQGDK ---------------------------------------------------------333 AKQSIRAKCTEYLDRAEKLKEYLKNKEKKAQKPVKEGQPSPADEKGNDSDGSGPSSG 3-------------------------------------------------------- >CBL-INTERACTING PROTEIN S; SWP:Q8TF42; PDB:2CPWA; GSSGSSGRNRQQRPGTIKHGSALDVLLSMGFPRARAQKALASTGGRSVQTACDWLFSHSG --------------------3333-3333------------------------------- PSSG ---- >HYPOTHETICAL PROTEIN FLJ1; SWP:Q96IZ5; PDB:2CPXA; GSSGSSGEEIRKIPMFSSYNPGEPNKVLYLKNLSPRVTERDLVSLFARFQEKKGPPIQFR ---------------------------------33333333-----3333---------- MMTGRMRGQAFITFPNKEIAWQALHLVNGYKLYGKILVIEFGKNKKQRSSGPSSG -------------------------------%%%%-------------------- >RNA-BINDING PROTEIN 12; SWP:Q9NTZ6; PDB:2CPYA; GSSGSSGEGDVNSAKVCAHITNIPFSITKMDVLQFLEGIPVDENAVHVLVDNNGQGLGQA -----------------------333333333333------3333-----3333------ LVQFKNEDDARKSERLHRKKLNGREAFVHVVTLEDMREIEKNPPAQGKSGPSSG -----3333----1111---iiii--------------3333------------ >CUG TRIPLET REPEAT RNA-BI; SWP:Q92879; PDB:2CPZA; GSSGSSGLTQQSIGAAGSQKEGPEGANLFIYHLPQEFGDQDLLQMFMPFGNVVSAKVFID ---------------------------------3333--------3333----------- KQTNLSKCFGFVSYDNPVSAQAAIQSMNGFQIGMKRLKVQLKRSKNDSKSGPSSG ---------------3333-------2222------------------------- >Eukaryotic translation in; SWP:O75821; PDB:2CQ0A; GSSGSSGPNRRADDNATIRVTNLSEDTRETDLQELFRPFGSISRIYLAKDKTTGQSKGFA -----------------------1111333311113333--------------------- FISFHRREDAARAIAGVSGFGYDHLILNVEWAKPSTNSGPSSG ----------------2222-%%%%------------------ >PTB-LIKE PROTEIN L; SWP:Q969N9; PDB:2CQ1A; GSSGSSGDKMDGAPSRVLHIRKLPGEVTETEVIALGLPFGKVTNILMLKGKNQAFLELAT ---------------------------3333-----3333-------------------3 EEAAITMVNYYSAVTPHLRNQPIYIQYSNHKELKTSGPSSG 333--------------%%%%-------------------- >HYPOTHETICAL PROTEIN LOC9; SWP:Q8N989; PDB:2CQ2A; GSSGSSGAKHTLLRHEGIETVSYATQSLVVANGGLGNGVSRNQLLPVLEKCGLVDALLMP ----------3333-------------------3333--1111-----3333-------- PNKPYSFARYRTTEESKRAYVTLNGKEVVDDLGQKITLYLNFVEKVQWSGPSSG -----------3333-------2222---3333--------------------- >RNA-BINDING PROTEIN 9; SWP:O43251; PDB:2CQ3A; GSSGSSGNSESKSTPKRLHVSNIPFRFRDPDLRQMFGQFGKILDVEIIFNERGSKGFGFV -----------------------3333--------3333--------------------- TFENSADADRAREKLHGTVVEGRKIEVNNATARVMTNSGPSSG -------------------%%%%-------------------- >RNA BINDING MOTIF PROTEIN; SWP:Q86U06; PDB:2CQ4A; GSSGSSGKSPVREPVDNLSPEERDARTVFCMQLAARIRPRDLEDFFSAVGKVRDVRIISD ----------------------3333-----------3333--1111------------- RNSRRSKGIAYVEFCEIQSVPLAIGLTGQRLLGVPIIVQASQAEKNRLSGPSSG ---------------1111-----------iiii-----3333--3333----- >CYSTEINE-RICH SECRETORY P; SWP:P16562; PDB:2CQ7A; GSSGSSGSCQYQDLLSNCDSLKNTAGCEHELLKEKCKATCLCESGPSSG --------------------3333-3333-------------------- >10-FORMYLTETRAHYDROFOLATE; SWP:O75891; PDB:2CQ8A; GSSGSSGFFKGAASSVLELTEAELVTAEAVRSVWQRILPKVLEVEDSTDFFKSGAASVDV ---------------------3333-------------------1111------------ VRLVEEVKELCDGLELENEDVYMASTFGDFIQLLVRKLRGDDEESGPSSG --------1111----3333------------------------------ >RUVB-LIKE 2; SWP:Q9Y230; PDB:2CQAA; GSSGSSGKEETEIIEGEVVEIQIDRPATGTGSKVGKLTLKTTEMETIYDLGTKMIESLTK ----------------------------------------%%%%--------3333--11 DKVQAGDVITIDKATGKISKLGRSFTRARSGPSSG 11--------------------------------- >PEPTIDYL-PROLYL CIS-TRANS; SWP:Q9UNP9; PDB:2CQBA; GMATTKRVLYVGGLAEEVDDKVLHAAFIPFGDITDIQIPLDYETEKHRGFAFVEFELAED --------------11113333-11113333----------------------------- AAAAIDNMNESELFGRTIRVNLAKPMRIKESGPSSG ------------%%%%-------------------- >ARGININE/SERINE-RICH SPLI; SWP:P62995; PDB:2CQCA; GSSGSSGNRANPDPNCCLGVFGLSLYTTERDLREVFSKYGPIADVSIVYDQQSRRSRGFA -------------3333----------3333----3333--------------------- FVYFENVDDAKEAKERANGMELDGRRIRVSGPSSG ------33333333-------iiii---------- >RNA-BINDING REGION CONTAI; SWP:Q9H0Z9; PDB:2CQDA; GMHGSQKDTTFTKIFVGGLPYHTTDASLRKYFEGFGDIEEAVVITDRQTGKSRGYGFVTM -------------------------------3333------------------------- ADRAAAERACKDPNPIIDGRKANVNLAYLGAKPRSLQTGFAIGVSGPSSG ---33333333-----iiii------3333-------------------- >KIAA1064 PROTEIN; SWP:Q9UPT8; PDB:2CQEA; GSSGSSGELPKKRELCKFYITGFCARAENCPYMHGDFPCKLYHTTGNCINGDDCMFSHDP -----------------3333-----1111---------3333----3333--------- LTEETRELLDKMLADDAEAGAEDEKEVEELKKSGPSSG ---3333------------------------------- >RNA-BINDING PROTEIN LIN-2; SWP:Q9H9Z2; PDB:2CQFA; GSSGSSGDRCYNCGGLDHHAKECKLPPQPKKCHFCQSISHMVASCPLKAQQGPSAQGSGP ------------------3333------------------3333-3333----------- SSG --- >TAR DNA-BINDING PROTEIN-4; SWP:Q13148; PDB:2CQGA; GSSGSSGVKRAVQKTSDLIVLGLPWKTTEQDLKEYFSTFGEVLMVQVKKDLKTGHSKGFG -----------------------------------3333--------------------- FVRFTEYETQVKVMSQRHMIDGRWCDCKLPNSKQSQDSGPSSG -------------------iiii-------3333--------- >IGF-II MRNA-BINDING PROTE; SWP:NA; PDB:2CQHA; SGMNKLYIGNLSPAVTADDLRQLFGDRKLPLAGQVLLKSGYAFVDYPDQNWAIRAIETLS -----------1111---------1111------------------------------22 GKVELHGKIMEVDYSVSKKLRSSGPSSG 22--iiii-------------------- >NUCLEOLYSIN TIAR; SWP:Q01085; PDB:2CQIA; GMMEDDGQPRTLYVGNLSRDVTEVLILQLFSQIGPCKSCKMITEHTSNDPYCFVEFYEHR -----------------3333--------------------------------------- DAAAALAAMNGRKILGKEVKVNWATTPSSQKSGPSSG -------------iiii-------------------- >U3 small nucleolar ribonu; SWP:Q9NV31; PDB:2CQJA; GSSGSSGRRLPTVLLKLRMAQHLQAAVAFVEQGHVRVGPDVVTDPAFLVTRSMEDFVTWV ------------------------------------!!!!---3333--3333------- DSSKISGPSSG -3333------ >C-MPL BINDING PROTEIN; SWP:Q71RC2; PDB:2CQKA; GSSGSSGAVSTEDLKECLKKQLEFCFSRENLSKDLYLISQMDSDQFIPIWTVANMEEIKK -----------------------------------------------3333--------- LTTDPDLILEVLRSSPMVQVDEKGEKVRPSHKRCISGPSSG --------------------3333----------------- >60S RIBOSOMAL PROTEIN L9; SWP:P32969; PDB:2CQLA; GMKTILSNQTVDIPENVDITLKGRTVIVKGPRGTLRRDFNHINVELSLLGKKKKRLRVDK --------------------------------------3333------------------ WWGNRKELATVRTICSHVQNMIKGVTLGSGPSSG ----3333-------------------------- >RIBOSOMAL PROTEIN L17 ISO; SWP:Q9NRX2; PDB:2CQMA; GSSGSSGLLRNLLTGLVRHERIEAPWARVDEMRGYAEKLIDYGKLGDTNERAMRMADFWL ------------------------3333-------------3333--------------- TEKDLIPKLFQVLAPRYKDQTGGYTRMLQIPNRSLDRAKMAVIEYKGNCLPPLPLPSGPS -3333-------33333333---------------------------------------- SG -- >FORMIN-BINDING PROTEIN 3; SWP:O75400; PDB:2CQNA; GSSGSSGMKRKESAFKSMLKQAAPPIELDAVWEDIRERFVKEPAFEDITLESERKRIFKD ------------------------------3333----11113333---3333------- FMHVLEHECQHSGPSSG ----------------- >NUCLEOLAR PROTEIN OF 40 K; SWP:73921227; PDB:2CQOA; GMNSGRPETMENLPALYTIFQGEVAMVTDYGAFIKIPGCRKQGLVHRTHMSSCRVDKPSE --------------2222---------------------------3333-------1111 IVDVGDKVWVKLIGREMKNDRIKVSLSMKVVNQGTGKDLDPNNVIIESGPSSG -----------------1111------1111--------33333333------ >RNA-BINDING PROTEIN 12; SWP:Q8R4X3; PDB:2CQPA; GSSGSSGASSGKPGPTIIKVQNMPFTVSIDEILDFFYGYQVIPGSVCLKYNEKGMPTGEA -----------------------3333------1111----2222-----3333------ MVAFESRDEATAAVIDLNDRPIGSRKVKLVLGSGPSSG -------------------------------------- >DNAJ HOMOLOG SUBFAMILY C ; SWP:Q96KC8; PDB:2CQQA; GSSGSSGAPEWTEEDLSQLTRSMVKFPGGTPGRWEKIAHELGRSVTDVTTKAKQLKDSVT ---------------------3333-2222--------------33333333--3333-- CSPGMVSGPSSG ------------ >DNAJ HOMOLOG SUBFAMILY C ; SWP:Q96KC8; PDB:2CQRA; GSSGSSGSLRKERARSAEEPWTQNQQKLLELALQQYPRGSSDCWDKIARCVPSKSKEDCI -----------3333----------------3333----------3333----------- ARYKLLVSGPSSG ------------- >CELLOBIOSE PHOSPHORYLASE; SWP:O66264; PDB:2CQSA; MRYGHFDDAAREYVITTPHTPYPWINYLGSEQFFSLLSHQAGGYSFYRDAKMRRLTRYRY -------1111------------------------------------------------- NNIPADAGGRYLYVNDGGDVWTPSWLPVKADLDHFEARHGLGYSRITGERNGLKVETLFF ---------------iiii--1111--------------2222------iiii------- VPLGENAEVQKVTVTNTSDAPKTATLFSFVEFCLWNAQDDQTNYQRNLSIGEVEVEQDGP -2222-----------------------------------------1111---------- HGSAIYHKTEYRERRDHYAVFGVNTRADGFDTDRDTFVGAYNSLGEASVPRAGKSADSVA -------2222---------------------3333--22223333---3333------- SGWYPIGSHSVAVTLQPGESRDLVYVLGYLENPDEEKWADDAHQVVNKAPAHALLGRFAT ---------------2222-------------1111---1111----------------- SEQVDAALEALNSYWTNLLSTYSVSSTDEKLDRMVNIWNQYQCMVTFNMSRSASFFETGI ---------------------------3333----------------------------- GRGMGFRDSNQDLLGFVHLIPERARERIIDIASTQFADGSAYHQYQPLTKRGNNDIGSGF ----------3333-33333333------------1111-------1111---------- NDDPLWLIAGVAAYIKESGDWGILDEPVPFDNEPGSEVPLFEHLTRSFQFTVQNRGPHGL --3333-----------------------%%%%-----3333-------------1111- PLIGRADWNDCLNLNCFSTTPGESFQTTENQAGGVAESVFIAAQFVLYGAEYATLAERRG --------1111-----------3333-----------------------------1111 LADVATEARKYVDEVRAAVLEHGWDGQWFLRAYDYYGNPVGTDAKPEGKIWIEPQGFAVM ---------------------------------1111----3333--------------- AGIGVGEGPDDADAPAVKALDSVNEMLGTPHGLVLQYPAYTTYQIELGEVSTYPPGYKEN -2222--1111-----------------1111-----------1111-1111-2222--- GGIFCHNNPWVIIAETVVGRGAQAFDYYKRITPAYREDISDTHKLEPYVYAQMIAGKEAV ---3333------------------------33333333----------------1111- RAGEAKNSWLTGTAAWNFVAVSQYLLGVRPDYDGLVVDPQIGPDVPSYTVTRVARGATYE 2222--------------------------1111-------3333--------iiii--- ITVTNSGAPGARASLTVDGAPVDGRTVPYAPAGSTVRVEVTV -------2222-----iiii----------2222-------- >PEROXISOMAL D3,D2-ENOYL-C; SWP:NA; PDB:2CQUA; GSSGSSGMNRTAMRASQKDFENSMNQVKLLKKDPGNEVKLKLYALYKQATEGPCNMPKPG ---------%%%%----------------------------------------------3 VFDLINKAKWDAWNALGSLPKEAARQNYVDLVSSLSPSLESSSQVEPGTDSGPSSG 333----------3333--3333---------3333-------------------- >Myosin light chain kinase; SWP:Q9UIT9; PDB:2CQVA; GSSGSSGPQIIQFPEDQKVRAGESVELFGKVTGTQPITCTWMKFRKQIQESEHMKVENSE ------------------------------------------%%%%-------------- NGSKLTILAARQEHCGCYTLLVENKLGSRQAQVNLTVVDKPDPPAGTPSGPSSG ----------3333---------------------------------------- >SUSHI DOMAIN CONTAINING 2; SWP:Q9DBX3; PDB:2CQWA; GSSGSSGIPGPGFTAGAQGSCSLRCGAQDGLCSCHPTCSGLGTCCEDFLDYCLEILPSSG ----------------------------3333--1111------11111111-------- SMMGGKDFVVQHLKWTDPSGPSSG ------------------------ >LAG1 LONGEVITY ASSURANCE ; SWP:Q9D6K9; PDB:2CQXA; GSSGSSGGIKDSPVNKVEPNDTLEKVFVSVTKYPDEKRLKGLSKQLDWSVRKIQCWFRHR -------------------------1111-----3333---3333---3333-------- RNQDKPSGPSSG ------------ >Propionyl-CoA carboxylase; SWP:P05165; PDB:2CQYA; GSSGSSGDKIESKLLAKKAEVNTIPGFDGVVKDAEEAVRIAREIGYPVMIKASAGGGGKG --------------------------------3333---------------3333----- MRIAWDDEETRDGFRLSSQEAASSFGDDRLLIEKFIDNPRHISGPSSG ------------------------------------------------ >177AA LONG HYPOTHETICAL P; SWP:O58085; PDB:2CQZA; MIEKILLVQTLKRLPRMGWLIKGVQEPESIADHSFGVAFITLVLADVLEKRGKRIDVEKA ---------3333-----------------------------------1111-------- LKMAIVHDLAEAIITDIPLSAQEFVDKDKAEALVFKKVFPEFYELYREYQECSSPEAQLV -----------------3333-----------------3333------3333-------- RIADKLDMILQAYQYELSGNKNLDEFWEAIEEIKRLELSKYLEDILNSVGRLK ---------------1111---3333-----33333333---------3333- >NUCLEAR MIGRATION PROTEIN; SWP:O35685; PDB:2CR0A; GSSGSSGKPNLGNGADLPNYRWTQTLAELDLAVPFRVSFRLKGKDVVVDIQRRHLRVGLK ----------------1111---------------------3333--------------- GQPPVVDGELYNEVKVEESSWLIEDGKVVTVHLEKINKMEWWNRLVTSDPEINTKSGPSS -----------------------%%%%--------------------------------- G - >SPECKLE-TYPE POZ PROTEIN; SWP:O43791; PDB:2CR2A; GSSGSSGKVVKFSYMWTINNFSFCREEMGEVIKSSTFSSGANDKLKWCLRVNPKGLDEES -------------------1111---------------------------------3333 KDYLSLYLLLVSCPKSEVRAKFKFSILNAKGEETKAMESQRAYRFVQGKDWGFKKFIRRD ---------------------------1111--------------2222--------333 FLLDEANGLLPDDKLTLFCEVSVVQDSVNISGQSGPSSG 3---3333------------------------------- >BASIC FIBROBLAST GROWTH F; SWP:P11362; PDB:2CR3A; GSSGSSGVEVESFLVHPGDLLQLRCRLRDDVQSINWLRDGVQLAESNRTRITGEEVEVQD -------------------------------------%%%%------------------- SVPADSGLYACVTSSPSGSDTTYFSVNVSDALPSGPSSG --------------------------------------- >SH3 DOMAIN-BINDING PROTEI; SWP:P78314; PDB:2CR4A; GSSGSSGEDYEKVPLPNSVFVNTTESCEVERLFKATSPRGEPQDGLYCIRNSSTKSGKVL ---------------3333-----3333--------------2222-------------- VVWDETSNKVRNYRIFEKDSKFYLEGEVLFVSVGSMVEHYHTHVLPSHQSLLLRHPYGYT ----1111---------%%%%--------------------------------------- SGPSSG -3333- >REPRODUCTION 8; SWP:Q9QZ49; PDB:2CR5A; GSSGSSGEVPDLPEEPSETAEEVVTVALRCPNGRVLRRRFFKSWNSQVLLDWMMKVGYHK -----------------------------1111-----------3333----------11 SLYRLSTSFPRRALEVEGGSSLEDIGITVDTVLNVEEKEQSSQSGPSSG 11------------------3333------------------------- >KIAA1556 PROTEIN; SWP:Q5VST9; PDB:2CR6A; GSSGSSGLVQGRRVHIIEDLEDVDVQEGSSATFRCRISPANYEPVHWFLDKTPLHANELN -------------------------2222------------------------------- EIDAQPGGYHVLTLRQLALKDSGTIYFEAGDQRASAALRVTEKPSVFSRSGPSSG -----------------3333---------------------------------- >PAIRED AMPHIPATHIC HELIX ; SWP:Q62141; PDB:2CR7A; GSSGSSGVHVEDALTYLDQVKIRFGSDPATYNGFLEIMKEFKSQSIDTPGVIRRVSQLFH --------------------3333----------------1111--1111--------11 EHPDLIVGFNAFLPSGPSSG 113333-------------- >MDM4 PROTEIN; SWP:O15151; PDB:2CR8A; GSSGSSGSEDEWQCTECKKFNSPSKRYCFRCWALRKDWYSDCSKLTHSGPSSG ---------------------3333---------------------------- >POLY [ADP-RIBOSE] POLYMER; SWP:P09874; PDB:2CR9A; GSSGSSGKSEKRMKLTLKGGAAVDPDSGLEHSAHVLEKGGKVFSATLGLVDIVKGTNSYY -----------------------3333-1111-----%%%%------------------- KLQLLEDDKENRYWIFRSWGRVGTVIGSNKLEQMPSKEDAIEHFMKLYEEKTGNAWHSKN ----------------------------------------------1111---------- FTKYPKKFYPLEISGPSSG ------------------- >HOMEOBOX PROTEIN HOX-B13; SWP:Q92826; PDB:2CRAA; GSSGSSGRKKRIPYSKGQLRELEREYAANKFITKDKRRKISAATSLSERQITIWFQNRRV --------------3333----------------------------3333--------33 KEKKSGPSSG 33-------- >NUCLEAR RECEPTOR BINDING ; SWP:NA; PDB:2CRBA; GSSGSSGMEGPLNLAHQQSRRADRLLAAGKYEEAISCHRKATTYLSEAMKLTESEQAHLS -------------------------1111-----------------3333----3333-- LELQRDSHMKQLLLIQERWKRAKREERLKAHSGPSSG ------------------------------------- >Ubiquitin conjugating enz; SWP:Q9BYM8; PDB:2CRCA; GSSGSSGPVGWQCPGCTFINKPTRPGCEMCCRARPEAYQVPASYQPSGPSSG --------------------1111----------3333-------------- >HEF-LIKE PROTEIN; SWP:Q9NQ75; PDB:2CREA; GSSGSSGLLARALYDNCPDCSDELAFSRGDILTILEQHVPESEGWWKCLLHGRQGLAPAN -------------------3333---------------3333-------%%%%----333 RLQILSGPSSG 3---------- >RAN BINDING PROTEIN 3; SWP:Q9H6Z4; PDB:2CRFA; GSSGSSGTARKCLLEKVEVITGEEAESNVLQMQCKLFVFDKTSQSWVERGRGLLRLNDMA ---------------3333--------------------3333----------------- STDDGTLQSRLVMRTQGSLRLILNTKLWAQMQIDKASEKSIHITAMDTEDQGVKVFLISA ---------------------------1111----------------------------- SSKDTGQLYAALHHRILALRSRVESGPSSG ------------------1111-------- >METASTASIS ASSOCIATED PRO; SWP:Q924K8; PDB:2CRGA; GSSGSSGMEEWSASEACLFEEALEKYGKDFNDIRQDFLPWKSLTSIIEYYYMWKTTDRYV ------------------------------------------3333------1111---- QQKRSGPSSG ---------- >VAV PROTO-ONCOGENE; SWP:P15498; PDB:2CRHA; GSSGSSGKAEAEQNWWEGPPQDLSVHLWYAGPMERAGAESILANRSDGTFLVRQRVKDAA ---------------------3333----------------3333--------------- EFAISIKYNVEVKHIKIMTAEGLYRITEKKAFRGLTELVEFYQQNSLKDCFKSLDTTLQF -------%%%%--------iiii-----------3333---1111-33333333------ PFKEPEKRTISRSGPSSG 3333-------------- >SWI/SNF-related matrix-as; SWP:Q9Z104; PDB:2CRJA; GSSGSSGPKAPVTGYVRFLNERREQIRTRHPDLPFPEITKMLGAEWSKLQPAEKQRYLDE ------------3333----3333-----1111----3333--3333------------- AEKEKQQYLKELWAYQQSEAYKVCTESGPSSG --3333-----------3333----------- >COPPER CHAPERONE FOR SUPE; SWP:O14618; PDB:2CRLA; GSSGSSGMASDSGNQGTLCTLEFAVQMTCQSCVDAVRKSLQGVAGVQDVEVHLEDQMVLV ------------------------------------------------------------ HTTLPSQEVQALLEGTGRQAVLKGMGSGQLQNSGPSSG ----3333-----3333--------------------- >Fibronectin type-III doma; SWP:Q9Y2H6; PDB:2CRMA; GSSGSSGVVEFTTCPDKPGIPVKPSVKGKIHSHSFKITWDPPKDNGGATINKYVVEMAEG --%%%%------------------------------------------------------ SNGNKWEMIYSGATREHLCDRLNPGCFYRLRVYCISDGGQSAVSESLLVQTPAVSGPSSG ------------------------------------------------------------ >UBASH3A PROTEIN; SWP:P57075; PDB:2CRNA; GSSGSSGSSPSLLEPLLAMGFPVHTALKALAATGRKTAEEALAWLHDHCNDPSLDDPISG ----------3333--3333----------------3333----------1111------ PSSG 3333 >REGULATORY PROTEIN CRO; SWP:P03036; PDB:2CRO; QTLSERLKKRRIALKMTQTELATKAGVKQQSIQLIEAGVTKRPRFLFEIAMALNCDPVWL -----------1111-----------------------------------1111------ QYGT ---- >REGULATOR OF G-PROTEIN SI; SWP:O15539; PDB:2CRPA; GSSGSSGPEKPAKTQKTSLDEALQWRDSLDKLLQNNYGLASFKSFLKSEFSEENLEFWIA -----------------3333-3333-----3333------------------------- CEDYKKIKSPAKMAEKAKQIYEEFIQTEAPKEVNIDHFTKDITMKNLVEPSLSSFDMAQK ---1111-3333-------------------------------1111------------- RIHALMEKDSLPRFVRSEFYQELISGPSSG ---------3333----3333--------- >MITOCHONDRIAL TRANSLATION; SWP:Q9CZD5; PDB:2CRQA; GSSGSSGPKTGPTMTKELVFSSNIGQHDLDTKSKQIQQWIEKKYHVQVTIKRRKDAEQSE --------------------33333333----------3333----------------33 EETEEIFNQILQTMPDIATFSSRPKAIRGGTASMCVFRHLSKKEEKSGPSSG 33-----------1111---------iiii---------------------- >STROMAL MEMBRANE-ASSOCIAT; SWP:Q8IYB5; PDB:2CRRA; GSSGSSGKAQKLNEQHQLILSKLLREEDNKYCADCEAKGPRWASWNIGVFICIRCAGIHR -------3333----3333------3333-----------------------3333--33 NLGVHISRVKSVNLDQWTAEQIQCMQDMGNTKARLLYEANLPENFRRPQTDQAVEFFIRD 331111-----------3333---------3333---------------3333------- KYEKKKYYDKNAIAISGPSSG -1111---3333--------- >PROGRAMMED CELL DEATH PRO; SWP:O14737; PDB:2CRUA; GSSGSSGLRRQRLAELQAKHGDPGDAAQQEAKHREAEMRNSILAQVLDQSARARLSNLAL -----------------------3333----------------------------3333- VKPEKTKAVENYLIQMARYGQLSEKVSEQGLIEILKKVSQQTEKTTTVKFNRSGPSSG ---------------------------------11113333----------------- >TRANSLATION INITIATION FA; SWP:Q91YJ5; PDB:2CRVA; GSSGSSGYPIGEASILATFTVTEGKKKIPVADCRVQKGQLERHKKFKLIRNGQVIWKGSL ------------------------------------------------------------ TSLKHHKDDISVIKTGMDCGLSLDEEKVEFKPGDQVICYEENKVPTKTSWDPGFSGPSSG -------------2222-------3333-------------------------------- >ADP-ribosylation factor G; SWP:Q9NP61; PDB:2CRWA; GSSGSSGMGDPSKQDILTIFKRLRSVPTNKVCFDCGAKNPSWASITYGVFLCIDCSGSHR ---------------------33331111---------------1111-----3333-33 SLGVHLSFIRSTELDSNWSWFQLRCMQVGGNASASSFFHQHGCSTNDTNAKYNSRAAQLY 333333------------3333--------3333----3333----1111---------- REKIKSLASQATRKHGTDLWLDSSGPSSG ----------------------------- >KIN OF IRRE-LIKE PROTEIN ; SWP:Q8IZU9; PDB:2CRYA; GSSGSSGTLTVNGPPIISSTQTQHALHGEKGQIKCFIRSTPPPDRIAWSWKENVLESGTS -----------------------------------------------------------% GRYTVETISTEEGVISTLTISNIVRADFQTIYNCTAWNSFGSDTEIIRLKEQGSEMSGPS %%%------3333----------3333----------3333------------------- SG -- >Fibronectin type-III doma; SWP:Q9Y2H6; PDB:2CRZA; GSSGSSGPPGPCLPPRLQGRPKAKEIQLRWGPPLVDGGSPISCYSVEMSPIEKDEPREVY ---------------------1111----------iiii----------3333------- QGSEVECTVSSLLPGKTYSFRLRAANKMGFGPFSEKCDITTAPGSGPSSG -------------------------3333--------------------- >HEMATOPOIETIC SH2 DOMAIN ; SWP:Q96JZ2; PDB:2CS0A; GSSGSSGGQLAQDGVPEWFHGAISREDAENLLESQPLGSFLIRVSHSHVGYTLSYKAQSS -----------------------3333----1111------------------------- CCHFMVKLLDDGTFMIPGEKVAHTSLDALVTFHQQKPIEPRRELLTQPCRQKDSGPSSG --------1111-------------------------3333------------------ >PMS1 PROTEIN HOMOLOG 1; SWP:P54277; PDB:2CS1A; GSSGSSGIKKPMSASALFVQDHRPQFLIENPKTSLEDATLQIEELWKTLSEEEKLKYEEK --------------------------3333---------------1111-3333--3333 ATKDLERYNSQMKRAIEQESQMSLKDSGPSSG 3333-------3333----------------- >POLY [ADP-RIBOSE] POLYMER; SWP:P09874; PDB:2CS2A; GSSGSSGGSKAEKTLGDFAAEYAKSNRSTCKGCMEKIEKGQVRLSKKMVDPEKPQLGMID -------------------------------------------------11113333--- RWYHPGCFVKNREELGFRPEYSASQLKGFSLLATEDKEALKKQLPGVKSEGKRKGDEVDG ------------3333-33333333--3333---------3333---------------- VDEVAKKKSGPSSG -------------- >PROTEIN C14ORF4; SWP:Q9H1B7; PDB:2CS3A; GSSGSSGSPMANSGPLCCTICHERLEDTHFVQCPSVPSHKFCFPCSRESIKAQGATGEVY --------------------------3333-----3333--1111-3333---1111--- CPSGEKCPLVGSNVPWAFMQGEIATILSGPSSG --------------------------------- >PROTEIN C12ORF2; SWP:Q8NHQ8; PDB:2CS4A; GSSGSSGMELKVWVDGVQRIVCGVTEVTTCQEVVIALAQAIGRTGRYTLIEKWRDTERHL -------------iiii-------33333333--------------------%%%%---- APHENPIISLNKWGQYASDVQLILRRTGPSGPSSG 1111------------3333--------------- >Tyrosine-protein phosphat; SWP:P29074; PDB:2CS5A; GSSGSSGNGGIPHDNLVLIRMKPDENGRFGFNVKGGYDQKMPVIVSRVAPGTPADLCVPR -----------------------3333--------3333--------------------- LNEGDQVVLINGRDIAEHTHDQVVLFIKASCERHSGELMLLVRPNAVYDVVEESGPSSG ---------%%%%------3333----3333---------------------------- >PNEUMOCOCCAL HISTIDINE TR; SWP:Q97QM8; PDB:2CS7A; QGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAAAEAFLSG -----1111---1111------------!!!!----1111--------------- >KIAA0535 PROTEIN; SWP:O60284; PDB:2CS8A; GSSGSSGPELKCPVIGCDGQGHISGKYTSHRTASGCPLAAKRQKENPLNGASLSWKLNKQ ---------------------3333------3333------------------------- ELPHCPLPGCNGLGHVNNVFVTHRSLSGCPLNAQVIKKGKVSSGPSSG ------3333----3333------3333-3333--------------- >TOPOISOMERASE V; SWP:Q977W1; PDB:2CSBA; LVYDAEFVGSEREFEEERETFLKGVKAYDGVLATRYLERSSSAKNDEELLELHQNFILLT ----------------------------------------3333---------------- GSYACSIDPTEDRYQNVIVRGVNFDERVQRLSTGGSPARYAIVYRRGWRAIAKALDIDEE --3333-1111-------iiii--------1111---------2222-----1111-333 DVPAIEVRAVKRNPLQPALYRILVRYGRVDLPVTVDEVPPEAGEFERLIERYDVPIDEKE 3--------1111--------------3333--3333------------1111------- ERILEILRENPWTPHDEIARRLGLSVSEVEGEKDPESSGIYSLWSRVVVNIEYDERTAKR -----33331111-----------3333------------------3333---------- HVKRRDRLLEELYEHLEELSERYLRHPLTRRWIVEHKRDIRRYLEQRIVECALKLQDRYG -------------------3333-------------3333-1111--------------- IREDVALCLARAFDGSISIATTPYRTLKDVCPDLTLEEAKSVNRTLATLIDEHGLSPDAA ----------1111----------------1111-------------------------- DELIEHFESIAGILATDLEEIERYEEGRLSEEAYRAAVEIQLAELTKKEGVGRKTAERLL ------------1111-------3333--3333-----------3333------------ RAFGNPERVKQLAREFEIEKLASVEGVGERVLRSLVPGYASLISIRGIDRERAERLLKKY ------------1111-------2222--------2222-----2222------------ GGYSKVREAGVEELREDGLTDAQIRELKGLK ------------------------------- >DNA-BINDING PROTEIN SATB2; SWP:Q9UPW6; PDB:2CSFA; GSSGSSGPIKVDGANINITAAIYDEIQQEMKRAKVSQALFAKVAANKSQGWLCELLRWKE ------------------3333-------------3333--------3333--------- NPSPENRTLWENLCTIRRFLNLPQHERDVIYEEESSGPSSG ----------------------3333--------------- >PUTATIVE CYTOPLASMIC PROT; SWP:Q8ZQM7; PDB:2CSGA; TPFTHETLPADPKAAIRQMKQALRAQIGDVQAVFDRLSATIAARVAEINDLKAQGQPVWP ------------------------------------------------------------ IIPFSELAMGNISDATRAEVKRRGCAVIKGHFPREQALAWDQSMLDYLDKNHFDEVYKGP ------1111-------------------------------------------------- GDNFFGTLSASRPEIYPVYWSQAQMQARQSEEMALAQSFLNRLWQVEHDGKRWFNPDISI --1111--------------3333-----1111------3333----------------- IYPDRIRRRPPGTTSKGLGAHTDSGALERWLLPAYQQVFASVFNGNVEQYDPWNAAHRTD ---------2222------------3333----------------3333-1111--1111 VEEYTVDNTTKCSVFRTFQGWTALSDMLPGQGLLHVVPIPEAMAYILLRPLLDDVPEDEL -----2222------------------2222----------------3333----1111i CGVAPGRVLPISEQWHPLLMAALTSIPPLEAGDSVWWHCDVIHSVAPVENQQGWGNVMYI iii2222----3333-3333---------2222----1111------------------- PAAPMCEKNLAYARKVKAALETGASPGDFPREDYETTWEGRFTLRDLNIHGKRALGI -------------------------3333---1111------3333------1111- >ZINC FINGER PROTEIN 297B; SWP:O43298; PDB:2CSHA; GSSGSSGDKLYPCQCGKSFTHKSQRDRHMSMHLGLRPYGCGVCGKKFKMKHHLVGHMKIH ------------3333------------3333----------------333333331111 TGIKPYECNICAKRFMWRDSFHRHVTSCTKSYEAAKAEQNTTEASGPSSG -----------------3333---------3333---------------- >RIM BINDING PROTEIN 2; SWP:O15034; PDB:2CSIA; GSSGSSGRRMVALYDYDPRESSPNVDVEAELTFCTGDIITVFGEIDEDGFYYGELNGQKG -------------------------3333----------------3333-----%%%%-- LVPSNFLEEVSGPSSG --1111---------- >TJP2 PROTEIN; SWP:NA; PDB:2CSJA; GSSGSSGMEEVIWEQYTVTLQKDSKRGFGIAVSGGRDNPHFENGETSIVISDVLPGGPAD -------------------------!!!!-----------3333------------3333 GLLQENDRVVMVNGTPMEDVLHSFAVQQLRKSGKIAAIVVKRPRKVQVAPLSGPSSG -----------iiii-----3333---3333-------------------------- >SORTING NEXIN 12; SWP:Q9UMY4; PDB:2CSKA; GSSGSSGSNFLEIDIFNPQTVGVGRARFTTYEVRMRTNLPIFKLKESCVRRRYSDFEWLK --------------------------------------3333---------3333----- NELERDSKIVVPPLPGKALKRQLPFRGDEGIFEESFIEERRQGLEQFINKIAGHPLAQNE 1111---------------------------------------------3333------- RCLHMFLQEEAIDRNYVPGKSGPSSG --3333-------------------- >PLECKSTRIN; SWP:P08567; PDB:2CSOA; GSSGSSGRSIRLPETIDLGALYLSMKDTEKGIKELNLEKDKKIFNHCFTGNCVIDWLVSN ------------11113333--3333------------%%%%------------------ QSVRNRQEGLMIASSLLNEGYLQPAGDMSKSAVDGTAENPFLDNPDAFYYFPDSGFFCEE -----3333------------------3333------------1111------------- NSGPSSG ------- >RIM BINDING PROTEIN 2; SWP:O15034; PDB:2CSPA; GSSGSSGVEFSTLPAGPPAPPQDVTVQAGVTPATIRVSWRPPVLTPTGLSNGANVTGYGV ------------------------------1111-------------------------- YAKGQRVAEVIFPTADSTAVELVRLRSLEAKGVTVRTLSAQGESVDSAVAAVPPELLVPP -----------1111-----3333-1111---------3333----------3333---- TPHPSGPSSG ---------- >RIM BINDING PROTEIN 2; SWP:O15034; PDB:2CSQA; GSSGSSGTDPGAEELPARIFVALFDYDPLTMSPNPDAAEEELPFKEGQIIKVYGDKDADG --------------------------3333----3333------2222--------3333 FYRGETCARLGLIPCNMVSEIQADDEEMMDQSGPSSG -----%%%%----3333-------------------- >Regulating synaptic membr; SWP:Q86UR5; PDB:2CSSA; GSSGSSGVTWQPSKEGDRLIGRVILNKRTTMPKDSGALLGLKVVGGKMTDLGRLGAFITK ------------3333--------------------------------3333-------- VKKGSLADVVGHLRAGDEVLEWNGKPLPGATNEEVYNIILESKSEPQVEIIVSRPSGPSS ---------------------%%%%-2222------------------------------ G - >ASPARTATE AMINOTRANSFERAS; SWP:P00504; PDB:2CSTA; AASIFAAVPRAPPVAVFKLTADFREDGDSRKVNLGVGAYRTDEGQPWVLPVVRKVEQLIA --1111---------------------1111---------1111---------------- GDGSLNHEYLPILGLPEFRANASRIALGDDSPAIAQKRVGSVQGLGGTGALRIGAEFLRR -1111-----33333333---------1111--1111----------------------- WYNGNNNTATPVYVSSPTWENHNSVFMDAGFKDIRTYRYWDAAKRGLDLQGLLDDMEKAP ------------------3333----1111-----------1111----------11112 EFSIFILHACAHNPTGTDPTPDEWKQIAAVMKRRCLFPFFDSAYQGFASGSLDKDAWAVR 222------------------------------------------------3333----- YFVSEGFELFCAQSFSKNFGLYNERVGNLSVVGKDEDNVQRVLSQMEKIVRTTWSNPPSQ ----------------11111111-------------------------1111------- GARIVATTLTSPQLFAEWKDNVKTMADRVLLMRSELRSRLESLGTPGTWNHITDQIGMFS ----------------------------------------1111-----3333------- FTGLNPKQVEYMIKEKHIYLMASGRINMCGLTTKNLDYVAKSIHEAVTKIQ --------------------1111--3333-3333---------------- >457AA LONG HYPOTHETICAL P; SWP:O58493; PDB:2CSUA; LDYFFNPKGIAVIGASNDPKKLGYEVFKNLKEYKKGKVYPVNIKEEEVQGVKAYKSVKDI -3333------------1111--------1111--------1111--iiii----3333- PDEIDLAIIVVPKRFVKDTLIQCGEKGVKGVVIITAGFGETGEEGKREEKELVEIAHKYG -----------3333----------------------1111------------------- RIIGPNCVGINTHVDLNATFITVAKKGNVAFISQSGALGAGIVYKTIKEDIGFSKFISVG ----------3333---------------------------------------------- NADVDFAELEYLADTEEDKAIALYIEGVRNGKKFEVAKRVTKKKPIIALKAGSWKIYEAA ----3333--3333---------------3333-----3333------------------ FKQSGVLVANTIDELSARAFSQPLPRGNKVAITNAGGPGVLTADELDKRGLKLATLEEKT -1111-----3333----------------------3333-----3333----------- IEELRSFLPPAAVKNPVDIASARGEDYYRTAKLLLQDPNVDLIAICVVPTFAGTLTEHAE ------------------1111--------------1111-------------------- GIIRAVKEVNNEKPVLAFAGYVSEKAKELLEKNGIPTYERPEDVASAAYALVEQAKNVGI -------------------3333-------1111-------------------------- >TRIPARTITE MOTIF PROTEIN ; SWP:Q14134; PDB:2CSVA; GSSGSSGQLLEPIRDFEARKCPVHGKTMELFCQTDQTCICYLCMFQEHKNHSTVTVEEAK ---------------------------------------3333-----------3333-3 AEKETESGPSSG 333--------- >UBIQUITIN LIGASE PROTEIN ; SWP:O76064; PDB:2CSWA; GSSGSSGVTGDRAGGRSWCLRRVGMSAGWLLLEDGCEVTVGRGFGVTYQLVSKICPLMIS ---------------------2222-------2222------------------3333-- RNHCVLKQNPEGQWTIMDNKSLNGVWLNRARLEPLRVYSIHQGDYIQLGVPLENKENAEY --------3333--------------%%%%---------------------1111----- EYEVTEEDWETIYPCLSPKSGPSSG -------11113333---------- >ZINC FINGER PROTEIN 183-L; SWP:Q8IZP6; PDB:2CSYA; GSSGSSGGSEEEEIPFRCFICRQAFQNPVVTKCRHYFCESCALEHFRATPRCYICDQPTG ------------------------------1111------------------------%% GIFNPAKELMAKLQKSGPSSG %%------------------- >SYNAPTOTAGMIN-LIKE PROTEI; SWP:Q96C24; PDB:2CSZA; GSSGSSGLLEIKRKGAKRGSQHYSDRTCARCQESLGRLSPKTNTCRGCNHLVCRDCRIQE --------------------------------------3333------------------ SNGTWRCKVCSGPSSG %%%%--3333------ >NON-SMC ELEMENT 1 HOMOLOG; SWP:Q8WV22; PDB:2CT0A; GSSGSSGRETYPDAVKICNICHSLLIQGQSCETCGIRMHLPCVAKYFQSNAEPRCPHCND --------------------------------------1111------------------ YWPHEIPKSGPSSG -------------- >TRANSCRIPTIONAL REPRESSOR; SWP:P49711; PDB:2CT1A; GSSGSSGRTHSGEKPYECYICHARFTQSGTMKMHILQKHTENVAKFHCPHCDTVIARKSD --------------------------3333---------------------------333 LGVHLRKQHSYSGPSSG 3---------------- >TRIPARTITE MOTIF PROTEIN ; SWP:Q13049; PDB:2CT2A; GSSGSSGNLDALREVLECPICMESFTEEQLRPKLLHCGHTICRQCLEKLLASSINGVRCP -------------------------3333-----3333---3333--------------- FCSKITRITSLTQLTDNLTVLKSGPSSG ---------3333----3333------- >VINEXIN; SWP:O60504; PDB:2CT3A; GSSGSSGTPYRAMYQYRPQNEDELELREGDRVDVMQQCDDGWFVGVSRRTQKFGTFPGNY -------------------3333---------------------------------3333 VAPVSGPSSG --3333---- >CDC42-INTERACTING PROTEIN; SWP:Q15642; PDB:2CT4A; GSSGSSGGHCVAIYHFEGSSEGTISMAEGEDLSLMEEDKGDGWTRVRRKEGGEGYVPTSY --------------------------------------------------------3333 LRVTSGPSSG ---------- >ZINC FINGER BED DOMAIN CO; SWP:O96006; PDB:2CT5A; GSSGSSGSKVWKYFGFDTNAEGCILQWKKIYCRICMAQIAYSGNTSNLSYHLEKNHPEEF ----------3333-----33333333-----1111---------3333----------- CEFVKSNSGPSSG ------------- >sh3 domain-binding glutam; SWP:Q9UJC5; PDB:2CT6A; GSSGSSGMVIRVFIASSSGFVAIKKKQQDVVRFLEANKIEFEEVDITMSEEQRQWMYKNV -------------------3333-----------1111------3333-------1111- PPEKKPTQGNPLPPQIFNGDRYCGDYDSFFESKESNTVFSFLGLKSGPSSG 3333--------------------3333---3333---------------- >RING FINGER PROTEIN 31; SWP:Q96EP0; PDB:2CT7A; GSSGSSGALFHKKLTEGVLMRDPKFLWCAQCSFGFIYEREQLEATCPQCHQTFCVRCKRQ ------------------------------------------------------------ WEEQHRGRSCEDFQNWKRMNSGPSSG -3333--------------------- >METHIONYL-TRNA SYNTHETASE; SWP:O67298; PDB:2CT8A; MTLMKKFYVTTPIYYVNDVPHLGHAYTTIAADTIARYYRLRDYDVFFLTGTDEHGLKIQK --------------------3333--------------------------------3333 KAEELGISPKELVDRNAERFKKLWEFLKIEYTKFIRTTDPYHVKFVQKVFEECYKRGDIY --1111------------------------------------------------------ LGEYEKEPSYFFRLSKYQDKLLELYEKNPEFIQPDYRRNEIISFVKQGLKDLSVTRPRSR ------------3333-----------1111--3333----------------------- VKWGIPVPFDPEHTIYVWFDALFNYISALEDKVEIYWPADLHLVGKDILRFHTVYWPAFL ---------1111--3333--3333------3333---------3333------------ MSLGYELPKKVFAHGWWTVEGKKMSKTLGNVVDPYEVVQEYGLDEVRYFLLREVPFGQDG -------------------------1111------------------------------- DFSKKAILNRINGELANEIGNLYSRVVNMAHKFLGGEVSGARDEEYAKIAQESIKNYENY --3333-----------------------------------------------------3 MEKVNFYKAIEEILKFTSYLNKYVDEKQPWALNKERKKEELQKVLYALVDGLFVLTHLLY 333------------------------3333----------------------------- PITPNKMKEALQMLGEKEFLKELKPYSKNTYKLGERKILFPKREG ----3333------------------------------------- >CALCIUM-BINDING PROTEIN P; SWP:P61023; PDB:2CT9A; LLRDEELEEIKKETGFSHSQITRLYSRFTSLDKGENGTLSREDFQRIPELAINPLGDRII -------------------------------1111----3333----3333-1111---- NAFFSEGEDQVNFRGFMRTLAHFRPIEEPLNSRSNKLHFAFRLYDLDKDDKISRDELLQV 11112222-----------3333-----1111-------3333-1111-----------3 LRMMVGVNISDEQLGSIADRTIQEADQDGDSAISFTEFVKVLEKVDVEQKMSIRFLHKLA 333--1111-----------------------------3333---3333----------- AALEH ----- >ZINC FINGER PROTEIN 512; SWP:Q96ME7; PDB:2CTDA; GSSGSSGRIRKEPPVYAAGSLEEQWYLEIVDKGSVSCPTCQAVGRKTIEGLKKHMENCKQ -------------------3333------------------------------------- EMFTCHHCGKQLRSLAGMKYHVMANHNSLPSGPSSG ------------------------------------ >VIGILIN; SWP:Q00341; PDB:2CTEA; GSSGSSGDIVARLQTQASATVAIPKEHHRFVIGKNGEKLQDLELKTATKIQIPRPDDPSN -----------------------33333333------------1111------1111--- QIKITGTKEGIEKARHEVLLISAEQDKRSGPSSG ------3333-------------3333------- >VIGILIN; SWP:Q00341; PDB:2CTFA; GSSGSSGEPEKLGQALTEVYAKANSFTVSSVAAPSWLHRFIIGKKGQNLAKITQQMPKVH ----------------------------------1111-3333------------3333- IEFTEGEDKITLEGPTEDVSVAQEQIEGMVKDLINRSGPSSG --------------3333------------------------ >CYTOCHROME C3; SWP:P00131; PDB:2CTHA; APKAPADGLKMEATKQPVVFNHSTHKSVKCGDCHHPVNGKEDYRKCGTAGCHDSMDKKDK --------------------333311113333----iiii----1111-------1111- SAKGYYHVMHDKNTKFKSCVGCHVEVAGADAAKKKDLTGCKKSKCHE 11113333----------------------------------3333- >VIGILIN; SWP:Q00341; PDB:2CTJA; GSSGSSGSIQKDLANIAEVEVSIPAKLHNSLIGTKGRLIRSIMEECGGVHIHFPVEGSGS -----------------------3333----------------3333-------3333-- DTVVIRGPSSDVEKAKKQLLHLAEEKQTKSGPSSG -------3333------------------------ >VIGILIN; SWP:Q00341; PDB:2CTKA; GSSGSSGKEALEALVPVTIEVEVPFDLHRYVIGQKGSGIRKMMDEFEVNIHVPAPELQSD -----------------------33333333-----3333-------------3333--- IIAITGLAANLDRAKAGLLERVKELQAEQEDRALRSFKSGPSSG ------------------------------3333---------- >VIGILIN; SWP:Q00341; PDB:2CTLA; GSSGSSGEQEDRALRSFKLSVTVDPKYHPKIIGRKGAVITQIRLEHDVNIQFPDKDDGNQ ---------33331111------3333--------------------------3333--- PQDQITITGYEKNTEAARDAILRIVGELEQMSGPSSG ---------------------------3333------ >VIGILIN; SWP:Q00341; PDB:2CTMA; GSSGSSGRIVGELEQMVSEDVPLDHRVHARIIGARGKAIRKIMDEFKVDIRFPQSGAPDP -----------3333--------11113333-----1111-------------1111-11 NCVTVTGLPENVEEAIDHILNLEEEYLADSGPSSG 11-----3333--------------3333------ >NOVEL PROTEIN; SWP:NA; PDB:2CTOA; GSSGSSGMPNRKASRNAYYFFVQEKIPELRRRGLPVARVADAIPYCSSDWALLREEEKEK ---------------3333------------------3333-3333-3333--------- YAEMAREWRAAQGKDPGPSEKQKPVFTSGPSSG --------------------------------- >DNAJ HOMOLOG SUBFAMILY B ; SWP:Q9NXW2; PDB:2CTPA; GSSGSSGDYYEILGVSRGASDEDLKKAYRRLALKFHPDKNHAPGATEAFKAIGTAYAVLS ------------------------------3333-3333--3333------------333 NPEKRKQYDQFGSGPSSG 33333------------- >DNAJ HOMOLOG SUBFAMILY C ; SWP:Q9UKB3; PDB:2CTQA; GSSGSSGMDAILNYRSEDTEDYYTLLGCDELSSVEQILAEFKVRALECHPDKHPENPKAV --------------------3333----3333------------3333-3333--3333- ETFQKLQKAKEILTNEESRARYDHWRRSQMSMPFQQWEALNDSVKTSGPSSG ---------------------------------------------------- >DNAJ HOMOLOG SUBFAMILY B ; SWP:Q9UBS3; PDB:2CTRA; GSSGSSGSYYDILGVPKSASERQIKKAFHKLAMKYHPDKNKSPDAEAKFREIAEAYETLS -------3333----11113333------------------3333--------------- DANRRKEYDTLGHSAFTSGKGQSGPSSG ------------3333------------ >CITRATE SYNTHASE; SWP:P00889; PDB:2CTS; ASSTNLKDILADLIPKEQARIKTFRQQHGNTAVGQITVDMMYGGMRGMKGLVYETSVLDP ----3333-------------------3333-----3333-------------------- DEGIRFRGYSIPECQKMLPKAKGGEEPLPEGLFWLLVTGQIPTEEQVSWLSKEWAKRAAL ------------------------------------------3333--------1111-- PSHVVTMLDNFPTNLHPMSQLSAAITALNSESNFARAYAEGIHRTKYWELIYEDCMDLIA 3333----------------------------3333-1111-3333-------------- KLPCVAAKIYRNLYREGSSIGAIDSKLDWSHNFTNMLGYTDAQFTELMRLYLTIHSDHEG -----------------------1111-------------3333--------1111---- GNVSAHTSHLVGSALSDPYLSFAAAMNGLAGPLHGLANQEVLVWLTQLQKEVGKDVSDEK ----------3333--3333-------1111----------------------------- LRDYIWNTLNSGRVVPGYGHAVLRKTDPRYTCQREFALKHLPHDPMFKLVAQLYKIVPNV -------3333---2222----------------------1111---------------- LLEQGKAKNPWPNVDAHSGVLLQYYGMTEMNYYTVLFGVSRALGVLAQLIWSRALGFPLE ------------3333------------1111--------------------1111---- RPKSMSTDGLIKLVDSK ----------------- >DNAJ HOMOLOG SUBFAMILY A ; SWP:Q96EY1; PDB:2CTTA; GSSGSSGMELTFNQAAKGVNKEFTVNIMDTCERCNGKGNEPGTKVQHCHYCGGSGMETIN -------------------------------1111------------------------- TGPFVMRSTCRRCGGRGSIIISPCVVCRGAGQAKQKKRSGPSSG --------------------------%%%%-------------- >ZINC FINGER PROTEIN 483; SWP:Q8TF39; PDB:2CTUA; GSSGSSGKRQKIHLGDRSQKCSKCGIIFIRRSTLSRRKTPMCEKCRKDSCQEAALNKDEG -----------------------------------------3333--------------- NESGKKTSGPSSG ------------- >DNAJ HOMOLOG SUBFAMILY C ; SWP:P60904; PDB:2CTWA; GSSGSSGRQRSLSTSGESLYHVLGLDKNATSDDIKKSYRKLALKYHPDKNPDNPEAADKF -----------------3333----11113333------------3333----------- KEINNAHAILTDATKRNIYDKYGSLGLYVAEQFGEENVNTYFVSGPSSG ----------------------3333--3333-3333------------ >ALPHA-COBRATOXIN; SWP:P25671; PDB:2CTX; IRCFITPDITSKDCPNGHVCYTKTWCDAFCSIRGKRVDLGCAATCPTVKTGVDIQCCSTD -------------1111---------1111------------------2222------22 NCNPFPTRKRP 22--------- >O-ACETYL-L-HOMOSERINE SUL; SWP:Q93I77; PDB:2CTZA; MRFETLQLHAGYEPEPTTLSRQVPIYPTTSYVFKSPEHAANLFALKEFGNIYSRIMNPTV -33331111-----3333-----------------------1111-------11113333 DVLEKRLAALEGGKAALATASGHAAQFLALTTLAQAGDNIVSTPNLYGGTFNQFKVTLKR ----------------------------------2222--------3333---------- LGIEVRFTSREERPEEFLALTDEKTRAWWVESIGNPALNIPDLEALAQAAREKGVALIVD --------11113333-11111111----------------------------------- NTFGMGGYLLRPLAWGAALVTHSLTKWVGGHGAVIAGAIVDGGNFPWEGGRYPLLTEPQP 1111iiii--3333--------3333-------------------------3333---33 GYHGLRLTEAFGELAFIVKARVDGLRDQGQALGPFEAWVVLLGMETLSLRAERHVENTLH 33---3333-!!!!---------------------------------------------- LAHWLLEQPQVAWVNYPGLPHHPHHDRAQKYFKGKPGAVLTFGLKGGYEAAKRFISRLKL -------1111----1111------------iiii--------1111-------1111-- ISHLANVGDTRTLAIHPASTTHSQLSPEEQAQAGVSPEMVRLSVGLEHVEDLKAELKEAL ---------------33331111-------1111-1111--------3333-----1111 A - >INOSINE-5'-MONOPHOSPHATE ; SWP:O58045; PDB:2CU0A; KFVEKLEKAIKGYTFDDVLLIPQATEVEPKDVDVSTRITPNVKLNIPILSAAMDTVTEWE 3333-1111----3333-----------1111------1111----------1111---- MAVAMAREGGLGVIHRNMGIEEQVEQVKRVKRAEKYKNAVRDENGELLVAAAVSPFDIKR -----------------------------1111--1111--1111--------1111--- AIELDKAGVDVIVVDTAHAHNLKAIKSMKEMRQKVDADFIVGNIANPKAVDDLTFADAVK ------------------------------3333-----------33331111------- VGIGPGSICTTRIVAGVGVPQITAVAMVADRAQEYGLYVIADGGIRYSGDIVKAIAAGAD -----1111-3333-----------------3333-----------3333----1111-- AVMLGNLLAGTKEAPGKEVIINGRKYKQYRGMGSLGAMMKYMKTRKFVPEGVEGVVPYRG ----3333--3333------iiii------11113333---------------------- TVSEVLYQLVGGLKAGMGYVGARNIRELKEKGEFVIITHAGIKESHPHDIIITNEAPN -----------------1111---------------------3333------------ >putative mannose-1-phosph; SWP:Q5SHI0; PDB:2CU2A; MKTYALVMAGGRGERLWPLSREDRPKPFLPLFEGKTLLEATLERLAPLVPPERTLLAVRR ------------3333----11113333--2222----------3333-3333-----11 DQEAVARPYADGIRLLLEPLGRDTAGAVLLGVAEALKEGAERLLVLPADHYVGDDEAYRE 11-1111----------------------------------------------------- ALATMLEAAEEGFVVALGLRPTRPETEYGYIRLGPREGAWYRGEGFVEKPSYAEALEYIR ---------2222-----------------------!!!!-------------------- KGYVWNGGVFAFAPATMAELFRRHLPSHHEALERLLAGASLEEVYAGLPKISIDYGVMEK ------------3333--------3333------1111------1111---3333-3333 AERVRVVLGRFPWDDVGNWRALERVFSQDPHENVVLGEGRHVALDTFGCVVYADRGVVAT -----------------3333-------3333---------------------------- LGVSGLVVAKVGDEVLVVPKDWAREVREVVKRLEA ----------!!!!----3333------------- >UNKNOWN FUNCTION PROTEIN; SWP:Q5SKG8; PDB:2CU3A; VWLNGEPRPLEGKTLKEVLEEGVELKGVAVLLNEEAFLGLEVPDRPLRDGDVVEVVALQG --iiii---2222----------3333----!!!!--1111------2222--------- >CONSERVED HYPOTHETICAL PR; SWP:Q5SHL7; PDB:2CU5A; MEGVVRLEVPTPEEGFVNITRKVEAALSGHTGLVYLFVPHTTCGLTVQEGADPTVAQDLL --------------------------%%%%------------------------------ GRLAELAPRHRPQDRHLEGNSHAHLKSLLTGVHLLLLAEKGRLRLGRWQQVFLAEFDGPR ----------3333-33333333---------------iiii---1111----------- VREVWVRLL --------- >DTDP-4-KETO-L-RHAMNOSE RE; SWP:Q53W28; PDB:2CU6A; PLEAQAWALLEAVYDPELGLDVVNLGLIYDLVVEPPRAYVRTLTTPGCPLHDSLGEAVRQ ---------------1111----------------------------------------- ALSRLPGVEEVEVEVTFEPPWTLARLSEKA ----2222-------------3333-3333 >KIAA1915 PROTEIN; SWP:Q5VVJ2; PDB:2CU7A; GSSGSSGYSVKWTIEEKELFEQGLAKFGRRWTKISKLIGSRTVLQVKSYARQYFKNKVKC ------------------------------3333-------------------------- GLDKETPNQKTG 3333-------- >CYSTEINE-RICH PROTEIN 2; SWP:P52943; PDB:2CU8A; GSSGSSGMASKCPKCDKTVYFAEKVSSLGKDWHKFCLKCERCSKTLTPGGHAEHDGKPFC -------------------3333---%%%%--1111-----------------iiii--- HKPCYATLFGSGPSSG ---------------- >HISTONE CHAPERONE CIA1; SWP:O74515; PDB:2CU9A; MSIVNILSVNVLNNPAKFSDPYKFEITFECLEPLKSDLEWKLTYVGSATSQSYDQILDTL ----------------1111--------------------------2222---------- LVGPIPIGINKFVFEADPPNIDLLPQLSDVLGVTVILLSCAYEDNEFVRVGYYVNNEMEG -------------------3333--3333------------iiii------------222 LNLQEMDDAEIKKVKVDISKVWRSILAEKPRVTRFNIQWDN 23333-3333------3333-----1111--------!!!! >CUA; SWP:P98052; PDB:2CUAA; AGKLERVDPTTVRQEGPWADPAQAVVQTGPNQYTVYVLAFAFGYQPNPIEVPQGAEIVFK -------3333----11113333----------------------------2222----- ITSPDVIHGFHVEGTNINVEVLPGEVSTVRYTFKRPGEYRIICNQYCGLGHQNMFGTIVV -----------2222------2222----------------------1111--------- KE -- >CYTOPLASMIC PROTEIN NCK1; SWP:P16333; PDB:2CUBA; GSSGSSGDPGERLYDLNMPAYVKFNYMAEREDELSLIKGTKVIVMEKCSDGWWRGSYNGQ -----------------------------3333---2222-------3333-----iiii VGWFPSNYVTEEGDSPLGDHVGSGPSSG ----3333-------------------- >SH3 DOMAIN CONTAINING RIN; SWP:Q8BZT2; PDB:2CUCA; GSSGSSGNMFVALHTYSAHRPEELDLQKGEGIRVLGKYQDGWLKGLSLLTGRTGIFPSDY --------------------------------------------------------3333 VIPVSGPSSG ---------- >SRC-LIKE-ADAPTER; SWP:Q13239; PDB:2CUDA; GSSGSSGPLPNPEGLDSDFLAVLSDYPSPDISPPIFRRGEKLRVISDEGGWWKAISLSTG ------------------------------------2222-------!!!!--------- RESYIPGICVARVSGPSSG -----3333---------- >PAIRED BOX PROTEIN PAX6; SWP:P26367-2; PDB:2CUEA; GSSGSSGQRNRTSFTQEQIEALEKEFERTHYPDVFARERLAAKIDLPEARIQVWFSNRRA --------------3333--------------3333--3333----3333------3333 KWRREEKLRNQRRQSGPSSG --------3333-------- >FLJ21616 PROTEIN; SWP:Q6NT76; PDB:2CUFA; GSSGSSGRGSRFTWRKECLAVMESYFNENQYPDEAKREEIANACNAVIQKPGKKLSDLER -------------------------------------------------2222--3333- VTSLKVYNWFANRRKEIKRRANIAAILESSGPSSG ----------------------------------- >MKIAA0962 PROTEIN; SWP:Q80TN4; PDB:2CUGA; GSSGSSGILQSLSALDFDPYRVLGVSRTASQADIKKAYKKLAREWHPDKNKDPGAEDRFI -------------------------3333-------------11111111---------- QISKAYEILSNEEKRTNYDHYGSGPSSG --------------------%%%%---- >TENASCIN-X; SWP:P22105; PDB:2CUHA; GSSGSSGPDGPTQLRALNLTEGFAVLHWKPPQNPVDTYDIQVTAPGAPPLQAETPGSAVD -------------------%%%%-------------------------------1111-- YPLHDLVLHTNYTATVRGLRGPNLTSPASITFTTGLEAPRDLEAKEVTPSGPSSG ------------------------------------------------------- >TENASCIN-X; SWP:P22105; PDB:2CUIA; GSSGSSGSRPRLSQLSVTDVTTSSLRLNWEAPPGAFDSFLLRFGVPSPSTLEPHPRPLLQ -----------------------------------------------3333--------- RELMVPGTRHSAVLRDLRSGTLYSLTLYGLRGPHKADSIQGTARTLSGPSSG -----3333------------------------------------------- >TRANSCRIPTIONAL ADAPTOR 2; SWP:Q8CHV6; PDB:2CUJA; GSSGSSGIDSGLSPSVLMASNSGRRSAPPLNLTGLPGTEKLNEKEKELCQVVRLVPGAYL ----------------------------------2222---3333----1111-3333-3 EYKSALLNECHKQGGLRLAQARALIKIDVNKTRKIYDFLIREGYITKA 333-------------3333--1111-3333----------------- >GLYCERATE DEHYDROGENASE/G; SWP:Q5SMG6; PDB:2CUKA; MRVLVTRTLPGKALDRLRERGLEVEVHRGLFLPKAELLKRVEGAVGLIPTVEDRIDAEVM -----------3333--1111-----------3333----2222-----1111------- DRAKGLKVIACYSVGVDHVDLEAARERGIRVTHTPGVLTEATADLTLALLLAVARRVVEG ---------------1111-----1111-------1111-------------1111---- AAYARDGLWKAWHPELLLGLDLQGLTLGLVGMGRIGQAVAKRALAFGMRVVYHARTPKPL ---1111-----1111-----2222----------------------------------- PYPFLSLEELLKEADVVSLHTPLTPETHRLLNRERLFAMKRGAILLNTARGALVDTEALV -----------------------3333--------11112222------3333------- EALRGHLFGAGLDVTDPEPLPPGHPLYALPNAVITPHIGSAGRTTRERMAEVAVENLLAV 1111----------------111133331111-----1111------------------1 LEGREPPNPVV 111-------- >Glucose-inhibited divisio; SWP:Q5SH33; PDB:2CULA; AAYQVLIVGAGFSGAETAFWLAQKGVRVGLLTQSLDAVMMPFLPPKPPFPPGSLLERAYD ----------3333-------1111--------1111------------2222------1 PKDERVWAFHARAKYLLEGLRPLHLFQATATGLLLEGNRVVGVRTWEGPPARGEKVVLAV 111----------------1111------------!!!!-----3333-----------! GSFLGARLFLGGVVEEAGRLSEASYPDLLEDLSRLGFRFVEREGEVPPGYRVRYLAFHPE !!!------!!!!-----2222-----------------------------------333 EWEEKTFRLKRLEGLYAVGLCVREGDYARMSEEGKRLAEHLLHEL 3-------3333------1111----------------------- >TENASCIN-X; SWP:Q6IPK3; PDB:2CUMA; GSSGSSGLEAPRDLEAKEVTPRTALLTWTEPPVRPAGYLLSFHTPGGQTQEILLPGGITS -------------------1111------------------------------------- HQLLGLFPSTSYNARLQAMWGQSLLPPVSTSFTTGGLRISGPSSG --------------------------------------------- >PHOSPHOGLYCERATE KINASE; SWP:O58965; PDB:2CUNA; MFRLEDFNFHNKTVFLRVDLNSPMKDGKIISDARFKAVLPTIRYLIESGAKVVIGTHQGK --3333--2222------------iiii-------------------------------2 PYSEDYTTTEEHARVLSELLDQHVEYIEDIFGRYAREKIKELKSGEVAILENLRFSAEEV 222-----------------------------------11112222-----1111-3333 KNKPIEECEKTFLVKKLSKVIDYVVNDAFATAHRSQPSLVGFARIKPMIMGFLMEKEIEA ---33331111-----3333-------3333----33331111----------------- LMRAYYSKDSPKIYVLGGAKVEDSLKVVENVLRRERADLVLTGGLVANVFTLAKGFDLGR -------------------------------1111----------------3333----- KNVEFMKKKGLLDYVKHAEEILDEFYPYIRTPVDFAVDYKGERVEIDLLSENRGLLHQYQ ------------------------3333----------iiii----1111---------- IMDIGKRTAEKYREILMKARIIVANGPMGVFEREEFAIGTVEVFKAIADSPAFSVLGGGH ---------------1111----------3333-------------------------33 SIASIQKYGITGITHISTGGGAMLSFFAGEELPVLRALQISYEKF 33--1111-----------------1111---------------- >SKELETAL MUSCLE LIM-PROTE; SWP:Q13642; PDB:2CUPA; GSSGSSGCVECRKPIGADSKEVHYKNRFWHDTCFRCAKCLHPLANETFVAKDNKILCNKC -----------------------%%%%--1111---------1111----%%%%--3333 TTREDSPKCKGCFKAIVAGDQNVEYKGTVWHKDCFSGPSSG ----------------3333----------3333------- >FOUR AND A HALF LIM DOMAI; SWP:Q13643; PDB:2CUQA; GSSGSSGPCYENKFAPRCARCSKTLTQGGVTYRDQPWHRECLVCTGCQTPLAGQQFTSRD -------------------------------%%%%--3333-----------------%% EDPYCVACFGELFASGPSSG %%--3333------------ >SKELETAL MUSCLE LIM-PROTE; SWP:Q13642; PDB:2CURA; GSSGSSGCVKCNKAITSGGITYQDQPWHADCFVCVTCSKKLAGQRFTAVEDQYYCVDCYK ---------------------%%%%--1111----------------------------- NFVSGPSSG --------- >MALONYL COA-[ACYL CARRIER; SWP:Q5SL77; PDB:2CUYA; MYAALFPGQGSHRVGMGRALYEASPAAKEVLDRAEAALPGLLKLMWEGPEEALTLTENQQ -------2222-2222---------------------2222-------3333--1111-- PALLAAGYAAYRAFLEAGGKPPALAAGHSLGEWTAHVAAGTLELEDALRLVRLRGRYMQE ----------------------------3333---------------------------- AVPVGEGAMAAVLKLPLEEIQKALEGLEGVEIANLNAPEQTVISGRRQAVEEAAERLKER --2222---------3333----2222------------------------------111 RARVVFLPVSAPFHSSLMAPARKRLAEDLAQVPLRRPRFPVYSNVTARPEEDPERIRALL 1-------------1111---------3333----------------------------- LEQITAPVRWVEILRDMEARGVKRFLEFGSGEVLKGLVLRTLKEAEALSVQDPDSLRKAL -3333------------------------------------1111--------------- EVERA ----- >SEED STORAGE PROTEIN; SWP:NA; PDB:2CV6A; SVSRGKNNPFYFNSDRWFHTLFRNQFGHLRVLQRFDQRSKQMQNLENYRVVEFNSKPNTL -------1111------------1111-------33331111--1111------------ LLPHHADADFLLVVLNGRAVLTLVNPDGRDSNILEQGHAQKIPAGTTFFLVNPDDEENLR ------------------------1111------2222----2222-------------- IIKLAVPVNNPHRFQDFFLSSTEAQQSYLQGFSKNILEASFDSDIKEISRVLFGSQQEGV ---------1111-------------3333------------------------------ IVELKREQIRELTKHADQPFNLRNQKPIYSNKLGRWFEITPEKNPQLRDLDMFIRSVDMK --------------------1111------1111-----3333-3333-----------2 EGSLLLPHYNSKAIVILVINEGKANIELVGQREESWEVQRYRAELSEDDVFIIPATYPVA 222--------------------------------------------------2222--- INATSNLNFFAFGINAENNQRNFLAGEKDNVISEIPTEVLDVTFPASGEKVQKLIKKQSE ---------------2222----------3333------1111---3333---------- SQFVDA ------ >TRNA-SPLICING ENDONUCLEAS; SWP:Q975R3; PDB:2CV8A; IGELVKDKILIKNIEDARLIYKGYYGKPIGSELILSLIEGVYLVKKGKLEIVSNGERLDF ----!!!!-----------------------------------1111------------- ERLYQIGVTQIPRFRILYSVYEDLREKGYVVRSGIKYGADFAVYTIGPPYLVIALDENSQ ----------2222----------1111------1111-----------------1111- ISSNEILGFGRVSKELILGIVNLTNGKIRYIFKWLK -1111------------------------------- >HYPOTHETICAL PROTEIN TTHA; SWP:Q5SKL8; PDB:2CV9A; RVLFIGDVAEPGLRAVGLHLPDIRDRYDLVIANGENAARGKGLDRRSYRLLREAGVDLVS ------------------33333333-------1111iiii----------3333----- LGNHAWDHKEVYALLESEPVVRPLNYPPGTPGKGFWRLEVGGESLLFVQVGRIFDPLDDP -1111--------1111-----33332222---------iiii----------------- FRALDRLLEEEKADYVLVEVHAEATSEKALAHYLDGRASAVLGTHTHVPTLDATRLPKGT -----------------------3333------2222------------------1111- LYQTDVGTGTYHSIIGGEVETFLARFLTGRPQPFRAAQGKARFHATELVFEGGRPVAISP -------------iiii---------------------------------iiii------ YVWEEP ------ >PROBABLE THIOL-DISULFIDE ; SWP:Q5SKQ0; PDB:2CVBA; LQYPELPLESPLIDAELPDPRGGRYRLSQFHEPLLAVVFCNHCPYVKGSIGELVALAERY ------------------1111---1111----------------1111----------2 RGKVAFVGINANDYEKYPEDAPEKAAFAEEHGIFFPYLLDETQEVAKAYRALRTPEVFLF 222-------------33333333---------------1111----------------- DERRLLRYHGRVNDNPKDPSKVQSHDLEAAIEALLRGEEPPLKEAPAIGCTIKWRPGNEP 1111----------33331111-----------1111-----------------2222-- EVRIG ----- >HIGH-MOLECULAR-WEIGHT CYT; SWP:P24092; PDB:2CVCA; EKRADLIEIGAMERFGKLDLPKVAFRHDQHTTAVTGMGKDCAACHKSKDGKMSLKFMRLD ---------1111---------------------1111-3333----iiii--------- DNSAAELKEIYHANCIGCHTDLAKAGKKTGPQDGECRSCHNPKPSAASSWKEIGFDKSLH ----------------------1111-----11113333--------------------- YRHVASKAIKPVGDPQKNCGACHHVYDEASKKLVWGKNKEDSCRACHGEKPVDKRPALDT --1111-----------3333------1111----2222--3333------!!!!----- AAHTACISCHMDVAKTKAETGPVNCAGCHAPEAQAKFKVVREVPRLDRGQPDAALILPVP -------------1111------3333---3333-------------------------- GKDAPREMKGTMKPVAFDHKAHEAKANDCRTCHHVRIDTCTACHTVNGTADSKFVQLEKA ----------------------------3333-------3333-11113333-------- MHQPDSMRSCVGCHNTRVQQPTCAGCHGFIKPTKSDAQCGVCHVAAPGFDAKQVEAGALL --1111-------------3333--1111-----3333-------22223333---1111 NLKAEQRSQVAASMLSARPQPKGTFDLNDIPEKVVIGSIAKEYQPSEFPHRKIVKTLIAG --3333--------1111-------1111-------1111-------------------- IGEDKLAATFHIEKGTLCQGCHHNSPASLTPPKCASCHGKPDRPGLKAAYHQQCMGCHDR !!!!--------11111111------------3333------------------------ MKIEKPANTACVDCHKERAK ------1111-1111----- >GLUTATHIONE-REQUIRING PRO; SWP:O60760; PDB:2CVDA; PNYKLTYFNMRGRAEIIRYIFAYLDIQYEDHRIEQADWPEIKSTLPFGKIPILEVDGLTL -----------1111------------------3333-------1111------iiii-- HQSLAIARYLTKNTDLAGNTEMEQCHVDAIVDTLDDFMSCFPWAEKKQDVKEQMFNELLT ----------2222-----------------------33331111-3333---------- YNAPHLMQDLDTYLGGREWLIGMSVTWADFYWEICSTTLLVFKPDLLDNHPRLVTLRKKV -------------!!!!-------------------------11111111---------- QAIPAVANWIKRRPQTKL ------------------ >HYPOTHETICAL PROTEIN TTHA; SWP:Q5SJF5; PDB:2CVEA; SLTLADKVVYEEEIQKSRFIAKAAPVASEEEALAFLAENREPEATHNGHAYKIGLLYRFS -------------iiii-----------------------1111--------!!!!---- DDGEPSGTAGRPILHAIEAQGLDRVAVLVVRYFGGVKLGAGGLVRAYGGVAAEALRRAPK iiii2222---------------------------------------------------- VPLVERVGLAFLVPFAEVGRVYALLEARALKAEETYTPEGVRFALLLPKPEREGFLRALL -------------3333--------1111-------1111-------3333--------- DATRGQVALE --iiii---- >DNA REPAIR AND RECOMBINAT; SWP:P95547; PDB:2CVHA; MLSTGTKSLDSLLGGGFAPGVLTQVYGPYASGKTTLALQTGLLSGKKVAYVDTEGGFSPE ---------------------------2222----------3333--------------- RLVQMAETRGLNPEEALSRFILFTPSDFKEQRRVIGSLKKTVDSNFALVVVDSITAHYRA ------1111-3333---------------------3333--1111-------------- EENRSGLIAELSRQLQVLLWIARKHNIPVIVINQVHFDSRTEMTKPVAEQTLGYRCKDIL ----------------------------------------------------3333---- RLDKLPKPGLRVAVLERHRFRPEGLMAYFRITERGIEDVE ------2222---------------------1111----- >75AA LONG HYPOTHETICAL RE; SWP:O73983; PDB:2CVIA; MVTAFILMVTAAGKEREVMEKLLAMPEVKEAYVVYGEYDLIVKVETDTLKDLDQFITEKI ----------2222----------3333-----------------------------333 RKMPEIQMTSTMIAILEHHHHHH 3-3333----------1111--- >THIOREDOXIN REDUCTASE REL; SWP:Q5SLC3; PDB:2CVJA; WDVIVVGGGPSGLSAALFLARAGLKVLVLDGGRSKVKGVSRVPNYPGLLDEPSGEELLRR ---------------------------------1111-------2222------------ LEAHARRYGAEVRPGVVKGVRDGGVFEVETEEGVEKAERLLLCTHKDPTLPSLLGLTRRG -----1111--------------------3333----------!!!!-----------!! AYIDTDEGGRTSYPRVYAAGVARGKVPGHAIISAGDGAYVAVHLVSDLRGEPYKDHAL !!---1111---2222---3333----------------------------------- >THIOREDOXIN; SWP:Q72HU9; PDB:2CVKA; KPIEVTDQNFDETLGQHPLVLVDFWAEWCAPCRIAPILEEIAKEYEGKLLVAKLDVDENP -----3333---1111---------11113333-----------2222------3333-3 KTARYRVSIPTVILFKDGQPVEVLVGAQPKRNYQAKIEKHLP 333------------iiii---------3333----3333-- >PROTEIN TRANSLATION INITI; SWP:Q5SM06; PDB:2CVLA; EAVKTDRAPAAIGPYAQAVKAGGFVFVSGQIPLAPDGSLVEGDIRVQTERVENLKAVLEA --------------------iiii---------1111---------------------11 AGSGLSRVVQTTCFLADEDFPGFNEVYARYFTPPYPARATVAVKALPRGVRVEVACVALA 11-1111-------------------3333---------------2222----------- E - >PUTATIVE SEMIALDEHYDE DEH; SWP:Q6AV34; PDB:2CVOA; KSGEEVRIAVLGASGYTGAEIVRLLANHPQFRIKVMTADRKAGEQFGSVFPHLITQDLPN ----------------------------------------22223333-1111------- LVAVKDADFSNVDAVFCLPHGTTQEIIKGLPQELKIVDLSADFRLRDINEYAEWYGHSHR --3333--------------------1111----------1111---------------- APELQQEAVYGLTEVLRNEIRNARLVANPGCYPTSIQLPLVPLIKAKLIKVSNIIIDAKS 33331111-----------1111------------------------------------- GVSGAGRGAKEANLYTEIAEGIHAYGIKGHRHVPEIEQGLSEAAESKVTISFTPNLICMK 3333-----33333333------------1111--------------------------- RGMQSTMFVEMAPGVTANDLYQHLKSTYEGEEFVKLLNGSSVPHTRHVVGSNYCFMNVFE -----------22223333--------1111------!!!!--33332222--------- DRIPGRAIIISVIDNLVKGASGQAVQNLNLMMGLPENTGLQYQPLFP --2222-------1111------------1111-1111--------- >Ribonucleoside-diphosphat; SWP:P21524; PDB:2CVXA; TLAARIAISNLHKQTTKQFSKVVEDLYRYVNAATGKPAPMISDDVYNIVMENKDKLNSAI -3333------1111--3333-----------------------------------3333 VYDRDFQYSYFGFKTLERSYLLRINGQVAERPQHLIMRVALGIHGRDIEAALETYNLMSL 3333---------------------------------------!!!!------------- KYFTHASPTLFNAGTPKPQMSSCFLVAMKEDSIEGIYDTLKECALISKTAGGIGLHIHNI -----------2222------------------------------3333-------1111 RSTGSYIAGTNGTSNGLIPMIRVFNNTARYVDQGGNKRPGAFALYLEPWHADIFDFIDIR -2222-----------3333--------------------------1111----3333-- KNHGKEEIRARDLFPALWIPDLFMKRVEENGTWTLFSPTSAPGLSDCYGDEFEALYTRYE ----3333-1111------3333-------------3333--1111-------------- KEGRGKTIKAQKLWYSILEAQTETGTPFVVYKDACNRKSNQKNLGVIKSSNLCCEIVEYS --------3333--------------------------1111--------1111------ APDETAVCNLASVALPAFIETSEDGKTSTYNFKKLHEIAKVVTRNLNRVIDRNYYPVEEA 1111-----------1111-------------------------------------3333 RKSNMRHRPIALGVQGLADTFMLLRLPFDSEEARLLNIQIFETIYHASMEASCELAQKDG ---------------------1111-1111------------------------------ PYETFQGSPASQGILQFDMWDQKPYGMWDWDTLRKDIMKHGVRNSLTMAPMPTASTSQIL -1111--3333---3333------------------------------------------ GYNECFEPVTSNMYSFQVVNPYLLRDLVDLGIWDEGMKQYLITQNGSIQGLPNVPQELKD ---------------------------1111--3333----------2222--------- LYKTVWEISQKTIINMAADRSVYIDQSHSLNLFLRAPTMGKLTSMHFYGWKKGLKTGMYY ---1111-------------1111-------------3333--------3333------- LRTQ ---- >3-HYDROXYISOBUTYRATE DEHY; SWP:Q5SLQ6; PDB:2CVZA; EKVAFIGLGAGYPAGHLARRFPTLVWNRTFEKALRHQEEFGSEAVPLERVAEARVIFTCL ---------------3333---------3333-------------33333333------- PTTREVYEVAEALYPYLREGTYWVDATSGEPEASRRLAERLREKGVTYLDAPVSGGTSGA ------------3333-2222-------------------3333---------------- EAGTLTVLGGPEEAVERVRPFLAYAKKVVHVGPVGAGHAVKAINNALLAVNLWAAGEGLL -----------------33331111-------2222------------------------ ALVKQGVSAEKALEVINASSGRSNATENLIPQRVLTRAFPKTFALGLLVKDLGIAGVLDG --1111--------111111113333-------1111------3333------------- EKAPSPLLRLAREVYEAKRELGPDADHVEALRLLERWGGVEIR ---------------------11113333-------------- >SN4M; SWP:NA; PDB:2CW1A; MRKKLDLKKFVEDKNQEYAARALGLSQKLIEEVLKRGLPVYVETNKDGNIKVYITQDGIT -----3333-3333-------------------1111------------------iiii- QPFPP ----- >SUPEROXIDE DISMUTASE 1; SWP:Q8ISJ0; PDB:2CW2A; SVTGPFQCPPLPYVKNALEPHMSAETLTYHHDKHHQTYVDTLNSIAAENSTIASKTLEQI -------------1111------------------------------------------- IKTETGKPFNQAAQVYNHTFFFNNLAPNGGGEPTGKIAELITRDFGSFEKFKEDFSAAAV ---------------------11111111------------------------------- GHFGSGWVWLIADDGKLKIVQGHDAGNPIRESKTPLMNIDVWEHAYYIDYRNARAQYVKN ----------------------!!!!3333-----------3333----!!!!------- YWNLVNWDFVNDNVAKAGI ------------------- >IRON SUPEROXIDE DISMUTASE; SWP:Q8ISI9; PDB:2CW3A; PSSGLRMTLPYGLEALEPVISAATVDFHYNKHHQGYIQKLLDATGLPESRINLKSLVTLG -----------1111-------------------------------3333---------- PDRAGENVFNAAGQIYNHNMYWLSMVPTSGSGRHVPPRLLKLIRARWGNVDEMKENFMRK -11113333------------1111-2222------------------------------ ATALFGSGWIWLVWDTRERRLDLVGTKDAHSPLSEDAGKIPLFTCDVWEHAYYLDYQHDR ---------------1111-----------3333-------------33333333!!!!- AAYLTRWWSLINWEFADSNL -----3333-------1111 >BACTERIAL FLUORINATING EN; SWP:Q5SLF5; PDB:2CW5A; RPVYFLSDFGLEDPYVAVVKAVLAEAPGPAVVDLAHALPPQDLRRAAYALFEALPYLPEG ---------1111--------1111-------------2222----------3333-222 AVVLAVVDPGVGTARRAVAALGRWTYVGPDNGLFTLAWLLDPPRRAFLLEPPGRDVFAPA 2------1111--------------------11113333-------------1111---- AAHLALGLPPEGLGPEVPVETLARLPLALTEGPEGEVLTFDRFGNAITTLLRAPVGGFVE --------3333-----3333-------------------1111---------2222--- VGGRRVPVRRTFGEVPEGAPVAYLGSAGLLEVAVNRGSAREALGLKEGPVRLL iiii------1111-2222-----1111-----2222---1111--------- >HYDROXYMETHYLGLUTARYL-COA; SWP:P35914; PDB:2CW6A; TLPKRVKIVEVGPRDGLQNEKNIVSTPVKIKLIDMLSEAGLSVIETTSFVSPKWVPQMGD ----------------1111------------------------------33333333-- HTEVLKGIQKFPGINYPVLTPNLKGFEAAVAAGAKEVVIFGAASELFTKKNINCSIEESF ----------2222---------------1111---------------------3333-- QRFDAILKAAQSANISVRGYVSCALGCPYEGKISPAKVAEVTKKFYSMGCYEISLGDTIG ----------1111--------1111-------------------1111-------1111 VGTPGIMKDMLSAVMQEVPLAALAVHCHDTYGQALANTLMALQMGVSVVDSSVAGLGGCP ------------------3333------1111---------1111-------------11 YAQGASGNLATEDLVYMLEGLGIHTGVNLQKLLEAGNFICQALNRKTSSKVAQATC 11------------------------------------------------------ >ENDONUCLEASE PI-PKOII; SWP:P77933; PDB:2CW8A; SILPEEWLPVLEEGEVHFVRIGELIDREENAGKVKREGETEVLEVSGLEVPSFNRRTNKA --1111-----iiii--------------3333---!!!!-------------------- ELKRVKALIRHDYSGKVYTIRLKSGRRIKITSGHSLFSVRNGELVEVTGDELKPGDLVAV ---------------------1111-----1111-----iiii----1111--------- PRRLELPERNHVLNLVELLLGTPEEETLDIVTIPVKGKKNFFKGLRTLRWIFGEEKRPRT ------------------111111111111-------------------1111------3 ARRYLRHLEDLGYVRLKKIGYEVLDWDSLKNYRRLYEALVENVRYNGNKREYLVEFNSIR 333-----1111-------------------------------------------3333- DAVGIPLKELKEWKIGTLNGFRRKLIEVDESLAKLLGYYVSEGYARKQRNPKNGWSYSVK -333333333333----------------------------------------------- LYNEDPEVLDDERLASRFFGKVRRGRNYVEIPKKIGYLLFENCGVLAENKRIPEFVFTSP --------------3333---------------------------1111---3333---3 KGVRLAFLEGYFIGDGDVHPNKRLRLSTKSELLANQLVLLLNSVGVSAVKLGHDSGVYRV 333-------------------------------------3333---------------- YINEELPFVKLDKKKNAYYSHVIPKEVLSEVFGKVFQKNVSPQTFRKVEDGRLDPEKAQR -----1111--3333--3333-------------------3333---------3333--- LSWLIEGDVVLDRVESVDVEDYDGYVYDLSVEDNENFLVGFGLVYAHN 3333-----------------------------------2222----- >TRANSLOCASE OF INNER MITO; SWP:O43615; PDB:2CW9A; NAFIRASRALTDKVTDLLGGLFSKTESEVLTEILRVDPAFDKDRFLKQCENDIIPNVLEA ---------------11111111-------------1111------------3333---- ISGELDILKDWCYEATYSQLAHPIQQAKALGLQFHSRILDIDNVDLAGKVEQGPVLIITF ---------------------------1111------------------1111------- QAQLVVVRNPKGEVVEGDPDKVLRLYVWALCRDQDELNPYAAWRLLDISASSTEQI --------1111-----1111-----------1111-1111--------------- >SINGLE-STRAND BINDING PRO; SWP:Q5SLP9; PDB:2CWAA; RGLNRVFLIGALATRPDRYTPAGLAILDLTLAGQDLLREVSWYHRVRLLGRQAEWGDLLD -------------------1111--------------------------3333-2222-2 QGQLVFVEGRLEYRQSELQIRADFLDPLDDRGKERAEDSRGQPRLRAALNQVFLGNLTRD 222--------------------------2222----1111------------------- PELRYTPQGTAVARLGLAVNERERTHFVEVQAWRDLAEWAAELRKGDGLFVIGRLVNDSW -----1111------------------------------11112222------------- TSSSGERRFQTRVEALRLERPTR ----------------------- >chimera of Immunoglobulin; SWP:Q96S82; PDB:2CWBA; GSQWQPQLQQLRDMGIQDDELSLRALQATGGDIQAALELIFAGGAP -----------1111--3333-----------3333---------- >ADP-RIBOSYLGLYCOHYDROLASE; SWP:Q5SMG9; PDB:2CWCA; KERQDRRLGAFLGLAVGDALGAQVEGLPKGTFPEVREMKGGGPHRLPPGFWTDDTSQALC 3333-------------------22222222----------1111-2222---------- LAESLLQRGFDPKDQMDRYLRWYREGYATRRALERYAATGDPYAGDEAGAGNGPLMRLAP ---------------------------------------------1111----3333--- LVLAYENHPDLLSLARRAARTTHGAREALEATEVLAWLLREALRGAPKEALLALEPFRGA ----1111---------------------------------1111--------3333--- DLHPALRRVVEGGFWEAPEEGPGYAPGTLAAALWAFARGRDFEEGMRLAVNLGGDADTVG -----------1111--------------------------------------------- AVYGQLAGAYYGLGAIPGRWLRALHLREEMEALALALYRMSMAS -----------3333-3333------------------3333-- >low molecular weight phos; SWP:Q72JF6; PDB:2CWDA; DRPVRVLFVCLGNICRSPMAEGIFRKLLKERGLEDRFEVDSAGTGAWHVGEPMDPRARRV --------------------------------1111---------1111----------- LEEEGAYFPHVARRLTREDVLAYDHILVMDRENLEEVLRRFPEARGKVRLVLEELGGGEV ----------------------------------------3333-----1111------- QDPYYGDLEDFREVYWTLEAALQAFLDRHG --11113333-------------------- >PUTATIVE ENDONUCLEASE; SWP:Q9YBU8; PDB:2CWJA; APKPVGPYSQAVESGCFMFVSGQIPINPETGALEEGGFKESAKRALDNLKAIVEGAGYSM --------------------------1111---------------------3333----- DDIVKVTVYITDISRFSEFNEVYREYFNRPYPARAVVGVAALPLGAPLEVEAVLYT -----------2222-3333---------------------2222----------- >MANGANESE-FREE PSEUDOCATA; SWP:Q5SM21; PDB:2CWLA; MFLRIDRLQIELPMPKEQDPNAAAAVQALLGGRFGEMSTLMNYMYQSFNFRGKKALKPYY -------------------------------1111--------------------3333- DLIANIATEELGHIELVAATINSLLAKNPGKDLEEGVDPESAPLGFAKDVRNAAHFIAGG ---------------------------2222------333311113333--3333----- ANSLVMGAMGEHWNGEYVFTSGNLILDLLHNFFLEVAARTHKLRVYEMTDNPVAREMIGY ------1111---1111---------------------------1111------------ LLVRGGVHAAAYGKALESLTGVEMTKMLPIPKIDNSKIPEAKKYMDLGFHRNLYRFSPED ----------------------3333-------3333----------1111--------- YRDLGLIWKGASPEDGTEVVVVDGPPTGGPVFDAGHDAAEFAPEFHPGELYEIAKKLYE --3333-------------------------------1111-------------3333- >TRANSALDOLASE; SWP:Q93092; PDB:2CWNA; SGSALDQLKQFTTVVADTGDFNAIDEYKPQDATTNPSLILAAAQMPAYQELVEEAIAYGK -------3333--------11113333-----------------3333------------ KLGGPQEEQIKNAIDKLFVLFGAEILKKIPGRVSTEVDARLSFDKDAMVARARRLIELYK ------------------------1111---------3333------------------1 EAGVGKDRILIKLSSTWEGIQAGKELEEQHGIHCNMTLLFSFAQAVACAEAGVTLISPFV 111-3333----------------------------------------1111-------- GRILDWHVANTDKKSYEPQGDPGVKSVTKIYNYYKKFGYKTIVMGASFRNTGEIKALAGC -----------------1111----------------------------3333------- DFLTISPKLLGELLKDNSKLAPALSVKAAQTSDSEKIHLDEKAFRWLHNEDQMAVEKLSD ------------------------33331111---------------------------- GIRKFAADAIKLERMLTERMF --------------------- >RNA SILENCING SUPPRESSOR; SWP:Q08545; PDB:2CWOA; MKFFLKDGETSRALSRSESLLRRVKELGTNSQQSEISECVDEFNELASFNHLLVTVEHRE -----------------------1111----3333------------------------- WMEQRIGEMLKEIRAFLKVRVVTPMHKETASDTLNAFLEEYCRITGLAREDALREKMRKV -----------------1111-1111---------------------------------- KSVVLFHHSELLKFEVTENMFSYTELLKLNLSLRVISSQILGMAI ----------------1111--3333------------------- >METRS RELATED PROTEIN; SWP:O58023; PDB:2CWPA; MELYDVDEFWKFQMKVGLVKKAEKIKRTKKLIKLIVDFGNEERTIVTGIADQIPPEELEG ----33333333------------------------------------1111-3333222 KKFIFVVNLKPKKFSGVESQGMLILAETEDGKVYLIPVPEEVPVGARVW 2------------iiii----------1111-------33332222--- >HYPOTHETICAL PROTEIN TTHA; SWP:NA; PDB:2CWQA; RTHERVLQAAENLGEGLPRAIPLLAEKAPGLLLEHGRSWTYAPEKGALDEKTRTLILLGI ------------!!!!-3333--------------------------------------- ALATGSEACVKAAHRAKRLGLSKEALLETLKIARQAQANAVLGHAAPLLEVL ---------------------------------------------3333--- >CHITINASE; SWP:Q8U1H5; PDB:2CWRA; PVSGSLEVKVNDWGSGAEYDVTLNLDGQYDWTVKVKLAPGATVGSFWSANKQEGNGYVIF -------------------------------------2222------------2222--- TPVSWNKGPTATFGFIVNGPQGDKVEEITLEINGQVI --1111-------------------------iiii-- >ALGINATE LYASE A1-II'; SWP:Q75WP3; PDB:2CWSA; AAPGKNFDLSHWKLQLPDANTTEISSANLGLGYTSQYFYTDTDGAMTFWAPTTGGTTANS -3333------------1111-------1111--1111--1111------1111------ SYPRSELREMLDPSNSKVNWGWQGTHTMKLSGKTVQLPSSGKIIVAQIHGIMDDGTNAPP -----------1111----------------------3333----------1111----- LVKAVFQDGQLDMQVKQNSDGTGSDVHNYFTGIKLGDLYNMEIRVTDGVAYVTMNGDTRS ---------------------------------2222----------------iiii--- VDFVGKDAGWKNLKYYFKAGNYVQDNTSTGGSAIAKLYSLSVSHSNL -3333----1111-------------1111----------------- >HYPOTHETICAL PROTEIN TTHA; SWP:Q5SM75; PDB:2CWYA; VPDWEEVLGLWRAGRYYEVHEVLEPYWLKATGEERRLLQGVILLAAALHQRRLGRPGLRN ----------1111------------------------------------1111------ LRKAEARLEGLPCPLGLDWRSLLQEARRRLGA -------2222------3333----------- >THIOESTERASE FAMILY PROTE; SWP:Q5SJP1; PDB:2CWZA; RPIPEGYEAVFETVVTPETVRFEELGPVHPVYATYWVKHELAGRKIILPFLEEGEEGIGS ---2222-------------------------3333----------3333-2222----- YVEARHLASALPGRVRVVARHEKTEGNRVYARVEAYNELGDLIGVGRTEQVILPKAKVEA ------------------------!!!!--------1111-------------------- LFRRLKERWEAER ------------- >HYPOTHETICAL PROTEIN APE0; SWP:NA; PDB:2CX1A; HLWARLVGLARLEARALSKKERRSLLERLKPYYTRIPFSEKADLRLVKARTDSGEYEIIT ----------------------------3333------1111--------1111------ VDGVPCLFEWSDGRIYPTLQCLKAFGVDWLKGVVLVDKGAAIALAKGAHLIPGVVGVEGS iiii-----1111-------------1111-------------1111---1111------ FTRGDVVAALYHETRTPVVGVAEVDSSALEKLYREKARGRAVRRVHRLGDALWELAQEVG -2222------1111-----------------1111----------2222---------- K - >BACTERIOFERRITIN COMIGRAT; SWP:NA; PDB:2CX4A; GLVELGEKAPDFTLPNQDFEPVNLYEVLKRGRPAVLIFFPAAFSPVCTKELCTFRDKAQL ---2222--------1111--------3333---------2222---------------1 EKANAEVLAISVDSPWCLKKFKDENRLAFNLLSDYNREVIKLYNVYHEDLKGLKVAKRAV 111------------------------------1111---1111-----iiii------- FIVKPDGTVAYKWVTDNPLNEPDYDEVVREANKIAGELV ----------------1111--------------3333- >A PUTATIVE TRANS-EDITING ; SWP:Q5SHN1; PDB:2CX5A; SLSPSARRVQGALETRGFGHLKVVELPASTRTAKEAAQAVGAEVGQIVKSLVFVGEKGAY -------------11113333-----1111------------3333--------1111-- LFLVSGKNRLDLGKATRLVGGPLRQATPEEVRELTGFAIGGVPPVGHNTPLPAYLDEDLL ----1111-----------------------------2222-----------------11 GYPEVWAAGGTPRALFRATPKELLALTGAQVADLKEG 11--------1111----------------------- >HYPOTHETICAL PROTEIN YHCO; SWP:P64618; PDB:2CX6A; NIYTFDFDEIESQEDFYRDFSQTFGLAKDKVRDLDSLWDVLNDVLPLPLEIEFVHLGEKT ------1111-3333--------------------------------------------- RRRFGALILLFDEAEEELEGHLRFNVRH -----------------iiii------- >STEROL CARRIER PROTEIN 2; SWP:Q5SL92; PDB:2CX7A; ELFTEAWAQAYCRKLNESEAYRKAASTWEGSLALAVRPDPKAGFPKGVAVVLDLWHGACR ----------------------1111-------------1111-----------iiii-- GAKAVEGEAEADFVIEADLATWQEVLEGRLEPLSALRGLLELKKGTIAALAPYAQAAQEL ------------------------1111--3333-----------33333333------- VKVAREVA ---1111- >LEUCYL/PHENYLALANYL-TRNA-; SWP:P0A8P1; PDB:2CXAA; RLVQLSRHSIAFPSPEGALREPNGLLALGGDLSPARLLAYQRGIFPWFSPGDPILWWSPD -------------3333----2222-------3333---1111-----2222-------- PRAVLWPESLHISRSKRFHKRSPYRVTNYAFGQVIEGCASDGTWITRGVVEAYHRLHELG -----1111-------3333-----------------11111111-----------1111 HAHSIEVWREDELVGGYGVAQGTLFCGESFSRENASKTALLVFCEEFIGHGGKLIDCQVL --------!!!!--------!!!!-----------------------1111--------- NDHTASLGACEIPRRDYLNYLNQRLGRLPNNFWVPRCLFSP ----1111--------------------1111--------- >NUSA; SWP:Q9YAU4; PDB:2CXCA; ITLEELRYISVFHSITGVTAYRCIVDEENNRLIFLVSEGEAGRAIGRGGRLIKLLREALG --------------------------1111------2222-----2222----------- KNIEVVEYSSDLERIVKNLFPGVKIESINVRERNGVKQVVIKVSEDDKGAAIGKGGKNVK --------------------------------iiii-------------3333iiii--- RARLVLSKLFGVEKVVIR ------------------ >Probable brix-domain ribo; SWP:Q9YC08; PDB:2CXHA; GYRILVTTSRRPSPRIRSFVKDLSATIPGAFRFTRGHYSEELAREAIIRGADRIVVVGER ------------3333------11112222-----------------------------i RGNPGIIRVYAVEGPERPDNIVSFIVKGVSLSRERRWGLPSLRGGEVLVARPLDSGVAVE iii---------------------------3333-------------------------- FADAFVIAFHARLKPPEAAGYVEAVIESLDARTVAVTFRYGGAPVGPLRLGKPAEVK ---------------------------------------%%%%-------------- >PHENYLALANYL-TRNA SYNTHET; SWP:O73984; PDB:2CXIA; PKFDVSKSDLERLIGRSFSIEEWEDLVLYAKCELDDVWEENGKVYFKLDSKDTNRPDLWS ------------------3333---3333----------------------11111111- AEGVARQIKWALGIEKGLPKYEVKKSNVTVYVDEKLKDIRPYGVYAIVEGLRLDEDSLSQ --------------------------------3333------------------------ IQLQEKIALTFGRRRREVAIGIFDFDKIKPPIYYKAAEKTEKFAPLGYKEETLEEILEKH ---------1111----------3333----------1111---2222------------ EKGREYGHLIKDKQFYPLLIDSEGNVLSPPIINSEFTGRVTTDTKNVFIDVTGWKLEKVL ---------1111-------1111----------------1111----------3333-- ALNVVTALAERGGKIRSVRVVYKDFEIETPDLTPKEFEVELDYIRKLSGLELNDGEIKEL --------1111---------1111----------------------------------- LEKYEVEISRGRAKLKYPAFRDDIHARDILEDVLIAYGY --------iiii-----1111---3333-------3333 >S100 CALCIUM-BINDING PROT; SWP:P97352; PDB:2CXJA; MAAETLTELEAAIETVVSTFFTFAGREGRKGSLNINEFKELATQQLPHLLKDVGSLDEKM ---------------------3333----------3333--------------------- KTLDVNQDSELRFSEYWRLIGELAKEVRKEKALGIRKK ---------------------3333------------- >CALMODULIN BINDING TRANSC; SWP:Q9Y6Y1; PDB:2CXKA; SSGVTDYSPEWSYPEGGVKVLITGPWQEASNNYSCLFDQISVPASLIQPGVLRCYCPAHD ------------3333--------------------iiii-------2222--------- TGLVTLQVAFNNQIISNSVVFEYKSG ---------%%%%------------- >GLUCOSE-6-PHOSPHATE ISOME; SWP:P06745; PDB:2CXNA; MAALTRNPQFQKLLEWHRANSANLKLRELFEADPERFNNFSLNLNTNHGHILVDYSKNLV -3333--------------3333---------11111111--------------1111-- SKEVMQMLVELAKSRGVEAARDNMFSGSKINYTEDRAVLHVALRNRSNTPIKVDGKDVMP -----------------------1111-----------3333--3333----iiii---- EVNRVLDKMKSFCQRVRSGDWKGYTGKSITDIINIGIGGSDLGPLMVTEALKPYSKGGPR ---------------3333---1111----------!!!!----------33332222-- VWFVSNIDGTHIAKTLASLSPETSLFIIASKTFTTQETITNAETAKEWFLEAAKDPSAVA ---------------11113333------3333---------------------333311 KHFVALSTNTAKVKEFGIDPQNMFEFWDWVGGRYSLWSAIGLSIALHVGFDHFEQLLSGA 11----------------3333----1111111111111111------------------ HWMDQHFLKTPLEKNAPVLLALLGIWYINCYGCETHALLPYDQYMHRFAAYFQQGDMESN ----------3333---------------------------3333--------------- GKYITKSGARVDHQTGPIVWGEPGTNGQHAFYQLIHQGTKMIPCDFLIPVQTQHPIRKGL ----1111------------------1111-3333--------------------%%%%- HHKILLANFLAQTEALMKGKLPEEARKELQAAGKSPEDLEKLLPHKVFEGNRPTNSIVFT -----------------------------1111--------3333--------------- KLTPFILGALIAMYEHKIFVQGIMWDINSFDQWGVELGKQLAKKIEPELEGSSAVTSHDS -------------------------------1111----------3333----------- STNGLISFIKQQRDTKL -----------1111-- >PROBABLE GTP-BINDING PROT; SWP:O57939; PDB:2CXXA; ATIIFAGRSNVGKSTLIYRLTGKKVRRGKRPGVTRKIIEIEWKNHKIIDPGFGFGLPKEV -------2222---------------------1111-----!!!!--------------- QERIKDEIVHFIEDNAKNIDVAVLVVDGKAAPEIIKRWEKRGEIPIDVEFYQFLRELDIP --------------3333--------3333--------1111------------1111-- TIVAVNKLDKIKNVQEVINFLAEKFEVPLSEIDKVFIPISAKFGDNIERLKNRIFEVIRE ------3333-----------------3333------------2222------------- R - >BAF250B SUBUNIT; SWP:Q5JRD2; PDB:2CXYA; GEKITKVYELGNEPERKLWVDRYLTFEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLA ------1111--1111----------1111--------!!!!------------------ QVNKNKKWRELATNLNVGTSSSAASSLKKQYIQYLFAFECKIERGEEPPPEVF ------------1111------------------------------------- >PROBABLE ACETYLTRANSFERAS; SWP:Q5SJ05; PDB:2CY2A; VRIRRAGLEDLPGVARVLVDTWRATYRGVVPEAFLEGLSYEGQAERWAQRLKTPTWPGRL ------3333---------------2222-----3333--------------1111---- FVAESESGEVVGFAAFGPDRASGFPGYTAELWAIYVLPTWQRKGLGRALFHEGARLLQAE ----1111---------------2222---------1111-----------------111 GYGRLVWVLKENPKGRGFYEHLGGVLLGEREIELGGAKLWEVAYGFDLGGHKW 1-------1111-------1111----------!!!!---------------- >CYTOCHROME C3; SWP:P00136; PDB:2CY3; ADAPGDDYVISAPEGMKAKPKGDKPGALQKTVPFPHTKHATVECVQCHHTLEADGGAVKK ----1111----2222------------------333311113333-11111111----1 CTTSGCHDSLEFRDKANAKDIKLVENAFHTQCIDCHKALKKDKKPTGPTACGKCHTTN 111----------3333--33333333----------------------3333----- >epidermal growth factor r; SWP:Q8R5F8; PDB:2CY5A; ADVSQYHVNHLVTFCLGEEDGVHTVEDASRKLAVDSQGRVWAQELLRVSPSQVTLLDPVS ----------------1111--------------1111----------1111-------- KEELESYPLDAIVRCDAVPRGRSRSLLLLVCQEPERAQPDVHFFQGLLLGAELIREDIQG -------3333---------------------3333----------1111---------- ALQNYR ------ >CYSTEINE PROTEASE APG4B; SWP:Q9Y4P1; PDB:2CY7A; TLTYDTLRFAEFEDFPETSEPVWILGRKYSIFTEKDEILSDVASRLWFTYRKNFPAIGGT ----3333---------------------3333--------3333----------2222- GPTSDTGWGCMLRCGQMIFAQALVCRHLGRDWRWTQRKRQPDSYFSVLNAFIDRKDSYYS ----2222--------------------1111--------3333----1111-1111--- IHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWSSLAVHIAMDNTVVMEEIRRLCR ------3333---2222-------------3333-----------%%%%-3333------ TSVPCSPWRPLVLLIPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVG ------------------------3333-----11111111------2222-------!! EELIYLDPHTTQPAVEGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQ !!-------------------1111---------1111---------------------- QVKKLSLLPMFELVEQQPDVLNLSLDSSDVERL ---3333-------------------------- >D-PHENYLGLYCINE AMINOTRAN; SWP:NA; PDB:2CY8A; SILNDYKRKTEGSVFWAQRARSVMPDGAFDPHGLFISDAQGVHKTDVDGNVYLDFFGGHG ---------------------------------------!!!!--1111------2222- ALVLGHGHPRVNAAIAEALSHGVQYAASHPLEVRWAERIVAAFPSIRKLRFTGSGTETTL -1111-------------1111------3333----------3333-------------- LALRVARAFTGRRMILRFEGTTANTLLIRPDDIEGMREVFANHGSDIAAFIAEPVGSHFG --------------------3333----2222----------1111----------%%%% VTPVSDSFLREGAELARQYGALFILDEVISGFRVGNHGMQALLDVQPDLTCLAKASAGGL ----------------1111-----------------3333-----------!!!!iiii PGGILGGREDVMGVLSRGSFTGNPITAAAAIAAIDTILEDDVCAKINDLGQFAREAMNHL --------3333------------------------------------------------ FARKGLNWLAYGRFSGFHLMPGLPPNTTDTGSITRAEVARPDVKMIAAMRMALILEGVDI -1111-------!!!!-------1111--3333--------------------------- GGRGSVFLSAQHEREHVEHLVTTFDRVLDRLADENLLSWQGT --------11113333---------------1111------- >THIOESTERASE SUPERFAMILY ; SWP:Q9CQR4; PDB:2CY9A; KVFKVPGFDVLEKVTLVSAAPEKLICEKVEEQHTNKLGTLHGGLTATLVDSISTALCTPG -----------------------------3333------------3333----------- VSVDNITYSPAKIGEEIVITAHILKQGKTLAFASVDLTNKTTGKLIAQGRHTKHLG -------------------------------------------------------- >TYROSYL-TRNA SYNTHETASE; SWP:Q9YA64; PDB:2CYAA; VDVEERFNRIARNTVEIVTEEELKGLLASGARIKGYIGYEPSGVAHIGWLVWMYKVKDLV 3333------2222---------------------------------------------1 EAGVDFSVLEATWHAYINDKLGGDMDLIRAAARIVRRVMEAAGVPVERVRFVDAEELASD 111------------------------------------1111-3333------------ KDYWGLVIRVAKRASLARVRRALAEEAEVDASKLIYPLMQVSDIFYMDLDIALGGMDQRK ---------3333-3333------3333--------------------------1111-- AHMLARDVAEKLGRKKPVAIHTPIISSLQGPVKMSKSKPETAVFVVDSDDDIRRKIRKAY ---------1111------------------------3333--1111------------- CPAKQVQGNPVLEIARYILFARDGFTLRVDVEYTSYEELERDYTDGRLHPLDLKNAVAES -22222222--------11112222----------------------------------- LIEVVRPIRGAVLGDPAMKRALEAIEGK -----------1111------------- >TYROSYL-TRNA SYNTHETASE; SWP:O29482; PDB:2CYBA; DITEKLRLITRNAEEVVTEEELRQLIETKEKPRAYVGYEPSGEIHLGHMMTVQKLMDLQE ---------2222-------------------------------3333----------11 AGFEIIVLLADIHAYLNEKGTFEEIAEVADYNKKVFIALGLDESRAKFVLGSEYQLSRDY 11------------1111------------------1111---------33331111--- VLDVLKMARITTLNRARRSMDEVSRRKEDPMVSQMIYPLMQALDIAHLGVDLAVGGIDQR -------1111------1111---------3333-----------1111------1111- KIHMLARENLPRLGYSSPVCLHTPILVGLDGQKMSSSKGNYISVRDPPEEVERKIRKAYC --------3333---------------1111-----------1111-------------- PAGVVEENPILDIAKYHILPRFGKIVVERDAKFGGDVEYASFEELAEDFKSGQLHPLDLK 22222222---------------------3333--------------------------- IAVAKYLNMLLEDARKRLG ------------------- >TYROSYL-TRNA SYNTHETASE; SWP:O58739; PDB:2CYCA; MDIEERINLVLKKPTEEVLTVENLRHLFEIGAPLQHYIGFEISGYIHLGTGLMAGAKIAD ----------------------------------------------3333---------- FQKAGIKTRVFLADWHSWINDKLGGDLEVIQEVALKYFKVGMEKSIEVMGGDPKKVEFVL -1111----------------%%%%---------------------1111-3333----3 ASEILEKGDYWQTVIDISKNVTLSRVMRSITIMGRQMGEAIDFAKLIYPMMQVADIFYQG 333-------------3333----------1111-------3333-----------1111 VTIAHAGMDQRKAHVIAIEVAQKLRYHPIVHEGEKLKPVAVHHHLLLGLQEPPKWPIESE ------1111---------1111-------iiii-------------------------- EEFKEIKAQMKMSKSKPYSAVFIHDSPEEIRQKLRKAFCPAREVRYNPVLDWVEYIIFRE ----------3333-3333--1111--------------2222----------------- EPTEFTVHRPAKFGGDVTYTTFEELKRDFAEGKLHPLDLKNAVAEYLINLLEPIRRYFEK ---------3333---------------1111--3333---------------------- HPEPLELMRSV -------1111 >CONSERVED HYPOTHETICAL PR; SWP:Q5SH84; PDB:2CYEA; EGFPVRVRVDVRFRDLDPLGHVNNAVFLSYELARIRYFQRIDWLEEGHFVVAREVDYLRP -----------3333-1111--3333-----------11113333iiii----------- ILLGDEVFVGVRTVGLGRSSLREHLVTANGESAAKGLGVLVWLEGGRPAPLPEAIRERIR -2222----------------------iiii------------iiii------------- ALEGRP ------ >BETA-1, 3-GLUCANANSE; SWP:O22317; PDB:2CYGA; IGVCYGMLGNNLPPPSEVVSLYKSNNIARMRLYDPNQAALQALRNSNIQVLLDVPRSDVQ ----------------------1111----------------2222--------1111-- SLASNPSAAGDWIRRNVVAYWPSVSFRYIAVGNELIPGSDLAQYILPAMRNIYNALSSAG ----1111--------3333------------------1111--------------1111 LQNQIKVSTAVDTGVLGTSYPPSAGAFSSAAQAYLSPIVQFLASNGAPLLVNVYPYFSYT --------------------3333----3333---------------------3333-11 GNPGQISLPYALFTASGVVVQDGRFSYQNLFDAIVDAVFAALERVGGANVAVVVSESGWP 11-------1111--------!!!!---------------------1111---------- SAGGGAEASTSNAQTYNQNLIRHVGGGTPRRPGKEIEAYIFEMFNENQKAGGIEQNFGLF ----1111--------------3333-3333--------------1111--3333----- YPNKQPVYQISF 1111-------- >HYPOTHETICAL PROTEIN PH15; SWP:O59174; PDB:2CYJA; MKIEEVRFGLVKIDGKEFDHDIVIYPSGRIERRMKEISKKKHGTSHKLDPEELEKYLVED ------2222--iiii--------1111--------------------3333-1111--- FDVLLVGTGIYGMLSLLPESKKLVEDKEVIEKPTKEALKLLEELWGKKRILAIIHVTC --------1111-----------1111----------------2222----------- >UBIQUITIN-CONJUGATING ENZ; SWP:UB2G2_HUMAN; PDB:2CYXA; MAALKRLMAEYKQLTLNPPEGIVAGPMNEENFFEWEALIMGPEDTCFEFGVFPAILSFPL ---------------------------3333-------------1111----------11 DYPLSPPKMRFTCEMFHPNIYPDGRVCISILHAPGDDPMGYESSAERWSPVQSVEKILLS 11--------------11111111---3333-2222-----3333---1111-------- VVSMLAEPNDESGANVDASKMWRDDREQFYKIAKQIVQKSLGL ------------------------------------------- >Putative HTH-type transcr; SWP:O59188; PDB:2CYYA; RVPLDEIDKKIIKILQNDGKAPLREISKITGLAESTIHERIRKLRESGVIKKFTAIIDPE -----------------1111-----------3333--------3333---------333 ALGYSLAFILVKVKAGKYSEVASNLAKYPEIVEVYETTGDYDVVKIRTKNSEELNNFLDL 3------------2222----------3333----------------------------- IGSIPGVEGTHTIVLKTHKETTELPIK ---2222-------------------- >NITRILE HYDRATASE SUBUNIT; SWP:P13448; PDB:2CZ1A; ENAAPAQAPVSDRAWALFRALDGKGLVPDGYVEGWKKTFEEDFSPRRGAELVARAWTDPE --------------------3333---2222----------------------------- FRQLLLTDGTAAVAQYGYLGPQGEYIVAVEDTPTLKNVIVCSLSTAWPILGLPPTWYKSF ------------3333-----------------------------3333----3333--- EYRARVVREPRKVLSEMGTEIASDIEIRVYDTTAETRYMVLPQRPAGTEGWSQEQLQEIV --------------1111---1111----------------------2222----3333- TKDCLIGVAIPQVP 3333---------- >Nitrile hydratase subunit; SWP:P13449; PDB:2CZ1B; MDGVHDLAGVQGFGKVPHTVNADIGPTFHAEWEHLPYSLMFAGVAELGAFSVDEVRYVVE --33332222--------2222-------1111-------------------------11 RMEPRHYMMTPYYERYVIGVATLMVEKGILTQDELESLAGGPFPLSRPSESEGRPAPVET 11----1111-3333--------------------------------------------- TTFEVGQRVRVRDEYVPGHIRMPAYCRGRVGTISHRTTEKWPFPDAIGHGRNDAGEEPTY ---2222---------------3333-------------------3333----------- HVKFAAEELFGSDTDGGSVVVDLFEGYLEPA ----3333-!!!!----------1111---- >MALEYLACETOACETATE ISOMER; SWP:Q9WVL0; PDB:2CZ2A; GKPILYSYFRSSCSWRVRIALALKGIDYEIVPINLIKDGGQQFTEEFQTLNPKQVPALKI -------1111-------------------------iiii11113333-----------i DGITIVQSLAIEYLEETRPIPRLLPQDPQKRAIVRISDLIASGIQPLQNLSVLKQVGQEN iii---3333---------------------------------3333----------111 QQWAQKVITSGFNALEKILQSTAGKYCVGDEVSADVCLVPQVANAERFKVDLSPYPTISH 1----------------3333------------3333--------1111--1111----- INKELLALEVFQVSHPRRQPDTPAELR -------33331111---11113333- >HYPOTHETICAL PROTEIN TTHA; SWP:Q5SKX7; PDB:2CZ4A; DLVPLKLVTIVAESLLEKRLVEEVKRLGAKGYTITPARGEGSRGIRSVDWEGQNIRLETI ------------3333--------1111---------------1111-3333-------- VSEEVALRILQRLQEEYFPHYAVIAYVENVWVVRGEKYV ----------------3333-------------3333-- >TT0972 PROTEIN; SWP:Q72IR6; PDB:2CZ8A; GKVYKKVELVGTSEEGLEAAIQAALARARKTLRHLDWFEVKEIRGTIGEAGVKEYQVVLE -----------------------------------------------1111--------- VGFRLEE ------- >PROBABLE GALACTOKINASE; SWP:O58107; PDB:2CZ9A; MIKVKSPGRVNLIGEHTDYTYGYVMPMAINLYTKIEAEKHGEVILYSEHFGEERKFSLND ----------------1111---------------------------1111-----1111 LRKENSWIDYVKGIFWVLKESDYEVGGIKGRVSGNLPLGAGLSSSASFEVGILETLDKLY ------------------1111--------------2222-----------------111 NLKLDSLSKVLLAKKAENEFVGVPCGILDQFAVVFGREGNVIFLDTHTLDYEYIPFPKDV 1-----------------------------------2222-------------------- SILVFYTGVRSSEYAERKHIAEESLKILGKGSSKEVREGELSKLPPLHRKFFGYIVRENA -------------------------------3333-33331111---------------- RVLEVRDALKEGNVEEVGKILTTAHWDLAKNYEVSCKELDFFVERALKLGAYGARLTGAG --------1111----------------------------------1111---------- FGGSAIALVDKEDAETIGEEILREYLKRFPWKARHFIVEPSDGVGI ---------3333--------------------------------- >GLYCERALDEHYDE-3-PHOSPHAT; SWP:O59494; PDB:2CZCA; MKVKVGVNGYGTIGKRVAYAVTKQDDMELIGITKTKPDFEAYRAKELGIPVYAASEEFIP ----------3333---------1111-----------------1111------3333-- RFEKEGFEVAGTLNDLLEKVDIIVDATPGGIGAKNKPLYEKAGVKAIFQGGEKADVAEVS -----------33331111--------2222-----------------11113333---- FVAQANYEAALGKNYVRVVSCNTTGLVRTLSAIREYADYVYAVMIRRAADPNDTKRGPIN -----33332222-------------------3333-------------1111------- AIKPTVEVPSHHGPDVQTVIPINIETMAFVVPTTLMHVHSVMVELKKPLTKDDVIDIFEN ---------33333333------------------------------------------- TTRVLLFEKEKGFDSTAQIIEFARDLHREWNNLYEIAVWKESINIKGNRLFYIQAVHQES -------3333----------------2222-------3333---!!!!-------1111 DVIPENIDAIRAMFELADKWDSIKKTNKSLGILK ---------------------------1111--- >OROTIDINE 5'-PHOSPHATE DE; SWP:O58462; PDB:2CZDA; MIVLALDVYEGERAIKIAKSVKDYISMIKVNWPLILGSGVDIIRRLKEETGVEIIADLKL --------------------1111------------------------------------ ADIPNTNRLIARKVFGAGADYVIVHTFVGRDSVMAVKELGEIIMVVEMSHPGALEFINPL --------------1111-------1111------3333----------3333---3333 TDRFIEVANEIEPFGVIAPGTRPERIGYIRDRLKEGIKILAPGIGAQGGKAKDAVKAGAD ----------------------------------------------2222----1111-- YIIVGRAIYNAPNPREAAKAIYDEIR ----3333------------------ >HYPOTHETICAL PROTEIN TTHA; SWP:Q5SI12; PDB:2CZLA; ALRLGFSPPNDTFIFYALVHGRVESPVPLEPVLEDVETLNRWALEGRLPLTKLSYAAYAQ -----------------1111-------------------3333---------3333--- VRDRYVALRSGGALGRGVGPLVVARGPLQALEGLRVAVPGRHTTAYFLLSLYAQGFVPVE 1111--------------------------2222-----1111----------------- VRYDRILPMVAQGEVEAGLIIHESRFTYPRYGLVQVVDLGAWWEERTGLPLPLGAILARR -1111---------------!!!!----1111--------------------------33 DLGEGLIRALDEAVRRSVAYALAHPEEALDYMRAHAQELSDEVIWAHVHTYVNAFSLDVG 33---------------------------------1111-------------3333---- EEGERAVARLFAEAEARGLAAPSPRPLFV --------------1111----------- >BUD EMERGENCE PROTEIN 1; SWP:P29366; PDB:2CZOA; KSAKLVDGELLVKASVESFGLEDEKYWFLVCCELSNGKTRQLKRYYQDFYDLQVQLLDAF ---------------------------------1111----------------------- PAEAGKLRDAGGQWSKRIMPYIPGPVPYVTNSITKKRKEDLNIYVADLVNLPDYISRSEM 3333------------------------------------------3333---3333-33 VHSLFVVLNN 33-------- >CUTINASE-LIKE PROTEIN; SWP:Q874E9; PDB:2CZQA; ATSSACPQYVLINTRGTGEPQGQSAGFRTMNSQITAALSGGTIYNTVYTADFSQNSAAGT --3333---------2222----3333-----------------------3333-3333- ADIIRRINSGLAANPNVCYILQGYSQGAAATVVALQQLGTSGAAFNAVKGVFLIGNPDHK -------------1111--------------------------------------11112 SGLTCNVDSNGGTTTRNVNGLSVAYQGSVPSGWVSKTLDVCAYGDGVCDTAHGFGINAQH 222----1111-1111---1111------1111--------2222-----------3333 LSYPSDQGVQTMGYKFAVNKLGGSA -3333-------------1111--- >TBP-INTERACTING PROTEIN; SWP:NA; PDB:2CZRA; MYLSPGTKKVYTQVRYLDDYHWEIEGSTITGIHKKSNVKVVIDVAKNREEADSLAGKDVN -11----------1111------------------------------------------- GIHIVAIPDNGVFYIKNGSFVLTYRYLKATLADINDHIVWSGFKVVEDNGKLVQEDVYEY ---------------iiii---1111--------1111---------iiii--------- LGAALVNHIKNNALAGQDYIFWQFYKCEECGKYVDIENLEAHLREHGIKLHEKSEEHYEV -------------2222-----------------3333-----1111-3333-------- FELNFREGKVFDKFGGEVPMDKFSSEAREFIKEVLS ------------------1111-------------- >CYTOCHROME C, PUTATIVE; SWP:Q748S4; PDB:2CZSA; VRTKKVPLDTNHKRFYDAFAQGAGKLDLDRQCVECHHEKPGGIPFPKNHPVKPADGPMRC --------3333------1111--------3333-----------2222---------11 LFCHKFKLEH 11-------- >PROSTAGLANDIN-H2 D-ISOMER; SWP:O09114; PDB:2CZTA; FQQDKFLGRWYSAGLASNSSWFREKKAVLYMAKTVVAPSTEGGLNLTSTFLRKNCETKIM -3333-------------3333---1111---------1111------------------ VLQPAGAPGHYTYSSPHSGSIHSVSVVEANYDEYALLFSRGTKGPGQDFRMATLYSRTQT ------2222-------------------1111----------2222------------- LKDELKEKFTTFSKAQGLTEEDIVFLPQPDKCIQE -3333-------3333--3333------------- >Ribonuclease P protein co; SWP:O59150; PDB:2CZVC; RKLKTLPPTLRDKNRYIAFEIISDGDFTKDEVKELIWKSSLEVLGETGTAIVKPWLIKFD ------3333-------------------------------------------------- PNTKTGIVRSDREYVEYLRFALMLVSEFNGKRLIIRTLGVSGTIKRLKRKFLAKYGWK 1111------1111-------1111--iiii--------------------3333--- >50S RIBOSOMAL PROTEIN L7A; SWP:P62009; PDB:2CZWA; YVKFEVPKELAEKALQAVEIARDTGKIRKGTNETTKAVERGQAKLVIIAEDVDPEEIVAH -----------------------------------------------------3333--- LPPLCEEKEIPYIYVPSKKELGAAAGIEVAAASVAIIEPGKARDLVEEIAMKVRELMK --------------------------------------!!!!------------1111 >V-TYPE ATP SYNTHASE SUBUN; SWP:P74903; PDB:2D00A; MVPVRMAVIADPETAQGFRLAGLEGYGASSAEEAQSLLETLVERGGYALVAVDEALLPDP ------------------1111------------------------------3333---- ERAVERLMRGRDLPVLLPIAGLKEAFQGHDVEGYMRELVRKTIGFDIKL 33333333------------3333------------------------- >NEOCULIN ACIDIC SUBUNIT; SWP:Q6F495; PDB:2D04A; DSVLLSGQTLYAGHSLTSGSYTLTIQNNCNLVKYQHGRQIWASDTDGQGSQCRLTLRSDG ----2222--2222---!!!!----1111---------------------------1111 NLIIYDDNNMVVWGSDCWGNNGTYALVLQQDGLFVIYGPVLWPLGLNGCRS -----1111-------------------1111------------------- >Curculin [Precursor]; SWP:P19667; PDB:2D04B; DNVLLSGQTLHADHSLQAGAYTLTIQNKCNLVKYQNGRQIWASNTDRRGSGCRLTLLSDG ----------2222-----------1111-----iiii------2222--------1111 NLVIYDHNNNDVWGSACWGDNGKYALVLQKDGRFVIYGPVLWSLGPNGCRR -----1111-------------------1111------------------- >RIBONUCLEASE HIII; SWP:Q6L6Q4; PDB:2D0BA; SNYVIQADQQLLDALRAHYEGALSDRLPAGALFAVKRPDVVITAYRSGKVLFQGKAAEQE ------------------2222-----2222-----2222----3333------------ AAKWISGASASNETADHQPSALAAHQLGSLSAIGSDEVGTGDYFGPIVVAAAYVDRPHIA -----------------3333-----1111--------1111------------3333-- KIAALGVKDSKQLNDEAIKRIAPAIMETVPHAVTVLDNPQYNRWQRSGMPQTKMKALLHN --------3333--------------------------------1111------------ RTLVKLVDAIAPAEPEAIIIDEFLKRDSYFRYLSDEDRIIRERVHCLPKAESVHVSVAAA ------------------------3333--1111--------------3333-------- SIIARYVFLEEMEQLSRAVGLLLPKGAGAIVDEAAARIIRARGEEMLETCAKLHFANTKK ---------------------------3333-----------3333-----3333----- ALAIAKRR -------- >DEHYDROGENASE; SWP:O58256; PDB:2D0IA; MRPKVGVLLKMKREALEELKKYADVEIILYPSGEELKGVIGRFDGIIVSPTTKITREVLE ------------------3333----------------3333------3333--333311 NAERLKVISCHSAGYDNIDLEEATKRGIYVTKVSGLLSEAVAEFTVGLIINLMRKIHYAD 11------------1111-----1111------!!!!----------------------- KFIRRGEWESHAKIWTGFKRIESLYGKKVGILGMGAIGKAIARRLIPFGVKLYYWSRHRK --1111-----------------2222-----------------3333-----------3 VNVEKELKARYMDIDELLEKSDIVILALPLTRDTYHIINEERVKKLEGKYLVNIGRGALV 333-1111----------------------1111----33333333---------1111- DEKAVTEAIKQGKLKGYATDVFEKEPVREHELFKYEWETVLTPHYAGLALEAQEDVGFRA -----------------------------3333-----------1111------------ VENLLKVLRGEVPEDLVNKEVLEVRPIENVKML ------1111--1111-3333----3333---- >Galactosylgalactosylxylos; SWP:Q9NPZ5; PDB:2D0JA; QLPTIYAITPTYSRPVQKAELTRLANTFRQVAQLHWILVEDAAARSELVSRFLARAGLPS -------------1111---------3333----------------------1111---- THLHVPTPRRGLPRATEQRNAGLAWLRQRHQHQRAQPGVLFFADDDNTYSLELFQEMRTT -------------------------------------------1111--3333--3333- RKVSVWPVGLVGGRRYERPLVENGKVVGWYTGWRADRPFAIDMAGFAVSLQVILSNPKAV ----------%%%%-------iiii--------1111----1111----------1111- FKRRGSQPGMQESDFLKQITTVEELEPKANNCTKVLVWHTRTEKVNLANEPKYHLDTVKI --------------3333--3333-----%%%%-------------1111---------- EV -- >DIHYDROFOLATE REDUCTASE; SWP:P0ABQ4; PDB:2D0KA; AISLIAALAVDRVIGNENALPWNLPADLAWFKRNTLNKPVIYGRHTWESIGRPLPGRKNI --------2222-----------3333-------2222---------------2222--- ILSSQPGTDDRVTWVKSVDEAIAAAGDVPEIFVIGGGRVYEQFLPKAQKLYLTHIDAEVE ----------------------3333----------------3333-------------- GDTHFPDYEPDDWESVFSEFHDADAQNSHSYSFEILERR --------3333--------------------------- >diol dehydratase-reactiva; SWP:O68195; PDB:2D0OA; MRYIAGIDIGNSSTEVALATLDEAGALTITHSALAETTGIKGTLRNVFGIQEALALVARG ---------------------3333--------------2222----------------- AGIAVSDISLIRINEATPVIGDVAMETITETIITESTMIGHNPKTPGGAGLGTGITITPQ ---1111--------------------------%%%%--------------------333 ELLTRPADAPYILVVSSAFDFADIASVINASLRAGYQITGVILQRDDGVLVSNRLEKPLP 3--------------3333--------------------------------1111----- IVDEVLYIDRIPLGMLAAIEVAVPGKVIETLSNPYGIATVFNLSPEETKNIVPMARALIG ------1111------------2222---1111----------------------1111- NRSAVVVKTPSGDVKARAIPAGNLELLAQGRSVRVDVAAGAEAIMKAVDGCGRLDNVTGE ---------------------------iiii----1111--------1111--------- SGTNIGGMLEHVRQTMAELTNKPSSEIFIQDLLAVDTSVPVSVTGGLAGEFSLEQAVGIA ----------------------3333----------------2222-------------- SMVKSDRLQMAMIAREIEQKLNIDVQIGGAEAEAAILGALTTPGTTRPLAILDLGAGSTD --------3333-----------------------------2222--------------- ASIINPKGDIIATHLAGAGDMVTMIIARELGLEDRYLAEEIKKYPLAKVESLFHLRHEDG ----3333------------------------------------------1111--1111 SVQFFSTPLPPAVFARVCVVKADELVPLPGDLALEKVRAIRRSAKERVFVTNALRALRQV ---------3333---------------33333333------------------------ SPTGNIRDIPFVVLVGGSSLDFEVPQLVTDALAHYRLVAGRGNIRGSEGPRNAVATGLIL 11113333--------1111---------1111---------2222-------------- SWHKEF ------ >Diol dehydratase-reactiva; SWP:O68196; PDB:2D0OB; HSAPAIAIAVIDGCDGLWREVLLGIEEEGIPFRLQHHPAGEVVDSAWQAARSSPLLVGIA ----------%%%%1111-------1111-----------------------1111---- CDRHMLVVHYKNLPASAPLFTLMHHQDSQAHRNTGNNAARLVKGIPFR ---------22221111-----1111--------------1111---- >CYTOCHROME C; SWP:Q76IQ6; PDB:2D0SA; DEALAKAKGCMACHAIDKKLVGPSYKDVAKKYTEADVPKLVEKVKKGGAGVWGPVPMPPH --------1111--------------------3333-------------1111------1 PQVAEADIEKIVRWVLTLK 111-----------1111- >INDOLEAMINE 2,3-DIOXYGENA; SWP:P14902; PDB:2D0TA; SKEYHIDEEVGFALPNPQENLPDFYNDWMFIAKHLPDLIESGQLRERVEKLNMLSIDHLT ---------!!!!--------3333-------------1111-----1111----1111- DHKSQRLARLVLGCITMAYVWGKGHGDVRKVLPRNIAVPYCQLSKKLELPPILVYADCVL --------------------!!!!--------3333-----------------3333-11 ANWKKKDPNKPLTYENMDVLFSFRDGDCSKGFFLVSLLVEIAAASAIKVIPTVFKAMQMQ 11----1111--3333-------22223333-----------33331111---------- ERDTLLKALLEIASCLEKALQVFHQIHDHVNPKAFFSVLRIYLSGWKGNPQLSDGLVYEG -----------------------3333-----------3333------1111-----222 FWEDPKEFAGGSAGQSSVFQCFDVLLGIQQTAGGGHAAQFLQDMRRYMPPAHRNFLCSLE 2----------3333-------------1111----------3333------------11 SNPSVREFVLSKGDAGLREAYDACVKALVSLRSYHLQIVTKYILIPASQGGTDLMNFLKT 11---------------------------------------------------------- VRSTTEKSLLKEG -----1111---- >METHANOL DEHYDROGENASE LA; SWP:Q4AE26; PDB:2D0VA; NDKLIELSNSNENWVMPGKNYDSNNYSTSTQINVDNVKQLKHAWSFSTGELHGHEGAPLV ---------1111--22223333---------33331111-------------------- IGDVMYVHSSFPNKTFALDLNDPGHILWQHSPKQDPAARSVACCDLVNRGLAYWPGDDKT !!!!--------------1111--------------3333-------------------- PSLIIKTQLDGHLVALNAKTGEEFWKVENGDIKVGQTLTQAPYVVHDLAIVGSSGAELGV -------1111-------------------1111--------------------3333-- RGHVTAYNVRTGEQAWRYYATGPDAEIGLADDFNSANPHYGQKGLGTATWEGDAWKIGGG ----------------------3333---11111111------1111---!!!!------ TNWGWYAYDPAANLIYYGSGNPAPWNETMRPGDNKWTMTITARDADTGKMKFGYQKTPHD ---------1111------------3333----2222----------------------- EWDFAGVNVIMLSEQTDKTGKKRKLLTHPDRNGIVYTLDRENGDLISADKLDDTVNVFKT ----------------1111---------1111------------------3333----- VDLKTGLPVRDPEYGTRMDHKGTDICPSAMGYHNQGHDSYDPQKQLFFMGINHICMDWEP --1111----3333-------------3333----------1111--------------- FMLPYRAGQFFVGATLWMYPGPKGDRQNYLGLGQIKAYNAITNEYKWQHMERFSVWGGTL -----2222-----------1111------------------------------------ ATAGNLVFYGTLDGFLKARNSDTGELVWKHKLPSGVIGYPMTYEHKGVQYIAVMSGVGGW ----------1111------------------------------iiii----------33 PGVGLVFDLQDPTAGLGAVGAFKNLQNYTQMGGSLEVFSLDGKNPYDDVNVGEYEKG 33--1111--1111iiii3333-1111----------------11111111------ >Methanol dehydrogenase sm; SWP:Q4AE23; PDB:2D0VB; YDGTHCKAPGNCWEPKPGFPEKIAGSKYDPKHDPKELNKQVESRKGEEERNANRAEHFKK -------2222----2222---2222------3333------------------------ TGKWVYDVKK ---------- >CYTOCHROME CL; SWP:Q4AE24; PDB:2D0WA; AQEVFRNTVTGEALDVEGQAPKEGRDTPAVKQFMQTGVDPYVEVAGCLPKGEEIYLESCS --------------3333--------------------1111-3333------------- GCHGHIGEGKVGPGLNDSYWTYPKNTTDKGLFETIFGGANGMMGPHGQDLELDNMLKLIA ---1111---------------1111-------------!!!!--3333----------- WIRHIQKDDVADADWLSDEQKKNFKPFDIKAWEATGKAAAEKAQCKIS ---------1111--------------------------1111----- >HYPOTHETICAL PROTEIN PH12; SWP:O58996; PDB:2D13A; VGLADVAVLYSGGKDSNYALYWALKSGLRVRYLVSMVSENNVELTSLQARALGIPIIKGF -----------------------------------------1111--------------- TEKEKEVEDLKNVLEGLKVDGIVAGALASRYQKERIENVARELGLKVYTPAWEKDPYQYM -------------1111-----------------------1111----1111-------- LEIIKLGFKVVFVAVSAYGLNESWLGRELNYKNLEELKKLSEKYGIHIAGEGGEFETFVL ---3333-------------1111----------------------1111---------- DMPFFKAKIVIDDAEKFWDGLSGKFIIKRAHLEWK -1111------------------------------ >HYPOTHETICAL PROTEIN PH19; SWP:O59581; PDB:2D16A; MVRIEVIDIEKPEGVEVIIGQGNFSIFTVDDLARALLTAVPGIKFGIAMNEAKPQLTRYT -----------2222--------3333--------11112222--------1111----- GNDPELEALAAKNAVKIGAGHVFVILMKNAYPINVLNTIKNHPAVAMIYGASENPFQVIV ------------------2222--------3333-------1111--------------- AETELGRAVIGVVDGKAANKIETDEQKKERRELVEKIGYKID --3333----------------------------3333---- >ISOCITRATE DEHYDROGENASE; SWP:P33197; PDB:2D1CA; PLITTETGKKMHVLEDGRKLITVIPGDGIGPECVEATLKVLEAAKAPLAYEVREAGASVF ----1111-----1111---------!!!!-------------------------33331 RRGIASGVPQETIESIRKTRVVLKGPLETPVGYGEKSANVTLRKLFETYANVRPVREFPN 1113333-3333---------------------------------------------222 VPTPYAGRGIDLVVVRENVEDLYAGIEHMQTPSVAQTLKLISWKGSEKIVRFAFELARAE 2-1111------------------------1111-------------------------- GRKKVHCATKSNIMKLAEGTLKRAFEQVAQEYPDIEAVHIIVDNAAHQLVKRPEQFEVIV ---------3333------------------1111----------------1111----- TTNMNGDILSDLTSGLIGGLGFAPSANIGNEVAIFEAVHGSAPKYAGKNVINPTAVLLSA ----------------------------1111---------1111--------------- VMMLRYLEEFATADLIENALLYTLEEGRVLTGDVVGYDRGAKTTEYTEAIIQNLGKTPRK ------------------------------3333----------------1111------ TQVRGYKPFRLPQVDGAIAPIVPRSRRVVGVDVFVETNLLPEALGKALEDLAAGTPFRLK ------------3333-----------------------------------2222----- MISNRGTQVYPPTGGLTDLVDHYRCRFLYTGEGEAKDPEILDLVSRVASRFRWMHLEKLQ ---iiii----------------------------------------------------- EFDGEPGFTKAQGED -iiii-----2222- >PHYCOCYANOBILIN:FERREDOXI; SWP:Q55891; PDB:2D1EA; LSLTNSSLMPTLNPMIQQLALAIAASWQSLPLKPYQLPEDLGYVEGRLEGEKLVIENRCY --1111-3333---------------1111------2222-------------------- QTPQFRKMHLELAKVGKGLDILHCVMFPEPLYGLPLFGCDIVAGPGGVSAAIADLSPTQS -1111-----------------------3333-----------1111-----------11 DRQLPAAYQKSLAELGQPEFEQQRELPPWGEIFSEYCLFIRPSNVTEEERFVQRVVDFLQ 11----------3333----------3333---1111----------------------- IHCHQSIVAEPLSEAQTLEHRQGQIHYCQQQQKNDKTRRVLEKAFGEAWAERYMSQVLFD ---3333----------------------------------------------------- VIQ --- >THREONINE SYNTHASE; SWP:P66902; PDB:2D1FA; QPWPGVIAAYRDRLPVGDDWTPVTLLEGGTPLIAATNLSKQTGCTIHLKVEGLNPTGSFK ----3333-3333------------------------------------33331111-33 DRGMTMAVTDALAHGQRAVLCASTGNTSASAAAYAARAGITCAVLIPQGKIAMGKLAQAV 33---------1111---------3333-----------------------3333----1 MHGAKIIQIDGNFDDCLELARKMAADFPTISLVNSVNPVRIEGQKTAAFEIVDVLGTAPD 111------------------------------1111----------------------- VHALPVGNAGNITAYWKGYTEYHQLGLIDKLPRMLGTQAAGAAPLVLGEPVSHPETIATA ----------------------1111------------11113333-----------333 IRIGSPASWTSAVEAQQQSKGRFLAASDEEILAAYHLVARVEGVFVEPASAASIAGLLKA 3----------------------------------------------3333--------- IDDGWVARGSTVVCTVTGNGLKDPDTALKDMPSVSPVPVDPVAVVEKLG ------2222--------3333-----------------33331111-- >ACID PHOSPHATASE; SWP:Q2A5P7; PDB:2D1GA; NSKPNDYGTLQKLFNNANTLKTTTPIKHVVIIFQENNSFDRYFGMYPNAKNPEGEPKFVA ---------------3333-----------------------1111-----2222----- KENTPNVNGLTKQLLENNPNTKNPYRLDRNFQPCSQNHEYHQEISSFNGGLMNKFVEHGG 2222------3333--------------------------------%%%%----3333-- HDNDTYKQNCDGQVMGYYDGNTVTALWNYAQNFALNDNTFGTTFGPSTPGALNLVAGANG ---------2222-----1111-------------------------------------- PAMSPSGNLENIENNYIIDDPNPYYDDCSYGTSKSGDTNTAVAKITDGYNIGHYLTQKGI ---1111!!!!-%%%%---------11113333---1111---------3333------- TWGWFQGGFKPTSYSGKTAICDAMSTNKFGVKSRDYIPHHEPFNYWKETSNPHHLAPSDD --------------!!!!--------1111------1111-----3333-1111----33 KYIGSNDQANHQYDISEFWKALDQNNMPAVSYLKAPGYQDGHGGYSNPLDEQEWLVNTIN 33----3333---3333----1111----------3333--------------------- RIQQSKDWDSTAIIIIYDDSDGDYDHVYSPKSQFSDIKGRQGYGPRLPMLVISPYAKANY ----1111-----------------------1111-2222------------1111---- VDHSLLNQASVLKFIEYNWGIGSVSKYSNDKYSNNILNMFDFNKEQKTLKLILDPKTGLV ------1111------1111----11113333---1111--------------------- M - >109aa long hypothetical t; SWP:Q96ZE4; PDB:2D1HA; KEKLESKKDEIRCCYKITDTDVAVLLKVEIEKPITSEELADIFKLSKTTVENSLKKLIEL --3333---------------------3333--------------------------111 GLVVRTKTPKYYYSISSNILEKIRNDLLNCAKRELAAT 1----------------3333------------1111- >METASTASIS SUPPRESSOR PRO; SWP:MTSS1_MOUSE; PDB:2D1LA; HEVIEKECSALGGLFQTIISDKGSYPVWEDFINKAGKLQSQLRTTVVAAAAFLDAFQKVA ------------------------------------------------------------ DATNTRGGTREIGSALTRCRHRSIEAKLRQFSSALIDCLINPLQEQEEWKKVANQLDKDH -1111!!!!--------------------------------------------------- AKEYKKARQEIKKKSSDTLKLQKKAKKVDAQGRGDIQPQLDSALQDVNDKYLLLEETEKQ ---------------------------1111----------------------------- AVRKALIEERGRFCTFISLRPVIEEEISLGEITHLQTISEDLKSLTDPHKLPSSSEQ ----------------------------3333----------11111111-3333-- >HYPOTHETICAL UPF0163 PROT; SWP:P45532; PDB:2D1PA; SMRFAIVVTGPAYGTQQASSAFQFAQALIADGHELSSVFFYREGVYNANQLTSPASDEFD ----------------------------1111----------------1111--1111-- LVRAWQQLNAQHGVALNICVAAALRRGVVDETEAGRLGLASSNLQQGFTLSGLGALAEAS -----------------------1111------------------------3333----- LTCDRVVQF --------- >Protein tusC; SWP:P45531; PDB:2D1PB; MKRIAFVFSTAPHGTAAGREGLDALLATSALTDDLAVFFIADGVFQLLPGQKPDAVLARD ---------------------------1111-------------1111-----1111--- YIATFKLLGLYDIEQCWVCAASLRERGLDPQTPFVVEATPLEADALRRELANYDVILRF 3333-------------------1111-1111-----------------1111------ >Protein tusB; SWP:P45530; PDB:2D1PC; MLHTLHRSPWLTDFAALLRLLSEGDELLLLQDGVTAAVDGNRYLESLRNAPIKVYALNED -------3333------11112222----!!!!----2222-----1111---------- LIARGLTGQISNDIILIDYTDFVRLTVKHPSQMAW -11111111-3333--------------------- >LUCIFERIN 4-MONOOXYGENASE; SWP:P13129; PDB:2D1SA; DENIVVGPKPFYPIEEGSAGTQLRKYMERYAKLGAIAFTNAVTGVDYSYAEYLEKSCLGK 3333-------------------------------------------------------- ALQNYGLVVDGRIALCSENCEEFFIPVIAGLFIGVGVAPTNEIYTLRELVHSLGISKPTI -------2222--------1111-----------------11113333------------ VFSSKKGLDKVITVQKTVTTIKTIVILDSKVDYRGYQCLDTFIKRNTPPGFQASSFKTVE ----1111---------1111-----------iiii-----------22223333----- VDRKEQVALIMNSSGSTGLPKGVQLTHENIVTRFSHARDPIYGNQVSPGTAVLTVVPFHH -3333-----------------------------------------2222------3333 GFGMFTTLGYLICGFRVVMLTKFDEETFLKTLQDYKCTSVILVPTLFAILNKSELLNKYD 3333----------------------------1111------3333--------3333-- LSNLVEIASGGAPLSKEVGEAVARRFNLPGVRQGYGLTETTSAIIITPEGDDKPGASGKV 3333-------------------1111---------1111-------2222-2222---- VPLFKAKVIDLDTKKSLGPNRRGEVCVKGPMLMKGYVNNPEATKELIDEEGWLHTGDIGY 2222-------------------------------------------1111--------- YDEEKHFFIVDRLKSLIKYKGYQVPPAELESVLLQHPSIFDAGVAGVPDPVAGELPGAVV -1111------3333---!!!!--3333-------1111--------------------- VLESGKNMTEKEVMDYVASQVSNAKRLRGGVRFVDEVPKGLTGKIDGRAIREILKKPV --2222----------1111-3333-1111---------------------------- >TRANSCRIPTIONAL REGULATOR; SWP:P37478; PDB:2D1VA; SSNEIHIGSLVIFPDAYVVSKRDETIELTHREFELLHYLAKHIGQVMTREHLLQTVWGYD ------!!!!---1111---iiii-----------------2222------------111 YFGDVRTVDVTVRRLREKIEDNPSHPNWIVTRRGVGYYLRNPE 1--------------------3333------2222-------- >CORTACTIN ISOFORM A; SWP:Q96H99; PDB:2D1XA; NDLGITAVALYDYQAAGDDEISFDPDDIITNIEMIDDGWWRGVCKGRYGLFPANYVELRQ 1111------------1111---2222----------------iiii----1111----- >HYPOTHETICAL PROTEIN TT03; SWP:Q5SLC4; PDB:2D1YA; GLFAGKGVLVTGGARGIGRAIAQAFAREGALVALCDLRPEGKEVAEAIGGAFFQVDLEDE --2222-------------------------------3333--------------3333- RERVRFVEEAAYALGRVDVLVNNAAIAAPGSALTVRLPEWRRVLEVNLTAPMHLSALAAR ------------------------------3333-------------------------- EMRKVGGGAIVNVASVQGLFAEQENAAYNASKGGLVNLTRSLALDLAPLRIRVNAVAPGA 3333----------1111---------------------------3333----------- IATEAVLEAIARRDWEDLHALRRLGKPEEVAEAVLFLASEKASFITGAILPVDGGMTASF -------33333333---1111---3333---------3333----------iiii---- >ENDO-1,4-BETA-D-XYLANASE; SWP:Q7SI98; PDB:2D1ZA; AESTLGAAAAQSGRYFGTAIASGKLGDSAYTTIASREFNMVTAENEMKIDATEPQRGQFN ---------1111-------1111--------------------11113333--2222-- FSAGDRVYNWAVQNGKQVRGHTLAWHSQQPGWMQSLSGSTLRQAMIDHINGVMGHYKGKI -----------1111--------------3333----------------------2222- AQWDVVSHAFSDDGSGGRRDSNLQRTGNDWIEVAFRTARAADPAAKLCYNDYNIENWTWA --------------------3333--1111-----------1111----------1111- KTQGVYNMVRDFKQRGVPIDCVGFQSHFNSGSPYNSNFRTTLQNFAALGVDVAITELDIQ ----------------------------3333-------------1111---------22 GASSSTYAAVTNDCLAVSRCLGITVWGVRDTDSWRSGDTPLLFNGDGSKKAAYTAVLNAL 22--------------1111--------3333--3333-----1111-----------11 NGGGQIKGVGSGRCLDVPNASTTDGTQVQLYDCHSATNQQWTYTDAGELRVYGDKCLDAA 11--------------2222--2222-----------------1111------------- GTGNGTKVQIYSCWGGDNQKWRLNSDGSIVGVQSGLCLDAVGGGTANGTLIQLYSCSNGS --2222-----------------1111------------2222--2222---------11 NQRWTRT 11----- ------------------------------ >General secretion pathway; SWP:P31742; PDB:2D28C; EQRSAETRIVEALLERRRLKDTDLVRARQESGGLLALLGRLGLVSERDHAETCAEVLGLP --------------------------------------1111------------------ LVDARQLGDTPPEEVQGLSLRFLKQFHLCPVGERDGRLDLWIADPYDDYAIDAVRLATGL --3333---------------------------iiii------1111------------- PLLLHVGLRSEIDDLIERWYG --------------------- >ACYL-COA DEHYDROGENASE; SWP:Q5SGZ2; PDB:2D29A; GLWFEEGAEERQVLGPFREFLKAEVAPGAAERDRTGAFPWDLVRKLAEFGVFGALVPEAY -1111-------------------3333---------------------1111---3333 GGAGLSTRLFARMVEAIAYYDGALALTVASHNSLATGHILLAGSEAQKEAFLPKLASGEA ------------------------------------------------------------ LGAWGLTEPGSGSDAAALKTKAEKVEGGWRLNGTKQFITQGSVAGVYVVMARTDPPPSPE -------1111--1111-------2222-----------------------------333 RKHQGISAFAFFRPERGLKVGRKEEKLGLTASDTAQLILEDLFVPEEALLGERGKGFYDV 3-2222----------------------1111------------1111---2222----- LRVLDGGRIGIAAMAVGLGQAALDYALAYAKGREAFGRPIAEFEGVSFKLAEAATELEAA ----------------------------------iiii----3333-------------- RLLYLKAAELKDAGRPFTLEAAQAKLFASEAAVKACDEAIQILGGYGYVKDYPVERYWRD ----------1111----------------------------------3333-------- ARLTRIGEGTSEILKLVIARRLLEAV --1111-------------------- >SUFA PROTEIN; SWP:P77667; PDB:2D2AA; NPQDFAWQGLTLTPAAAIHIRELVAKQPGMVGVRLGVKQTGCAGFGYVLDSVSEPDKDDL -------------------------------------------------------3333- LFEHDGAKLFVPLQAMPFIDGTEVDFVREGLNQIFKFHNPKAQNECGCGESFGV ---iiii----1111---2222------!!!!------1111------------ >SUFC PROTEIN; SWP:Q5SH92; PDB:2D2EA; SQLEIRDLWASIDGETILKGVNLVVPKGEVHALMGPNGAGKSTLGKILAGDPEYTVERGE -------------------------2222------2222-----------3333------ ILLDGENILELSPDERARKGLFLAFQYPVVPGVTIANFLRLALQAKLGREVGVAEFWTKV --iiii-1111------------------------------------------------- KKALELLDWDESYLSRYLNEGEKKRNEILQLLVLEPTYAVLDETDSGLDIDALKVVARGV ------------------------------------------1111-------------- NAMRGPNFGALVITHYQRILNYIQPDKVHVMMDGRVVATGGPELALELEAKGYEWLKEK 11111111--------3333-----------iiii------------------------ >GLYCERALDEHYDE 3-PHOSPHAT; SWP:Q55245; PDB:2D2IA; MTIRVAINGFGRIGRNFLRCWFGRQNTDLEVVAINNTSDARTAAHLLEYDSVLGRFNADI --------------------1111------------------------------------ SYDENSITVNGKTMKIVCDRNPLNLPWKEWDIDLVIESTGVFVTAEGASKHIQAGAKKVL --------------------3333-3333--------------3333------------- ITAPGKAEGVGTYVIGVNDSEYRHEDFAVISNASCTTNCLAPVAKVLHDNFGIIKGTMTT -------------22221111-1111---------------------------------- THSYTLDQRILDASHRDLRRARAAAVNIVPTTTGAAKAVALVIPELKGKLNGIALRVPTP ----1111----------33331111-------33333333-3333-------------- NVSVVDLVVQVEKPTITEQVNEVLQKASQTTMKGIIKYSDLPLVSSDFRGTDESSIVDSS ---------------3333----------1111----------11112222------333 LTLVMDGDLVKVIAWYDNEWGYSQRVVDLAELAARKSG 3------------------------------------- >PHOSPHOTRIESTERASE; SWP:Q93LD7; PDB:2D2JA; TGDLINTVRGPIPVSEAGFTLTHEHICGSSAGFLRAWPEFFGSRKALAEKAVRGLRHARA ------1111--3333-------------2222----3333-----------------11 AGVQTIVDVSTFDIGRDVRLLAEVSRAADVHIVAATGLWFDPPLSMRMRSVEELTQFFLR 11--------1111----------------------------3333-------------- EIQHGIEDTGIRAGIIKVATTGKATPFQELVLKAAARASLATGVPVTTHTSASQRDGEQQ -----!!!!-----------------------------------------3333------ AAIFESEGLSPSRVCIGHSDDTDDLSYLTGLAARGYLVGLDRMPYSAIGLEGNASALALF ----1111-3333----1111----------1111------1111-2222---------- GTRSWQTRALLIKALIDRGYKDRILVSHDWLFGFSSYVTNIMDVMDRINPDGMAFVPLRV ---------------11111111--------------2222-------3333-------- IPFLREKGVPPETLAGVTVANPARFLSPT ----1111--------------------- >GIANT HEMOGLOBIN, A1(B) G; SWP:Q7M419; PDB:2D2MA; VCNRLEQILVKTQWAQSYGEAENRAAFSRDLFSELFNIQGSSRALFSGVGVDDMNSAAFT --3333----------------3333------------33331111---3333------- AHCLRVTGALNRLISQLDQQATINADLAHLAGQHASRNLDASNFAAMGQAVMSVVPTHLD ------------3333---------------1111----3333----------3333--- CFNQHAWGECYERIASGISG ---------------1111- >Extracellular giant hemog; SWP:Q7M413; PDB:2D2MB; DCTSLNRLLVKRQWAEAYGEGTNRELLGNRIWEDLFANMPDARGLFSRVNGNDIDSSEFQ -------------------!!!!----------1111-33333333---3333----333 AHSLRVLGGLDMCVASLDDVPVLNALLARLNSQHDSRGIPAAGYPAFVASAISAVRATVG 3--------------3333-----------1111-----1111----------------- ARSFDNDAWNSCMNQIVSGISG ----------------3333-- >Extracellular giant hemog; SWP:Q7M418; PDB:2D2MC; SCCSSEDRANVMHNWDAAWSAAYSDRRVALAQAVFASLFSRDAAAQGLFSGVSADNPDSA ---3333----------------------------------33333333---3333--33 DFRAHCVRVVNGLDVAINMLNDPAVLNEQLAHLSAQHQARAGVAAAHFDVMAEAFAEVMP 33--------------1111-----------------------3333----------333 QVSSCFSSDSWNRCFARIANGISAGL 3---------------------2222 >Extracellular giant hemog; SWP:Q5KSB7; PDB:2D2MD; ECCSRGDAEVVISEWDQVFNAAMAGSSESAVGVAIFDAFFASSGVSPSMFPGGGDSNNPE ---3333------------------------------------------2222-1111-- FLAQVSRVVSGADIAINSLTNRATCDSLLSHLNAQHRAISGVTGAAVTHLSQAISSVVAQ ---------------------3333-----------------3333-------------- VLPSAHIDAWEYCMAYIAAGIGAGL -33333333------------2222 >UNDECAPRENYL PYROPHOSPHAT; SWP:Q1CS42; PDB:2D2RA; STLKHLAIIMDGNGRWAKLKNKARAYGHKKGVKTLKDITIWCANHKLECLTLYAFEVDFL ------------3333-1111----------------------------------3333- MKMLKKYLKDERSTYLDNNIRFRAIGDLEGFSKELRDTILQLENDTRHFKDFTQVLALNY ---------------1111--------------------------3333----------- GSKNELSRAFKSLLESPPSNISLLESLENEISNRLDTRNLPEVDLLLRTGGEMRLSNFLL -----------------1111---------11111111----------------%%%%33 WQSSYAELFFTPILWPDFTPKDLENIISDFYKRVR 33-----------3333------------------ >EXOCYST COMPLEX COMPONENT; SWP:P38261; PDB:2D2SA; DMSSTAQRLKFLDEGVEEIDIELARLRFESAVETLLDIESQLEDLSLMLLNLISLKIEQR ---3333-----------------------------------3333-------------- REAISSKLSQSILSSNEIVHLKSGTENMIKLGLPEQALDLFLQNRSNFIQDLILQIVDNP ----------------------------3333---------------------------- TNYLTQLAVIRFQTIKKTVEDFQDIFKELGAKISSILVDWCSDEVDNHFKLIDKQLLNLS -------------------------2222--3333------------------------1 PGSIKSSRKQIDGLKAVGLDFVYKLDEFIKKNSDKIR 111-----------------3333-------3333-- >2-DEOXY-SCYLLO-INOSOSE SY; SWP:Q9S5E2; PDB:2D2XA; TTKQICFADRCFNFAFGEHVLESVESYIPRDEFDQYIMISDSGVPDSIVHYAAEYFGKLA ------!!!!--------33333333--1111--------11113333------------ PVHILRFQGGEEYKTLSTVTNLQERAIALGANRRTAIVAVGGGLTGNVAGVAAGMMFRGI ---------3333-------------1111-1111-------------------2222-- ALIHVPTTFLAASDSVLSIKQAVNLTSGKNLVGFYYPPRFVFADTRILSESPPRQVKAGM -------------1111-------1111-------------------11113333----- CELVKNMLILKEFTEDDLNSANVYSPKQLETFINFCISAKMSVLSEDIYEKKKGLIFEYG -------------3333-1111------------------------1111-3333--222 HTIGHAIELAEQGGITHGEAIAVGMIYAAKIANRMNLMPEHDVSAHYWLLNKIGALQDIP 2-------1111--------------------1111--3333-----------1111--- LKSDPDSIFHYLIHEDNLGMILLSGVGKPAMYNQTLLTPVRKTLIKEVIREGL ---3333-----------------2222---%%%%-----3333--------- >CYTIDINE DEAMINASE; SWP:Q81LT6; PDB:2D30A; MNSKQLIQEAIEARKQAYVPYSKFQVGAALLTQDGKVYRGCNVENASYGLCNCAERTALF ------------3333---------------1111-----------3333---------- KAVSEGDKEFVAIAIVADTKRPVPPCGACRQVMVELCKQDTKVYLSNLHGDVQETTVGEL --1111-------------------------------1111--------------3333- LPGA 2222 >HYPOTHETICAL NADH-DEPENDE; SWP:Q974C9; PDB:2D37A; MAEVIKSIMRKFPLGVAIVTTNWKGELVGMTVNTFNSLSLNPPLVSFFADRMKGNDIPYK --------1111----------iiii-----------------------33333333333 ESKYFVVNFTDNEELFNIFALKPVKERFREIKYKEGIGGCPILYDSYAYIEAKLYDTIDV 3----------3333-------3333---------2222---1111-------------! GDHSIIVGEVIDGYQIRDNFTPLVYMNRKYYKLSS !!!----------------------iiii------ >FICOLIN-1; SWP:O00602; PDB:2D39A; EFPRNCKDLLDRGYFLSGWHTIYLPDCRPLTVLCDMDTDGGGWTVFQRRMDGSVDFYRDW ---------1111----------1111----------iiii------------------- AAYKQGFGSQLGEFWLGNDNIHALTAQGSSELRVDLVDFEGNHQFAKYKSFKVADEAEKY --------1111------------------------------------------3333-- KLVLGAFVGGSAGNSLTGHNNNFFSTKDQDNDVSSSNCAEKFQGAWWYADCHASNLNGLY ---------33331111-2222---1111----------1111-----------1111-- LMGANGINWSAAKGYKYSYKVSEMKVRPA -----------2222-------------- >GLUTAMINE SYNTHETASE; SWP:P38562; PDB:2D3AA; CLTDLVNLNLSDTTEKIIAEYIWIGGSGMDLRSKARTLPGPVTDPSKLPKWNYDGSSTGQ 333311113333------------1111---------------3333------1111--- APGEDSEVILYPQAIFKDPFRRGNNILVMCDCYTPAGEPIPTNKRYSAAKIFSSPEVAAE -3333----------------!!!!--------1111--1111-----------3333-- EPWYGIEQEYTLLQKDTNWPLGWPIGGFPGPQGPYYCGIGAEKSFGRDIVDAHYKACLYA -------------------2222-----------2222-1111--3333----------- GINISGINGEVMPGQWEFQVGPSVGISSGDQVWVARYILERITEIAGVVVTFDPKPIPGD -----------2222--------!!!!--------------------------------- WNGAGAHTNYSTESMRKEGGYEVIKAAIEKLKLRHKEHIAAYGEGNERRLTGRHETADIN -----------3333-2222-----------------3333-2222------%%%%-333 TFSWGVANRGASVRVGRETEQNGKGYFEDRRPASNMDPYVVTSMIAETTIVWK 3------1111---------------------11113333------------- >VTS1 PROTEIN; SWP:Q08831; PDB:2D3DA; SMNPKSLTDPKLLKNIPMWLKSLRLHKYSDALSGTPWIELIYLDDETLEKKGVLALGARR --3333--3333--------11113333---333333331111-----1111-------- KLLKAFGIVIDYKERDLIDRSAY ------------1111--3333- >General control protein G; SWP:P58772; PDB:2D3EA; DKVEELLSKNYHLENEVARLKKLLERAEERAELSEGKCAELEEELKTVTNNLKSLEAQAE ------------------------------------------------------------ KYSQKEDKYEEEIKVLSDKLKEAETRAEFAERSVTKLEKSIDDLEDELYAQKLKYKAISE --3333------------------------------------------------------ ELDHALNDMT ---------- >WNT INHIBITORY FACTOR-1; SWP:Q9Y5W5; PDB:2D3JA; GSHMLDQQEESLYLWIDAHQARVLIGFEEDILIVSEGKMAPFTHDFRKAQQRMPAIPVNI ----------------------------------iiii------------------3333 HSMNFTWQAAGQAEYFYEFLSLRSLDKGIMADPTVNVPLLGTVPHKASVVQVGFPCLGKQ ------------------------------------------------------------ DGVAAFEVDVIVMNSEGNTILQTPQNAIFFKTCLQAE -------------1111-------------------- >GLUCAN 1,4-ALPHA-MALTOHEX; SWP:P19571; PDB:2D3NA; TNGTMMQYFEWYLPNDGNHWNRLNSDASNLKSKGITAVWIPPAWKGASQNDVGYGAYDLY ---------1111-----------------1111-------------1111------111 DLGEFNQKGTVRTKYGTRSQLQAAVTSLKNNGIQVYGDVVMNHKGGADATEMVRAVEVNP 1-----%%%%--1111------------1111--------------------------11 NNRNQEVTGEYTIEAWTRFDFPGRGNTHSSFKWRWYHFDGVDWDQSRRLNNRIYKFRGHG 11------------------1111---------1111------1111----------222 KAWDWEVDTENGNYDYLMYADIDMDHPEVVNELRNWGVWYTNTLGLDGFRIDAVKHIKYS 2-------2222----------1111---------------------------1111333 FTRDWINHVRSATGKNMFAVAEFWKNDLGAIENYLQKTNWNHSVFDVPLHYNLYNASKSG 3------------------------------------%%%%-----------------ii GNYDMRNIFNGTVVQRHPSHAVTFVDNHDSQPEEALESFVEEWFKPLAYALTLTREQGYP ii-3333-222233333333------11112222------3333---------------- SVFYGDYYGIPTHGVPAMRSKIDPILEARQKYAYGKQNDYLDHHNIIGWTREGNTAHPNS --3333---3333----3333--------------------------------3333--- GLATIMSDGAGGSKWMFVGRNKAGQVWSDITGNRTGTVTINADGWGNFSVNGGSVSIWVN ------------------1111------1111--------1111---------------- K - >LECTIN ALPHA CHAIN; SWP:P81517; PDB:2D3PA; ADTIVAVELDTYPNTDIGDPNYQHIGINIKSIRSKATTRWNVQDGKVGTAHISYNSVAKR -------------1111-------------------------2222--------3333-- LSAIVSYPGGSSATVSYDVDLNNILPEWVRVGLSASTGLYKETNTILSWSFTSKLKTNST -------------------3333------------------------------------- ADAQSLHFTFNQFSQNPKDLILQGDASTDSDGNLQLTRVSNGSPQSNSVGRALYYAPVHV ----------------1111--------1111-------!!!!----------------- WDKSAVVASFDATFTFLIKSTDSDIADGIAFFIANTDSSIPHGSGGRLLGLFPDAN -1111-----------------------------1111------!!!!-------- >DECOLORIZING PEROXIDASE; SWP:Q8WZK8; PDB:2D3QA; TILPLNNIQGDILVGMKKQKERFVFFQVNDATSFKTALKTYVPERITSAAILISDPSQQP ---1111-3333------------------------------------------3333-- LAFVNLGFSNTGLQALGITDDLGDAQFPDGQFADAANLGDDLSQWVAPFTGTTIHGVFLI -------------1111--------33333333-3333--3333---------------- GSDQDDFLDQFTDDISSTFGSSITQVQALSGSARPGDQAGHEHFGFLDGISQPSVTGWET ---3333-----------!!!!------------!!!!---1111--------------- TVFPGQAVVPPGIILTGRDGDTGTRPSWALDGSFMAFRHFQQKVPEFNAYTLANAIPANS --2222----33332222----------2222---------------------------- AGNLTQQEGAEFLGARMFGRWKSGAPIDLAPTADDPALGADPQRNNNFDYSDTLTDETRC --------------------1111-1111-----3333--1111-----1111------- PFGAHVRKTNPRQDLGGPVDTFHAMRSSIPYGPETSDAELASGVTAQDRGLLFVEYQSII ----------!!!!---------------------3333-------------------33 GNGFRFQQINWANNANFPFSKPITPGIEPIIGQTTPRTVGGLDPLNQNETFTVPLFVIPK 33-----------1111----------------------------3333----------- GGEYFFLPSISALTATIAA ---------------3333 >leukocyte immunoglobulin-; SWP:Q6PI73; PDB:2D3VA; LSKATLWAEPGSVISRGNSVTIRCQGTLEAQEYRLVKEGSPEPWDTQNPLEPKNKARFSI --------------2222--------1111------2222-------------------- PSTEHHAGRYRCYYYSPAGWSEPSDPLELVVTGFYNKPTLSAVTLQCGSRLRFDRFILTE --1111---------1111----------------------------------------- EKLSWTLDSQLTPSGQFQALFPVGPVTPSHRWLRCYGSRRHILQVWSEPSDLLEIPV -----------1111-----------3333--------3333--------------- >PROBABLE ATP-DEPENDENT TR; SWP:P77499; PDB:2D3WA; MLSIKDLHVSVEDKAILRGLSLDVHPGEVHAIMGPNGSGKSTLSATLAGREDYEVTGGTV ----------iiii----------2222-------------------------------- EFKGKDLLALSPEDRAGEGIFMAFQYPVEIPGVSNQFFLQTALNAVRSYRGQETLDRFDF -iiii1111-3333---------------2222--------------1111--------- QDLMEEKIALLKMPEDLLTRSVNVGFSGGEKKRNDILQMAVLEPELCILDESDSGLDIDA ---------------3333---3333-3333----------------------------- LKVVADGVNSLRDGKRSFIIVTHYQRILDYIKPDYVHVLYQGRIVKSGDFTLVKQLEEQG -------------------------3333----------iiii-----1111--1111-- YGWL 3333 >URACIL-DNA GLYCOSYLASE; SWP:Q5SJ65; PDB:2D3YA; MDREAFVQTLTACRLCPRLVAWREEVVGRKRAFRGEPYWARPVPGFGDPEARILLFGLAP --------3333-----------3333--3333--------------1111--------- GAHGSNRTGRPFTGDASGAFLYPLLHEAGLSSKPESLPGDDLRLYGVYLTAAVRCAPPKN 1111-----2222-----------------------2222----------------2222 KPTPEELRACARWTEVELGLLPEVRVYVALGRIALEALLAHFGLRKSAHPFRHGAHYPLP --------------------1111--------------------3333---2222----- GGRHLLASYHVSRQNTQTGRLTREMFLEVLMEAKRLAGL ---------------1111---------------1111- >PUTATIVE GENTISATE 1,2-DI; SWP:Q8X655; PDB:2D40A; TPNANCAPAYWNYQEIRPLLVLENPALRGQSSITATLYAGLQLIPGEVAPSHRHNQSALR -----------3333------------------1111----------------------- FIVEGKGAFTAVDGERTPNEGDFILTPQWRWHDHGNPGDEPVIWLDGLDLPLVNILGCGF ------------------2222-------------------------------------- AEDYPQQPVTRKEGDYLPRYAANLPLRHQTGNSSPIFNYRYDRSREVLHDLTRLGDADEW -----------22223333--------------------3333-------1111------ DGYKRYVNPVTGGYPPSGAFLQLLPKGFASRVARTTDSTIYHVVEGSGQVIIGNETFSFS ------------------------2222-------------------------------2 AKDIFVVPTWHGVSFQTTQDSVLFSFSDRPVQEALGLFREARY 222----2222----------------3333-1111------- >NON-TOXIC CRYSTAL PROTEIN; SWP:Q6L5X8; PDB:2D42A; AIINLLRELEIYGMQYANSHQYTYGSSYSDDTNPIRIAGLDARIPDPIVTDPVNHIVLDR -----------------1111--------1111--------------------------- RIITNTTSNSLEGVFSFSNAYTSRTSSQTRDGVTAGTNITGKYFANLFFEQVGLSGRIAF ------------------------------------------------3333-------- EGAVTNENKYTLDATQDFRDSQTIRVPPFHRATGVYTLEQGAFEKMTVLECVVSGNGIIR ------------------------------------------------------------ YYRTLPDNSYTEIVQRVNIIDVLQANGTPGFTISKEQNRAYFTGEGTISGQIGLQTFIDV -----%%%%-------------------------1111---------------------- VIEPLPGHA ----2222- >calcium channel, voltage-; SWP:O00305; PDB:2D46A; GSHMYDNLYLHGIEDSEAGSADSYTSRPSDSDVSLEEDREAIRQEREQQAAIQLERAKSK -----------------------------------------3333--------------- P - >INTERLEUKIN-4; SWP:P05112; PDB:2D48A; HKCDITLQEIIKDLNSLTEQKTLCTELTVTDIFAASKNTTEKETFCRAATVLRQFYSHHE 1111-----------1111--3333-----1111------------------------11 KDTRCLGATAQQFHRHKQLIRFLKRLDRNLWGLAGLNSCPVKEANQSTLENFLERLKTIM 11---------------------------------------------------------- REKYSKCSS ----1111- >MALATE DEHYDROGENASE; SWP:Q9YEA1; PDB:2D4AA; MITILGAGKVGMATAVMLMMRGYDDLLLIARTPGKPQGEALDLAHAAAELGVDIRISGSN ------------------------------------------------------------ SYEDMRGSDIVLVTAGIGEQLLEANANTMADLAEKIKAYAKDAIVVITTNPVDAMTYVMY 33332222--------------------------3333-1111-------3333------ KKTGFPRERVIGFSGILDSARMAYYISQKLGVSFKSVNAIVLGMHGQKMFPVPRLSSVGG ------1111---3333---------------3333---------1111--------iii VPLEHLMSKEEIEEVVSETVNAGAKITELRGYSSNYGPAAGLVLTVEAIKRDSKRIYPYS i3333-------------1111------------3333---------------------- LYLQGEYGYNDIVAEVPAVIGKSGIERIIELPLTEDEKRKFDEAVQAVKKLVETLPPQLR ----2222------------1111-------------------------------3333- E - >5-carboxymethyl-2-hydroxy; SWP:Q5SJP9; PDB:2D4EA; YADRVAGISWETIEEVRRRLKERPALHFIAGEFVPSESGETFPSLDPATNEVLGVAARGG ----------------------------iiii---3333------3333----------- EREVDRAAKAAHEAFQRWSRTKAKERKRYLLRIAELIEKHADELAVMECLDAGQVLRIVR ----------------3333-3333-----------------------------3333-- AQVARAAENFAFYAEYAEHAMEDRTFPVDRDWLYYTVRVPAGPVGIITPWNAPLMLSTWR -----------33331111-------------------------------------3333 IAPALAFGNTVVLKPAEWSPFTATKLAEILKEADLPPGVFNLVQGFGEEAGAALVAHPLV ----1111-------11113333------------2222------3333-------1111 PLLTLTGETETGKIVMRNAADHLKRLSPELGGKSPALVFADADLERALDAVVFQIFSFNG -------3333------3333-----------------1111------------------ ERCTASSRLLVEEKIFEDFVGKVVERARAIRVGHPLDPETEVGPLIHPEHLQRVLGYVEA -1111------3333-----------1111---3333----------------------- GKREGARLLVGGERAKTSFRGEDLSRGNYLLPTVFVGENHMKIAQEEIFGPVLVAIPFKD -1111------------------3333----------11111111--------------- EEEALRKANDTKYGLAAYVFTRDLERAHRLALELEAGMVYLNSHNVRHLPTPFGGVKGSG -----------------------------------------------3333----!!!!- DRREGGTYALDFYTDLKTIALPLRPPHVPKFGK ----!!!!3333--------------------- >HYPOTHETICAL PROTEIN BSU1; SWP:O31629; PDB:2D4GA; KYGIVLFPSKKLQDLANSYRKRYDPSYSLIPPHLTLRASFECAEEKADQLVSHLRNIAKE --------3333-----------1111---------------1111-------------- SHPLVLKTKYSSFAPVNNVIYIKAEPTEELKTLNEKLYTGVLAGEQEYNFVPHVTVGQNL ---------------------------------1111-!!!!------------------ SDDEHSDVLGQLKQEVSHEEIVDRFHLLYQLENGSWTVYETFLLG ------------------------------1111----------- >DU; SWP:P07570; PDB:2D4NA; KQPISKLTRATPGSAGLDLCSTSHTVLTPEMGPQALSTGIYGPLPPNTFGLILGRSSITM --3333----1111-------------3333-------------2222------------ KGLQVYPGVIDNDYTGEIKIMAKAVNNIVTVSQGNRIAQLILLPLIETDNKVQ ----------1111-----------------2222------------------ >HYPOTHETICAL PROTEIN TTHA; SWP:Q72J89; PDB:2D4PA; MRFRPFTEEDLDRLNRLAGKRPVSLGALRFFARTGHSFLAEEGEEPMGFALAQAVWQGEA ------3333---3333!!!!---------1111-------------------------- TTVLVTRIEGRSVEALRGLLRAVVKSAYDAGVYEVALHLDPERKELEEALKAEGFALGPL ---------------------------1111--------3333-------1111------ VLAVRVLGSR ---------- >NEUROFIBROMIN; SWP:P21359; PDB:2D4QA; KEEFKALKTLSIFYQAGTSKAGNPIFYYVARRFKTGQINGDLLIYHVLLTLKPYYAKPYE --33333333--------1111------3333------3333--------3333------ IVVDLTHTGPSNRFKTDFLSKWFVVFPGFAYDNVSAVYIYNCNSWVREYTKYHERLLTGL ----2222-1111-------1111--3333----------------------33331111 KGSKRLVFIDCPGKLAEHIEHEQQKLPAATLALEEDLKVFHNALKLAHKDTKVSIKVGST --1111--------------------3333-1111------------------------- AVQVTSAERTKVLGQSVFLNDIYYASEIEEICLVDENQFTLTIANQGTPLTFMHQECEAI -----------iiii--------3333-------1111----2222-------------- VQSIIHIRTRWELSQPD -----------1111-- >HYPOTHETICAL PROTEIN TTHA; SWP:Q72K65; PDB:2D4RA; PEVRAERYIPAPPERVYRLAKDLEGLKPYLKEVESLEVVAREGARTRSRWVAVAGKKVRW -----------3333------333333331111--------!!!!--------------- LEEEEWDDENLRNRFFSPEGDFDRYEGTWVFLPEGEGTRVVLTLTYELTIPIFGGLLRKL -------1111-------------------------------------------1111-- VQKLQENVESLLKGLEERVLAASS -------------------1111- >METHYL-ACCEPTING CHEMOTAX; SWP:P02942; PDB:2D4UA; GPLGSGGLFFNALKNCKENFTVLQTIRQQQSTLNGSWVALLQTRNTLNRAGIRYMMDQNN ------------------------------------------------------------ IGSGSTVAELMESASISLKQAEKNWADYEALPRDPRQSTAAAAEIKRNYDIYHNALAELI ---------------------------1111--1111----------------------- QLLGAGKINEFFDQPTQGYQDGFEKQYVAYMEQNDRLHDIAVSDNNA --1111----------------------------------------- >ISOCITRATE DEHYDROGENASE; SWP:Q8GAX0; PDB:2D4VA; THIQKPATGSPLTLLNGVLQVPDQPIIPFIEGDGIGCDVTPAMRSVVDAAVAKVYGGQRQ -----1111------------------------3333-----------------iiii-- IAWMELFAGQKAVQLYGEGQYLPDETMAAIREYKVAIKGPLETPVGGGIRSLNVAMRQDL ----------------2222--3333---------------------------------- DLYVCLRPVRYFEGTPSPMRHPEKVDMVIFRENSEDIYAGIEWPAGSPEAEKIIRFLREE -----------2222-----3333-----------1111----2222------------- MGVTKIRFPDSSAIGIKPVSTEGSERLIRRTIQYALEHGKPSVSLVHKGNIMKFTEGGFR -------3333------------------------------------3333--------- DWGYALAEREFAGRVFTWRQKAAISKAEGKAAGQKAEQQAIADGKLIIKDVIADNFLQQI ----------2222--------------------------1111---------------- LLRPEDYSVVATLNLNGDYVSDALAAEVGGIGMAPGANLSDTHAIFEATHGTAPDIAGQG -------------------------1111-----------------------3333---- KANPSSLILSAVMMLEHLGWGEAAQAIVAAMNATIAAGEVTGDLAALRGDVPALSTTEFT ---------------1111---------------1111--33331111------------ AALIRRF ---1111 >GLYCEROL KINASE; SWP:NA; PDB:2D4WA; ADYVLAIDQGTTSSRAIVFDHSGEIYSTGQLEHDQIFPRAGWVEHNPEQIWNNVREVVGL ---------1111------1111---------------2222------------------ ALTRGNLTHEDIAAVGITNQRETAVVWDKTTGKPVYNAIVWQDTRTQKIVDELGGDEGAE -------3333----------------------------11111111----------111 KYKSIVGLPLATYFSGPKIKWILDNVEGAREKAEKGDLLFGNTDTWVLWNMTGGTEGGVH 13333-----3333-----------2222-----------------------!!!!---- VTDVTNASRTMLMDLDTLSWREDIAADMGIPLSMLPDIRSSSEVYGHGRPRGLVPGVPIA --3333-------------------1111-3333--------------1111-2222--- GILGDQQAATFGQACFEVGQAKNTYGTGNFLLLNTGTEKVMSKNGLLTTVCYKIGDAPAV ----------1111--2222---------------------------------!!!!--- YALEGSIAVTGSLVQWLRDNLGMFEDAPDVEWLAGKVQDNGGAYFVPAFSGLFAPYWRPD -----------------------3333-33331111---iiii---2222---------- ARGALVGLTRYVNRNHIARAALEATAFQSREVVDAMNADSGVDLTELRVDGGMVANELLM --------3333--------------------------------------3333------ QFQADQLGVDVVRPKVAETTALGAAYAAGIAVGFWKGEQDVIDNWAEDKRWSPSMESGER -----------------------------------------1111--------------- ERLYRNWKKAVTKTMEWVDEDVE ----------------------- >FLAGELLAR HOOK-ASSOCIATED; SWP:P16326; PDB:2D4XA; VVLSQAQAQNSQYALARTFATQKVSLEESVLSQVTTAIQTAQEKIVYAGNGTLSDDDRAS ---------------------------------------------11111111------- LATDLQGIRDQLMNLANSTDGNGRYIFAGYKTEAAPFDQATGGYHGGEKSVTQQVDSART -------------------1111----!!!!----------------------------- MVIGHTGAQIFNSITSNAVPEPDGSDSEKNLFVMLDTAIAALKTPVEGNNVEKEKAAAAI -----3333-----1111--1111----------------1111-2222----------- DKTNRGLKNSLNNVLTVRAELGTQLSELSTLDSL ------------------------------3333 >FLAGELLAR HOOK-ASSOCIATED; SWP:P0A1J5; PDB:2D4YA; DAFITNQLRGAQNQSSGLTTRYEQMSKIDNLLADKSSSLSGSLQSFFTSLQTLVSNAEDP ---------------------------------1111------------------1111- AARQALIGKAEGLVNQFKTTDQYLRDQDKQVNIAIGSSVAQINNYAKQIANLNDQISRMT ------------------------------------------------------------ NDLLDQRDQLVSELNKIVGVEVSVQDGGTYNLTMANGYTLVQGSTARQLAAVPSSADPTR ------------------------2222-----1111----!!!!--------3333--- TTVAYVDEAAGNIEIPEKLLNTGSLGGLLTFRSQDLDQTRNTLGQLALAFADAFNAQHTK ---------------3333----------------------------------------- GYDADGNKGKDFFSIGSPVVYSNSNNADKTVSLTAKVVDSTKVQATDYKIVFDGTDWQVT --1111----------------1111-1111--------1111----------------- RTADNTTFTATKDADGKLEIDGLKVTVGTGAQKNDSFLLKPVSNAIVDMNVKVTNEAEIA ------------1111---------------2222------11111111-----3333-- MASESKLSDNRNGQALLDLQNSNVVGGNKTFNDAYATLVSDVGNKTSTLKTSSTTQANVV ------------------1111--iiii-------------------------------- KQLYKQQQS --------- >CHLORIDE CHANNEL PROTEIN; SWP:P21564; PDB:2D4ZA; LSWSSANKYNIQVGDIMVRDVTSIASTSTYGDLLHVLRQTKLKFFPFVDTPDTNTLLGSI ------------------------1111-------------------------------- DRTEVEGLLQRRISAYRRQPAAAAEADEEFEEMLTLEEIYRWEQREKNVVVNFETCRIDQ ---------------------------------------------1111----------- SPFQLVEGTSLQKTHTLFSLLGLDRAYVTSMGKLVGVVALAEIQAAIEG -----3333--------------------iiii---------------- >PENTAKETIDE CHROMONE SYNT; SWP:NA; PDB:2D51A; GPGMSSLSNSLPLMEDVQGIRKAQKADGTATVMAIGTAHPPHIFPQDTYADVYFRATNSE --11111111----------------------------------3333------111111 HKVELKKKFDHICKKTMIGKRYFNYDEEFLKKYPNITSYDEPSLNDRQDICVPGVPALGT 111111------1111---------333311113333----------------------- EAAVKAIEEWGRPKSEITHLVFCTSCGVDMPSADFQCAKLLGLHANVNKYCIYMQGYAGG ------------3333---------------3333--------1111---------3333 TVMRYAKDLAENNRGARVLVVCAELTIMGLRAPNETHLDNAIGISLFGDGAAALIIGSDP ------------2222----------1111---11113333------------------- IIGVEKPMFEIVCTKQTVIPNTEDVIHLHLRETGMMFYLSKGSPMTISNNVEACLIDVFK 2222-----------------1111-----1111-------------------------1 SVGITPPEDWNSLFWIPHPGGRAILDQVEAKLKLRPEKFRAARTVLWDYGNMVSASVGYI 111-------------------------------1111-------------!!!!----- LDEMRRKSAAKGLETYGEGLEWGVLLGFGPGITVETILLHSLPL --------1111----iiii------------------------ >ASABF; SWP:P90683; PDB:2D56A; AVDFSSCARMDVPGLSKVAQGLCISSCKFQNCGTGHCEKRGGRPTCVCDRCGR -------------3333----------1111---------------------- >ALLOGRAFT INFLAMMATORY FA; SWP:P55008; PDB:2D58A; KAQQEERLDEINKQFLDDPKYSSDEDLPSKLEGFKEKYMEFDLNGNGDIDIMSLKRMLEK 3333-------------3333--1111----------3333--1111-----------11 LGVPKTHLELKKLIGEVSSGSGETFSYPDFLRMMLGKRSAILKMILM 11--------------------------------------------- >HYPOTHETICAL PROTEIN PH11; SWP:O58836; PDB:2D59A; TRPIDGLTDEDIREILTRYKKIALVGASPKPERDANIVMKYLLEHGYDVYPVNPKYEEVL -----------------------------11113333---------------1111--ii GRKCYPSVLDIPDKIEVVDLFVKPKLTMEYVEQAIKKGAKVVWFQYNTYNREASKKADEA ii----3333------------3333-------------------------------111 GLIIVANRCMMREHERLLGEK 1-------------------- >METHIONYL-TRNA SYNTHETASE; SWP:P23395; PDB:2D5BA; MEKVFYVTTPIYYVNAEPHLGHAYTTVVADFLARWHRLDGYRTFFLTGTDEHGETVYRAA ------------1111--------------------1111-------------------- QAAGEDPKAFVDRVSGRFKRAWDLLGIAYDDFIRTTEERHKKVVQLVLKKVYEAGDIYYG ---------------------------------1111--------------1111----- EYEGLYCVSCERFYTEKELVEGLCPIHGRPVERRKEGNYFFRMEKYRPWLQEYIQENPDL --------------3333-iiii------------------3333-----------1111 IRPEGYRNEVLAMLAEPIGDLSISRPKSRVPWGIPLPWDENHVTFVWFDALLNYVSALDY --3333-------------------3333------1111-----------33333333-- PEGEAYRTFWPHAWHLIGKDILKPHAVFWPTMLKAAGIPMYRHLNVGGFLLGPDGRKMSK ---3333-3333-----3333------------------------------1111---33 TLGNVVDPFALLEKYGRDALRYYLLREIPYGQDTPVSEEALRTRYEADLADDLGNLVQRT 33--------------------------2222---------------------------- RAMLFRFAEGRIPEPVAGEELAEGTGLAGRLRPLVRELKFHVALEEAMAYVKALNRYINE -------%%%%------!!!!------3333---1111---------------------- KKPWELFKKEPEEARAVLYRVVEGLRIASILLTPAMPDKMAELRRALGLKEEVRLEEAER -33333333-----------------------------------1111-----3333--- WGLAEPRPIPEEAPVLFPKK -------------------- >SHIKIMATE 5-DEHYDROGENASE; SWP:Q5SJF8; PDB:2D5CA; MLRFAVLGHPVAHSLSPAMHAFALESLGLEGSYEAWDTPLEALPGRLKEVRRAFRGVNLT ----------1111----------1111----------3333------3333-------- LPLKEAALAHLDWVSPEAQRIGAVNTVLQVEGRLFGFNTDAPGFLEALKAGGIPLKGPAL -----3333--------------------iiii---------------1111-------- VLGAGGAGRAVAFALREAGLEVWVWNRTPQRALALAEEFGLRAVPLEKAREARLLVNATR ---------------------------3333-----------------1111-------2 VGLEDPSASPLPAELFPEEGAAVDLVYRPLWTRFLREAKAKGLKVQTGLPMLAWQGALAF 222-1111---1111-----------------------1111------------------ RLWTGLLPDPSGMEEAARRAL -----------------1111 >GLYCININ A3B4 SUBUNIT; SWP:NA; PDB:2D5FA; NECQLNNLNALEPDHRVESEGGLIETWNSQHPELQCAGVTVSKRTLNRNGLHLPSYSPYP 1111--------------1111-----1111---------------2222---------- QMIIVVQGKGAIGFAFPGCPETFEKPQLQDSHQKIRHFNEGDVLVIPPGVPYWTYNTGDE ---------------2222-------------------2222----2222---------- PVVAISLLDTSNFNNQLDQNPRVFYLAGNPDIEHPETMQEGGSVLSGFSKHFLAQSFNTN --------1111---------------------3333-----3333--------1111-- EDTAEKLRSPDDERKQIVTVEGGLSVISPKWGVEENICTMKLHENIARPSRADFYNPKAG -------------------2222------------3333--------3333----1111- RISTLNSLTLPALRQFGLSAQYVVLYRNGIYSPHWNLNANSVIYVTRGKGRVRVVNQGNA -----33333333------------2222------------------------------- VFDGELRRGQLLVVPQNFVVAEQGGEQGLEYVVFKTHHNAVSSYIKDVFRAIPSEVLSNS ------2222----2222------1111--------------------1111-------- YNLGQSQVRQLKYQGNSGPLVNP ----------------------- >DPS FAMILY PROTEIN; SWP:Q1Y2H0; PDB:2D5KA; SNQQDVVKELNQQVANWTVAYTKLHNFHWYVKGPNFFSLHVKFEELYNEASQYVDELAER -3333---------------------------1111------------------------ ILAVGGNPVGTLTECLEQSIVKEAAKGYSAEQMVEELSQDFTNISKQLENAIEIAGNAGD -1111--------------------------------------------------1111- DVSEDMFIGMQTSVDKHNWMFKSYLSLE ---------------------------- >DIPEPTIDYL AMINOPEPTIDASE; SWP:Q7MUW6; PDB:2D5LA; EFYNFYPEVGLQWMGDNYVFIEGDDLVFNKTTRFSAADLNALMPPSFRTLDAGRGLVVLF ---------------------------------------1111-------3333------ TQGGLVGFDMLARKVTYLFDTNEETASLDFSPVGDRVAYVRNHNLYIARGGKLGEGMSRA --------------------iiii1111--3333------iiii-------2222----- IAVTIDGTETLVYGQAVHQREFGIEKGTFWSPKGSCLAFYRMDQSMVKPTPIVDYHPLEA -------1111-----%%%%iiii------1111---------1111------------- ESKPLYYPMAGTPSHHVTVGIYHLATGKTVYLQTGEPKEKFLTNLSWSPDENILYVAEVN --------2222------------------------1111-------1111--------3 RAQNECKVNAYDAETGRFVRTLFVETDKHYVEPLHPLTFLPGSNNQFIWQSRRDGWNHLY 333------------------------------------2222--------1111----- LYDTTGRLIRQVTKGEWEVTNFAGFDPKGTRLYFESTEASPLERHFYCIDIKGGKTKDLT -------------------------1111-------3333---------1111------- PESGMHRTQLSPDGSAIIDIFQSPTVPRKVTVTNIGKGSHTLLEAKTGYAMPEIRTGTIM ----------------------1111---------------------------------- AADGQTPLYYKLTMPLHFDPAKKYPVIVYVYGGPHAQLVTKTWRSSVGGWDIYMAQKGYA 1111----------22221111------------------------iiii---3333--- VFTVDSRGSANRGAAFEQVIHRRLGQTEMADQMCGVDFLKSQSWVDADRIGVHGWSYGGF -------------------2222------------------11111111----------- MTTNLMLTHGDVFKVGVAGGPVIDWNRYEIMYGERYFDAPQENPEGYDAANLLKRAGDLK --------3333-------------------3333---3333--------33333333-- GRLMLIHGAIDPVVVWQHSLLFLDACVKARTYPDYYVYPSHEHNVMGPDRVHLYETITRY -------1111---3333------------------------------------------ FTDHL ----- >FLAVOREDOXIN; SWP:Q4W5X6; PDB:2D5MA; MKKSLGARTLAYPTPLFLVGTYDRDSRPNIMAAAWAGICCSQPPSIAVSLRKATYTYRSI ----------------------1111------------------------3333------ TERGAFTISIPSRAYVRHADYAGIYSGENEDKFASLGLTPVPGEHVDAPYVGEFPMAIEL -----------3333-----1111--------------------------3333------ KLIHQIEHTQFIGEIMDVKVDESCLRDDGLPDINKVDPVIFAPVSREYYAVGEFLAKAFS --------------------3333-1111---3333-----------------------3 AGK 333 >CCR4-NOT TRANSCRIPTION CO; SWP:Q9UIV1; PDB:2D5RA; RICEVWACNLDEEMKKIRQVIRKYNYVAMDTEFPGVVARPIGEFRSNADYQYQLLRCNVD -----3333-----------1111----------------------------------11 LLKIIQLGLTFMNEQGEYPPGTSTWQFNFKFNLTEDMYAQDSIELLTTSGIQFKKHEEEG 11----------1111---------------1111-----------1111---------- IETQYFAELLMTSGVVLCEGVKWLSFHSGYDFGYLIKILTNSNLPEEELDFFEILRLFFP ------------------------------------------------------------ VIYDVKYLMKSCKNLKGGLQEVAEQLELERIGPQHQAGSDSLLTGMAFFKMREMFFEDHI ---3333----1111--------------------------------------------- DDAKYCGHLYGL 33332222---- >Protein Tob1; SWP:P50616; PDB:2D5RB; HMQLEIQVALNFIISYLYNKLPRRRVNIFGEELERLLKKKYEGHWYPEKPYKGSGFRCIH ----------------2222--------------------2222-3333-22221111-- IGEKVDPVIEQASKESGLDIDDVRGNLPQDLSVWIDPFEVSYQIGEKGPVKVLYVD ------------------3333-1111--------2222-----1111-------- >HEPATOCYTE NUCLEAR FACTOR; SWP:P70512; PDB:2D5VA; MEEINTKEVAQRITTELKRYSIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFR -----------------------------------------------3333--------- RMWKWLQEPEFQRMSALRLPRLVFTDVQRRTLHAIFKENKRPSKELQITISQQLGLELST -------------3333------------------------------------------- VSNFFMNARRRSLDK --------------- >PEPTIDE ABC TRANSPORTER, ; SWP:Q5SHU6; PDB:2D5WA; MGPQDNSLVIGASQEPRVLAGDFLRVISNQAIKSEIEQYLFAPFIGFNADSQNFPVLATE ---------------------1111----------------------1111--------- VPTLENGRLRVTDIGGGKKRLEMDITIRPDAKWSDGRPITTEDVAFYFEVGKAKGMPVLN ---1111-------iiii---------1111-1111----------------2222---- PDFWERVNVRIKDARNFTLIFEPAYYYDTYGPINTYAPKHIMGPEWERVKAAARGLDPDK ------------------------1111---------3333-----------1111---- DAEKLNELYRNFFLKFATPQALNRGAMVYSGPFKLKRWVPGNSIEMERNPNFPIKPEGGE --------------------------------------2222------1111---22221 SKYVQKVVYRFIQNTNSLLVAVIGGSIDATSSVSLTFDQGRSPQLVRRAPGRFDIWFVPG 111--------------------------------3333-----11112222-------- AIWEHIDINKFENCQVVKDLGLNDKRTRQAILHALNREGLVKAFFDGLQPVAHTWIAPVN ----------1111------1111-------1111---------------------3333 PLFNPNVKKYEFDLKKAEALLAEMGWRKGPDGILQRTVNGRTVRFEIEYVTTAGNVVRER ---------------------1111---1111-----iiii----------2222----- TQQFFAEDLKKIGIAVKINNAPSAVVFADEFIQRASECKWTGMFEFAWVSNLQEDGSLFQ ---------1111--------3333--3333--3333------------------3333- YKNLNTGAIMVPTKENNYQGQNIGGWRNDEFDRLTSQAVLEFDPERRKQLFWRAQEIWAE ------------3333-----1111------------1111------------------- ELPALPLYFRANPYVVRKGLVNYVASAYSGGYGYPGWNAWEIGWESRGAVKKWDQAKYAL ----------------2222-------2222------1111--3333-------3333-- ST -- >HEMOGLOBIN ALPHA SUBUNIT; SWP:P01958; PDB:2D5XA; VLSAADKTNVKAAWSKVGGHAGEYGAEALERMFLGFPTTKTYFPHFDLSHGSAQVKAHGK ----------------!!!!---------------3333---1111--2222-------- KVGDALTLAVGHLDDLPGALSNLSDLHAHKLRVDPVNFKLLSHCLLSTLAVHLPNDFTPA -----------1111------------------3333----------------1111--- VHASLDKFLSSVSTVLTSKYR --------------1111--- >Hemoglobin subunit beta; SWP:P02062; PDB:2D5XB; VQLSGEEKAAVLALWDKVNEEEVGGEALGRLLVVYPWTQRFFDSFGDLSNPGAVMGNPKV --------------11113333---------------33331111--------------- KAHGKKVLHSFGEGVHHLDNLKGTFAALSELHCDKLHVDPENFRLLGNVLVVVLARHFGK ----------------1111------------------3333---------------!!! DFTPELQASYQKVVAGVANALAHKYH !--------------------3333- >multiple sugar-binding tr; SWP:O57933; PDB:2D62A; IGAEVKLINIWKRFGDVTAVKDLSLEIKDGEFLVLLGPSGCGKTTTLRIAGLEEPTRGQI -------------!!!!----------2222------2222------------------- YIEDNLVADPEKGVFVPPKERDVAVFQSYALYPHTVYDNIAFPLKLRKVPKQEIDKRVRE -%%%%----1111---3333-----------------------1111------------- VAELGLTELLNRKPRELSGGQRQRVALGRAIIRRPKVFLDEPLSNLDAKLRVKRAELKKL -----1111---1111------------------------1111---------------- QRQLGVTTIYVTHDQVEATGDRIAVNKGELQQVGTPDEVYYKPVNTFVAGFIGSPPNFLD -------------3333--------iiii------------------------------- ATITDDGFLDFGEFKLKLLQDQFEVLEEENVGKEVIFGIRPEDVHDASFTHIDVPEENTV ---1111-------------------1111---------3333--1111----2222--- KATVDIIENLGGEKIVHLRRGNISFTAKFPKESKVREGDEVSVVFDKKIHIFRKDTEKAI -------------------!!!!------1111--2222--------------------- F - >FOP; SWP:O95684; PDB:2D68A; VNESLKKFLNTKDGRLVASLVAEFLQFFNLDFTLAVFQPETSTLEGRENLARDLGIIEAE -------------------------1111-3333-----------------1111---11 GTVGGPLLLEVIRRW 11---3333--1111 >RIBULOSE BISPHOSPHATE CAR; SWP:O58677; PDB:2D69A; MMVLRMKVEWYLDFVDLNYEPGRDELIVEYYFEPNGVSPEEAAGRIASESSIGTWTTLWK ---------3333--1111--1111----------------------------------- LPEMAKRSMAKVFYLEKHGEGYIAKIAYPLTLFEEGSLVQLFSAVAGNVFGMKALKNLRL -1111------------!!!!-------3333-2222---------3333-1111----- LDFHPPYEYLRHFKGPQFGVQGIREFMGVKDRPLTATVPKPKMGWSVEEYAEIAYELWSG -----33331111--------------------------------------------111 GIDLLKDDENFTSFPFNRFEERVRKLYRVRDRVEAETGETKEYLINITGPVNIMEKRAEM 1------1111--1111------------------------------------------- VANEGGQYVMIDIVVAGWSALQYMREVTEDLGLAIHAHRAMHAAFTRNPRHGITMLALAK -1111------3333-----------------------2222-----1111--------- AARMIGVDQIHTGTAVGKMAGNYEEIKRINDFLLSKWEHIRPVFPVASGGLHPGLMPELI ------------------------------------!!!!-----------3333----- RLFGKDLVIQAGGGVMGHPDGPRAGAKALRDAIDAAIEGVDLDEKAKSSPELKKSLREVG -----------3333--1111-------------------33333333-----------1 LSKA 111- >GLUTAMYL-TRNA(GLN) AMIDOT; SWP:O26802; PDB:2D6FA; SYQGRARKFLESASIDVGDMVLVEKPDVTYEGMVLDRADDADDRHIVLKLENGYNIGVEI ---------------2222------------------3333--------3333------- SDARIELLEKGSAAEDPELPDVSIISTGGTVASIIDYRTGAVHPAFTADDLLRANPELLD ------------------------------------------------------3333-- IANIRGRAVFNILSENMKPEYWVETARAVYGEIKDGADGVVVAHGTDTMHYTSAALSFML ------------3333-3333----------3333------------3333--------- RTPVPVVFTGAQRSSDRPSSDASLNIQCSVRAATSEIAEVTVCMHATMDDLSCHLHRGVK -------------3333--------------1111---------------------1111 VRKMHTSRRDTFRSMNALPLAEVTPDGIKILEENYRKRGSDELELSDRVEERVAFIKSYP -----------------------1111----------3333------------------- GISPDIIKWHLDEGYRGIVIEGTGLGHCPDTLIPVIGEAHDMGVPVAMTSQCLNGRVNMN --3333----------------------3333-------1111------3333------- VYSTGRRLLQAGVIPCDDMLPEVAYVKMCWVLGQTDDPEMAREMMRENIAGEINERTSIA --------1111-----------------1111------------------------111 YFRG 1--- >Glutamyl-tRNA(Gln) amidot; SWP:O26803; PDB:2D6FC; MDWEKVGLKMGLEIHQQLDTESKLFCPCRTELTDSEPDHDIVRNLRPTAFEEAMRKLHFH -3333-----------------------------------------------3333---- YENYHEETCLVEADEEPPHPLNPEALEIAVTIALLLNMRVVDEFHTMRKQVIDGSNTGGF --------3333---------------------1111----------------------- QRTGLVATDGHLETPQGTVKIENLCLEEDAARRIRETGDGVVFRLDRLGIPLVEITTDPS -------------1111---------------------------3333------------ MSDPQQLREVAYQIGQILRSTRVKRGLGTIRQDLNISIRDGARVEVKGVQDLDLIPEIVE --3333-------------------------------2222---------3333------ REVKRQLSLVEIRDTLQERGAVVEDKIFDVSEVFADTESRIISSAESVLAVKLRGFDGLI ----------------1111----------3333----3333----------2222--11 GVEIQPGRRLGTEMADYAKKRGVSGIFHTDELPAYGITEEEVRGLRDAVGASQGDAVVMV 11------3333---------------------%%%%3333------------------- AHERVTAENALREVIRRAEMAIQGVPEETRKALPDGNTQYLRPLPTSSRMYLETDIPLFR --3333------------3333------------------------------3333---- IEDDLLEGIRRNLPELPSEKKERIMRDYGLSEDLASQLVKRNLVDEFDTTVIASLLAYTL -3333-------------------------------------3333------1111---- RELRR 3333- >LECTIN, GALACTOSE BINDING; SWP:NA; PDB:2D6MA; SALFSAQSPYINPIIPFTGPIQGGLQEGLQVTLQGTTKSFAQRFVVNFQNSFNGNDIAFH --------------------2222-2222------------------------------- FNPRFEEGGYVVCNTKQNGQWGPEERKMQMPFQKGMPFELCFLVQRSEFKVMVNKKFFVQ -----iiii-------iiii------------2222--------1111----iiii---- YQHRVPYHLVDTIAVSGCLKLSFITFQTQ -----3333-------------------- >PUTATIVE TETR FAMILY REGU; SWP:Q9ADP7; PDB:2D6YA; EATKARIFEAAVAEFARHGIAGARIDRIAAEARANKQLIYAYYGNKGELFASVLEKKLDL -------------------1111-----------3333---------------------- AISVPVDPDDIEGWIDRLLDYHAAHPELLRLLFWEGEYGTAELPHEAERQEHYARKVAAV ------1111---------------------------!!!!------------------- RDGQERGVITDAIPAPDLLFLLVAANWAVVVPQKRILVGGGDAGTDGLRDSIKKAARRIV -------------------------3333------------------------------- DR -- >ALPHA-GLUCOSIDASE SUSB; SWP:P71094; PDB:2D73A; QQKLTSPDNNLVMTFQVDSKGAPTYELTYKNKVVIKPSTLGLELKKEDNTRTDFDWVDRR -----1111--------1111-------%%%%--------------------1111---- DLTKLDSKTNLYDGFEVKDTQTATFDETWQPVWGEEKEIRNHYNELAVTLYQPMNDRSIV 11113333-------------------------------------------3333----- IRFRLFNDGLGFRYEFPQQKSLNYFVIKEEHSQFGMNGDHIAFWIPGDYDTQEYDYTISR ------------------1111-------------------------------------3 LSEIRGLMKEAITPNSSQTPFSQTGVQTALMMKTDDGLYINLHEAALVDYSCMHLNLDDK 33333331111------------------------------------------------- NMVFESWLTPDAKGDKGYMQTPCNTPWRTIIVSDDARNILASRITLNLNEPCKIADAASW ----------1111--------------------333311113333-------1111--- VKPVKYIGVWWDMITGKGSWAYTDELTSVKLGETDYSKTKPNGKHSANTANVKRYIDFAA -----------------------------2222-3333---------------------1 AHGFDAVLVEGWNEGWEDWFGNSKDYVFDFVTPYPDFDVKEIHRYAARKGIKMMMHHETS 111-----------3333---------------1111---------1111--------%% ASVRNYERHMDKAYQFMADNGYNSVKSGYVGNIIPRGEHHYGQWMNNHYLYAVKKAADYK %%------------------------------------1111--------------1111 IMVNAHEATRPTGICRTYPNLIGNESARGTEYESFGGNKVYHTTILPFTRLVGGPMDYTP -----------------1111----------1111---1111--33333333-------- GIFETHCNKMNPANNSQVRSTIARQLALYVTMYSPLQMAADIPENYERFMDAFQFIKDVA ------33331111---------------------------3333--------------- LDWDETNYLEAEPGEYITIARKAKDTDDWYVGCTAGENGHTSKLVFDFLTPGKQYIATVY -----------2222-------2222-----------------------2222------- ADAKDADWKENPQAYTIKKGILTNKSKLNLHAANGGGYAISIKEVKDKSEAKGLKRL --2222--------------------------2222----------11112222--- >Translation initiation fa; SWP:Q8U3I5; PDB:2D74B; IDYYDYEKLLEKAYQELPENVKHHKSRFEVPGALVTIEGNKTIIENFKDIADALNRDPQH -1111-33331111-------------------------------------------111 LLKFLLREIATAGTLEGRRVVLQGRFTPYLIANKLKKYIKEYVICPVCGSPDTKIIKRDR 1-------------------------3333-------------------1111------- FHFLKCEACGAETPIQH ----------------- >Rab11 family-interacting ; SWP:O75154; PDB:2D7CC; VSRDELEAIQKQEEINFRLQDYIDRIIVAIETNPSILEVK -3333---------------------------3333---- >UVRABC SYSTEM PROTEIN B; SWP:P37954; PDB:2D7DA; DRFELVSKYQPQGDQPKAIEKLVKGIQEGKKHQTLLGATGTGKTFTVSNLIKEVNKPTLV -----------!!!!----------------------2222------------------- IAHNKTLAGQLYSEFKEFFPNNAVEYFVSYYDYYQPEAYVPQTDTFIEKDASINDEIDKL ------------------1111------------------1111---------------- RHSATSALFERRDVIIIASVSCIYGLGSPEEYREMVVSLRTEMEIERNELLRKLVDIQYA ------1111----------1111--------1111------------------1111-- RNDIDFQRGTFRVRGDVVEIFPASRDEHCVRVEFFGDEIERIREVDALTGEILGDRDHVA -------------!!!!----1111----------------------------------- IFPASHFVTRAEKMEKAIQNIEKELEEQLKVMHENGKLLEAQRLEQRTRYDLEMMREMGF --------------------------------1111------------------------ CSGIENYSRHLTLRPPGSTPYTLLDYFPDDFMIVVDESHVTIPQVRGMFNGDQARKQVLV 2222-----1111-2222---3333-----------3333-------------------1 DHGFRLPSALDNRPLRFEEFEKHMHNIVYVSATPGPYEIEHTDEMVEQIIRPTGLLDPLI 111--3333------3333-1111--------------------------1111------ DVRPIEGQIDDLIGEIQARIERNERVLVTTLTKKMSEDLTDYLKEIGIKVNYLHSEIKTL ----2222-----------1111--------------------1111------1111--- ERIEIIRDLRLGKYDVLVGINLLREGLDIPEVSLVAILDADKEGFLRSERSLIQTIGRAA -----------------------2222-3333------1111--------------1111 RNAEGRVIMYADKITKSMEIAINETKRRREQQERFNEEHGITPKTINKKERQKVVEQMEH ------------------------------------------------------------ EMKEAAKALDFERAAELRDLL --------------------- >PRIMOSOMAL PROTEIN N'; SWP:P17888; PDB:2D7EA; MPVAHVALPVPLPRTFDYLLPEGMTVKAGCRVRVPFGKQQERIGIVVSVSDASELPLNEL --------------------2222-------------------------------3333- KAVVEVLDSEPVFTHSVWRLLLWAADYYHHPIGDVLFHALPILLR -------------------------1111-3333-------3333 >POLYPEPTIDE N-ACETYLGALAC; SWP:Q86SR1; PDB:2D7IA; GQKLKDWHDKEAIRRDAQRVGNGEQGRPYPMTDAERVDQAYRENGFNIYVSDKISLNRSL ---------------1111-2222-------3333--3333---------11111111-- PDIRHPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFS ----3333-------------------------------------3333----------- DREHLKKPLEDYMALFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWL -3333----------3333----------------------------------------3 PPLLDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKAD 333------1111------------------2222-----------------3333---1 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 111-------------------1111--1111----3333---------------1111- VGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEYIYQRRPEYRHLSAGDVAVQKKLR -----------------3333---------!!!!3333---3333--------------- SSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCADTKHGALGSPLRLEG 1111-----------1111----------------------------------------- CVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKFCFDAISHTSPVTLYDCHSMKGNQLW -iiii-3333--------1111-----1111----------------------------- KYRKDKTLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFEHTNSTVLEKFNRN --1111---------------------------------------3333------- >WD REPEAT AND HMG-BOX DNA; SWP:O75717; PDB:2D7LA; GSSGSSGRPKTGFQMWLEENRSNILSDNPDFSDEADIIKEGMIRFRVLSTEERKVWANKA --------------------------------3333-------3333-3333------33 KGETASEGTEAKKRKSGPSSG 33---------%%%%------ >FILAMIN-C; SWP:Q14315; PDB:2D7MA; GSSGSSGAHDASKVRASGPGLNASGIPASLPVEFTIDARDAGEGLLTVQILDPEGKPKKA ---------3333----33333333----------------------------------- NIRDNGDGTYTVSYLPDMSGRYTITIKYGGDEIPYSPFRIHALPTGDASSGPSSG ------------------------------------------------------- >FILAMIN-C; SWP:Q14315; PDB:2D7NA; GSSGSSGLRPFNLVIPFAVQKGELTGEVRMPSGKTARPNITDNKDGTITVRYAPTEKGLH -----------------------------3333--------------------------- QMGIKYDGNHIPGSPLQFYVDAINSRHSGPSSG --------------------------------- >FILAMIN-C; SWP:Q14315; PDB:2D7OA; GSSGSSGAINSRHVSAYGPGLSHGMVNKPATFTIVTKDAGEGGLSLAVEGPSKAEITCKD -----------------3333--------------------------------------- NKDGTCTVSYLPTAPGDYSIIVRFDDKHIPGSPFTAKITGDDSMRSGPSSG -----------------------%%%%------------------------ >FILAMIN-C; SWP:Q14315; PDB:2D7PA; GSSGSSGSDDARRLTVTSLQETGLKVNQPASFAVQLNGARGVIDARVHTPSGAVEECYVS --------3333------------------------------------------------ ELDSDKHTIRFIPHENGVHSIDVKFNGAHIPGSPFKIRVGEQSQAGSGPSSG --%%%%------------------%%%%------------------------ >FILAMIN-C; SWP:Q14315; PDB:2D7QA; GSSGSSGAGDPGLVSAYGPGLEGGTTGVSSEFIVNTLNAGSGALSVTIDGPSKVQLDCRE ---------3333----3333--------------3333--------------------- CPEGHVVTYTPMAPGNYLIAIKYGGPQHIVGSPFKAKVTGPRLSGSGPSSG --------------------------------------------------- >anti polyhydroxybutyrate ; SWP:NA; PDB:2D7TH; QVQLVQSGAEVKKPGASVKVSCKASGYTFTGNYMHWVRQAPGQGLEYMGWINPKSGDTNY ------------2222-----------1111--------2222----------------- AQKFQGRVTMTRDTSISTVYMEVRRLRSDDTAVYYCATGWWGMDVWGQGTLVTVSS 3333----------------------3333-------------------------- >anti polyhydroxybutyrate ; SWP:NA; PDB:2D7TL; DIVMTQSPSSLSASVGDRVTITCRASQNINNYLHWYQHEPGKAPKLLIYAASNLQGGVTS -------------2222---------------------2222------------2222-- RFSGSGSGTDFTLTISTLQPEDFATYYCLQTHAYPLTFGGGTKVDIKRAA ------------------1111---------------------------- >ADENYLOSUCCINATE SYNTHETA; SWP:O58187; PDB:2D7UA; MPSVIVVGGQWGDEGKGSIVAYLSLHDEPEIIARGGVGTNAGHSVVINGKKYAVRQIPTG ----------------------------------------------iiii---------3 FMQTKARLLIGAGVLVDPEVFFHELEQLKDFNVKDRVGIDYRCAIIEEKHKQSGCGPANA 333-------1111--3333-------33333333-----------3333---------- DRVMRKAKQAKDVKELEPYLTDVAQEINDALDEGSLVLVEGTQGFGLSLYYGTYPYVTSK 3333----111133331111-3333------------------3333------------- DVTASSVAADVGIGPTRVDEVIVVFKSFPTRVGAGPFPTEMPMEEADRLGLVEYGTVTGR -------------1111-------------------1111------3333---------- RRRVGWFDFEMARYSARINGATMLAVTMLDKYDKEAFGVTDYDKLPRKAKEFIEEIEERV ----------------1111---------3333--2222-3333---------------- GVPVGLIKTGPELEHIIDRRD -----------3333------ >HYPOTHETICAL PROTEIN VCA0; SWP:Q9KMK9; PDB:2D7VA; SSEHSAIVTWKRKDSEAFTDNQYSRAHTWEFDGGSKILASASPHVVPVPLSVEANVDPEE ------------1111-1111---------1111-------1111------3333----- AFVAALSSCHLVFLSIAAKQRYLVESYTDNAVGILGKNSKGKTSVTKVVLRPQVVFSGTS -------------------------------------1111------------------- KPTLQQLEKHHLAHENCFIANSVETEVVTEII -----------------3333----------- >PHB DEPOLYMERASE; SWP:Q4W9V8; PDB:2D81A; TALPAFNVNPNSVSVSGLASGGYMAAQLGVAYSDVFNVGFGVFAGGPYDCARNQYYTSCM --------1111-----------------------3333-------2222-----3333% YNGYPSITTPTANMKSWSGNQIASVANLGQRKIYMWTGSSDTTVGPNVMNQLKAQLGNFD %%%-------------2222---3333----------1111---3333-------3333- NSANVSYVTTTGAVHTFPTDFNGAGDNSCSLSTSPYISNCNYDGAGAALKWIYGSLNARN 3333------------------2222-1111----------------------------- TGTLSGSVLSFAQSGSYGANGMDTTGYLYVPQSCASGATVCSLHVALHGCLQSYSSIGSR -------------!!!!-2222--------3333--------------22223333!!!! FIQNTGYNKWADTNNMIILYPQAIPDYTIHAIWNGGVLSNPNGCWDWVGWYGSNADQIGG -----33333333-----------------------------------1111-1111--- VQMAAIVGQVKQIVSGFQ -------------1111- >L-PLASTIN; SWP:P13796; PDB:2D85A; GSSGSSGNDDIIVNWVNETLREAEKSSSISSFKDPKISTSLPVLDLIDAIQPGSINYDLL -------1111---------1111----------------3333-----------3333- KTENLNDDEKLNNAKYAISMARKIGARVYALPEDLVEVNPKMVMTVFACLMGKGMKRVSG -------3333---------------------------3333--------3333------ PSSG ---- >VAV-3 PROTEIN; SWP:Q9UKW4; PDB:2D86A; GSSGSSGMEPWKQCAQWLIHCKVLPTNHRVTWDSAQVFDLAQTLRDGVLLCQLLNNLRAH --------11113333--------11111111-----33331111---3333-------- SINLKEINLRPQMSQFLCLKNIRTFLTACCETFGMRKSELFEAFDLFDVRDFGKVIETLS --3333----%%%%3333--------------------------------3333--3333 RLSRTPIALATGIRPFPSGPSSG 3333---3333------------ >SMOOTHELIN SPLICE ISOFORM; SWP:P53814; PDB:2D87A; GSSGSSGIKQMLLDWCRAKTRGYEHVDIQNFSSSWSDGMAFCALVHNFFPEAFDYGQLSP ------3333------1111------------------------33333333--333333 QNRRQNFEVAFSSAETHADCPQLLDTEDMVRLREPDWKCVYTYIQEFYRCLVQKGLVKTK 33-3333----------------------------3333--------------------- KSSGPSSG -------- >PROTEIN MICAL-3; SWP:Q7RTP6; PDB:2D88A; GSSGSSGVARSSKLLGWCQRQTDGYAGVNVTDLTMSWKSGLALCAIIHRYRPDLIDFDSL ---------------------------------3333----3333------33333333- DEQNVEKNNQLAFDIAEKELGISPIMTGKEMASVGEPDKLSMVMYLTQFYEMFKDSGPSS 3333------3333-----------------------------------1111------- G - >EHBP1 PROTEIN; SWP:Q8NDI1; PDB:2D89A; GSSGSSGPNASQSLLVWCKEVTKNYRGVKITNFTTSWRNGLSFCAILHHFRPDLIDYKSL -------------------1111-------------33333333------1111-3333- NPQDIKENNKKAYDGFASIGISRLLEPSDMVLLAIPDKLTVMTYLYQIRAHFSSGPSSG ------3333-----3333-------3333------3333------------------- >PROBABLE L-THREONINE 3-DE; SWP:O58389; PDB:2D8AA; EKVAIKTKPGYGAELVEVDVPKPGPGEVLIKVLATSICGTDLHIYEWNEWAQSRIKPPQI -----------------------2222--------------------3333--------- GHEVAGEVVEIGPGVEGIEVGDYVSVETHIVCGKCYTKIFGVDTDGVFAEYAVVPAQNIW ------------------2222----------------2222------------3333-- KNPKSIPPEYATLQEPLGNAVDTVLAGPISGKSVLITGAGPLGLLGIAVAKASGAYPVIV --3333333311113333----------2222---------------------------- SEPSDFRRELAKKVGADYVINPFEEDVVKEVDITDGNGVDVFLEFSGAPKALEQGLQAVT --------------------1111-3333--1111------------------------2 PAGRVSLLGLYPGKVTIDFNNLIIFKALTIYGITGRHLWETWYTVSRLLQSGKLNLDPII 222--------------3333--1111---------------------3333---3333- THKYKGFDKYEEAFELRAGKTGKVVFL --------------------------- >TWINFILIN-1; SWP:Q91YR1; PDB:2D8BA; GSSGSSGEVQTDVSVDTKHQTLQGVAFPISRDAFQALEKLSKKQLNYVQLEIDIKNETII --------------------------------------3333------------------ LANTENTELRDLPKRIPKDSARYHFFLYKHSHEGDYLESVVFIYSMPGYTCSIRERMLYS -------33331111----------------iiii----------------3333----- SCKSPLLEIVERQLQMDVIRKIEIDNGDELTADFLYDEVHSGPSSG --1111--------------------33331111---3333----- >Phosphatidylcholine:ceram; SWP:Q8VCQ6; PDB:2D8CA; GSSGSSGMLSARTMKEVVYWSPKKVADWLLENAMPEYCEPLEHFTGQDLINLTQEDFKKP --------------------1111-3333111133333333-----------3333---- PLYRVSSDNGQRLLDMIETLKMEHHMEAHKNSGPSSG -------%%%%---------3333------------- >phospho-2-dehydro-3-deoxy; SWP:Q72LN8; PDB:2D8DA; ERIQALRKEVDRVNREILRLLSERGRLVQEIGRLQTELGLPHYDPKREEEMLAYLTAENP ------------------------------------------------------------ GPFPDETIRKLFKEIFKASL ----------------1111 >SH3YL1 PROTEIN; SWP:Q9Y3V5; PDB:2D8HA; GSSGSSGHERVGNLNQPIEVTALYSFEGQQPGDLNFQAGDRITVISKTDSHFDWWEGKLR -----------------------------1111-------------------------%% GQTGIFPANYVTMNSGPSSG %%----3333---------- >T-cell lymphoma invasion ; SWP:Q59GK8; PDB:2D8IA; GSSGSSGEIEICPKVTQSIHIEKSDTAADTYGFSLSSVEEDGIRRLYVNSVKETGLASKK ------------------------3333-----------iiii----------------- GLKAGDEILEINNRAADALNSSMLKDFLSQPSLGLLVRTYPELEEGVESGPSSG ----------%%%%3333------------------------------------ >FYN-RELATED KINASE; SWP:Q8BPC1; PDB:2D8JA; GSSGSSGQYFVALFDYQARTAEDLSFRAGDKLQVLDTSHEGWWLARHLEKKGTGLGQQLQ -------------------3333---2222------------------------------ GYIPSNYVAEDSGPSSG ---3333---------- >SYNAPTOTAGMIN VII; SWP:Q62747; PDB:2D8KA; GSSGSSGSRENLGRIQFSVGYNFQESTLTVKIMKAQELPAKDFSGTSDPFVKIYLLPDKK -----------------------------------------3333--------------- HKLETKVKRKNLNPHWNETFLFEGFPYEKVVQRILYLQVLDYDRFSRNDPIGEVSIPLNK -------------------------33333333-----------------------3333 VDLTQMQTFWKDLKPSGPSSG -3333---------------- >DNA-REPAIR PROTEIN XRCC1; SWP:P18887; PDB:2D8MA; GSSGSSGEPRRPRAGPEELGKILQGVVVVLSGFQNPFRSELRDKALELGAKYRPDWTRDS ---------------3333---2222------------------------------1111 THLICAFANTPKYSQVLGLGGRIVRKEWVLDCHRMRRRLPSQRYLMAGPGSSSEEDEASH ---------1111--------------------------3333----------------- SGGSGPSSG --------- >RECOVERIN; SWP:P35243; PDB:2D8NA; GNSKSGALSKEILEELQLNTKFSEEELCSWYQSFLKDCPTGRITQQQFQSIYAKFFPDTD ---1111------------------------------1111--------------1111- PKAYAQHVFRSFDSNLDGTLDFKEYVIALHTTAGKTNQKLEWAFSLYDVDGNGTISKNEV ------------1111----------------------3333-----3333--------- LEIVAIFKITPEDVKLLPDDENTPEKRAEKIWKYFGKNDDDKLTEKEFIEGTLANKEILR -------------11111111----------------1111------------------- LIQFEPQKVKEK ------------ >ZINC FINGER MYND DOMAIN C; SWP:O75800; PDB:2D8QA; GSSGSSGLEAVAPERPRCAYCSAEASKRCSRCQNEWYCCRECQVKHWEKHGKTCVLAAQG ------------------------------------------------3333-------- DRAKSGPSSG ---------- >THAP DOMAIN-CONTAINING PR; SWP:Q9H0W7; PDB:2D8RA; GSSGSSGMPTNCAAAGCATTYNKHINISFHRFPLDPKRRKEWVRLVRRKNFVPGKHTFLC ------------------------------------------------------------ SKHFEASCFDLTGQTRRLKMDAVPTIFDFCTHISGPSSG -----1111---------1111----------------- >CELLULAR MODULATOR OF IMM; SWP:Q5T0T0; PDB:2D8SA; GSSGSSGTSITPSSQDICRICHCEGDDESPLITPCHCTGSLHFVHQACLQQWIKSSDTRC -------------------------3333-----------------3333---------- CELCKYEFIMETKLSGPSSG -------------------- >RING FINGER PROTEIN 146; SWP:Q9NTX7; PDB:2D8TA; GSSGSSGNTAPSLTVPECAICLQTCVHPVSLPCKHVFCYLCVKGASWLGKRCALCRQEIP -------------------------------------3333------------------3 EDFLDSGPSSG 333-------- >UBIQUITIN LIGASE TRIM63; SWP:Q969Q1; PDB:2D8UA; GSSGSSGHPMCKEHEDEKINIYCLTCEVPTCSMCKVFGIHKACEVAPLQSVFQGQKTESG ----------3333-----------------3333-----------3333---------- PSSG ---- >ZINC FINGER FYVE DOMAIN-C; SWP:Q9DAZ9; PDB:2D8VA; GSSGSSGLPWCCICNEDATLRCAGCDGDLYCARCFREGHDNFDLKEHQTSPYHPRRPCQE ----------------------1111--------3333----3333-------------- HSGPSSG ------- >PROTEIN PINCH; SWP:P48059; PDB:2D8XA; GSSGSSGCHQCGEFIIGRVIKAMNNSWHPECFRCDLCQEVLADIGFVKNAGRHLCRPCHN ---------------------%%%%-----------------------%%%%-------- REKASGPSSG ---------- >EPLIN PROTEIN; SWP:Q53GG0; PDB:2D8YA; GSSGSSGMKFQAPARETCVECQKTVYPMERLLANQQVFHISCFRCSYCNNKLSLGTYASL --------------------------------%%%%------------------------ HGRIYCKPHFNQLFKSKGNYDEGFGSGPSSG ------------------------------- >FOUR AND A HALF LIM DOMAI; SWP:Q4G0X1; PDB:2D8ZA; GSSGSSGCVQCKKPITTGGVTYREQPWHKECFVCTACRKQLSGQRFTARDDFAYCLNCFC ---------------------%%%%--1111-------------------------3333 DLYASGPSSG ---------- >PDZ DOMAIN CONTAINING PRO; SWP:Q9JIL4; PDB:2D90A; GSSGSSGRVVVIKKGSNGYGFYLRAGPEQKGQIIKDIEPGSPAEAAGLKNNDLVVAVNGK -------------------------------------------3333---------iiii SVEALDHDGVVEMIRKGGDQTTLLVLDKEAESIYSLSGPSSG -1111--------3333------------------------- >INAD-LIKE PROTEIN; SWP:Q8NI35; PDB:2D92A; GSSGSSGELALWSPEVKIVELVKDCKGLGFSILDYQDPLDPTRSVIVIRSLVADGVAERS ---------------------------------------1111----------------- GGLLPGDRLVSVNEYCLDNTSLAEAVEILKAVPPGLVHLGICSGPSSG -----------%%%%--------------------------------- >RAP GUANINE NUCLEOTIDE EX; SWP:Q8TEU7; PDB:2D93A; GSSGSSGDDDIEQLLEFMHQLPAFANMTMSVRRELCSVMIFEVVEQAGAIILEDGQELDS ----------------3333-1111--3333---1111---------------------- WYVILNGTVEISHPDGKVENLFMGNSFGITPTLDKQYMHGIVRTKVDDCQFVCIAQQDYW ------------3333----------------------------------------3333 RILNHVEKSGPSSG -------------- >NUCLEAR FACTOR NF-KAPPA-B; SWP:Q00653; PDB:2D96A; GSSGSSGPGLSLGDTALQNLEQLLDGPEAQGSWAELAERLGLRSLVDTYRQTTSPSGSLL ---3333-----3333---------3333---3333-33331111---3333---3333- RSYELAGGDLAGLLEALSDMGLEEGVRLLRGPETRDKLPSTEVSGPSSG ---3333------------------------------------------ >General transcription fac; SWP:Q9UHL9; PDB:2D99A; GSSGSSGLRKMVEEVFDVLYSEALGRASVVPLPYERLLREPGLLAVQGLPEGLAFRRPAE ------3333----------------------333333333333---------------- YDPKALMAILEHSHRIRFKLKRPSSGPSSG -----------3333--------------- >MYB-RELATED PROTEIN B; SWP:P48972; PDB:2D9AA; GSSGSSGKVKWTHEEDEQLRALVRQFGQQDWKFLASHFPNRTDQQCQYRWLRVLSGPSSG --------------------------333333333333---------------------- >GENERAL TRANSCRIPTION FAC; SWP:P78347; PDB:2D9BA; GSSGSSGMSVDAVEIETLRKTVEDYFCFCYGKALGKSTVVPVPYEKMLRDQSAVVVQGLP --------1111------------------------------------------------ EGVAFKHPENYDLATLKWILENKAGISFIIKRPFLEPKKHVGGSGPSSG ------1111------------3333----------------------- >SIGNAL-REGULATORY PROTEIN; SWP:O00241; PDB:2D9CA; GSSGSSGELQVIQPEKSVSVAAGESATLRCAMTSLIPVGPIMWFRGAGAGRELIYNQKEG -------------------------------------------------------3333- HFPRVTTVSELTKRNNLDFSISISNITPADAGTYYCVKFRKGSPDDVEFKSGAGTELSVR -3333------------------------------------------------------- AKPSAPVVSGSGPSSG ---------------- >BAG FAMILY MOLECULAR CHAP; SWP:Q9UL15; PDB:2D9DA; GSSGSSGSILKIEKVLKRMREIKNELLQAQNPSELYLSSKTELQGLIGQLDEVSLEKNPC --------3333-----------------------------------3333--------- IREARRRAVIEVQTLITYIDLKESGPSSG ------------------3333------- >PEREGRIN; SWP:Q99JV4; PDB:2D9EA; GSSGSSGFLILLRKTLEQLQEKDTGNIFSEPVPLSEVPDYLDHIKKPMDFFTMKQNLEAY -----3333---------33331111-------1111--3333-------------1111 RYLNFDDFEEDFNLIVSNCLKYNAKDTIFYRAAVRLREQGGAVLRQARRQAEKMGSGPSS ---3333----------------3333------------------------3333----- G - >YY1-ASSOCIATED FACTOR 2; SWP:Q8IY57; PDB:2D9GA; GSSGSSGDEGYWDCSVCTFRNSAEAFKCMMCDVRKGTSTRKPRPVSQSGPSSG ---------------------3333---------------------------- >ZINC FINGER PROTEIN 692; SWP:Q59EV5; PDB:2D9HA; GSSGSSGLQCEICGFTCRQKASLNWHQRKHAETVAALRFPCEFCGKRFEKPDSVAAHRSK ---------------------------33333333------------------------- SHPALLLAPQESSGPSSG -1111------------- >NEDD4-BINDING PROTEIN 2; SWP:Q86UW6; PDB:2D9IA; GSSGSSGQNVLDLHGLHVDEALEHLMRVLEKKTEEFKQNGGKPYLSVITGRGNHSQGGVA ---------------------------------------------------3333----- RIKPAVIKYLISHSFRFSEIKPGCLKVMLKSGPSSG ------------------------------------ >REGULATOR OF G-PROTEIN SI; SWP:P49802; PDB:2D9JA; GSSGSSGSQQRVKRWGFGMDEALKDPVGREQFLKFLESEFSSENLRFWLAVEDLKKRPIK -------------------3333--3333-----------3333---------3333333 EVPSRVQEIWQEFLAPGAPSAINLDSKSYDKTTHNVKEPGRYTFEDAQEHIYKLMKSDSY 3-3333-----------1111---3333------3333---11113333----3333-33 PRFIRSSAYQELLSGPSSG 3333333333--------- >FLN29 GENE PRODUCT; SWP:O14545; PDB:2D9KA; GSSGSSGHEETECPLRLAVCQHCDLELSILKLKEHEDYCGARTELCGNCGRNVLVKDLKT ---------------------------3333-3333-----------------3333--3 HPEVCGREGSGPSSG 333------------ >Zinc finger CCCH-type dom; SWP:Q8IWR0; PDB:2D9MA; GSSGSSGQYCWQHRFPTGYFSICDRYMNGTCPEGNSCKFAHGNAELHEWEERRDALKMKL ------------------------------------------------------3333-- NKASGPSSG --------- >Cleavage and polyadenylat; SWP:O95639; PDB:2D9NA; GSSGSSGEKTVVCKHWLRGLCKKGDQCEFLHEYDMTKMPECYFYSKFGECSNKECPFLHI --------------3333----!!!!---------------------------------- DPESKIKDCPWSGPSSG ------------3333- >DNAJ (HSP40) HOMOLOG, SUB; SWP:Q9NVM6; PDB:2D9OA; GSSGSSGQGTPKLKLKWKCKKEDESKGGYSKDVLLRLLQKYGEVLNLVLSSKKPGTAVVE -------------------------------------3333------------------- FATVKAAELAVQNEVGLVDNPLKISWLEGQPQDASGPSSG -------1111-----3333-------------------- >POLYADENYLATE-BINDING PRO; SWP:Q9H361; PDB:2D9PA; GSSGSSGDRITRYQVVNLYVKNLDDGIDDERLRKAFSPFGTITSAKVMMEGGRSKGFGFV -----------------------3333--------3333--------------------- CFSSPEEATKAVTEMNGRIVATKPLYVALAQRKEERQSGPSSG --------------2222------------------------- >Granulocyte colony-stimul; SWP:Q99062; PDB:2D9QB; CGHISVSAPIVHLGDPITASCIIKQNCSHLDPEPQILWRLGAELQPGGRQQRLSDGTQES ------------------------------------------------------------ IITLPHLNHTQAFLSCSLNWGNSLQILDQVELRAGYPPAIPHNLSCLMNLTTSSLICQWE ------------------------------------------------------------ PGPETHLPTSFTLKSFKSRGNCQTQGDSILDCVPKDGQSHCSIPRKHLLLYQNMGIWVQA ------------------!!!!---------------------3333------------- ENALGTSMSPQLCLDPMDVVKLEPPMLRTMDPQAGCLQLSWEPWQPGLHINQKCELRHKP --------------3333-------------------------3333------------- QRGEASWALVGPLPLEALQYELCGLLPATAYTLQIRCIRWPLPGHWSDWSPSLELRTTE ----------------------------------------------------------- >CONSERVED HYPOTHETICAL PR; SWP:Q7MXL1; PDB:2D9RA; SPIEFDAIIRQVPDDAAYVEIPFDVKTVYGKGRVRVNATFDGYPYTGYIVRGLPCHILGL -----------------------3333------------iiii----------------- RQDIRRAIGKQPGDSVYVTLLPL ----------2222--------- >CBL E3 UBIQUITIN PROTEIN ; SWP:CBL_MOUSE; PDB:2D9SA; GSSGSSGQLSSEIERLMSQGYSYQDIQKALVIAHNNIEMAKNILREFSGPSSG --------3333--------------------%%%%----------------- >TUDOR DOMAIN-CONTAINING P; SWP:Q91W18; PDB:2D9TA; GSSGSSGKVWKPGDECFALYWEDNKFYRAEVEALHSSGMTAVVKFTDYGNYEEVLLSNIK ----------------------------------3333----------------1111-- PVQTEAWVRDPNSGPSSG ------------------ >CHROMOBOX PROTEIN HOMOLOG; SWP:Q14781; PDB:2D9UA; GSSGSSGEQVFAAECILSKRLRKGKLEYLVKWRGWSSKHNSWEPEENILDPRLLLAFQKK ---------------------%%%%----------3333----3333------------3 EHEKEVQNSGPSSG 333----------- >Pleckstrin homology domai; SWP:Q9QYE9; PDB:2D9VA; GSSGSSGLVRGGWLWRQSSILRRWKRNWFALWLDGTLGYYHDETAQDEEDRVVIHFNVRD %%%%--------------1111---------3333------------------------- IKVGQECQDVQPPEGRSRDGLLTVNLREGSRLHLCAETRDDAIAWKTALMEANSTPAPAG ---3333---------1111-----3333------------------------------- ATVPSGPSSG ---------- >DOCKING PROTEIN 2; SWP:O60496; PDB:2D9WA; GSSGSSGMGDGAVKQGFLYLQQQQTFGKKWRRFGASLYGGSDCALARLELQEGPEKPRRC ------------------------------------------------------------ EAARKVIRLSDCLRVAEAGGEASSPRDTSAFFLETKERLYLLAAPAAERGDWVQAICLLA -------3333---------------------------------3333------------ FSGPSSG ------- >OXYSTEROL BINDING PROTEIN; SWP:Q9BXB4; PDB:2D9XA; GSSGSSGENVYGYLMKYTNLVTGWQYRFFVLNNEAGLLEYFVNEQSRNQKPRGTLQLAGA -------------------3333-------------------3333----------2222 VISPSDEDSHTFTVNAASGEQYKLRATDAKERQHWVSRLQICTQHHTEAIGKNNSGPSSG ---------------3333--------3333--------------3333----------- >Pleckstrin homology domai; SWP:Q9Y2H5; PDB:2D9YA; GSSGSSGNAPVTKAGWLFKQASSGVKQWNKRWFVLVDRCLFYYKDEKEESILGSIPLLSF --------------------------------------------3333------------ RVAAVQPSDNISRKHTFKAEHAGVRTYFFSAESPEEQEAWIQAMGEAARVQSGPSSG -----1111-----------------------3333--------3333--------- >PROTEIN KINASE C, NU TYPE; SWP:O94806; PDB:2D9ZA; GSSGSSGMVKEGWMVHYTSRDNLRKRHYWRLDSKCLTLFQNESGSKYYKEIPLSEILRIS -----------------3333------------------------------3333----- SPRDFTNISQGSNPHCFEIITDTMVYFVGENNGDSSHNPVLAATGVGLDVAQSWEKAIRQ ----------------------------------------3333---------------- ALMSGPSSG --------- >130-kDa phosphatidylinosi; SWP:Q9ULH1; PDB:2DA0A; GSSGSSGYGSEKKGYLLKKSDGIRKVWQRRKCSVKNGILTISHATSNRQPAKLNLLTCQV ----------------------------------%%%%---------------3333--- KPNAEDKKSFDLISHNRTYHFQAEDEQDYVAWISVLTNSKEEALTMAFSGPSSG ------------------------3333---------------1111------- >ALPHA-FETOPROTEIN ENHANCE; SWP:Q15911; PDB:2DA1A; GSSGSSGKRPRTRITDDQLRVLRQYFDINNSPSEEQIKEMADKSGLPQKVIKHWFRNTLF --------------3333-----3333-----3333----------3333---------- KERQSGPSSG ---------- >ALPHA-FETOPROTEIN ENHANCE; SWP:Q15911; PDB:2DA2A; GSSGSSGRSSRTRFTDYQLRVLQDFFDANAYPKDDEFEQLSNLLNLPTRVIVVWFQNARQ --------------------------------3333----------3333---------3 KARKSGPSSG 333------- >ALPHA-FETOPROTEIN ENHANCE; SWP:Q15911; PDB:2DA3A; GSSGSSGGTGGEEPQRDKRLRTTITPEQLEILYQKYLLDSNPTRKMLDHIAHEVGLKKRV ------------------------1111-------3333--------------------- VQVWFQNTRARERKSGPSSG -------------------- >HYPOTHETICAL PROTEIN DKFZ; SWP:Q96NK7; PDB:2DA4A; GSSGSSGALQDRTQFSDRDLATLKKYWDNGMTSLGSVCREKIEAVATELNVDCEIVRTWI --------------------------1111----3333-------------3333----- GNRRRKYRLMGIEVSGPSSG ------3333---------- >ZINC FINGERS AND HOMEOBOX; SWP:Q9H4I2; PDB:2DA5A; GSSGSSGPTKYKERAPEQLRALESSFAQNPLPLDEELDRLRSETKMTRREIDSWFSERRK --------------3333------3333-------------------------------3 KVNAEETKKSGPSSG 333------------ >HEPATOCYTE NUCLEAR FACTOR; SWP:P35680; PDB:2DA6A; GSSGSSGRNRFKWGPASQQILYQAYDRQKNPSKEEREALVEECNRAECLQRGVSPSKAHG ----------------3333-----------3333------------------1111333 LGSNLVTEVRVYNWFANRRKEEAFRQKLAMDAYSSNSGPSSG 31111-3333----------1111-1111------------- >ZINC FINGER HOMEOBOX PROT; SWP:O60315; PDB:2DA7A; GSSGSSGSPINPYKDHMSVLKAYYAMNMEPNSDELLKISIAVGLPQEFVKEWFEQRKVYQ ------------3333------3333----3333----------3333------------ YSNSRSGPSSG ----------- >SH3-DOMAIN KINASE BINDING; SWP:Q8R550; PDB:2DA9A; GSSGSSGDYCKVIFPYEAQNDDELTIKEGDIVTLINKDCIDVGWWEGELNGRRGVFPDNF -------------------3333-------------------------%%%%----3333 VKLLSGPSSG ---------- >ABSENT IN MELANOMA 1 PROT; SWP:Q9Y4K1; PDB:2DADA; GSSGSSGQIHLFSEPQFQGHSQSFEETTSQIDDSFSTKSCRVSGGSWVVYDGENFTGNQY --------------%%%%-------------%%%%-----------------%%%%---- VLEEGHYPCLSAMGCPPGATFKSLRFISGPSSG ---------3333-------------------- >KIAA0733 PROTEIN; SWP:NA; PDB:2DAEA; GSSGSSGQIDFQVLHDLRQKFPEVPEVVVSRCMLQNNNNLDACCAVLSQESTRYLYGEGD ---------3333-------3333-------3333----3333--------1111----- LNFSDDSGISGPSSG --------------- >FLJ35834 PROTEIN; SWP:Q8NA54; PDB:2DAFA; GSSGSSGQESVEDSLATVKVVLIPVGQEIVIPFKVDTILKYLKDHFSHLLGIPHSVLQIR ---------------------------------1111--3333---------3333---- YSGKILKNNETLVQHGVKPQEIVQVEIFSTNPDLYPVRRIDGLTDVSQIITVSGPSSG %%%%--11113333----------------3333------------------------ >UBIQUITIN CARBOXYL-TERMIN; SWP:P45974; PDB:2DAGA; GSSGSSGLDESVIIQLVEMGFPMDACRKAVYYTGNSGAEAAMNWVMSHMDDPDFANPLIL --------3333---3333---------------------------33331111------ PGSSGPGSSGPSSG -------------- >UBIQUILIN-3; SWP:Q9H347; PDB:2DAHA; GSSGSSGHFQVQLEQLRSMGFLNREANLQALIATGGDVDAAVEKLRQSSGPSSG ---------3333----------------------------------------- >UBIQUITIN ASSOCIATED DOMA; SWP:Q9BSL1; PDB:2DAIA; GSSGSSGDAVELFKKANAMLDEDEDERVDEAALRQLTEMGFPENRATKALQLNHMSVPQA ---------------------------------------------------------333 MEWLIEHAEDPTIDTPLSGPSSG 3----3333-3333--------- >KIAA0977 PROTEIN; SWP:Q53SF7; PDB:2DAJA; GSSGSSGEKTVRVVINFKKTQKTIVRVSPHASLQELAPIICSKCEFDPLHTLLLKDYQSQ -------------------------------3333------1111-3333---------- EPLDLTKSLNDLGLRELYAMDVNRESGPSSG ---33333333-------------------- >UBIQUITIN CARBOXYL-TERMIN; SWP:P45974; PDB:2DAKA; GSSGSSGPPEDCVTTIVSMGFSRDQALKALRATNNSLERAVDWIFSHIDDLDAEAAMSGP ----------------3333-------------------------3333----------- SSG --- >PROTEIN KIAA0794; SWP:O94888; PDB:2DALA; GSSGSSGGSAASSALKGLIQQFTTITGASESVGKHMLEACNNNLEMAVTMFLDGGGSGPS -----------3333--------------------3333%%%%----------------- SG -- >ETEA PROTEIN; SWP:NA; PDB:2DAMA; GSSGSSGAPEERDLTQEQTEKLLQFQDLTGIESMDQCRHTLEQHNWNIEAAVQDRLNEQE --------------------------------3333----------3333---------- GSGPSSG ------- >TRANSCRIPTION FACTOR ETV6; SWP:P41212; PDB:2DAOA; GSSGSSGCRLLWDYVYQLLSDSRYENFIRWEDKESKIFRIVDPNGLARLWGNHKNRTNMT ----------3333-3333-33333333---3333------3333------1111----- YEKMSRALRHYYKLNIIRKEPGQRLLFRFMKTPDEIMSGRTDRLEHLESQELSGPSSG ----------3333-----------------3333-------3333------------ >WHSC1L1 PROTEIN, ISOFORM ; SWP:Q9BZ95; PDB:2DAQA; GSSGSSGKLHYKQIVWVKLGNYRWWPAEICNPRSVPLNIQGLKHDLGDFPVFFFGSHDYY ------------------------------1111-3333--------------------- WVHQGRVFPYVEGDKSFAEGQTSINKTFKKALEEAAKRFQELKASGPSSG -------------------------------------------------- >PDZ AND LIM DOMAIN PROTEI; SWP:Q96HC4; PDB:2DARA; GSSGSSGDQDTLVQRAEHIPAGKRTPMCAHCNQVIRGPFLVALGKSWHPEEFNCAHCKNT -----------------------------------------%%%%--------------- MAYIGFVEEKGALYCELCYEKFFASGPSSG --------%%%%------------------ >ZINC FINGER MYM-TYPE PROT; SWP:Q9UJ78; PDB:2DASA; GSSGSSGQPTAQQQLTKPAKITCANCKKPLQKGQTAYQRKGSAHLFCSTTCLSSFSSGPS -----------------------------------------------3333-3333---- SG -- >POSSIBLE GLOBAL TRANSCRIP; SWP:Q5TB71; PDB:2DATA; GSSGSSGSPNPPKLTKQMNAIIDTVINYKDSSGRQLSEVFIQLPSRKELPEYYELIRKPV ----------3333---------------------3333---------3333-------- DFKKIKERIRNHKYRSLGDLEKDVMLLCHNAQTFNLEGSQIYEDSIVLQSVFKSARQSGP -3333---1111---3333-----------3333-------------------3333--- SSG --- >MYOSIN-BINDING PROTEIN C,; SWP:Q00872; PDB:2DAVA; GSSGSSGILFIEKPQGGTVKVGEDITFIAKVKAEDLLRKPTIKWFKGKWMDLASKAGKHL -------------------2222---------------------------3333------ QLKETFERHSRVYTFEMQIIKAKDNFAGNYRCEVTYKDKFDSCSFDLEVHESTGTTPNID ------3333------------1111---------------------------------- SGPSSG ------ >RWD DOMAIN CONTAINING PRO; SWP:Q9UIY3; PDB:2DAWA; GSSGSSGMSASVKESLQLQLLEMEMLFSMFPNQGEVKLEDVNALTNIKRYLEGTREALPP ---------------------------------------1111-----3333-------- KIEFVITLQIEEPKVKIDLQVTMPHSYPYLALQLFGRSSELDRHQQLLLNKGLTSYIGTF ------------------------------------------------------------ DPGELCVCAAIQWLQDNSASYFLNRKLVSGPSSG -----3333--------3333------------- >PROTEIN C21ORF6; SWP:P57060; PDB:2DAXA; GSSGSSGEQAEAQLAELDLLASMFPGENELIVNDQLAVAELKDCIEKKTMEGRSSKVYFT ------3333---------3333------------------------------------- INMNLDVSDEKMAMFSLACILPFKYPAVLPEITVRSVLLSRSQQTQLNTDLTAFLQKHCH -------%%%%------------------------3333--------------------- GDVCILNATEWVREHASGYVSRDTSSSGPSSG ----3333--------3333------------ >RING FINGER PROTEIN 25; SWP:Q96BH1; PDB:2DAYA; GSSGSSGEEDWVLPSEVEVLESIYLDELQVIKGNGRTSPWEIYITLHPATAEDQDSQYVC ------------3333---3333---------%%%%------------------------ FTLVLQVPAEYPHEVPQISIRNPRGLSDEQIHTILQVLGHVAKAGLGTAMLYELIEKGKE -------3333------------------------------------------------- ILSGPSSG -------- >INAD-LIKE PROTEIN; SWP:Q8NI35; PDB:2DAZA; GSSGSSGDAFTDQKIRQRYADLPGELHIIELEKDKNGLGLSLAGNKDRSRMSIFVVGINP ----------3333----3333-----------1111---------3333--------11 EGPAAADGRMRIGDELLEINNQILYGRSHQNASAIIKTAPSKVKLVFIRNEDAVNQMASG 113333------------%%%%-----3333------------------1111------- PSSG ---- >253AA LONG HYPOTHETICAL P; SWP:O58277; PDB:2DB0A; DIREALANGEHLEKILIMAKYDESVLKKLIELLDDDLWTVVKNAISIIMVIAKTREDLYE -----1111---------------------3333--------------------3333-- PMLKKLFSLLKKSEAIPLTQEIAKAFGQMAKEKPELVKSMIPVLFANYRIGDEKTKINVS ------------------------------------------------------------ YALEEIAKANPMLMASIVRDFMSMLSSKNREDKLTALNFIEAMGENSFKYVNPFLPRIIN ---------------------1111--------------11111111---3333------ LLHDGDEIVRASAVEALVHLATLNDKLRKVVIKRLEELNDTSSLVNKTVKEGISRLLLL 1111--------------3333--3333--------------------------3333- >HETEROGENEOUS NUCLEAR RIB; SWP:NA; PDB:2DB1A; GMMLGPEGGEGYVVKLRGLPWSCSIEDVQNFLSDCTIHDGVAGVHFIYTREGRQSGEAFV -------------------11113333----3333----1111----------------- ELESEDDVKLALKKDRESMGHRYIEVFKSHRTEMDWVLKHSGPNSASGPSSG ---3333---3333----------------3333----------%%%%---- >KIAA0890 PROTEIN; SWP:Q7L2E3; PDB:2DB2A; GSSGSSGASRDLLKEFPQPKNLLNSVIGRALGISHAKDKLVYVHTNGPKKKKVTLHIKWP ----------3333---3333----------3333------------------------- KSVEVEGYGSKKIDAERQAAAAACQLFKGWGLLGPRNELFDAAKYRVLADRFGSGPSSG -----------------------------------------3333-------------- >ATP-DEPENDENT RNA HELICAS; SWP:P09052; PDB:2DB3A; YIPPEPSNDAIEIFSSGIASGIHFSKYNNIPVKVTGSDVPQPIQHFTSADLRDIIIDNVN --------3333--------1111--------------------1111-----------1 KSGYKIPTPIQKCSIPVISSGRDLMACAQTGSGKTAAFLLPILSKLLEDPHELELGRPQV 111--------------1111-------22223333------------------------ VIVSPTRELAIQIFNEARKFAFESYLKIGIVYGGTSFRHQNECITRGCHVVIATPGRLLD ------------------------------------------------------------ FVDRTFITFEDTRFVVLDEADRMLDMGFSEDMRRIMTHVTMRPEHQTLMFSATFPEEIQR -1111---1111--------3333-1111--------1111-------------3333-- MAGEFLKNYVFVAIGIVGGACSDVKQTIYEVNKYAKRSKLIEILSEQADGTIVFVETKRG -3333----------2222-1111-------1111------------------------- ADFLASFLSEKEFPTTSIHGDRLQSQREQALRDFKNGSMKVLIATSVASRGLDIKNIKHV --------1111------11113333-------------------1111----3333--- INYDMPSKIDDYVHRIGRTGRVGNNGRATSFFDPEKDRAIAADLVKILEGSGQTVPDFLR --------------------iiii--------33333333--------1111---3333- >INAD-LIKE PROTEIN; SWP:Q8NI35; PDB:2DB5A; GSSGSSGLGNEDFNSVIQQMAQGRQIEYIDIERPSTGGLGFSVVALRSQNLGKVDIFVKD -----------3333-----iiii------------------------------------ VQPGSVADRDQRLKENDQILAINHTPLDQNISHQQAIALLQQTTGSLRLIVAREPVHTKS ---------------------%%%%--3333-3333------------------------ STSGPSSG -------- >SH3 AND CYSTEINE RICH DOM; SWP:Q96MF2; PDB:2DB6A; GSSGSSGEPPKLVNDKPHKFKDHFFKKPKFCDVCARMIVLNNKFGLRCKNCKTNIHEHCQ ---------------------------------------%%%%------------11113 SYVEMQRCSGPSSG 333----------- >Hairy/enhancer-of-split r; SWP:NA; PDB:2DB7A; SGGYFDAHALADYRSLGFRECLAEVARYLSIIEGLDASDPLRVRLVSHLNNYASQR !!!!-------------------------------1111----------------- >TRIPARTITE MOTIF PROTEIN ; SWP:Q9C026; PDB:2DB8A; GSSGSSGPVPATPILQLEECCTHNNSATLSWKQPPLSTVPADGYILELDDGNGGQFREVY ---------------------------------1111----------------------- VGKETMCTVDGLHFNSTYNARVKAFNKTGVSPYSKTLVLQTSEGSGPSSG -------------------------------------------------- >Smooth muscle cell associ; SWP:NA; PDB:2DBAA; GSSGSSGMTVSGPGTPEPRPATPGASSVEQLRKEGNELFKCGDYGGALAAYTQALGLDAT --------------------------3333-------------3333------------1 PQDQAVLHRNRAACHLKLEDYDKAETEASKAIEKDGGDVKALYRRSQALEKLGRLDQAVL 111----------------------------3333--------------3333------- DLQRCVSLEPKNKVFQEALRNISGPSSG ---3333----3333------------- >Putative HTH-type transcr; SWP:O57802; PDB:2DBBA; KLDRVDMQLVKILSENSRLTYRELADILNTTRQRIARRIDKLKKLGIIRKFTIIPDIDKL ---------------1111------------------------------------3333- GYMYAIVLIKSKVPSDADKVISEISDIEYVKSVEKGVGRYNIIVRLLLPKDIKDAENLIS ------------3333-------1111--------------------------------- EFLQRIKNAENVEVILISEVRKFEII -3333--------------------- >UNNAMED PROTEIN PRODUCT; SWP:Q78Y63; PDB:2DBCA; GSSGSSGKFGELREISGNQYVNEVTNAEKDLWVVIHLYRSSVPMCLVVNQHLSVLARKFP ----------------3333---1111-----------3333----------------33 ETKFVKAIVNSCIEHYHDNCLPTIFVYKNGQIEGKFIGIIECGGINLKLEELEWKLSEVG 33------------------------------------3333-11113333--------- AIQSDLEENSGPSSG --------------- >NUCLEAR FACTOR NF-KAPPA-B; SWP:P19838; PDB:2DBFA; GSSGSSGDMKQLAEDVKLQLYKLLEIPDPDKNWATLAQKLGLGILNNAFRLSPAPSKTLM --------33333333---------------3333------3333---3333-------- DNYEVSGGTVRELVEALRQMGYTEAIEVIQAASSSGPSSG --------3333----------3333-------------- >MYELOID CELL NUCLEAR DIFF; SWP:P41218; PDB:2DBGA; GSSGSSGMVNEYKKILLLKGFELMDDYHFTSIKSLLAYDLGLTTKMQEEYNRIKITDLME ---------3333-----------3333------3333----3333-------------- KKFQGVACLDKLIELAKDMPSLKNLVNNLRKEKSKVASGPSSG ----3333----------3333--------------------- >Tumor necrosis factor rec; SWP:O75509; PDB:2DBHA; GSSGSSGSSALSRNGSFITKEKKDTVLRQVRLDPCDLQPIFDDMLHFLNPEELRVIEEIP -------------------------------------3333-3333--3333-------- QAEDKLDRLFEIIGVKSQEASQTLLDSVYSHLPDLLSGPSSG -3333-------1111---------------3333------- >Proto-oncogene tyrosine-p; SWP:Q12866; PDB:2DBJA; GSSGSSGWILASTTEGAPSVAPLNVTVFLNESSDNVDIRWMKPPTKQQDGELVGYRISHV ----------------------------------------------3333---------- WQSAGISKELLEEVGQNGSRARISVQVHNATCTVRIAAVTRGGVGPFSDPVKIFIPAHSG --2222------------------------------------------------------ PSSG ---- >SH3-CONTAINING GRB2-LIKE ; SWP:Q99962; PDB:2DBMA; GSSGSSGPCCRALYDFEPENEGELGFKEGDIITLTNQIDENWYEGMLHGHSGFFPINYVE ----------------------------------------------%%%%---------- ILVALPHSGPSSG ------------- >HYPOTHETICAL PROTEIN YBIU; SWP:P75791; PDB:2DBNA; MASTFTSDTLPADHKAAIRQMKHALRAQLGDVQQIFNQLSDDIATRVAEINALKAQGDAV 3333-------------------------------------------------1111--- WPVLSYADIKAGHVTAEQREQIKRRGCAVIKGHFPREQALGWDQSMLDYLDRNRFDEVYR ----3333---------------------------------------------3333--- PEIYPIYWSQAQMQARQSEEMANAQSFLNRLWTFESDGKQWFNPDVSVIYPDRIRRRPPG ----------------------------1111---%%%%---1111-----------222 TTSKGLGAHTDSGALERWLLPAYQRVFANVFNGNLAQYDPWHAAHRTEVEEYTVDKCSVF 2------------3333---------1111---3333-11112222-------------- RTFQGWTALSDMLPGQGLLHVVPIPEAMAYVLLRPLLDDVPEDELCGVAPGRVLPVSEQW ------------2222----------------1111----1111iiii2222----3333 HPLLIEALTSIPKLEAGDSVWWHCDVIHSVAPVENQQGWGNVMYIPAAPMCEKNLAYAHK -3333---------2222----1111---------------------------------- VKAALEKGASPGDFPREDYETNWEGRFTLADLNIHGKRALGMD ----------1111---1111------3333------1111-- >D-TYROSYL-TRNA(TYR) DEACY; SWP:O66742; PDB:2DBOA; MRAVIQRVKKSWVEVDGKVVGSINEGLNVFLGVRKGDTEEDIEKLVNKILNLRIFEDERG --------------iiii-------------------3333---------------1111 KFQYSVLDIKGEILVVSQFTLYANVKKGRRPSFEEAEEPKRAKELYEKFVDKIKESGLKV ----3333------------------------3333------------------------ ETGIFGAMMDVFIENWGPVTIIIDSREI -----------------------1111- >GLYOXYLATE REDUCTASE; SWP:O58320; PDB:2DBQA; KPKVFITREIPEVGIKLEDEFEVEVWGDEKEIPREILLKKVKEVDALVTLSERIDKEVFE ----------------1111--------------------1111---------------- NAPKLRIVANYAVGYDNIDIEEATKRGIYVTNTPDVLTDATADLAFALLLATARHVVKGD -1111---------1111-----1111-------1111---------------------- RFVRSGEWKKRGVAWHPKWFLGYDVYGKTIGIIGLGRIGQAIAKRAKGFNRILYYSRTRK --1111---------1111-----2222----------------3333------------ EEVERELNAEFKPLEDLLRESDFVVLAVPLTRETYHLINEERLKLKKTAILINIARGKVV ------------------------------3333----3333---1111------1111- DTNALVKALKEGWIAGAGLDVFEEEPYYNEELFKLDNVVLTPHIGSASFGAREGAELVAK ----------------------------3333-----------1111------------- NLIAFKRGEIPPTLVNREVIKIRKPGF -----------------3333------ >HYPOTHETICAL PROTEIN TTHC; SWP:Q5SGN2; PDB:2DBSA; RLAELDGVLQYLLEADLLRELPPTYRLVLLPLDEPEVAAQALAWAEAPNPEGWPSVYALF ----------------3333----------11113333---------------------- LQGRPIRLLLLGKEVEVA iiii-------------- >GTP-BINDING PROTEIN; SWP:Q5SJ29; PDB:2DBYA; LAVGIVGLPNVGKSTLFNALTRANALAANYPFATIDKNVGVVPLEDERLYALQRTFAKGE -----------------------1111---3333-1111------------------!!! RVPPVVPTHVEFVDIAGLVKGAHKGEGLGNQFLAHIREVAAIAHVLRCFPDPLEDAEVVE !-------------------------2222----3333------------3333------ TELLLADLATLERRLERLRKEARADRERLPLLEAAEGLYVHLQEGKPARTFPPSEAVARF ---------------------11111111------------1111-3333---------- LKETPLLTAKPVIYVANVAEEDLPDGRGNPQVEAVRRKALEEGAEVVVVSARLEAELAEL -----1111---------3333---2222---------------------------3333 SGEEARELLAAYGLQESGLQRLARAGYRALDLLTFFTAGEKEVRAWTVRRGTKAPRAAGE ---------1111-------------------------3333--------------3333 IHSDERGFIRAEVIPWDKLVEAGGWARAKERGWVRLEGKDYEVQDGDVIYVLF ----1111------3333-------------------1111--2222------ >PROBABLE AMIDASE; SWP:Q5SHD3; PDB:2DC0A; MDLLEAKRLLETGRTTPLALLEEALERAKAFQDRNALAYLDEEAARKEALALTEELRRGQ ------------------------------3333---------------------1111- VRGPLHGLPLTVKDLFPVKGMPTRAGTKAPLPPLPEEARAVRRLREAGALLFAKTNHEIA --1111-----------2222--%%%%--------------------------------- LGITGENPWTGPVRNAVDPSRQAGGSSGGSAVAVALGIGLASLGTDTGGSIRIPAGFNGV ------3333----3333------------------------------------------ VGFKPSYGRVSLEGALPLSRSTDHAGPLTRSVRDAHFLTEILAGESIPLEGVQNPVFGVP -----2222--2222---1111-------------------------------------3 LDFLEGRLGVEVRKAFTRLLEDLPALRAEVREVSLPLEGVYEVYTRLVRYEAARIHEKAL 3332222--------------3333-----------2222-------------------- KEHPEGFSPQVREALLAGLALTEKDYRDAVAEREALRLELVKALRGVDALLLPVQPLPAP --1111-------------------------------------2222------------- PLGTEEVELESGRKGHREAFITLTLPFSLLGVPTLALPFAKVEGPVGLQVVGAYGEDGKV 2222----1111---------------1111---------------------2222---- LALGGWLEARLG -------1111- >L-ASPARTATE DEHYDROGENASE; SWP:O28440; PDB:2DC1A; MLVGLIGYGAIGKFLAEWLERNGFEIAAILDVRGEHEKMVRGIDEFLQREMDVAVEAASQ -----------------------------------1111------1111----------- QAVKDYAEKILKAGIDLIVLSTGAFADRDFLSRVREVCRKTGRRVYIASGAIGGLDAIFS ---------------------3333-----------------------!!!!-------- ASELIEEIVLTTRKNWRQFGRKGVIFEGSASEAAQKFPKNLNVAATLSIASGKDVKVRLV 3333-----------3333----------------------------------------- ADEVEENIHEILVRGEFGEMEIRVRNRPMRENPKTSYLAALSVTRILRNLKEGLVV --------------1111----------3333---3333----------------- >golgi associated PDZ and ; SWP:Q9HD26; PDB:2DC2A; PIRKVLLLKEDHEGLGISITGGKEHGVPILISEIHPGQPADRCGGLHVGDAILAVNGVNL ---------3333--------3333---------------3333----------iiii-1 RDTKHKEAVTILSQQRGEIEFEVVYVA 111--------3333------------ >CYTOGLOBIN; SWP:Q8WWM9; PDB:2DC3A; EELSEAERKAVQAMWARLYANCEDVGVAILVRFFVNFPSAKQYFSQFKHMEDPLEMERSP -------------------------------------------1111------------- QLRKHACRVMGALNTVVENLHDPDKVSSVLALVGKAHALKHKVEPVYFKILSGVILEVVA ------------------1111---------------------3333------------- EEFASDFPPETQRAWAKLRGLIYSHVTAAYKEVGWVQQVPNATTPPATLPSS 1111--------------------------1111------1111-------- >165AA LONG HYPOTHETICAL P; SWP:O58740; PDB:2DC4A; MEIEVKFRVNFEDIKRKIEGLGAKFFGIEEQEDVYFELPSPKLLRVRKINNTGKSYITYK ------------------3333-------------------------------------- EILDKRNEEFYELEFEVQDPEGAIELFKRLGFKVQGVVKKRRWIYKLNNVTFELNRVEKA ---------------------------1111---------------!!!!---------- GDFLDIEVITSNPEEGKKIIWDVARRLGLKEEDVEPKLYIELIN ------------------------1111-3333----3333--- >GLUTATHIONE S-TRANSFERASE; SWP:Q80W21; PDB:2DC5A; PTLGYWDIRGLAHAIRLFLEYTDSSYEEKRYTGDAPDYDQSQWLNEKFKLGLDFPNLPYL --------!!!!----------------------3333-3333--1111----------- IDGSHKITQSNAILRYLGRKHNLCGETEEERIRVDILENQLDNRVLARLCYNADFEKLKP -!!!!-------------1111-----------------------------1111----- GYLEQLPGRLYSEFLGKRPWFAGDKITFVDFIAYDVLERNQVFEAKCLDAFPNLKDFIAR --------------!!!!-1111---3333-------------11111111--------- FEGLKKISDYKTSRFLPRPFTKATWGSN -----------3333------------- >CATHEPSIN B; SWP:P07688; PDB:2DCCA; LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNVNVEVSAEDMLTC -----3333-11113333---------3333----------------------------- CGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGE -3333-!!!!--3333--------------2222-------------------------- GDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLL ----------2222--3333-----------------------------------3333- YKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGI --------------------------iiii---------1111-iiii-------2222- ESEIVAGMPCT ----------- >KIAA1915 PROTEIN; SWP:Q5VVJ2; PDB:2DCEA; GSSGSSGHEEEELKPPEQEIEIDRNIIQEEEKQAIPEFFEGRQAKTPERYLKIRNYILDQ ---------------------------333311111111--------------------- WEICKPKYLNKTSVRPGLKNCGDVNCIGRIHTYLELIGAINFGCEQAVYNR ---------3333-3333----3333------------------------- >6-AMINOHEXANOATE-DIMER HY; SWP:Q59710; PDB:2DCFA; STGQHPARYPGAAAGEPTLDSWQEPPHNRWAFAHLGEMVPSAAVSRRPGHALARLGAIAA --------22222222-1111----333333333333-------------------3333 QLPDLEQRLEQTYTDAFLVLRGTEVVAEYYRAGFAPDDRHLLMAVSKSLCGTVVGALVDE -1111---------------!!!!------22221111---!!!!--------------- GRIDPAQPVTEYVPELAGSVYDGPSVLQVLDMQISIDYNEDYVDPASEVQTHDRSAGWRT ---11113333-1111--1111------------------1111---------------- RRHGDPADTYEFLTTLRGDGSTGEFQYCSANTDVLAWIVERVTGLRYVEALSTYLWAKLD -2222----------------------3333-----------------------1111-- ADRDATITVDTTGFGFANGGVSCTARDLARVGRMMLDGGVAPGGRVVSEDWVRRVLAGGS ---------1111----------3333-------1111--1111---------------3 HEAMTDKGFTNTFPDGSYTRQWWCTGNERGNVSGIGIHGQNLWLDPLTDSVIVKLSSWPD 333--3333---1111--%%%%----1111---------------1111----------- PDTEHWHRLQNGILLDVSRALDAV ------------------1111-- >PUTATIVE HOMING ENDONUCLE; SWP:NA; PDB:2DCHX; HMVWDYLCGLIAADGHLDEEGYITILQKDRRFIDKIVALLKSAEIKISSLFYDKGAGVWK -----------------1111--------------------------------------- IKVKDERLYRYLVNNGVIPGKVLRPPSSAVDPLWYIIGFIDGDGWVEQVVKRAGDKSYYY -----------1111-----------3333----------------------%%%%---- IRIGIKTKSKELRDWIAQTLNDLGIRASRADKSDGYEVHIDGVEAWRLVPHLQNPTHLER --------------------1111-------1111------------3333--3333--- AQSVKDNRLSLLF -3333----1111 >XYLANASE J; SWP:Q9RC94; PDB:2DCKA; AITSNEIGTHDGYDYEFWKDSGGSGSMTLNSGGTFSAQWSNVNNILFRKGKKFDETQTHQ ---------iiii-----------------!!!!-----------------------333 QIGNMSINYGATYNPNGNSYLTVYGWTVDPLVEFYIVDSWGTWRPPGGTPKGTINVDGGT 3------------------------------------------------------iiii- YQIYETTRYNQPSIKGTATFQQYWSVRTSKRTSGTISVSEHFRAWESLGMNMGNMYEVAL ------------1111--------------------3333-----1111----------- TVEGYQSSGSANVYSNTLTIGATRVEAESMTKGGPYTSNITSPFNGVALYANGDNVSFNH -------------------------1111---------------------2222------ SFTKANSSFSLRGASNNSNMARVDLRIGGQNRGTFYFGDQYPAVYTINNINHGIGNQLVE --------------------------iiii------------------------------ LIVTADDGTWDAYLDYLEIR -------------------- >HYPOTHETICAL UPF0166 PROT; SWP:O59172; PDB:2DCLA; VEVEHWNTLRLRIYIGENDKWEGRPLYKVIVEKLREMGIAGATVYRGIYGFGTDLPIIVE ---------------1111-iiii------------------------------------ VVDRGHNIEKVVNVIKPMIKDGMITVEPTIVLWVGTQEE ---------------3333-------------------- >HYPOTHETICAL FRUCTOKINASE; SWP:Q96XN9; PDB:2DCNA; AKLITLGEILIEFNALSPGPLRHVSYFEKHVAGSEANYCVAFIKQGNECGIIAKVGDDEF -------------------3333-------------------1111-------------- GYNAIEWLRGQGVDVSHMKIDPSAPTGIFFIQRHYPVPLKSESIYYRKGSAGSKLSPEDV --------1111--1111------------------2222------22221111-3333- DEEYVKSADLVHSSGITLAISSTAKEAVYKAFEIASNRSFDTNIRLKLWSAEEAKREILK 3333----------------------------------------1111------------ LLSKFHLKFLITDTDDSKIILGESDPDKAAKAFSDYAEIIVMKLGPKGAIVYYDGKKYYS ------------------------3333----3333--------3333----%%%%---- SGYQVPVEDVTGAGDALGGTFLSLYYKGFEMEKALDYAIVASTLNVMIRGDQENLPTTKD ---------2222-----------1111----------------1111------------ IETFLREM -------- >TACHYSTATIN-B1; SWP:P0C1Z8; PDB:2DCVA; YVSCLFRGARCRVYSGRSCCFGYYCRRDFPGSIFGTCSRRNF -----2222--1111----2222------------------- >TACHYSTATIN-B2; SWP:P0C1Z9; PDB:2DCWA; YITCLFRGARCRVYSGRSCCFGYYCRRDFPGSIFGTCSRRNF -----------1111----2222------------------- >ENDO-1,4-BETA-XYLANASE A; SWP:P18429; PDB:2DCYA; ASTDYWQNWTDGGGIVNAVNGSGGNYSVNWSNTGNFVVGKGWTTGSPFRTINYNAGVWAP ---------------------!!!!--------------------1111----------- NGNGYLTLYGWTRSPLIEYYVVDSWGTYRPTGTYKGTVKSDGGTYDIYTTTRYNAPSIDG ---------------------------------------iiii-------------1111 DRTTFTQYWSVRQSKRPTGSNATITFSNHVNAWKSHGMNLGSNWAYQVMATEGYQSSGSS ------------------------3333-----1111----------------------- NVTVW ----- >THIOCYANATE HYDROLASE ALP; SWP:O66187; PDB:2DD5A; PVWDRTHHAKMATGIGDPQCFKGMAGKSKFNVGDRVRIKDLPDLFYTRTMTYTRGATGTI -----------2222--3333-1111----2222---------------3333------- VRLVYESPAAEDEAFGNEENVEWFYSIVFAQKDLWPEYSDTFANDTLETEIPERYLEKA --------33331111-------------3333-11113333---------3333---- >Thiocyanate hydrolase sub; SWP:O66186; PDB:2DD5B; SSIREEVHRHLGTVALMQPALHQQTHAPAPTEITHTLFRAYTRVPHDVGGEADVPIEYHE ------------3333------------3333-----------33332222--------- KEEEIWELNTFATCECLAWRGVWTAEERRRKQNCDVGQTVYLGMPYYGRWLLTAARILVD -----------------1111-------------------------------------11 KQFVTLTELHNKIVEMRERVASGQGLGEYLPP 11-----------------------!!!!--- >Thiocyanate hydrolase sub; SWP:O66188; PDB:2DD5C; EVSDFEILEMAVRELAIEKGLFSAEDHRVWKDYVHTLGPLPAARLVAKAWLDPEYKKLCI ----------------1111-------------1111----------------------- EDGVEASKAVGVNWVTSPPTQFGTPSDYCNLRVLADSPTLKHVVVCTLSYPRPILGQSPE --33333333-------3333--1111---------1111----------3333----33 WYRSPNYRRRLVRWPRQVLAEFGLQLPSEVQIRVADSNQKTRYIVMPVRPEGTDGWTEDQ 33-----------------1111---1111----------------------2222---- LAEIVTRDCLIGVAVPKPGITVNAKRPVLKANRPV -11113333-------2222--------------- >GREEN FLUORESCENT PROTEIN; SWP:Q2MHN7; PDB:2DD7A; TTFKIESRIHGNLNGEKFELVGGGVGEEGRLEIEMKTKDKPLAFSPFLLSHCMFYHFASF ------------iiii----------2222------------------1111-3333--- PKGTKNIYLHAATNGGYTNTRKEIYEDGGILEVNFRYTYEFNKIIGDVECIGHGFPSQSP 2222------1111----------1111-----------2222------------11111 IFKDTIVKSCPTVDLMLPMSGNIIASSYARAFQLKDGSFYTAEVKNNIDFKNPIHESFSK 111------------------------------1111-----------------3333-- SGPMFTHRRVEETHTKENLAMVEYQQVFNSAPRD ---------------------------------- >IGHM protein; SWP:Q6PJF1; PDB:2DD8H; VQLQQSGAEVKKPGSSVKVSCKASGGTFSSYTISWVRQAPGQGLEWMGGITPILGIANYA -----------2222-------1111------------2222-----------------3 QKFQGRVTITTDESTSTAYMELSSL 333--------3333---------- >PSEUDECIN; SWP:Q8AVA3; PDB:2DDBA; NYQKEIVDKHNALRRSVKPTARNMLQMKWNSHAAQNAKRWADRCTFAHSPPNTRTVGKLR ---------------------------------------3333------3333--!!!!- CGENIFMSSQPFPWSGVVQAWYDEIKNFVYGIGAKPPGSVIGHYTQVVWYKSHLIGCASA ------------3333-------------------1111-3333----1111-------- KCSSSKYLYVCQYCPAGNIRGSIATPYKSGPPCADCPSACVNRLCTNPCNYNNDFSNCKS ---------------------3333-----2222-1111-iiii---------------- LAKKSKCQTEWIKKKCPASCFCHNKII ----%%%%--------3333-1111-- >PHOTOCONVERTIBLE FLUORESC; SWP:Q53UG8; PDB:2DDDA; VSVITSEMKIEVRMEGAVNGHKFVITGKGSGQPFEGIQNVDLTVIEGGPLPFAFDILTTA -----------------iiii----------3333---------------------1111 FNRVFVKYPEEIVDYFKQSFPEGYSWERSMSYEDGGICLATNNITMKKDGSNCFVNEIRF --------1111-3333--------------1111------------------------- DGVNFPANGPVMQRKTVKWESSTEKMYVRDGVLKGDVNMALLLQGGGHYRCDFRTTYKAK -----1111-------------------iiii----------1111-------------- KVVQLPDYHFVDHLMEITSHDKDYNKVKLYEHAKAHSGLPRLA --------------------1111------------------- >ADAM 17; SWP:P78536; PDB:2DDFA; PDPMKNTCKLLVVADHRFYRYMGRGEESTTTNYLIELIDRVDDIYRNTAWDNAGFKGYGI -1111-----------------%%%%----------------------1111-------- QIEQIRILKSPQEVKPGEKHYNMAKSYPNEEKDAWDVKMLLEQFSFDIAEEASKVCLAHL --------------2222-1111-----1111-----------------3333------- FTYQDFDMGTLGLAYGGSPHGGVCPKAYYSPVGKKNIYLNSGLTSTKNYGKTILTKEADL ------iiii--------------------3333-------------%%%%--3333--- VTTHELGHNFGAEHDPDGLAECAPNEDQGGKYVMYPIAVSGDHENNKMFSQCSKQSIYKT ---------------11111111-1111---1111-------1111-------------- IESKAQECFQERS ------------- >ACYL-COA OXIDASE; SWP:P07872; PDB:2DDHA; MNPDLRKERASATFNPELITHILDGSPENTRRRREIENLILNDPDFQHEDYNFLTRSQRY --3333--1111---------------------------11111111--3333------- EVAVKKSATMVKKMREYGISDPEEIMWFKNSVHRGHPEPLDLHLGMFLPTLLQATAEQQE --------------------------------iiii-1111------------------- RFFMPAWNLEITGTYAQTEMGHGTHLRGLETTATYDPKTQEFILNSPTVTSIKWWPGGLG ------------------1111--3333-------------------3333----2222- KTSNHAIVLAQLITQGECYGLHAFVVPIREIGTHKPLPGITVGDIGPKFGYEEMDNGYLK -------------%%%%-------------------2222----------1111------ MDNYRIPRENMLMKYAQVKPDGTYVKPLMVFVRSFLVGNAAQSLSKACTIAIRYSAVRRQ ------1111--------1111-------------------------------------- SEIKQSEPEPQILDFQTQQYKLFPLLATAYAFHFVGRYMKETYLRSELPELHALTAGLKA ---1111----------------------------------------------------- FTTWTANAGIEECRMACGGHGYSHSSGIPNIYVTFTPACTFEGENTVMMLQTARFLMKIY -----------------3333---------------1111-------------------- DQVRSGKLVGGMVSYLNDLPSVDINSLEGLTEAYKLRAARLVEIAAKNLQTHVSHRKSKE --1111----1111-1111---3333---------------------------------- VAWNLTSVDLVRASEAHCHYVVVKVFSDKLPKIQDKAVQAVLRNLCLLYSLYGISQKGGD ----------------------------3333---------------------------- FLEGSIITGAQLSQVNARILELLTLIRPNAVALVDAFDFKDMTLGSVLGRYDGNVYENLF -1111--------------------3333----------3333--33331111------- EWAKKSPLNKTEVHESYHKHLK -----3333----3333----- >WAP, follistatin/kazal, i; SWP:Q7LDW0; PDB:2DDIA; EAEAEFTDACVLPAVQGPCRGWEPRWAYSPLLQQCHPFVYGGCEGNGNNFHSRESCEDAC -------3333------------------1111--------------------------- PVVDHHHHHH ---------- >PYRIDOXINE KINASE; SWP:P40191; PDB:2DDMA; KSRALQADIVAVQSQVVYGSVGNSIAVPAIKQNGLNVFAVPTVLLSNTPHYDTFYGGAIP --------------------!!!!-----3333--------------3333--------- DEWFSGYLRALQERDALRQLRAVTTGYMGTASQIKILAEWLTALRKDHPDLLIMVDPVIG ----------------1111---------------------------1111--------- DIDSGIYVKPDLPEAYRQYLLPLAQGITPNIFELEILTGKNCRDLDSAIAAAKSLLSDTL --------1111-------3333-----------------------------1111---- KWVVVTSQEMQVVVVTADSVNVISHSRVKTDLKGTGDLFCAQLISGLLKGKALTDAVHRA ---------------1111---------------------------1111---------- GLRVLEVMRYTQQHESDELILPPL -----------1111--------- >If kappa light chain [Fra; SWP:A2NHM3; PDB:2DDQH; KVKLQESGPELVKPGASVKMSCKASGYTFTSYVMHWVKQKPGQGLEWIGYINPYNDGTKY ------------2222-----------1111----------------------------- NEKFKGKATLTSDKSSSTAYMELSSLTSEDSAVYYCAPYGGYWGQGTTVTVSSAKTTAPS 3333----------------------3333--------3333------------------ VYPLAPVCGTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLYTLSSSVT -------------------------------iiii------------------------- VTSSTWPSQSITCNVAHPASSTKVDKKIEPR -3333-----------3333----------- >SPHINGOMYELIN PHOSPHODIES; SWP:Q81HW0; PDB:2DDRA; NDTLKVMTHNVYMLSTNLYPNWGQTERADLIGAADYIKNQDVVILNEVFDNSASDRLLGN --------------3333----3333--------1111---------------------- LKKEYPNQTAVLGRSSGSEWDKTLGNYSSSTPEDGGVAIVSKWPIAEKIQYVFAKGCLSN -3333-----2222--3333---------------------------------------- KGFVYTKIKKNDRFVHVIGTHLQAESPASVRTNQLKEIQDFIKNKNIPNNEYVLIGGDMN ------------------------------------------3333-1111--------- VNKINAENNNDSEYASMFKTLNASVPSYTGHTATWDATTNSIAKYNFPDSPAEYLDYIIA --2222--11113333-------------------1111-------1111---------- SKDHANPSYIENKVLQPKSPQWTVTSWFQKYTYNDYSDHYPVEATISM 1111---------------------%%%%------------------- >REELIN; SWP:Q64FW1; PDB:2DDUA; LTLKPGYVLQFKLNIGCTSQFSSTAPVLLQYSHDAGMSWFLVKEGCFPASAGKGCEGNSR ----------------------------------------------1111-----!!!!- ELSEPTVYYTGDFEEWTRITIAIPRSLASSKTRFRWIQEVPPFGLDGVYISEPCPSYCSG --------2222-----------------------------------------2222--- HGDCISGVCFCDLGYTAAQGTCVSNTPNHSEMFDRFEGKLSPLWYKITGGQVGTGCGTLN ----iiii---2222--iiii-------------------1111---------1111--- DGRSLYFNGLGKREARTVPLDTRNIRLVQFYIQIGSKTSGITCIKPRARNEGLVVQYSND ---------------------1111--------------1111----------------i NGILWHLLRELDFMSFLEPQIISIDLPREAKTPATAFRWWQPQHGKHSAQWALDDVLISR iii--------1111-----------3333-------------3333------------- L - >BETA-1,3-XYLANASE; SWP:NA; PDB:2DDXA; LDGVLVPESGILVSVGQDVDSVNDYASALGTIPAGVTNYVGIVNLDGLNSDADAGAGRNN iiii----------------------------------------2222------------ IAELANAYPTSALVVGVSMNGEVDAVASGRYNANIDTLLNTLAGYDRPVYLRWAYEVDGP -------1111-------iiii--------------------1111---------11111 WNGHSPSGIVTSFQYVHDRIIALGHQAKISLVWQVASYCPTPGGQLDQWWPGSEYVDWVG 111---------------------1111---------3333----3333--3333----- LSYFAPQDCNWDRVNEAAQFARSKGKPLFLNESTPQRYQVADLTYSADPAKGTNRQSKTS ----------------------------------2222-3333----------------- QQLWDEWFAPYFQFMSDNSDIVKGFTYINADWDSQWRWAAPYNEGYWGDSRVQANALIKS -----------------3333---------33333333------------1111------ NWQQEIAKGQYINHSETLFETLGY -------1111---1111-1111- >190AA LONG HYPOTHETICAL P; SWP:O57962; PDB:2DDZA; SMELLIIKERRIDYDGSAIRSHWAYRNFGILGDSLVVFRGKCNVKVEEMVDIEDLRLRKE ---------------11112222---------------------3333------------ IKGDDMVHYILELFWHPDILLASSLQKLLIARLVELLWNYGIEASRRGDDIYVNGRKLSI -------------------------------------1111-----!!!!--%%%%---- SIATVSPVSIKIHIGLNVKTVGVPPGVDAIGLEELGIDPTEFMERSAKALVEEIEKVRKD -----------------------------------------------------------1 SLKVRWVT 111----- >Alpha-(1,6)-fucosyltransf; SWP:Q9BYC5; PDB:2DE0X; LGKDHEILRRRIENGAKELWFFLQSELKKLKNLEGNELQRHADEFLLDLGHHERSIMTDL -----------------------------------3333--------------------- YYLSQTDGAGDWREKEAKDLTELVQRRITYLQNPKDCSKAKKLVCNINKGCGYGCQLHHV --1111-3333------------------------3333--------------------- VYCFMIAYGTQRTLILESQNWRYATGGWETVFRPVSETCTDRSGISTGHWSGEVKDKNVQ -------1111---------3333--1111---------------------3333----- VVELPIVDSLHPRPPYLPLAVPEDLADRLVRVHGDPAVWWVSQFVKYLIRPQPWLEKEIE -----3333------------1111---3333---------------------------- EATKKLGFKHPVIGVHVRRTEAAFHPIEEYMVHVEEHFQLLARRMQVDKKRVYLATDDPS ---1111------------------3333-----------3333-------------333 LLKEAKTKYPNYEFISDNSISWSAGLHNRYTENSLRGVILDIHFLSQADFLVCTFSSQVC 3-------3333-----------33333333-------------1111-----1111--- RVAYEIMQTLHPDASANFHSLDDIYYFGGQNAHNQIAIYAHQPRTADEIPMEPGDIIGVA ------1111---1111---------2222--------------1111------------ GNHWDGYSKGVNRKLGRTGLYPSYKVREKIETVKYPTYPE -----------3333------------------------- >DIBENZOTHIOPHENE DESULFUR; SWP:P54997; PDB:2DE3A; DTLTYSNSPVPNALLTASESGFLDAAGIELDVLSGQQGTVHFTYDQPAYTRFGGEIPPLL -----------------------1111------3333----------------------- SEGLRAPGRTRLLGITPLLGRQGFFVRDDSPITAAADLAGRRIGVSASAIRILRGQLGDY -----2222-----------------1111---33332222---------------!!!! LELDPWRQTLVALGSWEARALLHTLEHGELGVDDVELVPISSPGVDVPAEQLEESATVKG -------------------------1111-1111-------2222-------------33 ADLFPDVARGQAAVLASGDVDALYSWLPWAGELQATGARPVVDLGLDERNAYASVWTVSS 33-----------1111-------------------------33331111---------- GLVRQRPGLVQRLVDAAVDAGLWARDHSDAVTSLHAANLGVSTGAVGQGFGADFQQRLVP 3333-----------------3333----------------3333-------3333---- RLDHDALALLERTQQFLLTNNLLQEPVALDQWAAPEFLNNSLNR -----------------1111------3333---------1111 >TERMINAL OXYGENASE COMPON; SWP:Q84II6; PDB:2DE6A; MANVDEAILKRVKGWAPYVDAKLGFRNHWYPVMFSKEINEGEPKTLKLLGENLLVNRIDG ----3333---3333-----1111---------3333-2222-----iiii------iii KLYCLKDRCLHRGVQLSVKVECKTKSTITCWYHAWTYRWEDGVLCDILTNPTSAQIGRQK i-------------3333-----1111-------------------1111--3333---- LKTYPVQEAKGCVFIYLGDGDPPPLARDTPPNFLDDDMEILGKNQIIKSNWRLAVENGFD --------iiii-----------3333--2222-1111-----------3333------1 PSHIYIHKDSILVKDNDLALPLGFAPGGDRKQQTRVVDDDVVGRKGVYDLIGEHGVPVFE 1113333---------------------3333-------1111------1111------- GTIGGEVVREGAYGEKIVANDISIWLPGVLKVNPFPNPDMMQFEWYVPIDENTHYYFQTL --iiii------------------------------1111---------1111------- GKPCANDEERKKYEQEFESKWKPMALEGFNNDDIWAREAMVDFYADDKGWVNEILFESDE ----------------------------3333---------1111-3333-----3333- AIVAWRKLASEHNQGIQTQAHVSGLEHHH -----------------3333-------- >Ferredoxin component of c; SWP:Q8GI16; PDB:2DE6D; IWLKVCAASDMQPGTIRRVNRVGAAPLAVYRVGDQFYATEDTCTHGIASLSEGTLDGDVI ------3333-2222---------------------------1111--3333---!!!!- ECPFHGGAFNVCTGMPASSPCTVPLGVFEVEVKEGEVYVAGEKK --------------------------------%%%%-------- >Carnitine O-palmitoyltran; SWP:P18886; PDB:2DEBA; DDYLQHSIVPTHYQDSLPRLPIPKLEDTKRYLNAQKPLLDDSQFRRTEALCKNFETGVGK ------------3333-------3333-------3333---------------------- ELHAHLLAQDKQNKHTSYISGPWFDYLTARDSIVLNFNPFAFNPDPKSEYNDQLTRATNL ------------11111111-----3333------------------------------- TVSAVRFLKTLQAGLLEPEVFHLNPSKSDTDAFKRLIRFVPPSLSWYGAYLVNAYPLDSQ -----------------------3333--------3333-3333---------------3 YFRLFNSTRIPRPNRDELFTDTKARHLLVLRKGHFYVFDVLDQDGNIVNPLEIQAHLKYI 333-----------------3333------iiii-------1111---3333-------1 LSDSSPVPEFPVAYLTSENRDVWAELRQKLIFDGNEETLKKVDSAVFCLCLDDFPKDLIH 111-------33331111------------1111-------------------------- LSHTLHGDGTNRWFDKSFNLIVAEDGTAAVHFEHSWGDGVAVLRFFNEVFRDSTQTPAIT ------------3333------1111-------1111----------------------1 PQSQPAATNSSASVETLSFNLSGALKAGITAAKEKFDTTVKTLSIDSIQFQRGGKEFLKK 111-----3333---------------------------1111---------------11 KQLSPDAVAQLAFQAFLRQYGQTVATYESCSTAAFKHGRTETIRPASIFTKRCSEAFVRD 11-----------------------------3333------------------------1 PSKHSVGELQHAECSKYHGQLTKEAAGQGFDRHLYALRYLATARGLNLPELYLDPAYQQN 111--------------------------------------1111---3333-3333--- HNILSTSTLNSPAVSLGGFAPVVPDGFGIAYAVHDDWIGCNVSSYSGRNAREFLHCVQKC ----------1111-------------------1111-------1111------------ LEDIFDALEGKAIKT ------1111----- >PROBABLE DIPHTHINE SYNTHA; SWP:O58456; PDB:2DEKA; MVLYFIGLGLYDERDITVKGLEIAKKCDYVFAEFYTSLMAGTTLGRIQKLIGKEIRVLSR -----------1111--------1111-----------1111----------------33 EDVELNFENIVLPLAKENDVAFLTPGDPLVATTHAELRIRAKRAGVESYVIHAPSIYSAV 33---1111--3333-----------1111--3333------------------333333 GITGLHIYKFGKSATVAYPEGNWFPTSYYDVIKENAERGLHTLLFLDIKAEKRMYMTANE 33---3333----------!!!!--3333--------------------1111------- AMELLLKVEDMKKGGVFTDDTLVVVLARAGSLNPTIRAGYVKDLIREDFGDPPHILIVPG -----------------1111------------------33331111------------- KLHIVEAEYLVEIAGAPREILRVNV ------------------------- >441AA LONG HYPOTHETICAL N; SWP:NA; PDB:2DEOA; AKNIVYVAQIKGQITSYTYDQFDRYITIAEQDNAEAIIIELDTPGGRADANIVQRIQQSK --------------3333-----------1111--------------------------- IPVIIYVYPPGASAASAGTYIALGSHLIAAPGTSIGACRTNYFIAYIKSLAQESGRNATI --------2222---------1111----2222--------------------------- AEEFITKDLSLTPEEALKYGVIEVVARDINELLKKSNGKTKIPVNGRYVTLNFTNVEVRY -------------------------------------------iiii------------- LAPSFKDKLISYITD --------------- >THERMOSTABLE CELLOXYLANAS; SWP:P40942; PDB:2DEPA; DIPSLAEAFRDYFPIGAAIEPGYTTGQIAELYKKHVNMLVAENAMKPASLQPTEGNFQWA ---3333-1111-------3333-------------------11113333--2222---- DADRIVQFAKENGMELRFHTLVWHNQTPDWFFLDKEGKPMVEETDPQKREENRKLLLQRL ---------1111--------------3333--1111-3333------------------ ENYIRAVVLRYKDDIKSWDVVNEVIEPNDPGGMRNSPWYQITGTEYIEVAFRATREAGGS -------------------------3333iiii--------!!!!--------------- DIKLYINDYNTDDPVKRDILYELVKNLLEKGVPIDGVGHQTHIDIYNPPVERIIESIKKF ---------1111--------------1111-----------------3333-------- AGLGLDNIITELDMSIYSWNDRSDYGDSIPDYILTLQAKRYQELFDALKENKDIVSAVVF 1111-------------1111--------3333-----------------1111------ WGISDKYSWLNGFPVKRTNAPLLFDRNFMPKPAFWAIVDP ---33333333-------------1111------------ >TRNA; SWP:P25745; PDB:2DERA; AKKVIVGMSGGVDSSVSAWLLQQQGYQVEGLFMKNWEEDDGEEYCTAAADLADAQAVCDK -----------3333------1111--------------3333----------------- LGIELHTVNFAAEYWDNVFELFLAEYKAGRTPNPDILCNKEIKFKAFLEFAAEDLGADYI ------------------------------------------------------------ ATGHYVRRADVDGKSRLLRGLDSNKDQSYFLYTLSHEQIAQSLFPVGELEKPQVRKIAED ----------iiii-------3333-3333333333333333--1111-3333------- LGLVKFREFLGRYLPAQPGKIITVDGDEIGEHQGLMYHTLGQRKGLGIGGTKEGTEEPWY ----------------------1111--------11112222------------------ VVDKDVENNILVVAQGHEHPRLMSVGLIAQQLHWVDREPFTGTMRCTVKTRYRQTDIPCT -----1111---------1111-------------------------------------- VKALDDDRIEVIFDEPVAAVTPGQSAVFYNGEVCLGGGIIEQRLPLPV ------------------------------------------------ >Protein-arginine deiminas; SWP:Q9UM07; PDB:2DEXX; GTLIRVTPEQPTHAVCVLGTLTQLDICSSAPTSFSINASPGVVVDITWPLDPGVEVTLTM ----------------2222------1111--------3333--------1111------ KAASGSTGDQKVQISYYGPKTPPVKALLYLTAVEISLCADITRTGKQRTWTWGPCGQGAI -----2222--------1111------------------1111---------1111---- LLVNCDRDNLESSAMDCEDDEVLDSEDLQDMSLMTLSTKTPKDFFTNHTLVLHVARSEMD --------------1111-----33331111---------1111----------333311 KVRVFQATCSVVLGPKWPSHYLMVPGGKHNMDFYVEALAFPDTDFPGLITLTISLLDTSN 11-----------1111------------------------1111--------------1 LELPEAVVFQDSVVFRVAPWIMTPNTQPPQEVYACSIFENEDFLKSVTTLAMKAKCKLTI 111--------------------1111--------------------------------- CPEEENMDDQWMQDEMEIGYIQAPHKTLPVVFDSPRNRGLKEFPIKRVMGPDFGYVTRGP -3333%%%%-3333--------1111-----------11113333----2222------- QTGGISGLDSFGNLEVSPPVTVRGKEYPLGRILFGDSCYPSNDSRQMHQALQDFLSAQQV -----1111-1111-------iiii-1111----------1111---3333--------- QAPVKLYSDWLSVGHVDEFLSFVPAPDRKGFRLLLASPRSCYKLFQEQQNEGHGEALLFE -------3333---1111------------------------------11111111--22 GIKKKKQQKIKNILSNKTLREHNSFVERCIDWNRELLKRELGLAESDIIDIPQLFKLKEF 22------------------------------------1111-3333----------222 SKAEAFFPNMVNMLVLGKHLGIPKPFGPVINGRCCLEEKVCSLLEPLGLQCTFINDFFTY 2-------3333---!!!!----------iiii----------3333------------3 HIRHGEVHAGTNVRRKPFSFKWWNMVP 333--3333-----------3333--- >PEPTIDE YY; SWP:P10082; PDB:2DEZA; YPIKPEAPGEDASPEELNRYYASLRHYLNLVTRQRY -----------------------------1111--- >STRUCTURAL POLYPROTEIN VP; SWP:Q6S9I7; PDB:2DF7A; IVPFIRSLLMPTTGPASIPDDTLEKHTLRSETSTYNLTVGDTGSGLIVFFPGFPGSIVGA ---------3333--------------------------1111------1111------- HYTLQSNGNYKFDQMLLTAQNLPASYNYCRLVSRSLTVRSSTLPGGVYALNGTINAVTFQ ----1111------------3333------------------------------------ GSLSELTDVSYNGLMSATANINDKIGNVLVGEGVTVLSLPTSYDLGYVRLGDPIPAIGLD -3333----33331111--1111-----3333---------------------------1 PKMVATCDSSDRPRVYTITAADDYQFSSQYQSGGVTITLFSANIDAITSLSIGGELVFHT 111---------------------------2222-------------------------- SVHGLALDATIYLIGFDGTTVITRAVASDNGLTTGIDNLMPFNLVIPTNEITQPITSIKL --------------1111----------------------------3333---------- EIVTSKSGGQAGDQMSWSASGSLAVTIHGGNYPGALRPVTLVAYERVATGSVVTVAGVSN --------------------------2222-2222------------2222--------- FELIPNPELAKNLVTEYGRFDPGAMNYTKLILSERERLGIKTVWPTREYTDFREYFMEVA ------3333----------1111-----------1111--------------------- >325AA LONG HYPOTHETICAL P; SWP:O58246; PDB:2DF8A; MKTLIEIKQTPDGIIKADKVFNKVKDKISLPNRILYLGCGSSHFLSKLLAMVTNMHGGLG ----------3333---------1111------------3333----------1111--- IALPCSEFLYSKETYPIGEVELAVGISRSGETTEILLALEKINVKKLGITTRESSLTRMC ---3333---3333------------3333-3333---1111--------------1111 DYSLVVPAIEESVVMTHSFTSFYFAYLQLLRYSYGLPPLNAGEISKATEKSLEYERYIRE -------------------------------1111------------------------- IVESFDFQNIIFLGSGLLYPVALEASLKMKEMSIFWSEAYPTFEVRHGFKAIADEKTLVV --------------!!!!-----------------------------3333--1111--- LMVEEPFEWHEKLVKEFKNQGAKVLVISNSPQDLGQDYSIELPRLSKDANPIPYLPIVQL ------3333-------1111------------------------3333-3333------ LSYYKAVSRGLNPDNPRFLDKVVRW ------1111-33332222------ >HYPOTHETICAL UPF0271 PROT; SWP:Q53WG6; PDB:2DFAA; MKVDLNADAGESYGAFAYGHDREIFPLVSSANLACGFHGGSPGRILEAVRLAKAHGVAVG ------------!!!!---33331111-------------------------1111---- AHPGFPDLVGFGRREMALSPEEVYADVLYQIGALSAFLKAEGLPLHHVKPHGALYLKACR -------1111---------------------------1111------------------ DRETARAIALAVKAFDPGLPLVVLPGTVYEEEARKAGLRVVLEAFPERAYLRSGQLAPRS ---------------1111----2222------1111-------1111--3333---111 MPGSWITDPEEAARRALRMVLEGKVEALDGGEVAVRADTLCIHPNAPEVARAVREALEQA 1-------------------------------------------3333---------111 GVEVRAF 1------ >ENDO-1,4-BETA-XYLANASE 2; SWP:P36217; PDB:2DFBA; TIQPGTGYNNGYFYSYWNDGHGGVTYTNGPGGQFSVNWSNSGNFVGGKGWQPGTKNKVIN --------iiii-----------------!!!!--------------------------- FSGSYNPNGNSYLSVYGWSRNPLIEYYIVENFGTYNPSTGATKLGEVTSDGSVYDIYRTQ -----------------------------------1111---------iiii-------- RVNQPSIIGTATFYQYWSVRRNHRSSGSVNTANHFNAWAQQGLTLGTMDYQIVAVEGYFS -----1111--------------------3333-----1111------------------ SGSASITVS --------- >MALATE DEHYDROGENASE; SWP:P40926; PDB:2DFDA; NAKVAVLGASGGIGQPLSLLLKNSPLVSRLTLYDIAHTPGVAADLSHIETKAAVKGYLGP -------1111------------3333-----------------1111----------33 EQLPDCLKGCDVVVIPAGVPRKPGMTRDDLFNTNATIVATLTAACAQHCPEAMICVIANP 33----2222-----------22223333-------------------1111-------3 VNSTIPITAEVFKKHGVYNPNKIFGVTTLDIVRANTFVAELKGLDPARVNVPVIGGHAGK 333---------1111--1111----3333--------------3333---------!!! TIIPLISQCTPKVDFPQDQLTALTGRIQEAGTEVVKAKAGAGSATLSMAYAGARFVFSLV !---3333-----------------------------iiii------------------- DAMNGKEGVVECSFVKSQETECTYFSTPLLLGKKGIEKNLGIGKVSSFEEKMISDAIPEL -1111--------------------------1111------------------------- KASIKKGEDFVKTL ---------3333- >DIADENOSINETETRAPHOSPHATA; SWP:Q83SQ2; PDB:2DFJA; ATYLIGDVHGCYDELIALLHKVEFTPGKDTLWLTGDLVARGPGSLDVLRYVKSLGDSVRL -------------------1111------------------------------!!!!--- VLGNHDLHLLAVFAGISRNKPKDRLTPLLEAPDADELLNWLRRQPLLQIDEEKKLVMAHA -------------------3333-------1111----------------1111---111 GITPQWDLQTAKECARDVEAVLSSDSYPFFLDAMYGDMPNNWSPELRGLGRLRFITNAFT 1-1111-----------------1111---------------3333-3333--------- RMRFCFPNGQLDMYSKESPEEAPAPLKPWFAIPGPVAEEYSIAFGHWASLEGKGTPEGIY -----------------3333------3333--3333---------3333-----2222- ALDTGCCWGGSLTCLRWEDKQYFVQPS ----3333------------------- >COLLYBISTIN II; SWP:Q9QX73; PDB:2DFKA; CLCLGRPLQNRDQMRANVINEIMSTERHYIKHLKDICEGYLKQCRKRRDMFSDEQLKVIF ----------------------------------------------1111---------- GNIEDIYRFQMGFVRDLEKQYNNDDPHLSEIGPCFLEHQDGFWIYSEYCNNHLDACMELS -----------------111133331111-3333---3333------------------- KLMKDSRYQHFFEACRLLQQMIDIAIDGFLLTPVQKICKYPLQLAELLKYTAQDHSDYRY ------------------------3333-------------------11111111----- VAAALAVMRNVTQQINERKRRLENIDKIAQWQASVLDWEGDDILDRSSELIYTGEMAWIY -------------------------------1111------1111--------------- QPYGRNQQRVFFLFDHQMVLCKKDLIRRDILYYKGRIDMDKYEVIDIEDGRDDDFNVSMK 2222---------2222---------1111-------3333------------------- NAFKLHNKETEEVHLFFAKKLEEKIRWLRAFREERKMVQEDEKIGFEISENQKRQAAMTV -------------------3333----------------------------------333 RKASK 3---- >DNA REPAIR AND RECOMBINAT; SWP:Q55075; PDB:2DFLA; TINDLPGISQTVINKLIEAGYSSLETLAVASPQDLSVAAGIPLSTAQKIIKEARDALDIR 3333----3333--3333--------1111-------------3333-------3333-- FKTALEVKKERMNVKKISTGSQALDGLLAGGIETRTMTEFFGEFGSGKTQLCHQLSVNVQ --3333---1111-----------------------------22223333---------- LPPEKGGLSGKAVYIDTEGTFRWERIENMAKALGLDIDNVMNNIYYIRAINTDHQIAIVD -3333-------------------------------33331111------------3333 DLQELVSKDPSIKLIVVDSVTSHFRAEYPGRENLAVRQQKLNKHLHQLTRLAEVYDIAVI 3333--------------1111--------1111------------------1111---- ITNQVMHTLYHVPGIRIQLKKSRGNRRIARVVDAPHLPEGEVVFALTEEGIRDA ----------------------------------------------3333---- >probable 2-hydroxyhepta-2; SWP:Q72KR1; PDB:2DFUA; MKILRFNEGRWGVLEGELVLETDGPGGNPTGRRYDLASVTLLPPATPTKIVCVGRNYRLP --------------!!!!-----2222-------3333---------------------- KEPGLFLKGPNALARPGNPRDPWGTAEPVPYPFFTEELHYEGELAVVVGDRMRHVPPEKA ---------3333----11111111------------------------------3333- LDHVLGYTVAVDITARDVQKKDLQWVRAKSADKFLPLGPWLETDLNPQDTWVRTYVNGTL 1111-------------3333---1111--2222-----------1111------iiii- RQEGHTSQMIFSVAEILSYISTFMTLEPLDVVLTGTPEGVGALRPGDRLEVAVEGVGTLF ----3333------------------2222-------------2222-----2222---- TLIGPKEERPW ----------- >SALT-TOLERANT GLUTAMINASE; SWP:NA; PDB:2DFWA; MRHPIPDYLASLVTELGAVNPGETAQYIPVLAEADPDRFGIALATPTGRLHCAGDADVEF ---------------------------3333-------------1111------------ TIQSASKPFTYAAALVDRGFAAVDRQVGLNPSGEAFNELSLEAESHRPDNAINAGALAVH ----------------------3333----------3333--3333-------------1 QLLVGPEASRKERLDRAVEISLLAGRRLSVDWETYESEAVSDRNLSLAHLRSYGVLQDSA 111-1111-------------3333-------------------------1111------ EEIVAGYVAQCAVLVTVKDLAVGACLATGGIHPMTGERMLPSIVARRVVSVTSSGYDAAG -------------------------1111------------------------------- QWLADVGIPAKSGVAGGVLGALPGRVGIGVFSPRLDEVGNSARGVLACRRLSEDFRLHLD ------------1111-----2222----------1111--------------------- GDSLGGTAVRFVEREGDRVFLHLQGVIRFGGAEAVLDALTDLRTKPGTGWDAAVYPRWQE ---!!!!-------------------------------3333------------------ AAADRAALSAATGGGAVHEAAAAAA -----------1111---------- >PERIPLASMIC BINDING PROTE; SWP:Q9AJF5; PDB:2DFZA; KPDKLVVWENADDGVQLNNTKKWAGEFTKKTGIQVEVVPVALLKQQEKLTLDGPAGKGAD ----------------------------------------1111--------1111---- LVTWPHDRLGEAVTKGLLQPIQVDNSVKNQFDDVAMKALTYGGKLYGLPKAIESVALIYN ----3333----1111-------3333---------1111%%%%---------------1 KKLMGQVPATYDELFQYAKANNKPDEQKYGVLFEANNFYYTYFLFAAKGAAVFKEQDGTL 111-------------------3333-------11113333----1111----------- DPNEIGLNSPEAVQGMNEVQKWFTEARLPQSLKADTVNGLFKSGKVAAVINGPWAIKDYQ 3333----------------------------3333----1111-------3333----3 AAGINVGVAPLPKIDGKDAQTFIGVKGWYLSAYSKYPKYATELMQFLTSKEALASRFKET 333----------iiii-------------1111-------------------------- GEIPPQKELLNDPMIKNNPVVNGFAKQASKGVPMPSIPEMGVVWEPINNAHTFVAQGKQT -----3333--3333--3333------1111-----1111-------------------- PEQALNDAVKIMKEKIQTMKQ --------------3333--- >DRP35; SWP:Q2FDH3; PDB:2DG1A; QDLPTLFYSGKSNSAVPIISESELQTITAEPWLEISKKGLQLEGLNFDRQGQLFLLDVFE --------!!!!-------3333------------------------1111------111 GNIFKINPETKEIKRPFVSHKANPAAIKIHKDGRLFVCYLGDFKSTGGIFAATENGDNLQ 1----------------------------1111------!!!!---------1111---- DIIEDLSTAYCIDDMVFDSKGGFYFTDFRGYSTNPLGGVYYVSPDFRTVTPIIQNISVAN -----------------1111---------------------1111-------------- GIALSTDEKVLWVTETTANRLHRIALEDDGVTIQPFGATIPYYFTGHEGPDSCCIDSDDN ----1111------------------1111---2222------------------1111- LYVAMYGQGRVLVFNKRGYPIGQILIPGRDEGHMLRSTHPQFIPGTNQLIICSNDIEMGG ----2222------1111-------2222-------------2222---------1111- GSMLYTVNGFAKGHQSFQFQL ---------------3333-- >GAMMA-GLUTAMYLTRANSPEPTID; SWP:P18956; PDB:2DG5A; EEDVFHPVRAKQGVASVDATATQVGVDILKEGGNAVDAAVAVGYALAVTHPQAGNLGGGG ----------------------------1111---------------------------- FLIRSKNGNTTAIDFREAPAKATRDFLDDQGNPDSKKSLTSHLASGTPGTVAGFSLALDK ----1111----------1111-----1111--3333---1111---------------- YGTPLNKVVQPAFKLARDGFIVNDALADDLKTYGSEVLPNHENSKAIFWKEGEPLKKGDT ---3333-----------------------------1111---------iiii--2222- LVQANLAKSLEIAENGPDEFYKGTIAEQIAQEQKNGGLITKEDLAAYKAVERTPISGDYR -----------------3333----------------------1111-----------ii GYQVYSPPPSSGGIHIVQILNILENFDKKYGFGSADAQIAEAEKYAYADRSEYLGDPDFV ii--------------------1111-3333--3333------------------1111- KVPWQALTNKAYAKSIADQIDINKAKPSSEIRPGKLAPYE --3333---------3333-1111--3333-----1111- >Gamma-glutamyltranspeptid; SWP:P18956; PDB:2DG5B; TTHYSVVDKDGNAVAVTYTLNTTFGTGIVAGESGILLNNQDDFSAKPGVPNVYGLVGGDA -------1111----------2222----!!!!------------2222-1111----11 NAVGPNKRPLSSSPTIVVKDGKTWLVTGSPGGSRIITTVLQVVNSIDYGLNVAEATNAPR 11-2222-----------iiii--------!!!!-------------------------- FHHQWLPDELRVEKGFSPDTLKLLEAKGQKVALKEAGSTQSIVGPDGELYGASDPRSVDD ------------------------1111---------------1111------3333--- LTAGY ----- >PUTATIVE TRANSCRIPTIONAL ; SWP:O86531; PDB:2DG6A; RLADLSKRSGVSTATIKYYLREGLLPPGYDEDHLRRLRLVRALIQVGKVPVATAREVLGH -----------------------------------------------------------1 VDDDSLGRTVRLGAALWALPQDAEPDEADPAVAAARVEVDRLLELLGWETSRELAPLSPV 111---3333-----1111------3333-------------------3333-3333--- HRSLVVAVAALRRLDYPWDAELAPYGELEVARRDLDFETHASEAEKVEAVAAAVLFQPVL -----------1111---3333-3333-------------11113333-33331111--- RALHRLAQEEESARRYGIELE --------------------- >PUTATIVE TRANSCRIPTIONAL ; SWP:Q9RK42; PDB:2DG7A; ARWDPGAEQRLKRAALELYSEHGYDNVTVTDIAERAGLTRRSYFRYFPDKREVLFGGSEL 1111------------------3333-------1111-33331111----3333------ LPPAVARAVLAADPGAAPLTAVLDASQVGAQLVAQVEGAAQRRAVIDASPELQERERTKS ------------1111----------1111-----2222--------------------- AAISRAVQDALVRRQVDADTAELVAQLATVAFGSAFRRWIDAEGHADFGSCLDTVTDRLR -----------1111--------------------------iiii--------------- AVLTG ----- >putative tetR-family tran; SWP:Q93J02; PDB:2DG8A; DPQRRERILAATLDLIAEEGIARVSHRRIAQRAGVPLGSMTYHFTGIEQLLREAFGRFTD -------------------3333------------3333--------------------- HIVAVFDEHLGAAADRDEAREAVADLVHELSEDSQRDLVLTQELYTLAARQPAYRELTHE --------3333--------------------------------------3333------ WMRRSRVHLEKHFDPGTARQLDALIEGLTLHRALAREPHGRALTLEAIARITTTD -------------------------------1111--------------1111-- >BETA-GLUCOSIDASE; SWP:Q1XH05; PDB:2DGAA; GPVFTKLKPWQIPKRDWFDKDFLFGASTSAYQIEGAWNEDGKGPSTWDHFCHTYPERISD -------1111--3333-1111------3333------iiii-----------3333111 MTNGDVAANSYHLYEEDVKALKDMGMKVYRFSISWSRILPDGTGKVNQAGIDYYNKLINS 1----!!!!------------------------3333-1111------------------ LIDNDIVPYVTIWHWDTPQALEDKYGGFLNRQIVDDYKQFAEVCFKNFGDRVKNWFTFNE -1111--------------------!!!!------------------------------- PHTYCCFSYGEGIHAPGRCSPGMDCAVPEGDSLREPYTAGHHILLAHAEAVQLFKARYNM -------------------2222-------1111------------------------11 HGDSKIGMAFDVMGYEPYQDSFLDDQARERSIDYNMGWFLEPVVRGDYPFSMRSLIGDRL 11-----------------------------------------------------!!!!- PMFTKEEQEKLASSCDIMGLNYYTSRFSKHVDMSPDFTPTLNTDDAYASSETTGSDGNDI ----------2222-------------------1111---3333---------1111--- GPITGTYWIYMYPKGLTDLLLIMKEKYGNPPVFITENGIADVEGDESMPDPLDDWKRLDY -----1111--3333--------------------------2222----1111------- LQRHISAVKDAIDQGADVRGHFTWGLIDNFEWSLGYSSRFGLVYIDKNDGNKRKLKKSAK -----------1111---------------!!!!-----------1111----------- WFSKFNSVPKP ----------- >HYPOTHETICAL PROTEIN PURS; SWP:Q5SI58; PDB:2DGBA; PRYQATLLIELKKGILDPQGRAVEGVLKDLGHPVEEVRVGKVLEIVFPAENLLEAEEKAK -----------2222------------1111----------------------------- AGALLANPVEVYALEALKELP --------------------- ------------------------------------------------- >223aa long hypothetical a; SWP:Q974K3; PDB:2DGDA; PGGRGRIGVILPANNAGEYDLWKAPEGVSIHSTRKPTKGCEPENVEEFEKELKYSYSLLA -1111------1111---------2222-------------------------------- EVSDIIIYGRTYGTHKHAHVIKRVIKDVVIPEESVYELLKKLNVRKLWIGTPYIKERTLE ----------2222----------2222-3333------1111----------------- EVEWWRNKGFEIVGYDGLGKIRGIDISNTPIFTIYRLVKRHLNEVLKADAVYIACTALST ----3333---------------------1111----------3333-------3333-3 YEAVQYLHEDLDPVVSENAAAWEALNKLKIKAKLPGF 333------------3333------------------ >HYPOTHETICAL PROTEIN EBHA; SWP:Q931R6; PDB:2DGJA; AGQLQHGIDDENATKQTQKYRDAEQSKKTAYDQAVAAAKAILNKQDKAAVDRALQQVTST ------------------------------------------------------------ KDALNGDAKLAEAKAAARQNLGTLNHITNAQRTALEGQINQATTVDGVNTVKTNANTLDG --------------------1111------------------------------------ ANSLQGAINDKDATLRNQNYLDADESKRNAYTQAVTAAEGILNKQTGGNTSKADVDNALN -----1111----------1111------------------------------------- AVTRAKAALNGAENLRNAKTSATNTINGLPNLTQLQKDNLKHQVEQAQNVVGVNGVKDKG -----1111----------------1111----------------------------333 NLEH 3--- >GLUTAMATE DECARBOXYLASE B; SWP:P69910; PDB:2DGKA; SKRFPLHEMRDDVAFQIINDELYLDGNARQNLATFCQTWDDENVHKLMDLSINKNWIDKE ---------------------------1111-----------------1111--1111-- EYPQSAAIDLRCVNMVADLWHAPAPKNGQAVGTNTIGSSEACMLGGMAMKWRWRKRMEAA -----------------1111---1111-----------------------------111 GKPTDKPNLVCGPVQICWHKFARYWDVELREIPMRPGQLFMDPKRMIEACDENTIGVVPT 1---------------------------------2222--------11111111------ FGVTYTGNYEFPQPLHDALDKFQADTGIDIDMHIDAASGGFLAPFVAPDIVWDFRLPRVK -----------------------------------------3333-1111--3333---- SISASGHKFGLAPLGCGWVIWRDEEALPQELVFNVDYLGGQIGTFAINFSRPAGQVIAQY ----1111--------------3333-3333-----1111-------------------- YEFLRLGREGYTKVQNASYQVAAYLADEIAKLGPYEFICTGRPDEGIPAVCFKLKDGEDP ---------------------------3333----------1111---------2222-- GYTLYDLSERLRLRGWQVPAFTLGGEATDIVVMRIMCRRGFEMDFAELLLEDYKASLKYL -----------1111--------------------------3333--------------- SDHPKLQGIAQQNSFKHT --1111------------ >Cytotoxic granule-associa; SWP:Q8CII5; PDB:2DGOA; GSSGSSGQKKDTSNHFHVFVGDLSPEITTEDIKAAFAPFGRISDARVVKDMATGKSKGYG -----------------------11113333-----1111-------------------- FVSFFNKWDAENAIQQMGGQWLGGRQIRTNWATRKPPAPKSTYESNTKQSGPSSG ------3333------2222----------------------------------- >BRUNO-LIKE 4, RNA BINDING; SWP:Q5R8W7; PDB:2DGPA; GSSGSSGMKDHDAIKLFIGQIPRNLDEKDLKPLFEEFGKIYELTVLKDRFTGMHKGCAFL -------------------------1111------------------------------- TYCERESALKAQSALHEQKTLPGMNRPIQVKPADSESRGGSGPSSG ---3333--------------------------------------- >BRUNO-LIKE 6, RNA BINDING; SWP:Q7TN33; PDB:2DGQA; GSSGSSGVPMKDHDAIKLFVGQIPRGLDEQDLKPLFEEFGRIYELTVLKDRLTGLHKGCA ---------------------------3333--3333----------------------- FLTYCARDSALKAQSALHEQKTLPGMNRPIQVKPAASEGRGESGPSSG ------3333------------2222---------------------- >RING FINGER AND KH DOMAIN; SWP:Q86XN8; PDB:2DGRA; GSSGSSGGQTTIQVRVPYRVVGLVVGPKGATIKRIQQRTHTYIVTPGRDKEPVFAVTGMP ----------------33333333----------------------------------11 ENVDRAREEIEAHITLRSGPSSG 11--------3333--------- >DAZ-ASSOCIATED PROTEIN 1; SWP:Q96EP5; PDB:2DGSA; GSSGSSGSKSNKIFVGGIPHNCGETELREYFKKFGVVTEVVMIYDAEKQRPRGFGFITFE -------------------------------3333----------3333----------- DEQSVDQAVNMHFHDIMGKKVEVKRAEPRDSKSSGPSSG --------3333---%%%%-------------------- >RNA-BINDING PROTEIN 30; SWP:Q9BQ04; PDB:2DGTA; GSSGSSGKASTKLHVGNISPTCTNQELRAKFEEYGPVIECDIVKDYAFVHMERAEDAVEA ------------------11113333--------------------------3333---- IRGLDNTEFQGKRMHVQLSTSRLRTASGPSSG -------------------------------- >HETEROGENEOUS NUCLEAR RIB; SWP:O60506; PDB:2DGUA; GSSGSSGMAKVKVLFVRNLANTVTEEILEKAFSQFGKLERVKKLKDYAFIHFDERDGAVK ------------------------------------------------------------ AMEEMNGKDLEGENIEIVFAKPPDQKRKERKAQRQAASGPSSG ---------iiii------------------------------ >HETEROGENEOUS NUCLEAR RIB; SWP:P52272; PDB:2DGVA; GSSGSSGACQIFVRNLPFDFTWKMLKDKFNECGHVLYADIKMENGKSKGCGVVKFESPEV ----------------3333--------3333----------%%%%-------------- AERACRMMNGMKLSGREIDVRIDRNASGPSSG -------2222-%%%%---------------- >PROBABLE RNA-BINDING PROT; SWP:Q9Y4C8; PDB:2DGWA; GSSGSSGTTCHTVKLRGAPFNVTEKNVMEFLAPLKPVAIRIVRNAHGNKTGYIFVDFSNE ------------------------------------------------------------ EEVKQALKCNREYMGGRYIEVFREKSGPSSG ----3333---------------3333---- >KIAA0430 PROTEIN; SWP:Q9Y4F3; PDB:2DGXA; GSSGSSGNGADVQVSNIDYRLSRKELQQLLQEAFARHGKVKSVELSPHTDYQLKAVVQME -----------------33333333----------------------------------- NLQDAIGAVNSLHRYKIGSKKILVSLATGASGPSSG -3333------------------------------- >MGC11102 PROTEIN; SWP:Q9BSC1; PDB:2DGYA; GSSGSSGEHIVPSNQQQIVRVLRTPGNNLHEVETAQGQRFLVSMPSKYRKNIWIKRGDFL ------------3333-----------------1111-------1111------2222-- IVDPIEEGEKVKAEISFVLCKDHVRSLQKEGFWPEAFSEVAEKHNSGPSSG ----------------------------------------3333------- >YPL069C; SWP:NA; PDB:2DH4A; TKNKMEAKIDELINNDPVWSSQNESLISKPYNHILLKPGKNFRLNLIVQINRVMNLPKDQ -----------1111--------------------------------------------- LAIVSQIVELLHNSSLLIDDIEDNAPLRRGQTTSHLIFGVPSTINTANYMYFRAMQLVSQ ---------------------------iiii-3333-------------------3333- LTTKEPLYHNLITIFNEELINLHRGQGLDIYWRDFLPEIIPTQEMYLNMVMNKTGGLFRL ------------------------------------------------------------ TLRLMEALSPSSHHGHSLVPFINLLGIIYQIRDDYLNLKDFQMSSEKGFAEDITEGKLSF --------------------------------------------------3333------ PIVHALNFTKTKGQTEQHNEILRILLLRTSDKDIKLKLIQILEFDTNSLAYTKNFINQLV ------------------------3333-------------------------------- NMIKNDNENKYLPDELLYIIDHLSEL --1111---------------3333- >NUCLEOLYSIN TIAR; SWP:Q01085; PDB:2DH7A; GSSGSSGQKKDTSNHFHVFVGDLSPEITTEDIKSAFAPFGKISDARVVKDMATGKSKGYG -----------------------11113333----3333--------------------- FVSFYNKLDAENAIVHMGGQWLGGRQIRTNWATRKPPAPSGPSSG ----------------2222------------------------- >DAZ-ASSOCIATED PROTEIN 1; SWP:DAZP1_HUMAN; PDB:2DH8A; GMNNSGADEIGKLFVGGLDWSTTQETLRSYFSQYGEVVDCVIMKDKTTNQSRGFGFVKFK --------1111------3333--3333--3333-------------------------- DPNCVGTVLASRPHTLDGRNIDPKPCTPRGMQPSGPSSG 1111-----------%%%%-------------------- >FLJ20171 PROTEIN; SWP:Q6NXG1; PDB:2DHAA; GSSGSSGGGTSNEVAQFLSKENQVIVRMRGLPFTATAEEVVAFFGQHCPITGGKEGILFV -------------------------------11113333----------2222------- TYPDGRPTGDAFVLFACEEYAQNALRKHKDLLGKRYIELFRSTAAEVQQVLNRFSSASGP ----------------3333-----------%%%%----------------3333----- SSG --- >TRNA SELENOCYSTEINE ASSOC; SWP:Q5R462; PDB:2DHGA; GSSGSSGPEYSLFVGDLTPDVDDGMLYEFFVKVYPSCRGGKVVLDQTGVSKGYGFVKFTD ---------------------3333----3333-----------3333-----------3 ELEQKRALTECQGAVGLGSKPVRLSVAIPKASRVKPVESGPSSG 333----------------------------------------- >Pleckstrin homology domai; SWP:Q9QZC7; PDB:2DHIA; GSSGSSGFVKSGWLLRQSTILKRWKKNWFDLWSDGHLIYYDDQTRQSIEDKVHMPVDCIN -------------------------------1111------3333--------------- IRTGHECRDIQPPDGKPRDCLLQIVCRDGKTISLCAESTDDCLAWKFTLQDSRTSGPSSG ---3333----------1111-----------------------------3333------ >TBC1 DOMAIN FAMILY MEMBER; SWP:Q9BYX2; PDB:2DHKA; GSSGSSGKKLCGYLSKFGGKGPIRGWKSRWFFYDERKCQLYYSRTAQDANPLDSIDLSSA --------------------------------------------3333-------3333- VFDCKADAEEGIFEIKTPSRVITLKAATKQAMLYWLQQLQMKRWEFHNSPPAPSGPSSG ------3333-----------------3333---------------------------- >PROTEIN BOLA; SWP:P0ABE2; PDB:2DHMA; GSSGSSGMMIRERIEEKLRAAFQPVFLEVVDESYRHNVPAGSESHFKVVLVSDRFTGERF ----------------3333-------------------------------3333----- LNRHRMIYSTLAEELSTTVHALALHTYTIKEWEGLQDTVFASPPCRG 3333-------------------------3333-------------- >poly (ADP-ribose) polymer; SWP:Q96K72; PDB:2DHXA; GSSGSSGGVAVEVRGLPPAVPDELLTLYFENRRRSGGGPVLSWQRLGCGGVLTFREPADA ----------------3333------------------------------------3333 ERVLAQADHELHGAQLSLRPAPPRAPARLLLQGLPPGTSGPSSG --3333-------------------------------------- >CUE DOMAIN-CONTAINING PRO; SWP:Q9NWM3; PDB:2DHYA; GSSGSSGRPARQVRRLEFNQAMDDFKTMFPNMDYDIIECVLRANSGAVDATIDQLLQMNL ----------------------------33333333----------3333---------- ESGPSSG ------- >Rap guanine nucleotide ex; SWP:Q5R9B2; PDB:2DHZA; GSSGSSGDEIFCRVYMPDHSYVTIRSRLSASVQDILGSVTEKLQYSEEPAGREDSLILVA ---------------1111-------------------3333------------------ VSSSGEKVLLQPTEDCVFTALGINSHLFACTRDSYEALVPLPEEIQVSPGDTEISGPSSG ----------------3333----------11113333---------------------- >Activating signal cointeg; SWP:Q9H1I8; PDB:2DI0A; GSSGSSGMCGVELDSLISQVKDLLPDLGEGFILACLEYYHYDPEQVINNILEERLAPTLS -------------------------------------------------1111--1111- QLDRNLDREMN ----------- >CELL DIVISION PROTEIN FTS; SWP:O67077; PDB:2DI4A; TISPKEKEKIAIHEAGHALMGLVSDDDDKVHKISIIKHIYDKKDLYNKILVLLGGRAAEE --3333------------------------------------------------------ VFFGKDGITTGAENDLQRATDLAYRMVSMWGMSDKVGPIAIRRTAVDTSPDLLREIDEEV ---1111-3333---------------------3333----------------------- KRIITEQYEKAKAIVEEYKEPLKAVVKKLLEKETITCEEFVEVFKLYGIELKDKCK --------------------------------------------1111-------- >BK158_1; SWP:Q6UW63; PDB:2DI7A; GSSGSSGETGGERQLSPEKSEIWGPGLKADVVLPARYFYIQAVDTSGNKFTSSPGEKVFQ ----------------3333-----------------------1111------------- VKVSAPEEQFTRVGVQVLDRKDGSFIVRYRMYASYKNLKVEIKFQGQHVAKSPYILKGSG -------------------------------------------%%%%------------- PSSG ---- >FILAMIN-B; SWP:O75369; PDB:2DI8A; GSSGSSGIGDARRAKVYGRGLSEGRTFEMSDFIVDTRDAGYGGISLAVEGPSKVDIQTED ---------3333----------------------1111--------------------- LEDGTCKVSYFPTVPGVYIVSTKFADEHVPGSPFTVKISGEGRVKSGPSSG 3333-------------------%%%%------------------------ >FILAMIN-B; SWP:O75369; PDB:2DI9A; GSSGSSGDVTYDGHPVPGSPYTVEASLPPDPSKVKAHGPGLEGGLVGKPAEFTIDTKGAG ------------------1111-------3333----3333------------------- TGGLGLTVEGPCEAKIECSDNGDGTCSVSYLPTKPGEYFVNILFEEVHIPGSPFKADIEM -------------------------------------------%%%%------------- PFDPSSGPSSG ----------- >FILAMIN-B; SWP:O75369; PDB:2DIAA; GSSGSSGPFDPSKVVASGPGLEHGKVGEAGLLSVDCSEAGPGALGLEAVSDSGTKAEVSI ---------3333----3333---2222---------------------1111------- QNNKDGTYAVTYVPLTAGMYTLTMKYGGELVPHFPARVKVEPAVDTSSGPSSG --1111-------------------iiii------------------------ >FILAMIN-B; SWP:O75369; PDB:2DIBA; GSSGSSGHFPARVKVEPAVDTSRIKVFGPGIEGKDVFREATTDFTVDSRPLTQVGGDHIK ---------------------------3333----------------------------- AHIANPSGASTECFVTDNADGTYQVEYTPFEKGLHVVEVTYDDVPIPNSPFKVAVTEGCQ ----1111--------------------------------%%%%---------------- PSSGPSSG -------- >FILAMIN-B; SWP:O75369; PDB:2DICA; GSSGSSGGCQPSRVQAQGPGLKEAFTNKPNVFTVVTRGAGIGGLGITVEGPSESKINCRD ---------3333----3333--------------------------------------- NKDGSCSAEYIPFAPGDYDVNITYGGAHIPGSPFRVPVKDVVDPS --------------------------------------------- >TRIPARTITE MOTIF PROTEIN ; SWP:Q9HCM9; PDB:2DIDA; GSSGSSGESLCPQHHEALSLFCYEDQEAVCLICAISHTHRAHTVVPLSGPSSG -------------------------------3333------------------ >AMYLASE; SWP:O82839; PDB:2DIEA; TNGTMMQYFEWHLPNDGNHWNRLRDDAANLKSKGITAVWIPPAWKGTSQNDVGYGAYDLY ------------------------------1111-------------3333-3333-111 DLGEFNQKGTVRTKYGTRSQLQGAVTSLKNNGIQVYGDVVMNHKGGADGTEMVNAVEVNR 1-----%%%%--1111------------1111--------------------------11 SNRNQEISGEYTIEAWTKFDFPGRGNTHSNFKWRWYHFDGTDWDQSRQLQNKIYKFRGTG 11------------------3333---------3333--------------------222 KAWDWEVDIENGNYDYLMYADIDMDHPEVINELRNWGVWYTNTLNLDGFRIDAVKHIKYS 2-------2222----------1111---------------------------1111333 YTRDWLTHVRNTTGKPMFAVAEFWKNDLAAIENYLNKTSWNHSVFDVPLHYNLYNASNSG 3---------------------------------------------------------ii GYFDMRNILNGSVVQKHPIHAVTFVDNHDSQPGEALESFVQSWFKPLAYALILTREQGYP ii-3333-222233331111------33332222------3333---------------- SVFYGDYYGIPTHGVPSMKSKIDPLLQARQTYAYGTQHDYFDHHDIIGWTREGDSSHPNS --3333---3333----3333--------------------------------3333--- GLATIMSDGPGGNKWMYVGKHKAGQVWRDITGNRSGTVTINADGWGNFTVNGGAVSVWVK ------------------1111------1111--------1111---------------- Q - >LAMIN-B RECEPTOR; SWP:Q14739; PDB:2DIGA; GSSGSSGMPSRKFADGEVVRGRWPGSSLYYEVEILSHDSTSQLYTVKYKDGTELELKEND -------------2222---------------------------------------3333 IKSGPSSG -------- >TFIIH basal transcription; SWP:P32780; PDB:2DIIA; GSSGSSGFKRKANKELEEKNRMLQEDPVLFQLYKDLVVSQVISAEEFWANRLNVNSGPSS ------------3333--------------------1111--3333-------------- G - >Proline-serine-threonine ; SWP:O43586; PDB:2DILA; GSSGSSGAQEYRALYDYTAQNPDELDLSAGDILEVILEGEDGWWTVERNGQRGFVPGSYL --------------------------------------3333-----iiii----3333- EKLSGPSSG --------- >ZINC FINGER SWIM DOMAIN-C; SWP:Q8NEG5; PDB:2DIPA; GSSGSSGLEEFKNSSKLVAAAEKERLDKHLGIPCNNCKQFPIEGKCYKCTECIEYHLCQE -----------------------------------------------------------3 CFDSYCHLSHTFTFREKRNQKWRSLEKRADEVSGPSSG 333--3333----------------------------- >TUDOR AND KH DOMAIN-CONTA; SWP:Q9Y2W6; PDB:2DIQA; GSSGSSGRSLQLDKLVNEMTQHYENSVPEDLTVHVGDIVAAPLPTNGSWYRARVLGTLEN --------3333---------3333----------------------------------- GNLDLYFVDFGDNGDCPLKDLRALRSDFLSLPFQAIECSLARIASGPSSG ----------------3333----3333---------------------- >THUMP DOMAIN-CONTAINING P; SWP:Q9NXG2; PDB:2DIRA; GSSGSSGKAFLEDMKKYAETFLEPWFKAPNKGTFQIVYKSRNNSHVNREEVIRELAGIVC -------3333----------1111----------------------3333--------- TLNSENKVDLTNPQYTVVVEIIKAVCCLSVVKSGPSSG --3333---------------%%%%------------- >UNNAMED PROTEIN PRODUCT; SWP:Q5RBZ7; PDB:2DISA; GSSGSSGNCRLFIGGIPKMKKREEILEEIAKVTEGVLDVIVYASAADKMKNRGFAFVEYE ----------------33333333------------------------------------ SHRAAAMARRKLMPGRIQLWGHQIAVDWAEPEIDVDEDVMETVSGPSSG 33331111---1111---iiii-----------3333------------ >HIV TAT SPECIFIC FACTOR 1; SWP:Q5RB63; PDB:2DITA; GSSGSSGGPSRMRHERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQIRKLLLFD --------------------------3333------------------------------ RHPDGVASVSFRDPEEADYCIQTLDGRWFGGRQITAQAWDGTTDYQSGPSSG ------------33331111--------iiii-------------------- >KIAA0430 PROTEIN; SWP:Q9Y4F3; PDB:2DIUA; GSSGSSGCHTLLYVYNLPANKDGKSVSNRLRRLSDNCGGKVLSITGCSAILRFINQDSAE -----------------33333333-------------------!!!!--------3333 RAQKRMENEDVFGNRIIVSFTPKNRELCETSGPSSG -11111111--------------------------- >TRNA SELENOCYSTEINE ASSOC; SWP:NA; PDB:2DIVA; GSSGSSGMAASLWMGDLEPYMDENFISRAFATMGETVMSVKIIRNRLTGIPAGYCFVEFA ---------------------3333----------------------------------- DLATAEKCLHKINGKPLPGATPAKRFKLNYATYSGPSSG --------------------------------------- >PUTATIVE RNA-BINDING PROT; SWP:Q80TJ3; PDB:2DIWA; GSSGSSGDNMEAVKTFNSELYSLNDYKPPISKAKMTQITKAAIKAIKFYKHVVQSVEKFI --------------------3333------3333----------3333------------ QKCKPEYKVPGLYVIDSIVRQSRHQFGQEKDVFAPRFSNNIISTFQNLYRCPGDDKSKIV ---3333-------------------------3333-----------3333---3333-- RVLNLWQKNNVFKSEIIQPLLDMAAGSGPSSG ------------3333-----3333------- >Interferon-inducible doub; SWP:O75569; PDB:2DIXA; GSSGSSGKTPIQVLHEYGMKTKNIPVYECERSDVQIHVPTFTFRVTVGDITCTGEGTSKK --------1111----3333---------------------------------------- LAKHRAAEAAINILKANASGPSSG ------------------------ >THIOREDOXIN DOMAIN-CONTAI; SWP:Q8NBS9; PDB:2DIZA; GSSGSSGTVLALTENNFDDTIAEGITFIKFYAPWCGHCKTLAPTWEELSKKEFPGLAGVK -------------------3333------------3333-------3333--2222---- IAEVDCTAERNICSKYSVRGYPTLLLFRGGKKVSEHSGGRDLDSLHRFVLSQAKDEL --------33333333----------------------------------------- >THIOREDOXIN-RELATED TRANS; SWP:Q561W0; PDB:2DJ0A; GSSGSSGYIKYFNDKTIDEELERDKRVTWIVEFFANWSNDCQSFAPIYADLSLKYNCTGL --------------3333----------------11113333------------------ NFGKVDVGRYTDVSTRYKVSTSPLTKQLPTLILFQGGKEAMRRPQIDKKGRAVSWTFSEE -----333333333333-----------------%%%%--------3333-------333 NVIREFNLNELSGPSSG 3--33333333------ >PROTEIN DISULFIDE-ISOMERA; SWP:P08003; PDB:2DJ1A; GSSGSSGDDDLEVKEENGVWVLNDGNFDNFVADKDTVLLEFYAPWCGHCKQFAPEYEKIA ---------------iiii---3333----------------11113333---------- STLKDNDPPIAVAKIDATSASMLASKFDVSGYPTIKILKKGQAVDYDGSRTQEEIVAKVR ---------------3333-----1111----------iiii--------3333------ EVSQPDWTPPPEVTSGPSSG ---3333------------- >PROTEIN DISULFIDE-ISOMERA; SWP:P08003; PDB:2DJ2A; GSSGSSGVTLSLTKDNFDDVVNNADIILVEFYAPWCGHCKKLAPEYEKAAKELSKRSPPI ------------1111---3333-------------3333------------1111---- PLAKVDATEQTDLAKRFDVSGYPTLKIFRKGRPFDYNGPREKYGIVDYMIEQSGSGPSSG -----33333333-1111------------------------------------------ >PROTEIN DISULFIDE-ISOMERA; SWP:P08003; PDB:2DJ3A; GSSGSSGPVKVVVGKTFDAIVMDPKKDVLIEFYAPWCGHCKQLEPIYTSLGKKYKGQKDL ------------1111---------------------3333------------------- VIAKMDATANDITNDQYKVEGFPTIYFAPSGDKKNPIKFEGGNRDLEHLSKFIDEHATKR -----3333----------------------3333------------------------- SRTKEELSGPSSG ------------- >FILAMIN-B; SWP:O75369; PDB:2DJ4A; GSSGSSGVVDPSKVKIAGPGLGSGVRARVLQSFTVDSSKAGLAPLEVRVLGPRGLVEPVN ---------1111----3333---------------1111----------3333------ VVDNGDGTHTVTYTPSQEGPYMVSVKYADEEIPRSPFKVKVLPTYDAS --------------------------%%%%------------------ >SIROHYDROCHLORIN COBALTOC; SWP:O29537; PDB:2DJ5A; GMRRGLVIVGHGSQLNHYREVMELHRKRIEESGAFDEVKIAFAARKRRPMPDEAIREMNC -----------------------------3333--------------------------- DIIYVVPLFISYGLHVTEDLPDLLGFPRGRGIKEGEFEGKKVVICEPIGEDYFVTYAILN ------------3333--------------------%%%%-------1111--------- SVFRIG ------ >HYPOTHETICAL PROTEIN PH06; SWP:O58368; PDB:2DJ6A; MKSRIIVRTSFDAAHAHGHTFFLEVAIEGEIKNGYVMDFLELRKIVEEITKELDHRNLNN -------------------------------iiii---------------------3333 IFENPTTENIALWIGERIRDKLPPYVKLKRVVLWEGKDNGVELEW ------------------1111----------------------- >ACTIN-BINDING LIM PROTEIN; SWP:O94929; PDB:2DJ7A; GSSGSSGKPIKIRGPSHCAGCKEEIKHGQSLLALDKQWHVSCFKCQTCSVILTGEYISKD --------------------------------%%%%----------------------%% GVPYCESDYHAQFGSGPSSG %%-----3333--------- >PROTEIN CBFA2T1; SWP:NA; PDB:2DJ8A; GSSGSSGINQQEDSSESCWNCGRKASETCSGCNTARYCGSFCQHKDWEKHHHICSGPSSG --------------------------------------3333-1111------------- >MIDLINE-2; SWP:Q9UJV3; PDB:2DJAA; GSSGSSGVEPVPDTHLRGITCLDHENEKVNMYCVSDDQLICALCKLVGRHRDHQVASLND ---------------------------------1111----------------------- RFEKLKQTLEMNLTNLVKSGPSSG ------------------------ >POLYCOMB GROUP RING FINGE; SWP:Q9BYE7; PDB:2DJBA; GSSGSSGNLSELTPYILCSICKGYLIDATTITECLHTFCKSCIVRHFYYSNRCPKCNIVV -------------3333---------------------3333--1111------------ HQTQPLSGPSSG ------------ >DIPEPTIDYL-PEPTIDASE 1; SWP:P53634; PDB:2DJFA; DTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQEKKVVVYLQKLDTAYDDLGNSGHF ------3333--------------11113333---------------------------- TIIYNQGFEIVLNDYKWFAFFKYKEEGSKVTTYCNETMTGWVHDVLGRNWACFTGKKV -----------%%%%-----------------1111-------1111----------- >Dipeptidyl-peptidase 1 [P; SWP:P53634; PDB:2DJFB; LPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVS -----1111iiii-------------3333--------------%%%%------------ CSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVGG -----!!!!--3333------------3333-------------------------2222 FYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHT --------------------------3333----------- >Dipeptidyl-peptidase 1 [P; SWP:P53634; PDB:2DJFC; PFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAV ----------------------------------3333-iiii--------%%%%----- AATPIPKL -------- >PROTEIN DISULFIDE-ISOMERA; SWP:P55059; PDB:2DJJA; GPLGSEGPVTVVVAKNYNEIVLDDTKDVLIEFYAPWCGHCKALAPKYEELGALYAKSEFK ------------3333------------------------------------------11 DRVVIAKVDATANDVPDEIQGFPTIKLYPAGAKGQPVTYSGSRTVEDLIKFIAENGKYKA 11------1111-------------------1111---------------1111------ A - >PROTEIN DISULFIDE-ISOMERA; SWP:P55059; PDB:2DJKA; GPLGSPLIGEIGPETYSDYMSAGIPLAYIFAETAEERKELSDKLKPIAEAQRGVINFGTI -----------1111--------------------------------------------- DAKAFGAHAGNLNLKTDKFPAFAIQEVAKNQKFPFDQEKEITFEAIKAFVDDFVAGKIEP ----3333-1111-----------------------------3333------3333---- SIKSEPIPEKQEG ------------- >HOMEOBOX PROTEIN DLX-5; SWP:P56178; PDB:2DJNA; GSSGSSGRKPRTIYSSFQLAALQRRFQKTQYLALPERAELAASLGLTQTQVKIWFQNKRS --------------3333-------3333----3333---------3333------3333 KIKKSGPSSG ---------- >HYPOTHETICAL PROTEIN SB14; SWP:Q9N012; PDB:2DJPA; GSSGSSGCSPVRERRLEHQLEPGDTLAGLALKYGVTMEQIKRANRLYTNDSIFLKKTLYI --------------------11113333-------3333-----------3333------ PILTEPRDLFNSGPSSG ----------------- >SH3 DOMAIN CONTAINING RIN; SWP:Q8BZT2; PDB:2DJQA; GSSGSSGPRAKALCNYRGKNPGDLKFNKGDVILLRRQLDENWYQGEINGVSGIFPASSVE ----------------------------------------------%%%%-----3333- VISGPSSG -------- >ZINC FINGER BED DOMAIN-CO; SWP:Q9BTP6; PDB:2DJRA; GSSGSSGSEAWEYFHLAPARAGHHPNQYATCRLCGRQVSRGPGVNVGTTALWKHLKSMHR -------3333-------------------------------------3333-------- EELEKSGHGQSGPSSG ---------------- >EPHRIN TYPE-B RECEPTOR 1; SWP:P54762; PDB:2DJSA; GSSGSSGPSTVPIMHQVSATMRSITLSWPQPEQPNGIILDYEIRYYEKEHNEFNSSMARS -------------------1111-----------------------33333333------ QTNTARIDGLRPGMVYVVQVRARTVAGYGKFSGKMCFQTLTDSGPSSG -----------------------3333--------------------- >UNNAMED PROTEIN PRODUCT; SWP:Q9H6Y5; PDB:2DJTA; GSSGSSGQASGHFSVELVRGYAGFGLTLGGGRDVAGDTPLAVRGLLKDGPAQRCGRLEVG --------------------------------1111--------------3333------ DLVLHINGESTQGLTHAQAVERIRAGGPQLHLVIRRPLSGPSSG -----%%%%----------------------------------- >RECEPTOR-TYPE TYROSINE-PR; SWP:P10586; PDB:2DJUA; GSSGSSGPKPPIDLVVTETTATSVTLTWDSGNSEPVTYYGIQYRAAGTEGPFQEVDGVAT ------------------------------------------------------------ TRYSIGGLSPFSEYAFRVLAVNSIGRGPPSEAVRARTGEQSGPSSG ---------------------3333-------------3333---- >METHIONYL-TRNA SYNTHETASE; SWP:P56192; PDB:2DJVA; GSSGSSGTTAKPQQIQALMDEVTKQGNIVRELKAQKADKNEVAAEVAKLLDLKKQLAVAE --------------------------------1111------------------------ GKPPEAPKGKKKKSGPSSG ------------------- >PROBABLE TRANSCRIPTIONAL ; SWP:Q5SK07; PDB:2DJWA; MITAFVLIRPRGNRVQALGEAIAELPQVAEVYSVTGPYDLVALVRLKDVEELDDVVTQGI ----------1111----------1111-------------------3333--------1 LSLEGVERTETLLAFRAYPR 111----------------- >SMAD UBIQUITINATION REGUL; SWP:Q9HAU4; PDB:2DJYA; GPLGSGPLPPGWEIRNTATGRVYFVDHNNRTTQFTDPRLSAN -----------------------------------3333--- -------------------------------------------------- >HETEROGENEOUS NUCLEAR RIB; SWP:O43390; PDB:2DK2A; GSSGSSGDPEVMAKVKVLFVRNLATTVTEEILEKSFSEFGKLERVKKLKDYAFVHFEDRG -------3333----------------------------------------------333 AAVKAMDEMNGKEIEGEEIEIVLAKPPDKKRSGPSSG 3------------%%%%-------------------- >E3 UBIQUITIN-PROTEIN LIGA; SWP:Q9ULT8; PDB:2DK3A; GSSGSSGVRSQVLKYMVPGARVIRGLDWKWRDQDGSPQGEGTVTGELHNGWIDVTWDAGG ----------3333---------------------------------%%%%----3333- SNSYRMGAEGKFDLKLAPGYSGPSSG ------1111---------------- >PRE-MRNA-SPLICING FACTOR ; SWP:NA; PDB:2DK4A; GSSGSSGTSSNPVLELELAEEKLPMTLSRQEVIRRLRERGEPIRLFGETDYDAFQRLRKI ---------------------------3333-----1111-------------------- EILTPEVNKGSGPSSG ---------------- >DNA-directed RNA polymera; SWP:Q9H1D9; PDB:2DK5A; GSSGSSGDSQNAGKMKGSDNQEKLVYQIIEDAGNKGIWSRDVRYKSNLPLTEINKILKNL --------------------3333--------3333-----3333---3333-------- ESKKLIKAVKSVAASKKKVYMLYNLSGPSSG ------------------------------- >DNA-directed RNA polymera; SWP:Q921X6; PDB:2DK8A; GSSGSSGPDADPVEIENRIIELCHQFPHGITDQVIQNEMPHIEAQQRAVAINRLLSMGQL -------------------------3333-3333-------------------------- DLLRSNTGLLYRIKDSGPSSG --------------------- >NEDD9-interacting protein; SWP:Q8TDZ2; PDB:2DK9A; MGSAGTQEELLRWCQEQTAGYPGVHVSDLSSSWADGLALCALVYRLQPGLLEPSELQGLG -----------------------------3333-------------1111-3333----- ALEATAWALKVAENELGITPVVSAQAVVAGSDPLGLIAYLSHFHSAFKSM -------------------------------3333--------------- >PHOSPHOACETYLGLUCOSAMINE ; SWP:Q9P4V2; PDB:2DKAA; MSIEQTLSQYLPSHPKPQGVTFTYGTAGFRMKADKLDYVTFTVGIIASLRSKYLQGKTVG ---------3333-----------1111---1111------------------iiii--- VMITASNPPEDNGVKVVDPLGSMLESSWEKYATDLANASPSPEKNSLVEVIKNLVSDLKI -------1111------1111---3333---------------------------1111- DLSIPANVVIARDSRESSPALSMATIDGFQSVPNTKYQDFGLFTTPELHYVTRTLNDPDF 1111----------1111----------1111------------------------3333 GKPTEDGYYSKLAKSFQEIYTIEKIDITIDAANGVGAPKIQELLEKYLHKEISFTVVNGD ------------------1111--------%%%%-------------3333--------1 YKQPNLLNFDCGADYVKTNQKLPKNVKPVNNKLYASFDGDADRLICYYQNNDNKFKLLDG 1111111---------------2222-----------1111--------1111------- DKLSTLFALFLQQLFKQIDPTKISLNIGVVQTAYANGSSTKYVEDVLKIPVRCTPTGVKH --------------11111111---------11113333--------------------- LHHEAENFDIGVYFEANGHGTVIFNPEAEKKIFDYKPNNDNEAKAIKVLQNFSQLINQTV --------------1111--------------------------------------3333 GDAISDLLAVLIVVHYLKLSPSDWDNEYTDLPNKLVKVFKTTNAERLVPKGMQDEIDKLV -----------------------1111---------------%%%%--2222-------- AQYPNGRSFVRASAVRVYAEADTQNNVEELSKAVSELVK --2222---------------------------3333-- >METALLO-BETA-LACTAMASE SU; SWP:Q5SLP1; PDB:2DKFA; RIVPFGAAREVTGSAHLLLAGGRRVLLDCGFQGKEEARNHAPFGFDPKEVDAVLLTHAHL -------------------%%%%---------11111111-----3333----------- DHVGRLPKLFREGYRGPVYATRATVLLEIVLEDALKVDEPFFGPEDVEEALGHLRPLEYG ---------1111-------------------------------------1111------ EWLRLGALSLAFGQAGHLPGSAFVVAQGEGRTLVYSGDLGNREKDVLPDPSLPPLADLVL -----------------2222--------------------------------------- AEGTYGDRPHRPYRETVREFLEILEKTLSQGGKVLIPTFAVERAQEILYVLYTHGHRLPR ---------------------------1111----------------------------- APIYLDSPAGRVLSLYPRLVRYFSEEVQAHFLQGKNPFRPAGLEVVEHTEASKALNRAPG --------------33333333------------------------------3333---- PVVLAGSGLAGGRILHHLKHGLSDPRNALVFVGYQPQGGLGAEIIARPPAVRILGEEVPL -----------------------3333--------2222-3333--------%%%%---- RASVHTLGGFSGHAGQDELLDWLQGEPRVVLVHGEEEKLLALGKLLALRGQEVSLARFGE ----------------------------------------------1111---------- GVPV ---- >3-HYDROXYBENZOATE HYDROXY; SWP:Q6SSJ6; PDB:2DKHA; MQFHLNGFRPGNPLIAPASPLAPAHTEAVPSQVDVLIVGCGPAGLTLAAQLAAFPDIRTC ---1111----1111---3333-------------------------------1111--- IVEQKEGPMELGQADGIACRTMEMFEAFEFADSILKEACWINDVTFWKPDPGQPGRIARH -------------------------1111--------------------1111------- GRVQDTEDGLSEFPHVILNQARVHDHYLERMRNSPSRLEPHYARRVLDVKVDHGAADYPV ------2222--------3333-----------1111--------------1111----- TVTLERCDAAHAGQIETVQARYVVGCDGARSNVRRAIGRQLVGDSANQAWGVMDVLAVTD -------3333-----------------------1111---------------------- FPDVRYKVAIQSEQGNVLIIPREGGHLVRFYVEMDNITVEQLIATAQRVLHPYKLEVKNV 1111-------1111------2222----------------------------------- PWWSVYEIGQRICAKYDDVVDAVATPDSPLPRVFIAGDACHTHSPKAGQGMNFSMQDSFN ------------------------1111---------------3333-!!!!3333---- LGWKLAAVLRKQCAPELLHTYSSERQVVAQQLIDFDREWAKDPKEFQKYFEQHGRFTAGV -------1111------------------------------1111--------------- GTHYAPSLLTGQAKHQALASGFTVGMRFHSAPVVRVCDAKPVQLGHCGKADGRWRLYAFA ------1111----333311112222----------------3333-------------- AQNDLAQPESGLLALCRFLEGDAASPLRRFTPAGQDIDSIFDLRAVFPQAYTEVALETLP 1111--3333-----------11113333--22221111----------1111-3333-3 ALLLPPKGQLGMIDYEKVFSPDLKNAGQDIFELRGIDRQQGALVVVRPDQYVAQVLPLGD 333----1111-------------22223333--------------1111------1111 HAALSAYFESFMRA -------------- >TRINUCLEOTIDE REPEAT CONT; SWP:Q9HCJ0; PDB:2DKLA; GSSGSSGGMKTSGKQDEAWIMSRLIKQLTDMGFPREPAEEALKSNNMNLDQAMSALLEKK --------------------------------------------%%%%3333---3333- VDVDKRGLGVTDHNGMAAKSGPSSG ------------------------- >COLLAGEN ALPHA-1(XX) CHAI; SWP:Q9P218; PDB:2DKMA; GSSGSSGPLPPPRALTLAAVTPRTVHLTWQPSAGATHYLVRCSPASPKGEEEEREVQVGR --------------------1111------------------------------------ PEVLLDGLEPGRDYEVSVQSLRGPEGSEARGIRARTPTSGPSSG -------------------------------------------- >3-ALPHA-HYDROXYSTEROID DE; SWP:Q59718; PDB:2DKNA; SVIAITGSASGIGAALKELLARAGHTVIGIDRGQADIEADLSTPGGRETAVAAVLDRCGG ------1111----------1111---------------------------------%%% VLDGLVCCAGVGVTAANSGLVVAVNYFGVSALLDGLAEALSRGQQPAAVIVGSIAATQPG %----------1111------------------------1111--------------222 AAELPMVEAMLAGDEARAIELAEQQGQTHLAYAGSKYAVTCLARRNVVDWAGRGVRLNVV 2--3333-------------------3333--------------------1111------ APGAVETPLLQASKADPRYGESTRRFVAPLGRGSEPREVAEAIAFLLGPQASFIHGSVLF ---------------------------3333---3333---------1111--------- VDGGMDALMRAKTF ---3333--1111- >CASPASE-3; SWP:P42574; PDB:2DKOA; SGISLDNSYKMDYPEMGLCIIINNKNFHKSTGMTSRSGTDVDAANLRETFRNLKYEVRNK ---------------------------3333----2222-----------1111------ NDLTREEIVELMRDVSKEDHSKRSSFVCVLLSHGEEGIIFGTNGPVDLKKITNFFRGDRC --------------1111-1111-----------2222--1111--33333333-11111 RSLTGKPKLFIIQACRGTELDCGIET 111----------------------- >Caspase-3 [Precursor]; SWP:P42574; PDB:2DKOB; ASGVDDDMACHKIPVEADFLYAYSTAPGYYSWRNSKDGSWFIQSLCAMLKQYADKLEFMH -------------1111--------2222----------------------1111----- ILTRVNRKVATEFESFSFDATFHAKKQIPCIVSMLTKELYFYH ------------------3333--------------------- >Pleckstrin homology domai; SWP:Q9HAU0; PDB:2DKPA; GSSGSSGKRSNSIKRNPNAPVVRRGWLYKQDSTGMKLWKKRWFVLSDLCLFYYRDEKEEG ---------------3333--------------------------%%%%-----3333-- ILGSILLPSFQIALLTSEDHINRKYAFKAAHPNMRTYYFCTDTGKEMELWMKAMLDAALV -----3333------3333-------------------------3333--------3333 QTSGPSSG -------- >KIAA1075 PROTEIN; SWP:Q7Z5T9; PDB:2DKQA; GSSGSSGMSTAADLLRQGAACSVLYLTSVETESLTGPQAVARASSAALSCSPRPTPAVVH ----------3333-----------------------3333------------------- FKVSAQGITLTDNQRKLFFRRHYPVNSITFSSTDPQDRRWTNPDGTTSKIFGFVAKKPGS ---1111----------------1111------3333-------------------1111 PWENVCHLFAELDPDQPAGAIVTFITKVLLGQRKSGPSSG ------------33333333-------------------- >LIN-7 HOMOLOG B; SWP:Q9HAP6; PDB:2DKRA; GSSGSSGVVELPKTDEGLGFNIMGGKEQNSPIYISRVIPGGVADRHGGLKRGDQLLSVNG -------------3333----------------------------------------%%% VSVEGEQHEKAVELLKAAQGSVKLVVRSGPSSG %-----3333-------------------3333 >RING finger and CHY zinc ; SWP:Q9CR50; PDB:2DKTA; GSSGSSGGVRNLAQGPRGCEHYDRACLLKAPCCDKLYTCRLCHDTNEDHQLDRFKVKEVQ -------------------------------------------1111------------- CINCEKLQHAQQTCEDCSTLFGEYYCSICHLFDKDKRQYHCESCGICRIGPKEDFFHCLK -----------------------------------------1111-----3333----33 CNLCLTTNLRGKHKCIESGPSSG 33---3333-------------- >KIAA1556 PROTEIN; SWP:Q5VST9; PDB:2DKUA; GSSGSSGANCFTEELTNLQVEEKGTAVFTCKTEHPAATVTWRKGLLELRASGKHQPSQEG ----------------------------------------------------------!! LTLRLTISALEKADSDTYTCDIGQAQSRAQLLVQGRRSGPSSG !!--------3333----------------------------- >HYPOTHETICAL PROTEIN KIAA; SWP:Q9ULI0; PDB:2DKWA; GSSGSSGNTLRELRLFLRDVTKRLATDKRFNIFSKPVSDYLEVIKEPMDLSTVITKIDKH -------3333------------3333---1111--------------------3333-- NYLTAKDFLKDIDLICSNALEYNPDKDPGDKIIRHRACTLKDTAHAIIAAELDPEFNKLC ----3333---------------------------------------------------- EEIKESGPSSG --3333----- >SAM pointed domain-contai; SWP:NA; PDB:2DKXA; GSSGSSGLKDIETACKLLNITADPMDWSPSNVQKWLLWTEHQYRLPPMGKAFQELAGKEL ---------------1111---1111---3333-----1111-----33331111----1 CAMSEEQFRQRSPLGGDVLHAHLDIWKSAASGPSSG 111-3333------------3333------------ >HYPOTHETICAL PROTEIN LOC6; SWP:Q9H706; PDB:2DKZA; GSSGSSGPWQPPADLSGLSIEEVSKSLRFIGLSEDVISFFVTEKIDGNLLVQLTEEILSE --------------------------3333--3333----1111-33333333------- DFKLSKLQVKKIMQFINGSGPSSG ----3333-----------3333- >SAM AND SH3 DOMAIN-CONTAI; SWP:O94885; PDB:2DL0A; GSSGSSGGGLTEICRKPVSPGCISSVSDWLISIGLPMYAGTLSTAGFSTLSQVPSLSHTC ----------------------------------1111-----------1111---3333 LQEAGITEERHIRKLLSAARLFKLPPGPEAMSGPSSG -------3333-------1111--------------- >SPARTIN; SWP:Q8N0X7; PDB:2DL1A; GSSGSSGEPAEIKIIREAYKKAFLFVNKGLNTDELGQKEEAKNYYKQGIGHLLRGISISS ----------3333------------------3333------------------------ KESEHTGPGWESARQMQQKMKETLQNVRTRLEILEKGLATSLQNDLQEVPSGPSSG ------3333-----------------------3333------------------- >SORBIN AND SH3 DOMAIN-CON; SWP:Q9BX66; PDB:2DL3A; GSSGSSGRPARAKFDFKAQTLKELPLQKGDIVYIYKQIDQNWYEGEHHGRVGIFPRTYIE -------------------3333-----------------------%%%%----1111-- LLSGPSSG -------- >PROTEIN STAC; SWP:Q99469; PDB:2DL4A; GSSGSSGNTYVALYKFVPQENEDLEMRPGDIITLLEDSNEDWWKGKIQDRIGFFPANFVQ -------------------3333-------------------------------3333-- RLSGPSSG -------- >KIAA0769 PROTEIN; SWP:O94868; PDB:2DL5A; GSSGSSGTLRNYPLTCKVVYSYKASQPDELTIEEHEVLEVIEDGDMEDWVKARNKVGQVG --------%%%%-------------1111------------------------3333--- YVPEKYLQFPTSSGPSSG --3333------------ >CHROMODOMAIN-HELICASE-DNA; SWP:Q9HCK8; PDB:2DL6A; GSSGSSGEPNHLDVDLETRIPVINKVDGTLLVGEDAPRRAELEMWLQGHPEFAVDPRFLA --------------3333--------------3333-3333-3333--------3333-- YMEDRRKQKWQRCKKNNSGPSSG --3333----------------- >KIAA0769 PROTEIN; SWP:O94868; PDB:2DL7A; GSSGSSGVCFVKALYDYEGQTDDELSFPEGAIIRILNKENQDDDGFWEGEFNGRIGVFPS --------------------3333--------------------------%%%%------ VLVEELSSGPSSG ------------- >SLIT-ROBO RHO GTPASE-ACTI; SWP:O75044; PDB:2DL8A; GSSGSSGEPIEAIAKFDYVGRTARELSFKKGASLLLYQRASDDWWEGRHNGIDGLIPHQY ---------------------3333-----------------------%%%%----1111 IVVQDTSGPSSG ------------ >LEUCINE-RICH REPEAT-CONTA; SWP:Q9HBW1; PDB:2DL9A; GSSGSSGPFIMDAPRDLNISEGRMAELKCRTPPMSSVKWLLPNGTVLSHASRHPRISVLN -----------------------------------------------3333--------- DGTLNFSHVLLSDTGVYTCMVTNVAGNSNASAYLNVSSGPSSG ---------1111---------3333----------------- >397AA LONG HYPOTHETICAL P; SWP:NA; PDB:2DLAA; MMIMLDPFSEKAKELLKGFGSINDFMDAIPKIVSVDDVIERIRVVKNEKLIDKFLDQDNV -----1111-------1111-------3333--3333---------33333333------ MDLAQFYALLGALSYSPYGIELELVKKANLIIYSERLKRKKEIKPEEISIDVSTAIEFPT ----------1111-----------------------------1111-----------33 EDVRKIERVYGKIPEYTMKISDFLDLVPDEKLANYYIYEGRVYLKREDLIRIWSKAFERN 33------------------------11113333---iiii------------------- VERGVNMLYEIRDELPEFYRKVLGEIQAFAEEEFGRKFGEIQ --------1111------------------------------ >YOPT; SWP:O34498; PDB:2DLBA; AGYLNNIALNLEIVLKNKADSPEVSETLVTRICENLLLSKEVSFLKADGSVENFKLSDEY -----------------------------------1111------1111----------- EITNTEELP --------- >D-LACTATE DEHYDROGENASE; SWP:P30901; PDB:2DLDA; MTKVFAYAIRKDEEPFLNEWKEAHKDIDVDYTDKLLTPETAKLAKGADGVVVYQQLDYTA ---------3333-----------------------33333333--------------33 DTLQALADAGVTKMSLRNVGVDNIDMDKAKELGFQITNVPVYSPNAIAEHAAIQAARVLR 33---------------------------------------------------------- QDKRMDEKMAKRDLRWAPTIGREVRDQVVGVVGTGHIGQVFMRIMEGFGAKVIAYDIFKN ----------------------3333---------------------------------- PELEKKGYYVDSLDDLYKQADVISLHVPDVPANVHMINDKSIAEMKDGVVIVNCSRGRLV -3333------3333--------------3333----33331111----------3333- DTDAVIRGLDSGKIFGFVMDTYEDEVGVFNKDWEGKEFPDKRLADLIDRPNVLVTPHTAF -3333---------------------------2222---------------------111 YTTHAVRNMVVKAFNNNLKLINGEKPDSPVALNKNKF 1------------------1111-------------- ------------------------------------------------------------ -------------------------------------------- >FILAMIN-B; SWP:O75369; PDB:2DLGA; GSSGSSGRAPSVATVGSICDLNLKIPEINSSDMSAHVTSPSGRVTEAEIVPMGKNSHCVR ------------------------11111111------1111------------------ FVPQEMGVHTVSVKYRGQHVTGSPFQFTVGPLGEGGSGPSSG --------------%%%%------------------------ >RECEPTOR-TYPE TYROSINE-PR; SWP:P23468; PDB:2DLHA; GSSGSSGPVLTQTSEQAPSSAPRDVQARMLSSTTILVQWKEPEEPNGQIQGYRVYYTMDP ----------------------------------------------------------33 TQHVNNWMKHNVADSQITTIGNLVPQKTYSVKVLAFTSIGDGPLSSDIQVITQTGSGPSS 333333------------------------------3333-------------------- G - >NOVEL PROTEIN; SWP:Q9BU19; PDB:2DLKA; GSSGSSGMPCDFPGCGRIFSNRQYLNHHKKYQHIHQKSFSCPEPACGKSFNFKKHLKEHM ------------------------------1111---------3333------------- KLHSDTRDYICEFSGPSSG -3333------3333---- >INTERFERON REGULATORY FAC; SWP:Q15306; PDB:2DLLA; GSSGSSGKLRQWLIDQIDSGKYPGLVWENEEKSIFRIPWKHAGKQDYNREEDAALFKAWA ----------------------------3333----------------3333-------- LFKGKFREGIDKPDPPTWKTRLRCALNKSNDFEELVERSQLDISDPYKVYRIVPESGPSS -------------3333-----------1111--3333---------------------- G - >VINEXIN; SWP:NA; PDB:2DLMA; GSSGSSGKAARLKFDFQAQSPKELTLQKGDIVYIHKEVDKNWLEGEHHGRLGIFPANYVE -------------------3333---------------1111------------3333-- VLSGPSSG -------- >THYROID RECEPTOR-INTERACT; SWP:Q15654; PDB:2DLOA; GSSGSSGEGCYVATLEKCATCSQPILDRILRAMGKAYHPGCFTCVVCHRGLDGIPFTVDA ----------------------------------------------------------33 TSQIHCIEDFHRKFASGPSSG 33------------------- ------------------------------------------------------------ ------------------------- >GLI-KRUPPEL FAMILY MEMBER; SWP:Q99K15; PDB:2DLQA; GSSGSSGVECPTCHKKFLSKYYLKVHNRKHTGEKPFECPKCGKCYFRKENLLEHEARNCM -------------------------3333------------------------------- NRSEQVFTCSVCQETFRRRMELRLHMVSHTGEMPYKCSSCSQQFMQKKDLQSHMIKLHSG ------------------------3333-------------------------------- PSSG ---- >RHO GUANINE NUCLEOTIDE EX; SWP:O15085; PDB:2DLSA; GSSGSSGVQRCVIIQKDQHGFGFTVSGDRIVLVQSVRPGGAAMKAGVKEGDRIIKVNGTM ---------------------------------------3333------------iiii- VTNSSHLEVVKLIKSGAYVALTLLGSSSGPSSG --------------------------------- ------------------------------------------------------------ ---------------------------------------------- >INAD-LIKE PROTEIN; SWP:Q8NI35; PDB:2DLUA; GSSGSSGPETVCWGHVEEVELINDGSGLGFGIVGGKTSGVVVRTIVPGGLADRDGRLQTG ---------------------------------------------------3333----- DHILKIGGTNVQGMTSEQVAQVLRNCGNSVRMLVARDPAGDISVTSGPSSG -----%%%%-----3333--------------------------------- >DOCKING PROTEIN 2, ISOFOR; SWP:O60496; PDB:2DLWA; GSSGSSGHKEFAVTMRPTEASERCHLRGSYTLRAGESALELWGGPEPGTQLYDWPYRFLR ---------------------3333-----------------------------3333-- RFGRDKVTFSFEAGRRCVSGEGNFEFETRQGNEIFLALEEAISAQKNSGPSSG ----3333--------3333---------3333----------3333------ >UBX DOMAIN-CONTAINING PRO; SWP:O94888; PDB:2DLXA; GSSGSSGIDKKLTTLADLFRPPIDLMHKGSFETAKECGQMQNKWLMINIQNVQDFACQCL ----------------3333-3333-------------3333-----------3333--- NRDVWSNEAVKNIIREHFIFWQVYHDSEEGQRYIQFYKLGDFPYVSILDPRTGQKLVEWH ---3333----------------3333--------------------------------- QLDVSSFLDQVTGFLGEHGQLDGLSSSSGPSSG --3333--------------------------- >FYN-RELATED KINASE; SWP:Q8BPC1; PDB:2DLYA; GSSGSSGAEDRSLQAEPWFFGAIKRADAEKQLLYSENQTGAFLIRESESQKGDFSLSVLD ---------3333--3333----------------------------------------i EGVVKHYRIRRLDEGGFFLTRRKVFSTLNEFVNYYTTTSDGLCVKLEKPCLKIQVSGPSS iii----------------3333-----3333-3333----------------------- G - >TYROSINE-PROTEIN KINASE T; SWP:P42681; PDB:2DM0A; GSSGSSGNKITNLEIYEWYHRNITRNQAEHLLRQESKEGAFIVRDSRHLGSYTISVFMGA ------------3333---------------3333----------3333----------- RRSTEAAIKHYQIKKNDSGQWYVAERHAFQSIPELIWYHQHNAAGLMTRLRYPVGLMGSS ---------------1111-----------1111---3333------------------- GPSSG ----- >SORTILIN-RELATED RECEPTOR; SWP:Q92673; PDB:2DM4A; GSSGSSGPDAPRNLQLSLPREAEGVIVGHWAPPIHTHGLIREYIVEYSRSGSKMWASQRA ------3333-------------------------------------------------- ASNFTEIKNLLVNTLYTVRVAAVTSRGIGNWSDSKSITTIKGSGPSSG ------------------------------------------------ >NADP-dependent leukotrien; SWP:LTB4D_CAVPO; PDB:2DM6A; MVKAKSWTLKKHFQGKPTQSDFELKTVELPPLKNGEVLLEALFLSVDPYMRIASKRLKEG -----------------1111-----------2222----------3333--3333---- AVMMGQQVARVVESKNSAFPAGSIVLAQSGWTTHFISDGKGLEKLLTEWPDKLPLSLALG ---------------33332222--------------------------3333------- TIGMPGLTAYFGLLEVCGVKGGETVLVSAAAGAVGSVVGQIAKLKGCKVVGAAGSDEKIA ----------------------------1111---------------------------- YLKQIGFDAAFNYKTVNSLEEALKKASPDGYDCYFDNVGGEFLNTVLSQMKDFGKIAICG --1111-----3333-----------1111--------------3333--2222------ AISVYNRMDQLPPGPSPESIIYKQLRIEGFIVYRWQGDVREKALRDLMKWVLEGKIQYHE -1111-3333----------1111------3333--3333----------1111------ HVTKGFENMPAAFIEMLNGANLGKAVVTA ----3333-------1111---------- >KIAA1556 PROTEIN; SWP:Q8NHN3; PDB:2DM7A; GSSGSSGPARFTQDLKTKEASEGATATLQCELSKVAPVEWKKGPETLRDGGRYSLKQDGT --------------------2222---------------------------------!!! RCELQIHDLSVADAGEYSCMCGQERTSATLTVRALPARFTEGSGPSSG !--------1111----------------------------------- >INAD-LIKE PROTEIN; SWP:Q8NI35; PDB:2DM8A; GSSGSSGPATCPIVPGQEMIIEISKGRSGLGLSIVGGKDTPLNAIVIHEVYEEGAAARDG ------------------------------------------------------------ RLWAGDQILEVNGVDLRNSSHEEAITALRQTPQKVRLVVYRDEAHYRDEESGPSSG --2222----%%%%-----3333--1111--------------------------- >V-TYPE ATP SYNTHASE SUBUN; SWP:O57724; PDB:2DM9A; EIISSVLEEVKRRLETMSEDEYFESVKALLKEAIKELNEKKVRVMSNEKTLGLIASRIEE ------------3333-------------------------------------------- IKSELGDVSIELGETVDTMGGVIVETEDGRIRIDNTFEARMERFEGEIRSTIAKVLFG ----3333-----------------1111----------------------------- >FILAMIN-B; SWP:O75369; PDB:2DMBA; GSSGSSGTGDASKCLATGPGIASTVKTGEEVGFVVDAKTAGKGKVTCTVLTPDGTEAEAD ----------3333---3333--------------------------------------- VIENEDGTYDIFYTAAKPGTYVIYVRFGGVDIPNSPFTVMATDGEVTAVEEAPVNACPSG ---1111-------------------%%%%------------------------------ PSSG ---- >FILAMIN-B; SWP:O75369; PDB:2DMCA; GSSGSSGIPGSPFTAKITDDSRRCSQVKLGSAADFLLDISETDLSSLTASIKAPSGRDEP ---------------------------2222------------3333-----1111---- CLLKRLPNNHIGISFIPREVGEHLVSIKKNGNHVANSPVSIMVVQSEIGDSGPSSG ------%%%%------------------iiii------------------------ >ZINC FINGER PROTEIN 64, I; SWP:Q9NPA5; PDB:2DMDA; GSSGSSGPHKCEVCGKCFSRKDKLKTHMRCHTGVKPYKCKTCDYAAADSSSLNKHLRIHS ----------3333-----3333--3333-------------------3333---3333- DERPFKCQICPYASRNSSQLTVHLRSHTGDSGPSSG ---------------3333---3333---------- >PHD FINGER PROTEIN 3; SWP:Q92576; PDB:2DMEA; GSSGSSGSADQIRQSVRHSLKDILMKRLTDSNLKVPEEKAAKVATKIEKELFSFFRDTDA -------3333---------------3333-----3333-------------3333---- KYKNKYRSLMFNLKDPKNNILFKKVLKGEVTPDHLIRMSPEELASKELAAWRRRSGPSSG --------3333--3333--33333333-----3333-1111-----3333--------- >RING FINGER PROTEIN 25; SWP:Q96BH1; PDB:2DMFA; GSSGSSGEEDWVLPSEVEVLESIYLDELQVIKGNGRTSPWEIYITLHPATAEDQDSQYVC ------------------3333----------------------------3333------ FTLVLQVPAEYPHEVPQISIRNPRGLSDEQIHTILQVLGHVAKAGLGTAMLYELIEKGKE ------------------------------------------------------------ ILTDNNIPHGQSGPSSG -3333------------ >KIAA1228 PROTEIN; SWP:A0FGR8; PDB:2DMGA; GSSGSSGSPLGQIQLTIRHSSQRNKLIVVVHACRNLIAFSEDGSDPYVRMYLLPDKRRSG -------3333---------1111---------------3333----------------- RRKTHVSKKTLNPVFDQSFDFSVSLPEVQRRTLDVAVKNSGGFLSKDKGLLGKVLVALAS -----------------------33331111----------------------------- EELAKGWTQWYDLTEDSGPSSG ---------------------- >MYOFERLIN; SWP:Q9NZM1; PDB:2DMHA; GSSGSSGMLRVIVESASNIPKTKFGKPDPIVSVIFKDEKKKTKKVDNELNPVWNEILEFD ----------------------------------%%%%---------------------- LRGIPLDFSSSLGIIVKDFETIGQNKLIGTATVALKDLTGDQSRSLPYKLISLLNEKGQD iiii--1111-------3333------------3333-----------------1111-- TGATIDLVIGYDPPSGPSSG -------------------- >TEASHIRT HOMOLOG 3; SWP:Q63HK5; PDB:2DMIA; GSSGSSGKLYGSIFTGASKFRCKDCSAAYDTLVELTVHMNETGHYRDDNHETDNNNPKRW ------------------------------3333-------------------------- SKPRKRSLLEMEGKEDAQKVLKCMYCGHSFESLQDLSVHMIKTKHYQKVSGPSSG ----------------------------------------11113333------- >MIDLINE 2 ISOFORM 2; SWP:Q9UJV3; PDB:2DMKA; GSSGSSGEGLDYLTAPNPPSIREELCTASHDTITVHWISDDEFSISSYELQYTIFTGQAN ----------------------1111---------------3333--------------3 FISLYNSVDSWMIVPNIKQNHYTVHGLQSGTRYIFIVKAINQAGSRNSEPTRLKTNSQPF 333---3333-------------------------------------------------- KSGPSSG ------- >PROTEIN DISULFIDE-ISOMERA; SWP:Q922R8; PDB:2DMLA; GSSGSSGAVSGLYSSSDDVIELTPSNFNREVIQSDGLWLVEFYAPWCGHCQRLTPEWKKA -------------3333--------3333-1111-------------3333--------- ATALKDVVKVGAVNADKHQSLGGQYGVQGFPTIKIFGANKNKPEDYQGGRTGEAIVDAAL -------------33333333-----------------3333----------3333---- SALRSGPSSG --3333---- >PROTEIN DISULFIDE-ISOMERA; SWP:P30101; PDB:2DMMA; GSSGSSGFDGNLKRYLKSEPIPESNDGPVKVVVAENFDEIVNNENKDVLIEFYAPWCGHC --------------------------------33333333-------------3333333 KNLEPKYKELGEKLSKDPNIVIAKMDATANDVPSPYEVRGFPTIYFSPANKKLNPKKYEG 3-3333-------1111---------3333--------------------3333------ GRELSDFISYLQREATSGPSSG --3333--------3333---- >HOMEOBOX PROTEIN TGIF2LX; SWP:Q8IUE1; PDB:2DMNA; GSSGSSGKKRKGNLPAESVKILRDWMYKHRFKAYPSEEEKQMLSEKTNLSLLQISNWFIN ---------------3333-------1111-----3333---3333---3333-----33 ARRRILPDMLQQRRNDPSGPSSG 33--3333--------------- >NEUTROPHIL CYTOSOL FACTOR; SWP:P19878; PDB:2DMOA; GSSGSSGEAHRVLFGFVPETKEELQVMPGNIVFVLKKGNDNWATVMFNGQKGLVPCNYLE --------------------------2222----------------%%%%----3333-- PVSGPSSG -------- >ZINC FINGERS AND HOMEOBOX; SWP:Q9Y6X8; PDB:2DMPA; GSSGSSGAYPDFAPQKFKEKTQGQVKILEDSFLKSSFPTQAELDRLRVETKLSRREIDSW -------------------------------3333---3333------------------ FSERRKLRDSMEQAVLDSMGSGKSGPSSG --------%%%%----------------- >LIM/HOMEOBOX PROTEIN LHX9; SWP:Q9NQ69; PDB:2DMQA; GSSGSSGKRMRTSFKHHQLRTMKSYFAINHNPDAKDLKQLAQKTGLTKRVLQVWFQNARA ----------------3333---3333--------------------------------- KFRRNLLRQENGGVSGPSSG -------------------- >HOMEOBOX PROTEIN OTX2; SWP:P80206; PDB:2DMSA; GSSGSSGRRERTTFTRAQLDVLEALFAKTRYPDIFMREEVALKINLPESRVQVWFKNRRA --------------3333----------------------------------------33 KCRQQQQQQQNGGQSGPSSG 33------------------ >HOMEOBOX PROTEIN BARH-LIK; SWP:Q9HBU1; PDB:2DMTA; GSSGSSGGEPGTKAKKGRRSRTVFTELQLMGLEKRFEKQKYLSTPDRIDLAESLGLSQLQ ------------------------------------------3333----------3333 VKTWYQNRRMKWKKSGPSSG ---------3333------- >HOMEOBOX PROTEIN GOOSECOI; SWP:P56915; PDB:2DMUA; GSSGSSGRRHRTIFTDEQLEALENLFQETKYPDVGTREQLARKVHLREEKVEVWFKNRRA -----------------------------------------1111--------------- KWRRSGPSSG -----3333- >ITCHY HOMOLOG E3 UBIQUITI; SWP:Q96J02; PDB:2DMVA; GSSGSSGLPPGWEQRVDQHGRVYYVDHVEKRTTWDRPSGPSSG ----------------1111----------------------- >SYNAPTOBREVIN-LIKE 1 VARI; SWP:NA; PDB:2DMWA; GSSGSSGMAILFAVVARGTTILAKHAWCGGNFLEVTEQILAKIPSENNKLTYSHGNYLFH -----------------------------------------------------%%%%--- YICQDRIVYLCITDDDFERSRAFNFLNEIKKRFQTTYGSRAQTAPPYAMNSEFSSVLAAQ ---%%%%------3333--------------------3333------------------- LKHHSSGPSSG ----------- >DNAJ HOMOLOG SUBFAMILY B ; SWP:Q8NHS0; PDB:2DMXA; GSSGSSGMANYYEVLGVQASASPEDIKKAYRKLALRWHPDKNPDNKEEAEKKFKLVSEAY -----------------1111-----------------3333------------------ EVLSDSKKRSLYDRAGCDSWRAGGGASGPSSG ------------3333---------------- >SPERMATID PERINUCLEAR RNA; SWP:Q96SI9; PDB:2DMYA; GSSGSSGRKILDSKAIDLMNALMRLNQIRPGLQYKLLSQSGPVHAPVFTMSVDVDGTTYE -----------------------------------------------------%%%%--- ASGPSKKTAKLHVAVKVLQAMGYPTGFDADISGPSSG ------------------------------------- >INAD-LIKE PROTEIN; SWP:Q8NI35; PDB:2DMZA; GSSGSSGGSDSSLFETYNVELVRKDGQSLGIRIVGYVGTSHTGEASGIYVKSVIPGSAAY ------------------------------------------------------------ HNGHIQVNDKIVAVDGVNIQGFANHDVVEVLRNAGQVVHLTLVRRKTSSSTSPLEPPSDR -------------iiii------------------------------------------- GTVSGPSSG --------- >ZINC FINGERS AND HOMEOBOX; SWP:Q9H4I2; PDB:2DN0A; GSSGSSGASIYKNKKSHEQLSALKGSFCRNQFPGQSEVEHLTKVTGLSTREVRKWFSDRR ---------------3333----------------------------------------- YHCRNLKGSRSGPSSG ---------------- >GENERAL TRANSCRIPTION FAC; SWP:P78347; PDB:2DN4A; GSSGSSGLRKQVEELFERKYAQAIKAKGPVTIPYPLFQSHVEDLYVEGLPEGIPFRRPST ----------------------------------3333---------------------- YGIPRLERILLAKERIRFVIKKHELLNSTREDLSGPSSG -----------3333-------3333------------- >General transcription fac; SWP:Q9UHL9; PDB:2DN5A; GSSGSSGLREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIPFRNPNT ---------------------------------------1111----------------- YDIHRLEKILKAREHVRMVIINQSGPSSG ----------------------------- >KIAA0640 PROTEIN; SWP:Q9UH65; PDB:2DN6A; GSSGSSGVLKQGYMMKKGHRRKNWTERWFVLKPNIISYYVSEDLKDKKGDILLDENCCVE -----------------3333--------------------------------3333--- SLPDKDGKKCLFLVKCFDKTFEISASDKKKKQEWIQAIHSTIHLLKLGSSGPSSG ----%%%%----------------------------------3333--------- >RECEPTOR-TYPE TYROSINE-PR; SWP:P10586; PDB:2DN7A; GSSGSSGPGRPTMMISTTAMNTALLQWHPPKELPGELLGYRLQYCRADEARPNTIDFGKD %%%%-----------------------------------------1111--------333 DQHFTVTGLHKGTTYIFRLAAKNRAGLGEEFEKEIRTPEDLSGPSSG 3---------------------------------------------- >ACETYL-COA CARBOXYLASE 2; SWP:O00763; PDB:2DN8A; GSSGSSGTCVFEKENDPTVLRSPSAGKLTQYTVEDGGHVEAGSSYAEMEVMKMIMTLNVQ -------------------------------------------------%%%%------- ERGRVKYIKRPGAVLEAGCVVARLELDDPSKVHPSGPSSG ---------------------------------------- >DNAJ HOMOLOG SUBFAMILY A ; SWP:Q96EY1; PDB:2DN9A; GSSGSSGDYYQILGVPRNASQKEIKKAYYQLAKKYHPDTNKDDPKAKEKFSQLAEAYEVL -------3333-------------------------3333--3333---3333------- SDEVKRKQYDAYGSGPSSG ------------------- >UNNAMED PROTEIN PRODUCT; SWP:Q14DR8; PDB:2DNAA; GSSGSSGPSHSLQAPEVRFSKEMECLQAMGFVNYNANLQALIATDGDTNAAIYKLKSSQG --------------33333333-------------------1111---------3333-- FSGPSSG ------- >PYRUVATE DEHYDROGENASE PR; SWP:O00330; PDB:2DNCA; GSSGSSGIKILMPSLSPTMEEGNIVKWLKKEGEAVSAGDALCEIETDKAVVTLDASDDGI -----------------------------2222------------3333----------- LAKIVVEEGSKNIRLGSLIGLIVEEGEDWKHVSGPSSG ------2222---------------------------- >Dihydrolipoyllysine-resid; SWP:P10515; PDB:2DNEA; GSSGSSGQKVPLPSLSPTMQAGTIARWEKKEGDKINEGDLIAEVETDKATVGFESLEECY -----------------------------2222------------3333----------- MAKILVAEGTRDVPIGAIICITVGKPEDIEAFKNYTLDSSAASGPSSG -------------2222-------33333333---------------- >DOUBLECORTIN DOMAIN-CONTA; SWP:Q9UHG0; PDB:2DNFA; GSSGSSGRKPLQEPCTIFLIANGDLINPASRLLIPRKTLNQWDHVLQMVTEKITLRSGAV --------------------2222----------3333--3333---3333---3333-- HRLYTLEGKLVESGAELENGQFYVAVGRDKFKKLPYGELLFDSGPSSG ----1111---------------------------3333--------- >EUKARYOTIC TRANSLATION IN; SWP:Q9WUK2; PDB:2DNGA; GSSGSSGKELPTEPPYTAYVGNLPFNTVQGDIDAIFKDLSIRSVRLVRDKDTDKFKGFCY -----------------------------------3333--------------------- VEFDEVDSLKEALTYDGALLGDRSLRVDIAEGRKQDKSGPSSG -------------------!!!!-------------------- >BRUNO-LIKE 5, RNA BINDING; SWP:Q86VW6; PDB:2DNHA; GSSGSSGSESRGGRDRKLFVGMLNKQQSEEDVLRLFQPFGVIDECTVLRGPDGSSKGCAF -----------------------------------3333--------------------- VKFSSHTEAQAAIHALHGSQTMPGASSSLVVKFADTDKESGPSSG --------------------------------------------- >BRUNO-LIKE 4, RNA BINDING; SWP:Q9BQ96; PDB:2DNKA; GSSGSSGCLRQPPSHRKLFVGMLNKQQSEDDVRRLFEAFGNIEECTILRGPDGNSKGCAF -----------------------------3333----------------3333------- VKYSSHAEAQAAINALHGSQTMPGASSSLVVKFADTDKESGPSSG ---------------2222-------------------------- >cytoplasmic polyadenylati; SWP:Q5T390; PDB:2DNLA; GSSGSSGSRKVFVGGLPPDIDEDEITASFRRFGPLVVDWPHKAESKSYFPPKGYAFLLFQ ----------------------3333---3333--------------------------- EESSVQALIDACLEEDGKLYLCVSSPTIKDKPVQIRPWNLSDSDFVMDSGPSSG -3333------------------------------------------------- >SRP46 SPLICING FACTOR; SWP:Q6PF01; PDB:2DNMA; GSSGSSGPDVDGMITLKVDNLTYRTSPDSLRRVFEKYGRVGDVYIPREPHTKAPRGFAFV ---------------------3333----11113333----------------------- RFHDRRDAQDAEAAMDGAELDGRELRVQVARYGRRDLSGPSSG -------------------%%%%-------------------- >RNA-BINDING PROTEIN 12; SWP:Q9NTZ6; PDB:2DNNA; GSSGSSGKPLPINPDDLYVSVHGMPFSAMENDVRDFFHGLRVDAVHLLKDHVGRNNGNGL ------------3333--------1111--3333---------------1111------- VKFLSPQDTFEALKRNRMLMIQRYVEVSPATERQWVAAGGHITSGPSS ----333333331111---%%%%-------3333-3333--------- >TRINUCLEOTIDE REPEAT CONT; SWP:Q5SZQ7; PDB:2DNOA; GSSGSSGSRGEDRKLFVGMLGKQQTDEDVRKMFEPFGTIDECTVLRGPDGTSKGCAFVKF --------------------33333333----3333------------------------ QTHAEAQAAINTLHSSRTLPGASSSLVVKFADTEKESGPSSG --3333------------------------------------ >RNA-BINDING PROTEIN 14; SWP:Q96PK6; PDB:2DNPA; GSSGSSGNTWKIFVGNVSAACTSQELRSLFERRGRVIECDVVKDYAFVHMEKEADAKAAI -----------------11113333----------------------------------- AQLNGKEVKGKRINVELSTKGQKKSGPSSG -------iiii------------------- >RNA-BINDING PROTEIN 4B; SWP:RBM4B_HUMAN; PDB:2DNQA; GMVKLFIGNLPREATEQEIRSLFEQYGKVLECDIIKNYGFVHIEDKTAAEDAIRNLHHYK --------------------3333----------%%%%------3333------------ LHGVNINVEASKNKSKASSGPSSG %%%%-------------------- >SYNAPTOJANIN-1; SWP:O43426; PDB:2DNRA; GSSGSSGGTVLVSIKSSLPENNFFDDALIDELLQQFASFGEVILIRFVEDKMWVTFLEGS -----------------1111------------------------------------333 SALNVLSLNGKELLNRTITIALKSPSGPSSG 3----1111---%%%%--------------- >Chromodomain protein, Y c; SWP:NA; PDB:2DNTA; GSSGSSGMASEELYEVERIVDKRKNKKGKTEYLVRWKGYDSEDDTWEPEQHLVNCEEYIH ------------------------3333-----------1111----3333--------- DFNRRHTEKQKESGPSSG ------------------ >SH3 MULTIPLE DOMAINS 1; SWP:Q5TCZ1; PDB:2DNUA; GSSGSSGEEKYVTVQPYTSQSKDEIGFEKGVTVEVIRKNLEGWWYIRYLGKEGWAPASYL --------------------3333-----------------------%%%%--------- KKAKDSGPSSG ----------- >CHROMOBOX PROTEIN HOMOLOG; SWP:Q9QXV1; PDB:2DNVA; GSSGSSGERVFAAEALLKRRIRKGRMEYLVKWKGWSQKYSTWEPEENILDARLLAAFESG -------------------------------------------3333--3333--3333- PSSG ---- >ACYL CARRIER PROTEIN; SWP:O14561; PDB:2DNWA; GSSGSSGMPPLTLEGIQDRVLYVLKLYDKIDPEKLSVNSHFMKDLGLDSLDQVEIIMAME -----------3333-----------33333333-------------3333--------- DEFGFEIPDIDAEKLMCPQEIVDYIADKKDVYESGPSSG ----------3333--3333----3333----------- >SYNTAXIN-12; SWP:Q86Y82; PDB:2DNXA; GSSGSSGQLRDFSSIIQTCSGNIQRISQATAQIKNLMSQLGTKQDSSKLQENLQQLQHST ----------3333-----------------------3333----3333----------- NQLAKETNELLKELGSLPLPLSTSEQRQQRLQKERLMNDFSAALNNFQAVQRRVSEKEKE -------------3333------------------------------------------- SIARSGPSSG ---------- >HETEROGENEOUS NUCLEAR RIB; SWP:P52272; PDB:2DO0A; GSSGSSGALQAGRLGSTVFVANLDYKVGWKKLKEVFSMAGVVVRADILEDKDGKSRGIGT -----------------------3333----------------------3333------- VTFEQSIEAVQAISMFNGQLLFDRPMHVKMDERALPKGDFFPPERPQQSGPSSG ---------------2222-%%%%------------------------------ >NUCLEAR PROTEIN HCC-1; SWP:P82979; PDB:2DO1A; SGSSGVELHKLKLAELKQECLARGLETKGIKQDLIHRLQAYLEEHAESGPSSG ------3333------------------------------------------- >TRANSCRIPTION ELONGATION ; SWP:O00267; PDB:2DO3A; GSSGSSGEFPAQELRKYFKMGDHVKVIAGRFEGDTGLIVRVEENFVILFSDLTMHELKVL ------------------2222-------------------------------------1 PRDLQLCSE 111------ >UPF0301 PROTEIN HD_1794; SWP:Q7VKS7; PDB:2DO8A; MFGNLQGKFIIATPEMDDEYFDRTVIYICEHNDNGTIGVIINTPTDLSVLELLTRMDFQM ----2222-----------------------3333------------------------- AKPRIYTQDQMVLNGGPVNQDRGFIVHSKTDHEFTHSYKVTDDITLTTSGDVLDSFGTQT ------------------3333------------------3333-----3333------- APEKFIVCLGCSTWKPHQLEQEIAQNYWLLSEANNQTLFETSYLDRWVEANEMLGISGIL --------------2222----------------3333---------3333--------- APAGRALE -------- >NACHT-, LRR- AND PYD-CONT; SWP:Q8CCN1; PDB:2DO9A; GSSGSSGMALARANSPQEALLWALNDLEENSFKTLKFHLRDVTQFHLARGELESLSQVDL ------------------------------3333----3333-----iiii3333----- ASKLISMYGAQEAVRVVSRSLLAMNLMELVDYLNQVCLNDYREIYREHVSGPSSG -------------------3333--------3333-------------------- >PROACTIVATOR POLYPEPTIDE; SWP:P07602; PDB:2DOBA; GSLPCDICKDVVTAAGDLKDNATEEEILVYLEKTCDWLPKPNSASCKEIVDSYLPVILDI -----------------1111-------------1111---------------------1 IKGESRPGEVCSALNLCES 111---------------- >NEURAL CELL ADHESION MOLE; SWP:O15394; PDB:2DOCA; GSSGSSGQEYILALADVPSSPYGVKIIELSQTTAKVSFNKPDSHGGVPIHHYQVDVKEVA ------------------------------------------------------------ SEIWKIVRSHGVQTMVVLNNLEPNTTYEIRVAAVNGKGQGDYSKIEIFQTLPVSGPSSG ----------------------------------3333--------------------- >TRANSCRIPTION ELONGATION ; SWP:NA; PDB:2DODA; GSSGSSGARERAIVPLEARMKQFKDMLLERGVSAFSTWEKELHKIVFDPRYLLLNPKERK --------------3333------------------3333---------1111-3333-- QVFDQYVKTRAEEERRSGPSSG -----------3333------- >TRANSCRIPTION ELONGATION ; SWP:O14776; PDB:2DOEA; GSSGSSGEKEDSKTRGEKIKSDFFELLSNHHLDSQSRWSKVKDKVESDPRYKAVDSSSMR ----------3333------------------1111--------33333333---3333- EDLFKQYIEKIAKNLDSSGPSSG ----------------------- >TRANSCRIPTION ELONGATION ; SWP:O14776; PDB:2DOFA; GSSGSSGDREREQHKREEAIQNFKALLSDMVRSSDVSWSDTRRTLRKDHRWESGSLLERE -----------3333-----------------------------333333333333---- EKEKLFNEHIEALTKKKRESGPSSG ------------3333--------- >CELL DIVISION CONTROL PRO; SWP:P06704; PDB:2DOQA; SELLEEQKQEIYEAFSLFDNNDGFLDYHELKVAKALGFELPKREILDLIDEYDSEGRHLK -------------------------3333-----------3333---------------3 YDDFYIVGEKILKRDPLDEIKRAFQLFDDDHTGKISIKNLRRVAKELGETLTDEELRAIE 333----3333---3333-----33333333-------------1111----------33 EFDLDGDGEINENEFIAICTD 33------------------- >Protein SFI1; SWP:Q12369; PDB:2DOQD; PLGSNEEANRFANQAKLRVQEAVFYIWSDKTLKYSQANDEAESFRNTWLLFRSFQQWITL -----3333--------------------------------------------------- TQTFKEQSRLADQAFLNKFRK --------------------- >probable N-succinyldiamin; SWP:Q5SLF1; PDB:2DOUA; VPEPSVFLVVDEAKRKARERGVGLIDLSIGSTDLPPPEAPLKALAEALNDPTTYGYCLKS --------------------------------------------3333-3333----333 CTLPFLEEAARWYEGRYGVGLDPRREALALIGSQEGLAHLLLALTEPEDLLLLPEVAYPS 3--------------------1111--------------------2222--------333 YFGAARVASLRTFLIPLREDGLADLKAVPEGVWREAKVLLLNYPNNPTGAVADWGYFEEA 3----------------1111--1111--------------------------------- LGLARKHGLWLIHDNPYVDQVYEGEAPSPLALPGAKERVVELFSLSKSYNLAGFRLGFAL --------------1111---------33332222-----------11113333------ GSEEALARLERVKGVIDFNQYAGVLRMGVEALKTPKEVVRGYARVYRERALGMAEALKGV -----------3333-----3333------1111-------------------------- LSLLPPRATMYLWGRLPEGVDDLEFGLRLVERGVALAPGRGFGPGGKGFVRIALVRPLEE ----------------2222--------3333-----3333--1111------------- LLEAAKRIREAL ------------ >TRIOSEPHOSPHATE ISOMERASE; SWP:P36186; PDB:2DP3A; PARRPFIGGNFKCNGSLDFIKSHVAAIAAHKIPDSVDVVIAPSAVHLSTAIAANTSKQLR --------------------------1111--1111------3333-------------- IAAQNVYLEGNGAWTGETSVEMLQDMGLKHVIVGHSERRRIMGETDEQSAKKAKRALEKG -------------2222------1111-----------------------------1111 MTVIFCVGETLDERKANRTMEVNIAQLEALGKELGESKMLWKEVVIAYEPVWSIGTGVVA ------------------------------------33331111-----1111------- TPEQAEEVHVGLRKWFVEKVAAEGAQHIRIIYGGSANGSNCEKLGQCPNIDGFLVGGASL --------------------3333------------1111------1111-----3333- KPEFMTMIDILTKTR --------------- >HYPOTHETICAL PROTEIN TTHA; SWP:NA; PDB:2DP9A; YMERPKLGLIVREPYASLIVDGRKVWEIRRRKTRHRGPLGIVSGGRLIGQADLVGVEGPF ------------------1111--------------------%%%%-------------- SVEELLAHQEKHLAEEAFLRAYAKDEPLYAWVLENAFRYEKPLHVPRRPGRVMFVDLSEV 3333---3333---3333----%%%%------------------------------1111 RW -- >CHITINASE-3-LIKE PROTEIN ; SWP:Q6TMG6; PDB:2DPEA; YKLICYYTSWSQYREGDGSCFPDAIDPFLCTHVIYSFANISNNEIDTWEWNDVTLYDTLN -------1111---!!!!--1111-1111-----------%%%%----1111-------- TLKNRNPKLKTLLSVGGWNFGPERFSAIASKTQSRRTFIKSVPPFLRTHGFDGLDLAWLY -11111111-------11113333------------------------------------ PGRRDKRHLTTLVKEMKAEFIREAQAGTEQLLLSAAVSAGKIAIDRGYDIAQISRHLDFI -1111-------------------------------------------33333333---- SLLTYDFHGAWRQTVGHHSPLFAGNEDASSRFSNADYAVSYMLRLGAPANKLVMGIPTFG --------%%%%-----------------------------------1111--------- RSFTLASSKTDVGAPVSGPGVPGRFTKEKGILAYYEICDFLHGATTHRFRDQQVPYATKG ----------2222-------------2222-------3333-------1111-----!! NQWVAYDDQESVKNKARYLKNRQLAGAMVWALDLDDFRGTFCGQNLTFPLTSAVKDVLAE !!-----------------1111-------1111-3333-----------------1111 V - >DNA POLYMERASE IOTA; SWP:Q9UNA4; PDB:2DPIA; SSRVIVHVDLDCFYAQVEMISNPELKDKPLGVQQKYLVVTCNYEARKLGVKKLMNVRDAK ---------------------3333--------!!!!-----3333----------3333 EKCPQLVLVNGEDLTRYREMSYKVTELLEEFSPVVERLGFDENFVDLTEMVEKRLQQLQS --1111---------------------3333-----------------------3333-- DELSAVTVSGHVYNNQSINLLDVLHIRLLVGSQIAAEMREAMYNQLGLTGCAGVASNKLL 3333-------2222---1111-------------------------------------- AKLVSGVFKPNQQTVLLPESCQHLIHSLNHIKEIPGIGYKTAKCLEALGINSVRDLQTFS ---1111---------3333---------3333------------1111----------- PKILEKELGISVAQRIQKLSFGEDNSPVILSGPPQSFSEEDSFKKCSSEVEAKNKIEELL ------------------1111------------------------------3333---- ASLLNRVCQDGRKPHTVRLIIRRYSSEKHYGRESRQCPIPSHVIQVMTPMVDILMKLFRN -------3333----------------------------3333--3333------3333- MTLLSVCFCNLK ------------ >SODIUM/CALCIUM EXCHANGER ; SWP:P23685; PDB:2DPKA; HGIPVSKIFFEQGTYQCLENCGTVALTIIRRGGDLTNTVFVDFRTEDGTANAGSDYEFTE -----------------1111--------------------------------------- GTVVFKPGETQKEIRVGIIDDDIFEEDENFLVHLSNVKVSSESALACLGSPSTATVTIFD -----2222--------------------------------------------------- DDHA ---- >GMP SYNTHASE [GLUTAMINE-H; SWP:O59072; PDB:2DPLA; MDWGRFVEEKVREIRETVGDSKAIIALSGGVDSSTAAVLAHKAIGDRLHAVFVNTGFLRK -----------------!!!!----------------------!!!!-----------22 GEPEFVVKTFRDEFGMNLHYVDAQDRFFSALKGVTDPEEKRKIIGRVFIEVFEEVAKKIG 22----------------------------2222-------------------------- AEYLIQGTIAPLNLKLIEPLRDLYKDEVRELAKFLGLPEKIYNRMPFPGPGLAVRVIGEV -----------------1111----------------3333------11111111----- TPEKIRIVREANAIVEEEVERAGLRPWQAFAVLLGVKTVGVQGDIRAYKETIAVRIVESI -------------------1111------------------------------------- DGMTANAMNVPWEVLQRIAFRITSEIPEVGRVLYDITNKPPATIEFE ----------3333-----------3333------------------ >ADENINE-SPECIFIC METHYLTR; SWP:P04043; PDB:2DPMA; TLQPFTKWTGGKRQLLPVIRELIPKTYNRYFEPFVGGGALFFDLAPKDAVINDFNAELIN -------22223333---1111------------!!!!---------------------- CYQQIKDNPQELIEILKVHQEYNSKEYYLDLRSADRDERIDMMSEVQRAARILYMLRVNF -------------------------------3333--3333-----------------22 NGLYRVNSKNQFNVPYGRYKNPKIVDEELISAISVYINNNQLEIKVGDFEKAIVDVRTGD 22----1111--------------------------1111-------3333-11112222 FVYFDPPYIPLFTSYTHEGFSFADQVRLRDAFKRLSDTGAYVMLSNSSSALVEELYKDFN -------------------------------------------------------1111- IHYVEGKISEIIVTNYEK ------------------ >GLYCEROL KINASE; SWP:Q53W24; PDB:2DPNA; FLLALDQGTTSSRAILFTLEGRPVAVAKREFRQLYPKPGWVEHDPLEIWETTLWAAREVL -----------------1111---------------2222-------------------- RRAGAEAGEVLALGITNQRETTLLWDRKTGKPLHNAIVWQDRRTTPLCEALRAKGLEPLF -----3333----------------------------33331111------3333----- RERTGLLFDPYFSGTKLVWLLENVPGLKARAEGGGVAFGTVDTWLIWNLTGGKVHATDPT --------3333------------------3333---------------iiii----333 NASRTLLFNLHTLAWDPELLEALGIPAALLPEVRPSDGDFGETLPELLGAPVPIRGVLGD 3-------------------------1111----1111-----3333------------- QQAALFGQAALGGGEGKCTYGTGAFLLLNTGKRPVLSEKGLLATVAWSLGGRATYALEGS -----------2222--------------!!!!--------------------------- LFVAGAAVGWLKEVGLIRESAEVEALAASVEDTGDVYFVPAFTGLGAPYWDPYARGTLLG ----------------------------------------1111---------------- LTRGTSRAHLARAALEGVAFQVRDVVLAMEEEAGVRLKVLKADGGMAQNRLFLKIQADLL -1111--------------------------------------3333----------333 GVPVAVPEVTETTALGAALMAGVGAGALSPEDVAGRFREAERFLPTMPEGRREALYRRWR 3---------------------------33333333------------------------ EAVERAKGWARE ------------ >L-GULONATE 3-DEHYDROGENAS; SWP:P14755; PDB:2DPOA; GDVLIVGSGLVGRSWAMLFASGGFRVKLYDIEPRQITGALENIRKEMKSLQQSGSLKGSL -------------------1111--------3333---------------1111------ SAEEQLSLISSCTNLAEAVEGVVHIQECVPENLDLKRKIFAQLDSIVDDRVVLSSSSSCL -----1111----3333-2222--------------------1111-------------- LPSKLFTGLAHVKQCIVAHPVNPPYYIPLVELVPHPETSPATVDRTHALMRKIGQSPVRV 3333-2222--1111-------3333--------1111------------1111------ LKEIDGFVLNRLQYAIISEAWRLVEEGIVSPSDLDLVMSDGLGMRYAFIGPLETMHLNAE ---2222------------------------------1111---1111------------ GMLSYSDRYSEGMKRVLKSFGSIPEFSGATVEKVNQAMCKKVPADPEHLAARREWRDECL ----------------1111------------------------3333------------ KRLAKLKRQM ---------- >GTP-BINDING PROTEIN RAD; SWP:P55042; PDB:2DPXA; SVYKVLLLGAPGVGKSALARIFGGVEHTYDRSIVVDGEEASLMVYDIWLPGHCMAMGDAY ---------2222---------------------iiii---------------------- VIVYSVTDKGSFEKASELRVQLRRARDVPIILVGNKSDLVRSREVSVDEGRACAVVFDCK ----1111---------------------------33331111----------------- FIETSAALHHNVQALFEGVVRQIRLR -----1111----------------- >FLAGELLUM-SPECIFIC ATP SY; SWP:P26465; PDB:2DPYA; VRRYGRLTRATGLVLEATGLQLPLGATCIIERQDGPETKEVESEVVGFNGQRLFLMPLEE ------------------------------------------------------------ VEGILPGARVYARKQLPLGPALLGRVLDGGGKPLDGLPAPDTLETGALITPPFNPLQRTP ------------------3333-----1111----------------------------- IEHVLDTGVRAINALLTVGRGQRMGLFAGSGVGKSVLLGMMARYTRADVIVVGLIGERGR --------3333------2222-----------------------------------333 EVKDFIENILGPDGRARSVVIAAPADVSPLLRMQGAAYATRIAEDFRDRGQHVLLIMDSL 3----------3333--------11113333---------------1111---------- TRYAMAQREIALAIGEPPATKGYPPSVFAKLPALVERAGNGIHGGGSITAFYTVLTEGDD -----------1111--------3333-------3333---------------------- QQDPIADSARAILDGHIVLSRRLAEAGHYPAIDIEASISRAMTALITEQHYARVRLFKQL --------3333--------3333---------1111-1111------------------ LSSFQRNRDLVSVGAYAKGSDPMLDKAITLWPQLEAFLQQGIFERADWEDSLQALDLIFP ----1111-------------------1111---------1111---------------- TV -- >AMINOPEPTIDASE N; SWP:P04825; PDB:2DQ6A; QAKYRHDYRAPDYQITDIDLTFDLDAQKTVVTAVSQAVRHGASDAPLRLNGEDLKLVSVH ---1111-----------------3333-------------1111--------------- INDEPWTAWKEEEGALVISNLPERFTLKIINEISPAANTALEGLYQSGDALCTQCEAEGF iiii-------2222------------------33333333-----!!!!--------33 RHITYYLDRPDVLARFTTKIIADKIKYPFLLSNGNRVAQGELENGRHWVQWQDPFPKPCY 33------1111----------3333----------------%%%%-----------333 LFALVAGDFDVLRDTFTTRSGREVALELYVDRGNLDRAPWAMTSLKNSMKWDEERFGLEY 3----------------1111---------22221111---------------------- DLDIYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTG -------------------2222---3333---1111----------------------- NRVTCRDWFQLSLKEGLTVFRDQEFSSDLGSRAVNRINNVRTMRGLQFAEDASPMAHPIR ------3333------------------------------------------1111---- PDMVIEMNNFYTLTVYEKGAEVIRMIHTLLGEENFQKGMQLYFERHDGSAATCDDFVQAM -----3333------------------------------------2222----------- EDASNVDLSHFRRWYSQSGTPIVTVKDDYNPETEQYTLTISQRTPATPDQAEKQPLHIPF -------3333-------------------1111------------1111---------- AIELYDNEGKVIPLQKGGHPVNSVLNVTQAEQTFVFDNVYFQPVPALLCEFSAPVKLEYK -----1111------iiii----------------------------2222--------- WSDQQLTFLMRHARNDFSRWDAAQSLLATYIKLNVARHQQGQPLSLPVHVADAFRAVLLD -------------------------------------1111-----3333---------- EKIDPALAAEILTLPSVNEMAELFDIIDPIAIAEVREALTRTLATELADELLAIYNANYQ ---------1111-------1111-------------------------------1111- SEYRVEHEDIAKRTLRNACLRFLAFGETHLADVLVSKQFHEANNMTDALAALSAAVAAQL ----------------------1111-----------------------------11111 PCRDALMQEYDDKWHQNGLVMDKWFILQATSPAANVLETVRGLLQHRSFTMSNPNRIRSL 111----------11113333------1111-1111---------11111111------- IGAFAGSNPAAFHAEDGSGYLFLVEMLTDLNSRNPQVASRLIEPLIRLKRYDAKRQEKMR -------3333--1111-----------3333---------3333-3333---------- AALEQLKGLENLSGDLYEKITKALA -----1111---3333--------- >Proto-oncogene tyrosine-p; SWP:P06241; PDB:2DQ7X; KDVWEIPRESLQLIKRLGNGQFGEVWMGTWNGNTKVAIKTLKPGTMSPESFLEEAQIMKK -1111-3333---------1111------%%%%-------------3333-------111 LKHDKLVQLYAVVSEEPIYIVTEYMNKGSLLDFLKDGEGRALKLPNLVDMAAQVAAGMAY 1-1111--------------------------------3333-3333------------- IERMNYIHRDLRSANILVGNGLICKIADFGLARLIEDNETARQGAKFPIKWTAPEAALYG -----------3333----%%%%-----1111---------------3333--------- RFTIKSDVWSFGILLTELVTKGRVPYPGMNNREVLEQVERGYRMPCPQDCPISLHELMIH --3333-----------------------3333------------------1111----- CWKKDPEERPTFEYLQSFLEDY ----3333--------3333-- >Deoxyguanosinetriphosphat; SWP:Q72LL5; PDB:2DQBA; RFSREALLELEASRLAPYAQKARDTRGRAHPEPESLYRTPYQKDRDRILHTTAFRRLEYK ---------------1111-3333------------------------------------ TQVLPGWAYYRTRLTHTLEVAQVSRSIARALGLNEDLTEAIALSHDLGHPPFGHTGEHVL -------------------------------------------1111------------- NALQDHGGFEHNAQALRILTHLEVRYPGFRGLNLTYEVLEGIATHEAGQGTLEAQVVDLS ----------------------------------3333---------------------- DAIAYAAHDLDDGFRAGLLHPEELKEVELLQALALEEGLDLLRLPELDRRVLVRQLLGYF -------------------33333333--------------------------------- ITAAIEATHRRVEEAGVQSAEAVRRHPSRLAALGEEAEKALKALKAFLERFYRHPEVLRE --------------------------------------------------1111------ RRKAEAVLEGLFAAYTRYPELLPREVQAKIPEEGLERAVCDYIAGTDRFALEAYRRLSP -----------------1111-3333-------------------3333-----1111- >Ighg protein; SWP:Q91Z05; PDB:2DQTH; VKLVESGGGLVKPGGSLKLSCAASGFTFSNYAMSWVRQTPEKRLEWVVSISSGGSIYYLD -----------2222-----------3333--------1111----------------11 SVKGRFTVSRDNARNILYLQMTSL 11---------------------- >Similar to Ig gamma-1, ch; SWP:Q5I0J0; PDB:2DQUH; EVKLVESGGGLVKPGGSLKLSCAASGFTFSNYAMSWVRQTPEKRLEWVVSISSGGSIYYL ------------2222-----------3333--------1111--------1111----3 DSVKGRFTVSRDNARNILYLQMTSLRSEDTAMYFCARVSHYDGSRDWYFDVWGAGTSVTV 333----------------------1111-----------2222---------------- SSAKTTPPSVYPLAPGSAANSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSD ---------------1111---------------------%%%%-------------iii LYTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPRDC i---------1111-----------3333------------- >Kappa light chain C_regio; SWP:Q65ZC0; PDB:2DQUL; DVLMTQTPLSLPVSLGDQASISCRSSQTIVHSNGDTYLDWFLQKPGQSPKLLIYKVSNRF -------------2222-------------1111---------2222------------2 SGVPDRFSGSGSGTDFTLKISRVEAEDLGVYYCFQGSHVPPTFGGGTKLEIKRADAAPTV 2223333----------------1111--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM -----33331111---------------------iiii--2222---------------- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC ------3333------------1111--------3333- >DIHYDROPTEROATE SYNTHASE; SWP:Q5SLV2; PDB:2DQWA; VRTLWLRDRALDLDRVRLLGVLNLTPRALERAREMVAEGADILDLGAEEEEKRRLLPVLE -----------------------------------1111----------3333------- AVLSLGVPVSVDTRKPEVAEEALKLGAHLLNDVTGLRDERMVALAARHGVAAVVMHMPVP --3333----------------3333---------------------------------- DPATMMAHARYRDVVAEVKAFLEAQARRALSAGVPQVVLDPGFGFGKLLEHNLALLRRLD 11111111---------------------1111--------2222--------------- EIVALGHPVLVGLSRKRTIGELSGVEDPAQRVHGSVAAHLFAVMKGVRLLRVHDVRAHRE ------------2222----------1111------------------------------ ALGVWEALYG ---3333--- >386AA LONG HYPOTHETICAL S; SWP:O59033; PDB:2DR1A; EFEEAFKEVYEMVKPKYKLFTAGPVACFPEVLEIMKVQMFSHRSKEYRKVHMDTVERLRE -3333---3333---------------------1111---1111---------------- FLEVEKGEVLLVPSSGTGIMEASIRNGVSKGGKVLVTIIGAFGKRYKEVVESNGRKAVVL --------------3333----------2222-------1111-------1111------ EYEPGKAVKPEDLDDALRKNPDVEAVTITYNETSTGVLNPLPELAKVAKEHDKLVFVDAV --2222----------1111----------------------------1111-------- SAMGGADIKFDKWGLDVVFSSSQKAFGVPPGLAIGAFSERFLEIAEKMPERGWYFDIPLY -2222-----1111---------1111----------------3333------------- VKYLKEKESTPSTPPMPQVFGINVALRIIEKMGGKEKWLEMYEKRAKMVREGVREIGLDI ------------------------------------------------------------ LAEPGHESPTITAVLTPPGIKGDEVYEAMRKRGFELAKGYGSVKEKTFRIGHMGYMKFED --2222----------22223333----------------1111------------3333 IQEMLDNLREVINELKKQKGI --------------------- >UPF0273 PROTEIN PH0284; SWP:O58022; PDB:2DR3A; RRVKTGIPGVDEILHGGIPERNVVLLSGGPGTGKTIFSQQFLWNGLKMGEPGIYVALEEH ------2222-1111---2222------2222-------------1111----------3 PVQVRQNMAQFGWDVKPYEEKGMFAMVDAFTAGIGKEYEKYIVHDLTDIREFIEVLRQAI 333----1111------------------3333--------------------------- RDINAKRVVVDSVTTLYINKPAMARSIILQLKRVLAGTGCTSIFVSQVSGFGPGVEHGVD -----------3333-11111111-----------------------------3333--- GIIRLDLDEIDGELKRSLIVWKMRGTSHSMRRHPFDITDKGIIVYPDKVLKR ---------iiii---------2222-----------1111---1111---- >WATER-SOLUBLE CHLOROPHYLL; SWP:O04797; PDB:2DREA; NDEEPVKDTNGNPLKIETRYFIQPASDNNGGGLVPANVDLSHLCPLGIVRTSLPYQPGLP -------1111-----------------------------------------1111---- VTISTPSSSEGNDVLTNTNIAITFDAPIWLCPSSKTWTVDSSSEEKYIITGGDPKSGESF ------------------------------------------1111------1111---- FRIEKYGNGKNTYKLVRYDNGEGKSVGSTKSLWGPALVLNDNAFPIKFREVD ------------------------------1111------------------ >361AA LONG HYPOTHETICAL D; SWP:O57784; PDB:2DRHA; MKAQELGIKIGVFKPGKRNKITDVKGVKVGHVTLIKGKGKLIPGKGPVRTGVTAILPHEG ---1111--------1111----------------------2222--------------- NIYKEKVLAGAFVMNGYSKPVGLIQLWELGTIETPIILTNTLSIGTAVEGLLDYILEENE 3333-----------------------------------1111----------------- DIGVTTGSVNPLVLECNDSYLNDIRGRHVKREHVVEAIKRADEDFEEGAVGAGTGMSAFE 2222-------------3333-3333---3333----1111--------!!!!----%%% FKGGIGSASRIVEIEGKKYTVGALVLSNFGRREDLTIAGVPVGLELKNWPGRGSIIMIIA %------------%%%%-------------1111--iiii---1111------------- TDAPLTGRQLNRVAKRAIVGLARTGGYAYNGSGDIAVAFSTANRIKHYEKEVIEIKALPD -----3333------------1111---1111-------------1111---------33 SVISPLFKATAEAVEEAIINSLLEARTMDGRDNHVRYALPKEELLRIMRRYGRL 33-------------------1111-----%%%%--------------1111-- >D-RIBOSE-BINDING PROTEIN; SWP:P02925; PDB:2DRI; KDTIALVVSTLNNPFFVSLKDGAQKEADKLGYNLVVLDSQNNPAKELANVQDLTVRGTKI --------------------------------------%%%%----------1111---- LLINPTDSDAVGNAVKMANQANIPVITLDRQATKGEVVSHIASDNVLGGKIAGDYIAKKA --------3333------------------------------------------------ GEGAKVIELQGIAGTSAARERGEGFQQAVAAHKFNVLASQPADFDRIKGLNVMQNLLTAH -----------2222--------------------------%%%%-----------1111 PDVQAVFAQNDEMALGALRALQTAGKSDVMVVGFDGTPDGEKAVNDGKLAATIAQLPDQI ------------------------------------------------------------ GAKGVETADKVLKGEKVQAKYPVDLKLVVKQ ------------------------------- >PROTEIN (TRAMTRACK DNA-BI; SWP:P17789; PDB:2DRPA; FTKEGEHTYRCKVCSRVYTHISNFCRHYVTSHKRNVKVYPCPFCFKEFTRKDNMTAHVKI -------------------------------------------------3333------- IHK --- >chimera of CD48 antigen a; SWP:P08921; PDB:2DRUA; FQDQSVPNVNAITGSNVTLTILKHPLASYQRLTWLHTTNQKILEYFPNGKKTVFESVFKD -----------2222----------------------------------------1111- RVDLDKTNGALRIYNVSKEDRGDYYMRMLHETEDQWKITMEVYEMVSKPMIYWECSNATL ----------------3333---------------------------------------- TCEVLEGTDVELKLYQGKEHLRSLRQKTMSYQWTNLRAPFKCKAVNRVSQESEMEVVNCP ---------------!!!!--------------------------1111----------- >D-AMINO ACID AMIDASE; SWP:Q9LCC8; PDB:2DRWA; SDLNNAIQGILDDHVARGVVGVSLALCLPGEETSLYQSGYADKFNKMPMTGDHLFRIASC --------------1111---------2222-----------1111---1111------- TKSFIATGLHLLVQDGTVDLDEPITRWFPDLPKAAQMPVRILLNHRSGLPDFETSMPMIS ------------------1111--1111--2222---3333----------1111----- DKSWTAQEIVDFSFRHGVQKEPWHGMEYSNTGYVLAGMIIAHETGKPYSDHLRSRIFAPL --------------------2222---------------------------------111 GMKDTWVGTHETFPIEREARGYMHAAADDENPQWDVSGAGDPVDGVWDSTEWFPLSGANA 1-----3333---3333--------1111-------------iiii-------3333!!! AGDMVSTPRDIVKFLNALFDGRILDQKRLWEMKDNIKPAFFPGSNTVANGHGLLLMRYGS !----------------1111---3333------------2222-------------!!! SELKGHLGQIPGHTSIMGRDEETGAALMLIQNSGAGDFESFYLKGVNEPVDRVLEAIKNS !--------2222-----------------------1111------------------11 RS 11 >29-KDA GALACTOSE-BINDING ; SWP:O96048; PDB:2DS0A; PKFFYIKSELNGKVLDIGGQNPAPGSKIITWDQKKGPTAVNQLWYTDQQGVIRSKLNDFA ----------------2222---------------1111-------1111---------- IDASHEQIETQPFDPNNPKRAWIVSGNTIAQLSDRDNVLGVIKSDKGASAHICAWKQHGG -------------1111-------------1111-------------------------- PNQKFIIESE ---------- >TRIPARTITE MOTIF PROTEIN ; SWP:Q9H8W5; PDB:2DS4A; GSSGSSGEVDPAKCVLQGEDLHRAREKQTASFTLLCKDAAGEIMGRGGDNVQVAVVPKDK ----------1111-----1111--------------1111------------------- KDSPVRTMVQDNKDGTYYISYTPKEPGVYTVWVCIKEQHVQGSPFTVTVRRKH ----------------------------------%%%%--------------- >ATP-dependent Clp proteas; SWP:Q8ZRC0; PDB:2DS5A; GKLLYCSFCGKSQHEVRKLIAGPSVYICDECVDLCNDIIREEI -----------1111---------------------------- >ADP-SUGAR PYROPHOSPHATASE; SWP:Q9UKK9; PDB:2DSCA; KQYIISEELISEGKWVKLEKTTYMDPTGKTRTWESVKRTTRKEQTADGVAVIPVLQRTLH ------------------------1111----------------------------1111 YECIVLVKQFRPPMGGYCIEFPAGLIDDGETPEAAALRELEEETGYKGDIAECSPAVCMD ----------3333------------2222-----------------------------1 PGLSNCTIHIVTVTINGDDAENARPKPKPGDGEFVEVISLPKNDLLQRLDALVAEEHLTV 111------------11111111-----------------3333---------------- DARVYSYALALKHAN --------------- >PYRIMIDINE-NUCLEOSIDE (TH; SWP:Q5SHF9; PDB:2DSJA; MNPVAFIREKREGKKHRREDLEAFLLGYLRDEVPDYQVSAWLMAAFLRGLDPEETLWLTE ---------1111--------------1111--3333----------------------- TMARSGKVLDLSGLPHPVDKHSSGGVGDKVSLVVGPILAASGCTFAKMSGRGLAHTGGTI -1111-----1111--------------3333--------------------!!!!---- DKLESVPGWRGEMTEAEFLERARRVGLVIAAQSPDLAPLDGKLYALRDVTATVESVPLIA -----2222----------------------3333-3333-------------------- SSIMSKKLAAGARSIVLDVKVGRGAFMKTLEEARLLAKTMVAIGQGAGRRVRALLTSMEA -------3333---------------------------------1111------------ PLGRAVGNAIEVREAIEALKGEGPGDLLEVALALAEEALRLEGLDPALARKALEGGAALE -----------------1111------------------1111---------1111---- KFRAFLEAQGGDPRAVEDFSLLPLAEEHPLRAEREGVVREVDAYKVGLAVLALGGGRKRK ------1111--3333--1111------------------------------------22 GEPIDHGVGVYLLKKPGDRVERGEALALVYHRRRGLEEALGHLREAYALGEEAHPAPLVL 22--1111------2222--2222------------------------------------ EAI --- >CHITINASE; SWP:Q8U1H5; PDB:2DSKA; GPNANPIPEHFFAPYIDMSLSVHKPLVEYAKLTGTKYFTLAFILYSSVYNGPAWAGSIPL ----------------1111----3333------------------1111---%%%%-11 EKFVDEVRELREIGGEVIIAFGGAVGPYLCQQASTPEQLAEWYIKVIDTYNATYLDFDIE 11--------1111-------------3333----------------------------- AGIDADKLADALLIVQRERPWVKFSFTLPSDPGIGLAGGYGIIETMAKKGVRVDRVNPMT ------------------1111-------------------------------------- MDYYWTPSNAENAIKVAENVFRQLKQIYPEKSDEEIWKMIGLTPMIGVNDDKSVFTLEDA ---------------------------1111-----------------1111-------- QQLVDWAIQHKIGSLAFWSVDRDHPGPTGEVSPLHRGTNDPDWAFSHVFVKFMEAFGYTF ------------------1111----2222----------2222--------3333---- >HYPOTHETICAL PROTEIN YQAI; SWP:P45906; PDB:2DSMA; MVENPMVINNWHDKLTETDVQIDFYGDEVTPVDDYVIDGGEIILRENLERYLREQLGFEF ----------------------1111---1111----------3333------------- KNAQLEHHHHHH ------------ >Insulin-like growth facto; SWP:P05019; PDB:2DSPI; PETLCGAELVDALQFVCGDRGFYFNKPTQTGIVDECCFRSCDLRRLEMYCAPLKPAK -----------------3333-------------------------1111------- >Insulin-like growth facto; SWP:P08833; PDB:2DSQG; EPCRIELYRVVESLAKASKFYLPNCNKNGFYHSRQCETSMEAGLCWCVYPWNGKRIPGSP -------------------------3333--------------------------2222- EIRGDPNCQIYFN ------------- >Insulin-like growth facto; SWP:P22692; PDB:2DSRG; GSCQSELHRALERLAASQSRTHEDLYIIPIPNCDRNGNFHPKQCHPALDGQRGKCWCVDR --------------------3333---------1111----------iiii--------- KTGVKLPGGLEPKGELDCH -----------3333---- >HYPOTHETICAL PROTEIN TTHA; SWP:Q5SI36; PDB:2DSTA; RRAGYLHLYGLNLVFDRVGKGPPVLLVAEEASRWPEALPEGYAFYLLDLPGYGRTEGPRM -------iiii-------------------------------------2222-------- APEELAHFVAGFAVMMNLGAPWVLLRGLGLALGPHLEALGLRALPAEGVEVAEVLSSKLS -------------1111---------3333------1111-------------------- YG -- >RUBREDOXIN; SWP:P00270; PDB:2DSXA; MDIYVCTVCGYEYDPAKGDPDSGIKPGTKFEDLPDDWACPVCGASKDAFEKQ -------------3333-3333--22221111-1111-------1111---- >AT-RICH DNA-BINDING PROTE; SWP:Q5SHS3; PDB:2DT5A; KVPEAAISRLITYLRILEELEAQGVHRTSSEQLGGLAQVTAFQVRKDLSYFGSYGTRGVG ------------------------------------------------1111---2222- YTVPVLKRELRHILGLNRKWGLCIVGMGRLGSALADYPGFGESFELRGFFDVDPEKVGRP --------------1111--------------3333----3333--------3333---- VRGGVIEHVDLLPQRVPGRIEIALLTVPREAAQKAADLLVAAGIKGILNFAPVVLEVPKE 2222---33331111------------3333--------------------------111 VAVENVDFLAGLTRLSFAILNPKWREEMMG 1-------------------11113333-- >SPLICING FACTOR 3 SUBUNIT; SWP:Q15459; PDB:2DT6A; GEVRNIVDKTASFVARNGPEFEARIRQNEINNPKFNFLNPNDPYHAYYRHKVSEFKEGKA -----------------3333-------33331111--3333------------------ QEPS ---- >Splicing factor 3 subunit; SWP:Q15459; PDB:2DT7B; GAQVIQETIVPKEPPPEFEFIADPPSISAFDLDVVKLTAQFVARNGRQFLTQLMQKEQRN ------------------------------------------------------1111-3 YQFDFLRPQHSLFNYFTKLVEQYTK 333---3333--------------- >RAL GUANINE NUCLEOTIDE EX; SWP:A2AUQ2; PDB:2DTCA; PTEGPLRRKTLLKEGRKPALSSWTRYWVVLSGATLLYYGAKSLRGTDRKHYKSTPGKKVS ------------iiii------------------------------3333---------- IVGWVQLPDDPEHPDIFQLNNPDKGNVYKFQTGSRFHAILWHKHLDDACKSSRP 2222-----3333-------1111-----------------------1111--- >DIPHTHERIA TOXIN REPRESSO; SWP:P33120; PDB:2DTR; LVDTTEMYLRTIYELEEEGVTPLRARIAERLEQSGPTVSQTVARMERDGLVVVASDRSLQ ---------------------------------------------1111----1111--- MTPTGRTLATAVMRKHRLAERLLTDIIGLDINKVHDEACRWEHVMSDEVERRLVKVLKDV -----------------------------1111------3333----------------- SRSPFGNPIPGLDELGV --1111----3333--- >LACTATE OXIDASE; SWP:Q44467; PDB:2DU2A; MNNNDIEYNAPSEIKYIDVVNTYDLEEEASKVVPHGGFNYIAGASGDEWTKRANDRAWKH -1111---------------3333----1111------------!!!!---------333 KLLYPRLAQDVEAPDTSTEILGHKIKAPFIMAPIAAHGLAHTTKEAGTARAVSEFGTIMS 3------------------iiii------------3333-1111---------------- ISAYSGATFEEISEGLNGGPRWFQIYMAKDDQQNRDILDEAKSDGATAIILTADSTVSGN -1111----------iiii----------------------------------------- RDRDVKNKFVYPFGMPIVQRYLRGTAEGMSLNNIYGASKQKISPRDIEEIAGHSGLPVFV --------------3333----!!!!---3333-3333----3333-------------- KGIQHPEDADMAIKRGASGIWVSNHGARQLYEAPGSFDTLPAIAERVNKRVPIVFDSGVR ----3333----1111-------%%%%-------3333--------iiii---------- RGEHVAKALASGADVVALGRPVLFGLALGGWQGAYSVLDYFQKDLTRVMQLTGSQNVEDL 3333----1111------3333---------------------------1111--33331 KGLDLFDNPYGYEY 111-------1111 >O-PHOSPHOSERYL-TRNA SYNTH; SWP:O30126; PDB:2DU3A; MKFDPQKYRELAEKDFEAAWKAGKEILAERSPNELYPRVGFSFGKEHPLFATIQRLREAY -----------------------1111---1111-------------------------- LSIGFSEVVNPLIVEDVHVKKQFGREALAVLDRCFYLATLPKPNVGISAEKIRQIEAITK 1111-------------------3333---1111---------------1111------- REVDSKPLQEIFHRYKKGEIDGDDLSYLIAEVLDVDDITAVKILDEVFPEFKELKPISST -------3333-1111-----------------------33331111-3333-------- LTLRSHMTTGWFITLSHIADKLPLPIKLFSIDRCFRREQGEDATRLYTYFSASCVLVDEE -----3333--------1111--------------------1111--------------- LSVDDGKAVAEALLRQFGFENFRFRKDEKRSKYYIPDTQTEVFAFHPKLVGSSTKYSDGW --------------1111--------333311112222-------3333------1111- IEIATFGIYSPTALAEYDIPYPVMNLGLGVERLAMILYGYDDVRKMVYPQIHGEIKLSDL -----------------------------------------3333---3333-------- DIAREIKVKEVPQTAVGLKIAQSIVETAEKHASEPSPCSFLAFEGEMMGRNVRVYVVEEE --1111------------------------1111------------%%%%---------- ENTKLCGPAYANEVVVYKGDIYGIPKTKKWRSFFEEGVPTGIRYIDGFAYYAARKVEEAA ------1111------iiii------3333------------------------------ MREQEEVKVKARIVENLSDINLYIHENVRRYILWKKGKIDVRGPLFVTVKAEIE ---------------3333-------------1111------------------ >D-AMINO-ACID OXIDASE; SWP:P14920; PDB:2DU8A; MRVVVIGAGVIGLSTALCIHERYHSVLQPLDIKVYADRFTPLTTTDVAAGLWQPYLSDPN --------3333------------------------------3333-------------- NPQEADWSQQTFDYLLSHVHSPNAENLGLFLISGYNLFHEAIPDPSWKDTVLGFRKLTPR 3333-----------1111-11113333----------------1111---------333 ELDMFPDYGYGWFHTSLILEGKNYLQWLTERLTERGVKFFQRKVESFEEVAREGADVIVN 33333---------------3333--------1111--------------1111------ CTGVWAGALQRDPLLQPGRGQIMKVDAPWMKHFILTHDPERGIYNSPYIIPGTQTVTLGG -!!!!3333--3333-----------3333-------1111----------1111----- IFQLGNWSELNNIQDHNTIWEGCCRLEPTLKNARIIGERTGFRPVRPQIRLEREQLRTGP -----------3333-----------1111------------------------------ SNTEVIHNYGHGGYGLTIHWGCALEAAKLFGRILEEKKLS ----------!!!!3333---------------------- >MS0616; SWP:Q8R2U6; PDB:2DUKA; TRTYDREGFKKRAACLCFRSEQEDEVLLVSSSRYPDQWIVPGGGMEPEEEPGGAAVREVY -------------------3333----------1111------------3333------- EEAGVKGKLGRLLGIFENQDRKHRTYVYVLTVTEILEDWEDSVNIGRKREWFKVEDAIKV --------------------------------------1111------------------ LQCHKPVHAEYLEKLKLG -----------1111--- >N(2),N(2)-dimethylguanosi; SWP:O59493; PDB:2DULA; LIEVQEGKAKILIPPVFYNPRMALNRDIVVVLLNILNPKIVLDALSATGIRGIRFALETP -----!!!!---------3333-----------------------!!!!----------- AEEVWLNDISEDAYELMKRNVMLNFDGELRESKGRAILKGEKTIVINHDDANRLMAERHR --------------------1111------------------------------------ YFHFIDLDPFGSPMEFLDTALRSAKRRGILGVTATDGAPLCGAHPRACLRKYLAVPLRGE ------------11113333----------------3333------------------11 LCHEVGTRILVGVIARYAAKYDLGIDVILAYYKDHYFRAFVKLKDGARKGDETLEKLGYI 11---------------3333--------------------------------------- YFDDKTGKFELEQGFLPTRPNAYGPVWLGPLKDEKIVSKMVKEAESLSLARKKQALKLLK -------------------------------------------1111------------- MIDQELDIPLFYDTHAIGRRLKIETKKVEEIISALREQGYEATRTHFSPTGIKTSAPYEV -----------------------------------1111-----3333------------ FIETIKR ------- >HYPOTHETICAL PROTEIN PH08; SWP:O58553; PDB:2DUMA; MFRKVLFPTDFSEGAYRAVEVFEKRNKMEVGEVILLHVIDEGTLEELMELKDIKEKLKEE -----------------------------------------3333------3333--333 ASRKLQEKAEEVKRAFRAKNVRTIIRFGIPWDEIVKVAEEENVSLIILPSRGKLSHEFLG 3----------------------------3333-----1111------------------ STVMRVLRKTKKPVLIIKEVDEN ----------------------- >DNA POLYMERASE MU; SWP:Q9H980; PDB:2DUNA; GSSGSSGSTRFPGVAIYLVEPRMGRSRRAFLTGLARSKGFRVLDACSSEATHVVMEETSA ----------3333----1111---3333-----------------3333--------33 EEAVSWQERRMAAAPPGCTPPALLDISWLTESLGAGQPVPVECRHRLEVAGPRKGPLSPA 33------------2222------3333-------------3333--------------- WMPAYACSGPSSG ------------- >NITRITE REDUCTASE; SWP:NA; PDB:2DV6A; HAPVVFTLRTGIAEGRMVYIGVGGDIDHKINPTLVVHEGETVQVNLVNGEGAQHDVVVDQ ------------iiii---------2222-------2222-------------------- YAARSAIVNGKNASSTFSFVASKVGEFNYYCSIAGHRQAGMEGNIQVLPGNRAEMKSSGA ---------2222-------------------22223333-------------------- DITRDPADLPGPIGPRQAKTVRIDLETVEVKGQLDDNTTYTYWTFNGKVPGPFLRVRVGD ----1111--------------------------2222------iiii--------2222 TVELHLKNHKDSLMVHSVDFHGATGPGGAAAFTQTDPGEETVVTFKALIPGIYVYHCATP --------1111-------1111-22223333---2222--------------------- SVPTHITNGMYGLLLVEPEGGLPQVDREFYVMQGEIYTVKSFGTSGEQEMDYEKLINEKP -----------------1111-------------------2222---------------- EYFLFNGSVGSLTRSHPLYASVGETVRIFFGVGGPNFTSSFHVIGEIFDHVYSLGSVVSP -------2222---------2222------------------2222-----2222----- PLIGVQTVSVPPGGATIVDFKIDRAGRYILVDHALSRLEHGLVGFLNVDGPKNDSIMHEG ----------2222---------------------3333--------------3333--- PA -- >UPF0130 PROTEIN APE0816; SWP:Q9YDV3; PDB:2DVKA; DPGAEKVLARINRPSKIVSTSSCTGRITLIEGEAHWLRVAYKTHHPISRSEVERVLRRGF 2222---------1111------------------------------------------- TNLWLKVTGPILHLRVEGWQCAKSLLEAARRNGFKHSGVISIAEDSRLVIEIMSSQSMSV ------------------------------------------1111-------------- PLVMEGARIVGDDALDMLIEKANTILVESRIGLDTFSREVEELVEC ---iiii-----------------------------------3333 >ACYL-COA DEHYDROGENASE; SWP:Q5SH14; PDB:2DVLA; LTQEQRLVLDAVRRVAREVLYPLAPEYDRKAEYPWPQLKALAELGLLGMTTPEEWGGVGL -------------------3333----1111-------------1111---3333----- DSVTWALALEELAAADPSVAVIVSVTSGLPQYMLLRFGSEAQKRRYLVPLARGEWIGAFC ----------------------------3333---------------------------- LTEPQAGSDAKSLRAEARRVKGGFVLNGVKSWITSAGHAHLYVVMARTEKGISAFLVEKG --1111--3333-------2222-----------2222---------1111------222 TPGLSFGRPEEKMGLHAAHTAEVRLEEVFVPEENLLGEEGRGLAYALAGLDSGRVGVAAQ 2-------------3333------------3333---2222------------------- AVGIARGAFEIAKAYAEEREQFGKKLKEHQAIAFKIADMHVKIAAARALVLEAARKKDRG --------------------iiii3333----------------------------1111 ERFTLEASAAKLFASAAAVEVTREAVQVLGGYGYHRDYRVERYYRDAKVTEIYEGTSEIQ ----------------------------------3333----------1111-------- RLVIARELYR ------1111 >Thermophilic reversible g; SWP:Q60GU1; PDB:2DVTA; MQGKVALEEHFAIPETLQDSAGFVPGDYWKELQHRLLDIQDTRLKLMDAHGIETMILSLN ------------3333-------------------------------------------- APAVQAIPDRRKAIEIARRANDVLAEECAKRPDRFLAFAALPLQDPDAATEELQRCVNDL --3333------------------------1111-------1111--------------- GFVGALVNGFSQEGDGQTPLYYDLPQYRPFWGEVEKLDVPFYLHPRNPLPQDSRIYDGHP ------------!!!!-------3333---------------------33333333--11 WLLGPTWAFAQETAVHALRLMASGLFDEHPRLNIILGHMGEGLPYMMWRIDHRNAWVKLP 11-3333----------------1111-1111----%%%%-3333--------3333--- PRYPAKRRFMDYFNENFHITTSGNFRTQTLIDAILEIGADRILFSTDWPFENIDHASDWF -------3333---------2222-------------3333------------------- NATSIAEADRVKIGRTNARRLFKLD -------------------1111-- >26S PROTEASOME NON-ATPASE; SWP:Q9Z2X2; PDB:2DVWA; GCVSNIMICNLAYSGKLDELKERILADKSLATRTDQDSRTALHWACSAGHTEIVEFLLQL -----3333------------------3333---1111---------------------- GVPVNDKDDAGWSPLHIAASAGRDEIVKALLVKGAHVNAVNQNGCTPLHYAASKNRHEIA -------1111-3333--------------1111-1111-1111-3333--1111----- VMLLEGGANPDAKDHYDATAMHRAAAKGNLKMVHILLFYKASTNIQDTEGNTPLHLACDE --------1111-1111-3333--------------1111------1111-------111 ERVEEAKFLVTQGASIYIENKEEKTPLQVAKGGLGLILKRLAEGEEASM 1--------1111------1111-3333--------------------- >Psmc4 protein; SWP:Q8K3E0; PDB:2DVWB; MDRRQKRLIFSTITSKMNLSEEVDLEDYVARPDKISGADINSICQESGMLAVRENRYIVL -------------1111--1111----1111--------------------1111----3 AKDFEKAYKTVIK 333---------- >PUTATIVE EXPORTED PROTEIN; SWP:Q7W0A0; PDB:2DVZA; AYPSKAIRVIVPFAPGGSTDIIARLVTQRSQELGQPVVENKGGAGGAIGASEAARAEPDG -------------2222-------------------------%%%%------1111---- YTLSIATVSTAVNPACRPKDLPYDPIKDFQPVTNFANTANVVAVNPKFPAKDFKGFLEEL ------3333---3333------3333-----------------1111------------ KKNPGKYSYGSSGTCGVLHLGESFKATGTDIVHVPYKGSGPAVADAVGGQIELIFDNLPS --2222------11113333------------------------------------3333 SPQIQAGKLRAAIAWPTRIDAIKDVPTFADAGFPVLNQPVWYGLLAPKGTPDVVNKLRDA -3333-------------3333----3333--3333----------2222---------- AVVALKDPKVIKALDDQGSAPSGNTPEEFAKEIKEQYDWAQDVVKKQNIKLD ---------------------------------------------------- >433aa long hypothetical p; SWP:O58056; PDB:2DWCA; VVMIKLRDELGTATTDSAQKILLLGSGELGKEIAIEAQRLGVEVVAVDRYANAPAMQVAH -----------2222-------------------------------------3333---- RSYVGNMMDKDFLWSVVEREKPDAIIPEIEAINLDALFEFEKDGYFVVPNARATWIAMHR -----1111-------------------------------1111-----3333------- ERLRETLVKEAKVPTSRYMYATTLDELYEACEKIGYPCHTKAIMSYFVKGPEDIPKAWEE -------------------------------------------------3333---1111 EKIIVEEHIDFDVEVTELAVRHFDENGEIVTTFPKPVGHYQIDGDYHASWQPAEISEKAE -----------------------1111--------------------------------- REVYRIAKRITDVLGGLGIFGVEMFVKGDKVWANEVSPRPHDTGMVTLASHPPGFSEFAL --------------------------!!!!----------3333-------2222----- HLRAVLGLPIPGEWVDGYRLFPMLIPAATHVIKAKVSGYSPRFRGLVKALSVPNATVRLF --------------iiii--------------------------3333------------ GKPEAYVGRRLGIALAWDKDVEVAKRKAEMVAHMIELRTRSSDWHD -----2222----------------------1111----------- >PROTEIN RUFY3; SWP:Q9D394; PDB:2DWKA; ANERNLNAKLSIKGLIESALNLGRTLDSDYAPLQQFFVVEHCLKHGLKANKSFWGPLELV --------------------------1111-----------1111-----!!!!----33 EKLVPEAAEITASVKDLPGLKTPVGRGRAWLRLALQKKLSEYKALINKKELLSEFYEVNA 33-3333---------2222------------------3333--1111--------1111 LEEEGAIIAGLLVGLNVIDANFCKGEDLDSQV --------------1111-----3333----- ------------------------------------------------- >RHO GUANINE NUCLEOTIDE EX; SWP:Q9NR80; PDB:2DX1A; LAINELISDGSVVCAEALWDHVTDDQELGFKAGDVIEVDATNREWWWGRVADGEGWFPAS -----------------------1111---2222-----------------------333 FVRLRVNQDAQSSKDQRTNVINEILSTERDYIKHLRDICEGYVRQCRKRADFSEEQLRTI 3----------------------------------------------------------- FGNIEDIYRCQKAFVKALEQRFNRERPHLSELGACFLEHQADFQIYSEYCNNHPNACVEL ------------------1111---3333----------3333----------------- SRLTKLSKYVYFFEACRLLQKIDISLDGFLLTPVQKICKYPLQLAELLKYTHPQHRDFKD -33333333-----------------------------3333-----1111---3333-- VEAALHAKNVAQLINERKRRLENIDKIAQWQSSIEDWEGEDLLVRSSELIYSGELTRVTQ ----------------------------------------3333---------------- PQAKSQQRFFLFDHQLIYCKKDLLRRDVLYYKGRLDDGLEVVIKNAFRLHSHLLCTRKPE -----------2222------2222--------------------------------333 QKQRWLKAFAREREQVQ 3---------------- >HYPOTHETICAL PROTEIN TTHA; SWP:Q72GI6; PDB:2DX6A; ARIRVVQGDITEFQGDAIVNAANNYLKLGAGVAGAILRKGGPSIQEECDRIGKIRVGEAA --------3333----------1111----3333--------------------2222-- VTGAGNLPVRYVIHAAVLGDEPASLETVRKATKSALEKAVELGLKTVAFPLLGTGVGGLP ---!!!!--------------------------------1111-------22223333-- VEAVARVLEEIKKAPDTLEVTLYGYREEDAEAIRRAL ----------11113333---------------1111 >PROTEIN YBAK; SWP:A2UKU4; PDB:2DXAA; TPAVKLLEKNKISFQIHTYEHDPAETNFGDEVVKKLGLNPDQVYKTLLVAVNGDKHLAVA -------1111----------3333-------------3333--------iiii------ VTPVAGQLDLKKVAKALGAKKVEADPVAQRSTGYLVGGISPLGQKKRLPTIIDAPAQEFA ----------------------------------2222--------------3333---- TIYVSGGKRGLDIELAAGDLAKILDAKFADIARR -------2222----------------------- >NUCLEOSIDE DIPHOSPHATE KI; SWP:O58429; PDB:2DXEA; ETERTLVIIKPDAVVRGLIGEIISRFEKKGLKIVGMKMIWIDRELAEKHYEEHREKPFFK --------------------------3333-------------------3333--1111- ALIDYITKTPVVVMVLEGRYAVEVVRKMAGATDPKDAAPGTIRGDFGLEVSDAICNVIHA ----1111------------------------3333------------------------ SDSKESAEREISLFFKPEELFEYPRAADWFYKKG ---------------1111-----11111111-- >ACETYLTRANSFERASE; SWP:Q8UD96; PDB:2DXQA; DAISLRAAGPGDLPGLLELYQVLNPSDPELTTQEAGAVFAALAQPGLTIFVATENGKPVA ---------1111----------1111---3333---------2222------iiii--- TATLLIVPNLTRAARPYAFIENVVTLEARRGRGYGRTVVRHAIETAFGANCYKVLLTGRH ----------%%%%-----------1111-----------------1111---------- DPAVHAFYESCGFVQNKTGFQIRQD --------1111------------- >BIOTIN--[ACETYL-COA-CARBO; SWP:O57883; PDB:2DXUA; MLGLKTSIIGRRVIYFQEITSTNEFAKTSYLEEGTVIVADKQTMGHGALNRKWESPEGGL -------2222--------------------2222------------%%%%--------- WLSIVLSPKVPQKDLPKIVFLGAVGVVETLKEFSIDGRIKWPNDVLVNYKKIAGVLVEGK ----------33331111------------1111------------%%%%---------! GDKIVLGIGLNVNNKVPNGATSMKLELGSEVPLLSVFRSLITNLDRLYLNFLKNPMDILN !!!-------------2222-3333------3333------------------1111--- LVRDNMILGVRVKISFEGIAEDIDDFGRLIIRLDSGEVKKVIYGDVSLRFL -----------------------1111-----3333--------------- >ADENINE PHOSPHORIBOSYLTRA; SWP:Q325C7; PDB:2DY0A; TATAQQLEYLKNSIKSIQDYPKPGILFRDVTSLLEDPKAYALSIDLLVERYKNAGITKVV ----------1111----------------3333----------------1111------ GTEARGFLFGAPVALGLGVGFVPVRKPGKLPRETISETYDLEYGTDQLEIHVDAIKPGDK ---3333------------------2222-----------1111------1111-2222- VLVVDDLLATGGTIEATVKLIRRLGGEVADAAFIINLFDLGGEQRLEKQGITSYSLVPFP ---------------------1111-----------1111------1111---------- GH -- >CHROMO DOMAIN PROTEIN 1; SWP:P32657; PDB:2DY7A; QPEDFHGIDIVINHRLKTSLEEGKVLEKTVPDLNNCKENYEFLIKWTDESHLHNTWETYE -----------------%%%%------------------------------------333 SIGQVRGLKRLDNYCKQFIIE 3-------------------- ------------------------------------------------------------ --------- >PROBABLE 16S RRNA-PROCESS; SWP:Q5SJH5; PDB:2DYIA; MRLVEIGRFGAPYALKGGLRFRGEPVVLHLERVYVEGHGWRAIEDLYRVGEELVVHLAGV ------------------------3333------2222----------!!!!----2222 TDRTLAEALVGLRVYAEVADLPPLEEGRYYYFALIGLPVYVEGRQVGEVVDILDAGAQDV ------1111------3333----2222-33332222---iiii----------!!!!-- LIIRGVGERLRDRAERLVPLQAPYVRVEEGSIHVDPIPGLFD --------3333------1111-----1111-----2222-- >RIBOSOME-BINDING FACTOR A; SWP:Q5SJV1; PDB:2DYJA; GKAHLEAQLKRALAEEIQALEDPRLFLLTVEAVRLSKDGSVLSVYVEAFREEEGALRALS ----------------1111-3333----------1111--------------------- RAERRLVAALARRVRMRRLPRLEFLPWRASP ---------1111------------3333-- >GTP-BINDING PROTEIN; SWP:Q5SIH8; PDB:2DYKA; MHKVVIVGRPNVGKSSLFNRLLKKRSDLKEGVVETDRGRFLLVDTGGLWSGDKWEKKIQE --------2222----------------------3333------3333------------ KVDRALEDAEVVLFAVDGRAELTQADYEVAEYLRRKGKPVILVATKVDDPKHELYLGPLY ---1111-----------------------------------------3333---3333- GLGFGDPIPTSSEHARGLEELLEAIWERLP -----------1111----------1111- >AUTOPHAGY PROTEIN 5; SWP:Q12380; PDB:2DYOA; HMNDIKQLLWNGELNVLVSIDPSFLMKGSPREIAVLRIRVPRETYLVNYMPLIWNKIKSF --------------------3333-22223333-----------3333--------3333 LSFDPEKYFWFEHNKTPIPWNYPVGVLFDCLAGKSAVKDVLTFLRIHLVMGDSLPPTIIP ------------iiii---------------!!!!-------------------2222-- IASSKTQAEKFWFHQWKQVCFILNGSSKAIMSLSVNEARKFWGSVITRNFQDFIEISNKI --------------------------3333---------------------------333 SSSRPRHIPLIIQTSRTSGTFRISQPTISMTGVNPTLKDIEGDILDVKEGDVMVICQGIE 3----------------------------2222--33333333--3333------iiii- IPWHMLLYDLYSKLRSFDGFLYITLVPIK -1111----------1111---------- >Autophagy protein 16; SWP:Q03818; PDB:2DYOB; DSMDDLLIRRLTDRNDKEAHLNELFQDNSGAIGGNI ----------------11113333---1111----- >AUTOPHAGY-RELATED PROTEIN; SWP:ATG3_YEAST; PDB:2DYTA; TFLTTGQITPEEFVQAGDYLCHMFPTWKWNEESSDISYRDFLPKNKQFLIIRKVPCDKRA 3333-------------------3333-----1111--11111111------------33 EQCIDDIDELIQDMEIKMAQERYYDLYIAYSTSYRVPKMYIVGFNSNGSPLSPEQMFEDI 33------------------------------------------1111---3333-1111 SADYRTKTATIEKLPFYKNSVLSVSIHPCKHANVMKILLDKVRVVRQRRRKELQEEQELD 11111111-----1111------------------------------------------- GVGDWDSLRVDQYLIVFLKFITSVTPSIQHDYT --------3333-------3333-1111----- >FORMAMIDASE; SWP:O25836; PDB:2DYUA; GFLVAAIQFPVPIVNSRKDIDHNIESIIRTLHATKAGYPGVELIIFPEYSTQGLNTAKWL -------------------------------------1111-----2222----1111-- SEEFLLDVPGKETELYAKACKEAKVYGVFSIMERNPDSNKNPYNTAIIIDPQGEIILKYR 3333--------------------------------1111---------1111------- KLFPWNPIEPWYPGDLGMPVCEGPGGSKLAVCICHDGMIPELAREAAYKGCNVYIRISGY -----------------------iiii------3333---------1111---------- STQVNDQWILTNRSNAWHNLMYTVSVNLAGYDNVFYYFGEGQICNFDGTTLVQGHRNPWE ----------------1111------------------------1111--------2222 IVTGEIYPKMADNARLSWGLENNIYNLGHRGYVAKPGGEHDAGLTYIKDLAAGKYKLPWE ------3333--------11113333----33332222-----------1111---1111 DHMKIKDGSIYGYPTTGGRFGK -------3333----------- >UPF0076 PROTEIN PH0854; SWP:O58584; PDB:2DYYA; MKEVIFTENAPKPIGPYSQAIKAGNFLFIAGQIPIDPKTGEIVKGDIKDQTRQVLENIKA ----------------------!!!!---------------------------------- ILEAAGYSLNDVIKVTVYLKDNDFAKMNEVYAEYFGESKPARVAVEVSRLPKDVLIEIEA --1111-1111-------------3333-3333----------------2222------- IAYKE ----- >432aa long hypothetical d; SWP:Q973K9; PDB:2E0IA; MDCIFIFRRDLRLEDNTGLNYALSECDRVIPVFIADPRQLINNPYKSEFAVSFMINSLLE -----------------------------------3333---1111-------------- LDDELRKKGSRLNVFFGEAEKVVSRFFNKVDAIYVNEDYTPFSISRDEKIRKVCEENGIE --------------------------1111------------------------------ FKAYEDYLLTPKSLFHHRNFTSFYNEVSKVKVREPETMEGSFDVTDSSMNVDFLLTFKKI ----------3333------------------------------3333-33333333--- ESPLFRGGRREGLYLLHRNVDFRRRDYPAENNNYRLSPHLKFGTISMREAYYTQKGKEEF -1111---------1111--3333--3333-----------------------1111--- VRELYWRDFFTLLAYYNPHVFGHCYRREYDNISWENNESYFEAWKEGRTGYPIIDAGMRM -----------------3333----3333-------3333-------------------- LNSTGYINGRVRMLVAFFLVKVLFVDWRWGERYFATKLVDYDPAINNGNWQWIASTGVDY -------------------------------------1111------------------- MFRVFNPWKQQEKFDPEAKFIKEWVEELKDVPPSIIHSIYKTKVPGYPSPIVNWLERVNY --------------1111-3333-3333---3333--1111--2222------------- VKSEYKNV -------- >PRECORRIN-2 C20-METHYLTRA; SWP:Q8KFD9; PDB:2E0NA; GSIISVSLGPGDPGLITVKALSQLREADVIYYPGTVSASGAVTSVALDILKEFDLDPSKL -----------3333---------------------1111---------1111--3333- RGMLVPMSYAANYASMAEEVQAGRRVAVVSVGDGGFYSTASAIIERARRDGLDCSMTPGI --------3333--------------------1111--3333------------------ PAFIAAGSAAGMPLALQSDSVLVLAQIDEIGELERALVTHSTVVVMKLSTVRDELVSFLE ----------------------------3333---3333-----------1111------ RYAKPFLYAEKVGMAGEFITMEVDALRSRAIPYFSLLVCSPHCRQSTLS ----------2222-------33331111------------3333---- >216AA LONG HYPOTHETICAL A; SWP:O57848; PDB:2E1BA; MINMTRKLYYEDAYLKEAKGRVLEIRDNAILLDQTIFYPTGGGQPHDRGTINGVEVLDVY -------3333%%%%-------------------------iiii------%%%%------ KDEEGNVWHVVKEPEKFKVGDEVELKIDWDYRYKLMRIHTGLHLLEHVLNEVLGEGNWQL -1111-----------------------3333---------------------------- VGSGMSVEKGRYDIAYPENLNKYKEQIISLFNKYVDEGGEVKIWWEGDRRYTQIRDFEVI ------------------3333-------------------------------------- PCGGTHVKDIKEIGHIKKLKRSSIGRGKQRLEMWLE ------------------------iiii-------- >WERNER SYNDROME ATP-DEPEN; SWP:Q14191; PDB:2E1FA; QPVISAQEQETQIVLYGKLVEARQKHANKMDVPPAILATNKILVDMAKMRPTTVENVKRI --------------------------------3333----------------3333---2 DGVSEGKAAMLAPLLEVIKHFCQTNSVQTDLFSS 222------------------------------- >PEX; SWP:O66226; PDB:2E1NA; MDFEDIYRFFQDPPPHYLSKELAVCYVLAVLRHEDSYGTELIQHLETHWPNYRLSDTVLY -3333-3333--------------------------------------1111-------- TALKFLEDEQIISGYWKKVEGRGRPRRMYQLAQANDDRSRDLAQLWERYL ------------------2222---------------------------- >HOMEOBOX PROTEIN PRH; SWP:Q03014; PDB:2E1OA; GSSGSSGKGGQVRFSNDQTIELEKKFETQKYLSPPERKRLAKMLQLSERQVKTWFQNRRA --------------------------------3333------------------------ KWRRSGPSSG --3333---- >TK-SUBTILISIN; SWP:P58502; PDB:2E1PA; NTIRVIVSVDKAKFNPHEVLGIGGHIVYQFKLIPAVVVDVPANAVGKLKKMPGVEKVEFD ------------------1111------------------11113333--2222------ HQAVLLGKPSWLGGGSTQPAQTIPWGIERVKAPSVWSITDGSVSVIQVAVLDTGVDYDHP -----------------------3333----33333333---3333---------1111- DLAANIAWCVSTLRGKVSTKLRDCADQNGHGTHVIGTIAALNNDIGVVGVAPGVQIYSVR -3333------2222----3333---------------------------1111------ VLDARGSGSYSDIAIGIEQAILGPDGVADKDGDGIIAGDPDDDAAEVISMSLGGPADDSY --1111----------------1111--1111---2222--------------------- LYDMIIQAYNAGIVIVAASGNEGAPSPSYPAAYPEVIAVGAIDSNDNIASFSNRQPEVSA -----------------------------3333---------1111--3333-------- PGVDILSTYPDDSYETLMGTAMATPHVSGVVALIQAAYYQKYGKILPVGTFDDISKNTVR -------------------------------------------------1111------- GILHITADDLGPTGWDADYGYGVVRAALAVQAALG ---1111-----------!!!!------------- >ENZYME IIA; SWP:P23532; PDB:2E2AA; MNREEMTLLGFEIVAYAGDARSKLLEALKAAENGDFAKADSLVVEAGSCIAEAHSSQTGM ------------------------------1111-------------------------- LAREASGEELPYSVTMMHGQLHLMTTILLKDVIHHLIELYKRGA -------------------------------------------- >UBIQUITIN CONJUGATING ENZ; SWP:Q95044; PDB:2E2C; MTTSKERHSVSKRLQQELRTLLMSGDPGITAFPDGDNLFKWVATLDGPKDTVYESLKYKL -------------------------2222---------------------1111------ TLEFPSDYPYKPPVVKFTTPCWHPNVDQSGNICLDILKENWTASYDVRTILLSLQSLLGE ----1111--------------11111111---333311111111---------3333-- PNNASPLNAQAADMWSNQTEYKKVLHEKYKTAQSDK ------------3333-------------------- >Metalloproteinase inhibit; SWP:P16368; PDB:2E2DC; CSCSPVHPQQAFCNADIVIRAKAVNKKEVDSGNDIYGNPIKRIQYEIKQIKMFKGPDQDI ---------------------------------1111----------------------- EFIYTAPAAAVCGVSLDIGGKKEYLIAGKAEGNGNMHITLCDFIVPWDTLSATQKKSLNH -------3333--------------------2222---1111---3333----------- RYQMGCECKITRCPMIPCYISSPDECLWMDWVTEKNINGHQAKFFACIKRSDGSCAWYRG ----1111-------------1111---3333------3333------------------ >HEXOKINASE; SWP:Q96Y14; PDB:2E2OA; MMIIVGVDAGGTKTKAVAYDCEGNFIGEGSSGPGNYHNVGLTRAIENIKEAVKIAAKGEA ---------3333---------------------3333---------------------- DVVGMGVAGLDSKFDWENFTPLASLIAPKVIIQHDGVIALFAETLGEPGVVVIAGTGSVV --------------------3333-------------------iiii------------- EGYNGKEFLRVGGRGWLLSDDGSAYWVGRKALRKVLKMMDGLENKTILYNKVLKTINVKD -------------------2222----------------------3333----------- LDELVMWSYTSSCQIDLVASIAKAVDEAANEGDTVAMDILKQGAELLASQAVYLARKIGT ----------1111--------------1111---------------------------- NKVYLKGGMFRSNIYHKFFTLYLEKEGIISDLGKRSPEIGAVILAYKEVGCDIKKLISD ------3333-------------1111------------------------3333---- >DNA LIGASE 4; SWP:P49917; PDB:2E2WA; GSSGSSGKISNIFEDVEFCVMSGTDSQPKPDLENRIAEFGGYIVQNPGPDTYCVIAGSEN ----------1111-------------11113333-1111-------------------- IRVKNIILSNKHDVVKPAWLLECFKTKSFVPWQPRFMIHMCPSTKEHFAREYD ---------------3333-3333--------1111----1111--------- >PEROXIDASE; SWP:Q12575; PDB:2E3BA; SVTCPGGQSTSNSQCCVWFDVLDDLQTNFYQGSKCESPVRKILRIVFHDAIGFSPALTAA ---1111----3333------------1111----3333---------1111-----111 GQFGGGGADGSIIAHSNIELAFPANGGLTDTIEALRAVGINHGVSFGDLIQFATAVGMSN 1--------3333----11111111----------------------------------- CPGSPRLEFLTGRSNSSQPSPPSLIPGPGNTVTAILDRMGDAGFSPDEVVDLLAAHSLAS 2222----------------------1111------------------------3333-- QEGLNSAIFRSPLDSTPQVFDTQFYIETLLKGTTQPGPSLGFAEELSPFPGEFRMRSDAL ----1111-------1111--33333333-----------2222----2222-------- LARDSRTACRWQSMTSSNEVMGQRYRAAMAKMSVLGFDRNALTDCSDVIPSAVSNNAAPV ---1111----3333------------------22223333---3333------------ IPGGLTVDDIEVSCPSEPFPEIATASGPLPSLAPAP -iiii3333----1111------------------- >UTP--GLUCOSE-1-PHOSPHATE ; SWP:P0AEP3; PDB:2E3DA; NTKVKKAVIPVAGLGTRMLPATKAIPKEMLPLVDKPLIQYVVNECIAAGITEIVLVTHSS --------------3333-1111--1111------3333------------------111 KNSIENHFDTSFELEAMLERQLLDEVQSICPPHVTIMQVRQGLAKGLGHAVLCAHPVVGD 1----1111--33331111-----------1111-------------------------- EPVAVILPDVILDEYESDLSQDNLAEMIRRFDETGHSQIMVEPVADVTAYGVVDCKGVEL ------1111--1111-1111------------------------3333-----iiii-- APGESVPMVGVVEKPKADVAPSNLAIVGRYVLSADIWPLLAKTPPEIQLTDAIDMLIEKE 2222-----------3333-------------11113333----------------1111 TVEAYHMKGKSHDCGNKLGYMQAFVEYGIRHNTLGTEFKAWLEEEM ---------------------------------------------- >BETA-GLUCOSIDASE; SWP:Q25BW5; PDB:2E3ZA; AAKLPKSFVWGYATAAYQIEGSPDKDGREPSIWDTFCKAPGKIADGSSGDVATDSYNRWR ----3333------3333------iiii----------22221111----!!!!------ EDVQLLKSYGVKAYRFSLSWSRIIPKGGRSDPVNGAGIKHYRTLIEELVKEGITPFVTLY ------1111--------1111-11111111-----------------1111-------- HWDLPQALDDRYGGWLNKEEAIQDFTNYAKLCFESFGDLVQNWITFNEPWVISVMGYGNG ----3333----!!!!-------------------1111--------------------- IFAPGHVSNTEPWIVSHHIILAHAHAVKLYRDEFKEKQGGQIGITLDSHWLIPYDDTDAS ------------------------------------------------------------ KEATLRAMEFKLGRFANPIYKGEYPPRIKKILGDRLPEFTPEEIELVKGSSDFFGLNTYT -------------------------------!!!!-----------2222---------- THLVQDGGSDELAGFVKTGHTRADGTQLGTQSDMGWLQTYGPGFRWLLNYLWKAYDKPVY ---------3333--------1111--------3333--3333----------------- VTENGFPVKGENDLPVEQAVDDTDRQAYYRDYTEALLQAVTEDGADVRGYFGWSLLDNFE -------2222---3333-----------------------------------------! WAEGYKVRFGVTHVDYETQKRTPKKSAEFLSRWFKEHIEE !!!------------------------------------- >Amyloid beta A4 precursor; SWP:O00213; PDB:2E45A; DSFWNPNAFETDSDLPAGWMRVQDTSGTYYWHIPTGTTQWEPPGRASPSQ -%%%%------------------3333-------------------1111 >TRYPTOPHAN HALOGENASE; SWP:Q8KHZ8; PDB:2E4GA; MSGKIDKILIVGGGTAGWMAASYLGKALQGTADITLLQAPDIPTLGVGEATIPNLQTAFF ---------------------------2222--------------------1111----- DFLGIPEDEWMRECNASYKVAIKFINWRTAGEGTSEARELDGGPDHFYHSFGLLKYHEQI 1111-3333--1111------------------------%%%%-------------iiii PLSHYWFDRSYRGKTVEPFDYACYKEPVILDANRSPRRLDGSKVTNYAWHFDAHLVADFL -----------------3333---3333---------1111----------3333----- RRFATEKLGVRHVEDRVEHVQRDANGNIESVRTATGRVFDADLFVDCSGFRGLLINKAME ----------------------1111------1111------------3333-------- EPFLDMSDHLLNDSAVATQVPHDDDANGVEPFTSAIAMKSGWTWKIPMLGRFGTGYVYSS ----------------------3333---------------------2222-------33 RFATEDEAVREFCEMWHLDPETQPLNRIRFRVGRNRRAWVGNCVSIGTSSCFVEPLESTG 33----------------1111------------------------3333--------33 IYFVYAALYQLVKHFPDKSLNPVLTARFNREIETMFDDTRDFIQAHFYFSPRTDTPFWRA 33---------1111-1111--------------------------1111---------1 NKELRLADGMQEKIDMYRAGMAINAPASDDAQLYYGNFEEEFRNFWNNSNYYCVLAGLGL 111--------------------------33331111---1111--3333---------- VPDAPSPRLAHMPQATESVDEVFGAVKDRQRNLLETLPSLHEFLRQQH ------3333-------------------------------------- >METABOTROPIC GLUTAMATE RE; SWP:P31422; PDB:2E4UA; RREIKIEGDLVLGGLFPINEKGTGTEECGRINEDRGIQRLEAMLFAIDEINKDNYLLPGV ----------------------!!!!----------------------11111111---- KLGVHILDTCSRDTYALEQSLEFVRASLIPLLIAGVIGGSYSSVSIQVANLLRLFQIPQI --------%%%%-------33333333-----------------------3333------ SYASTSAKLSDKSRYDYFARTVPPDFYQAKAMAEILRFFNWTYVSTVASEGDYGETGIEA -----3333--------------------------------------------------- FEQEARLRNICIATAEKVGRSNIRKSYDSVIRELLQKPNARVVVLFMRSDDSRELIAAAN ----3333----------------------------3333-------------------1 RVNASFTWVASDGWGAQESIVKGSEHVAYGAITLELASHPVRQFDRYFQSLNPYNNHRNP 111-------3333--333322223333------------3333---111133333333- WFRDFWEQKFQCSLQQVCDKHLAIDSSNYEQESKIMFVVNAVYAMAHALHKMQRTLCPQT ------------------1111--1111---1111---------------------1111 TKLCDAMKILDGKKLYKEYLLKIQFTAPFNPNKGADSIVKFDTFGDGMGRYNVFNLQQTG ---3333-----------1111-------------------1111--------------- GKYSYLKVGHWAETLSLDVDSIHWSRNSVPTSQCSDPCAPNEMKNMQPGDVCCWICIPCE -----------------3333--------------------------------------1 PYEYLVDEFTCMDCGPGQWPTADLSGCYNLPE 111---1111----2222--1111-------- >PROTEIN SET; SWP:SET_HUMAN; PDB:2E50A; MSAQAAKVSKKELNSDETSEKEQQEAIEHIDEVQNEIDRLNEQASEEILKVEQKYNKLRQ ------------------------------------------------------------ PFFQKRSELIAKIPNFWVTTFVNHPQVSALLGEEDEEAMHYLTRVEVTEFEDIKSGYRID ------------2222----------3333--------1111------------------ FYFDENPYFENKVLSKEFHSSKSTEIKWKSGKDMFFTWFTADELGEVIKDDIWPNPLQYY ----------------------------------3333-------------33333333- LV -- >WERNER SYNDROME ATP-DEPEN; SWP:Q3TZ13; PDB:2E6MA; NLPFLEFPGSIVYSYEASDCSFLSEDISMRLSDGDVVGFDMEWPPPGKRSRVAVIQLCVS ---------------------------11112222------------------------1 ESKCYLFHISSMSVFPQGLKMLLENKSIKKAGVGIEGDQWKLLRDFDVKLESFVELTDVA 111-----1111------------3333-----3333----------------------- NEKLKCAETWSLNGLVKHVLGKQLLKDKSIRCSNWSNFPLTEDQKLYAATDAYAGLIIYQ -1111---------------------3333---1111----------------------- KLGNLG ------ >5-methyltetrahydrofolate ; SWP:Q46389; PDB:2E7FA; MLIIGERINGMFGDIKRAIQERDPAPVQEWARRQEEGGARALDLNVGPAVQDKVSAMEWL --------3333----------------------3333---------------------- VEVTQEVSNLTLCLDSTNIKAIEAGLKKCKNRAMINSTNAEREKVEKLFPLAVEHGAALI ---3333-----------------3333------------3333---------------- GLTMNKTGIPKDSDTRLAFAMELVAAADEFGLPMEDLYIDPLILPANVAQDHAPEVLKTL ----3333------------------------3333--------11111111-------- QQIKMLADPAPKTVLGLSNVSQNCQNRPLINRTFLAMAMACGLDAAIADACDEALIETAA --1111---------3333-2222--------------1111------1111-------- TAEILLNQTVYCDSFVKMFKTR ---1111--------------- >RAB GUANINE NUCLEOTIDE EX; SWP:P17065; PDB:2E7SA; LEEQLNKSLKTIASQKAAIENYNQLKEDYNTLKRELSDRDDEVKRLREDIAKENELRTKA -3333---------------------------------------1111------------ EEEADKLNKEVEDLTASLFDEANNLVADAREKYAIEILNKRLTEQLREKDLL ------1111------------------------------------1111-- >ACETYLENE HYDRATASE AHY; SWP:Q71EW5; PDB:2E7ZA; KKHVVCQSCDINCVVEAEVKADGKIQTKSISEPHPTTPPNSICMKSVNADTIRTHKDRVL --------3333-------1111----------------------------1111----- YPLKNVGSKRGEQRWERISWDQALDEIAEKLKKIIAKYGPESLGVSQTEINQQSEYGTLR --------2222--------------------------3333-----3333---iiii-- RFMNLLGSPNWTSAMYMCIGNTAGVHRVTHGSYSFASFADSNCLLFIGKNLSNHNWVSQF -------------------------------------1111------------------- NDLKAALKRGCKLIVLDPRRTKVAEMADIWLPLRYGTDAALFLGMINVIINEQLYDKEFV ------1111----------3333---------2222------------1111------- ENWCVGFEELKERVQEYPLDKVAEITGCDAGEIRKAAVMFATESPASIPWAVSTDMQKNS -------------1111----------------------------------1111-1111 CSAIRAQCILRAIVGSFVNGAEILGAPHSDLVPISKIQMHEALPEEKKKLQLGTETYPFL --------------1111---------1111-3333--3333--------2222---111 TYTGMSALEEPSERVYGVKYFHNMGAFMANPTALFTAMATEKPYPVKAFFALASNALMGY 133331111---------------------------------------------3333-- ANQQNALKGLMNQDLVVCYDQFMTPTAQLADYVLPGDHWLERPVVQPNWEGIPFGNTSQQ ---------1111----------3333----------1111------------------- VVEPAGEAKDEYYFIRELAVRMGLEEHFPWKDRLELINYRISPTGMEWEEYQKQYTYMSK ----!!!!-----------11113333-------------3333--33333333------ LPDYFGPEGVGVATPSGKVELYSSVFEKLGYDPLPYYHEPLQTEISDPELAKEYPLILFA -------------1111-------------------------3333-3333--------- GLREDSNFQSCYHQPGILRDAEPDPVALLHPKTAQSLGLPSGEWIWVETTHGRLKLLLKH ---1111!!!!----3333---------------1111-2222-----3333-------- DGAQPEGTIRIPHGRWCPEQEGGPETGFSGAMLHNDAMVLSDDDWNLDPEQGLPNLRGGI 33332222--------3333--3333--------3333----3333-------------- LAKAYKC ------- >ENDO-BETA-N-ACETYLGLUCOSA; SWP:P36911; PDB:2EBN; TTKANIKLFSFTEVNDTNPLNNLNFTLKNSGKPLVDMVVLFSANINYDAANDKVFVSNNP ------------3333-----------3333-----------------1111-------- NVQHLLTNRAKYLKPLQDKGIKVILSILGNHDRSGIANLSTARAKAFAQELKNTCDLYNL -------3333-----1111-----------------------------------1111- DGVFFDDEYSAYQTPPPSGFVTPSNNAAARLAYETKQAMPNKLVTVYVYSRTSSFPTAVD ----------------2222------------------1111------!!!!------%% GVNAGSYVDYAIHDYGGSYDLATNYPGLAKSGMVMSSQEFNQGRYATAQALRNIVTKGYG %%3333-------2222---111122223333---------------------------- GHMIFAMDPNRSNFTSGQLPALKLIAKELYGDELVYSNTPYSKDW -------1111---------------------------------- >EBOLA VIRUS ENVELOPE GLYC; SWP:Q05320; PDB:2EBOA; GLRQLANETTQALQLFLRATTELRTFSILNRKAIDFLLQRWGGTCHILGPDCAIEPHDWT -------------------------------------1111-3333----1111------ KNITDKIDQIIHDF -------------- >136AA LONG HYPOTHETICAL T; SWP:Q974H8; PDB:2EC2A; MEYKSTRHAKYLCNYHFVWIPKYRRKVLTGEVAEYTKEVLRTIAEELGCEVLALEVMPDH -----1111----------------------3333------------------------- IHLFVNCPPRYAPSYLANYFKGKSARLILKKFQELKKSTNGKLWTRSYFVSTSGNVSSET -------1111-----------------1111---------------------------- IKKYIEEQWA ----1111-- >ZINC FINGERS AND HOMEOBOX; SWP:Q9UKY1; PDB:2ECBA; GSSGSSGPDFTPQKFKEKTAEQLRVLQASFLNSSVLTDEELNRLRAQTKLTRREIDAWFT ------------------3333-------3333-----3333--------3333------ EKKKSKALKEEKMEIDESNAGSSSGPSSG --3333----------------------- >HOMEOBOX AND LEUCINE ZIPP; SWP:Q8IX15; PDB:2ECCA; GSSGSSGKRKTKEQLAILKSFFLQCQWARREDYQKLEQITGLPRPEIIQWFGDTRYALKH ----------------------------1111----------3333-------------- GQLKWFRDNASGPSSG ---------------- ------------------------------------------------- >PROTEIN (EUKARYOTIC TRANS; SWP:IF5A_METJA; PDB:2EIFA; MVIIMPGTKQVNVGSLKVGQYVMIDGVPCEIVDISVSKPGKHGGAKARVVGIGIFEKVKK -----------3333-2222---iiii--------------------------------- EFVAPTSSKVEVPIIDRRKGQVLAIMGDMVQIMDLQTYETLELPIPEGIEGLEPGGEVEY ----1111-----------------!!!!-------------------22222222---- IEAVGQYKITRVI --iiii------- >ENDONUCLEASE V; SWP:P04418; PDB:2END; TRINLTLVSELADQHLMAEYRELPRVFGAVRKHVANGKRVRDFKISPTFILGAGHVTFFY ------1111-----------------------1111-3333---------22223333- DKLEFLRKRQIELIAECLKRGFNIKDTTVQDISDIPQEFRGDYIPHEASIAISQARLDEK -----------------1111----------33333333--------------------- IAQRPTWYKYYGKAIYA ---3333--iiii---- >ENDOGLUCANASE V; SWP:P43316; PDB:2ENG; ADGRSTRYWDCCKPSCGWAKKAPVNQPVFSCNANFQRITDFDAKSGCEPGGVAYSCADQT -------------1111--------------1111----1111-1111-------1111- PWAVNDDFALGFAATSIAGSNEAGWCCACYELTFTSGPVAGKKMVVQSTSTGSNHFDLNI ----1111--------222233332222--------1111-------------------2 PGGGVGIFDGCTPQFGGLPGQRYGGISSRNECDRFPDALKPGCYWRFDWFKNADNPSFSF 222-------3333-------------333311113333----33331111--------- RQVQCPAELVARTGCRRNDDGNFPAV -----3333-------1111------ >HORSE MILK LYSOZYME; SWP:P11376; PDB:2EQL; KVFSKCELAHKLKAQEMDGFGGYSLANWVCMAEYESNFNTRAFNGKNANGSSDYGLFQLN -----------3333-1111---3333--------%%%%---------------1111-3 NKWWCKDNKRSSSNACNIMCSKLLDENIDDDISCAKRVVRDPKGMSAWKAWVKHCKDKDL 333----------1111-3333--------------33333333----3333--2222-- SEYLASCNL -1111---- >REGULATORY PROTEIN LEU3; SWP:P08638; PDB:2ER8A; KRKFACVECRQQKSKCDAHERAPEPCTKCAKKNVPCILKRDFRRTYKRARNEAIEKRFKE -----------------------------1111-----1111------------------ LTRTLTNL -------- >ODORANT BINDING PROTEIN; SWP:Q8T6S0; PDB:2ERBA; TPRRDAEYPPPELLEALKPLHDICLGKTGVTEEAIKKFSDEEIHEDEKLKCYMNCLFHEA ----1111-------------------------------------------------111 KVVDDNGDVHLEKLHDSLPSSMHDIAMHMGKRCLYPEGETLCDKAFWLHKCWKQSDPKHY 1--1111-------11113333-------1111-----------------------1111 FLV --- >THROMBOSPONDIN-1; SWP:P07996; PDB:2ERFA; GGDNSVFDIFELTGAARKGSGRRLVKGPDPSSPAFRIEDANLIPPVPDDKFQDLVDAVRT -------------3333-----------1111------3333------------------ EKGFLLLASLRQMKKTRGTLLALERKDHSGQVFSVVSNGKAGTLDLSLTVQGKQHVVSVE ------------------------1111---------------------iiii------- EALLATGQWKSITLFVQEDRAQLYIDCEKMENAELDVPIQSVFTRDLASIARLRIAKGGV ----------------!!!!-----------------3333----3333----------- NDNFQGVLQNVRFVFGTTPEDILRNKGCS ----------------------------- >CIRCULIN B; SWP:P56879; PDB:2ERIA; CGESCVFIPCISTLLGCSCKNKVCYRNGVIP ------------3333---iiii-------- >MATING PHEROMONE ER-1; SWP:P10774; PDB:2ERL; DACEQAAIQCVESACESLCTEGEDRTGCYMYIYSNCPPYV ------11113333-------------------------- >VASCULAR APOPTOSIS-INDUCI; SWP:Q9DGB9; PDB:2EROA; SNLTPEQQRYLNAKKYVKLFLVADYIMYLKYGRNLTAVRTRMYDIVNVITPIYHRMNIHV ------------------------------%%%%----------------1111------ ALVGLEIWSNTDKIIVQSSADVTLDLFAKWRATDLLSRKSHDNAQLLTGINFNGPTAGLG ----------------------------------1111---------------------- YLGGICNTMYSAGIVQDHSKIHHLVAIAMAHEMGHNLGMDHDKDTCTCGTRPCVMAGALS 2222--------------------------------------1111------1111---- CEASFLFSDCSQKDHREFLIKNMPQCILKKPLKTDVVSPAVCGNYFVEVGEECDCGSPRT ------------------------1111---1111------------2222-----3333 CRDPCCDATTCKLRQGAQCAEGLCCDQCRFKGAGTECRAAKDECDMADVCTGRSAECTDR --1111-------2222----1111iiii--2222------1111--------------- FQRNGQPCKNNNGYCYNGKCPIMADQCIALFGPGATVSQDACFQFNREGNHYGYCRKEQN --------%%%%---iiii------------2222---3333-1111------------- TKIACEPQDVKCGRLYCFPNSPENKNPCNIYYSPNDEDKGMVLPGTKCADRKACSNGQCV -----3333-----------1111--------3333-2222-2222-------------- DVTTPY 1111-- >ATAXIN-2-BINDING PROTEIN ; SWP:Q9NWB1; PDB:2ERRA; NTENKSQPKRLHVSNIPFRFRDPDLRQMFGQFGKILDVEIIFNERGSKGFGFVTFENSAD ----------------1111--------3333----------3333-------------- ADRAREKLHGTVVEGRKIEVNNATARVM ------------%%%%------------ ------------------------------------------------------------ ------ >SERINE PROTEASE INHIBITOR; SWP:Q95P16; PDB:2ERWA; NPCACFRNYVPVCGSDGKTYGNPCMLNCAAQTKVPGLKLVHEGRCQRSNVEQF 1111---------1111----------------2222---------2222--- >GTP-BINDING PROTEIN DI-RA; SWP:Q96HU8; PDB:2ERXA; NDYRVAVFGAGGVGKSSLVLRFVKGTFRESYIPTVEDTYRQVISCDKSICTLQITDTTGS ---------2222----------------------------------------------- HQFPAMQRLSISKGHAFILVYSITSRQSLEELKPIYEQICEIKGSIPIMLVGNKCDESPS ---------------------1111----------------------------3333111 REVQSSEAEALARTWKCAFMETSAKLNHNVKELFQELLNLEKRRTVSL 1--3333----------------1111----------3333------- >RAS-RELATED PROTEIN R-RAS; SWP:P62070; PDB:2ERYA; SMQEKYRLVVVGGGGVGKSALTIQFIQSYFVTDYDPTIEDSYTKQCVIDDRAARLDILDT ------------%%%%------------------1111---------iiii--------- AFGAMREQYMRTGEGFLLVFSVTDRGSFEEIYKFQRQILRVKDRDEFPMILIGNKADLDH --------------------1111------------------------------3333-- QRQVTQEEGQQLARQLKVTYMEASAKIRMNVDQAFHELVRVIRKFQE ------------------------1111---------------1111 >REGULATOR OF G-PROTEIN SI; SWP:P49758; PDB:2ES0A; MPSQQRVKRWGFSFDEILKDQVGRDQFLRFLESEFSSENLRFWLAVQDLKKQPLQDVAKR -------3333-3333---------------1111-------------11113333---- VEEIWQEFLAPGAPSAINLDSHSYEITSQNVKDGGRYTFEDAQEHIYKLMKSDSYARFLR ---------2222-------------------iiii1111-------------------- SNAYQDLLL -3333---- >COLD SHOCK PROTEIN CSPB; SWP:P32081; PDB:2ES2A; MLEGKVKWFNSEKGFGFIEVEGQDDVFVHFSAIQGEGFKTLEEGQAVSFEIVEGNRGPQA -------------------2222-----3333---------2222--------------- ANVTKEA ------- >Lipase chaperone; SWP:Q05490; PDB:2ES4D; MPLPAALPGALAGSHAPRLPLAAGGRLARTRAVREFFDYLTAQGELTPAALDALVRREIA -------3333----------1111----------------1111--------------- AQLDGSPAQAEALGVWRRYRAYFDALAVLGDKLDPAAMQLALDQRAALADRTLGEWAEPF -----3333--------------3333-------------------------!!!!-333 FGDEQRRQRHDLERIRIANDTLSQKAARLAALDAQLTPDERAQQAALHAQQDAVTKIADL 3-----------------------------------3333-------------------- QKAGATPDQMRAQIAQTLGPEAAARAAQMQQDDEAWQTRYQAYAAERDRIAAQGLAPQDR 1111---------------------------------------------1111------- DARIAQLRQQTFTAPGEAIRAASLDRGAG -------------2222------1111-- >putative thiol-disulfide ; SWP:NA; PDB:2ES7A; FSALWQRLLTRGWQPVEASVGDGVILLSSDPRRSDNPVIAELLREFPQFDWQVAVADLEQ --------1111---------------------------------------------333 SEAIGDRFNVRRFPATLVFTDGALSGIHPWAELLTLRSIVD 3---3333--------------------------------- >PUTATIVE CYTOPLASMIC PROT; SWP:Q8ZRJ2; PDB:2ES9A; TAIEKALDFIGGNTSASVPHSDESTAKGILKYLHDLGVPVSPEVVVARGEQEGWNPEFTK ------------3333-----------------1111----------------------- KVAGWAEKVASGNRILIKNPEYFSTYQEQLKELVLEH ------------------------------------- >DUAL SPECIFICITY PROTEIN ; SWP:Q8NEJ0; PDB:2ESBA; SGLSQITKSLYISNGVAANNKLMLSSNQITMVINVSVEVVNTLYEDIQYMQVPVADSPNS -------------3333-------1111---------------2222---------1111 RLCDFFDPIADHIHSVEMKQGRTLLHCAAGVSRSAALCLAYLMKYHAMSLLDAHTWTKSC 3333------------1111---------------------------------------- RPIIRPNSGFWEQLIHYEFQLFGKNTVHMVSSPVGMIPDIYE 1111---------------------------1111--3333- >CONSERVED HYPOTHETICAL PR; SWP:Q9X035; PDB:2ESHA; RGGRGFRGWWLASTILLLVAEKPSHGYELAERLAEFGIEIPGIGHMGNIYRVLADLEESG -------------------------3333----------2222----------------- FLSTEWDTTVSPPRKIYRITPQGKLYLREILRSLEDMKRRIETLEERIKRVLQE ------------------------------------------------------ >UBIQUITIN-CONJUGATING ENZ; SWP:P62837; PDB:2ESKA; AMKRIHKELNDLARDPPAQCSAGPVGDDMFHWQATIMGPNDSPYQGGVFFLTIHFPTDYP ---------------------------1111-------2222-2222--------1111- FKPPKVAFTTRIYHPNINSNGSICLDILRSQWSPALTISKVLLSICSLLCDPNPDDPLVP -------------11111111---333311111111----------------1111---- EIARIYKTDREKYNRIAREWTQKYAM -------------------------- >PEPTIDYL-PROLYL CIS-TRANS; SWP:P45877; PDB:2ESLA; RGPSVTAKVFFDVRIGDKDVGRIVIGLFGKVVPKTVENFVALATGEKGYGYKGSKFHRVI --------------!!!!--------------------------1111--2222-----2 KDFMIQGGDITTGDGTGGVSIYGETFPDENFKLKHYGIGWVSMANAGPDTNGSQFFITLT 222----------------1111------------------------------------- KPTWLDGKHVVFGKVIDGMTVVHSIELQATDGHDRPLTNCSIINSGKIDVKTPFVVEIAD -3333-------------------1111--1111-----------------------222 W 2 >PROBABLE TRANSCRIPTIONAL ; SWP:Q9I641; PDB:2ESNA; PLLRRLDLNLLLVFDALYRHRNVGTAASELAISASAFSHALGRLRQGLDDELFLRQGNRM -1111------------------------------------------------------- QPTQRAEHLAAAVAAALRALGEGLEEWRPFVPGQSQRTFVFAATDYTAFALLPPLMNRLQ -----------------------3333---3333-------------------------- HSAPGVRLRLVNAERKLSVEALASGRIDFALGYDEEHERLPEGIQAHDWFADRYVVVARR --1111----------------------------%%%%---------------------- DHPRLAGAPTLEGYLAERHAVVTPWNEDSGVIDRLLARSGLRREVAVQLPTVLAALFLAG -------------1111-------------------1111------------------11 STDFLLTAPRHAARALAEAAGLALYPAPFDIPPYVLRLYSHVQGRDAHAWMIGQLKGLD 11------3333-----1111-------------------------------------- >METHYLTRANSFERASE; SWP:Q5XAY9; PDB:2ESRA; VRGAIFNIGPYFNGGRVLDLFAGSGGLAIEAVSRGSAAVLVEKNRKAQAIIQDNIITKAE ---------------------!!!!---------------------------------11 NRFTLLKEAERAIDCLTGRFDLVFLDPPYAKETIVATIEALAAKNLLSEQVVVCETDKTV 11---------3333--------------------------1111--1111-----3333 LLPKEIATLGIWKEKIYGISKVTVYVNEGHHHHHH -----!!!!-----------------1111----- >ACYL-ACP THIOESTERASE; SWP:Q8A611; PDB:2ESSA; SEENKIGTYQFVAEPFHVDFNGRLTGVLGNHLLNCAGFHASDRGFGIATLNEDNYTWVLS 3333---------1111-1111------------------1111---------------- RLAIELDEPYQYEKFSVQTWVENVYRLFTDRNFAVIDKDGKKIGYARSVWAINLNTRKPA ---------2222-----------1111--------1111-------------------- LHGGSIVDYICDEPCPIEKPSRIKVTSNQPVATLTAKYSDIDINGHVNSIRYIEHILDLF ----3333----------------------------3333-1111-----------3333 PIELYQTKRIRRFEAYVAESYFGDELSFFCDEVSENEFHVEVKKNGSEVVCRSKVIFE 3333----------------2222---------1111--------------------- ------ >BETA-2-MICROGLOBULIN; SWP:NA; PDB:2ESVE; GVTQFPSHSVIEKGQTVTLRCDPISGHDNLYWYRRVMGKEIKFLLHFVKESKQDESGMPN -----------2222--------2222--------------------!!!!---1111-3 NRFLAERTGGTYSTLKVQPAELEDSGVYFCASSQDRDTQYFGPGTRLTVLEDLKNVFPPE 333-----------------3333---------------------------3333----- VAVFEPSEAEISHTQKATLVCLATGFYPDHVELSWWVNGKEVHSGVCTDPQPLKEQPALN ------------------------------------iiii--2222---------1111- DSRYALSSRLRVSATFWQNPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEAWGRAD ------------3333--1111-----------3333----------------------- >LUNG SURFACTANT PROTEIN C; SWP:NA; PDB:2ESYA; SPPDYSAAPRGRFGIPFFPVHLKRLLILLLL ----------------3333----------- >OXALATE OXIDASE 1; SWP:P45850; PDB:2ET1A; TDPDPLQDFCVADLDGKAVSVNGHTCKPMSEAGDDFLFSSKLTKAGNTSTPNGSAVTELD --------------1111---------3333---1111-1111------3333------3 VAEWPGTNTLGVSMNRVDFAPGGTNPPHIHPRATEIGMVMKGELLVGILGSLDSGNKLYS 3333333------------2222---------------------------1111------ RVVRAGETFVIPRGLMHFQFNVGKTEAYMVVSFNSQNPGIVFVPLTLFGSDPPIPTPVLT ---2222----2222---------------------------------------3333-- KALRVEAGVVELLKSKFAGGS -----3333-------1111- >(3R)-HYDROXYACYL-COA DEHY; SWP:P22414; PDB:2ET6A; SPVDFKDKVVIITGAGGGLGKYYSLEFAKLGAKVVVNDLKAADVVVDEIVKNGGVAVADY ----2222---------------------------------------------------- NNVLDGDKIVETAVKNFGTVHVIINNAGILRDASMKKMTEKDYKLVIDVHLNGAFAVTKA -3333----------------------------3333----------------------- AWPYFQKQKYGRIVNTSSPAGLYGNFGQANYASAKSALLGFAETLAKEGAKYNIKANAIA -----------------3333---2222--------------------3333-------- PLARSRMTESIMPPPMLEKLGPEKVAPLVLYLSSAENELTGQFFEVAAGFYAQIRWERSG ----3333--------11113333---------3333---------iiii---------- GVLFKPDQSFTAEVVAKRFSEILDYDDSRKPEYLKNQYPFMLNDYATLTNEARKLPANDA ------33333333----3333----11111111---------3333---1111-----2 SGAPTVSLKDKVVLITGAGAGLGKEYAKWFAKYGAKVVVNDFKDATKTVDEIKAAGGEAW 222---------------------------1111------------------1111---- PDQHDVAKDSEAIIKNVIDKYGTIDILVNNAGILRDRSFAKMSKQEWDSVQQVHLIGTFN ----3333-----------------------------3333------------------- LSRLAWPYFVEKQFGRIINITSTSGIYGNFGQANYSSSKAGILGLSKTMAIEGAKNNIKV ---------------------3333---2222--------------------3333---- NIVAPHAETAMKNLYHADQVAPLLVYLGTDDVPVTGETFEIGGGWIGNTRWQRAKGAVSH ---------------3333---------1111---------iiii--------------- DEHTTVEFIKEHLNEITDFTTDTENPKSTTESSMAILSAVGG ----3333-----3333------------------------- >Transient receptor potent; SWP:Q9WUD2; PDB:2ETBA; FDRDRLFSVVSRGVPEELTGLLEYLRWNSKYLTDSAYTEGSTGKTCLMKAVLNLQDGVNA -------------33332222---------11111111----------------iiii11 CIMPLLQIDKDSGNPKPLVNAQCTDEFYQGHSALHIAIEKRSLQCVKLLVENGADVHLRA 11--------------3333----3333---------1111-------------1111-- CGRFFQKHQGTCFYFGELPLSLAACTKQWDVVTYLLENPHQPASLEATDSLGNTVLHALV -3333------------------1111----------------1111-1111-------- MIADNSPENSALVIHMYDGLLQMGARLCPTVQLEEISNHQGLTPLKLAAKEGKIEIFRHI ---------------------------11111111--1111------------------- LQREFSGAAAHH ------------ >LEMA PROTEIN; SWP:Q9X056; PDB:2ETDA; HLVLEQEVQEYSQIQNQLQRRADLIPNLVETVGYAAHEEILEEIANARALIGATPQESAQ ----------------------------------1111-----------1111------- ADAELSSALSRLLAIAENYPNLADANFRQLDELAGTENRIAVARRDYNEAVYNTAIGFEE ------------------3333------------------------------3333---- QYFEAP ------ >SULFOTRANSFERASE 1C1; SWP:O00338; PDB:2ETGA; KLKEVEGTLLQPATVDNWSQIQSFEAKPDDLLICTYPKAGTTWIQEIVDMIEQNGHPFIE ----iiii------------1111--1111-----2222------------------111 WARPPQPSGVEKAKAMPSPRILKTHLSTQLLPPSFWENNCKFLYVARNAKDCMVSYYHFQ 1-----------1111----------3333-33331111--------------------- RMNHMLPDPGTWEEYFETFINGKVVWGSWFDHVKGWWEMKDRHQILFLFYEDIKRDPKHE --1111-----------------2222-----------1111------------------ IRKVMQFMGKKVDETVLDKIVQETSFEKMKENFMRKGTVGDWKNHFTVAQNERFDEIYRR -----1111------------------------------3333----------------- KMEGTSINFSMEL -2222-------- >TRANSCRIPTIONAL REGULATOR; SWP:Q9WZS3; PDB:2ETHA; HDALEIFKTLFSLVRFSSYLPSNEEISDKTTELYAFLYVALFGPKKKEIAEFLSTTKSNV -3333---------3333------3333-------------------------------- TNVVDSLEKRGLVVREDPVDRRTYRVVLTEKGKEIFGEILSNFESLLKSVLEKFSEEDFK -------1111---------------------------------------1111------ VVSEGFNRVEALSRE ---------1111-- >RIBONUCLEASE HII; SWP:Q9X017; PDB:2ETJA; GIDELYKKEFGIVAGVDEAGRGCLAGPVVAAAVVLEKEIEAKRERLLDEIEKAAVGIGIA 3333---------------1111------------------------------------- SPEEIDLYNIFNATKLANRALENLSVKPSFVLVDGKGIELSVPGTCLVKGDQKSKLIGAA ------------------------------------------------3333-------- SIVAKVFRDRLSEFHRYPQFSFHKHKGYATKEHLNEIRKNGVLPIHRLSFEPVLELLTDD ------------333311113333iiii--------------111111113333------ LLREFFEKGLISENRFERILNLLGAR -----1111----------------- >UBIQUITIN CARBOXYL-TERMIN; SWP:P09936; PDB:2ETLA; MQLKPMEINPEMLNKVLSRLGVAGQWRFVDVLGLEEESLGSVPAPACALLLLFPLTAQHE --------------------------------------3333------------------ NFRKKQIEELKGQEVSPKVYFMKQTIGNSCGTIGLIHAVANNQDKLGFEDGSVLKQFLSE -----------11113333-------------------1111------2222------11 TEKMSPEDRAKCFEKNEAIQAAHDAVAQEGQCRVDDKVNFHFILFNNVDGHLYELDGRMP 11-----------------------3333------------------%%%%----1111- FPVNHGASSEDTLLKDAAKVCREFTEREQGEVRFSAVALCKAA --------1111------------------------------- >anti-cleavage anti-GreA t; SWP:Q8VQD7; PDB:2ETNA; REVKLTKAGYERLMKQLEQERERLQEATKILQELMESSDDYDDSGLEAAKQEKARIEARI ------------------------------------------------------------ DSLEDVLSRAVILEEGTGEVIGLGSVVELEDPATGERLSVQVVSPAEASVLENPMKISDA -------------------------------------------3333----------111 SPMGKALLGHRVGDVLSLDTPKGKKEFRVVAIHG 1---1111--2222-------------------- >RHO-ASSOCIATED PROTEIN KI; SWP:Q13464; PDB:2ETRA; SFETRFEKMDNLLRDPKSEVNSDCLLDGLDALVYDLDFPALRKNKNIDNFLSRYKDTINK --------------1111-------------------3333------------------- IRDLRMKAEDYEVVKVIGRGAFGEVQLVRHKSTRKVYAMKLLSKFEMIKRSDSAFFWEER ------1111-------------------------------------------------- DIMAFANSPWVVQLFYAFQDDRYLYMVMEYMPGGDLVNLMSNYDVPEKWARFYTAEVVLA -------1111-------------------1111----1111------------------ LDAIHSMGFIHRDVKPDNMLLDKSGHLKLADFGTCMKMNKEGMVRCDTAVGTPDYISPEV --------------3333---1111------1111---1111---------3333----- LKSQGGDGYYGRECDWWSVGVFLYEMLVGDTPFYADSLVGTYSKIMNHKNSLTFPDDNDI ---3333---3333----------------------3333------3333---------- SKEAKNLICAFLTDREVRLGRNGVEEIKRHLFFKNDQWAWETLRDTVAPVVPDLSSDIDT -------------3333-----333311111111----1111-------------11111 SNFDDLEEDKGEEETFPIPKAFVGNQLPFVGFTYYSNRRY 111--------------------1111-2222-------- >HYPOTHETICAL PROTEIN; SWP:NA; PDB:2ETSA; HNLVESLIYEVNNQQNFENVKSQQQDHDFYQTVKPYTEHIDSILNEIKLHREFIIEVPYN --------------------3333---3333---------------------3333---- SRKFELLIANIEQLSVECHFKRTSRKLFIEKLKSVQYDLQNILDGVT -------------------11113333-------------------- >SORTING NEXIN-22; SWP:Q96L94; PDB:2ETTA; GHHHHHHLELEVHIPSVGPEAEGPRQSPEKSHMVFRVEVLCSGRRHTVPRRYSEFHALHK ----------------------------------------iiii------3333------ RIKKLYKVPDFPSKRLPNWRTRGLEQRRQGLEAYIQGILYLNQEVPKELLEFLRLRHFPT 3333--------------3333--------------------------------2222-- DPKASNWG -------- >iron(III) ABC transporter; SWP:Q9WY32; PDB:2ETVA; HDLLGREVEPNVNRIVAVGPGALRLIAYLATDVVGVEDFELRPYGRPYILAYPELLPSVG -1111-------------2222--------------3333-----3333--3333----- PGGPGLPDLESLITLQPDVVFITYVDRTADIQETGIPVVVLSYGNLGTFEDEDLFRSIEL ------------------------------------------------------------ AGILGREERAHEVVDFIRAQEDLVTRSEGVESPTVYVGGIGYGAHGIDSTEAYPPFVVLH --------------------------2222----------------------33331111 ARNVVDELGEGHFIDPELLVWNPEYIFIDENGLSLVLDDYSHREFYESLSAVRGVYGILP --1111----------------------3333----------------3333-------- YNYYTTNIGTALADAYFIGKVLYPERFTDIDPEEADEIYEFLLGRVYGEAEQFGGFGIDL ----------------------333311113333-----------3333----------1 PSGRILRGTW 111--2222- >MEDIATOR OF DNA DAMAGE CH; SWP:Q14676; PDB:2ETXA; APKVLFTGVVDARGERAVLALGGSLAGSAAEASHLVTDRIRRTVKFLCALGRGIPILSLD ------------------1111-----3333----------------------------- WLHQSRKAGFFLPPDEYVVTDPEQEKNFGFSLQDALSRARERRLLEGYEIYVTPGVQPPP -------------1111--------1111-------------1111------1111---- PQMGEIISCCGGTYLPSMPRSYKPQRVVITCPQDFPHCSIPLRVGLPLLSPEFLLTGVLK -------1111-----------2222----33331111---------------------- QEAKPEAFVLSPLE ----3333--1111 >ADENYLATE KINASE; SWP:P16304; PDB:2EU8A; MNLVLMGLPGAGKGTQGERIVEDYGIPHISTGDMFRAAMKEETPLGLEAKSYIDKGELVP -------2222-----------------------------------------1111---- DEVTIGIVKERLGKDDCERGFLLDGFPRTVAQAEALEEILEEYGKPIDYVINIEVDKDVL -------------1111--------------------------------------3333- MERLTGRRICSVCGTTYHLVFNPPKTPGICDKDGGELYQRADDNEETVSKRLEVNMKQTQ ---1111------------------2222----------1111-----------1111-- PLLDFYSEKGYLANVNGQRDIQDVYADVKDLLGGLK -------------------------------2222- >DUAL SPECIFICITY PROTEIN ; SWP:P49761; PDB:2EU9A; VEDDKEGHLVCRIGDWLQERYEIVGNLGEGTFGKVVECLDHARGKSQVALKIIRNVGKYR ---1111---------%%%%---------3333---------%%%%---------3333- EAARLEINVLKKIKEKDKENKFLCVLMSDWFNFHGHMCIAFELLGKNTFEFLKENNFQPY ----------------1111------------iiii------------------%%%%-- PLPHVRHMAYQLCHALRFLHENQLTHTDLKPENILFVNSEFETLYNEHSCEEKSVKNTSI -----------------------------3333--------------------------- RVADFGSATFDHEHHTTIVATRHYRPPEVILELGWAQPCDVWSIGCILFEYYRGFTLFQT ----1111-1111-------3333-----------3333--------------------- HENREHLVMMEKILGPIPSHMIHRTRKQKYFYKGGLVWDENSSDGRYVKENCKPLKSYML -----------------3333-----3333-iiii---1111-----------3333--- QDSLEHVQLFDLMRRMLEFDPAQRITLAEALLHPFFAGLTPEERS -------------------3333--3333---3333---3333-- >MENAQUINONE-SPECIFIC ISOC; SWP:P38051; PDB:2EUAA; QSLTTALENLLRHLSQEIPATPGIRVIDIPFPLKDAFDALSWLASQQTYPQFYWQQRNGD 3333--------1111--------------------------3333---------3333- EEAVVLGAITRFTSLDQAQRFLRQHPEHADLRIWGLNAFDPSQGNLLLPRLEWRRCGGKA ------------------------3333-----------3333------------!!!!- TLRLTLFSESSLQHDAIQAKEFIATLVSIKPLPGLHLTTTREQHWPDKTGWTQLIELATK ----------------------1111---------------------------------- TIAEGELDKVVLARATDLHFASPVNAAAMMAASRRLNLNCYHFYMAFDGENAFLGSSPER ------------------------------------------------------------ LWRRRDKALRTEALAGTVANNPDDKQAQQLGEWLMADDKNQRENMLVVEDICQRLQADTQ ----!!!!-------------------------1111-----------------3333-- TLDVLPPQVLRLRKVQHLRRCIWTSLNKADDVICLHQLQPTAAVAGLPRDLARQFIARHE -----------1111-------------------------3333---------------- PFTREWYAGSAGYLSLQQSEFCVSLRSAKISGNVVRLYAGAGIVRGSDPEQEWQEIDNKA ---!!!!-------3333------------!!!!---------1111------------- AGLRTLLQ --3333-- >TRANSCRIPTIONAL REGULATOR; SWP:P23836; PDB:2EUBA; RVLVVEDNALLRHHLKVQIQDAGHQVDDAEDAKEADYYLNEHIPDIAIVDLGLPDEDGLS ------------------------------------------------------------ LIRRWRSNDVSLPILVLTARESWQDKVEVLSAGADDYVTKPFHIEEVARQALRRNSQ -----1111--------------------1111---------3333------3333- >HYPOTHETICAL PROTEIN YFMB; SWP:O34626; PDB:2EUCA; QYFSPEQQYNAWIVSDLVKQIFHKRAGCSPGIHELAVFAEEHFHIDIDFVFSIINIGDIE -------------------3333-------3333-------------------------- FALTDEIEKKLSGYLSTLLPYVTADFETSKANAHAFLSAAYHLFV --------------33331111-----------3333-------- ------------ >PROBABLE ACETYLTRANSFERAS; SWP:Q9HX01; PDB:2EUIA; RIVQATLEHLDLLAPLFVKYREFYGLSYPESSRKFLEKRLRRKESVIYLALADEEDRLLG -----3333------------1111--------------1111----------------- FCQLYPSFSSLSLKRVWILNDIYVAEEARRQLVADHLLQHAKQARETHAVRRVSTSVDNE ---------1111-----------3333----------------1111-------1111- VAQKVYESIGFREDQEFKNYTLPISDELS 3333------------------------- >CYTOCHROME C PEROXIDASE; SWP:P00431; PDB:2EUTA; LVHVASVEKGRSYEDFQKVYNAIALKLREDDEYDNYIGYGPVLVRLAWHISGTWDKHDNT -------22223333--------------1111%%%%-------------1111------ GGSYGGTYRFKKEFNDPSNAGLQNGFKFLEPIHKEFPWISSGDLFSLGGVTAVQEMQGPK --1111---------3333----------------3333--------------1111--- IPWRCGRVDTPEDTTPDNGRLPDADKDAGYVRTFFQRLNMNDREVVALMGAHALGKTHLK ----------1111---------------------1111-----------3333---333 NSGYEGPGGAANNVFTNEFYLNLLNEDWKLEKNDANNEQWDSKSGYMMLPTDYSLIQDPK 3-------------------------------1111-----1111--------------- YLSIVKEYANDQDKFFKDFSKAFEKLLENGITFPKDAPSPFIFKTLEEQGL -------------------------1111----1111-------3333--- >HYPOTHETICAL PROTEIN RV12; SWP:Q11055; PDB:2EV1A; ANIDDLLGDLGGTARAERAKLVEWLLEQGITPDEIRATNPPLLLATRHLVGDDGTYVSAR 33331111-----------------1111-----1111--1111---1111--------- EISENYGVDLELLQRVQRAVGLARVDDPDAVVHMRADGEAAARAQRFVELGLNPDQVVLV --------------------------1111---3333----------1111--------- VRVLAEGLSHAAEAMRYTALEAIMRPGATELDIAKGSQALVSQIVPLLGPMIQDMLFMQL ------------------------2222---------------3333------------- RHMME ----- >55 KDA ERYTHROCYTE MEMBRA; SWP:Q00013; PDB:2EV8A; VRLIQFEKVTEEPMGITLKLNEKQSCTVARILHGGMIHRQGSLHVGDEILEINGTNVTNH ------------------------------------1111---2222----%%%%----- SVDQLQKAMKETKGMISLKVIPNQQ 3333--------------------- >TAK1 KINASE - TAB1 CHIMER; SWP:O43318; PDB:2EVAA; SLHMIDYKEIEVEEVVGRGAFGVVCKAKWRAKDVAIKQIESESERKAFIVELRQLSRVNH -----3333-------------------%%%%---------------------3333--- PNIVKLYGACLNPVCLVMEYAEGGSLYNVLHGAEPLPYYTAAHAMSWCLQCSQGVAYLHS --------------------3333---------------------------------111 MQPKALIHRDLKPPNLLLVAGGTVLKICDFGTGSAAWMAPEVFEGSNYSEKCDVFSWGII 1----------3333----iiii----------3333-3333------------------ LWEVITRRKPFDEIGGPAFRIMWAVHNGTRPPLIKNLPKPIESLMTRCWSKDPSQRPSME ----------1111--3333----1111-----22223333-----1111-3333----- EIVKIMTHLMRYFPGADEPLQYPCQHSLPPGEDGRVEPYVDFAEFYRLWSVDHGE ---------1111-1111-----------------------3333---------- >METHYLMALONYL-COA DECARBO; SWP:O59021; PDB:2EVBA; MVVSENVVSAPMPGKVLRVLVRVGDRVRVGQGLLVLEAMKMENEIPSPRDGVVKRILVKE ---------------------2222--2222------%%%%-----------------22 GEAVDTGQPLIELG 22--2222------ >HYPOTHETICAL PROTEIN PSPT; SWP:Q87UR7; PDB:2EVEA; AYWLKSEPDEFSISDLQRLGKARWDGVRNYQARNFLRTAEGDEFFFYHSSCPEPGIAGIG ------3333----------------------------2222------------------ KIVKTAYPDPTALDPDSHYHDAKATTEKNPWSALDIGFVDIFKNVLGLGYLKQQSQLEQL --------3333-1111---11111111------------------3333---3333--3 PLVQKGSRLSVPVTAEQWAAILALRL 3332222---------------3333 >COG0791: Cell wall-associ; SWP:NA; PDB:2EVRA; KLGEYQCLADLNLFDSPECTRLATQSASGRHLWVTSNHQNLAVEVYLCEDDYPGWLSLSD ---------------3333-------2222--------%%%%--------------3333 FDSLQPATVPYQAATFSESEIKKLLAEVIAFTQKAQQSNYYLWGGTVGPNYDCSGLQAAF 1111-------------------------------------2222-------3333---- ASVGIWLPRDAYQQEGFTQPITIAELVAGDLVFFGTSQKATHVGLYLADGYYIHSSGKDQ 1111-----------------3333-2222-----------------iiii--------- GRDGIGIDILSEQGDAVSLSYYQQLRGAGRVFKSYEPQRR ----------1111-------1111--------------- >HYPOTHETICAL PROTEIN HP02; SWP:O25006; PDB:2EVVA; KTFEVIQTDSKGYLDAKFGGNAPKAFLNSNGLPTYSPKISWQKVEGAQSYALELIDHDAQ --------1111--333311113333-1111------------2222--------3333- KVCGPFVHWVVGNIAHNVLEENASDKRIVQGVNSLTQGFIRSPLNESEKQRSNLNNSVYI -------------------2222---------------1111-----------1111--- GPPPNGDHHYLIQVYALDIPKLALKAPFFLGDLHDKRNHIIAIGRKEFLYKQFV ----------------------------3333---------------------- >11S GLOBULIN BETA SUBUNIT; SWP:P13744; PDB:2EVXA; ACRLENLRAQDPVRRAEAEAGFTEVWDQDNDEFQCAGVNMIRHTIRPKGLLLPGFSNAPK --------------------------1111---------------2222----------- LIFVAQGFGIRGIAIPGCAETYQTDLRFKDQHQKIRPFREGDLLVVPAGVSHWMYNRGQS --------------2222--------------------2222----2222---------- DLVLIVFADTRNVANQIDPYLRKFYLAGRPEQVERGVEKSGNIFSGFADEFLEEAFQIDG -----------3333--------------------------3333--3333--1111--- GLVRKLKGEDDERDRIVQVDEDFEVLLETICTLRLKQNIGRSERADVFNPRGGRISTANY -------1111----------------3333-----------------1111------33 HTLPILRQVRLSAERGVLYSNAMVAPHYTVNSHSVMYATRGNARVQVVDNFGQSVFDGEV 333333------------2222--------------------------1111-------- REGQVLMIPQNFVVIKRASDRGFEWIAFKTNDNAITNLLAGRVSQMRMLPLGVLSNMYRI 2222----2222------1111-------------------------------------- SREEAQRLKYGQQEMRVLSPGR 3333------------------ >HYPOTHETICAL PROTEIN ACIA; SWP:Q6FF54; PDB:2EW0A; YLTHRCLIAPPEADDFFANTVIYLARHDEEGAQGIIINRPAGIQIKELLNDLDIDADNVN -2222--------3333----------1111------------------1111--1111- PHEVLQGGPLRPEAGFVLHTGQPTWHSSIAVGENVCITTSKDILDAIAHNEGVGRYQIAL ----------3333--------------------------3333--1111---------- GYASWGKNQLEDEIARGDWLICDADDLIFNLPYDDRWDAAYKKIGVDRTWLAS -----2222----1111--------------1111------3333-------- >RAS-RELATED PROTEIN RAB-3; SWP:Q15771; PDB:2EW1A; DYDFLFKIVLIGNAGVGKTCLVRRFTQGLFPPGQGATIGVDFMIKTVEINGEKVKLQIWD ------------2222--------------------------------iiii-------- TAGQERFRSITQSYYRSANALILTYDITCEESFRCLPEWLREIEQYASNKVITVLVGNKI ---3333-3333-3333--------11113333-------------------------33 DLAERREVSQQRAEEFSEAQDMYYLETSAKESDNVEKLFLDLACRLISEAR 331111-----------1111-------1111---------------3333 >2-DEHYDROPANTOATE 2-REDUC; SWP:Q831Q5; PDB:2EW2A; KIAIAGAGAGSRLGILHQGGNDVTLIDQWPAHIEAIRKNGLIADFNGEEVVANLPIFSPE --------------------------------------------iiii---------333 EIDHQNEQVDLIIALTKAQQLDAFKAIQPITEKTYVLCLLNGLGHEDVLEKYVPKENILV 3-3333----------3333----1111--1111-------------1111--3333--- GITWTAGLEGPGRVKLLGDGEIELENIDPSGKKFALEVVDVFQKAGLNPSYSSNVRYSIW ---------2222--------------3333----------------------------- RKACVNGTLNGLCTILDCNIAEFGALPVSESLVKTLISEFAAVAEKEAIYLDQAEVYTHI -------------------------1111---------------1111-----------3 VQTYDPNGIGLHYPSYQDLIKNHRLTEIDYINGAVWRKGQKYNVATPFCALTQLVHGKEE 333-11111111--------------3333---------------3333----------1 LLGAK 111-- >SH3-CONTAINING GRB2-LIKE ; SWP:Q99963; PDB:2EW3A; MDQPCCRGLYDFEPENQGELGFKEGDIITLTNQIDENWYEGMIHGESGFFPINYVEVIVP ---------------2222---2222------------------------1111------ >(S)-1-PHENYLETHANOL DEHYD; SWP:Q5P5I4; PDB:2EW8A; QRLKDKLAVITGGANGIGRAIAERFAVEGADIAIADLVPAPEAEAAIRNLGRRVLTVKCD 1111-------------------------------------------1111--------1 VSQPGDVEAFGKQVISTFGRCDILVNNAGIYPLIPFDELTFEQWKKTFEINVDSGFLMAK 111-------------------------------3333---------------------- AFVPGMKRNGWGRIINLTSTTYWLKIEAYTHYISTKAANIGFTRALASDLGKDGITVNAI -------------------3333-----3333-----------------1111------- APSLVNMLQAIPRLQVPLDLTGAAAFLASDDASFITGQTLAVDGGMVRH -----1111------3333------------1111-------------- >COPPER-TRANSPORTING ATPAS; SWP:P35670; PDB:2EW9A; MAPQKCFLQIKGMTCASCVSNIERNLQKEAGVLSVLVALMAGKAEIKYDPEVIQPLEIAQ -------------------------1111-------------------1111-------- FIQDLGFEAAVMEDYAGSDGNIELTITGMTCASCVHNIESKLTRTNGITYASVALATSKA ------------------------------3333----------3333------3333-- LVKFDPEIIGPRDIIKIIEEIGFHASLAQ ---------3333---------------- >CONSERVED HYPOTHETICAL PR; SWP:Q48R10; PDB:2EWCA; TIRRYDVNEDRGHTGLVEAGDFYYLNYCVGNVGQDIESQINGAFDEERRLALVGLTLDAV --------1111------!!!!--------2222----------------1111-3333- VQDCLFRDVWNIPVEKIKERFNGRYPARKSIQTEFAHHGGPQGLLFQVDGVAYSKH -------1111-----1111iiii---------------2222------------- >LACTATE DEHYDROGENASE,; SWP:Q5CYZ2; PDB:2EWDA; MIERRKIAVIGSGQIGGNIAYIVGKDNLADVVLFDIAEGIPQGKALDITHSMVMFGSTSK ----------------------------------------------------1111---- VIGTDDYADISGSDVVIITASIPGRPKDDRSELLFGNARILDSVAEGVKKYCPNAFVICI -----33332222--------------------------------------1111----- TNPLDVMVSHFQKVSGLPHNKVCGMAGVLDSSRFRTFIAQHFGVNASDVSANVIGGHGDG -----------------1111----3333---------------3333---------111 MVPATSSVSVGGVPLSSFIKQGLITQEQIDEIVCHTRIAWKEVADNLKTGTAYFAPAAAA 1--3333--iiii3333------------------------------------------- VKMAEAYLKDKKAVVPCSAFCSNHYGVKGIYMGVPTIIGKNGVEDILELDLTPLEQKLLG ------1111-------------%%%%-----------1111------------------ ESINEVNTISKVLDNAP ----------------- >NICKING ENDONUCLEASE N.BS; SWP:Q9AM79; PDB:2EWFA; MAKKVNWYVSCSPRSPEKIQPELKVLANFEGSYWKGVKGYKAQEAFAKELAALPQFLGAF --------------3333--------1111-----3333-------------3333---- STRDRVAPMKTYGFVFVDEEGYLRITEAGKMLANNRRPKDVFLKQLVKWQYPSFQHKGKE 3333-----1111----1111----------1111-----------------3333-333 YPEEEWSINPLVFVLSLLKKVGGLSKLDIAMFCLTATNNNQVDEIAEEIMQFRNEREKIK 33333---------------------------1111-1111------------------- GQNKKLEFTENYFFKRFEKIYGNVSHKSKIETKMRNARDVADATTRYFRYTGLFVARGNQ -------------------------3333---------------------------!!!! LVLNPEKSDLIDEIISSSKVVKNYTRVEEFHEYYGNPSLPQFSFETKEQLLDLAHRIRDE ---1111----------------------------1111--1111--------------- NTRLAEQLVEHFPNVKVEIQVLEDIYNSLNKKVDVETLKDVIYHAKELQLELKKKKLQAD -----1111-------------------3333---------------------------- FNDPRQLEEVIDLLEVYHEKKNVIEEKIKARFIANKNTVFEWLTWNGFIILGNALEYKNN -------------3333----------1111----------------------------- FVIDEELQPVTHAAGNQPDMEIIYEDFIVLGEVTTSKGATQFKMESEPVTRHYLNKKKEL ---1111------1111------------------------------------------- EKQGVEKELYCLFIAPEINKNTFEEFMKYNIVQNTRIIPLSLKQFNMLLMVQKKLIEKGR -------------------------------------------------------1111- RLSSYDIKNLMVSLYRTTIECERKYTQIKAGLEETLNNWVVDKEVRF -----------------------3333-------------------- >MAJOR CARBOXYSOME SHELL P; SWP:P45689; PDB:2EWHA; GIALGMIETRGLVPAIEAADAMTKAAEVRLVGRQFVGGGYVTVLVRGETGAVNAAVRAGA ------------------------------------------------------------ DACERVGDGLVAAHIIARVHSEVENILPKAPQA --1111-------------33331111------ -------------------------------------------------------- >PUTATIVE AGMATINE DEIMINA; SWP:Q8DW17; PDB:2EWOA; AKRIKNTTPKQDGFRPGEFEKQKQIWLWPWRNDNWRLGAKPAQKAFLEVAEAISEFEPVS ----------------3333----------3333-%%%%------------3333----- LCVPPLQYENALARVSELGSHNIRIIETNDDAWIRDCGPTFLVNDKGDLRAVDWEFNAWG ---3333-------------------------3333------------------------ GLVDGLYFPWDQDALVARKVCEIEGVDSYKTKDFVLEGGSIHVDGEGTVLVTECLLHPSR ---------3333-----------------2222-------------------1111--- NPHLTKEDIEDKLKDYLNCVKVLWVKDGIDPYETNGHIDDVACFIRPGEVACIYTDDKEH ----3333---------------------------------------------------1 PFYQEAKAAYDFLSQQTDAKGRPLKVHKCVTKEPCYLQEAATIDYVEGEAIASYLNFLIV 111---------1111----------------------3333----------1111---2 NGGIILPQYGDENDQLAKQQVQEFPDRKVVGVRTEEIAYGGGNIHCITQQQPATL 222------------------------------33331111--3333-------- >HYPOTHETICAL PROTEIN TM10; SWP:Q9X0A5; PDB:2EWRA; IRPEYLRVLRKIYDRLKNEKVNWVVTGSLSFALQGVPVEVHDIDIQTDEEGAYEIERIFS -3333----------1111------------1111------------3333-------11 EFVSKKVRFSSTEKICSHFGELIIDGIKVEIGDIRKRLEDGTWEDPVDLNKYKRFVETHG 11---------1111--------iiii----------1111------3333--------- KIPVLSLEYEYQAYLKLGRVEKAETLRKWLNERK ------------------------------1111 >PANTOTHENATE KINASE; SWP:Q6G7I0; PDB:2EWSA; MKVGIDAGGTLIKIVQEQDNQRTFKTELTKNIDQVVEWLNQQQIEKLCLTGGNAGVIAEN -----------------%%%%------3333-------1111--------1111--1111 INIPAQIFVEFDAASQGLGILLKEQGHDLADYIFANVGTGTSLHYFDGQSQRRVGGIGTG ----------------------1111---------------------------------- GGMIQGLGYLLSQITDYKQLTDMAQHGDRNTIDLKVRHIIPGDLTAANFGHVLHHLDADF ----------------------3333--1111--3333--1111------11111111-- TPSNKLAAVIGVVGEVVTTMAITVAREFKTENIVYIGSSFHNNALLRKVVEDYTVLRGCK -------------------------------------1111------------------- PYYVENGAFSGAIGALYLEK ---2222--------1111- >PUTATIVE DNA-BINDING PROT; SWP:Q7WZG6; PDB:2EWTA; MSSEYAKQLGAKLRAIRTQQGLSLHGVEEKSQGRWKAVVVGSYERGDRAVTVQRLAELAD -----------------1111-------1111---------------------------- FYGVPVQELLP ----3333--- >A2,3-SIALYLTRANSFERASE, A; SWP:Q9CP67; PDB:2EX0A; MKTITLYLDPASLPALNQLMDFTQNNEDKTHPRIFGLSRFKIPDNIITQYQNIHFVELKD ----------------------1111----------1111--11111111--------%% NRPTEALFTILDQYPGNIELNIHLNIAHSVQLIRPILAYRFKHLDRVSIQQLNLYDDGSM %%----------------------3333--------------3333-------------- EYVDLEKEENKDISAEIKQAEKQLSHYLLTGKIKFDNPTIARYVWQSAFPVKYHFLSTDY ---33332222-------------------------3333----------------3333 FEKAEFLQPLKEYLAENYQKMDWTAYQQLTPEQQAFYLTLVGFNDEVKQSLEVQQAKFIF ---3333------!!!!------3333---------------------1111-------- TGTTTWEGNTDVREYYAQQQLNLLNHFTQAEGDLFIGDHYKIYFKGHPRGGEINDYILNN -----------3333-------------1111----1111------1111---------- AKNITNIPANISFEVLMMTGLLPDKVGGVASSLYFSLPKEKISHIIFTSNKQVKSKEDAL -------3333-------------------3333---3333--------1111-3333-- NNPYVKVMRRLGIIDESQVIFWDSLKQLGGGL --------------3333--3333-------- >PENICILLIN-BINDING PROTEI; SWP:P24228; PDB:2EX2A; NVDEYITQLPAGANLALMVQKVGASAPAIDYHSQQMALPASTQKVITALAALIQLGPDFR 33333333-2222-------2222-------1111---!!!!-------------1111- FTTTLETKGNVENGVLKGDLVARFGADPTLKRQDIRNMVATLKKSGVNQIDGNVLIDTSI -----------iiii-----------11113333-----------------------333 FASHDKAPGWPWNDMTQCFSAPPAAAIVDRNCFSVSLYSAPKPGDMAFIRVASYYPVTMF 3-----22223333--3333-------%%%%----------2222------3333----- SQVRTLPRGSAEAQYCELDVVPGDLNRFTLTGCLPQRSEPLPLAFAVQDGASYAGAILKY ---------3333----------%%%%--------------------------------- ELKQAGITWSGTLLRQTQVNEPGTVVASKQSAPLHDLLKIMLKKSDNMIADTVFRMIGHA --1111--------------------------3333------------------------ RFNVPGTWRAGSDAVRQILRQQAGVDIGNTIIADGSGLSRHNLIAPATMMQVLQYIAQHD --------------------------!!!!--------1111--3333------------ NELNFISMLPLAGYDGSLQYRAGLHQAGVDGKVSAKTGSLQGVYNLAGFITTASGQRMAF ----3333--2222-1111-----11112222-------2222--------1111----- VQYLSGYAVEPADQRNRRIPLVRFESRLYKDIYQNN ---------1111--1111----------------- >DNA terminal protein; SWP:P03681; PDB:2EX3B; ANMRYQFEKNAYGVVASKAKIAEIERNTKEVQRLVDEKIKAMKDKEYYATGINRPHDFDF ---------1111---------------------------------------------33 SKVRSYSRLRTLEESMEMRTDPQYYEKKMIQLQLNFIKSVEGSFNSFDAADELIEELKKI 33-----------------------------------------------------3333- PPDDFYELFLRISEISGNTVENVEGNVYKILSYLEQYRRGDF 3333-3333-3333----------------------1111-- >ADRENAL GLAND PROTEIN AD-; SWP:NA; PDB:2EX4A; GSTSEVIEDEKQFYSKAKTYWKQIPPTVDGMLGGYGHISSIDINSSRKFLQRFLRKTGTS -3333--------------3333------1111-3333------------1111------ CALDCGAGIGRITKRLLLPLFREVDMVDITEDFLVQAKTYLGEEGKRVRNYFCCGLQDFT ------!!!!------3333---------3333--------3333---------3333-- PEPDSYDVIWIQWVIGHLTDQHLAEFLRRCKGSLRPNGIIVIKDNMAQEGILDDVDSSVC -------------3333-------------11112222---------------------- RDLDVVRRIICSAGLSLLAEERQENLPDEIYHVYSFALR ----------1111------------1111--------- >DNA ENDONUCLEASE I-CEUI; SWP:P32761; PDB:2EX5A; ILKPGEKLPQDKLEELKKINDAVKKTKNFSKYLIDLRKLFQIDEVQVTSESKLFLAGFLE --2222------------------------------------------------------ GEASLNISTKKLATSKFGLVVDPEFNVTRHVNGVKVLYLALEVFKTGRIRHKSGSNATLV -----------1111--------------3333------------------2222----- LTIDNRQSLEEKVIPFYEQYVVAFSSPEKVKRVANFKALLELFNNDAHQDLEQLVNKILP --------------------3333---------------------1111----------- IWDQMRKQQGQSNEGFPNLEAAQDFAR --1111-2222---------------- >NFED SHORT HOMOLOG; SWP:O58204; PDB:2EXDA; RRETTDIGGGKYTFELKGKVGKVVKIAEDHYLVEVEGDKWIAYSDEKLSLGDRVMVVDVD --------------------------3333----%%%%----------2222-------- GLKLKVKRIPPQLE ----------3333 >BETA-D-XYLOSIDASE; SWP:Q68HB3; PDB:2EXHA; KIKNPILTGFHPDPSICRVGDDYYIAVSTFEWFPGVRIYHSKDLKNWRLVARPLNRLSQL ------------------!!!!------!!!!-----------------------3333- NMIGNPDSGGVWAPHLSYSDGKFWLIYTDVKVVEGQWKDGHNYLVTCDTIDGAWSDPIYL -22222222---------%%%%-------------------------------------- NSSGFDPSLFHDEDGRKYLVNMYWDHRVDHHPFYGIVLQEYSVEQKKLVGEPKIIFKGTD -----------1111-----------1111------------1111------------33 LRITEGPHLYKINGYYYLLTAEGGTRYNHAATIARSTSLYGPYEVHPDNPLLTSWPYPRN 33-----------------------1111--------1111------------1111--- PLQKAGHASIVHTHTDEWFLVHLTGRPLPREGQPLLEHRGYCPLGRETAIQRLEWKDGWP -----------------------------22221111----1111----------%%%%- YVVGGNGPSLEIDGPSVEEVSWEKDYDEKDDFDGDTLNHHFQTLRIPLGEDIATLKARPG -------------------------------------1111-------3333-----222 HLRLYGRESLTSRFTQAFVARRWQHFHFVAETKVSFRPTTFQQSAGLVNYYNTQNWTTLQ 2-------1111---------------------------1111--------1111----- ITWHEEKGRILELMTCDHLVVDQPLRGREIVVPDDIEYVYLRVTVQATTYKYSYSFDGMN ----------------iiii--1111------1111---------!!!!----------- WIDLPVTFESYKLSDDYIKSRAAFTGAFVGMHCRDGSGQNNYADFDYFLYKEL --------3333-33333333-------------1111--------------- >HYPOTHETICAL PROTEIN BOR1; SWP:Q7WNU7; PDB:2EXNA; MSTTAYQPIAECGATTQSEAAAYQKRWLVANDAGQWLNRDLCPRLAEVSVELRMGYLVLK ---------3333------1111-------1111---3333------------------- APGMLRLDIPLDVIEDDDSVRYQMLVGEQTVDVVDEGELAAAWISNHAGVPCRILKVHPD ------------------------------------------------------------ MAEVRWPSLE ---------- >CYTOKININ DEHYDROGENASE 7; SWP:Q9FUJ1; PDB:2EXRA; LNIQGEILCGGAAADIAGRDFGGNCVKPLAVVRPVGPEDIAGAVKAALRSDKLTVAARGN ---------------11113333------------------------------------- GHSINGQAAEGGLVVDSTTAENHFEVGYLSGGDATAFVDVSGGALWEDVLKRCVSEYGLA --------2222----1111--------------------1111---------------- PRSWTDYLGLTVGGTLSNAGVSGQAFRYGPQTSNVTELDVVTGNGDVVTCSEIENSELFF --------------1111---1111----3333--------1111--------------- SVLGGLGQFGIITRARVLLQPAPDVRWIRVVYTEFDEFTQDAEWLVSQKNESSFDYVEGF ----iiii------------------------------------11112222-------- VFVNGADPVNGWPTVPLHPDHEFDPTRLPQSCGSVLYCLELGLHYRDSDSNSTIDKRVER ----------3333---1111--1111-1111-------------1111----------- LIGRLRFNEGLRFEVDLPYVDFLLRVKRSEEIAKENGTWETPHPWLNLFVSKRDIGDFNR -1111--2222--------------------------1111---------3333------ TVFKELVKNGVNGPLVYPLLRSRWDDRTSVVIPEEGEIFYIVALLRFVPPCAKVSSVEKV ------1111---------3333-3333--------------------1111-------- AQNQEIVHWCVKNGIDYKLYLPHYKSQEEWIRHFGNRWSRFVDRKAFDPAILSPGQKIFN ----------1111-------------------!!!!---------------3333---- RSL --- >TRANSCRIPTION INITIATION ; SWP:P32914; PDB:2EXUA; SERACMLCGIVQTTNEFNRDGCPNCQGIFEEAGVSTMECTSPSFEGLVGMCKPTKSWVAK ----------------------------------3333-------------3333----- WLSVDHSIAGMYAIKVDGRLPAEVVELLPHYKPRDGSGSATIWGVRCRPGKEKELIRKLL ---1111-------------33331111-------------------2222--------- KKKFNLDRAMGKKKLKILSIFQRDNYTGRIYIEAPKQSVIEKFCNGVPDIYISQKLLIPV ----------------------3333---------3333----2222---1111----33 QELPLLLKPNLE 333333------ >HSCARG PROTEIN; SWP:NA; PDB:2EXXA; DKKLVVVFGGTGAQGGSVARTLLEDGTFKVRVVTRNPRKKAAKELRLQGAEVVQGDQDDQ --------3333-----------------------1111----------------11113 VIMELALNGAYATFIVTNYWESCSQEQEVKQGKLLADLARRLGLHYVVYSGLENIKKLTA 333---2222-------3333------------------1111---------------ii GRLAAAHFDGKGEVEEYFRDIGVPMTSVRLPCYFENLLSHFLPQKAPDGKSYLLSLPTGD ii--1111----------1111----------3333---------1111--------!!! VPMDGMSVSDLGPVVLSLLKMPEKYVGQNIGLSTCRHTAEEYAALLTKHTRKVVHDAKMT !-----3333----------33332222-------------------------------- PEDYEKLGFPGARDLANMFRFYALRPDRDIELTLRLNPKALTLDQWLEQHKGDFNL ---1111-2222---------1111-----------1111---------------- >PROBABLE TRNA PSEUDOURIDI; SWP:Q7LWY0; PDB:2EY4A; EVRRILPADIKREVLIKDENAETNPDWGFPPEKRPIEMHIQFGVINLDKPPGPTSHEVVA -----3333--------1111--1111--1111--------------------------- WIKKILNLEKAGHGGTLDPKVSGVLPVALEKATRVVQALLPAGKEYVALMHLHGDVPEDK ---1111----------1111-------!!!!-3333-1111------------------ IIQVMKEFEGEIIQRPPLRSAVKRRLRTRKVYYIEVLEIEGRDVLFRVGVEAGTYIRSLI ----3333-------------------------------!!!!----------------- HHIGLALGVGAHMSELRRTRSGPFKEDETLITLHDLVDYYYFWKEDGIEEYFRKAIQPME --------------------!!!!--1111---------------------------333 KAVEHLPKVWIKDSAVAAVTHGADLAVPGIAKLHAGIKRGDLVAIMTLKDELVALGKAMM 3-1111------------1111---1111--------2222-----1111---------- TSQEMLEKTKGIAVDVEKVFMPRDWYPKL ---------------------3333---- >Small nucleolar rnp simil; SWP:Q8U029; PDB:2EY4C; MKRLGKVLHYAKQGFLIVRTNWVPSLNDRVVDKRLQFVGIVKDVFGPVKMPYVAIKPKVS ----------1111----------2222---1111-----------3333---------- NPEIYVGEVLYVD ----2222----- >Ribosome biogenesis prote; SWP:Q8U1R4; PDB:2EY4E; RIRKCPKCGRYTLKEVCPVCGEKTKVAHPPRFSPEDPYGEYRRRWKREVLGI --------------------------------3333---------------- >ALPHA-ACTININ 1; SWP:P12814; PDB:2EYIA; AGHMEKQQRKTFTAWCNSHLRKAGTQIENIEEDFRDGLKLMLLLEVISGERLAKPERGKM -------------------1111-----1111-1111----------------------- RVHKISNVNKALDFIASKGVKLVSIGAEEIVDGNVKMTLGMIWTIILRFAIQDISVEETS ---------------1111----------1111-----------------1111-iiii- AKEGLLLWCQRKTAPYKNVNIQNFHISWKDGLGFCALIHRHRPELIDYGKLRKDDPLTNL ------------3333--------3333-------------3333-3333-1111----- NTAFDVAEKYLDIPKMLDAEDIVGTARPDEKAIMTYVSSFYHAFSGAQEFLEPG ---------------------------------------------3333----- >NKT15; SWP:Q6PIZ8; PDB:2EYSA; NQVEQSPQSLIILEGKNCTLQCNYTVSPFSNLRWYKQDTGRGPVSLTIMTFSENTKSNGR ------------2222---------------------2222--------1111------- YTATLDADTKQSSLHITASQLSDSASYICVVSDRGSTLGRLYFGRGTQLTVWPDIQNPDP ------1111---------3333------------1111--------------------- AVYQLRDSKSSDKSVCLFTDFDSQTNVSQSKDSDVYITDKCVLDMRSMDFKSNSAVAWSN ---------------------3333------1111------------------------- KSDFACANAFNNSIIPEDTFFP 33333333-1111--1111--- >TRBV19 protein; SWP:Q6GMR4; PDB:2EYSB; DIYQTPRYLVIGTGKKITLECSQTMGHDKMYWYQQDPGMELHLIHYSYGVNSTEKGDLSS -----------2222--------------------2222--------------------- ESTVSRIRTEHFPLTLESARPSHTSQYLCASSGLRDRGLYEQYFGPGTRLTVTEDLKNVF -------3333--------3333------------%%%%---------------1111-- PPEVAVFEPSEAEISHTQKATLVCLATGFYPDHVELSWWVNGKEVHSGVCTDPQPLKEQP ---------------------------------------iiii--2222---------11 ALNDSRYALSSRLRVSATFWQNPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEAWG 11-------------------1111-----------1111-------------------- RAD --- >TWITCHING MOTILITY PROTEI; SWP:NA; PDB:2EYUA; PEFKKLGLPDKVLELCHRKGLILVTGPTGSGKSTTIASIDYINQTKSYHIITIEDPIEYV -3333---33333333-------------------------------------------- FKHKKSIVNQREVGEDTKSFADALRAALREDPDVIFVGERDLETVETALRAAETGHLVFG ------------------------------------------------------------ TLHTNTAIDTIHRIVDIFPLNQQEQVRIVLSFILQGIISQRLLPKIGGGRVLAYGLLIPN --------------33333333-------------------------------------- TAIRNLIRENKLQQVYSLQSGQAETGQTNQTLYKLYKQGLITLEDAEASPDPKELERIR -----------------------------------1111--3333-------------- >v-crk sarcoma virus CT10 ; SWP:P46108; PDB:2EYVA; GAMGDSEERSSWYWGRLSRQEAVALLQGQRHGVFLVRDSSTSPGDYVLSVSENSRVSHYI -------1111------3333------------------------------%%%%----- INSSGPRPPVPPSPAQPPPGVSPSRLRIGDQEFDSLPALLEFYKIHYLDTTTLIEPVSRS ------------------------------------------------------------ RQGR ---- >TYROSINE PHENOL-LYASE; SWP:P31013; PDB:2EZ2A; MNYPAEPFRIKSVETVSMIPRDERLKKMQEAGYNTFLLNSKDIYIDLLTDSGTNAMSDKQ ---------------------------------3333-3333------------------ WAGMMMGDEAYAGSENFYHLERTVQELFGFKHIVPTHQGRGAENLLSQLAIKPGQYVAGN --1111-------------------------------3333----------2222----- MYFTTTRYHQEKNGAVFVDIVRDEAHDAGLNIAFKGDIDLKKLQKLIDEKGAENIAYICL ---------------------3333-1111---1111-------------3333------ AVTVNLAGGQPVSMANMRAVRELTEAHGIKVFYDATRCVENAYFIKEQEQGFENKSIAEI ----1111----------------1111--------------------2222-------- VHEMFSYADGCTMSGKKDCLVNIGGFLCMNDDEMFSSAKELVVVYEGMPSYGGLAGRDME ----1111--------1111--------------------3333---1111--------- AMAIGLREAMQYEYIEHRVKQVRYLGDKLKAAGVPIVEPVGGHAVFLDARRFCEHLTQDE -----------------------------1111-------------------11111111 FPAQSLAASIYVETGVRSMERGIISAGRNNVTGEHHRPKLETVRLTIPRRVYTYAHMDVV ---------------------3333----------------------------------- ADGIIKLYQHKEDIRGLKFIYEPKQLRFFTARFDYI ---------1111------------3333------- >E3 ubiquitin-protein liga; SWP:Q9VVI3; PDB:2EZ5W; GPLGSGEEEPLPPRWSMQVAPNGRTFFIDHASRRTTWIDPRNGRAS -------------------3333----------------------- >RIBONUCLEASE III; SWP:O67082; PDB:2EZ6A; MLEQLEKKLGYTFKDKSLLEKALTHVSYSKKEHYETLEFLGNALVNFFIVDLLVQYSPNK 3333----------3333------3333-------------------------------- REGFLSPLKAYLISEEFFNLLAQKLELHKFIRIKRGKINETIIGDVFEALWAAVYIDSGR 3333-----------------33333333---------3333------------------ DANFTRELFYKLFKEDILSAIKEGRVKKDYKTILQEITQKRWKERPEYRLISVEGPHHKK -------------------------------------------------------1111- KFIVEAKIKEYRTLGEGKSKKEAEQRAAEELIKLLEES -------!!!!--------------------------- >PYRUVATE OXIDASE; SWP:P37063; PDB:2EZ9A; TNILAGAAVIKVLEAWGVDHLYGIPGGSINSIMDALSAERDRIHYIQVRHEEVGAMAAAA -------------1111--------1111-------1111-------------------- DAKLTGKIGVCFGSAGPGGTHLMNGLYDAREDHVPVLALIGQFGTTGMNMDTFQEMNENP -------------------------------------------3333----2222--333 IYADVADYNVTAVNAATLPHVIDEAIRRAYAHQGVAVVQIPVDLPWQQIPAEDWYASANS 3-------------1111-----------1111-------3333-----1111---3333 YQTPLLPEPDVQAVTRLTQTLLAAERPLIYYGIGARKAGKELEQLSKTLKIPLMSTYPAK --------------------1111--------1111-------------------3333- GIVADRYPAYLGSANRVAQKPANEALAQADVVLFVGNNYPFAEVSKAFKNTRYFLQIDID ---3333------------------1111---------11111111-1111--------3 PAKLGKRHKTDIAVLADAQKTLAAILAQVSERESTPWWQANLANVKNWRAYLASLEDKQE 333----------------------1111------------------------------- GPLQAYQVLRAVNKIAEPDAIYSIDVGDINLNANRHLKLTPSNRHITSNLFATMGVGIPG ----------------1111------3333---------1111----------------- AIAAKLNYPERQVFNLAGDGGASMTMQDLATQVQYHLPVINVVFTNCQYGWIKDEQEDTN -------1111-------------3333---------------------3333------- QNDFIGVEFNDIDFSKIADGVHMQAFRVNKIEQLPDVFEQAKAIAQHEPVLIDAVITGDR ------------------1111-------3333--------------------------- PLPAEKLRLDSAMSSAADIEAFKQRYEAQDLQPLSTYLKQFGLDD --1111---1111--------------1111-3333--1111--- >TRANSPOSASE; SWP:P07636; PDB:2EZH; SEFDEDAWQFLIADYLRPEKPAFRKCYERLELAAREHGWSIPSRATAFRRIQQLDEAMVV ----------------3333----------------------3333---3333-3333-- ACREG ----- >TRANSPOSASE; SWP:P07636; PDB:2EZK; MIARPTLEAHDYDREALWSKWDNASDSQRRLAEKWLPAVQAADEMLNQGISTKTAFATVA ------------------------------------------------------------ GHYQVSASTLRDKYYQVQKFAKPDWAAALVDGR ----------------11113333---3333-- >TYPE II RESTRICTION ENZYM; SWP:O52512; PDB:2EZVA; MHQDYRELSLDELESVEKQTLRTIVQALQQYSKEAKSIFETTAADSSGEVIVLAEDITQY ---3333----------------------------------------3333--------- ALEVAETYPINRRFAGFIDYKRVRWLPSPHGLLPQVLLVDAKASTEKNRDTLQRSQLPMD -1111----------------------1111---------------------1111---- AEFRNTSSGEVVTMEAGVIPHLMLQSANDGVLPAVTTSIFVHFYYRELKEGRYRELKSIY ------------------------------------------------------------ VLSLPHARLKQRYNPDPDTSFFGAGKHSPARGEVARIRVYFDRLKEACPWRLQELHYSAD -----3333------1111--------3333----------------3333--------- SEYTQPRWRDLNDAGHEVTKEFLFLER -----------1111------------ >cAMP-dependent protein ki; SWP:P00514; PDB:2EZWA; SLRECELYVQKHNIQALLKDSIVQLCTARPERPMAFLREYFEKLEKEEAK -3333--------3333-------3333----1111------3333---- >UDP-N-ACETYLMURAMATE--L-A; SWP:P17952; PDB:2F00A; NTQQLAKLRSIVPERRVRHIHFVGIGGAGGGIAEVLANEGYQISGSDLAPNPVTQQLNLG -------1111------------1111---------1111-------------------- ATIYFNHRPENVRDASVVVVSSAISADNPEIVAAHEARIPVIRRAELAELRFRHGIAIAG -------33332222-----33331111--------------33333333---------- THGKTTTTAVSSIYAEAGLDPTFVNGGLVKAAGVHARLGHGRYLIAEADESDASFLHLQP ----------------------------3333----------------------3333-- VAIVTNIEADHDTYQGDFENLKQTFINFLHNLPFYGRAVCVDDPVIRELLPRVGRQTTTY -------------iiii-----------33331111---3333-----3333-------- GFSEDADVRVEDYQQIGPQGHFTLLRQDKEPRVTLNAPGRHNALNAAAAVAVATEEGIDD --1111---------!!!!------2222------------------------3333--- EAILRALESFQGTGRRFDFLGEFPLEPVNGKSGTALVDDYGHHPTEVDATIKAARAGWPD ------------------------------------------3333--------2222-- KNLVLFQPHRFTRTRDLYDDFANVLTQVDTLLLEVYPAGEAPIPGADSRSLCRTDPILVP ---------33333333-------------------------22223333---------- DPARVAELAPVLTGNDLILVQGAGNIGKIARSLAEIKLKPQ -------1111-----------!!!!-------1111---- >STREPTAVIDIN; SWP:P22629; PDB:2F01A; EAGITGTWYNQLGSTFIVTAGADGALTGTYESAVGNAESRYVLTGRYDSAPATDGSGTAL 3333-----1111-------1111------------------------------------ GWTVAWKNNYRNAHSATTWSGQYVGGAEARINTQWLLTSGTTEANAWKSTLVGHDTFTKV -----------------------------------------33331111----------- K - >TAGATOSE-6-PHOSPHATE KINA; SWP:NA; PDB:2F02A; SLIVTVTNPSIDISYLLDHLKLDTVNRTSQVTKTPGGKGLNVTRVIHDLGGDVIATGVLG ------------------------------------3333------3333---------- GFHGAFIANELKKANIPQAFTSIKEETRDSIAILHEGNQTEILEAGPTVSPEEISNFLEN -----------1111-------------------%%%%---------------------- FDQLIKQAEIVTISGSLAKGLPSDFYQELVQKAHAQEVKVLLDTSGDSLRQVLQGPWKPY ----1111---------22221111--------1111--------3333----------- LIKPNLEELEGLLGQDFSENPLAAVQTALTKPFAGIEWIVISLGKDGAIAKHHDQFYRVK --------------------------------2222------!!!!-----!!!!----- IPTIQAKNPVGSGDATIAGLAYGLAKDAPAAELLKWGAAGANAQERTGHVDVENVKKHLN --------2222-----------1111-------------3333---------------- IQVVEIAKEGHHH -------1111-- >PAIRED AMPHIPATHIC HELIX ; SWP:Q62141; PDB:2F05A; ESDSVEFNNAISYVNKIKTRFLDHPEIYRSFLEILHTYQKEQLHTKGRPFRGMSEEEVFT --------------------1111------------------------------------ EVANLFRGQEDLLSEFGQFLPEAKR --------------3333--3333- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:2F06A; AVAKQLSIFLENKSGRLTEVTEVLAKENINLSALCIAENADFGILRGIVSDPDKAYKALK ------------------------1111-------------------------------1 DNHFAVNITDVVGISCPNVPGALAKVLGFLSAEGVFIEYYSFANNNVANVVIRPSNDKCI 111---------------2222--------1111---------!!!!------------- EVLKEKKVDLLAASDLYKL ---1111----3333---- >HYPOTHETICAL PROTEIN YDHA; SWP:P28224; PDB:2F09A; MQTDTLEYQCDEKPLTVKLNNPRQEVSFVYDNQLLHLKQGISASGARYTDGIYVFWSKGD -------------------3333------------------------------------- EATVYKRDRIVLNNCQLQNPQR ---------------------- >Segment polarity protein ; SWP:P51142; PDB:2F0AA; IITVTLNEKYNFLGISIVGQSNERGDGGIYIGSIKGGAVAADGRIEPGDLLQVNDINFEN ---------------------3333--------------------2222---!!!!---- SNDDAVRVLRDIVHKPGPIVLTVAKLE -----------3333------------ >PHAGE TP901-1 ORF49 (BPP); SWP:Q9G096; PDB:2F0CA; TINDDLEAINSELTSGGNVVHKTGDETIAGKKTFTGNVEVNGSLTLPTKSWSGELGGGII ----------3333-----------------------------------------iiii- LSLRKKGTTVEYSIGGEISSSILANSNLVNRSVPNEFCPRNRCSLVGHMVGGWNAFHIDI -----!!!!------------------------3333-----------2222-------- PSSGVCQWFGPTASSGTPRGTGTYPID 3333----------------------- >5'-AMP-ACTIVATED PROTEIN ; SWP:O43741; PDB:2F15A; QARPTVIRWSEGGKEVFISGSFNNWSTKIPLIKSHNDFVAILDLPEGEHQYKFFVDGQWV -------------------3333-------------------------------iiii-- HDPSEPVVTSQLGTINNLIHVKKSDFEVF -1111----1111----------2222-- >IMIDAZOLEGLYCEROL-PHOSPHA; SWP:P34047; PDB:2F1DA; GRIGEVKRVTKETNVSVKINLDGTGVADSSSGIPFLDHMLDQLASHGLFDVHVRATGDVH ---------3333--------------------------------------------333 IDDHHTNEDIALAIGTALLKALGERKGINRFGDFTAPLDEALIHVSLDLSGRPYLGYNLE 3--------------------!!!!------------!!!!------------------- IPTQRVGTYDTQLVEHFFQSLVNTSGMTLHIRQLAGENSHHIIEATFKAFARALRQATET ---------3333------------------------3333------------------- DPR --- >PROTEIN APAG; SWP:Q8PP26; PDB:2F1EA; RYRVEVEVSPRFLAHQSTPDEGRYAFAYSIRIQNAGAVPARLVARHWQITDGNGRTEQVD ------------3333-3333-----------------------------1111------ GEGVVGEQPWLRPGEAFHYTSGVLLETEQGQMQGHYDMVADDGTEFIAPIAAFVLS ---iiii----2222------------------------1111------------- >ACETOLACTATE SYNTHASE ISO; SWP:P00894; PDB:2F1FA; ARRILSVLLENESGALSRVIGLFSQRGYNIESLTVAPTDDPTLSRMTIQTVGDEKVLEQI -----------2222-------------------------------------3333---- EKQLHKLVDVLRVSELGQGAHVEREIMLVKIQASGYGRDEVKRNTEIFRGQIIDVTPSLY ------1111----1111----------------3333-----------------1111- TVQLAGTSGKLDAFLASIRDVAKIVEVARSGVVGLSRGDKIMR -----------------1111---------------!!!!--- >PREPHENATE DEHYDROGENASE; SWP:P73906; PDB:2F1KA; MKIGVVGLGLIGASLAGDLRRRGHYLIGVSRQQSTCEKAVERQLVDEAGQDLSLLQTAKI --------3333-------1111----------------1111---------1111---- IFLCTPIQLILPTLEKLIPHLSPTAIVTDVASVKTAIAEPASQLWSGFIGGHPAGTAAQG -----1111-------3333-1111--------3333-------2222-----------3 IDGAEENLFVNAPYVLTPTEYTDPEQLALRSVLEPLGVKIYLCTPADHDQAVAWISHLPV 333---1111--------11113333------3333------------------------ MVSAALIQACAGEKDGDILKLAQNLASSGFRDTSRVGGGNPELGTMMATYNQRALLKSLQ ---------1111--------------------3333----------------------- DYRQHLDQLITLISNQQWPELHRLLQQTNGDRDKYVE -------------------------------3333-- >16S RRNA PROCESSING PROTE; SWP:Q9HXQ0; PDB:2F1LA; DLVVIGKIVSVYGIRGEVKVYSFTDPLDNLLDYRRWTLRRDGEIRQAELVRGRLHGKVLA -------------------------11111111------%%%%----------------- AKLKGLDDREEARTFTGYEICIPRSELPSYYWHQLEGLKVIDQGRQLLGVIDHLLETGAN --2222-33331111-------3333----33332222---1111--------------- DVVVKPCAGSLDDRERLLPYTGQCVLSIDLAAGERVDWDADF ------1111----------3333-----1111-----3333 >ACRIFLAVINE RESISTANCE PR; SWP:P31223; PDB:2F1MA; TTELPGRTSAYRIAEVRPQVSGIILKRNFKEGSDIEAGVSLYQIDPATYQATYDSAKGDL -----------------------------2222--2222--------------------- AKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTAAKAAVETARINLA -----------------1111--------------------------------------- YTKVTSPISGRIGKSNVTEGALVQNGQATALATVQQLDPIYVDVTQSSNDKAKVSLITSD -----------------2222--2222-------------------3333---------- GIKFPQDGTLEFSDVTVDQTTGSITLRAIFPNPDHTPGFVRARLE -------------------------------1111---------- >CYTOLETHAL DISTENDING TOX; SWP:Q46669; PDB:2F1NA; TDLTDFRVATWNLQGASATTESKWNINVRQLISGENAVDILAVQEAGSPPSTAVDTGRVI -3333----------1111----------1111----------------1111------- PSPGIPVRELIWNLSTNSRPQQVYIYFSAVDALGGRVNLALVSNRRADEVFVLSPVRQGG --------------------------------------------------------2222 RPLLGIRIGNDAFFTAHAIAMRNNDAPALVEEVYNFFRDSRDPVHQALNWMILGDFNREP -------!!!!-------------------------1111-3333--------------- ADLEMNLTVPVRRASEIISPAAATQTSQRTLDYAVAGNSVAFRPSPLQAGIVYGARRTQI --3333-----1111---------1111----------------------2222------ SSDHFPVGVSRR ------------ >molybdopterin-guanine din; SWP:O28031; PDB:2F1RA; LILSIVGTSDSGKTTLITRMMPILRERGLRVAVVKRKDSWKIYNSGADVVIASPVKLAFI ------------------------------------------------------------ RRVSEEEGNDLDWIYERYLSDYDLVITEGFSKAGKDRIVVVKKPEEVEHFRQGRILAVVC ---3333-----------1111-------1111---------33331111---------- DERVDGHKWFRRDEVERIAEFILSLLRE ----------1111----------1111 >conserved hypothetical pr; SWP:Q8A8E9; PDB:2F20A; CFHNSSAKAIKVAARYGRQSDVVEIYQSILDEQYHVNAFTFPRYPIITSSDEVQVFNWGL -------3333--1111--3333--3333-------1111-------------------- IPFWVRSEEDATEIRKTLNARADTIFEKPSFREPIKKRCIVPSTGYFEWRHEGANKIPYY -1111-3333--3333----3333---3333--------------------!!!!----- IYVKDEPIFSAGIYDRWLDKDTGEEHETFSIITTDTNSLTDYIDNTKHRPAILTQEEEEK -----------------------------------------------------1111-33 WLNPSLSKAEIASLLKPFDTEKDAYVIRNDFLKKSPNDPTIVQRALE 331111-----1111---3333-------3333-1111-1111---- >BH3987; SWP:Q9RC77; PDB:2F22A; GDTNGVLYAANTNALAKEIPESKWDIQLIPELGTLRKLFIHIVRVRDVYRDGLKTGSIKF ------------3333---3333-----1111---------------------------- PGRLASDEHRLLDELERSEELVFEFKQTTFNSIKGENYLSIELLGTVIQHEGIHQGQYYV ------------------------------------------------------------ ALKQSGINLPKQWVQDW ---------3333---- >Anti-cleavage anti-greA t; SWP:Q72JT8; PDB:2F23A; REVKLTKAGYERLMQQLERERERLQEATKILQELMESSDDYDDSGLEAAKQEKARIEARI ---------------------------------1111----------------------- DSLEDILSRAVILEEGSGEVIGLGSVVELEDPLSGERLSVQVVSPAEANVLDTPMKISDA ---------------------2222------------------3333----------111 SPMGKALLGHRVGDVLSLDTPKGKREFRVVAIHG 1-----22222222-----1111----------- >GLUTAMYL-TRNA(GLN) AMIDOT; SWP:P63488; PDB:2F2AA; MSIRYESVENLLTLIKDKKIKPSDVVKDIYDAIEETDPTIKSFLALDKENAIKKAQELDE --1111--------------3333-----------3333--------------------- LQAKDQMDGKLFGIPMGIKDNIITNGLETTCASKMLEGFVPIYESTVMEKLHKENAVLIG ----------2222---------2222-----3333------------------------ KLNMDEFAMGGSTETSYFKKTVNPFDHKAVPGGSSGGSAAAVAAGLVPLSLGSDTGGSIR ----2222----1111------1111---------------1111--------------- QPAAYCGVVGMKPTYGRVSRFGLVAFASSLDQIGPLTRNVKDNAIVLEAISGADVNDSTS -------------2222--2222---1111-----------------------1111--- APVDDVDFTSEIGKDIKGLKVALPKEYLGEGVADDVKEAVQNAVETLKSLGAVVEEVSLP --------1111---2222----3333-1111--------------------------11 NTKFGIPSYYVIASSEASSNLSRFDGIRYGYHSKEAHSLEELYKMSRSEGFGKEVKRRIF 11----------------1111-------------------------------------- LGTFALSSGYYDAYYKKSQKVRTLIKNDFDKVFENYDVVVGPTAPTTAFNLGEEIDDPLT ------2222---------------------------------------22221111--- MYANDLLTTPVNLAGLPGISVPCGQSNGRPIGLQFIGKPFDEKTLYRVAYQYETQYNLHD --1111-------------------iiii--------2222----------------111 VYEKL 11111 >Aspartyl/glutamyl-tRNA(As; SWP:P64201; PDB:2F2AB; ETVIGLEVHVELKTDSKMFSPSPAHFGAEPNSNTNVIDLAYPGVLPVVNKRAVDWAMRAA ----------------1111--------------3333--2222---------------- MALNMEIATESKFDRKNYFYPDNPKAYQISQFDQPIGENGYIDIEVDGETKRIGITRLHM 1111---------------1111----------------------iiii----------- EEDAGKSTHKGEYSLVDLNRQGTPLIEIVSEPDIRSPKEAYAYLEKLRSIIQYTGVSDVK -----------------1111--------------3333--------------------3 MEEGSLRCDANISLRPYGQEKFGTKAELKNLNSFNYVRKGLEYEEKRQEEELLNGGEIGQ 333---------------------------------1111-------------------- ETRRFDESTGKTILMRVKEGSDDYRYFPEPDIVPLYIDDAWKERVRQTIPELPDERKAKY ------3333------------------1111-------------1111----------- VNELGLPAYDAHVLTLTKEMSDFFESTIEHGADVKLTSNWLMGGVNEYLNKNQVELLDTK ------------1111--------------------------------------3333-- LTPENLAGMIKLIEDGTMSSKIAKKVFPELAAKGGNAKQIMEDNGLVQ -3333----------------3333----------------------- >Aspartyl/glutamyl-tRNA(As; SWP:P68807; PDB:2F2AC; KVTREEVEHIANLARLQISPEETEEMANTLESILDFAKQNDSADTEGVEPTYHVLDLQNV --------------------------------------3333--2222------------ LREDKAIKGIPQELALKNAKETEDGQFKVPTI ----------------------%%%%------ >AQUAPORIN AQPM; SWP:Q9C4Z5; PDB:2F2BA; MVSLTKRCIAEFIGTFILVFFGAGSAAVTLMIASGGTSPNPFNIGIGLLGGLGDWVAIGL --------------------------------2222---3333-2222-!!!!------- AFGFAIAASIYALGNISGCHINPAVTIGLWSVKKFPGREVVPYIIAQLLGAAFGSFIFLQ ------------1111-------------1111--3333--------------------- CAGIGAATVGGLGATAPFPGISYWQAMLAEVVGTFLLMITIMGIAVDERAPKGFAGIIIG ----------%%%%---2222-------------------------11112222------ LTVAGIITTLGNISGSSLNPARTFGPYLNDMIFAGTDLWNYYSIYVIGPIVGAVLAALTY ---------1111------3333--------------33333333--------------- QYLTS ----- >CYCLIN HOMOLOG; SWP:Q01043; PDB:2F2CA; LNRAKIDSTTMKDPRVLNNLKLRELLLPKFTSLWEIQTEVTVDNRTILLTWMHLLCESFE ------3333--1111---------------2222------------------------- LDKSVFPLSVSILDRYLCKKQGTKKTLQKIGAACVLIGSKIRTVKPMTVSKLTYLSFTNL -1111---------1111----1111---------------------1111--------- ELINQEKDILEALKWDTEAVLATDFLIPLCNALKIPEDLWPQLYEAASTTICKALIQPNI ------------%%%%----1111-----------3333-------------33331111 ALLSPGLICAGGLLTTIETDNTNCRPWTCYLEDLSSILNFSTNTVRTVKDQVSEAFSLYD ---------------------------1111----------------------------3 LEIL 333- >PA1607; SWP:Q9I3B4; PDB:2F2EA; TSHKQASCPVARPLDVIGDGWSLIVRDAFEGLTRFGEFQKSLGLAKNILAARLRNLVEHG --1111-3333----------------1111----------------------------- VVAVPAESGSHQEYRLTDKGRALFPLLVAIRQWGEDYFFAPDESHVRLVERDSGQPVPRL -----------------3333------------------1111----------------- QVRAGDGSPLAAEDTRVSRD ---1111---3333------ >CYTOLETHAL DISTENDING TOX; SWP:O87120; PDB:2F2FA; PSEPSNFMTLMGQNGALLTVWALAKRNWLWAYPNIYSQDFGNIRNWKIEPGKHREYFRFV --3333-----1111-----------------33331111-------------------- NQSLGTCIEAYGNGLIHDTCSLDKLAQEFELLPTDSGAVVIKSVSQGRCVTYNPVSPTYY ----------!!!!------11111111-----1111----------------------- STVTLSTCDGATEPLRDQTWYLAPPVLEATAVN ------------1111----------------- >Cytolethal distending tox; SWP:Q7DK11; PDB:2F2FC; DPTTYPDVELSPPPRISLRSLLTAQPIKNDHYDSHNYLSTHWELIDYKGKEYEKLRDGGT ----3333------------------------1111-1111----------3333iiii- LVQFKVVGAAKCFAFPGEGTTDCKDIDHTVFNLIPTNTGAFLIKDALLGFCMTSHDFDDL ---------------------33331111------1111----------------2222- RLEPCGISVSGRTFSLAYQWGILPPFGPSKILRP --------2222--3333---------------- >SEED MATURATION PROTEIN P; SWP:Q9ASY9; PDB:2F2GA; GVIDTWIDKHRSIYTAATRHAFVVSIRDGSVDLSSFRTWLGQDYLFVRRFVPFVASVLIR --------------------------iiii------------------------------ ACKDSGESSDEVVLGGIASLNDEIEWFKREGSKWDVDFSTVVPQRANQEYGRFLEDLSSE ------1111--------------------------3333-----------------333 VKYPVITAFWAIEAVYQESFAHCLEDGNKTPVELTGACHRWGNDGFKQYCSSVKNIAERC 33333---------------------1111------------------------------ LENASGEVLGEAEDVLVRVLELEVAFWESRG 1111--------------------------- >PUTATIVE FAMILY 31 GLUCOS; SWP:P31434; PDB:2F2HA; MKISDGNWLIQPGLNLIHPLQVFEVEQQDNEMVVYAAPRDVRERTWQLDTPLFTLRFFSP ----------2222-------------!!!!-----------3333-------------- QEGIVGVRIEHFQGALNNGPHYPLNILQDVKVTIENTERYAEFKSGNLSARVSKGEFWSL 2222--------------------------------1111----!!!!------------ DFLRNGERITGSQVKNNGYVQDTNNQRNYMFERLDLGVGETVYGLGERFTALVRNGQTVE ---%%%%-----2222--------------------2222-------------2222--- TWNRDGGTSTEQAYKNIPFYMTNRGYGVLVNHPQCVSFEVGSEKVSKVQFSVESEYLEYF ------------------------------------------------------------ VIDGPTPKAVLDRYTRFTGRPALPPAWSFGLWLTTSFTTNYDEATVNSFIDGMAERNLPL ------------------------3333-------------------------1111--- HVFHFDCFWMKAFQWCDFEWDPLTFPDPEGMIRRLKAKGLKICVWINPYIGQKSPVFKEL -----1111-2222---------------------1111-----------3333------ QEKGYLLKRPDGSLWQWDKWQPGLAIYDFTNPDACKWYADKLKGLVAMGVDCFKTDFGER --------1111--------2222---------------------1111----------- IPTDVQWFDGSDPQKMHNHYAYIYNELVWNVLKDTVGEEEAVLFARSASVGAQKFPVHWG ------1111--3333---------------1111-3333--------2222-------- GDCYANYESMAESLRGGLSIGLSGFGFWSHDIGGFENTAPAHVYKRWCAFGLLSSHSRLH -------------------1111-------2222-------------------------- GSKSYRVPWAYDDESCDVVRFFTQLKCRMMPYLYREAARANARGTPMMRAMMMEFPDDPA ------1111-3333----------------------------------3333-1111-- CDYLDRQYMLGDNVMVAPVFTEAGDVQFYLPEGRWTHLWHNDELDGSRWHKQQHGFLSLP 1111----------------3333------------------------------1111-- VYVRDNTLLALGNNDQRPDYVWHEGTAFHLFNLQDGHEAVCEVPAADGSVIFTLKAARTG --------------------1111---------2222-------1111----------!! NTITVTGAGEAKNWTLCLRNVVKVNGLQDGSQAESEQGLVVKPQGNALTITLH !!---------------2222-------------1111--------------- >KALATA-B1; SWP:P56254; PDB:2F2IA; CGETCVGGTCNTPGCTCSWDKCTRNGLPV -----------------%%%%--iiii-- >KALATA-B1; SWP:P56254; PDB:2F2JA; CGETCVGGTCNTPGCTCSKNKCTRNGLPV -----------2222--------iiii-- >Peptidoglycan-recognition; SWP:Q8SXQ7; PDB:2F2LX; MVILKVAEWGGRPAKRMLDAQQLPINRVVISHTAAEGCESREVCSARVNVVQSFHMDSWG ----3333---------------------------------------------------- WDHIGYNFLVGGDGRVYEGRGWDYVGAHTKGYNRGSIGISFIGTFTTRKPNERQLEACQL ----------1111--------------22222222------------------------ LLQEGVRLKKLTTNYRLYGHRQLSATESPGEELYKIIKKWPHWSHE ------------------3333-----------------1111--- >ACETYL-COA ACETYLTRANSFER; SWP:P24752; PDB:2F2SA; SSGLVPRGSEVVIVSATRTPIGSFLGSLSLLPATKLGSIAIQGAIEKAGIPKEEVKEAYM ----------------------2222-11113333----------3333-3333------ GNVLQGGEGQAPTRQAVLGAGLPISTPCTTINKVASGMKAIMMASQSLMCGHQDVMVAGG ----2222---------1111-1111---------------------1111--------- MESMSNVPYVMNRGSTPYGGVKLEDLIVKDGLTDVYNKIHMGSCAENTAKKLNIARNEQD --3333----------2222-------------------3333----------------- AYAINSYTRSKAAWEAGKFGNEVIPVTVTVVKEDEEYKRVDFSKVPKLKTVFQKENGTVT ----------------1111--------------3333--3333-----1111------3 AANASTLNDGAAALVLMTADAAKRLNVTPLARIVAFADAAVEPIDFPIAPVYAASMVLKD 333--------------------------------------33331111----------- VGLKKEDIAMWEVNEAFSLVVLANIKMLEIDPQKVNINGGAVSLGHPIGMSGARIVGHLT ---3333---------3333----------1111-11113333---11113333------ HALKQGEYGLASICNGGGGASAMLIQKL ---------------------------- >RHO-ASSOCIATED PROTEIN KI; SWP:Q28021; PDB:2F2UA; QRKLEALIRDPRSPINVESLLDGLNSLVLDLDFPALRKNKNIDNFLNRYEKIVKKIRGLQ -333333333333--------------------3333--3333------------3333- MKAEDYDVVKVIGRGAFGEVQLVRHKASQKVYAMKLLSKFEMIKRSDSAFFWEERDIMAF -3333---------1111------------------------1111-------------- ANSPWVVQLFCAFQDDKYLYMVMEYMPGGDLVNLMSNYDVPEKWAKFYTAEVVLALDAIH --1111-------------------3333----1111----------------------- SMGLIHRDVKPDNMLLDKHGHLKLADFGTCMKMDETGMVHCDTAVGTPDYISPEVLKSQG ---------3333---1111------1111----------------3333-------111 GDGYYGRECDWWSVGVFLFEMLVGDTPFYADSLVGTYSKIMDHKNSLCFPEDAEISKHAK 1----------------------------------------3333----1111------- NLICAFLTDREVRLGRNGVEEIKQHPFFKNDQWNWDNIRETAAPVVPELSSDIDSSNFDD --------33332222-333311111111-------3333--------------1111-- IEVETFPIPKAFVGNQLPFIGFTYYR -------------1111-2222---- >DIAPHANOUS PROTEIN HOMOLO; SWP:O08808; PDB:2F31A; SAMMYIQELRSGLRDMHLLSCLESLRVSLNNNPVSWVQTFGAEGLASLLDILKRLHDEKN -------1111-------------------------------------------1111-- YDSRNQHEIIRCLKAFMNNKFGIKTMLETEEGILLLVRAMDPAVPNMMIDAAKLLSALCI --------------1111-------3333------3333-3333-------------111 LPQPEDMNERVLEAMTERAEMDEVERFQPLLDGLKSGTSIALKVGCLQLINALITPAEEL 1-----------------------1111--11113333---------------3333-33 DFRVHIRSELMRLGLHQVLQELREIENEDMKVQLCVFDEQGDEDFFDL 33-------------3333----------------------------- >CALBINDIN; SWP:CALB1_RAT; PDB:2F33A; MAESHLQSSLITASQFFEIWLHFDADGSGYLEGKELQNLIQELLQARKKAGLELSPEMKT 3333-------3333-----1111--------3333------------------3333-- FVDQYGQRDDGKIGIVELAHVLPTEENFLLLFRCQQLKSCEEFMKTWRKYDTDHSGFIET -------1111------1111-----------------3333---1111----------- EELKNFLKDLLEKANKTVDDTKLAEYTDLMLKLFDSNNDGKLELTEMARLLPVQENFLLK ------------------3333---------------------3333----3333-3333 FQGIKMCGKEFNKAFELYDQDGNGYIDENELDALLKDLCEKNKQELDINNISTYKKNIMA ----------------------------------------------3333--------11 LSDGGKLYRTDLALILSAGDN 11iiii-33333333------ >GLUTAMATE RECEPTOR, IONOT; SWP:P22756; PDB:2F34A; RTLIVTTILEEPYVMYRKSDKPLYGNDRFEGYCLDLLKELSNILGFLYDVKLVPDGKYGA -----------------------!!!!-------------------------1111---- QNDKGEWNGMVKELIDHRADLAVAPLTITYVREKVIDFSKPFMTLGISILYRKGTPIDSA -1111-----------------------3333--------------------------33 DDLAKQTKIEYGAVRDGSTMTFFKKSKISTYEKMWAFMSSRQQSALVKNSDEGIQRVLTT 331111-------2222-----------3333---------------------------- DYALLMESTSIEYVTQRNCNLTQIGGLIDSKGYGVGTPIGSPYRDKITIAILQLQEEGKL -------------33331111----------------2222------------------- HMMKEKWWRGN ----------- >Transient receptor potent; SWP:Q9Y5S1; PDB:2F37A; PNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEYTEGSTGKTCLKAVLNLKDGV ------------1111--1111-----------11111111----------3333-iiii NACILPLLQIDRDSGNPQPLVNAQCTDDYYRGHSALHIAIEKRSLQCVKLLVENGANVHA 1111--------------3333----3333---------------------1111-1111 RACGRFFQKGGTCFYFGELPLSLAACTKQWDVVSYLLENPHQPASLQATDSQGNTVLHAL ----1111----------------1111----------------1111-1111------- VISDNSAENIALVTSYDGLLQAGARLCPTVQLEDIRNLQDLTPLKLAAKEGKIEIFRHIL -----1111-----------------11111111--1111-------------------- QREF 1111 >Thrombin inhibitor infest; SWP:Q95P16; PDB:2F3CI; DCACPRVLHRVCGSDGNTYSNPCTLDCAKHEGKPDLVQVHEGPCDP ------------1111----------------1111---------- >DNA-directed RNA polymera; SWP:P52434; PDB:2F3IA; MAGILFEDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVI ---------------------1111-------------------3333------------ ASTLYEDGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLSAYVSYGG ---------3333--------------------------------------------iii LLMRLQGDANNLHGFEVDSRVYLLMKKLAF i------3333------------------- >SH3 AND MULTIPLE ANKYRIN ; SWP:Q9JLU4; PDB:2F3NA; MLQLWSKFDVGDWLESIHLGEHRDRFEDHEIEGAHLPALTKEDFVELGVTRVGHRENIER 3333----------11113333----1111-33331111--------------------- ALRQL ----- >PYRUVATE FORMATE-LYASE 2; SWP:O28823; PDB:2F3OA; DRIEKLIKKVSKPARLSVERCRLYTESMKQTEGEPMIIRQAKALKHVLENIPIQILDSEL 3333------------------------1111-----------------------2222- IVGTMLPNPPGAIIFPEGVGLRIINELDSLPNRETNRLMVDEEDAKVLREEIAPYWQRKT --------------3333-------3333-----------3333-----------22223 IEAFAFPLMPDIMQILYTGSVFVLTEIAGISHVAVNYPYLLRRGFRWFLEESERRIRALE 333-3333-----------------1111----------1111----------------1 ESGVYEGEKYSFYQAAKIVSEAVINYGLRYSKLAEELAESEDGERREELLKIAEICRKVP 111--------------------------------------------------------- AEKPETFWEAVQFVWLVQSALHQENYEQAISMGRIDQYLYPFFKKDIGEGRINRELAFDI ---------------------------------3333--------2222----------- LANLWIKTNEIVPAFDSLLEQYFSGQATNQAVTIGGCDIYGNDATNELTYLMLEVTDRLR ---------------3333----------------------------------------- LRQPNVHVRINKGSPESFLKRLAEAISSGCNNLALFFDDAAVKALKNAEVDDRDALNYTT ----------1111-------------------------------1111----------- DGCVEIAPFGNSFTSSDAALINVAKALEYALNEGVDLQFGYEFGAKTEKPKFLEDLLEKL -------------------------------%%%%----------------3333----- REQVSHIVKLVVRGSNVLSYANAEVKPTPLLSLCVEDCFEKGVDVSRGGARYNFTGIQAV ---------------------------3333-----3333---1111------------- GIADVGDSLVAIEGALNAGYSMDDIVEACRKNFVGYEKLHKLLLQSPKYGNDDDAADKYT 3333----------------3333--------2222----------------3333---- KMVLEWYCEEVNRHRNFRGGKFAAGCYPMTTNVGFGFFTSALPSGRKSGEPLNPGVSPST ----------1111-1111---------------3333---1111--------!!!!-22 GMDREGVTAVINSASKLSYENLPNGASLTINLSSDVLGEKGDAVIEALIKSSMELGVMHV 22---------------3333-------------3333---3333-------1111---- QFNILKEDLLRKAQQEPEKYRWLLVRVAGWSAYFVELSRPVQEEVIRRISCRI -----1111-3333-3333-------------3333---------1111---- >CALMODULIN; SWP:P62158; PDB:2F3YA; QLTEEQIAEFKEAFSLFDKDGDGTITTKELGTVMRSLGQNPTEAELQDMINEVDADGNGT --------------3333----------------1111---------------1111--- IDFPEFLTMMARKMKDTDSEEEIREAFRVFDKDGNGYISAAELRHVMTNLGEKLTDEEVD ---------------1111-----------1111-------------1111--------- EMIREADIDGDGQVNYEEFVQMMTAK ------1111---------------- >HYPOTHETICAL PROTEIN PF14; SWP:Q8U0X6; PDB:2F40A; SKWIKFTTNLTPEEAKIVQYELSTRDEFYRVFINPYAKVAEVVIDDSKVNIEELKEKLKG --------------------3333-------------------------------1111- EVIEEKEITLQELI -------------- >TRANSCRIPTION FACTOR FAPR; SWP:O34835; PDB:2F41A; EVIGEIIDLELDDQAISILEIKQEHVFSRNQIARGHHLFAQANSLAVAVILALTASADIR ---------2222--------1111--1111----------------------------- FTRQVKQGERVVAKAKVTAVEKEKGRTVVEVNSYVGEEIVFSGRFDMY -----2222-------------------------!!!!---------- >STIP1 HOMOLOGY AND U-BOX ; SWP:Q7ZTZ6; PDB:2F42A; AKKKRWNSIEEKRISQENELHAYLSKLILAEKERELDDSKHDKYLMDMDELFSQVDEKRK ---3333-------------------------------3333---------------333 KREIPDYLCGKISFELMREPCITPSGITYDRKDIEEHLQRVGHFDPVTRSPLTQDQLIPN 3---1111--------------1111---------3333-------------3333---- LAMKEVIDAFIQENGWVE -------------3333- >HYPOTHETICAL PROTEIN; SWP:Q9JY57; PDB:2F46A; KAILKLDEHLYISPQLTKADAEQIAQLGIKTIICNRPDREEESQPDFAQIKQWLEQAGVT ------1111------3333--------------------1111---------1111--- GFHHQPVTARDIQKHDVETFRQLIGQAEYPVLAYCRTGTRCSLLWGFRRAAEGPVDEIIR -------3333------------1111--------------------------------- RAQAAGVNLENFRERLDNARV --1111--1111----1111- >diphosphate--fructose-6-p; SWP:O51052; PDB:2F48A; SLFKQERQKYIPKLPNILKKDFNNISLVYGENTEAIQDRQALKEFFKNTYGLPIISFTEG ------1111----3333--1111-------------3333------------------- ESSLSFSKALNIGIILSGGPAPGGHNVISGVFDAIKKFNPNSKLFGFKGGPLGLLENDKI ----------------------3333------------1111-------33331111--- ELTESLINSYRNTGGFDIVSSGRTKIETEEHYNKALFVAKENNLNAIIIIGGDDSNTNAA --3333---2222--3333--------------------1111----------------- ILAEYFKKNGENIQVIGVPKTIDADLRNDHIEISFGFDSATKIYSELIGNLCRDAMSTKK ------1111----------1111---------2222----------------------- YWHFVKLMGRSASHVALECALKTHPNICIVSEEVLAKKKTLSEIIDEMVSVILKRSLNGD -------------------------------------------------------1111- NFGVVIVPEGLIEFIPEVKSLMLELCDIFDKNEGEFKGLNIEKMKEIFVAKLSDYMKGVY -------1111---3333-------------33331111--------------------1 LSLPLFIQFELIKSILERDPHGNFNVSRVPTEKLFIEMIQSRLNDMKKRGEYKGSFTPVD 111---------------1111---33333333-------------1111---------- HFFGYEGRSAFPSNFDSDYCYSLGYNAVVLILNGLTGYMSCIKNLNLKPTDWIAGGVPLT ---1111-----------------------1111---------11113333------333 MLMNMEERYGEKKPVIKKALVDLEGRPFKEFVKNRDKWALNNLYLYPGPVQYFGSSEIVD 3------%%%%----------1111-----------------------------3333-- EITETLKLELF -------1111 >DUAL SPECIFICITY PROTEIN ; SWP:DUS3_HUMAN; PDB:2F4DA; ELSVQDLNDLLSDGSGYSLPSQPNEVTPRIYVGNASVAQDIPKLQKLGITHVLNAAEGRS --------------------------2222-----3333-----1111-----------1 FMHVNTNANFYKDSGITYLGIKANDTQEFNLSAYFERAADFIDQALAQKNGRVLVHCREG 111---33332222-----------33333333--------------2222--------- YSRSPTLVIAYLMMRQKMDVKSALSIVRQNREIGPNDGFLAQLCQLNDRLAKEGKLKP ---------------------------------------------------------- >ATFKBP42; SWP:Q9LDC0; PDB:2F4EA; GNVPPKVDSEAEVLDEKVSKQIIKEGHGSKPSKYSTCFLHYRAWTKNSQHKFEDTWHEQQ --------------1111-------------2222------------------------- PIELVLGKEKKELAGLAIGVASMKSGERALVHVGWELAYGKEGNFSFPNVPPMADLLYEV ----2222-1111------11112222------3333--1111----------------- EVIGFDETKEG ----------- >HYPOTHETICAL PROTEIN TM09; SWP:NA; PDB:2F4IA; FDPRYARELWFLQDNEGLGYDAVEVLNTLDENPELAHQFAVVGVSNYRYYIIQGVGEIVE ------------------------------------------------------------ IDDGILVVRENRVPDLFLSNHIFGNGIVNATGIAEDFDRIIDFNLTATELNIVEEVVNSF ---------------------------------------------------------333 LQLSGAGSVGSLVRFIAVFTLLDEEIYPIEAIPLYLEIQ 3------2222---------------------------- >VILLIN-1; SWP:P02640; PDB:2F4KA; LSDEDFKAVFGMTRSAFANLPLWQQHLKEKGLF ----------------1111-------1111-- >ACETAMIDASE, PUTATIVE; SWP:Q9WXX3; PDB:2F4LA; KVVPAQRCVYSFSANAPVEEVYPGEQVVFETLDALGSKVNPATGPVFVNGVKPGDTLKVR ---3333--------------2222-------1111-----------22222222----- IKRIELPRRGIVTGKGFGVLGDEVEGFHTKELEIEKWAVLFDGVRIPIHPVGVIGVAPQE -------------2222--1111-----------1111--!!!!---------------- GEYPTGTAHRHGGNDTKEITENVTVHLPVFQEGALLALGDVHATGDGEVCVSACEVPAKV ---1111-1111---3333-----------2222-------------1111--------- VVEIDVSKEEIKWPVVETNDAYYIIVSLPDIEEALKEVTRETVWFIQRRKTIPFTDAYLA ----------------------------------------------------3333---- SLSVDVGISQLVNPAKTAKARIPKYIFT ------------------------1111 >PEPTIDE N-GLYCANASE; SWP:Q9JI78; PDB:2F4MA; GDSTILKVLQSNIQHVQLYENPVLQEKALTCIPVSELKRKAQEKLFRARKLDKGTNVSDE ---------------3333--------3333---------------11113333------ DFLLLELLHWFKEEFFRWVNNIVCSKCGGETRSRDEALLPNDDELKWGAKNVENHYCDAC --------------------------------------------1111---------111 QLSNRFPRYNNPEKLLETRCGRCGEWANCFTLCCRALGFEARYVWDYTDHVWTEVYSPSQ 1---------------------------------1111-------1111--------111 QRWLHCDACEDVCDKPLLYEIGWGKKLSYIIAFSKDEVVDVTWRYSCKHDEVMSRRTKVK 1------1111---1111---------------1111---3333--------1111---- EELLRETINGLNKQRQLSLSESRRKELLQRIIVELVEFISPKTPRPGLEHHHHHH ---------------1111-----------------1111----11111111--- >HYPOTHETICAL PROTEIN MJ16; SWP:Q59045; PDB:2F4NA; DDILDIITLTTDFGTNEGYVGAKGRILNILKKYNKDAKIIDISHEIKPFNIYHGAYVLLT -----------------3333-------------------------2222---------- AIPYFPPSVHVAVIDPTRKSIVIETKSGYYLVGPDNGLFTYVAEKLGIKRIIKIDEERGR 3333--------------------1111--------1111-------------------- DVYAVVGAEILINNGYDGEELDEVKIDETKKRVIHIDRFGNIITNIKTFKTIIKIRHKNG ------------------------------------1111----------------3333 IEKIIKCKFVKSYFEEKNNFICLINSEGFLEISKFDNASKLLNVDYLDEIEIE -----------33333333-----1111-----------1111-2222----- >HYPOTHETICAL PROTEIN TM10; SWP:Q9X0A3; PDB:2F4PA; VDDIFERGSKGSSDFFTGNVWVKLVTDENGVFNTQVYDVVFEPGARTHWHSHPGGQILIV -----------3333-----------1111-----------2222------1111----- TRGKGFYQERGKPARILKKGDVVEIPPNVVHWHGAAPDEELVHIGISTQVHLGPAEWLGS --------2222-----2222----2222-------------------3333-------- VTEEEYRKATEGK ---------2222 >TYPE I TOPOISOMERASE, PUT; SWP:Q9RWH8; PDB:2F4QA; PSRTELLAEEYLRREPQKFRLARIARLAVPPAYQDVYVSPDAENELQAFGRDAAGRLQYR ---------------------3333----1111-------3333-------1111----- YHPDFVQAGALKKWQRLTRFAGALPTLKVATTADLRASGLPPRKVALTRLLHVARFRVTY -3333----------------------------1111---3333---------------- GLSTLRQRHVVVDGNTVTFRFKGKHGVSQHKATSDRTLAANQKLLDLPGPWLFQTVDAGG 1111-3333---!!!!------2222----------------3333----------3333 GERRRIHSTELNAYLREVIGPFTAKDFRTWGGTLLAAEYLAQQGTESSERQAKKVLVDCV ----------------------3333---------------------------------- KFVADDLGNTPAVTRGSYICPVIFDRYLDGKVLDDYEPRTERQEAELEGLTRSEGALKRL --------------------------1111-3333---------1111------------ ESERT 1111- >UBIQUITIN-CONJUGATING ENZ; SWP:Q5T7L0; PDB:2F4WA; TATQRLKQDYLRIKKDPVPYICAEPLPSNILEWHYVVRGPEMTPYEGGYYHGKLIFPREF -----------------2222----1111-------------1111----------1111 PFKPPSIYMITPNGRFKCNTRLCWNPAWSVSTILTGLLSFMVEKGPTLGSIETSDFTKRQ ------------------------3333-----------1111---2222---1111--- LAVQSLAFNLKDKVFCELFPEVVEEIK --------------------------- >TGTWINSCAN_2721 - E2 DOMA; SWP:NA; PDB:2F4ZA; REQARLLKELADIQQLGVSAQIVGGDIHRWRGFIAGPLGTPYEGGHFTLDIVIPPDYPYN ----------------------iiii-------------1111----------1111--- PPKMKFVTKIWHPNISSQTGAICLDILKHEWSPALTIRTALLSIQAMLADPVPTDPQDAE -----------1111--------333311113333-----------1111-3333----- VAKMMIENHPLFVQTAKLWTETFAK ------------------------- >THIOREDOXIN; SWP:Q8IEV4; PDB:2F51A; SDPIVHFNGTHEALLNRIKEAPGLVLVDFFATWCGPCQRLGQILPSIAEANKDVTFIKVD -----------------3333---------1111----------------1111-----3 VDKNGNAADAYGVSSIPALFFVKKEGNEIKTLDQFVGADVSRIKADIEKFK 333-----1111------------!!!!----------------------- >TRAV20 protein; SWP:Q6PIZ8; PDB:2F53D; KQEVTQIPAALSVPEGENLVLNCSFTDSAIYNLQWFRQDPGKGLTSLLLIPFWQREQTSG -------------2222---------------------2222--------1111----!! RLNASLDKSSGRSTLYIAASQPGDSATYLCAVRPTSGGSYIPTFGRGTSLIVHPYIQNPD !!----3333----------3333------------------------------------ PAVYQLRDSKSSDKSVCLFTDFDSQTNVSQSKDSDVYITDKCVLDMRSMDFKSNSAVAWS -------3333-----------------------------------1111---------- NKSDFACANAFN -----3333--- >SERINE/THREONINE-PROTEIN ; SWP:NA; PDB:2F57A; MVSHEQFRAALQLVVSPGDPREYLANFIKIGEGSTGIVCIATEKHTGKQVAVKKMDLRKQ ----------3333----3333---------------------------------1111- QRRELLFNEVVIMRDYHHDNVVDMYSSYLVGDELWVVMEFLEGGALTDIVTHTRMNEEQI -----------------1111--------!!!!-----------3333------------ ATVCLSVLRALSYLHNQGVIHRDIKSDSILLTSDGRIKLSDFGFCAQVSKEVPKRKLVGT ------------------------3333---1111------1111---3333-------- PYWMAPEVISRLPYGTEVDIWSLGIMVIEMIDGEPPYFNEPPLQAMRRIRDSLPPRVKDL 1111------------------------------2222--------------------33 HKVSSVLRGFLDLMLVREPSQRATAQELLGHPFLKLAGPPSCIVPLMRQ 33---------------3333--3333---3333----33333333--- >6,7-DIMETHYL-8-RIBITYLLUM; SWP:Q57DY1; PDB:2F59A; APHLLIVEARFYDDLADALLDGAKAALDEAGATYDVVTVPGALEIPATISFALDGADNGG ---------------------------1111---------3333---------------- TEYDGFVALGTVIRGETYHFDIVSNESCRALTDLSVEESIAIGNGILTVENEEQAWVHAR -----------------------------------1111--------------------3 REDKDKGGFAARAALTMIGLRKKFGA 333----------------------- >TRANSPOSASE, PUTATIVE; SWP:Q97Y68; PDB:2F5GA; ELKSTRHTKYLCNYHFVWIPKHRRNTLVNEIAEYTKEVLKSIAEELGCEIIALEVMPDHI -------------------------------------------------------1111- HLFVNCPPRYAPSYLANYFKGKSARLILKKFPQLNKGKLWTRSYFVATAGNVSSEVIKKY ------1111-------------------------------------------------- IEEQWRKEGE ---------- >METALLOTHIONEIN-3; SWP:P25713; PDB:2F5HA; KSCCSCCPAECEKCAKDCVCKGGEAAEAEAEKCSCCQ -----------1111-----------------3333- >MORTALITY FACTOR 4-LIKE P; SWP:Q9UBU8; PDB:2F5JA; VKVKIPEELKPWLVDDWDLITRQKQLFYLPAKKNVDSILEDYANYKKSYAVNEVVAGIKE -----3333--------------------------------------------------- YFNVMLGTQLLYKFERPQYAEILADHPDAPMSQVYGAPHLLRLFVRIGAMLAYTPLDEKS -----------3333--------------3333-------3333------------3333 LALLLNYLHDFLKYLAKNSATLFSASDYEVAPPEYHR -----------------------3333----3333-- >MORF-RELATED GENE 15 ISOF; SWP:Q4R7Y9; PDB:2F5KA; DPKPKFQEGERVLCFHGPLLYEAKCVKVAIKDKQVKYFIHYSGWNKNWDEWVPESRVLKY ------2222--------------------iiii------22223333----3333---- VDTNLQKQRELQKANQEQYAEGK ----------------------- >Putative uncharacterized ; SWP:Q7LYW4; PDB:2F5TX; AIWRSRSFDEAIEMFRESLYSAKNEVIVVTPSEFFETIREDLIKTLERGVTVSLYIDKIP ------------------1111--------3333-1111------1111----------- DLSEFKGKGNFFVRQFYKLNHLIGMTDGKEVVTIQNATFDSIGPPSFKSTYPEIIFSQYS -3333--------------------%%%%------3333--------------------- LIIEIFKESTLEKEIIGNPKDIRFFAMFHAVDFVKNHLKNRNIYAEITGKNLESGRLETL -----1111--------1111---------------3333-------------------- TGRVVGYTLSLREAVNNIHLETENGVVKVGGMFAVIEDYESTEIKFIMGGSRS ----------1111-------1111-----2222-------------2222-- >VIRION PROTEIN UL25; SWP:P10209; PDB:2F5UA; AEMEVQIVRNDPPLRYDTNLPVDLLHMVYAGRGATGSSGVVFGTWYRTIQDRTITDFPLT -------3333-----------------------!!!!---------------------1 TRSADFRDGRMSKTFMTALVLSLQACGRLYVGQRHYSAFECAVLCLYLLYRNTHGRAPVT 111---%%%%------------1111---------------------------------3 FGDLLGRLPRYLACLAAVIGTEGGRPQYRYRDDKLPKTQFAAGGGRYEHGALASHIVIAT 333---------------------------3333-------------22221111----- LMHHGVLPAAPGDVPVAHHDDINRAAAAFLSRGHNLFLWEDQTLLRATANTITALGVIQR -1111-----------1111----------------1111-------------------- LLANGNVYADRLNNRLQLGMLIPGAVSGSDSGAIKSGDNNLEALCANYVLPLYRADPAVE ---------1111---3333-----------------------------------11113 LTQLFPGLAALCLDAQAGRRRVVDMSSGARQAALVRLTALELINRTPTPVGEVIHAHDAL 333---------3333---------3333-------------------3333-------- AIQYEQGLGLLAQQARIGLGSNTKRFSAFNVSSDYDMLYFLCLGFIPQYL -----------------3333-----1111-------------------- >PYRANOSE 2-OXIDASE; SWP:Q8J136; PDB:2F5VA; MDIKYDVVIVGSGPIGCTYARELVGAGYKVAMFDIGEIDSGLKIGAHKKNTVEYQKNIDK ------------3333-------1111---------------222211113333--3333 FVNVIQGQLMSVSVPVNTLVVDTLSPTSWQASTFFVRNGSNPEQDPLRNLSGQAVTRVVG ------------------------3333--------iiii33331111-1111----222 GMSTHWTCATPRFDREQRPLLVKDDADADDAEWDRLYTKAESYFQTGTDQFKESIRHNLV 21111--------3333-------------------------------1111-------- LNKLAEEYKGQRDFQQIPLAATRRSPTFVEWSSANTVFDLQNRPNTDAPEERFNLFPAVA -------iiii---------------------3333--------3333------------ CERVVRNALNSEIESLHIHDLISGDRFEIKADVYVLTAGAVHNTQLLVNSGFGQLGRPNP ------1111----------------------------3333-----1111-------33 TNPPELLPSLGSYITEQSLVFCQTVMSTELIDSVKSDMTIRGTPGELTYSVTYTPGASTN 33----1111----------------------1111------2222-------2222--- KHPDWWNEKVKNHMMQHQEDPLPIPFEDPEPQVTTLFQPSHPWHTQIHRDAFSYGAVQQS ----------------1111----1111---------3333------------------- IDSRLIVDWRFFGRTEPKEENKLWFSDKITDAYNMPQPTFDFRFPAGRTSKEAEDMMTDM -3333------------3333---------1111-------------------------- CVMSAKIGGFLPGSLPQFMEPGLVLHLGGTHRMGFDEKEDNCCVNTDSRVFGFKNLFLGG ----------2222-----2222------------3333-----1111-2222------- CGNIPTAYGANPTLTAMSLAIKSCEYIKQNFTPSPFT --------------------------1111------- >BUGD; SWP:Q7WGE2; PDB:2F5XA; YPERPVNVVPFAAGGPTDNVARSLAESRPTLGETVVVENKGGAGGTIGTTQVARAQPDGY -----------2222--------------------------%%%%------1111----- SILLHAGFSTAPSLYKNPGYEPYTSFEPIGLVVDVPTIIARGDFPPNNIKELAEYVKKNA -----33333333-------1111----------------1111--------------33 DKISLANAGIGAASHLCGTLVEALGVNLLTIPYKGTAPANDLLGKQVDLCDQTTNTTQQI 33------22223333-----1111----------3333--1111------333333331 TSGKVKAYAVTSLKRVPTLPDLPTDESGYKGFEVGIWHGWAPKGTPKPVVDKLVKSLQAG 111------------1111-----11112222---------2222--------------- LADPKFQERKQLGAEVLTNEANPEALQAKVKQQVPQWAELFKKAGVEKQ ---------1111---1111---------------------1111---- >REGULATOR OF G-PROTEIN SI; SWP:P49796; PDB:2F5YA; MRYRQITIPRGKDGFGFTICCDSPVRVQAVDSGGPAERAGLQQLDTVLQLNERPVEHWKC ------------------------------2222---------------!!!!------- VELAHEIRSCPSEIILLVWRMV ------1111------------ >Pyruvate dehydrogenase pr; SWP:O00330; PDB:2F60K; EHIPGTLRFRLSPAARNILEKHSLDASQGTATGPRGIFTKEDALKLVQLKQTGKILEHHH --22221111--------------1111-----iiii----------------------- >NUCLEOSIDE 2-DEOXYRIBOSYL; SWP:Q57VC7_9TRYP; PDB:2F62A; HHHHHHRKIYIAGPAVFNPDGASYYNKVRELLKKENVPLIPTDNEATEALDIRQKNIQIK -------------3333---------------1111---1111---------------11 DCDAVIADLSPFRGHEPDCGTAFEVGCAAALNKVLTFTSDRRNREKYGSGVDKDNLRVEG 11--------------------------1111------------1111---1111----- FGLPFNLLYDGVEVFDSFESAFKYFLANFPS ------------------------------- >Protein SRN2; SWP:Q99176; PDB:2F66C; SKKYGDIALKKKLEQNTKKLDEESSQLETTTRSIDSADDLDQFIKNYLDIRTQYHLRREK --3333------------------------------------------------------ LATWD 1111- >COLLAGEN ADHESIN; SWP:NA; PDB:2F68X; HGSARDISSTNVTDLTVSPSKIEDGGKTTVKMTFDDKNGKIQNGDMIKVAWPTSGTVKIE ----------------------2222---------1111--2222--------------- GYSKTVPLTVKGEQVGQAVITPDGATITFNDKVEKLSDVSGFAEFEVQGRNLTQTNTSDD ---------iiii-------1111-----1111-----------------1111---333 KVATITSGNKSTNVTVHKSSSVFYYKTGDMLPEDTTHVRWFLNINNEKSYVSKDITIKDQ 3-----!!!!--------------------3333----------1111------------ IQGGQQLDLSTLNINVTGTHSNYYSGQSAITDFEKAFPGSKITVDNTKNTIDVTIPQGYG -------3333----------------3333-----2222-----1111------3333- SYNSFSINYKTKITNEQQKEFVNNSQAWYQEHGKEEVNGKSFNHTVHNINANAGIEGTVK --------------1111------------------------------------------ >TAF10 peptide, Acetyl-Ser; SWP:Q8WTS6; PDB:2F69A; HGVCWIYYPDGGSLVGEVNEDGEMTGEKIAYVYPDERTALYGKFIDGEMIEGKLATLMST -------1111-------1111----------1111--------iiii------------ EEGRPHFELMPGNSVYHFDKSTSSCISTNALLPDPYESERVYVAESLISSAGEGLFSKVA iiii------------------------1111--3333---------2222--------- VGPNTVMSFYNGVRITHQEVDSRDWALNGNTLSLDEETVIDVPEPYNHVSKYCASLGHKA -------------------11113333-------1111---------3333----1111- NHSFTPNCIYDMFVHPRFGPIKCIRTLRAVEADEELTVAYGYDHSPPEAPEWYQVELKAF ------------------------------2222-------------------------- QATQ ---- >TOXIN A; SWP:P16154; PDB:2F6EA; YYFEPNTAIGANGYKIIDNKNFYFRNGLPQIGVFKGPNGFEYFAPANTDANNIEGQAIRY ----------------%%%%----iiii----------------2222%%%%2222---- QNRFLHLLGNIYYFGNNSKAVTGWQTINGNMYYFMPDTAMAAAGGLFEIDGVIYFFGVDG ------iiii----1111--------iiii------------------iiii----1111 VKAPG ----- >Myosin-2; SWP:P19524; PDB:2F6HX; NATQINEELYRLLEDTEILNQEITEGLLKGFEVPDAGVAIQLSKRDVVYPARILIIVLSE 3333-----------------------1111-----------3333-------------- MWRFGLTKQSESFLAQVLTTIQKVVTQLKGNDLIPSGVFWLANVRELYSFVVFALNSILT -1111-------------------1111-1111--------------------------- EETMTDEEYKEYVSLVTELKDDFEALSYNIYNIWLKKLQKQLQKKAINAVVISESLPGFS ----3333-----------------------------------------------2222- AEYTMDDILTFFNSIYWCMKSFHIENEVFHAVVTTLLNYVDAICFNELIMKRNFLSWKRG ---3333---------------------------------------3333---------- LQLNYNVTRLEEWCKTHGLTDGTECLQHLIQTAKLLQVRKYTIEDIDILRGICYSLTPAQ --------------111111113333--------------------------1111---- LQKLISQYQVADYESPIPQEILRYVADIVKKEAALSSSGSIFITPETGPFTDPFSLIKTR ----------2222---3333------------------------------3333----- KFDQVEAYIPAWLSLPSTKRIVDLVAQQVV ---------3333-------------3333 >ATP-DEPENDENT CLP PROTEAS; SWP:O97252; PDB:2F6IA; HMDIKDMKKDVKLFFFKKRIIYLTDEINKKTADELISQLLYLDNINHNDIKIYINSPGGS -----------------------------------------3333--------------- INEGLAILDIFNYIKSDIQTISFGLVASMASVILASGKKGKRKSLPNCRIMIHQPLGNAF ---------------------------------11112222---1111-----1111--- QTKEILYLKKLLYHYLSSFTNQTVETIEKDSDRDYYMNALEAKQYGIIDEVIETKLPHPY ------------------------------1111-----------------------111 FN 1- >METAL-DEPENDENT HYDROLASE; SWP:Q88TJ2; PDB:2F6KA; SKIDFHTHYLPTSYVEALKRHVPGDPDGWPTPEWTPQLTLNFRDNDISYSILSLSSPHVN ------------------------2222------3333---------------------- FGDKAETIRLVEAANDDGKSLAQQYPDQLGYLASLPIPYELDAVKTVQQALDQDGALGVT ------------------------------------------------------------ VPTNSRGLYFGSPVLERVYQELDARQAIVALHPNEPAILPKNVDIDLPVPLLGFFDTTTF ----iiii22221111------1111-----------------22223333--------- INLKYHFFEKYPNIKVIIPHAGAFLGIVDDRIAQYAQKVYQVDVYDVHHVYFDVAGAVLP --111133331111----%%%%-33333333-----------3333-------------- RQLPTLSLAQPEHLLYGSDIPYTPLDGSRQLGHALATTDLLTNEQKQAIFYDNAHRLLTE ---------1111----------------------------------------------- ------------------------------------------------------------ ----- >Vacuolar protein sorting-; SWP:Q02767; PDB:2F6MB; MDISQLFHDEVPLFDNSITSKDKEVIETLSEIYSIVITLDHVEKAYLKDSIDDTQYTNTV -3333---------1111---------------------------1111--3333----- DKLLKQFKVYLNSQNKEEINKHFQSIEAFADTYNITASNAITRLERG ------------iiii-3333-------------------------- >PEROXISOMAL 3,2-TRANS-ENO; SWP:O75521; PDB:2F6QA; GFETLVVTEDGITKIFNRPKKKNAINTEYHEIRALKAASKDDSIITVLTGNGDYYSSGND --------iiii-----3333---------------3333-------------------- LDIPPGGVEEKAKNNAVLLREFVGCFIDFPKPLIAVVNGPAVGISVTLLGLFDAVYASDR ---1111-----------------------------------3333--1111-----111 ATFHTPFSHLGQSPEGCSSYTFPKISPAKATELIFGKKLTAGEACAQGLVTEVFPDSTFQ 1----3333-----%%%%---3333-3333--------------1111------1111-- KEVWTRLKAFAKLPPNALRISKEVIRKREREKLHAVNAEECNVLQGRWLSDECTN ---------1111------------1111---------------------3333- >BIFUNCTIONAL COENZYME A S; SWP:Q9DBL7; PDB:2F6RA; ALYQIQLLKDQRILGNLLQPPNERPELPSGLYVLGLTGISGSGKSSVAQRLKNLGAYIID ------------2222-------11111111-------2222---------1111----- SDHLGHRAYAPGGPAYQPVVEAFGTDILHKDGTINRKVLGSRVFGNKKQMKILTDIVWPV ---------2222----------3333-1111----------2222-------------- IAKLAREEMDVAVAKGKTLCVIDAAMLLEAGWQSMVHEVWTVVIPETEAVRRIVERDGLS ------------1111-------111111113333-------------------1111-- EAAAQSRLQSQMSGQQLVEQSNVVLSTLWESHVTQSQVEKAWNLLQKRLP -----------------1111--------3333------------1111- >CELL FILAMENTATION PROTEI; SWP:NA; PDB:2F6SA; HHLDRQSLEKAKHLIQSGLIDTIEVGTIKGLQEIHRFLFEGLYEFAGKIRDKNIAKGNFR --------------33333333----------------22221111---------!!!!- FANCLYLDLILPRIESPQNNFNQIVEKYVENIAHPFLEGNGRATRIWLDLLLKKELKKIV --3333------------------------3333-------------------------- LWDRIDKAAYLSAERSPVNDLEIKTLLKKHLSSNTNDPLTLIKGITQSYYYEGLG 3333---------------------------------------------1111-- >(S)-3-O-Geranylgeranylgly; SWP:O29844; PDB:2F6UA; MRWRKWRHITKLDPDRTNTDEIIKAVADSGTDAVMISGTQNVTYEKARTLIEKVSQYGLP -1111-------1111----------------------1111-----------1111--- IVVEPSDPSNVVYDVDYLFVPTVLNSADGDWITGKHAQWVRMHYENLQKFTEIIESEFIQ ----------------------1111--1111----------3333-------------- IEGYIVLNPDSAVARVTKALCNIDKELAASYALVGEKLFNLPIIYIEYSGTYGNPELVAE -------1111------------------------------------------------- VKKVLDKARLFYGGGIDSREKAREMLRYADTIIVGNVIYEKGIDAFLETLP -1111------------------------------3333------------ >FATTY ACID-BINDING PROTEI; SWP:P07148; PDB:2F73A; SMSFSGKYQLQSQENFEAFMKAIGLPEELIQKGKDIKGVSEIVQNGKHFKFTITAGSKVI ----------------------------3333------------!!!!------!!!!-- QNEFTVGEECELETMTGEKVKTVVQLEGDNKLVTTFKNIKSVTELNGDIITNTMTLGDIV ----2222---------------------------iiii------!!!!------!!!!- FKRISKRI -------- >Gag polyprotein; SWP:P04022; PDB:2F76X; MGQELSQHERYVEQLKQALKTRGVKVKYADLLKFFDFVKDTCPWFPQEGTIDIKRWRRVG -------------------1111---------------------------3333------ DCFQDYYNTFGPEKVPVTAFSYWNLIKELIDKKEVNPQVM ---------------------------------------- >HTH-TYPE TRANSCRIPTIONAL ; SWP:O68014; PDB:2F7AA; RIASVEKTIRIGFVGSLLFGLLPRIIHLYRQAHPNLRIELYEMGTKAQTEALKEGRIDAG ---3333---------1111------------1111-------3333------------- FGRLKISDPAIKRTLLRNERLMVAVHASHPLNQMKDKGVHLNDLIDEKILLYPSSPKPNF -------1111--------------1111----------33331111------------- STHVMNIFSDHGLEPTKINEVREVQLALGLVAAGEGISLVPASTQSIQLFNLSYVPLLDP --------1111------------------1111------3333----2222------11 DAITPIYIAVRNMEESTYIYSLYETIRQIYAYEGFTEPPNWLEHHHHH 11-----------------------------------2222------- >HTH-TYPE TRANSCRIPTIONAL ; SWP:P07774; PDB:2F7BA; QTLRIGYVSSLLYGLLPEIIYLFRQQNPEIHIELIECGTKDQINALKQGKIDLGFGRLKI -------3333---------------3333-------3333------------------- TDPAIRRIVLHKEQLKLAIHKHHHPNQFAATGVHLSQIIDEPMLLYPVSQKPNFATFIQS -1111--------------1111-----3333-33331111------------------- LFTELGLVPSKLTEIREIQLALGLVAAGEGVCIVPASAMDIGVKNLLYIPILDDDAYSPI -3333-------------------1111------3333--------------1111---- SLAVRNMDHSNYIPKILACVQEVFATHHIRPLIE ----1111----------------1111------ >NICOTINATE PHOSPHORIBOSYL; SWP:Q830Y8; PDB:2F7FA; TYADDSLTLHTDMYQINMMQTYWELGRADLHAVFECYFREMPFNHGYAIFAGLERLVNYL ------1111-3333-------11111111----------2222---------------1 ENLTFTESDIAYLREVEEYPEDFLTYLANFEFKCTVRSALEGDLVFNNEPLIQIEGPLAQ 111------------------------------------2222----------------- CQLVETALLNMVNFQTLIATKAARIKSVIGDDPLLEFGTRRAQELDAAIWGTRAAYIGGA -------------------------------------3333------------------- DATSNVRAGKIFGIPVSGTHAHSLVQSYGNDYEAFMAYAKTHRDCVFLVDTYDTLKAGVP --------------------33333333---------3333------------------- SAIRVAREMGDKINFLGVRIDSGDMAYISKRVREQLDEAGFTEAKIYASNDLDENTILNL --------!!!!-----------3333---------11111111--------3333---- KMQKSKIDVWGVGTKLITAYDQPALGAVFKLVSIEGEDGQMKDTIKLSSNAEKVTTPGKK ------------3333--1111-------------1111----------1111------- QVWRITRKSDKKSEGDYVTLWNEDPRQEEEIYMFHPVHTFINKYVRDFEARPVLQDIFVE -------------------11113333----------3333-----------------ii GKRVYELPTLDEIKQYAKENLDSLHEEYKRDLNPQKYPVDLSTDCWNHKMNLLEKVRKDV ii----------------------3333-----------------------------111 KH 1- >455AA LONG HYPOTHETICAL P; SWP:Q976E4; PDB:2F7LA; MGKLFGTDGVRGIVNKELTPELVLKLSKAIGTFFGKNSKILVGRDVRAGGDMLVKIVEGG ----------------------------------------------1111---------- LLSVGVEVYDGGMAPTPALQYAVKTLGYDGGVVITASHNPAPYNGIKVVDKDGIEIRREK -----------------------------------!!!!1111------1111---3333 ENEIEDLFFTERFNTIEWSSLTTEVKREDRVISTYVNGILSHVDIEKIKKKNYKVLIDPA ----------------3333-------------------1111----3333-------%% NSVGALSTPLVARALGCKIYTINGNLDPLFSARQPEPTFDSLKETAEVVKTLKVDLGVAH %%---------------------------3333----3333--------1111------- DGDADRAIFIDSEGRVQWGDRSGTLLSYWASVKNPKAIKKIVTAVSSSSLVEEYLSKYNI 1111------1111---3333--------33333333------11113333--3333--- QVDWTKVGSVDIAHKVADENALAGFEENGGFMYPPHQYVRDGAMSFALMLELLANENVSS -------------------------1111---3333------------------------ AELFDRLPKYYLVKTKVDLKPGLMVEEIYKKILEVYSTSSVKAITIDGVKIIGKDFWFLV -----------------------3333--------------------------------- RKSGTEPIIRIMAEAKDENVANNLVNELKKIVEGK ----------------------------------- >RAS-RELATED PROTEIN RAB-2; SWP:O00194; PDB:2F7SA; DYDYLIKLLALGDSGVGKTTFLYRYTDNKFNPKFITTVGIDFREKRVVYNAGKAFKVHLQ ------------------------------------------------------------ LWDTAGQERFRSLTTAFFRDAMGFLLMFDLTSQQSFLNVRNWMSQLQANAYCENPDIVLI ----------------------------1111-----------------3333------- GNKADLPDQREVNERQARELADKYGIPYFETSAATGQNVEKAVETLLDLIMKRMEQCVE --11111111-----------1111----------------------------3333-- >MOS1 TRANSPOSASE; SWP:O61446; PDB:2F7TA; WVPHELNERQMERRKNTCEILLSRYKRKSFLHRIVTGDEKWIFFVNPKKTMLCVWWDQSG -----------------------------1111-----------------------1111 VIYYELLKPGETVNAARYQQQLINLNRALQRKRPEYQKRQHRVIFLHDNAPSHTARAVRD --------------------------------3333-------------3333------- TLETLNWEVLPHAAYSPDLAPSDYHLFASMGHALAEQRFDSYESVKKWLDEWFAAKDDEF ---------------11113333----------1111---3333--------11113333 YWRGIHKLPERWEKCVASDGKYFE ---------------1111----- >AECTYLCITRULLINE DEACETYL; SWP:NA; PDB:2F7VA; HMTDLLASTLEHLETLVSFDTRNPPRAIAAEGGIFDYLRAQLPGFQVEVIDHGDGAVSLY -------------------------------------11112222-------iiii---- AVRGTPKYLFNVHLDTVPDSPHWSADPHVMRRTEDRVIGLGVCDIKGAAAALVAAANAGD -------------------------1111---------2222------------1111-- GDAAFLFSSDEEANDPRCIAAFLARGLPYDAVLVAEPTMSEAVLAHRGISSVLMRFAGRA ----------------------1111-----------%%%%------------------- GDPAASALHQAMRWGGKALDHVESLAHARFGGLTGLRFNIGRVDGGIKANMIAPAAELRF -1111-----------------1111---iiii--------------1111--------- GFRPLPSMDVDGLLATFAGFADPAAAHFEETFRGPSLPSGDIARAEERRLAARDVADALD ----2222------------------------------------------------1111 LPIGNAVDFWTEASLFSAGGYTALVYGPGDIAQAHTADEFVTLAQLQRYVESVNRIINGS -----------3333-1111---------3333--------------------------- >MOLYBDENUM COFACTOR BIOSY; SWP:Q8EKM7; PDB:2F7WA; SKAKIGIVTVSDRASAGIYEDISGKAIIDTLNDYLTSEWEPIYQVIPDEQDVIETTLIKA ------------------------------------------------------------ DEQDCCLIVTTGGTGPAKRDVTPEATEAVCDRPGFGELRAESLKFVPTAILSRQTAGLRG ----------------3333------------3333---------1111---------!! DSLIVNLPGKPKSIRECLDAVFPAIPYCIDLEGPYLECNEAVIKPFRP !!------------------3333--------------3333------ >HMG-COA SYNTHASE; SWP:Q9M6U3; PDB:2F82A; AKNVGILAMDIYFPPTCVQQEALEAHDGASKGKYTIGLGQDCLAFCTELEDVISMSFNAV -----------------------------2222--1111-------1111---------- TSLLEKYKIDPKQIGRLEVGSETVIDKSKSIKTFLMQLFEKCGNTDVEGVDSTNACYGGT ----1111-3333----------------3333-33333333-----------!!!!--- AALLNCVNWVESNSWDGRYGLVICTDSAVYAEGPARPTGGAAAIAMLIGPDAPIVFESKL -----------1111-----------------1111--------------------1111 RGSHMAHVYDFYKPNLASEYPVVDGKLSQTCYLMALDSCYKHLCNKFEKLEGKEFSINDA --------------1111-------------------------------------3333- DYFVFHSPYNKLVQKSFARLLYNDFLRNASSIDEAAKEKFTPYSSLSLDESYQSRDLEKV --------3333------------11113333-------3333---3333---------- SQQLAKTYYDAKVQPTTLVPKQVGNMYTASLYAAFASLVHNKHSDLAGKRVVMFSYGSGS ------------1111---------!!!!------------11112222----------- TATMFSLRLCENQSPFSLSNIASVMDVGGKLKARHEYAPEKFVETMKLMEHRYGAKEFVT ---------------------------------------------------2222----- SKEGILDLLAPGTYYLKEVDSLYRRFYGKK -222233332222----------------- >OROTIDINE MONOPHOSPHATE D; SWP:Q8IJH3; PDB:2F84A; MGFKVKLEKRRNAINTCLCIGLDPDEKDIENFMKNEKENNYNNIKKNLKEKYINNVSIKK -------------------------------------------------3333-----11 DILLKAPDNIIREEKSEEFFYFFNHFCFYIINETNKYALTFKMNFAFYIPYGSVGIDVLK 11---3333----1111----------------3333------3333-11113333---- NVFDYLYELNIPTILDMKINDIGNTVKNYRKFIFEYLKSDSCTVNIYMGTNMLKDICYDE ---------------------3333--------------------1111--3333----1 EKNKYYSAFVLVKTTNPDSAIFQKNLSLDNKQAYVIMAQEALNMSSYLNLEQNNEFIGFV 111-------------1111--3333------------------------1111------ VGANSYDEMNYIRTYFPNCYILSPGIGAQNGDLHKTLTNGYHKSYEKILINIGRAITKNP -1111----------1111-------1111-------------3333-----3333---- YPQKAAQMYYDQINAILKQNMES -------------------1111 >Calcium/calmodulin depend; SWP:Q9U6Q0; PDB:2F86B; NDSEKAQKQDIVRVTQTLLDAISCKDFETYTRLCDTSMTCFEPEALGNLIEGIEFHRFYF ----------------------------------1111---3333--------3333--- DGNRKNQVHTTMLNPNVHIIGEDAACVAYVKLTQFLDRNGEAHTRQSQESRVWSKKQGRW ------------------------------------------------------------ VCVHVHRST --------- >GLUTATHIONE PEROXIDASE 1; SWP:P07203; PDB:2F8AA; MQSVYAFSARPLAGGEPVSLGSLRGKVLLIENVASLGGTTVRDYTQMNELQRRLGPRGLV ----------1111------1111------------1111-------------3333--- VLGFPCNQFGHQENAKNEEILNSLKYVRPGGGFEPNFMLFEKCEVNGAGAHPLFAFLREA --------%%%%---1111----------iiii-------------1111---------- LPAPSDDATALMTDPKLITWSPVCRNDVAWNFEKFLVGPDGVPLRRYSRRFQTIDIEPDI ---1111------3333------1111----------1111------11113333----- EALLS ----- >HYPOTHETICAL PROTEIN LMO1; SWP:Q71Z85; PDB:2F8LA; ANEATQELFQVLDNTAIILQNELEISYLEAVYETGENLFQKEVLQKEEKQLKLQASYESI ------------------------------------------------------------ ELENFSNEEIRKGLQLALLKGKHGIQVNHQTPDSIGFIVAYLLEKVIQKKKNVSILDPAC 3333---------------------3333--3333------------------------! GTANLLTTVINQLELKGDVDVHASGVDVDDLLISLALVGADLQRQKTLLHQDGLANLLVD !!!----------3333----------------------------------1111----- PVDVVISDLPVGYYPDDENAKTFELCREEGHSFAHFLFIEQGRYTKPGGYLFFLVPDAFG ---------------3333---1111-------------------2222----------- TSDFAKVDKFIKKNGHIEGIIKLPETLFKSQARKSILILEKADVDVKPPKEVLLANLSSL 3333-------------------3333--------------------------------- TDPSVTAPILAEIENWFK -3333---------1111 >RIBOSE 5-PHOSPHATE ISOMER; SWP:NA; PDB:2F8MA; LKKIVAYKAVDEYVQSNMTIGLGTGSTVFYVLERIDNLLKSGKLKDVVCIPTSIDTELKA ------------------------3333-------------------------------- RKLGIPLTTLEKHSNIDITIDGTDEIDLNLNLIKGRGGALVREKLVASSSSLLIIIGDES ----------3333------------1111----1111--------1111-------333 KLCTNGLGMTGAVPIEILTFGYEKIIENLLKIYTLKGCTYKIRKRNGEIFITDNKNYIVD 3-----------------2222---------1111---------%%%%---1111----- FFFTEPIQDLLETCTRIKMTTGVVDHGIFVNMTNVALISKHDGTVLTLNK -------------------2222----------------1111------- >ALKALINE THERMOSTABLE END; SWP:O30700; PDB:2F8QA; QPFAWQVASLADRYEESFDIGAAVEPHQLNGRQGKVLKHHYNSIVAENAMKPISLQPEEG --1111-------1111-------1111----------------------3333---222 VFTWDGADAIVEFARKNNMNLRFHTLVWHNQVPDWFFLDEEGNPMVEETNEAKRQANKEL 2-------------------------------3333--1111-3333------------- LLERLETHIKTVVERYKDDVTAWDVVNEVVDDGTPNERGLRESVWYQITGDEYIRVAFET ---------------1111----------------1111---------!!!!-------- ARKYAGEDAKLFINDYNTEVTPKRDHLYNLVQDLLADGVPIDGVGHQAHIQIDWPTIDEI -----1111-------1111--------------1111------------1111------ RTSMEMFAGLGLDNQVTELDVSLYGWPPRPAFPTYDAIPQERFQAQADRYNQLFELYEEL -------1111----------------------3333--------------------111 DADLSSVTFWGIADNHTWLDDRAREYNDGVGKDAPFVFDPNYRVKPAFWRIID 1-----------3333----------iiii--------1111--33331111- >Recombining binding prote; SWP:Q06330; PDB:2F8XC; RPPPKRLTREAMRNYLKERGDQTVLILHAKVAQKSYGNEKRFFCPPPCVYLMGSGWKKKK ------------------------------------------------------------ EQMERDGCSEQESQPCAFIGIGNSDQEMQQLNLEGKNYCTAKTLYISDSDKRKHFMLSVK --------1111----------------------------------3333---------- MFYGNSDDIGVFLSKRIKVISKPSKKKQSLKNADLCIASGTKVALFNRLRSQTVSTRYLH ------------------------------------------------%%%%-------- VEGGNFHASSQQWGAFFIHLLDDDESEGEEFTVRDGYIHYGQTVKLVCSVTGMALPRLII -iiii------------------------------------------------------- RKVDKQTALLDADDPVSQLHKCAFYLKDTERMYLCLSQERIIQFQATPCPKEPNKEMIND ---%%%%----------------------------------------------------- GASWTIISTDKAEYTFYEGMGPVLAPVTPVPVVESLQLNGGGDVAMLELTGQNFTPNLRV ----------------------------------------!!!!---------------- WFGDVEAETMYRCGESMLCVVPDISAFREGWRWVRQPVQVPVTLVRNDGIIYSTSLTFTY -!!!!-------1111------3333-------------------1111----------- TPEP ---- ------------------------------------------------------- >Notch homolog 1, transloc; SWP:P46531; PDB:2F8YA; AVISDFIYQGASLHNQTDRTGETALHLAARYSRSDAAKRLLEASADANIQDNMGRTPLHA ---iiii----1111------------------------------1111-1111------ AVSADAQGVFQILIRNRATDLDARMHDGTTPLILAARLAVEGMLEDLINSHADVNAVDDL ---------------33331111-1111-------------------1111-1111-111 GKSALHWAAAVNNVDAAVVLLKNGANKDMQNNREETPLFLAAREGSYETAKVLLDHFANR 1------------------------1111-1111-------------------1111--- DITDHMDRLPRDIAQERMHHDIVRLLDEYNLVRSP ---1111-3333----------------------- >HEPATOPANCREAS TRYPSIN; SWP:Q52V24; PDB:2F91A; IVGGTDATLGEFPYQLSFQETFIIGGFFSFHFCGASIYNENYAITAGHCVYGGDDDDYYE -------22221111--------------------------------------------- ENNPSGLQIVAGELDMSVNEGSEQIITVSKIILHENFDYNLLDNDISLLKLSGSLTFNDN ------------------------------------------------------------ VAPIALPEQGHTATGDVIVTGWGTTSEGGNTPDVLQKVTVPLVSDEDCRADYGAADEILD ------------------------------------------------------------ SMICAGVPEEGGGKDSCQG ------------------- >Farnesyl pyrophosphate sy; SWP:P14324; PDB:2F94F; DVYAQEKQDFVQHFSQIVRVLTEDEMGHPEIGDAIARLKEVLEYNAIGGKYNRGLTVVVA 3333-----------------3333--3333----------------------------- FRELVEPRKQDADSLQRAWTVGWCVELLQAFFLVADDIMDSSLTRRGQICWYQKPGVGLD -----1111-----------------------------------iiii-33332222--- AINDANLLEACIYRLLKLYCREQPYYLNLIELFLQSSYQTEIGQTLDLLTAPQGNVDLVR -------------------1111----------------------------2222-3333 FTEKRYKSIVKYKTAFYSFYLPIAAAMYMAGIDGEKEHANAKKILLEMGEFFQIQDDYLD ---------------------------1111----------------------------- LFGDPSVTGKIGTDIQDNKCSWLVVQCLQRATPEQYQILKENYGQKEAEKVARVKALYEE ---3333-----3333-----------1111----------------------------- LDLPAVFLQYEEDSYSHIMALIEQYAAPLPPAVFLGLARKIY -----------------------------3333--------- >RIBONUCLEASE T; SWP:Q9HY82; PDB:2F96A; RHPARRFRGYLPVVVDVETGGFNSATDALLEIAATTVGDEKGFLFPEHTYFFRIEPFEGA ------iiii------------3333------------1111--------------2222 NIEPAALEFTGIKLDHPLRAVQEEAALTEIFRGIRKALKANGCKRAILVGHNSSFDLGFL -------3333-1111----------------------1111------------------ NAAVARTGIKRNPFHPFSSFDTATLAGLAYGQTVLAKACQAAGEFDNREAHSARYDTEKT ------------------------------------------------------------ AELFCGIVNRWKEGGW ---------------- >AKLANONIC ACID METHYL EST; SWP:O52646; PDB:2F99A; RSEQIAAVRRMVEAYNTGKTDDVADYIHPEYMNPGTLEFTSLRGPELFAINVAWVKKTFS -------------------1111----1111-33331111-------------------1 EEARLEEVGIEERADWVRARLVLYGRHVGEMVGMAPTGRLFSGEQIHLLHFVDGKIHHHR 111---------!!!!--------------iiii-----------------%%%%----- DWPDYQGTYRQLGEPWPETEH ---------1111-------- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ -------------------- >PRE-MRNA BRANCH SITE PROT; SWP:Q9Y3B4; PDB:2F9DA; RLPPEVNRILYIRNLPYKITAEEMYDIFGKYGPIRQIRVGNTPETRGTAYVVYEDIFDAK --3333---------1111---------1111---------1111--------------- NACDHLSGFNVCNRYLVVLYYNANRAFQKMDTKKKEEQLKLLKEKYGINTDPPK -----2222------------3333-11113333-------------------- ------------------------------------- >FIRST MANNOSYL TRANSFERAS; SWP:O30192; PDB:2F9FA; VETSKFKFKCYGDFWLSVNRIYPEKRIELQLEVFKKLQDEKLYIVGWFSKGDHAERYARK --1111---------------3333-----------1111--------2222-------- IKIAPDNVKFLGSVSEEELIDLYSRCKGLLCTAKDEDFGLTPIEAASGKPVIAVNEGGFK ----1111---------------------------------------------------- ETVINEKTGYLVNADVNEIIDAKKVSKNPDKFKKDCFRRAKEF ---2222----------------33331111-------3333- >PTS SYSTEM, IIA COMPONENT; SWP:Q831B0; PDB:2F9HA; WKQATVTEIGKHAIDDSEKIILFGETATDTLKQHAVIQSFPEKDQVTLAEGDHLKIGDTN ---------1111-1111-----111133331111----1111-----2222---!!!!- YTITKVGSFANSNLQSIAHSTLIFADAPTDEDDVIRNGVYLTPHQLPKITIGTTIDYLV -----------------------------3333-1111-----------2222------ >acetyl-coenzyme A carboxy; SWP:Q7A557; PDB:2F9IA; MLDFEKPLFEIRNKIESLQEEIDMLEASLERETKKIYTNLKPWDRVQIALERPTTLDYIP -1111--------1111-3333-------------1111-3333-----111-3333333 YIFDSFMELHGDRNFRDDPAMIGGIGFLNGRAVTVIGQQRGKDTKDNIYRNFGMAHPEGY 3----------------1111------iiii------------------%%%%--3333- RKALRLMKQAEKFNRPIFTFIDTKGAYPGKAAEERGQSESIATNLIEMASLKVPVIAIVI -----------------------------------------------1111--------- GEGGSGGALGIGIANKVLMLENSTYSVISPEGAAALLWKDSNLAKIAAETMKITAHDIKQ ----3333-----------1111----------------3333---------------11 LGIIDDVISEPLGGAHKDIEQQALAIKSAFVAQLDSLESLSRDEIANDRFEKFRNIGSYI 11--------22223333----------------1111--------------1111---- E - >Acetyl-CoA carboxylase tr; SWP:Q7A557; PDB:2F9IB; PAGIMTKCPKCKKIMYTKELAENLNVCFNCDHHIALTAYKRIEAISDEGSFTEFDKGMTS --------------------1111------------------11112222----1111-- ANPLDFPSYLEKIEKDQQKTGLKEAVVTGTAQLDGMKFGVAVMDSRFRMGSMGSVIGEKI -11112222-----------------------iiii-------3333%%%%--------- CRIIDYCTENRLPFILFSASGGARMQEGIISLMQMGKTSVSLKRHSDAGLLYISYLTHPT -----------------------3333------------------1111----------- TGGVSASFASVGDINLSEPKALIGFAGRRVIEQTINEKLPDDFQTAEFLLEHGQLDKVVH --33333333-------2222------------------1111------1111------3 RNDMRQTLSEILKIHQEVTK 333----------------- >RAB11B, MEMBER RAS ONCOGE; SWP:Q15907; PDB:2F9LA; MYDYLFKVVLIGDSGVGKSNLLSRFTRNEFNLSTIGVEFATRSIQVDGKTIKAQIWDTAG ------------2222-----------------------------iiii----------- QERYRRITSAYYRGAVGALLVYDIAKHLTYENVERWLKELRDHADSNIVIMLVGNKSDLR -1111--33332222-------1111------------------1111-------33331 HLRAVPTDEARAFAEKNNLSFIETSALDSTNVEEAFKNILTEIYRIVSQKQIA 111--3333-----1111-------1111----------------3333---- >CYTOCHROME P450 2D6; SWP:P10635; PDB:2F9QA; PPGPLPLPQNTPYCFDQLRRRFGDVFSLQLAWTPVVVLNGLAAVREALVTHGEDTADRPP ----------3333--------------3333----------------1111-------- VPITQILGFGPRSQGVFLARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAA ---------------1111----------------------------------------- FANHSGRPFRPNGLLDKAVSNVIASLTCGRRFEYDDPRFLRLLDLAQEGLKEESGFLREV ---iiii-------------------------1111------------------------ LNAVPVDRHIPALAGKVLRFQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEKAK -----------3333-------------------3333------------------1111 GNPESSFNDENLRIVVADLFSAGMVTTSTTLAWGLLLMILHPDVQRRVQQEIDDVIGQVR -------3333------------------------------------------------- RPEMGDQAHMPYTTAVIHEVQRFGDIVPLGMTHMTSRDIEVQGFRIPKGTTLITNLSSVL --33331111----------------1111----------iiii----------333311 KDEAVWEKPFRFHPEHFLDAQGHFVKPEAFLPFSAGRRACLGEPLARMELFLFFTSLLQH 11-----1111-3333--1111----33331111-11111111----------------- FSFSVPTGQPRPSHHGVFAFLVSPSPYELCAVPR -----3333------------------------- >THIOL-DISULFIDE OXIDOREDU; SWP:P35160; PDB:2F9SA; EGSDAPNFVLEDTNGKRIELSDLKGKGVFLNFWGTWCEPCKKEFPYANQYKHFKSQGVEI -----------1111---33332222-------1111-----------33331111---- VAVNVGESKIAVHNFKSYGVNFPVVLDTDRQVLDAYDVSPLPTTFLINPEGKVVKVVTGT ---------------1111-------1111---1111----------1111--------- TESIHDYNLIKPG -------1111-- >PANTOTHENATE KINASE; SWP:Q9HWC1; PDB:2F9WA; SILELDCGNSLIKWRVIEGAARSVAGGLAESDDALVEQLTSQQALPVRACRLVSVRSEQE -----------------------------------------1111--------------- TSQLVARLEQLFPVSALVASSGKQLAGVRNGYLDYQRLGLDRWLALVAAHHLAKKACLVI ------------------------iiii-----1111----------------------- DLGTAVTSDLVAADGVHLGGYICPGTLRSQLRTHTRRIRYDDAEARRALASLQPGQATAE -----------1111------------3333----------------------------- AVERGCLLLRGFVREQYAACELLGPDCEIFLTGGDAELVRDELAGARIPDLVFVGLALAC --------------------------------1111--33331111---3333------- PIE --- >CHEMOTAXIS PROTEIN CHEC; SWP:NA; PDB:2F9ZC; HMKKVIGIGEYAVMKNPGVIVTLGLGSCVAVCMRDPVAKVGAMAHVMLPDSGGKTDKPGK ------2222----------------------------------------------3333 YADTAVKTLVEELKKMGAKVERLEAKIAGGASMFESKGMNIGARNVEAVKKHLKDFGIKL ------------------3333-----------------3333----------1111--- LAEDTGGNRARSVEYNIETGKLLVRKVLEIKEI ----------------1111------------- >PROBABLE TRANSCRIPTIONAL ; SWP:P16684; PDB:2FA1A; HMNAQARFSQNLLDQGSHPTSEKLLSVLRPASGHVADALGITEGENVIHLRTLRRVNGVA ----------1111---1111--------------------2222----------iiii- LCLIDHYFADLTLWPTLQRFDSGSLHDFLREQTGIALRRSQTRISARRAQAKECQRLEIP ---------1111--3333------------------------------!!!!-1111-2 NMSPLLCVRTLNHRDGESSPAEYSVSLTRADMIEFTMEH 222-------------------------1111------- >THIOREDOXIN II; SWP:TRX2_YEAST; PDB:2FA4A; MVTQLKSASEYDSALASGDKLVVVDFFATWCGPCKMIAPMIEKFAEQYSDAAFYKLDVDE ---------------------------1111----------------1111-----1111 VSDVAQKAEVSSMPTLIFYKGGKEVTRVVGANPAAIKQAIASNV -------------------iiii----------------1111- >TRANSCRIPTIONAL REGULATOR; SWP:NA; PDB:2FA5A; HPVLLNLEQFLPYRLSVLSNRISGNIAKVYGDRYGAIPEWRVITILALYPGSSASEVSDR -----3333---------------------------------------2222-------- TADKVAVSRAVARLLERGFIRRSLALSPAGRQVYETVAPLVNEEQRLSVFSAEEQQTLER ------------------------------------------------------------ LIDRLAKDGLPRA --------3333- >HYPOTHETICAL PROTEIN ATU0; SWP:Q8UIR5; PDB:2FA8A; TKPRIAIRYCTQCNWLLRAGWAQEILQTFASDIGEVSLIPSTGGLFEITVDGTIIWERKR ----------1111--------------1111---------%%%%----iiii---3333 DGGFPGPKELKQRIRDLIDPERDLG ------------------1111--- >PHOSPHOENOLPYRUVATE CARBO; SWP:P21642; PDB:2FAFA; LSTSLSALPAAARDFVEEAVRLCRPREVLLCDGSEEEGKELLRGLQDDGVLHPLPKYDNC ----3333---------------------------------------------3333--- WLARTDPRDVARVESKTVLVTPEQSDAVPPPPPSGGPPQLGNWMSPNAFQAAVQERFPGC -----1111---3333------3333-----1111---------------------2222 MAGRPLYVIPFSMGPPTSPLAKLGVQVTDSPYVVLSMRIMTRVGPAVLQRLDDDFVRCLH 2222----------1111-----------------------------1111--------- SVGRPLPLTEPLVSSWPCDPSRVLVAHIPSERRIVSFGSGYGGNSLLGKKCFALRIASRM ------------%%%%--3333-----3333----------33333333----------- AQQQGWLAEHMLILGVTSPSGEKRYMAAAFPSACGKTNLAMMTPSLPGWRIHCVGDDIAW -----------------1111-----------2222-3333----2222----------- MKFDDEGRLRAINPERGFFGVAPGTSSRTNPNAMATIARNTIFTNVGLRSDGGVYWDGLD ---1111--------------22223333-------------------1111---2222- EPTEPGVTYTSWLGKPWKHGDPEPCAHPNSRFCAPADQCPIMDPRWDDPEGVPIDAIIFG ---2222---1111---2222-----1111----111111111111-3333--------- GRRPRGVPLVVEAFGWRHGVFMGSAMRSLMHDPFAMRPFFGYNAGRYLEHWLSTGLRSNA -----------------------------------1111-----------------2222 RLPRLFHVNWFLRDNEGRFVWPGFGHNARVLAWIFGRIQGRDTARPTPIGWVPKEGDLDL ----------------------!!!!--------------------1111---2222--2 GGLPGVDYSQLFPMEKGFWEEECRQLREYYGENFGADLPRDVMAELEGLEERVRKM 222---3333-----------------------!!!!-3333----------1111 >PROBABLE ATP-DEPENDENT DN; SWP:Q9I1X7; PDB:2FAOA; RAATAGVRISHPQRLIDPSIQASKLELAEFHARYADLLLRDLRERPVSLVRGPDGIGGEL 1111------1111--3333-----------------33331111------1111----- FFQKHAARLKIPGIVQLDPALDPGHPPLLQIRSAEALVGAVQMGSIEFHTWNASLANLER ----------2222---33332222----------------------------3333--- PDRFVLDLDPDPALPWKRMLEATQLSLTLLDELGLRAFLKTSGGKGMHLLVPLERRHGWD --------------3333------------1111-------------------------- EVKDFAQAISQHLARLMPERFSAVSGPRNRVGKIFVDYLRNSRGASTVAAYSVRAREGLP -------------------------11112222----11112222---2222---2222- VSVPVFREELDSLQGANQWNLRSLPQRLDELAGDDPWADYAGTRQRISAAMRRQL -----33331111------1111-----------1111-1111------------ >Ig heavy chain V region S; SWP:P01755; PDB:2FATH; GVKLQQSGPEVVKPGASVKISCKASGYSFTNFYIHWVKQRPGQGLEWIGWIFHGSDNTEY ------------2222------------1111-------2222----------------- NEKFKDKATLTADTSSSTAYMQLSSLTSEDSAVYFCARWGPHWYFDVWGQGTTVTVSSAK 3333----------------------3333---------1111----------------- TTPPSVYPLAPGNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLYTLSSS ---------------------------------%%%%--2222-------iiii------ VTVPSSTWPSETVTCNVAHPASSTKVDKKIAAAG ---1111------------1111----------- >Igk-C protein; SWP:Q58EV6; PDB:2FATL; DIVLTQSPDITAASLGQKVTITCSASSSVSYMHWYQQKSGTSPKPWIFEISKLASGVPAR -------------2222--------------------2222------------2222333 FSGSGSGTSYSLTISSMEAEDAAIYYCQQWNYPFTFGGGTKLEIKRADAAPTVSIFPPSS 3----!!!!--------3333--------------------------------------- EQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTLT ---------------------------iiii----------------------------- KDEYERHNSYTCEATHKTSTSPIVKSFNRNEC ---3333--------1111------------- >VACUOLAR PROTEIN SORTING ; SWP:O75436; PDB:2FAUA; GFFGPIEIDIVLNDGETRKMAEMKTEDGKVEKHYLFYDGESVSGKVNLAFKQPGKRLEHQ --3333-----1111---------1111--------2222-------------------- GIRIEFVGQIELFNDKSNTHEFVNLVKELALPGELTQSRSYDFEFMQVEKPYESYIGANV -------------1111---------------------------------------1111 RLRYFLKVTIVRRLTDLVKEYDLIVHQLATYPDVNNSIKMEVGIEDLHIEFEYNKSKYHL ----------------------------------------------------------11 KDVIVGKIYFLLVRIKIQHMELQLIKKEITGIGPSTTTETETIAKYEIMDGSIPIRLFLA 11------------------------------1111---------------------333 GYDPTPTMRDVNKKFSVRYFLNLVLVDEEDRRYFKQQEIILWRKAP 3----------1111-----------1111---------------- >Ubiquitin-like containing; SWP:Q96T88; PDB:2FAZA; SMWIQVRTMDGRQTHTVDSLSRLTKVEELRRKIQELFHVEPGLQRLFYRGKQMEDGHTLF -------1111---------1111---------------3333----iiii------333 DYEVRLNDTIQLLVRQS 3---2222--------- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:2FB0A; SIRLNVFVRVNETNREKAIEAAKELTACSLKEEGCIAYDTFESSTRRDVFICETWQNAEV ----------3333-----------------1111----------1111----------- LAAHEKTAHFAQYVGIIQELAEKLEKFEF ----------------------------- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:2FB1A; NYYSSNPTFYLGIDCIIFGFNEGEISLLLLKRNFEPAGEWSLGGFVQKDESVDDAAKRVL 1111----------------iiii----------------------1111---------- AELTGLENVYEQVGAFGAIDRDPGERVVSIAYYALININEYDRELVQKHNAYWVNINELP -----------------11113333-----------3333--------------1111-- ALIFDHPEVDKAREKQKASVEPIGFNLLPKLFTLSQLQSLYEAIYGEPDKRNFRKRVAED ----3333--33333333---3333----------------------------------- FIEKTDKIDKLGSKRGAALYKFNGKAYRKDPKFKL -----------1111-------------------- >Ig heavy chain V-III regi; SWP:P01772; PDB:2FB4H; EVQLVQSGGGVVQPGRSLRLSCSSSGFIFSSYAMYWVRQAPGKGLEWVAIIWDDGSDQHY ------------2222-----------1111--------2222--------1111----- ADSVKGRFTISRNDSKNTLFLQMDSLRPEDTGVYFCARDGGHFC 3333--------3333----------1111-------------- >Putative uncharacterized ; SWP:Q6GMW6; PDB:2FB4L; QSVLTQPPSASGTPGQRVTISCSGTSSNI ------------2222------------- >HYPOTHETICAL MEMBRANE SPA; SWP:NA; PDB:2FB5A; SNAMHEWGLSEELKIQTKQMIEIAEKELSIMRNAIDKEDECILCKMEDIHHMLANVQTLA ---------------------------------3333----------------------- ATYYIQAYLSPYTESSSFITTAIQHLSARKHGALIVVERNETLEALIQTGTTLNAHLTAP --------3333------------------------------1111-------------- LLESIFYPGNPLHDGAVLVKNNHIVSAANILPLTKSTEVDPELGTRHRAAIGLSEKSDAL ------2222---------!!!!----------------1111---------3333---- ILVVSEETGRTSFALNGILYTISL --------------iiii------ >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:2FB6A; ASANDKLTILWTTDNKDTVFNLAYALNSKNRGWWKHINIILWGASVKLVANDTQVQTEIL -1111-------------------------------------3333-------------- ELQSGITIEACQDCCENFGVASIITNLGITVRYGIPLTEYLKNGEKILSI -1111-------------------1111------------1111------ >SM-LIKE PROTEIN, LSM-14_N; SWP:Q7SXR4; PDB:2FB7A; PYIGSKISLISKAEIRYEGILYTIDTENSTVALAKVRSFGTEDRPTDRPIAPRDETFEYI ----------1111-----------1111------------------------------- IFRGSDIKDLTVCEPPKPIM ---1111------------- >B-Raf proto-oncogene seri; SWP:P15056; PDB:2FB8A; DWEIPDGQITVGQRIGSGSFGTVYKGKWHGDVAVKMLNVTAPTPQQLQAFKNEVGVLRKT -------------------------------------------------------3333- RHVNILLFMGYSTKPQLAIVTQWCEGSSLYHHLHIIETKFEMIKLIDIARQTAQGMDYLH ------------------------------------------------------------ AKSIIHRDLKSNNIFLHEDLTVKIGDFGLATVSGSILWMAPEVIRMQDKNPYSFQSDVYA ---------3333-------------1111----1111-3333----------------- FGIVLYELMTGQLPYSNINNRDQIIFMVGRGYLSPDLSKVRSNCPKAMKRLMAECLKKKR --------------1111-----------------1111-3333---------------3 DERPLFPQILASIELLARS 333-----------3333- >D-ALANINE:D-ALANINE LIGAS; SWP:NA; PDB:2FB9A; MEFMRVLLIAGGVSPEHEVSLLSAEGVLRHIPFPTDLAVIAQDGRWLLGEKALTALEAKA -------------1111-----------------------1111----3333--1111-- APEGEHPFPPPLSWERYDVVFPLLHGRFGEDGTVQGFLELLGKPYVGAGVAASALCMDKD -------------1111---------------------1111------------------ LSKRVLAQAGVPVVPWVAVRKGEPPVVPFDPPFFVKPANTGSSVGISRVERFQDLEAALA ------1111---------2222-----------------%%%%------3333------ LAFRYDEKAVVEKALSPVRELEVGVLGNVFGEASPVGEVRYEAPFYDYETKYTPGRAELL ------------------------------------------------------------ IPAPLDPGTQETVQELALKAYKVLGVRGMARVDFFLAEGELYLNELNTIPGFTPTSMYPR ------------------------------------iiii------------1111---- LFEAGGVAYPELLRRLVELALT --1111-----------3333- >GLUCOAMYLASE GLU1; SWP:P08017; PDB:2FBAA; AYPSFEAYSNYKVDRTDLETFLDKQKEVSLYYLLQNIAYPEGQFNNGVPGTVIASPSTSN -1111----------------------------1111-1111-----2222--------- PDYYYQWTRDSAITFLTVLSELEDNNFNTTLAKAVEYYINTSYNLQRTSNPSGSFDDENH --------------------------------------------1111-33331111%%% KGLGEPKFNTDGSAYTGAWGRPQNDGPALRAYAISRYLNDVNSLNEGKLVLTDSGDINFS %-------1111--------------------------------iiii--1111------ STEDIYKNIIKPDLEYVIGYWDSTGFDLWEENQGRHFFTSLVQQKALAYAVDIAKSFDDG --------------------------1111------------------------1111-- DFANTLSSTASTLESYLSGSDGGFVNTDVNHIVENPDLLQQNSRQGLDSATYIGPLLTHD -------------------3333-----------3333---------3333--------1 IGESSSTPFDVDNEYVLQSYYLLLEDNKDRYSVNSAYSAGAAIGRYPEDVYNGDGSSEGN 111------1111------------------1111----------1111----------- PWFLATAYAAQVPYKLAYDAKSASNDITINKINYDFFNKYIVDLSTINSAYQSSDSVTIK --------------------1111-----3333---------3333-1111--------2 SGSDEFNTVADNLVTFGDSFLQVILDHINDDGSLNEQLNRYTGYSTGAYSLTWSSGALLE 222-------------------------1111---------------------------- AIRLRNKVKALA -------1111- >LYSOZYME 1; SWP:Q7YT16; PDB:2FBDA; KTFTRCSLAREMYALGVPKSELPQWTCIAEHESSYRTNVVGPTNSNGSNDYGIFQINNYY ------------1111-3333-----------%%%%-------1111----1111----- WCQPSNGRFSYNECHLSCDALLTDNISNSVTCARKIKSQQGWTAWSTWKYCSGSLPSIND ---1111----1111-3333--------------------33331111---------333 CF 3- >PREDICTED: SIMILAR TO RET; SWP:NA; PDB:2FBEA; GPLGSPEFQVDTFDVDTANNYLIISEDLRSFRSGDLSQNRKEQAERFDTALCVLGTPRFT ----3333-----3333-1111--1111--------------1111-------------- SGRHYWEVDVGTSQVWDVGVCKESVNRQGKIELSSEHGFLTVGCREGKVFAASTVPTPLW ---------!!!!--------1111--------1111----------------------- VSPQLHRVGIFLDVGRSIAFYNVSDGCHIYTFIEIPVCEPWRPFFAHKRGSQDDQSILSI -1111---------------------------------------------1111------ CSVIN ----- >TRANSCRIPTIONAL REGULATOR; SWP:Q9HYQ4; PDB:2FBHA; YFGTLLAQTSRAWRAELDRRLSHLGLSQARWLVLLHLARHRDSPTQRELAQSVGVEGPTL --------------------3333----3333---------------------------- ARLLDGLESQGLVRRLAVAEDRRAKHIVLTPKADVLIADIEAIAASVRNDVLTGIDESEQ -------1111------------------3333----------------1111------- ALCQQVLLRILANLENR ----------------- >PROBABLE TRANSCRIPTIONAL ; SWP:Q9HWP6; PDB:2FBIA; RPSLTLTLLQAREAAMSFFRPSLNQHGLTEQQWRVIRILRQQGEMESYQLANQACILRPS --------------3333-----1111--------------------------------- MTGVLARLERDGIVRRWKAPKDQRRVYVNLTEKGQQCFVSMSGDMEKNYQRIQERFGEEK --------1111---------1111----------------------------------- LAQLLELLNELKKIKP ---------3333--- >Ig heavy chain V region J; SWP:P01810; PDB:2FBJH; EVKLLESGGGLVQPGGSLKLSCAASGFDFSKYWMSWVRQAPGKGLEWIGEIHPDSGTINY ------------2222-----------3333--------2222----------------- TPSLKDKFIISRDNAKNSLYLQMSKVRSEDTALYYCARLHYYGYNAYWGQGTLVTVSAES 3333---------1111---------3333---------2222----------------- ARNPTIYPLTLPPALSSDPVIIGCLIHDYFPSGTMNVTWGKSGKDITTVNFPPALASGGR ------------------------------------------------------------ YTMSNQLTLPAVECPEGESVKCSVQHDSNPVQELDVNCSG ---------3333-2222-------!!!!----------- >TRANSCRIPTIONAL REGULATOR; SWP:Q9RV71; PDB:2FBKA; DTAALLERIRSDWARLNHGPSAGPMLTLLLLERLHAALGREIERTYAASGLNAAGWDLLL -----------------------------------------33333333--3333----- TLYRSAPPEGLRPTELSALAAISGPSTSNRIVRLLEKGLIERREASIRLTPQGRALVTHL ------3333----3333-----3333-------1111---------------------- LPAHLATTQRVLAPLSAQEQRTLEELAGRMLAGLEQ --------3333---2222------------1111- >HYPOTHETICAL PROTEIN NE14; SWP:Q82UI9; PDB:2FBLA; TEIERKFLVATFPDGELHAVPLRQGYLTTPTDSIELRLRQQGTEYFMTLKSQEYEIQIDV ----------------------------1111--------!!!!---------------- TQFEMLWPATEGRRVEKTRYSGKLPDGQLFELDVFAGHLSPLMLVEVEFLSEDAAQAFIP ---------2222----------1111--------!!!!-----------11111111-- PPWFGEEVTEDKRYKNKALALSIP 3333---11111111--------- >Y chromosome chromodomain; SWP:Q9Y6F8; PDB:2FBMA; TYRDIVVKKEDGFTQIVLSTRSTEKNALNTEVIKEIVNALNSAAADDSKLVLFSAAGSVF -----------------------%%%%--------------------------------- CCGLDFGYFVKHLRNNRNTASLEMVDTIKNFVNTFIQFKKPIVVSVNGPAIGLGASILPL -----3333-3333-3333-------------------------------------3333 CDLVWANEKAWFQTPYTTFGQSPDGCSSITFPKMMGKASANEMLIAGRKLTAREACAKGL ------1111----3333-----iiii----------------------------1111- VSQVFLTGTFTQEVMIQIKELASYNPIVLEECKALVRCNIKLELEQANERECEVLRKIWS -----3333-----------1111------------------------------------ SAQGIESMLKI -------1111 >70 KDA PEPTIDYLPROLYL ISO; SWP:Q8I4V8; PDB:2FBNA; AKKSIYDYTDEEKVQSAFDIKEEGNEFFKKNEINEAIVKYKEALDFFIHTEEWDDQILLD ---3333--------------------1111-------------1111-1111------- KKKNIEISCNLNLATCYNKNKDYPKAIDHASKVLKIDKNNVKALYKLGVANMYFGFLEEA ------------------------------------2222-------------------- KENLYKAASLNPNNLDIRNSYELCVNKLKEARK ----------1111------------------- >PROBABLE TRANSCRIPTIONAL ; SWP:Q9HZJ9; PDB:2FBQA; AQSETVERILDAAEQLFAEKGFAETSLRLITSKAGVNLAAVNYHFGSKKALIQAVFSRFL ---------------------1111-----------3333-------------------- GPFCASLEKELDRRQAKPEAQHATLEDLLHLLVSQAMAVKPRSGNDLSIFMRLLGLAFSQ ----------------1111---------------1111------------------111 SQGHLRKYLEEVYGKVFRRYMLLVNEAAPKLPPIELFWRVHFMLGAAAFSMSGIKALRAM 1-----------------------1111-------------------------------- AETDFGVNTSTEQVMHLMVPFFAAGMRAESGID ------------------------1111----- >WERNER SYNDROME HELICASE; SWP:Q14191; PDB:2FBYA; HHSVFEDDLPFLEFTGSIVYSYDASDCSFLSEDISMSLSDGDVVGFDMEWPPLYNRGKLG --3333---------------------------3333-2222------------iiii-- KVALIQLCVSESKCYLFHVSSMSVFPQGLKMLLENKAVKKAGVGIEGDQWKLLRDFDIKL ---------1111-----1111------------3333-----3333------------- KNFVELTDVANKKLKCTETWSLNSLVKHLLGKQLLKDKSIRCSNWSKFPLTEDQKLYAAT -----------1111---------------------3333---1111------------- DAYAGFIIYRNLEILD ---------------- >50S RIBOSOMAL PROTEIN L7A; SWP:Q9YAX7; PDB:2FC3A; PIYVRFEVPEDLAEKAYEAVKRARETGRIKKGTNETTKAVERGLAKLVVIAEDVDPPEIV 1111----------------------------------------------------3333 MHLPLLCDEKKIPYVYVPSKKRLGEAAGIEVAAASVAIIEPGDAETLVREIVEKVKELRA ----------------------------------------!!!!---------------- KAGV ---- >TARGET OF EGR1, MEMBER 1; SWP:Q25NF8; PDB:2FC6A; GSSGSSGCCLPPATHRPHPTSICDNFSAYGWCPLGPQCPQSHDISGPSSG -------------------------------1111--------------- >ZZZ3 PROTEIN; SWP:Q9Y4U0; PDB:2FC7A; GSSGSSGQQMQAESGFVQHVGFKCDNCGIEPIQGVRWHCQDCPPEMSLDFCDSCSDCLHE --------------------------------------------------3333------ TDIHKEDHQLEPIYRSSGPSSG ----3333-------------- >NCL PROTEIN; SWP:Q9BQ02; PDB:2FC8A; GSSGSSGPNARSQPSKTLFVKGLSEDTTEETLKESFDGSVRARIVTDRETGSSKGFGFVD --------------------------------3333------------------------ FNSEEDAKAAKEAMEDGEIDGNKVTLDWAKPKGEGGSGPSSG -----------1111---%%%%-------------------- >NCL PROTEIN; SWP:Q9BQ02; PDB:2FC9A; GSSGSSGNSTWSGESKTLVLSNLSYSATEETLQEVFEKATFIKVPQNQNGKSKGYAFIEF -----------------------333333333333-----------3333---------- ASFEDAKEALNSCNKREIEGRAIRLELQGPRGSPNSGPSSG ----3333---------%%%%-------------------- >TRNA (GUANINE-N(7)-)-METH; SWP:O34522; PDB:2FCAA; WDDFLAENADIAISNPADYKGKWNTVFGNDNPIHIEVGTGKGQFISGMAKQNPDINYIGI --------------333322223333------------!!!!---------1111----- ELFKSVIVTAVQKVKDSEAQNVKLLNIDADTLTDVFEPGEVKRVYLNFSDPWPKKRHEKR --3333---------------------3333-----2222-------------3333111 RLTYSHFLKKYEEVMGKGGSIHFKTDNRGLFEYSLKSFSEYGLLLTYVSLDLHNSNLEGN 1-------------------------------------------------1111------ IMTEYEEKFSALGQPIYRAEVEWRT --3333------------------- >FC GAMMA RIIB; SWP:P31994; PDB:2FCBA; APPKAVLKLEPQWINVLQEDSVTLTCRGTHSPESDSIQWFHNGNLIPTHTQPSYRFKANN ----------------2222--------------------%%%%-1111---------33 NDSGEYTCQTGQTSLSDPVHLTVLSEWLVLQTPHLEFQEGETIVLRCHSWKDKPLVKVTF 33-------1111------------------------2222-------2222-------- FQNGKSKKFSRSDPNFSIPQANHSHSGDYHCTGNIGYTLYSSKPVTITVQAPA -iiii----------------3333---------!!!!--------------- >MULTIPLE PDZ DOMAIN PROTE; SWP:O75970; PDB:2FCFA; QSMQPRRVELWREPKSLGISIVGGRGIFIKHVLEDSPAGKNGTLKPGDRIVEVDGMDLRD --------------------------------------------2222----iiii-111 ASHEQAVEAIRKAGNPVVFMVQSIISTRL 1--------1111---------------- >C-termainl SH2 domain fro; SWP:P08487; PDB:2FCIA; GSPGIHESKEWYHASLTRAQAEHMLMRVPRDGAFLVRKRNEPNSYAISFRAEGKIKHCRV -----------------3333--------------------------------------- QQEGQTVMLGNSEFDSLVDLISYYEKHPLYRKMKLRYPINEENSS ---------------3333----------!!!!-------3333- >SMALL TOPRIM DOMAIN PROTE; SWP:Q5KVJ9; PDB:2FCJA; MRRVEKVIIVEGRSDKQKVAAVLNEPVVIVCTNGTISDARLEELADELEGYDVYLLADAD -------------------1111----------------------1111----------- EAGEKLRRQFRRMFPEAEHLYIDRAYREVAAAPIWHLAQVLLRARFDVRIESLM -------------3333-----3333-3333----------1111-----1111 >ribosomal-protein-serine ; SWP:Q9KQV9; PDB:2FCKA; TPDFQIVTQRLQLRLITADEAEELVQCIRQSQTLHQWVDWFSQQEAEQFIQATRLNWVKA 1111---1111-----1111-------1111-3333------------------------ EAYGFGVFERQTQTLVGVAINEFYHTFNASLGYWIGDRYQRQGYGKEALTALILFCFERL -----------------------3333--------33331111----------------- ELTRLEIVCDPENVPSQALALRCGANREQLAPNRFLYAGEPKAGIVFSLIP ---------1111-------1111------------%%%%----------- >HYPOTHETICAL PROTEIN TM10; SWP:Q9X0A5; PDB:2FCLA; MIRPEYLRVLRIYDRLNEVNWVVTGSLSFALQGVPVEVHDIDIQTDEEGAYEIERIFSEF ------------1111-------------1111------------3333-------1111 VSVRFSSTEICSHFGELIIDGIVEIMGDIRRLEDGTWEDPVDLNYRFVETHGMIPVLSLE ------------------iiii---------1111--------------iiii------- YEYQAYLLGRVEAETLRWLNER ------------------1111 >recombination protein U (; SWP:Q5KXY4; PDB:2FCOA; GMTLEDDLNATNEYYRERGIAVIHKKPTPVAYFRQASTTDYNGVYRGKYIDFEAKETKNK --------------------------------------------iiii------------ TAFPLKNFHAHQIRHMEQVVAHGGICFAILRFSLLNETYLLDASHLIAWWNKQEAGGRKS ---3333-3333-------1111--------1111------3333------3333----- IPKQEIERHGHSIPLGYQPRIDYISVVDNVYFTR -3333----------------------------- >FLAVODOXIN; SWP:P14070; PDB:2FCR; KIGIFFSTSTGNTTEVADFIGKTLGAKADAPIDVDDVTDPQALKDYDLLFLGAPTWNTGA -----------------------!!!!-----3333--3333--------------2222 DTERSGTSWDEFLYDKLPEVDMKDLPVAIFGLGDAEGYPDNFCDAIEEIHDCFAKQGAKP ------3333-----3333--2222--------33331111------------------- VGFSNPDDYDYEESKSVRDGKFLGLPLDMVNDQIPMEKRVAGWVEAVVSETGV ----1111-----1111iiii-------------------------------- >SYRINGOMYCIN BIOSYNTHESIS; SWP:Q9RBY6; PDB:2FCTA; KKFALTAEQRASFEKNGFIGPFDAYSPEEMKETWKRTRLRLLDRSAAAYQDLDATNIANY 1111---------------------------------------1111------------- DRHLDDDFLASHICRPEICDRVESILGPNVLCWRTEFFPKYPGDEGTDWHQADTFANASG 3333------3333-3333---------------------2222-----------3333- KPQIIWPENEEFGGTITVWTAFTDANIANGCLQFIPGTQNSMNYDETKRMTYEPDANNSV ------1111---------------3333-----2222---------------------- VKDGVRRGFFGYDYRQLQIDENWKPDEASAVPMQMKAGQFIIFWSTLMHASYPHSGESQE -iiii---iiii3333---1111--3333------2222----1111------------- MRMGFASRYVPSFVHVYPDSDHIEEYGGRISLEKYGAVQVIGDETPEYNRLVTHTTRGKK ----------3333----------iiii---1111----------3333-----1111-- FEAV ---- ------------------------------------------------------------ ---------------------------------------------- >AVIRULENCE PROTEIN AVRPTO; SWP:Q8RSY1; PDB:2FD4A; KLAALDPIASQFSQLRTISKALGFKDAADDVTHCLFGGELSLSNPDQQVIGLAGNPTDTS ----------3333-------------------1111---1111---------------- QPYSDLAFMDMKKLAQFLAGKPEHPMTRETLNAENIAKYAFRIVP -----------------1111----------33331111------ >TRANSCRIPTIONAL REGULATOR; SWP:Q9HZ91; PDB:2FD5A; MSDKKTQTRARILGAATQALLERGAVEPSVGEVMGAAGLTVGGFYAHFQSKDALMLEAFE ----------------------------3333--1111--3333---------------- QLLGKRRELLGELDPGLSGKERRALAAAFYLSRKHRDAQVDAGCPLPATLAEVARLPEGF ---------11111111-----------------1111------333333331111---- REVLSRHVEIMVTSLAESPEETDVALADLVLMIGGLALARALGPGELSDRVLRAAKQAVN ------------1111-3333----------------------------------1111- >Igh-6 protein; SWP:Q4V9V8; PDB:2FD6H; GVKLQQSGPEVVKPGASVKISCKASGYSFTNFYIHWVKQRPGQGLEWIGWIFHGSDNTEY ------------2222-----------1111--------2222----------1111--- NEKFKDKATLTADTSSSTAYMQLSSL 3333--------3333---------- >Putative uncharacterized ; SWP:Q52L64; PDB:2FD6L; DIVLTQSPDITAASLGQKVTITCSASSSVSYMHWYQQKSGTSPKPWIFEISKLASGVPAR -------------2222--------------------%%%%------------2222333 FSGSGSGTSYSLTISSMEAEDAAIYYCQQWNYPFTFGGGTKLEIKRADAAPTVSIFPPSS 3----!!!!--------3333--------------------------------------- EQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTLT --1111---------------------iiii----------------------------- KDEYERHNSYTCEATHKTSTSPIVKSFNRNEAKA ---1111--------3333--------3333--- >Fibroblast growth factor ; SWP:P55075; PDB:2FDBM; FTQHVREQSLVTDQLSRRLIRTYQLYSRTSGKHVQVLANKRINAMAEDGDPFAKLIVETD -------3333-------------------------1111------22221111------ TFGSRVRVRGAETGLYICMNKKGKLIAKSNGKGKDCVFTEIVLENNYTALQNAKYEGWYM -%%%%--------------1111-------------------1111--------1111-- AFTRKGRPRKGSKTRQHQREVHFMKRLPR --1111---3333-1111----------- >GAG-POL POLYPROTEIN; SWP:Q7SPG9; PDB:2FDDA; PQITLWQRPIVTIKIGGQQREALLDTGADDTVLEDINLPGRWKPKIIGGVGGFVKVRQYD --------------%%%%------1111--------------------2222-------- QVPIEICGHKVIGTVLVGPTPANIIGRNLMTQIGCTLNF -----iiii----------------3333-1111----- >ALKYLATED DNA REPAIR PROT; SWP:P05050; PDB:2FDIA; LAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQ -2222-------1111--------------------1111-----------------111 GYLYSPIDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDK 1-----------------------------11111111----------2222-------- DEPDLRAPIVSVSLGLPAIFQFGGLKRNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKA ---1111-----------------------------2222-----3333----------- GFHPLTIDCRYNLTFRQAGK -------------------- >FERREDOXIN; SWP:P00198; PDB:2FDN; AYVINEACISCGACEPECPVNAISSGDDRYVIDADTCIDCGACAGVCPVDAPVQA ----3333---3333--1111-------------------3333--1111----- >Uncharacterized protein A; SWP:Y2331_ARCFU; PDB:2FDOB; HPAYVFSKESFLKFLEGHLEDDVVVVVSSDVTDFCKKLSESVGEKEYCFAEFAFPADIFD -------------------1111--------------------------------1111- ADEDEIDEKYAIVFVEKEKLSEAGRNAIR ---------------3333-----1111- >CONSERVED HYPOTHETICAL PR; SWP:Q8UH93; PDB:2FDRA; GFDLIIFDCDGVLVDSEIIAAQVESRLLTEAGYPISVEEGERFAGTWKNILLQVESEASI --------2222-----------------------3333--------------------- PLSASLLDKSEKLLDRLERDVKIIDGVKFALSRLTTPRCICSNSSSHRLDLTKVGLKPYF --3333-----------------2222-3333------------3333---11113333- APHIYSAKDLGADRVKPKPDIFLHGAAQFGVSPDRVVVVEDSVHGIHGARAAGRVIGFTG -----3333-----------------1111-3333--------------1111------- ASHTYPSHADRLTDAGAETVISRQDLPAVIAAAE 11111111----1111------------------ >OROTIDINE-MONOPHOSPHATE-D; SWP:Q4Z4C3; PDB:2FDSA; HFKTKLKNRRNEVNTCLCIGLDPDEDDIKNFMRNEEKNGYKNVKNNMNNNNRIENVIKIG -------------------------------------%%%%-----------3333---3 KEILLTDEENIENLSEEDKFFYFFNHFCFYIINNTKEYALIYKMNFAFYIPYGSVGINAL 333---33331111--------------------3333------3333-1111------- KNVFDYLNSMNIPTMLDMKINDIGNTVKNYRKFIFEYLKSDSCTINVYMGTSMLKDICFD -------1111-----------3333--------------------33333333------ YEKNKYYSAYVLIKTTNKDSFIFQNELSINDKQAYIVMAEETQKMATDLKIDQNNEFIGF 1111------------1111--------!!!!------------------3333------ VVGSNAFEEMKIIRNKFPDSYILSPGIDLYKTLKNGYNKDYEKLLINVGRAITKSPNPKK --1111----------1111-------------------3333-----3333-------- SSESYYNQIIQIFKDIEN ------------------ >CYTOCHROME P450 2A6; SWP:P11509; PDB:2FDVA; KGKLPPGPTPLPFIGNYLQLNTEQMYNSLMKISERYGPVFTIHLGPRRVVVLCGHDAVRE -------------!!!!---3333-------3333--------!!!!------------- ALVDQAEEFSGRGEQATFDWVFKGYGVVFSNGERAKQLRRFSIATLRDFGVGKRGIEERI -----3333-----3333---iiii------------------------2222------- QEEAGFLIDALRGTGGANIDPTFFLSRTVSNVISSIVFGDRFDYKDKEFLSLLRMMLGIF -------------iiii-------------------------1111-------------- QFTSTSTGQLYEMFSSVMKHLPGPQQQAFQLLQGLEDFIAKKVEHNQRTLDPNSPRDFID -----------------1111-3333------------------------1111------ SFLIRMQEEEKNPNTEFYLKNLVMTTLNLFIGGTETVSTTLRYGFLLLMKHPEVEAKVHE ------1111-1111--------------------------------------------- EIDRVIGKNRQPKFEDRAKMPYMEAVIHEIQRFGDVIPMSLARRVKKDTKFRDFFLPKGT ------------33331111------------------------------%%%%--2222 EVYPMLGSVLRDPSFFSNPQDFNPQHFLNEKGQFKKSDAFVPFSIGKRNCFGEGLARMEL ----3333---1111--1111---11111111----11111111---------------- FLFFTTVMQNFRLKSSQSPKDIDVSPKHVGFATIPRNYTMSFLPR -----------------3333------------------------ >CATHEPSIN K; SWP:P43235; PDB:2FDZA; APDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKATGALLNLAPQNLVDCVSEN -----3333--------------3333---------------------3333----1111 DGCGGGYMTNAFQYVQRNR !!!!--------------- >SMALL MYRISTOYLATED PROTE; SWP:Q5SDH5; PDB:2FE0A; MGCGASSENSSVTYVNGRPTFVGEEVTKGFEKDNGLLFRIVNKKKKQWAYYNDTTQYEMH ------------------------------------------------------------ VLVTFNEDCDIKALGKTKLEQQENGEWVASVVVYPCETEMFIEGRVNGFKSKMDALPLSE -------------!!!!----3333----------------------------------- EYRQHQAEKDK ------3333- >CONSERVED HYPOTHETICAL PR; SWP:Q8ZZP3; PDB:2FE1A; MELVVDASAIAALYVPEERSEQAERAVSQAQELHTLDLAAYEVANDLWKHARRGLLREDE -----------------------------------3333-----------1111------ ASNMLEELWEFFKALKVHSYAEVLKDAFALALKHGVTVYDAAYVALAEKIGGKLLTLDRQ -----------1111---3333--------------3333-------------------- LAEKFPALVT ----1111-- >PEROXIDE OPERON REGULATOR; SWP:P71086; PDB:2FE3A; HELKEALETLKETGVRITPQRHAILEYLVNSMAHPTADDIYKALEGKFPNMSVATVYNNL ----------1111-----------------------------33331111--------- RVFRESGLVKELTYGDASSRFDFVTSDHYHAICENCGKIVDFHYPGLDEVEQLAAHVTGF ---1111------!!!!------------------------------------------- KVSHHRLEIYGVCQECSKKEN --------------------- >PRESYNAPTIC PROTEIN SAP10; SWP:Q92796; PDB:2FE5A; SMTIMEVNLLKGPKGLGFSIAGGIGNQHIPGDNSIYITKIIEGGAAQKDGRLQIGDRLLA -----------1111-------2222--2222--------2222--------2222---- VNNTNLQDVRHEEAVASLKNTSDMVYLKVAKPGS !!!!------------------------------ >PROBABLE N-ACETYLTRANSFER; SWP:Q9I640; PDB:2FE7A; LEIRPAVPADAEQILAFIIELADYERARHEVVTDVEGIRRSLFAEGSPTRALMCLSEGRP ------3333----------------3333----------1111-----------iiii- IGYAVFFYSYSTWLGRNGIYLEDLYVTPEYRGAGRRLLRELAREAVANDCGRLEWSVLDW -----------1111-----------1111---------------1111--------111 NQPAIDFYRSIGALPQDEWVRYRLDGEALRKMAE 1-------1111---------------------- >REPLICASE POLYPROTEIN 1AB; SWP:P59641; PDB:2FE8A; MEVKTIKVFTTVDNTNLHTQLVDMSMTYGQQFGPTYLDGADVTKIKPHVNHEGKTFFVLP ----------------------11113333------iiii-1111--1111--------- SDDTLRSEAFEYYHTLDESFLGRYMSALNHTKKWKFPQVGGLTSIKWADNNCYLSSVLLA ----------------3333----------1111----iiii-----%%%%--------- LQQLEVKFNAPALQEAYYRARAGDAANFCALILAYSNKTVGELGDVRETMTHLLQHANLE 1111---------------3333---------------2222-----------1111-11 SAKRVLNVVCKHCGQKTTTLTGVEAVMYMGTLSYDNLKTGVSIPCVCGRDATQYLVQQES 11-------------------3333-------3333--------3333------------ SFVMMSAPPAEYKLQQGTFLCANEYTGNYQCGHYTHITAKETLYRIDGAHLTKMSEYKGP --------------2222---------!!!!---------------!!!!---------- VTDVFYKETSYTTTI --------------- >2-hydroxy-3-keto-5-methyl; SWP:O31667; PDB:2FEAA; TTRKPFIICDFDGTITNDNIINIKTFAPPEWALKDGVLSKTLSIKEGVGRFGLLPSSLKE ----------2222---3333------3333-------------------33333333-- EITSFVLEDAKIREGFREFVAFINEHEIPFYVISGGDFFVYPLLEGIVEKDRIYCNHASF ------------2222-----------------------33332222-1111-------- DNDYIHIDWPHSCKGTCSNQCGCCKPSVIHELSEPNQYIIIGDSVTDVEAAKLSDLCFAR -------------!!!!----------------2222------3333---1111------ DYLLNECREQNLNHLPYQDFYEIRKEIENVKEVQEWLQNK ---------------------------------------- ------------------------------------------------------------ ------------------------------------------ >HYPOTHETICAL PROTEIN PA22; SWP:Q9I1R6; PDB:2FEFA; LTLDNRLAEALPLWRNLARTDRAPRRNIDLADWKADWRELIAALDRFSRSHGYRQPFAAQ -----3333-------1111------------------------------33331111-- GHAALENAWAWGQAAENASTLLLKAIDRGLAGAELRSIYLETAALWLDYSRLLGAARDSL ------------------------------------------------------------ REQGETAPALAPRTGQYPFALQLLAGVLLDAQELIPALVEEVLQFDTDRLLDYLGAAALG ----------3333------------11113333--------%%%%-------------- LTSASEETFHPRPFGQLRAFFEEASDAQALAPYLQSQYREFFQLSPKAQKKTRRLTGPYA -------------------------3333-----------3333-----3333---1111 WGWWAEVSALGVLYGWDDGVLRASPHYLGDLVDYARARGD !!!!----3333-----3333--1111------------- >CD2-ASSOCIATED PROTEIN; SWP:Q9Y5K6; PDB:2FEIA; MRQCKVLFEYIPQNEDELELKVGDIIDINEEVEEGWWSGTLNNKLGLFPSNFVKELELEH -------------3333---2222------------------------3333-------- >Low molecular weight prot; SWP:P0AAB2; PDB:2FEKA; MFNNILVVCVGNICRSPTAERLLQRYHPELKVESAGLGALVGKGADPTAISVAAEHQLSL --------------------------1111---------2222----------1111--2 EGHCARQISRRLCRNYDLILTMEKRHIERLCEMAPEMRGKVMLFGHWDNECEIPDPYRKS 222-------------------3333-------3333-----1111-%%%%--------3 RETFAAVYTLLERSARQWAQALNAEQV 333------------------------ >3-CARBOXY-CIS,CIS-MUCONAT; SWP:Q2HNZ1; PDB:2FELA; SLSPFEHPFLSGLFGDSEIIELFSAKADIDAMIRFETALAQAEAEASIFADDEAEAIVSG --3333---3333--33331111--------------------1111--3333------- LSEFAADMSALRHGVAKDGVVVPELIRQMRAAVAGQAADKVHFGATSQDVIDTSLMLRLK -----------------------------1111--33332222--3333----------- MAAEIIATRLGHLIDTLGDLASRDGHKPLTGYTRMQAAIGITVADRAAGWIAPLERHLLR -----------------------3333-----%%%%---------3333----------- LETFAQNGFALQFGGAAGTLEKLGDNAGAVRADLAKRLGLADRPQWHNQRDGIAEFANLL -------------------33331111------------------1111----------- SLVTGTLGKFGQDIALMAEIGSEIRLSNPVNAETLVTLARFNAVQISALHQSLVQEQERS -------------------------------------------------1111------3 GAGWMLEWLTLPQMVTATGTSLLVAERLAAQIDRLGA 333-------------------------3333----- >CATABOLITE CONTROL PROTEI; SWP:P25144; PDB:2FEPA; TTTVGVIIPDISSIFYSELARGIEDIATMYKYNIILSNSDQNMEKELHLLNTMLGKQVDG ---------1111--------------1111-------%%%%-----------1111--- IVFMGGNITDEHVAEFKRSPVPIVLAASVEEQEETPSVAIDYEQAIYDAVKLLVDKGHTD -----------------------------3333--------------------1111--- IAFVSGPMAEPINRSKKLQGYKRALEEANLPFNEQFVAEGDYTYDSGLEALQHLMSLDKK ------3333---------------1111---3333-----------------1111--- PTAILSATDEMALGIIHAAQDQGLSIPEDLDIIGFDNTRLSLMVRPQLSTVVQPTYDIGA -------------------1111--------------3333------------------- VAMRLLTKLMNKEPVEEHIVELPHRIELRKSTK --------1111--------------------- >CONSERVED HYPOTHETICAL PR; SWP:Q8UGZ9; PDB:2FEXA; TRIAIALAQDFADWEPALLAAAARSYLGVEIVHATPDGPVTSGGLKVTPDTSYDALDPVD -----------1111------------------------------------1111-3333 IDALVIPGGLSWEKGTAADLGGLVKRFRDRDRLVAGICAAASALGGTGVLNDVAHTGNAL -----------1111------------1111----------------1111-------33 ASHKAYPAYRGEAHYRDQPRAVSDGGVVTAAGSAPVSFAVEILKSLGLFGPEAEAELQIF 33---1111-3333---------iiii---1111---------1111-----------33 AAEHR 33--- >STEROIDOGENIC FACTOR 1; SWP:P33242; PDB:2FF0A; DELCPVCGDKVSGYHYGLLTCESCKGFFKRTVQNNKHYTCTESQSCKIDKTQRKRCPFCR --------------iiii------------------------------3333-------- FQKCLTVGMRLEAVRADRMRGGRNKFGPMYKRDRALKQQKKA ---------3333----------1111--------------- >PROBABLE REGULATORY PROTE; SWP:P14737; PDB:2FF4A; EKRLDFGLLGPLQMTIDGTPVPSGTPKQRAVLAMLVINRNRPVGVDALITALWEEWPPSG ---------------iiii------------------2222----------------111 ARASIHSYVSNLRKLLGGAGIDPRVVLAAAPPGYRLSIPDNTCDLGRFVAEKTAGVHAAA 1---------------1111-3333----3333-----1111----------------11 AGRFEQASRHLSAALREWRGPVLDDLRDFQFVEPFATALVEDKVLAHTAKAEAEIACGRA 11------------3333----1111--3333----------------------111133 SAVIAELEALTFEHPYREPLWTQLITAYYLSDRQSDALGAYRRVKTTLADDLGIDPGPTL 33-----------1111------------------------------------------- RALNERILRQQPLDAKKSAKTTAAGTVTVLDQRTMASGQQAVAYLHDIASGRGYPLQAAA ------1111--------------------11113333---------------------- TRIGRLHDNDIVLDSANVSRHHAVIVDTGTNYVINDLRSSNGVHVQHERIRSAVTLNDGD -----1111-----1111--------------------1111--%%%%--------2222 HIRICDHEFTFQISAGTHG ---!!!!------------ >Alpha-hemolysin transloca; SWP:P08716; PDB:2FF7A; HHDITFRNIRFRYKPDSPVILDNINLSIKQGEVIGIVGRSGSGKSTLTKLIQRFYIPENG -------------3333-----------2222------22223333---1111------- QVLIDGHDLALADPNWLRRQVGVVLQDNVLLNRSIIDNISLANPGMSVEKVIYAAKLAGA ---iiii3333------------------1111--------------------------- HDFISELREGYNTIVGEQGAGLSGGQRQRIAIARALVNNPKILIFDEATSALDYESEHVI ------1111-----2222----------------1111--------------------- MRNMHKICKGRTVIIIAHRLSTVKNADRIIVMEKGKIVEQGKHKELLSEPESLYSYLYQL -------2222-------33331111------iiii------------------------ QSD --- >OROTIDINE 5-MONOPHOSPHATE; SWP:NA; PDB:2FFCA; NLKIKLQKRRDEVNTCLCIGLDPDEADIKSFMQSEKQNGYQSVKKNLSNSGELFAPQMGG ------------------------------------------------------------ QMLLATPPKEAQEKDEFFYFFNHFCFYIINETKEYALAYKMNFAFYLPYGSLGVDVLKNV --------3333-------------------3333------3333-1111---------- FDYLHHLNVPTILDIKMNDIGNTVKHYRKFIFDYLRSDSCTANIYMGTQMLRDICLDEEC -------------------3333--------------------33333333-----1111 KRYYSTFVLVKTTNADSHIFQNRLSLDGKEAYVVIAEEVQKMAKQLHLEENGEFVGFVVG -------------1111--------iiii3333--------------3333--------1 ANCYDEIKKIRELFPDCYILAPGVGAQKGDLRKMLCNGYSKNYEKVLINVGRAITKSGSP 111----------1111-------1111-------------3333-----3333------ QQAAREYHQQIKEVLAEL ------------------ >LPPG:FO 2-PHOPSPHO-L-LACT; SWP:Q8PVT6; PDB:2FFEA; IIFSGGTGTPKLLDGLKEILPEEELTVVVNTAEDLWVSGNLISPDLDTVLYLFSDQIDRK -------3333---3333--3333------------iiii-------------------- RWWGIENDTFGTYERKELGIEEGLKLGDRDRATHIIRSNIIRDGASLTDSTVKLSSLFGI ---------3333--1111----------------------------------------- KANILPSDDPVSTYIETAEGIHFQDFWIGKRGEPDVRGVDIRGVSEASISPKVLEAFEKE ----------------1111-3333-----------------3333-------------- ENILIGPSNPITSIGPIISLPGRELLKKKKVVAVSPIIGNAPVSGPAGKLPACGIEVSSG -----------------------3333-----------------1111------------ VAEYYQDFLDVFVFDERDRADEFAFERLGCHASRADTLTSTEKSKELAEIVVQAFLEHHH --1111-------------------1111------------------------------- H - >YKUJ; SWP:O34588; PDB:2FFGA; SQLGIITRLQSLQETAEAANEPQRYFEVNGEKICSVKYFEKNQTFELTVFQKGEKPNTYP ---------------------------iiii-------------------2222------ FDNIDVSIEIFELLQLE ----------------- >2-pyrone-4,6-dicarboxylic; SWP:Q88M75; PDB:2FFIA; LHLTAIDSHAHVFSRGLNLASQRRYAPNYDAPLGDYLGQLRAHGFSHGVLVQPSFLGTDN ----------------------------------------1111--------1111---- RYLLSALQTVPGQLRGVVLERDVEQATLAEARLGVRGVRLNLGQDPDLTGAQWRPLLERI -----------------------3333---1111------------1111--3333---- GEQGWHVELHRQVADIPVLVRALQPYGLDIVIDHFGRPDARRGLGQPGFAELLTLSGRGK -----------33333333---3333-------iiii----------------------- VWVKVSGIYRLQGSPEENLAFARQALCALEAHYGAERLWGSDWPHTQHESEVSFGSAVEQ ------1111---1111----------------1111----------------------- FEALGCSAQLRQALLLDTARALFGFELE -3333-3333----------1111---- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:2FFJA; CPSCLLGRVYYEAKLVTDDEDLISQCVDESLKILAENINAHLATRIHRRVYEILGVEDPY ---------------------------------------------------------111 AEVKARANEVARQVLPLAKEIVEGSDDPFKTAVIVSIVGNNFDYVVEEEFRDFLKRKVQE 1------------------------------------1111------------------- GLKINDTERIKELSSGKVVYLTDNAGEIFFDTLLKEIKRRCEKLTAVVRGRPIISDATIE -------------------------3333-----------------------!!!!---- DARLARVDKIADELLTNGKGAIGIIDELPDETRKALEEADLIVAKGANYECLSLKPIAFL -----3333-------------------------------------3333---------- LTAKCEPVARDIGVNVGDVAKVVE --------------2222------ >RABBITPOX ENCODED CC CHEM; SWP:O10647; PDB:2FFKA; MPASLQQSSSSSSSCTEEENKHHMGIDVIIKVTKQDQTPTNDKICQSVTEITESESDPDP ---------1111----------------------------------------------- EVESEDDSTSVEDVDPPTTYYSIIGGGLRMNFGFTKCPQIKSISESADGNTVNARLSSVS -----------------------------------------------!!!!--------- PGQGKDSPAITHEEALAMIKDCEVSIDIRCSEEEKDSDIKTHPVLGSNISHKKVSYEDII ------------------------------------------------------------ GSTIVDTKCVKNLEFSVRIGDMCKESSELEVKDGFKYVDGSASKGATDDTSLIDSTKLKA -----3333-----------------1111-------iiii-------1111-------- CV -- >Small-inducible cytokine ; SWP:P13236; PDB:2FFKB; APMGSDPPTACCFSYTARKLPRNFVVDYYETSSLCSQPAVVFQTAASAQVCADPSESWVQ --------------------3333----------------------------3333---- EYVYDLELN ----3333- >DICER; SWP:NA; PDB:2FFLA; AMHALGHCCTVVTTRGPSHWLLLLDTHLGTLPGFKVSAGRGLPAAEVYFEAGPRVSLSRT ------------1111-------------------------------------------- DATIVAVYQSILFQLLGPTFPASWTEIGATMPHNEYTFPRFISNPPQFATLAFLPLLSPT ----------------1111--3333-----1111--1111----------------111 SPLDLRALMVTAQLMCDAKRLSDELSASLHGRMVATPEISWSLYVVLGIDSTQTSLSYFT 1------------------3333-----2222---1111----------33331111--- RANESITYMRYYATAHNIHLRAADLPLVAAVRLDDLKDHQIPAPGSDDLAPKLRFLPPEL 2222-----------------1111------3333--------------1111---1111 CLLLPDEFDLIRVQALQFLPEIAKHICDIQNTICALDKSFPDCGRIGGERYFAITAGLRL ----3333---------------------------1111--------------------- DQGRGRGLAGWRTPFGPFGVSHTDVFQRLELLGDAVLGFIVTARLLCLFPDASVGTLVEL --------!!!!---2222-----------------------------1111-------- KMELVRNEALNYLVQTLGLPQLAEFSNNLKSKTWADMYEEIVGSIFTGPNGIYGCEEFLA 1111--3333--------3333----------3333-----------3333--------- KTLMSPEHSKTACPDAVTKASKRVCMGEAGAHEFRSLVDYACEQGISVFCSSRVSTMFLE ----1111-----3333------------------------------------------- RLRDIPAEDMLDWYRLGIQFSHRSGLSGPGGVVSVIDIMTHLARGLWLGSPGFYVEQPPT -11113333------------1111--------3333----------------------- IPVLYIYHRSVQCPVLYGSLTTGPVASKVLALYEKILAYESSGGSKHIAAQTVSRSLAVP ---3333-----3333-------------------------------------------- IPSGTIPFLIRLLQIALTPHVYQKLELLGDAFLKCSLALHLHALHPTLTEGALTRMRQSA --------------------------------------------1111--------3333 ETNSVLGRLTKRFPSVVSEVIIESHPKIQPDSKVYGDTFEAILAAILLACGEEAAGAFVR ---------1111---------------1111---------------------------- EHVLPQVVADA --3333----- >SAV1430; SWP:Q99U58; PDB:2FFMA; KIISISETPNHNTKITLSESREGTSDTYTKVDDSQPAFINDILKVEGVKSIFHVDFISVD -------------------------------3333------------------------- KENDANWETVLPKVEAVFE -11113333-----3333- >RAS-RELATED PROTEIN RAB-6; SWP:Q9NRW1; PDB:2FFQA; KFKLVFLGEQSVGKTSLITRFMYDSFDNTYQATIGIDFLSKTMYLEDRTVRLQLWDTAGQ --------2222--------------------------------%%%%-----------1 ERFRSLIPSYIRDSTVAVVVYDITNLNSFQQTSKWIDDVRTERGSDVIIMLVGNKTDLAD 11111113333----------1111-----------------!!!!--------3333-- KRQITIEEGEQRAKELSVMFIETSAKTGYNVKQLFRRVASALLE ------------------------1111---------------- >HYPOTHETICAL PROTEIN PA12; SWP:Q9I4D2; PDB:2FFSA; QFEHLVQVNDRTLPVLDRLQLWEGLVCRAREPQYFVVGLERFEILVDDGDRLHRRLYLPG ------------------------------3333-----------------------222 LVVEDEVVLKAPDSAHYSIKPSAEVAGGSLDTIEEPEPGSLFVRFAYCTRYLQPDELPYD 2--------------------1111-----------2222-------------------3 AFVKQAYIADVETIATIRDRFGA 333-------------------- ------------------------------------------------------------ ------------------------ >POLYPEPTIDE N-ACETYLGALAC; SWP:Q10471; PDB:2FFUA; KVRWPDFNQEAYVGGTMVRSGQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD --3333-3333-------22221111--------33331111------3333-------- LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGALLGKIEKVR -----------------------------3333------------3333-1111-2222- VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIID -----------------1111----------------------------1111------- VINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMD --------------------1111------------1111-1111--------------- KFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSG ----------1111----3333------1111-----1111-------------2222-- TVFARNTRRAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVY -------------------------3333---------------------3333-----3 PELRVPDHQDIAFGALQQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHMD 333--------------!!!!---iiii2222----------!!!!----1111---!!! LCLTVVDRAPGSLIKLQGCRENDSRQKWEQIEGNSKLRHVGSNLCLDSRTAKSGGLSVEV !-------2222-------------------%%%%---2222-----1111--------- CGPALSQQWKFTLN ---1111------- >MIDLINE-1; SWP:O15344; PDB:2FFWA; QKASVSGPNSPSETRRERAFDANTMTSAEKVLCQFCDQDPAQDAVKTCVTCEVSYCDECL --------------------------------3333-----------3333-----3333 KATHPNKKPFTGHRLIEP ------------------ >Ferritin light chain; SWP:P02792; PDB:2FFXJ; SSQIRQNYSTDVEAAVNSLVNLYLQASYTYLSLGFYFDRDDVALEGVSHFFRELAEEKRE -1111------------------------------------------------------- GYERLLKMQNQRGGRALFQDIKKPAEDEWGKTPDAMKAAMALEKKLNQALLDLHALGSAR ---------1111--------------------------------------------111 TDPHLCDFLETHFLDEEVKLIKKMGDHLTNLHRLGGPEAGLGEYLFERLTLKH 1------------------------------1111------------------ >BETA-LACTAMASE; SWP:P00811; PDB:2FFYA; APQQINDIVHRTITPLIEQQKIPGMAVAVIYQGKPYYFTWGYADIAKKQPVTQQTLFELG ------------------------------iiii----------1111---1111---!! SVSKTFTGVLGGDAIARGEIKLSDPTTKYWPELTAKQWNGITLLHLATYTAGGLPLQVPD !!------------1111--11113333-3333-3333--------------------33 EVKSSSDLLRFYQNWQPAWAPGTQRLYANSSIGLFGALAVKPSGLSFEQAMQTRVFQPLK 33-------------------------3333--------3333-------------1111 LNHTWINVPPAEEKNYAWGYREGKAVHVSPGALDAEAYGVKSTIEDMARWVQSNLKPLDI --------33331111----iiii------2222---------------------1111- NEKTLQQGIQLAQSRYWQTGDMYQGLGWEMLDWPVNPDSIINGSDAKIALAARPVKAITP ----------1111----!!!!-----------------3333----------------- PTPAVRASWVHKTGATGGFGSYVAFIPEKELGIVMLANKNYPNPARVDAAWQILNALQ ---------------1111------1111------------3333------------- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:2FG1A; AEILYIKGDATAPIGSGVKVITHICNDIGGWGKGFVLALSKKWKPEEAYRQWYKSQEEFT --------1111---------------------3333----------------------2 LGAVQFVNVENKLYVANIGQHGIYKDSKGLPPIRYDAVRQCLKEVALFTIAHKASVHPRI 222------2222------------1111-----------------------------22 GCGLAGGKWELEQIIKEELITKEIAVTVYDL 223333-3333-------3333--------- >RAS-RELATED PROTEIN RAB-3; SWP:Q13636; PDB:2FG5A; IRELKVCLLGDTGVGKSSIVCRFVQDHFDHNISPTIGASFMTKTVPCGNELHKFLIWDTA ----------2222--------------1111---------------------------- GQERFHSLAPMYYRGSAAAVIVYDITKQDSFYTLKKWVKELKEHGPENIVMAIAGNKCDL -33331111---2222-------11113333-------------------------3333 SDIREVPLKDAKEYAESIGAIVVETSAKNAINIEELFQGISRQIP ------3333----------------1111--------------- >5-NITROIMIDAZOLE ANTIBIOT; SWP:NA; PDB:2FG9A; GKTIVIEDKQRIESIILQADACFVGITDLEGNPYVVPNFGYENDTLYLHSGPEGGKIELQ ---------------------------1111----------%%%%---------333333 RNNNVCITFSLGHKLVYQHCSYSRSESACRGKVEFIEDEEKRHALDIIRHYTKDQFSYSD 33---------------------------------------------------------- PAVRNVKVWKVPVDQTGKVFGLAE ------------------------ >ACETOLACTATE SYNTHASE, SM; SWP:Q9WZ19; PDB:2FGCA; IREHLVSLVHNKPGVRKVANLFARRGFNISSITVGESETPGLSRLVIVKGDDKTIEQIEK -----------2222-----------------------2222--------1111------ QAYKLVEVVKVTPIDPLPENRVEREALIKVRFDEDKQEIFQLVEIFRGKIIDVSREGAII ----3333--------3333---------------------------------3333--- EITGARSKVEAFINLLPQKQVEEIARTGIVANRWNV ----3333----11113333---------------- >probable acetolactate syn; SWP:NA; PDB:2FGDA; HRHIISLLENEAGALSRVAGLFSARGYNIESLSVAPTEDPTLSRTLVTNGPDEIVEQITK ---------------------3333-------------3333--------3333------ QLNKLIEVVKLIDLSSEGYVERELLVKVRAVGKDREEKRLADIFRGNIIDVTNELYTIEL ----1111----1111--------------!!!!-------------------------- TGTRSKLDGFLQAVDCNLILEIARTGVSGLSRGERVLKL --------------3333-----------------3333 >ZINC METALLOPROTEASE (INS; SWP:Q9LJL3; PDB:2FGEA; DEAEKLGFEKVSEEFISECKSKAILFKHKKTGCEVSVSNEDENKVFGVVFRTPPKDSTGI ----------------1111---------------------------------------- PHILQHSVLCGSRKYPVKEPFVELLKGSLHTFLNAFTYPDRTCYPVASTNTKDFYNLVDV -----3333--1111---3333-------------------------------------- YLDAVFFPKCVDDAHTFQQEGWHYELNDPSEDISYKGVVFNEKGVYSQPDNILGRIAQQA --------1111---------------1111------------1111------------- LSPENTYGVDSGGDPKDIPNLTFEEFKEFHRQYYHPSNARIWFYGDDDPVHRLRVLSEYL -11111111333333333333-------------3333---------3333---3333-- DFEASPSPNSSKIKFQKLFSEPVRLVEKYPAGRDGDLKKKHLCVNWLLSEKPLDLQTQLA -----3333--------------------------3333--------------3333--- LGFLDHLLGTPASPLRKILLESGLGEALVSSGLSDELLQPQFGIGLKGVSEENVQKVEEL ---------1111------3333--------------------------1111------- IDTLKKLAEEGFDNDAVEASNTIEFSLRENNTGSFPRGLSLLQSISKWIYDDPFEPLKYT -------------------------------!!!!3333-------3333-3333----- EPLKALKTRIAEEGSKAVFSPLIEKLILNNSHRVTIEQPDPEKATQEEVEEKNILEKVKA --------------3333--------1111------------------------------ ATEEDLAELARATEELKLKQETPDPPEALRCVPSLNLGDIPKEPTYVPTEVGDINGVKVL ------------------1111--33333333---1111--------------%%%%--- RHDLFTNDIIYTEVVFDIGSLKHELLPLVPLFCQSLLEGTKDLTFVQLNQLIGRKTGGIS -----%%%%--------111133331111----------1111----------------- VYPLTSSVRGKDEPCSKIIVRGKSAGRADDLFNLNCLLQEVQFTDQQRFKQFVSQSRARE -------2222-------------1111-------------------------------- NRLRGSGHGIAAARDALNIAGWSEQGGLSYLEFLHTLEKKVDEDWEGISSSLEEIRRSLL ------------------3333-------------------------------------- ARNGCIVNTADGKSLTNVEKSVAKFLDLLPENPSGGLVTWDGRLPLRNEAIVIPTQVNYV -2222--------------------1111------------------------------- GKAGNIYSTGYELDGSAYVISKHISNTWLWDRVRVSGGAYGGFCDFDSHSGVFSYLSYRD ----3333---------------------------------------------------- PNLLKTLDIYDGTGDFLRGLDVDQETLTKAIIGTIGDVDSYQLPDAKGYSSLLRHLLGVT ----------------1111---------------------------------------- DEERQRKREEILTTSLKDFKDFAQAIDVVRDKGVAVAVASAEDIDAANNERSNFFEVKK ----------11113333--------3333----------------------------- >HYPOTHETICAL PROTEIN RV26; SWP:P65033; PDB:2FGGA; SEHVGKTCQIDVLIEEHDERTRAKARLSWAGRQVGVGLARLDPADEPVAQIGDELAIARA ----------------------------%%%%---------------------------- LSDLANQLFALTSSDIEASTHQP ----------------------- >FERREDOXIN; SWP:Q9I6D2; PDB:2FGOA; SLKITDDCINCDVCEPECPNGAISQGEEIYVIDPNLCTECVGHYDEPQCQQVCPVDCIPL ----1111---3333--1111-----------3333-%%%%-----3333---------- DDANVESKDQLMEKYRKITGK 1111----------------- >PUTATIVE PERIPLASMIC PROT; SWP:Q9PI85; PDB:2FGSA; EYTLDKAHTDVGFKIKHLQISNVKGNFKDYSAVIDFDPASAEFKKLDVTIKIASVNTENQ ----1111------------------------------------------3333------ TRDNHLQQDDFFKAKKYPDMTFTMKKYEKIDNEKGKMTGTLTIAGVSKDIVLDAEIGGVA -------1111-3333--------------1111--------iiii-------------- KGKDGKEKIGFSLNGKIKRSDFKFATSTSTITLSDDINLNIEVEANEKE -1111------------3333---11113333----------------- >Two-component system yycF; SWP:Q794W0; PDB:2FGTA; HKIEKTTQKLSETVRPRDMFIHDDGAHYKVDDNALYEEIWSDLPHWDVKGIKDISDQYDK --------3333----------iiii---------------3333--------1111--- AGFKSWFYGIGGSEAKLDLQFSDTIPIDIFQTLFKWSNQSFEYSSFDHILIPFNETKANK ---------!!!!---------------------------1111---------------- KIYLVSYSKQLILEVTVESANYRNIMNDLKNRQSNMPAFSLFSIGSKKEFLLPNKPLTMD ------1111----------------------1111------------------------ KKEFVTESIKTNTFKQALFSDPSIVRENVLTDGISRLDVNLSQRQVQFQQLVQSTSYQTG ---------33333333---3333----------------1111---------------- ELIKKSQKYLEDTGSWTDHYQFFNINDSQQLSFYIFMDQIPVINSTAKPFGATSAITVQW ------------------------------------%%%%-------------------- ANDDILSYKRPNYSLGTNSETELMGGSEVKMLLSKQTAYDTDKIDQIFLAYQLLEPVWAM -------------------------3333--3333----3333----------------- KVNGKIVPITKDL ------------- >H52 FAB (HEAVY CHAIN); SWP:NA; PDB:2FGWH; EVQLVESGGGLVQPGGSLRLSCATSGYTFTEYTMHWMRQAPGKGLEWVAGINPKNGGTSH ------------2222-----------3333----------------------------- NQRFMDRFTISVDKSTSTAYMQMNSLRAEDTAVYYCARWRRYFDVWGQGTLVTVSSASTK 3333---------1111---------3333------------------------------ GPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYS ------------------------------------%%%%-------------3333--- LSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSCD -------3333----------------------------- >PUTATIVE THIOREDOXIN; SWP:Q82SJ4; PDB:2FGXA; MNNQVEPRKLVVYGREGCHLCEEMIASLRVLQKKSWFELEVINIDGNEHLTRLYNDRVPV ------------------------------3333---------3333------------- LFAVNEDKELCHYFLDSDVIGAYLS ------------------------- >CARBOXYSOME SHELL POLYPEP; SWP:O85042; PDB:2FGYA; HPACITLPERTCRHPLTDLEANEQLGRCEDSVKNRFDRVIPFLQVVAGIPLGLDHVTRVQ ---1111------1111----------------------------1111----------- ELAQSSLGHTLPEELLKDNWISGHNLKGIFGYATAKALTAATEQFSRDDSASAIGFFLDC -----------3333--------------------------------------------- GFHAVDISPCADGRLKGLLPYILRLPLTAFTYRKAYAGSMFDIEDDLAQWEKNELRRYRE -----------1111-3333---------------2222--------------------- GVPNTADQPTRYLKIAVYHFSTSDPTHSGCAAHGSNDRAALEAALTQLMKFREAVENAHC ----1111------------1111-----3333--------------------------% CGASIDILLIGVDTDTDAIRVHIPDSKGFLNPYRYVDNTVTYAQTLHLAPDEARVIIHEA %%%---------------------1111--1111--------1111-------------- ILNANRSDGWAKGNGVASEGMRRFIGQLLINNLSQIDYVVNRHGGRYPPNDIGHAERYIS --1111--1111-----------------------------------1111--------- VGDGFDEVQIRNLAYYAHLDTVEENAIDVDVGITIFTKLNLSRGLPIPIAIHYRYDPNVP ---------2222-------3333---------------3333------------1111- GSRERTVVKARRIYNAIKERFSSLDEQNLLQFRLSVQAQDIGSPIEEVASA ------------------------1111-----------2222-------- >Hypothetical 16.0 kDa pro; SWP:Q04773; PDB:2FH0A; GENSAPVGAAIANFLEPQALERLSRVALVRRDRAQAVETYLKKLIATNNVTHKITEAEIV -------3333-------------------------------------------3333-- SILNGIAKQQNSQNNSKIIFE ------3333----------- >GELSOLIN; SWP:P06396; PDB:2FH1A; MDDDGTGQKQIWRIEGSNKVPVDPATYGQFYGGDSYIILYNYRHGGRQGQIIYNWQGAQS --------------!!!!----1111-----------------iiii---------1111 TQDEVAASAILTAQLDEELGGTPVQSRVVQGKEPAHLMSLFGGKPMIIYKGGTSREGGQT ----------------1111--------2222-3333---iiii---------------- APASTRLFQVRANSAGATRAVEVLPKAGALNSNDAFVLKTPSAAYLWVGTGASEAEKTGA ------------1111---------3333-1111-----1111-----1111-------- QELLRVLRAQPVQVAEGSEPDGFWEALGGKAAYRTSPRLKDKKMHPPRLFACSNKIGRFV --------------2222-3333-1111-------3333--------------1111--- IEEVPGELMQEDLATDDVMLLDTWDQVFVWVGKDSQEEEKTEALTSAKRYIETDTPITVV --------3333-1111--------------11113333-----------3333------ KQGFEPPSFVGWFLGWDDDYW 2222-33331111-------- >Signal recognition partic; SWP:P08240; PDB:2FH5A; HSMVDFFTIFSKGGLVLWCFQGVSDSCTGPVNALIRSVLLQETHEALTLKYKLDNQFELV ----------3333-----------------------1111-------------1111-- FVVGFQKILTLTYVDKLIDDVHRLFRDKYRTEIQQQSALSLLNGTFDFQNDFLRLLREAE ------3333----------------11111111--33331111---------------- ESSK 3333 >Signal recognition partic; SWP:P47758; PDB:2FH5B; RAVLFVGLCDSGKTLLFVRLLTGQYRDTQTSITDSSAIYKVNNNRGNSLTLIDLPGHESL -------2222-------------------------------3333----------1111 RFQLLDRFKSSARAVVFVVDSAAFQREVKDVAEFLYQVLIDSMALKNSPSLLIACNKQDI -------3333--------33331111-----------------------------3333 AMAKSAKLIQQQLEKELNTLRVTRSPAQLGKKGKEFEFSQLPLKVEFLECSAKSADIQDL ------------------3333--------1111--3333-------------------- EKWLAKIA -------- >RECEPTOR-TYPE TYROSINE-PR; SWP:Q13332; PDB:2FH7A; MLSHPPIPIADMAEHTERLKANDSLKLSQEYESIDPGQQFTWEHSNLEVNKPKNRYANVI -------3333----------%%%%-----1111-------3333-33331111-1111- AYDHSRVILQPIEGIMGSDYINANYVDGYRRQNAYIATQGPLPETFGDFWRMVWEQRSAT -3333------2222-1111---------------------1111--------------- IVMMTRLEEKSRIKCDQYWPNRGTETYGFIQVTLLDTIELATFCVRTFSLHKNGSSEKRE --------%%%%--------------!!!!---------------------2222----- VRQFQFTAWPDHGVPEYPTPFLAFLRRVKTCNPPDAGPIVVHCSAGVGRTGCFIVIDAML ------------------------------------------------------------ ERIKPEKTVDVYGHVTLMRSQRNYMVQTEDQYSFIHEALLEAVGCGNTEVPARSLYAYIQ -----------------11112222----------------3333-----3333------ KLAQVEPGEHVTGMELEFKRLAFISANLPCNKFKNRLVNIMPYESTRVCLQPIRGVEGSD ----------------------3333-11111111-1111--3333-----------111 YINASFIDGYRQQKAYIATQGPLAETTEDFWRMLWENNSTIVVMLTKLREMGREKCHQYW 1---------------------1111-----------------------%%%%------- PAERSARYQYFVVDPMAEYNMPQYILREFKVTDARDGQSRTVRQFQFTDWPEQGVPKSGE -------!!!!---------1111------------------------------------ GFIDFIGQVHKTKEQFGQDGPISVHCSAGVGRTGVFITLSIVLERMRYEGVVDIFQTVKM ------------------------------------------------------------ LRTQRPAMVQTEDEYQFCYQAALEYLGS 1111------------------------ >DNA REPAIR PROTEIN RHP9/C; SWP:P87074; PDB:2FHDA; HSRRSFKNRVLAFFKGYPSFYYPATLVAPVHSAVTSSIYKVQFDDATSTVNSNQIKRFFL ----3333----------------------------------1111----1111------ KKGDVVQSTRLGKIKHTVVKTFRSTNEQLSLIAVDALNNDVILAHGEIEVTVPISTIYVA 2222---1111------------------3333-1111--------------3333---3 PVNIRRFQGRDLSFSTLKDKFEETS 3331111-----3333--------- >GLUTATHIONE S-TRANSFERASE; SWP:P56598; PDB:2FHEA; PAKLGYWKIRGLQQPVRLLLEYLGEKYEEQIYERDDGEKWFSKKFELGLDLPNLPYYIDD --------------------------------1111------1111------------11 KCKLTQSLAILRYIADKHGMIGTTSEERARVSMIEGAAVDLRQGISRISYQPKFEQLKEG 11-------------1111-------------------------------1111------ YLKDLPTTMKMWSDFLGKNPYLRGTSVSHVDFMVYEALDAIRYLEPHCLDHFPNLQQFMS ---------------!!!!-1111---3333---------333311111111-------- RIEALPSIKAYMESNRFIKWPLNGWHAQFGGGDAPP ----------1111---------1111--------- >PULLULANASE; SWP:P07206; PDB:2FHFA; DVVVRLPDVAVPGEAVQASARQAVIHLVDIAGITSSTPADYATKNLYLWNNETCDALSAP -------------------------------1111-----1111------3333------ VADWNDVSTTPTGSDKYGPYWVIPLTKESGCINVIVRDGTNKLIDSDLRVSFSDFTDRTV --3333--------1111------------------------------------1111-- SVIAGNSAVYDSRADAFRAAFGVALADAHWVDKTTLLWPGGENKPIVRLYYSHSSKVAAD --2222-----3333--1111----------1111--3333------------------1 SNGEFSDKYVKLTPTTVNQQVSMRFPHLASYPAFKLPDDVNVDELLQGETVAIAAESDGI 111---------------------1111--------111133331111-------1111- LSSATQVQTAGVLDDTYAAAAEALSYGAQLTDSGVTFRVWAPTAQQVELVIYSADKKVIA ------------------------------1111------1111--------1111---- SHPMTRDSASGAWSWQGGSDLKGAFYRYAMTVYHPQSRKVEQYEVTDPYAHSLSTNSEYS --------------------2222----------1111--------1111---2222--- QVVDLNDSALKPEGWDGLTMPHAQKTKADLAKMTIHESHIRDLSAWDQTVPAELRGKYLA ---111111112222-----------------------3333-1111---1111--3333 LTAQESNMVQHLKQLSASGVTHIELLPVFDLATVNEFSDKVADIQQPFSRLCEVNSAVKS --1111---------1111-----------------3333--11113333-----3333- SEFAGYCDSGSTVEEVLTQLKQNDSKDNPQVQALNTLVAQTDSYNWGYDPFHYTVPEGSY 1111-1111-----------11111111-----------------------------111 ATDPEGTARIKEFRTMIQAIKQDLGMNVIMDVVYNHTNAAGPTDRTSVLDKIVPWYYQRL 1-----3333---------------------------------11111111-2222---- NETTGSVESATCCSDSAPEHRMFAKLIADSLAVWTTDYKIDGFRFDLMLYHPKAQILSAW --------3333----1111---------------------------------------- ERIKALNPDIYFFGEGWDSNQSDRFEIASQINLKGTGIGTFSDRLRDAVRGGGPFDSGDA ------1111----------3333----33332222----------------1111!!!! LRQNQGVGSGAGVLPNELTTLSDDQARHLADLTRLGMAGNLADFVLIDKDGAVKRGSEID -----3333--------------------------1111-1111---1111---3333-- YNGAPGGYAADPTEVVNYVSKHDNQTLWDMISYKAAQEADLDTRVRMQAVSLATVMLGQG iiii------1111---------------------1111-----------33331111-- IAFDQQGSELLRSKSFTRDSYDSGDWFNRVDYSLQDNNYNVGMPRSSDDGSNYDIIARVK ----2222-----iiii-------1111----------------33331111------11 DAVATPGETELKQMTAFYQELTALRKSSPLFTLGDGATVMKRVDFRNTGADQQTGLLVMT 11-------------------------3333-----------------11112222---- IDDGMQAGASLDSRVDGIVVAINAAPESRTLQDFAGTSLQLSAIQQAAGDRSLASGVQVA ---1111----1111--------------------------------!!!!1111----1 ADGSVTLPAWSVAVLELPQGESQGAGLPVSSK 111----------------------------- >Proteasome (Beta subunit); SWP:O33245; PDB:2FHGH; TTIVALKYPGGVVMAGDRRSTQGNMISGRDVRKVYITDDYTATGIAGTAAVAVEFARLYA -------3333-------------------------------------3333-------- VELEHYEKLEGVPLTFAGKINRLAIMVRGNLAAAMQGLLALPLLAGYDIHASDPQSAGRI ------------------------------3333-------------1111--------- VSFDAAGGWNIEEEGYQAVGSGSLFAKSSMKKLYSQVTDGDSGLRVAVEALYDAADDDSA ---1111--------------3333-------3333-----------------1111111 TGGPDLVRGIFPTAVIIDADGAVDVPESRIAELARAIIESRS 1----1111--------3333--------------------- >20S PROTEASOME, ALPHA AND; SWP:NA; PDB:2FHH1; SPEQAMRERSELARKGIARAKSVVALAYAGGVLFVAENPSRSLQKISELYDRVGFAAAGK ---------------------------1111----------------------------- FNEFDNLRRGGIQFADTRGYAYDRRDVTGRQLANVYAQTLGTIFTEQAKPYEVELCVAEV 3333------------------3333---------------------------------- AHYGETKRPELYRITYDGSIADEPHFVVMGGTTEPIANALKESYAENASLTDALRIAVAA --------------1111------------------------------3333-------- LRAGSLGVASLEVAVLDANRPRRAFRRITGSALQALLVDQ 3333------------3333---------3333------- >20S PROTEASOME, ALPHA AND; SWP:NA; PDB:2FHH2; TTIVALKYPGGVVMAGDRRSTQGNMISGRDVRKVYITDDYTATGIAGTAAVAVEFARLYA -------3333-------------------------------------3333-------- VELEHYEKLEGVPLTFAGKINRLAIMVRGNLAAAMQGLLALPLLAGYDIHASDPQSAGRI -------------------------------3333------------------------- VSFDAAGGWNIEEEGYQAVGSGSLFAKSSMKKLYSQVTDGDSGLRVAVEALYDAADDDSA ---1111------------1111---------3333---------------------111 TGGPDLVRGIFPTAVIIDADGAVDVPESRIAELARAIIESRS 1----1111--------1111----3333------------- >PROBABLE ACYLPHOSPHATASE; SWP:O35031; PDB:2FHMA; MLQYRIIVDGRVQGVGFRYFVQMEADKRKLAGWVKNRDDGRVEILAEGPENALQSFVEAV -------------------------1111-------1111-------------------3 KNGSPFSKVTDISVTESRSLEGHHRFSIVYS 333---------------------------- >METHYLASE, PUTATIVE; SWP:Q831P8; PDB:2FHPA; ARVISGEYGGRRLKALDGDNTRPTTDKVKESIFNIGPYFDGGALDLYSGSGGLAIEAVSR -----1111--------------------------------------!!!!--------- GDKSICIEKNFAALKVIKENIAITKEPEKFEVRKDANRALEQFYEEKLQFDLVLLDPPYA ---------3333--------33333333--------------1111----------111 KQEIVSQLEKLERQLLTNEAVIVCETDKTVKLPETIGTLKKTRETVYGITQVTIYRQ 1---------1111--1111------1111-----!!!!-------!!!!------- >PUTATIVE GENERAL STRESS P; SWP:Q8A7U5; PDB:2FHQA; TKTKEKAVELLQKCEVVTLASVNKEGYPRPVPSKIAAEGISTIWSTGADSLKTIDFLSNP ----------1111--------1111--------------------1111---------- KAGLCFQEKGDSVALGEVEVVTDEKLKQELWQDWFIEHFPGGPTDPGYVLLKFTANHATY -------------------------------1111---1111--1111------------ WIEGTFIHKKL -%%%%------ >VIRAL MACROPHAGE INFLAMMA; SWP:Q98157; PDB:2FHTA; SWHRPDKCCLGYQKRPLPQVLLSSWYPTSQLCSKPGVIFLTKRGRQVCADKSKDWVKKLM -----------------3333-------1111--------3333-----3333------- QQLPVTAR -------- >SPM-1; SWP:Q8G9Q0; PDB:2FHXA; DHVDLPYNLTATKIDSDVFVVTDRDFYSSNVLVAKMLDGTVVIVSSPFENLGTQTLMDWV -----%%%%-----2222-----------------1111--------------------- AKTMKPKKVVAINTHFHLDGTGGNEIYKKMGAETWSSDLTKQLRLEENKKDRIKAAEFYK --------------------1111---1111----------------------------- NEDLKRRILSSHPVPADNVFDLKQGKVFSFSNELVEVSFPGPAHSPDNVVVYFPKKKLLF --------1111--------3333-----iiii--------------------------- GGMIKPKELGYLGDANVKAWPDSARRLKKFDAKIVIPGHGEWGGPEMVNKTIKVAEKAVG -----------1111-1111---------------------------------------- EMRL ---- >COLICIN-E5 IMMUNITY PROTE; SWP:P13476; PDB:2FHZA; KLFEHTVLYDSGDAFFELKGNASMKLSPKAAIEVCNEAAKKGLWILGIDGGHWLNPGFRI ---1111---3333-1111-------------------1111------------------ DSSASWTYDMPEEYKSKIPENNRLAIENIKDDIENGYTAFIITLKM 3333---------1111---------------1111---------- >CONSERVED DOMAIN PROTEIN; SWP:Q8CZ42; PDB:2FI0A; VVMDNIIDVSIPVAEVVDKHPEVLEILVELGFKPLANPLMRNTVGRKVSLKQGSKLAGTP -----------3333----3333--3333--3333-3333---1111------------3 MDKIVRTLEANGYEVIGLD 333-----1111------- >HYDROLASE, HALOACID DEHAL; SWP:Q97RK1; PDB:2FI1A; MKYHDYIWDLGGTLLDNYETSTAAFVETLALYGITQDHDSVYQALKVSTPFAIETFAPNL -----------------------------1111--------------------------2 ENFLEKYKENEARELEHPILFEGVSDLLEDISNQGGRHFLVSHRNDQVLEILEKTSIAAY 222-----------------2222-------1111---------------------3333 FTEVVTSSSGFKRKPNPESMLYLREKYQISSGLVIGDRPIDIEAGQAAGLDTHLFTSIVN -----3333---------------1111---------3333----1111----------- LRQVLDI ------- >ZINC FINGER PROTEIN 42; SWP:P28698; PDB:2FI2A; GSDPGPEAARLRFRCFHYEEATGPQEALAQLRELCRQWLRPEVRSKEQMLELLVLEQFLG ------------------3333-------------------------------------- ALPPEIQARVQGQRPGSPEEAAALVDGLRREPGG ---------------------------------- >OUTER MEMBRANE PROTEIN; SWP:Q8RIU4; PDB:2FI9A; HFPGRAPIDAYGNGGFRFADMSHRGSIICIPSGIYGIDMTGPVPTQEDISRVLEESDQIE -----------------%%%%--------1111-----------3333------3333-- VLLIGTGVELLRLPEELRVLLWEKRISSDTMSTGAAVRTFNVLLAEDRAVAALLFAVE ---------------------1111------------------1111----------- >ACETYLTRANSFERASE; SWP:Q833M5; PDB:2FIAA; MKIRVADEKELPMILQFLTEVKAYMDVVGITQWTKDYPSQGDIQEDITKKRLYLLVHEEM ------1111-3333------------------1111-3333----1111------!!!! IFSMATFCMEQEQDFVWLKRFATSPNYIAKGYGSLLFHELEKRAVWEGRRKMYAQTNHTN -----------------------33331111-------------------------1111 HRMIRFFESKGFTKIHESLQMNRLDFGSFYLYVKELE -------1111--------22221111---------- >FIBRINOGEN; SWP:P02679; PDB:2FIBA; VQIHDITGKDCQDIANKGAKQSGLYFIKPLKANQQFLVYCEIDGSGNGWTVFQKRLDGSV ---------3333-1111----------1111----------1111-------------- DFKKNWIQYKEGFGHLSPTGTTEFWLGNEKIHLISTQSAIPYALRVELEDWNGRTSTADY ----------------1111-------------1111----------------------- AMFKVGPEADKYRLTYAYFAGGDAGDAFDGFDFGDDPSDKFFTSHNGMQFSTWDNDNDKF ------3333-----------3333-1111--------3333--2222---1111----- EGNCAEQDGSGWWMNKCHAGHLNGVYYQGGTYSKASTPNGYDNGIIWATWKTRWYSMKKT --------------------------2222--3333----------3333---------- TMKIIPFNRL -----3333- >T-CELL SURFACE GLYCOPROTE; SWP:P11609; PDB:2FIKA; YTFRCLQMSSFANRSWSRTDSVVWLGDLQTHRWSNDSATISFTKPWSQGKLSNQQWEKLQ ------------------------!!!!-----1111------1111!!!!--------- HMFQVYRVSFTRDIQELVKYPIEIQLSAGCEMYPGNASESFLHVAFQGKYVVRFWGTSWQ ---------------------------------------------iiii----------- TVPGAPSWLDLPIKVLNADQGTSATVQMLLNDTCPLFVRGLLEAGKSDLEKQEKPVAWLS -22223333--------------------------------------1111--------- SVPSSAHGHRQLVCHVSGFYPKPVWVMWMRGDQEQQGTHRGDFLPNADETWYLQATLDVE -----2222--------------------!!!!-1111---------------------- AGEEAGLACRVKHSSLGGQDIILYW ---2222-----1111--------- >TUBBY RELATED PROTEIN 1; SWP:O00294; PDB:2FIMA; PREFVLRPAPQGRTVRCRLTRDKYPSYFLHLDTEKKVFLLAGRKRKRSKTANYLISIDPT 3333-----2222--------------------------------------------111 NNFIGKLRSNLLGNRFTVFDNGQNPQRGYSTNVASLRQELAAVIYERRMTVIIPGMSAEN 1--------3333----------333311113333---------------------1111 ERVPIRPRNASDGLLVRWQNKTLESLIELHNKPPVWNDDSGSYTLNFQGRVTQASVKNFQ ----------------------1111-----------3333-----iiii----1111-- IVHADDPDYIVLQFGRVAEDAFTLDYRYPLCALQAFAIALSSFD --1111---------------------------------3333- >LATE GENES ACTIVATOR; SWP:P03682; PDB:2FIPA; PKTQRGIYHNLKESEYVASNTDVTFFFSSELYLNKFLDGYQEYRKKFNKKIERVAVTPWN --1111---3333----------------------------------------------- MDMLADITFYSEVEKRGFHAWLKGDNATWREVHVYALRIMTKPNTLDWSRIQKPR ---------------------iiii-------------1111------------- >PUTATIVE TAGATOSE 6-PHOSP; SWP:Q7ACL2; PDB:2FIQA; KTLIARHKAGEHIGICSVCSAHPLVIEAALAFDRNSTRKVLIEATSNQVNQFGGYTGTPA ------1111-------------------3333-----------3333-1111------- DFREFVFAIADKVGFARERIILGGDHLGPNCWQQENVDAAEKSVELVKAYVRAGFSKIHL ---------------3333---------1111---3333--------------------- DASSCAGDPIPLAPETVAERAAVLCFAAESVATDCQREQLSYVIGTEVPVVHITHVEDAA ----2222----3333--------------------1111-------------------- NTLRTHQKAFIARGLTEALTRVIAIVVQPGVEFDHSNIIHYQPQEAQALAQWIENTRVYE ----------1111-3333----------------------3333--------------- AHSTDYQTRTAYWELVRDHFAILKVGPALTFALREAIFALAQIEQELIAPENRSGCLAVI ---2222------------------3333-------------------3333-------- EEVLDEPQYWKKYYRTGFNDSLLDIRYSLSDRIRYYWPHSRIKNSVETVNLQGVDIPLGI ------1111-----------------3333----3333----------3333------- SQYLPKQFERIQSGELSAIPHQLIDKIYDVLRAYRYGCAE ------------------3333------------------ >CONSERVED HYPOTHETICAL PR; SWP:Q8UIJ7; PDB:2FIUA; AKGYWIAQVDVRDSERYKDYVSTAKPAFERFGANFLARGGSVTELEGTARARNVVIEFPS ------------3333------------1111---------------------------- VQHAIDCYNSPEYQAAAKIRQEVADAEIVEGIG --------------------------------- >GCN5-related N-acetyltran; SWP:NA; PDB:2FIWA; VSTPALRPYLPEDAAVTAAIFVASIEQLTADDYSEEQQEAWASAADDEAKFAARLSGQLT ---------3333---------------1111---------3333---------1111-- LIATLQGVPVGFASLKGPDHIDLYVHPDYVGRDVGTTLIDALEKLAGARGALILTVDASD ----iiii-----------------1111-----------------1111--------33 NAAEFFAKRGYVAKQRNTVSINGEWLANTTTKSL 33----1111----------iiii---------- >PROTEIN FDHE HOMOLOG; SWP:Q9HV00; PDB:2FIYA; PHLHQPSRDLFARRGERLLQLAEGHPMGDYLRLVAGLCRLQQALLDNPPALAPLDPERLR ------1111-----------2222----------------------------------- KSREHGMPPLAYDLLVREGAWLPWLDALLAGYPAPANAAVGAALEQLREAEEGQRKAWAI --1111------------1111------1111---------------------------- ALLSGQFDLLPAALVPFLGAALQVAWSHWLLGLEEGAVVETESRTLCPACGSPPMAGMIR -11111111-3333---------------1111--------------------------- QTGLRYLSCSLCACEWHYVRIKCSHCEESKHLAYLSLEHGQPAEKAVLRAETCPSCQGYL ------------------------------------------------------------ KQFYLEFDRHADALADDLASLALDMRLAEDGYLRRSPNLLLAPGG ---333311113333----------------------1111---- >JUVENILE HORMONE ESTERASE; SWP:Q7M4E5; PDB:2FJ0A; EEVVVRTESGWIRGLKRRAEGNKSYASFRGVPYAKQPLGELRFKELQPLEPWQDELDATQ ------1111----------------------------1111------------------ EGPVCQQTDVLYGRIMRPRGMSEACIHANIHVPYYALPRDGLPVLVFIHGGGFAFGSGDS -----------!!!!-----------------3333--------------iiii------ DLHGPEYLVSKDVIVITFNYRLNVYGFLSLNSTSVPGNAGLRDMVTLLKWVQRNAHFFGG ----3333---------------1111--------------------------3333--- RPDDVTLMGQSAGAAATHILSLSKAADGLFRRAILMSGTSSSAFFTTNPVFAQYINKLFV 1111------------------1111------------1111------------------ TNIGITATDPEEIHQKLIEMPAEKLNEANRFLLEQFGLTTFFPVVESPINGVTTILDGDP 1111-------------------------------------------------------- EQLIAKGRGKHIPLIIGFTDAECEIFRRQFEQIDIVSKIKENPGILVPLSVLFSSAPDTV --------1111----------3333---------------1111--3333--------- AEITKAMHEKYFKKSVDMEGYIELCTDSYFMYPAISLAIKRARSNGAPVYLYQFSFDGDY ------------------------------------------------------------ SVFREVNHLNFEGAGHIEDLTYVFRTNSMLGGHASFPPHDKDDHMKYWMTSFITNFMKYS 33331111------22221111---1111------------------------------- NPVTDAKLWPEVRADNLRYQDIDTPDVYQNVKPHSEQRDMLDFFDSIYNW ----1111----3333-------2222------------------1111- >MAJOR PRION PROTEIN; SWP:Q95211; PDB:2FJ3A; LGGYMLGSAMSRPLIHFGNDYEDRYYRENMYRYPNQVYYRPVDQYSNQNSFVHDCVNITV ---------------------------1111----------------------------- KQHTVTTTTKGENFTETDIKIMERVVEQMCITQYQQESQAAYQRA --------------3333--------------------------- >HYPOTHETICAL UPF0346 PROT; SWP:O31864; PDB:2FJ6A; MKSFYHYLLKYRHPKPKDSISEFANQAYEDHSFPKTSTDYHEISSYLELNADYLHTMATF -------------------------33331111----------------3333--3333- DEAWDQYESEVHGRLEHHHHHH ---------------------- >BOWMAN-BIRK TYPE TRYPSIN ; SWP:P12940; PDB:2FJ8A; KRPWKCCDEAVCTRSIPPICTCMDEVFECPKTCKSCGPSMGDPSRRICQDQYVGDPGPIC -----------------------------1111-----%%%%------------------ RPWECCDKAICTRSNPPTCRCVDEVKKCAPTCKTCLPSRSRPSRRVCIDSYFGPVPPRCT ----------------------------1111--------1111---------------- >ANTIGEN TPF1; SWP:P16665; PDB:2FJCA; PDARAIAAICEQLRQHVADLGVLYIKLHNYHWHIYGIEFKQVHELLEEYYVSVTEAFDTI -3333------------------------------1111--------------------- AERLLQLGAQAPASMAEYLALSGIAEETEKEITIVSALARVKRDFEYLSTRFSQTQVLAA ------------------------------------------------------------ ESGDAVTDGIITDILRTLGKAIWMLGATLKA ------------------------------- >IGHG1 protein; SWP:Q569F4; PDB:2FJFH; EVQLVESGGGLVQPGGSLRLSCAASGFTISDYWIHWVRQAPGKGLEWVAGITPAGGYTYY ------------2222-----------1111--------2222--------2222----- ADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCARFVFFLPYAMDYWGQGTLVTVSS 1111--------3333----------1111------------------------------ ASTKGPSVFPLAPSSGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSL -----------------------------------%%%%--2222--------------- SSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSC ------3333------------1111------------ >IGHG1 protein; SWP:Q569F4; PDB:2FJHH; EVQLVESGGGLVQPGGSLRLSCAASGFTINASWIHWVRQAPGKGLEWVGAIYPYSGYTNY ---------------------------3333--------2222--------3333----- ADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCARWGHSTSPWAMDYWGQGTLVTVS 3333--------3333----------1111---------3333----------------- SASTKGPSVFPLAPSSGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYS ------------------------------------%%%%--2222-------3333--- LSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPKSC -------3333---------------------------- >IGKC protein; SWP:Q6GMX8; PDB:2FJHL; DIQMTQSPSSLSASVGDRVTITCRASQVIRRSLAWYQQKPGKAPKLLIYAASNLASGVPS -------------2222-------------------------------------222233 RFSGSGSGTDFTLTISSLQPEDFATYYCQQSNTSPLTFGQGTKVEIKRTVAAPSVFIFPP 33----------------1111-------------------------------------- SDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLT 3333-------------------------iiii--------------------------- LSKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC -3333------------1111------------- >Exocyst complex component; SWP:P32844; PDB:2FJI1; GSHMGDKEKETLFKDYLNLIVVKMTEWIGNLEKAEFDVFLERSTPPHSDSDGLLFLDGTK ------------------------------------------------1111-------- TCFQMFTQQVEVAAGTNQAKILVGVVERFSDLLTKRQKNWISKISEEIKKQINYNHKYDI -----------------3333---------------------------------1111-- DPESITPEDECPGGLVEYLIAVSNDQMKAADYAVAISSKYGKLVSKVYEKQITNHLEGTL 3333-3333-------------------------------1111---------------- DGFAEVAQCSSLGLITLMFDDLRKPYQEIFSKTWYMGSQAQQIADTLDEYLLDIKPQMNS -----------------3333-3333-22223333------------------3333-33 VLFVNFIDNVIGETIIKFLTALSFEHSFKNKNNKFLEAMKRDFEIFYQLFVKVLDGNESK 33------------------3333------%%%%-------------------2222--- DTLITQNFTVMEFFMDLSCEPIDSILDIWQKYLEVYWDSRIDLLVGILKCRKDVSSSERK --------------------3333-----------1111-----------1111------ KIVQQATEMLHEYRRNMEANGVDREPTLMRRFVLEFEKQ --------------------------1111--------- >U1 SMALL NUCLEAR RIBONUCL; SWP:P43332; PDB:2FJJA; MEMLPNQTIYINNLNEKIKKEELKKSLYAIFSQFGQILDIVALKTLKMRGQAFVIFKEIG --------------------------------------------1111------------ SASNALRTMQGFPFYDKPMQIAYSKSDSDIVAKIKGTFKERPKK --------2222-------------3333--------------- >FRUCTOSE-BISPHOSPHATE ALD; SWP:Q703I2; PDB:2FJKA; LVTGLEILRKARAEGYGVGAFNTNNEFTQAILEAAEEKSPVILALSEGAKYGGRALTRVV ---------------------------------------------3333---3333---- ALAQEARVPVAVHLDHGSSYESVLKALREGFTSVIDKSHEDFETNVRETKRVVEAAHAVG -3333---------------------1111------3333----------------1111 VTVEAELGRLAGIEEHVAVDEKDALLTNPEEARIFERTGADYLAVAIGTSHGAYKGKGRP ------------------------------------------------------------ FIDHPRLARIAKLVPAPLVLHGASAVPQELVERFRAAGGEIGEASGIHPEDIKKAISLGI ------------------------------------------------------------ AKINTDTDLRLAFTALVRETLGKNPKEFDPRKYLGPAREAVKEVVKSRELFGSVGRA -----------------------1111-3333----------------11112222- >1-phosphatidylinositol-4,; SWP:P10686; PDB:2FJLA; SIKNGILYLEDPVNHEWYPHYFVLTSSKIYYSEETSSDQGNEDEEEPKEASGSTELHSSL ------------------------------------------------------------ EVLFQGPNPAILEPEREHLDENSPLGDLLRGVLDVPACQIAIRPEGKNNRLFVFSISMPS 3333-------------------------------------------------------- VAQWSLDVAADSQEELQDWVKKIREVAQTA --------------------------1111 >REPRESSOR PROTEIN CI; SWP:P08707; PDB:2FJRA; DSLGWSNVDVLDRICEAYGFSQKIQLANHFDIASSSLSNRYTRGAISYDFAAHCALETGA ---------------------3333--1111-3333------------------------ NLQWLLTGEGEAFVNNRESSDAKRIEGFTLSEEILKSDKQLSVDAQFFTKPLTDGMAIRS ------------------------------%%%%---------3333------------% EGKIYFVDKQASLSDGLWLVDIKGAISIRELTKLPGRKLHVAGGKVPFECGIDDIKTLGR %%%------------------iiii-------------------------1111------ VVGVYSEVN --------- >ADENYLYL CYCLASE CLASS IV; SWP:Q8ZGU9; PDB:2FJTA; SEHFVGKYEVELKFRVMDLTTLHEQLVAQKATAFTLNNHEKDIYLDANGQDLADQQISMV --------------------------------------------------3333------ LREMNPSGIRLWIVKGPGAEREASNIEDVSKVQSMLATLGYHPAFTIEKQRSIYFVGKFH ------------------------------------1111---------------!!!!- ITVDHLTGLGDFAEIAIMTDDATELDKLKAECRDFANTFGLQVDQQEPRSYRQLLGF --------------------1111------------1111-3333------------ >Phospholipase C, beta 2 v; SWP:Q59F77; PDB:2FJUB; PKVKAYLSQGERFIKWDDETTVASPVILRVDPKGYYLYWTYQSKEMEFLDITSIRDTRFG ---3333-----------------------1111------1111-----1111-----!! KFAKMPKSQKLRDVFNMDFPDNSFLLKTLTVVSGPDMVDLTFHNFVSYKENVGKAWAEDV !!-------------1111---3333---------------------------------- LALVKHPLTANASRSTFLDKILVKLKMQLNSEGKIPVKNFFQMFPADRKRVEAALSACHL --11113333--3333-------------1111--3333----------------1111- PKGKNDAINPEDFPEPVYKSFLMSLCPRPEIDEIFTSYMTKEHLTKFINQKQRDQGLIDK --------3333---------------33333333------------------------- YEPSGQLSPEGMVWFLCGPENSVLAQDKLLLHHDMTQPLNHYFINSSHNTYLTAGQFSGL -----------------3333-----3333-------1111-------3333-------- SSAEMYRQVLLSGCRCVELDCWKGKPPDEEPIITHGFTMTTDIFFKEAIEAIAESAFKTS ---------1111------------------------------3333-------1111-- PYPIILSFENHVDSPRQQAKMAEYCRTIFGDMLLTEPLEKFPLKPGVPLPSPEDLRGKIL ----------------------------!!!!-----1111--2222---33332222-- IKNKKEESGNLDEEEIKKMQSDEGTAGLEVTAYEEMSSLVNYIQPTKFVSFEFSAQKNRS -----------3333---3333----------33331111------------------11 YVISSFTELKAYDLLSKASVQFVDYNKRQMSRIYPKGTRMDSSNYMPQMFWNAGCQMVAL 11---------------------------------3333-----------1111------ NFQTMDLPMQQNMAVFEFNGQSGYLLKHEFMRRPTTLSITVISGQFLSERSVRTYVEVEL 1111-------------%%%%------3333----------------------------- FGLPGDPKRRYRTKLSPSTNSINPVWKEEPFVFEKILMPELASLRVAVMEEGNKFLGHRI --1111-------------------------------3333------------------- IPINALNSGYHHLCLHSESNMPLTMPALFIFLEMKD -3333-----------1111---------------- >PROTEIN E6; SWP:P03126; PDB:2FK4A; AMSYSLYGTTLEQQYNKPLSDLLIRCINCQKPLSPEEKQRHLDKKQRFHNIRGRWTGRCM 3333---3333-3333-3333-----------------------------2222--3333 SCSRSSR ------- >FUCULOSE-1-PHOSPHATE ALDO; SWP:Q72HM7; PDB:2FK5A; RARLYAAFRQVGEDLFAQGLISATAGNFSVRTKGGFLITKSGVQKARLTPEDLLEVPLEG ---------------1111--!!!!------3333--------3333-1111-------- PIPEGASVESVVHREVYRRTGARALVHAHPRVAVALSFHLSRLRPLDLEGQHYLKEVPVL --22221111-------------------------3333--------------------- APKTVSATEEAALSVAEALREHRACLLRGHGAFAVGLKEAPEEALLEAYGLMTTLEESAQ --------------------------2222------------------------------ ILLYHRLWQGAGPAL --------1111--- >METHOXY MYCOLIC ACID SYNT; SWP:Q79FX8; PDB:2FK8A; IQAHYDVSDDFFALFQDPTRTYSCAYFEPPELTLEEAQYAKVDLNLDKLDLKPGMTLLDI 3333-------1111-3333--------1111------------3333---2222----- GCGWGTTMRRAVERFDVNVIGLTLSKNQHARCEQVLASIDTNRSRQVLLQGWEDFAEPVD -!!!!------------------------------1111-----------3333------ RIVSIEAFEHFGHENYDDFFKRCFNIMPADGRMTVQSSVSYHPYEMAARGKKLSFETARF ------3333-3333------------1111--------------3333----------- IKFIVTEIFPGGRLPSTEMMVEHGEKAGFTVPEPLSLRPHYIKTLRIWGDTLQSNKDKAI --------2222------------1111-------------------------------- EVTSEEVYNRYMKYLRGCEHYFTDEMLDCSLVTYLKPGAAA -----------------------------------1111-- >PROTEIN KINASE C, ETA TYP; SWP:Q8NE03; PDB:2FK9A; TMKFNGYLRVRIGEAVGLQPTRWSLRHSLFKKGHQLLDPYLTVSVDQVRVGQTSTKQKTN --------------------3333--1111--------------iiii------------ KPTYNEEFCANVTDGGHLELAVFHETPLGYDHFVANCTLQFQELLRTTGASDTFEGWVDL -----------------------------------------------!!!!--------- EPEGKVFVVITLT ------------- >PUTATIVE NUDIX HYDROLASE ; SWP:P65556; PDB:2FKBA; STEWVDIVNEENEVIAQASREQRAQCLRHRATYIVVHDGGKILVQRRTETKDFLPGLDAT --------1111------33331111---------------------------------- AGGVVQADEQLLESARREAEEELGIAGVPFAEHGQFYFEDKNCRVWGALFSCVSHGPFAL -----2222------------------------------1111----------------- QEDEVSEVCWLTPEEITARCDEFTPDSLKALALWKRN 1111--------------3333--------------- >R.HINP1I RESTRICTION ENDO; SWP:Q5I6E6; PDB:2FKCA; MNLVELGSKTAKDGFKNEKDIADRFENWKENSEAQDWLVTMGHNLDEIKSVKAVVLSGYK -------------------------------------------3333------------- SDINVQVLVFYKDALDIHNIQVKLVSNKRGFNQIDKHWLAHYQEMWKFDDNLLRILRHFT --------3333-------------------------3333------------------- GELPPYHSNTKDKRRMFMTEFSQEEQNIVLNWLEKNRVLVLTDILRGRGDFAAEWVLVAQ ----------------3333----------------------------1111-------- KVSNNARWILRNINEVLQHYGSGDISLSPRGSINFGRVTIQRKGGDNGRETANMLQFKID ---------------------------1111---!!!!------%%%%------------ PTELFDI --1111- >PROTEIN YJBR; SWP:P0AF50; PDB:2FKIA; MTISELLQYCMAKPGAEQSVHNDWKATQIKVEDVLFAMVKEVENRPAVSLKTSPELAELL -------3333------------------------------%%%%--------------- RQQHSDVRPSRHLNKAHWSTVYLDGSLPDSQIYYLVDASYQQAVNLLPEEKRKLLVQL -------------------------------------------1111-------3333 >BASEPLATE STRUCTURAL PROT; SWP:P10928; PDB:2FKKA; LYVSQGPGVDISGDVNLTDFDKIGWPNVEAVQSYQREFNAVSNIFDTIYPIGTIYENAVN ---------------------------------------3333------2222------- PNNPVTYMGFGSWKLFGQGKVLVGWNEDISDPNFALNNNDLDSGGNPSHTAGGTGGSTSV ----------------2222-------1111-----1111-1111--------------- TLENTNLPATETDEEVLIVDENGSVIVYTKYREAKASTNSTHTPPTSITNIQPYITVYRW --3333-------------1111---------------1111------------------ IRIA ---- >UROCANATE HYDRATASE; SWP:P25503; PDB:2FKNA; SIRANRGTELECLGWEQEAVLRMLRNNLDPEVAEKPEDLIVYGGIGKAARDWDAFHAIEH ----------------------------3333--3333---------------------- SLKTLKNDETLLVQSGKPVGMFRTHPQAPRVLLANSVLVPKWADWEHFHELEKKGLMMYG -------------%%%%-------1111----------3333------------------ QMTAGSWIYIGSQGILQGTYETFAELARQHFGGSLKGTLTLTAGLGGMGGAQPLSVTMNE --1111----3333----------------iiii2222---------3333-----1111 GVVIAVEVDEKRIDKRIETKYCDRKTASIEEALAWAEEAKLAGKPLSIALLGNAAEVHHT ----------------1111--------------------------------3333---- LLNRGVKIDIVTDQTSAHDPLIGYVPEGYSLDEADRLRQDTPELYVRLAKQSMKKHVEAM ---------------3333------2222------------------------------- LAFQQKGSIVFDYGNNIRQVAKDEGLENAFDFPGFVPAYIRPLFCEGKGPFRWAALSGDP ---1111--------3333--11111111----------33331111-------3333-- ADIYRTDALLKELFPTNKALHRWIDMAQEKVTFQGLPSRICWLGYGERKKMGLAINELVR -------------1111------------------------------------------- TGELKAPVVIGRDHLDCGSVASPNRETEAMKDGSDAVGDWAVLNALVNTAAGASWVSFHH ---------------------1111----11111111----------------------- GGGVGMGYSLHAGMVAVADGSELADERLARVLTSDPGMGIIRHADAGYERAVEVAKEQDI ----2222---------------------1111-3333---------------------- IVPM ---- >BACTERIOFERRITIN; SWP:P22759; PDB:2FKZA; MKGDKIVIQHLNKILGNELIAINQYFLHARMYEDWGLEKLGKHEYHESIDEMKHADKLIK --------------------------------1111------------------------ RILFLEGLPNLQELGKLLIGEHTKEMLECDLKLEQAGLPDLKAAIAYCESVGDYASRELL ------------------------------------3333--------1111-------- EDILESEEDHIDWLETQLDLIDKIGLENYLQSQMD ----------------------------------- >RED FLUORESCENT PROTEIN Z; SWP:Q8T4U4; PDB:2FL1A; SAHGLTDDMTMHFRMEGCVDGHKFVIEGNGNGNPFKGKQFINLCVIEGGPLPFSEDILSA -iiii-------------iiii----------3333-----------------3333111 AFNRLFTEYPEGIVDYFKNSCPAGYTWHRSFRFEDGAVCICSADITVNVRENCIYHESTF 1-3333---1111-3333--------------1111------------1111-------- YGVNFPADGPVMKKMTTNWEPSCEKIIPINSQKILKGDVSMYLLLKDGGRYRCQFDTIYK -----1111-------------------3333------------1111------------ AKTEPKEMPDWHFIQHKLNREDRSDAKNQKWQLIEHAIASRSALP ------------------------1111----------------- >SPERMINE/SPERMIDINE ACETY; SWP:NA; PDB:2FL4A; GMEIHFEKVTSDNRKAVENLQVFAEQQAFIESMAENLKESDQFPEWESAGIYDGNQLIGY ---------3333---1111-2222-----------------1111------!!!!---- AMYGRWQDGRVWLDRFLIDQRFQGQGYGKAACRLLMLKLIEKYQTNKLYLSVYDTNSSAI ------------------3333------------------------------1111---- RLYQQLGFVFNGELDTNGERVMEWTHQ ---1111-------1111--------- >IGHG1 protein; SWP:Q6PJA4; PDB:2FL5H; EVQLVESGGGLVQPGESLKLSCTASGFSLSNYYMTWVRQAPGKGLEWVTNIRPDETEKFY ---------------------------3333--------2222----------------- SDSVKGRFTVSRDNARNSLFNSMSLQ 3333--------3333---------- >IGL@ protein; SWP:Q5FWF9; PDB:2FL5L; SYELKQPPSVSVSPGQTARITCSGDVLPKKYAYWYQERSGQAPVLVVYEDSGRPSEIPER ------------2222-------1111------------------------------333 FSGSSSGTKATLTISGAQVEDEADYYCYSDISNGYPLFGGGTKLSVGQPKAAPSVTLFPP 3----------------1111--------------------------------------- SSEELQANKATLVCLISDFYPGAVTVAWKADSSPIKAGVETTTPSKQSNNKYAASSYLSL 33331111---------------------iiii--2222--------------------- TPEQWKSHRSYSCQVTHEGSTVEKTVAPT 3333------------------------- >REGULATORY PROTEIN SIR3; SWP:P06701; PDB:2FL7A; LDGWQVIITDDQGRVIENVFLKRISDGLSFGKGESVIFNDNVTETYSVYLIHEIRLVVEI 2222-------------------------------------------------------- WVFSYLRWFELKPKLYYEQFRPDLIKEDHPLEFYKDKFFNEVNKSELYLTAELSEIWLKD ------1111--------------1111-3333---------1111----------3333 FIAVGQILPESQWIEDRDFLVRYACEPTAEKFVPIDIFQIIRRVKEMEPKQSDEYLKRVS --------3333-2222--------1111----------------------------111 VPV 1-- >DNA ENDONUCLEASE I-MSOI; SWP:Q8WKW7; PDB:2FLDA; TLQPTEAAYIAGFLDGDGSIYALLIPRPDYKDIKYQVSLAISFIQRKDKFPYLQDIYDQL --3333--------------------3333---------------3333--------111 GKRGNLRKDRGDGIADYRIIGSTHLSIILPDLVPYLRIKKKQANRILHIINLYPQAQKNP 1--------------------3333--33333333-1111------------1111---- SKFLDLVKIVDDVQNLNKRADELKSTNYDRLLEEFLKAGKI --------------11111111-------------1111-- >CYTOKININ-SPECIFIC BINDIN; SWP:Q9ZWP8; PDB:2FLHA; MVKEFNTQTELSVRLEALWAVLSKDFITVVPKVLPHIVKDVQLIEGDGGVGTILIFNFLP -------------------------1111-------------------2222------11 EVSPSYQREEITEFDESSHEIGLQVIEGGYLSQGLSYYKTTFKLSEIEEDKTLVNVKISY 11-------------1111--------!!!!----------------------------- DHVTPTKTSQSTLMYLRRLERYLS ------------------------ >RIBULOSE-PHOSPHATE 3-EPIM; SWP:Q5XDX2; PDB:2FLIA; TLKIAPSILAADYANFASELARIEETDAEYVHIDIMDGQFVPNISFGADVVASMRKHSKL ------3333-1111-------3333---------------------------1111--- VFDCHLMVVDPERYVEAFAQAGADIMTIHTESTRHIHGALQKIKAAGMKAGVVINPGTPA ---------3333---------------1111-----------1111-------111133 TALEPLLDLVDQVLIMTVNPGFGGQAFIPECLEKVATVAKWRDEKGLSFDIEVDGGVDNK 3311111111-----------------3333-----------1111-----------111 TIRACYEAGANVFVAGSYLFKASDLVSQVQTLRTALN 1----3333------3333------------------ >NITRIC OXIDE SYNTHASE; SWP:Q5KZC5; PDB:2FLQA; RQHDEQLMTKAEQFIIASYRELGKSEQEIKRRVNEIRWEVEQTGTYRHTYEELSYGAKMA --3333-------------------3333------------------------------- WRHSNRCIGRLFWQSLHVIDAREAVTEEEVFSYLFHHIEVATNGGKIRPTITIFRPNGEV ---1111----3333-----1111--------------3333-------------iiii- RIWNHQLIRYAGYETEEGIIGDSSSLTFTRACEQLGWKGEKTPFDVLPLVIQVGGQKPVW --------------------------------1111------------------------ TPIPKELVLEVPIEHPEFPWFRDLQLKWYAVPIISDMCLEIGGIRYMAAPFNGWYMGTEI ---1111-------1111---1111---------------iiii---------------- GARNFADDYRYNMLPKVASCMGLDTNSNASLWKDKALVELNIAVLYSYKKAGVSIVDHHT --------------------------3333------------------------------ AARQFQLFEQQEKAAGRHVTGDWTWLIPPLSPATTHIFHRSYDNTMMLPNFFYQDRPYE ------------1111-----3333-----11113333-----------------1111 >CIS-3-CHLOROACRYLIC ACID ; SWP:Q6VPE5; PDB:2FLTA; PVYMVYVSQDRLTPSAKHAVAKAITDAHRGLTGTQHFLAQVNFNEQPAGNVFLGGVQQGG -------2222-----------------------3333--------2222--iiii---- DTIFVHGLHREGRSADLKGQLAQRIVDDVSVAAEIDRKHIWVYFGEMPAQQMVEYGR -----------------------------------3333--------3333--iiii >SURFACE PRESENTATION OF A; SWP:P0A1N0; PDB:2FM8A; MQHLDIAELVRSALEVSGCDPSLIGGIDSHSTIVLDLFALPSICISVKDDDVWIWAQLGA -------------------3333-----------------------------------11 DSMVVLQQRAYEILMTIMEGCHFARGGQLLLGEQNGELTLKALVHPDFLSDGEKFSTALN 11--------------33331111---------%%%%-------3333------------ GFYNYLEVFSRSLM ----------1111 >CELL INVASION PROTEIN SIP; SWP:Q56027; PDB:2FM9A; PQLEDFPALIKQASLDALFKCGKDAEALKEVFTNSNNVAGKKAIMEFAGLFRSALNATSD -3333-------------1111-----------------------------------111 SPEAKTLLMKVGAEYTAQIIKDGLKEKSAFGPWLPETKKAEAKLENLEKQLLDIIKNNTG 1-----------------------------1111-------------------------- GELSKLSTNLVMQEVMPYIASCIEHNFGCTLDPLTRSNLTHLVDKAAAKAVEALDMCHQK ------------------------------------------------------------ HLEMQTLIPLLLRNVFAQIPA -3333----------1111-- >AMYLOID BETA A4 PROTEIN P; SWP:Q6P0P4; PDB:2FMAA; EACKFLHQERMDVCETHLHWHTVAKETCSEKSTNLHDYGMLLPCGIDKFRGVEFVCCPL ----------------------------1111--------------------------- >HYDROPHOBIN; SWP:Q04571; PDB:2FMCA; ATTIGPNTCSIDDYKPYCCQSMSGPAGSPGLLNLIPVDLSASLGCVVGVIGSQCGASVKC ------------------------------------------------2222-------- CKDDVTNTGNSFLIINAANCVA -----1111------------- >LECTIN; SWP:P42088; PDB:2FMDA; ANSVCFTFTDFESGQQDLIFQGDASVGSNKALQLTKVDSKGNPQGGSVGRALYTAPIRLW -----------2222-----------1111-------1111------------------- QSSSLVASFETTFTFSISQGSSTPADALTFFIASPDTKIPSGSGGRLLGLFGSSNDNGVV 3333-----------------------------1111--2222!!!!------------- SVEFDTYPNTDIGDPNYRHIGIDVNSIRSKAASKWDWQNGKTATAHISYNSASKRLSVVS --------3333------------------------------------------------ SYPNSSPVVVSFDVELNNVPWVRVGFSATTGQYTQTNNILAWSFRSSLMG --------------3333-------------------------------- >MUTT/NUDIX FAMILY PROTEIN; SWP:Q830S2; PDB:2FMLA; QFASKAEEKNYYERQASLAEFLTWYHQQELPEYEKPSLTVDVLLCYNKEADQLKVLLIQR -----------------------3333--------------------1111--------- KGHPFRNSWALPGGFVNRNESTEDSVLRETKEETGVVISQENIEQLHSFSRPDRDPRGWV ----2222--------1111------------------3333------------3333-- VTVSYLAFIGEEPLIAGDDAKEVHWFNLERHGQHITLSHEDVEITLDLKTAASLGKDTLA ----------------1111----------!!!!----!!!!-----------------! FDHSEIIIKAFNRVVDKEHEPQVLQVLGKDFTITEARKVFAKFLGVDYRSIDHSNFKKAT !!!-----------------33331111------------------3333-3333----1 QYFEELGEPSKIYQLK 111------------- >5'-D(*GP*CP*TP*GP*AP*TP*G; SWP:P06746; PDB:2FMPA; TLNGGITDMLTELANFEKNVSQAIHKYNAYRKAASVIAKYPHKIKSGAEAKKLPGVGTKI ---------------------------------------------3333----------- AEKIDEFLATGKLRKLEKIRQDDTSSSINFLTRVSGIGPSAARKFVDEGIKTLEDLRKNE ---------------------------------2222--------1111--3333---11 DKLNHHQRIGLKYFGDFEKRIPREEMLQMQDIVLNEVKKVDSEYIATVCGSFRRGAESSG 11------------3333----------------------1111---------------- DMDVLLTHPSFTSESTKQPKLLHQVVEQLQKVHFITDTLSKGETKFMGVCQLPSKNDEKE -------3333------------------1111---------------------2222-- YPHRRIDIRLIPKDQYYCGVLYFTGSDIFNKNMRAHALEKGFTINEYTIRPLGVTGVAGE -----------1111-------------------------------------3333---- PLPVDSEKDIFDYIQWKYREPKDRSE -----3333--1111----3333--- >FMR1 PROTEIN; SWP:Q06787; PDB:2FMR; ASRFHEQFIVREDLMGLAIGTHGANIQQARKVPGVTAIDLDEDTCTFHIYGEDQDAVKKA ----------3333---------3333--------------1111--------------- RSFLE ----- >HIV-1 TAT INTERACTIVE PRO; SWP:Q9Z2G9; PDB:2FMUA; HLPKLREDFKMQNKSVFILGASGETGKVLLKEILGQNLFSKVTLIGRRKLTFYKNVNQEV ---------3333------1111------------------------------------- VDFEKLDVYASAFQGHDVGFCCLGTKAGAEGFVRVDRDYVLKSAELAKAGGCKHFNLLSS -111133333333----------------------------------1111--------2 RGADKSSSFLYLQVKGEVEAKVEELKFDRLSVFRPGVLLCDSWASGYAVPVVTVVRAMLN 2221111--------------3333----------------3333--------------- NLVSPSSGQMELLENKAILHLGKD 1111-------------------- >carbon monoxide oxidation; SWP:NA; PDB:2FMYA; ATQMRLTDTNLLEVLNSEEYSGVLKEFREQRYSKKAILYTPNTERNLVFLVKSGRVRVYL ---------3333---3333-3333-------2222------------------------ AYEDKEFTLAILEAGDIFCTHTRAFIQAMEDTTILYTDIRNFQNIVVEFPAFSLNMVKVL -3333-------2222---------------------3333---3333------------ GDLLKNSLTIINGLVFKDARLRLAEFLVQAAMDTGLKVPQGIKLELGLNTEEIALMLGTT -------------------------------------2222------------------- RQTVSVLLNDFKKMGILERVNQRTLLLKDLQKLKEFSS -----------1111-----1111----33333333-- >SALICYLATE SYNTHETASE, IR; SWP:Q9X9I8; PDB:2FN0A; KISEFLHEEQWLPTISGVLRQFAEEECYVYERPPCWYLGKGCQARLHINADGTQATFIDD -------1111-------------------------------------1111------33 AGEQKWAVDSIADCARRFMAHPQVKGRRVYGQVGFNFAAHARGIAFNAGEWPLLTLTVPR 33------------------1111------------------------------------ EELIFEKGNVTVYADAPLAVDTALNGEAYKQQVARAVAEIRRGEYVKVIVSRAIPLPSRI ----------------------2222-------------1111----------------- DMPATLLYGRQANTPVRSFMFRQEGREALGFSPELVMSVTGNKVVTEPLAGTRDRMGNPE ---------1111---------iiii-------------!!!!----------------- HNKAKEAELLHDSKEVLEHILSVKEAIAELEAVCLPGSVVVEDLMSVRQRGSVQHLGSGV ----------------------------------2222-----------!!!!------- SGQLAENKDAWDAFTVLFPSITASGIPKNAALNAIMQIEKTPRELYSGAILLLDDTRFDA ----1111-----------1111-------------------!!!!-------------- ALVLRSVFQDSQRCWIQAGAGIIAQSTPERELTETREKLASIAPYLMV ----------------------1111---------------3333--- >RAS-RELATED PROTEIN R-RAS; SWP:P10301; PDB:2FN4A; PPPSETHKLVVVGGGGVGKSALTIQFIQSYFVSDYDPTIEDSYTKICSVDGIPARLDILD -------------2222------------------1111---------iiii-------- TAGQEEFGAMREQYMRAGHGFLLVFAINDRQSFNEVGKLFTQILRVKDRDDFPVVLVGNK ---------3333------------1111------------------------------1 ADLESQRQVPRSEASAFGASHHVAYFEASAKLRLNVDEAFEQLVRAVRKYQEQ 1111111-----------1111-------1111-------------------- >ribose ABC transporter, p; SWP:Q9X053; PDB:2FN9A; GKMAIVISTLNNPWFVVLAETAKQRAEQLGYEATIFDSQNDTAKESAHFDAIIAAGYDAI --------------------------1111-------%%%%------------------- IFNPTDADGSIANVKRAKEAGIPVFCVDRGINARGLAVAQIYSDNYYGGVLAGEYFVKFL -------1111------1111--------------------------------------- KEKYPDAKEIPYAELLGILSAQPTWDRSNGFHSVVDQYPEFKMVAQQSAEFDRDTAYKVT ---1111----------1111------------33333333-------%%%%-------- EQILQAHPEIKAIWCGNDAMALGAMKACEAAGRTDIYIFGFDGAEDVINAIKEGKQIVAT ------3333------------------11111111--------------1111------ IMQFPKLMARLAVEWADQYLRGERSFPEIVPVTVELVTRE ------------------1111------------------ >CONSERVED HYPOTHETICAL PR; SWP:Q97Y08; PDB:2FNAA; GLFDTSPKDNRKDFFDREKEIEKLKGLRAPITLVLGLRRTGKSSIIKIGINELNLPYIYL ---------3333----------3333---------2222-------------------- DLRKFEERNYISYKDFLLELQKEINKLVKRLPSLLKALKNIQGIVIGNEIKFNRLSFANL --1111------------------------1111---1111------------------- LESFEQASKDNVIIVLDEAQELVKLRGVNLLPALAYAYDNLKRIKFISGSEGLLYDYLRV ----1111----------------1111------------1111--------------11 EDPESPLFGRAFSTVELKPFSREEAIEFLRRGFQEADIDFKDYEVVYEKIGGIPGWLTYF 11--1111---------------------------------------------------- GFIYLDNKNLDFAINQTLEYAKKLILKEFENFLHGREIARKRYLNIRTLSKCGKWSDVKR ------------------------------1111-3333-------1111---3333--- ALELEEGIEISDSEIYNYLTQLTKHSWIIKEGEKYCPSEPLISLAFS ----------3333-----------------------------1111 >FIBRONECTIN; SWP:P11276; PDB:2FNBA; MRGSEVPQLTDLSFVDITDSSIGLRWTPLNSSTIIGYRITVVAAGEGIPIFEDFVDSSVG --------1111------------------------------------------------ YYTVTGLEPGIDYDISVITLINGGESAPTTLTQQT ----------------------------------- >maltose ABC transporter, ; SWP:Q9S5Y1; PDB:2FNCA; KLTIWCSEKQVDILQKLGEEFKAKYGIPVEVQYVDFGSIKSKFLTAAPQGQGADIIVGAH ------3333------------------------3333--------1111--------33 DWVGELAVNGLIEPIPNFSDLKNFYDTALKAFSYGGKLYGVPYAMEAVALIYNKDYVDSV 33----1111-------1111--------1111iiii---------------3333---- PKTMDELIEKAKQIDEEYGGEVRGFIYDVANFYFSAPFILGYGGYVFKETPQGLDVTDIG ---------------1111--------11113333----1111------1111-1111-- LANEGAVKGAKLIKRMIDEGVLTPGDNYGTMDSMFKEGLAAMIINGLWAIKSYKDAGINY ----------------1111--1111--------1111-------3333----1111--- GVAPIPELEPGVPAKPFVGVQGFMINAKSPNKVIAMEFLTNFIARKETMYKIYLADPRLP --------2222-------------1111------------1111--------------- ARKDVLELVKDNPDVVAFTQSASMGTPMPNVPEMAPVWSAMGDALSIIINGQASVEDALK -3333-1111----------3333------1111-------------1111--3333--- EAVEKIKAQIEKGSHHHHH ------------1111--- >MULTIPLE PDZ DOMAIN PROTE; SWP:Q5VZ62; PDB:2FNEA; MPQCKSITLERGPDGLGFSIVGGYGSPHGDLPIYVKTVFAKGAASEDGRLKRGDQIIAVN -----------1111----------1111---------2222--------2222----ii GQSLEGVTHEEAVAILKRTKGTVTLMVLSSDETSV ii-2222---------------------------- >Ras association domain-co; SWP:Q5EBH1; PDB:2FNFX; PRVLAERGEGHRFVELALRGGPGWCDLCGREVLRQALRCANCKFTCHSECRSLIQLDCR -------------------------3333---------3333----33331111----- >AGR_PAT_752P; SWP:NA; PDB:2FNOA; HEDGNTFDLYYWPVPFRGQLIRGILAHCGCSWDEHDVDAIEGLDCGAEKQPVAFGPPVLI ---------------1111------1111----------------3333----------- DRERNFAISQPAIAIYLGERLDILPATVEGRTLSAKIVNDANDVLDELTLNGGREWTPEK -1111-------------1111-------------------------------------- WQEFVPRLQKWIRIFADTGARNGLSAASGFLGTEKIGVADIVTAILWTTVADRFPAIKGI -------------------1111-1111--------3333-------------------- IEDTSPIIWGLSRRVVATAPLAALNSKSFEEYGNAYCGGEIEKSLRKVAS -------------------------------!!!!--------------- >ALLENE OXIDE SYNTHASE-LIP; SWP:O16025; PDB:2FNQA; AIYNVEVETGDREHAGTDATITIRITGAKGRTDYLKLDKWFHNDFEAGSKEQYTVQGFDV -----------2222--------------------------------------------- GDIQLIELHSDGGGYWDPDWFVNRVIIISSTQDRVYSFPCFRWVIKDMVLFPGEATLPFN --------------------------------------------------------1111 EVPAIVSEQRQKELEQRKLTYQWDYVSDDMPGNIKAKTHDDLPRDVQFTDEKSRSYQESR ---3333------------------------------1111-3333-------------- KAALVNLGIGSLFTMFENWDSYDDYHILYRNWILGGTPNMADRWHEDRWFGYQFLNGANP 11111111-3333-------333333331111----------3333-------------- VILTRCDALPSNFPVTNEHVNASLDRNLDEEIKDGHIYIVDFKVLVGAKSYADIRYCAAP ---------1111--1111--------3333----------3333--------------- LALFYVNKLGHLMPIAIQINQEPGPENPIWTPHEENEHDWMMAKFWLGVAESNFHQLNTH -----------------------3333---------3333-------------------- LLRTHLTTESFALSTWRNLASAHPVFKLLQPHIYGVLAIDTIGRKELIGSGGIVDQSLSL -------------------33333333-----2222------------2222-----111 GGGGHVTFMEKCFKEVNLQDYHLPNALKKRGVDDPSKLPGFYYRDDGLALWEAIETFIGE 1-----------1111-3333-3333-1111--3333----------------------- IIAIFYKNDDDVKRDNEIQSWIYDVHKNGWRVNPGHQDHGVPASFESREQLKEVLTSLVF -----------------------------------------------3333--------- TFSCQHAAVNFSQKDHYGFTPNAPAVLRHPPPKKKGEATLQSILSTLPSKSQAAKAIATV ---------1111-----3333------------------1111---------------- YILTKFSEDERYLGNYSATAWEDKDALDAINRFQDKLEDISKKIKQRNENLEVPYIYLLP -------------------------------------------------------1111- ERIPNGTAI --------- >AMINOTRANSFERASE; SWP:O25130; PDB:2FNUA; MKEFAYSEPCLDKEDKKAVLEVLNSKQLTQGKRSLLFEEALCEFLGVKHALVFNSATSAL ---------------------1111----------------------------------- LTLYRNFSEFSADRNEIITTPISFVATANMLLESGYTPVFAGIKNDGNIDELALEKLINE ----------1111-----------------1111--------1111--11113333-11 RTKAIVSVDYAGKSVEVESVQKLCKKHSLSFLSDSSHALGSEYQNKKVGGFALASVFSFH 11------2222----------------------1111----%%%%-------------1 AIKPITTAEGGAVVTNDSELHEKMKLFRSHGMLKKDFFEGEVKSIGHNFRLNEIQSALGL 111--------------------------------1111--------------------- SQLKKAPFLMQKREEAALTYDRIFKDNPYFTPLHPLLKDKSSNHLYPILMHQKFFTCKKL -----------------------2222-----3333--------------33331111-- ILESLHKRGILAQVHYKPIYQYQLYQQLFNTAPLKSAEDFYHAEISLPCHANLNLESVQN -----------------3333------------3333------------1111------- IAHSVLKTFESFK ------------- >Protein lag-3; SWP:Q09260; PDB:2FO1D; EDEPTIGDLNAFHSGEELHRQRSELARANYEKARPEMIANQRAVTAHLFNRYTEDEERKR -------1111------------------------------------3333----3333- VEQ --- >Protein lin-12 [Precursor; SWP:P14585; PDB:2FO1E; RTRKRRINASVWPPENEESPIKLHTEAAGSYAITEPITRESVNIIDPRHNRTVLHWIASN -3333--------------------------------3333------------------- SSAEKSEDLIVHEAKECIAAGADVNADCDENTPLLAVLARRRRLVAYLKAGADPTIYNKS ---------------3333--------------------------------------111 ERSALHQAAANRDFGVYLNSTKLKGDIEELDRNGTALIVAHNEGRDQVASAKLLVEKGAK 1------------------3333-------1111----3333------------------ VDYDGAARKDSEKYKGRTALHYAAQVSNPIVKYLVGEKGSNKDKQDEDGKTPILAAQEGR ------------------------------3333------1111-----------33333 IEVVYLIQQGASVEAVDATDHTARQLAQANNHHNIVDIFDRCR 333------------------------1111-3333--1111- >UBIQUITIN-CONJUGATING ENZ; SWP:Q4Y037; PDB:2FO3A; YRIQKELHNFLNNPPINCTLDVHPNNIRIWIVKYVGLENTIYANEVYKLKIIFPDDYPLK --------------2222----1111----------2222-2222--------1111--- PPIVYFLQKPPKHTHVYSNGDICLSLLGDDYNPSLSISGLVLSIISMLS ------------11111111---333311111111-------------- >CYSTEINE PROTEINASE EP-B ; SWP:P25250; PDB:2FO5A; DLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTA ------3333--------------3333-------------------------------- DNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVP --!!!!------------------3333----------3333-----------------2 ANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKA 222-------1111----------------------------------------1111-- YWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTY --------1111-iiii--------1111%%%%----------- >UBIQUITIN CARBOXYL-TERMIN; SWP:Q93009; PDB:2FOJA; TSWRSEATFQFTVERFSRLSESVLSPPCFVRNLPWKIMVMPRFQKSVGFFLQCNAESDST 1111----------3333-----------%%%%--------------------1111--- SWSCHAQAVLKIINYRDDEKSFSRRISHLFFHKENDWGFSNFMAWSEVTDPEKGFIDDDK -------------33331111---------1111---------3333--1111---%%%% VTFEVFVQADAPHGVAW ----------------- >FOKI RESTRICTION ENDONUCL; SWP:P14870; PDB:2FOKA; IRTFGWVQNPGKFENLKRVVQVFDRNSKVHNEVKNIKIPTLVKESKIQKELVAIMNQHDL ------------------------------------3333-------------------- IYTYKELVGTGTAPCDAIIQATIADQGKGYIDNWSSDGFLRWAHALGFIEYINKSDSFVI -------------------------------------------1111------------- TDVGLAYSKSADGSAIEKEILIEAISSYPPAIRILTLLEDGQHLTKFDLGKNLGFSGESG ----------2222---------------------1111-----33333333-------- FTSLPEGILLDTLANAMPKDKGEIRNNWEGSSDKYARMIGGWLDKLGLVKQGKKEFIIPT ------------11111111-----------------------1111------------- NKEFISHAFKITGEGLKVLRRAKGSTKFTRVPKRVYWEMLATNLTDKEYVRTRRALILEI --------------------3333-----------1111------3333----------- LIKAGSLKIEQIQDNLKKLGFDEVIETIENDIKGLINTGIFIEIKGRFYQLKDHILQFVI -------3333------------1111--------1111-----!!!!------------ PNRLVKSELEEKKSELRHKLKYVPHEYIELIEIARNSTQDRILEMKVMEFFMKVYGYRGK -----------------------3333--------1111--------------------- HLGGSRKPDGAIYTVGSPIDYGVIVDTKAYSGGYNLPIGQADEMQRYVEENQTRNKHINP ------------------------------------------------1111--333311 NEWWKVYPSSVTEFKFLFVSGHFKGNYKAQLTRLNHITNCNGAVLSVEELLIGGEMIKAG 11-33333333--------------3333----------------3333----------- TLTLEEVRRKFNNGEINF -------3333------- >RAS-RELATED PROTEIN RAB-1; SWP:P62820; PDB:2FOLA; YDYLFKLLLIGDSGVGKSCLLLRFADDTYTESYISTIGVDFKIRTIELDGKTIKLQIWDY -----------22223333----------------------------iiii--------1 RGAHGIIVVYDVTDQESFNNVKQWLQEIDRYASENVNKLLVGNKCDLTTKKVVDYTTAKE 111-------1111------------------1111-------33331111--------- FADSLGIPFLETSAKNATNVEQSFMTMAAEIKKRM -------------1111------------------ >POLYPROTEIN; SWP:Q91H74; PDB:2FOMA; GSHMLEADLELERAADVRWEEQAEISGSSPILSISIKNEEEEQTLG -------------------3333-----------------3333-- >PEROXISOMAL ACYL-COA OXID; SWP:Q5D8D3; PDB:2FONA; GVDYLADERKKAGFDVDEMKIVWAGSRHDFELTDRISKLVASDPGFSKEGRTMLPRKELF --1111--1111------------------------------1111-2222--------- KNTLRKAAYAWKRIIELRLSQEEATMLRRYVDEPAFTDLHWGMFIPAIKGQGTDKQQEKW --------------1111---------1111---3333--------------3333---- LPLAYKMQIIGCYAQTELGHGSNVQGLETTATFDPQTDEFVIHSPTLTSSKWWPGGLGKV ---1111---------1111--3333-------1111--------1111----2222--- STHAVVYARLITDGKDYGVNGFIVQLRSLEDHKPLPGVTVGDIGMKFGNGAYNSMDNGVL -----------iiii-------------------2222-----------!!!!------- SFDHVRIPRDQMLMRVSQVTKEGKYVQSDIPRQLLYGTMVYVRQSIVADASLAMSRAVCI -------1111--------1111-------3333-------------------------- ATRYSAVRRQFGSQNGGQETQVIDYKTQQNRLFPLLASAYAFRFVGEWLKWLYTDVTQRL --------------------3333------------------------------------ AANDFSTLPEAHACTAGLKSLTTSATADGIEECRKLCGGHGYLCSSGLPELFAVYVPACT ----1111-----------------------------3333-1111-------------- YEGDNVVLQLQVARFLMKTISQLGTGKKPVGTVSYMGRIEHLMQCRSDVKQAEDWLKPSA -------------------1111------!!!!1111-3333--------3333--3333 VLEAFEARSARMSVACAKNLSKFENQEEGFAELAADLVEAAVAHCQLIVVSKYIEKLQQN -------------------------------------------------------1111- IPGKGVKQQLEVLCGIYSLFILHKHQGDFLGTGYITSKQGSLANDQLRALYSQLRPNAVS --2222-----------------------1111---------------------1111-- LVDAFNYTDHYLGSILGRYDGNVYPKLYEAAWKDPLNKSDIADGFHEYIRPLLK -3333--3333--33331111-------3333-3333-------------3333 >CARBONIC ANHYDRASE 1; SWP:P00915; PDB:2FOYA; WGYDDKNGPEQWSKLYPIANGNNQSPVDIKTSETKHDTSLKPISVSYNPATAKEIINVGH ---33333333-1111-1111--------1111---1111-------1111--------- SFHVNFEDNDNRSVLKGGPFSDSYRLFQFHFHWGSTNEHGSEHTVDGVKYSAELHVAHWN ----------------!!!!---------------3333-----iiii------------ SAKYSSLAEAASKADGLAVIGVLMKVGEANPKLQKVLDALQAIKTKGKRAPFTNFDPSTL -----33331111----------------3333-----3333--2222---------111 LPSSLDFWTYPGSLTHPPLYESVTWIICKESISVSSEQLAQFRSLLSNVEGDNAVPMQHN 1----------------------------------------------------------- NRPTQPLKGRTVRASF ------iiii------ >ADP-RIBOSYLHYDROLASE LIKE; SWP:Q9NX46; PDB:2FOZA; MASLSRFRGCLAGALLGDCVGSFYEAHDTVDLTSVLRHVQSLEPDPEALYYTDDTAMARA --------------------3333---------------1111----------------- LVQSLLAKEAFDEVDMAHRFAQEYKKDPDRGYGAGVVTVFKKLLNPKCRDVFEPARAQFN --------------------------------3333--------1111-11113333%%% GKGSYGNGGAMRVAGISLAYSSVQDVQKFARLSAQLTHASSLGYNGAILQALAVHLALQG %------3333-3333-----------------------3333-------------1111 ESSSEHFLKQLLGHMEDLEGDAQSVLDARELGMEERPYSSRLKKIGELLDQASVTREEVV ----------------1111---------------3333--------1111--------- SELGNGIAAFESVPTAIYCFLRCMEPDPEIPSAFNSLQRTLIYSISLGGDTDTIATMAGA -------1111---------------11113333----------1111------------ IAGAYYGMDQVPESWQQSCEGYEETDILAQSLHRVFQKS ------3333--------2222----------------- >CHORISMATE MUTASE; SWP:O07746; PDB:2FP1A; GTSQLAELVDAAAERLEVADPVAAFKWRAQLPIEDSGRVEQQLAKLGEDARSQHIDPDYV --1111--------------------------------------------1111------ TRVFDDQIRATEAIEYSRFSDWKLNPASAPPEPPDLSASRSAIDSLNNRMLSQIWSHWSL ------------------------3333------------------------------33 LSAPSCAAQLDRAKRDIVRSRHLDSLYQRALTTATQSYCQALPPA 331111------------1111----------1111--------- >CASPASE NC; SWP:Q9XYF4; PDB:2FP3A; ALTPYVGVVDGPEVKKSKKIHGGDSAILGTYKMQSRFNRGVLLMVNIMDYPDQNRRRIGA --------------------------------------------------------2222 EKDSKSLIHLFQELNFTIFPYGNVNQDQFFKLLTMVTSSSYVQNTECFVMVLMTHGNSVE -----------1111-------------------------------------------!! GKEKVEFRDGSVVDMQKIKDHFQTAKCPYLVNKPKVLMFPFASTNVPSLADTLVCYANTP !!----1111---3333------33331111----------------------------- GYVTHRDLDTGSWYIQKFCQVMADHAHDTDLEDILKKTSEAVGNKRTKKGSMQTGAYDNL ------------------------3333-------------------------------- GFNKKLYFNPGFFN --------2222-- >Succinyl-CoA ligase [GDP-; SWP:O19069; PDB:2FP4A; SYTASRKHLYVDKNTKVICQGFTGKQGTFHSQQALEYGTNLVGGTTPGKGGKTHLGLPVF 3333-------1111-----1111---------------------2222----%%%%--- NTVKEAKEQTGATASVIYVPPPFAAAAINEAIDAEVPLVVCITEGIPQQDMVRVKHRLLR -------------------3333----------------------------------111 QGKTRLIGPNCPGVINPGECKIGIMPGHIHKKGRIGIVSRSGTLTYEAVHQTTQVGLGQS 1--------------2222------3333------------3333-------1111---- LCVGIGGDPFNGTDFTDCLEIFLNDPATEGIILIGEIGGNAEENAAEFLKQHNSGPKSKP ------------------------1111--------------------------1111-- VVSFIAGLTAPPGRRMGAGAIIAGGKGGAKEKITALQSAGVVVSMSPAQLGTTIYKEFEK ------1111------------iiii----------1111-----1111---------11 RKML 11-- >Succinyl-CoA ligase [GDP-; SWP:P53590; PDB:2FP4B; MNLQEYQSKKLMSDNGVKVQRFFVADTANEALEAAKRLNAKEIVLKAQILAGGRGKGVFS ---3333-----1111------------------------------------3333--11 SGLKGGVHLTKDPEVVGQLAKQMIGYNLATKQTPKEGVKVNKVMVAEALDISRETYLAIL 11---------3333-------2222---11111111----------------------- MDRSCNGPVLVGSPQGGVDIEEVAASNPELIFKEQIDIIEGIKDSQAQRMAENLGFLGPL -3333---------------------3333------3333-----------------111 QNQAADQIKKLYNLFLKIDATQVEVNPFGETPEGQVVCFDAKINFDDNAEFRQKDIFAMD 1-----------------------------1111-----------333311113333--- DKSENEPIENEAAKYDLKYIGLDGNIACFVNGAGLAMATCDIIFLNGGKPANFLDLGGGV -11113333--3333----------------------------1111------------- KESQVYQAFKLLTADPKVEAILVNIFGGIVNCAIIANGITKACRELELKVPLVVRLEGTN --------------3333------------------------------------------ VHEAQNILTNSGLPITSAVDLEDAAKKAVASVT --------3333--------------------- >POLYPROTEIN; SWP:P06935; PDB:2FP7A; ETDMWIERTADITWESDAEITGSSERVDVRLDDDGNFQLM --------------1111-------------1111----- >Genome polyprotein; SWP:P06935; PDB:2FP7B; TTGVYRIMTSYQAGAGVMVEGVFHTLWHTTKGAALMSGEGRLDPYWGSVKEDRLCYGGPW ------------------%%%%---3333iiii---1111--------1111-------- KLQHKWNGHDEVQMIVVEPGKNVKNVQTKPGVFKTPEGEIGAVTLDYPTGTSGSPIVDKN -----------------2222-------------1111---------3333------111 GDVIGLYGNGVIMPNGSYISAIVQGER 1-----------1111----------- >STRICTOSIDINE SYNTHASE; SWP:P68175; PDB:2FP8A; KEILIEAPSYAPNSFTFDSTNKGFYTSVQDGRVIKYEGPNSGFVDFAYASPYWNKAFCEN -----------------3333------1111------3333--------11113333222 STDAEKRPLCGRTYDISYNLQNNQLYIVDCYYHLSVVGSEGGHATQLATSVDGVPFKWLY 2-3333-------------------------------1111---------iiii------ AVTVDQRTGIVYFTDVSTLYDDRGVQQIMDTSDKTGRLIKYDPSTKETTLLLKELHVPGG --------------------1111-----------------3333--------------- AEVSADSSFVLVAEFLSHQIVKYWLEGPKKGTAEVLVKIPNPGNIKRNADGHFWVSSSEE ---1111------1111---------1111-----------------1111--------1 LDGNMHGRVDPKGIKFDEFGNILEVIPLPPPFAGEHFEQIQEHDGLLYIGTLFHGSVGIL 1111111---------1111----------------------%%%%-------------- VY -- >C-jun-amino-terminal kina; SWP:Q9R237; PDB:2FPEA; SEQTHRAIFRFVPRHEDELELEVDDPLLVELQAEDYWYEAYNRTGARGVFPAYYAIEVTK --------------1111---2222-------1111--------------1111------ >YlmH protein; SWP:Q04JA7; PDB:2FPHX; GIYQHFSIEDRPFLDKGMEWIKKVEDSYAPFLTPFINPHQEKLLKILAKTYGLACSSSGE ------3333--------------------------------------1111----3333 FVSSEYVRVLLYPDYFQPEFSDFEISLQEIVYSNKFEYLTHAKILGTVINQLGIERKLFG ------------3333--3333---------------------------------1111- DILVDEERAQIMINQQFLLLFQDGLKKIGRIPVSLEERPFTEKID -------------3333---------------------3333--- >YWMB; SWP:O32277; PDB:2FPNA; LTPLAQAEGERQDVSIDKWTLHAKQNLSLTEKEFYQKVQRLKQEYRQYDWVIAREDKIKA -3333----1111-------------------------------3333------------ IGTYTDKKNRTSFRLQLVTTLKKHNPTSYLLYEQSLETPDSWNDTYEQFERETLGIFQEK ------1111-------------------------------------------------- VVIFTCLNGHLDDNNIVLQKKANQLLNEFQVEHVVEPNFVSISAFTDEWEEYITSKHKNL ----------------3333---------------------------------------- QIALRSHTVTVGTPIVTT ------------------ >METHYLASE YHHF; SWP:P0ADX9; PDB:2FPOA; GQIRIIGGQWRGRKLPVPDSTDRVRETLFNWLAPVIVDAQCLDCFAGSGALGLEALSRYA -------1111--------------------11112222------!!!!----------- AGATLIEDRAVSQQLIKNLATLKAGNARVVNSNASFLAQKGTPHNIVFVDPPFRRGLLEE ----------------------------------1111---------------2222--- TINLLEDNGWLADEALIYVESEVENGLPTVPANWSLHREKVAGQVAYRLYQREAQ ---------------------3333-----1111-------!!!!---------- >BOTULINUM NEUROTOXIN D LI; SWP:P19321; PDB:2FPQA; FMTWPVKDFNYSDPVNDNDILYLRIPQNKLITTPVKAFMITQNIWVIPERFSSDTNPSLS ---------1111-----------3333------------2222---------------- KPPRPTSKYQSYYDPSYLSTDEQKDTFLKGIIKLFKRINERDIGKKLINYLVVGSPFMGD ------1111---1111------------------------------------------3 SSTPEDTFDFTRHTTNIAVEKFENGSWKVTNIITPSVLIFGPLPNILDYTASLTQSNPSF 3331111----3333-------iiii------------------1111---------111 EGFGTLSILKVAPEFLLTFSDVGKSIFCMDPVIALMHELTHSLHQLYGINIPSDKRIRPQ 1----------1111-------1111-------------------------3333----- VSEGFFSQDGPNVQFEELYTFGGLDVEIIPQIERSQLREKALGHYKDIAKRLNNINKTIP ----------------------3333---------------------------------1 SSWISNIDKYKKIFSEKYNFDKDNTGNFVVNIDKFNSLYSDLTNVMSEVVYSSQYNVKNR 1111111--------1111---1111---------------------------------- THYFSRHYLPVFANILDDNIYTIRDGFNLTNKGFNIENSGQNIERNPALQKL -1111--------1111-------!!!!-2222--%%%%---3333------ >HISTIDINE BIOSYNTHESIS BI; SWP:Q9S5G5; PDB:2FPRA; SQKYLFIDRDGTLISEPDFQVDRFDKLAFEPGVIPQLLKLQKAGYKLVMITNQDGLGTQS --------2222----------1111---2222-------1111--------2222-333 FPQADFDGPHNLMMQIFTSQGVQFDEVLICPHLPADECDCRKPKVKLVERYLMDRANSYV 33333------------1111-----------3333--------33333333-3333--- IGDRATDIQLAENMGINGLRYDRETLNWPMIGEQLT ---3333----------------------------- >TRYPTASE BETA-2; SWP:P20231; PDB:2FPZA; IVGGQEAPRSKWPWQVSLRVHGPY -------11111111--------- >ACYL CARRIER PROTEIN; SWP:O77077; PDB:2FQ0A; LKSTFDDIKKIISKQLSVEEDKIQMNSNFTKDLGADSLDLVELIMALEEKFNVTISDQDA ------------------3333-----------------------------------333 LKINTVQDAIDYIEKNNKQ 33333-------3333--- >ISOCHORISMATASE; SWP:P0ADI4; PDB:2FQ1A; KLQAYALPESHDIPQNKVDWAFEPQRAALLIHDMQDYFVSFWGENCPMMEQVIANIAALR --------3333----------3333--------33331111------------------ DYCKQHNIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTVLVKW ---1111------------3333--3333---!!!!-3333---1111--1111------ RYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALADFSRD --3333--------1111---------1111---------1111-----1111------- EHLMSLKYVAGRSGRVVMTEELLPAPIPASKAALREVILPLLDESDEPFDDDNLIDYGLD -----------------3333----------------3333-------11113333---3 SVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWKLLSRE 333------3333-111133333333------------- >TRANSCRIPTION REGULATORY ; SWP:P32591; PDB:2FQ3A; SKWFNLEKIHSIEVQSLPEFFTNRIPSKTPEVYMRYRNFMVNSYRLNPNEYFSVTTARRN 33331111-3333---3333----1111------------------3333---------- VSGDAAALFRLHKFLTKWGLINYQV ------------------------- >TRANSCRIPTIONAL REGULATOR; SWP:Q81BJ4; PDB:2FQ4A; RNIETQKAILSASYELLLESGFKAVTVDKIAERAKVSKATIYKWWPNKAAVVDGFLSAAA --------------------3333----------------3333--3333---------- ARLPVPDTGSALNDILIHATSLANFLISREGTIINELVGEGQFDSKLAEEYRVRYFQPRR ----------------------------3333-------3333----------------- LQAKQLLEKGIKRGELKENLDIELSIDLIYGPIFYRLLVTGEKLDDSYVHDLVINAFEGI ----------1111---------------------------------------------- RLR --- >CYSTATHIONINE BETA-LYASE; SWP:P06721; PDB:2FQ6A; KLDTQLVNAGRSKKYTLGAVNSVIQRASSLVFDSVEAKKHATRNRANGELFYGRRGTLTH -------22223333iiii----------------------1111------3333----- FSLQQAMCELEGGAGCVLFPCGAAAVANSILAFIEQGDHVLMTNTAYEPSQDFCSKILSK --------1111------------------11112222----11113333------3333 LGVTTSWFDPLIGADIVKHLQPNTKIVFLESPGSITMEVHDVPAIVAAVRSVVPDAIIMI --------1111--3333--1111----------------------------1111---- DNTWAAGVLFKALDFGIDVSIQAATKYLVGHSDAMIGTAVCNARCWEQLRENAYLMGQMV --1111----3333--------33333333------------1111-------1111--- DADTAYITSRGLRTLGVRLRQHHESSLKVAEWLAEHPQVARVNHPALPGSKGHEFWKRDF ---------3333----------------------1111----1111--2222------- TGSSGLFSFVLKKKLNNEELANYLDNFSLFSMAYSWGGYESLILANQPEHIAAIRPQGEI ----------------------3333------------------------11112222-- DFSGTLIRLHIGLEDVDDLIADLDAGFARIV ------------------------------- >HYPOTHETICAL PROTEIN TA09; SWP:Q9HJM9; PDB:2FQHA; MSEVNIVVNGREAGSKSKGCALCGATWGDYHADFLGEDLFFCCDICAAEFMNMMDEAFKH --------%%%%---33333333----------1111---------------3333---- TARHNVDELHIDGNYQLGRNVLLKNGEDRLRFYVKFGPGAVIKEFKITD -------------1111------------3333------33333333-- >PHOSPHOPROTEIN; SWP:P04880; PDB:2FQMA; DWKQPELESDEHGKTLRLTLPEGLSGEQKSQWMLTIKAVVQSAKHWNLAECTFEASGEGV -----------------------------------------1111-1111---------- IIKKR ----- >HYPOTHETICAL PROTEIN BP22; SWP:Q7VWF8; PDB:2FQPA; GKRPGAIPTVQIDNERVKVTEWRFPPGGETGWHRHSDYVVVPTTGPLLLETPEGSVTSQL ------------------------2222----------------------1111------ TRGVSYTRPEGVEHNVINPSDTEFVFVEIEIKA 2222----------------------------- >MEMBRANE LIPOPROTEIN TMPC; SWP:P29724; PDB:2FQXA; GDFVVGMVTDSGDIDDKSFNQQVWEGISRFAQENNAKCKYVTASTDAEYVPSLSAFADEN -------------------------------1111---------3333--------1111 MGLVVACGSFLVEAVIETSARFPKQKFLVIDAVVQDRDNVVSAVFGQNEGSFLVGVAAAL ---------------------1111-----------1111-------------------- KAKEAGKSAVGFIVGMELGMMPLFEAGFEAGVKAVDPDIQVVVEVANTFSDPQKGQALAA --1111-----------!!!!--------------1111--------------------- KLYDSGVNVIFQVAGGTGNGVIKEARDRRLNGQDVWVIGVDRDQYMDGVYDGSKSVVLTS --1111--------3333-------------------------3333------------- MVKRADVAAERISKMAYDGSFPGGQSIMFGLEDKAVGIPEENPNLSSAVMEKIRSFEEKI ---------------------2222----3333--------1111--------------- VSKEIVVPVRSARMMN ----------1111-- >ERYTHROMYCIN SYNTHASE, ER; SWP:Q03131; PDB:2FR1A; DEVSALRYRIEWRPTGAGEPARLDGTWLVAKYAGTADETSTAAREALESAGARVRELVVD 3333-------------------------------------------1111--------- ARCGRDELAERLRSVGEVAGVLSLLAVDEAEPEEAPLALASLADTLSLVQAMVSAELGCP -----------------------1111-------11113333----------1111---- LWTVTESAVATGPFERVRNAAHGALWGVGRVIALENPAVWGGLVDVPAGSVAELARHLAA -----------1111---3333-------------3333-------22223333------ VVSGGAGEDQLALRADGVYGRRWVRAAAPATDDEWKPTGTVLVTGGTGGVGGQIARWLAR --------------------------------------------1111------------ RGAPHLLLVSRSGPDADGAGELVAELEALGARTTVAACDVTDRESVRELLGGIGDDVPLS -----------!!!!2222-------1111--------1111-------11111111--- AVFHAAATLDDGTVDTLTGERIERASRAKVLGARNLHELTRELDLTAFVLFSSFASAFGA ------------1111-----------------------1111---------3333---2 PGLGGYAPGNAYLDGLAQQRRSDGLPATAVAWGTWAFRRHGVIEMPPETACRALQNALDR 222-----------------1111-------------1111------------------- AEVCPIVIDVRWDRFLLAYTAQRPTRLFDEIDDARR ----------------3333-------11111111- >HYPOTHETICAL PROTEIN RV27; SWP:O07216; PDB:2FR2A; DLAPALQALSPLLGSWAGRGAGKYPTIRPFEYLEEVVFAHVGKPFLTYTQQTRAVADGKP --33331111-------------1111--------------------------------- LHSETGYLRVCRPGCVELVLAHPSGITEIEVGTYSVTGDVIELELSTRADGSIGLAPTAK -----------2222------1111-----------!!!!---------------1111- EVTALDRSYRIDGDELSYSLQMRAVGQPLQDHLAAVLHRQR -----------!!!!--------iiii-------------- >CYTIDINE DEAMINASE; SWP:P56389; PDB:2FR5A; EPEHVQRLLLSSREAKKSAYCPYSRFPVGAALLTGDGRIFSGCNIENACYPLGVCAERTA -------------3333----------------1111-----------3333-------- IQKAISEGYKDFRAIAISSDLQEEFISPCGACRQVMREFGTDWAVYMTKPDGTFVVRTVQ ----1111---------------------------3333---------1111-----333 ELLPASFGPEDLQKIQ 3------3333----- >PUTATIVE CYTOCHROME P450; SWP:Q6N8N2; PDB:2FR7A; HGAGVPHLGIDPFALDYFADPYPEQETLREAGPVVYLDKWNVYGVARYAEVYAVLNDPLT -2222------------------------------------------------------- FCSSRGVGLSDFKKEKPWRPPSLILEADPPAHTRTRAVLSKVLSPATMKRLRDGFAAAAD --1111-----------------------3333--------------------------- AKIDELLARGGNIDAIADLAEAYPLSVFPDAMGLKQEGRENLLPYAGLVFNAFGPPNELR ---------------1111------------------3333-3333--3333-------- QSAIERSAPHQAYVAEQCQRPNLAPGGFGACIHAFSDTGEITPEEAPLLVRSLLSAGLDT ------3333--------3333-2222------3333----3333--------------- TVNGIAAAVYCLARFPDEFARLRADPSLARNAFEEAVRFESPVQTFFRTTTRDVELAGAT ------------------------1111---------------------------iiii- IGEGEKVLMFLGSANRDPRRWDDPDRYDITRKTSGHVGFGSGVHMCVGQLVARLEGEVVL -----------3333-3333--1111-1111-22221111-11111111----------- AALARKVAAIEIAGPLKRRFNNTLRGLESLPIQLTPA ------------------------------------- >NAD(P)H-FLAVIN OXIDOREDUC; SWP:NA; PDB:2FREA; TNSNNRQSEYPVDPLFLDRWSPRAFDGSPPKEHLLTILDAAHWAPSASNHQPWRFVYAHK --%%%%------3333-------------3333-------1111-2222---------11 DSEDWPLFVELLEGNQKWAKNASVLLFVISRDHTISHEGEKKPSATHSFDAGAAWFSLAQ 11------11113333--1111-------------1111--------------------- AHLLGYHAHGGGIFKDRIVEKLDIPDGFKVEAGVAIGTLTDKSILPDDLAEREVPSKRVP -1111--------------1111-2222------------3333-----1111------3 LADVAFEGRFTGKAD 333------------ >MYOGLOBIN; SWP:P68082; PDB:2FRFA; GLSDGEWQQVLNVWGKVEADIAGHGQEVLIRLFTGHPETLEKFDKFKHLKTEAEMKASED -----------------------------------333311111111------------- LKKHGTVVLTALGGILKKKGHHEAELKPLAQSHATKHKIPIKYLEFISDAIIHVLHSKHP ---------------1111--3333--------------3333---------------22 GDFGADAQGAMTKALELFRNDIAAKYKELGFQ 22------------------------------ >Trem-like transcript 1 pr; SWP:Q86YW5; PDB:2FRGP; GSLPEVLQAPVGSSILVQCHYRLQDVKAQKVWCRFLPEGCQPLVSSAVDRRAPAGRRTFL ---------2222--------3333----------3333-------2222---------- TDLGGGLLQVEMVTLQEEDAGEYGCMVDGARGPQILHRVSLNILPP ---iiii--------1111---------1111-------------- >STAPHYLOCOCCAL ACCESSORY ; SWP:Q7A1N5; PDB:2FRHA; GSHMAITKINDCFELLSMVTYADKLKSLIKKEFSISFEEFAVLTYISENKEKEYYLKDII ------------------------------------------------------------ NHLNYKQPQVVKAVKILSQEDYFDKKRNEHDERTVLILVNAQQRKKIESLLSRVNKRITE -----3333--------------------------------------------------- ANNEIEL ------- >HYPOTHETICAL PROTEIN PH07; SWP:O58523; PDB:2FRNA; PEELVKLLPKRWVRIGDVLLLPPELEPYKHRIAEVYAEVLGVKTVLRKGYELLYGSDTVT --------------!!!!------------------------------------------ VHVENGIKYKLDVAKIFSPANVKERVRAKVAKPDELVVDFAGIGHLSLPIAVYGKAKVIA ---iiii----1111--3333----------1111-----!!!!---------------- IEKDPYTFKFLVENIHLNKVEDRSAYNDNRDFPGENIADRILGYVVRTHEFIPKALSIAK ---3333--------11113333----1111----------------------------2 DGAIIHYHNTVPEKLPREPFETFKRITKEYGYDVEKLNELKIKRYAPGVWHVVLDLRVFK 222--------3333------------1111----------------------------- S - >CYTOPLASMIC PROTEIN NCK2; SWP:O43639; PDB:2FRWA; IPAFVKFAYVAEREDELSLVKGSRVTVMEKCSDGWWRGSYNGQIGWFPSNYVLEEVD ------------2222-------------------------------3333------ >HYPOTHETICAL PROTEIN YEBU; SWP:P76273; PDB:2FRXA; YFPDAFLTQREAPFDDFLAACQRPLRRSIRVNTLKISVADFLQLTAPYGWTLTPIPWCEE --3333-------3333--1111--------3333---------3333------1111-- GFWPLGSTAEHLSGLFYIQEASSLPVAALFADGNAPQRVDVAAAPGSKTTQISARNNEGA ---3333-3333-------3333-------%%%%-------------------------- ILANEFSASRVKVLHANISRCGISNVALTHFDGRVFGAAVPEFDAILLDAPCSGEGVVRK ------3333--------1111----------3333-----------------1111333 DPDALKNWSPESNQEIAATQRELIDSAFHALRPGGTLVYSTCTLNQEENEAVCLWLKETY 3--------------------------11112222----------1111----------1 PDAVEFLPLGDLFPGANKALTEEGFLHVFPQIYDCEGFFVARLRKTQAIPALPAPKYKVG 111-----1111--3333--1111----1111---------------------------- NFPFSPVKDREAGQIRQAATGVGLNWDENLRLWQRDKELWLFPVGIEALIGKVRFSRLGI --------------------------3333------------33331111---------- KLAETHNKGYRWQHEAVIALASPDNNAFELTPQEAEEWYRGRDVYPQAAPVADDVLVTFQ -----!!!!-------------------------------------------------%% HQPIGLAKRIGSRLKNSYPRELVRDGKL %%----------------3333------ >PSD-1; SWP:NA; PDB:2FS1A; MEAVDANSLAQAKEAAIKELKQYGIGDYYIKLINNAKTVEGVESLKNEILKALPTE --------------------------------3333--3333-------------- >PHENYLACETIC ACID DEGRADA; SWP:P76084; PDB:2FS2A; SLSHKAWQNAHAYENDACAKALGIDIISDEGFAVVTTVTAQLNGHQSCHGGQLFSLADTA 3333---------------1111-----2222----------1111--3333-------- FAYACNSQGLAAVASACTIDFLRPGFAGDTLTATAQVRHQGKQTGVYDIEIVNQQQKTVA -------------------------2222-----------1111--------1111---- LFRGKSHR -------- >CELLULAR RETINOIC ACID-BI; SWP:P29373; PDB:2FS6A; NFSGNWKIIRSENFEELLKVLGVNVMLRKIAVAAASKPAVEIKQEGDTFYIKTSTTVRTT ------------------1111----------1111--------!!!!------------ EINFKVGEEFEEQTVDGRPCKSLVKWESENKMVCEQKLLKGEGPKTSWTRELTNDGELIL ----2222-----1111----------1111---------------------1111---- TMTADDVVCTRVYVRE ---!!!!--------- >BROMODOMAIN PHD FINGER TR; SWP:Q12830; PDB:2FSAA; DTKLYCICKTPYDESKFYIGCDRCQNWYHGRCVGILQSEAELIDEYVCPQCQSTEDATVL -----1111---1111------------3333---33331111----------------- TPLTEKDYEGLKRVLRSLQAHKAWPFLEPVDPNDAPDYYGVIKEPDLATEERVQRRYYEK ----------------------1111----3333--1111-----3333---1111---3 LTEFVADTKIFDNCRYYNPSDSPFYQCAEVLESFFVQKLKGFK 333---------------1111--------------1111--- >PREPROTEIN TRANSLOCASE SE; SWP:SECA_ECOLI; PDB:2FSHA; RNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLENLIPEAFAVVR -3333---------------3333----------------------3333---------- EASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNALTGKGVHVVTVN -----------3333-----1111------22223333---------1111--------- DYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNEYGFDYLRDNMA ---------------1111------2222--------------------------1111- FSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSQNENQTLASITFQNYFRLYE -3333--------------------1111-------------------------3333-- KLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIRKDLPDLVYMTEAEKIQAIIEDIKER ------------------------------------------------------------ TAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAAVTIATN ------------------------1111------3333-------1111-2222------ MAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKADWQVRHDAVLEAGGLHIIGTERHESR --------2222------------------------------------------------ RIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASDRVSGMMRKLGMKPGEAIEHPWVTK ------3333-iiii--------33331111---------3333---------------- AIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIYSQRNELLDVSDVSETINSIREDVFK ------------------------------------------------------------ ATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPIAEWLDKEPELHEETLRERILAQSIE -------22223333--------------------------------------------- VYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAMDYLRQGIHLRGYAQKDPKQEYKR ----------------------------------------1111---------------- ESFSMFAAMLESLKYEVISTLSKVQVRMPEE ------------------------------- >HYPOTHETICAL PROTEIN TA05; SWP:NA; PDB:2FSJA; MVVVGLDVGYGDTKVIGVDGKRIIFPSRWAVTETESWKIPVLSTDGGQTKFIYGKYASGN --------3333-----iiii-----------------------iiii-----1111--- NIRVPQGDGRLASKEAFPLIAAALWESGIHNPVDLVIGSGTPLGTFDLEVKAAKEALENK ---------11113333-------3333-------------3333--------------- VLTVTGPEGEVRQFNITRLIMRPQGVGAALYLLNQGIIEQQPGYGVVIDVGSRTTDVLTI -----2222-----------------------------------------1111------ NLMDMEPVVELSFSLQIGVGDAISALSRKIAKETGFVVPFDLAQEALSHPVMFRQKQVGG -------3333---------------------------------3333----%%%%---- PEVSGPILEDLANRIIENIRLNLRGEVDRVTSLIPVGGGSNLIGDRFEEIAPGTLVKIKP 3333----------------------1111------3333-----3333-2222----33 EDLQFANALGYRDAAERS 33---------------- >ATU0111 PROTEIN; SWP:Q8UJ27; PDB:2FSQA; VSKDLDYISTANHDQPPRHLGSRFSAEGEFLPEPGNTVVCHLVEGSQTESAIVSTRQRFL -3333---3333----1111----1111--------------2222-------------- DPEASQLAFTPVSSLHTVFQGVIESRRALPYWPQTLPLDTPIDAVTDYYRDRLSTFPTLP -3333-----3333--------1111------333311113333--------1111---- AFNRVTGLRPVGVKGATAEDDSIVALWRDTFADFFGYRHPDHDTYEFHITLSYIVSWFEP --------1111----3333------------------1111----------------33 ECLPRWQALDEELEKLRVAAPVIQRPPAFCEFKDNHFKELVVFD 33------------------------------------------ >ACETYLTRANFERASE; SWP:Q8UCP8_AGRT5; PDB:2FSRA; SIPTLRTERLTLRPLAADFPAYRDFASPRSTGVGGPYDLPSTWGVFCHDLANWHFFGHGA ------1111----------------33331111-------------------------- LIDLGETGECIGQIGINHGPLFPEKELGWLLYEGHEGRGYAAEAAVALRDWAFETLNLPT ---!!!!-----------1111---------2222------------------------- LVSYVSPQNRKSAAVAERIGGTLDPLAPRSDPEDLVYRYHQ -----1111--------------1111---1111------- >FORMATE DEHYDROGENASE; SWP:O93968; PDB:2FSSA; AKIVLVLYDAGKHAADEEKLYGCTENKLGIANWLKDQGHELITTSDEEGGNSVLDQHIPD ----------3333--1111--3333--------1111------------------3333 ADIIITTPFHPAYITKERIDKAKKLKLVVVAGVGSDHIDLDYINQTGKKISVLEVTGSNV ------1111-----------1111---------1111----------------2222-- VSVAEHVVMTMLVLVRNFVPAHEQIINHDWEVAAIAKDAYDIEGKTIATIGAGRIGYRVL ------------------------1111--33333333---2222-------3333---- ERLVPFNPKELLYYDYQALPKDAEEKVGARRVENIEELVAQADIVTVNAPLHAGTKGLIN --1111---------------------------3333-1111---------1111----3 KELLSKFKKGAWLVNTARGAICVAEDVAAALESGQLRGYGGDVWFPQPAPKDHPWRDMRN 33311112222------3333----------------------------11113333--1 KYGAGNAMTPHYSGTTLDAQTRYAQGTKNILESFFTGKFDYRPQDIILLNGEY 111--------1111------------------3333----3333---iiii- >Mitogen-activated protein; SWP:Q16539; PDB:2FSTX; RPTFYRQELNKTIWEVPERYQNLSPVGGSVCAAFDTKTGLRVAVKKLSRPFQSIIHAKRT ----------------3333---------------------------------------- YRELRLLKHMKHENVIGLLDVFTPARSLEEFNDVYLVTHLMGADLNQKLTDDHVQFLIYQ -----------1111-----------3333------------------------------ ILRGLKYIHSADIIHRDLKPSNLAVNEDCELKILDATRWYRAPEIMLNWMHYNQTVDIWS --------1111------3333---1111--------11113333--------------- VGCIMAELLTGRTLFPGTDHIDQLKLILRLVGTPGAELLKKISSESARNYIQSLTQMPKM ------------------3333------------33331111--------1111------ NFANVFIGANPLAVDLLEKMLVLDSDKRITAAQALAHAYFAQYHDPDDEPVADPYDQSLE 3333-2222--------------3333--333311111111---3333-------3333- SRDLLIDEWKSLTYDEVISFVPPP ------------------------ >PG_0823 PROTEIN; SWP:Q7M7B7; PDB:2FSWA; RKISDEECPVRKSQIFAGKWTLLIIFQINRRIIRYGELKRAIPGISEKLIDELKFLCGKG ----11113333---------------!!!!----------2222--------------- LIKKKQYPEVPPRVEYSLTPLGEKVLPIIDEIAKFGENL -------------------3333---------------- >COG0607: RHODANESE-RELATE; SWP:P95198; PDB:2FSXA; SYAGDITPLQAWEMLSDNPRAVLVDVRCEAEWRFVGVPDLSSLGREVVYVEWATSDGTHN -----------------1111------------------3333----------1111--- DNFLAELRDRIPRPVIFLCRSGNRSIGAAEVATEAGITPAYNVLDGFEGHLDAEGHRGAT ----------------------3333------1111------2222-----1111----- GWRAVGLPWRQG 3333-------- >TDP-FUCOSAMINE ACETYLTRAN; SWP:NA; PDB:2FT0A; VRASIEPLTWENAFFGVNSAIVRITSEAPLLTPDALAPWSRVQAKIAASNTGELDALQQL -------------------------------3333-----------3333-------111 GFSLVEGEVDLALPVNNVSDSGAVVAQETDIPALRQLASAAFAQSRFRAPWYAPDASGRF 1-------------------------3333----------------------1111---- YAQWIENAVRGTFDHQCLILRAASGDIRGYVSLRELNATDARIGLLAGRGAGAELMQTAL -----------1111------3333-----------1111-------2222--------- NWAYARGKTTLRVATQMGNTAALKRYIQSGANVESTAYWLYR ---1111--------1111-------1111------------ >BIGLYCAN; SWP:P21809; PDB:2FT3A; AMCPFGCHCHLRVVQCSDLGLKAVPKEISPDTTLLDLQNNDISELRKDDFKGLQHLYALV ----------------------------3333---------------1111-1111---- LVNNKISKIHEKAFSPLRKLQKLYISKNHLVEIPPNLPSSLVELRIHDNRIRKVPKGVFS -------------1111--------------------3333---------------1111 GLRNMNCIEMGGNPLENSGFEPGAFDGLKLNYLRISEAKLTGIPKDLPETLNELHLDHNK ---------------1111----------------------------3333--------- IQAIELEDLLRYSKLYRLGLGHNQIRMIENGSLSFLPTLRELHLDNNKLSRVPAGLPDLK ----1111----------------------1111--------------------3333-- LLQVVYLHTNNITKVGVNDFCPVGFGVKRAYYNGISLFNNPVPYWEVQPATFRCVTDRLA ------------------------------------------1111-33331111--111 IQF 1-- >AZURIN; SWP:P00282; PDB:2FT6A; CSVDIQGNDQMQFNTNAITVDKSCKQFTVNLSHPGNLPKNVMGHNWVLSTAADMQGVVTD -------1111---------3333-------------3333--------3333------- GMASGLDKDYLKPDDSRVIAHTKLIGSGEKDSVTFDVSKLKEGEQYMFFCTPHPMKGTLT ----3333------1111-------2222-----------2222-----1111------- LK -- >FATTY ACID-BINDING PROTEI; SWP:P81400; PDB:2FTBA; PFNGTWQVYSQENYEAFLRAVGLPEDIINVAKDINPIIEIQQNGDNFVVTSKTPNQSVTN ------------------3333--------1111------------------1111---- SFTIGKEAEITSMGGKKIKCTVVLEGGKLVSKTDQFSHIQEVKGNEMVETLTVGGATLIR --2222-----2222---------iiii--------------!!!!------iiii---- RSKRV ----- >HYDROXYMETHYLGLUTARYL-COA; SWP:Q9I2A0; PDB:2FTPA; MNLPKKVRLVEVGPRDGLQNEKQPIEVADKIRLVDDLSAAGLDYIEVGSFVSPKWVPQMA ----------------3333-----------------1111----------33333333- GSAEVFAGIRQRPGVTYAALAPNLKGFEAALESGVKEVAVFAAASEAFSQRNINCSIKDS -----------1111---------------1111-------------------------- LERFVPVLEAARQHQVRVRGYISCVLGCPYDGDVDPRQVAWVARELQQMGCYEVSLGDTI -----------1111--------1111-------------------1111---------- GVGTAGATRRLIEAVASEVPRERLAGHFHDTYGQALANIYASLLEGIAVFDSSVAGLGGC --------------3333-1111------1111---------1111-------%%%%--- PYAKGATGNVASEDVLYLLNGLEIHTGVDMHALVDAGQRICAVLGKSNGSRAAKALLAKA --iiii-------------1111------------------------------------- >BH0200; SWP:Q9KGB0; PDB:2FTRA; ENVKLIALYEQPEDKQAFDEHYFNTHAPLTRKIPGLRDKVTRIVGSPGESKFYLCEYYDD --------------------------------2222------------------------ HESLQQARTDEGKASGKDAKFAGKLLTLIGEED ---------------------!!!!-------- >GEPHYRIN; SWP:Q03555; PDB:2FTSA; MSPFPLTSMDKAFITVLEMTPVLGTEIINYRDGMGRVLAQDVYAKDNLPPFPASVKDGYA ----------------------------33332222------------------------ VRAADGPGDRFIIGESQAGEQPTQTVMPGQVMRVTTGAPIPCGADAVVQVEDTELIRESD -3333-----------2222------2222----3333--1111----3333-------- DGTEELEVRILVQARPGQDIRPIGHDIKRGECVLAKGTHMGPSEIGLLATVGVTEVEVNK --------------2222---2222--2222---2222---------------------- FPVVAVMSTGNELLNPEDDLLPGKIRDSNRSTLLATIQEHGYPTINLGIVGDNPDDLLNA ---------1111-3333--2222-------------1111------------------- LNEGISRADVIITSGGVSMGEKDYLKQVLDIDLHAQIHFGRVFMKPGLPTTFATLDIDGV --------------------------------------------------------iiii RKIIFALPGNPVSAVVTCNLFVVPALRKMQGILDPRPTIIKARLSCDVKLDPRPEYHRCI ---------------------------1111----------------------------- LTWHHQEPLPWAQSTGNQMSSRLMSMRSANGLLMLPPKTEQYVELHKGEVVDVMVIGRL ---2222--------------------------------------2222---------- >DIHYDROPYRIMIDINE AMIDOHY; SWP:Q86LT2; PDB:2FTWA; TGTILIKNGTVVNDDRYFKSDVLVENGIIKEISKNIEPKEGIKVVDATDKLLLPGGIDTH ------------1111--------iiii------------------2222---------- THFQLPFMGTVSVDDFDIGTQAAVAGGTTFIIDFVIPTRGQSLLEAYDQWKKWADEKVNC ------iiii---------------------------2222------------3333--- DYSLHVAITWWSEQVSREMEILVKERGVNSFCFMAYKNSFMVTDQEMYHIFKRCKELGAI -----------------------------------2222--------------------- AQVHAENGDMVFEGQKKMLEMGITGPEGHELSRPEALEAEATNRAIVIADSVCTPVYIVH ------------------1111--3333-------------------------------- VQSIGAADVICKHRKEGVRVYGEPIAAGLGVDGSHMWNHDWRHAAAFVMGPPIRPDPRTK ------------------------3333----3333-------------------1111- GVLMDYLARGDLDCVGTDNCTFCADQKAMGKDDFTKIPNGVNGVEDRMSIVWENGVNTGK ----------------------3333-1111-3333------3333---------1111- LTWCQFVRATSSERARIFNIYPRKGRIDVGCDGDIVIWDPNQSKTISKDTHHHAVDFNIF ---------------------------2222---------------3333-------111 EGIKVTGIAVTTIVAGNIVWSDNKLSCVKGSGRFVPRPPFGPVFDGIEQRDKVRNELLRK 1------------iiii---%%%%---2222---------3333------------1111 VDR --- >Hypothetical 24.6 kDa pro; SWP:P40014; PDB:2FTXA; NDAAEVALYERLLQLRVLPGASDVHDVRFVFGDDSRCWIEVAHGDHVIGNSHPALDPKSR ------------------------------------------------------------ ATLEHVLTVQGDLAAFLVVARDLLASL ----------------------3333- >Kinetochore protein SPC24; SWP:Q04477; PDB:2FTXB; ANENILKLKLYRSLGVILDLENDQVLINRKNDGNIDILPLDNNLSDFYKTKYIWERLGK -3333-------------3333------------------3333-----------1111 >GERANYLTRANSTRANSFERASE; SWP:NA; PDB:2FTZA; EVEERIREILRPGWDLLTEEAMLYSATVGGRIRPLLVLTLGEDLGVEEELLDVAVAVELF ------------------------------------------------------------ HTASLIHDDLPPIDNADFRRGPSCHRTYGEDIALLAGDGLFFLAFSQISIGNSIFEEFSE ---------3333----------3333--------------------------------- TAYLLLGEAMDVEFERRMEVSQEMVERMYAFTGALFAFCFSAPFILGDHTMLLGEFGVAF ------------------------------------------3333-------------- QIYDDLDILGSFEVTLVVGIQAREMADYYEEVLGIESEGLFRTLFLLELQMVEER -----------------------------------1111----------3333-- >CYCLOPHILIN, PUTATIVE; SWP:Q8I402; PDB:2FU0A; PKSAIIYTTMGDIHISLFYKECKKTVQNFSVHSINGYYNNCIFHRVIKHFMVQTGDPSGD -------1111------------------------1111-------2222-----3333- GTGGESIWGNEFEDEFFDHLNHSKPFMVSMANCGPNTNGSQFFITTVPCPWLDFKHTVFG -----1111-------1111----------------------------3333-------- KVTQGSKIVLDIEKVRTDKRDKPLEDIKILNIKIN ----3333---1111--1111-------------- >HYPOTHETICAL PROTEIN SPY2; SWP:Q99XL4; PDB:2FU2A; PSEKEILDALSKVYSEQVIQADDYFRQAIFELASQLEKEGSSLLATKIDSLINQYILTHQ ----------------3333--------------------------------------%% FDAPKSIFDLSRLVKTK %%-3333---------- >FERRIC UPTAKE REGULATION ; SWP:P0A9A9; PDB:2FU4A; DNNTALKKAGLKVTLPRLKILEVLQEPDNHHVSAEDLYKRLIDMGEEIGLATVYRVLNQF ------1111---------------3333------------------------------- DDAGIVTRHNFEGGKSVFELT ----------2222------- >Ras-related protein Rab-8; SWP:P55258; PDB:2FU5C; KTYDYLFKLLLIGDSGVTFISTIGIDFKIRTIELDGKRIKLQIWDTTTAYYRGAMGIMLV -----------------3333------------iiii----------------------- YDITNEKSFDNIRNWIRNIEEHASADVEKMILGNKVNDKRQVSKERGEKLALDYGIKFME -1111----------3333-----------------------3333-----1111----- TSAINVENAFFTLARDIKAKMDKNWK -------------------------- >PHOSPHOMANNOMUTASE 1; SWP:NA; PDB:2FUEA; RVLCLFDVDGTLTPARQKIDPEVAAFLQKLRSRVQIGVVGGSDYCKIAEQLGDGDEVIEK -------2222--2222------------1111-------------------!!!!---- FDYVFAENGTVQYKHGRLLSKQTIQNHLGEELLQDLINFCLSYALLRLPKKRGTFIEFRN -----%%%%----iiii-----3333---------------------------------- GLNISPIGRSCTLEERIEFSELDKKEKIREKFVEALKTEFAGKGLRFSRGGISFDVFPEG ----3333-------------------------------2222--------------222 WDKRYCLDSLDQDSFDTIHFFGNETSPGGNDFEIFADPRTVGHSVVSPQDTVQRCREIFF 2------------------------2222-------1111-------------------3 PET 333 >LARGE T ANTIGEN; SWP:P03070; PDB:2FUFA; KVEDPKDFPSELLSFLSHAVFSNRTLACFAIYTTKEKAALLYKKIEKYSVTFISRHNSYN --------33331111--1111-----------------------1111--------iii HNILFFLTPHRHRVSAINNYAQKLCTFSFLICKGVNKEYLYSALTRDPFSVIEESLPGGL i-----------3333--------------------33333333-----------2222- KEHD ---- >CONSERVED HYPOTHETICAL PR; SWP:Q8PBH4; PDB:2FUJA; KILARVPISVRWRDMDSMGHVNNAKYISYLEEARVRWMLGVEGVAMTDRIAPVVAATNVN ----------3333-1111----------------------------------------- YKRPLVWPNDILVELFVERLGSSSVTIGHRILDQKDEGVLYSDGNVVVVWIDTQTGKS --------------------1111--------3333---------------------- >XC6422 PROTEIN; SWP:Q4V016; PDB:2FUKA; NPLFPTESAALTLDGPVGPLDVAVDLPEPDVAVQPVTAIVCHPLSTEGGSMHNKVVTMAA --------------1111-------------------------3333--1111------- RALRELGITVVRFNFRSVGTSAGSFDHGDGEQDDLRAVAEWVRAQRPTDTLWLAGFSFGA --3333--------2222-------iiii----------------1111----------- YVSLRAAAALEPQVLISIAPPAGRWDFSDVQPPAQWLVIQGDADEIVDPQAVYDWLETLE --------------------2222-----------------------3333---1111-- QQPTLVRMPDTSHFFHRKLIDLRGALQHGVRRWLPATP -----------1111--------------1111----- >EUKARYOTIC TRANSLATION IN; SWP:P38431; PDB:2FULA; SPEFVNSELTQLDEYGEWILEQAGEDKENLPSDVELYKKAAELDVLNDPKIGCVLAQCLF -----------------------1111-----------------1111------------ DEDIVNEIAEHNAFFTKILVTPEYEKNFMGGIERFLGLEHKDLIPLLPKILVQLYNNDII 1111--1111-----------------------------33331111------------- SEEEIMRFGTKSSKKFVPKEVSKKVRRAAKPFITWLETAEL ----------------------------------------- >HYPOTHETICAL PROTEIN PA33; SWP:Q9HYP4; PDB:2FUPA; PDSPTLLDLFAEDIGHANQLLQLVDEEFQALERRELPVLQQLLGAKQPLQQLERNGRARA -----------------------------------3333------3333----------- EILREAGVSLDREGLARYARERADGAELLARGDELGELLERCQQANLRNGRIANQASTGS ---1111-----------1111------------------------------1111---- LLNILR ------ >HEPARINASE II PROTEIN; SWP:Q46080; PDB:2FUQA; TKADVVWKDVDGVSMPIPPKTHPRLYLREQQVPDLKNRMNDPKLKKVWADMIKMQEDWKP ---------iiii--------------33331111------1111-------3333--33 ADIPEVKDFRFYFNQKGLTVRVELMALNYLMTKDPKVGREAITSIIDTLETATFKPAGDI 33---------------3333------------3333---------------------33 SRGIGLFMVTGAIVYDWCYDQLKPEEKTRFVKAFVRLAKMLECGYPPVKDKSIVGHASEW 33---------------1111-3333--------------3333----------1111-- MIMRDLLSVGIAIYDEFPEMYNLAAGRFFKEHLVARNWFYPSHNYHQGMSYLNVRFTNDL --------------------------------------3333------------------ FALWILDRMGAGNVFNPGQQFILYDAIYKRRPDGQILAGGDVDYSRKKPKYYTMPALLAG ----------------3333------11111111-------------------------- SYYKDEYLNYEFLKDPNVEPHCKLFEFLWRDTQLGSRKPDDLPLSRYSGSPFGWMIARTG 1111-------3333---3333--------1111----1111------------------ WGPESVIAEMKVNEYSFLNHQHQDAGAFQIYYKGPLAIDAGSYTGSSGGYNSPHNKNFFK -1111------------!!!!--2222-----------------33331111-----111 RTIAHNSLLIYDPKETFSSSGYGGSDHTDFAANDGGQRLPGKGWIAPRDLKEMLAGDFRT 11111------1111---1111-------------------------------------- GKILAQGFGPDNQTPDYTYLKGDITAAYSAKVKEVKRSFLFLNLKDAKVPAAMIVFDKVV -----------------------3333--------------------------------- ASNPDFKKFWLLHSIEQPEIKGNQITIKRTKNGDSGMLVNTALLPDAANSNITSIGGKGK --1111--------------!!!!------iiii-----------1111-------2222 DFWVFGTNYTNDPKPGTDEALERGEWRVEITPKKAAAEDYYLNVIQIADNTQQKLHEVKR ---iiii------22221111---------------------------1111-------- IDGDKVVGVQLADRIVTFSKTSETVDRPFGFSVVGKGTFKFVMTDLLPGTWQVLKDGKIL --2222----!!!!----1111--------------------------------iiii-- YPALSAKGDDGALYFEGTEGTYRFLR ------3333---------------- >HYPOTHETICAL PROTEIN; SWP:Q9HIG7; PDB:2FURA; RASYSDEDLVALDRNFTCTVSFIDGGIPYAIPLASEGKTIYLHGSKSRIYGILKTGQLIA -----------1111--------%%%%--------!!!!--------------------- ISLLEINGIVLAKEIKNNSINYVSALIFGRPYEIDDTEKKIEVFRLLTEKLVKGRWDNSI -------------3333----------------------------------2222----- KPSYEDLNGVFVFAVKPETFSKARTGPPHDTSTDDIWSGVLPIQHTISEAGENAPEYVKS --33331111----------------------------------------1111333311 LYGKRIFI 11------ >PHOSPHOGLUCOMUTASE; SWP:Q8ZQW9; PDB:2FUVA; AIHNRAGQPAQQSDLINVAQLTAQYYVLKPEAGNAEHAVKFGTSGHRGSAGRHSFNEPHI --1111----3333--------3333----22221111----------1111---3333- LAIAQAIAEERAKNGITGPCYVGKDTHALSEPAFISVLEVLAANGVDVIVQENNGFTPTP -----------1111-----------3333-----------1111------%%%%--333 AVSNAILVHNKKGGPLADGIVITPSHNPPEDGGIKYNPPNGGPADTNVTKVVEDRANALL 3--------1111----------!!!!1111------1111---3333------------ AGGLQGVKRISLDAAASGHVKAVDLVQPFVEGLADIVDAAIQKAGLTLGVDPLGGSGIEY ----------3333------------------1111-----3333------%%%%----- WKRIAEHYKLNLTLVNDQVDQTFRFHLDKDGAIRDCSSECAAGLLALRDKFDLAFANDPD -----1111----------1111----1111---1111----33333333-------111 YDRHGIVTPAGLNPNHYLAVAINYLFQHRPLWGKDVAVGKTLVSSAIDRVVNDLGRKLVE 1------1111------------1111-11111111----1111-------1111----- VPVGFKWFVDGLFDGSFGFGGEESAGASFLRFDGTPWSTDKDGIICLLAAEITAVTGKNP ---3333--------------1111-----1111-------------------------- QEHYNELAARFGAPSYNRLQASATSAQKAALSKLSPEVSASTLAGDPITARLTAAPGNGA -----------------------------3333---------iiii-------------- SIGGLKVTDNGWFAARPSGTEDAYKIYCESFLGEEHRKQIEKEAVEIVSEVLKNA -------1111-------------------------------------------- >RCD1 REQUIRED FOR CELL DI; SWP:Q4R347; PDB:2FV2A; REKIYQWINELSSPETRENALLELSKKRESVPDLAPMLWHSFGTIAALLQEIVNIYPSIN ------------3333----------33331111------2222----------3333-- PPTLTAHQSNRVCNALALLQCVASHPETRSAFLAAHIPLFLYPFLHTVSKTRPFEYLRLT --------------------33331111-------------3333--------------- SLGVIGALVKTDEQEVINFLLTTEIIPLCLRIMESGSELSKTVATFILQKILLDDTGLAY ------1111--3333--------------------3333-------------------- ICQTYERFSHVAMILGKMVLQLSKEPSARLLKHVVRCYLRLSDNPRAREALRQCLPDQLK --------------------1111-------------------------------3333- DTTFAQVLKDDTTTKRWLAQLVKNLQE -11111111------------------ >Deformed epidermal autore; SWP:NA; PDB:2FV6A; SCVNCGREAMSECTGCHKVNYCSTFCQRKDWKDHQHICGQSA ----------------------3333---3333--------- >RIBOKINASE; SWP:Q9H477; PDB:2FV7A; VAAVVVVGSCMTDLVSLTSRLPKTGETIHGHKFFIGFGGKGANQCVQAARLGAMTSMVCK ----------------------2222----------------------1111-------- VGKDSFGNDYIENLKQNDISTEFTYQTKDAATGTASIIVNNEGQNIIVIVAGANLLLNTE --------------1111---------------------3333-------!!!!------ DLRAAANVISRAKVMVCQLEITPATSLEALTMARRSGVKTLFNPAPAIADLDPQFYTLSD --------1111---------3333--------1111--------------33331111- VFCCNESEAEILTGLTVGSAADAGEAALVLLKRGCQVVIITLGAEGCVVLSQTEPEPKHI ------------------3333--------1111-------!!!!-----1111------ PTEKVKAVDTTGAGDSFVGALAFYLAYYPNLSLEDMLNRSNFIAAVSVQAAGTQSSYPYK ---------2222--------------1111---------------1111--3333--33 KDLPLTLF 33-3333- >RHO-RELATED GTP-BINDING P; SWP:P62745; PDB:2FV8A; IRKKLVVVGDGACGKTCLLIVFSKDEFPFENYVADIEVDGKQVELALWDTAGQEDYDRLR ---------2222------------------------iiii--------2222------- PLSYPDTDVILMCFSVDSPDSLENIPEKWVPEVKHFCPNVPIILVANKKDLRSDEHVRTE ---2222-------1111------------------2222---------3333------- LARMKQEPVRTDDGRAMAVRIQAYDYLECSAKTKEGVREVFETATRAALQKRYGS -1111-------------------------1111--------------------- >ENDOGLUCANASE; SWP:Q9X0D9; PDB:2FVGA; GYLKELSPGVSGDEGKVRDFIKSKIEGLVDNLYTDVLGNLIALKRGRDSSKKLLVSAHDE -3333----2222-----------1111------1111---------------------- VGFVVSKIEKDGKVSFLPVGGVDPRILPGKVVQVKNLKGVIGYRPPRFENLRIDFGFSSA --------1111----------33332222---%%%%---------3333--------33 DEAKKYVSIGDYVSFVSDYIEKNGRAVGKAFDDRAGCSVLIDVLESGVSPAYDTYFVFTV 33-----2222----------iiii----------------------------------- QEESAVVVEQLKPTCAIVVETTTAGDNPELEERKWATHLGDGPAITFYHRGYVIPKEIFQ ---3333-------------------3333-------2222-------------3333-- TIVDTAKNNDIPFQKRRTYGVPAGVISTPARYIHSPNSIIDLNDYENTKKLIKVLVEEGK ------1111------------------------------------------------33 IVEVVS 331111 >UREASE GAMMA SUBUNIT; SWP:P0A676; PDB:2FVHA; RLTPHEQERLLLSYAAELARRRRARGLRLNHPEAIAVIADHILEGARDGRTVAELMASGR ----------------------1111------------------------------3333 EVLGRDDVMEGVPEMLAEVQVEATFPDGTKLVTVHQPIA ---3333-22223333--------1111----------- >GERANYLGERANYL PYROPHOSPH; SWP:O95749; PDB:2FVIA; ETVQRILLEPYKYLLQLPGKQVRTKLSQAFNHWLKVPEDKLQIIIEVTEMLHNASLLIDD -------------3333------------------------------------------- IEDNSKLRRGFPVAHSIYGIPSVINSANYVYFLGLEKVLTLDHPDAVKLFTRQLLELHQG -------iiii-3333----------------------33331111-------------- QGLDIYWRDNYTCPTEEEYKAMVLQKTGGLFGLAVGLMQLFSDYKEDLKPLLNTLGLFFQ --------------------------------------1111------------------ IRDDYANLHSKSFCEDLTEGKFSFPTIHAIWSRPESTQVQNILRQRTENIDIKKYCVHYL --------------3333--------------3333------------------------ EDVGSFEYTRNTLKELEAKAYKQIDARGGNPELVALVKHLSKMFK ------------------------1111------------1111- >DIHYDROPYRIMIDINASE; SWP:Q9P903; PDB:2FVKA; PIYDLIIKNGIICTASDIYAAEIAVNNGKVQLIAASIDPSLGSEVIDAEGAFITPGGIDA -------------------------iiii--------3333------iiii--------- HVHVDEPLKLLGDVVDTMEHATRSAVAGGTTTVVAFSTQDVSKKGPSALAESVKLDVDEY -----1111------------------------------3333-1111-----------1 SEQTLYCDYGLHLILFQIEKPSVEARELLDVQLQAAYNDYGVSSVMFMTYPGLQISDYDI 111------------------3333------------------------2222------- MSAMYATRKNGFTTMLHAENGDMVKWMIEALEEQGLTDAYYHGVSRPSIVEGEATNRAIT -------------------------------1111--3333------------------- LATTMDTPILFVHVSSPQAAEVIKQAQTKGLKVYAETCPQYALLSDAITRCHGVGIDLSS -------------------------------------3333----3333-------3333 ISESPFTNPDDRFIGSKYICSPPIRPEGTQKSIWKGMNNGTFTIVGSDHCSYNYYEKTST ---33333333--------------22223333-------------------------11 ASKHRAFDPENNKNGEFRYIPNGLPGVCTRMPLLYDYGYLRGNLTSMMKLVEIQCTNPAK 11-----3333----3333------3333---------1111------------------ VYGMYPQKGSILPGVSDADLVIWYPDDSKKEYNSKPKLITNKLMEHNCDYTPFEGIEIKN -----------2222----------------3333----3333-------1111------ WPRYTIVKGKIVYKEGEILKENADGKYLKRGKSFMCTPKNEWVTEWRPKYE ------iiii---iiii-3333----------1111-----------1111 >ALDO-KETO REDUCTASE FAMIL; SWP:P17516; PDB:2FVLA; MDPKYQRVELNDGHFMPVLGFGTYAPPEVPRNRAVEVTKLAIEAGFRHIDSAYLYNNEEQ -3333----1111------------33333333------------------3333-3333 VGLAIRSKIADGSVKREDIFYTSKLWCTFFQPQMVQPALESSLKKLQLDYVDLYLLHFPM --------------3333-------1111-1111-------------------------- ALKPGETPLPKDENGKVIFDTVDLSATWEVMEKCKDAGLAKSIGVSNFNYRQLEMILNKP -----------1111-------------------1111-----------------1111- GLKYKPVCNQVECHPYLNQSKLLDFCKSKDIVLVAHSALGTQRHKLWVDPNSPVLLEDPV -------------1111---------1111------1111---3333-1111-1111--- LCALAKKHKRTPALIALRYQLQRGVVVLAKSYNEQRIRENIQVFEFQLTSEDMKVLDGLN --------------------1111---------------1111------------1111- RNYRYVVMDFLMDHPDYPFSDEY -------1111--1111------ >CONSERVED HYPOTHETICAL PR; SWP:Q6N5Y9; PDB:2FVTA; MAQRSEIPHFPRTAAIDAYGKGGFYFAGMSHQGSLLFLPDAVWGWDVTKPEQIDRYSLQR -------------------2222--------------3333------------3333--- VFDNANAIDTLIVGTGADVWIAPRQLREALRGVNVVLDTMQTGPAIRTYNIMIGERRRVA ---3333-----------------------1111-------------------------- AALIAVPLEHHHHHH ------1111----- >Diphosphoinositol polypho; SWP:O95989; PDB:2FVVA; QTRTYDGDGYKKRAACLCFRSESEEEVLLVSSSRHPDRWIVPGGGMEPEEEPSVAAVREV -----1111-----------1111-------3333-----------2222---------- CEEAGVKGTLGRLVGIFENQERKHRTYVYVLIVTEVLEDWEDSVNIGRKREWFKIEDAIK -------------------1111------------------------------------- VLQYHKPVQASYFET ------33331111- >D-GALACTOSE-BINDING PERIP; SWP:A1AD10; PDB:2FVYA; DTRIGVTIYKYDDNFMSVVRKAIEQDAKAAPDVQLLMNDSQNDQSKQNDQIDVLLAKGVK ---------1111----------------1111------%%%%-----------1111-- ALAINLVDPAAAGTVIEKARGQNVPVVFFNKEPSRKALDSYDKAYYVGTDSKESGIIQGD -------3333--------1111-----------------1111-----3333------- LIAKHWAANQGWDLNKDGQIQFVLLKGEPGHPDAEARTTYVIKELNDKGIKTEQLQLDTA ------------1111-----------2222--------------1111----------i MWDTAQAKDKMDAWLSGPNANKIEVVIANNDAMAMGAVEALKAHNKSSIPVFGVDALPEA iii-------------1111---------------------11113333----------- LALVKSGALAGTVLNDANNQAKATFDLAKNLADGKGAADGTNWKIDNKVVRVPYVGVDKD ------------------------------1111-1111------iiii--------333 NLAEF 31111 >INOSITOL MONOPHOSPHATASE ; SWP:O14732; PDB:2FVZA; WEECFQAAVQLALRAGQIIRKALTEETETDHLVEDLIISELRERFPSHRFIAEEAKCVLT 3333---------------3333---------------------1111------------ HSPTWIIDPIDGTCNFVHRFPTVAVSIGFAVRQELEFGVIYHCTEERLYTGRRGRGAFCN ------------------------------%%%%-----------------2222----- GQRLRVSGETDLSKALVLTEIGPKRDPATLKLFLSNMERLLHAKAHGVRVIGSSTLALCH ----------3333-----------3333------------------------------- LASGAADAYYQFGLHCWDLAAATVIIREAGGIVIDTSGGPLDLMACRVVAASTREMAMLI --------------3333-3333---1111----3333---1111--------------- AQAL ---- >TESTIS-SPECIFIC CHROMODOM; SWP:Q9Y6F7; PDB:2FW2A; YRDIVVKKEDGFTQIVLSTRSTEKNALNTEVIKEMVNALNSAAADDSKLVLFSAAGSVFC --------2222----------%%%%---------------------------------- CGLDFGYFVRHLRNDRNTASLEMVDTIKNFVNTFIQFKKPIVVSVNGPAIGLGASILPLC ----3333-----------------------------------------------3333- DLVWANEKAWFQTPYTTFGQSPDGCSSITFPKMMGKASANEMLIAGRKLTAREACAKGLV -----1111----3333-----%%%%----------------------------1111-- SQVFLTGTFTQEVMIQIKELASYNAIVLEECKALVRCNIKLELEQANERECEVLRKIWSS -------------------1111-----------3333------------------1111 AQGIESMLKYVENKIDEF ------------------ >DHC, DIHEME CYTOCHROME C; SWP:Q3J4W3; PDB:2FW5A; ALVVTDPLTRTECSACHMAYPAALLPARSWTALMADLPNHFGEDASLDEASRGQIESYLV ----------1111------3333-3333------3333iiii----------------- ANAADSSGAGRALRGLVQTDTPLRISELPWFKRKHADEVSPRMLEKARSMSNCAACHTGA --1111---3333---1111---1111-------1111----------333333331111 ERGLF ----- >THIOL:DISULFIDE INTERCHAN; SWP:P36655; PDB:2FWHA; HLNFTQIKTVDELNQALVEAKGKPVMLDLYADWCVACKEFEKYTFSDPQVQKALADTVLL -------------------2222-------1111---------1111----1111----- QANVTANDAQDVALLKHLNVLGLPTILFFDGQGQEHPQARVTGFMDAETFSAHLRDR ---3333----------------------1111--3333------------------ >U6 SNRNA-ASSOCIATED SM-LI; SWP:Q5CXX3; PDB:2FWKA; GNIILPLALIDKCIGNRIYVVMKGDKEFSGVLRGFDEYVNMVLDDVQEYGFRVMVNRLET ------------2222-------------------------------------------- ILLSGNNVAMLVPGGDP ---1111---------- >2,3-dihydro-2,3-dihydroxy; SWP:P15047; PDB:2FWMX; MDFSGKNVWVTGAGKGIGYATALAFVEAGAKVTGFDQAFTQEQYPFATEVMDVADAAQVA --2222-----3333----------1111----------------------3333----- QVCQRLLAETERLDALVNAAGILRMGATDQLSKEDWQQTFAVNVGGAFNLFQQTMNQFRR ------1111----------------1111------------------------------ QRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALSVGLELAGSGVRCNVVSPGSTRPQ ----------3333---2222--------------------3333------------333 EIANTILFLASDLASHITLQDIVVDGGSTLGA 3---------3333----------iiiiiiii >DHC, DIHEME CYTOCHROME C; SWP:Q3J4W3; PDB:2FWTA; LVVTDPLTRTECSACHMAYPAALLPARSWTALMADLPNHFGEDASLDEASRGQIESYLVA ---------1111------3333-3333------3333iiii------------------ NAADSSGTPLRISELPWFKRKHADEVSPRMLEKARSMSNCAACHTGAERGLF -1111-----1111------3333-----------333333331111----- >SODIUM/CALCIUM EXCHANGER ; SWP:P23685; PDB:2FWUA; HAGIFTFEEPVTHVSESIGIMEVKVLRTSGARGNVIVPYKTIEGTARGGGEDFEDTCGEL -----------------------------------------------iiii--------- EFQNDEIVKTISVKVIDDEEYEKNKTFFLEIGEPRLVEMSEKKGGFTITEEYDDKQPLTS --3333------------------------------------------------------ KEEEERRIAEMGRPILGEHTKLEVIIEESYEFKSTVD --3333--3333------------------------- >HYPOTHETICAL PROTEIN MTUB; SWP:A1KGU6; PDB:2FWVA; AAVERAKATAARNIPAFDDLPVPADTANLREGADLNNALLALLPLVGVWRGEGEGRGPDG -----------------------------------33331111-------------1111 DYRFGQQIVVSHDGGDYLNWESRSWRLTATGDYQEPGLREAGFWRFVAIELLLAHSAGYV ---------------------------1111----------------------------- ELFYGRPRTQSSWELVTDALARSRSGVLVGGAKRLYGIVEGGDLAYVEERVDADGGLVPH ----------------------1111------------2222---------1111----- LSARLSRFVG ---------- >HEMOLYSIN II REGULATORY P; SWP:Q7X506; PDB:2FX0A; SREQTENILKAAKKKFGERGYEGTSIQEIAKEAKVNVAASYYFNGKENLYYEVFKKYGLA 3333----------------1111----------------1111--------------33 NELPNFLEKNQFNPINALREYLTVFTTHIKENPEIGTLAYEEIIKESARLEKIKPYFIGS 33-------%%%%---------------------3333--------1111---1111--- FEQLKEILQEGEKQGVFHFFSINHTIHWITSIVLFPKFKKFIDSDLVSRIISALTDK --------------------------------------------------------- >LIPASE; SWP:A0HXJ4; PDB:2FX5A; APLPDTPGAPFPAVANFDRSGPYTVSSQSEGPSCRIYRPRDLGQGGVRHPVILWGNGTGA ---------------1111-----------------------2222----------2222 GPSTYAGLLSHWASHGFVVAAAETSNAGTGREMLACLDYLVRENDTPYGTYSGKLNTGRV 3333-------3333-----------1111------------------1111---1111- GTSGHSQGGGGSIMAGQDTRVRTTAPIQPYTLGLGHDSASQRRQQGPMFLMSGGGDTIAF -----------------1111-----------iiii--3333----------1111---3 PYLNAQPVYRRANVPVFWGERRYVSHFEPVGSGGAYRGPSTAWFRFQLMDDQDARATFYG 333---------------------1111----!!!!--------------33333333-- AQCSLCTSLLWSVERRGL --3333------------ >Envelope glycoprotein gp1; SWP:P05880; PDB:2FX7H; QVQLVQSGAEVKRPGSSVTVSCKASGGSFSTYALSWVRQAPGRGLEWMGGVIPLLTITNY ------------2222-----------------------2222---------1111---- APRFQGRITITADRSTSTAYLELNSLRPEDTAVYYCAREGTTGWGWLGKPIGAFAHWGQG 3333----------------------3333------------------------------ TLVTVSSASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTF ------------------------------------------------iiii-------- PAVLQSSGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPK ------------------1111------------1111---------- >Putative uncharacterized ; SWP:Q6P5S8; PDB:2FX7L; EIVLTQSPGTQSLSPGERATLSCRASQSVGNNKLAWYQQRPGQAPRLLIYGASSRPSGVA -------------2222-----------2222-------2222------------2222- DRFSGSGSGTDFTLTISRLEPEDFAVYYCQQYGQSLSTFGQGTKVEVKRTVAAPSVFIFP -------------------1111------------------------------------- PSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTL -3333-------------------------%%%%-------------------------- TLSKADYEKHKVYACEVTHQGLSSPVTKSFNRGE --33331111--------1111--------2222 >PROTEASE PRODUCTION REGUL; SWP:P11065; PDB:2FXAA; PPYDVKEALVFTQKAQLSKALWKSIEKDWQQWLKPYDLNINEHHILWIAYQLNGASISEI --------------------------------3333-------------1111------- AKFGVHVSTAFNFSKKLEERGYLRFSKRTYVQLTEEGTEVFWSLLEEFDPTRNAVFKGSQ -1111------------1111----------------------3333-3333-------- PLYHLFGKFPEVAECIRHIYGDDFEIFETS ---------------3333-33333333-- >POL PROTEIN; SWP:NA; PDB:2FXDA; PQITLWKRPIVTVKIGGQLKEALLDTGADDTVIEEMNLPGKWKPKIIGGIGGFIKVRQYD --------------iiii------1111--------------------1111-------- QIIIEIAGHKAIGTVLVGPTPFNVIGRNLMTQIGATLNF -----iiii----------------3333-1111----- >POL PROTEIN; SWP:O38566; PDB:2FXEA; PQITLWKRPIVTVKIGGQLKEALLDTGADDTVIEEMNLPGKWKPKMIGGIGGFIKVRQYD --------------iiii------1111--------------------1111-------- QIIIEIAGHKAIGTVLVGPTPVNIIGRNLLTQIGATLNF -----iiii-------------------3333------- ------------------------------------------------------------ ------------------------------------------------------------ --- >SINGLE-STRAND BINDING PRO; SWP:Q9KH06; PDB:2FXQA; RGLNQVFLIGTLTARPDMRYTPGGLAILDLNLAGQDAFTDESGQEREVPWYHRVRLLGRQ --------------------1111---------------1111----------------- AEMWGDLLEKGQLIFVEGRLEYRSEVQVRAEFIDPLEGRRRALNQVILMGNLTRDPDLRY ---1111-2222------------------------------------------------ TPQGTAVVRLGLAVNERRTHFLEVQAWRELAEWASELRKGDGLLVIGRLVNDSWTSSSGE 1111-----------------------------11112222--------------1111- RRFQTRVEALRLERPTR ----------------- >ACTIN, ALPHA SKELETAL MUS; SWP:P68135; PDB:2FXUA; ETTALVCDNGSGLVKAGFAGDDAPRAVFPSIVGRPRHQGVDSYVGDEAQSKRGILTLKYP -----------------2222---------------3333-------------------- IEGIITNWDDMEKIWHHTFYNELRVAPEEHPTLLTEAPLNPKANREKMTQIMFETFNVPA -------------------------3333-------22223333---------------- MYVAIQAVLSLYASGRTTGIVLDSGDGVTHNVPIYEGYALPHAIMRLDLAGRDLTDYLMK -----------1111-------------------iiii-3333----------------- ILTERGYSFVTTAEREIVRDIKEKLCYVALDFENEMATAASSSSLEKSYELPDGQVITIG --3333------------------------------------1111----1111------ NERFRCPETLFQPSFIGMESAGIHETTYNSIMKCDIDIRKDLYANNVMSGGTTMYPGIAD -----3333----1111-------------11111111-----------1111-2222-- RMQKEITALAPSTMKIKIIAPPERKYSVWIGGSILASLSTFQQMWITKQEYDEAGPSIVH ----------1111------1111-------------11111111---------3333-- >S1A STEM-LOOP RNA; SWP:Q15376; PDB:2FY1A; MVEADHPGKLFIGGLNRETNEKMLKAVFGKHGPISEVLLIKDRTSKSRGFAFITFENPAD --------------%%%%---------3333----------%%%%--------------- AKNAAKDMNGKSLHGKAIKVEQAKKPSFQSGGRRRPPASSRNRSPSGS ------------------------------------------------ >CHOLINE O-ACETYLTRANSFERA; SWP:P28329; PDB:2FY2A; SEESGLPKLPVPPLQQTLATYLQCMRHLVSEEQFRKSQAIVQQFGAPGGLGETLQQKLLE ------------3333--------1111-----------------2222----------- RQEKTANWVSEYWLNDMYLNNRLALPVNSSPAVIFARQHFPGTDDQLRFAASLISGVLSY ------1111-------3333--------------------3333--------------- KALLDSHSIPTGQPLCMKQYYGLFSSYRLPGHTQDTLVAQMPEPEHVIVACCNQFFVLDV ---1111---------3333------------------------------%%%%------ VINFRRLSEGDLFTQLRKIVKMASNAAARLPPIGLLTSDGRSEWAEARTVLVKDSTNRDS -%%%%-------------------3333-----3333----------------------- LDMIERCICLVCLDAPGGVELSDTHRALQLLHGGGYSKNGANRWYDKSLQFVVGRDGTCG ---1111------------------------iiii---1111-3333------1111--- VVCEHSPFDGIVLVQCTEHLLKHMTQPELVRSPMVPLPAPRRLRWKCSPEIQGHLASSAE ----3333-------------3333----------------------3333--------- KLQRIVKNLDFIVYKFDNYGKTFIKKQKCSPDAFIQVALQLAFYRLHRRLVPTYESASIR -----1111---------------1111------------------------------33 RFQEGRVDNIRSATPEALAFVRAVTDHKAAVPASEKLLLLKDAIRAQTAYTVMAITGMAI 33-----------------------3333------------------------1111--- DNHLLALRELARAMCAALPEMFMDETYLMSNRFVLSTSQVPTTTEMFCCYGPVVPNGYGA ------------------3333------1111---------------------1111--- CYNPQPETILFCISSFHSCAATSSSKFAKAVEESLIDMRDLCSLL ----1111-------3333-------------------------- >PEPTIDE METHIONINE SULFOX; SWP:Q9JWM8; PDB:2FY6A; VPHTLSTLKTADNRPASVYLKKDKPTLIKFWASWCPLCLSELGQTEKWAQDAKFSSANLI 3333-----1111-3333--1111-------1111---------------3333------ TVASPGFLHEKKDGDFQKWYAGLNYPKLPVVTDNGGTIAQSLNISVYPSWALIGKDGDVQ ---2222------------1111-1111----2222-----------------1111--- RIVKGSINEAQALALIRDPNADL -----------------1111-- >BETA-1,4-GALACTOSYLTRANSF; SWP:P15291; PDB:2FY7A; RGSASLPACPEESPLLVGPMLIEFNMPVDLELVAKQNPNVKMGGRYAPRDCVSPHKVAII ------------------------------------1111-------------------- IPFRNRQEHLKYWLYYLHPVLQRQQLDYGIYVINQAGDTIFNRAKLLNVGFQEALKDYDY ---------------------1111----------------------------3333--- TCFVFSDVDLIPMNDHNAYRCFSQPRHISVAMDKFGFSLPYVQYFGGVSALSKQQFLTIN -------------------------------3333-----1111---------------- GFPNNYWGWGGEDDDIFNRLVFRGMSISRPNAVVGTTRHIRNPQRFDRIAHTKETMLSDG --------------------1111------3333--------------------1111-3 LNSLTYQVLDVQRYPLYTQITVDIGTPS 333----------1111----------- >PUTATIVE TRANSITION STATE; SWP:P39758; PDB:2FY9A; MKSIGVVRKVDELGRIVMPIELRRALDIAIKDSIEFFVDGDKIILKKYKPHGVC -3333-------------3333-------------------------------- >PHOSPHOSERINE AMINOTRANSF; SWP:A2VGI0; PDB:2FYFA; HLEIPTAIKPRDGRFGSGPSKVRLEQLQTLTTTAAALFGTSHRQAPVKNLVGRVRSGLAE ----3333-------------------33331111-22221111---------------1 LFSLPDGYEVILGNGGATAFWDAAAFGLIDKRSLHLTYGEFSAKFASAVSKNPFVGEPII 111-2222-------3333----------------------------------------- ITSDPGSAPEPQTDPSVDVIAWAHNETSTGVAVAVRRPEGSDALVVIDATSGAGGLPVDI ---2222------1111--------------------2222----------2222---33 AETDAYYFAPQKNFASDGGLWLAIMSPAALSRIEAIAATGRWVPDFLSLPIAVENSLKNQ 33---------1111---------------------3333---3333-------3333-- TYNTPAIATLALLAEQIDWLVGNGGLDWAVKRTADSSQRLYSWAQERPYTTPFVTDPGLR --------------------1111----------------------1111-----3333- SQVVGTIDFVDDVDAGTVAKILRANGIVDTEPYRKLGRNQLRVAMFPAVEPDDVSALTEC ---------1111---------1111------1111---------33333333------- VDWVVERL ----1111 >REPLICASE POLYPROTEIN 1AB; SWP:P59641; PDB:2FYGA; HHHHHNSTVLSFCAFAVDPAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVTPEANMD ----3333-------------------------------------------------111 QESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCANDPVGFTLRNTVCTVCGMWK 1---3333------------3333---2222----3333-------------------22 GYGCSCDQ 22------ >PUTATIVE INTEGRAL MEMBRAN; SWP:Q8U4Q3; PDB:2FYHA; MRAFIAIDVSESVRDALVRAQDYIGSKEAKIKFVERENFHITLKFLGEITEEQAEEIKKI ---------3333---------------------1111-----------3333------- LEKIAKKYKKHEVNVRGIGVFPNPNYVRVIWAGVENDEIIKKIAKEIDDELAKLGFKKEG ---1111-----------------------------3333-----------3333----- NFVAHITLGRVKFVKDKLGLAMKLKELANEDFGSFIVEAIELKKSTLTPKGPIYETLARF ----------------3333----1111-------------------3333--------- ELSEHHHHHH ---------- >HTH-TYPE TRANSCRIPTIONAL ; SWP:Q47083; PDB:2FYIA; SHMTNDTSGVLTIATTHTQARYSLPEVIKAFRELFPEVRLELIQGTPQEIATLLQNGEAD ---1111---------------------------1111---------------------- IGIASERLSNDPQLVAFPWFRWHHSLLVPHDHPLTQISPLTLESIAKWPLITYRQGITGR ----------1111-----------------3333-----33333333-----2222-33 SRIDDAFARKGLLADIVLSAQDSDVIKTYVALGLGIGLVAEQSSGEQEEENLIRLDTRHL 33----3333-----------------------------1111-------------1111 FDANTVWLGLKRGQLQRNYVWRFLELCNAGLSVEDIKRQVMES ----------2222--3333----------------------- >Low-density lipoprotein r; SWP:Q07954; PDB:2FYJA; SARTCPPNQFSCASGRCIPISWTCDLDDDCGDRSDESASCAYPTCFPLTQFTCNNGRCIN -----1111------------------------1111---------------3333---- INWRCDNDNDCGDNSDEAGCSH ---------------------- >ENOLASE; SWP:P0A6P9; PDB:2FYMA; SKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLG ------------1111---------2222------------------------1111iii KGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKA i----------------22223333-------------1111------------------ AAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKE --1111--------11112222------------!!!!-------------1111----- AIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDI ----------------1111-----1111------------------------------- TLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGF ------3333--iiii--1111------------------------------1111---- AYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKD ---------------3333---3333---1111-------1111--------------11 AGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAP 11--------------3333-----------------3333-------------!!!!-- YNGRKEIKGQA --33332222- >CHYMOTRYPSIN-LIKE CYSTEIN; SWP:Q83883; PDB:2FYQA; APPTLWSRVTKFGSGWGFWVSPTVFITTTHVVPTGVKEFFGEPLSSIAIHQAGEFTQFRF -33333333--!!!!------------3333-------iiii3333-----!!!!----- SKKMRPDLTGMVLEEGCPEGTVCSVLIKRDSGELLPLAVRMGAIASMRIQGRLVHGQSGM ----3333---------2222-------3333----------------iiii-------- LLTIPGDCGAPYVHKRGNDWVVCGVHAAATKSGNTVVCAVQAGEGETALE ---3333--------!!!!-------------------------1111-- >PROTEIN ARGININE N-METHYL; SWP:O60678; PDB:2FYTA; FSSYGHYGIHEEMLKDKIRTESYRDFIYQNPHIFKDKVVLDVGCGTGILSMFAAKAGAKK 3333------------3333---------33332222------!!!!------------- VLGVDQSEILYQAMDIIRLNKLEDTITLIKGKIEEVHLPVEKVDVIISEWMGYFLLFESM -----------------11111111------1111--------------------22223 LDSVLYAKNKYLAKGGSVYPDICTISLVAVSDVNKHADRIAFWDDVYGFKMSCMKKAVIP 333---------2222-----------------------------iiii---3333-333 EAVVEVLDPKTLISEPCGIKHIDCHTTSISDLEFSSDFTLKITRTSMCTAIAGYFDIYFE 3------3333-----------3333-3333----------------------------2 KNCHNRVVFSTGPQSTKTHWKQTVFLLEKPFSVKAGEALKGKVTVHKNKKDPRSLTVTLT 222--------1111--3333--------------------------3333--------- LNNSTQTYGLQ %%%%------- >Ubiquinol-cytochrome c re; SWP:P07552; PDB:2FYUK; MLTRFLGPRYRQLARNWVPTASLWGAVGAVGLVWATDWRLILDWVPYINGKFK --------------------------------------3333----------- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:2FYWA; ALASEVIQAYEAFCPQEFSEGDSRGLQIGTLDKGIQRVVALDIREETVAEAIEKGVDLII -3333---------1111------------------------------------------ VKHAPIFRPIKDLLASRPQNQIYIDLIKHDIAVYVSHTNIDIVENGLNDWFCQLGIEETT -------------1111---------1111------3333--2222-------------- YLQETGPERGIGRIGNIQPQTFWELAQQVKQVFDLDSLRVHYQEDDLQKPISRVAICGGS -----2222---------------------------------1111-------------- GQSFYKDALAKGADVYITGDIYYHTAQDLSDGLLALDPGHYIEVIFVEKIAALLSQWKED 1111----1111----------------1111-------3333----------------- KGWSIDILPSQASTNPFHHI -------------------- >TRANSPOSASE, PUTATIVE; SWP:Q9RXX8; PDB:2FYXA; MKKGRGYVYKLEYHLIWATKYRHQVLVDEVADGLKDILRDIATQNGLELVALEVMPDYVH ---2222-----------2222----!!!!------------1111-------------- LLLGATPQHVIPDFVKALKGASARRMFSAFPHLKQPHWGGNLWNPSYCVLTVSEHTRAQI -----1111-------------------------3333--------------1111---- QQYIENQHAA ---1111--- >v-SNARE component of the ; SWP:Q12255; PDB:2FZ0A; MKRFNVSYVEVIKNGETISSCFQPFQKNENYGTITSANEQITPVIFHNLIMDMVLPKVVP ------------%%%%-------------------------------------1111--- IKGNKVTKMSMNLIDGFDCFYSTDDHDPKTVYVCFTLVDIPKILPIRILSGLQEYESNAT --------------------------1111------3333-----------3333----- NELLSSHVGQILDSFHEELVEYRNQTLNS --3333----------------------- >DNA REPAIR PROTEIN RAD25; SWP:O29889; PDB:2FZ4A; IAEIYYERGTIVVKGDAHVPHAKFDSRSGTYRALAFRYRDIIEYFESNGIEFVDNAADPI ------%%%%--------2222-----------3333--------1111----------- PTPYFDAEISLRDYQEKALERWLVDKRGCIVLPTGSGKTHVAMAAINELSTPTLIVVPTL ----------------------------------------------------------33 ALAEQWKERLGIFGEEYVGEFSGRIKELKPLTVSTYDSAYVNAEKLGNRFMLLIFDEVHH 33-------11113333----3333-----------------3333-------------- LPAESYVQIAQMSIAPFRLGLTATFE ---------1111------------- >FLAVODOXIN; SWP:P00321; PDB:2FZ5A; MVEIVYWSGTGNTEAMANEIEAAVKAAGADVESVRFEDTNVDDVASKDVILLGCPAMGSE ------------------------1111------3333-33331111------------- ELEDSVVEPFFTDLAPKLKGKKVGLFGSYGWGSGEWMDAWKQRTEDTGATVIGTAIVNEM --3333-------3333------------------------------------------- PDNAPECKELGEAAAKA -------------1111 >HYDROPHOBIN-1; SWP:P52754; PDB:2FZ6A; NGNGNVCPPGLFSNPQCCATQVLGLIGLDCKVPSQNVYDGTDFRNVCAKTGAQPLCCVAP ---1111---------------------------------------3333---------- VAGQALLCQTAV ------------ >HYPOTHETICAL PROTEIN; SWP:Q8U1L6; PDB:2FZFA; GLPIEKMADFSLEELLGMAIKAEIGAREFYKSLAEKIKIEALKEKINWLAEEEKKHEALL --33331111----------------------3333--------------1111------ RKLYSQMFPGKEVVFPKEHIGPELQPVARELEKVQDIIDLIRWAMKAEEIAAEFYLKLEE -------2222---------------------3333------------------------ MVKEEEKKRLMRYLADMERGHYYTLRAEYELLLNWEMY ---3333------------------------------- >DIHYDROFOLATE REDUCTASE; SWP:P16184; PDB:2FZIA; MNQQKSLTLIVALTTSYGIGRSNSLPWKLKKEISYFKRVTSFVPTFDSFESMNVVLMGRK -------------1111--------------------------3333------------- TWESIPLQFRPLKGRINVVITRNESLDLGNGIHSAKSLDHALELLYRTYGSESSVQINRI -11113333--2222----------------------------------1111------- FVIGGAQLYKAAMDHPKLDRIMATIIYKDIHCDVFFPLKFRDKEWSSVWKKEKHSDLESW --------------1111--------------------11111111------3333---- VGTKVPHGKINEDGFDYEFEMWTRDL -----------%%%%----------- >DIHYDROFOLATE REDUCTASE; SWP:P00375; PDB:2FZJA; VRPLNCIVAVSQNMGIGKNGDLPWPPLRNEFKYFQRMTTTSSVEGKQNLVIMGRKTWFSI -----------------iiii-----3333------------2222----------1111 PEKNRPLKDRINIVLSRELKEPPRGAHFLAKSLDDALRLIEQPELASKVDMVWIVGGSSV 3333--2222------------2222---------------3333--------------- YQEAMNQPGHLRLFVTRIMQEFESDTFFPEIDLGKYKLLPEYPGVLSEVQEEKGIKYKFE -------------------------------1111------2222------iiii----- VYEKKD ------ >DNA REPAIR PROTEIN RAD25,; SWP:O29889; PDB:2FZLA; HLAKYTIKRIFVPLAEDERVEYEKREKVYKQFLRARGITLRRAEDFNKIVMASGYDERAY ---------------3333--------------1111-3333-------1111------- EALRAWEEARRIAFNSKNKIRKLREILERHRKDKIIIFTRHNELVYRISKVFLIPAITHR -----------------------------1111-------3333---------------- TSREEREEILEGFRTGRFRAIVSSQVLDEGIDVPDANVGVIMSGSGSAREYIQRLGRILR ------------------------------------------------------------ PSKGKKEAVLYELISRG ----------------- >RING FINGER PROTEIN 41 IS; SWP:Q9H4P4; PDB:2FZPA; SSGLVPRGSTIEYNEILEWVNSLQPARVTRWGGMISTPDAVLQAVIKRSLVESGCPASIV -----2222----------1111------1111-----------------1111-3333- NELIENAHERSWPQGLATLETRQMNRRYYENYVAKRIPGKQAVVVMACENQHMGDDMVQE -------3333-1111----------3333------2222-----33333333-1111-- PGLVMIFAHGVEEI -------------- >ATP-DEPENDENT CLP PROTEAS; SWP:P0A6G7; PDB:2FZSA; LVPMVIERSFDIYSRLLKERVIFLTGQVEDHMANLIVAQMLFLEAENPEKDIYLYINSPG ------------------------------------------------------------ GVITAGMSIYDTMQFIKPDVSTICMGQAASMGAFLLTAGAKGKRFCLPNSRVMIHQPLGG -----------------------------------11112222---1111---------- YQGQATDIEIHAREILKVKGRMNELMALHTGQSLEQIERDTERDRFLSAPEAVEYGLVDS --------------------------------------1111------------------ ILTHRN ------ >HYPOTHETICAL PROTEIN TM06; SWP:Q9WZF7; PDB:2FZTA; NIDEIERKIDEAIEKEDYETLLSLLNKRKELEGLPKDKLSEILEKDRKRLEIIEKRKTAL ---------------------------3333----------------------------- FQEINVIREARSSLQK ------------1111 >PUTATIVE ARSENICAL RESIST; SWP:NA; PDB:2FZVA; AMRLRHLSDPDSLPALDKSFAIERPALGLAPDAPPVRILLLYGSLRARSFSRLAVEEAAR --------111111113333---1111--------------------------------- LLQFFGAETRIFDPSDLPLPDQVQSDDHPAVKELRALSEWSEGQVWCSPERHGQITSVMK --1111------------22222222------------------------%%%%------ AQIDHLPLEMAGIRPTQGRTLAVMQVSGGSQSFNAVNTLRLLGRWMRMFTIPNQSSIAKA --1111---%%%%--2222---------------------------------------33 FQEFDAAGRMKPSPYYDRIADVMEELVRFTALVRPHREALTDRYSERKAAGHVI 33--1111------------------------1111-3333------------- >ALCOHOL DEHYDROGENASE CLA; SWP:P11766; PDB:2FZWA; ANEVIKCKAAVAWEAGKPLSIEEIEVAPPKAHEVRIKIIATAVCHTDAYTLSGADPEGCF -------------------------------------------3333--3333-1111-- PVILGHLGAGIVESVGEGVTKLKAGDTVIPLYIPQCGECKFCLNPKTNLCQKIRVTQGKG ---------------2222---2222------------3333-1111------------- LMPDGTSRFTCKGKTILHYMGTSTFSEYTVVADISVAKIDPLAPLDKVCLLGCGISTGYG -1111-----iiii----%%%%---------1111----11111111------------- AAVNTAKLEPGSVCAVFGLGGVGLAVIMGCKVAGASRIIGVDINKDKFARAKEFGATECI --------2222-------------------------------1111----1111----- NPQDFSKPIQEVLIEMTDGGVDYSFECIGNVKVMRAALEACHKGWGVSVVVGVAASGEEI 3333---3333--------------------------1111------------------- ATRPFQLVTGRTWKGTAFGGWKSVESVPKLVSEYMSKKIKVDEFVTHNLSFDEINKAFEL ------1111------%%%%--3333--------------3333-----3333------- MHSGKSIRTVVKI ------------- >MITOGEN-ACTIVATED PROTEIN; SWP:P45983; PDB:2G01A; FYSVEIGDSTFTVLKRYQNLKPIGSGAQGIVCAAYDAILERNVAIKKLSRPFQNQTHAKR -------------1111------------------1111--------------------- AYRELVLMKCVNHKNIIGLLNVFTPQKSLEEFQDVYIVMELMDANLCQVIQMELDHERMS ------------1111--------------------------------1111-------- YLLYQMLCGIKHLHSAGIIHRDLKPSNIVVKSDCTLKILDFGLARTAGTSFMMEPEVVTR -----------------------3333-------------------------------33 YYRAPEVILGMGYKENVDLWSVGCIMGEMVCHKILFPGRDYIDQWNKVIEQLGTPCPEFM 33-3333--------3333--------------------3333-11111111-------3 KKLQPTVRTYVENRPKYAGYSFEKLFPDVLFPADSEHNKLKASQARDLLSKMLVIDASKR 333-----------------3333---3333-----------------1111-------- ISVDEALQHPYINVWYDPSEAEAPPPKIPDKQLDEREHTIEEWKELIYKEVMDLEH --3333----------3333------------------------------------ >HYPOTHETICAL PROTEIN NMA0; SWP:Q7DDR9; PDB:2G03A; KSIDEQSLHNARRLFESGDIDRIEVGTTAGLQQIHRYLFGGLYDFAGQIREDNISKGGFR --------------11111111----------------22221111---------!!!!- FANAYLKEALVKIEQPERTFEEIIAKYVENIAHPFLEGNGRSTRIWLDLVLKKNLKKVVN -----------3333--------------3333--------------------------3 WQNVSKTLYLQAERSPVNDLELRFLLKDNLTDDVDNREIIFKGIEQSYYYEG 333----------3333---------------11113333------3333-- >PROBABLE FATTY-ACID-COA R; SWP:O53867; PDB:2G04A; GGPLAGVKVIELGGIGPGPHAGMVLADLGADVVRVRRPGGLTMPSEDRDLLHRGKRIVDL -1111--------------------1111-------1111----11111111-------- DVPQAMLELAAKADVLLDCFRPGTCERLGIGPDDCASVNPRLIFARITGWGQDGPLASTA ----33333333----------3333------3333--1111-----------3333--- GHDINYLSQTGALAAFGYADRPPMPPLNLVADFGGGSMLVLLGIVVALYERERSGVGQVV -33333333-3333---------------------------------------------- DAAMVDGVSVLAQMMWTMKGIGSLRDQRESFLLDGGAPFYRCYETSDGKYMAVGAIEPQF ------------------------------------1111----------------3333 FAALLSGLGLSAADVPTQLDVAGYPQMYDIFAERFASRTRDEWTRVFAGTDACVTPVLAW ----------3333--33331111---------1111---------2222--------33 SEAANNDHLKARSTVITAHGVQQAAPAPRFSRTPAGPVRPPPAAATPIDEINW 33-------1111---------------------------------------- >CYTOSOLIC 5'-NUCLEOTIDASE; SWP:Q9D020; PDB:2G09A; AVHLKPEFQKSSVRIKNPTRVEEIICGLIKGGAAKLQIITDFDTLSRFSYNGKRCPTCHN -----11113333-------------------1111-------------iiii---3333 IIDNCKLVTDECRRKLLQLKEQYYAIEVDPVLTVEEKFPYVEWYTKSHGLLIEQGIPKAK ----1111------------------------3333---------------3333-3333 LKEIVADSDVLKEGYENFFGKLQQHGIPVFIFSAGIGDVLEEVIRQAGVYHSNVKVVSNF -------------3333-----1111------------------------1111------ DFDENGVLKGFKGELIHVFNKHDGALKNTDYFSQLKDNSNIILLGDSQGDLRADGVANVE --1111----------1111--------------3333--------3333--1111---- HILKIGYLNDRVDELLEKYDSYDIVLVKEESLEVVNSILQKTL ----------3333-3333------------------------ >FEEM; SWP:NA; PDB:2G0BA; SMTPRKVARILVAPNERDAARRIVRTTYEAQGYAIDESFATFLEGPSATTFGLFNGEVLY ------------3333----------------------------1111------iiii-- GTISIINDGAQGLPMDSIYAVELAAWRGEGKKLAEVVQFAMDHTLYEAVASPFEAASLFT --------11113333--3333----1111-----------3333-----1111------ MVLTYALETHIDYLCISINPKHDTFYSLLGFTQIGALKHYGTVNAPAIARALYVPEWRSQ ------------------3333------------------1111--------33331111 TLLAQFM 3333--- >ATP-DEPENDENT RNA HELICAS; SWP:P42305; PDB:2G0CA; MKLYFNGGKKAVDFVGTIAKIDGVSADDIGIITIMDNASYVEILNGKGPHVLKVMKNTTV --------------------22223333------1111-----%%%%------------- QLKVNKAN -------- >NISIN BIOSYNTHESIS PROTEI; SWP:Q03202; PDB:2G0DA; KKNIKRNVEKIIAQWDERTRKDFGELTLSTGLPGIILMLAELKNKDNSKIYQKKIDNYIE -----------------33333333----------------3333--------------- YIVSKLSTYGLLTGSLYSGAAGIALSILHLREDDEKYKNLLDSLNRYIEYFVREKIEGFN --------------1111--------3333---3333------------------11113 LENITPPDYDVIEGLSGILSYLLLINDEQYDDLKILIINFLSNLTKENNGLISLYIKSEN 333-3333-------------1111-3333--------------------------3333 QMSQSESEMYPLGCLNMGLAHGLAGVGCILAYAHIKGYSNEASLSALQKIIFIYEKFELE ---------1111--------------------1111----------------------! RKKQFLWKDGLVADELKKEKVIREASFIRDAWCYGGPGISLLYLYGGLALDNDYFVDKAE !!!---------------------------33333333---------1111--------- KILESAMQRKLGIDSYMICHGYSGLIEICSLFKRLLNTKKFDSYMEEFNVNSEQILEEYG ---------2222-------------------------1111------------------ DESGTGFLEGISGCILVLSKFEYSINFTYWRQALLLFDDFLKGG 1111------------------------3333----1111---- >HYPOTHETICAL PROTEIN SMU.; SWP:Q8DUQ5; PDB:2G0IA; SIQATFIRRKGILESVELTGHASGEYGFDIVCAAVSTLSNLVNALEVLADCTVSLQDEFD --------iiii------------------------------------------------ GGYKIDLSYITNKSDEKVQLLFEAFLLGITNLAENSPEFVTAKITQ ------1111-1111--------------------3333------- >AT5G39720.1 PROTEIN; SWP:Q9FIX2; PDB:2G0QA; CSSDSLQLHNVFVYGSFQDPDVINVMLDRTPEIVSATLPGFQRFRLKGRLYPCIVPSEKG --------------1111--------------------------------------1111 EVHGKVLMGVTSDELENLDAVEGNEYERVTVGIVREDNSEKMAVKTYMWINKADPDMFGE ------------------------------------------------------1111-- WNFEEWKRLHKKKFIETFKKIMECKKKPQ 3333----------------3333----- >CONSERVED HYPOTHETICAL PR; SWP:Q9WZQ3; PDB:2G0TA; HDLWKLYQPGTPAAIVAWGQLGTAHAKTTYGLLRHSRLFKPVCVVAEHEGKASDFVKPVR -3333--2222-----2222--3333---------------------22223333----- YDVPVVSSVEKAKEGAEVLIIGVSNPGGYLEEQIATLVKKALSLGDVISGLHFSQQTEFL ------------------------------------------------------3333-- KIAHENGTRIIDIRIPPLELDVLRGGIYRKKIKVVGVFGTDCVVGKRTTAVQLWERALEK -----------3333---------3333-----------------------------111 GIKAGFLATGQTGILIGADAGYVIDAVPADFVSGVVEKAVLKLEKTGKEIVFVEGQGALR 1---------------------3333-3333------------1111----------111 HPAYGQVTLGLLYGSNPDVVFLVHDPSRDHFESFPEIPKKPDFEEERRLIETLSNAKVIG 1-----------------------1111--2222----------------1111------ GVSLNGGFETDLPVYDPFNTDDLDELERAVW ---------------1111------3333-- >TYPE III SECRETION SYSTEM; SWP:Q63K18; PDB:2G0UA; MSNPPTPLLADYEWSGYLTGIGRAFDDGVKDLNKQLQDAQANLTKNPSDPTALANYQMIM ---------------------11111111------------------------------- SEYNLYRNAQSSAVKSMKDIDSSILEHHHHHH ---------------3333------------- >LMO2234 PROTEIN; SWP:Q8Y542; PDB:2G0WA; KCPITISSYTLGTEVSFPKRVKVAAENGFDGIGLRAENYVDALAAGLTDEDLRILDEHNK --------1111------------1111-------------------3333----1111- VTEVEYITQWGTAEDRTAEQQKKEQTTFHARLFGVKHINCGLLEKIPEEQIIVALGELCD -----------3333---------------1111------------3333---------3 RAEELIIGLEFPYSGVADLQAAWRVAEACGRDNAQLICDTWHWARANQTAESIKNVPADR 333---------------------------1111---------1111-333322221111 IVSIQLCDVHETPYKELREESLHDRLAPGEGYGDTVGFAKILKEHGVNPRVGVEVISDSV --------------------------2222-----------------------------3 ATGLEYAALKVYNATKKVLDEAWPEISPR 333-------------------3333--- >ACTIVATED MET ONCOGENE; SWP:P08581; PDB:2G15A; LLQNTVHIDLSALNPELVQAVQHVVIGPSSLIVHFNEVIGRGHFGCVYHGTLLDNDGKKI -1111----3333-------1111--3333---1111-!!!!-1111------------- HCAVKSLNRITDIGEVSQFLTEGIIMKDFSHPNVLSLLGICLRSEGSPLVVLPYMKHGDL ------------------------------1111--------------------1111-- RNFIRNETHNPTVKDLIGFGLQVAKGMKYLASKKFVHRDLAARNCMLDEKFTVKVADFGL ------------------------------1111------3333---1111------!!! ARDMYDKEYYSVHNKTGAKLPVKWMALESLQTQKFTTKSDVWSFGVLLWELMTRGAPPYP !-1111----11112222--1111--------------------------1111------ DVNTFDITVYLLQGRRLLQPEYCPDPLYEVMLKCWHPKAEMRPSFSELVSRISAIFSTFI --1111----1111-----22223333-----1111-3333------------------- G - >N-ACETYL-GAMMA-GLUTAMYL-P; SWP:Q8ZKL8; PDB:2G17A; ALNTLIVGASGYAGAELVSYVNRHPHTITALTVSAQSNDAGKLISDLHPQLKGIVDLPLQ -------1111-------------------------1111--1111-3333--------- PSDVRDFSADVDVVFLATAHEVSHDLAPQFLQAGCVVFDLSGAFRVNDRAFYEKYYGFTH --3333------------------------1111-------------3333--------- QYPELLEQAVYGLAEWNVDKLNTANLIAVPGCYPTAAQLSLKPLIDGGLLDLTQWPVINA -3333-------3333-3333---------3333----------1111--3333------ TSGVSGAGRKAAISNSFCEVSLQPYGVFTHRHQPEIAVHLGAEVIFTPHLGNFPRGILET ----3333---33333333------22223333--------------------------- ITCRLKAGVTHAQVADVLQKAYGDKPLVRLYDKGVPALKNVVGLPFCDIGFAVQGEHLIV -----2222------------1111-----------33332222---------!!!!--- VATEDNLLKGAAAQAVQCANIRFGFAETQSLI ----1111-----------------1111--- >PHYCOCYANOBILIN:FERREDOXI; SWP:Q93TN0; PDB:2G18A; ISLTSIPSLREQQHPLIRQLADCIEEVWHQHLDLSPYHLPAELGYVEGRLEGEKLTIENR --------3333---------------------------3333------iiii------- CYQTPQFRKMHLELAKVGNMLDILHCVMFPRPEYDLPMFGCDLVGGRGQISAAIADLSPV ---1111-----------------------3333-----------iiii----------- HLDRTLPESYNSALTSLNTLNFSQPRELPEWGNIFSDFCIFVRPSSPEEEAMFLGRVREF 3333------------------------1111---1111--------------------- LQVHCQGAIAASPVSAEQKQQILAGQHNYCSKQQQNDKTRRVLEKAFGVDWAENYMTTVL ------------------------------------------------------------ FDLPE ----- >30S RIBOSOMAL PROTEIN S24; SWP:Q9HJ79; PDB:2G1DA; MDLIIKEKRDNPILKRKEIKYVLKFDSSRTPSREEIKELIAKHEGVDKELVIVDNNKQLT ----------------------------------------------3333---------- GKHEIEGYTKIYADKPSAMLYEPDYELIRNGLKQKEAK ---------------------1111------------- >HYPOTHETICAL PROTEIN TA08; SWP:Q9HJR9; PDB:2G1EA; MVTVRYYATLRPITKKKEETFNGISKISELLERLKVEYGSEFTKQMYDGNNLFKNVIILV -----------------------------------33333333--------3333----- NGNNITSMKGLDTEIKDDDKIDLFPPVAGG ---3333--------3333------3333- >PYRUVATE DECARBOXYLASE; SWP:Q12629; PDB:2G1IA; SEITLGRYLFERLKQVEVQTIFGLPGDFNLSLLDNIYEVPGMRWAGNANELNAAYAADGY -------------1111--------111133333333-2222------------------ ARLKGMSCIITTFGVGELSALNGIAGSYAEHVGVLHVVGVPSVSHHTLGNGDFTVFHRMS -------------3333----------------------------------1111----- SNISETTAMITDINTAPAEIDRCIRTTYVSQRPVYLGLPANLVDLTVPASLLDTPIDLSL -----------3333-----------------------3333------3333-------- KPNDPEAEEEVIENVLQLIKEAKNPVILADACCSRHDAKAETKKLIDLTQFPAFVTPMGK ---3333----------------------11111111------------------3333- GSIDEKHPRFGGVYVGTLSSPAVKEAVESADLVLSVGALLTKNIVEFHSDYTKIRSATFP ---1111-------!!!!--------1111-----------------------!!!!--- GVQMKFALQKLLTKVADAAKGYKPVPVPSEPEHNEAVADSTPLKQEWVWTQVGEFLREGD --------------33333333---------------1111--3333--3333---2222 VVITETGTSAFGINQTHFPNNTYGISQVLWGSIGFTTGATLGAAFAAEEIDPKKRVILFI ------3333--1111---------------2222---------------1111------ GDGSLQLTVQEISTMIRWGLKPYLFVLNNDGYTIERLIHGETAQYNCIQNWQHLELLPTF 33333333-------1111-------------3333------3333-----3333--111 GAKDYEAVRVSTTGEWNKLTTDEKFQDNTRIRLIEVMLPTMDAPSNLVKQAQLTAATNAK 1-----------------11113333------------1111------------------ >KINESIN-LIKE PROTEIN KIF1; SWP:O43896; PDB:2G1LA; STPHLVNLNEDPLMSECLLYHIKDGVTRVGQVDMDIKLTGQFIREQHCLFRSIPQPDGEV ------------3333-----------------------1111-----------1111-- VVTLEPCEGAETYVNGKLVTEPLVLKSGNRIVMGKNHVFRFNH ------2222---iiii--------2222-------------- >DNA ADENINE METHYLASE; SWP:P0AEE8; PDB:2G1PA; KNRAFLKWAGGKYPLLDDIKRHLPKGECLVEPFVGAGSVFLNTDFSRYILADINSDLISL -------22223333----1111----------!!!!3333------------------- YNIVKMRTDEYVQAARELFVPETNCAEVYYQFREEFNKSQDPFRRAVLFLYLNRYGYNGL ---------------11113333---------------------------------%%%% CRYNLRGEFNVPFGRYKKPYFPEAELYHFAEKAQNAFFYCESYADSMARADDSSVVYCDP ---1111-------------------------1111-----3333-11111111------ PYAPLNSFTLEQQAHLAEIAEGLVERHIPVLISNHDTMLTREWYQRAKLHVVKKVDELLA -----------------------1111--------------1111--------------- LYKP ---- >HYPOTHETICAL PROTEIN TM10; SWP:A1GH50; PDB:2G1UA; KQKSKYIVIFGCGRLGSLIANLASSSGHSVVVVDKNEYAFHRLNSEFSGFTVVGDAAEFE ------------3333-------1111------------33331111-------111133 TLKECGMEKADMVFAFTNDDSTNFFISMNARYMFNVENVIARVYDPEKIKIFEENGIKTI 33---3333-----------------------------------3333----1111---- CPAVLMIEKVKEFIIGS ----------------- >PHENOXAZINONE SYNTHASE; SWP:Q53692; PDB:2G23A; APGELTPFAAPLTVPPVLRPASDEVTRETEIALRPTWVRLHPQLPPTLMWGYDGQVPGPT -----------------------3333-------------1111-------%%%%----- IEVRRGQRVRIAWTNRIPKGSEYPVTSVEVPLGPPGTPAPNTEPGRGGVEPNKDVAALPA ---2222----------2222-----------------1111----------3333---- WSVTHLHGAQTGGGNDGWADNAVGFGDAQLSEYPNDHQATQWWYHDHAMNITRWNVMAGL -----2222--3333--1111--2222--------------------2222----3333- YGTYLVRDDEEDALGLPSGDREIPLLIADRNLDTDEDGRLNGRLLHKTVIVQQSNPETGK -------33331111--!!!!-------------1111----------------3333-- PVSIPFFGPYTTVNGRIWPYADVDDGWYRLRLVNASNARIYNLVLIDEDDRPVPGVVHQI ------------iiii------------------------------1111---------- GSDGGLLPRPVPVDFDDTLPVLSAAPAERFDLLVDFRALGGRRLRLVDKGPGAPAGTPDP -1111----------3333-----2222-------1111----------11112222--1 LGGVRYPEVMEFRVRETCEEDSFALPEVLSGSFRRMSHDIPHGHRLIVLTPPGTKGSGGH 111---------------------------------1111----------1111--iiii PEIWEMAEVEQVPAEGVIQVTGADGRTKTYRRTAATFNDGLGFTIGEGTHEQWTFLNLSP -------------2222----1111----------1111------2222----------- ILHPMHIHLADFQVLGRDAYDASGFDLALGGTRTPVRLDPDTPVPLAPNELGHKDVFQVP ---------------------1111-------------1111----1111---------- GPQGLRVMGKFDGAYGRFMYHCHLLEHEDMGMMRPFVVMPPEALKFD ---------------------------------------33331111 >NITRATE TRANSPORT PROTEIN; SWP:P73452; PDB:2G29A; NAPEVTTAKLGFIALTDAAPLIIAKEKGFYAKYGMPDVEVLKQASWGTTRDNLVLGSASG --------------1111------1111----------------------------1111 GIDGAHILTPMPYLITMGTVTDGKPTPMYILARLNVNGQGIQLGNNYKDLKVGTDAAPLK -------3333------1111----------------------33331111----3333- EAFAKVTDPKVAMTFPGGTHDMWIRYWLAAGGMEPGKDFSTIVVPPAQMVANVKVNAMES 3333----------2222----------1111-2222-------1111----1111---- FCVGEPWPLQTVNQGVGYQALTTGQLWKDHPEKAFGMRADWVDQNPKAAKALLMAVMEAQ ---------------------3333----------------------------------- QWCDQAENKEEMCQILSKREWFKVPFEDIIDRSKGIYNFGNGQETFEDQEIMQKYWVDNA ----3333---------3333---3333-----------iiii----3333-----%%%% SYPYKSHDQWFLTENIRWGYLPASTDTKAIVDKVNREDLWREAAQALEVPADQIPSSPSR -------------------------------------------------1111------- GIETFFDGITFDPENPQAYLDSLKI ----1111---1111----1111-- >Putative molybdenum cofac; SWP:Q6NJA6; PDB:2G2CA; HIKSAIIVVSDRISTGTRENKALPLLQRLSDYSYELISEVVVPEGYDTVVEAIATALKQG ------------------------------------------------------------ ARFIITAGGTGIRAKNQTPEATASFIHTRCEGLEQQILIHGGLSRGIVGVTGRDDHAALI ------------1111------1111---------------------------------- VNAPSSSGGITDTWAVISPVIPNIFEGLDA ----------------3333---------- >ATP:COBALAMIN ADENOSYLTRA; SWP:P64803; PDB:2G2DA; TDARLVAYADCDEANAAIGAALALGHPDTQITDVLRQIQNDLFDAGADLSTPIVENPKHP ------------------------------------------------------------ PLRIAQSYIDRLEGWCDAYNAGLPALKSFVLPGGSPLSALLHVARTVVRRAERSAWAAVD ----3333----------3333-------------------------------------- AHPEGVSVLPAKYLNRLSDLLFILSRVANPDGDVLWRPGG -1111-----------------------1111-------- >ATP-DEPENDENT RNA HELICAS; SWP:Q9UHL0; PDB:2G2JA; LTLNNIRQYYVLCEHRKDKYQALCNIYGSITIGQAIIFCQTRRNAKWLTVEMIQDGHQVS -----------------------1111-------------------------1111---- LLSGELTVEQRASIIQRFRDGKEKVLITTNVCARGIDVKQVTIVVNFDLPVPDYETYLHR ------3333---------------------------1111------------------- IGRTKGLAFNMIEVDELPSLMKIQDHFNSSIKQLNAED ------------3333---------------------- >EUKARYOTIC TRANSLATION IN; SWP:P55010; PDB:2G2KA; LSVNVNRSVMDQFYRYKMPRLIAKVEGKGNGIKTVIVNMVDVAKALNRPPTYPTKYFGCE -----1111------------------!!!!-------------------1111------ LGAQTQFDVKNDRYIVNGSHEANKLQDMLDGFIKKFVLCPECENPETDLHVNPKKQTIGN --------------------3333--------3333----------------1111---- SCKACGYRGMLDTHHKLCTFILKNPPENSDSGTGKK ------------------------------------ >TRANSTHYRETIN-LIKE PROTEI; SWP:P76341; PDB:2G2NA; NILSVHILNQQTGKPAADVTVTLEKKADNGWLQLNTAKTDKDGRIKALWPEQTATTGDYR ---------------------------------------1111----------------- VVFKTGDYFKKQNLESFFPEIPVEFHINKVNEHYHVPLLLSQYGYSTYRGS ---------1111---------------1111------------------- >GLUTAREDOXIN-2; SWP:P68460; PDB:2G2QA; KNVLIIFGKPYCSICENVSDAVEELKSEYDILHVDILSFFLKDGDSSRGTLIGNFAAHLS --------------------33331111-------------3333------------333 NYIVSIFKYNPQTKQAFVDINKSLDFTKTDKSLVNLEILKSEIEKATYGVWP 3------------------3333-1111-3333--3333------------- >GREEN-FLUORESCENT ANTIBOD; SWP:NA; PDB:2G2RA; DVVMTQTPLTLSVTIGQPASISCRSSQSLLYINGKTHLNWLLQRPGQSPKRLIYLVSKLD -------------2222-------------1111---------2222------------2 SGVPDRFSGSGSGTDFTLKISRVEAEDLGIYFCLQSTHFPLTFGAGTKLELKRADAAPTV 2223333----------------3333--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPRDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM ----------------------------------iiii---------------------- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC ----------3333---------1111-------3333- >GREEN-FLUORESCENT ANTIBOD; SWP:NA; PDB:2G2RB; QVQLQQSGPVLVKPGTSLKMSCKASGYTFTAYYMNWMKQSHGKRLEWIAVINPYNGFTTY ------------2222-----------1111--------%%%%----------------- NQKFKGKATLTVDKSSNTAYMDLNSLTSEDSAVYYCVPYDYAADRVYWGHGTLVTVSTAK 3333---------1111---------3333----------1111---------------- TTAPSVYPLAPVCGGTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPALLQSGLY ---------------------------------------iiii----------------- TLSSSVTVTSNTWPSQTITCNVAHPASSTKVDKKIEPRG -----------------------3333------------ >Beta-lactamase inhibitory; SWP:P35804; PDB:2G2UB; AGVMTGAKFTQIQFGMTRQQVLDIAGAENCETGGSFGDSIHCRGHAAGDYYAYATFGFTS ------------2222---------3333---!!!!----------!!!!---------- AAADAKVDSKSQEKLLAPSAPTLTLAKFNQVTVGMTRAQVLATVGQGSCTTWSEYYPAYP -1111----------------------11112222---------1111-------1111- STAGVTLSLSCFDVDGYSSTGFYRGSAHLWFTDGVLQGKRQWDLV -2222-------1111---------------iiii---------- >HYPOTHETICAL PROTEIN PP52; SWP:Q88CH6; PDB:2G2XA; AYWLKSEPDELSIEALARLGEARWDGVRNYQARNFLRASVGDEFFFYHSSCPQPGIAGIA ------3333----------------------------2222------------------ RITRAAYPDPTALDPESHYHDAKATTDKNPWSAVDVAHVQTFPRVLELGRLKQQAGLVEL ---------33333333---11111111-------------------------1111--3 PLVQKGSRLSVPVTPEQWAVIVALRL 3332222--------------3333- >AP-2 COMPLEX SUBUNIT BETA; SWP:P63010; PDB:2G30A; GGYVAPKAVWLPAVKAKGLEISGTFTHRQGHIYMEMNFTNKALQHMTDFAIQFNKNSFGV -----------3333iiii--------iiii------------------------1111- IPSTPLAIHTPLMPNQSIDVSLPLNTLGPVMKMEPLNNLQVAVKNNIDVFYFSCLIPLNV ------------2222------------------1111------3333--------3333 LFVEDGKMERQVFLATWKDIPNENELQFQIKECHLNADTVSSKLQNNNVYTIAKRNVEGQ --------------------1111--------------------1111--------%%%% DMLYQSLKLTNGIWILAELRIQPGNPNYTLSLKCRAPEVSQYIYQVYDSILKN --------1111-----------------------3333-------------- >RETICULON-4; SWP:Q9NQC3; PDB:2G31A; RIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDD -----------3333------------------------3333-3333------------ >TRYPTOPHANYL-TRNA SYNTHET; SWP:Q9WYW2; PDB:2G36A; HMRILSGMRPTGKLHIGHLVGALENWVKLQEEGNECFYFVADWHALTTHYDDVSKLKEYT --------------3333-----------------------3333---1111-------- RDLVRGFLACGIDPEKSVIFVQSGVKEHAELALLFSMIVSVSRLERVPTYKEIDLSTAGF -------1111-3333----3333----------1111--------3333---------- LIYPVLQAADILIYKAEGVPVGEDQVYHIELTREIARRFNYLYDEVFPEPEAILSRVPKL -----------1111------3333----------------------------------- PGTDGRKMSKSYGNIINLEISEKELEQTILRMMTDPARVRRSDPGNPENCPVWKYHQAFD --------3333----------------------3333-1111--3333----------- ISEEESKWVWEGCTTASIGCVDCKKLLLKNMKRKLAPIWENFRKIDEDPHYVDDVIMEGT ------------1111-------------------------------2222--------- KKAREVAAKTMEEVRRAMNLMF ---------------------- >proline dehydrogenase/del; SWP:Q72IB8; PDB:2G37A; LAYRSFVLGVAGHPQVERLIKHRAKGLVRRYVAGETLEEALKAAEALEREGVHAILDLLG ---------3333----------------------------------1111--------- EMVRTEEEARAFQRGLLELVWALAGKPWPKYISLKLTQLGLDLSEDLALALLREVLREAE ----------------------2222--------3333-3333----------------1 PRGVFVRLDMEDSPRVEATLRLYRALREEGFSQVGIVLQSYLYRTEKDLLDLLPYRPNLR 111--------3333-----------1111-----------3333------3333----- LVKGAYREPKEVAFPDKRLIDAEYLHLGKLALKEGLYVAFATHDPRIIAELKRYTEAMGI --------3333-------------------1111------------------------- PRSRFEFQFLYGVRPEEQRRLAREGYTVRAYVPYGRDWYPYLTRRIAERPEN 1111-----2222--------------------------------3333--- >PE FAMILY PROTEIN; SWP:Q7TYL3; PDB:2G38A; PEALTVAATEVRRIRDRAIQSDAQVAPMTTAVRPPAADLVSEKAATFLVEYARKYRQTIA 3333--------------------3333-------------------------------- AAAVVLEEFAHALTTGA ----------------- >PPE FAMILY PROTEIN; SWP:Q79FE1; PDB:2G38B; AFEAYPPEVNSANIYAGPGPDSMLAAARAWRSLDVEMTAVQRSFNRTLLSLMDAWAGPVV 1111-3333--------------------------------------------------- MQLMEAAKPFVRWLTDLCVQLSEVERQIHEIVRAYEWAHHDMVPLAQIYNNRAERQILID -------------------------------------------3333------------- NNALGQFTAQIADLDQEYDDFWDEDGEVMRDYRLRVSDALSKLTPWKAPPPIA -1111----------------------------------1111---------- >ACETYL-COA HYDROLASE; SWP:Q9HTC2; PDB:2G39A; RDRVRLPSLLDKVSAAEAADLIQDGTVGSGFTRAGEAKAVPQALARAKERPLRISLTGAS -----33331111-----1111--------iiii-----------3333----------- LGNDLDKQLTEAGVLARRPFQVDSTLRKAINAGEVFIDQHLSETVEQLRNHQLKLPDIAV -%%%%----1111----------------1111------1111----------------- IEAAAITEQGHIVPTTSVGNSASFAIFAKQVIVEINLAHSTNLEGLHDIYIPTYRPTRTP ------1111-------!!!!--------------11111111----------------- IPLTRVDDRIGSTAIPIPPEKIVAIVINDQPDSPSTVLPPDGETQAIANHLIDFFKREVD ----1111---------3333-------------------------------------11 AGRSNSLGPLQAGIGSIANAVCGLIESPFENLTYSEVLQDSTFDLIDAGKLRFASGSSIT 11-1111--------------1111-------------3333------------------ LSPRRNADVFGNLERYKDKLVLRPQEISNHPEVVRRLGIIGINTALEFDIYGNVNSTHVG --------333333331111---------3333---------------1111-------- GTKNGIGGSGDFARNAHLAIFVTKSIAKGGNISSVVPVSHVDHTEHDVDILVTEQGLADL -----!!!!---1111-----------iiii------------3333-----3333---2 RGLAPRERARVIIENCVHPSYQAPLLDYFEAACAKGGHTPHLLREALAWHLNLEERGHLA 222--------------3333----------3333------3333--------------- G - >ACETYLTRANSFERASE; SWP:Q7CXI0; PDB:2G3AA; NFVLSDVADAEAEKAIRDPLVAYNLARFGESDKRDLNITIRNDDNSVTGGLVGHTARGWL -----------------------------------------1111----------iiii- YVQLLFVPEARGQGIAPKLLAAEEEARKRGCGAYIDTNPDALRTYERYGFTKIGSLGPLS ----------------------------------------------------------11 SGQSITWLEKRF 11---------- >PUTATIVE TETR-FAMILY TRAN; SWP:Q5Z2L0; PDB:2G3BA; SERRDAILKASATAIAQRGIRGLRVNDVAEVAGVSPGLLYYHFKDRIGLLEAALNYINDR 3333--------------3333-3333-------3333---------------------- ARAYRSEGEGGDSARDRLTRSLLGEIQDRPEVVENSLAWNELRASAVYEEALRDPLARTT -----2222-------------3333----------------------3333-------- AAWVSEIADAIVQAQATGEISRSLDPQPTAVTTALVEGLSGRWLCKEISTEDARSHLLGA --------------------3333------------------1111-------------- IDVVS ----- >IMIDAZOLONEPROPIONASE; SWP:P42084; PDB:2G3FA; PKQIDTILINIGQLLTMESSGPRAGKSMQDLHVIEDAVVGIHEQKIVFAGQKGAEAGYEA --------------------------1111-----------iiii-------1111---- DEIIDCSGRLVTPGLVDPHTHLVFGGSREKEMNLKLQGISYLDILAQGGGILSTVKDTRA -----%%%%------------------1111-------------1111-3333------- ASEEELLQKAHFHLQRMLSYGTTTAEVKSGYGLEKETELKQLRVAKKLHESQPVDLVSTF ------------------------------------------------------------ MGAHAIPPEYQNDPDDFLDQMLSLLPEIKEQELASFADIFTETGVFTVSQSRRYLQKAAE ------3333-------------------------------2222--------------- AGFGLKIHADEIDPLGGAELAGKLKAVSADHLVGTSDEGIKKLAEAGTIAVLLPGTTFYL ---------------------1111------1111------------------------- GKSTYARARAMIDEGVCVSLATDFNPGSSPTENIQLIMSIAALHLKMTAEEIWHAVTVNA -----------1111--------------------------------------------- AYAIGKGEEAGQLKAGRSADLVIWQAPNYMYIPYHYGVNHVHQVMKNGTIVVNR -------------2222----------33333333----------iiii----- >ALPHA-GLUCOSIDASE; SWP:O59645; PDB:2G3MA; ILKIYENKGVYKVVIGEPFPPIEFPLEQKISSNKSLSELGLTIVQQGNKVIVEKSLDLKE ------%%%%------------------------3333------------------1111 HIIGLGEKAFELDRKRKRYVMYNVDAGAYKKYQDPLYVSIPLFISVKDGVATGYFFNSAS ----------------------------------------------iiii---------- KVIFDVGLEEYDKVIVTIPEDSVEFYVIEGPRIEDVLEKYTELTGKPFLPPMWAFGYMIS ---------1111------------------3333---------------3333------ RYSYYPQDKVVELVDIMQKEGFRVAGVFLDIHYMDSYKLFTWHPYRFPEPKKLIDELHKR -----3333--------1111--------3333-%%%%-------------------111 NVKLITIVDHGIRVDQNYSPFLSGMGKFCEIESGELFVGKMWPGTTVYPDFFREDTREWW 1----------------------2222---1111------3333-----3333------- AGLISEWLSQGVDGIWLDMNEPTDFSRAIEIRDVLSSLPVQFRDDRLVTTFPDNVVHYLR -------1111---------------------------------3333---1111---ii GKRVKHEKVRNAYPLYEAMATFKGFRTSHRNEIFILSRAGYAGIQRYAFIWTGDNTPSWD ii--33333333-------------1111-----------2222-------------333 DLKLQLQLVLGLSISGVPFVGCDIGGFQGRNFAEIDNSMDLLVKYYALALFFPFYRSHKA 3-----------1111------2222----------------------1111-------1 TDGIDTEPVFLPDYYKEKVKEIVELRYKFLPYIYSLALEASEKGHPVIRPLFYEFQDDDD 111---3333---------------------------------------3333-111133 MYRIEDEYMVGKYLLYAPIVSKEESRLVTLPRGKWYNYWNGEIINGKSVVKSTHELPIYL 33---------------------------------------------------------- REGSIIPLEGDELIVYGETSFKRYDNAEITSSSNEIKFSREIYVSKLTITSEKPVSKIIV --------%%%%----------1111---------------------------------% DDSKEIQVEKTMQNTYVAKINQKIRGKINLE %%%--------2222---------------- ------------------------------------------- >TUMOR SUPPRESSOR P53-BIND; SWP:Q12888; PDB:2G3RA; SFVGLRVVAKWSSNGYFYSGKITRDVGAGKYKLLFDDGYECDVLGKDILLCDPIPLDTEV -2222---------------------iiii----1111-----3333-------2222-- TALSEDEYFSAGVVKGHRKESGELYYSIEKEGQRKWYKRMAVILSLEQGNRLREQYGLG ---1111------------iiii------iiii----3333---3333----------- >CAG PATHOGENICITY ISLAND ; SWP:Q75XJ1; PDB:2G3VA; KLIESLQENELLNTDEKKKIIDQIKTHDFFKQHTNKGALDKVLRNYKDYRAVIKSIGVDK --3333-------------------------------------------3333------- FKKVYRLLESETELLHAIAENPNFLFSKFDRSILGIFLPFFSKPIFKSIREDSQIELYGT --------------------1111--3333--33333333-3333--3333--------- KLPLLKLFVTDEENFYANLKTIEQYNDYVRDL 3333-----3333-3333--3333----1111 >HYPOTHETICAL PROTEIN XAC2; SWP:Q3BSD9; PDB:2G3WA; TATVRRAELQISDDRGYYANHSLTLAQHPSETDERLVRLLAFALFADDRLEFGRGLSNDD ---------------------------11113333-----------1111---------- EPDLWRRDYTGDPDLWIDLGQPDESRVRKACNRSREAVVIGYGGQATETWWKKHANAGRY -------1111--------------------------------3333------------- RNLRVIELDSQATEALGALIQRGRFDVIIQDGEVQLADHGSVTLTPVRQAPAE ----------------1111---------iiii---1111------------- >GTP-BINDING PROTEIN GEM; SWP:P55040; PDB:2G3YA; NTYYRVVLIGEQGVGKSTLANIFAGVHDSMDSDLGEDTYERTLMVDGESATIILLDMWEN ----------2222-------------3333-------------iiii--------3333 KGENEWLHDHCMQVGDAYLIVYSITDRASFEKASELRIQLRRARQTEDIPIILVGNKSDL ----------------------1111------------33333333----------3333 VRCREVSVSEGRACAVVFDCKFIETSAAVQHNVKELFEGIVRQVRLRRD 1111--3333----------------1111--------------3333- >CONSERVED HYPOTHETICAL PR; SWP:Q9RT57; PDB:2G40A; PLSRAEILHQFEDRILDYGAAYTHVSAAELPGAIAKALGNARRVIVPAGIPAPWLTVGMD -------------------------3333--------!!!!-----11113333-2222- VLRDEPPLSHAELDRADAVLTGCAVAISETGTIILDHRADQGRRALSLIPDFHICVVRED ---------------------------1111------1111-3333-----------111 QIVQTVREGVEAVAASVREGRPLTWLSGGSGVHGPRRLQVIVVG 1---------------1111----------1111---------- >UBIQUITIN CARBOXYL-TERMIN; SWP:P45974; PDB:2G45A; VRQVSKHAFSLKQLDNPARIPPCGWKCSKCDMRENLWLNLTDGSILCGRRYFDGSGGNNH ----1111------------------1111--------------------1111------ AVEHYRETGYPLAVKLGTITPDGADVYSYDEDDMVLDPSLAEHLSHFGIDMLKMQKT --------------2222-1111-----1111----1111---3333--1111---- >INSULIN-DEGRADING ENZYME; SWP:Q5T5N2; PDB:2G47A; PAIKRIGNHITKSPEDKREYRGLELANGIKVLLISDPTTDKSSAALDVHIGSLSDPPNIA ------------1111--------1111-------1111------------11111111- GLSHFCQHMLFLGTKKYPKENEYSQFLSEHAGSSNAFTSGEHTNYYFDVSHEHLEGALDR --------1111-3333--------------------------------1111------- FAQFFLCPLFDESCKDREVNAVDSEHEKNVMNDAWRLFQLEKATGNPKHPFSKFGTGNKY -3333---------------------1111-----------111111111111----333 TLETRPNQEGIDVRQELLKFHSAYYSSNLMAVCVLGRESLDDLTNLVVKLFSEVENKNVP 3-----1111---------------3333---------------------1111------ LPEFPEHPFQEEHLKQLYKIVPIKDIRNLYVTFPIPDLQKYYKSNPGHYLGHLIGHEGPG ---------3333------------------------3333-----------1111-222 SLLSELKSKGWVNTLVGGQKEGARGFMFFIINVDLTEEGLLHVEDIILHMFQYIQKLRAE 2-----------------------------------3333-------------------- GPQEWVFQECKDLNAVAFRFKDKERPRGYTSKIAGILHYYPLEEVLTAEYLLEEFRPDLI -----------------------------------1111-3333--1111---------- EMVLDKLRPENVRVAIVSKSFEGKTDRTEEWYGTQYKQEAIPDEVIKKWQNADLNGKFKL ---11113333------3333--------------------3333---1111--1111-- PTKNEFIPTNFEILPLEKEATPYPALIKDTAMSKLWFKQDDKFFLPKACLNFEFFSPFAY ----------------1111---------3333----------------------1111- VDPLHCNMAYLYLELLKDSLNEYAYAAELAGLSYDLQNTIYGMYLSVKGYNDKQPILLKK ---------------------3333-------------1111------------------ IIEKMATFEIDEKRFEIIKEAYMRSLNNFRAEQPHQHAMYYLRLLMTEVAWTKDELKEAL ----------------------------11113333----------------------33 DDVTLPRLKAFIPQLLSRLHIEALLHGNITKQAALGIMQMVEDTLIEHAHTKPLLPSQLV 33----------------------------------------------------3333-- RYREVQLPDRGWFVYQQRNEVHNNCGIEIYYQTDMQSTSENMFLELFCQIISEPCFNTLR -------2222------------------------------------------------- TKEQLGYIVFSGPRRANGIQGLRFIIQSEKPPHYLESRVEAFLITMEKSIEDMTEEAFQK ---------------iiii----------------------------------------- HIQALAIRRLDKPKKLSAECAKYWGEIISQQYNFDRDNTEVAYLKTLTKEDIIKFYKEML --------------3333--------1111--1111-------33333333--------- AVDAPRRHKVSVHVLAREMDSCPVVGNLSQAPALPQPEVIQNMTEFKRGLPLFPLVKPHI 1111-----------1111----------------------------------------- NFMA ---- >BROMODOMAIN-CONTAINING PR; SWP:A2CFH3; PDB:2G4AA; EQLKHCNVILKELLSKKHAAYAWPFYKPVDASALGLHDYHDIIKHPMDLSTVKRKMENRD ---------------1111---1111---3333--------------------------- YRDAQEFAADVRLMFSNCYKYNPPDHDVVAMARKLQDVFEFRYAKMPD -----------------------------------------3333--- >DNA POLYMERASE GAMMA SUBU; SWP:Q9UHN1; PDB:2G4CA; SEALLEICQRRHFLSGSKQQLSRDSLLSGCHPGFGPLGVELRKNLAAEWWTSVVVFREQV ---------------------3333----------------------------------- FPVDALHHKPGPLLPGDSAFRLVSAETLREILQDKELSKEQLVTFLENVLKTSGKLRENL ---------------------------------------11113333-1111-------- LHGALEHYVNCLDLVNKRLPYGLAQIGVCFHPVKSIGEKTEASLVWFTPPRTSNQWLDFW --------1111------------------------------------3333-------- LRHRLQWWRKFAMSPSNFSSSDCQDEEGRKGNKLYYNFPWGKELIETLWNLGDHELLHMY --------1111-3333------------------------------------------- PGNVSKLHGRDGRKNVVPCVLSVNGDLDRGMLAYLYDSFQRKVLKLHPCLAPIKVALDVG --1111-----------------------------1111-------1111---------- RGPTLELRQVCQGLFNELLENGISVWPGYLETMQSSLEQLYSKYDEMSILFTVLVTETTL ---3333-3333------------------------1111---------------1111- ENGLIHLRSRDTTMKEMMHISKLKDFLIKYISSAKNV -----------------------3333---------- >MOLYBDOPTERIN BIOSYNTHESI; SWP:Q7U129; PDB:2G4RA; TRSARIVVVSSRAAAGVYTDDCGPIIAGWLEQHGFSSVQPQVVADGNPVGEALHDAVNAG ------------1111-----------------------------3333-------3333 VDVIITSGGTGISPTDTTPEHTVAVLDYVIPGLADAIRRSGLPKVPTSVLSRGVCGVAGR ------------1111-----------------------------3333----------- TLIINLPGSPGGVRDGLGVLADVLDHALEQIAG -----------------------------1111 >PYRUVATE KINASE ISOZYMES ; SWP:P11974; PDB:2G50A; IQTQQLHAAMADTFLEHKCRLDIDSAPITARNTGIICTIGPASRSVETLKEMIKSGMNVA --%%%%3333-------11111111--------------3333----------------- RMNFSHGTHEYHAETIKNVRTATESFASDPILYRPVAVALDTKGPEIRTGLIKGSGTAEV -------------------------3333------------------------------- ELKKGATLKITLDNAYMEKCDENILWLDYKNICKVVDVGSKVYVDDGLISLQVKQKGPDF --2222------3333------------1111----2222----%%%%------------ LVTEVENGGFLGSKKGVNLPGAAVDLPAVSEKDIQDLKFGVEQDVDMVFASFIRKAADVH ------------------1111--------------------------------3333-- EVRKILGEKGKNIKIISKIENHEGVRRFDEILEASDGIMVARGDLGIEIPAEKVFLAQKM -------1111--------------------------------------3333------- IIGRCNRAGKPVICATQMLESMIKKPRPTRAEGSDVANAVLDGADCIMLSGETAKGDYPL ------------------3333---------------------------3333------- EAVRMQHLIAREAEAAMFHRKLFEELARASSQSTDLMEAMAMGSVEASYKCLAAALIVLT -------------1111----------1111-----------------1111-------- ESGRSAHQVARYRPRAPIIAVTRNHQTARQAHLYRGIFPVVCKDPVQEAWAEDVDLRVNL ---------1111--------------------2222----------------------- AMNVGKARGFFKKGDVVIVLTGWRPGSGFTNTMRVVPVP -----------2222--------1111------------ >PHOSPHOLIPASE A2 VRV-PL-V; SWP:P59071; PDB:2G58A; SLLEFGKMILEETGKLAIPSYSSYGCYCGWGGKGTPKDATDRCCFVHDCCYGNLPDCNPK -------------------------------------3333----------------333 SDRYKYKRVNGAIVCEKGTSCENRICECDKAAAICFRQNLNTYSKKYMLYPDFLCKGELK 3-------------------------------------3333-3333----1111----- C - >6A7 FAB LIGHT CHAIN; SWP:NA; PDB:2G5BA; DIVMSQSPSSLAVSAGERVTMTCKSSQSLFNSKTRR -------------2222------------------- >6A7 FAB LIGHT CHAIN; SWP:NA; PDB:2G5BB; EVNLVESGGGLVQPGGSLRLSCATSGFTFIDNYMSWVRQPPGKALEWLGFIRNKVN ---------------------------3333------------------------- >PREPHENATE DEHYDROGENASE; SWP:O67636; PDB:2G5CA; QNVLIVGVGFGGSFAKSLRRSGFKGKIYGYDINPESISKAVDLGIIDEGTTSIAKVEDFS --------------------------------3333---------------3333-1111 PDFVLSSPVRTFREIAKKLSYILSEDATVTDQGSVKGKLVYDLENILGKRFVGGHPIAGT -------3333------3333--1111-------------------!!!!---------- EKSGVEYSLDNLYEGKKVILTPTKKTDKKRLKLVKRVWEDVGGVVEYSPELHDYVFGVVS ---3333---1111--------1111---------------------3333--------- HLPHAVAFALVDTLIHSTPEVDLFKYPGGGFKDFAKSDPIWRDIFLENKENVKAIEGFEK -----------------1111-----11113333--------------3333-------- SLNHLKELIVREAEEELVEYLKEVKIKREI --------1111------------------ >GNA33; SWP:Q9L6H1; PDB:2G5DA; PDRPAGIPDPAGTTVAGGGAVYTVVPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQD ---------2222---iiii-----11112222-----------------33332222-- VCAQAFQTPIHSFQAKRFFERYFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPI ---1111---3333---------------iiii-------------------3333---- YGIPDDFISVPLPLVRIRQTGKNSGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQ ---1111----------------------3333--------------------------- INGGALDGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYM ---1111---------------------------1111---------------------- ADKGYLKLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELDGPVGALGTPLMGEYAG ----------------------3333---------------------3333---2222-- AIDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAG --3333-2222------------------------1111------------------333 KQKTTGYVWQLLPNGMKPEYRP 3--------------------- >COG0147: Anthranilate/par; SWP:Q7D785; PDB:2G5FA; SSSIPMPAGVNPADLAAELAAVVTESVDEDYLLYECDGQWVLAAGVQAMVELDSDELRVI -------------------1111------------iiii--------------------- RDGVTRRQQWSGRPGAALGEAVDRLLLETDQAFGWVAFEFGVHRYGLQQRLAPHTPLARV iiii-------------------------------------3333-3333---------- FSPRTRIMVSEKEIRLFDAGIRHREAIDRLLATGVREVPQSRSVDVSDDPSGFRRRVAVA ---------3333-----------------------------------1111-------- VDEIAAGRYHKVILSRCVEVPFAIDFPLTYRLGRRHNTPVRSFLLQLGGIRALGYSPELV ---1111-------------------------1111----------iiii---------- TAVRADGVVITEPLAGTRALGRGPAIDRLARDDLESNSKEIVEHAISVRSSLEEITDIAE ---1111----------------------------------------------------2 PGSAAVIDFMTVRERGSVQHLGSTIRARLDPSSDRMAALEALFPAVTASGIPKAAGVEAI 222-----------!!!!-----------1111------1111-1111----------33 FRLDECPRGLYSGAVVMLSADGGLDAALTLRAAYQVGGRTWLRAGAGIIEESEPEREFEE 33-------2222-----1111-------------iiii---------1111-------- TCEKLSTLTPYLVAR -------1111---- >Putative iron transport p; SWP:Q0PBW2; PDB:2G5GX; PLQENKDFYILDTHTQKKISFEDILELLKADVILLGEKHDEVKHKISQVIFNALEGNLSS --2222-------------3333---3333-------1111-----------------11 QNINFDVALELASTEQNHLDKAFKNKKTIKANELTNALNWDKVWKWKDYEQFVNVVFYSK 11---------3333---------1111-1111--1111-33333333-------1111- SKILGANLSRSEITSIYNGAQPLKGYVSTTNEVKKQLFDIISLSHKLNPEENKELLDKLV ------------------------------------------------------------ EIQQFKDRRADVLVHHVNKVLLLAGSYHTSKKIGIPLHIQDFKSSKKIVVVNLSYGEIDL ------------------------3333------------------------------33 KDSDYVLIYKG 33--------- >Neurabin-2; SWP:O35274; PDB:2G5MB; GHMELFPVELEKDSEGLGISIIGMGAGADMGLEKLGIFVKTVTEGGAAHRDGRIQVNDLL ------------3333--------------------------33333333----1111-- VEVDGTSLVGVTQSFAASVLRNTKGRVRFMIGRERPGEQSEVAQLIQQTLEQE -----------------3333-------------------------------- >RIBOSOME-INACTIVATING PRO; SWP:P85101; PDB:2G5XA; RPSWTVDSDSAKYSSFLDSLREEFGRGTPKVCNIPVTKKANNDKFVLVNLVLPFNRNTIT ------------------------------%%%%---1111------------------- LAFRASDAYLVGFQDRDSKTNKLRANFFSDEYRALSGKYKSIFTDAEVLAPALPCASTYT ---------------------------11113333--3333-1111-------------- DLQNKAGVSREKLSLGVSSLQTAFTAVYGKVFTGKNVAKFALISIQMVAEAARFKYIEDQ --------3333--------------2222------------------------------ VINRGMYSSFEAGARITLLENNWSKISEQYHKSCKLGGGQFTEEEMKLGLLLYN -----------------------------------------3333--------- >ANTI-FLAG M2 FAB LIGHT CH; SWP:NA; PDB:2G60H; EVQLQQSGGELAKPGASVKMSCKSSGYTFTAYAIHWAKQAAGAGLEWIGYIAPAAGAAAY ------------2222-----------3333----------------------------- NAAFKGKATLAADKSSSTAYMAAAALTSEDSAVYYCARAAAAGADYWGQGTTLTVSSAKT 3333--------3333----------3333------------------------------ TPPSVYPLAPSMVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLYTLSSSVTV ------------------------------------------------------------ TSSPRPSETVTCNVAHPASSTKVDKKIV -3333----------3333--------- >If kappa light chain [Fra; SWP:A2NHM3; PDB:2G60L; DVLMTQAPLTLPVSLGDQASISCRSSQAIVHANGNTYLEWYLQKPGQSPALLIYKVANRF -------------2222-------------1111---------2222------------2 SGVPDRFSGSGSGTDFTLKISRVEAEDLGVYYCFQGAHAPYTFGGGTKLEIKRADAAPTV 2221111----------------1111--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM ------3333------------------------iiii---------------------- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKS ------3333----------------------- >protein phosphatase 2A, r; SWP:Q15257; PDB:2G62A; NLYFQSRNFIIPKKEIHTVPDMGKWKRSQAYADYIGFILTLNEGVKGKKLTFEYRVSEAI -----------------3333---------------------1111--1111-------- EKLVALLNTLDRWIDETPPVDQPSRFGNKAYRTWYAKLDEEAENLVATVVPTHLAAAVPE -------------1111----------3333--------------3333-33333333-- VAVYLKESVGNSTRIDYGTGHEAAFAAFLCCLCKIGVLRVDDQIAIVFKVFNRYLEVMRK ----1111---1111-----------------------3333------------------ LQKTYRMEPAGLDDFQFLPFIWGSSQLIDHPYLEPRHFVDEKAVNENHKDYMFLECILFI ----------------3333------2222---3333---------3333---------- TEMKTGPFAEHSNQLWNISAVPSWSKVNQGLIRMYKAECLEKFPVIQHFKFGSLLPIHPV ------3333----------------------------11113333-------------- TS -- >PUTATIVE 6-PYRUVOYL TETRA; SWP:O02058; PDB:2G64A; MFRMPIVTMERVDSFSAAHRLHSEKLSDAENKETFGKCNNSNGHGHNYVWKVKLRGEVDP ----------------------1111-------------1111----------------- TSGMVYDLAKLKKEMSLVLDTVDHRNLDKDVEFFKTTVSTSENVAIYMFEKLKSVMSNPS -----------------3333----3333-3333-----------------3333--333 VLYKVTIEETPKNIFTYKGS 3--------1111------- >RAS-RELATED PROTEIN RAB-2; SWP:Q9ULW5; PDB:2G6BA; DFYDVAFKVMLVGDSGVGKTCLLVRFKDGAFLAGTFISTVGIDFRNKVLDVDGVKVKLQM -------------2222---------------------------------iiii------ WDTAGQAYYRDAHALLLLYDVTNKASFDNIQAWLTEIHEYAQHDVALMLLGNKVDSAHER -------3333--------1111------------------1111--------------- VVKREDGEKLAKEYGLPFMETSAKTGLNVDLAFTAIAKELKR --3333-----1111-------1111---------------- >Rho guanine nucleotide ex; SWP:O55043; PDB:2G6FX; GPLGSVVRAKFNFQQTNEDELSFSKGDVIHVTRVEEGGWWEGTHNGRTGWFPSNYVREI ----------------1111---2222----------------iiii----1111---- >INHIBITOR OF GROWTH PROTE; SWP:Q9ESK4; PDB:2G6QA; EPTYCLCNQVSYGEMIGCDNEQCPIEWFHFSCVSLTYKPKGKWYCPKCRGDN ----1111-----------1111-----3333------------3333---- >Uncharacterized protein, ; SWP:Q97H28; PDB:2G6TA; YKCLIWGVNDEYTLAYDKLLFEISKGNLSIEALISKDKYAKYIDGKEVIDKTEISNYEFD ----------------------1111----------------%%%%---33331111--- YIIIFNKERYSDIKNEALELGIPERKILNGKFFFISNFDFKRYCKLIENPITIISDDCWG -----------------1111-3333---1111--------------------------- GLVSSYLGFKFNSPFINFYIHNDDYIKFLENDYYLEQELKVEQEGNVYSCTPKGSLGTGD --------------------3333--------3333------------------------ NKIILNFNHQASFAEAKNDWDERKTRINKKNLFVKLIKDDNEKLVKRFDNLPYKNKVCFH ---------------------3333--1111----------------1111--------- PKPKYKSVAFFPRYIWRCINYAARTSNSNLEQYTDSWLEKSCDILKLCGEEDFIREK ----1111--3333--------------3333-----3333-3333----------- >RIBOFLAVIN BIOSYNTHESIS P; SWP:RIBD_ECOLI; PDB:2G6VA; EYYARALKLAQRGRFTTHPNPNVGCVIVKDGEIVGEGYHQRAGEPHAEVHALRAGEKAKG ----------1111--------------------------3333-----------3333- ATAYVTLEPCPCCDALIAAGVARVVASQDPNPQVAGRGLYRLQQAGIDVSHGLSEAEQLN ------------3333-------------------------------------3333--- KGFLKRRTGFPYIQLKLGASLDGRTAESQWITSPQARRDVQLLRAQSHAILTSSATVLAD -------------------------------------------1111------------- DPALTVRWSELDEQTQALYPQQNLRQPIRIVIDSQNRVTPVHRIVQQPGETWFARTQEDS ------3333--3333---1111---------1111--11111111-------------- REWPETVRTLLIPEHKGHLDLVVLQLGKQQINSIWVEAGPTLAGALLQAGLVDELIVYIA ---1111-------------3333--1111----------------1111---------- PKLLGSDARGLCTLPGPQFKFKEIRHVGPDVCLHLVGA --------------------------!!!!-------- >GREEN FLUORESCENT PROTEIN; SWP:Q6WV12; PDB:2G6YA; AMEIECRITGTLNGVEFELVGGGEGTPEQGRMTNKMKSTKGALTFSPYLLSHVMFYHFGT -----------iiii----------3333--------------------1111-3333-- YPSGYENPFLHAINNGGYTNTRIEKYEDGGVLHVSFSYRYEAGRVIGDFKVMGTGFPEDS -2222-33333333-----------1111-----------2222------------1111 VIFTDKIIRSNATVEHLHPMGDNDLDGSFTRTFSLRDGGYYSSVVDSHMHFKSAIHPSIL 1111----------------1111----------1111-----------------3333- QNGGPMFAFRRVEEDHSNTELGIVEYQHAFKTP --------------------------------- >DUAL SPECIFICITY PROTEIN ; SWP:Q16690; PDB:2G6ZA; GSHMGPVEILPFLYLGSAYHASKCEFLANLHITALLNVSRRTSEACMTHLHYKWIPVEDS ---------1111---3333---------------------------------------1 HTADISSHFQEAIDFIDCVREKGGKVLVHSEAGISRSPTICMAYLMKTKQFRLKEAFDYI 111-1111---------------------------------------------------- KQRRSMVSPNFGFMGQLLQYESEILPS ---3333-------------------- >PHENYLETHANOLAMINE N-METH; SWP:P11086; PDB:2G72A; APGQAAVASAYQRFEPRAYLRNNYAPPRGDLCNPNGVGPWKLRCLAQTFATGEVSGRTLI ----------1111------------1111--1111------------3333-------- DIGSGPTVYQLLSACSHFEDITMTDFLEVNRQELGRWLQEEPGAFNWSMYSQHACLIEGK ---!!!!3333--1111-------------------11112222---------------- GECWQDKERQLRARVKRVLPIDVHQPQPLGAGSPAPLPADALVSAFCLEAVSPDLASFQR ---------------------1111-1111----------------3333---------- ALDHITTLLRPGGHLLLIGALEESWYLAGEARLTVVPVSEEEVREALVRSGYKVRDLRTY ---3333--2222--------------!!!!----------------------------- IMPAHLQTGVDDVKGVFFAWAQKV --3333------------------ >IGG HEAVY CHAIN; SWP:Q6PJF1; PDB:2G75A; VQLQQSGAEVKKPGSSVKVSCKASGGTFSSYTISWVRQAPGQGLEWMGGITPILGIANYA -----------2222-------3333-!!!!-------2222--------1111-----3 QKFQGRVTITTDESTSTAYMELSSLRSEDTAVYYCARDTVMGGMDVWGQGTTVTVSSAST 333--------1111----------3333---------1111------------------ KGPSVFPLAPSSKSTSGGTSALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLY -------------------------------------%%%%--2222-------3333-- SLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPK -----------------------3333----------- >IGL@ protein; SWP:Q8N355; PDB:2G75B; SYELTQPPSVSVAPGKTARITCGGNNIGSKSVHWYQQKPGQAPVLVVYDDSDRPSGIPER --------------------------3333-------2222----------------333 FSGSNSGNTATLTISRVEAGDEADYYCQVWDSSSDYVFGTGTKVTVLGQPKANPTVTLFP 3----!!!!--------1111--------------------------------------- PSSEEFQANKATLVCLISDFYPGAVTVAWKADGSPVKAGVETTKPSKQSNNKYAASSYLS -33331111---------------------iiii-------------1111--------- LTPEQWKSHRSYSCQVTHEGSTVEKTVAPTE -33331111--------iiii---------- >D-3-PHOSPHOGLYCERATE DEHY; SWP:O43175; PDB:2G76A; LRKVLISDSLDPCCRKILQDGGLQVVEKQNLSKEELIAELQDCEGLIVRSATKVTADVIN ---------------------------------------1111----------------- AAEKLQVVGRAGTGVDNVDLEAATRKGILVMNTPNGNSLSAAELTCGMIMCLARQIPQAT --------------1111----------------1111---------------------- ASMKDGKWERKKFMGTELNGKTLGILGLGRIGREVATRMQSFGMKTIGYDPIISPEVSAS --1111--3333-----2222-------3333-----------------1111-----11 FGVQQLPLEEIWPLCDFITVHTPLLPSTTGLLNDNTFAQCKKGVRVVNCARGGIVDEGAL 11----33333333----------3333----333311112222------2222------ LRALQSGQCAGAALDVFTEEPPRDRALVDHENVISCPHLGASTKEAQSRCGEEIAVQFVD ------------------------3333-1111-----1111------------------ MV -- >ENDONUCLEASE I; SWP:Q2XSK9; PDB:2G7EA; FSHAKNEAVKIYRDHPVSFYCGCEIRWQGKKGIPDLESCGYQVRKNENRASRIEWEHVVP ---------1111--------------!!!!---3333-------3333----------3 AWQFGHQLQCWQQGGRKNCTRTSPEFNQMEADLHNLTPAIGEVNGNRSNFSFSQWNGIDG 333-1111-----------------------3333------------------------- VTYGQCEMQVNFKERTAMPPERARGAIARTYLYMSEQYGLRLSKAQNQLMQAWNNQYPVS --!!!!-----1111----3333------------------------------------- EWECVRDQKIEKVQGNSNRFVREQCPN -----------------33331111-- >RHA04620, PUTATIVE TRANSC; SWP:Q0S9X7; PDB:2G7GA; LDRERIAEAALELVDRDGDFRPDLARHLNVQVSSIYHHAKGRAAVVELVRHRVVREIDGS -------------------------1111-33333333---------------3333-33 AFERLPWDEAFSEWARSYRAAFSRHPTAIRLLATETVRDPGSLSVYHSAAAGLRGAGFPD 33----------------------1111-------------------------3333-11 DHIAVITAAENFLLGAALDAAAPEVIEADSTTTDDALTRALAAAPRGPERAEQAFELGLA 11----------------1111------------------------3333---------- ALLAGFHHLLQECG ----------1111 >Methylated-DNA--protein-c; SWP:Q58924; PDB:2G7HA; MIIQIEEYFIGMIFKGNQLVRNTIPLRREEIFNFMDGEVVSNPEDEHLKVAEIILKLYFA ---------------------------11111111------------------------- EIDDKKVRELISYKLEVPEFTKKVLDIVKDIEFGKTLTYGDIAKKLNTSPRAVGMALKRN -------1111---------3333-----------------3333--------------- PLPLIIPCHRVVAKNSLGGYSYGLDKKKFILERERLNMVSFKFNKVY ------3333--------------3333----3333----------- >COMPLEMENT FACTOR H; SWP:P08603; PDB:2G7IA; GKCGPPPPIDNGDITSFPLSVYAPASSVEYQCQNLYQLEGNKRITCRNGQWSEPPKCLHP --------2222----------2222------1111----------iiii---------- CVISREIMENYNIALRWTAKQKLYSRTGESVEFVCKRGYRLSSRSHTLRTTCWDGKLEYP ----------------1111-----2222------2222--2222-------iiii---- TCAK ---- >PUTATIVE CYTOPLASMIC PROT; SWP:Q57KX2; PDB:2G7JA; MYLRPDEVARVLEKAGFTVDVVTNKTYGYRRGENYVYVNREARMGRTALIIHPRLKDRSS ---3333---------------1111----!!!!----1111------------3333-- SLADPASDIKTCDHYQNFPLYLGGETHEHYGIPHGFSSRIALERYLNGLFGD -------------------------------------3333------1111- >TETR-FAMILY TRANSCRIPTION; SWP:Q93JG8; PDB:2G7LA; KPALSRRWIVDTAVALRAEGLEKVTRRLAQELDTGPASLYVYVANTAELHAAVLDALLGE --------------------3333-----1111-33333333-------------1111- VDLTGAEDWREQLRAVLTSYTLVLFAHPQLARSALVARPSGENYLRLVERVLELLARSGA ------------------------------------------------------------ PGAQVAWGVDKLLQDATATAAEQATSATVRALRDADEATHPAIASHPLLVAGSAHDRLRW -------------------1111------------3333-------3333---------- SFDVLVNGITRTPVPGPA ------------------ >PROTEIN TRAM; SWP:P10026; PDB:2G7OA; AFNQTEFNKLLLECVVKTQSSVAKILGIESLSPHVSGNSKFEYANMVEDIREKVSSEMER -------------------------------1111--3333------------------- FFPKNDDE -------- >Mucosa-associated lymphoi; SWP:Q9UDY8; PDB:2G7RA; TLNRLREPLLRRLSELLDQAGWRRLAELAGLSCLDLEQCSLKVLEPEGSPSLCLLKLMGE 1111-3333-----------3333-------3333---------1111------------ KGCTVTELSDFLQAMEHTEVLQLLSP ---3333--------------1111- >TRANSCRIPTIONAL REGULATOR; SWP:Q8UIL4; PDB:2G7SA; NPQSKADDILQCARTLIIRGGYNSFSYADISQVVGIRNASIHHHFPSKSDLVCKLVSQYR ---------------3333-1111------------3333-3333--------------- QEAEAGIAELEKNISDPLEQLRAYIGYWEGCIADATHPFCVCALLASEIPVLPETVVLEV -------------------------------------------33333333--------- RAHFRSLSDWLTAVLERGIAQGRLVLTGTARANAEIFATVHGALSARAHGDAATFGAITR --------------------------------------------3333--1111------ PLERITA ------- >TRANSCRIPTIONAL REGULATOR; SWP:O33539; PDB:2G7UA; DRDYIQSIERGFAVLLAFDAQRPNPTLAELATEAGLSRPAVRRILLTLQKLGYVAGSGGR ----3333------33333333----------------------------------iiii WSLTPRVLSIGQHYSESHALIEAAPRLLEVAEKTQESASLGVLDGADVVYAARVPVRRIS ----3333--1111-----------------------------!!!!------------- INVSVGTRVPAYATSGRALLAWAPADVVERVVAESTFQKLGPETIGTAAELERELAKVRE ---2222--3333--3333----3333----1111------------------------- QGFALTSEELEKGLISLAAPVHDAGGTVVGVVACSTSSARNTPAQFREQAVPCVLAAAAA ----------2222--------1111----------3333-------------------- LSADGFA ------- >CONSERVED HYPOTHETICAL PR; SWP:P67372; PDB:2G7ZA; AGTIKIVTDSSITIEPELIKALDITVVPLSVIDSKLYSDNDLKEEGHFLSLKASKSLPKT -------------------------------------3333--22223333--------- SQPPVGLFAETYENLVKKGVTDIVAIHLSPALSGTIEASRQGAEIAEAPVTVLDSGFTDQ ---3333--------1111---------3333-------------------------!!! AKFQVVEAAKAKAGASLNEILAAVQAIKSKTELYIGVSTLENLVKGGRIGRVTGLNVKVV !---------3333-------------1111------------1111-1111-------- ALKNDELKTLVKGRGNKTFTKWLDSYLAKNSHRPIAEIAISYAGEASLALTLKERIAAYY --!!!!-----------------------3333----------------------1111- NHSISVLETGSIIQTHTGEGAFAVVRYE ---------------------------- >PROTEIN UTR4; SWP:P32626; PDB:2G80A; DNYSTYLLDIEGTVCPISFVKETLFPYFTNKVPQLVQQDTRDSPVSNILSQFHIDNKEQL ---------2222--3333--------------------1111------3333------- QAHILELVAKDVKDPILKQLQGYVWAHGYESGQIKAPVYADAIDFIKRKKRVFIYSSGSV ------------------------------------------------------------ KAQKLLFGYVQDPNAPAHDSLDLNSYIDGYFDINTSGKKTETQSYANILRDIGAKASEVL -----------3333-------3333-----3333--11113333---------3333-- FLSDNPLELDAAAGVGIATGLASRPGNAPVPDGQKYQVYKNFETL -----------3333--------2222--------------1111 >CATIONIC TRYPSIN; SWP:NA; PDB:2G81I; PCCDRCECTKSIPPQCRCSDVRLNSCHSACKSCACTFSIPAQCFCGDINDFCYKPC --------------------------1111-------------------------- >GLYCERALDEHYDE-3-PHOSPHAT; SWP:NA; PDB:2G82A; MKVGINGFGRIGRQVFRILHSRGVEVALINDLTDNKTLAHLLKYDSIYHRFPGEVAYDDQ --------3333-------1111----------------------------------111 YLYVDGKAIRATAVKDPKEIPWAEAGVGVVIESTGVFTDADKAKAHLEGGAKKVIITAPA 1--iiii--------3333-3333--------------3333--3333------------ KGEDITIVMGVNHEAYDPSRHHIISNASTTNSLAPVMKVLEEAFGVEKALMTTVHSYTNQ -------22223333-3333--------3333---------------------------- RLLDLPHKDLRARAAAINIIPTTGAAKATALVLPSLKGRFDGMALVPTATGSISDITALL -------------1111------3333--11111111----------------------- KREVTAEEVNAALKAAAEGPLKGILAYTEDEIVLQDIVMDPHSSIVDAKLTKALGNMVKV ------------------1111----------33332222------3333---!!!!--- FAWYDNEWGYANRVADLVELVLRKG ------------------------- >Cytidine and deoxycytidyl; SWP:Q82Y41; PDB:2G84A; MNDALHIGLPPFLVQANNEPRVLAAPEARMGYVLELVRANIAADGGPFAAAVFERDSGLL ------------------------------------------------------------ IAAGTNRVVPGRCSAAHAEILALSLAQAKLDTHDLSADGLPACELVTSAEPCVMCFGAVI --------11111111-----------------1111----------------------- WSGVRSLVCAARSDDVEAIGFDEGPRPENWMGGLEARGITVTTGLLRDAACALLREYNAC -----------33333333---------------1111---------------------- NGVIYNARC -----3333 >CHORISMATE SYNTHASE; SWP:P63611; PDB:2G85A; MLRWITAGESHGRALVAVVEGMVAGVHVTSADIADQLARRRLGYGFERDAVTVLSGIRHG --------1111--------------------------1111---------------%%% STLGGPIAIEIGNTEWPKWETVMAADPVDPAELADVARNAPLTRPRPGHADYAGMLKYGF %-------------3333----------333311111111-----2222----------- DDARPVLERASARETAARVAAGTVARAFLRQALGVEVLSHVISIGASAPYEGPPPRAEDL ----------3333-----------------------------!!!!-------3333-3 PAIDASPVRAYDKAAEADMIAQIEAAKKDGDTLGGVVEAVALGLPVGLGSFTSGDHRLDS 333--1111-------------------------------------------3333---- QLAAAVMGIQAIKGVEIGDGFQTARRRGSRAHDEMYPGPDGVVRSTNRAGGLEGGMTNGQ --------2222----!!!!3333--3333----------------3333--iiii---- PLRVRAAMKPISTVPRALATVDLATGDEAVAIHQRSDVCAVPAAGVVVETMVALVLARAA ---------------------------------------3333----------------- LEKFGGDSLAETQRNIAAYQRSVADR -------------------------- >Outer surface protein A; SWP:Q45040; PDB:2G8CO; NSVSVDLPGSMKVLVSKSSNADGKYDLIATVDALELSGTSDKNNGSGVLEGVKADASKVK -------------------1111-----------------------------1111---- LTISDDLGQTTLEVFKSDGSTLVSKKVTSKDKSSTEEKFNEKGEVSEKIITRADGTRLEY ---1111--------1111---------1111-------1111--------1111----- TGIKSDGSGKAKEVLKGYVLEGTLTAEKTTLVVKEGTVTLSKNISKSGAVSVELNDTDSS ---1111-------2222------3333------!!!!------1111-----------3 AATKKTAAWNSGTSTLTITVNSKKTKDLVFTSSNTITVQQYDSNGTSLEGSAVEITKLDE 333----------------%%%%-------1111-------3333-----------3333 IKNALK --1111 >CALPAIN-1 CATALYTIC SUBUN; SWP:P97571; PDB:2G8JA; NAIKYLGQDYENLRARCLQNGVLFQDDAFPPVSHSLGFKELGPNSSKTYGIKWKRPTELL ---2222------------------1111--3333------11113333-----1111-- SNPQFIVDGATRTDICQGALGDCWLLAAIASLTLNETILHRVVPYGQSFQEGYAGIFHFQ -----------1111----------------------------------2222------- LWQFGEWVDVVVDDLLPTKDGKLVFVHSAQGNEFWSALLEKAYAKVNGSYEALSGGCTSE --%%%%------------iiii-------1111-----------11113333----3333 AFEDFTGGVTEWYDLQKAPSDLYQIILKALERGSLLGCSINISDIRDLEAITFKNLVRGH -------------1111-1111-------1111------------------1111----- AYSVTDAKQVTYQGQRVNLIRMRNPWGEVEWKGPWSDNSYEWNKVDPYEREQLRVKMEDG -----------iiii--------3333-----2222--3333------------------ EFWMSFRDFIREFTKLEICNLT ---------------------- >287AA LONG HYPOTHETICAL P; SWP:O59272; PDB:2G8LA; HKVQYECLTCANQCQRIVEATQDDIRRRAILAAKLLAKEYNENAIPAIAGSLIFLELYKF ---1111---------------------------------11113333------------ LGNDDPFIEYKLKSEEARKVADIIKRKLKLDFELAVKLAIIGNVIDFSVGFSPEDLEEEV ----1111--------------------------------3333-1111----------- EKLKDKLYIDDSKELFEEVKRAENILYITDNVGEHYFDAILIEKIREISNAEVYIAGKEG --------------------------------3333--------3333------------ PIINDATVEDLKRAGLEKLGKVISTGTRIVGVPLKLVSREFEAFNKADVIIAKGQGNFET -!!!!---------3333--------------1111-3333--3333------------- LSEINDSRIFFLLKAKCPAVARELKVPKGALVCRNK 3333----------------------2222------ >THYMIDYLATE SYNTHASE; SWP:P0A884; PDB:2G8MA; KQYLELMQKVLDEGTQKNDRTGTGTLSIFGHQMRFNLQDGFPLVTTKRCHLRSIIHELLW ------------------1111-------------1111----------3333------- FLQGDTNIAYLHENNVTIWDEWADENGDLGPVYGKQWRAWPTPDGRHIDQITTVLNQLKN 1111-------1111-1111---1111--------------1111--------------- DPDSRRIIVSAWNVGELDKMALAPCHAFFQFYVADGKLSCQLYQRSCDVFLGLPFNIASY 1111--------11111111-------------iiii----------------------- ALLVHMMAQQCDLEVGDFVWTGGDTHLWSNHMDQTHLQLSREPRPLPKLIIKRKPESIFD --------1111---------------1111------1111---------------1111 YRFEDFEIEGYDPHPGIKAPVAI -1111------------------ >GLUCOSE/SORBOSONE DEHYDRO; SWP:P75804; PDB:2G8SA; ATVNVEVLQDKLDHPWALAFLPDNHGLITLRGGELRHWQAGKGLSAPLSGVPDVWAHGQG ---------------------%%%%----3333-----2222--------------!!!! GLLDVVLAPDFAQSRRIWLSYSEVGDDGKAGTAVGYGRLSDDLSKVTDFRTVFRQPKLST -------1111----------------------------1111----------------- GNHFGGRLVFDGKGYLFIALGENNQRPTAQDLDKLQGKLVRLTDQGEIPDDNPFIKESGV ---------------------%%%%-----1111--------------1111-1111--- RAEIWSYGIRNPQGANPWSNALWLNEHGPRGGDEINIPQKGKNYGWPLATWGINYSGFKI 3333-----------1111-------------------2222-----------3333--1 PEAKGEIVAGTEQPVFYWKDSPAVSGAFYNSDKFPQWQQKLFIGALKDKDVIVSVNGDKV 111----2222----------------------3333--------1111------!!!!- TEDGRILTDRGQRIRDVRTGPDGYLYVLTDESSGELLKVSPR ------3333---------1111------------------- >MALATE/L-LACTATE DEHYDROG; SWP:P30178; PDB:2G8YA; SGHRFDAQTLHSFIQAVFRQGSEEQEAKLVADHLIAANLAGHDSHGIGFPSYVRSWSQGH -----------------------------------------3333----------1111- LQINHHAKTVKEAGAAVTLDGDRAFGQVAAHEAALGIEKAHQHGIAAVALHNSHHIGRIG ------------!!!!----%%%%3333-----------------------------333 YWAEQCAAAGFVSIHFVSVVGIPVAPFHGRDSRFGTNPFCVVFPRKDNFPLLLDYATSAI 3-----1111---------------2222------------------------------- AFGKTRVAWHKGVPVPPGCLIDVNGVPTTNPAVQESPLGSLLTFAEHKGYALAACEILGG 3333-----------------1111----3333-----------!!!!------------ ALSGGKTTHQETLQTSPDAILNCTTIIINPELFGAPDCNAQTEAFAEWVKASPHDDDKPI --------3333---1111---------3333--1111----------1111--1111-- LLPGEWEVNTRRERQKQGIPLDAGSWQAICDAARQIGPEETLQAFCQQLAS -2222----------------------------1111-------------- >NICOTINAMIDE PHOSPHORIBOS; SWP:Q80Z29; PDB:2G95A; EFNILLATDSYKVTHYKQYPPNTSKVYSYFECREKVKYEETVFYGLQYILNKYLKGKVVT --1111--3333-1111------------------------------------------- KEKIQEAKEVYREHFQDDVFNERGWNYILEKYDGHLPIEVKAVPEGSVIPRGNVLFTVEN ------------1111---------------------------2222------------- TDPECYWLTNWIETILVQSWYPITVATNSREQKKILAKYLLETSGNLDGLEYKLHDFGYR -33333333-------------------------------------2222-------333 GVSSQETAGIGASAHLVNFKGTDTVAGIALIKKYYGTKDPVPGYSVPAAEHSTITAWGKD 3-------------3333-------------------------------3333----111 HEKDAFEHIVTQFSSVPVSVVSDSYDIYNACEKIWGEDLRHLIVSRSTEAPLIIRPDSGN 1-------------------------------------333311111111---------3 PLDTVLKVLDILGKKFPVSENSKGYKLLPPYLRVIQGDGVDINTLQEIVEGMKQKKWSIE 333-----------------1111----1111--------3333-------------333 NVSFGSGGALLQKLTRDLLNCSFKCSYVVTNGLGVNVSKKGRLSLHRTPAGTFVTLEEGK 3-----3333----3333-----------iiii--------------1111-----iiii GDLEEYGHDLLHTVFKNGKVTKSYSFDEVRKNAQLN ---------------iiii-----------1111-- >SUCCINYLGLUTAMATE DESUCCI; SWP:Q9KSL4; PDB:2G9DA; KSLFRQSFLFDSLDLDHPVAQTVRTEQGVTLKLHQRGVLEVIPAQTDAATKNVISCGIHG ------3333--------------1111------2222---------------------- DETAPELLDKWIDDIVSGFQPVAERCLFIAHPQATVRHVRFIEQNLNRLFDDKPHTPSTE ------------------------------3333-----------------------333 LAIADNLKVLLRQFFANTDEHSRWHLDLHCAIRGSKHYSFAVSPKARHPVRSRSLQFIEQ 3-----------------3333-----------------------------3333----- AHIEAVLSNAPSSTFSWYSAEHYAAQALTLELGQVARLGENLLDRLLAFDLARDLISRHK ---------------3333----------------------3333--------------- PEHLPRKSVYRVSRTIVRLHDDFDFRFSDDVENFTAFHGEVFGHDGDKPLAKNEGEAIVF ------------------------------------------------------------ PNRKVAIGQRAALVCKVNTRYEDDQLVYD ---------------------iiii---- >Enterotoxin type I [Fragm; SWP:Q52T95; PDB:2G9HD; IGVGNLRNFYTKHDYIDLKGLIDKNLPSANQLEFSTGINDLISESNNWDEISKFKGKKLD ----------------------------------------------3333-1111----- IFGIDYNGPCKSKYMYGGATLSGQYLNSARKIPINLWVNGKHKTISTDKISTNKKLVTAQ -------2222--------------------------iiii-----3333---------- EIDVKLRRYLQEEYNIYGHNSTGKGKEYGYKSKFYSGFNKGKVLFHLNDEKSFSYDLFYT --------------1111------1111---1111------------------------! GDGVPVSFLKIYEDNKIIESEKFHLDVEISYVD !!!3333-3333------1111----------- >F420-0:GAMMA-GLUTAMYL LIG; SWP:O28028; PDB:2G9IA; RVEVFPVEGLPLIKEGDDLAELISSRVRFEDGDVLVVCSTVISKAEGRIRRLEEFNPSER -------------2222----1111----2222-----------------3333------ AKEIAARIGKPAEFVQAVLEESEEVLLDFPFLLVKAKFGNVCVNAGIDASNVEEGSLLLP ----1111----------1111-------------3333---%%%%--11111111---- PLDPDGSAEKLRRRILELTGKRVGVIITDTNGRCFRRGVVGFAIGISGVKAKDWVTVECV --------------------------------2222----------------------33 ADEIAAFANLLGGIPAVVVRGLNVAGEGSEEIYRSEEEDVIRRCLKRCL 33-----3333-----------------------3333-------1111 >PHYCOERYTHRIN; SWP:P84861; PDB:2G9MA; QRAAARLEAAEKLGSNHEAVVKEAGDACFSKYGYNKNPGEAGENQEKINKCYRDIDHYMR -3333-----------1111----3333-----3333-----------3333-------- LINYTLVVGGTGPLDEWGIAGAREVYRTLNLPSAAYIAAFVFTRDRLCAPRDMSAQAGVE ---1111----3333------1111-1111-3333-------------------3333-- FCTALDYLINSLS ------------- >EUKARYOTIC INITIATION FAC; SWP:P60842; PDB:2G9NA; GVIESNWNEIVDSFDDMNLSESLLRGIYAYGFEPSAIQQRAILPCIGYDVIAQAQSGTGT ------------3333--------------------------3333-------------- ATFAISILQQIELDLATQALVLAPTRELAQQIQVVMALGDYMGASCHACIGNVRAEVQLQ -------11111111-----------------------1111-----------3333--- MEAPHIIVGTPGRVFDMLNRRYLSPYIMFVLDEADEMLSRGFDQIYDIFQLNSNTQVVLL -----------------1111----------------1111----------1111----- SATMPSDVLEVTFMRDPIRILV ---------------------- >COPPER-TRANSPORTING ATPAS; SWP:ATP7A_HUMAN; PDB:2G9OA; NDSTATFIIDGMHCKSCVSNIESTLSALQYVSSIVVSLENRSAIVVYNASSVTPESLRKA ------------!!!!-----------1111-----3333-------------------- IEAVSPGLYRVSITSEV ----2222--------- >CONSERVED HYPOTHETICAL PR; SWP:A1QSJ9; PDB:2G9WA; KLTRLGDLERAVDHLWSRTEPQTVRQVHEALSARRDLAYTTVAVLQRLAKKNLVLQIRAH 3333---------3333--------------------3333-------1111-------- RYAPVHGRDELVAGLVDALAQAEDSGSRQAALVHFVERVGADEADALRRALAELEA -----------------3333----------------------------------- >THIAMINE PYROPHOSPHOKINAS; SWP:Q59N99; PDB:2G9ZA; SELIEQVIEQPDSLIISPPSYNHIQPFVYLHNVLLILNQKITIDLISLWKKCEIIVCADG ------------------------1111--------------------1111-------- GANSLYEYFNLQRSDYIPDYIVGDFDSISPDVKTYYESHGSKIIRQSSQYYNDFTKSIHC -----3333--3333---------------------1111-------------------- IQLHYQLNHTKENWFESIDEVDGLAKLWNGLNNSSDVVVDIDITIYVLNAIGGRFDQTVQ ------3333--3333---------------1111------------------3333--- SINQLYIMNEDYPKVTVFFITTNDIIFLLKKGVNYISYKNRLMFHKDNGSSPTPTCGLLP -----------1111-----1111---------------3333----------------- LSNKTPIILNSYGLKYDMRNWKTEMLGQVSSSNRISGETGFIVECSDDIVMNIEIDV -----------------------2222------------------------------ >PROTEIN OF UNKNOWN FUNCTI; SWP:Q3MFD8; PDB:2GA1A; GNKKTQLLEVIAALPEELVDQALNYVQLQNPIQITPGVCGGQARIRNTRIPVWTLVAYRQ ----------11113333----------------1111iiii--2222--3333----11 QGAPDKELLANYPGLTAEDLSAAWHYYEQNPEQIDREIAQD 11-3333----3333-------------------------- >FRATAXIN HOMOLOG, MITOCHO; SWP:Q07540; PDB:2GA5A; MESSTDGQVVPQEVLNLPLEKYHEEADDYLDHLLDSLEELSEAHPDCIPDVELSHGVMTL ------------1111-------------------------------------2222--- EIPAFGTYVINKQPPNKQIWLASPLSGPNRFDLLNGEWVSLRNGTKLTDILTEEVEKAIS ---------------------------------iiii---------3333---------- KSQ --- >HYPOTHETICAL 39.9 KDA PRO; SWP:P43591; PDB:2GA8A; VDTHKLADDVLQLLDNRIEDNYRVCVILVGSPGSGKSTIAEELQIINEKYHTFLSEHPNV ----------------1111----------2222------------------3333---- IEVNDRLKPMVNLVDSLKTLQPNKVAEMIENQGLFKDHVEDVNFQPVKYSAEETAVVARG ----1111-----1111-----------------1111--1111-------------111 GTANAIRIAADSINIAQIVPMDGFHLSRRCLDLFKDPQTAHKRRGSPSTFDSNNFLQLCK 11111--------------3333---33331111-33333333--1111----------- ILAKTSLKVSTSSVFEKLSKTFSQTIPDIFVPGFNHALKDPTPDQYCISKFTRIVILEGL -----------------1111-1111----------------------1111-------- YLLYDQENWKKIYKTLADTGALLVYKIDIDYEATEERVAKRHLQSGLVTTIAEGREKFRS ------3333-----3333----------------------------------------- NDLLNGRDIDNHLIKVDNIVHIRNDH 3333-----1111------------- >Poly(A) polymerase cataly; SWP:P23371; PDB:2GA9D; NITLKIIETYLGRVPSVNEYHMLKSQARNIQKITVFNKDIFVSLVKKNKKRFFSDVNTSA ----------------------3333--------------------------1111---- SEIKDRILSYFSKQTQTYNIGKLFTIIELQSVLVTTYTDILGVLTINVTSMEELARDMLN -----------3333-------------------------------3333---------1 SMNVAVVSSLVKNVNKLMEEYLRRHNKSCICYGSYSLYLINPNIRYGDIDILQTNSRTFL 111----------------3333-----------------3333---------------- IDLAFLIKFITGNNIILSKIPYLRNYMVIKDENDNHIIDSFNIRQDTMNVVPKIFIDNIY -------------------1111-------1111-------------1111----%%%%- IVDPTFQLLNMIKMFSQIDRLEDLSKDPEKFNARMATMLEYVRYTHGIVFDGKRNNMPMK --------------------------3333------------------------------ CIIDENNRIVTVTTKDYFSFKKCLVYLDENVLSSDILDLNADTSCDFESVTNSVYLIHDN ----1111-----3333-----------------------------!!!!-------%%% IMYTYFSNTILLSDKGKVHEISARGLCAHILLYQMLTSGEYKQCLSDLLNSMMNRDKIPI %------------2222-----------------1111-----------1111------- YSHTERDKKPGRHGFINIEKDIIVF -----------------1111---- >HETEROTETRAMERIC SARCOSIN; SWP:Q3ZDQ8; PDB:2GAGA; MSKPQRLSAEQSSRARINREEALSLTVDGAKLSAFRGDTVASALLANGVRRAGNSLYLDR -------33331111--1111-----iiii------------------------------ PRGIFAAGVEEPNALVTVSARHEQDIDESMLPATTVPVTEDLNATLLSGLGVLDPTKDPA -------1111----------1111------3333---2222------------------ YYDHVHVHTDVLVVGAGPAGLAAAREASRSGARVMLLDERAEAGGTLLDTAGEQIDGMDS --------------------------3333-------------!!!!-------iiii33 SAWIEQVTSELAEAEETTHLQRTTVFGSYDANYLIAAQRRTVHLDGPSGPGVSRERIWHI 33-----------1111------------%%%%-------1111----2222-------- RAKQVVLATGAHERPIVFENNDRPGIMLAGAVRSYLNRYGVRAGARIAVATTNDSAYELV -----------------------------------------------------3333--- RELAATGGVVAVIDARSSISAAAAQAVADGVQVISGSVVVDTEADENGELSAIVVAELDE ---1111-------------------1111--------------1111----------11 ARELGGTQRFEADVLAVAGGFNPVVHLHSQRQGKLDWDTTIHAFVPADAVANQHLAGAMT 11----------------------33331111-----------------2222----111 GRLDTASALSTGAATGAAAATAAGFATVARTPQALETALGETRPVWLVPSVSGDDAVNYK 1-------------------1111-------------------------1111-3333-- FHFVDLQRDQTVADVLRATGAGMKSVEHIKRYTSISTANDQGKTSGVAAIGVIAAVLGIE -----1111---------1111-----------222211111111--------------- NPAAIGTTTFRAPYTPVAFAALAGRNRGDQLDPARITAMHSWHLSHGAEFEDVGQWKRPW ----------------------!!!!!!!!------1111---1111-----!!!!---- YYPQAGETMDQAVYRESKAVRDSVGMLDATTLGKIEIRGKDAAEFLNRIYTNGYTKLKVG ---2222---------------------1111------1111-----------1111222 MGRYGVMCKADGMIFDDGVTLRLAEDRFLLHTTTGGAADVLDWLEEWLQTEWPDLDVTCT 2-------1111-----------1111-----3333---------------3333----- SVTEQLATVAVVGPRSRDVIAKLASTVDVSNEGFKFMAFKDVVLDSGIEARISRISFSGE -1111-------1111-------1111--3333-2222-----3333------------- LAFEIAVPAWHGLRVWEDVYAAGEEFNITPYGTETMHVLRAEKGFIIVGQDTDGTVTPQD -------1111-----------3333--------------1111--2222------3333 AGMEWVVSKLKDFIGNRSYSRADNAREDRKQLVSVLPVDKSLRLPEGAALVASDALASEG --3333------2222----3333--------------1111--2222---1111--iii ITPMEGWVTSSYDSPNLGRTFGLALIKNGRNRIGEVLKTPVGDQLVDVVVSETVLYDPEG i-------------1111---------33332222---------------------1111 SRRDG -1111 >Heterotetrameric sarcosin; SWP:Q3ZDR0; PDB:2GAGB; DLLPEHPEFLWANPEPKKSYDAIIVGGGGHGLATAYFLAKNHGITNVAVLEKGWLAGGNM ---------------------------------------------------------333 ARNTTIIRSNYLWDESAGIYEKSLKLWEQLPEDLEYDFLFSQRGVLNLAHTLGDVRESVR 3------------------------------1111------------------------- RVEANKLNGVDAEWLDPSQVKEACPIINTSDDIRYPVMGATWQPRAGIAKHDHVAWAFAR ----1111---------------3333---------------1111-------------- KANEMGVDIIQNCEVTGFIKDGEKVTGVKTTRGTIHAGKVALAGAGHSSVLAEMAGFELP --1111-------------------------------------!!!!------------- IQSHPLQALVSELFEPVHPTVVMSNHIHVYVSQAHKGELVMGAGIDSYNGYGQRGAFHVI ---------------------------------3333------------------3333- QEQMAAAVELFPIFARAHVLRTWGGIVDTTMDASPIISKTPIQNLYVNCGWGTGGFKGTP ----------3333---------------1111--------2222-----!!!!3333-- GAGFTLAHTIANDEPHELNKPFSLERFETGHLIDEHGAAAVAH ---------------3333---3333---------3333---- >Heterotetrameric sarcosin; SWP:Q3ZDQ7; PDB:2GAGC; TPLRHSPAEHLDTVMDAASVAGRVELREIAFTTQISLRCAPGTQAHAALAAATGAGLPAK -----1111----------2222----------------2222-------3333-----2 VGEVAGEAQGTAVLWLAPDEFLATSAENTELGGVLSAALGDAPGQVVDLSANRSVLELTG 222---3333-----------------1111-------!!!!------1111-------1 PDAPLVLRKSCPADLHPRAFAVNQAIVTSVANIPVLLWRTGEQAWRIMPRASFTEHTVHW 1113333--------3333----------%%%%----------------3333------- LVDAMSEFAS ----3333-- >Heterotetrameric sarcosin; SWP:Q3ZDQ9; PDB:2GAGD; MMLIDCPNCGPRNENEFKYGGEAHVAYPADPHALSDKQWSRYLFYRQNKKGIFAERWVHA ------------1111-------------1111-------------------------11 AGCRKWFNALRDTVTYEFKAIYPAGAPRPEI 11--------------------1111----- >DNA TOPOISOMERASE I; SWP:P46799; PDB:2GAIA; KYIVVESPAKAKTIKSILGNEYEVFASMGHIIDLPKSKFGVDLEKDFEPEFAVIKGKEKV ------------------3333-------------------1111--------2222--- VEKLKDLAKKGELLIASDMDREGEAIAWHIARVTNTLGRKNRIVFSEITPRVIREAVKNP -----------------------------------2222---------------3333-- REIDMKKVRAQLARRILDRIVGYSLSPVLWRNFKSNLSAGRVQSATLKLVCDREREILRF ------------------------------------------------------------ VPKKYHRITVNFDGLTAEIDVKEKKFFDAETLKEIQSIDELVVEEKKVSVKKFAPPEPFK -----------iiii-------------------1111---------------------- TSTLQQEAYSKLGFSVSKTMMIAQQLYEGVETKDGHIAFITYMRTDSTRVSDYAKEEARN -------------------------------1111------------------------- LITEVFGEEYVGAHEAIRPTNVFMTPEEAGKYLNSDQKKLYELIWKRFLASQMKPSQYEE ------3333----------111133331111-----------------1111------- TRFVLRTKDGKYRFKGTVLKKIFDGYEKVWKTERNTGEFPFEEGESVKPVVVKIEEQETK ------1111-------------!!!!--------------2222--------------- PKPRYTEGSLVKEMERLGIGRPSTYASTIKLLLNRGYIKKIRGYLYPTIVGSVVMDYLEK --------------------1111----------------iiii---------------- KYSDVVSVSFTAEMEKDLDEVEQGKKTDKIVLREFYESFSSVFDRNDRIVVDFPTNQKCS --3333---------------------------------11111111-----------11 CGKEMRLSFGKYGFYLKCECGKTRSVKNDEIAVIDDGKIFL 11------------------------1111----iiii--- >BETA-1,6-N-ACETYLGLUCOSAM; SWP:Q09324; PDB:2GAKA; RHLELNVNCTKILQGDPEEIQKVKLEILTVQFKKRPRWTPHDYINMTRDCASFIRTRKYI -----------1111---------------3333----3333--11113333-------- VEPLTKEEVGFPIAYSIVVHHKIEMLDRLLRAIYMPQNFYCIHVDRKAEESFLAAVQGIA ----3333--------------------------1111------11113333-------- SCFDNVFVASQLESVVYASWTRVKADLNCMKDLYRMNANWKYLINLCGMDFPIKTNLEIV --1111---------2222---------------------------1111---------- RKLKCSTGENNLETEKMPPNKEERWKKRYAVVDGKLTNTGIVKAPPPLKTPLFSGSAYFV -----iiii--------11113333------iiii------------------------- VTREYVGYVLENENIQKLMEWAQDTYSPDEFLWATIQRIPEVPGSFPSSNKYDLSDMNAI --------------------------3333--------2222------3333--3333-- ARFVKWQYFEGDVSNGAPYPPCSGVHVRSVCVFGAGDLSWMLRQHHLFANKFDMDVDPFA -----1111--1111-----------iiii---3333-3333----------1111---- IQCLDEHLRRKALE -------------- >182AA LONG HYPOTHETICAL P; SWP:O58467; PDB:2GANA; EGVKKIKNPSTVKDELLELFRIYRSTNGKYPALEWVKRKPNPNDFNGFREVYEPFLKFRL -------------------------iiii---1111----1111---------------- SQEFDELYTYQKDNRIIGTIALVYKRIKEKGIWWVPEELNEKVGLIEFFVVDPEFQGKGI -----------%%%%----------3333--111133331111--------1111----- GSTLLEFAVKRLRSLGKDPYVVTFPNLEAYSYYYKKGFREIRYKEFVILKFNHKKFQLE ------------1111-------1111-------------------------3333--- >GTP-BINDING PROTEIN SAR1A; SWP:Q9NR31; PDB:2GAOA; SSVLQFLGLYKKSGKLVFLGLDNAGKTTLLHMLTSEELTIAGMTFTTFDLRVWKNYLPAI ----11112222--------2222-----3333------!!!!-------3333-3333- NGIVFLVDCADHSRLVESKVELNALMTDETISNVPILILGNKIDRTDAISEEKLREIFGL -------11111111------------3333----------3333--------------2 YGQTTGKGNVTLKELNARPMEVFMCSVLKRQGYGEGFRWLSQYID 222-------3333------------1111----------1111- >ISOFLAVONE REDUCTASE; SWP:P52575; PDB:2GASA; TENKILILGPTGAIGRHIVWASIKAGNPTYALVRKTITAANPETKEELIDNYQSLGVILL --------1111-3333-----------------------3333--------1111---- EGDINDHETLVKAIKQVDIVICAAGRLLIEDQVKIIKAIKEAGNVKKFFPSEFGLDVDRH --1111-------1111----------3333------------------------1111- DAVEPVRQVFEEKASIRRVIEAEGVPYTYLCCHAFTGYFLRNLAQLDATDPPRDKVVILG ----3333--------------------------3333-1111----------------- DGNVKGAYVTEADVGTFTIRAANDPNTLNKAVHIRLPKNYLTQNEVIALWEKKIGKTLEK -----------------------3333--------1111--------------------- TYVSEEQVLKDIQESSFPHNYLLALYHSQQIKGDAVYEIDPAKDIEASEAYPDVTYTTAD --------------------------------1111---3333--3333-3333------ EYLNQFV -3333-- >GATA-1; SWP:P17678; PDB:2GATA; KRAGTVCSNCQTSTTTLWRRSPMGDPVCNACGLYYKLHQVNRPLTMRKDGIQTRNRKVSS -2222---------------1111-------------------1111------------- KGKKRR ------ >TRANSCRIPTIONAL REGULATOR; SWP:Q7MAW7; PDB:2GAUA; LGHLLRDVWSLLNEEERELLDKEIQPFPCKKASTVFSEGDIPNNLFYLYEGKIKILRRFH 3333-3333--------------------2222---2222-------------------- ISRIVKPGQFFGMRPYFAEETCSSTAIAVENSKVLAIPVEAIEALLKGNTSFCRYFLKAL -----2222--------------------------------------------------- AKELGYAERRTVTLTQKHVRGRLAETLLILKENFGFENDGATLSIYLSREELATLSNMTV ------------------------------------1111-------------1111--- SNAIRTLSTFVSERMLALDGKRIKIIDCDRLQKTARSG ----------1111----!!!!---------------- >HYPOTHETICAL PROTEIN ATU0; SWP:Q8UIQ3; PDB:2GAXA; FDTKIAVILRDDLAVWQKLNVTAFLSGIVAQTGEIIGEPYRDGAGNVYNPLSIQPIVVAT ---------1111------------------3333------1111--------------- DQEALRKIHQRSLERDITTSLYIEEFATGHDAANRQVFSHFSPDTAKVVGALRADRKIVD ------------1111---------------------11113333--------------- KITKGAKLHA ---------- >ASPARTATE AMINOTRANSFERAS; SWP:Q9X224; PDB:2GB3A; FSDRVLLTEESPIRKLVPFAEAKKRGVRIHHLNIGQPDLKTPEVFFERIYENKPEVVYYS -3333-----3333---33333333----------------------------------- HSAGIWELREAFASYYKRRQRVDVKPENVLVTNGGSEAILFSFAVIANPGDEILVLEPFY 11113333----------------3333-------------------2222--------- ANYNAFAKIAGVKLIPVTRREEGFAIPQNLESFINERTKGIVLSNPCNPTGVVYGKDERY ---------------------%%%%-111111111111----------------3333-- LVEIAERHGLFLIVDEVYSEIVFRGEFASALSIESDKVVVIDSVSKFSACGARVGCLITR ---------------1111---------3333--1111----------1111-------- NEELISHAKLAQGRLAPPLLEQIGSVGLLNLDDSFFDFVRETYRERVETVLKKLEEHGLK ---------------------------11113333-------------------1111-- RFTKPSGAFYITAELPVEDAEEFARWLTDFNDGETTVAPLRGFYLTPGLGKKEIRIACVL ---------------------------------------1111--22221111------- EKDLLSRAIDVLEGLKFCS ------------------- >THIOPURINE S-METHYLTRANSF; SWP:O55060; PDB:2GB4A; DAEVQKNQVLTLEDWKEKWVTRHISFHQEQGHQLLKKHLDTFLKGQSGLRVFFPLCGKAI 11111111---------------33331111-----------2222-------------- EMKWFADRGHTVVGVEISEIGIREFFAEQNLSYTEEPLAEIAGAKVFKSSSGSISLYCCS -------------------------------------1111-------1111-------3 IFDLPRANIGKFDRIWDRGALVAINPGDHDRYADIILSLLRKEFQYLVAVLSYDPTKHAG 3331111------------1111-3333--------1111-------------1111--- PPFYVPSAELKRLFGTKCSMQCLEEVDALEERHKAWGLDYLFEKLYLLTEK -----3333--------------------33333333-------------- >R.ECL18KI; SWP:O87963; PDB:2GB7A; RLSPGEFKTLISKERKSHFITPFALVYKTFCDLGYDQKNSDYFLNNPSEYIIAMRKNCWK ---------------------------------3333-3333------------------ EFEPFEKEFTTRMLSYLIDEERIKDMSPYDAIRDFTMEYPTHIYDLALSNTQSRRSRAGK -------------3333-33332222---------------------------------- EFESILELLMMGAGIPVDVQGAINQIGKLVDLVMPGVVQYTSNKRNTMLISAKTTLRERW ------------------1111----3333-------------2222--------!!!!- QEVPEEVNRTGIREMYLATLDDSFSEETINILYEANVVVVTTVENKNFKYKNNNRVLTFE -------------------------------------------------1111------- DMLQSAMELSRKWNNVSYTDSEKEEIQQSILKQIEKYSDFPYVVNYYRNRLSA --------3333----------------------1111--------------- >DIPEPTIDYL PEPTIDASE 4; SWP:P14740; PDB:2GBCA; RRTYTLADYLKNTFRVKSYSLRWVSDSEYLYKQENNILLFNAEHGNSSIFLENSTFEIFG --------1111--------------------%%%%------------------1111-- DSISDYSVSPDRLFVLLEYNYVKQWRHSYTASYSIYDLNKRQLITEEKIPNNTQWITWSQ --------1111-------------------------3333------------------- EGHKLAYVWKNDIYVKIEPHLPSHRITSTGKENVIFNGINDWVYEEEIFGAYSALWWSPN --------%%%%-----3333---------2222-------------------------- GTFLAYAQFNDTGVPLIEYSFYSDESLQYPKTVWIPYPKAGAVNPTVKFFIVNTDSLSST -----------------------1111-----------2222-----------1111--- TTTIPMQITAPASVTTGDHYLCDVAWVSEDRISLQWLRRIQNYSVMAICDYDKTTLVWNC ----------1111-------------1111----------------------------- PTTQEHIETSATGWCGRFRPAEPHFTSDGSSFYKIVSDKDGYKHICQFQKDRKPEQVCTF -1111--------------------1111--------1111------------------- ITKGAWEVISIEALTSDYLYYISNEYKEMPGGRNLYKIQLTDHTNKKCLSCDLNPERCQY ------------------------22221111------3333-------33333333--- YSVSLSKEAKYYQLGCRGPGLPLYTLHRSTDQKELRVLEDNSALDKMLQDVQMPSKKLDF -----1111-------------------1111---------------------------- IVLNETRFWYQMILPPHFDKSKKYPLLIDVYAGPCSQKADAAFRLNWATYLASTENIIVA --%%%%------------1111-----------------------3333----------- SFDGRGSGYQGDKIMHAINKRLGTLEVEDQIEAARQFLKMGFVDSKRVAIWGWSYGGYVT ---2222---3333---2222--3333----------------1111------------- SMVLGSGSGVFKCGIAVAPVSRWEYYDSVYTERYMGLPTPEDNLDHYRNSTVMSRAENFK --1111--------------------------------1111--------3333--1111 QVEYLLIHGTADDNVHFQQSAQISKALVDAGVDFQAMWYTDEDHGIASSTAHQHIYSHMS --------1111---------------1111-----------1111-------------- HFLQQCFSLR ---------- >ORPHAN NUCLEAR RECEPTOR N; SWP:P22736; PDB:2GBDA; NLLTSLVRAHLDSGPSTAKLDYSKFQELVLPHFGKEDAGDVQQFYDLLSGSLEVIRKWAE ---------------------1111-----------3333------------------11 KIPGFAELSPADQDLLLESAFLELFILRLAYRSKPGEGKLIFCSGLVLHRLQCARGFGDW 11-3333----------------------------------3333---3333-----333 IDSILAFSRSLHSLLVDVPAFACLSALVLITDRHGLQEPRRVEELQNRIASCLKEHVAAV 3----------1111-3333-----------------3333------------------- AGPASCLSRLLGKLPELRTLCTQGLQRIFYLKLEDLVPPPPIIDKIFMDT -----3333------------------------------------3333- >UBIQUITIN; SWP:P62988; PDB:2GBJA; MQIFVKTLTGGGGGGGGGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLED -------------------------1111---------------3333----iiii--11 GRTLSDYNIQKESTLHLVLRL 113333---2222-------- >UBIQUITIN; SWP:P62988; PDB:2GBKA; MQIFVKTLTQVRELVGGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDG ------------------------1111---------------3333----iiii--333 RTLSDYNIQKESTLHLVLRLRG 33333---2222---------- >CIRCADIAN CLOCK PROTEIN K; SWP:Q5N594; PDB:2GBLA; EHQAIAKMRTMIEGFDDISHGGLPIGRSTLVSGTSGTGKTLFSIQFLYNGIIEFDEPGVF -------------3333------2222------2222----------------------- VTFEETPQDIIKNARSFGWDLAKLVDEGKLFILDASPDPEGQEVVGGFDLSALIERINYA ------------3333-------------------------------------------- IQKYRARRVSIDSVTSVFQQYDASSVVRRELFRLVARLKQIGATTVMTTERIEEYGPIAR ------------------------------------------------------------ YGVEEFVSDNVVILRNVLEGERRRRTLEILKLRGTSHMKGEYPFTITDHGINIFPLGAMR --3333------------iiii---------2222-----------1111----1111-- LTQRSSNVRVSSGVVRLDEMCGGGFFKDSIILATGATGTGKTLLVSRFVENACANKERAI ------------------1111-------------------------------------- LFAYEESRAQLLRNAYSWGMDFEEMERQNLLKIVCAYPESAGLEDHLQIIKSEINDFKPA ---------------1111-----------------3333-------------------- RIAIDSLSALARGVSNNAFRQFVIGVTGYAKQEEITGLFTNTSDQFMGAHSITDSHIITD -----3333-----3333------------1111------------------1111---- TIILLQYVEIRGEMSRAINVFKMRGSWHDKAIREFMISDKGPDIKDSFRNFERIISGSPT ----------------------1111-----------3333------1111--1111--- RITVDEKSELSRIVRGVQEKGPES ------------------------ >UBIQUITIN; SWP:Q867C3; PDB:2GBMA; MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGGGGGGGGGIPPDQQRLIFAGKQLED ------1111-------1111---------3333----------1111----iiii---- GRTLSDYNIQKESTLHLVLRL --3333---2222-------- >UPF0358 PROTEIN EF2458; SWP:Q831P3; PDB:2GBOA; DEGISKKFAIQLLEDDAERIKLIRNQKNSLCISQCKAFEEVVDTQYGFSRQVTYATRLGI -------------------------1111------------------------------- LTNDEGHRLLSDLERELNQ ------------------- >HYPOTHETICAL PROTEIN RPA0; SWP:Q6ND56; PDB:2GBSA; VAYWLVKSEPSVWSWDQQVAKGAAGEAWTGVRNHSAKLHMVAMRRGDRAFYYHSNEGKEI --------3333-----3333-----------------3333------------------ VGIAEIIREAYPDPTDASGKFVCVDIKADKPLKTPVTLAAVKAEPRLADMALMKYSRLSV ------------1111---------------------------3333--3333-1111-- QPVTAEEWKLVCKMGGLLEHHHHHH ------------1111--------- >BIPHENYL 2,3-DIOXYGENASE ; SWP:A2TC87; PDB:2GBWA; TLVDTVNASQSRQVFWDEDVYALEIERIFSRAWLMLGHESLVPKPGDFITTYMAEDKVIL ------------1111---------------------3333--2222-----!!!!---- SHQSDGTFRAFINSCSHRGNQICHADSGNAKAFVCNYHGWVFGQDGSLVDVPLESRCYHN --1111------------------------------------1111------3333-%%% SLDKQKLAAKSVRVETYKGFIFGCHDPEAPSLEDYLGEFRYYLDTIWEGAGGGMELLGPP %-3333----------iiii-----1111------------------!!!!--------- MKSLLQCNWKVPAENFIGDGYHVGWTHAAALSQIGGELAGLAGNRADIPFDDLGLQFTTR -------3333-------3333--1111--------3333222211113333-------- HGHGFGVIDNAAAGLHIKREGWTKFLEDTRGEVRRKFGPERERLYLGHWNCSIFPNCSFL ----------1111------------------------1111-1111------------- YGTNTFKIWHPRGPHEIEVWTYTIVPRDADPATKSMIQREAIRTFGTAGTLESDDGENMS -------------------------1111---------------------3333------ SATYINRGVITRNGRMNSTMGVGYEGPHPVYPGIVGISFIGETSYRGFYRFWKEMIDAPD ---1111-3333------2222-------------------------------------3 WASVKANDDTWDSVFPNRNFWNEKLNAAE 333-1111-1111---1111--------- >Biphenyl/naphthalene diox; SWP:A2TC88; PDB:2GBWB; QIPVTPDVHYDIEAHYRAEVRMFQTGQYREWLQGMVAEDIHYWMPIYEQRLTRDRRPDPT ----------------------1111----------1111----------3333-----1 PDDAAIYNDDFGELKQRVERLYSGQVWMEDPPSKIRYFVSNVEAFEAGNGELDVLSNILV 111--------------3333----3333------------------iiii--------- YRNRRQTEVTVHTLGREDKLRRDGNGFKVFRRKLILDARVTQDKNLYFFC ----------------------!!!!------------------------ >OLIGORIBONUCLEASE; SWP:Q8P8S1; PDB:2GBZA; NDRLIWIDLEMTGLDTDRDSIIEIATIVTDAQLNVLAEGPELAIAHSLETLEAMDEWNRN -----------------------------1111-------------33331111------ QHRRSGLWQRVLDSQVTHAQAEAQTVAFLGEWIRAGASPMCGNSICQDRRFLHRQMSRLE ----------------------------1111-----------3333------------- RYFHYRNLDVSTIKELARRWAPAVASGFAKSSAHTALSDVRDSIDELRHYRQFMGTLGG --------------------33331111------3333----------------3333- >Cytochrome c, class I [Pr; SWP:A1B6D4; PDB:2GC7D; APQFFNIIDGSPLNFDDAMEEGRDTEAVKHFLETGENVYNEDPEILPEAEELYAGMCSGC --------------1111------------------1111-3333--------------- HGHYAEGKIGPGLNDAYWTYPGNETDVGLFSTLYGGATGQMGPMWGSLTLDEMLRTMAWV -1111--------------3333--------------!!!!------------------- RHLYTGDPKDASWLTDEQKAGFTPFQP ------33331111----1111----- >P-COUMARIC ACID DECARBOXY; SWP:Q88RY7; PDB:2GC9A; TKTFKTLDDFLGTHFIYTYDNGWEYEWYAKNDHTVDYRIHGGVAGRWVTDQKADIVLTEG -----33332222-----1111--------------------2222-----------222 IYKISWTEPTGTDVALDFPNEKKLHGTIFFPKWVEEHPEITVTYQNEHIDLEQSREKYAT 2------3333-------1111--------3333--3333---33333333--------- YPKLVVPEFANITYGDAGQNNEDVISEAPYKEPNDIRNGKYFDQNYHRLNK --------------------3333------------------1111-1111 >ATERF1; SWP:O80337; PDB:2GCC; KHYRGVRQRPWGKFAAEIRDPAKNGARVWLGTFETAEDAALAYDRAAFRMRGSRALLNFP ----------------------------------3333----------1111-------- LRV --- >CATION-TRANSPORTING ATPAS; SWP:P73241; PDB:2GCFA; MAQTINLQLEGMRCAACASSIERAIAKVPGVQSCQVNFALEQAVVSYHGETTPQILTDAV -----------------------33331111-------------------------1111 ERAGYHARVLKQQ 1111--------- >GLYOXYLATE REDUCTASE/HYDR; SWP:Q9UBQ7; PDB:2GCGA; RLMKVFVTRRIPAEGRVALARAADCEVEQWDSDEPIPAKELERGVAGAHGLLCLLSDHVD -----------3333------1111-------------------2222-----1111--- KRILDAAGANLKVISTMSVGIDHLALDEIKKRGIRVGYTPDVLTDTTAELAVSLLLTTCR --------------------1111-----1111-------1111-------------111 RLPEAIEEVKNGGWTSWKPLWLCGYGLTQSTVGIIGLGRIGQAIARRLKPFGVQRFLYTG 1----------------1111-----2222-------------33333333--------- RQPRPEEAAEFQAEFVSTPELAAQSDFIVVACSLTPATEGLCNKDFFQKMKETAVFINIS ---33333333-----------------------3333--------11111111------ RGDVVNQDDLYQALASGKIAAAGLDVTSPEPLPTNHPLLTLKNCVILPHIGSATHRTRNT 1111----------------------------33331111---------1111------- MSLLAANNLLAGLRGEPMPSELKL -----------1111--------- >PROBABLE ALPHA-METHYLACYL; SWP:Q7U0J6; PDB:2GCIA; AGPLSGLRVVELAGIGPGPHAAMILGDLGADVVRIDRPISRDAMLRNRRIVTADLKSDQG -1111--------------------1111------------3333--------1111--- LELALKLIAKADVLIEGYRPGVTERLGLGPEECAKVNDRLIYARMTGWGQTGPRSQQAGH -------1111-------2222------3333-------------------1111----3 DINYISLNGILHAIGRGDERPVPPLNLVGDFGGGSMFLLVGILAALWERQSSGKGQVVDA 333-----3333---1111-----------1111-------------------------- AMVDGSSVLIQMMWAMRATGMWTDTRGANMLDGGAPYYDTYECADGRYVAVGAIEPQFYA ----------------1111--------------1111----1111-------------- AMLAGLGLDAAELPPQNDRARWPELRALLTEAFASHDRDHWGAVFANSDACVTPVLAFGE --------3333--11111111--------------3333----------------3333 VHNEPHIIERNTFYEANGGWQPMPAPRFSRTASSQPRPPAATIDIEAVLTDWDG 11113333---------------------------------------------- >Hypothetical 63.0 kDa pro; SWP:Q04636; PDB:2GCLA; VAGDAIVSFQDVFFTTPRGRYDIDIYKNSIRLRGKTYEYKLQHRQIQRIVSLPKADDIHH -1111------------------------------------3333--------3333--- VAIEPPLRQGQTTYPFLVLQFQKDEETEVQLNLEDEDYEENYKDKLKKQYDAKTHIVLSH --------!!!!---------1111----------------------------------- VLKGLTDRRVIVPGEYKSKYDQCAVSCSFKANEGYLYPLDNAFFFLTKPTLYIPFSDVSV -----------------1111-------!!!!---------------------3333--- NISRRTFDLEVVLRSNRGSTTFANISKEEQQLLEQFLKSKNLRVKN -------------%%%%--------3333--------1111----- >PUTATIVE HYDROXYACYLGLUTA; SWP:Q9C8L4; PDB:2GCUA; MKLLFRQLFENESSTFTYLLADVSHPDKPALLIDPVDKTVDRDLKLIDELGLKLIYAMNT ----------1111----------1111-------3333--------------------- HVHADHVTGTGLLKTKLPGVKSVISKASGSKADLFLEPGDKVSIGDIYLEVRATPGHTAG ------------------------3333--------2222---!!!!------------- CVTYVTGEGADQPQPRMAFTGDAVLIRGCGRTDFQEGSSDQLYESVHSQIFTLPKDTLIY --------1111-------!!!!----------%%%%------------11111111--- PAHDYKGFEVSTVGEEMQHNPRLTKDKETFKTIMSNLNLSYPKMIDVAVPANMVGLQDVP ---------------------1111--------1111----1111----3333------- SQA --- >FERROUS IRON TRANSPORT PR; SWP:Q57IW9; PDB:2GCXA; MQFTPDSAWKITGFSRDISPAYRQKLLSLGMLPGSSFHVVRVAPLGDPVHIETRRVSLVL ------------------------3333-------------------------------- RKKDLALIELEAVAQ --3333--------- >CHARGED MULTIVESICULAR BO; SWP:Q9Y3E7; PDB:2GD5A; PKELVNEWSLKIRKERVVDRQIRDIQREEEKVKRSVKDAAKKGQKDVCIVLAKEIRSRKA ------------------------------------------------------------ VSKLYASKAHNSVLGKNQLAVLRVAGSLQKSTEVKAQSLVKIPEIQATRELSKEKAGIIE --------------------------------------------------3333------ AEEIDRILFEI --3333----- >HYPOTHETICAL PROTEIN YYAP; SWP:P37508; PDB:2GD9A; GTNNLKQRRIILDLAVTLDGFIEGKNGEVDWCIDPDGFTDFLNQIDTILYGRKSFDLWGQ ----------------1111--------3333---------1111--------------- YKELWKLVHSKKKYVFSRTQNEIDNQAIFINDNILEEVNKLKKNPGKDIWLYGGASLITT --33331111------------------------------3333---------3333--- FINLGLVDEFRLSIHPVVLGEGKPLFIDVKQRINLKVNTRTFSSGVVQIVYHW -1111------------------------------------3333-------- >LECTIN; SWP:NA; PDB:2GDFA; ADTIVAVELDSYPNTDIGDPNYPHIGIDIKSIRSKSTARWNMQTGKVGTVHISYNSVAKR ---------------3333-----------------------2222--------3333-- LSAVVSYSGSSSTTVSYDVDLNNVLPEWVRVGLSATTGLYKETNTILSWSFTSKLKTENS ------2222---------1111------------------------------------- LHFSFHKFSQNPKDLILQGDAFTDSDGNLELTKVQGNSVGRALFYAPVHIWEKSAVVASF -----------1111--------1111------------------------3333----- DATFTFLIKSPDREPADGITFFIANTDTSIPSGSGGRLLGLFPDAN ------------------------1111--2222!!!!-------- >MACROPHAGE MIGRATION INHI; SWP:P34884; PDB:2GDGA; PMFIVNTNVPRASVPEGFLSELTQQLAQATGKPAQYIAVHVVPDQLMTFSGTNDPCALCS ---------3333-2222--------------3333------------iiii-------- LHSIGKIGGAQNRNYSKLLCGLLSDRLHISPDRVYINYYDMNAANVGWNGSTFA -----------------------------1111--------1111--iiii--- >LEGHEMOGLOBIN (OXY); SWP:P02240; PDB:2GDM; GALTESQAALVKSSWEEFNANIPKHTHRFFILVLEIAPAAKDLFSFLKGTSEVPQNNPEL --------------------------------------3333-1111------------- QAHAGKVFKLVYEAAIQLEVTGVVVTDATLKNLGSVHVSKGVADAHFPVVKEAILKTIKE -------------------------------------1111-1111-------------- VVGAKWSEELNSAWTIAYDELAIVIKKEMDDAA -!!!!---------------------------- >BETA-LACTAMASE; SWP:P0A5I6; PDB:2GDNA; DLADRFAELERRYDARLGVYVPATGTTAAIEYRADERFAFCSTFKAPLVAAVLHQNPLTH --------------------------------1111---!!!!-------------3333 LDKLITYTSDDIRSISPVAQQHVQTGMTIGQLCDAAIRYSDGTAANLLLADLGGPT -------3333------3333-1111-----------------------3333--- >YITF; SWP:NA; PDB:2GDQA; LVKIVRIETFPLFHRLEKPYGDANGFKRYRTCYLIRIITESGIDGWGECVDWLPALHVGF ---------------------1111-------------1111---------3333----- TKRIIPFLLGKQAGSRLSLVRTIQKWHQRAASAVSMALTEIAAKAADCSVCELWGGRYRE -------22221111----------------------------1111----1111----- EIPVYASFQSYSDSPQWISRSVSNVEAQLKKGFEQIKVKIGGTSFKEDVRHINALQHTAG -------------1111-----------1111---------------------------3 SSITMILDANQSYDAAAAFKWERYFSEWTNIGWLEEPLPFDQPQDYAMLRSRLSVPVAGG 333-----%%%%-----------------------------3333----1111------1 ENMKGPAQYVPLLSQRCLDIIQPDVMHVNGIDEFRDCLQLARYFGVRASAHAYDGSLSRL 111-3333----1111-------1111--------------1111--------------- YALFAQACLPPWSKMKNDHIEPIEWDVMENPFTDLVSLQPSKGMVHIPKGKGIGTEINME -----1111---------------------3333------iiii------!!!!------ IVNRYKWDGSAYE --1111------- >GLUTATHIONE S-TRANSFERASE; SWP:Q59721; PDB:2GDRA; MKLYYSPGACSLSPHIALREAGLNFELVQVDLASKKTASGQDYLEVNPAGYVPCLQLDDG -----2222---------------------------1111-3333-1111------1111 RTLTEGPAIVQYVADQVPGKQLAPANGSFERYHLQQWLNFISSELHKSFSPLFNPASSDE ----3333--------3333----2222--------------------3333-3333--- WKNAVRQSLNTRLGQVARQLEHAPYLLGDQLSVADIYLFVVLGWSAYVNIDLSPWPSLQA -----------------1111---1111---------------3333----3333----- FQGRVGGREAVQSALRAEGLIK ---------------1111--- >NAD+-dependent 15-hydroxy; SWP:NA; PDB:2GDZA; AHMVNGKVALVTGAAQGIGRAFAEALLLKGAKVALVDWNLEAGVQCKAALHEQFEPQKTL ---2222-----------------------------------------------1111-- FIQCDVADQQQLRDTFRKVVDHFGRLDILVNNAGVNNEKNWEKTLQINLVSVISGTYLGL ----1111---------------------------------------------------- DYMSKQNGGEGGIIINMSSLAGLMPVAQQPVYCASKHGIVGFTRSAALAANLMNSGVRLN ---1111-----------3333---1111------------------------------- AICPGFVNTAILESIEKEENMGQYIEYKDHIKDMIKYYGILDPPLIANGLITLIEDDALN ----------------3333!!!!1111-------------3333----------1111- GAIMKITTSKGIHFQDYGSKENLYFQ -------------------1111--- >PROBABLE ACETYLTRANSFERAS; SWP:Q8UD38; PDB:2GE3A; DTVTIKPIRAEHVESFHRALDAVSRERKYLSFLEAPPLEAVRAFVLDIENDHPQFVAIAD --------3333-----------3333--------------------1111-------ii GDVIGWCDIRRQDRATRAHCGTLGGILPAYRNKGLGARLRRTLDAAHEFGLHRIELSVHA ii-----------1111---------3333---3333---------------------11 DNARAIALYEKIGFAHEGRARDAVSIDGHYIDSLNAIIFG 11-------------------------------------- >NUCLEOCAPSID PROTEIN; SWP:P32923; PDB:2GE7A; RYCKRTIPPGYKVDQVFGPRTKGKEGNFGDDKMNEEGIKDGRVTAMLNLVPSSHACLFGS 3333---22221111--------2222--------!!!!3333---1111---------- RVTPKLQPDGLHLKFEFTTVVPRDDPQFDNYVKICDQCVDGVGTRPK ------1111-----------1111---------------2222--- >TYROSINE-PROTEIN KINASE B; SWP:Q06187; PDB:2GE9A; TEAEDSIEMYEWYSKHMTRSQAEQLLKQEGKEGGFIVRDSSKAGKYTVSVFAKSTGDPQG --------------------------------------------------------1111 VIRHYVVCSTPQSQYYLAEKHLFSTIPELINYHQHNSAGLISRLKYPVSQQNKNAPST -----------------3333------------------------------------- >NUCLEOCAPSID PROTEIN; SWP:P32923; PDB:2GECA; RPPKVGSSGNASWFQAIKAKKLNSPQPKFEGSGVPDNENLKTSQQHGYWRRQARFKPGKG ------------------------------------33333333---------------- RRKPVPDAWYFYYTGTGPAADLNWGDSQDGIVWVAAKGADVKSRSNQGTRDPDKFDQYPL ------------2222--11112222-2222----22223333-------3333------ RFSDGGPDGNFRWDFIPL -1111--1111------- >HYPOTHETICAL PROTEIN; SWP:Q6MZU5; PDB:2GEEA; PRGSHMEVPQPTDLSFVDITDSSIGLRWTPLNSSTIIGYRITVVAAGEGIPIFEDFVDSS ---------------------------------------------------------111 VGYYTVTGLEPGIDYDISVYTVKNGGESTPTTLTQQTAVPPPTDLRFTNIGPDTMRVTWA 1--------2222----------------------------------------------- PPPSIDLTNFLVRYSPVKNEEDVAELSISPSDNAVVLTNLLPGTEYVVSVSSVYEQHEST ---------------1111---------1111---------------------------- PLRGRQKT -------- >PROTEASE VP4; SWP:Q8AZM0; PDB:2GEFA; DLPISLLQTLAYKQPLGRNSRIVHFTDGALFPVVAFGDNHSTSELYIAVRGDHRDLSPDV ----------------------------------------------------33333333 RDSYALTGDDHKVWGATHKFNVKTRTDLTILPVADVFWRADGSADVDVVWNDPAVAGQSS ------1111----------------------------1111---------------111 SIALALASSLPFVPKAAYTGCLSGTNVQPVQFGNLKARAAHKIGLPLVGTQDGGEDTRIC 1---------------------!!!!-------------3333------1111------- TLDDAADHAFDSES -------------- >PROBABLE TRANSCRIPTIONAL ; SWP:Q9I2Q9; PDB:2GENA; RKDEILQAALACFSEHGVDATTIEIRDRSGASIGSLYHHFGNKERIHGELYLAGIGQYAA -----------------1111--------------------------------------- LLEAGFARARSAEETVRLLVTSYIDWVVANPDWARFILHSRGRVEAGELGERLRADNQAH ----3333--------------------------------------1111---------- FARIHAALAGYRAEGLFREPDDCFASVVIGPAHDLARQWLAGRTRVALADCRELLAQVAW -----------1111-----------------------1111----3333---------- DSVRAA ------ >PANTOTHENATE KINASE; SWP:P63810; PDB:2GEVA; SEPSPYVEFDRRQWRALRMSTPLALTEEELVGLRGLGEQIDLLEVEEVYLPLARLIHLQV -----------------!!!!---------1111-------------------------- AARQRLFAATAEFLGEPQQNPDRPVPFIIGVAGSVAVGKSTTARVLQALLARWDHHPRVD ---------------------------------2222-----------3333-------- LVTTDGFLYPNAELQRRNLMHRKGFPESYNRRALMRFVTSVKSGSDYAAPVYSHLHYDII --3333------------1111--1111-------------------------1111--2 PGAEQVVRHPDILILEGLNVLQTGPTLMVSDLFDFSLYVDARIEDIEQWYVSRFLAMRTT 222-------------3333-------3333----------------------------3 AFADPESHFHHYAAFSDSQAVVAAREIWRTINRPNLVENILPTRPRATLVLRKDADHSIN 3331111--3333--------------------------33331111------1111--- RLRLRKL ------- >SNOL; SWP:Q9RN64; PDB:2GEXA; STTANKERCLEVAAWNRWDVSGVVAHWAPDVVHYDDEDKPVSAEEVVRRNSAVEAFPDLR --------------11113333-1111----------------------------1111- LDVRSIVGEGDRVLRITCSATHQGVFGIAPTGRKVRWTYLEELRFSEAGKVVEHWDVFNF --------!!!!---------------------------------3333----------3 SPLFRDLGVVPDGLKLAAALEH 333---%%%%------------ >ACLR PROTEIN; SWP:Q1XDX7; PDB:2GEYA; SMAERKALCLEMVAAWNRWDLSGIIKHWSPDIVHYSEDNEVSSADMVKLMEGGLKAFPDL ---------------11113333-11111111---%%%%-----------------1111 QLEVKSIMAEEDRVALRITVTATHQGEFMGVQPTGQRVSWHLVEELRFVDGKVVEHWDVI ---------!!!!--------------iiii-----------------iiii-------- NMRPLLVRLGKLPDVPKVVLEASAKLAAALEHHHHHH --------------------------------1111- >L-ASPARAGINASE ALPHA SUBU; SWP:Q9ZSD6; PDB:2GEZA; GGWSIALHGGAGDIPFSLPPERRKPREEGLRHCLQIGVEALKAQKPPLDVVELVVRELEN ------------------3333-------------------------------------- IEHFNAGIGSVLTNSGTVEMEASIMDGNTMKCGAVSGLSTVLNPISLARLVMDKTPHIYL 1111--2222--1111--------------------------3333-------------- AFQGAQDFAKQQGVETVDSSHLITAENVERLKLAIEAN ---------1111----3333----------------- >L-asparaginase [Precursor; SWP:Q9ZSD6; PDB:2GEZB; TVGCVAVDSHGNLASATSTGGLVNKMVGRIGDTPLIGAGTYANELCAVSATGKGEEIIRA -------1111----------22222222--1111-------3333------3333-111 TVARDVAALMEFKGLSLKEAADFVIHERTPKGTVGLIAVSAAGEIAMPFNTTGMFRACAT 1--------------3333----------2222------1111----------------1 EDGYSEIAIWPTT 111---------- >GTP-BINDING PROTEIN DI-RA; SWP:O95057; PDB:2GF0A; QSNDYRVVVFGAGGVGKSSLVLRFVKGTFRDTYIPTIEDTYRQVISCDKSVCTLQITDTT -----------2222---------------1111------------%%%%---------! GSHQFPAMQRLSISKGHAFILVFSVTSKQSLEELGPIYKLIVQIKGSVEDIPVMLVGNKC !!!--------------------1111---------------11111111---------- DETQREVDTREAQAVAQEWKCAFMETSAKMNYNVKELFQELLTLETRRNMSLN ---------------------------1111----------3333-------- >3-HYDROXYISOBUTYRATE DEHY; SWP:P31937; PDB:2GF2A; MPVGFIGLGNMGNPMAKNLMKHGYPLIIYDVFPDACKEFQDAGEQVVSSPADVAEKADRI -------------------1111--------3333-3333------------3333---- ITMLPTSINAIEAYSGANGILKKVKKGSLLIDSSTIDPAVSKELAKEVEKMGAVFMDAPV ------------------3333--2222--------3333--------1111-------- SGGVGAARSGNLTFMVGGVEDEFAAAQELLGCMGSNVVYCGAVGTGQAAKICNNMLLAIS ------------------3333------3333---------2222--------------- MIGTAEAMNLGIRLGLDPKLLAKILNMSSGRCWSSDTYNPVPGVMDGVPSANNYQGGFGT -----------1111------------11113333-----2222---3333%%%%----- TLMAKDLGLAQDSATSTKSPILLGSLAHQIYRMMCAKGYSKKDFSSVFQFLREEET ---------------------------------1111-1111-------------- >MONOMERIC SARCOSINE OXIDA; SWP:P40859; PDB:2GF3A; STHFDVIVVGAGSMGMAAGYQLAKQGVKTLLVDAFDPPHTNGSHHGDTRIIRHAYGEGRE -----------3333-------1111----------------------------111111 YVPLALRSQELWYELEKETHHKIFTKTGVLVFGPKGESAFVAETMEAAKEHSLTVDLLEG 11------------3333---------------2222---------------------!! DEINKRWPGITVPENYNAIFEPNSGVLFSENCIRAYRELAEARGAKVLTHTRVEDFDISP !!----------1111------------------------1111--------------11 DSVKIETANGSYTADKLIVSMGAWNSKLLSKLNLDIPLQPYRQVVGFFESDESKYSNDID 11----1111----------!!!!--------------------------1111-3333- FPGFMVEVPNGIYYGFPSFGGCGLKLGYHTFGQKIDPDTINREFGVYPEDESNLRAFLEE -------1111-------iiii-------------1111---2222-------------- YMPGANGELKRGAVCMYTKTLDEHFIIDLHPEHSNVVIAAGFSGHGFKFSSGVGEVLSQL -1111--------------1111------1111--------iiii3333----------- ALTGKTEHDISIFSINRPALKESLQ ---------333311113333---- >PROTEIN VNG1086C; SWP:Q9HQM9; PDB:2GF4A; HKDELLELHEQVNIKDQFLGFDHVDETAFAAYEELDVEPSHVHKSKSEHKHAVFLLGNAL -----------------1111---111133333333-1111------------------- AAASEDEFSSAGRISKREELADDAS ------------------------- >FADD PROTEIN; SWP:Q13158; PDB:2GF5A; SDPFLVLLHSVSSSLSSSELTELKYLCLGRVGKRKLERVQSGLDLFSMLLEQNDLEPGHT -3333-----1111-----------------33333333-33333333-1111------3 ELLRELLASLRRHDLLRRVDDFEAGAAAGAAPGEEDLCAAFNVICDNVGKDWRRLARQLK 333----------------------------3333--------3333---3333--1111 VSDTKIDSIEDRYPRNLTERVRESLRIWKNTEKENATVAHLVGALRSCQMNLVADLVQEV -3333----3333---3333-----------------------------3333------- QQARDLQNRSG ----------- >CONSERVED HYPOTHETICAL PR; SWP:Q97WD2; PDB:2GF6A; ENIEYVFEDVVRIYDTDAQGIAHYAAYYRFFTNTIEKFIKEKVGIPYPIVNENLWFVIAE -3333------1111-1111--3333------------------------1111------ SHAIYHRPVKLGDKLTVLLNPKILSNKTIKFEFKVLKDGELTTEGYVIQIAINPKIWKST ---------2222-----------------------%%%%-------------------- EPKEIDKLSIK -33333333-- >JUMONJI DOMAIN-CONTAINING; SWP:O75164; PDB:2GF7A; QSITAGQKVISKHKNGRFYQCEVVRLTTETFYEVNFDDGSFSDNLYPEDIVSQDCLQFGP ---2222-----1111-------------------1111------3333----3333--- PAEGEVVQVRWTDGQVYGAKFVASHPIQMYQVEFEDGSQLVVKRDDVYTLDEELP -2222-----1111-------------------1111-----3333--1111--- >RAS-RELATED PROTEIN RAB-3; SWP:O95716; PDB:2GF9A; LVPRGSDYMFKLLLIGNSSVGKTSFLFRYADDSFTPAFVSTVGIDFKVKTVYRHDKRIKL --2222----------2222--------------------------------%%%%---- QIWDTAGQERYRTITTAYYRGAMGFLLMYDIANQESFAAVQDWATQIKTYSWDNAQVILV -------------3333-2222-------1111------------------1111----- GNKCDLEDERVVPAEDGRRLADDLGFEFFEASAKENINVKQVFERLVDVICEKMNE --3333------3333----------------1111----------------1111 >JUMONJI DOMAIN-CONTAINING; SWP:O75164; PDB:2GFAA; ALQSITAGQKVISKHKNGRFYQCEVVRLTTETFYEVNFDDGSFSDNLYPEDIVGPPAEGE -----2222-----1111-------------------1111------1111-----2222 VVQVRWTDGQVYGAKFVASHPIQMYQVEFEDGSQLVVKRDDVYTLDE -----1111-------------------1111-----3333------ >IGG2A CNJ206 FAB (HEAVY C; SWP:NA; PDB:2GFBA; QIQMTQSPSSLSASLGERVSLTCRASQEISGYLSWLQQKPDGTIKRLIYAASTLDSGVPK ----------------------------iiii------1111----------------11 RFSGSRSGSDYSLTISSLESEDFADYYCLQYASSPYTFGGGTKLEILRGGAAPTVSIFPP 11----------------3333-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT ------------------------------------------------------------ LTKDEYERHNSYTCEATHKTSTSPIVKSFNRNEC -33331111--------3333------------- >IGG2A CNJ206 FAB (HEAVY C; SWP:NA; PDB:2GFBB; DVKLVESGGGLVQPGGSRKLSCAASGFTFSSFGMHWVRQAPEKGLEWVAYISSGSSTIYY ------------2222-----------3333--------------------1111----- ADTVKGRFTISRDNPKNTLFLQMTSLRSEDTAMYYCARGDYYGSRGAYWGQGTLVTVSAK -1111-------1111----------3333-------1111-----3333---------- TTAPSVYPLAPVCGDTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPAVLQSDLY ---------------------------------------iiii----------------- TLSSSVTVTSSTWPSQSITCNVAHPASSTKVDKKIEPRG -----------------------3333------------ >BH2851; SWP:Q9K901; PDB:2GFGA; GGKEIEIERKTLVSKETFKRLISQLHIGEGDFKLQRNHYFETDDFQLKKQSSALRIREKE ----------------------1111-3333----------1111--1111-------%% AIFTFTLKQPHPAGLLETNQTLSKQEAKLALESAHFPSGEVDALRDLSIPISQLKHIGTL %%------------------------------------------1111-3333------- STSRAEISYEQGILCLDHSSYLGIEDYEIEFEGTSEEHATVTFQEILKTFSISQVPTENK --------!!!!--------iiii-----------------------1111--------- IQRFFSKKE --------- >haloacid dehalogenase-lik; SWP:NA; PDB:2GFHA; HMGLSRVRAVFFDLDNTLIDTAGASRRGMLEVIKLLQSKYHYKEEAEIICDKVQVKLSKE -------------2222--------------------1111---------------1111 CFHCITDVRTSHWEEAIQETKGGADNRKLAEECYFLWKSTRLQHMILADDVKAMLTELRK -----------------------------------------1111--------------- EVRLLLLTNGDRQTQREKIEACACQSYFDAIVIGGEQKEEKPAPSIFYHCCDLLGVQPGD -----------------------3333-----3333--------------------3333 CVMVGDTLETDIQGGLNAGLKATVWINKSGRVPLTSSPMPHYMVSSVLELPALLQSIDCK ---------------1111-------3333---------------3333-------1111 VSMS ---- >HTH-type transcriptional ; SWP:Q0S7R9; PDB:2GFNA; IVDHDERRRALADAVLALIAREGISAVTTRAVAEESGWSTGVLNHYFGSRHELLLAALRR ----------------------3333---------------------------------- AGDIQGDRYRTILDEEGAGPIEKLRNITASILPLDERRLATRVFLFFYAEGTARGEIAAF --------------2222----------1111--3333---------------------- LARWRGVVRESVVAAQREGTVSTDLDADAVTVALVALTDGLALQAILDPVVKAISAEDAA ---------------------1111----------------------333311111111- ARCVDAAVRR ---------- >UBIQUITIN CARBOXYL-TERMIN; SWP:P40818; PDB:2GFOA; IRNLNPVFGGSGPALTGLRNLGNTCYNSILQCLCNAPHLADYFNRNCYQDDINRSNLLGH 1111-------2222-------------------------------3333--1111---i KGEVAEEFGIIKALWTGQYRYISPKDFKITIGKINDQFAGYSQQDSQELLLFLDGLHEDL iii---------1111------------------3333-------------------111 NKADNDHLDDFKAAEHAWQKHKQLNESIIVALFQGQFKSTVQCLTCHKKSRTFEAFYLSL 1---3333---------------------------------------------------- PLASTSKCTLQDCLRLFSKEEKLTDNNRFYCSHCRARRDSLKKIEIWKLPPVLLVHLKRF --------3333--3333------1111---3333------------------------- SYDGRWKQKLQTSVDFPLENLDLSQYVIGPKNNLKKYNLFSVSNHYGGLDGGHYTAYCKN ----------------------3333---------------------3333--------- AARQRWFKFDDHEVSDISVSSVKSSAAYILFYTSL 1111-----!!!!----3333--3333-------- >UPF0204 PROTEIN PH0006; SWP:Y006_PYRHO; PDB:2GFQA; HKVITTKVDKASNINKLIENFGFKETEYVFEGNPVYKRGDVLILTTNDEIYYDYLDREIE -----1111-----3333-----------iiii----!!!!----------2222----- NQLGFKPEIIAFASRHSSKQKLPALTTHVTGNWGKAYGGKDESFAVAIPSAKLSLLKSEL -----------------------------------------------3333------111 NDLGWTVCYEATHHGPTELEVPSFFIEIGSSEEEWINDRAGEIIAETIIYVLDNYEKGRS 1-----------------------------3333------------------------11 KFKVALGIGGGHYAPKQTKRALEGDLAFGHILPKYAQPVSRDVIKALNRFGEKVEAIYVD 11--------1111------------------3333---3333-3333-----------3 WKGSRGETRQLAKSLAQELGLEFIKDG 333------------------------ >3-OXOACYL-[ACYL-CARRIER-P; SWP:P0AAI5; PDB:2GFVA; KRRVVVTGLGMLSPVGNTVESTWKALLAGQSGISLIDHFDTSAYATKFAGLVKDFNCEDI ------------1111---------1111-----------1111------------1111 ISRKEQRKMDAFIQYGIVAGVQAMQDSGLEITEENATRIGAAIGSGIGGLGLIEENHTSL -33331111---------------3333---3333------------------------- MNGGPRKISPFFVPSTIVNMVAGHLTIMYGLRGPSISIATAQTSGVHNIGHAARIIAYGD ---3333-111111111111-------------------!!!!----------------- ADVMVAGGAEKASTPLGVGGFGAARALSTRNDNPQAASRPWDKERDGFVLGDGAGMLVLE -------------3333----1111----11111111-2222------------------ EYEHAKKRGAKIYAELVGFGMSSDAYHMTSPPENGAGAALAMANALRDAGIEASQIGYVN -----1111----------------------1111----------------3333----- AHGTSTPAGDKAEAQAVKTIFGEAASRVLVSSTKSMTGHLLGAAGAVESIYSILALRDQA -----3333--------------3333------------!!!!----------------- VPPTINLDNPDEGCDLDFVPHEARQVSGMEYTLCNSFGFGGTNGSLIFKKI --------------------------------------------------- >GENOME POLYPROTEIN; SWP:NA; PDB:2GG1A; TYTVCDKTKFTWKRAPTDSGHDTVVMEVGFSGTRPCRIPVRAVAHGVPEVNVAMLITPNP -----1111--------------------------------------------------- TMENNGGGFIEMQLPPGDNIIYVGDLNHQWFQK ----------------------!!!!------- >METHIONINE AMINOPEPTIDASE; SWP:P0AE18; PDB:2GGCA; AISIKTPEDIEKMRVAGRLAAEVLEMIEPYVKPGVSTGELDRICNDYIVNEQHAVSACLG --------------------------3333-2222----------------------222 YHGYPKSVCISINEVVCHGIPDDAKLLKDGDIVNIDVTVIKDGFHGDTSKMFIVGKPTIM 2----------!!!!------1111--2222---------iiii--------------22 GERLCRITQESLYLALRMVKPGINLREIGAAIQKFVEAEGFSVVREYCGHGIGRGFHEEP 22-------------111122223333---------1111-------------------- QVLHYDSRETNVVLKPGMTFTIEPMVNAGKKEIRTMKDGWTVKTKDRSLSAQYEHTIVVT ------1111----2222-------------------------1111------------- DNGCEILTLRKDDTIPAIISHDE -----11111111---------- >CENTRIN-2; SWP:P41208; PDB:2GGMA; ELTEEQKQEIREAFDLFDADGTGTIDVKELKVARALGFEPKKEEIKKISEIDKEGTGKNF -------------33331111----333333331111----------------------- GDFLTVTQKSEKDTKEEILKAFKLFDDDETGKISFKNLKRVAKELGENLTDEELQEIDEA -------------------------1111-------------1111---3333---3333 DRDGDGEVSEQEFLRIKKTSLY --------3333---------- >PROTO-ONCOGENE C-CRK; SWP:Q64010; PDB:2GGRA; GPIYARVIQKRVPNAYDKTALALEVGELVKVTKINVSGQWEGECNGKRGHFPFTHVRLLD -------------1111------2222--------------------------------- QQN --- >SCO1 PROTEIN HOMOLOG, MIT; SWP:O75880; PDB:2GGTA; LLGGPFSLTTHTGERKTDKDYLGQWLLIYFGFTHCPDVCPEELEKMIQVVDEIDSITTLP ---------1111---33332222-------1111------------------------- DLTPLFISIDPERDTKEAIANYVKEFSPKLVGLTGTREEVDQVARAYRVYYSPGPKDEDE ---------3333-3333--------------------------1111------------ DYIVDHTIIMYLIGPDGEFLDYFGQNKRKGEIAASIATHMRPYR -------------1111------33333333--------1111- >GUANYLYL CYCLASE-ACTIVATI; SWP:O95843; PDB:2GGZA; WYRTFMMEYPSGLQTLHEFKTLLGLQGLNQKANKHIDQVYNTFDTNKDGFVDFLEFIAAV ---3333-3333-----------------3333-------3333---------------- NLIMQEKMEQKLKWYFKLYDADGNGSIDKNELLDMFMAVQALNGQQTLSPEEFINLVFHK -------------------1111---------------1111------------------ IDINNDGELTLEEFINGMAKDQDLLEIVYKSFDFSNVLRVICNGK ---------3333---------3333--3333------------- >ARTEMIN; SWP:O60609; PDB:2GH0A; DSDLCLKFAMLCTLNDKCDRLRKAYGEACSGPHCQRHVCLRQLLTFFEKAAEPHAQGLLL ------------------------------------------------------------ CPCAPNDRGCGERRRNTIAPNCALPPVAPNCLELRRLCFSDPLCRSRLVDFQTHCHPMDI ---1111------1111-----------------------------------------33 LGTCATEQSRCLRAYLGLIGTAMTPNFVSNVNTSVALSCTCRGSGNLQEECEMLEGFFSH 33-------------1111---------------------2222---------------- NPCLTEAIAAKMRFHSQLFS -------------------- >METHYLTRANSFERASE; SWP:Q81E32; PDB:2GH1A; YLKNTRDLYYNDDYVSFLVNTVWKITKPVHIVDYGCGYGYLGLVLPLLPEGSKYTGIDSG ----3333---------------------------!!!!---------2222-------- ETLLAEARELFRLLPYDSEFLEGDATEIELNDKYDIAICHAFLLHTTPETLQKIHSVKKG ----------3333---------1111--------------3333-3333---1111222 GKIICFEPHWISNASYLLDGEKQSEFIQLGVLQKLFESDTQRNGKDGNIGKIPIYLSELG 2-------3333-----22223333--------------------1111-------1111 VKNIECRVSDKVNFLDSNHHNDKNDLYQSLKEEGIAGDPGDKQQFVERLIARGLTYDNAL ------------------------------1111---------------1111------- AQYEAELRFFKALHLHSSLVYAPNKITFGEIEC -------------1111---------------- >CAPSID PROTEIN; SWP:P36285; PDB:2GH8A; EIVTEEQGTVVQQQPAPAPTALATLATASTGKSVEQEWMTFFSYHTSINWSTVESQGKIL -----------------------------------1111-----------3333------ YSQALNPSINPYLDHIAKLYSTWSGGIDVRFTVSGSGVFGGKLAALLVPPGVEPIESVSM -----3333-------1111---------------3333-----------------3333 LQYPHVLFDARQTEPVIFTIPDIRKTLFHSMDETDTTKLVIMVYNELINPYENGVENKTT --------1111-----------------3333--------------------------- CSITVETRPSADFTFALLKPPGSLIKHGSIPSDLIPRNSAHWMGNRWWSTISGFSVQPRV ---------3333------2222-1111---------3333--3333------------- FQSNRHFDFDSTTTGWSTPYYVPIEIKIQGKVGSNNKWFHVIDTDKALVPGIPDGWPDTT ------------------------------------------------------------ IPDETKATNGNFSYGESYRAGSTTIKPNENSTHFKGTYICGTLSTVEIPENDEQQIKTEA --------------3333-------1111----2222-----------------3333-- EKKSQTMYVVTADFKDTIVKPQHKISPQKLVVYFDGPEKDLTMSATLSPLGYTLVDEQPV ----------------------------------------------------------22 GSVSSRVVRIATLPEAFTQGGNYPIFYVNKIKVGYFDRATTNCYNSQILMTSQRLAEGNY 223333-----------------------------------------3333--------- NLPPDSLAVYRITDSSSQWFDIGINHDGFSYVGLSDLPNDLSFPLTSTFMGVQLARVKLA ------------------------3333-------------------------1111--- SKVK ---- >MALTOSE/MALTODEXTRIN-BIND; SWP:Q5SHS8; PDB:2GH9A; KITVWTHFGGPELEWLKEQARTFERTSGTKVEVVEVPFAEIKQKFILGAPQGQAADLVVT ------------------------------------1111--------1111-------- VPHDWVGEMAQAGVLEPVGKYVTQAYLADLQGVAVEAFTFGGRLMGLPAFAESVALIYNK -3333------------3333-----1111-----1111iiii----------------- KYVKEPPRTWEEFLALAQKLTTGATFGFLYNIGDPYFNFGFFKAFGAENVFAKDAKGNLD ------------------------------11113333---------------------1 PTKLLIGGEVGEKALQFIKDLRFKYNLVPEGVDYGVADGAFKDGALAMILNGPWALGDYK 111-------------------------2222-------------------3333----1 KAKVDFGIAPFPVPPGAKNPWGPFLGVQGVVVNAYSKNKTQAVNFAKTLVTGRNLVAFNQ 111----------2222---------------1111---------3333----------- AGGRIPVSKSAVKQLEKDPVVAGFSKVFPLGAPMPNIPEMGKVWGPWGNAISLAIQRPDS ------------1111----------3333------3333----------------1111 NVKKIVEDMVAEIKKAIG ------------------ >maltose ABC transporter, ; SWP:Q9X0T1; PDB:2GHAA; MQPKLTIWCSEKQVDILQKLGEEFKAKYGVEVEVQYVNFQDIKSKFLTAAPEGQGADIIV ---------3333------------------------3333--------1111------- GAHDWVGELAVNGLIEPIPNFSDLKNFYETALNAFSYGGKLYGIPYAMEAIALIYNKDYV ---------1111-------1111--------1111iiii-------------------- PEPPKTMDELIEIAKQIDEEFGGEVRGFITSAAEFYYIAPFIFGYGGYVFKQTEKGLDVN --------------------iiii------11111111----1111------1111-111 DIGLANEGAIKGVKLLKRLVDEGILDPSDNYQIMDSMFREGQAAMIINGPWAIKAYKDAG 1------------------------1111--------1111-------3333----1111 IDYGVAPIPDLEPGVPARPFVGVQGFMVNAKSPNKLLAIEFLTSFIAKKETMYRIYLGDP -----------2222-------------1111------------1111------------ RLPSRKDVLELVKDNPDVVGFTLSAANGIPMPNVPQMAAVWAAMNDALNLVVNGKATVEE ----11113333-----------3333------3333-------------1111------ ALKNAVERIKAQIQ -------------- >Ascorbate peroxidase; SWP:Q43758; PDB:2GHCX; SGKSYPTVSADYQKAVEKAKKKLRGFIAEKRCAPLMLRLAAHSAGTFDKGTKTGGPFGTI -------------------------------------------1111-1111-----333 KHPAELAHSANNGLDIAVRLLEPLKAEFPILSYADFYQLAGVVAVEVTGGPEVPFHPGRE 3-3333-3333-----------3333-3333--------------1111----------- DKPEPPPEGRLPDATKGSDHLRDVFGKAMGLTDQDIVALSGGHTIGAAHKERSGFEGPWT ------------1111--------------------------------1111-------- SNPLIFDNSYFTELLSGEKEGLLQLPSDKALLSDPVFRPLVDKYAADEDAFFADYAEAHQ ------------------2222------3333-3333----------------------- KLSELGFAD ---2222-- >ZINC FINGERS AND HOMEOBOX; SWP:Q9UKY1; PDB:2GHFA; MAHHHHHHNQQNKKVEGGYECKYCTFQTPDLNMFTFHVDSEHPNVVLNSSYVCVECNFLT -------2222------------------------------------------1111--- KRYDALSEHNLKYHPGEENFKLTMVKRNNQTIFEQTINDLTF -3333------------------------------------- >TRANSPORT PROTEIN; SWP:Q7RBT4; PDB:2GHIA; LESFSLTSHEKKFGVNIEFSDVNFSYPKQTNHRTLKSINFFIPSGTTCALVGHTGSGKST -------------------------1111-------------2222------22223333 IAKLLYRFYDAEGDIKIGGKNVNKYNRNSIRSIIGIVPQDTILFNETIKYNILYGKLDAT ----------------iiii1111-----3333------------------33333333- DEEVIKATKSAQLYDFIEALPKKWDTIVGNKGMKLSGGERQRIAIARCLLKDPKIVIFDE -------------------1111------------------------------------- ATSSLDSKTEYLFQKAVEDLRKNRTLIIIAHRLSTISSAESIILLNKGKIVEKGTHKDLL -----------------------------------1111------iiii----------- KLNGEYAEMWNMQSG --------------- >50S RIBOSOMAL PROTEIN L20; SWP:O67086; PDB:2GHJA; LAKGYRGQRSRSYRRAKEAVRALYYQYRDRKLRKREFRRLWIARINAAVRAYGLNYSTFI ---------3333------------------------------------1111------- NGLKKAGIELDRKILADAVRDPQAFEQVVNKVKEALQVQ -3333-----3333------------------------- >DNA-directed RNA polymera; SWP:Q9KWU6; PDB:2GHOD; KEVRKVRIALASPEKIRSWSYGEVEKPETINYRTLKPERDGLFDERIFGPIKDYECACGK -------------------------------------------3333------------- YKRQRFEGKVCERCGVEVTRSIVRRYRMGHIELATPAAHIWFVKDVPSKIGTLLDLSATE --------------------3333------------------------------------ LEQVLYFNKYIVLDPKGAVLDGVPVEKRQLLTDEEYGGIDARMGAEAIQELLKELDLEKL -1111--------------------------------------3333---------3333 ERELLEEMKHPSRARRAKARKRLEVVRAFLDSGNRPEWMILEAVPVLPPDLRPMVQVDGG -1111---------------3333-3333-------3333-------------------- RFATSDLNDLYRRLINRNNRLKKLLAQGAPEIIIRNEKRMLQEAVDAVIDNGRRGSPVTN ----------------------3333---------------------------------- PGSERPLRSLTDILSGKQGRFRQNLLGKRVDYSGRSVIVVGPQLKLHQCGLPKRMALELF -----------3333iiii----------------------------------3333--- KPFLLKKMEEKAFAPNVKAARRMLERQRDIKDEVWDALEEVIHGKVVLLNRAPTLHRLGI ----------------33333333-------------------------------3333- QAFQPVLVEGQSIQLHPLVCEAFNADFDGDQMAVHVPLSSFAQAEARIQMLSAHNLLSPA ---------------------------------------------1111-3333---333 SGEPLAKPSRDIILGLYYITQVRKEKKGAGMAFATPEEALAAYERGEVALNAPIVVAGRE 3----------------------------------------------------------- TSVGRLKFVFANPDEALLAVAHGLLDLQDVVTVRYLGRRLETSPGRILFARIVGEAVGDE -3333-------------3333--------------------------------3333-- KVAQELIQMDVPQEKNSLKDLVYQAFLRLGMEKTARLLDALKYYGFTLSTTSGITIGIDD -------------------------------------------------1111----333 AVIPEEKQRYLEEADRKLRQIEQAYEMGFLTDRERYDQVIQLWTETTEKVTQAVFKNFEE 3--1111----------1111-----------3333------------------------ NYPFNPLYVMAQSGARGNPQQIRQLCGMRGLMQKPSGETFEVPVRSSFREGLTVLEYFIS ----------------------------------------------3333---------- SHGARKGGADTALRTADSGYLTRKLVDVAHEIVVREADCGTTNYISVPLFQMDEVTRTLR 3333-------1111--------------------------------------------- LRKRSDIESGLYGRVLAREVEALGRRLEEGRYLSLEDVHFLIKAAEAGEVREVPVRSPLT ---------------------iiii-------------------1111------------ CQTRYGVCQKCYGYDLSMARPVSIGEAVGVVAAESIGEPGTQLTMRTTQGLPRVIELFEA ----------------------2222----------1111---------3333------- RRPKAKAVISEIDGVVRIEEGEDRLSVFVESEGFSKEYKLPKDARLLVKDGDYVEAGQPL ------------------------------------------------------------ TRGAIDPHQLLEAKGPEAVERYLVDEIQKVYRAQGVKLHDKHIEIVVRQMLKYVEVTDPG ------------------------------------------------------------ DSRLLEGQVLEKWDVEALNERLIAEGKVPVAWKPLLMGVTKSALSTKSWLSAASFQNTTH -------------------------------------------------3333------- VLTEAAIAGKKDELIGLKENVILGRLIPAGTGSDFVRFTQVVDQRTLKAIEEARKE --------------------1111-------------------------------- >U4/U6 SNRNA-ASSOCIATED SP; SWP:P49960; PDB:2GHPA; TTVLVKNLPKSYNQNKVYKYFKHCGPIIHVDVADSLKKNFRFARIEFARYDGALAAITKT --------1111--------1111----------1111----------3333------22 HKVVGQNEIIVSHLTECTLWTNFPPSYTQRNIRDLLQDINVVALSIRLPRRFAYIDVTSK 22---------------------1111---------1111-------------------- EDARYCVEKLNGLKIEGYTLVTKVSNPLTLEGREIIRNLSTELLDENLLRESFEGFGSIE ---------2222-iiii-----------2222------3333---------3333---- KINIPAGQKFNNCCAFVFENKDSAERALQNRSLLGNREISVSLADKKP -------------------3333--3333------------------- >HOMOSERINE O-SUCCINYLTRAN; SWP:Q72X44; PDB:2GHRA; EENIFVTKERAETQDIRALKIAILNLPTKQETEAQLLRLIGNTPLQLDVHLLHESSFYKT ------------1111---------------------1111------------------3 FRDIENEKFDGLIITGAPVETLSFEEVDYWEELKRIEYSKTNVTSTLHICWGAQAGLYHH 3331111-----------111133331111------3333-------------------- YGVQKYPLKEKFGVFEHEVREQHVKLLQGFDELFFAPHSRHTEVRESDIREVKELTLLAN -----------------------3333------------------------1111----- SEEAGVHLVIGQEGRQVFALGHSEYSCDTLKQEYERDRDKGLNIDVPKNYFKHDNPNEKP -----------iiii------11111111---------------------22221111-- LVRWRSHGNLLFSNWLNYYVYQET ------------------------ >AGR_C_1268P; SWP:Q7D0W3; PDB:2GHSA; LATVFPFAGRVLDETPLLGEGPTFDPASGTAWWFNILERELHELHLASGRKTVHALPFGS -------------------------1111------1111------1111----------- ALAKISDSKQLIASDDGLFLRDTATGVLTLHAELESDLPGNRSNDGRHPSGALWIGTGRK -----1111----1111-----------------1111---------3333------111 AETGAGSIYHVAKGKVTKLFADISIPNSICFSPDGTTGYFVDTKVNRLRVPLDARTGLPT 12222------iiii----------------1111------3333--------------- GKAEVFIDSTGIKGGDGSVCDAEGHIWNARWGEGAVDRYDTDGNHIARYEVPGKQTTCPA --------2222--------1111-----2222------1111----------------- FIGPDASRLLVTSAREHLDDDAITANPQHGLTFELGIEVKGRFEPLYRL --1111--------2222-------1111-------------------- >DNA-DIRECTED RNA POLYMERA; SWP:Q9GZU7; PDB:2GHTA; QYLLPEAKAQDSDKICVVINLDETLVHSSFKPVNNADFIIPVEIDGVVHQVYVLKRPHVD -------3333---------2222-------------------iiii--------2222- EFLQRMGELFECVLFTASLAKYADPVADLLDKWGAFRARLFRESCVFHRGNYVKDLSRLG ------------------3333--------1111------3333---iiii---3333-- RDLRRVLILDNSPASYVFHPDNAVPVASWFDNMSDTELHDLLPFFEQLSRVDDVYSVLRQ -1111------33331111----------------3333--------1111-3333---- >SPIKE GLYCOPROTEIN; SWP:NA; PDB:2GHVC; TNLCPFGEVFNATKFPSVYAWERKKISNCVADYSVLYNSTFFSTFKCYGVSATKLNDLCF ----3333--------3333-----------3333---------------3333--1111 SNVYADSFVVKGDDVRQIAPGQTGVIADYNYKLPDDFMGCVLAWNTRNIDATSTGNYNYK ----------333311112222-3333------1111---------1111-1111----- YRYLRHGKLRPFERDISNVPFSPDGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVVVL ---------2222--------1111----------------------------------- SFE --- >SPIKE GLYCOPROTEIN; SWP:NA; PDB:2GHWB; EVQLVQSGGGVVQPGKSLRLSCAASGFAFSSYAMHWVRQAPGKGLEWVAVISYDGSNKYY ------------2222-----------3333--------2222--------1111----- ADSVKGRFTISRDNSKNTLYLQMNSLRAEDTAVYYCARDRSYYLDYWGQGTLVTVSSGGS 3333----------------------3333------------------------------ ETTLTQSPATLSLSPGERATLSCRASQSVRSNLAWYQQKPGQAPRPLIYDASTRATGIPD -------------2222-----------!!!!------2222------------222233 RFSGSGSGTDFTLTISRLEPEDFAVYYCQQRSNWPPTFGQGTKVEVKSGLVP 33----------------1111------------------------------ >GLUTAMYL-TRNA(GLN) AMIDOT; SWP:Q9X0Z9; PDB:2GI3A; GIDLDFRKLTIEECLKLSEEEREKLPQLSLETIKRLDPHVKAFISVRENVSVEKKGKFWG ----3333-333311113333----------------------------------1111- IPVAIKDNILTLGRTTCASRILENYESVFDATVVKKKEAGFVVVGKANLDEFAGSSTERS ------------------3333----------33333333---------2222---1111 AFFPTRNPWDLERVPGGSSGGSAAAVSAGVVAALGSDTGGSVRQPASLCGVVGYKPTYGL ------1111---------------3333---------------------------2222 VSRYGLVAFASSLDQIGPITKTVRDAAILEIISGRDENDATTVNRKVDFLSEIEEGVSGK --2222---1111----------------------1111----------1111------- FAVPEEIYEHDIEEGVSERFEEALKLLERLGAKVERVKIPHIKYSVATYYVIAPAEASSN ---3333-------------------------------1111------------------ TRNVGFGEEVRRRIIGTFTLSAAYYEAYFNKAKVRRKISDELNEVLSQYDAILTPTSPVT ----------------------------3333------------3333------------ AFKIGEIKDPLTYYLDIFTIPANLAGLPAISVPFGFSNNLPVGVQVIGRRFADGKVFRIA --2222---------1111-----------------%%%%--------2222-------- RAIEKNSPYNENGFPLPEVKA ------1111----------- >POSSIBLE PHOSPHOTYROSINE ; SWP:Q0P8Z8; PDB:2GI4A; MKKILFICLGNICRSPMAEFIMKDLVKKANLEKEFFINSAGTSGEHDGEGMHYGTKNKLA ------------------------------1111-----------------3333---33 QLNIEHKNFTSKKLTQKLCDESDFLITMDNSNFKNVLKNFTNTQNKVLKITDFSPSLNYD 33------------3333------------------------3333--3333-------- EVPDPWYSGNFDETYKILSLACKNLLVFLSKHHHHH ---3333--3333----------------------- >IMMUNOGLOBULIN B1 BINDING; SWP:Q56193; PDB:2GI9A; MQYKLILNGKTLKGETTTEAVDAATAEKVFKQYANDNGVDGEWTYDDATKTFTVTE ----------------------------------1111-------3333------- >MITOCHONDRIAL RNA-BINDING; SWP:Q952G2; PDB:2GIAA; WRRPSLAQQRARRAQLPPAFDVVHWNDEDISRGHLLRVLHRDTFVVLDYHRQARMLTEEG -------------------------33331111-------iiii----------3333-- NKAERVVSVMLPAVYTARFLAVLEGRSEKVEVHSRYTNATFTPNPAAPYTFTLKCTSTRP ---------------------------------1111------1111------------- DETFEWTVEFDVAESLMLQRFLTQALHYNTGFAR ------------------------------1111 >GBP21; SWP:P90629; PDB:2GIAB; SLPKFEIHDVRDDPAEGTMTRVAVDGKLLLISQYPQLGPRKVDPNDLSPQFDADRRISVR ------------3333--------!!!!--------------1111-----1111----- LRHVDLAYLVGVCKERVPRHRMETKAYTLDFEKSAQGYHLHGKVHRVASQRMEDWSVKFD -3333------------------1111------1111----------------------! NHFAVTLEHFLESALDESFGFRQHYA !!!-------------1111------ >NUCLEOCAPSID PROTEIN; SWP:P59595; PDB:2GIBA; NVTQAFGRRGPEQTQGNFGDQDLIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSG 3333-------1111----3333--!!!!11113333-------------------3333 TWLTYHGAIKLDDKDPQFKDNVILLNKHIDAYKTFPP -----------3333----------------1111-- >NUCLEOCAPSID PROTEIN; SWP:P03521; PDB:2GICA; SVTVKRIIDNTVIVPKLPANEDPVEYPADYFRKSKEIPLYINTTKSLSDLRGYVYQGLKS -------------------------33331111------------3333----------- GNVSIIHVNSYLYGALKDIRGKLDKDWSSFGINIGKAGDTIGIFDLVSLKALDGVLPDGV ---3333----------------------------------------------------- SDASRTSADDKWLPLYLLGLYRVGRTQMPEYRKKLMDGLTNQCKMINEQFEPLVPEGRDI --------3333---------1111---3333-----3333------------------- FDVWGNDSNYTKIVAAVDMFFHMFKKHECASFRYGTIVSRFKDCAALATFGHLCKITGMS --3333-------------33331111--------1111--------------------1 TEDVTTWILNREVADEMVQMMLPGQEIDKADSYMPYLIDFGLSSKSPYSSVKNPAFHFWG 111-1111-----------------1111---3333-1111--------1111------- QLTALLLRSTRARNARQPDDIEYTSLTTAGLLYAYAVGSSADLAQQFCVGDNKYTPDDST ----1111-3333----------------------------------------------- GGLTTNAPPQGRDVVEWLGWFEDQNRKPTPDMMQYAKRAVMSLQGLREKTIGKYAKSEFD ------------------------------------------------------------ K - >PLASTOCYANIN; SWP:P0C178; PDB:2GIMA; METYTVKLGSDKGLLVFEPAKLTIKPGDTVEFLNNKVPPHNVVFDAALNPAKSADLAKSL ---------1111-----------2222--------------------1111-----111 SHKQLLMSPGQSTSTTFPADAPAGEYTFYCEPHRGAGMVGKITVAG 1------2222------1111---------1111------------ >PROBABLE HISTONE ACETYLTR; SWP:Q9H7Z6; PDB:2GIVA; KYVDKIHIGNYEIDAWYFSPFPEDYGKQPKLWLCEYCLKYMKYEKSYRFHLGQCQWRQPP ---------------------3333----------------------------------- GKEIYRKSNISVHEVDGKDHKIYCQNLCLLAKLFLDHTLYFDVEPFVFYILTEVDRQGAH ------!!!!-----3333-------------------11111111--------3333-- IVGYFSKEKESPDGNNVACILTLPPYQRRGYGKFLIAFSYELSKLESTVGSPEKPLSDLG ----------1111--------3333-----------------1111------------- KLSYRSYWSWVLLENLRSIKDLSQMTSITQNDIISTLQSLNMVKYWKGQHVICVTPKLVE -------------------------------------1111----iiii-----3333-- EHLPPITVDSVCLKWAP --------3333----- >INWARD RECTIFIER POTASSIU; SWP:P35561; PDB:2GIXA; SHGRSRFVKKDGHCNVQFINVGEKRNETLVFSHNAVIAMRDGKLCLMWRVGNLQKSHLVE --------1111---------------------------iiii----------------- AHVRAQLLKSRITSEGEYIPLDQIDINVGFDSGIDRIFLVSPITIVHEIDEDSPLYDLSK ------------1111------------3333-----------------1111-111133 QDIDNADFEIVVILEGMVEATAMTKQCRSSYLANEILWGHRYEPVLFEEKHYYKVDYSRF 331111-------------------------1111-------------1111---3333- HKTYEVPNTPLCSARDLAEKKYILSN -----3333----------------- >GLYCOPROTEIN E; SWP:P04488; PDB:2GIYA; HVRGVTVRMETPEAILFSPGETFSTNVSIHAIAHDDQTYSMDVVWLRFDVPTSCAEMRIY -----------------2222-----------------------------1111-----3 ESCLYHPQLPECLSPADAPCAASTWTSRLAVRSYAGCSRTNPPPRCSAEAHMEPVPGLAW 333--1111-------!!!!-----------------1111-3333--------2222-- QAASVNLEFRDASPQHSGLYLCVVYVNDHIHAWGHITISTAAQYRNAVVEQPLDIEGRG ------------3333---------%%%%----------3333----------1111-- >CYCLOVIOLACIN O14; SWP:NA; PDB:2GJ0A; GSIPACGESCFKGKCYTPGCSCSKYPLCAKN --3333----------2222----------- >WSV230; SWP:Q91LD0; PDB:2GJ2A; ATFQTDADFLLVGDDTSRYEEVMKTFDTVEAVRKSDLDDRVYMVCLKQGSTFVLNGGIEE ---------------1111------1111--------1111-----2222---------- LRLLTGDSTLEIQPMIVPT --11111111--------- >NITROGEN FIXATION REGULAT; SWP:Q4IU07; PDB:2GJ3A; LLPEIFRQTVEHAPIAISITDLKANILYANRAFRTITGYGSEEVLGKNESILSNGTTPRL -3333------------------------------------3333--3333--3333333 VYQALWGRLAQKKPWSGVLVNRRKDKTLYLAELTVAPVLNEAGETIYYLGMHRDTSELH 3------3333-----------1111-------------1111---------------- >GLYCOGEN PHOSPHORYLASE, M; SWP:P00489; PDB:2GJ4A; QISVRGLAGVENVTELKKNFNRHLHFTLVKDRNVATPRDYYFALAHTVRDHLVGRWIRTQ -3333------------------------------------------3333--------- QHYYEKDPKRIYYLSLEFYMGRTLQNTMVNLALENACDEATYQLGLDMEELEEIEEDAGL ----------------------------1111---------1111-3333-3333----- GNGGLGRLAACFLDSMATLGLAAYGYGIRYEFGIFNQKICGGWQMEEADDWLRYGNPWEK ----------------1111-------------------iiii------1111--1111- ARPEFTLPVHFYGRVEHTSQGAKWVDTQVVLAMPYDTPVPGYRNNVVNTMRLWSAKAPNG -3333------------1111--------------------------------------- YIQAVLDRNLAENISRVLYPNDNFFEGKELRLKQEYFVVAATLQDIIRRFKSNFDAFPDK -----------------------------------------------------1111--- VAIQLNDTHPSLAIPELMRVLVDLERLDWDKAWEVTVKTCAYTNHTVLPEALERWPVHLL -------1111------------------------------------1111--------- ETLLPRHLQIIYEINQRFLNRVAAAFPGDVDRLRRMSLVEEGAVKRINMAHLCIAGSHAV -------------------------2222------------------------------- NGVARIHSEILKKTIFKDFYELEPHKFQNKTNGITPRRWLVLCNPGLAEIIAERIGEEYI ----------------------3333----------1111----------------3333 SDLDQLRKLLSYVDDEAFIRDVAKVKQENKLKFAAYLEREYKVHINPNSLFDVQVKRIHE ---------1111---------------------------------------------33 YKRQLLNCLHVITLYNRIKKEPNKFVVPRTVMIGGKAAPGYHMAKMIIKLITAIGDVVNH 33------------------1111-------------1111---------------1111 DPVVGDRLRVIFLENYRVSLAEKVIPAADLSEQISTAGTEASGTGNMKFMLNGALTIGTM 3333-------------------3333--------2222----------1111------- DGANVEMAEEAGEENFFIFGMRVEDVDRLDQRGYNAQEYYDRIPELRQIIEQLSSGFFSP !!!!-------3333-------------------3333-----------------1111- KQPDLFKDIVNMLMHHDRFKVFADYEEYVKCQERVSALYKNPREWTRMVIRNIATSGKFS -1111-----------11113333---------------------------33333333- SDRTIAQYAREIWGVEPSRQRLP ----------------------- >Putative uncharacterized ; SWP:Q2YDB4; PDB:2GJ6D; KEVEQNSGPLSVPEGAIASLNCTYSDRGSQSFFWYRQYSGKSPELIMSIYSNGDKEDGRF ------------2222---------1111--------2222------------------- TAQLNKASQYVSLLIRDSQPSDSATYLCAVTTDSWGKLQFGAGTQVVVTPDIQNPDPAVY -----1111---------1111----------1111------------------------ QLRDSKSSDKSVCLFTDFDSQTNVSQSKDSDVYITDKTVLDMRSMDFKSNSAVAWSNKSD ------------------1111-------------------------------------- FACANAFNNSIIPEDTFFPS -------------------- >Putative uncharacterized ; SWP:Q2YDB4; PDB:2GJ6E; GVTQTPKFQVLKTGQSMTLQCAQDMNHEYMSWYRQDPGMGLRLIHYSVGAGITDQGEVPN -----------------------------------1111---------2222-------- GYNVSRSTTEDFPLRLLSAAPSQTSVYFCASRPGLAGGRPEQYFGPGTRLTVTEDLKNVF -------3333--------1111--------------------------------1111- PPEVAVFEPSEAEISHTQKATLVCLATGFYPDHVELSWWVNGKEVHSGVSTDPQPLKEQP ---------3333--------------------------iiii--1111----------- ALNDSRYALSSRLRVSATFWQNPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEAWG -1111----------------1111----------------------------------- RAD --- >TRNA MODIFICATION GTPASE ; SWP:P25522; PDB:2GJ8A; GKVVIAGRPNAGKSSLLNALAGREAAIVTDIAGTTRDVLREHIHIDGPLHIIDTAGLREA ------------------------------------------------------------ SDEVERIGIERAWQEIEQADRVLFVDGTTTDAVDPAEIWPEFIARLPAKLPITVVRNKAD ---------------1111------3333----3333-----33331111-------333 ITGETLGSEVNGHALIRLSARTGEGVDVLRNHLKQS 3--------iiii------1111------------- >THIAZOLE BIOSYNTHETIC ENZ; SWP:P32318; PDB:2GJCA; HLNSTPVTHCLSDIVKKEDWSDFKFAPIRESTVSRAMTSRYFKDLDKFAVSDVIIVGAGS 1111----1111----1111--------3333---------------------------- SGLSAAYVIAKNRPDLKVCIIESSVAPGGGSWLGGQLFSAMVMRKPAHLFLQELEIPYED ------------1111-----------!!!!---iiii---------------------- EGDYVVVKHAALFISTVLSKVLQLPNVKLFNATCVEDLVTRPPTTVAGVVTNWTLVTQAQ --------3333-----------1111--------------------------------- CCMDPNVIELAGYKNDGTRDLSQKHGVILSTTGHDFGAFCAKRIVDIDQNQKLGGMKGLD ------------------------------------------------------------ MNHAEHDVVIHSGAYAGVDNMYFAGMEVAELDGLNRMGPTFGAMALSGVHAAEQILKHFA 3333--------------------3333-------------------------------- A - >UBIQUITIN-CONJUGATING ENZ; SWP:P50623; PDB:2GJDA; SLCLQRLQEERKKWRKDHPFGFYAKPVKKADGSMDLQKWEAGIPGKEGTNWAGGVYPITV ------------------2222------1111--1111-------2222-2222------ EYPNEYPSKPPKVKFPAGFYHPNVYPSGTICLSILNEDQDWRPAITLKQIVLGVQDLLDS --1111---------2222-11111111---33331111--1111---------3333-- PNPNSPAQEPAWRSFSRNKAEYDKKVLLQAKQYSK -1111-------------------------1111- >HYPOTHETICAL PROTEIN PP43; SWP:Q88EQ6; PDB:2GJGA; FNESDAPQPPKVLSTPLEIAANLRQLQESHDPLIITFHDRSHRFQSYVVHVDRESNTLAL --1111--------3333----------------------------------1111---- DEIPRDGEKFIENGEHFRVEGFHDGVRIAWECDHALKISEVDGHRCYSGPLPQEVTYHQR ----------1111--------iiii--------------iiii---------------- RNAFRAALKLSQLVDIILDGAHLKGNGARGKLLDISATGCKLRFEGNVEDRLQLGQVYER -------------------1111------------1111--------3333-2222---- FKAGNPLGLVDTVELRHLHYEERINTTFAGVRFHNLSGQAQRKIESFVYQLQREARRFDK ---------------------1111-----------3333-------------------- DDY --- >A21 SINGLE-CHAIN ANTIBODY; SWP:NA; PDB:2GJJA; ADIVLTQTPSSLPVSVGEKVTMTCKSSQTLLYSNNQKNYLAWYQQKPGQSPKLLISWAFT --------------2222---------------------------2222----------- RKSGVPDRFTGSGSGTDFTLTIGSVKAEDLAVYYCQQYSNYPWTFGGGTRLEIKRGEVQL -22221111----------------3333------------------------------- QQSGPEVVKTGASVKISCKASGYSFTGYFINWVKKNSGKSPEWIGHISSSYATSTYNQKF --------2222-----------3333-----------------------------3333 KNKAAFTVDTSSSTAFMQLNSLTSEDSADYYCVRSGNYEEYAMDYWGQGTSVTVS ---------1111---------3333-----------1111-------------- >HYPOTHETICAL PROTEIN PA10; SWP:Q9I4V0; PDB:2GJLA; VFRTRFTETFGVEHPIMQGGMQWVGRAEMAAAVANAGGLATLSALTQPSPEALAAEIARC -------1111--------------3333-------------1111-------------1 RELTDRPFGVNLTLLPTQKPVPYAEYRAAIIEAGIRVVETAGNDPGEHIAEFRRHGVKVI 111---------------------------1111------------------1111---- HKCTAVRHALKAERLGVDAVSIDGFECAGHPGEDDIPGLVLLPAAANRLRVPIIASGGFA ------------1111-------3333---------3333-------------------- DGRGLVAALALGADAINMGTRFLATRECPIHPAVKAAIRAADERSTDLIMRSLRNTARVA ------------------3333--3333-------------1111----3333------- RNAISQEVLAIEARGGAGYADIAALVSGQRGRQVYQQGDTDLGIWSAGMVQGLIDDEPAC -----------------3333-3333-3333-------1111------3333------33 AELLRDIVEQARQLVRQRLEGMLA 33-----------------3333- >ALPHA-AMYLASE; SWP:NA; PDB:2GJPA; TNGTMMQYFEWHLPNDGQHWNRLRDDASNLRNRGITAIWIPPAWKGTSQNDVGYGAYDLY ------------------------------1111-------------1111-3333-111 DLGEFNQKGTVRTKYGTRSQLESAIHALKNNGVQVYGDVVMNHKGGADATENVLAVEVNP 1-----iiii--1111------------1111--------------------------11 NNRNQEISGDYTIEAWTKFDFPGRGNTYSDFKWRWYHFDGVDWDQSRQFQNRIYKFRGDG 11------------------3333---------1111------1111----------222 KAWDWEVDSENGNYDYLMYADVDMDHPEVVNELRRWGEWYTNTLNLDGFRIDAVKHIKYS 2-------2222----------1111---------------------------1111333 FTRDWLTHVRNATGKEMFAVAEFWKNDLGALENYLNKTNWNHSVFDVPLHYNLYNASNSG 3---------------------------------------------------------ii GNYDMAKLLNGTVVQKHPMHAVTFVDNHDSQPGESLESFVQEWFKPLAYALILTREQGYP ii-3333-222233333333------33332222------3333---------------- SVFYGDYYGIPTHSVPAMKAKIDPILEARQNFAYGTQHDYFDHHNIIGWTREGNTTHPNS --3333---3333----3333--------------------------------1111--- GLATIMSDGPGGEKWMYVGQNKAGQVWHDITGNKPGTVTINADGWANFSVNGGSVSIWVK ------------------3333------1111--------1111---------------- R - >RECEPTOR-TYPE TYROSINE-PR; SWP:Q16827; PDB:2GJTA; SMNPVQLDDFDAYIKDMAKDSDYKFSLQFEELKLIGLDIPHFAADLPLNRCKNRYTNILP -----1111----------%%%%--------11111111-3333-33331111-1111-- YDFSRVRLVSMNEEEGADYINANYIPGYNSPQEYIATQGPLPETRNDFWKMVLQQKSQII 3333---------2222---------1111----------3333--------1111---- VMLTQCNEKRRVKCDHYWPFTEEPIAYGDITVEMISEEEQDDWACRHFRINYADEMQDVM -------%%%%---------------!!!!---------1111--------!!!!----- HFNYTAWPDHGVPTANAAESILQFVHMVRQQATKSKGPMIIHCSAGVGRTGTFIALDRLL ------------------------------3333-------------------------- QHIRDHEFVDILGLVSEMRSYRMSMVQTEEQYIFIHQCVQLMWMKKKQQFCISDV -----------------3333---------------------------1111--- >252AA LONG HYPOTHETICAL P; SWP:O58732; PDB:2GJUA; MVYVAVLANIAGNFPALTAALEKIEELKEEGYEIEKYYILGNIVGLFPYPREVIEAIKNL --------------------------1111-----------------------------3 AKTSNVKVIRGKYDQLIAMSDPHAGDPKYIDKLEIPDHLKATLKYTWEKLGHEGREYLRD 333-------------3333------3333---------------------------111 LPIYLVDKIGKNEIFGVYGSPVNPFDGEILPDQPTSYYEAIMRPVKEYEMLLVASPRYPL 1-------!!!!-------3333----------3333----3333--------3333--- DAMTMYGRVVCPGSIGFPPAREHKATFALVDAETLKVKFIEVEYDKKIIEDRIKLEKLPE ---1111----------------------------------------------------- EVIKILYHGGKA ------------ >PUTATIVE CYTOPLASMIC PROT; SWP:NA; PDB:2GJVA; AMETLSVIHTVANRLRELNPDMDIHISSTDAKVYIPTGQQVTVLIHYCGSVFAEPENTDA --3333------------1111-------3333--------------------------- TVQKQLIRISATVIVPQISDAINALDRLRRSLGGIELPDCDRPLWLESEKYIGDAANFCR -------------------------------2222-2222-------------2222--- YALDMTASTLFIAEQ --------------- >BETA-HEXOSAMINIDASE ALPHA; SWP:P06865; PDB:2GJXA; LWPWPQNFQTSDQRYVLYPNNFQFQYDVSSAAQPGCSVLDEAFQRYRDLLFGTLEKNVLV -----------------3333-----3333------------------------------ VSVVTPGCNQLPTLESVENYTLTINDDQCLLLSETVWGALRGLETFSQLVWKSAEGTFFI ------------1111--------3333------3333--------------3333---- NKTEIEDFPRFPHRGLLLDTSRHYLPLSSILDTLDVMAYNKLNVFHWHLVDDPSFPYESF -------------------------3333---------------------3333------ TFPELMRKGSYNPVTHIYTAQDVKEVIEYARLRGIRVLAEFDTPGHTLSWGPGIPGLLTP ------------------------------1111------------1111---2222--- CYSGSEPSGTFGPVNPSLNNTYEFMSTFFLEVSSVFPDFYLHLGGDEVDFTCWKSNPEIQ -----------------3333--------------------------------------- DFMRKKGFGEDFKQLESFYIQTLLDIVSSYGKGYVVWQEVFDNKVKIQPDTIIQVWREDI 3333-----------------------1111---------1111---1111--------- PVNYMKELELVTKAGFRALLSAPWYLNRISYGPDWKDFYVVEPLAFEGTPEQKALVIGGE -----------3333-----11113333-------------1111---33331111---- ACMWGEYVDNTNLVPRLWPRAGAVAERLWSNKLTSDLTFAYERLSHFRCELLRRGVQAQP --------3333-3333-3333--------3333-----------------1111----- LNVGFCEQEFEQ ------------ >CATALYTIC ELIMINATION ANT; SWP:NA; PDB:2GK0A; ELVMTQTPLSLPVSLGDQASISCRSSQSIVHSNGNTYLEWYLQKPGQSPKLLIYKVSNRF ------------------------------------------------------------ SGVPDRFSGSGSGTDFTLKINRVEAEDLGVYYCFQGSHLPPTFGGGTKLEIKRADAAPTV ---3333----------------3333--------------------------------- SIFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSM -------3333-----------------------------2222---------------- SSTLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNRN --------3333------------------------- >CATALYTIC ELIMINATION ANT; SWP:NA; PDB:2GK0B; QVQLKESGPGLVAPSQSLSITCTVSGFSLTNYGVDWVRQPPGKGLEWVGVIWSGGSTNYN ------------1111-------------------------------------------1 SALMSRLSISKDNSKSQVFLKMNSLQTDDTAVYYCAKHWGYGMDHWGQGTTVTVSSAKTT 111--------3333--------------------------------------------- PPSVYPLAPGCGDTTGSSVTLGCLVKGYFPEPVTVTWNSGSLSSSVHTFPALLQSGLYTM -------------------------------------iiii------------------- SSSVTVPSSTWPSQTVTCSVAHPASSTTVDKKLEPR ---------------------3333----------- >Carcinoembryonic antigen-; SWP:Q5UB49; PDB:2GK2A; AQLTTESMPFNVAEGKEVLLLVHNLPQQLFGYSWYKGERVDGNRQIVGYAIGTQQATPGP ------------2222------------------------3333--------------11 ANSGRETIYPNASLLIQNVTQNDTGFYTLQVIKSDLVNEEATGQFHVY 11------1111-------1111---------1111------------ >PUTATIVE CYTOPLASMIC PROT; SWP:Q8ZLF9; PDB:2GK3A; LKVLFIGESWHIHIHSKGYDSFTSSKYEEGATWLLECLRKGGVDIDYPAHTVQIAFPESI --------------------------------------1111-----3333--------- DELNRYDVIVISDIGSNTFLLQNETFYQLKIKPNALESIKEYVKNGGGLLIGGYLSFGIE --------------3333----------------------------------1111-222 AKANYKNTVLAEVLPVILDGDDRVEKPEGICAEAVSPEHPVVNGFSDYPVFLGYNQAVAR 2--33331111--------------3333------1111--2222--------------1 DDADVVLTINNDPLLVFGEYQQGKTACFSDCSPHWGTQQFSWPFYTDLWVNTLQFIARK 111-----%%%%------------------------3333-2222-------------- >CONSERVED HYPOTHETICAL PR; SWP:Q97QI1; PDB:2GK4A; AKILVTSGGTSEAIDSVRSITNHSTGHLGKIITETLLSAGYEVCLITTKRALKPEPHPNL ------------------------------------1111-------1111-----1111 SIREITNTKDLLIEQERVQDYQVLIHSAVSDYTPVYTGLEEVQASSNLKEFLSKQNHQAK ----------------3333---------------------1111----1111-3333-- ISSTDEVQVLFLKKTPKIISLVKEWNPTIHLIGFKLLVDVTEDHLVDIARKSLIKNQADL -3333------------3333----1111------------------------------- IIANDLTQISADQHRAIFVEKNQLQTVQTKEEIAELLLEKIQAYH ----1111--------------------------------1111- >REGULATOR OF NONSENSE TRA; SWP:Q92900; PDB:2GK6A; GSRYEDAYQYQNIFGPLVKLEADYDKKLKESQTQDNITVRWDLGLNKKRIAYFTLPLMQG ----------------------------1111-----------1111------------- DEICLRYKGDLAPLWKGIGHVIKVPDNYGDEIAIELRSSVGAPVEVTHNFQVDFVWKSTS -------------------------3333-------------3333-------------- FDRMQSALKTFAVDETSVSGYIYHKLLGHEVEDVIIKCQLPKRFTAQGLPDLNHSQVYAV -------------1111-------1111-----------------2222----------- KTVLQRPLSLIQGPPGTGKTVTSATIVYHLARQGNGPVLVCAPSNIAVDQLTEKIHQTGL -------------2222--------------------------------------1111- KVVRLCAKSREAIDSPVSFLALHNQIRNMDSMPELQKLQQLKDLSSADEKRYRALKRTAE ------3333------3333--------33333333--------3333------------ RELLMNADVICCTCVGAGDPRLAKMQFRSILIDESTQATEPECMVPVVLGAKQLILVGDH ------------3333--3333-----------1111-3333---1111---------11 CQLGPVVMCKKAAKAGLSQSLFERLVVLGIRPIRLQVQYRMHPALSAFPSNIFYEGSLQN 11------33331111---------1111------------3333--------%%%%--- GVTAADRVKKGFDFQWPQPDKPMFFYVTQGQEEIASSGTSYLNRTEAANVEKITTKLLKA --3333------------------------------------------------------ GAKPDQIGIITPYEGQRSYLVQYMQFSGSLHTKLYQEVEIASVDAFQGREKDFIILSCVR --1111---------------------------3333----33332222----------- IGFLNDPRRLNVALTRARYGVIIVGNPKALSKQPLWNHLLNYYKEQKVLVEGPLNNLRES -----3333--1111----------3333-----------------------1111---- LM -- >PI(4)P 5-kinase type II g; SWP:Q8TBX8; PDB:2GK9A; DPLVGVFLWGVAHSINELSQVPPPVMLLPDDFKASSKIKVNNHLFHRENLPSHFKFKEYC --3333-----------1111------3333----------------------------- PQVFRNLRDRFGIDDQDYLVSLTRNPPSESEFLISYDRTLVIKEVSSEDIADMHSNLSNY -------3333-------3333------------1111-------------3333----- HQYIVKCHGNTLLPQFLGMYRVSVDNEDSYMLVMRNMFSHRLPVHRKYDLKGTLRDMDNK ------%%%%-------------%%%%---------------------------3333-- NQKVYIKKIFLEKLKRDVEFLVQLKIMDYSLLLGIHDIEVYFMGLIDILHPEQYAKRFLD --------3333---------1111------------------------33333333--- FIT --- >DIAMINOPIMELATE EPIMERASE; SWP:P44859; PDB:2GKEA; MQFSKMHGLGNDFVVVDGVTQNVFFTPETIRRLANRHCGIGFDQLLIVEAPYDPELDFHY -------iiii-----------------------------------------3333---- RIFNADGSEVSQCGNGARCFARFVTLKGLTNKKDISVSTQKGNMVLTVKDDNQIRVNMGE ---1111-----------------1111--------------------1111-------- PIWEPAKIPFTANKFEKNYILRTDIQTVLCGAVSMGNPHCVVQVDDIQTANVEQLGPLLE ---3333---------------1111-------------------3333-3333------ SHERFPERVNAGFMQIINKEHIKLRVYERGAGETQACGSGACAAVAVGIMQGLLNNNVQV -1111------------1111---------------------------1111-------- DLPGGSLMIEWNGVGHPLYMTGEATHIYDGFITL -1111-------2222------------------ >RESPONSE REGULATOR HOMOLO; SWP:O68522; PDB:2GKGA; KKILIVESDTALSATLRSALEGRGFTVDETTDGKGSVEQIRRDRPDLVVLAVDLSAGQNG --------------------1111-------3333-------------------iiii-- YLICGKLKKDDDLKNVPIVIIGNPDGFAQHRKLKAHADEYVAKPVDADQLVERAGALIGF ---------1111---------3333------1111------------------------ PE -- >NUCLEASE; SWP:Q924Q1; PDB:2GKIA; EVQLQQSGPELVKPGASVKMSCKASGYTFTSYVMHWVKQKPGQGLEWIGYINPYNDGTKY ---------------------------1111----------------------------- NEKFKGKATLTSDKSSSTAYMELSSLTSEDSAVYYCARGAYKRGYAMDYWGQGTSVTVSS 3333--------3333----------1111------------iiii-------------- DLVMSQSPSSLAVSAGEKVTMSCKSSQSLFNSRTRKNYLAWYQQKPGQSPKLLIYWASTR ------------------------------------------------------------ ESGVPDRFTGSGSGTDFTLTISSVQAEDLAVYYCKQSYYHMYTFGSGTKLEIKHH 22223333----------------3333--------------------------- >HEMOGLOBIN-LIKE PROTEIN H; SWP:P0A592; PDB:2GKMA; GLLSRLRKREPISIYDKIGGHEAIEVVVEDFFVRVLADDQLSAFFSGTNMSRLKGKQVEF -----1111----------------------------11113333--------------- FAAALGGPEPYTGAPMKQVHQGRGITMHHFSLVAGHLADALTAAGVPSETITEILGVIAP --1111-------------2222------------------1111--------------3 LAVDVTS 333---- >HYPOTHETICAL PROTEIN NMB0; SWP:Q9K0T5; PDB:2GKPA; TFNQEQDYWAGYKANERALIIQTWSGFGRYAPDHLYPPHILPLDTDNETLGTTVLQALAN --2222--------1111---------------------------------------111 SRTFVYDSPEDQDFFDTEKIRQRYEDWVAKLCGNLGYKTRRALFKNSVDIWLHNGCLKIS 1---2222--------------------------------------------iiii---- PSRHVKLEAWDAIDADDVILSLDNSPEEIGAGLKLALSRCR --------------------3333------------1111- >BIFUNCTIONAL SAT/APS KINA; SWP:O67174; PDB:2GKSA; IKYLKSIQISQRSVLDLELLAVGAFTPLDRFMGEEDYRNVVESMRLKSGTLFPIPITLPM 1111------------------1111-------------------3333----------- EKEIAKDLKEGEWIVLRDPKNVPLAIMRVEEVYKWNLEYEAKNVLGTTDPRHPLVAEMHT 333311112222-----1111---------------------------3333-----111 WGEYYISGELKVIQLPKYYDFPEYRKTPKQVREEIKSLGLDKIVAFQTRNPMHRVHEELT 1-------------------3333------------------------------------ KRAMEKVGGGLLLHPVVGLTKPGDVDVYTRMRIYKVLYEKYYDKKKTILAFLPLAMRMAG --------------------2222------------------1111-------------- PREALWHGIIRRNYGATHFIVGRDHASPGKDSKGKPFYDPYEAQELFKKYEDEIGIKMVP -----------1111--------2222---1111-----------------3333----- FEELVYVPELDQYVEINEIRENFLKQGRKLPEWFTRPEVAEILAETYVPKHKQGFCVWLT ------1111--------3333-1111---3333-3333---------1111-------- GLPCAGKSTIAEILATMLQARGRKVTLLDGDVVRTHLSRGLGFSKEDRITNILRVGFVAS -2222-------------1111---------------2222------------------- EIVKHNGVVICALVSPYRSARNQVRNMMEEGKFIEVFVDAPVEVCEERDVKGLYKKAGFT --1111------------------11112222--------11111111---3333---22 GVDDPYEPPVAPEVRVDTTKLTPEESALKILEFLKKEGFIKD 22--------------1111---------------------- >PUTATIVE DEHYDRATASE PROT; SWP:NA; PDB:2GL5A; SLKITSIEVFDCELKKRDQTMSSYNPVLIRVNTDSGLSGIGEVGLAYGAGAKAGVGIIRD ------------3333-3333-----------1111------------------------ LAPLIVGEDPLNIEKIWEFFFRKTFWGMGGGNVFYAGMSAIDIALWDIKGKYLGVPVYQL 333322223333------------3333--3333---------------------3333- LGGKTNEKLRTYASQLQFGWGDKNHILVTPEEYAEAARAALDDGYDAIKVDPLEIDRNGD ---------------1111---------------------1111-----------1111- DCVFQNRNRNYSGLLLADQLKMGEARIAAMREAMGDDADIIVEIHSLLGTNSAIQFAKAI 3333---------------------------------------%%%%------------3 EKYRIFLYEEPIHPLNSDNMQKVSRSTTIPIATGERSYTRWGYRELLEKQSIAVAQPDLC 333------------------------------1111-3333----1111-------111 LCGGITEGKKICDYANIYDTTVQVHVCGGPVSTVAALHMETAIPNFIIHEHHTNAMKASI 1------------3333----------------------------------1111-3333 RELCTHDYQPENGYYVAPEQPGLGQELNDEVVKEYLAYVIK 3333------iiii--------------33331111----- >CREATINE KINASE; SWP:Q6P8J7; PDB:2GL6A; RLFPPSADYPDLRKHNNCMAECLTPAIYAKLRNKVTPNGYTLDQCIQTGVDNPTVGMVAG ---3333---------3333----------1111-1111--------3333--------- DEESYEVFADLFDPVIKLRHNGYDPRVMKHTTDLDASKRVRTGRSIRGLSLPPACTRAER 3333---------------iiii3333-------3333-------2222----------- REVENVAITALEGLKGDLAGRYYKLSEMTEQDQQRLIDDHFLFDKPVSPLLTCAGMARDW ----------1111!!!!-----3333---------1111-----------1111-2222 PDARGIWHNYDKTFLIWINEEDHTRVISMNMKRVFERFCRGLKEVERLIQERGWEFMWNE 2222----1111-----------------3333----------------1111------- RLGYILTCPSNLGTGLRAGVHVRIPKLSKDPRFSKILENLRLQKRGTGGVDTAAVADVYD -------3333--------------33331111------------1111-----%%%%-- ISNIDRIGRSEVELVQIVIDG --------------------- >Transcription factor 7-li; SWP:Q9NQB0; PDB:2GL7B; LGANDELISFKDEGEQEERDLADVKSSLVN ------------------3333----1111 >RETINOIC ACID RECEPTOR RX; SWP:P48443; PDB:2GL8A; DMPVERILEAELAVEAADKQLFTLVEWAKRIPHFSDLTLEDQVILLRAGWNELLIASFSH --3333-----1111-----3333------2222-------------------------1 RSVSVQDGILLATGLHVHRSSAHSAGVGSIFDRVLTELVSKMKDMQMDKSELGCLRAIVL 111--------------3333---------------------1111-3333--------- FNPDAKGLSNPSEVETLREKVYATLEAYTKQKYPEQPGRFAKLLLRLPALRSIGLKCLEH -1111----3333-------------------3333------------------------ LFFFKLIGDT ---------- >NEURABIN-1; SWP:O35867; PDB:2GLEA; GHMVHEWSVQQVSHWLVGLSLDQYVSEFSAQNISGEQLLQLDGNKLKALGMTSSQDRALV --1111--1111---3333-3333--3333-------1111-----1111---------- KKKLKEMKMSLEKA -----3333----- >PROBABLE M18-FAMILY AMINO; SWP:Q9WYJ9; PDB:2GLFA; KERKNVWHHRKKEEIEAFSKEYEFSKAKTERTVKEIKRILDESGFVPLEDFAGDPNTVYA ----3333--3333---------------------------------------------- VNRGKAIAAFRVVDDLKRGLNLVVAHIDSPRLDFKPNPLIEDEQIALFKTHYYGGIKKYH -----------------------------------------%%%%------------333 WLSIPLEIHGVLFKNDGTEIEIHIGDKPEDPVFTIPDLLPHLDKEDAKISEKFKGENLLI 3------------1111---------3333--------3333-----3333--3333--- AGTIPLSGEEKEAVKTNVLKILNEYGITEEDFVSGEIEVVPAFSPREVGDRSLIGAYGQD -----------3333------------3333---------------------------22 DRICAYTALRALLSANPEKSIGVIFFDKEEIGSDGNTGAKARFYLKALRQILKQGAKDSE 22---------1111------------3333---------------------------33 FVLDEVLENTSVISGDVCAAVNPPYKDVHDLHNAPKLGYGVALVKYTGARGKYSTNDAHA 33----------------------1111-1111--2222--------------------- EFVARVRKVLNEQGVIWQVATLGKVDQGGGGTIAKFFAERGSDVIDGPALLGHSPFEISS ----------1111------------------33333333-------------------- KADLFETYVAYRSLEKL ----------------- -------------------------------- >CALCITONIN-1; SWP:P01263; PDB:2GLHA; CSNLSTCVLGKLSQELHKLQTYPRTNTGSGTP --------------------3333-------- >ZINC FINGER PROTEIN GLI1; SWP:P08151; PDB:2GLIA; ETDCRWDGCSQEFDSQEQLVHHINSEHIHGERKEFVCHWGGCSRELRPFKAQYMLVVHMR -----2222------------------------------------------3333----- RHTGEKPHKCTFEGCRKSYSRLENLKTHLRSHTGEKPYMCEHEGCSKAFSNASDRAKHQN ----------------------------3333-------------------3333----- RTHSNEKPYVCKLPGCTKRYTDPSSLRKHVKTVHG ---------------------3333---------- >PROBABLE M18-FAMILY AMINO; SWP:Q97K30; PDB:2GLJA; LLKEYKNAWDKYDDKQLKEVFALGDRFKNFISNCKTERECVTELIKTAEKSGYRNIEDIL ----------------------------1111-------3333----------------- AKGETLKEGDKVYANNRGKGLIMFLIGKEPLYTGFKILGAHIDSPRLDLKQNPLYEDTDL ------2222-----iiii----------3333--------------------------- AMLETHYYGGIKKYQWVTLPLAIHGVIVKKDGTIVNVCVGEDDNDPVFGVSDILVHLASE ----------------------------1111---------1111--------3333333 QLEKKASKVIEGEDLNILIGSIPLKDGEEKQKVKHNIMKILNEKYDISEEDFVSAELEIV 3--3333----------------%%%%-----------------------3333------ PAGKARDYGFDRSMVMGYGQDDRICAYTSFEAMLEMKNAKKTCITILVDKEEVGSIGATG --------1111-------------------------------------3333------- MQSKFFENTVADIMSDELKLRKALYNSEMLSSDVSAAFDPNYPNVMEKRNSAYLGKGIVF --3333-33333333-----3333--------------11111111------2222---- NKYTGSRGKSGCNDANPEYIAELRRILSKESVNWQTAELGKVDQGGGGTIAYILAEYGMQ -------------------------------------------------3333-3333-- VIDCGVALLNHAPWEISSKADIYETKNGYSAFLNN ----------------------------------- >XYLOSE ISOMERASE; SWP:P24300; PDB:2GLKA; MNYQPTPEDRFTFGLWTVGWQGRDPFGDATRRALDPVESVRRLAELGAHGVTFHDDDLIP -----3333----1111--------------------------1111------1111--2 FGSSDSEREEHVKRFRQALDDTGMKVPMATTNLFTHPVFKDGGFTANDRDVRRYALRKTI 222--------------------------------3333--------------------- RNIDLAVELGAETYVAWGGREGAESGGAKDVRDALDRMKEAFDLLGEYVTSQGYDIRFAI ------1111-------1111---1111-------------------------------- EPKPNEPRGDILLPTVGHALAFIERLERPELYGVNPEVGHEQMAGLNFPHGIAQALWAGK ----------------------1111-3333----------1111--------------- LFHIDLNGQNGIKYDQDLRFGAGDLRAAFWLVDLLESAGYSGPRHFDFKPPRTEDFDGVW --------------------------------------------------3333------ ASAAGCMRNYLILKERAAAFRADPEVQEALRASRLDELARPTAADGLQALLDDRSAFEEF -----------------------------------3333---3333------33331111 DVDAAAARGMAFERLDQLAMDHLLGARG ----3333-------------------- >(3R)-hydroxymyristoyl-acy; SWP:Q5G940; PDB:2GLLA; LQSQFFIEHILQILPHRYPMLLVDRITELQANQKIVAYKNITFNEDVFNGHFPNKPIFPG ---------3333----------------2222-----------3333---2222---33 VLIVEGMAQSGGFLAFTSLWGFDPEIAKTKIVYFMTIDKVKFRIPVTPGDRLEYHLEVLK 33--------------------3333--------------------2222---------- HKGMIWQVGGTAQVDGKVVAEAELKAMIAERE -------------iiii--------------- >BRINKER CG9653-PA; SWP:Q9XTN4; PDB:2GLOA; GSRRIFTPHFKLQVLESYRNDNDCKGNQRATARKYNIHRRQIQKWLQCESNLRSSVANN -----------------------2222----------3333------------------ >92AA LONG HYPOTHETICAL PR; SWP:O73971; PDB:2GLWA; MDVLAKFHTTVHRIGRIIIPAGTRKFYGIEQGDFVEIKIVKYEGEEPKEGTFTARVGEQG ------------iiii-------------2222---------!!!!-----------%%% SVIIPKALRDVIGIKPGEVIEVLLLGHYKPRN %---3333------2222-------------- >1,5-ANHYDRO-D-FRUCTOSE RE; SWP:Q2I8V6; PDB:2GLXA; NRWGLIGASTIAREWVIGAIRATGGEVVSMMSTSAERGAAYATENGIGKSVTSVEELVGD --------------------1111------------------1111------33331111 PDVDAVYVSTTNELHREQTLAAIRAGKHVLCEKPLAMTLEDAREMVVAAREAGVVLGTNH ----------1111--------1111---------------------------------- HLRNAAAHRAMRDAIAEGRIGRPIAARVFHAVYLPPHLQGWRLERPEAGGGVILDITVHD 3333-------------1111-------------3333-3333-3333----3333---- ADTLRFVLNDDPAEAVAISHSAGMGKEGVEDGVMGVLRFQSGVIAQFHDAFTTKFAETGF -----1111-----------------------------3333------------------ EVHGTEGSLIGRNVMTQKPVGTVTLRNAEGESQLPLDPANLYETALAAFHSAIEGHGQPS --------------------------3333---------------------1111----- ATGEDGVWSLATGLAVVKAAATGQAAEIETGL -------------------------------- >similar to Formylmethanof; SWP:Q193J3; PDB:2GLZA; VEKTPWELVIDFHGHTCPDIALGYRIAQLAQREGIRPAPDSECLVKAYTQSCALDAIQVL -------------------------------------1111---------3333------ NKATIGRHALIIEETHRYYQFHFTGTQDIHQFTVSPAVLDHLETLRHPDLSPRERQNKVL ---3333---------------2222----------------1111-------------- EGVQYVLTLEESAFCHYDKIPGQLSKI -----11113333-------------- >CONSERVED HYPOTHETICAL PR; SWP:Q8P9Y3; PDB:2GM2A; MPLNQEHPDYTYALRAADGRHAKVNEQILQQSFILMPDELVEHWPVPSLGQLQPAHMDAV ------------------------------------------------11113333---- LALNPAVILLGTGERQQFPSTDVLAACLTRGIGLEAMTNAAAARTYNVLASEGRRVALAM ---------------------------3333----------------------------- IVGGLEHHHHHH ------------ >UNKNOWN PROTEIN; SWP:Q8LGG8; PDB:2GM3A; PTKVVAVNASTIKDYPNPSISCKRAFEWTLEKIVRSNTSDFKILLLHVQVSIYASPEDFR ---------------------------------!!!!-----------------3333-- DRQSNKAKGLHLLEFFVNKCHEIGVGCEAWIKTGDPKDVICQEVKRVRPDFLVVGSRGLG ------------------------------------------------------------ TVSAFCVKHAECPVTIKRNADETPSDPADD ------------------1111---3333- >TRANSPOSON GAMMA-DELTA RE; SWP:P03012; PDB:2GM5A; ALFGYARVSTSQQSLDIQVRALKDAGVKANRIFTDKADRKGLDLLRKVKEGDVILVKKLD ----------------------1111-1111-----------------2222-----111 HLGRDTADIQLIKEFDAQGVSIRFIDDGISTDSYIGKVVTILSAVAQAERQRILERTNE 1---3333-------1111-----3333-1111-------------------------- >CYSTEINE DIOXYGENASE TYPE; SWP:Q46R41; PDB:2GM6A; SLAPLREFITGLSALLDEQPGEARILREGGALLARLVARDDWLPDAFAQPHPEYYQQLLH ---------------1111------------------------3333------------- CDSAERFSIVSFVWGPGQRTPIHDHTVWGLIGLRGAEYSQPFVLDGSGRPVLHGEPTRLE -1111---------2222--------------------------1111-----------2 PGHVEAVSPTVGDIHRVHNAYDDRVSISIHVYGANIGGVRRSVYTEAGERKPFISGYSNP 222-------------------------------3333------1111------------ YLPNPWDRSK ---------- >TENA HOMOLOG/THI-4 THIAMI; SWP:NA; PDB:2GM8A; HGVTGELRRRADGIWQRILAHPFVAELYAGTLPMEKFKYYLLQDYNYLVNFAKALSLAAS --------------------------------3333------------------------ RAPSVDLMKTALELAYGTVTGEMANYEALLKEVGLSLRDAAEAEPNRVNVSYMAYLKSTC ------------------------------1111-------------------------- ALEGFYQCMAALLPCFWSYAEIAERHGGKLRENPVHVYKKWASVYLSPEYRGLVERLRAV ---------------------------3333----------3333--------------- LDSSGLSAEELWPYFKEASLYELEFWQAAYEGH 1111--3333----------------------- >GRANULOCYTE-MACROPHAGE CO; SWP:P04141; PDB:2GMFA; RSPSPSTQPWEHVNAIQEARRLLNLSRDTAAEMNETVEVISEMFDLQEPTCLQTRLELYK ---1111-----------------------3333----------3333------------ QGLRGSLTKLKGPLTMMASHYKQHCPPTPETSCATQIITFESFKENLKDFLLVIPFDCWE ----3333-----------------------------------------1111------- P - >HYPOTHETICAL PROTEIN PF06; SWP:NA; PDB:2GMGA; AHHHHHHGSATRREKIIELLLEGDYSPSELARILDMRGKGSKKVILEDLKVISKIAKREG --------------11111111---3333---------------------------1111 MVLLIKPAQCRKCGFVFKAEINIPSRCPKCKSEWIEEPRFKLERK --------------------------------------------- >RIBOSOMAL LARGE SUBUNIT P; SWP:P32684; PDB:2GMLA; DLVLIALNKPVGIVSTTEDGERDNIVDFVNHSKRVFPIGRLDKDSQGLIFLTNHGDLVNK -----------------!!!!--3333--------------1111--------3333--- ILRAGNDHEKEYLVTVDKPITEEFIRGSAGVPILGTVTKKCKVKKEAPFVFRITLVQGLN ---3333----------------------------------------------------- RQIRRCEHFGYEVKKLERTRINVSLSGIPLGEWRDLTDDELIDLFKLIENSS -33333333---------------22222222----3333----1111---- >HYPOTHETICAL PROTEIN EF00; SWP:Q47787; PDB:2GMQA; GKEIAIQEKDLTLQWRGNTGKLVKVRLKNTRAEWYNKQITEENIQEITTLNIIKNGKSLA ------3333----2222----------3333-1111--33333333------%%%%--- LEVYPEKSIYVKPRINVPVFFIKTPINRGVFEEIFG ---1111----------------------------- >Putative pyridoxamine 5-p; SWP:NA; PDB:2GMSA; HMINYPLASSTWDDLEYKAIQSVLDSKMFTMGEYVKQYETQFAKTFGSKYAVMVSSGSTA ------------3333-------------------------------------------- NLLMIAALFFTKKPRLKKGDEIIVPAVSWSTTYYPLQQYGLRVKFVDIDINTLNIDIESL ------1111------2222--------1111---------------------------- KEAVTDSTKAILTVNLLGNPNNFDEINKIIGGRDIILLEDNCESMGATFNNKCAGTFGLM ----3333-------iiii----------iiii-------1111----%%%%2222---- GTFSSFYSHHIATMEGGCIVTDDEEIYHILLCIRAHGWTRNLPKKNKVTGVKSDDQFEES -----2222-----------------------------1111------------3333-- FKFVLPGYNVRPLEMSGAIGIEQLKKLPRFISVRRKNAEYFLDKFKDHPYLDVQQETGES ------------3333----------------------------1111------------ SWFGFSFIIKKDSGVIRKQLVENLNSAGIECRPIVTGNFLKNTDVLKYFDYTVHNNVDNA ---------------3333--------------!!!!3333-3333-------------- EYLDKNGLFVGNHQIELFDEIDYLREVLK ----------------------------- >HYPOTHETICAL PROTEIN ATU0; SWP:Q8UI08; PDB:2GMYA; KTRINYAKASPEAFKAVMALENYVQSSGLEHRFIHLIKLRASIINGCAFCVDMHVKESRH ----3333---------------1111-------------------------------11 DGLSEQWINLMSVWRESPVYTEQERALLGWVDAVTKIAETGAPDDAFETLRAHFSDEEIV 11-3333--33331111-------------------3333---------3333------- KITVAIGAINTWNRIAVGFRSQHPVEA --------------------------- >THREONINE DEHYDRATASE CAT; SWP:P11954; PDB:2GN0A; ASTYDLPVAIEDILEAKKRLAGKIYKTGMPRSNYFSERCKGEIFLKFENMQRTGSFIRGA --------3333----------------------------------33332222------ FNKLSSLTEAEKRKGVVACSAGNHAQGVSLSCAMLGIDGKVVMPKGAPKSKVAATCDYSA ---33333333--------------------------------2222------------- EVVLHGDNFNDTIAKVSEIVETEGRIFIPPYDDPKVIAGQGTIGLEIMEDLYDVDNVIVP --------------------------------------------------1111------ IGGGGLIAGIAIAIKSINPTIKVIGVQAENVHGMAASYYTGEITTHRTTGTLADGCDVSR -----------------1111------3333---------------------3333---- PGNLTYEIVRELVDDIVLVSEDEIRNSMIALIQRNKVITEGAGALACAALLSGKLDSHIQ ---------------------------------------3333---------1111---- NRKTVSIISGGNIDLSRVSQITG ----------------------- >UDP-GLCNAC C6 DEHYDRATASE; SWP:O25511; PDB:2GN4A; QNMLDNQTILITGGTGSFGKCFVRKVLDTTNAKKIIVYSRDELKQSEMAMEFNDPRMRFF ---2222-----1111-------------------------------------1111--- IGDVRDLERLNYALEGVDICIHAAALKHVPIAEYNPLECIKTNIMGASNVINACLKNAIS --1111-----1111------------3333----------------------------- QVIALSTDKAANPINLYGATKLCSDKLFVSANNFKGSSQTQFSVVRYGNVVGSRGSVVPF ------1111--------------------3333---------------2222------- FKKLVQNKASEIPITDIRMTRFWITLDEGVSFVLKSLKRMHGGEIFVPKIPSMKMTDLAK ---------------1111---------------3333---------------------1 ALAPNTPTKIIGIRPGEKLHEVMIPKDESHLALEFEDFFIIQPTISFQTPKDYTLTKLHE 111----------2222-------11111111-----------------------1111- KGQKVAPDFEYSSHNNNQWLEPDDLLKLL ----------------------------- >SLIT-ROBO RHO GTPASE-ACTI; SWP:Q91Z69; PDB:2GNCA; PEFAIAKFDYVGRSARELSFKKGASLLLYHRASEDWWEGRHNGIDGLVPHQYIVV -------------1111---2222--------1111----iiii----3333--- >VASCULAR ENDOTHELIAL GROW; SWP:P52584; PDB:2GNNA; DSNTKGWSEVLKGSECKPRPIVVPVSETHPELTSQRFNPPCVTLMRCGGCCNDESLECVP -----------1111--------3333----1111-----------------1111---- TEEVNVTMELLGGMQRLSFVEHKKCDCRPRFT -------------------------------- >DNA polymerase III, gamma; SWP:Q9WZM9; PDB:2GNOA; DQLETLKRIIEKSEGISILINGEDLSYPREVSLELPEYVEKFPPKASDVLEIDPEGENIG --------------------------------------------3333------------ IDDIRTIKDFLNYSPELYTRKYVIVHDCERTQQAANAFLKALEEPPEYAVIVLNTRRWHY --------------------------3333---------3333--1111-------1111 LLPTIKSRVFRVVVNVPKEFRDLVKEKIGDLWEELPLLERDFKTALEAYKLGAEKLSGLE -33331111-------3333---------3333-3333---------------------- SLKVLETEKLLKKVLSKGLEGYLACRELLERFSKVESKEFFALFDQVTNTITGKDAFLLI -11113333---1111-------------------3333--------------------- QRLTRIILHENTWESVEDQKSVSFLDSILRVKIANLNNKLTLNILAIHRERKR --------------3333-------3333--3333-3333------------- >TRANSCRIPTIONAL REGULATOR; SWP:Q97SS7; PDB:2GNPA; NFDTNFKLENYVKEKYSLESLEIIPNEFDDTPTILSERISQVAAGVLRNLIDDNKIGFSW 3333----------------------1111-----------------3333--------- GKSLSNLVDLIHSKSVRNVHFYPLAGGPSHIHAKYHVNTLIYESRKFHGECTFNATIVQE -------------------------------3333------------------------- NKLLADGILQSRYFENLKNSWKDLDIAVVGIGDFSNKGKHQWLDLTEDDFKELTKVKTVG -------11111111----1111-----------33333333-----------1111--- EICCRFFDSKGKEVYENLQERTIAISLEDLKNIPQSLAVAYGDTKVSSILSVLRANLVNH -%%%%--1111---33331111-------1111--------1111--------------- LITDKNTILKVLEEDGD ----------------- >CONSERVED HYPOTHETICAL PR; SWP:Q97WQ4; PDB:2GNRA; KEGSLLRWYDVEAERYEYTVGPAGEQFFNGLKQNKIIGSKCSKCGRIFVPARSYCEHCFV ------------------------------1111-------------------------- KIENYVEINKDEAYVDSYTIIYNDDEGNKLAQPVYIALIRFPNIEGGLLCYAEGNVKVGA --------3333-----------1111-------------2222------------2222 KAKILSFQWPLRVKVD ---------------- >NON-SYMBIOTIC HEMOGLOBIN ; SWP:O04986; PDB:2GNVA; AVAVSFSEEQEALVLKSWAILKKDSANIALRFLLKIFEVAPSASQMFSFLRNSDVPLEKN -------------------------------------------------3333--3333- PKLKTHAMSVFVMTCEAAAQLRKAGKVTVRDTTLKRLGATHLKYGVGDAHFEVVKFALLD -----------------------------------------1111--------------- TIKEEVPADMWSPAMKSAWSEAYDHLVAAIKQEMKPAE ------3333-------------------3333----- >HYPOTHETICAL PROTEIN; SWP:Q6P1I3; PDB:2GNXA; PHLSEQLCFFVQAREIADFYEKYALSTQKFINTEELVSTLDTILRKYSPLESSFQLEVGV ----------3333----------1111-------------------------------- LSHLLKAQAQISEWKFLPSLVTLHNAHTKLQSWGQTFEKQRPPHLFLWLKLKTLLAKFSF ----------1111----------------------------3333-------------- YFHEALSRQTTASEKALTAKANPDLFGKISSFIRKYDAANVSLIFDQYPAVVSLPSDRPV ----------3333--3333---------------------------------------- HWPNVIITDRASDLNSLEKVVHFYDDKVQSTYFLTRPEPHFTIVVIFESKKSERDSHFIS 3333---1111--------------1111--------1111-------------3333-- FLNELSLALKNPKVFASLK ---------3333------ >KUNITZ-TYPE SERINE PROTEA; SWP:Q6VEQ7; PDB:2GO2A; SSVVVDTNGQPVSNGADAYYLVPVSHGHAGLALAKIGNEAEPRAVVLDPHHRPGLPVRFE -----1111--------------------------------------------------- SPLRINIIKESYFLNIKFGPSSSDSGVWDVIQQDPIGLAVKVTDTKSLLGPFKVEKEGEG --------3333-------1111---------------------------------!!!! YKIVYYPERGQTGLDIGLVHRNDKYYLAVKDGEPCVFKIRKAT -------1111---------%%%%-----2222---------- >UDP-3-O-[3-hydroxymyristo; SWP:O67648; PDB:2GO3A; GLEKTVKEKLSFEGVGIHTGEYSKLIIHPEKEGTGIRFFKNGVYIPARHEFVVHTNHSTD ------------------------------2222-----iiii----3333--------- LGFKGQRIKTVEHILSVLHLLEITNVTIEVIGNEIPILDGSGWEFYEAIRKNILNQNREI --iiii------------------------------!!!!-------3333--------- DYFVVEEPIIVEDEGRLIKAEPSDTLEVTYEGEFKNFLGRQKFTFVEGNEEEIVLARTFA ---------------------------------------------22221111------- FDWEIEHIKKVGLGKGGSLKNTLVLGKDKVYNPEGLRYENEPVRHKVFDLIGDLYLLGSP 1111----1111-11113333----------1111-------------------1111-- VKGKFYSFRGGHSLNVKLVKELAKKQ -------------------------- >HYDROLASE, HALOACID DEHAL; SWP:Q97NG6; PDB:2GO7A; KTAFIWDLDGTLLDSYEAILSGIEETFAQFSIPYDKEKVREFIFKYSVQDLLVRVAEDRN -------2222------------------------------------------------- LDVEVLNQVRAQSLAEKNAQVVLPGAREVLAWADESGIQQFIYTHKGNNAFTILKDLGVE ----------------3333-------------1111---------------------33 SYFTEILTSQSGFVRKPSPEAATYLLDKYQLNSDNTYYIGDRTLDVEFAQNSGIQSINFL 33-----1111--------------------3333------3333--------------- ESTYEGNHRIQALADISRIFETK ---1111----3333-3333--- >HYPOTHETICAL PROTEIN YQJZ; SWP:P54563; PDB:2GO8A; DFLSKTPEPPYYAVIFSSVKSGETAERVSLAADQPGFLGVESVREADGRGITVSYWDSDA ---------------------1111--------2222---------------------33 INHWRHHTYESYAVRVAKVDRQRLFQE 33------------------------- >IMIDAZOLONEPROPIONASE; SWP:Q8U8Z6; PDB:2GOKA; ATALWRNAQLATLNPAMDGIGAVENAVIAVRNGRIAFAGPESDLPDDLSTADETTDCGGR -------------1111!!!!---------%%%%-----3333-3333--------iiii WITPALIDCHTHLVFGGNRAMEFEMRLNGATYEEIAKAGGGIVSSVRDTRALSDEVLVAQ -------------------------1111------------------------------- ALPRLDTLLSEGVSTIEIKSGYGLDIETELKMLRVARRLETLRPVRIVTSYLAAHATPAD ------------------------------------3333-----------------111 YKGRNADYITDVVLPGLEKAHAEGLADAVDGFCEGIAFSVKEIDRVFAAAQQRGLPVKLH 1-------------------1111---------1111-------------1111------ AEQLSNLGGAELAASYNALSADHLEYLDETGAKALAKAGTVAVLLPGAFYALREKQLPPV -------------1111------1111--------------------------------- QALRDAGAEIALATDCNPGTSPLTSLLLTMNMGATLFRMTVEECLTATTRNAAKALGLLA -----------------------------------------------------1111--- ETGTLEAGKSADFAIWDIERPAELVYRIGFNPLHARIFKGQKVS -----2222----------3333--------------iiii--- >MATRIX PROTEIN P17 (MA); SWP:P12497; PDB:2GOLA; SVLSGGELDKWEKIRLRPGGKKQYKLKHIVWASRELERFAVNPGLLETSEGCRQILGQLQ ----------------2222----3333--------1111-3333--------------- PSLQTGSEELRSLYNTIAVLYCVHQRIDVKDTKEALDKIEEE 3333---1111-----------1111---------------- >FIBRINOGEN-BINDING PROTEI; SWP:P68798; PDB:2GOMA; IKKEQKLIQAQNLVREFEKTHTVSAHRKAQKAVNLVSFEYKVKKMVLQERIDNVLKQGLV --------------------------------11113333--------------3333-- R - >TRILOBED PROTEASE; SWP:Q8U3Y3; PDB:2GOPA; TFAKFAYLSDPRTKGELVAYVLTKANLKDNKYENTIVIENLKNNARRFIENATMPRISPD -------------!!!!----------------------------------------111 GKKIAFMRANEEKKVSEIWVADLETLSSKKILEAKNIRSLEWNEDSRKLLIVGFKRREVP 1---------1111----------------------------1111-------------- AWEKTTFWIFDTESEEVIEEFEKPRFSSGIWHRDKIVVNVPHREIIPQYFKFWDIYIWED -----------------------2222----!!!!-----------------------ii GKEEKMFEKVSFYAVDSDGERILLYGKPEKKYMSEHNKLYIYDGKEVMGILDEVDRGVGQ ii------------------------------------------------1111------ AKIKDGKVYFTLFEEGSVNLYIWDGEIKPIAKGRHWIMGFDVDEIVVYLKETATRLRELF ---iiii------iiii----------------------------------1111----- TWDGEEKQLTDYNDPIFAKL ----------1111------ >OXIDOREDUCTASE, FMN-BINDI; SWP:Q8EEC8; PDB:2GOUA; MTQSLFQPITLGALTLKNRIVMPPMTRSRASQPGDVANHMMAIYYAQRASAGLIVSEGTQ -3333-----!!!!-------------------------------1111----------- ISPTAKGYAWTPGIYTPEQIAGWRIVTEAVHAKGCAIFAQLWHVGRVTHPDNIDGQQPIS -1111--2222------------------3333----------!!!!-3333iiii---- SSTLKAENVKVFVDNGSDEPGFVDVAVPRAMTKADIAQVIADYRQAALNAMEAGFDGIEL --------------------------------------------------1111------ HAANGYLINQFIDSEANNRSDEYGGSLENRLRFLDEVVAALVDAIGAERVGVRLAPLTTL --%%%%3333--1111---------3333----------------3333---------11 NGTVDADPILTYTAAAALLNKHRIVYLHIAEVDWDDAPDTPVSFKRALREAYQGVLIYAG 11----3333--------3333----------!!!!------------------------ RYNAEKAEQAINDGLADMIGFGRPFIANPDLPERLRHGYPLAEHVPATLFGGGEKGLTDY ----------1111-------3333-------------------3333-----2222--- PTYQA ----- >HEME-BINDING PROTEIN 1; SWP:Q9R257; PDB:2GOVA; NSLFGSVETWPWQVLSTGGKEDVSYEERACEGGKFATVEVTDKPVDEALREAMPKIMKYV -------------------------------------------3333-----------33 GGTNDKGVGMGMTVPVSFAVFPNEDGSLQKKLKVWFRIPNQFQGSPPAPSDESVKIEERE 33-1111---------------1111------------3333--------1111------ GITVYSTQFGGYAKEADYVAHATQLRTTLEGTPATYQGDVYYCAGYDPPMKPYGRRNEVW -------------3333-----------2222----%%%%-------------------- LVKA ---- >UBIQUITIN-LIKE PROTEIN 3; SWP:O95164; PDB:2GOWA; SSNVPADMINLRLILVSGKTKEFLFSPNDSASDIAKHVYDNWPMDWEEEQVSSPNILRLI --------------1111-------1111-------------------11111111---- YQGRFLHGNVTLGALKLPFGKTTVMHLVARETLPEPNSQGQRNREKTGESNCCVIL ----------3333---2222----------------------------------- >ADENOSINE PHOSPHOSULFATE ; SWP:O05927; PDB:2GOYA; FDLPALASSLADKSPQDILKAAFEHFGDELWISFSGAEDVVLVDMAWKLNRNVKVFSLDT -------------------------------------------------1111------- GRLHPETYRFIDQVREHYGIAIDVLSPDPRLLEPLVKEKGLFSFYRDGHGECCGIRKIEP ---------------------------33333333------3333------3333----- LKRKLAGVRAWATGQRRDQSPGTRSQVAVLEIDGAFSTPEKPLYKFNPLSSMTSEEVWGY ---3333--------3333-------------3333-3333-----1111---------- IRMLELPYNSLHERGYISIGCEPCTRPVLPNQHEREGRWWWE -1111---3333--------3333----22223333--3333 >6-PHOSPHOGLUCONATE DEHYDR; SWP:NA; PDB:2GP4A; HHSVVQSVTDRIIARSKASREAYLAALNDARNHLLKQEVGSVAQVAGVPCDGVTQGQPGE ------------------------------------------------------------ LSLLSREVIAATAVGLSHNFDGALLLGICDKIVPGLLIGALSFGHLPLFVPAGPGKVDRA -3333----------3333-----------------------3333-------------- QLLEAEAQSYHSAGTCTFYGQLLEVGLQLPGSSFVNPDDPLREALNKAAKQVCRLTELGT --1111--3333----------------2222---1111----------------3333- QYSPIGEVVNEKSIVNGIVALLATGGSTNLTHIVAAARAAGIIVNWDDFSELSDAVPLLA ---3333--------------1111------------1111---3333------------ RVYPNGHADINHFHAAGGAFLIKELLDAGLLHEDVNTVAGYGLRRYTQEPKLLDGELRWV ------------------------------------1111------------iiii---- DGPTVSLDTEVLTSVATPFQNNGGLKLLKGNLGRAVIKVSAVQPQHRVVEAPAVVIDDQN -------------3333------------3333-----11113333-----------333 KLDALFKSGALDRDCVVVVKGQGPKANGPELHKLTPLLGSLQDKGFKVALTDGRSGASGK 3-------1111----------3333---------------------------------- VPAAIHLTPEAIDGGLIAKVQDGDLIRVDALTGELSLLVSDTELATRTATEIDLRHSRYG ---------1111-3333---------------------3333----------1111--- GRELFGVLRSNLSSPETGARSTSAIDELY 3333---3333--3333------3333-- >3-OXOACYL-[ACYL-CARRIER-P; SWP:P63456; PDB:2GP6A; HMVTGKAFPYVVVTGIAMTTALATDAETTWKLLLDRQSGIRTLDDPFVEEFDLPVRIGGH ------------------------------------------------------------ LLEEFDHQLTRIELRRMGYLQRMSTVLSRRLWENAGSPEVDTNRLMVSIGTGLGSAEELV ----1111-33331111---------------3333----1111-----------3333- FSYDDMRARGMKAVSPLTVQKYMPNGAAAAVGLERHAKAGVMTPVSACASGAEAIARAWQ ---3333-------111133331111-------------------!!!!----------- QIVLGEADAAICGGVETRIEAVPIAGFAQMRIVMSTNNDDPAGACRPFDRDRDGFVFGEG -1111---------------3333---------------3333-----1111-------- GALLLIETEEHAKARGANILARIMGASITSDGFHMVAPDPNGERAGHAITRAIQLAGLAP -----------------------------------------------------1111-11 GDIDHVNAHATGTQVGDLAEGRAINNALGGNRPAVYAPKSALGHSVGAVGAVESILTVLA 11----------1111--------------------3333-----1111----------- LRDQVIPPTLNLVNLDPEIDLDVVAGEPRPGNYRYAINNSFGFGGHNVAIAFGRY ---------------1111----------------------2222---------- >BIFUNCTIONAL PROTEIN PUTA; SWP:Q0TJ53; PDB:2GPEA; GTTTMGVKLDDATRERIKSAATRIDRTPHWLIKQAIFSYLEQLENSD ---------------------1111---------------------- >CONSERVED HYPOTHETICAL PR; SWP:Q9I169; PDB:2GPFA; MTSVFDRDDIQFQVVVNHEEQYSIWPEYKEIPQGWRAAGKSGLKKDCLAYIEEVWTDMRP ----------------1111-----1111------------------------------- LSLRQHMDKAAG --1111------ >MITOGEN-ACTIVATED PROTEIN; SWP:P63086; PDB:2GPHA; EMVRGQVFDVGPRYTNLSYIGEGAYGMVCSAYDNLNKVRVAIKKISPFEHQTYCQRTLRE ----------------------------------------------1111---------- IKILLRFRHENIIGINDIIRAPTIEQMKDVYIVQDLMETDLYKLLKCQHLSNDHICYFLY --3333--1111----------3333---------------------------------- QILRGLKYIHSANVLHRDLKPSNLLLNTTCDLKICDFGLARVADPDHDHTGFLTEYVATR -------------------3333---1111------1111---1111---3333-----1 WYRAPEIMLNSKGYTKSIDIWSVGCILAEMLSNRPIFPGKHYLDQLNHILGILGSPSQED 1113333-------------------------------------------------3333 LNCIINLKARNYLLSLPHKNKVPWNRLFPNADSKALDLLDKMLTFNPHKRIEVEQALAHP 1111--------1111------3333-1111--------------3333---------33 YLEQYYDPSDEPIAEAPFKFDMELDDLPKEKLKELIFEETARFQP 33----1111-----------------3333------1111---- >CONSERVED HYPOTHETICAL PR; SWP:Q33UM0; PDB:2GPIA; GNQSIIFTEQLTWDVQLSAIHFTAQQQGVIDCYIGQKVLEHLAAEKINNSEQALSLFEQF --------------1111------------------------------------------ RFDIEEQAEKLIEQEAFDVQGHIQVERVD -----------1111--1111-------- >SIDEROPHORE-INTERACTING P; SWP:Q2ZSW7; PDB:2GPJA; PAPRELEVIRSTYITPHLRITLGGAGLAGFPADQESAYIKLLFPQAGERPLRTYTIRQQR -----------------------1111------2222-------2222------------ DDEIDVDFVLHDTDGPASSWAKTAQVGELIQIGGPGLKKLINFEADWFLLAGDTALPAIS --------------3333--11112222-------------------------------- VNLAKLPNNAVGYAVIEVLSEADIQPLVHPEHVELHWVINPEADPEGRPLVERIAQLPWL ------1111---------3333------1111----------1111------1111--- AGEPAVWIACEFNSRALRRHFKQAHALPKSHFYTSSYWKIGCNEGEHKLVKQEDEQLENG ----------3333-------------3333-------2222------------------ >GLUCOSE-PERMEASE IIA COMP; SWP:P45618; PDB:2GPR; MWFFNKNLKVLAPCDGTIITLDEVEDEVFKERMLGDGFAINPKSNDFHAPVSGKLVTAFP -------------------3333--3333--1111-----------------------11 TKHAFGIQTKSGVEILLHIGLDTVSLDGNGFESFVTQDQEVNAGDKLVTVDLKSVAKKVP 11------1111-------------%%%%------------2222--------------- SIKSPIIFTNNGGKTLEIVKMGEVKQGDVVAILK ------------------------2222------ >3-dehydroquinate dehydrat; SWP:Q9SQT8; PDB:2GPTA; VKNPSLICAPVMADSIDKMVIETSKAHELGADLVEIRLDWLKDFNPLEDLKTIIKKSPLP ------------------------------------3333-------------------- TLFTYRPKWEGGQYEGDENERRDVLRLAMELGADYIDVELQVASEFIKSIDGKKPGKFKV ------1111------------------1111------3333-----1111---1111-- IVSSHNYQNTPSVEDLDGLVARIQQTGADIVKIATTAVDIADVARMFHITSKAQVPTIGL -----------------------1111--------------------------------- VMGERGLMSRILCSKFGGYLTFGTLDSSKVSAPGQPTIKDLLDLYNFRRIGPDTKVYGII --33333333-3333----------1111--2222----------3333-1111------ GKPVSHSKSPIVHNQAFKSVDFNGVYVHLLVDNLVSFLQAYSSSDFAGFSCTIPHKEAAL ---11113333-------------------------------3333----------3333 QCCDEVDPLAKSIGAVNTILRRKSDGKLLGYNTDCIGSISAIEDGLTVVVIGAGGAGKAL ----------------------------------3333----3333-------------- AYGAKEKGAKVVIANRTYERELAEAIGALSLTDLDNYEDGMVLANTTSMGMQPNVEETPI ---------------------1111-----1111-------------2222--1111--- SKDALKHYALVFDAVYTPRITRLLREAEESGAITVSGSEMFVRQAYEQFEIFTGLPAPKE 33331111-------------------1111----------------------------- LYWQIMSKYGSRENLYFQ ------------------ >ESTROGEN-RELATED RECEPTOR; SWP:P62508; PDB:2GPUA; YNKIVSHLLVAEPEKIYAMPDPTVPDSDIKALTTLCDLADRELVVIIGWAKHIPGFSTLS --------1111--------1111----------------------------2222---- LADQMSLLQSAWMEILILGVVYRSLSFEDELVYADDYIMDEDQSKLAGLLDLNNAILQLV ---------------------1111--------1111----------------------- KKYKSMKLEKEEFVTLKAIALANSDSMHIEDVEAVQKLQDVLHEALQDYEAGQHMEDPRR --------3333--------1111-----------------------------3333--- AGKMLMTLPLLRQTSTKAVQHFYNIKLEGKVPMHKLFLEMLE ------------------------------------------ >O-METHYLTRANSFERASE; SWP:Q9KDE1; PDB:2GPYA; EERLKHYLEKQIPARDQYIEQEREAHEQQVPIDLLGESLLHLLKAAPARILEIGTAIGYS --------1111---3333-------------3333------------------!!!!33 AIRAQALPEATIVSIERDERRYEEAHKHVKALGLESRIELLFGDALQLGEKLELYPLFDV 33-3333----------------------11113333------3333-3333-------- LFIDAAKGQYRRFFDYSPVRPGGLILSDNVLFQWLLEHPQYDTRIFPVGDGIAISIKR ---1111------------2222-----1111--1111---------!!!!------- >TRANSTHYRETIN-LIKE PROTEI; SWP:Q5PGA5; PDB:2GPZA; MILSVHILDQQTGKPAPGVEVVLEQKKDGWTQLNTGHTDQDGRIKALWPEKAAAPGDYRV --------------------------------------1111------------------ IFKTGQYFESKKLDTFFPEIPVEFHISKTNEHYHVPLLLSQYGYSTYRGS --------1111---------------1111------------------- >CHAPERONE PROTEIN HTPG; SWP:P0A6Z3; PDB:2GQ0A; AQALWTRNKSEITDEEYKEFYKHIAHDFNDPLTWSHNRVEGKQEYTSLLYIPSQAPWDMW --3333-3333--------------------------------------------1111- NRDHKHGLKLYVQRVFIMDDAEQFMPNYLRFVRGLIDSSDLPLNVSREILQDSTVTRNLR 1111-------iiii------11111111------------1111--------------- NALTKRVLQMLEKLAKDDAEKYQTFWQQFGLVLKEGPAEDFANQEAIAKLLRFASTHTDS ------------------------------3333-----3333----1111--------- SAQTVSLEDYVSRMKEGQEKIYYITADSYAAAKSSPHLELLRKKGIEVLLLSDRIDEWMM ----------11112222----------------3333-3333---------33333333 NYLTEFDGKPFQSVSK -----%%%%---1111 >FRUCTOSE-1,6-BISPHOSPHATA; SWP:P0A993; PDB:2GQ1A; KTLGEFIVEKQHEFSHATGELTALLSAIKLGAKIIHRDINKLDLFANEKLKAALKARDIV ------------%%%%--------------------------------------1111-- AGIASEEEDEIVVFEGCEHAKYVVLDPLDGSSNIDVNVSVGTIFSIYRRVTPVGTPVTEE ----1111-----2222--------------3333----------------------333 DFLQPGNKQVAAGYVVYGSSTLVYTTGCGVHAFTYDPSLGVFCLCQERRFPEKGKTYSIN 3----1111---------------------------1111-------------------3 EGNYIKFPNGVKKYIKFCQEEDKSTNRPYTSRYIGSLVADFHRNLLKGGIYLYPSTASHP 3331111---------1111-----------------------------------3333- DGKLRLLYECNPAFLAEQAGGKASDGKERILDIIPETLHQRRSFFVGNDHVEDVERFIRE ----------------1111--------3333----1111-------------------- FPDA ---- >CONSERVED HYPOTHETICAL PR; SWP:NA; PDB:2GQBA; MSIFGKIMSAIFGDSAAASPGGAQAPATTGAAGTAPTAPQPTAAPSIDVAPILDKAVKAK -3333----1111----------------------------------------------- GEKLEWRTSIVDLMKALDIDSSLSARKELAKELGYSGDMNDSASMNIWLHKQVMSKLVAN ----3333---------------------------------------------------- GGKLPPEIKH ----1111-- >RHOMBOID INTRAMEMBRANE PR; SWP:Q9HZC2; PDB:2GQCA; MSAVQVLKFPLSVDLAGFVGLLRRLNVPHRVSEESGQQVLWVPDERLAEQVRELYRRYPE ---------1111--------3333-------------------3333-----1111111 GDPQATLEAA 1--------- >3-OXOACYL-[ACYL-CARRIER-P; SWP:Q8NXE1; PDB:2GQDA; NKRVVITGMGALSPIGNDVKTTWENALKGVNGIDKITRIDTEPYSVHLAGELKNFNIEDH ------------1111------------------------1111-----------3333- IDKKEARRMDRFTQYAIVAAREAVKDAQLDINENTADRIGVWIGSGIGGMETFEIAHKQL -333311113333------------------3333------------------------- MDKGPRRVSPFFVPMLIPDMATGQVSIDLGAKGPNGATVTACATGTNSIGEAFKIVQRGD ---3333-1111---------------------------!!!!----------------- ADAMITGGTEAPITHMAIAGFSASRALSTNDDIETACRPFQEGRDGFVMGEGAGILVIES ---------------------1111------1111--2222------------------- LESAQARGANIYAEIVGYGTTGDAYHITAPAPEGEGGSRAMQAAMDDAGIEPKDVQYLNA ------------------------------2222----------------3333------ HGTSTPVGDLNEVKAIKNTFGEAAKHLKVSSTKSMTGHLLGATGGIEAIFSALSIKDSKV --------------------1111---------------3333----------------- APTIHAVTPDPECDLDIVPNEAQDLDITYAMSNSLGFGGHNAVLVFKKFEA ---------3333-------------------------------------- >HYPOTHETICAL PROTEIN HI09; SWP:P44941; PDB:2GQFA; SQYSENIIIGAGAAGLFCAAQLAKLGKSVTVFDNGKKIGRKILSGGGFCNFTNLEVTPAH ----------------------1111-------------3333%%%%---------3333 YLSQNPHFVKSALARYTNWDFISLVAEQGITYHEKELGQLFCDEGAEQIVELKSECDKYG ----1111--------3333---------------iiii--------------------- AKILLRSEVSQVERIQNDEKVRFVLQVNSTQWQCKNLIVATGGLSPGLGATPFGYQIAEQ -----------------1111-----!!!!---------------1111--3333---11 FGIPVIPPRASLVPFTYRETDKFLTALSGISLPVTITALCGKSFYNQLLFTHRGISGPAV 11---------------3333---1111---------1111---------1111------ LQISNYWQPTESVEIDLLPNHNVEEEINQAKQSSPKQLKTILVRLLPKKLVELWIEQGIV -------2222--------------------------33331111-3333----1111-- QDEVIANISKVRVKNLVDFIHHWEFTPNGTEGYRTAEVTGGVDTKVISSKTESNQVSGLY ---3333------------------------1111-------3333------3333---- FIGEVLDVTGWLGGYNFQWAWSSAYACALSISRQ --3333------------------------1111 >KIAA1556 PROTEIN; SWP:NA; PDB:2GQHA; GSSGSSGARFTEGLRNEEAMEGATATLQCELSKAAPVEWRKGLEALRDGDKYSLRQDGAV -------------------2222-----------------%%%%---------------- CELQIHGLAMADNGVYSCVCGQERTSATLTVRALPARFIEDSGPSSG --------3333----------------------------------- >ZINC FINGER PROTEIN KIAA1; SWP:Q96KM6; PDB:2GQJA; GSSGSSGPGGPEEQWQRAIHERGEAVCPTCNVVTRKTLVGLKKHMEVCQKLQDALKCQHC ------------1111-1111------------1111----------------------- RKQFKSKAGLNYHTMAEHSAKPSDAEASEGGESGPSSG -------------------------------------- >Phosphoribosylaminoimidaz; SWP:P0A7D7; PDB:2GQRA; QKQAELYRGKAKTVYSTENPDLLVLEFRNDTSAGDGARIEQFDRKGVNNKFNYFISKLAE ------------------1111-----------1111----2222--------------- AGIPTQERLLSDTECLVKKLDVPVECVVRNRAAGSLVKRLGIEEGIELNPPLFDLFLKND ----------1111------------------!!!!------2222-------------- AHDPVNESYCETFGWVSKENLARKELTYKANDVLKKLFDDAGLILVDFKLEFGLYKGEVV -----3333-1111----------------------------------------iiii-- LGDEFSPDGSRLWDKETLEKDKDRFRQSLGGLIEAYEAVARRLGVQLD -----3333---------------1111------------1111---- >UDP-N-ACETYLENOLPYRUVYLGL; SWP:NA; PDB:2GQTA; EFMRVERVLLKDYTTLGVGGPAELWTVETREELKRATEAPYRVLGNGSNLLVLDEGVPER --------3333-3333-----------------1111--------------1111---- VIRLAGEFQTYDLKGWVGAGTLLPLLVQEAARAGLSGLEGLLGIPAQVGGAVKMNAGTRF -----1111--1111------3333------------1111----------------333 GEMADALEAVEVFHDGAFHVYCPEELGFGYRKSHLPPGGIVTRVRLKLKERPKEEILRRM 33333--------iiii----3333----------2222--------------------- AEVDRARKGQPKRKSAGCAFKNPPGQSAGRLIDERGLKGLRVGDAMISLEHGNFIVNLGQ ----1111--------------2222------11112222-!!!!--1111--------- ARAKDVLELVRRVQEELPLELEWEVWP -3333---------------------- >DEBLOCKING AMINOPEPTIDASE; SWP:Q81HB5; PDB:2GREA; HHTKETELIKELVSIPSPSGNTAKIINFIENYVSEWNVETKRNNKGALILTVKGKNDAQH -3333------------2222---------1111--------1111-------------- RLLTAHVDTLGAVKEIKPDGRLSLSIGGFRWNSVEGEYCEIETSSGKTYTGTILHIEVRI ----------------3333---------33332222-----1111-------------- DERVFSADEVRELGIEVGDFVSFDPRVQITESGYIKSRHLDDKVSVAILLKLIKRLQDEN ----------1111---------------1111-----------------------1111 VTLPYTTHFLISNNEGGNSNIPEETVEYLAVDGALGDGSDEYTVSICAKDSSGPYHYALR ---------------------1111------------------------1111------- KHLVELAKTNHIEYKVDIYPYYRAGFDVKHALIGAGIDSSHAFERTHESSIAHTEALVYA --------------------------------------2222------------------ YVSNLIE ------- >NSP3; SWP:P59641; PDB:2GRIA; APIKGVTFGEDTVWEVQGYKNVRITFELDERVDKVLNEKCSVYTVESGTEVTEFACVVAE -------------------------------3333--------------11111111--- AVVKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDE --3333-----3333----3333--------1111----------------- >DEPHOSPHO-COA KINASE; SWP:COAE_THEMA; PDB:2GRJA; HVIGVTGKIGTGKSTVCEILKNKYGAHVVNVDRIGHEVLEEVKEKLVELFGGSVLEDGKV -------2222---------------------------------------3333------ NRKKLAGIVFESRENLKKLELLVHPLKKRVQEIINKTSLIVIEAALLKRGLDQLCDHVIT -------1111-----------3333-------1111--------33333333------- VVASRETILKRNREADRRLKFQEDIVPQGIVVANNSTLEDLEKKVEEVKLVW -----------1111--33331111--------------------------- >EVM001; SWP:Q9JFS0; PDB:2GRKA; SFASSCTEEENNHHMGIDVIIKVTKQDQTPTNDKICQSVTEVTESEDDGVSEEVVKGDPT -------------------------1111------------------------------- TYYTVVGGGLRMNFGFTKCPQIKSISESADGNTVNARLSSVSPMYGIESPAITHEEALAM ------%%%%-------------------!!!!--------3333--------------- INDCAVSINIKCSEEEKDSNIKTHPVLGSNISHKKVRYEDIIGSTIVDIKCVKDLEFSVR ------------------------------------------------------------ IGDMCKEASELEVKDGFKYIDGSVSEGATDDTSLIDSTKLKACVGSLV -------------------iiii------------3333--------- >UBIQUITIN-CONJUGATING ENZ; SWP:P63279; PDB:2GRRA; MSGIALSRLAQERKAWRKDHPFGFVAVPTKNPDGTMNLMNWECAIPGKKGTPWEGGLFKL --------------------2222------1111--1111-------2222-2222---- RMLFKDDYPSSPPKCKFEPPLFHPNVYPSGTVCLSILEEDKDWRPAITIKQILLGIQELL ----1111--------------11113333---11111111--3333-----------11 NEPNIQSPAQAEAYTIYCQNRVEYEKRVRAQAKKFAP 11-3333-------------------------1111- >Ran GTPase-activating pro; SWP:P46060; PDB:2GRRB; PADVSTFLAFPSPEKLLRLGPKSSVLIAQQTDTSDPEKVVSAFLKVSSVFKDEATVRMAV -------------------1111-----------------------1111---------- QDAVDALMQKAFNSSSFNSNTFLTRLLVHMGLLKSEDKVKAIANLYGPLMALNHMVQQDY ----------1111-------------1111----------------------3333111 FPKALAPLLLAFVTKPNSALESCSFARHSLLQTLYKV 13333-----------3333-------------1111 >LPQW; SWP:NA; PDB:2GRVA; APTQIIMAIDSIGPGFNPHLLSDQSPVNAAIASLVLPSSFRPVPDPTSPTGSRWELDTTL ----------------11111111--------------------3333------------ LESAEVTQENPFTVTYKIRPEAQWTDNAPIAADDYWYLWRQMVSQPGVVDPAGYDLITGV ------------------3333-1111-----------------------3333------ QSVEGGKQAVVTFSQPYPAWRELFNDILPAHIVKDIPGGFGAGLARAMPVTGGQFRVETI ---iiii---------1111--------33331111-----1111--------------- DPQRDEILLARNDRFWSVPAKPDLVLFRRGGAPAALADSIRNGDTQVAQVHGGAATFAQL -1111------1111--------------------------------------------- SAIPDVRTARIVTPRVMQLTLRAQQPKLADPQVRKAILGLIDVDLLASVGAGDDNTVTLA --2222---------------11111111--------1111------------------- QAQVRSPSDPGYVPTAPPAMTRDDALELLRDAGYVSEPVPPPRERIVKDGVPLTIVLGVA -----1111--------------------1111--------------iiii--------3 SNDPTSVAVANTAADQLRNVGIDASVLALDPVALYGDALVNNRVDAVVGWRQAGGDLATV 333--------------1111-----------------1111------------------ LASRYGCRALEAQAPSNITGICDRSIQPRIDAALDGTDDIADVIQAVEPRLWNMATVLPI -----3333-------------3333-------------------------3333----- LQDTTIVAAGPSVQNVSLTGAVPVGIVGDAGDWTKT ---------1111-------3333--1111------ >KINESIN-LIKE PROTEIN KIF2; SWP:NA; PDB:2GRYA; EIMCMIRDFRGSLDYRPLPIDEHRICVCVRKRPLNKKETQMKDLDVITIPSKDVVMVHEP -----33331111---------------------3333---------------------- KQKVDLTRYLENQTFRFDYAFDDSAPNEMVYRFTARPLVETIFERGMATCFAYGQTGSGK --1111-------------------3333-----3333-3333-----------2222-- THTMGGSKGIYALAARDVFLMLKKPNYKKLELQVYATFFEIYSGKVFDLLNRKTKLRVLE -----------------------3333--------------iiii-----%%%%-----2 DGKQQVQVVGLQEREVKCVEDVLKLIDIGNSCRTHSSRSHAVFQIILRRKGKLHGKFSLI 222----2222------3333-------------1111---------------------- DLAGNERDRQTRLEGAEINKSLLALKECIRALGRNKPHTPFRASKLTQVLRDSFIGENSR ----------------------------------3333-1111-3333-1111------- TCMIATISPGMASCENTLNTLRYANRV ---------3333----------1111 >Phospholipid hydroperoxid; SWP:P36969; PDB:2GS3A; TENLYFQSMRCARSMHEFSAKDIDGHMVNLDKYRGFVCIVTNVASQGGKTEVNYTQLVDL -----3333----3333----1111-----1111------------1111---------- HARYAECGLRILAFPCNQFGKQEPGSNEEIKEFAAGYNVKFDMFSKICVNGDDAHPLWKW ---3333-----------%%%%------------1111------------1111------ MKIQPKGKGILGNAIKWNFTKFLIDKNGCVVKRYGPMEEPLVIEKDLPHYF ---1111-----------------1111------1111333333333333- >PROTEIN YCIF; SWP:P21362; PDB:2GS4A; KTIEDVFIHLLSDTYSAEKQLTRALAKLARATSNEKLSQAFHAHLEETHGQIERIDQVVE ---------------------------1111----------------------------- SESNLKIKRKCVAEGLIEEANEVIESTEKNEVRDAALIAAAQKVEHYEIASYGTLATLAE -3333-----------------1111---------------------------------1 QLGYRKAAKLLKETLEEEKATDIKLTDLAINNVNKK 111--------------------------------- >CONSERVED HYPOTHETICAL PR; SWP:Q6NEA9; PDB:2GS5A; FADRLFNAERNEPAPGVLVAAPSESEDFARSVILIIEHSEYATFGVNLASRSDVAVFNVI -3333-------------------3333----------3333------------3333-3 PEWVPCVTKPQALYIGGPLNQQSVVGVGVTAQGVDAARVDNLTRLANRLVVNLGADPEEI 3331111------------1111-------222233331111---!!!!--111133333 KPLVSGRLFAGHAEWAPGQLAQEIENGDWFVAPALPSDVTAPGSVDVWGDVRRQPPLPLY 333------------2222----1111-------3333---11113333------3333- STFPV ----- >MEVALONATE PYROPHOSPHATE ; SWP:NA; PDB:2GS8A; ADPNVITVTSYANIAIIKYWGKENQAKIPSTSSISLTLENFTTTSVSFLPDTATSDQFYI -----------------------3333----------------------1111------i NGILQNDEEHTKISAIIDQFRQPGQAFVKETQNNPTAAGLSSSSSGLSALVKACDQLFDT iii------------3333--2222---------3333--3333-----------1111- QLDQKALAQKAKFASGSSSRSFFGPVAAWDKDSGAIYKVETDLKAILVLNAAKKPISSRE --------------!!!!--------------------------------------3333 GKLCRDTSTTFDQWVEQSAIDYQHLTYLKTNNFEKVGQLTEANALAHATTKTANPPFSYL -------1111----------------1111-----------------1111-------- TKESYQAEAVKELRQEGFACYFTDAGPNVKVLCLEKDLAQLAERLGKNYRIIVSKTKDLP ---------------------------------3333--------1111----------- DV -- >HYPOTHETICAL PROTEIN TT13; SWP:Q5SL11; PDB:2GS9A; DPFASLAEAYEAWYGTPLGAYVIAEEERALKGLLPPGESLLEVGAGTGYWLRRLPYPQKV 1111--11113333----------------1111----------!!!!-3333------- GVEPSEALAVGRRRAPEATWVRAWGEALPFPGESFDVVLLFTTLEFVEDVERVLLEARRV ---------3333-3333-----3333--------------------------------- LRPGGALVVGVLEALSPWAALYRRLGEKGVLPWAQARFLAREDLKALLGPPEAEGEAVFL -2222-------1111---------------3333----3333----------------- APEAHPPYEEADLAGRRAGNRPALYLGRWR 1111-------------------------- >CONSERVED HYPOTHETICAL PR; SWP:Q8PD29; PDB:2GSCA; RPHERLDAWRDSMELVEMIYRLTEVFPDQERYGLTAQLRRAAVSIPSNIAEGAARDYSRF 3333--3333------------11113333------------------------------ LSIARGSLSELDTQVQIAARLGYSRSEDDQSVRRQVDLVFAKLTALMNALRRR ------------------1111---3333------------------------ >NAD-DEPENDENT FORMATE DEH; SWP:O08375; PDB:2GSDA; AKVVCVLYDDPINGYPTSYARDDLPRIDKYPDGQTLPTPKAIDFTPGALLGSVSGELGLR ----------1111---------------1111-----------2222---1111----3 KYLESQGHELVVTSSKDGPDSELEKHLHDAEVIISQPFWPAYLTAERIAKAPKLKLALTA 3331111----------1111--------------3333-----------1111------ GIGSDHVDLQAAIDNNITVAEVTYCNSNSVAEHVVMMVLGLVRNYIPSHDWARNGGWNIA ---1111--------------2222---------------1111-------1111--333 DCVARSYDVEGMHVGTVAAGRIGLRVLRLLAPFDMHLHYTDRHRLPEAVEKELNLTWHAT 3-------2222-------3333------3333--------------------------3 REDMYGACDVVTLNCPLHPETEHMINDETLKLFKRGAYLVNTARGKLCDRDAIVRALESG 3333333----------3333----333311112222------1111------------- RLAGYAGDVWFPQPAPNDHPWRTMPHNGMTPHISGTSLSAQTRYAAGTREILECYFEGRP ---------------11113333---------11113333-------------------- IRDEYLIVQGGGLAGVGAHSYSKGNATGGSEEAAKYEKL -3333---%%%%-!!!!---------2222--3333--- >DIHYDROPYRIMIDINASE-RELAT; SWP:Q16555; PDB:2GSEA; DRLLIKGGKIVNDDQSFYADIYMEDGLIKQIGENLIVPGGVKTIEAHSRMVIPGGIDVHT -----------1111--------%%%%------------------iiii----------- RFQMPDQGMTSADDFFQGTKAALAGGTTMIIDHVVPEPGTSLLAAFDQWREWADSKSCCD -----iiii----3333------------------------------------------- YSLHVDISEWHKGIQEEMEALVKDHGVNSFLVYMAFKDRFQLTDCQIYEVLSVIRDIGAI ----------2222---------------------------------------------- AQVHAENGDIIAEEQQRILDLGITGPEGHVLSRPEEVEAEAVNRAITIANQTNCPLYITK ------------------1111--3333-1111--------------------------- VMSKSSAEVIAQARKKGTVVYGEPITASLGTDGSHYWSKNWAKAAAFVTSPPLSPDPTTP -------------1111-------3333----1111-------------------1111- DFLNSLLSCGDLQVTGSAHCTFNTAQKAVGKDNFTLIPEGTNGTEERMSVIWDKAVVTGK -----------------------------11111111-----3333---------3333- MDENQFVAVTSTNAAKVFNLYPRKGRIAVGSDADLVIWDPDSVKTISAKTHNSSLEYNIF ---------------------------2222---------------1111-------111 EGMECRGSPLVVISQGKIVLEDGTLHVTEGSGRYIPRKPFPDFVYKRIKARSRLA 1------------iiii---iiii---2222---------3333------1111- >EPHRIN TYPE-A RECEPTOR 3; SWP:P29320; PDB:2GSFA; TYVDPHVHEFAKELDATNISIDKVVGAGEFGEVCSGRLKLPSKKEISVAIKTLKVGYTEK ------3333----3333---------1111--------1111----------2222--- QRRDFLGEASIMGQFDHPNIIRLEGVVTKSKPVMIVTEYMENGSLDSFLRKHDAQFTVIQ ----------3333--1111-------------------1111--------2222----- LVGMLRGIASGMKYLSDMGYVHRDLAARNILINSNLVCKVSDFGLSRVPIRWTSPEAIAY ---------------1111------3333---1111------1111--3333-------- RKFTSASDVWSYGIVLWEVMSYGERPYWEMSNQDVIKAVDEGYRLPPPMDCPAALYQLML ---3333-----------1111--2222-------------------2222--------- DCWQKDRNNRPKFEQIVSILDKLIRNPGSLKIITSSNLLLD 1111-3333-------------------------------- >IMMUNOGLOBULIN (KAPPA) LI; SWP:Q66JS7; PDB:2GSIA; DIVLTQSPASLAVSLGQPATISCGASKSVRT -------------2222-------------- >Thermonuclease [Precursor; SWP:Q8NXI6; PDB:2GSIB; VQLQQSGAELVRSGASVKLSCTASGFNIKDYYMYWVKLRPEQGLEWIGWIDPENGDTEYV -----------------------------------------------------------3 PTFQGKVTMTADTSSNTAYLQLSSL 333---------------------- >PROTEIN PPL-2; SWP:NA; PDB:2GSJA; GGIVVYWGQNGGEGTLTSTCESGLYQIVNIAFLSQFGGGRRPQINLAGHCDPANNGCRTV ---------1111-----------------------iiii-----!!!!---%%%%---- SDGIRACQRRGIKVMLSIGGGAGSYSLSSVQDARSVADYIWNNFLGGRSSSRPLGDAVLD -------1111----------------------------------------1111----- GVDFDIEHGGAYYDALARRLSEHNRGGKKVFLSAAPQCPFPDQSLNKALSTGLFDYVWVQ ---------2222-------1111-----------------3333-3333---------- FYNNPQCEFNSGNPSNFRNSWNKWTSSFNAKFYVGLPASPEAAGSGYVPPQQLINQVLPF ---3333--3333-------------------------1111------------------ VKRSPKYGGVMLWDRFNDLKTKYSSKIKPSV ---1111---------------33333333- >HYPOTHETICAL PROTEIN; SWP:Q8RIL0; PDB:2GSLA; DFSKDIRDYSGLELAFLGDAIWELEIRKYYLQFGYNIPTLNKYVKAKVNAKYQSLIYKKI ---1111--3333-------------------------------1111------------ INDLDEEFKVIGKRAKNIKTFPRSCTVEYKEATALEAIIGAYLLKKEEEIKKIINIVIKG 11113333---------------------------------1111--------1111111 EL 1- >Cytochrome c oxidase subu; SWP:Q03736; PDB:2GSMB; LEIIGRPQPGGTGFQPSASPVATQIHWLDGFILVIIAAITIFVTLLILYAVWRFHEKRNK -------2222-------------------------------------------3333-- VPARFTHNSPLEIAWTIVPIVILVAIGAFSLPVLFNQQEIPEADVTVKVTGYQWYWGYEY -----------------------------------------------------------3 PDEEISFESYMIGSPATGGDNRMSPEVEQQLIEAGYSRDEFLLATDTAMVVPVNKTVVVQ 333-------22221111-------------1111-3333-------------------- VTGADVIHSWTVPAFGVKQDAVPGRLAQLWFRAEREGIFFGQCSELCGISHAYMPITVKV -----------3333------2222----------------------1111--------- VSEEAYAAWLEQHHHH ----------1111-- >PHOSPHODIESTERASE-NUCLEOT; SWP:Q8PIS1; PDB:2GSOA; TPHALLLISIDGLRADMLDRGITPNLSHLAREGVRARWMAPSYPSLTFPNHYTLVTGLRP ---------22221111-----------------------------------------33 DHHGIVHNSMRDPTLGGFWLSKSEAVGDARWWGGEPVWVGVENTGQHAATWSWPGSEAAI 33----------------11113333-3333----3333--1111-------2222---i KGVRPSQWRHYQKGVRLDTRVDAVRGWLATDGAQRNRLVTLYFEHVDEAGHDHGPESRQY iii---------------------------!!!!-------------------1111--- ADAVRAVDAAIGRLLAGMQRDGTRARTNIIVVSDHGMAEVAPGHAISVEDIAPPQIATAI ----------------------3333--------------2222--1111--3333---- TDGQVIGFEPLPGQQAAAEASVLGAHDHYDCWRKAELPARWQYGSHPRIPSLVCQMHEGW ----------2222------------------3333-3333----1111-------2222 DALFPDKLAKRAQRGTRGSHGYDPALPSMRAVFLAQGPDLAQGKTLPGFDNVDVYALMSR ---3333---------------11111111------1111---------3333------- LLGIPAAPNDGNPATLLPALRM -----------33333333--- >GLUTATHIONE S-TRANSFERASE; SWP:P46088; PDB:2GSQ; PKYTLHYFPLMGRAELCRFVLAAHGEEFTDRVVEMADWPNLKATMYSNAMPVLDIDGTKM ----------!!!!-------1111--------333311111111---------iiii-- SQSMCIARHLAREFGLDGKTSLEKYRVDEITETLQDIFNDVVKIKFAPEAAKEAVQQNYE ------------------------------------------------------------ KSCKRLAPFLEGLLVSNGGGDGFFVGNSMTLADLHCYVALEVPLKHTPELLKDCPKIVAL ------------------------------------3333------111111113333-- RKRVAECPKIAAYLKKRPVRDF ---------------------- >CLASS PI GST GLUTATHIONE ; SWP:P80031; PDB:2GSRA; PPYTITYFPVRGRCEAMRMLLADQDQSWKEEVVTMETWPPLKPSCLFRQLPKFQDGDLTL ----------!!!!-------1111--------3333-3333--1111------!!!!-- YQSNAILRHLGRSFGLYGKDQKEAALVDMVNDGVEDLRCKYATLIYTNYEAGKEKYVKEL -3333-------------------------------------------3333------33 PEHLKPFETLLSQNQGGQAFVVGSQISFADYNLLDLLRIHQVLNPSCLDAFPLLSAYVAR 33-----------%%%%---------3333-------------11111111--------- LSARPKIKAFLASPEHVNRPINGNGKQ -------------3333---------- >HYPOTHETICAL PROTEIN YVFG; SWP:P71066; PDB:2GSVA; ELFSVPYFIENLKQHIENQSEDKIHANSYYRSVVSTLVQDQLTKNAVVLKRIQHLDEAYN ----------------------3333----------1111-------------------- KVKRG ----- >3C-LIKE PROTEINASE; SWP:P59641; PDB:2GT7A; SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICLNPNYEDLLIRKSNHS ---------33331111----!!!!---------------1111-------3333-3333 FLVQAGNVQLRVIGHSMQNCLLRLKVDTSNPKTPKYKFVRIQPGQTFSVLACYNGSPSGV ----!!!!---------!!!!--------1111--------2222-------iiii---- YQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGKFYGPF -----1111------2222--------!!!!----------1111--------------- VDRQTAQAAGTDTTITLNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQD --------------------------1111-1111-------------3333-------- HVDILGPLSAQTGIAVLDMCAALKELLQNGMNGRTILGSTILEDEFTPFDVVRQCSGVTF ------------------------------iiii-iiii--------------------- >HYPOTHETICAL PROTEIN YPJD; SWP:P42979; PDB:2GTAA; SDKTKDIQAEVDRYIGQFKEGYFSPLAARLTEELGELAREVNHRYGEKPKKATEDDKSEE -----------------3333--3333--------------------------------- EIGDVLFVLVCLANSLDISLEEAHDRVHKFNT ------------3333---------------- >TYPE III PANTOTHENATE KIN; SWP:NA; PDB:2GTDA; MDPMYLLVDVGNTHSVFSITEDGKTFRRWRLSTGVFQTEDELFSHLHPLLGDAMREIKGI ---------------------------------------------1111!!!!1111--- GVASVVPTQNTVIERFSQKYFHISPIWVKAKNGCVKWNVKNPSEVGADRVANVVAFVKEY -----3333-------------------------------3333---------------- GKNGIIIDMGTATTVDLVVNGSYEGGAILPGFFMMVHSLFRGTAKLPLVEVKPADFVVGK ------------------------------------------------------------ DTEENIRLGVVNGSVYALEGIIGRIKEVYGDLPVVLTGGQSKIVKDMIKHEIFDEDLTIK -------------------------------------1111---3333-----1111--- GVYHFCFG -------- >PROACTIVATOR POLYPEPTIDE; SWP:P07602; PDB:2GTGA; DVYCEVCEFLVKEVTKLIDNNKTEKEILDAFDKMCSKLPKSLSEECQEVVDTYGSSILSI -3333------------1111-------33331111--3333------------------ LLEEVSPELVCSMLHLCS -----3333--1111--- >REPLICASE POLYPROTEIN 1AB; SWP:P16342; PDB:2GTIA; SSLENVVYNLVNAGHFDGRAGELPCAVIGEKVIAKIQNEDVVVFKNNTPFPTNVAVELFA ---------------------------!!!!----%%%%-----------3333----11 KRSIRPHPELKLFRNLNIDVCWSHVLWDYAKDSVFCSSTYKVCKYTDLQCIESLNVLFDG 11-----------1111-----------3333----------3333----1111----11 RDNGALEAFKKCRNGVYINTTKIKSLSIKGPQRADLNGVVVEKVGDSDVEFWFAVRKDGD 11------1111----------1111---------iiii----%%%%---------iiii DVIFSRTGSLEPSHARGTIFTQSRLLSSFTPRSEEKDFDLDDDVFIAKYSLQDYAFEHVV ------------------------1111------3333--------1111---------- YGSFNQKIIGGLHLLIGLARRQQKSNLVIQEFVTYDSSIHSYLITDENSGSSKSVCTVID -------------3333---3333----------------------1111---------- LLLDDFVDIVKSLNLKCVSKVVNVNVDFKDFQFLWCNEEKVTF -3333----1111------------iiii-------------- >Hemoglobin linker chain L; SWP:Q9GV76; PDB:2GTLM; RFQYLVKNQNLHIDYLAKKLHDIEEEYNKLTHDVDKKTIRQLKARISNLEEHHCDEHESE --1111----------------------------3333-------1111-----2222-- CRGDVPECIHDLLFCDGEKDCRDGSDEDPETCSLNITHVGSSYTGLATWTSCEDLNPDHA ---------1111-------1111---3333--33332222------------------- IVTITAAHRKSFFPNRVWLRATLSYELDEHDHTVSTTQLRGFYNFGKRELLLAPLKGQSE ---------1111----------------------------------------------- GYGVICDFNLGDDDHADCKIVVPSSLFVCAHFNAQRY ------------------------------------- >Extracellular hemoglobin ; SWP:Q2I743; PDB:2GTLN; LDPRLGANAFLIIRLDRIIEKLRTKLDEAEKIDPEHFVSEIDARVTKIEGTHCEKRTFQC ---------------------------3333-------------3333-----2222--- GGNEQECISDLLVCDGHKDCHNAHDEDPDVCDTSVVKAGNVFSGTSTWHGCLAREDHVTR -------------------1111---3333-3333-2222-------------------- ITITASKRRKFFTARIWLRALVESELERHGENVTSSFNAKGYYNFASRRLILLPTDDHDD --------1111--------------------------------1111------------ HLAVVCSFNRGDNERAECHRVTEATLHQCADLFVTLEEHD ---------------------------------------- >Extracellular hemoglobin ; SWP:Q2I742; PDB:2GTLO; QSHDEIIDKLIERTNKITTSISHVESLLDDRLDPKRIRKAGSLRHRVEELEDPSCDEHEH ---1111--------------------3333-3333----------3333---------- QCGGDDPQCISKLFVCDGHNDCRNGEDEKDCTLPTKAGDKFIGDVCFDHCTKRRPEHMTL ----------1111-------3333-------------------------1111------ AFESSSIAAFFTPIADLHVHIEIESETDEDESEVSMPADGEYSFADHRLTIHPPEEDGLG -------1111---------------3333------------3333-------------- LVGEFDGYNFDRFVGHIVHELSEEVCAEFIFHRKK ----------------------------------- >AMINOPEPTIDASE N; SWP:Q9JYV4; PDB:2GTQA; TVHYLKDYQTPAYHILKTDLHFDINEPQTVVKSRLTVEPQRVGEPLVLDGSAKLLSVKIN ---3333---------------------------------2222--------------ii GAAADYVLEGETLTIAGVPSERFTVEVETEILPAENKSLGLYASGGNLFTQCEPEGFRKI ii-----------------------------3333--------iiii--------3333- TFYIDRPDVSKFTTTIVADKKRYPVLLSNGNKIDGGEFSDGRHWVKWEDPFSKPSYLFAL -----1111---------3333---------------1111------------3333--- VAGDLAVTEDYFTTSGRNVKIEFYTTEADKPKVGFAVESLKNAKWDETRFGLEYDLDIFV -------------------------33331111--------------------------- VAVGDFNGAENKGLNIFNTKFVLADSRTATDTDFEGIESVVGHEYFHNWTGNRVTCRDWF ----------2222---3333---3333-----------------------------333 QLSLKEGLTVFRDQEFSGDRASRAVRRIENIRLLRQHQFPEDAGPTAHPVRPASYEENNF 3-------------------------------------3333-1111------------- YTTVYEKGAEVVRYHTLLGEEGFQKGKLYFQRHDGQAVTCDDFRAAADANGINLDQFALW --------------------------------2222--3333-----1111--3333--- YSQAGTPVLEAEGRLKNNIFELTVKQTVPPTPDTDKQPIPVKVGLLNRNGEAVAFDYQGK ----------------------------------------------1111------2222 RATEAVLLLTEAEQTFLLEGVTEAVVPSLLRGFSAPVHLNYPYSDDDLLLLLAHDSDAFT -----------------------------2222--------------------------- RWEAAQTLYRRAVAANLATLSDGVELPKHEKLLAAVEKVISDDLLDNAFKALLLGVPSEA ---------------------------------------------------1111----- ELWDGAENIDPLRYHQAREALLDTLAVHFLPKWHELNRQAAKQENQSYEYSPEAAGWRTL --2222---3333-------------1111-------------%%%%---3333------ RNVCRAFVLRADPAHIETVAEKYGEAQNTHEWGILSAVNGNESDTRNRLLAQFADKFSDD -------------------------------------1111-3333----------1111 ALVDKYFALVGSSRRSDTLQQVRTALQHPKFSLENPNKARSLIGSFSRNVPHFHAEDGSG --------------1111---------11111111----------------1111----- YRFIADKVIEIDRFNPQVAARLVQAFNLCNKLEPHRKNLVKQALQRIRAQEGLSKDVGEI ----------------------------1111-----------------2222------- VGKILD ------ >CHROMODOMAIN Y-LIKE PROTE; SWP:Q9Y232; PDB:2GTRA; RYRDIVVRKQDGFTHILLSTKSSENNSLNPEVMREVQSALSTAAADDSKLVLLSAVGSVF ---------iiii----------%%%%--------------------------------- CCGLDFIYFIRRLTDDRKRESTKMAEAIRNFVNTFIQFKKPIIVAVNGPAIGLGASILPL --------------------------------------------------!!!!--3333 CDVVWANEKAWFQTPYTTFGQSPDGCSTVMFPKIMGGASANEMLLSGRKLTAQEACGKGL ------3333----3333-----%%%%----------------------------1111- VSQVFWPGTFTQEVMVRIKELASCNPVVLEESKALVRCNMKMELEQANERECEVLKKIWG -----3333-----------1111------------3333-----------------111 SAQGMDSMLKYLQRKIDE 1-1111------------ >HYPOTHETICAL PROTEIN HP00; SWP:O24902; PDB:2GTSA; VQDTEEVREFVGHLERFKELLREEVNSLSNHFHNLESWRDARRDKFSEVLDNLKSTFNEF ---------------------------------------3333----------------- DEAAQEQIAWLKERIR ---------------- >Uncharacterized protein M; SWP:Q57696; PDB:2GTVX; MIEKLAEIRKKIDEIDNKILKARWPWAEKLIAERNSLAKDVAEIKNQLGIPINDPEREKY --3333-------------------------------------3333------------- IYDRIRKLCKEHNVDENIGIKIFQRLIEHNKALQKQYLEETLEH -------------------------------------------- >NONSTRUCTURAL PROTEIN 2; SWP:Q9PY93; PDB:2GU0A; AELACFVSFSLTEDKVVWYPINKKAVQTMLCAKVEKDQRSNYYDTILYGVAPPPEFRNRF -1111-----------------------------3333---------------------- KTNERYGLDYESDQYTELVNLLADTLNMVSMPTEKFQFDIVKTVVQVRHLENLLCRIKDV --------1111--------------3333-3333-3333------------------11 NDILNANVKLRVKAVMIACNLVNETETTPLTESNDIVYQDSYFTITKLDYSNHKLLPLMA 113333----------1111------3333-------------------1111------- DEYKITINTKTDIPDRNQTAFAAYIRYNFNKFAAISHGKRHWRLVLHSQLMSHAERLDRK -------------3333-----------1111-------------1111-3333------ IKSDKYDDGDMAFVHPGWKTCIGQLCGGTTFEVAKTSLYSIKPSKTVRTATNKIESDLIS ----------1111----------1111-33333333------3333---------3333 M - >ZINC PEPTIDASE; SWP:Q9KUL5; PDB:2GU1A; RIHYMVKVGDTLSGIFAQLGVPYSILQKILSVDLDHLQLDMIQPGEELELMMDDMGQLSR ------2222------1111-----------3333--3333-2222------1111---- LIYHMSIVEKAIYTRENDGSFSYDFQEISGEWREILFSGEINGSFSVSARRVGLTSSQVA ---------------1111------------------------------1111------- NITQVMKDKIDFSRSLRADRFDILVKQQYLGEHNTGNSEIKAISFKLAKGDVSAFLAEDG -----------------------------!!!!-------------1111------1111 RFYDRAGNSLERAFNRYPVDKAYRQITSGFNPKRKHPVTGRVVPHNGTDFATPIGAPVYS ---1111------------3333-----------------------------2222---- TGDGKVIVVRKHPYAGNYLVIEHNSVYKTRYLHLDKILVKKGQLVKRGQKIALAGATGRL ---------------------------------------2222--2222----------- TGPHLHFEVLVRNRPVDAMKADLP ----------iiii--1111---- >ASPA PROTEIN; SWP:Q9R1T5; PDB:2GU2A; CVAEEPIKKIAIFGGTHGNELTGVFLVTHWLKNGAEVHRAGLEVKPFITNPRAVEKCTRY ---------------------------------3333-2222-----------1111--- IDCDLNRVFDLENLSKESEDLPYEVRRAQEINHLFGPKNSDDAYDVVFDLHNTTSNGCTL ---1111--3333----11113333-----------2222-------------------- ILGDSGNDFLIQFHYIKTCAPLPCSVYLIEHPSLKYATTRSIAKYPVGIEVGPQPHGVLR ---11113333-------------------33331111-3333-----------2222-- ADILDQRRLKHALDFIQRFNEGKEFPPCAIDVYKIEKVDYPRNESGDVAAVIHPNLQDQD ------------------------------------------1111---------2222- WKPLHPGDPVFVSLDGKVIPLGGDCTVYPVFVNEAAYYEKKEAFAKTTKLTLNAKSIRST ----1111----1111-----------------33331111------------------- >YPMB PROTEIN; SWP:P54396; PDB:2GU3A; KEEGHEAAAAEAKKETDLAHVDQVETFVGKEKYYVVKGTDKKGTALYVWVPADKKAKILS -2222----------------------------------1111---------1111---- KEAKEGISEDKAAKIIKDEGLVSKQKEVHLAREGNVLLWEVTYLDKEGQYSLSYVDFTTG -3333-----------1111------------!!!!--------1111------------ KILKNITP -------- >TETRACENOMYCIN POLYKETIDE; SWP:Q8PBM3; PDB:2GU9A; QYATLELNNAFKVLFSLRQVQAAEMVIAPGDREGGPDNRHRGADQWLFVVDGAGEAIVDG ----------------%%%%-------2222--------------------------!!! HTQALQAGSLIAIERGQAHEIRNTGDTPLKTVNFYHPPAYDAQGEPLPAGE !----2222-------------------------------1111------- >GRIFFITHSIN; SWP:P84801; PDB:2GUDA; SLTHRKFGGSGGSPFSGLSSIAVRSGSYLDAIIIDGVHHGGSGGNLSPTFTFGSGEYISN ---------------------------------iiii---------------2222---- MTIRSGDYIDNISFETNMGRRFGPYGGSGGSANTLSNVKVIQINGSAGDYLDSLDIYYEQ ---------------1111----------------------------------------- Y - >PUTATIVE TETR-FAMILY TRAN; SWP:Q0S8Y3; PDB:2GUHA; RTAEQSRSLIVDAAGRAFATRPYREITLKDIAEDAGVSAPLIIKYFGSKEQLFDALVDFR ----------------1111-3333------------3333-------------1111-- AAAEIVFSGPLDGLGERVSFARPLEPYKPLSLNILFSGPSEESSRKLRANYSAQIDALAE ----1111--2222----------11113333--------------------------11 RLPGRDARLRAELVSLTGLAVRRKQEHATGTPEEVVAHYAPLVQELLDGG 11----------------------1111---------------------- >DNA POLYMERASE III EPSILO; SWP:Q3Z5E8; PDB:2GUIA; ITRQIVLDTETTGMNQIGAHYEGHKIIEIGAVEVVNRRLTGNNFHVYLKPDRLVDPEAFG ------------------1111------------%%%%-----------------3333- VHGIADEFLLDKPTFAEVADEFMDYIRGAELVIHNAAFDIGFMDYEFSLLKRDIPKTNTF ----33331111-3333--------2222-----3333---------3333----1111- CKVTDSLAVARKMFPGKRNSLDALCARYEIDNSKRTLHGALLDAQILAEVYLAMTG -------------2222--------------1111-----------------1111 >PHAGE-LIKE ELEMENT PBSX P; SWP:P54332; PDB:2GUJA; AQNTISGKEGRLFLDGEEAHIKTFEANVEKNKSEVNIGRRTGHKTTGANGTGTATFYKVT ------------------------------------------------------------ SKFVLLDYVKKGSDPYFTLQAVLDDQSSGRGTERVTLYDVNFDSAKIASLDEEEVPFTFE ------------------------3333---------------3333------------- DFDVPEKL -------- >HYPOTHETICAL PROTEIN PG18; SWP:Q7MTT4; PDB:2GUKA; QTLNSDLRVFHHIYEFEKGVRSVLATLANDDIPYAEERLRSRQIPYFAQPTPNTERTNLF ---------------------------3333--------1111-------1111------ FGCKECEAIRLFVSGRSLNSLTPEEDFIIGALGYDICRQCERYCRRK --3333------22221111--------------------------- >GLYCOPROTEIN B; SWP:P06437; PDB:2GUMA; ANFYVCPPPTGATVVQFEQPRRCPTRPEGQNYTEGIAVVFKENIAPYKFKATMYYKDVTV ------------------------------------------------------------ SQVWFGHRYSQFMGIFEDRAPVPFEEVIDKINAKGVCRSTAKYVRNNLETTAFHRDDHET -----------------------3333-----------------%%%%----2222---- DMELKPANAATRTSRGWHTTDLKYNPSRVEAFHRYGTTVNCIVEEVDARSVYPYDEFVLA ----------------------------2222--------------------------11 TGDFVYMSPFYGYREGSHTEHTTYAADRFKQVDGFYARDLAPTTRNLLTTPKFTVAWDWV 11-----1111----3333-----1111-------------------------------- PKRPSVCTMTKWQEVDEMLRSEYGGSFRFSSDAISTTFTTNLTEYPLSRVDLGDCIGKDA -1111-----------------%%%%-----1111----------3333---3333---- RDAMDRIFARRYNATHIKVGQPQYYQANGGFLIAYQPLLSNTVERIKTTSSIEFARLQFT --------------------------2222---------------------3333----- YNHIQRHVNDMLGRVAIAWCELQNHELTLWNEARKLNPNAIASVTVGRRVSARMLGDVMA ---------------------------------3333----------------------- VSTCVPVAADNVIVQNSMRISSRPGACYSRPLVSFRYEDQGPLVEGQLGENNELRLTRDA -------3333--------1111--------------1111--------%%%%------- IEPCTVGHRRYFTFGGGYVYFEEYAYSHQLSRADITTVSTFIDLNITMLEDHEFVPLEVY -------------!!!!----%%%%-----3333-------------------------- TRHEIKDSGLLDYTEVQRRNQLHDLRFADIDTVIHA -----3333--------------------------- >ROK FAMILY PROTEIN; SWP:Q97NB0; PDB:2GUPA; TIATIDIGGTGIKFASLTPDGKILDKTSISTPENLEDLLAWLDQRLSEQDYSGIASVPGA -------1111------1111--------------------------------------- VNQETGVIDGFSAVPYIHGFSWYEALSSYQLPVHLENDANCVGLSELLAHPELENAACVV -------------1111---3333-3333---------------3333-1111------- IGTGIGGAIINGRLHRGRHGLGGEFGYTTLAPAEKLNNWSQLASTGNVRYVIEKSGHTDW ---------iiii-------2222-------------3333------------------- DGRKIYQEAAAGNILCQEAIERNRNLAQGLLNIQYLIDPGVISLGGSISQNPDFIQGVKK --------1111---------------------------------3333----------- AVEDFVDAYEEYTVAPVIQACTYHADANLYGALVNWLQEEKQW -----3333--------------1111---------------- >ARC/MEDIATOR, Positive co; SWP:Q96RN5; PDB:2GUTA; GAMGQETDWRSTAFRQKLVSQIEDAMRKAGVAHSKSSKDMESHVFLKAKTRDEYLSLVAR -------1111--------------------------------------3333------- LIIHFRDIHNKKSQASV ----------------- -------------------------------------------------------- >AMP NUCLEOSIDASE; SWP:NA; PDB:2GUWA; LTPEQALDRLEELYEQSVNALREAIADYVDNGTLPDPHARLNGLFVYPSLSVTTTVTRPA -----1111------------------------------1111--------------333 LFRAYLLEQLNLVYHDYGAHIAVEASHHEIPYPYVIDGSALTLDRSMSAGLTRHFPTTEL 3-----1111--------------------3333-------------------------- AQHFDARRVDFSLARLRHYTGTPVEHFQPFVLFTNYTRYVDEFVRWGCSQILDPDSPYIA ----------------------3333----------------------33331111---- LSCAGGIWITAEEAISDLAWKKHQMPAWHLVTADGQGITLVNIGVGPSNAKTICDHLAVL --3333-----------3333----------1111--------------------3333- RPDVWLMIGHCGGLRESQAIGDYVLAHAYLRDDHVLDAVLPPDIPIPSIAEVQRALYDAT --------------33332222-----------1111---1111---------------- KAVSGMPGEEVKQRLRTGTVVTTDDRNWELRYSASALRFNLSRAVAIDMESATIAAQGYR --------3333--------------3333--3333------------------------ FRVPYGTLLCVSDKPLHGEIKLPGQANRFYEGAISEHLQIGIRAIDLLRAE -------------1111---------3333--------------------- >ALPHA-AMYLASE A; SWP:Q76CT3; PDB:2GUYA; ATPADWRSQSIYFLLTDRFARTDGSTTATCNTADQKYCGGTWQGIIDKLDYIQGMGFTAI -33331111-----3333--1111------3333------------------1111---- WITPVTAQLPQTTAYGDAYHGYWQQDIYSLNENYGTADDLKALSSALHERGMYLMVDVVA ------------1111-1111----1111-3333-------------1111--------- NHMGYDGAGSSVDYSVFKPFSSQDYFHPFCFIQNYEDQTQVEDCWLGDNTVSLPDLDTTK -------3333-3333-----3333--------1111----------------------- DVVKNEWYDWVGSLVSNYSIDGLRIDTVKHVQKDFWPGYNKAAGVYCIGEVLDGDPAYTC 3333----------------------3333-1111-------------------3333-- PYQNVMDGVLNYPIYYPLLNAFKSTSGSMDDLYNMINTVKSDCPDSTLLGTFVENHDNPR -1111------3333--------1111-----------------1111-----------3 FASYTNDIALAKNVAAFIILNDGIPIIYAGQEQHYAGGNDPANREATWLSGYPTDSELYK 333--------------------------3333------------3333----------- LIASANAIRNYAISKDTGFVTYKNWPIYKDDTTIAMRKGTDGSQIVTILSNKGASGDSYT ---------------1111--------------------2222---------1111---- LSLSGAGYTAGQQLTEVIGCTTVTVGSDGNVPVPMAGGLPRVLYPTEKLAGSKICS --------2222-------------1111------%%%%-----33332222---- >Mitochondrial import inne; SWP:Q07914; PDB:2GUZA; GFLKGGFDPKMNSKEALQILNLTENTLTKKKLKEVHRKIMLANHPDKGGSPFLATKINEA ----------------------3333-----------------3333------------- KDFLEKRGISK ----------- >Mitochondrial import inne; SWP:NA; PDB:2GUZB; MTLDESCKILNIEESKGDLNMDKINNRFNYLFEVNDKEKGGSFYLQSKVYRAAERLKWEL ------------3333-------------------------------------------- AQREK -3333 >PROBABLE ACYLPHOSPHATASE; SWP:P0AB65; PDB:2GV1A; MSKVCIIAWVYGRVQGVGFRYTTQYEAKRLGLTGYAKNLDDGSVEVVACGEEGQVEKLMQ --------------------3333-3333---------------------3333------ WLKSGGPRSARVERVLSEPHHPSGELTDFRIR ------3333---------------------- >Protein SFI1; SWP:Q12369; PDB:2GV5C; GPLGSKLNDILHVYEKSKERELQSQLFNAWRNRFCFYTEECNIQAISKRNYQLEKVLKKF ------------------------------------------------------------ RERLLEIVKSEE -------3333- >MONOOXYGENASE; SWP:Q9HFE4; PDB:2GV8A; LPTIRKIAIIGAGPSGLVTAKALLAEKAFDQVTLFERRGSPGGVWNYTSTLSNKLPVPST -----------------------3333--------------!!!!--------------- NPILTTEPIVGPAALPVYPSPLYRDLQTNTPIELGYCDQSFKPQTLQFPHRHTIQEYQRI 3333------1111--------1111----3333-1111--2222--------------- YAQPLLPFIKLATDVLDIEKKDGSWVVTYKGTKAGSPISKDIFDAVSICNGHYEVPYIPN --1111--------------!!!!--------2222------------------------ IKGLDEYAKAVPGSVLHSSLFREPELFVGESVLVVGGASSANDLVRHLTPVAKHPIYQSL 2222------2222--3333--33332222------------------------------ LGGGDIQNESLQQVPEITKFDPTTREIYLKGGKVLSNIDRVIYCTGYLYSVPFPSLAKLK -------1111----------1111---2222--------------------3333---- SPETKLIDDGSHVHNVYQHIFYIPDPTLAFVGLALHVVPFPTSQAQAAFLARVWSGRLKL 3333-----------2222--3333----------------------------------- PSKEEQLKWQDELFSLSGANNYHSLDYPKDATYINKLHDWCKQATPVLEEEFPSPYWGEK ---------------iiii---------------------3333---------------- ERSIRENWSIRAKFFGIE ------------------ >DNA POLYMERASE; SWP:P04292; PDB:2GV9A; GPTQRHTYYSECDEFRFIAPRVLDEDAPPEKRAGVHDGHLKRAPKVYCGGDERDVLRVGS -------------------3333----1111----------------------1111--- GGFWPRRSRLWGGVDHAPAGFNPTVTVFHVYDILENVEHAYGMRAAQFHARFMDAITPTG --------------------------------------33331111-------------- TVITLLGLTPEGHRVAVHVYGTRQYFYMNKEEVDRHLQCRAPRDLCERMAAALRESPGAS --------1111-----------------------------------------------% FRGISADHFEAEVVERTDVYYYETRPALFYRVYVRSGRVLSYLCDNFCPAIKKYEGGVDA %%%-1111----------------------------------------------1111-- TTRFILDNPGFVTFGWYRLKPGRNNTLAQPRAPMAFGTSSDVEFNCTADNLAIEGGMSDL ----1111-----------------------1111-----------1111---------- PAYKLMCFDIECKAGGEDELAFPVAGHPEDLVIQISCLLYDLSTTALEHVLLFSLGSCDL -----------------1111--3333--------------------------------- PESHLNELAARGLPTPVVLEFDSEFEMLLAFMTLVKQYGPEFVTGYNIINFDWPFLLAKL --------1111----------------------------------3333---------- TDIYKVPLDGYGRMNGRGVFRVWDKIKVNGMVNIDMYGIITDKIKLSSYKLNAVAEAVLK -------1111----------------2222----------------------------- DKKKDLSYRDIPAYYATGPAQRGVIGEYCIQDSLLVGQLFFKFLPHLELSAVARLAGINI ------3333-------3333-------------------------------------33 TRTIYDGQQIRVFTCLLRLADQKGFILPDTQVLDPTSGFHVNPVVVFDFASLYPSIIQAH 33-----3333------------------------------------------------- NLCFSTLSLRADAVAHLEAGKDYLEIEVGGRRLFFVKAHVRESLLSILLRDWLAMRKQIR --3333-------------1111------------------------------------- SRIPQSSPEEAVLLDKQQAAIKVVCNSVYGFTGVQHGLLPCLHVAATVTTIGREMLLATR -3333------------------3333-3333---------------------------- EYVHARWAAFEQLLADFPEAADMRAPGPYSMRIIYGDTDSIFVLCRGLTAAGLTAMGDKM ------------------3333----------------------2222------------ ASHISRALFLPPIKLECEKTFTKLLLIAKKKYIGVIYGGKMLIKGVDLVRKNNCAFINRT -----------------------------------2222----------3333------- SRALVDLLFYDDTVSGAAAALAERPAEEWLARPLPEGLQAFGAVLVDAHRRITDPERDIQ -------------------------1111-----3333-------------------333 DFVLTASIKDRIPYVIVAQKLLVSELAEDPAYAIAHGVALNTDYYFSHLLGAACVTFKAL 3--------------------------------1111----3333-----------3333 FGNNAKITESLLKRFIPEVWH %%%%------3333--1111- >AGR_L_2016P; SWP:Q7CTE6; PDB:2GVHA; KPAQHGATTRLIDIVFPGDTNHHGTLFGGTGLALDRVAFIAATRFGRTPFVTASCERIDF ---------------2222----------------------------------------- RQPARIGHIVEFTARPVKAGRRSLTVEVEVAETIIGRQQHTCTRGIFHVAIPEGEDAASY -------------------------------------------------------3333- VLPELLTEETPDAVTVEIVFPDQANSAGRFGGEAIAYTKAAFVAASRYCGKLVVLASSER -------------------3333-1111-3333--------------------------- IDFARAIEIGEIVEAQAHVERVGRSSSIQTKLWSENLLTGERHITATGHFTVAVDRPATI -------1111-----------1111---------------------------------- >CONSERVED HYPOTHETICAL PR; SWP:Q9HJ63; PDB:2GVIA; EKLNFGIPEWAFEFHGHKCPYPGYRAGSYALKIAGLEKEKDHRTYLLSESPEDNGCFNDG ---iiii3333--------------------------------------1111------- AQAATGCTYGKGLFSLLGYGKLALILYRPGRKAIRVHVRNSFDELSTRASDFFRYRKQGY -------3333----------------2222-------3333-------------3333- EPSEIPAGAIDPVLEWISSLEDEEIFEYREIDGFTFEPVKKNGAKVRCDVCGEYTYEADA 3333-3333-------11113333-------------------------------3333- KLLNGKPVCKPDYYG --iiii--3333--- >NICOTINAMIDE PHOSPHORIBOS; SWP:P43490; PDB:2GVJA; FNILLATDSYKVTHYKQYPPNTSKVYSYFECREYEETVFYGLQYILNKYLKGKVVTKEKI -3333--3333-3333------------------------3333---------------- QEAKDVYKEHFQDDVFNEKGWNYILEKYDGHLPIEIKAVPEGFVIPRGNVLFTVENTDPE ---------------------------%%%%--------2222--------------333 CYWLTNWIETILVQSWYPITVATNSREQKKILAKYLLETSGNLDGLEYKLHDFGYRGVSS 3--1111--3333-----------------------------2222-------3333--- QETAGIGASAHLVNFKGTDTVAGLALIKKYYGTKDPVPGYSVPAAEHSTITAWGKDHEKD ----------3333-----3333----------------------3333----1111--- AFEHIVTQFSSVPVSVVSDSYDIYNACEKIWGEDLRHLIVSRSTQAPLIIRPDSGNPLDT ----------------------------------333311111111-------------- VLKVLEILGKKFPVTENSKGYKLLPPYLRVIQGDGVDINTLQEIVEGKQKWSIENIAFGS -------1111-------------1111-----------------------3333----- GGGLLQKLTRDLLNCSFKCSYVVTNGLGINVFKDPVADPNKRSKKGRLSLHRTPAGNFVT 3333----3333-----------%%%%-------11111111----------1111---- LEEGKGDLEEYGQDLLHTVFKNGKVTKSYSFDEIRKNAQLNIE ---3333-------------iiii-----3333--1111---- >HEME PEROXIDASE; SWP:Q8A8E8; PDB:2GVKA; FGGHIPQDVAGKQGENVIFIVYNLTDSPDTVDKVKDVCANFSAIRSRNRFPDQFSCTGFG iiii------------------------------------------3333---------- ADAWTRLFPDKGKPKELSTFSEIKGEKYTAVSTPGDLLFHIRAKQGLCFEFASILDEKLK -------1111--1111-----------------------------------------22 GAVVSVDETHGFRYDGKAIIGFVDGTENPAVDENPYHFAVIGEEDADFAGGSYVFVQKYI 22---------------1111---1111-----3333----3333--2222--------- HDVAWNALPVEQQEKVIGRHKFNDVELSDEEKPGNAHNAVTNIGDDLKIVRANPFANTSK --3333---------------------3333-1111--1111-%%%%----------111 GEYGTYFIGYASTFSTTRRLENFIGSPAGNTDRLLDFSTAITGTLFFVPSYDLLGELGE 1-------------------------2222-3333--------------------1111 >CHEMOSENSORY PROTEIN CSP-; SWP:O76476; PDB:2GVSA; EEKYTTKYDNVNLDEILANDRLLNKYVQCLLEDDESNCTADGKELKSVIPDALSNECAKC -------%%%%33333333---------------11113333------------%%%%-- NEKQKEGTKKVLKHLINHKPDVWAQLKAKYDPDGTYSKKYEDREKELHQ -------------------3333-------1111-3333---------- >MITOCHONDRIAL PRECURSOR P; SWP:P07213; PDB:2GW1A; EKDKYALALKDKGNQFFRNKKYDDAIKYYNWALELKEDPVFYSNLSACYVSVGDLKKVVE 3333----------------3333------3333---3333--------3333-3333-- MSTKALELKPDYSKVLLRRASANEGLGKFADAMFDLSVLSLNGDFNDASIEPMLERNLNK -----1111--3333--------------------------------------------- QAMSKLKEKNLPSVTSMASFFGIFKPELTFANYDESNEADKELMNGLSNLYKRSPESYDK -----1111---3333----1111------------------------------------ ADESFTKAARLFEEQLDKNNEDEKLKEKLAISLEHTGIFKFLKNDPLGAHEDIKKAIELF --------------1111----3333---------------------------------- PRVNSYIYMALIMADRNDSTEYYNYFDKALKLDSNNSSVYYHRGQMNFILQNYDQAGKDF -3333--------------3333-----3333---3333--------------------3 DKAKELDPENIFPYIQLACLAYRENKFDDCETLFSEAKRKFPEAPEVPNFFAEILTDKND 3331111---3333-----3333------------------------------------- FDKALKQYDLAIELENKLDGIYVGIAPLVGKATLLTRNPTVENFIEATNLLEKASKLDPR 3333-------------------------------------3333------------111 SEQAKIGLAQMKLQQEDIDEAITLFEESADLARTMEEKLQAITFAEAAKVQQRIRSDPVL 13333---------------------------------------------------3333 AKKIQET -3333-- >PEPTIDYL-PROLYL CIS-TRANS; SWP:NA; PDB:2GW2A; RPRCFFDIAINNQPAGRVVFELFSDVCPKTCENFRCLCTGEKGTGKSTQKPLHYKSCLFH ---------iiii---------3333-------------1111----------2222--- RVVKDFMVQGGDFSEGNGRGGESIYGGFFEDESFAVKHNAAFLLSMANRGKDTNGSQFFI --2222-----1111-------1111---------------------------------- TTKPTPHLDGHHVVFGQVISGQEVVREIENQKTDAASKPFAEVRILSCGELIP ----3333-------------------1111--1111---------------- >Leukocyte immunoglobulin-; SWP:Q8N423; PDB:2GW5A; IPKPTLWAEPDSVITQGSPVTLSCQGSLEAQEYRLYREKITRIRPELVKNGQFHIPSITW --------------------------1111-------------3333-iiii-------3 EHTGRYGCQYYSRARWSELSDPLVLVMTGAYPKPTLSAQPSPVVTSGGRVTLQCESQVAF 333--------%%%%-----------------------------2222------------ GGFILCKEQCLNSQPHARGSSRAIFSVGPVSPNRRWSHRCYGYDLNSPYVWSSPSDLLEL ---------------1111-----------1111---------1111------------- LVPG ---- >TRNA-SPLICING ENDONUCLEAS; SWP:Q8WW01; PDB:2GW6A; SEDAWMGTHPKYLEMMELDIGDATQVYVAFLVYLDLMESKSWHEVNCVGLPELQLICLVG --------------------------------------------------1111------ TEIEGEGLQTVVPTPITASLSHNRIREILKASRKLQGDPDLPMSFTLAIVESDSTIVYYK --------------1111--------------------------------1111------ LTD --- >PII SIGNAL TRANSDUCTION P; SWP:NA; PDB:2GW8A; PMKKIEAIVKPFKLDDVREALTEIGITGMTVSEVKGFGRVDFLPKIKIELVLADDAVERA ---------1111--------1111---------------------------3333---- IDVIVEVARSGKIGDGKIFVLPVEEAIRIRTGERSDAAV -----------2222--------------------1111 >GLUTAMATE CYSTEINE LIGASE; SWP:O23736; PDB:2GWDA; EPLTREDLIAYLASGCKSKEKWRIGTEHEKFGFEVNTLRPMKYDQIAELLNSIAERFEWE ------------3333-3333--------------------------------------- KVMEGDKIIGLKQGKQSISLEPGGQFELSGAPLETLHQTCAEVNSHLYQVKAVAEEMGIG ---!!!!-----!!!!----1111------------------------------1111-- FLGMGFQPKWRREDIPTMPKGRYDIMRNYMPKVGSLGLDMMLRTCTVQVNLDFSSEADMI ----------3333--------------3333---------------------------- RKFRAGLALQPIATALFANSPFTEGKPNGFLSMRSHIWTDTDKDRTGMLPFVFDDSFGFE ----------------------iiii---------------3333---3333-3333--- QYVDYALDVPMYFAYRNGKYVDCTGMTFRQFLAGKLPCLPGELPTYNDWENHLTTIFPEV -----3333------iiii---2222---------3333------------1111----- RLKRYMEMRGADGGPWRRLCALPAFWVGLLYDEDVLQSVLDLTADWTPAEREMLRNKVPV --------------3333------------------------1111-------------- TGLKTPFRDGLLKHVAEDVLKLAKDGLERRGYKEVGFLNAVTEVVRTGVTPAENLLEMYN !!!!--!!!!3333-------------------33333333------------------- GEWGQSVDPVFQELLY 1111--3333------ >UBIQUITIN CARBOXYL-TERMIN; SWP:P40818; PDB:2GWFA; SGAITAKELYTMMTDKNISLIIMDARRMQDYQDSCILHSLSVPEEAISPGVTASWIEAHL -----------------------------------2222---3333-2222-----1111 PDDSKDTWKKRGNVEYVVLLDWFSSAKDLQIGTTLRSLKDALFKWESKTVLRNEPLVLEG ---------1111-----------3333-2222------------------------222 GYENWLLCYPQYTTNA 2-------1111---- >4-OXALOMESACONATE HYDRATA; SWP:Q6N0R4; PDB:2GWGA; IIDIHGHYTTAPKALEDWRNRQIAGIKDPSVPKVSELKISDDELQASIIENQLKKQERGS ----------------------3333-3333-3333---------------33331111- DLTVFSPRAGDFNVSSTWAAICNELCYRVSQLFPDNFIGAALPQSPGVDPKTCIPELEKC --------------------------------1111--------2222------------ VKEYGFVAINLNPDPSGGHWTSPPLTDRIWYPIYEKVELEIPAIHVSTGAHYLNADTTAF -------------3333------11111111-----1111-------------------- QCVAGDLFKDFPELKFVIPHGGGAVPYHWGRFRGLAQEKKPLLEDHVLNNIFFDTCVYHQ -----1111-1111----%%%%-3333--------------3333--------------- PGIDLLNTVIPVDNVLFASEIGAVRGIDPRTGFYYDDTKRYIEASTILTPEEKQQIYEGN ----------3333-------------------11113333------------------- ARRVYPRLDAALKAKGKLEH -------------------- >SULFOTRANSFERASE 1C2; SWP:O75897; PDB:2GWHA; RLSVNYVKGILQPTDTCDIWDKIWNFQAKPDDLLISTYPKAGTTWTQEIVELIQNEGDVE ------iiii--3333------1111--1111-----2222-------------iiii33 KSKRAPTHQRFPFLEMKIPSLGSGLEQAHAMPSPRILKTHLPFHLLPPSLLEKNCKIIYV 33---3333---1111-2222--------------------3333-----1111------ ARNPKDNMVSYYHFQRMNKALPAPGTWEEYFETFLAGKVCWGSWHEHVKGWWEAKDKHRI -----------------3333-----------------22223333-------3333--- LYLFYEDMKKNPKHEIQKLAEFIGKKLDDKVLDKIVHYTSFDVMKQNPMANYSSIPAEIM ----------------------------------------3333--1111-11113333- DHSISPFMRKGAVGDWKKHFTVAQNERFDEDYKKKMTRLTFHFQF 3333---------3333-------------------3333----- >65 KDA VIRULENCE PROTEIN; SWP:P74846; PDB:2GWMA; ESKQIQALRYYSAQGYSVINKYLRGDDYPETQAKETLLSRDYLSTNEPSDEEFKNAMSVY -------------3333------------------------------------------- INDIAEGLSSLPETDHRVVYRGLKLDKPALSDVLKEYTTIGNIIIDKAFMSTSPDKAWIN --------------------------3333--------2222------------------ DTILNIYLEKGHKGRILGDVAHFKGEAEMLFPPNTKLKIESIVNCGSQDFASQLSKLRLS --------2222----!!!!---------------------------------1111--- DDATADTNRIKRIINMRVLN -11113333----------- >DIHYDROOROTASE; SWP:NA; PDB:2GWNA; AKILLRNALITNEGKTFPGSVIDGAFISRIIEGELPADDNLSADEVIECSGLRLFPGCID -----------iiii-------!!!!---------1111%%%%-----2222-------- DQVHFREPGLTHKATIASESRAAVAGGVTSFDPNTNPPTTWERLLEKRQIGADTAWANYG ---------3333----------------------------------------------- FFFGGTNDNIDEIKRVDKHLVPGLLFLGSSTGNLVDNKETLEKIFGECDLLIATHCEKEE ------------11113333---------------------------------------- IIRANKEHYKAKYGNDLDIHFHPLIRSEEACYRSSAEAVELAERNARLHILHLSTEKELS -----------------3333---------------------------------333311 LFRNDIPTAQKRITSEVCVHHLWFSDTDYGRLGNRIKWNPAIKKESDREALRAAVRNGRI 11----3333-------3333---3333-------------------------------- DIIATDHAPHLLREKEGSCLQAASGGPLVQHSLLALLELCNQGIFSIEEIVSKTAHIPAT ----------3333---1111------3333--------1111----------------1 LFAIEKRGYIRPGYYADLVLVDPSSPHTVSADNILSLCGWSPFEGFTFSHSVAYTFVNGC 111-------2222---------------3333--3333-1111------------iiii LAYAKGRLAESRPTVHPLFFN ---iiii-------------- >DUAL SPECIFICITY PROTEIN ; SWP:Q9UII6; PDB:2GWOA; PPTLASLQRLLWVRQAATLNHIDEVWPSLFLGDAYAARDKSKLIQLGITHVVNAAAGKFQ -------------------------2222---------------------------1111 VDTGAKFYRGMSLEYYGIEADDNPFFDLSVYFLPVARYIRAALSVPQGRVLVHCAMGVSR ---33333333-----------1111-3333---------11113333------------ SATLVLAFLMICENMTLVEAIQTVQAHRNICPNSGFLRQLQVLDNRLGR ------------------------------------------------- >DNA-BINDING RESPONSE REGU; SWP:P0A5Z4; PDB:2GWRA; DTRQRILVVDDDASLAELTIVLRGEGFDTAVIGDGTQALTAVRELRPDLVLLDLLPGNGI -----------3333-------1111-------3333----------------------- DVCRVLRADSGVPIVLTAKTDTVDVVLGLESGADDYIKPFKPKELVARVRARLRRNDDEP -----------------1111-------1111-----------------1111------- AELSIADVEIDVPAHKVTRNGEQISLTPLEFDLLVALARKPRQVFTRDVLLEQVWGYRAD ----!!!!---1111---iiii--------------3333-------------------- TRLVNVHVQRLRAKVEKDPENPTVVLTVRGVGYKAGPP -----------------3333------2222------- >NIF3-RELATED PROTEIN; SWP:Q730P7; PDB:2GX8A; IPNGHEIISLFESMYPKHLAMEGDKIGLQIGALNKPVRHVLIALDVTEEVVDEAIQLGAN ---------------3333----------------------------------------- VIIAHHPLIFNPLKAIHTDKAYGKIIEKCIKNDIAIYAAHTNVDVAKGGVNDLLAEALGL ----------------1111---------1111-------3333-2222------1111- QNTEVLAPTYAEEMKKVVVFVPVTHAEEVRKALGDAGAGHIGNYSHCTFSSEGTGTFVPQ ---------------------3333--------1111---!!!!---------------- QLERVEEVRIETIIPASLQRKVIKAMVTAHPYEEVAYDVYPLDNKGETLGLGKIGYLQEE --------------3333------------------------------------------ MTLGQFAEHVKQSLDVKGARVVGKLDDKVRKVAVLGGDGNKYINQAKFKGADVYVTGDMY -----------1111--------1111-----------3333------------------ YHVAHDAMMLGLNIVDPGHNVEKVMKQGVQKQLQEKVDAKKLNVHIHASQLHTDPFIFV -----------------3333----------------1111------------------ >NS1 EFFECTOR DOMAIN; SWP:Q6LD08; PDB:2GX9A; TASVPASRYLTDTLEESRDWSLIPKQKVAGPLCIRDQAIDKNIILKANFSVIFDRLETLI ------------3333------------!!!!---1111------------%%%%----- LLRAFTEEGAIVGEISPLPSLPGHTAEDVKNAVGVLIGGLEWNDNTVRVSETLQRFAWRS -----1111--------3333-------------------1111----------1111-- >REPLICATION PROTEIN E1; SWP:P03116; PDB:2GXAA; TEKFDFGTMVQWAYDHKYAEESKIAYEYALAAGSDSNARAFLATNSQAKHVKDCATMVRH ----3333-----1111-------------3333------1111---------------- YLRAETQALSMPAYIKARCKLATGEGSWKSILTFFNYQNIELITFINALKLWLKGIPKKN ------------------3333----3333----------3333-----------2222- CLAFIGPPNTGKSMLCNSLIHFLGGSVLSFANHKSHFWLASLADTRAALVDDATHACWRY -----------3333-------------3333------3333------------------ FDTYLRNALDGYPVSIDRKHKAAVQIKAPPLLVTSNIDVQAEDRYLYLHSRVQTFRFEQP ----3333-----------------------------3333---3333------------ CTDESGEQPFNITDADWKSFFVRLWGRLDL ------------3333-----11111111- >HYPOTHETICAL PROTEIN YYBH; SWP:P37496; PDB:2GXFA; EQQLKDIISACDLAIQNEDFDTLNYYSEDAVLVVKPGIARGKEEIKKAFITIANYFNHHI --------------11113333-------------------------------------- VPTQGKILLEAGDTVLVLSQTLLDERRATYVFKKNAQGEWLCVIDNSYGTDLIG ----------------------------------3333---------!!!!--- >146aa long hypothetical t; SWP:Q96ZY1; PDB:2GXGA; ENRIQIMSTIAKIYRAMSRELNRRLGELNLSYLDFLVLRATSDGPKTMAYLANRYFVTQS -----------------------3333-------------1111---------------- AITASVDKLEEMGLVVRVRDREDRRKILIEITEKGLETFNKGIEIYKKLANEVTGDLSED ----------------------3333---------------------------3333--- EVILVLDKISKILKRIEEIS -------------------- >HEAT RESISTANT RNA DEPEND; SWP:O07897; PDB:2GXQA; MEFKDFPLKPEILEALHGRGLTTPTPIQAAALPLALEGKDLIGQARTGTGKTLAFALPIA -3333-----------1111--------------1111-------22223333------- ERLAPSQERGRKPRALVLTPTRELALQVASELTAVAPHLKVVAVYGGTGYGKQKEALLRG -------2222------------------------1111-----------------3333 ADAVVATPGRALDYLRQGVLDLSRVEVAVLDEADEMLSMGFEEEVEALLSATPPSRQTLL ---------------------1111-----------1111--------11113333---- FSATLPSWAKRLAERYMKNPVLINVIK --------------------------- >ANGIOPOIETIN-1 RECEPTOR; SWP:Q02763; PDB:2GY5A; AMDLILINSLPLVSDAETSLTCIASGWRPHEPITIGRDFEALMNQHQDPLEVTQDVTREW -------------------------------------3333------------------- AKKVVWKREKASKINGAYFCEGRVRGEAIRIRTMKMRQQASFLPATLTMTVDKGDNVNIS ------2222-------------iiii---------3333-------------------- FKKVLIKEEDAVIYKNGSFIHSVPRHEVPDILEVHLPHAQPQDAGVYSARYIGGNLFTSA ---------------------------------------3333-------11113333-- FTRLIVRRCEAQKWGPECNHLCTACMNNGVCHEDTGECICPPGFMGRTCEKACELHTFGR ---------2222------------%%%%--1111-----------------------11 TCKERCSGQEGCKSYVFCLPDPYGCSCATGWKGLQCNEACHPGFYGPDCKLRCSCNNGEM 11-----1111----------------2222-2222----2222-2222-----1111-- CDRFQGCLCSPGWQGLQCEREGIPRMTPKIVDLPDHIEVNSGKFNPICKASGWPLPTNEE ---------2222-1111---------------------------------1111-1111 MTLVKPDGTVLHPKDFNHTDHFSVAIFTIHRILPPDSGVWVCSVNTVAGMVEKPFNISVK ----1111------------------------3333---------1111----------- VLP --- >FERRITIN L SUBUNIT; SWP:P02791; PDB:2GYDA; SQIRQNYSTEVEAAVNRLVNLYLRASYTYLSLGFYFDRDDVALEGVCHFFRELAEEKREG 1111---------------------------------1111------------------- AERLLKMQNQRGGRALFQDLQKPSQDEWGTTLDAMKAAIVLEKSLNQALLDLHALGSAQA --------1111------------------------------------------------ DPHLCDFLESHFLDEEVKLIKKMGDHLTNIQRLVQAGLGEYLFERLTL ------------------------------1111-------------- >YCFI, PUTATIVE STRUCTURAL; SWP:Q6N4M9; PDB:2GYQA; MGFFSRDIQTMEDLLLHGLRDIYYAEQQITKALPKMIEQATNRDLSQGLTSHLEETQKQI ---2222----------------------------------3333--------------- ERLDQVFKKLGQKPSGVNCPAIDGLIKEADETAGEIADKTVLDAAIVANAQAVEHYEIAR --------------------------------1111------------------------ YGTLIAWAEELGHDDIVRFLTTNLNEEKAANTKLNTVALRAS --------1111------------------------------ >CYTOKINE RECEPTOR COMMON ; SWP:Q6NSJ8; PDB:2GYSA; EETIPLQTLRCYNDYTSHITCRWADTQDAQRLVNVTLIRRVNEDLLEPVSCDLSDDMPWS -------------------------3333------------1111--------------- ACPHPRCVPRRCVIPCQSFVVTDVDYFSFQPDRPLGTRLTVTLTQHVQPPEPRDLQISTD -------------------1111------------------3333--------------! QDHFLLTWSVALHWLSPGDLEFEVVYKRLQDSWEDAAILLSNTSQATLGPEHLMPSSTYV !!!------------1111--------1111-----------------3333-------- ARVRTRLAPGSRLSGRPSKWSPEVCWDSQPGDEAQPQNLECFFDGAAVLSCSWEVRKEVA -------1111----------------------------------------------333 SSVSFGLFYKPSAVLLREEECSPVLREGLGSLHTRHHCQIPVPDPATHGQYIVSVQPRRA 3------------------------------------------1111------------- EKHIKSSVNIQMAPPSLQVTDSYSLRWETDHTFEIQYRKDTATWKDSKTETLQNAHSMAL ----1111------------------------------1111------------------ PALEPSTRYWARVRVRTSRTGYNGIWSEWSEARSWDT ------------------1111--------------- >ASPARTATE BETA-SEMIALDEHY; SWP:Q8DQ00; PDB:2GZ1A; GYTVAVVGATGAVGAQMIKMLEESTLPIDKIRYLASARSAGKSLKFKDQDITIEETTETA -------1111---------1111-----------1111------!!!!-------1111 FEGVDIALFSAGSSTSAKYAPYAVKAGVVVVDNTSYFRQNPDVPLVVPEVNAHALDAHNG 2222-------------------1111-------1111-1111---333333331111-- IIACPNCSTIQMMVALEPVRQKWGLDRIIVSTYQAVSGAGMGAILETQRELREVLNDGVK -----3333-------------------------3333---------------------3 PCDLHAEILPSGGDKKHYPIAFNALPQIDVFTDNDYTYEEMKMTKETKKIMEDDSIAVSA 333-------3333-----------------1111-------------11111111---- TCVRIPVLSAHSESVYIETKEVAPIEEVKAAIAAFPGAVLEDDVAHQIYPQAINAVGSRD ----------------------------------2222----3333----33332222-- TFVGRIRKDLDAEKGIHMWVVSDNLLKGAAWNSVQIAETLHERGLVRPTAELKFELK -----------1111-------1111--------------1111------------- >HYPOTHETICAL PROTEIN ATU1; SWP:Q8UGI5; PDB:2GZ4A; SPRAWQRMLSGRRLDLLDPSPLDVEIADIAHGLARVARWNGQTRGDHAFTVAQHCLIVET -------1111--------1111------------------------------------- IFCRMCPGATPDEMQMALLHDAPEYVIGDMISPFKSVVGGGYKTVEKRLEAAVHLRFGLP -----1111---------1111--------33331111----------------1111-- PHASRELKDRIKKADTVAAFFEATELAGFSTAEAQKFFGLPRGITRDMFDIIPLPSTEAQ --------------------------------------------1111------------ RLFIARFEAIETLRVTRTGG -------------------- >N-ACETYL-D-GLUCOSAMINE 2-; SWP:Q3M763; PDB:2GZ6A; KNLQALAQLYKNALLNDVLPFWENHSLDSEGGYFTCLDRQGKVYDTDKFIWLQNRQVWTF ---------------------------1111------1111------------------- SMLCNQLEKRENWLKIARNGAKFLAQHGRDDEGNWYFALTRGGEPLVQPYNIFSDCFAAM -----------------------------1111------1111----------------- AFSQYALASGEEWAKDVAMQAYNNVLRRTRPMKALAVPMILANLTLEMEWLLPQETLENV ------------------------1111-----3333----------3333-3333---- LAATVQEVMGDFLDQEQGLMYENVAPDGSHIDCFEGRLINPGHGIEAMWFIMDIARRKND --------------3333------1111----3333---3333------------1111- SKTINQAVDVVLNILNFAWDNEYGGLYYFMDAAGHPPQQLEWDQKLWWVHLESLVALAMG ------------------------------1111----1111------------------ YRLTGRDACWAWYQKMHDYSWQHFADPEYGEWFGYLNRRGEVLLNLKGGKWKGCFHVPRA --------------------------1111------1111-------------------- MYLCWQQFEALS ------------ >TYPE IV SECRETION SYSTEM ; SWP:Q8FXK7; PDB:2GZAA; ASVNFHLEPLRPWLDDPQITEVCVNRPGEVFCERASAWEYYAVPNLDYEHLISLGTATAR ------3333--1111---------2222----%%%%-----1111-------------- FVDQDISDSRPVLSAILPMGERIQIVRPPACEHGTISVTIRKPSFTRRTLEDYAQQGFFK ----------------1111-----------2222---------------------1111 HVRPMSKSLTPFEQELLALKEAGDYMSFLRRAVQLERVIVVAGETGSGKTTLMKALMQEI --------------------------------------------------------1111 PFDQRLITIEDVPELFLPDHPNHVHLFYPVTAATLLRSCLRMKPTRILLAELRGGEAYDF 1111------------1111----------------3333-------------------- INVAASGHGGSITSCHAGSCELTFERLALMVLQNRQGRQLPYEIIRRLLYLVVDVVVHVH ------------------3333------------3333---------------------- NGVHDGTGRHISEVWYDPNTKRAL ----------------3333---- >Rab11 family-interacting ; SWP:Q7L804; PDB:2GZDC; RRKDTHIRELEDYIDNLLVRVEETPSILRVPYEP -------------------------1111----- >UPF0301 PROTEIN SO3346; SWP:Q8EBZ9; PDB:2GZOA; MESLQNHFLIAMPSLDDTFFERTVIYLCEHDEKGAMGLVINKPLGIEVNSLLEQMDLPTE ------------------------------------------------------------ QVSADLAMGSQVLMGGPVSQDRGFVLHTSQPYWANSTELGSGLMLTTSRDVLTAIGSKRS ------------------1111-------------------------------3333--- PDKFLVALGYAGWSKNQLEQELADNSWLTIPADHALLFDINHEDRWQQASRSLGFEAWQL -----------3333-----------------3333------------------------ STQAGHA ------- >PHOSPHATIDYLETHANOLAMINE-; SWP:Q4Y719; PDB:2GZQA; GGPPTIEELKREKIIPHVFPDENVDLTVDYISFKSGKEVNHGNILDLAGTGSVPRNIKFS ---------------------------------2222--iiii---2222---------- EEPPEDYCYILFIDPDFPSRRRPDGRDYVHWAVSGIKSKELVKGTDKNCITLLPYVGPSI ---2222-----------33331111---------------iiii1111----------- KKGTGLHRISFILSLVKEENKGNVTGVPLYRGEHYITRVKFNNCQSAYNVIQNDKIVGFN 2222------------3333---2222----3333------------------------- WCQRRK ------ >IROE PROTEIN; SWP:Q6KD95; PDB:2GZSA; PNIADKGSVFYHFSATSFDSVDGTRHYRVWTAVPNTTAPASGYPILYLDGNAVDRLDDEL 3333---1111--------1111---------------1111------------------ LKQLSEKTPPVIVAVGYQTNLPFDLNSRAYDYTPAAESRKTDLHRKSGGSNNFRQLLETR ---3333--------------------------3333----------------------- IAPKVEQGLNIDRQRRGLWGHSYGGLFVLDSWLSSSYFRSYYSASPSLGRGYDALLSRVT -----2222--1111-----------------------------33331111------11 AVEPLQFCTKHLAIEGSAGVLSKIHTTLTILKDKGVNAVFWDFPNLGHGPFNASFRQALL 1122221111---------------------1111-------22223333---------- DISGE 1111- >PRKCA-BINDING PROTEIN; SWP:Q9NRD5; PDB:2GZVA; SMVPGKVTLQKDAQNLIGISIGGGAQPCLYIVQVFDNTPAALDGTVAAGDEITGVNGRSI -----------1111-------------------------------2222----iiii-2 KGKTKVEVAKMIQEVKGEVTIHYNKLQYYKV 222---------------------------- >PUTATIVE TATD RELATED DNA; SWP:Q1Y6Z0; PDB:2GZXA; LIDTHVHLNDEQYDDDLSEVITRAREAGVDRFVVGFNKSTIERAKLIDEYDFLYGIIGWH ------11111111----------1111---------------------1111------3 PVDAIDFTEEHLEWIESLAQHPKVIGIGEGLDYHWDKSPADVQKEVFRKQIALAKRLKLP 3331111-------------1111------------------------------------ IIIHNREATQDCIDILLEEHAEEVGGIHSFSGSPEIADIVTNKLNFYISLGGPVTFKNAK ----------------11113333--------------------------3333------ QPKEVAKHVSERLLVETDAPYLSPHPYRGKRNEPARVTLVAEQIAELKGLSYEEVCEQTT --------------------------2222--3333---------1111----------- KNAEKLFN -------- >THIOREDOXIN; SWP:P14949; PDB:2GZYA; MAIVKATDQSFSAETSEGVVLADFWAPWCGPCKMIAPVLEELDQEMGDKLKIVKIDVDEN ------3333-3333----------1111-3333---------------------33333 QETAGKYGVMSIPTLLVLKDGEVVETSVGFKPKEALQELVNKHL 333---------------iiii------------------1111 >METHYLTRANSFERASE 10 DOMA; SWP:Q9BVG7; PDB:2H00A; VSLNFKDPEAVRALTCTLLREDFGLSIDIPLERLIPTVPLRLNYIHWVEDLIGHQDSDKS ---3333----------------------3333----------------------1111- TLRRGIDIGTGASCIYPLLGATLNGWYFLATEVDDMCFNYAKKNVEQNNLSDLIKVVKVP ----------3333-------------------------------11113333------1 QKTLLMDALKEESEIIYDFCMCNPPFFGITEIMAEGGELEFVKRIIHDSLQLKKRLRWYS 111-3333-----------------------3333----------------!!!!----- CMLGKKCSLAPLKEELRIQGVPKVTYTEFCQGRTMRWALAWSFYD ----1111--------1111----------!!!!----------- >2-CYS PEROXIREDOXIN; SWP:Q86SB2; PDB:2H01A; QGQAPSFKAEAVFGDNTFGEVSLSDFIGKKYVLLYFYPLDFTFVCPSEIIALDKALDSFK ---------------------33332222----------------3333----------- ERNVELLGCSVDSKFTHLAWKKTPLSQGGIGNIKHTLISDISKSIARSYDVLFNESVALR -----------------------3333------------1111---1111---------- AFVLIDKQGVVQHLLVNNLALGRSVDEILRLIDALQHHEKYGDVCPANWQKG -----1111------!!!!-3333---------------------------- >NEUREXIN-1-ALPHA; SWP:Q28146; PDB:2H0BA; EEYIATFKGSEYFCYDLSQNPIQSSSDEITLSFKTLQRNGLLHTGKSADYVNLALKNGAV ----------------1111-------------------------!!!!------iiii- SLVINLGSGAFEALVEPVNGKFNDNAWHDVKVTRNLRQVTISVDGILTTTGYTQEDYTLG ----------------------------------!!!!---------------------- SDDFFYVGGSPSTADLPGSPVSNNFGCLKEVVYKNNDVRLELSRLAKQGDPKKIHGV -----------33332222-------------------------------------- >TRANSTHYRETIN-LIKE PROTEI; SWP:O32142; PDB:2H0EA; GKLTTHILDLTCGKPAANVKIGLKRLGESIKEVYTNNDGRVDVPLLAGEELSGEYVEFHA -----------------------------------1111--------3333--------- GDYFASKNAADQPFLTIVTVRFQLADPDAHYHIPLLLSPFGYQVYRGS ----1111-----------------1111------------------- >NADPH-FLAVIN OXIDOREDUCTA; SWP:Q7BGI8; PDB:2H0UA; DREQVVALQHQRFAAKKYDPNRRISQKDWEALVEVGRLAPSSIGLEPWKLLLKNASHFVI -----3333---------1111-------------1111-2222---------------- YLARKGVTYDSDYVKKVHEVKKRDYDTNSRFAQIIKNFQENDKLNSERSLFDWASKQTYI -------1111--------------1111------------------------------- QANAAALGIDSCPIEGYDQEKVEAYLEEKGYLNTAEFGVSVACFGYRNQEITPKTRWKTE --------------------------------3333---------------------333 VIYEVIE 3------ >CITRATE SYNTHASE; SWP:P20901; PDB:2H12A; STATISVDGKSAEMPVLSGTLGPDVIDIRKLPAQLGVFTFDPGYGETAACNSKITFIDGD ---------------------------1111---------2222--------------11 KGVLLHRGYPIAQLAENASYEEVIYLLLNGELPNKAQYDTFTNTLTNHTLLHEQIRNFFN 11---iiii---------3333----------------------------------3333 GFRRDAHPMAILCGTVGALSAFYPANRDLAAMRLIAKIPTIAAWAYKYTQGEAFIYPRND --1111-----------3333--------------------------1111------111 LNYAENFLSMMFARMSEPYKVNPVLARAMNRILILHADHEQNASTSTVRLAGSTGANPFA 1------------1111----------------------------------1111----- CIAAGIAALWGPAHGGANEAVLKMLARIGKKENIPAFIAQVKDKNSGVKLMGFGHRVYKN ----------1111---------------3333---------1111---2222------- FDPRAKIMQQTCHEVLTELGIKDDPLLDLAVELEKIALSDDYFVQRKLYPNVDFYSGIIL ----------------1111-----------------------1111---1111------ KAMGIPTSMFTVLFAVARTTGWVSQWKEMIEEPGQRISRPRQLYIGAPQRDYVPLAKR 1111-3333----------------------2222------------------3333- >WD-REPEAT PROTEIN 5; SWP:P61964; PDB:2H14A; KPNYALKFTLAGHTKAVSSVKFSPNGEWLASSSADKLIKIWGAYDGKFEKTISGHKLGIS ----------------------1111------1111------------------------ DVAWSSDSNLLVSASDDKTLKIWDVSSGKCLKTLKGHSNYVFCCNFNPQSNLIVSGSFDE ----1111----------------1111------------------3333---------- SVRIWDVKTGKCLKTLPAHSDPVSAVHFNRDGSLIVSSSYDGLCRIWDTASGQCLKTLID ------1111------------------1111------1111------------------ DDNPPVSFVKFSPNGKYILAATLDNTLKLWDYSKGKCLKTYTGHKNEKYCIFANFSVTGG -----------1111----------------1111------------------------- KWIVSGSEDNLVYIWNLQTKEIVQKLQGHTDVVISTACHPTENIIASAALENDKTIKLWK -------------------------------------------------3333------- SDC --- >ADP-RIBOSYLATION FACTOR-L; SWP:Q9Y689; PDB:2H16A; QEHKVIIVGLDNAGKTTILYQFSMNEVVHTNVEEIVINNTRFLMWDIGWNTYYTNTEFVI ---------2222------3333-1111--------!!!!----------3333------ VVVDSTDRERISVTREELYKMLAHEDLRKAGLLIFANKQDVKECMTVAEISQFLKLTSIK ---1111----------------3333----------3333-------------1111-- DHQWHIQACCALTGEGLCQGLEWMMSRLK -------------2222-------1111- >ADP-RIBOSYLATION FACTOR-L; SWP:Q9Y689; PDB:2H17A; EHKVIIVGLDNAGKTTILYQFSMNEVVHTSPTIGSNVEEIVINNTRFLMWDIGGQESLRS --------2222---------2222----------------!!!!--------3333--- SWNTYYTNTEFVIVVVDSTDRERISVTREELYKMLAHEDLRKAGLLIFANKQDVKECMTV -----2222-------11111111------------1111----------3333------ AEISQFLKLTSIKDHQWHIQACCALTGEGLCQGLEWMMSR -------3333------------1111------------- >TRAFFICKING PROTEIN B; SWP:Q9RF91; PDB:2H1CA; MILLDTNVISEPLRPQPNERVVAWLDSLILEDVYLSAITVAELRLGVALLLNGKKKNVLH ----3333-3333-----------11113333--------------1111---------- ERLEQSILPLFAGRILPFDEPVAAIYAQIRSYAKTHGKEIAAADGYIAATAKQHSLTVAT ----------2222---------------------------------------------- RDTGSFFAADVAVFNPWHL ------1111----1111- >CARBOXYLESTERASE; SWP:Q81AD5_BACCR; PDB:2H1IA; KHVFQKGKDTSKPVLLLLHGTGGNELDLLPLAEIVDSEASVLSVRGNVLENGPRFFRRLA --------3333-------2222--1111------3333--------------------2 EGIFDEEDLIFRTKELNEFLDEAAKEYKFDRNNIVAIGYSNGANIAASLLFHYENALKGA 222--3333---------------1111-1111--------------------------- VLHHPVPRRGQLANLAGKSVFIAAGTNDPICSSAESEELKVLLENANANVTHWENRGHQL -------------------------------3333--------1111--------!!!!- TGEVEKAKEWYDKAF --------------- ---------------------------------------------------------- >OLIGOENDOPEPTIDASE F; SWP:NA; PDB:2H1NA; AKFSEFRYERPDIAQLQASFQEALDSFRRAGSAALQHEAKRINELRRRYSTANLCHIRHT -3333------------------------------------------------------- IDTNDEFYKKEQDFFDETEPVVKGLVNDYYRALVSSPFRAELEQVWGKQLFALAETQLKT -1111---------------------------1111-------------------3333- YAPVIVEDLQKENKLASEYTKLIASAKIFEGEERTLAQLQPFVESPDRARQRASEARFSF -3333-----------------1111--iiii--33333333------------------ FKDYEKELDELYDELVHVRTAIARKLGFQNFVELGYARLGRTDYNADVAGYRRQVKTHIV ------------------------------------1111-------------------- PLAAKLRERQRQRIQVEKLYYYDEPFFPTGNPTPKGDADWIVQNGRQYEELSPETGEFFR -----------3333----1111---1111------------------------------ YVEHELDLVAKKGKAGGGYCTYIDDYKAPFIFSNFTGTSGDIDVLTHEAGHAFQVYESRH -1111-----2222---------1111--------------------------------- YDIPEYNWPTLEACEIHSSEFFTWPWELFFGEDADKYRFAHLSDALLFLPYGVAVDEFQH --3333---3333------3333-------3333-------------------------- AVYENPDTPAERKSVWRNIEKAYLPTRDYADHDYLERGGFWQRQGHIYTDPFYYIDYTLA -------3333------------1111-%%%%-----------3333------3333--- QVCAFQFWKRAQEDRASAWRDYVALCRLGGSRPFTELVKSANLQSPFADGAVASVVGHIE -------------------------3333---------------3333------------ RWLDSVDDKAL ------3333- >2H1; SWP:NA; PDB:2H1PH; DVKLVESGGGLVKLGGSLKLSCAASGFTFSSYFLSWVRQTPEKRLELVATINSNGDKTYH ---------------------------3333--------1111----------------- PDTMKGRFTISRDNAKNTLYLQMSSLKSEDTALYYCARRDSSASLYFDYWGQGTTLTVSS -3333-------1111----------1111---------1111----------------- AKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSD ----------------------------------------%%%%-------------!!! LYTLSSSVTVPSSTWPSETVTCNVAHPASSTKVDKKIVPR !------------------------3333----------- >HYPOTHETICAL PROTEIN; SWP:Q422F5_DESHA; PDB:2H1QA; GWEIYDAINGIPEDFLVDELVCGTTHSVIRSGNGVGLGPNRPFETRPLTQNLLGLPLRVA -3333--11113333-------1111----!!!!-------------33332222----- AGCVKSWNYVEASIGLAAINAYYNNPQVAREHGVIFSDANDPFISQNEVKGKKVGVVGHF --1111---------------1111----1111-------3333-3333----------2 PHLESLLEPICDLSILEWSPEEGDYPLPASEFILPECDYVYITCASVVDKTLPRLLELSR 222---3333----------1111-3333---3333--------3333--------1111 NARRITLVGPGTPLAPVLFEHGLQELSGFVKDNARAFRIVAGAEKVKIYSAGQKVTIKK --------1111--3333------------------------------1111------- >DIMETHYLADENOSINE TRANSFE; SWP:Q8ILT8; PDB:2H1RA; HLLKNPGILDKIIYAAKIKSSDIVLEIGCGTGNLTVKLLPLAKKVITIDIDSRMISEVKK ----3333----------1111------!!!!-33333333---------3333------ RCLYEGYNNLEVAIKTVFPKFDVCTANIPYKISSPLIFKLISHRPLFKCAVLMFQKEFAE --1111----------------------3333---------------------------- RMLANVGDSNYSRLTINVKLFCKVTKVCNVNRSSFNPPPKVDSVIVKLIPKESSFLTNFD ----2222------------------------------------------3333---333 EWDNLLRICFSRKRKTLHAIFKRNAVLNMLEHNYKNWCTLNKQVPVNFPFKKYCLDVLEH 3----------111133331111------------------------------------- LDMCEKRSINLDENDFLKLLLEFNKKGIHFF --11113333-3333--------1111---- >HYPOTHETICAL PROTEIN; SWP:Q9I2B5; PDB:2H1TA; SRDRLYTWAGLWRSPSSSWEALRLEDDQAESQLRAPDERSGLPYQLDYRLRWDADWHLRE ----------------------------------------------------1111---- AVFHVESETGVRKLHLLADGRGHWQDGDGEALPAFDGCLDIDIWPSPFTNTFPIRRLGLA ------1111---------------1111--3333-----------3333---------2 DGQRAEIRALYIEAPALEPRSRQAYTRLDASHYLYENLEGSAFKAVLLVDEQGLVIDYPG 222-------------------------1111-----------------1111----222 LFQRL 2---- >FERROCHELATASE; SWP:P32396; PDB:2H1VA; SRKKMGLLVMAYGTPYKEEDIERYYTHIRRGRKPEPEMLQDLKDRYEAIGGISPLAQITE ----------------3333------1111----------------1111-3333----- QQAHNLEQHLNEIQDEITFKAYIGLAHIEPFIEDAVAEMHKDGITEAVSIVLAPHFSTFS --------------------------------------------------------3333 VQSYNKRAKEEAEKLGGLTITSVESWYDEPKFVTYWVDRVKETYASMPEDERENAMLIVS -------------------------1111--------------33333333--------- AHSLPEKIKEFGDPYPDQLHESAKLIAEGAGVSEYAVGWQSEGNTPDPWLGPDVQDLTRD --------1111------------------------------------------------ LFEQKGYQAFVYVPVGFVADHLEVLYDNDYECKVVTDDIGASYYRPEMPNAKPEFIDALA --------------------3333------------------------!!!!-------- TVVLKKLGR ----1111- >RIBULOSE-1,5 BISPHOSPHATE; SWP:Q43088; PDB:2H21A; LSPAVQTFWKWLQEEGVITAKTPVKASVVTEGLGLVALKDISRNDVILQVPKRLWINPDA ------------1111------------1111---------2222-----1111--1111 VAASEIGRVCSELKPWLSVILFLIRERSREDSVWKHYFGILPQETDSTIYWSEEELQELQ 1111-33331111---------------------------------3333-3333---22 GSQLLKTTVSVKEYVKNECLKLEQEIILPNKRLFPDPVTLDDFFWAFGILRSRAFSRLNL 22------------------------3333--------3333------------------ VVVPMADLINHSAGVTTEDHAYEVYLFSLKSPLSVKAGEQVYIQYDLNKSNAELALDYGF --3333-----3333--------------------2222------1111----------- IEPNENRHAYTLTLEISESDPFFDDKLDVAESNGFAQTAYFDIFYNRTLPPGLLPYLRLV ---1111---------1111-3333------------------2222--2222------- ALGGTDAFLLESLFRDTIWGHLELSVSRDNEELLCKAVREACKSALAGYHTTIEQDRELK --333333333333-------------------------------1111---------11 EGNLDSRLAIAVGIREGEKMVLQQIDGIFEQKELELDQLEYYQERRLKDLGLCGENGDIL 11------------------------------1111------------------------ ENLY ---- >T-CELL SURFACE GLYCOPROTE; SWP:P29016; PDB:2H26A; FQGPTSFHVIQTSSFTNSTWAQTQGSGWLDDLQIHGWDSDSGTAIFLKPWSKGNFSDKEV ----------------1111--------!!!!------3333-----1111!!!!----- AELEEIFRVYIFGFAREVQDFAGDFQMKYPFEIQGIAGCELHSGGAIVSFLRGALGGLDF ---------------------3333-----------------------------%%%%-- LSVKNASCVPSPEGGSRAQKFCALIIQYQGIMETVRILLYETCPRYLLGVLNAGKADLQR ---%%%%---3333---------3333-----------------------------1111 QVKPEAWLSSGPSPGPGRLQLVCHVSGFYPKPVWVMWMRGEQEQQGTQLGDILPNANWTW --------------2222--------------------!!!!-1111------------- YLRATLDVADGEAAGLSCRVKHSSLEGQDIILYWRNPI --------11112222-----1111------------- >RNA POLYMERASE SIGMA E FA; SWP:P0AGB6; PDB:2H27A; SHMLSEELRQIVFRTIESLPEDLRMAITLRELDGLSYEEIAAIMDCPVGTVRSRIFRARE ---------------1111---------------------------3333---------- AIDNKVQPLIR -----1111-- >HYPOTHETICAL PROTEIN YEEU; SWP:P76364; PDB:2H28A; DRPWWGLPCTVTPCFGARLVQEGNRLHYLADRAGIRGLFSDADAYHLDQAFPLLKQLELL ---------------------!!!!---1111---------------------------3 TSGELNPRHQHTVTLYAKGLTCKADTLSSCDYVYLAVYPTPEKNLE 333--1111-------2222-----%%%%-------------1111 >Probable nicotinate-nucle; SWP:Q5HFG7; PDB:2H29A; MKKIVLYGGQFNPIHTAHMIVASEVFHELQPDEFYFLPSFMSPLKKHHDFIDVQHRLTMI -----------------------------------------2222--------------- QMIIDELGFGDICDDEIKRGGQSYTYDTIKAFKEQHKDSELYFVIGTDQYNQLEKWYQIE ---------------------------------------------3333--3333--333 YLKEMVTFVVVNRDKNSQNVENAMIAIQIPRVDISSTMIRQRVSEGKSIQVLVPKSVENY 31111---------------3333------------------------------------ IKGEGLYE -------- >TIGHT JUNCTION PROTEIN ZO; SWP:Q07157; PDB:2H2BA; GSHMIWEQHTVTLHRAPGFGFGIAISGGRDNPHFQSGETSIVISDVLKGGPAEGQLQEND ------------------!!!!-----1111---------------2222-2222-2222 RVAMVNGVSMDNVEHAFAVQQLRKSGKNAKITIRRKKGGGWRRTTYL ----iiii-----3333------------------------------ >COMM DOMAIN-CONTAINING PR; SWP:Q8N668; PDB:2H2MA; MAAGELEGGKPLSGLLNALAQDTFHGYPGITEELLRSQLYPEVPPEEFRPFLAKMRGILK ---------3333-3333------------1111-33333333----3333----1111- SIASADMDFNQLEAFLTAQTKKQGGITSDQAAVISKFWKSHKTKIRES -----------1111-----------3333--3333------------ >Low affinity immunoglobul; SWP:P06734; PDB:2H2TB; CNTCPEKWINFQRKCYYFGKGTKQWVHARYACDDMEGQLVSIHSPEEQDFLTKRASHTGS ----2222--!!!!------------------1111-------------------1111- WIGLRNLDLKGEFIWVDGSHVDYSNWAPGEPTSDCVMMRGSGRWNDAFCDRKLGAWVCDR ------%%%%----1111--------2222--------1111-----1111--------- LATC ---- >HOMOSERINE O-SUCCINYLTRAN; SWP:Q9WZY3; PDB:2H2WA; PINVPSGLPAVKVLAKEGIFVMTEKIRPLEILILNLMPDKIKTEIQLLRLLGNTPLQVNV ----1111------1111-------------------------------1111------- TLLYTETHKPKHTPIEHILKFYTTFSAVKDRKFDGFIITGAPVELLPFEEVDYWEELTEI -------------3333------33331111-----------111133331111------ MEWSRHNVYSTMFICWAAQAGLYYFYGIPKYELPQKLSGVYKHRVAKDSVLFRGHDDFFW --3333------------------------------------------3333-------- APHSRYTEVKKEDIDKVPELEILAESDEAGVYVVANKSERQIFVTGHPEYDRYTLRDEYY ---------33331111------------------1111-------11111111------ RDIGRNLKVPIPANYFPNDDPTKTPILTWWSHAHLFFSNWLNYCIYQKT --1111---------22223333---------------------1111- >UBIQUITIN-CONJUGATING ENZ; SWP:Q8IDP1; PDB:2H2YA; KPSRTVEKHIKTKYNLGNANYRIQKELNNFLKNPPINCTIDVHPSNIRIWIVQYVGLENT -----------------3333-------------2222----2222-------------1 IYANEVYKIKIIFPDNYPLKPPIVYFLQKPPKHTHVYSNGDICLSVLGDDYNPSLSISGL 111-----------------------------11113333---3333----1111----- ILSIISMLSSAKE ------------- >PEPTIDE METHIONINE SULFOX; SWP:P14930; PDB:2H30A; ATVPHTSTKTADNRPASVYLKKDKPTLIKFWASWCPLCLSELGQAEKWAQDAKFSSANLI -3333----1111---1111-----------1111---------------3333------ TVASPGFLHEKKDGEFQKWYAGLNYPKLPVVTDNGGTIAQNLNISVYPSWALIGKDGDVQ ---2222------------1111-1111----2222-----------------1111--- RIVKGSINEAQALALIRNPNADLGSLKHS -----------------111133331111 >MULTIFUNCTIONAL PROTEIN A; SWP:P22234; PDB:2H31A; LNIGKKLYEGKTKEVYELLDSPGKVLLQSKGKAAISNKITSCIFQLLQEAGIKTAFTRKC ------------------------------------------------------------ GETAFIAPQCEIPIEWVCRRIATGSFLKRNPGVKEGYKFYPPKVELFFKDDANNDPQWSE ------------------------3333-----2222---------------------11 EQLIAAKFCFAGLLIGQTEVDISHATQAIFEILEKSWLPQNCTLVDKIEFGVDVTTKEIV 113333---iiii-----------------------3333-------------------- LADVIDNDSWRLWPSGPEGLQVKKNFEWVAERVELLLKSESQCRVVVLGSTSDLGHCEKI -----1111---------------3333----3333-------------1111------- KKACGNFGIPCELRVTSAHKGPDETLRIKAEYEGDGIPTVFVAVAGRSNGLGPVSGNTAY --3333----------3333------------1111--------------3333------ PVISCPPLTPDWGVQDVWSSLRLPSGLGCSTVLSPEGSAQFAAQIFGLSNHLVWSKLRAS -----------33333333----------------------------------------- ILNTWISLKQADKKIRECNL --------------3333-- >SERINE/THREONINE-PROTEIN ; SWP:P72001; PDB:2H34A; GPYRLRRLVGRGGGDVYEAEDTVRERIVALKLSETLSSDPVFRTRQREARTAGRLQEPHV --------------------------------------------------1111--1111 VPIHDFGEIDGQLYVDRLINGVDLAALRRQGPLAPPRAVAIVRQIGSALDAAHAAGATHR ----------------------3333--------------------------1111---- DVKPENILVSADDFAYLVDFGIGTLYYAPERFSEYRADIYALTCVLYECLTGSPPYQGDQ --1111-----------------33333333----------------------------- LSVGAHINQAIPRPSTVRPGIPVAFDAVIARGAKNPEDRYVTCGDLSAAAHAALA ---------------------3333---------1111------------1111- >Transcriptional activator; SWP:A0R8D8; PDB:2H3GX; MIFVLDVGNTNAVLGVFEEGELRQHWRMETDRHKTEDEYGMLVKQLLEHEGLSFEDVKGI -----------------iiii---------1111-------------1111-3333---- IVSSVVPPIMFALERMCEKYFKIKPLVVGPGIKTGLNIKYENPREVGADRIVNAVAGIHL -----3333-------------------2222---------3333-3333---------- YGSPLIIVDFGTATTYCYINEEKHYMGGVITPGIMISAEALYSIEITKPSSVVGKNTVSA -------------------1111------------------------------------- MQSGILYGYVGQVEGIVKRMKEEAKQEPKVIATGGLAKLISEESNVIDVVDPFLTLKGLY --------------------1111---------1111---1111------1111------ MLYERNA ------- >HYPOTHETICAL PROTEIN PA43; SWP:Q9HW42; PDB:2H3JA; MSALQPSRSYRITGYSPAISNGYRQRLFSMGLLPGAALRVVRIAPLGDPIQVETRQTSLA ---------------1111%%%%--------------------3333------------- LRRKDLALLTLVPLD -33331111------ >HAPTOGLOBIN-BINDING SURFA; SWP:Q6G8J7; PDB:2H3KA; ADESLKDAIKDPALENKEHDIGPREQVNFQLLDKNNETQYYHFFSIKDPADVYYTKKKAE ---3333---3333---------------------------------------------- VELDINTASTWKKFEVYENNQKLPVRLVSYSPVPEDHAYIRFPVSDGTQELKIVSSTQID ------1111---------------------3333---------%%%%------------ DGEETNYDYTKLVFAKPIYNDPSL ------------------------ >LAP2 PROTEIN; SWP:Q96RT1; PDB:2H3LA; GSHMGHELAKQEIRVRVEKDPELGFSISGGVGGRGNPFRPDDDGIFVTRVQPEGPASKLL --2222-----------------------2222-----1111-----------1111--- QPGDKIIQANGYSFINIEHGQAVSLLKTFQNTVELIIVREVSS 2222----iiii------------------------------- >CGMP-SPECIFIC 3',5'-CYCLI; SWP:O76074; PDB:2H44A; EETRELQSLAAAVVPSAQTLKITDFSFSDFELSDLETALCTIRMFTDLNLVQNFQMKHEV --3333--1111---3333----1111-1111-------------------1111----- LCRWILSVKKNYRKNVAYHNWRHAFNTAQCMFAALKAGKIQNKLTDLEILALLIAALSHD --------11111111-------------------11113333--------------111 LDHRGVNNSYIQRSEHPLAQLYCHSIMEHHHFDQCLMILNSPGNQILSGLSIEEYKTTLK 1-----------1111------------------------22221111------------ IIKQAILATDLALYIKRRGEFFELIRKNQFNLEDPHQKELFLAMLMTACDLSAITKPWPI ------------------------1111--1111----------------3333------ QQRIAELVATEFFDQGDRERKELNIEPTDLMNREKKNKIPSMQVGFIDAICLQLYEALTH ---------------------------333311111111--------------------- VSEDCFPLLDGCRKNRQKWQALAEQQ -1111--------------------- >YONK PROTEIN; SWP:O31947; PDB:2H4OA; ASKKVHQINVKGFFDDVEVTEQTKEAEYTYDFKEILSEFNGKNVSITVKEENELPVKGVE ------------------------------------1111-------------------- >HETEROCHROMATIN-ASSOCIATE; SWP:O73790_CHICK; PDB:2H4PA; VSASIGNFTVDLFNKLNETNRDKNIFFSPWSISSALALTYLAAKGSTAREMAEVLHFTEA ---------------------------------------1111----------------- HEQAENIHSGFKELLTAFNKPRNNYSLRSANRIYVEKTYALLPTYLQLSKKYYKAEPQKV ---3333---------1111---------------1111--------------------- NFKTAPEQSRKEINTWVEKQTESKIKNLLSSDDVKATTRLILVNAIYFKAEWEVKFQAEK --------------------%%%%-----1111-1111------------------3333 TSIQPFRLSKNKSKPVKMMYMRDTFPVLIMEKMNFKMIELPYVKRELSMFILLPDDIKDG --------1111------------------1111-------2222--------------- TTGLEQLERELTYERLSEWADSKMMTETLVDLHLPKFSLEDRIDLRDTLRNMGMTTAFTT ---------------------3333------------------------1111-333311 NADFRGMTDKKDLAISKVIHQSFVAVDEKGTEAAAATAVIIS 11------------------------3333------------ >THIOESTERASE SUPERFAMILY ; SWP:Q9NPJ3; PDB:2H4UA; RNFERVLGKITLVSAAPGKVICEMKVEEEHTNAIGTLHGGLTATLVDNISTMALLCTERG --33331111-----2222-------3333-1111------------------------- APGVSVDMNITYMSPAKLGEDIVITAHVLKQGKTLAFTSVDLTNKATGKLIAQGRHTKHL ----------------2222---------------------------------------- G - >RECEPTOR-TYPE TYROSINE-PR; SWP:P23470; PDB:2H4VA; YFQSMKQFVKHIGELYSNNQHGFSEDFEEVQRCTADMNITAEHSNHPENKHKNRYINILA -----------------%%%%-------------------3333-33331111-1111-- YDHSRVKLRPLPHSDYINANYVDGYNKAKAYIATQGPLKSTFEDFWRMIWEQNTGIIVMI 3333--------1111---------------------1111------------------- TNLVEKGRRKCDQYWPTENSEEYGNIIVTLKSTKIHACYTVRRFSIRNTKERVVIQYHYT ----iiii--------------!!!!---------1111--------------------- QWPDMGVPEYALPVLTFVRRSSAARMPETGPVLVHCSAGVGRTGTYIVIDSMLQQIKDKS ---------------------3333----------------------------------- TVNVLGFLKHIRTQRNYLVQTEEQYIFIHDALLEAILG ----------11112222-------------------- >DNA-3-METHYLADENINE GLYCO; SWP:Q9KC25; PDB:2H56A; HRYFSTDSPEVKTIVAQDSRLFQFIEIAGEVQLPTKPNPFQSLVSSIVEQQLSIKAASAI ----1111----------------------------------------22223333---- YGRVEQLVGGALEKPEQLYRVSDEALRQAGVSKRKIEYIRHVCEHVESGRLDFTELEGAE -------------33331111-----1111---------------1111----------3 ATTVIEKLTAIKGIGQWTAEFFSLGRLDVLSVGDVGLQRGAKWLYGNGEGDGKKLLIYHG 333-------22223333------------1111-------------------------- KAWAPYETVACLYLWKAAGTFAEEYRSLEELLHH --------------------------3333---- >ADP-RIBOSYLATION FACTOR-L; SWP:Q9H0F7; PDB:2H57A; EVHVLCLGLDNSGKTTIINKLKPSNAQSQNILPTIGFSIEKFKSSSLSFTVFDMSGQGRY --------2222------11113333------------------------------3333 RNLWEHYYKEGQAIIFVIDSSDRLRMVVAKEELDTLLNHPDIKHRRIPILFFANKMDLRD 3333--3333--------11113333------------1111------------3333-- AVTSVKVSQLLCLENIKDKPWHICASDAIKGEGLQEGVDWLQDQI -----------3333---------------2222----------- >KINESIN-LIKE PROTEIN KIFC; SWP:Q9BVG8; PDB:2H58A; NIRVIARVRPVTKEDGEGPEATNAVTFDADDDSIIHLLHKGKPVSFELDKVFSPQASQQD -----------3333--3333-----------------iiii----------1111---- VFQEVQALVTSCIDGFNVCIFAYGQTGAGKTYTMEGTAENPGINQRALQLLFSEVQEKAS 33333333-3333-----------2222--------3333--------------111133 DWEYTITVSAAEIYNEVLRDLLGKEPQEKLEIRLCPDGSGQLYVPGLTEFQVQSVDDINK 33-----------%%%%--1111-----------1111-----2222------------- VFEFGHTNRTTEFTNLNEHSSRSHALLIVTVRGVDCSTGLRTTGKLNLVDLAGSERVGSR ------33331111-----1111------------------------------------- LREAQHINKSLSALGDVIAALRSRQGHVPFRNSKLTYLLQDSLSGDSKTLMVVQVSPVEK --------------------1111----3333------3333-!!!!----------333 NTSETLYSLKFAERVR 3--------------- >ALPHA-LYTIC PROTEASE; SWP:P00778; PDB:2H5CA; ANIVGGIEYSINNASLCSVGFSVTRGATKGFVTAGHCGTVNATARIGGAVVGTFAARVFP ------------------------!!!!-----1111-2222------------------ GNDRAWVSLTSAQTLLPRVANGSSFVTVRGSTEAAVGAAVCRSGRTTGYQCGTITAKNVT ---------3333-------!!!!----------2222---------------------- ANYAEGAVRGLTQGNACMGRGDSGGSWITSAGQAQGVMSGGNVQSNGNNCGIPASQRSSL ------------------2222---------------------1111-11113333---- FERLQPILSQYGLSLVTG ----3333---------- >DENMOTOXIN; SWP:Q06ZW0; PDB:2H5FA; VGLPHGFCIQCNRKTWSNCSIGHRCLPYHMTCYTLYKPDENGEMKWAVKGCARMCPTAKS -----------1111--1111----2222-----------------------------22 GERVKCCTGASCNSD 22--------1111- >DELTA 1-PYRROLINE-5-CARBO; SWP:P54886; PDB:2H5GA; PTVEQQGEARSGGRLATLEPEQRAEIIHHLADLLTDQRDEILLANKKDLEEAEGRLAAPL --------------11113333-----------------------------2222----- LKRLSLSTSKLNSLAIGLRQIAASSQDSVGRVLRRTRIAKNLELEQVTVPIGVLLVIFES 1111--------------------1111----------2222------------------ RPDCLPQVAALAIASGNGLLLKGGKEAAHSNRILHLLTQEALSIHGVKEAVQLVNTREEV 3333--------1111-------3333-------------3333--3333----3333-- KIDLIIPRGSSQLVRDIQKAAKGIPVGHSEGICHYVDSEASVDKVTRLVRDSKCEYPAAC ------------------------------------11111111-----------1111- NALETLLIHRDLLRTPLFDQIIDLRVEQVKIHAGPKFASKSLRTEYGDLELCIEVVDNVQ --------3333------------1111------3333---------------------- DAIDHIHKYGSSHTDVIVTEDENTAEFFLQHVDSACVFWNASTRFSDGYRFGLGAEVGIS -----------------------------------------1111-3333---------- TSRIHARGPVGLEGLLTTKWLLRGKDHVVSDFSEHGSLKYLHENLPIPQRN ----------3333-------------3333-1111--------------- >HYPOTHETICAL PROTEIN PG_1; SWP:Q7MVF6; PDB:2H5NA; MGLGRQSLNIMTFSGQELTAIIKMAKSMVMADGKIKPAEIAVMTREFMRFGILQDQVDLL -----------------------------1111--3333------3333---1111---- LKASDSIEASQAVALIARMDEERKKYVASYLGVIMASDGDIDDNELALWTLISTLCGLPT -3333--3333----1111-------------1111iiii-------------------- MTVMEAINNMKN ------------ >MORANGE; SWP:Q5S3G8; PDB:2H5OA; MAIIKEFMRFKVRMEGSVNGHEFEIEGEGEGRPYEGFQTAKLKVTKGGPLPFAWDILSPQ -----------------iiii----------1111-----------------33331111 FSKAYVKHPADIPDYFKLSFPEGFKWERVMNFEDGGVVTVTQDSSLQDGEFIYKVKLRGT -3333---1111-3333--------------1111-----------iiii---------- NFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIKMRLKLKDGGHYTSEVKTTYKAKKPV --1111-------------------iiii----------1111----------------- QLPGAYIVGIKLDITSHNEDYTIVEQYERAEGRH -----------------1111------------- >Holliday junction ATP-dep; SWP:P66744; PDB:2H5XA; MIASVRGEVLEVALDHVVIEAAGVGYRVNATPATLATLRQGTEARLITAMIVREDSMTLY --------------------iiii------333311112222----------1111---- GFPDGETRDLFLTLLSVSGVGPRLAMAALAVHDAPALRQVLADGNVAALTRVPGIGKRGA ----------------2222-------------------------------2222----- ERMVLELRDKVGVAVRSPVVEALVGLGFAAKQAEEATDTVLAANHDATTSSALRSALSLL ------------------------------------------------------------ GKA --- >PROBABLE GLOBAL TRANSCRIP; SWP:Q8R569; PDB:2H60A; KMKKIVDAVIKYKDSSSGRQLSEVFIQLPSRKELPEYYELIRKPVDFKKIKERIRNHKYR --------------1111-3333------3333-3333--------------3333---- SLNDLEKDVMLLCQNAQTFNLEGSLIYEDSIVLQSVFTSVRQKIE --------------------------------------------- >BILIVERDIN REDUCTASE A; SWP:P53004; PDB:2H63A; MRKFGVVVVGVGRAGSVRMRDLRNPHPSSAFLNLIGFVSRRELGSIDGVQQISLEDALSS -------------------------3333------------------------------3 QEVEVAYICSESSSHEDYIRQFLNAGKHVLVEYPMTLSLAAAQELWELAEQKGKVLHEEH 333-------3333--------1111---------------------------------3 VELLMEEFAFLKKEVVGKDLLKGSLLFTAGPLEEERFGFPAFSGISRLTWLVSLFGELSL 333-----------1111--------------3333--3333------------------ VSATLEERKQYMKMTVCLETEKKSPLSWIEEKGPGLKRNRYLSFHFKSGSLENVPNVGVN -------------------1111----------------------1111----------- KNIFLKDQNIFVQKLLGQFSEKELAAEKKRILHCLGLAEEIQKY -------------1111--------------------------- >INSULIN A CHAIN; SWP:NA; PDB:2H67B; FVNQALCGSDLVEALYLVCGERGFFYTKPT -------3333--------1111------- >CHLOROPHENOL REDUCTION GE; SWP:Q18R04; PDB:2H6BA; VEGLGKDFCGAIIPDNFFPIEKLRNYTQMGLIRDFAKGSAVIMPGEEITSMIFLVEGKIK -------iiii--------------3333------2222---1111-------------- LDIIFEDGSEKLLYYAGGNSLIGKLYPTGNNIYATAMEPTRTCWFSEKSLRTVFRTDEDM ----1111--------2222---------------------------------------- IFEIFKNYLTKVAYYARQVAEMNTYNPTIRILRLFYELCSSQGKRVGDTYEITMPLSQKS -------------------------3333----------------!!!!----------- IGEITGVHHVTVSRVLACLKRENILDKKKNKIIVYNLGELKHLSEQTSVDKLAAALDHH -------------------1111-------------------1111---------1111 >CHLOROPHENOL REDUCTION GE; SWP:Q9LAS2; PDB:2H6CA; FFPIEKLRNYTDMGIIREFAKGSAIIMPGEDTTSMIFLMDGKIKLDIIFEDGSEKLLYYA ---3333--3333------2222------------------------------------- GSNSLIGRLYPTGNNIYATAMEQTRTCWFSEECLRVIFRTDEDMIFEIFKNYLTKVAYYA 2222-------------------------3333--------3333--------------- RQVAEINTYNPTIRILRLFYELCSSQGKRVGDTYEITMPLSQKSIGEITGAHHVTVSKVL --1111---3333----------------!!!!------------------3333----- ACLKKENILDKKKNKFIVYNLEELKHLS -----------1111----33333333- >5'-AMP-activated protein ; SWP:P54646; PDB:2H6DA; RVKIGHYVLGDTLGVGTFGKVKIGEHQLTGHKVAVKILNRQKIRSLDVVGKIKREIQNLK ---!!!!--------1111--------------------------------------333 LFRHPHIIKLYQVISTPTDFFMVMEYVSGGELFDYICKHGRVEEMEARRLFQQILSAVDY 3--1111--------1111----------------------------------------- CHRHMVVHRDLKPENVLLDAHMNAKIADFGLSNMMSDAAPEVISGRLYAGPEVDIWSCGV -----------3333---1111-------3333-----3333------------------ ILYALLCGTLPFDDEHVPTLFKKIRGGVFYIPEYLNRSVATLLMHMLQVDPLKRATIKDI -------------------------------1111--------------3333------- REHEWFKQDLPSYLFP ---3333---3333-- >PROTEIN FARNESYLTRANSFERA; SWP:P49354; PDB:2H6FA; FVSLDSPSYVLYRDRAEWADIDPVPQNDGPNPVVQIIYSDKFRDVYDYFRAVLQRDERSE --1111----33331111------------------------------------------ RAFKLTRDAIELNAANYTVWHFRRVLLKSLQKDLHEEMNYITAIIEEQPKNYQVWHHRRV ------------1111-----------1111----------------1111--------- LVEWLRDPSQELEFIADILNQDAKNYHAWQHRQWVIQEFKLWDNELQYVDQLLKEDVRNN -------1111------3333---------------1111-1111----------1111- SVWNQRYFVISNTTGYNDRAVLEREVQYTLEMIKLVPHNESAWNYLKGILQDRGLSKYPN -----------------------------------1111-----------33331111-- LLNQLLDLQPSHSSPYLIAFLVDIYEDMLENQCDNKEDILNKALELCEILAKEKDTIRKE -------3333----------------1111-----------------------3333-- YWRYIGRSLQSKHST --------------- >Protein farnesyltransfera; SWP:P49356; PDB:2H6FB; SSPVWSEPLYSLRPEHARERLQDDSVETVTSIEQAKVEEKIQEVFSSYKFNHLVPRLVLQ --1111--3333----1111--%%%%-------------------1111!!!!------- REKHFHYLKRGLRQLTDAYECLDASRPWLCYWILHSLELLDEPIPQIVATDVCQFLELCQ ---------------33331111--------------1111---------------1111 SPEGGFGGGPGQYPHLAPTYAAVNALCIIGTEEAYDIINREKLLQYLYSLKQPDGSFLMH 1111----2222----------------------3333---------11111111----2 VGGEVDVRSAYCAASVASLTNIITPDLFEGTAEWIARCQNWEGGIGGVPGMEAHGGYTFC 222--------------------11112222----11113333----2222--------- GLAALVILKRERSLNLKSLLQWVTSRQMRFEGGFQGRCNKLVDGCYSFWQAGLLPLLHRA ---------3333----------1111---------2222-------------------- LHAQGDPALSMSHWMFHQQALQEYILMCCQCPAGGLLDKPGKSRDFYHTCYCLSGLSIAQ -11111111---------------------1111----2222------------------ HFGSGAMLHDVVLGVPENALQPTHPVYNIGPDKVIQATTYFLQKPVPGFE ---!!!!-------3333----------------------1111-2222- >HYPOTHETICAL PROTEIN; SWP:O30132; PDB:2H6LA; KVFEFEVGKGFLLRLDYGKDLVRQIEEFLEEKGIHAAHISAIGAVRSAVIGYYDQEKKEY ---------------2222----------------------------------------- VKKELEPLEILSLSGNVSKDSKPFCHIHVLLGKDGEVYGGHLFSAEVFACEVFVLPLSGE ------------------%%%%----------!!!!------------------------ APERAFDEQTGLFLWLE -------1111------ >TRIOSEPHOSPHATE ISOMERASE; SWP:Q58923; PDB:2H6RA; MVIVINYKTYNESIGNRGLEIAKIAEKVSEESGITIGVAPQFVDLRMIVENVNIPVYAQH ---------1111!!!!-----------------------11113333------------ IDNINPGSHTGHILAEAIKDCGCKGTLINHSEKRMLLADIEAVINKCKNLGLETIVCTNN --------2222-33333333--------1111--3333--------------------3 INTSKAVAALSPDCIAVEPPEVVEGTVRAVKEINKDVKVLCGAGISKGEDVKAALDLGAE 333---3333------------3333-------1111----------------1111--- GVLLASGVVKAKNVEEAIRELIK ----3333--------------- >5-HYDROXYISOURATE HYDROLA; SWP:NA; PDB:2H6UA; LSPLSTHVLNIAQGVPGANMTIVLHRLDPVSSAWNILTTGITNDDGRCPGLITKENFIAG ------------------------------------------1111------3333---- VYKMRFETGKYWDALGETCFYPYVEIVFTITNTSQHYHVPLLLSRFSYSTYRGS ------------1111-------------------------------------- >3-HYDROXYISOBUTYRATE DEHY; SWP:Q9I5I6; PDB:2H78A; KQIAFIGLGHGAPATNLLKAGYLLNVFDLVQSAVDGLVAAGASAARSARDAVQGADVVIS -----------------1111----------------1111------------------- LPASQHVEGLYLDDDGLLAHIAPGTLVLECSTIAPTSARKIHAAARERGLALDAPVSGGT --1111------22223333---------------------------------------- AGAAAGTLTFVGGDAEALEKARPLFEAGRNIFHAGPDGAGQVAKVCNNQLLAVLIGTAEA ------------------------------------------------------------ ALGVANGLEAKVLAEIRRSSGGNWALEVYNPWPGVENAPASRDYSGGFAQLAKDLGLAQE ---1111-3333------11113333-----2222--3333%%%%--------------- AAQASASSTPGSLALSLYRLLLKQGYAERDFSVVQKLFDPTQ ---------------------11111111-----3333---- >THRA PROTEIN; SWP:Q6FH41; PDB:2H79A; ARGSHMEEMIRSLQQRPEPTPEEWDLIHIATEAHRSTNAQGSHWKQRRKFLPDDIGQSPI ----------1111-----------------------2222----------1111----- VSMPDGDKVDLEAFSEFTKIITPAITRVVDFAKKLPMFSELPEDQIILLKGCCMEIMSLR --1111----------------------------3333---------------------- AAVRYDPESDTLTLSGEMAVKREQLKNGGLGVVSDAIFELGKSLSAFNLDDTEVALLQAV 1111--1111---------------------------------3333------------- LLMSTDRSGLLVDKIEKSQEAYLLAFEHYVNHRKHNIPHFWPKLLMKVTDLRMIGAHASR ------------------------------3333-------------------------- FLHKVEPTELFPPLFLEVFEDQ ------3333------------ >CORE-BINDING FACTOR, ML1-; SWP:Q7Z4J5; PDB:2H7BA; SARQLSKLKRFLTTLQQFGNDISPEIGERVRTLVLGLVNSTLTIEEFHSKLQEATNFPLR -33331111--------1111---------------------3333-------------- PFVIPFLKANLPLLQRELLHAARLAKQNPAQYLAQHEQLLLDAS --1111---3333--33333333----1111------------- >LIVER CARBOXYLESTERASE 1; SWP:P23141; PDB:2H7CA; SSPPVVDTVHGKVLGKFVSLEGFAQPVAIFLGIPFAKPPLGPLRFTPPQPAEPWSFVKNA -------1111----------------------------!!!!----------------- TSYPPMCTQDPKAGQLLSELFTNRKENIPLKLSEDCLYLNIYTPADLTKKNRLPVMVWIH ---------------------------------------------1111----------- GGGLMVGAASTYDGLALAAHENVVVVTIQYRLGIWGFFSTGDEHSRGNWGHLDQVAALRW -%%%%--3333----------------------1111----3333--------------- VQDNIASFGGNPGSVTIFGESAGGESVSVLVLSPLAKNLFHRAISESGVALTSVLVKKGD ---3333---1111------------------3333------------11111111---- VKPLAEQIAITAGCKTTTSAVMVHCLRQKTEEELLETTLKMKFLSLDLQGDPRESQPLLG ---------1111-----------3333----------------------1111------ TVIDGMLLLKTPEELQAERNFHTVPYMVGINKQEFGWLIPMLMSYPLSEGQLDQKTAMSL ----------33331111--------------1111----------1111---------- LWKSYPLVCIAKELIPEATEKYLGGTDDTVKKKDLFLDLIADVMFGVPSVIVARNHRDAG ---3333---3333-------------3333-------------------------1111 APTYMYEFQYRPSFSSDMKPKTVIGDHGDELFSVFGAPFLKEGASEEEIRLSKMVMKFWA -----------111111111111--2222---1111------------------------ NFARNGNPNGEGLPHWPEYNQKEGYLQIGANTQAAQKLKDKEVAFWTNLFAK ---------2222------1111-------------2222------------ >CATHEPSIN S; SWP:P25774; PDB:2H7JA; ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE ------3333---------!!!!-----------------------------------33 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYG 33--!!!!-------------------3333-----------3333-----------222 REDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEY 2---------------------3333---------1111---------------iiii-- WLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI -------3333-iiii------%%%%-1111------- >ENOYL-[ACYL-CARRIER-PROTE; SWP:P0A5Y6; PDB:2H7MA; TGLLDGKRILVSGIITDSSIAFHIARVAQEQGAQLVLTGFDRLRLIQRITDRLPAKAPLL -1111----------1111---------1111-----------------1111------- ELDVQNEEHLASLAGRVTEAIGAGNKLDGVVHSIGFMPQTGMGINPFFDAPYADVSKGIH --1111---------------2222------------3333----3333----------- ISAYSYASMAKALLPIMNPGGSIVGMDFDPSRAMPAYNWMTVAKSALESVNRFVAREAGK ------------3333-2222-----------------------------------3333 YGVRSNLVAAGPIRTLAMSAIVGGALGEEAGAQIQLLEEGWDQRAPIGWNMKDATPVAKT -----------------------1111-----------------1111-1111------- VCALLSDWLPATTGDIIYADGGAHTQLL --------1111---------1111--- >PROTEIN KINASE YPKA; SWP:Q05608; PDB:2H7OA; RITPKKLRELSDLLRTHLSSAATKQLDMGGVLSDLDTMLVALDKAEREVDKDQLKSFNSL --------------------1111-------------------3333------------- ILKTYRVIEDYVKGNFMLSIVEPSLQRIQKHLDQTHSFSDIGSLVRAHKHLETLLEVLVT --------------3333-----------1111---3333-------------------- LSQPVSSETYGFLNRLAEAKITLSQQLNTLQQQQESAKAQLSILINRSGSWADVARQSLQ ------------------------------------------------------------ RFDSTRPVVKFGTEQYTAIHRQMMAAHAAITLQEVSEFTDDMRNFTVDSIPLLIQLGRSS -------------3333-----------------1111---------------------- LMDEHLVEQREKLRELTTIAERLNRLEREW ------------------------------ >CHAGASIN; SWP:Q966X9; PDB:2H7WA; HKVTKAHNGATLTVAVGELVEIQLPSNPTTGFAWYFEGGTKESPNESMFTVENKYFPPDS ---3333-------2222--------3333-----2222-----3333---------333 KLLGAGGTEHFHVTVKAAGTHAVNLTYMRPWTGPSHDSERFIVYLKAN 32222-----------------------1111--1111---------- >IRDITOXIN; SWP:A0S864; PDB:2H7ZA; QAVGPPYTLCFECNRMTSSDCSTALRCYRGSCYTLYRPDENCELKWAVKGCAETCPTAGP -------------3333---------------------1111----------------11 NERVKCCRSPRCNDD 11------2222--- >Irditoxin subunit B [Prec; SWP:A0S865; PDB:2H7ZB; QAKGPPYTLCFECNRETCSNCFKDNRCPPYHRTCYTLYRPDGNGEMKWAVKGCAKTCPTA -------------3333----------2222----------------------------- QPGESVQCCNTPKCNDY 2222------2222--- >STEELY1; SWP:Q55E72; PDB:2H84A; NNSFVLGIGISVPGEPISQQSLKDSISNDFSDKAETNEKVKRIFEQSQIKTRHLVRDYTK --------------------------------------------------------3333 PENSIKFRHLETITDVNNQFKKVVPDLAQQACLRALKDWGGDKGDITHIVSVTSTGIIIP 11111111---------------------------------3333--------------- DVNFKLIDLLGLNKDVERVSLNLMGCLAGLSSLRTAASLAKASPRNRILVVCTEVCSLHF 3333---1111-1111--------1111----------11113333---------3333- SNTDGGDQMVASSIFADGSAAYIIGCNPRIEETPLYEVMCSINRSFPNTENAMVWDLEKE ----3333--------------------1111----------------1111-----333 GWNLGLDASIPIVIGSGIEAFVDTLLDKAKLQTSTAISAKDCEFLIHTGGKSILMNIENS 3-----1111----1111----------1111------1111----------------11 LGIDPKQTKNTWDVYHAYGNMSSASVIFVMDHARKSKSLPTYSISLAFGPGLAFEGCFLK 11-3333--------------3333-------1111------------------------ NVV --- >PUTATIVE ORF1AB POLYPROTE; SWP:Q6VA80; PDB:2H85A; QSLENVAYNVVNKGHFDGHAGEAPVSIINNAVYTKVDGIDVEIFENKTTLPVNVAFELWA ---------------------------%%%%----iiii-------------------11 KRNIKPVPEIKILNNLGVDIAANTVIWDYKREAPAHVSTIGVCTMTDIAKKPTESACSSL 11------3333------------------------------1111----11111111-- TVLFDGRVEGQVDLFRNARNGVLITEGSVKGLTPSKGPAQASVNGVTLIGESVKTQFNYF ----1111--------------------2222----------iiii-------------- KKVDGIIQQLPETYFTQSRDLEDFKPRSQMETDFLELAMDEFIQRYKLEGYAFEHIVYGD --iiii-------------3333------------------------2222--------- FSHGQLGGLHLMIGLAKRSQDSPLKLEDFIPMDSTVKNYFITDAQTGSSKCGCSVIDLLL ----------3333---3333--------------------------------------- GDFVEIIKSQDLSVISKVVKVTIDYAEISFMLWCKDGHVETFYPKL -----3333-------------iiii--------iiii-------- >SUCCINATE DEHYDROGENASE F; SWP:Q8K2B3; PDB:2H88A; TQYPVVDHEFDAVVVGAGGAGLRAAFGLSEAGFNTACVTKLFPTRSHTVAAQGGINAALG ----------------------------1111---------11113333----------- NMEDDNWRWHFYDTVKGSDWLGDQDAIHYMTEQAPAAVIELENYGMPFSRTEEGKIYQRA -----3333--------------------------------1111-----1111------ FGGQSLQFGKGGQAHRCCCVADRTGHSLLHTLYGRSLRYDTSYFVEYFALDLLMENGECR 2222--iiii-------------------------1111---------------iiii-- GVIALCIEDGTIHRFRAKNTVIATGGYGRTYFSCTSAHTSTGDGTAMVTRAGLPCQDLEF ------------------------------------1111--------1111----3333 VQFHPTGIYGAGCLITEGCRGEGGILINSQGERFMERYAPVAKDLASRDVVSRSMTIEIR -------------------1111----1111--3333----!!!!-------------11 EGRGCGPEKDHVYLQLHHLPPQQLATRLPGISETAMIFAGVDVTKEPIPVLPTVHYNMGG 11---1111------11113333------------------1111--------------- IPTNYKGQVITHVNGEDKVVPGLYACGEAASASVHGANRLGANSLLDLVVFGRACALTIA ---1111-----%%%%---2222---3333----!!!!-2222---------------11 ETCKPGEPVPSIKPNAGEESVANLDKLRFADGTIRTSEARLNMQKTMQSHAAVFRTGSIL 11-2222-----11113333--------------3333---------------------- QEGCEKLSQIYRDLAHLKTFDRGIVWNTDLVETLELQNLMLCALQTIYGAEARKESRGAH --------------------------------------------------------!!!! AREDYKLRIDEFDYSKPLQGQQKRPFEEHWRKHTLSYVDVKSGKVTLKYRPVIDRTLNEE -1111------------2222---1111-----------------------------333 DCSSVPPAIRSY 3----------- >INSULIN-LIKE 3; SWP:NA; PDB:2H8BB; PTPEMREKLCGHHFVRALVRVCGGPRWSTEA ---------!!!!------------------ >HEMOGLOBIN ALPHA SUBUNIT; SWP:P80043; PDB:2H8FA; SLSDKDKAAVRALWSKIGKSADAIGNDALSRMIVVYPQTKTYFSHWPDVTPGSPHIKAHG ----------------3333----------------------1111---2222------- KKVMGGIALAVSKIDDLKTGLMELSEQHAYKLRVDPANFKILNHCILVVISTMFPKEFTP ----------1111--------------------3333----------------3333-- EAHVSLDKFLSGVALALAERYR ---------------1111--- >Hemoglobin subunit beta; SWP:P80044; PDB:2H8FB; VEWTDKERSIISDIFSHMDYDDIGPKALSRCLIVYPWTQRHFSGFGNLYNAEAIIGNANV ------------------3333----------------1111------------------ AAHGIKVLHGLDRGVKNMDNIAATYADLSTLHSEKLHVDPDNFKLLSDCITIVLAAKMGH ----------3333--11113333--------------3333---------------!!! AFTAETQGAFQKFLAVVVSALGKQYH !------------------1111--- >5'-METHYLTHIOADENOSINE NU; SWP:Q9T0I8; PDB:2H8GA; ILRPISSVVFVIAMQAEALPLVNKFGLSETTDSPLGKGLPWVLYHGVHKDLRINVVCPGR -------------3333------------------------------!!!!--------- DAALGIDSVGTVPASLITFASIQALKPDIIINAGTCGGFKVKGANIGDVFLVSDVVFHDR --------------------------------------3333--2222------------ RIPIPMFDLYGVGLRQAFSTPNLLKELNLKIGRLSTGDSLDMSTQDETLIIANDATLKDM --------------------------------------------------1111------ EGAAVAYVADLLKIPVVFLKAVTDLVDGDKPTAEEFLQNLTVVTAALEGTATKVINFING 3333--------------------1111-----------------------------222 RNLSDL 23333- >SULT1C3 SPLICE VARIANT D; SWP:Q6IMI6; PDB:2H8KA; FNIMEVDGVPTLILSKEWWEKVCNFQAKPDDLILATYPKSGTTWMHEILDMILNQLIKTH ---------------------------3333----------3333--------------- LPSHLIPPSIWKENCKIVYVARNPKDCLVSYYHFHRMASFMPDPQNLEEFYEKFMSGKVV -1111--------------------------------3333----3333---------22 GGSWFDHVKGWWAAKDMHRILYLFYEDIKKDPKREIEKILKFLEKDISEEILNKIIYHTS 22-----------3333------3333-------------1111---------------- FDVMKQNPMTNYTTLPTSIMDHSISPFMRKGMPGDWKNYFTVAQNEEFDKDYQKKMAGST ------1111-----3333------------22221111--------------1111--- LTFRT ----- >PROTEIN DISULFIDE-ISOMERA; SWP:P30101; PDB:2H8LA; PASVPLRTEEEFKKFISDKDASIVGFFDDSFSEAHSEFLKAASNLRDNYRFAHTNVESLV ----------------------------1111------------1111-------3333- NEYDDNGEGIILFRPSHLTNKFEDKTVAYTEQKTSGKIKKFIQENIFGICPHTEDNKDLI --------------3333----------------------------------3333---- QGKDLLIAYYDVDYEKNAKGSNYWRNRVVAKKFLDAGHKLNFAVASRKTFSHELSDFGLE ------------------------3333---------------------3333-1111-- STAGEIPVVAIRTAKGEKFVQEEFSRDGKALERFLQDYFDGNLKRYL ------------1111--------1111------------------- >GERANYLTRANSTRANSFERASE; SWP:Q8UBX7; PDB:2H8OA; DAQMTNFETRLRENAAKTEALLGHLLSGEARADEITRPQNLLEAMRHGVLNGGKRLRPFL -----------------------1111---2222--------------1111-------- VIESVALLGGDAEAGLHVGAALECLHCYSLVHDDLPAMDDDDLRRGQPTVHRKFDEATAI -----1111-------------------------3333-----iiii--3333------- LAGDSLLTLAFDIIASDDNPLAAERKAALVISLARAAGIGGMAGGQALDLAAEKKAPDED ---------------1111------------------1111------------------- GIITLQAMKTGALLRFACEAGAIIAGSNQAERQRLRLFGEKIGLSFQLADDLLDLTKGTL ----------------------------------------------------------33 VALRGEAWAREKLQEQVAEASELLAPYGEKAAILIAAARFIAE 33---------------------3333---------------- >XENOBIOTIC REDUCTASE A; SWP:Q88NF7; PDB:2H8ZA; SALFEPYTLKDVTLRNRIAIPPMCQYMAEDGMINDWHHVHLAGLARGGAGLLVVEATAVA 3333----!!!!----------------iiii---------------------------1 PEGRITPGCAGIWSDAHAQAFVPVVQAIKAAGSVPGIQIAHAGRKASANRPWEGDDHIAA 111--1111-------------------1111---------!!!!----1111-----11 DDTRGWETIAPSAIAFGAHLPKVPREMTLDDIARVKQDFVDAARRARDAGFEWIELHFAH 11--------------!!!!--------------------------1111---------- GYLGQSFFSEHSNKRTDAYGGSFDNRSRFLLETLAAVREVWPENLPLTARFGVLEYDGRD ---3333---------1111----------------3333-1111--------------- EQTLEESIELARRFKAGGLDLLSVSVGFTIPDTNIPWGPAFMGPIAERVRREAKLPVTSA --------------1111-------------------2222---------1111------ WGFGTPQLAEAALQANQLDLVSVGRAHLADPHWAYFAAKELGVEKASWTLPAPYAHWLE ------------1111-------3333-----------1111--3333--33331111- >CYTIDYLATE KINASE; SWP:Q1YBX2; PDB:2H92A; AINIALDGPAAAGKSTIAKRVASELSMIYVDTGAMYRALTYKYLKLNKTEDFAKLVDQTT ----------------------1111----3333-------------------------- LDLTYKADKGQCVILDNEDVTDFLRNNDVTQHVSYVASKEPVRSFAVKKQKELAAEKGIV -----1111-----%%%%-1111---3333---------------------1111----- MDGRDIGTVVLPDADLKVYMIASVEERAERRYKDNQLRGIESNFEDLKRDIEARDQYDMN ----3333--1111---------------------------------------------- REISPLRKADDAVTLDTTGKSIEEVTDEILAMVSQI --------1111--------3333--------1111 >CO dehydrogenase/acetyl-C; SWP:Q3ACS3; PDB:2H9AA; PPVALIKVGKGEKVLEIGHETVLFRHDKRFEHPCGLAILVEDTLSEGEIKERVEKINKLV ---------iiii----------3333--------------------------------- FDRVGQMHSVNLVALKGSSQDAATFAKAVATAREVTDLPFILIGTPEQLAAALETEGANN --iiii----------3333-----------------------------------3333- PLLYAATADNYEQMVELAKKYNVPLTVSAKGLDALAELVQKITALGYKNLILDPQPENIS ------1111-------------------------------------------------- EGLFYQTQIRRLAIKKLFRPFGYPTIAFALDENPYQAVMEASVYIAKYAGIIVLNTVEPA -----------------3333------------------------------------333 DILPLITLRLNIYTDPQKPIAVEPKVYEILNPGPDAPVFITTNFSLTYFCVAGDVEGARI 3-------------3333--------------1111-------3333------------- PAYILPVDTDGTSVLTAWAAGKFTPEKIAQFLKESGIAEKVNHRKAILPGGVAVLSGKLQ --------%%%%3333-------------------3333---------3333----3333 ELSGWEILVGPRESSGINSFIKQ ------------3333------- >CO dehydrogenase/acetyl-C; SWP:Q3ACS0; PDB:2H9AB; VEVLKEKWNSKVVEVTLGTGDKTVTLGGDSTLPFLTFEGEMPNPPRFALEVFDTPPTDWP ------------------!!!!---------2222------------------------3 DILVEPFKDVINDPVAWAKKCVEYGADIVALRLVSAHPDGQNRSGAELAEVCKAVADAID 3333333--1111--------1111------------------------------3333- VPLMIIGCGVEEKDAEIFPVIGEALSGRNCLLSSATKDNYKPIVATCMVHGHSVVASAPL ---------3333---------1111---------1111--------------------- DINLSKQLNIMIMEMNLAPNRIIMDPLIGALGYGIEYSYSIIERMRLGALTGDKILAMPV ------------1111-1111--------2222-------------------1111---- VCFIGQEAWKAKEAKDPEVAEWGDYALRAIHWETVTTVALIQAGGHLFVMRHPKSLAEVK ---3333---3333----3333-------------------------------------- EHLKRIL ------- >SALICYLATE BIOSYNTHESIS P; SWP:Q51507; PDB:2H9DA; MKTPEDCTGLADIREAIDRIDLDIVQALGRRMDYVKAASRFERVAAMLPERARWAEENGL --3333-------------------------------11113333----------1111- DAPFVEGLFAQIIHWYIAEQIKYW ------------------3333-- >Anti-coagulant protein C2; SWP:Q16938_ANCCA; PDB:2H9EC; CGENEKYDDKKCKYDGVECVCEEGFYRNKDDKCVSAEDCELDNMDFIYPGTR -2222--------2222-----------------33333333---------- >HYPOTHETICAL PROTEIN; SWP:Q9I5E5; PDB:2H9FA; AHPPQIRIPATYLRGGTSKGVFFRLEDLPESCRVPGEARDRLFRVIGSPDPYAAHIDGGG --------------!!!!-----3333-3333-----------3333--1111------- ATSSTSKCVILSKSSQPGHDVDYLYGQVSIDKPFVDWSGNCGNLSTGAGAFALHAGLVDP -3333----------2222---------------------3333--------------33 ARIPEDGICEVRIWQANIGKTIIAHVPVSGGQVQETGDFELDGVTFPAAEIVLEFLDPSD 33--------------------------iiii--------2222---------------- GGAIFPTGNLVDDLEVPGVGTFKATINAGIPTVFVNAEEIGYRGTELREEINGDPQQLAR ----1111-------2222----------------3333-------33331111------ FERIRVAGALRGLIKTPEEAATRQHTPKIAFVAPPRDYRTASGKLVAAGDIDLLVRALSG ---------------33331111----------------3333---3333---------- KLHHAGTAAVAIGTAAAIPGTLVNLAAGGGERSAVRFGHPSGTLRVGAEASQANGEWTVT ------------------------1111----------1111----------iiii---- KAISRSARILEGWVRVPGDAF ----------------1111- >Tumor necrosis factor rec; SWP:O14763; PDB:2H9GB; EVQLVESGGGLVQPGGSLRLSCAASGFSIGKSGIHWVRQAPGKGLEWVAVIYPHDGNTAY ------------2222-----------3333--------2222----------------- ADSVKGRFTISADTSKNTAYLQMNSL 3333---------1111--------- >ACETYLCHOLINESTERASE; SWP:P21836; PDB:2HA2A; EGREDPQLLVRVRGGQLRGIRLKAPGGPVSAFLGIPFAEPPVGSRRFMPPEPKRPWSGVL ----1111---1111--------1111--------------!!!!--------------- DATTFQNVCYQYVDTLYPGFEGTEMWNPNRELSEDCLYLNVWTPYPRPASPTPVLIWIYG ----------------2222---1111--------------------------------- GGFYSGAASLDVYDGRFLAQVEGAVLVSMNYRVGTFGFLALPGSREAPGNVGLLDQRLAL %%%%-----3333---------------------------2222-----3333------- QWVQENIAAFGGDPMSVTLFGESAGAASVGMHILSLPSRSLFHRAVLQSGTPNGPWATVS -----3333---1111------------------3333---------------------- AGEARRRATLLARLVGCPNDTELIACLRTRPAQDLVDHEWHVLPQESIFRFSFVPVVDGD ------------1111----------11113333-1111--------------------- FLSDTPEALINTGDFQDLQVLVGVVKDEGSYFLVYGVPGFSKDNESLISRAQFLAGVRIG --------------1111--------11113333--2222-------------------- VPQASDLAAEAVVLHYTDWLHPEDPTHLRDAMSAVVGDHNVVCPVAQLAGRLAAQGARVY 1111-------------3333-------------------------------1111---- AYIFEHRASTLTWPLWMGVPHGYEIEFIFGLPLDPSLNYTTEERIFAQRLMKYWTNFART -------1111--3333--22223333--33333333----------------------- GDPNDPRDSKSPQWPPYTTAAQQYVSLNLKPLEVRRGLRAQTCAFWNRFLPKLLSA ----3333---------3333----------------------------------- >TAR (HIV-1) RNA LOOP BIND; SWP:Q13395; PDB:2HA8A; LGKSISRLIVVASLIDKPTNLGGLCRTCEVFGASVLVVGSLQCISDKQFQHLSVSAEQWL -----------1111------------------------3333---------iiii---- PLVEVKPPQLIDYLQQKKTEGYTIIGVEQTAKSLDLTQYCFPEKSLLLLGNEREGIPANL -----3333--------1111--------1111-3333------------------3333 IQQLDVCVEIPQQGIIRSLNVHVSGALLIWEYTRQQLLS 1111-------------------------------1111 >UPF0210 PROTEIN SP0239; SWP:Q97ST4; PDB:2HA9A; ADIRQVTETIAIEEQNFDIRTITGISLLDCIDPDINRAAEKIYQKITTKAANLVAVGDEI --------------------------3333------------------------------ AAELGIPIVNKRVSVTPISLIGAATDATDYVVLAKALDKAAKEIGVDFIGGFSALVQKGY ----------------3333-----------------------------------1111- QKGDEILINSIPRALAETDKVCSSVNIGSTKSGINTAVADGRIIKETANLSDGVAKLVVF ----------------------------------------------------3333---- ANAVEDNPFAAFHGVGEADVIINVGVSGPGVVKRALEKVRGQSFDVVAETVKKTAFKITR ---3333------2222------------------3333--------------------- IGQLVGQASERLGVEFGIVDLSLAPTDSVARVLEEGLETVGTHGTTAALALLNDQVKKGG ---------3333----------------------------------------------- VACNDEGIAAVQNGSLNLEKLEATAICSDIAIPEDTPAETIAAIADEAAIGVINKTTAVR --------------------------------1111------------------------ IIPKGKEGDIEFTAPVKVNGASSVDFISRGGQIPAPI ----------------------33333333------- >PUTATIVE TRANSLATION REPR; SWP:Q9KUP8; PDB:2HAFA; SDIEVGHSMNLTNHFLVAMPSMKDPYFKRSVIYICEHNQDGAMGLMINAPIDITVGGMLK -----------------------3333--------------------------------3 QVDIEPAYPQSHQENLKKPVFNGGPVSEDRGFILHRPRDHYESSMKMTDDIAVTTSKDIL 333---------3333----------1111-------------------------3333- TVLGTEAEPEGYIVALGYSGWSAGQLEVELTENSWLTIEADPELIFNTPVHEKWQKAIQK -2222------------------------1111-------3333---------3333333 LGISP 3---- >PROTEASE; SWP:POL_FIVPE; PDB:2HAHA; GTTTTLEKRPEILIFVNGYPIKFLLDTGADITVLNRRDFQVKNSIENGRQMIGGIGGFIR ---------------iiii------1111-----3333--!!!!---------------- GTNYINVHLEIRDENYKTQCIFGNVCVLEDNSTPVNILGRDNMIKFNIRLVM ------------1111----------------------3333-1111----- >HEPATITIS C VIRUS NS5B RN; SWP:P26663; PDB:2HAIA; SMSYTWTGALITPCAAEESKLPINALSNSLLRHHNMVYATTSRSAGQRQKKVTFDRLQVL -----------------------3333-----3333----3333---------------- DDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPPHSAKSKYGYGAKDVRNLSSRAVNHIH ----------------------------11111111----------------3333---- SVWKDLLEDTVTPIDTTIMAKNEVFCVQPEKGGRKPARLIVFPDLGVRVCEKMALYDVVS ---------------------------3333----------------------------- TLPQVVMGSSYGFQYSPGQRVEFLVNTWKSKKNPMGFSYDTRFDSTVTENDIRVEESIYQ ------!!!!1111-------------1111-----------3333------------33 CCDLAPEARQAIKSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKASA 33----------------3333----1111------------1111-------------- ACRAAKLQDCTMLVNGDDLVVICESAGTQEDAASLRVFTEAMTRYSAPPGDPPQPEYDLE --------------!!!!------------------------1111-----------333 LITSSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETARHTPVNSWLGNIIMYAPTLWAR 3-----------1111------------------------------------11113333 MILMTHFFSILLAQEQLEKALDCQIYGACYSIEPLDLPQIIERLHGLSAFSLHSYSPGEI ---------------1111-----iiii----3333-----------1111--------- NRVASCLRKLGVPPLRVWRHRARSVRARLLSQGGRAATCGKYLFNWAVKTKLKLTPIPAA -----------------------------------------1111-----------3333 SQLDLSGWFVAGYSGGDIYH ------------2222---- >SERINE/THREONINE-PROTEIN ; SWP:Q9P0L2; PDB:2HAKA; PHIGNYRLQKTIGKGNFAKVKLARHVLTGREVAVKIIDKTQLNPTSLQKLFREVRIMKIL --!!!!--------------------------------1111--------------1111 NHPNIVKLFEVIETEKTLYLVMEYASGGEVFDYLVAHGRMKEKEARAKFRQIVSAVQYCH -------------3333------------------------------------------- QKYIVHRDLKAENLLLDGDMNIKIADFGFSNEFTVGSPPYAAPELFQGKKYDGPEVDVWS ---------3333---1111-----2222-1111-------3333---------3333-- LGVILYTLVSGSLPFDGQNLKELRERVLRGKYRIPFYMSTDCENLLKKLLVLNPIKRGSL ------------------3333------------1111--------------1111---- EQIMKDRWMNVGHEEEELKPYTEPDPDFNDTKRIDIMVTMGFARDEINDALINQKYDEVM ------3333---------------------------1111---------1111------ ATYILLGRK ----1111- >HEPATITIS A PROTEASE 3C; SWP:P13901; PDB:2HALA; STLEIAGLVRKNLVQFGVGEKNGSVRWVMNALGVKDDWLLVPSHAYKFEKDYEMMEFYFN ---------1111------2222-----------!!!!---33331111-3333------ RGGTYYSISAGNVVIQSLDVGFQDVVLMKVPTIPKFRDITQHFIKKGDVPRALNRLATLV iiii----1111-----------------1111-----1111--33333333-------- TTVNGTPMLISEGPLKMEEKATYVHKKNDGTTVDLTVDQAWRGKGEGLPGMCGGALVSSN --iiii--------------------3333-----------------2222--------1 QSIQNAILGIHVAGGNSILVAKLVTQEMFQNI 111-----------%%%%------3333---- >CYCLOPHILIN; SWP:Q9U9R3; PDB:2HAQA; EPEVTAKVYFDVMIDSEPLGRITIGLFGKDAPLTTENFRQLCTGEHGFGYKDSIFHRVIQ -------------------------------------------1111--2222------- NFMIQGGDFTNFDGTGGKSIYGEKFADENLNVKHFVGALSMANAGPNTNGSQFFITTAPT ------------------1111------------2222---------------------1 PWLDGRHVVFGKVLDGMDVVLRIEKTKTNSHDRPVKPVKIVASGEL 111------------3333---1111---%%%%------------- >PUTATIVE NAD(P)H-FLAVIN O; SWP:Q9A120; PDB:2HAYA; IHHQIQQALHFRTAVRVYKEEKISDEDLALILDAAWLSPSSIGLEGWRFVVLDNKPIKEE ----------------------------------1111-2222----------------- IKPFAWGAQYQLETASHFILLIAEKHARYDSPAIKNSLLRRGIKEGDGLNSRLKLYESFQ 33331111-------------------1111-------1111------3333-------- KEDDADNPRALFDWTAKQTYIALGNTAALLGIDTCPIEGFHYDKVNHILAKHNVIDLEKE -------------------------3333--------------------1111--3333- GIASLSLGYRLRDPKHAQVRKPKEEVSVVK ---------------------3333----- >NEURAL CELL ADHESION MOLE; SWP:P13592; PDB:2HAZA; SHMDTPSSPSIDQVEPYSSTAQVQFDEPEATGGVPILKYKAEWRAVGEEVWHSKWYDAKE --------------------------------------------2222------------ ASMEGIVTIVGLKPETTYAVRLAALNGKGLGEISAASEFKTQPV -------------------------1111--------------- >REVERSE TRANSCRIPTASE/RIB; SWP:P03355; PDB:2HB5A; HGTRPDLTDQPLPDADHTWYTDGSSLLQEGQRKAGAAVTTETEVIWAKALPAGTSAQRAE ---1111--------------------iiii-------------------2222------ LIALTQALKAEGKKLNVYTDSRYAFATAHLTSEGKEIKNKDEILALLKALFLPKRLSIIH ---------2222-------------1111-----------------3333--------- CGHSAEARGNRADQAARKAAITETPDTS ---------------------------- >HEMOGLOBIN (DEOXY); SWP:P02216; PDB:2HBG; GLSAAQRQVIAATWKDIAGADNGAGVGKKCLIKFLSAHPQMAAVFGFSGASDPGVAALGA --------------------iiii-------------3333-3333--1111-------- KVLAQIGVAVSHLGDEGKMVAQMKAVGVRHKGYGNKHIKAQYFEPLGASLLSAMEHRIGG ---------1111---------------3333------3333---------------!!! KMNAAAKDAWAAAYADISGALISGLQS !-------------------------- >EXOSOME COMPLEX EXONUCLEA; SWP:Q12149; PDB:2HBJA; GMVEKPQLKFKSPIDNSESHPFIPLLKEKPNALKPLSESLRLVDDDENNPSHYPHPYEYE ----3333-------------------------------------3333-----111133 IDHQEYSPEILQIREEIPSKSWDDSVPIWVDTSTELESMLEDLKNTKEIAVDLEHHDYRS 33----1111----------1111------------------------------------ YYGIVCLMQISTRERDYLVDTLKLRENLHILNEVFTNPSIVKVFHGAFMDIIWLQRDLGL -----------3333------1111-----------1111------3333---------- YVVGLFDTYHASKAIGLPRHSLAYLLENFANFKTSKKYQLADWRIRPLSKPMTAAARADT -----------------------------------------------------------1 HFLLNIYDQLRNKLIESNKLAGVLYESRNVAKRRFEYSKYRPLTPSSEVYSPIEKESPWK 111-----------1111-----------1111---3333-------------------- ILMYQYNIPPEREVLVRELYQWRDLIARRDDESPRFVMPNQLLAALVAYTPTDVIGVVSL -1111---------------------------3333--3333----------3333---3 TNGVTEHVRQNAKLLANLIRDALRNIKNTN 333--------------------------- >HYPOTHETICAL PROTEIN (NP_; SWP:Q9A395; PDB:2HBOA; AIPEGFSQLNWSRGFGRQIGPLFEHREGPGQARLAFRVEEHHTNGLGNCHGGLSFADAWG --2222---------------------2222-------1111------------------ RIISLQKSYSWVTVRLCDFLSGAKLGDWVEGEGELISEEDLFTVRGRIWAGERTLITGTG -----------------------2222----------------------!!!!------- VFKALSARKPRPGELAY ----------2222--- >EGL NINE HOMOLOG 1; SWP:Q9GZT9; PDB:2HBTA; LPALKLALEYIVPCMNKHGICVVDDFLGKETGQQIGDEVRALHDTGKFTDGQLVSQKSDS ------------------------1111--------------1111-------------1 SKDIRGDKITWIEGKEPGCETIGLLMSSMDDLIRHCNGKLGSYKINGRTKAMVACYPGNG 111------------2222--------------1111--!!!!----------------- TGYVRHVDNPNGDGRCVTCIYYLNKDWDAKVSGGILRIFPEGKAQFADIEPKFDRLLFFW ---------------------------3333--------------------2222----- SDRRNPHEVQPAYATRYAITVWYFDADERARAKVKYLKGVRVEL -3333--------------------------------------- >2-amino-3-carboxymuconate; SWP:Q83V25; PDB:2HBVA; KPRIDMHSHFFPRISEQEAAKFDANHAPWLQVSAKGDTGSIMMGKNNFRPVYQALWDPAF --------------------------------3333------!!!!-----3333----- RIEEMDAQGVDVQVTCATPVMFGYTWEANKAAQWAERMNDFALEFAAHNPQRIKVLAQVP -----1111--------3333-1111---------------------------------3 LQDLDLACKEASRAVAAGHLGIQIGNHLGDKDLDDATLEAFLTHCANEDIPILVHPWDMM 333-----------1111---------!!!!1111----------1111----------- GGQRMKKWMLPWLVAMPAETQLAILSLILSGAFERIPKSLKICFGHGGGSFAFLLGRVDN --1111%%%%-----------------11111111-3333----%%%%-3333------- AWRHRDIVREDCPRPPSEYVDRFFVDSAVFNPGALELLVSVMGEDRVMLGSDYPFPLGEQ ----33331111--3333------------------------1111-------------- KIGGLVLSSNLGESAKDKIISGNASKFFNIN 22223333-----------------1111-- >NLP/P60 PROTEIN; SWP:Q3M7N3; PDB:2HBWA; SGEYQCLAALNLYDSPECTSLATQAAVGRHLQVTSNQQGAAVEVCLCEDDYPGWLSLGDL --------------3333-------2222--------!!!!--------------33331 GLLKPATVLYQAKSFSESEIKKLLPGAIAFTQKAQQSNYYLWGGTVGPNYDCSGLQAAFV 111-------------------------------------2222-------3333----1 SVGIWLPRDAYQQEAFTQAITIDELAPGDLVFFGTPVKATHVGLYLGDGCYIHSSGKAQG 111-----------------3333-2222-----3333--------iiii---------- RDGIGIDILSEQGDVVSRSYYQQLRGAGRVVKSYKPQRH ---------1111-------1111--------------- >RECEPTOR-TYPE TYROSINE-PR; SWP:P23467; PDB:2HC1A; NRKTSCPIKINQFEGHFMKLQADSNYLLSKEYEELKDVGRNQSCDIALLPENRGKNRYNN --------3333----------%%%%------11112222---3333-33331111-111 ILPYDATRVKLSGGSDYINASYIPGNNFRREYIVTQGPLPGTKDDFWKMVWEQNVHNIVM 1--3333-----------------1111----------1111------------------ VTQCVEKGRVKCDHYWPADQDSLYYGDLILQMLSESVLPEWTIREFKICGEEQLDAHRLI -----iiii---------------!!!!-------------------------------- RHFHYTVWPDHGVPETTQSLIQFVRTVRDYINRSPGAGPTVVHCSAGVGRTGTFIALDRI ---------------3333-----------1111-------------------------- LQQLDSKDSVDIYGAVHDLRLHRVHMVQTECQYVYLHQCVRDVLRARKLR ---------------------------------------------3333- >HYPOTHETICAL PROTEIN YVYC; SWP:P39737; PDB:2HC5A; LNIERLTTLQPVWDRYDTQIHNQKDNDNEVPVHQVSYTNLAEMVGEMNKLLEPSQVHLKF ------------------3333-------------3333-----------1111------ ELHDKLNEYYVKVIEDSTNEVIREIPPKRWLDFYAAMTEFLGLFVDEKKLEHHHHHH ---------------1111------1111------------------3333------ >CATION-TRANSPORTING ATPAS; SWP:O29777; PDB:2HC8A; EAIKKLVGLQAKTAVVIRDGKEIAVPVEEVAVGDIVIVRPGEKIPVDGVVVEGESYVDES ------1111-------iiii----3333-2222----2222----------------33 MISGEPVPVLKSKGDEVFGATINNTGVLKIRATRVGGETLLAQIVKLVEDAMG 33---------2222--2222-------------!!!!--------------- >LEUCINE AMINOPEPTIDASE 1; SWP:P34629; PDB:2HC9A; TQVLVRNGIQAVGDGLTSLIIVGKKSVLKNVTFEGKFKEVAQKFVTDGDSWNSMISRIPA ---------------------------1111----------------------------- SGRHPLHYELAHLITVPDASSRGNTPTNAHSIYKELKPINYPEDTKNVHFVLFAEYPDVL --------------------111133333333---3333--1111---------3333-- SHVAAIARTFCKFSMKTSGIRELNVNIDVVCDKLTNEDAVFLTDLSESVRETARLIDTPA ------1111--------------------1111------------------------33 NILTTDALVDEAVKVGNATGSKITVIRGEELLKAGFGGIYHVGKAGPTPPAFVVLSHEVP 33--------------1111------!!!!------------1111------------22 GSTEHIALVGKGVVYDTGGLQIKTKTGMPNMKRDMGGAAGMLEAYSALVKHGFSQTLHAC 22---------------------11112222----------------------------- LCIVENNVSPIANKPDDIIKMLSGKTVEINNTDAEGRLILADGVFYAKETLKATTIFDMA --------1111-2222---3333------1111-------------------------- TLTGAQAWLSGRLHGAAMTNDEQLENEIIKAGKASGDLVAPMLFAPDLFFGDLKSSIADM --3333--------------------------------------3333--1111------ KNSNLGKMDGPPSAVAGLLIGAHIGFGEGLRWLHLDIAAPAEVGDRGTGYGPALFSTLLG -----------------------%%%%---------------!!!!------------33 KYTSVPMLK 33--3333- >HUMAN CHEMOKINE HCC-2; SWP:Q16663; PDB:2HCC; HFAADCCTSYISQSIPCSLMKSYFETSSECSKPGVIFLTKKGRQVCAKPSGPGVQDCMKK ---------------1111-------3333--------1111--------2222------ LKPYSI ------ >HYDROLASE, HALOACID DEHAL; SWP:Q8KBS5; PDB:2HCFA; SRTLVLFDIDGTLLKVESNRRVLADALIEVYGTEGSTDFSGKDGAIIYEVLSNVGLERAE --------2222--------------------------2222--------3333------ IADKFDKAKETYIALFRERARREDITLLEGVRELLDALSSRSDVLLGLLTGNFEASGRHK --------------------3333---2222------1111-----------3333---- LKLPGIDHYFPFGAFADDALDRNELPHIALERARRTGANYSPSQIVIIGDTEHDIRCARE -11113333-----3333--1111----------------3333-------------333 LDARSIAVATGNFTEELARHKPGTLFKNFAETDEVLASILT 3-------------3333----------------------- >DUAL SPECIFICITY PROTEIN ; SWP:Q8BTR5; PDB:2HCMA; SLGTSEAAPPPFARVAPALFIGNARAAGATELLVRAGITLCVNVSRQQPGPRAPGVAELR ---------------2222----3333-------------------------2222---- VPVFDDPAEDLLTHLEPTCAAMEAAVRDGGSCLVYCKNGRSRSAAVCTAYLMRHRGHSLD -----1111-3333-----------1111------------------------------- RAFQMVKSARPVAEPNLGFWAQLQKYEQTLQAQAILPRE ---------1111-----------------1111----- >RNA-DIRECTED RNA POLYMERA; SWP:POLG_KUNJM; PDB:2HCNA; LVNGVVRLLSKPWDTITTKAPEPPEGVKYVLNETTNWLWAFLAREKRPRMCSREEFIRKV ----3333-3333---------------------------1111---------------- NSNAAQWRSAREAVEDPKFWEMVDEEREAHLRGECHTCRAIWFMWLGARFLEFEALGFLN -----------3333-3333-------1111----------------------------- EDHWLGRKNSGGGVEGLGLQKLGYILREVGTRPGGRIYADDTAGWDTRITRADLENEAKV --11113333---22223333------3333------------3333-----------33 LELLDGEHRRLARAIIELTYRSGQVVTYALNTFTNLAVQLVRMMEGEGVIGPDDVEKLTK 33------------1111----3333------------------1111--1111----22 GKGPKVRTWLSENGEERLSRMAVSGDDCVVKPLDDRFATSLHFLNAMSKVRKDIQEWKPS 22---------------1111--!!!!-------3333------1111------1111-- TGWYDWQQVPFCSNHFTELIMKDGRTLVTPCRGQDELVGRARISPNVRDTACLAKSYAQM ----1111--iiii------1111--------3333---3333----------------- WLLLYFHRRDLRLMANAICSAVPVNWVPTGRTTWSIHAGGEWMTTEDMLEVWNRVWIEEN ----1111--------------1111-------------1111-------------1111 EWMEDKTPVEKWSDVPYSGKREDIWCGSLIGTRARATWAENIQVAINQVRSIIGDEKYVD ----------3333---------1111-2222---------------------------- YMSSLK 1111-- >3-ISOPROPYLMALATE DEHYDRA; SWP:LEUD_STRMU; PDB:2HCUA; SMEEFTIYTGTTVPLMNDNIDTDQILPKQFLKLIDKKGFGKYLMYEWRYLDNNYTENPDF --------------------------3333---1111-3333-3333---------1111 IFNQPEYREASILITGDNFGAGSSREHAAWALADYGFKVIVAGSFGDIHYNNDLNNGILP 11111111----------------3333-------------------------------- IIQPKEVRDKLAKLKPTDEVTVNLFEQKIYSPVGDFSFDIDGEWKHKLLNGLD ----------11111111-----1111---1111------------------- >ALCOHOL DEHYDROGENASE 1; SWP:P00330; PDB:2HCYA; SIPETQKGVIFYESHGKLEYKDIPVPKPKANELLINVKYSGVCHTDLHAWHGDWPLPVKL ------------2222------------1111----------3333-------------- PLVGGHEGAGVVVGMGENVKGWKIGDYAGIKWLNGSCMACEYCELGNESNCPHADLSGYT ---------------1111---2222-------------3333---33331111-2222- HDGSFQQYATADAVQAAHIPQGTDLAQVAPILCAGITVYKALKSANLMAGHWVAISGAAG -------------------22223333-3333----------1111-2222-----1111 GLGSLAVQYAKAMGYRVLGIDGGEGKEELFRSIGGEVFIDFTKEKDIVGAVLKATDGGAH ----------------------2222----1111-------------------------- GVINVSVSEAAIEASTRYVRANGTTVLVGMPAGAKCCSDVFNQVVKSISIVGSYVGNRAD -------------1111--2222-------2222-------------------------- TREALDFFARGLVKSPIKVVGLSTLPEIYEKMEKGQIVGRYVVDTSK ----------------------------------------------- >Expansin-B1 [Precursor]; SWP:P58738; PDB:2HCZX; KVPPGNITTNYNGKWLTARATWYGQPNGAGAPDNGGACGIKNVNLPPYSGMTACGNVPIF ------------------------------1111-3333--3333--%%%%--------- KDGKGCGSCYEVRCKEKPECSGNPVTVYITDMNYEPIAPYHFDLSGKAFGSLAKPGLNDK -----------------------------------------------3333--2222--- IRHCGIMDVEFRRVRCKYPAGQKIVFHIEKGCNPNYLAVLVKYVADDGDIVLMEIQDKLS ------------------2222-------------------------------------- AEWKPMKLSWGAIWRMDTAKALKGPFSIRLTSESGKKVIAKDVIPANWRPDAVYTSNVQF ---------!!!!------------------3333-------------2222-------- Y - >PROTEASE NS2-3 (P23); SWP:P27958; PDB:2HD0A; SHMQASLLKVPYFVRVQGLLRICALARKIAGGHYVQMAIIKLGALTGTYVYNHLTPLRDW -----------------------1111-2222-----------1111---3333-3333- AHNGLRDLAVAVEPVVFSRMETKLITWGADTAACGDIINGLPVSARRGQEILLGPADGMV -11113333------------------3333------iiii--------------22221 SKGWRLL 111---- >PHOSPHODIESTERASE 9A; SWP:O76083; PDB:2HD1A; PTYPKYLLSPETIEALRKPTFDVWLWEPNEMLSCLEHMYHDLGLVRDFSINPVTLRRWLF ---3333----------11111111----------------------------------- CVHDNYRNNPFHNFRHCFCVAQMMYSMVWLCSLQEKFSQTDILILMTAAICHDLDHPGYN --1111------3333------------1111---------------------------- NTYQINARTELAVRYNDISPLENHHCAVAFQILAEPECNIFSNIPPDGFKQIRQGMITLI --------------%%%%----------------11111111--3333------------ LATDMARHAEIMDSFKEKMENFDYSNEEHMTLLKMILIKCCDISNEVRPMEVAEPWVDCL ---3333---------------1111-----------------3333-3333-------- LEEYFMQSDREKSEGLPVAPFMDRDKVTKATAQIGFIKFVLIPMFETVTKLFPMVEEIML ------------------11111111---------------------------------- QPLWESRDRYEELKRIDDAMKELQKK -------------------------- >ETHANOLAMINE UTILIZATION ; SWP:P0AEJ9; PDB:2HD3A; KLAVVTGQIVCTVRHHGLAHDKLLVEIDPQGNPDGQCAVAIDNIGAGTGEWVLLVSGSSA --------------3333----------------------------2222------3333 RQAHKSETSPVDLCVIGIVDEVVSGGQVIFHKL -----1111--------------iiii------ >UBIQUITIN CARBOXYL-TERMIN; SWP:O75604; PDB:2HD5A; QGLAGLRNLGNTCFMNSILQCLSNTRELRDYCLQRLYMR -------------------------3333---------- >UPF0310 PROTEIN PH1033; SWP:O58764; PDB:2HD9A; MTYWICITNRENWEVIKRHNVWGVPKKHKNTLSRVKPGDKLVIYVRQEKDKEGNLLEPKI ------------------------3333-3333--2222----------1111------- VGIYEVTSEPYVDFSRIFKPHRGGKETYPYRVKIKPIKIGEINFKPLINDLKFIKNKKRW -------------------------------------------11111111----3333- SMHFFGKAMRELPEEDYKLIEKLLL ------------------------- >HNF3/FH TRANSCRIPTION FAC; SWP:Q63245; PDB:2HDCA; VKPPYSYIALITMAILQSPQKKLTLSGICEFISNRFPYYREKFPAWQNSIRHNLSLNDCF --------------3333------------------3333-------------------- VKIPREPGNPGKGNYWTLDPQSEDMFDNGSFLRRRKR -------------------------3333-------- >ENGRAILED HOMEODOMAIN; SWP:P02836; PDB:2HDDA; RTAFSSEQLARLKREFNENRYLTERRRQQLSSELGLNEAQIKIWFKNKRAKIKKS ----3333-----------------------1111-3333--------------- >SMALL INDUCIBLE CYTOKINE ; SWP:NA; PDB:2HDLA; SKCKCSRKGPKIRYSDVKKLEMKPKYPHCEEKMVIITTKSVSRYRGQEHCLHPKLQSTKR ------------3333---------1111------------1111---------3333-- FIKWYNAWNEKRRVYEE ----------3333--- >PHOSPHOGLYCOLATE PHOSPHAT; SWP:Q88YA8; PDB:2HDOA; TYQALFDIDGTLTNSQPAYTTVREVLATYGKPFSPAQAQKTFPAAEQATELGIAASEFDH -------2222-------------3333---------------3333-1111-3333--- FQAQYEDVASHYDQIELYPGITSLFEQLPSELRLGIVTSQRRNELESGRSYPFRAVTISA ----------3333---2222-------3333--------------------------11 DDTPKRKPDPLPLLTALEKVNVAPQNALFIGDSVSDEQTAQAANVDFGLAVWGDPNADHQ 11---------------1111-3333-----------------------33333333--- KVAHRFQKPLDILELF -------3333----- >UBIQUITIN-PROTEIN LIGASE ; SWP:Q00987; PDB:2HDPA; SLPLNAIEPCVICQGRPKNGCIVHGKTGHLMACFTCAKKLKKRNKPCPVCRQPIQMIVLT -----------------------iiii--------------------------------- YFP --- >SH2-B PH domain containin; SWP:Q9WVM5; PDB:2HDVA; SDQPLSGYPWFHGMLSRLKAAQLVLEGGTGSHGVFLVRQSETRRGECVLTFNFQGKAKHL ---33331111---------------!!!!2222--------2222------iiii---- RLSLNAAGQCRVQHLHFQSIFDMLEHFRVHPIPDVVLVSYVPS ----1111-----------------1111-------------- >HYPOTHETICAL PROTEIN PA22; SWP:Q01609; PDB:2HDWA; NMQLQLTQEWDKTFPLSAKVEHRKVTFANRYGITLAADLYLPKNRGGDRLPAIVIGGPFG ----------------1111--------1111-------------------------222 AVKEQSSGLYAQTMAERGFVTLAFDPSYTGESGGQPRNVASPDINTEDFSAAVDFISLLP 2-------------1111-------2222-----------3333--------------11 EVNRERIGVIGICGWGGMALNAVAVDKRVKAVVTSTMYDMTRVMSKGYNDSVTLEQRTRT 111111-----!!!!-------1111---------------------%%%%--------- LEQLGQQRWKDAESGTPAYQPPYNELKGGEAQFLVDYHDYYMTPRGYHPRAVNSGNAWTM ------------------------------3333--------3333-1111-------11 TTPLSFMNMPILTYIKEISPRPILLIHGERAHSRYFSETAYAAAAEPKELLIVPGASHVD 11---------1111------------1111-------------------------3333 LYDRLDRIPFDRIAGFFDEHL ---1111-------------- >SELENOCYSTEINE LYASE; SWP:Q96I15; PDB:2HDYA; ERKVYMDYNATTPLEPEVIQAMTKAMWEAWGNPSSPYSAGRKAKDIINAARESLAKMIGG ------3333---------------------1111------------------------- KPQDIIFTSGGTESNNLVIHSVVKHFHANQTGAKPHFITSSVEHDSIRLPLEHLVEEQVA 3333---------------------------------------1111------------- AVTFVPVSKVSGQTEVDDILAAVRPTTRLVTIMLANNETGIVMPVPEISQRIKALNQERV --------------3333-11111111--------------------------------1 AAGLPPILVHTDAAQALGKQRVDVEDLGVDFLTIVGHKFYGPRIGALYIRGLGEFTPLYP 111---------1111------3333--------1111------------2222------ MLFGGGQERNFRPGTENTPMIAGLGKAAELVTQNCEAYEAHMRDVRDYLEERLEAEFGQK ------%%%%-----------------------------------------------333 RIHLNSQFPGTQRLPNTCNFSIRGPRLQGHVVLAQCRVLMASVGAACHSDHGDQPSPVLL 3--11112222------------1111-----1111----------3333---------1 SYGVPFDVARNALRLSVGRSTTRAEVDLVVQDLKQAVAQLEDQ 111-33331111-----1111---------------------- >DISCS LARGE HOMOLOG 2; SWP:Q15700; PDB:2HE2A; MEPRKVVLHKGSTGLGFNIVGGEDGEGIFVSFILAGGPADLSGELQRGDQILSVNGIDLR ----------------------iiii-------2222--------2222----iiii-22 GASHEQAAAALKGAGQTVTIIAQYQPEDYARFEAKIHETSV 22--------3333-----------------1111------ >GLUTATHIONE PEROXIDASE 2; SWP:P18283; PDB:2HE3A; MIAKSFYDLSAINLDGEKVDFNTFRGRAVLIENVASLCGTTTRDFTQLNELQCRFPRRLV ----1111----1111---33332222----------1111------------------- VLGFPCNQFGHQENCQNEEILNSLKYVRPGGGYQPTFTLVQKCEVNGQNEHPVFAYLKDK --------%%%%---1111----------iiii-------------1111---------- LPYPYDDPFSLMTDPKLIIWSPVRRSDVAWNFEKFLIGPEGEPFRRYSRTFPTINIEPDI ---1111------3333------1111----------1111------11113333----- KRLLK -1111 >NA(+)/H(+) EXCHANGE REGUL; SWP:Q15599; PDB:2HE4A; SMLRPRLCHLRKGPQGYGFNLHSDKSRPGQYIRSVDPGSPAARSGLRAQDRLIEVNGQNV ------------1111-------------------2222--1111-2222----iiii-1 EGLRHAEVVASIKAREDEARLLVVGPSTRL 111--------------------------- >BAND 4.1-LIKE PROTEIN 3; SWP:Q9Y2J2; PDB:2HE7A; KSMQCKVILLDGSEYTCDVEKRSRGQVLFDKVCEHLNLLEKDYFGLTYRDAENQKNWLDP --------1111-------11113333------1111---1111-----1111-----11 AKEIKKQVRSGAWHFSFNVKFYPPDPAQLSEDITRYYLCLQLRDDIVSGRLPCSFVTLAL 113333------------------3333--3333-----------1111----------- LGSYTVQSELGDYDPDECGSDYISEFRFAPNHTKELEDKVIELHKSHRGMTPAEAEMHFL -------------1111-11111111------------------1111------------ ENAKKLSMYGVDLHHAKDSEGVEIMLGVCASGLLIYRDRLRINRFAWPKVLKISYKRNNF -33331111--------1111-------3333----%%%%-----3333------!!!!- YIKIRPGEFEQFESTIGFKLPNHRAAKRLWKVCVEHHTFFRLL ------1111--------------------------------- >NK-TUMOR RECOGNITION PROT; SWP:P30414; PDB:2HE9A; SPQCHFDIEINREPVGRIMFQLFSDICPKTCKNFLCLCSGEKGLGKTTGKKLCYKGSTFH ---------iiii--------------------------1111----------2222--- RVVKNFMIQGGDFSEGNGKGGESIYGGYFKDENFILKHDRAFLLSMANRGKHTNGSQFFI --2222-----1111-------1111---------------------------------- TTKPAPHLDGVHVVFGLVISGFEVIEQIENLKTDAASRPYADVRVIDCGVLA ----3333-------------------1111--1111--------------- >KIF2C PROTEIN; SWP:Q99661; PDB:2HEHA; NWEFARMIKEFRATLECHPLTMTDPIEEHRICVCVRKRPLNKQELAKKEIDVISIPSKCL --------------------1111--------------------1111--------1111 LLVHEPKLKVDLTKYLENQAFCFDFAFDETASNEVVYRFTARPLVQTIFEGGKATCFAYG --------1111---------------1111---------33333333------------ QTGSGKTHTMGKGIYAMASRDVFLLKNQPCYRKLGLEVYVTFFEIYNGKLFDLLNKKAKL 22223333---------------1111-1111-------------iiii-----%%%%-- RVLEDGKQQVQVVGLQEHLVNSADDVIKMIDMGSACRNSSRSHACFQIILRAKGRMHGKF ----1111---2222----------------------1111------------------- SLVDLAGNEGAEINKSLLALKECIRALGQNKAHTPFRESKLTQVLRDSFIGENSRTCMIA --------------------------1111----1111------3333------------ TISPGISSCEYTLNTLRYADRVKE ----3333----------1111-- >RAS-RELATED PROTEIN RAB-5; SWP:P61020; PDB:2HEIA; ICQFKLVLLGESAVGKSSLVLRFVKGQFHEYQESTIGAAFLTQSVCLTVKFEIWDTAGQE ----------2222-----------------------------------------2222- RYHSLAPMYYRGAQAAIVVYDITNQETFARAKTWVKELQRQASPSIVIALAGNKADLANK -1111----2222-------1111------------------1111-------3333111 RMVEYEEAQAYADDNSLLFMETSAKTAMNVNDLFLAIAKKLPK 1-----------1111-------1111----------1111-- >ALDO-KETO REDUCTASE FAMIL; SWP:Q9CX32; PDB:2HEJA; HCVILNDGNFIPVLGFGTALPLECPKSKAKELTKIAIDAGFHHFDSASVYNTEDHVGEAI ----1111------------11113333--------1111------3333---------- RSKIADGTVRREDIFYTSKVWCTSLHPELVRASLERSLQKLQFDYVDLYLIHYPMALKPG ---1111--3333-------1111-1111------------------------------- EENFPVDEHGKLIFDRVDLCATWEAMEKCKDAGLTKSIGVSNFNYRQLEMILNKPGLKYK ------1111-------------------1111--------------------2222--- PVCNQVECHPYLNQMKLLDFCKSKDIVLVAYGVLGTQRYGGWVDQNSPVLLDEPVLGSMA --------1111---------1111------1111---2222-1111-1111-------- KKYNRTPALIALRYQLQRGIVVLNTSLKEERIKENMQVFEFQLSSEDMKVLDGLNRNMRY ---------------1111-----------------1111----------1111------ IPAAIFKGHPNWPFLDEY --3333--1111------ >HYPOTHETICAL PROTEIN; SWP:O67745; PDB:2HEKA; MIKEFSDPLYGFVRVGEAGLRLIDSFPFQRLRYVKQLGLAYLVFPSAQHTRFEHSLGVYH -------------------------33333333-1111-----1111------------- ITERICESLKVKEKELVKLAGLLHDLGHPPFSHTTEVLLPRERSHEDFTERVIKETEIYE ----------------------1111-------33333333------------------- ILKQDYSHEDIERLVRITLGKPEDEEEKLLSEIITGEFGSDRMDYLRRDAYFCGVSYGFF ------------------------------------------------------------ DYDRLISTLRVYENKVVVDESGLRALENFLISRYFMYVQVYFHKVVRILSIHLVEFLKKL -----1111--%%%%---3333-----------------1111----------------- ISQEDFTDINNFLRLNDAFVISELFKRKAFREDFERIFQRKHFKTLLSTENYEKFSETKE -------33331111-----------3333------------------------------ RLLEKFPQEKVRFDEVEKEVYGGNIYVLSSEGLKKAHELSPLIASLKPIKLYRIYVDRQL ------3333------------------1111--3333--3333------------3333 WEKARSELK -----1111 >EPH RECEPTOR A4; SWP:Q80VZ2_MOUSE; PDB:2HELA; AKEIDASCIKIEKVIGVGEFGEVCSGRLKVEICVAIKTLKYTDKQRRDFLSEASIMGQFD ----1111---------------------------------11113333-----1111-- HPNIIHLEGVVTKCKPVMIITEYMENGSLDAFLRKNDGRFTVIQLVGMLRGIGSGMKYLS -----------------------1111--------2222--------------------- DMSAVHRDLAARNILVNSNLVCKVSDFGMSRVKIPIRWTAPEAIAYRKFTSASDVWSYGI ---------3333---1111------1111----1111-3333----------------- VMWEVMSYGERPYWDMSNQDVIKAIEEGYRLPPPMDCPIALHQLMLDCWQKERSDRPKFG ----1111--2222----------1111-----2222---------1111-3333----- QIVNMLDKLIRNPNSL -----------3333- >Z-DNA BINDING PROTEIN 1; SWP:Q9QY24; PDB:2HEOA; DNLEQKILQVLSDDGGPVAIFQLVKKCQVPKKTLNQVLYRLKKEDRVSSPSPKYWSIGG ------------------3333-------------------1111-----2222----- ------------------------------------------ >YORP PROTEIN; SWP:O31898; PDB:2HEQA; MAGDPLPKYWSYPVGLAVEINNNARYGCPHHVGRKGKIIEHLHSATYDYAVSDETGDITY ------------2222-----------2222---------------------1111---- FKEHELTPLKGGLAYVLEHHHHHH -3333------------------- >FARNESYL PYROPHOSPHATE SY; SWP:Q5CR09; PDB:2HERA; YDYTDFINYYDKFKVIVYNVLKKLPLNDEIRKPVIEYYLNCIDYNVKKGKHIRGKILVLI ----------------1111--------33333333--------1111---3333---33 SSLSSAYSNIKRDSIYLLGWVVEAIQALILIADDIMDSGKFRRGAPCWYIVHGQSNAIND 33--------3333--------------------1111--------3333---------- IFFLKMLSLSLIFELSSVFGNDIVMKIQKIYNESIFFTVLGQHLDLSYFDLSKADKISER -------------3333--------------------------------------3333- YFSMVEMKTSRYTFYMPVFFGLTLSEIQVSSAQLNLIEAILYKLGEFYQVHNDVSDYLFN ----------------------3333---------------------------------- DSNADDICRFKLTWPLQKSFEIADEEMKLKISENYGKNSSLVKDCYNLLKINEHYLEYQR ------------3333-------3333------2222------------11111111--- NALDYLIKLVKDITDDSLQKVFIHLIHQISELITN --------------3333----------------- >Tumor necrosis factor lig; SWP:P23510; PDB:2HEVF; RIQSIKVQFTEYKKEKGFILTSQKEDEIMKVQDNSVIINCDGFYLISLKGYFSQEVDISL -----------------------2222----%%%%------------------------- HYQKDEEPLFQLKKVRSVNSLMVASLTYKDKVYLNVTTDNTSLDDFHVNGGELILIHQNP --1111--------------------2222--------3333----------------22 GEFCVL 22---- >Tumor necrosis factor lig; SWP:P43488; PDB:2HEWF; PPIQRLRGAVTRCEDGQLFISSYKNEYQTMEVQNNSVVIKCDGLYIIYLKGSFFQEVKID -------------iiii---------------%%%%------------------------ LHFREDHNPISIPMLNDGRRIVFTVVASLAFKDKVYLTVNAPDTLCEHLQINDGELIVVQ ---1111-------3333-----------2222--------------------------- LTPGYCAP -------- >Tumor necrosis factor rec; SWP:P43489; PDB:2HEYR; LHCVGDTYPSNDRCCHECRPGNGMVSRCSRSQNTVCRPCGPGFYNDVVSSKPCKPCTWCN ---------%%%%-----2222------1111-------2222----------------3 LRSGSERKQLCTATQDTVCRCRAGTQPLDSYKPGVDCAPCPPGHFSPGDNQACKPWTNCT 333--------1111------2222---------------2222---%%%%--------- LAGKHTLQPASNSSDAIC ----------1111---- >BILE SALT HYDROLASE; SWP:Q9KK62; PDB:2HF0A; CTGVRFSDDEGNTYFGRNLDWSFSYGETILVTPRGYHYDTVFGAGGKAKPNAVIGVGVVM --------------------------------3333-----------------------% ADRPMYFDCANEHGLAIAGLNFPGYASFVHEPVEGTENVATFEFPLWVARNFDSVDEVEE %%%-------1111------------------2222---3333----------------3 TLRNVTLVSQIVPGQQESLLHWFIGDGKRSIVVEQMADGMHVHHDDVDVLTNQPTFDFHM 333--------1111--------------------1111-----1111------3333-- ENLRNYMCVSNEMAEPTSWGKASLTAWGAGVGMHGIPGDVSSPSRFVRVAYTNAHYPQQN ---1111-----------!!!!-------3333--------------------------- DEAANVSRLFHTLGSVQMVDGMAKMGDGQFERTLFTSGYSSKTNTYYMNTYDDPAIRSYA ------------------2222--1111------------1111-----1111------- MADYDMDSSELISVAR ----1111-------- >TETRAACYLDISACCHARIDE-1-P; SWP:Q7NSS5; PDB:2HF1A; DAKFLEILVCPLCKGPLVFDKSKDELICKGDRLAFPIKDGIPLESEARELAPEEEVKLE --------------------1111----1111-----iiii-3333----33333333- >SUGAR PHOSPHATASE SUPH; SWP:P75792; PDB:2HF2A; LSVKVIVTDDGTFLNDAKTYNQPRFAQYQELKKRGIKFVVASNNQYYQLISFFPELKDEI --------------1111--3333-------1111---------333333331111---- SFVAENGALVYEHGKQLFHGELTRHESRIVIGELLKNFVACGLQSAYVSENAPEAFVALA ----iiii---iiii--------------------------1111---1111-------- KHYHRLKPVKDYQEIDDVLFKFSLNLPDEQIPLVIDKLHVALDGIKPVTSGFGFIDLIIP ----------1111------------3333-----------2222-----2222----11 GLHKANGISRLLKRWDLSPQNVVAIGDSGNDAELKARYSFAGNAAENIKQIARYATDDNN 11---------------3333------1111--------------------------111 HEGALNVIQAVLDNTSPFN 1------------------ >Probable hydrogenase nick; SWP:Q57884; PDB:2HF9A; KDILKANKRLADKNRKLLNKHGVVAFDFMGAIGSGKTLLIEKLIDNLKDKYKIACIAGDV ------------------1111--------2222------------3333---------- IAKFDAERMEKHGAKVVPLNTGKECHLDAHLVGHALEDLNLDEIDLLFIENVGNLICPAD ---------1111-------!!!!----------3333-1111------------3333- FDLGTHKRIVVISTTEGDDTIEKHPGIMKTADLIVINKIDLADAVGADIKKMENDAKRIN ------------1111--3333--3333---------3333-1111-------------1 PDAEVVLLSLKTMEGFDKVLEFIEKSVKEVK 111---------2222----------1111- >HYDROGENASE-1 OPERON PROT; SWP:P19931; PDB:2HFDA; MSNDTPFDALWQRMLARGWTPVSESRLDDWLTQAPDGVVLLSSDPKRTPEVSDNPVMIGE --------------1111----3333----------------------------3333-- LLREFPDYTWQVAIADLEQSEAIGDRFGVFRFPATLVFTGGNYRGVLNGIHPWAELINLM ----3333------------------------------%%%%------------------ RGLVEPQQERAS ------------ >CB2 FAB, LIGHT CHAIN; SWP:Q6GMX8; PDB:2HFFA; IQMTQSPSSLSASVGDRVTITCRASQDVSTAVAWYQQKPGKAPKLLIYSASFLYSGVPSR ------------2222---------------------2222------------2222333 FSGSGSGTDFTLTISSLQPEDFATYYCQQSRITPPTFGQGTKVEIKRTVAAPSVFIFPPS 3----------------1111--------------------------------------3 DEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLTL 3333333---------------------%%%%---------------------------- SKADYEKHKVYACEVTHQGLSSPVTKSFNRGEC 3333------------1111------------- >Immunglobulin heavy chain; SWP:Q0ZCH0; PDB:2HFFB; EVQLVESGGGLVQPGGSLRLSCAASGFTISSNSIHWVRQAPGKGLEWVAWITPSDGNTDY ------------2222-----------3333--------2222----------------- ADSVKGRFTISADTSKNTAYLQMNSLRAEDTAVYYCARRVCYAGGMDYWGQGTLVTVSSA 1111--------3333----------3333------------------------------ STKGPSVFPLAPSGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYSLSS ---------------------------------%%%%--2222-------1111------ VVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPK ----3333------------1111---------- >GENESIS; SWP:Q63245; PDB:2HFH; MVKPPYSYIALITMAILQSPQKKLTLSGICEFISNRFPYYREKFPAWQNSIRHNLSLNDC -------3333----3333-----3333----3333---33331111------------- FVKIPREPGNPGKGNYWTLDPQSEDMFDNGSFL --------------------------------- >TYPE I POLYKETIDE SYNTHAS; SWP:Q9ZGI2; PDB:2HFKA; AGMFRALFRQAVEDDRYGEFLDVLAEASAFRPQFASPEACSERLDPVLLAGGPTDAEGRA ---------------------------1111----3333--------------------- VLVGCTGTAANGGPHEFLRLSTSFQEERDFLAVPLPGYGTGGTALLPADLDTALDAQARA --------1111111133333333------------------------------------ ILRAAGDAPVVLLGHAGGALLAHELAFRLERAHGAPPAGIVLVDPYPPGHQEPIEVWSRQ ----!!!!--------------------------------------1111---------- LGEGLFAGELEPMSDARLLAMGRYARFLAGPRPGRSSAPVLLVRASEPLGDWQEERGDWR -----1111-----------------1111----------------------1111---- AHWDLPHTVADVPGDHFTMMRDHAPAVAEAVLSWLDAIEG ----------------3333-------------------- >SYNECHOCYSTIS PHOTORECEPT; SWP:P74295; PDB:2HFNA; SLYRLIYSSQGIPNLQPQDLKDILESSQRNNPANGITGLLCYSKPAFLQVLEGECEQVNE -----------11113333------------1111------------------------- TYHRIVQDERHHSPQIIECMPIRRRNFEVWSMQAITVNDLSTEQVKTLVLKYSGFTTLRP -------1111---------------3333------------------3333------33 SAMDPEQCLNFLLDIAKIYELSDNFFLDL 33-------------1111---------- >HUMAN TISSUE FACTOR; SWP:P13726; PDB:2HFT; SGTTNTVAAYNLTWKSTNFKTILEWEPKPVNQVYTVQISTKSGDWKSKCFYTTDTECDLT ----------------iiii---------------------------------------- DEIVKDVKQTYLARVFSYPAGNVESTEPLYENSPEFTPYLETNLGQPTIQSFEQVGTKVN -----1111---------------------------3333--------------!!!!-- VTVEDERTLVRRNNTFLSLRDVFGKDLIYTLYYWKSSGKKTAKTNTNEFLIDVDKGENYC -----------------3333-!!!!---------------------------------- FSVQAVIPSRTVNRKSTDSPVECMG ------3333--------------- >HYPOTHETICAL PROTEIN RPA1; SWP:NA; PDB:2HFVA; MGSSHHHHHHSSGRENLYFQGHLRELLRTNDAVLLSAVGALLDGADIGHLVLDQNMSILE ------------------------------------------1111-------------- GSLGVIPRRVLVHEDDLAGARRLLTDAGLAHELRSDD ------------3333--------------------- >6-DEOXYERYTHRONOLIDE B SY; SWP:Q5UNP4; PDB:2HG4A; EEKLRRYLKRTVTELDSVTARLREVEHRAGEPIAIVGACRFPGDVDSPESFWEFVSGGGD ------------------------------------------------------1111-- AIAEAPADRGWEPDPDARLGGLAAAGDFDAGFFGISPREALADPQQRILEISWEALERAG -----------------------1111-3333----------3333----------1111 HDPVSLRGSATGVFTGVGTVDYGPRPDEAPDEVLGYVGTGTASSVASGRVAYCLGLEGPA -33332222---------------3333-3333----33333333--------------- TVDTACSSGLTALHLAESLRRDECGLALAGGVTVSSPGAFTEFRSQGGLAADGRCKPFSK ---!!!!-----------1111-------------------1111----3333--2222- AADGFGLAEGAGVLVLQRLSAARREGRPVLAVLRGSAVNQDGASNGLTAPSGPAQQRVIR ---------------------------------------------3333----------- RALENAGVRAGDVDYVEAHGTGTRLGDPIEVHALLSTYGAERDPDDPLWIGSVKSNIGHT --------3333----------3333-----------3333------------------! QAAAGVAGVKAVLALRHGEPRTLHFDEPSPQIEWAVSVVSQARSWPAGERPRRAGVSSFG !!!-3333---------------------------------------------------1 ISGTNAHVIVEEAPEADGPVPLVLSGRDEQARAQAGRLADHLAREPRNSLRDTGFTLATR 111------------------------3333-------------3333------------ RSAWEHRAVVVGDRDEALAGLRAVADGRIADRTATGQARTRRGVAVFPGQGAQWQGARDL ------------3333-------1111--------------------------------- LRESQVFADSIRDCERALAPHVDWSLTDLLSGARPLDRVDVVQPALFAVVSLAALWRSHG -----------------3333--------1111----1111------------------- VEPAAVVGHSQGEIAAAHVAGALTLEDAAKLVAVRSRVLRRLGGQGGASFGLGTEQAAER --------!!!!-----------3333----------33332222--------------- IGRFAGALSIASVNGPRSVVVAGESGPLDELIAECEAEAHKARRIPVDYASHSPQVESLR 3333--------------------------------------------------3333-- EELLTELAGISPVSADVALYSTTTGQPIDTATDTAYWYANLREQVRFQDATRQLAEAGFD ------2222------------------3333----------------------1111-- AFVEVSPHPVLTVGIEATLDSALPADAGACVVGTLRRDRGGLADFHTALGEAYAQGVEVD -----------------------------------2222-3333-------3333----- WSPAFADARPVELPVYPFQRQRYWLPI 3333----------------------- >HYPOTHETICAL PROTEIN; SWP:Q9I4L2; PDB:2HG6A; MSITSTDICQAADALKGFVGFNRKTGRYIVRFSEDSFGMDVADDSITPTSEFVWSSVRDD ----------1111------------------1111-----3333--3333--------- VMRLGREQLQILLEQNINERLNIGEPLLVYLRRQDLPEITAQRQLR ------------------3333------------------------ >PHAGE-LIKE ELEMENT PBSX P; SWP:P54342; PDB:2HG7A; MILYDAIMYKYPNAVSRKDFELRNDGNGSYIEKWNLRAPLPTQAELETWWEELQKNPPYE ----------1111---------------------------------------------- >CONSERVED PROTEIN MTH1368; SWP:O27421; PDB:2HGAA; QPDGVQIDSVVPGSPASKVLTPGLVIESINGMPTSNLTTYSAALKTISVGEVINITTDQG -------------1111---2222----2222----------3333-2222-----1111 TFHLKTGRNPNNSSRAYMGIRTSNHLRVRDSVASVLGDTLPFA ------------------------------------------- >YJCQ PROTEIN; SWP:O31639; PDB:2HGCA; KLRYAILKEIFEGNTPLSENDIGVTEDQFDDAVNFLKREGYIIGVHYSDDRPHLYKLGPE -------------------3333------------------------------------- LTEKGENYLKENGTWSKA ------3333-------- >TRANSCRIPTION FACTOR IIIA; SWP:NA; PDB:2HGHA; MYVCHFENCGKAFKKHNQLKVHQFSHTQQLPYECPHEGCDKRFSLPSRLKRHEKVHAGYP -----2222---------------1111-------2222---------------3333-- CKKDDSCSFVGKTWTLYLKHVAECHQD ---1111-------------------- >HETEROGENEOUS NUCLEAR RIB; SWP:P52597; PDB:2HGMA; NSADSANDGFVRLRGLPFGCTKEEIVQFFSGLEIVPNGITLPVDPEGKITGEAFVQFASQ ----------------22223333----3333---------------------------- ELAEKALGKHKERIGHRYIEVFKSSQEEVRSY ------3333---------------------- >HETEROGENEOUS NUCLEAR RIB; SWP:P52597; PDB:2HGNA; TGHCVHMRGLPYKATENDIYNFFSPLNPVRVHIEIGPDGRVTGEADVEFATHEEAVAAMS ----------1111------------------------------------3333------ KDRANMQHRYIELFLNSTTGA --------------------- >GLUTATHIONE SYNTHETASE; SWP:P48637; PDB:2HGSA; TNWGSLLQDKQQLEELARQAVDRALAEGVLLRTSQEPTSSEVVSYAPFTLFPSLVPSALL --3333------------------1111----3333------------------------ EQAYAVQMDFNLLVDAVSQNAAFLEQTLSSTIKQDDFTARLFDIHKQVLKEGIAQTVFLG -----------------------------3333--------------------------- LNRSDYMFQRSADGSPALKQIEINTISASFGGLASRTPAVHRHVLSVLSKTKEAGKILSN -----------------------------3333------------1111----1111--- NPSKGLALGIAKAWELYGSPNALVLLIAQEKERNIFDQRAIENELLARNIHVIRRTFEDI 3333----------33331111-----------------------1111------3333- SEKGSLDQDRRLFVDGQEIAVVYFRDGYMPRQYSLQNWEARLLLERSHAAKCPDIATQLA -------------iiii-----------3333---------------------------- GTKKVQQELSRPGMLEMLLPGQPEAVARLRATFAGLYSLDVGEEGDQAIAEALAAPSRFV ----------2222----2222-------1111---------------------1111-- LKPQREGGGNNLYGEEMVQALKQLKDSEERASYILMEKIEPEPFENCLLRPGSPARVVQC -----------------------11113333------------------2222------- ISELGIFGVYVRQEKTLVMNKHVGHLLRTKAIEHADGGVAAGVAVLDNPYPV ------------------------------1111----1111---------- >Major prion protein [Prec; SWP:P10279; PDB:2HH0H; VQLLEQSGAELVKPGASVKLSCTASGFNIEDSYIHWVKQRPEQGLEWIGRIDPEDGETKY ------------------------------------------------------------ APKFQGKATITADTSSNTAYLHLRRLTSEDTAIYYCGRGAYYIKEDFWGQGTTLTVSSAS -1111--------1111---------1111------------1111-------------- TKGPSVFPLAPSSKAGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLYS ------------------------------------%%%%--2222-------------- LSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPA -------3333-------------------------- >Major prion protein [Prec; SWP:P10279; PDB:2HH0L; LVMTQTPSSLSASLGERVSLTCRASQDIGNNLNWIQQKPDGTIKRLIYATSSLDSGVPKR -------------------------------------1111------------2222111 FSGSRSGSDYSLTISSLESEDFADYYCLQHDTFPLTFGGGTKLEIKRTVAAPSVFIFPPS 1----------------1111--------------------------------------3 DEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSSTLTL 333-------------------------%%%%---------------------------- SKADYEKHKVYACEVTHQGLSSPVTKSFNR ------------------------------ ----------------------------- >BH3980 PROTEIN; SWP:Q9K5V7; PDB:2HH6A; SFIEKIGSLNDKREWKAEARAKALPKEYHHAYKAIQKYWTSGGPTDWQDTKRIFGGILDL 3333----------------3333---------------1111----------------- FEEGAAEGKKVTDLTGEDVAAFCDELKDTKTWDKYRTKLNDSIGRD ----1111-1111-----------------------------!!!! >HYPOTHETICAL PROTEIN CSOR; SWP:P71543; PDB:2HH7A; ELTAKKRAALNRLKTVRGHLDGIVRMLESDAYCVDVMKQISAVQSSLERANRVMLHNHLE --------------------------1111------------------------------ TCFSTAVLDGHGQAAIEELIDAVKF -------------------1111-- >HYPOTHETICAL PROTEIN YDFO; SWP:P76156; PDB:2HH8A; DQVVIFKQIFDKVRNDLNYQWFYSELKRHNVSHYIYYLATENVHIVLKNDNTVLLKGLKN ------------------------------------------------------------ IVSVKFSKDRHLIETTSNKLKSREITFQEYRRNLAKAGVFRWVTNIHEQKRYYYTFDNSL --------------------------------------------3333------1111-- LFTESIQ ------- >HYPOTHETICAL PROTEIN RPA3; SWP:Q6N3S9; PDB:2HHGA; GPQTITRGIKALDEANSSIETLTTADAIALHKSGASDVVIVDIRDPREIERDGKIPGSFS -------3333---1111------------------------------------2222-- CTRGLEFWIDPQSPYAKPIFQEDKKFVFYCAGGLRSALAAKTAQDGLKPVAHIEGGFGAW ----3333-1111---3333-------------3333---------------2222---- RDAGGPIE 1111---- >IMMUNOGENIC PROTEIN MPT64; SWP:P0A5Q4; PDB:2HHIA; PKTYCEELKGTDTGQACQIQMSDPAYNINISLPSYYPDQKSLENYIAQTRDKFLSAATSS ----1111-----------------------------3333------------------- TPREAPYELNITSATYQSAIPPRGTQAVVLKVYQNAGGTHPTTTYKAFDWDQAYRKPITY ----------------------------------------------------------33 DTLWQADTDPLPVVFPIVQGELSKQTGQQVSIAPNAGLDPVNYQNFAVTNDGVIFFFNPG 33---------------------1111-----1111---3333-----1111-------- ELLPEAAGPTQVLVPRSAIDSMLA --------------33333333-- >BISPHOSPHOGLYCERATE MUTAS; SWP:P07738; PDB:2HHJA; SKYKLIMLRGEGAWNKENRFCSWVDQKLNSEGMEEARNCGKQLKALNFEFDLVFTSVLNR --------------------!!!!-------------------1111------------- SIHTAWLILEELGQEWVPVESSWRLNERHYGALIGLNREQMALNHGEEQVRLWRRSYNVT -------------1111----3333----!!!!--------------------------- PPPIEESHPYYQEIYNDRRYKVCDVPLDQLPRSESLKDVLERLLPYWNERIAPEVLRGKT ----1111-----11111111----3333-------------------------1111-- ILISAHGNSSRALLKHLEGISDEDIINITLPTGVPILLELDENLRAVGPHQFLGDQEAIQ ---------------1111-33331111------------1111---------------- AAIKKVEDQGKVKQ -------1111--- >CTD SMALL PHOSPHATASE-LIK; SWP:O15194; PDB:2HHLA; QVIPIPSPPAKYLLPEVTVLDYGKKCVVIDLDETLVHSSFKPISNADFIVPVEIDGTIHQ -----------------3333---------2222-------------------iiii--- VYVLKRPHVDEFLQRMGQLFECVLFTASLAKYADPVADLLDRWGVFRARLFRESCVFHRG -----2222-------------------3333------------------3333---iii NYVKDLSRLGRELSKVIIVDNSPASYIFHPENAVPVQSWFDDMTDTELLDLIPFFEGLSR i---3333---3333------333311111111--------1111--------------- >INOSITOL MONOPHOSPHATASE; SWP:P29218; PDB:2HHMA; WQECMDYAVTLARQAGVVEAIKNEMNVMLKSSPVDLVTATDQKVEKLISSIKEKYPSHSF ------------------3333---------1111----------------1111----- IGEESVAAGEKSILTDNPTWIIDPIDGTTNFVHRFPFVAVSIGFAVNKKIEFGVVYSVEG -----1111------------------------------------%%%%----------- KMYTARKGKGAFCNGQKLQVSQQEDITKSLLTELGSSRTPETVRVLSNMEKLFCIPVHGI -----2222---iiii--------1111----------3333---------1111----- RSVGTAAVNMCLVATGGADAYYEMGIHCWDVAGAGIIVTEAGGVLMDVTGGPFDLMSRRV --------------------------3333--------1111----1111---1111--- IAANNRILAERIAKEIQVIPLQRDDE -------------------------- >POLY(A) POLYMERASE; SWP:P29468; PDB:2HHPA; QKVFGITGPVSTVGATAAENKLNDSLIQELKKEGSFETEQETANRVQVLKILQELAQRFV 3333--------------------------1111-------------------------- YEVSKKKNMSDGMARDAGGKIFTYGSYRLGVHGPGSDIDTLVVVPKHVTREDFFTVFDSL ----1111----------------3333----1111--------1111------------ LRERKELDEIAPVPDAFVPIIKIKFSGISIDLICARLDQPQVPLSLTLSDKNLLRNLDEK ---1111-----------------iiii--------------1111-------2222--- DLRALNGTRVTDEILELVPKPNVFRIALRAIKLWAQRRAVYANIFGFPGGVAWAMLVARI -------------1111------------------------3333--------------- CQLYPNACSAVILNRFFIILSEWNWPQPVILKPIEDGPLQVRVWNPKIYAQDRSHRMPVI ---1111-------------------------------------3333-3333------- TPAYPSMCATHNITESTKKVILQEFVRGVQITNDIFSNKKSWANLFEKNDFFFRYKFYLE ---------1111---------------------1111--3333-----1111------- ITAYTRGSDEQHLKWSGLVESKVRLLVMKLEVLAGIKIAHPFTKPFESSYCCPTEDDYEM -----------------------------1111----------------------3333- IQDKYGSHKTETALNALKPKAYLSTMYIGLDFNKEKVDIHIPCTEFVNLCRSFNEDYGDH ------3333-------------------------------------------3333--- KVFNLALRFVKGYDLPDEVFDENEKRPS ----------3333-3333--------- >DNA POLYMERASE I; SWP:Q5KWC1; PDB:2HHVA; KKMAFTLADRVTEEMLADKAALVVEVVEENYHDAPIVGIAVVNEHGRFFLRPETALADPQ -----------3333---------------------------1111----33331111-- FVAWLGDETKKKSMFDSKRAAVALKWKGIELCGVSFDLLLAAYLLDPAQGVDDVAAAAKM ------1111--------------1111-----------------3333----------- KQYEAVRPDEAVYGKGAKRAVPDEPVLAEHLVRKAAAIWELERPFLDELRRNEQDRLLVE -------3333---!!!!----3333-----------------------1111------- LEQPLSSILAEMEFAGVKVDTKRLEQMGKELAEQLGTVEQRIYELAGQEFNINSPKQLGV --------------------------------------------------1111------ ILFEKLQLPVLKKTKTGYSTSADVLEKLAPYHEIVENILHYRQLGKLQSTYIEGLLKVVR ---1111-------------3333---3333-----------------------3333-- PDTKKVHTIFNQALTQTGRLSSTEPNLQNIPIRLEEGRKIRQAFVPSESDWLIFAADYSQ --------------1111---------------3333----------2222--------- IELRVLAHIAEDDNLMEAFRRDLDIHTKTAMDIFQVSEDEVTPNMRRQAKAVNFGIVYGI ------------------1111---------1111-3333-------------------- SDYGLAQNLNISRKEAAEFIERYFESFPGVKRYMENIVQEAKQKGYVTTLLHRRRYLPDI ------------------------------------------------1111------11 TSRNFNVRSFAERMAMNTPIQGSAADIIKKAMIDLNARLKEERLQAHLLLQVHDELILEA 11---------------------------------------------------------- PKEEMERLCRLVPEVMEQAVTLRVPLKVDYHYGSTWYDAK 1111-3333-------------------------3333-- >PYRIDOXAMINE 5'-PHOSPHATE; SWP:Q2ZZ07; PDB:2HHZA; ELKDIHILEDKVGVFATLDEYGNPHARHAHITAANEEGIFFTSPETHFYDQLGDQRVATA 3333--------------1111------------1111---------------------- ISEEGYLIQVVRVEGTARPVENDYLKTVFADNPYYQHIYKTQVFQIYAGHGFYHSLTQGH --------------------33333333!!!!----------------------3333-- KYIFSIGEVRAL ------------ >PUTATIVE PHOSPHOGLYCOLATE; SWP:Q1GA24; PDB:2HI0A; GKYKAAIFDDGTILDTSADLTSALNYAFEQTGHRHDFTVEDIKNFFGSGVVVAVTRALAY ----------------------------1111-----3333------------------1 EAGSSRESLVAFGTKDEQIPEAVTQTEVNRVLEVFKPYYADHCQIKTGPFPGILDLKNLR 111-33331111-1111--3333--------------------------2222------1 QKGVKLAVVSNKPNEAVQVLVEELFPGSFDFALGEKSGIRRKPAPDTSECVKVLGVPRDK 111---------------------2222-------2222-----------------1111 CVYIGDSEIDIQTARNSEDEIAVNWGFRSVPFLQKHGATVIVDTAEKLEEAILGE --------------1111---------------1111------------------ >4-HYDROXYTHREONINE-4-PHOS; SWP:P58718; PDB:2HI1A; ETKTVAITGDPAGIGPEIIVKALSEDGLNGAPLVVIGCLATLKRLQAKGITPNVELRAIE ---------1111-----------2222-----------------1111----------- RVAEARFAPGIIHVIDEPLAQPEALEAGKVQAQAGDLAYRCVKRATELALRGDVQAIATA 3333---2222---------3333------------------------1111-------- PLNKEALHLAGHNYPGHTELLATLTHSRDYAVLYTDKLKVIHVSTHIALRKFLDTLSTAR ----------------------1111--------1111---------------------- VETVIGIADTFLKRVGYVKPRIAVAGVNPHAGENGLFGDEETRILTPAITDARAKGDVYG ------------1111-----------2222-%%%%----------------1111---- PCPPDTVFLQAYEGQYDVVAYHDQGHIPLKLLGYDGVNITAGLPFIRTSADHGTAFDIAW --1111------------------------------------------------3333-- TGKAKSESAVSIKLAQLA ------------------ >FIMBRIAL PROTEIN; SWP:P02974; PDB:2HI2A; FTLIELMIVIAIVGILAAVALPAYQDYTARAQVSEAILLAEGQKSAVTEYYLNHGKWPEN ----------------3333---------------------------------------3 NTSAGVASSPTDIKGKYVKEVEVKNGVVTATMLSSGVNNEIKGKKLSLWARRENGSVKWF 333-----1111-----------iiii-------------2222---------------- CGQPVTRTDDDTVADAKDGKEIDTKHLPSTCRDNFDAK ----------------------3333-1111--1111- >HOMEODOMAIN-ONLY PROTEIN; SWP:Q8R1H0; PDB:2HI3A; MSAQTVSGPTEDQVEILEYNFNKVNKHPDPTTLCLIAAEAGLTEEQTQKWFKQRLAEWRR ---------3333----------------------------------------------- SEGLPSECRSVTD ------------- >CYTOCHROME P450 1A2; SWP:P05177; PDB:2HI4A; RVPKGLKSPPEPWGWPLLGHVLTLGKNPHLALSRMSQRYGDVLQIRIGSTPVLVLSRLDT --2222-------------3333------------------------------------- IRQALVRQGDDFKGRPDLYTSTLITDGQSLTFSTDSGPVWAARRRLAQNALNTFSIASDP -----1111--------3333--!!!!--------------------------------- ASSSSCYLEEHVSKEAKALISRLQELMAGPGHFDPYNQVVVSVANVIGAMCFGQHFPESS ---------------------------------------------------!!!!-1111 DEMLSLVKNTHEFVETASSGNPLDFFPILRYLPNPALQRFKAFNQRFLWFLQKTVQEHYQ -----------------222211113333----------------------------333 DFDKNSVRDITGALFKHSKKGPRASGNLIPQEKIVNLVNDIFGAGFDTVTTAISWSLMYL 3-1111-----------1111--iiii--3333--------------------------- VTKPEIQRKIQKELDTVIGRERRPRLSDRPQLPYLEAFILETFRHSSFLPFTIPHSTTRD ------------------------33331111-----------3333------------- TTLNGFYIPKKCCVFVNQWQVNHDPELWEDPSEFRPERFLTADGTAINKPLSEKMMLFGM --iiii-----------------3333--3333-3333--1111----3333-------- GKRRCIGEVLAKWEIFLFLAILLQQLEFSVPPGVKVDLTPIYGLTMKHARCEHVQARRFS 1111-------------------------------------------------------- >UPF0107 PROTEIN AF0055; SWP:O30181; PDB:2HI6A; VKFACRAITRGRAEGEALVTKEYISFLGGIDKETGIVKEDCEIKGESVAGRILVFPGGKG -----------------------------------------------2222--------- STVGSYVLLNLRKNGVAPKAIINKKTETIIAVGAAMAEIPLVEVRDEKFFEAVKTGDRVV ----------------------------------------------3333---------- VNADEGYVELIE ------------ >6-PHOSPHO-1-FRUCTOKINASE; SWP:O15648; PDB:2HIGA; VTSKLVKAHRAMLNSVTQEDLKVDRLPGADYPNPSKKYSSRTEFRDKTDYIMYNPRPRDN ----------------3333------------111133331111---------------- PVSVSPLLCELAAARSRIHFNPTETTIGIVTCGGICPGLNDVIRSITLTGINVYNVKRVI ---------------------1111----------2222--------------------- GFRFGYWGLSKKGSQTAIELHRGRVTNIHHYGGTILGSSRGPQDPKEMVDTLERLGVNIL ----3333---3333-----3333--1111---1111------3333------------- FTVGGDGTQRGALVISQEAKRRGVDISVFGVPKTIDNDLSFSHRTFGFQTAVEKAVQAIR -------------------1111----------1111-------2222------------ AAYAEAVSANYGVGVVKLMGRDSGFIAAQAAVASAQANICLVPENPISEQEVMSLLERRF -----1111----------------------3333------3333-------------33 CHSRSCVIIVAEGFGQDWGRGGYDASGNKKLIDIGVILTEKVKAFLKANKSRYPDSTVKY 33--------11111111-----1111-----3333------------3333-------- IDPSYMIRACPPSANDALFCATLATLAVHEAMAGATGCIIAMRHNNYILVPIKVATSVRR --3333------------------------------------%%%%----3333------ VLDLRGQLWRQVREITVDLG --1111-------------- >HIGH POTENTIAL IRON SULFU; SWP:P04168; PDB:2HIPA; EPRAEDGHAHDYVNEAADASGHPRYQEGQLCENCAFWGEAVQDGWGRCTHPDFDEVLVKA ----2222%%%%---3333--1111222233331111----2222----1111-----11 EGWCSVYAPAS 11--------- >HYPOTHETICAL PROTEIN YDHR; SWP:P0ACX3; PDB:2HIQA; ATLLQLHFAFNGPFGDAMAEQLKPLAESINQEPGFLWKVWTESEKNHEAGGIYLFTDEKS --------------------------3333-2222--------1111------------- ALAYLEKHTARLKNLGVEEVVAKVFDVNEPLSQINQ ----------3333---------------------- >THERMOSTABLE DNA LIGASE; SWP:Q980T8; PDB:2HIVA; MEFKVIAEYFDKLEKISSRLQLTALLADLLSKSDKTIIDKVVYIIQGKLWPDFLGYPELG -----------3333--------------11113333-----3333----3333------ IGEKFLIKAISIATNTDENSVENLYKTIGDLGEVARRLKSKQKESLTVDEVYSTLSKVAL --------------------------------------1111------------------ TTGEGSRDLKIRLLAGLLKKADPLEAKFLVRFVEGRLRVGIGDATVLDAMAIAFGGGQSA --1111--------------------------------------------------3333 SEIIERAYNLRADLGNIAKIIVEKGIEALKTLKPQVGIPIRPMLAERLSNPEEILKKMGG --------------------------3333-------------------------1111- NAIVDYKYDGERAQIHKKEDKIFIFSRRLENITSQYPDVVDYVSKYIEGKEFIIEGEIVA -------------------------1111--3333------------------------- IDPESGEMRPFQELMHRKRKSDIYEAIKEYPVNVFLFDLMYYEDVDYTTKPLEARRKLLE ---------3333---1111--------------------------11113333-----1 SIVKPNDYVKIAHHIQANNVEDLKSFFYRAISEGGEGVMVKAIGKDAIYQAGARGWLWIK 111----------------------------------------1111--2222------- LKRDYQSEMADTVDLVVVGGFYGKGKRGGKISSLLMAAYNPKTDSFESVCKVASGFSDEQ -----------------------!!!!-------------1111---------------- LDELQKKLMEIKRDVKHPRVNSKMEPDIWVEPVYVAEIIGSEITISPLHTCCQDVVEKDA ----------------1111-------------------------------2222-2222 GLSIRFPRFIRWRDDKSPEDATTTDEILEMYNKQPKK ------------11111111----------3333--- >HYPOTHETICAL PROTEIN; SWP:Q97RI5; PDB:2HIYA; ATRYALLVRGINVGKNKVVAELRQELTNLGLEKVESYINSGNIFFTSIDSKAQLVEKLET ------------------------------------------------------------ FFAVHYPFIQSFSLLSLEDFEAELENLPAWWSRDLARKDFLFYTEGLDVDQVIATVESLE -----1111-------------1111-3333------------2222--------1111- LKDEVLYFGKLGIFWGKFSEESYSKTAYHKYLLKVPFYRHITIRNAKTFDKIGQLKK --------1111------33331111333333331111------------------- >PUTATIVE CITRATE LYASE, A; SWP:Q8DUC1; PDB:2HJ0A; ENKLGRDIPRKYANQYGVFEELAHIKSYKESSRQVKPVKPSDDKLLSSIHEAIEKTRLKD --------------------------------------1111-----------1111--- GTISFHHHFREGDYVNVLDEIAKGIKDISIAPSSIANVHEPLIDHIKNGVVTNITSSGLR ------1111-------------------------3333--------------------- DKVGAAISEGIENPVIIRSHGGRARAIATDDIHIDVAFLGAPSSDAYGNANGTRGKTTCG ------1111-------------------------------------------------- SLGYAIDAKYADQVVIVTDTLVPYPNTPISIPQTDVDYIVVVDAIGDPEGIAKGATRYTK -33333333----------------------1111------------1111--------- NPKELLIAEYAAKVITSSPYYKEGFSFQTGTGGASLAVTRFREQIKDDIKANFALGGITN --------------1111--------------3333------------------------ AVELLEEGLVDKILDVQDFDHPSAVSLDRNAEKHYEIDANYASPLSKGSVINQLDICVLS ------------------------------------------------3333-------- ALEVDTNFNVNVTGSDGVIRGASGGHCDTAFAAKSLVISPLVRGRIPTFVDKVNTVITPG -------------1111-----!!!!----------------!!!!------------33 TSVDVVVTEVGIAINPNRPDLIEYFKDLKVPQLTIEELKEKAYAIVGNPQPIQYGDKIVA 33-----1111---1111------1111-----3333----1111--------------- LIEYRDGSLIDVVRNVLE ---1111----------- >HYPOTHETICAL PROTEIN; SWP:Q4QNE7; PDB:2HJ1A; NQINIEIAYAFPERYYLKSFQVDEGITVQTAITQSGILSQFPEIDLSTNKIGIFSRPIKL ----------1111--------2222----------333333331111----------11 TDVLKEGDRIEIYRPLL 11--2222--------- >SULFHYDRYL OXIDASE ERV1P; SWP:Q8LC15; PDB:2HJ3A; PVTKEDLGRATWTFLHTLAAQYPEKPTRQQKKDVKELMTILSRMYPCRECADHFKEILRS -----------------3333--------------------------------------- NPAQAGSQEEFSQWLCHVHNTVNRSLGKLVFPCERVDARWG -----------------------1111-------3333--- >INTERFERON-INDUCED 17 KDA; SWP:P05161; PDB:2HJ8A; DEPLSILVRNNKGRSSTYEVRLTQTVAHLKQQVSGLEGVQDDLFWLTFEGKPLEDQLPLG --------------------3333-----------------------------3333333 EYGLKPLSTVFMNLR 3---1111------- >QUORUM-SENSING ANTIACTIVA; SWP:Q20HX4; PDB:2HJDA; FELRPVIGLTRGLSSADIETLTANAIRLHRQLLEKADQLFQVLPDDIKIGTAAGGEQHLE --3333---2222--------------------------1111---3333---------- YIEAMIEMHAQMSAVNTLVGLLGFIPKVS ----------------------------- >AUTOINDUCER 2 SENSOR KINA; SWP:P54302; PDB:2HJEA; SKQQTSALIHNIFDSHFAAIQIHHDSNSKSEVIRDFYTDRDTDVLNFFFLSIDQSDPSHT ---------------------------------3333------------------1111- PEFRFLTDHKGIIWDDGNAHFYGVNDLILDSLANRVSFSNNWYYINVMTSIGSRHMLVRR ----------------3333----------------------------1111-------- VPILDPSTGEVLGFSFNAVVLDNNFALMEKLKSESNVDNVVLVANSVPLANSLIGDEPYN --------------------2222--------1111-------!!!!------------3 VADVLQLLVIETPIVVNAVTTELCLLTVQD 333------------%%%%----------- >GTP-BINDING PROTEIN ENGA; SWP:P50743; PDB:2HJGA; KPVVAIVGRPNVGKSTIFNRIAGERISRIYSSAEWLNYDFNLIDPFLAQIRQQAEIADEA --------------------------------3333------------------------ DVIIFVNGREGVTAADEEVAKILYRTKKPVVLAVNKLDNNIYDFYSLGFGEPYPISGTHG ------3333-----------3333-----------------3333-------------- LGLGDLLDAVAEHFKNIPETKYNEEVIQFCLIGRPNVGKSSLVNALGEERVIVSVDTSFT ------------3333------3333-------2222----------3333--------- YNQQEFVIVDTAGRKKGKVYETTEKYSVLRALKAIDRSEVVAVVLDGEEGIIEQDKRIAG iiii-----3333--------------------------------3333--3333----- YAHEAGKAVVIVVNKWDAVDKDESTKEFEENIRDHFQFLDYAPILFSALTKKRIHTLPAI --1111--------3333---1111----------1111-----------2222------ IKASENHSLRVQTNVLNDVIDAVANPTPTHNGSRLKIYYATQVSVKPPSFVVFVNDPELH -----1111-----------3333-----iiii----------------------3333- FSYERFLENRIRDAFGFEGTPIKIFARARK ----------------2222---------- >HYPOTHETICAL PROTEIN YKFF; SWP:P75677; PDB:2HJJA; GPFTRRQAQAVTTTYSNITLEDDQGSHFRLVVRDTEGRMVWRAWNFEPDAGEGLNRYIRT --------------1111---------------1111-----------3333-----%%% SGIRTD %----- >MAINTENANCE OF PLOIDY PRO; SWP:P40484; PDB:2HJNA; PVLTTNVTDFNYTPSHQKPFLDIKQIVETLGSEGVAVKLPRGEDENEWLAVHCVDFYNQI -------1111------------------3333------2222----------------- NLYGSITEFCSPQTCPRIATNEYEYLWAFQKGQPPVSVSAPKYVECLRWCQDQFDDESLF -33333333-3333-----1111------2222----------------------1111- PSKVTGTFPEGFIQRVIQPILRRLFRVYAHIYCHHFNEILELNLQTVLNTSFRHFCLFAQ --1111--2222---------------------------1111----------------- EFELLRPADFGPLLELVELRD -----3333!!!!3333---- >PHOSPHONOPYRUVATE HYDROLA; SWP:Q84G06; PDB:2HJPA; MTKNQALRAALDSGRLFTAMAAHNPLVAKLAEQAGFGGIWGSGFELSASYAVPDANILSM ----------3333---------------------------------1111--------- STHLEMMRAIASTVSIPLIADIDTGFGNAVNVHYVVPQYEAAGASAIVMEDKTFPKDTQE ---------------------!!!!----------------------------------- LVRIEEFQGKIAAATAARADRDFVVIARVEALIAGLGQQEAVRRGQAYEEAGADAILIHS -------------------3333------3333--------------------------- RQKTPDEILAFVKSWPGKVPLVLVPTAYPQLTEADIAALSKVGIVIYGNHAIRAAVGAVR ------------------------1111----------1111------------------ EVFARIRRDGGIREVDAALPSVKEIIELQGDERMRAVEARYLK --------------3333------------------------- >MALATE DEHYDROGENASE; SWP:Q5CYZ3; PDB:2HJRA; MRKKISIIGAGQIGSTIALLLGQKDLGDVYMFDIIEGVPQGKALDLNHCMALIGSPAKIF ----------3333-------1111----------------------------------- GENNYEYLQNSDVVIITAGVPRKPNMTRSDLLTVNAKIVGSVAENVGKYCPNAFVICITN ---33332222-----------22223333-------------------1111------- PLDAMVYYFKEKSGIPANKVCGMSGVLDSARFRCNLSRALGVKPSDVSAIVVGGHGDEMI 3333-----------1111-----------------------3333---------1111- PLTSSVTIGGILLSDFVEQGKITHSQINEIIKKTAFGGGEIVELLKTGSAFYAPAASAVA -3333--iiii3333-1111---------------------------------------- MAQAYLKDSKSVLVCSTYLTGQYNVNNLFVGVPVVIGKNGIEDVVIVNLSDDEKSLFSKS ----1111-------------%%%%-----------1111-------------------- VESIQNLVQDLKS ------------- >USG-1 PROTEIN HOMOLOG; SWP:O87014; PDB:2HJSA; QPLNVAVVGATGSVGEALVGLLDERDFPLHRLHLLASAESAGQRGFAESSLRVGDVDSFD --------1111----------1111-------------2222--iiii-----3333-3 FSSVGLAFFAAAAEVSRAHAERARAAGCSVIDLSGALEPSVAPPVVSVNAERLASQAAPF 333--------3333--------1111-------1111----------33331111---- LLSSPCAVAAELCEVLAPLLATLDCRQLNLTACLSVSSLGREGVKELARQTAELLNARPL ----------------3333---------------------------------------- EPRLFDRQIAFNLLAQVGAVDAEGHSAIERRIFAEVQALLGERIGPLNVTCIQAPVFFGD --------------------1111---------------!!!!----------------- SLSVTLQCAEPVDLAAVTRVLDATKGIEWVGEGDYPTVVGDALGQDETYVGRVRAGQADP -----------------------2222---iiii--3333-2222-------------11 CQVNLWIVSDNVRKGAALNAVLLGELLIKHYL 11------------------------------ >ATP-DEPENDENT RNA HELICAS; SWP:P42305; PDB:2HJVA; TTRNIEHAVIQVREENKFSLLKDVLMTENPDSCIIFCRTKEHVNQLTDELDDLGYPCDKI ------------3333----------------------------------1111------ HGGMIQEDRFDVMNEFKRGEYRYLVATDVAARGIDIENISLVINYDLPLEKESYVHRTGR 1111----------------------33332222---------------3333--1111- TGRAGNKGKAISFVTAFEKRFLADIEEYIGFEIQKIEA --iiii--------1111-------------------- >D-PSICOSE 3-EPIMERASE; SWP:Q8U6Q7; PDB:2HK0A; KHGIYYSYWEHEWSAKFGPYIEKVAKLGFDIIEVAAHHINEYSDAELATIRKSAKDNGII -----3333--3333-3333--------------33331111------------1111-- LTAGIGPSKTKNLSSEDAAVRAAGKAFFERTLSNVAKLDIHTIGGALHSYWPIDYSQPVD -------1111------------------------1111------1111----------- KAGDYARGVEGINGIADFANDLGINLCIEVLNRFENHVLNTAAEGVAFVKDVGKNNVKVL ------------------3333---------3333------------------1111--- DTFHNIEEDSFGDAIRTAGPLLGHFHTGESNRRVPGKGRPWHEIGLALRDINYTGAVIEP 3333-------------!!!!-------1111-2222-----------1111-------- FVKTGGTIGSDIKVWRDLSGGADIAKDEDARNALAFSRFVLGG ---------1111-----%%%%3333----------------- >TYROSINE-PROTEIN KINASE H; SWP:P08631; PDB:2HK5A; AQKPWEKDAWEIPRESLKLEKKLGAGQFGEVWMATYNKHTKVAVKTMKPGSMSVEAFLAE --1111-1111-1111---------1111------------------2222-3333---- ANVMKTLQHDKLVKLHAVVTKEPIYIITEFMAKGSLLDFLKSDEGSKQPLPKLIDFSAQI --------1111------------------1111--------3333--3333-------- AEGMAFIEQRNYIHRDLRAANILVSASLVCKIADFGLARVIEDNEYTAREGAKFPIKWTA -----------------3333---1111------1111---%%%%---1111--3333-3 PEAINFGSFTIKSDVWSFGILLMEIVTYGRIPYPGMSNPEVIRALERGYRMPRPENCPEE 333------3333-----------1111----2222--------1111-----1111333 LYNIMMRCWKNRPEERPTFEYIQSVLDDF 3----3333--3333----------1111 >OUTER SURFACE PROTEIN A; SWP:P14013; PDB:2HKDA; NSVSVDLPGSMKVLVSKSSNADGKYDLIATVDALELSGTSDKNNGSGVLEGVKADASKVK ------------------------------%%%%------------------1111---- LTISDDLGQTTLEVFKSDGSTLVSKKVTSKDKSSTEEKFNEKGELSEKKITRADKSSTEE ---1111--------1111---------1111-------1111--------1111----- KFNEKGELSEKKITRADKSSTEEKFNEKGELSEKKITRADKSSTEEKFNEKGEVSEKIIT --1111--------1111-------1111--------1111-------1111-------- RADGTRLEYTGIKSDGSGKAKEVLKGYVLEGTLTAEKTTLVVKEGTVTLSKNISKSGEVS 1111--------1111-------2222------3333------!!!!------1111--- VELNDTDSSAATKKTAAWNSGTSTLTITVNSKKTKDLVFTSSNTITVQQYDSNGTSLEGS --------3333----------------%%%%-------1111-------1111------ AVEITKLDEIKNALK -----3333--1111 >TYPE II DNA TOPOISOMERASE; SWP:O05207; PDB:2HKJA; LSPAEFFKRNPELAGFPNPARALYQTVRELIENSLDATDVHGILPNIKITIDLIDDARQI ---------3333------------------------3333--------------1111- YKVNVVDNGIGIPPQEVPNAFGRVLYSSKYVNRQTRGMYGLGVKAAVLYSQMHQDKPIEI ------------3333-3333-----------------3333------------------ ETSPVNSKRIYTFKLKIDINKNEPIIVERGSVENTRGFHGTSVAISIPGDWPKAKSRIYE ---2222---------------------------2222-----------3333------- YIKRTYIITPYAEFIFKDPEGNVTYYPRLTNKIPKPPQEVKPHPYGVDREEIKILINNLK --------1111-----1111---------------------3333---------1111- RDYTIKEFLVNEFQSIGDTTADKILELAGLKPNKKVKNLTEEEITRLVETFKKYEDFRSP ------------------------------11113333---------------------- SADSLSVIGEDLIELGLKKIFNPDFAASITRKPKAYQGHPFIVEAGVAFGGSIPVGEEPI -1111------------------------------iiii----------!!!!------- VLRYANKIPLIYDEKSDVIWKVVEELDWKRYGIESDQYQMVVMVHLCSTKIPYKSAGKES ----iiii----3333----------3333------------------------2222-- IAEVEDIEKEIKNALMEVARKLKQYLSEKRKEQEAKKKLLA ---3333---------------------------------- >PUTATIVE TRANSCRIPTIONAL ; SWP:Q83Q96; PDB:2HKTA; ATLTEDDVLEQLDAQDNLFSFKTAHSILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSG ---1111---------3333--------------3333-------------3333-2222 PLDDIDVALRLIYALGKDKWLYADITHFSQYWHYLNEQDETPGFADDITWDFISNVNSIT ---3333-----------------------------------1111-----33333333- RNATLYDALKAKFVWSEARFSGVKTALTLAVTTTLKELT --3333-----------------------------1111 >A PUTATIVE TRANSCRIPTIONA; SWP:Q0SB15; PDB:2HKUA; QTRDALFTAATELFLEHGEGVPITQICAAAGAHPNQVTYYYGSKERLFVEVACAAVLRAG -----------------33333333-------3333------------------------ KRAEDDAATAETVGDYTEKLVGSLLGPGAPSVELFTSALTGRRSELRDLITDTLRTLHSS -----3333----------------1111-------------3333-------------- GEVALIRTLRTGWQLRAGIDVESKAFWSAIFGLVIQKTATGESFGYSLEEAVAVIFANLQ ------------------------------------------iiii-------------- IPETVRNT -------- >HYPOTHETICAL PROTEIN; SWP:Q41IB9; PDB:2HKVA; GTDWQQALDRHVGVGVRTTRDLIRLIQPEDWDKRPISGKRSVYEVAVHLAVLLEADLRIA ----------------------33333333-----2222-------------------11 TGATADEAQFYAVPVLPEQLVDRLDQSWQYYQDRLADFSTETTYWGVTDSTTGWLLEAAV 11-3333--1111--1111-----------------------1111-------------- HLYHHRSQLLDYLNLLGYDIKLDLF -------------1111-------- >RIBONUCLEASE 7; SWP:Q9H1E1; PDB:2HKYA; MKPKGMTSSQWFKIQHMQPSPQACNSAMKNINKHTKRCKDLNTFLHEPFSSVAATCQTPK -----------------------------------------------3333--------- IACKNGDKNCHQSHGPVSLTMCKLTSGKYPNCRYKEKRQNKSYVVACKPPQKKDSQQFHL ------------------------------------------------------------ VPVHLDRVL --------- >THREONYL-TRNA SYNTHETASE; SWP:Q9UZ14; PDB:2HL0A; MRVLLIHSDYIEYEVKDKALKNPEPISEDMKRGRMEEVLVAFISVEKVDEKNPEEVSLKA --------------------------3333---------------3333----------- IEEISKVAEQVKAENVFVYPFAHLSSELAKPSVAMDILNRVYQGLKERGFNVGKAPFGYY --------------------1111---------------------1111----------- KAFKISCKGHPLAELSRTIVPEE ---------1111---------- >ATP SYNTHASE ALPHA CHAIN,; SWP:P07251; PDB:2HLDA; NLNETGRVLAVGDGIARVFGLNNIQAEELVEFSSGVKGMALNLEPGQVGIVLFGSDRLVK -----------iiii-----11112222---1111--------1111-------3333-2 EGELVKRTGNIVDVPVGPGLLGRVVDALGNPIDGKGPIDAAGRSRAQVKAPGILPRRSVH 222-------------3333-----1111------------------------------- EPVQTGLKAVDALVPIGRGQRELIIGDRQTGKTAVALDTILNQKRWNNGSDESKKLYCVY ------3333------2222---------------------33331111-1111------ VAVGQKRSTVAQLVQTLEQHDAMKYSIIVAATASEAAPLQYLAPFTAASIGEWFRDNGKH -----3333--------11113333------11113333--------------------- ALIVYDDLSKQAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKLSEKEGSGSLTA -------------------1111----%%%%----------1111-----1111------ LPVIETQGGDVSAYIPTNVISITDGQIFLEAELFYKGIRPAINVGLSVSRVGSAAQVKAL ------iiii--------1111--------------------3333--33331111---- KQVAGSLKLFLAQYREVAAFAQSDLDASTKQTLVRGERLTQLLKQNQYSPLATEEQVPLI ----------------3333--------------------3333-------3333----- YAGVNGHLDGIELSRIGEFESSFLSYLKSNHNELLTEIREKGELSKELLASLKSATESFV -----1111--3333--------------------------------------------- AT -- >ATP synthase subunit beta; SWP:P00830; PDB:2HLDD; STPITGKVTAVIGAIVDVHFEQSELPAILNALEIKTPQGKLVLEVAQHLGENTVRTIAMD -----------!!!!-----------2222-------------------%%%%------- GTEGLVRGEKVLDTGGPISVPVGRETLGRIINVIGEPIDERGPIKSKLRKPIHADPPSFA -22222222-------------1111-----1111----------------------333 EQSTSAEILETGIKVVDLLAPYARGGKIGLFGGAGVGKTVFIQELINNIAKAHGGFSVFT 3-----------3333------2222------2222-------------3333------- GVGERTREGNDLYREMKETGVINLEGESKVALVFGQMNEPPGARARVALTGLTIAEYFRD ----3333---------------------------33333333----------------- EEGQDVLLFIDNIFRFTQAGSEVSALLGRIPSAVGYQPTLATDMGLLQERITTTKKGSVT ----------------------3333------%%%%1111------3333---3333--- SVQAVYVPADDLTDPAPATTFAHLDATTVLSRGISELGIYPAVDPLDSKSRLLDAAVVGQ ------22221111-----1111-----------1111-----------11113333--- EHYDVASKVQETLQTYKSLQDIIAILGMDELSEQDKLTVERARKIQRFLSQPFAVAEVFT --------------------------3333----------------3333--11113333 GIPGKLVRLKDTVASFKAVLEGKYDNIPEHAFYMVGGIEDVVAKAEKLAA -----------------------11113333-----3333---------- >ATP synthase gamma chain,; SWP:P38077; PDB:2HLDG; ATLKEVEMRLKSIKNIEKITKTMKIVASTRLSKAEKAKISAKKMDEAEQLFYKNAETKNK ------------------------------------------------------------ ELIVAITSDKGLCGSIHSQLAKAVRRHLNDQPNADIVTIGDKIKMQLLRTHPNNIKLSIN -------------------------1111-1111----------------1111------ GIGKDAPTFQESALIADKLLSVMKAGTYPKISIFYNDPVSSLSFEPSEKPIFNAKTIEQS -----------------------3333------------1111----------------- PSFGKFEIDTDANVPRDLFEYTLANQMLTAMAQGYAAEISARRNAMDNASKNAGDMINRY -1111--------3333------------------------------------------- SILYNRTRQAVITNELVDIITGASS --------------------1111- >ATP synthase delta chain,; SWP:Q12165; PDB:2HLDH; KLQFALPHETLYSEVTQVNLPAKSGRIGVLANHVPTVEQLLPGVVEVMEGSNSKKFFISG -----3333---------------------------------------!!!!-------- GFATVQPDSQLCVTAIEAPLESFSENIKNLLAEAKKNVSSSREAAEAAIQVEVLENLQSV -----1111---------------3333------3333------------------1111 >HYPOTHETICAL PROTEIN; SWP:Q88R33; PDB:2HLJA; GPALITYRTTVQEDWVDYNGHLRDAFYLLIFSYATDALDRIGLDADSRGQSGNSLFTLEA -----------1111-1111------------------1111-1111--iiii------- HINYLHEVKLGTEVWVQTQILGFDRKRLHVYHSLHRAGFDEVLAASEQLLHVDLQSAPFG --------2222-----------------------2222--------------------- HTTVCRLNHLVEQQEGAQAPQYGRTIKLPA -------------1111------------- >BONE MORPHOGENETIC PROTEI; SWP:NA; PDB:2HLRA; ALCAFKDPYNGTILCSKGSTCYGLWNLVKQGCWSHIGDPQECHYEECVVTYRFCCCSTDL ---------------1111------------------1111----------------222 CNVNFTE 2------ >PROTEIN DISULFIDE OXIDORE; SWP:Q9YDZ4; PDB:2HLSA; ARYYVLDLSEDFRRELRETLAEMVNPVEVHVFLSKSGCETCEDTLRLMKLFEEESPTRNG -------------------1111--------------1111----------------iii GKLLKLNVYYRESDSDKFSEFKVERVPTVAFLGGEVRWTGIPAGEEIRALVEVIMRLSED i-----------------------------!!!!--------!!!!----------1111 ESGLEDATKEALKSLKGRVHIETIITPSCPYCPYAVLLAHMFAYEAWKQGNPVILSEAVE -----------1111----------3333-3333------------1111---------- AYENPDIADKYGVMSVPSIAINGYLVFVGVPYEEDFLDYVKSAAEGRLTVKG 1111---3333---------iiii------------------1111------ >HYPOTHETICAL PROTEIN ATU2; SWP:Q8UD29; PDB:2HLYA; GYFEGMLIKQTDYFRIYRVINSLLISQNADPASASMYFSTFGAFILQQHYKVKAVPKGGL -----------------------3333--------------------------------- AAYNLGGTVLLFADHREYVTGAGENFHCWVEADGWAIDFMAPAFSEGTDALAVPAKMFQR ----iiii--------------1111-----iiii--1111-3333--3333-------- PLSAMAASINDLGQSGDFFYRSEPEATARRFADWHKQAMIGDMASVAANWFRKSPKQMAA 3333---1111--2222-----------11113333------------------------ SLSVTDRDGKARTVPLTGEMLTGAW -----1111---------------- >KETOHEXOKINASE; SWP:P50053; PDB:2HLZA; GSQILCVGLVVLDVISLVDKYPKEDSEIRCLSQRWQRGGNASNSCTILSLLGAPCAFGSA ----------------------2222---------------------------------- PGHVADFVLDDLRRYSVDLRYTVFQTTGSVPIATVIINEASGSRTILYYDRSLPDVSATD ------------1111--1111---------------3333---------------3333 FEKVDLTQFKWIHIEGRNASEQVKLQRIDAHNTRQPPEQKIRVSVEVEKPREELFQLFGY 11113333-----------------------11111111----------------3333- GDVVFVSKDVAKHLGFQSAEEALRGLYGRVRKGAVLVCAWAEEGADALGPDGKLLHSDAF -----------1111----------3333-2222-----!!!!-----3333-------- PPPRVVDTLGAGDTFNASVIFSLSQGRSVQEALRFGCQVAGKKCGLQGFDGIV -------2222-----------1111----------------1111--1111- >Pyrin domain-containing p; SWP:Q8WXC3; PDB:2HM2Q; MGTKREAILKVLENLTPEELKKFKMKLGTVPLREGFERIPRGALGQLDIVDLTDKLVASY --3333-----1111---------3333------------1111---------------- YEDYAAELVVAVLRDMRMLEEAARLQRAA ------------3333--3333--1111- >Probable tRNA (5-methylam; SWP:Q97T38; PDB:2HMAA; SDNSKTRVVVGSGGVDSSVTALLLKEQGYDVIGIFKNWDDTDCTATEDYKDVVAVADQIG -3333-------------------1111-------------------------------- IPYYSVNFEKEYWDRVFEYFLAEYRAGRTPNPDVCNKEIKFKAFLDYAITLGADYVATGH -----------------------1111---3333--------------1111-------- YARVARDEDGTVHLRGVDNGKDQTYFLSQLSQEQLQKTFPLGHLEKPEVRRLAEEAGLST ------3333-------3333-33331111333311111111--3333-----1111111 AKKKDSTGICFIGEKNFKNFLSNYLPAQPGRTVDGRDGEHAGLYYTIGQRGGLGIGGDNA 1-------3333---3333-3333-------1111----------2222--2222----- PWFVVGKDLSKNILYVGQGFYHDSLSTSLEASQVHFTREPEEFTLECTAKFRYRQPDSKV --------1111------1111-----------------------------1111----- TVHVKGEKTEVIFAEPQRAITPGQAVVFYDGEECLGGGLIDNAYRDGQVCQYI --------------------2222-----!!!!-----------%%%%----- >DIHYDRODIPICOLINATE SYNTH; SWP:Q8U6Y1_AGRT5; PDB:2HMCA; TASIFSGVIPALTPCRQDRTPDFDALVRKGKELIADGSAVVYCGSGDWPLLTDEQREGVE --1111---------1111--------------1111---------3333-3333----- RLVKAGIPVIVGTGAVNTASAVAHAVHAQKVGAKGLVIPRVLSRGSVIAAQKAHFKAILS --1111-------------------------------------1111------------- AAPEIPAVIYNSPYYGFATRADLFFALRAEHKNLVGFKEFGGPADRYAAENITSRDDEVT -1111------3333---------------1111-------3333--------------- LIGVDTAVVHGFVNCGATGAITGIGNVLPKEVIHLCKLSQAAAKGDADARARALELEQAL ---1111---------------3333---------------1111--------------- AVLSSFDEGPDLVLYFKYVLKGDKEYTLHFNETDALTDSQRGYVEAQFKLFNSWYADWSK -33331111-333333331111-1111---1111-------------------------- >PROBABLE ASPARTOKINASE; SWP:Q57991; PDB:2HMFA; TTVMKFGGTSVGSGERIRHVAKIVTKRKKEDDDVVVVVSAMSEVTNALVEISQQALDVRD -------3333---------------3333------------------------------ IAKVGDFIKFIREKHYKAIEEAIKSEEIKEEVKKIIDSRIEELEKVLIGVAYLGELTPKS ------------------------------------------------------------ RDYILSFGERLSSPILSGAIRDLGEKSIALEGGEAGIITDNNFGSARVKRLEVKERLLPL --------------------1111------3333------------------3333---- LKEGIIPVVTGFIGTTEEGYITTLGRGGSDYSAALIGYGLDADIIEIWTDVSGVYTTDPR 1111-----------1111-----2222-----------------------------111 LVPTARRIPKLSYIEAMELAYFGAKVLHPRTIEPAMEKGIPILVKNTFEPESEGTLITND 11111--------------111133331111----1111------1111----------- MEMSDSIVKAISTIKNVALINIFGAGMVGVSGTAARIFKALGEEEVNVILISQGSSETNI ---1111-----------------------------------------------1111-- SLVVSEEDVDKALKALKREFGDFLNNNLIRDVSVDKDVCVISVVGAGMRGAKGIAGKIFT -----1111-------------------------------------3333---------- AVSESGANIKMIAQGSSEVNISFVIDEKDLLNCVRKLHEKFIEK --1111-------------------3333--------------- >SUPPRESSOR OF CYTOKINE SI; SWP:O35718; PDB:2HMHA; SEYQLVVNAVRKLQESGFYWSAVTGGEANLLLSAEPAGTFLIRDSSDQRHFFTLSVKTQS -------------------------------11112222-------1111-------111 GTKNLRIQEGGSFSLQSDPRSTQPVPRFDVLKLVHHYMPPQAYYIYKIPLVLSRPLSSN 1-------iiii-----1111-------------1111--------------------- >HIV-1 REVERSE TRANSCRIPTA; SWP:NA; PDB:2HMIC; DIQMTQTTSSLSASLGDRVTISCSASQDISSYLNWYQQKPEGTVKLLIYYTSSLHSGVPS -------------2222-----------%%%%------1111------------222233 AFSGSGSGTDYSLTISNLEPEDFATYYCQQYSKFPWTFGGGTKLEIKRADAAPTVSIFPP 33----------------1111-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVAWAIDGSAAANGVLNSWTDQDSKDSTYSMSSTLT 3333-------------------------------------------------------- LTADEYEAANSYTCAATHKTSTSPIVKSFNANEC ---------------------------------- >YUAA PROTEIN; SWP:O32080; PDB:2HMVA; NKQFAVIGLGRFGGSIVKELHRMGHEVLAVDINEEKVNAYASYATHAVIANATEENELLS ---------3333--------------------------1111-------1111----33 LGIRNFEYVIVAIGANIQASTLTTLLLKELDIPNIWVKAQNYYHHKVLEKIGADRIIHPE 331111--------------------3333------------------------------ KDMGVKIAQSLSDENVLNY ------------------- >HEMERYTHRIN; SWP:P02246; PDB:2HMZA; GFPIPDPYCWDISFRTFYTIVDDEHKTLFNGILLLSQADNADHLNELRRCTGKHFLNEQQ ----------3333---------------------------------------------- LMQASQYAGYAEHKKAHDDFIHKLDTWDGDVTYAKNWLVNHIKTIDFKYRGKI --11111111------------------------------------1111--- >PROTEIN MIOC; SWP:MIOC_ECOLI; PDB:2HNAA; MADITLISGSTLGGAEYVAEHLAEKLEEAGFTTETLHGPLLEDLPASGIWLVISSTHGAG ------------------------------------------------------------ DIPDNLSPFYEALQEQKPDLSAVRFGAIGIGSREYDTFCGAIDKLEAELKNSGAKQTGET -----3333----------1111------------------------------------- LKINILDHDIPEDPAEEWLGSWVNLLK --------------------------- >HYPOTHETICAL PROTEIN; SWP:Q97PP5; PDB:2HNGA; ANLKREQEFVSQYHFDARNFEWENENGAPETKVDVNFQLLQHDQENQVTSLIVILSFIVF ------------------3333---------------------1111------------1 DKFVISGTISQVNHIDGRIVNEPSELNQEEVETLARPCLNLNRLTYEVTEIALDLPGINL 111------------------3333-----------3333----------1111------ EF -- >DNA POLYMERASE III ALPHA ; SWP:P10443; PDB:2HNHA; MSEPRFVHLRVHSDYSMIDGLAKTAPLVKKAAALGMPALAITDFTNLCGLVKFYGAGHGA ------------1111---------------1111-----------1111---------- GIKPIVGADFNVQCDLLGDELTHLTVLAANNTGYQNLTLLISKAYQRGYGAAGPIIDRDW -------------3333---------------------------3333-3333---1111 LIELNEGLILLSGGRMGDVGRSLLRGNSALVDECVAFYEEHFPDRYFLELIRTGRPDEES ----2222-----1111------------------------2222---------2222-- YLHAAVELAEARGLPVVATNDVRFIDSSDFDAHEIRVAIHDGFTLDDPKRPRNYSPQQYM -------------------------1111--------------1111-------1111-- RSEEEMCELFADIPEALANTVEIAKRCNVTVRLGEYFLPQFPTGDMSTEDYLVKRAKEGL ---------1111----------3333---------------!!!!-------------- EERLAFLFPDEEERLKRRPEYDERLETELQVINQMGFPGYFLIVMEFIQWSKDNGVPVGP ------------------------------------------------------------ GRGSGAGSLVAYALKITDLDPLEFDLLFERFLNPERVSMPDFDVDFCMEKRDQVIEHVAD -!!!!------1111----3333---3333--1111----------3333---------- MYGRDAVSQIITFGTMAAKAVIRDVGRVLGHPYGFVDRISKLIPPDPGMTLAKAFEAEPQ --1111--------------------1111-3333---3333---1111--------333 LPEIYEADEEVKALIDMARKLEGVTRNAGKHAGGVVIAPTKITDFAPLYCDEEGKHPVTQ 3-------3333--------2222----------------3333------1111------ FDKSDVEYAGLVKFDFLGLRTLTIINWALEMINKRRAKNGEPPLDIAAIPLDDKKSFDML -----------------------------------3333-----3333-----------1 QRSETTAVFQLESRGMKDLIKRLQPDCFEDMIALVALFRPGPLQSGMVDNFIDRKHGREE 111-2222------------------3333--3333----3333---------------- ISYPDVQWQHESLKPVLEPTYGIILYQEQVMQIAQVLSGYTLGGADMLRRAMGKKKPEEM ----3333-3333----1111----3333-----------------------------33 AKQRSVFAEGAEKNGINAELAMKIFDLVEKFAGYGFNKSHSAAYALVSYQTLWLKAHYPA 33---------1111----------------1111------------------------- EFMAAVMTADMDNTEKVVGLVDECWRMGLKILPPDINSGLYHFHVNDDGEIVYGIGAIKG ---------1111-----------1111------1111-------1111--------222 VGEGPIEAIIEARNKGGYFRELFDLCARTDTKKLNRRVLEKLIMSGAFDRLGPHRAALMN 2-------------------3333------------------1111-------3333--- SLGDALKAAD ------3333 >Prothrombin B-chain [Frag; SWP:Q69EZ7; PDB:2HNTE; EKISMLEKIYIHPRYNWRENLDRDIALMKLKKPVAFSDYIHPVCLPDRETAASLLQAGYK -----------1111-------------------------------3333-----2222- GRVTGWG ------- >FATTY ACID-BINDING PROTEI; SWP:FABPA_HUMAN; PDB:2HNXA; MCAFVGTWKLVSSENFDDYMKEVGVGFATRKVAGMAKPNMIISVNGDVITIKSESTFKNT 11------------------------------3333--------!!!!------3333-- EISFILGQEFDEVTADDRKVKSTITLDGGVLVHVQKWDGKSTTIKRKREDDKLVVECVMK ----2222-----1111---------iiii------%%%%--------!!!!------ii GVTSTRVYERA ii--------- >TYPE 4 FIMBRIAL BIOGENESI; SWP:Q9HXJ2; PDB:2HO1A; KGRDEARDAYIQLGLGYLQRGNTEQAKVPLRKALEIDPSSADAHAALAVVFQTEEPKLAD -----------------11113333-----------1111-------------------- EEYRKALASDSRNARVLNNYGGFLYEQKRYEEAYQRLLEASQDTLYPERSRVFENLGLVS ---------1111-----------1111--------------1111-3333--------- LQKKPAQAKEYFEKSLRLNRNQPSVALEADLLYKEREYVPARQYYDLFAQGGGQNARSLL --------------------------------1111------------------------ LGIRLAKVFEDRDTAASYGLQLKRLYPGSLEYQEFQAEK -------------------------1111---------- >OXIDOREDUCTASE, GFO/IDH/M; SWP:Q97PV8; PDB:2HO3A; MLKLGVIGTGAISHHFIEAAHTSGEYQLVAIYSRKLETAATFASRYQNIQLFDQLEVFFK ---------3333-----------------------------1111-------------- SSFDLVYIASPNSLHFAQAKAALSAGKHVILEKPAVSQPQEWFDLIQTAEKNNCFIFEAA ----------3333--------1111-----------3333--------1111------3 RNYHEKAFTTIKNFLADQVLGADFNYAKYSSKGGALMDLGIYPLYAAVRLFGKANDATYH 333-3333-------------------------3333----------------------- AQQLDNSIDLNGDGILFYPDYQVHIKAGKNITSNLPCEIYTTDGTLTLNTIEHIRSAIFT ---1111----------1111-------------------1111---------------- DHQGNQVQLPIQQAPHTMTEEVAAFAHMIQQPDLNLYQTWLYDAGSVHELLYTMRQTAGI ---------------1111----------------------------------------- RF -- >Haloacid dehalogenase-lik; SWP:Q6PEB2; PDB:2HO4A; ALKAVLVDLNGTLHIAVPGAQEALKRLRATSVVRFVTNTTKETKKDLLERLKKLEFEISE ----------------2222-------------------------------1111---33 DEIFTSLTAARNLIEQKQVRPLLLDDRALPEFTGVQTQDPNAVVIGLAPEHFHYQLLNQA 33----------------------33331111---------------1111--------- FRLLLDGAPLIAIHKARYYKRKDGLALGPGPFVTALEYATDTKAVVGKPEKTFFLEALRD ---1111-------------1111---------------------------------333 ADCAPEEAVIGDDCRDDVDGAQNIGLGILVKTGKYKAADEEKINPPPYLTCESFPHAVDH 3--3333-----3333----3333-------!!!!22221111----------------- ILQHLL ------ >N-ACETYLGLUCOSAMINE KINAS; SWP:Q9X0V1; PDB:2HOEA; ISRILKRIKSPVSRVELAEELGLTKTTVGEIAKIFLEKGIVVEEKDSPRPTKSLKISPNC --3333-----------------3333-----------------------------3333 AYVLGIEVTRDEIAACLIDASNILAHEAHPLPSQSDREETLNVYRIIDRAKDEKLGSKLS --------1111----------------------------------------1111---- ALTVAAPGPIDTERGIIIDPRNFPLSQIPLANLLKEKYGIEVWVENDADGAVGEKWYTKR ----------------------1111-------------------3333----------- DDSFAWILTGKGIGAGIIIDGELYRGENGYAGEIGYTRVFNGNEYVFLEDVCNENVVLKH ------------------iiii------------------------3333---------3 VLSGFSLAEARDSGDVRVKEYFDDIARYFSIGLLNLIHLFGISKIVIGGFFKELGENFLK 333------3333-3333------------------------------3333-------- KIKIEVETHLLYKHSVDSFSKVQEPVIAFGAAVHALENYLERVTTS ---------------------------------------------- >HD DOMAIN PROTEIN; SWP:Q836G9; PDB:2HONA; MTIPYKEQRLPIEKVFRDPVHNYIHVQHQVILDLINSAEVQRLRRIKQLGTSSFTFHGAE ---3333------------------------------3333--------3333--2222- HSRFSHSLGVYEITRRICEIFQRNYSVERLGENGWNDDERLITLCAALLHDVGHGPYSHT -------------------------3333-1111-1111-----------1111------ FEHIFDTNHEAITVQIITSPETEVYQILNRVSADFPEKVASVITKQYPNPQVVQMISSQI ------------------1111-----33331111------1111-------3333---- DADRMDYLLRDAYFTGTEYGTFDLTRILRVIRPYKGGIAFAMNGMHAVEDYIVSRYQMYV ------------11113333--3333-1111--1111---3333---------------- QVYFHPVSRGMEVILDHLLHRAKELFENPEFDYDLQASLLVPFFKGDFTLQEYLKLDDGV ------------------------------------3333------------1111---- LSTYFTQWMDVPDSILGDLAKRFLMRKPLKSATFTNEKESAATIADLPYDFYRPNKDRHR ---------------------------------------3333--33333333------- TDGSLVELATVSPLVAALAGQSQGDERFPKEMQGNKKHYDLFDETYREFSSYIHNGALVL ----------------------------------------------11113333------ >SEGMENTATION POLARITY HOM; SWP:P02836; PDB:2HOSA; RTAFSSEQLARLKREFNENRYLTERRRQQLSSELGLNEAQVKGWFKNMRAKIKKST ------------------------------------3333------------3333 >ALLIIN LYASE 1; SWP:Q01594; PDB:2HOXA; KMTWTMKAAEEAEAVANINCSEHGRAFLDGIISEGSPKCECNTCYTGPDCSEKIQGCSAD --1111--------1111--------1111--iiii-----2222--------------- VASGDGLFLEEYWKQHKEASAVLVSPWHRMSYFFNPVSNFISFELEKTIKELHEVVGNAA --------3333----3333----1111---------%%%%------------------- AKDRYIVFGVGVTQLIHGLVISLSPNMTATPDAPESKVVAHAPFYPVFREQTKYFDKKGY 1111------3333------1111-33331111-----------3333------------ VWAGNAANYVNVSNPEQYIEMVTSPNNPEGLLRHAVIKGCKSIYDMVYYWPHYTPIKYKA ----3333-----3333-------------------2222-----11113333------- DEDILLFTMSKFTGHSGSRFGWALIKDESVYNNLLNYMTKNTEGTPRETQLRSLKVLKEV -------3333---3333------------------------------------------ VAMVKTQKGTMRDLNTFGFKKLRERWVNITALLDQSDRFSYQELPQSEYCNYFRRMRPPS ------2222---3333--------------1111------------------------- PSYAWVKCEWEEDKDCYQTFQNGRINTQNGVGFEASSRYVRLSLIKTQDDFDQLMYYLKD ---------3333---------------3333---1111--------------------- MVKAK ----- >Glutamate-1-semialdehyde ; SWP:P24630; PDB:2HOYA; FKTIKSDEIFAAAQKLMPGGVSSPVRAFKSVGGQPIVFDRVKDAYAWDVDGNRYIDYVGT ------------11112222--3333-1111----------!!!!--1111------%%% WGPAICGHAHPEVIEALKVAMEKGTSFGAPCALENVLAEMVNDAVPSIEMVRFVNSGTEA %--1111-------------1111------3333----------3333------------ CMAVLRIMRAYTGRDKIIKFEGCYHGHADMANTLTTPYNDLEAVKALFAENPGEIAGVIL -------------------2222---------------------------2222------ EPIVGNSGFIVPDAGFLEGLREITLEHDALLVFDEVMTGFRIAYGGVQEKFGVTPDLTTL ------------2222--------1111---------2222-1111--1111-------- GKIIGGGLPVGAYGGKREIMQLVAPAGPMYQAGTLSGNPLAMTAGIKTLELLRQPGTYEY -1111-----------3333------------1111-----------------2222--- LDQITKRLSDGLLAIAQETGHAACGGQVSGMFGFFFTEGPVHNYEDAKKSDLQKFSRFHR ------------------------------------------33331111---------- GMLEQGIYLAPSQFEAGFTSLAHTEEDIDATLAAARTVMSAL -------------------1111------------------- >IDS-EPIMERASE; SWP:Q1L4E3; PDB:2HP0A; HFTTKLAEKVVSAWKAKISQPALKAAQDGVIDTVAAALGGVTEHSVQVALKYVAATGGSG ------------3333-----------------------1111----------3333--- DSKLWGVNQRSNFDAAFVNGAAHAIDFDDSFPVRGHPSSSLVPAIFAVGEHVGANGHNCL ---2222-------------1111------------3333-------------------- KSYVLGIEVVATLGRAVGKGHYLAGWHPTSTLGVFGATTAAALLLGADEEQLRNAWGIAA -------------------3333-------------------1111-----------333 SNSCGIIKNFGTTKPHTGSAARNGVLSAWLSQSFTGCQTVFDDAEGILAYGAQPGPELFN 3----3333-----------------------------11111111--------3333-- AQKFGTPWAIIAPGLYKKSWPSCYANHKPLAGLFAIKEHGLTGQDISHVDVGFLPGVEKP --2222-3333-----------3333----------1111-3333--------2222333 LLYDPRTTEEAKFSIEANIGAALLDGEVSLASFEIEHLDRPARAAKKVTRFDPSETTFSG 3-----3333----3333----------3333--3333--------------------11 TTGYTDIVVHTADGKIERRIEATPGSLEDPDDAHLERKFKDCTAWPFGESGLLFDRLRSL 11--------1111--------2222--------------1111-2222--------333 TADQGIKTVQP 3---3333--- >Glutamate-1-semialdehyde ; SWP:P24630; PDB:2HP1A; FKTIKSDEIFAAAQKLMPGGVSSPVRAFKSVGGQPIVFDRVKDAYAWDVDGNRYIDYVGT -----------3333-2222--3333-1111----------!!!!--1111------%%% WGPAICGHAHPEVIEALKVAMEKGTSFGAPCALENVLAEMVNDAVPSIEMVRFVNSGTEA %--1111-------------1111------3333----------3333------------ CMAVLRIMRAYTGRDKIIKFEGCYHGHADMFLVKAGSGVATLGLPSSPGVPKKTTANTLT -------------------2222----3333------3333---------3333------ TPYNDLEAVKALFAENPGEIAGVILEPIVGNSGFIVPDAGFLEGLREITLEHDALLVFDE -2222----------2222------------------2222--------1111------- VMTGFRIAYGGVQEKFGVTPDLTTLGKIIGGGLPVGAYGGKREIMQLVAPAGPMYQAGTL --2222-1111--1111---------3333-----------3333------------111 SGNPLAMTAGIKTLELLRQPGTYEYLDQITKRLSDGLLAIAQETGHAACGGQVSGMFGFF 1-----------------2222-------------------------------------- FTEGPVHNYEDAKKSDLQKFSRFHRGMLEQGIYLAPSQFEAGFTSLAHTEEDIDATLAAA -------33331111-----------------------------1111------------ RTVMSAL ---1111 >T-CELL SURFACE GLYCOPROTE; SWP:Q4ZG17; PDB:2HP4A; SQFRVSPLDRTWNLGETVELKCQVLLSNPTSGASWLFQPRGAAASPTFLLYLNQNKPKAA ------------2222-------------------------------------------2 EGLDTQRFSGKRLGDTFVLTLSDFRRENEGYYFCSALSNSIMYFSHFVPVFLPA 2221111-----!!!!--------1111---------iiii------------- >FLAGELLAR MOTOR SWITCH PR; SWP:Q9WZE6; PDB:2HP7A; PSKFSKEQLRTFQMIHENFGRALSTYLSGRLRTFVDVEISIDQLTYEEFIRSVMIPSFIV -------------------------------------------------1111------- IFTGDVFEGSAIFEMRLDLFYTMLDIIMGGPPNRPPTEIETSIMRKEVTNMLTLLAQAWS ---1111---------3333---------------------------------------- DFQYFIPSIENVETNPQFVQIVPPNEIVLLVTASVSWGEFTSFINVCWPFSLLEPLLEK --------------3333----1111----------!!!!--------3333-3333-- >ABC transporter, periplas; SWP:Q9WYF8; PDB:2HPGA; VFGAKYTLRFGHVLAPGEPYHQAFLKWAKAVEEKTNGDVRIEVFPSSQLGVEEDIIEQIR --------------2222--------------1111-----------------3333--- GAPVGWNTDSARLGYVKDIGVNLAYFIDFGAKTPEEAIEVLKKIKQSPTQKWLKELEQRF --------3333----------2222---------------------------------- GIKVLSFYWVQGYRHFVTNKPIRKPEDLNGLRIRTPGAPAWQESIRSLGAIPVAVNFGEI -----------------------33332222------3333--------------3333- YTAVQTRAVDGAELTYANVYNGGLYEVLKYSETGHFLLINFEIVSADWFNSLPKEYQKII ---1111---------------3333----------------------3333-------- EEEDKAGIEVSLKIKELEEEYKQKCIEKGAVIPASEIDKEAFEKAKQAYKNLGLENALNQ -------------------------1111---3333--3333------------------ LIKEVKGE -------- >DNA POLYMERASE III ALPHA ; SWP:Q9XDH5; PDB:2HPIA; LKFAHLHQHTQFSLLDGAAKLQDLLKWVKETTPEDPALAMTDHGNLFGAVEFYKKATAMG ---------1111--------------------------------3333-------1111 VKPIIGYEAYVAAESRFDRKRGGYFHLTLLAKDFTGYQNLVRLASRAYLEGFYEKPRIDR --------------1111------------------------------------------ EILREHAQGLIALSGCLGAEIPQFILQDRLDLAEARLNEDLSIFGDRFFIEIQNHGLPEQ ------2222-----1111-----1111---------------!!!!------------- KKVNQVLKEFARKYGLGMVATNDGHYVRKEDARAHEVLLAIQSKTTLDDPERWRFPCDEF ---------------------------3333---------3333---------------- YVKTPEEMRAMLPEAEWGDEPFDNTVEIARMCDVDLPIGDKMVYRIPRFPLGRTEAQYLR ------------1111--3333-------------------------------3333--- ELTFLGLLRRYPDRITEAFYREVLRLLDERALAEALARVEEKAWEELRKREWTAEAILHR -----------------------11113333---3333-3333----------------- ALYELSVIERMGFPGYFLIVQDYINWARGHGVSVGPGRGSAAGSLVAYAVGITNIDPLRF ---------------------------------------1111------------1111- GLLFERFLNPERVSMPDIDTDFSDRERDRVIQYVRERYGEDKVAQIGTFGSLASKAALKD --3333--1111--------------------------1111------------------ VARVYGIPHKKAEELAKLIPVQFGKPKPLQEAELRAEMEKDERIRQVIEVAMRLEGLNRH --1111--------1111---------3333--3333---3333---------3333--- ASVHAAGVVIAAEPLTDLVPLMRDQEGRPVTQYDMGAVEALGLLKMDFLGLRTLTFLDEA -------------3333------1111--------------------------------- RRIVKESKGVELDYDRLPLDDPKTFELLSRGETKGVFQLESGGMTATVRGLKPRRLEDII ------------1111----3333---1111-----------------------3333-- ALVSLYRPGPMEHIPTYIRRHHGQEPVSYAEFPHAEKYLRPILDETYGIPVYQEQIMQIA --------3333----------------33331111--3333---iiii--3333----- SQVAGYSLGEADLLRRAMGKKRVEEMQKHRERFVRGAKERGVPEEEANRLFDMLEAFANY --------------------------------------------------------3333 GFNKSHAAAYSLLSYQTAYVKAHYPVEFMAALLSVERHDSDKVAEYIRDARALGIPVLPP -----------------------------------1111--------------------- DVNRSGFDFKVVGEEILFGLSAVKNVGEMAARAILEERERGGPFKSLGDFLKRLPEQVVN 1111--------------33331111----------------------------3333-- KRALESLVKAGALDAFGDRARLLASLEPLLRWAAETRERGRSGLVGLFAEVEEPPLVEAS ------------1111----------------------1111------------------ PLDEITMLRYEKEALGIYVSGHPVLRYPGLREVASCTIEELSEFVRELPGKPKVLLSGMV ---------------------3333-----------3333-3333--------------- EEVRFTLSDETGALEVVKEDIPLLVLAEVERVLAQAVWTLEEVLEAPKALEVEVDHALLD --------------------------------------3333------------------ EKGARLKSLLDEHPGSLPVYLRVLGPFGEALFALREVRVGEEALGLLEAEGYRAYLVPDR ----------------------------------------------------------33 EVF 33- >Coagulation factor II var; SWP:Q53H04; PDB:2HPQP; CVPDRGQQYQGRLAVTTHGLPCLAWASAQAKALSKHQDFNSAVQLVENFCRNPDGDEEGV ---------------1111----1111-1111-1111----------------------- WCYVAGKPGDFGYCDLNYC ------2222--------- >coelenterazine-binding pr; SWP:P05938; PDB:2HPSA; EITESERAYHLRKKTRQRVDVTGDGFISREDYELIAVRIAKIAKLSAEKAEETRQEFLRV ----------------3333---------------------------------------- ADQLGLAPGVRISVEEAAVNATDSLLKKGEEKAAVIQSLIYDCIDTDKDGYVSLPEFKAF ------2222-------------1111!!!!----3333--3333--------------- LQAVGPDLTDDKAITCFNTLDFNKNGQISRDEFLVTVNDFLFGLEETALANAFYGDLVD ----3333------------1111---------------------------1111---- >NOSL PROTEIN; SWP:O68481; PDB:2HPUA; KAQIFLEGSPAPLFFSQVRDAIAYARGPEQIAPILVIYVNDMGAAGATWDQPGDGNWIAA -----2222-------3333--------3333---------------3333-------33 DKAFYVVGSARRGGMGAPEAVPFSSRDEAAAFVLAEGGQVLALADITDAMVLTPVETGSE 33----------1111-------------------------3333-3333---------- PRADDE ------ >FMN-DEPENDENT NADH-AZORED; SWP:Q831B2; PDB:2HPVA; SKLLVVKAHPLTKEESRSVRALETFLASYRETNPSDEIEILDVYAPETNPEIDEELLSAW -----------3333-----------------1111-----11113333----------- GALRAGAAFETLSENQQQKVARFNELTDQFLSADKVVIANPWNLNVPTRLKAWVDTINVA -------1111------------------3333--------%%%%--------3333-22 GKTFQYTAEGPKPLTSGKKALHIQSNGGFYEGKDFASQYIKAILNFIGVDQVDGLFIEGI 22----1111----------------------------------1111-----------3 DHFPDRAEELLNTATKATEYGKTF 3331111-----------3333-- >GREEN FLUORESCENT PROTEIN; SWP:NA; PDB:2HPWA; TEGAKLFEKEIPYITELEGDVEGMKFIIKGEGTGDATTGTIKAKYICTTGDLPVPWATIL 33331111------------iiii-----------1111-------1111----3333-3 SSLVFCFAKYPRHIADFFKSTQPDGYSQDRIISFDNDGQYDVKAKVTYENGTLYNRVTVK 3333333---1111-3333--------------2222-----------iiii-------- GTGFKSNGNILGMRVLYHSPPHAVYILPDRKNGGMKIEYNKAFDVMGGGHQMARHAQFNK ----1111-1111---------------3333------------2222------------ PLGAWEEDYPLYHHLTVWTSFGKDPDDDETDHLTIVEVIKAVDLETYR -----------------------1111-----------------3333 >GLUCOSE/RIBITOL DEHYDROGE; SWP:Q4CFD1; PDB:2HQ1A; MQLKGKTAIVTGSSRGLGKAIAWKLGNMGANIVLNGSPASTSLDATAEEFKAAGINVVVA 1111---------------------1111-------1111----------1111------ KGDVKNPEDVENMVKTAMDAFGRIDILVNNAWDDVLNTNLKSAYLCTKAVSKIMLKQKSG ------------------------------------------------------------ KIINITSQANYAASKAGLIGFTKSIAKEFAAKGIYCNAVAPGIIKTDMTDVLPDKVKEMY ----------------------------3333-------------3333----------3 LNNIPLKRFGTPEEVANVVGFLASDDSNYITGQVINIDGGL 3331111----------------3333----------iiii >PUTATIVE HEME/HEMOGLOBIN ; SWP:Q8X5N8; PDB:2HQ2A; SMNHYTRWLELKEQNPGKYARDIAGLMNIREAELAFARVTHDAWRMHGDIREILAALESV --------------2222------1111-3333--1111----------------3333- GETKCICRNEYAVHEQVGTFTNQHLNGHAGLILNPRALDLRLFLNQWASVFHIKENTARG --------1111---------------------2222-----3333----------1111 ERQSIQFFDHQGDALLKVYATDNTDMAAWSELLARFITDENTPLELKAVRADATVVEQEW --------1111--------1111-----------------------------------1 RAMTDVHQFFTLLKRHNLTRQQAFNLVADDLACKVSNSALAQILESAQQDGNEIMVFVGN 111-1111-------------------3333----1111--------------------1 RGCVQIFTGVVEKVVPMKGWLNIFNPTFTLHLLEESIAEAWVTRKPTSDGYVTSLELFAH 111-------------iiii------------1111----------------------11 DGTQIAQLYGQRTEGEQEQAQWRKQIASLIP 11----------2222----------1111- >HYPOTHETICAL PROTEIN PH15; SWP:O59278_PYRHO; PDB:2HQ4A; MQCEEKLEVFENGFKDEKFNVEVKFYGNDARKVLLAMIYELYLPEYGREYVYPFECAKEF ------------------------------------------------------------ WNIYLEGEEIQDQLKPIKFTSEQVIKKLQEEIKKIKPPLEIKIEEAKIYKTKEGYLAVGN -----3333--------------------------------3333-----1111------ YFILDPRGRLFIFNKPSIANKILKYIWKW --------------3333---3333---- >SEROLOGICALLY DEFINED COL; SWP:Q6UX04; PDB:2HQ6A; VPRGSEPPTNGKVLLKTTAGDIDIELWSKEAPKACRNFIQLCLEAYYDNTIFHRVVPGFI -2222-----------1111------------------------1111-------2222- VQGGDPTGTGSGGESIYGAPFKDEFHSRLRFNRRGLVAMANAGSHDNGSQFFFTLGRADE ----1111-----------------1111----------------------------333 LNNKHTIFGKVTGDTVYNMLRLSEVDIDDDERPHNPHKIKSCEVLFNPFD 3----------1111----3333----1111---------------1111 >Protein, related to gener; SWP:Q97DI6; PDB:2HQ7A; IDEKFLIESNELVESSKIVVGTNGENGYPNIKARLKHDGLKKFWLSTNTSTRVERLKKNN ---3333-----1111-------2222----------!!!!------------------- KICLYFVDDNKFAGLLVGTIEILHDRASKELWTDGCEIYYPLGIDDPDYTALCFTAEWGN ------------------------3333------3333-1111--1111----------- YYRHLKNITFKIDEI --%%%%----3333- >MLL6688 PROTEIN; SWP:Q988L5; PDB:2HQ9A; GLVRTLSALECTKVLTANRVGRLACAKDGQPYVVPLYYAYSDAHLYAFSPGKKIEWRANP --------------------------iiii----------!!!!------3333------ RVSVQVDEHGQGRGWKSVVVDGRYEELPDLIGHKLQRDHAWSVLSKHTDWWEAPHVFFRI ----------!!!!---------------3333------------3333----------- LIEQVSGREASE ------------ >CYCLOPHILIN; SWP:Q4QBH1; PDB:2HQJA; MTNPKVFFDISIDNKAAGRIVMELYADTVPKTAENFRALCTGEKGKGRSGKPLHYKSSVF -----------------------------------------1111-3333----2222-- HRVIPNFMIQGGDFTRGNGTGGESIYGTTFRDESFSGKAGRHTGLGCLSMANAGPNTNGS -----------------------1111--------!!!!----2222------------- QFFICTAATPWLDGKHVVFGRVIDGLDVVKKVERLGSSSGKTRSRIVVSDCGEVAADKS --------3333------------3333----11113333------------------- >CYAN FLUORESCENT CHROMOPR; SWP:Q9U6Y3; PDB:2HQKA; GVIKPDMKIKLKMEGNVNGHAFVIEGEGEGKPYDGTNTINLEVKEGAPLPFSYDILTTAF ----------------iiii----------1111---------------------1111- NRAFTKYPDDIPNYFKQSFPEGYSWERTMTFEDKGIVKVKSDISMEEDSFIYEIHLKGEN 3333---1111-3333-------------------------------------------- FPPNGPVMQKKTTGWDASTERMYVRDGVLKGDVKHKLLLEGGGHHRVDFKTIYRAKKAVK -1111-------------------iiii----------2222------------------ LPDYHFVDHRIEILNHDKDYNKVTVYESAVARN ----------------1111------------- >GU4 NUCLEIC-BINDING PROTE; SWP:P46672; PDB:2HQTA; SDLVTKFESLIYPVSFTKEQSAQAAQWESVLKSGQIQPHLDQLNLVLRDNTFIVSTLYPT -----1111---1111-------------------1111-------1111-1111----- STDVHVFEVALPLIKDLVASSKDVKSTYTTYRHILRWIDYMQNLLEVSSTDKLEI ------------------------------------------1111-2222---- >AGR_C_4470P; SWP:Q7CX01; PDB:2HQVA; AYPSIAAQKNDDDRQARALAALAEKPEAIAAKAEVAPAEILAILPQGAAVSAPADRFDAI --------------------------3333-----3333-11112222----3333---- WNERGWGEILIVQTGDIVLEVPGHLPEGTESHGWFNIHGDSPIGGHIKKDNCAAITFVDR -------------1111-------------iiii-------------1111--------- GFHGRRSCSVWFNAAGGAFKIFVRRDENKELLAGQLAKFEELRDGFR ------------1111---------1111--------------1111 >P100 CO-ACTIVATOR TUDOR D; SWP:Q7KZF4; PDB:2HQXA; TQFQKLMENMRNDIASHPPVEGSYAPRRGEFCIAKFVDGEWYRARVEKVESPAKIHVFYI -------------------2222---2222-----1111-----------1111------ DYGNREVLPSTRLGTLSPAFSTRVLPAQAT --------3333----33333333------ >CONSERVED HYPOTHETICAL PR; SWP:Q8A1H2; PDB:2HQYA; MIPFKDITLADRDTITAFTMKSDRRNCDLSFSNLCSWRFLYDTQFAVIDDFLVFKFWAGE -------3333-------1111---3333-------3333-------%%%%------!!! QLAYMMPVGNGDLKAVLRKLIEDADKEKHNFCMLGVCSNMRADLEAILPERFIFTEDRAY !-----------------------1111-------------------2222------111 ADYIYLRSDLATLKGKKFQAKRNHINRFRNTYPDYEYTPITPDRIQECLDLEAEWCKVNN 1-------------3333-------------1111-----3333---------------3 CDQQEGTGNERRALIYALHNFEALGLTGGILHVNGKIVAFTFGMPINHETFGVHVEKADT 333------------------3333-------iiii----------------------11 SIDGAYAMINYEFANRIPEQYIYINREEDLGIEGLRKAKLSYQPVTILEKYMACLKDH 11-----------11113333-------iiii-------1111--------------- >COMPLEMENT C3 BETA CHAIN; SWP:P01024; PDB:2HR0A; SPMYSIITPNILRLESEETMVLEAHDAQGDVPVTVTVHDFPGKKLVLSSEKTVLTPATNH ------------------------------------------------------3333-- MTFTIPANRFKSEKGRNKFVTVQATFGTQVVEKVVLVSLQSGYLFIQTDKTIYTPGSTVL ------------------------------------------------------------ YRIFTVNHKLLPVGRTVMVNIENPEGIPVKQDSLSSQNQLGVLPLSWDIPELVNMGQWKI ------1111------------1111---------------------------------- RAYYENSPQQVFSTEFEVKEYVLPSFEVIVEPTEKFYYIYNEKGLEVTITARFLYGKKVE ---3333------------------------------1111-----------1111---- GTAFVIFGIQDGEQRISLPESLKRIPIEDGSGEVVLSRKVLLDGVQNLRAEDLVGKSLYV -----------------1111------%%%%---------1111----33332222---- SATVILHSGSDMVQAERSGIPIVTSPYQIHFTKTPKYFKPGMPFDLMVFVTNPDGSPAYR ------------------------------1111-----------------3333----- VPVAVQGEDTVQSLTQGDGVAKLSINTHPSQKPLSITVRTKKQELSEAEQATRTMQALPY ---------------1111--------------------------1111----------- STVGNSNNYLHLSVLRTELRPGETLNVNFLLRMDRAHEAKIRYYTYLIMNKGRLLKAGRQ -------------------2222-------------1111---------iiii------- VREPGQDLVVLPLSITTDFIPSFRLVAYYTLIGASGQREVVADSVWVDVKDSCVGSLVVK --2222------------------------------------------------------ SGQSEDRQPVPGQQMTLKIEGDHGARVVLVAVDKGVFVLNKKNKLTQSKIWDVVEKADIG ---------------------2222----------1111--------------1111--- CTPGSGKDYAGVFSDAGLTFTSSSGQQTAQRAELQCPQPAA ---------3333---------------------------- >HYPOTHETICAL PROTEIN; SWP:Q8KAL8; PDB:2HR2A; KPLKEVVGAYLALSDAQRQLVAGEYDEAAANCRRAEISHTPPEEAFDHAGFDAFCHAGLA 3333-------------------------------3333-1111---------------- EALAGLRSFDEALHSADKALHYFNRRGELNQDEGKLWISAVYSRALALDGLGRGAEAPEF ---------------------3333--11113333-------------1111-1111--- KKVVEIEERKGETPGKEREVAIDRIAQLGA -----3333---2222----------3333 >PROBABLE TRANSCRIPTIONAL ; SWP:NA; PDB:2HR3A; PTNQDLQLAAHLRSQVTTLTRRLRREAQADPVQFSQLVVLGAIDRLGGDVTPSELAAAER -3333------------------1111-----------------------------1111 RSSNLAALLRELERGGLIVRHARTRVSLSSEGRRNLYGNRAKREEWLVRAHACLDESERA 3333--------1111-------------------------------------------- LLAAAGPLLTRLAQFEE -----------1111-- >INSULIN RECEPTOR; SWP:P06213; PDB:2HR7A; PGEVCPGMDIRNNLTRLHELENCSVIEGHLQILLMFKTRPEDFRDLSFPKLIMITDYLLL -------------11111111-----------------3333-----1111--------- FRVYGLESLKDLFPNLTVIRGSRLFFNYALVIFEMVHLKELGLYNLMNITRGSVRIEKNN --2222------1111--------!!!!------1111----1111------------11 ELCYLATIDWSRILDSVEDNHIVLNKDDNEECGDICPGTAKGKTNCPATVINGQFVERCW 11--11113333----1111----3333-------2222--------------------- THSHCQKVCPTICKSHGCTAEGLCCHSECLGNCSQPDDPTKCVACRNFYLDGRCVETCPP 1111-----3333-----1111---1111--------------------%%%%------- PYYHFQDWRCVNFSFCQDLHHKCKNSRRQGCHQYVIHNNKCIPECPSGYTMNSSNLLCTP ----%%%%--------------3333----------%%%%-----2222----------- CLGPCPKVCHLLEGEKTIDSVTSAQELRGCTVINGSLIINIRGGNNLAAELEANLGLIEE ----------2222-----33333333------------------3333-----1111-- ISGYLKIRRSYALVSLSFFRKLRLIRGETLEIGNYSFYALDNQNLRQLWDWSKHNLTITQ ---------3333--3333----------------------1111----3333------- GKLFFHYNPKLCLSEIHKMEEVSGTKGRQERNDIALKTNGDKASCE -------1111-------------2222-1111------1111--- >GLUTAMYL-TRNA SYNTHETASE,; SWP:P46655; PDB:2HRAA; MPSTLTINGKAPIVAYAELIAARIVNALAPNSIAIKLVDDKKAPAAKLDDATEDVFNKIT -------1111-----------------2222---------------!!!!--------- SKFAAIFDNGDKEQVAKWVNLAQKELVIKNFAKLSQSLETLDSQLNLRTFILGGLKYSAA --3333------------------1111--------------1111---1111------- DVACWGALRSNGMCGSIIKNKVDVNVSRWYTLLEMDPIFGEAHDFLSKSLLELKKSANVG ----------3333----1111-------------3333--------------------- >CARBONYL REDUCTASE [NADPH; SWP:O75828; PDB:2HRBA; SRVALVTGANRGIGLAIARELCRQFSGDVVLTARDVARGQAAVQQLQAEGLSPRFHQLDI ----------------------------------------------3333--------11 DDLQSIRALRDFLRKEYGGLNVLVNNAAVAFKSDDPMPFDIKAEMTLKTNFFATRNMCNE 11-----------------------------1111------------------------- LLPIMKPHGRVVNISSLQCLRAFENCSEDLQERFHSETLTEGDLVDLMKKFVEDTKNEVH 3333-2222-------------1111---------1111--------------------3 EREGWPNSPYGVSKLGVTVLSRILARRLDEKRKADRILVNACCPGPVKTDMDGKDSIRTV 333----------------------------3333--------------1111------- EEGAETPVYLALLPPDATEPQGQLVHDKVVQNW -------------1111--------%%%%---- >FERROCHELATASE; SWP:Q7KZA3; PDB:2HRCA; RKPKTGILMLNMGGPETLGDVHDFLLRLFLDRDLMTLPIQNKLAPFIAKRLTPKIQEQYR ----------------3333------------------3333-----------------1 RIGGGSPIKIWTSKQGEGMVKLLDELSPNTAPHKYYIGFRYVHPLTEEAIEEMERDGLER 111-----------------------3333------------------------------ AIAFTQYPQYSCSTTGSSLNAIYRYYNQVGRKPTMKWSTIDRWPTHHLLIQCFADHILKE ----------1111---------------------------------------------- LDHFPLEKRSEVVILFSAHSLPMSVVNRGDPYPQEVSATVQKVMERLEYCNPYRLVWQSK 111111111111---------33331111---------------1111------------ VGPMPWLGPQTDESIKGLCERGRKNILLVPIAFTSDHIETLYELDIEYSQVLAKECGVEN ------------------1111--------------3333-------------------- IRRAESLNGNPLFSKALADLVHSHIQSNELCSKQLTLSCPLCVNPVCRETKSFFTSQQL ------!!!!---------------------3333---1111----------------- >PHOSPHOENOLPYRUVATE-PROTE; SWP:P23533; PDB:2HROA; QIKGIAASDGVAIAKAYLLVEPDLSFDNESVTDTDAEVAKFNGALNKSKVELTKIRNNAE ------------------------------------------------------------ KQLGADKAAIFDAHLLVLEDPELIQPIEDKIKNESVNAAQALTDVSNQFITIFESMDNEY -------------------1111--------1111------------------------- MAERAADIRDVSKRVLAHILGVELPNPVVIIGNDLTPSDTAQLNKEYVQGFVTNIGSHSA ------------------------------------------------------------ IMSRSLEIPAVGTKSITEEVEAGDIVVDDVLPSDEVIAEYQEKRENFFKDKQELQKLRDA --------------3333-----------------------3333-----------1111 ESVTADGHHVELAANIGTPNDLPGVIENGAEGIGLYRTEFLYMGRDQMPTEEEQFEAYKA ---1111----------1111----1111--------3333------------------- VLEAMKGKRVVVRTLDIGGDKELPYLDLPEEMNPFLGYRAIRLCLDQPEIFRPQLRALLR -----------------1111-----------3333--!!!!-11111111--------- ASVFGKLNIMFPMVATIQEFRDAKALLEEERANLKNEGYEVADDIELGIMVEIPSTAALA 1111-----------3333---------------1111--------------3333--33 DIFAKEVDFFSIGTNDLIQYTMAADRMSERVSYLYQPYNPAILRLVKQVIEASHAEGKWT 33----------------------1111--3333-1111--------------1111--- GMCGEMAGDQTAIPLLLGLGLDEFSMSATSILKARRLIRSLNESEMKELSERAVQCATSE ---3333-------------------1111-----------3333--------------- EVVDLVEEYTK ----------- >Protease [Fragment]; SWP:Q8Q4A6; PDB:2HRPH; DVQLVESGGGLVQPGGSRKLSCAASGFTFMRFGMHWVRQAPEKGLEWVAYISSGSSTIYY ------------2222-----------3333--------------------1111----- ADTVKGRFTISRDNPKNTLFLQMTSLRSEDTALYYCARSGGIERYDGTYYVMDYWGQGTS 1111--------3333----------1111------------------------------ VTVSSAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPA ---------------------------------------------%%%%----------- VLQSDLYTLSSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVPRD ---------------3333-----------3333------------ >Protease [Fragment]; SWP:Q8Q4A6; PDB:2HRPL; DTVLTQSPASLAVSLGQRATISCRASESVDYYGKSFMNWFQQKPGQPPKLLIYAASNQGS -------------2222-------------iiii--------2222-------------- GVPARFSGSGSGTDFSLHIHPMEEDDSAMYFCQQSKEVPWTFGGGTKLEIKRADAAPTVS --3333----------------1111---------------------------------- IFPPSSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMS ---------------------------------iiii----------------------- STLTLTKDEYERHNSYTCEATHKTSTSPIVKSFNR -----3333------------1111---------- >2A CYSTEINE PROTEINASE; SWP:P04936; PDB:2HRVA; GPSDMYVHVGNLIYRNLHLFNSEMHESILVSYSSDLIIYRTNTVGDDYIPSCDCTQATYY 1111----!!!!---3333---1111----3333-------------------------- CKHKNRYFPITVTSHDWYEIQESEYYPKHIQYNLLIGEGPCEPGDCGGKLLCKHGVIGIV -1111------------------------------------2222------3333----- TAGGDNHVAFIDLRHFHCA -----------3333---- >NUCLEOSIDE-DIPHOSPHATE-SU; SWP:Q8UBW2; PDB:2HRZA; GRENLYFQGHIAIIGAAGVGRKLTQRLVKDGSLGGKPVEKFTLIDVFQPEAPAGFSGAVD --------------1111--------------1111------------------------ ARAADLSAPGEAEKLVEARPDVIFHLAAIVSGEAELDFDKGYRINLDGTRYLFDAIRIAN ----1111-------1111----------------------------------------- GKDGYKPRVVFTSSIAVFGAPLPYPIPDEFHTTPLTSYGTQKAICELLLSDYSRRGFFDG --------------------------1111------------------------------ IGIRLPTICIRPGKPNAAASGFFSNILREPLVGQEAVLPVPESIRHWHASPRSAVGFLIH -----------------11113333---------------3333---------------- GAIDVEKVGPRRNLSPGLSATVGEQIEALRKVAGEKAVALIRREPNEIRCEGWAPGFEAK ---3333---------------------------3333-----------1111------- RARELGFTAESSFEEIIQVHIEDELGGSLK --1111------------------iiii-- >HIV-1 PROTEASE V32I MUTAN; SWP:P03366; PDB:2HS1A; PQITLWKRPLVTIKIGGQLKEALLDTGADDTIIEEMSLPGRWKPKMIGGIGGFIKVRQYD --------------!!!!------1111--------------------1111-------- QIIIEIAGHKAIGTVLVGPTPVNIIGRNLLTQIGATLNF -----iiii----------------3333-1111----- >PUTATIVE TRANSCRIPTIONAL ; SWP:Q0SB06; PDB:2HS5A; TSRTTRVAGILRDAIIDGTFRPGARLSEPDICAALDVSRNTVREAFQILIEDRLVAHELN --------------------2222------------------------------------ RGVFVRVPTAEDITELYICRRVVECAGVNGFDPATGDLSRVAEALDLADERYAVEDWTGV ---------------------------------------------------1111----- GTADIHFHSALASLNNSNRIDELRSVWNEARLVFHVDDAHRFHGPYLTRNHEIYDALAAG --------------------------------------------------------1111 NTEAAGQLLKTYLEDAEAQILGAYR ------------------------- >12-OXOPHYTODIENOATE REDUC; SWP:NA; PDB:2HSAA; NPLFSPYKMGKFNLSHRVVLAPMTRCRALNNIPQAALGEYYEQRATAGGFLITEGTMISP -1111---!!!!----------------%%%%-3333--------2222---------11 TSAGFPHVPGIFTKEQVREWKKIVDVVHAKGAVIFCQLWHVGRASHEVYQPAGAAPISST 11--------------------------------------!!!!-33332222------- EKPISNRWRILMPDGTHGIYPKPRAIGTYEISQVVEDYRRSALNAIEAGFDGIEIHGAHG -----------1111------------------------------------------iii YLIDQFLKDGINDRTDEYGGSLANRCKFITQVVQAVVSAIGADRVGVRVSPAIDHLDAMD i-3333---------1111-3333----------------3333-----1111-%%%%-- SNPLSLGLAVVERLNKIQLHSGSKLAYLHVTQPRYVAYGQTEAGRLGSEEEEARLMRTLR -------------------------------------1111-1111-------------- NAYQGTFICSGGYTRELGIEAVAQGDADLVSYGRLFISNPDLVMRIKLNAPLNKYNRKTF --------------------------------3333-------------------3333- YTQDPVVGYTDYPFLQ -----2222------- >HYPOTHETICAL UPF0332 PROT; SWP:O29944; PDB:2HSBA; GDELELRIRKAEKLVQDAKKEFEGLYERCCSTAYYAFHAAKALLGYGRDSKTHRGTIYLI ------------------------------------------------------------ WECREELGLSDDDCSKLSRAFDLREESDYGIYKEVSKDLAIKILKDAEIFVQKAKNAVNK ----1111-----------------------------------------------1111- NR -- >PUTATIVE PEPTIDASE M23; SWP:Q9HXK8; PDB:2HSIA; SFIMRLLNKPVPGGVAVVDLGEEGPPPRAFYQGKPVLVVREEGRRWIAVVGIPLSTKPGP ----------2222----------------iiii------iiii--------1111---- QKLEVRAATGNHEERFSVGSKPEDLKRIERELAEQTAAYRRFSPGLPSNLMLDKPVDGPL ------1111-------------3333----------1111------------------- SSPFPHSGLDFAVPAGTPIKAPAAGKVILIGDYFFNGKTVFVDHGQGFISMFCHLSKIDV -------------2222---------------------------iiii------------ KLGQQVPRGGVLGKVGATGRATGPHMHWNVSLNDARVDPAIFIGAFQ 2222--2222---------------------%%%%--3333------ >PUTATIVE PLATELET ACTIVAT; SWP:Q97PY9; PDB:2HSJA; AVQLLENWLLKEQEKIQTKYRHLNHISVVEPNILFIGDSIVEYYPLQELFGTSKTIVNRG -------------------------------------3333---3333------------ IRGYQTGLLLENLDAHLYGGAVDKIFLLIGTNDIGKDVPVNEALNNLEAIIQSVARDYPL 2222---------1111----------------1111--------------------111 TEIKLLSILPVNEREEYQQAVYIRSNEKIQNWNQAYQELASAYQVEFVPVFDCLTDQAGQ 1------------3333---!!!!-------------------------3333--1111- LKKEYTTDGLHLSIAGYQALSKSLKDYLY -1111------------------3333-- >METHIONYL-TRNA SYNTHETASE; SWP:P00958; PDB:2HSNA; MSFLISFDKSKKHPAHLQLANNLKIALALEYASKNLKPEVDNDNAAMELRNTKEPFLLFD ------------------------------------------------------------ ANAILRYVMDDFEGQTSDKYQFALASLQNLLYHKELPQQHVEVLTNKAIENYLVELKEPL -------------1111----------3333----------------------------- TTTDLILFANVYALNSSLVHSKFPELPSKVHNAVALAKKH ---------------------------------------- >NOVEL PREDICTED PHOSPHATA; SWP:Q0I1W8; PDB:2HSZA; GTQFKLIGFDLDGTLVNSLPDLALSINSALKDVNLPQASENLVTWIGNGADVLSQRAVDW ----------2222----------------1111----3333------------------ ACKQAEKELTEDEFKYFKRQFGFYYGENLCNISRLYPNVKETLEALKAQGYILAVVTNKP -----------------------------------2222-------1111---------3 TKHVQPILTAFGIDHLFSELGGQSLPEIKPHPAPFYYLCGKFGLYPKQILFVGDSQNDIF 333-----11113333----1111-------3333---------3333------------ AAHSAGCAVVGLTYGYNYNIPIAQSKPDWIFDDFADILKITQ ----------------%%%%3333--------3333-1111- >GLUTAREDOXIN-2; SWP:Q9NS18; PDB:2HT9A; LATAPVNQIQETISDNCVVIFSKTSCSYCTMAKKLFHDMNVNYKVVELDLLEYGNQFQDA 11113333--------------1111--------------------33331111------ LYKMTGERTVPRIFVNGTFIGGATDTHRLHKEGKLLPLVHQCYL --------------iiii-----------1111-----3333-- >PUTATIVE ENZYME RELATED T; SWP:Q8ZPV9_SALTY; PDB:2HTAA; HKIFALPVIEQLTPVLSRRQLDDLDLIVVDHPQVKASFALQGAHLLSWKPVGEEEVLWLS --1111------1111----!!!!------1111---------------2222------1 NNTPFKTGVALRGGVPICWPWFGPAAQQGLPSHGFARNLPWALKAHNEDDNGVMLTFELQ 111--2222-----------------2222----1111----------1111-------- SSEATRKYWPHDFTLLARFKVGKTCEIELEAHGEFATTSALHSYFNVGDIANVKVSGLGD -3333-------------------------------------------3333-------- RFIDKVNDAKEGVLTDGIQTFPDRTDRVYLNPEACSVIHDATLNRTIDVVHHHHLNVVGW ------%%%%------------------------------1111---------------- NPGPALSVSMGDMPDDGYKTFVCVETVYATAPQQATEEKPSRLAQTICVAKR ---------11111111----------------------------------- >Predicted flavin-nucleoti; SWP:Q1GBW8; PDB:2HTDA; GKKLNTNKLTEEQVNLFKNNLVYLATVDADGNPQVGPKGSTVLDPSHLQYLEKTKGEAYE ---------------------------1111------------1111------------- NIKRGSKVALVAADVPSHTAVRVLATAEVHEDDDYAKKVLAKTEFPNAFVVNLNIEEVFA -1111---------1111---------------------1111-1111------------ >Vacuolar protein-sorting-; SWP:Q86VN1; PDB:2HTHB; RFVWTSGLLEINETLVIQQRGVRIYDGEEKIKFDAGTLLLSTHRLIWRDQKNHECCMAIL -------------------------!!!!-----------1111----2222-------3 LSQIVFIEEQAAGIGKSAKIVVHLHPAPPNKEPGPFQSSKNSYIKLSFKEHGQIEFYRRL 333--------------------------------------------------------- SEEMTQRRW --------- >BH0577 PROTEIN; SWP:Q9KFA8; PDB:2HTIA; ECKDEKKITEFLNKARTGFLGLSTNDQPYVIPLNFVWHNHAIYFHGASEGRKIKIEANPE ---3333------------------------------------------3333-3333-- VCFTICEDLAYSVIIFGTIEPVSAIEEGTEAQQLDKYVPSLGSRTAIYKISCRERTAKVN ---------------------------------3333----------------------- EP -- >P FIMBRIAL REGULATORY PRO; SWP:Q47193; PDB:2HTJA; MKNEILEFLNRHNGGKTAEIAEALAVTDYQARYYLLLLEKAGMVQRSPLRRGMATYWFLK 3333---------------------------------3333------------------- GEKQAGQSCSSTTLEHHHHHH ------2222----------- >THIAZOLE BIOSYNTHESIS PRO; SWP:Q5SKG7; PDB:2HTMA; MDTWKVGPVELKSRLILGSGKYEDFGVMREAIAAAKAEVVTVSVRRVEGLLEALEGVRLL -----!!!!---------------------------------------3333-------- PNTAGARTAEEAVRLARLGRLLTGERWVKLEVIPDPTYLLPDPLETLKAAERLIEEDFLV --2222-3333------------------------------------------1111--- LPYMGPDLVLAKRLAALGTATVMPLAAPIGSGWGVRTRALLELFAREKASLPPVVVDAGL --------------3333---------2222---11113333----1111---------- GLPSHAAEVMELGLDAVLVNTAIAEAQDPPAMAEAFRLAVEAGRKAYLAGPMRP -3333----3333------3333-----------------------1111---- >BACTERIOFERRITIN; SWP:P0ABD3; PDB:2HTNA; MKGDTKVINYLNKLLGNELVAINQYFLHARMFKNWGLKRLNDVEYHESIDEMKHADRYIE --------------------------------1111------------------------ RILFLEGLPNLQDLGKLNIGEDVEEMLRSDLALELDGAKNLREAIGYADSVHDYVSRDMM ------------------------------------------------------------ IEILRDEEGHIDWLETELDLIQKMGLQNYLQAQIREEG ------------------------------1111---- >NEURAMINIDASE; SWP:Q07599; PDB:2HTUA; TYMNNTEAICDAKGFAPFSKDNGIRIGSRGHIFVIREPFVSCSPIECRTFFLTQGSLLND ---------------------3333-----------------1111-------------1 KHSNGTVKDRSPFRTLMSVEVGQSPNV 111----------------2222---- >NEURAMINIDASE; SWP:Q6XV46; PDB:2HTVA; VIHYSSGKDLCPVKGWAPLSKDNGIRIGSRGEVFVIREPFISCSINECRTFFLTQGALLN ----------------------3333-----------------1111----------222 DKHSNGTVKDRSPFRTLMSCPIGVAPSPSNSRFESVAWSATACSDGPGWLTIGITGPDAT 2----------1111-----2222--3333--------------------------3333 AVAVLKYNGIITDTLKSWKGNIMRTQESECVCQDEFCYTLITDGPSDAQAFYKILKIKKG ------iiii----------------------iiii---------------------iii KIVSVKDVDAPGFHFEECSCYPSGENVECVCRDNWRGSNRPWIRFNSDLDYQIGYVCSGV i--------2222----------------------------------------------- FGDNPRPMDSTGSCNSPINNGKGRYGVKGFSFRYGDGVWIGRTKSLESRSGFEMVWDANG ------------------iiii-----------!!!!-------------------1111 WVSTDKDSNGVQDIIDNDNWSGYSGSFSIRGETTGRNCTVPCFWVEMIRGQPKEKTIWTS ---------------1111----------3333--------------------------- GSSIAFCGVNSDTTGWSWPDGALLPFDI ---------------------------- >NEURAMINDASE; SWP:Q6DPL2; PDB:2HTYA; VKLAGNSSLCPINGWAVYSKDNSIRIGSKGDVFVIREPFISCSHLECRTFFLTQGALLND ---------------------3333-------------------------------2222 KHSNGTVKDRSPHRTLMSCPVGEAPSPYNSRFESVAWSASACHDGTSWLTIGISGPDNGA -------------------2222--1111--------------------------1111- VAVLKYNGIITDTIKSWRNNILRTQESECACVNGSCFTVMTDGPSNGQASYKIFKMEKGK -----iiii----------------------iiii---------------------%%%% VVKSVELDAPNYHYEECSCYPNAGEITCVCRDNWHGSNRPWVSFNQNLEYQIGYICSGVF --------2222---------iiii----------------------------------- GDNPRPNDGTGSCGPVSSNGAYGVKGFSFKYGNGVWIGRTKSTNSRSGFEMIWDPNGWTE ----------------2222----------!!!!-------------------------- TDSSFSVKQDIVAITDWSGYSGSFVQHPELTGLDCIRPCFWVELIRGRPKESTIWTSGSS ------------3333----------3333------------------------------ ISFCGVNSDTVGWSWPDGAELPFTI ------------------------- >Histone H3.2; SWP:P84233; PDB:2HUEB; ALIRKLPFQRLVREIAQDFKTDLRFQSSAVMALQEASEAYLVALFEDTNLCAIHAKRVTI --------------3333----------------------------------1111---- MPKDIQLARRIRGER 3333----------- >Histone H4; SWP:P62799; PDB:2HUEC; KVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGVLKVFLENVIRDAVTYTEHAKRK -3333--------------1111-------------------------------1111-- TVTAMDVVYALKRQGRTLYGFG -----------3333------- >ALANINE GLYOXYLATE AMINOT; SWP:Q3LSM4; PDB:2HUFA; MEYKVTPPAVLREPLVTPNKLLMGPGPSNAPQRVLDAMSRPILGHLHPETLKIMDDIKEG -------3333------------------------1111----1111------------- VRYLFQTNNIATFCLSASGHGGMEATLCNLLEDGDVILIGHTGHWGDRSADMATRYGADV -----------------3333----------2222-------3333-------------- RVVKSKVGQSLSLDEIRDALLIHKPSVLFLTQGDSSTGVLQGLEGVGALCHQHNCLLIVD -----2222---------------------------------2222----1111------ TVASLGGAPMFMDRWEIDAMYTGSQVLGAPPGITPVSFSHRAVERYKRRNTKVKVYYWDM ---2222---3333-------------------------------1111-----3333-- SLVGDYWGCFGRPRIYHHTISSTLLYGLREAIAMACEEGLPALIARHEDCAKRLYRGLQD ----1111--------------------------------------------------11 AGFELYADPKDRLSTVTTIKVPQGVDWLKAAQYAMKTYLVEISGGLGPTAGQVFRIGLMG 11-----1111-1111-----2222--------------------!!!!---------!! QNATTERVDRVLQVFQEAVAAVKP !!-----------------1111- >PUTATIVE DNA MISMATCH REP; SWP:Q8A5Q9; PDB:2HUHA; RQPEVRGGDTLNVFLAYVPEDAKATTPFEAYLVNDSNYYLYYTYLSAEGKAWNNRSHGLV ----2222------------1111-----------------------!!!!--------- EPNTKLLLEEFTKDVLNEERVAVQLIAFKDGKPAAIKPAVSVELRIDTVKFYKLHTFSAS -------------3333-----------------------------3333--3333---- DFFEEPALIYDIVKDDVPAKQVYV -------------iiii-2222-- >LIN2004 PROTEIN; SWP:Q92AB8_LISIN; PDB:2HUJA; LIRTEQLLLQNEKNWELYLSNREEEKPFDFYKDKPFVDEAKRCADDFLELAIPWVNTERP ----------------------------3333---------------------------- PYLGELQLRQACDNVQTAVSAFNGRSFYKHFLDHYQSTKYTLTRVRDFLKRKEES ----------------------1111----------------------------- >336aa long hypothetical d; SWP:O58151; PDB:2HUNA; SMKLLVTGGMGFIGSNFIRYILEKHPDWEVINIDKLGYGSNPANLKDLEDDPRYTFVKGD -------1111-------------1111--------2222----1111--1111-----1 VADYELVKELVRKVDGVVHLAAESHVDRSISSPEIFLHSNVIGTYTLLESIRRENPEVRF 1113333---1111-----------3333--3333-------------------1111-- VHVSTDEVYGDILKGSFTENDRLMPSSPYSATKAASDMLVLGWTRTYNLNASITRCTNNY -----3333--------1111--------------------------------------- GPYQFPEKLIPKTIIRASLGLKIPIYGTVRDWLYVEDHVRAIELVLLKGESREIYNISAG 22221111--------1111-------------3333----------------------- EEKTNLEVVKIILRLMGKGEELIELVEDRPGHDLRYSLDSWKITRDLKWRPKYTFDEGIK -------------1111--3333-----------------3333---------------- KTIDWYLKNEWWWKPLVDERILHPTPWKL --------33333333--3333--1111- >INOSITOL OXYGENASE; SWP:Q9QXN5; PDB:2HUOA; FRNYTSGPLLDRVFTTYKLMHTHQTVDFVSRKRIQYGSFSYKKMTIMEAVGMLDDLVDES -----------------------------------1111------------------111 DPDVDFPNSFHAFQTAEGIRKAHPDKDWFHLVGLLHDLGKIMALWGEPQWAVVGDTFPVG 1---------------------3333--------1111-3333----3333--------- CRPQASVVFCDSTFQDNPDLQDPRYSTELGMYQPHCGLENVLMSWGHDEYLYQMMKFNKF ---1111-3333-111111113333---!!!!----3333---------------1111- SLPSEAFYMIRFHSFYPWHTGGDYRQLCSQQDLDMLPWVQEFNKFDLYTKCPDLPDVESL --3333------------------1111------------------1111-----3333- RPYYQGLIDKYCPGTLSW ------------------ >RAS-RELATED PROTEIN RAB-4; SWP:Q86YS6; PDB:2HUPA; YDFLFKLVLVGDASVGKTCVVQRFKTGAFDFTMKTLEIQGKRVKLQIWDTAGQERFRTIT -----------2222----------------------iiii--------22221111--- QSYYRSANGAILAYDITKRSSFLSVPHWIEDVRKYAGSNIVQLLIGNKSDLSELREVSLA -3333---------1111------------------1111-------33331111----- EAQSLAEHYDILCAIETSAKDSSNVEEAFLRVATELIMRHGG ------1111--------1111---------------1111- >HYPOTHETICAL PROTEIN; SWP:Q836T6; PDB:2HV2A; TTKRVKKGKEEKEFDLVIYAFNQEPTAERQERFEKLLSHTQSYGFLIDEQLTSQVATPFQ -------3333-------1111---3333-------1111------%%%%---------- VNFHGVRYPAGIGYVASYPEYRGEGGISAIKELADLAKQKVALSYLAPFSYPFYRQYGYE -----------------3333---3333--------1111---------33333333--- QTFEQAEYTIKTEDWPRVKRVPGTIKRVSWADGKEVIKDVYLENQRAHSGGVIRETWWLD ----------3333--------------3333-----------3333------------- YTLNRASKPNNQAIYYSSEGKAEGYVIYRIAAGTFEIVEWNYLTNTAFKALAGFIGSHSG ----3333--------1111----------iiii-----------------------111 SVQSFHWINGFAGKDLNDLPTPAASVKILPYARIVELQTFLEKYPFQSGEKETYSLEIED 1--------------1111----------------------------------------1 SYGPWNEGIWTITIDEQGKATVTKGAATAALKADIQTWTQLFLGYRSAETLSFYERLQGD 1111111---------------------------------1111---------------- ATIAQRLGQRLVKGPILEDYF -------1111---------- >SUPEROXIDE REDUCTASE; SWP:O58810; PDB:2HVBA; MLKETIRSGDWEKHVPVIEYEREGDLVKVEVSVGKEIPHPNTPEHHIAWIELYFHPEGGQ 3333------------------!!!!---------------3333--------------- FPILVGRVEFTNHSDPLTEPRAVFFFKTSKKGKLYALSYCNIHGLWENEVQLE ----------------------------------------------------- >ADENYLOSUCCINATE LYASE; SWP:Q8WSJ9; PDB:2HVGA; HLKNISPIDGRYKKACGELSAFFSEHALIKHRIIVEVRWLLFLNEEELFFEKVTDHSVEV 1111-33331111----3333--------------------------------3333--- LNQIATNITDSDIARVKAIEHDVKAVEYFVKEKLKNSKREDLLKIKEYVHYLCTSEDINN 3333------------3333-3333--------1111-------1111-22223333--- VAYATCLKACLNDVVIPCLEKILKLKDLAVEYSHVPLLSRTHGQPASSTTFGKEANFYAR -------------------------------1111-----iiii-----3333------- IHHHVGVIRRVKVCAKFNGAVGNFNAHKVASKDTDWVNTIGLFLKKHFNLTYSIYCTQIQ -------1111-------------------1111-------------------------- DHDYICELCDGLARANGTLIDLCVDIWLYISNNLLKLKSSTPHKVNPIDFENAEGNLHIA -----------------------------1111--------------------------- NAFFKLFSSKLPTSRLQRDLSDSTVLRNIGSSLAYCLIAYKSVLKGLNKIDIDRRNLEEE ---------3333-!!!!----------------------------1111---------- LNQNWSTLAEPIQIVKRHNYVDAYEELKQFTRGKVIDQKIQEFIKTKCAFLPQDVVDQLL ------------------------------2222--3333-------3333-------11 ELTPATYTGYADYLAKNVERLS 113333-!!!!------3333- >HEVAMINE; SWP:P23472; PDB:2HVM; GGIAIYWGQNGNEGTLTQTCSTRKYSYVNIAFLNKFGNGQTPQINLAGHCNPAAGGCTIV ---------1111-----------------------iiii-----!!!!---%%%%---- SNGIRSCQIQGIKVMLSLGGGIGSYTLASQADAKNVADYLWNNFLGGKSSSRPLGDAVLD -------1111----------------------------------------1111----- GIDFDIEHGSTLYWDDLARYLSAYSKQGKKVYLTAAPQCPFPDRYLGTALNTGLFDYVWV ------------------------1111--------------3333-3333--------- QFYNNPPCQYSSGNINNIINSWNRWTTSINAGKIFLGLPAAPEAAGSGYVPPDVLISRIL ----3333--2222--------------------------3333--------------33 PEIKKSPKYGGVMLWSKFYDDKNGYSSSILDSV 333333------------------33331111- >SPLICING FACTOR, ARGININE; SWP:Q16629; PDB:2HVZA; MKVYVGNLGTGAGKGELERAFSYYGPLRTVWIARNPPGFAFVEFEDPRDAEDAVRGLDGK ------------------------------------------------------------ VICGSRVRVELSTGMPRRSRFDRPPARRKLLEVLFNGPLEH -%%%%------------------------------------ >ENOYL-COA HYDRATASE; SWP:P30084; PDB:2HW5A; ANFEYIIAEKRGKNNTVGLIQLNRPKALNALCDGLIDELNQALKIFEEDPAVGAIVLTGG -----------2222--------1111---------------------3333-------1 DKAFAAGADIKEMQNLSFQDCYSSKFLKHWDHLTQVKKPVIAAVNGYAFGGGCELAMMCD 111-----33331111-----1111-33333333---------------------3333- IIYAGEKAQFAQPEILIGTIPGAGGTQRLTRAVGKSLAMEMVLTGDRISAQDAKQAGLVS ----1111----3333-------1111--------------------------------- KICPVETLVEEAIQCAEKIASNSKIVVAMAKESVNAAFEMTLTEGSKLEKKLFYSTFATD ---3333-----------1111-----------3333----------------3333--- DRKEGMTAFVEKRKANFKDQ --------1111-------- >MAP kinase-interacting se; SWP:Q9BUB5; PDB:2HW6A; SLPGKFEDMYKLTSELLGEGAYAKVQGAVSLQNGKEYAVKIIEKQAGHSRSRVFREVETL ----3333----------------------------------1111------------33 YQCQGNKNILELIEFFEDDTRFYLVFEKLQGGSILAHIQKQKHFNEREASRVVRDVAAAL 33------------------------------3333--------3333------------ DFLHTKGIAHRDLKPENILCESPEKVSPVKICDFDLGSAPEVVEVFTDQATFYDKRCDLW -------------3333----3333----------------------------------- SLGVVLYIMLSGYPPFGKYEFPDKDWAHISSEAKDLISKLLVRDAKQRLSAAQVLQHPWV ---------------------3333------------------3333----------333 QG 3- >HYPOTHETICAL PROTEIN ATU1; SWP:Q8UF59; PDB:2HWJA; IYEPRLSRIAIDKLRPTQIAVGFREVELKRKEWRETRDFLGNHIVPVVAGPKDRAYLIDH ---------1111----------------------------------------------- HHLVLALSKEGVEHVLTSEVAKFSHLGKDEFWSVDHRNLIYPFDAQGLRRQSGDIPKNIH ----------------------1111--------1111-----1111---3333---333 DLEDDPFRSLAGALRAGGYAKVIIPFSEFGWADFLRRRIDRDLLSDSFDDALAEAKLAKS 3------------------------3333----1111--3333-------------1111 REARHLPGWCGVE 1111-2222---- >HELICASE NSP2; SWP:P27282; PDB:2HWKA; DVFQNKANVCWAKALVPVLKTAGIDMTTEQWNTVDYFETDKAHSAEIVLNQLCVRFFGLD 1111---------------1111---3333----3333-----3333------------3 LDSGLFSAPTVPLSIRNNHWDNSPSPNMYGLNKEVVRQLSRRYPQLPRAVATGRVYDMNT 3331111-----------------------------------1111-------------- GTLRNYDPRINLVPVNRRLPHALVLHHNEHPQSDFSSFVSKLKGRTVLVVGEKLSVPGKM -------------1111---------------------1111-------------2222- VDWLSDRPEATFRARLDLGIPGDVPKYDIIFVNVRTPYKYHHYQQCEDHAIKLSMLTKKA ------1111----3333--1111---------------------------------333 CLHLNPGGTCVSIGYGYADRASESIIGAIARQFKFSRVCKPKSSLEETEVLFVFIGYDRK 3---2222-------------------------------------1111----------- ARTHNPYKLSSTLTNIYTGS -------------------- >DNA-BINDING RESPONSE REGU; SWP:Q836C2; PDB:2HWVA; SHMTIGDLTIHPDAYMVSKRGEKIELTHREFELLYYLAKHIGQVMTREHLLQTVWGYDYF ----!!!!----------iiii-----------------2222------------1111- GDVRTVDVTVRRLREKIEDSPSHPTYLVTRRGVGYYLRNPE -------------------1111------2222-------- >TELOMERASE-BINDING PROTEI; SWP:Q86US8; PDB:2HWWA; MELEIRPLFLVPDTNGFIDHLASLARLLESRKYILVVPLIVINELDGLAKGAGGYARVVQ -------------3333--------------------3333------------------- EKARKSIEFLEQRFESRDSCLRALTSRGNELESIAFRSENNDDLILSCCLHYCKDKAKDF -------------11111111---1111--------------------3333---3333- MIRLLREVVLLTDDRNLRVKALTRNVPVRDIPAFLTWAQV ---------------------1111--------------- >PROTEIN SMG5; SWP:Q9UPR3; PDB:2HWYA; PYLVPDTQALCHHLPVIRQLATSGRFIVIIPRTVIDGLDLLKEHPGARDGIRYLEAEFKK ---------------------------------------------------------111 GNRYIRCQLYKILDSCKQLTLAQLPLDNPSVLSGALQAAAHASVDIKNVLDFYKQW 1------------------1111-----------3333--1111------------ >PUTATIVE DNA-BINDING PROT; SWP:Q57K43; PDB:2HX0A; HHNASTARFYALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQEATTS --------------2222---------------------------------2222----- LTGTFEVISLNGTLELTGEHLHLAVSDPYGVLGGHPGCTVRTTLELVIGELPALTFSRQP --------------------------1111--------------------1111------ CAISGYDELHISSRL --------------- >Predicted sugar phosphata; SWP:Q11S56; PDB:2HX1A; GQIESFKSLLPKYKCIFFDAFGVLKTYNGLLPGIENTFDYLKAQGQDYYIVTNDASRSPE ----33333333-------2222-------2222-------------------------- QLADSYHKLGLFSITADKIISSGITKEYIDLKVDGGIVAYLGTANSANYLVSDGIKLPVS ------111133333333------------------------33333333-2222--333 AIDDSNIGEVNALVLLDDEGFNWFHDLNKTVNLLRKRTIPAIVANTDNTYPLTKTDVAIA 3-33331111-----------3333----------------------------------- IGGVATIESILGRRFIRFGKPDSQFFAYDLRQKEISKREILVGDTLHTDILGGNKFGLDT -----------------------------1111--1111--------------1111--- ALVLTGNTRIDDAETKIKSTGIVPTHICESAVIEL --------3333----------------------- >HYPOTHETICAL PROTEIN; SWP:Q7V4A7; PDB:2HX5A; NPENWLLLRRVVRFGDTDAAGVHFHQLFRWCHESWEESLESYGLNPADIFPGSRKSEVTP 3333--------3333-1111-3333------------------3333------------ EVALPIIHCQADFRRPIHTGDALAELRPERLNPNSFQVHFEFRCEEQIAAHALIRHLAIN -----------------2222----------1111--------iiii------------- AQTRHRCALPEGIDRWLEASG -----------------1111 >RIBONUCLEASE; SWP:P13312; PDB:2HX6A; MTINTEVFIRRNKLRRHFESEFRQINNEIREASKAAGVSSFHLKYSQALLDRAIQREIDE ---------------------------------------------3333-------2222 TYVFELFHKIKDHVLEVNEFLSMPPRPDIDEDFIDGVEYRPGRLEITDGNLWLGFTVCKP -----------------------------------------------iiii--------- NEKFKDPSLQCRMAIINSRRLPGKASKAVIKTQ --------------------------------- >HYPOTHETICAL PROTEIN; SWP:Q9JY98; PDB:2HXJA; QETALGAALKSAVQTSKKKQTEIADHIYGKYDVFKRFKPLALGIDQDLIAALPQYDAALI ---------------------------------1111---2222-------1111----- ARVLANHCRRPRYLKALARGGKRFDLNNRFKGEVTPEEQAIAQNHPFVQQALQQ ------------------------1111-------------------------- >URACIL-DNA GLYCOSYLASE; SWP:P13051; PDB:2HXMA; MEFFGESWKKHLSGEFGKPYFIKLMGFVAEERKHYTVYPPPHQVFTWTQMCDIKDVKVVI -----------3333------------------------3333-3333---3333----- LGQDPYHGPNQAHGLCFSVQRPVPPPPSLENIYKELSTDIEDFVHPGHGDLSGWAKQGVL -------2222---2222---------------------1111-------33331111-- LLNAVLTVRAHQANSHKERGWEQFTDAVVSWLNQNSNGLVFLLWGSYAQKKGSAIDRKRH --------2222-1111--------------------------------1111--1111- HVLQTAHPSPLSVYRGFFGCRHFSKTNELLQKSGKKPIDWKEL --------3333----2222----------1111----1111- >PUTATIVE TETR-FAMILY TRAN; SWP:Q9K463; PDB:2HXOA; LSRERIVGAAVELLDTVGERGLTFRALAERLATGPGAIYWHITGKAELLGAATDAVVTAA -----------------3333------------11113333------------------- VTAGPTGAADSPQDAVRAVALGLWDATEAHPWLATQLATQLSRTPWGTVAPRIFESLGRQ ------1111---------------------------------1111------------- VQAGVPEAHWFTASSALHYILGAAGQNAANDEFLDTVSTAWEGLDPDAYPFTRAVADQVR -----3333-------------------------------11113333------333322 GHDDREQFLAGITLVLTGITALHRP 22-----------------1111-- >DUAL SPECIFICITY PROTEIN ; SWP:Q99956; PDB:2HXPA; SFPVQILPNLYLGSARDSANLESLAKLGIRYILNVTPNLPNFFEKNGDFHYKQIPISDHW ------2222------------------------------1111-1111-------3333 SQNLSRFFPEAIEFIDEALSQNCGVLVHSLAGVSRSVTVTVAYLMQKLHLSLNDAYDLVK 11111111-------------------------------------1111----------- RKKSNISPNFNFMGQLLDFERSLR ------------------------ >HTH-TYPE TRANSCRIPTIONAL ; SWP:P27111; PDB:2HXRA; SLRIAVTPTFTSYFIGPLADFYARYPSITLQLQESQEKIEDLCRDELDVGIAFAPVHSPE ------3333--------------1111-------------1111------------111 LEAIPLLTESLALVVAQHHPLAVHEQVALSRLHDEKLVLLSAEFATREQIDHYCEKAGLH 1--------------1111-1111---33331111-----1111----------1111-- PQVVIEANSISAVLELIRRTSLSTLLPAAIATQHDGLKAISLAPPLLERTAVLLRRKNSW --------------------------33333333-------------------------- QTAAAKAFLHALDKCA ------------1111 >L-FUCONATE DEHYDRATASE; SWP:Q8P3K2; PDB:2HXTA; RTIIALETHDVRFPTSRELDGSDAMNPDPDYSAAYVVLRTDGAEDLAGYGLVFTIGRGND --------------33332222--------------------3333---------2222- VQTAAVAALAEHVVGLSVDKVIADLGAFARRLTNDSQLRWLGPEKGVMHMAIGAVINAAW ------------222233331111-------11113333--------------------- DLAARAANKPLWRFIAELTPEQLVDTIDFRYLSDALTRDEALAILRDAQPQRAARTATLI ----1111------1111-----1111-2222---------------3333--------- EQGYPAYTTSPGWLGYSDEKLVRLAKEAVADGFRTIKLKVGANVQDDIRRCRLARAAIGP --------33331111------------1111---------------------------- DIAMAVDANQRWDVGPAIDWMRQLAEFDIAWIEEPTSPDDVLGHAAIRQGITPVPVSTGE -------%%%%--------33333333---------1111------------------11 HTQNRVVFKQLLQAGAVDLIQIDAARVGGVNENLAILLLAAKFGVRVFPHAGGVGLCELV 11--------------------1111--------------------------2222---- QHLAMADFVAITGKMEDRAIEFVDHLHQHFLDPVRIQHGRYLAPEVPGFSAEMHPASIAE --------------1111-------3333-------%%%%-------------------- FSYPDGRFWVEDLA ----------3333 >Diaminohydroxyphosphoribo; SWP:Q9X2E8_THEMA; PDB:2HXVA; HTFKRAIELAKKGLGRVNPNPPVGAVVVKDGRIIAEGFHPYFGGPHAERAIESARKKGED 3---------1111--------------iiii--------2222-3333----------- LRGATLIVTLEPCDHHGKTPPCTDLIIESGIKTVVIGTRDPNPVSGNGVEKFRNHGIEVI 2222-------------------------------------3333--------------- EGVLEEEVKKLCEFFITYVTKKRPFVALKYASTLDGKIADHRGDSKWITDKLRFKVHERN --------------------------------1111---1111-2222-1111------- IYSAVLVGAGTVLKDNPQLTCRLKEGRNPVRVILDRKGVLSGKVFRVFEENARVIVFTES ----------------------1111--------1111------3333-----------3 EEAEYPPHVEKALSDCSVESILRNLYERDIDSVLVEGGSKVFSEFLDHADVVFGFYSTKI 333--1111----------------1111---------------1111------------ FGKGLDVFSGYLSDVSVPPKFKVVNVEFSDSEFLVERPC -------1111--1111-----------!!!!------- >BARSTAR; SWP:P11540; PDB:2HXXA; KKAVINGEQIRSISDLHQTLKKELALAEYYGENLDALDALTGVEYPLVLERQFEQSKQLT -----3333------------1111-1111--3333------------------------ ENGAESVLQVFREAKAEGADITIILS -------------------------- >RV0805; SWP:O06629; PDB:2HY1A; PRPDYVLLHISDTHLIDADDRLGELLEQLNQSGLRPDAIVFTGDLADKGEPAAYRKLRGL -----------------------------3333--------------------------- VEPFAAQLGAELVWVMGNHDDRAELRKFLLDEAPSMAPLDRVCMIDGLRIIVLDTSVPGH ----------------1111------------------------!!!!--------2222 HHGEIRASQLGWLAEELATPAPDGTILALHHPPIPSVLDMAVTVELRDQAALGRVLRGTD --------------------1111-------------33331111--3333----2222- VRAILAGHLHYSTNATFVGIPVSVASATCGCNLVHVYPDTVVHSVIP ----------------iiii--------------------------- >PUTATIVE SULFURTRANSFERAS; SWP:O87896; PDB:2HY5A; MKFALQINEGPYQHQASDSAYQFAKAALEKGHEIFRVFFYHDGVNNSTRLTTPPQDDRHI ---------------------------1111--------!!!!----------1111--- VNRWAELAEQYELDMVVCVAAAQRRGIVDEGEASRNGKDATNIHPKFRISGLGQLVEAAI -------------------------------------------1111------------- QADRLVVFGD ---------- >Intracellular sulfur oxid; SWP:O87897; PDB:2HY5B; VKKFMYLNRKAPYGTIYAWEALEVVLIGAAFDQDVCVLFLDDGVYQLTRGQDTKGIGMKN --------------------------3333--------------1111----3333---- FSPTYRTLGDYEVRRIYVDRDSLEARGLTQDDLVEIAFEDMETEEEFDNIVEVIDSARVS 33331111---------------1111-3333--------1111---------------- ELMNESDAVFSF --1111------ >DsrH; SWP:O87898; PDB:2HY5C; SILHTVNKSPFERNSLESCLKFATEGASVLLFEDGIYAALAGTRVESQVTEALGKLKLYV --------1111-----------2222------------2222----------------- LGPDLKARGFSDERVIPGISVVDYAGFVDLTTECDTVQAWL -----1111-3333--------------------------- >PROTEIN MAGO NASHI HOMOLO; SWP:P61326; PDB:2HYIA; SDFYLRYYVGHKGKFGHEFLEFEFRPDGKLRYANNSNYKNDVMIRKEAYVHKSVMEELKR ------------1111--------1111---------%%%%---------3333------ IIDDSEITKEDDALWPPPDRVGRQELEIVIGDEHISFTTSKIGSLIDVNQSKDPEGLRVF -----3333--1111---3333-------!!!!----------3333-----1111---- YYLVQDLKCLVFSLIGLHFKIKPI ------------------------ >Protein CASC3; SWP:O15234; PDB:2HYID; LDDDEDRKNPAYIPRKGLFFEHDLRGRWEHDKFREDEQAPKSRQELIALYGYDIRS -11111111-------3333---------11113333---------------3333 >PUTATIVE TETR-FAMILY TRAN; SWP:Q8CJS4; PDB:2HYJA; AEAQATRGRILGRAAEIASEEGLDGITIGRLAEELEMSKSGVHKHFGTKETLQISTLDKA ---------------------1111------------33333333--------------- FVDFWHRVVEPALAEPPGLRRLRAVCANSVGYLEEPLLPGGCLLTAALSEYDGRPGRVRD --------3333-------------------------1111---------1111------ AVAEVWSRWREQLRADLTAAVDKGELPAGFDVEQALFEIVAAGLALNAAMQLQHDRTAAD --------------------------1111------------------------3333-- RARRAIERALAQS -------1111-- >TETR-FAMILY TRANSCRIPTION; SWP:Q6D668; PDB:2HYTA; RTRAEEETRATLLATARKVFSERGYADTSDDLTAQASLTRGALYHHFGDKKGLLAAVVEQ -3333-------------------1111----------2222------------------ IDAEDERLQAISDTAEDDWEGFRCRCRAYLEALEPEIQRIVLRDARAVLGGASPDSQRHC ------------------------------------------------------------ VESQRLIDNLIRQGVVAEADPQALASLIYGSLAEAAFWIAEGEDGNARLAQGVAALELLL --------------------------------------3333-----------------3 RGLLVKPR 333----- >ANNEXIN A2; SWP:P07355; PDB:2HYVA; NFDAERDALNIETAIKTKGVDEVTIVNILTNRSNAQRQDIAFAYQRRTKKELASALKSAL ----------------2222-------1111----------------------------- SGHLETVILGLLKTPAQYDASELKASMKGLGTDEDSLIEIICSRTNQELQEINRVYKEMY ---------3333--------------!!!!-3333------------------------ KTDLEKDIISDTSGDFRKLMVALAKGRRAEDGSVIDYELIDQDARDLYDAGVKRKGTDVP --3333--1111--------------------------------------3333------ KWISIMTERSVPHLQKVFDRYKSYSPYDMLESIRKEVKGDLENAFLNLVQCIQNKPLYFA ------------------3333-------------------------------------- DRLYDSMKGKGTRDKVLIRIMVSRSEVDMLKIRSEFKRKYGKSLYYYIQQDTKGDYQKAL -------------3333------1111--------------------------------- LYLCGGDD -------- >SPLICING FACTOR U2AF 65 K; SWP:P26368; PDB:2HZCA; GPLGSARRLYVGNIPFGITEEAMMDFFNAQMRLGGLTQAPGNPVLAVQINQDKNFAFLEF --3333--------2222------------------------------------------ RSVDETTQAMAFDGIIFQGQSLKIRRP -------33332222-iiii------- >TRANSCRIPTIONAL ENHANCER ; SWP:P28347; PDB:2HZDA; MDNDAEGVWSPDIEQSFQEALSIYPPCGRRKIILSDEGKMYGRNELIARYIKLRTGKTRT -3333---------------------------3333--------------1111------ RKQVSSHIQVLARRKSRDLVPR --------------1111---- >GLUTAREDOXIN-1; SWP:GLRX1_ECTVM; PDB:2HZFA; QMAEEFVQQRLANNKVTIFVKYTCPFCRNALDILNKFSFKRGAYEIVDIKEFKPENELRD ------3333-------------3333------3333--2222----------3333--- YFEQITGGKTVPRIFFGKTSIGGYSDLLEIDNMDALGDILSSIGVLRT ---------------!!!!---------------------1111---- >Mandelate racemase/mucona; SWP:Q3HKK5; PDB:2HZGA; SLKIDAVDLFYLSMPEVTDAADGSQDALLVRVAAGGHIGWGECEAAPLPSIAAFVCPKSH ---------------------1111--------iiii----------------------1 GVCRPVSDSVLGQRLDGPDDIARIAALVGYNSMDLLQAPHMLSGIEMALWDLLGRRLSAP 111-3333-2222---3333-----------1111-3333-------------------3 AWALLGYSASHGKRPYASLLFGDTPQETLERARAARRDGFAAVKFGWGPIGRGTVAADAD 333--------------------------------1111-------!!!!---------- QIMAAREGLGPDGDLMVDVGQIFGEDVEAAAARLPTLDAAGVLWLEEPFDAGALAAHAAL ------------------%%%%---------------1111--------1111------1 AGRGARVRIAGGEAAHNFHMAQHLMDYGRIGFIQIDCGRIGGLGPAKRVADAAQARGITY 111--------1111-3333---------------------------------------- VNHTFTSHLALSASLQPFAGLEADRICEYPAAPQQLALDITGDHIRPDAEGLIRAPEAPG --------------3333--3333---------3333----------1111-------!! LGLQVAASALRRYLVETEIRIGGQLIYRTPQLE !!------------------iiii--------- >PROTO-ONCOGENE TYROSINE-P; SWP:P00519; PDB:2HZIA; DKWEMERTDITMKHKLGGGQYGEVYEGVWKKYSLTVAVKTLKEDTMEVEEFLKEAAVMKE 1111-1111-------iiii---------1111------------------------111 IKHPNLVQLLGVCTREPPFYIITEFMTYGNLLDYLRECNRQEVNAVVLLYMATQISSAME 1-1111--------------------------------3333------------------ YLEKKNFIHRDLAARNCLVGENHLVKVADFGLSRLMTGDTYTAHAGAKFPIKWTAPESLA ------------3333----%%%%-----------1111----2222--3333-3333-- YNKFSIKSDVWAFGVLLWEIATYGMSPYPGIDLSQVYELLEKDYRMERPEGCPEKVYELM ----3333-----------1111----22221111----1111-----2222-------- RACWQWNPSDRPSFAEIHQAFETMFQES -1111-3333------------------ >RNA POLYMERASE II MEDIATO; SWP:P34162; PDB:2HZMA; GKSAVIFVERATPATLTELKDALSNSILSVRDPWSIDFRTYRCSIKNLSKLMYSITFHHH -----------3333-------3333---------------------------------- GRQTVLIKDNSAMVTTAAAADIPPALVFNGSSTGVPESIDTILSSKLSNIWMQRQLIKGD -------%%%%------1111-----1111----------------1111---------- AGETLILDGLTVRLVNLFSSTGFKGLLIELQADEAGEFETKIAGIEGHLAEIRAKEYKTS ------2222--------1111-----------3333------------1111------- SDSLGPDTSNEICDLAYQYVRALEL ------------------------- >RNA polymerase II mediato; SWP:P32585; PDB:2HZMB; VQQLSLFGSIGDDGYDLLISTLTTISGNPPLLYNSLCTVWKPNPSYDVEPNRIKLSKEVP ----------3333----------------------------3333-------------3 FSYLIDEKPLNFRILKSFESCSPWSLQISDIRSVSMQTIAETIILSSAGKNSSVSSLMNG 333-------3333---------------------------------------------- LGYVFEFQYLTIGVKFFMKHGLILELQKIWQIEEAGNSQITSGGFLLKAYINVSRGTDID ------------------iiii----------1111--1111------------------ RINYTETVLMNLKKELQGYIELSVPDRQSMDSRVAHGNILIAAALEH -------------1111--------------3333------------ >PUTATIVE HTH-TYPE TRANSCR; SWP:O34533; PDB:2HZTA; SLVEATLEVIGGKWKVILHLTHGKKRTSELKRLPNITQKLTQQLRELEADGVINRIVYNQ -----3333--2222----1111------------------------1111--------- VPPKVEYELSEYGRSLEGILDLAWGANHINR ----------3333---3333---------- >ACETYLTRANSFERASE, GNAT F; SWP:Q831Y9; PDB:2I00A; LTLKPVEEEHIDQFNELLSYVFQVTEADIEESGFENKRAFIKSKQPILELSKVFGWFHEN -------1111------------------------3333--------1111--------- QLISQIAIYPCEVNIHGALYKMGGVTGVGTYPEYANHGLMKDLIQTALEEMRQDKQWISY --------------iiii------------3333-------------------------- LFPYNIPYYRRKGWEIMSDKLSFKIRDTQLPKTVPVPGMIERLAVDHPDVFDVYARFARQ ---------1111------------3333--------------1111------------- NHGALIRSAFNWEEYWRFENEEERTAAVYYGANQEPLGVLFYWVADEVFHIKEMFYLNQE 2222---3333--1111--3333-------1111----------%%%%------------ ARNGLWNFITAHFSMVYWVKGDIYKNEPLAFLLEDSQIKESIEPYYMARIVDVKAFLENF -----------1111-------------3333------------------------1111 PFESTAKPFHFVVKDPVAEWNNGIFGLIWDENDQVTITDEPLGTAVHLDIQTLTCLVMNY -----------------1111--------1111--------------------------- RRPSYLHRIERIDTDKETLNSLERIFPDQEAYFSDYF ------1111--------------------------- >GENERAL STRESS PROTEIN OF; SWP:NA; PDB:2I02A; TDRTQEIQKLHELIKNIDYGFTTVDDDGSLHSYPSKSGDEATLWFFTYAGSHKVTEIEHH -------------1111-------1111-------------------1111--------- EQVNVSFSSPEQQRYVSISGTSQLVKDRNKRELWKPELQTWFPKGLDEPDIALLKVNINQ ---------1111-------------33331111----1111-!!!!1111--------- VNYWDSTSSFKPQTISF ----3333--------- >PEPTIDE E6; SWP:Q4L1J5; PDB:2I04A; SIHTKLRKSSRGFGFTVVGGDEPDEFLQIKSLVLDGPAALDGKMETGDVIVSVNDTCVLG --------1111---------1111-------------------2222------------ HTHAQVVKIFQSIPIGASVDLELCR ---------11112222-------- >PHOSPHOPENTOMUTASE; SWP:Q8DTU0; PDB:2I09A; STFNRIHLVVLDSVGIGAAPDANNFSNAGVPDGASDTLGHISKTVGLNVPNMAKIGLGNI ----------2222----1111----iiii-1111--------------------1111- PRDTPLKTVPAENHPTGYVTKLEEVSLGKDTMTGHWEIMGLNITEPFDTFWNGFPEEIIS -----1111----------------------------------------1111-3333-- KIEKFSGRKVIREANKPYSGTGELIIYTSADPVLQIAAHEDVIPLDELYRICEYARSITL ----------3333------------------------3333-3333------------- ERPALLGRIIARPTANRHDYALSPFAPTVLNKLADAGVSTYAVGKINDIFNGSGITNDMG ---------------------------------1111--------------2222----- HNKSNSHGVDTLIKTMGLSAFTKGFSFTNLVDFDALYGHRRNAHGYRDCLHEFDERLPEI -----------------3333----------------1111------------------- IAAMKVDDLLLITADHGNDPTYAGTDHTREYVPLLAYSPSFTGNGVLPVGHYADISATIA 1111--------------1111---------------1111---------3333------ DNFGVDTAMIGESFLDKLI -------------3333-- >PROTEIN KINASE C-BETA II; SWP:P05771; PDB:2I0EA; LTDFNFLMVLGKGSFGKVMLSERKGTDELYAVKILKKDVVIQDDDVECTMVEKRVLALPG 3333--------1111-------------------3333---------------1111-- KPPFLTQLHSCFQTMDRLYFVMEYVNGGDLMYHIQQVGRFKEPHAVFYAAEIAIGLFFLQ -1111--------1111-----------------------3333---------------- SKGIIYRDLKLDNVMLDSEGHIKIADFGMCKENIWDGVTTKFCGTPDYIAPEIIAYQPYG ---------3333---1111------1111----2222-------11113333------3 KSVDWWAFGVLLYEMLAGQAPFEGEDEDELFQSIMEHNVAYPKSMSKEAVAICKGLMTKH 333--------------------------------------3333--------------3 PGKRLGCGPEGERDIKEHAFFRYIDWEKLERKEIQPPYKPKACGRENFDRFFTRHPPVLP 33322221111---11111111------1111---------------------------- PDQEVIRNIDQSEFEGFFVNSEFLKP ---------1111------------- >PHOSPHATE TRANSPORT SYSTE; SWP:P0A3Y7; PDB:2I0MA; QFDLELHELEQSFLGLGQLVLETASKALLALASKDKEMAELIINKDHAINQGQSAIELTC ------------------------------1111-------------------------- ARLLALPQVSDLRFVISIMSSCSDLERMGDHMAGIAKAVLQLKENQLAEEQLHQMGKLSL 3333----3333------------------------3333-------------------- SMLADLLVAFPLHQASKAISIAQKDEQIDQYYYALSKEIIGLMKDQESIPNGTQYLYIIG --------3333--------------------------3333-----3333--------- HLERFADYIANICERLVYLETGELVDL --------------------------- >CLASS VII UNCONVENTIONAL ; SWP:NA; PDB:2I0NA; MGPLGSPEFAKYARALKDYNVSDTSLLPFKRNDIITITFKDQENKWFMGQLNGKEGSFPV ----------------------------------------------------------33 DHVEILLSDVPPPQPVHPVA 33------------------ >SER/THR PHOSPHATASE; SWP:Q7PP01_ANOGA; PDB:2I0OA; SLGAYLSEPLTTKDSSDESNEFLASGSSSMQGWRISQEDAHNCILNFDDQCSFFAVYDGH -!!!!--------------1111------------------------2222--------- GGAEVAQYCSLHLPTFLKTVEAYGRKEFEKALKEAFLGFDATLLQEKVIEELKVLSGDAE -------------3333---3333-----------------------------3333--- PGKDSGCTAVVALLHGKDLYVANAGDSRCVVCRNGKALEMSFDHKPEDTVEYQRIEKAGG --------------!!!!--------------iiii--------1111-------1111- RVTLDGRVNGGLNLSRAIGDHGYKMNKSLPAEEQMISALPDIEKITVGPEDEFMVLACDG --1111-iiii---------1111-33331111--------------1111------333 IWNFMTSEQVVQFVQERINKPGMKLSKICEELFDHCLNMTAIIVQFK 31111-----------1111---3333-------------------- >HYPOTHETICAL PROTEIN PF11; SWP:Q8U1T8; PDB:2I0XA; MYIVVVYDVGVERVNKVKKFLRMHLNWVQNSVFEGEVTLAEFERIKEGLKKIIDENSDSV --------------------1111----2222---------------3333--------- IIYKLRSMPPRETLGIEKNPIEEI -------------------3333- >CFMS TYROSINE KINASE; SWP:P07333; PDB:2I0YA; VRWKIIESYESYTFIEKWEFPRNNLQFGKTLGAGAFGKVVEATAFGLGKEDAVLKVAVKM ---------------1111-3333-----------------------1111--------- LKSTAHADEKEALMSELKIMSHLGQHENIVNLLGACTHGGPVLVITEYCCYGDLLNFLRR -1111--------------------1111-------------------1111-----111 KRQLSSRDLLHFSSQVAQGMAFLASKNCIHRDVAARNVLLTNGHVAKIGDFGLARDIMND 1--------------------------------3333---2222------3333-33333 SNYIVKGARLPVKWMAPESIFDCVYTVQSDVWSYGILLWEIFSLGLNPYPGILVNSKFYK 333-------3333-3333------3333-------------------2222-------- LVKDGYQMAQPAFAPKNIYSIMQACWALEPTHRPTFQQICSFLQEQAQE ----------1111---------1111-1111----------------- >NAD(FAD)-UTILIZING DEHYDR; SWP:Q816V9; PDB:2I0ZA; MHYDVIVIGGGPSGLMAAIGAAEEGANVLLLDKGNKLGRKLAISGGGRCNVTNRLPLDEI ---------------------1111-------------3333-%%%%------------- VKHIPGNGRFLYSAFSIFNNEDIITFFENLGVKLKEEDHGRMFPVSNKAQSVVDALLTRL ------3333--3333-----------1111------iiii--3333------------- KDLGVKIRTNTPVETIEYENGQTKAVILQTGEVLETNHVVIAVGGKSVPQTGSTGDGYAW ------------------%%%%-----3333----------------3333---3333-- AEKAGHTITELFPTEVPILSNEPFIRDRSLQGLALRDINLSVLNAIISHKMDMLFTHFGL -1111----------------3333----2222----------------------1111- SGPAALRCSQFVVKALKKFKTNTIQMSIDALPEENSEQLFQRMLKQMKEDPKKGIKNVLK ------------------------------33333333-----------11113333--- GYVPERYFLFLLEKNEIDGSEQAGQVSHEKIRALVKDFKEFTVNVNGTQSIEKAFVTGGG ------------1111-11111111------------------------3333------- VSVKEINPKEMSSKFTNGLYFCGEVLDIHGYTGGYNITSALVTGRIAGTTAGENAK -3333-----------------3333------------------------------ >PUTATIVE TETR TRANSCRIPTI; SWP:Q0RX74; PDB:2I10A; DDQVALQTAELFWRQGYEGTSITDLTKALGINPPSLYAAFGSKRDLFEKTLDRYCERTLQ ----------------11113333------------------------------------ LEEAVRPTAHEAVLDFLTGRVEVFTGCTVQAGLASGEPHHEIVDLLTAAREQRQTVLDRF ------------------------------------------------------------ EKALADGDLPAGTDCTALARYVAAVYGLSVEAASGAPREELTAAAILAAQVVP -------------------------------1111-------------1111- >AART; SWP:Q07230; PDB:2I13A; FSRSDHLAEHQRTHKPYKCPECGKSFSDKKDLTRHQRTHTGEKPYKCPECGKSFSQRANL ---3333--3333---------------------3333---------------------- RAHQRTHTGEKPYACPECGKSFSQLAHLRAHQRTHTGEKPYKCPECGKSFSREDNLHTHQ ---3333----------------3333--------------------------------- RTHTGEKPYKCPECGKSFSRRDALNVHQRTH --------------------------1111- >NICOTINATE-NUCLEOTIDE PYR; SWP:Q8TZS9; PDB:2I14A; MKRFYIANEDEIKAGKTTDVYFLRTKKILEVKNIRKKVLADVTTTSLPNNWRWGVLVGVE --------------11113333------------------------2222---------3 EVAKLLEGIPVNVYAMPEGTIFHPYEPVLQIEGDYADFGIYETALLGMLSQASGIATAAL 333-------------2222-------------333311113333--------------- RIKIAAKFKPVYSFGIRHMHPAIAPMIDRAAFIGGCDGVSGVLGAEMMGEKAVGTMPHAL -----%%%%-----1111-3333--------3333------3333--------------- IITVGDQVKAWKYFDEVIEEEVPRIALVDTFYDEKVEAVMAAEALGKKLFAVRLDTPSSR ------------------3333----------------------------------3333 RGNFRKIIEEVRWELKVRGYDWVKIFVSGGLDEEKIKEIVDVVDAFGVGGAIASAKPVDF --------------------------------------1111------3333-------- ALDIVEVEGKPIAKRGKLSGRKQVYRCENGHYHVVPANKKLERCPVCNAKVEPLLKPIIE ------iiii---2222---------1111-----3333--------------------- NGEIVVEFPKAREIREYVLEQAKKFNLEI ----------------------1111--- >HYPOTHETICAL PROTEIN MG29; SWP:P75364; PDB:2I15A; MKPQLLALKQFVQTEFEKVDFETFRQNFNRCLEREQSTLLIYEDDDYDDQSFFLKPMLSD -------------------3333----------------1111------3333------- AFFISSEVVKQLDLPKGDVKSCCQSFYEALTLFISALAITKGVDVGRYHQQLGKRFGVLT --------------------------------------1111------------------ VY -- >FARNESYL PYROPHOSPHATE SY; SWP:Q86C09; PDB:2I19A; MPMQMFMQVYDEIQMFLLEELELKFDMDPNRVRYLRKMMDTTCLGGKYNRGLTVIDVAES 1111-------------------------------------------3333--------- LLSLSPNNNGEEDDGARRKRVLHDACVCGWMIEFLQAHYLVEDDIMDNSVTRRGKPCWYR ------3333------3333-------------------------------%%%%-3333 HPDVTVQCAINDGLLLKSWTHMMAMHFFADRPFLQDLLCRFNRVDYTTAVGQLYDVTSMF 1111-----------------------1111-----------------------1111-- DSNKLDPDVSQPTTTDFAEFTLSNYKRIVKYKTAYYTYLLPLVMGLIVSEALPTVDMGVT 3333-1111-------11113333----3333------------------1111------ EELAMLMGEYFQVQDDVMDCFTPPERLGKVGTDIQDAKCSWLAVTFLAKASSAQVAEFKA ----------------------3333-------1111--------3333----------- NYGSGDSEKVATVRRLYEEADLQGDYVAYEAAVAEQVKELIEKLRLCSPGFAASVETLWG -------1111---------3333----------------------------------11 KTYKRQK 11----- >DNA DAMAGE-INDUCIBLE PROT; SWP:P40087; PDB:2I1AA; QVPMLYINIEINNYPVKAFVDTGAQTTIMSTRLAKKTGLSRMIDKRFIIGRIHQAQVKIE ----------iiii------1111--------------3333------------------ TQYIPCSFTVLDTDIDVLIGLDMLKRHLACVDLKENVLRIAEVETSFLSEAEIP ------------------------1111----1111---!!!!-----3333-- >MOESIN; SWP:NA; PDB:2I1JA; KSMNVRVTTMDAELEFAIQQTTTGKQLFDQVVKTIGLREVWFFGLQYTDSKGDLTWIKLY ------------------1111----------------3333------1111-------- KKVMQQDVKKENPLQFKFRAKFYPEDVADELIQEITLKLFYLQVKNAILSDEIYCPPETS -1111--------------------3333------------------1111--------- VLLASYAVQARHGDHNPAVHGPGFLANDRLLPQRVTDQHKMSREEWEQSITNWWQEHRGM ---------------1111-22221111-------1111-------------33332222 LREDAMMEYLKIAQDLEMYGVNYFEIRNKKNTELWLGVDALGLNIYEKDDKLTPKIGFPW ---------------1111--------1111-------3333----1111--------33 SEIRNISFNDRKFIIKPIDKKAPDFVFFAPRVRVNKRILALCMGNHELYMRRRKPDTIDV 33------!!!!------3333----------------------------1111------ QQMKAQAREEKLAKQAQREKLQLEIAARERAEKKQQEYQDRLRQMQEEMERSQANLLEAQ ----------------1111--------------------------------------33 DMVEDARRKQDEAAAALLAATTPQHHHVAERESGGGDLARGPDDLVDPVADRRTLAERNE 333333-----------1111-1111---------------1111--3333--3333--- RLHNQLKALKQDLARSCDETKETAMDKIHRENVRQGRDKYKTLREIRKGNTKRRVDQFEN -----------3333--1111-----------1111--------1111-3333----111 M 1 >MACROPHAGE COLONY-STIMULA; SWP:P07333; PDB:2I1MA; QVRWKIIESYNSYTFIDPTQLPYNEKWEFPRNNLQFGKTLGAGAFGKVVEATAFGLGKED ----------------3333----1111-3333---------1111-------------- AVLKVAVKMLKSTAHADEKEALMSELKIMSHLGQHENIVNLLGACTHGGPVLVITEYCCY ----------3333--------------------1111-------------------111 GDLLNFLRRKSRVLETDSTASTRDLLHFSSQVAQGMAFLASKNCIHRDVAARNVLLTNGH 1----------3333----------------------------------3333---2222 VAKIGDFGLARDIMNDSNYIVKGNARLPVKWMAPESIFDCVYTVQSDVWSYGILLWEIFS ------3333-33331111--------3333-3333------3333-------------- LGLNPYPGILVNSKFYKLVKDGYQMAQPAFAPKNIYSIMQACWALEPTHRPTFQQICSFL -----2222---------1111-----11113333-----1111-3333----------- QEQAQEDRRER ----------- >DISCS, LARGE HOMOLOG 3; SWP:Q5JUW7; PDB:2I1NA; MFKYEEIVLERGNSGLGFSIAGGIDNPHVPDDPGIFITKIIPGGAAAMDGRLGVNDCVLR -----------1111-------1111--2222--------2222--------2222---- VNEVDVSEVVHSRAVEALKEAGPVVRLVVRRRQPPPEETSV !!!!-----3333---------------------------- >NICOTINATE PHOSPHORIBOSYL; SWP:Q9HJ28; PDB:2I1OA; MNVFNTASDEDIKKGLASDVYFERTISAIGDKCNDLRVAMEATVSGPLDTWINFTGLDEV -----------1111---------------3333---------------------3333- LKLLEGLDVDLYAIPEGTILFPRDANGLPVPFIRVEGRYCDFGMYETAILGFICQASGIS ---2222-------2222-----1111----------33333333--------------- TKASKVRLAAGDSPFFSFGIRRMHPAISPMIDRSAYIGGADGVSGILGAKLIDQDPVGTM ---------!!!!-----3333-3333--------------------------------- PHALSIMLGDEEAWKLTLENTKNGQKSVLLIDTYMDEKFAAIKIAEMFDKVDYIRLDTPS 3333------------------------------------------------------33 SRRGNFEALIREVRWELALRGRSDIKIMVSGGLDENTVKKLREAGAEAFGVGTSISSAKP 33---------------1111--------------------1111------3333----- FDFAMDIVEVNGKPETKRGKMSGRKNVLRCTSCHRIEVVPANVQEKTCICGGSMQNLLVK ---------iiii---2222-------------------1111----------------- YLSHGKRTSEYPRPKEIRSRSMKELEYFK --iiii-----------------3333-- >Low-density lipoprotein r; SWP:P98158; PDB:2I1PA; GAMVLNCTSAQFKCADGSSCINSRYRCDGVYDCRDNSDEAGCPTRPPG -------3333--3333----1111----------3333--------- >DNA REPAIR AND RECOMBINAT; SWP:O73948; PDB:2I1QA; LTDLPGVGPSTAEKLVEAGYIDFMKIATATVGELTDIEGISEKAAAKMIMGARDLCDLGF 3333-----------1111------1111------------------------------- KSGIDLLKQRSTVWKLSTSSSELDSVLGGGLESQSVTEFAGVFGSGKTQIMHQSCVNLQN -!!!!---1111------------1111---------------------------33331 PEFLFYDEEAVSKGEVAQPKAVYIDTEGTFRPERIMQMAEHAGIDGQTVLDNTFVARAYN 111---3333-3333---------------3333-----1111-3333-1111------- SDMQMLFAEKIEDLIQEGNNIKLVVIDSLTSTFRNEYTGRGKLAERQQKLGRHMATLNKL ------------------------------------------------------------ ADLFNCVVLVTNQVSAKAEQAIGGHIVGHAATFRFFVRKGKGDKRVAKLYDSPHLPDAEA -1111-----------------------------------!!!!---------------- IFRITEKGIQD ----1111--- >HYPOTHETICAL PROTEIN; SWP:Q8PRU3; PDB:2I1SA; TFEKVYHLKLSIKGITPQIWRRIQVPENYTFLDLHKAIQAVMDWEDYHLHEFEMVNPKTG -------------------------1111------------------------------- MLDKIGAEGDPLVSEKKAKLSDYFTLENKEALYTYDFGDNWQVKVRLEKILPRKEGVEYP -------------3333-3333--1111-------------------------2222--- ICTAGKRAAVPEDSGGVWGYEEMLEVLKDSEHEEYEDTVLWLGDDFDPEYFDPKDVSF ----------2222----------33331111----------11111111-1111--- >THIOREDOXIN; SWP:P0A616; PDB:2I1UA; SATIKVTDASFATDVLSSNKPVLVDFWATWCGPCKMVAPVLEEIATERATDLTVAKLDVD ------3333----1111---------11113333------------3333------333 TNPETARNFQVVSIPTLILFKDGQPVKRIVGAKGKAALLRELSDVVPN 3-----1111----------iiii------------------------ >RECEPTOR-TYPE TYROSINE-PR; SWP:Q16849; PDB:2I1YA; TGHMILAYMEDHLRNRDRLAKEWQALCAYQAEPNTCATAQGEGNIKKNRHPDFLPYDHAR -3333--------------------1111------3333-33331111-1111--1111- IKLKVESSPSRSDYINASPIIEHDPRMPAYIATQGPLSHTIADFWQMVWESGCTVIVMLT ---33331111------------1111---------1111-------------------- PLVEDGVKQCDRYWPDEGASLYHVYEVNLVSEHIWCEDFLVRSFYLKNVQTQETRTLTQF ---------------------!!!!----------3333--------------------- HFLSWPAEGTPASTRPLLDFRRKVNKCYRGRSCPIIVHCSDGAGRTGTYILIDMVLNRMA ----------------------------------------------------------11 KGVKEIDIAATLEHVRDQRPGLVRSKDQFEFALTAVAEEVNAILKAL 11-----------------------------------------1111 >NEW ANTIGEN RECEPTOR PBLA; SWP:NA; PDB:2I24N; RVDQTPQRITKETGESLTINCVVRDSRCVLSTGYWYRKPPGSRNEESISDGGRYVETVNR -----------2222-----------------------2222--------!!!!-----1 GSKSFSLRINDLTVKDSGTYRCKPESRYGSYDAVCAALNDQYGGGTVVTVNAA 111---------3333-----------3333-33333333------------- >NEW ANTIGEN RECEPTOR ANCE; SWP:NA; PDB:2I27N; RVDQTPQTITKETGESLTINCVLRDSNCALSSTYWYRKKSGSTNEESISKGGRYVETVNS -----------2222-----------------------2222--------!!!!-----1 GSKSFSLRINDLTVEDSGTYRCKPESRDAECAALNDQYGGGTVVTVNAA 111---------1111-----------3333------------------ >YOPX PROTEIN; SWP:O34401; PDB:2I2LA; NTAYRVWDGEQHYWDDEGLSLIIKSNGDWTLKRLYTDVLVPVVDSTNRNAALWGAKVRGK ------------1111-------1111----------------3333---------iiii FIYDRSIVKITSDDKESSDVCEVKFSDGVFQVDVSKYDVTAVGWVEYATIEVIGDVYQNP -------------------------%%%%-----------1111----------333311 ELLEGVK 11----- >EIF4G-LIKE PROTEIN; SWP:Q5EAQ1; PDB:2I2OA; EDYKIQSFDLETQKLLKTALKDPGSVDLEKVSSVIVDQSLKDQVFSREAGRICYTIVQAE ----3333-------------3333------------11113333--------------- AKQTNGSVFRRNLLNRLQQEFKAREETRKRSTQEWVCLVSFICNIFDYLKVNNPVALVHP ------------------------------------------------------1111-- VYDCLFRLAQSDALKNEEEVDCLVLQLHRIGDQLEKNVQLDELFNLLRDGFLLQEDLSSG ---------1111-----------------------3333-------------------- RLLLLEILEFRAGGWKLSDTAQKYYY ----------1111------------ >COFILIN; SWP:P78929; PDB:2I2QA; SFSGVKVSPECLEAFQELKLGKSLRYVVFKMNDTKTEIVVEKKSTDKDFDTFLGDLPEKD 3333---------------------------1111------------3333-1111---- CRYAIYDFEFNLGEGVRNKIIFISWSPDVAPIKSKMVYSSSKDTLRRAFTGIGTDIQATD --------------------------1111--------------3333------------ FS -- >METHYLTRANSFERASE 1; SWP:P94921; PDB:2I2XA; AKRYTSMAYANADEMTFGVSKYPVKAGLDLEIGAGYTIPEINYAPRPEAGASKEKLIKEY ----------3333-------------------------------3333----------- ERITTDVMERMVQVGFPAIILETEHVQQMSNNPSWGAEVAHAQKTIMEKYHDEYGIKCAL -----------1111----------3333------------------------------- RHTIGDIRENREFLQLRGDKYSVFLEAFEQCAENGADLLSVESMGGKEVFDYAVLRNDIP ---------3333----1111----------------------2222----3333----- GLLYSIGCLGSIDMELIWTDISKIAKKTGTISAGDTDCAQANTAMFIGGGLLNKNLAHTI ------------------------------------3333---------1111---3333 AVIARAISAPRSLVAYEAGAVGPGKDCGYENIIVKAITGMPMTMEGKTSTCAHSDVMGNL ---------------1111-----1111-----------------1111---------33 VMQCCDCWSNESVEYHGEFGGTTVQCWSETLAYDCALMNTALETKNDKVLRDLMMLSDRY 33---------------11113333----------------1111------------111 RDPQAYMLAYDNAYRVGQSIVKDGDNIYLRAKNAAIECCNIIEEGAAGKLELSRFETKAL 133331111----------3333------------------------------------- ADAKAALEALPDDMDKFMDDCLTKYKSEVKVFKPENYGF ------1111------------------1111--1111- >Methyltransferase 1; SWP:P94920; PDB:2I2XB; MLDFTEASLKKVLTRYNVALEKALTPEEAAEELYPKDELIYPIAKAIFEGEEDDVVEGLQ --------3333--3333------------1111-----3333-------3333------ AAIEAGKDPIDLIDDALMVGMGVVIRLYDEGVIFLPNVMMSADAMLEGIEYCKENSGATP -------33331111------------------3333-----------3333-------- KTKGTVVCHVAEGDVHDIGKNIVTALLRANGYNVVDLGRDVPAEEVLAAVQKEKPIMLTG ----------2222----------------------------3333----1111------ TALMTTTMYAFKEVNDMLLENGIKIPFACGGGAVNQDFVSQFALGVYGEEAADAPKIADA ---1111-----------------------1111-------1111----3333------- IIAGTTDVTELREKFHKH ------3333-------- >PROTEIN N1; SWP:VN01_VACCC; PDB:2I39A; HMTLLIRYILWRNDNDQTYYNDDFKKLMLLDELVDDGDVCTLIKNMRMTLSDGPLLDRLN ----------------------------3333-------------------------111 QPVNNIEDAKRMIAISAKVARDIGERSEIRWEESFTILFRMIETYFDDLMIDLYG 1----------------------3333--------------3333---------- >HUMAN CANCER-RELATED NTPA; SWP:Q5TDE9; PDB:2I3BA; ARHVFLTGPPGVGKTTLIHKASEVLKSSGVPVDGFYTEEVRQGGRRIGFDVVTLSGTRGP -------------------------1111------------%%%%-------1111---- LSRVGLEPPPGKRECRVGQYVVDLTSFEQLALPVLRNADCSSGPGQRVCVIDEIGKMELF ----------------------3333-----3333-------------------1111-- SQLFIQAVRQTLSTPGTIILGTIPVPKGKPLALVEEIRNRKDVKVFNVTKENRNHLLPDI ----------3333------------------33333333-----------1111----- VTCVQSSRK --------- >ASPARTOACYLASE; SWP:P45381; PDB:2I3CA; EHIQKVAIFGGTHGNELTGVFLVKHWLENGAEIQRTGLEVKPFITNPRAVKKCTRYIDCD ------------1111-------------3333--------------------------1 LNRIFDLENLGKKSEDLPYEVRRAQEINHLFGPKDSEDSYDIIFDLHNTTSNGCTLILED 111--3333----11113333-----------2222------------------------ SRNNFLIQFHYIKTSLAPLPCYVYLIEHPSLKYATTRSIAKYPVGIEVGPQPQGVLRADI --3333---------------------3333---3333---------------------- LDQRKIKHALDFIHHFNEGKEFPPCAIEVYKIIEKVDYPRDENGEIAAIIHPNLQDQDWK ----------------------------------------1111------3333--2222 PLHPGDPFLTLDGKTIPLGGDCTVYPVFVNEAAYYEKKEAFAKTTKLTLNAKSIRC --3333---1111-----------------3333---------------------- >HYPOTHETICAL PROTEIN ATU1; SWP:Q8UED4; PDB:2I3DA; PEVIFNGPAGRLEGRYQPSKEKSAPIAIILHPHPQFGGTNNQIVYQLFYLFQKRGFTTLR ------1111----------------------3333---------------1111----- FNFRSIGRSQGEFDHGAGELSDAASALDWVQSLHPDSKSCWVAGYSFGAWIGQLLRRPEI --2222------------------------1111----------------------1111 EGFSIAPQPNTYDFSFLAPCPSSGLIINGDADKVAPEKDVNGLVEKLKTQKGILITHRTL -------1111--3333--------------------------------2222------- PGANHFFNGKVDELGECEDYLDRRLNGELVPEP ---1111---3333---------1111------ >G-RICH; SWP:Q90306; PDB:2I3EA; GSHMELPLFFGWFLLPEEEERIKCATMDFLKTLDTLEAFKEHISEFTGEAEKEVDLEQYF ------------------------------------------------------3333-- QNPLQLHCTTKFCDYGKAEGAKEYAELQVVKESLTKSYELSVTALIVTPRTFGARVALTE --------------------------------3333-----------3333--------- AQVKLWPEGADKEGVAPALLPSVEALPAGSRAHVTLGCSAGVETVQTGLDLLEILALQKE --1111-3333----33333333-------------------3333-------------- GKEGTQVEMDLGTLTYLSEGRWFLALREPINADTTFTSFSED --------1111----3333---------------------- >GLYCOLIPID TRANSFER-LIKE ; SWP:NA; PDB:2I3FA; DFGIIVILWKQVTVKEDGKVPLEPFLTAAKEVLRVVDAFGSGFRIVKNDIAGNIKKLYRA 3333----------3333-----------------33333333---------------11 NQTVHAETLQELIIAENSPDGLATVALLWLKRAFQFIASFLRRLVVTDKSLEQCVTEAYN 11---------------1111--------------------------------------- CTLRPCHSAVIQKVFWGGVKLAPSRERFYRKLHPDLNIAKAKIEEFLIELHDPLCCIVQF --3333------------1111-------------------------------------- FFQRELEDQCWGDEVYQRKDSSEWLK ------------3333---1111--- >BACULOVIRAL IAP REPEAT-CO; SWP:Q96CA5; PDB:2I3HA; GPAFPGMGSEELRLASFYDWPLTAEVPPELLAAAGFFHTGHQDKVRCFFCYGGLQSWKRG ---3333-------1111---3333--------------------------------222 DDPWTEHAKWFPGCQFLLRSKGQEYINNIH 2---------1111---------------- >GAMMA-GLUTAMYLTRANSFERASE; SWP:Q9HJH4; PDB:2I3OA; FRSRPNALSQRSVIASSSELASLAGRDILKRGGNIFDAALAVSALCVTQNNLCGLGGDLF ----------------------------1111---------------------1111--- ALIRDENGQIDLNGSGQASRAVSIDYYESGLTKIPERGPYAAITVPGIAGSWDEIFRKFA ----1111----------11113333-----------1111------------------- TDIADILEPAIRTASAGFPITQNYSDSIARSAPVIGQYRGWSSIFPNGSVPVAGEILKQP ------------------------------333311113333---------2222----- DLAESFRLSEEGFRSFYDGSLADIIIAGLEGTGSPLSDRDLRVYRPLIGKPVFTDLDEFR -----------3333-----------1111-------------------------!!!!- IYETSPNSQGITVIEWIRGESHGYDSRTWEAKIEDIFETEEAYDKRRKITDPSYNGLPKR -------------------1111-1111-----------------1111-3333------ DHNDIGDTTYFSISDSEGRSVSIIQSNYGFGSGIVPKGTGFVLQNRGSYFTLQRDHPNAL --------------1111-----------------2222-------1111--1111---- PGKRTFHTLAACVEKEHDLYASLGSGGDIQPQVQQILEILKDNTDPQAILDKPRWTEPYT --------------iiii--------11113333---3333---3333-----------1 IYEAPGAVYVESEELYRNVSKQISGRKVVLRDVSQEFGTAQITTLIRGDVVVGAADPRGD 111------------------------------3333--------2222------1111- GIAIPYS ------- >Spindle assembly checkpoi; SWP:P47074; PDB:2I3TB; MKPEKIDCNFKLIYCELEFSLEEVLAISRNVYKRV --------1111----------------------- >SERINE-THREONINE PHOSPHAT; SWP:Q8WPN9; PDB:2I44A; DVPPTIHVPLPPTSYPAFDAAIFTDIGGRKHQEDRFTLCPQLVPGRDDCAFFGVFDGTVG --------------1111-------!!!!-------------2222-------------- DFASENVKDLVVPQLISSPAWQEVTELRSDVPATEVDEKLPQLLDQAVDDYKNADNELVK -----3333-----------------------3333------------------------ CEQLNKDYASSTSVTAVLAKGFVAVGHLGDSRIAGVETPNGLNCEFLTVDHKPDPHEKLR -1111-------------iiii---------------1111------------------- IRNGGSVEYLHNHNNKPFIRGGDFSFRKSRGEQPQLQYSRAFGGKDLKYGLSNQPDVRVV ---------2222-------1111-------------------1111------------- RVTPQHRVILATDGLWDVSAAQAVEIAQARQEGRNPAQALVETLAEQQSRNQSADNITAT --1111-----3333--------------1111--------------1111--------- VFFK ---- >HYPOTHETICAL PROTEIN; SWP:Q9JXU4; PDB:2I45A; ETINLKQHLAAIKEYWQPEIINRHGFQFHLVKLLGDYGWHTHSDKVLFAVEGDAVDFADG ----------------------!!!!---------------------------------- GSTIREGEAVVPKSVSHRPRSENGCSLVLIELS ----2222------------------------- >ADRENOCORTICAL DYSPLASIA ; SWP:Q96AP0; PDB:2I46A; SGRLVLRPWIRELILGSETPSSPRAGQLLEVLQDAEAAVAGPSHAPDTSDVGATLLVSDG -----------------------------------------------3333--------- THSVRCLVTREALDTSDWEEKEFGFRGTEGRLLLLQDCGVHVQVAEGGAPAEFYLQVDRF ---------------------------2222--------------iiii----------- SLLPTEQPRLRVPGCNQDLDVQKKLYDCLEEH -------------3333--------------- >BICARBONATE TRANSPORTER; SWP:Q55460; PDB:2I49A; MPETANIKLGYIPIVEAAPLIIAQEKGFFAKYGMTGVEVSKQANWASARDNVTIGSQGGG -------------3333------1111--1111----------------------1111- IDGGQWQMPMPHLITEGIITNGNKVPMYVLAQLITQGNGIAVAPMHEGKGVNLDITKAAD -------------------iiii-------------------3333--------3333-- YIKGFNKTNGRKFKAAHTFPNVNQDFWIRYWFAAGGVDPDTDIDLLAVPPAETVQGMRNG --------------------------------1111------------------------ TMDAFSTGDPWPYRIVTENIGYMAGLTAQIWPYHPEEYLAIRADWVDKNPKATKALLKGI ---------------1111------3333------------------------------- MEAQQWIDDPKNRPEVVQIVSGRNYFNVPTTILESPFKGQYTMGDGQPAIDDFQKGPLYW --------3333---------1111---3333-3333------iiii----3333----- KDGIGNVSYPYKSHDLWFLTESIRWGFHKNAIPDLDTAQKIIDKVNREDLWREAATEAGF ----------3333--------1111-3333----------------------------3 TADIPSSTSRGVETFFDGITFDPANPSAYLQSLAIKKV 333-----------1111---1111----1111----- >THIOREDOXIN; SWP:A1YAC5; PDB:2I4AA; SEHTLAVSDSSFDQDVLKASGLVLVDFWAEWCGPCKMIGPALGEIGKEFAGKVTVAKVNI -------3333----1111---------1111----------------iiii------33 DDNPETPNAYQVRSIPTLMLVRDGKVIDKKVGALPKSQLKAWVESAQ 33-----1111----------iiii------------------3333 >ATP-DEPENDENT RNA HELICAS; SWP:O00571; PDB:2I4IA; MVEATGNNCPPHIESFSDVEMGEIIMGNIELTRYTRPTPVQKHAIPIIKEKRDLMACAQT --------------1111-------------------3333-----------------22 GSGKTAAFLLPILSQIYSDGPGEALRAMKENGRYGRRKQYPISLVLAPTRELAVQIYEEA 22-----------------------------1111------------------------- RKFSYRSRVRPCVVYGGADIGQQIRDLERGCHLLVATPGRLVDMMERGKIGLDFCKYLVL ---2222-------------------1111------------------------------ DEADRMLDMGFEPQIRRIVEQDTMPPKGVRHTMMFSATFPKEIQMLARDFLDEYIFLAVG ------1111---------------2222------------------------------- TSENITQKVVWVEESDKRSFLLDLLNATGKDSLTLVFVETKKGADSLEDFLYHEGYACTS ------------1111--------1111-----------------------1111----- IHGDRSQRDREEALHQFRSGKSPILVATAVAARGLDISNVKHVINFDLPSDIEEYVHRIG -1111----------------------33332222-----------------------11 RTGRNLGLATSFFNERNINITKDLLDLLVEAKQEVPSWLENMAYEHHY 11-----------33331111-------1111---3333-3333---- >SORTING NEXIN-1; SWP:Q13596; PDB:2I4KA; QFDLTVGITDPEKIGDGMNAYVAYKVTTQTSLPLFRSKQFAVKRRFSDFLGLYEKLSEKH --------------------------------------------3333------------ SQNGFIVPPPPEKSLIGMTKVKVGKEDSSSAEFLEKRRAALERYLQRIVNHPTMLQDPDV -----------------------------3333-----------------3333------ REFLEKEE --1111-- >PROLINE-TRNA LIGASE; SWP:Q6N5P6_RHOPA; PDB:2I4LA; HMRLSRFFLPILKENPKEAEIVSHRLMLRAGMLRQEAAGIYAWLPLGHRVLKKIEQIVRE --3333---------1111--------1111-----2222-------------------- EQNRAGAIELLMPTLQLADLWRESGRYDAYGPEMLRIADRHKRELLYGPTNEEMITEIFR ----------------3333-1111-----3333----1111------------------ AYIKSYKSLPLNLYHIQWKFRDEQRPRFGVMRGREFLMKDAYSFDVDEAGARKSYNKMFV ----3333-------------------!!!!----------------------------- AYLRTFARMGLKAIPMRAETGPIGGDLSHEFIVLAETGESGVYIDRDVLNLPVPDENVDY ------1111----------3333----------1111-------3333-----111111 DGDLTPIIKQWTSVYAATEDVHEPARYESEVPEANRLNTRGIEVGQIFYFGTKYSDSMKA 11-3333---------------1111-----3333--------------!!!!--1111- NVTGPDGTDAPIHGGSYGVGVSRLLGAIIEACHDDNGIIWPEAVAPFRVTILNLKQGDAA ---1111--------------------------1111---3333----------2222-- TDAACDQLYRELSAKGVDLYDDTDQRAGAKFATADLIGIPWQIHVGPRGLAEGKVELKRR ------------1111-----------------------------33331111------- SDGARENLALADVVAR --------1111---- >V-TYPE ATP SYNTHASE SUBUN; SWP:O29102; PDB:2I4RA; LAVVGDPDFTIGFLAGISDIYEVTSDEEIVKAVEDVLKRDDVGVVIKQEYLKKLPPVLRR -----33333333----------------------1111-------111111113333-- EIDEKVEPTFVSVG -3333--------- >UNCHARACTERIZED CONSERVED; SWP:NA; PDB:2I51A; SLAPWRGAIAHALHRNRSLVYARYLQLATVQPNGRPANRTLVFRGFLEDTNQLRFITDTR --1111-------1111---1111------1111------------2222-------111 SAKADQIQQQPWAEICWYFPNTREQFRAGDLTLISSDDSHQDLQPARIAWQELSDAARLQ 1--------------------------------------3333------1111------1 FGWPYPGKPRIKESGAFEPSPPDPIEPVPNFCLLLLDPVQVDHLELRGEPQNRWLYHRND 111-2222----3333-----------1111---------------------------11 QQEWSSEAINP 11--------- >HYPOTHETICAL PROTEIN; SWP:Q6L2J9; PDB:2I52A; SLYDPAEKYFNCTDIQRAFFEAGIKLGAIFHQYTGIPVNSENASMAEEFIERSTMIQPFV ---1111-------------------------2222--3333--------------2222 ENVRISINNVYSYSSLNEKMLHAEVLINYNGKKVLGVLNYDEGLDYPVMYAKEVL ----------------3333--------iiii---------1111---------- >CYCLIN K; SWP:O75909; PDB:2I53A; SANLDHTKPCWYWDKKDLAHTPSQLEGLDPATEARYRREGARFIFDVGTRLGLHYDTLAT -------------3333------1111---------------------1111-------- GIIYFHRFYMFHSFKQFPRYVTGACCLFLAGKVEETPKKCKDIIKTARSLLNDVQFGQFG ------------3333----------------------3333---------33333333- DDPKEEVMVLERILLQTIKFDLQVEHPYQFLLKYAKQLKGDKNKIQKLVQMAWTFVNDSL ----------------------------------1111--3333--------------11 CTTLSLQWEPEIIAVAVMYLAGRLCKFEIQEWTSKPMYRRWWEQFVQDVPVDVLEDICHQ 113333---------------------3333--------1111------3333------- ILDLYSQGKQQMPH 3333---------- >L-RHAMNOSE ISOMERASE; SWP:Q75WH8; PDB:2I57A; FRIAQDVVARENDRRASALKEDYEALGANLARRGVDIEAVTAKVEKFFVAVPSWGVGTGG ---3333-----------------------1111-3333---3333-----1111----- TRFARFPGTGEPRGIFDKLDDCAVIQQLTRATPNVSLHIPWDKADPKELKARGDALGLGF 3333-------------------------------------------------------- DAMNSNTFSDAPGQAHSYKYGSLSHTNAATRAQAVEHNLECIEIGKAIGSKALTVWIGDG ----------2222---11111111-3333------------------------------ SNFPGQSNFTRAFERYLSAMAEIYKGLPDDWKLFSEHKMYEPAFYSTVVQDWGTNYLIAQ --2222-----------------11111111----------------------------- TLGPKAQCLVDLGHHAPNTNIEMIVARLIQFGKLGGFHFNDSKYGDDDLDAGAIEPYRLF --1111----1111-2222---------1111-----------------2222------- LVFNELVDAEARGVKGFHPAHMIDQSHNVTDPIESLINSANEIRRAYAQALLVDRAALSG ------------------------------------------------3333-------- YQEDNDALMATETLKRAYRTDVEPILAEARRRTGGAVDPVATYRASGYRARVAAERPAS ----------------3333----------1111------------------------- >PHOSPHOMETHYLPYRIMIDINE K; SWP:P39610; PDB:2I5BA; MHKALTIAGSDSSGGAGIQADLKTFQEKNVYGMTALTVIVAMDPNNSWNHQVFPIDTDTI ----------1111-----------1111----------------%%%%------3333- RAQLATITDGIGVDAMKTGMLPTVDIIELAAKTIKEKQLKNVVIDPVMVCKGANEVLYPE ----------------------------------1111-------------------333 HAQALREQLAPLATVITPNLFEASQLSGMDELKTVDDMIEAAKKIHALGAQYVVITGGGK 3-------3333---------------------------------1111-------!!!! LKHEKAVDVLYDGETAEVLESEMIDTPYTHGAGCTFSAAVTAELAKGAEVKEAIYAAKEF ----------------------------2222-----------1111------------- ITAAIKESFPLNQYVGPTKHSALRLNQQS ----1111---1111---1111------- >HYPOTHETICAL PROTEIN MM_2; SWP:Q8PU52_METMA; PDB:2I5EA; ARAVIPYKKAGAKSRLSPVLSLQEREEFVELLNQVISSLKGAGIEQVDILSPSVYGLEET --------22221111-----------------------1111----------2222--- EARVLLDEKDLNEALNRYLKEAEEPVLIVADLPLLSPEHIKEISSTEKDVCIVPGKGGGT -------------------------------1111--------------------%%%%- NALFIKNPSKYRVKYYGSSFLTHCSIATDSGQDFEIYDSFAGTDIDEPEDLVELLIHGKG ------3333-----------------1111---------------3333---------- AAKDYIESKFRLEVKKGRVGLVPL ------------------------ >AMIDOHYDROLASE; SWP:Q9HTG8; PDB:2I5GA; LSPAELHADSIVIDGLIIAKWNRELFEDRKGGLTAANCTVSVWEGFQATVNNITASNKLI -----3333-------------------1111---------------------------- RDNSDLVIPVRSTADIRKAKEQGKTGILYGFQNAHAFEDQIGYVEVFKQLGVGIVQCYNT --1111-----3333------------------3333--3333----------------- QNLVGTGCYERDGGLSGFGREIVAENRVGICDLSHVGSKTSEEVILESKKPVCYSHCLPS ----------------3333------------1111----------------------11 GLKEHPRNKSDEELKFIADHGGFVGVTFAPFLKKGIDSTIDDYAEAIEYVNIVGEDAIGI 11--1111--------------------1111-!!!!-3333-----------1111--- GTDFTQGHGHDFFEWLTHDKGYARRLTNFGKIVNPLGIRTVGEFPNLTETLLKRGPERVV ----2222---------2222------------------3333----------------- RKVGENWVRVLRDVWGE ----------------- >HYPOTHETICAL PROTEIN AF15; SWP:O28741; PDB:2I5HA; EDYAYVLDFMPYGHPDDKRPIHRREPLAQVVGERNFTLLEVSIRKGKQPLVMDRVYIGKG ---------11111111--3333--------------------2222--2222------- ERDVVYKIKRRLRYEDLTPAAKTELPYVIEHIIKQDEKKYVDFFNDSITTRMHQLELLPG ------------3333-----------------------3333---------3333-222 VGKKMMWAIIEERKKRPFESFEDIAQRVKGIQRPEKLIVSRIIYEIKNPQTKYKLFTA 2--------------------------2222----------------1111------- >UPF0249 PROTEIN EF_3048; SWP:P59745; PDB:2I5IA; SNKKLIINADDFGYTPAVTQGIIEAHKRGVVTSTTALPTSPYFLEAESARISAPTLAIGV ----------2222----------------------1111-3333-------1111---- HLTLTLNQAKPILPREVPSLVDEAGYFWHQSIFEEKVNLEEVYNEWDAQIISFKSGRRPD -----2222-------1111-1111---33333333------------------------ HIDSHHNVHGKNKKLLGVALALARKYQLPLRNASRSIETKDYLELYQDVRTPDELYQFYD ---222211113333--------------------33333333--!!!!---------!! KAISTETILQLLDVVCSEGEVFEINCHPAFIDTILQNQSGYCPRIREVEILTSQEVKEAI !!-----------1111-------------------------3333-------------- EERGILLANYESLA 1111---------- >Photosynthetic reaction c; SWP:P07173; PDB:2I5NC; CFEPPPATTTQTGFRGLSMGEVLHPATVKAKKERDAQYPPALAAVKAEGPPVSQVYKNVK -------------2222------------------------------------------- VLGNLTEAEFLRTMTAITEWVSPQEGCTYCHDENNLASEAKYPYVVARRMLEMTRAINTN -1111--------------------1111--1111------------------------- WTQHVAQTGVTCYTCHRGTPLPPYVRYLEPTLPLNNRETPTHVERVETRSGYVVRLAKYT 3333!!!!--3333-iiii---------------1111--333333331111-------i AYSALNYDPFTMFLANDKRQVRVVPQTALPLVGVSRGKERRPLSDAYATFALMMSISDSL iii-----3333-----------------------!!!!--3333------------111 GTNCTFCHNAQTFESWGKKSTPQRAIAWWGIRMVRDLNMNYLAPLNASLPASRLGRQGEA 1-1111--3333---!!!!------------------------3333--3333-1111-- PQADCRTCHQGVTKPLFGASRLKDYPELGPIK ---3333-iiii--%%%%--33333333---- >Reaction center protein H; SWP:P06008; PDB:2I5NH; YHGALAQHLDIAQLVWYAQWLVIWTVVLLYLRREDRREGYPLVEPEDGQVYELPYPKTFV 2222-%%%%------------------------1111--------33331111------- LPHGGTVTVPRRRPETRELKLAQTDGFEGAPLQPTGNPLVDAVGPASYAERAEVVDATVD 1111----------------------1111------3333--!!!!-----------111 GKAKIVPLRVATDFSIAEGDVDPRGLPVVAADGVEAGTVTDLWVDRSEHYFRYLELSVAG 1-----33331111--2222--2222---1111------------------------222 SARTALIPLGFCDVKKDKIVVTSILSEQFANVPRLQSRDQITLREEDKVSAYYAGGLLYA 2------3333---1111------33331111---------------------------- TPERAESLL 3333----- >Reaction center protein L; SWP:P06009; PDB:2I5NL; ALLSFERKYRVRGGTLIGGDLFDFWVGPYFVGFFGVSAIFFIFLGVSLIGYAASQGPTWD --11111111-------!!!!----!!!!---3333----------------1111---3 PFAISINPPDLKYGLGAAPLLEGGFWQAITVCALGAFISWMLREVEISRKLGIGWHVPLA 333------3333-----1111--------------------------1111-------- FCVPIFMFCVLQVFRPLLLGSWGHAFPYGILSHLDWVNNFGYQYLNWHYNPGHMSSVSFL --------------------3333-----------------11113333----------- FVNAMALGLHGGLILSVANPGDGDKVKTAEHENQYFRDVVGYSIGALSIHRLGLFLASNI --------------------%%%%--------------------3333------------ FLTGAFGTIASGPFWTRGWPEWWGWWLDIPFWS -----------------3333----11111111 >Glyceraldehyde-3-phosphat; SWP:Q01077; PDB:2I5PO; MVSIAINGFGRIGRLVLRIALERKNIDVVAINDPFISVDYAAYMFKYDSTHGKYKGEVSH ----------------------1111------1111------------------------ DGSNLIINGKKVAVFQEKDPATLPWGKLGVDIAVDSTGVFKELDSAQKHIDAGAKKVVIT ------------------3333-------------------3333----1111------- APSKTAPMFVVGVNEDKYNGEKIVSNASCTTNCLAPIAKIINDEFGIEEGLMTTVHSITS ---------22223333------------------------------------------- GNIIPSSTGAAKAVGKVLPELQGKLTGMAFRVPTTDVSVVDLTVKLVKAATYDEIKAAVK --------33333333-3333--------------------------------------- KVSEGKLKDVVGYTEDAVVSSDFLGDTHSTIFDAAAGIQLSPKFVKLVAWYDNEYGYSTR ----1111----------33332222-------3333----------------------- VVDLVEHVA --------- >PROTEIN C7ORF24; SWP:O75223; PDB:2I5TA; EESFLYFAYGSNLLTERIHLRNPSAAFFCVARLQDFKLDFGNSQGKTSQTWHGGIATIFQ ---------1111--------1111-----------------iiii-------------- SPGDEVWGVVWKNKSNLNSLDEQEGVKSGYVVIEVKVATQEGKEITCRSYLTNYESAPPS 2222--------3333--------3333----------1111------------------ PQYKKIICGAKENGLPLEYQEKLKAIEPNDYTGKVSEEIEDIIKK ----------1111--------1111------------------- >DNAD DOMAIN PROTEIN; SWP:Q830E8; PDB:2I5UA; NAISIWENNGFGLSSKTTDFDYWISDFEKIGASQKEAEQLIVKAIEIAIDANARNYNYIN -----1111----3333----------1111----------------------------- AILKDWEQRGFKS ------------- >INSECT TOXIN 2; SWP:Q26292; PDB:2I61A; MDGYIKRRDGCKVACLIGNEGCDKECKAYGGSYGYCWTWGLACWCEGLPDDKTWKSETNT ------1111----------------1111-------2222-------1111--3333-- CG -- >NICOTINAMIDE N-METHYLTRAN; SWP:O55239; PDB:2I62A; FTSKDTYLSHFNPRDYLEKYYSFGSRHCAENEILRHLLKNLFKIFCLGAVKGELLIDIGS --3333-----------------------------------------------------! GPTIYQLLSACESFTEIIVSDYTDQNLWELQKWLKKEPGAFDWSPVVTYVCDLEGNRMKG !!!1111-3333----------3333----------2222--3333-----1111----- PEKEEKLRRAIKQVLKCDVTQSQPLGGVSLPPADCLLSTLCLDAACPDLPAYRTALRNLG -----------------1111-1111--------------3333-------------333 SLLKPGGFLVMVDALKSSYYMIGEQKFSSLPLGWETVRDAVEEAGYTIEQFEVISQNYSS 3--2222--------------!!!!---------------------------------11 TTSNNEGLFSLVGRKPG 11--------------- >PUTATIVE ACETYLTRANSFERAS; SWP:Q9HV14; PDB:2I6CA; QLSHRPAETGDLETVAGFPQDRDELFYCYPKAIWPFSVAQLAAAIAERRGSTVAVHDGQV -------3333---1111----------1111------------1111-------iiii- LGFANFYQWQHGDFCALGNVAPAARGLGVARYLIGVENLAREQYKARLKISCFNANAAGL ---------2222-------3333----------------------------3333---- LLYTQLGYQPRAIAERHDPDGRRVALIQDKPLEP ---1111----------1111------------- >RNA METHYLTRANSFERASE, TR; SWP:Q7MW92; PDB:2I6DA; ALSANQIKFLRSLRERKYRLREQAFAVEGPKLVGELPFYRCRLVGTAALRAVSTPHDAEV -----------------------------------1111---------1111-------- VELPESFDFKRISTQTTPQPLAVFDLPAEPEPVVEGLTLLLDGVQDPGNVGTILRTADWF ---33333333------------------------------------------------- GIRHVWLGTGSADVFSPKVVQASGALARVQPTPLKNTVDTLAYFRRQGIPVYGAFLDGQS -------2222-11113333---1111-----------------1111-----------3 LYEAPLPNFTEPAILVLGSEGRGISPEVAAEITDRLTIPASGLSVESLNVAIATAILCSE 333----1111-------3333--33331111---------------------------- WRRRS -1111 >HYPOTHETICAL PROTEIN; SWP:Q9RXE3; PDB:2I6EA; PYRAGWIHFTNVAPILDSLELPPGVTAITGVPTQNAALLSGEVDIANVSAVEFIRHADTL --------3333---------2222-----3333---------------------3333- AALPDFSVAVLGPVYSVNLFHTCPLPELRRVALTSQSASVALLEVLLRQKGLSPVLERAE -----------------------3333--------------------------------- GTAESLLAAGYDGVLRIGDDALREWYGVVGPLTPERTTSLPHTGRGITVTDLAQEWFDLT -33331111----------------------------------iiii------------- GHPFTFAVWAYRKDNPPPAALLQAREARRRGIGHLAEVSQRHAEKLGLPERVVQHYLWNF -----------3333---------------3333----------------------3333 RYHLEAPDRLGLREFADLAVPGHAELTF ---------------------------- >PUTATIVE METHYLTRANSFERAS; SWP:Q8ZPC3; PDB:2I6GA; GTVRDENYFTEYGLTRTHSDVLAAAVVAPGRTLDLGCGNGRNSLYLAANGYDVTAWDNPA ------------------------------------!!!!-----------------333 SANLERIAAEGLDNLQTDLVDLNTLTFDGEYDFILSTVVFLEAQTIPGLIANQRCTPGGY 3------11111111-----3333-----------------3333-------1111---- NLIVAADFPFAFEGELRRYYEGWDLYNEDVGLRFATLARTA -----------------1111-------------------- >HYPOTHETICAL PROTEIN ATU0; SWP:Q8UJ18; PDB:2I6HA; KNDTAALAADIVDFWKKAGPDKWFDKDAAFDNHFHDRFRDAHFAAARRELDGWLEGAESS -1111-------------3333----------------------1111------------ LALLLLDQFPRNCFRGTAHYATDPLARFFADEAIRRGHDQAVSEDLRVFFYLPFSHAEDI ---------33332222--1111-------------3333--3333----3333------ AAQQRACDLNQPLGGLYLHHAEEHRDIVERFGRFPHRNGILLRETTPEERQYLEEG ----------3333-------------------3333-1111----------1111 >Sulfolobus solfataricus p; SWP:Q97VZ7; PDB:2I6JA; MYWVRRKTIGGSGLPYTENEILEWRKEGVKRVLVLPEDWEIEESWGDKDYYLSILKKNGL ----2222--------3333----3333---------------------------1111- QPLHIPIPDGGVPSDSQFLTIMKWLLSEKEGNLVHCVGGIGRTGTILASYLILTEGLEVE -------2222------------------------------------------------- SAIDEVRLVRPGAVQTYEQEMFLLRVEGMRKSWLKNIYSNS -----33332222---------------------------- >MITOGEN-ACTIVATED PROTEIN; SWP:Q16659; PDB:2I6LA; GFDLGSRYMDLKPLGLVFSAVDNDCDKRVAIKKIVLTDPQSVKHALREIKIIRRLDHDNI ---------------------------------------------------1111-1111 VKVFEILGPSGSQLTDLTELNSVYIVQEYMETDLANVLEQGPLLEEHARLFMYQLLRGLK -------1111--------------------------1111--3333------------- YIHSANVLHRDLKPANLFINTEDLVLKIGDFGLARIMHLSEGLVTKWYRSPRLLLSPNNY --1111------3333----1111------1111-----3333--1111------1111- TKAIDMWAAGCIFAEMLTGKTLFAGAHELEQMQLILESIPVVHEEDRQELLSVIPVYIRN 3333---------------------------------------------1111------- DMTEPHKPLTQLLPGISREAVDFLEQILTFSPMDRLTAEEALSHPYMSIYSF 1111---3333-2222-------1111---3333---------33331111- >UBIQUITIN-CONJUGATING ENZ; SWP:NA; PDB:2I6TA; VNKITVVGGGELGIACTLAISAKGIADRLVLLDLSATMDLEIFNLPNVEISKDLSASAHS --------------------1111--------------------2222----3333---- KVVIFTVNSQSYLDVVQSNVDMFRALVPALGHYSQHSVLLVASQPVEIMTYVTWKLSTFP ---------------------------------1111----------------------3 ANRVIGIGCNLDSQRLQYIITNVLKAQTSGKEVWVIGEQGEDKVLTWSGQEEVVSHTSQV 333------------------------3333----------------------------- QLSNRAMELLRVKGQRSWSVGLSVADMVDSIVNNKKKVHSVSALAKGYYDINSEVFLSLP ------1111--------------------1111----------2222------------ CILGTNGVSEVIKTTLEDTVTEKLQSSASSIHSLQQQLKL ---1111--------------------------3333--- >GENERAL SECRETION PATHWAY; SWP:P45777; PDB:2I6VA; QEIFQYVRLSQVKRDDKVLGYRVSPGKDPVLFESIGLQDGDMAVALNGLDLTDPNVMNTL -3333--------!!!!-----------3333-----2222----iiii1111------- FQSMNEMTEMSLTVERDGQQHDVYIQF --3333---------iiii-------- >HYDROLASE, HALOACID DEHAL; SWP:Q7MWA6; PDB:2I6XA; AIRNIVFDLGGVLIHLNREESIRRFKAIGVADIEELDPKGLFLDLESGRKSEEEFRTELS -------------------------11111111--------------------------- RYIGKELTYQQVYDALLGFLEEISAEKFDYIDSLRPDYRLFLLSNTNPYVLDLASPRFLP ----------------3333-------------3333---------------------33 SGRTLDSFFDKVYASCQGKYKPNEDIFLEIADSGKPEETLFIDDGPANVATAERLGFHTY 33-3333------3333-----------------1111--------------1111---- CPDNGENWIPAITRLLREQ --2222-------3333-- >HYPOTHETICAL PROTEIN; SWP:Q97YD5_SULSO; PDB:2I71A; GASIVFSTIGNPKGYQKVTYEIDGEKFESNVSVLALRDLLKVDKTVVILGISVADVYNCK ----------1111-------iiii-----3333---1111--------3333-1111-- YADYRSCKECIIQNSKNDLGISESYVVAPNVYQKFKGKPDHYFTYIYYHSLRILEKEGIN -------------------------------!!!!--3333------------------- EVFIDTTHGINYGVLAKEAIQLAVSAYAAKSEKEVKVSLYNSDPVGKDVSDTVKLHEIEA -----1111-------------------1111-----------22223333--------- IKISPLSGLKYVTYQILNKDKNFFNKIFSDSVNAIPRFATALDNGLFIYLSEKDSSLHLK ---------------11111111----2222----------1111--------------- RLEDDLSKDPLLTPSENEINVVYKDKYALSHALFYVISRFSGNVDLDTLRHYAETYADKV --------------1111------------------3333-------------------- TRAIIENEVDKIEKYQGSERKLLGEYRILYAHGGLPYAGTYVYKEKDKVYVTYGDKIDEI -----------3333------3333----1111--1111-------------!!!!---- ERQI 1111 >PNGASE; SWP:Q9JI78; PDB:2I74A; ERKEILFIPSENEKISKQFHLRYDIVRDRYIRVSDNNTNISGWENGVWKMESIFRKVEKD ------------------------1111---1111------3333--------------- WNMVYLARKEGSSFAYISWKFECGSAGLKVDTVSIRTSSQSFESGSVRWKLRSETAQVNL --------2222----------3333---------------!!!!-------1111---- LGDKNLRSYNDFSGATEVTLEAELSRGDGDVAWQHTQLFRQSLNDSGENGLEIIITFNDL ---------1111---------------1111---------1111--------------- >TYROSINE-PROTEIN PHOSPHAT; SWP:P29074; PDB:2I75A; SLRESMIQLAEGLITGTVLTQFDQLYRKKPGMTMSCAKLPQNISKNRYRDISPYDATRVI ---------------------3333---2222---1111--1111--1111--3333--- LKGNDYINANYINMEIPSIINQYIACQGPLPHTCTDFWQMTWEQGSSMVVMLTTQVERGR -----------------------------1111--------------------------- VKCHQYWPEPTGSSSYGCYQVTCHSEEGNTAYIFRKMTLFNQEKNESRPLTQIQYIAWPD --------2222---!!!!----------------------------------------- HGVPDDSSDFLDFVCHVRNKRAGKEEPVVVHCSAGIGRTGVLITMETAMCLIECNQPVYP ------------------------------------3333-----------1111---33 LDIVRTMRDQRAMMIQTPSQYRFVCEAILKVYE 33----3333-------------------1111 >HYPOTHETICAL PROTEIN; SWP:Q9X250; PDB:2I76A; VLNFVGTGTLTRFFLECLKIGYILSRSIDRARNLAEVYGGKAATLEKHPEVVFVIVPDRY --------------------------------------------------------3333 IKTVANHLNLGDAVLVHCSGFLSSEIFKKSGRASIHPNFSFLEKALEKDQIVFGLEGDER ---1111---------------3333---------------1111--------------- GLPIVKKIAEEISGKYFVIPSEKKKAYHLAAVIASNFPVALAYLSKRIYTLLGLDEPELL -------------------1111------------3333----------1111--3333- IHTLKGVADNIKKRVECSLTGPVKRGDWQVVEEERREYEKIFGNTVLYDEIVKLLREVAE --------3333--1111--3333--3333------------------------------ SERR ---- >ACETYLTRANSFERASE, GNAT F; SWP:Q97NS4; PDB:2I79A; MEYELLIREAEPKDAAELVAFLNRVSLETDFTSLDGDGILLTSEEMEIFLNKQASSDNQI ----------3333----------1111------3333---3333--------------- TLLAFLNGKIAGIVNITADQRKRVRHIGDLFIVIGKRYWNNGLGSLLLEEAIEWAQASGI -----iiii-----------3333----------3333---------------------- LRRLQLTVQTRNQAAVHLYQKHGFVIEGSQERGAYIEEGKFIDVYLMGKLI --------3333-------1111---------------------------- >CALPAIN 13; SWP:Q6MZZ7; PDB:2I7AA; SGLVPGSDIDATQLQGLLNQELLDMFSLDECRSLVALMELKVNGRLDQEEFARLWKRLVH -----------------------------------1111--------------------- YQHVFQKVQTSPGVLLSSDLWKAIENTDFLRGIFISRELLHLVTLRYSDSVGRVSFPSLV ----------2222-1111-------3333------------------1111-------- CFLMRLEAMAKTFRNLSKDGKGLYLTEMEWMSLVMYN ------------------------------------- >SPERMIDINE SYNTHASE; SWP:Q8II73; PDB:2I7CA; KKWFSEFSIMWPGQAFSLKIKKILYETKSKYQNVLVFESTTYGKVLVLDGVIQLTEKDEF -------3333------------------------------------iiii---3333-- AYHEMMTHVPMTVSKEPKNVLVVGGGDGGIIRELCKYKSVENIDICEIDETVIEVSKIYF ---------1111-----------3333--------3333-------------------1 KNISCGYEDKRVNVFIEDASKFLENVTNTYDVIIVDSSDPIGPAETLFNQNFYEKIYNAL 111-----1111--------3333-----------------1111-----------1111 KPNGYCVAQCESLWIHVGTIKNMIGYAKKLFKKVEYANISIPTYPCGCIGILCCSKTDTG 1111-------3333-------------------------1111%%%%------------ LTKPNKKLESKEFADLKYYNYENHSAAFKLPAFLLKEIENI ---------3333------3333--1111-3333---1111 >FERREDOXIN COMPONENT OF D; SWP:A2TC31; PDB:2I7FA; NKLRLCQVASVKDGEPVAVYQEKMPALAVYNVDGEVFVTDNLCTHGNAMLTDGYQDGTII ------3333-2222-----1111-------iiii-------------3333---!!!!- ECPFHGGSFDIATGAAKAFPCQIPIKTYPVTIEDGWVCIDQP --------------------------------iiii------ >MONOOXYGENASE; SWP:Q8UD21; PDB:2I7GA; GELGLYTFADVNPNPADGRGPEGARRLRELLEEIELADQVGLDVFGLGEHHRPDYVVSSP --------------1111-------------------1111----------1111---33 STVLAAAAVKTKNIRLTSAVSVLSSDDPVRVFQQFSTVDLLSNGRAEIAGRGSFIESYPL 33----1111-----------3333-----------------------------3333-- FGYDLEDYDVLFAEKLDLLLALREQEVVTWSGTKHPAINGRGVYPRPLQERLPVWIAVGG ---3333----------------------------------------------------- TPQSVARAGAGLPVALAIIGGEYRRFAPLFDLYHEAARRAGQEKTKLRTSINVHGFIADT 3333-----------------3333-----------------1111-------------- TDKAADQFYGPQAEVNRIGRERGWGPTNRAHFDAARGPEGNLFLGEPELVAEKIIKAHGV ------------------------------------1111-------------------- FKNDRFLLQAIGLPHDQIRGIELYGTKVAPLVRKELT -------------3333-------------------- >NITROREDUCTASE-LIKE FAMIL; SWP:Q81HL8; PDB:2I7HA; ATTYTSIANVIKERRSVRTFTDKAVEKDLLIELLNDATWAPNHKHREPWNCKLYIGEGRK -----------------------------------------2222---------!!!!-- KLVDAVLNSFTEEERAKRGKILSDRFLSTPAQIVVYNEDPRQIQRDEDYAATCAFQNFQL ------1111-------------------------------------------------- LAWERGLGCVWKSGGLNYNPLFIEGIGLTRGQRIVGILHIGYFDKAPEGKARTPITEKEI --1111-------3333------1111--------------------------3333--- IEG --- >PANTOTHENATE KINASE 1; SWP:Q8TE04; PDB:2I7NA; FPWFGMDIGGTLVKLVYFEPKDLKSIRKYLTSNTAYGKTGIRDVHLELKNLTMRKGNLHF -----------------------3333-3333----------3333-------------- IRFPSCAMHRFIQMGCATGGGAFKFEEDFLHKLDELDCLIQGLLYVDSVGFNGKPECYYF ---3333-----------------3333-------------------------------- ENPTNPELCQKKPYCLDNPYPMLLVNMGSGVSILAVYSKDNYKRVTGTSLGGGTFLGLCC -1111--------------------------------1111------------------- LLTGCETFEEALEMAAKGDSTNVDKLVKDIYGGDYERFGLQGSAVASSFGNMMSKEKRDS --------------11113333---3333-----3333--1111----1111--3333-- ISKEDLARATLVTITNNIGSIARMCALNENIDRVVFVGNFLRINMVSMKLLAYAMDFWSK --------------------------------------1111--------------1111 GQLKALFLEHEGYFGAVGALLELFK -------1111---------3333- >PANTOTHENATE KINASE 3; SWP:Q9H999; PDB:2I7PA; VPRGSPWFGMDIGGTLVKLSYFEPIDITAEEEQEEVESLKSIRKYLTSNTGIRDVHLELK ------------------------------------------------------1111-- DLTLFGRRGNLHFIRFPTQDLPTFIQMGRDKNFSTLQTVLCATGGGAYKFEKDFRTIGNL ---iiii---------3333--------11111111-------1111-----3333---- HLHKLDELDCLVKGLLYIDSVQAECYYFANASEPERCQKMPFNLDDPYPLLVVNIGSGVS ----------------3333---------3333--------------------------- ILAVHSKDNYKRVTGTSLGGGTFLGLCSLLTGCESFEEALEMASKGDSTQADKLVRDIYG ------------------3333-------------------3333-3333---3333--- GDYERFGLPGWAVASSFGNMIYKEKRESVSKEDLARATLVTITNNIGSVARMCAVNEKIN --------1111--2222------------------------------------------ RVVFVGNFLRVNTLSMKLLAYALDYWSKGQLKALFLEHEGYFGAVGALLGLPN ------1111----------------iiii-----1111--------1111-- >CHOLINE KINASE ALPHA; SWP:P35790; PDB:2I7QA; EQPEPRTRRRAYLWCKEFLPGAWRGLREDEFHISVIMLFQCSLPDTTATLGDEPRKVLLR -------------------!!!!---3333-------------1111------------- LYEAMVLESVMFAILAERSLGPKLYGIFPQGRLEQFIPSRRLDTEELSLPDISAEIAEKM ---------------1111--------1111-----------3333-------------- ATFHGMKMPFNKEPKWLFGTMEKYLKEVLRIKFTEESRIKKLHKLLSYNLPLELENLRSL --1111--------------------3333------------------------------ LESTPSPVVFCHNDCQEGNILLLEGRENSEKQKLMLIDFEYSSYNYRGFDIGNHFCEWMY ---------------3333---2222------------1111---3333-------1111 DYSYEKYPFFRANIRKYPTKKQQLHFISSYLPAFQNDFENLSTEEKSIIKEEMLLEVNRF ------------3333------------------1111---------------------- ALASHFLWGLWSIVQAKISSIEFGYMDYAQARFDAYFHQKRKLG -------------------------------------------- >CONSERVED DOMAIN PROTEIN; SWP:Q97RR3; PDB:2I7RA; NLNQLDIIVSNVPQVCADLEHILDKKADYANDGFAQFTIGSHCLLSQNHLVPLENFQSGI --------------------------------------!!!!------------------ IIHIEVEDVDQNYKRLNELGIKVLHGPTVTDWGTESLLVQGPAGLVLDFYRK ----------------3333---------1111--------iiii------- >Cleavage and polyadenylat; SWP:CPSF3_HUMAN; PDB:2I7VA; EESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDL -------------------------iiii--------1111!!!!--------1111--- LLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKLYTETDLEESMDKI ------11111111----------------------1111----------------1111 ETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN -----------------------2222------iiii----------------------- IKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDE ---------1111---------------------1111----------3333-------- YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNANNPFVFKHISNLKSMDHFDDIGPSVVM ----3333-----------------11113333-1111--------3333---------- ASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKM --1111-----------1111-----------2222-------------3333------- SVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVH ------------------------------------------------------------ NPRNTEAVTLNFR --2222------- >PROTEIN CFT2; SWP:CFT2_YEAST; PDB:2I7XA; MTYKYNCCDDGSGTTVGSVVRFDNVTLLIDPGWNPSKVSYEQCIKYWEKVIPEIDVIILS ---------------------------------3333---------11111111------ QPTIECLGAHSLLYYNFTSHFISRIQVYATLPVINLGRVSTIDSYASAGVIGPYDTNKLD --11111111-------------------------------------------1111--3 LEDIEISFDHIVPLKYSQLVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEKLVYAKRW 333----1111---2222-------iiii---------2222------------------ NHTRDNILNAASILDATGKPLSTLMRPSAIITTLDRFGSSQPFKKRSKIFKDTLKKGLSS ----------------------------------------------------------!! DGSVIIPVDMSGKFLDLFTQVHELLFESQVPVLILSYARGRTLTYAKSMLEWLSPSLLKT !!------1111-3333------------------1111---------3333-------- WENRNNTSPFEIGSRIKIIAPNELSKYPGSKICFVSEVGALINEVIIKVGNSEKTTLILT -----------!!!!----11113333---------------------1111-------- KPSFECASSLDKILEIVEQDEDGKSFLCDNYISIDTIKEEPLSKEETNFDNLDYLKIDKT ---1111---------1111-------------------------------3333----- LSKRTISTVNVQLKCSVVILNLQSLVDQRSASIIWPSLKSRKIVLSAPKQIQNEEITAKL --------------------------3333---3333----------3333--------- IKKNIEVVNMPLNKIVEFS ------------------- >2-CYS PEROXIREDOXIN; SWP:NA; PDB:2I81A; PTYVGKEAPFFKAEAVFGDNSFGEVNLTQFIGKKYVLLYFYPLDFTFVCPSEIIALDKAL --2222----------1111-----33332222--------------------------- DAFHERNVELLGCSVDSKYTHLAWKKTPLAKGGIGNIKHTLLSDITKSISKDYNVLFDDS ---1111--------------------3333------------1111---------%%%% VSLRAFVLIDMNGIVQHLLVNNLAIGRSVDEILRIIDAIQHHEKYGDVCPANWQKGKVSM ---------1111--------1111------------------------2222------- KPSEEGVAQYLST --------1111- >RIBOSOMAL LARGE SUBUNIT P; SWP:P0AA37; PDB:2I82A; ENYNPPQEPWLVILYQDDHIVVNKPSGLLSVPGRLEEHKDSVTRIQRDYPQAESVHRLDA ------------------------2222------3333-------1111----------- TSGVIVVALTKAAERELKRQFREREPKKQYVARVWGHPSPAEGLVDLPLICDWPNRPKQK --------------------1111---------------------------3333----- VCYETGKPAQTEYEVVEYAADNTARVVLKPITGRSHQLRVHLALGHPILGDRFYASPEAR ------------------1111-----------222233333333--2222----3333- AAPRLLLHAELTITHPAYGNSTFKAPADF ----------------------------- >D-ALANINE-D-ALANINE LIGAS; SWP:Q5HEB7; PDB:2I87A; KENICIVFGGKSAEHEVSILTAQNVLNAIDKDKYHVDIIYITNDGDWRKQNNITAEIKST -------------------------11111111--------1111-------------33 DELHLENGEALEISQLLKESSSGQPYDAVFPLLHGPNGEDGTIQGLFEVLDVPYVGNGVL 33-!!!!------3333--1111------------------------------------- SAASSMDKLVMKQLFEHRGLPQLPYISFLRSEYEKYEHNILKLVNDKLNYPVFVKPANLG ---------------1111----------------------------------------- SSVGISKCNNEAELKEGIKEAFQFDRKLVIEQGVNAREIEVAVLGNDYPEATWPGEVVKD -2222---------------3333------------------------------------ VAFVQLQIPADLDEDVQLTLRNMALEAFKATDCSGLVRADFFVTEDNQIYINETNAMPGF ----------------------------1111-----------1111------------- TAFSMYPKLWENMGLSYPELITKLIELAKERHQDKQKNKYKIDRS 1111------1111---------------------------1111 >UNCHARACTERIZED CONSERVED; SWP:Q035U5; PDB:2I8DA; GSLAEWYQRIPTPDDLTRVESLFANQAQFPQLKLEFKWNQPFTDHGTFIGFNPSKKHLAV ------1111---------------3333-------%%%%---%%%%------1111--- AIEPQTTRFIPQIDKAGYDHSQIIRFPWHKPLDEQLIHDLIAYTIDQKKDATTFWQR --3333--------------------1111---------------1111-------- >HYPOTHETICAL PROTEIN DIP2; SWP:Q6NEK4; PDB:2I8GA; VHDSALPFDALPPPQGREGFEECPYLDSQWVADTNGQRTGQGVDTRFDTPACVFWSYPEA -%%%%-1111--1111-------------------------------------------- PQATVVRHPSEEEAIRVVDWAAPIDTTEPAEEPDGWSGGRAGHEEGAVYAVQKGPVAVVV ----------------------1111------2222------1111------!!!!---- WSNQQQSLKAELAKEAIARLGL ---------------------- >HYDROGENASE 3 MATURATION ; SWP:HYCI_ECOLI; PDB:2I8LA; MTDVLLCVGNSMMGDDGAGPLLAEKCAAAPKGNWVVIDGGSAPENDIVAIRELRPTRLLI ------------------------------%%%%----%%%%3333----1111------ VDATDMGLNPGEIRIIDPDDIAEMFMMTTHNMPLNYLIDQLKEDIGEVIFLGIQPDIVGF --------2222----3333---------------------------------------- YYPMTQPIKDAVETVYQRLEGWEGNGGFAQLAVEEE -----------------1111---%%%%---%%%%- >MU-CRYSTALLIN HOMOLOG; SWP:Q5HYB7; PDB:2I99A; SRVPAFLSAAEVEEHLRSSSLLIPPLETALANFSSGPEGGVMQPVRTVVPVTKHRGYLGV -----------------3333--------------3333-----------1111------ MPAYSAAEDALTTKLVTFYEDRGITSVVPSHQATVLLFEPSNGTLLAVMDGNVITAKRTA -----1111-------------3333---------------------------------- AVSAIATKFLKPPSSEVLCILGAGVQAYSHYEIFTEQFSFKEVRIWNRTKENAEKFADTV -----------------------3333---------------------3333-------- QGEVRVCSSVQEAVAGADVIITVTLATEPILFGEWVKPGAHINAVGASRPDWRELDDELM --------3333-2222--------------3333-2222-------------------- KEAVLYVDSQEAALKESGDVLLSGAEIFAELGEVIKGVKPAHCEKTTVFKSLGMAVEDTV -----------------3333--------3333--------1111--------3333--- AAKLIYDSWSSG ------------ >UROKINASE-TYPE PLASMINOGE; SWP:P00749; PDB:2I9AA; NCDCLNGGTCVSNKYFSNIHWCNCPKKFGGQHCEIDKSKTCYEGNGHFYRGKASTDTMGR ----iiii----1111--------1111-1111---------!!!!---------1111- PCLPWNSATVLQQTYHAHRSDALQLGLGKHNYCRNPDNRRRPWCYVQVGLKPLVQECMVH ---11113333----1111-3333----------1111---------------------- DCA --- >HYPOTHETICAL PROTEIN RPA1; SWP:Q6N8L4; PDB:2I9CA; KLDLHQTTQDLVALFAKVTVEQDDALLGNQISRFNRLFGVAEIADELKARDGDQRTALLS -------------------------1111--------------------2222---3333 LFEYPNQVRLQAAKLTLAVAPVKAREQLEAIVSSKWFPQAGDAGCLDLLDDGTFKPK 1111---------1111--------------3333--3333---------------- >CHLORAMPHENICOL ACETYLTRA; SWP:Q8A336; PDB:2I9DA; KQIIDIENWERKENFNFFRHFQNPQLSITSEVECGGARQRAKAAGQSFFLHYLYAVLRAA ----33331111-----1111-------------------------3333---------- NEIPEFRYRIDPDGRVVLYDTIDLSPIKIKENGKFFTTRFPYHNDFDTFYQEARLIIDAI ----------1111---------------1111-----------------------1111 PEDGDPYAAENEEVADGDYGLILLSATPDLYFTSITGTQEKRSGNNYPLLNAGKAIIREG ----1111------------------1111----------1111-------------iii RLVPIATIHHGFIDGHHLSLFYKKVEDFLK i-------3333-3333------------- >TRIOSEPHOSPHATE ISOMERASE; SWP:Q8MPF2; PDB:2I9EA; ARKFVVGGNWKMNGDKKQINEIIGFLKSGPLNQDTEVVVGVPAIYLELVRTCVPASIGVA --------------3333-------------1111------3333--------3333--- AQNCYKVPKGAFTGEISPAMIKDVGADWVILGHSERRQIFGESDELIAEKVCHALESGLK -----------2222------1111-----------------------------1111-- VIACIGETLEEREAGKTEEVVFRQTKAIAAKVNDWSNVVIAYEPVWAIGTGKTATPQQAQ ----------------------------1111--1111-----3333------------- DVHKALRQWICENIDAKVGNSIRIQYGGSVTAANCKELASQPDIDGFLVGGASLKPEFVD ------------------------------3333------1111-----3333------- IINARQ 1111-- >HYPOTHETICAL PROTEIN; SWP:O25234; PDB:2I9IA; ERMKTSSEHVTPLDFNYPIHIVQAPQNHHVVGILTPRIQVSDNLKPYIDKFQDALINQIQ ----------------------------------------33331111------------ TIFEKRGYQVLRFQDEKALNAQDKRKIFSVLDLKGWVGILEDLKMNLKDPNNPNLDTLVD --------------3333------------------------------1111-------- QSSGSVWFNFYEPESNRVVHDFAVEVGTFQAMTYTYKHNNSGGLNSSNSIIHEYLEKNKE --------------------------------------1111--3333--3333------ DAIHKILNRMYAVVMKKAVTELTKENIDKYREAIDRMKGFK ----------------------------------------- >CYTOSINE/GUANINE DEAMINAS; SWP:Q97MB6; PDB:2I9UA; NLKIFKGNLIFTKTSDKFTIMKDSYIVVIDGKIASVSSNLPDKYKGNPIIDFRNNIIIPG ----------------------------iiii--------3333-------!!!!----- MNDLHAHASQYKNLGIGMDKELLPWLNNYTFPEEAKFLNVDYAKKTYGRLIKDLIKNGTT ------11111111---------------------------------------------- RVALFATLHKDSTIELFNMLIKSGIGAYVGKVNMDYNCPDYLTENYITSLNDTEEIILKY --------------------------------------1111------------------ KDKSNIVKPIITPRFVPSCSNELMDGLGKLSYKYRLPVQSHLSENLDEIAVVKSLHKKSN --------------3333-------------------------------------1111- FYGEVYDKFGLFGNTPTLMAHCIHSSKEEINLIKRNNVTIVHCPTSNFNLGSGMMPVRKY -----3333--------------------------------------1111--------- LNLGINVVLGSDISAGHTCSLFKVIAYAIQNSKIKWQESGKKDMFLSTSEAFYMATKKGG 1111----------------------------------%%%%-----------------3 SFFGKVGSFEEGYDFDALVINDSNLYPEDYDLTERLERFIYLGDDRNIMKRYVCGNEIF 333------2222--------11113333--------------3333-----iiii--- >HYPOTHETICAL PROTEIN; SWP:Q4FPZ7; PDB:2I9WA; LFSIQTCPCQINPALNAVSTPLLYQDCCQPYHDGLYNQAIRADTAEHLRTRYSAFVLVKP 3333--3333-1111-------3333-3333------------3333------------- EYIVKTTLPAQQDLLDIKAIENWAKETDWAGLEVVAHTPKLSKRHAQVEFKAYFKTPDGL ---111133331111----------------------------------------3333- QAHHELSTFVKIKNKANSDASWYFLDPTVSSVTQKQPCICGSGEKFKRCCGYI --------------------------------1111-1111---3333----- >PUTATIVE SEPTATION PROTEI; SWP:SP5G_STAES; PDB:2I9XA; AKVTDVRLRKIQTDGRKALVSITLDEAFVIHDLRVIEGNSGLFVAPSKRTPDGEFRDIAH -----------------------%%%%----------1111--------1111------- PINSDRQEIQDAVKVYDETD --------------3333-- >major latex protein-like ; SWP:Q9SSK9; PDB:2I9YA; TEASSLVGKLETDVEIKASADKFHHMFAGKPHHVSKASPGNIQGCDLHEGDWGTVGSIVF --1111-----------------------------------------------2222--- WNYVHDGEAKVAKERIEAVEPDKNLITFRVIEGDLMKEYKSFLLTIQVTPKPGGPGSIVH ----iiii-----------------------3333-----------------!!!!---- WHLEYEKISEEVAHPETLLQFCVEVSKEIDEHLLAEE --------3333-3333-------------------- >Putative HTH-type transcr; SWP:Q8U2H1; PDB:2IA0A; HLDDLDRNILRLLKKDARLTISELSEQLKKPESTIHFRIKKLQERGVIERYTIILGEQLK ---------------1111-----------3333---------------------3333- PKHLALIVLEVGKPEDFLERYISYISSTLSALPGVLFVAKSGEDKIIALVGKNNKDELVK -------------------------------2222------------------------- FIEENITSIPNLKHIQIFPITEIKKGEDLTGFLAEV -----1111----------------1111------- >BH3703 PROTEIN; SWP:Q9K6M5; PDB:2IA1A; SLEKQIESYYQEIAQLIIDMIPEEWAEVRFYAQEDHDGWKIFFFHYLSASSDEWTKDIDI -----------------3333-------------1111-----------------33333 RDVIKVPQDEFMEKYNELSFCISDFRKDYAEAFGEPWMSFQMTFYASGKFNIDFYYDKNP 333---3333----------------------------------3333------------ FDTFLTRLAWQYEHFGTIPDSFYKETLNEYLEEKAQGKRYPFLEPLHHH ---------------------------------------3333------ >PUTATIVE TRANSCRIPTIONAL ; SWP:Q2F1F8; PDB:2IA2A; YVQSLARGLAVIRCFDHRNQRRTLSDVARATDLTRATARRFLLTLVELGYVATDGSAFWL ---------------1111---------1111---------------------------- TPRVLELGYSYLSSLSLPEVAQPHLEKLSHKVHESSSVSILDGADIVYVARVPVSRITVG 3333---333311113333----------------------!!!!--------------- ITIGTRLPAYATSGRVLLAGLPDDELDAYLEKLDIQRLTERTITARDELKAAILAVRADG -2222--3333------1111-----------------1111------------------ ICVLDQELEAGLRSAAPIRGASGLTVAAVNISTPAARYSLEDLHSDLIPSLRVTATDIEQ --------2222-------1111----------3333-3333------------------ DLATVNR ------- >TAIL LYSOZYME, PUTATIVE; SWP:Q74EH6; PDB:2IA7A; AVLSSAEEDIAESIRIILGTARGERVRPDFGCGIHDRVFSVINTTTLGLIENEVKEALIL --------------------2222--1111-3333-------3333-------------- WEPRIELLSVTASPREAAEGRLLIDIEYRVRSTNTRFNLVYPFYLKESA -1111--------1111-------------------------------- ------------------------------------------------------------ ------------------------------ >Azurin; SWP:P00281; PDB:2IAAC; ACDVSIEGNDSMQFNTKSIVVDKTCKEFTINLKHTGKLPKAAMGHNVVVSKKSDESAVAT --------1111---------3333-------------3333--------3333------ DGMKAGLNNDYVKAGDERVIAHTSVIGGGETDSVTFDVSKLKEGEDYAFFCSFPGHWSIM -33333333---2222----------2222-----------2222--------------- KGTIELGS -------- >HYPOTHETICAL PROTEIN; SWP:Q825J7; PDB:2IABA; TTPPARTAKQRIQDTLNRLELDVDAWVSTAGADGGAPYLVPLSYLWDGETFLVATPAASP ------------------------------1111---------------------1111- TGRNLSETGRVRLGIGPTRDLVLVEGTALPLEPAGLPDGVGDTFAEKTGFDPRRLTTSYL ----------------2222-----------1111-2222----------3333------ YFRISPRRVQAWREANELSGRELRDGEWLVTD -------------33332222--iiii----- >PTS SYSTEM, IIA COMPONENT; SWP:Q838I6; PDB:2IACA; KPKLILSHGRAEETLASTQIVGELADAAIVSTAEDGLSGTQAKLAAILKEAGNVPTLVLA ---------------------1111------1111------------------------- DLKGGTPCNVAAGTYPQLRVVAGLNLAAIEAAVSPVENVDELAAYLTQIGQSAVTTIDLP -2222---------1111--------------------------------3333------ ELT --- >MHC CLASS II I-AD; SWP:P04228; PDB:2IADA; IEADHVGFYGTTVYQSPGDIGQYTHEFDGDELFYVDLDKKKTVWRLPEFGQLILFEPQGG ---------------2222-------iiii---------------3333----------- LQNIAAEKHNLGILTKRSNFTPATNEAPQATVFPKSPVLLGQPNTLICFVDNIFPPVINI ---------------1111-------------------2222------------------ TWLRNSKSVTDGVYETSFLVNRDHSFHKLSYLTFIPSDDDIYDCKVEHWGLEEPVLKHWS ---%%%%-------------1111------------1111-------1111--------- SADLVPR ------- >HYPOTHETICAL PROTEIN SDHL; SWP:Q5WYB1; PDB:2IAFA; SSHTVGPLAANAFLQLLEQKNLFDKTQRVKVELYGSLALTGKGHGTDKAILNGLENKAPE ---3333---------------3333-------------------------3333----- SIPRHEILDSNLLNLAGKKEIPFHEATDFLFLQKELLPKHSNGRFSAFDGNANLLIEQVY -------3333--2222------3333----1111-3333-------------------- YSIGGGFITTEEDFDK ---------3333--- >PROSTACYCLIN SYNTHASE; SWP:Q16647; PDB:2IAGA; RTRRPGEPPLDLGSIPWLGYALDFGKDAASFLTRMKEKHGDIFTILVGGRYVTVLLDPHS ---2222----------!!!!------3333---------------iiii------3333 YDAVVWEPRTRLDFHAYAIFLMERIFDVQLPHYSPSDEKARMKLTLLHRELQALTEAMYT 1111---1111----------------------3333----3333--------------- NLHAVLLGDATEAGSGWHEMGLLDFSYSFLLRAGYLTLYGIEALPRTHESQAQDRVHSAD ------------------------------------------------------------ VFHTFRQLDRLLPKLARGSLSVGDKDHMCSVKSRLWKLLSPARLARRAHRSKWLESYLLH -------3333----------------------------33331111----3333----- LEEMGVSEEMQARALVLQLWATQGNMGPAAFWLLLFLLKNPEALAAVRGELESILWQTTL -1111-3333-------------------------------------------------- PQKVLDSTPVLDSVLSESLRLTAAPFITREVVVDLAMPMADGREFNLRRGDRLLLFPFLS 3333----------------------------------1111-----2222----3333- PQRDPEIYTDPEVFKYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNHCLGRSYAVNS ---1111--1111-1111--1111-------iiii------1111!!!!----------- IKQFVFLVLVHLDLELINADVEIPEFDLSRYGFGLMQPEHDVPVRYRIRPH -----------------1111-----3333--------------------- >PUTATIVE TRANSCRIPTIONAL ; SWP:Q9XA31_STRCO; PDB:2IAIA; GTAKTPETLLSVAVQVFIERGYDGTSEHLSKAAGISKSSIYHHVTGKEELLRRAVSRALD ---------------------1111----------3333--------------------- ELFGILDEEHARVGTAAERLEYVVRRVEVLAELPYVTLLLRVRGNTGTERWALERRREFD --3333-1111---------------3333--------1111------------------ HRVAALLKDAAAEGDVRADVEVRLATRLVFGINSIVEWYRPESGVSGAGEREVVDAVARL ----------1111------------------3333------------------------ VFGGLRK ------- >BULLOUS PEMPHIGOID ANTIGE; SWP:Q91ZU8; PDB:2IAKA; QIRKPLLKSSLLDQNLTEEEVNMKFVQDLLNWVDEMQVQLDRTEWGSDLPSVESHLENHK ------3333-------------------------------------------------- NVHRAIEEFESSLKEAKISEIQMTAPLKLSYTDKLHRLESQYAKLLNTSRNQERHLDTLH ------------------3333-------------------------------------- NFVTRATNELIWLNEKEESEVAYHAELMRELEQKEESIKAVQEIAEQLLLENHPARLTIE --------------3333--------3333------------------------------ AYRAAMQTQWSWILQLCQ ------------------ >HYPOTHETICAL PROTEIN; SWP:Q88V95; PDB:2IAYA; GAYTTTVKLDGDTKTYTLSPTVKKYTLDLGFVKGRSGAFSFERSLDPTSPYQAAFKLKTV --------2222------11113333-------3333-----------2222-------- NADLTGFKTTVTGNGVQRANIFKNDAHPEAVEQLRYILANFIERDILTTD 1111-------1111------2222----------------1111----- >HYPOTHETICAL PROTEIN SP13; SWP:Q97Q59; PDB:2IAZA; GSNIYDSANELSRGLRGLPEYKAVKAAKDAIAADAEASKIFTDYLAFQEEIQKLAPDASF -----------------------------------------------3333--------- QAKEGFGKQIQGNSLLSEFFTKQQQLAIYLSDIEKIVFEPVSELL -------------------------------------33331111 >CONSERVED HYPOTHETICAL AL; SWP:O05815; PDB:2IB0A; SEGSADNAALCDALAVEHATIYGYGIVSALSPPGVNFLVADALKQHRHRRDDVIVMLSAR -------------------------------3333----------------------111 GVTAPIAAAGYQLPMQVSSAADAARLAVRMENDGATAWRAVVEHAETADDRVFASTALTE 1-----------------3333-------------------------------------- SAVMATRWNRVLGAWPITAAFP -----------1111------- >CHROMO PROTEIN; SWP:A0AQQ8; PDB:2IB5A; ISDNVRIKLYEGTVNNHHFCEAEGEGKPYEGTQENIKVTKGGPLPFSFDILTPNCSVAIT -------------%%%%----------1111---------------33331111-1111- KYTSGIPDYFKQSFPEGFTWERTTIYEDGAYLTTQQETKLDGNCLVYNIKILGCNFPPNG --iiii-3333--------------1111---------------------------1111 PVQKKTQGWEPCCERYTRDGVLCGQTLALKCADGNHLTCHLRTTYRSKKAAKALQPPFHF -----------------iiii---------1111---------------3333------- SDHRPEIVKVSENGTLFEQHESSVARYCQTCPSKLGHN -----------%%%%----------------------- >URICASE; SWP:Q068V7; PDB:2IBAA; SAVKAARYGKDNVRVYKVHKDEKTGVQTVYEMTVCVLLEGEIETSYTKADNSVIVATDSI -----------------------------------------3333-----1111-3333- KNTIYITAKQNPVTPPELFGSILGTHFIEKYNHIHAAHVNIVCHRWTRMDIDGKPHPHSF --------------3333------------1111----------------iiii------ IRDSEEKRNVQVDVVEGKGIDIKSSLSGLTVLKSTNSQFWGFLRDEYTTLKETWDRILST --------------2222--------------------------1111------------ DVDATWQWKNFSGLQEVRSHVPKFDATWATAREVTLKTFAEDNSASVQATMYKMAEQILA -------------------3333------------------------------------- RQQLIETVEYSLPNKHYFEIDLSWHKGLQNTGKNAEVFAPQSDPNGLIKCTVGRS -1111----------------3333-----!!!!--------------------- >POSSIBLE TRANSCRIPTIONAL ; SWP:Q0S7V2; PDB:2IBDA; GKSGRRTELLDIAATLFAERGLRATTVRDIADAAGILSGSLYHHFDSKESVDEILRGFLD ---------------------1111------1111-33333333--3333---------- DLFGKYREIVASGLDSRATLEALVTTSYEAIDASHSAVAIYQDEVKHLVANERFTYLSEL --------------------------------------------1111--3333------ NTEFRELWGVLEAGVKDGSFRSDIDVELAFRFLRDTAWVAVRWYRPGGSVTVDTVAKQYL ---------------------------------------3333-2222------------ SIVLDGLASP ---------- >Protein hedgehog [Precurs; SWP:Q02936; PDB:2IBGE; YPLVLKQTIPNLSEYTNSASGPLEGVIRRDSPKFKDLVPNYNRDILFRDRLMSKRCKEKL ---2222-----1111-----------33333333------1111--------------- NVLAYSVMNEWPGIRLLVTESWDEDYHHGQESLHYEGRAVTIATSDRDQSKYGMLARLAV -------------------------------3333--------11113333--------- EAGFDWVSYVSRRHIYCSVKSD ----------3333-------- >CYTOCHROME B5; SWP:P49096; PDB:2IBJA; SEDVKYFTRAEVAKNNTKDKNWFIIHNNVYDVTAFLNEHPGGEEVLIEQAGKDATEHFED -------3333-----1111----iiii---1111---111133331111--------33 VGHSSDAREMMKQYKVGELVAEERSN 33-------3333------3333--- >FIBRITIN; SWP:Q76VI8; PDB:2IBLA; DIVLNDLPFVDGPPAEGQSRISWIKNGEEILGADTQYGSEGSMNRPTVSVLRNVEVLDKN --------------2222------2222------------3333---------------- IGILKTSLETANSDIKTIQEAGYIPEAPRDGQAYVRKDGEWVLLSTFL -----------------1111---------------%%%%--3333-- >INOSITOL OXYGENASE; SWP:Q9UGB7; PDB:2IBNA; SDRVFTTYKLHTHQTVDFVRSKHAQFGGFSYKKTVEAVDLLDGLVDESDDFPNSFHAFQT -------------------------1111---------1111------------------ AEGIRKAHPDKDWFHLVGLLHDLGKVLALFGEPQWAVVGDTFPVGCRPQASVVFCDSTFQ -------1111--------------3333---3333------------1111-3333-11 DNPDLQDPRYSTELGYQPHCGLDRVLSWGHDEYYQVKFNKFSLPPEAFYIRFHSFYPWHT 11----3333----------3333------------1111---3333--1111------- GRDYQQLCSQQDLALPWVREFNKFDLLPDVDKLRPYYQGLIDKYCPGILSW ---3333-3333--------3333----3333------------------- >HYPOTHETICAL PROTEIN SP21; SWP:Q97N67; PDB:2IBOA; KASIALQVLPLVQGIDRIAVIDQVIAYLQTQEVTVVTPFETVLEGEFDELRILKEALEVA -------------3333----------3333-----1111-----3333----------- GQEADNVFANVKINVGEILSIDEKLEK --------------------------- >HEMAGGLUTININ; SWP:Q6DQ34; PDB:2IBXA; DQICIGYHANNSTEQVDTIMEKNVTVTHAQDILEKTHNGKLCDLDGVKPLILRDCSVAGW -----------------1111----------------------%%%%----!!!!----- LLGNPMCDEFINVPEWSYIVEKANPVNDLCYPGDFNDYEELKHLLSRINHFEKIQIIPKS ------3333-----------------------------------------------333 SWSSHEASLGVSSACPYQGKSSFFRNVVWLIKKNSTYPTIKRSYNNTNQEDLLVLWGIHH 31111------------------1111-----%%%%------------------------ PNDAAEQTKLYQNPTTYISVGTSTLNQRLVPRIATRSKVNGQSGRMEFFWTILKPNDAIN ---------------------1111-------------%%%%------------------ FESNGNFIAPEYAYKIVKKGDSTIMKSELEYGNCNTKCQTPMGAINSSMPFHNIHPLTIG ---------------------------------------1111----------------- ECPKYVKSNRLVLATGLRNSP --------------------- >CYSTEINE DIOXYGENASE TYPE; SWP:Q16878; PDB:2IC1A; VLKPRTLADLIRILHQLFAGDEVNVEEVQAIMEAYESDPTEWAMYAKFDQYRYTRNLVDQ -----3333----------------------------33333333--------------% GNGKFNLMILCWGEGHGSSIHDHTNSHCFLKMLQGNLKETLFAWPDKKSNEMVKKSERVL %%%---------2222------%%%%---------------------------------- RENQCAYINDSIGLHRVENISHTEPAVSLHLYSPPFDTCHAFDQRTGHKNKVTMTFHSKF 2222----1111----------------------------------------------ii GIRTP ii--- >CG9211-PA; SWP:Q9VM64; PDB:2IC2A; GSTYPPTPPNVTRLSVLRWVPRNDGLPIVIFKVQYRVGNWQTTNDNIPYGKPKWNSELGK -------------------------------------------------------3333- SFTASVTDLKPQHTYRFRILAVYSNNDNKESNTSAKFYLQP ----------------------1111--------------- ------------------------------------------------------------ ----------- >MALTOSE TRANSACETYLASE; SWP:Q75TD0; PDB:2IC7A; MKSEKEKMLAGHLYNPADLELVKERERARRLVRLYNETLETEYDKRTGLLKELFGSTGER -------1111---1111----------------11111111------------------ LFIEPNFRCDYGYNIHVGENFFMNFDGVILDVCEVRIGDHCFIGPGVHIYTATHPLDPHE -----------1111--------------------------------------------- RNSGLEYGKPVVIGHNVWIGGRAVINPGVTIGDNAVIASGAVVTKDVPANAVVGGNPAKV 3333---------------2222--2222--------2222------2222--------- IKWLK ----- >V-set and immunoglobulin ; SWP:Q9Y279; PDB:2ICCA; GRPILEVPESVTGPWKGDVNLPCTYDPLQGYTQVLVKWLVQRGSDPVTIFLRDSSGDHIQ -------------2222----------2222---------------------1111---- QAKYQGRLHVSHKVPGDVSLQLSTLEMDDRSHYTCEVTWQTPDGNQVVRDKITELRVQK 3333---------2222--------1111-----------1111--------------- >LIN2918 PROTEIN; SWP:Q926X2; PDB:2ICGA; GASSFLEEVDRLITLSGITFHASGTGTPELIKIYQDALGNEFPETYKLFLEKYGTLTFNG -1111----------------------------------------------------iii VSFYGISKRGLSAASIPDVKFATEQARTFGDINKEIIKNSGYGSIFSIDTSIIGSEGEPV i-----1111------------------------------2222----1111-------- IVETNLSFKDNTEKKVVANSFGEFLLEEIELSLTDL ----1111---------------------------- >PUTATIVE ATTH; SWP:Q82US3; PDB:2ICHA; LAPVVPGKALEFPQDFGAHNDFRIEWWYVTGWLETPTGKPLGFQITFFRTASHFAPDQLI ------------1111--1111------------1111---------------------- IAHVALSDPAIGKLQHDQKIARAGFDLAYARTGNTDVKLDDWIFVRETDGRYRTRIEAED -------3333--------------3333---------!!!!----1111-------111 FTLTFILTPSQPLLQGENGFSRKGPGAPQASYYYSEPHLQVSGIINRQGEDIPVTGTAWL 1---------------iiii-----1111-----------------%%%%---------- DREWSSEYLDPNAAGWDWISANLDDGSALAFQIRGKDDSKIWAYAALRDASGHTRLFTPD ----------------------1111--------1111----------1111-----333 QVSFHPIRTWRSARTQAVYPVATRVLTGETEWQITPLDDQELDSRASAGAVYWEGAVTFT 3-------------------------!!!!------------------------------ RDGQPAGRGYELTGYV ---------------- >ISOPENTENYL-DIPHOSPHATE D; SWP:Q13907; PDB:2ICJA; LDKQQVQLLAEMCILIDENDNKIGAETKKNCHLNENIEKGLLHRAFSVFLFNTENKLLLQ ----------------1111------------33331111-----------1111----- QRSDAKITFPGCFTNTCCSHPLSNPAELEESDALGVRRAAQRRLKAELGIPLEEVPPEEI --------2222-----------3333---%%%%----------------3333-3333- NYLTRIHYKAQSDGIWGEHEIDYILLVRKNVTLNPDPNEIKSYCYVSKEELKELLKKAAS -----------------------------------3333------------------111 GEIKITPWFKIIAATFLFKWWDNLNHLNQFVDHEKIYRM 1---------------------11113333--------- >ADENINE DEAMINASE; SWP:Q837K0; PDB:2ICSA; DYDLLIKNGQTVNGPVEIAIKEKKIAAVAATISGSAKETIHLEPGTYVSAGWIDDHVHCF ----------1111------%%%%------------------2222-------------- EKALYYDYPDEIGVKKGVTTVIDAGTTGAENIHEFYDLAQQAKTNVFGLVNISKWGIVAQ -------3333-1111-----------3333------3333-----------1111---- DELADLSKVQASLVKKAIQELPDFVVGIARSRTVIGDNGITPLELAKQIQQENQEIPLVH ----------------------------------!!!!-3333---------%%%%---- IGSAPPHLDEILALEKGDVLTHCFNGKENGILDQATDKIKDFAWQAYNKGVVFDIGHGTD ------3333----2222--------1111------------------------------ SFNFHVAETALREGKAASISTDIYIRNRENGPVYDLATTEKLRVVGYDWPEIIEKVTKAP ----------1111--------------------3333---------------------- AENFHLTQKGTLEIGKDADLTIFTIQAEEKTLTDSNGLTRVAKEQIRPIKTIIGGQIYDN -----1111---2222-----------------1111---------------iiii---- >ANTITOXIN HIGA; SWP:P67699; PDB:2ICTA; KANHPRPGDIIQESLDELNVSLREFARAEIAPSTASRLLTGKAALTPEAIKLSVVIGSSP ------------------------------------------------------------ QWLNLQNAWSLAEAEKTVDVSRLRRLVTQ ------------3333---1111------ >HYPOTHETICAL PROTEIN YEDK; SWP:P76318; PDB:2ICUA; GRFAQSQTREDYLALLAEDIERDIPYDPEPIGRYNVAPGTKVLLLSERDEHLHLDPVFWG -------3333-1111-3333--------------------------%%%%--------- YAPGPPLINARVETAATSRFKPLWQHGRAICFADGWFEWKQPFFIYRADGQPIFAAIGST ----------33331111----------------------------1111---------- PFERGDEAEGFLIVTAAADQGLVDIHDRRPLVLSPEAAREWRQEISGKEASEIAASGCVP 1111--------------!!!!-------------------1111--------------3 ANQFSWHPVSRAVGNVKNQGAELIQPV 333-----------3333---1111-- >Superantigen [Precursor]; SWP:Q48898; PDB:2ICWG; MKLRVENPKKAQKHFVQNLNNVVFTNKELEDIYNLSNKEETKEVLKLFKLKVNQFYRHAF ---------------1111-------------1111-3333-3333-------------- GIVNDYNGLLEYKEIFNMMFLKLSVVFDTQRKEANNVEQIKRNIAILDEIMAKADNDLSY -----3333-----------------------1111------------------------ FISQNKNFQELWDKAVKLTKEMKIKLKGQKLDLRDGEVAINKVRELFGSDKNVKELWWFR -------------------3333-----------------------11113333------ SLLVKGVYLIKRYYEGDIELKTTSDFAKAVFED ------------11113333---3333------ >T-cell receptor beta chai; SWP:TVB5_MOUSE; PDB:2ICWJ; EAAVTQSPRNKVAVTGEKVTLSCNQTNNHNNMYWYRQDTGHELRLIHYSYGAGSTEKGDI -------------2222--------------------2222---------2222------ PDGYKASRPSQENFSLILESATPSQTSVYFCASGGGGTLYFGAGTRLSVLSSA 2222-----1111--------3333---------------------------- >Probable UTP-glucose-1-ph; SWP:Q9M9P3; PDB:2ICYA; TENLPQLKSAVDGLTEMSESEKSGFISLVSRYLIEWSKIQTPTDEIVVPYEKMTPVSQDV -------------1111-----------------1111----3333--3333-------- AETKNLLDKLVVLKLNGGLGTTMGCTGPKSVIEVRDGLTFLDLIVIQIENLNNKYGCKVP ------1111----------1111---3333---iiii---------------------- LVLMNSFNTHDDTHKIVEKYTNSNVDIHTFNQSKYPRVVADEFVPWPSKGKTDKEGWYPP -----1111------33331111----------------1111-3333----1111---- GHGDVFPALMNSGKLDTFLSQGKEYVFVANSDNLGAIVDLTILKHLIQNKNEYCMEVTPK ------------------1111-------1111-------------1111---------- TLADVKGGTLISYEGKVQLLEIAQVPDEHVNEFKSIEKFKIFNTNNLWVNLKAIKKLVEA 3333--------iiii----3333-----3333--------------------------- DALKMEIIPNPKEVDGVKVLQLETAAGAAIRFFDNAIGVNVPRSRFLPVKASSDLLLVQS ------------------------33331111---------3333--------------- DLYTLVDGFVTRNKARTNPSNPSIELGPEFKKVATFLSRFKSIPSIVELDSLKVSGDVWF -----iiii---3333----------3333------1111-----1111----------- GSSIVLKGKVTVAAKSGVKLEIPDRAVVENKNINGP --------------2222----2222---------- >EXORIBONUCLEASE 2; SWP:P30850; PDB:2ID0A; NPLLAQLKQQLHSQTPRAEGVVKATEKGFGFLEVDAQKSYFIPPPQKKVHGDRIIAVIHS ------------------------------------------3333-------------- EKERESAEPEELVEPFLTRFVGKVQGKNDRLAIVPDHPLLKDAIPCRAARGLNHEFKEGD ------------------------------------1111-------------------- WAVAERRHPLKGDRSFYAELTQYITFGDDHFVPWWVTLARHNLEKEAPDGVATELDEGLV -------3333--------------1111------------------------------- REDLTALDFVTIDSASTEDDDALFAKALPDDKLQLIVAIADPTAWIAEGSKLDKAAKIRA ----------------------------------------1111------3333------ FTNYLPGFNIPLPRELSDDLCSLRANEVRPVLACRTLSADGTIEDNIEFFAATIESKAKL ----2222----3333-------------------------------------------- VYDQVSDWLENTGDWQPESEAIAEQVRLLAQICQRRGEWRHNHALVFKDRPDYRFILGEK ---------------------------------------------------------111 GEVLDIVAEPRRIANRIVEEAIAANICAARVLRDKLGFGIYNVHGFDPANADALAALLKT 1-----------------------------3333------------3333---3333--- HGLHVDAEEVLTLDGFCKLRRELDAQPTGFLDSRIRRFQSFAEISTEPGPHFGLGLEAYA -----------3333------3333--------------------------1111----- TWTSPIRKYGDINHRLLKAVIKGRPQDEITVQAERRRLNRAERDVGDWLYARFLKDKAGT ---33333333--------------1111------------------------3333--- DTRFAAEIVDISRGGRVRLVDNGAIAFIPAPFLHAVRDELVCSQENGTVQIKGETVYKVT -----------1111-------------3333---3333-----------iiii---222 DVIDVTIAEVRETRSIIARPVA 2--------------------- >HYPOTHETICAL PROTEIN; SWP:Q7P0P8; PDB:2ID1A; EIQEISKLAIEALEDIKGKDIIELDTSKLTSLFQRIVATGDSNRQVKALANSVQVKLKEA 1111-------3333----------1111----------------------------111 GVDIVGSEGHESGEWVLVDAGDVVVHVLPAVRDYYDIEALWGGQKPSFAVGAAKPWS 1------------------!!!!----3333-----3333-------------1111 >LEUCINE RICH REPEAT NEURO; SWP:Q96FE5; PDB:2ID5A; TGCPPRCECSAQDRAVLCHRKRFVAVPEGIPTETRLLDLGKNRIKTLNQDEFASFPHLEE ---2222-----------------------1111-----------------1111----- LELNENIVSAVEPGAFNNLFNLRTLGLRSNRLKLIPLGVFTGLSNLTKLDISENKIVILL -----------22222222--------------------2222----------------2 DYMFQDLYNLKSLEVGDNDLVYISHRAFSGLNSLEQLTLEKCNLTSIPTEALSHLHGLIV 2221111---------1111-------2222----------------3333---1111-- LRLRHLNINAIRDYSFKRLYRLKVLEISHWPYLDTMTPNCLYGLNLTSLSITHCNLTAVP ------------------1111-------1111---11112222---------------3 YLAVRHLVYLRFLNLSYNPISTIEGSMLHELLRLQEIQLVGGQLAVVEPYAFRGLNYLRV 3331111--------------------3333----------------22222222----- LNVSGNQLTTLEESVFHSVGNLETLILDSNPLACDCRLLWVFRRRWRLNFNRQQPTCATP -----------1111--3333-------------3333---1111----!!!!------1 EFVQGKEFKDFPDVLLPNYFTCRRARIRDRKAQQVFVDEGHTVQFVCRADGDPPPAILWL 111---3333-----2222------------------2222------------------- SPRKHLVLTVFPDGTLEVRYAQVQDNGTYLCIAANAGGNDSMPAHLHVRS 3333------3333-------1111---------3333------------ >HYPOTHETICAL PROTEIN; SWP:Q6NA67; PDB:2IDAA; MTMGCRHVAGIRTVTPSALGCEECLKIGSPWVHLRICRTCGHVGCCDDSPHKHATRHFHA -------1111---------3333---------------------3333----------- TGHPIIEGYDPPEGWGWCYVDEVMFDLSDRMTPHNGPIPRYV ------------------------------------------ >3-OCTAPRENYL-4-HYDROXYBEN; SWP:P0AAB4; PDB:2IDBA; YNDLRDFLTLLEQQGELKRITLPVDPHLEITEIADRTLRAGGPALLFENPKGYSPVLCNL -------------------------------------------------2222------- FGTPKRVAGGQEDVSALREVGKLLAFLKEPEPPKGFRDLPTKRLRGAPCQQKIVSGDDVD ------------3333----------------3333-----------------------3 LNRIPITCWPEDAAPLITWGLTVTRGPHKERQNLGIYRQQLIGKNKLIRWLSHRGGALDY 333-----1111------------------------------------------3333-- QEWCAAHPGERFPVSVALGADPATILGAVTPVPDTLSEYAFAGLLRGTKTEVVKCISNDL ------------------------------------------------------------ EVPASAEIVLEGYIEQGETAPEGPYGDHTGYYNEVDSFPVFTVTHITQREDAIYHSTYTG --1111--------------------3333------------------------------ RPPDEPAVLGVALNEVFVPILQKQFPEIVDFYLPPEGCSYRLAVVTIKKQYAGHAKRVGV -----------3333------3333--------3333-------------22223333-- WSFLRQFYTKFVIVCDDDVNARDWNDVIWAITTRDPARDTVLVENTPIDYLDFASPVSGL ---1111------------1111------------3333---------1111----2222 GSKGLDATNKWPGETQREWGRPIKKDPDVVAHIDAIWDELAIF -----------------------------------3333---- >HYPOTHETICAL PROTEIN AF01; SWP:O30077; PDB:2IDGA; TIGRAKVYATLSKIFYHLFYDEAIPKDCREIIEKFGEIDFNLRSVLVRELRGSVLIKDPQ ------------------------3333----1111------333311113333----33 SLAEVYESVKDFYERYGFQASELHADHIAVELAFSKLVEREISLAQQKEEELYKIRAAQH 33---3333----1111------1111--------------------------------- RFIKAHLQPLVKNLPSAPLLNFVRDFVREDAKYLY ----------1111--------------------- >HYPOTHETICAL PROTEIN; SWP:Q97QU3; PDB:2IDLA; AMIQAVFERAEDGELRSAEITGHAESGEYGLDVVCASVSTLAINFINSIEKFAGYEPILE ---------1111----------------3333--------------------------- LNEDEGGYLMVEIPKDLPSHQREMTQLFFESFFLGMANLSENYSEFVQTRVIT ---iiii----------3333---------------------1111------- >Hot; SWP:Q71T70; PDB:2IDOB; YDWNIAAKSQEERDKVNVDLAASGVAYKERLNIPVIAEQVAREQPENLRTYFMERLRHYR ---3333----------------------------3333-----1111------------ QLSLQLPKGSDPAYQ 3333---11111111 >AMICYANIN; SWP:A1BBA1; PDB:2IDQA; DKATIPSESPFAAAEVADGAIVVDIAKMKYETPELHVKVGDTVTWINREAMPHNVHFVAG -----------3333-2222-----%%%%--------2222----------------222 VLGEAALKGPMMKKEQAYSLTFTEAGTYDYHCTPHPFARGKVVVE 2-----------2222---------------3333---------- >Cob(I)yrinic acid a,c-dia; SWP:Q96EY8; PDB:2IDXA; DDQVFEAVGTTDELSSAIGFALELVTEKGHTFAEELQKIQCTLQDVGSALATFKAGPILE ---------------------1111-----------------------------3333-- LEQWIDKYTSQLPPLTAFILPSGGKISSALHFCRAVCRRAERRVVPLVQMGETDANVAKF --------1111-------------------------------3333------3333--- LNRLSDYLFTLARYAAMKEGNQEKIYMK ----------------1111-------- >FLUORESCENT PROTEIN DRONP; SWP:Q5TLG6; PDB:2IE2A; SVIKPDMKIKLRMEGAVNGHPFAIEGVGLGKPFEGKQSMDLKVKEGGPLPFAYDILTTVF ----------------iiii-----------1111--------------------1111- NRVFAKYPENIVDYFKQSFPEGYSWERSMNYEDGGICNATNDITLDGDCYIYEIRFDGVN -------1111-3333--------------1111-----------!!!!----------- FPANGPVMQKRTVKWEPSTEKLYVRDGVLKGDVNMALSLEGGGHYRCDFKTTYKAKKVVQ -1111-------------------iiii----------2222------------------ LPDYHFVDHHIEIKSHDKDYSNVNLHEHAEAHS ----------------1111------------- >Serine/threonine-protein ; SWP:P67775; PDB:2IE4C; FTKELDQWIEQLNECKQLSESQVKSLCEKAKEILTKESNVQEVRCPVTVCGDVHGQFHDL -----------1111-----------------3333------------------------ MELFRIGGKSPDTNYLFMGDYVDRGYYSVETVTLLVALKVRYRERITILRGNHESRQITQ ---------------------------3333----------1111-----1111-3333- VYGFYDECLRKYGNANVWKYFTDLFDYLPLTALVDGQIFCLHGGLSPSIDTLDHIRALDR -----------------------3333------%%%%-----------------3333-- LQEVPHEGPMCDLLWSDPDDRGGWGISPRGAGYTFGQDISETFNHANGLTLVSRAHQLVM --------------------------3333--------------1111----------11 EGYNWCHDRNVVTIFSAPNYCYRCGNQAAIMELDDTLKYSFLQFDPAP 11----%%%%--------2222-----------1111----------- >ANNEXIN A5; SWP:P14668; PDB:2IE7A; ALRGTVTDFSGFDGRADAEVLRKAMKGLGTDEDSILNLLTARSNAQRQQIAEEFKTLFGR -------------------------------------1111------------------- DLVNDMKSELTGKFEKLIVALMKPSRLYDAYELKHALKGAGTDEKVLTEIIASRTPEELR -------------------11113333------3333!!!!------------------- AIKQAYEEEYGSNLEDDVVGDTSGYYQRMLVVLLQANRDPDTAIDDAQVELDAQALFQAG ------------------1111-------------------------------------1 ELKWGTDEEKFITILGTRSVSHLRRVFDKYMTISGFQIEETIDRETSGNLENLLLAVVKS 111---------------------------------3333-------------------- IRSIPAYLAETLYYAMKGAGTDDHTLIRVIVSRSEIDLFNIRKEFRKNFATSLYSMIKGD --------------------------------1111------------------------ TSGDYKKALLLLCGGEDD ------------------ >PYRUVATE DEHYDROGENASE E1; SWP:P0AFG8; PDB:2IEAA; ISNYINTIPVEEQPEYPGNLELERRIRSAIRWNAIMTVLRASKKDLELGGHMASFQSSAT --------3333----------------------------3333-------3333----- IYDVCFNHFFRARNEQDGGDLVYFQGHISPGVYARAFLEGRLTQEQLDNFRQEVHGNGLS -------------3333-------1111------------------1111--1111---- SYPHPKLMPEFWQFPTVSMGLGPIGAIYQAKFLKYLEHRGLKDTSKQTVYAFLGDGEMDE ---33331111--------------------------------1111------------3 PESKGAITIATREKLDNLVFVINCNLQRLDGPVTGNGKIINELEGIFEGAGWNVIKVMWG 333-------1111-------------------1111----------1111--------3 SRWDELLRKDTSGKLIQLMNETVDGDYQTFKSKDGAYVREHFFGKYPETAALVADWTDEQ 333--------------------------1111--------1111-----1111------ IWALNRGGHDPKKIYAAFKKAQETKGKATVILAHTIKGYGMGDAAMDGVRHIRDRFNVPV 1111-3333--------------------------2222-!!!!---------------- SDADIEKLPYITFPEGSEEHTYLHAQRQKLHGYLPSRQPNFTEKLELPSLQDFGALLEEQ 33331111-----2222----------1111-----------------33333333---- SKEISTTIAFVRALNVMLKNKSIKDRLVPIIADEARTFGMEGLFRQIGIYSPEDEKGQIL ----------------------1111--------3333-3333----------------- QEGINELGAGCSWLAAATSYSTNNLPMIPFYIYYSMFGFQRIGDLCWAAGDQQARGFLIG -----------------3333------------33333333--------1111------- GTSGRTTLNGEGLQHEDGHSHIQSLTIPNCISYDPAYAYEVAVIMHDGLERMYGEKQENV ---111111111111---3333----1111-----------------------3333--- YYYITTLNENYHMPAMPEGAEEGIRKGIYKLETIEGSKGKVQLLGSGSILRHVREAAEIL ----------------2222-------------------------!!!!----------- AKDYGVGSDVYSVTSFTELARDGQDCERWNMLHPLETPRVPYIAQVMNDAPAVASTDYMK --------------------------------1111----3333---------------- LFAEQVRTYVPADDYRVLGTDGFGRSDSRENLRHHFEVDASYVVVAALGELAKRGEIDKK -----3333------------------------1111----------------------- VVADAIAKFNIDADKVNPRLA ------1111-1111-3333- >UNCHARACTERIZED PROTEIN C; SWP:Q8TX89; PDB:2IECA; LSDRERAIFEAGITLGAIYHQFCGTPVSPGTAEEVAKCIERAALLQPCVIDARVEVDVSS ---------------------2222--------------------2222---------33 EDTDNYGGYTEVSGRNLRVTIVTRCGEWEAVGKLEFIEELNYPLMWVEEIRRV 33--3333----3333--------!!!!--------3333------------- >Probable ABC transporter ; SWP:P42400; PDB:2IEEA; TGWEQIKDKGKIVVATSGTLYPTSYHDTDSGSDKLTGYEVEVVREAAKRLGLKVEFKEGI ---------------------------1111----------------1111--------- DGLTAVNSGQVDAAANDIDVTKDREEKFAFSTPYKYSYGTAIVRKDDLSGIKTLKDLKGK --------------------3333----------------------------33332222 KAAGAATTVYEVARKYGAKEVIYDNATNEQYLKDVANGRTDVILNDYYLQTLALAAFPDL --------------------------------------------------------1111 NITIHPDIKYPNKQALVKKSNAALQKKNEALKESKDGSLTKLSKQFFNKADVSKKIDADV ----1111---------1111------------1111---------iiii1111------ QDVD ---- >HYPOTHETICAL PROTEIN TT00; SWP:Q72LM7; PDB:2IELA; ARYLVVAHRTAKSPELAAKLKELLAQDPEARFVLLVPAVPPPGWVYNEVRRRAEEEAAAA -------1111---------------1111------------------------------ KRALEAQGIPVEEAKAGDISPLLAIEEELLAHPGAYQGIVLSTLPPGLSRWLRLDVHTQA ----1111-----------------------2222---------2222----------33 ERFGLPVIHVIA 33---------- >MUSCLE-SPECIFIC KINASE RE; SWP:Q62838; PDB:2IEPA; LPKAPVITTPLETVDALVEEVATFMCAVESYPQPEISWTRNKILIKLFDTRYSIRENGQL ----------------2222-------------------%%%%--1111-----%%%%-- LTILSVEDSDDGIYCCTANNGVGGAVESCGALQVKMKPKITRPPINVKIIEGLKAVLPCT ------1111---------------------------------------2222------- TMGNPKPSVSWIKGDSALRENSRIAVLESGSLRIHNVQKEDAGQYRCVAKNSLGTAYSKL --------------------1111--3333-------1111---------1111------ VKLEVEV ------- >SPIKE GLYCOPROTEIN; SWP:Q6Q1S2; PDB:2IEQA; AINNIVASFSSVNDAITQTAEAIHTVTIALNKIQDVVNQQGSALNHLTSQLTYLNLSSEL ------------------------------------------------------------ KQLEAKTASLFQTTVELQGLIDQINSTY -----------------------1111- >INOSITOL POLYPHOSPHATE MU; SWP:P07250; PDB:2IEWA; GLLIFKPAFPQELEFYKAIQGDAPLCSWMPTYLGVLNESKQYLVLENLLYGFSKPNILDI -----------------------3333-------------------3333---------- KLGKTLYDSKASLEKRERMKRVSETTTSGSLGFRICGMKIQKNPSVLNQLSLEYYEEEAD -------1111---------------3333------------333311113333------ SDYIFINKLYGRSRTDQNVSDAIELYFNNPHLSDARKHQLKKTFLKRLQLFYNTMLEEEV ----------11113333----------1111---------------------3333--- RMISSSLLFIYEGDPERWELLNDVDKLMRDDFIDSLSSMSLIDFAHSEITPGKGYDENVI --------------------%%%%-------------------------2222--3333- EGVETLLDIFMKFLEHHH -------------2222- >DEPHOSPHO-COA KINASE; SWP:O67792; PDB:2IF2A; KRIGLTGNIGCGKSTVAQFRELGAYVLDADKLIHSFYRKGHPVYEEVVKTFGKGILDEEG -------------------3333----------3333----------------1111--- NIDRKKLADIVFKDEEKLRKLEEITHRALYKEIEKITKNLSEDTLFILEASLLVEKGTYK ---------------------------1111----------------------1111111 NYDKLIVVYAPYEVCKERAIKRGSEEDFERRWKKQPIEEKVKYADYVIDNSGSIEETYKQ 1-----------------------------3333-33331111----------------- VKKVYEELTR ---------- >ZINC FINGER AND BTB DOMAI; SWP:O95365; PDB:2IF5A; IGIPFPDHSSDILSGLNEQRTQGLLCDVVILVEGREFPTHRSVLAACSQYFKKLFTSQQN ----1111-----------------------iiii------------------------- VYEIDFVSAEALTALMDFAYTATLTVSTANVGDILSAARLLEIPAVSHVCADLLD --------------------------3333------------------------- >HYPOTHETICAL PROTEIN YIIX; SWP:Q8X778; PDB:2IF6A; WQPQTGDIIFQISRSSQSKAIQLATHSDYSHTGMLVMRNKKPYVFEAVGPVKYTPLKQWI ---2222-------1111-------------------%%%%------------------- AHGEKGKYVVRRVEGGLSVEQQQKLAQTAKRYLGKPYDFSFSWSDDRQYCSEVVWKVYQN --2222------2222-----------33332222--1111------------------- ALGMRVGEQQKLKEFDLSNPLVQAKLKERYGKNIPLEETVVSPQAVFDAPQLTTVAKEWP ----------3333---------------!!!!-1111----------3333-------- LF -- >SLAM FAMILY MEMBER 6; SWP:Q96DU3; PDB:2IF7A; LTPLMVNGILGESVTLPLEFPAGEKVNFITWLFNETSLAFIVPHETKSPEIHVTNPKQGK --------------------------------!!!!--------------------3333 RLNFTQSYSLQLSNLKMEDTGSYRAQISTKTSAKLSSYTLRILRQLRNIQVTNHSQNMTC ----1111-------1111----------------------------------------- ELHLTCSVEDADDNVSFRWEALGNTLSSQPNLTVSWDPRISSEQDYTCIAENAVSNLSFS --------------------%%%%------------1111-------------------- VSAQKLCE -3333--- >HYPOTHETICAL PROTEIN SMU.; SWP:Q8DW21; PDB:2IFAA; SNFLDLQKQRRSIYALGKTVDLSKAELVALIQNAIKQAPSAFNSQTSRALVLFGQDSQDF -3333----------------------------------2222----------------- WNKIAYSELEKVTPAEAFAGTKAKLESFAAGVGTILLFEDQAVVRNLEENFPLYAENFQP -------------1111---------------------------------33331111-- WSEQAHGIALYAIWLALAEQNIGSVQHYNPLVDAQVAEKYDLPTNWKRAQIPFGSIEAPA -----------------1111---------------------3333-------------- GEKEFADQERFKVFGDL -----3333-------- >TRANSLATION INITIATION FA; SWP:P02999; PDB:2IFEA; VIQVKEIKFRPGTDEGDYQVKLRSLIRFLEEGDKAKITLRFRGREMAHQQIGMEVLNRVK ---------2222---------------3333----------------------3333-- DDLQELAVVESFPTKIEGRQMIMVLAPKKKQ ------------------------------- ---------------------------------------------- >THIOREDOXIN; SWP:THIO_HUMAN; PDB:2IFQA; MVKQIESKTAFQEALDAAGDKLVVVDFSATWCGPCKMIKPFFHSLSEKYSNVIFLEVDVD -----------------!!!!-------1111----------------1111-----111 DCQDVASEEVKCMPTFQFFKKGQKVGEFSGANKEKLEATINELV 1------------------iiii--------------------- >SCYTALIDOPEPSIN B; SWP:P15369; PDB:2IFRA; TVESNWGGAILIGSDFDTVSATANVPSASGGSSAAGTAWVGIDGDTCQTAILQTGFDWYG ------------------------------1111------------------------11 DGTYDAWYEWYPEVSDDFITISEGDSIQMSVTATSDTSGSATLENLTTGQKVSKSFSNES 11-------------------2222----------------------------------- SGSLCRTNAEFIIEDFEECNSNGSDCEFVPFASFSPAVEFTDCSVTSDGESVSLDDAQIT -------------------1111-----------------------iiii---1111--- QVIINNQDVTDCSVSGTTVSCSYV ---%%%%----------------- >Wiskott-Aldrich Syndrome ; SWP:WASIP_HUMAN; PDB:2IFSA; GSESRFYFHPISDLPPPEPYVQTTKSYPSKLARNESRGGLVPRGSGGSLFSFLGKKCVTM ---------3333----------------------------------------------- SSAVVQLYAADRNCMWSKKCSGVACLVKDNPQRSYFLRIFDIKDGKLLWEQELYNNFVYN -----------------------------1111--------------------------- SPRGYFHTFAGDTCQVALNFANEEEAKKFRKAVTDLLGRRQRKSEKRRD ------------------------------------------------- >PUTATIVE METHYLASE HI0767; SWP:P44869; PDB:2IFTA; GEVRIIAGLWRGRKLPVLDRVKETLFNWLPYIHQSECLDGFAGSGSLGFEALSRQAKKVT -------1111-------------------3333-----------------1111----- FLELDKTVANQLKKNLQTLKCSSEQAEVINQSSLDFLKQPQNQPHFDVVFLDPPFHFNLA ---------------------1111------33331111--------------------- EQAISLLCENNWLKPNALIYVETEKDKPLITPENWTLLKEKTTGIVSYRLYQNLE -------------2222--------------1111-------------------- >GAMMA-SNAP; SWP:Q5BJK3; PDB:2IFUA; AAQKISEAHEHIAKAEKYLKTSFKWKPDYDSAASEYAKAAVAFKNAKQLEQAKDAYLQEA -------------------------------------------1111------------- EAHANNRSLFHAAKAFEQAGLKDLQRPEAVQYIEKASVYVENGTPDTAAALDRAGKLEPL ---1111--------------1111-3333---------11113333------------- DLSKAVHLYQQAAAVFENEERLRQAAELIGKASRLLVRQQKFDEAAASLQKEKSYKEENY ----------------------------------------------------------33 PTCYKKCIAQVLVQLHRADYVAAQKCVRESYSIPGFSGSEDCAALEDLLQAYDEQDEEQL 33--------------------------33332222------------------------ LRVCRSPLVTYDNDYAKLAISLKVP -----3333--3333---1111--- >HYPOTHETICAL PROTEIN; SWP:Q471R3; PDB:2IFXA; GIRLLYLLVKPAGSDETFRAECLRHYESHDVPGLHKYEVRLVAEQPHVPFFDIGHVDAIG ---------------------------1111----------------------------- ECWFKDDAAYATYASDIRKAWFEHGKTFIGQLKPFRTAPVAG ------------------------------------------ >2,3-bisphosphoglycerate-i; SWP:Q81X77; PDB:2IFYA; RKPTALIILDGFGLREETYGNAVAQAKKPNFDGYWNKFPHTTLTACGEAVGLPEGQMGNS -----------------2222-1111-3333--------------!!!!---------33 EVGHLNIGAGRIVYQSLTRVNVAIREGEFDKNETFQSAIKSVKEKGTALHLFGLLSDGGV 33-------------3333----111111113333------------------------- HSHMNHMFALLRLAAKEGVEKVYIHAFLDGRDVGPKTAQSYIDATNEVIKETGVGQFATI --3333------------------------------------------------------ SGRYYSMDRDKRWDRVEKCYRAMVNGEGPTYKSAEECVEDSYANGIYDEFVLPSVIVNED -3333------------------------------------1111--------------- NTPVATINDDDAVIFYNFRPDRAIQIARVFTNGDFREFDRGEKVPHIPEFVCMTHFSETV -------2222---------------------------------------------1111 DGYVAFKPMNLDNTLGEVVAQAGLKQLRIAETEKYPHVTFFFSGGREAEFPGEERILINS ------------------------------3333---------------2222------- PKVATYDLKPEMSIYEVTDALVNEIENDKHDVIILNFANCDMVGHSGMMEPTIKAVEATD -----3333-!!!!----------1111---------3333--3333------------- ECLGKVVEAILAKDGVALITADHGNADEELTSEGEPMTAHTTNPVPFIVTKNDVELREDG ----------1111----------3333-------------------------------- ILGDIAPTMLTLLGVEQPKEMTGKTIIK 11113333---------3333------- >GROUP III TRUNCATED HAEMO; SWP:Q0PB48; PDB:2IG3A; MKFETINQESIAKLMEIFYEKVRKDKDLGPIFNNAIGTSDEEWKEHKAKIGNFWAGMLLG ------------------------------------------------------------ EGDYNGQPLKKHLDLPPFPQEFFEIWLKLFEESLNIVYNEEMKNVILQRAQMIASHFQNM ------------------3333-------------------------------------- LYKYGGH ------- >NIMC/NIMA FAMILY PROTEIN; SWP:Q97G05_CLOAB; PDB:2IG6A; ALEFLKECGVFYLATNEGDQPRVRPFGAVFEYEGKLYIVSNNTKKCFKQIQNPKVEISGN ----------------!!!!-----------iiii-----11113333-----------1 KKGQWIRLTGEVANDDRREVKELALEAVPSLKNYSVDDGIFAVLYFTKGEGTICSFKGEN 111-------------3333--------------1111---------------------- ETFSL ----- >CHOLINE/ETHANOLAMINE KINA; SWP:Q9Y259; PDB:2IG7A; RDAERRAYQWCREYLGGAWRRVQPEELRVYPVSGGLSNLLFRCSLPDHLPSVGEEPREVL ---------------!!!!---3333---------------------------------- LRLYGAILQGVDSLVLESVFAILAERSLGPQLYGVFPEGRLEQYIPSRPLKTQELREPVL ----3333---------------1111--------1111-----------3333------ SAAIATKAQFHGEPFTKEPHWLFGTERYLKQIQDLPNLLEYSLKDEGNLRKLLESTPSPV --------3333---------3333------1111-3333--3333------1111---- VFCHNDIQEGNILLLSLLVDFEYSSYNYRGFDIGNHFCEWVYDYTHEEWPFYKARPTDYP -------3333---------1111---3333-----------------------3333-- TQEQQLHFIRHYLAEAKKGETLSQEEQRKLEEDLLVEVSRYALASHFFWGLWSILQASST --------------1111------------------------------------3333-- IEFGYLDYAQSRFQFYFQQKGQLTS ------------------------- >HYPOTHETICAL PROTEIN PA34; SWP:Q9HYB0; PDB:2IG8A; MTAVRRIRAAALPDLPDASWSNALLVGEELVMSGMTAHPATRQAAERGAALDAHAQALVV --------3333--1111-------!!!!---------------1111------------ LGKVKALLEAAGGHVGNLYKLNVYVTRIADKDAIGRARQEFFAGQGTFPASTLVEVSGLV --------1111--3333--------3333------------------------------ FPELLVEIDAWARLDIDLANCD 1111--------11113333-- >PROTEIN G; SWP:P06654; PDB:2IGD; MTPAVTTYKLVINGKTLKGETTTKAVDAETAEKAFKQYANDNGVDGVWTYDDATKTFTVT ---------------------------------------1111----------------- E - >RETINOBLASTOMA-ASSOCIATED; SWP:O14777; PDB:2IGPA; DPRPLNDKAFIQQCIRQLCEFLTENGYAHNVSMKSLQAPSVKDFLKIFTFLYGFLCPSYE ---1111---------------1111-----3333--------------------1111- LPDTKFEEEVPRIFKDLGYPFALSKSSMYTVGAPHTWPHIVAALVWLIDCIKIH --2222--------1111-----------1111--------------------- >EUCHROMATIC HISTONE METHY; SWP:NA; PDB:2IGQA; ERIVSRDIARGYERIPIPCVNAVDSEPCPSNYKYVSQNCVTSPMNIDRNITHLQYCVCID ------1111--------------------------------------1111-------- DCSSSNCMCGQLSMRCWYDKDGRLLPEFNMAEPPLIFECNHACSCWRNCRNRVVQNGLRA ---3333---1111----1111--11113333-------3333--1111---3333---- RLQLYRTRDMGWGVRSLQDIPPGTFVCEYVGELISDSEADVREEDSYLFDLVYCIDARFY --------------------2222--------------1111--1111------------ GNVSRFINHHCEPNLVPVRVFMAHQDLRFPRIAFFSTRLIEAGEQLGFDYGERFWDIKGS ---1111------------------3333-----------2222------3333------ CRCGSPKCRHS ----1111--- >HYPOTHETICAL PROTEIN; SWP:NA; PDB:2IGSA; INIYQNPGQSLANIYKGFARQCNPGFVFPEAQTIEAWDIPLRLHPEFIPGGDISKADQQY ----------------------1111-----------------3333-%%%%1111---- STLLAQEIANGVTIGFRVNEKERVCNVEILPLLTSAQNLDRIKARFGSGYLDRFKGSPNV -------------------------------------------------3333------- YPTDVGFSTDASGGISQESGLLVSYGVNLRTLTPGTWQATLPEDIKALVGPGVGLRLDAP 1111-----1111--3333--------3333-----------------3333---1111- NFSDVFNTIKSGLRYTTAVTLLLAYFAAIGS ------------------------------- >SAM DEPENDENT METHYLTRANS; SWP:Q8UIF7_AGRT5; PDB:2IGTA; GQRTGELPAEHVPVILESSGAGDFHLIDSGNGLKLEQYGDYRVVRPEAQALWRPLVPDRV --------------------!!!!-----%%%%----!!!!-----1111------3333 WQNADAIFTGDTGGRWRFPKEALGETWPLSLLGVEFLGRFTAFRHVGVFPEQIVHWEWLK 1111-------------3333---------iiii--------------3333-------- NAVETADRPLKVLNLFGYTGVASLVAAAAGAEVTHVDASKKAIGWAKENQVLAGLEQAPI ----------------!!!!------1111------------------------1111-- RWICEDAKFIQREERRGSTYDIILTDPPKFGRGTHGEVWQLFDHLPLLDICREILSPKAL ---------------------------------------3333--------11111111- GLVLTAYSIRASFYSHELRETRGAGGVVASGELVIREAGLDGKTPGRVLSTSLFSRWEPK -------33333333--------------------------------------------- >FAB HEAVY CHAIN; SWP:Q99LC4; PDB:2IH3A; QVQLQQPGAELVKPGASVKLSCKASGYTFTSDWIHWVKQRPGHGLEWIGEIIPSYGRANY ------------2222------------1111-------2222---------1111---- NEKIQKKATLTADKSSSTAFMQLSSLTSEDSAVYYCARERGDGYFAVWGAGTTVTVSSAK 1111---------1111---------3333------------------------------ TTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVLQSDLY -----------1111-----------------------%%%%--2222------------ TLSSSVTVPSSSWPSETVTCNVAHPASSTKVDKKIVPRD --------3333-----------3333------------ >Igk-C protein; SWP:Q58EU4; PDB:2IH3B; DILLTQSPAILSVSPGERVSFSCRASQSIGTDIHWYQQRTNGSPRLLIKYASESISGIPS -------------2222---------------------2222------------222233 RFSGSGSGTDFTLSINSVESEDIANYYCQQSNRWPFTFGSGTKLEIKRADAAPTVSIFPP 33----------------3333-------------------------------------- SSEQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLT 3333-------------------------iiii--2222--------------------- LTKDEYERHNSYTCEATHKTSTSPIVKSFNRN -----1111--------1111----------- >Voltage-gated potassium c; SWP:P0A334; PDB:2IH3C; LHWRAAGAATVLLVIVLLAGSYLAVLAERGAPGAQLITYPRALWWACETATTVYGDLYPV ---------------------------2222-----------------1111-------- TLWGRLVAVVVMVAGITSFGLVTAALATWFVGREQER ------------------------------------- >LACCASE-1; SWP:Q70KY3; PDB:2IH8A; EPTCNTPSNRACWSDGFDINTDYEVSTPDTGVTQSYVFNLTEVDNWMGPDGVVKEKVMLI -----1111----222211113333----------------------1111--------% NGNIMGPNIVANWGDTVEVTVINNLVTNGTSIHWHGIHQKDTNLHDGANGVTECPIPPKG %%%--------2222------------------2222-22221111-2222--------- GQRTYRWRARQYGTSWYHSHFSAQYGNGVVGTIQINGPASLPYDIDLGVFPITDYYYRAA --------------------!!!!1111-------------------------------- DDLVHFTQNNAPPFSDNVLINGTAVNPNTGEGQYANVTLTPGKRHRLRILNTSTENHFQV -------------------iiii----------------2222----------------- SLVNHTMTVIAADMVPVNAMTVDSLFLAVGQRYDVVIDASRAPDNYWFNVTFGGQAACGG -2222------!!!!------------2222---------------------%%%%---- SLNPHPAAIFHYAGAPGGLPTDEGTPPVDHQCLDTLDVRPVVPRSVPVNSFVKRPDNTLP -----------2222--------------%%%%--------------1111--1111--- VALDLTGTPLFVWKVNGSDINVDWGKPIIDYILTGNTSYPVSDNIVQVDAVDQWTYWLIE --------------iiii----11113333---------3333----------------- NDPEGPFSLPHPMHLHGHDFLVLGRSPDVPAASQQRFVFDPAVDLARLNGDNPPRRDTTM -1111------------------------1111------33331111------------- LPAGGWLLLAFRTDNPGAWLFHCHIAWHVSGGLSVDFLERPADLRQRISQEDEDDFNRVC -2222-----------------------------------33331111------------ DEWRAYWPTNPYPKIDSGL -----3333---------- >Regulator of G-protein si; SWP:O43665; PDB:2IHBB; LKSTAKWAASLENLLEDPEGVKRFREFLKKEFSEENVLFWLACEDFKKMQDKTQMQEKAK -----3333-------------------1111-------------3333----------- EIYMTFLSSKASSQVNVEGPHPLMFQKLQDQIFNLMKYDSYSRFLKSDLFL -------1111---------1111--------------------------- >TRANSCRIPTION REGULATOR P; SWP:O14867; PDB:2IHCA; SVFAYESSVHSTNVLLSLNDQRKKDVLCDVTIFVEGQRFRAHRSVLAACSSYFHSRIVGQ ---------------------------------%%%%-------------------1111 ELNITLPEEVTVKGFEPLIQFAYTAKLILSKENVDEVCKCVEFLSVHNIEESCFQFL ------33333333---------------3333----------------33333333 >REGULATOR OF G-PROTEIN SI; SWP:P57771; PDB:2IHDA; STEEATRWADSFDVLLSHKYGVAAFRAFLKTEFSEENLEFWLACEEFKKTRSTAKLVSKA ----3333---------------------1111-------------1111---------- HRIFEEFVDVQAPREVNIDFQTREATRKNLQEPSLTCFDQAQGKVHSLMEKDSYPRFLRS --------2222----------------3333-1111---------------3333---- KMYLDLL 3333--- >PYROPHOSPHATE SYNTHASE; SWP:NA; PDB:2IHIA; FFRNMYDKYRDAFLSHLNEYSLEEEIKEHISKYYKLLFDYNCLGGKNNRGILVILIYEYV 3333------------1111---------------------------------------- KINSSEWEKAACLAWCIEILQAAFLVADDIMDKGEMRRNKYCWYLLKDVETKNAVNDVLL -----------------------------1111---%%%%-33331111-3333------ LYNSIYKLIEIYLRNESCYVDVIATFRDATLKTIIGQHLDTNIFSDKYSDAHREIDVNNI ------------1111-------------------------11113333------1111- NVPQPVIDINMINFGVYKNIVIHKTAYYSFFLPIVCGMLLAGIAVDNLIYKKIEDISMLM -------3333--------------------------------1111------------- GEYFQIHDDYLDIFGSDIQNNKLTWPLIKTFELCSEPDKIKIVKNYGNNLACVKVIDSLK ----------------3333----------------------------3333-------- IRKHYESYEKAQKAKILSAINELHHEGIEYVLKYLLEI -------------------1111--------------- >JAPANESE QUAIL EGG WHITE ; SWP:P00701; PDB:2IHL; KVYGRCELAAAMKRHGLDKYQGYSLGNWVCAAKFESNFNTQATNRNTDGSTDYGILQINS ------------11112222---3333--------%%%%------1111----1111-33 RWWCNDGRTPGSRNLCNIPCSALLSSDITASVNCAKKIVSDVHGMNAWVAWRNRCKGTDV 33------2222-1111-3333------------------1111----------222233 NAWIRGCRL 33-2222-- >DNA POLYMERASE MU; SWP:Q9JIW4; PDB:2IHMA; SMPAYACQRPSPLTHHNTLLSEALETLAEAAGFEANEGRLLSFSRAASVLKSLPCPVASL ----1111-----------------------1111-----------------------11 SQLHGLPYFGEHSTRVIQELLEHGTCEEVKQVRCSERYQTMKLFTQVFGVGVKTANRWYQ 11--------------------------------------------2222--------11 EGLRTLDELREQPQRLTQQQKAGLQYYQDLSTPVRRADAEALQQLIEAAVRQTLPGATVT 11--3333-------------------3333---3333---------------2222--- LTGGFRRGKLQGHDVDFLITHPEEGQEVGLLPKVMSCLQSQGLVLYHQYHRSHDVFERSF --3333----------------22222222------------------------------ CILGLPQPTWKAVRVDLVVTPSSQFPFALLGWTGSQFFERELRRFSRQEKGLWLNSHGLF --------------------33333333--------------------------1111-- DPEQKRVFHATSEEDVFRLLGLKYLPPEQRNA -----------------1111----3333--- >Bacterial peptide chain r; SWP:Q72GJ6; PDB:2IHR1; LAQRLEGLGGIFDIPQKETRLKELERRLEDPSLWNDPEAARKVSQEAARLRRTVDTFRSL 3333--------11111111-3333----------3333-1111---------------- ESDLQGLLELMEELPAEEREALKPELEEAAKKLDELYHQTLLNFPHAEKNAILTIQPGAG 3333----1111-----11111111--------------------3333----------- GTEACDWAEMLLRMYTRFAERQGFQVEVVDLTPGPEAGIDYAQILVKGENAYGLLSPEAG 3333--------------------------------------------------1111-- VHRLVRPSPFDASGRRHTSFAGVEVIPEVDEEVEVVLKPEELRIDVMRASGPGGQGVNTT -------1111--------------------------1111------------------- DSAVRVVHLPTGITVTCQTTRSQIKNKELALKILKARLYELERKKREEELKALRGEVRPI ------------------------------------------------3333-------- EWGSQIRSYVLDKNYVKDHRTGLMRHDPENVLDGDLMDLIWAGLEWKAGRR -----------------------------1111--3333------------ >CG2944-PF, ISOFORM F; SWP:A1Z6E2; PDB:2IHSA; LQADFVKPARIDILLDMPPASRDLQLKHSWNSEDRSLNIFVKEDDKLTFHRHPVAQSTDC -------3333--1111-------------1111-1111--1111--------------- IRGKVGLTKGLHIWEIYWPTRQRGTHAVVGVCTADAPLHSVGYQSLVGSTEQSWGWDLGR ------------------1111----------1111---------22221111------- NKLYHDSKNCAGVTYPAILKNFLVPDKFLVALDMDEGTLSFIVDQQYLGIAFRGLRGKKL -----3333---------------------------------%%%%--------2222-- YPIVSAVWGHCEITMRYIGGLVD -------2222------------ >NUCLEOCAPSID (NC) PROTEIN; SWP:Q77YH0; PDB:2IHXA; GRARGLCYTCGSPGHYQAQCPKKRKSGNSRERCQLCNGMGHNAKQCRKRD --2222---------33331111------------------3333----- >SON OF SEVENLESS HOMOLOG ; SWP:Q07889; PDB:2II0A; QMRLPSADVYRFAEPDSEENIIFEGIPIIKAGTVIKLIERLTYHMYADPNFVRTFLTTYR -----1111-------1111-------------------------------------333 SFCKPQELLSLIIERFEIPEPEPTEADRIAIENGDQPLSAELKRFRKEYIQPVQLRVLNV 3----------------------------------------------------------- CRHWVEHHFYDFERDAYLLQRMEEFIGTVRGKAMKKWVESITKIIQRKKIAFQSSPPTVE ---------3333------------1111-3333-------------------------- WHISRPGHIETFDLLTLHPIEIARQLTLLESDLYRAVQPSELVGSVWTKEDKEINSPNLL ----222211111111-----------------1111333322221111-3333------ KMIRHTTNLTLWFEKCIVETENLEERVAVVSRIIEILQVFQELNNFNGVLEVVSAMNSSP ----------------1111---------------------------------------- VYRLDHTFEQIPSRQKKILEEAHELSEDHYKKYLAKLRSINPPCVPFFGIYLTNILKTEE 1111---1111----------------%%%%-----1111-------3333--------- GNPEVLKRHGKELINFSKRRKVAEITGEIQQYQNQPYCLRVESDIKRFFENLNPMGNSME -------iiii----------------3333-----------------11113333---- KEFTDYLFNKSLEIEPRNPKPLPRFPKKYSYPLKSPGVRPSNP ------------------------------------------- >ACETAMIDASE; SWP:Q9KGN3; PDB:2II1A; GIRLSNENTIFFDKENVPIASCQSGDTVIFETKDCFSDQITNEEQALTSIDFNRVNPATG ------------1111------2222-------1111----11111111-1111------ PLYVEGARRGDLEIEILDIKVGKQGVTAAPGLGALGESLNSPTTKLFPIEGDDVVYSTGL ---22222222-----------------22221111-------------!!!!---1111 RLPLQPIGVIGTAPPGEPINNGTPGPHGGNLDTKDIKPGTTVYLPVEVDGALLALGDLHA -------------------3333-1111----11112222-------2222--------- AGDGEILICGVEIAGTVTLKVNVKKERFPLPALKTDTHFTIASAETLDAAAVQATKNATF ----3333---------------------------------------------------- LANRTALSIEEAGLLSGAGDLYVSQIVNPLKTARFSLALHYFEKLGV -------3333------------------------------------ >Lipoamide acyltransferase; SWP:P11181; PDB:2II3A; GKDRTEPVKGFHKAMVKTMSAALKIPHFGYCDEVDLTELVKLREELKPIAFARGIKLSFM --------!!!!---------1111--------------------3333-1111------ PFFLKAASLGLLQFPILNASVDENCQNITYKASHNIGIAMDTEQGLIVPNVKNVQIRSIF -------------3333-----------------------------------3333---- EIATELNRLQKLGSAGQLSTNDLIGGTFTLSNIGSIGGTYAKPVILPPEVAIGALGTIKA ------------------3333---------3333------------------------- LPRFNEKGEVCKAQIMNVSWSADHRIIDGATVSRFSNLWKSYLENPAFMLLDLK ----1111------------------------------------33333333-- >SENSORY RHODOPSIN TRSANSD; SWP:Q8YSC3_ANASP; PDB:2IIAA; IGRTCWAIAEGYIPPETVCILNAGDEDAHVEITIYYSDKEPVGPYRLTVPARRTKHVRFN ---------------------------------------------------------333 DLNDPAPIPHDTDFASVIQSNVPIVVQHT 3---------------------------- >L-AMINO-ACID OXIDASE; SWP:P81382; PDB:2IIDA; RNPLAECFQENDYEEFLEIARNGLKATSNPKHVVIVGAGMAGLSAAYVLAGAGHQVTVLE -1111--------------------------------------------1111------- ASERPGGRVRTYRNEEAGWYANLGPMRLPEKHRIVREYIRKFDLRLNEFSQENDNAWYFI -----!!!!-----1111----------3333-------1111---------1111---% KNIRKKVGEVKKDPGLLKYPVKPSEAGKSAGQLYEESLGKVVEELKRTNCSYILNKYDTY %%%-----------1111---3333------------------------------1111- STKEYLIKEGDLSPGAVDMIGDLLNEDSGYYVSFIESLKHDDIFAYEKRFDEIVDGMDKL -------------------------3333-----------------------2222---- PTAMYRDIQDKVHFNAQVIKIQQNDQKVTVVYETLSKETPSVTADYVIVCTTSRAVRLIK -------3333----------------------------------------33331111- FNPPLLPKKAHALRSVHYRSGTKIFLTCTTKFWEDDGIHGGKSTTDLPSRFIYYPNHNFT ---------------------------------1111---------3333--------11 NGVGVIIAYGIGDDANFFQALDFKDCADIVFNDLSLIHQLPKKDIQSFCYPSVIQKWSLD 11--------!!!!-1111----------------1111-3333-----------3333- KYAMGGITTFTPYQFQHFSDPLTASQGRIYFAGEYTAQAHGWIDSTIKSGLRAARDVNLA ----------2222-----------!!!!---3333-----------------------3 SEN 333 >INTEGRATION HOST FACTOR; SWP:IHFA_ECOLI; PDB:2IIEA; MASTKSELIERLATQQSHIPAKTVEDAVKEMLEHMASTLAQGGSGGLTKAEMSEYLFDKL ---------------11113333----------------------------------111 GLSKRDAKELVELFFEEIRRALENGEQVKLSGFGNFDLRDKNQRPGRNPKTGEDIPITAR 1-3333---------------1111----2222--------------------------- RVVTFRPGQKLKSRVENAGGGERIEIRGFGSFSLHYRAPRTGRNPKTGDKVELEGKYVPH -------------------------2222------------------------------- FKPGKELRDRANIYGGSGHHHHHH ------------------------ >PROTO-ONCOGENE TYROSINE-P; SWP:P06239; PDB:2IIMA; GSPLQDNLVIALHSYEPSHDGDLGFEKGEQLRILEQSGEWWKAQSLTTGQEGFIPFNFVA -3333-------------2222---2222-------------------------3333-- KA -- >NICOTINAMIDE N-METHYLTRAN; SWP:NNMT_HUMAN; PDB:2IIPA; MESGFTKDTYLSHFNPRDYLEKYYKSAESQILKHLLKNLFKIFCLDGVKGDLLIDIGSGP ------3333-------------------------------------------------- TIYQLLSACESFKEIVVTDYSDQNLQELEKWLKAAPAAFDWSPVVTYVCDLEGNRVKGPE -33333333-------------------------1111--3333-----1111------- KEEKLRQAVKQVLKCDVTQSQPLGAVPLPPADCVLSTLCLDAACPDLPTYCRALRNLGSL -----3333------1111-1111--------------3333-------------3333- LKPGGFLVIMDALKSSYYMIGEQFSSLPLGREAVEAAVKEAGYTIEWFEVISQSYSSTMA -2222--------------!!!!--------------------------------1111- NNEGLFSLVARKL ------------- >MELANIN BIOSYNTHESIS PROT; SWP:Q8EIU4; PDB:2IIZA; NPREQLGVCAEGNLHSVYLFNANDNVESQLRPCIANVAQYIYELTDQYSDSAFNGFVAIG ----1111-----------------3333---------------3333-----------1 ANYWDSLYPESRPELKPFPAQEGNREAPAIEYDLFVHLRCDRYDILHLVANEISQFEDLV 11133333333----------!!!!----------------------------------- ELVEEERGFRFDSRDLTGFVDGTENPKGRHRQEVALVGSEDPEFKGGSYIHVQKYAHNLS --------------3333---1111-!!!!----------3333---------------- KWHRLPLKKQEDIIGRTKQDNIEYESEDKPLTSHIKRVNLKDENGKSIEILRQSPYGSLK -11113333--------1111---3333-1111--------1111------------333 EQGLFISTCRTPDHFEKLHSVFGDGAGNHDHLHFTSALTGSSFFAPSLDFLQFD 3---------1111--------------------------------3333---- >TOXIC SHOCK SYNDROME TOXI; SWP:NA; PDB:2IJ0C; GAVVSQHPSMVIVKSGTSVKIECRSLDTNIHTMFWYRQFPKQSLMLMATSHQGFNAIYEQ -------------2222---------------------2222--------2222----22 GVVKDKFLINHASPTLSTLTVTSAHPEDSGFYVCSALAGSGSSTDTQYFGPGTQLTVL 223333------3333--------1111------------------------------ >CYTOCHROME P450 BM3; SWP:P14779; PDB:2IJ2A; KEMPQPKTFGELKNLPLLNTDKPVQALMKIADELGEIFKFEAPGRVTRYLSSQRLIKEAC --------!!!!-3333------------------------2222--------------- DESRFDKNLSQALKFVRDFAGDGLFTSWTHEKNWKKAHNILLPSFSQQAMKGYHAMMVDI 3333------------------3333-11113333-----3333-3333-1111------ AVQLVQKWERLNADEHIEVPEDMTRLTLDTIGLCGFNYRFNSFYRDQPHPFITSMVRALD -------11112222-------------------------1111---------------- EAMNKLNPDDPAYDENKRQFQEDIKVMNDLVDKIIADRKASGEQSDDLLTHMLNGKDPET ------11111111---------------------------------------------- GEPLDDENIRYQIITFLIAGHETTSGLLSFALYFLVKNPHVLQKAAEEAARVLVDPVPSY ------------------------------------------------------------ KQVKQLKYVGMVLNEALRLWPTAPAFSLYAKEDTVLGGEYPLEKGDELMVLIPQLHRDKT ------------------------------------------2222------3333---- IWGDDVEEFRPERFENPSAIPQHAFKPFGNGQRACIGQQFALHEATLVLGMMLKHFDFED ----1111---11113333-2222-1111!!!!-1111---------------------1 HTNYELDIKETLTLKPEGFVVKAKSKKIPL 111------------2222----------- >URIDYLATE KINASE; SWP:O28237; PDB:2IJ9A; MKVVLSLGGSVLSNESEKIREFAKTIESVAQQNQVFVVVGGGKLAREYIKSARELGASET ----------------------------------------------------1111---- FCDYIGIAATRLNAMLLISAIPSAAKKVPVDFMEAEELSKLYRVVVMGGTFPGHTTDATA --------------------1111------------------------------------ ALLAEFIKADVFINATNVDGVYSADPKSDTSAVKYDRLSPQQLVEIVSRGTNVVIDLLAA ------------------------------------------------------------ KIIERSKIKTYVILGTPENIMKAVKGEAVGTVIA ---------------------------------- >ARYLAMINE N-ACETYLTRANSFE; SWP:P18440; PDB:2IJAA; GSDIEAYLERIGYKKSRNKLDLETLTDILQHQIRAVPFENLNIHCGDADLGLEAIFDQVV ---------------------------------------3333----------------- RRNRGGWCLQVNHLLYWALTTIGFETTLGGYVYSTPAKKYSTGIHLLLQVTIDGRNYIVD ------3333------------------------3333-------------iiii----- AGSGRSYQWQPLELISGKDQPQVPCVFRLTEENGFWYLDQIRREQYIPNEEFLHSDLLED ---!!!!-------2222---3333------iiii-------------3333--1111-- SKYRKIYSFTLKPRTIEDFESNTYLQTSPSSVFTSKSFCSLQTPDGVHCLVGFTLTHRRF --------------3333----3333-11111111---------------!!!!------ NYKDNTDLIEFKTLSEEEIEKVLKNIFNISLQRKLVPKHGDRFFTI --------------3333---------------------------- >Guanine nucleotide-releas; SWP:P27671; PDB:2IJES; GPLGSALEIAEQLTLLDHLVFKSIPYEEFFGQGWMKAEKYERTPYIMKTTKHFNHVSNFI --------------------1111-------11111111--------------------- ASEIIRNEDISARASAIEKWVAVADICRCLHNYNAVLEITSSINRSAIFRLKKTWLKVSK ----------------------------------------3333---11113333----- QTKSLLDKLQKLVSSDGRFKNLRESLRNCDPPCVPYLGMYLTDLVFIEEGTPNYTEDGLV ---------------%%%%----------------3333---------------1111-- NFSKMRMISHIIREIRQFQQTTYKIDPQPKVIQYLLDESFMLDEESLYESSLLIEPK ---------------------------------11111111---------------- >Cryptochrome DASH, chloro; SWP:Q84KJ5; PDB:2IJGX; MNDHIHRVPALTEEEIDSVAIKTFERYALPSKRKGKGVTILWFRNDLRVLDNDALYKAWS -2222-----------------------------------------------------11 SSDTILPVYCLDPRLFHTTHFFNFPKTGALRGGFLMECLVDLRKNLMKRGLNLLIRSGKP 11---------3333-------------------------------1111--------33 EEILPSLAKDFGARTVFAHKETCSEEVDVERLVNQGLKRVNSTKLELIWGSTMYHKDDLP 33----------------------------------------------------3333-- FDVFDLPDVYTQFRKSVEAKCSIRSSTRIPLSLGPTPSVDDWGDVPTLEKLGVEPQEVTR -3333---3333-----------------------------------3333--------- GMRFVGGESAGVGRVFEYFWKKDLLKVYKETRNGMLGPDYSTKFSPWLAFGCISPRFIYE -------------------111111111111------3333------------------- EVQRYEKERVANNSTYWVLFELIWRDYFRFLSIKCGNSLFHLGGPRNVQGKWSQDQKLFE ------------------------------------33331111---------------- SWRDAKTGYPLIDANMKELSTTGFMSNRGRQIVCSFLVRDMGLDWRMGAEWFETCLLDYD --------3333-------------------------------------------11113 PCSNYGNWTYGAGVGNDPREDRYFSIPKQAQNYDPEGEYVAFWLQQLRRLPKEKRHWPGR 333--------------1111------------1111------3333---3333----33 LMYMDTVVPLKH 33---------- >MOLYBDENUM-BINDING TRANSC; SWP:Q8UCD4; PDB:2IJLA; KRLPLKPVLRIDFPPGERLGHGKVELQLIAETGSISAAGRADSYRRAWLLVDALNHFRQP ----------------------------------------------------1111---- VICSQRGAALTVFGAELLERYRGEERNEALREDIDWLEANRNPQ --------------------------3333-------------- >14-3-3 PROTEIN; SWP:Q5CSF3; PDB:2IJPA; NYKDVIKVLTENSLILLLAGSLRNRVTSIRNSLKSIKSQEEKLRKEKSLNNEFIQVIEDI 3333---3333--------------------------------------3333------- KRDFEESILLESEDVIRIIDDNLLMYSEEGARAFCIKLKGDLMRYKAEILKDEEKNQCIK ----------------------1111---------------------------------- QAVEFYEDALQRERSFLEKYPSDPLYLATILNYTILKYDLLGNPEGAMKFANRAIQAAEN ------------------3333----------------1111----------------11 SRSDQFSENTEKLLKILRDNVSQWEQGCSGLLTSAFF 11---------------------1111---------- >HYPOTHETICAL PROTEIN; SWP:Q5V3A0; PDB:2IJQA; GNPSGWRTDGQWEHETLRRAVVHGVRLYNSGEFHESHDCFEDEWYNYGRGNTESKFLHGM --22221111-----------------1111-----------1111-------------- VQVAAGAYKHFDFEDDDGMRSLFRTSLQYFRGVPNDYYGVDLLDVRTTVTNALSDPSALH -----------------------------2222--2222-----------3333----22 GWQIRLDGEYPTCRPEDIEFAESLE 22---iiii----3333-------- >HYPOTHETICAL PROTEIN API9; SWP:Q6EVP2; PDB:2IJRA; ALEWCKNRLEITGRSVFVDIQQWVTGEEVPLYRHAIQQSIRLFLAGCAGILKPVKCEYPP -------------3333------------3333------------------------333 FPRLVSHGTGSAVASNLAFQHWLDLLLKDAVLDGDTIRQIDRIYLQSGIASVKWETIPEG 3------------------------1111------------------3333--------- ARQIITQLARQYPDWFGVASWSSHINGADCWTKLGVQEHACNCDLIIPTRLAIELNGNSQ -------------------------------1111--------------------!!!!- LLTGVSTTHDLYSHLYGAWPSGQNIIWQRDRINSLRLDFDSPSYPPSAELGELSAVFDCE -2222-3333--------------------1111------------3333----1111-- IRHWYQEPVNGIRGYDCYDRGDHVDSGEYGAGPF -------1111-------iiii------------ >PROBABLE M18-FAMILY AMINO; SWP:Q9HYZ3; PDB:2IJZA; LIDFLKASPTPFHATASLARRLEAAGYRRLDEGGRYYVTRNDSSLIAIRLGSPLESGFRL ---3333-------------------------------!!!!------------------ VGAHTDSPCLRVKPNPEIARNGFLQLGVEVYGGALFAPWFDRDLSLAGRVTFRANGKLES -------------------iiii------------------------------------- RLVDFRKAIAVIPNLNPINAQNELPPIIAQLAPGETADVVLDYELSFYDTQSAAVVGLND ------------------------------------------------------------ EFIAGARLDNLLSCHAGLEALLNAEGDENCILVCTDHEEVGSCSHCGADGPFLEQVLRRL ---------------------1111-------------1111------------------ LPEGDAFSRAIQRSLLVSADNAHGVHPNYADRHDANHGPALNGGPVIKINSNQRYATNSE -------3333----------------------------------------------111 TAGFFRHLCQDSEVPVQSFVTRSDMGCGSTIGPITASQVGVRTVDIGLPTFAMHSIRELA 111113333------------3333-----------3333-------------------- GSHDLAHLVKVLGAFYASS --3333------------- >Regulator of G-protein si; SWP:O15492; PDB:2IK8B; SFDLLLSSKNGVAAFHAFLKTEFSEENLEFWLACEEFKKIRSATKLASRAHQIFEEFICS -3333--------------1111------------1111--------------------- EAPKEVNIDHETRELTRMNLQTATATCFDAAQGKTRTLMEKDSYPRFLKSPAYRDLAAQA -----------------------1111-----------------------------3333 >HYPOTHETICAL PROTEIN NMB1; SWP:Q7DDI9; PDB:2IKBA; SDKFNQFINRVLSHEGGYANHPKDPGGETNWGITKRTAQANGYNGSRATREQAISIYRKA --------------------11111111-iiii--------------------------- FWERYRADQPEAVAFQFFDACVNHGYGNAARLQRAAGVPDDGVIGAVSLKAINSLPENDL --1111---------------------------1111----------------------- LLRFNAERLVFYTKLGTFTSFGKGWVRRVAQNLIHASA ----------------------------------1111 >RNA URIDYLYL TRANSFERASE; SWP:Q381M1; PDB:2IKFA; PSPAVVGRSLVNSFKQFVSRHVDATYRLVLDCVAAVDPLMRLYTFGSTVVYGVHEKGSDV --------------1111------------------1111-----3333-----2222-- DFVVLNKTDVEDGKGGDAATQVAKGLQADILAKLARVIRQKHLSWNVEEVRRTRVPVVRV -----3333--------------------------------3333--------------- KGGGAVDFDITAYRRNGVRNSALLRAYFEQNPPCRWLSMSIKRWSKQTGLNASVIGGSIT ------------------------------3333----------------1111------ SYGFNLMVVYYLLQRNHLQFVPPSTIDVSRVEPLPPHLPLEEPADEGLELGTQVLDFLHF ------------1111-----3333-3333--------------iiii------------ FLHEFDSDKQVISLNRPGITTKEELDWTKSAEDFARMNGEKVHYQWCIEDPYELNLNVGR -----1111-----------3333---3333-----%%%%----------------1111 NVTPLKRDFLRRHLEKARDTALLTIV ---------------3333-%%%%-- >HYPOTHETICAL TRANSCRIPTIO; SWP:O32152; PDB:2IKKA; GTENLYFQHHVLSHDIIPASKPIAEKLQIQPESPVVELKRILYNDDQPLTFEVTHYPLDL -----------------------------2222-----------------------3333 FPGIDTFIADGVSHDILKQQYKVVPTHNTKLLNVVYAQQEESKYLDCDIGDALFEIDKTA 222211112222-----------------------------------2222--------- FTSNDQPIYCSLFLHTNRVTFTIN -2222---------1111------ >DNA-BINDING TRANSCRIPTION; SWP:P0ACP1; PDB:2IKSA; GRTRSIGLVIPDLENTSYTRIANYLERQARQRGYQLLIACSEDQPDNERCIEHLLQRQVD -----------------------------1111-------%%%%3333------1111-- AIIVSTSLPPEHPFYQRWANDPFPIVALDRALDREHFTSVVGADQDDAELAEELRKFPAE --------11113333-------------------------------------------- TVLYLGALPELSVSFLREQGFRTAWKDDPREVHFLYANSYEREAAAQLFEKWLETHPPQA -------3333-------------1111-------------------------------- LFTTSFALLQGVDVTLRRDGKLPSDLAIATFGDNELLDFLQCPVLAVAQRHRDVAERVLE ------------------------------------------------------------ IVLASLDEPRKPKPGLTRIKRNLYRRGVLSRS ---------------------------1111- >RAB12; SWP:Q6IQ22; PDB:2IL1A; PADFKLQVIIIGSRGVGKTSLMERFTDSTVGVDFKIKTVELRGKKIRLQIWDTAGQERFN ------------2222------------2222--------iiii-----------3333- SITSAYYRSAKGIILVYDITKKETFDDLPKWMKMIDKYASEDAELLLVGNKLDCETDREI 33333333---------11113333--------------1111-------33331111-- TRQQGEKFAQQITGMRFCEASAKDNFNVDEIFLKLVDDILKKM -----------2222------1111-3333---------1111 >HYPOTHETICAL PROTEIN; SWP:Q2FVU2; PDB:2IL5A; NVENEHVEVEIEKLYKFSPELVYEAWTKKDLLKQWFTSARTNKEIEADVKEGGKYRIVDQ 1111-------------3333-3333-1111------3333--------2222------- QRNGKVNVIEGIYESLVDEYVKTIGPSETQDVIEVEFFERETGGTQLFYYRSLVEKERRF -iiii---------------------------------------------------2222 TNLEYKQKKKEYHDAVHGFELFDKYHVIETSTQQ ---------------------------1111--- >INTERLEUKIN-10; SWP:P22301; PDB:2ILK; TQSENSCTHFPGNLPNMLRDLRDAFSRVKTFFQMKDQLDNLLLKESLLEDFKGYLGCQAL -----------------------3333-------------------------1111---- SEMIQFYLEEVMPQAENQDPDIKAHVNSLGENLKTLRLRLRRCHRFLPCENKSKAVEQVK -------------3333-1111--------------------%%%%3333---------- NAFNKLQEKGIYKAMSEFDIFINYIEAYMTMKIRN ------3333------------------------- >TITIN; SWP:Q8BUJ0; PDB:2ILLA; MAPHFKEELRNLNVRYQSNATLVCKVTGHPKPIVKWYRQGKEIIADGLKYRIQEFKGGYH --------------2222-------------------iiii------------------- QLIIASVTDDDATVYQVRATNQGGSVSGTASLEVEVPAKIHLPKTLEGMGAVHALRGEVV -------3333---------1111---------------------iiii-----2222-- SIKIPFSGKPDPVITWQKGQDLIDNNGHYQVIVTRSFTSLVFPNGVERKDAGFYVVCAKN -----------------------------------------1111-3333---------1 RFGIDQKTVELDVAD 111------------ >FANCONI ANEMIA GROUP E PR; SWP:Q9HB96; PDB:2ILRA; AESLELPKAIQDQLPRLQQLLKTLEEAPPVELQLLHECSPSQMDLLCAQLQLPQLSDLGL ----------------------1111--33333333--------------3333------ LRLCTWLLALSPDLSLSNATVLTRSLFLGRILSLTSSASRLLTTALTSFAAKYTYPVCSA ------1111--------------------1111-------------------------- LLDPVLQAPGTGPAQTELLCCLVKMESLEPDAQVLMLGQILELPWKEETFLVLQSLLERQ -----------3333---------1111-----------1111----------------- VEMTPEKFSVLMEKLCKTTSMAYAKLMLTVMTKYQANITETQRLGLAMALEPNTTFLRKS ---------------------------------3333----------1111--------- LKAALKHLG -----3333 >2',3'-CYCLIC-NUCLEOTIDE 3; SWP:P13233; PDB:2ILXA; GSHMFLPLYFGWFLTKKSSETLRKAGQVFLEELGNHKAFKKELRHFISGDEPKEKLDLVS ---------------3333-------------------3333-3333---------3333 YFGKRPPGVLHCTTKFCDYGKATGAEEYAQQDVVRRSYGKAFKLSISALFVTPKTAGAQV ----------------%%%%-------------3333--------------1111----- VLNEQELQLWPSDLDKPSSSESLPPGSRAHVTLGCAADVQPVQTGLDLLEILQQVKGGSQ ------11111111-----------1111------11113333----------------- GEEVGELPRGKLYSLGKGRWMLSLAKKMEVKAIFTGYYG ------1111----------------------------- >NICOTINATE PHOSPHORIBOSYL; SWP:Q7MXV0; PDB:2IM5A; IIRSILDTDLYKFTTGYAYAKLFPRAYGEFRFIDRNRQGFTEEFAELVRGEIRAMAALSL ---1111-3333----------1111-------1111-----------------1111-- TRDEKEFLQRELPYLPPIYIDFLDGFRFDPEEVTVSIDAQGHLDIRAQGLLYRVTLWETP -----------33333333---------1111-----1111--------33333333--- ILAVISELYYRFIGAEPDWKQVEEVTRSKGELMREHRATFSIFGMRRRFSLEVEDRVTDI ----------1111-----------------------------3333------------- LKQYAGESLFGTSNVHLAHKHGLRVSGTHPHEWIQFHGAIYGYKMANYVAMEDWINVYDG ----!!!!---------------------3333---------1111-----------iii DLGTVLTDTYTTDVFMRNFSKKHAMLFTSLRHDSGDPEIFIEKAVRRYEELRVDPKIKYI i--------------1111-----------------------------1111-3333--- IFSDSLTPQRAIEIQKLCAGRIKASFGIGTNLTNDVGGGVEPLNIVMKLWKCKMTAKDDW -----------------2222-------3333----iiii--------------1111-- HYCVKLSDVDGKHTGEPEEILLAMNTLGI ----------------------------- >HYPOTHETICAL PROTEIN YPPE; SWP:P50833; PDB:2IM8A; LSQTLLETEQIEVAEKGADRYQEGKNSNHSYDFFETIKPAVEENDELAARWAEGALELIK 3333-------------------------------------------------------- VRRPKYVHKEQIEAVKDNFLELVLQSYVHHIHKKRFKDITESVLYTLHAVKDEIAREDSR --------------------------------------------------------1111 >HYPOTHETICAL PROTEIN; SWP:Q5ZY11; PDB:2IM9A; NSAAIEEQANSSIRKLYHTLNTTSMADRISQISAYFKGTKYILGSLGEGPNARYDQFPRY ------------------1111-----------1111-----------2222-------- RVDGFDCDTYVNTVLSLALANSLESFQECLKHTRYKNGKRSYINRNHFTSIDWNNYNQKR ----------------------------------2222--3333---------------- GLLKDITFSIRNEKKQPVALYANALINKPQWYNHKTIDTIRLQKQDKNEQEKRLVELKAK -----3333--1111----------------11113333--------------------- GKTLETSLSNVPYIPFTALFSENKPNLHLFSQIPNGAVIEIIRPNWDLRQQIGTELDISH 1111----------3333-2222---3333---2222----------3333--------- LGFAIWINNELFFRQASSQYGKVVDVSLIDYLDKARSSPTIKGINIQVVLPEKPVCQLF ------%%%%-------1111-------------------------------------- >DUAL SPECIFICITY PROTEIN ; SWP:Q9BVJ7; PDB:2IMGA; GVQPPNFSWVLPGRLAGLALPRLPAHYQFLLDLGVRHLVSLTERGPPHSDSCPGLTLHRL ----------2222--------3333-------------------2222--1111----- RIPDFCPPAPDQIDRFVQIVDEANARGEAVGVHCALGFGRTGTLACYLVKERGLAAGDAI --2222--3333-----------1111-----------3333------------------ AEIRRLRPGSIETYEQEKAVFQFYQRTK ---------------------------- >HYPOTHETICAL PROTEIN UNP ; SWP:Q5LQD5; PDB:2IMHA; SLTFSILAHDPETGAIGGAAATGSLCVGGWVLRGDLNAGSASQGAAPSTFWGEEVLQHLR ----------1111------------3333----1111---------3333-------11 DGSHPEDAVNHVTSQDSGRAYRQLAADLLGNAAAFTGSENQDIKGSVTFASGIASGNLGD 11-------------1111-------1111------1111-------------------3 NSVLGATEAFVASDLTFERRLLAALIAAEGAGGLLSAALVLHPDRPPVTLRIDYHPDNPI 333-------------------------1111---------1111--------------- GALEQLYQKATTGDYADWARQVPVLSDKERILDEGHHHHHH -----------------1111--1111-------------- >HYPOTHETICAL PROTEIN DUF1; SWP:Q4KBL6; PDB:2IMJA; AQVRPPLPPFTRESAIEKIRLAEDGWNSRDPERVSLAYTLDTQWRNRAEFAHNREEAKAF -------------------------1111-----1111-------!!!!----------- LTRKWAKELDYRLIKELWAFTDNRIAVRYAYEWHDDSGNWFRSYGNENWEFDEQGLARRF --1111--------------!!!!---------------------------1111----- ACINDPIKAQERKFHWPLGRRPDDHPGLSELGLEHH -------3333----------1111-3333------ >HYPOTHETICAL PROTEIN; SWP:O28442; PDB:2IMLA; LRLADFGFTDGINEIIAITENEDGSWNAAPIGIIVEDSSSDTAKAKLYRNRTRANLERSG -3333---------------1111------------1111-------------------- VLFANVTDDALVFAVSSFGNLNDDWYASPNPPIIKGAAWCRFEAERSGVAHLKLTDGEII --------------3333----1111-------2222--------iiii----------- EKRVRAINRGLSAVIEALVHATRYVAIKSDERRKELLERIHYYREIVQKCGSEREKRAFE ----------------------3333---------------------------------- IIEKIGEG --1111-- >IGA-KAPPA MCPC603 FV (LIG; SWP:Q6KB05; PDB:2IMN; DIVMTQSPSSLSVSAGERVTMSCKSSQSLLYKDGKNFLAWYQQKPGQPPKLLIYGASTRE -------------2222-------------1111---------2222------------2 SGVPDRFTGSGSGTDFTLTISSVQAEDLAVYYCQNDHSYPLTFGAGTKLELKR 2221111----------------1111-------------------------- >LACTALDEHYDE DEHYDROGENAS; SWP:P25553; PDB:2IMPA; VPVQHPMYIDGQFVTWRGDAWIDVVNPATEAVISRIPDGQAEDARKAIDAAERAQPEWEA --------iiii---------------------------3333----------------- LPAIERASWLRKISAGIRERASEISALIVEEGGKIQQLAEVEVAFTADYIDYMAEWARRY -3333------------------------------------------------1111--- EGEIIQSDRPGENILLFKRALGVTTGILPWNFPFFLIARKMAPALLTGNTIVIKPSEFTP --------2222-------------------3333---------1111-------1111- NNAIAFAKIVDEIGLPRGVFNLVLGRGETVGQELAGNPKVAMVSMTGSVSAGEKIMATAA ---------------2222-----------------1111-----------------333 KNITKVLELGGKAPAIVMDDADLELAVKAIVDSRVINSGQVCNCAERVYVQKGIYDQFVN 3----------------1111-----------1111%%%%-----------1111----- RLGEAMQAVQFGNPAERNDIAMGPLINAAALERVEQKVARAVEEGARVAFGGKAVEGKGY ----3333----3333-------------------------1111--------------- YYPPTLLLDVRQEMSIMHEETFGPVLPVVAFDTLEDAISMANDSDYGLTSSIYTQNLNVA ----------33331111--------------3333------------------------ MKAIKGLKFGETYINRENFEAMQGFHAGWRKSGIGGADGKHGLHEYLQTQVVYLQS --------------------1111-------------------1111--------- >HYPOTHETICAL PROTEIN DR_0; SWP:Q9RW45; PDB:2IMRA; HTPRLLTCDVLYTGAQSPGGVVVVGETVAAAGHPDELRRQYPHAAEERAGAVIAPPPVNA ----------------------------------------1111---------------- HTHLDMSAYEFQALPYFQWIPEVVIRGRHLRGVAAAQAGADTLTRLGAGGVGDIVWAPEV ------3333---3333-------1111---------------1111---------3333 MDALLAREDLSGTLYFEVLNPFPDKADEVFAAARTHLERWRRLERPGLRLGLSPHTPFTV ------1111-----------3333--------------3333-2222-------1111- SHRLMRLLSDYAAGEGLPLQIHVAEHPTELEMFRTGGGPLWDNRMPALYPHTLAEVIGRE -------------------------3333----------1111-3333---3333----- PGPDLTPVRYLDELGVLAARPTLVHMVNVTPDDIARVARAGCAVVTCPRSNHHLECGTFD -1111----------3333----------------------------------------- WPAFAAAGVEVALGTDSVASGETLNVREEVTFARQLYPGLDPRVLVRAAVKGGQRVVGTP ----1111--------3333----3333--------1111-------------------- FLRRGETWQEGFRWELSRDL --2222--33333333---- >APOPTOSIS REGULATOR BAK; SWP:Q16611; PDB:2IMSA; LPSASEEQVAQDTEEVFRSYVFYRHQQEQEAEGVAAPADPEVTLPLQPSSTGQVGRQLAI -------------------------------!!!!-----------1111---------- IGDDINRRYDSEFQTLQHLQPTAENAYEYFTKIATSLFESGINWGRVVALLGFGYRLALH ---------------1111----------------1111--------------------- VYQHGLTGFLGQVTRFVVDFLHHCIARWIAQRGGWVAAL -1111------------------------1111--1111 >UFM1-CONJUGATING ENZYME 1; SWP:Q9Y3C8; PDB:2IN1A; DEATRRVVSEIPVLKTNAGPRDRELWVQRLKEEYQSLIRYVENNKNADNDWFRLESNKEG -------1111-----------3333------------------1111--------1111 TRWFGKCWYIHDLLKYEFDIEFDIPITYPTTAPEIAVPELDGKTAKYRGGKICLTDHFKP ----------iiii-------------3333-----3333------2222----1111-- LWARNVPKFGLAHLALGLGPWLAVEIPDLIQKGVIQHKEK ---------3333-----------------------%%%% >HYPOTHETICAL PROTEIN; SWP:Q82S11; PDB:2IN3A; EKPVLWYIADPCSWCWGFAPVIENIRQEYSAFLTVKIPGGTNTPLLPEKRAQILHHWHSV ----------------------------1111---------------------------- HITTGQPFTFENALPEGFIYDTEPACRGVVSVSLIEPEKVFPFFAAIQRAFYVGQEDVAQ ---------2222-2222-----------------3333-----------------1111 LAILKKLAVDLGIPESRFTPVFQSDEAKQRTLAGFQRVAQWGISGFPALVVESGTDRYLI 3333----1111-3333-----------------------------------!!!!---- TTGYRPIEALRQLLDTWLQQHG ---------------------- >HYPOTHETICAL LIPOPROTEIN ; SWP:P75884; PDB:2IN5A; HSQQSVDTFRASLFDNQVADQQIQALPYSTYLRLNEGQRIFVVLGYIEQEQSKWLSQDNA ------------------33331111-------%%%%----------%%%%----1111- LVTHNGRLLKTVKLNNNLLEVTNSGQDPLRNALAIKDGSRWTRDILWSEDNHFRSATLSS ---iiii---------------1111----3333-2222---------%%%%-------- TFSFAGLETLNIAGRNVLCNVWQEEVTSTRPEKQWQNTFWVDSATGQVRQSRQLGAGVIP -----------%%%%---------------------------------------2222-- VETFLKPAPL ---------- >HYPOTHETICAL PROTEIN; SWP:NA; PDB:2INBA; DVFHQVVKIALEKDGWQITNDPLTISVGGVNKLIAAEREGEKIAVEVKSFLERSSAISEF -----------1111-----------iiii-------iiii------------------- HTALGQFINYRGALRRRQPERVLYLAVPLTTYKTFFQLDFPKEIAENQVKLIYDVEQEVI ------------3333-1111-----------------3333--1111------1111-- FQWIN ----- >TOUB PROTEIN; SWP:O87798; PDB:2INCA; SMLKREDWYDLTRTTNWTPKYVTENELFPEEMSGARGISMEAWEKYDEPYKITYPEYVSI ---3333--3333---------3333--3333--iiii3333------------------ QREKDSGAYSIKAALERDGFVDRADPGWVSTMQLHFGAIALEEYAASTAEARMARFAKAP ---------------33333333------------------------------------- GNRNMATFGMMDENRHGQIQLYFPYANVKRSRKWDWAHKAIHTNEWAAIAARSFFDDMMM ----------------------33331111--------1111------------------ TRDSVAVSIMLTFAFETGFTNMQFLGLAADAAEAGDHTFASLISSIQTDESRHAQQGGPS -------------------------------1111-------------3333-------- LKILVENGKKDEAQQMVDVAIWRSWKLFSVLTGPIMDYYTPLESRNQSFKEFMLEWIVAQ --------------------------------3333----1111---------------- FERQLLDLGLDKPWYWDQFMQDLDETHHGMHLGVWYWRPTVWWDPAAGVSPEEREWLEEK -----1111---1111--------------------3333-------------------- YPGWNDTWGQCWDVITDNLVNGKPELTVPETLPTICNMCNLPIAHTPGNKWNVKDYQLEY --3333------------11113333--------------------!!!!---------i EGRLYHFGSEADRWCFQIDPERYKNHTNLVDRFLKGEIQPADLAGALMYMSLEPGVMGDD iii------------33331111---------1111-----------1111--------1 AHDYEWVKAYQ 11133331111 >Toluene, o-xylene monooxy; SWP:O87802; PDB:2INCB; ALKPLKTWSHLAGNRRRPSEYEVVSTNLHYFTDNPERPWELDSNLPMQTWYKKYCFDSPL -------1111------------------11111111----1111---------1111-- KHDDWNAFRDPDQLVYRTYNLLQDGQESYVQGLFDQLNDRGHDQMLTREWVETLARFYTP ---3333--1111---------------------------3333---------------- ARYLFHALQMGSVYIHQIAPASTITNCATYETADHLRWLTHTAYRTRELANCYPDVGFGK ---------------1111---------------------------------1111---- RERDVWENDPAWQGFRELIEKALIAWDWGEAFTAINLVTKPAVEEALLQQLGSLAQSEGD --------3333---------1111------------------------------1111- TLLGLLAQAQKRDAERHRRWSSALVKMALEKEGNREVLQKWVAKWEPLADKAIEAYCSAL ------------------------------2222---------------------3333- PDGENAIVEAKSASRYVRQMMG ------------------1111 >TouB protein; SWP:O87799; PDB:2INCC; TFPIMSNFERDFVIQLVPVDTEDTMDQVAEKCAYHSINRRVHPQPEKILRVRRHEDGTLF -------2222--------1111------------2222----1111------------- PRGMIVSDAGLRPTETLDIIFMD 11113333---2222-------- >UROPORPHYRINOGEN DECARBOX; SWP:P32395; PDB:2INFA; TFNETFLKAARGEKADHTPVWYMRQAGRSQPEYRKLKEKYGLFEITHQPELCAYVTRLPV --------1111--------------3333------------------3333------33 EQYGVDAAILYKDIMTPLPSIGVDVEIKNGIGPVIDQPIRSLADIEKLGQIDPEQDVPYV 33----------11113333--------------------33331111---3333-3333 LETIKLLVNEQLNVPLIGFSGAPFTLASYMTEGGPSKNYNKTKAFMYSMPDAWNLLMSKL ------------------------------------------------------------ ADMIIVYVKAQIKAGAKAIQIFDSWVGALNQADYRTYIKPVMNRIFSELAKENVPLIMFG -----------1111-------1111-----------------------1111------- VGASHLAGDWHDLPLDVVGLDWRLGIDEARSKGITKTVQGNLDPSILLAPWEVIEQKTKE --3333---3333-------1111-----1111----------3333--3333------- ILDQGMESDGFIFNLGHGVFPDVSPEVLKKLTAFVHEYSQNKKM ---1111------------1111--------------------- >PHENOL HYDROXYLASE COMPON; SWP:Q84AQ2; PDB:2INPA; KKLNLKDKYQYLTRDMAWEPTYQDKKDIFPEEDFEGIKITDWSQWEDPFRLTMDAYWKYQ ------------3333-------3333-----1111----1111-------3333----- AEKEKKLYAIFDAFAQNNGHQNISDARYVNALKLFISGISPLEHAAFQGYSKVGRQFSGA --------------------------------------3333------------------ GARVACQMQAIDELRHSQTQQHAMSHYNKHFNGLHDGPHMHDRVWYLSVPKSFFDDARSA -------------------------3333--------------3333------------- GPFEFLTAISFSFEYVLTNLLFVPFMSGAAYNGDMATVTFGFSAQSDEARHMTLGLEVIK ----------------3333---------------------------------------- FILEQHEDNVPIVQRWIDKWFWRGFRLLSLVSMMMDYMLPNKVMSWSEAWEVYYEQNGGA -----1111--------------------------------------------------- LFKDLERYGIRPPKYQDVANDAKHHLSHQLWTTFYQYCQATNFHTWIPEKEEMDWMSEKY ----3333----2222-------------------------------------------- PDTFDKYYRPRYEYLAKEAAAGRRFYNNTLPQLCQVCQIPTIFTEKDAPTMLSHRQIEHE --------------------------------------------2222----------ii GERYHFCSDGCCDIFKHEPEKYIQAWLPVHQIYQGNCEGGDLETVVQKYYHINIGEDNFD ii-----3333------33331111---------------------------2222---- YVGSPDQKHWLSIK 2222---------- >PHENOL HYDROXYLASE COMPON; SWP:NA; PDB:2INPC; PIRHTYGHIARRFGDKPATRYQEASYDIEAKTNFHYRPQWDSEHTLNDPTRTAIRMEDWC -------------------------------------1111------1111------111 AVSDPRQFYYGAYVGNRAKMQESAETSFGFCEKRNLLTRLSEETQKQLLRLLVPLRHVEL 1--1111-3333-------------------1111-1111-----------3333----- GANMNNAKIAGDATATTVSQMHIYTGMDRLGIGQYLSRIALMIDGSTGAALDESKAYWMD --------------3333-------------------------%%%%------------- DEMWQPMRKLVEDTLVVDDWFELTLVQNILIDGMMYPLVYDKMDQWFESQGAEDVSMLTE 3333-------------------------------------------1111--3333--- FMRDWYKESLRWTNAMMKAVAGESETNRELLQKWIDHWEPQAYEALKPLAEASVGIDGLN -------------------3333------------------------------------- EARAELSARLKKFELQSR --------3333------ >Phenol hydroxylase compon; SWP:Q84AQ1; PDB:2INPE; SVNALYDYKFEPKDKVENFHGMQLLYVYWPDHLLFCAPFALLVQPGMTFSALVDEILKPA -----------11111111------------------------1111-----------11 TAAHPDSAKADFLNAEWLLNDEPFTPKADASLKEQGIDHKSMLTVTTPGLKGMANAGY 11-3333---3333----iiii----11113333---2222-----2222--%%%%-- >PHENOL HYDROXYLASE COMPON; SWP:NA; PDB:2INPL; QLVFIVFQDNDDSRYLAEAVMEDNPDAEMQHQPAMIRIQAEKRLVINRETMEEKLGRDWD ----------3333--------------------------------3333---------3 VQEMLINVSIAGNVDEDHFILEW 3333333---------------- >INULIN FRUCTOTRANSFERASE; SWP:Q3SAG3; PDB:2INUA; PNTYDVTTWRIKAHPEVTAQSDIGAVINDIIADIKQRQTSPDARPGAAIIIPPGDYDLHT ----3333--1111---3333------------------1111----------------- QVVVDVSYLTIAGFGHGFFSRSILDNSNPTGWQNLQPGASHIRVLTSPSAPQAFLVKRAG -----------------------1111-2222--------------3333---------- DPRLSGIVFRDFCLDGVGFTPGKNSYHNGKTGIEVASDNDSFHITGGFVYLEHALIVRGA ---------------------1111----------------------------------- DALRVNDNIAECGNCVELTGAGQATIVSGNHGAGPDGVTLLAENHEGLLVTGNNLFPRGR ---------------------------------1111----------------------- SLIEFTGCNRCSVTSNRLQGFYPGLRLLNGCKENLITANHIRRTNEGYPPFIGRGNGLDD -----------------------------------------------3333--------- LYGVVHIAGDNNLISDNLFAYNVPPANIAPAGAQPTQILIAGGDANVVALNHVVSDVASQ -----------------------3333--2222--------------------------- HVVLDASTTHSKVLDSGTASQITSYSSDTAIRPTP ----3333---------1111-------------- >PUTATIVE STRUCTURAL PROTE; SWP:Q83JN9; PDB:2INWA; TLPGTTPPDDNHDRPWWGLPCTVTPCFGARLVQEGNRLHYLADRAGIRGRFSDVDAYHLD ----------1111-------------------!!!!---1111---------------- QAFPLLKQLELLTGGELNPRHQHTVTLYAKGLTCEADTLGSCGYVYLAVYPTPAA --3333-----1111--1111-------iiii-----%%%%-------------- >ASF1A PROTEIN; SWP:Q69DB9; PDB:2IO5A; MAKVQVNNVVVLDNPSPFYNPFQFEITFECIEDLSEDLEWKIIYVGSAESEEYDQVLDSV ----------------3333--------------------------33331111------ LVGPVPAGRHMFVFQADAPNPGLIPDADAVGVTVVLITCTYRGQEFIRVGYYVNNEYTET -------------------3333-3333------------%%%%--------------33 ELRENPPVKPDFSKLQRNILASNPRVTRFHINWE 33--------1111-----1111----------- >Bifunctional glutathionyl; SWP:P0AES0; PDB:2IO8A; APFGTLLGYAPGGVAIYSSDYSDDAVFRSYIDDEYMGHKWQCVEFARRFLFLNYGVVFTD -2222----2222----------3333---!!!!-------------------------- VGMAWEIFSLRFLREVVNDNILPLQAFPNGSPRAPVAGALLIWDKGGEFKDTGHVAIITQ --33331111-----------------2222----2222--------------------- LHGNKVRIAEQNVIHSPLPQGQQWTRELEMVVENGCYTLKDTFDDTTILGWMIQTEDTEY ------------------2222-----------------------------------222 SLPQPEIAGELLKISGARLENKGQFDGKWLDEKDPLQNAYVQANGQVINQDPYHYYTITE 2------3333-----------1111----1111----------------1111------ SAEQELIKATNELHLMYLHATDKVLKDDNLLALFDIPKILWPRLRLSWQRRRHHMITGRM --------------------------33333333--3333----------1111------ DFCMDERGLKVYEYNADSASCHTEAGLILERWAEQGYKGNGFNPAEGLINELAGAWKHSR ----1111----------------------------------1111-------------- ARPFVHIMQDKDIEENYHAQFMEQALHQAGFETRILRGLDELGWDAAGQLIDGEGRLVNC -----------3333-----------1111--------3333--3333------------ VWKTWAWETAFDQIREVEFAAVPIRTGHPQNEVRLIDVLLRPEVLVFEPLWTVIPGNKAI -----3333----3333----------------33331111-------3333-1111--- LPILWSLFPHHRYLLDTDFTVNDELVKTGYAVKPIAGRCGSNIDLVSHHEEVLDKTSGKF -------2222----------------------1111%%%%-----1111---------1 AEQKNIYQQLWCLPKVDGKYIQVCTFTVGGNYGGTCLRGDESLVIKKESDIEPLIVVK 111------------iiii--------iiii--------------1111--------- >CELLULAR TUMOR ANTIGEN P5; SWP:P02340; PDB:2IOIA; QKTYQGNYGFHLGFLQSGTAKSVMCTYSPPLNKLFCQLAKTCPVQLWVSATPPAGSRVRA -----1111----------1111-----1111----2222------------2222---- MAIYKKSQHMTEVVRRCPHHERCSDGDGLAPPQHLIRVEGNLYPEYLEDRQTFRHSVVVP -----3333-------3333----------1111-------------------------- YEPPEAGSEYTTIHYKYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRDSFEVRVCACPG -------------------1111---iiii---------1111----------------- RDRRTEE ------- >HYPOTHETICAL PROTEIN AF_1; SWP:O29056; PDB:2IOJA; GLSVEEIREAVSGEYLIEPREEKVEQVVIGASPQSALRYLREARNAALVTGGDRSDLLLT --------------------------------------3333--------1111------ ALEPNVRCLILTGNLEPVQLVLTKAEERGVPVILTGHDTLTAVSRLESVFGRTRIRG -----------%%%%--3333-----------------------------1111--- >CHAPERONE PROTEIN HTPG; SWP:P0A6Z3; PDB:2IORA; HMKGQETRGFQSEVKQLLHLMIHSLYSNKEIFLRELISNASDAADKLRFRALSNPDLYEG ------------------------3333---------------------3333----%%% DGELRVRVSFDKDKRTLTISDNGVGMTRDEVIDHLGTIAKSGTKSFLESLGSDQAKDSQL %----------1111------------------------2222---1111---------- IGQFGVGFYSAFIVADKVTVRTRAAGEKPENGVFWESAGEGEYTVADITKEDRGTEITLH 1111--1111-------------22223333----------------------------- LREGEDEFLDDWRVRSIISKYSDHIALPVEIE -22221111-----------3333-------- >PERIPLASMIC SUGAR-BINDING; SWP:Q8RD41; PDB:2IOYA; KTIGLVISTLNNPFFVTLKNGAEEKAKELGYKIIVEDSQNDSSKELSNVEDLIQQKVDVL -----------3333----------------------%%%%------------------- LINPVDSDAVVTAIKEANSKNIPVITIDRSANGGDVVCHIASDNVKGGEMAAEFIAKALK -----------------1111-----------------------------------1111 GKGNVVELEGIPGASAARDRGKGFDEAIAKYPDIKIVAKQAADFDRSKGLSVMENILQAQ ----------1111----------------1111-------%%%%--------------- PKIDAVFAQNDEMALGAIKAIEAANRQGIIVVGFDGTEDALKAIKEGKMAATIAQQPALM ------------------------------------------------------------ GSLGVEMADKYLKGEKIPNFIPAELKLITKENVQ ----------1111--------------3333-- >PROBABLE PHENAZINE-SPECIF; SWP:Q9HWH2; PDB:2IP2A; NLAAARNLIQVVTGEWKSRCVYVATRLGLADLIESGIDSDETLAAAVGSDAERIHRLMRL --------------------------------1111------------------------ LVAFEIFQGDTRDGYANTPTSHLLRDVEGSFRDMVLFYGEEFHAAWTPACEALLSGTPGF -1111-----1111---33331111------------------1111------------- ELAFGEDFYSYLKRCPDAGRRFLLAMKASNLAFHEIPRLLDFRGRSFVDVGGGSGELTKA ----------------------------------1111---2222------!!!!----- ILQAEPSARGVMLDREGSLGVARDNLSSLLAGERVSLVGGDMLQEVPSNGDIYLLSRIIG ----1111------2222-----------1111-------3333-------------333 DLDEAASLRLLGNCREAMAGDGRVVVIERTISASEPSPMSVLWDVHLFMACAGRHRTTEE 3-----------------2222--------------3333-------------------- VVDLLGRGGFAVERIVDLPMETRMIVAARA ------------------%%%%-------- >CLASS A NONSPECIFIC ACID ; SWP:Q934J6; PDB:2IPBA; MQPFHSPEESVNSQFYLPPPPGNDDPAFRYDKEAYFKGYAIKGSPRWKQAAEDADISVEN -----1111--3333------1111-------------1111------------------ IARIFSPVVGAKINPKDTPETWNMLQNLLKMGGYYATASAKKYYMRTRPFVLFNHSTCRP ------3333---3333-----------------1111--------------------33 EDENTLRKDGSYPSGHDAYSTLLALVLSQARPERAQELARRGWEFGQSRVICGAHWQSDV 33-3333-----------------------3333---------------------3333- DAGRYVGAVEFARLQTIPAFQKSLAKVREELNDKNNLLS --------------------------------3333--- >ACLACINOMYCIN OXIDOREDUCT; SWP:Q0PCD7; PDB:2IPIA; ALVKVDRVDRRYQDLVTRGFNGRFRGRPDVVYVVHTADQVVDAVNQAMAAGQRIAVRSGG --------1111--------3333-----------------------1111--------- HCFEGFVDDPAVRAVIDMSQMRQVFYDSGKRAFAVEPGATLGETYRALYLDWGVTIPAGV -11111111--------1111--------------3333--------------------- CPQVGVGGHVLGGGYGPLSRRDGVVADHLYAVEVVVVDASGRARKVVATSAADDPNRELW 1111-----1111--1111----1111----------1111---------1111------ WAHTGGGGGNFGIVTRYWFRTPGATGTDPSQLLPKAPTSTLRHIVTWDWSALTEEAFTRI ---------------------------3333----------------3333--------- IDNHGAWHQSNSAAGTPYASMHSVFYLNSRAAGQILLDIQIDGGLDGAEALLNDFVAAVN ---------------1111----------------------1111--------------1 EGTGVEPAVQRSTEPWLRATLANKFDTGGFDRTKSKGAYLRKPWTAAQAATLYRHLSADS 111---------------1111-------------------------------------- QVWGEVSLYSYGGKVNSVPETATATAQRDSIIKVWMSATWMDPAHDDANLAWIREIYREI -----------!!!!---1111-------------------3333--------------- FATTGGVPVPDDRTEGTFINYPDVDLVDERWNTSGVPWYTLYYKGNYPRLQKVKARWDPR 1111------1111---1111---------------------!!!!-----------111 DVFRHALSVRPP 1----------- >Putative uncharacterized ; SWP:Q8Z1C5; PDB:2IPQX; LSSTELGDLFWSWLRDGLREGDIPVNTADACVHLTCGFVFISVPGVFFLFLKSHGRKEQV -3333------------1111-----1111----iiii-----------3333------- QAAFEKRKHRVSDSRRFWQCCLYEEPGGRGRYKKLTGYLIKSEIYNGNFPDDSLFLKVI -----------iiii-------------------------------------------- >RRNA 2'-O-METHYLTRANSFERA; SWP:P22087; PDB:2IPXA; VEPHRHEGVFICRLVTKNLVPGESVYGEKRVSISKIEYRAWNPFRSKLAAAILGGVDQIH -----2222----------2222------------------3333--------------- IKPGAKVLYLGAASGTTVSHVSDIVGPDGLVYAVEFSHRSGRDLINLAKKRTNIIPVIED -2222------!!!!3333------1111-------3333------33331111-----3 ARHPHKYRLIAVDVIFADVAQPDQTRIVALNAHTFLRNGGHFVISIKANCIDSTASAEAV 333-----------------1111------------2222------1111---------- FASEVKKQQENKPQEQLTLEPYERDHAVVVGVYRP -------1111--------3333------------ >IRON-RESPONSIVE ELEMENT-B; SWP:Q01059; PDB:2IPYA; SNPFAYLAEPLDPAQPGKKFFNLNKLDYSRYGRLPFSIRVLLEAAVRNCDKFLVKKEDIE -1111------1111------3333-333311113333-------1111-----3333-- NILNWNVTQHMNIEVPFKPARVILQDFTGVPSVVDFAAMRDAVKKLGGDPEKINPICPVD ------3333--------------!!!!---------------1111-1111-------- LVIDFERNRERFEFLKWGSKAFRNMRIIPPGSGIIHQVNLEYLARVVFDQDGYYYPDSLV ----3333--------------------2222--------1111-----%%%%------- GTDSHTTMIDGLGVLGWGVGGIEAEAVMLGQPISMVLPQVIGYRLMGKPHPLVTSTDIVL --11113333----------------1111-------------------1111------- TITKHLRQVGVVGKFVEFFGPGVAQLSIADRATIANMCPEYGATATFFPVDEVSIKYLVQ ----3333--2222-----3333-------------3333------------------11 TGRDESKVKQIRKYLQAVGMFRDYSDPSQDPDFTQVVELDLKTVVPCCSGPKRPQDKVAV 11--------------------11113333---------3333---------3333--33 SDMKKDFESCLGAKQGFKGFQVAPDHHNDHKTFIYNDSEFTLSHGSVVIAAITSSTNTSN 33--------------------1111----------------2222-------------- PSVMLGAGLLAKKAVDAGLNVKPYVKTSLSPGSGVVTYYLRESGVMPYLSQLGFDVPLPE --------------1111---1111--------3333---1111-----1111-----33 PVVEAITQGDLVAVGVLSGNRNFEGRVHPNTRANYLASPPLVIAYAIAGTIRIDFEKEPL 33-------------------------1111----------------------3333--- GTNAKGQQVFLRDIWPTREEIQAVERQYVIPGMFTEVYQKIETVNASSDKLYLWNPKSTY ---------3333-----------------------------------------1111-- IKSPPFFENLTLDLQPPKSIVDAYVLLNLGDSVTTDHISPAGNIARNSPAARYLTNRGLT ----1111-------------------------3333-------1111------1111-3 PREFNSYGSRRGNDAIMARGTFANIRLLNRFLNKQAPQTIHLPSGETLDVFDAAERYQQE 333--33331111----1111--1111-3333---------------------------- GHPLIVLAGKEYGSGSSRDWAAKGPFLLGIKAVLAESYERIHRSNLVGMGVIPLEYLPGE -----------------1111----1111-----------------1111------2222 NADSLGLTGRERYTIIIPENLTPRMHVQVKLDTGKTFQAVIRFDTDVELTYFHNGGILNY ------------------------------1111-------------------------- MIRKMAK ------- >PROTEIN PHOSPHATASE 2C KA; SWP:Q8N3J5; PDB:2IQ1A; ISLENVGCASQIGKRKENEDRFDFAQLTDEVLYFAVYDGHGGPAAADFCHTHMEKCIMDL -3333----------------------1111----------------------------1 LPKEKNLETLLTLAFLEIDKAFSSHARLATLLTSGTTATVALLRDGIELVVASVGDSRAI 111-------------------------3333---------------------------- LCRKGKPMKLTIDHTPERKDEKERIKKCGGFVAWNSLGQPHVNGRLAMTRSIGDLDLKTS --iiii--------3333-------1111-----1111---iiii---------1111-- GVIAEPETKRIKLHHADDSFLVLTTDGINFMVNSQEICDFVNQCHDPNEAAHAVTEQAIQ -------------3333----------3333---------1111--------------11 YGTEDNSTAVVVPFGAW 11-----------1111 >PUTATIVE PREPHENATE DEHYD; SWP:Q99SX2; PDB:2IQ8A; QLYYLGPKGTFSYLACRQYFSENEATFQPKSNLFEVIKAVADDDTSIGVVPIENSIEGTI ----------------1111---------------------------------------- NIVADALAQQDVFAHGEIRLDINFALYGNGTDSISDIKKVYSIAPAISQTTNYIHQHQFD --------------------------------3333------------------1111-- YDYVDSTIQSLTKIENGVAAIAPLGSGEAYGFTPIDTHIEDYPHNVTRFLVIKNQQQFDQ ---------1111-2222------3333-------------------------------- NATSLFLITPHDKPGLLASVLNTFALFNINLSWIESRPLKTQLGYRFFVQADSAITTDIK -----------------------3333--------------------------------- KVIAILETLDFKVEIGAFN ------------------- >FANCONI ANEMIA GROUP F PR; SWP:Q9NPI8; PDB:2IQCA; EDSLMKTQAELLLERLQEVRPARFLSSLWERLPQNNFLKVIAVALLQPGSQVLVHWLLGN 3333---------1111--------------------------1111--3333------- SEVFAAFCRALPAGLLTLVTSRHPALSPVYLGLLTDWGQRLHYDLQKGIWVGTESQDVPW ----------------------3333------------------1111-----1111--- EELHNRFQSLCQAPPPLKDKVLTALETCKAQDGDFEVPGLSIWTDLLLALRSG --------1111--3333-----------1111-------------------- >HYPOTHETICAL PROTEIN XCC0; SWP:Q8PCT0; PDB:2IQIA; ATIYAPTVRVTPNPAWPQVSWQLLVAKPSAARIIDSPRINVRPTPGELQVYHGAGWAQPA ------------1111--------------3333---------1111------------- TDMLEDSVVRAFEDSGKIAAVARISDYKLAIDVRRFESDYAGQSLPAATIELNAKLLHSS ---------------------------------------%%%%----------------- DQRVVASRTFTVARPSSSTDTAAVAAAFEQALTQVTTELVGWTLITGQQDSQT -------------------3333------------------------------ >STROMAL MEMBRANE-ASSOCIAT; SWP:Q8WU79; PDB:2IQJA; RYQAVLANLLLEEDNKFCADCQSKGPRWASWNIGVFICIRCAGIHRNLGVHISRVKSVNL -----------3333---------------3333----------11113333-------- DQWTQEQIQCMQEMGNGKANRLYEAYLPETFRRPQIDPAVEGFIRDKYEKKKYMDRSL -----------------------11113333-------------------11113333 >FRUCTOSE-BISPHOSPHATE ALD; SWP:ALF1_PORGI; PDB:2IQTA; ANEQLQQRQAPGFVGALDQSGGSTPKALKAYGIQPDAYQSEEEFDLIHQRTRITSPAFAT -------------------3333-----1111-3333--3333-----------3333-- GKIIGVILFERTRGKIEGPTADFLWEKRHIVPFLKVDKGLQDEANGVQLKPFPELGKLCE --------3333-------------------------------%%%%----1111----- EAVGYHVFGTKRSVIKQANEQGIRDIVEQQFQWGKEILSHGLVPILEPEVDIHCPEKAKA --1111-------------------------------1111---------1111------ EEILKRELLAQLDKTEPVLKITIPTVDNFYKEIIEHPLRVVALSGGYSREQANELLSRNH -------------------------2222--------------iiii3333-------22 GVIASFSRALVEGLSARQTDAEFNALEASIEDVYQASIK 22----3333----------------------------- >Interferon regulatory fac; SWP:P23906; PDB:2IRFG; RMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARHGWDVEKDAPLFRNWAIHTGK -----------3333-2222----1111-------1111---3333---------1111- HQPGIDKPDPKTWKANFRCAMNSLPDIEEVKDRSIKKGNNAFRVYRMLP -----------------------------3333---------------- >mitogen-activated protein; SWP:Q7QD46; PDB:2IRMA; SWTDDLKVCNQTGVGEAINQIYKDDGRRCEGYESRDKKCLCISDNNTSLYAILSGHNGVT -3333-----------------3333-----------------3333------------- VAENALQEMAAELLLGQLNVCNTDEAVKELIRQSFMSVEKGYFDSINPHVATKTAIQLHL -----------------1111--------------------------------------- SVLQKLDSLNNALSVGSSAVLALIHRSHLYLGNIGNCRALLCKTDEHDTLTVTQLSVDHN ---3333----------------------------------------------------3 LLNAEEAARLFRLGLMAQNFEGVPLYSTRCIGNYLGKAGYKDCNFLSSATAEPVIFEPEI 333-------1111-33332222---------3333--3333---3333----------- VGGIQITPACRFLVLMSSGLCRALHEIFPGDASTGNRELVRMISEEFQNQSTLGGVAQSV ------3333----------------------------------1111---3333----- VHRIVQAHHDTYMQLVEEHRSVTFNSRDDVTLLIRNFNY --------------------------------------- >PUTATIVE DNA LIGASE-LIKE ; SWP:P71571; PDB:2IRUA; SASEQRVTLTNADKVLYPATGTTKSDIFDYYAGVAEVMLGHIAGRPATRKRWPNGVDQPA 1111------1111--3333-----------------33332222------1111----- FFEKQLALSAPPWLSRATVAHRSGTTTYPIIDSATGLAWIAQQAALEVHVPQWRFVAEPG ------33331111------1111--------3333------------------------ SGELNPGPATRLVFDLDPGEGVMMAQLAEVARAVRDLLADIGLVTFPVTSGSKGLHLYTP ------------------2222---------------3333------------------- LDEPVSSRGATVLAKRVAQRLEQAMPALVTSTMTKSLRAGKVFVDWSQNSGSKTTIAPYS ------------------------3333-----11112222----11111111---2222 LRGRTHPTVAAPRTWAELDDPALRQLSYDEVLTRIARDGDLLERLD -------------3333--1111----------------1111--- >DNA HELICASE II; SWP:P03018; PDB:2IS6A; MDVSYLLDSLNDKQREAVAAPRSNLLVLAGAGSGKTRVLVHRIAWLMSVENCSPYSIMAV -3333-3333------------------------------------------3333---- TFTNKAAAEMRHRIGQLMGTSQGGMWVGTFHGLAHRLLRAHHMDANLPQDFQILDSEDQL ---------------------2222----------------3333--------------- RLLKRLIKAMNLDEKQWPPRQAMWYINSQKDEGLRPHHIQSYGNPVEQTWQKVYQAYQEA -------1111--3333------------1111-3333---------------------- CDRAGLVDFAELLLRAHELWLNKPHILQHYRERFTNILVDEFQDTNNIQYAWIRLLAGDT -------3333-----------------------------3333---------------- GKVMIVGDDDQSIYGWRGAQVENIQRFLNDFPGAETIRLEQNYRSTSNILSAANALIENN -------1111--3333-------------2222-------------------------- NGRLGKKLWTDGADGEPISLYCAFNELDEARFVVNRIKTWQDNGGALAECAILYRSNAQS ----------------------------------------1111-3333----------- RVLEEALLQASMPYRIYGGMRFFERQEIKDALSYLRLIVNRNDDAAFERVVNTPTRGIGD -------1111----------------------------11113333------------- RTLDVVRQTSRDRQLTLWQACRELLQEKALAGRAASALQRFMELIDALAQETADMPLHVQ -------------------------------------------------1111------- TDRVIKDSGLRTMYEQEKGEKGQTRIENLEELVTATRQFSYNEEDLMPLQAFLSHAALEA ------------------3333------------------------3333--------33 GEGQADTWQDAVQLMTLHSAKGLEFPQVFIVGMEEGMFPSQMSLDEGGRLEEERRLAYVG 33-------------33332222----------------3333----------------- VTRAMQKLTLTYAETRRLYGKEVYHRPSRFIGELPEECVEEVRLRATVSRPVSH -----------------iiii-------------3333---------------- >CATALASE; SWP:Q3LSM1; PDB:2ISAA; SKKLTTAAGCPVAHNQNVQTAGKRGPQLLQDVWFLEKLAHFDREVIPERRHAKGSGAYGT -----1111------------1111--1111---------1111---------------- FTVTHDITKYTKAKIFSDIGKKTDMFARFSTVAGERGAADAERDIRGFSLKFYTEEGNWD ------3333--3333-2222-------------1111---------------------- LAGNNTPVFFLRDPLKFPDLNHAVKRDPRTNMRSAKNNWDFWTSLPEALHQVTIVMSDRG ------------3333----------------------------3333--------1111 IPATYRHMHGFGSHTFSFINSDNERYWVKFHFVSQQGIKNLSDAEAGELVGNDRESHQRD ---1111------------1111----------1111----------------------- LLDSIDNQDFPKWTLKVQIMPEADAATVPYNPFDLTKVWPHKDYPLIEVGEFELNRNPQN ----1111------------3333------1111-----3333----------------3 YFAEVEQAAFNPANVVPGISFSPDKMLQGRLFAYGDAQRYRLGVNHQHIPVNAPRCPVHS 333-1111--3333-2222-----------------------1111--3333-------- YHRDGAMRVDGNFGSTLGYEPNDQGQWAEQPDFSEPPLNLDGAAAHWDHREDEDYFSQPG ----------%%%%---------------3333--------------1111--------- DLFGLMTAEKQAILFDNTARNLNGVPKEIQLRHVTHCYKADPAYGEGIGKLLGFDISEYN --1111-------------1111--3333-----------3333----------3333-- S - >FUMARASE; SWP:O29167_ARCFU; PDB:2ISBA; HVEYELRTPLVKDQILKLKVGDVVYITGEIFTARDEAHARALEWEEGKELPFSFDKGVVY ----------333333332222-------------------------------2222--- HCGPLVKKNDEWRVVSAGPTTSARNPFTPKILEKVECGIIGKGGSEEVVEARGKAAYFAF --------------------33331111---1111------------------------- TGGAGALAASIKKVKGVVWEDLGPEAVWLLEVERFGPCIVAIDAHGNSLYRR -----3333---------3333-------------------------1111- >NYSGXRC-8828Z, PHOSPHATAS; SWP:NA; PDB:2ISNA; KKVITVNEWYTTTVAATLGRRPTDEDAILVSAPPNVRIKAVFDGHAGEATSQYCAKHAAK -----------------!!!!--------------------------------------- HLGKLSEFTFAEVKKACLSLDAEIIRKLGPKHVAGSTGIIVAIERLSAPVVENVVGREIV -1111-------------------------------------------------!!!!-- PRFVPLEKLIQEEEEAVGRYPRVPDVQQKTIPAGSFLVTAINIGDSRATLIHSDGGLTRL ----3333-----------------------2222----------------1111----- SKDHKPNHPTEASRIEKAGGSVETFDVPRVDGVLALSRAFGDSDFKNPNLPPEEQKVIAV ----1111-------1111---------2222---------3333-----1111------ PDVRQFYALSSDLLLLACDGVYEPSGDWAYVRDLTVAEQRSKGDLEEVAARVDYAYDNSQ --------1111-----3333-----------------1111------------------ DNISVLVAFHNQEVEHPTAVYKVV ------------------------ >PYRIDOXAL BIOSYNTHESIS LY; SWP:Q9WYU4; PDB:2ISSA; HMEIKKGTWIIKKGFAEMFKGGVIMDVTSAEQAKIAEEAGAVAVMALERVPADIRKEGGV -------3333---33332222-------------------------------------- ARMASIAKIREIMEAVSIPVMAKVRIGHIAEAKILEELGVDFIDESEVLTPADDRFHINK ------------------------2222------------------------------33 HEFKVPFVCGARDLGEALRRIAEGAAMIRTKGEAGTGNVVEAVKHMRRVMEQIKQVTKME 33------------------1111-----------------------------------3 DEELVAYGKEIGAPVELLREVKRLGRLPVVNFAAGGVATPADAALMMMLGADGVFVGSGI 333-----1111-3333---------------------3333----1111------3333 FKSKDPRKMAKAMVLAVTYWDNPRILLKISEDIGEPMRGLD ------------------1111-----1111---------- >Glutamine amidotransferas; SWP:Q9WYU3; PDB:2ISSD; HMKIGVLGVQGDVREHVEALHKLGVETLIVKLPEQLDMVDGLILPGGESTTMIRILKEMD -------------------------------3333------------3333-----1111 MDEKLVERINNGLPVFATCAGVILLAKRIKQEKLGVLDITVERNAYGRQVESFETFVEIP -------------------------------------------11113333-------33 AVGKDPFRAIFIRAPRIVETGKNVEILATYDYDPVLVKEGNILACTFHPELTDDLRLHRY 33------------------1111-----iiii-----!!!!-----3333---3333-- FLEMV -1111 >PUTATIVE FRUCTOSE-1,6-BIS; SWP:O97447; PDB:2ISWA; PLCTLRQMLGEARKHKYGVGAFNVNNMEQIQGIMKAVVQLKSPVILQCSRGALKYSDMIY ---3333----------------------------------------------1111--- LKKLCEAALEKHPDIPICIHLDHGDTLESVKMAIDLGFSSVMIDASHHPFDENVRITKEV -----------1111------------------1111-------1111------------ VAYAHARSVSVEAELGTLVQLTEPQDAKKFVELTGVDALAVAIGTSHGAYKFKSRLAIDR ----1111------------------------------------------------3333 VKTISDLTGIPLVMHGSSSVPKDVKDMINKYGGKMPDAVGVPIESIVHAIGEGVCKINVD ----------------------------------1111---3333--------------- SDSRMAMTGAIRKVFVEHPEKFDPRDYLGPGRDAITEMLIPKIKAFGSAGHAGDYKVVSL -----------------3333-3333---------------------2222-------33 EEAKAWY 33----- >IRON-DEPENDENT REPRESSOR ; SWP:A2VL53; PDB:2ISYA; NELVDTTEMYLRTIYDLEEEGVTPLRARIAERLDQSGPTVSQTVSRMERDGLLRVAGDRH 1111-------------1111---3333---------------------------1111- LELTEKGRALAIAVMRKHRLAERLLVDVIGLPWEEVHAEARWEHVMSEDVERRLVKVLNN -------------------------------1111-3333-1111----------1111- PTTSPFGNPIPGLVELGVASENLYFQ ---1111----3333----------- >HYPOTHETICAL PROTEIN; SWP:Q46J75; PDB:2IT9A; GIKKEGPGWRIIFDSSRDNFSTLIGGETWAIELDKSEWKILVEVVELCDQYKLVKEQLGD -----2222----1111--------1111------------------------1111--- EDITLELERRPWLAILNGDQYGWNLRLILSAFNRGAEVYWPRHVTNNVVNARSWD ------------------1111--------------------3333--------- >TRNA-(MS(2)IO(6)A)-HYDROX; SWP:Q88KV1; PDB:2ITBA; LIPEIDAFLGCPTPDAWIEAALADQETLLIDHKNCEFKAASTALSLIAKYNTHLDLINSR ---3333------3333------------------------------------3333--- LAREELVHHEQVLRLKRRGVPLRPVSAGRYASGLRRLVRAHEPVKLVDTLVVGAFIEARS ---------------1111---------------1111---------------------- CERFAALVPHLDEELGRFYHGLLKSEARHYQGYLKLAHNYGDEADIARCVELVRAAEELI ------3333-------------------------------------------------- QSPDQELRFHSGIPQ --------------- >IRON-REGULATED SURFACE DE; SWP:Q2FZE9; PDB:2ITEA; ATSQPINFQVQKDGSSEKSHDDYQHPGKVIKQNNKYYFQTVLNNASFWKEYKFYNANNQE -----------2222----------------%%%%--------3333-------1111-- LATTVVNDNKKADTRTINVAVEPGYKSLTTKVHIVVPQINYNHRYTTHLEFEKAIPTLA ---------1111--------2222---------------------------------- >XYLULOSE KINASE; SWP:P09099; PDB:2ITMA; MYIGIDLGTSGVKVILLNEQGEVVAAQTEKLTVSRPHPLWSEQDPEQWWQATDRAMKALG -------1111------1111---------------2222---3333------------- DQHSLQDVKALGIAGQMHGATLLDAQQRVLRPAILWNDGRCAQECTLLEARVPQSRVITG ----1111---------------1111-------1111--3333-------1111----- NLMMPGFTAPKLLWVQRHEPEIFRQIDKVLLPKDYLRLRMTGEFASDMSDAAGTMWLDVA ---3333------------3333----------------------------3333---11 KRDWSDVMLQACDLSRDQMPALYEGSEITGALLPEVAKAWGMATVPVVAGGGDNAAGAVG 11-------1111-3333-----1111----------1111-----------------11 VGMVDANQAMLSLGTSGVYFAVSEGFLSKPESAVHSFCHALPQRWHLMSVMLSAASCLDW 11--2222--------------------------------------------3333---- AAKLTGLSNVPALIAAAQQADESAEPVWFLPYLSPQAKGVFFGLTHQHGPNELARAVLEG --1111--1111----11111111--------------------1111------------ VGYALADGMDVVHACGIKPQSVTLIGGGARSEYWRQMLADISGQQLDYRTGGDVGPALGA ------------1111---------3333------------------------------- ARLAQIAANPEKSLIELLPQLPLEQSHLPDAQRYAAYQPRRETFRRLYQQLLPLMA --------11113333-------------3333-----------------3333-- >EUKARYOTIC TRANSLATION IN; SWP:P55010; PDB:2IU1A; LERTIEERVNILFDFVKKKKEEGVIDSSDKEIVAEAERLDVKAMGPLVLTEVLFNEKIRE -----------------------3333-------------3333----------1111-- QIKKYRRHFLRFCHNNKKAQRYLLHGLECVVAMHQAQLISKIPHILKEMYDADLLEEEVI --------3333----------------------33331111-------1111------- ISWSEKASKKYVSKELAKEIRVKAEPFIKWLKEAEEESSGGEEEDEDENIEVVYSKLES --1111----------------------------------------------------- >DIHYDROXYACETONE KINASE; SWP:Q9CIW0; PDB:2IU4A; NEIPEEMLKGIDLTYPQLTYLPETGILYDNTYNEKTVPIISGGGSGHEPAHVGYVGSGML --------------1111--2222----1111-------------------11112222- AAAVTGPLFIPPKSKNILKAIRQVNSGKGVFVIIKNFEADLKEFNEAIKEARTEGIDVRY ------2222--3333-----------------------------------1111----- IVSHDDISVNAYNFHKRHRGVAGTILLHKILGAFAKEGGSIDEIEQLALSLSPEIYTLGV ---------1111-------3333----------1111------------3333------ ALAPVHFPHQKTSFVLAEDEVSFGIGI -------%%%%-----1111------- >HYPOTHETICAL PROTEIN YCEG; SWP:Q9CIV9; PDB:2IU5A; MEKSIITQKIIAKAFKDLMQSNAYHQISVSDIMQTAKIRRQTFYNYFQNQEELLSWIFEN -1111-----------------1111------------33333333-------------- DFAELINDNSDYYGWQNELLLLLRYLDENQIFYQKIFVIDKNFEHFFLIQWENLLDKVIF ---------------------------------------1111----------------- DQEKKSDYHWSDLEKSFICRYNAAAICAITRESIIRGNSLEKLYSQIVNLLLAQIKIFES ---------------------------------------3333-----------3333-- >ALKALINE PHOSPHATASE; SWP:Q9KWY4; PDB:2IUCA; KTPKNVILLISDGAGLSQISSTFYFKEGTPNYTQFKNIGLIKTSSSREDVTDSASGATAF ----------2222----3333-------3333--------------------------- SCGIKTYNAAIGVADDSTAVKSIVEIAALNNIKTGVVATSSITHATPASFYAHALNRGLE ------2222---1111----------1111---------11113333-------1111- EEIAMDMTESDLDFFAGGGLNYFTSRKDKKDVLAILKGNQFTINTTGLTDFSSIASNRKM ----3333----------3333---1111-3333--1111---------33331111--- GFLLADEAMPTMESGRGNFLSAATDLAIQFLSKDNSAFFIMSEGSQIDWGGHANNASYLI ----------3333---------------1111-----------------1111------ SEINDFDDAIGTALAFAKKDGNTLVIVTSDHETGGFTLAAKKNKREYSDYTSIGPSFSTG ------------------------------------------------3333-------- GHSATLIPVFAYGPGSEEFIGIYENNEIFHKILKVTKWNQ ------------22221111---3333------------- >CATALASE; SWP:NA; PDB:2IUFA; QQFLSQFYLNDQDVYLTSNVGGPIQDENSLSAGQRGATLLQDFIFREKIQRFDHERVPER 33331111---------1111-----------------1111---------1111----- AVHARGTGAHGTFTSYGDWSNLTAASFLSAEGKETPMFTRFSTVAGSRGSADTARDVHGF ------------------1111--3333-2222-----------------1111------ ATRFYTDEGNFDIVGNNIPVFFIQDAILFPDLIHAVKPRGDNQIPQAATAHDSAWDFFSQ ------------------------3333-------------------------------- QPSVLHTLLWAAGHGIPRSFRHVNGFGVHTFRLVTDDGKTKLVKFHWKGLQGKASFVWEE 3333-------3333---1111------------1111---------------------- AQQTAGKNADFRQDLFQSIQAGRFPEWELGVQIMQEQDQLKFGFDLLDPTKIVPEELVPV -------1111-------1111------------1111-1111-1111-----3333--- TILGKMQLNRNPNYFAETEQVMFQPGHIVRGVDFTEDPLLQGRLFSYLDTQLNRHGGPNF ------------3333-1111--3333-2222--------------------1111--33 EQLPINRPRAPIHNNNRDGAGQMFIPLDPNAYSPNTENKGSPKQANETVGKGFFTAPERT 333333-----------------------------1111------1111------1111- ASGKLQRTLSTTFENNWSQPRLFWNSLVNAQKEFIVDARFETSNVSSSVVRDDVIIQLNR ---------1111----------1111--------------1111------------333 ISDNLATRVASAIGVEAPKPNSSFYHDNTTAHIGAFGEKLAKLDGLKVGLLASVNKPASI 3--------3333--------1111-----------------2222------1111---- AQGAKLQVALSSVGVDVVVVAERANNVDETYSASDAVQFDAVVVADGAEGLFGADSFTVE ---------3333----------2222--3333-1111------222211111111---- PSAGSGASTLYPAGRPLNILLDAFRFGKTVGALGSGSDALESGQISSERQGVYTGKNAGD -1111------2222-------------------------1111-3333----------- AFAKDIKSGLSTFKFLDRFAVDE --------------1111----- >Phosphatidylinositol 3-ki; SWP:P27986; PDB:2IUGA; NNMSLQNAEWYWGDISREEVNEKLRDTADGTFLVRDASTKMHGDYTLTLRKGGNNKLIKI ---33331111----3333--------2222-------3333--------iiii------ FHRDGKYGFSDPLTFSSVVELINHYRNESLAQYNPKLDVKLLYPVSKYQQ --iiii----------------------3333-1111------------- >LIPOXYGENASE L-5; SWP:Q43446; PDB:2IUJA; GQKIKGTMVVMQKNVLDINSITSVALDTVTFLASSISIQLISATKADGGKGKVGKATNLR -----------3333--------------------------------------------- GKITLPTIGAKEEAYDAQFDWDSDFGIPGAFYIKNYMQNEFYLKSLILEDIPNHGTIHFI ----------------------3333------------------------2222------ CNSWVYNSKHYKTDRIFFANNTYLPSETPAPLVKYREEELKNVRGDGTGERKEWDRIYDY ------1111------------------3333-------------------1111----- DVYNDLGDPDKGEKYARPVLGGSALPYPRRGRTGRGKTRKDPNSEKPGDFVYLPRDEAFG -------3333--------------------------1111------------1111--- HLKSSDFLAYGIKSVAQDVLPVLTDAFDGNLLSLDFDNFAEVRKLYEGGVTLPTNFLSNI --3333--------------------1111-------3333-3333-------------- TPIPIIKELFRTDGEQFLKYPPPKVMQVDKSAWMTDEEFARETIAGLNPNVIKIIEEFPL --3333----------------1111--1111--3333-1111----------------- SSKLDTQAYGDHTCIITKEHLEPNLGGLTVEQAIQNKKLFILDHHDYLIPYLRKINANTT ----3333---------1111-------3333-1111-------333311113333---- KTYATRTIFFLKNDGTLTPLAIELSKPHPQGEEYGPVSEVYVPSSEGVEAYIWLLAKAYV -----------1111------------11111111-----------3333---------- VVNDACYHQIISHWLNTHAVVEPFVIATNRHLSVVHPIYKLLFPHYRDTMNINSLARKSL -----------------1111-----------1111------1111-------------- VNADGIIEKTFLWGRYSLEMSAVIYKDWVFTDQALPNDLVKRGVAVKDPSAPHGVRLLIE -2222-----1111--------3333--3333---------------3333--------- DYPYASDGLEIWDAIKSWVEEYVSFYYKSDEELQKDPELQAWWKELVEVGHGDLKDKPWW ---------------------3333---1111------------------1111--1111 QKMQTREELVEASATLIWIASALHAAVNFGQYPYGGLILNRPTISRRFMPEKGSPEYDAL ------------------------------3333--3333-------------3333--- AKNPEKEFLKTITGKKETLIDLTIIEILSRHASDEFYLGQRDGGDYWTSDAGPLEAFKRF -------------------------3333--1111------------------------- GKNLEEIEKKLIEKNNDETLRNRYGPAKMPYTLLYPSSEEGLTFRGIPNSISI ------------1111---3333-3333---1111------------------ >SEED LIPOXYGENASE; SWP:P24095; PDB:2IUKA; GQKIKGTVVLMPKNVLDFNAITSIVIDTATSFLGRNISMQLISATQTDGSGNGKVGKEVY -----------3333------------3333----------------------------- LEKHLPTLPTLGARQDAFSIFFEWDASFGIPGAFYIKNFMTDEFFLVSVKLEDIPNHGTI ------------------------------------------------------------ EFVCNSWVYNFRSYKKNRIFFVNDTYLPSATPAPLLKYRKEEFEVLRGDGTGKRKDFDRI ---------3333--------------11113333-------------------1111-- YDYDVYNDLGNPDGGDPRPILGGCSIYPYPLRVRTGRERTRTDPNSEKPGEVYVPRDENF ----------3333----------------------------------------1111-- GHLKSSDFLTYGIKSLSHDVIPLFKSAIFQLRVTSSEFESFEDVRSLYEGGIKLPTDILS ---3333--------------------------------3333-----------3333-- QISPLPALKEIFRTDGENVLQFPPPHVAKVSKSGVMTDEEFAREVIAGVNPNVIRRLQEF -3333--3333-------------------------3333-1111--------------- PPKSTLDPTLYGDQTSTITKEQLEINMGGVTVEEALSTQRLFILDYQDAFIPYLTRINSL ------3333--------33333333iiii----------------333311113333-- PTAKAYATRTILFLKDDGTLKPLAIELSKPHPDGDNLGPESIVVLPATEGVDSTIWLLAK --------------1111-------------------------------3333------- AHVIVNDSGYHQLVSHWLNTHAVMEPFAIATNRHLSVLHPIYKLLYPHYRDTININGLAR -----------------------------------3333-----33332222-------- QSLINADGIIEKSFLPGKYSIEMSSSVYKNWVFTHQALPADLVKRGLAIEDPSAPHGLRL -------3333--3333--------3333--3333-------1111----3333------ VIEDYPYAVDGLEIWDAIKTWVHEYVSLYYPTDAAVQQDTELQAWWKEAVEKGHGDLKEK --------------------------------3333-----------------1111--3 PWWPKKQTTEDLIQSCSIIVWTASALHAAVNFGQYPYGGLILNRPTLARRFIPAEGTPEY 333---------------------------1111-----3333-------------1111 DEMVKNPQKAYLRTITPKFETLIDLSVIEILSRHASDEIYLGERETPNWTTDKKALEAFK 3333------------3333--------3333---------------------------- RFGSKLTGIEGKINARNSDPSLRNRTGPVQLPYTLLHRSSEEGLTFKGIPNSISI --------------3333333311111111---1111------------------ >DNA TRANSLOCASE FTSK; SWP:P46889; PDB:2IUSA; LPSLDLLTPPTFALEQMARLVEARLADFRIKADVVNYSPGPVITRFELNLAPGVKAARIS --1111------------------------------------------------111111 NLSRDLARSLSTVAVRVVEVIPGKPYVGLELPNKKRQTVYLREVLDNAKFRDNPSPLTVV 11---1111-----------2222------------------------------1111-- LGKDIAGEPVVADLAKMPHLLVAGTTGSGASVGVNAMILSMLYKAQPEDVRFIMIDPKML ---1111-----3333------------------------1111-3333----------1 ELSVYEGIPHLLTEVVTDMKDAANALRWCVNEMERRYKLMSALGVRNLAGYNEKIAEADR 111-2222---------------------------------------------------- MMRPIPDPYWHPVLKKEPYIVVLVDEFADLMMTVGKKVEELIARLAQKARAAGIHLVLAT ------1111-----------------3333------------------1111------- QRPSVDVITGLIKANIPTRIAFTVSSKIDSRTILDQAGAESLLGMGDMLYSGPNSTLPVR ---1111------------------------------1111--iiii------------- VHGAFVRDQEVHAVVQDWKARGRPQYVDGITSD -----------------3333-------1111- >DNA TRANSLOCASE FTSK; SWP:Q9I0M3; PDB:2IUTA; LPPLSLLDPAEVKQKSYSPESLEAMSRLLEIKLKEFGVEVSVDSVHPGPVITRFEIQPAA --3333--------------------------3333------------------------ GVKVSRISNLAKDLARSLAVISVRVVEVIPGKTTVGIEIPNEDRQMVRFSEVLSSPEYDE ---------------1111---------2222---------------3333----3333- HKSTVPLALGHDIGGRPIITDLAKMPHLLVAGTTGSGKSVGVNAMLLSILFKSTPSEARL -----------1111-----3333--------2222------------1111-1111--- IMIDPKMLELSIYEGIPHLLCPVVTDMKEAANALRWSVAEMERRYRLMAAMGVRNLAGFN ------------2222-------------------------------------------- RKVKDAEEAGTPLTDPLFRRESPDDEPPQLSTLPTIVVVVDEFADMMMIVGKKVEELIAR ------1111----1111---3333-------------------------3333------ IAQKARAAGIHLILATQRPSVDVITGLIKANIPTRIAFQVSSKIDSRTILDQGGAEQLLG -----1111----------3333-33331111---------------------1111--i HGDMLYLPPGTGLPIRVHGAFVSDDEVHRVVEAWKLRGAPDYIEDILAG iii-------------------3333-------1111-------1111- >ALKYLATED REPAIR PROTEIN ; SWP:Q96Q83; PDB:2IUWA; SHMRVIDREGVYEISLSPTGVSRVCLYPGFVDVKEADWILEQLCQDVPWKQRTGIREDIT -------------------------------------------------------%%%%- YQQPRLTAWYGELPYTYSRITMEPNPHWHPVLRTLKNRIEENTGHTFNSLCNLYRNEKDS -------------11113333----------------------------------1111- VDWHSDDEPSLGRCPIIASLSFGATRTFEMRKKPPYVERVKIPLDHGTLLIMEGATQADW --------1111--------------------------------2222------------ QHRVPKEYHSREPRVNLTFRTVYP ------------------------ >GLYCOSYLTRANSFERASE; SWP:Q93KV2; PDB:2IUYA; PLKVALVNIPLRVPGSDAWISVPPQGYGGIQWVVANLDGLLELGHEVFLLGAPGSPAGRP ------------2222------------------------1111--------------22 GLTVVPAGEPEEIERWLRTADVDVVHDHSGGVIGPAGLPPGTAFISSHHFTTRPVNPVGC 22--------------1111--------------22222222--------------2222 TYSSRAQRAHCGGGDDAPVIPIPVDPARYRSAADQVAKEDFLLFGRVSPHKGALEAAAFA --------1111-1111-------3333--3333-------------3333--------- HACGRRLVLAGPAWEPEYFDEITRRYGSTVEPIGEVGGERRLDLLASAHAVLASQAVTGP 1111---------------------3333-----------------------------33 WGGIWCEPGATVVSEAAVSGTPVVGTGNGCLAEIVPSVGEVVGYGTDFAPDEARRTLAGL 33-------3333---1111-------!!!!--3333-------------------1111 PASDEVRRAAVRLWGHVTIAERYVEQYRRLLAGATWK ------------------------------------- >Formate dehydrogenase H; SWP:P07658; PDB:2IV2X; MKKVVTVCPYCASGCKINLVVDNGKIVRAEAAQGKTNQGTLCLKGYYGWDFINDTQILTP ---------------------iiii--------3333----3333---3333-------- RLKTPMIRRQRGGKLEPVSWDEALNYVAERLSAIKEKYGPDAIQTTGSSRGTGNETNYVM --------------------------------------1111------------------ QKFARAVIGTNNVDCCARVHGPSVAGLHQSVGNGAMSNAINEIDNTDLVFVFGYNPADSH -------------------------3333---------33331111--------3333-- PIVANHVINAKRNGAKIIVCDPRKIETARIADMHIALKNGSNIALLNAMGHVIIEENLYD ------------------------3333---------2222------------1111--- KAFVASRTEGFEEYRKIVEGYTPESVEDITGVSASEIRQAARMYAQAKSAAILWGMGVTQ ----------------3333-3333--1111------------------------!!!!- FYQGVETVRSLTSLAMLTGNLGKPHAGVNPVRGQNNVQGACDMGALPDTYPGYQYVKDPA ----------------------------------------1111-1111-%%%%3333-- NREKFAKAWGVESLPAHTGYRISELPHRAAHGEVRAAYIMGEDPLQTDAELSAVRKAFED --------------------3333------------------1111---1111------- LELVIVQDIFMTKTASAADVILPSTSWGEHEGVFTAADRGFQRFFKAVEPKWDLKTDWQI -----------3333--------------------1111--------------------- ISEIATRMGYPMHYNNTQEIWDELRHLCPDFYGATYEKMGELGFIQWPCRDTSDADQGTS -----1111------3333--------1111---3333--------------3333---- YLFKEKFDTPNGLAQFFTCDWVAPIDKLTDEYPMVLSTVREVGHYSCRSMTGNCAALAAL --------1111----------------3333--------1111-----3333--3333- ADEPGYAQINTEDAKRLGIEDEALVWVHSRKGKIITRAQVSDRPNKGAIYMTYQWWPEYK -------------------2222---------------------2222---------111 YCAVRVEPIADQRAAEQYVIDEYNKLKTRLREAALA 1----------------------------------- >PROTOPORPHYRINOGEN OXIDAS; SWP:P56601; PDB:2IVDA; MNVAVVGGGISGLAVAHHLRSRGTDAVLLESSARLGGAVGTHALAGYLVEQGPNSFLDRE -------------------1111--------------------iiii------------3 PATRALAAALNLEGRIRAADPAAKRRYVYTRGRLRSVPASPPAFLASDILPLGARLRVAG 333----11111111--------------iiii----------1111------------3 ELFSRRAPEGVDESLAAFGRRHLGHRATQVLLDAVQTGIYAGDVEQLSVAATFPMLVKME 333----2222-------------------------------3333-------------- REHRSLILGAIRAQKAQRQAGTAPKLSGALSTFDGGLQVLIDALAASLGDAAHVGARVEG --------------------------------11113333-------!!!!--------- LAREGWRLIIEEHGRRAELSVAQVVLAAPAHATAKLLRPLDDALAALVAGIAYAPIAVVH -----------%%%%---------------------3333-------1111--------- LGFDAGTLPAPDGFGFLVPAEEQRRMLGAIHASTTFPFRAEGGRVLYSCMVGGARQPGLV ---2222-----------3333--------3333-3333-iiii--------11113333 EQDEDALAALAREELKALAGVTARPSFTRVFRWPLGIPQYNLGHLERVAAIDAALQRLPG ----------------------------------------2222---------3333--- LHLIGNAYKGVGLNDCIRNAAQLADALVA ----3333----------------3333- >ETHYLBENZENE DEHYDROGENAS; SWP:Q5P5I0; PDB:2IVFA; EDIYRKEWKWDKVNWGSHLNICWPQGSCKFYVYVRNGIVWREEQAAQTPACNVDYVDYNP 33333333--------------------------%%%%-------------1111----- LGCQKGSAFNNNLYGDERVKYPLKRVGKRGEGKWKRVSWDEAAGDIADSIIDSFEAQGSD --3333---3333-1111---------2222------3333----------------333 GFILDAPHVHAGSIAWGAGFRMTYLMDGVSPDINVDIGDTYMGAFHTFGKMHMGYSADNL 3-------1111-----------------------------------------------1 LDAELIFMTCSNWSYTYPSSYHFLSEARYKGAEVVVIAPDFNPTTPAADLHVPVRVGSDA 111--------3333--------------------------1111---------2222-- AFWLGLSQVMIDEKLFDRQFVCEQTDLPLLVRMDTGKFLSAEDVDGGEAKQFYFFDEKAG ----------1111-----------1111------------------1111-----3333 SVRKASRGTLKLDFMPALEGTFSARLKNGKTIQVRTVFEGLREHLKDYTPEKASAKCGVP -------------------------3333---------------1111------------ VSLIRELGRKVAKKRTCSYIGFSSAKSYHGDLMERSLFLAMALSGNWGKPGTGAFAWAYS ----------1111-------3333----------------1111---2222-------- DDNMVYLGVMSKPTAQGGMDELHQMAEGFNKRTLEADPTSTDEMGNIEFMKVVTSAVGLV 3333--------33331111----------------1111-------------------- PPAMWLYYHVGYDQLWNNKAWTDPALKKSFGAYLDEAKEKGWWTNDHIRPAPDKTPQVYM 3333------3333111111111111-----------1111--1111---1111------ LLSQNPMRRKRSGAKMFPDVLFPKLKMIFALETRMSSSAMYADIVLPCAWYYEKHEMTTP ----3333-2222-------3333-----------3333----------2222------- CSGNPFFTFVDRSVAPPGECREEWDAIALILKKVGERAAARGLTEFNDHNGRKRRYDELY 3333----------------------------------1111-----1111---1111-- KKFTMDGHLLTNEDCLKEMVDINRAVGVFAKDYTYEKFKKEGQTRFLSMGTGVSRYAHAN ----iiii---------------------1111------------------3333----- EVDVTKPIYPMRWHFDDKKVFPTHTRRAQFYLDHDWYLEAGESLPTHKDTPMVGGDHPFK --1111----------------1111-------------------------2222----- ITGGHPRVSIHSTHLTNSHLSRLHRGQPVVHMNSKDAAELGIKDGDMAKLFNDFADCEIM ---------!!!!11113333----------------1111-2222-------------- VRTAPNVQPKQCIVYFWDAHQYKGWKPYDILLIGMPKPLHLAGGYEQFRYYFMNGSPAPV -------2222------11112222-3333--------------1111--2222------ TDRGVRVSIKKA -2222------- >Beta-subunit of ethylbenz; SWP:Q5P5I1; PDB:2IVFB; KRQLVTVIDLNKCLGCQTCTVACKNIWTKRPGTEHMRWNNVTTYPGKGYPRDYERKGGGF --------3333-----------------2222------------------1111----- LRGEPQPGVLPTLIDSGDDFQFNHKEVFYEGKGQTVHFHPTSKSTGKDPAWGYNWDEDQG %%%%-------3333-------3333-----!!!!------------------1111--- GGKWPNPFFFYLARMCNHCTNPACLAACPTGAIYKREDNGIVLVDQERCKGHRHCVEACP ---------------------------1111---------------------------11 YKAIYFNPVSQTSEKCILCYPRIEKGIANACNRQCPGRVRAFGYLDDTTSHVHKLVKKWK 11--------------%%%%--1111--3333--1111-----1111------------- VALPLHAEYGTGPNIYYVPPMGARGFGEDGEITDKTRIPLDVLEGLFGPEVKRVLAVLHT --------------------------1111--------3333-----1111--------- ERENMRAGRGSELMDLLISKKWSDRFGGFTNDPLTQS ----1111------------3333-iiii-------- >Gamma-subunit of ethylben; SWP:Q5P5I2; PDB:2IVFC; MKAKRVPGGKELLLDLDAPIWAGAESTTFEMFPTPLVMVKEVSPFLALSEGHGVIKRLDV -----------1111--3333-------------3333-33333333------------- AALHNGSMIALRLKWASEKHDKIVDLNSFVDGVGAMFPVARGAQAVTMGATGRPVNAWYW ------------------------1111-----------22223333--2222------- KANANEPMEIVAEGFSAVRRMKDKAGSDLKAVAQHRNGEWNVILCRSMATGDGLAKLQAG 1111---------1111----%%%%----------iiii-----------2222---222 GSSKIAFAVWSGGNAERSGRKSYSGEFVDFEILK 2---------3333--!!!!-------------- >PROTO-ONCOGENE TYROSINE-P; SWP:P07949; PDB:2IVSA; EDPKWEFPRKNLVLGKTLGEGEFGKVVKATAFHLKGRAGYTTVAVKMLKENASPSELRDL -3333--3333----------------------iiii-----------1111-------- LSEFNVLKQVNHPHVIKLYGACSQDGPLLLIVEYAKYGSLRGFLRESRKVGPGYLRALTM ------1111-1111-------------------1111---------------------- GDLISFAWQISQGMQYLAEMKLVHRDLAARNILVAEGRKMKISDFGLSRDVYEEDSYVKR ---------------------------3333---2222------1111------------ SQGRIPVKWMAIESLFDHIYTTQSDVWSFGVLLWEIVTLGGNPYPGIPPERLFNLLKTGH -----3333-3333---------------------1111----22223333----1111- RMERPDNCSEEMYRLMLQCWKQEPDKRPVFADISKDLEKMMVKR ----1111---------1111-1111------------------ >PILP PILOT PROTEIN; SWP:A1KS87; PDB:2IVWA; ETDKKGENAPDTKRIKETLEKFSLENMRYVGILKSGQKVSGFIEAEGYVYTVGVGNYLGQ -----------------3333-3333---------------------------------- NYGRIESITDDSIVLNELIEDSTGNWVSRKAELLLNSSDKNTEQAAAPAAEQN --------------------3333----------------------------- >CYCLIN-T2; SWP:O60583; PDB:2IVXA; ASSRWFFTREQLENTPSRRCGVEADKELSCRQQAANLIQEMGQRLNVSQLTINTAIVYMH -3333------------1111---------------------1111-------------- RFYMHHSFTKFNKNIISSTALFLAAKVEEQARKLEHVIKVAHACLHPLEPLLDTKCDAYL ------3333---------------1111----------------1111---1111---- QQTRELVILETIMLQTLGFEITIEHPHTDVVKCTQLVRASKDLAQTSYFMATNSLHLTTF ------------------------3333------------------------------33 CLQYKPTVIACVCIHLACKWSNWEIPVSTDGKHWWEYVDPTVTLELLDELTHEFLQILEK 33--3333-------------------1111-3333--1111------------------ TPNRLKKIRNWRANQA ---------------- >HYPOTHETICAL PROTEIN SSO1; SWP:Q97YC2; PDB:2IVYA; AMLYLIFYDITDDNLRNRVAEFLKKKGLDRIQYSVFMGDLNSSRLKDVEAGLKIIGNRKK -----------------------1111--------------------------1111--- LQEDERFFILIVPITENQFRERIVIGYS -1111-------------1111------ >CHITIN DEACETYLASE; SWP:Q6DWK3; PDB:2IW0A; VPVGTPILQCTQPGLVALTYDDGPFTFTPQLLDILKQNDVRATFFVNGNNWANIEAGSNP ------------------------1111------------------------1111---- DTIRRMRADGHLVGSHTYAHPDLNTLSSADRISQMRQLEEATRRIDGFAPKYMRAPYLSC ---------------------3333-----------------------------2222-- DAGCQGDLGGLGYHIIDTNLDTKDYENNKPETTHLSAEKFNNELSADVGANSYIVLSHDV --------1111----------3333--1111--------------3333-------111 HEQTVVSLTQKLIDTLKSKGYRAVTVGECLGDAPENWYKAHHHHHH 13333-----------1111-------1111-3333---------- >LIPOPOLYSACCHARIDE CORE B; SWP:P25740; PDB:2IW1A; IVAFCLYKYFPFGGLQRDFMRIASTVAARGHHVRVYTQSWEGDCPKAFELIQVPVKSHTN --------------------------1111--------------3333------------ HGRNAEYYAWVQNHLKEHPADRVVGFNKMPGLDVYFAADVCYAEKVAQEKGFLYRLTSRY --------------------------------------------------3333------ RHYAAFERATFEQGKSTKLMMLTDKQIADFQKHYQTEPERFQILPPGIYPDRKYSEQIPN -----------2222---------------------3333--------33331111-222 SREIYRQKNGIKEQQNLLLQVGSDFGRKGVDRSIEALASLPESLRHNTLLFVVGQDKPRK 2-----1111-1111--------3333---------1111----1111------------ FEALAEKLGVRSNVHFFSGRNDVSELMAAADLLLHPAYQEAAGIVLLEAITAGLPVLTTA ---------3333-----------------------------3333---1111-----33 VCGYAHYIADANCGTVIAEPFSQEQLNEVLRKALTQSPLRMAWAENARHYADTQDLYSLP 33-----------------------------------------------1111----333 EKAADIITGG 3--------- >XAA-PRO DIPEPTIDASE; SWP:P12955; PDB:2IW2A; GPSFWLGNETLKVPLALFALNRQRLCERLRKNPAVQAGSIVVLQGGEETQRYCTDTGVLF ------!!!!---------------------11112222-----------!!!!------ RQESFFHWAFGVTEPGCYGVIDVDTGKSTLFVPRLPASHATWMGKIHSKEHFKEKYAVDD -----------------------------------1111--------------------- VQYVDEIASVLTSQKPSVLLTLRGVNTDSGSVCREASFDGISKFEVNNTILHPEIVECRV --3333-----1111----------------------2222----------------333 FKTDMELEVLRYTNKISSEAHREVMKAVKVGMKEYELESLFEHYCYSRGGMRHSSYTCIC 3---------------------------22223333------------------------ GSGENSAVLHYGHAGAPNDRTIQNGDMCLFDMGGEYYCFASDITCSFPANGKFTADQKAV -!!!!-------1111------2222---------%%%%--------1111--------- YEAVLRSSRAVMGAMKPGVWWPDMHRLADRIHLEELAHMGILSGSVDAMVQAHLGAVFMP ---------------22223333------------------------------------- HGLGHFLGIDVHDVGGYPEGVERIDEPGLRSLRTARHLQPGMVLTVEPGIYFIDHLLDEA -----------------2222-------1111------2222------------------ LADPARASFFNREVLQRFRGFGGVRIEEDVVVTDSGIELLTCVPRTVEEIEACMAGCD --33331111-33331111-------------1111---------------------- >REST corepressor 1; SWP:Q9UKL0; PDB:2IW5B; RKPPKGMFLSQEDVEAVSANATAATTVLRQLDMELVSVKRQIQNIKQTNSALKEKLDGGI ---2222------------------------------------------------22221 EPYRLPEVIQKCNARWTTEEQLLAVQAIRKYGRDFQAISDVIGNKSVVQVKNFFVNYRRR 111-------------------------------------------------------11 FNIDEVLQEWEAE 11----------- >GLUTAMINE CYCLOTRANSFERAS; SWP:O81226; PDB:2IWAA; RPSSRVYIVEVLNEFPHDPYAFTQGLVYAENDTLFESTGLYGRSSVRQVALQTGKVENIH -----------------1111--------%%%%------2222-------1111------ KMDDSYFGEGLTLLNEKLYQVVWLKNIGFIYDRRTLSNIKNFTHQMKDGWGLATDGKILY --1111-------iiii----2222----------------------------------- GSDGTSILYEIDPHTFKLIKKHNVKYNGHRVIRLNELEYINGEVWANIWQTDCIARISAK -------------------------iiii----------iiii----2222--------- DGTLLGWILLPNLRKKLIDEGFRDIDVLNGIAWDQENKRIFVTGKLWPKLFEIKLHLVRH -----------------1111-------------1111-----2222------------- RIPDGYIERHCLNL --2222-------- >METHICILLIN RESISTANCE ME; SWP:P0A0B0; PDB:2IWBA; DKYETNVSYKKLNQLAPYFKGFDGSFVLYNEREQAYSIYNEPESKQRYSPNSTYKIYLAL -----------33333333-----------1111-----3333------!!!!------- MAFDQNLLSLNHTEQQWDKHQYPFKEWNQDQNLNSSMKYSVNWYYENLNKHLRQDEVKSY --1111--2222-----------3333---------1111-------3333--------- LDLIEYGNEEISGNENYWNESSLKISAIEQVNLLKNMKQHNMHFDNKAIEKVENSMTLKQ -----!!!!------1111----------------------------------1111--- KDTYKYVGKTGTGIVNHKEANGWFVGYVETKDNTYYFATHLKGEDNANGEKAQQISERIL 3333----------%%%%------------------------------------------ KEMELI 1111-- >SERINE/THREONINE-PROTEIN ; SWP:Q9P1W9; PDB:2IWIA; YRLGPLLGKGGFGTVFAGHRLTDRLQVAIKVIPRNRVLVTCPLEVALLWKVGAGGGHPGV ---------1111----------------------------------------------- IRLLDWFFMLVLERPLPAQDLFDYITEKGPLGEGPSRCFFGQVVAAIQHCHSRGVVHRDI -------------------3333------------------------------------- KDENILIDLRRGCAKLIDFGSGALLHDEPYTDFDGTRVYSPPEWISRHQYHALPATVWSL 3333--------------1111--------------11113333---------------- GILLYDMVCGDIPFERDQEILEAELHFPAHVSPDCCALIRRCLAPKPSSRPSLEEILLDP ---------------------------11113333----------3333--3333---33 WMQT 33-- >NITROUS OXIDE REDUCTASE; SWP:P94127; PDB:2IWKA; ADGSVAPGKLDDYYGFWSSGQTGEMRILGIPSMRELMRVPVFNRCSATGWGQTNESIRIH -----2222---------!!!!--------------------------2222-------- QRTMTEKTKKQLAANGKKIHDNGDLHHVHMSFTDGKYDGRYLFMNDKANTRVARVRCDVM 1111--------1111-------------------------------------------- KTDAILEIPNAKGIHGMRPQKWPRSNYVFCNGEDEAPLVNDGSTMTDVATYVNIFTAVDA -------------------------------------------11111111--------- DKWEVAWQVKVSGNLDNCDADYEGKWAFSTSYNSEMGMTLEEMTKSEMDHVVVFNIAEIE --------------------------------3333--3333------------------ KAIKAGQYEEINGVKVVDGRKEAKSLFTRYIPIANNPHGCNMAPDRKHLCVAGKLSPTVT --1111----iiii-----3333-------------------1111-------------- VLDVTKFDALFYDNAEPRSAVVAEPELGLGPLHTAFDGRGNAYTSLFLDSQVVKWNIDEA --33333333-----1111---------------------------3333---------- IRAYAGEKINPIKDKLDVQYQPGHLKTVMGETLDAANDWLVCLCKFSKDRFLNVGPLKPE --1111--------------------2222-1111-----------!!!!---------- NDQLIDISGDKMVLVHDGPTFAEPHDAIAVSPSILPNIRSVWDRNDPLWAETRKQAEADE ------------------------------33331111----11111111------1111 VDIDEWTEAVIRDGNKVRVYMTSVAPSFSQPSFTVKEGDEVTVIVTNLDEIDDLTHGFTM -1111-------!!!!-------------------2222-----------2222-----2 GNHGVAMEVGPQQTSSVTFVAANPGVYWYYCQWFCHALHMEMRGRMFVEP 222------2222----------------------1111----------- >MULTIPLE PDZ DOMAIN PROTE; SWP:O75970; PDB:2IWNA; SMSETFDVELTKNVQGLGITIAGYIGDPSGIFVKSITKSSAVEHDGRIQIGDQIIAVDGT ------------1111--------------------2222--------2222----iiii NLQGFTNQQAVEVLRHTGQTVLLTLMRRGETSV -1111--------1111---------------- >MULTIPLE PDZ DOMAIN PROTE; SWP:O75970; PDB:2IWOA; LRTVEMKKSLGISIAGGVGSPLGDVPIFIAMMHPTGVAAQTQKLRVGDRIVTICGTSTEG -------------------1111---------1111--3333--2222----%%%%-222 MTHTQAVNLLKNASGSIEMQVVAGGDVSETSV 2----------------------%%%%----- >CENTAURIN GAMMA 1; SWP:Q99490; PDB:2IWRA; SMRSIPELRLGVLGDARSGKSSLIHRFLTGSYQVLEKTESEQYKKEMLVDGQTHLVLIRE --------------1111------------------------------iiii-------- EAGAPDAKFSGWADAVIFVFSLEDENSFQAVSRLHGQLSSLRGEGRGGLALALVGTQDRI -----3333-----------1111--------------------------------1111 SASSPRVVGDARARALADMKRCSYYETATYGLNVDRVFQEVAQKVVTLRKQQQL 1111------------1111-------1111----------------------- >THIOREDOXIN H ISOFORM 2; SWP:Q7XZK2; PDB:2IWTA; EVISVHSLEQWTMQIEEANTAKKLVVIDFTASWCGPSRIMAPVFADLAKKFPNAVFLKVD ------------------1111--------11113333---------------------3 VDELKPIAEQFSVEAMPTFLFMKEGDVKDRVVGAIKEELTAKVGLHAAA 333-------------------iiii---------3333---------- >OUTER MEMBRANE PROTEIN G; SWP:P76045; PDB:2IWVA; NDWHFNIGAMYEIENVEGYGEDMDGLAEPSVYFNAANGPWRIALAYYQEGPVDYSAGKRG -----------------------------------------------------3333--- TWFDRPELEVHYQFLENDDFSFGLTGGFRNYGYHYVDEPGKDTANMQRWKIAPDWDVKLT ----------------3333-----------------2222------------------1 DDLRFNGWLSMYKFANDLNTTGYADTRVETETGLQYTFNETVALRVNYYLERGFNMDDSR 111-------------3333------------------1111--------------1111 NNGEFSTQEIRAYLPLTLGNHSVTPYTRIGLDRWSNWDWQDDIEREGHDFNRVGLFYGYD -----------------!!!!--------------1111--------------------- FQNGLSVSLEYAFEWQDHDEGDSDKFHYAGVGVNYSF ------------------------------------- >ATP-DEPENDENT MOLECULAR C; SWP:P02829; PDB:2IWXA; MASETFEFQAEITQLMSLIINTVYSNKEIFLRELISNASDALDKIRYKSLSDPKQLETEP -----------------------------------------------3333--3333--- DLFIRITPKPEQKVLEIRDSGIGMTKAELINNLGTIAKSGTKAFMEALSAGADVSMIGQF ---------1111----------------------------------1111-33333333 GVGFYSLFLVADRVQVISKSNDDEQYIWESNAGGSFTVTLDEVNERIGRGTILRLFLKDD -3333--------------1111----------------------------------111 QLEYLEEKRIKEVIKRHSEFVAYPIQLVVTKEVE 1--------------------------------- >3-OXOACYL-[ACYL-CARRIER-P; SWP:Q9NWU1; PDB:2IWZA; SRLHRRVVITGIGLVTPLGVGTHLVWDRLIGGESGIVSLVGEEYKSIPCSVAAYVPRGSD ---------------1111---------------------3333---------------2 EGQFNEQNFVSKSDIKSMSSPTIMAIGAAELAMKDSGWHPQSEADQVATGVAIGMGMIPL 222-3333--33331111----------------------------------------33 EVVSETALNFQTKGYNKVSPFFVPKILVNMAAGQVSIRYKLKGPNHAVSTACTTGAHAVG 33-----------1111-111133331111-------------------!!!!------- DSFRFIAHGDADVMVAGGTDSCISPLSLAGFSRARALSTNSDPKLACRPFHPKRDGFVMG -------------------------------1111------3333--2222--------- EGAAVLVLEEYEHAVQRRARIYAEVLGYGLSGDAGHITAPDPEGEGALRCMAAALKDAGV --------------1111----------------------1111---------------- QPEEISYINAHATSTPLGDAAENKAIKHLFKDHAYALAVSSTKGATGHLLGAAGAVEAAF 3333---------------------------3333------3333---!!!!-------- TTLACYYQKLPPTLNLDCSEPEFDLNYVPLKAQEWKTEKRFIGLTNSFGFGGTNATLCIA -------------------3333-------------------------2222-------- GL -- >DNA POLYMERASE SLIDING CL; SWP:P57766; PDB:2IX2A; FKIVYPNAKDFFSFINSITNVTDSIILNFTEDGIFSRHLTEDKVLMAIMRIPKDVLSEYS -----------------------------1111------1111--------1111----- IDSPTSVKLDVSSVKKILSKASKATIELTDSGLKIIIRDEKSGAKSTIYIKNLAVNFTTD -----------------1111--------------------------------------- ESVLNVIAADVTLVGEEMRISTEEDKIKIEAGRYVAFLMKDKPLKELSIDTSASSSYSAE ----------------------!!!!------------2222------------------ MFKDAVKGLRGFSAPTMVSFGENLPMKIDVEAVSGGHMIFWIAPRL ------1111---------------------1111----------- >DNA polymerase sliding cl; SWP:Q97Z84; PDB:2IX2B; MKAKVIDAVSFSYILRTVGDFLSEANFIVTKEGIRVSGIDPSRVVFLDIFLPSSYFEGFE -----------------------------3333------1111--------3333----- VSQEKEIIGFKLEDVNDILKRVLKDDTLILSSNESKLTLTFDGEFTRSFELPLIQVESTQ ------------------11112222---------------------------------- PPSVNLEFPFKAQLLTITFADIIDELSDLGEVLNIHSKENKLYFEVIGDLSTAKVELSTD -------------------------1111--------iiii------1111-------11 NGTLLEASGADVSSSYGMEYVANTTKMRRASDSMELYFGSQIPLKLRFKLPQEGYGDFYI 11------------------------1111--------------------%%%%------ APRAD ----- >DNA polymerase sliding cl; SWP:P57765; PDB:2IX2C; NIRLINMKVVYDDVRVLKDIIQALARLVDEAVLKFKQDSVELVALDRAHISLISVNLPRE -----------------------------------1111------1111--------333 MFKEYDVNDEFKFGFNTQYLMKILKVAKRKEAIEIASESPDSVIINIIGSTNREFNVRNL 3---------------------3333-2222-------1111------------------ EVSEQEIPEINLQFDISATISSDGFKSAISEVSTVTDNVVVEGHEDRILIKAEVEVEFGL -------------------------------------------1111------------- QDLEFSKESKNSYSAEYLDDVLSLTKLSDYVKISFGNQKPLQLFFNMEGGGKVTYLLAPK ---------------------3333----------2222-------1111---------- V - >3-OXOACYL-[ACYL-CARRIER-P; SWP:Q8L3X9; PDB:2IX4A; RRVVVTGLGMVTPLGRGVETTWRRLIDGECGIRGLTLDDLKMKSFDEETKLYTFDQLSSK -----------1111--------------------3333--1111--------1111--- VAAFVPYGSNPGEFDEALWLNSKAVANFIGYAVCAADEALRDAEWLPTEEEEKERTGVSI ---------2222-3333---1111----------------------------------- GGGIGSICDIVEAAQLICEKRLRRLSPFFIPKILVNMASGHVSMKYGFQGPNHAAVTACA --------------------3333-111111111111-------------------!!!! TGAHSIGDATRMIQFGDADVMVAGGTESSIDALSVAGFSRSRALSTKFNSSPQEASRPFD --------------------------------------1111-----11111111-2222 CDRDGFVIGEGSGVIVLEEYEHAKRRGAKIYAELCGYGMSGDAHHITQPPEDGKGAVLAM -------------------------------------------------1111------- TRALRQSGLCPNQIDYVNAHATSTPIGDAVEARAIKTVFSEHATSGTLAFSSTKGATGHL ---------3333----------3333---------------1111------3333---! LGAAGAVEAIFSILAIHHGVAPMTLNVKNPDPIFDKRFMPLTTSKKMLVRTAMSNSFGFG !!!---------------------------3333-----------------------222 GTNASLLFASI 2---------- >ACYL-COENZYME A OXIDASE 4; SWP:Q96329; PDB:2IX5A; KSSYFDLPPMEMSVAFPQATPASTFPPCTSDYYHFNDLLTPEEQAIRKKVRECMEKEVAP ---1111---3333------3333----------3333-------------------333 IMTEYWEKAEFPFHITPKLGAMGVAGGSIKGYGCPGLSITANAIATAEIARVDASCSTFI 3----------3333---1111---!!!!-iiii-------------------------- LVHSSLGMLTIALCGSEAQKEKYLPSLAQLNTVACWALTEPDNGSDASGLGTTATKVEGG ---------------------------------------1111--3333-------2222 WKINGQKRWIGNSTFADLLIIFARNTTTNQINGFIVKKDAPGLKATKIPNKIGLRMVQNG ---------2222--------------------------2222----------3333--- DILLQNVFVPDEDRLPGVNSFQDTSKVLAVSRVMVAWQPIGISMGIYDMCHRYLKERKQF ---------3333-1111-3333-----------------------------------%% GAPLAAFQLNQQKLVQMLGNVQAMFLMGWRLCKLYETGQMTPGQASLGKAWISSKARETA %%1111----------------------------1111---------------------- SLGRELLGGNGILADFLVAKAFCDLEPIYTYEGTYDINTLVTGREVTGIASFKPAT ---3333-----3333----------3333-------------------------- >Myosin-Va; SWP:Q99104; PDB:2IX7C; ADKLRAACIRIQKTIRGWLLRKRYLCMQRAAITVQRYVRGYQARCYAKFLRRTKAATT -----------------------------------------------------1111- >ANTIGEN PEPTIDE TRANSPORT; SWP:P36370; PDB:2IXEA; GSLAPLNMKGLVKFQDVSFAYPNHPNVQVLQGLTFTLYPGKVTALVGPNGSGKSTVAALL --------------------1111-------------2222------2222-------11 QNLYQPTGGKVLLDGEPLVQYDHHYLHTQVAAVGQEPLLFGRSFRENIAYGLTRTPTMEE 11----------iiii3333----------------------------2222-------- ITAVAMESGAHDFISGFPQGYDTEVGETGNQLSGGQRQAVALARALIRKPRLLILDNATS ----------------1111------iiii---------------3333---------11 ALDAGNQLRVQRLLYESPEWASRTVLLITQQLSLAERAHHILFLKEGSVCEQGTHLQLME 11--------------3333----------33331111------iiii------------ RGGCYRSMVEA ----------- >DTDP-4-DEHYDRORHAMNOSE 3,; SWP:Q9HU21; PDB:2IXKA; SMSMKATRLAIPDVILFEPRVFGDDRGFFFESYNQRAFEEACGHPVSFVQDNHSRSARGV ----------3333---------3333-----------------------------2222 LRGLHYQIRQAQGKLVRATLGEVFDVAVDLRRGSPTFGQWVGERLSAENKRQMWIPAGFA ------------------------------1111-2222------3333------2222- HGFVVLSEYAEFLYKTTDFWAPEHERCIVWNDPELKIDWPLQDAPLLSEKDRQGKAFADA --------------------3333----1111---------------3333----3333- DCFP ---- >SERINE/THREONINE-PROTEIN ; SWP:Q15257; PDB:2IXMA; FIIPKKEIHTVPDMGKWKRSQAYADYIGFILTLNEGVKGKKLTFEYRVSEAIEKLLALLN ---------3333---------------------------1111---------------- TLDRWIDETPPVDQPSRFGNKAYRTWYAKLDEEAENLVATVVPTHLAAAVPEVAVYLKES ------------------------------------------33331111------1111 VGNSTRIDYGTGHEAAFAAFLCCLCKIGVLRVDDQIAIVFKVFNRYLEVMRKLQKTYRME ---1111-----------------1111--1111-------------------------- PAGSQGVWGLDDFQFLPFIWGSSQLIDHPYLEPRHFVDEKAVNENHKDYMFLECILFITE ----!!!!------3333------2222---3333---------3333------------ MKTGPFAEHSNQLWNISAVPSWSKVNQGLIRMYKAECLEKFPVIQHFKFGSLLPIHPVTS ----3333-------3333-----------------11113333---------------- >SERINE/THREONINE-PROTEIN ; SWP:Q12461; PDB:2IXNA; PEKRLLTPDDMKLWEESPTRAHFTKFIIDLAESVKGHENSQYKEPISESINSMMNLLSQI --------3333-3333--------------1111--1111-----3333---------- KDITQKHPVIKRFGKVEFRDFYDEVSRNSRKILRSEFPSLTDEQLEQLSIYLDESWGNKR -3333---------3333-----3333---------3333-----------1111---11 RIDYGSGHELNFMCLLYGLYSYGIFNLSNDSTNLVLKVFIEYLKIMRILETKYWLEPAGS 11--3333-----------1111--3333------------------------------- HGVWGLDDYHFLPFLFGAFQLTTHKHLKPISIHNNELVEMFAHRYLYFGCIAFINKVKSS ----------3333----1111-----3333-------------3333------------ ASLRWHSPMLDDISGVKTWSKVAEGMIKMYKAEVLSKLPIMQHFYFSEFLPC -3333-------1111-3333----------------3333-----3333-- >SERINE/THREONINE-PROTEIN ; SWP:P40454; PDB:2IXOA; SLDRVDWPHATFSTPVKRIFDTQTTLDFQSSLAIHRIKYHLHKYTTLISHCSDPDPHATA --------------------3333------------------------------1111-- SSIAMVNGLMGVLDKLAHLIDETPPLPGYGNLACREWHHKLDERLPQWLQEMLPSEYHEV --3333-----------------------------------------------3333111 VPELQYYLGNSFGSSTRLDYGTGHELSFMATVAALDMLGMFPHHRGADVFLLFNKYYTIM 1------1111------------------------1111-22223333------------ RRLILTYTLEPAGLDDHFHLVYILGSSQWQLLDAQAPLQPREILDKSLVREYKDTNFYCQ ----1111----------3333------11111111--3333-------1111------- GINFINEVKMGPFEEHSPILYDIAVTVPRWSKVCKGLLKMYSVEVEKKFPVVQHFWFGTG -----------3333--------------------------------33331111----- FFPWVNI ------- >SDAI RESTRICTION ENDONUCL; SWP:NA; PDB:2IXSA; NDIDETAATIDTARALLKSFGFEAQRHNVRSAVTLLALAGLKPGDHWADSTTPRLGVQKI ---3333----------1111-3333---------------22223333----------- MDWSGAYWAKPYATGSREDFRKKTLRQWVDNGFAVLNPDNLNIATNSQLNEYCLSDEAAQ ------------1111------------1111----3333---1111------------- AIRSYGTDAFESALVDFLSKASDTVRARAEALRAAMISVDLADGDEFLLSPAGQNPLLKK -1111-1111-------1111-----------1111-----2222------!!!!----- MVEEFMPRFAPGAKVLYIGDTRGKHTRFEKRIFEETLGLTFDPHGRMPDLVLHDKVRKWL ----3333-2222----------------------------1111--------------- FLMEAVKSKGPFDEERHRTLRELFATPVAGLVFVNCFENREAMRQWLPELAWETEAWVAD ------3333-----------11111111---------333333331111-------111 DPDHLIHLNGSRFLGPYER 1--------3333------ >Small ubiquitin-related m; SWP:P63165; PDB:2IY1B; EYIKLKVIGQDSSEIHFKVKMTTHLKKLKESYCQRQGVPMNSLRFLFEGQRIADNHTPKE --------1111-------1111----------1111-3333----iiii--11113333 LGMEEEDVIEVYQEQTGGHSTVC ---2222---------------- >SUBA; SWP:Q6EZC2; PDB:2IY9A; KPWYFDAIGLTETTMSLTDKNTPVVVSVVDSGVAFIGGLSDSEFAKFSFTQDGSPFPVKK -1111-----333311111111-------------!!!!----------1111------- SEALYIHGTAMASLIASRYGIYGVYPHALISSRRVIPDGVQDSWIRAIESIMSNVFLAPG ------------------------3333-------------------------1111222 EEKIINISGGQKGVSASVWTELLSRMGRNNDRLIVAAVGNDGADIRKLSAQQRIWPAAYH 2------------------------------------------3333-3333-------- PVSSVNKKQDPVIRVAALAQYRKGETPVLHGGGITGSRFGNNWVDIAAPGQNITFLRPDA ---------------------2222--------------2222-------------1111 KTGTGSGTSEATAIVSGVLAAMTSCNPRATATELKRTLLESADKYPSLVDKVTEGRVLNA ---------------------33331111---------------33331111%%%%---- EKAISMFCK --------- >APPA, ANTIREPRESSOR OF PP; SWP:Q3J677; PDB:2IYGA; GSDLVSCSYRSLAAPDLTLRDLLDIVETSQAHNARAQLTGALFYSQGVFFQWLEGRPAAV -------------1111---------------------------iiii------------ AEVMTHIQRDRRHSNVEILAEEPIAKRRFAGWHMQLSCSEADMRSLGLA ---------1111---------------2222----------------- >RIBONUCLEOSIDE-DIPHOSPHAT; SWP:NA; PDB:2IYHA; GVEDEPLLRENPIFPIEYHDIWQMYKKAEASFWTAEEVDLSKDIQHWESLKPEERYFISH -3333-----------------------1111-1111--1111--3333-3333------ VLAFFAASDGIVNENLVERFSQEVQITEARCFYGFQIAMENIHSEMYSLLIDTYIKDPKE --------------------------------------------------------3333 REFLMPCVKKKADWALRWIGDKEATYGERVVAFAAVEGIFFSGSFASIFWLKKRGLMPGL --------3333--------------------------1111--------1111------ TFSNELISRDEGLHCDFACLMFKHLVHKPSEERVREIIINAVRIEQEFLTEALPVKLIGM ---------------------1111----------------------------3333--- NCTLMKQYIEFVADRLMLELGFSKVFRVENPFDFM -----------------1111-------------- >REGULATOR OF NONSENSE TRA; SWP:Q92900; PDB:2IYKA; LPIHACSYCGIHDPACVVYCNTSKKWFCNGRGNTSGSHIVNHLVRAKCKEVTLHKDGPLG -1111-------1111--------------!!!!---------1111------1111--- ETVLECYNCGCRNVFLLGFIPAKADSVVVLLCRQPCASQSSLKDINWDSSQWQPLIQDRC ------------1111------------------1111---%%%%--3333-----%%%% FLSWLVKIPSEQEQLRARQITAQQINKLEELWKEN -3333--------3333--------------3333 >SHIKIMATE KINASE; SWP:P0A4Z2; PDB:2IYVA; APKAVLVGLPGSGKSTIGRRLAKALGVGLLDTDVAIEQRTGRSIADIFATDGEQEFRRIE --------22223333-------------------------------------------- EDVVRAALADHDGVLSLGGGAVTSPGVRAALAGHTVVYLEISAAEGVRRTGGNTVRPLLA -----------------1111---------2222-------------1111------111 GPDRAEKYRALMAKRAPLYRRVATMRVDTNRRNPGAVVRHILSRLQVPSPSEAATLEHH 1-----------------------------------------------1111--1111- >6-PHOSPHOGLUCONATE DEHYDR; SWP:P96789; PDB:2IZ1A; MAQANFGVVGMAVMGKNLALNVESRGYTVAIYNRTTSKTEEVFKEHQDKNLVFTKTLEEF -----------3333-------------------3333-------1111------3333- VGSLEKPRRIMLMVQAGAATDATIKSLLPLLDIGDILIDGGNTHFPDTMRRNAELADSGI --------------------------3333-2222--------3333------------- NFIGTGVSGGEKGALLGPSMMPGGQKEAYDLVAPIFEQIAAKAPQDGKPCVAYMGANGAG -------------------------------------------------------!!!!- HYVKMVHNGIEYGDMQLIAESYDLLKRILGLSNAEIQAIFEEWNEGELDSYLIEITKEVL ---------------------------------------------1111--------333 KRKDDEGEGYIVDKILDKAGNKGTGKWTSESALDLGVPLPLITESVFARYISTYKDERVK 3--------3333------------------------------------3333------- ASKVLSGPALDFSGDKKEVIEKIRKALYFSKIMSYAQGFAQLRKASEEFDWDLPYGTIAQ -----------------------------------------------------------1 IWRAGCIIRAEFLQNITDAFDKDSELENLLLDDYFVDITKRYQEAVRDVVSLAVQAGTPI 111------1111---------1111-3333----------------------------3 PTFTSAISYYDSYRSENLPANLIQAQRDYFGAHTYERTDKAGIFHYDWYT 333--------1111----------------------------------- >BETA-MICROSEMINOPROTEIN; SWP:P08118; PDB:2IZ3A; SCYFIPNEGVPGDSTRKCMDLKGNKHPINSEWQTDNCETCTCYETEISCCTLVSTPVGYD -------------------3333------------------------------------3 KDNCQRIFKKEDCKYIVVEKKDPKKTCSVSEWII 333---------------3333------------ >BETA-MICROSEMINOPROTEIN; SWP:O02826; PDB:2IZ4A; QCYFIPNQSLKPNECQDLKGVSHPLNSVWKTKDCEECTCGQDAISCCNTAAIPTGYDTNK ----------1111--1111----------1111----------------------1111 CQKILNKKTCTYTVVEKKDPGKTCDVTGWVL ---------------3333------------ >MOLYBDENUM COFACTOR CARRI; SWP:Q8RV61; PDB:2IZ6A; GRKPIIGVMGPGKADTAENQLVMANELGKQIATHGWILLTGGRSLGVMHEAMKGAKEAGG ----------------3333-----------1111--------------------1111- TTIGVLPGISDAVDIPIVTGLGSARDNINALSSNVLVAVGMGPGTAAEVALALKAKKPVV ---------3333---------------3333--------------------1111---- LLGTQPEAEKFFTSLDAGLVHVAADVAGAIAAVKQLLAK ---------------3333-------------------- >FLAP STRUCTURE-SPECIFIC E; SWP:Q980U8; PDB:2IZOA; MDLVKDVKRELSFSELKGKRVSIDGYNALYQFLAAIRQPPLMDSQGRVTSHLSGLFYRTI -----------3333---------------------------1111-------------- NILEEGVIPIYVFDGSNIMVEESKKLLRAMGIPIVQAPSEGEAEAAYLNKLGLSWAAASQ ------------------------------------------------------------ DYDAILFGAKRLVRNLTIYVEIKPELIETEILLKKLGITREQLIDIGILIGTDYNPDGIR -3333----------------------3333--------------------3333---22 GIGPERALKIIKKYGKIIDEIRGLFLNPQVVKPEALDLNEPNGEDIINILVYEHNFSEER 22---------------33333333-----------------1111-----1111-3333 VKNGIERLTKAIKEAKGASRQTGLDRWF ----------------3333--1111-- >PUTATIVE MEMBRANE ANTIGEN; SWP:Q63K37; PDB:2IZPA; AGARAMTDDDLRAAGVDRRVPEQKLGAAIDEFASLRLPDRIDGRFVDGRRANLTVFDDAR ----------3333--3333-----------1111-----iiii---------------- VAVRGHARAQRNLLERLETELLGGTLDTAGDEGGIQPDPILQGLVDVIGQGKSDIDAYAT ------------------------------------------------------------ IVEGLTKYFQSVADVMSKLQDYISAKDDKNMKIDGGKIKALIQQVIDHLPTMQLPKGADI ----------------------------------------------------------33 ARWRKELGDAVSISDSGVVTINPDKLIKMRDSLPPDGTVWDTARYQAWNTAFSGQKDNIQ 33-----3333-----------------------2222---------------------- NDVQTLVEKYSHQNSNFDNLVKVLSGAISTLT -------------------------------- >CASEIN KINASE I ISOFORM G; SWP:Q9Y6M4; PDB:2IZRA; VLMVGPNFRVGKKIGCNFGELRLGKNLYTNEYVAIKLEPMKSRAPQLHLEYRFYKQLGSG ----------------iiii------------------1111-----------------2 DGIPQVYYFGPCGKYNAMVLELLGPSLEDLFDLCDRTFSLKTVLMIAIQLISRMEYVHSK 222--------!!!!----------------1111------------------------- NLIYRDVKPENFLIGRPGNKTQQVIHIIDFALAKEYIDPETKKHIPYREHKSLTGTARYM -------3333----3333-1111-----1111-----------------------1111 SINTHLGKEQSRRDDLEALGHMFMYFLRGSLPWQGLKADTLKERYQKIGDTKRATPIEVL ----------3333----------------1111---------------------3333- CENFPEMATYLRYVRRLDFFEKPDYDYLRKLFTDLFDRKGYMFDYEYDWIGKQLPTPV 2222---------11111111---------------1111------1111-------- >SUPPRESSOR OF CYTOKINE SI; SWP:Q8WXH5; PDB:2IZVA; NLYFQSMLVPDLLQINNNPCYWGVMDKYAAEALLEGKPEGTFLLRDSAQEDYLFSVSFRR --------1111-----1111------------22222222-------1111-------% YSRSLHARIEQWNHNFSFDAHDPCVFHSPDITGLLEHYKDPSACMFFEPLLSTPLIRTFP %%%--------iiii---1111--------3333-11111111-1111------------ FSLQHICRTVICNCTTYDGIDALPIPSSMKLYLKEYHYKSKVR ----------3333-33331111--------1111-------- >AKAP-IS; SWP:P13861; PDB:2IZXA; IPPGLTELLQGYTVEVLRQQPPDLVEFAVEYFTRLREAR -2222---------------------------------- >PYRROLINE-5-CARBOXYLATE R; SWP:P32322; PDB:2IZZA; SMSVGFIGAGQLAFALAKGFTAAGVLAAHKIMASSPDMDLATVSALRKMGVKLTPHNKET --------------------------3333----------------3333-----3333- VQHSDVLFLAVKPHIIPFILDEIGADIEDRHIVVSCAAGVTISSIEKKLSAFRPAPRVIR -----------3333-------3333-3333-----2222--------3333-------- CMTNTPVVVREGATVYATGTHAQVEDGRLMEQLLSSVGFCTEVEEDLIDAVTGLSGSGPA ---3333-----------11113333--------1111-----3333-------1111-- YAFTALDALADGGVKMGLPRRLAVRLGAQALLGAAKMLLHSEQHPGQLKDNVSSPGGATI -------------1111--------------------------3333-3333-2222--- HALHVLESGGFRSLLINAVEASCIRTRELQSM -------------------------------- >30S ribosomal protein S2; SWP:P80371; PDB:2J00B; VKELLEAGVHFGHERKRWNPKFARYIYAERNGIHIIDLQKTMEELERTFRFIEDLAMRGG ------------------3333--------------3333--------------1111-- TILFVGTKKQAQDIVRMEAERAGMPYVNQRWLGGMLTNFKTISQRVHRLEELEALFASPE ------------3333------------------33333333-----------3333333 IEERPKKEQVRLKHELERLQKYLSGFRLLKRLPDAIFVVDPTKEAIAVREARKLFIPVIA 3----------------3333-------------------3333---------------- LADTDSDPDLVDYIIPGNDDAIRSIQLILSRAVDLIIQARGGVVEPSPSYALVQE ------------------------------------------------------- >30S ribosomal protein S3; SWP:P80372; PDB:2J00C; GNKIHPIGFRLGITRDWESRWYAGKKQYRHLLLEDQRIRGLLEKELYSAGLARVDIERAA ---------------------------3333---------3333-3333----------- DNVAVTVHVAKPGVVIGRGGERIRVLREELAKLTGKNVALNVQEVQNPNLSAPLVAQRVA ----------3333--2222--------------------------1111---------- EQIERRFAVRRAIKQAVQRVMESGAKGAKVIVSGRIGGAEQARTEWAAQGRVPLHTLRAN ----------------------------------2222--------------1111---- IDYGFALARTTYGVLGVKAYIFLGEVI ---------1111-------------- >30S ribosomal protein S4; SWP:P80373; PDB:2J00D; GRYIGPVCRLCRREGVKLYLKGERCYSPKCAMERRPYPPGQHGQKRARRPSDYAVRLREK ------------------------------3333-------------------------- QKLRRIYGISERQFRNLFEEASKKKGVTGSVFLGLLESRLDNVVYRLGFAVSRRQARQLV --3333--------------------------------------1111------------ RHGHITVNGRRVDLPSYRVRPGDEIAVAEKSRNLELIRQNLEAMKGRKVGPWLSLDVEGM -------------3333-----------3333-----------------1111------- KGKFLRLPDREDLALPVNEQLVIEFYSR ---------3333--------------- >30S ribosomal protein S5; SWP:Q5SHQ5; PDB:2J00E; DFEEKMILIRRTARMQAGGRRFRFGALVVVGDRQGRVGLGFGKAPEVPLAVQKAGYYARR ------------------------------------------------------------ NMVEVPLQNGTIPHEIEVEFGASKIVLKPAAPGTGVIAGAVPRAILELAGVTDILTKELG ----------------------------------------------3333---------- SRNPINIAYATMEALRQLRTKADVERLRKGE -------------3333------3333---- >30S ribosomal protein S8; SWP:Q5SHQ2; PDB:2J00H; MLTDPIADMLTRIRNATRVYKESTDVPASRFKEEILRILAREGFIKGYERVDVDGKPYLR ---------------------------------------1111----------------- VYLKYGPRRQGPDPRPEQVIHHIRRISKPGRRVYVGVKEIPRVRRGLGIAILSTSKGVLT ---------------------------1111----1111----iiii------------- DREARKLGVGGELICEVW ------------------ >30S ribosomal protein S10; SWP:Q5SHN7; PDB:2J00J; KIRIKLRGFDHKTLDASAQKIVEAARRSGAQVSGPIPLPTRVRRFTVIRGPFKHKDSREH ---------------------11111111------------------------------- FELRTHNRLVDIINPNRKTIEQLMTLDLPTGVEIEIKTV ----------------------1111------------- >30S ribosomal protein S11; SWP:P80376; PDB:2J00K; KRQVASGRAYIHASYNNTIVTITDPDGNPITWSSGGVIGYKGSRKGTPYAAQLAALDAAK -------------1111------1111------3333-----3333-3333--------- KAMAYGMQSVDVIVRGTGAGREQAIRALQASGLQVKSIVDDTPVPHNGCRPKKKFRKAS --1111--------------33333333-----------------------3333---- >30S ribosomal protein S12; SWP:Q5SHN3; PDB:2J00L; PTINQLVRKGREKVRKKSKVPALKGAPFRRGVCTVVRTVTPKKPNSALRKVAKVRLTSGY ------------------------------------------------------------ EVTAYIPGEGHNLQEHSVVLIRGGRVKDLPGVRYHIVRGVYDAAGVKDRKKSRSKYGTKK ---------------------------------------!!!!----------------- PKEAA ----- >30S ribosomal protein S14; SWP:Q5SHQ1; PDB:2J00N; ARKALIEKAKRTPKFKVRAYTRCVRCGRARSVYRFFGLCRICLRELAHKGQLPGVRKASW --------------3333---------------------------3333--2222----- >30S ribosomal protein S16; SWP:Q5SJH3; PDB:2J00P; MVKIRLARFGSKHNPHYRIVVTDARRKRDGKYIEKIGYYDPRKTTPDWLKVDVERARYWL ----------------------3333---------------------------------1 SVGAQPTDTARRLLRQAGVFRQEA 111---1111-------------- >30S ribosomal protein S17; SWP:Q5SHP7; PDB:2J00Q; PKKVLTGVVVSDKMQKTVTVLVERQFPHPLYGKVIKRSKKYLAHDPEEKYKLGDVVEIIE ------------------------------------------------------------ SRPISKRKRFRVLRLVESGRMDLVEKYLIRRQNYESLSKR ---------------------------------1111--- >30S ribosomal protein S19; SWP:Q5SHP2; PDB:2J00S; SLKKGVFVDDHLLEKVLELNAKGEKRLIKTWSRRSTIVPEMVGHTIAVYNGKQHVPVYIT --------1111-------------------1111--2222------------------- ENMVGHKLGEFAPTRTYRG ------3333--------- >30S ribosomal protein S20; SWP:P80380; PDB:2J00T; RNLSALKRHRQSLKRRLRNKAKKSAIKTLSKKAIQLAQEGKAEEALKIMRKAESLIDKAA ---------------------------------------------------------333 KGSTLHKNAAARRKSRLMRKVRQLLEAAGAPLIGGGLSA 3-----3333----------------------------- >50S ribosomal protein L28; SWP:P60494; PDB:2J011; SGKRPIVANSIQRRGKAKREGGVGKKTTGISKRRQYPNLQKVRVRVAGQEITFRVAASHI ----------------------------------------------------------33 PKVYELVERAKGLKLEGLSPKEIKKELLK 33--3333--------------------- >50S ribosomal protein L29; SWP:Q5SHP6; PDB:2J012; EARKLSPVELEKLVREKKRELMELRFQASIGQLSQNHKIRDLKRQIARLLT -----------------3333------------------------------ --------------------------------------------- ------------------------------------------------- >50S ribosomal protein L35; SWP:Q5SKU1; PDB:2J018; PKMKTHKGAKKRVKITASGKVVAMKTGKRHLNWQKSGKEIRQKGRKFVLAKPEAERIKLL ------3333----------------------------3333---------3333----- LPYE ---- >50S ribosomal protein L2; SWP:P60405; PDB:2J01D; AVKKFKPYTPSRRFMTVADFSEITKTEPEKSLVKPLKKTGGRNNQGRITVRFRGGGHKRL ------------------------------------------------------------ YRIIDFKRWDKVGIPAKVAAIEYDPNRSARIALLHYVDGEKRYIIAPDGLQVGQQVVAGP -------3333------------1111--------3333-------------------33 DAPIQVGNALPLRFIPVGTVVHAVELEPKKGAKLARAAGTSAQIQGREGDYVILRLPSGE 33---------1111-----------2222-----------------!!!!----3333- LRKVHGECYATVGAVGNADHKNIVLGKAGRSRWLGRRPHVRGAAMNPVDHPHGGGEGRAP ----3333--------3333-------------------------3333----------- RGRPPASPWGWQTKGLKTRKRRKPSSRFIIAR ------1111---------------------- >50S ribosomal protein L3; SWP:Q5SHN8; PDB:2J01E; MKGILGVKVGMTRIFRDDRAVPVTVILAGPCPVVQRRTPEKDGYTAVQLGFLPQNPKRVN -------------------------------------3333------------------- RPLKGHFAKAGVEPVRILREIRDFNPEGDTVTVEIFKPGERVDVTGTSKGRGFAGVMKRW -------------1111-------------------2222-------------------- NFAGGPDSHGAHKIHRHPGSIGNRKTPGRVYKGKKMAGHYGAERVTVMNLEVVDVIPEEN ------------------------------------------------------------ LLLVKGAVPGPNGGLVIVRETKKAA ------------------------- >50S ribosomal protein L4; SWP:Q5SHN9; PDB:2J01F; MKEVAVYQIPVLSPSGRRELAADLPAEINPHLLWEVVRWQLAKRRRGTASTKTRGEVAYS --------------------1111-----------------1111--------3333--- GRKIWPQKHTGRARHGDIGAPIFVGGGVVFGPKPRDYSYTLPKKVRKKGLAMAVADRARE -------------------1111------------------3333----------3333- GKLLLVEAFAGVNGKTKEFLAWAKEAGLDGSESVLLVTGNELVRRAARNLPWVVTLAPEG -----------------------1111----------------------1111---1111 LNVYDIVRTERLVMDLDAWEVFQNRIGG ----------------3333-------- >50S ribosomal protein L6; SWP:Q5SHQ3; PDB:2J01H; PKGVSVEVAPGRVKVKGPKGELEVPVSPEMRVVVEEGVVRVERPSDERRHKSLHGLTRTL ----------------------------------------------33333333------ IANAVKGVSEGYSKELLIKGIGYRARLVGRALELTVGFSHPVVVEPPEGITFEVPEPTRV ---------------------------!!!!---------------2222---------- RVSGIDKQKVGQVAANIRAIRKPSAYHEKGIYYAGEPVRL -----------------3333------------------- >50S ribosomal protein L9; SWP:Q5SLQ1; PDB:2J01I; MKVILLEPLENLGDVGQVVDVKPGYARNYLLPRGLAVLATESNLKALEARIRAQAKRLAE --------2222------------------1111-----------------11111111- RKAEAERLKEILENLTLTIPVRAGETKIYGSVTAKDIAEALSRQHGVTIDPKRLALEKPI -------1111---------------------3333---------------3333----- KELGEYVLTYKPHPEVPIQLKVSVVV -------------------------- >50S ribosomal protein L13; SWP:P60488; PDB:2J01N; MKTYVPKQVEPRWVLIDAEGKTLGRLATKIATLLRGKHRPDWTPNVAMGDFVVVVNADKI ------------------------------------------1111-------------- RVTGKKLEQKIYTRYSGYPGGLKKIPLEKMLATHPERVLEHAVKGMLPKGPLGRRLFKRL ------------------------------------------1111-------------- KVYAGPDHPHQAQRPEKLE ------------------- >50S ribosomal protein L14; SWP:Q5SHP8; PDB:2J01O; MIQPQTYLEVADNTGARKIMCIRVLKGSNAKYATVGDVIVASVKEAIPRGAVKEGDVVKA ---------------------------------2222----------------------- VVVRTKKEIKRPDGSAIRFDDNAAVIINNQLEPRGTRVFGPVARELREKGFMKIVSLAPE ----------1111-------------1111---------------3333---------- VL -- >23S RIBOSOMAL RNA; SWP:NA; PDB:2J01P; DLRPNPGANKRRKRVGRGPGSGHGKTATRGHKGQKSRSGGLKDPRRFEGGRSTTLMRLPK ------------------------------------------------------------ RGMQGQVPGEIKRPRYQGVNLKDLARFEGEVTPELLVRAGLLKKGYRLKILGEGEAKPLK -----------------------3333--------------------------------- VVAHAFSKSALEKLKAAGGEPVLLEA ----------3333------------ >50S ribosomal protein L16; SWP:P60489; PDB:2J01Q; RMKYRKQQRGRLKGATKGGDYVAFGDYGLVALEPAWITAQQIEAARVAMVRHFRRGGKIF -------------------------------------3333--------3333------- IRIFPDKPYTKKPLEVRMGKGKGNVEGYVAVVKPGRVMFEVAGVTEEQAMEALRIAGHKL --------------------------------------------3333-------3333- PIKTKIVRRDAYDEAQ ---------------- >50S ribosomal protein L20; SWP:P60491; PDB:2J01U; PRAKTGVVRRRKHKKILKLAKGYWGLRSKSFRKARETLFAAGNYAYAHRKRRKRDFRRLW ------------------------3333-------------------------------- IVRINAACRQHGLNYSTFIHGLKKAGIEVDRKNLADLAVREPQVFAELVERAKAAQG -------3333--------------------33333333------------------ >50S ribosomal protein L21; SWP:P60492; PDB:2J01V; MFAIVKTGGKQYRVEPGLKLRVEKLDAEPGATVELPVLLLGGEKTVVGTPVVEGASVVAE ------%%%%----------------------------------------2222------ VLGHGRGKKILVSKFKAKVQYRRKKGHRQPYTELLIKEIRG ----------------------------------------- >50S ribosomal protein L23; SWP:Q5SHP0; PDB:2J01X; TAYDVILAPVLSEKAYAGFAEGKYTFWVHPKATKTEIKNAVETAFKVKVVKVNTLHVRGK -----------33333333---------------------1111---------------- KKRLGRYLGKRPDRKKAIVQVAPGQKIEALEGL ----------------------------2222- >50S ribosomal protein L24; SWP:Q5SHP9; PDB:2J01Y; RVKMHVKKGDTVLVASGKYKGRVGKVKEVLPKKYAVIVEGVNIVKKAVRVSPKYPQGGFI ------------------------------------------------------------ EKEAPLHASKVRPICPACGKPTRVRKKFLENGKKIRVCAKC ------3333------------------------------- >RAS GTPASE-ACTIVATING PRO; SWP:P20936; PDB:2J05A; SHRRRVRAILPYTKVPDTDEISFLKGDFIVHNELEDGWWVTNLRTDEQGLIVEDLVEEVG --------------2222-----2222------------------------3333----- R - >BETA-1,3-N-ACETYLGLUCOSAM; SWP:O09008; PDB:2J0AA; ELQLGDIFIAVKTTWAFHRSRLDLLLDTWVSRIRQQTFIFTDSPDERLQERLGPHLVVTQ --3333-------3333----------3333-3333---------------!!!!----- CALSCKMAAEFDAFLVSGLRWFCHVDDDNYVNPKALLQLLKTFPQDRDVYVGKPSLFWFA --------------3333-------1111----------33331111------------- TGGAGFCINRQLALKMVPWASGSHFVDTSALIRLPDDCTVGYIIECKLGGRLQPSPLFHS 1111--------------1111-----3333----------------------------- HLETLQLLGAAQLPEQVTLSYGVFEGKLNVIKLPGPFSHEEDPSRFRSLHCLLYPDTPW ---3333-33331111-------%%%%----------33333333--------3333-- >6-PHOSPHOGLUCONOLACTONASE; SWP:Q9GRG6; PDB:2J0EA; SFKPTISVHATPQELSAAGCRKIVEIIEASGSQQWPLSIALAGGSTPKMTYARLHDEHLN ----------3333----------------1111---------3333------------- LLREKRALRFFMGDERMVPADSTDSNYNMAREVLLHDIPDDLVFPFDTSAVTPSAEATSA --1111------------1111------------11111111-----33333333----- DAMRVAEAYGKQLASLLPLKSVGEAGPKVPVFDVVLLGLGSDGHTASIFPGSQAEKETDG ----------------------2222-------------1111-----22223333---- KVVVSVGFPSETMKPKVWRVTLSPATIMQARNVIVLATGAEKKWVVDGILADTAHKAPVA ---------1111-------------1111--------1111---------------333 RFLRGCEGNVSFLLDKEIAENLA 33333----------3333---- >SERINE/THREONINE-PROTEIN ; SWP:O96013; PDB:2J0IA; SHEQFRAALQLVVDPGDPRSYLDNFIKIGEGSTGIVCIATVRSSGKLVAVKKMDLRKQQR -------3333-----1111----------1111-------------------1111--3 RELLFNEVVIMRDYQHENVVEMYNSYLVGDELWVVMEFLEGGALTDIVTHTRMNEEQIAA 333-----1111---1111--------!!!!-------1111------------------ VCLAVLQALSVLHAQGVIHRDIKSDSILLTHDGRVKLSDFGFCAQVSKEVPRRKLVGTPY ----------------------3333---1111------1111---3333-------333 WMAPELISRLPYGPEVDIWSLGIMVIEMVDGEPPYFNEPPLKAMKMIRDNLPPRLKNLHK 3----1111-----------------------2222--------------------3333 VSPSLKGFLDRLLVRDPAQRATAAELLKHPFLAKAGPPASIVPLMRQNR ---------------3333--3333---3333----333333333333- >INVASIN IPAD; SWP:P18013; PDB:2J0NA; DINEQYLKVYEHAVSSYTQMYQDFSAVLSSLAGWISPGGNDGNSVKLQVNSLKKALEELK --1111-----------------------3333--------------------------- EKYKDKPLYPANNTVSQEQANKWLTELGGTIGKVSQKNGGYVVSINMTPIDNMLKSLDNL 1111--------------------------------!!!!-------------------- GGNGEVVLDNAKYQAWNAGFSAEDETMKNNLQTLVQKYSNANSIFDNLVKV --------------------------------------------------- >HEMIN TRANSPORT PROTEIN H; SWP:P31517; PDB:2J0PA; SIYEQYLQAKADNPGKYARDLATLMGISEAELTHSRVSHDAKRLKGDARALLAALEAVGE 3333--------11113333--1111-------1111----------------3333--- VKAITRNTYAVHEQMGRYENQHLNGHAGLILNPRNLDLRLFLNQWASAFTLTEETRHGVR ------1111---------------------2222-----3333----------1111-- HSIQFFDHQGDALHKVYVTEQTDMPAWEALLAQFITTENPELQLEPATDEAVDAEWRAMT ------1111--------1111---------------------------------1111- DVHEFFQLLKRNNLTRQQAFRAVGNDLAYQVDNSSLTQLLNIAQQEQNEIMIFVGNRGCV 3333-------------------1111----1111------------------------- QIFTGMIEKVTPHQDWINVFNQRFTLHLIETTIAESWITRKPTKDGFVTSLELFAADGTQ ------------!!!!------------3333----------1111--------1111-- IAQLYGQRTEGQPEQTQWRDQIARLNNK --------2222---------1111--- >ATP-DEPENDENT RNA HELICAS; SWP:P38919; PDB:2J0SA; EDMTKVEFETSEEVDVTPTFDTMGLREDLLRGIYAYGFEKPSAIQQRAIKQIIKGRDVIA --1111----1111----3333-----------3333----3333------1111----- QSQSGTGKTATFSISVLQCLDIQVRETQALILAPTRELAVQIQKGLLALGDYMNVQCHAC --2222----------11113333-----------------------1111--------- IGGTNVGEDIRKLDYGQHVVAGTPGRVFDMIRRRSLRTRAIKMLVLDEADEMLNKGFKEQ ----3333-----------------------------1111--------3333------- IYDVYRYLPPATQVVLISATLPHEILEMTNKFMTDPIRILVKRDELTLEGIKQFFVAVER ---3333--------------3333--3333----------1111--1111--------3 EEWKFDTLCDLYDTLTITQAVIFCNTKRKVDWLTEKMREANFTVSSMHGDMPQKERESIM 333-------3333-----------------------1111------1111--------- KEFRSGASRVLISTDVWARGLDVPQVSLIINYDLPNNRELYIHRIGRSGRYGRKGVAINF -------------3333-----1111----------3333----3333-iiii------- VKNDDIRILRDIEQYYSTQIDEMPMNVADLI -!!!!-------------------------- >Protein CASC3; SWP:O15234; PDB:2J0ST; HLDDDEDRKNPAYIPRKGLFFEHDLRGQEGRWEHDKFREDEQAP --11111111-------3333------------11113333--- >Metalloproteinase inhibit; SWP:P01033; PDB:2J0TD; CTCVPPHPQTAFCNSDLVIRAKFVGTPEVNQTTLYQRYEIKMTKMYKGFQALGDAADIRF --------3333------------------------------------------------ VYTPAMESVCGYFHRSHNRSEEFLIAGKLQDGLLHITTCSFVAPWNSLSLAQRRGFTKTY -----3333---------------------------1111---3333-----------33 TVGC 33-- >RAC-LIKE GTP-BINDING PROT; SWP:NA; PDB:2J0VA; HMSVSKFIKCVTVGDGAVGKTCMLICYTSNKFPTDYIPTVFDNFSANVAVDGQIVNLGLW --------------2222------------------------------------------ DTAGQEDYSRPLSYRGADIFVLAFSLISKASYENVLKKWMPELRRFAPNVPIVLVGTKLD ----------3333----------1111------------------1111-------333 LRDDKGYLADHTNVITSTQGEELRKQIGAAAYIECSSKTQQNVKAVFDTAIKVVLQP 3------1111-------------------------1111----------------- >LYSINE-SENSITIVE ASPARTOK; SWP:P08660; PDB:2J0WA; EIVVSKFGGTSVADFDAMNRSADIVLSDANVRLVVLSASAGITNLLVALAEGLEPGERFE --------1111-----------11111111-------2222------1111-------- KLDAIRNIQFAILERLRYPNVIREEIERLLENITVLAEAAALATSPALTDELVSHGELMS -----------1111--------------------------------------------- TLLFVEILRERDVQAQWFDVRKVMRTNDRFGRAEPDIAALAELAALQLLPRLNEGLVITQ ------------------3333--------------3333---------3333------- GFIGSENKGRTTTLGRGGSDYTAALLAEALHASRVDIWTDVPGIYTTDPRVVSAAKRIDE --------------2222-----------------------------11111111----- IAFAEAAEMATFGAKVLHPATLLPAVRSDIPVFVGSSKDPRAGGTLVCNKTENPPLFRAL ---------111133331111--------------1111--------------------- ALRRNQTLLTLHSLNMLHSRGFLAEVFGILARHNISVDLITTSEVSVALTLDTTGSTSTG ------------------------------------------!!!!-------------- DTLLTQSLLMELSALCRVEVEEGLALVALIGNDLSKACGVGKEVFGVLEPFNIRMICYGA ----3333------------------------3333--------1111------------ SSHNLCFLVPGEDAEQVVQKLHSNLFE ---------3333-------------- >FIBER PROTEIN; SWP:Q64823; PDB:2J12A; RTLWTTPDTSPNCTIAQDKDSKLTLVLTKCGSQILANVSLIVVAGKYHIINNKTNPKIKS -----------------------------!!!!-----------1111--33331111-- FTIKLLFNKNGVLLDNSNLGKAYWNFRSGNSNVSTAYEKAIGFMPNLVAYPKPSNSKKYA -------1111--1111----------!!!!--------3333----------------- RDIVYGTIYLGGKPDQPAVIKTTFNQETGCEYSITFNFSWSKTYENVEFETTSFTFSYIA --------22221111-------------------------------------------- QE -- >POLYSACCHARIDE DEACETYLAS; SWP:Q81Z49; PDB:2J13A; MAYTNTPHNWGIAGKLYTDLLQKNGGFYLGDTKKKDIYLTFDNGYENGYTGKILDVLKEK ------------------------------------------------------------ KVPATFFVTGHYIKTQKDLLLRMKDEGHIIGNHSWSHPDFTAVNDEKLREELTSVTEEIK -----------------------1111-----------1111------------------ KVTGQKEVKYVRPPRGVFSERTLALTKEMGYYNVFWSLAFLDWIHPGSILLLHAISKDNA ------------2222----------1111--------------2222-------1111- EALAKIIDDLREKGYHFKSLDDLVKSN ----------1111------------- >HYALURONIDASE; SWP:Q8XL08; PDB:2J1AA; NPRTVKITASSEETSGENAPASFASDGDMNTFWHSKWSSPAHEGPHHLTLELDNVYEINK -------------------3333----1111----------------------------- VKYAPRQDSKNGRITGYKVSVSLDGENFTEVKTGTLEDNAAIKFIEFDSVDAKYVRLDVT ---------2222----------------------------------------------- DSVSDQGRGKFATAAEVNVHG ------2222----------- >FICOLIN-2; SWP:Q6IS69; PDB:2J1GA; GPRTCKDLLDRGHFLSGWHTIYLPDCRPLTVLCDMDTDGGGWTVFQRRVDGSVDFYRDWA --------1111----------1111----------iiii-------------------- TYKQGFGSRLGEFWLGNDNIHALELRTDLVDFEDNYQFAKYRSFKVADEAEKYNLVLGAF -------3333-------------------1111-------------3333--------- VEGSAGDSLTFSTKDQDNDLNTGNCAVMFQGAWWYHTSNLNGRYLRGTHGSFANGINWKS --3333------1111--------3333----------1111------------------ GKGYNYSYKVSEMKVRP ----------------- >GERANYLGERANYL PYROPHOSPH; SWP:Q43133; PDB:2J1PA; PISYIIRKADSVNKALDSAVPLREPLKIHEARYSLLAGGKRVRPVLCIAACELVGGEESL 3333-----------------------------1111--------------1111-3333 APAACAVEIHTSLIHDDLPCDNDDLRRGKPTNHKVYGEDVAVLAGDALLSFAFEHLASAT -------------------------%%%%-3333-------------------------- SSEVSPARVVRAVGELAKAIGTEGLVAGQVVDISLDLNNVGLEHLKFIHLHKTAALLEAS 33333333------------1111-----------3333--------------------- AVLGGIIGGGSDEEIERLRKFARCIGLLFQVVDDILDVTKKLTYPKLGLEKSREFAEKLN ------------------------------------------3333-------------- TEARDQLLGFDSDKVAPLLALANYIANRQN ----1111--3333----------1111-- >FUCOLECTIN-RELATED PROTEI; SWP:Q97N96; PDB:2J1VA; KFNDGNLNIAYAKPTTQSSVDYNGDPNRAVDGNRNGNFNSGSVTHTRADNPSWWEVDLKK --%%%%-3333---------%%%%------------1111-------------------- MDKVGLVKIYNRTDAETQRLSNFDVILYDNNRNEVAKKHVNNLSGESVSLDFKEKGARYI ---------------3333---------1111-------------------iiii----- KVKLLTSGVPLSLAEVEVFRES ---------------------- >CELLULAR TUMOR ANTIGEN P5; SWP:P04637; PDB:2J21A; SVPSQKTYQGSYGFRLGFLHSVTCTYSPALNKLFCQLAKTCPVQLWVDSTPPPGTRVRAM ---------1111--------------1111----2222------------2222----- AIYKQSQHMTEVVRRCPHHERCSDSDGLAPPQHLIRVEGNLRAEYLDDRNTFRHSVVVPY ----3333-------3333----------1111------1111----------------- EPPEVGSDCTTIHYNYMCYSSCMGGMNRRPILTIITLEDSSGNLLGRDSFEVRVCACPGR ---2222-----------1111---iiii---------1111------------------ DWRTEEEN -------- >FUCOLECTIN-RELATED PROTEI; SWP:Q97N96; PDB:2J22A; LSNIALTKETRQSSTDYNGFSRLAVDGNKNGDYGHHSVTHTKEDSPSWWEIDLAQTEELE --3333---------%%%%3333--------1111------------------------- KLIIYNRTDAEIQRLSNFDIIIYDSNDYEVFTQHIDSLESNNLSIDLKGLKGKKVRISLR ----------3333---------1111-------------------iiii---------- SAGIPLSLAEVEVYTYK 2222------------- >THIOREDOXIN; SWP:NA; PDB:2J23A; GSVQVISSYDQFKQVTGGDKVVVIDFWATWCGPCKMIGPVFEKISDTPAGDKVGFYKVDV ---------------------------11113333-----------3333--------33 DEQSQIAQEVGIRAMPTFVFFKNGQKIDTVVGADPSKLQAAITQHSA 33-------------------iiii------------------1111 >TRIOSEPHOSPHATE ISOMERASE; SWP:P04789; PDB:2J27A; SKPQPIAAANWKCNGSQQSLSELIDLFNSTSINHDVQCVVASTFVHLAMTKERLSHPKFV --------------------------1111------------3333---------1111- IAAQNAIAKSGAFTGEVSLPILKDFGVNWIVLGHSERRAYYGETNEIVADKVAAAVASGF ------------2222------1111-------3333------------------3333- MVIACIGETLQERESGRTAVVVLTQIAAIAKKLKKADWAKVVIAYEAVWAIGTGKVATPQ ------------1111------------3333-33331111-----3333---------- QAQEAHALIRSWVSSKIGADVAGELRILYGGSVNGKNARTLYQQRDVNGFLVGGASLKPE ---------------------------------3333-3333-1111-----3333-111 FVDIIKATQ 1----1111 >FIBER PROTEIN; SWP:Q65914; PDB:2J2JA; APITLWTGPGPSINGFINDTPVIRCFICLTRDSNLVTVNASFVGEGGYRIVSPTQSQFSL ----------------%%%%------------------------!!!!---1111----- IMEFDQFGQLMSTGNINSTTTWGEKPWGNNTVQPRPSHTWKLCMPNREVYSTPAATISRC ----1111--------3333-----2222-------3333-------------------- GLDSIAVDGAPSRSIDCMLIINKPKGVATYTLTFRFLNFNRLSGGTLFKTDVLTFTYVGE ----1111-1111-----------!!!!---------1111------------------- NQ -- >CATALASE; SWP:A2A136; PDB:2J2MA; KKLTTNQGVPIGDNQNSRTAGRRGPTLLEDYQLIEKIAHFDRERVPERVVHARGFGAHGV ----1111------------------1111---------1111----------------- FKVKNSMKKYTKAAFLQEEGTEVPVFARFSTVIHGTHSPETLRDPRGFSVKFYTEEGNWD ------3333--3333-2222-------------2222---------------------- FVGNNLPVFFIRDAMKFPDMVHSLKPDPRTNIQDPDRYWDFMTLRPESTNMLMHIFTDEG ------------3333---------------------------------33331111333 IPASYRKMRGSSVHSFKWVNAHGNTVYIKLRWVPKEGVHNLSADEATEVQGKDFNHASND 3--1111------------1111----------1111----------------------- TFQAIENGDFPEWDLFVQVLDPADVENFDFDPLDATKDWFEDVIPFQHVGTMTLNKNVDN --------------------1111------1111-----3333----------------3 YFAETESVGFNPGVLVPGMLPSEDKLLQGRLFSYSDTQRHRIGPNYQQLPINCPFAQVNN 333-------1111-2222----3333-----------------33333333-------- YQRDGAMPFKQQTSSVNYEPNRYQDEPKQTPEYTEDTQPLHDDIHGRLEIEKTNNFGQAG ----------------------1111---3333--------------------------- EVYRRMTEEEQMALLNNLVNDLQQVRHENTVLLAICNFYRADASLGEKLSEALNVDIKPF --1111--------------3333--------------------------------3333 >ZINC FINGER PROTEIN HRX; SWP:Q03164; PDB:2J2SA; GGSVKKGRRSRRCGQCPGCQVPEDCGVCTNCLDKPKFGGRNIKKQCCKMRKCQNLQWMPS ---------------3333--------3333---1111--------1111---------- KAYLQKQAKAVK ------------ >CHAPERONE PROTEIN PAPD; SWP:P15319; PDB:2J2ZA; AVSLDRTRAVFDGSEKSMTLDISNDNKQLPYLAQAWIENENQEKIITGPVIATPPVQRLE -----------1111-----------------------1111-----------------2 PGAKSMVRLSTTPDISKLPQDRESLFYFNLREIPPRSEKANVLQIALQTKIKLFYRPAAI 222--------3333-----------------------------------------3333 KTRPNEVWQDQLILNKVSGGYRIENPTPYYVTVIGLGGSEKQAEEGEFETVMLSPRSEQT --2222-1111------------------------------------------------- VKSANYNTPYLSYINDYGGRPVLSFICNGSRCSVKKE --------------1111---------!!!!------ >PAP fimbrial minor pilin ; SWP:P07111; PDB:2J2ZB; RAAFHGEVVRPACTLAMEDAWQIIDMGETPVRDLQNGFSGPERKFSLRLRNCEFNSQGGN ------------------1111-------------------------------------3 LFSDSRIRVTFDGVRGETPDKFNLSGQAKGINLQIADVRGNIARAGKVMPAIPLTGNEEA 333--------------1111----------------------2222------------- LDYTLRIVRNGKKLEAGNYFAVLGFRVDYE ------------------------------ >NADP-DEPENDENT OXIDOREDUC; SWP:Q39172; PDB:2J3HA; MTATNKQVILKDYVSGFPTESDFDFTTTTVELRVPEGTNSVLVKNLYLSCDPYMRIRMGK ------------------1111----------------------------3333-1111- QAYTPGQPIQGYGVSRIIESGHPDYKKGDLLWGIVAWEEYSVITPMTHAHFKIQHTDVPL ---2222--------------11112222-----------------------------11 SYYTGLLGMPGMTAYAGFYEVCSPKEGETVYVSAASGAVGQLVGQLAKMMGCYVVGSAGS 11----------------------2222-----1111-3333-----------------3 KEKVDLLKTKFGFDDAFNYKEESDLTAALKRCFPNGIDIYFENVGGKMLDAVLVNMNMHG 333-----------------------------1111--------!!!!----11112222 RIAVCGMISQYNLENQEGVHNLSNIIYKRNRIQGFVVSDFYDKYSKFLEFVLPHIREGKI ------1111----------3333-1111------33333333-----------1111-- TYVEDVADGLEKAPEALVGLFHGKNVGKQVVVVARE --------3333-3333--1111------------- >PROLYL-TRNA SYNTHETASE; SWP:Q831W7; PDB:2J3MA; MKQSKMLIPTLEVLSHQILLRAGYIRQVAAGIYSYLPLANRVLEKLKTIMREEFEKIDAV -3333--------------1111-----2222----------------------1111-- EMLMPALLPAELWKESGRYETYGPNLYRLKDRNDRDYILGPTHEETFTELIRDEINSYKR --------3333-11111111-3333----1111----------------------3333 LPLNLYQIQTKYRDEKRSRSGLLRGREFIMKDGYSFHADEASLDQSYRDYEKAYSRIFER -------------------!!!!-----------------------------------11 CGLEFRAIIGDGGKDSKEFMAISEIGEDTICYSTESDYAANLEMATSLYTPKKSHETQLD 11--------------------1111---------------1111--------------- LEKIATPEVGTIAEVANFFEVEPQRIIKSVLFIADEEPVMVLVRGDHDVNDVKLKNFLGA -----2222------------3333--------%%%%------1111------------- DFLDEATEEDARRVLGAGFGSIGPVNVSEDVKIYADLAVQDLANAIVGANEDGYHLTNVN -----------------3333------3333-------1111--------2222------ PDRDFQPISYEDLRFVQEGDPSPDGNGVLAFTKGIEIGHIFKLGTRYSDAMGATVLDENG -1111-----------2222-1111-------------------------------1111 REKSVIMGCYGIGVSRLLSAIVEQNADERGINWPTGIAPFDLHVVQMNVKDEYQTKLSQE --------------------------3333---2222----------1111--------- VEAMMTEAGYEVLVDDRNERAGVKFADADLIGCPIRITVGKKAVDGVVEVKIKRTGEMLE -------------------3333----------------1111---------1111---- VRKEELESTLSILM -3333--------- >Trafficking protein parti; SWP:Q5NCF2; PDB:2J3TC; TVHNLYLFDRNGVCLHYSEWHRKKQAGIPKEEEYKLMYGMLFSIRSFVSKMSPLDMKDGF --------1111----------------3333---------------------------- LSFQTSRYKLHYYETPTGIKVVMNTDLGVGPIRDVLHHIYSALYVEFVVKNPLCPLGQTV ----1111------1111-------3333------------------1111--------- QSELFRSRLDSYVRSLPFFSAR ---------------1111--- >Trafficking protein parti; SWP:Q9Y296; PDB:2J3TD; AIFSVYVVNKAGGLIYQLDSYEAEKTFSYPLDLLLKLHDERVLVAFGQRDGIRVGHAVLA --------1111-------------------------%%%%------------------- INGMDVNGRYTADGKEVLEYLGNPANYPVSIRFGRPRLTSNEKLMLASMFHSLFAIGSQL iiii--!!!!1111--------3333-------------------------------111 SPEQGSSGIEMLETDTFKLHCYQTLTGIKFVVLADPRQAGIDSLLRKIYEIYSDFALKNP 1------------1111------1111-------1111--------------------11 FYSLEMPIRCELFDQNLKLALEVAEKAGTFG 111111-----------------1111---- >TRAFFICKING PROTEIN PARTI; SWP:NA; PDB:2J3WA; EMSGSFYFVIVGHHDNPVFEMEFLPPGKAESKDDHRHLNQFIAHAALDLVDENMWLSNNM -----------1111------------------------------------3333----- YLKTVDKFNEWFVSAFVTAGHMRFIMLHDVRQEDGIKNFFTDVYDLYIKFAMNPFYEPNS -------!!!!------1111-------------------------------11112222 PIRSSAFDRKVQFLGKKHLLS ---3333-------------- >TRAFFICKING PROTEIN PARTI; SWP:NA; PDB:2J3WB; KTEVSVSAFALLFSEMVQYCQSRVYSVSELQARLADMGQGVGASLLDVLVMREKNGKRET ----3333---------------------------------------------iiii--- KVLNILLFIKVNVWKALFGKEADKLEQANDDDKTYYIIEKEPLINAYISVPKENSTLNCA ---------------------------1111----------3333--------------- AFTGGIVEAILTHSGFPAKVTVHWHKGTTLMIKFDESVIARDKALDGR -----------1111---------------------------1111-- >C2 TOXIN COMPONENT I; SWP:O69275; PDB:2J3XA; PIIKEPIDFINKPESEAKEWGKEEEKRWFTKLNNLEEVAVNQLKNKEYKTKIDNFSTDIL --------22223333------------1111---------------------------- FSSLTAIEIMKEDENQNLFDVERIREALLKNTLDRDAIGYVNFTPKELGINFSIRDVELD ----------1111-----------------------------3333----------%%% RDISDETLDKVRQQIINQEYTKFSFISLGLNDNSINESVPVIVKTRVPTTFDYGVLNDKE %-------------2222------------3333-1111------------------111 TVSLLLNQGFSIIPESAIITTIKGKDYILIEGSLSQELDFYNKGSEAWGAENYGDYISKL 1--------------------iiii--------------!!!!-3333----11111111 SHEQLGALEGYLHSDYKAINSYLRNNRVPNNDELNKKIELISSALSVKPIPQTLIAYRRV ----------------------1111-----------------1111------------- DGIPFDLPSDFSFDKKENGEIIADKQKLNEFIDKWTGKEIENLSFSSTSLKSTPSSFSKR -3333--1111-----iiii--------------2222---------------3333--- RFIFRLRLSEGAIGAFIYGFSGFQDEQEILLNKNSTFKIFRITPITSIINRVTKMTQVVI --------2222-------2222--------------------------1111------- DAEGIQNKEI ---------- >GUANYLATE KINASE; SWP:Q5HGM3; PDB:2J41A; EKGLLIVLSGPSGVGKGTVRKRIFEDPSTSYKYSISMTTRQMREGEVDGVDYFFKTRDAF ----------2222-----------3333-------------22222222---------- EALIKDDQFIEYAEYVGNYYGTPVQYVKDTMDEGHDVFLEIEVEGAKQVRKKFPDALFIF ---1111-------iiii-----------------------3333-------1111---- LAPPSKEVEMMNLYDYVVVNDEVELAKNRIQCIVEAEHLKRERVEAK -----1111-1111--------------------------------- >SPYDX; SWP:Q99XX8; PDB:2J43A; ASHHLRHFKTLPAGESLGSLGLWVWGDVDQPSKDWPNGAITTKAKKDDYGYYLDVPLAAK -----------22223333---------------------------3333---------- HRQQVSYLINNKAGENLSKDQHISLLTPKNEVWIDENYHAHAYRPLKEGYLRINYHNQSG ----------1111--------------------1111--------2222------3333 HYDNLAVWTFKDVKTPTTDWPNGLDLSHKGHYGAYVDVPLKEGANEIGFLILDKSKTGDA -2222-----------------------------------2222--------3333-333 IKVQPKDYLFKELDNHTQVFVKDTDPKVYNNPYYID 3----------3333--------------------- >ALKALINE AMYLOPULLULANASE; SWP:Q97SQ7; PDB:2J44A; DNYFRIHVKKLPEENKDAQGLWTWDDVEKPSENWPNGALSFKDAKKDDYGYYLDVKLKGE --------------3333----------------3333--1111--3333---------- QAKKISFLINNTAGKNLTGDKSVEKLVPKMNEAWLDQDYKVFSYEPQPAGTVRVNYYRTD --------------------------1111-----1111--------------------- GNYDKKSLWYWGDVKNPSSAQWPDGTDFTATGKYGRYIDIPLNEAAREFGFLLLDESKGD ------------------------------------------------------1111-- VKIRKENYKFTDLKNHSQIFLKDDDESIYTNPYYVHD ------------------------------------- >TWO-COMPONENT SENSOR KINA; SWP:Q9KHI5; PDB:2J48A; AGHILLLEEEDEAATVVCEMLTAAGFKVIWLVDGSTALDQLDLLQPIVILMAWPPPDQSC --------------------------------3333---3333----------3333--- LLLLQHLREHQADPHPPLVLFLGEPPVDPLLTAQASAILSKPLDPQLLLTTLQGLCPPN ----------------------------------------------------3333--- >URIDYLATE KINASE; SWP:Q97ZE2; PDB:2J4JA; MNIILKISGKFFDEDNVDNLIVLRQSIKELADNGFRVGIVTGGGSTARRYIKLAREIGIG -------333333333333-----------1111--------------------1111-- EAYLDLLGIWASRLNAYLVMFSLQDLAYMHVPQSLEEFIQDWSHGKVVVTGGFQPGQSTA --------------------1111-----------------1111--------------- AVAALVAEASSSKTLVVATNVDGVYEKDPRIYADVKLIPHLTTQDLRKILEELLDPLAIK -------1111----------------1111----------------------------- IVERSKIRVIVMNYRKLNRIIDILKGEEVSSIIEPV ------------33331111--1111---------- >MITOGEN-ACTIVATED PROTEIN; SWP:Q15750; PDB:2J4OA; SWTDDLPLCHLSGVGSASNRSYSADGKGTESHPPEDSWLKFRSENNCFLYGVFNGYDGNR 3333-------------------------------------------------------- VTNFVAQRLSAELLLGQLNAEHAEADVRRVLLQAFDVVERSFLESIDDALAEKASLQSQL ---------1111-------------------------------------------1111 PEGVPQHQLPPQYQKILERLKTLEREISGGAMAVVAVLLNNKLYVANVGTNRALLCKSTV ---------3333----------------------------------------------- DGLQVTQLNVDHTTENEDELFRLSQLGLDAGKIKQVGIICGQESTRRIGDYKVKYGYTDI ------------3333-------1111-----------------------1111-3333- DLLSAAKSKPIIAEPEIHGAQPLDGVTGFLVLMSEGLYKALEAAHGPGQANQEIAAMIDT --1111----------------2222-------------------2222----------3 EFAKQTSLDAVAQAVVDRVKRIHSDTFASGGERARFCPRHEDMTLLVRNFGYPLGE 333----------------------------3333--------------------- >ANGIOGENIN-4; SWP:Q80Z85; PDB:2J4TA; NERYEKFLRQHYDAKPQGRDDRYCESMMKERKLTSPCKDVNTFIHGTKKNIRAICGKKGS -------------------3333------------------------3333--------- PYGENFRISNSPFQITTCTHSRGSPWPPCGYRAFKDFRYIVIACEDGWPVHFDESFISP --------------------------------------------iiii----3333--- >Putative uncharacterized ; SWP:Q5XFY8; PDB:2J4WH; EVQLVESGGGLVKPGGSLKLSCAASGFIFSDYYMYWVRQTPEKRLEWVATISDGNSYTYY ------------2222-----------1111--------1111----------------- VDSVKGRFTISRDNAKNNLYLQMSSLKSEDTAIYYCARDGPTDSSGYGGFGYWGQGTLVT 3333---------1111---------3333----------3333---------------- VSEAKTTPPSVYPLAPGSAAQTNSMVTLGCLVKGYFPEPVTVTWNSGSLSSGVHTFPAVL --------------------------------------------iiii------------ QSDLYTLSSSVTVPSSPRPSETVTCNVAHPASSTKVDKKIVPRDC -----------------------------1111------------ >Putative uncharacterized ; SWP:Q5XFY8; PDB:2J4WL; SVLSQSPAILSASPGEKVTMTCRARSSVSYMHWYQQKSGSSPKPWIHATSNLASGVPARF ------------2222------------------------------------22223333 SGSGSGTSYSLTISRVEAEDAATYYCQQWSSHPPTFGSGTKLEIKRADAAPTVSIFPPSS ----------------1111---------------------------------------- EQLTSGGASVVCFLNNFYPKDINVKWKIDGSERQNGVLNSWTDQDSKDSTYSMSSTLTLT -----------------------------------------------------------3 KDEYERHNSYTCEATHKTSTSPIVKSFNRNEC 3333333--------1111------------- >Rho-GTPase activating pro; SWP:Q8NI19_HUMAN; PDB:2J59M; AAKEGWLHFRPLVPWKQMYVVLRGHSLYLYKDKREQPISVNACLIDISYSETKRKNVFRL -------------------------------3333------------------------- TTSDCECLFQAEDRDDMLAWIKTIQESSNLNEEDTGVTNRDLISRRIKEYNNL -1111-------------------3333------------------------- >30S RIBOSOMAL PROTEIN S6; SWP:O66474; PDB:2J5AA; HYKTLRYYETVFAVKPTLSEEEMKKKFEQVKEFIKQKGGEILYEEDWGMRQLAYPIQKFN -------------------------------------------------------%%%%- NARYFLVQFKTENPQLPNELDFQLKIDEDVIRWLNIQIKESEVKKN ------------1111----------3333--------1111---- >ALR4455 PROTEIN; SWP:Q8YNV6; PDB:2J5GA; QPEYFTKYENLHFHRDENGILEVRMHTNGSSLVFTGKTHREFPDAFYDISRDRDNRVVIL -3333--1111----1111-------iiii---------------------1111----- TGSGDAWMAEIDFPSLGDVTNPREWDKTYWEGKKVLQNLLDIEVPVISAVNGAALLHSEY --!!!!-----3333--11113333-------------1111----------------33 ILTTDIILASENTVFQDMPHLNAGIVPGDGVHILWPLALGLYRGRYFLFTQEKLTAQQAY 33-------1111-----1111-------3333--------------1111--------1 ELNVVHEVLPQSKLMERAWEIARTLAKQPTLNLRYTRVALTQRLKRLVNEGIGYGLALEG 111------1111-----------1111-------------------------------- ITATDLRNT ----3333- --------------------------------------- >P-HYDROXYCINNAMOYL COA HY; SWP:O69762; PDB:2J5IA; TYEGRWKTVKVEIEDGIAFVILNRPEKRNAMSPTLNREMIDVLETLEQDPAAGVLVLTGA -2222------------------3333---------------------1111-------- GEAWTAGMDLKEYFREVDAGPEILQEKIRREASQWQWKLLRMYAKPTIAMVNGWCFGGGF --------------------1111------------------------------------ SPLVACDLAICADEATFGLSEINWGIPPGNLVSKAMADTVGHRQSLMYIMTGKTFGGQKA -----------1111----3333-----!!!!---------------------------- AEMGLVNESVPLAQLREVTIELARNLLEKNPVVLRAAKHGFKRCRELTWEQNEDYLYAKL ----------3333-----------3333--------------11113333--------- DQSRLLDT -------- >APICAL MEMBRANE ANTIGEN 1; SWP:Q9BIM8; PDB:2J5LA; IFISDDKDSLKCPCDPEMVSQSTCRFFVCKCVER -----3333------------------------- >MREC PROTEIN; SWP:Q8Y6Y4; PDB:2J5UA; TENQHLKERLEELAQLESEVADLKKENKDLKESLDITDSIRDYDPLNASVISRNPTNWND --------------------------------------1111-----------3333--- QVEIDKGSSDGVKPDMAVTTPSGLIGKVTTTGAKSATVELLTSSDVKNRVSAKVQGKENA ------1111--2222---1111----------------1111-1111------------ FGIINGYDSDTKLLELKQLPYDMKFKKGQKVVTSGLGGKFPAGIFIGTIEKVETDKMGLS -------------------1111-----------1111----------------3333-- QTAFIKPGADMYDLNHVTVLKRSAEAGTTDD ------------------------------- >GLUTAMATE 5-KINASE; SWP:PROB_ECOLI; PDB:2J5VA; DSQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQLHAAGHRIVIVTSGAIAAGREHLGY --------------iiii------------------1111-------------------- PELPATIASKQLLAAVGQSRLIQLWEQLFSIYGIHVGQMLLTRADMEDRERFLNARDTLR -----------------------------1111--------3333--------------- ALLDNNVVPVINENDAVATAEIKVGDNDNLSALAAILAGADKLLLLTDQMSTKLQAADVA --1111-------3333-3333-------------------------------------- CRAGIDTIIAAGSKPGVIGDVMEGISVGTLFHAQATPLENRKRWIFGAPPAGEITVDEGA ----------1111------------------------3333------------------ TAAILERGSSLLPKGIKSVTGNFSRGEVIRICNLEGRDIAHGVSRYNSDALRRIAGHHSQ -----------3333--------2222-----1111---------------------333 EIDAILGYEYGPVAVHRDDMITR 3-3333---------3333---- >CERULOPLASMIN; SWP:CERU_HUMAN; PDB:2J5WA; KEKHYYIGIIETTWDYASDHGEKKLISVDTEHSNIYLQNGPDRIGRLYKKALYLQYTDET ------------------------2222---1111------------------------- FRTTIEKPVWLGFLGPIIKAETGDKVYVHLKNLASRPYTFHSHGITYYKEHEGAIYPDNT -------1111------------------------------------3333--------- TDFQRADDKVYPGEQYTYMLLATEEQSPGEGDGNCVTRIYHSHIDAPKDIASGLIGPLII 33333333--2222--------3333------------------3333------------ CKKDSLDKEKEKHIDREFVVMFSVVDENFSWYLEDNIKTYCSEPEKVDKDNEDFQQSNRM ------%%%%2222-----------33331111---------3333-1111----1111- YSVNGYTFGSLSGLSMCAEDRVKWYLFGMGNEVDVHAAFFHGQALTNKNYRIDTINLFPA --iiii----------2222----------3333-----2222---%%%%-------222 TLFDAYMVAQNPGEWMLSCQNLNHLKAGLQAFFQVQECNKSSSKDNIRGKHVRHYYIAAE 2----------------------------------------------------------- EIIWNYAPSGIDIFTKENLTAPGSDSAVFFEQGTTRIGGSYKKLVYREYTDASFTNRKER ------3333-------1111--3333-----------------------3333------ GPEEEHLGILGPVIWAEVGDTIRVTFHNKGAYPLSIEPIGVRFNKNNEGTYYSPNVPPSA 33333333-----------------------------------1111-----------33 SHVAPTETFTYEWTVPKEVGPTNADPVCLAKMYYSAVDPTKDIFTGLIGPMKICKKGSLH 33-------------3333--1111------------3333-3333-------------1 ANGRQKDVDKEFYLFPTVFDENESLLLEDNIRMFTTAPDQVDKEDEDFQESNKMHSMNGF 111----------------33331111---------3333-1111----1111---iiii MYGNQPGLTMCKGDSVVWYLFSAGNEADVHGIYFSGNTYLWRGERRDTANLFPQTSLTLH %%%%------2222----------3333-----2222---%%%%-------2222----- MWPDTEGTFNVECLTTDHYTGGMKQKYTVNQCRRQSEDSTFYLGERTYYIAAVEVEWDYS ------------------------------------------------------------ PQREWEKELHHLQEQNVSNAFLDKGEFYIGSKYKKVVYRQYTDSTFRVPVERKAEEEHLG ------------------1111--------------------3333------11113333 ILGPQLHADVGDKVKIIFKNMATRPYSIHAHGVQTESSTVTPTLPGETLTYVWKIPERSG --------2222-------------------------------2222--------1111- AGTEDSACIPWAYYSTVDQVKDLYSGLIGPLIVCRRPNPRRKLEFALLFLVFDENESWYL -1111------------3333-3333--------------------------33331111 DDNIKTYSDHPEKVNKDDEEFIESNKMHAINGRMFGNLQGLTMHVGDEVNWYLMGMGNEI ---------3333-1111----1111---iiiiiiii------2222----------111 DLHTVHFHGHSFQYKHRGVYSSDVFDIFPGTYQTLEMFPRTPGIWLLHCHVTDHIHAGME 1-----2222----2222---------2222----------------------------- TTYTVLQNE --------- >FICOLIN-3; SWP:O75636; PDB:2J5ZA; CQEGPRNCRELLSQGATLSGWYHLCLPEGRALPVFCDMDTEGGGWLVFQRRQDGSVDFFR -------------------------1111----------iiii----------------- SWSSYRAGFGNQESEFWLGNENLHQLTLQGNWELRVELEDFNGNRTFAHYATFRLLGEVD ----------3333-------------------------1111-------------3333 HYQLALGKFSEGTAGDSLSLHSGRPFTTYDADHDSSNSNCAVIVHGAWWYASCYRSNLNG -----------1111--3333------1111-------------------------1111 RYAVSEAAAHKYGIDWASGRGVGHPYRRVRMMLR ----3333-------1111-2222---------- >BTRK; SWP:Q4H4E6; PDB:2J66A; DQAEITALTKRFETPFYLYDGDFIEAHYRQLRSRTNPAIQFYLSLKANNNIHLAKLFRQW -------------------------------11111111----3333----------111 GLGVEVASAGELALARHAGFSAENIIFSGPGKKRSELEIAVQSGIYCIIAESVEELFYIE 1-------------------3333----------------1111---------------- ELAEKENKTARVAIRINPDKSFTAIKMGGVPRQFGMDESMLDAVMDAVRSLQFTKFIGIH ------------------------------------3333----------1111------ VYTGTQNLNTDSIIESMKYTVDLGRNIYERYGIVCECINLGGGFGVPYFEKALDIGKITR ------------------------------------------------------------ TVSDYVQEARDTRFPQTTFIIESGRYLLAQAAVYVTEVLYRKASKGEVFVIVDGGMHHHA -----------------------33331111------------iiii-------333333 ASPMEYIPLEKVTIAGPLCTPEDCLGKDVHVPALYPGDLVCVLNSGAYGLSFSPVHFLGH 33-----------------3333-----------2222-----------11111111--- PTPIEILKRNGSYELIRRKGTADDIVATQLQ --------iiii--------3333-1111-- >TOLL LIKE RECEPTOR 10; SWP:Q9BXR5; PDB:2J67A; LKRNVRFHAFISYSEHDSLWVKNELIPNLEKEDSILICLYESYFDPGKSISENIVSFIEK -------------3333---------------------3333--3333---------111 SYKSIFVLSPNFVQNEWCHYEFYFAHHNHIILILLEPIPFYCIPTRYHKLKALLEKKAYL 1--------------33333333---------------3333-3333------------- EWPKDRRKCGLFWANLRAAIN ----3333------------- >BACTERIAL DYNAMIN-LIKE PR; SWP:NA; PDB:2J69A; QVATDRFIQDLERVAQVRSEMSVCLNKLAETINKAELAGDSSSGKLSLERDIEDITIASK -----------------------------------------------3333--------- NLQQGVFRLLVLGDMKRGKSTFLNALIGENLLPSDVNPCTAVLTVLRYGPEKKVTIHFND ------------------------------------------------------------ GKSPQQLDFQNFKYKYTIDPAEAKKLEQEKKQAFPDVDYAVVEYPLTLLQKGIEIVDSPG ---------------------------------3333--------3333----------- LNDTEARNELSLGYVNNCHAILFVMRASQPCTLGERRYLENYIKGRGLTVFFLVNAWDQV ------3333---3333--------3333--3333-------2222---------11113 RESLIDPDDVEELQASENRLRQVFNANLAEYCTVEGQNIYDERVFELSSIQALRRRLKNP 333--1111------------------3333-------3333----------------11 QADLDGTGFPKFMDSLNTFLTRERAIAELRQVRTLARLACNHTREAVARRIPLLEQDVNE 11-2222------------------------------------------3333----333 LKKRIDSVEPEFNKLTGIRDEFQKEIINTRDTQARTISESFRSYVLNLGNTFENDFLRYQ 3------3333-----------------------------------------3333---- PELNLFDFLSSGKREAFNAALQKAFEQYITDKSAAWTLTAEKDINAAFKELSRSASQYGA -----11113333----------------------------------------------- SYNQITDQITEKLTGKDVEEDNSPGWAKWAMGLLSAGFDWKNILLNYFTVIGIGGIITAV -----------------------3333-----------3333----3333------3333 TGILLGPIGFALLGLGVGFLQADQARRELVKTAKKELVKHLPQVAHEQSQVVYNAVKECF 3333-3333-3333---------------------------------------------- DSYEREVSKRINDDIVSRKSELDNLVKQKQTREINRESEFNRLKNLQEDVIAQLQKIEAA ----------------------------------3333---------------------- YSNLLAYYSHH -------1111 >PROTEIN TRM112; SWP:P53738; PDB:2J6AA; MKFLTTNFLKCSVKACDTSNDNFPLQYDGSKCQLVQDESIEFNPEFLLNIVDRVDWPAVL -3333-------3333-----------3333-----1111----------3333------ TVAAELGNNALPPTKPSFPSSIQELTDDDMAILNDLHTLLLQTSIAEGEMKCRNCGHIYY ---1111-------------3333------------------------------------ IKNGIPNLLLPPHLVH -iiii----------- >AFV3-109; SWP:NA; PDB:2J6BA; MLYILNSAILPLKPGEEYTVKAKEITIQEAKELVTKEQFTSAIGHQATAELLSSILGVNV ------------------------------------------------------------ PMNRVQIKVTHGDRILAFMLKQRLPEGVVVKTTEELEKIGYELWLFEIQ ---------2222-----------2222--------------------- >CONKUNITZIN-S2; SWP:NA; PDB:2J6DA; ARPKDRPSYCNLPADSGSGTKPEQRIYYNSAKKQCVTFTYNGKGGNGNNFSRTNDCRQTC -------3333----------------------------------------3333----- QYPVG ----- >CD2-ASSOCIATED PROTEIN; SWP:Q9Y5K6; PDB:2J6FA; VDYIVEYDYDAVHDDELTIRVGEIIRNVKKLQEEGWLEGELNGRRGMFPDNFVKEIKR ------------1111---2222---------2222----%%%%----1111------ >ALDEHYDE DEHYDROGENASE FA; SWP:P49419; PDB:2J6LA; TLLINQPQYAWLKELGLREENEGVYNGSWGGRGEVITTYCPANNEPIARVRQASVADYEE -33331111---1111-------------------------------------------- TVKKAREAWKIWADIPAPKRGEIVRQIGDALREKIQVLGSLVSLEMGKILVEGVGEVQEY ----------3333---------------------------------------------- VDICDYAVGLSRMIGGPILPSERSGHALIEQWNPVGLVGIITAFNFPVAVYGWNNAIAMI ---------1111---------2222---------------------------------- CGNVCLWKGAPTTSLISVAVTKIIAKVLEDNKLPGAICSLTCGGADIGTAMAKDERVNLL ---------1111---------------1111-1111----------------1111--- SFTGSTQVGKQVGLMVQERFGRSLLELGGNNAIIAFEDADLSLVVPSALFAAVGTAGQRC ----------------1111---------------11113333----------%%%%-11 TTARRLFIHESIHDEVVNRLKKAYAQIRVGNPWDPNVLYGPLHTKQAVSMFLGAVEEAKK 11------3333-----------1111---1111------------------------11 EGGTVVYGGKVMDRPGNYVEPTIVTGLGHDASIAHTETFAPILYVFKFQNEEEVFAWNNE 11-------------------------11113333------------------------- VKQGLSSSIFTKDLGRIFRWLGPKGSDCGIVNVNIPTSGAEIGGAFGGEKHTGGGRESGS ---------------------1111----------1111---------!!!!-------- DAWKQYMRRSTCTINYS 3333------------- >PULLULANASE; SWP:O33840; PDB:2J73A; FTETTIVVHYHRYDGKYDGWNLWIWPVEPVSQEGKAYQFTGEDDFGKVAVVKLPMDLTKV -----------1111-2222----------------------1111-------------- GIIVRLNEWQAKDVAKDRFIEIKDGKAEVWILQGVEEIFYEKP ----------------------%%%%-----2222-------- >BETA-GLUCOSIDASE A; SWP:Q08638; PDB:2J78A; VKKFPEGFLWGVATASYQIEGSPLADGAGMSIWHTFSHTPGNVKNGDTGDVACDHYNRWK ----1111------3333---1111-------------22222222----!!!!------ EDIEIIEKLGVKAYRFSISWPRILPEGTGRVNQKGLDFYNRIIDTLLEKGITPFVTIYHW ------------------1111-1111-------------------1111---------- DLPFALQLKGGWANREIADWFAEYSRVLFENFGDRVKNWITLNEPWVVAIVGHLYGVHAP --333311113333---------------------------------------------- GMRDIYVAFRAVHNLLRAHARAVKVFRETVKDGKIGIVFNNGYFEPASEKEEDIRAVRFM ------------------------3333-1111--------------------------- HQFNNYPLFLNPIYRGDYPELVLEFAREYLPENYKDDMSEIQEKIDFVGLNYYSGHLVKF -------------------------3333---3333--3333------------------ DPDAAKVSFVERDLPKTAMGWEIVPEGIYWILKKVKEEYNPPEVYITENGAAFDDVVSED 1111------------1111---3333------------------------------111 GRVHDQNRIDYLKAHIGQAWKAIQEGVPLKGYFVWSLLDNFEWAEGYSKRFGIVYVDYST 1---------------------1111---------------!!!!--------------- QKRIVKDSGYWYSNVVKNNGLED ----------------------- >CYTOCHROME C NITRITE REDU; SWP:Q72EF3; PDB:2J7AA; GCSDVSTELKTPVYKTKLTAEEIRNSAFKPEFPKQYASYERNDETTVMTEYKGSVPFNKN -----------------------3333-3333--------3333----1111-----111 DNVNPLPEGYRHAQPYLKNLWLGYPFMYEYREARGHTYAIQDFLHIDRINRYAEKGGLPA 1------------11113333--1111-------3333---11113333----------- TCWNCKTPKMMEWVKESGDGFWAKDVNEFRDKIDMKDHTIGCATCHDPQTMELRITSVPL 3333------------!!!!111111111111--------3333---------------- TDYLVSQGKDPKKLPRNEMRALVCGQCHVEYYFNGPTMGVNKKPVFPWAEGFDPADMYRY ----1111-1111-3333--3333----------1111-2222----1111-3333---- YDKHGDLQVKGFEGKFADWTHPASKTPMIKAQHPEYETWINGTHGAAGVTCADCHMSYTR --------2222---------------------3333-1111-3333--3333------- SDDKKKISSHWWTSPMKDPEMRACRQCHSDKTPDYLKSRVLFTQKRTFDLLLAAQEVSVK -------------111111111111--3333----------------------------- AHEAVRLANEYQGAKAAGYDDLMIQAREMVRKGQFFWDYVSAENSVGFHNPAKALDTLAQ -------1111----1111--------------------3333-iiii------------ SQQFSQKAIDLAMEATQYGIGKDLSGDIKTIVPPILKMNRKLQQDPEFMKTHKWFQYLPV -----------------1111-----3333--------3333---3333--1111----- LPKADQVWDGQKRLV --------!!!!--- >NapC/NirT cytochrome c fa; SWP:Q72EF4; PDB:2J7AC; KLVLGGATLGVVALATVAFGMKYTDQRPFCTSCHIMNPVGVTHKLSGHANISCNDCHAPH --1111--1111--------------3333--3333---------1111--1111----- NLLAKLPFKAIAGARDVYMNTLGHPGDLILAGMETKEVVNANCKACHTMTNVEVASMEAK 3333-------------------------------------------3333---1111-- KYCTDCHRNVQHMRMKPISTREVAD -1111-1111%%%%--3333----- >RNA-DEPENDENT RNA POLYMER; SWP:Q9Y7G6; PDB:2J7NA; HAPVVAARLRNIWPKFPKWLHEAPLAVAWEVTRLFMHCKVDLEDESLGLKYDPSWSTARD ----------------3333-------------------------------3333----3 VTDIWKTLYRLDAFRGKPFPEKPPNDVFVTAMTGNFESKGSAVVLSAVLDYNPDNSPTAP 333--------------------3333------%%%%!!!!------------------- LYLVKLKPLMFEQGCRLTRRFGPDRFFEILIPSPTSTSPSVPPVVSKQPAAVEEVIQWLT ---------------------1111-------1111--------------3333------ MGQHSLVGRQWRAFFAKDAGPKPIIKERVHFFAETGITFRPDVFQRTEFKVSQMLDWLLQ -----iiii--------------------------------------------------3 LDNNTWQPHLKLFSRIQLGLSKTYAIMTLEPHQIRHHKTDLLSPSGTGEVMNDGVGRMSR 3333333-------3333-----------3333---------1111-----2222----- SVAKRIRDVLGLGDVPSAVQGRFGSAKGMWVIDVDDTGDEDWIETYPSQRKWECDFVDKH ----------------------!!!!------1111---------3333--------333 QRTLEVRSVASELKSAGLNLQLLPVLEDRARDKVKMRQAIGDRLINDLQRQFSEQKHALN 3-----------------3333-------------------------------------- RPVEFRQWVYESYSSRATRVSHGRVPFLAGLPDSQEETLNFLMNSGFDPKKQKYLQDIAW 3333-----1111--------------!!!!-----------1111-1111--------- DLQKRKCDTLKSKLNIRVGRSAYIYMIADFWGVLEENEVHVGFSSKFRDEEESFTLLSDC ------------------------------------------------!!!!-------- DVLVARSPAHFPSDIQRVRAVFKPELHSLKDVIIFSTKGDVPLAKKLSGGDYDGDMAWVC ----------1111--------3333---------------3333-%%%%---------- WDPEIVDGFVNAEMPLEPDLSRYLKKDKTTFKQLMASHGTGSAAKEQTTYDMIQKSFHFA -33331111----------3333-----------1111--------------------11 LQPNFLGMCTNYKERLCYINNSVSNKPAIILSSLVGNLVDQSKQGIVFNEASWAQLRREL 11---------------------------------11113333----------------- LGGALSLPDPMYKSDSWLGRGEPTHIIDYLKFSIARPAIDKELEAFHNAMKAAKDTEDGA ---------1111---------------------------1111---------------- HFWDPDLASYYTFFKEISDKSRSSALLFTTLKNRIGEVEKEYGRDPYPVRVNQVYEKWCA ---3333--------------------------------------------------111 ITPSKVIRLLELSFLADREMNTWALLRASTAFKLYYHKSPKFVWQMAGRQLAYIKAQMTS 1----------3333-1111------------------------------------1111 RPGEGAPALMTAFMYAGLMPDKKFTKQYVARLEGD 2222------33331111------------1111- >UBIQUITIN; SWP:P68198; PDB:2J7QA; KIVRASRDQSAPVYGPRAGSQCSNCFTFLHTCYLGIDPVLDTTSLDAVLDSGARLDAIAD -------11111111--------------------3333--------------------- EKVKRQALTDHPYRLGTEIPTVIETPAGITGHALSRPFNGTAETQDLGGYKCLGILDFLT ---3333-------3333------3333------------------iiii---------- YARGKPLPVYIIVTVGVHTRGVIVARGATYVFDPHTTDLSAEAAVYVCDDFTEAISALSF -1111---------!!!!------1111--------1111----------------1111 FTEIGDFYYDAVLVYFTRCRTTLISPSELLVQIDQYKDPDIDASVS ---1111------------------------------11113333- >SERINE/THREONINE-PROTEIN ; SWP:O94804; PDB:2J7TA; HVRRDLDPNEVWEIVGELGDGAFGKVYKAKNKETGALAAAKVIEEELEDYIVEIEILATC ------1111----------1111---------------------3333----------- DHPYIVKLLGAYYHDGKLWIMIEFCPGGAVDAIMLELDRGLTEPQIQVVCRQMLEALNFL -1111-------------------3333-------------------------------- HSKRIIHRDLKAGNVLMTLEGDIRLADFGVSAKNLKTLQKIGTPYWMAPEVVMCETMKDT 1111------3333---3333---------------------3333--------1111-- PYDYKADIWSLGITLIEMAQIEPPHHELNPMRVLLKIAKSDPPTLLTPSKWSVEFRDFLK --3333----------------2222--------------------3333---------- IALDKNPETRPSAAQLLEHPFVSSITSNKALRELVAEAKAEVMEE -----3333--3333------1111--3333---------3333- >RNA DEPENDENT RNA POLYME; SWP:A1XTB9; PDB:2J7UA; MDVIGERIKRIKEEHNSTWHYDDENPYKTWAYHGSYEVSSMINGVVKLLTKPWDVVPMVT 3333---------------------------------------------3333------- QMAMTDTTPFGQQRVFKEKVDTRTPRPLPGTRKVMEITAEWLWRTLGRNKRPRLCTREEF -----------3333-----------------------------1111------------ TKKVRTNDSAKAAVEDEEFWKLVDRERELHKLGKCGSCVSRAIWYMWLGARYLEFEALGF -----------------------------1111--------------------------- LNEDHWFSRENSYSGVEGEGLHKLGYILRDISKIPGGAMYADDTAGWDTRITEDDLHNEE -1111---3333---22223333-------1111-----------3333----------- KIIQQMDPEHRQLANAIFKLTYQNKVVKVQRPTPTGTVMDIISRKDQRGSGQVGTYGLNT --1111---------------------------------------------2222----- FTNMEAQLVRQMEGEGVLTKADLENPHLLEKKITQWLETKGVERLKRMAISGDDCVVKPI ------------1111----------------------------1111--!!!!------ DDRFANALLALNDMGKVRKDIPQWQPSKGWHDWQQVPFCSHHFHELIMKDGRKLVVPCRP -3333--3333----------1111-------1111-iiii------1111--------3 QDELIGRARISQGAGWSLRETACLGKAYAQMWSLMYFHRRDLRLASNAICSAVPVHWVPT 333---1111-------------------------1111--------------1111--- SRTTWSIHAHHQWMTTEDMLTVWNRVWIEENPWMEDKTPVTTWENVPYLGKREDQWCGSL -----1111-1111-------------1111----------3333---------1111-2 IGLTSRATWAQNIPTAIQQVRSLIGNEEFLDYM 222------------------------------ >ESTROGEN RECEPTOR BETA; SWP:Q62986; PDB:2J7YA; TLSPEQLVLTLLEAEPPNVLVSFTEASMMMSLTKLADKELVHMIGWAKKIPGFVELSLLD -----------1111----------------------------------2222------- QVRLLESCWMEVLMVGLMWRSIDHPGKLIFAPDLVLDRDEGKCVEGILEIFDMLLATTSR ------------------1111-2222---2222--3333---2222------------- FRELKLQHKEYLCVKAMILLNSSMYPLAESSRKLTHLLNAVTDALVWVIAKSGISSQQQS --------------------3333------------------------------------ VRLANLLMLLSHVRHISNKGMEHLLSMKCKNVVPVYDLLLEMLNAH ------------------------------------------3333 >STROMAL CELL-DERIVED FACT; SWP:P48061; PDB:2J7ZA; KPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIVARLKNNNRQVCIDPKLKWIQE -3333-------------3333--------1111-----------------1111----- YLEKALNK ---1111- >STIV B116; SWP:Q6Q0K9; PDB:2J85A; GKVFLTNAFSINLKEFPTTITIDKLDEEDFCLKLELRLEDGTLINAIGHDSTINLVNTLC -------------------------------------1111------------------- GTQLQKNRVEVKNEGDEALIIISQRLEEGKVLSDKEIKDYRQGKISFYEVWHHH ------------1111-----------------------1111----------- >METHIONINE SULFOXIDE REDU; SWP:Q6QPJ4; PDB:2J89A; PTIPQGPDDDLPAPGQQFAQFGAGCFWGVELAFQRVPGVTKTEVGYTQGLLHNPTYEDVT -3333-------2222-------------------2222---------------3333-- GTTNHNEVVRVQYDPKECSFDTLIDVLWARHDPTTLNRQGNDVGTQYRSGIYYYTPEQEK -------------1111-3333----------------!!!!------------------ AAKESLERQQKLLNRKIVTEILPAKKFYRAEEYHQQYLAKGGRFGFMQSAEKGCNDPIRC ---------1111-----------------3333-3333-----------2222----11 YG 11 >HISTONE-LYSINE N-METHYLTR; SWP:P38827; PDB:2J8AA; SCEIVVYPAQDSTTTNIQDISIKNYFKKYGEISHFEAFNDPNSALPLHVYLIKYANDAAK ---------------------------------------------------------333 AAFSAVRKHESSGCFIMGFKFEVILNKHSILNNIISKFVEINVKKLQKLQENLK 3-------------------------%%%%------------------------ >NP275-NP276; SWP:NA; PDB:2J8KA; HMDVEKLRQLYAAGERDFSIVDLRGAVLENINLSGAILHGAMLDEANLQQANLSRADLSG -----------------2222-2222-2222-2222-2222-2222------2222-222 ATLNGADLRGANLSKADLSDAILDNAILEGAILDEAVLNQANLKAANLEQAILSHANIRE 2-2222-2222-2222-2222-2222-2222-2222-2222-2222-2222-2222-222 ADLSEANLEAADLSGADLAIADLHQANLHQAALERANLTGANLEDANLEGTILEGG 2-2222-2222-2222-2222-2222-2222-2222-2222-2222-2222-2222 >ACETYLTRANSFERASE PA4866 ; SWP:Q9HUU7; PDB:2J8MA; SASIRDAGVADLPGILAIYNDAVGNTTAIWNETPVDLANRQAWFDARARQGYPILVASDA -------3333------------------------3333--------1111-------11 AGEVLGYASYGDWRPFEGFRGTVEHSVYVRDDQRGKGLGVQLLQALIERARAQGLHVMVA 11-------------3333----------1111--------------------------- AIESGNAASIGLHRRLGFEISGQMPQVGQKFGRWLDLTFMQLNLDPTRSAP --1111-------1111------------%%%%-----------1111--- >CLEAVAGE STIMULATION FACT; SWP:P33240; PDB:2J8PA; HMTPQDHEKAALIMQVLQLTADQIAMLPPEQRQSILILKEQIQKSTGAP --3333--3333---------------11113333----3333------ >CLEAVAGE AND POLYADENYLAT; SWP:O43809; PDB:2J8QA; SMYIQQTKPLTLERTINLYPLTNYTFGTKEPLYEKDSSVAARFQRMREEFDKIGMRRTVE --------3333-------1111------------------------------------- GVLIVHEHRLPHVLLLQLGTTFFKLPGGELNPGEDEVEGLKRLMTEILGRQDQDWVIDDC ------%%%%--------------------1111-------------------------- IGNWWRPNFEPPQYPYIPAHITKPKEHKKLFLVQLQEKALFAVPKNYKLVAAPLFELYDN -----------------2222----------------------1111-----3333---- APGYGPIISSLPQLLSRFNFIYN 3333--3333----3333----- >ACRIFLAVINE RESISTANCE PR; SWP:P31224; PDB:2J8SA; MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISASYPGADAKTVQDT 3333--------------------------------------------2222-------- VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ -----1111-------------------------2222---------------3333-33 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDAISRTSGVGDVQLFGS 331111-----------------1111---------------------2222-------- QYAMRIWMNPNELNKFQLTPVDVITAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL -------------1111---------------------------2222------------ TSTEEFGKILLKVNQDGSRVLLRDVAKIELGGENYDIIAEFNGQPASGLGIKLATGANAL -------------3333---3333----------------iiii---------2222--- DTAAAIRAELAKMEPFFPSGLKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ ------------3333-2222--------------------------------------- NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM --33333333-3333--------1111--------------------------------- AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL ------3333----------------------3333-----3333--------------- SVLVALILTPALCATMLKPIAKGDHGEGKKGFFGWFNRMFEKSTHHYTDSVGGILRSTGR --------------------2222-----------------------------3333-11 YLVLYLIIVVGMAYLFVRLPSSFLPDEDQGVFMTMVQLPAGATQERTQKVLNEVTHYYLT 11------------------------------------2222------------------ KEKNNVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGEENKVEAITMRATRAFSQIKD -3333----------3333-1111--------3333---1111----------------- AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLLAEAAKHPDMLTSVRPNG -------------3333-------------------------------3333-------- LEDTPQFKIDIDQEKAQALGVSINDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR -------------------------------------------iiii--------3333- MLPDDIGDWYVRAADGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA -1111-------1111---3333---------------iiii---------1111----- MELMEQLASKLPTGVGYDWTGMSYQERLSGNQAPSLYAISLIVVFLCLAALYESWSIPFS ------3333---------!!!!---------3333-------------1111--3333- VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL ----------------1111---3333--------------------------------- IEATLDAVRMRLRPILMTSLAFILGVMPLVISTGAGSGAQNAVGTGVMGGMVTATVLAIF ---------------------------1111------------------------3333- FVPVFFVVVRRRFSRKNEDIEHSH ------------------------ >ACRIFLAVINE RESISTANCE PR; SWP:NA; PDB:2J8SD; GSDLGKKLLEAARAGRDDEVRILMANGADVNAADVVGWTPLHLAAYWGHLEIVEVLLKNG -----------------------1111-1111-1111-------1111------------ ADVNAYDTLGSTPLHLAAHFGHLEIVEVLLKNGADVNAKDDNGITPLHLAANRGHLEIVE ------1111-3333--1111--------1111-1111-1111-3333--1111-3333- VLLKYGADVNAQDKFGKTAFDISINNGNEDLAEILQ -------1111-1111-3333--------------- >URACIL-DNA GLYCOSYLASE; SWP:Q777D9; PDB:2J8XA; ENLLLPDLWLDFLQLSPIFQRKLAAVIACVRRLRTQATVYPEEDMCMAWARFCDPSDIKV -------------------------------3333------1111--1111--1111--- VILGQDPYHGGQANGLAFSVAYGFPVPPSLRNIYAELHRSLPEFSPPDHGCLDAWASQGV --------------------2222----------------1111-------33331111- LLLNTILTVQKGKPGSHADIGWAWFTDHVISLLSERLKACVFMLWGAKAGDKASLINSKK ---------2222-1111--3333----------------------3333-3333-3333 HLVLTSQHPSPLAQNSTRKSAQQKFLGNNHFVLANNFLREKGLGEIDWRL ---------3333---------------3333------------------ >PHYCOERYTHROCYANIN ALPHA ; SWP:P00309; PDB:2J96A; MKTPLTEAIAAADLRGSYLSNTELQAVFGRFNRARAGLEAARAFANNGKKWAEAAANHVY --3333-------------3333---1111------------------------------ QKFPYTTQMQGPQYASTPEGKAKCVRDIDHYLRTISYCCVVGGTGPLDDYVVAGLKEFNS --3333----------3333------------------------3333-----3333333 ALGLSPSWYIAALEFVRDNHGLTGDVAGEANTYINYAINALS 3---3333---------------------------------- >CHLORIDE CHANNEL PROTEIN ; SWP:P51795; PDB:2J9LA; HKTLAMDVMKPRRNDPLLTVLTQDSMTVEDVETIISETTYSGFPVVVSRESQRLVGFVLR ---3333----------------------------------------3333--------- RDLIISIENARKKQDGVVSTSIIYFTEHSPPLPPYTPPTLKLRNILDLSPFTVTDLTPME ---------1111----1111--------------------1111--------1111333 IVVDIFRKLGLRQCLVTHNGRLLGIITKKDVLKHIAQMANFNEFLEV 3----------------iiii--------------------3333-- >THYMIDINE KINASE; SWP:NA; PDB:2J9RA; HMYLINQNGWIEVICGSMFSGKSEELIRRVRRTQFAKQHAIVFKPCVKAVPVSASKDIFK --------------------------------3333-----------------3333333 HITEEMDVIAIDEVQFFDGDIVEVVQVLANRGYRVIVAGLDQDFRGLPFGQVPQLMAIAE 3-3333------3333-3333-------1111----------1111--!!!!-------- HVTKLQAVCSACGSPASRTQRLIDGEPAAFDDPIILVGASESYEPRCRHCHAVPTKQ ----------------------iiii--1111------1111----3333------- >VACUOLAR PROTEIN SORTING-; SWP:Q02767; PDB:2J9UA; FNAKYVAEATGNFITVMDALKLNYNAKDQLHPLLAELLISINRVTRDDFENRSKLIDWIV -------------------1111--3333-------------------2222-------- RINKLSIGDTLTETQIRELLFDLELAYKSFYALL -1111----------------------------- >Vacuolar protein-sorting-; SWP:Q06696; PDB:2J9UB; VSTWVCPICMVSNETQGEFTKDTLPTPICINCGVPADYELTKSSINC -------------------1111-------------33333333--- >VPS28-PROV PROTEIN; SWP:Q28GA6; PDB:2J9WA; HHMGNLNRCIADIVSLFITVMDKLRLEIRAMDEIQPDLRELMETMNRMSHLPPDFEGREK -----------------------1111--1111--------------33331111----- VSQWLQKLSSMSASDELDDSQVRQMLFDLESAYNAFNRFLH -------11111111-------------------------- >T-CELL SURFACE GLYCOPROTE; SWP:A0N0P4; PDB:2JA4A; AFQPKVQSRLVGGSSICEGTVEVRQGAQWAALCDSSSARSSLRWEEVCREQQCGSVNSYR --------------1111------------------------------------------ VLDAGDPTSRGLFCPHQKLSQCHELWERNSYCKKVFVTCQD --2222-----------1111-------------------- >EXOSOME COMPLEX EXONUCLEA; SWP:Q08285; PDB:2JA9A; KRYIPSVNDFVIGVIIGTFSDSYKVSLQNFSSSVSLSYMAFPNASKKNRPTLQVGDLVYA -----2222---------------------------1111----3333----2222---- RVCTAEKELEAEIECFDSTTGRDAGFGILEDGMIIDVNLNFARQLLFNNDFPLLKVLAAH -----2222-------------iiii---------------------1111--------- TKFEVAIGLNGKIWVKCEELSNTLACYRTIMECCQKNDTAAFKDIAKRQFKEILT -------1111--------------------------3333------1111---- >GLUTAREDOXIN-1; SWP:P25373; PDB:2JACA; MVSQETIKHVKDLIAENEIFVASKTYCPYSHAALNTLFEKLKVPRSKVLVLQLNDMKEGA -------------1111------1111----------------3333----33331111- DIQAALYEINGQRTVPNIYINGKHIGGNDDLQELRETGELEELLEPIL -------------------iiii----------------33333333- >L-AMINO ACID OXIDASE; SWP:Q8VPD4; PDB:2JAEA; DLIGKVKGSHSVVVLGGGPAGLCSAFELQKAGYKVTVLEARTRPGGRVWTARGGSEETDL ----------------------------1111------------!!!!---2222---11 SGETQKCTFSEGHFYNVGATRIPQSHITLDYCRELGVEIQGFGNQNANTFVNYQSDTSLS 11-------2222---------1111-------------------1111-------1111 GQSVTYRAAKADTFGYMSELLKKATDQGALDQVLSREDKDALSEFLSDFGDLSDDGRYLG ---------------------------1111----------------1111-1111---- SSRRGYDSEPGAGLNFGTEKKPFAMQEVIRSGIGRNFSFDFGYDQAMMMFTPVGGMDRIY 3333-------!!!!----------------11113333--1111------2222----- YAFQDRIGTDNIVFGAEVTSMKNVSEGVTVEYTAGGSKKSITADYAICTIPPHLVGRLQN -------3333------------1111------iiii-------------33333333-- NLPGDVLTALKAAKPSSSGKLGIEYSRRWWETEDRIYGGASNTDKDISQIMFPYDHYNSD ---------3333-------------------------------3333-------2222- RGVVVAYYSSGKRQEAFESLTHRQRLAKAIAEGSEIHGEKYTRDISSSFSGSWRRTKYSE ----------11111111-------------------3333----------33332222- SAWANWAGSATPEYEKLLEPVDKIYFAGDHLSNAIAWQHGALTSARDVVTHIHERVAQE -----2222------3333-!!!!---3333--2222---------------------- >CLAVULANIC ACID DEHYDROGE; SWP:Q9LCV7; PDB:2JAHA; SALQGKVALITGASSGIGEATARALAAEGAAVAIAARRVEKLRALGDELTAAGAKVHVLE 1111---------------------1111--------------------1111------- LDVADRQGVDAAVASTVEALGGLDILVNNAGILLGPVEDADTTDWTRIDTNLLGLYTRAA -1111-------------------------------2222-----------------333 LPHLLRSKGTVVQSSIAGRVNVRNAAVYQATKFGVNAFSETLRQEVTERGVRVVVIEPGT 3-------------3333---2222--------------------3333----------- TDTELRGHITHTATKEYEQRISQIRKLQAQDIAEAVRYAVTAPHHATVHEIFIRPTDQV ---3333---3333---3333------3333-----------1111--------1111- >NG,NG-DIMETHYLARGININE DI; SWP:Q5VWX2; PDB:2JAJA; AFGRATHAVVRALPESLGQHALRSGEEVDVARAERQHQLYVGVLGSKLGLQVVELPADES 2222---------3333----------------------------------------111 LPDCVFVEDVAVVCEETALITRPGAPSRRKEVDMMKEALEKLQLNIVEMKDENATLDGGD 1-33333333---!!!!-------3333-----------1111-------1111---111 VLFTGREFFVGLSKRTNQRGAEILADTFKDYAVSTVPVADGLHLKSFCSMAGPNLIAIGS 1-----------1111-----------3333-------iiii1111-----2222----- SESAQKALKIMQQMSDHRYDKLTVPDDIAANCIYLNIPNKGHVLLHRTPEEYPESAKVYE -----------1111----------3333------------------3333-3333---- KLKDHMLIPVSMSELEKVDGLLTCCSVLINKK -1111------3333-----3333-------- >SERINE/THREONINE-PROTEIN ; SWP:Q13362; PDB:2JAKA; IRDVPADQEKLFIQKLRQCCVLFDFVSDPLSDLKWKEVKRAALSEMVEYITHNRNVITEP ---------------------------------------------------------333 IYPEVVHMFAVNMFRTLPPPTLEAAWPHLQLVYEFFLRFLESPDFQPNIAKKYIDQKFVL 3---------------------1111---------------111133333333------- QLLELFDSEDPRERDFLKTTLHRIYGKFLGLRAYIRKQINNIFYRFIYETEHHNGIAELL ---------------------------3333----------------------------- EILGSIINGFALPLKEEHKIFLLKVLLPLHKVKSLSVYHPQLAYCVVQFLEKDSTLTEPV ------1111----3333-------------1111-----------------3333---- VMALLKYWPKTHSPKEVMFLNELEEILDVIEPSEFVKIMEPLFRQLAKCVSSPHFQVAER ------------3333---------3333-3333--------------1111-3333--- ALYYWNNEYIMSLISDNAAKILPIMFP -3333------------11113333-- >POLYPEPTIDE; SWP:Q96NX5; PDB:2JAMA; NIRKTFIFMEVLGSGAFSEVFLVKQRLTGKLFALKCIKKSSLENEIAVLKKIKHENIVTL 3333-------------------------------------------------1111--- EDIYESTTHYYLVMQLVSGGELFDRILERGVYTEKDASLVIQQVLSAVKYLHENGIVHRD -----1111------------------------------------------1111----- LKPENLLYLTPEENSKIMITDFGLSKMEQNGIMSTACGTPGYVAPEVLAQKPYSKAVDCW -3333------1111-------1111----1111---------1111------------- SIGVITYILLCGYPPFYEETESKLFEKIKEGYYEFESPFWDDISESAKDFICHLLEKDPN -------------2222----------------------1111--------------333 ERYTCEKALSHPWIDGNTALHRDIYPSVSLQIQKNFAKS 3--3333---3333----------3333-------1111 >DEOXYGUANOSINE KINASE; SWP:Q93IG4; PDB:2JAQA; MKIAIFGTVGAGKSTISAEISKKLGYEIFKEPVEENPYFEQYYKDLKKTVFKMQIYMLTA -------2222--------------------33331111-33333333------------ RSKQLKNIIFDRTLLEDPIFMKVNYDLNNVDQTDYNTYIDFYNNVVLENKLSFDIVIYLR ------------3333--------1111-----------------3333----------- VSTKTAISRIKKRGRSEELLIGEEYWETLNKNYEEFYKQNVYDFPFFVVDAELDVKTQIE --------------3333---3333--------------1111----------------- LIMNKLNSI -----1111 >SERINE/THREONINE-PROTEIN ; SWP:P51955; PDB:2JAVA; SRAEDYEVLYTIGTGSYGRCQKIRRKSDGKILVWKELDYGSMTEAEKQMLVSEVNLLREL -1111---------3333--------------------11113333--------1111-- KHPNIVRYYDRIIDTLYIVMEYCEGGDLASVITKGTKERQYLDEEFVLRVMTQLTLALKE -1111-----------------1111---------------------------------- CHRRSDLKPANVFLDGKQNVKLGDFGLARILGTPYYMSPEQMNRMSYNEKSDIWSLGCLL -------3333------------------------------------3333--------- YELCALMPPFTAFSQKELAGKIREGKFRRIPYRYSDELNEIITRMLNLKDYHRPSVEEIL -------------3333-------------3333---------1111-3333-------- ENPLILEHHHHHH -33333333---- >PHOSPHATE REGULON TRANSCR; SWP:P0AFJ5; PDB:2JBAA; ARRILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLAWMLPGGSG ---------------------1111------------1111-------------2222-- IQFIKHLRRESMTRDIPVVMLTARGEEEDRVRGLETGADDCITKPFSPKELVARIKAVMR -----11113333-----------------2222-------------------------- RISPM ----- >HHGP; SWP:Q9NRG1; PDB:2JBHA; EAPDYGRGVVIMDDWPGYDLNLFTYPQHYYGDLEYVLIPHGIIVDRIERLAKDIMKDIGY ---3333----1111---3333---3333----------------------------333 SDIMVLCVLKGGYKFADLVEHLKNISRNSDRFVSMKVDFIRLKMQIIGGDDLSTLAGKNV 3--------1111-----------------------------------------2222-- LIVEDVVGTGRTMKALLSNIEKYKPNMIKVASLLVKRTRSDGFRPDYAGFEIPNLFVVGY ----------------------------------------------------------ii ALDYNEYFRDLNHICVINEHGKEKYRV ii-iiii1111-----------1111- >P-HYDROXYPHENYLACETATE HY; SWP:Q6Q272; PDB:2JBRA; RLVYTHAQTPDVSGVSMLEKIQQILPQIAKNAESAEQLRRVPDENIKLLKEIGLHRAFQP -----------------------------------------3333-------3333---3 KVYGGLEMSLPDFANCIVTLAGACAGTAWAFSLLCTHSHQIAMFSKQLQDEIWLKDPDAT 333----------------3333-----------------1111-----------1111- ASSSIAPFGKVEEVEGGIILNGDYGWSSGCDHAEYAIVGFNRFDADGNKIYSFGVIPRSD --------------------------2222-------------1111---------3333 YEIVDNWYAQAIKSSGSKMLKLVNVFIPEYRISKAKDMMEGKSAGFGLYPDSKIFYTPYR -----------1111------------1111----1111---1111--1111-------3 PYFASGFSAVSLGIAERMIEAFKEKQRNRVRAYTGANVGLATPALMRIAESTHQVAAARA 333----------------------1111--------1111------------------- LLEKTWEDHRIHGLNHQYPNKETLAFWRTNQAYAVKMCIEAVDRLMAAAGATSFMDNSEL --------------------------------------------3333-3333-1111-- QRLFRDAHMTGAHAYTDYDVCAQILGRELMGMEPDPTMV -------3333-3333-----------1111---3333- >2,6-DIHYDROXY-PSEUDO-OXYN; SWP:Q93NG6; PDB:2JBWA; KPEDEDNWGRLILDGVSYSDVGARDRPKEITWFDYWSLANEYEQEAERKVALGHDLSAGE 3333------------3333--11113333--3333-------------1111------- LLSAALCAQYAQFLWFDERRQKGQARKVELYQKAAPLLSPPAERHELVVDGIPPVYVRIP ----------------3333-------------3333-----------iiii-------- EGPGPHPAVILGGLESTKEESFQENLVLDRGATATFDGPGQGEFEYKRIAGDYEKYTSAV ----------------3333-------1111------2222--1111----3333----- VDLLTKLEAIRNDAIGVLGRSLGGNYALKSAACEPRLAACISWGGFSDLDYWDLETPLTK ------11111111-------------------3333-----------1111-------- ESWKYVSKVDTLEEARLHVHAALETRDVLSQIACPTYILHGVHDEVPLSFVDTVLELVPA ----1111------------111111111111--------1111--------------33 EHLNLVVEKDGDHCCHNLGIRPRLEADWLYDVLVAGKKVAPTKGWPLEH 33---------222211113333-------------------------- >HOLO-[ACYL-CARRIER-PROTEI; SWP:O86785; PDB:2JBZA; HMSIIGVGIDVAEVERFGAALERTPALAGRLFLESELLLPGGERRGVASLAARFAAKEAL -----------------------33333333-3333--3333------------------ AKALGAPAGLLWTDAEVWVEAGGRPRLRVTGTVAARAAELGVASWHVSLSHDAGIASAVV -1111-----1111-----3333----------------------------%%%%----- IAEG ---- >O-ACETYLSERINE SULFHYDRYL; SWP:P29848; PDB:2JC3A; MNTLEQTIGNTPLVKLQRIGPDNGSEIWVKLEGNNPAGSVKDRAALSMIVEAEKRGEIKP --3333------------------------11111111--------------1111---- GDVLIEATSGNTGIALAMIAALKGYRMKLLMPDNMSQERRAAMRAYGAELILVTKEQGME ------------------------------------------3333-------3333--- GARDLALAMSERGEGKLLDQFNNPDNPYAHYTTTGPEIWRQTSGRITHFVSSMGTTGTIT ------------------1111-------------------%%%%--------------- GVSRFLREQEKPVTIVGLQPEEGSSIPGIRRWPAEYMPGIFNASLVDEVLDIHQNDAENT --------------------2222-2222--------11111111--------------- MRELAVREGIFCGVSSGGAVAGALRVARATPGAIVVAIICDRGDRYLSTGVFGE -----------------------------2222---------11111111---- >EXODEOXYRIBONUCLEASE III; SWP:Q9K100; PDB:2JC4A; MKITTWNVNSLNVRLPQVQNLLADNPPDILVLQELKLDQDKFPAAALQMMGWHCVWSGQK -------------------------------------1111-33333333---------- TYNGVAIVSRSVPQDVHFGLPALPDDPQRRVIAATVSGVRVINVYCVNGEALDSPKFKYK -------------------1111------------iiii-----------1111------ EQWFAALTEFVRDEMTRHGKLVLLGDFNIAPADADCYDPEKWHEKIHCSSVERQWFQNLL -------------------------------3333--3333------------------- DLGLTDSLRQVHPEGAFYTWFDYRGAMFQRKLGLRIDHILVSPAMAAALKDVRVDLETRA -----3333-------------22223333---------------1111------3333- LERPSDHAPVTAEFDW ---------------- >EXODEOXYRIBONUCLEASE; SWP:A1IPH9; PDB:2JC5A; MLKIISANVNGIRSAYKKGFYEYIAASGADIVCVQELKAQEADLSADMKNPHGMHGHWHC ------------------3333-----------------3333-1111-2222------- AEKRGYSGVAVYSKRKPDNVQIGMGIEEFDREGRFVRCDFGRLSVISLYLPSGSSAEERQ -------------------------3333----------!!!!---------3333---- QVKYRFLDAFYPMLEAMKNEGRDIVVCGDWNIAHQNIDLKNWKGNQKNSGFLPEEREWIG ----------------1111--------------3333--33331111------------ KVIHKLGWTDMWRTLYPDVPGYTWWSNRGQAYAKDVGWRIDYQMVTPELAAKAVSAHVYK ---------------1111------------1111-----------3333---------- DEKFSDHAPLVVEYDYAAE ------------------- >BETA-LACTAMASE OXA-24; SWP:NA; PDB:2JC7A; HISSQQHEKAIKSYFDEAQTQGVIIIKEGKNLSTYGNALARANKEYVPASTFKMLNALIG ---------------------------!!!!------3333------!!!!--------- LENHKATTNEIFKWDGKKRTYPMWEKDMTLGEAMALSAVPVYQELARRTGLELMQKEVKR ------1111----------3333------------------------------------ VNFGNTNIGTQVDNFWLVGPLKITPVQEVNFADDLAHNRLPFKLETQEEVKKMLLIKEVN --!!!!----1111--------------------1111--------------------ii GSKIYAKSGWGMGVTPQVGWLTGWVEQANGKKIPFSLNLEMKEGMSGSIRNEITYKSLEN ii------------------------1111-----------2222--------------- LGII ---- >CYTOSOLIC PURINE 5'-NUCLE; SWP:P49902; PDB:2JC9A; TSWSDRLQNAADMPANMDKHALKKYRREAYHRVFVNRSLAMEKIKCFGFDMDYTLAVYKS -3333----3333--------------3333--------3333-------2222------ PEYESLGFELTVERLVSIGYPQELLSFAYDSTFPTRGLVFDTLYGNLLKVDAYGNLLVCA ---------------1111--3333----1111-----------------1111------ HGFNFIRGPETREQYPNKFIQRDDTERFYILNTLFNLPETYLLACLVDFFTNCPRYTSCE !!!!--3333-1111-----3333--------3333----------------3333--11 TGFKDGDLFMSYRSMFQDVRDAVDWVHYKGSLKEKTVENLEKYVVKDGKLPLLLSRMKEV 11--!!!!------------------------------3333----1111---------- GKVFLATNSDYKYTDKIMTYLFDFPHGPKPGSSHRPWQSYFDLILVDARKPLFFGEGTVL -------------------1111-----2222---3333--------------------- RQVDTKTGKLKIGTYTGPLQHGIVYSGGSSDTICDLLGAKGKDILYIGDHIFGDILKSKK -------------------2222-----3333-------1111------3333------- RQGWRTFLVIPELAQELHVWTDKSSLFEELQSLDIFLAQRRIKKVTHDMDMCYGMMGSLF -----------------------------------------------------1111111 RSGSRQTLFASQVMRYADLYAASFINLLYYPFSYLFRAAHVLMPHES 1!!!!---------------------11111111--------1111- >5-FORMYLTETRAHYDROFOLATE ; SWP:NA; PDB:2JCBA; EKLRLRKQIIEHMNSLSKERYTTLSEQIVFSLYEQKEWAEAKTIGITLSMENEVNTYPII ------------1111-------------------3333----------!!!!------- EKAWKEGKRVVVPKCNKETRTMSFRQISNFDQLETVYMNLREPIPALTEEVNADEIDLQI ---1111---------------------1111-----------3333----1111----- VPGVAYTERGERIGYGGGYYDRYLVHYKGKTLSLAYSFQMVEHIPVEPFDKNVEKIITEK ------1111------------3333---------3333-------1111-------111 GTMVKN 1----- >GLUCOSE-RESISTANCE AMYLAS; SWP:P46828; PDB:2JCGA; VTIYDVAREASVSMATVSRVVNGNPNVKPSTRKKVLETIERLGYRPNAVTTTVGVIIPDI -3333--1111--------11111111--------------------------------- SNIFYAELARGIEDIASMYKYNIILSNSDQNQDKQLHLLNNMLGKQVDGIIFMSGNVTEE -1111-----------------------------------3333-------------333 HVEELKKSPVPVVLAASIESTNQIPSVTIDYEQAAFDAVQSLIDSGHKNIAFVSGTLEEP 31111-------------3333--------------------1111---------3333- INHAKKVKGYKRALTESGLPVRDSYIVEGDYTYDSGIEAVEKLLEEDEKPTAIFVGTDEM -------------3333----1111-----------------1111-------------- ALGVIHGAQDRGLNVPNDLEIIGFDNTRLSTMVRPQLTSVVQPMYDIGAVAMRLLTKYMN --------------------------3333---------------------------111 KETVDSSIVELPHRIEFRQSTK 1--------------------- >CD44 ANTIGEN; SWP:Q3U8S1; PDB:2JCQA; QIDLNVTCRYAGVFHVEKNGRYSISRTEAADLCQAFNSTLPTMDQMKLALSKGFETCRYG ---------iiii----%%%%------------1111------------1111------- FIEGNVVIPRIHPNAICAANHTGVYILVTSNTSHYDTYCFNASAPPEEDCTSVTDLPNSF -2222--------1111iiii-------------------1111-----------1111- DGPVTITIVNRDGTRYSKKGEYRTHQEDID ---------1111-----------3333-- >LAMININ SUBUNIT ALPHA-1; SWP:P19137; PDB:2JD4A; APLAQPELCAVDTAPGYVAGAHQFGLSQNSHLVLPLQQSDVRKRLQVQLSIRTFASSGLI ----------------------------------------1111---------------- YYVAHQNQMDYATLQLQEGRLHFMFDLGKGRTKVSHPALLSDGKWHTVKTEYIKRKAFMT ----1111--------iiii---------------------------------------- VDGQESPSVTVVGKATTLDVERKLYLGGLPSHYRARNIGTITHSIPACIGEIMVNGQQLD iiii-------------------------1111--------------------iiii--1 KDRPLSASAVDRCYVVAQEGTFFEGSGYAALVKEGYKVRLDLQITLEFRTTSKNGVLLGI 111----------------------------3333------------------------- SSAKVDAIGLEIVDGKVLFHVNNGAGRITATYQPRAARALCDGKWHTLQAHKSKHRIVLT ------------iiii-------------------1111--------------------- VDGNSVRAEHSTSADTNDPIYVGGYPAHIKQNSLSSRASFRGCVRNLRLSQVQSLDLSRA iiii---------------------1111--------------------------3333- FDLQGVFPHSCPGPEP ------2222------ >YECBM32; SWP:A1JSS7; PDB:2JDAA; TAQIVAVTASGYDSEKGHVPANIADGDVKTRWAASGESWVQLELDKEQSIENILIVPFKP -------------1111-3333----1111----------------------------11 TERKLKFSIFYSNDGKNWQPLAEGLETSSADKNGEKLTFTPVTAKYIKLDTFGTDVNNWS 11---------------------------------------------------------- AINEIAINSAAALPSRAIK ------------------- >GLYPHOSATE N-ACETYLTRANSF; SWP:NA; PDB:2JDCA; IEVKPINAEDTYELRHRILRPNQPIEACMFESDLLRGAFHLGGYYGGKLISIASFHQAEH ------3333---------1111--11113333-2222------iiii-----------1 SELQGQKQYQLRGMATLEGYREQKAGSSLIKHAEEILRKRGADLLWCNARTSASGYYKKL 111-------------2222-----------------1111--------3333----111 GFSEQGEVFDTPPVGPHILMYKRIT 1------------------------ >ATP SYNTHASE SUBUNIT ALPH; SWP:P19483; PDB:2JDIA; DLEETGRVLSIGDGIARVHGLRNVQAEEMVEFSSGLKGMSLNLEPDNVGVVVFGNDKLIK -----------iiii-----11112222---3333--------1111-------3333-2 EGDIVKRTGAIVDVPVGEELLGRVVDALGNAIDGKGPIGSKARRRVGLKAPGIIPRISVR 222-------------3333-----1111-----------------------1111---- EPMQTGIKAVDSLVPIGRGQRELIIGDRQTGKTSIAIDTIINQKRFNDGTDEKKKLYCIY ----------------2222----------3333-------------------------- VAIGQKRSTVAQLVKRLTDADAMKYTIVVSATASDAAPLQYLAPYSGCSMGEYFRDNGKH ---------------------3333------11113333--------------------- ALIIYDDLSKQAVAYRQMSLLLRRPPGREAYPGDVFYLHSRLLERAAKMNDAFGGGSLTA -------------------1111---2222-1111------3333----3333------- LPVIETQAGDVSAYIPTNVISITDGQIFLETELFYKGIRPAINVGLSVSRVGSAAQTRAM ------%%%%-----------------------3333------3333-33331111---- KQVAGTMKLELAQYREVAAFAQFGSDLDAATQQLLSRGVRLTELLKQGQYSPMAIEEQVA ------------33333333----------------------1111-------------- VIYAGVRGYLDKLEPSKITKFENAFLSHVISQHQALLGKIRTDGKISEESDAKLKEIVTN ----1111-11111111------------------------------------------- FLAGFEA ---1111 >ATP synthase subunit beta; SWP:P00829; PDB:2JDID; TTGRIVAVIGAVVDVQFDEGLPPILNALEVQGRETRLVLEVAQHLGESTVRTIAMDGTEG --------!!!!----------2222-------------------%%%%--------222 LVRGQKVLDSGAPIRIPVGPETLGRIMNVIGEPIDERGPIKTKQFAAIHAEAPEFVEMSV 2-----------------1111-----1111----------------------3333--- EQEILVTGIKVVDLLAPYAKGGKIGLFGGAGVGKTVLIMELINNVAKAHGGYSVFAGVGE ------------------2222------2222-------------1111----------- RTREGNDLYHEMIESGVINLKDATSKVALVYGQMNEPPGARARVALTGLTVAEYFRDQEG 3333--------------------------------3333-------------------- QDVLLFIDNIFRFTQAGSEVSALLGRIPSAVGYQPTLATDMGTMQERITTTKKGSITSVQ -------------------3333------iiii1111------3333---3333------ AIYVPADDLTDPAPATTFAHLDATTVLSRAIAELGIYPAVDPLDSTSRIMDPNIVGSEHY ---22221111-----3333--------3333--------1111--11113333------ DVARGVQKILQDYKSLQDIIAILGMDELSEEDKLTVSRARKIQRFLSQPFQVAEVFTGHL ---------------3333----3333-3333-----------3333--1111------- GKLVPLKETIKGFQQILAGEYDHLPEQAFYMVGPIEEAVAKADKLAE --------------------11113333-----3333---------- >ATP synthase gamma chain,; SWP:ATPG_BOVIN; PDB:2JDIG; ATLKDITRRLKSIKNIQKITKSMKMVAAAKYARAERELKPARVYGVGLIIGVSSDRGLCG ----------------------------------------------------------!! AIHSSVAIIGVGDKIRSILTFKEVGRRPPTFGDASVIALELSIIFNRFRSVISYKTEYSL !!-------------3333----------3333----3333---------------3333 ANIIYYSLKESTTSEQSARMTAMDNASKNASEMIDKLTLTFNRTRQAVITKELIEIISGA ------------------------------------------------------------ AALD ---- >ATP synthase delta chain,; SWP:ATPD_BOVIN; PDB:2JDIH; SFTFASPTQVFFNQVDVPTLRPGLVVVFVSSGSQLLAEEAVTLDMLDLGAAKANLEKAQS ----------------------------------------------------------33 ELLGAADEATRAEIQIRIEANEALVKAL 33---------------------3333- >IMPORTIN ALPHA-1 SUBUNIT; SWP:Q5R909; PDB:2JDQA; TSDMIEMIFSKSPEQQLSATQKFRKLLSKEPNPPIDEVISTPGVVARFVEFLKRKENCTL 3333------------------------------3333--2222---------1111--- QFESAWVLTNIASGNSLQTRIVIQAGAVPIFIELLSSEFEDVQEQAVWALGNIAGDSTMC --------------3333----------------------------------1111---- RDYVLDCNILPPLLQLFSKQNRLTMTRNAVWALSNLCRGKSPPPEFAKVSPCLNVLSWLL ----1111------------------------------------33333333------11 FVSDTDVLADACWALSYLSDGPNDKIQAVIDAGVCRRLVELLMHNDYKVVSPALRAVGNI 11--------------1111---------1111-----1111---3333----------- VTGDDIQTQVILNCSALQSLLHLLSSPKESIKKEACWTISNITAGNRAQIQTVIDANIFP ----------1111-------3333----------------------------1111--- ALISILQTAEFRTRKEAAWAITNATSGGSAEQIKYLVELGCIKPLCDLLTVMDSKIVQVA ------------------------------------1111-----3333----------- LNGLENILRLGEQEAKRNGTGINPYCALIEEAYGLDKIEFLQSHENQEIYQKAFDLIEHY ----------------------1111---------------------------------- FGTE ---- >Polymerase [Fragment]; SWP:Q7TGX1; PDB:2JDQD; SAVLRGFLILGKEDRRYGPALSINELSNLAKGEKANVLIGQGDVVLVMKRKRDSQTATKR 3333---------3333----333311112222--------------------------- IRM --- >EXOSOME COMPLEX EXONUCLEA; SWP:Q9UXC2; PDB:2JE6A; HMSSTPSNQNIIPIIKKESIVSLFEKGIRQDGRKLTDYRPLSITLDYAKKADGSALVKLG ------------3333----1111---------1111----------1111-------!! TTMVLAGTKLEIDKPYEDTPNQGNLIVNVELLPLAYETFEPGPPDENAIELARVVDRSLR !!-------------1111------------33331111--------------------3 DSKALDLTKLVIEPGKSVWTVWLDVYVLDYGGNVLDACTLASVAALYNTKVYKVEQHISV 333--3333---2222-------------------------------------------- NKNEVVGKLPLNYPVVTISVAKVDKYLVVDPDLDEESIMDAKISFSYTPDLKIVGIQKSG ----------------------!!!!-----33331111--------1111--------- KGSMSLQDIDQAENTARSTAVKLLEELKKHLGI --------------------------------- >Probable exosome complex ; SWP:Q9UXC2; PDB:2JE6B; ERPKLILDDGKRTDGRKPDELRSIKIELGVLKNADGSAIFEMGNTKAIAAVYGPKEMHPR ------1111-1111-1111---------------------!!!!-------------33 HLSLPDRAVLRVRYHMTPFSTDERKNPAPSRREIELSKVIREALESAVLVELFPRTAIDV 33-1111---------1111----------------------------11112222---- FTEILQADAGSRLVSLMAASLALADAGIPMRDLIAGVAVGKADGVIILDLNETEAMWGEA ---------3333----------1111--------------iiii--------------- DMPIAMMPSLNQVTLFQLNGSMTPDEFRQAFDLAVKGINIIYNLEREALKSKYV ------3333---------------------------------------1111- >Probable exosome complex ; SWP:Q9UXC4; PDB:2JE6I; QEIVLQPRSIVVPGELLAEGEFQIPWSPYILKINSKYYSTVVGLFDVKDTQFEVIPLEGS -----------2222-----------1111--!!!!-----------!!!!--------- FYYPKINDIVIGLVEDVEIYGWVVDIKAPYKAYLPASNLLGRSINVGEDLRRYLDVGDYV ----2222--------------------------3333----------------2222-- IARIENFDRSIDPVLSVKGKDLGRVSNGIVIDIMPVKVPRVIGKNKSMYETLTSKSIFVA -------1111----------------------3333------%%%%----3333----- NNGRIWAFSEEILIEAIRKIENESHIK -----------333333331111---- >BETA-MANNOSIDASE; SWP:Q8AAK6; PDB:2JE8A; NDTSEVMLLDTGWEFSQSGTEKWMPATVPGTVHQDLISHELLPNPFYGMNEKKIQWVENE ----------------2222----------------1111---1111--3333-3333-- DWEYRTSFIVSEEQLNRDGIQLIFEGLDTYADVYLNGSLLLKADNMFVGYTLPVKSVLRK ----------3333--------------------iiii------1111-----3333--- GENHLYIYFHSPIRQTLPQYASNGFNYPADNDHHEKHLSVFSRKAPYSYGWDWGIRMVTS ------------3333------------1111-----3333---3333------------ GVWRPVTLRFYDIATISDYYVRQLSLTDENARLSNELIVNQIVPQKIPAEVRVNVSLNGT --------------------------3333--------------------------iiii TVTEVKQQVTLQPGINHITLPAEVTNPVRWMPNGWGTPTLYDFSAQIACGDRIVAEQSHR -------------------------------2222-------------!!!!-------- IGLRTIRVVNEKDKDGESFYFEVNGIPMFAKGANYIPQDALLPNVTTERYQTLFRDMKEA ------------1111------iiii--------------3333-------------111 NMNMVRIWGGGTYENNLFYDLADENGILVWQDFMFACTPYPSDPTFLKRVEAEAVYNIRR 1-------------3333------------------------------------------ LRNHASLAMWCGNNEILEALKYWGFEKKFTPEVYQGLMHGYDKLFRELLPSTVKEFDSDR 1111-------------------3333-----------------------------1111 FYVHSSPYLANWGRPESWGTGDSHNWGVWYGKKPFESLDTDLPRFMSEFGFQSFPEMKTI ----------11111111----------1111-3333------------------33333 AAFAAPEDYQIESEVMNAHQKSSIGNSLIRTYMERDYIIPESFEDFVYVGLVLQGQGMRH 333-3333-11113333-------3333-------------------------------- GLEAHRRNRPYCMGTLYWQLNDSWPVVSWSSIDYYGNWKALHYQAKRAFAPVLINPIQQN -----1111-----------------------1111----------1111--------%% DSLSVYLISDRLDTMEQMTLEMKVVDFDGKTLGKKIQVHSLEVPANTSKCVYRAKLDGWL %%-----------------------1111--------------------------2222- TPEDCRRSFLKLILKDKSGHQVAESVHFFRKTKDLQLPPTSVSYQMKQTDGKCELTLFSS 3333-----------1111-----------3333--------------2222-------- MLAKDIFIETPLQGARYSDNFFDLLPGERKKVIITSPRIKKGEELPVNIKHIRETYK -----------2222---------2222-------11112222-------3333--- >RV1873; SWP:A2VIY8; PDB:2JEKA; DPFDLKRFVYAQAPVYRSVVEELRAGRKRGHWMWFVFPQLRGLGSSPLAVRYGISSLEEA 11113333-----------------------3333----2222----------------- QAYLQHDLLGPRLHECTGLVNQVQGRSIEEIFGPPDDLKLCSSMTLFARATDANQDFVAL -------------------3333---3333----------------1111---------- LAKYYGGGEDRRTVALLAVT ----iiii------1111-- >Phosphocarrier protein HP; SWP:P0AA04; PDB:2JELH; QVQLAQSGPELVRPGVSVKISCKGSGYTFTTYAMHWVKQSHAKSLEWIGLISTYSGYTNY ------------2222-----------1111----------------------------- NQKFKGKATMTVDKSSSTAYMELARL 3333---------1111--------- >URIDINE-CYTIDINE KINASE 1; SWP:Q9HA47; PDB:2JEOA; MRPFLIGVSGGTASGKSTVCEKIMELLGQNEVEQRQRKVVILSQDRFYKVLTAEQKAKAL ----------2222-------------1111-3333------1111------------11 KGQYNFDHPDAFDNDLMHRTLKNIVEGKTVEVPTYDFVTHSRLPETTVVYPADVVLFEGI 11--11113333------------------------------------------------ LVFYSQEIRDMFHLRLFVDTDSDVRLSRRVLRDVRDLEQILTQYTTFVKPAFEEFCLPTK 111133331111------------------1111-----------------------333 KYADVIIPRGVDNMVAINLIVQHIQDILNGDI 3--------3333------------------- >XYLOGLUCANASE; SWP:NA; PDB:2JEPA; ADASQIVSEMGAGWNLGNQLEAAVNGTPNETAWGNPTVTPELIKKVKAAGFKSIRIPVSY -3333--3333-------1111-iiii-1111------3333----1111---------! LNNIGSAPNYTINAAWLNRIQQVVDYAYNEGLYVIINIHGDGYNSVQGGWLLVNGGNQTA !!!-----------------------3333------------1111-----1111----- IKEKYKKVWQQIATKFSNYNDRLIFESMNEVFDGNYGNPNSAYYTNLNAYNQIFVDTVRQ ---------------11113333----------------------------------111 TGGNNNARWLLVPGWNTNIDYTVGNYGFTLPTDNYRSSAIPSSQKRIMISAHYYSPWDFA 1!!!!--------2222-1111----------111133331111-----------3333- GEENGNITQWGATSTNPAKKSTWGQEDYLESQFKSMYDKFVTQGYPVVIGEFGSIDKTSY ----------1111-3333--------------------3333-------------3333 DSSNNVYRAAYAKAVTAKAKKYKMVPVYWDNGHNGQHGFALFNRSNNTVTQQNIINAIMQ 1111---------------1111-----------2222---------------------1 GMQ 111 >AGMATINE DEIMINASE; SWP:Q837U5; PDB:2JERA; AKRIVGSTPKQDGFRMPGEFEPQEKVWMIWPERPDNWRDGGKPVQEAFTNVAKAISQFTP -------3333------1111-----------1111-%%%%------------------- MNVVVSQQQFQNCRRQLPPEITVYEMSNNDAWVRDCGPSFVINDHGEIRGVDWTFNAWGG -----3333--------3333----------3333--------------------%%%%- LVDGLYFPWDQDDLVAQKICEIEHVDSYRTDDFVLEGGSFHVDGQGTVLTTEMCLLSEGR --------3333--------1111-----2222--1111-----------3333--1111 NPQLSKEAIEQKLCDYLNVEKVLWLGDGIDPEETNGHVDDVACFIAPGEVACIYTEDQNS 1111-------------------------1111------------2222-------1111 PFYEAAQDAYQRLLKMTDAKGRQLKVHKLCCPVKNVTIKGSFKIDFVEGTMPREDGDICI -----------------1111-----------------1111----2222---2222--- ASYMNFLITNDGVIVPQYGDENDRLALEQVQTMFPDKKIVGVNTVEVVYGGGNIHITQQE -3333---!!!!-------1111----------1111------33331111--------- PKRVG ----- >FATTY ACID SYNTHASE; SWP:P49327; PDB:2JFKA; QSMRLLRASGRTPEAVQKLLEQGLRHSQDLAFLSMLNDIAAVPATAMPFRGYAVLGGERG -------------------------1111-------------3333-------------- GPEVQQVPAGERPLWFICSGMGTQWRGMGLSLMRLDRFRDSILRSDEAVKPFGLKVSQLL ------------------------2222-------3333---------------3333-- LSTDESTFDDIVHSFVSLTAIQIGLIDLLSCMGLRPDGIVGHSLGEVACGYADGCLSQEE ---1111----------------------1111---------3333-------------- AVLAAYWRGQCIKEAHLPPGAMAAVGLSWEECKQRCPPGVVPAHNSKDTVTISGPQAPVF ------------------------------------2222-------------------- EFVEQLRKEGVFAKEVRTGGMAFHSYFMEAIAPPLLQELKKVIREPKPRSARWLSTSIPE ------1111-------iiii---33331111-----3333--------3333-----33 AQWHSSLARTSSAEYNVNNLVSPVLFQEALWHVPEHAVVLEIAPHALLQAVLKRGLKPSC 33--3333-----------------33331111------------1111-------3333 TIIPLMKKDHRDNLEFFLAGIGRLHLSGIDANPNALFPPV ------2222--------------1111---3333----- >FRUCTOSE 1-PHOSPHATE KINA; SWP:Q6GIU3; PDB:2JG5A; MIYTVTFNPSIDYVIFTNDFKIDGLNRATATYKFAGGKGINVSRVLKTLDVESTALGFAG --------------------2222----------------------1111---------- GFPGKFIIDTLNNSAIQSNFIEVDEDTRINVKLKTGQETEINAPGPHITSTQFEQLLQQI -----------1111--------------------------------------------1 KNTTSEDIVIVAGSVPSSIPSDAYAQIAQITAQTGAKLVVDAEKELAESVLPYHPLFIKP 1111111--------11111111--------------------------3333------- NKDELEVMFNTTVNSDADVIKYGRLLVDKGAQSVIVSLGGDGAIYIDKEISIKAVNPQGK --------------------------1111-------!!!!-----1111---------- VVNTVGSGDSTVAGMVAGIASGLSIEKAFQQAVACGTATAFDEDLATRDAIEKIKSQVTI ---2222-----------1111----------------1111----3333----1111-- SVLDGE ------ >DNA-3-METHYLADENINE GLYCO; SWP:Q99TJ7; PDB:2JG6A; MNECAFGTKDPVYLNYHDHVWGQPLYDSKALFKLLALESQHAGLSWLTILKKKEAYEEAF 3333------------------------------------22223333-----------% YDFEPEKVAQMTAQDIDRLMTFPNIVHHRKKLEAIVNQAQGYLKIEQAYGSFSKFLWSYV %%%-3333-----------------------------------------------3333i NGKPKDLQYEHASDRITVDDTATQLSKDLKQYGFKFLGPVTVFSFLEAAGLYDAHLKDCP iii-------1111---------------1111----------------------1111- SKPKHN ------ >EUKARYOTIC TRANSLATION IN; SWP:O60573; PDB:2JGBA; KAVVPGPAEHPLQYNYTFWYSRRTPGRPTSSQSYEQNIKQIGTFASVEQFWRFYSHMVRP -----1111-------------------------1111--------------3333--33 GDLTGHSDFHLFKEGIKPMWEDDANKNGGKWIIRLRKGLASRCWENLILAMLGEQFMVGE 33----------2222--33331111---------2222-----------1111---!!! EICGAVVSVRFQEDIISIWNKTASDQATTARIRDTLRRVLNLPPNTIMEYKTHTDSIKMP !--------------------1111-----------------2222-------------- GPQRLLF ------- >2-OXOGLUTARATE DEHYDROGEN; SWP:Q0T6W4; PDB:2JGDA; DTNVKQVKVLQLINAYRFRGHQHANLDPLGLWQQDDLDPSFHDLTEDFQETFNVGSFAET ------------------3333----1111--------3333----1111---!!!!--- MKLGELLEALKQTYCGPIGAEYMHITSTEEKRWIQQRIESRATFNSEEKKRFLSELTAAE --------------------------3333--------------3333------------ GLERYLGAKFPGRFSLEGGDALIPMLKEMIRHAGNSGTREVVLGMAHRGRLNVLVNVLGK ----------------------------------------------2222---------- KPQDLFDEFAGKHHLGTGDVKYHMGFSSDFQTDGGLVHLALAFNPSHLEIVSPVVIGSVR -------1111--------1111--------1111------------------------- ARLDRLDEPSSNKVLPITIHGDAAVTGQGVVQETLNMSKARGYEVGGTVRIVINNQYCTD --1111---3333--------------------------2222---------------33 IGKMVQAPIFHVNADDPEAVAFVTRLALDFRNTFKRDVFIDLVCYRRHQPLMYQKIKKHP 33----------1111-------------------------------------------- TPRKIYADKLEQEKVATLEDATEMVNLYRDALDAGDCVVAEWRPMNMHSFTWSPYLNHEW ----------------3333------------------1111---33331111-----11 DEEYPNKVEMKRLQELAKRISTVPEAVEMQSRVAKIYGDRQAMAAGEKLFDWGGAENLAY 11---------------1111--3333---------------1111-------------- ATLVDEGIPVRLSGEDSGRGTFFHRHAVIHNQSNGSTYTPLQHIHNGQGAFRVWDSVLSE ---1111--------33331111---------------3333--2222------------ EAVLAFEYGYATAEPRTLTIWEAQFGDFANGAQVVIDQFISSGEQKWGRMCGLVMLLPHG -------------1111-------3333-------------3333--------------- YEGQGPEHSSARLERYLQLCAEQNMQVCVPSTPAQVYHMLRRQALRGMRRPLVVMSPKSL ----1111---3333-1111%%%%---------------------------------333 LRHPLAVSSLEELANGTFLPAIGEIDELDPKGVKRVVMCSGKVYYDLLEQRRKNNQHDVA 3-1111----------------------3333--------3333---------------- IVRIEQLYPFPHKAMQEVLQQFAHVKDFVWCQEEPLNQGAWYCSQHHFREVIPFGASLRY ------------------3333------------1111-3333---------3333---- AGRPASASPAVGHMSVHQKQQQDLVNDALNVE -------------------------------- >COMPLEMENT FACTOR H; SWP:P08603; PDB:2JGWA; LRKCYFPYLENGYNQNHGRKFVQGKSIDVACHPGYALPKAQTTVTCMENGWSPTPRCIRV ---------------2222-----------------2222----------3333------ K - >UMP SYNTHASE; SWP:P11172; PDB:2JGYA; SMELSFGARAELPRIHPVASKLLRLMQKKETNLCLSADVSLARELLQLADALGPSICMLK ----3333---1111------------------------------------3333----- THVDILNDFTLDVMKELITLAKCHEFLIFEDRKFADIGNTVKKQYEGGIFKIASWADLVN -1111----3333-----------------------3333----------3333------ AHVVPGSGVVKGLQEVGLPLHRGCLLIAEMSSTGSLATGDYTRAAVRMAEEHSEFVVGFI --1111-----------1111----------2222----------------1111----- SGSRVSMKPEFLHLTPGVQLEAYNSPQEVIGKRGSDIIIVGRGIISAADRLEAAEMYRKA -------3333------------------------------------------------- AWEAYLSRL -----3333 >PHOSPHORIBOSYL PYROPHOSPH; SWP:NA; PDB:2JI4A; GLVLFSANSNSSCMELSKKIAERLGVEMGKVQVYQEPNRETRVQIQESVRGKDVFIIQTV ---------3333----------------------1111---------2222-------- SKDVNTTIMELLIMVYACKTSCAKSIIGVIPYFPYSKQSIVSKLLASMMCKAGLTHLITM ---------------------------------------3333------3333------- DLHQKEIQGFFNIPVDNLRASPFLLQYIQEEIPDYRNAVIVAKSPASAKRAQSFAERLRL -----3333----------3333----------1111------3333--------1111- GIAVIHPITVVGDVGGRIAIIVDDIIDDVDSFLAAAETLKERGAYKIFVMATHGLLSSDA -------------2222------------3333--------------------------- PRRIEESAIDEVVVTNTIPHEVQKLQCPKIKTVDISMILSEAIRRIHNGESMSYLFRNIG ---3333-------------------3333-------------------------1111- LD -- >HYPOTHETICAL PROTEIN; SWP:Q20595; PDB:2JM3A; GSMPTTCGFPNCKFRSRYRGLEDNRHFYRIPKRPLILRQRWLTAIGRTEETVVSQLRICS -------------1111-------------------------1111-1111-------11 AHFEGGEKKEGDIPVPDPTVDKQIKIELPPK 11--------------1111----------- >RELAXIN RECEPTOR 1; SWP:Q9HBX9; PDB:2JM4A; GSQDVKCSLGYFPCGNITKCLPQLLHCNGVDDCGNQADEDNCG -------2222----------3333----------3333---- >REGULATOR OF G-PROTEIN SI; SWP:Q9NS28; PDB:2JM5A; SMVSPEEAVKWGESFDKLLSHRDGLEAFTRFLKTEFSEENIEFWIACEDFKKSKGPQQIH --------3333--------------------1111-------------3333------- LKAKAIYEKFIQTDAPKEVNLDFHTKEVITNSITQPTLHSFDAAQSRVYQLMEQDSYTRF -------------------------------3333---1111------------------ LKSDIYLDLMEGRP -------------- >DNA BINDING DOMAIN/TRANSC; SWP:Q1DDV9_MYXXD; PDB:2JMLA; HMTLRIRTIARMTGIREATLRAWERRYGFPRPLRSEGNNYRVYSREEVEAVRRVARLIQE ----3333-----3333------------------------------------------- EGLSVSEAIAQVKTEPPRE -3333-------------- >PARKIN; SWP:Q8NI42; PDB:2JMOA; GHMGEEQYNRYQQYGAEECVLQMGGVLCPRPGCGAGLLPEPDQRKVTCEGGNGLGCGFAF ------------3333-------------------------------------------- CRECKEAYHEGECSAVFEAS -1111--------------- >THIAMINE-TRIPHOSPHATASE; SWP:Q8JZL3; PDB:2JMUA; SAQGLIEVERKFAPGPDTEERLQELGATLEHRVTFRDTYYDTSELSLMLSDHWLRQREGS -----------------------------------------11113333-------%%%% GWELKCPGVTGVSGPHNEYVEVTSEAAIVAQLFELLGSGEQKPAGVAAVLGSLKLQEVAS ------------------------------------------------------------ FITTRSSWKLALSGAHGQEPQLTIDLDSADFGYAVGEVEAMVHEKAEVPAALEKIITVSS -------------3333--------------------------3333------------- MLGVPAQEEAPAKLMVYLQRFRPLDYQRLLEAASSGEATGDSAS ----------------------------3333------------ >PROTEIN CGL2762; SWP:Q8NM20; PDB:2JN6A; MPTKTYSEEFKRDAVALYENSDGASLQQIANDLGINRVTLKNWIIKYGSNHNVQGTTPSA -------------------3333-----------------------------------11 AVSEAEQIRQLKKENALQRARTRHPAESCLEHHHHHH 11-----------------------3333-------- >PROTEIN YFJZ; SWP:P52141; PDB:2JN7A; MSNTTWGLQRDITPRLGARLVQEGNQLHYLADRASITGKFSDAECPKLDVVFPHFISQIE ----------------------!!!!---3333-------1111-------------333 SMLTTGELNPRHAQCVTLYHNGFTCEADTLGSCGYVYIAVYPTQR 3--1111------------iiii-----iiii------------- >PUTATIVE SECRETED PROTEIN; SWP:Q7CR88; PDB:2JNAA; MKKRIIAAALLATVASFSTLAAEQVSKQEISHFKLVKVGTINVSQSGGQISSPSDLREKL ------------------------------------------------------------ SELADAKGGKYYHIIAAREHGPNFEAVAEVYNDATKLEHHHHHH ----1111-----------!!!!--------------------- >CORTICOTROPIN-RELEASING F; SWP:Q60748; PDB:2JNCA; YCHRTTIGNFSGPYTYCNTTLDQIGTCWPQSAPGALVERPCPEYFNGIKYNTTRNAYREC ------------------------------------------------------------ LENGTWASRVNYSHCEPI 3333------3333---- >CULLIN-7; SWP:Q14999; PDB:2JNGA; MRSEFASGNTYALYVRDTLQPGMRVRMLDDYEEISAGDEGEFRQSNNGVPPVQVFWESTG 11111111----------------------!!!!2222------------------3333 RTYWVHWHMLEILGFEE -----3333-------- >REGULATOR OF G-PROTEIN SI; SWP:O43566; PDB:2JNUA; SMTEEQPVASWALSFERLLQDPLGLAYFTEFLKKEFSAENVTFWKACERFQQIPASDTQQ --------3333--------------------1111-------------33333333--- LAQEARNIYQEFLSSQALSPVNIDRQAWLGEEVLAEPRPDMFRAQQLQIFNLMKFDSYAR -------------1111-------1111-3333-----------------------3333 FVKSPLYRECLLAEAE 1111--3333------ >KLENOW FRAGMENT; SWP:P00582; PDB:2KFNA; MISYDNYVTILDEETLKAWIAKLEKAPVFAFDTETDSLDNISANLVGLSFAIEPGVAAYI --3333----------------1111------------3333----------2222---- PVAHDYLDAPDQISRERALELLKPLLEDEKALKVGQNLKYDRGILANYGIELRGIAFDTM -----2222------------------3333--------------1111----------- LESYILNSVAGRHDMDSLAERWLKHKTITFEEIAGKGKNQLTFNQIALEEAGRYAAEDAD ------1111------------------3333----1111-3333--------------- VTLQLHLKMWPDLQKHKGPLNVFENIEMPLVPVLSRIERNGVKIDPKVLHNHSEELTLRL ---------3333-------------3333------------------------------ AELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLKKTPSTSEEVLEELALDYPLPKVI ----------------------33333333--------------33333333-------- LEYRGLAKLKSTYTDKLPLMINPKTGRVHTSYHQAVTATGRLSSTDPNLQNIPVRNEEGR ---------------3333-----------------1111-------1111----3333- RIRQAFIAPEDYVIVSADYSQIELRIMAHLSRDKGLLTAFAEGKDIHRATAAEVFGLPLE --------2222---------------------------1111--------------333 TVTSEQRRSAKAINFGLIYGMSAFGLARQLNIPRKEAQKYMDLYFERYPGVLEYMERTRA 3---------------1111-11111111------------------3333--------- QAKEQGYVETLDGRRLYLPDIKSSNGARRAAAERAAINAPMQGTAADIIKRAMIAVDAWL ---------1111----1111----------------3333------------------- QAEQPRVRMIMQVHDELVFEVHKDDVDAVAKQIHQLMENCTRLDVPLLVEVGSGENWDQA ------------!!!!-----1111------------------------------3333- H - >KINESIN; SWP:P56536; PDB:2KINA; ADPAECSIKVMCRFRPLNEAEILRGDKFIPKFKGEETVVIGQGKPYVFDRVLPPNTTQEQ ---------------------------------------!!!!---------1111---- VYNACAKQIVKDVLEGYNGTIFAYGQTSSGKTHTMEGKLHDPQLMGIIPRIAHDIFDHIY ------------1111---------2222--------1111-----------------11 SMDENLEFHIKVSYFEIYLDKIRDLLDVSKTNLAVHEDKNRVPYVKGCTERFVSSPEEVM 111111-----------%%%%--1111----------1111------------------- DVIDEGKANRHVAVTNMNEHSSRSHSIFLINIKQENVETEKKLSGKLYLVDLAGSEKV ----------2222-3333-1111---------------------------------- >Kinesin heavy chain isofo; SWP:P28738; PDB:2KINB; AKNINKSLSALGNVISALAEGTKTHVPYRDSKMTRILQDSLDGNCRTTIVICCSPSVFNE ----3333------------------1111--------1111-------------3333- AETKSTLMFGQRAKTIKNTVSVNLELTAEEWKKKYEKEKE ------------1111------------------------ >APO-LACTATE DEHYDROGENASE; SWP:LDHX_MOUSE; PDB:2LDX; STVKEQLIQNLVPEDKLSRCKITVVGVGDVGMACAISILLKGLADELALVDADTDKLRGE -----------------------------------3333--------------------- ALDLQHGSLFLSTPKIVFGKDYNVSANSKLVIITAGARMVSGQTRLDLLQRNVAIMKAIV --------------------3333------------------------------------ PGVIQNSPDCKIIVVTNPVDILTYVVWKISGFPVGRVIGSGCNLDSARFRYLIGEKLGVN -3333-1111-----------------------------!!!!---------3333---- PTSCHGWVLGEHGDSSVPIWSGVNVAGVTLKSLNPAIGTDKNKQHWKNVHKQVVEGGYEV ------------------3333-----------1111--------3333----------3 LDMKGYTSWAIGLSVTDLARSILKNLKRVHPVTTLVKGFHGIKEEVFLSIPCVLGESGIT 333-----3333---------1111-----------------------------1111-- DFVKVNMTAEEEGLLKKSADTLWNMQKNLEL ------------------------------- >LYMPHOID ENHANCER-BINDING; SWP:P27782; PDB:2LEFA; MHIKKPLNAFMLYMKEMRANVVAESTLKESAAINQILGRRWHALSREEQAKYYELARKER --------------------------------------3333------------------ QLHMQLYPGWSARDNYGKKKKRKREK ------2222----2222-------- >HEMOGLOBIN V (CYANO MET); SWP:P02208; PDB:2LHB; PIVDTGSVAPLSAAEKTKIRSAWAPVYSTYETSGVDILVKFFTSTPAAQEFFPKFKGLTT ----------------------------3333------------333311111111---- ADELKKSADVRWHAERIINAVDDAVASMDDTEKMSMKLRNLSGKHAKSFQVDPEYFKVLA ------------------------1111-3333------------------3333----- AVIADTVAAGDAGFEKLMSMICILLRSAY -------2222------------1111-- >SPERM LYSIN; SWP:P04552; PDB:2LISA; HYVEPKFLNKAFEVALKVQIIAGFDRGLVKWLRVHGRTLSTVQKKALYFVNRRYMQTHWA ----------------------------------1111---------------------- NYMLWINKKIDALGRTPVVGDYTRLGAEIGRRIDMAYFYDFLKDKNMIPKYLPYMEEINR ---------1111----3333---------------------1111-----3333----- MRPADVPVKYM -3333------ >n/a; SWP:P02867; PDB:2LTNA; TETTSFLITKFSPDQQNLIFQGDGYTTKEKLTLTKAVKNTVGRALYSSPIHIWDRETGNV --------------1111------------------------------------------ ANFVTSFTFVINAPNSYNVADGFTFFIAPVDTKPQTGGGYLGVFNSAEYDKTTQTVAVEF ------------------------------------!!!!---------1111------- DTFYNAAWDPSNRDRHIGIDVNSIKSVNTKSWKLQNGEEANVVIAFNAATNVLTVSLTYP ----3333-3333---------------------2222---------------------- N - >Lectin [Fragment]; SWP:Q84TR3; PDB:2LTNB; VTSYTLSDVVSLKDVVPEWVRIGFSATTGAEYAAHEVLSWSFHSELSG ----------3333---------------------------------- >INOSINE-URIDINE NUCLEOSID; SWP:Q27546; PDB:2MASA; AKKIILDCDPGLDDAVAILLAHGNPEIELLAITTVVGNQTLAKVTRNAQLVADIAGITGV ----------3333---------1111------------3333----------------- PIAAGCDKPLVRKIMTAGHIHGESGMGTVAYPAEFKNKVDERHAVNLIIDLVMSHEPKTI ----------------1111-3333-----------------3333-------------- TLVPTGGLTNIAMAARLEPRIVDRVKEVVLMGGGYHEGNATSVAEFNIIIDPEAAHIVFN -------------------3333---------------------3333---------111 ESWQVTMVGLDLTHQALATPPILQRVKEVDTNPARFMLEIMDYYTKIYQSNRYMAAAAVH 1-------3333-----------------------------------------------3 DPCAVAYVIDPSVMTTERVPVDIELTGKLTLGMTVADFRNPRPEHCHTQVAVKLDFEKFW 333--------------------------2222--------------------------- GLVLDALERIGDP ------------- >Ig lambda chain V-II regi; SWP:P01709; PDB:2MCG1; SALTQPPSASGSLGQSVTISCTGTSSDVGGYNYVSWYQQHAGKAPKVIIYEVNKRPSGVP ----------------------------1111---------------------------- DRFSGSKSGNTASLTVSGLQAEDEADYYCSSYEGSDNFVFGTGTKVTVLGQPKANPTVTL -------------------1111------------------------------------- FPPSSEELQANKATLVCLISDFYPGAVTVAWKADGSPVKAGVETTKPSKQSNNKYAASSY -----3333-----------------------iiii------------------------ LSLTPEQWKSHRSYSCQVTHEGSTVEKTVAPTECS -----3333-------------------------- >MACROMOMYCIN; SWP:P01549; PDB:2MCM; APGVTVTPATGLSNGQTVTVSATGLTPGTVYHVGQCAVVEPGVIGCDATTSTDVTADAAG ------------2222---------2222----------2222----1111-----1111 KITAQLKVHSSFQAVVGADGTPWGTVNCKVVSCSAGLGSDSGEGAAQAITFA ----------------1111------------------1111---------- >Genome polyprotein; SWP:P12296; PDB:2MEV1; GVENAEKGVTENTDATADFVAQPVYLPENQTKVAFFYDRSSPIGAFAVKSGSLESGFAPF ---3333------3333---------------------------------3333------ SNKACPNSVILTPGPQFDPAYDQLRPQRLTEIWGNGNEETSEVFPLKTKQDYSFCLFSPF -----------------1111--------------------------------------- VYYKCDLEVTLSPHTSGAHGLLVRWCPTGTPTKPTTQVLHEVSSLSEGRTPQVYSAGPGT --------------------------2222--------22223333----------1111 SNQISFVVPYNSPLSVLPAVWYNGHKRFDNTGDLGIAPNSDFGTLFFAGTKPDIKFTVYL --------------------------3333--------------------1111------ RYKNMRVFCPRPTVFFPWPTSGDKIDMT ---------------------------- >Genome polyprotein; SWP:P12296; PDB:2MEV2; ENLSDRVSQDTAGNTVTNTQSTVGRLVGYGTVHDGEHPASCADTASEKILAVERYYTFKV --------------------------2222-------1111-------3333-------- NDWTSTQKPFEYIRIPLPHVLSGEDGGVFGATLRRHYLVKTGWRVQVQCNASQFHAGSLL ---33332222------3333-1111------1111---------------1111----- VFMAPEYPTLDVFAMDNRWSKDNLPNGTRTQTNRKGPFAMDHQNFWQWTLYPHQFLNLRT ---------1111-------2222-----3333-----1111-3333---------3333 NTTVDLEVPYVNIAPTSSWTQHASWTLVIAVVAPLTYSTGASTSLDITASIQPVRPVFNG -----------------3333----------------2222------------------- LRHEVLSRQ --------- >Genome polyprotein; SWP:P12296; PDB:2MEV3; SPIPVTIREHAGTWYSTLPDSTVPIYGKTPVAPANYMVGEYKDFLEIAQIPTFIGNKVPN -------1111---1111---------------1111-----33331111---------- AVPYIEASNTAVKTQPLAVYQVTLSCSCLANTFLAALSRNFAQYRGSLVYTFVFTGTAMM -----------1111----------3333-------3333----------------3333 KGKFLIAYTPPGAGKPTSRDQAMQATYAIWDLGLNSSYSFTVPFISPTHFRMVGTDQANI -----------------33331111---------------------------------!! TNVDGWVTVWQLTPLTYPPGCPTSAKILTMVSAGKDFSLKMPISPAPWSPQ !!---------------2222------------3333-------------- >Genome polyprotein; SWP:P12296; PDB:2MEV4; SEGNEGVIINNFYSNQYQNSIDLSANATGSDPPKTYGQFSNLLSGAVNAFSNMLPLLA -------------3333--------1111----------------------------- >MYOHEMERYTHRIN; SWP:P02247; PDB:2MHR; GWEIPEPYVWDESFRVFYEQLDEEHKKIFKGIFDCIRDNSAPNLATLVKVTTNHFTHEEA ----------3333---------------------------------------------- MMDAAKYSEVVPHKKMHKDFLEKIGGLSAPVDAKNVDYCKEWLVNHIKGTDFKYKGKL --1111-----------------1111------------------------1111--- >CD7 METALLOTHIONEIN-2; SWP:P02795; PDB:2MHU; MDPNCSCAAGDSCTCAGSCKCKECKCTSCK -------------------------3333- >MYOGLOBIN; SWP:P02144; PDB:2MM1; GLSDGEWQLVLNVWGKVEADIPGHGQEVLIRLFKGHPETLEKFDRFKHLKSEDEMKASED --------------3333-----------------33333333--3333----------- LKKHGATVLTALGGILKKKGHHEAEIKPLAQSHATKHKIPVKYLEFISEAIIQVLQSKHP ---------------1111--3333--------------3333----------------- GDFGADAQGAMNKALELFRKDMASNYKELGFQG ----3333------------------1111--- >METHANE MONOOXYGENASE REG; SWP:P27356; PDB:2MOBA; SNAVVLVLMKSDEIDAIIEDIVLKGGKAKNPSIVVEDKAGFWWIKADGAIEIDAAEAGEL -------------3333------1111--------------------------3333--- LGKPFSVYDLLINVSSTVGRAYTLGTKFTITSEL ---------------------------------- >CD7 METALLOTHIONEIN-2A; SWP:P18055; PDB:2MRB; MDPNCSCAAAGDSCTCANSCTCKACKCTSCK --------------------------1111- >MANNOSE-BINDING PROTEIN-A; SWP:Q9Z294; PDB:2MSBA; KKFFVTNHERMPFSKVKALCSELRGTVAIPRNAEENKAIQEVAKTSAFLGITDEVTEGQF -----------3333-----1111-------------------------------2222- MYVTGGRLTYSNWKKDEPNDHGSGEDCVTIVDNGLWNDISCQASHTAVCEFP -1111--------2222---!!!!------1111-----1111--------- >MUSASHI1; SWP:Q61474; PDB:2MSSA; KIFVGGLSVNTTVEDVKHYFEQFGKVDDAMLMFDKTTNRHRGFGFVTFESEDIVEKVCEI -----------3333-----------------------------------3333-3333- HFHEINNKMVECKKA --------------- >MYOSIN; SWP:P13538; PDB:2MYSA; DAEMAAFGEAAPYLRSEKERIEAQNPFDASSVFVVHPKQSFVGTIQSEGGVTVTEGGETL -------3333--------3333------------------------------------- TVKEDQVFSMNPPYDIEDMAMMTHLHEPAVLYNLERYAAWMIYTYSGLFCVTVNPYWLPV -----------------1111-----3333------1111------------------11 YNPVVLAYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGESGAGKTVNTRVIQYF 11--3333------------------------------------1111-3333------- ATIAASGEGTLEDQIISANPLLEAFGNATVRNDNSSRFGFIRIHFGATGKLASADIETYL ---------3333--------3333----------------------------------- LESRVTFQLPAERSYHIFYQIMSNPELIDMLLITTNPYDYHYVSEGEITVPSIDDQEELM --3333--1111------------33331111---11113333------1111------- ATDSAIDILGFSADETAIYLTGAVMHYGNLKFQQREEQAEPDGTEVADAAYLMGLNSAEL -----------3333-----------1111--------------3333--1111------ LKALCYPRVGVGNEAVTGETVSEVHNSVGALAAVYEMFLWMVIRINQQLDTKQPRQYFIG -------------------3333------------------------------------- VLDIAGFEIFDFNSFEQLCINFTNELQQFFNHHMFVLEQEEYEGIEWEFIDFGMDLAACI -------------3333----------------------------------3333----- ELIEPMGIFSILEEECMFPKATDTSFNLYDEHLGKSNNFQKPKPAAEAHFSLVHYAGTVD ---------------------3333---1111---1111--------------3333--- YNISGWLENDPLNETVIGLYQSSVTLALLFATYQTVSALFRENLNLMANLRSTHPHFVRC ----3333--------------------------3333---------------------- IIPNETTPGAMEHELVLHQLRCNGVLEGIRICRKGFPSRVLYADFKQRYRVLNASAMDSK ----------------------------------------3333----3333-------- KASEKLLGGGDVDHTQYAFGHTVFFAGLLGLLEEMRDDLAEIITATQARCRGFLMRVEYR -----3333------------------3333----------------------------- AMVERRESIFCIQYNVRSFMNVHWPWMLFFIPLLK -----------------------3333-------- >NAD-DEPENDENT FORMATE DEH; SWP:P33160; PDB:2NACA; AKVLCVLYDDPVDGYPKTYARDDLPKIDHYPGGQTLPTPKAIDFTPGQLLGSVSGELGLR ----------1111---------------2222------------------1111----- KYLESNGHTLVVTSDKDGPDSVFERELVDADVVISQPFWPAYLTPERIAKAKNLKLALTA ---1111----------1111--------------1111-----------1111------ GIGSDHVDLQSAIDRNVTVAEVTYCNSISVAEHVVMMILSLVRNYLPSHEWARKGGWNIA ---------------------2222--------------------------1111--333 DCVSHAYDLEAMHVGTVAAGRIGLAVLRRLAPFDVHLHYTDRHRLPESVEKELNLTWHAT 3-------2222-------3333------1111--------------------------3 REDMYPVCDVVTLNCPLHPETEHMINDETLKLFKRGAYIVNTARGKLCDRDAVARALESG 3333333----------3333----333311112222------1111---------1111 RLAGYAGDVWFPQPAPKDHPWRTMPYNGMTPHISGTTLTAQARYAAGTREILECFFEGRP ---------------11113333---------1111------------------1111-- IRDEYLIVQGGALA -3333---%%%%-- >PERIPLASMIC NITRATE REDUC; SWP:P81186; PDB:2NAPA; RPEKWVKGVCRYCGTGCGVLVGVKDGKAVAIQGNPNNHNAGLLCLKGSLLIPVLNSKERV ------------3333--------------------1111---3333--3333------- TQPLVRRHKGGKLEPVSWDEALDLMASRFRSSIDMYGPNSVAWYGSGQCLTEESYVANKI -------2222-------------------------1111-----1111----------- FKGGFGTNNVDGNPRLCMASAVGGYVTSFGKDEPMGTYADIDQATCFFIIGSNTSEAHPV ------------3333--------------------3333------------3333---- LFRRIARRKQVEPGVKIIVADPRRTNTSRIADMHVAFRPGTDLAFMHSMAWVIINEELDN -----------3333---------3333---------2222------------1111--3 PRFWQRYVNFMDAEGKPSDFEGYKAFLENYRPEKVAEICRVPVEQIYGAARAFAESAATM 3331111----1111---------------3333-------------------------- SLWCMGINQRVQGVFANNLIHNLHLITGQICRPGATSFSLTGQPNACGGVRDGGALSHLL ---------1111------------------2222--------------------1111- PAGRAIPNAKHRAEMEKLWGLPEGRIAPEPGYHTVALFEALGRGDVKCMIICETNPAHTL %%%%3333-------------2222---------------1111----------1111-- PNLNKVHKAMSHPESFIVCIEAFPDAVTLEYADLVLPPAFWCERDGVYGCGERRYSLTEK -----------1111-------11113333---------!!!!------1111------- AVDPPGQCRPTVNTLVEFARRAGVDPQLVNFRNAEDVWNEWRMVSKGTTYDFWGMTRERL ----!!!!-3333------1111-3333----3333------1111-----3333----- RKESGLIWPCPSEDHPGTSLRYVRGQDPCVPADHPDRFFFYGKPDGRAVIWMRPAKGAAE -----------1111----2222---11111111-----1111----------------- EPDAEYPLYLTSMRVIDHWHTATMTGKVPELQKANPIAFVEINEEDAARTGIKHGDSVIV --3333--------3333!!!!-----3333-----------3333-------------- ETRRDAMELPARVSDVCRPGLIAVPFFDPKKLVNKLFLDATDPVSREPEYKICAARVRKA -1111------------2222------1111-3333------------3333-------- >KINESIN MOTOR NCD; SWP:P20480; PDB:2NCDA; LRQRTEELLRCNEQQAAELETCKEQLFQSNMERKELHNTVMDLRGNIRVFCRIRPPLESE ----3333------------------------------------------------1111 ENRMCCTWTYHDESTVELQSIDAQAKSKMGQQIFSFDQVFHPLSSQSDIFEMVSPLIQSA ---------------------3333---------------1111---------------1 LDGYNICIFAYGQTGSGKTYTMDGVPESVGVIPRTVDLLFDSIRGYRNLGWEYEIKATFL 111---------2222----------------------------3333------------ EIYNEVLYDLLSNEQKDMEIRMAKNNKNDIYVSNITEETVLDPNHLRHLMHTAKMNRATA -------------------------1111------------------------------- STAGNERSSRSHAVTKLELIGRHAEKQEISVGSINLVDLAGSESPNINRSLSELTNVILA ---3333----------------------------------------------------- LLQKQDHIPYRNSKLTHLLMPSLGGNSKTLMFINVSPFQDCFQESVKSLRFAASVNSC 1111--------------3333---------------1111---------1111---- >GTP BINDING PROTEIN (G25K; SWP:P25763; PDB:2NGRA; MQTIKCVVVGDGAVGKTCLLISYTTNKFPSEYVPTVFDNYAVTVMIGGEPYTLGLFDTAG ----------2222-------------------------------iiii----------- QEDYDRLRPLSYPQTDVFLVCFSVVSPSSFENVKEKWVPEITHHCPKTPFLLVGTQIDLR 3333-------2222-------1111------------------1111--------1111 DDPSTIEKLAKNKQKPITPETAEKLARDLKAVKYVECSALTQKGLKNVFDEAILAALEPP --------------------------1111------------------------1111-- EPKKSRRCVLL 2222------- >Thyroid hormone receptor ; SWP:P10828; PDB:2NLLB; DELCVVCGDKATGYHYRCITCEGCKGFFRRTIQKNLHPSYSCKYEGKCVIDKVTRNQCQE --------------iiii-------------11113333--------------1111--- CRFKKCIYVGMATDLVLDDSKRLAKRKLIEENREKRRREELEK -----------1111--3333-----------------3333- >SHIKIMATE DEHYDROGENASE; SWP:Q9X5C9; PDB:2NLOA; DSILLGLIGQGLDLSRTPAMHEAEGLAQGRATVYRRIDTLGSRASGQDLKTLLDAALYLG -----------1111----------1111--------------2222------------- FNGLNITHPYKQAVLPLLDEVSEQATQLGAVNTVVIDATGHTTGHNTDVSGFGRGMEEGL -----------3333---------------------1111-------------------1 PNAKLDSVVQVGAGGVGNAVAYALVTHGVQKLQVADLDTSRAQALADVINNAVGREAVVG 111----------!!!!--------------------3333------------------- VDARGIEDVIAAADGVVNATPMGMPAHPGTAFDVSCLTKDHWVGDVVYMPIETELLKAAR --2222---1111----------1111-----3333-1111------------------1 ALGCETLDGTRMAIHQAVDAFRLFTGLEPDVSRMRETFLSL 111----3333--------------------------1111 >ENDOGLUCANASE; SWP:Q54331; PDB:2NLRA; DTTICEPFGTTTIQGRYVVQNNRWGSTAPQCVTATDTGFRVTQADGSAPTNGAPKSYPSV -----1111---%%%%--------------------------------1111-------- FNGCHYTNCSPGTDLPVRLDTVSAAPSSISYGFVDGAVYNASYDIWLDPTARTDGVNQTE ----iiii-2222-----1111-------------------------------------- IMIWFNRVGPIQPIGSPVGTASVGGRTWEVWSGGNGSNDVLSFVAPSAISGWSFDVMDFV ----------------------iiii----------------------------3333-- RATVARGLAENDWYLTSVQAGFEPWQNGAGLAVNSFSSTVET ---1111--1111--------------2222----------- >XISI PROTEIN-LIKE; SWP:Q3M6F6; PDB:2NLVA; GDKLVKYQELVKKLLTNYASDDVSDQDVEVQLILDTERNHYQWNVGWQGLNRIYRCVIHF ------------------1111--1111-------------------!!!!--------- DIKDGKIWLQQNLTDRNPAEELVGVPREDIVLGLQAPYKRQYTDYGVA --iiii-------------------3333--111133331111----- >Eukaryotic translation in; SWP:Q86UM1; PDB:2NLWA; GDVLKDRPQEADGIDSVIVVDNVPQVGPDRLEKLKNVIHKIFSKFGKITNDFYPEEDGKT --------------------------22223333--------1111---------iiii- KGYIFLEYASPAHAVDAVKNADGYKLDKQHTFRVNLFTDFDKYMT --------------------------------------------- >Divergent polysaccharide ; SWP:Q9KCS7; PDB:2NLYA; MKRAAIIIDDFGGDVKGVDDFLTGEIPVTVAVMPFLEHSTKQAEIAQAAGLEVIVHMPLE --------------22223333-------------1111-------1111---------- PKPSGITSNLSVGEVKSRVRKAFDDIPYAVGLNNHMGSKIVENEKIMRAILEVVKEKNAF ------1111----------------------------1111------------1111-- IIDSGTSPHSLIPQLAEELEVPYATRSIFLDNTHSSRKEVIKNMRKLAKKAKQGSEPIGI --------------------------------------------------1111------ GHVGVRGDETYAGIRSMLDEFQAESIQLVPVSQLLP -------1111--1111------------3333--- >CEPHALOSPORIN ACYLASE; SWP:Q9KEI5; PDB:2NLZA; VMFDPQSYPYPSRRNVVYAKNGMVATSQPLAAQAGLDILKAGGNAIDAAIATATALTVLE -----------------------------------------------------------1 PTSNGIGSDAFALVWTKGKLHGLNGSGRAPMSLTMEAVKAKGYEQELPPYGVIPVTVPGA 111------------%%%%----------1111-----1111---------1111----- PGAWAELAKMYGNLPLAASLAPAIRYAEEGYPVTPTLAKYWKAAYDRVKTEWTDDVYQPW --------------3333--------------------------3333-----3333--- FDTFAPKGRAPRVGEVWRSQGHADTLRSIAESNGESFYRGELADQIHAFFDKHGGYLTKE ----1111---2222------------------------------------------333 DLACYRPEWVEPISIDYRGYRVWEIPPNGQGLVALEALNIVKGFEFYHKDTVDTYHKQIE 31111----------------------------------3333----------------- AMKLAFVDGMKYVTEPSDMSVSVEQLLSDEYATERRKEIGEQALTPEPGTPTVYLATADG --------------3333---3333---------3333--------------------11 DGNMVSFIQSNYMGFGSGVVVPGTGIAMQNRGHNFSLDPNHDNALKPGKRTYHTIIPGFL 11----------!!!!----2222-------3333--1111----2222----------- TKNDQPIGPFGVMGGFMQPQGHMQVMMNTIDFGLNPQAALDAPRWQWTNGKQVQVEPTFP -%%%%--------!!!!--------------------------------------1111- VDIAQALVRRGHKIQVVLDEGAFGRGQIIWRDPTTGVLAGGTEPRTDGQVAAWEGH -------1111-------2222--------------------1111---------- >Probable 3-oxacyl-(Acyl-c; SWP:Q9S274; PDB:2NM0A; MSRSVLVTGGNRGIGLAIARAFADAGDKVAITYRSGEPPEGFLAVKCDITDTEQVEQAYK ----------------------1111---------------------1111--------- EIEETHGPVEVLIANAGVTKDQLMSEEDFTSVVETNLTGTFRVVKRANRAMLRAKKGRVV -------------------3333-3333-------------------------------- LISSVVGLLGSAGQANYAASKAGLVGFARSLARELGSRNITFNVVAPGFVDTQRANIVSQ ------------------------------------------------------------ VPLGRYARPEEIAATVRFLASDDASYITGAVIPVDGGLGMG 3333---3333---------3333----------iiii--- >NUMB PROTEIN; SWP:P16554; PDB:2NMBA; SKPHQWQADEEAVRSATCSFSVKYLGCVEVFESRGMQVCEEALKVLRQSRRRPVRGLLHV ----3333---3333--------------------------------------------- SGDGLRVVDDETKGLIVDQTIEKVSFCAPDRNHERGFSYICRDGTTRRWMCHGFLACKDS -------------------3333------------------------------------- GERLSHAVGCAFAVCLERKQRRTRAAA ---3333111133331111-------- >ENHANCER OF RUDIMENTARY H; SWP:P84090; PDB:2NMLA; SHTILLVQPTKRPEGRTYADYESVNECMEGVCKMYEEHLKRMNPNSPSITYDISQLFDFI -----------3333---------------------------1111-----3333----3 DDLADLSCLVYRADTQTYQPYNKDWIKEKIYVLLRRQAQQ 333---------1111------------------------ >14 KDA PHOSPHOHISTIDINE P; SWP:Q5T5S3; PDB:2NMMA; AVADLALIPDVDIDSDGVFKYVLIRVHSAPESKEIVRGYKWAEYHADIYDKVSGDQKQGC ---3333-------------------------------1111-3333--------1111- DCECLGGGRISHQSQDKKIHVYGYSAYGPAQHAISTEKIKAKYPDYEVTWA -------------1111-------------------------3333----- >CYSTATHIONINE GAMMA-LYASE; SWP:P32929; PDB:2NMPA; GFLPHFQHFATQAIHVGQDPEQWTSRAVVPPISLSTTFKQGNPTRNCLEKAVAALDGAKY -----2222-----222233333333---------------------------1111--- CLAFASGLAATVTITHLLKAGDQIICMDDVYGGTNRYFRQVASEFGLKISFVDCSKIKLL --------------33332222------------------3333--------3333---- EAAITPETKLVWIETPTNPTQKVIDIEGCAHIVHKHGDIILVVDNTFMSPYFQRPLALGA ----1111---------------------------------------------3333--- DISMYSATKYMNGHSDVVMGLVSVNCESLHNRLRFLQNSLGAVPSPIDCYLCNRGLKTLH -----33333333----------------------------------------------- VRMEKHFKNGMAVAQFLESNPWVEKVIYPGLPSHPQHELVKRQCTGCTGMVTFYIKGTLQ -------------------1111----1111--11113333------------------- HAEIFLKNLKLFTLAESLGGFESLAELPAIMTHASVLKNDRDVLGISDTLIRLSVGLEDE -----1111-----------------3333--1111----------1111---------- EDLLEDLDQALKAAHP -----------1111- >CMRF35-LIKE-MOLECULE 1; SWP:Q8TDQ1; PDB:2NMSA; GIPQITGPTTVNGLERGSLTVQCVYRSGWETYLKWWCRGAIWRDCKILVKTSGSEQEVKR -------------2222-----------1111--------3333---------------! DRVSIKDNQKNRTFTVTMEDLMKTDADTYWCGIEKTGNDLGVTVQVTIDPAP !!!-----1111---------1111--------------------------- >HYPOTHETICAL PROTEIN YQGQ; SWP:P54494; PDB:2NN4A; LNTFYDVQQLLKTFGHIVYFGDRELEIEFMLDELKELYMNHMIEKEQWARAAAVLRKELE -----------1111----------------------1111------------------- QT -- >GALECTIN-3; SWP:Q6IBA7; PDB:2NN8A; PLIVPYNLPLPGGVVPRMLITILGTVKPNANRIALDFQRGNDVAFHFNPRFNENNRRVIV ---------2222-2222--------------------!!!!----------%%%%---- CNTKLDNNWGREERQSVFPFESGKPFKIQVLVEPDHFKVAVNDAHLLQYNHRVKKLNEIS ----iiii------------2222--------1111----iiii----------1111-- KLGISGDIDLTSASYTMI ------------------ >SULFUR COVALENTLY-BINDING; SWP:Q8RLX2; PDB:2NNCA; SASKLDDAIAAKFGSLPIQESTAIQIKAPEIAENGAFVPVTVATSIPGATNISIFTPANF --------------------------------2222---------1111------3333- SPMVASFDVLPRMKPEVSLRMRMAKTENLVVVVQAGGKLYRAVREVKVTI -------------------------------------------------- >PROBABLE TRANSCRIPTIONAL ; SWP:Q9I3B8; PDB:2NNNA; YRLDDQIGFILRQANQRYAALFANGIGNGLTPTQWAALVRLGETGPCPQNQLGRLTADAA -1111-----------------------------------------------1111---- TIKGVVERLDKRGLIQRSADPDDGRRLLVSLSPAGRAELEAGLAAAREINRQALAPLSLQ ---------1111------1111------------------------------------- EQETLRGLLARLI --------1111- ------------------------------- >REGULATORY PROTEIN E2; SWP:P03120; PDB:2NNUA; SMETLCQRLNVCQDKILTHYENDSTDLRDHIDYWKHMRLECAIYYKAREMGFKHINHQVV -------------------------3333------------------1111---%%%%-- PTLAVSKNKALQAIELQLTLETIYNSQYSNEKWTLQDVSLEVYLTAPTGCIKKHGYTVEV -------------------------1111----3333----------------------- QFDGDICNTMHYTNWTHIYICEEASVTVVEGQVDYYGLYYVHEGIRTYFVQFKDDAEKYS -iiii----------------!!!!--------1111----iiii-----3333------ KNKVWEVHAGGQVILCPTSVF --------------------- >SUPEROXIDE DISMUTASE [CU-; SWP:P00441; PDB:2NNXA; ATKAVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFRVQEFGDNTAGCTSA -----------------------1111----------------------------3333- GPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVH ----1111----1111---1111------1111--------------11112222----- EKADDLGKGGNEESTKTGNAGSRLACGVIGIAQ ----%%%%--3333------------------- ------------------------------------------------------------ --------------------------------------- >NICKEL-BINDING PERIPLASMI; SWP:P33590; PDB:2NOOA; DEITTAWPVNVGPLNPHLATPNQMFAQSMVYEPLVKYQADGSVIPWLAKSWTHSEDGKTW --------------1111-1111-3333---------1111------------1111--- TFTLRDDVKFSNGEPFDAEAAAENFRAVLDNRQAHAALELANQIVDVKALSKTELQITLK ---------1111-----------------333333333333--------1111------ SAYYPFLQELALPAPFRFIAPSQFKNHETMNGIKAPIGTGPWILQESKLNQYDVFVRNEN --1111--1111-------3333-%%%%1111---------------2222------111 YWGEKPAIKKITFNVIPDPTTRAVAFETGDIDLLYGNEGLLPLDTFARFSQNPAYHTQLS 1-----------------------------------3333-----------3333----- QPIETVMLALNTAKAPTNELAVREALNYAVNKKSLIDNALYGTQQVADTLFAPSVPANLG ----------1111-3333------------------1111----------1111----- LKPSQYDPQKAKALLEKAGWTLPAGKDIREKNGQPLRIELSFIGTDALSKSMAEIIQADM ---------------1111---2222----%%%%--------1111------------33 RQIGADVSLIGEEESSIARQRDGRFGMIFHRTAGAPADPHAFLSSMRVPSHADFQAQQGL 33----------3333---1111----------!!!!-------1111------1111-1 ADKPLIDKEIGEVLATHDETQRQALYRDILTRLHDEAVYLPISYISMMVVSKPELGNIPY 111------------------------------------------------3333----- APIATEIPFEQIKP --3333-3333--- >PHOSPHOLIPASE A2; SWP:P00609; PDB:2NOTA; NLVQFSYLIQCANHGRRPTRHYMDYGCYCGWGGSGTPVDELDRCCKIHDDCYSDAEKKGC ------------iiii-3333--------------------------------------- SPKMSAYDYYCGENGPYCRNIKKKCLRFVCDCDVEAAFCFAKAPYNNANWNIDTKKRCQ 3333-------1111------------------------------3333---3333--- >DNA TOPOISOMERASE 4 SUBUN; SWP:P72525; PDB:2NOVA; ALPDIRDGLKPVQRRILYSMNKDSNTFDKSYRKSAKSVGNIMGNFHPHGDSSIYDAMVRM ---------3333-------1111-3333---3333------------------------ SQNWKNREILVEMHGNNGSMDGDPPAAMRYTEARLSEIAGYLLQDIEKKTVPFAWNFDDT --------------------------3333-----3333---2222-------------- EKEPTVLPAAFPNLLVNGSTGISGYATDIPPHNLAEVIDAAVYMIDHPTAKIDKLMEFLP -----------3333-----------------------------------3333------ GPDFPTGAIIQGRDEIKKAYETGKGRVVVRSKTEIEKLKGGKEQIVITEIPYEINKANLV -----------3333-----------------------iiii--------2222------ KKIDDVRVNNKVAEVRDELRIAIDANTELVLNYLFKYTDLQINYNFNMVAIDNFTPRQVG ---------------------------------------------------iiii----- IVPILSSYIAHRREVILARSRFDKEKAEKRLHIVEGLIRVISILDEVIALIRASENKADA -------------------------------------------3333--3333--33333 KENLKVYDFTEEQAEAIVTLQLYRLTNTDVVVLQEEEAELREKIAMLAAIIGDERTMYNL 333-------------11111111-33333333-------------3333---------- MKKELREVKKKFATPRLSSL -------------------- >TRYPTOPHAN 2,3-DIOXYGENAS; SWP:Q1LK00; PDB:2NOXA; RDMSYGDYLGLDQILSAQHPLSPDHNEMLFIVQHQTTELWMKLMLHELRAARDGVKSDQL ---------3333----------1111---------------------------1111-- QPAFKMLARVSRIMDQLVQAWNVLATMTPPEYSAMRPYLGASSGFQSYQYREIEFILGNK -----------------------1111-------3333!!!!3333-------------- NAAMLRPHAHRPEHLELVETALHTPSMYDEAIRLMARRGFQIDPEVVERDWTQPTQYNAS 33333333---------------------------1111---1111---1111------- VEAAWLEVYRNPSAHWELYELGEKFVDLEDAFRQWRFRHVTTVERVIGFKRGTGGTEGVS ----------3333--------------------------------!!!!-3333----- YLRRMLDVVLFPELWKLRTDL ---3333---3333-3333-- >PUTATIVE TETR-FAMILY REGU; SWP:Q9RCV4; PDB:2NP3A; ILTAARVCFYGTKENLFLQALELPGKIEEAITAAAQGGLDGIGERVVRAHLSVWDDVSSR -----------3333---3333----------1111--2222-------------33333 PALTVRSAARLRETATGILARALGGVITGEDALRTSVATQLVGLARYVAHLEPLASADTD 333------------------------------------------------------333 TVARHYGRAVQAIVTDR 3-----------1111- >TRANSCRIPTIONAL REGULATOR; SWP:Q0S914; PDB:2NP5A; TSPERLAAALFDVAAESGLEGASVREVAKRAGVSIGAVQHHFSTKDEFAFALRTLVDKLL -----------------3333----------------3333--3333------------- ARLSEVERGGDPARALFAASQLLPLDEARSREAHVAAFAVRAATSPSLAEIRRKTLFTIR --1111-------------1111----------------3333----------------- TGLSAVLIGIGTPEAETRAALLLATVDGLALDAIGSPALYPPEYLEHALDIQIGILQGAD -----------------------------------3333-3333-----------2222- VVP --- >SELENOPROTEIN W; SWP:P63300; PDB:2NPBA; MALAVRVVYSGACGYKPKYLQLKEKLEHEFPGCLDICGEGTPQVTGFFEVTVAGKLVHSK ------------------------3333-2222------------------%%%%----1 KRGDGYVDTESKFRKLVTAIKAALAQCQ 111-----3333---------------- >PROTEIN CLP1; SWP:Q08685; PDB:2NPIA; TGDNEWHKLVIPKGSDWQIDLKAEGKLIVKVNSGIVEIFGTELAVDDEYTFQNWKFPIYA -----------2222------2222------------iiii------------------- VEETELLWKCPDLTTNTITVKPNHTKYIYNLHFLEKIRSNFEGPRVVIVGGSQTGKTSLS ---------1111----------------------------------------------- RTLCSYALKFNAYQPLYINLDPQQPIFTVPGCISATPISDILDAQLPTWGQSLTSGATLL --------------------3333------------------1111-------------- HNKQPVKNFGLERINENKDLYLECISQLGQVVGQRLHLDPQVRRSGCIVDTPSISQLDEN ------------3333------------------------------------1111-333 LAELHHIIEKLNVNILVLCSETDPLWEKVKKTFGPELGNNNIFFIPKLDGVSAVDDVYKR 3-------1111-------------------------3333------2222--------- SLQRTSIREYFYGSLDTALSPYAIGVDYEDLTIWKPSNVFDNEVGRVELFPVTITPSNLQ -------------1111---------3333------------------------111122 HAIIAITFAERRADQATVIKSPILGFALITEVNEKRRKLRVLLPVPGRLPSKAILTSYRY 22-------1111---3333---------------------------------------- LE -- >Coxsackievirus and adenov; SWP:Q5R764; PDB:2NPLX; GSSGARCYVDGSEEIGSDFKIKCEPKEGSLPLQYEWQKLSDSQKMPTSWLAEMTSSVISV ---------------------------------------1111-------3333------ KNASSEYSGTYSCTVRNRVGSDQCLLRLNVVPPSNK ------------------------------------ >14-3-3 DOMAIN CONTAINING ; SWP:Q5CUW0; PDB:2NPMA; MSDSVNARESNVYMAKLAEQAERYDEMAKYMKDVVEARQEELTVEERNLLSVAYKNAVGS 3333-------------------------------------------------------- RRSSWRIISSVEQKEHSRNAEDASKMCGKYRSKVEAELTDICNDILTMLDKHLIPTATSP ---------------1111----------------------------------1111--- DSKVFYFKMKGDYHRYISEFSTGDSKQSSAEDALKAYKDATVVAKDLEPTHPIRLGLALN -------------------------------------------11111111--------- FSVFHYEILNEPRAAIDMAKEAFEMAIEQLDKLSEDCYKDSTLIMQLLRDNLTLWTA ----------------------------3333-3333----------------1111 >PUTATIVE COBALAMIN SYNTHE; SWP:NA; PDB:2NPNA; ARTIYVIGIGTGSPEFLTLQAISGLRHAQAIVALDQKSDLLALRQKIVDTHAPGTPIYAV ------------3333-----------------------------------2222----- TDEEEVRRWHAERAHLLASTIRERTPDDGAVAFLVWGDPSLYDSTLRIIEHRNLEDLHAD --3333-------------------1111--------3333------------------- VKVIPGITAVQVLTAEHGILINRIGEAIHITTGRNLPETSAKDRRNCVVLDGKTAWQDVA ----------------------2222-----333311113333----------3333--- TEHTYWWGAFLGTEQQVLRKGYVHEIGAQVAELKQQLRTEHGWIDTYLLRELD 1111-----2222--------3333---------------------------- >ACETYLTRANSFERASE; SWP:Q0P9D1; PDB:2NPOA; RTEKIYIYGGHGLVCEDVAKNMGYKECIFLDDFKGMKFESTLPKYDFFIAIGNNEIRKKI -------------------------------11111111--------------------- YQKISENGFKIVNLIHKSALISPSAIVEENAGILIMPYVVINAKAKIEKGVILNTSSVIE ----1111-------------1111----------------2222--------2222--2 HECVIGEFSHVSVGAKCAGNVKIGKNCFLGINSCVLPNLSLADDSILGGGATLVKNQDEK 222--------2222--------------2222--------2222--2222--------- GVFVGVPAKRMEG ------------- >MEROZOITE SURFACE PROTEIN; SWP:Q26183; PDB:2NPRA; TMSSEHTCIDTNVPDNAACYRYLDGTEEWRCLLTFKEEGGKCVPASNVTCKDNNGGCAPE ---------------------1111------1111-------------3333-------- AECKMTDSNKIVCKCTKEGSEPLFEGVFCS -----1111--------------------- ------------------------------------------------------------ --- >Syntaxin 13; SWP:O70319; PDB:2NPSB; GSMRETAIQQLEADILDVNQIFKDLAMMIHDQGDLIDSIEANVESSEVHVERASDQLQRA -1111------------------------------------------------------- AYYQKKSR -------- >Vesicle transport through; SWP:Q9JI51; PDB:2NPSC; GSMRAHLLDNTERLERSSRRLEAGYQIAVETEQIGQEMLENLSHDRERIQRARERLRETD ------------------------------------------------------------ ANLGKSSRILTGMLRRIIQ --------------3333- ------------------------------------------------------------ --- >MITOGEN-ACTIVATED PROTEIN; SWP:Q13163; PDB:2NPTA; SMALGPFPAMQVLVIRIKIPNSGAVDWTVHSQLLFRDVLDVIGQVLPEATTTAFEYEDED ------------------2222-----------3333--------1111--------111 GDRITVRSDEEMKAMLSYYYSTVMEQQVNGQLIEPLQIFPRA 1-------------------------1111------------ >Mitogen-activated protein; SWP:Q9Y2U5; PDB:2NPTB; NDVRVKFEHRGEKRILQFPRPVKLEDLRSKAKIAFGQSMDLHYTNNELVIPLTTQDDLDK --------iiii----------3333------------------!!!!------------ AVELLDRSIHMKSLKILLVING ----3333-------------- >Uncharacterized ABC trans; SWP:Q57399; PDB:2NQ2C; NKALSVENLGFYYQAENFLFQQLNFDLNKGDILAVLGQNGCGKSTLLDLLLGIHRPIQGK ---------------------------2222----------------------------- IEVYQSIGFVPQFFSSPFAYSVLDIVLMGRSTHINTFAKPKSHDYQVAMQALDYLNLTHL -----------------------------1111-1111--3333--------11111111 AKREFTSLSGGQRQLILIARAIASECKLILLDEPTSALDLANQDIVLSLLIDLAQSQNMT ---1111-------------------------1111------------------------ VVFTTHQPNQVVAIANKTLLLNKQNFKFGETRNILTSENLTALFHLPMFEQQAQYKESFF ------3333-------------------3333---------------------iiii-- THFVPLYKTLL ------3333- >ITCHY HOMOLOG E3 UBIQUITI; SWP:Q96J02; PDB:2NQ3A; SLTMKSQLQITVISAKLKENKWFGPSPYVEVTVDGQSKKTEKCNNTNSPKWKQPLTVIVT --------------------------------iiii-----------------------1 PVSKLHFRVWSHQTLKSDVLLGTAALDIYETLKSNNMKLEEVVVTLQLGGDKEPTETIGD 111-------------------------------%%%%--------------3333---- LSICLDGLQLE ----------- >5-methyltetrahydropteroyl; SWP:Q8CWX6; PDB:2NQ5A; LTKVSSLGYPRLGENREWKKLIEAYWAGKVSKNDLFAGAKELRLDFLKKQLNAGLDLIPV --------------------------------------------------1111------ GDFSLYDHILDLSVQFNIIPKRFAKEPIDIDLYFAIARGNKENVASSMKKWFNTNYHYIV -------------------3333----------------1111-------!!!!------ PEWSKQRPKLNNNRLLDLYLEAREVVGDKAKPVITGPITYVALSTGVEDFTAAVKSLLPL -------------------------!!!!-----------1111---------------- YKQVFTELVKAGASYIQVDEPIFVTDEGKDYLQAAKAVYAYFAKEVPDAKFIFQTYFEGL ------------------------3333-----------------1111----------- IDSQVLSQLPVDAFGLDFVYGLEENLEAIKTGAFKGKEIFAGVIDGRNIWSSDFVKTSAL -33331111----------------------1111------------------------- LETIEEQSAALTIQPSCSLLHVPVTTKNETDLDPVLRNGLAFADEKLTEVKRLAEHLDGR ----1111----------1111---1111---33331111---------------1111- EDPAYDLHIAHFDALQAADFRNVKLEDLSRVATKRPSDFAKRRDIQQEKLHLPLLPTTTI -3333------------1111-----3333------------------------------ GSFPQDAEYKQFIQAEIERWIRIQEDLDLDVLVHGEFERVDMVEFFGQKLAGFTTTKFGW -----------------------------------1111---33331111---------- VQSYGSRAVKPPIIYGDVQHLEPITVEETVYAQSLTDRPVKGMLTGPITITNWSFERTDI ---!!!!-------------------------1111--------------3333------ PRDQLFNQIGLAIKDEIKLLENAGIAIIQVDEAALREGLPLRKSKQKAYLDDAVHAFHIA ---------------------------------3333----3333--------------- TSSVKDETQIHTHMCYSKFDEIIDAIRALDADVISILGIGLGVYDIHSPRVPTKEEVVAN ----3333----------1111----3333--------------1111----3333---- IERPLRQLSPTQFWVNPDCGLKTRQEPETIAALKVLVAATKEVRQK --------------------1111---------------------- >CALPAIN 8; SWP:A2CEN9; PDB:2NQAA; NALKYLGQDFKTLRQQCLDSGVLFKDPEFPAPSALGYTQGIIWKRPTELPSPQFIVGGAT ---2222------------------1111--3333---------3333-----------3 RTDIQGGLGDCWLLAAIASLTLNEELLYRVVPRDQDFQENYAGIFHFQFWQYGEWVEVVI 333-----------------------------------------------%%%%------ DDRLPTKNGQLLFLHSEQGNEFWSALLEKAYAKLNGYEALAGGSTVEGFEDFTGGISEFY ------iiii-------1111-----------11113333-----2222-2222------ DLKKPPANLYQIIRKALAGSLLGCSIDVYSAAEAEAITSQKLVKSHAYSVTGVEEVNFQG 3333-------------------------3333----1111----------------iii HPEKLIRLRNPWGEEWSGAWSDDAPEWNHIDPRRKEELDKKVEDGEFWMSLSDFVRQFSR i--------3333----2222--3333--------------------------------- LEICN ----- >ISOMERASE/LACTONIZING ENZ; SWP:Q8UJL8; PDB:2NQLA; MNSPIATVEVFTLTQPRKVPYLGALREGEVVNPNGYIVRKGNRTVYPTFDRSVLVRMTTE --------------------3333-------2222-----------------------11 AGTVGWGETYGIVAPGAVAALINDLLAGFVIGRDASDPSAVYDDLYDMMRVRGYTGGFYV 11----------------------3333-2222---------------3333-------- DALAALDIALWDIAGQEAGKSIRDLLGGGVDSFPAYVSGLPERTLKARGELAKYWQDRGF -----------------------1111----------------------------1111- NAFKFATPVADDGPAAEIANLRQVLGPQAKIAADMHWNQTPERALELIAEMQPFDPWFAE -----33331111------------1111---------------------3333------ APVWTEDIAGLEKVSKNTDVPIAVGEEWRTHWDMRARIERCRIAIVQPEMGHKGITNFIR ---3333------1111-------1111--------------------3333-------- IGALAAEHGIDVIPHATVGAGIFLAASLQASSTLSMLKGHEFQHSIFEPNRRLLDGDMDC ---------------------------------1111-----333333331111------ REGRYHLPSGPGLGVRPSEAALGLIERI iiii--------------3333------ >GAMMA-GLUTAMYLTRANSPEPTID; SWP:O25743; PDB:2NQOA; YPPIKNTKVGLALSSHPLASEIGQKVLEEGGNAIDAAVAIGFALAVVHPAAGNIGGGGFA --------------------------1111-----------------3333--------- VIHLANGENVALDFREKAPLKATKNMFLDKQGNVVPKLSEDGYLAAGVPGTVAGMEAMLK ---1111-----------1111--1111------2222---3333--------------- KYGTKKLSQLIDPAIKLAENGYAISQRQAETLKEARERFLKYSSSKKYFFKKGHLDYQEG -----3333------------------------------------------%%%%--222 DLFVQKDLAKTLNQIKTLGAKGFYQGQVAELIEKDMKKNGGIITKEDLASYNVKWRKPVV 2-----------------------------------1111-------1111--------- GSYRGYKIISMSPPSSGGTHLIQILNVMENADLSALGYGASKNIHIAAEAMRQAYADRSV --iiii-------------------------3333-2222-------------------- YMGDADFVSVPVDKLINKAYAKKIFDTIQPDTVTPSSQIKPGMGQLH ---3333-----------------1111------3333-2222---- >Gamma-glutamyltranspeptid; SWP:O25743; PDB:2NQOB; TTHYSVADRWGNAVSVTYTINASYGSAASIDGAGFLLNNEMDDFSIKPGNPNLYGLVGGD -------1111----------2222----2222-----3333----2222-1111----- ANAIEANKRPLSSMSPTIVLKNNKVFLVVGSPGGSRIITTVLQVISNVIDYNMNISEAVS ----2222------------%%%%---------1111----------------------- APRFHMQWLPDELRIEKFGMPADVKDNLTKMGYQIVTKPVMGDVNAIQVLPKTKGSVFYG ---------------2222---------3333---------------------------- STDPRK ------ >N-ACETYL-GAMMA-GLUTAMYL-P; SWP:P63562; PDB:2NQTA; VANATKVAVAGASGYAGGEILRLLLGHPAYADGRLRIGALTAATSAGSTLGEHHPHLTPL ----------1111-------------3333-------------22223333-3333111 AHRVVEPTEAAVLGGHDAVFLALPHGHSAVLAQQLSPETLIIDCGADFRLTDAAVWERFY 1-------3333----------1111-333311111111------1111----------- GSSHAGSWPYGLPELPGARDQLRGTRRIAVPGYPTAALLALFPALAADLIEPAVTVVAVS -----------1111-3333------------------------1111------------ GTSGAGRAATTDLLGAEVIGSARAYNIAGVHRHTPEIAQGLRAVTDRDVSVSFTPVLIPA --1111---3333-3333--------iiii1111------3333---------------- SRGILATCTARTRSPLSQLRAAYEKAYHAEPFIYLMPEGQLPRTGAVIGSNAAHIAVAVD --------------3333--------1111------2222--33332222---------- EDAQTFVAIAAIDNLVKGTAGAAVQSMNLALGWPETDGLSVVGVAP 1111--------1111-----------------1111--------- >CBS DOMAIN PROTEIN; SWP:Q7MXD1; PDB:2NQWA; LPFKVLGDGSYLFEGKTSLSDVRHYLDLPENAFGELGDEVDTLSGLFLEIKQELPHVGDT -----1111----1111-----------1111--3333-----------------2222- AVYEPFRFQVTQDKRRIIEIKIFPFE -------------------------- >HYPOTHETICAL PROTEIN; SWP:Q882E2; PDB:2NR3A; NAEALQLNSTEVRILGCLIEKQATNPETYPLTLNALVIACNQKTSRDPVNLTQGQVGQSL ------------------------1111-------------------------------- RALEGRGLTRLVGSRADRWEHKVDKGLELVPAQVILTGLLLLRGPQTVSELLTRSNRHDF ---1111-----3333--------1111------------------------1111---- EDSEQVVHQLERLIARGLATLVPRQSGQREDRYHLIGDPEDLQD -------------1111--------------------3333--- >CONSERVED HYPOTHETICAL PR; SWP:Q8PVV4; PDB:2NR4A; DLSSFGIREGISEIIASTGFEHPNAAPIGIVMKGERPFVRLFKGSHTWENVLKEKCLASN 3333--------------1111-------------------2222--------------- VVYDPILFVRSTFSDLVPSEFEYVDGEFKFPVLKEAIAWVVFECINLRNTDQSLVADLVP ----------------1111------------1111------------------------ LNAGFNERNIKELPVPNRGFNAVLEATVHATRYQLTGEEKYLELIRHYESLASKCGGDAE -----3333--------------------------------------------------- KKAMKLIYEAL ----------- >HYPOTHETICAL PROTEIN SO26; SWP:Q8EDS4; PDB:2NR5A; TKKERIAIQRSAEEALGKLKAIRQLCGAEDSDQEVEIWTNRIKELEDWLWGESPIA ------------------------2222------------------------1111 >SECRETION ACTIVATOR PROTE; SWP:Q7MXB3_PORGI; PDB:2NR7A; AANVKLLLPYILKWEGGFVHDPADAGGATNKGVTIATWKRVGYDKDGDGDIDVEDLKLLT --3333---------------1111-----------2222----2222------3333-- DDDVLNRVLKPFYWDRWKADLIESQKVANILVDWVWGSGKYGIVIPQRILGVQADGIVGN --------------11111111----------------3333------------------ KTLQAVNSADPDELFESIFDARREFLEDITARSIKKYEDSIGRKATERELLRHTNKRFLR -----------------------------------------------------3333--- GWLNRLEDIRKL --------1111 >KINESIN-LIKE PROTEIN KIF9; SWP:Q9HAQ2; PDB:2NR8A; KKVHAFVRVKPTDDFAHEMIRYGDDKRSIDIHLKKDIRRGVVNNQQTDWSFKLDGVLHDA ----------------------3333-----------3333------------------- SQDLVYETVAKDVVSQALDGYNGTIMCYGQTGAGKTYTMMGATENYKHRGILPRALQQVF ----------------1111---------2222-----------3333------------ RMIEERPTHAITVRVSYLEIYNESLFDLLSTLPYVGPSVTPMTIVENPQGVFIKGLSVHL -33331111-----------%%%%--1111-----3333-------1111--2222---- TSQEEDAFSLLFEGETNRIIASHTMNKNSSRSHCIFTIYLEAHKYITSKINLVDLAGSEY --3333----------------1111--1111---------------------------- INKSLSFLEQAIIALGDIPFRQCKLTHALKDSLGGNCNMVLVTNIYGEAAQLEETLSSLR -3333-------------1111------3333---------------3333--------- FASRMKLV ---3333- >TRANSCRIPTIONAL ACTIVATOR; SWP:Q5HW73; PDB:2NRHA; LLLCDIGNSNANFLDKYFTLNIDQFLEFIFYINVNEHLKEHLKNQKNFINLEPYFLFDTI --------------------33331111------1111------3333--3333------ YQGLGIDRIAACYTIEDGVVVDAGSAITIDIIHLGGFILPGIANYKKIYSHISPFNTQVS -----------1111----------------------------------------1111- LDAFPQKTMDALSYGVFKGIYLLIKDAAQNKKLYFTGGDGQFLANYFDHAIYDKLLIFRG -------------------------1111-------1111------------1111---- MKKIIKENPNLL ------------ >HBL B PROTEIN; SWP:Q9REG6; PDB:2NRJA; LSEIEQTNNGDTALSANEARKETLQKAGLFAKSNAYSYLIKNPDVNFEGITINGYVDLPG -3333---1111----3333-------------------------------iiii----- RIVQDQKNARAHAVTWDTKVKKQLLDTLNGIVEYDTTFDNYYETVEAINTGDGETLKEGI ------------------------------------------------------------ TDLRGEIQQNQKYAQQLIEELTKLRDSIGHDVRAFGSNKELLQSILKNQGADVDADQKRL ----------------------------------------------1111---------- EEVLGSVNYYKQLEGFNVKGAILGLPIIGGIIVGVARDNLGKLEPLLAELRQTVDYKVTL ---33333333------------------------1111-1111----1111-------- NRVVGVAYSNINEHKALDDAINALTYSTQWHDLDSQYSGVLGHIENAAQKADQNKFKFLK ----------------------------------------------3333-11113333- PNLNAAKDSWKTLRTDAVTLKEGIKELKVET -----------------------1111---- >HYPOTHETICAL PROTEIN GRPB; SWP:NA; PDB:2NRKA; IVTEYQPAWVEQFEEEAQALKQILKENCLKVEHIGSTSVPNLAAKPIIDFLVIVEEIEKV -----3333--------------!!!!-------11112222-------------33333 DLLQWEFERIGYEYGEFGLSGRRYLRKGPIKRTHHVHIYQFDNTQEILRHLAFRNYLREN 333----1111----iiii--------------------11113333------------- PAIATTYGTLKKQLAQAHPDSIDKYKDAFIKKIEKEALKKYWE -------------111133333333-------------3333- >HYPOTHETICAL PROTEIN ORF-; SWP:Q9UXC9; PDB:2NRQA; NQAIISVFIHETEDYNKIVNTIESFFSPLISNSKKNVTTAQGHYGNKIIILEYRFDRKSG ---------3333--------------3333------------------------3333- EQFFKIILEKIETSELLILTTSHIDGSKLYLRFDKQYLIAEHRLVLKEGDDVIKCIISFN -----------------3333--------------------------------------- TSNIKEEIKKLVNSRI ---------------- >UVRABC SYSTEM PROTEIN C; SWP:Q9WYA3; PDB:2NRRA; MEALEELMKLLNMKDFPYRIEGIDISHLYTVASLVVFEDGFPKKGDYRRYKIDDYESIRT -------------------------------------iiii-3333-------------- VVKRRYSKHPLPNLLFVDGGIGQVNAAIEALKEIGKDCPVVGLATVVFENREIHLPHDHP -------------------------------1111------------%%%%----1111- VLRLLVQIRDETHRFAVSY ---------------1111 >HYPOTHETICAL PROTEIN; SWP:Q0S2K3; PDB:2NS0A; TVSDRELEECIRALLDARADSASICPSDVARAVAPDDWRPLEPVREAAGRLADAGEVEVT --------------11112222--3333-----11113333------------------- QKGAVVDPRSARGPIRIRWTRTD iiii--3333------------- >Nitrogen regulatory prote; SWP:P0AC55; PDB:2NS1B; SMKLVTVIIKPFKLEDVREALSSIGIQGLTVTEVKGFGRQKGHAELYRGAEFSVNFLPKV ---------3333--------1111---------------------!!!!---------- KIDVAIADDQLDEVIDIVSKAAYTGKIGDGKIFVAELQRVIRIRTGEADEAAL ------3333---------------2222-------------------3333- >SPINDLIN-1; SWP:Q9Y657; PDB:2NS2A; RRNIVGCRIQHGWKEGNGPVTQWKGTVLDQVPVNPSLYLIKYDGFDCVYGLELNKDERVS ---2222-------!!!!------------1111-------2222------33333333- ALEVLPDRVATSRISDAHLADTMIGKAVEHMFETEDGSKDEWRGMVLARAPVMNTWFYIT --------------------3333----------------------------1111---- YEKDPVLYMYQLLDDYKEGDLRIMPDPGEVVDSLVGKQVEYAKEDGSKRTGMVIHQVEAK 1111-----------1111--------------2222-----1111----------1111 PSVYFIKFDDDFHIYVYDLVK -------1111---------- >MOBILIZATION PROTEIN A; SWP:Q60198; PDB:2NS6A; AIYHLTAKTGSGGQSARAKADYIQREGKYARDMDEVLHAESGHMPEFVERPADYWDAADL ---------------------1111!!!!---3333--------3333-3333------- YERANGRLFKEVEFALPVELTLDQQKALASEFAQHLTGAERLPYTLAIHAGGGENPHCHL ----------------3333---------------------------------------- MISERINDGIERPAAQWFKRYNGKTPEKGGAQKTEALKPKAWLEQTREAWADHANRALER ------------3333-----33331111-----11113333----------------11 AGH 11- >HYPOTHETICAL PROTEIN APE2; SWP:Q9Y9R3_AERPE; PDB:2NS9A; AGVWGLKVRYEGSFEVSKTPEEVFEFLTDPKRFSRAFPGFKSVEVEDGSFTIELRLSLGP ----------------------------333333332222-----iiii----------- LRGDARVRASFEDLEKPSKATVKGSGRGAGSTLDFTLRFAVEPSGGGSRVSWVFEGNVGG -------------------------------------------iiii------------3 LAASMGGRVLDSLARRMINDVISGVKRELGEA 333----------------------------- >CASPASE RECRUITMENT DOMAI; SWP:Q9Y239; PDB:2NSNA; SHPHIQLLKSNRELLVTHIRNTQCLVDNLLKNDYFSAEDAEIVCACPTQPDKVRKILDLV ------------------------------------------------------------ QSKGEEVSEFFLYLLQQLADAYVDLRPWLLEIGLE -----------------11111111-3333----- >E3 UBIQUITIN-PROTEIN LIGA; SWP:Q96PU5; PDB:2NSQA; GEPVYGLSEDEGESRILRVKVVSGIDLAKKASDPYVKLSLYVADENRELALVQTKTIKKT ---------1111----------------------------------------------- LNPKWNEEFYFRVNPSNHRLLFEVFDENRLTRDDFLGQVDVPLSHLPTEDPYTFKDFLLR -------------3333------------------------------------------- PRSHKSRVKGFLRLKMAYMP --1111-------------- >PROGRAMMED CELL DEATH PRO; SWP:Q61823; PDB:2NSZA; QPVNHLVKEIDMLLKEYLLSGDISEAEHCLKELEVPHFHHELVYEAIVMVLESTGESAFK ------------------------------33331111---------------------- MILDLLKSLWKSSTITIDQMKRGYERIYNEIPDINLDVPHSYSVLERFVEECFQAGIISK ---------1111------------------------2222-----------1111---- QLRDLCPSR -----3333 >GLUCOSYLCERAMIDASE; SWP:P04062; PDB:2NT0A; ARPCIPKSFGYSSVVCVCNATYCDSFDPPTFPALGTFSRYESTRSGRRMELSMGPIQANH ------------------1111----------2222------3333-------------- TGTGLLLTLQPEQKFQKVKGFGGAMTDAAALNILALSPPAQNLLLKSYFSEEGIGYNIIR ---------1111-------------------1111-------------1111------- VPMASCDFSIRTYTYADTPDDFQLHNFSLPEEDTKLKIPLIHRALQLAQRPVSLLASPWT --------------------1111------------------------------------ SPTWLKTNGAVNGKGSLKGQPGDIYHQTWARYFVKFLDAYAEHKLQFWAVTAENEPSAGL -11111111----------2222-----------------1111----------3333-- LSGYPFQCLGFTPEHQRDFIARDLGPTLANSTHHNVRLLMLDDQRLLLPHWAKVVLTDPE 2222--------------------------1111---------1111-------111133 AAKYVHGIAVHWYLDFLAPAKATLGETHRLFPNTMLFASEACVGSKFWEQSVRLGSWDRG 33----------1111--------------1111-----------1111---2222---- MQYSHSIITNLLYHVVGWTDWNLALNPEGGPNWVRNFVDSPIIVDITKDTFYKQPMFYHL -------------------------1111---------------3333------------ GHFSKFIPEGSQRVGLVASQKNDLDAVALMHPDGSAVVVVLNRSSKDVPLTIKDPAVGFL --3333-2222-------------------1111-------------------1111--- ETISPGYSIHTYLWHRQ ----------------- >COBALAMIN ADENOSYLTRANSFE; SWP:Q50EJ2; PDB:2NT8A; KIYTKNGDKGQTRIIGKQILYKNDPRVAAYGEVDELNSWVGYTKSLINSHTQVLSNELEE ----1111------------1111-------------------111133331111----- IQQLLFDCGHDLATPADDERHSFKFKQEQPTVWLEEKIDNYTQVVPAVKKFILPGGTQLA ----------11111111------------------------------------------ SALHVARTITRRAERQIVQLMREEQINQDVLIFINRLSDYFFAAARYANYLEQQPDMLYR -------------------------------------------------1111------- N - >ENOYL-[ACP] REDUCTASE; SWP:NA; PDB:2NTVA; AGLLEGKRILVSGIITDSSIAFHIAKVAQEAGAQLVLTGFDRLRLIQRIADRLPDKAPLI ---2222--------1111-----------------------------3333-------- ELDVQNEEHLATLAERVTAEIGEGNKLDGVVHSIGFMPQTGMGTNQFFDAPYEDVSKGIH --1111---------------2222------------3333----3333-3333------ ISTYSYASLAKALLLIMNSGGSIVGMDFDPTRAMPAYNWMTVAKSALESVNRFVAREAGK ------------3333-2222---------------!!!!-------------------- YGVRSNLVAAGPIRTLAMSAIVGGAFGEEAGAQMQLLEEGWDQRAPIGWNMKDPTPVAKT -----------------------1111-----------------1111-1111------- VCALLSEWLPATTGSIIYADGGASTQLL --------1111-------iiii----- >EMB|CAB41934.1; SWP:Q9LV40_ARATH; PDB:2NTXA; SERQQADEKDRFAKLLLGEDSGGGKGVSSALALSNAITNLAASIFGEQKLQPPQDRQARW ----------------iiii------------------------!!!!----3333---- KKEIDWLLSVTDHIVEFVPSEIVTRQRGDLLNIPALRKLDALIDTLDNFRGHNEFWYVLP --------------------------1111--------------3333------------ PVKVPPGGLSEPSRRLYFQKDSVTQVQKAAAINAQVLSEEIPESYIDSLPKNGRASLGDS ----1111-3333----------------------------33331111--3333----- IYKSITEEWFDPEQFLALDSTEHKVLDLKNRIEASVVIWKRKSLEKRELFEERAETILVL ---1111-------------3333--------------1111------------------ LKQKFPGLPQSSLDISKIQFNKDVGQAVLESYSRILESLAYTVSRIEDVLYTDTLALKQT ----1111----------------------------------------------3333-- >PERIPLASMIC DIVALENT CATI; SWP:Q9PFN8; PDB:2NUHA; SDVYLIFSTCPDLPSAEIISRVLVQERLAACVTQLPGAVSTYRWQGKIETTQEIQLLIKT ------------------------------------------------------------ NAVHVNAAITRLCALHPYRLPEAIAVQVSVGLPEYLTWINTEID 3333--------1111---------------------------- >THIOESTERASE SUPERFAMILY; SWP:Q28QX3; PDB:2NUJA; LPPYHTPLPAETLRALSIPAPWTFGLADRVRFGELDAIGHVNHTAYLRWYESFRLPFLKA -------------1111-------------3333-1111--1111--------------- RHVTDYGPTSPRLVLKQVHCTYLAEGGEDYVITGRVSNFRTTSFTEFACWRLGDAVECTS ------3333-------------------------------------------------- EGSAVVVLLNRDGSGRYPIPEAGRASFVTEDGVLAA ---------1111----------------------- >GLUTAMINE AMIDOTRANSFERAS; SWP:P37528; PDB:2NV0A; MLTIGVLGLQGAVREHIHAIEACGAAGLVVKRPEQLNEVDGLILPGGESTTMRRLIDTYQ --------------------1111-------33331111-----------------1111 FMEPLREFAAQGKPMFGTCAGLIILAKEIAPHLGLLNVVVERNSFGRQVDSFEADLTIKG --------1111---------------------------------1111--------222 LDEPFTGVFIRAPHILEAGENVEVLSEHNGRIVAAKQGQFLGCSFHPELTEDHRVTQLFV 2-----------------1111-----iiii-----!!!!-----1111----------- EMVEEYKQKAL ----------- >PYRIDOXAL BIOSYNTHESIS LY; SWP:P37527; PDB:2NV1A; TERVKRGMAEMQKGGVIMDVINAEQAKIAEEAGAVAVMALERAGGVARMADPTIVEEVMN 3333-------2222--------------1111--------------------------- AVSIPVMAKARIGHIVEARVLEAMGVDYIDESEVLTPADEEFHLNKNEYTVPFVCGCRDL ----------2222-----------------3333---------3333------------ GEATRRIAEGASMLRTKGEPGTGNIVEAVRHMRKVNAQVRKVVAMSEDELMTEAKNLGAP ------------------2222-----------------------1111----------- YELLLQIKKDGKLPVVNFAAGGVATPADAALMMQLGADGVFVGSGIFKSDNPAKFAKAIV --------------------------------1111-------3333------------- EATTHFTDYKLIAELSKEL -------------1111-- >UPF0066 PROTEIN AF_0241; SWP:O29998; PDB:2NV4A; ILKPIGVVKSPFKTQNDAPRQGRFSDAVSEIAIFDEYADGLHKIENLRHIIVLYWDKASR -------------1111---3333---------33331111-3333-------------- DKLRVVPPGETEERGVFTTRSPSRPNPIGLCVVEILEVERNRLKVRWLDALDGSPVIDIK ------2222----1111--------------------------------2222------ KYSPEIDCVNQ --3333----- >PTPRD, PHOSPHATASE; SWP:Q8R169; PDB:2NV5A; SHPPIPILELADHIERLKANDNLKFSQEYESIDPGQQFTWEHSNLEVNKPKNRYANVIAY -----3333-------------------3333-------3333-33331111-1111--3 DHSRVLLSAIEGIPGSDYVNANYIDGYRKQNAYIATQGSLPETFGDFWRMIWEQRSATVV 333------2222-1111---------------------3333----------------- MMTKLEERSRVKCDQYWPSRGTETHGLVQVTLLDTVELATYCVRTFALYKNGSSEKREVR ------%%%%--------------!!!!---------1111------------------- QFQFTAWPDHGVPEHPTPFLAFLRRVKTCNPPDAGPMVVHCSAGVGRTGCFIVIDAMLER -------------------------1111------------------------------- IKHEKTVDIYGHVTLMRAQRNYMVQTEDQYIFIHDALLEAVTC ---------------11112222-------------------- >ARGININE DECARBOXYLASE, A; SWP:Q84527; PDB:2NVAA; MNSVVNNILKAHPHQTKSFYVSSPKIVEDLIDQWTILFPRVTPHYAVKCNNDEVLLKTMC -----------1111----------------------1111----3333----------1 DKNVNFDCASSSEIKKVIQIGVSPSRIIFAHTMKTIDDLIFAKDQGVDIATFDSSFELDK 111--------------3333-3333---------------------------------- IHTYHPNCKMILRIRCDDPNATVQLGNKFGANEDEIRHLLEYAKQLDIEVIGISFHVGSG ----1111---------1111---3333---1111--------1111------------- SRNPEAYYRAIKSSKEAFNEAISVGHKPYILDIGGGLHADIDLSTYMSDYINDAIKDFFP --3333--------------------------------------3333------------ EDTVTIVAEPGRFFAEHYSVLATQVIGKRVRDGLYEYFFNESTYGGFSNVIFEKSVPTPQ 1111------33331111------------iiii-------1111--3333--------- LLRDVPDDEEYVPSVLYGCTCDGVDVINHNVALPELHIGDWVYFPSWGAYTNVLTTSFNG -----1111------------3333-----------2222----------3333--2222 FGEYDVYYI --------- >Ubiquinol-cytochrome c re; SWP:Q3IY09; PDB:2NVGA; LASIFVDVSSVEPGVQLTVKFLGKPIFIRRRTEADIELGRSVQLGQLVDTNARNANIDAG -------11112222-----iiii--------------11113333-------1111111 AEATDQNRTLDEAGEWLVMWGVCTHLGCVPIGGVSGDFGGWFCPCHGAHYDSAGRIRKGP 1--3333---3333------------------------------------1111------ APENLPIPLAKFIDETTIQLG --------------------- >INTERLEUKIN-1 BETA; SWP:P01584; PDB:2NVHA; APVRSLNCTLRDSQQKSLVMSGPYELKALHLQGQDMEQQVVFSMSFVQGEESNDKIPVAL -----------1111-----------------3333---------------1111----- GLKEKNLYLSCVLKDDKPTLQLESVDPKNYPKKKMEKRFVFNKIEINNKLEFESAQFPNW -2222--------%%%%--------3333------3333------%%%%----3333--- YISTSQAENMPVFLGGTKGGQDITDFTMQFVS -------------------------------- >FDXN ELEMENT EXCISION CON; SWP:Q3MD55; PDB:2NVMA; DKLTHYRHTIQEIIKKYYDLSNSLPDTVGDRLIIDEQRDQYLWLCCGWDGKKRVQHIILY -----------------------1111--------1111---------!!!!-------- LQIQNGKIWIEEDSTNLAIVDELVAGIPQTDIILGFHHPSKRG ---%%%%-------iiii33331111-3333--33333333-- >HYPOTHETICAL PROTEIN; SWP:Q5MZF1; PDB:2NVNA; GRILREGAGWRLGWDETAHRYPGLVGTTDWAVELTAAEADFCRLVQQLAETIAAIAPELP ------2222----3333--------1111----3333-------------3333----- EERLQIEAESALLWLEAEGFADAYELRLILASDRRVEACWPAAAVPALVAATHTLKGF -------------------1111-----------------3333------3333---- >HYPOTHETICAL PROTEIN; SWP:Q0TU05; PDB:2NVPA; SLSTNELKEIVRKIGKDLSGKIEDKKLQELFYNCFINTDTTVEVSEGDAFVITGDIPAWL ------------------1111----------------------2222------------ RDSTSQVEHYLPFVKEYPELKAIFTGLINRQVKCIFIDPYANAFNKEPNGQKWDNDITKD ------3333---3333--------------------1111------------------- SPWVWERKYEIDSLCYPVRLIHKYWKESGDETFFNDDIKKAFNIIDLWRVEQYHREKSDY 1111-----3333------------3333------------------------3333--- SFQRLNCSVTDTLSHEGLGTPVTYTGTWSGFRPSDDACEYGYLIPANFAVVALRYISEIA -------------------------------1111-------3333-------------- EKVYKDEELKEKADSLREEIDNAIEKHGKVYKEGFGEVYAYETDGGNYNFDDANVPSLLS --------------------------------------------------------3333 IPYLEYKGIEDEVYQNTRKFILSKNNRFFFEGKAAKGIGSPHTPDQYIWHIALSQGLTTN -3333--1111-----------3333-----3333----11112222---33331111-- NQEEIDQLIKLLKETDAGTGYHEGFHVDDPTKFTRDWFAWSNSLFSHFIYEKVINKKLEH 3333-----------iiii------1111------------------------------- H - >HISTONE DEACETYLASE 7A; SWP:Q8WUI4; PDB:2NVRA; TLPFTTGLIYDSVMLKHQCSCGDNSRHPEHAGRIQSIWSRLQERGLRSQCECLRGRKASL ----------3333----33333333---------------11111111---------33 EELQSVHSERHVLLYGTNPLSRLKLDNGKLAGLLAQRMFVMLPCGGVGVDTDTIWNELHS 333333-----------------------------------1111--------------- SNAARWAAGSVTDLAFKVASRELKNGFAVVRPPGHHADHSTAMGFCFFNSVAIACRQLQQ ---------------------------------1111-----iiii-------------- QSKASKILIVDWDVHHGNGTQQTFYQDPSVLYISLHRHDDGNFFPGSGAVDEVGAGSGEG -2222------------------1111----------%%%%-------1111---1111- FNVNVAWAGGLDPPMGDPEYLAAFRIVVMPIAREFSPDLVLVSAGFDAAEGHPAPLGGYH ---------------------------------------------1111----1111--- VSAKCFGYMTQQLMNLAGGAVVLALEGGHDLTAICDASEACVAALLGNRVDPLSEEGWKQ ---------------%%%%-------------------------------33333333-- KPNLNAIRSLEAVIRVHSKYWGCMQR ---------------3333-3333-- >ACETYL-COA HYDROLASE/TRAN; SWP:Q7MVN7; PDB:2NVVA; ALRFITAEEAAEFVHHNDNVGFSGFTPAGNPKVVPAAIAKRAIAAHEKGNPFKIGFTGAS -----3333-11112222--------2222--3333--------3333------------ TGARLDGVLAQADAVKFRTPYQSNKDLRNLINNGSTSYFDLHLSTLAQDLRYGFYGKVDV -1111----1111-----------------1111-------3333--------------- AIIEVADVTEDGKILPTTGVGILPTICRLADRIIVELNDKHPKEIGHDLCEPLDPPARRE --------1111-------------------------33331111--------------- LPVYTPSDRIGKPYVQVDPAKIVGVVRTSEPNDESDFAPLDPVTQAIGDNVAAFLVSEKA ----1111---------3333--------------------------------------- GRIPKDFLPLQSGVGNVANAVLGALGDNPDIPAFNYTEVIQDAVIALKKGRIKFASGCSL ---1111--------------------1111---------3333---------------- SVSRSVIQDIYANLDFFKDKILLRPQEYSNNPEIVRRLGVITINTALEADIFGNINSTHV ------------33331111----3333---3333--------------1111-----22 SGTRNGIGGSGDFTRNSYVSIFTTPSVKDGKISSFVPVAHHDHSEHSVKVIISEWGVADL 22----!!!!---1111----------iiii------------1111-----3333---2 RGKNPRERAHEIIDKCVHPDYRPLLRQYLELGVKGQTPQNLDCCFAFHQELAKSGDRNVR 222--------------3333----3333----------3333----------------3 WEDY 333- >Galactose/lactose metabol; SWP:GAL80_KLULA; PDB:2NVWA; LANNNKRSKLSTVPSSRPIRVGFVGLTSGKSWVAKTHFLAIQQLSSQFQIVALYNPTLKS ----1111----1111---------------------------3333------------- SLQTIEQLQLKHATGFDSLESFAQYKDIDMIVVSVKVPEHYEVVKNILEHSSQNLNLRYL ---------1111-----------1111-------3333--------------------- YVEWALAASVQQAEELYSISQQRANLQTIICLQGRKSPYIVRAKELISEGCIGDINSIEI ----------------------1111-----3333------------------------- SGNGGWYGYERPMRSPEYLYDIESGVNLISNSFGHTIDVLQYITGSYFQKINAMISNNIP ---------------3333-3333--3333------------------------------ TQFLLDGKRTKETISKTCPDHLLFQGILENGKVPVSCSFKGGTPVKKLTKNLVIDIHGTK ----------------------------2222---------------------------- GDLKIEGDSNLVLYFYGIKNGEEQTMEVFHLRNYNSVVGNILRIYESIADYHFLKFDKQG ----------------------------------------------------------!! FRFEGFPTFKDAIILHRLIDAVFRSDKEEKTLDVSKIMI !!---------------------------------1111 >PLYB; SWP:NA; PDB:2NW0A; GYIVDMSKWNGSPDWDTAKGQLDLVIARVQDGSNYVDPVYKDYVAAMKARNIPFGSYAFC ------3333----33331111--------------1111-------------------- RFVSVEDAKVEARDFWNRGDKDSLFWVADVEVTTMSDMRAGTQAFIDELYRLGAKKVGLY -------------------1111--------------------------3333------- VGHHKYEEFGAAQIKCDFTWIPRYGAKPAYPCDLWQYDEYGQVPGIGKCDLNRLNGDKSL -2222-11111111----------------------------2222------------33 DWFTGKGEE 33------- >ELS4 TCR ALPHA CHAIN; SWP:Q6P4G7; PDB:2NW2A; QNIDQPTEMTATEGAIVQINCTYQTSGFNGLFWYQQHAGEAPTFLSYNVLDGLEEKGRFS -----------2222--------------------------------------------- SFLSRSKGYSYLLLKELQMKDSASYLCAVQAGGSYIPTFGRGTSLIVHPYIQNPDPAVYQ ----1111---------3333--------------------------------------- LRDSKDKSVCLFTDFDSQTNVSQSKDSDVYITDKCVLDMRSMDFKSNSAVAWSNKSDFAC ---------------3333--------------------1111---------------33 ANAFNNSIIPEDTFFPS 33-1111---------- >ELS4 TCR ALPHA CHAIN; SWP:NA; PDB:2NW2B; DAGITQSPRHKVTETGTPVTLRCHQTENHRYMYWYRQDPGHGLRLIHYSYGVKDTDKGEV -------------2222--------------------2222---------2222------ SDGYSVSRSKTEDFLLTLESATSSQTSVYFCATGTGDSNQPQHFGDGTRLSILEDLNKVF ---------1111--------3333-----------------------------1111-- PPEVAVFEPSEAEISHTQKATLVCLATGFFPDHVELSWWVNGKEVHSGVCTDPQPLKEQP ---------------------------------------iiii--2222---------11 ALNDSRYALSSRLRVSATFWQNPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEAWG 11-------------3333--1111-----------1111-------------------- RAD --- >METHIONINE AMINOPEPTIDASE; SWP:Q8SR45; PDB:2NW5A; CILLNQAEELPIEFLPKDGVYGKGKLFDSRNMEIENFTESDILQDARRAAEAHRRARYRV ---------------------------1111----------------------------1 QSIVRPGITLLEIVRSIEDSTRTLLKGERNNGIGFPAGMSMNSCAAHYTVNPGEQDIVLK 111-2222----------------2222%%%%--------!!!!------2222-----1 EDDVLKIDFGTHSDGRIMDSAFTVAFKENLEPLLVAAREGTETGIKSLGVDVRVCDIGRD 111---------iiii----------3333------------------22223333---- INEVISSYEVEIGGRMWPIRPISDLHGHSISQFRIHGGISIPAVNNRDTTRIKGDSFYAV ---3333----%%%%---------------2222-------------------------- ETFATTGKGSIDDRPPCSHFVLNTYKSRKLFNKDLIKVYEFVKDSLGTLPFSPRHLDYYG ---------------------------------------------!!!!--33333333- LVKGGSLKSVNLLTMMGLLTPYPPLNDIDGCKVAQFEHTVYLSEHGKEVLTRGDDY -2222----------------------2222-----------3333------3333 >TRYPTOPHAN 2,3-DIOXYGENAS; SWP:Q8PDA8; PDB:2NW8A; EGRLTYGGYLRLDQLLSAQQPLSEPAHHDEMLFIIQHQTSELWLKLLAHELRAAIVHLQR -1111--------1111---------1111---------------------------111 DEVWQCRKVLARSKQVLRQLTEQWSVLETLTPSEYMGFRDVLGPSSGFQSLQYRYIEFLL 1-------------------------1111333311113333---3333----------- GNKNPQMLQVFAYDPAGQARLREVLEAPSLYEEFLRYLARFGHAIPQQYQARDWTAAHVA ---33333333----3333----1111-----------1111---3333---3333---- DDTLRPVFERIYENTDRYWREYSLCEDLVDVETQFQLWRFRHMRTVMRVIGFKRGTGGSS 3333----------3333-------------------------------!!!!-3333-- GVGFLQQALALTFFPELFDVRTSVGV -----3333----3333-3333---- ------------------------------------------------------------ ---------------- >Lysozyme C [Precursor]; SWP:P61626; PDB:2NWDX; KVFERCELARTLKRLGMDGYRGISLANWMCLAKWESGYNTRATNYNAGDRSTDYGIFQIN ------------11112222---3333--------%%%%---------------1111-3 SRYWCNDGKTPGAVNACHLSCSALLQDNIADAVACAKRVVRDPQGIRAWVAWRNRCQNRD 333------2222-1111-3333------------------3333----3333--2222- VRQYVQGCGV 33332222-- >CARBOHYDRATE KINASE; SWP:Q8UE86; PDB:2NWHA; MKKILVLGGAHIDRRGMIETETAPGASNPGSWMEEAGGGGFNAARNLSRLGFEVRIIAPR ----------------------2222---------------------1111--------- GGDVTGEVVAEAARQAGVEDTPFTFLDRRTPSYTAILERDGNLVIALADMDLYKLFTPRR -------------1111-------1111---------1111--------3333---3333 LKVRAVREAIIASDFLLCDANLPEDTLTALGLIARACEKPLAAIAISPAKAVKLKAALGD ---------1111---------------------------------3333----1111-- IDILFMNEAEARALTGVRDWPNILRKAGLSGGVVTRGASEVVAFNGTEKAILHPPLIREV ----------------1111-----------------------------------1111- KDVTGAGDAMASGYLAAIAEGKTIREALRQGAAAAAITVQSSFATSQDLSKDSVEAMLGL -----------------1111-------------------11113333--------3333 VPQAEML ------- >HYPOTHETICAL PROTEIN; SWP:O28875; PDB:2NWIA; LNAIHRILMTTDGSITAIIEAVTQKKVEVETLEQKIIRADRELAELLEIDEGDEVNYRVV -------------------------------------------------2222------- YLRANGEIYAKAISFTPLKRLENSFREDLMRADIPIGKIMRKHNIEARREIRWSRVEEAD ----------------1111---------------------------------------- LALAKELGIADRRVISRNYNIIHRGKVLINITEFFPMERF ----------------------iiii---------3333- >GLUTAMATE SYMPORT PROTEIN; SWP:O59010; PDB:2NWLA; YPVLIKILIGLILGAIVGLILGHYGYAHAVHTYVKPFGDLFVRLLKMLVMPIVFASLVVG ---------------------1111----------------------------------- AASISPARLGRVGVKIVVYYLLTSAFAVTLGIIMARLFNPGAGIHLAVGGQQFPPLVHIL ----------------------------------------------------------33 LDIVPTNPFGALANGQVLPTIFFAIILGIAITYLMNSENEKVRKSAETLLDAINGLAEAM 33------------------------------3333------------------------ YKIVNGVMQYAPIGVFALIAYVMAEQGVHVVGELAKVTAAVYVGLTLQILLVYFVLLKIY -----1111---------------------!!!!-----------------------111 GIDPISFIKHAKDAMLTAFVTRSSSGTLPVTMRVAKEMGISEGIYSFTLPLGATINMDGT 1-3333-----------------3333-------------3333-----3333---3333 ALYQGVCTFFIANALGSHLTVGQQLTIVLTAVLASIGTAGVPGAGAIMLAMVLHSVGLPL ------------------------------------------------------------ TDPNVAAAYAMILGIDAILDMGRTMVNVTGDLTGTAIVAKTE ------------------------------------------ >PROBABLE SHORT-CHAIN DEHY; SWP:Q9HUQ6; PDB:2NWQA; SSTLFITGATSGFGEACARRFAEAGWSLVLTGRREERLQALAGELSAKTRVLPLTLDVRD -------1111----------1111-------------------1111--------3333 RAASAAVDNLPEEFATLRGLINNAGLALGTDPAQSCDLDDWDTVDTNIKGLLYSTRLLLP ------11113333-----------------3333------------------------- RLIAHGAGASIVNLGSVAGKWPYPGSHVYGGTKAFVEQFSLNLRCDLQGTGVRVTNLEPG --3333---------3333---2222----------------33332222---------- LCEGAHPIQPEDIAETIFWINQPAHLNINSLEIPVSQSWAGFAIH --------3333----------1111-------3333-------- >UPF0165 PROTEIN AF_2212; SWP:O28071; PDB:2NWTA; MPKIIEAVYENGVFKPLQKVDLKEGERVKIKLELKVEPIDLGEPVSVEEIKKIRDGTWMS ---------iiii----------------------------------1111--------- SLEHHHHHH --------- >UPF0201 PROTEIN SSO1042; SWP:Q97Z89; PDB:2NWUA; DKVVVAEVRPSEDVNKVLSAISNFFDFEKNTGIIDILVLEARTLKSLLKFHRVLRNERIL --------11113333--------------------------------------1111-- DSARKYLKGIEGNTIAFIHKQAAAVGVLSFVAIKFYIEYQNPKEIVDWLAPKTAHGVPLW ----------------------1111--------------3333---------iiii--- DNPVPPD ------- >XISI PROTEIN-LIKE; SWP:Q3M7V9; PDB:2NWVA; DNVAEYRKLIKQVLTEYDNLSRQSPETNYETCLVFDENHDNYLWLAVDWQGSKRIKYTYV --------------------1111-------------------------!!!!------- HIRIKNEKIYIEEDYTEEGIATELRLGVTNNDIVLAFHPPDVRKFTDFATA ----%%%%-------11113333-----3333--111111111111----- >HYPOTHETICAL PROTEIN YPSA; SWP:P50838; PDB:2NX2A; SLKVLAITGYKPFELGIFKQDDKALYYIKKAIKNRLIAFLDEGLEWILISGQLGVELWAA ----------3333--------3333-------------1111----------------- EAAYDLQEEYPDLKVAVITPFYEQEKNWKEPNKEQYEAVLAQADYEASLTHRPYESPLQF ------1111------------1111-------------1111----------------- KQKNQFFIDKSDGLLLLYDPEKEGSPKYMLGTAEKRREQDGYPIYFITMDDLRVTVEE ------------------3333-1111--------3333------------------- >TRANSCRIPTIONAL REGULATOR; SWP:Q0SCU8; PDB:2NX4A; GVPKLVDHDERRRSITAAAWRLIAARGIEAANMRDIATEAGYTNGALSHYFAGKDEILRT ---------------------------3333---------------3333--3333---- SYEHISEATDRRIAEALGDATGLDALRILCREVMPINEEQLLEARIAASLWPRAMYDEQM ----------------!!!!-------------------------------3333--333 AATNRRTMDNWREQMAIFLEQAREEGSVGDIDVTIVVEQLLNMMMGMQILGVLTPGETSS 3----------------------------------------------------------- ERQLEMLEQFVAAL -------------- >OXALOACETATE DECARBOXYLAS; SWP:Q6A1F6; PDB:2NX9A; AIKRVGVTDVVLRDAHQSLFATRLRIDDMLPIAQQLDQIGYWSLECWGGATFDSCIRFLG ---------1111------%%%%-11113333-3333----------!!!!--------- EDPWQRLRLLKQAMPNTPLQMLLRGQNLLGYRHYADDVVDTFVERAVKNGMDVFRVFDAM ---------------------------------------------------------111 NDVRNMQQALQAVKKMGAHAQGTLCYTTSPVHNLQTWVDVAQQLAELGVDSIALKDMAGI 13333--------1111-----------1111------------1111-------1111- LTPYAAEELVSTLKKQVDVELHLHCHSTAGLADMTLLKAIEAGVDRVDTAISSMSGTYGH --------------------------1111---------1111-------3333-!!!!- PATESLVATLQGTGYDTGLDIAKLEQIAAYFRDVRKKYHAFEGMMKGSDARILVAQVPGG -------1111-1111-----------------33333333-------3333-----333 MLTNMESQLKQQNALDKLDLVLEEIPRVREELGFLPLVTPTSQIVGTQAVINVVLGERYK 3------------3333------------------------------------------- TITKETSGVLKGEYGKTPAPVNTELQARVLAGAEAITCRPADLIAAEMPTLQDRVLQQAK -----------1111-----------------------3333------------------ EQHITLAENAIDDVLTIALFDQVGWKFLANR ------------------------------- >BROMODOMAIN-CONTAINING PR; SWP:Q15059; PDB:2NXBA; RKTNQLQYMQNVVVKTLWKHQFAWPFYQPVDAIKLNLPDYHKIIKNPMDMGTIKKRLENN ---------------333311111111---3333--1111-------------------- YYWSASECMQDFNTMFTNCYIYNKPTDDIVLMAQALEKIFLQKVAQMPQEE -----------------------1111----------------1111---- >RIBOSOMAL PROTEIN L11 MET; SWP:Q84BQ9; PDB:2NXCA; MWVYRLKGTLEALDPILPGLFDGGARGLWEREGEVWAFFPAPVDLPYEGVWEEVGDEDWL --------33333333----1111------%%%%--------------------3333-- EAWRRDLKPALAPPFVVLAPWHTWEGAEIPLVIEPGGHHETTRLALKALARHLRPGDKVL ---1111-----------1111-------------------------------2222--- DLGTGSGVLAIAAEKLGGKALGVDIDPMVLPQAEANAKRNGVRPRFLEGSLEAALPFGPF ---!!!!------------------3333--------1111--------33333333--- DLLVANLYAELHAALAPRYREALVPGGRALLTGILKDRAPLVREAMAGAGFRPLEEAAEG -------------------11112222-------1111--------1111--------!! EWVLLAYGR !!------- >PUTATIVE DIMETAL PHOSPHAT; SWP:Q7T291; PDB:2NXFA; DPVFTFGLIADVQYADIEDGENYLRTRRRYYRGSADLLRDAVLQWRRERVQCVVQLGDII ---------------------1111----------------------------------- DGHNRRRDASDRALDTVAELDACSVDVHHVWGNHEFYNFSRPSLLSSRLNSAQGSDLIGD --3333--------------3333-------3333-----------1111------3333 DIYAYEFSPAPNFRFVLLDAYDLSVIGREEESEKHTHSWRILTQHNHNLQDLNLPPVSVG ---------2222-----1111------2222---------------3333--------3 LEQRFVKFNGGFSEQQLQWLDAVLTLSDHKQERVLIFSHLPVHPCAADPICLAWNHEAVL 333--1111---------------------------------1111-3333---3333-- SVLRSHQSVLCFIAGHDHDGGRCTDSSGAQHITLEGVIETPPHSHAFATAYLYEDRVKGR -----3333-------3333----3333-------3333-1111---------------! GRVEDLTITYS !!!-------- >50S ribosomal protein L11; SWP:P36238; PDB:2NXNB; MKKVVAVVKLQLPAGKATPAPPVGPALGQHGANIMEFVKAFNAATANMGDAIVPVEITIY ------------2222-----------1111-----------1111-------------1 ADRSFTFVTKTPPASYLIRKAAGLEKGAHKPGREKVGRITWEQVLEIAKQKMPDLNTTDL 111---------3333-------------------------------------------3 EAAARMIAGSARSMGVEVV 333--3333---------- >HYPOTHETICAL PROTEIN SCO4; SWP:Q9L0T8; PDB:2NXOA; TRPRVGHIQFLNCLPLYWGLARTGTLLDFELTKDTPEKLSEQLVRGDLDIGPVTLVEFLK ---------3333-------11111111--------------1111-------------- NADDLVAFPDIAVGCDGPVMSCVIVSQVPLDRLDGARVALGSTSRTSVRLAQLLLSERFG 3333------------------------3333--------3333---------------- VQPDYYTCPPDLSLMAAVLIGDAALRANMIDGPRYGLDVHDLGALWKEWTGLPFVFAVWA ------------------------------3333-------------------------- ARRDYAEREPVITRKVHEAFLASRNLSLEEVEKVAEQAARWEAFDEDTLAKYFTTLDFRF ------------------------------------------------------------ GAPQLEAVTEFARRVGPTTGFPADVKVELLKP 3333----------3333-------------- >TRANSCRIPTION INITIATION ; SWP:Q15542; PDB:2NXPA; QPDVSAVLSAYNQQGDPTYEEYYSGLKHFIECSLDCHRAELSQLFYPLFVHYLELVYNQH -----------------------------1111----------------------1111- ENEAKSFFEKFHGDQECYYQDDLRVLSSLTKKEHKGNETLDFRTSKFVLRISRDSYQLLK -----------11111111------3333-3333-----11111111------------- RHLQEKQNNQIWNIVQEHLYIDIFD -1111-------------------- >ENVELOPE GLYCOPROTEIN GP1; SWP:P01730; PDB:2NY1A; EVVLVNVTENFNMWKNDMVEQMHEDICSLWDQSLKPCVKLTPLCVGAGSCNTSVITQACP -----------1111--------------------------------------------- KVSFEPIPIHYCAPAGFAILKCNNKTFNGTGPCTNVSTVQCTHGIRPVVSSQLLLNGSLA -------------2222------------------------------------------- EEEVVIRSVNFTDNAKTIIVQLNTSVEINCTGAGHCNIARAKWNNTLKQIASKLREQFGN ---------3333---------------------------------------------11 NKTIIFKQSSGGDPEIVTHWFNCGGEFFYCNSTQLFNSTWFNGSDTITLPCRIKQIINMW 11----------3333------iiii-----3333------------------------- CKVGKAMYAPPISGQIRCSSNITGLLLTRDGGNSNNESEIFRPGGGDMRDNWRSELYKYK -------------------------------------------------------1111- VVKIE ----- >T-cell surface glycoprote; SWP:P01730; PDB:2NY1B; KKVVLGKKGDTVELTCTASQKKSIQFHWKNSNQIKILGNQGSFLTKGPSKLNDRADSRRS ------2222-------------------1111------!!!!-----1111-----111 LWDQGNFPLIIKNLKIEDSDTYICEVEDQKEEVQLLVFGLTANSDTHLLQGQSLTLTLES 11111---------3333-------%%%%-------------------2222-------- PPGSSPSVQCRSPRGKNIQGGKTLSVSQLELQDSGTWTCTVLQNQKKVEFKIDIVVLAFQ 2222-------1111--------------1111---------iiii-------------- K - >ENVELOPE GLYCOPROTEIN GP1; SWP:NA; PDB:2NY1C; DIVMTQSPATLSVSPGERATLSCRASESVSSDLAWYQQKPGQAPRLLIYGASTRATGVPA -------------2222---------------------2222------------222233 RFSGSGSGAEFTLTISSLQSEDFAVYYCQQYNNWPPRYTFGQGTRLEIKRTVAAPSVFIF 33----------------1111-------------------------------------- PPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALQSGNSQESVTEQDSKDSTYSLSST --33333333---------------------iiii------------------------- LTLSKADYEKHKVYACEVTHQGLSSPVTKSFNRG ---3333------------1111----------- >ENVELOPE GLYCOPROTEIN GP1; SWP:NA; PDB:2NY1D; EVQLVESGAEVKKPGSSVKVSCKASGDTFIRYSFTWVRQAPGQGLEWMGRIITILDVAHY ------------2222-----------1111--------2222---------1111---- APHLQGRVTITADKSTSTVYLELRNLRSDDTAVYFCAGVYEGEADEGEYDNNGFLKHWGQ 3333---------1111---------1111------------3333-------------- GTLVTVSSASTKGPSVFPLAPGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQ ------------------------------------------%%%%--2222-------1 SSGLYSLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEPK 111----------1111------------1111---------- >PERIPLASMIC NITRATE REDUC; SWP:P33937; PDB:2NYAA; EAIKWDKAPCRFCGTGCGVLVGTQQGRVVACQGDPDAPVNRGLNCIKGYFLPKIMYGKDR ------------3333-------iiii------1111--iiii-3333-1111------- LTQPLLRMKNGKYDKEGEFTPITWDQAFDVMEEKFKTALKEKGPESIGMFGSGQWTIWEG -------------1111-------------------------3333-------------- YAASKLFKAGFRSNNIDPNARHCMASAVVGFMRTFGMDEPMGCYDDIEQADAFVLWGANM -------------------1111-3333--------------3333------------33 AEMHPILWSRITNRRLSNQNVTVAVLSTYQHRSFELADNGIIFTPQSDLVILNYIANYII 33------------1111------------1111---------2222------------1 QNNAINQDFFSKHVNLRKGATDIGYGLRPTHPLEKAAKNPGSDASEPMSFEDYKAFVAEY 111---3333-----------------33333333---2222--------------3333 TLEKTAEMTGVPKDQLEQLAQLYADPNKKVISYWTMGFNQHTRGVWANNLVYNLHLLTGK ---------------------3333---------3333---------------------- ISQPGCGPFSLTGQPSACGTAREVGTFAHRLPADMVVTNEKHRDICEKKWNIPSGTIPAK --2222--------------------1111-iiii1111-------------2222---- IGLHAVAQDRALKDGKLNVYWTMCTNNMQAGPNINEERMPGWRDPRNFIIVSDPYPTVSA -----------1111----------3333---3333-------1111---------3333 LAADLILPTAMWVEKEGAYGNAERRTQFWRQQVQAPGEAKSDLWQLVQFSRRFKTEEVWP ----------!!!!---------------------------------------3333--1 EDLLAKKPELRGKTLYEVLYATPEVSKFPVSELAEDQLNDESRELGFYLQKGLFEEYAWF 11133331111----3333--3333---3333-1111-------------------3333 GRGHGHDLAPFDDYHKARGLRWPVVNGKETQWRYSEGNDPYVKAGEGYKFYGKPDGKAVI 2222------3333----------iiii----------11112222------1111---- FALPFEPAAEAPDEEYDLWLSTGRVLEHWHTGSMTRRVPELHRAFPEAVLFIHPLDAKAR ------------------------1111!!!!-333311111111-------33331111 DLRRGDKVKVVSRRGEVISIVETRGRNRPPQGLVYMPFFDAAQLVNKLTLDATDPLSKET -----------1111--------------2222------11111111------------- DFKKCAVKLEK ----------- >SUPEROXIDE DISMUTASE [FE]; SWP:P0AGD3; PDB:2NYBA; SFELPALPYAKDALAPHISAETIEYHYGKHHQTYVTNLNNLIKGTAFEGKSLEEIIRSSE ---------1111----------------------------2222-2222----1111-- GGVFNNAAEVWNHTFYWNCLAPNAGGEPTGKVAEAIAASFGSFADFKAQFTDAAIKNFGS ----------------1111---------------------------------------- GWTWLVKNSDGKLAIVSTSNAGTPLTTDATPLLTVDVWEHAYYIDYRNARPGYLEHFWAL -------1111-------!!!!3333-----------3333----!!!!------3333- VNWEFVAKNLAA ------------ >NUCLEAR PROTEIN SNF4; SWP:P12904; PDB:2NYCA; THFLKIPIGDLNIITQDNMKSCQMTTPVIDVIQMLTQGRVSSVPIIDENGYLINVYEAYD 3333---1111-----------11113333-----3333-------1111------3333 VLGLIKGGLSLSVGEALMRRSYTCTKNDKLSTIMDNIRKARVHRFFVVDDVGRLVGVLTL ---------------3333-----1111--------------------1111-------- SDILKYILLG ------3333 >UPF0135 PROTEIN SA1388; SWP:Y1388_STAAN; PDB:2NYDA; PMKIADLMTLLDHHVPFSTAESWDNVGLLIGDEDVEVTGVLTALDCTLEVVNEAIEKGYN ---------------3333-1111-------1111------------------------- TIISHHPLIFKGVTSLKANGYGLIIRKLIQHDINLIAMHTNLDVNPYGVNMMLAKVMGLK ------------------!!!!------1111------3333--1111------1111-- NISIINNQQDVYYKVQEFMIDAYQKSRAEQLIKQTPVFDFIEIKQTSLYGLGVMAEVDNQ ---------------------3333----------------------------------- MTLEDFAADIKSKLNIPSVRFVGESNQKIKRIAIIGGSGIGYEYQAVQQGADVFVTGDIK -----------------------1111--------------------------------- HHDALDAKIHGVNLIDINHYSEYVMKEGLKTLLMNWFNIEKINIDVEASTINTDPFQYI -----------------3333----------------1111------------------ >NOSTOC PUNCTIFORME PHENYL; SWP:NA; PDB:2NYFA; SIVTVGDRNLTIDEVVNVARHGTQVRLTDNADVIRGVQASCDYINNAVETAISREQAAEL ----------3333------------------------------3333------------ QTNLIWFLKSGAGNKLSLADVRAAMLLRANSHLYGASGIRLELIQRIETFLNAGVTPHVY -----1111-------3333----------1111-----3333----------------- EFGSIGDLVPLSYITGALIGLDPSFTVDFDGKEMDAVTALSRLGLPKLQLQPKEGLAMMN -------3333----------3333---------------1111---------------- GTSVMTGIAANCVYDAKVLLALTMGVHALAIQGLYGTNQSFHPFIHQCKPHPGQLWTADQ -------------------------------------33333333--------------- MFSLLKDSSLVREEQDRYSLRCLAQFIGPIVDGVSEITKQIEVEMNSVTDNPLIDVENQV --1111----------3333----------------------------------3333-- SYHGGNFLGQYVGVTMDRLRYYIGLLAKHIDVQIALLVSPEFSNGLPPSLVGNSDRKVNM --------3333--------------------------3333----2222--3333---! GLKGLQISGNSIMPLLSFYGNSLADRFPTHAEQFNQNINSQGYISANLTRRSVDIFQNYM !!!-------------------1111-1111iiii------------------------- AIALMFGVQAVDLRTYKMKGHYDARTCLSPNTVQLYTAVCEVVGKPLTSVRPYIWNDNEQ ----------------------------3333---------------1111----1111- CLDEHIARISADIAGGGLIVQAVEHIFSSL 3333-------------3333-33333333 >YOKD PROTEIN; SWP:O32003; PDB:2NYGA; LKKIVESTTFPRTKQSITEDLKALGLKKGMTVLVHSSLSSIGWVNGGAVAVIQALIDVVT --------------------------2222------3333-------------------1 EEGTIVMPSQSVELSDPKEWGNPPVPEEWWDIIRESMPAYNSNYTPTTRGMGQIVELFRS 111-------3333-3333------3333--3333-----1111---3333-----3333 YPEVKRSNHPNYSFVAWGKHKNKILNQHPLEFGLGEQSPLGKLYIRESYVLLLGADFDSS 2222-------------1111-------------2222-1111------------33333 TCFHLAEYRIPYQKIINRGAPIIVEGKRVWKEYKELEFREELFQEVGQAFEAEHNMKVGK 3333333----------------%%%%-----------3333------3333-------- VGSANCRLFSLTEAVDFAEKWFINNDSKNI !!!!-------------------------- >PUTATIVE DIOXYGENASE; SWP:Q13JM0; PDB:2NYHA; GTFRDTSAIASWHAHVYFDASSRDAAWTLREQIEAHWSGKLQLGRFHERPVGPHPWSYQL ----3333----------3333--------------%%%%----------!!!!------ AFTQEQFADLVGWLTLNHGALDIFLHPNTGDALRDHRDAAVWIGHSHELVLSAL --1111-----------!!!!----------------------------3333- >UNKNOWN PROTEIN; SWP:NA; PDB:2NYIA; ETQSFVVSVAGSDRVGIVHDFSWALKNISANVESSRACLGGDFAIVLVSLNAKDGKLIQS -------------------------1111---------iiii------------------ ALESALPGFQISTRRASSVVSPDTREYELYVEGPDSEGIVEAVTAVLAKKGANIVELETE -----2222-----------1111-----------1111--------1111--------- TLPAPFAGFTLFRGSRVAFPFPLYQEVVTALSRVEEEFGVDIDLEEVV -------------------3333------------------------- >PHENYLALANINE/HISTIDINE A; SWP:Q3M5Z3; PDB:2NYNA; NVIIGNQKLTINDVARVARNGTLVSLTNNTDILQGIQASCDYINNAVESGISREQASELQ ------------------------------------------------------------ TNLVWFLKTGAGNKLPLADVRAAMLLRANSHMRGASGIRLELIKRMEIFLNAGVTPYVYE ----1111-------3333-------------------3333------------------ FGSIGDLVPLSYITGSLIGLDPSFKVDFNGKEMDAPTALRQLNLSPLTLLPKEGLAMMNG ------3333----------3333---------------1111------2222------- TSVMTGIAANCVYDTQILTAIAMGVHALDIQALNGTNQSFHPFIHNSKPHPGQLWAADQM ------------------------------------3333----1111------------ ISLLANSQLVRDELDGKIQDRYSLRCLPQYLGPIVDGISQIAKQIEIEINSVTDNPLIDV ---2222-----1111----3333-----------------------1111-------33 DNQASYHGGNFLGQYVGMGMDHLRYYIGLLAKHLDVQIALLASPEFSNGLPPSLLGNRER 33------11113333--------------------------3333iiii2222--3333 KVNMGLKGLQICGNSIMPLLTFYGNSIADRFPTHAEQFNQNINSQGYTSATLARRSVDIF ---!!!!-------------------1111-1111%%%%--------------------- QNYVAIALMFGVQAVDLRTYKKTGHYDARACLSPATERLYSAVRHVVGQKPTSDRPYIWN --------------------------3333--3333---------------3333----1 DNEQGLDEHIARISADIAAGGVIVQAVQDIL 111-3333--------1111------1111- >AGR_C_3712P; SWP:Q8U561; PDB:2NYSA; QDHIRYDILAQDALRGVIRKVLGEVAATGRLPGDHHFFITFLTGAPGVRISQHLKSKYAE -------------------------------!!!!------3333--------------- QTIVIQHQFWDKVTETGFEIGLSFSDTPEKLVIPYNAIRGFYDPSVNFELEFDVP -------------3333------iiii------1111-----3333--------- >PROBABLE C to U-EDITING E; SWP:Q9Y235; PDB:2NYTA; SGGGMIVTGERLPANFFKFQFRNVEYSSGRNKTFLCYVVEAQGKGGQVQASRGYLEDEHA -----------3333---1111-------------------------------------- AAHAEEAFFNTILPAFDPALRYNVTWYVSSSPCAACADRIIKTLSKTKNLRLLILVGRLF --33331111------3333--------------------------1111---------- MWEEPEIQAALKKLKEAGCKLRIMKPQDFEYVWQNFVEQEEAFQPWEDIQENFLYYEEKL 3333--------------------3333-------------------------------- ADILK ----- >PUTATIVE RIBOSOMAL RNA ME; SWP:Q9UI43; PDB:2NYUA; SYRSRSAFKLLEVNERHQILRPGLRVLDCGAAPGAWSQVAVQKVNAAGTDPSSPVGFVLG ---3333-------------2222--------------------1111-1111------- VDLLHIFPLEGATFLCPADVTDPRTSQRILEVLPGRRADVILSDMAPNATGFRDLDHDRL --------2222------1111----------2222------------------------ ISLCLTLLSVTPDILQPGGTFLCKTWAGSQSRRLQRRLTEEFQNVRIIKPEVYFLATQYH ---------3333--2222--------1111----------------------------- G - >PHOSPHOGLYCOLATE PHOSPHAT; SWP:O67359; PDB:2NYVA; SLRVILFDLDGTLIDSAKDIALALEKTLKELGLEEYYPDNVTKYIGGGVRALLEKVLKDK --------2222--------------------3333---3333-------------!!!! FREEYVEVFRKHYLENPVVYTKPYPEIPYTLEALKSKGFKLAVVSNKLEELSKKILDILN -3333------------------2222-------1111---------------------- LSGYFDLIVGGDTFGEKKPSPTPVLKTLEILGEEPEKALIVGDTDADIEAGKRAGTKTAL 3333-----1111-1111---------------3333----------------------- ALWGYVKLNSQIPDFTLSRPSDLVKLDNHIVEFEG 1111--------------3333------------- >Probable transcriptional ; SWP:P71672; PDB:2NYXA; PATAEESVDVITDALLTASRLLVAISAHSIAQVDENITIPQFRTLVILSNHGPINLATLA ---------------------------------1111----------------------- TLLGVQPSATGRVDRLVGAELIDRLPHPTSRRELLAALTKRGRDVVRQVTEHRRTEIARI ----------------1111---------------------------------------- VEQAPAERHGLVRALTAFTEAGGE ---3333----------------- >BOTULINUM NEUROTOXIN TYPE; SWP:NA; PDB:2NYYD; VQLQESGGGLVQPGGSLRLSCAASGFTFKYDYMYWVRQAPGKGLEWVATISDGGSYTYYS -----------2222-----------1111--------------------1111-----3 DSVEGRFTTSRDNSKNTLYLQMNSLRAEDTAIYYCSRYRYDDAMDYWGQGTLVTVSSAST 333---------1111---------1111------------------------------- KGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSGLY -------------------------------------%%%%-------------3333-- SLSSVVTVPSSSLGTQTYICNVNHKPSNTKVDKKVEP ------------------------------------- >ARGININOSUCCINATE SYNTHAS; SWP:NA; PDB:2NZ2A; KGSVVLAYSGGLDTSCILVWLKEQGYDVIAYLANIGQKEDFEEARKKALKLGAKKVFIED ---------------------1111----------------------------------- VSREFVEEFIWPAIQSSALYEDRYLLGTSLARPCIARKQVEIAQREGAKYVSHGATGKGN -------------------------3333-3333---------------------11113 DQVRFELSCYSLAPQIKVIAPWRMPEFYNRFKRNDLMEYAKQHGIPIPVTPKNPWSMDEN 333---------1111---3333----3333-3333----1111---------------3 LMHISYEAGILENPKNQAPPGLYTKTQDPAKAPNTPDILEIEFKKGVPVKVTNVKDGTTH 333----!!!!-3333--1111-----1111------------iiii------------- QTSLELFMYLNEVAGKHGVGRIDIVENRFIGMKSRGIYETPAGTILYHAHLDIEAFTMDR ---------------------------1111---------3333---------------- EVRKIKQGLGLKFAELVYTGFWHSPECEFVRHCIAKSQERVEGKVQVSVLKGQVYILGRE -----------------------3333--------1111----------iiii------- SPLSLYNEELVSNVQGDYEPTDATGFININSLRLKEYHRLQS 1111--3333--------------------3333-------- >HYPOTHETICAL PROTEIN; SWP:NA; PDB:2NZCA; EKRFYILTIVVEDREKAYRQVNELLHNFSEDILLRVGYPVREENAIIFLVLKTDNDTIGA ---------------------------3333--------1111----------------- LSGKLGQISGVRVKTVPLK -------2222-------- >CHOLERA ENTEROTOXIN SUBUN; SWP:NA; PDB:2NZGD; APQNITELCSEYHNTQIYTINDKILSYTESLRGKREMAIITFKNGATFQVEVPGQKKAIE -------------------------------2222------1111--------------- RMKDTLRIAYLTEAKVEKLCVWNNKTPNSIAAISMA ------------------------------------ >GTP-BINDING PROTEIN REM 1; SWP:O75628; PDB:2NZJA; MALYRVVLLGDPGVGKTSLASLFAGKHEQLGEDVYERTLTVDGEDTTLVVVDTWSWSQES ----------2222--------------------------iiii---------------1 CLQGGSAYVIVYSIADRGSFESASELRIQLRRTHVPIILVGNKADLARCREVSVEEGRAC 111---------1111--------------------------3333-------------- AVVFDCKFIETSATLQHNVAELFEGVVRQLRLRRR ------------1111------------------- >HYDROXYACID OXIDASE 1; SWP:Q9UJM8; PDB:2NZLA; RLICINDYEQHAKSVLPKSIYDYYRSGANDEETLADNIAAFSRWKLYPRMLRNVAETDLS ---3333---------3333-------!!!!-------3333------------------ TSVLGQRVSMPICVGATAMQRMAHVDGELATVRACQSLGTGMMLSSWATSSIEEVAEAGP --iiii------------1111-1111-------------------------------11 EALRWLQLYIYKDREVTKKLVRQAEKMGYKAIFVTVDTPYLGNRLDDVRNRFKLPPQLRM 11----------------------1111--------------------------1111-- KNFGLAAYVAKAIDPSISWEDIKWLRTSLPIVAKGILRGDDAREAVKHGLNGILVSNHGA -------------11113333------------------------1111-------%%%% RQLDGVPATIDVLPEIVEAVEGKVEVFLDGGVRKGTDVLKALALGAKAVFVGRPIVWGLA -------3333--------iiii----------3333----1111------3333----- FQGEKGVQDVLEILKEEFRLAMALSGCQNVKVIDKTLVR ----------------------1111--3333-1111-- >HEXOKINASE-2; SWP:P52789; PDB:2NZTA; DQVQKVDQYLYHMRLSDETLLEISKRFRKEMEKGLGATTHPTAAVKMLPTFVRSTPDGTE -1111----1111-------------------------3333------------------ HGEFLALDLGGTNFRVLWVKVVEMENQIYAIPEDIMRGSGTQLFDHIAECLANFMDKLQI -------------------------------3333------------------------1 KDKKLPLGFTFSFPCHQTKLDESFLVSWTKGFKSSGVEGRDVVALIRKAIQRRGDFDIDI 111-------------------------!!!!----2222---------1111------- VAVVNDTVGTMMTCGYDDHNCEIGLIVGTGSNACYMEEMRHIDMVEGDEGRMCINMEWGA -------------33331111----------------11111111-----------3333 FGDDGSLNDIRTEFDQEIDMGSLNPGKQLFEKMISGMYMGELVRLILVKMAKEELLFGGK -1111-3333--------1111-2222---1111----------------1111-%%%%- LSPELLNTGRFETKDISDIEGEKDGIRKAREVLMRLGLDPTQEDCVATHRICQIVSTRSA -3333------3333------------------1111---3333---------------- SLCAATLAAVLQRIKENKGEERLRSTIGVDGSVYKKHPHFAKRLHKTVRRLVPGCDVRFL ------------------------------3333-----------------1111----- RSEDGSGKGAAMVTAVAYRLADQHRARQKTLEHLQLSHDQLLEVKRRMKVEMERGLSKET ----3333---------------------3333------------------------111 HASAPVKMLPTYVCGDFLALDLGGTNFRVLLVRVRGVEMHNKIYAIPQEVMHGTGDELFD 1-----------------------------------------------3333-------- HIVQCIADFLEYMGMSLPLGFTFSFPCQQNSLDESILLKWTKGFKASGCEGEDVVTLLKE ------------------------------1111------!!!!----2222-------- AIHRRDLDVVAVVNDTVGTMMTCGFEDPHCEVGLIVGTGSNACYMEEMRNVELVEGEEGR -3333-----------------33331111----------------33331111------ MCVNMEWGAFGDNGCLDDFRTEFDVAVDELSLNPGKQRFEKMISGMYLGEIVRNILIDFT -----3333-1111-3333-3333---1111-22223333---3333------------1 KRGLLFRGRISERLKTRGIFETKFLSQIESDCLALLQVRAILQHLGLESTCDDSIIVKEV 111-%%%%--1111-2222-3333--1111------------1111-------------- CTVVARRAAQLCGAGMAAVVDRIRENRGLDALKVTVGVDGTLYKLHPHFAKVMHETVKDL ------------------------1111-------------------------------- APKCDVSFLQSEDGSGKGAALITAVACRIRE 1111---------3333-------------- >ALPHA1,3-FUCOSYLTRANSFERA; SWP:O30511; PDB:2NZXA; MFQPLLDAYVESASIEKMASKSPPPLKIAVANWWGDEEIKEFKNSVLYFILSQRYTITLH ----------1111----------------1111----------------1111------ QNPNEFSDLVFGNPYQNAKRVFYTGENESPNFNLFDYAIGFDELDFNDRYLRMPLYYDRL --------------1111------------3333-----------!!!!----------- HHKAESVNDTTAPYKLKDNSLYALKKPSHCFKEKHPNLCAVVNDESDPLKRGFASFVASN --------1111----22221111-----3333-------------3333---------- PNAPIRNAFYDALNSIEPVTGGGSVRNTLGYNVKNKNEFLSQYKFNLCFENTQGYGYVTE -------------1111-----------------3333-1111----------2222--3 KIIDAYFSHTIPIYWGSPSVAKDFNPKSFVNVHDFKNFDEAIDYIKYLHTHKNAYLDMLY 333--1111-------1111----1111--3333-----------------------111 ENPLNTLDGKAYFYQNLSFKKILAFFKTILENDTIYHDNP 1-----iiii--2222------------------------ >PROBABLE ZINC UPTAKE REGU; SWP:O05839; PDB:2O03A; ASAAGVRSTRQRAAISTLLETLDDFRSAQELHDELRRRGENIGLTTVYRTLQSMASSGLV ---3333-----------1111-------------1111--------------------- DTLHTDTGESVYRRCSEHHHHHLVCRSCGSTIEVGDHEVEAWAAEVATKHGFSDVSHTIE ----3333---------------------------1111--------------------- IFGTCSDCR ----3333- >SPERMIDINE SYNTHASE; SWP:P19623; PDB:2O07A; IREGWFRETCSLWPGQALSLQVEQLLHHRRSRYQDILVFRSKTYGNVLVLDGVIQCTERD -iiii----1111------------------------------------iiii---3333 EFSYQEMIANLPLCSHPNPRKVLIIGGGDGGVLREVVKHPSVESVVQCEIDEDVIQVSKK ------------1111----------3333--------3333------------------ FLPGMAIGYSSSKLTLHVGDGFEFMKQNQDAFDVIITDSSESYYQLMKTALKEDGVLCCQ -----3333-1111--------3333-------------------------1111----- GECQWLHLDLIKEMRQFCQSLFPVVAYAYCTIPTYPSGQIGFMLCSKNPSTNFQEPVQPL --1111-------------------------1111------------11113333----- TQQQVAQMQLKYYNSDVHRAAFVLPEFARKALND -------------------1111-3333------ >BH1327 PROTEIN; SWP:Q9KD90; PDB:2O08A; GNRGKALQLVKPHLTEHRYQHTIGVETAIDLAKLYGADQQKAELAAIFHDYAKFRDKNER ---------3333----------------------------------111111113333- TLIREKLSQQDILFYGDELLHAPCGAYYVREEVGIEDEDVLQAIRFHTTGRPNSLLEKII --------3333---3333----------------------------------------- FLADYIEPNRQFPGVEKVRTQAKTDLNGAIISSLVNTITFLLKKNQPIYPDTLATYNQLL ------1111-2222---------------------------------3333-------- LEQ --- >ALR2278 PROTEIN; SWP:Q8YUQ7; PDB:2O09A; MYGLVNKAIQDMISKHHGEDTWEAIKQKAGLEDIDFFVGMEAYSDDVTYHLVGAASEVLG -3333-------------------------3333---1111--3333------------- KPAEELLIAFGEYWVTYTSEEGYGELLASAGDSLPEFMENLDNLHARVGLSFPQLRPPAF ---------------------------1111--------------------1111----- ECQHTSSKSMELHYQSTRCGLAPMVLGLLHGLGKRFQTKVEVTQTAFRETGEDHDIFSIK --------------------3333---------1111---------3333---------- YE -- >TRANSCRIPTIONAL REGULATOR; SWP:Q833I7; PDB:2O0MA; HQIEKETQYFGIQRCIVVAGDSDIQKKVLSDFGDVLTNTLNLLLPNGENTIAVGGTTAVA 3333---1111---------33333333------------------------------33 ENGSLETEKRHNLFVPARGGIGEAVSVQANSISAVANKTGGNYRALYVPEQLSRETYNSL 33----1111-------------3333---------1111-------------------- LQEPSIQEVLTLISHANCVVHSIGRALHAARRKSDDEVLKQKNAVAESFGYFFDEEGKVV ------------1111--------3333-11113333--1111----iiii--1111--- YKIPRIGLQLKNLQEIPYVVAIAGGKTKAKAIRAYKNAPKQTWLITDEAAANEILK --------33331111--------3333----------1111-------------- >HYPOTHETICAL PROTEIN CC05; SWP:Q9AAR9; PDB:2O0QA; TLIYKILSRAEWDAAKAQGRFEGSAVDLADGFIHLSAGEQAQETAAKWFRGQANLVLLAV ------------------------------------3333--------2222-------- EAEPLGEDLKWEASRGGARFPHLYRPLLVSEVTREADLDLDADGVPQLGDHLAL -11111111----2222----------3333---------1111----3333-- >Rv0858c (N-Succinyldiamin; SWP:A1QPT3; PDB:2O0RA; ATVSRLRPYATTVFAEMSALATRIGAVNLGQGFPDEDGPPKMLQAAQDAIAGGVNQYPPG --11111111-3333-----------------------3333-------1111-----11 PGSAPLRRAIAAQRRRHFGVDYDPETEVLVTVGATEAIAAAVLGLVEPGSEVLLIEPFYD 11--------------------3333--------------------2222--------11 SYSPVVAMAGAHRVTVPLVPDGRGFALDADALRRAVTPRTRALIINSPHNPTGAVLSATE 11------------------!!!!--------33331111-------------------- LAAIAEIAVAANLVVITDEVYEHLVFDHARHLPLAGFDGMAERTITISSAAMFNCTGWKI ------------------1111---!!!!---33332222--------------1111-- GWACGPAELIAGVRAAKQYLSYVGGAPFQPAVALALDTEDAWVAALRNSLRARRDRLAAG ----------------1111----1111-------------------------------- LTEIGFAVHDSYGTYFLCADPRPLGYDDSTEFCAALPEKVGVAAIPMSAFCDPADVWNHL --------------------3333-----------------------1111----3333- VRFTFCKRDDTLDEAIRRLSVLAE -------------------1111- >DIAMINOPIMELATE DECARBOXY; SWP:A2VHI9; PDB:2O0TA; NELLHLAPNVWPRNTTRDEVGVVCIAGIPLTQLAQEYGTPLFVIDEDDFRSRCRETAAAF -3333-3333-------1111---iiii-------------------------------- GSGANVHYAAAFLCSEVARWISEEGLCLDVCTGGELAVALHASFPPERITLHGNNKSVSE -3333----------------------------------1111-3333------------ LTAAVKAGVGHIVVDSMTEIERLDAIAGEAGIVQDVLVRLTVGVEAHTHEFISTAHEDQK ----------------------------------------------!!!!---------- FGLSVASGAAMAAVRRVFATDHLRLVGLHSHIGSQIFDVDGFELAAHRVIGLLRDVVGEF ----1111---------------------------------------------------- GPEKTAQIATVDLGGGLGISYLPSDDPPPIAELAAKLGTIVSDESTAVGLPTPKLVVEPG ----1111-------------3333---3333-------------1111----------3 RAIAGPGTITLYEVGTVKDVDVSATAHRRYVSVDGGMSDNIRTALYGAQYDVRLVSRVSD 333--------------------------------3333-3333---------------- APPVPARLVGKHCESGDIIVRDTWVPDDIRPGDLVAVAATGAYCYSLSSRYNMVGRPAVV -------------1111--------11112222----------3333--2222------- AVHAGNARLVLRRETVDDLLSLEVR --iiii--------3333-1111-- >TRANSCRIPTIONAL REGULATOR; SWP:NA; PDB:2O0YA; GVRSVTRVIDLLELFDAAHPTRSLKELVEGTKLPKTTVVRLVATCARSVLTSRADGSYSL -----------33333333--------------3333---------------1111---- GPELRWVRLAGRTWAPPEEVVDIRQLSADTGETVNLYIRQGLSRVVVAQCESTATVRSVI ---------------------------------------!!!!----------------- PLGVPYPLWAGAAGKILLLAAPELIDDVAADSPHGPEFADQLREKVEDGRERGYQLVHGE 2222----------------1111-------33331111--------------------- RELGSSGLSFPLVDSHGTVVAALTLGGPTGRFTEDRTPHYIECTRAAAEEISAIGLPGL -2222--------1111----------3333-1111-------------------1111 >HYPOTHETICAL PROTEIN YXIM; SWP:P42304; PDB:2O14A; KVYQFDFGSGSEPGYIGVRASDRYDRSKGYGFQTPENRDVAASGAGVKSDAVEFLAYGTK -----------2222---1111--3333-----3333-------!!!!---------111 SNNTFNVDLPNGLYEVKVTLGNTARASVAAEGVFQVINTGDGAEDTFQIPVTDGQLNLLV 1----------------------------iiii------2222----------------- TEGKAGTAFTLSALKIKKLSDQPVTNRTIYVGGDSTVCNYYPLNSSKQAGWGQLPHYIDK ---2222--------------------------1111----1111--------1111-11 HTFQVRNASGGQIARGFRNDGQLEAILKYIKPGDYFLQLGINDTNPKHKESEAEFKEVRD 11------2222-------------3333-2222------33333333------------ IRQVKAKGADVILSTPQGRATDFTSEGIHSSVNRWYRASILALAEEEKTYLIDLNVLSSA ----1111----------1111-1111---1111-3333--------------------- YFTSIGPERTLGLYDGDTLHPNRAGADALARLAVQELKRQGIAGF ----------1111-----------------------11112222 >ACETOIN UTILIZATION PROTE; SWP:Q9KTZ3; PDB:2O16A; SLMIKVEDMMTRHPHTLLRTHTLNDAKHLMEALDIRHVPIVDANKKLLGIVSQRDLLAAQ ----3333---------1111--------------------1111--------------- ESSLQFETPLFEVMHTDVTSVAPQAGLKESAIYMQKHKIGCLPVVAKDVLVGIITDSDFV --------3333---------1111--------------------iiii-----3333-- TIAINLLELQEE ------------ >THIAMINE BIOSYNTHESIS LIP; SWP:P0AB85; PDB:2O18A; TEVTVLEGKTGTFWRASIPGIDAKRSAELKEKIQTQLDADDQLLSTYKKDSALRFNDSQS -----------------------------------------------1111---3333-- LSPWPVSEAADIVTTSLRIGAKTDGADITVGPLVNLWGFGPEQQPVQIPSQEQIDAKAKT ----------------------iiii1111---3333--1111---------33331111 GLQHLTVINQSHQQYLQKDLPDLYVDLSTVGEGYAADHLARLEQEGISRYLVSVGGALNS 1111----------------------1111------------1111-------!!!!--- RGNGEGLPWRVAIQKPTDKQAVVDINGHGISTSGSYRNRLSHVIDPQTGRPIEHNLVSVT ------------------------2222-------------------------------- VIAPTALEADAWDTGLVLGPEKAKEVVRREGLAVYITKEGDSFKTWSPQFKSFLVSE ----------------------------------------------33331111--- >AMINOTRANSFERASE, CLASS I; SWP:Q5HCZ0; PDB:2O1BA; ISNKLANIPDSYFGEHGPLPLINAVGIPDGPTPQGIIDHFQKALTIPENQKYGAFHGKEA -3333-----1111------------------3333-----33333333----1111--- FKQAIVDFYQRQYNVTLDKEDEVCILYGTKNGLVAVPTCVINPGDYVLLPDPGYTDYLAG -----------------3333------33333333-3333-2222----------3333- VLLADGKPVPLNLEPPHYLPDWSKVDSQIIDKTKLIYLTYPNNPTGSTATKEVFDEAIAK --------------------3333-----1111--------------------------- FKGTDTKIVHDFAYGAFGFDAKNPSILASENGKDVAIEIYSLSKGYNSGFRVGFAVGNKD 2222-------1111---------11112222-----------11113333--------- IQALKKYQTHTNAGFGALQDAAIYALNHYDDFLEEQSNVFKTRRDRFEALAKADLPFVHA ------3333----------------------------------------1111------ KGGIYVWLETPPGYDSEQFEQFLVQEKSILVAPGKPFGENGNRYVRISLALDDQKLDEAA ----------2222-------------------33331111------------------- IRLTELAYLYE -----3333-- >YCDH; SWP:O34966; PDB:2O1EA; KLHVVTTFYPYEFTKQIVKDKGDVDLLIPSSVEPHDWEPTPKDIANIQDADLFVYNSEYE -----------------!!!!-----------3333-----------------------1 TWVPSAEKSGQGHAVFVNASKGIDLEGHADPHVWLSPVLAQKEVKNITAQIVKQDPDNKE 111--------------1111----------3333-------------------3333-- YYEKNSKEYIAKLQDLDKLYRTTAKKAEKKEFITQHTAFGYLAKEYGLKQVPIAGLSPDQ ------------------------------------1111-------------------- EPSAASLAKLKTYAKEHNVKVIYFEEIASSKVADTLASEIGAKTEVLNTLEGLSKEEQDK --3333-----1111--------------------------------------------- GLGYIDIKQNLDALKDSLLV ---3333-----3333---- >NONSTRUCTURAL PROTEIN NSP; SWP:Q9PYC1; PDB:2O1JA; IETQMDRVVKEMRRQLEMIDKLTTREIEQVELLKRIYDKLTVRT ---1111------------------------------------- ------------------------------------------- >Probable amino-acid ABC t; SWP:O34852; PDB:2O1MA; KVQTITVGTGTQFPNICFIDEKGDLTGYDVELIKELDKRLPHYKFTFKTEFSNLLVSLGQ -------------------3333----------------1111------1111------- HKVDIVAHQEKSKEREKKFLFNKVAYNHFPLKITVLQNNDTIRGIEDLKGKRVITSATSN -----------3333--------------------1111----11112222--------- GALVLKKWNEDNGRPFEIAYEGQGANETANQLKSGRADATISTPFAVDFQNKTSTIKEKT ------------------------------------------------------------ VGNVLSNAKVYFFNKNEQTLSDDIDKALQEIIDDGTLKRLSLKWLGDDY -------------1111--------------1111-------------- >putative acetyl/propionyl; SWP:A2SM23; PDB:2O1QA; LKSKIKEEYVQDQVDWKPFPAAFSTGGIRWKLLHVSPEGSWTAIFDCPAGSSFAAHVHVG -------------------3333------------------------2222--------- PGEYFLTKGKDVRGGKAAGGDTAIAPGYGYESANARHDKTEFPVASEFYSFLGPLTFVKP -----------iiii1111------------2222-----------------------11 DGSPIAVIGWEDAQGAWAA 11----------------- >UPF0053 PROTEIN HI0107; SWP:Q57017; PDB:2O1RA; AIQQSDGSIIGSANLRDLNKFNWELDTEDARTFNGLILEHLEEIPDEGTICEIDGLLITI ---1111---111--------------------------------2222---iiii---- LEVGDNIKQAKVVKL --------------- >1-DEOXY-D-XYLULOSE-5-PHOS; SWP:P77488; PDB:2O1SA; FDIAKYPTLALVDSTQELRLLPKESLPKLCDELRRYLLDSVSRSSGHFASGLGTVELTVA -1111--3333--333333333333----------------3333----3333------- LHYVYNTPFDQLIWDVGHQAYPHKILTGRRDKIGTIRQKGGLHPFPWRGESEYDVLSVGH ----------------1111--------11111111-2222-----11111111------ SSTSISAGIGIAVAAEKEGKNRRTVCVIGDGAITAGAFEANHAGDIRPDLVILNDNEPGT ----------------------------3333---------3333--------------- LFEELGFNYIGPVDGHDVLGLITTLKNRDLKGPQFLHITKKGRGYEPALPSYSKIFGDWL ------------------------------------------------------------ CETAAKDNKLAITPAREGSGVEFSRKFPDRYFDVAIAEQHAVTFAAGLAIGGYKPIVAIY --1111----------11113333--1111-----------------------------3 STFLQRAYDQVLHDVAIQKLPVLFAIDRAGIVGADGQTHQGAFDLSYLRCIPEVITPSDE 33311113333----1111-------------3333------1111-1111--------- NECRQLYTGYHYNDGPSAVRYPRGNAVGVELTPLEKLPIGKGIVKRRGEKLAILNFGTLP --------1111------------------------------------------------ EAAKVAESLNATLVDRFVKPLDEALILEAASHEALVTVEENAIGGAGSGVNEVLAHRKPV ----------------------------1111---------------------------- PVLNIGLPDFFIPQGTQEERAELGLDAAGEAKIKAWLA ---------------3333-1111-3333--------- >1-DEOXY-D-XYLULOSE-5-PHOS; SWP:Q9RUB5; PDB:2O1XA; PGTSDTPLLDQIHGPKDLKRLSREQLPALTEELRGEIVRVCSRGGLHLASSLGAVDIITA -----3333--------33333333----------------------------------- LHYVLDSPRDRILFDVGHQAYAHKILTGRRDQMADIKKEGGISGFTKVSESEHDAITVGH ----------------1111--------33331111-2222-----11111111------ ASTSLTNALGMALARDAQGKDFHVAAVIGDGSLTGGMALAALNTIGDMGRKMLIVLNDNE ----------------------------3333---------------------------- MSISENVGAMNKFMSVNPFAAMGVRYVGPVDGHNVQELVWLLERLVDLDGPTILHIVTTK --------3333----1111------------------------1111----------22 GKGLSYAEADPIYWHGPAKFDPATGEYVPSSAYSWSAAFGEAVTEWAKTDPRTFVVTPAM 22---------1111------------------------------33331111------- REGSGLVEFSRVHPHRYLDVGIAEEVAVTTAAGMALQGMRPVVAIYSTFLQRAYDQVLHD -1111-3333--1111------------------1111-------33333333------- VAIEHLNVTFCIDRAGIVGADGATHNGVFDLSFLRSIPGVRIGLPKDAAELRGMLKYAQT -1111-------------1111------3333-3333----------------------- HDGPFAIRYPRGNTAQVPAGTWPDLKWGEWERLKGGDDVVILAGGKALDYALKAAEDLPG -----------------2222----2222---------------3333----1111-333 VGVVNARFVKPLDEEMLREVGGRARALITVEDNTVVGGFGGAVLEALNSMNLHPTVRVLG 3----------------------------------------------1111--------- IPDEFQEHATAESVHARAGIDAPAIRTVLAELGVDVPI -------------------------------------- >RIBONUCLEOTIDE REDUCTASE ; SWP:P50650; PDB:2O1ZA; KKFSDLQKSKEANEKILSKETDRFTLYPILYPDVWDFYKKAEASFWTAEEIDLSSDLKDF 3333---1111--1111------------------------1111-3333--3333---3 EKLNDNEKHFIKHVLAFFAASLASKFLRQVKITEAKKFYAFQIAVENIHSETYSLLIDNY 333------------------3333----------------------------------- IKDEKERMNLFHAIENIPAVKNKALWAAKWINDTNSFAERIVANACVEGILFSGSFCAIF ------------------------------------------------------------ WFKKQNKLHGLTFSNELISRDEGLHTDFNCLIYSLLENKLPEEVVQNIVKEAVEVERSFI --1111--------------------------1111------------------------ CESLPCDLIGMNSRLMSQYIEFVADRLLECLGSPKIFHAKNPFNWMDL -----3333-------------------1111----------1111-- >HADH2 PROTEIN; SWP:Q6IBS9; PDB:2O23A; RSVKGLVAVITGGASGLGLATAERLVGQGASAVLLDLPNSGGEAQAKKLGNNCVFAPADV --2222-----1111---------------------2222---------1111-----11 TSEKDVQTALALAKGKFGRVDVAVNCAGIAVASKTYNLKKGQTHTLEDFQRVLDVNLMGT 11-----------------------------------1111------------------- FNVIRLVAGEMGQNEPDQGGQRGVIINTASVAAFEGQVGQAAYSASKGGIVGMTLPIARD ----------3333--1111----------------2222-------------------- LAPIGIRVMTIAPGLFGTPNFLASQVPFPSRLGDPAEYAHLVQAIIENPFLNGEVIRLDG 3333---------------3333----------3333----------1111-------%% AIRMQPGS %%------ >KIT LIGAND; SWP:NA; PDB:2O26U; PPSIHPAQSELIVEAGDTLSLTCIDPDFVRWTFKTYFNEMVENKKNEWIQEKAEATRTGT -----------------------------------------------------3333--- YTCSNSNGLTSSIYVFVRDPAKLFLVGLPLFGKEDSDALVRCPLTDPQVSQYSLIECDGK ----3333------------------------2222-------------------1111- SLPTDLTFVPNPKAGITIKNVKRAYHRLCVRCAAQRDGTWLHSDKFTLKVREAIKAIPVV --1111----1111-------1111----------------------------------- SVPETSHLLKKGDTFTVVCTIKDVSTSVNSMWLKMNPQPQHIAQVKHNSWHRGDFNYERQ ------------------------------------------------------------ ETLTISSARVDDSGVFMCYANNTFGSANVTTTLKV --------3333----------------------- >KIT LIGAND; SWP:Q64384; PDB:2O27A; CGNNVKDITKLVANLPNDYMITLNYVAGMDVLPSHCWLRDMVIQLSLSLTTLLDKFSNIS -----------11111111------2222---3333----------------1111---- EGLSNYSIIDKLGKIVDDLVLCMEENAPKNISPKRPETRSFTPEEFFSIFNRSIDAFKDF ------------------------------------------------------------ SDCVLS ------ >GLUCOSAMINE 6-PHOSPHATE N; SWP:Q96EK6; PDB:2O28A; PDETPMFDPSLLKEVDWSQNTATFSPAISPTHPGEGLVLRPLCTADLNRGFFKVLGQLTE -------3333----3333---------3333-2222-----3333--------1111-- TGVVSPEQFMKSFEHMKKSGDYYVTVVEDVTLGQIVATATLIIEHKFIHSCAKRGRVEDV -----------------------------1111--------------%%%%--------- VVSDECRGKQLGKLLLSTLTLLSKKLNCYKITLECLPQNVGFYKKFGYTVSEENYMCRRF --3333-----------------------------3333-3333---------------- LK -- >HYPOTHETICAL PROTEIN GBS1; SWP:Q8E4I8; PDB:2O2AA; AEVIREQEFVNQYHYDARNLEWEEENGTPKTNFEVTFQLANRDEAAKVTSIVAVLQFVIV -------------------------------------------1111------------- RDEFVISGVISQAHIQGRLINEPSEFSQDEVENLAAPLLEIVKRLTYEVTEIALDRPGVT 1111-----------------3333----------------------------------- LEF --- >DIENELACTONE HYDROLASE; SWP:Q3M5Q1; PDB:2O2GA; HQPQEYAVSVSVGEVKLKGNLVIPNGATGIVLFAHGSGSSRYSPRNRYVAEVLQQAGLAT -----------!!!!--------2222--------22221111----------1111--- LLIDLLTQEEEEIDLRTRHLRFDIGLLASRLVGATDWLTHNPDTQHLKVGYFGASTGGGA ------------3333---1111-----------------3333---------------- ALVAAAERPETVQAVVSRGGRPDLAPSALPHVKAPTLLIVGGYDLPVIANEDALEQLQTS --------------------3333-3333-----------11113333----3333---- KRLVIIPRASHLFEEPGALTAVAQLASEWFHYLR ---------1111-2222---------------- >METHIONINE SYNTHASE; SWP:Q99707; PDB:2O2KA; ERRYLPLSQARKSGFQMDWLSEPHPVKPTFIGTQVFEEYDLQKLVDYIDWKPFFDVWQLR ----------1111---3333------------------33333333------------3 GKYPNRGFPKIFNDKGEARKVYDDAHNMLNTLISQKKLRARGVVGFWPAQSIQDDIHLYA 333-2222--1111-------------------------------------!!!!----1 EAAVPQAAEPIATFYGLRQQAENSTEPYYCLSDFIAPLHSGIRDYLGLFAVACFGVEELS 1113333----------------------3333---1111-------------------- KAYEDDGDDYSSIMVKALGDRLAEAFAEELHERVRRELWAYCGSEQLDVADLRRLRYKGI ---1111----------------------------------1111------1111----- RPAPGYPSQPDHTEKLTMWRLADIEQSTGIRLTESLAMAPASAVSGLYFSNLKSKYFAVG --2222-------------1111-3333----1111--------------1111------ KISKDQVEDYALRKNISVAEVEKWLGPILGYD ------------------------3333---- >FORMYLTETRAHYDROFOLATE DE; SWP:Q5HZB2; PDB:2O2PA; VINYVEKAVNKLTLQMPYQLFIGGEFVDAEGSKTYNTINPTDGSVICQVSLAQVSDVDKA --------%%%%---------iiii---2222---------------------------- VAAAKEAFENGLWGKINARDRGRLLYRLADVMEQHQEELATIEALDAGAVYTLALKTHVG ----------3333---------------------------------------------- MSIQTFRYFAGWCDKIQGATIPINQARPNRNLTLTKKEPVGVCGIVIPWNYPLMMLSWKT ---------3333-------------------------------------3333------ AACLAAGNTVVIKPAQVTPLTALKFAELTLKAGIPKGVVNILPGSGSLVGQRLSDHPDVR ---1111-------1111----------------2222------3333------------ KIGFTGSTEVGKHIMKSCALSNVKKVSLELGGKSPLIIFADCDLNKAVQMGMSSVFFNKG --------------------------------------1111--------------%%%% ENCIAAGRLFVEESIHNQFVQKVVEEVEKMKIGNPLERDTNHGPQNHEAHLRKLVEYCQR -1111------3333-----------1111---1111----------------------- GVKEGATLVCGGNQVPRPGFFFQPTVFTDVEDHMYIAKEESFGPIMIISRFADGDVDAVL ------------------------------11111111-------------2222----- SRANATEFGLASGVFTRDINKALYVSDKLQAGTVFINTYNKTDVAAPFGGFKQSGFGKDL ------------------------------------------1111----!!!!------ GEAALNEYLRIKTVTFEY 33333333---------- >ENOYL-ACYL CARRIER REDUCT; SWP:Q6UCJ9; PDB:2O2SA; PIDLRGQTAFVAGVADSHGYGWAIAKHLASAGARVALGTWPPVLGLFQKSLQSGRLDEDR ---2222---------------------1111-------3333-------3333-3333- KLPDGSLIEFAGVYPLDAAFDKPEDVPQDIKDNKRYAGVDGYTIKEVAVKVKQDLGNIDI -1111----------------3333-3333--3333------------------------ LVHSLANGPEVTKPLLETSRKGYLAASSNSAYSFVSLLQHFGPIMNEGGSAVTLSYLAAE -------1111--3333-3333------------------3333-2222------3333- RVVPGYGGGMSSAKAALESDTRTLAWEAGQKYGVRVNAISAGPLKSRAASAIGKSGEKSF --2222---------------------------------------3333-2222------ IDYAIDYSYNNAPLRRDLHSDDVGGAALFLLSPLARAVSGVTLYVDNGLHAMGQAVDSRS -------------------------------3333----------iiii--------111 MPP 1-- >MULTIPLE PDZ DOMAIN PROTE; SWP:O75970; PDB:2O2TA; CDEFDQLIKNMAQGRHVEVFELLKPPSGGLGFSVVGLRSENELGIFVQEIQEGSVAHRDG 3333-------iiii-----------------------------------2222------ RLKETDQILAINGQALDQTITHQQAISILQKAKDTVQLVIARGSLPQYYKV --2222----iiii--3333------------------------3333--- >HYPOTHETICAL PROTEIN; SWP:Q98I56; PDB:2O2XA; PHPLTEPGVWIERIGGRVFPPHLPALFLDRDGTINVDTDYPSDPAEIVLRPQLPAIATAN -----1111--------------------2222---------3333-------------1 RAGIPVVVVTNQSGIARGYFGWSAFAAVNGRVLELLREEGVFVDVLACAYHEAGVGPLAI 111---------------------------------1111----------1111-1111- PDHPRKPNPGLVEAGKRLALDLQRSLIVGDKLADQAGKRAGLAQGWLVDGEAAVQPGFAI --------------------3333------3333---1111-----2222---------- RPLRDSSELGDLLAAIETLGRDNR ----------------1111---- >ENOYL-ACYL CARRIER REDUCT; SWP:Q9BH77; PDB:2O2YA; DICFIAGIGDTNGYGWGIAKELSKRNVKIIFGIWPPVYNIFMKNYKNGKFDNDMIIDKDK ----------------------1111-------3333----------11111111----- KMNILDMLPFDASFDTANDIDEETKNNKRYNMLQNYTIEDVANLIHQKYGKINMLVHSLA ---------------3333-3333--3333------------------------------ NAKEVQKDLLNTSRKGYLDALSKSSYSLISLCKYFVNIMKPQSSIISLTYHASQKVVPGY ---33333333-----------------------3333-2222------3333---2222 GGGMSSAKAALESDTRVLAYHLGRNYNIRINTISAGPLKSRAATAINYTFIDYAIEYSEK iiii-----------------------------------3333----------------- YAPLRQKLLSTDIGSVASFLLSRESRAITGQTIYVDNGLNIMFLPDDIYR --------1111---------3333------------3333--------- >HYPOTHETICAL PROTEIN; SWP:Q9K706; PDB:2O2ZA; GKKKNVIVFGGGTGLSVLLRGLKTFPVSITAIVTVADDGGSSGRLRKELDIPPPGDVRNV --------------------3333----------------------1111---------- LVALSEVEPLLEQLFQHRFENGGLSGHSLGNLLLAGTSITGDFARGISESKVLNVRGKVL -----------------------2222-----------------------1111------ PASNRSIILHGEEDGTIVTGESSIPKAGKKIKRVFLTPKDTKPLREGLEAIRKADVIVIG -------------------11111111----------1111--3333------------- PGSLYTSVLPNLLVPGICEAIKQSTARKVYICNVTQNGETDGYTASDHLQAIDHCGVGIV ----------1111---------------------22222222----------------- DDILVHGEPISDTVKAKYAKEKAEPVIVDEHKLKALGVGTISDYFVLEQVLRHNASKVSE ------------------1111-----------1111----------------------- AILE ---- >NUCLEAR MOVEMENT PROTEIN; SWP:Q8SSJ3; PDB:2O30A; AKYTWDQELNEINIQFPVTDSSAIKIRVGKKICVKNQGEIVIDGELLHEVDVSSLWWVIN -------1111----------------!!!!----iiii-----------3333-----! GDVVDVNVTKKRNEWWDSLLV !!!------------------ >HYPOTHETICAL PROTEIN; SWP:Q72D34; PDB:2O34A; SLPQHVHTSPVRDYRNRCARREGETVFQVVVEETDLRVTALAELATPAAYVGELRAQLKV ------------------------------!!!!-------------------------- WEFQPAFRHSLVPVEVPEGAPEVVRRAHGARLVGVGPFAAVAGTIAQVAERFVDVSPELI ---1111---------11113333-----------3333------------3333----- VENGGDLYLYSERDRVVGILPDPASGDVGILVRAGTAPVSLCGSSARIGHSLSLGDGDLA --!!!!----------------3333------2222------------------------ VVRARDASLADAAATAFGNLRRADDVAAVTERAAQLASIGIEGVYAQCGGRIGIWGDELA ----------------------3333---------3333--------iiii--------- V - >HYPOTHETICAL PROTEIN DUF1; SWP:Q92M60; PDB:2O35A; SEISPEQRTAFEAAVFRRLLEHLRERSDVQNIDLNLAGFCRNCLSNWYREAAEASGVPSK -------------------------11113333-------------------1111---- EESREIVYGPYEEWRT ---------3333--- >THIMET OLIGOPEPTIDASE; SWP:P52888; PDB:2O36A; LRWDLSAQQIEERTRELIEQTKRVYDQVGTQEFEDVSYESTLKALADVEVTYTVQRNILD ---------------------------11113333-3333-------------------- FPQHVSPSKDIRTASTEADKKLSEFDVEMSMREDVYQRIVWLQEKVQKDSLRPEAARYLE 3333------------------------------------------2222---------- RLIKLGRRNGLHLPRETQENIKRIKKKLSLLCIDFNKNLNEDTTFLPFTLQELGGLPEDF ---------1111-----------------------------------3333----3333 LNSLEKMEDGKLKVTLKYPHYFPLLKKCHVPETRRKVEEAFNSRCKEENSAILKELVTLR 1111--3333------3333---------------------------------------- AQKSRLLGFHTHADYVLEMNMAKTSQTVATFLDELAQKLKPLGEQERAVILELKRAECER ----1111-------33333333------------------------------------- RGLPFDGRIRAWDMRYYMNQVEETRYCVDQNLLKEYFPVQVVTHGLLGIYQELLGLAFHH ---------1111----------------------------------------------- EEGASAWHEDVRLYTARDAASGEVVGKFYLDLYPREGKYGHAACFGLQPGCLRQDGSRQI -------1111-----------------------2222--------------1111---- AIAAMVANFTKPTADAPSLLQHDEVRTYFHEFGHVMHQLCSQAEFAMFSGTHVETDFVEA --------------------3333--------------------3333!!!!-1111--- PSQMLENWVWEQEPLLRMSRHYRTGSAVPRELLEKLIESRQANTGLFNLRQIVLAKVDQA -----3333-----3333------------------------------------------ LHTQTDADPAEEYARLCQEILGVPATPGTNMPATFGHLAGGYDAQYYGYLWSEVYSMDMF -------------------------22223333-3333---2222--------------- HTRFKQEGVLNSKVGMDYRSCILRPGGSEDASAMLRRFLGRDPKQDAFLLSKGL ----------------------3333-----------------------1111- >PROTEIN SIS1; SWP:P25294; PDB:2O37A; MVKETKLYDLLGVSPSANEQELKKGYRKAALKYHPDKPTGDTEKFKEISEAFEILNDPQK --------1111-1111----------------1111----------------1111--- REIYDQYGLEAARSGGPSFGP ------------22223333- >HYPOTHETICAL PROTEIN; SWP:Q6N370; PDB:2O38A; PDAEERQTKLRLAYALNAVIDRARLSQAAAAARLGINQPKVSALRNYKLEGFSVERLTLL -------------------------------------------1111-11113333---- NALDQDVEIVIRKKPRSRAAARISVVAA 1111------------------------ >FIBER PROTEIN; SWP:P35774; PDB:2O39A; DNINTLWTGVNPTEANCQIMNSSESNDCKLILTLVKTGALVTAFVYVIGVSNNFNMLTTH -------------------1111----------------------------3333-1111 RNINFTAELFFDSTGNLLTRLSSLKTPLNHKSGQNMATGAITNAKGFMPSTTAYPFNDNS -----------1111---1111---------!!!!-------3333-----------111 REKENYIYGTCYYTASDRTAFPIDISVMLNRRAINDETSYCIRITWSWNTGDAPEVQTSA 11111---------1111--------------------------------------1111 TTLVTSPFTFYYIREDD ----------------- >Membrane cofactor protein; SWP:P15529; PDB:2O39C; CEEPPTFEAMELIGKPKPYYEIGERVDYKCKKGYFYIPPLATHTICDRNHTWLPVSDDAC ------1111----------2222------2222----------------------1111 YRETCPYIRDPLNGQAVPANGTYEFGYQMHFICNEGYYLIGEEILYCELKGSVAIWSGKP -----------------1111--------------------------------------- PICEKV ------ >UPF0106 PROTEIN AF_0751; SWP:O29507; PDB:2O3AA; LEVYVLRLGHRPDKRISTHVALTARAFGAKGIYFDTEDKSVFESVRDVVERWGGDFFIKA ------------------------1111-------------------------------- VSWKKLLREFDGLKVHLTMYGIPLPQKLEEIKRADKVLVVVGPPEVYELCDLNISIGTQP -----------------1111-3333----------------1111-------------- HSEVAALAVFLDRVLGKVFDISFDDAKIKVIPSERGKRVVS -3333-----------3333--3333--------------- >Sugar-non-specific nuclea; SWP:Q7A260; PDB:2O3BB; STKTNSEILEQLKQASDGLLFMSESEYPFEVFLWEGSAPPVTHEIVLQQTGHGQDAPFKV ---------------2222----------------------------------------- VDIDSFFSRATTPQDWYEDEENAVVAKFQKLLEVIKSNLKNPQVYRLGEVELDVYVIGET --------1111-1111------------------------------------------1 PAGNLAGISTKVVET 111------------ >NEUROLYSIN; SWP:P42676; PDB:2O3EA; MSSYTAAGRNVLRWDLSPEQIKTRTEQLIAQTKQVYDTVGTIALKEVTYENCLQVLADIE -----2222-----------------------------11113333-3333--------- VTYIVERTMLDFPQHVSSDREVRAASTEADKKLSRFDIEMSMREDVFQRIVHLQETCDLE -----3333--3333------------------------------------------333 KIKPEARRYLEKSIKMGKRNGLHLSEHIRNEIKSMKKRMSELCIDFNKNLNEDDTSLVFS 3-------------------1111-----------------------------------3 KAELGALPDDFIDSLEKTDEDKYKVTLKYPHYFPVMKKCCVPETRRKMEMAFHTRCKQEN 3332222----1111------------3333----------------------------- TAILQQLLPLRAQVAKLLGYNTHADFVLELNTAKSTSRVAAFLDDLSQKLKPLGEAEREF ---------------1111-------33333333-------------------------- ILSLKKKECEERGFEYDGKINAWDLHYYMTQTEELKYSVDQESLKEYFPIEVVTEGLLSI --------------------1111---------------3333----------------- YQELLGLSFEQVPDAHVWNKSVSLYTVKDKATGEVLGQFYLDLYPREGKYNHAACFGLQP ------------------1111-----------------------2222----------- GCLLPDGSRMMSVAALVVNFSQPVAGRPSLLRHDEVETYFHEFGHVMHQICAQTDFARFS ---1111----------------iiii----------------------------1111! GTNVERDFVEVPSQMLENWVWDVDSLRKLSKHYKDGHPITDELLEKLVASRLVNTGLLTL !!!-1111----33333333--3333---------------------11112222----- RQIVLSKVDQSLHTNATLDAASEYAKYCTEILGVAATPGTNMPATFGHLAGGYDGQYYGY ------------------------------------22223333-3333---2222---- LWSEVFSMDMFHSCFKKEGIMNPEVGMKYRNLILKPGGSLDGMDMLQNFLQREPNQKAFL ------------------1111-----------1111----------------------- MSRGL 1111- >PUTATIVE HTH-TYPE TRANSCR; SWP:Q45581; PDB:2O3FA; ATGGLAIIQSHLPPSERKLADYILAHPHAIESTVNEISALANSSDAAVIRLCSLGLKGFQ ------------3333-----------------------------------------333 DLRVAGDLAKPTFQG 3--------3333-- >PUTATIVE PROTEIN; SWP:Q9JYP7; PDB:2O3GA; ESLTVEGALEYVELAPQLNLPQQEEDADFHTVAGLIEELQTIPDVGDFADFHGWRFEVVE ---------33333333------1111---------3333---2222---iiii------ KEGQRIERVKITKLP --------------- >HYPOTHETICAL PROTEIN; SWP:Q7NTB2; PDB:2O3IA; AFELSPSDLEPLLQGACFFGSGGGGTISARHLAANFRKGDYYPTDKVRVVDVDEATDGDC ----3333-----------iiii---------1111--3333--------1111------ VVAYGAPDAINQVQWPNGPVEAALAARQRLESQGRKLAYVVAPESGALGFVVASLVAAKL ------3333--------------------1111-------------------------- GLAVVDADGAGRAVPSLPLTYAAAGVPPTPAFLAGESGLCVELGVRPPPDREDISTVVEQ ------------------------------------------------------------ LRPILTNPQFGQFGGLAWSPAQLGGALPVRGTLSRALKLGRALQDGKVKTAEALDFLRRE -3333-3333--------3333-3333----------------------3333------- LDIKGKLLFGPATLASPGKVVLEDGERRCTVLYQNESLLAWDSALSHPLATAPDAISYFV -----------------------------------------1111--------------- EGEGQHVFSNGDLSGNDHGLDPSVRGRKAAVIALPAAAPLSEGLILQSFADELAQLGYLG --------3333---------3333-----------3333-------------1111--- PYAPVD ------ >UDP-GLUCOSE 6-DEHYDROGENA; SWP:Q19905; PDB:2O3JA; DQVFGKVSKVVCVGAGYVGGPTCAMIAHKCPHITVTVVDMNTAKIAEWNSDKLPIYEPGL -----------------------------1111-------------1111------2222 DEIVFAARGRNLFFSSDIPKAIAEADLIFISVNTPTKMYGRGKGMAPDLKYVESVSRTIA 3333--2222-----------------------------2222----------------- QYAGGPKIVVEKSTVPVKAAESIGCILREAQKLKFQVLSNPEFLAEGTAMKDLANPDRVL ---------------2222--------1111-------------2222------------ IGGESSPEGLQAVAELVRIYENWVPRNRIITTNTWSSELSKLVANAFLAQRISSINSISA ------------------------1111-------------------------------- VCEATGAEISEVAHAVGYDTRIGSKFLQASVGFGGSCFQKDVLSLVYLCESLNLPQVADY -------------------------------------------------1111------- WQGVININNWQRRRFADKIIAELFNTVTDKKIAIFGFAFKKNTGDTRESSAIHVIKHLME ----------------------%%%%2222------------------------------ EHAKLSVYDPKVQKSQMLNDLASVTSAQDVERLITVESDPYAAARGAHAIVVLTEWDEFV ---------------------1111-------------3333-2222--------3333- ELNYSQIHNDMQHPAAIFDGRLILDQKALREIGFRTFAIGTSPDQ -------1111--------------------------2222---- >HYPOTHETICAL PROTEIN; SWP:Q734F7; PDB:2O3LA; GEYKARVAALPEDYQFVFKKIQNYWNFSAGNGDLHIQYELIDLFEAGAAEGRQVLDITGE 33333333--------------------------------------------3333---- DVASFADELVANAKTYV ----------3333--- >ADP-RIBOSYL CYCLASE 1; SWP:P28907; PDB:2O3SA; FWRQTWSGPGTTKRFPETVLARCVKYTEIHPEMRHVDCQSVWDAFKGAFISKHPCDITEE -----------2222--------------3333---------------2222-----333 DYQPLMKLGTQTVPCNKILLWSRIKDLAHQFTQVQRDMFTLEDTLLGYLADDLTWCGEFD 3-----1111---1111----------------------1111------2222----222 TSKINYQSCPDWRKDCSNNPVSVFWKTVSRRFAEAACDVVHVMLDGSRSKIFDKDSTFGS 2---------3333----3333----------1111----------------11113333 VGVHNLQPEKVQTLEAWVIHGGREDSRDLCQDPTIKELESIISKRNIQFSCKNIYRPDKF -1111-3333-----------------1111-----------1111-------------- LQCVKNPEDSSC -----1111--- >COVALENT DIMER HIV-1 PROT; SWP:NA; PDB:2O40A; PQITLWKRPLVTIRIGGQLKEALLDTGADDTVIEENLPGWKPKIGGIGGFIKVRQYDQIP --------------iiii------1111-----------------1111----------- VEIGHKAIGTVLVGPTPVNIIGRNLLTQIGTLNFGGGGPQITLWKRPLVTIRIGGQLKEA ---------------------33333333-----------------------iiii---- LLDTGADDTVIEENLPGWKPKIGGIGGFIKVRQYDQIPVEIGHKAIGTVLVGPTPVNIIG --1111-----------------2222--------------------------------- RNLLTQIGTLNF --3333------ >M11L PROTEIN; SWP:Q85295; PDB:2O42A; SRLKTAVYDYLNDVDITECTEDLLCQLSNCCDFINETYAKNYDTLYDIERDILSYNIVNI ---------1111--------------------------------------1111----- KNTLTFALRDASPSVKLATLTLLASVIKKLNKIQHTDAAFSEVIDGIVAEEQQVIGFIQK -----1111--------------------3333--------------------------- KCKYNTTYYNVRS --------1111- >ERYTHRONATE-4-PHOSPHATE D; SWP:Q9I3W9; PDB:2O4CA; MRILADENIPVVDAFFADQGSIRRLPGRAIDRAALAEVDVLLVRSVTEVSRAALAGSPVR -----1111-3333-1111------3333-33331111-----3333--33332222--- FVGTCTIGTDHLDLDYFAEAGIAWSSAPGCNARGVVDYVLGCLLAMAEVRGADLAERTYG --------1111--------------2222----------------------1111---- VVGAGQVGGRLVEVLRGLGWKVLVCDPPRQAREPDGEFVSLERLLAEADVISLHTPLNRD ---------------1111-------------1111------------------------ GEHPTRHLLDEPRLAALRPGTWLVNASRGAVVDNQALRRLLEGGADLEVALDVWEGEPQA -------------11112222------1111----------------------------- DPELAARCLIATPHIAGYSLEGKLRGTAQIYQAYCAWRGIAERVSLQDVLPETWLAGLQL 33331111------11113333----------------------3333------------ NPGCDPAWALATLCRAVYDPRSDDAAFRRSLTGDSATRRAAFDALRKHYPPRREITGLRV 1111------------------------1111---------------------3333--- ATGGQAELQRVVRALGAQLV -%%%%--------------- >HYPOTHETICAL PROTEIN PA02; SWP:Q9I6M1; PDB:2O4DA; TTRLEWAKASPDAYAALGLEKALAKAGLERPLIELVYLRTSQINGCAYCVNHANDARKAG ----3333--------------1111--------------------3333------1111 ETEQRLQALCVWQETPYFTPRERAALAWTEQLARLSQGALPHGLLDELREHFDDKEIAEL -------3333----------------------1111---1111---3333--------- TLAVSAINAWNRFGVGGQPE -------------------- >VITAMIN D3 RECEPTOR; SWP:VDR_RAT; PDB:2O4JA; KLSEEQQHIIAILLDAHHKTYDPTYADFRDFRPPVRMPLSMLPHLADLVSYSIQKVIGFA ---------------------1111--1111-------1111------------------ KMIPGFRDLTSDDQIVLLKSSAIEVIMLRSNQSFTMDDMSWDCGSQDYKYDVTDVSKAGH --2222----------------------3333------------3333------------ TLELIEPLIKFQVGLKKLNLHEEEHVLLMAICIVSPDRPGVQDAKLVEAIQDRLSNTLQT 3333-----------3333---------------1111---------------------- YIRCRHPPPGSHQLYAKMIQKLADLRSLNEEHSKQYRSLSFQPENSMKLTPLVLEVFGNE --------1111-----------------------------33331111----------- >GAG-POL POLYPROTEIN (PR16; SWP:P03367; PDB:2O4NA; PQITLWKRPLVTVKIGGQLKEALLDTGADDTIFEEMSLPGRWKPIMIGGIGGFIKVRQYD --------------iiii------1111--------------------2222-------- QILIEICGHKAIGTVLVGPTPLNVIGRNLLTQIGCTLNF -----iiii----------------3333-1111----- >BH3976 PROTEIN; SWP:Q9K5W1; PDB:2O4TA; HVSRVEKLPKDYQIVYKEIQKYLFKVGPVELNEGIGLLSEILGFFEEGAAAGKGVLDVTG --3333------------------------------------------1111-3333--- TDVAAFCDALIGDSKTYADLYQESIQQHVD --------------------------3333 >RAS-RELATED PROTEIN RAB-4; SWP:P61018; PDB:2O52A; SDFLFKFLVIGSAGTGKSCLLHQFIEVEFGSRVVNVGGKTVKLQIWDTAGQERFRSVTRS -----------2222--------------------%%%%----------3333----333 YYRGAAGALLVYDITSRETYNSLAAWLTDARTLASPNIVVILCGNKKDLDPEREVTFLEA 32222-------1111------------------1111-------33331111------- SRFAQENELMFLETSALTGENVEEAFLKCARTILNKIDSGELDPERM ----1111----------2222--------------------1111- >PUTATIVE GLYCEROPHOSPHODI; SWP:NA; PDB:2O55A; SKVIIPKIVGHRGVGKEGLAPENTLRSFVLCERNIPYIETDLRVCKTGEIVLFHGTPEGT -------------!!!!------3333-----------------1111------------ IPFYKDGTSRIGDLSLEELKRLDVGGGHTIPSLEELFVAIEEQKFNLKLNLELKGEEWKR 3333-----3333--3333-------------3333--------------------3333 KESGDHQRLLLLVEKYHQERVDYCSFHHEALAHLKALCPDVKITYLFNYGQPTPLDFVEQ -------------------------------------3333------------1111--- ACYGDANGVSLFHYLTKEQVCTAHEKGLSVTVWPWIFDDSEEDWKKCLELQVDLICSNYP ----------3333---------1111------3333--3333---------------33 FGLNFLSN 33--3333 >PUTATIVE MANDELATE RACEMA; SWP:Q8ZKY6; PDB:2O56A; LMKITSVDIIDVANDFKWRPVVVKINTDEGISGFGEVGLAYGVGASAGIGMAKDLSAIII --------------------------1111-------------3333-------333322 GMDPMNNEAIWEKMLKKTFWGQGGGGIFSAAMSGIDIALWDIKGKAWGVPLYKMLGGKSR 223333------------3333---------------------------3333------- EKIRTYASQLQFGWGDGSDKDMLTEPEQYAQAALTAVSEGYDAIKVDTVAMDRHGNWNQQ ---------1111-2222------3333--------1111-----------1111----- NLNGPLTDKILRLGYDRMAAIRDAVGPDVDIIAEMHAFTDTTSAIQFGRMIEELGIFYYE ------3333------------------------%%%%------------1111------ EPVMPLNPAQMKQVADKVNIPLAAGERIYWRWGYRPFLENGSLSVIQPDICTCGGITEVK ------------------------1111--3333--------------1111---3333- KICDMAHVYDKTVQIHVCGGPISTAVALHMETAIPNFVIHELHRYALLEPNTQTCKYNYL ----3333----------------------------------3333--3333-------- PKNGMYEVPELPGIGQELTEETMKKSPTITVK -iiii------!!!!-------1111------ >putative sarcosine dimeth; SWP:NA; PDB:2O57A; SKTVKDNAEIYYDDDDSDRFYFHVWGGEDIHVGLYKEPVDQDEIREASLRTDEWLASELA ------------------------!!!!---------3333------------------- TGVLQRQAKGLDLGAGYGGAARFLVRKFGVSIDCLNIAPVQNKRNEEYNNQAGLADNITV ----2222------!!!!-----------------------------------3333--- KYGSFLEIPCEDNSYDFIWSQDAFLHSPDKLKVFQECARVLKPRGVAITDPKEDGIDKSS ---1111---------------3333---------------2222-------22223333 IQPILDRIKLHDGSLGLYRSLAKECGLVTLRTFSRPDSLVHHYSKVKAELIKRSSEIASF -----1111---------------------------------------------1111-- CSPEFQANKRGLEHWIEGGRAGKLTWGGLFRKSDKI ------------------1111-------------- >BH1328 PROTEIN; SWP:Q9KD89; PDB:2O5AA; SNQELLQLAVNAVDDKKAEQVVALNKGISLDFFLICHGNSEKQVQAIAHELKKVAQEQGI ------------------------------------------------------------ EIKRLEGYEQARWVLIDLGDVVVHVFHKDERAYYNLEKLWPTVELEG -------3333---------------111133333333--------- >HYPOTHETICAL PROTEIN; SWP:Q9K0R7_NEIMB; PDB:2O5HA; ARLNNHDVHKRYQDRLEEDVEFTINYELPLSCLWSTIKDFSSDFEEKTEAFFILFKELLR -333------3333--------------3333----1111-------------------- RGHLKLQRDGQIIGHTPEEWEQIFREVWPEYEIEPNPFDIGWLTVEAPAYAVWIDPEDGS -------iiii------------------------------------------------- EYW --- >GLUTAMYL-TRNA SYNTHETASE ; SWP:Q9X172; PDB:2O5RA; VRVRFAPSPTGFLHVGGARTALFNFLFARKEKGKFILRIEDTDLERSEREYEEKLESLRW ----------------------------1111--------11113333------------ LGLLWDEGPDVGGDHGPYRQSERVEIYREHAERLVKEGKAYYVYAYPEEIEEREKLLSEG ------------1111--3333------------1111-------3333-------1111 KAPHYSQEFEKFDTPERRREYEEKGLRPAVFFKPRKDYVLNDVVKGEVVFKTGAIGDFVI --------3333---------1111-------------------------2222------ RSNGLPTYNFACVVDDLEITHVIRGDDHLSNTLRQLALYEAFEKAPPVFAHVSTILGPDG 1111--------------------33333333------------------------1111 KKLSKRHGATSVEAFRDGYLPEALVNYLALLGWSHPEGKELLTLEELISSFSLDRLSPNP ---3333------------3333----1111---1111-------------3333----- AIFDPQKLKWNGYYLRNPIEKLAELAKPFFEKAGIKIIDEEYFKKVLEITKERVEVLSEF ------------------------------1111---------------3333--3333- PEESRFFFEDPAPVEIPEEKEVFSQLKEELQNVRWTEEITPVFKKVLKQHGVKPKEFYTL 33333333------------3333---1111---------------------3333---- RRVLTGREEGPELVNIIPLLGKEIFLRRIERSLG -----------33333333---------3333-- >DNA REPLICATION AND REPAI; SWP:Q9RVE0; PDB:2O5VA; GDVRLSALSTLNYRNLAPGTLNFPEGVTGIYGENGAGKTNLLEAAYLALTGQTDAPRIEQ ------------2222----------------2222--------------------3333 LIQAGETEAYVRADLQQGGSLSIQEVGLGRGRRQLKVDGVRARTGDLPRGGAVWIRPEDS --2222----------!!!!--------iiii----iiii--3333---------11113 ELVFGPPSGRRAYLDSLLSRLSARYGEQLSRYERTVSQRNAALRGGEEWAMHVWDDVLLK 333--3333---------------------------------11113333-1111----- LGTEIMLFRRRALTRLDELAREANAQLGSRKTLALTLTESTSPETYAADLRGRRAEELAR -----------------------------------------3333--------------- GSTVTGPHRDDLLLTLGDFPASDYASRGEGRTVALALRRAELELLREKFGEDPVLLLDDF -----1111------iiii3333-----------------------------------33 TAELDPHRRQYLLDLAASVPQAIVTGTELAPGAALTLRAQAGRFTPVADEEMQAEGTA 33-------------------------------------%%%%-----3333------ >HYPOTHETICAL PROTEIN; SWP:NA; PDB:2O62A; KSQWECFLQNLGVWEGSFSNFSPEGTLLNDTSSRLCLEGLNNNQTVRLTLSRSGKDDVIR -------1111----------1111---------------%%%%-------2222----- EFRSVGGGLLFFENGSFSEGLIQLGPFSEFGGELAFVHENRRLRLVQLFDRNGHLNGLTL -----!!!!--1111---------1111---------!!!!--------1111------- IREHLAGTPVAERPLLQINDLLGEWRGQAVTIYRDLRPPDIYSTTLKIQLDDAGRLQSTS ----2222--------3333------------1111------------------------ FGERTITSTATIKGSIVLFDQDPEKQVQVLLLPDGASATSPLKVQLRQPLFLEAGWLIQS !!!!--------!!!!-----1111-------%%%%------------------------ DLRQRIRSYNDKGEWVSLTLVTEERV -------------------------- >PII PROTEIN; SWP:Q9ZST4_ARATH; PDB:2O66A; SSDYIPDSKFYKVEAIVRPWRIQQVSSALLKIGIRGVTVSDVRGFGEDKFVAKVKMEIVV -----------------1111--------1111--------------------------- KKDQVESVINTIIEGARTGEIGDGKIFVLPVSDVIRVRTGERGEKAE 1111--------------------------------------3333- >34 KDA MEMBRANE ANTIGEN; SWP:P19478; PDB:2O6FA; DEFPIGEDRDVGPLHVGGVYFQPVEMHPAPGAQPSKEEADCHIEADIHANEAGKDLGYGV ----------!!!!---------------!!!!-3333-----------33331111-22 GDFVPYLRVVAFLQKHGSEKVQKVMFAPMNAGDGPHYGANVKFEEGLGTYKVRFEIAAPS 22------------2222------------1111--------1111-------------1 HDEYSLHIDEQTGVSGRFWSEPLVAEWDDFEWKGPQW 111-----3333------------------------- >UPF0346 PROTEIN MW1311; SWP:Q7BEF3; PDB:2O6KA; YSFYQFVTVRGRHDDKGRLAEEIFDDLAFPKHDDDFNILSDYIETHGDFTLPSVFDDLYE -3333---2222-------------1111------------------------------- EYTEWLKFLE ---------- >IRON-REGULATED SURFACE DE; SWP:Q1Y717; PDB:2O6PA; GSDSGTLNYEVYKYNTNDTSIANDYFNKPAKYIKKNGKLYVQITVNHSHWITGMSIEGHK -------------------3333-----------iiii--------3333-----iiii- ENIISKNTAKDERTSEFEVSKLNGKIDGKIDVYIDEKVNGKPFKYDHHYNITYKFNGPTD -------1111--------------------------iiii------------------- VAG --- >VARIABLE LYMPHOCYTE RECEP; SWP:Q32R26; PDB:2O6QA; NEALCKKDGGVCSCNNNKNSVDCSSKKLTAIPSNIPADTKKLDLQSNKLSSLPSKAFHRL ----3333-------1111----------------1111--------------------1 TKLRLLYLNDNKLQTLPAGIFKELKNLETLWVTDNKLQALPIGVFDQLVNLAELRLDRNQ 111-------------11111111------------------1111-------------- LKSLPPRVFDSLTKLTYLSLGYNELQSLPKGVFDKLTSLKELRLYNNQLKRVPEGAFDKL ------1111-1111-------------22223333------------------1111-1 TELKTLKLDNNQLKRVPEGAFDSLEKLKMLQLQENPWDCTCNGIIYMAKWLKKKADEGLG 111-------------22221111------------------------------1111-- GVDTAGCEKGGKAVLEITEKDAASDCVSPN 1111--------3333-------------- >VARIABLE LYMPHOCYTE RECEP; SWP:Q4G1L2; PDB:2O6RA; CPSRCSCSGTEIRCNSKGLTSVPTGIPSSATRLELESNKLQSLPHGVFDKLTQLTKLSLS -1111--!!!!---------------1111-------------22221111--------- QNQIQSLPDGVFDKLTKLTILYLHENKLQSLPNGVFDKLTQLKELALDTNQLKSVPDGIF -------22221111----------------22221111----------------22221 DRLTSLQKIWLHTNPWDCSCPRIDYLSRWLNKNSQKEQGSAKCSGSGKPVRSIICPT 111----------------3333---------3333------------3333----- >VARIABLE LYMPHOCYTE RECEP; SWP:Q4G1L3; PDB:2O6SA; CPSRCSCSGTTVECYSQGRTSVPTGIPAQTTYLDLETNSLKSLPNGVFDELTSLTQLYLG -2222---------------------1111-------------22221111--------- GNKLQSLPNGVFNKLTSLTYLNLSTNQLQSLPNGVFDKLTQLKELALNTNQLQSLPDGVF -------22221111----------------22221111----------------22221 DKLTQLKDLRLYQNQLKSVPDGVFDRLTSLQYIWLHDNPWDCTCPGIRYLSEWINKHSGV 111----------------22221111-----------------1111--------1111 VRNSAGSVAPDSAKCSGSGKPVRSIICP --3333--1111--------1111---- >Ubiquitin [Fragment]; SWP:Q45TR8; PDB:2O6VB; MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGQLEDGRTLSDYNI ------1111-------1111---------------3333---------33333333--- QRESTLHLVLRLRGG 2222----------- >PUTATIVE HISTIDINE AMMONI; SWP:Q3IWB0; PDB:2O6YA; PKPAVELDRHIDLDQAHAVASGGARIVLAPPARDRCRASEARLGAVIREARHVYGLTTGF ----------------------------------------------------2222---! GPLANRLISGENVRTLQANLVHHLASGVGPVLDWTTARAMVLARLVSIAQGASGASEGTI !!!-----1111------------------------------------------------ ARLIDLLNSELAPAVPSRGTVGDLTPLAHMVLCLQGRGDFLDRDGTRLDGAEGLRRGRLQ --------------------------------1111-----1111---------1111-- PLDLSHRDALALVNGTSAMTGIALVNAHACRHLGNWAVALTALLAECLRGRTEAWAAALS ---11111111---------------------------------------3333------ DLRPHPGQKDAAARLRARVDGSARVVRHVIAERRLDAGDIGTEPEAGQDAYSLRCAPQVL ----------------1111--------3333---1111----------3333------- GAGFDTLAWHDRVLTIELNAVTDNPVFPPDGSVPALHGGNFMGQHVALTSDALATAVTVL ---------------------------1111-------1111------------------ AGLAERQIARLTDERLNRGLPPFLHRGPAGLNSGFMGAQVTATALLAEMRATGPASIHSI ------------1111iiii2222--------!!!!------------------1111-- STNAANQDVVSLGTIAARLCREKIDRWAEILAILALCLAQAAELRCGSGLDGVSPAGKKL --iiii----------------------------------------1111---------- VQALREQFPPLETDRPLGQEIAALATHLLQQSPV ---3333-------------------3333---- >DEATH DOMAIN-CONTAINING P; SWP:P78560; PDB:2O71A; HILNSSPSDRQINQLAQRLGPEWEPMVLSLGLSQTDIYRCKANHPHNVQSQVVEAFIRWR 3333-----------------------1111------------1111------------- QRFGKQATFQSLHNGLRAVEVDPSLLLHMLE --!!!!---------------------1111 >PROBABLE RNA POLYMERASE S; SWP:P66809; PDB:2O7GA; TASDDEAVTALALSAAKGNGRALEAFIKATQQDVWRFVAYLSDVGSADDLTQETFLRAIG ------------------------------------------3333-------------- AIPRFSARSSARTWLLAIARHVVADHIR 3333--------------------1111 >Oligopeptide ABC transpor; SWP:Q9WXN8; PDB:2O7IA; SLPREDTVYIGGALWGPATTWNLYAPQSTWGTDQFMYLPAFQYDLGRDAWIPVIAERYEF --3333---------------1111---2222------------1111------------ VDDKTLRIYIRPEARWSDGVPITADDFVYALELTKELGIGPGGGWDTYIEYVKAVDTKVV -1111-----3333-1111---3333---------------2222----------1111- EFKAKEENLNYFQFLSYSLGAQPMPKHVYERIRAQMNIKDWINDKPEEQVVSGPYKLYYY -----------------1111---3333--------3333----3333------------ DPNIVVYQRVDDWWGKDIFGLPRPKYLAHVIYKDNPSASLAFERGDIDWNGLFIPSVWEL 1111--------1111-------------------------1111----------3333- WEKKGLPVGTWYKKEPYFIPDGVGFVYVNNTKPGLSDPAVRKAIAYAIPYNEMLKKAYFG ----------------------------1111-1111------1111------------- YGSQAHPSMVIDLFEPYKQYIDYELAKKTFGTEDGRIPFDLDMANKILDEAGYKKGPDGV -----3333----33331111----------1111--------------------1111- RVGPDGTKLGPYTISVPYGWTDWMMMCEMIAKNLRSIGIDVKTEFPDFSVWADRMTKGTF ----------------2222---------------------------------------- DLIISWSVGPSFDHPFNIYRFVLDKRLSKPVGEVTWAGDWERYDNDEVVELLDKAVSTLD ----------1111---------3333--2222-----1111------------------ PEVRKQAYFRIQQIIYRDMPSIPAFYTAHWYEYSTKYWINWPSEDNPAWFRPSPWHADAW ------------------------------------------1111------1111--33 PTLFIISKKSDPQPVPSWLGTVDEGGIEIPTAKIFEDLQKAT 33-----1111----3333-3333-----3333-----1111 >CXE CARBOXYLESTERASE; SWP:Q0ZPV7; PDB:2O7RA; LLKYLPIVLNPDRTITRPIQIPSTAASPDPTSSSPVLTKDLALNPLHNTFVRLFLPRHAL 3333------------------------1111-----------3333--------3333- YNSAKLPLVVYFHGGGFILFSAASTIFHDFCCEMAVHAGVVIASVDYRLAPEHRLPAAYD --------------iiii--1111------------------------------------ DAMEALQWIKDSRDEWLTNFADFSNCFIMGESAGGNIAYHAGLRAAAVADELLPLKIKGL ---------------------1111----------------------3333--------- VLDEPGFGGSKRTGSELRLANDSRLPTFVLDLIWELSLPMGADRDHEYCNPTPLYSFDKI ------------------1111----------------22221111-------3333--- RSLGWRVMVVGCHGDPMIDRQMELAERLEKKGVDVVAQFDVGGYHAVKLEDPEKAKQFFV --------------1111----------1111-----------2222------------- ILKKFVV ------- >TRANSCRIPTIONAL REGULATOR; SWP:Q8NQ14; PDB:2O7TA; RADALKRREHIITTTCNLYRTHHHDSLTENIAEQAGVGVATLYRNFPDRFTLDACAQYLF ----------------------1111---------------------3333--------- NVVISLQLQAISTFPTDPEGVWTSFNQLLFDRGLGSLVPALAPESLDDLPDEVSALRRTT ------------3333----------------3333--------3333------------ EKNTTTLINLAKQHGLVHHDIAPGTYIVGLITISRPPITALATISENSHKALLGLYLSGL -----------1111--1111----------1111--3333-3333-------------- KHG --- >HYPOTHETICAL PROTEIN ATU2; SWP:Q8UD01; PDB:2O8IA; VTREEFVARFGGVFEHSPFIAERAYDAGGAGLELTAKAVHGALCAQFRVASEAERLGVLR ---------1111--------------3333---------------------------11 AHPDLAGKLAIAGELTGLDRLSPQEHARFTQLNSAYTEKFGFPFIIAVKGLNRHDILSAF 11---------3333-1111---------------------------2222--------- DTRIDNNAAQEFATATGQVEKIAWLRLASLPEG -3333-3333----------------------- >Histone-lysine N-methyltr; SWP:A2BED6; PDB:2O8JA; KIICRDVARGYENVPIPCVNGVDGEPCPEDYKYISENCETSTMNIDRNITHLQHCTCVDD -----1111--------------------------------------1111--------- CSSSNCLCGQLSIRCWYDKDGRLLQEFNKIEPPLIFECNQACSCWRNCKNRVVQSGIKVR --1111---1111----1111--1111-----------1111--1111--3333------ LQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVREDDSYLFDLDEVYCIDARY -------------------2222----------33331111------------------- YGNISRFINHLCDPNIIPVRVFMLHQDLRFPRIAFFSSRDIRTGEELGFDYGDRFWDYFT ----1111------------------3333-----------2222------3333----- CQCGSEKCKHSAEA ----1111--3333 >V8 PROTEASE; SWP:Q99V45; PDB:2O8LA; VILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATHGDP ------------11111111--------1111-----------------------iiii3 HALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNKHIGEVVKPATMSNN 333-----------1111-----------------------1111-1111---------1 AETQTNQNITVTGYPGDKPVATMWESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEV 1112222-------1111---------------!!!!----------2222---1111-- IGIHWGGVPNEFNGAVFINENVRNFLKQNIEDINFA -------2222------------------1111--- >14-3-3 DOMAIN CONTAINING ; SWP:Q5CYG0; PDB:2O8PA; DERLLQKYRAQVFEWGGCFDKFEALKSLIYLSEFENSEFDDEERHLLTLCIKHKISDYRT -----------------3333--------------------------------------- TSQVLQEQTKQLNNDELVKICSEYVFSLRKDIKAFLQSFEDCVDRLVEKSFFSKFFKLKV -------1111-------------------------------1111-------------- KSDISRYKLEFGLCSLEDSKKIHQDAFTLLCEHPDKIEQLPLGFIQNLAYILSEKYGEKK --------1111--------------------333311113333---------------- QVFNLNSLGKILELQIKEQENDRKAQITVYLQGIK ----------------------------------- >HYPOTHETICAL PROTEIN; SWP:Q13HN3; PDB:2O8QA; GKLQTTIQHEPKDGSGFDRREFFEYRDTGVNEATGGFGAHVIRAIPPTWHTHTVGFQLFY -------------------1111-----3333---------------------------- VLRGWVEFEYEDIGAVLEAGGSAFQPPGVRHRELRHSDDLEVLEIVSPAGFATSVVDLE -----------------2222----2222-------1111------------------- >POLYPHOSPHATE KINASE; SWP:Q7MTR1; PDB:2O8RA; SAYPFFRRDSWLSFNERVLEAADRTLPVYDRIKFLSIFSSNLEEFYTVRVAYHQAVLQKH -------------------1111---3333-------------------------3333- ILQAIRETVIRQDELYYRIFYDQILPTLEEHGIRLRTHAPTHPDHKAYLRRFFHEEIFPL --------------------------------------------------------3333 LYPLLLPSKVRTFIRSGRVYLAVRLKEKETDEAYSYALLNVPTDGLPRFVELPRLQTDTF -----3333--------------------------------1111---------1111-- YYYSFLEDIIKEHLDVVFPGYEVDSYSIKVSRDADLLLDAPTRFYDGRPDEVLRYICSSC ------------3333-1111--------------------------------------- DIDPEEAIRSGNYVNLQDLALPNPFAPRLETLTPEPLLSKHLEQAPSLEGIRRKDYLIHV ----------------3333------------------3333------3333-------- PYYTYDYVVRLLEAAISPDVSEIRLTQYRVAENSSIISALEAAAQSGKKVSVFVELKARF ------------33333333-----------------------1111------------- NLRLSERRRSGIRIVYSPGLKVHAKTALILYHTPAGERPQGIALLSTGNFNETTARIYSD -------3333------------------------------------------------- TTLTANTDIVHDVYRLFRILDGDPEPARFSRLLVARYNGEAITNLIEREIENVKRGKRGY -----3333---------1111-----------2222---------------1111---- LLKNGLQDKNVITQLYRASEAGVEIDLIVRGICCLVPDPQSRNIRVTRLVDYLEHSRIWC ----------------------------------------1111---------------- FHNGGKEEVFISSADWKRNLYNRIETACPVLDPTLRREIIDILEIQLRDNIKACRIDSSL ----------------3333-------------------------1111-------1111 NNIYKHNSDEKPVRAQAAIYRYLKGKEETT ------3333---3333-------3333-- >AGR_C_984P; SWP:Q7D182_AGRT5; PDB:2O8SA; PPSPQPVSHKVTSTYTSYRLISQDIGKSLERVSKQPDVARETEYYREKIGSVKSIDDFAD -----------------------3333--------------------3333--3333--- TRLYNYALKAHGLEDAYAKAFIRKVLTEGASDKNAFANKLSDNRYAELAKSLDFAGLGAA --------1111-------------------11111111-------------3333!!!! ATATEAAKSGVIGNYARQTLEQEAGDDNNGVRLALYFERKAPTIKSGLDFLADDALAQVF ---3333----------------33333333---------3333--------3333---- RTTFNLAADVDKQAALIEKSINIKDLQDPEKVGKLLERFTIWEQNP -----------3333------1111--------------------- >PROBABLE RNA POLYMERASE S; SWP:P66809; PDB:2O8XA; MGFEDLVEVTTMIADLTTDQREALLLTQLLGLSYADAAAVCGCPVGTIRSRVARARDALL -3333-----3333---3333----------------------3333------------- A - >DNA-binding protein HU-be; SWP:DBHB_ECOLI; PDB:2O97B; MNKSQLIDKIAAGADSKAAAGRALDAIIASVTESLKEGDDVALVGFGTFAVKERAKVPSF ----------------------------------1111---------------------- RAGKALKDAVN ----------- >BACTERIOPHYTOCHROME; SWP:Q9RZA4; PDB:2O9CA; DPLPFFPPLYLGGPEITTENCEREPIHIPGSIQPHGALLTADGHSGEVLQMSLNAATFLG -------1111-----111111111111----1111---------------1111----- QEPTVLRGQTLAALLPEQWPALQAALPPGCPDALQYRATLDWPAAGHLSLTVHRVGELLI -333322223333-------------22221111--------------------!!!!-- LEFEPTEAWDSTGPHALRNAMFALESAPNLRALAEVATQTVRELTGFDRVMLYKFAPDAT ------1111---------------------------------------------1111- GEVIAEARREGLHAFLGHRFPASDIPAQARALYTRHLLRLTADTRAAAVPLDPVLNPQTN --------2222--2222--3333------------------1111-------------- APTPLGGAVLRATSPMHMQYLRNMGVGSSLSVSVVVGGQLWGLIACHHQTPYVLPPDLRT ----1111-------------1111----------iiii--------------------- TLESLGRLLSLQVQVKEAH ------------------- >AQUAPORIN Z; SWP:AQPZ_ECOLI; PDB:2O9GA; HMFRKLAAESFGTFWLVFGGSGSAVLAAGFPELGIGFAGVALAFGLTVLTMAFAVGHISG 3333--------------------------------------------------1111-- GHFNPAVTIGLWAGGRFPAKEVVGYVIAQVVGGIVAAALLYLIASGKTGFDAAASGFASN ---3333----1111--3333-------------------------22223333-%%%%- GYGEHSPGGYSMLSALVVELVLSAGFLLVIHGATDKFAPAGFAPIAIGLACTLIHLISIP --1111----------------------------111122223333-----------333 VTNTSVNPARSTAVAIFQGGWALEQLWFFWVVPIVGGIIGGLIYRTLLEKR 3------------3333---3333-3333---------------------- >Monellin chain B; SWP:P02882; PDB:2O9UX; GEWEIIDIGPFTQNLGKFAVDEENKIGQYGRLTFNKVIRPCMKKTIYENEGFREIKGYEY -----------------------3333--------------------------------- QLYVYASDKLFRADISEDYKTRGRKLLRFNGPVPPP -----%%%%---------1111-------------- >BH2720 PROTEIN; SWP:Q9K9C9; PDB:2OA2A; VTDHGPRPFVVNIEDETKRNRAFRRALWTGDHLQVTLSIQVGEDIGLEIHPHLDQFLRVE ---------------------------------------2222------1111------- EGRGLVQGHRQDNLHFQEEVFDDYAILIPAGTWHNVRNTGNRPLKLYSIYAPPQHPHGTV ---------1111-------2222----2222-----------------------2222- HETKAIAAA --3333--- >SIR5; SWP:Q5LST8; PDB:2OA4A; MMFLRKVEGPRSVTLPDGSIMTRADLPPANTRRWVASRKIAVVRGVIYGLITLAEAKQTY -------3333----------------------------------------3333----- GLSDEEFNSWVSALAEHGKDALKVTALKKYRQLLEHHHHHH -----------------------3333-------------- >HYPOTHETICAL PROTEIN BQLF; SWP:P88989_MHV68; PDB:2OA5A; PDKTYEEVKEVERLKLENKTLKQKVDSILTAAKRESIIVSSSRALGAVARKIEAKVRSRA ---3333--------------1111----------------------------------3 AKAVTEQELTSLLQSLTLRVDVSEELEHH 333-3333----1111------------- >ARISTOLOCHENE SYNTHASE; SWP:NA; PDB:2OA6A; SLEPPPSTFQPLCHPLVEEVSKEVDGYFLQHWNFPNEKARKKFVAAGFSRVTCLYFPKAL -------------1111--------------------------------------1111- DDRIHFACRLLTVLFLIDDLLEYMSFEEGSAYNEKLIPISRGDVLPDRSIPVEYIIYDLW --------------------1111------------------------------------ ESMRAHDREMADEILEPVFLFMRAQTDRTRARPMGLGGYLEYRERDVGKELLAALMRFSM ------------------------------------------------------------ GLKLSPSELQRVREIDANCSKHLSVVNDIYSYEKELYTLCTSVQILAQEADVTAEAAKRV --------------------------------3333------------------------ LFVMCREWELRHQLLVARLSAEGLETPGLAAYVEGLEYQMSGNELWSQTTLRYSV -------------------------------------------------3333-- >THREE PRIME REPAIR EXONUC; SWP:Q91XB0; PDB:2OA8A; TLPHGHQTLIFLDLEATGLPSSRPEVTELCLLAVHRRALENTQGHPPPVPRPPRVVDKLS ------------------1111------------3333---------------------- LCIAPGKACSPGASEITGLSKAELEVQGRQRFDDNLAILLRAFLQRQPQPCCLVAHNGDR ------------------------1111---------------1111---------3333 YDFPLLQTELARLSTPSPLDGTFCVDSIAALKALEQASKSYSLGSIYTRLYWQAPTDSHT ------------------1111-------------------------------------- AEGDVLTLLSICQWKPQALLQWVDEHARPFSTVKPYGT ----------1111--------------3333------ >R.MVAI; SWP:Q8RNV5; PDB:2OAAA; SMSEYLNLLKEAIQNVVDGGWHETKRKGNTGIGKTFEDLLEKEEDNLDAPDFHDIEIKTH --1111----------1111-------3333------1111----------!!!!----- ETAAKSLLTLFTKSPTNPRGANTMLRNRYGKKDEYGNNILHQTVSGNRKTNSNSYNYDFK 3333-------------2222-----------1111------------------------ IDIDWESQVVRLEVFDKQDIMIDNSVYWSFDSLQNQLDKKLKYIAVISAESKIENEKKYY ---------------1111----------------------------------%%%%--- KYNSANLFTDLTVQSLCRGIENGDIKVDIRIGAYHSGKKKGKTHDHGTAFRINMEKLLEY ------------------------------------1111------------11111111 GEVKVIV ------- >THIOESTERASE SUPERFAMILY; SWP:Q28UM1; PDB:2OAFA; QPRPDSAFVHDVRVTWGDCDPAKIAYTGHLPRFALEAIDAWWSEYHGPGGWYHLELDTNV ---1111-------1111-1111--3333-----------------2222---------- GTPFVRLEDFKSPVTPRHILKCHTWPTRLGTKSITFRVDGVQDGVTCFVGAFTCVFTIAD -----------------------------------------iiii------------333 QFKSQPAPDHLRALIEPHIPA 3-------------3333--- >HEMOLYSIN; SWP:Q87DZ3_XYLFT; PDB:2OAIA; EDALVTREDGSFLIDGTLPIEELREVLGANNYHTLAGCISYFGRIPHVGEYFDWAGWRIE ------1111----1111---------------3333--3333---2222---iiii--- IVDLDGARIDLLLQRLN ----!!!!--------- >Type II secretion system ; SWP:O29598_ARCFU; PDB:2OAP1; HYDILRRHIRSEDLLETPEFGSGSRIVEEYWIQEPFTKAIIVENEDEFRNVYYALEPTVS -3333------------------------------------------------------3 SEEAEVISALYDDLKKILVLQDVSVDLEERAEVLVRAIEKTDNFYSRLYYLFRDFFGYGL 333---------33331111-33333333------3333-------------------11 IDPLEDTNVEDISCDGYNIPIFIYHQKYGNVETNIVLDQEKLDRVLRLTQRSGKHISIAN 11------------------------------------3333--------------3333 PIVDATLPDGSRLQATFGTEVTPRGSSFTIRKFTIEPLTPIDLIEKGTVPSGVLAYLWLA ------1111--------1111-------------------------------------- IEHKFSAIVVGETASGKTTTLNAIFIPPDAKVVSIEDTREIKLYHENWIAEVTRTGGEGE -----------2222-----------1111------------------------------ IDYDLLRAALRQRPDYIIVGEVRGREAQTLFQASTGHASYSTLHAGDINQVYRLESEPLK --------3333----------------------------------3333---------- VPRSLQFLDIALVQTWVRGNTRLRRTKEVNEILGIDPVDKNLLVNQFVKWDPKEDKHIEV ----1111-------------------------------------------1111----- SPKKLEKADFLGVSVQEVYDELSRKRYLELLKRGIRNYKEVTRYIHAYYRNPELATKEEG -3333-------------------------1111-------------------------- L - >4-HYDROXYBUTYRATE COENZYM; SWP:Q8EG98; PDB:2OASA; PAIVCQSALEAVSLIRSGETLWTHSGATPKVLLDALAKHALTLDNITLLQLHTEGAESLS -----------3333-------------3333-------1111------------3333- HPSLLGHLRHRCFFGGVPTRPLLQSGDADYVPIFLSEVPKLFRSGEQKIDTAIIQVSPPD 3333-----------1111---1111-------3333----------------------1 KHGCSLGISVEATLAACQVAGKIIAHINPQPRTHGDGFIHIDRFAAVYEQSASLPIHSFA 111--!!!!!!!!--------------------------1111----------------- TGDAVSLAIGQHVAELVRDGDCLQGIGAIPDAVLSCLTGHKDLGVHTELFSDGILQLVEK -------------1111-------------------1111----------3333---111 GVINNTKKRFYPGKLVTGFALGSQKLYDYVDDNPAVIFDIEQVNDTSIIRKNPNVAINSA 1---1111------------------------1111--3333--3333------------ LQVDLTGQVCADSIGTKIYSGVGGQDFIRGAGLSEGGRSVIALPSTAAGGRISRIASVLS ---1111------!!!!----------------2222---------iiii---------2 PGAGVVTTRAHVHYIVTEYGAANLKGRSLRERAQALINIAHPDFREQLSRDAFEVWGLNL 222----3333-----1111---2222--------1111-1111---------------- >ORNITHINE AMINOTRANSFERAS; SWP:P04181; PDB:2OATA; GPPTSDDIFEREYKYGAHNYHPLPVALERGKGIYLWDVEGRKYFDFLSSYSAVNQGHCHP ------------------------------!!!!--1111------%%%%--1111---- KIVNALKSQVDKLTLTSRAFYNNVLGEYEEYITKLFNYHKVLPMNTGVEAGETACKLARK --------1111------------------------------------------------ WGYTVKGIQKYKAKIVFAAGNFWGRTLSAISSSTDPTSYDGFGPFMPGFDIIPYNDLPAL -----------------2222----33331111-33332222------------------ ERALQDPNVAAFMVEPIQGEAGVVVPDPGYLMGVRELCTRHQVLFIADEIQTGLARTGRW -----1111---------3333----2222------------------------1111-- LAVDYENVRPDIVLLGKALSGGLYPVSAVLCDDDIMLTIKPGEHGSTYGGNPLGCRVAIA 3333-----------!!!!iiii--------3333----2222--1111----------- ALEVLEEENLAENADKLGIILRNELMKLPSDVVTAVRGKGLLNAIVIKETKDWDAWKVCL -----1111---------------33331111------!!!!-------1111------- RLRDNGLLAKPTHGDIIRFAPPLVIKEDELRESIEIINKTILSF ---------------------1111------------------- >HUMAN MAK3 HOMOLOG; SWP:Q9GZZ1; PDB:2OB0A; SKGSRIELGDVTPHNIKQLKRLNQVIFPVSYNDKFYKDVLEVGELAKLAYFNDIAVGAVC 2222-------3333----------------3333------!!!!-----%%%%------ CRVDHSQNQKRLYITLGCLAPYRRLGIGTKLNHVLNICEKDGTFDNIYLHVQISNESAID -----%%%%---------3333---3333----------------------1111----- FYRKFGFEIIETKKNYYKRIEPADAHVLQKNL --1111-------------------------- >PARATHION HYDROLASE; SWP:P0A433; PDB:2OB3A; DRINTVRGPITISEAGFTLTHEHICGSSAGFLRAWPEFFGSRKALAEKAVRGLRRARAAG ----1111--3333-------------222233333333-----------------1111 VRTIVDVSTFDIGRDVSLLAEVSRAADVHIVAATGLWFDPPLSMRLRSVEELTQFFLREI --------1111----------------------------3333---------------- QYGIEDTGIRAGIIVATTGKATPFQELVLKAAARASLATGVPVTTHTAASQRDGEQQAAI ---!!!!----------------------------------------3333--------- FESEGLSPSRVCIGHSDDTDDLSYLTALAARGYLIGLDHIPYSAIGLEDNASASALLGIR -1111-3333----1111----------1111------1111-2222------------- SWQTRALLIKALIDQGYMKQILVSNDWTFGFSSYVTNIMDVMDRVNPDGMAFIPLRVIPF ------------11113333--------------1111-------3333----------- LREKGVPQETLAGITVTNPARFLSPTLRA -1111-3333------------------- >Ubiquitin-conjugating enz; SWP:P49427; PDB:2OB4A; SQKALLLELKGLQEEPVEGFRVTLVDEGDLYNWEVAIFGPPNTYYEGGYFKARLKFPIDY ----------------2222-----1111-------------1111----------1111 PYSPPAFRFLTKMWHPNIYETGDVCISILHPPVQNVRTILLSVISLLNEPNTFSPANVDA --------------11113333--3333-------------------------------- SVMYRKWKESKGKDREYTDIIRKQVLGTKVDAERDGV ------------------------------------- >HYPOTHETICAL PROTEIN ATU2; SWP:Q8UDV3_AGRT5; PDB:2OB5A; GLKNIDPALNADVLHALRAGHGDTLVISDTNFPSDSVARQTTVGKVLHIDNVSAARAKAI -22223333----------2222-----------------3333--------3333---- LSVLPLDTPLQPSVGREVGAPDQLEPVQVEVQQEIDAAEGKSAPYGIERFAFYEKAKQAY ------3333---------1111------------------------------------- CVITTGETRFYGCFLLTKGVIP ---------------------- >PROBABLE 6-PYRUVOYL TETRA; SWP:Q9I0H2; PDB:2OBAA; HELFKEFTFESAHRLPHVPEGHKCGRLHGHSFRVAIHIEGEVDPHTGWIRDFAEIKAIFK ------------------11111111------------------------3333------ PIYEQLDHNYLNDIPGLENPTSENLCRWIWQQLKPLLPELSKVRVHETCTSGCEYRGD ---------33332222---------------33331111------------------ >HYPOTHETICAL PROTEIN; SWP:NA; PDB:2OBBA; ATIAVDFDGTIVEHRYPRIGEEIPFAVETLKLLQQEKHRLILWSVREGELLDEAIEWCRA ------2222------------2222--------------------------------11 RGLEFYAANKDYPEEHQGFSRKLKADLFIDDRNVGGIPDWGIIYEIKEKKTFADIYSQ 11---------------------------1111------------1111--------- >CHOLESTERYL ESTER TRANSFE; SWP:P11597; PDB:2OBDA; TSHEAGIVCRITKPALLVLNHETAKVIQTAFQRASYPDITGEKAMMLLGQVKYGLHNIQI --------------------------------------------2222------------ SHLSIASSQVELVEAKSIDVSIQDVSVVFKGTLKYGYTTAWWLGIDQSIDFEIDSAIDLQ ------------2222-----------------------1111----------------- INTQLTADSGRVRTDAPDCYLSFHKLLLHLQGEREPGWIKQLFTNFISFTLKLVLKGQIC -------iiii------------------2222-----3333------------------ KEINVISNIMADFVQTRAASILSDGDIGVDISLTGDPVITASYLESHHKGHFIYKDVSED -------------------1111!!!!------------1111----------%%%%--- LPLPTFSPTLLGDSRMLYFWFSERVFHSLAKVAFQDGRLMLSLMGDEFKAVLETWGFNTN ----------------------------------------------------1111-111 QEIFQEVVGGFPSQAQVTVHCLKMPKISCQNKGVVVDSSVMVKFLFPRPDQQHSVAYTFE 11111-----1111---------------3333----------------3333------- EDIVTTVQASYSKKKLFLSLLDFQITPKTVSNLTESSSESIQSFLQSMITAVGIPEVMSR -----------%%%%--------------------------------------------- LEVVFTALMNSKGVSLFDIINPEIITRDGFLLLQMDFGFPEHLLVDFLQSLS --------33333333----------2222-----------------1111- >SELT/SELW/SELH SELENOPROT; SWP:Q4KGC5; PDB:2OBKA; RKPEVIITYCTQCQWLLRAAWLAQELLSTFSDDLGKVSLEPATGGAFRITCDGVQIWERK ----------3333------------------------------------iiii-----1 ADGGFPEAKVLKQRVRDQIDPERD 111--------------------- >ESCN; SWP:O52140; PDB:2OBLA; SHKIRVGDALLGRLIDGIGRPMESNIVAPYLPFERSLYAEPPDPLLRQVIDQPFILGVRA ------1111-----1111--------------------------------------333 IDGLLTCGIGQRIGIFAGSGVGKSTLLGMICNGASADIIVLALIGERGREVNEFLALLPQ 3------2222-----------------------------------------------33 STLSKCVLVVTTSDRPALERMKAAFTATTIAEYFRDQGKNVLLMMDSVTRYARAARDVGL 331111-----3333-------------------1111---------------------- ASGEPDVRGGFPPSVFSSLPKLLERAGPAPKGSITAIYTVLLESDNVNDPIGDEVRSILD -------iiii3333-------3333----------------------------1111-- GHIVLTRELAEENHFPAIDIGLSASRVMHNVVTSEHLRAAAECKKLIATYKNPELLIRIG ---------1111-----3333--1111-----------------------33333333- EYTMGQDPEADKAIKNRKLIQNFIQQSTKDISSYEKTIESLFKVVA --------------------------1111-----------3333- >HYPOTHETICAL PROTEIN; SWP:Q3M7B8; PDB:2OBNA; NQRVAILLHEGTTGTIGKTGLALLRYSEAPIVAVIDRNCAGQSLREITGIYRYVPIVKSV -------2222------------------------3333---3333------------33 EAALEYKPQVLVIGIAPKGGIPDDYWIELKTALQAGSLVNGLHTPLANIPDLNALLQPGQ 333333---------------1111--------------------11113333---2222 LIWDVRKEPANLDVASGAARTLPCRRVLTVGTDAIGKSTSLELHWAAKLRGWRSKFLATG ---1111---------3333---------------------------------------3 QTGVLEGDGVALDAVRVDFAAGAVEQVRYGKNYDILHIEGQGSLLHPGSTATLPLIRGSQ 333-------3333-3333---------1111----------1111----------1111 PTQLVLVHRAGQTHNGNNPHVPIPPLPEVIRLYETVASGGGAFGTVPVVGIALNTAHLDE --------2222--3333------3333---------%%%%-------------3333-- YAAKEAIAHTIAETGLPCTDVVRFGADVLLDAVQN -------------------3333------------ >PUTATIVE DNA-BINDING PROT; SWP:Q46TT3; PDB:2OBPA; GIDPAIVEVLLVLREAGIENGATPWSLPKIAKRAQLPSVLRRVLTQLQAAGLADVSVEAD --------------11112222-------------------------1111------111 GRGHASLTQEGAALAAQLFP 1------------------- >S-ADENOSYLMETHIONINE SYNT; SWP:Q00266; PDB:2OBVA; VFMFTSESVGEGHPDKICDQISDAVLDAHLKQDPNAKVACETVCKTGMVLLCGEITSMAM ---------1111---------------33331111--------2222------------ VDYQRVVRDTIKHIGYDDSAKGFDFKTCNVLVALEQQSPDIAQCVHLDRNEEDVGAGDQG -----------------3333--3333----------3333----22223333------- LMFGYATDETEECMPLTIILAHKLNARMADLRRSGLLPWLRPDSKTQVTVQYMQDNGAVI ---------1111-----------------------1111--------------iiii-- PVRIHTIVISVQHNEDITLEEMRRALKEQVIRAVVPAKYLDEDTVYHLQPSGRFVIGGPQ -----------------------------3333--3333-1111----3333-----333 GDAGVTGRKIIVDTYGGWGAHGGGAFSGKDYTKVDRSAAYAARWVAKSLVKAGLCRRVLV 3-------------iiii-------22223333----------------1111------- QVSYAIGVAEPLSISIFTYGTSQKTERELLDVVHKNFDLRPGVIVRDLDLKKPIYQKTAC ----2222---------iiii------------------1111--1111----3333--- YGHFGRSEFPWEVPRKLVF -----33331111------ >HYDROXYACYLGLUTATHIONE HY; SWP:Q8ZRM2; PDB:2OBWA; SMNLNSIPAFQDNYIWVLTNDEGRCVIVDPGEAAPVLKAIAEHKWMPEAIFLTHHHHDHV -------------------1111--------------------------------11111 GGVKELLQHFPQMTVYGPAETQDKGATHLVGDGDTIRVLGEKFTLFATPGHTLGHVCYFS 111------3333----1111---------2222---iiii------------------- RPYLFCGDTLFSGGCGRLFEGTPSQMYQSLMKINSLPDDTLICCAHEYTLANIKFALSIL -----!!!!-2222------------------11111111-------------------1 PHDSFINEYYRKVKELRVKKQMTLPVILKNERKINLFLRTEDIDLINEINKETILQQPEA 111-------------1111------3333333311111111--------------3333 RFAWLRSKKDTF -------3333- >PUTATIVE QUINONE OXIDORED; SWP:QORX_HUMAN; PDB:2OBYA; PMLAVHFDKPGGPENLYVKEVAPSPGEGEVLLKVAASALNRADLMQRQGQYDPPPGASNI -----------3333------------------------3333----------------- LGLEASGHVAELGPGCQGHWKIGDTAMALLPGGGQAQYVTVPEGLLMPIPEGLTLTQAAA --------------------2222-----------------3333----22223333--- IPEAWLTAFQLLHLVGNVQAGDYVLIHAGLSGVGTAAIQLTRMAGAIPLVTAGSQKKLQM -3333-------------2222--------3333-------------------------- AEKLGAAAGFNYKKEDFSEATLKFTKGAGVNLILDCIGGSYWEKNVNCLALDGRWVLYGL -1111-----3333----------iiii----------1111---11112222------- MGGGDINGPLFSKLLFKRGSLITSLLRSRDNKYKQMLVNAFTEQILPHFSTEGPQRLLPV -------------------------1111-----------------------1111---- LDRIYPVTEIQEAHKYMEANKNIGKIVLELPQ -----3333-------1111------------ >Tyrosine-protein phosphat; SWP:Q99952; PDB:2OC3A; DSASFLERLAVLAGEFSDIQACSAAWKADGVCSTVAGSRPENVRKNRYKDVLPYDQTRVI -----3333------------------------3333-33331111-1111--3333--- LSLLQEEGHSDYINGNFIRGVDGSLAYIATQGPLPHTLLDFWRLVWEFGVKVILMACREI -2222--------------1111----------1111----------------------- ENGRKRCERYWAQEQEPLQTGLFCITLIKEKWLNEDIMLRTLKVTFQKESRSVYQLQYMS iiii--------2222---!!!!----------1111--------iiii----------- WPDRGVPSSPDHMLAMVEEARRLQGSGPEPLCVHCSAGCGRTGVLCTVDYVRQLLLTQMI ------------------------------------------------------1111-- PPDFSLFDVVLKMRKQRPAAVQTEEQYRFLYHTVAQMFC 1111--------11112222------------------- >HYPOTHETICAL PROTEIN; SWP:Q7V6D4; PDB:2OC5A; EALPDFTSDRYKDAYSRINAIVIEGEQEAHDNYIAIGTLLPDHVEELKRLAKERHKKGFT ----1111-------------------------------3333----------------- ACGKNLGVEADDFAREFFAPLRDNFQTALGQGKTPTCLLIQALLIEAFAISAYHTYIPVS ---1111---------------------1111-----------------------3333- DPFARKITEGVVKDEYTHLNYGEAWLKANLESCREELLEANRENLPLIRRLDQVAGDAAV ----------------------------3333-------------------1111----- LQDKEDLIEDFLIAYQESLTEIGFNTREITRAAAAL -------------------------------3333- >YDHG PROTEIN; SWP:Q797E6; PDB:2OC6A; GDVFSEYLAGIADPFHRERTEEVLTWIKNKYPNLHTEIKWNQPFTDHGTFIIGFSVSKKH -1111-------------------------1111----%%%%---iiii-------1111 LAVAPEKVTIAHVEDDIVKAGYDYTEQLIRIPWNGPVDYTLLEKIEFNILDKADCSTFWR ------------------------1111---1111--------------1111------- K - >RAS-RELATED PROTEIN RAB-9; SWP:Q9NP90; PDB:2OCBA; GKSLLLKVILLGDGGVGKSSLMNRYVTNKFDSQAFHTIGVEFLNRDLEVDGRFVTLQIWD ------------2222--------------------------------%%%%-------- TAGQERFKSLRTPFYRGADCCLLTFSVDDRQSFENLGNWQKEFIYYADVKDPEHFPFVVL ---3333---33332222-------1111---------------1111--3333------ GNKVDKEDRQVTTEEAQTWCMENGDYPYLETSAKDDTNVTVAFEEAVRQVLAV --3333--------------------------1111-------------1111 >L-ASPARAGINASE I; SWP:A2PBS8; PDB:2OCDA; ARKHIYIAYTGGTIGKKSDPVAGFEKQLASPEFHRPEPLFTIHEYDPLDSSDTPADWQLI -----------3333---------------1111--------------1111-------- ADDIAANYDKYDGFVILHGTDTAYTASALSFFENLGKPVIVTGSQIPLADLRSDGQANLL ----1111-------------------3333---------------1111---------- NALHVAANYPINEVTLFFNNRLRGNRSRKSHADGFSAFSSPNLPPLLEAGINIELSTNVK -----------------%%%%-1111-------------1111------------1111- VDEKPSGEFKVNPITPQPIGVITYPGISHEVIRNTLLQPVNAILLTFGVGNAPQNPELLA -----------------------2222-------------------!!!!---------- QLKAASERGVIVVNLTQCLAGKVNGGCALADAGVISGYDTPEAALAKLHYLLSQNLSYEE -----1111-----------------------------------------3333------ VKAKQQVLRGETL ------------- >HYPOTHETICAL PROTEIN DNJ-; SWP:O45502; PDB:2OCHA; KETGYYDVLGVKPDASDNELKKAYRKMALKFHPDKNPDGAEQFKQISQAYEVLSDEKKRQ -----------1111----------------11111111------------1111----- IYDQGG ------ >DEOXYGUANOSINE KINASE; SWP:Q16854; PDB:2OCPA; GPRRLSIEGNIAVGKSTFVKLLTKTYPEWHVATEPVATWQNIQLGNLLDMMYREPARWSY ---------2222------------3333-----1111---------------3333--- TFQTFSFLSRLKVQLEPFPEKLLQARKPVQIFERSVYSDRYIFAKNLFENGSLSDIEWHI ------------1111----3333----------3333---------1111--------- YQDWHSFLLWEFASRITLHGFIYLQASPQVCLKRLYQRAREEEKGIELAYLEQLHGQHEA -----------3333-------------------------3333--3333---------- WLIHKTTKLHFEALMNIPVLVLDVNDDFSEEVTKQEDLMREVNTFVKNL ----------3333----------------------------------- >NA(+)/H(+) EXCHANGE REGUL; SWP:Q15599; PDB:2OCSA; MPRLCRLVRGEQGYGFHLHGERGQFIRRVEPGSPAEAAALRAGDRLVEVNGVNVEGETHH ---------3333----------------2222--1111-2222----iiii-2222--- QVVQRIKAVEGQTRLLVVDQEDTSV --------2222------------- >3-DEHYDROQUINATE DEHYDRAT; SWP:Q1JCG7; PDB:2OCZA; ARIVAPVPRHFDEAQAIDISKYEDVNLIEWRADFLPKDEIVAVAPAIFEKFAGKEIIFTL -------------1111----1111-----3333-11113333-----1111-------- RTVQEGGNITLSSQEYVDIIKEINAIYNPDYIDFEYFTHKSVFQELDFPNLILSYHNFEE -3333-----------------------------33333333------------------ TPENLEAFSETKLAPRVVKIAVPQSEQDVLDLNYTRGFKTLNPEQEFATISGKLGRLSRF -1111-----3333----------3333-------------1111------3333----- AGDVIGSSWTYVSLGQVTLNDKRIIEVLE -3333------------3333-------- >HYPOTHETICAL PROTEIN VP10; SWP:Q87QX1; PDB:2OD0A; KPILKDSKLFEALGTIKSRSFGGFGLFADETFALVVNNQLHIRADQQTSSDFETQGLKPY -3333-----1111-------------%%%%----%%%%-------------1111---- VYKKRGFPVVTKYYAISSELWESSDRLIEVAKKSLENAKL ---iiii---------3333----------------1111 >HYPOTHETICAL PROTEIN; SWP:NA; PDB:2OD4A; GFAGSIPYIRVVSITAQSKLQFDTVTYFENVWSPKVISLGAISAEFVQSNENSGYIIHYP -----------------3333---------------1111-------------------- DKQTAISVFDKIKPEVDEVRTQNRIQITEGKRLFRVD ------------------3333--------------- >HYPOTHETICAL PROTEIN; SWP:NA; PDB:2OD5A; ETESKTVRIREKIKKFLGDRPRNTAEILEHINSTRHGTTSQQLGNVLSKDKDIVKVGYIK ------------------------------1111---------------1111------- RSGILSGGYDICEWATRNWVAEHCPEWTE ----------------------------- >HYPOTHETICAL PROTEIN; SWP:NA; PDB:2OD6A; GAEPKFTSFTTADFINDVDELFIDAVEKTAPVWVKEKSRGLLKFSNRVWNKGEVFRVVTY ---------------3333---------3333---------------------------- EYKDRASFEANIAYLEDTFGKNPVFLQLVTTAKFTTSRCLVVEV ----------------------------1111------------ >REGULATOR OF G-PROTEIN SI; SWP:P08754; PDB:2ODEA; EVKLLLLGAGESGKSTIVKQMKIIHEDGYSEDECKQYKVVVYSNTIQSIIAIIRAMGRLK --------2222--------------------------------------------1111 IDFGEAARADDARQLFVLAGSAEEGVMTPELAGVIKRLWRDGGVQACFSRSREYQLNDSA ----3333---------33331111------------------------3333---1111 SYYLNDLDRISQSNYIPTQQDVLRTRVKTTGIVETHFTFKDLYFKMFDVGGQRSERKKWI --3333-33331111------1111-------------%%%%---------333311111 HCFEGVTAIIFCVALSDYDLVLAEDEEMNRMHESMKLFDSICNNKWFTETSIILFLNKKD 111----------1111----1111---------------11111111------------ LFEEKIKRSPLTICYPEYTGSNTYEEAAAYIQCQFEDLNRRKDTKEIYTHFTCATDTKNV ----3333-3333-1111----3333--------------3333--------1111---- QFVFDAVTDVIIKNNL ---------------- >HYPOTHETICAL PROTEIN ATU2; SWP:Q8UDI1; PDB:2ODFA; RFFTEAEGKAVGVENAAAKGDVLLVCEHASATIPQKYGTLGLSADVLSSHAAWDPGALAV ---3333-------1111---------------3333-%%%%3333--3333-2222--- ARLLSEKFHATLVYQRFSRLVYDCNRPPESPSAMPVKSEIYDIPGNFDLDEAERFARTSA -----1111--------3333-11113333-------!!!!-3333-------------- LYVPFHDRVSEIIAERQAAGRKVVVVTIHSFTPVYHGRFREVEIGILHDNDSRLADAMLA ----------------1111--------------iiii---------------------1 GAEGASLTVRRNDPYGPEDGVTHTLRLHALPDGLLNVMIEIRNDLIANEGEQAAIAGFLH 111------------3333----------1111--------3333--------------- ELMGKALSSIEE --------1111 >R.BCNI; SWP:Q8RNV8; PDB:2ODIA; KIWSKEEVVNKLHEIKNKGYLSVPTDFRTDDGVVGQILERQFGVQENNITLGDLGEFELK ---------------3333----------1111------1111----33331111----- GRNRKAKSNLTLFHKKPVAGQTVIQIFNRFGYVKPSSRNPEVKKKLFTTIKGGRLNNLGL -----------------------------------3333----------------1111- TLNAKHASEINLYYQDEYLSTWDLNLSKIEKLVLVFAETIGRANSPEEQFHFTKAYLTEI -----1111----!!!!--------3333------------2222--------------- NDITSLINDGVLVDLCIDQDLSKSKGPHDRGPHLRIPISKLDKLYRNIERLL ------1111---------3333-------------3333------------ >HYPOTHETICAL PROTEIN; SWP:Q82T22; PDB:2ODKA; HVWPVQDAKARFSEFLDACITEGPQIVSRRGAEEAVLVPIGEWRRLQAAA ----------------------------%%%%------------------ >COMPLEMENT C2; SWP:Q95IG1; PDB:2ODPA; KIQIQRSGHLNLYLLLDASQSVSENDFLIFKESASLMVDRIFSFEINVSVAIITFASEPK ------------------3333-------------------1111--------------- VLMSVLNDNSRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMA ---3333----------------11111111-----------------------111133 WQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWREL 33---------------------------------33331111----------------- NELGSKKDGERHAFILQDTKALHQVFEHMLDVSKLTDTICGVGNMSANASDQERTPWHVT ------2222--------------3333---1111----------11113333-1111-- IKPTCRGALISDQWVLTAAHCFWRVNVGDPKSQWGKEFLIEKAVISPGFDVFAKKNQGIL -----------------3333-------1111-------------333311113333--- EFYGDDIALLKLAQKVKMSTHARPICLPCTMEANLALRRPQGSTCRDHENELLNKQSVPA ------------------1111-----------------1111----------------- HFVALNGSKLNINLKMGVEWTSCAEVVSQEKTMFPNLTDVREVVTDQFLCSGTQEDESPC ---1111---------3333------------------1111--1111----!!!!---1 KGESGGAVFLERRFRFFQVGLVSWGLYNPCLNSRKRAPRSKVPPPRDFHINLFRMQPWLR 111--------%%%%------------1111------1111---------3333------ QHLGDVLNFL --1111---- >Inositol-tetrakisphosphat; SWP:Q13572; PDB:2ODTX; KGKRVGYWLSEKKIKKLNFQAFAELCRKRGEVVQLNLSRPIEEQGPLDVIIHKLTDVILE -------------------------3333------11113333----------------- ADQNDSQSLELVHRFQEYIDAHPETIVLDPLPAIRTLLDRSKSYELIRKIEAYEDDRICS 1111-----------------1111----3333-----3333------------1111-- PPFELTSLCGDDTRLLEKNGLTFPFICKTRVAHGTNSHEAIVFNQEGLNAIQPPCVVQNF ---------1111---1111-------------1111------33333333--------- INHNAVLYKVFVVGESYTVVQRPSLKNFSAGTSDRESIFFNSHNVSKPESVLTELDKIEG --%%%%------!!!!------------------------3333-----3333------- VFERPSDEVIRELSRALRQALGVSLFGIDIIINNQTGQHAVIDINAFPGYEGVSEFFTDL -----3333--------------------------------------------------- LNHIATVLQG ------3333 >PLECTIN 1; SWP:Q6S379; PDB:2ODVA; ELQLRWQEYRELVLLLLQWMRHHTAAFEERFPSSFEEIEILWSQFLKFKEMELPAKEADK ------------------------------------------------------------ NRSKGIYQSLEGAVQAGQLKVPPGYHPLDVEKEWGKLHVAILEREKQLRSEFERLEALQR ---------------------22223333------------------------------- IVTKLQMEAGLAEEQLNQADALLQSDVRLLAAGKVPQRAGEVERDLDKADSMIRLLFNDV ------------------------------------------------------------ QTLKDGRHPQGEQMYRRVYRLHKRLVAIRTEYNLRLK ---11111111-------------------------- >CYTOCHROME C OXIDASE POLY; SWP:P04037; PDB:2ODXA; MKDPIIIESYDDYRYVGCTGSPAGSHTIMWLKPTVNEVARCWECGSVYKLNPVG ---------------------------------1111----------------- >HYPOTHETICAL PROTEIN; SWP:O28417; PDB:2OEBA; DIREIEQERASFAFKVVSDIKDKYSQNKKVQGKYSSYAEKAPTIILNNGLGATLAFFLSK -----------------------1111-----------------------------3333 LEKPIDDVDYKSINPESFGNAENIAYAFLYKHLSTWLAEGNGKDSAFSGLTNGEDPLKYI --------3333-1111---------------------!!!!--------iiii------ MEKTAIDVAISTEEALSILNWIKKFAKAMLEE -------------------------------- >UPF0342 PROTEIN YHEA; SWP:YHEA_BACSU; PDB:2OEEA; VNFYDVAYDLENALRGSEEFTRLKNLYDEVNADESAKRFENFRDVQLQAQKTVALVQQHE ---------------------------------3333-------------3333------ KISQLEAEQRSLIGELNKIIKPLEELY --------------------------- >UTP-glucose-1-phosphate u; SWP:A4HXX2; PDB:2OEGA; SLSAAAQACVKKRDAKVNEACIRTFIAQHVVSKGETGSIPDSAIPVDSLDALDSLTIECD ------------1111--------------1111-----3333-------3333------ NAVLQSTVVLKLNGGLGTGGLCDAKTLLEVKDGKTFLDFTALQVQYLRQHCSEHLRFLDS ---1111----------------3333---2222-----------------1111----3 FNTSASTKSFLKARYPWLYQVFDSEVELQNQVPKILQDTLEPAAWAENPAYEWAPPGHGD 333-----------3333--------------------------3333-1111------- IYTALYGSGKLQELVEQGYRYFVSNGDNLGATIDKRVLAYEKEKIDFLEVCRRTESDKKG -----1111---------------1111-----3333---1111---------1111--- GHLARQTVYVKGKDGQPDAEKRVLLLRESAQCPKADESFQDINKYSFFNTNNLWIRLPVL ---------------------------3333-11113333-------------------- LETQEHGGTLPLPVIRNEKTVDSSNSASPKVYQLETAGAAIAFESASAIVVPRSRFAPVK -----%%%%------------3333-----------------1111-----3333----- TCADLLALRSDAYVVTDDFRLVLDDRCHGHPPVVDLDSAHYKNGFEKLVQHGVPSLVECK ---------3333--1111----3333---------3333--33331111-----1111- RVTVKGLVQFGAGNVLTGTVTIENTDSASAFVIPDGAKLNDTTASP -----------------------3333------------------- >2,3-diketo-5-methylthiope; SWP:Q5L1E2; PDB:2OEMA; SAVMATYLLHDETDIRKKAEGIALGLTIGTWTDLPALEQEQLRKHKGEVVAIEELGESER ------------------------------1111-------3333--------------- VNAYFGKRLKRAIVKIAYPTVNFSADLPALLVTTFGKLSLDGEVRLLDLEFPDEWKRQFP ------------------1111-------------3333------------33331111- GPRFGIDGIRDRVGVHNRPLLMSIFKGMIGRDLAYLTSELKKQALGGVDLVDDEILFDSE ---------------------------2222------------1111-----1111--33 LLPFEKRITEGKAALQEVYEQTGKRTLYAVNLTGKTFALKDKAKRAAELGADVLLFNVFA 33--------------------------------1111------------------3333 YGLDVLQALREDEEIAVPIMAHPAFSGAVTPSEFYGVAPSLWLGKLLRLAGADFVLFPSP -3333------1111-------2222---------------------------------- YGSVALEREQALGIARALTDDQEPFARAFPVPSAGIHPGLVPLIIRDFGLDTIVNAGGGI ------------------------------------3333----------------3333 HGHPDGAIGGGRAFRAAIDAVLAGRPLRAAAAENEALQKAIDRWGVV --1111----------------------------------------- >PROTEIN OF UNKNOWN FUNCTI; SWP:Q5L2A5; PDB:2OEQA; EPLHALARQLEQAIRASEPFQQLKRAYEDVRRDETAYRFANVRDIQLRLHEKQRGAAILP ---3333---------------------------1111---------------------- DEIEQAQKAALAQQNEKLARLALEQQSITIAEVQQIAKPLEELHRSF ----------------------------------------------- >PROBABLE TRANSCRIPTIONAL ; SWP:Q9I3U1; PDB:2OERA; SELVASILEAAVQVQRFTTARVAERAGVSIGSLYQYFPNKAAILFRLQSDEWRRTTRLLG --------------------------------3333------------------------ EILEDTTRPPLERLRRLVLAFVRSECEEAAIRVALSDAAPLYEAREVKAEGARVFQAFLR --------------------------------------1111------------------ EALPEVAEAERSLAGDLLTTTLGAVGKQFSEQPRSEAEIERYAEALADLCAYLAALGE --1111---------------------1111--------------------------- >UPF0289 PROTEIN VP2528; SWP:Q87LT3; PDB:2OEZA; ATTHKFEHPLNEKTRIYLRVESLLRQAHLASGFADNHQYQLFFRALFDVEIFEQIQLKSE -------------------------------------------------3333------- LAKDLEKQRLSYRHWLNVEGVDQEALNSLLNEIDVVHSQLGAERFGQALKEDRFLSSIRQ ------------1111---------------------------2222------------3 RFNLCCFDLPALHYWLHLPIERKKHDANQWQKSLKPLSDALTLWLKLARETGHFKAQIAR 333--1111-----1111-----------3333--------------------------i AGFFQSDADEANILRLHIPKYGVYPISGHKNRFAIKFAFENGQACSQDVEFELAVC iii-------------------------!!!!-----3333--------------- >ZYG-9; SWP:O61442; PDB:2OF3A; AELLLSDNEDKKQRIKEEKQLKLVKWNFQAPTDEHISQLQTLLGNQAKVSLMSQLFHKDF -----------------1111--------------------------------1111--- KQHLAALDSLVRLADTSPRSLLSNSDLLLKWCTLRFFETNPAALIKVLELCKVIVELIRD ----------3333------------------3333------------------------ TETPMSQEEVSAFVPYLLLKTGEAKDNMRTSVRDIVNVLSDVVGPLKMTPMLLDALKSKN -----------------1111-----------------------1111------------ ARQRSECLLVIEYYITNAGISPLKSLSVEKTVAPFVGDKDVNVRNAAINVLVACFKFEGD -------------------3333111133333333------------------------- QMWKAAGRMADKDKSLVEERIKRTGV ----------------------1111 >PUTATIVE TETR-FAMILY TRAN; SWP:Q9EWH2; PDB:2OF7A; GLRERKKTRTREAIRAATYGLIRQQGYEATTVEQIAERAEVSPSTVLRYFPTREDIVLTD --------------------------3333-----------1111-------3333---- EYDPVAAELAARPAGEPWSDSLRHVLRKALGLGAGEEAELIRLRTRLLAEVPAVRARLEN ------------------------------------------------------------ SDTGRLARAIADRTGLDPDGLEVRIVSSLVGGLEVSRYWAEHDHEESLAELVDRALDALE ----------------11113333-----3333--------3333--------------- NGLPA ----- >TRAO; SWP:A0PBC5; PDB:2OFQA; AGAKNYQYVMSEQPEMRSIQPVHVWDNYRFTRFEFPANAELPQVYMISASGKETLPNSHV ------------33331111---------------1111--------------------- VGENRNIIEVETVAKEWRIRLGDKVVGVRNNNFAP ----------------------------------- >PUTATIVE XRE-FAMILY TRANS; SWP:Q0S9B8; PDB:2OFYA; RVPLTAEELERGQRLGELLRSARGDSVTVAFDAGISVETLRKIETGRIATPAFFTIAAVA ------------------------------------------------------------ RVLDLSLDDVAAVVTFGPVS 1111---------------- >2-hydroxy-6-oxo-6-phenylh; SWP:P47229; PDB:2OG1A; TALTESSTSKFVKINEKGFSDFNIHYNEAGNGETVIMLHGGGPGAGGWSNYYRNVGPFVD ---3333--------2222----------------------22223333-1111----11 AGYRVILKDSPGFNKSDAVVMDEQRGLVNARAVKGLMDALDIDRAHLVGNSMGGATALNF 11-------2222------------------------1111------------------- ALEYPDRIGKLILMGPGGLGPSMFAPMPMEGIKLLFKLYAEPSYETLKQMLQVFLYDQSL ---1111--------------------------------------------1111-1111 ITEELLQGRWEAIQRQPEHLKNFLISAQKAPLSTWDVTARLGEIKAKTFITWGRDDRFVP ---------------3333-------------111133333333---------------3 LDHGLKLLWNIDDARLHVFSKCGHWAQWEHADEFNRLVIDFLRHA 333---------------------3333----------------- >PRE-MRNA-SPLICING FACTOR ; SWP:P33334; PDB:2OG4A; SKNEWRKSAIANTLLYLRLKNIYVSADDFVEEQNVYVLPKNLLKKFIEISDVKIQVAAFI ---------1111--3333----------3333-------------1111---------- YGMSAKDHPKVKEIKTVVLVPQLGHVGSVQISNIPDIGDLPDTEGLELLGWIHTQTEELK -------1111-------------2222--------1111--2222-------------- FMAASEVATHSKLFADKKRDCIDISIFSTPGSVSLSAYNLTDEGYQWGEENKDIMNVLSE -----------------1111-------2222---------------1111-------22 GFEPTFSTHAQLLLSDRITGNFIIPSGNVWNYTFMGTAFNQEGDYNFKYGIPLEFYNEMH 221111------------------1111---111111111111----------1111111 RPVHFLQFS 1-------- >PUTATIVE OXYGENASE; SWP:Q9Z4Z5; PDB:2OG5A; SRYDVTLDQSDAELVEEIAWKLATQATGRPDDAEWVEAARNAWHAWPATLRRDLAGFRRD ----------------------------1111--------3333---------------- SGPDGAIVLRGLPVDSMGLPPTPRVNGSVQREASLGAAVLLMTACGLGDPGAFLPEKNGA -1111---------3333----------------------------------1111%%%% LVQDVVPVPGMEEFQGNAGSTLLTFHNENAFHEHRPDFVMLLCLRADPTGRAGLRTACVR -------2222----1111---------3333--------------1111-------333 RVLPLLSDSTVDALWAPEFRTAPPPSFQAPAPVLLGDRSDPDLRVDLAATEPVTERAAEA 33333-------1111-------3333------------------3333----------- LRELQAHFDATAVTHRLLPGELAIVDNRVTVHGRTEFTPRYDGTDRWLQRTFVLTDLRRS -----------------2222----------------------------------33333 RAMRPADGYVLGAAP 3332222-------- >MANDELATE RACEMASE/MUCONA; SWP:Q12GE3; PDB:2OG9A; PSDRITWVRISSCYLPLATPITEIAILFAEIETAGGHQGLGFSYSKRAGGPGQFAHAREI --------------------------------1111------------------------ APALIGEDPSDIAKLWDKLCWAGASAGRSGLSTQAIGAFDVALWDLKAKRAGLSLAKLLG -1111--3333-----------1111----------------------1111-------- SYRDSVRCYNTSGGFLHTPIDQLVNASASIERGIGGIKLKVGQPDGALDIARVTAVRKHL ----------%%%%11113333-------------------------------------- GDAVPLVDANQQWDRPTAQRCRIFEPFNLVWIEEPLDAYDHEGHAALALQFDTPIATGEL 1111----%%%%---------3333-----------1111-------------------- TSAAEHGDLIRHRAADYLPDAPRVGGITPFLKIASLAEHAGLLAPHFAELHVHLAAAYPR ---------1111------3333-------------------------------1111-- EPWVEHFEWLEPLFNERIEIRDGRLVPTRPGLGLTLSGQVKAWTREEAQVGTRP -------1111---------iiii-----!!!!---33331111---------- >HYPOTHETICAL PROTEIN MJ04; SWP:Q57851; PDB:2OGFA; SLRVEETEVFKKYFKNLTDRERAVFEGGITLGALFHQFVGTPVSKYNKESLERAIEEAKN --3333-3333--1111--------------------2222--3333------------- QPCVYDIKVKIRNVGEKYVSLDGKLDVDLKIKINKTVAHLKLEYIPEIDYPLYVKKFE 2222----------------------------!!!!--------3333---------- >TREHALOSE OPERON TRANSCRI; SWP:P39796; PDB:2OGGA; TKTTVHKFGLEPPSELIQKQLRANLDDDIWEVIRSRKIDGEHVILDKDYFFRKHVPHLTK ------------------1111----------------------------3333----33 EICENSIYEYIEGELGLSISYAQKEIVAEPCTDEDRELLDLRGYDHVVVRNYVFLEDTSL 33------------1111-------------3333-------------------1111-- FQYTESRHRLDKFRFVDFARRGK --------3333----------- >HYPOTHETICAL PROTEIN SAG1; SWP:Q8DY32; PDB:2OGIA; YKDYTGLDRTELLSKVRHSDKRFNHVLGVERAAIELAERYGYDKEKAGLAALLHDYAKEL 3333------------------------------------------------11111111 SDDEFLRLIDKYQPDPDLKKWGNNIWHGLVGIYKIQEDLAIKDQDILAAIAKHTVGSAQS --------------3333---3333--1111----------------------------- TLDKIVYVADYIEHNRDFPGVEEARELAKVDLNKAVAYETARTVAFLASKAQPIYPKTIE ------------3333-------------------------------1111---3333-- TYNAYIPYLD ----3333-- >DIHYDROOROTASE; SWP:Q8UAV1; PDB:2OGJA; QAPILLTNVKPVGFGKGASQSSTDILIGGDGKIAAVGSALQAPADTQRIDAAFISPGWVD --------------1111---------1111----------------------------- LHVHIWHGGTDISIRPSECGAERGVTTLVDAGSAGEANFHGFREYIIEPSRERIKAFLNL --------------3333-3333-----------------------1111---------- GSIGLVACNRVPELRDIKDIDLDRILECYAENSEHIVGLVRASHVITGSWGVTPVKLGKK 1111--2222-----3333------------1111-------3333!!!!---------- IAKILKVPVHVGEPPALYDEVLEILGPGDVVTHCFNGKSGSSIEDEDLFNLAERCEGIRL -------------------------2222--------2222------------------- DIGHGGASFSFKVAEAAIARGLLPFSISTDLHGHSNFPVWDLATTSKLLSVDPFENVVEA -----------------1111-------------------3333---3333-33333333 VTRNPASVIRLDENRLDVGQRADFTVFDLVDADLEATDSNGDVSRLKRLFEPRYAVIGAE ----3333-----1111---------------------------------------!!!! AIAASRYIPRA ----------- >HYPOTHETICAL PROTEIN; SWP:O27966; PDB:2OGKA; GKIEWVRVSAVVHSTEDREKVGEAISTLFPFEFEIAVSKMEYLEVELTKSSEIKKFWKNL ------------11113333----3333-------------------------------- LELLGEQAEEILSTLEDRIDEQNVLHIRIDKQKAYLGEVSLTSGGDPIAVKLRLVTYPSK ----------3333-----1111------------------------------------3 REKVIEFARELCT 333---------- >THERMOSTABLE CARBOXYLESTE; SWP:Q8GCC7; PDB:2OGTA; RTVVETRYGRLRGEMNEGVFVWKGIPYAKAPVGERRFLPPEPPDAWDGVREATSFGPVVM -----1111------iiii------------!!!!------------------------- QPSPSEDGLYLNIWSPAADGKKRPVLFWIHGGAFLFGSGSSPWYDGTAFAKHGDVVVVTI -------------------------------%%%%--11111111--------------- NYRMNVFGFLHLGDSFGEAYAQAGNLGILDQVAALRWVKENIAAFGGDPDNITIFGESAG ---!!!!----!!!!-3333-3333---------------3333---1111--------- AASVGVLLSLPEASGLFRRAMLQSGSGSLLLRSPETAMAMTERILDKAGIRPGDRERLLS ---------1111------------1111---------------------2222------ IPAEELLRAALSLGPGVMYGPVVDGRVLRRHPIEALRYGAASGIPILIGVTKDEYNLFTL ----------1111----------------33333333--1111--------3333---- TDPSWTKLGEKELLDRINREVGPVPEEAIRYYWQTWLRIMTYRVFVEGMLRTADAQAAQG -3333---------------------------3333--------------------1111 ADVYMYRFDYETPVCHALELPFVFHNLHQPGVANFVGNRPEREAIANEMHYAWLSFARTG ---------------------11111111------------------------------- DPNGAHLPEAWPAYTNERKAAFVFSAASHVEDDPFGRERAAWQ ---1111-------3333--------------1111------- >High-affinity zinc uptake; SWP:ZNUA_ECOLI; PDB:2OGWA; AVVASLKPVGFIASAIADGVTETEVLLPDGASEHDYSLRPSDVKRLQNADLVVWVGPEEA ----------------2222-------22221111------------------------- FQKPVSKLPGAKQVTIAQLEDVKPLLKDFNHLWLSPEIARATAVAIHGKLVELPQSRAKL --3333--3333--333333331111----1111-------------------1111--- DANLKDFEAQLASTETQVGNELAPLKGKGYFVFHDAYGYFEKQFGLTPLGHFTVNPEIQP ---------------------1111-----------------------------3333-- GAQRLHEIRTQLVEQKATCVFAEPQFRPAVVESVARGTSVRGTLDPLGTNIKLGKTSYSE ------------1111------1111------------------1111-----1111--- FLSQLANQYASCLK -------------- >ACETYLTRANSFERASE, GNAT F; SWP:Q722P7; PDB:2OH1A; NKITAGGLEFLVRFAAPTDRLKINDLIDTARWLKESGSTQWSDILHGFDVHNIEQRIELG ----%%%%-------3333--------------------3333-----3333----1111 EVALFETEAGALAGAIIRKTPSDWDTDLWEDLAIDKAYYLHRIVSRAFSGISLSKQIYFA ------1111------------------!!!!------------3333---3333----- EKLGIESVPFIRLDCIESNETLNQYVRYGFQFSGKKNGFYLYQKELSQK ---------------11113333--1111------iiii---------- >COG1633: UNCHARACTERIZED ; SWP:Q2VZ87; PDB:2OH3A; YTLAEFLAHAIALETEAAERYVELADEAHNNLDTATVFRDARFSTLHGDEIKQRSRALEL --------------------------1111------------------------------ PKLSWQYRWKTPPEVGDEHYLTPYHALRYARDNEIRGEYYKEAAANSADPEVKRLGADFA ---3333----------------------------------------------------- AEEAEHVVALDKWIEKTPRPSIT -------------1111------ >POLYHEDRIN; SWP:O10693; PDB:2OH5A; ADVAGTSNRDFRGREQRLFNSEQYNYNNSLNGEVSVWVYAYYSDGSVLVINKNSQYKVGI ----------------------------1111---------1111--------------- SETFKALKEYRKGQHNDSYDEYEVNQSIYYPNGGDARKFHSNAKPRAIQIIFSPSVNVRT --3333----2222---------------2222-------1111--------11113333 IKMAKGNAVSVPDEYLQRSHPWEATGIKYRKIKRDGEIVGYSHYFELPHEYNSISLAVSG ------1111----3333------3333-----iiii-----------1111-------- VHKNPSSYNVGSAHNVMDVFQSCDLALRFCNRYWAELELVNHYISPNAYPYLDINNHSYG --------1111--------------------------------1111----1111---- VALSNRQ --2222- >YUEI PROTEIN; SWP:O32092; PDB:2OHWA; EDKMDLYLQQGMYGPLETKPDERHLFLGSLRERVVLALTKGQVLRSKPYKEAEHELKNSH 3333---------------------iiii3333-----3333-------------1111- NVTLLINGELQYQSYSSYIQMASRYGVPFKIVSDLQFHTPLGIVIAADIAVNRELIYIQD ------33333333--------1111---------------------------------- DIYNRSVL -------- >PUTATIVE REGULATORY PROTE; SWP:Q9KXS8; PDB:2OI8A; TPRERYRTQVRAEIKDHAWEQIATAGASALSLNAIAKRGSGPALYRYFDGRDELITELIR ---------------------------------------33331111------------- DAYRSQADSLRAAAASGADLAGLAHALRAWALDDPQRYFLIFGTPVPGYRAPDDITEIAA -------------1111----------------------------2222-----3333-- ETAVIVDACAAGTDGAFDAHLDTHRQWADRPAPSSALHRALSFWSRLHGVLSLELAGQFT -------------------3333---------------------------------1111 GGFDSALLFEAELKDLLGP ------------3333--- >INTERLEUKIN-1 RECEPTOR-AS; SWP:Q69FE1; PDB:2OIBA; DTRFHSFSFYELKNVTNNFDERPISVGGNKMGEGVVYKGYVNNTTVAVKKTTEELKQQFD -------3333-----%%%%--3333--------------iiii---------------- QEIKVMAKCQHENLVELLGFSSDGDDLCLVYVYMPNGSLLDRLSCLDGTPPLSWHMRCKI ----------1111-------------------1111-------2222------------ AQGAANGINFLHENHHIHRDIKSANILLDEAFTAKISDFGLARASEVMRIVGTTAYMAPE ---------------------3333---1111------1111----------3333-333 ALRGEITPKSDIYSFGVVLLEIITGLPAVDEHREPQLLLDIKEEIEDEEKTIEDYIDKKM 3----------------------------1111---3333----------3333------ NDADSTSVEAMYSVASQCLHEKKNKRPDIKKVQQLLQEMT ----------------1111-1111--------------- >RS21-C6; SWP:Q9QY93; PDB:2OIEA; RPFRFSPEPTLEDIRRLHAEFAAERDWEQFHQPRNLLLALVGEVGELAELFQWKSDTEPG --------------------------3333----------------33331111-----3 PQAWPPKERAALQEELSDVLIYLVALAARCHVDLPQAVISKM 333------------------------1111-------1111 >HISTIDINE TRIAD (HIT) PRO; SWP:Q1GYB6; PDB:2OIKA; SFHKNCELCTTAGGEILWQDALCRVVHVENQDYPGFCRVILNRHVKESDLRPAERDHLLV --11113333---------3333------1111--------------------------- VFAVEEAVREVRPDKINLASLGNTPHVHWHVIPRFKRDRHFPNSVWGETKRESLPQALDQ -------------------------------------1111--1111------------- GSTTALKKAISVRLD --------------- >RAS-RELATED PROTEIN RAB-2; SWP:P57735; PDB:2OILA; EDYNFVFKVVLIGESGVGKTNLLSRFTRNEFSHDSRTTIGVEFSTRTVMLGTAAVKAQIW -------------2222------------------------------------------- DTAGLERYRAITSAYYRGAVGALLVFDLTKHQTYAVVERWLKELYDHAEATIVVMLVGNK -----3333------2222-------11113333--------------1111-------3 SDLSQAREVPTEEARMFAENNGLLFLETSALDSTNVELAFETVLKEIFAKVSKQ 3331111------------------------------------------3333- >NUCLEOPORIN 214KDA; SWP:Q3KQZ0; PDB:2OITA; MGDEMDAMIPEREMKDFQFRALKKVRIFDSPEELPKERSSLLAVSNKYGLVFAGGASGLQ ------------------------------------------------------3333-- IFPTKNLLIQNKPGDDPNKIVDKVQGLLVPMKFPIHHLALSCDNLTLSACMMSSEYGSII --3333---------1111---------------------1111---------------- AFFDVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVT ---3333--1111-------------1111----------1111---------------- ETVKVCATLPSTVAVTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYESDHP ---------3333-------1111------1111-----1111--------11111111- VRVLDVLWIGTYVFAIVYAAADGTLETSPDVVMALLPKKEEKHPEIFVNFMEPCYGSCTE ---------1111------1111------------------------------------- RQHHYYLSYIEEWDLVLAASAASTEVSILARQSDQINWESWLLEDSSRAELPVTDKSDDS ---------3333------1111--------1111--------1111------------- LPMGVVVDYTNQVEITISDEKTLPPAPVLMLLSTDGVLCPFYMINQNPGVKSLIKTPERL -----------------1111-----------1111------------------------ SLEGERQPKSPGST -2222--------- >PUTATIVE 4-HYDROXYBENZOYL; SWP:NA; PDB:2OIWA; AFTTVITPRVSETDGVGHINNTTVPVWFEAGRHEIFKLFTPDLSFKRWRVIIREVDYVNQ --------3333-3333--3333----------------11113333------------- YYGQDVTVYTGIERIGNTSLTIYEEIHQNGVVCAKGRSVYVNFNFDTGRPEPIPDDIRVK ---------------1111--------%%%%-------------1111------------ LREHVWQP -------- >REGULATOR OF G-PROTEIN SI; SWP:Q8WV02; PDB:2OJ4A; SEEALKWGESLEKLLVHKYGLAVFQAFLRTEFSEENLEFWLACEDFKKVKSQSKMASKAK -333311113333---------------1111-------------1111----------- KIFAEYIAIQACKEVNLDSYTREHTKDNLQSVTRGCFDLAQKRIFGLMEKDSYPRFLRSD -------2222---------------------1111------------------------ LYLDLIN -3333-- >VIRAL ATTACHMENT PROTEIN ; SWP:Q86329; PDB:2OJ5A; PNLRYPIADVSGGIGMSPNYRFRQSMWIGIVSYSGSGLNWRVQVNSDIFIVDDYIHICLP ---------iiii---3333--------------iiii------------!!!!------ AFDGFSIADGGDLSLNFVTGLLPPLLTGDTEPAFHNDVVTYGAQTVAIGLSSGGTPQYMS -------------------------3333--1111------------------------- KNLWVEQWQDGVLRLRVEGGGSITHSNSKWPAMTVSYPRSFT --------iiii------------------------------ >CYTOCHROME P450 2R1; SWP:Q6VVX0; PDB:2OJDA; FPPGPPGLPFIGNIYSLAASSELPHVYMRKQSQVYGEIFSLDLGGISTVVLNGYDVVKEC ----------!!!!--1111---------3333---------iiii-------------- LVHQSEIFADRPCLPLFMKMTKMGGLLNSRYGRGWVDHRRLAVNSFRYFGYGQKSFESKI ----3333------------%%%%-1111--------------------3333------- LEETKFFNDAIETYKGRPFDFKQLITNAVSNITNLIIFGERFTYEDTDFQHMIELFSENV -------------iiii-------------------------1111-------------- ELAASASVFLYNAFPWIGILPFGKHQQLFRNAAVVYDFLSRLIEKASVNRKPQLPQHFVD ----------------3333------------------------------2222------ AYLDEMDQGKNDPSSTFSKENLIFSVGELIIAGTETTTNVLRWAILFMALYPNIQGQVQK ------1111-1111-----------------------------------3333------ EIDLIMGPNGKPSWDDKCKMPYTEAVLHEVLRFCNIVPLGIFHATSEDAVVRGYSIPKGT ------------33331111------------------------------iiii--2222 TVITNLYSVHFDEKYWRDPEVFHPERFLDSSGYFAKKEALVPFSLGRRHCLGEHLARMEM -------11113333--1111-3333--1111----11111111-1111----------- FLFFTALLQRFHLHFPHELVPDLKPRLGMTLQPQPYLICAERRH ---------------%%%%------------------------- >UNCHARACTERIZED PROTEIN A; SWP:Q8UEU8; PDB:2OJHA; GSRSSIEIFNIRTRKRVVWQTPELFEAPNWSPDGKYLLLNSEGLLYRLSLAGDPSPEKVD ------------------------------1111------iiii---------------- TGFATICNNDHGISPDGALYAISDKVEFGKSAIYLLPSTGGTPRLTKNLPSYWHGWSPDG !!!!---------1111-------------------1111----------------1111 KSFTYCGIRDQVFDIYSDIDSGVETRLTHGEGRNDGPDYSPDGRWIYFNSSRTGQQIWRV --------%%%%-----3333------------------1111-------1111------ RVDGSSVERITDSAYGDWFPHPSPSGDKVVFVSYDADVFDHPRDLDVRVQLDDGGNVETL 1111------------------1111--------1111---------------------- FDLFGGQGTNSPNWSPDGDEFAYVRYFPV -----2222-----1111----------- >HYPOTHETICAL PROTEIN; SWP:Q7VWT7; PDB:2OJLA; HPPRIAIQYCTQCQWLLRAAWAQELLSTFGADLGEVALVPGTGGVFRIHYNGAPLWDREV ----------1111--------------!!!!---------2222----iiii---3333 DGGFPEAKVLKQRVRDHL ------------------ >PROGRAMMED CELL DEATH 6-I; SWP:Q4W4Y1; PDB:2OJQA; PVSVQQSLAAYNQRKADLVNRSIAQREATTLANGVLASLNLPAAIEDVSGDTVPQSILTK ------------------------------------------------------------ SRSVIEQGGIQTVDQLIKELPELLQRNREILDESLRLLDEEEATDNDLRAKFKERWQRTP ---3333-3333----------------------------------------1111---3 SNELYKPLRAEGTNFRTVLDKAVQADGQVKECYQSHRDTIVLLCKPEPELNAAIPSANPA 333--------------------------------33331111---3333---------- KTQGSEVVSVLKSLLSNLDEVKKEREGLENDLKSVNFDTSKFLTALAQDGVINEEALSVT ----1111------------------------------3333------------------ ELDRVYGGLTTKVQESLKKQEGLLKNIQVSHQEFSKKQSNNEANLREEVLKNLATAYDNF -----3333---1111------------------------3333---------------- VELVANLKEGTKFYNELTEILVRFQNKCSDIVFARKTER --------------------------------------- >LACTOPEROXIDASE; SWP:A3F9D6; PDB:2OJVA; SWEVGCGAPVPLVTCDEQSPYRTITGDCNNRRSPALGAANRALARWLPAEYEDGLAVPFG ----------------------1111---3333-2222------------1111---222 WTQRKTRNGFRVPLAREVSNKIVGYLDEEGVLDQNRSLLFMQWGQIVDHDLDFAPETELG 21111-iiii-----------------2222----------------------------- SSEHSKVQCEEYCVQGDECFPIMFPKNDPKLKTQGKCMPFFRAGFVCPTPPYQSLARDQI ----1111-------!!!!-----22223333---------------------------- NAVTSFLDASLVYGSEPSLASRLRNLSSPLGLMAVNQEAWDHGLAYPPFNNVKPSPCEFI --------------------1111----------------iiii----------3333-- NTTAHVPCFQAGDSRASEQILLATVHTLLLREHNRLARELKRLNPHWDGEMLYQEARKIL ------------1111---------------------------1111------------- GAFIQIITFRDYLPIVLGSEMQKWIPPYQGYNNSVDPRISNVFTFAFRFGHMEVPSTVSR -----------3333-!!!!-----------1111----3333----------------- LDENYQPWGPEAELPLHTLFFNTWRIIKDGGIDPLVRGLLAKNSKLMNQNKMVTSELRNK -1111---------3333----3333-----3333------------1111---1111-- LFQPTHKVHGFDLAAINLQRCRDHGMPGYNSWRGFCGLSQPKTLKGLQAVLKNKVLAKKL --1111---------------------------1111----------------------- LDLYKTPDNIDIWIGGNAEPMVERGRVGPLLACLLGRQFQQIRDGDRFWWENPGVFTEKQ -----3333-3333-1111--2222-----------------1111--1111-------- RDSLQKVSFSRLICDNTHITKVPLHAFQANNYPHDFVDCSAVDKLDLSPWASREN --3333------------------3333----1111-3333-----3333----- >GLUTAMINE SYNTHETASE; SWP:P15104; PDB:2OJWA; YFQSMASSHLNKGIKQVYMSLPQGEKVQAMYIWIDGTGEGLRCKTRTLDSEPKCVEELPE ----3333--3333---1111-------------1111---------------3333--- WNFDGSSTLQSEGSNSDMYLVPAAMFRDPFRKDPNKLVLCEVFKYNRRPAETNLRHTCKR ---1111----3333----------------------------1111--1111------- IMDMVSNQHPWFGMEQEYTLMGDGHPFGWPSNGFPGPQGPYYCGVGADRAYGRDIVEAHY ----1111-----------------2222----------------1111--3333----- RACLYAGVKIAGTNAEVMPAQWEFQIGPCEGISMGDHLWVARFILHRVCEDFGVIATFDP -----------------2222--------!!!!--------------------------- KPIPGNWNGAGCHTNFSTKAMREENGLKYIEEAIEKLSKRHQYHIRAYDPKGGLDNARRL -----------------3333-2222----------------------1111---3333- TSNINDFSAGVANRSASIRIPRTVGQEKKGYFEDRRPSANCDPFSVTEALIRTCLLNETG --1111------1111----3333-------------1111------------------- >PUTATIVE FERREDOXIN--NADP; SWP:Q6LF82_PLAF7; PDB:2OK8A; NFINLYTVKNPLKCKIVDKINLVRPNSPNEVYHLEINHNGLFKYLEGHTCGIIPYYNERC -2222-1111-------------1111----------iiii---2222----2222---- ARLYSISSSNNMENLSVAIKIHKYETNYGYCSGFIKNLKINDDIYLTGAHGYFNLPNDAI --------1111--------------------------2222-------------33331 QKNTNFIFIATGTGISPYISFLKKLFAYDRNSNYTGYITIYYGVYNEDSILYLNELEYFQ 111--------------------1111------------------3333----------- KMYPNNINIHYVFSYKQNSDATSFYVQDEIYKRKTEFLNLFNNYKCELYICGHKSIRYKV --1111-----------3333--------------------------------------- MDILKSHDQFDEKKKKRVHVEVY --------------1111----- >HYPOTHETICAL PROTEIN; SWP:Q9HYQ7; PDB:2OKAA; AKPEIVITYCTQCQWLLRAAWLAQELLSTFADDLGKVCLEPGTGGVFRITCDGVQVWERK ----------1111------------------------------------iiii-----1 ADGGFPEAKALKQRVRDRIDPQRD 111--------------------- >TYPE I RESTRICTION ENZYME; SWP:Q89Z59; PDB:2OKCA; QSLTKKVWNLATTLAGQGIGFTDYITQLTYLLFLKDAENVEFGEESAIPTGYQWADLIAF -------------------------------------3333-------222233333333 DGLDLVKQYEETLKLLSELDNLIGTIYTKAQNKIDKPVYLKKVITIDEEQWLIDGDVKGA -3333----------1111--------------------------1111----------- IYESILEKNGQDKKSGAGQYFTPRPLIQAVDCINPQGETVCDPACGTGGFLLTAYDYKGQ -----------1111-3333----------3333----------!!!!------------ SSKEKRDFLRDKALHGVDNTPLVVTLASNLYLHGIGTDRSPIVCEDSLEKEPSTLVDVIL ------------------------------1111-----------3333----------- ANPPFGTRPAGSVDINRPDFYVETKNNQLNFLQHLLKTGGRAAVVLPDNVLFEAGAGETI --------2222----1111----------------2222------3333---------- RKRLLQDFNLHTILRLPTGIFYAQGVKANVLFFSKGQPTKEIWFYDYRTDIKHTLATNKL -----------------------------------------------2222--------- ERHHLDDFVSCYNNRVEIYDAENNPQGRWRKYPVDEIIARDKTSLDITWIKPG 3333----------------------------3333---2222---------- >FDXN ELEMENT EXCISION CON; SWP:Q3M7W6; PDB:2OKFA; RDVFHEVVKTALKKDGWQITDDPLTISVGGVNLKLIAAERQGQKIAVEVKSFLKQSSAIS ------------1111-----------iiii--------iiii---------3333---- EFHTALGQFINYRGALRKVEPDRVLYLAVPLTTYKTFFQLDFPKEIIIENQVKLVYDVEQ -------------------1111------------------------1111------111 EVIFQWIN 1------- >CENTRAL GLYCOLYTIC GENE R; SWP:O32253; PDB:2OKGA; NAKDVLGLTLLEKTLKERLNLKDAIIVSGDSDQSPWVKKEGRAAVACKKRFSGKNIVAVT ---1111----------------------33333333----------1111--------- GGTTIEAVAETPDSKNRELLFVPARGGLGKNQANTICAHAEKASGTYRLLFVPGQLSQGA ------------1111--------------3333------1111---------------- YSSIIEEPSVKEVLNTIKSASLVHGIGEAKTAQRRNTPLEDLKKIDDNDAVTEAFGYYFN ----------------1111-------3333--------------1111----iiii--1 ADGEVVHKVHSVGQLDDIDAIPDIIAVAGGSSKAEAIEAYFKKPRNTVLVTDEGAAKKLL 111----------33331111--------3333-----1111------------------ R - >O-SUCCINYLBENZOIC ACID SY; SWP:Q5HEY3; PDB:2OKTA; SLKLTALHFYKYSEPFKSQIVTPKVTLTHRDCLFIELIDDKGNAYFGECNAFQTDWYDHE ---------------------3333-------------1111------------------ TIASVKHVIEQWFEDNRNKSFETYEAALKLVDSLENTPAARATIVMALYQMFHVLPSFSV --------------------------33331111--------------3333-------- AYGATASGLSNKQLESLKATKPTRIKLKWTPQIMHQIRVLRELDFHFQLVIDANESLDRQ -----------------------------1111-------------------%%%%-333 DFTQLQLLAREQVLYIEEPFKDISMLDEVADGTIPPIALDEKATSLLDIINLIELYNVKV 3------1111------------3333--2222------1111----------------- VVLKPFRLGGIDKVQTAIDTLKSHGAKVVIGGMYEYGLSRYFTAMLARKGDYPGDVTPAG ---3333--3333--------1111--------------------3333--------222 YYFEQDVVAHSGILKEGRLEFRPPLVDITQLQPYEGHHHHHH 2-------------iiii--------3333-----1111--- >ACYL-COA DEHYDROGENASE FA; SWP:Q7MW70; PDB:2OKUA; QVVAAIRHITTGTYIARIREEYQQTEVKPELQPKEALARTDRAEALIAFVTEQKDQELLD 3333----------------1111---3333------------------3333------- FQARRLVETAHAVFGHLLLAANDDDSFRQSAEVYLRYGQAEQEKIDSYVRAFRPEELT --------------------33331111------------------------3333-- >PROBABLE D-TYROSYL-TRNA(T; SWP:Q8TEA8; PDB:2OKVA; MKAVVQRVTRASVTVGGEQISAIGRGICVLLGISLEDTQKELEHMVRKILNLRVFEDESG --------------iiii---------------1111-------------------1111 KHWSKSVMDKQYEILCVSQFTLQCVLKGNKPDFHLAMPTEQAEGFYNSFLEQLRKTYRPE -----3333-----------------------1111-3333----------------333 LIKDGKFGAYMQVHIQNDGPVTIELESPA 3----3333-------------------- >PLASMA SERINE PROTEASE IN; SWP:P05154; PDB:2OL2A; DFTFDLYRALASAAPSQNIFFSPVSISMSLAMLSLGAGSSTKMQILEGLGLNLQKSSEKE -------------2222-----------------------------1111-2222----- LHRGFQQLLQELNQPRDGFQLSLGNALFTDLVVDLQDTFVSAMKTLYLADTFPTNFRDSA ----------1111---------------3333--3333---------------333333 GAMKQINDYVAKQTKGKIVDLLKNLDSNAVVIMVNYIFFKAKWETSFNHKGTQEQDFYVT 33---------1111----------1111------------------1111--------1 SETVVRVPMMSREDQYHYLLDRNLSCRVVGVPYQGNATALFILPSEGKMQQVENGLSEKT 111-----------------------------------------2222----1111---- LRKWLKMFKKRQLELYLPKFSIEGSYQLEKVLPSLGISNVFTSHADLSGISNHSNIQVSE --------------------------11113333---33331111-3333---------- MVHKAVVEVDESGTRAAAATGTIFTFRSARLNSQRLVFNRPFLMFIVDNNILFLGKVNRP ------------------------------------------------------------ >PAI 2 PROTEIN; SWP:NA; PDB:2OL5A; DPDVAYQVIEENSFATLVSHQRELFATHLPLLLDREKTCLYGHFARSNPQWNDIQHQTVL --3333-------------%%%%----------1111-------33333333-------- AIFHGPHCYISPSWYETNQAVPTWNYVAVHVYGNVELINDQGEVQSLHDVEKYEAPGSRY ----------1111-------------------------3333-----------1111-- QLLSGNKGIQAFKIIIKRIEGKAKLSQNHPAHRQERIIKQLEQPFENEKRIASLKK -------------------------1111--------------------------- >PRO-PHENOLOXIDASE ACTIVAT; SWP:O97366; PDB:2OLGA; RNRRPELLPNDCGYQVEADKILNGDDTVPEEFPWTAMIGYKNSSNFEQFACGGSLINNRY ---1111--------------------1111----------1111--------------- IVTAAHCVAGRVLRVVGALNKVRLGEWNTATDPDCYGAVRVCVPDKPIDLGIEETIQHPD ---1111--3333--------------3333--------------------------111 YVDGSKDRYHDIALIRLNRQVEFTNYIRPVCLPQPNEEVQVGQRLTVVGWGRTETGQYST 12222------------------1111------1111--2222---------1111---- IKQKLAVPVVHAEQCAKTFGAAGVRVRSSQLCAGGEKAKDSCGGDSGGPLLAERANQQFF ------------11113333------1111----1111------2222------%%%%-- LEGLVSFGATCGTEGWPGIYTKVGKYRDWIEGNIRP -----------1111------3333----1111--- >NUCLEOPORIN-LIKE PROTEIN ; SWP:Q8K2K6; PDB:2OLMA; SSAKRKQEEKHLKMLRDMTGLPHNRKCFDCDQRGPTYVNMTVGSFVCTSCSGSLRGLNPP --------------------3333----------------------------3333---- HRVKSISMTTFTQQEIEFLQKHGNEVCKQIWLGLFDDRSSAIPDFRDPQKVKEFLQEKYE ----1111----------1111---------11113333----1111------------- KKRWYVPPEQAKV -1111-3333--- >PHOSPHOENOLPYRUVATE SYNTH; SWP:A1KSM6; PDB:2OLSA; NYVIWFENLRMTDVERVGGKNASLGEMISQLTEKGVRVPGGFATTAEAYRAFLAHNGLSE ----3333-1111-----------------3333----------------------3333 RISAALAKLDVEDVAELARVGKEIRQWILDTPFPEQLDAEIEAAWNKMVADAGGADISVA ---------1111----------------------------------------------- VRSSATAFAGQQETFLNINGLDNVKEAMHHVFASLYNDRAISYRVHKGFDIVALSAGVQR ------------------------------------------------------------ MVRSDSGASGVMFTLDTESGYDQVVFVTSSYGLGENVVQGAVNPDEFYVFKPTLKAGKPA --3333-----------------------------------------------1111--- ILRKTMGSKHIKMIFTDKAEAGKSVTNVDVPEEDRNRFSITDEEITELAHYALTIEKHYG --------------------------------3333------------------------ RPMDIEWGRDGLDGKLYILQARPETLCEGRAQKVGQGKVRDVLVTDMTDPDWEPVMKRAS ------------------------------------------------3333-3333--- AIVTNRGGRTCHAAIIAREPAVVGCGNATELLKNGQEVTVSCADTGFIYAGLMPKAPVKV -------111133333333----------------------------------------- MMNVGNPELAFSFANLPSEGIGLARMEFIINRQIGIHPKALLEFDKQDDELKAEITRRIA -----33333333------------------------3333-3333-3333-----1111 GYASPVDFYVDKIAEGVATLAASVYPRKTIVRMSDFKSNEYANLVGGNVYEPHEENPMLG ------------------------------------3333---22221111----3333- FRGAARYVADNFKDCFALECKALKRVRDEMGLTNVEIMIPFVRTLGEAEAVVKALKENGL --------33333333---------------1111--------------------1111- ERGKNGLRLIMMCELPSNAVLAEQFLQYFDGFSIGSNDMTQLTLGLDRDSGLVSESFDER 2222----------3333-------1111-----------------3333---1111111 NPAVKVMLHLAISACRKQNKYVGICGQGPSDHPDFAKWLVEEGIESVSLNPDTVIETWLY 1--------------1111-------3333---------1111------1111------- LANEL -3333 >HYPOTHETICAL PROTEIN; SWP:Q8EAX1; PDB:2OLTA; SPIKPLQEHDKVYDCASLLVPFFEATITGNWDDAVQIRKQISLAEKQGDSLKREIRLTLP 1111---------------------1111----------------------------333 SGLFPVERTDLLELLTQQDKIANKAKDISGRVIGRQLLIPQALQVPFIAYLQRCIDAVGL 3--------------------------------------3333----------------- AQQVINELDDLLEAGFRGREVDFVAKINELDIIEEDTDDLQIQLRRQLFALESELNPVDV --------------------------------------------------3333-3333- FLYKTIEWVGGLADLAERVGSRLELLARV -------------------------3333 >PENICILLIN-BINDING PROTEI; SWP:Q8KHY3; PDB:2OLVA; AKLQDPIPAKIYDKNGELVKTLDNGQRHEHVNLKDVPKSKDAVLATEDNRFYEHGALDYK ------------1111------%%%%-----3333-------3333---1111-----11 RLFGAIGKGASTLTQQVVKDAFLSQHKSIGRKAQEAYLSYRLEQEYSKDDIFQVYLNKIY 11-3333----------1111--------3333--------------------------- YSDGVTGIKAAAKYYFNKDLKDLNLAEEAYLAGLPQVPNNYNIYDHPKAAEDRKNTVLYL -iiii-------------3333---------------33333333--------------- HYHKRITDKQWEDAKKIDLKANLVNRTPEERQNIDTNQDSEYNSYVNFVKSELNNKAFKD 1111-------------1111-----3333--------3333------------------ ENLGNVLQSGIKIYTNDKDVQKTLQNDVDNGSFYKNKDQQVGATILDSKTGGLVAISGGR --3333-----------------------------1111-------------------22 DFKDVVNRNQATDPHPTGSSLKPFLAYGPAIENKWATNHAIQDESSYQVDGSTFRNYDTK 22------3333----!!!!3333-------------------------------3333- SHGTVSIYDALRQSFNIPALKAWQSVKQNAGNDAPKKFAAKLGLNYEGDIGPSEVLGGSA ------------------------------1111-----1111-------3333---!!! SEFSPTQLASAFAAIANGGTYNNAHSIQKVVTRDGETIEYDHTSHKASDYTAYLAELKGT !----------3333----------------1111------------3333------333 FKPYGSAYGHGVSGVNGAKTGTGTYGAETYSQYNLPDNAAKDVWINGFTPQYTSVWGFSK 32222-2222---------------3333-------------------3333-------- VKQYGENSFVGHSQQEYPQFLYENVSKISSRDGEDFKRPSSVSGSIPSINVSGSQDNNTT --iiii-----1111-----------------------------------2222------ NRSTH ----- >C-TERMINAL-BINDING PROTEI; SWP:Q86SV0; PDB:2OMEA; RPLVALLDGRDCTVEMPILKDLATVAFCDAQSTQEIHEKVLNEAVGAMMYHTITLTREDL -----------3333---1111---------3333--------------------33333 EKFKALRVIVRIGSGYDNVDIKAAGELGIAVCNIPSAAVEETADSTICHILNLYRRNTWL 333------------1111----------------------------------------- YQALREGTRVQSVEQIREVASGAARIRGETLGLIGFGRTGQAVAVRAKAFGFSVIFYDPY -------------------2222--2222-------3333------3333-------111 LQDGIERSLGVQRVYTLQDLLYQSDCVSLHCNLNEHNHHLINDFTIKQMRQGAFLVNAAR 1-----1111-----------------------1111----333311112222------3 GGLVDEKALAQALKEGRIRGAALDVHESEPFSFAQGPLKDAPNLICTPHTAWYSEQASLE 333----------------------------11111111----------3333------- MREAAATEIRRAITGRIPESLRNCVNKEFF -----------------1111--------- >HYPOTHETICAL PROTEIN; SWP:Q8A545_BACTN; PDB:2OMKA; AMINEHIPQAIILANGEYPAHELPLRLLAEAQFVVCCAANEYISRGHTPDVIIGDGDSLL ------------------------------------------1111--------3333-- PEYKKRFSSIILQISDQETNDQTKAVHYLQSKGIRKIAIVGATGKREDHTLGNISLLVEY ------1111-------------------1111--------------------------- MRSGMEVRTVTDYGTFIPVSDTQFSYPGQQVSIINFGAKGLKAEGLFYPLSDFTNWWQGT -------------------------2222-------------------------3333-- LNEAIADEFTIHCTGEYLVFLAY ----------------------- >RIBOSOMAL LARGE SUBUNIT P; SWP:P75966; PDB:2OMLA; NQPTRVILFNKPYDVLPQFTDEAGRKTLKEFIPVQGVYAAGRLDRDSEGLLVLTNNGALQ ---------------------2222-3333-------------1111------------- ARLTQPGKRTGKIYYVQVEGIPTQDALEALRNGVTLNDGPTLPAGAELVDEPAWLWPRNP ----2222---------------------------1111------------1111----- PIRRKSIPTSWLKITLYEGRNRQVRRMTAHVGFPTLRLIRYAMGDYSLDNLANGEWREVT ---1111---------------------1111----------!!!!-22222222----- D - >DUF176; SWP:Q82WP3_NITEU; PDB:2OMOA; HYVTIVYASVKTDKTEAFKEATRNHEQSIREPGNRFDILQSADDPTRFVLYEAYKTRKDA ----------1111------------33332222------1111---------------- AAHKETAHYLTWRDTVADWAEPRKGVIYGGLYPTG 3333-----------1111---------------- >HYPOTHETICAL PROTEIN TA01; SWP:Q9HLN2; PDB:2ONFA; GHVYESDVSWIDDRRTEVSVGDHRIEVDSPPEFGGPEGQLYPETLFPSVLASCLLTTFLE -------------------!!!!------3333-------3333---------------- FKDRGINLKSWNSHVTAELGPSPEKGFKFHRIKIHVKIGVNDEDKEKIPRAQLAEKYCFI ---------------------1111---------------33331111----------33 SRAIRNNVEEIVDYEFV 33-2222---------- >4S-LIMONENE SYNTHASE; SWP:Q40322; PDB:2ONHA; MRRSGNYNPSRWDVNFIQSLLSDYKEDKHVIRASELVTLVKMELEKETDQIRQLELIDDL ------------3333-------------------------------------------- QRMGLSDHFQNEFKEILSSIYLDHHYYKNPFPKEERDLYSTSLAFRLLREHGFQVAQEVF 11113333-----------1111-------------------------1111---3333- DSFKNEEGEFKESLSDDTRGLLQLYEASFLLTEGETTLESAREFATKFLEEKVNEGGVDG ----------3333--------------------3333---------------------- DLLTRIAYSLDIPLHWRIKRPNAPVWIEWYRKRPDMNPVVLELAILDLNIVQAQFQEELK ------------3333-------3333-----3333------------------------ ESFRWWRNTGFVEKLPFARDRLVECYFWNTGIIEPRQHASARIMMGKVNALITVIDDIYD ---------3333-1111---------1111---3333---------------------- VYGTLEELEQFTDLIRRWDINSIDQLPDYMQLCFLALNNFVDDTSYDVMKEKGVNVIPYL ------------------1111----3333-------------------------3333- RQSWVDLADKYMVEARWFYGGHKPSLEEYLENSWQSISGPCMLTHIFFRVTDSFTKETVD -------------------------------3333------------------------- SLYKYHDLVRWSSFVLRLADDLGTSVEEVSRGDVPKSLQCYMSDYNASEAEARKHVKWLI ------------------------------------------------------------ AEVWKKMNAERVSKDSPFGKDFIGCAVDLGRMAQLMYHNGDGHGTQHPIIHQQMTRTLFE ------------------3333------------1111---------------------- PFA --- >UPF0100 PROTEIN AF_0094; SWP:O30142; PDB:2ONSA; GHMNVKLKVFHAGSLTEPMKAFKRAFEEKHPNVEVQTEAAGSAATIRKVTELGRKADVIA 1111-------3333-3333---------1111--------------------------- TADYTLIQKMMYPEFANWTIMFAKNQIVLAYRNDSRYADEINSQNWYEILKRPDVRFGFS -------------------------------1111-3333-1111------1111----- NPNDDPCGYRSLMAIQLAELYYNDPTIFDELVAKNSNLRFSEDNGSYVLRMPSSERIEIN 3333---------------1111--------1111-----------------3333---- KSKIMIRSMEMELIHLVESGELDYFFIYKSVAKQHGFNFVELPVEIDLSSPDYAELYSKV --------3333------------------------------3333---3333------- KVVLANGKEVTGKPIVYGITIPKNAENRELAVEFVKLVISEEGQEILRELGQEPLVPPRA ---1111--------------1111----------------------1111--------- DTAVPSLKAMVEVS ---11111111--- >UBIQUITIN-CONJUGATING ENZ; SWP:NA; PDB:2ONUA; SLTRKQCDFTKLIMAGYDLELNNGSTQDFDVMFHGPNGTAYEGGIWKVHVTLPDDYPFAS --3333---------------%%%%----------2222-2222--------1111---- PSIGFMNKLLHPNVDEASGSVCLDVINQTWTPLYSLVNVFEVFLPQLLTYPNPSDPLNSD ----------1111-----------3333-11113333---------------------- AASLLMKDKNIYEEKVKEYVKLYASKDLWE ------------------------3333-- >BROMODOMAIN-CONTAINING PR; SWP:Q5T1R6; PDB:2OO1A; KLSEHLRYCDSILREMLSKKHAAYAWPFYKPVDAEALELHDYHDIIKHPMDLSTVKRKMD -----------------3333---3333----3333--1111----------------11 GREYPDAQGFAADVRLMFSNCYKYNPPDHEVVAMARKLQDVFEMRFAKMP 11---3333----------------1111----------------1111- >HYPOTHETICAL PROTEIN AF_1; SWP:O28492; PDB:2OO2A; SLEEELRRETLKWLERIEERVKEIEGDEGFMRNIEAYISDSRYFLEKGDLVRAFECVVWA ------------------3333-------------------------------------- WAWLEIGLEVGKLHET ---------------- >PROTEIN INVOLVED IN CATAB; SWP:Q5ZVZ2; PDB:2OO3A; HAGNFADVIKHITLTRLLAYLTHKDKPLFYLETHSGRGIYDLKDKTEEYKEGINPVWLDR 2222---------------1111-----------------------------------11 ENLPSLFLEYISVIKQINLNSTLSYYPGSPYFAINQLRSQDRLYLCELHPTEYNFLLKLP 11-3333------------------------------1111--------------1111- HFNKKVYVNHTDGVSKLNALLPPPEKRGLIFIDPSYERKEEYKEIPYAIKNAYSKFSTGL !!!!-------3333-3333--3333-----------3333-----------1111---- YCVWYPVVNKAWTEQFLRKMREISSKSVRIELHLNPLINEGMTGCGLWIINPPYTFPSEI ----------------------------------------------------2222---- KLVLETLTTYFNPGSSSYMIESGSKLC -----------2222-------1111- >HYPOTHETICAL L-ALANINE-DL; SWP:Q13PB7; PDB:2OO6A; LKVVSVDTLCCDAGWRNYHFVKLTTDEGIVGWSEFDEGFGSPGVTAVIEQLGKRLVGASV ------------------------1111------------2222------3333222233 MEHERFFAEAYCLTRPATGGVVSEGIGAIENALLDAKAKTLNVPCYELLGGKLRDRVPVY 33-----------3333---------------------1111-3333------------- WSHCPTWRINHPKFFGPPVTDLDGVKRTAEEARERQFRAIKTNIFIHDDGPLHAWRPGFA -------------------------------------------------------3333- VPFQPALNVDRKVLRNLRAHLEALRDGAGPDVEILLDLNFNAKPEGYLKILRELADFDLF ---3333--------------------------------------------1111----- WVEIDSYSPQGLAYVRNHSPHPISSCETLFGIREFKPFFDANAVDVAIVDTIWNGVWQSM ---------------1111------1111-3333----1111-------3333--3333- KIAAFADAHDINVAPHNFYGHLCTMINANFAAAVPNLRIMETDIDRLAWEDELFTHAPEY ------1111--------------------1111------------11111111------ QNGELIIPDRPGWGTDPVEEAILAHPPKVGGLL %%%%-------------------------!!!! >E3 UBIQUITIN-PROTEIN LIGA; SWP:CBL_HUMAN; PDB:2OO9A; GSQLSSEIENLSQGYSYQDIQKALVIAQNNIEAKNILREFAAAS --------------------------%%%%-------------- >E3 UBIQUITIN-PROTEIN LIGA; SWP:Q63Z43; PDB:2OOAA; VDAKIAKLGEGYAFEEVKRALEIAQNNVEVARSILREFAFP -----------------------%%%%-------------- >HISTIDINE PHOSPHOTRANSFER; SWP:Q9A980; PDB:2OOCA; GAVDFAYLEGFAAGDFAVVDEVLALFREQAALWAPLDPTHPGWKDAVHTVKGAARGVGAF ---------1111-----------------------1111-------------------- NLGEVCERCEAGQESLEGVRTALDAALLDIAAYAHEQALRSLK --------1111-------------------------------